CN110502499A - Data fault event-handling method and maintenance system based on bayesian algorithm - Google Patents

Data fault event-handling method and maintenance system based on bayesian algorithm Download PDF

Info

Publication number
CN110502499A
CN110502499A CN201910561178.6A CN201910561178A CN110502499A CN 110502499 A CN110502499 A CN 110502499A CN 201910561178 A CN201910561178 A CN 201910561178A CN 110502499 A CN110502499 A CN 110502499A
Authority
CN
China
Prior art keywords
event
maintenance system
data fault
failure information
fault event
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910561178.6A
Other languages
Chinese (zh)
Inventor
张志涛
陶晓龙
高志栋
高嘉良
杨洋
郭军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Telecom Wanwei Information Technology Co Ltd
Original Assignee
China Telecom Wanwei Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Telecom Wanwei Information Technology Co Ltd filed Critical China Telecom Wanwei Information Technology Co Ltd
Priority to CN201910561178.6A priority Critical patent/CN110502499A/en
Publication of CN110502499A publication Critical patent/CN110502499A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3476Data logging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases

Abstract

This application provides a kind of data fault event-handling method and maintenance system based on bayesian algorithm, for providing the troubleshooting strategy for meeting the solution demand of enterprise practical data fault event.Method includes: that maintenance system extracts event of failure information from the journal file of database server;Maintenance system carries out the classification of fault type according to bayesian algorithm, obtains the corresponding fault type label of data fault event of different faults type on the basis of event of failure information;Maintenance system obtains the corresponding troubleshooting strategy of data fault event of different faults type on the basis of event of failure information;When maintenance system monitors that new data fault event occurs for database server, maintenance system is new data fault event configuration target faults type label;Maintenance system obtains the corresponding target faults processing strategie of target faults type label;Maintenance system is to database server performance objective troubleshooting strategy.

Description

Data fault event-handling method and maintenance system based on bayesian algorithm
Technical field
This application involves database maintenance field more particularly to a kind of data fault event handlings based on bayesian algorithm Method and maintenance system.
Background technique
Enterprise, especially Internet enterprises, the management of database are the important ring in enterprise operation, number in database According to storage and its safety guarantee be to realize that cash one big ensures based on data.
In practical applications, the generation and data fault to prevent data fault generate harm to database, and enterprise is logical It is often equipped with the O&M engineer of database, data base administrator (Database Administrator, DBA) carries out in other words The maintenance of database.It being readily appreciated that, the energy of DBA, experience and treatment effeciency are often limited, for this purpose, in the prior art, enterprise Industry can also introduce the automatic maintenance system of database, it is intended to the data event that rapidly automatic data processing library occurs at the first time Barrier event.
However discovery is arrived in the application, different automatic maintenance systems is due to its function, the difference of operator, to different There is different troubleshooting strategies for data fault event, for this purpose, how to customize out the maintenance system for meeting enterprise's self-characteristic System, it is clear that for specific enterprise, there is certain application demand.
Summary of the invention
This application provides a kind of data fault event-handling method and maintenance system based on bayesian algorithm, is used for The troubleshooting strategy for meeting the solution demand of enterprise practical data fault event is provided.
The application is in a first aspect, provide a kind of data fault event-handling method based on bayesian algorithm, method packet It includes:
Maintenance system extracts event of failure information from the journal file of database server, and event of failure information is for referring to Show multiple data fault events that database server occurred;
Maintenance system carries out the classification of fault type according to bayesian algorithm, obtains on the basis of event of failure information The corresponding fault type label of the data fault event of different faults type;
On the basis of event of failure information, the data fault event for obtaining different faults type respectively corresponds maintenance system Troubleshooting strategy;
When maintenance system monitors that new data fault event occurs for database server, maintenance system is new data Event of failure configures target faults type label;
Maintenance system obtains the corresponding target faults processing strategie of target faults type label;
Maintenance system is to database server performance objective troubleshooting strategy, to solve new data fault event.
In conjunction with the application in a first aspect, maintenance system exists in the first possible implementation of the application first aspect On the basis of event of failure information, the classification of fault type is carried out according to bayesian algorithm, obtains the data of different faults type The corresponding fault type label of event of failure includes:
Spatial position locating for multiple database servers is divided into multiple three-dimensional grids by maintenance system, and is each three Tie up the corresponding Marking the cell (Identification, ID) of grid configuration;
Maintenance system is on the basis of event of failure information and disparate databases server corresponding difference grid ID, root The classification that fault type is carried out according to bayesian algorithm, obtains the corresponding failure classes of data fault event of different faults type Type label;
When maintenance system monitors that new data fault event occurs for database server, maintenance system is new data Event of failure configures target faults type label
When maintenance system monitors that new data fault event occurs for database server, maintenance system obtains new number According to the corresponding target gridding ID of event of failure and target data fault message;
Maintenance system is new data fault event configuration target according to target gridding ID and target data fault message Fault type label.
In conjunction with the application in a first aspect, maintenance system exists in second of possible implementation of the application first aspect On the basis of event of failure information, the corresponding troubleshooting strategy packet of data fault event of different faults type is obtained It includes:
Maintenance system reads the corresponding failure of data fault event of different faults type from event of failure information Processing strategie;
Maintenance system is by the corresponding troubleshooting policy store of the data fault event of different faults type in local In Mishap Database.
In conjunction with the application in a first aspect, maintenance system exists in the third possible implementation of the application first aspect On the basis of event of failure information, the corresponding troubleshooting strategy packet of data fault event of different faults type is obtained It includes:
On the basis of event of failure information, statistical data event of failure is executed and is completed at the failure solved maintenance system Reason strategy, forms the corresponding troubleshooting strategy of data fault event of different faults type.
In conjunction with the application in a first aspect, maintenance system exists in the 4th kind of possible implementation of the application first aspect On the basis of event of failure information, the corresponding troubleshooting strategy packet of data fault event of different faults type is obtained It includes:
Maintenance system receives user equipment (User Equipment, UE) and reports on the basis of event of failure information The corresponding troubleshooting strategy of the data fault event of different faults type.
In conjunction with the application in a first aspect, maintenance system exists in the 5th kind of possible implementation of the application first aspect On the basis of event of failure information, the corresponding troubleshooting strategy packet of data fault event of different faults type is obtained It includes:
Maintenance system carries out machine learning on the basis of event of failure information, and training obtains data fault model, data Fault model includes the corresponding relationship between the data fault event of different faults type and different troubleshooting strategies.
In conjunction with the application in a first aspect, in the 6th kind of possible implementation of the application first aspect, at target faults Reason strategy includes the investigation strategy, resolution policy and solution script of new data fault event.
The application provides a kind of maintenance system in second aspect, and system includes:
Extraction unit, for extracting event of failure information, event of failure information from the journal file of database server It is used to indicate multiple data fault events that database server occurred;
Taxon, for carrying out the classification of fault type according to bayesian algorithm on the basis of event of failure information, Obtain the corresponding fault type label of data fault event of different faults type;
Acquiring unit, on the basis of event of failure information, obtaining the data fault event point of different faults type Not corresponding troubleshooting strategy;
Configuration unit, for safeguarding when maintenance system monitors that new data fault event occurs for database server System is new data fault event configuration target faults type label;
Acquiring unit is also used to obtain the corresponding target faults processing strategie of target faults type label;
Execution unit is used for database server performance objective troubleshooting strategy, to solve new data fault thing Part.
In conjunction with the application second aspect, in the first possible implementation of the application second aspect, acquiring unit, tool Body is used for:
Spatial position locating for multiple database servers is divided into multiple three-dimensional grids, and is matched for each three-dimensional grid Set corresponding Marking the cell ID;
On the basis of event of failure information and disparate databases server corresponding difference grid ID, according to Bayes Algorithm carries out the classification of fault type, obtains the corresponding fault type label of data fault event of different faults type;
Configuration unit is specifically used for:
When maintenance system monitors that new data fault event occurs for database server, new data fault thing is obtained The corresponding target gridding ID of part and target data fault message;
It is new data fault event configuration target faults type according to target gridding ID and target data fault message Label.
In conjunction with the application second aspect, in second of possible implementation of the application second aspect, acquiring unit, tool Body is used for:
From event of failure information, the corresponding troubleshooting plan of data fault event of different faults type is read Slightly;
By the corresponding troubleshooting policy store of the data fault event of different faults type in local fault data In library.
In conjunction with the application second aspect, in the third possible implementation of the application second aspect, acquiring unit, tool Body is used for:
On the basis of event of failure information, statistical data event of failure executes and completes the troubleshooting strategy solved, Form the corresponding troubleshooting strategy of data fault event of different faults type.
In conjunction with the application second aspect, in the 4th kind of possible implementation of the application second aspect, acquiring unit, tool Body is used for:
On the basis of event of failure information, the data fault event of the different faults type of reported by user equipment UE is received Corresponding troubleshooting strategy.
In conjunction with the application second aspect, in the 5th kind of possible implementation of the application second aspect, acquiring unit, tool Body is used for:
On the basis of event of failure information, machine learning is carried out, training obtains data fault model, data fault model Corresponding relationship between data fault event and different troubleshooting strategies including different faults type.
In conjunction with the application second aspect, in the 6th kind of possible implementation of the application second aspect, at target faults Reason strategy includes the investigation strategy, resolution policy and solution script of new data fault event.
The third aspect, the application provide a kind of maintenance system, including processor, and processor is stored for executing in memory Computer program when realize as above-mentioned first aspect the data fault event-handling method based on bayesian algorithm any Step.
Fourth aspect, the application provide a kind of readable storage medium storing program for executing, are stored thereon with computer program, computer program quilt Any step of the data fault event-handling method based on bayesian algorithm such as above-mentioned first aspect is realized when processor executes Suddenly.
As can be seen from the above technical solutions, the application has the following advantages:
Maintenance system provided by the present application extracts event of failure from the journal file of the database server of enterprise On the one hand information assigns corresponding failure classes according to the data fault event of bayesian algorithm classification different faults type with this On the other hand type label obtains the corresponding troubleshooting strategy of different faults type with this, to occur in database server When new data fault event, after distributing target faults type label for it, the target faults type label can be got Corresponding target faults processing strategie, and executed on database server, the new data fault event is solved, due to Troubleshooting strategy is that the real data fault condition based on enterprise obtains, therefore, maintenance system when there is data fault, The troubleshooting strategy for meeting enterprise demand can be executed within first time, can not only meet the customization demand of enterprise, and also Quickly data event of failure can be solved, guarantee the data safety of enterprise.
Detailed description of the invention
Fig. 1 shows a kind of flow diagram of data fault event-handling method of the application based on bayesian algorithm;
Another process of data fault event-handling method Fig. 2 shows the application based on bayesian algorithm is illustrated Figure;
Fig. 3 shows another process signal of data fault event-handling method of the application based on bayesian algorithm Figure;
Fig. 4 shows another process signal of data fault event-handling method of the application based on bayesian algorithm Figure;
Fig. 5 shows a kind of structural schematic diagram of the application maintenance system;
Fig. 6 shows another structural schematic diagram of the application maintenance system.
Specific embodiment
This application provides a kind of data fault event-handling method and maintenance system based on bayesian algorithm, is used for The troubleshooting strategy for meeting the solution demand of enterprise practical data fault event is provided.
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based on Embodiment in the application, those skilled in the art's every other implementation obtained without creative efforts Example, shall fall in the protection scope of this application.
The description and claims of this application and term " first " in above-mentioned attached drawing, " second " etc. are for distinguishing Similar object, without being used to describe a particular order or precedence order.It should be understood that the data used in this way are in appropriate feelings It can be interchanged under condition, so that the embodiments described herein can be real with the sequence other than the content for illustrating or describing herein It applies.In addition, term " includes " and " having " and their any deformation, it is intended that cover it is non-exclusive include, for example, packet The process, method, system, product or equipment for having contained series of steps or module those of be not necessarily limited to be clearly listed step or Module, but may include other steps being not clearly listed or intrinsic for these process, methods, product or equipment or Module.Occur in this application to step carry out name perhaps number be not meant to must according to name or number Indicated time/logic sequencing executes the step in method flow, and named or number process step can be with Execution order is changed according to the technical purpose to be realized, as long as identical or similar technical effect can be reached.
The division of module appeared in the application is a kind of division in logic, can have when realizing in practical application Other division mode, such as multiple modules can be combined into or are integrated in another system, or some features can be ignored, Or do not execute, in addition, shown or discussion mutual coupling, direct-coupling or communication connection can be by one A little interfaces, the indirect coupling or communication connection between module can be electrical or other similar form, do not make in the application It limits.Also, module or submodule can be the separation that may not be physically as illustrated by the separation member, can be It can not be physical module, or can be distributed in multiple circuit modules, portion therein can be selected according to the actual needs Point or whole module realize the purpose of application scheme.
Firstly, before introducing the application, introduce first this application involves server and UE.
In this application, maintenance system can be directly deployed in the targeting database server of enterprise, or can also be incited somebody to action The server file of targeting database server is deployed in maintenance system, and maintenance system can directly be defaulted in other words will be local Database server be determined as targeting database server, and be based on the database server with provided by the present application The data fault event-handling method of bayesian algorithm, to provide the event for the solution demand for meeting enterprise practical data fault event Hinder processing strategie.
Maintenance system can also be deployed in physical host, physical server or physical equipment other than database server On, it establishes and is connected to the network with targeted database data library server.
Maintenance system system can also in real time according to UE initiate maintenance request entrained by database server server ID, location database server, and the data provided by the present application based on bayesian algorithm are run on the database server Event of failure processing method.
UE is specifically as follows smart phone, tablet computer, desktop computer, laptop, computer all-in-one machine, intelligent hand Ring or smartwatch etc. can establish the terminal device of network connection, the user of the side UE enterprise, in other words enterprise with maintenance system Database server relevant staff, UE can be passed through and access maintenance system, execute the hair of the above-mentioned maintenance request referred to It rises, maintenance condition of the database server under maintenance system such as checks at the sequence of operations.
In the following, starting to be based on bayesian algorithm to the application then based on the above-mentioned explanation to the application background context content Data fault event-handling method describe in detail.
Firstly, refering to fig. 1, Fig. 1 shows the one of data fault event-handling method of the application based on bayesian algorithm Kind flow diagram, specifically, data fault event-handling method of the application based on bayesian algorithm may include following steps:
Step S101, maintenance system extract event of failure information from the journal file of database server;
Wherein, event of failure information is used to indicate multiple data fault events that database server occurred.
It is appreciated that determining database server in maintenance system, and maintenance task is executed on the database server When, the event of failure information of the database server can be extracted first from the journal file of database server.
The event of failure information specifically may include the information such as fault time, failure functional node or malfunction routine section.
Step S102, maintenance system carry out fault type on the basis of event of failure information, according to bayesian algorithm Classification, obtains the corresponding fault type label of data fault event of different faults type;
On the one hand, after obtaining the corresponding event of failure information of database server, maintenance system can call Bayes Algorithm, the fault type based on database failure event are classified, and the data fault event difference of different faults type is obtained Corresponding fault type label.
Bayesian algorithm in practical applications, by extracting its different attributive character to different data fault events, So as to speculating the distinction and probability of happening of different faults type, obtaining different fault types and assigning corresponding failure Type label.
Step S103, maintenance system obtain the data fault thing of different faults type on the basis of event of failure information The corresponding troubleshooting strategy of part;
On the other hand, after obtaining the corresponding event of failure information of database server, maintenance system can be according to different The troubleshooting details of event of failure are searched, the corresponding failure of data fault event that statistics or generation enterprise are different Processing strategie.
It specifically may include the replacement of field, the rewriting of flag bit, system recovery, file backup in the troubleshooting strategy And its strategies such as recovery, the screening of virus and its isolation, interception of malicious traffic stream.
Step S104, when maintenance system monitors that new data fault event occurs for database server, maintenance system For new data fault event configuration target faults type label;
In the corresponding different faults type label of database server for classifying to obtain enterprise by bayesian algorithm, and After the different corresponding troubleshooting strategy of data fault event decibel of the database server of enterprise, if there is new data event Barrier event, maintenance system can encourage the new data fault event and be determined as current target data event of failure, and configuration pair The target faults type identification answered.
Step S105, maintenance system obtain the corresponding target faults processing strategie of target faults type label;
After having configured target faults type identification for current target data event of failure, maintenance system can be before The corresponding troubleshooting strategy of data fault event for stating the different faults type that step S103 is obtained gets target event Hinder the corresponding target faults processing strategie of type label.
Step S106, maintenance system is to database server performance objective troubleshooting strategy, to solve new data event Barrier event.
After obtaining the corresponding target faults processing strategie of target data event of failure, maintenance system can be directly in data The target faults processing strategie is executed on the server of library, is currently sent out so as to be automatically performed database server within first time The solution of raw target data event of failure.
Certainly, after monitoring that target data event of failure occurs for database server, maintenance system can pass through UE, Xiang Yong Family exports alarm, to prompt the timely focused data library server of user that target data event of failure and target data event occurs The corresponding fault details of barrier event.
Alternatively, maintenance system can also be in the target data event of failure for solving database server and occurring, to defeated to user Result out, to inform the solution details of the target data event of failure of user database server generation.
From the foregoing, it will be observed that maintenance system provided by the present application, from the journal file of the database server of enterprise, Event of failure information is extracted, is on the one hand assigned with this according to the data fault event of bayesian algorithm classification different faults type On the other hand corresponding fault type label obtains the corresponding troubleshooting strategy of different faults type with this, thus in data When new data fault event occurs in library server, after distributing target faults type label for it, the target can be got The corresponding target faults processing strategie of fault type label, and executed on database server, solve the new data Event of failure, since troubleshooting strategy is that the real data fault condition based on enterprise obtains, maintenance system is going out When existing data fault, the troubleshooting strategy for meeting enterprise demand can be executed within first time, can not only meet determining for enterprise Inhibition and generation demand, and also quickly data event of failure can be solved, guarantee the data safety of enterprise.
In practical applications, data fault event-handling method of the application based on bayesian algorithm, which can also have, has into one The optimization of step, is specifically detailed in the following contents:
In one embodiment, refering to Fig. 2 shows data fault event handling side of the application based on bayesian algorithm Another flow diagram of method, the corresponding event of data fault event that different faults type is obtained based on bayesian algorithm Hinder type label, can specifically be achieved by the steps of:
Spatial position locating for multiple database servers is divided into multiple three-dimensional grids by step S201, maintenance system, And corresponding grid ID is configured for each three-dimensional grid;
It is appreciated that database is realized often through multiple database servers in enterprise, multiple database clothes Device unified plan be engaged in an interior space, therefore, is directed to the data fault event that database server may occur, may be used also In view of related in the spatial position of the interior space to concrete database server.
For this purpose, the spatial position of multiple cdr database servers can be divided into multiple three dimensional networks by maintenance system Lattice, each three-dimensional grid is according to the actual deployment schemes of multiple database servers, it may include the database service of unequal number amount Device, and corresponding grid ID is assigned, with the spatial position for marking different three-dimensional grids corresponding and database server.
Step S201, maintenance system is the corresponding different grid ID's of event of failure information and disparate databases server On the basis of, the classification of fault type is carried out according to bayesian algorithm, the data fault event for obtaining different faults type is right respectively The fault type label answered.
After obtaining three-dimensional grid ID, maintenance system is i.e. in combination with the above-mentioned event of failure information got, using pattra leaves This algorithm carries out the classification of fault type, obtains the corresponding fault type mark of data fault event of different faults type Label.
It is corresponding, data fault event-handling method refering to the application shown in Fig. 3 based on bayesian algorithm it is another Kind flow diagram, is arranged, when maintenance system monitors that new data occur for database server based on above-mentioned three-dimensional grid When event of failure, maintenance system is new data fault event configuration target faults type label, specifically can be as follows It realizes:
Step S301, when maintenance system monitors that new data fault event occurs for database server, maintenance system Obtain the corresponding target gridding ID of new data fault event and target data fault message;
After the above-mentioned three-dimensional grid ID for the configuration of each database server, configured subsequent for target data event of failure During target faults type label, number of targets of the maintenance system in addition to the target data event of failure needed originally can be obtained According to fault message, the corresponding target gridding ID of concrete database server that target data event of failure occurs can be also obtained.
Step S302, maintenance system are new data fault thing according to target gridding ID and target data fault message Part configures target faults type label.
After obtaining the corresponding target three-dimensional grid ID of target data event of failure and target data fault message, Maintenance system in combination with the two, obtains the corresponding target faults type label of target data event of failure, obtains so as to subsequent To the corresponding target faults processing strategie of the target faults type label.
In another implementation, the statistics available database server of maintenance system in first operational process, for The performed troubleshooting strategy crossed of the data fault event of different faults type.Pattra leaves is based on refering to the application shown in Fig. 4 Another flow diagram of the data fault event-handling method of this algorithm, maintenance system is on the basis of event of failure information On, the corresponding troubleshooting strategy of data fault event of different faults type is obtained, it specifically can also be as follows It realizes:
Step S401, maintenance system read the data fault event difference of different faults type from event of failure information Corresponding troubleshooting strategy;
Maintenance system directly reads the corresponding troubleshooting strategy of data fault event from event of failure information, and ties It closes and states the classification processing based on bayesian algorithm and sorted out, the data fault event of the different faults type occurred Corresponding troubleshooting strategy.
Step S402, maintenance system deposit the corresponding troubleshooting strategy of the data fault event of different faults type Storage is in local Mishap Database.
After the troubleshooting strategy for obtaining crossing performed by the data fault event of different faults type, maintenance system can be straight Connect and it be stored in local Mishap Database, so as to when there is new data fault event, can it is convenient and can offline from In local Mishap Database, corresponding troubleshooting strategy is directly obtained.
In another implementation, the statistics available troubleshooting strategy for completing to solve of maintenance system, so as to subsequent number According to library server occur new data fault event be occurred originally and succeeded solution data fault event when, can be straight The original used troubleshooting strategy of execution is connect, convenient effective, corresponding, maintenance system is on the basis of event of failure information On, the corresponding troubleshooting strategy of data fault event for obtaining different faults type includes:
On the basis of event of failure information, statistical data event of failure is executed and is completed at the failure solved maintenance system Reason strategy, forms the corresponding troubleshooting strategy of data fault event of different faults type.
In another implementation, the corresponding failure of data fault event of different faults type in maintenance system Processing strategie can also manually be reported by the user of the side UE enterprise, which is more convenient for maintenance system in initial deployment, can Obtain more complete troubleshooting policy library.Corresponding, maintenance system obtains different events on the basis of event of failure information Barrier type the corresponding troubleshooting strategy of data fault event include:
Maintenance system receives the data fault event for the different faults type that UE is reported on the basis of event of failure information Corresponding troubleshooting strategy.
In another implementation, maintenance system can also introduce data fault model, by the way of machine learning, mould The data fault event of quasi- different faults type, and simulate different troubleshootings and survey strategy, thus it is speculated that different treatment effects, from And the troubleshooting strategy that different fault types can execute and obtain optimization process effect can be filtered out.Also, it can also be with Practical application of the maintenance system to data fault model, may also be combined with troubleshooting effect acquired by practical application, optimize The corresponding stock of different faults type is given to your processing strategie in data fault model.Corresponding, maintenance system is in failure thing On the basis of part information, the corresponding troubleshooting strategy of data fault event for obtaining different faults type includes:
Maintenance system carries out machine learning on the basis of event of failure information, and training obtains data fault model, data Fault model includes the corresponding relationship between the data fault event of different faults type and different troubleshooting strategies.
In another implementation, target faults processing strategie includes the investigation strategy of new data fault event, solution Script is omited and solved to decision.
Certainly, any one and any combination thereof that may also comprise above-mentioned three in target faults processing strategie, specifically exist This is without limitation.
It is the introduction of the data fault event-handling method to the application based on bayesian algorithm above, starts to introduce below The maintenance system of the application.
A kind of structural schematic diagram of the application maintenance system is shown refering to Fig. 5, Fig. 5, specifically, maintenance system may include Such as flowering structure:
Extraction unit 501, for extracting event of failure information from the journal file of database server;
Wherein, event of failure information is used to indicate multiple data fault events that database server occurred.
Taxon 502, for carrying out point of fault type according to bayesian algorithm on the basis of event of failure information Class obtains the corresponding fault type label of data fault event of different faults type;
Acquiring unit 503, for obtaining the data fault event of different faults type on the basis of event of failure information Corresponding troubleshooting strategy;
Configuration unit 504, for tieing up when maintenance system monitors that new data fault event occurs for database server Protecting system is new data fault event configuration target faults type label;
Acquiring unit 503 is also used to obtain the corresponding target faults processing strategie of target faults type label;
Execution unit 505 is used for database server performance objective troubleshooting strategy, to solve new data fault Event.
In one implementation, acquiring unit 503 are specifically used for:
Spatial position locating for multiple database servers is divided into multiple three-dimensional grids, and is matched for each three-dimensional grid Set corresponding Marking the cell ID;
On the basis of event of failure information and disparate databases server corresponding difference grid ID, according to Bayes Algorithm carries out the classification of fault type, obtains the corresponding fault type label of data fault event of different faults type;
Configuration unit 504, is specifically used for:
When maintenance system monitors that new data fault event occurs for database server, new data fault thing is obtained The corresponding target gridding ID of part and target data fault message;
It is new data fault event configuration target faults type according to target gridding ID and target data fault message Label.
In another implementation, acquiring unit 503 is specifically used for:
From event of failure information, the corresponding troubleshooting plan of data fault event of different faults type is read Slightly;
By the corresponding troubleshooting policy store of the data fault event of different faults type in local fault data In library.
In another implementation, acquiring unit 503 is specifically used for:
On the basis of event of failure information, statistical data event of failure executes and completes the troubleshooting strategy solved, Form the corresponding troubleshooting strategy of data fault event of different faults type.
In another implementation, acquiring unit 503 is specifically used for:
On the basis of event of failure information, the data fault event for receiving the different faults type that UE is reported is respectively corresponded Troubleshooting strategy.
In another implementation, acquiring unit 503 is specifically used for:
On the basis of event of failure information, machine learning is carried out, training obtains data fault model, data fault model Corresponding relationship between data fault event and different troubleshooting strategies including different faults type.
In another implementation, target faults processing strategie includes the investigation strategy of new data fault event, solution Script is omited and solved to decision.
The above is that the maintenance system of application is described from the angle of modular functionality entity, below then from hardware point of view introduction The maintenance system of the application shows another structural schematic diagram of maintenance system provided by the present application refering to Fig. 6, Fig. 6, specifically , maintenance system provided by the present application includes processor 601, and processor 601 is for executing the computer stored in memory 602 The each of the data fault event-handling method based on bayesian algorithm in corresponding any embodiment as shown in Figure 1 to Figure 4 is realized when program Step;Alternatively, being realized when processor 601 is for executing the computer program stored in memory 602 as in Fig. 5 corresponding embodiment The function of each unit.
Illustratively, computer program can be divided into one or more module/units, and one or more module/ Unit is stored in memory 602, and is executed by processor 601, to complete the application.One or more module/units can To be the series of computation machine program instruction section that can complete specific function, which is calculating for describing computer program Implementation procedure in machine device.
Maintenance system may include, but be not limited only to processor 601, memory 602.It will be understood by those skilled in the art that Signal is only the example of computer installation, does not constitute the restriction to maintenance system, may include more more or less than illustrating Component, perhaps combine certain components or different components, such as maintenance system can also include input-output equipment, net Network access device, bus etc., processor 601, memory 602, input-output equipment and network access equipment etc. pass through bus It is connected.
Processor 601 can be central processing unit (Central Processing Unit, CPU), can also be other General processor, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor Deng processor is the control centre of computer installation, utilizes each portion of various interfaces and the entire computer installation of connection Point.
Memory 602 can be used for storing computer program and/or module, and processor 601 is stored in by operation or execution Computer program and/or module in memory 602, and the data being stored in memory 602 are called, realize computer dress The various functions of setting.Memory 602 can mainly include storing program area and storage data area, wherein storing program area can store Application program needed for operating system, at least one function (such as sound-playing function, image player function etc.) etc.;Store number It can be stored according to area and created data (such as audio data, video data etc.) etc. are used according to mobile phone.In addition, memory can It can also include nonvolatile memory, such as hard disk, memory, plug-in type hard disk, intelligence to include high-speed random access memory Energy storage card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card), at least one disk memory, flush memory device or other volatile solid-state parts.
The application also provides a kind of readable storage medium storing program for executing, is stored thereon with computer program, computer program is by processor The data fault event-handling method based on bayesian algorithm in corresponding any embodiment as shown in Figure 1 to Figure 4 is realized when execution.
If it is appreciated that integrated unit be realized in the form of SFU software functional unit and as the sale of independent product or In use, can store in a computer readable storage medium.Based on this understanding, the technical solution sheet of the application The all or part of the part that contributes to existing technology or the technical solution can be with software product in other words in matter Form embodies, which is stored in a storage medium, including some instructions are used so that a meter Calculate the whole that machine equipment (can be personal computer, server or the network equipment etc.) executes the application each method embodiment Or part steps.And storage medium above-mentioned include: USB flash disk, mobile hard disk, read-only memory (Read-only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic or disk etc. are various can store program The medium of code.
It is apparent to those skilled in the art that for convenience and simplicity of description, the maintenance of foregoing description The specific work process of system and its unit, can be referring to figs. 1 to the data event in Fig. 4 corresponding embodiment based on bayesian algorithm Hinder the explanation of event-handling method, specific details are not described herein.
To sum up, a kind of data fault event-handling method and maintenance system based on bayesian algorithm provided by the present application System is extracted event of failure information, is on the one hand calculated with this according to Bayes from the journal file of the database server of enterprise The data fault event of method classification different faults type, assigns corresponding fault type label, on the other hand obtains difference with this The corresponding troubleshooting strategy of fault type, thus when new data fault event occurs in database server, for its point After target faults type label, the corresponding target faults processing strategie of the target faults type label can be got, and It is executed on database server, solves the new data fault event, since troubleshooting strategy is the reality based on enterprise Border data fault situation obtains, and therefore, maintenance system can execute within first time when there is data fault and meet enterprise The troubleshooting strategy of demand can not only meet the customization demand of enterprise, and can also quickly solve to data event of failure Certainly, guarantee the data safety of enterprise.
In several embodiments provided herein, it should be understood that the deployment system of disclosed fictitious host computer and Its unit, may be implemented in other ways.For example, the apparatus embodiments described above are merely exemplary, for example, The division of unit, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units Or component can be combined or can be integrated into another system, or some features can be ignored or not executed.Another point, institute Display or the mutual coupling, direct-coupling or communication connection discussed can be through some interfaces, device or unit Indirect coupling or communication connection can be electrical property, mechanical or other forms.
Unit may or may not be physically separated as illustrated by the separation member, shown as a unit Component may or may not be physical unit, it can and it is in one place, or may be distributed over multiple networks On unit.It can some or all of the units may be selected to achieve the purpose of the solution of this embodiment according to the actual needs.
It, can also be in addition, each functional unit in each embodiment of the application can integrate in one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.
More than, above embodiments are only to illustrate the technical solution of the application, rather than its limitations;Although referring to aforementioned reality Example is applied the application is described in detail, those skilled in the art should understand that: it still can be to aforementioned each Technical solution documented by embodiment is modified or equivalent replacement of some of the technical features;And these are modified Or replacement, the spirit and scope of each embodiment technical solution of the application that it does not separate the essence of the corresponding technical solution.

Claims (10)

1. a kind of data fault event-handling method based on bayesian algorithm, which is characterized in that the described method includes:
Maintenance system extracts event of failure information from the journal file of database server, and the event of failure information is for referring to Show multiple data fault events that the database server occurred;
The maintenance system carries out the classification of fault type according to bayesian algorithm on the basis of the event of failure information, Obtain the corresponding fault type label of data fault event of different faults type;
The maintenance system obtains the data fault event of different faults type on the basis of the event of failure information Corresponding troubleshooting strategy;
When the maintenance system monitors that new data fault event occurs for the database server, the maintenance system is The new data fault event configuration target faults type label;
The maintenance system obtains the corresponding target faults processing strategie of the target faults type label;
The maintenance system executes the target faults processing strategie to the database server, to solve the new data Event of failure.
2. the method according to claim 1, wherein the maintenance system is on the basis of the event of failure information On, the classification of fault type is carried out according to bayesian algorithm, the data fault event for obtaining different faults type is corresponding Fault type label includes:
Spatial position locating for multiple database servers is divided into multiple three-dimensional grids by the maintenance system, and is every A three-dimensional grid configures corresponding Marking the cell ID;
The maintenance system is the corresponding different grid ID's of the event of failure information and the disparate databases server On the basis of, the classification of fault type is carried out according to bayesian algorithm, the data fault event for obtaining different faults type is right respectively The fault type label answered;
It is described when the maintenance system monitors that new data fault event occurs for the database server, maintenance system System is that the new data fault event configuration target faults type label includes:
When the maintenance system monitors that the new data fault event occurs for the database server, the maintenance system System obtains the corresponding target gridding ID of the new data fault event and target data fault message;
The maintenance system is the new data fault event according to the target gridding ID and target data fault message Configure target faults type label.
3. the method according to claim 1, wherein the maintenance system is on the basis of the event of failure information On, the corresponding troubleshooting strategy of the data fault event for obtaining different faults type includes:
For the maintenance system from the event of failure information, the data fault event for reading different faults type is right respectively The troubleshooting strategy answered;
The maintenance system deposits the corresponding troubleshooting strategy of the data fault event of the different faults type Storage is in local Mishap Database.
4. the method according to claim 1, wherein the maintenance system is on the basis of the event of failure information On, the corresponding troubleshooting strategy of the data fault event for obtaining different faults type includes:
The maintenance system counts the data fault event and executes and complete to solve on the basis of the event of failure information The troubleshooting strategy, form the corresponding troubleshooting plan of the data fault event of the different faults type Slightly.
5. the method according to claim 1, wherein the maintenance system is on the basis of the event of failure information On, the corresponding troubleshooting strategy of the data fault event for obtaining different faults type includes:
The maintenance system receives the different faults type of reported by user equipment UE on the basis of the event of failure information The corresponding troubleshooting strategy of the data fault event.
6. the method according to claim 1, wherein the maintenance system is on the basis of the event of failure information On, the corresponding troubleshooting strategy of the data fault event for obtaining different faults type includes:
The maintenance system carries out machine learning on the basis of the event of failure information, and training obtains data fault model, The data fault model includes between the data fault event of different faults type and different troubleshooting strategies Corresponding relationship.
7. the method according to claim 1, wherein the target faults processing strategie includes the new data Investigation strategy, resolution policy and the solution script of event of failure.
8. a kind of maintenance system, which is characterized in that the system comprises:
Extraction unit, for extracting event of failure information, the event of failure information from the journal file of database server It is used to indicate multiple data fault events that the database server occurred;
Taxon, for carrying out the classification of fault type according to bayesian algorithm on the basis of the event of failure information, Obtain the corresponding fault type label of data fault event of different faults type;
Acquiring unit, for obtaining the data fault thing of different faults type on the basis of the event of failure information The corresponding troubleshooting strategy of part;
Configuration unit, for when the maintenance system monitors that new data fault event occurs for the database server, For the new data fault event configuration target faults type label;
The acquiring unit is also used to obtain the corresponding target faults processing strategie of the target faults type label;
Execution unit, for executing the target faults processing strategie to the database server, to solve the new number According to event of failure.
9. a kind of maintenance system, which is characterized in that the maintenance system includes processor, and processor is deposited for executing in memory The method as described in any one of claims 1 to 7 is realized when the computer program of storage.
10. a kind of readable storage medium storing program for executing, which is characterized in that be stored thereon with computer program, computer program is held by processor The method as described in any one of claims 1 to 7 is realized when row.
CN201910561178.6A 2019-06-26 2019-06-26 Data fault event-handling method and maintenance system based on bayesian algorithm Pending CN110502499A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910561178.6A CN110502499A (en) 2019-06-26 2019-06-26 Data fault event-handling method and maintenance system based on bayesian algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910561178.6A CN110502499A (en) 2019-06-26 2019-06-26 Data fault event-handling method and maintenance system based on bayesian algorithm

Publications (1)

Publication Number Publication Date
CN110502499A true CN110502499A (en) 2019-11-26

Family

ID=68585813

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910561178.6A Pending CN110502499A (en) 2019-06-26 2019-06-26 Data fault event-handling method and maintenance system based on bayesian algorithm

Country Status (1)

Country Link
CN (1) CN110502499A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112256466A (en) * 2020-10-23 2021-01-22 上海中通吉网络技术有限公司 System stability improving method, device and equipment based on fault reason
CN113285977A (en) * 2020-08-08 2021-08-20 詹能勇 Network maintenance method and system based on block chain and big data

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105553700A (en) * 2015-12-11 2016-05-04 浪潮电子信息产业股份有限公司 Intelligent equipment fault recognition detection and solution providing system
CN106383760A (en) * 2016-09-19 2017-02-08 郑州云海信息技术有限公司 Computer fault management method and apparatus
CN106383916A (en) * 2016-11-09 2017-02-08 北京许继电气有限公司 Data processing method based on predictive maintenance of industrial equipment
CN106779505A (en) * 2017-02-28 2017-05-31 中国南方电网有限责任公司 A kind of transmission line malfunction method for early warning driven based on big data and system
CN107229732A (en) * 2017-06-12 2017-10-03 郑州云海信息技术有限公司 A kind of fault data information processing method and device
CN107341068A (en) * 2017-06-28 2017-11-10 北京优特捷信息技术有限公司 The method and apparatus that O&M troubleshooting is carried out by natural language processing
CN107846016A (en) * 2017-11-16 2018-03-27 中国南方电网有限责任公司 A kind of Distribution Network Failure localization method and equipment based on Bayes and Complex event processing
CN108153603A (en) * 2017-12-08 2018-06-12 上海陆家嘴国际金融资产交易市场股份有限公司 Database server fault handling method, device and storage medium
CN108627720A (en) * 2018-03-08 2018-10-09 国网四川省电力公司电力科学研究院 A kind of power equipment state monitoring method based on bayesian algorithm
US20180307713A1 (en) * 2017-04-19 2018-10-25 Xerox Corporation Diagnostic method and system utilizing historical event logging data
WO2019047072A1 (en) * 2017-09-06 2019-03-14 富璟科技(深圳)有限公司 Database detection method and terminal

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105553700A (en) * 2015-12-11 2016-05-04 浪潮电子信息产业股份有限公司 Intelligent equipment fault recognition detection and solution providing system
CN106383760A (en) * 2016-09-19 2017-02-08 郑州云海信息技术有限公司 Computer fault management method and apparatus
CN106383916A (en) * 2016-11-09 2017-02-08 北京许继电气有限公司 Data processing method based on predictive maintenance of industrial equipment
CN106779505A (en) * 2017-02-28 2017-05-31 中国南方电网有限责任公司 A kind of transmission line malfunction method for early warning driven based on big data and system
US20180307713A1 (en) * 2017-04-19 2018-10-25 Xerox Corporation Diagnostic method and system utilizing historical event logging data
CN107229732A (en) * 2017-06-12 2017-10-03 郑州云海信息技术有限公司 A kind of fault data information processing method and device
CN107341068A (en) * 2017-06-28 2017-11-10 北京优特捷信息技术有限公司 The method and apparatus that O&M troubleshooting is carried out by natural language processing
WO2019047072A1 (en) * 2017-09-06 2019-03-14 富璟科技(深圳)有限公司 Database detection method and terminal
CN107846016A (en) * 2017-11-16 2018-03-27 中国南方电网有限责任公司 A kind of Distribution Network Failure localization method and equipment based on Bayes and Complex event processing
CN108153603A (en) * 2017-12-08 2018-06-12 上海陆家嘴国际金融资产交易市场股份有限公司 Database server fault handling method, device and storage medium
CN108627720A (en) * 2018-03-08 2018-10-09 国网四川省电力公司电力科学研究院 A kind of power equipment state monitoring method based on bayesian algorithm

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113285977A (en) * 2020-08-08 2021-08-20 詹能勇 Network maintenance method and system based on block chain and big data
CN113285977B (en) * 2020-08-08 2022-07-05 山东鼹鼠人才知果数据科技有限公司 Network maintenance method and system based on block chain and big data
CN112256466A (en) * 2020-10-23 2021-01-22 上海中通吉网络技术有限公司 System stability improving method, device and equipment based on fault reason

Similar Documents

Publication Publication Date Title
US10498588B2 (en) Systems and methods for managing network health
CN106326219B (en) Method, device and system for checking business system data
CN104243569B (en) A kind of city operating system
CN109766696A (en) The setting method and device of software permission, storage medium, electronic device
KR102001898B1 (en) Method of processing alarm information, related devices and systems
CN108737182A (en) The processing method and system of system exception
EP3286656B1 (en) System and method for handling events involving computing systems and networks using fabric monitoring system
CN104125214B (en) A kind of security architecture system and safety governor for realizing software definition safety
CN103026345A (en) Dynamic multidimensional schemas for event monitoring priority
CN110502499A (en) Data fault event-handling method and maintenance system based on bayesian algorithm
CN112434302B (en) Multitask collaboration vulnerability platform and construction method and service method thereof
CN102611713A (en) Entropy operation-based network intrusion detection method and device
CN114679292B (en) Honeypot identification method, device, equipment and medium based on network space mapping
CN107465568A (en) A kind of monitoring method and monitoring system for terminal
CN103208047A (en) Airport security management system
CN109525645A (en) A kind of method and system for collecting the log of distributed storage cluster
CN111614639A (en) Network security analysis method based on boundary theory
CN104765672A (en) Error code monitoring method, device and equipment
CN104468818B (en) A kind of internet of things service processing system and its method
Rajaboevich et al. Methods and intelligent mechanisms for constructing cyberattack detection components on distance-learning systems
CN104539463B (en) A kind of network equipments configuration file on-line attribute cross-check method and system
CN106656550A (en) Project system monitoring method and device
CN101931544A (en) Method and system for identifying unauthorized amendment of website content
CN112688818B (en) Data transmission method and device, electronic equipment and readable storage medium
CN113971288A (en) Big data technology-based smart campus security management and control platform

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191126

RJ01 Rejection of invention patent application after publication