CN108111359A - A kind of monitor processing method, device and monitoring processing system - Google Patents

A kind of monitor processing method, device and monitoring processing system Download PDF

Info

Publication number
CN108111359A
CN108111359A CN201810052608.7A CN201810052608A CN108111359A CN 108111359 A CN108111359 A CN 108111359A CN 201810052608 A CN201810052608 A CN 201810052608A CN 108111359 A CN108111359 A CN 108111359A
Authority
CN
China
Prior art keywords
background server
monitoring
probability value
offline
monitoring data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810052608.7A
Other languages
Chinese (zh)
Inventor
丁浩
吴岩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing QIYI Century Science and Technology Co Ltd
Original Assignee
Beijing QIYI Century Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing QIYI Century Science and Technology Co Ltd filed Critical Beijing QIYI Century Science and Technology Co Ltd
Priority to CN201810052608.7A priority Critical patent/CN108111359A/en
Publication of CN108111359A publication Critical patent/CN108111359A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time

Abstract

The invention discloses a kind of monitoring alarm processing methods, device and monitoring processing system, training pattern is deployed in advance on each background server first, and training pattern generates for control server according to the history monitoring data of each background server and corresponding alert process operation training, respective operating condition is monitored by respective background server by the present invention, monitoring data is handled to obtain the corresponding offline probability value of monitoring data by the training pattern being deployed on each background server, the alert process for determining corresponding background server according to offline probability value operates, respective monitoring data is handled by each background server and carries out corresponding alert process operation, improve fault alarm efficiency.

Description

A kind of monitor processing method, device and monitoring processing system
Technical field
The present invention relates to server monitoring technical field, at a kind of monitor processing method and device, monitoring Reason system.
Background technology
Monitoring system is important means of the IT enterprises for background server management.Monitoring system can be taken with monitoring backstage Whether the related hardware index of business device, such as disk utilization rate, CPU usage, network packet loss rate, the network port are unobstructed.
In the monitoring system used at present, most representative is zabbix systems, and the system is at regular intervals (rather Clock can configure) collect all background server information, and be uploaded to the control server of monitoring system by control server by According to preset monitoring strategies, the information of background server is analyzed.The monitoring strategies of the system are:To background service The monitor control index of device sets corresponding threshold value, if at the time of monitoring scanning twice, a certain monitor control index of background server is all More than threshold value, then alarm.When monitoring system is alarmed, after operation maintenance personnel receives alarm, corresponding processing is taken to arrange It applies, mainly includes:Corresponding background server is offline or not offline.
But the monitoring strategies of similar zabbix systems mainly carry out all background servers by control server Failure monitoring, control server needs are monitored the monitoring data of all background servers, and the data volume monitored compares Greatly, therefore, when carrying out data processing, control server data volume to be treated is bigger, cause fault alarm efficiency compared with It is low.
The content of the invention
It is an object of the invention to propose a kind of monitor processing method, device and monitoring processing system, to improve failure report Alert efficiency.
In order to achieve the above objectives, the present invention provides following technical schemes:
A kind of monitor processing method, applied to monitoring processing system, the monitoring processing system includes:Control server and At least one background server;Training pattern is deployed in advance on each background server, and the training pattern is described Control server is generated according to the history monitoring data and corresponding alert process operation training of each background server, and It is deployed on the corresponding background server, the described method includes:
In the case where meeting monitoring treatment conditions, the processor in the background server obtains the background server Monitoring data;
The monitoring data is input to the training pattern disposed in advance to handle, is obtained and corresponding background server Offline probability value;
By the offline probability value compared with predetermined probabilities value, background server corresponding with comparative result is determined Alert process operates.
Preferably, further include:
The corresponding background server is controlled to perform the alert process operation.
Preferably, history monitoring data and corresponding alarm of the control server according to each background server Processing operation training generation training pattern, including:
Obtain training data, the training data includes the history monitoring data of each background server and corresponding The alert process operation of background server;
Using the history monitoring data as characteristic value, operated using corresponding alert process as target variable and pass through machine Study carries out model training generation training pattern.
Preferably, the situation for meeting monitoring treatment conditions, including:The current monitor data that background server detects In any one monitor control index reach alarm threshold value.
Preferably, the monitoring data of the processor acquisition background server in the background server includes:
Since current time, the K+1 group monitoring datas in K time interval of the background server are continuously acquired.
Preferably, the training pattern disposed in advance that the monitoring data is input to is handled, and is obtained pair The process of the offline probability value for the background server answered includes:
The K+1 groups monitoring data is separately input into the training pattern disposed in advance to obtain supervising with the K+1 groups Control the offline probability value of the corresponding background server of data;
Corresponding background server is determined according to the offline probability value of the corresponding background server of the K+1 groups monitoring data Offline probability value.
Preferably, the offline probability value according to the corresponding background server of the K+1 groups monitoring data determines to correspond to Background server offline probability value, including:
The offline probability value of the corresponding background server of the K+1 groups monitoring data is averaged, obtain it is corresponding after The offline probability value of platform server.
Preferably, it is described by the offline probability value compared with predetermined probabilities value, determine with the comparative result pair The alert process operation for the background server answered, including:
By the offline probability value of the corresponding background server of the K+1 groups monitoring data respectively with the predetermined probabilities value into Row compares;
When in the comparative result more than n times, if the offline probability value for having n times background server is more than predetermined probabilities value, The alert process operation for determining corresponding background server is offline operation.
The invention also discloses a kind of monitoring processing unit, including:
Acquisition module, in the case where meeting monitoring treatment conditions, the processor in the background server to obtain The monitoring data of the background server;
Processing module is handled for the monitoring data to be input to the training pattern disposed in advance, obtained The offline probability value of corresponding background server;
Comparison module, for compared with predetermined probabilities value, determining the offline probability value corresponding with comparative result Background server alert process operation.
The invention also discloses a kind of monitoring processing system, including:Control server and at least one background server, In:
The control server is grasped according to the history monitoring data and corresponding alert process of each background server Make training generation training pattern, and be deployed on the corresponding background server;
The background server is used in the case where meeting monitoring treatment conditions, the processor in the background server Obtain the monitoring data of the background server;The monitoring data is input to the training pattern disposed in advance to handle, Obtain the offline probability value of corresponding background server;By the offline probability value compared with predetermined probabilities value, determine with The alert process operation of the corresponding background server of comparative result.
It can be seen via above technical scheme that compared with prior art, the invention discloses a kind of monitoring alarm processing sides During method, device and monitoring processing system, are deployed with training pattern in advance on each background server first, and training pattern is It controls server to be generated according to the history monitoring data of each background server and corresponding alert process operation training, which exists In the case of meeting monitoring treatment conditions, the monitoring data of background server is obtained by the processor in background server, and Monitoring data is inputted to training pattern and is handled, the offline probability value of corresponding background server is obtained, when offline probability When value is more than predetermined probabilities value, it is determined that corresponding alert process operation is offline operation, and is controlled under background server execution Line operates, and respective operating condition is monitored by each background server by the present invention, by being deployed in each background server On training pattern monitoring data is handled to obtain the offline probability value of corresponding background server, according to offline probability value Determine corresponding alert process operation, handling respective monitoring data by each background server carries out corresponding alert process Operation, improves fault alarm efficiency.
Description of the drawings
It in order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this The embodiment of invention, for those of ordinary skill in the art, without creative efforts, can also basis The attached drawing of offer obtains other attached drawings.
Fig. 1 is that processing system structure chart is monitored in the embodiment of the present invention;
Fig. 2 is the training flow diagram of training pattern provided in an embodiment of the present invention;
Fig. 3 is a kind of monitor processing method one embodiment schematic diagram provided in an embodiment of the present invention;
Fig. 4 is another embodiment schematic diagram of a kind of monitor processing method provided in an embodiment of the present invention;
Fig. 5 is another embodiment schematic diagram of a kind of monitor processing method provided in an embodiment of the present invention;
Fig. 6 is monitoring alarm time state exemplary plot provided in an embodiment of the present invention;
Fig. 7 is a kind of monitoring processing device structure diagram provided in an embodiment of the present invention.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without making creative work Embodiment belongs to the scope of protection of the invention.
Term " first ", " second ", " the 3rd " in description and claims of this specification and above-mentioned attached drawing, " The (if present)s such as four " are the objects for distinguishing similar, without being used to describe specific order or precedence.It should manage The data that solution so uses can exchange in the appropriate case, so that the embodiment of the present invention described herein for example can be to remove Order beyond those for illustrating or describing herein is implemented.In addition, term " comprising " and " having " and theirs is any Deformation, it is intended that cover it is non-exclusive include, for example, containing the process of series of steps or unit, method, system, production Product or equipment are not necessarily limited to those steps or unit clearly listed, but may include not list clearly or for this The intrinsic other steps of processes, method, product or equipment or unit a bit.
It is to be understood that the present invention is applied to monitoring processing system, referring to Fig. 1, Fig. 1 is handled for monitoring in the embodiment of the present invention System architecture diagram, as shown in Figure 1, figure includes a control server, at least one background server, wherein, 4 in Fig. 1 A background server is only a signal, and in practical applications, the quantity of background server can be more or less, therefore herein It is not construed as limiting.
It should be noted that for background server it is offline be according to the monitoring data of background server come definite, The offline probability of corresponding background server is mainly determined in the present invention according to every monitor control index of monitoring data, when certain When the offline probability of a background server reaches predetermined probabilities value, it is determined that the alert process of corresponding background server, which operates, is Offline operation, then the controller control background server of the background server is offline, i.e. offline general when some background server When rate reaches predetermined probabilities value, which is extractd from monitoring processing system, it is as offline.
The core concept of the application is, control server is according to the history monitoring data of each background server and corresponding Alert process operation training generates training pattern, and then training pattern is deployed to corresponding each background service by control server On device, the monitoring data of its own is monitored by background server, then monitoring data is inputted into training pattern, is corresponded to Background server offline probability value;By offline probability value compared with predetermined probabilities value, determine corresponding with comparative result Background server alert process operation, and then, subsequently can also according to obtain alert process operation control it is corresponding after Platform server performs alert process operation.
With reference to figure 1, in the disclosed monitoring processing system of the embodiment of the present application:
Control server is given birth to according to the history monitoring data of each background server and corresponding alert process operation training Into training pattern, and it is deployed on corresponding background server;
Background server is used in the case where meeting monitoring treatment conditions, and the processor in background server obtains backstage The monitoring data of server;Monitoring data is input to the training pattern disposed in advance to handle, obtains corresponding backstage clothes The offline probability value of business device;By offline probability value compared with predetermined probabilities value, backstage clothes corresponding with comparative result are determined The alert process operation of business device.
Determine alert process operation after, processor can also according to obtain alert process operation control it is corresponding after Platform server performs alert process operation, such as offline.By monitoring processing system provided by the invention, taken by each backstage Business device monitors respective operating condition, and monitoring data handle by the training pattern being deployed on each background server To the offline probability value of corresponding background server, the alert process for determining corresponding background server according to offline probability value is grasped Make, the alert process that the corresponding background server of respective monitoring data progress is handled by background server operates, and improves Fault alarm efficiency.
In the embodiment of the present invention, as shown in Fig. 2, specifically comprising the following steps for the training process of training pattern:
In step S201, training data is obtained, the history monitoring data of training data including each background server and right The alert process operation answered.
It should be noted that training data is obtained by existing monitoring and alarming system, in existing monitoring alarm Monitor control index and corresponding threshold value are set in system;If at the time of monitoring scanning twice, corresponding monitor control index is above Threshold value, then alarm.
For example, monitor control index can include but is not limited to server network connection number, CPU entirety utilization rate, disk Whether utilization rate, memory usage, application layer services device connection number, TCP retransmission rates, 80 port of network can connect.Threshold value is set Fixed, e.g., setting reaches 90% alarm when CPU entirety utilization rates, if then monitoring scanning discovery CPU usage all reaches twice 90%, then it alarms;For another example, if setting is when 80 port inaccessible alarm of network, if monitoring scanning discovery net twice 80 port of network is all unavailable, then alarms.When monitoring system is alarmed, operation maintenance personnel can be handled alarm, be remembered The processing method of the lower operation maintenance personnel of record, herein, only there are two results for processing method:By background server it is offline/not will backstage take Business device is offline.
In step S202, using history monitoring data as characteristic value, using the operation of corresponding alert process as target variable Model training is carried out by machine learning and generates training pattern.
It should be noted that monitor control index during using monitoring alarm makees the processing method of operation maintenance personnel as characteristic value It is trained for target variable, obtains training pattern.
For example, if all monitor control indexes of supervision and reporting include:Background server number of network connections:Numeric type becomes Amount;CPU entirety utilization rates:Numeric type variable;Disk utilization rate:Numeric type variable;Memory usage:Numeric type variable;From the background Server connections:Numeric type variable;TCP retransmission rates:Numeric type variable;Whether 80 port of network can connect:Nominal type variable, 1 can be connected as, 0 can not be connected as.It can be using above-mentioned monitor control index as the characteristic value of training;In addition, the processing side of operation maintenance personnel Method is mapped as target variable, and background server is offline:1;Background server is not offline:0.In actual use, monitor control index and mesh It is without being limited thereto to mark variable number.
It should be noted that above-mentioned machine learning method is the CART (Classification in supervised learning algorithm And Regression Tree, post-class processing) algorithm, NB Algorithm, SVM (Support Vector Machine, support vector machines) algorithm, ID3 algorithms etc., obtained training pattern is specifically as follows decision-tree model.
Specifically, for example, nginx servers under the Linux system used at present, can develop additional monitoring Module realizes relevant function.For above-mentioned monitor control index, then obtaining the method for monitor control index can be carried out by such as giving an order It obtains:
Background server number of network connections:Relevant information is obtained by netstat orders;
CPU entirety utilization rates:Relevant information is obtained by top orders or dstat orders;
Disk utilization rate:Relevant information is obtained by df orders;
Memory usage:Relevant information is obtained by free orders;
Background server connects number:Nginx servers obtain information by status modules;
TCP retransmission rates:Retransmission rate is calculated by the information in/proc/net/netstat;
Whether 80 port of network can connect:It can be judged by the result of the order of similar " curl localhost ".
In order to make it easy to understand, referring to Fig. 3, Fig. 3 is a certain background server implementing monitoring processing in the embodiment of the present invention Method one embodiment schematic diagram, as shown in figure 3, being specially:
In step 301, in the case where meeting monitoring treatment conditions, the processor in background server obtains background service The monitoring data of device.
It should be noted that in the embodiment of the present invention, can include in the case where meeting monitoring treatment conditions:Backstage takes Any one monitor control index in the current monitor data that business device detects reaches alarm threshold value;Or pre-set startup monitoring Condition;Also or background server can be made to be monitored the detection of index always.
In step 302, monitoring data is input to the training pattern disposed in advance and is handled, obtain corresponding backstage clothes The offline probability value of business device.
It should be noted that in the embodiment of the present invention, the processor monitoring related data of background server, and number will be monitored It is handled according to input to the training pattern for being previously deployed at background server, obtains the offline probability of corresponding background server Value.
In step 303, by offline probability value compared with predetermined probabilities value, backstage clothes corresponding with comparative result are determined The alert process operation of business device.
It should be noted that in the embodiment of the present invention, the processor of background server is by offline probability value and predetermined probabilities Value is compared, when offline probability value is more than predetermined probabilities value, it is determined that the alert process operation of corresponding background server For offline operation.
Further, processor, can be with after alert process operation is obtained:
In step 304, corresponding background server is controlled to perform alert process operation.
Controller directly controls corresponding background server to run according to alert process operation, and needs pair occur when faulty When server is offline, it can control server offline in time, improve the treatment effeciency of failure.
It should be noted that in the embodiment of the present invention, when offline probability value is more than predetermined probabilities value, background server Processor controls corresponding background server to perform offline operation, controls corresponding background server offline.
A kind of monitor processing method provided in an embodiment of the present invention, is deployed with instruction in advance on each background server first Practice model, and training pattern is history monitoring data and corresponding alert process of the control server according to each background server Operation training generates, and this method is in the case where meeting monitoring treatment conditions, after being obtained by the processor in background server The monitoring data of platform server, and monitoring data is inputted to training pattern and is handled, obtain corresponding background server Offline probability value, when offline probability value is more than predetermined probabilities value, it is determined that corresponding alert process operation is offline operation, and Corresponding background server is controlled to perform offline operation, by monitor processing method provided by the invention, is taken by each backstage Business device monitors respective operating condition, and monitoring data handle by the training pattern being deployed on each background server To the offline probability value of corresponding background server, determine that corresponding alert process operates according to offline probability value, by each The respective monitoring data of background server processing carries out the alert process operation of corresponding background server, improves fault alarm Efficiency;Also, the treatment effeciency of failure is improved, further improves the stability of system.
In order to make it easy to understand, referring to Fig. 4, Fig. 4 is monitor processing method in the embodiment of the present invention, another implements to illustrate It is intended to, as shown in figure 4, being specially:
In step 401, when any one monitor control index in the current monitor data that background server detects reaches report During alert threshold value, the processor in background server is continuously acquired since current time in K time interval of background server K+1 group monitoring datas.
It should be noted that in the embodiment of the present invention, can also include in the case where meeting monitoring treatment conditions:In advance The condition for starting monitoring is set;Also or background server can be made to be monitored the detection of index always.
In step 402, K+1 group monitoring datas are separately input into the training pattern disposed in advance and obtain monitoring with K+1 groups The offline probability value of the corresponding background server of data.
It should be noted that in the embodiment of the present invention, the processor monitoring related data of background server, and number will be monitored It is handled according to input to the training pattern for being previously deployed at background server, obtains the offline probability of corresponding background server Value.
In step 403, corresponding backstage is determined according to the offline probability value of the corresponding background server of K+1 group monitoring datas The offline probability value of server.
It should be noted that in the embodiment of the present invention, the offline probability value of corresponding background server can be according to K+1 groups The offline probability value of the corresponding background server of monitoring data is averaged to obtain.
In step 404, by offline probability value compared with predetermined probabilities value, backstage clothes corresponding with comparative result are determined The alert process operation of business device.
It should be noted that in the embodiment of the present invention, the processor of background server is by offline probability value and predetermined probabilities Value is compared, when offline probability value is more than predetermined probabilities value, it is determined that the alert process operation of corresponding background server For offline operation.
Further, processor, can be with after alert process operation is obtained:
In step 405, corresponding background server is controlled to perform alert process operation.
It should be noted that in the embodiment of the present invention, when offline probability value is more than predetermined probabilities value, background server Processor controls corresponding background server to perform offline operation, controls corresponding background server offline.
A kind of monitor processing method provided in an embodiment of the present invention, the current monitor that this method is detected in background server When any one monitor control index in data reaches alarm threshold value, opened by the processor in background server from current time Begin, continuously acquire the K+1 group monitoring datas in K time interval, and by K+1 group monitoring datas be separately input into training pattern into Row processing, obtains the corresponding offline probability value of K+1 group monitoring datas, makes even to the corresponding offline probability value of K+1 group monitoring datas Average, when offline probability value average value is more than predetermined probabilities value, it is determined that the alert process operation of corresponding background server For offline operation, and corresponding background server is controlled to perform offline operation, by monitor processing method provided by the invention, into One step improves fault alarm efficiency.
In order to make it easy to understand, referring to Fig. 5, Fig. 5 is monitor processing method in the embodiment of the present invention, another implements to illustrate It is intended to, as shown in figure 5, being specially:
In step 501, when any one monitor control index in the current monitor data that background server detects reaches report During alert threshold value, the processor in background server is continuously acquired since current time in K time interval of background server K+1 group monitoring datas.
It should be noted that in the embodiment of the present invention, can also include in the case where meeting monitoring treatment conditions:In advance The condition for starting monitoring is set;Also or background server can be made to be monitored the detection of index always.
In step 502, K+1 group monitoring datas are separately input into the training pattern disposed in advance and obtain monitoring with K+1 groups The offline probability value of the corresponding background server of data.
It should be noted that in the embodiment of the present invention, the processor monitoring related data of background server, and number will be monitored It is handled according to input to the training pattern for being previously deployed at background server, obtains background server corresponding with monitoring data Offline probability value.
In step 503, by the offline probability value of the corresponding background server of K+1 group monitoring datas respectively with predetermined probabilities value It is compared.
In step 504, when in the comparative result more than n times, if the offline probability value for having n times background server is more than default Probability value, it is determined that the alert process operation of corresponding background server is offline operation.
It should be noted that in the embodiment of the present invention, the processor of background server is by offline probability value and predetermined probabilities Value is compared, when offline probability value is more than predetermined probabilities value, it is determined that the alert process behaviour of corresponding background server As offline operation.
Further, processor, can be with after alert process operation is obtained:
In step 505, corresponding background server is controlled to perform alert process operation.
It should be noted that in the embodiment of the present invention, when the offline probability value of background server is more than predetermined probabilities value, The processor of background server controls corresponding background server to perform offline operation, controls corresponding background server offline.
In the embodiment of the present invention, pre-set training pattern is deployed on each background server by control server, And applied with reference to existing monitoring method, specifically illustrate how real the method that the present embodiment is provided is with example below Existing monitoring alarm processing:
Background server monitors related data in real time, and the time interval of monitoring can be smaller (for example, being executed once per second prison Control instruction);
When finding that a certain monitor control index reaches alarm threshold value, corresponding background server enters early warning state:From it is current when It carves in the continuous K time interval started, monitors obtained data every time and all substitute into above-mentioned model and predicted, obtain " from the background The offline offline probability value of server ", to being read as " if operation maintenance personnel is sentenced according to monitoring data at this time for this probability Disconnected, then it is much that background server, which needs offline probability, ".If (i.e. monitoring has obtained K+1 groups monitoring number in K time interval According to), more than in the result of n times, the offline probability of server is above m%, then carries out offline operation;Conversely, without offline. It illustrates (Fig. 6), if K=5, n=3, m=70, t0Moment finds that a certain monitor control index reaches alarm threshold value, then by t1、t2、 t3、t4、t5The monitoring data at moment substitutes into model, calculates the offline probability value at corresponding moment.If the probability difference of this six times calculatings For 74%, 68%, 63%, 77%, 71%, 69%, due to by t1—t5The monitoring data at moment is brought training pattern into and is calculated Offline probability in only 77%, 71% two more than 70%, then need not be to the offline operation of server progress.In the present embodiment In, if sometime, such as t3At the moment, also there is a certain monitor control index to reach alarm threshold value, then need to continue with t3Moment is the time Starting point monitors t3—t8The probability that the model at moment calculates.
A kind of monitor processing method provided in an embodiment of the present invention, the current monitor that this method is detected in background server When any one monitor control index in data reaches alarm threshold value, opened by the processor in background server from current time Begin, continuously acquire the K+1 group monitoring datas in K time interval, and by K+1 group monitoring datas be separately input into training pattern into Row processing, obtains the corresponding offline probability value of K+1 group monitoring datas, and offline probability value corresponding to K+1 group monitoring datas carries out Analysis, when offline probability value meets preset condition, it is determined that the alert process operation of corresponding background server is offline behaviour Make, and corresponding background server is controlled to perform offline operation, by monitor processing method provided by the invention, further improve Fault alarm efficiency.
The present invention also discloses corresponding device on the basis of method disclosed above.
A kind of monitoring processing unit provided in an embodiment of the present invention is introduced, it is necessary to which explanation is, in relation to being somebody's turn to do below The explanation of monitoring processing unit can refer to monitor processing method provided above, not repeat below.
In order to make it easy to understand, referring to Fig. 7, Fig. 7 illustrates to monitor processing unit one embodiment in the embodiment of the present invention Figure, as shown in fig. 7, being specially:
Acquisition module 701, in the case where meeting monitoring treatment conditions, after processor in background server obtains The monitoring data of platform server.
Processing module 702 is handled for monitoring data to be input to the training pattern disposed in advance, obtained corresponding The offline probability value of background server.
Comparison module 703, for compared with predetermined probabilities value, determining offline probability value corresponding with comparative result The alert process operation of background server.
Execution module 704, for background server to be controlled to perform alert process operation.
Preferably, acquisition module was additionally operable to since current time, was continuously acquired in K time interval of background server K+1 group monitoring datas.
Preferably, processing module includes:
Processing unit obtains supervising with K+1 groups for K+1 group monitoring datas to be separately input into the training pattern disposed in advance Control the offline probability value of the corresponding background server of data.
Determination unit, it is corresponding for being determined according to the offline probability value of the corresponding background server of K+1 group monitoring datas The offline probability value of background server.
Preferably, determination unit is additionally operable to make even the offline probability value of the corresponding background server of K+1 group monitoring datas Average obtains the offline probability value of corresponding background server.
Preferably, determining module includes:
Comparing unit, for by the offline probability value of the corresponding background server of K+1 group monitoring datas respectively with it is default general Rate value is compared;
Determination subelement, for working as in the comparative result more than n times, if the offline probability value for having n times background server surpasses Cross predetermined probabilities value, it is determined that the alert process operation of corresponding background server is offline operation.
A kind of monitoring processing unit provided in an embodiment of the present invention, is deployed with trained mould in advance first on background server Type, and training pattern generates for control server according to history monitoring data and corresponding alert process operation training, is meeting In the case of monitoring treatment conditions, the monitoring data of background server is obtained by the processor in background server, and will prison Control data, which are inputted to training pattern, to be handled, and the offline probability value of corresponding background server is obtained, when offline probability value is big When predetermined probabilities value, it is determined that the alert process operation of corresponding background server is offline operation, and controls background service Device performs offline operation, and by monitoring processing unit provided by the invention, respective operating condition is monitored by background server, Monitoring data is handled to obtain the offline general of corresponding background server by the training pattern being deployed on background server Rate value determines that corresponding alert process operates according to offline probability value, by background server handle respective monitoring data into The corresponding alert process operation of row, improves fault alarm efficiency.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit may be referred to the corresponding process in preceding method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, apparatus and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit Division is only a kind of division of logic function, can there is other dividing mode, such as multiple units or component in actual implementation It may be combined or can be integrated into another system or some features can be ignored or does not perform.It is another, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be the indirect coupling by some interfaces, device or unit It closes or communicates to connect, can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separate, be shown as unit The component shown may or may not be physical location, you can be located at a place or can also be distributed to multiple In network element.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also That unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list The form that hardware had both may be employed in member is realized, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is independent production marketing or use When, it can be stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially The part to contribute in other words to the prior art or all or part of the technical solution can be in the form of software products It embodies, which is stored in a storage medium, is used including some instructions so that a computer Equipment (can be personal computer, server or the network equipment etc.) performs the complete of each embodiment the method for the present invention Portion or part steps.And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only memory (English full name:Read-Only Memory, english abbreviation:ROM), random access memory (English full name:Random Access Memory, english abbreviation: RAM), the various media that can store program code such as magnetic disc or CD.
The above, the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although with reference to before Embodiment is stated the present invention is described in detail, it will be understood by those of ordinary skill in the art that:It still can be to preceding The technical solution recorded in each embodiment is stated to modify or carry out equivalent substitution to which part technical characteristic;And these Modification is replaced, and the essence of appropriate technical solution is not made to depart from the spirit and scope of various embodiments of the present invention technical solution.

Claims (10)

1. a kind of monitor processing method, which is characterized in that applied to monitoring processing system, the monitoring processing system includes:In Control server and at least one background server;Training pattern, the training are deployed in advance on each background server Model operates for the control server according to the history monitoring data and corresponding alert process of each background server Training generation, and be deployed on the corresponding background server, the described method includes:
In the case where meeting monitoring treatment conditions, the processor in the background server obtains the prison of the background server Control data;
The monitoring data is input to the training pattern disposed in advance to handle, obtain under corresponding background server Line probability value;
By the offline probability value compared with predetermined probabilities value, the alarm of background server corresponding with comparative result is determined Processing operation.
2. monitor processing method according to claim 1, which is characterized in that further include:
The corresponding background server is controlled to perform the alert process operation.
3. monitor processing method according to claim 1, which is characterized in that the control server according to it is each it is described after The history monitoring data of platform server and corresponding alert process operation training generation training pattern, including:
Training data is obtained, the training data includes the history monitoring data of each background server and corresponding backstage The alert process operation of server;
Using the history monitoring data as characteristic value, operated using corresponding alert process as target variable and pass through machine learning Carry out model training generation training pattern.
4. monitor processing method according to claim 1, which is characterized in that the situation for meeting monitoring treatment conditions, Including:Any one monitor control index in the current monitor data that background server detects reaches alarm threshold value.
5. monitor processing method according to claim 4, which is characterized in that the processor in the background server obtains The monitoring data of the background server includes:
Since current time, the K+1 group monitoring datas in K time interval of the background server are continuously acquired.
6. monitor processing method according to claim 5, which is characterized in that described to be input to the monitoring data in advance The training pattern of deployment is handled, and obtaining the process of the offline probability value of the corresponding background server includes:
The K+1 groups monitoring data is separately input into the training pattern disposed in advance to obtain monitoring number with the K+1 groups According to the offline probability value of corresponding background server;
It is determined according to the offline probability value of the corresponding background server of the K+1 groups monitoring data under corresponding background server Line probability value.
7. monitor processing method according to claim 6, which is characterized in that described according to the K+1 groups monitoring data pair The offline probability value for the background server answered determines the offline probability value of corresponding background server, including:
The offline probability value of the corresponding background server of the K+1 groups monitoring data is averaged, obtains corresponding backstage clothes The offline probability value of business device.
8. monitor processing method according to claim 6, which is characterized in that described that the offline probability value is general with presetting Rate value is compared, and determines the alert process operation of background server corresponding with the comparative result, including:
The offline probability value of the corresponding background server of the K+1 groups monitoring data is compared respectively with the predetermined probabilities value Compared with;
When in the comparative result more than n times, if the offline probability value for having n times background server is more than predetermined probabilities value, it is determined that The alert process operation of corresponding background server is offline operation.
9. a kind of monitoring processing unit, which is characterized in that including:
Acquisition module, described in the case where meeting monitoring treatment conditions, the processor in the background server obtains The monitoring data of background server;
Processing module is handled for the monitoring data to be input to the training pattern disposed in advance, corresponded to Background server offline probability value;
Comparison module, for by the offline probability value compared with predetermined probabilities value, determine it is corresponding with comparative result after The alert process operation of platform server.
10. a kind of monitoring processing system, which is characterized in that including:Control server and at least one background server, wherein:
The control server is according to the history monitoring data of each background server and corresponding alert process operation instruction Practice generation training pattern, and be deployed on the corresponding background server;
The background server is used in the case where meeting monitoring treatment conditions, and the processor in the background server obtains The monitoring data of the background server;The monitoring data is input to the training pattern disposed in advance to handle, is obtained The offline probability value of corresponding background server;By the offline probability value compared with predetermined probabilities value, determine compared with As a result the alert process operation of corresponding background server.
CN201810052608.7A 2018-01-19 2018-01-19 A kind of monitor processing method, device and monitoring processing system Pending CN108111359A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810052608.7A CN108111359A (en) 2018-01-19 2018-01-19 A kind of monitor processing method, device and monitoring processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810052608.7A CN108111359A (en) 2018-01-19 2018-01-19 A kind of monitor processing method, device and monitoring processing system

Publications (1)

Publication Number Publication Date
CN108111359A true CN108111359A (en) 2018-06-01

Family

ID=62218718

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810052608.7A Pending CN108111359A (en) 2018-01-19 2018-01-19 A kind of monitor processing method, device and monitoring processing system

Country Status (1)

Country Link
CN (1) CN108111359A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109104299A (en) * 2018-07-11 2018-12-28 新华三技术有限公司成都分公司 Reduce the method and device of cluster concussion
CN109614284A (en) * 2018-10-25 2019-04-12 北京奇艺世纪科技有限公司 A kind of data processing method and device
CN109889399A (en) * 2018-12-15 2019-06-14 中国平安人寿保险股份有限公司 RocketMQ client connection number monitoring method, device, electronic equipment and storage medium
CN117608974A (en) * 2024-01-22 2024-02-27 金品计算机科技(天津)有限公司 Server fault detection method, device, equipment and medium based on artificial intelligence

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090150324A1 (en) * 2007-12-10 2009-06-11 Sun Microsystems, Inc. Accurately inferring physical variable values associated with operation of a computer system
CN104954184A (en) * 2015-06-15 2015-09-30 四川长虹电器股份有限公司 Monitoring and alarming method and system for cloud background server cluster
CN106775929A (en) * 2016-11-25 2017-05-31 中国科学院信息工程研究所 A kind of virtual platform safety monitoring method and system
CN106856508A (en) * 2017-02-08 2017-06-16 北京百度网讯科技有限公司 The cloud monitoring method and cloud platform of data center
CN107066365A (en) * 2017-02-20 2017-08-18 阿里巴巴集团控股有限公司 The monitoring method and device of a kind of system exception

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090150324A1 (en) * 2007-12-10 2009-06-11 Sun Microsystems, Inc. Accurately inferring physical variable values associated with operation of a computer system
CN104954184A (en) * 2015-06-15 2015-09-30 四川长虹电器股份有限公司 Monitoring and alarming method and system for cloud background server cluster
CN106775929A (en) * 2016-11-25 2017-05-31 中国科学院信息工程研究所 A kind of virtual platform safety monitoring method and system
CN106856508A (en) * 2017-02-08 2017-06-16 北京百度网讯科技有限公司 The cloud monitoring method and cloud platform of data center
CN107066365A (en) * 2017-02-20 2017-08-18 阿里巴巴集团控股有限公司 The monitoring method and device of a kind of system exception

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109104299A (en) * 2018-07-11 2018-12-28 新华三技术有限公司成都分公司 Reduce the method and device of cluster concussion
CN109104299B (en) * 2018-07-11 2021-12-07 新华三技术有限公司成都分公司 Method and device for reducing cluster oscillation
CN109614284A (en) * 2018-10-25 2019-04-12 北京奇艺世纪科技有限公司 A kind of data processing method and device
CN109889399A (en) * 2018-12-15 2019-06-14 中国平安人寿保险股份有限公司 RocketMQ client connection number monitoring method, device, electronic equipment and storage medium
CN117608974A (en) * 2024-01-22 2024-02-27 金品计算机科技(天津)有限公司 Server fault detection method, device, equipment and medium based on artificial intelligence

Similar Documents

Publication Publication Date Title
CN108111359A (en) A kind of monitor processing method, device and monitoring processing system
CN106209432B (en) Network equipment inferior health method for early warning and device based on dynamic threshold
JP2021527906A (en) Unsupervised anomaly detection, diagnosis and correction of multivariate time series data
CN107612756A (en) A kind of operation management system with intelligent trouble analyzing and processing function
CN101783749B (en) Network fault positioning method and device
CN112650200B (en) Method and device for diagnosing plant station equipment faults
CN104796273A (en) Method and device for diagnosing root of network faults
US7388482B2 (en) Method for the machine learning of frequent chronicles in an alarm log for the monitoring of dynamic systems
US20200097651A1 (en) Systems and methods to achieve robustness and security in medical devices
CN109040277A (en) A kind of long-distance monitoring method and device of server
CN107367014A (en) The control method of air-conditioning cluster, apparatus and system
CN113282635A (en) Micro-service system fault root cause positioning method and device
CN109413642B (en) Terminal safety detection and monitoring systematization method
KR102096466B1 (en) Device and method for remote control and alarm using real time database
CN111416790B (en) Network abnormal access intelligent identification method and device based on user behavior, storage medium and computer equipment
CN103856344B (en) A kind of alarm event information processing method and device
TW202016805A (en) System and method of learning-based prediction for anomalies within a base station
CN114175072A (en) Facilitating efficient RUL analysis of utility system assets using unrelated filters
CN113487086B (en) Method, device, computer equipment and medium for predicting residual service life of equipment
CN114511227A (en) Power monitoring system network security policy arranging and handling method and system
CN109523141A (en) A kind of fire-fighting region deployment method, apparatus and terminal device
CN111489539A (en) Household appliance system fault early warning method, system and device
CN111078503B (en) Abnormality monitoring method and system
CN111277444B (en) Switch fault early warning method and device
CN114598480A (en) Method and system for processing machine data of network security operation platform

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180601

RJ01 Rejection of invention patent application after publication