Monitoring alarm control method and system thereof
Technical field
The application relates to computer network field, is specifically related to a kind of monitoring alarm control method and system thereof.
Background technology
Existing monitoring and alarming system be by be arranged on Agent on monitored machine initiatively or passive status data to monitoring equipment, then obtained state operation conditions grade by the contrast of built-in or third-party alarm module according to the range of normal value (alarm threshold value data) of status data and definition and given the alarm according to actual conditions, related personnel is artificial troubleshooting after receiving and reporting to the police.Wherein, monitoring alarm technology is by judging that the status data received draws result of variations with the difference of corresponding range of normal value (alarm threshold value data) thus carrys out trigger alarm, as long as each status data will trigger alarm beyond the scope of alarm threshold value data, and in practical operation, such as, monitored machine sends several status data of alarm threshold value data that exceeds to monitoring equipment at short notice because of the emergentness fluctuation of external environment, and afterwards, monitored machine recovers again normal work.As long as but now monitoring equipment receives a status data exceeding alarm threshold value data area and just can send warning, when staff finds that warning will carry out troubleshooting operation, monitored machine has recovered normally to have worked, and now just can determine this time to report to the police as invalid warning.In prior art, this type of invalid warning often can occur, increase the work load of staff, cause unnecessary trouble, monitoring alarm control system accuracy is lower, and controllability is lower.
Summary of the invention
According to the first aspect of the application, the application provides a kind of monitoring alarm control method, comprising:
Monitoring client obtains the real-time monitoring data of monitored end and forms monitoring message queue, and every bar monitoring message comprises monitor data;
From monitoring message queue, read monitoring message, monitoring client threshold data corresponding with this monitor data for the monitor data in monitoring message is compared, determine whether to generate warning message according to this monitoring message according to comparative result;
If generate warning message, judge whether the warning message generated meets the warning trigger condition corresponding with this monitor data, determine whether to produce warning trigger pip according to judged result, described warning trigger condition gets rid of less desirable warning message for adopting the mode of time and/or alerting signal number of times.
According to the second aspect of the application, the application provides a kind of monitoring alarm control system, comprising:
Data acquisition module, for obtaining the real-time monitoring data of monitored end and forming monitoring message queue, every bar monitoring message comprises monitor data;
Forewarn module, for reading monitoring message from monitoring message queue, monitor data in monitoring message and the monitoring client threshold data corresponding with this monitor data are compared, determine whether to generate warning message according to this monitoring message according to comparative result, if generate warning message, then judge whether the warning message generated meets the warning trigger condition corresponding with this monitor data, determine whether to produce warning trigger pip according to judged result, described warning trigger condition gets rid of less desirable warning message for adopting the mode of time and/or generation warning trigger pip number of times.
The beneficial effect of the application is: the application adds the restriction of the warning trigger condition corresponding with it to warning message, and described warning trigger condition gets rid of less desirable warning message for adopting the mode of time and/or generation warning trigger pip number of times.Namely might not trigger alarm for the monitor message exceeding monitoring client threshold data, but judge whether monitor message meets the warning trigger condition corresponding with it, is limited in time and/or in alarm times by warning trigger condition to the warning coming from same monitored end further.Because the sudden wave time of external environment is extremely short, so the warning message quantity produced is few, even only produce a warning message, therefore by filtering the invalid warning coming from same monitored end in time and/or in alarm times, the warning trigger pip of trigger alarm can be avoided to be condition of instant error data.In addition, when carrying out troubleshooting to the monitored end of reporting to the police, now receive the meaning that the warning coming from this monitored end has lost warning again, therefore by filtering the invalid warning coming from same monitored end in time and/or in alarm times, also can reduce a staff the warning received in during troubleshooting.Therefore can reduce the invalid warning in monitor procedure, improve the accuracy of monitoring and alarming system; Also facilitating user arranges different monitoring conditions according to different monitored ends simultaneously, strengthens customization.
Accompanying drawing explanation
Fig. 1 is the structural representation of a kind of monitoring alarm control system in the embodiment of the present application one;
Fig. 2 is the workflow diagram of a kind of monitoring alarm control system in the embodiment of the present application one;
Fig. 3 is the structural representation of a kind of monitoring alarm control system in the embodiment of the present application two;
Fig. 4 is the workflow diagram of a kind of monitoring alarm control system in the embodiment of the present application two;
Fig. 5 is the structural representation of a kind of monitoring alarm control system in the embodiment of the present application three;
Fig. 6 is the workflow diagram of trigger control module of reporting to the police in a kind of monitoring alarm control system in the embodiment of the present application three;
Fig. 7 is the structural representation of a kind of monitoring alarm control system in the embodiment of the present application four;
Fig. 8 is the workflow diagram of mark module and tokens statistics module in a kind of monitoring alarm control system in the embodiment of the present application four;
Fig. 9 is the structural representation of a kind of monitoring alarm control system in the embodiment of the present application five;
Figure 10 is the workflow diagram of automatic troubleshooting module in a kind of monitoring alarm control system in the embodiment of the present application five;
Embodiment
By reference to the accompanying drawings the present invention is described in further detail below by embodiment.
Each monitoring client can monitor multiple monitored end, monitored end can be such as the various resource and service, router, switch etc. that associate under hard disk, CPU, server, server, in the present embodiment, monitored end is classified, implements different controlling alarm strategies according to different types.
Embodiment one:
Please refer to Fig. 1, Fig. 2, in the present embodiment, monitoring alarm control system comprises data acquisition module 101 and forewarn module 102.Data acquisition module 101 is for obtaining the real-time monitoring data of monitored end and forming monitoring message queue, and every bar monitoring message comprises monitor data, forewarn module 102 for reading monitoring message from monitoring message queue, monitor data in monitoring message and the monitoring client threshold data corresponding with this monitor data are compared, determine whether to generate warning message according to this monitoring message according to comparative result, if generate warning message, then judge whether the warning message generated meets the warning trigger condition corresponding with this monitor data, determine whether to produce warning trigger pip according to judged result, described warning trigger condition gets rid of less desirable warning message for adopting the mode of time and/or generation warning trigger pip number of times.
The specific works flow process of data acquisition module 101 and forewarn module 102 is as follows:
Step 103, data acquisition module 101 obtains the real-time monitoring data of monitored end.Monitor data be monitored end status data, produced by script file by monitored end, it can be timing or the status data sporadically initiatively reading monitored end that data acquisition module obtains the mode of status data, also can by monitored end regularly or sporadically status data to monitoring client.
Step 104, the monitor data that step 103 obtains by data acquisition module 101 forms monitoring message queue.
Step 105, forewarn module 102 reads single monitoring message from monitoring message queue.
Step 106, the monitor data comprised in the monitoring message of reading and threshold data compare by forewarn module 102, determine whether to generate warning message according to this monitoring message according to comparative result.Then continue step 107 when monitor data exceeds the scope of threshold data, then perform step 110 when monitor data does not exceed the scope of threshold data.
Step 107, general, illustrate that when monitor data exceeds threshold data scope corresponding monitored end breaks down, then forewarn module 102 generates warning message.
Step 108, the warning message of generation judges by forewarn module 102 further, judges whether warning message meets warning trigger condition.If met, then perform step 109, if do not met, then perform step 111.
Step 109, when forewarn module 102 judges that warning message meets warning trigger condition, then generates warning trigger pip, carries out warning open operation.
Step 110, abandons current monitor message, and system reads next monitoring message automatically.
Step 111, abandons current alerts information, reads next warning message in queue, or the monitor message not meeting warning trigger condition can also be recorded, apply for subsequent processing steps.
In the present embodiment, owing to adding the restriction of warning trigger condition in forewarn module 102 to warning message, filter operation is carried out to warning message, filtered out invalid warning.For example, when warning trigger condition is that the warning message number of times receiving identical warning message is no less than 5 times, only have and just can open warning when forewarn module to receive 5 times or more time warning message continuously.In practical operation, such as, produce because of the emergentness fluctuation of external environment the monitor data exceeding threshold data scope when monitored end, just corresponding warning message is generated, because the sudden wave time of external environment is extremely short, so the warning message quantity produced is few, even only produce a warning message, afterwards, monitored end recovers again normal work.In prior art, now system just can open alarm operation, but in the monitoring alarm control procedure provided at the present embodiment, alarm operation can't be opened after forewarn module receives warning message at once, but continue to receive, within a certain period of time, if the quantity receiving identical warning message is not more than 5, then illustrate that this warning message is invalid warning, do not carry out unlatching alarm operation.If the identical warning message quantity received is more than 5, then illustrates that monitored object is very large and really may there occurs fault, then open warning, remind staff to carry out troubleshooting.Therefore can filter invalid warning according to the difference of warning trigger condition in the monitoring alarm control procedure provided at the present embodiment, the unlatching of invalid warning in minimizing work, the work load reduced a staff, improves accuracy and the controllability of monitoring and alarming system.
In the present embodiment, described warning trigger condition can carry out difference setting according to the difference of monitored object, preferably, described warning trigger condition also comprises the frequency threshold value producing warning trigger pip, at least one in inspection interval time threshold and alarm interval time threshold, the described inspection interval time is judge the time of current monitor message and the interval time between judging from the time of the last monitoring message of same monitored end, the described alarm interval time is the interval time between the time of fire alarming of the last time judging current monitor message and same monitored end.Difference according to warning trigger condition is filtered different invalid warning messages, improves the accuracy of monitoring alarm control system.
Embodiment two:
Please refer to Fig. 3, Fig. 4, in the present embodiment, monitoring alarm control system comprises data acquisition module 201 and forewarn module 202.Wherein, data acquisition module 202 comprises reading unit 203, monitor data taxon 204 and buffer unit 205, and reading unit 203 is for obtaining real-time monitoring data from least one monitored end; Monitor data and attribute 210, for determining the attribute 210 of monitor data, are synthesized a monitoring message by monitor data taxon 204, at least comprise the type of the monitored end producing this monitor data in described attribute 210; Buffer unit 205 is for putting into monitoring message queue by monitoring message.
Forewarn module 202 for reading monitoring message from monitoring message queue, monitor data in monitoring message and the monitoring client threshold data corresponding with this monitor data are compared, determine whether to generate warning message according to this monitoring message according to comparative result, if generate warning message, then judge whether the warning message generated meets the warning trigger condition corresponding with this monitor data, determine whether to produce warning trigger pip according to judged result, described warning trigger condition gets rid of less desirable warning message for adopting the mode of time and/or generation warning trigger pip number of times.Forewarn module 202 can in advance for the warning trigger condition that each monitor data is different according to its setup of attribute, after data acquisition module 201 is each monitoring message determination attribute, forewarn module 202 can determine adopt which warning trigger condition according to the attribute of this monitoring message, just produces warning trigger pip when monitoring message meets warning trigger condition.
Based on the monitoring alarm control system in the present embodiment, the control flow of monitoring alarm comprises the following steps:
Step 206, obtains monitor data.Reading unit 203 obtains real-time monitoring data, based on monitor data generating monitoring information from monitored end.
Step 207, monitor data, attribute and monitoring client threshold data are synthesized a monitoring message by monitor data taxon 204.The attribute of each monitored object is pre-set in data acquisition module, attribute can comprise monitored object type, monitored object rank etc., for the convenience of the user to the control of reporting to the police, also comprises self-defined priority in attribute, self-defined priority is the interim setting value of user, and user can adjust as required.Often get the status data of a monitored object, first according to the attribute of predefined monitored object, determine the attribute of this status data, monitor data and attribute 210(thereof are such as comprised monitored object type, monitored object rank and self-defined priority) be integrated into a packet, generate a monitor message, monitor message is put into monitoring message queue.In an instantiation, monitor data and attribute 210 thereof, monitoring client threshold data 209, also according to the monitored object type determination monitoring client threshold data 209 of this monitor data, are synthesized monitor message, as shown in Fig. 4 by data acquisition module together.Monitoring client threshold data is a scope normally.
In the present embodiment, data acquisition module also according to monitored object type, monitored object rank and self-defined priority to arrange the processing priority calculating this monitor message other, processing priority Wei the weighted sum of monitored end type, monitored end service class and self-defined priority.
Step 208, buffer unit 205 does not determine the position of monitoring message in queue according to processing priority, monitoring message is put into monitoring message queue, and the monitoring message that processing priority is not high is arranged in by the position of priority processing.
The arrangement mode of message in message queue is normally ranked according to the time order and function put into, and is read according to putting in order and processes.In order to realize the controllability that monitored object is reported to the police, user is thought, and the warning of important monitored object is by priority processing, in a kind of instantiation of the present embodiment, do not determine the position of monitoring message in queue when monitoring message being put into monitoring message queue according to processing priority, the monitoring message that processing priority is not high is arranged in by the position of priority processing.The monitor message that such as priority is high is placed in the front end of message queue automatically, is preferentially read.
In the present embodiment, monitoring alarm control system is planned monitored end according to the importance of monitored end, the processing priority higher to the monitored end setting being in important status in system is other, preferentially can transmit its warning message generated when it breaks down and carry out alarm operation, the carrying out of the troubleshooting work that avoids delay.
In the present embodiment, the job step of forewarn module 202 comprises:
Step 211, read monitoring message, forewarn module 202 reads monitor message from queue.
Step 212, judges whether monitored object current monitor data exceed alarm threshold value data.Forewarn module 202 reads monitor message from queue, extracts the monitor data in monitor message and alarm threshold value data, and both is compared, judge whether current monitor data exceed the scope of alarm threshold value data.If monitor data is beyond threshold range, then perform step 213, otherwise perform step 216.In a further embodiment, those skilled in the art are to be understood that, also alarm threshold value data can not be comprised in monitor message, in this step, forewarn module 202 according to the attribute determination alarm threshold value data of monitor data, and judges whether current monitor data exceed the scope of alarm threshold value data.
Step 213, generates warning message.When judging the scope of current monitor data beyond alarm threshold value data in step 212, then illustrate that monitored object may have occurred fault, so generate warning message.
Step 214, judges whether warning message meets warning trigger condition.Forewarn module 202 does not set different warning trigger conditions according to different monitoring object type and/or processing priority, when warning message meets warning trigger condition, forewarn module 202 performs step 215, produce warning trigger pip, perform and open alarm operation, otherwise perform step 217.
Step 215, generates warning trigger pip, and notice opens alarm operation.
Step 216, abandons current monitor message, and system reads next monitoring message automatically.
Step 217, abandons current alerts information, reads next warning message in queue, or the monitor message not meeting warning trigger condition can also be recorded, apply for subsequent processing steps.
Otherwise turn to and perform step 212, judge next monitor message.Warning trigger condition can comprise alarm times and/or interval time.Alarm times can be above the number of times that monitoring client threshold data produces warning message, also can be the frequency threshold value that satisfied warning trigger condition produces warning trigger pip, interval time can be inspection interval time threshold or alarm interval time threshold, the inspection interval time is judge the time of current monitor message and the interval time between judging from the time of the last monitoring message of same monitored end, and the alarm interval time is the interval time between the time of fire alarming of the last time judging current monitor message and same monitored end.
In a kind of instantiation, warning trigger condition is the number of times producing warning message, such as, when warning message number of times is set to 5, the number of times receiving the identical warning message coming from same monitored object within a certain period of time when forewarn module 202 is only had to be greater than 5, namely illustrate that monitored object may exist fault, then produce warning trigger pip; The number of times receiving identical warning message when forewarn module 202 is within a certain period of time less than 5, namely illustrate that monitored object may because the emergentness fluctuation of external environment causes temporary transient fault, but recovered at short notice normally to have worked, then forewarn module 202 does not produce warning trigger pip.
In another kind of instantiation, warning trigger condition is the alarm interval time, and the alarm interval time can be arranged as required flexibly, also can by system Lookup protocol such as, after forewarn module 202 produces a warning trigger pip, it is 60 seconds by alarm interval set of time.If when forewarn module 202 receives next warning message, then judge whether interval time is greater than 60 seconds, if it is produce next warning trigger pip, otherwise, do not produce warning trigger pip, avoid producing invalid warning.Like this, both can avoid repeatedly receiving identical warning at short notice, and also can prevent from missing useful warning simultaneously.By the setting to warning trigger condition, invalid warning message can be filtered out more exactly, improve the accuracy of monitoring alarm control system.
Monitored object rank in the present embodiment, self-defined priority and warning trigger condition can be pre-set according to practical work demand.Monitoring alarm control system in the present embodiment can realize the preferential monitored object high to rank and carry out alarm operation, can also be filtered invalid warning by warning trigger condition simultaneously, the unlatching of invalid warning in minimizing work, the work load reduced a staff, improves accuracy and the controllability of monitoring and alarming system.
Embodiment three:
The present embodiment adds warning trigger control module 303 on the basis of above-described embodiment, below to increase warning trigger control module 303 on the basis of embodiment two, in order to be described, please refer to Fig. 5, Fig. 6, monitoring alarm control system comprises data acquisition module 301, forewarn module 302 and warning trigger control module 303.Wherein, data acquisition module 301, forewarn module 302 and the data acquisition module in above-described embodiment, forewarn module is identical, warning trigger element 303 is for determining whether trigger alarm based on the warning trigger pip produced, whether the warning trigger control module 303 monitored end corresponding according to the monitored end related information table determination current alerts trigger pip set up in advance has the monitoring associated up or down, if had, carry out the control of trigger alarm based on the warning trigger pip produced and related information, if the monitoring do not associated up or down, then based on the warning trigger pip trigger alarm produced.Warning trigger control module 303 is when determining that monitored end corresponding to warning trigger pip has the monitoring upwards associated, check whether higher level's associate device has trigger alarm within the very first time of setting, if had, then to current warning trigger pip no longer trigger alarm, otherwise based on current warning trigger pip trigger alarm; Described warning trigger control module 303 when determining that monitored end corresponding to warning trigger pip has the monitoring of association downwards, within the second time of setting, to the warning trigger pip no longer trigger alarm produced by downward associate device.
The specific works flow process of trigger control module of reporting to the police in the present embodiment 303 is as follows:
Step 304, warning trigger control module 303 receives the warning trigger pip that forewarn module occurs, whether the monitored end corresponding according to the monitored end related information table determination current alerts trigger pip set up in advance has the monitoring associated up or down, if there is no, perform step 305, if existed, perform step 306.
Step 305, if there is not association monitoring up or down in monitored end corresponding to current warning trigger pip, then based on current warning trigger pip trigger alarm.
Step 306, judges whether monitored end corresponding to current warning trigger pip exists association monitoring upwards, if so, then performs step 307.If there is no upwards association monitoring, then perform step 309.
Whether step 307, judge association monitoring trigger alarm upwards, if it is perform step 308, if association monitoring does not upwards have trigger alarm, then forward step 305 to, based on current warning trigger pip trigger alarm.
Step 308, to current warning trigger pip no longer trigger alarm, namely closes the warning of current monitored end.
Step 309, there is downward association monitoring in monitored end corresponding to current warning trigger pip, regardless of downward association monitoring whether trigger alarm, all to the warning trigger pip no longer trigger alarm produced by downward associate device, namely close the warning that downward associate device produces.And forward step 305 to, based on current warning trigger pip trigger alarm.
Such as, monitor the existing state of certain server, the various services that server runs have been monitored simultaneously, comprise server local resource etc., the warning when monitored object server triggers, illustrate that whole server is in inactive state, also just mean various service in its system and resource all unavailable, then the various service in system and resource but still can transmit warning message, after warning trigger control module 303 processes, only can carry out unlatching alarm operation to the warning message of parent server.In like manner, for the monitoring of certain switch, when the existing state of switch triggers warning, namely deducibility be connected to each server under switch and resource all unavailable, this type of situation, warning trigger control module 303 just only can send a warning message (the corresponding warning message of switch), and subordinate's monitored object information of additional association, and can not to the equal trigger alarm of each monitored object.
In the present embodiment, also first can judge whether monitored end corresponding to current warning trigger pip exists downward association monitoring, and then judge whether monitored end corresponding to current warning trigger pip exists association monitoring upwards.
After the monitored end of higher level opens warning, the warning message of its monitored end of corresponding subordinate can be defined as invalid warning message, therefore, the warning trigger control module 303 increased in the present embodiment can reduce partial invalidity alarm request, the work load reduced a staff, improves the accurate of monitoring alarm control system and controllability.
Embodiment four:
The present embodiment adds mark module and tokens statistics module on the basis of above-described embodiment, below to increase mark module and tokens statistics module on the basis of embodiment three, in order to be described, please refer to Fig. 7, Fig. 8, monitoring alarm control system comprises data acquisition module 401, forewarn module 402, warning trigger control module 403, mark module 404 and tokens statistics module 405.Wherein, data acquisition module 401, forewarn module 402, warning trigger control module 403 and the data acquisition module in above-described embodiment, forewarn module, warning trigger control module are identical.The difference of the present embodiment and embodiment three is to add mark module 404 and tokens statistics module 405.Mark module 404 is for marking described warning message; Tokens statistics module 405, for being added up by described label information, generates statistics.
In the present embodiment, the specific works flow process of mark module 404 and tokens statistics module 405 is as follows:
Step 406, mark module 404 is after receiving warning message, and the difference according to warning message type carries out different marks, generates label information.
Step 407, tokens statistics module 405 receives the label information that mark module 404 generates, and adds up it, generates statistics, and described statistics can be the chart or digital form that generate according to label information.
In the present embodiment, for the warning message triggered, mark module 404 does mark of correlation to it, generate label information so that statistical study is used, and in the equipment can remembered stored in database, and the warning triggered to be gathered according to label information by tokens statistics module 405, add up according to not isolabeling, effective statistics is provided, and chart data intuitively can be generated.Staff, by analysis statisticaling data, can be optimized adjustment to the service of high failure rate.
Embodiment five:
The present embodiment adds automatic troubleshooting module on the basis of above-described embodiment, below to increase automatic troubleshooting module on the basis of embodiment four, in order to be described, please refer to Fig. 9, Figure 10, in the present embodiment, monitoring alarm control system comprises data acquisition module 501, forewarn module 502, warning trigger control module 503, mark module 504, tokens statistics module 505 and automatic troubleshooting module 506.Wherein, data acquisition module 501, forewarn module 502, warning trigger control module 503, mark module 504, tokens statistics module 505 and the data acquisition module in above-described embodiment, forewarn module, warning trigger control module, mark module, tokens statistics module are identical.Automatic troubleshooting module 506 judges whether to carry out automatic troubleshooting to current alerts information according to label information, if label information is to there being the automatic troubleshooting handling procedure prestored, explanation can carry out automatic troubleshooting to current alerts information, then send automatic process information to the monitored end that warning message is corresponding, if cannot, then to current warning message trigger alarm, described automatic process information is used for notifying that monitored end runs processing procedure and automatically fixes a breakdown.
In the present embodiment, the specific works flow process of automatic troubleshooting module 506 is as follows:
Step 507, automatic troubleshooting module 506 judges whether to carry out automatic troubleshooting to current alerts information according to label information.
Step 508, if can carry out automatic troubleshooting to current alerts information, then troubleshooting module 506 sends automatic process information to monitored end automatically, and described automatic process information is used for notifying that monitored end runs processing procedure and automatically fixes a breakdown.
Step 509, monitored termination runs automatic processing procedure after receiving the automatic process information of automatic troubleshooting module 506 transmission, carries out automatic troubleshooting.
Step 510, if cannot carry out automatic troubleshooting to current alerts information, then to current alerts information trigger alarm.
In the present embodiment, for the warning message that can carry out automatic troubleshooting, automatic troubleshooting module 506 sends the corresponding monitored end of automatic process information instruction and runs automatic processing procedure and carry out automatic troubleshooting after the automatic troubleshooting instruction receiving user.Achieve the automatic business processing of troubleshooting operation in monitoring alarm control procedure, thus decrease artificial intervention or maintenance process.
Above content is in conjunction with concrete embodiment further description made for the present invention, can not assert that specific embodiments of the invention are confined to these explanations.For general technical staff of the technical field of the invention, without departing from the inventive concept of the premise, some simple deduction or replace can also be made.