CN116112342A - Alarm information processing method, device, electronic equipment and storage medium - Google Patents
Alarm information processing method, device, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN116112342A CN116112342A CN202310017420.XA CN202310017420A CN116112342A CN 116112342 A CN116112342 A CN 116112342A CN 202310017420 A CN202310017420 A CN 202310017420A CN 116112342 A CN116112342 A CN 116112342A
- Authority
- CN
- China
- Prior art keywords
- alarm
- information
- alarm information
- initial
- level
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000010365 information processing Effects 0.000 title claims abstract description 28
- 238000003672 processing method Methods 0.000 title claims abstract description 19
- 238000012545 processing Methods 0.000 claims abstract description 41
- 238000000034 method Methods 0.000 claims abstract description 38
- 238000004590 computer program Methods 0.000 claims description 16
- 230000001629 suppression Effects 0.000 claims description 6
- 238000012544 monitoring process Methods 0.000 description 17
- 238000012423 maintenance Methods 0.000 description 13
- 230000008569 process Effects 0.000 description 9
- 238000004891 communication Methods 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 238000007726 management method Methods 0.000 description 5
- 230000033228 biological regulation Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013475 authorization Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0604—Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
- H04L41/064—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis involving time analysis
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention discloses an alarm information processing method, an alarm information processing device, electronic equipment and a storage medium. The method comprises the following steps: receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information; for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition; if yes, carrying out alarm upgrading processing on the current initial alarm information to generate advanced alarm information corresponding to the current initial alarm information. According to the technical scheme disclosed by the invention, the problem that the low-level alarm is ignored for a long time in the prior art and is not recovered is solved by judging the specific condition of the alarm and upgrading the low-level alarm, so that the alarm processing efficiency is improved, and the system performance of the cloud platform is improved.
Description
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and apparatus for processing alarm information, an electronic device, and a storage medium.
Background
Under the condition that technologies such as cloud computing, big data and the like are mature, service products of users are more and more, and requirements of users for responding to resource use conditions of own resources, running conditions and health of services and timely receiving abnormal alarms are more and more urgent. The monitoring is taken as a platform capable of carrying out three-dimensional monitoring on resources, and the failure of the alarm processing is particularly important.
Prometheus is an open source monitoring system, which is an open source version of Google BorgMon. With the rise of container and Kubernetes (K8 s for short) technology for managing containerized applications on multiple hosts in a cloud platform, promethaus is receiving increasing attention.
But it is now found in the monitoring process using promethaus that: low-level alarms are ignored for a long time, resulting in problems with alarms not being recovered, and thus resulting in the system performance of the monitored cloud platform being affected.
Disclosure of Invention
The invention provides an alarm information processing method, an alarm information processing device, electronic equipment and a storage medium, which are used for solving the problem that the alarm is not recovered because the low-level alarm is ignored for a long time in the prior art by judging the specific condition of the alarm and upgrading the low-level alarm, thereby improving the alarm processing efficiency and realizing the improvement of the system performance of a cloud platform.
In a first aspect, an embodiment of the present invention provides an alarm information processing method, where the method includes:
receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information;
for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition;
if yes, carrying out alarm upgrading processing on the current initial alarm information to generate advanced alarm information corresponding to the current initial alarm information.
Optionally, the method further comprises:
and when the alarm level information of the current initial alarm information does not meet the preset advanced alarm condition and the information sending frequency of the current initial alarm information is larger than a preset frequency threshold value, performing alarm suppression processing on the current initial alarm information.
Optionally, the method further comprises:
and when the alarm level of the current initial alarm information meets the preset advanced alarm condition, generating advanced alarm information corresponding to the current initial alarm information based on the initial alarm information.
Optionally, the method further comprises:
and for any service node, acquiring node index data of the current service node, and generating initial alarm information of the current service node based on the node index data and a preset alarm triggering rule.
Optionally, the determining the alarm level corresponding to each initial alarm information includes:
for any initial alarm information, an alarm level field of the current initial alarm information is determined, and the alarm level of the current initial alarm information is determined based on the alarm level field.
Optionally, the method further comprises:
and generating advanced alarm prompt information of the service node corresponding to the initial alarm information based on the alarm node information of the current initial alarm information and the advanced alarm information.
Optionally, the method further comprises:
acquiring a data query instruction of node state data of any service node, and inputting the data query instruction into a preset query interface to realize data query;
and receiving the query result of the node state data of the current service node output by the query interface.
In a second aspect, an embodiment of the present invention further provides an alarm information processing apparatus, where the apparatus includes:
the alarm level determining module is used for receiving the initial alarm information of each service node and respectively determining the alarm level corresponding to each initial alarm information;
the alarm condition judging module is used for judging whether the alarm continuous information of the initial alarm information meets the preset alarm continuous threshold condition or not when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition for any initial alarm information;
and the advanced alarm information generation module is used for carrying out alarm upgrading processing on the current initial alarm information if so, and generating advanced alarm information corresponding to the current initial alarm information.
In a third aspect, an embodiment of the present invention further provides an electronic device, including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the alert information processing method according to any one of the embodiments of the present invention.
In a fourth aspect, an embodiment of the present invention further provides a computer readable storage medium, where computer instructions are stored, where the computer instructions are configured to cause a processor to execute the alarm information processing method according to any one of the embodiments of the present invention.
The technical scheme of the embodiment of the invention specifically comprises the following steps: receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information; for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition; if yes, carrying out alarm upgrading processing on the current initial alarm information, and generating advanced alarm information corresponding to the current initial alarm information. After receiving the alarm information sent by the promethaus, the Alert manager component preset in the technical scheme carries out corresponding processing on the alarm information according to the specific condition of the alarm information, for example, carries out upgrading processing on low-level alarm information which is not processed for a long time, so that the problem that the low-level alarm is ignored for a long time and the alarm is not recovered is solved, the alarm processing efficiency is improved, and the system performance of the cloud platform is improved.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the invention or to delineate the scope of the invention. Other features of the present invention will become apparent from the description that follows.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of an alarm information processing method according to a first embodiment of the present invention;
FIG. 2 is a schematic diagram of an alarm information processing apparatus according to a second embodiment of the present invention;
fig. 3 is a schematic structural diagram of an electronic device implementing an alarm information processing method according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and the claims of the present invention and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the invention described herein may be implemented in sequences other than those illustrated or otherwise described herein.
The names of messages or information interacted between the various devices in the embodiments of the present disclosure are for illustrative purposes only and are not intended to limit the scope of such messages or information.
It will be appreciated that prior to using the technical solutions disclosed in the embodiments of the present disclosure, the user should be informed and authorized of the type, usage range, usage scenario, etc. of the personal information related to the present disclosure in an appropriate manner according to the relevant legal regulations.
For example, in response to receiving an active request from a user, a prompt is sent to the user to explicitly prompt the user that the operation it is requesting to perform will require personal information to be obtained and used with the user. Thus, the user can autonomously select whether to provide personal information to software or hardware such as an electronic device, an application program, a server or a storage medium for executing the operation of the technical scheme of the present disclosure according to the prompt information.
As an alternative but non-limiting implementation, in response to receiving an active request from a user, the manner in which the prompt information is sent to the user may be, for example, a popup, in which the prompt information may be presented in a text manner. In addition, a selection control for the user to select to provide personal information to the electronic device in a 'consent' or 'disagreement' manner can be carried in the popup window.
It will be appreciated that the above-described notification and user authorization process is merely illustrative and not limiting of the implementations of the present disclosure, and that other ways of satisfying relevant legal regulations may be applied to the implementations of the present disclosure.
It will be appreciated that the data (including but not limited to the data itself, the acquisition or use of the data) involved in the present technical solution should comply with the corresponding legal regulations and the requirements of the relevant regulations.
Example 1
Fig. 1 is a flowchart of an alarm information processing method according to an embodiment of the present invention, where the embodiment is applicable to a situation of monitoring a cloud platform.
Currently, promethaus (promiscus) is receiving more and more attention from various monitoring modes of each cloud platform in a cluster. Prometaus is written in Go language, acquires monitoring information in PULL mode, and provides multidimensional data model and flexible query interface. Prometaus can not only configure the monitoring object through a static file, but also support an automatic discovery mechanism, and can dynamically acquire the monitoring object. And promethaus periodically grabs the status of the monitored component through HTTP (hypertext transfer) protocol, and any component can realize access monitoring as long as a corresponding HTTP interface is provided. Specifically, prometheus directly pulls monitoring index data of preset interfaces in each cloud platform, and stores the monitoring index data into a time sequence database preset by Prometheus.
The time sequence database is a component of promethaus, is totally called as a time sequence database, and is mainly used for processing data with time labels (changing according to the sequence of time, namely time serialization). Specifically, the monitoring index data can be stored in a time sequence database in a time tag form, further, the monitoring index data is analyzed based on a preset alarm rule to obtain alarm information, and the alarm information is sent to an Alert manager component, so that the component sends the alarm information to an operation and maintenance personnel management end corresponding to the cloud platform, and the operation and maintenance personnel maintain the cloud platform, so that the system performance of the cloud platform is improved. However, in the existing implementation, the Alert manager component does not support a personalized alarm rule policy, and does not support the alarm to be updated according to the duration, so that problems such as redundant alarm information and alarm message storm are easily caused. Aiming at the technical problems, the technical scheme of the embodiment of the invention provides an alarm information processing method, which specifically comprises the following steps: after receiving the alarm information sent by the promethaus, the Alert manager component carries out corresponding processing on the alarm information according to the specific condition of the alarm information, for example, carries out upgrading processing on low-level alarm information which is not processed for a long time, thereby solving the problem that the low-level alarm is ignored for a long time and the alarm is not recovered, and further improving the alarm processing efficiency so as to improve the system performance of the cloud platform.
The method may be performed by an alert information processing apparatus, which may be implemented in hardware and/or software, and which may be configured in a cloud server. As shown in fig. 1, the method includes:
s110, receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information.
In the embodiment of the present invention, the service node may be understood as a cloud platform monitored by promethaus in the cluster, and in this embodiment, the cloud platform may be a private cloud or a public cloud, which is not limited thereto. The initial Alert information is an Alert record sent by promethaus to each service node of the Alert manager.
Optionally, in this embodiment, before monitoring the service nodes, corresponding promethaus meta information and alarm triggering rules need to be configured in advance based on node data of each service node.
Specifically, the configuration for the promethaus meta-information may include: firstly, cluster information in the monitored is input in the precursor, for example, when the cluster is k8s (kubernetes, orchestration management tool of the portable container) cluster, the corresponding kubeconfig is configured, and the corresponding precursor and the corresponding type are configured; optionally, if deployment is performed in an operator mode, the corresponding k8s clusters need to be associated; if the deployment is manual, the self-research sidecar needs to be deployed on the corresponding promethaus node.
Specific configurations for alarm triggering rules may include: and configuring a corresponding query expr according to the promethaus grammar. Optionally, the test and verification can be performed through a unified query interface. Further, configuring relevant labels of the alarm rules specifically comprises configuring key alarm grades, alarm duration time, belonging classifications and the like, associating alarm triggering rules related to each cloud platform with corresponding promethaus of each cloud platform, and automatically issuing the configured alarm triggering rules to the corresponding promethaus to realize subsequent alarm analysis.
Based on the foregoing embodiment, optionally, the technical solution of this embodiment further includes: and for any service node, acquiring node index data of the current service node, and generating initial alarm information of the current service node based on the node index data and a preset alarm triggering rule.
The node index data may be understood as each index data generated by the service node in the operation process. In this embodiment, the node index data includes, but is not limited to, monitoring indexes of the host, the process, the virtual machine, and the like of the service node.
Specifically, each service node in the cluster is associated with a corresponding promethaus. In the process of operating the service node, the prometaus periodically pulls the node index data of the service node, namely the monitoring index data, and performs alarm analysis on the node index data based on a preset alarm triggering rule. Specifically, when any index in the node index data exceeds a trigger threshold specified in the improved trigger rule, an alarm record corresponding to the index is generated, and the alarm record is further sent to an Alert manager component as initial alarm information of the current service node.
Further, when receiving an alarm record sent by prometaheus, the Alert manager takes the alarm record as initial alarm information, analyzes the initial alarm information, determines an alarm level of the initial alarm information based on an analysis result, and correspondingly prompts the initial alarm information based on the alarm level so as to improve the processing efficiency of the alarm information.
Optionally, the method for determining the alarm level of the initial alarm information in the technical solution of this embodiment may include: for any initial alarm information, an alarm level field of the current initial alarm information is determined, and an alarm level of the current initial alarm information is determined based on the alarm level field.
Specifically, when the initial alarm information is acquired, field analysis is performed on the initial alarm information to obtain each alarm field in the initial alarm information, alarm level fields in each alarm field are determined, and then the alarm level corresponding to the initial alarm information is determined based on the specific field content in the alarm level fields. Of course, the alarm level in the initial alarm information may also be determined based on other manners, such as a neural network model, which is not limited in this embodiment.
S120, for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition.
In this embodiment, the alert levels include, but are not limited to, low-level alerts and high-level alerts. Specifically, for the initial alarm information of any service node, acquiring field content corresponding to an alarm level on the basis of determining an alarm level field of the initial alarm information, and comparing the field content with a preset advanced alarm condition, if the field content is greater than the advanced alarm condition, the alarm level of the initial alarm information is an advanced alarm, namely the preset advanced alarm condition is satisfied; otherwise, the alarm is a low-level alarm, and the preset high-level alarm condition is not met.
For example, the field content corresponding to the low-level alarm may be a value of 1, and the field content corresponding to the high-level alarm may be a value of 2; the preset advanced warning condition may be a value of 2 or more. Specifically, if the field content of the alarm field of the current initial alarm information is 1, it may be determined that the alarm level of the current initial alarm information does not meet the preset advanced alarm condition based on the preset advanced alarm condition.
Optionally, in the technical solution in this embodiment, when it is determined that the alert level of the current initial alert information does not meet the preset advanced alert condition, it is indicated that the alert level of the initial alert information is an advanced alert, and advanced alert information corresponding to the current initial alert information may be generated based on the initial alert information, and alert prompt may be performed on operation and maintenance personnel corresponding to the service node based on the advanced alert information.
In particular, advanced alert information may be understood as more urgent alert cues. For example, the call is required to notify the operation and maintenance personnel corresponding to the corresponding service node; and then, for example, popup prompt is performed at a distinct position, such as a central position, of a display interface of the service node.
Optionally, when it is determined that the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, the alarm level of the initial alarm information is indicated to be a low-level alarm, the low-level alarm information corresponding to the current initial alarm information can be generated based on the initial alarm information, and the operation and maintenance personnel corresponding to the service node can be alerted based on the low-level alarm information.
In particular, low-level alert information may be understood as relatively loose alert cues. For example, the operation and maintenance personnel corresponding to the corresponding service node can be notified by a short message; and then, for example, popup window prompt is carried out at the edge position of the display interface of the service node.
In this embodiment, if the service node corresponding to the alarm information is not maintained, the service node will frequently generate the alarm information based on a preset alarm triggering rule, but because some alarm information is low-level alarm information and has a small influence on the operation of the service node, if other higher-level alarm information and more influenced alarm information appear, the operation and maintenance personnel may not be able to process the initial alarm information with the smaller influence in time, so that frequent sending of the alarm information may cause occurrence of scaling ladders such as alarm storm, and influence the maintenance of the operation and maintenance personnel on the service node. Optionally, in order to solve the above technical problem, in the technical solution of this embodiment, when the alert level information of the current initial alert information does not meet the preset advanced alert condition, and the information sending frequency of the current initial alert information is greater than the preset frequency threshold, the current initial alert information is subjected to alert suppression processing, so as to reduce the sending of the alert information, thereby avoiding the above problem.
In this embodiment, the alarm suppression processing may be understood as processing such as deleting and not prompting the alarm information, so that the alarm information is not frequently sent to the management end of the operation and maintenance personnel.
Optionally, in order to avoid the problem that the low-level alarm information is unmanned for a long time, the technical solution of this embodiment further includes obtaining alarm duration information of the initial alarm information, determining whether the alarm duration information meets a preset alarm duration threshold condition, and further determining whether to perform information processing on the initial alarm information based on a judgment result.
The alarm duration information may be understood as a prompting time length of the alarm information generated based on the alarm triggering rule when prompting. Specifically, on the basis of acquiring each alarm field of the initial alarm information, an alarm duration field corresponding to the alarm duration information in the initial alarm information is determined, field content corresponding to the alarm duration field is determined, and then the alarm duration information in the initial alarm information is determined based on the field content.
Specifically, the alarm duration information in the field content is compared with a preset alarm duration threshold condition, and if the alarm time length in the alarm duration information is longer than the threshold time length in the alarm duration threshold condition, the alarm duration information of the initial alarm information is indicated to meet the alarm duration threshold condition; otherwise, the preset advanced warning condition is not met.
And S130, if so, carrying out alarm upgrading processing on the current initial alarm information to generate advanced alarm information corresponding to the current initial alarm information.
In this embodiment, the alarm upgrading process may be understood as modifying the field content corresponding to the alarm level of the initial alarm information, so that the initial alarm information is changed from a low-level alarm to a high-level alarm.
Specifically, if it is determined that the alarm duration of the initial alarm information meets the preset alarm duration threshold condition, it is indicated that the service node corresponding to the current alarm information is not maintained for a long time, so that an alarm upgrading process needs to be performed on the initial alarm information, that is, an alarm level of the initial alarm information is upgraded from a low-level alarm to a high-level alarm, so as to generate the high-level alarm information corresponding to the initial alarm information.
On the basis of the foregoing embodiment, on the basis of obtaining the foregoing advanced warning information, the present embodiment further includes: and generating advanced alarm prompt information of the service node corresponding to the initial alarm information based on the alarm node information of the current initial alarm information.
The alarm node information includes, but is not limited to, node identification of the service node, management end of operation and maintenance personnel corresponding to the service node, and the like.
Specifically, after the advanced warning prompt information is generated, the advanced warning prompt information is distributed to operation and maintenance personnel corresponding to the service node based on a preset Alert Center component, so that the operation and maintenance personnel maintain the service node to improve the system performance.
Optionally, on the basis of the foregoing embodiment, when each node generates the initial alarm information, the low-level alarm information and the high-level alarm information, the technical solution of this embodiment stores each alarm information in correspondence with the service node, so as to implement subsequent backtracking of the alarm record, thereby providing a data basis for stability and quality analysis of the subsequent system.
Based on the foregoing embodiment, the technical solution of this embodiment further includes: acquiring a data query instruction of node state data of any service node, and inputting the data query instruction into a preset query interface to realize data query; and receiving the query result of the node state data of the current service node output by the query interface.
In this embodiment, the service node may be a private cloud or a public cloud platform. In the existing application, when only inquiring the data of each platform in a multi-cluster multi-cloud complex environment, the Prometaus corresponding to each platform is required to be predetermined, and then the data inquiry is carried out based on the Prometaus, and a unified inquiry entrance of each platform is not available, so that the technical scheme of the embodiment sets a query gateway in each Prometaus in advance to realize a unified data plane and support the unified inquiry of multiple platforms.
In this embodiment, the query gateway realizes a high-performance unified gateway proxy by the golang to realize http reverse distribution of the user query request. Specifically, a data query instruction is input to a query gateway, the query gateway sends the data query instruction to prometheus controller corresponding to the instruction, the prometheus controller processes relevant label of the promethaus to filter, sends the data query instruction distributed by the route to a back-end promethaus, further receives a query result fed back by the back-end, and feeds back the query result after aggregation to a query interface, so as to achieve the query result of node state data of the current service node.
The technical scheme of the embodiment of the invention specifically comprises the following steps: receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information; for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition; if yes, carrying out alarm upgrading processing on the current initial alarm information, and generating advanced alarm information corresponding to the current initial alarm information. After receiving the alarm information sent by the promethaus, the Alert manager component preset in the technical scheme carries out corresponding processing on the alarm information according to the specific condition of the alarm information, for example, carries out upgrading processing on low-level alarm information which is not processed for a long time, so that the problem that the low-level alarm is ignored for a long time and the alarm is not recovered is solved, the alarm processing efficiency is improved, and the system performance of the cloud platform is improved.
Example two
Fig. 2 is a schematic structural diagram of an alarm information processing apparatus according to a third embodiment of the present invention. As shown in fig. 2, the apparatus includes: an alarm level determination module 210, an alarm condition judgment module 220, and an advanced alarm information generation module 230; wherein,,
the alarm level determining module 210 is configured to receive initial alarm information of each service node, and determine alarm levels corresponding to the initial alarm information respectively;
the alarm condition judging module 220 is configured to determine, for any initial alarm information, if an alarm level of a current initial alarm information does not meet a preset advanced alarm condition, whether alarm duration information of the initial alarm information meets a preset alarm duration threshold condition;
and the advanced warning information generating module 230 is configured to perform warning upgrading processing on the current initial warning information if yes, and generate advanced warning information corresponding to the current initial warning information.
The technical scheme of the embodiment of the invention specifically comprises the following steps: receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information; for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition; if yes, carrying out alarm upgrading processing on the current initial alarm information, and generating advanced alarm information corresponding to the current initial alarm information. After receiving the alarm information sent by the promethaus, the Alert manager component preset in the technical scheme carries out corresponding processing on the alarm information according to the specific condition of the alarm information, for example, carries out upgrading processing on low-level alarm information which is not processed for a long time, so that the problem that the low-level alarm is ignored for a long time and the alarm is not recovered is solved, the alarm processing efficiency is improved, and the system performance of the cloud platform is improved.
On the basis of the above embodiment, optionally, the apparatus further includes:
and the alarm suppression processing module is used for performing alarm suppression processing on the current initial alarm information when the alarm level information of the current initial alarm information does not meet the preset advanced alarm condition and the information sending frequency of the current initial alarm information is larger than a preset frequency threshold.
On the basis of the above embodiment, optionally, the apparatus further includes:
and the advanced warning information generation module is used for generating advanced warning information corresponding to the current initial warning information based on the initial warning information when the warning level of the current initial warning information meets the preset advanced warning condition.
On the basis of the above embodiment, optionally, the apparatus further includes:
the system comprises an initial alarm information generation module, a current service node generation module and a current service node generation module, wherein the initial alarm information generation module is used for acquiring node index data of the current service node for any service node, and generating initial alarm information of the current service node based on the node index data and a preset alarm triggering rule.
Based on the foregoing embodiment, optionally, the alarm level determining module 210 includes:
and the alarm level determining unit is used for determining an alarm level field of the current initial alarm information for any initial alarm information and determining the alarm level of the current initial alarm information based on the alarm level field.
On the basis of the above embodiment, optionally, the apparatus further includes:
and the advanced warning prompt information generation module is used for generating advanced warning prompt information of the service node corresponding to the initial warning information based on the warning node information of the current initial warning information and the advanced warning information.
On the basis of the above embodiment, optionally, the apparatus further includes:
the data query instruction acquisition module is used for acquiring a data query instruction of node state data of any service node, and inputting the data query instruction into a preset query interface so as to realize data query;
and the query result receiving module is used for receiving the query result of the node state data of the current service node output by the query interface.
The alarm information processing device provided by the embodiment of the invention can execute the alarm information processing method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.
Example III
Fig. 3 shows a schematic diagram of the structure of an electronic device 10 that may be used to implement an embodiment of the invention. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. Electronic equipment may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices (e.g., helmets, glasses, watches, etc.), and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the inventions described and/or claimed herein.
As shown in fig. 3, the electronic device 10 includes at least one processor 11, and a memory, such as a Read Only Memory (ROM) 12, a Random Access Memory (RAM) 13, etc., communicatively connected to the at least one processor 11, in which the memory stores a computer program executable by the at least one processor, and the processor 11 may perform various appropriate actions and processes according to the computer program stored in the Read Only Memory (ROM) 12 or the computer program loaded from the storage unit 18 into the Random Access Memory (RAM) 13. In the RAM 13, various programs and data required for the operation of the electronic device 10 may also be stored. The processor 11, the ROM 12 and the RAM 13 are connected to each other via a bus 14. An input/output (I/O) interface 15 is also connected to bus 14.
Various components in the electronic device 10 are connected to the I/O interface 15, including: an input unit 16 such as a keyboard, a mouse, etc.; an output unit 17 such as various types of displays, speakers, and the like; a storage unit 18 such as a magnetic disk, an optical disk, or the like; and a communication unit 19 such as a network card, modem, wireless communication transceiver, etc. The communication unit 19 allows the electronic device 10 to exchange information/data with other devices via a computer network, such as the internet, and/or various telecommunication networks.
The processor 11 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of processor 11 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various processors running machine learning model algorithms, digital Signal Processors (DSPs), and any suitable processor, controller, microcontroller, etc. The processor 11 performs the respective methods and processes described above, such as an alarm information processing method.
In some embodiments, the alert information processing method may be implemented as a computer program tangibly embodied on a computer-readable storage medium, such as the storage unit 18. In some embodiments, part or all of the computer program may be loaded and/or installed onto the electronic device 10 via the ROM 12 and/or the communication unit 19. When the computer program is loaded into the RAM 13 and executed by the processor 11, one or more steps of the alert information processing method described above may be performed. Alternatively, in other embodiments, the processor 11 may be configured to perform the alert information processing method in any other suitable manner (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
A computer program for carrying out methods of the present invention may be written in any combination of one or more programming languages. These computer programs may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the computer programs, when executed by the processor, cause the functions/acts specified in the flowchart and/or block diagram block or blocks to be implemented. The computer program may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of the present invention, a computer-readable storage medium may be a tangible medium that can contain, or store a computer program for use by or in connection with an instruction execution system, apparatus, or device. The computer readable storage medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. Alternatively, the computer readable storage medium may be a machine readable signal medium. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on an electronic device having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) through which a user can provide input to the electronic device. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), blockchain networks, and the internet.
The computing system may include clients and servers. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service are overcome.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present invention may be performed in parallel, sequentially, or in a different order, so long as the desired results of the technical solution of the present invention are achieved, and the present invention is not limited herein.
The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.
Claims (10)
1. An alarm information processing method, comprising:
receiving initial alarm information of each service node, and respectively determining alarm levels corresponding to the initial alarm information;
for any initial alarm information, when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition, determining whether the alarm duration information of the initial alarm information meets the preset alarm duration threshold condition;
if yes, carrying out alarm upgrading processing on the current initial alarm information to generate advanced alarm information corresponding to the current initial alarm information.
2. The method according to claim 1, wherein the method further comprises:
and when the alarm level information of the current initial alarm information does not meet the preset advanced alarm condition and the information sending frequency of the current initial alarm information is larger than a preset frequency threshold value, performing alarm suppression processing on the current initial alarm information.
3. The method according to claim 1, wherein the method further comprises:
and when the alarm level of the current initial alarm information meets the preset advanced alarm condition, generating advanced alarm information corresponding to the current initial alarm information based on the initial alarm information.
4. The method according to claim 1, wherein the method further comprises:
and for any service node, acquiring node index data of the current service node, and generating initial alarm information of the current service node based on the node index data and a preset alarm triggering rule.
5. The method of claim 1, wherein the determining the alarm level corresponding to each of the initial alarm information includes:
for any initial alarm information, an alarm level field of the current initial alarm information is determined, and the alarm level of the current initial alarm information is determined based on the alarm level field.
6. The method according to claim 1, wherein the method further comprises:
and generating advanced alarm prompt information of the service node corresponding to the initial alarm information based on the alarm node information of the current initial alarm information and the advanced alarm information.
7. The method according to claim 1, wherein the method further comprises:
acquiring a data query instruction of node state data of any service node, and inputting the data query instruction into a preset query interface to realize data query;
and receiving the query result of the node state data of the current service node output by the query interface.
8. An alert information processing apparatus, comprising:
the alarm level determining module is used for receiving the initial alarm information of each service node and respectively determining the alarm level corresponding to each initial alarm information;
the alarm condition judging module is used for judging whether the alarm continuous information of the initial alarm information meets the preset alarm continuous threshold condition or not when the alarm level of the current initial alarm information does not meet the preset advanced alarm condition for any initial alarm information;
and the advanced alarm information generation module is used for carrying out alarm upgrading processing on the current initial alarm information if so, and generating advanced alarm information corresponding to the current initial alarm information.
9. An electronic device, the electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the alert information processing method of any one of claims 1-7.
10. A computer readable storage medium storing computer instructions for causing a processor to implement the alarm information processing method of any one of claims 1-7 when executed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310017420.XA CN116112342A (en) | 2023-01-06 | 2023-01-06 | Alarm information processing method, device, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310017420.XA CN116112342A (en) | 2023-01-06 | 2023-01-06 | Alarm information processing method, device, electronic equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116112342A true CN116112342A (en) | 2023-05-12 |
Family
ID=86257541
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310017420.XA Pending CN116112342A (en) | 2023-01-06 | 2023-01-06 | Alarm information processing method, device, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116112342A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116612612A (en) * | 2023-05-19 | 2023-08-18 | 湖北清江水电开发有限责任公司 | Centralized control center alarm method for river basin step power plant, computer equipment and storage medium |
-
2023
- 2023-01-06 CN CN202310017420.XA patent/CN116112342A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116612612A (en) * | 2023-05-19 | 2023-08-18 | 湖北清江水电开发有限责任公司 | Centralized control center alarm method for river basin step power plant, computer equipment and storage medium |
CN116612612B (en) * | 2023-05-19 | 2024-06-11 | 湖北清江水电开发有限责任公司 | Centralized control center alarm method for river basin step power plant, computer equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109257200B (en) | Method and device for monitoring big data platform | |
CN111124819B (en) | Method and device for full link monitoring | |
CN112911013B (en) | Cloud application processing method and device, computer equipment and storage medium | |
CN113220420B (en) | Service monitoring method, device, equipment, storage medium and computer program product | |
CN113377626B (en) | Visual unified alarm method, device, equipment and medium based on service tree | |
CN113590437B (en) | Alarm information processing method, device, equipment and medium | |
CN115396289B (en) | Fault alarm determining method and device, electronic equipment and storage medium | |
CN112948223B (en) | Method and device for monitoring running condition | |
CN112910733A (en) | Full link monitoring system and method based on big data | |
CN116112342A (en) | Alarm information processing method, device, electronic equipment and storage medium | |
CN110677271B (en) | Big data alarm method, device, equipment and storage medium based on ELK | |
CN113656239A (en) | Monitoring method and device for middleware and computer program product | |
CN115426287B (en) | System monitoring and optimizing method and device, electronic equipment and medium | |
CN115514618A (en) | Alarm event processing method and device, electronic equipment and medium | |
CN116302790A (en) | Operation resource management method, cloud gateway, electronic device and storage medium | |
CN115130112A (en) | Quick start-stop method, device, equipment and storage medium | |
CN114756301A (en) | Log processing method, device and system | |
CN114706893A (en) | Fault detection method, device, equipment and storage medium | |
CN114168371A (en) | Intelligent automatic fault alarm system | |
CN113282455A (en) | Monitoring processing method and device | |
CN118170617B (en) | Cluster anomaly detection method and device, electronic equipment and storage medium | |
CN116909757B (en) | Cluster management control system, method, electronic device and storage medium | |
CN115643091A (en) | System log monitoring method, device, equipment and medium | |
CN116668258A (en) | Alarm information processing method and device and electronic equipment | |
CN118819889A (en) | Message processing method, device, equipment, medium and program product |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |