WO2014040470A1 - Procédé et dispositif de traitement de message d'alarme - Google Patents

Procédé et dispositif de traitement de message d'alarme Download PDF

Info

Publication number
WO2014040470A1
WO2014040470A1 PCT/CN2013/081539 CN2013081539W WO2014040470A1 WO 2014040470 A1 WO2014040470 A1 WO 2014040470A1 CN 2013081539 W CN2013081539 W CN 2013081539W WO 2014040470 A1 WO2014040470 A1 WO 2014040470A1
Authority
WO
WIPO (PCT)
Prior art keywords
alarm
message
delay
recovery
received
Prior art date
Application number
PCT/CN2013/081539
Other languages
English (en)
Chinese (zh)
Inventor
胡锡文
李晶莹
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2014040470A1 publication Critical patent/WO2014040470A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • H04L41/0622Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time based on time

Definitions

  • the present invention relates to the field of communications, and in particular to a method and apparatus for processing an alert message.
  • fault management of equipment is an important component of system maintainability.
  • the device When the system equipment fails or an abnormality occurs, the device will report the alarm information, which is convenient for the maintenance personnel to locate the problem in time to solve the problem.
  • the alarm information is generated from the network element, and the network element receives the alarm message.
  • the corresponding element management system (Element Management System, EMS for short) is reported by the network element, and the maintenance personnel obtains the alarm information of the network element from the EMS. Locate and troubleshoot as quickly as possible. At present, the alarm information is reported to the EMS in the telecommunication system.
  • a method for processing an alarm message including: determining whether an alarm recovery message corresponding to an alarm message to be reported is received within a predetermined time period; The alarm message is reported to the alarm recovery message.
  • the determining, by the network element that generates the alarm message, whether the alarm attribute of the alarm message is set is an alarm delay; if yes, And setting the alarm delay to the predetermined time period, and determining whether an alarm recovery message is received within the predetermined time period.
  • determining whether the alarm recovery message is received within a predetermined time period comprises: in the alarm delay, the timer of the network element periodically detects whether the alarm recovery message is received; if not, the The alarm delay is subtracted from the time required for the timer to trigger a detection to obtain a new alarm delay; the predetermined time period is updated to the new alarm delay, and the determination is continued within the updated predetermined time period. The alarm recovery message is received.
  • the method further includes: when the alarm recovery message is received before the alarm delay of the alarm message is reduced to zero, the alarm message is cleared.
  • the method further includes: if the alarm attribute does not set an alarm delay, directly reporting the alarm message to the network management.
  • the method further includes: the network element processing the alarm message according to a priority of each of the alarm messages.
  • the method before determining whether the alarm recovery message is received in the predetermined time period, the method further includes: determining, by the network element, whether the number of times of reporting is greater than the first preset number of times and the number of times of recovery is greater than the second preset number of times The message; if yes, the alarm delay is set in the alarm attribute of the alarm message.
  • an apparatus for processing an alarm message including: a first determining module, configured to determine whether an alarm recovery message corresponding to an alarm message that needs to be reported is received within a predetermined time period; The module is configured to report the alarm message if the alarm recovery message is not received within the predetermined time period.
  • the first determining module includes: a determining unit, configured to determine whether an alarm delay is set in an alarm attribute of the alarm message; and a setting unit configured to set the alarm delay in an alarm attribute of the alarm message In the case of time, the alarm delay is set to the predetermined time period, and it is determined whether an alarm recovery message is received within the predetermined time period.
  • the device further includes: a second determining module, configured to determine whether there is an alarm message that the number of times of reporting is greater than the first preset number of times and the number of times of recovery is greater than the second preset number of times within a predetermined time; In the case that there is an alarm message whose number of times of reporting is greater than the first preset number of times and the number of times of recovery is greater than the second preset number of times, the alarm delay is set in the alarm attribute of the alarm message.
  • the embodiment of the present invention adopts the following method: It is determined whether a recovery message of an alarm message to be reported is received within a predetermined time period, and if the alarm recovery message is not received, the alarm message is reported.
  • an alarm message with frequent alarms and then resumed is processed, which solves the problem that a large number of alarms and alarm recovery messages are generated in a short period of time, which brings a large load to the system and occupies a large system. Resources, Therefore, the problem of the operating efficiency of the system is affected, thereby reducing the occupied system resources and improving the operating efficiency of the system.
  • FIG. 1 is a flowchart of a method for processing an alarm message according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for processing an alarm message according to a preferred embodiment of the present invention
  • FIG. 1 is a flowchart of a method for processing an alarm message according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for processing an alarm message according to a preferred embodiment of the present invention
  • FIG. 1 is a flowchart of a method for processing an alarm message according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for processing an
  • FIG. 4 is a block diagram of a processing device for processing an alarm message according to an embodiment of the present invention
  • FIG. 5 is a first block diagram of a processing device for an alarm message according to an embodiment of the present invention
  • FIG. 6 is a structural block diagram of a processing apparatus for an alarm message according to an embodiment of the present invention.
  • the embodiment of the present invention provides an alarm according to the related art, because a large number of alarms and alarms are generated in a short period of time, which may cause a large load on the system, and occupy a large amount of system resources, thereby affecting the operating efficiency of the system.
  • the processing method of the message, the process of the method is as shown in FIG. 1 , and includes the step S102 to the step S104: Step S102, determining whether an alarm recovery message corresponding to the alarm message to be reported is received within a predetermined time period; Step S104, scheduling If the alarm recovery message is not received within the time range, the alarm message is reported.
  • the embodiment of the present invention adopts the following method: It is determined whether a recovery message of an alarm message to be reported is received within a predetermined time period, and if the alarm recovery message is not received, the alarm message is reported.
  • the alarm message with frequent alarms and then recovered is processed, and the short-term solution is solved.
  • a large number of alarms and alarm recovery messages will bring a lot of load to the system, occupying too much system resources, thus affecting the system's operating efficiency, thereby reducing the occupied system resources and improving the system's operating efficiency.
  • Different types of alarm messages may be generated due to different problems, for example, alarm messages generated by the device actually failing, and alarm messages generated due to high sensitivity.
  • the alarm message may be set differently, for example, an alarm delay is set for the frequently reported and restored alarm message, and no frequent reporting or Even if the alarm message is frequently reported but not frequently recovered, the alarm delay is not set.
  • An alarm delay is set for the alarm message that is frequently reported and restored.
  • the network element determines whether there is an alarm message that the number of times of reporting is greater than the first preset number of times and the number of times of recovery is greater than the second preset number of times. The message sets the alarm delay in the alarm attribute of the alarm message.
  • the first preset number of times and the second preset number of times may be the same or different, for example, the preset time is set to 10 minutes, and the first preset number of times and the second preset number of times are 20 times simultaneously, if the same If the alarm message is reported and restored more than 20 times in 10 minutes, set the alarm attribute of the alarm message and set its attribute to the attribute with alarm delay. This can be set by adding this option when there is no option for the alarm delay attribute, or by setting it to 0 and 1 when there is an attribute option for the alarm delay.
  • the process of determining whether the alarm recovery message corresponding to the alarm message to be reported is received in the predetermined time period may be as follows:
  • the network element generating the alarm message determines whether the alarm attribute of the alarm message is set with an alarm delay. If the alarm delay is set, the alarm delay is set to a predetermined time period, and it is determined whether an alarm recovery message is received within the predetermined time period.
  • the timer of the network element periodically detects whether an alarm recovery message is received. If no alarm recovery message is received, the alarm delay is subtracted from the time required for the timer to trigger a detection to obtain a new one.
  • Alarm delay update the predetermined time period to a new alarm delay, continue to determine whether an alarm recovery message is received within the updated scheduled time period, and continue to detect whether an alarm recovery message is received. If an alarm recovery message is received before the alarm delay of the alarm message is reduced to zero, the alarm message is cleared. If the alarm recovery message is not received when the alarm delay of the alarm message is reduced to zero, the alarm message is reported to the NMS. During the implementation, if the alarm attribute is not set with the alarm delay (or the status is not set), the alarm message does not belong to the alarm message that is frequently reported and restored, and the alarm message is directly reported to the NMS. During the process of the foregoing steps, the network element processes the alarm message according to the priority of each alarm message.
  • the A alarm message can be set to have a higher priority than the B alarm message.
  • the alarm message is processed, even if the B alarm message is generated before the A alarm message, The A alarm message is processed.
  • an alarm buffer pool can also be set to store the above alarm message.
  • multiple alarm messages may be processed according to the priority of the alarm message when processing multiple alarm messages.
  • the alarm message is specially processed by the foregoing method to effectively reduce the system load, and the operation and maintenance personnel also filter the alarm information for processing. The above embodiments will be described below in conjunction with the preferred embodiments.
  • the preferred embodiment provides a method for reporting an alarm message, which can effectively process the alarm information frequently reported in a short period of time, that is, the operation and maintenance personnel know that the system has the alarm, and the alarm is not generated in a short time. Excessive alarms.
  • the method for reporting the alarm message of the preferred embodiment includes the following steps:
  • the internal timer of the NE is enabled.
  • the timer is a loop timer, and the duration of the timer can be set as needed.
  • the timer message arrives, it is first determined whether the alarm delay of each alarm in the network element is 0. If the alarm is equal to 0, the alarm is reported to the EMS immediately. If the alarm is not 0, the alarm delay of the alarm is reduced by 1 time. The time it takes to trigger the timer. When the user cancels the alarm delay time of all alarms and sets the duration to 0, the timer is canceled.
  • Example 1 This embodiment provides a method for processing an alarm message. The flow is as shown in FIG. 2, and includes steps S202 to S216. Step S202: According to the alarm message of the EMS, analyze whether there is an alarm message that is frequently reported and restored in a short period of time.
  • step S204 if there is an alarm that is frequently reported and restored in a short period of time, the duration of most alarms is analyzed from the alarm information, and the duration of the alarm delay reporting may be set as needed.
  • Step S206 An abnormality is generated inside the network element, and a warning message is reported to the network element.
  • the alarm module is also set up in the network element, and the generated alarm message is reported to the alarm module inside the network element.
  • Step S208 determining whether an alarm delay is set in the alarm attribute of the alarm message.
  • the NE determines whether to report it to the EMS network management system based on the alarm attribute of the alarm message.
  • step S210 the startup timer detects whether an alarm recovery message is received within a preset delay duration. If yes, step S212 is performed, otherwise step S214 is performed. In step S212, the alarm message is discarded to the EMS. Step S214, the alarm message is reported to the EMS. Step S216: The generated alarm message is directly reported to the EMS. In the process of the step S208, when the alarm module of the network element receives the alarm reported by the other module, it is first determined whether the alarm is configured to report the alarm delay.
  • FIG. 3 is a flowchart of processing a timer message provided by this example. After receiving the reported alarm message, the timer is started first. The processing flow of the timer is described below.
  • the network element alarm module is separately set in the network element, which is part of the related art.
  • the alarm cache pool is also configured to store alarm messages that need to be reported.
  • the timer processing flow includes steps S302 to S320.
  • Step S302 the network element alarm module receives the timer message.
  • Step S304 determining whether the number of alarms in the current delay alarm buffer pool is 0. If yes, step S306 is performed; if not, step S308 is performed.
  • Step S306 the processing flow of the timer is ended.
  • Step S308 the first alarm message is taken out in priority order for processing.
  • Step S310 whether the delay time of the alarm delay of the received alarm message is 0. If yes, step S312 is performed; if no, step S314 is performed.
  • step S312 the alarm message is reported to the EMS network management device, and the alarm is cleared from the delay alarm.
  • Step S314 subtracting the duration of the alarm delay of the alarm message from the duration of triggering the timer.
  • Step S316 determining whether there is a next alarm message in the delayed alarm pool. If yes, step S318 is performed, otherwise step S320 is performed.
  • step S320 an alarm message is taken for processing. The processing of the next alarm message is started again from step S310. During the implementation, if the alarm recovery message arrives before the alarm delay time becomes 0, the alarm message is cleared in the alarm buffer pool and is not reported to the EMS.
  • the embodiment of the present invention further provides an apparatus for processing an alarm message, where the apparatus is used to implement the foregoing method, and the module in the apparatus may be implemented in a processor.
  • a processor includes a first determining module 10 and a reporting module. 20.
  • These modules may be implemented by software, for example, a software comprising a first decision module 10 and a report module 20, which software may also be stored in a computer readable medium.
  • the block diagram of the device is as shown in FIG. 4, and includes: a first determining module 10, configured to determine whether an alarm recovery message corresponding to an alarm message to be reported is received within a predetermined time period; the reporting module 20, and the first determining module If the alarm recovery message is not received within the predetermined time period, the alarm message is reported.
  • the first determining module 10 is further configured as shown in FIG. 5, and includes: a determining unit 102, configured to determine whether an alarm delay is set in an alarm attribute of the alarm message; and the setting unit 104 is coupled to the determining unit 102, and configured to In the case that the alarm attribute of the alarm message is set with the alarm delay, the alarm delay is set to a predetermined time period, and it is determined whether an alarm recovery message is received within the predetermined time period.
  • the processing device of the foregoing alarm message may also be as shown in FIG.
  • a second determining module 30, configured to determine whether the number of times of reporting is greater than the first preset number of times and the number of times of recovery is greater than the second preset number of times within a predetermined time
  • the setting module 40 is coupled to the second determining module 30 and the first determining module 10, and is configured to be configured to: when there is an alarm message that the number of times of reporting is greater than the first preset number of times and the number of times of recovery is greater than the second preset number of times, Set the alarm delay in the alarm attribute of the alarm message.
  • the first judging module 10 of the processing device of the foregoing alarm message may include the following unit: a unit configured to periodically detect whether an alarm recovery message is received within an alarm delay; and set to not receive an alarm recovery message, The alarm delay is subtracted from the time required for the timer to trigger a detection to obtain a new alarm delay; set to update the predetermined time period to a new alarm delay, and continue to determine whether to receive the updated time period.
  • the unit to the alarm recovery message may further include a unit that is set to clear the alarm message if the alarm recovery message is received before the alarm delay of the alarm message is reduced to zero. If it is determined that the alarm attribute of the above alarm message is not set to alarm delay, the alarm message is directly reported to the network management.
  • the processing of the alarm message processing device is used to effectively reduce the system load, and the operation and maintenance personnel also filter the alarm information for processing. From the above description, it can be seen that the following technical effects are achieved in the embodiment of the present invention:
  • the embodiment of the present invention adopts the following method: determining, in a predetermined time period, whether a recovery message of an alarm message to be reported is received, if not received The alarm message is reported to the alarm recovery message.
  • the alarm message that is frequently alarmed and then resumed is processed, which solves the problem that a large amount of alarms and alarm recovery messages are generated in a short period of time, which causes a large load on the system and occupies excessive system resources.
  • modules or steps of the embodiments of the present invention can be implemented by a general computing device, which can be concentrated on a single computing device or distributed in multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from The steps shown or described are performed sequentially, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated into a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

L'invention concerne un procédé et un dispositif de traitement de message d'alarme. Le procédé consiste à : déterminer si un message de récupération d'alarme correspondant à un message d'alarme à rapporter est ou non reçu dans une période de temps préréglée ; et dans le cas dans lequel le message de récupération d'alarme n'est pas reçu dans la période de temps préréglée, rapporter le message d'alarme. La présente invention est appliquée pour traiter les messages d'alarme qui donnent l'alarme fréquemment puis récupérer cette alarme immédiatement, de façon à résoudre le problème selon lequel l'efficacité de fonctionnement d'un système est affectée en raison du fait qu'une charge importante est placée sur le système et trop de ressources de système sont occupées en raison du fait qu'un grand nombre d'alarmes et de messages de récupération d'alarme sont générés dans une courte période de temps, permettant ainsi de réduire les ressources de système occupées, et d'améliorer l'efficacité de fonctionnement du système.
PCT/CN2013/081539 2012-09-11 2013-08-15 Procédé et dispositif de traitement de message d'alarme WO2014040470A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210334058.0 2012-09-11
CN201210334058.0A CN103684821A (zh) 2012-09-11 2012-09-11 告警消息的处理方法及装置

Publications (1)

Publication Number Publication Date
WO2014040470A1 true WO2014040470A1 (fr) 2014-03-20

Family

ID=50277592

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/081539 WO2014040470A1 (fr) 2012-09-11 2013-08-15 Procédé et dispositif de traitement de message d'alarme

Country Status (2)

Country Link
CN (1) CN103684821A (fr)
WO (1) WO2014040470A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105101243A (zh) * 2014-05-23 2015-11-25 中国移动通信集团四川有限公司 一种派发告警工单的方法、设备和系统

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106502673B (zh) * 2016-10-21 2019-08-06 中国民生银行股份有限公司 业务状态的显示方法和装置
CN112261597B (zh) * 2020-10-16 2021-09-21 国网安徽省电力有限公司阜阳供电公司 一种判断通道中断的短信阶梯告警方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004093359A1 (fr) * 2003-04-16 2004-10-28 Fujitsu Limited Procede et dispositif de gestion de reseau
CN1925427A (zh) * 2006-09-04 2007-03-07 华为技术有限公司 告警系统和告警方法
CN101009598A (zh) * 2007-01-08 2007-08-01 中兴通讯股份有限公司 告警同步方法
CN101022638A (zh) * 2007-03-12 2007-08-22 华为技术有限公司 一种告警上报方法和告警装置
CN101222725A (zh) * 2007-01-08 2008-07-16 中兴通讯股份有限公司 一种利用告警归并减少北向接口告警数量的方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101076174B (zh) * 2007-06-05 2010-09-29 中兴通讯股份有限公司 告警风暴的处理方法
CN101562826B (zh) * 2008-04-15 2012-04-18 中兴通讯股份有限公司 一种告警归并的方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004093359A1 (fr) * 2003-04-16 2004-10-28 Fujitsu Limited Procede et dispositif de gestion de reseau
CN1925427A (zh) * 2006-09-04 2007-03-07 华为技术有限公司 告警系统和告警方法
CN101009598A (zh) * 2007-01-08 2007-08-01 中兴通讯股份有限公司 告警同步方法
CN101222725A (zh) * 2007-01-08 2008-07-16 中兴通讯股份有限公司 一种利用告警归并减少北向接口告警数量的方法
CN101022638A (zh) * 2007-03-12 2007-08-22 华为技术有限公司 一种告警上报方法和告警装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105101243A (zh) * 2014-05-23 2015-11-25 中国移动通信集团四川有限公司 一种派发告警工单的方法、设备和系统

Also Published As

Publication number Publication date
CN103684821A (zh) 2014-03-26

Similar Documents

Publication Publication Date Title
CN107515796B (zh) 一种设备异常监控处理方法及装置
CN110830283B (zh) 故障检测方法、装置、设备和系统
EP2800024B1 (fr) Système et procédés permettant d'identifier des applications dans des réseaux mobiles
EP3142011A1 (fr) Procédé de récupération d'anomalie destiné à une machine virtuelle dans un environnement distribué
CN102404141B (zh) 一种告警抑制的方法及装置
US11050609B2 (en) Technique for reporting and processing alarm conditions occurring in a communication network
US9007200B2 (en) Process method and apparatus for preventing alarm jitter
CN106789445B (zh) 一种广电网络中网络设备的状态轮询方法和系统
CN101296135A (zh) 故障信息的处理方法和装置
CN103475696A (zh) 云计算集群服务器状态监控系统和方法
WO2016187979A1 (fr) Procédé et appareil d'émission pour message de détection bidirectionnelle avec réexpédition (bfd)
CN101989933A (zh) 一种故障检测的方法和系统
CN104243192B (zh) 故障处理方法及系统
CN111130821A (zh) 一种掉电告警的方法、处理方法及装置
CN111142801B (zh) 分布式存储系统网络亚健康检测方法及装置
WO2014040470A1 (fr) Procédé et dispositif de traitement de message d'alarme
CN103905271B (zh) 一种告警风暴抑制方法
CN110806924B (zh) 一种基于cpu占用率的网络处理方法及装置
WO2013071755A1 (fr) Procédé et appareil de mise en œuvre de l'auto-réparation de dispositifs de station de base
JP6421516B2 (ja) サーバ装置、冗長構成サーバシステム、情報引継プログラム及び情報引継方法
CN110224872B (zh) 一种通信方法、装置及存储介质
CN113612647B (zh) 一种告警处理方法及装置
CN107025148B (zh) 一种海量数据的处理方法和装置
JP2017521802A (ja) スーパーコンピュータ監視用の相関イベントのためのアーキテクチャ
CN104348676A (zh) 一种基于操作管理维护oam的链路检测方法及设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13837167

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13837167

Country of ref document: EP

Kind code of ref document: A1