WO2013000282A1 - 告警信息上报方法、装置及系统 - Google Patents

告警信息上报方法、装置及系统 Download PDF

Info

Publication number
WO2013000282A1
WO2013000282A1 PCT/CN2012/070017 CN2012070017W WO2013000282A1 WO 2013000282 A1 WO2013000282 A1 WO 2013000282A1 CN 2012070017 W CN2012070017 W CN 2012070017W WO 2013000282 A1 WO2013000282 A1 WO 2013000282A1
Authority
WO
WIPO (PCT)
Prior art keywords
alarm information
managed device
management station
information
alarm
Prior art date
Application number
PCT/CN2012/070017
Other languages
English (en)
French (fr)
Inventor
骆庆开
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2013000282A1 publication Critical patent/WO2013000282A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors
    • H04L43/0829Packet loss
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Definitions

  • the present invention relates to the field of communications, and in particular to a method, device, and system for reporting alarm information.
  • S MP Simple Network
  • S MP Simple Network
  • the passive reporting of alarm information by managed devices generally uses SMP TRAP (Trap) messages.
  • SMP TRAP Trap
  • the reliability includes two aspects: First, information integrity; It is the order of multiple pieces of information.
  • the TRAP message of the Simple Network Management Protocol does not guarantee the reliability mechanism at the protocol level. Since the TRAP packet is based on UDP (User Datagram Protocol), the UDP protocol is unreliable, resulting in the data packet being transmitted.
  • UDP User Datagram Protocol
  • Packet loss may occur in the network; in addition, because the SNMP TRAP message may be routed, the order of the SNMP TRAP messages arriving at the destination may be different from the order of the SNMP TRAP messages sent by the source.
  • the receiving end ie, the management station, or the network management system
  • the receiving end since the receiving end only receives the next alarm, it can know whether there is a message loss. If the next alarm arrives after a long time, it takes a long time to know whether the message is lost.
  • a method for reporting an alarm information including: receiving, by a managed device, a heartbeat check message from a management station, wherein the heartbeat check message carries number information indicating a situation in which the management station currently receives the alarm information; The number of the alarm information that the pipe device has sent according to the number information and the managed device.
  • the number of the alarm information to be retransmitted is determined; the managed device sends the alarm information corresponding to the number of the alarm information to be retransmitted to the management station.
  • the method includes: the managed device configuring a number for each alarm information according to the sending sequence; the managed device carries the number of the alarm information in the alarm information. Send to the management station and cache the alarm information locally.
  • the number information includes: a maximum number of the alarm information that the management station has received; the managed device determines the number of the alarm information that needs to be retransmitted according to the number information and the number of the alarm message that has been sent by the managed device.
  • the device includes: the managed device adds the alarm information between the maximum number carried in the heartbeat check message and the highest number of the alarm information currently sent by the managed device to the alarm information set to be retransmitted; It is determined whether each alarm information in the alarm information set is an alarm information that needs to be retransmitted.
  • the number information further includes: a number set of the alarm information that does not reach the management station; and the method further includes: before the managed device determines whether the alarm information in the set of the alarm information to be resent is the alarm information that needs to be retransmitted, the method further includes: The managed device adds the alarm information corresponding to each number in the number set to the alarm information set to be retransmitted.
  • the method when configuring a number for each alarm information, the method further includes: determining whether the alarm information has preamble alarm information, and if yes, using the number of the preamble alarm information as a preamble number of the alarm information, and The preamble number is carried in the alarm information.
  • the method further includes: a maximum number (M) of the alarm information that the management station has received before receiving the alarm information The first number (S) of the currently received alarm information is compared, wherein: if M+1 ⁇ S, the management station determines that the alarm information numbered M+1 to S-1 has not arrived, and will be M+1 to S.
  • the above-mentioned managed device determines whether each alarm information in the set of alarm information to be retransmitted is alarm information that needs to be retransmitted, including: the first timestamp of the heartbeat check message acquired by the managed device; and the alarm information to be re-transmitted Each alarm information in the set, the managed device acquires a second time indicating the time when the alarm information is sent. And determining whether the time difference between the second timestamp and the first timestamp is greater than a preset packet loss time threshold, and if yes, determining that the alarm information needs to be retransmitted.
  • the method further includes: determining, by the managed device, the alarm information that has been successfully received by the management station according to the heartbeat check message, deleting The alarm information cached by the managed device has been successfully received by the management station.
  • an alarm information reporting device is provided on the side of the managed device, the device comprising: a first receiving module configured to receive a heartbeat check message from the management station, wherein the heartbeat check message carries a number indicating information indicating that the management station currently receives the alarm information; the first determining module is configured to set the number of the alarm information to be retransmitted according to the number information and the number of the currently sent alarm message; the first sending module is set to The alarm information corresponding to the number of the alarm information that needs to be retransmitted is sent to the management station.
  • the number information includes: a maximum number of the alarm information that the management station has received currently; the first determining module includes: an obtaining submodule, configured to set the maximum number carried in the heartbeat check message with the managed device currently The alarm information between the most-numbered alarm information is added to the alarm information set to be re-sent.
  • the judgment sub-module is configured to determine whether each alarm information in the alarm information set to be re-transmitted is alarm information that needs to be retransmitted.
  • the number information further includes: a number set of the alarm information that does not reach the management station; the obtaining submodule is further configured to add the alarm information corresponding to each number in the number set to the to-be-retransmitted determination alarm information set.
  • the determining sub-module includes: a first acquiring unit, configured to acquire a first timestamp of the heartbeat check message; and a second acquiring unit, configured to acquire a time for sending each alarm information in the set of alarm information to be resent a second timestamp; the determining unit is configured to determine whether the time difference between the second timestamp and the first timestamp of each alarm information in the set of alarm information to be retransmitted is greater than a preset timeout threshold, and if yes, determine The alarm information needs to be resent.
  • an alarm information reporting system including: a management station and a managed device, wherein the managed device includes the alarm information reporting device, wherein the management station includes: a second receiving module, setting The alarm information sent by the managed device, wherein the alarm information carries the number of the alarm information; the second determining module is configured to determine the number information indicating that the management station currently receives the alarm information; The module is configured to send a heartbeat check message to the managed device, where the heartbeat check message carries the number information determined by the second determining module in a current heartbeat check message sending period.
  • the number information includes: a maximum number of the alarm information that has been received by the management station and a number set of the alarm information that has not arrived at the management station; the alarm information also carries the number of the pre-order alarm information of the alarm information;
  • the alarm information corresponding to the first number S is added to the received alarm information. If not, the triggering execution module adds the alarm information of the first number S to the received alarm information; if S ⁇ M, the trigger is triggered. The execution module adds the alarm information of the first number S to the received alarm information; and the execution module is configured to perform corresponding processing according to the comparison result of the comparison module and/or the retrieval result of the retrieval module.
  • the management station carries the number information of the current heartbeat period to receive the alarm information in each heartbeat check message, so that the managed device can determine the number of the alarm information to be retransmitted according to the number information, and then needs to be retransmitted.
  • the method of sending the alarm information to the management station solves the problem of determining the timeliness of the packet loss in the prior art, thereby achieving that only one heartbeat period is needed to determine whether to send the packet loss, and the managed device is timely and automatically The effect of resending the lost alarm information to the management station.
  • FIG. 2 is a schematic diagram of numbering and pre-numbering of TRAP protocol alarm information according to an embodiment of the present invention
  • FIG. 4 is a flowchart of processing performed by a management station to receive a new alarm message according to an embodiment of the present invention
  • FIG. 5 is a flowchart of a management according to an embodiment of the present invention
  • a flowchart of processing the received heartbeat check message by the device 6 is a schematic structural diagram of an alarm information reporting apparatus according to an embodiment of the present invention
  • FIG. 7 is a schematic structural diagram of a first determining module of an alarm information reporting apparatus according to a preferred embodiment of the present invention
  • FIG. 8 is a schematic diagram of a first determining module according to a preferred embodiment of the present invention.
  • FIG. 9 is a schematic structural diagram of an alarm information reporting system according to an embodiment of the present invention.
  • FIG. 10 is a schematic structural diagram of a management station according to an embodiment of the present invention.
  • FIG. 11 is a schematic diagram of a management station according to an embodiment of the present invention;
  • Step S102 the managed device receives a heartbeat check from the management station.
  • the message where the heartbeat check message carries the number information indicating that the current heartbeat period management station receives the alarm information.
  • the managed device may configure a number for each alarm information according to an initial sending sequence.
  • the managed device may further determine whether the alarm information has preamble alarm information, and if yes, the number of the preamble alarm information. As the preamble number of the alarm information, the preamble number is also carried in the alarm information and sent to the management station.
  • the number of the alarm information may be a continuous serial number with a step size of 1.
  • FIG. 2 is a schematic diagram of numbering and preamble numbering of TRAP protocol alarm information according to a preferred embodiment of the present invention.
  • the managed device may use the number of the alarm information and the preamble number of the alarm information as The number information is carried in the alarm information and sent to the management station.
  • the managed device may also cache the alarm information locally when transmitting the alarm information to the management station.
  • the management station may send a heartbeat check message to the managed device according to the preset heartbeat period, and after receiving a new alarm information sent by the managed device, first parsing the alarm information to obtain the The number included in the alarm information, the preamble number, and the indication that the management station currently receives the alarm letter.
  • the number information of the information when the heartbeat check message is sent, the heartbeat check message carries the number information that is received by the management station and is currently receiving the alarm information obtained in the current heartbeat period.
  • the heartbeat check message may adopt the information structure diagram shown in FIG. 3. As shown in FIG. 3, in the actual application, once the management station obtains the number information, the number information obtained by the current heartbeat period is carried in the current to be sent.
  • the heartbeat check message is sent to the managed device.
  • the number information includes, but is not limited to, a maximum number of the alarm information that has been received by the current management station, and a number set of the alarm information that has not arrived at the management station.
  • Step S104 The managed device determines the number of the alarm information that needs to be retransmitted according to the number information and the number of the alarm information that has been sent by the managed device. In a preferred embodiment of the present invention, if the number information includes only: the maximum number of the alarm information that the management station has received, the managed device numbers the maximum number carried in the heartbeat check message and is managed.
  • the alarm information between the highest number of the alarm information that has been sent by the device is added to the alarm information set to be retransmitted, and then it is determined whether the alarm information in the alarm information set to be retransmitted is the alarm information that needs to be retransmitted.
  • the managed device sets the alarm information corresponding to each number in the number set. Join the above-mentioned to-retransmitted judgment alarm information set.
  • the management station can obtain the number set of the alarm information that does not reach the management station in the following manner (in the case where the alarm information carries the preamble number): the management station will receive the alarm information.
  • the second number H of the preamble alarm information of the currently received alarm information is retrieved in the set, and if yes, the second number H is associated with the first number S, and the second number H is corresponding to
  • the alarm information corresponding to the second number (H) and the alarm information corresponding to the first number (S) are added to the first and the first number (S) in the order of the second number (H).
  • Received alarm information If no, the alarm information of the first number (S) is added to the received alarm information; if S ⁇ M, the alarm information of the first number (S) is added to the received alarm information.
  • the management station can check whether the order in which the alarm information is received satisfies the required order.
  • the managed device after receiving the heartbeat check message, can take out the maximum serial number M of the alarm information that arrives at the management station, and record the serial number C of the current alarm information sent by itself, and calculate it. , C] the serial number of the interval, and then merged with the number set of alarm information for the arrival of the management station, Obtain a new set of number of alarm information that has not arrived at the management station (including the number of the alarm information that has been lost)
  • the alarm number queue of the unreached alarm information may be marked as Q) to determine the number of the alarm information that needs to be resent to the management station.
  • the managed device may mark the local alarm information buffer queue (that is, the alarm information that has been sent to the management station) as Q (1), and obtain a new unreached alarm number queue Q (ie, the above-mentioned to be resent determination alarm information)
  • Q (1) the local alarm information buffer queue
  • Q ie, the above-mentioned to be resent determination alarm information
  • the alarm information corresponding to the number in Q is taken out from Q ( 1 ) to obtain a new local alarm information buffer queue Q (2) (Q (2) is the alarm to be re-sent to be further judged by the managed device. Information collection).
  • the managed device when the heartbeat check message arrives, can obtain the first timestamp of the heartbeat check message (the timestamp can be recorded as the re-retransmission judgment of each alarm information in the alarm information set.
  • the time of the management station is that, for each alarm information in the alarm information set to be retransmitted, the managed device may obtain a second timestamp indicating the time when the alarm information is sent, and then further determine the second timestamp and the first timestamp. Whether the time difference is greater than a preset packet loss time threshold, and if yes, determining that the alarm information needs to be retransmitted.
  • the managed device sends the alarm information corresponding to the number of the alarm information that needs to be retransmitted to the management station.
  • the managed device may further determine, according to the heartbeat check message, the alarm information that has been successfully received by the management station, and delete the managed device cache.
  • Alarm information that is local and has been successfully received by the management station.
  • the alarm information may be detected in time to be lost, so that the lost alarm information is resent to the management station in time, and the foregoing method provided by the embodiment of the present invention is used to restart the management station.
  • the alarm information sent by the managed device can also be detected, thus ensuring that the alarm information will not be lost.
  • FIG. 4 is a flowchart of processing a new alarm information received by a management station in a preferred embodiment of the present invention. As shown in FIG. 4, the process mainly includes the following steps: Step 1. When the system is started, the network management (management station) initializes the maximum alarm number M that has been received, and initializes the queue of alarm information that has not arrived at the management station. At this time, the queue is empty. In the embodiment of the present invention, the initialization policy may synchronize the maximum alarm number M and the current alarm of the managed device with the management station during the establishment of the communication link.
  • Step 2 Receive an alarm message.
  • Step 3 The number (S) and the preamble number (H) of the bound (configuration) alarm are parsed from the alarm information of the TRAP protocol.
  • Step 4 Compare the sizes of S and M. If S is greater than M, proceed to step 9; if S is less than M, proceed to step 5.
  • Step 5 Put the received new alarm into the processing queue and wait for the processing thread to process.
  • Step 6 In the unreached queue, check whether S has an associated alarm. If there is an associated alarm, proceed to step 7. Otherwise, proceed to step 8.
  • Step 7. Put the associated alarm of S into the processing queue and wait for the processing thread to process.
  • Step 8. Remove S from the queue that has never arrived. In step 9, it is determined whether S is equal to M+l. If they are equal, step 11 is performed. Otherwise, the steps are continued.
  • step 10 the empty shell alarm information is constructed by using each integer in the open interval (M, S) as a number, and placed in the unreached queue.
  • step 11 it is determined whether the alarm numbered H is not in the queue. If it is in the queue, proceed to step 12; otherwise, go to step 14.
  • Step 12 Set the newly received alarm information to the associated alarm of the alarm number H in the queue, and wait for the alarm with the number H to arrive.
  • FIG. 5 is a flowchart of processing a received heartbeat check message by a managed device according to an embodiment of the present invention. As shown in FIG. 5, the process mainly includes the following steps: Steps 1, 2, and 3, receiving a management station After the link is established, the communication link, synchronization alarm information data, and synchronization maximum alarm number are established. Step 4: After the link is successfully established, the heartbeat message sent by the management station is received.
  • Step 5 Analyze the data information carried in the heartbeat check message, including the maximum alarm number (M) that has arrived, the sequence of non-arrival alarm information detected by the management station (denoted as: [Ni, ..., N k ]), and the management station collects The timestamp (Tm) of the above information, and the current alarm number (C) obtained from the managed device.
  • Step 6 Compare the sizes of C and M. If C is greater than M, continue to step 7. Otherwise, perform the steps.
  • Step 7 Add the integer value in the (M, C) interval to the unreached sequence, and update the unreached sequence to [Nl, ..., Nk, ..., Nc] o Step 8, and send the alarm information to the already managed station.
  • the received part is deleted by deleting the number of the alarm information if it is not in the non-arrival sequence.
  • Step 9 Scan the sent alarm information cache queue to calculate the difference between the current time (Tc) and the management station heartbeat information collection time TM. If the value is greater than the timeout threshold (that is, the packet loss time threshold), the device sends the heartbeat check message to the managed device through the management station. The alarm information sending end is informed whether the packet loss is sent, and the problem of poor timeliness is solved.
  • the present invention provides an alarm information reporting device, which is located on the side of the managed device.
  • Figure 6 is an alarm according to an embodiment of the present invention.
  • a schematic diagram of the structure of the information reporting device, the device includes: a first receiving module 10, a first determining module 20, and a first sending module 30.
  • the first receiving module 10 is configured to Receiving a heartbeat check message from the management station, where the heartbeat check message carries number information indicating that the current heartbeat period management station receives the alarm information; the first determining module 20 is connected to the first receiving module 10, and is set to be based on the number The information and the number of the currently sent alarm message are set to determine the number of the alarm information to be retransmitted.
  • the first sending module 30 is connected to the first determining module 20, and is set to be the number of the alarm information to be retransmitted. The corresponding alarm information is sent to the management station.
  • the number information includes: a maximum number of the alarm information that the management station has received currently; as shown in FIG.
  • the first determining module 20 may include The obtaining sub-module 22 is configured to add the alarm information between the maximum number carried in the heartbeat check message and the highest number of the alarm information currently sent by the managed device to the to-be-retransmitted determination alarm information set;
  • the sub-module 24 is connected to the acquisition sub-module 22, and is configured to determine whether each alarm information in the to-be-retransmitted determination alarm information set is alarm information that needs to be retransmitted.
  • the management station can conveniently send an indication to the managed device, and the processing flow is relatively simple.
  • the number information may further include: a number set of alarm information that does not reach the management station; and the obtaining submodule 22 is further configured to be associated with each number in the number set. The corresponding alarm information is added to the to-be-retransmitted determination alarm information set.
  • the managed device can determine the alarm information that needs to be retransmitted more accurately.
  • the determining sub-module 24 may include: a first obtaining unit 242, configured to acquire a first timestamp of the heartbeat check message; and a second acquiring unit 244, The second timestamp is set to obtain the time for sending each alarm information in the set of the alarm information to be resent.
  • the determining unit 246 is connected to the first obtaining unit 242 and the second obtaining unit 244, and is configured to determine that the to-be-retransmitted is respectively determined. And determining whether the time difference between the second timestamp and the first timestamp of each alarm information in the alarm information set is greater than a preset timeout threshold. If yes, determining that the alarm information needs to be retransmitted.
  • the managed device can obtain the lost alarm information in a timely manner and perform retransmission in time, thereby improving the reliability of the alarm information transmission.
  • FIG. 9 is a schematic structural diagram of an alarm information reporting system according to an embodiment of the present invention. As shown in FIG. 9, the system includes: a management station 1 and a managed device 2.
  • the managed device 2 may include the alarm information reporting device in the above embodiment.
  • the management station 1 may include: a second receiving module 40, a second determining module 50, and a second transmitting module 60.
  • the second receiving module 40 is configured to receive the alarm information sent by the managed device, where the alarm information carries the number thereof.
  • the second determining module 50 is connected to the second receiving module 40, and is configured to determine the indicating management station. 2 number information currently receiving the alarm information; the second sending module 60 is connected to the second determining mode
  • the block 50 is configured to send a heartbeat check message to the managed device, where the heartbeat check message carries the number information determined by the second determining module in the current heartbeat period.
  • the number information may include: a maximum number of the alarm information that the management station has received currently.
  • the number information may further include: a number set of the alarm information that does not reach the management station; and the alarm information further carries the pre-alarm information of the alarm information.
  • the number As shown in FIG. 11 , in the preferred embodiment, the second determining module 50 may include: a comparing module 52, configured to receive the maximum number M of the alarm information that has been received before receiving the alarm information, and the current receiving The first number S of the alarm information is compared.
  • the trigger execution module 54 will be M+1 to S-1.
  • the second number H of the preamble alarm information of the alarm information if yes, the triggering execution module associates the second number H with the first number S, and after the alarm information corresponding to the second number H arrives, The alarm information corresponding to the second number H and the alarm information corresponding to the first number S are added to the received alarm information according to the second number H and the first number S.
  • the row module adds the alarm information of the first number S to the received alarm information; if S ⁇ M, the trigger execution module 54 adds the alarm information of the first number S to the received The obtained alarm information; the execution module 54 is configured to perform corresponding processing according to the comparison result of the comparison module 52 and/or the retrieval result of the retrieval module 56.
  • the above-mentioned alarm information reporting system provided by the foregoing embodiment of the invention can send a heartbeat check message to the managed device through the management station, so that the managed device (the alarm information sending end) can know whether to send the packet loss in time, and solve the problem of poor timeliness.
  • the order of the alarm information sent by the managed device is consistent with the order of the alarm information received by the management station, so that timely discovery can be achieved. After the packet is lost, it can be retransmitted actively, avoiding the unnecessary retransmission effect. From the above description, it can be seen that the present invention achieves the following technical effects:
  • the management station carries the number information of the received alarm information in the heartbeat check message and sends it to the managed device (the alarm information sending end) to be managed.
  • the device determines the number of the alarm information to be retransmitted according to the number information, and then sends the alarm information that needs to be resent to the management station, so that the managed device can know whether to send the packet loss in time, and solve the problem of determining the packet loss in the prior art.
  • the problem of poor timeliness at the same time, by retrieving the pre-ordered alarm information of the currently received alarm information in the unreached alarm information queue, the order of the alarm information sent by the managed device is consistent with the order of the alarm information received by the management station. Then, it is achieved that only one heartbeat cycle is needed to determine whether to send a packet loss, and the managed device resends the lost alarm information to the management station in a timely and automatic manner.
  • modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps are fabricated as a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.
  • the above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Cardiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Environmental & Geological Engineering (AREA)
  • Communication Control (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

本发明公开了一种告警信息上报方法、装置及系统。其中,该方法包括:被管设备接收来自管理站的心跳检查消息,其中,心跳检查消息携带有指示管理站当前接收告警信息情况的编号信息;被管设备根据编号信息与被管设备当前已发送的告警信息的编号,确定需要重发的告警信息的编号;被管设备将与需要重发的告警信息的编号对应的告警信息发送给管理站。通过本发明,可以及时发现告警信息发生丢包后重新发送已经丢掉的告警信息,并且可以保证管理站收到的告警信息的顺序与被管设备发送的告警信息的顺序一致。

Description

告警信息上报方法、 装置及系统 技术领域 本发明涉及通信领域, 具体而言, 涉及一种告警信息上报方法、 装置及系统。 背景技术 通信网络管理中, 管理站和被管设备之间一般使用 S MP ( Simple Network
Management Protocol, 简单网络管理协议) 进行通讯, 被管设备对告警信息的主动上 报一般使用 S MP TRAP (陷阱) 消息。 在网络管理系统中, 必须保证管理站与被管设备上的信息是一致的, 所以被管设 备的告警信息上报需要很高的可靠性, 可靠性包括两个方面: 一是信息完整性; 二是 多条信息的次序。 简单网络管理协议的 TRAP消息在协议层面并没有保证可靠性的机制,由于 TRAP 包是基于 UDP (User Datagram Protocol, 用户数据包协议) 的, 而 UDP协议是不可靠 的, 导致数据包在传送过程中可能出现丢包现象; 另外, 由于 SNMP TRAP消息可能 经过路由转发, 所以到达目的地的 SNMP TRAP消息的次序可能不同于消息源发送的 SNMP TRAP消息的次序。 针对以上两个问题, 现有的解决方法是: 由接收端 (即管理站, 或称作网管) 通 过告警流水号判断 TRAP包是否丢失, 如发现丢失, 则主动向被管设备发送请求消息 要求重发。 但是, 在现有技术中, 由于接收端只有收到下一个告警时, 才能知道是否 有消息丢失, 如果下一个告警很长时间后才到达, 那么需要等待很长的时间才能知道 消息是否发生丢包, 当获知丢包时, 该消息可能已经没有再重发的必要了, 因此, 其 及时性较差。 发明内容 本发明提供了一种告警信息上报方法、 装置及系统, 以至少解决上述问题之一。 根据本发明的一个方面, 提供了一种告警信息上报方法, 包括: 被管设备接收来 自管理站的心跳检查消息, 其中, 心跳检查消息携带有指示管理站当前接收告警信息 情况的编号信息; 被管设备根据编号信息与被管设备当前已发送的告警信息的编号, 确定需要重发的告警信息的编号; 被管设备将与需要重发的告警信息的编号对应的告 警信息发送给管理站。 优选地, 上述被管设备接收来自管理站的心跳检查消息之前, 该方法包括: 被管 设备按照发送顺序为每个告警信息配置一个编号; 被管设备将告警信息的编号携带在 该告警信息中发送给管理站, 并在本地缓存告警信息。 优选地, 上述编号信息包括: 管理站当前已接收到的告警信息的最大编号; 被管 设备根据编号信息与被管设备当前已发送的告警消息的编号, 确定需要重发的告警信 息的编号, 包括: 被管设备将编号在心跳检查消息中携带的最大编号与被管设备当前 已发送的告警信息的最编号之间的告警信息加入待重发判断告警信息集合中; 被管设 备判断待重发判断告警信息集合中的各个告警信息是否为需要重发的告警信息。 优选地, 上述编号信息还包括: 未到达管理站的告警信息的编号集合; 被管设备 判断待重发判断告警信息集合中的各个告警信息是否为需要重发的告警信息之前, 方 法还包括: 被管设备将与编号集合中的各个编号对应的告警信息加入待重发判断告警 信息集合中。 优选地, 在为每个告警信息配置一个编号时, 该方法还包括: 判断告警信息是否 有前序告警信息, 如果是, 则将前序告警信息的编号作为告警信息的前序编号, 并将 前序编号携带在告警信息中。 优选地, 在被管设备将告警信息的编号携带在该告警信息中发送给管理站之后, 该方法还包括: 管理站将在接收告警信息之前已接收到的告警信息的最大编号 (M) 与当前接收到的告警信息的第一编号 (S)进行比较, 其中: 如果 M+1 <S, 则管理站 确定编号为 M+1至 S-1的告警信息未到达, 将 M+1至 S-1的编号加入未到达的告警 信息的编号集合; 如果 M+1=S, 则在未到达的告警信息的编号集合中检索是否存在当 前接收到的告警信息的前序告警信息的第二编号 H, 如果是, 则将所述第二编号 H与 所述第一编号 S关联, 待所述第二编号 H对应的告警信息到达后, 按照第二编号(H) 在前、 第一编号 (S) 在后的顺序将第二编号 (H) 对应的告警信息和第一编号 (S) 对应的告警信息加入到已接收到的告警信息, 如果否, 则将第一编号(S)的告警信息 加入到已接收到的告警信息; 如果 S<M, 则将第一编号 (S) 的告警信息加入到已接 收到的告警信息。 优选地, 上述被管设备判断待重发判断告警信息集合中的各个告警信息是否为需 要重发的告警信息包括: 被管设备获取心跳检查消息的第一时间戳; 对于待重发判断 告警信息集合中的各个告警信息, 被管设备获取指示该告警信息发出时间的第二时间 戳, 判断第二时间戳与第一时间戳的时间差是否大于预设的丢包时间阈值, 如果是, 则确定该告警信息需要重发。 优选地, 上述被管设备将未到达的告警信息的编号对应的未到达的告警信息发送 给管理站之后, 还包括: 被管设备根据心跳检查消息确定已经被管理站成功接收的告 警信息, 删除被管设备缓存的已经被管理站成功接收的告警信息。 根据本发明的另一方面, 提供了一种告警信息上报装置, 位于被管设备侧, 该装 置包括: 第一接收模块, 设置为接收来自管理站的心跳检查消息, 其中, 心跳检查消 息携带有指示管理站当前接收告警信息情况的编号信息; 第一确定模块, 设置为根据 编号信息与当前已发送的告警消息的编号, 设置为需要重发的告警信息的编号; 第一 发送模块, 设置为将与需要重发的告警信息的编号对应的告警信息发送给管理站。 优选地, 上述编号信息包括: 管理站当前已接收到的告警信息的最大编号; 第一 确定模块包括: 获取子模块, 设置为将编号在心跳检查消息中携带的最大编号与被管 设备当前已发送的告警信息的最编号之间的告警信息加入待重发判断告警信息集合 中; 判断子模块, 设置为判断待重发判断告警信息集合中的各个告警信息是否为需要 重发的告警信息。 优选地, 上述编号信息还包括: 未到达管理站的告警信息的编号集合; 获取子模 块还设置为将与编号集合中的各个编号对应的告警信息加入待重发判断告警信息集合 中。 优选地, 上述判断子模块包括: 第一获取单元, 设置为获取心跳检查消息的第一 时间戳; 第二获取单元, 设置为获取待重发判断告警信息集合中的各个告警信息发出 时间的第二时间戳; 判断单元, 设置为分别判断待重发判断告警信息集合中的各个告 警信息的第二时间戳与第一时间戳的时间差是否大于预设的丢包时间阈值, 如果是, 则确定该告警信息需要重发。 根据本发明的另一方面, 提供了一种告警信息上报系统, 包括: 管理站和被管设 备, 其中, 被管设备包括上述告警信息上报装置, 其中, 管理站包括: 第二接收模块, 设置为接收被管设备发送的告警信息, 其中, 设置为告警信息中携带有该告警信息的 编号; 第二确定模块, 设置为确定指示所述管理站当前接收告警信息情况的编号信息; 第二发送模块, 设置为向所述被管设备发送心跳检查消息, 其中, 所述心跳检查消息 中携带当前心跳检查消息发送周期中所述第二确定模块确定的所述编号信息。 优选地, 上述编号信息包括: 管理站当前已接收到的告警信息的最大编号和未到 达管理站的告警信息的编号集合; 告警信息中还携带有告警信息的前序告警信息的编 号; 第二确定模块包括: 比较模块, 设置为将在接收告警信息之前已接收到的告警信 息的最大编号 M与当前接收到的告警信息的第一编号 S进行比较, 如果 M+1 <S, 则 确定编号为 M+1至 S-1的告警信息未到达, 触发执行模块将 M+1至 S-1的编号加入 未到达的告警信息的编号集合; 如果 M+1=S, 则触发检索模块; 检索模块, 设置为在 未到达的告警信息的编号集合中检索是否存当前接收到告警信息的前序告警信息的第 二编号 H, 如果是, 则触发执行模块将所述第二编号 H与所述第一编号 S关联, 待所 述第二编号 H对应的告警信息到达后, 按照第二编号 H在前、第一编号 S在后的顺序 将第二编号 H对应的告警信息和第一编号 S对应的告警信息加入到已接收到的告警信 息, 如果否, 则触发执行模块将第一编号 S的告警信息加入到已接收到的告警信息; 如果 S<M, 则触发执行模块将第一编号 S的告警信息加入到已接收到的告警信息; 执行模块, 设置为根据比较模块的比较结果和 /或检索模块的检索结果, 执行相应的处 理。 通过本发明, 采用管理站在每个心跳检查消息中携带当前心跳周期接收告警信息 的编号信息,从而使得被管设备可以根据该编号信息确定需要重发的告警信息的编号, 再将需要重发的告警信息发送给管理站的方式, 解决了现有技术中确定丢包的及时性 差的问题, 进而达到了只需要一个心跳周期即可以确定是否发送丢包, 并使被管设备 及时、 自动地将丢掉的告警信息重新发送给管理站的效果。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部分, 本发 明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的不当限定。 在附图 中: 图 1是根据本发明实施例的告警信息上报方法的流程图; 图 2是根据本发明实施例对 TRAP协议告警信息进行编号和前序编号配置的示意 图; 图 3是根据本发明实施例的管理站发送心跳消息时携带的信息结构示意图; 图 4是根据本发明实施例的管理站对接收到的新告警消息的处理流程图; 图 5是根据本发明实施例的被管设备对收到的心跳检查消息的处理流程图; 图 6是根据本发明实施例的告警信息上报装置的结构示意图; 图 7 是根据本发明优选实施例的告警信息上报装置的第一确定模块的结构示意 图; 图 8是根据本发明优选实施例的第一确定模块的判断子模块的结构示意图; 图 9是根据本发明实施例的告警信息上报系统的结构示意图; 图 10是根据本发明实施例的管理站的结构示意图; 图 11是根据本发明优选实施例的管理站的第二确定模块的结构示意图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在不冲突的 情况下, 本申请中的实施例及实施例中的特征可以相互组合。 图 1是根据本发明实施例的告警信息上报方法的流程图, 如图 1所示, 该方法主 要包括以下步骤 (步骤 S102-步骤 S106): 步骤 S102, 被管设备接收来自管理站的心跳检查消息, 其中, 心跳检查消息携带 有指示在当前心跳周期管理站接收告警信息情况的编号信息。 在本发明实施例中, 对于被管设侧, 被管设备在发送告警信息之间, 可以按照初 始的发送顺序为每个告警信息配置一个编号。 在本发明实施例的一个优选实施方式中, 在为每个告警信息配置编号时, 被管设 备还可以判断该告警信息是否有前序告警信息, 如果是, 则将其前序告警信息的编号 作为该告警信息的前序编号, 并将前序编号也携带在告警信息中发送给管理站。例如, 告警信息的编号可以为一个连续的、 步长为 1的流水号。 例如, 图 2是根据本发明优 选实施例的对 TRAP协议告警信息进行编号和前序编号配置的示意图, 如图 2所示, 被管设备可以将告警信息的编号和告警信息的前序编号作为编号信息同时携带在该告 警信息中发送给管理站。 优选地, 为了便于后续重发该告警信息, 被管设备还可以在 将告警信息发送给管理站时, 将该告警信息在本地缓存。 对于管理站侧, 管理站可以根据预先设定的心跳周期向被管设备发送心跳检查消 息, 在每接收到被管设备发送的一个新的告警信息后, 首先对该告警信息进行解析, 从而获取该告警信息中携带的包括编号、 前序编号, 得到指示管理站当前接收告警信 息的编号信息, 在发送心跳检查消息时, 在心跳检查消息中携带在当前心跳周期得到 的指示管理站当前接收告警信息的编号信息。 例如, 心跳检查消息可以采用图 3所示的信息结构示意图, 如图 3所示, 在实际 应用中, 管理站一旦获得到该编号信息, 将当前心跳周期得到的该编号信息携带在当 前要发送的心跳检查消息中发送给被管设备。 其中, 上述编号信息包括但不限于: 当 前管理站已收到的告警信息的最大编号、 未到达管理站的告警信息的编号集合。 步骤 S104, 被管设备根据上述编号信息与被管设备当前已发送的告警信息的编 号, 确定需要重发的告警信息的编号。 在本发明优选实施例中, 如果上述编号信息只包括: 管理站当前已接收到的告警 信息的最大编号, 则被管设备将编号在所述心跳检查消息中携带的所述最大编号与被 管设备当前已发送的告警信息的最编号之间的告警信息加入待重发判断告警信息集合 中, 然后判断所述待重发判断告警信息集合中的各个告警信息是否为需要重发的告警 信息。 在本发明实施例的另一个优选实施例中, 如果上述编号信息还包括未到达所述管 理站的告警信息的编号集合, 则被管设备将与该编号集合中的各个编号对应的告警信 息也加入上述待重发判断告警信息集合中。 在本发明实施例的一个优选实施方式中管理站可以通过以下方式得到未到达管理 站的告警信息的编号集合(在告警信息中携带有前序编号的情况下): 管理站将在接收 告警信息之前已接收到的告警信息的最大编号 (记为: M) 与当前接收到的告警信息 的第一编号 (记为: S) 进行比较, 其中: 如果 M+1 <S, 则管理站确定编号为 M+1 至 S-1的告警信息未到达, 将 M+1至 S-1的编号加入未到达的告警信息的编号集合; 如果 M+1=S,则在未到达的告警信息的编号集合中检索是否存在当前接收到的告警信 息的前序告警信息的第二编号 H, 如果是, 则将所述第二编号 H与所述第一编号 S关 联, 待所述第二编号 H对应的告警信息到达后, 按照第二编号 (H) 在前、 第一编号 ( S) 在后的顺序将第二编号 (H) 对应的告警信息和第一编号 (S) 对应的告警信息 加入到已接收到的告警信息, 如果否, 则将第一编号(S)的告警信息加入到已接收到 的告警信息; 如果 S<M, 则将第一编号(S)的告警信息加入到已接收到的告警信息。 采用该实施方式, 管理站可以检查接收告警信息的顺序是否满足要求的顺序。 例如, 在实际应用中, 被管设备收到心跳检查消息后, 可以从中取出到达管理站 的告警信息的最大流水号 M, 同时, 记录当前自己发送的告警信息的流水号 C, 计算 得到 (M , C] 区间的流水号, 然后与为到达管理站的告警信息的编号集合合并, 得到新的未到达管理站的告警信息的编号集合 (其中包括已丢包的告警信息的编号)
(例如,可以将未到达的告警信息的告警编号队列标记为 Q), 从而确定需要向管理站 重新发送的告警信息的编号。 例如,被管设备可以将本地告警信息缓冲队列(即已经发送给管理站的告警信息) 标记为 Q ( 1 ), 当得到新的未到达的告警编号队列 Q (即上述待重发判断告警信息集 合) 后, 从 Q ( 1 ) 中取出与 Q中编号对应的告警信息得到新的本地告警信息缓冲队 列 Q (2) (Q (2) 即为被管设备需要进一步判断的待重发判断告警信息集合)。 在本发明的一个优选实施方式中, 当心跳检查消息到达时, 被管设备可以获取心 跳检查消息的第一时间戳 (该时间戳可以记为待重发判断告警信息集合中的各个告警 信息到达管理站的时间),对于待重发判断告警信息集合中的各个告警信息,被管设备 可以获取指示该告警信息发出时间的第二时间戳, 然后进一步判断第二时间戳与第一 时间戳的时间差是否大于预设的丢包时间阈值, 如果是, 则确定该告警信息需要重发。 例如, 被管设备记录取出心跳检查消息的时间戳为 T(h), 依次取出 Q(2)中的每个 元素, 再记录取出该告警的时间戳, 记为 T(a), 计算 t= (T(h)-T(a)), 将 t与预先设定 的丢包时间阈值 (记为: G) 比较, 如果 t>G或者 t=G, 则将 Q(2)中的告警信息加入 重发队列, 并从本地缓存中删除; 如果 t<G, 则继续等待。 步骤 S106,被管设备将与需要重发的告警信息的编号对应的告警信息发送给管理 站。 在本发明实施例中, 当被管设备将需要重发的告警信息发送给管理站后, 被管设 备还可以根据心跳检查消息确定已经被管理站成功接收的告警信息, 并删除被管设备 缓存在本地的且已经被管理站成功接收的告警信息。 通过本发明实施例提供的上述方法, 可以及时检测到告警信息是否有丢失, 从而 及时的向管理站重发丢失的告警信息, 并且, 采用本明实施例提供的上述方法, 管理 站在重启过程中被管设备发送的告警信息也能够检测出来, 从而保证了这些告警信息 不会被丢失。 另外, 在本发明实施例中, 管理站还可以检测接收到的告警信息的顺序 是否与设备端的发送顺序一致。 图 4是本发明优选实施例中的管理站对接收到的新告警信息的处理流程图, 如图 4所示, 该流程主要包括以下步骤: 步骤 1, 网管 (管理站) 系统启动时, 初始化已经接收到的最大告警编号 M, 初 始化未到管理站的告警信息的队列, 此时, 队列为空。 在本发明实施例中, 初始化策略可以在建立通讯链路过程中, 对被管设备与管理 站的最大告警编号 M、 当前的告警进行同步, 同步完成后, 初始化队列为空。 步骤 2, 接收告警信息。 步骤 3, 从 TRAP协议的告警信息中解析出绑定 (配置) 的告警的编号 (S)和前 序编号 (H)。 步骤 4, 比较 S与 M的大小, 如果 S大于 M, 则继续执行步骤 9; 如果 S小于 M, 则继续执行步骤 5。 步骤 5, 将接收到的新告警放入处理队列, 等待处理线程处理。 步骤 6, 在未到达队列中, 检查 S是否有关联告警, 如果有关联告警, 则继续执 行步骤 7, 否则继续执行步骤 8。 步骤 7, 将 S的关联告警放入处理队列, 等待处理线程处理。 步骤 8, 从未到达队列中删除 S。 步骤 9, 判断 S是否等于 M+l, 如果相等, 则执行步骤 11, 否则, 继续执行步骤
10。 步骤 10, 以开区间 (M,S)内的每个整数为编号构造空壳告警信息, 并放入未到达 队列。 步骤 11, 判断编号为 H的告警是否在未到达队列中, 如果在队列中, 则继续执行 步骤 12; 否则, 执行步骤 14。 步骤 12, 将新收到的告警信息设置为未到达队列中编号为 H的告警的关联告警, 等待编号为 H的告警到达后一并处理。 步骤 13, 更新接收到的最大告警编号, 令 M = S。 步骤 14, 将新告警放入处理队列, 等待处理线程处理。 在本发明实施例中, 被管设备将 TRAP协议告警信息发给管理站后, 可以将该告 警的一个副本作为已发送队列缓存到本地的缓存中, 待被管设备接收到心跳检查消息 后, 还可以更新缓存。 例如,图 5是根据本发明实施例的被管设备对收到的心跳检查消息的处理流程图, 如图 5所示, 该流程主要包括以下步骤: 步骤 1、 2、 3, 收到管理站的建链命令后, 建立通讯链路、 同步告警信息数据、 同步最大告警编号。 步骤 4, 链路建立成功后, 收到管理站发出的心跳消息。 步骤 5, 解析心跳检查消息中携带的数据信息, 包括已到达的最大告警编号(M)、 管理站探知的未到达告警信息序列 (记为: [Ni, …, Nk] )、 管理站采集上述信息的时 间戳 (Tm)、 以及从被管设备获取到的当前告警编号 (C)。 步骤 6, 比较 C和 M的大小, 如果 C大于 M, 继续执行步骤 7, 否则, 执行步骤
步骤 7, 将 (M, C]区间内的整数值加入未到达序列, 未到达序列更新为 [Nl, …, Nk, …, Nc] o 步骤 8, 将已发送告警信息缓存中已经被管理站收到的部分删除, 方法是如果该 告警信息的编号不在未到达序列中, 则删除。 步骤 9, 扫描已发送告警信息缓存队列, 计算当前时间 (Tc)与管理站心跳信息采集 时间™的差值, 如果差值大于超时阈值 (即丢包时间阈值), 则重发。 采用上述实施例提供的告警信息上报方法, 通过管理站向被管设备发送心跳检查 消息, 可以及时使得被管设备 (告警信息发送端) 获知是否发送丢包, 解决了及时性 差的问题, 同时, 通过在未到达告警信息队列中检索当前收到的告警信息的前序告警 信息, 保证了被管设备发送的告警信息的顺序与管理站接收的告警信息的顺序一致, 从而可以达到及时发现丢包后可以主动重发, 而避免不必要的重发的效果。 与上述告警信息上报方法相对应,本发明实施例还提供了一种告警信息上报装置, 该装置位于被管设备侧。 图 6是根据本发明实施例的告警信息上报装置的结构示意图, 该装置包括: 第一 接收模块 10、 第一确定模块 20和第一发送模块 30。 其中, 第一接收模块 10, 设置为 接收来自管理站的心跳检查消息, 其中, 该心跳检查消息携带有指示在当前心跳周期 管理站接收告警信息情况的编号信息; 第一确定模块 20, 连接至第一接收模块 10, 设 置为根据编号信息与被管设置当前已发送的告警消息的编号, 确定需要重发的告警信 息的编号; 第一发送模块 30, 连接至第一确定模块 20, 设置为将与需要重发的告警信 息的编号对应的告警信息发送给管理站。 在本发明实施例的一个优选实施方式中, 上述编号信息包括: 管理站当前已接收 到的告警信息的最大编号; 如图 7所示, 在该优选实施方式中, 第一确定模块 20可以 包括: 获取子模块 22, 设置为将编号在心跳检查消息中携带的所述最大编号与被管设 备当前已发送的告警信息的最编号之间的告警信息加入待重发判断告警信息集合中; 判断子模块 24, 与获取子模块 22连接, 设置为判断上述待重发判断告警信息集合中 的各个告警信息是否为需要重发的告警信息。 采用这种实施方式, 管理站可以方便地 向被管设备发送指示, 处理流程比较简单。 在本发明实施例的另一个优选实施方式中, 上述编号信息还可以包括: 未到达所 述管理站的告警信息的编号集合;则获取子模块 22还设置为将与该编号集合中的各个 编号对应的告警信息加入上述待重发判断告警信息集合中。 采用这种实施方式, 被管 设备可以比较准确的确定需要重发的告警信息。 在本发明实施例的一个优选实施方式中, 如图 8所示, 判断子模块 24可以包括: 第一获取单元 242, 设置为获取上述心跳检查消息的第一时间戳; 第二获取单元 244, 设置为获取上述待重发判断告警信息集合中的各个告警信息发出时间的第二时间戳; 判断单元 246, 与第一获取单元 242和第二获取单元 244连接, 设置为分别判断上述 待重发判断告警信息集合中的各个告警信息的第二时间戳与第一时间戳的时间差是否 大于预设的丢包时间阈值, 如果是, 则确定该告警信息需要重发。 通过本发明实施例提供的上述装置, 被管设备可以比较及时的获取到丢失的告警 信息, 并及时进行重发, 从而提高了告警信息发送的可靠性。 图 9是根据本发明实施例的告警信息上报系统的结构示意图, 如图 9所示, 该系 统包括: 管理站 1和被管设备 2。 其中, 被管设备 2可以包括上述实施例中的告警信 息上报装置。 如图 10所示, 管理站 1可以包括: 第二接收模块 40、 第二确定模块 50和第二发 送模块 60。 其中, 第二接收模块 40, 设置为接收被管设备发送的告警信息, 其中, 该 告警信息中携带有其编号; 第二确定模块 50, 连接至第二接收模块 40, 设置为确定指 示管理站 2当前接收告警信息情况的编号信息; 第二发送模块 60, 连接至第二确定模 块 50, 设置为向被管设备发送心跳检查消息, 其中, 所述心跳检查消息中携带有当前 心跳周期中所述第二确定模块确定的所述编号信息。 在本发明实施例的优选实施方式中, 上述编号信息可以包括: 所述管理站当前已 接收到的告警信息的最大编号。 在本发明实施例的另一个优选实施方式中, 上述编号信息还可以包括: 未到达所 述管理站的告警信息的编号集合; 所述告警信息中还携带有所述告警信息的前序告警 信息的编号。 则如图 11所述, 在该优选实施方式中, 第二确定模块 50可以包括: 比 较模块 52, 设置为将在接收所述告警信息之前已接收到的告警信息的最大编号 M与 当前接收到的所述告警信息的第一编号 S进行比较,如果 M+1 <S,则确定编号为 M+1 至 S-1的告警信息未到达, 触发执行模块 54将 M+1至 S-1的编号加入未到达的告警 信息的编号集合; 如果 M+1=S, 则触发检索模块 56; 检索模块 56, 设置为在未到达 的告警信息的所述编号集合中检索是否存当前接收到所述告警信息的前序告警信息的 第二编号 H, 如果是, 则触发执行模块将所述第二编号 H与所述第一编号 S关联, 待 所述第二编号 H对应的告警信息到达后, 按照所述第二编号 H在前、 所述第一编号 S 在后的顺序将所述第二编号 H对应的告警信息和所述第一编号 S对应的告警信息加入 到已接收到的告警信息,如果否,则触发所述执行模块将 54所述第一编号 S的所述告 警信息加入到已接收到的告警信息; 如果 S<M, 则触发执行模块 54将所述第一编号 S的所述告警信息加入到已接收到的告警信息; 执行模块 54, 设置为根据比较模块 52 的比较结果和 /或检索模块 56的检索结果, 执行相应的处理。 采用上述发明实施例提供的上述告警信息上报系统, 通过管理站向被管设备发送 心跳检查消息, 可以及时使得被管设备 (告警信息发送端) 获知是否发送丢包, 解决 了及时性差的问题, 同时, 通过在未到达告警信息队列中检索当前收到的告警信息的 前序告警信息, 保证了被管设备发送的告警信息的顺序与管理站接收的告警信息的顺 序一致, 从而可以达到及时发现丢包后可以主动重发, 而避免不必要的重发的效果。 从以上的描述中, 可以看出, 本发明实现了如下技术效果: 管理站将接收到的告 警信息的编号信息携带在心跳检查消息中发送给被管设备(告警信息发送端), 使被管 设备根据编号信息确定需要重发的告警信息的编号, 再将需要重发的告警信息发送给 管理站的方式, 可以及时使得被管设备获知是否发送丢包, 解决了现有技术中确定丢 包的及时性差的问题, 同时, 通过在未到达告警信息队列中检索当前收到的告警信息 的前序告警信息, 保证了被管设备发送的告警信息的顺序与管理站接收的告警信息的 顺序一致, 进而达到了只需要一个心跳周期即可以确定是否发送丢包, 并使被管设备 及时、 自动地将丢掉的告警信息重新发送给管理站的效果。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可以用通用 的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布在多个计算装置所 组成的网络上, 可选地, 它们可以用计算装置可执行的程序代码来实现, 从而, 可以 将它们存储在存储装置中由计算装置来执行, 并且在某些情况下, 可以以不同于此处 的顺序执行所示出或描述的步骤, 或者将它们分别制作成各个集成电路模块, 或者将 它们中的多个模块或步骤制作成单个集成电路模块来实现。 这样, 本发明不限制于任 何特定的硬件和软件结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本领域的技 术人员来说, 本发明可以有各种更改和变化。 凡在本发明的精神和原则之内, 所作的 任何修改、 等同替换、 改进等, 均应包含在本发明的保护范围之内。

Claims

权 利 要 求 书
1. 一种告警信息上报方法, 包括:
被管设备接收来自管理站的心跳检查消息, 其中, 所述心跳检查消息携带 有指示在当前心跳周期所述管理站当前接收告警信息情况的编号信息;
所述被管设备根据所述编号信息与所述被管设备当前已发送的告警信息的 编号, 确定需要重发的告警信息的编号;
所述被管设备将与所述需要重发的告警信息的编号对应的告警信息发送给 所述管理站。
2. 根据权利要求 1所述的方法, 其中, 所述被管设备接收来自管理站的心跳检查 消息之前, 所述方法包括:
所述被管设备按照发送顺序为每个所述告警信息配置一个所述编号; 所述被管设备将所述告警信息的编号携带在该告警信息中发送给所述管理 站, 并在本地缓存所述告警信息。
3. 根据权利要求 2所述的方法, 其中,
所述编号信息包括: 所述管理站当前已接收到的告警信息的最大编号; 所述被管设备根据所述编号信息与所述被管设备当前已发送的告警消息的 编号, 确定需要重发的告警信息的编号, 包括: 所述被管设备将编号在所述心 跳检查消息中携带的所述最大编号与所述被管设备当前已发送的告警信息的最 编号之间的告警信息加入待重发判断告警信息集合中; 所述被管设备判断所述 待重发判断告警信息集合中的各个告警信息是否为需要重发的告警信息。
4. 根据权利要求 3所述的方法, 其中, 所述编号信息还包括: 未到达所述管理站的告警信息的编号集合; 所述被管设备判断所述待重发判断告警信息集合中的各个告警信息是否为 需要重发的告警信息之前, 所述方法还包括: 所述被管设备将与所述编号集合 中的各个编号对应的告警信息加入所述待重发判断告警信息集合中。
5. 根据权利要求 4所述的方法, 其中, 在为每个所述告警信息配置一个所述编号 时, 所述方法还包括: 判断所述告警信息是否有前序告警信息, 如果是, 则将 所述前序告警信息的编号作为所述告警信息的前序编号, 并将所述前序编号携 带在所述告警信息中。
6. 根据权利要求 5所述的方法, 其中, 在所述被管设备将所述告警信息的编号携 带在该告警信息中发送给所述管理站之后, 所述方法还包括:
所述管理站将在接收所述告警信息之前已接收到的告警信息的最大编号 M 与当前接收到的所述告警信息的第一编号 S进行比较, 其中:
如果 M+1 <S, 则所述管理站确定编号为 M+1至 S-1的告警信息未到达, 将 M+1至 S-1的编号加入未到达的告警信息的所述编号集合;
如果 M+1=S,则在未到达的告警信息的所述编号集合中检索是否存在当前 接收到的所述告警信息的前序告警信息的第二编号 H, 如果是, 则将所述第二 编号 H与所述第一编号 S关联,待所述第二编号 H对应的告警信息到达后,按 照所述第二编号 H在前、所述第一编号 S在后的顺序将所述第二编号 H对应的 告警信息和所述第一编号 S对应的告警信息加入到已接收到的告警信息, 如果 否, 则将所述第一编号 S的所述告警信息加入到已接收到的告警信息;
如果 S<M, 则将所述第一编号 S的所述告警信息加入到已接收到的告警 信息。
7. 根据权利要求 1至 6中任一项所述的方法, 其中, 所述被管设备判断所述待重 发判断告警信息集合中的各个告警信息是否为需要重发的告警信息包括: 所述被管设备获取接收到所述心跳检查消息的时间的第一时间戳; 对于所述待重发判断告警信息集合中的各个告警信息, 所述被管设备获取 指示该告警信息发出时间的第二时间戳, 判断所述第一时间戳与所述第二时间 戳的时间差是否大于预设的丢包时间阈值, 如果是, 则确定该告警信息需要重 发。
8. 根据权利要求 2至 6中任一项所述的方法, 其中, 所述被管设备将所述未到达 的告警信息的编号对应的所述未到达的告警信息发送给所述管理站之后, 还包 括:
所述被管设备根据所述心跳检查消息确定已经被所述管理站成功接收的所 述告警信息, 删除所述被管设备缓存的所述已经被所述管理站成功接收的所述 告警信息。
9. 一种告警信息上报装置, 位于被管设备侧, 所述装置包括: 第一接收模块, 设置为接收来自管理站的心跳检查消息, 其中, 所述心跳 检查消息携带有指示在当前心跳周期所述管理站当前接收告警信息情况的编号 信息;
第一确定模块,设置为根据所述编号信息与当前已发送的告警消息的编号, 确定需要重发的告警信息的编号;
第一发送模块, 设置为将与所述需要重发的告警信息的编号对应的告警信 息发送给所述管理站。
10. 根据权利要求 9所述的装置, 其中,
所述编号信息包括: 所述管理站当前已接收到的告警信息的最大编号; 所述第一确定模块包括:
获取子模块, 设置为将编号在所述心跳检查消息中携带的所述最大编号与 所述被管设备当前已发送的告警信息的最编号之间的告警信息加入待重发判断 告警信息集合中;
判断子模块, 设置为判断所述待重发判断告警信息集合中的各个告警信息 是否为需要重发的告警信息。
11. 根据权利要求 10所述的装置, 其中, 所述编号信息还包括: 未到达所述管理站的告警信息的编号集合; 所述获取子模块还设置为将与所述编号集合中的各个编号对应的告警信息 加入所述待重发判断告警信息集合中。
12. 根据权利要求 10所述的装置, 其中, 所述判断子模块包括:
第一获取单元, 设置为获取所述心跳检查消息的第一时间戳; 第二获取单元, 设置为获取所述待重发判断告警信息集合中的各个告警信 息发出时间的第二时间戳;
判断单元, 设置为分别判断所述待重发判断告警信息集合中的各个告警信 息的所述第二时间戳与所述第一时间戳的时间差是否大于预设的丢包时间阈 值, 如果是, 则确定该告警信息需要重发。
13. 一种告警信息上报系统, 包括: 管理站和被管设备, 其中,
所述被管设备包括权利要求 9至 12中任一项所述的装置; 所述管理站包括:
第二接收模块, 设置为接收所述被管设备发送的告警信息, 其中, 所述告 警信息中携带有该告警信息的编号;
第二确定模块, 设置为确定指示所述管理站当前接收告警信息情况的编号 信息;
第二发送模块, 设置为向所述被管设备发送心跳检查消息, 其中, 所述心 跳检查消息中携带有当前心跳周期中所述第二确定模块确定的所述编号信息。
14. 根据权利要求 13所述的系统, 其中,
所述编号信息包括: 所述管理站当前已接收到的告警信息的最大编号和未 到达所述管理站的告警信息的编号集合;
所述告警信息中还携带有所述告警信息的前序告警信息的编号; 所述第二确定模块包括:
比较模块, 设置为将在接收所述告警信息之前已接收到的告警信息的最大 编号 M与当前接收到的所述告警信息的第一编号 S进行比较, 如果 M+1 <S, 则确定编号为 M+1至 S-1的告警信息未到达, 触发执行模块将 M+1至 S-1的 编号加入未到达的告警信息的编号集合; 如果 M+1=S, 则触发检索模块; 所述检索模块, 设置为在未到达的告警信息的所述编号集合中检索是否存 当前接收到所述告警信息的前序告警信息的第二编号 H, 如果是, 则触发所述 执行模块将所述第二编号 H与所述第一编号 S关联,待所述第二编号 H对应的 告警信息到达后, 按照所述第二编号 H在前、所述第一编号 S在后的顺序将所 述第二编号 H对应的告警信息和所述第一编号 S对应的告警信息加入到已接收 到的告警信息, 如果否, 则触发所述执行模块将所述第一编号 S的所述告警信 息加入到已接收到的告警信息; 如果 S<M, 则触发所述执行模块将所述第一 编号 S的所述告警信息加入到已接收到的告警信息;
所述执行模块,设置为根据所述比较模块的比较结果和 /或所述检索模块的 检索结果, 执行相应的处理。
PCT/CN2012/070017 2011-06-27 2012-01-04 告警信息上报方法、装置及系统 WO2013000282A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110175334.9A CN102857354B (zh) 2011-06-27 2011-06-27 告警信息上报方法、装置及系统
CN201110175334.9 2011-06-27

Publications (1)

Publication Number Publication Date
WO2013000282A1 true WO2013000282A1 (zh) 2013-01-03

Family

ID=47403564

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/070017 WO2013000282A1 (zh) 2011-06-27 2012-01-04 告警信息上报方法、装置及系统

Country Status (2)

Country Link
CN (1) CN102857354B (zh)
WO (1) WO2013000282A1 (zh)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103490943A (zh) * 2013-07-04 2014-01-01 文杰 一种基于心跳信号的工业以太网丢包检测方法
CN105592489A (zh) * 2014-11-12 2016-05-18 中兴通讯股份有限公司 一种传输数据管理方法和装置
CN107360013A (zh) * 2016-05-10 2017-11-17 北京数码视讯科技股份有限公司 一种告警同步方法及系统
CN108809538B (zh) * 2017-05-04 2020-07-17 大唐移动通信设备有限公司 一种重发告警信息的方法和装置
CN108964955A (zh) * 2017-05-23 2018-12-07 中兴通讯股份有限公司 一种丢失Trap报文查找方法和网络管理系统及一种SNMP代理
CN109286532B (zh) * 2018-11-28 2021-07-27 郑州云海信息技术有限公司 云计算系统中告警信息的管理方法和装置
CN111385111B (zh) * 2018-12-28 2023-03-24 中国电信股份有限公司 告警方法、装置、系统及计算机可读存储介质
CN113542061B (zh) * 2021-07-08 2023-03-31 阳光电源股份有限公司 一种数据传输控制方法及相关装置
CN114244682B (zh) * 2021-11-22 2024-01-05 中盈优创资讯科技有限公司 一种设备告警丢失补漏方法及装置
CN114285723A (zh) * 2021-12-22 2022-04-05 卡斯柯信号有限公司 维护终端间信息传输的方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005309789A (ja) * 2004-04-21 2005-11-04 Nippon Telegr & Teleph Corp <Ntt> ネットワーク管理方法及び装置、並びにプログラムを記録した記録媒体
CN101267335A (zh) * 2007-03-15 2008-09-17 中兴通讯股份有限公司 一种保证简单网络管理协议告警成功收发的方法
CN101577646A (zh) * 2009-06-22 2009-11-11 武汉烽火网络有限责任公司 一种基于snmp的告警同步方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101184003B (zh) * 2007-12-03 2010-08-18 中兴通讯股份有限公司 基于网络管理协议的前后台告警管理系统及其管理方法
CN101741635B (zh) * 2008-11-26 2013-04-17 大唐移动通信设备有限公司 一种同步告警信息的方法、系统及设备

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005309789A (ja) * 2004-04-21 2005-11-04 Nippon Telegr & Teleph Corp <Ntt> ネットワーク管理方法及び装置、並びにプログラムを記録した記録媒体
CN101267335A (zh) * 2007-03-15 2008-09-17 中兴通讯股份有限公司 一种保证简单网络管理协议告警成功收发的方法
CN101577646A (zh) * 2009-06-22 2009-11-11 武汉烽火网络有限责任公司 一种基于snmp的告警同步方法

Also Published As

Publication number Publication date
CN102857354B (zh) 2018-08-03
CN102857354A (zh) 2013-01-02

Similar Documents

Publication Publication Date Title
WO2013000282A1 (zh) 告警信息上报方法、装置及系统
US7903546B2 (en) Detecting unavailable network connections
KR101467798B1 (ko) 무선통신시스템에서의 상태정보 전송 방법 및 수신장치
CN108631954B (zh) 一种数据传输方法及装置
US7496038B2 (en) Method for faster detection and retransmission of lost TCP segments
JP3645230B2 (ja) データパケット送信装置、データパケット受信装置、データパケット伝送システムおよびデータパケット再送制御方法
KR101113125B1 (ko) 윈도우 제어 및 재송제어방법, 및, 송신측장치
RU2451406C2 (ru) Способ выполнения процедуры опроса в системе беспроводной связи
CN103907315B (zh) 用于网络质量估计、连接性检测以及负载管理的系统和方法
CN101056194B (zh) 一种简单网络管理协议消息传送方法及装置
US9838326B2 (en) System and method for equalizing transmission delay in a network
JP2002152308A (ja) データ通信システム、その通信方法及びその通信プログラムを記録した記録媒体
WO2016100631A1 (en) Methods for enabling delay-awareness in the constrained application protocol (coap)
CN108234087B (zh) 数据传输方法及发送端
WO2008000181A1 (fr) Procédés et systèmes de retransmission sur couche de transport
CN106331117B (zh) 一种数据传输方法
WO2011100911A2 (zh) 探测处理方法、数据发送端、数据接收端以及通信系统
WO2019128840A1 (zh) 报文传输控制方法及装置
CN103891252A (zh) 用于网络质量估计、连接性检测以及负载管理的系统和方法
CN103907314B (zh) 用于网络质量估计、连接性检测以及负载管理的系统和方法
KR20110071433A (ko) 무선링크제어 계층 재전송 실패를 처리하는 장치
CN102769520B (zh) 基于sctp协议的无线网络拥塞控制方法
WO2013097434A1 (zh) 可靠用户数据报协议链路故障定位方法及装置
JP2008059114A (ja) Snmpを利用した自動ネットワーク監視システム
WO2012083762A1 (zh) 数据传输方法、设备及系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12803894

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12803894

Country of ref document: EP

Kind code of ref document: A1