CN106961344B - Network fault detection method and device - Google Patents

Network fault detection method and device Download PDF

Info

Publication number
CN106961344B
CN106961344B CN201610014741.4A CN201610014741A CN106961344B CN 106961344 B CN106961344 B CN 106961344B CN 201610014741 A CN201610014741 A CN 201610014741A CN 106961344 B CN106961344 B CN 106961344B
Authority
CN
China
Prior art keywords
network element
node network
sent
message
messages
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610014741.4A
Other languages
Chinese (zh)
Other versions
CN106961344A (en
Inventor
易小聪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201610014741.4A priority Critical patent/CN106961344B/en
Priority to PCT/CN2016/080400 priority patent/WO2016197736A1/en
Publication of CN106961344A publication Critical patent/CN106961344A/en
Application granted granted Critical
Publication of CN106961344B publication Critical patent/CN106961344B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Abstract

The invention provides a network fault detection method and device, relates to the technical field of communication, and aims to guarantee operation of services in a network. The network fault detection method of the invention comprises the following steps: receiving a network fault detection instruction of a client, wherein the network fault detection instruction comprises an identifier of a tunnel to be detected; determining a network element included in the tunnel to be detected according to the identifier of the tunnel to be detected; sending a message counting instruction to the network element, so that the network element counts the number of messages received and/or sent by the network element according to the message counting instruction; receiving a message statistical result sent by the network element, and determining whether the network element fails according to the message statistical result, wherein the message statistical result comprises the number of messages received and/or sent by the network element. The invention is mainly used in the PTN technology.

Description

Network fault detection method and device
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a network fault detection method and apparatus.
Background
PTN (Packet Transport Network) is a Packet-based routing architecture that can provide technical support for multiple services. In the PTN network, if a packet loss phenomenon occurs in an end-to-end tunnel, the traffic carried on the tunnel will be affected. In the existing scheme, whether a packet loss phenomenon occurs in a tunnel can be determined according to packet sending and receiving conditions of a tunnel head node. However, the existing scheme cannot locate the network element with the fault, so that the fault of the network is accurately solved, and the operation of the service in the network is influenced.
Disclosure of Invention
In view of this, the present invention provides a method and an apparatus for detecting a network failure, so as to ensure the operation of a service in a network.
In order to solve the above technical problem, the present invention provides a network fault detection method, including:
receiving a network fault detection instruction of a client, wherein the network fault detection instruction comprises an identifier of a tunnel to be detected;
determining a network element included in the tunnel to be detected according to the identifier of the tunnel to be detected;
sending a message counting instruction to the network element, so that the network element counts the number of messages received and/or sent by the network element according to the message counting instruction;
receiving a message statistical result sent by the network element, and determining whether the network element fails according to the message statistical result, wherein the message statistical result comprises the number of messages received and/or sent by the network element.
Preferably, the network element includes: a head node network element, other node network elements; the sending of the message statistic instruction to the network element includes:
and firstly sending a message statistical instruction to the other node network elements, and sending the message statistical instruction to the head node network element after a preset time interval.
Preferably, the network fault detection instruction further includes a detection configuration parameter; the method further comprises the following steps: and sending the detection configuration parameters to the head node network element.
Preferably, the detecting the configuration parameters includes: a detection mode; when the detection mode is the slow Ping detection mode, the detecting configuration parameters further include: packet length, repetition times, timeout time, reply mode; when the detection mode is the fast Ping detection mode, the detecting configuration parameters further include: bandwidth, priority, detection time, reply mode.
Preferably, when the detection mode is the slow Ping detection mode, the statistical result of receiving the message sent by the network element specifically is as follows: when the number of times of sending the message by the head node reaches the requirement of the parameter of the number of times of repeating, receiving the message statistical result sent by the network element;
when the detection mode is the fast Ping detection mode, the receiving of the statistical result of the message sent by the network element specifically comprises: and receiving the message statistical result sent by the network element when the detection time is up.
Preferably, when the message is transmitted unidirectionally, the receiving of the statistical result of the message sent by the network element determines whether the network element fails according to the statistical result of the message, where the statistical result of the message includes the number of messages received and/or sent by the network element:
if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element; if the number of the messages sent by the head node network element does not accord with the preset number of the sent messages, determining that the head node network element has a fault;
if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault;
if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received by the tail node network elements; and if the number of the messages received by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
Preferably, when the message is transmitted bidirectionally, the receiving the statistical result of the message sent by the network element determines whether the network element fails according to the statistical result of the message, wherein the statistical result of the message includes the number of the messages received and/or sent by the network element:
if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element in the forward direction and the number of messages received in the reverse direction; if the number of the messages sent by the head node network element in the forward direction is not consistent with the number of the preset sent messages or the number of the messages received by the head node network element in the reverse direction is not consistent with the number of the messages sent by the previous node network element of the head node network element, determining that the head node network element has a fault;
if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault;
if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received and the number of messages sent by the tail node network elements; and if the number of the messages received and/or sent by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
The invention also provides a network fault detection method, which comprises the following steps:
receiving a message statistical instruction sent by a server;
counting the number of received and/or sent messages according to the message counting instruction;
and sending a message statistical result to the server, wherein the message statistical result comprises the number of received and/or sent messages.
Preferably, the method further comprises:
receiving detection configuration parameters sent by the server; the detecting the configuration parameters comprises: a detection mode; when the detection mode is the slow Ping detection mode, the detecting configuration parameters further include: packet length, repetition times, timeout time, reply mode; when the detection mode is the fast Ping detection mode, the detecting configuration parameters further include: bandwidth, priority, detection time and reply mode;
the statistical result of the message sent to the server is specifically as follows: when the detection mode is a slow Ping detection mode, receiving a message statistical result sent by the network element when the number of times of sending messages by the head node reaches the requirement of the parameter of the number of times of repeating; and when the detection mode is the fast Ping detection mode, receiving the message statistical result sent by the network element when the detection time is up.
The present invention also provides a network fault detection device, including:
the system comprises a receiving unit, a judging unit and a judging unit, wherein the receiving unit is used for receiving a network fault detection instruction of a client, and the network fault detection instruction comprises an identifier of a tunnel to be detected;
a determining unit, configured to determine, according to the identifier of the tunnel to be detected, a network element included in the tunnel to be detected;
a sending unit, configured to send a message statistics instruction to the network element, so that the network element counts the number of messages received and/or sent by the network element according to the message statistics instruction;
and the processing unit is used for receiving a message statistical result sent by the network element and determining whether the network element fails according to the message statistical result, wherein the message statistical result comprises the number of messages received and/or sent by the network element.
Preferably, the network element includes: a head node network element, other node network elements; the sending unit is specifically configured to:
and firstly sending a message statistical instruction to the other node network elements, and sending the message statistical instruction to the head node network element after a preset time interval.
Preferably, the network fault detection instruction further includes a detection configuration parameter; the sending unit is further configured to send a detection configuration parameter to the head node network element.
Preferably, the processing unit includes:
a receiving module, configured to receive a statistical result of the message sent by the network element;
the analysis module is used for analyzing the message statistical result sent by the network element when the message is transmitted in a unidirectional mode; if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element; if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received by the tail node network elements;
the processing module is used for determining that the head node network element has a fault if the number of the messages sent by the head node network element does not accord with the preset number of the sent messages when the messages are transmitted in a unidirectional mode; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
Preferably, the processing unit includes:
a receiving module, configured to receive a statistical result of the message sent by the network element;
the analysis module is used for analyzing the message statistical result sent by the network element when the message is transmitted in two directions; if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element in the forward direction and the number of messages received in the reverse direction; if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received and the number of messages sent by the tail node network elements;
a processing module, configured to determine that a head node network element fails if the number of messages sent by the head node network element in the forward direction is not consistent with a preset number of sent messages or the number of messages received by the head node network element in the reverse direction is not consistent with the number of messages sent by a previous node network element of the head node network element when a message is transmitted in a bidirectional manner; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received and/or sent by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
Preferably, when the detection mode is the slow Ping detection mode, the receiving module is specifically configured to receive a statistical result of the packet sent by the network element when the number of times that the header node sends the packet reaches the requirement of the parameter of the number of repetitions;
when the detection mode is the fast Ping detection mode, the receiving module is specifically configured to: and receiving the message statistical result sent by the network element when the detection time is up.
The present invention also provides a network fault detection device, including:
the receiving unit is used for receiving the message statistical instruction sent by the server;
a statistic unit for counting the number of received and/or sent messages according to the message statistic instruction;
and the sending unit is used for sending a message statistical result to the server, wherein the message statistical result comprises the number of received and/or sent messages.
Preferably, the receiving unit is further configured to receive a detection configuration parameter sent by the server; the detecting the configuration parameters comprises: a detection mode; when the detection mode is the slow Ping detection mode, the detecting configuration parameters further include: packet length, repetition times, timeout time, detection time; when the detection mode is the fast Ping detection mode, the detecting configuration parameters further include: packet length, bandwidth, priority, detection time;
the sending unit is specifically configured to: when the detection mode is a slow Ping detection mode, receiving a message statistical result sent by the network element when the number of times of sending messages by the head node reaches the requirement of the parameter of the number of times of repeating; and when the detection mode is the fast Ping detection mode, receiving the message statistical result sent by the network element when the detection time is up.
The technical scheme of the invention has the following beneficial effects:
in the embodiment of the invention, when the network fault is detected, the server receives the network fault detection instruction sent by the client, and then the server sends the message statistical instruction to the network element of the tunnel to be detected according to the network fault detection instruction, so that each network element counts the number of the received and/or sent messages and feeds the number back to the server according to the message statistical instruction, and the server can determine whether the network element has the fault according to the number of the received and/or sent messages fed back by each network element. Therefore, it can be seen from the above description that, in the embodiment of the present invention, the server can determine whether the network element fails according to the number of the received and/or sent messages of each network element, so as to accurately locate the failed network element, thereby facilitating the maintenance personnel to accurately solve the network failure, and further ensuring the operation of the service in the network.
Drawings
Fig. 1 is a flowchart of a network fault detection method according to a first embodiment of the present invention;
fig. 2 is a flowchart of a network fault detection method according to a second embodiment of the present invention;
fig. 3 is a flowchart of a network fault detection method according to a third embodiment of the present invention;
fig. 4 is a schematic diagram of a network fault detection apparatus according to a fourth embodiment of the present invention;
fig. 5 is a schematic diagram of a network fault detection apparatus according to a fifth embodiment of the present invention.
Detailed Description
The following detailed description of embodiments of the present invention will be made with reference to the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.
As shown in fig. 1, a network failure detection method according to a first embodiment of the present invention is executed by a server, and includes:
and step 11, receiving a network fault detection instruction of the client, wherein the network fault detection instruction comprises an identifier of a tunnel to be detected.
In the embodiment of the present invention, according to the characteristics of end-to-end communication of the PTN network architecture, a communication link between network element nodes at two ends is referred to as a tunnel. Herein, a tunnel that needs to detect whether a failure occurs is referred to as a tunnel to be detected. The transmission of the message in the tunnel may pass through a plurality of network elements. The identifier of the tunnel to be detected may be in any form, for example, an identifier formed by names of network element nodes at two ends. If it is assumed that the network element node at one end is a and the network element node at the other end is Z, the identifier of the tunnel to be detected here may be AZ or the like.
When a certain tunnel needs to be detected, the client sends a network fault detection instruction to the server, wherein the instruction comprises an identifier of the tunnel to be detected and is used for indicating the server to detect the fault of the corresponding tunnel.
And step 12, determining the network elements included in the tunnel to be detected according to the identification of the tunnel to be detected.
In a specific application, the server stores information of network elements through which each tunnel passes. Therefore, according to the identifier of the tunnel to be detected, the server can determine the network elements included in the tunnel to be detected. For example, assuming that the tunnel to be detected is a tunnel from node a to node Z, which is identified as AZ, and the network element that it passes through is A, B, C, Z, then the network elements included in the tunnel are respectively network elements A, B, C, Z.
And step 13, sending a message counting instruction to the network element, so that the network element counts the number of messages received and/or sent by the network element according to the message counting instruction.
After determining the network elements included in the tunnel to be detected, the server may send a message counting instruction to each network element, where the message counting instruction is used to enable the network element to count the number of messages received and/or sent by the network element according to the message counting instruction. In a specific application, in order to ensure the accuracy of the determined faulty network element, a message statistical instruction may be sent to the other node network elements first, and the message statistical instruction may be sent to the head node network element after a preset time interval. The preset time may be set arbitrarily, for example, set to 3 s.
Step 14, receiving a statistical result of the messages sent by the network element, and determining whether the network element has a fault according to the statistical result of the messages, wherein the statistical result of the messages includes the number of the messages received and/or sent by the network element.
When the message is transmitted in the tunnel to be detected, the message can be transmitted in a unidirectional transmission mode or can be transmitted in a bidirectional transmission mode. For example, suppose that the tunnels to be detected include network elements A, B, C, Z, where the network element a is called a head node network element, the network element Z is a tail node network element, and the network element B, C is called a middle node network element; and network element B, C, Z may be collectively referred to as other node network elements. In the unidirectional transmission mode, a message is sent by the network element A and transmitted to the network element Z through the network element B, C; in the bidirectional transmission mode, a message is sent by the network element a, transmitted to the network element Z through the network element B, C, and then sent by the network element Z, and the message sent by the network element Z is transmitted to the network element a through the network element B, C. The transmission direction from network elements a to Z may be referred to herein as forward transmission and the transmission direction from network elements Z to a may be referred to herein as reverse transmission.
Under different transmission modes, the message statistical results of each network element in the tunnel to be detected are different. In the embodiment of the invention, whether a network element node fails or not is determined mainly based on whether the number of messages received and sent by the network element node meets the requirement or not.
For example, in the unidirectional transmission mode, if a certain network element is a head node network element, the packet statistical result of the head node network element includes the number of packets sent by the head node network element. If a certain other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element includes the number of messages received and sent by the intermediate node network element. If some other node network element is a tail node network element, the packet statistical result of the tail node network element includes the number of packets received by the tail node network element.
At this time, if the number of messages sent by the head node network element does not accord with the preset number of sent messages, determining that the head node network element has a fault; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault. Otherwise, each node is considered to work normally.
For another example, in the bidirectional transmission mode, if a certain network element is a head node network element, the packet statistical result of the head node network element includes the number of packets sent by the head node network element in the forward direction and the number of packets received by the head node network element in the reverse direction. If a certain other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element includes the number of messages received and sent by the intermediate node network element. If some other node network element is a tail node network element, the message statistical result of the tail node network element includes the number of messages received and the number of messages sent by the tail node network element.
At this time, if the number of messages sent by the head node network element in the forward direction is not consistent with the number of preset sent messages or the number of messages received by the head node network element in the reverse direction is not consistent with the number of messages sent by a previous node network element of the head node network element, determining that the head node network element fails; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received and/or sent by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
As can be seen from the above description, in the embodiment of the present invention, the server may determine whether the network element fails according to the number of the received and/or sent messages of each network element, so as to accurately locate the failed network element, thereby facilitating the maintenance personnel to accurately solve the network failure, and further ensuring the operation of the service in the network.
As shown in fig. 2, the network fault detection method according to the second embodiment of the present invention is executed by a network element, and includes:
and step 21, receiving a message statistical instruction sent by the server.
And step 22, counting the number of the received and/or sent messages according to the message counting instruction.
For example, in the unidirectional transmission mode, if a certain network element is a head node network element, the packet statistical result of the head node network element includes the number of packets sent by the head node network element. If a certain other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element includes the number of messages received and sent by the intermediate node network element. If some other node network element is a tail node network element, the packet statistical result of the tail node network element includes the number of packets received by the tail node network element.
For another example, in the bidirectional transmission mode, if a certain network element is a head node network element, the packet statistical result of the head node network element includes the number of packets sent by the head node network element in the forward direction and the number of packets received by the head node network element in the reverse direction. If a certain other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element includes the number of messages received and sent by the intermediate node network element. If some other node network element is a tail node network element, the message statistical result of the tail node network element includes the number of messages received and the number of messages sent by the tail node network element.
And step 23, sending a message statistical result to the server, wherein the message statistical result comprises the number of received and/or sent messages.
As can be seen from the above description, in the embodiment of the present invention, the server may determine whether the network element fails according to the number of the received and/or sent messages of each network element, so as to accurately locate the failed network element, thereby facilitating the maintenance personnel to accurately solve the network failure, and further ensuring the operation of the service in the network.
As shown in fig. 3, a network fault detection method according to a third embodiment of the present invention includes:
and step 31, the client sends a network fault detection instruction to the server.
The network fault detection instruction comprises a tunnel to be detected as a tunnel 1 from a network element A to a network element Z, and a corresponding identifier of the tunnel to be detected is the tunnel 1.
In addition, the network failure detection instruction may further include a detection configuration parameter, including: detection mode, packet length, repetition times, timeout time, reply mode, bandwidth, priority, detection time, etc. The detection mode comprises a slow Ping detection mode and a fast Ping detection mode. For the slow Ping detection mode, the required parameters include packet length, repetition times, timeout time and reply mode; for the fast Ping detection mode, the required parameters include bandwidth, priority, detection time, and reply mode.
And step 32, the server determines the network elements included in the tunnel to be detected according to the identification of the tunnel to be detected.
It is assumed that, in this step, the server determines, according to the tunnel configuration information stored in itself, that the network elements included in the tunnel 1 are the network element a, the network element B, and the network element Z, respectively. Wherein, the network element A is a head node, and the network element Z is a tail node.
And step 33, the server acquires the detection configuration parameters and sends the detection configuration parameters to the head node network element A.
Since the head node is mainly used to send the packet to other nodes, in this embodiment, only the detection configuration parameter may be sent to the head node network element a, which is used to instruct the head node network element a to send the packet according to the detection configuration parameter.
And step 34, the server sends a message statistic instruction to each network element.
Specifically, in this step, the message statistical instruction may be sent to the network element B and the network element Z, and then the message statistical instruction may be sent to the network element a after 3 seconds. Then, the network element A starts to send a message, and the message is transmitted to the network element Z through the network element B; if the two-way transmission is carried out, the message sent by the network element Z is transmitted to the network element A through the network element B. In this embodiment, it is assumed that the number of messages to be sent is 5.
And step 35, each network element counts the number of the messages according to the message counting instruction.
If the transmission is unidirectional, i.e. the message is transmitted only from the network element a to the network element Z, then for the network element a, it needs to count the number of messages it sends, for the network element Z, it needs to count the number of messages it receives, and for the network element Z, it needs to count the number of messages it receives and sends.
If the transmission is bidirectional, that is, the message is transmitted from the network element a to the network element Z (forward direction) and then transmitted from the network element Z to the network element Z (reverse direction), then for the network element a, it needs to count the number of the message sent in the forward direction and the number of the message received in the reverse direction, for the network element Z, it needs to count the number of the message received in the forward direction and the number of the message sent in the reverse direction, and for the network element Z, it needs to count the number of the message received in the forward direction and the number of the message sent in the reverse direction.
And step 36, each network element sends the message statistical result to the server, wherein the result comprises the message number counted by each network element.
And step 37, displaying the message statistical results of the network elements.
Specifically, the number of messages received and sent by each network element is displayed in this step.
And step 38, the server determines whether each network element has a fault according to the message statistical result of each network element.
In this embodiment, the message is transmitted in one direction. It is assumed that, through analysis, the number of messages sent by network element a is 5, the number of messages received by network element B is 5, the number of messages sent is 4, and the number of messages received by network element Z is 4. It can be determined that network element B is malfunctioning.
It is also assumed that, after analysis, the number of messages sent by network element a is 4, the number of messages received by network element B is 4, the number of messages sent is 4, and the number of messages received by network element Z is 3. It can be determined that a and network element Z are malfunctioning.
In this embodiment, the message is transmitted in two directions. It is assumed that, through analysis, the number of messages sent by network element a in the forward direction is 5, the number of messages received by network element B in the forward direction is 5, the number of messages sent by network element B in the forward direction is 4, and the number of messages received by network element Z in the forward direction is 4. The number of messages sent out by the network element Z in the reverse direction is 4, the number of messages received and sent out by the network element B in the reverse direction is 3 and 3 respectively, and the number of messages received by the network element A in the reverse direction is 2. Then it may be determined that network element B, A has failed.
Fig. 3 only shows the processes of sending the packet statistical instruction, counting the number of packets, and reporting the packet statistical result of the head node network element and the server, and the above processes of other nodes can be executed with reference to the corresponding description in fig. 3.
As can be seen from the above description, in the embodiment of the present invention, the server can determine whether the network element fails according to the number of the received and/or sent messages of each network element, so as to accurately locate the failed network element, thereby facilitating the maintenance personnel to accurately solve the network failure, and further ensuring the operation of the service in the network; and the scheme is simple and easy to realize.
As shown in fig. 4, a network failure detection apparatus according to a fourth embodiment of the present invention includes:
a receiving unit 41, configured to receive a network failure detection instruction of a client, where the network failure detection instruction includes an identifier of a tunnel to be detected; a determining unit 42, configured to determine, according to the identifier of the tunnel to be detected, a network element included in the tunnel to be detected; a sending unit 43, configured to send a message statistics instruction to the network element, so that the network element counts the number of messages received and/or sent by the network element according to the message statistics instruction; a processing unit 44, configured to receive a statistical result of the messages sent by the network element, and determine whether the network element fails according to the statistical result of the messages, where the statistical result of the messages includes the number of the messages received and/or sent by the network element.
In a specific application, the network element includes: a head node network element, and other node network elements. In this case, the sending unit 43 is specifically configured to: and firstly sending a message statistical instruction to the other node network elements, and sending the message statistical instruction to the head node network element after a preset time interval. Wherein the preset time may be set to 3 s.
In order to enable the message to be transmitted accurately, the network fault detection instruction further comprises a detection configuration parameter. Therefore, the sending unit 43 is further configured to send the detection configuration parameter to the head node network element.
Specifically, the processing unit 44 includes:
a receiving module, configured to receive a statistical result of the message sent by the network element; the analysis module is used for analyzing the message statistical result sent by the network element when the message is transmitted in a unidirectional mode; if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element; if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received by the tail node network elements;
the processing module is used for determining that the head node network element has a fault if the number of the messages sent by the head node network element does not accord with the preset number of the sent messages when the messages are transmitted in a unidirectional mode; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
Specifically, the processing unit 44 may further include:
a receiving module, configured to receive a statistical result of the message sent by the network element;
the analysis module is used for analyzing the message statistical result sent by the network element when the message is transmitted in two directions; if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element in the forward direction and the number of messages received in the reverse direction; if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received and the number of messages sent by the tail node network elements;
a processing module, configured to determine that a head node network element fails if the number of messages sent by the head node network element in the forward direction is not consistent with a preset number of sent messages or the number of messages received by the head node network element in the reverse direction is not consistent with the number of messages sent by a previous node network element of the head node network element when a message is transmitted in a bidirectional manner; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received and/or sent by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
It can be seen from the above-mentioned functions of the processing units that the processing units of the two forms are identical, but perform different functions in different message transmission modes. Therefore, in a specific application, the processing units in the two forms can be combined into one form, so that the processing unit can execute the function corresponding to the transmission mode in different message transmission modes.
Due to the difference of the detection modes, the modes of the network element sending the message statistical result are different. When the detection mode is the slow Ping detection mode, the receiving module is specifically configured to receive a statistical result of the message sent by the network element when the number of times that the head node sends the message reaches the requirement of the parameter of the number of repetitions; when the detection mode is the fast Ping detection mode, the receiving module is specifically configured to: and receiving the message statistical result sent by the network element when the detection time is up.
The working principle of the device of the present invention can be referred to the description of the foregoing method embodiment, and the device can be located in a server.
As can be seen from the above description, in the embodiment of the present invention, the server may determine whether the network element fails according to the number of the received and/or sent messages of each network element, so as to accurately locate the failed network element, thereby facilitating the maintenance personnel to accurately solve the network failure, and further ensuring the operation of the service in the network.
As shown in fig. 5, a network failure detection apparatus according to a fifth embodiment of the present invention includes:
a receiving unit 51, configured to receive a message statistic instruction sent by a server; a counting unit 52, configured to count the number of received and/or sent messages according to the message counting instruction; a sending unit 53, configured to send a message statistical result to the server, where the message statistical result includes the number of received and/or sent messages.
In addition, the receiving unit 51 is further configured to receive the detection configuration parameters sent by the server; the detecting the configuration parameters comprises: a detection mode; when the detection mode is the slow Ping detection mode, the detecting configuration parameters further include: packet length, repetition times, timeout time, detection time; when the detection mode is the fast Ping detection mode, the detecting configuration parameters further include: packet length, bandwidth, priority, detection time. In this case, the sending unit 53 is specifically configured to: when the detection mode is a slow Ping detection mode, receiving a message statistical result sent by the network element when the number of times of sending messages by the head node reaches the requirement of the parameter of the number of times of repeating; and when the detection mode is the fast Ping detection mode, receiving the message statistical result sent by the network element when the detection time is up.
The working principle of the device of the present invention can refer to the description of the foregoing method embodiment, and the device can be located in each network element.
As can be seen from the above description, in the embodiment of the present invention, the server may determine whether the network element fails according to the number of the received and/or sent messages of each network element, so as to accurately locate the failed network element, thereby facilitating the maintenance personnel to accurately solve the network failure, and further ensuring the operation of the service in the network.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A method for detecting network faults is characterized by comprising the following steps:
the method comprises the steps that a server receives a network fault detection instruction which is sent by a client and contains detection configuration parameters and an identification of a tunnel to be detected, and determines a network element included in the tunnel to be detected according to the identification of the tunnel to be detected, wherein the network element comprises: a head node network element and other node network elements;
after sending the detection configuration parameters to the head node network element, the server sends message statistical instructions to the network elements included in the tunnel to be detected respectively, so that the head node network element sends messages to other node network elements according to the detection configuration parameters after receiving the message statistical instructions, and the network elements count the number of the messages received and/or sent by the network elements according to the message statistical instructions;
the server receives a message statistical result sent by the network element, and determines whether the network element fails according to the message statistical result, wherein the message statistical result comprises the number of messages received and/or sent by the network element;
wherein the detecting the configuration parameter comprises: a detection mode;
when the detection mode is the slow Ping detection mode, the detecting configuration parameters further include: packet length, repetition times, timeout time, reply mode; when the detection mode is the fast Ping detection mode, the detecting configuration parameters further include: bandwidth, priority, detection time, reply mode.
2. The method according to claim 1, wherein said sending a message statistic instruction to the network element included in the tunnel to be detected comprises:
and firstly sending a message statistical instruction to the other node network elements, and sending the message statistical instruction to the head node network element after a preset time interval.
3. The method of claim 1,
when the detection mode is the slow Ping detection mode, the receiving of the statistical result of the message sent by the network element specifically comprises: when the number of times of sending the message by the head node reaches the requirement of the parameter of the number of times of repeating, receiving the message statistical result sent by the network element;
when the detection mode is the fast Ping detection mode, the receiving of the statistical result of the message sent by the network element specifically comprises: and receiving the message statistical result sent by the network element when the detection time is up.
4. The method according to any of claims 1-3, wherein when transmitting the message in one direction, said receiving the statistical result of the message sent by said network element, and determining whether said network element has a fault according to said statistical result of the message, wherein said statistical result of the message includes the number of messages received and/or sent by said network element includes:
if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element; if the number of the messages sent by the head node network element does not accord with the preset number of the sent messages, determining that the head node network element has a fault;
if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault;
if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received by the tail node network elements; and if the number of the messages received by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
5. The method according to any of claims 1-3, wherein when transmitting messages in both directions, said receiving a statistical result of messages sent by said network element, and determining whether said network element has a fault according to said statistical result of messages, wherein said statistical result of messages includes the number of messages received and/or sent by said network element includes:
if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element in the forward direction and the number of messages received in the reverse direction; if the number of the messages sent by the head node network element in the forward direction is not consistent with the number of the preset sent messages or the number of the messages received by the head node network element in the reverse direction is not consistent with the number of the messages sent by the previous node network element of the head node network element, determining that the head node network element has a fault;
if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault;
if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received and the number of messages sent by the tail node network elements; and if the number of the messages received and/or sent by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
6. A network fault detection apparatus, comprising:
the receiving unit is used for receiving a network fault detection instruction which is sent by the client and contains detection configuration parameters and the identification of the tunnel to be detected;
a determining unit, configured to determine, according to the identifier of the tunnel to be detected, a network element included in the tunnel to be detected, where the network element includes: a head node network element and other node network elements;
a sending unit, configured to send the detection configuration parameters to the head node network element, and then send message statistical instructions to network elements included in the tunnel to be detected, so that the head node network element sends a message to the other node network elements according to the detection configuration parameters after receiving the message statistical instructions, so that the network elements count the number of messages received and/or sent by the network elements according to the message statistical instructions;
a processing unit, configured to receive a statistical result of messages sent by the network element, and determine whether the network element fails according to the statistical result of the messages, where the statistical result of the messages includes the number of messages received and/or sent by the network element;
wherein the detecting the configuration parameter comprises: a detection mode;
when the detection mode is the slow Ping detection mode, the detecting configuration parameters further include: packet length, repetition times, timeout time, reply mode; when the detection mode is the fast Ping detection mode, the detecting configuration parameters further include: bandwidth, priority, detection time, reply mode.
7. The apparatus according to claim 6, wherein the sending unit is specifically configured to: and firstly sending a message statistical instruction to the other node network elements, and sending the message statistical instruction to the head node network element after a preset time interval.
8. The apparatus of claim 6, wherein the processing unit comprises:
a receiving module, configured to receive a statistical result of the message sent by the network element;
the analysis module is used for analyzing the message statistical result sent by the network element when the message is transmitted in a unidirectional mode; if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element; if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received by the tail node network elements;
the processing module is used for determining that the head node network element has a fault if the number of the messages sent by the head node network element does not accord with the preset number of the sent messages when the messages are transmitted in a unidirectional mode; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
9. The apparatus of claim 6, wherein the processing unit comprises:
a receiving module, configured to receive a statistical result of the message sent by the network element;
the analysis module is used for analyzing the message statistical result sent by the network element when the message is transmitted in two directions; if the network element is a head node network element, the message statistical result of the head node network element comprises the number of messages sent by the head node network element in the forward direction and the number of messages received in the reverse direction; if the other node network element is an intermediate node network element between the head node network element and the tail node network element, the message statistical result of the intermediate node network element comprises the number of messages received and sent by the intermediate node network element; if the other node network elements are tail node network elements, the message statistical result of the tail node network elements comprises the number of messages received and the number of messages sent by the tail node network elements;
a processing module, configured to determine that a head node network element fails if the number of messages sent by the head node network element in the forward direction is not consistent with a preset number of sent messages or the number of messages received by the head node network element in the reverse direction is not consistent with the number of messages sent by a previous node network element of the head node network element when a message is transmitted in a bidirectional manner; if the number of the messages received and/or sent by the intermediate node network element is not consistent with the number of the messages sent by the last node network element of the intermediate node network element, determining that the intermediate node network element has a fault; and if the number of the messages received and/or sent by the tail node network element is not consistent with the number of the messages sent by the last node network element of the tail node network element, determining that the tail node network element has a fault.
10. The apparatus according to claim 8 or 9, wherein when the detection mode is the slow Ping detection mode, the receiving module is specifically configured to receive the statistical result of the packet sent by the network element when the number of times that the header node sends the packet reaches the requirement of the parameter of the number of repetitions;
when the detection mode is the fast Ping detection mode, the receiving module is specifically configured to: and receiving the message statistical result sent by the network element when the detection time is up.
CN201610014741.4A 2016-01-08 2016-01-08 Network fault detection method and device Active CN106961344B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610014741.4A CN106961344B (en) 2016-01-08 2016-01-08 Network fault detection method and device
PCT/CN2016/080400 WO2016197736A1 (en) 2016-01-08 2016-04-27 Network fault detection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610014741.4A CN106961344B (en) 2016-01-08 2016-01-08 Network fault detection method and device

Publications (2)

Publication Number Publication Date
CN106961344A CN106961344A (en) 2017-07-18
CN106961344B true CN106961344B (en) 2021-02-09

Family

ID=57502903

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610014741.4A Active CN106961344B (en) 2016-01-08 2016-01-08 Network fault detection method and device

Country Status (2)

Country Link
CN (1) CN106961344B (en)
WO (1) WO2016197736A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109560945B (en) * 2017-09-25 2021-02-12 华为技术有限公司 Method, device and system for detecting service quality
CN109728931B (en) * 2017-10-31 2021-06-15 华为技术有限公司 Network fault delimitation method and device and computer storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1815970A (en) * 2005-02-01 2006-08-09 华为技术有限公司 Method for detecting network chain fault and positioning said fault
CN102355383A (en) * 2011-09-28 2012-02-15 深圳市深信服电子科技有限公司 Method and system for detecting network node failure
CN102594600A (en) * 2012-02-21 2012-07-18 中兴通讯股份有限公司 Method and system for determining failure position of bidirectional forwarding detection session
CN104601407A (en) * 2015-01-23 2015-05-06 北京奥普维尔科技有限公司 Intelligent network testing method
US20160006634A1 (en) * 2013-03-15 2016-01-07 Huawei Technologies Co., Ltd. Network Performance Monitoring Method and Apparatus

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101232343B (en) * 2008-01-21 2012-03-07 中兴通讯股份有限公司 Method for locating embedded control path fault
CN101808021A (en) * 2010-04-16 2010-08-18 华为技术有限公司 Fault detection method, device and system, message statistical method and node equipment
CN102137282B (en) * 2010-12-15 2014-02-19 华为技术有限公司 Method, device, nodes and system for detecting faulted link
CN102868553B (en) * 2012-08-28 2016-03-30 华为技术有限公司 Fault Locating Method and relevant device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1815970A (en) * 2005-02-01 2006-08-09 华为技术有限公司 Method for detecting network chain fault and positioning said fault
CN102355383A (en) * 2011-09-28 2012-02-15 深圳市深信服电子科技有限公司 Method and system for detecting network node failure
CN102594600A (en) * 2012-02-21 2012-07-18 中兴通讯股份有限公司 Method and system for determining failure position of bidirectional forwarding detection session
US20160006634A1 (en) * 2013-03-15 2016-01-07 Huawei Technologies Co., Ltd. Network Performance Monitoring Method and Apparatus
CN104601407A (en) * 2015-01-23 2015-05-06 北京奥普维尔科技有限公司 Intelligent network testing method

Also Published As

Publication number Publication date
WO2016197736A1 (en) 2016-12-15
CN106961344A (en) 2017-07-18

Similar Documents

Publication Publication Date Title
CN111052668B (en) Residence time measurement for optimizing network services
US20090003241A1 (en) A Method and System For Obtaining Path Maximum Transfer Unit in Network
CN104917641A (en) Method, device and system for testing packet loss
US10097366B2 (en) Methods, systems, and computer readable media for monitoring latency and/or time-based data locations of multicast communications
CN101808021A (en) Fault detection method, device and system, message statistical method and node equipment
CN102195832A (en) Loopback testing method, device and system
CN102325060B (en) Link bandwidth test method and router
CN106961344B (en) Network fault detection method and device
JP4311675B2 (en) Quality degradation isolation method and apparatus
CN103139014A (en) Method and device for network quality evaluating based on by-pass
CN108683568B (en) Method and system for sectional detection of VPN service channel quality
CN102487338A (en) Method and device for adjusting bidirectional forwarding detection (BFD) sending interval according to network jitter
US20120063331A1 (en) Path analyzer
KR101866377B1 (en) Packet loss link detection method in multicast of sdn
JP4169725B2 (en) Packet discard location search method and apparatus
CN105471667A (en) Multicast link detection method, network equipment and server
WO2022007969A1 (en) Network error performance measurement
JP3953999B2 (en) Congestion detection apparatus, congestion detection method and program for TCP traffic
EP2712121A1 (en) Method, apparatus and system for detecting packet path signal degraded
JP6378653B2 (en) Service impact cause estimation apparatus, service impact cause estimation program, and service impact cause estimation method
KR101831165B1 (en) Device for monitoring and analyzing time synchronization state between ntp sever and ntp client
CN102386997B (en) A kind of method and system detecting signal degradation state of grouping paths
JP6407133B2 (en) Communication quality degradation detection system, communication quality degradation detection method, and program
CN106412968B (en) Method, device and terminal for collecting wireless link information
WO2019001129A1 (en) Method, apparatus, and system for measuring transmission performance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant