WO2015176603A1 - 网络故障定位方法和装置 - Google Patents

网络故障定位方法和装置 Download PDF

Info

Publication number
WO2015176603A1
WO2015176603A1 PCT/CN2015/078201 CN2015078201W WO2015176603A1 WO 2015176603 A1 WO2015176603 A1 WO 2015176603A1 CN 2015078201 W CN2015078201 W CN 2015078201W WO 2015176603 A1 WO2015176603 A1 WO 2015176603A1
Authority
WO
WIPO (PCT)
Prior art keywords
network
port
network device
alarm information
fault
Prior art date
Application number
PCT/CN2015/078201
Other languages
English (en)
French (fr)
Inventor
林铭
惠建恒
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2015176603A1 publication Critical patent/WO2015176603A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks

Definitions

  • the embodiments of the present invention relate to communication technologies, and in particular, to a network fault location method and apparatus.
  • a large and complex rule base is established based on historical experience data, and when a fault occurs, a fault is determined by matching a large and complicated rule base by describing a rule of occurrence of a fault.
  • Embodiments of the present invention provide a network fault location method and apparatus to improve the efficiency of network fault location.
  • a first aspect of the embodiments of the present invention provides a network fault locating method, where the network includes at least two network devices, each network device includes M network modules, and each network module includes N ports, where the M is An integer greater than or equal to 1, the N being an integer greater than or equal to 1, including:
  • the network topology information includes a connection relationship of each port of each network module of each network device;
  • the performing network fault according to the information of the second network device includes:
  • the locating the network fault includes a link failure between the first port and the second port.
  • the method further includes:
  • the locating the network fault includes the second network device itself being faulty, or the network module where the second port is located is faulty.
  • the locating the network fault comprises the second network device itself fault, or the second The network module where the port is located is faulty, including:
  • the method further includes:
  • the alarm information, the location of the network fault includes the second network device itself failure.
  • the method further includes:
  • the fault that the network fault is located is that the network module where the second port is located is faulty.
  • a second aspect of the embodiments of the present invention provides a network fault locating device, where the network includes at least two network devices, each network device includes M network modules, and each network module includes N ports, where the M is An integer greater than or equal to 1, the N being an integer greater than or equal to 1, including:
  • An acquiring module configured to acquire network topology information of the network, where the network topology information includes a connection relationship of each port of each network module of each network device;
  • a receiving module configured to receive alarm information of the first port reported by the first network device
  • a determining module configured to determine, according to the network topology information, a second network device where the second port connected to the first port is located;
  • a processing module configured to locate a network fault according to the information of the second network device.
  • the processing module is specifically configured to determine whether the alarm information of the second port is received, and if the alarm information of the second port is received And locating the network fault includes a link failure between the first port and the second port.
  • the processing module is further configured to: if the alarm information of the second port is not received, locate the location
  • the network failure includes the failure of the second network device itself, or the network module where the second port is located is faulty.
  • the processing module is specifically configured to determine whether the port that is sent by the second network device is received.
  • the alarm information if the alarm information sent by any port of the second network device is received, the network fault that is located by the second port is faulty.
  • the processing module is further configured to determine, according to the network topology information, that each port of the second network device is separately connected, if the alarm information sent by any port of the second network device is not received. Determining whether to receive the alarm information reported by the P third ports respectively connected to all the ports of the second network device, if receiving the connection with all the ports of the second network device respectively The alarm information reported by the P third ports is that the network fault is located, and the second network device itself is faulty.
  • the processing module is further configured to: if not all ports of the second network device are not respectively connected The alarm information reported by the P third ports is that the network fault is located, and the network module where the second port is located is faulty.
  • the network fault location method and device obtaineds the network topology information of the network, where the network topology information includes the connection relationship of each port of each network module of each network device; and receives the report reported by the first network device.
  • the alarm information of the first port determines, according to the network topology information, the second network device where the second port connected to the first port is located, and locates the network fault according to the information of the second network device, that is, only needs to generate and generate
  • the information about the second network device where the second port connected to the first port of the alarm information is located can locate the network fault, and does not need to establish a large and complicated rule base, and does not need to match with a large and complicated rule base. Improve the efficiency of network fault location.
  • Embodiment 1 is a schematic flowchart of Embodiment 1 of a network fault location method according to the present invention
  • FIG. 2 is a schematic diagram of a first application scenario of Embodiment 1 of a network fault location method according to the present invention
  • FIG. 3 is a schematic diagram of a second application scenario of Embodiment 1 of a network fault location method according to the present invention.
  • FIG. 4 is a schematic diagram of a third application scenario of Embodiment 1 of a network fault location method according to the present invention.
  • FIG. 5 is a schematic structural diagram of Embodiment 1 of a network fault locating device according to the present invention.
  • FIG. 6 is a schematic structural diagram of Embodiment 2 of a network fault locating device according to the present invention.
  • the technical solution of the present invention is mainly used for locating a network fault of a data link (L2) layer.
  • the network includes at least two network devices, each network device includes M network modules, and the network module may be, for example, a network card, and each network module It can include N ports, which are used to receive data and send data.
  • Each network device has a monitoring module for monitoring the status of each port of each network module of the network device. When the port is detected to be disconnected, the network is The management device reports the alarm information of the first port.
  • the disconnection of the port may be caused by the fault of the network device itself, the fault of the network module, or the link fault between the ports.
  • the fault of the network device itself refers to the fault of the network device. .
  • the main idea of the present invention is to obtain network topology information of a network, where the network topology information includes a connection relationship of each port of each network module of each network device, when receiving the report reported by the first network.
  • the second network device where the second port connected to the first port is located is determined according to the network topology information, and the network fault is located according to the information of the second network device, where the second network device
  • the information refers to whether the port of the second network device generates an alarm and whether an alarm is generated by the port connected to each port of the second network device.
  • the technical solution of the present invention only needs to locate a network fault according to the information of the second network device where the second port connected to the first port that generates the alarm information is located, and does not need to establish a large and complicated rule base, and does not need to be large and Complex rule bases are matched, thus improving the efficiency of network fault location.
  • FIG. 1 is a schematic flowchart of Embodiment 1 of a network fault locating method according to the present invention.
  • the executor of this embodiment is a network management device, and the method in this embodiment is as follows:
  • the network topology information includes a connection relationship of each port of each network module of each network device.
  • S102 Receive alarm information of the first port reported by the first network device.
  • the monitoring module of the first network device detects that the port is disconnected on the first network device, the alarm information of the first port is reported to the network management device, so that the network management device knows the fault condition in time and performs fault processing.
  • the present invention does not limit the order in which S101 and S102 are executed.
  • S103 Determine, according to the network topology information, a second network device where the second port connected to the first port is located.
  • S104 Locating a network fault according to the information of the second network device.
  • the first case is: determining whether the alarm information of the second port is received, and if the alarm information of the second port is received, the locating the network fault includes the first port and the second The link between the ports is faulty.
  • picture 2. 2 is a schematic diagram of a first application scenario of the network fault locating method according to the first embodiment of the present invention.
  • the first port 101 is disconnected, and the monitoring module of the network device where the first port 101 is located reports the first port to the network management device.
  • the alarm is disconnected, and the network management device learns the second port 201 connected to the first port 101 according to the network topology, and determines whether the alarm information of the second port 201 is received. If received, the first port is indicated.
  • the link between 101 and the second port 201 is faulty. If it is not received, it is judged whether it belongs to the second case or the third case described below.
  • the reason that the first port is disconnected is not caused by the link failure, and the positioning of the network fault includes the foregoing
  • the network device itself fails, or the network module where the second port is located is faulty.
  • the reason why the first port is disconnected is that the second network device itself is faulty or the network module where the second port is located is faulty, and the following scheme can be used for determining:
  • the reason for determining that the first port is disconnected is that the second network device itself is faulty or the network module where the second port is located is faulty. Further, determining whether to receive any port of the second network device The sent alarm information, if the alarm information sent by any port of the second network device is received, indicating that the second network device itself is not faulty, the network fault is located, and the network module where the second port is located is faulty.
  • a third case on the basis of the second case, if the alarm information sent by any port of the second network device is not received, determining, according to the network topology information, that all ports of the second network device are respectively connected
  • the third port is configured to determine whether the alarm information reported by the P third ports respectively connected to the ports of the second network device is received, and if the P is connected to all the ports of the second network device,
  • the alarm information reported by the third port, the location of the network fault includes the failure of the second network device itself.
  • Figure 3. 3 is a schematic diagram of a second application scenario of the network fault locating method in the first embodiment of the present invention. In FIG.
  • the first port 101 is disconnected, and the monitoring module of the network device 1 where the first port 101 is located reports the first to the network management device.
  • the alarm of disconnecting the port 101, the network management device knows the first port 101 according to the network topology structure.
  • the connected second port 201, the network port 2 where the second port 201 is located, all the ports are the second port 201, the port 202, the port 203, and the port 204, and the corresponding four third ports are the first port 101 and the port respectively. 301, port 401 and port 501.
  • Network device 2 is faulty. Because the network device 2 fails, the monitoring module of the network device 2 cannot report the alarm because the third port connected to all the ports of the network device 2 cannot perform data communication with the network device 2, and therefore, with all the ports of the network device 2 The three ports generate alarms, so in this case, it can be determined that the network device 2 is faulty.
  • the positioning network fault includes the second port.
  • the network module is faulty. Because the alarm information reported by the P third ports respectively connected to all the ports of the second network device is not received, it indicates that the second network device itself cannot be faulty, and therefore, the first port is disconnected. The reason is that the network module where the second port is located is faulty.
  • Figure 4. 4 is a schematic diagram of a third application scenario of the network fault locating method in the first embodiment of the present invention. In FIG.
  • the first port 101 is disconnected, and the monitoring module of the network device 1 where the first port 101 is located reports the first to the network management device. If the port 101 is disconnected, the network management device learns the second port 201 connected to the first port 101 according to the network topology, and determines whether the port connected to the network device 2 where the second port 201 is located is respectively connected. The alarm information of the third port is faulty. If the network device 2 is faulty, the network module where the second port 201 is located is faulty.
  • the network topology information of the network is obtained, and the network topology information includes the connection relationship of each port of each network module of each network device, and receives the alarm information of the first port reported by the first network device, according to the
  • the network topology information determines a second network device where the second port connected to the first port is located, and locates the network fault according to the information of the second network device, that is, only needs to be connected according to the first port that generates the alarm information.
  • the second network where the second port is located Device information can locate network faults, eliminate the need to build large and complex rule bases, and do not need to match large and complex rule bases. Therefore, the efficiency of network fault location can be improved.
  • FIG. 5 is a schematic structural diagram of Embodiment 1 of a network fault locating device according to the present invention.
  • the device in this embodiment may be deployed in a network management device, where the network includes at least two network devices, and each network device includes M network modules, each of which The network module includes N ports, where the M is an integer greater than or equal to 1, and the N is an integer greater than or equal to 1, and includes: an obtaining module 501, a receiving module 502, a determining module 503, and a processing module 504, where the acquiring module 501
  • the network topology information is used to obtain the network topology information of the foregoing network, where the network topology information includes a connection relationship of each port of each network module of each network device, and the receiving module 502 is configured to receive an alarm of the first port reported by the first network device.
  • the determining module 503 is configured to determine, according to the network topology information, a second network device where the second port connected to the first port is located, and the processing module 504 is configured to locate
  • the processing module 504 is specifically configured to determine whether the alarm information of the second port is received. If the alarm information of the second port is received, the network fault is located to include the first port and The link between the second ports is faulty.
  • the processing module 504 is further configured to: if the alarm information of the second port is not received, locate the network fault, including the second network device itself, or the second port is located. The network module is faulty.
  • the processing module 504 is specifically configured to determine whether the alarm information sent by any port of the second network device is received, and if the alarm information sent by any port of the second network device is received, Locating the network fault includes a failure of the network module where the second port is located.
  • the processing module 504 is further configured to determine, according to the network topology information, that all ports of the second network device are different, if the alarm information sent by any port of the second network device is not received. Connected to the P third ports; determine whether the alarm information reported by the P third ports respectively connected to all the ports of the second network device is received, if the And the alarm information reported by the P third ports respectively connected to the ports of the second network device, where the network fault is located includes the second network device itself being faulty.
  • the processing module 504 is further configured to: if the alarm information reported by the P third ports respectively connected to the ports of the second network device is not received, locate the network fault, including the second The network module where the port is located is faulty.
  • the device in the foregoing embodiment is applicable to the technical solution of the method embodiment shown in FIG. 1 , and the implementation principle and technical effects are similar, and details are not described herein again.
  • FIG. 6 is a schematic structural diagram of Embodiment 2 of a network fault locating device according to the present invention.
  • the device in this embodiment at least includes: a processor 601, a memory 602, a communication interface 603, and a bus 604.
  • the processor 601, the memory 602, and the communication interface 603 communicate via the bus 604.
  • the above memory 602 is used to store programs. Specifically, the program code may be included in the program, and the program code includes a computer execution instruction.
  • the above memory 602 may be a high speed RAM memory or a non-volatile memory such as at least one disk memory.
  • the processor 601 is configured to execute the execution instructions stored by the memory 602, which may be a single-core or multi-core CPU, or an ASIC, or one or more integrated circuits configured to implement the embodiments of the present invention.
  • the communication interface 603 described above is used to communicate with a network device.
  • the processor 601 runs a program to execute the following instructions:
  • the network topology information includes a connection relationship of each port of each network module of each network device; receiving the alarm information of the first port reported by the first network device; The network topology information determines a second network device where the second port connected to the first port is located, and locates a network fault according to the information of the second network device.
  • the device in the foregoing embodiment is applicable to the technical solution of the method embodiment shown in FIG. 1 , and the implementation principle and technical effects are similar, and details are not described herein again.
  • the steps can be completed by the relevant hardware of the program instructions.
  • the aforementioned program can be stored in a computer readable storage medium.
  • the program when executed, performs the steps including the foregoing method embodiments; and the foregoing storage medium includes various media that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本发明实施例提供一种网络故障定位方法和装置,通过获取网络的网络拓扑信息,网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;接收第一网络设备上报的第一端口的告警信息,根据网络拓扑信息,确定与上述第一端口相连的第二端口所在的第二网络设备,根据第二网络设备的信息,定位网络故障,也就是,仅需根据与产生告警信息的第一端口连接的第二端口所在的第二网络设备的信息,就可以定位网络故障,无需建立庞大且复杂的规则库,也无需与庞大且复杂的规则库进行匹配,因此,可以提高网络故障定位的效率。

Description

网络故障定位方法和装置 技术领域
本发明实施例涉及通信技术,尤其涉及一种网络故障定位方法和装置。
背景技术
随着通信技术的飞速发展,现有的网络系统架构日益复杂与庞大,人们对网络服务质量的要求也日益提高,若网络中链路或者网络设备发生故障,如何快速进行网络故障定位变得至关重要。
现有技术中,通过根据历史经验数据建立庞大且复杂的规则库,当故障发生时,通过描述故障发生的规则,与庞大且复杂的规则库进行匹配,对故障进行定位。
然而,采用现有技术的方法,需要建立庞大且复杂的规则库,并且要根据故障发生的规则与规则库进行匹配,网络故障定位的效率不高。
发明内容
本发明实施例提供一种网络故障定位方法和装置,以提高网络故障定位的效率。
本发明实施例第一方面提供一种网络故障定位方法,所述网络包含至少两个网络设备,每个网络设备包含M个网络模块,每个网络模块包含N个端口,其中,所述M为大于等于1的整数,所述N为大于等于1的整数,包括:
获取所述网络的网络拓扑信息,所述网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;
接收第一网络设备上报的第一端口的告警信息;
根据所述网络拓扑信息,确定与所述第一端口相连的第二端口所在的第二网络设备;
根据所述第二网络设备的信息,定位网络故障。
结合第一方面,在第一方面的第一种可能的实现方式中,所述根据所述第二网络设备的信息,定位网络故障,包括:
确定是否接收到所述第二端口的告警信息,如果接收到所述第二端口的告警信息,则定位所述网络故障包括所述第一端口和所述第二端口之间的链路故障。
结合第一方面的第一种可能的实现方式,在第一方面的第二种可能的实现方式中,所述方法还包括:
如果未接收到所述第二端口的告警信息,则定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障。
结合第一方面的第二种可能的实现方式,在第一方面的第三种可能的实现方式中,所述定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障,包括:
确定是否接收到所述第二网络设备的任一个端口发送的告警信息,如果接收到所述第二网络设备的任一个端口发送的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
结合第一方面的第三种可能的实现方式,在第一方面的第四种可能的实现方式中,所述方法还包括:
如果未接收到所述第二网络设备的任一个端口发送的告警信息,则根据所述网络拓扑信息确定与所述第二网络设备的所有端口分别相连的P个第三端口;
确定是否接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,如果接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二网络设备自身故障。
结合第一方面的第四种可能的实现方式,在第一方面的第五种可能的实现方式中,所述方法还包括:
如果未接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
本发明实施例第二方面提供一种网络故障定位装置,所述网络包含至少两个网络设备,每个网络设备包含M个网络模块,每个网络模块包含N个端口,其中,所述M为大于等于1的整数,所述N为大于等于1的整数,包括:
获取模块,用于获取所述网络的网络拓扑信息,所述网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;
接收模块,用于接收第一网络设备上报的第一端口的告警信息;
确定模块,用于根据所述网络拓扑信息,确定与所述第一端口相连的第二端口所在的第二网络设备;
处理模块,用于根据所述第二网络设备的信息,定位网络故障。
结合第二方面,在第二方面的第一种可能的实现方式中,所述处理模块具体用于确定是否接收到所述第二端口的告警信息,如果接收到所述第二端口的告警信息,则定位所述网络故障包括所述第一端口和所述第二端口之间的链路故障。
结合第二方面的第一种可能的实现方式,在第二方面的第二种可能的实现方式中,所述处理模块还用于如果未接收到所述第二端口的告警信息,则定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障。
结合第二方面的第二种可能的实现方式,在第二方面的第三种可能的实现方式中,所述处理模块具体用于确定是否接收到所述第二网络设备的任一个端口发送的告警信息,如果接收到所述第二网络设备的任一个端口发送的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
结合第二方面的第三种可能的实现方式,在第二方面的第四种可能的实 现方式中,所述处理模块还用于如果未接收到所述第二网络设备的任一个端口发送的告警信息,则根据所述网络拓扑信息确定与所述第二网络设备的所有端口分别相连的P个第三端口;确定是否接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,如果接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二网络设备自身故障。
结合第二方面的第四种可能的实现方式,在第二方面的第五种可能的实现方式中,所述处理模块还用于如果未接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
本发明实施例提供的网络故障定位方法和装置,通过获取网络的网络拓扑信息,网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;接收第一网络设备上报的第一端口的告警信息,根据网络拓扑信息,确定与上述第一端口相连的第二端口所在的第二网络设备,根据第二网络设备的信息,定位网络故障,也就是,仅需根据与产生告警信息的第一端口连接的第二端口所在的第二网络设备的信息,就可以定位网络故障,无需建立庞大且复杂的规则库,也无需与庞大且复杂的规则库进行匹配,因此,可以提高网络故障定位的效率。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1为本发明网络故障定位方法实施例一的流程示意图;
图2为本发明网络故障定位方法实施例一的第一种应用场景示意图;
图3为本发明网络故障定位方法实施例一的第二种应用场景示意图;
图4为本发明网络故障定位方法实施例一的第三种应用场景示意图;
图5为本发明网络故障定位装置实施例一的结构示意图;
图6为本发明网络故障定位装置实施例二的结构示意图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例例如能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。
本发明的技术方案主要用于定位数据链路(L2)层的网络故障,网络中包含至少两个网络设备,每个网络设备包含M个网络模块,网络模块例如可以是网卡,每个网络模块可以包含N个端口,端口用于接收数据和发送数据;每个网络设备上具有一个监测模块,用于监测网络设备的各网络模块的各端口的状态,当监测到端口断开时,向网络管理设备上报第一端口的告警信息,端口断开可能是由于网络设备自身故障、网络模块故障、或者端口之间的链路故障等导致,这里所描述的网络设备自身故障是指网络设备整体故障。本发明的主要思想是获取网络的网络拓扑信息,网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系,当接收第一网络上报的 第一端口的告警信息之后,根据所述网络拓扑信息,确定与第一端口相连的第二端口所在的第二网络设备,根据第二网络设备的信息,定位网络故障,其中,第二网络设备的信息是指第二网络设备的端口是否产生告警以及与第二网络设备的各端口相连的端口的是否产生告警。本发明的技术方案仅需根据与产生告警信息的第一端口连接的第二端口所在的第二网络设备的信息,就可以定位网络故障,无需建立庞大且复杂的规则库,也无需与庞大且复杂的规则库进行匹配,因此,可以提高网络故障定位的效率。
下面以具体地实施例对本发明的技术方案进行详细说明。下面这几个具体的实施例可以相互结合,对于相同或相似的概念或过程可能在某些实施例不再赘述。
图1为本发明网络故障定位方法实施例一的流程示意图,本实施例的执行主体是网络管理设备,本实施例的方法如下:
S101:获取网络的网络拓扑信息。
其中,网络拓扑信息包含每个网络设备的每个网络模块的每个端口的连接关系。
S102:接收第一网络设备上报的第一端口的告警信息。
第一网络设备的监测模块监测到第一网络设备上有端口断开时,则向网络管理设备上报该第一端口的告警信息,以使网络管理设备及时获知故障情况,进行故障处理。
本发明对S101和S102执行的先后顺序不做限定。
S103:根据所述网络拓扑信息,确定与第一端口相连的第二端口所在的第二网络设备。
S104:根据第二网络设备的信息,定位网络故障。
具体地,包括以下几种情况:
第一种情况:确定是否接收到上述第二端口的告警信息,如果接收到上述第二端口的告警信息,则定位上述网络故障包括上述第一端口和上述第二 端口之间的链路故障。如图2所示。图2为本发明网络故障定位方法实施例一的第一种应用场景示意图,图2中第一端口101断开,第一端口101所在的网络设备的监测模块则向网络管理设备上报第一端口101断开的告警,网络管理设备则根据网络拓扑结构,获知与第一端口101连接的第二端口201,则判断是否接收到第二端口201的告警信息,若接收到,则说明第一端口101和第二端口201之间的链路故障。若未接收到,则判断是否属于下述第二种情况或第三种情况。
在第一种情况的基础上,如果未接收到所述第二端口的告警信息,则说明导致第一端口断开的原因不是因为链路故障引起的,则定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障。具体地导致第一端口断开的原因是第二网络设备自身故障还是第二端口所在的网络模块故障,通过以下方案可以进行判断:
第二种情况:在确定导致第一端口断开的原因是第二网络设备自身故障或者,第二端口所在的网络模块故障,进一步地,确定是否接收到所述第二网络设备的任一个端口发送的告警信息,如果接收到所述第二网络设备的任一个端口发送的告警信息,说明第二网络设备自身未故障,则定位所述网络故障包括所述第二端口所在的网络模块故障。
第三种情况:在第二种情况的基础上,如果未接收到所述第二网络设备的任一个端口发送的告警信息,则根据网络拓扑信息确定与第二网络设备的所有端口分别相连的P个第三端口;确定是否接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,如果接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二网络设备自身故障。如图3所示。图3为本发明网络故障定位方法实施例一的第二种应用场景示意图,图3中第一端口101断开,第一端口101所在的网络设备1的监测模块则向网络管理设备上报第一端口101断开的告警,网络管理设备则根据网络拓扑结构,获知与第一端口101 连接的第二端口201,第二端口201所在的网络设备2所有的端口为第二端口201、端口202、端口203和端口204,其分别对应的4个第三端口为第一端口101、端口301、端口401和端口501。如果未接收到第二端口201、端口202、端口203和端口204中的任一个端口发送的告警信息,如果接收到第一端口101、端口301、端口401和端口501上报的告警信息,则确定网络设备2故障。因为网络设备2故障,网络设备2的监测模块无法上报告警,因为与网络设备2的所有端口相连的第三端口无法与网络设备2进行数据通信,因此,与网络设备2的所有端口的第三端口都产生告警,因此,在这种情况下,可以确定,网络设备2故障。
第四种情况,在第三种情况的基础上,如果未接收到与第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位网络故障包括所述第二端口所在的网络模块故障。因为,如果未接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则说明不可能为第二网络设备自身故障,因此,引起第一端口断开的原因是第二端口所在的网络模块故障。例如:一种场景为如图4所示。图4为本发明网络故障定位方法实施例一的第三种应用场景示意图,图4中第一端口101断开,第一端口101所在的网络设备1的监测模块则向网络管理设备上报第一端口101断开的告警,网络管理设备则根据网络拓扑结构,获知与第一端口101连接的第二端口201,则判断是否接收到与第二端口201所在的网络设备2的所有端口分别相连的P个第三端口的告警信息,如果没有,排除网络设备2故障的情况,则说明第二端口201所在的网络模块故障。
本发明实施例,通过获取网络的网络拓扑信息,网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系,接收第一网络设备上报的第一端口的告警信息,根据网络拓扑信息,确定与上述第一端口相连的第二端口所在的第二网络设备,根据第二网络设备的信息,定位网络故障,也就是,仅需根据与产生告警信息的第一端口连接的第二端口所在的第二网络 设备的信息,就可以定位网络故障,无需建立庞大且复杂的规则库,也无需与庞大且复杂的规则库进行匹配,因此,可以提高网络故障定位的效率。
图5为本发明网络故障定位装置实施例一的结构示意图,本实施例的装置可以部署在网络管理设备中,上述网络包含至少两个网络设备,每个网络设备包含M个网络模块,每个网络模块包含N个端口,其中,上述M为大于等于1的整数,上述N为大于等于1的整数,包括:获取模块501、接收模块502、确定模块503和处理模块504,其中,获取模块501用于获取上述网络的网络拓扑信息,上述网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;接收模块502用于接收第一网络设备上报的第一端口的告警信息;确定模块503用于根据上述网络拓扑信息,确定与上述第一端口相连的第二端口所在的第二网络设备;处理模块504用于根据上述第二网络设备的信息,定位网络故障。
在上述实施例中,处理模块504具体用于确定是否接收到所述第二端口的告警信息,如果接收到所述第二端口的告警信息,则定位所述网络故障包括所述第一端口和所述第二端口之间的链路故障。
在上述实施例中,处理模块504还用于如果未接收到所述第二端口的告警信息,则定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障。
在上述实施例中,处理模块504具体用于确定是否接收到所述第二网络设备的任一个端口发送的告警信息,如果接收到所述第二网络设备的任一个端口发送的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
在上述实施例中,处理模块504还用于如果未接收到所述第二网络设备的任一个端口发送的告警信息,则根据所述网络拓扑信息确定与所述第二网络设备的所有端口分别相连的P个第三端口;确定是否接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,如果接收到与 所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二网络设备自身故障。
在上述实施例中,处理模块504还用于如果未接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
上述实施例的装置对应的可用于执行图1所示方法实施例的技术方案,其实现原理和技术效果类似,在此不再赘述。
图6为本发明网络故障定位装置实施例二的结构示意图,如图6所示,本实施例的装置至少包括:处理器601、存储器602、通信接口603和总线604。其中,上述处理器601、上述存储器602和上述通信接口603通过上述总线604通信。
上述存储器602用于存放程序。具体的,程序中可以包括程序代码,上述程序代码包括计算机执行指令。上述存储器602可以为高速RAM存储器,也可以为非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。
上述处理器601用于执行上述存储器602存储的执行指令,可能为单核或多核CPU,或者为ASIC,或者为被配置成实施本发明实施例的一个或多个集成电路。
上述通信接口603用于与网络设备进行通信。当网络故障定位装置运行时,处理器601运行程序,以执行以下指令:
获取所述网络的网络拓扑信息,所述网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;接收第一网络设备上报的第一端口的告警信息;根据所述网络拓扑信息,确定与所述第一端口相连的第二端口所在的第二网络设备;根据所述第二网络设备的信息,定位网络故障。
上述实施例的装置对应的可用于执行图1所示方法实施例的技术方案,其实现原理和技术效果类似,在此不再赘述。
本领域普通技术人员可以理解:实现上述各方法实施例的全部或部分步 骤可以通过程序指令相关的硬件来完成。前述的程序可以存储于一计算机可读取存储介质中。该程序在执行时,执行包括上述各方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。

Claims (12)

  1. 一种网络故障定位方法,所述网络包含至少两个网络设备,每个网络设备包含M个网络模块,每个网络模块包含N个端口,其中,所述M为大于等于1的整数,所述N为大于等于1的整数,其特征在于,包括:
    获取所述网络的网络拓扑信息,所述网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;
    接收第一网络设备上报的第一端口的告警信息;
    根据所述网络拓扑信息,确定与所述第一端口相连的第二端口所在的第二网络设备;
    根据所述第二网络设备的信息,定位网络故障。
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述第二网络设备的信息,定位网络故障,包括:
    确定是否接收到所述第二端口的告警信息,如果接收到所述第二端口的告警信息,则定位所述网络故障包括所述第一端口和所述第二端口之间的链路故障。
  3. 根据权利要求2所述的方法,其特征在于,还包括:
    如果未接收到所述第二端口的告警信息,则定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障。
  4. 根据权利要求3所述的方法,其特征在于,所述定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障,包括:
    确定是否接收到所述第二网络设备的任一个端口发送的告警信息,如果接收到所述第二网络设备的任一个端口发送的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
  5. 根据权利要求4所述的方法,其特征在于,还包括:
    如果未接收到所述第二网络设备的任一个端口发送的告警信息,则根据 所述网络拓扑信息确定与所述第二网络设备的所有端口分别相连的P个第三端口;
    确定是否接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,如果接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二网络设备自身故障。
  6. 根据权利要求5所述的方法,其特征在于,还包括:
    如果未接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
  7. 一种网络故障定位装置,所述网络包含至少两个网络设备,每个网络设备包含M个网络模块,每个网络模块包含N个端口,其中,所述M为大于等于1的整数,所述N为大于等于1的整数,其特征在于,包括:
    获取模块,用于获取所述网络的网络拓扑信息,所述网络拓扑信息包含每个网络设备的每个网络模块的每个端口的的连接关系;
    接收模块,用于接收第一网络设备上报的第一端口的告警信息;
    确定模块,用于根据所述网络拓扑信息,确定与所述第一端口相连的第二端口所在的第二网络设备;
    处理模块,用于根据所述第二网络设备的信息,定位网络故障。
  8. 根据权利要求7所述的装置,其特征在于,所述处理模块具体用于确定是否接收到所述第二端口的告警信息,如果接收到所述第二端口的告警信息,则定位所述网络故障包括所述第一端口和所述第二端口之间的链路故障。
  9. 根据权利要求8所述的装置,其特征在于,所述处理模块还用于如果未接收到所述第二端口的告警信息,则定位所述网络故障包括所述第二网络设备自身故障,或者,所述第二端口所在的网络模块故障。
  10. 根据权利要求9所述的装置,其特征在于,所述处理模块具体用于确定是否接收到所述第二网络设备的任一个端口发送的告警信息,如果接收到所述第二网络设备的任一个端口发送的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
  11. 根据权利要求10所述的装置,其特征在于,所述处理模块还用于如果未接收到所述第二网络设备的任一个端口发送的告警信息,则根据所述网络拓扑信息确定与所述第二网络设备的所有端口分别相连的P个第三端口;确定是否接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,如果接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二网络设备自身故障。
  12. 根据权利要求11所述的装置,其特征在于,所述处理模块还用于如果未接收到与所述第二网络设备的所有端口分别相连的P个第三端口上报的告警信息,则定位所述网络故障包括所述第二端口所在的网络模块故障。
PCT/CN2015/078201 2014-05-23 2015-05-04 网络故障定位方法和装置 WO2015176603A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410223410.2 2014-05-23
CN201410223410.2A CN103986604A (zh) 2014-05-23 2014-05-23 网络故障定位方法和装置

Publications (1)

Publication Number Publication Date
WO2015176603A1 true WO2015176603A1 (zh) 2015-11-26

Family

ID=51278431

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/078201 WO2015176603A1 (zh) 2014-05-23 2015-05-04 网络故障定位方法和装置

Country Status (2)

Country Link
CN (1) CN103986604A (zh)
WO (1) WO2015176603A1 (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111953535A (zh) * 2020-07-31 2020-11-17 鹏城实验室 一种网络故障定位方法、终端及存储介质
CN114221882A (zh) * 2021-12-23 2022-03-22 锐捷网络股份有限公司 故障链路检测方法、装置、设备和存储介质
CN114520760A (zh) * 2020-11-20 2022-05-20 华为技术有限公司 一种跨域故障分析的方法及系统
CN115001573A (zh) * 2022-06-13 2022-09-02 中国电信股份有限公司 光缆故障定位方法、装置、电子设备及非易失性存储介质

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103986604A (zh) * 2014-05-23 2014-08-13 华为技术有限公司 网络故障定位方法和装置
CN107104856B (zh) * 2017-05-08 2019-07-30 北京北信源软件股份有限公司 一种hub设备识别方法及装置
CN112291075B (zh) * 2019-07-23 2022-08-30 中国移动通信集团浙江有限公司 网络故障定位方法、装置、计算机设备及存储介质
CN111277471B (zh) * 2020-02-24 2022-11-18 大连理工大学 一种无中心网络连接快速检测装置
CN111404728B (zh) * 2020-03-02 2023-06-06 广东优力普物联科技有限公司 网络设备故障监测方法及网络系统
CN113497721B (zh) * 2020-03-20 2023-08-01 中国移动通信集团四川有限公司 网络故障定位方法与装置
CN113114510B (zh) * 2021-04-22 2022-07-15 中国科学技术大学 一种网络故障信息的同步方法及装置
CN117424794A (zh) * 2022-07-11 2024-01-19 中兴通讯股份有限公司 根因定位方法、通信设备及计算机可读存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101707503A (zh) * 2009-07-03 2010-05-12 中兴通讯股份有限公司 嵌入式控制通道通讯故障自动定位方法和装置
CN102638375A (zh) * 2012-04-26 2012-08-15 北京星网锐捷网络技术有限公司 一种网络故障识别方法及装置
CN102739445A (zh) * 2012-06-18 2012-10-17 中兴通讯股份有限公司 一种环网故障快速定位的方法和系统
CN103986604A (zh) * 2014-05-23 2014-08-13 华为技术有限公司 网络故障定位方法和装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100550787C (zh) * 2006-08-29 2009-10-14 郑州威科姆技术开发有限公司 网络故障节点诊断方法
CN101931982A (zh) * 2010-08-18 2010-12-29 北京星网锐捷网络技术有限公司 一种网络故障定位方法及装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101707503A (zh) * 2009-07-03 2010-05-12 中兴通讯股份有限公司 嵌入式控制通道通讯故障自动定位方法和装置
CN102638375A (zh) * 2012-04-26 2012-08-15 北京星网锐捷网络技术有限公司 一种网络故障识别方法及装置
CN102739445A (zh) * 2012-06-18 2012-10-17 中兴通讯股份有限公司 一种环网故障快速定位的方法和系统
CN103986604A (zh) * 2014-05-23 2014-08-13 华为技术有限公司 网络故障定位方法和装置

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111953535A (zh) * 2020-07-31 2020-11-17 鹏城实验室 一种网络故障定位方法、终端及存储介质
CN111953535B (zh) * 2020-07-31 2023-06-09 鹏城实验室 一种网络故障定位方法、终端及存储介质
CN114520760A (zh) * 2020-11-20 2022-05-20 华为技术有限公司 一种跨域故障分析的方法及系统
CN114520760B (zh) * 2020-11-20 2023-08-22 华为技术有限公司 一种跨域故障分析的方法及系统
CN114221882A (zh) * 2021-12-23 2022-03-22 锐捷网络股份有限公司 故障链路检测方法、装置、设备和存储介质
CN115001573A (zh) * 2022-06-13 2022-09-02 中国电信股份有限公司 光缆故障定位方法、装置、电子设备及非易失性存储介质

Also Published As

Publication number Publication date
CN103986604A (zh) 2014-08-13

Similar Documents

Publication Publication Date Title
WO2015176603A1 (zh) 网络故障定位方法和装置
CN110752952B (zh) 网络故障定位方法、装置、网络设备及计算机存储介质
US20190140890A1 (en) Method and system of a dynamic high-availability mode based on current wide area network connectivity
CN106155260B (zh) 服务器的系统与管理方法以及计算机可读存储介质
MX2016009433A (es) Metodo de manejo de falla del servicio de red, sistema de gestion de servicio, y modulo de gestion de sistema.
CN106911648B (zh) 一种环境隔离方法及设备
WO2016206386A1 (zh) 一种故障关联方法和装置
CN103138988B (zh) 网络故障的定位处理方法及装置
WO2016112676A1 (zh) 告警处理方法及装置
US20130235718A1 (en) Path switch-back method and apparatus in transport network
CN106982244B (zh) 在云网络环境下实现动态流量的报文镜像的方法和装置
CN104615476A (zh) 用于所选择的虚拟机复制和虚拟机重新启动的方法和系统
CN102420820A (zh) 一种集群系统中的隔离方法和装置
US10417101B2 (en) Fault monitoring device, virtual network system, and fault monitoring method
CN101820359A (zh) 一种网络设备的故障处理方法和设备
CN102664755B (zh) 控制通道故障确定方法及其装置
WO2012075743A1 (zh) 一种以太环网链路保护倒换的方法及装置
WO2019079961A1 (zh) 一种确定共享风险链路组的方法及装置
CN110708715A (zh) 一种5g基站业务故障查找方法及装置
US20140192817A1 (en) Virtual link aggregation using state machine
WO2016082509A1 (zh) 一种检测标签交换路径连通性的方法及装置
CN115766405B (zh) 一种故障处理方法、装置、设备和存储介质
RU2693903C1 (ru) Способ, устройство и система обработки для расширенного порта
CN112653753B (zh) 基于rpc的多机房独立多活方法、系统及电子设备
US11916739B2 (en) Mitigation of physical network misconfigurations for clustered nodes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15795674

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15795674

Country of ref document: EP

Kind code of ref document: A1