CN112953744B - Network fault monitoring method, system, computer equipment and readable storage medium - Google Patents

Network fault monitoring method, system, computer equipment and readable storage medium Download PDF

Info

Publication number
CN112953744B
CN112953744B CN201911257749.3A CN201911257749A CN112953744B CN 112953744 B CN112953744 B CN 112953744B CN 201911257749 A CN201911257749 A CN 201911257749A CN 112953744 B CN112953744 B CN 112953744B
Authority
CN
China
Prior art keywords
router
network
monitored
alarm information
virtual private
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911257749.3A
Other languages
Chinese (zh)
Other versions
CN112953744A (en
Inventor
王小冬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unihub China Information Technology Co Ltd
Original Assignee
Unihub China Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unihub China Information Technology Co Ltd filed Critical Unihub China Information Technology Co Ltd
Priority to CN201911257749.3A priority Critical patent/CN112953744B/en
Publication of CN112953744A publication Critical patent/CN112953744A/en
Application granted granted Critical
Publication of CN112953744B publication Critical patent/CN112953744B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • H04L43/065Generation of reports related to network devices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0811Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/46Interconnection of networks
    • H04L12/4641Virtual LANs, VLANs, e.g. virtual private networks [VPN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The embodiment of the invention provides a network fault monitoring method, a system, computer equipment and a readable storage medium, wherein the method comprises the following steps: receiving first alarm information of a connection state between a router and a router to be monitored, wherein the first alarm information is reported in real time by a route reflector in an ISP network to be monitored; and aiming at each router in the ISP network to be monitored, when the first alarm information of the connection state between the router and all the route reflectors connected with the router is acquired, determining that the router is off-line in IP and off-line in virtual private network service. According to the scheme, based on the connection state between the router reflector and the connected router, whether the router has an IP off-network fault or not can be judged, whether the VPN service off-network fault occurs or not can be judged, a plurality of devices can be prevented from being polled at intervals, the timeliness of detection can be relatively improved, the influence of network jitter on the detection result is avoided, and the accuracy of the detection result can be relatively improved.

Description

Network fault monitoring method, system, computer equipment and readable storage medium
Technical Field
The present invention relates to the field of network monitoring technologies, and in particular, to a network fault monitoring method, a system, a computer device, and a readable storage medium.
Background
In the network of the operator, the occurrence of "IP off-line" or "VPN ((Virtual Private Network): virtual private network) service off-line" of the network device is a very serious fault, which often affects the network service of a certain area or a certain large customer, so that the fault needs to be accurately monitored in real time in the network management system. The traditional monitoring mode is generally that a network manager polls and dials the connectivity between the network manager and the network equipment of an operator to judge whether the network equipment is off-line or not through a ping measurement mode. This approach has several drawbacks: 1. the polling detection interval and the number of ping detection devices can lead to the detection timeliness being reduced by 2. Network jitter between a network and operator network devices can influence detection results and even false alarm 3. Off-network faults of a VPN service layer can not be detected.
Disclosure of Invention
The embodiment of the invention provides a network fault monitoring method, which aims to solve the technical problems that the detection timeliness is low, the accuracy is low, and the VPN service off-network fault cannot be detected in the prior art. The method comprises the following steps:
receiving first alarm information of a connection state between a router and a router to be monitored, wherein the first alarm information is reported in real time by a route reflector in an Internet service provider network;
and aiming at each router in the Internet service provider network to be monitored, when first alarm information of connection states between the router and all route reflectors connected with the router is acquired, determining that IP (Internet protocol) off-network and virtual private network service off-network of the router occur.
The embodiment of the invention also provides a network fault monitoring system to solve the technical problems that the detection timeliness is low, the accuracy is low, and the VPN service off-network fault cannot be detected in the prior art. The system comprises:
the receiving server is used for receiving first alarm information of the connection state between the router and the router to be monitored, wherein the first alarm information is reported in real time by the route reflector in the Internet service provider network;
and the fault judging server is used for determining that the router is off-line with IP and off-line with the virtual private network service aiming at each router in the Internet service provider network to be monitored when the first alarm information of the connection state between the router and all the router reflectors connected with the router is acquired.
The embodiment of the invention also provides computer equipment, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor realizes any network fault monitoring method when executing the computer program so as to solve the technical problems that the detection timeliness is low, the accuracy is low and the VPN service off-network fault cannot be detected in the prior art.
The embodiment of the invention also provides a computer readable storage medium which stores a computer program for executing any network fault monitoring method, so as to solve the technical problems that the detection timeliness is low, the accuracy is low, and the VPN service off-line fault cannot be detected in the prior art.
In the embodiment of the invention, the first alarm information of the connection state between a Route Reflector (RR) and a connected router (R) in an Internet Service Provider (ISP) network to be monitored is received, and then when the first alarm information of the connection state between the route reflector and the router, which is reported by all route reflectors connected with the router, is obtained for each router in the ISP network to be monitored, the occurrence of IP off-network and Virtual Private Network (VPN) service off-network of the router is determined, namely, the determination of whether each router has the IP off-network fault or not is made based on the first alarm information of the connection state between the route reflector and the connected router, which is reported by the route reflector in real time, because the occurrence of the IP off-network of the router leads to the VPN service off-network, so that the occurrence of the IP off-network fault of the router is determined, and the occurrence of the VPN service off-network fault of the VPN can be determined at the same time; in addition, because the method determines whether each router has an IP off-network fault based on the first alarm information of the connection state between the router and itself reported by the route reflector in real time, compared with the scheme of judging whether the network equipment is off-network by polling and measuring the connectivity between the network manager and the network equipment of an operator in the prior art in a ping measurement mode, the method can avoid polling at intervals and polling a plurality of equipment, so that the timeliness of detection can be relatively improved, the detection result is prevented from being influenced by network jitter, and the accuracy of the detection result can be relatively improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate and together with the description serve to explain the invention. In the drawings:
fig. 1 is a flowchart of a network fault monitoring method according to an embodiment of the present invention;
fig. 2 is a functional schematic diagram of implementing the above network fault monitoring method in an ISP network according to an embodiment of the present invention;
FIG. 3 is a block diagram of a computer device according to an embodiment of the present invention;
fig. 4 is a block diagram of a network fault monitoring system according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the following embodiments and the accompanying drawings, in order to make the objects, technical solutions and advantages of the present invention more apparent. The exemplary embodiments of the present invention and the descriptions thereof are used herein to explain the present invention, but are not intended to limit the invention.
In an embodiment of the present invention, a network fault monitoring method is provided, as shown in fig. 1, where the method includes:
step 102: receiving first alarm information of a connection state between a router and a router to be monitored, wherein the first alarm information is reported in real time by a route reflector in an Internet service provider network;
step 104: and aiming at each router in the Internet service provider network to be monitored, when first alarm information of connection states between the router and all route reflectors connected with the router is acquired, determining that IP (Internet protocol) off-network and virtual private network service off-network of the router occur.
As can be seen from the flow shown in fig. 1, in the embodiment of the present invention, it is proposed to receive the first alarm information of the connection state between the Router Reflector (RR) and the connected router (R) reported in real time in the Internet Service Provider (ISP) network to be monitored, and further, for each router in the ISP network to be monitored, when the first alarm information of the connection state between the router and all the router reflectors connected to the router is obtained, it is determined that the router is out of network for IP and Virtual Private Network (VPN) service, that is, it is proposed to determine whether each router is out of network for IP based on the first alarm information of the connection state between the router reflector and the connected router reported in real time, because the router is out of network for VPN service, so that it is determined that the router is out of network for IP service, and meanwhile, it is also possible to determine that the VPN service is out of network for VPN service fault to occur; in addition, because the method determines whether each router has an IP off-network fault based on the first alarm information of the connection state between the router and itself reported by the route reflector in real time, compared with the scheme of judging whether the network equipment is off-network by polling and measuring the connectivity between the network manager and the network equipment of an operator in the prior art in a ping measurement mode, the method can avoid polling at intervals and polling a plurality of equipment, so that the timeliness of detection can be relatively improved, the detection result is prevented from being influenced by network jitter, and the accuracy of the detection result can be relatively improved.
In specific implementation, the ISP network is built based on BGP (border gateway protocol), in order to control the number of BGP connections, RR (route reflector) and VRR (VPN route reflector) are also generally planned, the route transmitter and the router are generally CS (client and server mode) architecture, the route transmitter and the router enable iBGP protocol connection, and considering that each router is a dual-uplink route reflector, the inventor of the present application proposes to determine whether the router has an IP off-network based on the first alarm information of the connection state between itself and the router reported by the RR, and the first alarm information of the connection state between itself and the router reported by the RR may be BGP connection state, for example, the connection state between the RR and the router is BGP connection state, and the general connection state includes the following states:
(1) idle (Idle): idle is the first state of BGP connection, in Idle state, BGP waits for a start event, after which BGP initializes resources, resets a connection Retry timer (Connect-Retry), initiates a TCP connection, and transitions to Connect state.
(2) Connect (Connect): in the Connect state, BGP initiates a first TCP connection, and if a connection Retry timer (Connect-Retry) expires, re-initiates the TCP connection and continues to remain in the Connect state, and if the TCP connection is successful, transitions to the opencount state, and if the TCP connection fails, transitions to the Active state.
(3) Active (Active): in the Active state, BGP always tries to establish a TCP connection, returns to the Connect state if a connection Retry timer (Connect-Retry) expires, transitions to the opencount state if the TCP connection is successful, continues to remain in the Active state if the TCP connection fails, and continues to initiate the TCP connection.
(4) Opencount (open message sent): in the opencount state, a TCP connection has been established, BGP has also sent the first Open message, and the rest of the work, BGP waits for its peer to send the Open message. And carrying out correctness checking on the received Open message, if the error exists, sending an error notification message by the system and returning to the Idle state, if the error does not exist, starting sending a Keepalive message by the BGP, resetting a Keepalive timer, and starting timing. And simultaneously transitions to the OpenConfirm state.
(5) OpenConfirm state: in the OpenConfirm state, BGP sends a keep message, and resets the keep timer, if a keep message is received, the BGP neighbor relation is Established by switching to the published phase. If the TCP connection is interrupted, the Idle state is returned.
(6) Established connection: in the published state, BGP neighbor relations have been Established, at which point BGP will exchange Update messages with its neighbors while resetting the hold timer.
Among the above states, only the Established state will not generate an alarm, and the RR will not report the alarm information in this state, and in Idle, connect, active, opencount, openConfirm, etc. states, the alarm information is generated, and the RR will report the alarm information.
In a specific implementation, the function of receiving the first alarm information may be implemented by a receiving server, and the function of determining whether the router has an IP off-network or a VPN service off-network may be implemented by a fault determining server, for example, as shown in fig. 2, in an ISP network (operator), a framework of CS (client and server modes) is between the route reflectors RR1, RR2 and the routers R1 to R4, which enable iBGP protocol connection, each router is a dual-uplink route reflector, and if the router R1 is down for some reason, the route reflector RR1 and the route reflector RR2 detect a change in the connection state between itself and the router R1, so as to trigger an alarm that "BGP neighbors are not reachable (R1)", and meanwhile, the route reflector RR1 and the route reflector RR2 report the alarm message to the receiving server of the network. The receiving server forwards the received alarm message to the fault judging server, if the fault judging server receives the message of 'BGP neighbor unreachable (R1)' reported by the route reflector RR1 and the route reflector RR2 respectively, and in combination with network topology analysis, the route reflector connected with the router R1 only has the route reflector RR1 and the route reflector RR2, that is, all the route reflectors RR1 and RR2 connected with the router R1 report the alarm message that BGP connection of R1 is unreachable, it is judged that the router R1 has an IP off-line fault, but the router has an IP off-line, which results in VPN service off-line, so that it can also be judged that the router R1 has VPN service off-line at this time.
In a specific implementation, since the VPN service is not necessarily off-line with IP, in order to determine that the VPN service is off-line when the IP is not off-line, in this embodiment, the method further includes: receiving second alarm information of a connection state between a VPN Route Reflector (VRR) in the ISP network to be monitored and a connected router, wherein the second alarm information is reported in real time by the VPN Route Reflector (VRR);
and aiming at each router carrying VPN service in the ISP network to be monitored, determining that VPN service is off-line when second alarm information of connection states between the router and all VPN route reflectors connected with the router is acquired.
In the implementation, in the ISP network, the router carrying the VPN service needs to establish iBGP connection with the VRR router, i.e., the second alarm information may be alarm information of BGP connection status, for example, an alarm that "BGP neighbor is unreachable (Rn)".
In specific implementation, as shown in fig. 2, a router R4 carrying VPN traffic establishes connection with a VPN route reflector VRR1 and a VPN route reflector VRR2 respectively, if a receiving server receives a message of "BGP neighbor unreachable (R4)" reported by the VPN route reflector VRR1 and the VPN route reflector VRR2 at the same time, and forwards the message to a failure determination server, if the failure determination server receives a message of "BGP neighbor unreachable (R4)" reported by the VPN route reflector VRR1 and the VPN route reflector VRR2 respectively, in combination with network topology analysis, the route reflector connected by the router R4 only has VRR1 and VRR2, that is, all VPN route reflectors VRR1 and VRR2 connected by the router R4 report BGP connection unreachable alarm information of R1, it is determined that the PN traffic is off-network by the router R4.
In a specific implementation, in order to implement reporting of the alarm information, in this embodiment, the routing reflector and the VPN routing reflector may enable syslog service, the routing reflector reports the first alarm information through a syslog protocol, and the VPN routing reflector also reports the second alarm information through a syslog (system log) protocol, and a receiving server that receives the first alarm information and the second alarm information may be a syslog receiving server.
In a specific implementation, since the router and the VPN route reflector in the ISP network report a connection status fault through a syslog protocol, where the syslog protocol is a UDP protocol in a transport layer, the UDP protocol is an unreliable protocol, and it cannot be guaranteed that all syslog messages can be sent to a receiving server, so as to avoid that the accuracy of a detection result is reduced due to missing messages at the receiving server, in this embodiment, the method further includes:
polling a route reflector in the ISP network to be monitored, and acquiring first abnormal connection state information between the route reflector and a connected router;
and combining the first alarm information and the first abnormal connection state information to determine whether each router in the to-be-monitored Internet service provider network has IP off-network.
In the implementation, the first alarm information and the first abnormal connection state information are combined to determine whether each router in the internet service provider network to be monitored has an IP offline process, if the acquired first alarm information is enough to determine that the router has an IP offline process, the first abnormal connection state information is not needed to be considered, if the acquired first alarm information is insufficient to determine that the router has an IP offline process (such as failure message failure report of a certain router reflector), the first abnormal connection state information is considered to be combined to determine whether the router has an IP offline process. For example, taking fig. 2 as an example, if both the route reflector RR1 and the route reflector RR2 report an alarm message to the receiving server of the network. The receiving server forwards the received alarm message to the fault judging server, if the fault judging server receives the message of 'BGP neighbor unreachable (R1)' reported by the route reflector RR1 and the route reflector RR2 respectively, and the route reflector connected with the router R1 only has the route reflector RR1 and the route reflector RR2 combined with network topology analysis, namely all the route reflectors RR1 and RR2 connected with the router R1 report the alarm information of the unreachable BGP connection of the R1, the first abnormal connection state information about the router R1 acquired by the route reflector RR1 and the route reflector RR2 is discarded or not considered, and the IP off-network fault of the router R1 is directly judged. If the failure determination server only receives a message of "BGP neighbor unreachable (R1)" reported by the routing reflector RR1, in combination with network topology analysis, when the routing reflector connected by the routing reflector R1 has the routing reflector RR1 and the routing reflector RR2, that is, not all the routing reflectors RR1 and RR2 connected by the routing reflector R1 report alarm information that BGP connection of R1 is unreachable, in combination with consideration of first abnormal connection state information about the router R1 acquired by polling the routing reflector RR1 and the routing reflector RR2, if the first abnormal connection state information acquired by polling includes alarm information that BGP connection of R1 detected by the routing reflectors RR1 and RR2 is unreachable (when message missing occurs to the routing reflector RR 2), the router R1 is directly determined to have an IP off-network failure, otherwise, the determination that IP off-network failure occurs to the router R1 is not made.
In a specific implementation, in order to avoid that the accuracy of the detection result is reduced due to the missing message of the receiving server, in this embodiment, the method further includes: polling a VPN route reflector in the ISP network to be monitored to acquire second abnormal connection state information between the VPN route reflector and the connected router;
and combining the second alarm information and the second abnormal connection state information to determine whether VPN service disconnection occurs to each router carrying VPN service in the ISP network to be monitored.
Specifically, the process of determining whether the VPN service is off-line for each router carrying VPN service in the ISP network to be monitored by using the second alarm information and the second abnormal connection state information is similar to the process of determining whether the IP off-line for each router in the internet service provider network to be monitored by combining the first alarm information and the first abnormal connection state information, which is not described herein.
In specific implementation, the functions of the polling route reflector and the VPN route reflector can be realized through one acquisition server, so that the connection states between the route reflector and the VPN route reflector and the router can be actively acquired, the defect of a udp protocol is overcome, and the possibility of failure and missing report is reduced.
In particular implementations, the route reflector and VPN route reflector may initiate a snmp (simple network management protocol) service supporting polling of BGP session states of the route reflector and VPN route reflector to all routers by means of the snmp.
In the implementation, for the failure results of determining that the router is off-line in IP or off-line in VPN service, the failure results may be notified to a specific device responsible person in a manner of a short message or mail.
In specific implementation, the network fault monitoring method can be implemented on a network management network.
In this embodiment, a computer device is provided, as shown in fig. 3, including a memory 302, a processor 304, and a computer program stored on the memory and capable of running on the processor, where the processor implements any of the network failure monitoring methods described above when executing the computer program.
In particular, the computer device may be a computer terminal, a server or similar computing means.
In the present embodiment, a computer-readable storage medium storing a computer program for executing any of the above-described network failure monitoring methods is provided.
In particular, computer-readable storage media, including both permanent and non-permanent, removable and non-removable media, may be used to implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer-readable storage media include, but are not limited to, phase-change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Disks (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium, which can be used to store information that can be accessed by a computing device. Computer-readable storage media, as defined herein, does not include transitory computer-readable media (transmission media), such as modulated data signals and carrier waves.
Based on the same inventive concept, the embodiment of the invention also provides a network fault monitoring system, as described in the following embodiment. Because the principle of solving the problem of the network fault monitoring system is similar to that of the network fault monitoring method, the implementation of the network fault monitoring system can refer to the implementation of the network fault monitoring method, and the repetition is omitted. As used below, the term "unit" or "module" may be a combination of software and/or hardware that implements the intended function. While the means described in the following embodiments are preferably implemented in software, implementation in hardware, or a combination of software and hardware, is also possible and contemplated.
Fig. 4 is a block diagram of a network fault monitoring system according to an embodiment of the present invention, as shown in fig. 4, the system includes:
a receiving server 402, configured to receive first alarm information of a connection state between itself and a connected router, which is reported in real time by a route reflector in an internet service provider network to be monitored;
the failure determination server 404 is configured to determine, for each router in the to-be-monitored internet service provider network, that the router is IP-off-network and virtual private network service-off-network when first alarm information of connection status with the router, reported by all route reflectors connected to the router, is acquired.
In one embodiment, the receiving server is further configured to receive second alarm information of a connection state between itself and a connected router, which is reported in real time by a virtual private network route reflector in the internet service provider network to be monitored;
the fault judging server is further configured to determine, for each router carrying a virtual private network service in the internet service provider network to be monitored, that the router is disconnected from the virtual private network service when second alarm information of connection states between the router and all virtual private network route reflectors connected to the router are acquired.
In one embodiment, the receiving server is configured to receive, through a syslog protocol, the first alarm information reported by a route reflector in an internet service provider network to be monitored; and receiving the second alarm information reported by the virtual private network routing reflector in the to-be-monitored Internet service provider network through a syslog protocol.
In one embodiment, the system further comprises:
the acquisition server is used for polling the route reflector in the Internet service provider network to be monitored and acquiring first abnormal connection state information between the route reflector and the connected router;
the failure determination server is further configured to determine whether each router in the to-be-monitored internet service provider network is IP-off-network by combining the first alarm information and the first abnormal connection state information.
In one embodiment, the collecting server is further configured to poll a virtual private network routing reflector in the internet service provider network to be monitored, and obtain second abnormal connection state information between the virtual private network routing reflector and the connected router;
the failure determination server is further configured to determine, in combination with the second alarm information and the second abnormal connection state information, whether each router carrying the virtual private network service in the internet service provider network to be monitored has a virtual private network service offline.
The embodiment of the invention realizes the following technical effects: the method comprises the steps of receiving first alarm information of a connection state between a Router Reflector (RR) and a connected router (R) reported in real time in an Internet Service Provider (ISP) network to be monitored, further determining that the router has IP off-network and Virtual Private Network (VPN) service off-network when acquiring the first alarm information of the connection state between the router and all the router reflectors connected with the router, namely determining whether each router has IP off-network fault or not based on the first alarm information of the connection state between the router reflector and the connected router reported in real time, wherein the router has IP off-network fault, so that the VPN service off-network fault can be judged when judging that the router has IP off-network fault; in addition, because the method determines whether each router has an IP off-network fault based on the first alarm information of the connection state between the router and itself reported by the route reflector in real time, compared with the scheme of judging whether the network equipment is off-network by polling and measuring the connectivity between the network manager and the network equipment of an operator in the prior art in a ping measurement mode, the method can avoid polling at intervals and polling a plurality of equipment, so that the timeliness of detection can be relatively improved, the detection result is prevented from being influenced by network jitter, and the accuracy of the detection result can be relatively improved.
It will be apparent to those skilled in the art that the modules or steps of the embodiments of the invention described above may be implemented in a general purpose computing device, they may be concentrated on a single computing device, or distributed across a network of computing devices, they may alternatively be implemented in program code executable by computing devices, so that they may be stored in a storage device for execution by computing devices, and in some cases, the steps shown or described may be performed in a different order than what is shown or described, or they may be separately fabricated into individual integrated circuit modules, or a plurality of modules or steps in them may be fabricated into a single integrated circuit module. Thus, embodiments of the invention are not limited to any specific combination of hardware and software.
The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, and various modifications and variations can be made to the embodiments of the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (6)

1. A method for monitoring network faults, comprising:
receiving first alarm information of a connection state between a router and a router to be monitored, wherein the first alarm information is reported in real time by a route reflector in an Internet service provider network;
aiming at each router in the to-be-monitored Internet service provider network, when first alarm information of connection states between the router and all route reflectors connected with the router is obtained, determining that IP (Internet protocol) off-network and virtual private network service off-network of the router occur;
receiving second alarm information of the connection state between the virtual private network route reflector and the connected router, wherein the second alarm information is reported in real time by the virtual private network route reflector in the to-be-monitored Internet service provider network;
determining that the router has virtual private network service off-line when obtaining second alarm information of connection states between the router and all virtual private network routing reflectors connected with the router, aiming at each router carrying virtual private network service in the to-be-monitored Internet service provider network;
further comprises:
polling a route reflector in the to-be-monitored Internet service provider network to acquire first abnormal connection state information between the route reflector and a connected router;
determining whether each router in the internet service provider network to be monitored is out of network by combining the first alarm information and the first abnormal connection state information;
polling a virtual private network routing reflector in the to-be-monitored Internet service provider network to acquire second abnormal connection state information between the virtual private network routing reflector and a connected router;
and combining the second alarm information and the second abnormal connection state information to determine whether each router carrying the virtual private network service in the to-be-monitored Internet service provider network is off-line with the virtual private network service.
2. The network failure monitoring method of claim 1, wherein,
receiving the first alarm information reported by a route reflector in an Internet service provider network to be monitored through a system log protocol;
and receiving the second alarm information reported by the virtual private network routing reflector in the network of the Internet service provider to be monitored through a system log protocol.
3. A network failure monitoring system, comprising:
the receiving server is used for receiving first alarm information of the connection state between the router and the router to be monitored, wherein the first alarm information is reported in real time by the route reflector in the Internet service provider network;
the fault judging server is used for determining that the router is off-line with IP and off-line with virtual private network service when acquiring first alarm information of connection state between the router and all route reflectors connected with the router aiming at each router in the to-be-monitored Internet service provider network;
the receiving server is further configured to receive second alarm information of a connection state between itself and a connected router, which is reported in real time by a virtual private network routing reflector in the internet service provider network to be monitored;
the fault judging server is further configured to determine, for each router that carries a virtual private network service in the internet service provider network to be monitored, that the router is off-line when second alarm information of connection states with the router, reported by all virtual private network route reflectors connected to the router, is obtained;
further comprises: the acquisition server is used for polling the route reflector in the Internet service provider network to be monitored and acquiring first abnormal connection state information between the route reflector and the connected router;
the fault judging server is further used for determining whether each router in the to-be-monitored internet service provider network is off-line or not according to the first alarm information and the first abnormal connection state information;
the acquisition server is further used for polling the virtual private network routing reflector in the Internet service provider network to be monitored and acquiring second abnormal connection state information between the virtual private network routing reflector and the connected router;
the failure determination server is further configured to determine, in combination with the second alarm information and the second abnormal connection state information, whether each router carrying the virtual private network service in the internet service provider network to be monitored has a virtual private network service offline.
4. The network failure monitoring system of claim 3 wherein,
the receiving server is used for receiving the first alarm information reported by the route reflector in the Internet service provider network to be monitored through a system log protocol; and receiving the second alarm information reported by the virtual private network routing reflector in the network of the Internet service provider to be monitored through a system log protocol.
5. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the network failure monitoring method of any of claims 1 to 2 when executing the computer program.
6. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program that executes the network failure monitoring method according to any one of claims 1 to 2.
CN201911257749.3A 2019-12-10 2019-12-10 Network fault monitoring method, system, computer equipment and readable storage medium Active CN112953744B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911257749.3A CN112953744B (en) 2019-12-10 2019-12-10 Network fault monitoring method, system, computer equipment and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911257749.3A CN112953744B (en) 2019-12-10 2019-12-10 Network fault monitoring method, system, computer equipment and readable storage medium

Publications (2)

Publication Number Publication Date
CN112953744A CN112953744A (en) 2021-06-11
CN112953744B true CN112953744B (en) 2023-05-12

Family

ID=76225525

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911257749.3A Active CN112953744B (en) 2019-12-10 2019-12-10 Network fault monitoring method, system, computer equipment and readable storage medium

Country Status (1)

Country Link
CN (1) CN112953744B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114006800B (en) * 2021-10-11 2023-12-05 中盈优创资讯科技有限公司 Equipment off-network alarm method and device based on IGP-SPF algorithm

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7804766B2 (en) * 2007-11-16 2010-09-28 At&T Intellectual Property Ii, L.P. Devices, systems, and/or methods regarding virtual routing forwarding
US7804781B2 (en) * 2008-11-20 2010-09-28 At&T Intellectual Property I, L.P. Methods and apparatus to detect border gateway protocol session failures
US8023406B2 (en) * 2009-02-27 2011-09-20 Research In Motion Limited Route reflector for a communication system
CN106411727B (en) * 2016-09-22 2021-05-14 新华三技术有限公司 Message processing method, device and autonomous system
CN106878166B (en) * 2017-01-22 2020-04-03 新华三技术有限公司 Route notification method and device

Also Published As

Publication number Publication date
CN112953744A (en) 2021-06-11

Similar Documents

Publication Publication Date Title
US10771396B2 (en) Communications network failure detection and remediation
CA2493525C (en) Method and apparatus for outage measurement
US11171875B2 (en) Systems and methods of communications network failure detection and remediation utilizing link probes
US20090147698A1 (en) Network automatic discovery method and system
US8605603B2 (en) Route convergence based on ethernet operations, administration, and maintenance protocol
US20040105390A1 (en) Method and system for implementing a fast recovery process in a local area network
US20060126495A1 (en) System and methods for detecting network failure
US20090003241A1 (en) A Method and System For Obtaining Path Maximum Transfer Unit in Network
CN110224883B (en) Gray fault diagnosis method applied to telecommunication bearer network
WO2011100882A1 (en) Link detecting method, apparatus and system
US11307945B2 (en) Methods and apparatus for detecting, eliminating and/or mitigating split brain occurrences in high availability systems
US11711281B2 (en) Methods and network devices for detecting and resolving abnormal routes
US10742672B2 (en) Comparing metrics from different data flows to detect flaws in network data collection for anomaly detection
US20230070701A1 (en) Network performance monitoring and fault management based on wide area network link health assessments
CN109873719B (en) Fault detection method and device
CN112953744B (en) Network fault monitoring method, system, computer equipment and readable storage medium
US7898955B1 (en) System and method for real-time diagnosis of routing problems
US20220210038A1 (en) Leveraging operation, administration & maintenance protocols (oam) to add ethernet level intelligence to software-defined wide area network (sd-wan) functionality
EP4164190A1 (en) Wireless signal strength-based detection of poor network link performance
US11178107B2 (en) System and method for detecting surreptitious packet rerouting
CN113542052A (en) Node fault determination method and device and server
CN117354196B (en) Link detection method and device based on network interface
US20230076842A1 (en) Systems & methods for actively monitoring latency in a network fabric
US9560209B1 (en) Techniques for troubleshooting IP based telecommunications networks
CN110933051B (en) Intercommunication method between SIP signaling services

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP02 Change in the address of a patent holder

Address after: Room 702-2, No. 4811, Cao'an Highway, Jiading District, Shanghai

Patentee after: CHINA UNITECHS

Address before: 100872 5th floor, Renmin culture building, 59 Zhongguancun Street, Haidian District, Beijing

Patentee before: CHINA UNITECHS

CP02 Change in the address of a patent holder