CN109428785A - A kind of fault detection method and device - Google Patents

A kind of fault detection method and device Download PDF

Info

Publication number
CN109428785A
CN109428785A CN201710779439.2A CN201710779439A CN109428785A CN 109428785 A CN109428785 A CN 109428785A CN 201710779439 A CN201710779439 A CN 201710779439A CN 109428785 A CN109428785 A CN 109428785A
Authority
CN
China
Prior art keywords
delay
round
node
link
packet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710779439.2A
Other languages
Chinese (zh)
Inventor
龚明贤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201710779439.2A priority Critical patent/CN109428785A/en
Publication of CN109428785A publication Critical patent/CN109428785A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors
    • H04L43/0829Packet loss
    • H04L43/0841Round trip packet loss
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0811Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0852Delays
    • H04L43/0864Round trip delays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Health & Medical Sciences (AREA)
  • Cardiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

This application provides a kind of fault detection method and devices, wherein this method comprises: obtaining at least one network packet in multiple round-trip delays of multiple link sections of link to be detected, wherein the link to be detected includes multiple link sections;According to the multiple round-trip delay, the packet loss or delay situation of the multiple link section are determined;Determine faulty link section.Using technical solution provided by the embodiments of the present application, can solve existing way automatical and efficient can not determine the technical issues of which section link breaks down, and reach through delay distribution come the technical effect of locating network fault link section.

Description

A kind of fault detection method and device
Technical field
The application belongs to Internet technical field more particularly to a kind of fault detection method and device.
Background technique
Currently, cloud computing using more and more, enterprise can use some virtual networks that some cloud platforms provide and produce Product (such as: virtual switch, dummy load balanced device etc.), by these virtual network products, can build much more Complicated network environment.
However, under such a huge and complicated cloud computing virtualized network environment, often due to network equipment event Barrier, line fault, physical machine resource seize, VM (Virtual Machine, virtual machine) overload, software product BUG, plan Slightly the various problems such as problem, network attack cause network access delay excessive, to affect the normal operation of customer service.
If inquiring these failures by way of manually checking, often lead to ratio occur because network link is too long Relatively time-consuming problem, and be difficult accurately to find out link of all the problems.And sometimes access delay is only by one section of network link It is sometimes also likely to be caused by being gone wrong as several sections of network links caused by problematic.
However, not yet proposing effective solution at present for the link section accurately and efficiently automatically determined out there are failure Scheme.
Summary of the invention
The application is designed to provide a kind of fault detection method and device, may be implemented to position net by delay distribution The technical effect of network faulty link section.
The application provides a kind of fault detection method and device is achieved in that
A kind of fault detection method, which comprises
At least one network packet is obtained in multiple round-trip delays of multiple link sections of link to be detected, wherein institute Stating link to be detected includes multiple link sections;
According to the multiple round-trip delay, the packet loss or delay situation of the multiple link section are determined;
Determine faulty link section.
A kind of fault detection means, described device include:
Obtain module, for obtain at least one network packet link to be detected multiple link sections it is multiple round-trip Time delay, wherein the link to be detected includes multiple link sections;
First determining module, for determining packet loss or the delay of the multiple link section according to the multiple round-trip delay Situation;
Second determining module, for determining faulty link section.
A kind of computer readable storage medium, is stored thereon with computer program, realization when which is executed by processor The step of above method.
Link to be detected is divided into multiple link sections, and determined by fault detection method and device provided by the present application The packet loss and delay situation of each link section are which link section either which link section occurs so as to effectively determination Failure, thus solve existing way automatical and efficient can not determine which section link break down the technical issues of, reach By delay distribution come the technical effect of locating network fault link section.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this The some embodiments recorded in application, for those of ordinary skill in the art, in the premise of not making the creative labor property Under, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of method flow diagram of embodiment of fault detection method provided by the present application;
Fig. 2 is that the link section of network link provided by the present application divides schematic diagram;
Fig. 3 be the application offer be in network link data packet in the transmission schematic diagram of each node of link;
Fig. 4 is that each file in the packet capturing library provided by the present application based on chain road closes multiple network packets Join the method flow diagram of analysis;
Fig. 5 is the method flow diagram of determining delay distribution situation provided by the present application;
Fig. 6 is a kind of model structure schematic diagram of embodiment of processing equipment provided by the present application;
Fig. 7 is a kind of modular structure schematic diagram of embodiment of fault detection means provided by the present application.
Specific embodiment
In order to make those skilled in the art better understand the technical solutions in the application, below in conjunction with the application reality The attached drawing in example is applied, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described implementation Example is merely a part but not all of the embodiments of the present application.Based on the embodiment in the application, this field is common The application protection all should belong in technical staff's every other embodiment obtained without creative efforts Range.
In view of network access delay is sometimes as caused by one section of link failure, it is also possible to due to several sections of links events Caused by barrier, when determining access delay reason, if only determining whether failure is clearly unreasonable to whole link.For Realize the purpose for being simple and efficient and determining which section link of chain road to be detected breaks down, in this example, it is contemplated that can will Link to be detected is divided into multiple link sections, then, carries out detection statistics and obtains the packet loss and delay situation of each link section, thus It determines specific faulty link section in outgoing link, allows the faulty link that finds specific to a certain section of link or a few Section link, rather than a whole link, to improve the precision and efficiency of link failure detection.
Fig. 1 is a kind of method flow diagram of determination method one embodiment of herein described faulty link section.Although this Application provides as the following examples or method operating procedure shown in the drawings or apparatus structure, but based on conventional or without wound The labour for the property made may include more or less operating procedure or modular unit in the method or device.In logicality In upper the step of there is no necessary causalities or structure, the execution sequence of these steps or the modular structure of device are not limited to this Apply for embodiment description and execution shown in the drawings sequence or modular structure.The method or modular structure in practice Device or end product according to embodiment or method shown in the drawings or modular structure connection progress sequence in application, can hold Capable or parallel execution (such as environment or even distributed processing environment of parallel processor or multiple threads).
Specifically, as shown in Figure 1, a kind of fault detection method that a kind of embodiment of the application provides may include walking as follows It is rapid:
Step 101: obtain at least one network packet in multiple round-trip delays of multiple link sections of link to be detected, Wherein, the link to be detected includes multiple link sections;
Wherein, round-trip delay (Round-Trip Time, referred to as RTT) is an important property in a computer network Energy index indicates that receiving the confirmation from receiving end to transmitting terminal, (receiving end receives data meeting since transmitting terminal sends data Confirmation is sent immediately) time experienced.Such as: transmitting terminal A, receiving end B send data packet X from transmitting terminal to receiving end, So the data packet send round-trip delay, be exactly from A to B send X, from A receive B based on the X acknowledgment message sent when Between.
Link to be detected, chain road are usually to have multiple network interface cards with forwarding capability, that is, a data packet is in data Chain road is forwarded generally by multiple network interface cards.It in realization, can be using the network interface card with forwarding capability as decomposition Node, the link between the adjacent network interface card of every two can be used as a link section, and generally going wrong is also that some network interface card occurs Link fails between problem or adjacent two network interface card, so that data packet forward delay interval.
It therefore, can be by the network interface card with forwarding capability as node is decomposed, in reality when carrying out Link Fragmentation When existing, the network interface card that chain road to be detected each has forwarding capability can be regard as a decomposition node, be also possible to Several network interface cards with forwarding capability of the chain road to be detected are chosen as node is decomposed, for example, intermittent selection, Huo Zhesui Machine selection etc..It can specifically be selected according to actual needs using the segmentation which kind of mode chooses decomposition node progress link, this Application is not construed as limiting this.
It, can be on each decomposition node top after the decomposition node by selection divides a link into multiple link sections Affix one's name to a monitoring point.It, can (the starting point of link to be detected to terminal end-to-end with automatic trigger in order to realize automatic detection Section) ping instruction, then pass through each monitoring point collected each decomposition node whithin a period of time decomposed on node Then the delay product of each monitoring point is associated analysis by the delay of transmitting data packet.
For example, as shown in Fig. 2, a link to be detected: A to F, wherein the network interface card with forwarding capability are as follows: A, B, C, D,E,F.Assuming that each the network interface card with forwarding capability is used as decomposition node, then, entire link is also just with this six network interface cards It is divided into multiple link sections as node is decomposed.
Assuming that currently needing to detect with the presence or absence of faulty link section in A to F, then can be as shown in figure 3, sending number from A According to packet, receiving end is set as F.In the transmission process of data packet, i.e., what the network interface card by each with forwarding capability carried out turns Hair, until being sent to receiving end F, then, receiving end F can send immediately a confirmation message and return after receiving the data packet Be back to transmitting terminal A, confirmation message be also sent back by each network interface card with forwarding capability come.
Because deploying monitoring node at each decomposition node, it can capture every in data packet transfer procedure A RTT (round-trip delay) decomposed at node.If there is multiple data packets transmit, then each data packet hair can be got When sending, in multiple data packets in each transmission of data packets each transmission node round-trip delay.
As shown in Figure 3, it is assumed that the transmission predefined paths of data packet are that (A point is transmitting terminal, and F point is to receive from A transmission F End), then the RTT of A point is: data packet is from A to F, and total time of the confirmation message from F to A, the RTT of B point is: data packet from B to The RTT of F, total time of the confirmation message from F to B, C point are: data packet is from C to F, total time of the confirmation message from F to C, D point RRT is: data packet is from D to F, and total time of the confirmation message from F to D, the RTT of E point is: data packet is from E to F, and confirmation message is from F To the total time of E.
Assuming that the transmission predefined paths of data packet are to be transferred to E (B point is transmitting terminal, and E point is receiving end) from B, then B point RTT be: data packet is from B to E, and total time of the confirmation message from E to B, the RTT of C point is: data packet is from C to E, confirmation message The RTT of total time from E to C, D point are: for data packet from D to E, at this moment total time of the confirmation message from D to B does not just have A point With the record of F because the transmission path of data packet be exactly B to E, A and F not in a transmission path.
In view of being that the statistics to delay situation or packet drop therefore, can to determine which link section goes wrong It, can not be into for there is no the data packet being delayed only to select the data packet in the presence of delay as needing the data packet that counts Row statistics, to reduce calculation amount.
In one embodiment, acquisition network packet, can in the round-trip delay of multiple link sections of link to be detected To include: that triggering carries out data packet transmission end to end on the link to be detected;It obtains in the predetermined time, respectively decomposes node The collected data packet round-trip delay in the monitoring point of upper deployment;To the collected number in monitoring point disposed on each decomposition node According to packet round-trip delay carry out Conjoint Analysis, obtain it is multiple there are the network packet of transmission delay link to be detected multiple chains The round-trip delay in section.Specifically, multiple data can be obtained to reach automatic realize by automatic trigger ping end to end The purpose of the round trip delay time of packet.
Step 102: according to the multiple round-trip delay, determining the packet loss or delay situation of multiple link sections.
Step 103: according to the packet loss of the multiple link section or delay situation, determining faulty link section.
That is, the acquisition for passing through a period of time, so that it may the collection of each monitoring point be got up and be associated from the background Analysis counts the packet drop and delay situation of each link section, such as: determine the delay number of A to F, packet loss number, The delay number of B to E, packet loss number etc. shown by this intuitive result, can effectively determine any section or which There is failure, and delay and which link section of packet drop than more serious in section link.
Further, the data packet being delayed time consumed by each link section can also will occur successively shows Which come, so that simply finding out section delay than more serious.
In one embodiment, according to the multiple round-trip delay, the packet loss or delay situation of multiple link sections are determined, May include:
The corresponding round-trip delay data of decomposition node that successively traversal current network data packet is passed through according to the following steps, with Obtain the packet loss or delay situation of multiple link sections based on current network data packet:
S1: node is decomposed using current decomposition node as first;
Assuming that decomposing node using node A as first.
S2: the first round-trip delay that the current network data packet decomposes node described first is read;
Such as: obtain round-trip delay rtt1 of the data packet P at node A.
S3: determine that described first decomposes node with the presence or absence of next decomposition node;
That is, determining in node A to whether there is other decomposition nodes between the receiving end of data packet P.
S4: if it does not exist, then determining whether first round-trip delay is infinity, if it is infinity, to institute The packet loss record for stating link to be detected executes plus an operation, if being not infinity, remembers to the delay of the link to be detected Record executes plus one operates and record the delayed data of the delay;
Assuming that receiving end is node B, that is, other decomposition nodes are not present between transmitting terminal and receiving end, indicate that number According to packet P using A as transmitting terminal in the case where, transmission path is exactly direct node A to node B, between without other nodes, that What rtt1 was characterized is exactly data packet from node A to node B, and confirmation message is from node B to the time of node A.Because determining Data packet P is a delay data packet, accordingly, it is determined that whether rtt1 is infinitely great (∞), infinity is indicated that and never received To confirmation message, that, which can also confirm, has occurred a packet loss operation between A to B, at this moment, so that it may which by the section, (A is to B's) Packet loss operation note executes plus an operation, if rtt1 be not it is infinitely great, indicate that it is subsequent receive confirmation message, also with regard to phase It is only a transmission delay when in there is no packet losses, then can (the delay record of A to B) executes plus a behaviour by the section Make, in order to enable every section specific duration of data packet transmission delay etc. can be specified, at this moment can recorde this data packet A extremely Delay time between B sections.
S4: if it is present decomposing node for next decomposition node as second, the current network data packet is read In the second round-trip delay that described second decomposes node, institute is determined according to first round-trip delay and second round-trip delay State current network data packet it is described first decompose node to it is described second decomposition node between delay or packet drop.
Specifically, determining that the current network data packet exists according to first round-trip delay and second round-trip delay It is described first decompose node to it is described second decomposition node between delay or packet drop, may include:
It 1) is in the case that infinitely great second round-trip delay is not infinity, to institute in first round-trip delay State the packet loss record plus one for the link section that the first decomposition node is decomposed to described second between node;
2) it is not in the case that infinitely great second round-trip delay is also not infinitely great in first round-trip delay, Determine whether the difference between first round-trip delay and second round-trip delay is greater than preset threshold, if it is greater than default Threshold value, then the delay record for decomposing node to the link section between the second decomposition node to described first executes plus an operation And record the delayed data of the delay.
Assuming that is currently judged is the transmission delay record between A to C, that data packet is passed through is A, B, C, then can be with Therefore the receiving end for being determined to data packet after obtaining the rtt1 of A to C, can also obtain other than A there are also node B The rtt2 of B to C.It based on rtt1 and rtt2, may be implemented to A to C, A to B, the statistics of delay and packet drop between B to C. It is specific:
If 1) rtt1 is infinity, rtt2 is also infinity, indicates that the link section between A to B is that there is no problem , it is subsequent when being judged using B point as first node, if there is no node after B, because the rtt between B to C is nothing Poor big, indicating that B node, there are packet losses between C.Therefore, prolonging for current data packet is being judged using A node as first node When packet drop when, if A to B, the rtt between B to C be all it is infinitely great, be ignored as regardless of waiting subsequent determination to be specifically The failure which link section occurs;
If 2), rtt1 is infinity, rtt2 be not it is infinitely great, it is what there is no problem between B to C that, which is indicated that, problem It appears between A and B, there are packet losses between A and B.Add an operation so as to execute A to B sections of packet loss record;
If 3) rtt1 be not it is infinitely great, rtt2 is also not infinitely great, that indicates that A and B have received the confirmation of C point and disappear At this moment breath can calculate rtt1-rtt2 for characterizing the data transmission period between A and B, preset if rtt1-rtt2 is greater than Delay threshold, indicate that between A to B be exist delay, therefore, can by the delay of A to B record execute plus one behaviour Make, in order to enable every section specific duration of data packet transmission delay etc. can be specified, at this moment can recorde this data packet A extremely Delay time between B sections.If rtt1-rtt2 is not more than preset delay threshold, indicate that between A to B it is that there is no prolong When, so that it may it is adjusted without record.
If being used as first node to carry out above-mentioned update based on node each in transmission path each delay data packet The update of delay and packet loss record finally can be obtained by each node (decomposing node) and be formed by each link in link section Delay and packet drop, then the data packet packet of occurred delay is successively shown in each link section institute elapsed time Come, the judgement being simple and efficient to the fault condition of each link section may be implemented.
It is illustrated below with reference to determination method of the specific embodiment to above-mentioned faulty link section, however, being worth note Meaning, the specific embodiment do not constitute an undue limitation on the present application merely to the application is better described.
In this example, it is contemplated that under the virtualized network environment of cloud computing, it is difficult that network failure link section is positioned manually And it is very time-consuming, a kind of method of determining link segment fault is proposed, so as to greatly improve the speed of service response, Save human cost.
Specifically, link to be detected first can be decomposed into multiple link sections (such as: N number of), it, can when decomposing A decomposition node is defined as with each network interface card with forwarding capability by chain road, naturally it is also possible to select certain several tool There is the network interface card of forwarding capability as decomposition node, rather than all network interface cards with forwarding capability is selected all to save as decomposition Point.When actually realizing, it can according to need the selection from link to be detected and decompose node, to realize to link to be detected Decomposition.
A monitoring point, then, automatic trigger end-to-end (both ends to be detected) are all disposed on each node Ping, then, by collected ICMP, (Internet Control Message Protocol, Internet control message association View) packet be stored in file respectively, by the acquisition of a period of time, then by the collection of each monitoring point come backstage carry out Association analysis, finally, intuitively show which or which section there is network failure, wherein so-called network failure Packet delay be can be greater than preset legal threshold value, or packet drop occur, and can will occur delay Packet successively shown in each link section institute elapsed time.
Specifically, can be as shown in figure 4, to each file in the packet capturing library based on chain road to multiple network packets It is associated analysis:
S1: first pcap file of load chain road is (for the text of the data packet interception of network interface, port and protocol Part);
S2: each record in first pcap file is traversed one by one:
If current is recorded as request (request) ICMP data packet, newly creates a file and add one newly Record, may include<id in specific record, seq, request timestamp, and response timestamp>;
If current is recorded as response (confirmation is replied) ICMP data packet, it is determined that whether delay is greater than threshold value, If it does, so more new record, records delay record, it, can be by the record deletion if being not less than threshold value.
S3: next pcap file of load chain road;
S4: each record in pcap file is traversed one by one:
If current is recorded as request (request) ICMP data packet, a blotter is established;
If current is recorded as response (confirmation is replied) ICMP data packet, it is determined that whether delay is greater than threshold value, If it does, so just searching whether to continue to record there are the record in the file of above-mentioned creation, if it is determined that delay is not Greater than threshold value, then blotter is deleted.
S5: it until having traversed all pcap files, is saved and is shown.
By association analysis above, RTT of the available all delay packages on each monitoring node, that is to say, that For the ICMP packet a being delayed record, if RTT is greater than setting detection threshold when by a monitoring node, There will be corresponding RTT record.Based on obtained record as a result, the diagnostic analysis of entire link can be carried out, specifically , delay distribution situation of the available all delays on each link section.
When determining delay distribution situation, it can be realized according to step as shown in Figure 5:
S1: the ICMP packet record of a delay is obtained;
S2: the corresponding RTT record of each monitoring node of data packet process is successively traversed:
A record rtt1 is obtained, determines whether there are also next rtt records after the rtt1: 1) if there is no next Rtt record, it is determined that rtt1 whether be it is infinitely great, if it is infinity, then respective links section increases a packet loss statistics, such as Fruit is not infinity, then respective links section increases a delay statistics, and records specific delay;2) if there is next Rtt record, reads next record rtt2, in the case where it is not infinity that rtt1, which is infinitely great but rtt1, respective links section Increase a packet loss statistics, in the case where rtt1 is not that infinitely great rtt2 is also not infinity, determines whether rtt1-rtt2 is big In preset threshold, if it is greater, then respective links increase a delay statistics, and specific delay is recorded, if it is not greater, S3 is thened follow the steps, until having traversed the corresponding all rtt records of the data packet.
S3: it exits.
In upper example, by the way that long and complex network link is carried out sectional monitoring, then again by the monitoring knot of each node Fruit is associated analysis to find faulty network link segment, to automatically solve locating network fault link Section.It solves existing under the virtualized network environment of cloud computing, it is difficult and very that network failure link section is positioned manually Time-consuming technical problem saves human cost to greatly improve the speed of service response.
Fig. 6 shows the schematic configuration diagram of the processing equipment of the exemplary embodiment according to the application.Referring to FIG. 6, In hardware view, which includes processor, internal bus, network interface, memory and nonvolatile memory, certainly It is also possible that hardware required for other business.Processor read from nonvolatile memory corresponding computer program to It is then run in memory, forms fault detection means on logic level.Certainly, other than software realization mode, the application Other implementations, such as logical device or the mode of software and hardware combining etc. is not precluded, that is to say, that following processing stream The executing subject of journey is not limited to each logic unit, is also possible to hardware or logical device.
Referring to FIG. 7, the fault detection means may include obtaining module, the first determining mould in Software Implementation Block and the second determining module.Wherein:
Obtain module, for obtain at least one network packet link to be detected multiple link sections it is multiple round-trip Time delay, wherein the link to be detected includes multiple link sections;
First determining module, for determining packet loss or the delay of the multiple link section according to the multiple round-trip delay Situation;
Second determining module, for determining faulty link section.
In one embodiment, link to be detected can be divided into multiple link sections by multiple decomposition nodes, In, the node that decomposes includes: the network interface card that the chain road to be detected has forwarding capability.
In one embodiment, obtaining module may include: trigger unit, for triggering on the chain road to be detected Data packet end to end is carried out to transmit;Acquiring unit, for obtaining in the predetermined time, the decomposition node of the chain road to be detected The collected data packet round-trip delay in the monitoring point of upper deployment;Analytical unit, for the monitoring to being disposed on the decomposition node The collected data packet round-trip delay of point carries out Conjoint Analysis, obtains that there are the network packets of transmission delay in the multiple chain Multiple round-trip delays in section.
In one embodiment, the first determining module specifically can be used for successively traversing current network according to the following steps The corresponding round-trip delay data of decomposition node that data packet is passed through, to obtain multiple link sections based on current network data packet Packet loss or delay situation:
Node is decomposed using current decomposition node as first;
Read the first round-trip delay that the current network data packet decomposes node described first;
Determine that described first decomposes node with the presence or absence of next decomposition node;
If it does not exist, then determine first round-trip delay whether be it is infinitely great, if it is infinity, to it is described to The packet loss record for detecting link executes plus an operation, if being not infinity, holds to the delay record of the link to be detected Row plus an operation and the delayed data for recording the delay;
If it is present decomposing node for next decomposition node as second, reads the current network data packet and exist Described second decomposes the second round-trip delay of node, according to first round-trip delay and second round-trip delay determination Current network data packet it is described first decompose node to it is described second decomposition node between delay or packet drop.
In one embodiment, the first determining module can according to first round-trip delay and it is described second it is round-trip when Prolong determine the current network data packet described first decompose node between the second decomposition node delay or packet loss Situation specifically includes:
It is in the case that infinitely great second round-trip delay is not infinity, to described in first round-trip delay First decomposes the packet loss record plus one for the link section that node is decomposed to described second between node;
It is not in the case that infinitely great second round-trip delay is also not infinitely great, really in first round-trip delay Whether the difference between fixed first round-trip delay and second round-trip delay is greater than preset threshold, if it is greater than default threshold Value then decomposes delay record execution plus an operation of the node to the link section between the second decomposition node simultaneously to described first Record the delayed data of the delay.
Link to be detected is divided into multiple link sections, and determined by fault detection method and device provided by the present application The packet loss and delay situation of each link section are which link section either which link section occurs so as to effectively determination Failure, thus solve existing way automatical and efficient can not determine which section link break down the technical issues of, reach By delay distribution come the technical effect of locating network fault link section.
Although this application provides the method operating procedure as described in embodiment or flow chart, based on conventional or noninvasive The labour for the property made may include more or less operating procedure.The step of enumerating in embodiment sequence is only numerous steps One of execution sequence mode, does not represent and unique executes sequence.It, can when device or client production in practice executes To execute or parallel execute (such as at parallel processor or multithreading according to embodiment or method shown in the drawings sequence The environment of reason).
The device or module that above-described embodiment illustrates can specifically realize by computer chip or entity, or by having The product of certain function is realized.For convenience of description, it is divided into various modules when description apparatus above with function to describe respectively. The function of each module can be realized in the same or multiple software and or hardware when implementing the application.It is of course also possible to Realization the module for realizing certain function is combined by multiple submodule or subelement.
Method, apparatus or module described herein can realize that controller is pressed in a manner of computer readable program code Any mode appropriate is realized, for example, controller can take such as microprocessor or processor and storage can be by (micro-) The computer-readable medium of computer readable program code (such as software or firmware) that processor executes, logic gate, switch, specially With integrated circuit (Application Specific Integrated Circuit, ASIC), programmable logic controller (PLC) and embedding Enter the form of microcontroller, the example of controller includes but is not limited to following microcontroller: ARC 625D, Atmel AT91SAM, Microchip PIC18F26K20 and Silicone Labs C8051F320, Memory Controller are also implemented as depositing A part of the control logic of reservoir.It is also known in the art that in addition to real in a manner of pure computer readable program code Other than existing controller, completely can by by method and step carry out programming in logic come so that controller with logic gate, switch, dedicated The form of integrated circuit, programmable logic controller (PLC) and insertion microcontroller etc. realizes identical function.Therefore this controller It is considered a kind of hardware component, and hardware can also be considered as to the device for realizing various functions that its inside includes Structure in component.Or even, it can will be considered as the software either implementation method for realizing the device of various functions Module can be the structure in hardware component again.
Part of module in herein described device can be in the general of computer executable instructions Upper and lower described in the text, such as program module.Generally, program module includes executing particular task or realization specific abstract data class The routine of type, programs, objects, component, data structure, class etc..The application can also be practiced in a distributed computing environment, In these distributed computing environment, by executing task by the connected remote processing devices of communication network.In distribution It calculates in environment, program module can be located in the local and remote computer storage media including storage equipment.
As seen through the above description of the embodiments, those skilled in the art can be understood that the application can It is realized by the mode of software plus required hardware.Based on this understanding, the technical solution of the application is substantially in other words The part that contributes to existing technology can be embodied in the form of software products, and can also pass through the implementation of Data Migration It embodies in the process.The computer software product can store in storage medium, such as ROM/RAM, magnetic disk, CD, packet Some instructions are included to use so that a computer equipment (can be personal computer, mobile terminal, server or network are set It is standby etc.) execute method described in certain parts of each embodiment of the application or embodiment.
Each embodiment in this specification is described in a progressive manner, the same or similar portion between each embodiment Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.The whole of the application or Person part can be used in numerous general or special purpose computing system environments or configuration.Such as: personal computer, server calculate Machine, handheld device or portable device, mobile communication terminal, multicomputer system, based on microprocessor are at laptop device System, programmable electronic equipment, network PC, minicomputer, mainframe computer, the distribution including any of the above system or equipment Formula calculates environment etc..
Although depicting the application by embodiment, it will be appreciated by the skilled addressee that the application there are many deformation and Variation is without departing from spirit herein, it is desirable to which the attached claims include these deformations and change without departing from the application's Spirit.

Claims (12)

1. a kind of fault detection method, which is characterized in that the described method includes:
At least one network packet is obtained in multiple round-trip delays of multiple link sections of link to be detected, wherein it is described to Detecting link includes multiple link sections;
According to the multiple round-trip delay, the packet loss or delay situation of the multiple link section are determined;
Determine faulty link section.
2. the method according to claim 1, wherein the link to be detected is divided by multiple decomposition nodes For multiple link sections, wherein the node that decomposes includes: the network interface card that the chain road to be detected has forwarding capability.
3. according to the method described in claim 2, it is characterized in that, obtaining at least one network packet in link to be detected Multiple round-trip delays of multiple link sections, comprising:
Triggering carries out data packet transmission end to end on the link to be detected;
It obtains in the predetermined time, the collected data packet in monitoring point disposed on the decomposition node of the chain road to be detected is round-trip Time delay;
Conjoint Analysis is carried out to the collected data packet round-trip delay in monitoring point disposed on the decomposition node, obtains the presence of biography Multiple round-trip delays of the network packet of defeated delay in the multiple link section.
4. according to the method described in claim 2, it is characterized in that, determining the multiple chain according to the multiple round-trip delay The packet loss in section or the situation that is delayed include:
The corresponding round-trip delay data of decomposition node that successively traversal current network data packet is passed through according to the following steps, to obtain The packet loss or delay situation of multiple link sections based on current network data packet:
Node is decomposed using current decomposition node as first;
Read the first round-trip delay that the current network data packet decomposes node described first;
Determine that described first decomposes node with the presence or absence of next decomposition node;
If it does not exist, then determining whether first round-trip delay is infinity, if it is infinity, to described to be detected The packet loss record of link executes plus one operates, if being not infinity, executes and adds to the delay record of the link to be detected One operates and records the delayed data of the delay;
If it is present decomposing node for next decomposition node as second, the current network data packet is read described Second decomposes the second round-trip delay of node, is determined according to first round-trip delay and second round-trip delay described current Network packet it is described first decompose node to it is described second decomposition node between delay or packet drop.
5. according to the method described in claim 4, it is characterized in that, according to first round-trip delay and it is described second it is round-trip when Prolong determine the current network data packet described first decompose node between the second decomposition node delay or packet loss Situation, comprising:
It is in the case that infinitely great second round-trip delay is not infinity, to described first in first round-trip delay Decompose the packet loss record plus one for the link section that node is decomposed to described second between node;
It is not in the case that infinitely great second round-trip delay is also not infinitely great, to determine institute in first round-trip delay Whether the difference stated between the first round-trip delay and second round-trip delay is greater than preset threshold, if it is greater than preset threshold, Then the delay record of the first decomposition node to the link section between the second decomposition node is executed plus one operates and remembers Record the delayed data of the delay.
6. according to the method described in claim 4, it is characterized in that, the delayed data includes at least one of: delay Link section, delay time.
7. a kind of fault detection means, which is characterized in that described device includes:
Obtain module, for obtain at least one network packet link to be detected multiple link sections it is multiple round-trip when Prolong, wherein the link to be detected includes multiple link sections;
First determining module, for determining the packet loss or delay situation of the multiple link section according to the multiple round-trip delay;
Second determining module, for determining faulty link section.
8. device according to claim 7, which is characterized in that the link to be detected is divided by multiple decomposition nodes For multiple link sections, wherein the node that decomposes includes: the network interface card that the chain road to be detected has forwarding capability.
9. device according to claim 8, which is characterized in that the acquisition module includes:
Trigger unit carries out data packet transmission end to end for triggering on the link to be detected;
Acquiring unit, for obtaining in the predetermined time, the monitoring point disposed on the decomposition node of the chain road to be detected is acquired The data packet round-trip delay arrived;
Analytical unit, for carrying out joint point to the collected data packet round-trip delay in monitoring point disposed on the decomposition node Analysis, obtains multiple round-trip delays there are the network packet of transmission delay in the multiple link section.
10. device according to claim 8, which is characterized in that first determining module is specifically used for according to following step Suddenly the corresponding round-trip delay data of decomposition node that successively traversal current network data packet is passed through, to obtain based on current network number According to the packet loss or delay situation of multiple link sections of packet:
Node is decomposed using current decomposition node as first;
Read the first round-trip delay that the current network data packet decomposes node described first;
Determine that described first decomposes node with the presence or absence of next decomposition node;
If it does not exist, then determining whether first round-trip delay is infinity, if it is infinity, to described to be detected The packet loss record of link executes plus one operates, if being not infinity, executes and adds to the delay record of the link to be detected One operates and records the delayed data of the delay;
If it is present decomposing node for next decomposition node as second, the current network data packet is read described Second decomposes the second round-trip delay of node, is determined according to first round-trip delay and second round-trip delay described current Network packet it is described first decompose node to it is described second decomposition node between delay or packet drop.
11. device according to claim 10, which is characterized in that first determining module according to described first it is round-trip when Prolong and determines that the current network data packet decomposes node described first and decomposes section to described second with second round-trip delay Delay or packet drop between point, specifically include:
It is in the case that infinitely great second round-trip delay is not infinity, to described first in first round-trip delay Decompose the packet loss record plus one for the link section that node is decomposed to described second between node;
It is not in the case that infinitely great second round-trip delay is also not infinitely great, to determine institute in first round-trip delay Whether the difference stated between the first round-trip delay and second round-trip delay is greater than preset threshold, if it is greater than preset threshold, Then the delay record of the first decomposition node to the link section between the second decomposition node is executed plus one operates and remembers Record the delayed data of the delay.
12. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is by processor The step of any one of claims 1 to 6 the method is realized when execution.
CN201710779439.2A 2017-09-01 2017-09-01 A kind of fault detection method and device Pending CN109428785A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710779439.2A CN109428785A (en) 2017-09-01 2017-09-01 A kind of fault detection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710779439.2A CN109428785A (en) 2017-09-01 2017-09-01 A kind of fault detection method and device

Publications (1)

Publication Number Publication Date
CN109428785A true CN109428785A (en) 2019-03-05

Family

ID=65512994

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710779439.2A Pending CN109428785A (en) 2017-09-01 2017-09-01 A kind of fault detection method and device

Country Status (1)

Country Link
CN (1) CN109428785A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111256551A (en) * 2020-03-27 2020-06-09 北京中大爆破工程有限公司 Method for quickly determining fault detonator
CN111464374A (en) * 2020-02-21 2020-07-28 中国电子技术标准化研究院 Network delay control method, equipment and device
CN112311619A (en) * 2019-08-14 2021-02-02 北京字节跳动网络技术有限公司 Network message delay detection method and device and electronic equipment
CN112491635A (en) * 2019-08-20 2021-03-12 中兴通讯股份有限公司 Method, system, implementation equipment and storage medium for link quality detection
CN113099477A (en) * 2021-03-24 2021-07-09 Oppo广东移动通信有限公司 Time delay information processing method and related device
CN113395356A (en) * 2021-07-06 2021-09-14 山东电力工程咨询院有限公司 Health monitoring method and system of data center
CN114039889A (en) * 2021-09-27 2022-02-11 北京邮电大学 Network anomaly detection method based on round-trip delay time sequence and related device
CN114553678A (en) * 2022-02-09 2022-05-27 紫光云(南京)数字技术有限公司 Diagnosis method for soft SLB traffic problem of cloud network
CN115842747A (en) * 2022-11-21 2023-03-24 中盈优创资讯科技有限公司 Implementation method and device based on segmented monitoring network
CN116074184A (en) * 2023-03-21 2023-05-05 云南莱瑞科技有限公司 Network fault early warning system of power dispatching center

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103746874A (en) * 2013-12-30 2014-04-23 华为技术有限公司 Method and equipment for IP (Internet protocol) FPM (flow performance monitor)
CN105049299A (en) * 2015-08-27 2015-11-11 北京百度网讯科技有限公司 Detection method and device for time delay state information and network architecture
CN106713074A (en) * 2016-12-30 2017-05-24 贵州电网有限责任公司信息中心 Data network quality piecewise detection method and system based on service content

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103746874A (en) * 2013-12-30 2014-04-23 华为技术有限公司 Method and equipment for IP (Internet protocol) FPM (flow performance monitor)
CN105049299A (en) * 2015-08-27 2015-11-11 北京百度网讯科技有限公司 Detection method and device for time delay state information and network architecture
CN106713074A (en) * 2016-12-30 2017-05-24 贵州电网有限责任公司信息中心 Data network quality piecewise detection method and system based on service content

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
佟世文 等: "一种基于线性模型预测的网络化预测模糊控制方法", 《中南大学学报(自然科学版)》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112311619A (en) * 2019-08-14 2021-02-02 北京字节跳动网络技术有限公司 Network message delay detection method and device and electronic equipment
CN112311619B (en) * 2019-08-14 2022-04-05 北京字节跳动网络技术有限公司 Network message delay detection method and device and electronic equipment
CN112491635A (en) * 2019-08-20 2021-03-12 中兴通讯股份有限公司 Method, system, implementation equipment and storage medium for link quality detection
CN111464374A (en) * 2020-02-21 2020-07-28 中国电子技术标准化研究院 Network delay control method, equipment and device
CN111256551A (en) * 2020-03-27 2020-06-09 北京中大爆破工程有限公司 Method for quickly determining fault detonator
CN113099477A (en) * 2021-03-24 2021-07-09 Oppo广东移动通信有限公司 Time delay information processing method and related device
CN113395356A (en) * 2021-07-06 2021-09-14 山东电力工程咨询院有限公司 Health monitoring method and system of data center
CN114039889A (en) * 2021-09-27 2022-02-11 北京邮电大学 Network anomaly detection method based on round-trip delay time sequence and related device
CN114553678A (en) * 2022-02-09 2022-05-27 紫光云(南京)数字技术有限公司 Diagnosis method for soft SLB traffic problem of cloud network
CN114553678B (en) * 2022-02-09 2024-02-13 紫光云(南京)数字技术有限公司 Cloud network soft SLB flow problem diagnosis method
CN115842747A (en) * 2022-11-21 2023-03-24 中盈优创资讯科技有限公司 Implementation method and device based on segmented monitoring network
CN116074184A (en) * 2023-03-21 2023-05-05 云南莱瑞科技有限公司 Network fault early warning system of power dispatching center

Similar Documents

Publication Publication Date Title
CN109428785A (en) A kind of fault detection method and device
US9306819B2 (en) Controller driven OAM for split architecture network
Yu et al. Profiling network performance for multi-tier data center applications
Handigol et al. I know what your packet did last hop: Using packet histories to troubleshoot networks
US9571373B2 (en) System and method for combining server side and network side transaction tracing and measurement data at the granularity level of individual transactions
JP2022500963A (en) Network security monitoring methods, network security monitoring devices and systems
US20140215077A1 (en) Methods and systems for detecting, locating and remediating a congested resource or flow in a virtual infrastructure
US20130258843A1 (en) Network system and apparatis
KR20170049509A (en) Collecting and analyzing selected network traffic
CN105897507B (en) The condition detection method and device of node device
CN109088794A (en) A kind of fault monitoring method and device of node
KR101443071B1 (en) Error Check System of Webpage
US10904096B2 (en) Deep network path analysis for identifying network segments affecting application performance
CN108566363A (en) Method and system is determined based on the Brute Force of streaming computing
CN112350854A (en) Flow fault positioning method, device, equipment and storage medium
US10659338B1 (en) Isolation of network segments affecting application performance
US10805144B1 (en) Monitoring interactions between entities in a network by an agent for particular types of interactions and indexing and establishing relationships of the components of each interaction
CN109002478A (en) The fault handling method and relevant device of distributed file system
CN104378223A (en) Link performance testing method and device, logic processor and network processor
WO2019079961A1 (en) Method and device for determining shared risk link group
US10009151B2 (en) Packet storage method, information processing apparatus, and non-transitory computer-readable storage medium
CN110995606B (en) Congestion analysis method and device
JP2012175389A (en) Log collection automated device, log collection automation test system and log collection control method
CN109120449A (en) A kind of detection method and device of link failure
CN104993944A (en) Experiment scene backtracking technology device and method based on network environment and test equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190305