CN102684902B

CN102684902B - Based on the network failure locating method of probe prediction

Info

Publication number: CN102684902B
Application number: CN201110066944.5A
Authority: CN
Inventors: 王智立; 高志鹏; 吴顺安; 邱雪松; 李文璟; 乔焰; 王颖
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2011-03-18
Filing date: 2011-03-18
Publication date: 2015-10-14
Anticipated expiration: 2031-03-18
Also published as: CN102684902A

Abstract

The invention discloses a kind of network failure locating method based on probe prediction, comprise step: step 101: the detection set A that initialization is current and malfunctioning node set F are for empty; Wherein, detect set A and represent the issued probe selected from probe set M; Malfunctioning node set F represents the malfunctioning node set navigated to; Step 102: select K probe and send from probe set M; Step 103: obtain a sparse matrix by probe set A, and remove the probe sent from described probe set M; Step 104: predict obtained sparse matrix, obtains a perfect matrix; Step 105: select the most uncertain N number of probe and send from described perfect matrix; Step 106: utilize the result of the probe sent in step 105 to upgrade described perfect matrix; Step 107: the location of carrying out malfunctioning node, calculates malfunctioning node set.The method reduce the required number of probes sent of active probing technique.

Description

Based on the network failure locating method of probe prediction

Technical field

The present invention relates to network detection technical field, particularly relate to a kind of network failure locating method based on probe prediction.

Background technology

Along with popularizing of internet information, the scale of computer system network constantly expands and constantly increases with complexity, and the difficulty of network management is also increasing, how to manage network timely and effectively and becomes our current problem demanding prompt solution.

At present, received Network Management Function must comprise following five aspects: fault management, accounting management, configuration management, performance management and safety management.Fault management is again one of of paramount importance function in network management.When certain component failure in network, network manager must find the source of trouble timely and accurately and get rid of, with the reliability of Logistics networks.When network size is smaller, network manager according to processing alarm for information about, can be fixed a breakdown.And when network size complexity is higher, network manager just should perform some diagnostic tests to distinguish failure cause.And along with the continuous expansion of network size and the raising day by day of network complexity, the difficulty of fault management is also just increasing.

Failure diagnosis, as a Core Feature of fault management, plays increasing effect in current network management.Fault diagnosis technology the most frequently used is at present active probing technique.First active probing technique sends detection from selected base station to other all network equipment, then by calculating the entropy of detection, is then applied in Bayesian network model, by further compute location to the source of trouble in network.But along with the continuous expansion of network size, the increasing of all kinds of service application amount, and fault is in the appearance of different agreement layer, also more and more higher to the requirement of network detection.

The method of a current solution network failure location difficult problem mainly utilizes the technology of initiatively probe.Probe is used for the end-to-end test affairs collection of gathering system network performance information specially, initiatively probe is then utilize probability inference technology and the active from probe sets of combining information opinion method selects a small amount of probe most with amount of information, to carry out Rapid Inference to the state of grid.

The state of development of prior art is as follows:

Technical scheme 1: application number is the Chinese patent application of 200510074911.X, relate to a kind of method of recognition network malfunctioning node, its core technology is: first the network node of system sends detection messages to network node associated with it, then the response message of described associated network node is monitored, when the response message of the network node be associated described in confirmation does not receive, then determine that this related network node is failing network-node.

Technical scheme 2: application number be 200710188015.5 Chinese patent application disclose a kind of service failure diagnosis system based on active probe and method thereof, use the service performance in the effective monitoring network of active probe, ensure the performance using service in each Access Network, when monitoring notes abnormalities, the symptom according to monitoring orients fault rootstock rapidly and accurately.This application for a patent for invention monitoring overhead is little, has good performance of fault diagnosis, and this application for a patent for invention simultaneously chooses probe based on uncertainty models, with the verification and measurement ratio of less probe expense guarantee to each fault, and can obtain good diagnosis performance.

Literature protocol 1: " Adaptive diagnosis in distributed systems ", the document proposes initiatively probe technique the earliest.Probe is used for the end-to-end test affairs collection of gathering system network performance information specially, initiatively probe technique is then utilize probability inference technology and the active from probe sets of combining information opinion method selects a small amount of probe most with amount of information, to carry out Rapid Inference to the state of grid.This algorithm not only in the number of probes sent than traditional algorithm few a lot, and also greatly to reduce on the time spent by fault location.

Literature protocol 2: " Fault diagnosis in IP Networks via Noiseless Multicast Probing ", this document proposes one based on the fault model of extracting of multicasting technology, solve a fault location difficult problem.Extract in fault model at this, certain ancestor node of and if only if a reception probe breaks down, and this receives probe just can record this fault measuring.Utilize this model and most probable malfunctioning node can be calculated by probe value.The model treatment method that the document proposes makes full use of the fabric feature of multicast tree, more effective compared with common probability inference.

Discuss the defect of prior art below.

At present, from this subject of computer science, derive diversified network monitoring method, generally speaking can be divided into two classes: the detection method of passive collection information and the network detection method of active probe.The network management system adopting passive collection information to carry out this method of network detection requires that networking component provides the function of inside story to network management system, with the message active reporting oneself collected to network management system.But along with the continuous increase of network size and the increase of complexity, the network management system of passive collection information, due to the large and lower operational efficiency of its difficulty realized, cannot meet the needs managing the network become increasingly complex.Meanwhile, such as not be there is management interface by the managed entity of third party's assembly, middleware or other main frame exploitation, the function of reporting message cannot be realized.This just requires that we provide better model to detect network, and under this background, active probing technique is carried out.Network management system based on active probe uses the method for active probe detect managed device and analyze result of detection, to complete fault location, the message collected by it is reported without the need to managed device, there is active, efficient and adaptive characteristic, and the symptoms such as network, the system failure, serv-fail and performance degradation can be known as soon as possible with very little cost, thus provide foundation for the positioning analysis of root fault.

Although the method for active probe tool compared with the method for passive collection information has great advantage, its configuration overhead extra for network brings and flow load.First, the method for active probe needs to configure in a network the detection base station of some to ensure that the probe sent from base station can cover whole network, and the fault that can effectively occur in fixer network.And the existence of such special joint will introduce configuration to node and maintenance costs.Secondly, the probe of transmission is in fact also a kind of network overhead, although excessive network detection set is good for result of detection, also brings larger load to network, causes the congested of some network node.Therefore, from candidate probe set, how to select that one efficient and offered load is less detection set just seems particularly important.The uncertainty of network state should be reduced to minimum by the probe set that namely probe selection algorithm is selected, and the scale of probe set is little as much as possible simultaneously.

Specifically, the defect of technique scheme 1 is: the program sends to its network node be associated the state that monitoring message judges its network node be associated by network node, this scheme needs to send in a network to monitor message in a large number, very large on the flow load impact of network, and when cannot receive monitoring message when network congestion, can not determine it is that the network node be associated necessarily is in malfunction.

The defect of technique scheme 2 for: the program first sends the monitoring probe of some in institute's monitoring network, whether there to be fault in supervising network produce, the diagnostic probe of some is sent again to carry out fault location when there being fault to produce in network, this scheme needs to send a large amount of probes in network, when network is more complicated, the required number of probes sent is more, which increases the flow in network, larger to the load effect of network.

The defect of above-mentioned literature protocol 1 is: the document proposes the technology of active probe the earliest, the advantage of this algorithm be make use of probability theory technology and combining information opinion method has carried out reasoning to network state, compared to passive collection message method offered load expense little, accurate positioning.But this algorithm needs to carry out complicated calculating to the entropy of probe, and calculation cost is very high, and is only applicable to the situation occurring single fault in network.So this algorithm be not suitable for the more situation of current this network environment more complicated, failure ratio.

The defect of above-mentioned literature protocol 2 is: in the consideration technically adding time factor of active probe, and make the obtainable mutual information of probe set institute large as much as possible, not only fault location prepares more, and the average overhead of network is also less.But this algorithm needs to send a large amount of probe in network, still larger to the occupancy of Internet resources.

Summary of the invention

(1) technical problem that will solve

For needing the problem sending a large amount of probe in network in existing network Detection Techniques, the technical problem to be solved in the present invention is: reduce the required number of probes sent of active probing technique, and quick position is to the fault in network.

(2) technical scheme

In order to solve the problems of the technologies described above, the invention provides a kind of network failure locating method based on probe prediction, comprising the following steps:

Step 101: the detection set A that initialization is current and malfunctioning node set F are empty; Wherein, detect set A and represent select, issued probe from probe set M; Malfunctioning node set F represents the malfunctioning node set navigated to; Probe set M is made up of the probe sent to the network equipment in network to be positioned;

Step 102: select K probe and send from probe set M;

Step 103: obtain a sparse matrix by detecting set A, and remove the probe sent from described probe set M;

Step 104: predict obtained sparse matrix, obtains a perfect matrix;

Step 105: select the most uncertain N number of probe and send from described perfect matrix;

Step 106: utilize the result of the probe sent in step 105 to upgrade described perfect matrix;

Step 107: the location of carrying out malfunctioning node, calculates malfunctioning node set.

Wherein, in step 104, maximal margin matrix decomposition MMMF is utilized to predict obtained sparse matrix.

Wherein, in step 107, Bayesian network model is utilized to carry out the location of malfunctioning node.

Wherein, also comprise after step 107: judge whether the up-to-date malfunctioning node set calculated by Bayesian network model is changed compared with current malfunctioning node set, if unchanged, the up-to-date malfunctioning node set obtained is the result that will obtain, otherwise enters the fault location that step 104 carries out next round.

Wherein, before the fault location of carrying out next round, first the up-to-date perfect matrix obtained is transformed into two sub matrixs, then performs the fault location that next round is carried out in step 104 ~ 107, until malfunctioning node set no longer includes change.

Wherein: 201: the characteristic vector u of study sparse matrix and coefficient vector v; 202: the product calculating u and v, obtains perfect matrix.

Wherein, step 107 specifically comprises:

Step 301, conditional probability according to calculation of parameter all-network node to be positioned;

Step 302: calculate send as an envoy to described conditional probability maximum time condition, all node set meeting this condition are the malfunctioning node set that will calculate.

Wherein, in the perfect matrix obtained when adopting MMMF Forecasting Methodology to predict, the real number in element is filled to be [-1,1] interval, absolute value of a real number is less, and the uncertainty of probe is higher.

Wherein, described parameter to be positioned comprises the prior probability of all-network node and the result of all probes.

(3) beneficial effect

The network modelling of reality is become the Bayesian network model of two points by the present invention, this model is conducive to determining the relation between network node and all the other nodes, according to the feature of model, select the set of the probe that will send, but also not exclusively send and just send wherein a little, then the algorithm of the performance prediction end to end MMMF proposed is utilized to predict the result that all the other do not send probe, and then select a small amount of detection and send, to improve the accuracy of prediction, not only make the required number of probes sent in network less, thus avoid network failure to locate the unnecessary network overhead caused, and the accuracy of fault location is also very high, realize the fault location of the low expense of high efficiency.Its key problem in technology point is:

1) bayesian theory real network is utilized to carry out modeling.By setting up Bayesian model, can allowing each network node that their respective states need not be reported just can to carry out fault location, decreasing the network traffics produced because reporting.

2) adopt the method for probe prediction to carry out fault location, and do not send all probes selected.Predict the result of all probes by sending a small amount of probe, thus carry out fault location, so just further mitigate the flow burden in network.

3) adopt and select the most uncertain probe that predicts the outcome to resend from the result of detection predicted, improve the accuracy of prediction.In the process of carrying out probe prediction, if do not select part probe to re-start transmission, then the accuracy of probe prediction is very low, and the result of fault location is just not necessarily accurate.If Stochastic choice part probe re-starts transmission, then may select those probes sent, the accuracy of actual probes prediction improves seldom.And select predict the outcome in the most uncertain probe resend, avoid select those probes sent, also farthest improve the accuracy of probe prediction, thus failing network-node can be navigated to faster.

Accompanying drawing explanation

Fig. 1 is the method flow diagram of the embodiment of the present invention;

Fig. 2 is the schematic diagram that in the method for the embodiment of the present invention, particle carries out location updating;

Fig. 3 is the example of a Bayesian network model;

Fig. 4 is the flow chart utilizing Bayesian network model to carry out fault location;

Fig. 5 shows experimental result.

Embodiment

Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in further detail.Following examples for illustration of the present invention, but are not used for limiting the scope of the invention.

Mentality of designing of the present invention is: first in network, send a small amount of probe, utilize probe prediction technological prediction end to end to go out the result of remaining probe after obtaining result of detection, finally utilize the node broken down in the Bayesian network model computing network in active probing technique.

First the parameter will used when implementing the localization method of the embodiment of the present invention is described:

Network topology: the i.e. topology of networks of pending fault location, comprise network-termination device, communication equipment and transmission medium, network-termination device and communication equipment can be referred to as the network equipment.Wherein, network-termination device can be computer, server etc.; Communication equipment can be router, switch etc.; Transmission medium can be optical fiber, twisted-pair feeder and wireless etc.;

Probe set M: select multiple (getting 10 in the present embodiment) computer at random as probe base station, ping probe is sent to detect the packet loss between each network equipment according to critical path method (CPM) from these probe base stations to other all-network terminal equipment and communication equipment, namely multiple (getting 100 in the present embodiment) bag is sent with ping probe to other all-network terminal equipment and communication equipment, then detect and have received how many bags, the result of probe is defined as the bag number that receives divided by the bag number sent, set to the probe of the network equipment transmission in network to be positioned constitutes probe set M, wherein, ping probe is the computer network program for measuring end to end connectivity, and it is by sending a request data package and wait-receiving mode response data packet is confirmed whether to be communicated with opposite end to opposite end.

The number of probes K of initial transmission: set the initial number of probes sent as 1/4th (being obtained by experience) of total number of probes in the present embodiment, or add 20 or deduct 20 for 1/4th of total number of probes for 1/4th of total number of probes;

The number of probes N that each selection next round will send: because the number of probes sent at first is fewer, cause the prediction accuracy of the result of probe set M very low, so need select from probe set M some, least determine that the probe predicted the outcome sends, in the present embodiment, the value of N is 5,10,20 or 30.

The method flow diagram of the foundation embodiment of the present invention as shown in Figure 1, comprises the steps:

Step 101: the detection set A that initialization is current and malfunctioning node set F are empty.Wherein, detect set A and represent the issued probe selected from probe set M; Malfunctioning node set F represents the malfunctioning node set utilizing active probing technique to navigate to.

Step 102: select K probe and send from probe set M.

The system of selection of probe: select from probe set M at random, ensures that in the matrix corresponding with probe, every a line and each row are all non-full zeros.

Step 103: obtain a sparse matrix by probe set A, and remove the probe sent from probe set M.

A sparse matrix can be obtained by probe set A, the mode obtained is: the row of this sparse matrix corresponds to the start node of the probe sent, row correspond to the terminal note of the probe sent, if the result that this probe returns is normal, then in corresponding sparse matrix, the value of element is-1, the element value then corresponding for fault is 1, and do not have the element value in the corresponding sparse matrix of probe sent to be 0, the element that is in sparse matrix forms by 1,0 ,-1.

Step 104: utilize MMMF (Max Margin Matrix Factorization, maximal margin matrix decomposition) to predict obtained sparse matrix, obtain a perfect matrix.

The sparse matrix (this matrix is an ill-conditioned matrix) obtained by MMMF calculation procedure 103, obtain a perfect matrix, the probe that all elements in above-mentioned sparse matrix is namely 0 by this computational process is all predicted out.MMMF is the process of a synchronous study characteristic vector v and coefficient vector u.Computational process utilizes the nonindependence in matrix between element to calculate an a characteristic vector v and coefficient vector u, then utilize these two vectorial products to obtain a perfect matrix B=uv, and this perfect matrix is the approximate evaluation of ill-conditioned matrix.

Detailed MMMF predicts the step of ill-conditioned matrix as shown in Figure 2, comprising: step 201: the characteristic vector u and the coefficient vector v that learn out ill-conditioned matrix (namely sparse matrix) to be predicted; Step 202: calculate u*v, obtain the approximate evaluation of this ill-conditioned matrix, i.e. perfect matrix, the element set in this perfect matrix becomes the real number in [-1,1] interval.

Step 105: predict from step 104 in the perfect matrix obtained that selecting the most uncertain N number of probe sends.

The most uncertain probe is defined as: adopt MMMF Forecasting Methodology to carry out in the perfect matrix obtained when predicting, element is filled to be [-1,1] real number in interval, from 0 more close to (namely absolute value of a real number is less), uncertainty is higher, selects the highest N number of probe of uncertainty to send.

The value of N: when carrying out raising prediction accuracy, needs the probe selecting some from the perfect matrix doped to carry out the transmission of next round, and the quantity of each detection selected is different, and it is just different that prediction accuracy improves degree.As can be seen from following table 1, the number of probes selected is less, the accuracy raising degree of prediction is higher, but the number of probes selected is less, the number of repetition of the approximate matrix calculated is more, the time consumed is more, so need to balance the number of probes selected and the relation calculated between approximate matrix number of times.In the present invention, the value of N is 5,10,20 or 30, corresponding with initial N value.

Step 106: the probe utilizing step 105 to send upgrades the perfect matrix doped.

The result of the probe sent by epicycle replaces element value corresponding in the perfect matrix doped.

Step 107: utilize Bayesian network model to carry out the location of malfunctioning node.

Bayesian network model: the node in network and detection connect by Bayesian network model, form two layers of direct directed graph, node wherein in network is considered as father node, probe is considered as child node, the result of probe depends on the result of its father node, when and if only if its all father node is correct, this probe is just correct, and Fig. 3 is expressed as the example of a Bayesian network model.The node most possibly broken down can be located by the probability of malfunction calculating each node.

Figure 3 shows that set up a Bayesian network model example.

Utilize Bayesian network model to carry out the flow process of fault location as shown in Figure 4, comprising:

Step 301, (comprise the prior probability of all-network node and the result of all probes according to parameter to be positioned, described prior probability is according to previous experiences and analyzes the out of order probability of network node obtained, it is an estimated value, value in different network systems is different, this value is 0.9 in the present embodiment) calculate the conditional probability of all-network node, namely calculate P (t ₁, t ₂..., t _n, X ₁, X ₂..., X _m) probability, wherein t _ibe i-th network node, X _jfor jth bar probe, n is network node number, and m is probe number.P (t ₁, t ₂..., t _n, X ₁, X ₂..., X _m) computing formula be

P(t ₁，t ₂，...，t _n，X ₁，X ₂，...，X _m)

＝P(X ₁|Pa(X ₁))P(X ₂|Pa(X ₂))...P(X _m|Pa(X _m))P(t ₁)P(t ₂)...P(t _n)，

In above formula, Pa (X _i) represent all links of probe Xi process.As probe X _iin when having a node failure, P (X _i=1|Pa (X _i))=1, P (X _i=0|Pa (X _i))=0, P (X time trouble-free _i=1|Pa (X _i))=0, P (X _i=0|Pa (X _i))=1.And P (t ₁) P (t ₂) ... P (t _n)=0.1 ^k* 0.9 ^n-k, wherein k corresponds to the number of faults of the node occurred in network, i.e. t _ithe number of=0.Then P (t is calculated ₁, t ₂..., t _n, X ₁, X ₂..., X _m) middle t _i=0 and t _iprobability in=1 situation.Prior probability above-mentioned specifically refers to obtain the out of order probability of network node according to previous experiences and analysis, is an estimated value.

Step 302: calculate the P (t under all situations ₁, t ₂..., t _n, X ₁, X ₂..., X _m) after value, find out P (t ₁, t ₂..., t _n, X ₁, X ₂..., X _m) condition of value when maximum, i.e. t _ivalue (being 0 or 1).All t _ithe node set of=0 is the malfunctioning node set calculated.

The present embodiment can also comprise the following steps:

Step 108: judge whether the up-to-date malfunctioning node set calculated by Bayesian network model is changed compared with current malfunctioning node set, if unchanged, this up-to-date malfunctioning node set obtained is the result that will obtain, otherwise enters the fault location that step 104 carries out next round.When carrying out the fault location of next round, first the up-to-date perfect matrix obtained is transformed into two sub matrixs, namely be greater than in matrix 0 be transformed into 1, be less than 0 be transformed into-1, run MMMF method again, and apply the calculating that Bayesian network model method carries out probability of node failure, repeat above step until malfunctioning node set no longer changes.

The following describes embodiments of the invention:

We test the present invention in this network simulation environment of NS2.First utilize this instrument stochastic generation of GI-ITM topological diagram, number of nodes is 100, and number of links obeys random distribution, and the number of links of topological diagram is respectively 99.In order to the fault in analog network, add error model in simulations, the packet loss of normal link is set as 0.001, and the packet loss of faulty link is set as 0.9.The each number of probes N selecting next round to send of setting is 10, and in the topology of 100 nodes, K is 220, and initialization has detected set A and malfunctioning node set F is sky.Then from probe set M, select 220 probes send, obtain a sparse matrix:

Then utilize MMMF method to predict this sparse matrix, obtain perfect matrix:

Utilize Bayesian network model to calculate the probability of malfunction of all-network node after doping the result of all probes, namely in step 301

P(t ₁，t ₂，...，t _n，X ₁，X ₂，...，X _m)

＝P(X ₁|Pa(X ₁))P(X ₂|Pa(X ₂))...P(X _m|Pa(X _m))P(t ₁)P(t ₂)...P(t _n) (3)，

Wherein X _j, the value of j=1,2...m is corresponding with the value in matrix (2), and in matrix, element is-1, then X _jvalue be 1, element is 1, then X _jbe 0, then t _i=0 and t _ithe situation of=1 substitutes into formula (3) respectively, can obtain t _ip (t when=0 _i)=0.9, t _ip (t when=1 _i)=0.1, then can calculate P (t ₁, t ₂..., t _n, X ₁, X ₂..., X _m) probability in varied situations, find P (t ₁, t ₂..., t _n, X ₁, X ₂..., X _m) maximum that situation, i.e. t _icorresponding value, t _i=0 be malfunctioning node, the malfunctioning node set F calculated like this is { 38, 52, 67}, judge whether this malfunctioning node set changes, because failure collection becomes { 38 by empty set, 52, 67}, just from matrix (2), the minimum element of 10 absolute values is found out, send the probe corresponding to them, and the result obtained is updated in matrix (2), then matrix (2) is transformed into two sub matrixs, namely be greater than in matrix 0 be transformed into 1, be less than 0 be transformed into-1, run MMMF method again, and apply the calculating that Bayesian network model method carries out probability of node failure, repeat above step until malfunctioning node set no longer changes, namely malfunctioning node set is { 38, 53, 64} just no longer changes, then malfunctioning node set { 38, 53, 64} is the result wanted.Fig. 5 lists the result of this experiment.

As seen from Figure 5, the probe that will send required for the present invention is also fewer than 1/3rd of total probe set, simultaneously also can be as seen from Table 1, after predicting, reselect a small amount of probe at every turn re-start the larger of the accuracy raising sending rear prediction, this also make required for the negligible amounts of probe that resends, thus make the required total number of probes sent seldom just can complete fault location.Table 1 is expressed as the relation between the value of N and prediction error rate.

Table 1

The above is only embodiments of the present invention; it should be pointed out that for those skilled in the art, under the prerequisite not departing from the technology of the present invention principle; can also make some improvement and modification, these improve and modification also should be considered as protection scope of the present invention.

Claims

1. based on a network failure locating method for probe prediction, it is characterized in that, comprise the following steps:

Step 102: select K probe and send from probe set M, the system of selection of described probe is: select K probe at random from probe set M, and in the matrix corresponding with a described K probe every a line and each to arrange are all non-full zeros;

Step 103: obtain a sparse matrix by detecting set A, and the probe sent is removed from described probe set M, the acquisition pattern of described sparse matrix is: the row of described sparse matrix corresponds to the start node of the probe sent, row correspond to the terminal note of the probe sent, if the result that this probe returns is normal, element value then in corresponding sparse matrix is-1, if the result that this probe returns is fault, element value then in corresponding sparse matrix is 1, does not have the element value in the corresponding sparse matrix of probe sent to be 0;

Step 104: predict obtained sparse matrix, obtains a perfect matrix, is specially: utilize maximal margin matrix decomposition MMMF to predict obtained sparse matrix;

Step 105: select the most uncertain N number of probe and send from described perfect matrix, wherein, in the perfect matrix obtained when adopting MMMF Forecasting Methodology to predict, element is filled to be [-1,1] real number in interval, absolute value of a real number is less, and the uncertainty of probe is higher;

Step 107: the location of carrying out malfunctioning node, calculates malfunctioning node set, is specially: utilize Bayesian network model to carry out the location of malfunctioning node;

Described step 107 specifically comprises:

Step 301: according to the conditional probability of calculation of parameter all-network node to be positioned, described parameter to be positioned comprises the prior probability of all-network node and the result of all probes;

Step 302: calculate send as an envoy to described conditional probability maximum time condition, meet all t of this condition _ithe node set of=0 is the malfunctioning node set that will calculate, wherein, and t _ibe i-th network node;

Also comprise after step 107: judge whether the up-to-date malfunctioning node set calculated by Bayesian network model is changed compared with current malfunctioning node set, if unchanged, the up-to-date malfunctioning node set obtained is the result that will obtain, otherwise enters the fault location that step 104 carries out next round; Before the fault location of carrying out next round, first the up-to-date perfect matrix obtained is transformed into two sub matrixs, then performs the fault location that next round is carried out in step 104 ~ 107, until malfunctioning node set no longer includes change.

2. the method for claim 1, is characterized in that, step 104 specifically comprises:

Step 201: the characteristic vector u of study sparse matrix and coefficient vector v;

Step 202: the product calculating u and v, obtains perfect matrix.