CN102724078A - End-to-end network flow reconstruction method based on compression sensing in dynamic network - Google Patents
End-to-end network flow reconstruction method based on compression sensing in dynamic network Download PDFInfo
- Publication number
- CN102724078A CN102724078A CN2012102251452A CN201210225145A CN102724078A CN 102724078 A CN102724078 A CN 102724078A CN 2012102251452 A CN2012102251452 A CN 2012102251452A CN 201210225145 A CN201210225145 A CN 201210225145A CN 102724078 A CN102724078 A CN 102724078A
- Authority
- CN
- China
- Prior art keywords
- mtd
- flow
- mrow
- matrix
- mtr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 230000006835 compression Effects 0.000 title abstract description 4
- 238000007906 compression Methods 0.000 title abstract description 4
- 239000011159 matrix material Substances 0.000 claims abstract description 120
- 238000005295 random walk Methods 0.000 claims abstract description 50
- 238000005259 measurement Methods 0.000 claims description 72
- 238000004364 calculation method Methods 0.000 claims description 8
- 238000012847 principal component analysis method Methods 0.000 claims description 3
- 238000004458 analytical method Methods 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 16
- 238000000513 principal component analysis Methods 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- 238000000354 decomposition reaction Methods 0.000 description 8
- 238000004422 calculation algorithm Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 4
- 238000005315 distribution function Methods 0.000 description 4
- 238000004088 simulation Methods 0.000 description 4
- 230000002123 temporal effect Effects 0.000 description 4
- ZEFNOZRLAWVAQF-UHFFFAOYSA-N Dinitolmide Chemical group CC1=C(C(N)=O)C=C([N+]([O-])=O)C=C1[N+]([O-])=O ZEFNOZRLAWVAQF-UHFFFAOYSA-N 0.000 description 3
- 230000001186 cumulative effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000005484 gravity Effects 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000003325 tomography Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 241000764238 Isis Species 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000013146 percutaneous coronary intervention Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The invention discloses an end-to-end network flow reconstruction method based on compression sensing in a dynamic network. In a large-scale IP backbone network, the OD flow is selected by a random walk method; the flow value of partial OD flow acquired by a router subset is described by constructing a sparse flow matrix; a compressing sensing reconstruction module is established by adopting a main information analysis method; a relationship between the OD flow generated by the router sublet and all the end-to-end OD flow in the IP backbone network is descried by using the module, and further all the end-to-end OD flow of the whole IP backbone network is determined. The method can more accurately acquire end-to-end network flow detail characteristic, cannot consume mass hardware resources, can track the dynamic changes of the OD flow in a real time manner, and the reconstruction error is less.
Description
Technical Field
The invention belongs to the field of flow measurement and analysis under a dynamic network, and particularly relates to an end-to-end network flow reconstruction method based on compressed sensing under the dynamic network.
Background
In recent years, with the rapid development of the internet, an increasing number of network applications provide convenient services to users. But also makes the network increasingly complex. It is also increasingly difficult for network operators to manage and control networks. The traffic matrix is the most important input parameter in network traffic engineering, represents all end-to-end traffic in the network, and completely describes the traffic distribution, and provides the current network state for the network manager, but it is difficult to accurately obtain the traffic matrix in practical application.
Even though the traffic matrix is very important for the network operator, it is very difficult to accurately obtain an estimate of the traffic matrix. Therefore, the traffic matrix estimation becomes a very challenging research topic. In recent years, a great deal of research results have emerged. Vardi proposed the use of network tomography to solve the end-to-end traffic reconstruction problem, and this method was subsequently widely used and used to study IP network internal features. For the network tomography method, the end-to-end flow can be reconstructed through Poisson distribution and Gaussian distribution, but the spatial and temporal correlation of the end-to-end flow cannot be captured; conti et al estimate the flow matrix by calculating Hurst values using a maximum likelihood estimation algorithm. Nucci et al adds weights to the route configuration derived from multiple routing information to obtain a small traffic matrix estimation error. Zhang et al propose to use the gravity field model to describe the characteristic of the current end-to-end flow, and overcome the problem of high morbidity by obtaining additional constraint information; lakhina et al propose a principal component analysis method to directly measure and construct an end-to-end flow reconstruction model; soule et al propose an iterative Bayesian inversion algorithm to reconstruct end-to-end flow based on an independent identically distributed Poisson model assumption of end-to-end flow; liang et al propose a pseudo-likelihood reconstruction method that uses an improved EM algorithm to decompose the problem into several sub-problems containing one OD pair, reducing the error in the estimation accuracy.
Although statistical model correlation techniques can obtain an estimated value of a traffic matrix and are widely used in practice, these methods still cannot accurately estimate the traffic matrix. In fact, measuring end-to-end network traffic directly is more accurate than traffic estimation, and device manufacturers offer a large number of measurement techniques and devices, most typically Netflow. Netflow can obtain an end-to-end network traffic value by extracting and analyzing the source and destination IP addresses, source and destination port numbers, and protocol numbers of the data packets. But measuring each OD flow using NetFlow is impractical, if not impossible. This is because the ports of the router will consume a lot of hardware resources when running the NetFlow program, and the direct measurement will cause extra communication overhead. This reduces the store-and-forward efficiency of the router, thus increasing network latency and easily causing network congestion.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides an end-to-end network flow reconstruction method based on compressed sensing in a dynamic network, so as to achieve the purposes of reducing hardware resources, tracking the dynamic change of an OD flow in real time and reducing reconstruction errors.
A compression sensing-based end-to-end network flow reconstruction method in a dynamic network comprises the following steps:
and 4, according to the sparse measurement matrix constructed in the step 3 and the measurement value obtained by calculation, adopting a principal component analysis method to construct a compressed sensing reconstruction model, describing the relationship between the OD flows generated by the router subset and all end-to-end OD flows in the large-scale IP backbone network by using the model, and further determining all end-to-end OD flow in the whole IP backbone network.
The method for constructing the Boolean sparse measurement matrix in the step 2 comprises the following steps:
step 2-1, constructing an OD flow complete graph, wherein the complete graph comprises vertexes and edges, and the number of the vertexes is equal to the number of the OD flows;
step 2-2: setting an initial value of the random walk measurement times;
step 2-3: selecting a vertex as a unified starting point;
step 2-4: executing uniformly distributed random walks on the complete graph, and according to points found by the random walks, the state of the router corresponding to the points is an open state, otherwise, the state is a closed state;
step 2-5: if the random walk measurement times are smaller than the initialization set random walk measurement times, returning to the step 2-3; if the number of random walk measurements is greater than or equal to the number of initial random walk measurements, executing step 2-6;
step 2-6: and saving the Boolean sparse measurement matrix.
The Boolean sparse measurement matrix in the step 1 accords with a constraint equidistance criterion; the number of rows in the boolean sparse measurement matrix represents the number of times the large-scale IP backbone network is randomly walked, and the number of columns in the boolean sparse measurement matrix represents the total number of OD flows in the large-scale IP backbone network.
The calculation of the measured values described in step 3 satisfies the following formula:
measured value = boolean sparse measurement matrix x sparse flow matrix
The invention has the advantages that:
the invention relates to an end-to-end network flow reconstruction method based on compressed sensing in a dynamic network, which realizes end-to-end network flow reconstruction of a large IP backbone network by a partial direct measurement mode. The method can more accurately acquire the detailed characteristics of the end-to-end network flow, does not consume a large amount of hardware resources, can track the dynamic change of the OD flow in real time, and has smaller reconstruction error.
Drawings
Fig. 1 is a flowchart of a method for reconstructing an end-to-end network traffic based on compressive sensing in a dynamic network according to an embodiment of the present invention;
FIG. 2 is a schematic view of a flow reconstruction framework according to an embodiment of the present invention;
FIG. 3 is a flow chart of constructing a Boolean sparse measurement matrix according to one embodiment of the present invention;
FIG. 4 is a schematic diagram of a Boolean sparse measurement matrix constructed by random walk according to an embodiment of the present invention;
wherein 4-1 is a complete diagram;
FIG. 5 is a schematic representation of the rank variation of the NMAE with flow matrix according to an embodiment of the present invention;
FIG. 6 is a schematic representation of NMAE as a function of random walk length in accordance with an embodiment of the present invention;
FIG. 7 is a diagram illustrating the number of OD flows that need to be measured directly according to one embodiment of the present invention;
FIG. 8 is a schematic diagram illustrating the estimation of the 30 th and 60 th OD flow traffic matrices according to an embodiment of the present invention; a) a schematic diagram of the 30 th OD flow traffic matrix estimation; b) a schematic diagram of the 60 th OD flow matrix estimation;
FIG. 9 is a diagram illustrating a spatial relative error and a temporal relative error according to an embodiment of the present invention; a) is a schematic diagram of spatial relative error; b) is a schematic diagram of relative error of time;
FIG. 10 is a diagram illustrating a cumulative distribution function of spatial relative error and temporal error according to an embodiment of the present invention; a) is a schematic diagram of a spatial relative error cumulative distribution function; b) is a diagram of a time relative error accumulation distribution function.
Detailed Description
The following further describes the embodiments of the present invention with reference to the drawings.
Firstly, constructing a Boolean sparse measurement matrix according to random walk, and obeying RIP (restricted equal distance) criterion; then collecting part of end-to-end flow by using fewer router ports corresponding to the Boolean sparse measurement matrix, and calculating a measurement value according to the Boolean sparse measurement matrix, the sparse flow matrix and the linear relation among the measurement values; and finally, carrying out singular value decomposition on the flow matrix by utilizing a PCA (principal component analysis) model to meet the sparsification condition, and reconstructing all OD flow matrixes based on compressed sensing.
The real flow data of the Abilene backbone network is used in the embodiment of the invention, the real flow data comprises 12 nodes, 30 internal links, 24 external links and 144 end-to-end flows, and the simulation data adopts 5min time intervals and totals 2016 moments.
The traffic matrix in the Abilene backbone network can be as shown in equation (1),
m in the formula (1) represents a traffic matrix, and rows of the matrix M represent the total amount of traffic from a source node to a destination node in the large-scale Abilene backbone network.
Wherein, the number of network nodes is N, and the number of OD streams N = N2. Thus, M is an N × T traffic matrix.
mj(t): represents the t-th time of the j-th OD flow;
m (t): the t-th column representing M;
j∈{1,2,…,N};
t∈{1,2,…,T}。
the compressive sensing theory must satisfy two characteristics: firstly, the measurement matrix meets the RIP (restricted equal distance) criterion; one is that the traffic matrix is a sparse matrix. The following description is made with respect to the requirement for a matrix in the compressive sensing theory in conjunction with equation (2).
Due to the redundancy of the signals, it is assumed that the vector s can be expressed as:
wherein,
s is a discrete signal N-dimensional space RNAn N × 1 order column vector;
xieach column element being s, i ═ 1,2,3, …, N;
If and only if the signal is composed of K (K)<<N) order basis vectorThe signal s is sparse in the linear combination of the components, i.e. { x in equation (2) }iOnly K elements are non-zero. The signal s is considered compressible in compressed sensing theory. K also becomes the sparsity of the signal.
In the embodiment of the invention, the length of the random walk is set as r, the measurement times are set as p, whereinIn the embodiment of the invention, the random walk times p =19, the random walk step length r =11, and the rank q =5 (q = K) of the flow matrix, and the Boolean sparse measurement matrix constructed by the method obeys the RIP criterion.
Fig. 1 is a flowchart of a method for reconstructing an end-to-end network traffic based on compressive sensing in a dynamic network according to an embodiment of the present invention, including the following steps:
the rank of the flow matrix is the sparsity meeting the compressed sensing theory matrix, and the rank q of the flow matrix in the embodiment = 5;
in the embodiment of the invention, a router is selected by adopting a random walk mode, the selected router forms a router subset, the opening or closing state of the selected router subset is described by adopting a method for constructing a Boolean sparse measurement matrix, and a p multiplied by N Boolean sparse measurement matrix A is constructed according to p times of random walks. Fig. 2 is a schematic diagram of a flow-aware reconfiguration framework according to the present invention, in which a network operator collects end-to-end network traffic data on partially operating routers by controlling network devices (including hardware and software) through NetFlow, that is, a network management center controls the routers to be turned on or off, and collects traffic data from NetFlow operating on each router interface. After part of end-to-end network traffic is collected, all OD flow traffic matrixes are recovered in a network management center. Fig. 3 is a flowchart of constructing a boolean sparse measurement matrix according to an embodiment of the present invention, which specifically includes the following steps:
step 2-1, an OD flow complete graph G (V, E) is constructed (as in the complete graph 4-1 in FIG. 4), wherein V and E represent the set of graph vertices and edges, respectively. The number of vertices is equal to the number of OD flows N;
step 2-2: setting a measurement time parameter citeration=1;
Step 2-3: selecting a vertex as a unified starting point;
step 2-4: executing uniformly distributed random walks on G (V, E), and according to the points found by the random walks, the state of the router corresponding to the points is an open state (namely 1), otherwise, the state is a closed state (namely 0);
fig. 4 is a schematic diagram of a boolean sparse measurement matrix constructed by random walk in the present invention, wherein black vertices represent N OD streams, and the vertices are randomly walked. In the embodiment of the present invention, the vertices 10,15,18,70, and 72 are vertices found by random walk, and 10,15,18,70, and 72 OD flows are measured using NetFlow. a isiFor a row of the boolean sparse measurement matrix constructed for random walk, a p × N (19 × 144) boolean sparse measurement matrix a is obtained by p (p = 19) random walks, as follows:
aiis a row vector of:
ai=[ai1,ai2,ai3,…,aiN],i∈{1,2,3,…,p}. (4)
when a vertex is found by random walk, its corresponding matrix element value is 1. In FIG. 3, aiThe 10 th, 15 th, 18 th, 70 th and 72 th element values are 1, the rest are 0, namely, the setting ai10,ai15,ai18,ai70And ai72“1”。
Step 2-5: if c isiteration<p, then set up citeration=citeration+1, execution returns to step 204; if c isiteration>p, go to step 207;
step 2-6: and saving the Boolean sparse measurement matrix.
in the embodiment of the invention, all network flows are subjected to zero-mean processing according to formulas (5) to (7).
In the embodiment of the invention, a section of historical flow data is intercepted as prior information, and M is usedhisAnd (4) showing. The zero-mean process is expressed as:
M'=Mhis-Mmean (5)
wherein M' is a sparse flow matrix after zero-mean processing;
Mhisa historical traffic data matrix;
equation (6) is an N × T matrix whose elements are defined as follows:
wherein: t is the length of intercepting the historical flow;
in the embodiment of the invention, the measurement value Y is calculated by using the historical flow matrix according to the formulas (8) to (10).
the measured value y (t) at time t.
The formula (8) is equivalent to,
y(t)=A×m(t) (9)
for all moments, there is the following equation:
the measured value Y, the boolean sparse measurement matrix a, and the sparse traffic matrix M' are p × T, p × N, and N × T matrices, respectively.
Since a is a sparse random matrix, we can find the measurement value Y by knowing the element value corresponding to a in the matrix M' in equation (10).
In the embodiment of the present invention, a union of OD flow sets of all random walk measurements is obtained by equation (11).
Suppose thatRepresents the set of OD flows that need to be measured by the ith random walk, all the necessary OD flows are represented as:
wherein: sfRepresenting the union of the OD flows generated from the first random walk to the p-th random walk;
thus, all the necessary OD streams are generatedThe number of the corresponding necessary OD flows isThen the traffic matrix
each row represents an OD stream time series, the extraction requiring direct measurementStrip OD stream, the remaining OD streams correspond Behavior 0 of the matrix. In equation (10), since A is a Boolean sparse measurement matrix, it is only necessary to know the individual rows in MY can be obtained. That is Y = A × M' is equivalent to
The method of selecting a subset of OD flows to be measured directly is exemplified in detail below. As shown in equation (13):
in the formula, a boolean sparse measurement matrix a is a sparse matrix composed of "1" or "0", the row number of a represents the number of random walks, and the column number represents the OD flow traversed by each random walk. At any time t, the OD stream to be measured is indexed by a "1" in the column element of matrix a. Then a (5 rows and 10 columns) in fig. 4 is a boolean sparse measurement matrix, i.e. random walk 5 times, and the number of OD streams to traverse is 10. OD flows respectively indexed by using the column element '1' in the A for the first time are divided into OD4, OD6 and OD 8; OD flows indexed for the second time are OD4, OD 5; by analogy, the total OD flows to be measured obtained by 5 random walks are: OD1, OD4, OD5, OD6 and OD 8.
And 4, according to the sparse measurement matrix constructed in the step 3 and the measurement value obtained by calculation, adopting a PCA (principal component analysis) method to construct a compressed sensing reconstruction model, describing the relationship between the OD flow generated by the router subset and all end-to-end OD flows in the large-scale IP backbone network by using the model, and further determining all end-to-end OD flow in the whole IP backbone network.
In order to meet the requirement of compressed sensing on the sparsification characteristic of the traffic matrix, the PCA model is used for the historical traffic matrix in the embodiment of the inventionSingular value decomposition is performed.
Step 4-1: performing singular value decomposition on the measured value Y;
the principle of singular value decomposition is further explained below with reference to equations (14) to (17).
The Principal Component Analysis (PCA) has the main idea of projecting high dimensional data into a lower dimensional space. The m multiplied by n order flow matrix L in the PCA model is expressed as
L=U∑VT, (14)
Wherein: u is an orthogonal matrix;
UUTor UTU is an identity matrix;
Σ is a singular value of a diagonal matrix whose diagonal tuple value is L;
v is LTA feature vector of L;
LTis the transpose of L.
In practice, the formula (14) is that L utilizes an orthogonal baseU Σ is projected into space. The columns of the L matrix are basis vectors σkukThe normalized basis vector is expressed as:
wherein: u. ofk,vkK columns representing U and V, respectively;
min (m, n) represents the rank of the matrix L;
σkit is the singular values of the matrix L that represent the energy characteristics of L.
To facilitate understanding of the low rank approximation process, equation (14) may be expressed as:
therefore, large singular values can be selected to approximate the true matrix, and this process can be expressed by the mathematical formula:
wherein: the constant q < min (m, n);
σkis the q largest singular values;
In the embodiment of the invention, a historical traffic matrix M is assumedhisLength d of>And N is added. According to the formula (14),can be decomposed into:
Step 4-2: and further analyzing and calculating according to the historical flow matrix and the singular value decomposition method.
V can be calculated according to the formula (18)his∑his. Because each OD stream in the traffic matrix has a long correlation, the principal component remains substantially unchanged. Thus, in conjunction with equation (18), equation (10) can be approximately converted to:
Y=A×Vhis∑hisUT=ΘUT, (19)
wherein, AVhis∑his=Θ。
According to equation (19), the first q columns of Θ are truncated, and there is a new matrix ΘcolRepresents;
according to equation (16), Y can be approximately expressed as:
wherein,and uTAre respectively UTq.times.T (q)<p) and (N-q) xT order submatrices;
Step 4-3: matrix sparsification after singular value decomposition is realized through a convex optimization method:
thus, equation (20) can be transformed into:
wherein, thetacolIs q of theta<And p columns. Thus, l2The norm minimization problem can be used to solve uPCThat is to say that,
thus, equation (22) is an overdetermined problem and an optimal solution is easily found.
From equation (22), it can be known that ΘcolAnd uPCIs a column full rank matrix. Thus, the following formula can be obtained,
wherein the matrix uyObtained by singular value decomposition of Y;
matrix arrayIs YTA feature vector of Y;
because thetacolAnd uPCAre all column full rank matrices, then ΘcolIs equal to uyRank (Θ)col)=rank(uy) And with thetacolSpace expanded as a basis is equal to uPCSpace expanded for basis (theta)col)=span(uy))。
Step 4-4: the matrix is analyzed in a last step according to the orthogonal transformation.
Calculating R according to equation (25);
the orthogonal transformation matrix obeys the following constraint condition, there is a matrix R satisfying the following formula,
uy=ΘcolR. (25)
Therefore, the temperature of the molten metal is controlled,
in addition to this, the present invention is,
Wuy=WΘcolR=R. (27)
From equations (26) and (27), we can obtain:
By calculation ofObtaining a flow matrix M according to equation (18)hisAnd (6) estimating the value.
Through the steps, partial end-to-end network flow can be obtained according to direct measurement, and an estimation result of a network flow matrix is obtained. The matrix describes the size of all end-to-end traffic in the large-scale IP backbone.
Performance evaluation results:
fig. 5 is a schematic diagram of rank variation of NMAE with a flow matrix according to an embodiment of the present invention, and the performance of the present invention is evaluated by using Normalized Mean Absolute Error (NMAE), where the rank q in the embodiment of the present invention is 1,2,3,4,5,6,7, and 8 (as shown in fig. 6).
The NMAE calculation formula is:
wherein,representing the j-th OD flow estimation value at the t moment;
mjand (t) represents the true value of the j-th OD flow at the t moment.
Simulation results show that the performance of the algorithm of the invention does not change much with the change of q. However q is constrained by some conditions, first the rank q is equivalent to sparsity. That is, in the formula (20), when the rank is q, UTCan be prepared fromIn the approximation that the difference between the first and second values,is q sparse. The boolean sparsity measurement matrix a is a p × N matrix, p = O (Klog (N/K)). In the simulation, K = q, and thus, p = O (qlog (N/q)). The number of measurements p directly affects the total number of OD flows that need to be measured. In the embodiment of the invention, q is set to be equal to the total amount of OD streams according to p. In fig. 5, a fluctuation occurs in NMAE because p tends to flow boundary qlog (N/q) at this time. In this case, the smaller the p-qlog (N/q), the greater the probability of reconstruction failure. Fig. 6 is a schematic diagram of the variation of the random walk length of the NMAE according to the embodiment of the present invention, wherein the NMAE basically decreases with the increase of the random walk length. If r is small enough, robustness will be reduced; if r is too large, it will again produce a portion of the OD flow we have to measure.
FIG. 7 shows that the OD flow numbers are normally distributed, the intercept of the straight line represents the mean value of the OD flow numbers, and the slope represents the standard deviation. Simulation results show that: we need about 105 + 120 OD flows to recover the traffic matrix. In the flow compression reconstruction, the PCA method is adopted in the embodiment of the present invention without performing other preprocessing, so that the embodiment of the present invention requires 60% of links to operate NetFlow.
In FIG. 8, the first 500 points are used in the algorithm as a priori information for calculating Vhis∑his. The 30 th and 60 th OD streams were randomly selected for testing in the examples of the present invention. Three methods are found to track the dynamic changes of the flow in time. However, in fig. 8 (a), large fluctuations occur in SRSVD (sparse reconstruction singular value decomposition) and TomoG (gravity model), and FSR (flow sensing reconstruction) can accurately obtain the 30 th OD stream; fig. 8 (b) clearly shows that SRSVD and TomoG cannot reconstruct 60 OD streams accurately, and that FSR can more accurately derive the trend of flow rate changes. Therefore, experimental results prove that the method can better estimate the end-to-end network traffic.
In order to more accurately evaluate the advantages of the algorithm of the present invention, the embodiment of the present invention compares the three methods with another standard. Since network traffic exhibits time-varying and space-time correlation, embodiments of the present invention analyze FSR characteristics with reference to Spatial Relative Errors (SREs) and temporal relative errors (tress).
SREs and TREs are expressed as:
xTand (n) represents the real value of the nth OD flow at the time T.
Wherein:the estimated value of the Nth OD flow at the time t is shown;
xN(t) represents the true value of the Nth OD flow at time t.
Here, non-negative integers N and T denote the total number of OD flows and the measurement time instant, respectively. I | · | purple wind2Is represented by2And (4) norm. SERs represent the spatial error distribution of different OD streams. While TREs represents the time error distribution.
In FIG. 9 (a), the SRSVD and Tomog methods showed high fluctuations in the OD streams at lanes 10 and 40. Because the first half of the data for this OD flow is very small, the energy occupied in the overall network traffic is particularly small. It is difficult to accurately estimate these OD streams. Further, the average spatial correlation errors of FSR, SRSVD and Tomog were 0.47,0.77 and 0.68, respectively. The FSR method of embodiments of the present invention has the lowest spatial correlation error in the first half of the OD flow. This shows that there is a very stable and efficient estimation characteristic for the small-scale flow FSR. In FIG. 9 (b), all methods have mutations in the time domain. But FSR has very low mutations in small intervals. Meanwhile, the average time correlation errors of FSR, SRSVD, TomoG are 0.20,0.29, 0.26, respectively, so that FSR has the lowest time correlation error.
In the embodiment of the invention, the flow matrix estimation characteristics are obtained through the cumulative distribution functions of the spatial relative errors and the time relative errors of the three methods. FIG. 10 (a) shows that the relative spatial error obtained for the three methods FSR, SRSVD, Tomog is 0.83 for OD flows of about 92%, 73%, 84%. Furthermore, fig. 10 (b) shows that the relative error in time obtained for the embodiment of the present invention is 0.25, 0.31,0.30 for about 80% of the estimated time. This measurement therefore represents the same estimation result for the three methods, with the FSR method employed by embodiments of the present invention having the lowest estimation error.
In conclusion, it is demonstrated that the traffic matrix reconstructed by the FSR method is more accurate.
Claims (4)
1. An end-to-end network flow reconstruction method based on compressed sensing in a dynamic network is characterized in that: the method comprises the following steps:
step 1, setting the length and the random walk times of a system aiming at OD stream random walks;
step 2, in a large-scale IP backbone network, selecting OD flows in a random walk mode, and describing the opening and closing states of all routers by adopting a method for constructing a Boolean sparse measurement matrix according to the selected OD flows;
step 3, describing flow values of part of OD flows collected by a router subset by adopting a method for constructing a sparse flow matrix, wherein the router subset is a router in all opening states, and calculating a measurement value according to a Boolean sparse measurement matrix and the constructed sparse flow matrix;
and 4, according to the sparse measurement matrix constructed in the step 3 and the measurement value obtained by calculation, adopting a principal component analysis method to construct a compressed sensing reconstruction model, describing the relationship between the OD flows generated by the router subset and all end-to-end OD flows in the large-scale IP backbone network by using the model, and further determining all end-to-end OD flow in the whole IP backbone network.
2. The method according to claim 1, wherein the method for reconstructing the end-to-end network traffic based on compressed sensing in the dynamic network comprises: the method for constructing the Boolean sparse measurement matrix in the step 2 comprises the following steps:
step 2-1, constructing an OD flow complete graph, wherein the complete graph comprises vertexes and edges, and the number of the vertexes is equal to the number of the OD flows;
step 2-2: setting an initial value of the random walk measurement times;
step 2-3: selecting a vertex as a unified starting point;
step 2-4: executing uniformly distributed random walks on the complete graph, and according to points found by the random walks, the state of the router corresponding to the points is an open state, otherwise, the state is a closed state;
step 2-5: if the random walk measurement times are smaller than the initialization set random walk measurement times, returning to the step 2-3; if the number of random walk measurements is greater than or equal to the number of initial random walk measurements, executing step 2-6;
step 2-6: and saving the Boolean sparse measurement matrix.
3. The method according to claim 1, wherein the method for reconstructing the end-to-end network traffic based on compressed sensing in the dynamic network comprises: the Boolean sparse measurement matrix in the step 1 accords with a constraint equidistance criterion; the number of rows in the boolean sparse measurement matrix represents the number of times the large-scale IP backbone network is randomly walked, and the number of columns in the boolean sparse measurement matrix represents the total number of OD flows in the large-scale IP backbone network.
4. The method according to claim 1, wherein the method for reconstructing the end-to-end network traffic based on compressed sensing in the dynamic network comprises: the calculation of the measured values described in step 3 satisfies the following formula:
measurement = boolean sparse measurement matrix x sparse traffic matrix.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210225145.2A CN102724078B (en) | 2012-06-29 | 2012-06-29 | End-to-end network flow reconstruction method based on compression sensing in dynamic network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210225145.2A CN102724078B (en) | 2012-06-29 | 2012-06-29 | End-to-end network flow reconstruction method based on compression sensing in dynamic network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102724078A true CN102724078A (en) | 2012-10-10 |
CN102724078B CN102724078B (en) | 2014-12-10 |
Family
ID=46949748
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210225145.2A Expired - Fee Related CN102724078B (en) | 2012-06-29 | 2012-06-29 | End-to-end network flow reconstruction method based on compression sensing in dynamic network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102724078B (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103200043A (en) * | 2013-03-14 | 2013-07-10 | 东北大学 | Measurement method for time-varying non-stationary network flow |
CN103607292A (en) * | 2013-10-28 | 2014-02-26 | 国家电网公司 | Fast distributed monitoring method for electric-power communication network services |
CN105099973A (en) * | 2015-08-25 | 2015-11-25 | 西安电子科技大学 | Measurement matrix calibration method for broadband modulation conversion, compression and acquisition architecture |
CN105979541A (en) * | 2016-07-18 | 2016-09-28 | 国网辽宁省电力有限公司阜新供电公司 | Method and system for estimating dynamic traffic in data communication network |
CN106357479A (en) * | 2016-11-15 | 2017-01-25 | 中国人民解放军防空兵学院 | Full network flow monitoring method |
CN107170233A (en) * | 2017-04-20 | 2017-09-15 | 同济大学 | A kind of typical day transport need OD matrix acquisition methods based on matrix decomposition |
CN108156591A (en) * | 2017-12-21 | 2018-06-12 | 中南大学 | The method of data capture that a kind of compressed sensing and random walk combine |
CN109088796A (en) * | 2018-09-19 | 2018-12-25 | 哈尔滨工业大学 | Network flow Matrix Prediction Method based on network tomography technology |
CN109495328A (en) * | 2018-12-30 | 2019-03-19 | 李保安 | A kind of method of Logistics networks communication reliability |
CN110036600A (en) * | 2016-09-28 | 2019-07-19 | 亚马逊科技公司 | The convergence service of network health data |
CN110149245A (en) * | 2019-05-24 | 2019-08-20 | 广州大学 | The compressed sensing based high-speed network flow method of sampling and device |
CN110855485A (en) * | 2019-11-08 | 2020-02-28 | 西北工业大学青岛研究院 | Method and system for determining network flow of IP backbone network |
US11044168B2 (en) | 2019-07-02 | 2021-06-22 | Cisco Technology, Inc. | Fingerprinting application traffic in a network |
CN113379092A (en) * | 2020-03-09 | 2021-09-10 | 西北工业大学青岛研究院 | Backbone network multi-service traffic estimation method and system facing big data |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030043802A1 (en) * | 2001-08-31 | 2003-03-06 | Takeki Yazaki | Packet transferring method and apparatus that employs the same |
CN101150581A (en) * | 2007-10-19 | 2008-03-26 | 华为技术有限公司 | Detection method and device for DDoS attack |
CN102136087A (en) * | 2011-03-08 | 2011-07-27 | 湖南大学 | Multi-neural network-based traffic matrix estimation method |
CN102325090A (en) * | 2011-09-21 | 2012-01-18 | 电子科技大学 | Network flow estimating method |
-
2012
- 2012-06-29 CN CN201210225145.2A patent/CN102724078B/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030043802A1 (en) * | 2001-08-31 | 2003-03-06 | Takeki Yazaki | Packet transferring method and apparatus that employs the same |
CN101150581A (en) * | 2007-10-19 | 2008-03-26 | 华为技术有限公司 | Detection method and device for DDoS attack |
CN102136087A (en) * | 2011-03-08 | 2011-07-27 | 湖南大学 | Multi-neural network-based traffic matrix estimation method |
CN102325090A (en) * | 2011-09-21 | 2012-01-18 | 电子科技大学 | Network flow estimating method |
Non-Patent Citations (1)
Title |
---|
裴唯 等: "《城域网应用层流量异常检测与分析》", 《计算机应用研究》 * |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103200043B (en) * | 2013-03-14 | 2015-07-29 | 东北大学 | Non-stationary network flow programming method method is become time a kind of |
CN103200043A (en) * | 2013-03-14 | 2013-07-10 | 东北大学 | Measurement method for time-varying non-stationary network flow |
CN103607292A (en) * | 2013-10-28 | 2014-02-26 | 国家电网公司 | Fast distributed monitoring method for electric-power communication network services |
CN103607292B (en) * | 2013-10-28 | 2017-01-18 | 国家电网公司 | Fast distributed monitoring method for electric-power communication network services |
CN105099973A (en) * | 2015-08-25 | 2015-11-25 | 西安电子科技大学 | Measurement matrix calibration method for broadband modulation conversion, compression and acquisition architecture |
CN105099973B (en) * | 2015-08-25 | 2018-03-20 | 西安电子科技大学 | The calculation matrix calibration method of compression collection framework is converted for wide-band modulation |
CN105979541B (en) * | 2016-07-18 | 2019-05-14 | 国网辽宁省电力有限公司阜新供电公司 | Dynamic flow estimation method and system in a kind of data communication network |
CN105979541A (en) * | 2016-07-18 | 2016-09-28 | 国网辽宁省电力有限公司阜新供电公司 | Method and system for estimating dynamic traffic in data communication network |
US12068938B2 (en) | 2016-09-28 | 2024-08-20 | Amazon Technologies, Inc. | Network health data aggregation service |
US11641319B2 (en) | 2016-09-28 | 2023-05-02 | Amazon Technologies, Inc. | Network health data aggregation service |
CN110036600B (en) * | 2016-09-28 | 2022-11-22 | 亚马逊科技公司 | Network health data convergence service |
CN110036600A (en) * | 2016-09-28 | 2019-07-19 | 亚马逊科技公司 | The convergence service of network health data |
CN106357479A (en) * | 2016-11-15 | 2017-01-25 | 中国人民解放军防空兵学院 | Full network flow monitoring method |
CN106357479B (en) * | 2016-11-15 | 2019-08-23 | 中国人民解放军防空兵学院 | A kind of whole network flow monitoring method |
CN107170233A (en) * | 2017-04-20 | 2017-09-15 | 同济大学 | A kind of typical day transport need OD matrix acquisition methods based on matrix decomposition |
CN108156591B (en) * | 2017-12-21 | 2020-08-11 | 中南大学 | Data collection method combining compressed sensing and random walk |
CN108156591A (en) * | 2017-12-21 | 2018-06-12 | 中南大学 | The method of data capture that a kind of compressed sensing and random walk combine |
CN109088796A (en) * | 2018-09-19 | 2018-12-25 | 哈尔滨工业大学 | Network flow Matrix Prediction Method based on network tomography technology |
CN109495328A (en) * | 2018-12-30 | 2019-03-19 | 李保安 | A kind of method of Logistics networks communication reliability |
CN110149245A (en) * | 2019-05-24 | 2019-08-20 | 广州大学 | The compressed sensing based high-speed network flow method of sampling and device |
US11044168B2 (en) | 2019-07-02 | 2021-06-22 | Cisco Technology, Inc. | Fingerprinting application traffic in a network |
CN110855485A (en) * | 2019-11-08 | 2020-02-28 | 西北工业大学青岛研究院 | Method and system for determining network flow of IP backbone network |
CN113379092A (en) * | 2020-03-09 | 2021-09-10 | 西北工业大学青岛研究院 | Backbone network multi-service traffic estimation method and system facing big data |
CN113379092B (en) * | 2020-03-09 | 2022-12-09 | 西北工业大学青岛研究院 | Backbone network multi-service traffic estimation method and system facing big data |
Also Published As
Publication number | Publication date |
---|---|
CN102724078B (en) | 2014-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102724078A (en) | End-to-end network flow reconstruction method based on compression sensing in dynamic network | |
Jiang et al. | A compressive sensing-based approach to end-to-end network traffic reconstruction | |
Kasai et al. | Network volume anomaly detection and identification in large-scale networks based on online time-structured traffic tensor tracking | |
Zhu et al. | Network latency estimation for personal devices: A matrix completion approach | |
Zhang et al. | Spatio-temporal compressive sensing and internet traffic matrices | |
Chen et al. | Robust network compressive sensing | |
Chua et al. | Network kriging | |
US8725871B2 (en) | Systems and methods for application dependency discovery | |
Forero et al. | Prediction of partially observed dynamical processes over networks via dictionary learning | |
Gjoka et al. | 2.5 k-graphs: from sampling to generation | |
Chou et al. | Efficient computation of robust average of compressive sensing data in wireless sensor networks in the presence of sensor faults | |
Coluccia et al. | Distribution-based anomaly detection via generalized likelihood ratio test: A general maximum entropy approach | |
Zhou et al. | Accurate traffic matrix completion based on multi-Gaussian models | |
Xie et al. | Accurate recovery of internet traffic data under dynamic measurements | |
Jiang et al. | A novel hybrid prediction algorithm to network traffic | |
Ustebay et al. | Efficient decentralized approximation via selective gossip | |
CN111800414A (en) | Convolutional neural network-based traffic anomaly detection method and system | |
Zhou et al. | Data reconstruction in internet traffic matrix | |
Nie et al. | A power laws-based reconstruction approach to end-to-end network traffic | |
Nie et al. | A compressive sensing-based reconstruction approach to network traffic | |
Wu et al. | Energy-efficient and robust tensor-encoder for wireless camera networks in Internet of Things | |
CN102647354A (en) | End-to-end flow reconfiguration method in time-varying dynamic network | |
KR101350782B1 (en) | Method and apparatus for sampling data | |
CN110839253A (en) | Method for determining wireless grid network flow | |
Xie et al. | Expectile tensor completion to recover skewed network monitoring data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20141210 Termination date: 20150629 |
|
EXPY | Termination of patent right or utility model |