CN102724078A - End-to-end network flow reconstruction method based on compression sensing in dynamic network - Google Patents

End-to-end network flow reconstruction method based on compression sensing in dynamic network Download PDF

Info

Publication number
CN102724078A
CN102724078A CN2012102251452A CN201210225145A CN102724078A CN 102724078 A CN102724078 A CN 102724078A CN 2012102251452 A CN2012102251452 A CN 2012102251452A CN 201210225145 A CN201210225145 A CN 201210225145A CN 102724078 A CN102724078 A CN 102724078A
Authority
CN
China
Prior art keywords
mtd
flow
mrow
matrix
mtr
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012102251452A
Other languages
Chinese (zh)
Other versions
CN102724078B (en
Inventor
蒋定德
姚成
袁珍
聂来森
许争争
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201210225145.2A priority Critical patent/CN102724078B/en
Publication of CN102724078A publication Critical patent/CN102724078A/en
Application granted granted Critical
Publication of CN102724078B publication Critical patent/CN102724078B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses an end-to-end network flow reconstruction method based on compression sensing in a dynamic network. In a large-scale IP backbone network, the OD flow is selected by a random walk method; the flow value of partial OD flow acquired by a router subset is described by constructing a sparse flow matrix; a compressing sensing reconstruction module is established by adopting a main information analysis method; a relationship between the OD flow generated by the router sublet and all the end-to-end OD flow in the IP backbone network is descried by using the module, and further all the end-to-end OD flow of the whole IP backbone network is determined. The method can more accurately acquire end-to-end network flow detail characteristic, cannot consume mass hardware resources, can track the dynamic changes of the OD flow in a real time manner, and the reconstruction error is less.

Description

End-to-end network flow reconstruction method based on compressed sensing in dynamic network
Technical Field
The invention belongs to the field of flow measurement and analysis under a dynamic network, and particularly relates to an end-to-end network flow reconstruction method based on compressed sensing under the dynamic network.
Background
In recent years, with the rapid development of the internet, an increasing number of network applications provide convenient services to users. But also makes the network increasingly complex. It is also increasingly difficult for network operators to manage and control networks. The traffic matrix is the most important input parameter in network traffic engineering, represents all end-to-end traffic in the network, and completely describes the traffic distribution, and provides the current network state for the network manager, but it is difficult to accurately obtain the traffic matrix in practical application.
Even though the traffic matrix is very important for the network operator, it is very difficult to accurately obtain an estimate of the traffic matrix. Therefore, the traffic matrix estimation becomes a very challenging research topic. In recent years, a great deal of research results have emerged. Vardi proposed the use of network tomography to solve the end-to-end traffic reconstruction problem, and this method was subsequently widely used and used to study IP network internal features. For the network tomography method, the end-to-end flow can be reconstructed through Poisson distribution and Gaussian distribution, but the spatial and temporal correlation of the end-to-end flow cannot be captured; conti et al estimate the flow matrix by calculating Hurst values using a maximum likelihood estimation algorithm. Nucci et al adds weights to the route configuration derived from multiple routing information to obtain a small traffic matrix estimation error. Zhang et al propose to use the gravity field model to describe the characteristic of the current end-to-end flow, and overcome the problem of high morbidity by obtaining additional constraint information; lakhina et al propose a principal component analysis method to directly measure and construct an end-to-end flow reconstruction model; soule et al propose an iterative Bayesian inversion algorithm to reconstruct end-to-end flow based on an independent identically distributed Poisson model assumption of end-to-end flow; liang et al propose a pseudo-likelihood reconstruction method that uses an improved EM algorithm to decompose the problem into several sub-problems containing one OD pair, reducing the error in the estimation accuracy.
Although statistical model correlation techniques can obtain an estimated value of a traffic matrix and are widely used in practice, these methods still cannot accurately estimate the traffic matrix. In fact, measuring end-to-end network traffic directly is more accurate than traffic estimation, and device manufacturers offer a large number of measurement techniques and devices, most typically Netflow. Netflow can obtain an end-to-end network traffic value by extracting and analyzing the source and destination IP addresses, source and destination port numbers, and protocol numbers of the data packets. But measuring each OD flow using NetFlow is impractical, if not impossible. This is because the ports of the router will consume a lot of hardware resources when running the NetFlow program, and the direct measurement will cause extra communication overhead. This reduces the store-and-forward efficiency of the router, thus increasing network latency and easily causing network congestion.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides an end-to-end network flow reconstruction method based on compressed sensing in a dynamic network, so as to achieve the purposes of reducing hardware resources, tracking the dynamic change of an OD flow in real time and reducing reconstruction errors.
A compression sensing-based end-to-end network flow reconstruction method in a dynamic network comprises the following steps:
step 1, setting the length and the random walk times of a system aiming at OD stream random walks;
step 2, in a large-scale IP backbone network, selecting OD flows in a random walk mode, and describing the opening and closing states of all routers by adopting a method for constructing a Boolean sparse measurement matrix according to the selected OD flows;
step 3, describing flow values of part of OD flows collected by a router subset by adopting a method for constructing a sparse flow matrix, wherein the router subset is a router in all opening states, and calculating a measurement value according to a Boolean sparse measurement matrix and the constructed sparse flow matrix;
and 4, according to the sparse measurement matrix constructed in the step 3 and the measurement value obtained by calculation, adopting a principal component analysis method to construct a compressed sensing reconstruction model, describing the relationship between the OD flows generated by the router subset and all end-to-end OD flows in the large-scale IP backbone network by using the model, and further determining all end-to-end OD flow in the whole IP backbone network.
The method for constructing the Boolean sparse measurement matrix in the step 2 comprises the following steps:
step 2-1, constructing an OD flow complete graph, wherein the complete graph comprises vertexes and edges, and the number of the vertexes is equal to the number of the OD flows;
step 2-2: setting an initial value of the random walk measurement times;
step 2-3: selecting a vertex as a unified starting point;
step 2-4: executing uniformly distributed random walks on the complete graph, and according to points found by the random walks, the state of the router corresponding to the points is an open state, otherwise, the state is a closed state;
step 2-5: if the random walk measurement times are smaller than the initialization set random walk measurement times, returning to the step 2-3; if the number of random walk measurements is greater than or equal to the number of initial random walk measurements, executing step 2-6;
step 2-6: and saving the Boolean sparse measurement matrix.
The Boolean sparse measurement matrix in the step 1 accords with a constraint equidistance criterion; the number of rows in the boolean sparse measurement matrix represents the number of times the large-scale IP backbone network is randomly walked, and the number of columns in the boolean sparse measurement matrix represents the total number of OD flows in the large-scale IP backbone network.
The calculation of the measured values described in step 3 satisfies the following formula:
measured value = boolean sparse measurement matrix x sparse flow matrix
The invention has the advantages that:
the invention relates to an end-to-end network flow reconstruction method based on compressed sensing in a dynamic network, which realizes end-to-end network flow reconstruction of a large IP backbone network by a partial direct measurement mode. The method can more accurately acquire the detailed characteristics of the end-to-end network flow, does not consume a large amount of hardware resources, can track the dynamic change of the OD flow in real time, and has smaller reconstruction error.
Drawings
Fig. 1 is a flowchart of a method for reconstructing an end-to-end network traffic based on compressive sensing in a dynamic network according to an embodiment of the present invention;
FIG. 2 is a schematic view of a flow reconstruction framework according to an embodiment of the present invention;
FIG. 3 is a flow chart of constructing a Boolean sparse measurement matrix according to one embodiment of the present invention;
FIG. 4 is a schematic diagram of a Boolean sparse measurement matrix constructed by random walk according to an embodiment of the present invention;
wherein 4-1 is a complete diagram;
FIG. 5 is a schematic representation of the rank variation of the NMAE with flow matrix according to an embodiment of the present invention;
FIG. 6 is a schematic representation of NMAE as a function of random walk length in accordance with an embodiment of the present invention;
FIG. 7 is a diagram illustrating the number of OD flows that need to be measured directly according to one embodiment of the present invention;
FIG. 8 is a schematic diagram illustrating the estimation of the 30 th and 60 th OD flow traffic matrices according to an embodiment of the present invention; a) a schematic diagram of the 30 th OD flow traffic matrix estimation; b) a schematic diagram of the 60 th OD flow matrix estimation;
FIG. 9 is a diagram illustrating a spatial relative error and a temporal relative error according to an embodiment of the present invention; a) is a schematic diagram of spatial relative error; b) is a schematic diagram of relative error of time;
FIG. 10 is a diagram illustrating a cumulative distribution function of spatial relative error and temporal error according to an embodiment of the present invention; a) is a schematic diagram of a spatial relative error cumulative distribution function; b) is a diagram of a time relative error accumulation distribution function.
Detailed Description
The following further describes the embodiments of the present invention with reference to the drawings.
Firstly, constructing a Boolean sparse measurement matrix according to random walk, and obeying RIP (restricted equal distance) criterion; then collecting part of end-to-end flow by using fewer router ports corresponding to the Boolean sparse measurement matrix, and calculating a measurement value according to the Boolean sparse measurement matrix, the sparse flow matrix and the linear relation among the measurement values; and finally, carrying out singular value decomposition on the flow matrix by utilizing a PCA (principal component analysis) model to meet the sparsification condition, and reconstructing all OD flow matrixes based on compressed sensing.
The real flow data of the Abilene backbone network is used in the embodiment of the invention, the real flow data comprises 12 nodes, 30 internal links, 24 external links and 144 end-to-end flows, and the simulation data adopts 5min time intervals and totals 2016 moments.
The traffic matrix in the Abilene backbone network can be as shown in equation (1),
M = m 1 ( 1 ) m 1 ( 2 ) . . . m 1 ( T ) m 2 ( 1 ) m 2 ( 2 ) . . . m 2 ( T ) . . . . . . . . . m N ( 1 ) m N ( 2 ) . . . m N ( T ) - - - ( 1 )
m in the formula (1) represents a traffic matrix, and rows of the matrix M represent the total amount of traffic from a source node to a destination node in the large-scale Abilene backbone network.
Wherein, the number of network nodes is N, and the number of OD streams N = N2. Thus, M is an N × T traffic matrix.
mj(t): represents the t-th time of the j-th OD flow;
m (t): the t-th column representing M;
j∈{1,2,…,N};
t∈{1,2,…,T}。
the compressive sensing theory must satisfy two characteristics: firstly, the measurement matrix meets the RIP (restricted equal distance) criterion; one is that the traffic matrix is a sparse matrix. The following description is made with respect to the requirement for a matrix in the compressive sensing theory in conjunction with equation (2).
Due to the redundancy of the signals, it is assumed that the vector s can be expressed as:
Figure BDA00001834071500041
wherein,
s is a discrete signal N-dimensional space RNAn N × 1 order column vector;
xieach column element being s, i ═ 1,2,3, …, N;
Figure BDA00001834071500042
is a basis vector that is a function of,
Figure BDA00001834071500043
is RNA set of orthogonal bases.
If and only if the signal is composed of K (K)<<N) order basis vector
Figure BDA00001834071500044
The signal s is sparse in the linear combination of the components, i.e. { x in equation (2) }iOnly K elements are non-zero. The signal s is considered compressible in compressed sensing theory. K also becomes the sparsity of the signal.
In the embodiment of the invention, the length of the random walk is set as r, the measurement times are set as p, wherein
Figure BDA00001834071500045
In the embodiment of the invention, the random walk times p =19, the random walk step length r =11, and the rank q =5 (q = K) of the flow matrix, and the Boolean sparse measurement matrix constructed by the method obeys the RIP criterion.
Fig. 1 is a flowchart of a method for reconstructing an end-to-end network traffic based on compressive sensing in a dynamic network according to an embodiment of the present invention, including the following steps:
step 1, setting the length of random walk, the random walk times and the rank of a flow matrix for a router in a system;
the rank of the flow matrix is the sparsity meeting the compressed sensing theory matrix, and the rank q of the flow matrix in the embodiment = 5;
step 2, in a large-scale IP backbone network, selecting OD flows in a random walk mode, and describing the opening and closing states of all routers by adopting a method for constructing a Boolean sparse measurement matrix according to the selected OD flows;
in the embodiment of the invention, a router is selected by adopting a random walk mode, the selected router forms a router subset, the opening or closing state of the selected router subset is described by adopting a method for constructing a Boolean sparse measurement matrix, and a p multiplied by N Boolean sparse measurement matrix A is constructed according to p times of random walks. Fig. 2 is a schematic diagram of a flow-aware reconfiguration framework according to the present invention, in which a network operator collects end-to-end network traffic data on partially operating routers by controlling network devices (including hardware and software) through NetFlow, that is, a network management center controls the routers to be turned on or off, and collects traffic data from NetFlow operating on each router interface. After part of end-to-end network traffic is collected, all OD flow traffic matrixes are recovered in a network management center. Fig. 3 is a flowchart of constructing a boolean sparse measurement matrix according to an embodiment of the present invention, which specifically includes the following steps:
step 2-1, an OD flow complete graph G (V, E) is constructed (as in the complete graph 4-1 in FIG. 4), wherein V and E represent the set of graph vertices and edges, respectively. The number of vertices is equal to the number of OD flows N;
step 2-2: setting a measurement time parameter citeration=1;
Step 2-3: selecting a vertex as a unified starting point;
step 2-4: executing uniformly distributed random walks on G (V, E), and according to the points found by the random walks, the state of the router corresponding to the points is an open state (namely 1), otherwise, the state is a closed state (namely 0);
fig. 4 is a schematic diagram of a boolean sparse measurement matrix constructed by random walk in the present invention, wherein black vertices represent N OD streams, and the vertices are randomly walked. In the embodiment of the present invention, the vertices 10,15,18,70, and 72 are vertices found by random walk, and 10,15,18,70, and 72 OD flows are measured using NetFlow. a isiFor a row of the boolean sparse measurement matrix constructed for random walk, a p × N (19 × 144) boolean sparse measurement matrix a is obtained by p (p = 19) random walks, as follows:
A = a 1 a 2 . . . a p - - - ( 3 )
aiis a row vector of:
ai=[ai1,ai2,ai3,…,aiN],i∈{1,2,3,…,p}. (4)
when a vertex is found by random walk, its corresponding matrix element value is 1. In FIG. 3, aiThe 10 th, 15 th, 18 th, 70 th and 72 th element values are 1, the rest are 0, namely, the setting ai10,ai15,ai18,ai70And ai72“1”。
Step 2-5: if c isiteration<p, then set up citeration=citeration+1, execution returns to step 204; if c isiteration>p, go to step 207;
step 2-6: and saving the Boolean sparse measurement matrix.
Step 3, describing flow values of part of OD flows collected by a router subset by adopting a method for constructing a sparse flow matrix, wherein the router subset is a router in all opening states, and calculating a measurement value according to a Boolean sparse measurement matrix and the constructed sparse flow matrix;
in the embodiment of the invention, all network flows are subjected to zero-mean processing according to formulas (5) to (7).
In the embodiment of the invention, a section of historical flow data is intercepted as prior information, and M is usedhisAnd (4) showing. The zero-mean process is expressed as:
M'=Mhis-Mmean (5)
wherein M' is a sparse flow matrix after zero-mean processing;
Mhisa historical traffic data matrix;
M mean = m 1 mean , m 1 mean , m 1 mean , . . . , m 1 mean . . . m N mean , m N mean , m N mean , . . . , m N mean - - - ( 6 )
equation (6) is an N × T matrix whose elements are defined as follows:
<math> <mrow> <msubsup> <mi>m</mi> <mi>j</mi> <mi>mean</mi> </msubsup> <mo>=</mo> <mfrac> <mn>1</mn> <mi>T</mi> </mfrac> <munderover> <mi>&Sigma;</mi> <mrow> <mi>t</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>T</mi> </munderover> <msub> <mi>m</mi> <mi>j</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>7</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein: t is the length of intercepting the historical flow;
in the embodiment of the invention, the measurement value Y is calculated by using the historical flow matrix according to the formulas (8) to (10).
the measured value y (t) at time t.
<math> <mrow> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>y</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>y</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msub> <mi>y</mi> <mi>p</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> <mo>=</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>a</mi> <mn>1</mn> </msub> </mtd> </mtr> <mtr> <mtd> <msub> <mi>a</mi> <mn>2</mn> </msub> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msub> <mi>a</mi> <mi>p</mi> </msub> </mtd> </mtr> </mtable> </mfenced> <mo>&times;</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>m</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>m</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msub> <mi>m</mi> <mi>N</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> <mo>.</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>8</mn> <mo>)</mo> </mrow> </mrow> </math>
The formula (8) is equivalent to,
y(t)=A×m(t) (9)
for all moments, there is the following equation:
<math> <mrow> <mi>Y</mi> <mo>=</mo> <mi>A</mi> <mo>&times;</mo> <msup> <mi>M</mi> <mo>&prime;</mo> </msup> </mrow> </math>
<math> <mrow> <mo>=</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>a</mi> <mn>1</mn> </msub> </mtd> </mtr> <mtr> <mtd> <msub> <mi>a</mi> <mn>2</mn> </msub> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msub> <mi>a</mi> <mi>p</mi> </msub> </mtd> </mtr> </mtable> </mfenced> <mo>&times;</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>m</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mn>1</mn> <mo>)</mo> </mrow> </mtd> <mtd> <msub> <mi>m</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mn>2</mn> <mo>)</mo> </mrow> </mtd> <mtd> <mo>.</mo> <mo>.</mo> <mo>.</mo> </mtd> <mtd> <msub> <mi>m</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mi>T</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>m</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mn>1</mn> <mo>)</mo> </mrow> </mtd> <mtd> <msub> <mi>m</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mn>2</mn> <mo>)</mo> </mrow> </mtd> <mtd> <mo>.</mo> <mo>.</mo> <mo>.</mo> </mtd> <mtd> <msub> <mi>m</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mi>T</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> <mtd> <mo>.</mo> </mtd> <mtd> </mtd> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> <mtd> <mo>.</mo> </mtd> <mtd> </mtd> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> <mtd> <mo>.</mo> </mtd> <mtd> </mtd> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msub> <mi>m</mi> <mi>N</mi> </msub> <mrow> <mo>(</mo> <mn>1</mn> <mo>)</mo> </mrow> </mtd> <mtd> <msub> <mi>m</mi> <mi>N</mi> </msub> <mrow> <mo>(</mo> <mn>2</mn> <mo>)</mo> </mrow> </mtd> <mtd> <mo>.</mo> <mo>.</mo> <mo>.</mo> </mtd> <mtd> <msub> <mi>m</mi> <mi>N</mi> </msub> <mrow> <mo>(</mo> <mi>T</mi> <mo>)</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>10</mn> <mo>)</mo> </mrow> </mrow> </math>
the measured value Y, the boolean sparse measurement matrix a, and the sparse traffic matrix M' are p × T, p × N, and N × T matrices, respectively.
Since a is a sparse random matrix, we can find the measurement value Y by knowing the element value corresponding to a in the matrix M' in equation (10).
In the embodiment of the present invention, a union of OD flow sets of all random walk measurements is obtained by equation (11).
Suppose that
Figure BDA00001834071500071
Represents the set of OD flows that need to be measured by the ith random walk, all the necessary OD flows are represented as:
<math> <mrow> <mfenced open='{' close=''> <mtable> <mtr> <mtd> <msup> <mi>S</mi> <mi>f</mi> </msup> <mo>=</mo> <msubsup> <mo>&cup;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>p</mi> </msubsup> <msubsup> <mi>S</mi> <mi>i</mi> <mi>f</mi> </msubsup> </mtd> </mtr> <mtr> <mtd> <mi>p</mi> <mo>&le;</mo> <mi>N</mi> </mtd> </mtr> </mtable> </mfenced> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>11</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein: sfRepresenting the union of the OD flows generated from the first random walk to the p-th random walk;
Figure BDA00001834071500073
represents the set of OD flows that need to be measured by the ith random walk;
thus, all the necessary OD streams are generated
Figure BDA00001834071500074
The number of the corresponding necessary OD flows is
Figure BDA00001834071500075
Then the traffic matrix
<math> <mrow> <mover> <mi>M</mi> <mover> <mo>&OverBar;</mo> <mo>&OverBar;</mo> </mover> </mover> <mo>=</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>m</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>m</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msub> <mi>m</mi> <mi>N</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>12</mn> <mo>)</mo> </mrow> </mrow> </math>
Wherein:
Figure BDA00001834071500078
is a sparse matrix containing the necessary OD streams;
Figure BDA00001834071500079
each row represents an OD stream time series, the extraction requiring direct measurement
Figure BDA000018340715000710
Strip OD stream, the remaining OD streams correspond
Figure BDA000018340715000711
Behavior 0 of the matrix. In equation (10), since A is a Boolean sparse measurement matrix, it is only necessary to know the individual rows in MY can be obtained. That is Y = A × M' is equivalent to
Figure BDA000018340715000713
The method of selecting a subset of OD flows to be measured directly is exemplified in detail below. As shown in equation (13):
<math> <mrow> <mfenced open='' close=''> <mtable> <mtr> <mtd> <mfenced open='' close=''> <mtable> <mtr> <mtd> <mi>Y</mi> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> <mtd> <msub> <mi>A</mi> <mrow> <mo>(</mo> <mn>5</mn> <mo>&times;</mo> <mn>10</mn> <mo>)</mo> </mrow> </msub> </mtd> <mtd> <mi>m</mi> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> </mtd> </mtr> <mtr> <mtd> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>y</mi> <mn>1</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>y</mi> <mn>2</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>y</mi> <mn>3</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>y</mi> <mn>4</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> <mtr> <mtd> <msub> <mi>y</mi> <mn>5</mn> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> <mo>=</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <mn>0001010100</mn> </mtd> </mtr> <mtr> <mtd> <mn>0001100000</mn> </mtd> </mtr> <mtr> <mtd> <mn>1000010000</mn> </mtd> </mtr> <mtr> <mtd> <mn>1000110000</mn> </mtd> </mtr> <mtr> <mtd> <mn>0001000000</mn> </mtd> </mtr> </mtable> </mfenced> <mo>&times;</mo> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msub> <mi>OD</mi> <mn>1</mn> </msub> </mtd> </mtr> <mtr> <mtd> <mn>0</mn> </mtd> </mtr> <mtr> <mtd> <mn>0</mn> </mtd> </mtr> <mtr> <mtd> <msub> <mi>OD</mi> <mn>4</mn> </msub> </mtd> </mtr> <mtr> <mtd> <msub> <mi>OD</mi> <mn>5</mn> </msub> </mtd> </mtr> <mtr> <mtd> <msub> <mi>OD</mi> <mn>6</mn> </msub> </mtd> </mtr> <mtr> <mtd> <mn>0</mn> </mtd> </mtr> <mtr> <mtd> <msub> <mi>OD</mi> <mn>8</mn> </msub> </mtd> </mtr> <mtr> <mtd> <mn>0</mn> </mtd> </mtr> <mtr> <mtd> <mn>0</mn> </mtd> </mtr> </mtable> </mfenced> </mtd> </mtr> </mtable> </mfenced> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>13</mn> <mo>)</mo> </mrow> </mrow> </math>
in the formula, a boolean sparse measurement matrix a is a sparse matrix composed of "1" or "0", the row number of a represents the number of random walks, and the column number represents the OD flow traversed by each random walk. At any time t, the OD stream to be measured is indexed by a "1" in the column element of matrix a. Then a (5 rows and 10 columns) in fig. 4 is a boolean sparse measurement matrix, i.e. random walk 5 times, and the number of OD streams to traverse is 10. OD flows respectively indexed by using the column element '1' in the A for the first time are divided into OD4, OD6 and OD 8; OD flows indexed for the second time are OD4, OD 5; by analogy, the total OD flows to be measured obtained by 5 random walks are: OD1, OD4, OD5, OD6 and OD 8.
And 4, according to the sparse measurement matrix constructed in the step 3 and the measurement value obtained by calculation, adopting a PCA (principal component analysis) method to construct a compressed sensing reconstruction model, describing the relationship between the OD flow generated by the router subset and all end-to-end OD flows in the large-scale IP backbone network by using the model, and further determining all end-to-end OD flow in the whole IP backbone network.
In order to meet the requirement of compressed sensing on the sparsification characteristic of the traffic matrix, the PCA model is used for the historical traffic matrix in the embodiment of the invention
Figure BDA00001834071500081
Singular value decomposition is performed.
Step 4-1: performing singular value decomposition on the measured value Y;
the principle of singular value decomposition is further explained below with reference to equations (14) to (17).
The Principal Component Analysis (PCA) has the main idea of projecting high dimensional data into a lower dimensional space. The m multiplied by n order flow matrix L in the PCA model is expressed as
L=U∑VT, (14)
Wherein: u is an orthogonal matrix;
UUTor UTU is an identity matrix;
Σ is a singular value of a diagonal matrix whose diagonal tuple value is L;
v is LTA feature vector of L;
LTis the transpose of L.
In practice, the formula (14) is that L utilizes an orthogonal baseU Σ is projected into space. The columns of the L matrix are basis vectors σkukThe normalized basis vector is expressed as:
<math> <mrow> <msub> <mi>u</mi> <mi>k</mi> </msub> <mo>=</mo> <mfrac> <msub> <mi>Lv</mi> <mi>k</mi> </msub> <msub> <mi>&sigma;</mi> <mi>k</mi> </msub> </mfrac> <mo>,</mo> <mi>k</mi> <mo>=</mo> <mn>1,2</mn> <mo>,</mo> <mo>.</mo> <mo>.</mo> <mo>.</mo> <mo>,</mo> <mi>min</mi> <mrow> <mo>(</mo> <mi>m</mi> <mo>,</mo> <mi>n</mi> <mo>)</mo> </mrow> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>15</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein: u. ofk,vkK columns representing U and V, respectively;
min (m, n) represents the rank of the matrix L;
σkit is the singular values of the matrix L that represent the energy characteristics of L.
To facilitate understanding of the low rank approximation process, equation (14) may be expressed as:
<math> <mrow> <mi>L</mi> <mo>=</mo> <msup> <mi>U&Sigma;V</mi> <mi>T</mi> </msup> <mo>=</mo> <munderover> <mi>&Sigma;</mi> <mrow> <mi>k</mi> <mo>=</mo> <mn>1</mn> </mrow> <mrow> <mi>min</mi> <mrow> <mo>(</mo> <mi>m</mi> <mo>,</mo> <mi>n</mi> <mo>)</mo> </mrow> </mrow> </munderover> <msub> <mi>&sigma;</mi> <mi>k</mi> </msub> <msub> <mi>u</mi> <mi>k</mi> </msub> <msubsup> <mi>v</mi> <mi>k</mi> <mi>T</mi> </msubsup> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>16</mn> <mo>)</mo> </mrow> </mrow> </math>
therefore, large singular values can be selected to approximate the true matrix, and this process can be expressed by the mathematical formula:
<math> <mrow> <mover> <mi>L</mi> <mo>^</mo> </mover> <mo>=</mo> <munderover> <mi>&Sigma;</mi> <mrow> <mi>k</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>q</mi> </munderover> <msub> <mi>&sigma;</mi> <mi>k</mi> </msub> <msub> <mi>u</mi> <mi>k</mi> </msub> <msubsup> <mi>v</mi> <mi>k</mi> <mi>T</mi> </msubsup> <mo>.</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>17</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein: the constant q < min (m, n);
σkis the q largest singular values;
q is
Figure BDA00001834071500093
Is equal to the sparsity.
In the embodiment of the invention, a historical traffic matrix M is assumedhisLength d of>And N is added. According to the formula (14),
Figure BDA00001834071500094
can be decomposed into:
<math> <mrow> <msubsup> <mi>M</mi> <mi>his</mi> <mi>T</mi> </msubsup> <mo>=</mo> <msub> <mi>U</mi> <mi>his</mi> </msub> <msub> <mi>&Sigma;</mi> <mi>his</mi> </msub> <msubsup> <mi>V</mi> <mi>his</mi> <mi>T</mi> </msubsup> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>18</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein, Uhis,∑hisAnd
Figure BDA00001834071500096
are d × N, N × N and N × N matrices, respectively.
Step 4-2: and further analyzing and calculating according to the historical flow matrix and the singular value decomposition method.
V can be calculated according to the formula (18)hishis. Because each OD stream in the traffic matrix has a long correlation, the principal component remains substantially unchanged. Thus, in conjunction with equation (18), equation (10) can be approximately converted to:
Y=A×VhishisUT=ΘUT, (19)
wherein, AVhishis=Θ。
According to equation (19), the first q columns of Θ are truncated, and there is a new matrix ΘcolRepresents;
according to equation (16), Y can be approximately expressed as:
<math> <mrow> <mi>Y</mi> <mo>=</mo> <mi>&Theta;</mi> <msup> <mi>U</mi> <mi>T</mi> </msup> <mo>=</mo> <mi>&Theta;</mi> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> </mtd> </mtr> <mtr> <mtd> <msup> <mi>u</mi> <mi>T</mi> </msup> </mtd> </mtr> </mtable> </mfenced> <mo>&ap;</mo> <mi>&Theta;</mi> <mfenced open='[' close=']'> <mtable> <mtr> <mtd> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> </mtd> </mtr> <mtr> <mtd> <mn>0</mn> </mtd> </mtr> </mtable> </mfenced> <mo>=</mo> <mi>&Theta;</mi> <msubsup> <mi>U</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>20</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein,and uTAre respectively UTq.times.T (q)<p) and (N-q) xT order submatrices;
Figure BDA00001834071500099
is a sparse matrix with sparsity q.
Step 4-3: matrix sparsification after singular value decomposition is realized through a convex optimization method:
<math> <mrow> <msubsup> <mover> <mi>U</mi> <mo>^</mo> </mover> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>=</mo> <mi>arg</mi> <mi>min</mi> <msub> <mrow> <mo>|</mo> <mo>|</mo> <msubsup> <mi>U</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>|</mo> <mo>|</mo> </mrow> <mn>1</mn> </msub> <mi>s</mi> <mo>.</mo> <mi>t</mi> <mo>.</mo> <mi>&Theta;</mi> <msubsup> <mi>U</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>=</mo> <mi>Y</mi> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>21</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein:
Figure BDA000018340715000911
to make it possible toWhen taking the minimum valueA value of (d);
thus, equation (20) can be transformed into:
<math> <mrow> <mi>Y</mi> <mo>=</mo> <msub> <mi>&Theta;</mi> <mi>col</mi> </msub> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>22</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein, thetacolIs q of theta<And p columns. Thus, l2The norm minimization problem can be used to solve uPCThat is to say that,
<math> <mrow> <msubsup> <mover> <mi>u</mi> <mo>^</mo> </mover> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>=</mo> <mi>arg</mi> <mi>min</mi> <msub> <mrow> <mo>|</mo> <mo>|</mo> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>|</mo> <mo>|</mo> </mrow> <mn>2</mn> </msub> <mi>s</mi> <mo>.</mo> <mi>t</mi> <mo>.</mo> <msub> <mi>&Theta;</mi> <mi>col</mi> </msub> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>=</mo> <mi>Y</mi> <mo>,</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>23</mn> <mo>)</mo> </mrow> </mrow> </math>
thus, equation (22) is an overdetermined problem and an optimal solution is easily found.
From equation (22), it can be known that ΘcolAnd uPCIs a column full rank matrix. Thus, the following formula can be obtained,
<math> <mrow> <mi>Y</mi> <mo>=</mo> <msub> <mi>u</mi> <mi>y</mi> </msub> <msub> <mi>&Sigma;</mi> <mi>y</mi> </msub> <msubsup> <mi>v</mi> <mi>y</mi> <mi>T</mi> </msubsup> <mo>.</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>24</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein the matrix uyObtained by singular value decomposition of Y;
matrix arrayIs YTA feature vector of Y;
because thetacolAnd uPCAre all column full rank matrices, then ΘcolIs equal to uyRank (Θ)col)=rank(uy) And with thetacolSpace expanded as a basis is equal to uPCSpace expanded for basis (theta)col)=span(uy))。
Step 4-4: the matrix is analyzed in a last step according to the orthogonal transformation.
Calculating R according to equation (25);
the orthogonal transformation matrix obeys the following constraint condition, there is a matrix R satisfying the following formula,
uycolR. (25)
according to the formulas (26), (27), calculation
Figure BDA00001834071500105
Assuming that the matrix W satisfies
Figure BDA00001834071500106
And W thetacolI, I is the identity matrix.
Therefore, the temperature of the molten metal is controlled,
<math> <mrow> <mi>WY</mi> <mo>=</mo> <mi>W</mi> <msub> <mi>&Theta;</mi> <mi>col</mi> </msub> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>=</mo> <msubsup> <mi>u</mi> <mi>PC</mi> <mi>T</mi> </msubsup> <mo>.</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>26</mn> <mo>)</mo> </mrow> </mrow> </math>
in addition to this, the present invention is,
Wuy=WΘcolR=R. (27)
according to (28), calculating
Figure BDA00001834071500108
From equations (26) and (27), we can obtain:
WY = u PC T Wu y = R . - - - ( 28 )
u PC T = ( Ru y T ) Y - - - ( 29 )
obtained according to the above steps
Figure BDA00001834071500112
By calculation of
Figure BDA00001834071500113
Obtaining a flow matrix M according to equation (18)hisAnd (6) estimating the value.
Through the steps, partial end-to-end network flow can be obtained according to direct measurement, and an estimation result of a network flow matrix is obtained. The matrix describes the size of all end-to-end traffic in the large-scale IP backbone.
Performance evaluation results:
fig. 5 is a schematic diagram of rank variation of NMAE with a flow matrix according to an embodiment of the present invention, and the performance of the present invention is evaluated by using Normalized Mean Absolute Error (NMAE), where the rank q in the embodiment of the present invention is 1,2,3,4,5,6,7, and 8 (as shown in fig. 6).
The NMAE calculation formula is:
<math> <mrow> <mi>NMAE</mi> <mo>=</mo> <mfrac> <mrow> <munder> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>,</mo> <mi>t</mi> </mrow> </munder> <mo>|</mo> <msub> <mover> <mi>m</mi> <mo>^</mo> </mover> <mi>j</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>m</mi> <mi>j</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> <mo>|</mo> </mrow> <mrow> <munder> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>,</mo> <mi>t</mi> </mrow> </munder> <mo>|</mo> <msub> <mi>m</mi> <mi>j</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>)</mo> </mrow> <mo>|</mo> </mrow> </mfrac> <mo>.</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>29</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein,representing the j-th OD flow estimation value at the t moment;
mjand (t) represents the true value of the j-th OD flow at the t moment.
Simulation results show that the performance of the algorithm of the invention does not change much with the change of q. However q is constrained by some conditions, first the rank q is equivalent to sparsity. That is, in the formula (20), when the rank is q, UTCan be prepared from
Figure BDA00001834071500116
In the approximation that the difference between the first and second values,
Figure BDA00001834071500117
is q sparse. The boolean sparsity measurement matrix a is a p × N matrix, p = O (Klog (N/K)). In the simulation, K = q, and thus, p = O (qlog (N/q)). The number of measurements p directly affects the total number of OD flows that need to be measured. In the embodiment of the invention, q is set to be equal to the total amount of OD streams according to p. In fig. 5, a fluctuation occurs in NMAE because p tends to flow boundary qlog (N/q) at this time. In this case, the smaller the p-qlog (N/q), the greater the probability of reconstruction failure. Fig. 6 is a schematic diagram of the variation of the random walk length of the NMAE according to the embodiment of the present invention, wherein the NMAE basically decreases with the increase of the random walk length. If r is small enough, robustness will be reduced; if r is too large, it will again produce a portion of the OD flow we have to measure.
FIG. 7 shows that the OD flow numbers are normally distributed, the intercept of the straight line represents the mean value of the OD flow numbers, and the slope represents the standard deviation. Simulation results show that: we need about 105 + 120 OD flows to recover the traffic matrix. In the flow compression reconstruction, the PCA method is adopted in the embodiment of the present invention without performing other preprocessing, so that the embodiment of the present invention requires 60% of links to operate NetFlow.
In FIG. 8, the first 500 points are used in the algorithm as a priori information for calculating Vhishis. The 30 th and 60 th OD streams were randomly selected for testing in the examples of the present invention. Three methods are found to track the dynamic changes of the flow in time. However, in fig. 8 (a), large fluctuations occur in SRSVD (sparse reconstruction singular value decomposition) and TomoG (gravity model), and FSR (flow sensing reconstruction) can accurately obtain the 30 th OD stream; fig. 8 (b) clearly shows that SRSVD and TomoG cannot reconstruct 60 OD streams accurately, and that FSR can more accurately derive the trend of flow rate changes. Therefore, experimental results prove that the method can better estimate the end-to-end network traffic.
In order to more accurately evaluate the advantages of the algorithm of the present invention, the embodiment of the present invention compares the three methods with another standard. Since network traffic exhibits time-varying and space-time correlation, embodiments of the present invention analyze FSR characteristics with reference to Spatial Relative Errors (SREs) and temporal relative errors (tress).
SREs and TREs are expressed as:
SRE ( n ) = | | x ^ T ( n ) - x T ( n ) | | 2 | | x T ( n ) | | 2 , n = 1,2 , . . . , N - - - ( 29 )
wherein:
Figure BDA00001834071500122
the nth OD flow estimation value at the T moment is represented;
xTand (n) represents the real value of the nth OD flow at the time T.
TRE ( t ) = | | x ^ N ( t ) - x N ( t ) | | 2 | | x N ( t ) | | 2 , t = 1,2 , . . . , T - - - ( 30 )
Wherein:the estimated value of the Nth OD flow at the time t is shown;
xN(t) represents the true value of the Nth OD flow at time t.
Here, non-negative integers N and T denote the total number of OD flows and the measurement time instant, respectively. I | · | purple wind2Is represented by2And (4) norm. SERs represent the spatial error distribution of different OD streams. While TREs represents the time error distribution.
In FIG. 9 (a), the SRSVD and Tomog methods showed high fluctuations in the OD streams at lanes 10 and 40. Because the first half of the data for this OD flow is very small, the energy occupied in the overall network traffic is particularly small. It is difficult to accurately estimate these OD streams. Further, the average spatial correlation errors of FSR, SRSVD and Tomog were 0.47,0.77 and 0.68, respectively. The FSR method of embodiments of the present invention has the lowest spatial correlation error in the first half of the OD flow. This shows that there is a very stable and efficient estimation characteristic for the small-scale flow FSR. In FIG. 9 (b), all methods have mutations in the time domain. But FSR has very low mutations in small intervals. Meanwhile, the average time correlation errors of FSR, SRSVD, TomoG are 0.20,0.29, 0.26, respectively, so that FSR has the lowest time correlation error.
In the embodiment of the invention, the flow matrix estimation characteristics are obtained through the cumulative distribution functions of the spatial relative errors and the time relative errors of the three methods. FIG. 10 (a) shows that the relative spatial error obtained for the three methods FSR, SRSVD, Tomog is 0.83 for OD flows of about 92%, 73%, 84%. Furthermore, fig. 10 (b) shows that the relative error in time obtained for the embodiment of the present invention is 0.25, 0.31,0.30 for about 80% of the estimated time. This measurement therefore represents the same estimation result for the three methods, with the FSR method employed by embodiments of the present invention having the lowest estimation error.
In conclusion, it is demonstrated that the traffic matrix reconstructed by the FSR method is more accurate.

Claims (4)

1. An end-to-end network flow reconstruction method based on compressed sensing in a dynamic network is characterized in that: the method comprises the following steps:
step 1, setting the length and the random walk times of a system aiming at OD stream random walks;
step 2, in a large-scale IP backbone network, selecting OD flows in a random walk mode, and describing the opening and closing states of all routers by adopting a method for constructing a Boolean sparse measurement matrix according to the selected OD flows;
step 3, describing flow values of part of OD flows collected by a router subset by adopting a method for constructing a sparse flow matrix, wherein the router subset is a router in all opening states, and calculating a measurement value according to a Boolean sparse measurement matrix and the constructed sparse flow matrix;
and 4, according to the sparse measurement matrix constructed in the step 3 and the measurement value obtained by calculation, adopting a principal component analysis method to construct a compressed sensing reconstruction model, describing the relationship between the OD flows generated by the router subset and all end-to-end OD flows in the large-scale IP backbone network by using the model, and further determining all end-to-end OD flow in the whole IP backbone network.
2. The method according to claim 1, wherein the method for reconstructing the end-to-end network traffic based on compressed sensing in the dynamic network comprises: the method for constructing the Boolean sparse measurement matrix in the step 2 comprises the following steps:
step 2-1, constructing an OD flow complete graph, wherein the complete graph comprises vertexes and edges, and the number of the vertexes is equal to the number of the OD flows;
step 2-2: setting an initial value of the random walk measurement times;
step 2-3: selecting a vertex as a unified starting point;
step 2-4: executing uniformly distributed random walks on the complete graph, and according to points found by the random walks, the state of the router corresponding to the points is an open state, otherwise, the state is a closed state;
step 2-5: if the random walk measurement times are smaller than the initialization set random walk measurement times, returning to the step 2-3; if the number of random walk measurements is greater than or equal to the number of initial random walk measurements, executing step 2-6;
step 2-6: and saving the Boolean sparse measurement matrix.
3. The method according to claim 1, wherein the method for reconstructing the end-to-end network traffic based on compressed sensing in the dynamic network comprises: the Boolean sparse measurement matrix in the step 1 accords with a constraint equidistance criterion; the number of rows in the boolean sparse measurement matrix represents the number of times the large-scale IP backbone network is randomly walked, and the number of columns in the boolean sparse measurement matrix represents the total number of OD flows in the large-scale IP backbone network.
4. The method according to claim 1, wherein the method for reconstructing the end-to-end network traffic based on compressed sensing in the dynamic network comprises: the calculation of the measured values described in step 3 satisfies the following formula:
measurement = boolean sparse measurement matrix x sparse traffic matrix.
CN201210225145.2A 2012-06-29 2012-06-29 End-to-end network flow reconstruction method based on compression sensing in dynamic network Expired - Fee Related CN102724078B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210225145.2A CN102724078B (en) 2012-06-29 2012-06-29 End-to-end network flow reconstruction method based on compression sensing in dynamic network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210225145.2A CN102724078B (en) 2012-06-29 2012-06-29 End-to-end network flow reconstruction method based on compression sensing in dynamic network

Publications (2)

Publication Number Publication Date
CN102724078A true CN102724078A (en) 2012-10-10
CN102724078B CN102724078B (en) 2014-12-10

Family

ID=46949748

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210225145.2A Expired - Fee Related CN102724078B (en) 2012-06-29 2012-06-29 End-to-end network flow reconstruction method based on compression sensing in dynamic network

Country Status (1)

Country Link
CN (1) CN102724078B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103200043A (en) * 2013-03-14 2013-07-10 东北大学 Measurement method for time-varying non-stationary network flow
CN103607292A (en) * 2013-10-28 2014-02-26 国家电网公司 Fast distributed monitoring method for electric-power communication network services
CN105099973A (en) * 2015-08-25 2015-11-25 西安电子科技大学 Measurement matrix calibration method for broadband modulation conversion, compression and acquisition architecture
CN105979541A (en) * 2016-07-18 2016-09-28 国网辽宁省电力有限公司阜新供电公司 Method and system for estimating dynamic traffic in data communication network
CN106357479A (en) * 2016-11-15 2017-01-25 中国人民解放军防空兵学院 Full network flow monitoring method
CN107170233A (en) * 2017-04-20 2017-09-15 同济大学 A kind of typical day transport need OD matrix acquisition methods based on matrix decomposition
CN108156591A (en) * 2017-12-21 2018-06-12 中南大学 The method of data capture that a kind of compressed sensing and random walk combine
CN109088796A (en) * 2018-09-19 2018-12-25 哈尔滨工业大学 Network flow Matrix Prediction Method based on network tomography technology
CN109495328A (en) * 2018-12-30 2019-03-19 李保安 A kind of method of Logistics networks communication reliability
CN110036600A (en) * 2016-09-28 2019-07-19 亚马逊科技公司 The convergence service of network health data
CN110149245A (en) * 2019-05-24 2019-08-20 广州大学 The compressed sensing based high-speed network flow method of sampling and device
CN110855485A (en) * 2019-11-08 2020-02-28 西北工业大学青岛研究院 Method and system for determining network flow of IP backbone network
US11044168B2 (en) 2019-07-02 2021-06-22 Cisco Technology, Inc. Fingerprinting application traffic in a network
CN113379092A (en) * 2020-03-09 2021-09-10 西北工业大学青岛研究院 Backbone network multi-service traffic estimation method and system facing big data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030043802A1 (en) * 2001-08-31 2003-03-06 Takeki Yazaki Packet transferring method and apparatus that employs the same
CN101150581A (en) * 2007-10-19 2008-03-26 华为技术有限公司 Detection method and device for DDoS attack
CN102136087A (en) * 2011-03-08 2011-07-27 湖南大学 Multi-neural network-based traffic matrix estimation method
CN102325090A (en) * 2011-09-21 2012-01-18 电子科技大学 Network flow estimating method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030043802A1 (en) * 2001-08-31 2003-03-06 Takeki Yazaki Packet transferring method and apparatus that employs the same
CN101150581A (en) * 2007-10-19 2008-03-26 华为技术有限公司 Detection method and device for DDoS attack
CN102136087A (en) * 2011-03-08 2011-07-27 湖南大学 Multi-neural network-based traffic matrix estimation method
CN102325090A (en) * 2011-09-21 2012-01-18 电子科技大学 Network flow estimating method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
裴唯 等: "《城域网应用层流量异常检测与分析》", 《计算机应用研究》 *

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103200043B (en) * 2013-03-14 2015-07-29 东北大学 Non-stationary network flow programming method method is become time a kind of
CN103200043A (en) * 2013-03-14 2013-07-10 东北大学 Measurement method for time-varying non-stationary network flow
CN103607292A (en) * 2013-10-28 2014-02-26 国家电网公司 Fast distributed monitoring method for electric-power communication network services
CN103607292B (en) * 2013-10-28 2017-01-18 国家电网公司 Fast distributed monitoring method for electric-power communication network services
CN105099973A (en) * 2015-08-25 2015-11-25 西安电子科技大学 Measurement matrix calibration method for broadband modulation conversion, compression and acquisition architecture
CN105099973B (en) * 2015-08-25 2018-03-20 西安电子科技大学 The calculation matrix calibration method of compression collection framework is converted for wide-band modulation
CN105979541B (en) * 2016-07-18 2019-05-14 国网辽宁省电力有限公司阜新供电公司 Dynamic flow estimation method and system in a kind of data communication network
CN105979541A (en) * 2016-07-18 2016-09-28 国网辽宁省电力有限公司阜新供电公司 Method and system for estimating dynamic traffic in data communication network
US12068938B2 (en) 2016-09-28 2024-08-20 Amazon Technologies, Inc. Network health data aggregation service
US11641319B2 (en) 2016-09-28 2023-05-02 Amazon Technologies, Inc. Network health data aggregation service
CN110036600B (en) * 2016-09-28 2022-11-22 亚马逊科技公司 Network health data convergence service
CN110036600A (en) * 2016-09-28 2019-07-19 亚马逊科技公司 The convergence service of network health data
CN106357479A (en) * 2016-11-15 2017-01-25 中国人民解放军防空兵学院 Full network flow monitoring method
CN106357479B (en) * 2016-11-15 2019-08-23 中国人民解放军防空兵学院 A kind of whole network flow monitoring method
CN107170233A (en) * 2017-04-20 2017-09-15 同济大学 A kind of typical day transport need OD matrix acquisition methods based on matrix decomposition
CN108156591B (en) * 2017-12-21 2020-08-11 中南大学 Data collection method combining compressed sensing and random walk
CN108156591A (en) * 2017-12-21 2018-06-12 中南大学 The method of data capture that a kind of compressed sensing and random walk combine
CN109088796A (en) * 2018-09-19 2018-12-25 哈尔滨工业大学 Network flow Matrix Prediction Method based on network tomography technology
CN109495328A (en) * 2018-12-30 2019-03-19 李保安 A kind of method of Logistics networks communication reliability
CN110149245A (en) * 2019-05-24 2019-08-20 广州大学 The compressed sensing based high-speed network flow method of sampling and device
US11044168B2 (en) 2019-07-02 2021-06-22 Cisco Technology, Inc. Fingerprinting application traffic in a network
CN110855485A (en) * 2019-11-08 2020-02-28 西北工业大学青岛研究院 Method and system for determining network flow of IP backbone network
CN113379092A (en) * 2020-03-09 2021-09-10 西北工业大学青岛研究院 Backbone network multi-service traffic estimation method and system facing big data
CN113379092B (en) * 2020-03-09 2022-12-09 西北工业大学青岛研究院 Backbone network multi-service traffic estimation method and system facing big data

Also Published As

Publication number Publication date
CN102724078B (en) 2014-12-10

Similar Documents

Publication Publication Date Title
CN102724078A (en) End-to-end network flow reconstruction method based on compression sensing in dynamic network
Jiang et al. A compressive sensing-based approach to end-to-end network traffic reconstruction
Kasai et al. Network volume anomaly detection and identification in large-scale networks based on online time-structured traffic tensor tracking
Zhu et al. Network latency estimation for personal devices: A matrix completion approach
Zhang et al. Spatio-temporal compressive sensing and internet traffic matrices
Chen et al. Robust network compressive sensing
Chua et al. Network kriging
US8725871B2 (en) Systems and methods for application dependency discovery
Forero et al. Prediction of partially observed dynamical processes over networks via dictionary learning
Gjoka et al. 2.5 k-graphs: from sampling to generation
Chou et al. Efficient computation of robust average of compressive sensing data in wireless sensor networks in the presence of sensor faults
Coluccia et al. Distribution-based anomaly detection via generalized likelihood ratio test: A general maximum entropy approach
Zhou et al. Accurate traffic matrix completion based on multi-Gaussian models
Xie et al. Accurate recovery of internet traffic data under dynamic measurements
Jiang et al. A novel hybrid prediction algorithm to network traffic
Ustebay et al. Efficient decentralized approximation via selective gossip
CN111800414A (en) Convolutional neural network-based traffic anomaly detection method and system
Zhou et al. Data reconstruction in internet traffic matrix
Nie et al. A power laws-based reconstruction approach to end-to-end network traffic
Nie et al. A compressive sensing-based reconstruction approach to network traffic
Wu et al. Energy-efficient and robust tensor-encoder for wireless camera networks in Internet of Things
CN102647354A (en) End-to-end flow reconfiguration method in time-varying dynamic network
KR101350782B1 (en) Method and apparatus for sampling data
CN110839253A (en) Method for determining wireless grid network flow
Xie et al. Expectile tensor completion to recover skewed network monitoring data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20141210

Termination date: 20150629

EXPY Termination of patent right or utility model