CN106301950B - A kind of analysis method and analytical equipment of OD flow - Google Patents
A kind of analysis method and analytical equipment of OD flow Download PDFInfo
- Publication number
- CN106301950B CN106301950B CN201610809587.XA CN201610809587A CN106301950B CN 106301950 B CN106301950 B CN 106301950B CN 201610809587 A CN201610809587 A CN 201610809587A CN 106301950 B CN106301950 B CN 106301950B
- Authority
- CN
- China
- Prior art keywords
- matrix
- flow
- rank
- principal component
- formula
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000004458 analytical method Methods 0.000 title claims abstract description 44
- 239000011159 matrix material Substances 0.000 claims abstract description 449
- 238000000513 principal component analysis Methods 0.000 claims abstract description 95
- 238000005457 optimization Methods 0.000 claims abstract description 76
- 230000003190 augmentative effect Effects 0.000 claims abstract description 48
- 238000000034 method Methods 0.000 claims abstract description 42
- 238000000354 decomposition reaction Methods 0.000 claims description 41
- 238000004422 calculation algorithm Methods 0.000 claims description 26
- 238000004364 calculation method Methods 0.000 claims description 25
- 230000008602 contraction Effects 0.000 claims description 19
- 238000005206 flow analysis Methods 0.000 claims description 17
- 238000005070 sampling Methods 0.000 claims description 9
- 108010046685 Rho Factor Proteins 0.000 claims description 6
- 230000002159 abnormal effect Effects 0.000 abstract description 20
- 238000005259 measurement Methods 0.000 abstract description 18
- 239000013598 vector Substances 0.000 abstract description 7
- 230000000694 effects Effects 0.000 abstract description 6
- 230000000737 periodic effect Effects 0.000 abstract description 5
- 230000006870 function Effects 0.000 description 16
- 238000012545 processing Methods 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 3
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012067 mathematical method Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/142—Network analysis or design using statistical or mathematical methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/145—Network analysis or design involving simulating, designing, planning or modelling of a network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0876—Network utilisation, e.g. volume of load or congestion level
- H04L43/0888—Throughput
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Pure & Applied Mathematics (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Algebra (AREA)
- Environmental & Geological Engineering (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The present invention provides the analysis methods and analytical equipment of a kind of OD flow, it is related to network technique field, solves the problems, such as that the analytical effect for showing as the OD flow of the impact noise of random value to abnormal flow, burst flow and measurement noise etc. in the prior art is undesirable.A kind of analysis method of OD flow, periodic samples obtain the whole network OD flow, and the whole network OD flow is converted into traffic matrix;It is modeled using Robust Principal Component Analysis model traffic matrix, obtains the Robust Principal Component Analysis model of traffic matrix;The Robust Principal Component Analysis model of traffic matrix carries out convex optimization relaxation, obtains the convex Optimized model of Robust Principal Component Analysis;It is solved using convex Optimized model of the augmented vector approach to Robust Principal Component Analysis, traffic matrix is decomposed into low-rank matrix and sparse matrix;Obtained low-rank matrix and sparse matrix are exported.The embodiment of the present invention is for the analysis to OD flow.
Description
Technical Field
The present invention relates to the field of network technologies, and in particular, to an OD traffic analysis method and an OD traffic analysis device.
Background
With the increasing scale of networks, the variation of parameters of various technologies used in the networks is more complicated. For a network operator, it is necessary to clearly know how network traffic passes through a network of the network operator, and on this basis, the network may be designed and planned to perform works such as traffic engineering, network traffic anomaly detection, congestion control, and the like. However, many network traffic analysis efforts are limited to isolated single links, and the statistics and analysis of global traffic over a full network link is still very challenging. In order to observe the characteristics and flow direction of network traffic from the perspective of the whole network, it is necessary to use Origin and Destination (OD) flow to count and describe the network traffic. The OD flows reflect the actual traffic between each pair of nodes in the network, and are the set of all traffic entering the same ingress node and leaving the same egress node. Compared with local link traffic, the OD flow can more intuitively reflect the basic properties of the network, is the basis of global traffic analysis and is also the main input parameter of traffic engineering. Therefore, the statistics and analysis of the OD flows become a research hotspot in the field of home and abroad networks, and have important practical significance.
Although the OD traffic essentially reflects the actual condition of the network, there are difficulties in the processing and analysis of the OD traffic. The most significant problem is the high-dimensional multi-element structure of the OD stream. For example, even a medium-scale network may contain hundreds of OD streams, and the analysis of the OD streams requires spreading in the time dimension, further increasing the dimensionality of the data to be analyzed. On one hand, high-dimensional information contains abundant information for mining and utilization, and on the other hand, the large-scale data volume greatly increases the difficulty of analyzing and processing the data, which brings about a so-called dimensional disaster. Thus, the high dimensional nature of the OD flows is a difficult point in the overall network traffic analysis problem.
The common practice for processing high-dimensional data is to find a linear subspace of low-dimensional alternating projections that retains the important properties of the original data to approximate the high-dimensional data. The dimension reduction technology can better explore the information redundancy of high-dimensional data and the correlation between the data, and becomes an effective tool for understanding the original data structure. The most common method for processing high-dimensional data is Principal Component Analysis (PCA), given a set of high-dimensional data, PCA can search a new low-dimensional subspace, and the projection of the original data in the coordinate space has the smallest error, thereby achieving the purpose of reducing dimensions. When a high-dimensional data can be approximated by a low-dimensional subspace, we refer to the dimension of the low-dimensional space as the feature dimension.
PCA is very sensitive to the type of noise contained in the raw data, and can achieve a better estimation effect when facing additive gaussian noise, while the estimation effect is greatly affected when facing impulsive noise such as random values. In a network environment, short-term abnormal traffic, burst traffic and network abnormal conditions often occur, namely, impact noise with random values appears, and then the effect of performing dimensionality reduction and analysis on the OD stream by using the PCA is not ideal.
In summary, there is a problem in the prior art that the analysis effect on OD traffic, such as abnormal traffic, bursty traffic, and measurement noise, which shows impact noise of random values, is not ideal.
Disclosure of Invention
Embodiments of the present invention provide an OD traffic analysis method and an OD traffic analysis device, which solve the problem in the prior art that the OD traffic analysis effect is not ideal for impulse noise representing a random value, such as abnormal traffic, bursty traffic, measurement noise, and the like.
In order to achieve the above purpose, the embodiment of the invention adopts the following technical scheme:
in a first aspect, an embodiment of the present invention provides a method for analyzing an OD flow, including:
periodically sampling to obtain the OD flow of the whole network, and converting the OD flow of the whole network into a flow matrix;
modeling the flow matrix by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the flow matrix;
performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis;
solving a convex optimization model of robust principal component analysis by adopting an augmented Lagrange multiplier method, and decomposing a flow matrix into a low-rank matrix and a sparse matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow without the impact noise with a random value and represents an inherent low-rank structure of the flow matrix; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value;
and outputting the obtained low-rank matrix and sparse matrix.
Specifically, the modeling of the traffic matrix by using the robust principal component analysis model to obtain the robust principal component analysis model of the traffic matrix includes:
modeling the flow matrix by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the flow matrix:
wherein,to represent the minimization of the target matrix A and matrix E, A is a low rank matrix, D is a traffic matrix, E is a sparse matrix, rank (A) represents the rank of matrix A, | E |0Is the number of non-zero elements in the matrix E, gamma is a compromise factor and is greater than 0, and s.t. is a constraint condition.
Specifically, performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis, including:
performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis:
wherein,to indicate that the minimum A of the target matrix A and the matrix E is a low rank matrix, D is a flow matrix, E is a sparse matrix, | A |*Representing the sum of all singular values, | E | in the matrix A1Is the sum of the absolute values of all elements in the matrix E, λ is the relaxation factor, and s.t. is the constraint.
Specifically, solving a convex optimization model of robust principal component analysis by adopting an augmented Lagrange multiplier method, and decomposing a flow matrix into a low-rank matrix and a sparse matrix, wherein the method comprises the following steps:
constructing an augmented Lagrangian function of the convex optimization model:
‖A‖*represents the sum of all singular values, | E | in the low rank matrix A1Is the sum of the absolute values of all elements in the sparse matrix E, λ is the relaxation factor, Y is the lagrange multiplier matrix,D-A-E is a constraint condition for increasing the penalty of the Lagrange function,<Y,D-A-E>the inner product of a Lagrange multiplier matrix Y and a constraint condition D-A-E is obtained, mu is a positive parameter, and F is the Frobenius norm of the matrix;
presetting a soft threshold shrinking operator and a singular value soft threshold operator, wherein the soft threshold shrinking operator is as follows:
wherein, when x>When epsilon, Sε(x) Equal to x-epsilon; when x is<When epsilon, Sε(x) Equal to x + ε;
when x ═ epsilon, Sε(x) Is equal to zero;
the singular value soft threshold operator is:
Dτ(X)=U Sτ(∑)VTa third formula;
wherein X ═ U ∑ VTSingular value decomposition for the matrix;
solving by using an augmented Lagrange multiplier algorithm:
setting initial parameters Y0 ═ 0, E0 ═ 0 and mu0Greater than 0, rho0Greater than 0, k is 0;
wherein Y0 denotes a Lagrangian multiplier matrix, E0 denotes a sparse matrix, k is an iteration number, k is an integer greater than or equal to 0, and mu0Is the initial value of the contraction factor, ρ is a parameter greater than 1, used to update the contraction factor μ;
by usingFormula IV
Performing singular value decomposition;
wherein svd represents singular value decomposition; here, the initialized parameter k is 0 and Y0=0、E0=0、μ0Substituting rho into the formula IV, and obtaining a matrix U, a matrix S and a matrix V through singular value decomposition;
by usingFormula five
Solving for Ak+1;
Substituting k as 0 into formula five, and substituting matrix S obtained from formula four into singular value soft threshold operator in formula three to obtain A1;
by usingFormula six
Solving for Ek+1;
Substituting k as 0 into the formula six, and substituting the matrix A1 obtained from the formula five into a soft threshold shrinking operator in the formula two to obtain E1;
using Yk+1=Yk+μk(D-Ak+1-Ek+1) Formula seven
Solving for Yk+1;
Taking k as 0 to the seven formula, updating a Lagrange multiplier matrix Y according to the residual value D-A-E, and obtaining Y1;
this completes the first iteration of the algorithm, updating the contraction factor, μk+1=ρμk(ii) a And k is k + 1;
and converging the augmented Lagrange multiplier algorithm to an optimal solution through iterative calculation to obtain the low-rank matrix A and the sparse matrix E of the flow matrix decomposition.
In a second aspect, an embodiment of the present invention provides an OD flow rate analyzing apparatus, including:
the data acquisition unit is used for periodically sampling and acquiring the OD flow of the whole network and converting the OD flow of the whole network into a flow matrix;
the modeling unit is used for modeling the traffic matrix converted by the data acquisition unit by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the traffic matrix;
the optimization unit is used for performing convex optimization relaxation on the robust principal component analysis model of the traffic matrix obtained by the modeling unit to obtain a convex optimization model of the robust principal component analysis;
the solving unit is used for solving the convex optimization model of the robust principal component analysis obtained by the optimizing unit by adopting a non-precise augmented Lagrange multiplier method to obtain a recovered low-rank matrix and a recovered sparse matrix in the flow matrix; the low-rank matrix is a low-rank matrix corresponding to OD (origin-destination) flow without impact noise with a random value and represents an inherent low-rank structure of the flow matrix; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value;
and the data output unit is used for outputting the low-rank matrix and the sparse matrix obtained by the solving unit.
Specifically, the modeling unit is specifically configured to model the traffic matrix by using a robust principal component analysis model to obtain the robust principal component analysis model of the traffic matrix:
wherein,to represent the minimization of the target matrix A and matrix E, A is a low rank matrix, D is a traffic matrix, E is a sparse matrix, rank (A) represents the rank of matrix A,
‖E‖0is the number of non-zero elements in the matrix E, gamma is a compromise factor and is greater than 0, and s.t. is a constraint condition.
Specifically, the optimization unit is configured to perform convex optimization relaxation on the robust principal component analysis model of the traffic matrix to obtain a convex optimization model of the robust principal component analysis:
wherein,to represent that the minimization A of a target matrix A and a matrix E is the low rank matrix, D is the traffic matrix, E is the sparse matrix, | A |*Represents the sum of all singular values, | E | in the matrix A1Is the sum of the absolute values of all elements in the matrix E, λ is the relaxation factor, and s.t. is the constraint.
Specifically, the solving unit is specifically configured to construct an augmented lagrangian function of the convex optimization model:
‖A‖*represents the sum of all singular values, | E | in the low rank matrix A1Is the sum of the absolute values of all elements in the sparse matrix E, λ is the relaxation factor, Y is the lagrange multiplier matrix,D-A-E is a constraint condition for increasing the penalty of the Lagrange function,<Y,D-A-E>the inner product of a Lagrange multiplier matrix Y and a constraint condition D-A-E is obtained, mu is a positive parameter, and F is the Frobenius norm of the matrix;
presetting a soft threshold shrinking operator and a singular value soft threshold operator, wherein the soft threshold shrinking operator is as follows:
wherein, when x>When epsilon, Sε(x) Equal to x-epsilon; when x is<When epsilon, Sε(x) Equal to x + ε;
when x ═ epsilon, Sε(x) Is equal to zero;
the singular value soft threshold operator is:
Dτ(X)=U Sτ(∑)VTa third formula;
wherein X ═ U ∑ VTSingular value decomposition for the matrix;
solving by using an augmented Lagrange multiplier algorithm:
setting initial parameters Y0 ═ 0, E0 ═ 0 and mu0Greater than 0, rho0Greater than 0, k is 0;
wherein Y0 denotes a Lagrangian multiplier matrix, E0 denotes a sparse matrix, k is an iteration number, k is an integer greater than or equal to 0, and mu0Is the initial value of the contraction factor, ρ is a parameter greater than 1, used to update the contraction factor μ;
by usingFormula IV
Performing singular value decomposition;
wherein svd represents singular value decomposition; here, the initialized parameter k is 0 and Y0=0、E0=0、μ0Substituting rho into the formula IV, and obtaining a matrix U, a matrix S and a matrix V through singular value decomposition;
by usingFormula five
Solving for Ak+1;
Substituting k as 0 into formula five, and substituting matrix S obtained from formula four into singular value soft threshold operator in formula three to obtain A1;
by usingFormula six
Solving for Ek+1;
Substituting k as 0 into the formula six, and substituting the matrix A1 obtained from the formula five into a soft threshold shrinking operator in the formula two to obtain E1;
using Yk+1=Yk+μk(D-Ak+1-Ek+1) Formula seven
Solving for Yk+1;
Taking k as 0 to the seven formula, updating a Lagrange multiplier matrix Y according to the residual value D-A-E, and obtaining Y1;
this completes the first iteration of the algorithm, updating the contraction factor, μk+1=ρμk(ii) a And k is k + 1;
and converging the augmented Lagrange multiplier algorithm to an optimal solution through iterative calculation to obtain the low-rank matrix A and the sparse matrix E of the flow matrix decomposition.
The embodiment of the invention provides an OD flow analysis method, which comprises the steps of utilizing a robust component analysis model to model a flow matrix of total network OD flow obtained by periodic sampling, adopting a relaxation method to carry out convex optimization on the flow matrix, converting the flow matrix into a convex optimization model of robust principal component analysis, utilizing an augmented Lagrange multiplier method to solve the convex optimization model of robust principal component analysis, and decomposing the flow matrix into a low-rank matrix and a sparse matrix, so that the analysis problem of the flow matrix is converted into the decomposition problem of the low-rank matrix and the sparse matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow which does not contain the impact noise with a random value; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value; compared with PCA, the OD traffic analysis method provided by the embodiment of the invention can better find out a potential low-rank matrix of the traffic matrix of the OD traffic of the whole network in the calculation of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, and solves the problem that the calculation result of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, is not ideal in the prior art.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart of an OD flow analysis method according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an OD flow analysis apparatus according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention provides an OD flow analysis method, which comprises the steps of utilizing a robust component analysis model to model a flow matrix of total network OD flow obtained by periodic sampling, adopting a relaxation method to carry out convex optimization on the flow matrix, converting the flow matrix into a convex optimization model of robust principal component analysis, utilizing an augmented Lagrange multiplier method to solve the convex optimization model of robust principal component analysis, and decomposing the flow matrix into a low-rank matrix and a sparse matrix, so that the analysis problem of the flow matrix is converted into the decomposition problem of the low-rank matrix and the sparse matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow which does not contain the impact noise with a random value; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value; compared with PCA, the OD traffic analysis method provided by the embodiment of the invention can better find out a potential low-rank matrix of the traffic matrix of the OD traffic of the whole network in the calculation of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, and solves the problem that the calculation result of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, is not ideal in the prior art.
It should be noted that the method for analyzing the OD traffic provided by the embodiment of the present invention may be implemented in a hardware manner, or may be implemented in a software instruction executed by a processor. The software instructions may be comprised of corresponding software modules that may be stored in Random Access Memory (RAM), flash Memory, Read Only Memory (ROM), Erasable Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), registers, a hard disk, a removable disk, a compact disc Read Only Memory (CD-ROM), or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be integral to the processor. The processor and the storage medium may reside in an ASIC. The OD flow analysis method and the OD flow analysis device provided by the embodiment of the invention are positioned in the IP backbone network equipment, and can capture the OD flow in real time or capture periodically by setting a capture period in the IP backbone network equipment, and then analyze the OD flow; the OD flow analysis device provided by the embodiment of the present invention or the electronic device using the OD flow analysis method provided by the embodiment of the present invention may be used for analyzing the OD flow.
The first embodiment of the present invention provides a method for analyzing OD traffic, as shown in fig. 1, including:
and periodically sampling to obtain the OD flow of the whole network, and converting the OD flow of the whole network into a flow matrix.
And modeling the flow matrix by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the flow matrix.
And performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis.
Solving a convex optimization model of robust principal component analysis by adopting an augmented Lagrange multiplier method, and decomposing a flow matrix into a low-rank matrix and a sparse matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow without the impact noise with a random value and represents an inherent low-rank structure of the flow matrix; the sparse matrix is a sparse matrix corresponding to the OD traffic of the impact noise containing the random value.
And outputting the obtained low-rank matrix and sparse matrix.
The embodiment of the invention provides an OD flow analysis method, which comprises the steps of utilizing a robust component analysis model to model a flow matrix of total network OD flow obtained by periodic sampling, adopting a relaxation method to carry out convex optimization on the flow matrix, converting the flow matrix into a convex optimization model of robust principal component analysis, utilizing an augmented Lagrange multiplier method to solve the convex optimization model of robust principal component analysis, and decomposing the flow matrix into a low-rank matrix and a sparse matrix, so that the analysis problem of the flow matrix is converted into the decomposition problem of the low-rank matrix and the sparse matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow which does not contain the impact noise with a random value; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value; compared with PCA, the OD traffic analysis method provided by the embodiment of the invention can better find out a potential low-rank matrix of the traffic matrix of the OD traffic of the whole network in the calculation of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, and solves the problem that the calculation result of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, is not ideal in the prior art.
The second embodiment of the present invention provides a method for analyzing OD traffic, as shown in fig. 1, including:
and periodically sampling to obtain the OD flow of the whole network, and converting the OD flow of the whole network into a flow matrix.
It should be noted that the OD traffic analysis method provided by the embodiment of the present invention first converts periodically sampled and acquired full-network OD traffic data into a traffic matrix for analysis; for a network with N nodes, the total number of OD traffic is N ═ N2Then, OD flow data are acquired through T periods; the OD flow of each cycle forms a column vector d ═ d1,d2,...,dN]TArranged in a periodic sequence T, the OD traffic matrix D can be represented as:
the obtained OD traffic matrix D is a traffic matrix corresponding to the OD traffic of the whole network, wherein the traffic matrix is a matrix of dimension N × T.
Modeling the flow matrix by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the flow matrix:
wherein,to represent the minimization of the target matrix A and matrix E, A is a low rank matrix, D is a traffic matrix, E is a sparse matrix, rank (A) represents the rank of matrix A, | E |0Is the number of non-zero elements in the matrix E, gamma is a compromise factor and is greater than 0, and s.t. is a constraint condition.
It should be noted that, in the prior art, the topology of the network core node and the size of the traffic flowing through the network have relative stability, but this stability may be damaged by the impact noise, which is represented as a random value, such as abnormal traffic, bursty traffic, and measurement noise; for the damage problem generated by the impact noise of the random value, a mathematical method is used for conversion, and the rank of the flow matrix D is far smaller than the dimension of the flow matrix D, namely the low rank of the matrix, as the flow matrix D has low rank, namely the column vectors of the flow matrix D have correlation; however, the traffic matrix D may be contaminated by random impact noise, the low rank structure is destroyed, and abnormal traffic, bursty traffic, measurement noise, and the like are sparse with respect to the scale of the OD traffic matrix; therefore, the traffic matrix D can be modeled by robust principal component analysis, the purpose of modeling the traffic matrix D by robust principal component analysis is to recover a potential low-rank structure thereof, and impact noise with random values caused by abnormal traffic, bursty traffic, measurement noise and the like is separated from the traffic matrix D, that is, D is a + E, and the matrix a and the matrix E are a low-rank matrix and a sparse matrix of true values, respectively.
The mathematical formula mentioned in example two is additionally explained, wherein:
1. p range of vectorAnd (4) counting. For vector a ═ a1,a2,…an)T∈Rn×1Its P norm isWherein P > 0.
In particular, when p is 1,when p is 2, the compound is a compound,
‖A‖0is the zero norm of the vector, i.e., the number of non-zero elements in the vector. Infinite norm
2. Inner product of matrix. For two isotypes of m x n dimensional real matrices A, B, their inner product is
3. The Frobenius norm of the matrix. For matrix a ═ a (ai)j)m×n∈Rm ×nIts Frobenius norm is defined asZero norm A | | non-conducting phosphor0Is the number of non-zero elements in the matrix. Infinite norm(1,1) norm of(2,1) norm of
4. Singular value decomposition of the matrix. The matrix A ∈ Rm×nIts Singular Value Decomposition (SVD) isWherein U is E.Rm×mAnd V ∈ Rn×nAre all orthogonal matrices, diagonal matrix ∑r=diag(σ1,σ2,…,σr)∈Rr×rAnd diagonal elements satisfy σ1≥σ2≥…≥σr>0. The rank of matrix a is rank (a) r, U ═ U (U)1,u2,…,um),V=(v1,v2,…,vn) The singular value decomposition of the matrix is
5. The kernel norm of the matrix is defined asWherein ImRepresents an m-order identity matrix, and trace (·) represents a trace operator of the matrix. The kernel norm of matrix A can be represented by its singular values, i.e.
Performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis:
to represent the minimization of the target matrix A and matrix E, A is a low rank matrix, D is a flow matrix, E is a sparse matrix, | A |*Representing the sum of all singular values, | E | in the matrix A1Is the sum of the absolute values of all elements in the matrix E, λ is the relaxation factor, and s.t. is the constraint.
It should be noted that, although the robust principal component analysis model is used to convert the flow matrix analysis problem into the decomposition problem of the low-rank matrix and the sparse matrix; however, in the robust principal component analysis model of the traffic matrix, both the target matrix a and the target matrix E are nonlinear and non-convex matrices, and it is difficult to directly solve a non-deterministic polynomial (NP for short), so that convex optimization relaxation needs to be performed on the robust principal component analysis model of the traffic matrix in order to solve both the target matrix a and the target matrix E, so as to obtain the convex optimization model of the robust principal component analysis.
In a specific calculation process, assume A0∈Rn1×n2(n1≥n2) The non-coherence condition for the parameter μ is satisfied as follows:
wherein eiRepresents a unit vector in whichRepresenting the singular value decomposition of the matrix.
Where r is the matrix A0Rank of (1), infinite norm | | | D | | non-woven phosphor∞=maxi,j|Di,j|,
And assuming that the support of E0 is evenly distributed in all coordinates, as long as:
rank(A0)≤ρrn2μ-1(logn1)-2,m≤ρsn1n2
then there is a numerical constant c that causes the problem of tracking by principal component (where) Can be at least 1-cn1 -10The original matrix is recovered by the probability of where pr、ρsIs a positive numerical constant.
Therefore, when the rank matrix A is low0Is reasonably distributed and the non-zero elements of the sparse matrix E0 are also uniformly distributed, the solution to the problem of principal component analysis can recover the original low-rank matrix a from unknown and arbitrary errors with a probability close to 10。
Solving a convex optimization model of robust principal component analysis by adopting an augmented Lagrange multiplier method, and decomposing a flow matrix into a low-rank matrix and a sparse matrix, wherein the method comprises the following steps:
constructing an augmented Lagrangian function of the convex optimization model:
‖A‖*represents the sum of all singular values, | E | in the low rank matrix A1Is the sum of the absolute values of all elements in the sparse matrix E, λ is the relaxation factor, Y is the lagrange multiplier matrix,D-A-E is a constraint condition for increasing the penalty of the Lagrange function,<Y,D-A-E>is lagrange multiplier matrix Y andthe inner product of the constraint conditions D-A-E, mu is a positive parameter, and F is the Frobenius norm of the matrix;
presetting a soft threshold shrinking operator and a singular value soft threshold operator, wherein the soft threshold shrinking operator is as follows:
wherein, when x>When epsilon, Sε(x) Equal to x-epsilon; when x is<When epsilon, Sε(x) Equal to x + ε;
when x ═ epsilon, Sε(x) Is equal to zero;
the singular value soft threshold operator is:
Dτ(X)=U Sτ(∑)VTa third formula;
wherein X ═ U ∑ VTSingular value decomposition for the matrix;
solving by using an augmented Lagrange multiplier algorithm:
it should be noted that the Augmented lagrangian Multiplier algorithm herein refers to an inaccurate Augmented lagrangian Multiplier algorithm (IALM).
Setting initial parameters Y0 ═ 0, E0 ═ 0 and mu0Greater than 0, rho0Greater than 0, k is 0;
wherein Y0 denotes a Lagrangian multiplier matrix, E0 denotes a sparse matrix, k is an iteration number, k is an integer greater than or equal to 0, and mu0Is the initial value of the contraction factor, ρ is a parameter greater than 1, used to update the contraction factor μ;
by usingFormula IV
Performing singular value decomposition;
wherein svd represents singular value decomposition; here, the initialized parameter k is 0 and Y0=0、E0=0、μ0Substituting rho into the formula IV, and obtaining a matrix U, a matrix and a matrix V through singular value decomposition;
by usingFormula five
Solving for Ak+1;
Substituting k as 0 into formula five, and substituting matrix S obtained from formula four into singular value soft threshold operator in formula three to obtain A1;
by usingFormula six
Solving for Ek+1;
Substituting k as 0 into the formula six, and substituting the matrix A1 obtained from the formula five into a soft threshold shrinking operator in the formula two to obtain E1;
using Yk+1=Yk+μk(D-Ak+1-Ek+1) Formula seven
Solving for Yk+1;
Taking k as 0 to the seven formula, updating a Lagrange multiplier matrix Y according to the residual value D-A-E, and obtaining Y1;
this completes the first iteration of the algorithm, updating the contraction factor, μk+1=ρμk(ii) a And k is k + 1;
it should be noted that since k is the number of iterations, the number of iterations k may be selected according to the accuracy of the data in practical applications; assuming that the number of iterations is N, N is an integer greater than or equal to 1,during calculation, firstly solving the augmented Lagrange multiplier algorithm with 0 k, and finishing first iteration to obtain A1, E1 and Y1; then according to the contraction factor, muk+1=ρμk(ii) a And k is k + 1; solving according to the fact that k is 1 to N and is brought into the augmented Lagrange multiplier algorithm, and therefore the augmented Lagrange multiplier algorithm is converged to an optimal solution; the optimal solution means that the values of the low-rank matrix A and the sparse matrix E are not changed no matter how many times of iterative computation is performed.
And converging the augmented Lagrange multiplier algorithm to an optimal solution through iterative calculation to obtain the low-rank matrix A and the sparse matrix E of the flow matrix decomposition.
It should be noted that, in a general form, the formula for solving the following convex optimization problem by using the Augmented Lagrangian Multiplier (ALM) is as follows:
min f(X),s.t.h(X)=0;
wherein f is Rn→R,h:Rn→Rm. Then, the augmented lagrange function of the convex optimization problem is defined as:
where μ is a positive parameter.
Augmented Lagrangian functions differ from Lagrangian functions in that they carry a penalty term for the constraint.
By comparison with the PCP model, let:
X=(A,E),f(X)=||A||*+λ||E||1,h(X)=D-A-E;
and the method is brought into an augmented Lagrangian function in a general form, so that the augmented Lagrangian function of the convex optimization model is constructed:
the embodiment of the invention provides an OD flow analysis method, which comprises the steps of utilizing a robust component analysis model to model a flow matrix of periodically acquired full-network OD flows, adopting a relaxation method to carry out convex optimization calculation on the flow matrix through the relaxation calculation of convex optimization, converting the flow matrix into a convex optimization model for robust principal component analysis, utilizing an augmented Lagrange multiplier method to solve the convex optimization model for the robust principal component analysis, and decomposing the flow matrix into a low-rank matrix and a sparse matrix, so that the analysis problem of the flow matrix is converted into the decomposition problem of the low-rank matrix and the sparse matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow which does not contain the impact noise with a random value; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value; compared with PCA, the OD traffic analysis method provided by the embodiment of the invention can better find out a potential low-rank matrix of the traffic matrix of the OD traffic of the whole network in the calculation of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, and solves the problem that the calculation result of the OD traffic of the impact noise which is represented as a random value, such as abnormal traffic, burst traffic, measurement noise and the like, is not ideal in the prior art.
Third embodiment, an embodiment of the present invention provides an OD flow rate analyzing apparatus 10, as shown in fig. 2, including:
a data obtaining unit 1010, configured to periodically sample and obtain a full-network OD traffic, and convert the full-network OD traffic into a traffic matrix;
a modeling unit 1020, configured to model the traffic matrix converted by the data obtaining unit by using a robust principal component analysis model to obtain a robust principal component analysis model of the traffic matrix;
an optimizing unit 1021, configured to perform convex optimization relaxation on the robust principal component analysis model of the traffic matrix obtained by the modeling unit to obtain a convex optimization model of the robust principal component analysis;
a solving unit 1022, configured to solve the convex optimization model of robust principal component analysis obtained by the optimizing unit by using a non-precise augmented lagrangian multiplier method, so as to obtain a recovered low-rank matrix and a recovered sparse matrix in the traffic matrix; the low-rank matrix is a low-rank matrix corresponding to the OD flow which does not contain the impact noise with a random value; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value;
the data output unit 1030 is configured to output the low-rank matrix and the sparse matrix obtained by the solving unit.
The embodiment of the invention provides an OD flow analysis device.A data processing unit utilizes a robust component analysis model to model a flow matrix of the whole network OD flow acquired by a data acquisition unit periodically; then the data processing unit performs convex optimization calculation on the flow matrix by adopting a relaxation method through the relaxation calculation of convex optimization, and converts the flow matrix into a convex optimization model for robust principal component analysis; finally, the data processing unit adopts an augmented Lagrange multiplier method to solve a convex optimization model of robust principal component analysis, and decomposes the flow matrix into a low-rank matrix and a sparse matrix, so that the analysis problem of the flow matrix is converted into the decomposition problem of the low-rank matrix and the sparse matrix; the low rank matrix is a low rank matrix corresponding to the OD traffic that does not contain the random value of the impact noise. The OD flow analysis device provided by the embodiment of the invention can better find out a potential low-rank matrix of the flow matrix of the OD flows of the whole network in the calculation of the OD flows of the impact noise which is represented as a random value, such as abnormal flows, burst flows, measurement noise and the like, and solves the problem that the calculation result of the OD flows of the impact noise which is represented as a random value, such as the abnormal flows, the burst flows, the measurement noise and the like, is not ideal in the prior art.
In a fourth embodiment, an OD flow rate analyzing apparatus 10 according to an embodiment of the present invention is provided, as shown in fig. 2, including:
a data obtaining unit 1010, configured to periodically collect and obtain a full-network OD traffic, and convert the full-network OD traffic into a traffic matrix;
the modeling unit 1020 is specifically configured to model the traffic matrix by using a robust principal component analysis model to obtain the robust principal component analysis model of the traffic matrix:
wherein,to represent the minimization of the target matrix A and matrix E, A is a low rank matrix, D is a traffic matrix, E is a sparse matrix, rank (A) represents the rank of matrix A, | E |0Is the number of non-zero elements in the matrix E, gamma is a compromise factor and is greater than 0, and s.t. is a constraint condition.
The optimization unit 1021 is configured to perform convex optimization relaxation on the robust principal component analysis model of the traffic matrix to obtain a convex optimization model of the robust principal component analysis:
wherein,to indicate that the minimum A of the target matrix A and the matrix E is a low rank matrix, D is a flow matrix, E is a sparse matrix, | A |*Representing the sum of all singular values, | E | in the matrix A1Is the sum of the absolute values of all elements in the matrix E, λ is the relaxation factor, and s.t. is the constraint.
The solving unit 1022 is specifically configured to construct an augmented lagrangian function of the convex optimization model:
‖A‖*represents the sum of all singular values, | E | in the low rank matrix A1Is the sum of the absolute values of all elements in the sparse matrix E, λ is the relaxation factor, Y is the lagrange multiplier matrix,D-A-E is a constraint condition for increasing the penalty of the Lagrange function,<Y,D-A-E>the inner product of a Lagrange multiplier matrix Y and a constraint condition D-A-E is obtained, mu is a positive parameter, and F is the Frobenius norm of the matrix;
presetting a soft threshold shrinking operator and a singular value soft threshold operator, wherein the soft threshold shrinking operator is as follows:
wherein, when x>When epsilon, Sε(x) Equal to x-epsilon; when x is<When epsilon, Sε(x) Equal to x + ε;
when x ═ epsilon, Sε(x) Is equal to zero;
the singular value soft threshold operator is:
Dτ(X)=U Sτ(∑)VTa third formula;
wherein X ═ U ∑ VTSingular value decomposition for the matrix;
solving by using an augmented Lagrange multiplier algorithm:
it should be noted that. The Augmented Lagrange Multiplier algorithm herein refers to an inaccurate Augmented Lagrange Multiplier algorithm (IALM).
Setting initial parameters Y0 ═ 0, E0 ═ 0 and mu0Greater than 0, rho0Greater than 0, k is 0;
wherein Y0 denotes a Lagrangian multiplier matrix, E0 denotes a sparse matrix, k is an iteration number, k is an integer greater than or equal to 0, and mu0Is the initial value of the contraction factor, ρ is a parameter greater than 1, used to update the contraction factor μ;
by usingFormula IV
Performing singular value decomposition;
wherein svd represents singular value decomposition; here, the initialized parameter k is 0 and Y0=0、E0=0、μ0Substituting rho into the formula IV, and obtaining a matrix U, a matrix and a matrix V through singular value decomposition;
by usingFormula five
Solving for Ak+1;
Substituting k as 0 into formula five, and substituting matrix S obtained from formula four into singular value soft threshold operator in formula three to obtain A1;
by usingFormula six
Solving for Ek+1;
Substituting k as 0 into the formula six, and substituting the matrix A1 obtained from the formula five into a soft threshold shrinking operator in the formula two to obtain E1;
using Yk+1=Yk+μk(D-Ak+1-Ek+1) Formula seven
Solving for Yk+1;
Taking k as 0 to the seven formula, updating a Lagrange multiplier matrix Y according to the residual value D-A-E, and obtaining Y1;
this completes the first iteration of the algorithm, updating the contraction factor, μk+1=ρμk(ii) a And k is k + 1;
and converging the augmented Lagrange multiplier algorithm to an optimal solution through iterative calculation to obtain the low-rank matrix A and the sparse matrix E of the flow matrix decomposition.
And the data output unit 1030 is configured to output the low-rank matrix a and the sparse matrix E obtained by the solving unit.
According to the OD flow analysis device provided by the embodiment of the invention, the data processing unit utilizes a robust component analysis model algorithm to model a flow matrix of the whole network OD flow acquired by the data acquisition unit periodically; then the data processing unit adopts a relaxation method to perform convex optimization calculation on the flow matrix, and converts the flow matrix into a convex optimization model for robust principal component analysis; finally, the data processing unit adopts an augmented Lagrange multiplier method to solve a convex optimization model of robust principal component analysis, and decomposes the flow matrix into a low-rank matrix and a sparse matrix, so that the analysis problem of the flow matrix is converted into the decomposition problem of the low-rank matrix and the sparse matrix; the low rank matrix is a low rank matrix corresponding to the OD traffic that does not contain the random value of the impact noise. The OD flow analysis device provided by the embodiment of the invention can better find out a potential low-rank matrix of the flow matrix of the OD flows of the whole network in the calculation of the OD flows of the impact noise which is represented as a random value, such as abnormal flows, burst flows, measurement noise and the like, and solves the problem that the calculation result of the OD flows of the impact noise which is represented as a random value, such as the abnormal flows, the burst flows, the measurement noise and the like, is not ideal in the prior art.
The data acquisition unit and the data output unit may be communication units of the OD traffic analysis device, and the modeling unit, the optimization unit, and the solving unit may be processors separately installed, or may be implemented by being integrated into one of the processors of the OD traffic analysis device, or may be stored in a memory of the OD traffic analysis device in the form of program codes, and the functions of the modeling unit, the optimization unit, and the solving unit may be invoked and executed by one of the processors of the OD traffic analysis device. The processor described herein may be a Central Processing Unit (CPU), or an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits configured to implement embodiments of the present invention.
Those skilled in the art will recognize that, in one or more of the examples described above, the functions described in this invention may be implemented in hardware, software, firmware, or any combination thereof. When implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer.
The above-mentioned embodiments, objects, technical solutions and advantages of the present invention are further described in detail, it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made on the basis of the technical solutions of the present invention should be included in the scope of the present invention.
Claims (4)
1. An OD flow analysis method is characterized in that:
periodically sampling to obtain the OD flow of the whole network, and converting the OD flow of the whole network into a flow matrix;
modeling the flow matrix by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the flow matrix;
performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis;
solving the convex optimization model of the robust principal component analysis by adopting an augmented Lagrange multiplier method, and decomposing the flow matrix into a low-rank matrix and a sparse matrix; wherein the low-rank matrix does not contain the OD flow of the impact noise with random value, and represents the inherent low-rank structure of the flow matrix; the sparse matrix contains the OD flow of the impact noise with a random value;
outputting the obtained low-rank matrix and the sparse matrix;
the robust principal component analysis model is adopted to model the traffic matrix to obtain the robust principal component analysis model of the traffic matrix:
wherein,to represent a minimization of a target matrix A and a matrix E, A being the low rank matrix, D being the traffic matrix, E being the sparse matrix, rank (A) representing the rank of the matrix A, | E |0Is the number of non-zero elements in the matrix E, γ is a compromise factor and is greater than 0, and s.t. is a constraint condition;
performing convex optimization relaxation on the robust principal component analysis model of the flow matrix to obtain a convex optimization model of the robust principal component analysis:
wherein,to represent a minimization of a target matrix A and a matrix E, A being the low rank matrix, D being the traffic matrix, E being the sparse matrix, | A |*Represents the sum of all singular values, | E | in the matrix A1Is the sum of the absolute values of all elements in the matrix E, λIs a relaxation factor, and s.t. is a constraint condition; solving the convex optimization model of the robust principal component analysis by adopting an augmented Lagrange multiplier method, and decomposing the flow matrix into a low-rank matrix and a sparse matrix, wherein the method comprises the following steps of:
constructing an augmented Lagrangian function of the convex optimization model:
‖A‖*represents the sum of all singular values, | E | in the low rank matrix A1Is the sum of the absolute values of all elements in the sparse matrix E, λ is the relaxation factor, Y is the lagrange multiplier matrix,D-A-E is a constraint condition for increasing the penalty of the Lagrange function,<Y,D-A-E>is the inner product of a Lagrange multiplier matrix Y and a constraint condition D-A-E, mu is a positive parameter, and F is the Frobenius norm of the matrix.
2. The analysis method according to claim 1, wherein the solving of the convex optimization model of the robust principal component analysis using the augmented lagrange multiplier method to decompose the traffic matrix into a low rank matrix and a sparse matrix further comprises:
presetting a soft threshold shrinking operator and a singular value soft threshold operator, wherein the soft threshold shrinking operator is as follows:
wherein, when x > epsilon, Sε(x) Equal to x-epsilon; when x < epsilon, Sε(x) Equal to x + ε;
when x ═ epsilon, Sε(x) Is equal to zero;
the singular value soft threshold operator is:
Dτ(X)=U Sτ(∑)VTa third formula;
wherein X ═ U ∑ VTSingular value decomposition for the matrix;
solving by using an augmented Lagrange multiplier algorithm:
setting initial parameters Y0 ═ 0, E0 ═ 0 and mu0Greater than 0, rho0Greater than 0, k is 0;
wherein Y0 denotes a Lagrangian multiplier matrix, E0 denotes a sparse matrix, k is an iteration number, k is an integer greater than or equal to 0, and mu0Is the initial value of the contraction factor, ρ is a parameter greater than 1, used to update the contraction factor μ;
by usingPerforming singular value decomposition on the formula IV;
wherein svd represents singular value decomposition; here, the initialized parameter k is 0 and Y0=0、E0=0、μ0Substituting rho into the formula IV, and obtaining a matrix U, a matrix S and a matrix V through singular value decomposition;
by usingEquation five solves for Ak+1;
Substituting k as 0 into formula five, and substituting matrix S obtained from formula four into singular value soft threshold operator in formula three to obtain A1;
by usingSix solution of formula Ek+1;
Substituting k as 0 into the formula six, and substituting the matrix A1 obtained from the formula five into a soft threshold shrinking operator in the formula two to obtain E1;
using Yk+1=Yk+μk(D-Ak+1-Ek+1) Solving for Y by formula sevenk+1;
Taking k as 0 to the seven formula, updating a Lagrange multiplier matrix Y according to the residual value D-A-E, and obtaining Y1;
this completes the first iteration of the algorithm, updating the contraction factor, μk+1=ρμk(ii) a And k is k + 1;
and converging the augmented Lagrange multiplier algorithm to an optimal solution through iterative calculation to obtain the low-rank matrix A and the sparse matrix E of the flow matrix decomposition.
3. An OD flow analysis device, comprising:
the data acquisition unit is used for periodically sampling and acquiring the OD flow of the whole network and converting the OD flow of the whole network into a flow matrix;
the modeling unit is used for modeling the traffic matrix converted by the data acquisition unit by adopting a robust principal component analysis model to obtain the robust principal component analysis model of the traffic matrix;
the optimization unit is used for performing convex optimization relaxation on the robust principal component analysis model of the traffic matrix obtained by the modeling unit to obtain a convex optimization model of the robust principal component analysis;
the solving unit is used for solving the convex optimization model of the robust principal component analysis obtained by the optimizing unit by adopting a non-precise augmented Lagrange multiplier method to obtain a recovered low-rank matrix and a recovered sparse matrix in the flow matrix; the low-rank matrix is a low-rank matrix corresponding to OD (origin-destination) flow without impact noise with a random value and represents an inherent low-rank structure of the flow matrix; the sparse matrix is a sparse matrix corresponding to the OD flow of the impact noise containing the random value;
the data output unit is used for outputting the low-rank matrix and the sparse matrix obtained by the solving unit;
the modeling unit is specifically configured to model the traffic matrix by using a robust principal component analysis model to obtain the robust principal component analysis model of the traffic matrix:
wherein,to represent a minimization of a target matrix A and a matrix E, A being the low rank matrix, D being the traffic matrix, E being the sparse matrix, rank (A) representing the rank of the matrix A, | E |0Is the number of non-zero elements in the matrix E, γ is a compromise factor and is greater than 0, and s.t. is a constraint condition;
the optimization unit is configured to perform convex optimization relaxation on the robust principal component analysis model of the traffic matrix to obtain a convex optimization model of the robust principal component analysis:
wherein,to represent that the minimization A of a target matrix A and a matrix E is the low rank matrix, D is the traffic matrix, E is the sparse matrix, | A |*Represents the sum of all singular values, | E | in the matrix A1Is the sum of absolute values of all elements in the matrix E, λ is a relaxation factor, and s.t. is a constraint condition;
the solving unit is specifically used for constructing an augmented Lagrangian function of the convex optimization model, and comprises the following steps:
||A||*represents the sum of all singular values, | E | in the low rank matrix A1Is the sum of the absolute values of all elements in the sparse matrix E, λ is the relaxation factor, Y is the lagrange multiplier matrix,for the purpose of broadeningThe penalty term of the Lagrangian function, D-A-E is the constraint condition,<Y,D-A-E>is the inner product of a Lagrange multiplier matrix Y and a constraint condition D-A-E, mu is a positive parameter, and F is the Frobenius norm of the matrix.
4. The analysis device according to claim 3, wherein the solving unit is specifically configured to construct an augmented Lagrangian function of the convex optimization model, and further comprises:
presetting a soft threshold shrinking operator and a singular value soft threshold operator, wherein the soft threshold shrinking operator is as follows:
wherein, when x > epsilon, Sε(x) Equal to x-epsilon; when x < epsilon, Sε(x) Equal to x + ε;
when x ═ epsilon, Sε(x) Is equal to zero;
the singular value soft threshold operator is:
Dτ(X)=U Sτ(∑)VTa third formula;
wherein X ═ U ∑ VTSingular value decomposition for the matrix;
solving by using an augmented Lagrange multiplier algorithm:
setting initial parameters Y0 ═ 0, E0 ═ 0 and mu0Greater than 0, rho0Greater than 0, k is 0;
wherein Y0 denotes a Lagrangian multiplier matrix, E0 denotes a sparse matrix, k is an iteration number, k is an integer greater than or equal to 0, and mu0Is the initial value of the contraction factor, ρ is a parameter greater than 1, used to update the contraction factor μ;
by usingPerforming singular value decomposition on the formula IV;
wherein svd represents singular value decomposition; here, the initialized parameter k is 0 and Y0=0、E0=0、μ0Substituting rho into the formula IV, and obtaining a matrix U, a matrix S and a matrix V through singular value decomposition;
by usingEquation five solves for Ak+1;
Substituting k as 0 into formula five, and substituting matrix S obtained from formula four into singular value soft threshold operator in formula three to obtain A1;
by usingSix solution of formula Ek+1;
Substituting k as 0 into the formula six, and substituting the matrix A1 obtained from the formula five into a soft threshold shrinking operator in the formula two to obtain E1;
using Yk+1=Yk+μk(D-Ak+1-Ek+1) Solving for Y by formula sevenk+1;
Taking k as 0 to the seven formula, updating a Lagrange multiplier matrix Y according to the residual value D-A-E, and obtaining Y1;
this completes the first iteration of the algorithm, updating the contraction factor, μk+1=ρμk(ii) a And k is k + 1;
and converging the augmented Lagrange multiplier algorithm to an optimal solution through iterative calculation to obtain the low-rank matrix A and the sparse matrix E of the flow matrix decomposition.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610809587.XA CN106301950B (en) | 2016-09-07 | 2016-09-07 | A kind of analysis method and analytical equipment of OD flow |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610809587.XA CN106301950B (en) | 2016-09-07 | 2016-09-07 | A kind of analysis method and analytical equipment of OD flow |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106301950A CN106301950A (en) | 2017-01-04 |
CN106301950B true CN106301950B (en) | 2019-08-09 |
Family
ID=57710517
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610809587.XA Active CN106301950B (en) | 2016-09-07 | 2016-09-07 | A kind of analysis method and analytical equipment of OD flow |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106301950B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107170233B (en) * | 2017-04-20 | 2020-08-18 | 同济大学 | Typical daily traffic demand OD matrix acquisition method based on matrix decomposition |
CN107992449B (en) * | 2017-12-05 | 2021-04-30 | 北京工业大学 | Subway abnormal flow detection method based on low-rank representation |
CN108320504B (en) * | 2018-01-22 | 2020-06-16 | 北京工业大学 | Dynamic OD matrix estimation method based on monitoring data |
CN111970156B (en) * | 2020-08-27 | 2023-04-18 | 广州华多网络科技有限公司 | Network fault root cause analysis method and device, computer equipment and storage medium |
CN113973013B (en) * | 2021-10-25 | 2024-02-02 | 杭州安恒信息技术股份有限公司 | Network traffic detection method, system and related components |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103607292A (en) * | 2013-10-28 | 2014-02-26 | 国家电网公司 | Fast distributed monitoring method for electric-power communication network services |
CN105608492A (en) * | 2016-03-02 | 2016-05-25 | 东北大学 | Robust random-weight neural network-based molten-iron quality multi-dimensional soft measurement method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7773538B2 (en) * | 2008-05-16 | 2010-08-10 | At&T Intellectual Property I, L.P. | Estimating origin-destination flow entropy |
-
2016
- 2016-09-07 CN CN201610809587.XA patent/CN106301950B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103607292A (en) * | 2013-10-28 | 2014-02-26 | 国家电网公司 | Fast distributed monitoring method for electric-power communication network services |
CN105608492A (en) * | 2016-03-02 | 2016-05-25 | 东北大学 | Robust random-weight neural network-based molten-iron quality multi-dimensional soft measurement method |
Also Published As
Publication number | Publication date |
---|---|
CN106301950A (en) | 2017-01-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106301950B (en) | A kind of analysis method and analytical equipment of OD flow | |
Wu et al. | Generalized H2 fault detection for two-dimensional Markovian jump systems | |
Sundaresan et al. | Copula-based fusion of correlated decisions | |
Carrillo et al. | Robust compressive sensing of sparse signals: a review | |
Tillmann et al. | The computational complexity of the restricted isometry property, the nullspace property, and related concepts in compressed sensing | |
Zhou et al. | Spatio-temporal tensor completion for imputing missing internet traffic data | |
Kekatos et al. | From sparse signals to sparse residuals for robust sensing | |
Forero et al. | Sparsity-exploiting robust multidimensional scaling | |
Xie et al. | Accurate recovery of internet traffic data under dynamic measurements | |
Xiao et al. | Anomaly-tolerant network traffic estimation via noise-immune temporal matrix completion model | |
Li et al. | Tensor completion from one-bit observations | |
Zhang et al. | Grading learning for blind source separation | |
Cao et al. | Sketching for sequential change-point detection | |
Zhou et al. | Data reconstruction in internet traffic matrix | |
CN104468272A (en) | Flow matrix estimation method and device | |
Wu et al. | Energy-efficient and robust tensor-encoder for wireless camera networks in Internet of Things | |
Rajesh et al. | Data reconstruction in heterogeneous environmental wireless sensor networks using robust tensor principal component analysis | |
US9147162B2 (en) | Method for classification of newly arrived multidimensional data points in dynamic big data sets | |
Zhou et al. | Robust spatio-temporal tensor recovery for internet traffic data | |
Xiao et al. | A one-sample decentralized proximal algorithm for non-convex stochastic composite optimization | |
Bhushan et al. | Incremental principal component analysis based outlier detection methods for spatiotemporal data streams | |
Qiu et al. | Recursive sparse recovery in large but structured noise—Part 2 | |
Kiraly et al. | Error-minimizing estimates and universal entry-wise error bounds for low-rank matrix completion | |
Schizas | Online data dimensionality reduction and reconstruction using graph filtering | |
Stojnic | Towards improving ℓ 1 optimization in compressed sensing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |