CN114697272B

CN114697272B - Traffic classification method, system and computer readable storage medium

Info

Publication number: CN114697272B
Application number: CN202210203700.5A
Authority: CN
Inventors: 汤萍萍; 王再见; 汝佳冉
Original assignee: Anhui Normal University
Current assignee: Anhui Normal University
Priority date: 2022-03-03
Filing date: 2022-03-03
Publication date: 2023-06-16
Anticipated expiration: 2042-03-03
Also published as: CN114697272A

Abstract

The invention provides a flow classification method, a system and a computer readable storage medium, wherein the flow classification method comprises the following steps: s1, obtaining a network flow F to be classified _I Obtaining F _I A spatial sequence P and a temporal sequence T of (a); s2, establishing fractal features f in two dimensions of space and time according to the space sequence P and the time sequence T _P (alpha) and f _T (α) defining a high-dimensional fractal M; s3, defining a similarity measure according to the high-dimensional fractal M; s4, converting the vector matrix with similar measurement into similarity; s5, classifying the network flows according to the similarity. The flow classification method, the system and the computer readable storage medium reflect a series of change characteristics obtained by observing network flows from different space and time scales, represent the space-time correlation between data, so as to classify, not only ensure the stability of fine classification, but also enhance the accuracy of fine classification.

Description

Traffic classification method, system and computer readable storage medium

Technical Field

The present invention relates to the field of traffic classification technologies, and in particular, to a traffic classification method, system, and computer readable storage medium.

Background

With the increasing innovation of network technology, network traffic is exploded. Traffic with different QoS (Quality of Service) requirements, such as minute and second delays in video conferencing or loss of pictures can lead to economic losses or decision errors, while traffic classification can help to implement differentiated services; network bandwidth is occupied by a large amount of garbage flow (high consumption and low value), and the use of bandwidth resources can be optimized through online identification; in addition, malicious traffic online detection (such as broadcast storm attack) can enhance network security and ensure system confidentiality and availability. In short, traffic classification is a basic technology for solving a series of important problems such as resource management, network monitoring, security control and the like, and is an important research problem in the field of communication.

As network traffic is more and more classified, classification granularity is finer and finer. For fine-grained classification, the statistical feature method is invaginated by the "feature engineering" problem, 2227676s. The deep learning method obtains fine classification capability by enhancing detail features, but the network structure becomes complex, and setting parameters and super parameters is a difficult task; if new classes are added, all parameters and even the system architecture face readjustment, which severely restricts the application of online classification.

The existing fractal method generally obtains classification granularity in a mode of sacrificing classification speed, and is difficult to achieve both classification accuracy and classification speed.

Disclosure of Invention

In view of the above, the present invention aims to provide a flow classification method, system and computer readable storage medium, which can more accurately classify flows; in addition, unlike traditional continuous fractal, high-dimensional fractal is a discrete fractal feature, so that the calculated speed is greatly improved.

The technical scheme of the invention is realized as follows:

the invention provides a flow classification method, which comprises the following steps:

s1, obtaining a network flow F to be classified _I Obtaining F _I A spatial sequence P and a temporal sequence T of (a);

s2, establishing fractal features f in two dimensions of space and time according to the space sequence P and the time sequence T _P (alpha) and f _T (α), defining a high-dimensional fractal:

M＝f _P (α)*f _T (α) ^T

wherein f _P (alpha) and f _T The scale of the (α) observation is at least q=1 and at most

S3, defining a similarity measure according to the high-dimensional fractal M;

wherein M is _a Representing stream F _i ^a High-dimensional fractal of M _b Representing stream F _i ^b Is a high-dimensional fractal of (2);

s4, converting the vector matrix of the similarity measure into similarity:

s5, classifying the network flows according to the similarity.

Preferably, the spatial sequence P and the time sequence T establish fractal features f in two dimensions of space and time _P (alpha) and f _T (α) specifically includes:

the spatial sequence p= { P _i Sum time series t= { T _i Respectively brought into the following formula to form fractal features f in two dimensions of space and time _P (alpha) and f _T (α)；

Let x= { Xi, i=1, 2, …, N } be a discrete random sequence and possess fractal characteristics. Dividing the discrete sequence { X (i) } into m non-overlapping blocks, and carrying out merging operation on the blocks to obtain an m-order merging sequence:

q-order calculation and summation are carried out on the m-order coalescence sequence:

finally obtain

Preferably, the S4 specifically includes:

for similarity matrices A and P ^-1 AP，tr(P ^-1 AP)＝tr(PP ^-1 A) Tr (a), where tr (·) is the trace of the matrix and M is the fractal feature f of the spatial sequence _P Fractal features of (alpha) with time series f _T Cross multiplication of (α), thus tr (f) _P (α)f _T (α) ^T )＝f _T (α) ^T f _P (α) converting the vector matrix of similarity measures into a similarity:

preferably, the step S5 specifically includes:

s51: let there are L classes at present

Each class has several streams { …, F _I ^j ,F _I ^k … }, the center point is denoted as

The center point is determined by the following formula:

s52: for network flow F _I ^a When classifying, calculating the similarity between the stream and each center point

The following operations were selected to be most similar:

wherein the network flow F _I ^a And a center point P _l If the similarity of (2) is greater than or equal to the threshold T, then F _I ^a Belonging to class P _l The method comprises the steps of carrying out a first treatment on the surface of the If the similarity is less than the threshold T, then F _I ^a Not of class P _l 。

The invention also provides a flow classification system, which comprises:

an acquisition module for acquiring the network flow F to be classified _I Obtaining F _I A spatial sequence P and a temporal sequence T of (a);

the fractal module is used for establishing fractal characteristics f in two dimensions of space and time according to the space sequence P and the time sequence T _P (alpha) and f _T (α), defining a high-dimensional fractal:

M＝f _P (α)*f _T (α) ^T

The high-dimensional fractal module is used for defining a similarity measure according to the high-dimensional fractal M;

converting the vector matrix of similarity measures into similarity:

and the classification module is used for classifying the network flows according to the similarity.

PreferablyThe spatial sequence P and the time sequence T establish fractal characteristics f in two dimensions of space and time _P (alpha) and f _T (α) specifically includes:

finally obtain

Preferably, the high-dimensional fractal module is specifically configured to:

for similarity matrices A and P ^-1 AP，tr(P ^-1 AP)＝tr(PP ^-1 A) Tr (a), wherein tr (·) is the trace of the matrix;

m is the fractal feature f of the spatial sequence _P Fractal features of (alpha) with time series f _T Cross multiplication of (α), thus tr (f) _P (α)f _T (α) ^T )＝f _T (α) ^T f _P (α) converting the vector matrix of similarity measures into a similarity:

preferably, the classification module is specifically configured to:

let there are L classes at present

Each class has several streams { …, F _I ^j ,F _I ^k …, center point is denoted +.>

The center point is determined by the following formula:

for network flow F _I ^a When classifying, the similarity Sim (M _a ,M _Pl ) The following operations were selected to be most similar:

The invention also proposes a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements a flow classification method according to any of the preceding claims.

The discrete fractal features formed by carrying out space-time separation on the flow reflect a series of change features obtained by observing network flows from different space and time scales, embody the space-time correlation between data and classify the data, not only can ensure the stability of fine classification, but also can enhance the accuracy of fine classification.

Drawings

FIG. 1 is a flow chart of a flow classification method according to an embodiment of the present invention;

FIG. 2 is a block diagram of a flow classification system according to an embodiment of the present invention;

fig. 3 is a high-dimensional fractal flow detail profile.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

As shown in fig. 1, an embodiment of the present invention provides a flow classification method, which includes the following steps:

M＝f _P (α)*f _T (α) ^T

S3, defining a similarity measure according to the high-dimensional fractal M;

s4, converting the vector matrix of the similarity measure into similarity:

s5, classifying the network flows according to the similarity.

Specifically, the method specifically comprises the following steps:

space-time separation

In the flow fractal theory, flow is defined as the amount of data passing through the network device per unit time I, network flow F _I Is a group meeting five-tuple<Source IP, destination IP, source port, destination port, protocol>Is a data packet of (1):

here, P _i Refers to the size, T, of the ith data packet _i Refers to the time interval between the packet and the previous packet, and the resolution N is the number of packets contained in the stream.

In addition, stream F _I Can be divided into a plurality of substreams, the mth substream F _I ^(m) ：

Wherein N in the formula (2) _m The number of packets included in the mth substream is indicated. Next, fractal features are calculated for the spatial sequence P and the temporal sequence T, respectively. In practice f (α) is often estimated using numerical methods to obtain approximations, such as unbiased estimates under the legend transform: let x= { X _i I=1, 2, …, N } is a discrete random sequence and has fractal properties. Will leaveThe scattered sequence { X (i) } is divided into m non-overlapping blocks, and the blocks are combined to obtain an m-order merging sequence:

finally, the method comprises the following steps:

here, τ (q) is the fractal estimation spectrum f (α) under legendre transformation, and τ (q) is used in the present application to quickly calculate the fractal characteristics of the flow in order to achieve quick identification of the network flow.

High-dimensional fractal

The spatial sequence p= { P _i Sum time series t= { T _i Respectively brought into (4-6) to form fractal features f in both spatial and temporal dimensions _P (alpha) and f _T (alpha). The former describes the varying characteristics of the traffic packet size; the latter describes the bursty nature of the traffic packets over time. The two vectors are subjected to cross multiplication, and the physical meaning is that the variation characteristics of the data burst quantity reflected by the network flow on different spatial and time scales are defined, so that the high-dimensional fractal is defined:

M＝f _P (α)*f _T (α) ^T (7)

here, f _P (α) is a fractal feature built based on the spatial sequence P, the observed scale is minimum q=1, and maximum

Similarly, f _T And (alpha) is a fractal feature corresponding to the time sequence T. Fractal features in both the P and T dimensions describeWhen the observation scale q is from 1 to +.>

When in change, the flow data presents a change track in time and space. For this purpose, the observation scale f of the time series is fixed first _T (α)| _α＝q′ Only the spatially varying features f are observed _P (alpha). Observation scale f in time series _T (α)| _α＝q′ Under the feature vector f _Pa (alpha) has uniqueness, so that the orthogonal matrix f of all components _P (α)*f _T (α) ^T The network flow may be uniquely marked. Thus, the present application will distinguish between different types of network flows based on the high-dimensional fractal M.

High-dimensional fractal similarity

The high-dimensional fractal M describes a variation track of the flow along with the variation of the observation scale. One type of traffic always follows a specific protocol, transport, and therefore has similar trajectories reflecting some of the characteristics inherent to traffic. Therefore, based on the similarity of the high-dimensional fractal M, the accurate classification of the network flows is realized. To this end, the present application defines a similarity measure for the high-dimensional fractal M based on a matrix relationship:

here, M _a Representing stream F _i ^a High-dimensional fractal of M _b Representing stream F _i ^b Is a high-dimensional fractal of (2). For similarity matrices A and P ^-1 AP, tr (P- ¹ AP)＝tr(PP- ¹ A) Tr (a), where tr (·) is the trace of the matrix. I.e. the similarity matrix has the same trace. Furthermore, the fractal feature f of the spatial sequence is represented by formula (18) M _P Fractal features of (alpha) with time series f _T Cross multiplication of (α), thus tr (f) _P (α)f _T (α) ^T )＝f _T (α) ^T f _P (α), then, converting the vector matrix of the similarity measure shown in (9) into a scalar, and is called similarity:

here, sim (M) is obtainable according to formula (9) _a ,M _b )＝Sim(M _b ,M _a ) And Sim (·) ranges between 0 and 1; the larger the value, the higher the similarity between the two, and in the extreme case Sim (M _a ,M _a ) =1, i.e. there is perfect agreement between the two.

Classification

The classification process of the application refers to a classifier design method based on kmeans. Assume that there are L classes currently

Because Sim (·) obeys a uniform distribution over 0-1, the center point is determined by the following formula:

center point P _l With other points { …, F in the class _I ^j ,F _I ^k … are all of a relatively small amount. For network flow F _I ^a When classifying, calculating the similarity between the stream and each center point

The following operations were selected to be most similar:

the meaning of formula (11) here is: network flow F _I ^a And a center point P _l If the similarity of (a) is largeIs equal to or greater than the threshold value, then F _I ^a Belonging to class P _l The method comprises the steps of carrying out a first treatment on the surface of the If the similarity is less than the threshold, F _I ^a Not of class P _l 。

Examples

Software environment of experiment: capturing real-time traffic flow by using Wireshark software; the effectiveness of the model HFM (High-dimensional Fractal Model) method was verified with a MATLAB R2016a simulation tool. The hardware configuration environment is Win10professional (64 bit/SP 1), intel (R) Core (TM) i7-7500U@2.70GHz,8GB memory.

The data sets used in this experiment were: 1) An NJUPT data set acquired in a campus network of Nanjing university of post and telecommunications contains six types of traffic; 2) Internet traffic data set UNB ISCX Network Traffic ^[34] Traffic data containing numerous applications such as Vimeo, youTube, ICQ, skype, facebook, bitdorent, etc. The traffic of the dataset is divided into eight categories. 3) ISP data sets collected by regional data centers in China Mobile integrate ten types of traffic, such as video streams, online games, etc.

Step 1, obtaining a time sequence P and a space sequence T. Most traffic packet-grabbing software can provide the size of each packet as well as time-of-arrival information. Taking the example of a cool video stream, P and T can be obtained by Wireshark packet grabbing:

{P _i }＝{470,462,1494,…,68,1494,1494}

{T _i }＝{0.000428,0.00083,…,0.151786,0.05897}

and 2, generating high-dimensional fractal. For different observation scales

Generating corresponding fractal features f from the time series and the space series of (15-17) _P (alpha) and f _T (α)：

f _P (α)＝{20.513,10.436,7.237,5.288,4.362,3.538,3.192,2.641,2.407,2.215}

f _T (α)＝{6.285,3.217,2.163,1.722,1.338,1.176,1.035,0.919,0.814,0.752}

Generated by (7)High-dimensional fractal M _QQ ＝f _P (α)*f _T (α) ^T . As shown in fig. 3, the high-dimensional fractal through space-time separation characterizes more flow details in two dimensions of space and time. In a physical sense, the fractal characteristics of the spatial dimension reflect the change characteristics of the flow packet size; fractal features in the time dimension reflect the bursty nature of traffic packets over time. The high-dimensional fractal can obtain more fractal detail features only by a small amount of data (2000 data packets), so that the HFM greatly improves the calculation speed on the basis of ensuring the classification accuracy.

It should be specifically noted that, the present application sets the resolution to n=2000, and these packets are sufficient to obtain the variation characteristics of the network flow to implement classification. The smaller N is, the less the calculated amount is; however, as the number of packets decreases, the high-dimensional fractal features become unstable, which is detrimental to classification. Taking a cool video stream as an example, the influence of the size of the stream sequence resolution N on the high-dimensional fractal is studied. Convection sequence, respectively taking N _i = {10000,8000,6000,4000,2000,1500,1000,500}, calculating the high-dimensional fractal corresponding to the substreams; then, corresponding matrix similarity is counted

When N is ₁ When=10000, the matrix similarity Sim (C _j ,C _k ) 0.984.+ -. 0.006, this result is very stable. When N is reduced _i In the process, the stability is worse and worse, N ₈ When=500, sim (C _j ,C _k ) 0.469.+ -. 0.127, the difference between sub-streams is quite large and cannot be identified. For other types of flows, repeated experiments were performed, and the situation was largely similar. Therefore, the present application ultimately selects a resolution of n=n ₅ =2000, i.e. classification stability is guaranteed; the calculated amount and the memory amount are not excessively large.

The application adopts the index commonly used by a classification system: the accuracy, recall and F value are used for evaluating the classification accuracy. On the NJUPT data set, 5000 (1000 each type) are randomly selected for six types of flow such as streaming media video, voIP instant audio, web browsing, FTP file transmission, email and online game, and two-fold cross validation is performed; the average of 20 classification results was taken. The statistical results are shown in table 1. Average F, accuracy and recall were 0.953, 95.84% and 95.88%, respectively.

TABLE 1 identification rate statistics

The space-time complexity of HFM is minimal, and its complexity is mainly determined by the number of classified samples M and the resolution N, and is specifically analyzed as follows. The computational effort of HFM online classification is mainly focused on: 1) Preprocessing data, namely respectively generating fractal characteristics by a time sequence and a space sequence after space-time separation. As can be seen from equations (5-7), the calculated amount of this process is mainly the sum of the scan flows, i.e., O (Nlog (N)), N being the flow sequence resolution. 2) Generating a high-dimensional fractal. Taking the observation dimension

Therefore, the calculated amount for generating the high-dimensional fractal based on the formula (7) is O (log N) ² ). 3) And (5) classification. The process mainly comprises the steps of calculating the difference degree of the flow to be measured and each center point>

And then classified according to the similarity. Because tr (f) _P (α)f _T (α) ^T )＝f _T (α) ^T f _P (α)，/>

The calculation amount is therefore O (LlogN), L being the number of classes. The overall algorithm complexity is then obtained as O (Nlog (N) + (log N) ² +Llog N). If M streams participate in classification, the time complexity is O (MNLog (N)).

On the other hand, consider the spatial complexity. And comparing the flow to be measured with L class center points and classifying, so that the storage space required by calculation is mainly used for storing each high-dimensional fractal. Taking the observation dimension

Therefore, the memory space required by the high-dimensional fractal is O (log N) ² ). The L class centers are added with the flow to be measured, so the spatial complexity is O ((M+L) (log N) ² )。

From the above, the time complexity and the space complexity of the HFM are relatively small, and the HFM is suitable for online flow classification detection. Specifically, the traditional fractal (such as FS fractal spectrum) is to generate fractal features after fusing space dimension and time dimension; the HFM method establishes high-dimensional fractal based on two dimensions of space and time, the high-dimensional fractal shows finer fractal characteristics of flow in the space dimension and the time dimension, the detailed characteristics enable HFM to be obtained only by 2000 data packets, the FS method is based on classification accuracy which can be achieved only by 10000 data packets, and the HFM greatly improves classification rate.

As shown in fig. 2, the present invention further provides a flow classification system, including:

an acquisition module 1, configured to acquire a network flow F to be classified _I Obtaining F _I A spatial sequence P and a temporal sequence T of (a);

a fractal module 2 for establishing fractal characteristics f in two dimensions of space and time based on the spatial sequence P and the time sequence T _P (alpha) and f _T (α), defining a high-dimensional fractal:

M＝f _P (α)*f _T (α) ^T

The high-dimensional fractal module 3 is used for defining a similarity measure according to the high-dimensional fractal M;

converting the vector matrix of similarity measures into similarity:

and the classification module 4 is used for classifying the network flows according to the similarity.

Specifically, the spatial sequence P and the time sequence T establish fractal features f in two dimensions of space and time _P (alpha) and f _T (α) specifically includes:

finally obtain

In a preferred embodiment of the invention, the high-dimensional fractal module is specifically configured to:

in a preferred embodiment of the present invention, the classification module is specifically configured to:

let there are L classes at present

The center point is determined by the following formula:

for network flow F _I ^a When classifying, calculating the similarity between the stream and each center point

The following operations were selected to be most similar:

The flow classification method, the system and the computer readable storage medium introduce the attention of the self-adaptive width of the whole layer, so that the model can adjust the global attention when adjusting the attention width of each layer, and the model can learn the optimal attention range. The feedforward layer with the gate control unit reduces the training steps of the model by three quarters, and the model converges to the optimal state more quickly. Compared with the traditional transformer, the method greatly saves the calculation and display memory cost while increasing the maximum visible context length of the model.

From the above description of the embodiments, it will be apparent to those skilled in the art that the present application may be implemented by means of software plus necessary general purpose hardware, or of course may be implemented by dedicated hardware including application specific integrated circuits, dedicated CPUs, dedicated memories, dedicated components and the like. Generally, functions performed by computer programs can be easily implemented by corresponding hardware, and specific hardware structures for implementing the same functions can be varied, such as analog circuits, digital circuits, or dedicated circuits. However, a software program implementation is a preferred embodiment in many cases for the present application. Based on such understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a readable storage medium, such as a floppy disk, a usb disk, a removable hard disk, a ROM, a RAM, or an optical disk of a computer, etc., including several instructions to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to perform the method of the embodiments of the present application.

In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions in accordance with embodiments of the present application are produced in whole or in part. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable apparatus. The computer instructions may be stored in a computer-readable storage medium or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transmitted from one website, computer, server, or data center to another website, computer, server, or data center by a wired (e.g., coaxial cable, fiber optic, digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). Computer readable storage media can be any available media that can be stored by a computer or data storage devices such as servers, data centers, etc. that contain an integration of one or more available media. Usable media may be magnetic media (e.g., floppy disks, hard disks, magnetic tape), optical media (e.g., DVD), or semiconductor media (e.g., solid state disk), among others.

Finally, it should be noted that: the foregoing description is only illustrative of the preferred embodiments of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention are included in the protection scope of the present invention.

Claims

1. A method of classifying traffic, comprising the steps of:

M＝f _P (α)*f _T (α) ^T

wherein the method comprises the steps of，f _P (alpha) and f _T The scale of the (α) observation is at least q=1 and at most

S3, defining a similarity measure according to the high-dimensional fractal M:

s4, converting the vector matrix of the similarity measure into similarity:

s5, classifying the network flows according to the similarity.

2. The traffic classification method according to claim 1, wherein the spatial sequence P and the temporal sequence T establish fractal features f in both spatial and temporal dimensions _P (alpha) and f _T (α) specifically includes:

Let x= { Xi, i=1, 2,..n } be a discrete random sequence and possess fractal characteristics; dividing the discrete sequence { X (i) } into m non-overlapping blocks, and carrying out merging operation on the blocks to obtain an m-order merging sequence:

finally, the method comprises the following steps:

3. the traffic classification method according to claim 1, wherein S4 specifically comprises:

4. the flow classification method according to claim 1, wherein S5 specifically includes:

s51: let there are L classes at present

There are several streams per class {.. F (F) _I ^j ，F _I ^k ,..}, center point is denoted +.>

The center point is determined by the following formula:

The following operations were selected to be most similar:

5. A traffic classification system, comprising:

M＝f _P (α)*f _T (α) ^T

The high-dimensional fractal module is used for defining a similarity measure according to the high-dimensional fractal M:

wherein M is _a Representing stream F _i ^a Is higher than the height of (1)Dimension fractal, M _b Representing stream F _i ^b Is a high-dimensional fractal of (2);

converting the vector matrix of similarity measures into similarity:

6. The traffic classification system according to claim 5, wherein said spatial sequence P and temporal sequence T establish fractal features f in both spatial and temporal dimensions _P (alpha) and f _T (α) specifically includes:

finally obtain

7. The traffic classification system according to claim 5, wherein said high-dimensional fractal module is specifically configured to:

8. the traffic classification system according to claim 5, wherein said classification module is specifically configured to:

let there are L classes at present

The center point is determined by the following formula:

The following operations were selected to be most similar:

9. Computer readable storage medium, characterized in that the storage medium has stored thereon a computer program which, when executed by a processor, implements the flow classification method according to any of claims 1-4.