CN117034112A

CN117034112A - Malicious network traffic classification method based on sample enhancement and contrast learning

Info

Publication number: CN117034112A
Application number: CN202311005429.5A
Authority: CN
Inventors: 陈铁明; 谢京希; 吕明琪; 朱添田
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2023-08-10
Filing date: 2023-08-10
Publication date: 2023-11-10

Abstract

The invention belongs to the technical field of network security and deep learning, and particularly relates to a malicious network traffic classification method based on sample enhancement and contrast learning. Acquiring network traffic, extracting network traffic characteristics, constructing a network traffic sample set, preprocessing the network traffic characteristics, and constructing a pre-training set; training a basic model, calculating a loss value, calculating parameters of the basic model according to the loss value, and updating to obtain a contrast learning malicious flow classification model; and carrying out malicious traffic classification in the new task by adopting a contrast learning malicious traffic classification model. The invention adopts the basic model of the shallow neural network architecture occupied by light resources, so that the occupied resources are less, and the operation efficiency is high; the heuristic method of the contrast task is constructed by randomly covering network flow characteristics, so that data enhancement can be performed; the contrast learning malicious traffic classification model has good classification performance to distinguish the known network traffic categories, and can classify unknown small sample malicious network traffic.

Description

Malicious network traffic classification method based on sample enhancement and contrast learning

Technical Field

The invention relates to network security and deep learning technology, in particular to a malicious network traffic classification method based on sample enhancement and contrast learning.

Background

With the increasing popularity of the internet in modern life, more and more devices are communicating through a network, and the security of network space is receiving more attention. The traffic intrusion classification system is used for effectively classifying various malicious attacks on the network, and is one of key systems for maintaining network space safety. From a machine learning perspective, an intrusion classification system may be defined as a system that classifies network traffic. In short, it is to distinguish normal traffic from malicious traffic of the network. With the development of machine learning technology, a network malicious traffic classification method based on machine learning is widely paid attention to.

For the machine learning method, the intrusion classification system can accurately classify the test samples as long as enough samples are trained. However, today's network environment is constantly changing, new types of malicious traffic are endless, and it is difficult to obtain enough samples to train the model in a short time. The insufficient number of samples makes it difficult to fully train the machine-learned model, thereby affecting the effect of classifying malicious traffic.

Aiming at the problems, how to realize the classification of novel network malicious traffic by deep learning under the condition that the collected network traffic samples are less is a problem to be solved urgently.

Disclosure of Invention

The invention aims to solve the problem that the model of machine learning is difficult to sufficiently train due to insufficient sample number, further influences the effect of classifying malicious traffic, and provides a malicious network traffic classification method based on sample enhancement and contrast learning.

In order to achieve the above object, the malicious network traffic classification method based on sample enhancement and contrast learning includes:

obtaining N types of network traffic, wherein each type of network traffic comprises a plurality of samples, extracting a plurality of network traffic characteristics of each sample to form a characteristic vector corresponding to the sample, and constructing a network traffic sample set, wherein the network traffic sample set comprises network traffic and corresponding network traffic characteristics;

preprocessing the network flow characteristics to obtain an enhancement set of each sample;

taking union sets for enhancement sets of all samples contained in each type of network traffic, traversing other samples in the same union set for each sample in each union set to form positive sample pairs, taking one other union set traversing sample to form negative sample pairs, setting labels for each sample pair, and taking the obtained set of the positive and negative sample pairs as a pre-training set;

constructing a basic model, normalizing the feature vectors of two samples in a sample pair in a pre-training set, and then taking the normalized feature vectors as the input of the basic model to obtain two processed feature vectors of the sample pair;

calculating the similarity of the two processed feature vectors of the sample pair, identifying the label of the sample pair to obtain a label judgment value, calculating a loss value, calculating the parameters of a basic model according to the loss value, and updating to obtain a contrast learning malicious flow classification model;

and carrying out malicious traffic classification in the new task by adopting a contrast learning malicious traffic classification model.

Further, the preprocessing the network traffic characteristics to obtain an enhanced set of each sample includes:

training n network traffic classification models on a network traffic sample set, and inputting feature vectors of the jth class of network traffic into the ith network traffic classification model to obtain classification accuracy acc (i, j);

inputting the d network traffic characteristics into an i network traffic classification model to obtain importance weight (i, d), wherein the d network traffic characteristics belong to the j network traffic class;

importance weights I (j, d) are calculated according to the classification accuracy acc (I, j) and the importance weights weight (I, d), and are expressed as follows:

wherein I (j, d) represents importance weight of the feature of the d-th network traffic to the j-th class of network traffic;

the a-th sample s in the network traffic sample set _a Is characterized by a feature vector x _a The sample s _a The method belongs to the j-th class of network traffic, calculates the covering probability of each network traffic characteristic in the j-th class of network traffic, and uses the formula to express as follows:

wherein P (j, d) represents the mask probability of the d-th network traffic feature for all samples of the j-th class of network traffic;

in the feature vector x according to the mask probability _a Randomly selecting L network traffic characteristics to mask, and obtaining a sample s after masking _a Repeating the operations of randomly selecting and masking m times to obtain m enhanced samples, and combining the feature vector x _a And a set of m enhanced samples as samples s _a Is described.

Further, for all positive sample pairs formed by the same and pooled samples, at least one sample between any two positive sample pairs is not repeated.

Further, the labeling each sample pair includes:

the label of the positive sample pair is set to 1 and the label of the negative sample pair is set to 0.

Further, a basic model is constructed, the feature vectors of two samples in a sample pair in a pre-training set are normalized and then used as the input of the basic model, and the two processed feature vectors of the sample pair are obtained, and the method comprises the following steps:

constructing a basic model by adopting a multi-layer perceptron, and normalizing the eigenvector x of the sample pair _p And feature vector x _q As a basic modelThe base model updates parameters for each layer of the multi-layer perceptron based on the following formula:

X ^(l+1) ＝σ(A ^(l) X ^(l) +b ^(l) )

wherein A is ^(l) Trainable parameter matrix for the first layer of the multi-layer perceptron, b ^(l) Is the parameter vector of the first layer of the multi-layer perceptron, X ^(l) X is the output of the first layer of the multi-layer perceptron ^(l+1) For the output of the first layer +1 of the multi-layer perceptron, sigma (…) is an activation function;

obtaining a characteristic vector x _p And feature vector x _q Feature vector processed by multi-layer perceptronAnd->

Further, the calculating the similarity of the two processed feature vectors of the sample pair includes:

the two processed eigenvectors of the sample pair are compared using a cosine similarity function, formulated as follows:

wherein,similarity value representing two feature vectors, ranging from [ -1,1]，/>Representing feature vector x _p Feature vector processed by multi-layer perceptron, < >>Representing feature vector x _q The feature vector is processed by the multi-layer perceptron;

to make the similarity valueScale to [0,1 ]]Expressed by the formula:

wherein,a scaled value representing the similarity value.

Further, the calculating the loss value, calculating the parameters of the basic model according to the loss value and updating, includes:

the loss value is calculated by a binary cross entropy loss function and formulated as follows:

wherein, loss_L2 (x _p ,x _q ) Representing feature vector x _p And feature vector x _q Alpha represents a regularization factor, W represents all weight sums of the base model, and y represents a tag judgment value;

the gradient is calculated using back propagation and the parameters of the neural network are calculated and updated using gradient descent.

Further, the step of classifying the malicious traffic in the new task by adopting the contrast learning malicious traffic classification model includes:

acquiring a trained contrast learning malicious flow classification model;

collecting training samples in a new task, wherein at least one malicious traffic type in the training samples is not repeated with the malicious traffic type in the network traffic sample set;

and carrying out malicious traffic classification in a new task by adopting a contrast learning malicious traffic classification model, carrying out contrast learning on a training sample and a sample to be detected, and selecting a label with highest output probability as a prediction label.

Compared with the prior art, the invention has the remarkable advantages that: 1. the base model of the shallow neural network architecture occupied by light resources is adopted, so that the occupied resources are few, and the operation efficiency is high. 2. The heuristic method for constructing the comparison task by randomly covering the network flow characteristics can be used for data enhancement, the structure of the characteristic vector before covering can be effectively reserved, the effectiveness of the enhanced data is ensured, and the training effect of the basic model is improved. 3. Based on the architecture of contrast learning, the contrast learning malicious traffic classification model has good classification performance to distinguish the class of the known network traffic, and can classify the unknown small sample malicious network traffic.

Drawings

FIG. 1 is a flow chart of a small sample malicious network traffic classification method based on sample enhancement and contrast learning according to the invention;

fig. 2 is a flow chart of the network traffic construction training set of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

The invention provides a small sample malicious network traffic classification method based on sample enhancement and contrast learning, which is shown in fig. 1, and comprises the following steps:

(1) And grabbing the network traffic data, extracting the network traffic characteristics from the network traffic data, and finally enhancing the network traffic characteristics and constructing a pre-training set.

(1-1) network traffic feature extraction: first, a network traffic capture tool (e.g., tcpdump, wireshark) is used to capture and form a network traffic file (e.g., a pcap file). Then, a network traffic analysis tool (such as a CICFlowMeter) is adopted to analyze and statistically analyze the network traffic files to form network traffic characteristics, a plurality of network traffic characteristics are extracted from each network traffic file (i.e. sample), the types of the extracted network traffic characteristics of each network traffic file are the same, the number of the extracted network traffic characteristics is the same, and if the network traffic characteristics are not extracted for a certain network traffic characteristic type, the network traffic characteristics are set to 0. The number of network traffic characteristics is denoted D. And obtaining a network traffic sample set S (comprising N types of network traffic and corresponding network traffic characteristics, wherein the N types of network traffic are normal network traffic and N-1 types of malicious network traffic).

(1-2) as shown in fig. 2, the network traffic characteristics are enhanced, and a pre-training set is constructed, which comprises the following specific steps:

(1-2-1) calculating the importance weight of the network traffic characteristics: firstly, n machine learning algorithms are selected, and n network traffic classification models C are trained on S ₁ 、C ₂ 、…、C _n C is carried out by _i The classification accuracy of the j-th class of network traffic is denoted acc (i, j), where i is [1, n ]]，j∈[1,N]. C is C _i The importance weight of the calculated d-th network traffic feature is named weight (i, d), and the d-th network traffic feature belongs to the j-th network traffic. Then, the importance weight I (j, d) of the feature of the d-th network traffic to the j-th network traffic is calculated according to the formula (1).

(1-2-2) sample enhancement: first, for each sample s of each type of network traffic _a Its feature vector is denoted as x _a Let s assume _a And (3) calculating the covering probability of each network traffic characteristic d according to the formula (2) belonging to the j-th class of network traffic. Then, at x according to the mask probability _a Randomly selecting L network traffic characteristics to mask, wherein the default value of L is 10, namely the value of the selected network traffic characteristics is set to be 0, and the masked characteristic vector is s _a Is included. Repeating the above random pick and mask operation m times, then each network traffic sample s _a Generating m enhanced samples, and combining the feature vectors x _a The set of m enhanced samples is called s _a Is denoted AS AS _a 。

(1-2-3) constructing a pre-training set: 1) For one type of network traffic, taking the union of the enhancement sets of all samples contained in the one type of network traffic, and recording the union of the enhancement sets of the j type of network traffic as CAS _j . 2) For CAS (CAS) _j One sample s of (a) _jb Taking CAS _j Any one of the other samples s _jk And s is equal to _jb Form a positive sample pair (s _jb ,s _jk ) The tag is set to 1; union CAS that takes the enhancement set of other network traffic types _o Any one sample s in (o+.j) _ok And s is equal to _jb Forms a negative pair of samples (s _jb ,s _ok ) The tag is set to 0. 3) Repeating step 2) for CAS _j Traversing the CAS _j Other samples in (1) form positive sample pairs, traverse CAS _o The negative pairs are formed by all samples in (a). 4) Repeating the step 1) until all the samples in the union of the enhanced sets of the network traffic types are traversed. Positive sample pair generation for the same union sample, for positive sample pairs that are repeated with both samples in existing positive sample pairs, may be discarded directly after traversal, or may be deleted after traversal (e.g., positive sample pair (s _j1 ,s _j2 ) Sum(s) _j2 ,s _j1 ) Both belong to repeated pairs of samples, one is reserved). The final set of positive and negative sample pairs is denoted PNS as a pre-training set.

(2) Model training based on contrast learning: and constructing a basic model by adopting a shallow neural network, following a comparison learning method, and pre-training the model by adopting regularization and other technologies to obtain a comparison learning model.

(2-1) neural network base model definition: constructing a basic model by adopting a multi-layer perceptron (MLP), wherein the input of the basic model is the characteristic vector x of a certain positive and negative sample after normalization to the network flow _p And x _q The basic model updates parameters of each layer of the network flow based on the formula (3) to finally obtain x _p And x _q And the feature vector is processed by the multi-layer perceptron.

X ^(l+1) ＝σ(A ^(l) X ^(l) +b ^(l) ) (3)

Wherein A is ^(l) Trainable parameter matrix for the first layer of the multi-layer perceptron, b ^(l) Is the parameter vector of the first layer of the multi-layer perceptron, X ^(l) X is the output of the first layer of the multi-layer perceptron ^(l+1) For the output of layer 1+1 of the multi-layer perceptron, σ (…) is the activation function.

(2-2) comparing the eigenvectors of the two network traffic using a cosine similarity function, see equation (4).

Wherein,similarity value representing two feature vectors, ranging from [ -1,1]，/>Represents x _p Feature vector processed by multi-layer perceptron, < >>Represents x _q And the feature vector is processed by the multi-layer perceptron.

(2-3) passing through equation (5)Scaling the range value to [0,1 ]]。

Wherein,a scaled value representing the similarity value.

And (2-4) after obtaining the scaling value, identifying the label of the input sample pair to obtain a label judgment value y, wherein the label judgment value y in the embodiment is label 0 or 1 of the sample pair. And calculating a loss value by adopting a binary cross entropy loss function, and adding L2 regularization, see formula (6). After the loss value is obtained, the gradient is calculated by back propagation, and the parameters of the multi-layer perceptron are calculated and updated by gradient descent.

Where y represents a tag determination value, los_l2 (x _p ,x _q ) Representing the loss value, α represents the regularization factor, and W represents all the weight sums of the base model.

And (2-5) training a large number of positive sample pairs and negative sample pairs to obtain a contrast learning malicious flow classification model.

(3) Network traffic small sample classification: and for the target task, performing malicious flow small sample classification in the target task of the small sample by adopting a contrast learning malicious flow classification model.

(3-1) model initialization: and obtaining a trained contrast learning malicious flow classification model.

(3-2) data input: training samples in the new task (a small number of training samples containing new malicious traffic types, at least one malicious traffic type in the training samples not being repeated with the malicious traffic types in the network traffic sample set) are collected. The training samples in the new task are collected logically the same as steps (1-1) through (1-2-3).

(3-3) model implementation: and carrying out malicious traffic classification in a new task by adopting a contrast learning malicious traffic classification model, carrying out contrast learning on a training sample and a sample to be detected, and selecting a label with highest output probability as a prediction label.

And (3) in the small sample training process, carrying out malicious traffic classification in a new task by adopting a contrast learning malicious traffic classification model, wherein the logic of the steps is the same as that of the steps (2-1) to (2-4), and the network traffic sample set is replaced by a training sample.

And in the small sample classification process, when the training samples and the samples to be detected are subjected to contrast learning, taking samples in the training samples and the samples in the samples to be detected to form sample pairs, inputting feature vectors of the sample pairs into a contrast learning malicious flow classification model, calculating the similarity of the two processed feature vectors of the sample pairs, and taking the label (type) of the training samples in the sample pair with the highest similarity (highest output probability) as the type of the samples to be detected.

The above examples merely represent one or several embodiments of the present invention, which are described in more detail and are not to be construed as limiting the scope of the invention. It should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the invention, which are all within the scope of the invention. Accordingly, the scope of protection of the present invention is to be determined by the appended claims.

Claims

1. The malicious network traffic classification method based on sample enhancement and contrast learning is characterized by comprising the following steps of:

2. The malicious network traffic classification method based on sample enhancement and contrast learning according to claim 1, wherein the preprocessing of the network traffic features to obtain an enhancement set of each sample includes:

3. The method of sample-enhanced and contrast learning-based malicious network traffic classification of claim 1, wherein at least one sample between any two positive sample pairs is not repeated for all positive sample pairs formed from the same pooled sample.

4. The method for classifying malicious network traffic based on sample enhancement and contrast learning according to claim 1, wherein the step of labeling each sample pair comprises:

5. The malicious network traffic classification method based on sample enhancement and contrast learning according to claim 1, wherein constructing a basic model, normalizing feature vectors of two samples in a sample pair in a pre-training set, and then taking the normalized feature vectors as an input of the basic model, to obtain two processed feature vectors of the sample pair, comprises:

constructing a basic model by adopting a multi-layer perceptron, and normalizing the eigenvector x of the sample pair _p And feature vector x _q As an input to the base model, the base model updates parameters for each layer of the multi-layer perceptron based on the following formula:

X ^(l+1) ＝σ(A ^(l) X ^(l) +b ^(l) )

6. The malicious network traffic classification method based on sample enhancement and contrast learning of claim 5, wherein the calculating the similarity of two processed feature vectors of a sample pair comprises:

wherein,similarity value representing two feature vectors, ranging from [ -1,1]，/>Representing feature vector x _p Feature vector processed by multi-layer perceptron, < >>Representing feature vector x _q Through the multi-layer sensing machineThe processed feature vector;

to make the similarity valueScale to [0,1 ]]Expressed by the formula:

wherein,a scaled value representing the similarity value.

7. The malicious network traffic classification method based on sample enhancement and contrast learning according to claim 6, wherein the calculating the loss value, calculating parameters of the basic model according to the loss value and updating, comprises:

8. The malicious network traffic classification method based on sample enhancement and contrast learning according to claim 1, wherein the malicious traffic classification in a new task using a contrast learning malicious traffic classification model comprises:

acquiring a trained contrast learning malicious flow classification model;