CN114785623A

CN114785623A - Network intrusion detection method and device based on discretization characteristic energy system

Info

Publication number: CN114785623A
Application number: CN202210703944.XA
Authority: CN
Inventors: 许成程; 翟江涛; 刘光杰; 戴跃伟
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2022-06-21
Filing date: 2022-06-21
Publication date: 2022-07-22

Abstract

The invention relates to the technical field of network traffic identification, in particular to a network intrusion detection method and a device based on a discretization characteristic energy system, wherein the network intrusion detection method based on the discretization characteristic energy system comprises the following steps: collecting normal network flow data, and dividing network flows according to quintuple information; preprocessing network flow characteristics; discretizing the features using a feature discretization module; constructing a flow classifier based on a discretization characteristic energy system; and inputting the flow to be detected into a flow classifier based on a discretization characteristic energy system, and determining the network flow property according to a threshold value. The network intrusion detection method and device based on the discretization characteristic energy system can effectively classify the data to be tested into normal flow or malicious flow on the premise of only using normal network flow.

Description

Network intrusion detection method and device based on discretization characteristic energy system

Technical Field

The invention relates to the technical field of network traffic identification, in particular to a network intrusion detection method and a network intrusion detection device based on a discretization characteristic energy system.

Background

The traffic classification is to associate network traffic into specific categories according to requirements, and becomes a crucial component of network space security management. For example, in the field of network management, traffic may be classified according to different priorities to ensure the quality of service of the network. In the field of network space security, traffic is generally divided into normal traffic and malicious traffic, so as to achieve the purpose of network anomaly detection. In recent years, with the wide application of encryption technology in network applications, traffic encryption has become a current mainstream trend. In particular, many malware use encryption techniques such as TLS to encrypt traffic to avoid detection by firewalls and network intrusion detection systems. These practices present new challenges to traditional traffic classification approaches.

Traffic encryption techniques can be divided into application layer encryption, presentation layer encryption, and network layer encryption, depending on the network layer. Application layer encryption refers to the application program implementing its own secure data transmission protocol at the application layer, also known as conventional encryption. The presentation layer encryption and the network layer encryption refer to that an application program encrypts an entire data packet from an upper layer, and typical technologies include some tunnel technologies such as TLS and IPsec, and for example, VPN is based on these technologies. This type of encryption is also known as protocol encapsulation. In some cases, encrypted traffic through conventional encryption may be further encrypted through protocol encapsulation.

In recent years, different classifiers based on conventional machine learning and deep learning have been proposed one after another. These stream-based classifiers can achieve very high accuracy. But machine learning based classifiers need to be trained by tagging malicious traffic samples. However, true traffic identification is difficult to do, especially in the case of malicious traffic. In addition, after training specific data distribution, the classifier based on machine learning often has poor effect and lower field self-adaptive capability when being applied to other data with slightly different distributions.

Disclosure of Invention

The present invention aims to provide a network intrusion detection method and device based on a discretization characteristic energy system, so as to solve the problems proposed in the background art.

The technical scheme of the invention is as follows: the network intrusion detection method based on the discretization characteristic energy system comprises the following steps:

step 1, collecting normal network flow data, and dividing network flow according to quintuple information;

step 2, preprocessing network flow characteristics;

step 3, discretizing the features by using a feature discretization module;

step 4, constructing a flow classifier based on a discretization characteristic energy system;

and 5, inputting the flow to be detected into a flow classifier based on the discretization characteristic energy system, and determining the network flow property according to a threshold value.

Preferably, in step 1, the collected normal network traffic is captured by the traffic collector using a wireshark tool, exists in a PCAP form, and is stored after being divided according to the information of five elements, namely SrcIP, SrcPort, DstIP, DstPort, and Protocol.

Preferably, in step 2, the network flow characteristic preprocessing includes the following steps:

step 2.1, the network flow data input feature extraction tool obtains flow statistic feature vector

；

Step 2.2, inputting the size of the data packet sequence into the multilayer perceptron network to extract the packet sequence characteristics, and carrying out local characteristic amplification to obtain characteristic vectors

；

Step 2.3, inputting the original bytes after the network flow pretreatment into a convolutional neural network to extract the original byte characteristics, and carrying out local characteristic amplification to obtain characteristic vectors

；

Step 2.4, flowStatistical features

Packet sequence feature

Original byte characteristics

The combination results in a mixed feature tuple.

Preferably, in step 2.1, the stream statistical feature extraction is to use a feature extraction tool, cifcflowmeter, to perform feature extraction on the divided network streams, use an XGBoost method to perform feature dimension reduction processing, sequentially perform traversal calculation on the value of each feature through an objective function composed of a loss function and a regularization penalty term, and find the feature point of the minimized objective function, thereby obtaining a feature tuple

Objective function of

As shown in formula (1), wherein

In order to be a function of the loss,

in order to be a penalty function,

in (1)

The difference between the true value and the predicted value is described,

is a sample

First, the

The resulting tree model is fitted in a round of fits,

is composed of

The first derivative of (a) is,

is composed of

The second derivative of (a) is,

is the number of leaves of the tree model,

in order to obtain the learning rate of the learning,

for the prediction of the input samples by the decision tree,

to control the constant parameters of the size of the penalty term,

is the first decision tree

Predicted values of the leaf nodes:

（1）

；

in step 2.2, the packet sequence feature extraction is to output a feature tuple after extracting the size sequence feature of the network flow data packet by using a multilayer perceptron network

(ii) a The linear mapping using local feature amplification between fully connected layers is shown in formula (2) to obtain feature tuples

Will be

Adding augmented matrices to feature tuples

In, wherein the multilayer perceptron network structure is three full connection layers:

。（2）

preferably, in step 2.3, the original byte feature extraction is to extract feature tuples from the original bytes of the network stream after the original bytes are input into the convolutional neural network

Obtaining feature tuples after linear mapping between fully connected layers using local feature amplification

Will be

Matrix adding for augmentationIs added to

The convolutional neural network structure comprises two convolutional layers, two pooling layers and two full-connection layers; in step 2.4, the feature fusion is to use the statistical features of the flow after dimension reduction

Packet sequence feature extracted by multilayer perceptron

Original byte features extracted by convolutional neural network

Forming a mixed feature tuple.

Preferably, in step 3, the discretization of the features is to form an ordered array for the value of each feature, representing the global distribution thereof,

represents the minimum value of the characteristic,

representing the maximum value of the characteristic, and taking the value of each characteristic array

Is divided equally and increased

And with

Two intervals, forming H characteristic valid intervals.

Preferably, in step 4, the constructing of the flow classifier based on the discretized characteristic energy system includes the following steps:

step 4.1, establishing a network flow-energy field system, namely, corresponding each feature to particles in an energy field only according to the reconstructed features and the value tuples thereof of the normal flow, wherein all the features and the values thereof form an energy field;

4.2, the characteristic probability statistical module is used for calculating the frequency of each characteristic value in the total characteristic value and the frequency of combination occurrence among a plurality of characteristics;

and 4.3, the Hamiltonian energy calculation module is used for calculating the energy of the whole network flow so as to obtain the energy characteristic of the normal sample and finally determine a threshold value.

Preferably, in step 4.1, the building of the network flow energy field system is to instantiate the normal network flow and its feature tuple, and if (a 1 … AN) is taken as the feature N-tuple, the flow k can be instantiated as (ak 1 … akN), where aki e is

The value of each feature is from the set

The value ranges of all the characteristics are the number of intervals

(ii) a Establishing a system consisting of a plurality of characteristic nodes, wherein the nodes are mutually associated, local energy exists in the nodes, and the nodes have coupling energy with each other; in step 4.2, the network flow characteristic probability statistic module is shown in formulas (3) and (4),

is characterized in that

Take a value of

The probability of (a) of (b) being,

is characterized in that

And

value pair of

A joint probability of (a);

（3）

；（4）

in step 4.3, the Hamiltonian energy calculation module firstly calculates the coupling energy of the feature pair combination value, then calculates the local energy of each feature value in the network flow,

taking values for the first interval after the characteristic discretization,

the covariance matrix obtained for the feature single probability and joint probability is shown in equation (5) and the coupling energy

Local energy as shown in equation (6)

As shown in equation (7):

（5）

（6）

（7）；

and finally, calculating the Hamiltonian of each network flow, wherein the Hamiltonian obtained by local energy and coupling energy represents the total energy of each flow, and the local energy of all characteristic nodes is shown as a formula (8)

And coupled energy

The negative value after summation represents the total energy per stream

Taking the energy value of the 95 th% position sample of the energy distribution of the normal flow sample as a preset threshold value;

（8）。

preferably, in step 5, the detection of the flow to be detected is that a flow classifier monitors the network, waits for the captured network flow, when a first network flow is captured, calculates the hamilton energy of the network flow according to equation (7), compares the energy of the network flow with a preset threshold, if the energy exceeds the preset threshold, the network flow is classified as a malicious flow, and performs interception, otherwise, the flow is classified as a normal flow and passes through, and the flow classifier waits for another flow.

The device applied to the network intrusion detection method based on the discretization characteristic energy system comprises a flow classifier and a flow collector.

Compared with the prior art, the invention provides a network intrusion detection method and a device based on a discretization characteristic energy system by improvement, and has the following improvements and advantages:

the network intrusion detection method and device based on the discretization characteristic energy system can effectively classify the data to be tested into normal flow or malicious flow on the premise of only using normal network flow.

Drawings

The invention is further explained below with reference to the figures and examples:

FIG. 1 is a flow chart of a network intrusion detection method based on a discretized characteristic energy system according to the present invention;

FIG. 2 is a schematic diagram of feature extraction and feature fusion according to the present invention;

FIG. 3 is an enlarged view of the local blur feature of the present invention;

FIG. 4 is a network flow representation of a mapping to an energy field structure according to the present invention;

FIG. 5 is a schematic diagram of a discretized feature energy architecture framework of the present invention;

FIG. 6 is a diagram illustrating the distribution of energy in normal and malicious network flows according to the present invention.

Detailed Description

The present invention is described in detail below, and technical solutions in the embodiments of the present invention are clearly and completely described, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.

The invention provides a network intrusion detection method and a device based on a discretization characteristic energy system by improvement, and the technical scheme of the invention is as follows:

as shown in fig. 1 to fig. 6, the network intrusion detection method based on the discretization characteristic energy system includes the following steps:

step 1, collecting normal network flow data, and dividing network flows according to quintuple information; the collected normal network flow is captured by a flow collector by using a wireshark tool, exists in a PCAP form, and is stored after being divided according to SrcIP, SrcPort, DstIP, DstPort and Protocol quintuple information;

step 2, preprocessing network flow characteristics; the feature extraction and fusion is to extract and fuse the features of different dimensions of each network flow, and comprises the stages of flow statistical feature extraction and dimension reduction, data packet sequence feature extraction, convolutional neural network original byte feature extraction and local feature amplification after linear mapping;

step 3, discretizing the features by using a feature discretization module;

In step 2, the network flow characteristic preprocessing includes the following steps:

；

Step 2.2, inputting the sequence size of the data packet into a multilayer perceptron network to extract packet sequence characteristics, and carrying out local characteristic amplification to obtain a characteristic vector

；

；

Step 2.4, flow statistical characterization

Bag sequence features

Original byte characteristics

The combination results in a mixed feature tuple.

In step 2.1, the stream statistical feature extraction is to use a feature extraction tool CICFlowMeter to extract features of the divided network streams, use an XGboost method to perform feature dimension reduction processing, sequentially traverse and calculate the value of each feature through an objective function composed of a loss function and a regularization penalty term, and find the feature points of the minimized objective function, so as to obtain feature tuple

The method comprises the steps of obtaining a data packet, wherein the data packet comprises a port number, a protocol type, a stream byte rate, a stream duration, a forward and backward arrival interval time, a maximum value, a minimum value, an average value, a variance and the like; objective function

As shown in formula (1), wherein

In order to be a function of the loss,

in order to be a penalty function, the system,

in

The difference between the true value and the predicted value is described,

is a sample

First, the

The resulting tree model is fitted in a round of fits,

is composed of

The first derivative of (a) is,

is composed of

The second derivative of (a) is,

is the number of leaves of the tree model,

in order to obtain a learning rate,

for the prediction of the input samples by the decision tree,

to control the constant parameters of the size of the penalty term,

is the first decision tree

Predicted values of individual leaf nodes:

（1）

；

in step 2.2, after extracting the size sequence features of the network stream data packet by using the multi-layer perceptron network hidden layer, outputting feature tuples with dimension of 1 × 16 at the first full-connection layer

(ii) a The linear mapping using local feature amplification between two fully connected layers is shown in formula (2), the mapping can amplify fuzzy local features to play a role in attention, and the dimensionality obtained after the fully connected layers is represented by

Characteristic tuple of (2)

Will be

Adding augmented matrices to feature tuples

In the formation dimension of

Characteristic tuple of

Wherein the multilayer perceptron network structure contains two hidden layers, two full-link layers, and each hidden layer has 214 neurons:

。（2）

in step 2.3, the dimensionality of the first full-link layer of the original bytes after the convolutional layer is extracted after the original bytes of the network flow are input into the convolutional neural network

Feature tuple

After linear mapping using local feature amplification between two fully connected layers, the dimension is obtained at the second fully connected layer as

Characteristic tuple of

Will be

Is added as an augmentation matrix

In (1) formation of

Characteristic tuple of (2)

The convolutional neural network structure comprises two convolutional layers, two pooling layers and two full-connection layers, the activation function uses Relu, and a Dropout layer is arranged between the pooling layers to prevent overfitting.

Wherein, in the step 3, the discretization of the features is to form an ordered array for the value of each feature to represent the global distribution of the feature,

represents the minimum value of the characteristic,

Is divided equally and increased

And with

Two intervals form H characteristic valid intervals.

In step 4, the flow classifier based on the discretization characteristic energy system is constructed by the following steps:

step 4.1, establishing the relation between the network flow and the energy field; the energy system is established by describing network flow by using an energy field concept in quantum mechanics, and forming a single flow by using the feature tuple discretized in the step 3

Is shown in which

The characteristic composition is shown.

Denotes the first

A set of all possible values of each feature, and each feature

All have a local energy

，

Is a set of all possible pairs of features, different network flows can be represented by combinations of different features. Creating a graph of a plurality of characteristic nodes, the nodes being related to each other and having functions

The determined correlated coupling energy;

step 4.2, feature probability and covariance matrix;

is characterized by

Take a value of

The probability of (a) of (b) being,

is characterized in that

And

value pair of

Is a joint probability, covariance matrix

In order to eliminate the influence of indirect correlation in data, an inverse matrix of the covariance matrix is used for the next calculation;

4.3, calculating coupling energy; the coupling energy is calculated according to the characteristic probability and the covariance matrix in the step 4.2;

4.4, calculating local energy; said local energy

The calculation is obtained by calculation according to the coupling energy in the step 4.3 and the characteristic probability and covariance matrix in the step 3.2;

step 4.5, calculating network flow energy; the energy of the single stream is calculated as the negative of the sum of the coupling energy and the local energy between its features of step 4.4 and step 4.3. FIG. 4 is a network flow representation method mapped to an energy field structure, wherein ai representsMapping to the characteristics and values of the energy field, interacting each particle (characteristics and values) in the energy field to generate a plurality of coupled energy fields e (ai, aj) and local energy fields h (ai), and generating different interaction energy according to the difference of the particle size distance relation, thereby representing network flow and integrating the network flow

Is characterized in that

And

the set of all possible coupled energies between them,

is characterized in that

A set of all possible local energies associated;

step 4.6, determining a threshold value; the determining threshold is to determine the energy of the network flow according to the energy of the network flow calculated in step 4.5, train only the normal flow samples in the data set, calculate the energy value distribution of the normal flow samples, and use the energy value of the 95 th% position sample of the energy distribution of the normal flow samples as a preset threshold.

Step 5, inputting the flow to be detected into a flow classifier based on a discretization characteristic energy system, and determining the network flow property according to a threshold value;

step 5.1, the traffic classifier monitors the network, waiting for the network flow to be captured

；

Step 5.2, obtaining dimensionality after dimensionality reduction through the feature extraction and fusion module

Flow statistics of

The dimension extracted by the multilayer perceptron is

Packet sequence characteristics

And the dimension extracted by the convolutional neural network is

Original byte characteristics of

Is formed in the dimension of

Mixed feature tuples

；

Step 5.3, replacing the value of each feature in the mixed feature tuple with the interval value after feature discretization;

step 5.4, computing the captured network flow

First, the

Each characteristic takes on a value of

Probability of (2)

,

Calculating

First, the

Each characteristic takes on a value of

Probability of (2)

，

And calculate its first

、

Joint probability of simultaneous occurrence of individual features

；

Step 5.5, calculating the coupling energy according to the single characteristic probability and the joint characteristic probability in the step 4.2

；

Step 5.6, calculating characteristics according to the coupling energy in step 4.3

Local field value of

；

Step 5.7, calculating network flow by using variables in the above steps

The energy of (a). Initialization energy

=0, and

；

step 5.8, comparing the preset threshold c with the network flow energy to be measured, if so

If not, the flow classifier releases the flow and waits for another flow.

The previous description is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. The network intrusion detection method based on the discretization characteristic energy system is characterized by comprising the following steps: the method comprises the following steps:

step 2, preprocessing network flow characteristics;

step 3, discretizing the features by using a feature discretization module;

2. The network intrusion detection method based on the discretized characteristic energy system according to claim 1, wherein: in the step 1, the collected normal network flow is captured by the flow collector by using a wireshark tool, exists in a PCAP form, and is stored after being divided according to the five-element information of SrcIP, SrcPort, DstIP, DstPort and Protocol.

3. The network intrusion detection method based on the discretized characteristic energy system according to claim 1, wherein: in step 2, the network flow characteristic preprocessing includes the following steps:

；

；

；

Step 2.4, flow statistical characteristics

Packet sequence feature

Original byte characteristics

The combination results in a mixed feature tuple.

4. The network intrusion detection method based on the discretized characteristic energy system according to claim 3, wherein: in step 2.1, the stream statistical feature extraction is to use a feature extraction tool CICFlowMeter to extract features of the divided network streams, use an XGboost method to perform feature dimension reduction processing, sequentially traverse and calculate the value of each feature through an objective function composed of a loss function and a regularization penalty term, and find the feature points of the minimized objective function, so as to obtain feature tuples

Objective function of

As shown in formula (1), wherein

In order to be a function of the loss,

in order to be a penalty function, the system,

in (1)

The difference between the true value and the predicted value is described,

is a sample

First, the

Wheel fitting productThe model of the tree is generated by the generation of the tree,

is composed of

The first derivative of (a) is,

is composed of

The second derivative of (a) is,

is the number of leaves of the tree model,

in order to obtain a learning rate,

for the prediction of the input samples by the decision tree,

to control the constant parameters of the size of the penalty term,

is the first decision tree

Predicted values of individual leaf nodes:

（1）

；

Will be

Adding augmented matrices to feature tuples

（2）。

5. the method of claim 3, wherein the method comprises the steps of: in step 2.3, the original byte feature extraction is to extract feature tuples from the original bytes after inputting the original bytes of the network flow into the convolutional neural network

Will be

As an addition to the amplification matrix

The convolutional neural network structure comprises two convolutional layers, two pooling layers and two full-connection layers; in step 2.4, the feature fusion is to use the flow statistical features after dimension reduction

Packet sequence feature extracted by multilayer perceptron

Original byte features extracted by convolutional neural network

Forming a mixed feature tuple.

6. The network intrusion detection method based on the discretized characteristic energy system according to claim 1, wherein: in step 3, the discretization of the features is to form an ordered array for the value of each feature to represent the global distribution of the feature,

represents the minimum value of the characteristic,

Is divided equally and increased

And

two intervals, forming H characteristic valid intervals.

7. The method of claim 1, wherein the method comprises the steps of: in step 4, the construction of the flow classifier based on the discretization characteristic energy system comprises the following steps:

step 4.1, establishing a network flow-energy field system, namely, corresponding each feature to particles in an energy field only according to the reconstructed features and value tuples of the features after normal flow reconstruction, wherein all the features and the values of the features form an energy field;

and 4.3, the Hamiltonian energy calculation module is used for calculating the energy of the whole network flow so as to obtain the energy characteristic of the normal sample and finally determine the threshold value.

8. The network intrusion detection method based on the discretized characteristic energy system according to claim 7, wherein: in step 4.1, the step of constructing the network flow energy field system is to instantiate the normal network flow and the feature tuple thereof, and if (A1 … AN) is taken as the feature N-tuple, the flow k can be instantiated as (ak 1 … akN), wherein aki belongs to the element

The value of each feature is from the set

The value ranges of all the characteristics are the number of intervals

(ii) a Establishing a system consisting of a plurality of characteristic nodes, wherein the nodes are mutually associated, local energy exists in the nodes, and the nodes have interactive coupling energy; in step 4.2, the network flow characteristic probability statistic module is shown in formulas (3) and (4),

is characterized by

Take a value of

The probability of (a) of (b) being,

is characterized in that

And

value pair of

A joint probability of (a);

（3）

；（4）

taking values for the first interval after the characteristic discretization,

the covariance matrix obtained for the feature single probability and joint probability is shown in equation (5), coupling energy

Local energy as shown in formula (6)

As shown in formula (7):

（5）

（6）

（7）

And coupling energy

The negative value after summation represents the total energy per stream

And is combined withTaking the energy value of a 95% position sample of the energy distribution of the normal flow sample as a preset threshold;

（8）。

9. the network intrusion detection method based on the discretized characteristic energy system according to claim 1, wherein: in step 5, the flow classifier monitors the network, waits for the captured network flow, calculates the hamiltonian energy of the network flow according to the formula (7) when the first network flow is captured, compares the energy of the network flow with a preset threshold, and classifies the network flow as a malicious flow if the energy exceeds the preset threshold, and captures the flow, otherwise, the flow passes through the flow classified as a normal flow, and the flow classifier waits for the other flow.

10. The apparatus for applying the network intrusion detection method based on the discretized characteristic energy system of claim 2, wherein: comprises a flow classifier and a flow collector.