CN107104978B

CN107104978B - Network risk early warning method based on deep learning

Info

Publication number: CN107104978B
Application number: CN201710375043.1A
Authority: CN
Inventors: 赖洪昌
Original assignee: Individual
Current assignee: Individual
Priority date: 2017-05-24
Filing date: 2017-05-24
Publication date: 2019-12-24
Anticipated expiration: 2037-05-24
Also published as: CN107104978A

Abstract

The invention discloses a network risk early warning method based on deep learning, which comprises the following steps: A1. collecting asset risk sample data of a network space of a whole network segment, and storing the sample data into a database; A2. extracting data from a database, and performing convolutional neural network distributed training learning to form an initial risk prediction model; A3. and inputting the production data into a risk prediction model, evaluating the risk value of the production data, and giving an alarm if an early warning threshold value is reached. By the method and the equipment, safety risk assessment and early warning can be carried out on a plurality of target networks or targets without obvious bugs, and the safety state of one network can be assessed on the whole; the response speed is improved, and the risk points are found quickly; meanwhile, the maintenance cost is reduced, and the labor is saved.

Description

Network risk early warning method based on deep learning

Technical Field

The invention relates to a network risk early warning technology, in particular to a network risk early warning method and a network risk early warning system aiming at machine deep learning in an area range.

Background

In the current network security field, whether a target is safe or not is detected through traditional modes such as vulnerability scanning and port scanning, the mode has an effect on a single target and obvious vulnerabilities, and the security state of the target cannot be rapidly and comprehensively obtained for batch targets or targets without the obvious vulnerabilities.

Disclosure of Invention

In order to solve the problems, the invention provides a network risk early warning method and device based on deep learning, which can quickly and comprehensively acquire a security state of batch targets or targets without obvious bugs.

The invention provides a network risk early warning method based on deep learning, which is characterized by comprising the following steps: A1. collecting asset risk sample data of a network space of a whole network segment, and storing the sample data into a database; A2. extracting data from a database, and performing Convolutional Neural Network (CNN) distributed training learning to form a risk prediction model; A3. and inputting the production data into a risk prediction model, evaluating the risk value of the production data, and giving an alarm if an early warning threshold value is reached.

Preferably, the step a1 includes: A11. determining risk elements, and collecting network space asset risk sample data of the whole network segment; A12. and carrying out vulnerability scanning on the collected risk sample data, and dividing the security level.

Further preferably, the risk elements include: one or more of a target IP, an open port, a server system type and version, a server application type and version, existing vulnerabilities, a database type and version, a weak password, whether CDN acceleration is employed, and a firewall.

Further preferably, the security level is divided into: the safety level comprises four safety levels of high-risk, medium-risk, low-risk and safe, the ratio of the four safety levels is 1:1:1:1, and the number of each safety level is more than or equal to 5000.

Further preferably, the step a1 further includes: A13. and converting the collected network space asset risk sample data into binary sample data which can be identified by deep learning.

Still more preferably, the step a13 includes: A131. performing picture processing on the sample, and cutting the sample into uniform size; A132. and whitening the cut picture.

Preferably, the distributed training learning of step a2 is performed in a gradient decreasing manner, and the initial gradient is 10^-4。

Preferably, the step a2 includes: A21. preparing a training environment, wherein the training environment is carried out by adopting a Tensorflow GPU mode; A22. extracting training sample data from a database, and performing model training by combining a convolutional neural network to obtain a risk prediction model; A23. and extracting test sample data from the database, and performing evaluation test on the risk prediction model.

Still more preferably, the step a22 includes: A221. the model network structure adopts 3 convolution layers, wherein the first convolution layer adopts a convolution kernel of 3 x 3, the second convolution layer adopts a convolution kernel of 2 x 2, each convolution layer is followed by a maximum pooling layer, and then followed by two hidden layers and an output layer, and the feature maps of each convolution layer respectively adopt 32, 64 and 128; A222. performing regression by using a softmax function, wherein the final output layer does not need softmax regression; A223. training is carried out by using training sample data to obtain an initial risk prediction model.

The invention also provides a computer-readable storage medium containing a computer program which is executed by a computer to implement the method as described above.

The invention has the beneficial effects that: collecting network space asset risk samples of the whole network segment, carrying out distributed training learning by combining a Convolutional Neural Network (CNN), and carrying out self-learning and adjustment by combining all local results and neural network analysis to obtain a comprehensive and integrated risk prediction model. The risk prediction model can carry out security risk assessment and early warning on a plurality of target networks or targets without obvious bugs, and can assess the security state of one network on the whole; the response speed is increased, the risk points are found quickly, and the processing efficiency and accuracy of network security situation analysis and prediction are improved; meanwhile, the maintenance cost is reduced, and the labor is saved.

Further advantages are also obtained in a further preferred embodiment: the maximum resistance for network security assessment and early warning by using the CNN is as follows: and (5) construction of an application scene learning sample. The invention limits the risk elements of the risk sample to be as follows: the method comprises the steps of target IP, open ports, server system types and versions, server application types and versions, existing bugs, database types and versions, weak passwords, whether CDN acceleration is adopted or not, and whether firewall is adopted or not, so that the time of CNN distributed training is saved, and the accuracy of safety assessment and early warning results is improved.

Drawings

Fig. 1 is a schematic flow chart of a deep learning-based network risk early warning method according to an embodiment of the present invention.

Fig. 2 is a schematic diagram of a convolutional neural network distributed training learning process according to an embodiment of the present invention.

Detailed Description

The present invention is described in further detail below with reference to specific embodiments and with reference to the attached drawings, it should be emphasized that the following description is only exemplary and is not intended to limit the scope and application of the present invention.

As shown in fig. 1, the embodiment provides a network risk early warning method based on deep learning, which includes the following steps:

step 1, collecting the asset risk sample data of the network space of the whole network segment.

Step 1-1, establishing a database of the network space asset risk sample data, and identifying risk points of the assets to determine risk elements, wherein the risk elements comprise: target IP, open port, server system type and version, server application type and version, existing vulnerabilities, database type and version, weak password, whether CDN acceleration is employed, whether firewall is employed. And acquiring network space asset risk sample data of the whole network segment according to the risk elements.

By extracting the risk elements which possibly cause serious consequences on the network security, sample data is formed, and the real reliability of the prediction result of later-stage deep learning can be ensured. Some risk elements appear to be non-dangerous, but when combined, can create fatal vulnerabilities.

The network space risk sample data acquisition method comprises the following steps: the method comprises the steps of using a detection technology of service types and version information operated by a target network host, a detection technology of information such as an operating system and equipment types, an identification technology of security vulnerability of the target host, and an identification technology of CDN (content delivery network) and firewall to finish the collection work of sample data, and using a distributed technology to ensure that the collected sample has real-time property.

Step 1-2, carrying out vulnerability scanning on the acquired risk sample data, and dividing the acquired risk sample data into four security levels: high-risk, medium-risk, low-risk and safe. The ratio of the four safety levels of high-risk, medium-risk, low-risk and safe is 1:1:1:1, and the number of each safety level is more than or equal to 5000.

Through the division of the security levels, each security risk level comprises the designated insecurity factor and the maximum loss degree possibly brought by the vulnerability, and a user can preliminarily master the belonged classification and the possible loss of the vulnerability and make a specific defense measure so as to reduce the risk of the user facing the network risk.

And 1-3, converting the network space asset risk sample data into binary sample data which can be identified by deep learning. Collecting node data, summarizing the data in a control server, and storing the data after data cleaning.

The task of data cleansing is to filter out unsatisfactory data, mainly incomplete data, erroneous data, duplicate data, and the like.

Since most of the result data in the database is text or numbers and the combination is many, there is great difficulty in quantifying the sample parameters and it is difficult to form a learning model for deep learning, so sample data is made into a picture.

1) Sample picture processing: the sample pictures are uniformly cropped to a 100x100 pixel size, with the center region cropped for evaluation or randomly cropped for training.

2) And approximate whitening processing is carried out on the picture, so that the model is insensitive to the dynamic range change of the picture. And 2, extracting data from the database, and performing distributed training learning of the convolutional neural network to form a risk prediction model.

The good learning model can not only improve the learning speed, but also improve the accuracy of the learning result, and meanwhile, the number of samples needs to be considered, and comprehensively, the CNN model is the most ideal deep learning model at present. The step adopts a picture training mode and combines the characteristic that the convolutional neural network is good at solving picture recognition to train.

The learning model is divided into two types of samples: training samples and test samples. The training sample is sample data required in the debugging and training stages and is used as a function and a method for adjusting deep learning to guide a final result to a correct direction; the test sample is used for verifying whether the accuracy meets the functions of network risk assessment and early warning and is used in the assessment stage. The training sample is sample data used in a training model stage; the test sample is the sample data used in the evaluation model stage. Independent and dependent variables are known for both types of samples.

From the partitioned samples, a risk prediction model is formed by training a machine, the process of which is shown in fig. 2.

And 2-1, preparing a training environment. The training environment is performed in a Tensorflow GPU mode, the calculation speed of the GPU is higher than that of a CPU, and the time cost of the training process can be reduced.

Step 2-2, training model stage. And (3) performing model training in a training environment by using the training sample prepared in the step (2) in combination with a convolutional neural network. The training process is as follows:

1) the model network structure is defined by using 3 convolutional layers, the first convolutional layer uses 3 × 3 convolutional kernels, the second convolutional layer uses 2 × 2 convolutional kernels, each convolutional layer is followed by a max-pooling layer, and then two hidden layers and an output layer, and the feature maps of each convolutional layer are respectively 32, 64 and 128.

2) Regression is performed using the softmax function, and the final output layer does not need the softmax function regression.

3) And training after the model network structure is well defined to obtain an initial risk prediction model.

The accuracy was optimized in a gradient decreasing manner with an initial gradient of 10^-4(ii) a And performing distributed CNN training, performing linear regression on the training data in a gradient decreasing mode to reach a balanced state, finding out factors which have a large influence on a training result, and performing distributed training by using the data as CNN input. The data parallel distributed training stores a model backup on each working node of the GPU, processes different parts of data on each node, combines the results of each working node, and synchronizes model parameters among the nodes; the method can accelerate the efficiency of data training and model establishment.

And 2-3, evaluating the model. And (3) using the test sample prepared in the step (2) in a training environment, carrying out evaluation test on the initial risk prediction model obtained in the step (2-2), and determining whether the accuracy is qualified. The test method comprises the following steps: and inputting the test sample into the initial risk prediction model, and judging whether the result is matched with the expectation after the result is output. And if the network risk is matched, putting the network risk into a production process for early warning of the network risk. If not, returning to the step 2-2 for algorithm optimization until the output result is matched with the expectation. The output results are four types, namely safety, low risk, medium risk and high risk.

The risk prediction model established by the method has different accuracy according to different risk elements forming the sample data. According to the risk factors selected from the sample data, the time for forming the risk prediction model is different, the accuracy of the risk prediction model is also different, and the results are as follows:

from the above table it can be seen that: when the risk elements comprise a target IP, an open port, a server system type and version, a server application type and version, existing bugs, a database type and version, a weak password, whether CDN acceleration is adopted or not and whether a firewall is adopted or not, the accuracy of a risk prediction model is high, the learning time is short, and when one of the risk elements is lack, the result is lack of accuracy.

When the risk elements are redundant of these, experimental results show that: the learning time is long, the time for forming a risk prediction model is long, and the cost is high. By choosing the appropriate risk elements: the target IP, the open port, the server system type and version, the server application type and version, existing bugs, the database type and version, the weak password, whether CDN acceleration is adopted or not, and whether a firewall is adopted or not are adopted, so that the learning time is short, and the accuracy of the formed risk prediction model is high, namely, the method is quick and accurate.

And 3, inputting the production data into a risk prediction model, evaluating the risk value of the model, and alarming if an early warning threshold value is reached.

The foregoing is a more detailed description of the invention in connection with specific/preferred embodiments and is not intended to limit the practice of the invention to those descriptions. It will be apparent to those skilled in the art that various substitutions and modifications can be made to the described embodiments without departing from the spirit of the invention, and these substitutions and modifications should be considered to fall within the scope of the invention.

Claims

1. A network risk early warning method based on deep learning is characterized by comprising the following steps:

A1. collecting asset risk sample data of a network space of a whole network segment, and storing the sample data into a database;

A2. extracting data from a database, and performing convolutional neural network distributed training learning to form a risk prediction model;

A3. inputting production data into a risk prediction model, evaluating the risk value of the production data, and giving an alarm if an early warning threshold value is reached; the step A1 includes:

A11. determining risk elements, and collecting network space asset risk sample data of the whole network segment;

A12. and carrying out vulnerability scanning on the collected risk sample data, and dividing the security level.

2. The method of claim 1, wherein the risk elements comprise: one or more of a target IP, an open port, a server system type and version, a server application type and version, existing vulnerabilities, a database type and version, a weak password, whether CDN acceleration is employed, and a firewall.

3. The method of claim 1, wherein the security level is divided into: the safety level comprises four safety levels of high-risk, medium-risk, low-risk and safe, the ratio of the four safety levels is 1:1:1:1, and the number of each safety level is more than or equal to 5000.

4. The method of claim 1, wherein said step a1 further comprises:

A13. and converting the network space asset risk sample data into binary sample data which can be identified by deep learning.

5. The method of claim 4, wherein said step A13 comprises:

A131. performing picture processing on the sample, and cutting the sample into uniform size;

A132. and whitening the cut picture.

6. The method of claim 1, wherein the distributed training learning of step A2 is performed in a gradient decreasing manner, and the initial gradient is 10^-4。

7. The method of claim 1, wherein said step a2 comprises:

A21. preparing a training environment, wherein the training environment is carried out by adopting a Tensorflow GPU mode;

A22. extracting training sample data from a database, and performing model training by combining a convolutional neural network to obtain an initial risk prediction model;

A23. and extracting test sample data from the database, and performing evaluation test on the initial risk prediction model.

8. The method of claim 7, wherein said step a22 comprises:

A221. the model network structure adopts 3 convolution layers, wherein the first convolution layer adopts a convolution kernel of 3 x 3, the second convolution layer adopts a convolution kernel of 2 x 2, each convolution layer is followed by a maximum pooling layer, and then followed by two hidden layers and an output layer, and the feature maps of each convolution layer respectively adopt 32, 64 and 128;

A222. performing regression by using a softmax function, wherein the final output layer does not need softmax regression;

A223. training is carried out by using training sample data to obtain an initial risk prediction model.

9. A computer-readable storage medium containing a computer program, the computer program being executable by a computer to perform the method of any one of claims 1 to 8.