CN112488238A

CN112488238A - Hybrid anomaly detection method based on countermeasure self-encoder

Info

Publication number: CN112488238A
Application number: CN202011469743.5A
Authority: CN
Inventors: 刘文芬; 贾浩阳; 黄月华; 韦永壮
Original assignee: Guilin University of Electronic Technology
Current assignee: Guilin University of Electronic Technology
Priority date: 2020-12-14
Filing date: 2020-12-14
Publication date: 2021-03-12
Anticipated expiration: 2040-12-14
Also published as: CN112488238B

Abstract

The invention discloses a hybrid anomaly detection method based on an adversarial self-encoder. First, an adversarial self-encoder model is improved, and the improved adversarial self-encoder model is used to extract noise-added input data features; The two extracted features are subjected to weighted fusion processing; then, the fusion feature vector obtained by fusion is used as training data, and the ifroest classifier, LOF classifier and K-means classifier are integrated and trained by means of ensemble learning to obtain a detection classifier; finally , using the improved confrontation autoencoder model to extract two sets of feature vectors in the test set, and fuse the two sets of feature vectors and input them into the detection classifier to obtain anomaly detection results. Compared with the prior art, the present invention utilizes the adversarial self-encoder combined with the traditional anomaly detection method, which can more accurately perform anomaly detection on the data set and improve the accuracy of anomaly detection.

Description

Hybrid anomaly detection method based on countermeasure self-encoder

Technical Field

The invention relates to the technical field of anomaly detection, in particular to a hybrid anomaly detection method based on a confrontation self-encoder.

Background

The abnormal point is defined as a data object which is obviously different from other data distribution, and abnormal information can be mined from mass data, interest patterns can be extracted and the like by analyzing the data distribution characteristics of the abnormal point. Therefore, outlier detection (outlier detection) becomes one of the research hotspots in the data mining field. The traditional abnormal point detection methods are numerous, and the time performance and the accuracy performance in the detection process are mainly considered. However, with the development of cloud computing and big data technology, the traditional method for mining abnormal node information through a single node computing mode cannot meet the increasing data computing requirements. An artificial intelligence technology represented by a deep learning technology provides a new research direction for detecting the abnormal point.

At present, a method of constructing a hybrid model by combining a deep learning method is continuously applied to an actual scene in recent years. The method has the characteristic of no need of training the model in advance, so the method belongs to a method with high cost performance. However, for complex and high-dimensional data, the method is difficult to capture the inherent attributes of the data, and in addition, when the AE is used for dimension reduction, the degree of dimension reduction greatly affects the final result, repeated parameter adjustment is needed, and the accuracy of anomaly detection is reduced.

Disclosure of Invention

The invention aims to provide a hybrid anomaly detection method based on a countermeasures self-encoder, which improves the accuracy of anomaly detection.

In order to achieve the above object, the present invention provides a hybrid anomaly detection method based on a countering self-encoder, comprising the following steps:

improving a countermeasure self-encoder model, and extracting characteristics of noise-added input data by using the improved countermeasure self-encoder model;

performing weighted fusion on the two groups of extracted characteristic vectors to obtain fusion characteristic vectors;

taking the fusion feature vector as training data, and integrally training an error classifier, an LOF classifier and a K-means classifier by utilizing an integrated learning mode to obtain a detection classifier;

and extracting two groups of feature vectors in a test set by using the improved confrontation self-encoder model, fusing the two groups of feature vectors, and inputting the fused feature vectors into the detection classifier to obtain an abnormal detection result.

Wherein, improving the antagonistic self-encoder model and extracting the characteristics of the noise-added input data by using the improved antagonistic self-encoder model comprises the following steps:

taking a LeakyReLU function and a Tanh function as activation functions of a first convolutional layer, a third convolutional layer and a fourth convolutional layer in two encoders in a self-encoder model, forming a self-encoder by a generator and the first encoder, and improving the activation function and the mapping function of a discriminator;

normalizing the acquired data set, and performing noise adding processing on the divided training set;

and training the improved confrontation self-encoder model, and extracting two groups of characteristics of the training set.

Training the improved confrontation self-encoder model, and extracting two groups of characteristics of the training set, wherein the training comprises:

inputting the noisy training set into the improved confrontation self-encoder model, performing feature extraction by using a first encoder, and reconstructing the training set by using the generator to take the extracted first group of feature vectors as input;

and performing feature extraction on the reconstructed training set by using a second encoder to obtain a second group of feature vectors.

Wherein, carry out the weight fusion to two sets of characteristic vectors of drawing out, obtain and fuse the characteristic vector, include:

and multiplying the first group of feature vectors by a weighting coefficient, multiplying the second group of feature vectors by subtracting the weighting coefficient from 1, adding the two products, and combining the corresponding labels to obtain corresponding fusion feature vectors.

The fusion feature vector is used as training data, and the ifroest classifier, the LOF classifier and the K-means classifier are integrally trained in an integrated learning mode to obtain a detection classifier, and the method comprises the following steps:

serializing three classifiers by using an AdaBoost algorithm, inputting the fusion feature vector as a training set into an ifoest classifier for training, adjusting weight distribution to obtain a weight coefficient, inputting the training set into a next classifier until the ifoest classifier, an LOF classifier and a K-means classifier are trained, and integrating all the weight coefficients to obtain a detection classifier.

Wherein, the improved confrontation self-encoder model is used for extracting two groups of characteristic vectors in a test set, and the two groups of characteristic vectors are fused and then input into the detection classifier to obtain an abnormal detection result, which comprises the following steps:

denoising the divided test set, and extracting two groups of feature vectors by using a first encoder and a second encoder in the improved confrontation self-encoder model;

and performing weighted fusion on the two groups of feature vectors, and inputting the feature vectors into a search detection classifier to obtain an abnormal detection result of each test sample.

The invention relates to a hybrid anomaly detection method based on a countermeasure self-encoder, which comprises the steps of firstly, improving a countermeasure self-encoder model, and extracting noise-added input data characteristics by utilizing the improved countermeasure self-encoder model; then, carrying out weighted fusion processing on the two extracted features; then, taking the fused feature vector obtained by fusion as training data, and integrally training an ifoest classifier, an LOF classifier and a K-means classifier by utilizing an integrated learning mode to obtain a detection classifier; and finally, extracting two groups of feature vectors in a test set by using the improved confrontation self-encoder model, fusing the two groups of feature vectors, and inputting the fused feature vectors into the detection classifier to obtain an abnormal detection result. Compared with the prior art, the method has the advantages that the countermeasure autoencoder is combined with the traditional anomaly detection method, so that the anomaly detection can be more accurately carried out on the data set, and the accuracy of the anomaly detection is improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a schematic diagram illustrating the steps of a hybrid anomaly detection method based on a countering self-encoder according to the present invention.

Fig. 2 is a flow chart of a hybrid anomaly detection method based on a countering self-encoder provided by the invention.

FIG. 3 is a framework of a hybrid anomaly detection model based on an improved robust self-encoder provided by the present invention.

Fig. 4 is an encoder model structure of the improved robust autoencoder provided by the present invention.

Fig. 5 is a view of the improved discriminator model structure against the self-encoder provided by the present invention.

Fig. 6 is a generator model structure for an improved robust autoencoder provided by the present invention.

FIG. 7 is a model structure for ensemble learning provided by the present invention.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.

In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.

Referring to fig. 1 to 3, the present invention provides a hybrid anomaly detection method based on a run-length encoder, comprising the following steps:

s101, improving a confrontation self-encoder model, and extracting characteristics of noise-added input data by using the improved confrontation self-encoder model.

Specifically, an improved antagonistic self-encoder model is constructed, and features of input data are extracted. The DCGAN model is improved and comprises two encoders E1 and E2, a generator G and a discriminator D, wherein the two encoders E1 and E2 adopt the same structure: the four convolution layers are used for dimension reduction, the second layer and the third layer adopt BatchNormalization technology, the activation function of the first three layers is LeakyReLU function, and the activation function of the last layer is Tanh. The generator G and the first encoder E1 form a self-encoder, which has four layers of inverse convolution neural networks, the first three layers use Batch Normalization technology, the activation function is LeakyReLU function, and the last layer uses Tanh as the activation function. The discriminator D has a similar network structure to the first encoder E1, and uses a four-layer convolutional neural network, the second and third layers use Batch Normalization technology, the activation function of the first three layers is a leak relu function, and the last layer uses Sigmoid as an output mapping function, as shown in fig. 4 to 6.

And normalizing the data set into a single-channel or three-channel picture with the same specification size. And splitting the data set into a training set and a testing set according to the ratio of 4:1, and training the improved self-countervailing encoder model on the training set. The noise adding processing is carried out on the training set before training, so that the characteristics representing the input data can be better extracted. The network adopts Adam algorithm to update parameters of each layer of the network, and the iteration times are 200 generations. The generator G and the discriminator D are trained first, and after training is completed, the encoders E1 and E2 are trained using noisy data. Three loss functions are included:

L_enc＝||z₁-z₂||₂

L_con＝||x-x′||₁

L_adv＝||f(x)-f(x')||₂

the specific training process comprises the following steps: inputting m single-channel noise-added gray images or RGB three-channel noise-added images with the same pixel size into a confrontation self-encoder modelThe noisy training set is denoted as S { (x)₁,y₁),...,(x_m,y_m) In which x_iRepresenting noisy n-dimensional data, y_iIs a data tag. After convolutional layer feature extraction by the first encoder E1, each picture is represented as a first set of feature vectors Z₁＝{(z₁₁,y₁),...,(z_1m,y_m) And outputting and storing the form of the Chinese character. The generator G reconstructs a training set using these feature vectors as input, and outputs reconstructed data S { (x)₁',y₁),...,(x_m',y_m)}. The discriminator D is responsible for ensuring that the reconstructed data is as similar as possible to the original data. The second encoder E2, after re-extracting the features, represents a second set of feature vectors Z₂＝{(z₂₁,y₁),...,(z_2m,y_m) And storing.

And S102, carrying out weighted fusion on the two groups of extracted characteristic vectors to obtain fusion characteristic vectors.

Specifically, for the two sets of feature vectors Z obtained in S1₁＝{(z₁₁,y₁),...,(z_1m,y_m) Z and₂＝{(z₂₁,y₁),...,(z_2m,y_m) And performing weighted fusion processing on the feature data of every two data respectively:

wherein, the lambda is a weighting coefficient and is selected from 0 to 1 according to the actual situation.

For the obtained Z₃₁～Z_3mAdd their original tag y₁～y_mForm a new set of eigenvectors Z₃＝{(z₃₁,y₁),...,(z_3m,y_m)}。

S103, taking the fusion feature vector as training data, and integrally training an ifoest classifier, an LOF classifier and a K-means classifier by utilizing an integrated learning mode to obtain a detection classifier.

Specifically, serializing three classifiers by using an AdaBoost algorithm, inputting the fused feature vector as a training set into an ifoest classifier for training, adjusting weight distribution to obtain a weight coefficient, inputting the training set into a next classifier until the training of the ifoest classifier, an LOF classifier and a K-means classifier is completed, and integrating all the weight coefficients to obtain a detection classifier, as shown in fig. 7, S31: data Z after model training₃In the process of importing the AdaBoost detection model, firstly, weight distribution W of training data is initialized₁＝(w₁₁,...,w_1m),w_1i1/m, wherein i is 1, 2. Each training sample is given the same weight, and the fusion feature vector is taken as a training set.

S32: using the training set to train iforcest, i.e. classifier h₁(z) adjusting the iTree number through cross validation, and setting an abnormal ratio according to the sample label to enable a loss function e₁And minimum. Calculate h₁Error of (z)

If get e₁And if the number is more than 0.5, the training of the next classifier is directly skipped. Calculating a weight coefficient

Updating the weight distribution W₂＝(w₂₁,...,w_2m)，

Where i 1, 2., m, f (z) is the distribution function of the raw data.

S33: enter next classifier LOF, train LOF using training set, i.e. classifier h₂(z). Minimizing the loss function e by adjusting the K value in the LOF algorithm through cross-validation₂Threshold t of training LOF₁:(t₁',k)＝argmin_t1',ke₂Even if the loss function e₂Minimum t₁' is denoted by t₁Comparing the abnormality score with a threshold value t₁Is determined by the sizeConstant value, and calculate h₂Error of (z)

If get e₂And if the number is more than 0.5, the training of the next classifier is directly skipped. Calculating a weight coefficient

Updating the weight distribution W₃＝(w₃₁,...,w_3m)，

Where i 1, 2., m, f (z) is the distribution function of the raw data.

S34: enter the next classifier K-means, i.e. classifier h₃(z). Adjusting the number K of clustering clusters in the K-means algorithm through cross validation, and selecting the relative distance D of each clustering center as an abnormal score to minimize the error e₃Training K-means to obtain a threshold value t₂：(D,k)＝arg min_D,ke₂Even if the loss function e₃The smallest D is denoted as t₂Comparing the relative distance D with a threshold value t₂To determine an outlier; and calculate h₃Error of (z)

If get e₃And if the number is more than 0.5, the training of the next classifier is skipped directly. Calculating a weight coefficient

Linearly combining the 3 classifiers to obtain a final detection classifier or a final strong classifier

And S104, extracting two groups of feature vectors in a test set by using the improved confrontation self-encoder model, fusing the two groups of feature vectors, and inputting the fused feature vectors into the detection classifier to obtain an abnormal detection result.

Specifically, firstly, the test data set is subjected to noise adding, then the noise is sent to a trained improved confrontation self-encoder model, and two groups of feature vectors Z of the test set are extracted through encoders E1 and E2₁' and Z₂'and weighted fusion in the step S102 is performed to obtain a fused feature vector Z'.

The fused feature vector Z' is taken as input data and sent to the detection classifier h (Z) obtained in step S103, and the strong classifier outputs the anomaly score of each test sample, thereby obtaining an anomaly detection result.

In order to verify the effectiveness of the hybrid anomaly detection model based on the anti-self encoder, the method of the invention is compared with the anomaly detection effects of three traditional anomaly detection algorithms in an MNIST data set, and the comparison result is shown in Table 1. Compared with three traditional anomaly detection methods, the accuracy value and the AUC value of the method provided by the invention are greatly improved, and the method is proved to have higher reliability.

TABLE 1 comparison of the four test methods

	Iforest	LOF	OCSVM	Text algorithm
					Rate of accuracy	89.81	84.57	70.82	92.38
AUC value	0.79	0.83	0.80	0.95

Compared with the prior art, the invention has the following remarkable advantages:

1. compared with the characteristics extracted by the traditional machine learning method, the characteristics extracted by the method are more abstract and representative, and the accuracy rate of anomaly detection can be effectively improved.

2. The integrated learning in the deep learning is introduced, three traditional anomaly detection algorithms, namely, iforest, LOF and K-means, are respectively integrated through the AdaBoost algorithm, and compared with the method of directly using the traditional anomaly detection algorithms, the method is more accurate, can process data with high dimensionality, and does not need to make feature selection.

3. By resisting two groups of feature vectors decoded by the self-encoder model and performing weighted fusion on the two groups of feature vectors, the features of the extracted input data are more representative, and compared with a method for extracting the features of the self-encoder alone, the method is more reliable and has higher accuracy rate of anomaly detection.

While the invention has been described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A hybrid anomaly detection method based on a countering self-encoder is characterized by comprising the following steps:

2. The robust self-encoder based hybrid anomaly detection method of claim 1, wherein a robust self-encoder model is improved and the noisy input data features are extracted using the improved robust self-encoder model, comprising:

3. The hybrid anomaly detection method based on the antagonistic self-encoder as claimed in claim 2, wherein the training of the improved antagonistic self-encoder model to extract two sets of features of the training set comprises:

4. The hybrid anomaly detection method based on the antagonistic self-encoder as claimed in claim 3, wherein the weighted fusion of the two groups of extracted feature vectors to obtain a fused feature vector comprises:

5. The hybrid anomaly detection method based on the antagonistic self-encoder as claimed in claim 1, wherein the fused feature vector is used as training data, and an error classifier, an LOF classifier and a K-means classifier are integrally trained in an ensemble learning manner to obtain a detection classifier, and the method comprises the following steps:

6. The hybrid anomaly detection method based on the antagonistic self-encoder as claimed in claim 1, wherein the improved antagonistic self-encoder model is used to extract two groups of feature vectors in a test set, and the two groups of feature vectors are fused and input into the detection classifier to obtain an anomaly detection result, comprising: