CN110674940A

CN110674940A - Multi-index anomaly detection method based on neural network

Info

Publication number: CN110674940A
Application number: CN201910880142.4A
Authority: CN
Inventors: 葛晓波; 杨辰; 殷传旺
Original assignee: Shanghai Qing Chuang Information Technology Co Ltd
Current assignee: Shanghai Qing Chuang Information Technology Co Ltd
Priority date: 2019-09-18
Filing date: 2019-09-18
Publication date: 2020-01-10
Anticipated expiration: 2039-09-18
Also published as: CN110674940B

Abstract

The invention discloses a multi-index abnormality detection method based on a neural network, which comprises the following specific steps: step 1: defining a data format; step 2: performing model training on the system by using the SOM, and defining the training process as a learning process; and step 3: carrying out anomaly detection on input data, and defining the anomaly detection as a mapping process; and 4, step 4: and when the model is mapped to be abnormal, carrying out root cause positioning. The method can predict the unknown performance abnormity and provide abnormity reason prompt by utilizing the induced behavior model, and the model can obtain higher prediction precision in a benchmark test result; the SOM is utilized to map the high-dimensional input space into the low-dimensional map space, and meanwhile, the topological property of the original input space is reserved, so that the expandability and the effective system behavior learning can be realized.

Description

Multi-index anomaly detection method based on neural network

Technical Field

The invention relates to the technology in the field of computers, in particular to a multi-index abnormality detection method based on a neural network.

Background

An outlier is a data point that is sufficiently far from other points that it is suspected of being caused by another mechanism. Anomaly detection methods have been used in various fields of application, such as intrusion detection, financial fraud, medical diagnostics, law enforcement and natural sciences. The most common outlier detection methods include the use of distance-based methods, which, although they are old, have become the most popular and provide powerful results.

One particularly difficult case of anomaly detection is high-dimensional anomaly detection, which hides outliers due to their irrelevant properties. In high-dimensional anomaly detection, many different methods, such as feature bagging, high contrast methods, statistical subspace selection, and spectroscopic methods are used to score points as anomaly values. While system metrics for truly distributed applications often behave abnormally due to fluctuating noise from dynamic workloads or measurements, conventional approaches use statistical learning to detect abnormal data, often require training data given specific assumptions, require a significant amount of human effort, and can only deal with previously known abnormalities.

Achieving efficient multi-dimensional online system anomaly detection is a challenging task. The learning scheme first needs to achieve scalability, which results in a large amount of learning overhead. Furthermore, system metrics for truly distributed applications often fluctuate noise due to dynamic workload or measurements, which requires a powerful learning scheme. The SOM learning technology is selected to realize expandability and effective multi-index anomaly detection, the SOM maps a high-dimensional input space into a low-dimensional map space, usually two-dimensional, and simultaneously retains the topological property of the original input space, namely two similar samples are projected to a closed position in the map. Therefore, the SOM can handle multivariable system behavior learning well without missing any representative behavior.

Determining the root cause of an anomaly is for a very important task. The SOM consists of a set of neurons arranged in a lattice that retains the properties of the topological measurement space, and the model can use this information to identify the error metric that caused the anomaly. The basic idea is to observe the distinguishing neurons of abnormal neurons from normal neurons and output the index that differs most from the wrong index.

Disclosure of Invention

The invention aims to provide a multi-index anomaly detection method based on a neural network, which is based on a Self-Organizing neural network (SOM), and Self-Organizing neural network mapping is an unsupervised neural network model. Network parameters and structures are changed in a self-organizing and self-adaptive mode by automatically searching for inherent rules and attributes in the data, and therefore the data are gathered into different discrete areas according to the similarity degree. The method comprises the following specific steps:

step 1: defining a data format;

the data set D has n data points and D dimensions, wherein the D dimensions comprise index 1, index 2 and index 3 … … index D; the ith row of data is represented as a d-dimensional vector: x (t) ═ x (xi1, xi2, …, xid); where xid represents a system metric and uses the vector of measurements as input to the training SOM; an SOM is composed of a group of neurons arranged in a lattice, each neuron is assigned with different weight vectors and map coordinates, the weight vectors and the measurement vectors are the same in length, and the vectors in the training data are dynamically updated according to the measured values;

step 2: performing model training on the system by using the SOM, and defining the training process as a learning process;

step1. initialize the weight of each neuron, { n_i＝[w_i1,w_i2,w_i3,…,w_ik]I, k 1: n }; the neurons form a node matrix with equal intervals on the neural network according to a two-dimensional form to form an output layer; each node has a corresponding weight vector, and the dimension of the weight vector is equal to the dimension length of the input data;

step2. select input data x ═ v with arbitrary dimension K₁,v₂,v₃,…,v_k]Calculating the distance from the neuron to each neuron, wherein all the neurons of the network output layer compete with each other, and only one winning neuron can be activated at a time, namely the neuron BMU is activated; determining activated neurons by competitive learning: c ═ arg min { dist (x, ni) };

step3, setting a radius by taking the activated neuron as a center, wherein a region within the radius is called a winning region; selecting the neuron in the winning area according to the coordinate of the activated neuron and the radius of the neighborhood; in the initial stage of the method, the value of the radius is set to be larger, the default initial radius is equal to the radius of the size of the neural network, the radius is continuously shrunk along with the increase of the iteration times, and the shrinking function is as follows:

wherein r is_tRadius at the t-th iteration, r₀Is the initial radius, t is the current iteration time, and λ is a constant;

step4. when a neuron is activated, the neuron and the neurons in its area of dominance will get weight updates, making them more similar to the input samples, the update function is:

W_t+1＝W_t+Θ(t)L(t)[V_t-W_t]

wherein, W_t+1For the updated weight, W_tTo update the pre-weight value, V_tFor an input sample, Θ (t) is a neighbor function, the neighbor function controls the update amplitude, the update amplitude obtained by the activated neuron is the maximum, and the closer the neurons in the winning region are to the activated neuron, the larger the obtained update amplitude is, the gaussian function is; l (t) is a learning rate function, the learning rate is declined along with the increase of the iteration times, and the weight of the neuron is gradually stabilized through the decline, so that the model is converged;

step5, repeating Step2 to Step4 until the training is finished when the model converges;

and step 3: carrying out anomaly detection on input data, and defining the anomaly detection as a mapping process;

step1. calculate the neighbor Area per neuron and sort, sort (Area)₁,Area₂,Area₃,…,Area_nAnd) setting a threshold value, and determining the abnormal cluster exceeding the threshold value; the neighbor area is defined as the distance between the selected neuron and its immediate neighbors, the immediate neighbors are located above, below, left and right of the map coordinates of the selected neuron (N)_T,N_B,N_L,N_R) The neighbor area is calculated by the mean value of the Manhattan distance between the instant neighbor and the selected neuron;

step2, selecting any input sample, calculating the distance from the input sample to each neuron and determining an activated neuron;

step3. Area of Area immediately adjacent to the neuron to be activated_BMUComparing the sample with a threshold value, and judging whether the sample is abnormal;

and 4, step 4: when the model is mapped to be abnormal, root cause positioning is carried out;

step1, when the measurement sample is mapped to an abnormal neuron, calculating the Euclidean distance from the abnormal neuron to a group of nearby normal neurons; the aim is to avoid comparisons with neighbouring abnormal neurons, as they represent unknown states, giving false indications of the cause of the abnormality; calculating the difference of each dimension between the corresponding activated neuron and Q normal neurons around the corresponding activated neuron to obtain Q groups of difference value arrays, wherein the length of each difference value array is K;

D₁＝[|W_BMU1-W_normal11|,|W_BMU2-W_normal12|,…,|W_BMUk-W_normal1k|]

D_Q＝[|W_BMU1-W_normalQ1|,|W_BMU2-W_normalQ2|,…,|W_BMUk-W_normalqk|]

step2, preferentially selecting normal neurons from the neighborhood radius range, and expanding the search radius when no enough normal neurons exist in the range until Q neurons are found;

step3. once a group of normal neurons is found, differences are calculated; taking the absolute value of the calculated difference because the change is not positive or negative; sorting the Q group difference value arrays from large to small respectively, recording the dimensionality with the largest difference value to the dimensionality with the smallest difference value as K, K-1, K-2, …,1 respectively, and obtaining a Q group index ranking table after the process is completed;

and step4, calculating the total score of each dimension by using a majority voting method, and selecting a plurality of dimensions with the highest total score as main factors.

Preferably, the system metric in step1 includes CPU, memory, disk I/O, or network traffic.

Preferably, in step3, the threshold value is selected from the sorted percentile, and in the method, the threshold value is set to 98% or 99%.

Compared with the prior art, the invention has the advantages that:

1) the method utilizes the self-organizing neural network to learn the behavior capture abnormality of the high-dimensional system, utilizes the induced behavior model to predict the unknown performance abnormality and provide an abnormality reason prompt, and in the benchmark test result, the model can obtain higher prediction precision.

2) According to the method, the high-dimensional input space is mapped into the low-dimensional map space, usually two-dimensional, by utilizing the SOM, the topological property of the original input space is reserved, and the expandability and the effective system behavior learning can be realized.

Drawings

FIG. 1 is a flow chart of a multi-index anomaly detection method based on a neural network;

FIG. 2SOM input layer and competition layer;

figure 3SOM neighbor neurons.

Detailed Description

The following detailed description of the embodiments of the present invention is provided with reference to the accompanying drawings, as shown in fig. 1, the following steps are performed:

step 1: defining a data format;

the data set D has n data points and D dimensions, including time, index 1, index 2 and index 3 … … index D; the ith row of data can then be represented as a d-dimensional vector: x (t) ═ x (xi1, xi2, …, xid); where xid represents a system metric such as CPU, memory, disk I/O or network traffic, and uses the measured value vector as input to train the SOM; an SOM is composed of a group of neurons arranged in a lattice, each neuron is assigned with different weight vectors and map coordinates, the weight vectors and the measurement vectors are the same in length, and the vectors in the training data are dynamically updated according to the measured values;

step 2: performing model training on the system by using the SOM, and defining the model training as Learning Process;

step2. as shown in fig. 2, input data x ═ v with arbitrary dimension K is selected₁,v₂,v₃,…,v_k]Calculating the distance from the neuron to each neuron, wherein all the neurons of the network output layer compete with each other, and only one winning neuron can be activated at a time, namely, the neuron is activated (Best Matching Unit, BMU); determining activated neurons by competitive learning: c is argmin { dist (x, ni) };

step3, as shown in fig. 3, a radius is set around the activated neuron, and a region within the radius is called a winning region; selecting the neuron in the winning area according to the coordinate of the activated neuron and the radius of the neighborhood; in the initial stage of the method, the half-value is set to be larger, the default initial radius is equal to the radius of the size of the neural network, the radius is continuously shrunk along with the increase of the iteration times, and the shrinking function is as follows:

W_t+1＝W_t+Θ(t)L(t)[V_t-W_t]

and step 3: carrying out anomaly detection on input data, and defining the input data as Mapping Process;

step1. calculate the neighbor Area per neuron and sort, sort (Area)₁,Area₂,Area₃,…,Area_n-setting a Threshold value, exceeding the Threshold value (Threshold) as an abnormal cluster; selecting the sorted percentile as a threshold, wherein the threshold is set to 98% or 99% in the method; the neighbor area is defined as the distance between the selected neuron and its immediate neighbors, the immediate neighbors are located above, below, left and right of the map coordinates of the selected neuron (N)_T,N_B,N_L,N_R) The neighbor area is calculated by the mean value of the Manhattan distance between the instant neighbor and the selected neuron;

and 4, step 4: when the model is mapped to be abnormal, Root cause positioning Root cause;

D₁＝[|W_BMU1-W_normal11|,|W_BMU2-W_normal12|,…,|W_BMUk-W_normal1k|]

D_Q＝[|W_BMU1-W_normalQ1|,|W_BMU2-W_normalQ2|,…,|W_BMUk-W_normalqk|]

While the present invention has been described with reference to a limited number of embodiments and drawings, as described above, various modifications and changes will become apparent to those skilled in the art to which the present invention pertains. Accordingly, other embodiments are within the scope and spirit of the following claims and equivalents thereto.

Claims

1. A multi-index abnormality detection method based on a neural network is characterized by comprising the following specific steps:

step 1: defining a data format;

step2. select input data x ═ v with arbitrary dimension K₁,v₂,v₃,…,v_k]Calculating the distance from the neuron to each neuron, wherein all the neurons of the network output layer compete with each other, and only one winning neuron is activated each time, namely the neuron BMU is activated; determining activated neurons by competitive learning: c ═ arg min { dist (x, ni) };

W_t+1＝W_t+Θ(t)L(t)[V_t-W_t]

step1. meterCalculate and rank the neighbor Area of each neuron, sort (Area)₁,Area₂,Area₃,…,Area_n) Setting a threshold value, and taking the abnormal cluster exceeding the threshold value; the neighbor area is defined as the distance between the selected neuron and its immediate neighbors, the immediate neighbors are located above, below, left and right of the map coordinates of the selected neuron (N)_T,N_B,N_L,N_R) The neighbor area is calculated by the mean value of the Manhattan distance between the instant neighbor and the selected neuron;

D₁＝[|W_BMU1-W_normal11|,|W_BMU2-W_normal12|,…,|W_BMUk-W_normal1k|]

D_Q＝[|W_BMU1-W_normalQ1|,|W_BMU2-W_normalQ2|,…,|W_BMUk-W_normalqk|]

step2, selecting normal neurons from the neighborhood radius range, and expanding the search radius when no enough normal neurons exist in the range until Q neurons are found;

2. The multi-index abnormality detection method according to claim 1, characterized in that: the system measurement in the step1 comprises CPU, memory, disk I/O or network flow.

3. The multi-index abnormality detection method according to claim 1, characterized in that: and selecting the sorted percentile sites by using the threshold in the step3, wherein the threshold is set to be 98% or 99% in the method.