CN113076235A

CN113076235A - Time sequence abnormity detection method based on state fusion

Info

Publication number: CN113076235A
Application number: CN202110382900.7A
Authority: CN
Inventors: 梁于宁; 吴维刚
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2021-04-09
Filing date: 2021-04-09
Publication date: 2021-07-06
Anticipated expiration: 2041-04-09
Also published as: CN113076235B

Abstract

The invention provides a time sequence anomaly detection method based on state fusion, aiming at overcoming the defect that time distortion can cause cluster anomaly detection to have errors due to time sequence characteristics, and comprising the following steps of: collecting time sequence data to be detected and preprocessing the time sequence data to obtain time sequence data X_TAnd its corresponding label Y_T(ii) a The preprocessed time sequence data X_TInputting into a semantic analysis network MarkNet to generate a mark sequence S_T(ii) a The marker sequence S_TInputting into MeM module for state fusion analysis to generate marking sequence S_U(ii) a The marker sequence S_UInputting the data into an anomaly monitoring network DetectNet to carry out anomaly detection to obtain a confidence coefficient P (X), and inputting the confidence coefficient P (X) into the anomaly monitoring network DetectNet to carry out anomaly detectionAnd comparing the confidence coefficient P (X) with a preset comparison threshold epsilon, and outputting a detection result as abnormal when the confidence coefficient P (X) is greater than the preset comparison threshold epsilon, or outputting the detection result as normal.

Description

Time sequence abnormity detection method based on state fusion

Technical Field

The invention relates to the technical field of cluster resource management, in particular to a time sequence abnormity detection method based on state fusion.

Background

Currently, the anomaly detection for the cluster is usually performed from the resource usage of the cluster unit. The resource use condition of the cluster is recorded in a time sequence mode and is output to an analysis module for real-time monitoring or is stored in a large-scale database to be used for subsequent task repeated analysis. Due to the fact that time sequence data have time relevance, the model cannot capture time sequence characteristics due to estimation based on probability distribution, detection accuracy is greatly reduced, and therefore a deep learning model is added in most of work. For example, segment feature extraction is performed on the time sequence through a convolutional neural network, time features of the time sequence are recorded through a cyclic neural network, deep feature induction decomposition is performed through a full-connection network, the neural network has nonlinear features by applying technologies such as maximum pooling and activation functions, and finally a comprehensive judgment score is output by the network. However, the current timing learning method is generally trained and tested time by time, that is, at each time, the network outputs a value to perform the judgment. However, due to the intensive time sequence of data center detection, the time sequence characteristics of the monitored object have time distortion (temporal distortion) due to performance fluctuation, task switching, data center resource fluctuation and the like. For example, due to resource bottlenecks in the data center, peak features existing in a time sequence for recording user access volumes may be elongated, and these features cannot be well mastered by the deep learning model trained frame by frame.

For the time-warping problem of the timing characteristics, the following two techniques are mainly used at present: dynamic Time Warping (DTW) Time sequence similarity measurement technology utilizes Dynamic programming to find the closest distance between two Time sequences, and can solve Time sequence deformation under specific conditions so as to deal with misjudgment caused by Time Warping. However, DTW seeks to calculate the shortest distance between two timings using a priori assumptions, and does not give a reasonable explanation for the phenomenon of timing misalignment. The connected dominant Temporal Classification (CTC) is a sequence transformation method that combines adjacent output values into one for the predicted output of a neural network, thereby obtaining a more simplified time sequence representation. The potential characteristics of the time sequence are distinguished through the time sequence representation, and the subsequent data stream can be transmitted. However, the transformation method of CTC is to combine all the same characteristic states in a sequence, and when a sequence exists for a relatively long time, the representation may be over simplified, and errors may be caused.

Disclosure of Invention

The invention provides a time sequence anomaly detection method based on state fusion, aiming at overcoming the defect that time distortion exists in the time sequence characteristic of the prior art, which causes an error in anomaly detection of a cluster.

In order to solve the technical problems, the technical scheme of the invention is as follows:

a time sequence abnormity detection method based on state fusion comprises the following steps:

s1: collecting time sequence data to be detected and preprocessing the time sequence data to obtain time sequence data X_TAnd its corresponding label Y_T；

S2: the preprocessed time sequence data X_TInputting into a semantic analysis network MarkNet to generate a mark sequence S_T；

S3: the marker sequence S_TInputting into MeM module for state fusion analysis to generate marking sequence S_U；

S4: the marker sequence S_UInputting the data into an anomaly monitoring network DetectNet for anomaly detection to obtain a confidence coefficient P (X), comparing the confidence coefficient P (X) with a preset comparison threshold epsilon, and outputting a detection result as anomaly when the confidence coefficient P (X) is greater than the preset comparison threshold epsilon, or outputting the detection result as normal.

Preferably, the step of preprocessing the time series data to be detected includes denoising and normalizing.

As a preferred scheme, the semantic analysis network MarkNet comprises an LSTM neural network module, a full connection layer structure and a softmax function layer which are sequentially connected.

Preferably, in the step S2, a marker sequence S is generated_TThe method comprises the following specific steps: the preprocessed time sequence data X_TInputting the LSTM neural network module of the semantic analysis network MarkNet, and carrying out data processing on input time sequence data X by the LSTM neural network module to obtain a hidden stateh_tAnd an internal memory state c_t(ii) a Wherein the preprocessed time series data X_TThe expression formula of (a) is:

X_T＝{x₁,x₂,...,x_T},x_t∈R^m,t＝1,2,...,T

Y_T＝{y₁,y₂,...,y_T},y_t∈{0,1}

in the formula, x_tTime series data X representing input_TThe data at the middle T moment, wherein T is the length of time sequence data; y is_TAs time series data X_TLabel of (a), y_tRepresenting data x_tA corresponding label; the calculation formula of the LSTM neural network module is as follows:

(h_t,c_t)＝f_lstm(x_t-1,h_t-1,c_t-1)

will hide the state h_tInputting the input into a full-connection layer structure to generate an n + 1-dimensional output, obtaining a prediction probability distribution through a softmax function layer, and obtaining an output s according to the prediction probability distribution_tConstructed marker sequence S_TThe expression formula is as follows:

s_t＝f_dense(h_t)

S_T＝{s₁,s₂,...,s_T},s∈L＝{l₁,l₂,...,l_n,φ}

in the formula, L is a marker set, and the L comprises n non-null semantic markers L and 1 null semantic marker phi.

Preferably, the MeM module adopts a link meaning classification method based on window and fusion length limitation to input mark sequence S_TAnd performing state fusion analysis.

Preferably, in the step S3, a marker sequence S is generated_UComprises the following steps: the marker sequence S_TInputting the data into a MeM module for state fusion analysis, wherein the MeM module deduplicates adjacent continuous same mark element segments with the length not exceeding w, combines the same mark element segments into a new element segment, and directly removes a blank mark phiObtaining a mark sequence S containing refining information in T time_U(ii) a The expression formula is as follows:

S_U＝MeM(S_T,w)

in the formula, w represents a parameter for controlling the fusion number by the MeM module.

As a preferred scheme, the calculation formula of the confidence p (x) output by the anomaly monitoring network detectenet is as follows:

P(X_T)＝D(S_U,X_T,ε)

in the formula, D (-) represents a DetectNet neural network model.

Preferably, in the step S4, the anomaly monitoring network detectetnet performs time-series decoding by using an auto-encoder, and inputs the original time-series data into a decoder by using a SkipConnect technique, and outputs the obtained confidence level p (x).

Preferably, the method further comprises the following steps: labeling the confidence level P (X) and the time series data label Y_TPerforming precision statistics through a Loss function, performing error gradient calculation, and performing reverse gradient propagation and model parameter updating on the semantic analysis network MarkNet and the anomaly monitoring network DetectNet; judging whether the current precision meets a preset precision threshold value or not according to the statistical precision of the Loss function, and finishing training of a semantic analysis network MarkNet if the current precision meets the preset precision threshold value; if not, the step S2 is executed.

As a preferred scheme, the steps of performing inverse gradient propagation and model parameter updating on the semantic analysis network MarkNet and the anomaly monitoring network detectetnet include: judging whether the semantic analysis network MarkNet needs training according to the error gradient obtained by calculation, if so, fixing the parameter theta of the anomaly monitoring network DetectNet_DCalculating the detection deviation of the semantic analysis network MarkNet, performing back propagation according to the gradient, updating the parameters of the semantic analysis network MarkNet, and strengthening the semantic understanding capability of the semantic analysis network MarkNet; if not, fixing the parameter theta of the MarkNet of the semantic analysis network_MCalculating the detection deviation of the anomaly monitoring network DetectNet, performing back propagation according to the gradient and carrying out back propagation on the anomaly monitoring networkAnd updating the parameters of the DetectNet, and strengthening the abnormality detection capability of the abnormality monitoring network DetectNet.

Compared with the prior art, the technical scheme of the invention has the beneficial effects that: according to the method, the unknown time sequence data are subjected to semantic calibration by performing calibration marking on the time sequence data to be detected, and the potential semantics of the common time sequence data are analyzed, so that the time sequence mode deformed due to time distortion transformation can be better identified; the time distortion problem of the time sequence is solved by adopting a state fusion method, and the accuracy of the abnormality detection can be effectively improved under the conditions that the computer cluster runs complex tasks and the time sequence fluctuates or distorts.

Drawings

FIG. 1 is a flowchart of a method for detecting a time sequence abnormality based on state fusion according to the present invention.

Fig. 2 is a frame structure diagram of the method for detecting a timing anomaly based on state fusion according to the present invention.

Fig. 3 is a schematic structural diagram of an LSTM neural network module according to an embodiment.

Fig. 4 is a schematic diagram of the time series data to be detected and the tag sequence obtained by inputting the time series data into MarkNet according to the embodiment.

FIG. 5 is a diagram of an embodiment of timing data1 and a mark sequence mark1 thereof.

FIG. 6 is a diagram of an embodiment of timing data2 and its mark sequence mark 2.

FIG. 7 is a diagram of an example marker sequence MeM1 with a state-fused marker sequence mark 1.

FIG. 8 is a diagram of an example marker sequence MeM2 with a state-fused marker sequence mark 2.

Detailed Description

The drawings are for illustrative purposes only and are not to be construed as limiting the patent;

it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.

The technical solution of the present invention is further described below with reference to the accompanying drawings and examples.

Example (b):

the present embodiment provides a method for detecting a timing anomaly based on state fusion, which is a flowchart and a frame structure diagram of the method for detecting a timing anomaly based on state fusion of the present embodiment, as shown in fig. 1 to 2.

The method for detecting the time sequence abnormality based on the state fusion, provided by the embodiment, comprises the following steps:

s1: collecting time sequence data to be detected and preprocessing the time sequence data to obtain time sequence data X_TAnd its corresponding label Y_T(ii) a The hyper-parameters of the semantic analysis network MarkNet, the MeM module and the anomaly monitoring network DetectNet of the embodiment are set, and model parameters are initialized.

In the step, the step of preprocessing the time sequence data to be detected comprises denoising and normalization processing, and the time sequence data X with the length of T is obtained after the processing_TAnd its corresponding label Y_TThe expression formula is as follows:

X_T＝{x₁,x₂,...,x_T},x_t∈R^m,t＝1,2,...,T

Y_T＝{y₁,y₂,...,y_T},y_t∈{0,1}

in the formula, x_tTime series data X representing input_TData at time t, y_tRepresenting data x_tA corresponding label.

S2: the preprocessed time sequence data X_TInputting into a semantic analysis network MarkNet to generate a mark sequence S_T。

The semantic analysis network MarkNet in the embodiment comprises an LSTM neural network module, a full connection layer structure and a softmax function layer which are sequentially connected. Specifically, a marker sequence S is generated_TThe method comprises the following specific steps:

the preprocessed time sequence data X_TInputting the LSTM neural network module of the semantic analysis network MarkNet, and performing data processing on input time sequence data X by the LSTM neural network module to obtain a hidden state h_tAnd an internal memory state c_t。

Fig. 3 is a schematic structural diagram of the LSTM neural network module of this embodiment.

The calculation formula of the LSTM neural network module is as follows:

(h_t,c_t)＝f_lstm(x_t-1,h_t-1,c_t-1)

as can be seen, the output of the LSTM neural network module at the time t is determined by the input, the hidden state and the memory state at the time t-1.

s_t＝f_dense(h_t)

S_T＝{s₁,s₂,...,s_T},s∈L＝{l₁,l₂,...,l_n,φ}

The semantic analysis network MarkNet in the embodiment adopts an improved LSTM neural network module of the RNN recurrent neural network as a processing unit, has obvious advantages in processing sequence data, and can remember time dependence.

S3: the marker sequence S_TInputting into MeM module for state fusion analysis to generate marking sequence S_U。

In this step, the MeM module uses a link meaning classification method based on window and fusion length restriction to input mark sequence S_TAnd performing state fusion analysis. Which generates a marker sequence S_UComprises the following steps: the marker sequence S_TInputting into MeM module for state fusion analysis, wherein the MeM module will be adjacent, continuous andthe same mark element segments with the length not exceeding w are deduplicated and combined into a new element segment, and the blank mark phi is directly removed to obtain a mark sequence S containing refining information in the T moment_U(ii) a The expression formula is as follows:

S_U＝MeM(S_T,w)

in the formula, w represents a parameter for controlling the fusion number by the MeM module, and is used for preventing the fusion of overlong time span; the null semantic label phi is used to segment the same element.

In one embodiment, the MeM module pairs the tag sequence S_TThe results of performing the state fusion analysis are expressed as:

MeM(abbbaaac,3)＝abac

MeM(abbbab,2)＝abbab

MeM(abbaφa,2)＝abaa

the principle of the MeM module in this embodiment can be analogized to the problem of speech recognition, and the same text can correspond to speech data of different lengths. In a data center, task execution times may not be consistent, and to train a frame-by-frame, time-by-time model, data needs to be aligned more strictly. This is difficult to handle in data centers with varying task execution times and large resource fluctuations. Therefore, the MeM module utilizes the prediction state of the combined adjacent time, a new mark sequence containing refining information in the T moment can be obtained in the allowable time span w, and the new mark sequence is a more universal mode, thereby being beneficial to subsequent anomaly detection.

In this step, the calculation formula of the confidence p (x) output by the anomaly monitoring network detectenet is as follows:

P(X_T)＝D(S_U,X_T,ε)

in the formula, D (-) represents a DetectNet neural network model.

The anomaly monitoring network DetectNet in this embodiment serves as a decoder, reconstructs a time sequence according to the marker sequence, and compares the time sequence with an original time sequence, so that time influence can be robustly eliminated, and an anomaly judgment confidence value p (x) is made. In this embodiment, the anomaly monitoring network DetectNet performs time-series decoding by using an auto-encoder, and simultaneously inputs the original time-series data into a decoder by using a Skip Connect technique, and outputs a confidence p (x).

Further, the confidence P (X) and the time sequence data label Y_TPerforming precision statistics through a Loss function, performing error gradient calculation, and performing reverse gradient propagation and model parameter updating on the semantic analysis network MarkNet and the anomaly monitoring network DetectNet; judging whether the current precision meets a preset precision threshold value or not according to the statistical precision of the Loss function, and finishing training of a semantic analysis network MarkNet if the current precision meets the preset precision threshold value; if not, the step S2 is executed.

The steps of carrying out reverse gradient propagation and model parameter updating on the semantic analysis network MarkNet and the anomaly monitoring network DetectNet comprise:

(1) parameter theta of fixed anomaly monitoring network DetectNet_DCalculating the detection deviation of the semantic analysis network MarkNet, performing back propagation according to the gradient, updating the parameters of the semantic analysis network MarkNet, and strengthening the semantic understanding capability of the semantic analysis network MarkNet;

(2) parameter theta of fixed semantic analysis network MarkNet_MAnd calculating the detection deviation of the anomaly monitoring network DetectNet, performing back propagation according to the gradient, updating the parameters of the anomaly monitoring network DetectNet, and strengthening the anomaly detection capability of the anomaly monitoring network DetectNet.

The steps can be analogized to generate a training process of the countermeasure network, whether the semantic analysis network MarkNet needs to be trained is judged according to the error gradient obtained by calculation, if yes, the step (1) is executed, otherwise, the step (2) is executed, then reverse gradient propagation and model parameter updating are carried out, whether the current precision meets a preset precision threshold is further judged according to the precision of the Loss function statistics, and if yes, the training of the semantic analysis network MarkNet is finished; if not, the step S2 is executed.

In one embodiment, the Pytorch deep learning framework, Python programming language, or other similar language is used for encoding and implementation. In the implementation process, after time series data of several data center clusters are preprocessed, training data sets are generated by random sampling, and the method disclosed by the invention is applied to the data to carry out anomaly detection. Compared with a pure network model, such as an LSTM model and a VAE detection model, the double-layer model + MeM state fusion module can effectively detect the mode after deformation.

Example verification results prove that the method provided by the invention can ensure the robustness of the detection method under the condition of time distortion.

Fig. 4 is a schematic diagram of the time series data to be detected and the mark sequence obtained by inputting the time series data into MarkNet. In the decomposition point at the time when t is 200, the frequency of the time sequence changes, that is, the time sequence data to be detected is composed of two time sequence data with different frequencies, as shown in fig. 5 and 6, the time sequence data with two different frequencies and the mark sequence diagram obtained by inputting the time sequence data into MarkNet are respectively shown, wherein the mark sequences made for the time sequence data1 with the first frequency and the time sequence data2 with the second frequency are shown as mark1 and mark2 curves. Through the state fusion window of the MeM block with w set to 200, the tag sequences of the time series data of both frequencies become the same shape, as shown in fig. 7 and 8. By setting the appropriate state fusion window, the sequences MeM1, MeM2 through the state fusion process both have the same shape representation. Finally, the marker sequences of the time sequences with two different frequencies are changed into the same representation form, and the marker sequences are used as a characteristic sequence of the time sequences to carry out subsequent abnormity detection and judgment so as to obtain a judgment result with higher accuracy.

In this embodiment, on the one hand, a method for labeling timing data is provided, which is used for labeling a semantic meaning for an unknown timing data and is used in a subsequent timing processing process. On the other hand, an improved time sequence state fusion analysis mechanism is provided, potential semantics of a time sequence are analyzed, and a time sequence mode deformed due to time warping transformation can be better identified and used for judging the state of the computer cluster. The latent semantics of the time sequence can be analyzed aiming at the common time sequence, and the latent operation rule of the time sequence can be analyzed under the condition that the task running behind the time sequence is unknown. After the semantic mark is given, the method can still robustly identify the time sequence when the time sequence has distortion and the potential time sequence mode is scaled. The model is applied to cluster machines for task detection, so that abnormal mode identification under the conditions of variable data center tasks and frequent resource switching can be better responded, operation managers are helped to effectively identify the health state of a computing unit, the running state of a detection task and the use state of resources, the utilization efficiency of the resources is improved, and more benefits are created. In addition, in the model training process, the embodiment generates the confrontation network by analogy, and the accuracy of time sequence analysis can be better improved by training the two modules mutually.

It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims

1. A time sequence abnormity detection method based on state fusion is characterized by comprising the following steps:

2. The method for detecting time sequence abnormity based on state fusion of claim 1, wherein the step of preprocessing the time sequence data to be detected comprises denoising and normalization processing.

3. The method for detecting the time sequence abnormality based on the state fusion is characterized in that the semantic analysis network MarkNet comprises an LSTM neural network module, a full connection layer structure and a softmax function layer which are connected in sequence.

4. The method according to claim 3, wherein in the step S2, a marker sequence S is generated_TThe method comprises the following specific steps:

the preprocessed time sequence data X_TInputting the LSTM neural network module of the semantic analysis network MarkNet, and performing data processing on input time sequence data X by the LSTM neural network module to obtain a hidden state h_tAnd an internal memory state c_t(ii) a Wherein the preprocessed time series data X_TThe expression formula of (a) is:

X_T＝{x₁,x₂,...,x_T},x_t∈R^m,t＝1,2,...,T

Y_T＝{y₁,y₂,...,y_T},y_t∈{0,1}

in the formula, x_tTime series data X representing input_TThe data at the middle T moment, wherein T is the length of time sequence data; y is_TAs time series data X_TLabel of (a), y_tRepresenting data x_tA corresponding label; calculation formula of LSTM neural network moduleThe following were used:

(h_t,c_t)＝f_lstm(x_t-1,h_t-1,c_t-1)

s_t＝f_dense(h_t)

S_T＝{s₁,s₂,...,s_T},s∈L＝{l₁,l₂,...,l_n,φ}

5. The method according to claim 4, wherein the MeM module uses a link-sense classification method based on window and fusion length restriction on the input tag sequence S_TAnd performing state fusion analysis.

6. The method for detecting time series abnormality based on state fusion according to claim 5, wherein in the step of S3, a marker sequence S is generated_UComprises the following steps: the marker sequence S_TInputting the mark sequence S into a MeM module for state fusion analysis, wherein the MeM module deduplicates adjacent continuous same mark element segments with the length not exceeding w, combines the mark element segments into a new element segment, and directly removes a blank mark phi to obtain a mark sequence S containing refining information within the time T_U(ii) a The expression formula is as follows:

S_U＝MeM(S_T,w)

7. The method for detecting the time sequence abnormality based on the state fusion of claim 6, wherein the confidence P (X) output by the abnormality monitoring network DetectNet is calculated according to the following formula:

P(X_T)＝D(S_U,X_T,ε)

in the formula, D (-) represents a DetectNet neural network model.

8. The method according to claim 7, wherein in the step S4, the anomaly monitoring network detectetnet performs time-series decoding by using an auto-encoder, and inputs the original time-series data into a decoder by using a SkipConnect technique, and outputs a confidence p (x).

9. The method for detecting the time sequence abnormity based on the state fusion according to any one of claims 1 to 8, characterized by further comprising the following steps: labeling the confidence level P (X) and the time series data label Y_TPerforming precision statistics through a Loss function, performing error gradient calculation, and performing reverse gradient propagation and model parameter updating on the semantic analysis network MarkNet and the anomaly monitoring network DetectNet; judging whether the current precision meets a preset precision threshold value or not according to the statistical precision of the Loss function, and finishing training of a semantic analysis network MarkNet if the current precision meets the preset precision threshold value; if not, the step S2 is executed.

10. The method for detecting the time sequence abnormality based on the state fusion of claim 9, wherein the steps of performing inverse gradient propagation and model parameter updating on the semantic analysis network MarkNet and the abnormality monitoring network detectetnet include: judging whether the semantic analysis network MarkNet needs training according to the error gradient obtained by calculation, if so, fixing the parameter theta of the anomaly monitoring network DetectNet_DCalculating the detection deviation of the semantic analysis network MarkNet, performing back propagation according to the gradient, updating the parameters of the semantic analysis network MarkNet, and strengthening the semantic understanding capability of the semantic analysis network MarkNet; if not, fixing the parameter theta of the MarkNet of the semantic analysis network_MAnd calculating the detection of the abnormal monitoring network DetectNetAnd measuring deviation, performing back propagation according to the gradient, updating parameters of the anomaly monitoring network DetectNet, and strengthening the anomaly detection capability of the anomaly monitoring network DetectNet.