CN114841250A

CN114841250A - Industrial system production abnormity detection and diagnosis method based on multi-dimensional sensing data

Info

Publication number: CN114841250A
Application number: CN202210375778.5A
Authority: CN
Inventors: 吕明琪; 周丹; 朱添田; 陈铁明
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2022-04-11
Filing date: 2022-04-11
Publication date: 2022-08-02
Also published as: WO2023197617A1

Abstract

The invention discloses a method for detecting and diagnosing production abnormity of an industrial system based on multidimensional sensing data, which comprises the following steps: preprocessing a multi-dimensional sensing data sample, and dividing the preprocessed multi-dimensional sensing data sample into a plurality of sub-samples by adopting a sliding window; an automatic coding machine is adopted, and an abnormal detection model is obtained based on normal sub-sample training in an unsupervised training mode; training a classification model according to the anomaly detection model; and carrying out real-time detection and diagnosis on the production abnormity of the industrial system based on the abnormity detection model and the classification model. The invention solves the problem that the abnormality diagnosis is difficult to carry out under the condition of the abnormality detection of the multi-dimensional sensing data by using a black box model at present.

Description

Industrial system production abnormity detection and diagnosis method based on multi-dimensional sensing data

Technical Field

The invention belongs to the technical field of data mining, and particularly relates to a method for detecting and diagnosing production abnormity of an industrial system based on multi-dimensional sensing data.

Background

The industrial internet aims to achieve automation control and resource allocation of more acute and efficient industrial manufacturing systems while improving the production efficiency of intelligent factories. However, as the industrial internet breaks the boundaries of the network world and the physical world, industrial manufacturing systems are more vulnerable to external malicious activities. In addition, there are inevitable production problems in industrial manufacturing systems, such as equipment failure, performance degradation, quality defects, and the like. If the abnormal conditions such as intrusion, failure and the like in the industrial production cannot be detected in time, the serious loss can be brought to the whole manufacturing system. Therefore, the anomaly detection and diagnosis are basic requirements of the industrial Internet, and have very important significance for intelligent manufacturing enterprises.

With the rapid development of the industrial internet, the modern industrial manufacturing system realizes the sensing and recording of the production running state and process through a sensor, a large amount of industrial production data is accumulated, and a data driving method becomes a mainstream means for anomaly detection. In recent years, deep learning has become a mainstream technique of a data-driven method. However, since the deep learning model is too complex, contains a large number of non-linear transformations, and is a black box as a whole, the prediction result is not interpretable. In the industrial system abnormality detection, the interpretation of the detection result is very important, and is a basis for realizing the diagnosis of the abnormality detection result. For example, the anomaly detection result diagnosis can help locate which device and which time period an anomaly occurred.

The existing methods for explaining the deep learning model all focus on supervised learning models, such as SHAP, LIME and other deep learning interpretable frameworks. However, the industrial production data is very complex, and the cost of manual labeling is too high, so that the obtained industrial production data is basically label-free, and therefore the anomaly detection model needs to be trained in an unsupervised mode. In particular, the existing deep learning interpretable framework can hardly learn the association between the abnormal sample and the semantic feature, so that the deep unsupervised learning model can not be interpreted.

Disclosure of Invention

The invention aims to provide a method for detecting and diagnosing production abnormity of an industrial system based on multi-dimensional sensing data, and improve the accuracy of abnormity diagnosis.

In order to achieve the purpose, the technical scheme adopted by the invention is as follows:

the industrial system production abnormity detection and diagnosis method based on the multi-dimensional sensing data comprises the following steps:

s1, preprocessing the multi-dimensional sensing data sample, and dividing the preprocessed multi-dimensional sensing data sample into a plurality of sub-samples by adopting a sliding window, wherein the sub-samples comprise normal sub-samples and abnormal sub-samples; given multi-dimensional sensing data sample s ∈ R ^N×T S is a two-dimensional matrix, wherein N is the characteristic dimension of s, namely the number of devices contained in the industrial system, and T is the data duration of s, namely the number of sampling points of the sensor;

s2, obtaining an abnormal detection model based on normal sub-sample training in an unsupervised training mode by adopting an automatic coding machine;

s3, training a classification model according to the abnormality detection model, including:

step 31, detecting the subsamples containing the normal subsamples and the abnormal subsamples by using an abnormal detection model, and adding marks to the subsamples according to a detection result to obtain a marked subsample set;

Step 32, assuming that F is a set of N features, and obtaining 2 by taking N features in the set F each time according to the combination of the features ^N -1 feature subset S, N-1, 2, …, N, generating a training subset from the set of labeled sub-samples according to each feature subset, the training subset comprising only features of the feature subset, training a classification model on each training subset in a supervised manner using XGBoost classifier, and obtaining 2 in total ^N -1 classification model;

s4, real-time detection and diagnosis are carried out on the production abnormity of the industrial system based on the abnormity detection model and the classification model, and the method comprises the following steps:

acquiring a real-time subsample to be detected, and finishing if the detection result of the abnormal detection model on the real-time subsample is a normal subsample; otherwise, the classification model is used for sequentially calculating the feature confidence coefficient corresponding to each feature dimension in the N feature dimensions according to the real-time subsamples, and abnormal features are diagnosed according to the feature confidence coefficients, namely, abnormal equipment in the industrial system is positioned.

Several alternatives are provided below, but not as an additional limitation to the above general solution, but merely as a further addition or preference, each alternative being combinable individually for the above general solution or among several alternatives without technical or logical contradictions.

Preferably, the preprocessing the multi-dimensional sensing data sample includes:

filling missing values in a multi-dimensional sensing data sample s by adopting an average value of front and back data;

the multidimensional sensing data sample s is normalized to have data in the range of [0,1 ].

Preferably, the dividing the preprocessed multidimensional sensing data sample into a plurality of sub-samples by using a sliding window includes:

dividing the multi-dimensional sensing data sample s by using a sliding window with the window size W to obtain continuous M sub-samples sS E for R ^N×W 。

Preferably, the network structure of the automatic coding machine comprises an input layer, a coding layer, a semantic layer, a decoding layer and an output layer, wherein:

the input layer: the input is subsamples Ss ∈ R ^N×W ；

The coding layer: two layers of LSTM are used as an encoder, and N-dimensional feature vectors x of W time instants in a subsample ss ₁ ,x ₂ ,…,x _W Sequentially inputting each unit of the first layer of LSTM, sequentially inputting the obtained W hidden vectors into each unit of the second layer of LSTM, and obtaining W hidden vectors h ₁ ,h ₂ ,…,h _W ；

The semantic layer: taking a hidden vector h _W As a low-dimensional semantic vector after encoding;

the decoding layer: two layers of LSTM are adopted as a decoder, and an implicit vector h is generated _W Repeating W times and inputting each unit of the first layer of LSTM in sequence to obtain W hidden vectorsInputting each unit of the second layer LSTM in sequence to obtain W hidden vectors g ₁ ,g ₂ ,…,g _w ；

The output layer: adopting a full connection layer to hide W vectors g ₁ ,g ₂ ,…,g _W Conversion into a vector y conforming to the dimension ss of the subsamples ₁ ,y ₂ ,…,y _W Vector y ₁ ,y ₂ ,…,y _W As output data rss.

Preferably, in the training of the anomaly detection model, the mean square error of the output data rss and the sub-sample ss is used as a loss function, and optimization iteration is performed in a gradient descent mode.

Preferably, the sequentially calculating the feature confidence corresponding to each feature dimension of the N feature dimensions according to the real-time subsample includes:

the feature confidence is computed for feature k as follows:

in the formula, phi _k For feature confidence of feature k, k is 1,2, …, N, CM _S (x _S ) Classification model CM trained for training subsets S corresponding to feature subsets that do not contain feature k _S At subsample x _S The output result is 0 or 1, the subsample x _S For sample data extracted from the real-time subsamples that are identical to the features contained in the feature subset S, CM _S∪{k} (x _S∪{k} ) Classification model CM trained using training subsets corresponding to feature subsets S { k } containing feature k _S∪{k} At subsample x _S∪{k} The output result is 0 or 1, the subsample x _S∪{k} The sample data extracted from the real-time subsamples and identical to the features contained in the feature subset S { k },

representing a feature subset S that does not contain feature k.

Preferably, the diagnosing the abnormal feature according to the feature confidence includes:

firstly, normalizing the confidence degrees of all the features by adopting a Sigmoid function to obtain a weight score, wherein the absolute value of the weight score indicates the influence value of the features on the final detection result, and the detection result is explained by utilizing a SHAP explanation model based on the influence value.

The industrial system production abnormity detection and diagnosis method based on the multidimensional sensing data provided by the invention adopts an automatic coding machine to train an abnormity detection model in an unsupervised mode without providing an abnormity marking sample; the output of the anomaly detection model is used for training the supervised classification model, so that the interpretation and diagnosis of the anomaly detection result are realized on the basis, and the problem that the anomaly diagnosis is difficult to perform under the condition that the multi-dimensional sensing data anomaly detection is performed by using the black box model at present is solved.

Drawings

FIG. 1 is a flow chart of a multi-dimensional sensory data based industrial system production anomaly detection and diagnosis method of the present invention;

FIG. 2 is a diagram of a trellis diagram of the automatic coding machine of the present invention;

FIG. 3 is a schematic diagram of the setup of parameters of each layer of the automatic coding machine according to the present invention;

FIG. 4 is an explanatory diagram of anomaly detection for an anomalous subsample output in accordance with the present invention;

FIG. 5 is an explanatory diagram of the anomaly detection of the normal sub-sample output according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention.

In order to solve the problem that abnormality diagnosis is difficult to perform under the condition of multi-dimensional sensing data abnormality detection by using a black box model in the prior art, the embodiment provides an industrial system production abnormality detection and diagnosis method based on multi-dimensional sensing data.

As shown in fig. 1, the embodiment provides a method for detecting and diagnosing production abnormality of an industrial system based on multidimensional sensing data, which includes the following steps:

s1, preprocessing the multi-dimensional sensing data sample, and dividing the preprocessed multi-dimensional sensing data sample into a plurality of sub-samples by adopting a sliding window, wherein the sub-samples comprise normal sub-samples and abnormal sub-samples.

In the embodiment, a given multidimensional sensing data sample s E R ^N×T S is a two-dimensional matrix, where N is the characteristic dimension of s, i.e., the number of devices included in the industrial system, and T is the data duration of s, i.e., the number of sampling points of the sensor. Therefore, the detailed operation of the present embodiment to preprocess the sample data is as follows:

1) data cleaning: and filling missing values in the multi-dimensional sensing data sample s by adopting the average value of the previous data and the next data.

2) Data normalization: the multidimensional sensing data sample s is normalized to have data in the range of [0,1 ].

In the embodiment, when data is divided, a sliding window with the window size W is used for dividing a multi-dimensional sensing data sample s to obtain continuous M sub-samples ss ∈ R ^N×W 。

It should be noted that, in the embodiment, since the normal subsample is required to be used for training the anomaly detection model, the subsamples in the embodiment include the normal subsample and the anomaly subsample, but the subsamples obtained by dividing in step 1 are not labeled, and the normal and anomaly differences correspond to the normal and anomaly of the original data.

And S2, obtaining an abnormal detection model based on normal sub-sample training in an unsupervised training mode by adopting an automatic coding machine.

In the embodiment, an automatic coding machine is adopted to train an anomaly detection model AM, the input of the automatic coding machine is an original sub-sample ss, the original sub-sample is converted into a low-dimensional feature space through an encoder, the low-dimensional feature is output as a reconstructed sub-sample rss through a decoder, and the training aim is to enable ss and rss to be as close as possible. Referring to fig. 2, the network structure of the employed automatic coding machine is as follows:

an input layer: the input is subsample ss ∈ R ^N×W 。

And (3) coding layer: using two layers of LSTM as the encoder, the N-dimensional feature vector x for each time instant (W time instants in total) in the subsample ss ₁ ,x ₂ ,…,x _W (for a total of W times, i.e. x ₁ For an N-dimensional feature vector, other same understanding) are sequentially inputted into each unit of the first layer of LSTM, and the obtained W hidden vectors are sequentially inputted into each unit of the second layer of LSTM, so as to obtain W hidden vectors h ₁ ,h ₂ ,…,h _W 。

And (3) semantic layer: taking a hidden vector h _W As encoded low-dimensional semantic vectors.

A decoding layer: two layers of LSTM are adopted as a decoder, and an implicit vector h is generated _W Repeating W times and inputting each unit of the first layer of LSTM in sequence to obtain W hidden vectors, and inputting each unit of the second layer of LSTM in sequence to obtain W hidden vectors g ₁ ,g ₂ ,…,g _w 。

An output layer: adopting a full connection layer to hide W vectors g ₁ ,g ₂ ,…,g _w Conversion into a vector y conforming to the dimension ss of the subsamples ₁ ,y ₂ ,…,y _W Vector y ₁ ,y ₂ ,…,y _W As output data rss.

In the training process of the anomaly detection model AM, on one hand, in order to minimize the difference between ss and rss, the mean square error of output data rss and a subsample ss is used as a loss function, and on the basis, the model is optimally trained in a gradient descent mode; on the other hand, in order to make the model learn the pattern of the normal subsample, the normal subsample is used for training. The parameter settings of each layer of the automatic coding machine adopted in the present embodiment are shown in fig. 3.

After the training is finished, a real-time sub-sample ss is given and input into a trained automatic coding machine (namely an anomaly detection model AM), and an output reconstructed sub-sample rss is obtained. And calculating the mean square error of ss and rss, if the mean square error is larger than a predefined threshold value, judging the subsample to be abnormal, otherwise, judging the subsample to be normal.

step 31, constructing a labeled sub-sample set: and detecting the subsamples containing the normal subsamples and the abnormal subsamples by using an abnormal detection model, adding marks to the subsamples according to a detection result, marking the detected abnormal subsamples as 1, marking the normal subsamples as 0, and obtaining a marked subsample set LSS.

Step 32, constructing a classification model: assuming that F is a set of N features, taking N features in the set F each time according to the combination of the features to obtain 2 ^N -1 feature subset S, N-1, 2, …, N, generating a training subset comprising only features in the feature subset from the labeled subsample set LSS according to each feature subset, training a classification model on each training subset in a supervised manner using an XGBoost classifier, and obtaining 2 in total ^N -1 classification model.

The number of the subsamples in each training subset is the same as that in the LSS labeled subsample set, and the labels of the subsamples are unchanged.

s41, acquiring a real-time subsample to be detected, and ending if the detection result of the abnormal detection model on the real-time subsample is a normal subsample; otherwise, executing the next step;

and S42, sequentially calculating the feature confidence corresponding to each feature dimension in the N feature dimensions according to the real-time subsamples by using the classification model.

The embodiment calculates the feature confidence corresponding to each feature when the real-time sub-sample is abnormal so as to calculate the feature of the feature kTo illustrate the confidence level, given a feature k, (a feature is one of N features) the confidence level φ for a feature k is evaluated by computing the difference between a classification model using the feature k and a classification model not using the feature k _k Degree of confidence phi _k The larger the feature k is, the higher the importance degree is.

The feature confidence is computed for feature k as follows:

representing a feature subset S that does not contain feature k.

And S43, diagnosing abnormal features according to the feature confidence, namely positioning abnormal equipment in the industrial system.

The abnormity judgment is established on the SHAP interpretation model, and the SHAP interpretation model is realized based on the Shapley value, so the calculation of the feature confidence coefficient in the embodiment is equivalent to the calculation of the Shapley value. And (4) obtaining abnormal characteristics (for example, the influence value is higher than a set threshold) based on the finally obtained influence value, and directly locating the equipment which can cause the industrial system to generate the abnormity because the characteristics correspond to the equipment.

For convenience of observation, the diagnosis result is further visualized in the present embodiment, and referring to fig. 4 and 5, in the figure, f (x) represents the probability that the output of the classification model is an abnormal result, the left side of f (x) represents positive correlation to the abnormal detection result, the right side represents negative correlation to the abnormal result, and the larger the feature region width is, the higher the weight score of the feature is, thereby diagnosing the cause of the abnormality. For example, in fig. 4, the influence values of the features (labels of corresponding devices) such as f4, f6, f1 are high, which indicates that the most probable cause of the abnormality is generated by the devices corresponding to the features.

The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above examples are merely illustrative of several embodiments of the present invention, and the description thereof is more specific and detailed, but not to be construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present invention should be subject to the appended claims.

Claims

1. The industrial system production abnormity detection and diagnosis method based on the multidimensional sensing data is characterized by comprising the following steps:

s1, preprocessing the multi-dimensional sensing data sample, and dividing the preprocessed multi-dimensional sensing data sample into a plurality of sub-samples by adopting a sliding window, wherein the sub-samples comprise normal sub-samples and abnormal sub-samplesThen, the process is carried out; given multi-dimensional sensing data sample s ∈ R ^N×T S is a two-dimensional matrix, wherein N is the characteristic dimension of s, namely the number of devices contained in the industrial system, and T is the data duration of s, namely the number of sampling points of the sensor;

2. The method for detecting and diagnosing production anomalies in an industrial system based on multidimensional sensed data as recited in claim 1, wherein the preprocessing the multidimensional sensed data samples comprises:

3. The method for detecting and diagnosing production abnormality of industrial system based on multi-dimensional sensing data as claimed in claim 1, wherein the dividing the preprocessed multi-dimensional sensing data sample into a plurality of sub-samples by using a sliding window comprises:

dividing the multi-dimensional sensing data sample s by using a sliding window with the window size W to obtain continuous M sub-samples ss ∈ R ^N×W 。

4. The method of claim 3, wherein the network structure of the automatic coding machine comprises an input layer, a coding layer, a semantic layer, a decoding layer and an output layer, wherein:

the input layer: the input is subsample ss ∈ R ^N×W ；

the decoding layer: two layers of LSTM are adopted as a decoder, and an implicit vector h is generated _W Repeating W times and inputting each unit of the first layer of LSTM in sequence to obtain W hidden vectors, and inputting each unit of the second layer of LSTM in sequence to obtain W hidden vectors g ₁ ,g ₂ ,…,g _W ；

5. The method for detecting and diagnosing production abnormalities in an industrial system based on multidimensional sensed data as set forth in claim 4, wherein the mean square error of the output data rss and the subsample ss is used as a loss function in the training of the abnormality detection model, and the optimization iteration is performed in a gradient descent manner.

6. The method for detecting and diagnosing production anomalies of an industrial system based on multidimensional sensed data as recited in claim 1, wherein the sequentially calculating feature confidence corresponding to each of the N feature dimensions from real-time subsamples comprises:

Feature confidence is computed for feature k as follows:

representing a feature subset S that does not contain feature k.

7. The method of claim 1, wherein the diagnosing abnormal features based on feature confidence comprises: