WO2023197617A1 - Method for detecting and diagnosing production abnormality of industrial system on basis of multi-dimensional sensing data - Google Patents

Method for detecting and diagnosing production abnormality of industrial system on basis of multi-dimensional sensing data Download PDF

Info

Publication number
WO2023197617A1
WO2023197617A1 PCT/CN2022/135714 CN2022135714W WO2023197617A1 WO 2023197617 A1 WO2023197617 A1 WO 2023197617A1 CN 2022135714 W CN2022135714 W CN 2022135714W WO 2023197617 A1 WO2023197617 A1 WO 2023197617A1
Authority
WO
WIPO (PCT)
Prior art keywords
feature
sensing data
subsample
anomaly detection
layer
Prior art date
Application number
PCT/CN2022/135714
Other languages
French (fr)
Chinese (zh)
Inventor
吕明琪
陈铁明
朱添田
Original Assignee
浙江工业大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 浙江工业大学 filed Critical 浙江工业大学
Publication of WO2023197617A1 publication Critical patent/WO2023197617A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Definitions

  • the invention belongs to the technical field of data mining, and specifically relates to an industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data.
  • the Industrial Internet aims to achieve smarter and more efficient automated control and resource allocation of industrial manufacturing systems, while improving the production efficiency of smart factories.
  • the Industrial Internet breaks the boundaries between the online world and the physical world, industrial manufacturing systems are more vulnerable to external malicious behaviors.
  • production problems such as equipment failure, performance degradation, and quality defects are inevitable in industrial manufacturing systems. If abnormal situations such as intrusions and failures in industrial production cannot be detected in time, it may cause serious losses to the entire manufacturing system. Therefore, anomaly detection and diagnosis are basic requirements of the industrial Internet and are of great significance to intelligent manufacturing companies.
  • the purpose of the present invention is to provide an industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data to improve the accuracy of abnormal diagnosis.
  • An industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data includes:
  • Preprocess the multidimensional sensing data samples and use a sliding window to divide the preprocessed multidimensional sensing data samples into several subsamples, where the subsamples include normal subsamples and abnormal subsamples; given multidimensional sensing Data sample s ⁇ R N ⁇ T , s is a two-dimensional matrix, where N is the characteristic dimension of s, that is, the number of devices included in the industrial system, and T is the data duration of s, that is, the number of sampling points of the sensor;
  • Step 31 Use the anomaly detection model to detect subsamples containing normal subsamples and abnormal subsamples, and add labels to the subsamples based on the detection results to obtain a labeled subsample set;
  • the classification model is used to calculate the feature confidence corresponding to each of the N feature dimensions based on the real-time subsample. , and diagnose abnormal features based on feature confidence, that is, locate abnormal equipment in the industrial system.
  • each optional method can be independently implemented for the above-mentioned overall plan.
  • Combination can also be a combination between multiple optional methods.
  • the preprocessing of multi-dimensional sensing data samples includes:
  • the average value of the before and after data is used to fill
  • Multidimensional sensing data samples s are normalized so that the data is in the range of [0,1].
  • the sliding window is used to divide the preprocessed multi-dimensional sensing data samples into several sub-samples, including:
  • the multi-dimensional sensing data sample s is divided using a sliding window with window size W to obtain continuous M sub-samples ss ⁇ R N ⁇ W .
  • the network structure of the automatic encoding machine includes an input layer, a coding layer, a semantic layer, a decoding layer and an output layer, where:
  • the input layer the input is subsample ss ⁇ R N ⁇ W ;
  • the coding layer uses two layers of LSTM as the encoder.
  • the N-dimensional feature vectors x 1 , x 2 ,..., x W at W moments in the subsample ss are input into each unit of the first layer LSTM in sequence, and the obtained W
  • the hidden vectors are then input into each unit of the second layer LSTM in sequence, and W hidden vectors h 1 , h 2 ,..., h W are obtained;
  • the semantic layer takes the latent vector h W as the encoded low-dimensional semantic vector
  • the decoding layer uses two layers of LSTM as the decoder, repeats the hidden vector h W times W and inputs it into each unit of the first layer LSTM in sequence, and the obtained W hidden vectors are then input into each unit of the second layer LSTM in sequence. units, get W hidden vectors g 1 , g 2 ,...,g W ;
  • the output layer uses a fully connected layer to convert W hidden vectors g 1 , g 2 ,..., g W into vectors y 1 , y 2 ,..., y W , vectors y 1 , y consistent with the dimension of the subsample ss. 2 ,...,y W is used as the output data rss.
  • the mean square error of the output data rss and the subsample ss is used as the loss function, and gradient descent is used for optimization iteration.
  • the feature confidence corresponding to each of the N feature dimensions is calculated sequentially based on real-time subsamples, including:
  • the feature confidence is calculated as follows:
  • ⁇ k is the feature confidence of feature k
  • k 1, 2,...,N
  • CM S (x S ) is the classification model CM trained using the training subset corresponding to the feature subset S that does not contain feature k.
  • the output result of S on the subsample x S , the output result is 0 or 1.
  • the subsample x S is the sample data extracted from the real-time subsample with the same features as the feature subset S
  • CM S ⁇ k ⁇ (x S ⁇ k ⁇ ) is the output result of the classification model CM S ⁇ k ⁇ trained on the subsample x S ⁇ ⁇ k ⁇ using the training subset corresponding to the feature subset S ⁇ k ⁇ containing feature k
  • the output result is 0 or 1
  • the subsample x S ⁇ k ⁇ is the sample data extracted from the real-time subsample with the same features contained in the feature subset S ⁇ k ⁇ , Represents the feature subset S that does not contain feature k.
  • diagnosing abnormal features based on feature confidence includes:
  • the Sigmoid function is used to normalize the confidence of all features to obtain the weight score.
  • the absolute value of the weight score indicates the impact of the feature on the final detection result.
  • the SHAP explanation model is used to explain the detection results.
  • the industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data uses an automatic coding machine to train an anomaly detection model in an unsupervised manner without providing anomaly labeled samples; the output of the anomaly detection model is used to train supervised
  • the classification model realizes the interpretation and diagnosis of anomaly detection results on this basis, solving the problem of difficulty in anomaly diagnosis in the current black box model for anomaly detection of multi-dimensional sensing data.
  • Figure 1 is a flow chart of the industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to the present invention
  • Figure 2 is a grid structure diagram of the automatic encoding machine of the present invention.
  • Figure 3 is a schematic diagram of parameter settings for each layer of the automatic encoding machine of the present invention.
  • Figure 4 is an explanation diagram of abnormality detection for abnormal subsample output according to the present invention.
  • Figure 5 is an explanation diagram of abnormality detection for normal subsample output according to the present invention.
  • this embodiment provides a method for industrial system production anomaly detection and diagnosis based on multi-dimensional sensing data.
  • this embodiment proposes an industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data, which includes the following steps:
  • Preprocess the multi-dimensional sensing data samples and use a sliding window to divide the pre-processed multi-dimensional sensing data samples into several sub-samples, where the sub-samples include normal sub-samples and abnormal sub-samples.
  • a multi-dimensional sensing data sample s ⁇ R N ⁇ T is given, s is a two-dimensional matrix, where N is the characteristic dimension of s, that is, the number of devices included in the industrial system, and T is the data duration of s , that is, the number of sampling points of the sensor. Therefore, the detailed operations of preprocessing sample data in this embodiment are as follows:
  • a sliding window with a window size of W is used to divide the multi-dimensional sensing data sample s, and M continuous sub-samples ss ⁇ R N ⁇ W are obtained.
  • the subsamples in this embodiment include normal subsamples and abnormal subsamples, but the subsamples divided in step 1 are not marked. , the distinction between normality and abnormality corresponds to the normality and abnormality of the original data taken.
  • an automatic encoding machine is used to train the anomaly detection model AM.
  • the input of the automatic encoding machine is the original sub-sample ss.
  • the original sub-sample is first converted into a low-dimensional feature space through the encoder, and then the low-dimensional features are output into a heavy feature space through the decoder. Construct a subsample rss, and the training goal is to make ss and rss as close as possible.
  • the network structure of the automatic encoding machine used is as follows:
  • Input layer The input is subsample ss ⁇ R N ⁇ W .
  • the N-dimensional feature vectors x 1 , x 2 ,..., x W (a total of W moments) in the subsample ss at each moment (W moments in total), that is, x 1 is one N-dimensional feature vectors (others are understood in the same way) are input into each unit of the first layer LSTM in sequence, and the W hidden vectors obtained are input into each unit of the second layer LSTM in order, and W hidden vectors h 1 , h are obtained. 2 ,...,h W .
  • Semantic layer Take the latent vector h W as the encoded low-dimensional semantic vector.
  • Decoding layer Use two layers of LSTM as the decoder, repeat the hidden vector h W times W and input it into each unit of the first layer LSTM in sequence, and the obtained W hidden vectors are then input into each unit of the second layer LSTM in sequence. , get W hidden vectors g 1 , g 2 ,...,g W .
  • Output layer Use a fully connected layer to convert W hidden vectors g 1 , g 2 ,..., g W into vectors y 1 , y 2 ,..., y W , vectors y 1 , y 2 , consistent with the dimension of the subsample ss. ...,y W as the output data rss.
  • the mean square error of the output data rss and the sub-sample ss is used as the loss function, and on this basis, the gradient descent method is used to perform the model Optimize training; on the other hand, in order for the model to learn the pattern of normal subsamples, all normal subsamples are used for training.
  • the parameter settings of each layer of the automatic encoding machine used in this embodiment are shown in Figure 3.
  • the training After the training is completed, given a real-time subsample ss, input it into the trained automatic encoding machine (i.e., anomaly detection model AM) to obtain the output reconstructed subsample rss. Calculate the mean square error between ss and rss. If the mean square error is greater than the predefined threshold, the subsample is determined to be abnormal, otherwise it is determined to be normal.
  • the trained automatic encoding machine i.e., anomaly detection model AM
  • Step 31 Construction of labeled subsample set: Use the anomaly detection model to detect the subsamples containing normal subsamples and abnormal subsamples, and add labels to the subsamples based on the detection results. Mark the subsamples with detected abnormalities as 1. Normal subsamples are labeled as 0, and the labeled subsample set LSS is obtained.
  • the number of subsamples in each training subset is the same as the number of subsamples in the labeled subsample set LSS, and the labeling of each subsample remains unchanged.
  • This embodiment calculates the feature confidence corresponding to each feature when the real-time subsample is abnormal.
  • the calculation of the feature confidence of feature k is used as an example. Given feature k, (the feature is one of N features), by calculating The difference between the classification model using feature k and the classification model not using feature k is used to evaluate the confidence ⁇ k of feature k. The greater the confidence ⁇ k , the higher the importance of feature k.
  • the feature confidence is calculated as follows:
  • ⁇ k is the feature confidence of feature k
  • k 1, 2,...,N
  • CM S (x S ) is the classification model CM trained using the training subset corresponding to the feature subset S that does not contain feature k.
  • the output result of S on the subsample x S , the output result is 0 or 1.
  • the subsample x S is the sample data extracted from the real-time subsample with the same features as the feature subset S
  • CM S ⁇ k ⁇ (x S ⁇ k ⁇ ) is the output result of the classification model CM S ⁇ k ⁇ trained on the subsample x S ⁇ ⁇ k ⁇ using the training subset corresponding to the feature subset S ⁇ k ⁇ containing feature k
  • the output result is 0 or 1
  • the subsample x S ⁇ k ⁇ is the sample data extracted from the real-time subsample with the same features as the feature subset S ⁇ k ⁇ , Represents the feature subset S that does not contain feature k.
  • the Sigmoid function is used to normalize the confidence of all features to obtain the weight score.
  • the absolute value of the weight score indicates the impact of the feature on the final detection result.
  • the SHAP explanation model is used to explain the detection results.
  • the abnormality judgment in this application is based on the SHAP interpretation model, which is implemented based on the Shapley value. Therefore, the calculation of the feature confidence in this embodiment is equivalent to the calculation of the Shapley value.
  • Abnormal features are obtained based on the final impact value (for example, the impact value is higher than the set threshold). Since the features correspond to the equipment, equipment that may cause anomalies in the industrial system can also be directly located.
  • this embodiment further visualizes the diagnosis results.
  • f(x) represents the probability that the output of the classification model is an abnormal result.
  • the left side of f(x) represents a positive correlation with the abnormal detection results.
  • the right side indicates a negative correlation with the abnormal results, and the larger the width of the feature area, the higher the weight score of the feature, thereby diagnosing the cause of the abnormality.
  • features such as f4, f6, and f1 (corresponding to device numbers) have relatively high impact values, indicating that the cause of the anomaly is most likely caused by the devices corresponding to these features.

Abstract

A method for detecting and diagnosing a production abnormality of an industrial system on the basis of multi-dimensional sensing data. The method comprises: pre-processing a multi-dimensional sensing data sample, and dividing the pre-processed multi-dimensional sensing data sample into several sub-samples by using a sliding window; by using an automatic encoder, obtaining an abnormality detection model by means of unsupervised training and on the basis of training normal sub-samples; training a classification model according to the abnormality detection model; and performing real-time detection and diagnosis on a production abnormality of an industrial system on the basis of the abnormality detection model and the classification model. By using the method, the problem of it being difficult to perform abnormality diagnosis for abnormality detection of multi-dimensional sensing data by using a black-box model at present is solved.

Description

基于多维传感数据的工业系统生产异常检测与诊断方法Industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data 技术领域Technical field
本发明属于数据挖掘技术领域,具体涉及一种基于多维传感数据的工业系统生产异常检测与诊断方法。The invention belongs to the technical field of data mining, and specifically relates to an industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data.
背景技术Background technique
工业互联网旨在实现更敏锐、更高效的工业制造系统的自动化控制和资源分配,同时提高智能工厂的生产效率。然而,由于工业互联网打破了网络世界和物理世界的边界,使得工业制造系统更容易受到外部恶意行为的侵袭。此外,工业制造系统中不可避免的存在设备故障、性能下降、质量缺陷等生产问题。如果工业生产中的入侵、故障等异常情况不能被及时检测出来,将可能给整个制造体系带来严重的损失。因此,异常检测与诊断是工业互联网的基本要求,对智能制造企业具有十分重要的意义。The Industrial Internet aims to achieve smarter and more efficient automated control and resource allocation of industrial manufacturing systems, while improving the production efficiency of smart factories. However, because the Industrial Internet breaks the boundaries between the online world and the physical world, industrial manufacturing systems are more vulnerable to external malicious behaviors. In addition, production problems such as equipment failure, performance degradation, and quality defects are inevitable in industrial manufacturing systems. If abnormal situations such as intrusions and failures in industrial production cannot be detected in time, it may cause serious losses to the entire manufacturing system. Therefore, anomaly detection and diagnosis are basic requirements of the industrial Internet and are of great significance to intelligent manufacturing companies.
随着工业互联网的快速发展,现代化工业制造系统通过传感器,实现了对生产运行状态和过程的感知和记录,积累了大量的工业生产数据,数据驱动方法成为异常检测的主流手段。近年来,深度学习逐渐成为数据驱动方法的主流技术。然而,由于深度学习模型过于复杂、包含了大量的非线性变换,总体上是一个黑盒,其预测结果是不可解释的。在工业系统异常检测中,对检测结果的解释非常重要,是实现对异常检测结果诊断的基础。例如,异常检测结果诊断可以帮助定位哪个设备、哪个时间段发生了异常。With the rapid development of the industrial Internet, modern industrial manufacturing systems have realized the perception and recording of production operating status and processes through sensors, and accumulated a large amount of industrial production data. Data-driven methods have become the mainstream means of anomaly detection. In recent years, deep learning has gradually become a mainstream technology for data-driven methods. However, because the deep learning model is too complex and contains a large number of nonlinear transformations, it is generally a black box, and its prediction results are uninterpretable. In abnormal detection of industrial systems, the interpretation of detection results is very important and is the basis for diagnosis of abnormal detection results. For example, anomaly detection result diagnosis can help locate which device and which time period an anomaly occurred.
现有对深度学习模型进行解释的方法都专注于有监督学习模型,如SHAP、LIME等深度学习可解释框架。但由于工业生产数据十分复杂,人工标注的代价过大,导致获取到的工业生产数据基本都是无标注的,因此异常检测模型需要以无监督的方式训练。特别是自动编码机等新型的深度无监督学习模型,几乎无法让现有的深度学习可解释框架学习到异常样本与语义特征之间的关联,导致无法对深度无监督学习模型进行解释。Existing methods for interpreting deep learning models focus on supervised learning models, such as deep learning interpretable frameworks such as SHAP and LIME. However, due to the complexity of industrial production data and the high cost of manual annotation, the obtained industrial production data are basically unlabeled. Therefore, the anomaly detection model needs to be trained in an unsupervised manner. Especially for new deep unsupervised learning models such as automatic encoding machines, it is almost impossible for existing deep learning explainable frameworks to learn the association between abnormal samples and semantic features, making it impossible to explain deep unsupervised learning models.
发明内容Contents of the invention
本发明的目的在于提供一种基于多维传感数据的工业系统生产异常检测与 诊断方法,提高异常诊断准确性。The purpose of the present invention is to provide an industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data to improve the accuracy of abnormal diagnosis.
为实现上述目的,本发明所采取的技术方案为:In order to achieve the above objects, the technical solutions adopted by the present invention are:
一种基于多维传感数据的工业系统生产异常检测与诊断方法,所述基于多维传感数据的工业系统生产异常检测与诊断方法,包括:An industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data. The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data includes:
S1、对多维传感数据样本进行预处理,并采用滑动窗口将预处理后的多维传感数据样本划分为若干个子样本,所述子样本包含正常子样本和异常子样本;给定多维传感数据样本s∈R N×T,s为一个二维矩阵,其中N为s的特征维度,即工业系统中所包含的设备个数,T为s的数据时长,即传感器的采样点个数; S1. Preprocess the multidimensional sensing data samples, and use a sliding window to divide the preprocessed multidimensional sensing data samples into several subsamples, where the subsamples include normal subsamples and abnormal subsamples; given multidimensional sensing Data sample s∈R N×T , s is a two-dimensional matrix, where N is the characteristic dimension of s, that is, the number of devices included in the industrial system, and T is the data duration of s, that is, the number of sampling points of the sensor;
S2、采用自动编码机,以无监督训练方式基于正常子样本训练得到异常检测模型;S2. Use an automatic encoding machine to train an anomaly detection model based on normal subsamples in an unsupervised training method;
S3、根据异常检测模型训练分类模型,包括:S3. Train the classification model based on the anomaly detection model, including:
步骤31、利用异常检测模型对含有正常子样本和异常子样本的子样本进行检测,并根据检测结果对子样本添加标记,得到有标注子样本集;Step 31: Use the anomaly detection model to detect subsamples containing normal subsamples and abnormal subsamples, and add labels to the subsamples based on the detection results to obtain a labeled subsample set;
步骤32、假定F为N个特征的集合,根据特征的组合每次取集合F中的n个特征得到2 N-1个特征子集S,n=1,2,…,N,根据每个特征子集从有标注子样本集中生成一个仅包含特征子集中的特征的训练子集,在每个训练子集上采用XGBoost分类器以有监督的方式训练一个分类模型,共得到2 N-1个分类模型; Step 32. Assume that F is a set of N features. According to the combination of features, take n features in the set F each time to obtain 2 N -1 feature subsets S, n = 1, 2,..., N. According to each The feature subset generates a training subset containing only the features in the feature subset from the labeled subsample set, and uses the XGBoost classifier to train a classification model in a supervised manner on each training subset, resulting in a total of 2 N -1 a classification model;
S4、基于异常检测模型和分类模型对工业系统生产异常进行实时检测与诊断,包括:S4. Real-time detection and diagnosis of industrial system production anomalies based on anomaly detection models and classification models, including:
获取待检测的实时子样本,若异常检测模型对实时子样本的检测结果为正常子样本则结束;否则利用分类模型根据实时子样本依次计算N个特征维度中每个特征维度对应的特征置信度,并根据特征置信度诊断异常特征,即定位工业系统中的异常设备。Obtain the real-time subsample to be detected. If the detection result of the real-time subsample by the anomaly detection model is a normal subsample, it ends; otherwise, the classification model is used to calculate the feature confidence corresponding to each of the N feature dimensions based on the real-time subsample. , and diagnose abnormal features based on feature confidence, that is, locate abnormal equipment in the industrial system.
以下还提供了若干可选方式,但并不作为对上述总体方案的额外限定,仅仅是进一步的增补或优选,在没有技术或逻辑矛盾的前提下,各可选方式可单独针对上述总体方案进行组合,还可以是多个可选方式之间进行组合。Several optional methods are also provided below, but they are not used as additional limitations on the above-mentioned overall plan. They are only further additions or preferences. On the premise that there are no technical or logical contradictions, each optional method can be independently implemented for the above-mentioned overall plan. Combination can also be a combination between multiple optional methods.
作为优选,所述对多维传感数据样本进行预处理,包括:Preferably, the preprocessing of multi-dimensional sensing data samples includes:
对多维传感数据样本s中的缺失值,采用前后数据的平均值进行填充;For the missing values in the multi-dimensional sensing data sample s, the average value of the before and after data is used to fill;
对多维传感数据样本s进行标准化处理使数据在[0,1]的范围内。Multidimensional sensing data samples s are normalized so that the data is in the range of [0,1].
作为优选,所述采用滑动窗口将预处理后的多维传感数据样本划分为若干 个子样本,包括:Preferably, the sliding window is used to divide the preprocessed multi-dimensional sensing data samples into several sub-samples, including:
使用窗口大小为W的滑动窗口对多维传感数据样本s进行划分,得到连续的M个子样本ss∈R N×WThe multi-dimensional sensing data sample s is divided using a sliding window with window size W to obtain continuous M sub-samples ss∈R N×W .
作为优选,所述自动编码机的网络结构包括输入层、编码层、语义层、解码层和输出层,其中:Preferably, the network structure of the automatic encoding machine includes an input layer, a coding layer, a semantic layer, a decoding layer and an output layer, where:
所述输入层:输入为子样本ss∈R N×WThe input layer: the input is subsample ss∈R N×W ;
所述编码层:采用两层LSTM作为编码器,子样本ss中W个时刻的N维特征向量x 1,x 2,…,x W按顺序输入第一层LSTM的每个单元,得到的W个隐向量再按顺序输入第二层LSTM的每个单元,得到W个隐向量h 1,h 2,…,h WThe coding layer: uses two layers of LSTM as the encoder. The N-dimensional feature vectors x 1 , x 2 ,..., x W at W moments in the subsample ss are input into each unit of the first layer LSTM in sequence, and the obtained W The hidden vectors are then input into each unit of the second layer LSTM in sequence, and W hidden vectors h 1 , h 2 ,..., h W are obtained;
所述语义层:取隐向量h W作为编码后的低维语义向量; The semantic layer: takes the latent vector h W as the encoded low-dimensional semantic vector;
所述解码层:采用两层LSTM作为解码器,将隐向量h W重复W次并按顺序输入第一层LSTM的每个单元,得到的W个隐向量再按顺序输入第二层LSTM的每个单元,得到W个隐向量g 1,g 2,…,g WThe decoding layer: uses two layers of LSTM as the decoder, repeats the hidden vector h W times W and inputs it into each unit of the first layer LSTM in sequence, and the obtained W hidden vectors are then input into each unit of the second layer LSTM in sequence. units, get W hidden vectors g 1 , g 2 ,...,g W ;
所述输出层:采用全连接层将W个隐向量g 1,g 2,…,g W转换为与子样本ss维度一致的向量y 1,y 2,…,y W,向量y 1,y 2,…,y W作为输出数据rss。 The output layer: uses a fully connected layer to convert W hidden vectors g 1 , g 2 ,..., g W into vectors y 1 , y 2 ,..., y W , vectors y 1 , y consistent with the dimension of the subsample ss. 2 ,…,y W is used as the output data rss.
作为优选,在异常检测模型的训练中采用输出数据rss与子样本ss的均方误差作为损失函数,并采用梯度下降的方式进行优化迭代。As a preferred option, in the training of the anomaly detection model, the mean square error of the output data rss and the subsample ss is used as the loss function, and gradient descent is used for optimization iteration.
作为优选,所述根据实时子样本依次计算N个特征维度中每个特征维度对应的特征置信度,包括:Preferably, the feature confidence corresponding to each of the N feature dimensions is calculated sequentially based on real-time subsamples, including:
对于特征k计算特征置信度如下:For feature k, the feature confidence is calculated as follows:
Figure PCTCN2022135714-appb-000001
Figure PCTCN2022135714-appb-000001
式中,φ k为特征k的特征置信度,k=1,2,…,N,CM S(x S)为使用不含有特征k的特征子集S对应的训练子集训练的分类模型CM S在子样本x S上的输出结果,输出结果为0或1,子样本x S为从实时子样本中提取的与特征子集S所包含的特征相同的样本数据,CM S∪{k}(x S∪{k})为使用含有特征k的特征子集S∪{k}对应的训练子集训练的分类模型CM S∪{k}在子样本x S∪{k}上的输出结果,输出结果为0或1,子样本x S∪{k}为从实时子样本中提取的与特征子集S∪{k}所包含的特征相同的样本数据,
Figure PCTCN2022135714-appb-000002
表示不含有特征k的特征子集S。
In the formula, φ k is the feature confidence of feature k, k = 1, 2,...,N, CM S (x S ) is the classification model CM trained using the training subset corresponding to the feature subset S that does not contain feature k. The output result of S on the subsample x S , the output result is 0 or 1. The subsample x S is the sample data extracted from the real-time subsample with the same features as the feature subset S, CM S∪{k} (x S∪{k} ) is the output result of the classification model CM S∪{k} trained on the subsample x S∪ {k} using the training subset corresponding to the feature subset S∪{k} containing feature k , the output result is 0 or 1, and the subsample x S∪{k} is the sample data extracted from the real-time subsample with the same features contained in the feature subset S∪{k},
Figure PCTCN2022135714-appb-000002
Represents the feature subset S that does not contain feature k.
作为优选,所述根据特征置信度诊断异常特征,包括:Preferably, diagnosing abnormal features based on feature confidence includes:
先采用Sigmoid函数对所有特征置信度进行归一化得到权重分数,权重分数的绝对值表明了该特征对最终检测结果的影响值,基于影响值利用SHAP解释模型对检测结果进行解释。First, the Sigmoid function is used to normalize the confidence of all features to obtain the weight score. The absolute value of the weight score indicates the impact of the feature on the final detection result. Based on the impact value, the SHAP explanation model is used to explain the detection results.
本发明提供的基于多维传感数据的工业系统生产异常检测与诊断方法,采用自动编码机、以无监督的方式训练异常检测模型,无需提供异常标注样本;利用异常检测模型的输出训练有监督的分类模型,在此基础上实现对异常检测结果的解释和诊断,解决了当前以黑盒模型针对多维传感数据异常检测的情况下难以进行异常诊断的问题。The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data provided by the present invention uses an automatic coding machine to train an anomaly detection model in an unsupervised manner without providing anomaly labeled samples; the output of the anomaly detection model is used to train supervised The classification model realizes the interpretation and diagnosis of anomaly detection results on this basis, solving the problem of difficulty in anomaly diagnosis in the current black box model for anomaly detection of multi-dimensional sensing data.
附图说明Description of the drawings
图1为本发明的基于多维传感数据的工业系统生产异常检测与诊断方法的流程图;Figure 1 is a flow chart of the industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to the present invention;
图2为本发明自动编码机的网格结构图;Figure 2 is a grid structure diagram of the automatic encoding machine of the present invention;
图3为本发明自动编码机各层参数设置示意图;Figure 3 is a schematic diagram of parameter settings for each layer of the automatic encoding machine of the present invention;
图4为本发明针对异常子样本输出的异常检测解释图;Figure 4 is an explanation diagram of abnormality detection for abnormal subsample output according to the present invention;
图5为本发明针对正常子样本输出的异常检测解释图。Figure 5 is an explanation diagram of abnormality detection for normal subsample output according to the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, of the embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of the present invention.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是在于限制本发明。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which the invention belongs. The terminology used herein in the description of the present invention is for the purpose of describing specific embodiments only and is not intended to limit the present invention.
为了解决现有技术中以黑盒模型针对多维传感数据异常检测的情况下难以进行异常诊断的问题,本实施例提供一种基于多维传感数据的工业系统生产异常检测与诊断方法。In order to solve the problem in the prior art that it is difficult to perform abnormal diagnosis when detecting multi-dimensional sensing data anomalies using a black box model, this embodiment provides a method for industrial system production anomaly detection and diagnosis based on multi-dimensional sensing data.
如图1所示,本实施例提出基于多维传感数据的工业系统生产异常检测与诊断方法,包括以下步骤:As shown in Figure 1, this embodiment proposes an industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data, which includes the following steps:
S1、对多维传感数据样本进行预处理,并采用滑动窗口将预处理后的多维 传感数据样本划分为若干个子样本,所述子样本包含正常子样本和异常子样本。S1. Preprocess the multi-dimensional sensing data samples, and use a sliding window to divide the pre-processed multi-dimensional sensing data samples into several sub-samples, where the sub-samples include normal sub-samples and abnormal sub-samples.
本实施例中给定多维传感数据样本s∈R N×T,s为一个二维矩阵,其中N为s的特征维度,即工业系统中所包含的设备个数,T为s的数据时长,即传感器的采样点个数。因此,本实施例对样本数据进行预处理的详细操作如下: In this embodiment, a multi-dimensional sensing data sample s∈R N×T is given, s is a two-dimensional matrix, where N is the characteristic dimension of s, that is, the number of devices included in the industrial system, and T is the data duration of s , that is, the number of sampling points of the sensor. Therefore, the detailed operations of preprocessing sample data in this embodiment are as follows:
1)数据清洗:对多维传感数据样本s中的缺失值,采用前后数据的平均值进行填充。1) Data cleaning: The missing values in the multi-dimensional sensing data samples s are filled with the average value of the before and after data.
2)数据标准化:对多维传感数据样本s进行标准化处理使数据在[0,1]的范围内。2) Data standardization: Standardize the multi-dimensional sensing data samples s so that the data is within the range of [0,1].
本实施例对数据进行划分时,使用窗口大小为W的滑动窗口对多维传感数据样本s进行划分,得到连续的M个子样本ss∈R N×WWhen dividing the data in this embodiment, a sliding window with a window size of W is used to divide the multi-dimensional sensing data sample s, and M continuous sub-samples ss∈R N×W are obtained.
需要说明的是,由于本实施例异常检测模型的训练需要使用正常子样本,因此本实施例中的子样本包含正常子样本和异常子样本,但步骤1中划分得到的子样本并未被标记,其正常和异常的区分对应于所取的原始数据的正常和异常。It should be noted that since the training of the anomaly detection model in this embodiment requires the use of normal subsamples, the subsamples in this embodiment include normal subsamples and abnormal subsamples, but the subsamples divided in step 1 are not marked. , the distinction between normality and abnormality corresponds to the normality and abnormality of the original data taken.
S2、采用自动编码机,以无监督训练方式基于正常子样本训练得到异常检测模型。S2. Use an automatic encoding machine to train an anomaly detection model based on normal subsamples in an unsupervised training method.
本实施例采用自动编码机训练异常检测模型AM,自动编码机的输入为原始子样本ss,先通过编码器将原始子样本转换到低维特征空间,再通过解码器将低维特征输出为重构子样本rss,训练目标是使ss和rss尽可能接近。参照图2所示,采用的自动编码机的网络结构如下:In this embodiment, an automatic encoding machine is used to train the anomaly detection model AM. The input of the automatic encoding machine is the original sub-sample ss. The original sub-sample is first converted into a low-dimensional feature space through the encoder, and then the low-dimensional features are output into a heavy feature space through the decoder. Construct a subsample rss, and the training goal is to make ss and rss as close as possible. Referring to Figure 2, the network structure of the automatic encoding machine used is as follows:
输入层:输入为子样本ss∈R N×WInput layer: The input is subsample ss∈R N×W .
编码层:采用两层LSTM作为编码器,子样本ss中每个时刻(共W个时刻)的N维特征向量x 1,x 2,…,x W(共W个时刻,即x 1为一个N维特征向量,其他同理理解)按顺序输入第一层LSTM的每个单元,得到的W个隐向量再按顺序输入第二层LSTM的每个单元,得到W个隐向量h 1,h 2,…,h WCoding layer: Two layers of LSTM are used as the encoder. The N-dimensional feature vectors x 1 , x 2 ,..., x W (a total of W moments) in the subsample ss at each moment (W moments in total), that is, x 1 is one N-dimensional feature vectors (others are understood in the same way) are input into each unit of the first layer LSTM in sequence, and the W hidden vectors obtained are input into each unit of the second layer LSTM in order, and W hidden vectors h 1 , h are obtained. 2 ,…,h W .
语义层:取隐向量h W作为编码后的低维语义向量。 Semantic layer: Take the latent vector h W as the encoded low-dimensional semantic vector.
解码层:采用两层LSTM作为解码器,将隐向量h W重复W次并按顺序输入第一层LSTM的每个单元,得到的W个隐向量再按顺序输入第二层LSTM的每个单元,得到W个隐向量g 1,g 2,…,g WDecoding layer: Use two layers of LSTM as the decoder, repeat the hidden vector h W times W and input it into each unit of the first layer LSTM in sequence, and the obtained W hidden vectors are then input into each unit of the second layer LSTM in sequence. , get W hidden vectors g 1 , g 2 ,...,g W .
输出层:采用全连接层将W个隐向量g 1,g 2,…,g W转换为与子样本ss维度一致的向量y 1,y 2,…,y W,向量y 1,y 2,…,y W作为输出数据rss。 Output layer: Use a fully connected layer to convert W hidden vectors g 1 , g 2 ,..., g W into vectors y 1 , y 2 ,..., y W , vectors y 1 , y 2 , consistent with the dimension of the subsample ss. …,y W as the output data rss.
在异常检测模型AM的训练过程中,一方面为了最小化ss和rss的差异,使用输出数据rss与子样本ss的均方误差作为损失函数,并在此基础上采用梯度下降的方式对模型进行优化训练;另一方面为了让模型学习到正常子样本的模式,用于训练的均为正常子样本。本实施例采用的自动编码机的各层参数设置如图3所示。During the training process of the anomaly detection model AM, on the one hand, in order to minimize the difference between ss and rss, the mean square error of the output data rss and the sub-sample ss is used as the loss function, and on this basis, the gradient descent method is used to perform the model Optimize training; on the other hand, in order for the model to learn the pattern of normal subsamples, all normal subsamples are used for training. The parameter settings of each layer of the automatic encoding machine used in this embodiment are shown in Figure 3.
训练完成后,给定一个实时子样本ss,将其输入训练好的自动编码机(即异常检测模型AM),得到输出的重构子样本rss。计算ss与rss的均方误差,若均方误差大于预定义的阈值,则判定该子样本为异常,反之则判定为正常。After the training is completed, given a real-time subsample ss, input it into the trained automatic encoding machine (i.e., anomaly detection model AM) to obtain the output reconstructed subsample rss. Calculate the mean square error between ss and rss. If the mean square error is greater than the predefined threshold, the subsample is determined to be abnormal, otherwise it is determined to be normal.
S3、根据异常检测模型训练分类模型,包括:S3. Train the classification model based on the anomaly detection model, including:
步骤31、有标注子样本集构建:利用异常检测模型对含有正常子样本和异常子样本的子样本进行检测,并根据检测结果对子样本添加标记,将检测出异常的子样本标注为1,正常的子样本标注为0,得到有标注子样本集LSS。Step 31. Construction of labeled subsample set: Use the anomaly detection model to detect the subsamples containing normal subsamples and abnormal subsamples, and add labels to the subsamples based on the detection results. Mark the subsamples with detected abnormalities as 1. Normal subsamples are labeled as 0, and the labeled subsample set LSS is obtained.
步骤32、分类模型构建:假定F为N个特征的集合,根据特征的组合每次取集合F中的n个特征得到2 N-1个特征子集S,n=1,2,…,N,根据每个特征子集从有标注子样本集LSS中生成一个仅包含特征子集中的特征的训练子集,在每个训练子集上采用XGBoost分类器以有监督的方式训练一个分类模型,共得到2 N-1个分类模型。 Step 32. Classification model construction: Assume that F is a set of N features. According to the combination of features, take n features from the set F each time to obtain 2 N -1 feature subsets S, n=1, 2,...,N , according to each feature subset, a training subset containing only features in the feature subset is generated from the labeled subsample set LSS, and an XGBoost classifier is used on each training subset to train a classification model in a supervised manner. A total of 2 N -1 classification models were obtained.
每个训练子集中的子样本数与有标注子样本集LSS中的子样本数相同,且各子样本的标注不变。The number of subsamples in each training subset is the same as the number of subsamples in the labeled subsample set LSS, and the labeling of each subsample remains unchanged.
S4、基于异常检测模型和分类模型对工业系统生产异常进行实时检测与诊断,包括:S4. Real-time detection and diagnosis of industrial system production anomalies based on anomaly detection models and classification models, including:
S41、获取待检测的实时子样本,若异常检测模型对实时子样本的检测结果为正常子样本则结束;否则执行下一步;S41. Obtain the real-time subsample to be detected. If the detection result of the real-time subsample by the anomaly detection model is a normal subsample, the end is completed; otherwise, proceed to the next step;
S42、利用分类模型根据实时子样本依次计算N个特征维度中每个特征维度对应的特征置信度。S42. Use the classification model to sequentially calculate the feature confidence corresponding to each of the N feature dimensions based on the real-time subsamples.
本实施例在实时子样本异常时计算每个特征对应的特征置信度,以计算特征k的特征置信度为例进行说明,给定特征k,(特征为N个特征中的一个特征)通过计算使用特征k的分类模型和不使用特征k的分类模型之间的差异,来评估特征k的置信度φ k,置信度φ k越大,特征k的重要程度越高。 This embodiment calculates the feature confidence corresponding to each feature when the real-time subsample is abnormal. The calculation of the feature confidence of feature k is used as an example. Given feature k, (the feature is one of N features), by calculating The difference between the classification model using feature k and the classification model not using feature k is used to evaluate the confidence φ k of feature k. The greater the confidence φ k , the higher the importance of feature k.
对于特征k计算特征置信度如下:For feature k, the feature confidence is calculated as follows:
Figure PCTCN2022135714-appb-000003
Figure PCTCN2022135714-appb-000003
式中,φ k为特征k的特征置信度,k=1,2,…,N,CM S(x S)为使用不含有特征k的特征子集S对应的训练子集训练的分类模型CM S在子样本x S上的输出结果,输出结果为0或1,子样本x S为从实时子样本中提取的与特征子集S所包含的特征相同的样本数据,CM S∪{k}(x S∪{k})为使用含有特征k的特征子集S∪{k}对应的训练子集训练的分类模型CM S∪{k}在子样本x S∪{k}上的输出结果,输出结果为0或1,子样本x S∪{k}为从实时子样本中提取的与特征子集S∪{k}所包含的特征相同的样本数据,
Figure PCTCN2022135714-appb-000004
表示不含有特征k的特征子集S。
In the formula, φ k is the feature confidence of feature k, k = 1, 2,...,N, CM S (x S ) is the classification model CM trained using the training subset corresponding to the feature subset S that does not contain feature k. The output result of S on the subsample x S , the output result is 0 or 1. The subsample x S is the sample data extracted from the real-time subsample with the same features as the feature subset S, CM S∪{k} (x S∪{k} ) is the output result of the classification model CM S∪{k} trained on the subsample x S∪ {k} using the training subset corresponding to the feature subset S∪{k} containing feature k , the output result is 0 or 1, and the subsample x S∪{k} is the sample data extracted from the real-time subsample with the same features as the feature subset S∪{k},
Figure PCTCN2022135714-appb-000004
Represents the feature subset S that does not contain feature k.
S43、根据特征置信度诊断异常特征,即定位工业系统中的异常设备。S43. Diagnose abnormal features based on feature confidence, that is, locate abnormal equipment in the industrial system.
先采用Sigmoid函数对所有特征置信度进行归一化得到权重分数,权重分数的绝对值表明了该特征对最终检测结果的影响值,基于影响值利用SHAP解释模型对检测结果进行解释。First, the Sigmoid function is used to normalize the confidence of all features to obtain the weight score. The absolute value of the weight score indicates the impact of the feature on the final detection result. Based on the impact value, the SHAP explanation model is used to explain the detection results.
本申请异常判断建立在SHAP解释模型之上,SHAP解释模型基于Shapley值实现,因此本实施例中特征置信度的计算相当于Shapley值的计算。基于最终得到的影响值得出异常特征(例如影响值高于设定阈值),由于特征对应设备,因此也直接定位出可能造成工业系统生成异常的设备。The abnormality judgment in this application is based on the SHAP interpretation model, which is implemented based on the Shapley value. Therefore, the calculation of the feature confidence in this embodiment is equivalent to the calculation of the Shapley value. Abnormal features are obtained based on the final impact value (for example, the impact value is higher than the set threshold). Since the features correspond to the equipment, equipment that may cause anomalies in the industrial system can also be directly located.
为了便于观察,本实施例进一步可视化诊断结果,参照图4和图5,图中f(x)表示分类模型的输出为异常结果的概率,f(x)的左侧表示对异常检测结果呈正相关,右侧表示对异常结果呈负相关,而特征区域宽度越大,表示该特征的权重分数越高,由此诊断异常产生的原因。例如,图4中,f4、f6、f1等特征(对应设备的标号)影响值较高,说明造成异常的原因最有可能是由这些特征对应的设备产生的。In order to facilitate observation, this embodiment further visualizes the diagnosis results. Refer to Figures 4 and 5. In the figure, f(x) represents the probability that the output of the classification model is an abnormal result. The left side of f(x) represents a positive correlation with the abnormal detection results. , the right side indicates a negative correlation with the abnormal results, and the larger the width of the feature area, the higher the weight score of the feature, thereby diagnosing the cause of the abnormality. For example, in Figure 4, features such as f4, f6, and f1 (corresponding to device numbers) have relatively high impact values, indicating that the cause of the anomaly is most likely caused by the devices corresponding to these features.
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-described embodiments can be combined in any way. To simplify the description, not all possible combinations of the technical features in the above-described embodiments are described. However, as long as there is no contradiction in the combination of these technical features, All should be considered to be within the scope of this manual.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明的保护范围应以所附权利要求为 准。The above-described embodiments only express several implementation modes of the present invention. The descriptions are relatively specific and detailed, but should not be construed as limiting the scope of the invention. It should be noted that, for those of ordinary skill in the art, several modifications and improvements can be made without departing from the concept of the present invention, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the present invention should be determined by the appended claims.

Claims (7)

  1. 一种基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,所述基于多维传感数据的工业系统生产异常检测与诊断方法,包括:An industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data, characterized in that the industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data includes:
    S1、对多维传感数据样本进行预处理,并采用滑动窗口将预处理后的多维传感数据样本划分为若干个子样本,所述子样本包含正常子样本和异常子样本;给定多维传感数据样本s∈R N×T,s为一个二维矩阵,其中N为s的特征维度,即工业系统中所包含的设备个数,T为s的数据时长,即传感器的采样点个数; S1. Preprocess the multidimensional sensing data samples, and use a sliding window to divide the preprocessed multidimensional sensing data samples into several subsamples, where the subsamples include normal subsamples and abnormal subsamples; given multidimensional sensing Data sample s∈R N×T , s is a two-dimensional matrix, where N is the characteristic dimension of s, that is, the number of devices included in the industrial system, and T is the data duration of s, that is, the number of sampling points of the sensor;
    S2、采用自动编码机,以无监督训练方式基于正常子样本训练得到异常检测模型;S2. Use an automatic encoding machine to train an anomaly detection model based on normal subsamples in an unsupervised training method;
    S3、根据异常检测模型训练分类模型,包括:S3. Train the classification model based on the anomaly detection model, including:
    步骤31、利用异常检测模型对含有正常子样本和异常子样本的子样本进行检测,并根据检测结果对子样本添加标记,得到有标注子样本集;Step 31: Use the anomaly detection model to detect subsamples containing normal subsamples and abnormal subsamples, and add labels to the subsamples based on the detection results to obtain a labeled subsample set;
    步骤32、假定F为N个特征的集合,根据特征的组合每次取集合F中的n个特征得到2 N-1个特征子集S,n=1,2,…,N,根据每个特征子集从有标注子样本集中生成一个仅包含特征子集中的特征的训练子集,在每个训练子集上采用XGBoost分类器以有监督的方式训练一个分类模型,共得到2 N-1个分类模型; Step 32. Assume that F is a set of N features. According to the combination of features, take n features in the set F each time to obtain 2 N -1 feature subsets S, n = 1, 2,..., N. According to each The feature subset generates a training subset containing only the features in the feature subset from the labeled subsample set, and uses the XGBoost classifier to train a classification model in a supervised manner on each training subset, resulting in a total of 2 N -1 a classification model;
    S4、基于异常检测模型和分类模型对工业系统生产异常进行实时检测与诊断,包括:S4. Real-time detection and diagnosis of industrial system production anomalies based on anomaly detection models and classification models, including:
    获取待检测的实时子样本,若异常检测模型对实时子样本的检测结果为正常子样本则结束;否则利用分类模型根据实时子样本依次计算N个特征维度中每个特征维度对应的特征置信度,并根据特征置信度诊断异常特征,即定位工业系统中的异常设备。Obtain the real-time subsample to be detected. If the detection result of the real-time subsample by the anomaly detection model is a normal subsample, it ends; otherwise, the classification model is used to calculate the feature confidence corresponding to each of the N feature dimensions based on the real-time subsample. , and diagnose abnormal features based on feature confidence, that is, locate abnormal equipment in the industrial system.
  2. 如权利要求1所述的基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,所述对多维传感数据样本进行预处理,包括:The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to claim 1, characterized in that the preprocessing of multi-dimensional sensing data samples includes:
    对多维传感数据样本s中的缺失值,采用前后数据的平均值进行填充;For the missing values in the multi-dimensional sensing data sample s, the average value of the before and after data is used to fill;
    对多维传感数据样本s进行标准化处理使数据在[0,1]的范围内。Multidimensional sensing data samples s are normalized so that the data is in the range of [0,1].
  3. 如权利要求1所述的基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,所述采用滑动窗口将预处理后的多维传感数据样本划分为若干个子样本,包括:The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to claim 1, characterized in that the sliding window is used to divide the pre-processed multi-dimensional sensing data samples into several sub-samples, including:
    使用窗口大小为W的滑动窗口对多维传感数据样本s进行划分,得到连续的 M个子样本ss∈R N×WThe multi-dimensional sensing data sample s is divided using a sliding window with window size W to obtain continuous M sub-samples ss∈R N×W .
  4. 如权利要求3所述的基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,所述自动编码机的网络结构包括输入层、编码层、语义层、解码层和输出层,其中:The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to claim 3, characterized in that the network structure of the automatic encoding machine includes an input layer, a coding layer, a semantic layer, a decoding layer and an output layer, in:
    所述输入层:输入为子样本ss∈R N×WThe input layer: the input is subsample ss∈R N×W ;
    所述编码层:采用两层LSTM作为编码器,子样本ss中W个时刻的N维特征向量x 1,x 2,…,x W按顺序输入第一层LSTM的每个单元,得到的W个隐向量再按顺序输入第二层LSTM的每个单元,得到W个隐向量h 1,h 2,…,h WThe coding layer: uses two layers of LSTM as the encoder. The N-dimensional feature vectors x 1 , x 2 ,..., x W at W moments in the subsample ss are input into each unit of the first layer LSTM in sequence, and the obtained W The hidden vectors are then input into each unit of the second layer LSTM in sequence, and W hidden vectors h 1 , h 2 ,..., h W are obtained;
    所述语义层:取隐向量h W作为编码后的低维语义向量; The semantic layer: takes the latent vector h W as the encoded low-dimensional semantic vector;
    所述解码层:采用两层LSTM作为解码器,将隐向量h W重复W次并按顺序输入第一层LSTM的每个单元,得到的W个隐向量再按顺序输入第二层LSTM的每个单元,得到W个隐向量g 1,g 2,…,g WThe decoding layer: uses two layers of LSTM as the decoder, repeats the hidden vector h W times W and inputs it into each unit of the first layer LSTM in sequence, and the obtained W hidden vectors are then input into each unit of the second layer LSTM in sequence. units, get W hidden vectors g 1 , g 2 ,...,g W ;
    所述输出层:采用全连接层将W个隐向量g 1,g 2,…,g W转换为与子样本ss维度一致的向量y 1,y 2,…,y W,向量y 1,y 2,…,y W作为输出数据rss。 The output layer: uses a fully connected layer to convert W hidden vectors g 1 , g 2 ,..., g W into vectors y 1 , y 2 ,..., y W , vectors y 1 , y consistent with the dimension of the subsample ss. 2 ,…,y W is used as the output data rss.
  5. 如权利要求4所述的基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,在异常检测模型的训练中采用输出数据rss与子样本ss的均方误差作为损失函数,并采用梯度下降的方式进行优化迭代。The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data as claimed in claim 4, characterized in that the mean square error of the output data rss and the sub-sample ss is used as the loss function in the training of the anomaly detection model, and Optimization iteration is performed using gradient descent.
  6. 如权利要求1所述的基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,所述根据实时子样本依次计算N个特征维度中每个特征维度对应的特征置信度,包括:The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to claim 1, characterized in that the feature confidence corresponding to each of the N feature dimensions is calculated sequentially based on real-time sub-samples, including :
    对于特征k计算特征置信度如下:For feature k, the feature confidence is calculated as follows:
    Figure PCTCN2022135714-appb-100001
    Figure PCTCN2022135714-appb-100001
    式中,φ k为特征k的特征置信度,k=1,2,…,N,CM S(x S)为使用不含有特征k的特征子集S对应的训练子集训练的分类模型CM S在子样本x S上的输出结果,输出结果为0或1,子样本x S为从实时子样本中提取的与特征子集S所包含的特征相同的样本数据,CM S∪{k}(x S∪{k})为使用含有特征k的特征子集S∪{k}对应的训练子集训练的分类模型CM S∪{k}在子样本x S∪{k}上的输出结果,输出结果为0或1,子样本x S∪{k}为从实时子样本中提取的与特征子集S∪{k}所包含的特征相同的样本数据,
    Figure PCTCN2022135714-appb-100002
    表示不含有特征k的特征子集S。
    In the formula, φ k is the feature confidence of feature k, k = 1, 2,...,N, CM S (x S ) is the classification model CM trained using the training subset corresponding to the feature subset S that does not contain feature k. The output result of S on the subsample x S , the output result is 0 or 1. The subsample x S is the sample data extracted from the real-time subsample with the same features as the feature subset S, CM S∪{k} (x S∪{k} ) is the output result of the classification model CM S∪{k} trained on the subsample x S∪{k} using the training subset corresponding to the feature subset S∪{k} containing feature k , the output result is 0 or 1, and the subsample x S∪{k} is the sample data extracted from the real-time subsample with the same features as the feature subset S∪{k},
    Figure PCTCN2022135714-appb-100002
    Represents the feature subset S that does not contain feature k.
  7. 如权利要求1所述的基于多维传感数据的工业系统生产异常检测与诊断方法,其特征在于,所述根据特征置信度诊断异常特征,包括:The industrial system production anomaly detection and diagnosis method based on multi-dimensional sensing data according to claim 1, wherein the diagnosis of abnormal features based on feature confidence includes:
    先采用Sigmoid函数对所有特征置信度进行归一化得到权重分数,权重分数的绝对值表明了该特征对最终检测结果的影响值,基于影响值利用SHAP解释模型对检测结果进行解释。First, the Sigmoid function is used to normalize the confidence of all features to obtain the weight score. The absolute value of the weight score indicates the impact of the feature on the final detection result. Based on the impact value, the SHAP explanation model is used to explain the detection results.
PCT/CN2022/135714 2022-04-11 2022-11-30 Method for detecting and diagnosing production abnormality of industrial system on basis of multi-dimensional sensing data WO2023197617A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210375778.5 2022-04-11
CN202210375778.5A CN114841250A (en) 2022-04-11 2022-04-11 Industrial system production abnormity detection and diagnosis method based on multi-dimensional sensing data

Publications (1)

Publication Number Publication Date
WO2023197617A1 true WO2023197617A1 (en) 2023-10-19

Family

ID=82563882

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/135714 WO2023197617A1 (en) 2022-04-11 2022-11-30 Method for detecting and diagnosing production abnormality of industrial system on basis of multi-dimensional sensing data

Country Status (2)

Country Link
CN (1) CN114841250A (en)
WO (1) WO2023197617A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114841250A (en) * 2022-04-11 2022-08-02 浙江工业大学 Industrial system production abnormity detection and diagnosis method based on multi-dimensional sensing data
CN115474108B (en) * 2022-09-16 2023-09-22 南京品淳通信科技有限公司 Event monitoring system and method based on edge calculation
CN115358348B (en) * 2022-10-19 2023-03-24 成都数之联科技股份有限公司 Vehicle straight-through rate influence characteristic determination method, device, equipment and medium
CN116430817B (en) * 2023-04-26 2023-09-29 同心县启胜新能源科技有限公司 Data acquisition processing method and system applied to photovoltaic module production system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130282336A1 (en) * 2010-12-27 2013-10-24 Hitachi, Ltd. Anomaly Sensing and Diagnosis Method, Anomaly Sensing and Diagnosis System, Anomaly Sensing and Diagnosis Program and Enterprise Asset Management and Infrastructure Asset Management System
CN113051822A (en) * 2021-03-25 2021-06-29 浙江工业大学 Industrial system anomaly detection method based on graph attention network and LSTM automatic coding model
CN113255733A (en) * 2021-04-29 2021-08-13 西安交通大学 Unsupervised anomaly detection method under multi-modal data loss
CN114065613A (en) * 2021-10-27 2022-02-18 中国华能集团清洁能源技术研究院有限公司 Multi-working-condition process industrial fault detection and diagnosis method based on deep migration learning
CN114090396A (en) * 2022-01-24 2022-02-25 华南理工大学 Cloud environment multi-index unsupervised anomaly detection and root cause analysis method
CN114841250A (en) * 2022-04-11 2022-08-02 浙江工业大学 Industrial system production abnormity detection and diagnosis method based on multi-dimensional sensing data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130282336A1 (en) * 2010-12-27 2013-10-24 Hitachi, Ltd. Anomaly Sensing and Diagnosis Method, Anomaly Sensing and Diagnosis System, Anomaly Sensing and Diagnosis Program and Enterprise Asset Management and Infrastructure Asset Management System
CN113051822A (en) * 2021-03-25 2021-06-29 浙江工业大学 Industrial system anomaly detection method based on graph attention network and LSTM automatic coding model
CN113255733A (en) * 2021-04-29 2021-08-13 西安交通大学 Unsupervised anomaly detection method under multi-modal data loss
CN114065613A (en) * 2021-10-27 2022-02-18 中国华能集团清洁能源技术研究院有限公司 Multi-working-condition process industrial fault detection and diagnosis method based on deep migration learning
CN114090396A (en) * 2022-01-24 2022-02-25 华南理工大学 Cloud environment multi-index unsupervised anomaly detection and root cause analysis method
CN114841250A (en) * 2022-04-11 2022-08-02 浙江工业大学 Industrial system production abnormity detection and diagnosis method based on multi-dimensional sensing data

Also Published As

Publication number Publication date
CN114841250A (en) 2022-08-02

Similar Documents

Publication Publication Date Title
WO2023197617A1 (en) Method for detecting and diagnosing production abnormality of industrial system on basis of multi-dimensional sensing data
WO2022160902A1 (en) Anomaly detection method for large-scale multivariate time series data in cloud environment
CN111504676B (en) Equipment fault diagnosis method, device and system based on multi-source monitoring data fusion
CN111460728B (en) Method and device for predicting residual life of industrial equipment, storage medium and equipment
CN111914873A (en) Two-stage cloud server unsupervised anomaly prediction method
CN110705812A (en) Industrial fault analysis system based on fuzzy neural network
CN110647911A (en) Bearing fault diagnosis method based on principal component analysis and deep belief network
CN112036435A (en) Brushless direct current motor sensor fault detection method based on convolutional neural network
CN112685476A (en) Periodic multivariate time series anomaly detection method and system
CN116520806A (en) Intelligent fault diagnosis system and method for industrial system
CN115994630A (en) Multi-scale self-attention-based equipment residual service life prediction method and system
CN114519923A (en) Intelligent diagnosis and early warning method and system for power plant
CN116929815A (en) Equipment working state monitoring system and method based on Internet of things
WO2022188425A1 (en) Deep learning fault diagnosis method integrating prior knowledge
CN117290800B (en) Timing sequence anomaly detection method and system based on hypergraph attention network
CN114500004A (en) Anomaly detection method based on conditional diffusion probability generation model
WO2024027487A1 (en) Health degree evaluation method and apparatus based on intelligent operations and maintenance scene
CN115564075B (en) Main and auxiliary integrated fault collaborative diagnosis method and system for urban power grid
Bond et al. A hybrid learning approach to prognostics and health management applied to military ground vehicles using time-series and maintenance event data
Sun et al. Intelligent fault diagnosis scheme for rolling bearing based on domain adaptation in one dimensional feature matching
Zhang et al. A Flexible Monitoring Framework via Dynamic-Multilayer Graph Convolution Network
CN115204272A (en) Industrial system fault diagnosis method and equipment based on multi-sampling rate data
CN115556099B (en) Sustainable learning industrial robot fault diagnosis system and method
CN117493980B (en) Bearing fault diagnosis method integrating feature extraction and sequencing causal discovery
US20230350402A1 (en) Multi-task learning based rul predication method under sensor fault condition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22937249

Country of ref document: EP

Kind code of ref document: A1