US12050459B2 - Anomaly detection method and apparatus for dynamic control system, and computer-readable medium - Google Patents
Anomaly detection method and apparatus for dynamic control system, and computer-readable medium Download PDFInfo
- Publication number
- US12050459B2 US12050459B2 US18/262,630 US202118262630A US12050459B2 US 12050459 B2 US12050459 B2 US 12050459B2 US 202118262630 A US202118262630 A US 202118262630A US 12050459 B2 US12050459 B2 US 12050459B2
- Authority
- US
- United States
- Prior art keywords
- network
- time
- control system
- dynamic control
- point
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0218—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
- G05B23/0221—Preprocessing measurements, e.g. data collection rate adjustment; Standardization of measurements; Time series or signal analysis, e.g. frequency analysis or wavelets; Trustworthiness of measurements; Indexes therefor; Measurements using easily measured parameters to estimate parameters difficult to measure; Virtual sensor creation; De-noising; Sensor fusion; Unconventional preprocessing inherently present in specific fault detection methods like PCA-based methods
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0218—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
- G05B23/0224—Process history based detection method, e.g. whereby history implies the availability of large amounts of data
- G05B23/024—Quantitative history assessment, e.g. mathematical relationships between available data; Functions therefor; Principal component analysis [PCA]; Partial least square [PLS]; Statistical classifiers, e.g. Bayesian networks, linear regression or correlation analysis; Neural networks
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0218—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
- G05B23/0243—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults model based detection method, e.g. first-principles knowledge model
- G05B23/0254—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults model based detection method, e.g. first-principles knowledge model based on a quantitative model, e.g. mathematical relationships between inputs and outputs; functions: observer, Kalman filter, residual calculation, Neural Networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
- G06N3/0442—Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0495—Quantised networks; Sparse networks; Compressed networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/067—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using optical means
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Definitions
- the present disclosure relates to anomaly detections.
- Various embodiments of the teachings herein include anomaly detection methods and/or apparatus for a dynamic control system and a computer-readable medium.
- Active state monitoring of a dynamic control system is vital to the safety and reliability of various industries (for example, discrete manufacturing, power generation, building asset management, and process industries).
- an anomaly detection system is usually deployed to monitor dynamic behaviors of a control system, including dynamic changes of the measurement values of a sensor and the state values of a trigger over time.
- it is still very difficult to build an effective anomaly detection model having a high true positive rate and a low false positive rate for a dynamic control system in practice. The reasons are:
- anomaly detection methods for a dynamic control system include: residual-based anomaly detection methods, density-based anomaly detection methods, one-class classification-based anomaly detection methods and rule-based anomaly detection methods.
- Residual-based anomaly detection methods depend on a neural network-based regressive prediction model (see Long Short - Term Memory published by Hochreiter, Sepp & Jürgen Schmidhuber on page 1735 to page 1780 of the journal of Neural Computation in 1977) or a reconstruction model of an auto-encoder (see Fast Learning Algorithm for Deep Belief Nets published by Hinton, Geoffrey E, Simon Osindero & Yee-Whye Teh on page 1527 to page 1554 of the journal Neural Computation in 2006, and Auto - Encoding Variational Bayes published by Kingma, Diederik P & Max Welling on the preprint platform arXiv in 2013) so that a measurement value of a sensor is compressed to obtain low-dimensionality characteristics and reconstruct a measurement value.
- a neural network-based regressive prediction model see Long Short - Term Memory published by Hochreiter, Sepp & Jürgen Schmidhuber on page 1735 to page 1780 of
- a predicted or reconstructed measurement value is compared with the measurement value obtained from real-time monitoring to generate a residual. If the residual exceeds a preset threshold, it is considered that an anomaly is detected. Since the sensor noise level and the prediction error or reconstruction error at each point of time are unknown in practice, it is very difficult to define a strict threshold between a normal measurement value and an abnormal measurement value of the sensor. The performance of a residual-based anomaly detection method will usually degrade when a measurement value of the sensor is interfered with by a high noise level or the prediction or reconstruction error of the model is unsteady.
- Density-based anomaly detection methods For density-based anomaly detection methods, a model is built for the probability distribution of a measurement value of the sensor at each point of time. When the likelihood value of a measurement value monitored in real time is smaller than the preset threshold, it is considered that an anomaly is detected. Density-based anomaly detection methods include the Kalman filter algorithm (see Kalman Filter published by C. K. Chui & G.
- density-based anomaly detection methods are generally more robust for sensor noise than residual-based anomaly detection methods
- density-based anomaly detection methods still have some limitations which restrict the practical application. For example, it is usually necessary to build a mathematical model of a physical dynamic process through system identification before the Kalman filter algorithm is applied, but system identification is very difficult in practice.
- high prior knowledge is usually required for many density-based anomaly detection methods in a physical dynamic process and/or the modeling of the distribution of measurement values of a sensor. When the physical dynamic process is highly nonlinear, the performance of these methods may degrade.
- One-class classification-based anomaly detection methods for example, the one-class support vector machine (SVM) (see One - Class Support Vector Machine for Document Classification published by Manevitz, Larry M & Malik Yousef on page 139 to page 154 of the journal of Machine Learning in 2001) and the isolation forest (see Isolation Forest published by Liu, Fei Tony, Kai Ming Ting & Zhi-Hua Zhou at the 8th IEEE International Conference on Data Mining in 2008) can be naturally applied to anomaly detections of a dynamic control system and can provide high explainability. Because of the curse of dimensionality and high nonlinearity of dynamic behaviors of the system, these methods are not applicable to current dynamic control systems.
- SVM one-class support vector machine
- Isolation Forest published by Liu, Fei Tony, Kai Ming Ting & Zhi-Hua Zhou at the 8th IEEE International Conference on Data Mining in 2008
- rule-based anomaly detection methods the state conditions, which must be maintained for a system, are obtained from prior knowledge. Any physical process value monitored in real time and destroying rules is classified into an anomaly.
- the rules are defined by field experts at the system design stage, and it is time-consuming and labor-consuming to manually process the rules.
- rule-based anomaly detection methods are usually subject to the fact that it is difficult to find enough rules.
- Various embodiments of teachings of the present disclosure include anomaly detection methods and/or apparatus for a dynamic control system, and a computer-readable medium.
- the structure of a specially designed neural network is used for system identification of a dynamic control system, and the system identification process is automatically completed by training the neural network.
- the generality of system identifications can significantly be improved, highly nonlinear dynamic behaviors of the dynamic control system can be obtained, and the common problem that a model having an average ability of expression may cause dimensionality disasters is overcome.
- the Bayes filter method is used and an anomaly is detected according to the likelihood of a measurement value of the sensor monitored in real time.
- an anomaly detection method for a dynamic control system may be implemented by a computer program.
- the method may comprise: using a g network to initialize a hidden state distribution of a dynamic control system; receiving a measurement value of a sensor and a state value of a trigger obtained from real-time monitoring in the dynamic control system at the current point of time t; inputting at least one first sampling point into an f network to perform a prediction to obtain at least one second sampling point, wherein the at least one first sampling point is used to represent a hidden state distribution of the dynamic control system at a neighboring point of time t ⁇ 1 before the current point of time t, and the at least one second sampling point is used to represent a prior hidden state distribution of the dynamic control system at the current point of time t; using an h network to map the at least one second sampling point into a sensor measurement value space to perform a prediction to obtain a probability distribution of a measurement value of the sensor in the dynamic control system at the current point of time t; determining whether an
- the g network, the f network and the h network are sub-networks in a neural network used to represent the dynamic distribution of the dynamic control system
- the g network is a feed-forward network and encodes a measurement value of a sensor into a low-dimensionality hidden state vector
- the f network encodes a measurement value of a sensor and a state value of a trigger in a sliding window into vectors and utilizes the hidden state vector obtained from the encoding of the g network at the current point of time to predict a hidden state vector at the next point of time
- the h network is a feed-forward network and decodes the hidden state vector obtained from the prediction at the next point of time into a measurement value of a sensor and decodes the low-dimensionality hidden state vector at the current point of time into a measurement value of a sensor
- the neural network is obtained by using measurement values of a sensor obtained in normal operating conditions of the dynamic control system to perform training.
- an anomaly detection apparatus for a dynamic control system comprises:
- the g network, the f network and the h network are sub-networks in a neural network used to represent the dynamic distribution of the dynamic control system
- the g network is a feed-forward network and encodes a measurement value of a sensor into a low-dimensionality hidden state vector
- the f network encodes a measurement value of a sensor and a state value of a trigger in a sliding window into vectors and utilizes the hidden state vector obtained from the encoding of the g network at the current point of time to predict a hidden state vector at the next point of time
- the h network is a feed-forward network and decodes the hidden state vector obtained from the prediction at the next point of time into a measurement value of a sensor and decodes the low-dimensionality hidden state vector at the current point of time into a measurement value of a sensor
- the neural network is obtained by using measurement values of a sensor obtained in normal operating conditions of the dynamic control system to perform training.
- an anomaly detection apparatus for a dynamic control system comprises at least a memory, configured to store computer-readable codes, and at least a processor, configured to cause the computer-readable codes to perform one or more of the methods described herein.
- a computer-readable medium stores computer-readable instructions, and a processor performs the steps of one or more of the methods described herein when the computer-readable instructions are executed by the processor.
- the posterior hidden state distribution of the dynamic control system at the current point of time t may be updated to obtain the first sampling point at a neighboring point of time t+1 after the current point of time t.
- the uncertainty of the hidden state of the system is tracked in real time and the reliability of anomaly monitoring is increased.
- the loss function adopted for training the neural network minimizes the sum of the reconstruction error and the prediction error of measurement values of a sensor used for training at different points of time.
- the end-to-end training method makes it very easy to implement the neural network in practical applications.
- the at least one first sampling point and the at least one second sampling point are both sigma sampling points. In this way, the probability distribution is highly efficiently expressed by use of a minimum number of sampling points and the efficiency of the method is greatly improved.
- FIG. 1 shows the structure of an example neural network for system identification in some embodiments of the teachings of the present disclosure
- FIG. 2 shows a comparison between the anomaly detection effects of the anomaly detection method adopted in various embodiments of the teachings of the present disclosure and prior anomaly detection methods;
- FIG. 3 shows the structure of an example anomaly detection apparatus incorporating teachings of the present disclosure
- FIG. 4 is a flowchart showing an example anomaly detection method incorporating teachings of the present disclosure.
- the term “comprise” and its variants are open terms and mean “include but are not limited to.”
- the term “on the basis of” means “at least partially on the basis of.”
- the terms “an embodiment” and “one embodiment” mean “at least one embodiment.”
- the term “another embodiment” means “at least one other embodiment.”
- the terms “first” and “second” may refer to different or identical objects. Other definitions, explicit or implicit, may be included below. Unless otherwise specified in the context, the definition of a term is consistent throughout the description.
- Control systems are classified into static control systems and dynamic control systems. The differences between a static control system and a dynamic control system are as follows:
- the state variable of a dynamic control system changes significantly with time and the state variable is a function of time.
- the state variable of a static control system changes little with time and it is difficult to observe and measure a change of the state variable.
- a dynamic control system consists of various variables or parameters and these variables are associated with each other and are constantly dynamic.
- the output of a static control system at any point of time is only related to the input at the point of time, but has nothing to do with the input before or after the point of time.
- the final state of a dynamic control system may be an equilibrium state or may be a non-equilibrium state.
- the final state of a static control system is an equilibrium state.
- a dynamic control system may further have highly nonlinear dynamic behaviors and the feature that the noise level of a sensor and the model error are unknown.
- System identification is a process of determining a mathematic model describing system behaviors according to the input and output time functions of the system.
- the purpose of building a mathematic model through system identification is to estimate important parameters representing system behaviors to build a model which can simulate real system behaviors.
- Time series are a series of ordered data. Usually, they are data sampled at equal intervals. If they are not sampled at equal intervals, the time scale of each data point will be marked.
- the neural network used for system identification of the dynamic control system and the process of anomaly detection by using the Bayes filter method in the embodiments of the present invention are described in combination with FIG. 1 and FIG. 2 .
- FIG. 1 shows the structure of an example neural network 10 for system identification in various embodiments of the present disclosure.
- the dynamic control system comprises some sensors and some triggers. Let x t be measurement values of the sensors at the point of time t and let u t be state values of the triggers at the point of time t.
- the following neural network structure is proposed to obtain dynamic changes of time series of the dynamic control system.
- the neural network 10 here may comprise three sub-networks, called g network, f network and h network, respectively.
- the g network is a feed-forward network, and uses the measurement values x t-1 of the sensors at the point of time t ⁇ 1 as inputs and encodes the measurement values of the sensors into low-dimensionality state vectors z t-1 .
- the measurement values of the sensors and the state values of the triggers in a sliding window with a length of l are used as inputs and are encoded into hidden vector h t-1 by using a long short-term memory (LSTM) neural network.
- LSTM long short-term memory
- the f network uses the hidden state vectors z t-1 as inputs and then utilizes the feed-forward network to predict the hidden state vectors z t at the next point of time.
- the h network is a feed-forward network and uses the hidden state vectors as inputs and decodes the hidden state vectors into corresponding measurement values of the sensors. It should be noted that the two h networks in FIG. 1 may share the same weight.
- the f network, the g network and the h network may be implemented as a simulation model or differential equation solver and their specific implementation modes are not restricted.
- the whole neural network 10 which may be denoted by F ⁇ , ⁇ , ⁇ , uses the measurement values x t-1 of the sensors at the point of time t ⁇ 1, and the measurement values x t-l:t-1 of the sensors and the state values u t-l:t-1 of the triggers in the sliding window as inputs and uses the measurement values ⁇ tilde over (x) ⁇ t-1 and ⁇ tilde over (x) ⁇ t of the sensors after the decoding of the hidden state vectors as outputs.
- the first two terms are respectively the reconstruction error and the prediction error of measurement values of the sensors, and the third term is a smoothing factor. In this way, continuous hidden state vectors at two points of time can be closer to each other.
- ⁇ , ⁇ and ⁇ are hyperparameters representing three weights.
- R is a covariance matrix of the reconstruction error and is obtained according to the estimation of the reconstruction value obtained from the formula below based on the same validation data set: x t ⁇ h ⁇ ( g ⁇ ( x t )), for all t 2.
- the Bayes filter can be used for anomaly detection and the time-varying probability distribution of the hidden state of the dynamic control system can be iteratively estimated.
- z t and P t can be used to track the probability distribution of the hidden state of the dynamic control system (hereinafter referred to as “hidden state distribution”), wherein z t represents a mean vector and P t represents a covariance matrix of the hidden state distribution at the point of time t.
- the whole process is divided into an initialization step, a prediction step, an updating step and an anomaly detection step.
- this step calculate the prior mean and the covariance of the hidden state distribution at the point of time t.
- a sampling function for example, a sigma function
- first sampling points here, and called sigma points if the sampling function is a sigma function
- Sampling of the sigma function for example, is described below.
- weights of these sigma points are W m and W c , wherein, one example of a sigma function is the use of the scaled sigma point algorithm presented by Van der Merwe (see Sigma - Point Kalman Filters for Probabilistic Inference in Dynamic State - Space Models published by Van der Merwe in 2004).
- Z,W m ,W c sigma function( z t-1 ,P t-1 ) (1)
- the mean and the covariance of the prior hidden state distribution can be obtained from the calculation of the unscented transformation function at the point of time t:
- the Mahalanobis distance exceeds a preset threshold ⁇ , the measurement values obtained from real-time monitoring will impossibly occur, even if the sensor noise of the predicted noise is considered. That is to say, an anomaly is detected.
- System data consist of measurement values of 52 sensors sampled every minute in 5 months.
- the data set contains 7 faults lasting from hours to days.
- the data set was divided into a training set, a validation set and a test set at ratios of 3:1:1. All the 7 faults happened in the period of the test set, which means that the training set and the validation set contain only data in the normal operating conditions.
- the training set is used to train the neural network and used the validation set to adjust the hyperparameters to obtain the optimal validation performance.
- the test set is used to evaluate the performance of the anomaly detection method.
- FIG. 2 shows the performances of the anomaly detection method described herein and above-mentioned other anomaly detection methods (isolation forest, Bayes estimation algorithm, and auto-encoders including sparse auto-encoder, variational auto-encoder and LSTM auto-encoder). The same data set is used for training all baseline models.
- the method provided by embodiments of the present invention is obviously superior to other methods, wherein No. 1 corresponds to the method provided by embodiments of the present invention, No. 2 corresponds to the isolation forest, No. 3 corresponds to the Seq2SeqLSTM, No. 4 corresponds to the dilated convolution neural network (dilated CNN), No. 5 corresponds to the sparse auto-encoder, No. 6 corresponds to the variational auto-encoder, No. 7 corresponds to the LSTM auto-encoder, and No. 8 corresponds to the Bayes estimation algorithm.
- No. 1 corresponds to the method provided by embodiments of the present invention
- No. 2 corresponds to the isolation forest
- No. 3 corresponds to the Seq2SeqLSTM
- No. 4 corresponds to the dilated convolution neural network (dilated CNN)
- No. 5 corresponds to the sparse auto-encoder
- No. 6 corresponds to the variational auto-encoder
- No. 7 corresponds to
- the anomaly detection apparatus 30 may be implemented as a network of computer processors to realize the anomaly detection method 400 for a dynamic control system as described herein.
- the anomaly detection apparatus 30 may also be a single computer shown in FIG. 3 , comprising at least one memory 301 , including a computer-readable medium (RAM).
- the apparatus 30 further comprises at least one processor 302 coupled with at least one memory 301 .
- Computer-executable instructions are stored in at least one memory 301 , and allow at least one processor 302 to perform the steps described in this document when executed by at least one processor 302 .
- At least one processor 302 may be a microprocessor, an application-specific integrated circuit (ASIC), a digital signal processor (DSP), a central processing unit (CPU), a graphics processing unit (GPU) or a state machine.
- ASIC application-specific integrated circuit
- DSP digital signal processor
- CPU central processing unit
- GPU graphics processing unit
- Embodiments of the computer-readable medium include, but are not limited to, floppy disks, CD-ROM, disks, memory chips, ROMs, RAMS, ASICs, configured processors, all-optical media, all magnetic tapes or other magnetic media, and any other medium from which a computer processor can read instructions.
- computer-readable media in other forms, which can send or carry instructions to a computer include routers, private or public networks, or other wired and wireless transmission equipment or channels. Instructions may include codes in any computer programming language, including C, C++, C language, Visual Basic, Java and JavaScript.
- the at least one memory 301 shown in FIG. 3 may contain an anomaly detection program 31 for a dynamic control system to allow at least one processor 302 to execute the anomaly detection method 400 for a dynamic control system provided by various embodiments of the present disclosure.
- the anomaly detection program 31 for a dynamic control system may comprise:
- the anomaly detection program 31 may further comprise an updating module 315 , configured to update the posterior hidden state distribution of the dynamic control system at the current point of time t to obtain the first sampling point at a neighboring point of time t+1 after the current point of time t.
- an updating module 315 configured to update the posterior hidden state distribution of the dynamic control system at the current point of time t to obtain the first sampling point at a neighboring point of time t+1 after the current point of time t.
- the loss function adopted for training the neural network minimizes the sum of the reconstruction error and the prediction error of measurement values of a sensor used for training at different points of time.
- the at least one first sampling point and the at least one second sampling point are both sigma sampling points.
- the anomaly detection apparatus 30 may further comprise a communication module 303 , and the communication module is connected with at least one processor 302 and at least one memory 301 via a bus and is used for communication of the anomaly detection apparatus 30 with external equipment.
- embodiments of the present disclosure may comprise apparatuses whose structures are different from what is shown in FIG. 3 .
- the above-mentioned structure is only exemplary and is used to explain the method 400 provided by embodiments of the present disclosure.
- the above-mentioned modules can also be considered as functional modules realized by hardware and are used to realize the functions involved when the image stitching apparatus 30 executes the image stitching method.
- the control logics of various processes involved in the image stitching method are burned into field-programmable gate array (FPGA) chips or complex programmable logic devices (CPLDs) in advance, and then these chips or devices execute the functions of the above-mentioned modules.
- FPGA field-programmable gate array
- CPLDs complex programmable logic devices
- the anomaly detection apparatus 30 may further comprise a communication module 303 , and the communication module is connected with at least one processor 302 and at least one memory 301 via a bus and is used for communication of the anomaly detection apparatus 30 with external equipment.
- the anomaly detection method 400 for a dynamic control system provided by teachings of the present disclosure is described in combination with FIG. 4 .
- the method may comprise:
- the method 400 may further comprise step S 406 : updating the posterior hidden state distribution of the dynamic control system at the current point of time t to obtain the first sampling point at a neighboring point of time t+1 after the current point of time t.
- the loss function adopted for training the neural network 10 minimizes the sum of the reconstruction error and the prediction error of measurement values of a sensor used for training at different points of time.
- At least one first sampling point and at least one second sampling point are both sigma sampling points.
- embodiments of the present disclosure further provide a computer-readable medium.
- Computer-readable instructions are stored in the computer-readable medium and a processor executes the above-mentioned anomaly detection method for a dynamic control system when the computer-readable instructions are executed by the processor.
- Embodiments of the computer-readable medium include a floppy disk, a hard disk, a magneto-optical disk, a compact disk (for example, CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, DVD+RW), a magnetic tape, a non-volatile memory card, and a ROM.
- computer-readable instructions can be downloaded from a server computer or cloud via a communication network.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Automation & Control Theory (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Neurology (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
Description
-
- 1. The fault data volume is usually small and the anomaly detection model must be able to detect an unknown fault.
- 2. For a control system having highly nonlinear dynamic behaviors, the anomaly detection model must be able to accurately capture complex dynamic behaviors of the system.
- 3. An anomaly must be accurately detected when the sensor noise level and the model error at a random point of time are unknown.
-
- an initialization module, configured to use a g network to initialize a hidden state distribution of a dynamic control system;
- a data acquisition module, configured to receive a measurement value of a sensor and a state value of a trigger obtained from real-time monitoring in the dynamic control system at the current point of time t;
- a prediction module, configured to input at least one first sampling point into an f network to perform a prediction to obtain at least one second sampling point, wherein the at least one first sampling point is used to represent a hidden state distribution of the dynamic control system at a neighboring point of time t−1 before the current point of time t, the at least one second sampling point is used to represent a prior hidden state distribution of the dynamic control system at the current point of time t, and use an h network to map the at least one second sampling point into a sensor measurement value space to perform a prediction to obtain a probability distribution of a measurement value of the sensor in the dynamic control system at the current point of time t;
- an anomaly determination module, configured to determine whether an anomaly exists in the dynamic control system by comparing the measurement value obtained from real-time monitoring and the probability distribution obtained from a prediction.
| 10: Neural | 1: Detection result | 2: Detection result by using |
| network | by using the anomaly | the isolation forest method |
| detection method | ||
| provided in | ||
| embodiments of the | ||
| present invention | ||
| 3: Detection result by using the | 4: Detection result by using |
| Seq2SeqLSTM method | a dilated convolutional |
| neural network | |
| 5: Detection result by using a | 6: Detection result by using |
| sparse auto-encoder | a variational auto-encoder |
| 7: Detection result by using an | 8: Detection result by using |
| LSTM auto-encoder | the Bayes estimation |
| algorithm | |
| 30: Anomaly detection apparatus | 301: Memory |
| provided in embodiments of the | |
| present invention |
| 302: Processor | 303: | 31: Anomaly detection |
| Communication | program | |
| module |
| 311-315: Software program | |
| modules in the | |
| program | |
| 31 | |
| 400: Anomaly detection method | S401-S406: Steps of the |
| provided in embodiments of the | |
| present invention | |
z t =f θ(z t-1 ;z t-l:t-1 ,u t-l:t-1)+Q
x t =h φ(z t)+R
wherein Q is a covariance matrix of the prediction error and is obtained according to the estimation of the empirical value of the prediction error obtained from the formula below based on a validation data set:
g ω(x t)−f θ(g ω(x t-1);x t-l:t-1 ,u t-l:t-1), for all, t<l
x t −h φ(g ω(x t)), for all t
2. Bayes Filter for Anomaly Detection
Z,W m ,W c=sigma function(z t-1 ,P t-1) (1)
Y=f θ(Z,x t-l:t-1 ,u t-l:t-1) (2)
iii. Updating Step
L=h(Y) (5)
Z t =
P t =
iv. Anomaly Detection Step
√{square root over ((x t−μ)T(Σ)−1(x t−μ))}>τ (11)
-
- an
initialization module 311, configured to use a g network of theneural network 10 shown inFIG. 1 to initialize a hidden state distribution of a dynamic control system; - a
data acquisition module 312, configured to receive a measurement value of a sensor and a state value of a trigger obtained from real-time monitoring in the dynamic control system at the current point of time t; - a
prediction module 313, configured to input at least one first sampling point into an f network of theneural network 10 to perform a prediction to obtain at least one second sampling point, wherein the at least one first sampling point is used to represent a hidden state distribution of the dynamic control system at a neighboring point of time t−1 before the current point of time t, the at least one second sampling point is used to represent a prior hidden state distribution of the dynamic control system at the current point of time t, and use an h network of theneural network 10 to map the at least one second sampling point into a sensor measurement value space to perform a prediction to obtain a probability distribution of a measurement value of the sensor in the dynamic control system at the current point of time t; - an
anomaly determination module 314, configured to determine whether an anomaly exists in the dynamic control system by comparing the measurement value obtained from real-time monitoring and the probability distribution obtained from a prediction.
- an
-
- S401: using a g network of the
neural network 10 shown inFIG. 1 to initialize a hidden state distribution of a dynamic control system; - S402: receiving a measurement value of a sensor and a state value of a trigger obtained from real-time monitoring in the dynamic control system at the current point of time t;
- S403: inputting at least one first sampling point into an f network of the
neural network 10 to perform a prediction to obtain at least one second sampling point, wherein the at least one first sampling point is used to represent a hidden state distribution of the dynamic control system at a neighboring point of time t−1 before the current point of time t, and the at least one second sampling point is used to represent a prior hidden state distribution of the dynamic control system at the current point of time t; - S404: using an h network of the
neural network 10 to map at least one second sampling point into a sensor measurement value space to perform a prediction to obtain a probability distribution of a measurement value of the sensor in the dynamic control system at the current point of time t; - S405: determining whether an anomaly exists in the dynamic control system by comparing the measurement value obtained from real-time monitoring and the probability distribution obtained from a prediction.
- S401: using a g network of the
Claims (9)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110112274.X | 2021-01-27 | ||
| CN202110112274.XA CN114815763B (en) | 2021-01-27 | 2021-01-27 | A method, device and computer readable medium for detecting abnormality of a dynamic control system |
| PCT/CN2021/141706 WO2022161069A1 (en) | 2021-01-27 | 2021-12-27 | Anomaly detection method and apparatus for dynamic control system, and computer-readable medium |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20240045411A1 US20240045411A1 (en) | 2024-02-08 |
| US12050459B2 true US12050459B2 (en) | 2024-07-30 |
Family
ID=82525196
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US18/262,630 Active US12050459B2 (en) | 2021-01-27 | 2021-12-27 | Anomaly detection method and apparatus for dynamic control system, and computer-readable medium |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US12050459B2 (en) |
| EP (1) | EP4266209A4 (en) |
| CN (1) | CN114815763B (en) |
| WO (1) | WO2022161069A1 (en) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11619932B2 (en) * | 2021-04-13 | 2023-04-04 | Accenture Global Solutions Limited | Prediction method and system for multivariate time series data in manufacturing systems |
| CN117787268A (en) * | 2022-09-19 | 2024-03-29 | 西门子股份公司 | Method, device, electronic device and readable storage medium for detecting data anomaly |
| CN115294674B (en) * | 2022-10-09 | 2022-12-20 | 南京信息工程大学 | A method for monitoring and evaluating the navigation status of unmanned boats |
| CN116383608B (en) * | 2023-04-06 | 2025-10-28 | 中国人民解放军空军工程大学 | An online prediction method for equipment failure with small sample size |
| CN119629086B (en) * | 2025-02-13 | 2025-06-13 | 深圳市速程精密科技有限公司 | A method for monitoring the communication status of a target ZR device |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5586221A (en) | 1994-07-01 | 1996-12-17 | Syracuse University | Predictive control of rolling mills using neural network gauge estimation |
| WO2007067471A1 (en) * | 2005-12-07 | 2007-06-14 | Siemens Corporate Research, Inc. | Evaluating anomaly for one-class classifiers in machine condition monitoring |
| EP1914638A1 (en) * | 2006-10-18 | 2008-04-23 | Bp Oil International Limited | Abnormal event detection using principal component analysis |
| US8606554B2 (en) * | 2009-10-19 | 2013-12-10 | Siemens Aktiengesellschaft | Heat flow model for building fault detection and diagnosis |
| CN104156422A (en) | 2014-08-06 | 2014-11-19 | 辽宁工程技术大学 | Gas concentration real-time prediction method based on dynamic neural network |
| CN111666982A (en) | 2020-05-19 | 2020-09-15 | 上海核工程研究设计院有限公司 | Electromechanical equipment fault diagnosis method based on deep neural network |
| US20210097438A1 (en) * | 2019-10-01 | 2021-04-01 | Kabushiki Kaisha Toshiba | Anomaly detection device, anomaly detection method, and anomaly detection program |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11010921B2 (en) * | 2019-05-16 | 2021-05-18 | Qualcomm Incorporated | Distributed pose estimation |
| CN110751199A (en) * | 2019-10-15 | 2020-02-04 | 南京航空航天大学 | A Satellite Anomaly Detection Method Based on Bayesian Neural Network |
| CN111160968A (en) * | 2019-12-27 | 2020-05-15 | 清华大学 | SKU-level commodity sales forecast method and device |
-
2021
- 2021-01-27 CN CN202110112274.XA patent/CN114815763B/en active Active
- 2021-12-27 WO PCT/CN2021/141706 patent/WO2022161069A1/en not_active Ceased
- 2021-12-27 US US18/262,630 patent/US12050459B2/en active Active
- 2021-12-27 EP EP21922658.6A patent/EP4266209A4/en not_active Withdrawn
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5586221A (en) | 1994-07-01 | 1996-12-17 | Syracuse University | Predictive control of rolling mills using neural network gauge estimation |
| WO2007067471A1 (en) * | 2005-12-07 | 2007-06-14 | Siemens Corporate Research, Inc. | Evaluating anomaly for one-class classifiers in machine condition monitoring |
| EP1914638A1 (en) * | 2006-10-18 | 2008-04-23 | Bp Oil International Limited | Abnormal event detection using principal component analysis |
| US8606554B2 (en) * | 2009-10-19 | 2013-12-10 | Siemens Aktiengesellschaft | Heat flow model for building fault detection and diagnosis |
| CN104156422A (en) | 2014-08-06 | 2014-11-19 | 辽宁工程技术大学 | Gas concentration real-time prediction method based on dynamic neural network |
| US20210097438A1 (en) * | 2019-10-01 | 2021-04-01 | Kabushiki Kaisha Toshiba | Anomaly detection device, anomaly detection method, and anomaly detection program |
| CN111666982A (en) | 2020-05-19 | 2020-09-15 | 上海核工程研究设计院有限公司 | Electromechanical equipment fault diagnosis method based on deep neural network |
Non-Patent Citations (15)
| Title |
|---|
| Extended European Search Report, Application No. 21922658.6, 10 pages, Mar. 15, 2024. |
| Feng, Cheng et al., "Time Series Anomaly Detection for Cyber-Physical Systems via Neural System Identification and Bayesian Filtering," arxiv.org, XP091124529; DOI: 10.1145/3447548.3467137, 10 pages, Jan. 7, 2022. |
| Hinton, Geoffrey E. et al., "A Fast Learning Algorithm for Deep Belief Nets," Neural Computation, 18(7), pp. 1527-1554, Jul. 1, 2006. |
| Hochreiter, Sepp et al., "Long Short-Term Memory," Neural Computation, 9(8), 32 pages, Dec. 1997. |
| Huang et al., "Using recurrent neural networks toward black-box system anomaly prediction", Oct. 18, 2016, IEEE/ACM 24th International Symposium on Quality of Service (IWQoS), pp. 1-10 (Year: 2016). * |
| Kingma, Diederik et al., "Adam: A Method for Stochastic Optimization," arXiv preprint, 15 pages, Dec. 22, 2014. |
| Kingma, Diederik et al., "Auto-Encoding Variational Bayes," arXiv preprint, 14 pages, Dec. 20, 2013. |
| Liu, Fei Tony et al., "Isolation Forest," 2008 Eighth IEEE International Conference on Data Mining, 10 pages, Dec. 15, 2008. |
| Manevitz, Larry M. et al., "One-Class SVMs for Document Classification," Journal of Machine Learning Research, vol. 2, pp. 139-154, Dec. 2001. |
| Search Report for International Application No. PCT/CN2021/141706, 9 pages, Mar. 9, 2022. |
| Van der Merwe, Rudolph, "Sigma-Point Kalman Filters for Probabilistic Inference in Dynamic State-Space Models," Ph.D. Dissertation, Oregon Health & Science University, 397 pages, Apr. 2004. |
| Wan, Eric A. et al, "The Unscented Kalman Filter for Nonlinear Estimation," Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium, 6 pages, Oct. 4, 2000. |
| Xie, Xin et al., "Multivariate Abnormal Detection for Industrial Control Systems Using 1D CNN and GRU," IEEE Access, vol. 8, pp. 88348-88359, May 7, 2020. |
| Yuan, Chao et al., "Bayesian Sensor Estimation for Machine Condition Monitoring," Siemens Corporate Research Inc., 2007 IEEE International Conference on Acoustics, Speech and Signal Processing—ICASSP'07, vol. 2, pp. 517-520, Apr. 15, 2007. |
| Yuan, Chao et al., "Robust Sensor Estimation Using Temporal Information," 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 2077-2080, Mar. 31, 2008. |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4266209A1 (en) | 2023-10-25 |
| WO2022161069A1 (en) | 2022-08-04 |
| US20240045411A1 (en) | 2024-02-08 |
| EP4266209A4 (en) | 2024-04-17 |
| CN114815763B (en) | 2025-06-27 |
| CN114815763A (en) | 2022-07-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12050459B2 (en) | Anomaly detection method and apparatus for dynamic control system, and computer-readable medium | |
| JP6740247B2 (en) | Anomaly detection system, anomaly detection method, anomaly detection program and learned model generation method | |
| Mobtahej et al. | An LSTM‐Autoencoder Architecture for Anomaly Detection Applied on Compressors Audio Data | |
| US10754310B2 (en) | Incorporating change diagnosis using probabilistic tensor regression model for improving processing of materials | |
| CN111711608B (en) | Method and system for detecting abnormal flow of power data network and electronic equipment | |
| Ye et al. | A deep learning-based method for automatic abnormal data detection: Case study for bridge structural health monitoring | |
| KR102697617B1 (en) | Method for forecasting temperature and detecting anomaly in underground utility tunnel based on deep learning | |
| US11838192B2 (en) | Apparatus and method for monitoring network | |
| US10402289B2 (en) | Fine-grained causal anomaly inference for complex system fault diagnosis | |
| CN108257365B (en) | Industrial alarm design method based on global uncertainty evidence dynamic fusion | |
| Wang et al. | A hidden semi‐markov model with duration‐dependent state transition probabilities for prognostics | |
| CN112116070B (en) | Subway station environment parameter monitoring method and device | |
| CN119442104B (en) | Causally driven counterfactual data generation and its application method, device and storage medium in anomaly detection | |
| CN119293454A (en) | Spatiotemporal data prediction method, device and equipment | |
| Shiri et al. | Robust switching Kalman filter for diagnostics of long-term condition monitoring data in the presence of non-Gaussian noise | |
| KR20250068149A (en) | Method of data imputation for multivariate time series | |
| Moshkin et al. | Development of a time series forecasting method based on entropy measures of diagnostic information | |
| Jules et al. | Remaining useful life prediction of turbofans with virtual health indicator: a comparison of particle filter-based approaches | |
| CN115660832A (en) | Abnormal transaction detection method and device | |
| Melville et al. | Methods for data-centric small satellite anomaly detection and fault prediction | |
| US20240028872A1 (en) | Estimation apparatus, learning apparatus, methods and programs for the same | |
| Rong et al. | Hawkes network identification with log-sparsity penalty | |
| US20230177393A1 (en) | Machine learning apparatus, machine learning system, machine learning method, and program | |
| Fang et al. | Fault diagnosis using interval data-driven LPV observers and structural analysis | |
| Akofor et al. | Interactive fusion in distributed detection: Architecture and performance analysis |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| AS | Assignment |
Owner name: SIEMENS LTD., CHINA, CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WEN, CHEN JIA;CHENG, FENG;CHAO, LI CONG;AND OTHERS;SIGNING DATES FROM 20230710 TO 20230723;REEL/FRAME:066622/0909 Owner name: SIEMENS AKTIENGESELLSCHAFT, GERMANY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SIEMENS LTD., CHINA;REEL/FRAME:066622/0970 Effective date: 20230728 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: AWAITING TC RESP., ISSUE FEE NOT PAID |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |