CN105740212A - Sensor exception detection method based on regularized vector autoregression model - Google Patents

Sensor exception detection method based on regularized vector autoregression model Download PDF

Info

Publication number
CN105740212A
CN105740212A CN201610072438.XA CN201610072438A CN105740212A CN 105740212 A CN105740212 A CN 105740212A CN 201610072438 A CN201610072438 A CN 201610072438A CN 105740212 A CN105740212 A CN 105740212A
Authority
CN
China
Prior art keywords
data
model
detection method
sensor
exception detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610072438.XA
Other languages
Chinese (zh)
Inventor
韦义明
何改云
王建
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201610072438.XA priority Critical patent/CN105740212A/en
Publication of CN105740212A publication Critical patent/CN105740212A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24147Distances to closest patterns, e.g. nearest neighbour classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Evolutionary Biology (AREA)
  • Mathematical Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Operations Research (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Algebra (AREA)
  • Evolutionary Computation (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a sensor exception detection method based on a regularized vector autoregression model. The sensor exception detection method comprises the following steps: (1) establishing a multielement linear regression model, and determining a target function; collecting data by a sensor; establishing a nearest neighbor graph of data points, wherein the graph consists of n vertexes, each vertex corresponds to one data point, and the weight matrix of edges is defined by considering a relationship between fixed points; constructing a bound term in order to keep similarity among original data points while the obstacles of high dimension and overfitting can be overcome; and utilizing the target function to train a model parameter to obtain an optimal parameter coefficient, and utilizing the model obtained by training to carry out exception detection. The sensor exception detection method can better predict original data.

Description

A kind of sensor abnormality detection method based on regularization Vector Autoression Models
Technical field
The invention belongs to sensor abnormality detection technique field, particularly relate to sensor abnormality detection technique.
Background technology
Abnormality detection refers to detection and finds not meet in observation sample data the data pattern of normally desired behavior, and it is one of problem very active in data mining research, in occupation of leading status, is widely used.The application of abnormality detection technology can effectively prevent the invasion of network; ensure industrial safety; the fault etc. of monitoring equipment; therefore the research of abnormality detection is rich in theory significance and actual application value; and had been subjected to concern widely, become a very active and popular research subject.Abnormality detection is the task of a great particularity, this is mainly in truthful data great majority and only meets the data pattern of expectation (normal class) behavior, and rare or unknown violation meets the data pattern of expectation (exception class) behavior, this two class observes the extreme disequilibrium (exception class sample number is much smaller than normal class sample) of sample allows abnormality detection become more difficult.
Vector Autoression Models (VAR) is a kind of multivariate statistical method, is used to the linear dependence catching between multivariate time series, has promoted single argument autoregression (AR) model, be widely used.VAR model is based on the statistical property of data and sets up model, with current variablees all in model, some lagged variables of all variablees is returned, it is possible to estimate the dynamic relationship between joint endogenous variables, and need not anything first constraints.VAR model is a popular multivariate time series analytical model, but maximum the haveing the drawback that of this model can meet with overparameterization, because the quantity of parameter becomes square number to increase along with the seasonal effect in time series increase comprised.When seasonal effect in time series number is relative to seasonal effect in time series length, standard estimation technique is inaccurate.This destruction identifies the ability of important relationship in data, thus can not make predictive ability accurately.In order to solve the excessive parameter problem in VAR model, the model (SVAR) of sparse estimation is suggested.Utilizing regularization penalty term to retrain VAR model can so that some parameter estimation be zero, and the method is sparse in this sense, and then can overcome the obstacle of higher-dimension and overfitting.Similar with the method basic thought, utilize figure regular terms VAR to carry out retraining the advantage being possible not only to keep the sparse constraint of SVAR, simultaneously can also in conjunction with the information between sensing data.Compared with SVAR method, utilize figure canonical constraint VAR to enable to model more sparse, and may apply to large-scale data concentration.
Sensing data is set up model based on the statistical property of data by VAR, it is prone to estimate, it is possible to matching initial data well, there is considerable flexibility and practicality, the generation process particularly describing little variables collection data is most suitable, and therefore this model is widely used.But, in many practical applications, such as sensing data, data volume is relatively more, and the predictive ability of the method will decline.In order to overcome the problem of higher-dimension and overfitting, adopt regularization penalty term that VAR is retrained to solve coefficient more sparse.Simultaneously taking account of relation implicit between data, this bound term can ensure that interior spatial structure original between data, whereby it can be detected that the relation between multivariate.Therefore, the sensor abnormality detection technique based on regularization Vector Autoression Models just has even more important researching value.
Summary of the invention
The present invention devises a kind of sensor abnormality detection technique method based on regularization Vector Autoression Models.Comparing with original Vector Autoression Models, the method has advantage, and the method take into account the similarity relationships between data, be ensure that by increase bound term and solves the openness of coefficient, also maintains the interior geometry between initial data simultaneously.
A kind of sensor abnormality detection method based on regularization Vector Autoression Models, comprises the following steps:
(1) set up multiple linear regression model, and determine object function;
(2) sensor acquisition data are utilized;
(3) arest neighbors figure, the figure that set up data point are made up of n summit, one data point of each of which vertex correspondence, it is considered to the relation between fixed point and fixed point, and the weight matrix W on definition limit is as follows:
Wherein Np(xi) represent data point xiThe data acquisition system of p nearest neighbor point composition, definition L=D-W, D are diagonal matrix, and its diagonal element is current line or column vector sum, namelyClaiming L is figure Laplacian Matrix;
(4) similarity for the obstacle of higher-dimension and overfitting can be overcome to be maintained with between original data point, structure constraint item:
(5) utilize object function training pattern parameter to draw the parameter coefficient of optimum, utilize the model of training gained to carry out abnormality detection.
Beneficial effects of the present invention is as follows:
First, when processing sensing data, so that model is more sparse, compared with VAR, solution procedure can reduce and select less parametric variable by increasing the constraint of figure canonical.
Second, set up regularization VAR model so that most of coefficient of model becomes 0, reduces the complexity solved, adopt figure canonical bound term, it is possible to find the inherent mutual relation between data, predict initial data better simultaneously.
Accompanying drawing explanation
Fig. 1 institute extracting method block diagram
Detailed description of the invention
Consider the geometry between data, when constructing regularization constraint item, use for reference the thought assumed based on stream: if two data points have close geometric distribution at higher dimensional space, then after dimensionality reduction, in gained space, the two data point also should be close, and stream is assumed to occupy an important position in Data Dimensionality Reduction Algorithm.The arest neighbors figure of data point can be utilized to approach when data stream is unknown, such that it is able to consider that the arest neighbors figure of data point constructs corresponding bound term.If a figure is made up of n summit, one data point of each of which vertex correspondence, consider the relation between fixed point and fixed point simultaneously, the weight matrix W on definition limit is as follows:
Wherein Np(xi) represent xiThe data acquisition system of p nearest neighbor point composition.Definition L=D-W, D are diagonal matrix, and its diagonal element is current line or column vector sum, namelyClaiming L is figure Laplacian Matrix.
In order to overcome the similarity that the obstacle of higher-dimension and overfitting is maintained with between original data point, it is possible to structure constraint item:
The present invention is described in detail below.
The general type of Vector Autoression Models is
xt=xt-1β1+…+xt-pβpt, t=p+1 ..., n (3)
For each t > 1, with X=[xt-1,…,xt-p]TRepresent xtP forward direction sample data, then Vector Autoression Models can be re-expressed as multiple linear regression model:
Y=XB+ ε (4)
Wherein Y=[x2,…,xn]T, B=[β2,…,βn]TIt is the unknown regression coefficient vector of p dimension, ε=[ε2,…,εn]TIt is error vector, and ε~N (0, σ2I), σ2Unknown.
In order to overcome the defect of higher-dimension and overfitting, SVAR model utilizes ElasticNet bound term to be diluted restriction, is actually and utilizes LASSO to return (l1Norm) and ridge regression ((l2Norm) VAR object function is increased bound term so that and expressions below reaches minimum B:
l1Norm item is exactly LASSO penalty term, and it is able to ensure that coefficient matrix B is sparse, even if its most elements is 0 value.l2Norm is ridge regression penalty term, and it is able to ensure that the slickness of coefficient matrix.
Consistent with the general idea of SVAR model, we also take the thought increasing bound term that archetype is limited, the difference is that we not only consider that the obstacle that can overcome higher-dimension and overfitting also keeps the similarity between original data point simultaneously.Such object function is just
Owing to above formula comprises l1Norm constraint item, when B has comprise 0 value time, above formula is non-differentiable, and the Optimization without restriction of standard can not directly be brought application and solve, and the method for coordinate gradient can be utilized to solve about this problem.
First, keep its dependent variable constant, individually more new variables βn.In order to pass through to minimize each βnSolve expression formula (6), then reconstructed error can be expressed again:
Laplce bound term Tr (BTLB) can also again be write as:
So (6) formula just can be expressed as again:
When updating βiTime, keep its dependent variable { βj}j≠iFixing, then just can be optimized by expressions below and solve βi:
Wherein, hi=2 λ2j≠iLiiβi),It is βiJth correlation coefficient.
Solving to be optimized by characteristic symbol searching algorithm and solve for (1.10), thus can obtain optimizedFor each variable βnAll take above-mentioned optimisation strategy to carry out one by one solving optimal solution, then can obtain the optimized coefficients matrix needed
As follows in actual treatment step:
1. initialize: prepare data set, and initialize B.
2. couple each βiCarry out formula iteration
3. utilize characteristic symbol searching algorithm optimizing expression (10)
4. draw the result B of optimum*

Claims (1)

1., based on a sensor abnormality detection method for regularization Vector Autoression Models, comprise the following steps:
(1) set up multiple linear regression model, and determine object function;
(2) sensor acquisition data are utilized;
(3) arest neighbors figure, the figure that set up data point are made up of n summit, one data point of each of which vertex correspondence, it is considered to the relation between fixed point and fixed point, and the weight matrix W on definition limit is as follows:
w i j = 1 , i f x i ∈ N p ( x j ) o r x j ∈ N p ( x i ) 0 , o t h e r w i s e
Wherein Np(xi) represent data point xiThe data acquisition system of p nearest neighbor point composition, definition L=D-W, D are diagonal matrix, and its diagonal element is current line or column vector sum, namelyClaiming L is figure Laplacian Matrix;
(4) similarity for the obstacle of higher-dimension and overfitting can be overcome to be maintained with between original data point, structure constraint item:
R = 1 2 Σ i , j = 1 N | | x i - x j | | 2 2 W i j = Σ j = 1 N x i T x i D i j - Σ i , j = 1 N x i T x j W i j
(5) utilize object function training pattern parameter to draw the parameter coefficient of optimum, utilize the model of training gained to carry out abnormality detection.
CN201610072438.XA 2016-02-02 2016-02-02 Sensor exception detection method based on regularized vector autoregression model Pending CN105740212A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610072438.XA CN105740212A (en) 2016-02-02 2016-02-02 Sensor exception detection method based on regularized vector autoregression model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610072438.XA CN105740212A (en) 2016-02-02 2016-02-02 Sensor exception detection method based on regularized vector autoregression model

Publications (1)

Publication Number Publication Date
CN105740212A true CN105740212A (en) 2016-07-06

Family

ID=56244797

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610072438.XA Pending CN105740212A (en) 2016-02-02 2016-02-02 Sensor exception detection method based on regularized vector autoregression model

Country Status (1)

Country Link
CN (1) CN105740212A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107133642A (en) * 2017-04-25 2017-09-05 东北大学 A kind of priori method for diagnosing faults based on Tennessee Yi Siman processes
CN109949437A (en) * 2019-03-13 2019-06-28 东北大学 Isomeric data based on rarefaction cooperates with industrial method for diagnosing faults
CN111551383A (en) * 2020-05-12 2020-08-18 山东大学 Mechanical state monitoring method and system based on heterogeneous multi-sensors
CN111881413A (en) * 2020-07-28 2020-11-03 中国人民解放军海军航空大学 Multi-source time sequence missing data recovery method based on matrix decomposition
WO2020252673A1 (en) * 2019-06-19 2020-12-24 纪震 Design method for improving detection performance of wearable and stretchable electrochemical sensor
CN113218433A (en) * 2021-03-31 2021-08-06 桂林电子科技大学 Sensor fault detection and data restoration method based on time-varying graph signal processing
CN117494030A (en) * 2024-01-02 2024-02-02 广东力创信息技术有限公司 Abnormal event identification method and related device based on distributed optical fiber acoustic wave sensing

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103207945A (en) * 2013-03-05 2013-07-17 重庆科技学院 Mixed auxiliary variable separation and dimension reduction method based on independent subspace false neighboring point discrimination

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103207945A (en) * 2013-03-05 2013-07-17 重庆科技学院 Mixed auxiliary variable separation and dimension reduction method based on independent subspace false neighboring point discrimination

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
CAI D 等: "Non-negative matrix factorization on manifold", 《IN PROCEEDINGS OF EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING》 *
李亭 等: "时间序列AR模型的正则化及其应用", 《交通科学与工程》 *
王靖华 等: "基于数据包字节长度的线性自回归和支持向量分类机的网络流量预测建模与分析", 《微型电脑应用》 *
肖政宏 等: "无线传感器网络中一种基于流量预测和相关系数的异常检测方法", 《微电子学与计算机》 *
邹柏贤 等: "基于AR模型的网络异常检测", 《微电子学与计算机》 *
陈乾 等: "基于距离和DF-RLS的时间序列异常检测", 《计算机工程》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107133642A (en) * 2017-04-25 2017-09-05 东北大学 A kind of priori method for diagnosing faults based on Tennessee Yi Siman processes
CN109949437A (en) * 2019-03-13 2019-06-28 东北大学 Isomeric data based on rarefaction cooperates with industrial method for diagnosing faults
CN109949437B (en) * 2019-03-13 2021-06-15 东北大学 Heterogeneous data collaborative industrial fault diagnosis method based on sparsification
WO2020252673A1 (en) * 2019-06-19 2020-12-24 纪震 Design method for improving detection performance of wearable and stretchable electrochemical sensor
CN111551383A (en) * 2020-05-12 2020-08-18 山东大学 Mechanical state monitoring method and system based on heterogeneous multi-sensors
CN111881413A (en) * 2020-07-28 2020-11-03 中国人民解放军海军航空大学 Multi-source time sequence missing data recovery method based on matrix decomposition
CN113218433A (en) * 2021-03-31 2021-08-06 桂林电子科技大学 Sensor fault detection and data restoration method based on time-varying graph signal processing
CN117494030A (en) * 2024-01-02 2024-02-02 广东力创信息技术有限公司 Abnormal event identification method and related device based on distributed optical fiber acoustic wave sensing
CN117494030B (en) * 2024-01-02 2024-03-19 广东力创信息技术有限公司 Abnormal event identification method and related device based on distributed optical fiber acoustic wave sensing

Similar Documents

Publication Publication Date Title
CN105740212A (en) Sensor exception detection method based on regularized vector autoregression model
Khodayar et al. Spatio-temporal graph deep neural network for short-term wind speed forecasting
Wang et al. Wind power curve modeling and wind power forecasting with inconsistent data
Rhee et al. Active and semi-supervised learning for object detection with imperfect data
Houari et al. Dimensionality reduction in data mining: A Copula approach
Jacques et al. Funclust: A curves clustering method using functional random variables density approximation
Aswani et al. Regression on manifolds: Estimation of the exterior derivative
An et al. IGAGCN: Information geometry and attention-based spatiotemporal graph convolutional networks for traffic flow prediction
Zhou et al. Multi-view spectral clustering with optimal neighborhood Laplacian matrix
Liu et al. An efficient matrix factorization based low-rank representation for subspace clustering
Deng et al. LSTMED: An uneven dynamic process monitoring method based on LSTM and Autoencoder neural network
Hu et al. Low-rank reconstruction-based autoencoder for robust fault detection
CN114969648A (en) Modal-based fault detection method and system for adaptive adjustment and dimension reduction
Agrawal et al. Community detection in networks using graph embedding
Rong et al. Fault diagnosis by locality preserving discriminant analysis and its kernel variation
Bodyanskiy et al. Adaptive Recovery of Distorted Data Based on Credibilistic Fuzzy Clustering Approach.
Li et al. Structured analysis dictionary learning based on discriminative Fisher pair
You et al. Robust structure low-rank representation in latent space
Bhatnagar et al. Computer model calibration with time series data using deep learning and quantile regression
CN110988631B (en) Partial discharge positioning detection method and device, readable storage medium and electrical equipment
Akbas et al. Network embedding: on compression and learning
Sha et al. Robust kernel principal component analysis and its application in blockage detection at the turn of conveyor belt
CN115544307A (en) Directed graph data feature extraction and expression method and system based on incidence matrix
Wang et al. A fuzzy modeling method via Enhanced Objective Cluster Analysis for designing TSK model
Silva Generalized autoregressive neural network models

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160706