CN110727669A - A power system sensor data cleaning device and cleaning method - Google Patents
A power system sensor data cleaning device and cleaning method Download PDFInfo
- Publication number
- CN110727669A CN110727669A CN201910959898.8A CN201910959898A CN110727669A CN 110727669 A CN110727669 A CN 110727669A CN 201910959898 A CN201910959898 A CN 201910959898A CN 110727669 A CN110727669 A CN 110727669A
- Authority
- CN
- China
- Prior art keywords
- data
- sensor
- abnormal
- noise
- abnormal data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/26—Visual data mining; Browsing structured data
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Testing Or Calibration Of Command Recording Devices (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
本发明属于无线传感器网与神经网络技术领域,具体涉及一种电力系统传感器数据清理装置及清理方法,主要用于解决如何区分由事件发生造成的异常数据和由环境等其它因素造成的异常数据。本发明包括:异常数据检测模块、异常数据分类模块和噪声数据修复模块。本发明构建了一种传感器数据清理模型,用于清理传感器数据,它将传感器数据进行分类,分成正常数据、噪声数据和故障数据,并完成了噪声数据的修复,去除了噪声数据的干扰,用其进行故障诊断分类可以大大增强准确率。该方法具有实时性,能够快速清理实时传送过来的数据;并具有通用性,可以适用于大部分的工业传感器网络。
The invention belongs to the technical field of wireless sensor networks and neural networks, and in particular relates to a power system sensor data cleaning device and a cleaning method, which are mainly used to solve how to distinguish abnormal data caused by events from abnormal data caused by other factors such as the environment. The invention includes: an abnormal data detection module, an abnormal data classification module and a noise data restoration module. The invention constructs a sensor data cleaning model for cleaning sensor data, which classifies the sensor data into normal data, noise data and fault data, completes the restoration of the noise data, removes the interference of the noise data, and uses Its fault diagnosis classification can greatly enhance the accuracy. The method is real-time and can quickly clean up the data transmitted in real-time; it is universal and can be applied to most industrial sensor networks.
Description
技术领域technical field
本发明属于无线传感器网与神经网络技术领域,具体涉及一种电力系统传感器数据清理装置及清理方法,主要用于解决如何区分由事件发生造成的异常数据和由环境等其它因素造成的异常数据。The invention belongs to the technical field of wireless sensor networks and neural networks, and in particular relates to a power system sensor data cleaning device and a cleaning method, which are mainly used to solve how to distinguish abnormal data caused by events from abnormal data caused by other factors such as the environment.
背景技术Background technique
电力系统各种设施的实时状态监测对安全生产管理非常重要。设备的监测数据可用于确定操作设备是否正常运行或发生何种故障。互联网的快速发展使得许多设备被各种类型的传感器自动监控,无线传感器网络技术也已经成功地应用在各大工厂中,传感器数据可以通过网络进行统一的传输,极大地节省了人工成本并改善了实时监控。然而,各种因素,如环境因素,传感器硬件性能,无线传输干扰等,导致监控数据容易出现异常。其次,由于监测设备状态的变化,如出现故障等,也会导致传感器数据出现异常。由于这两种异常有些极为相似,在进行数据清理时会大大降低状态分析和故障检测的精度,这些故障数据和噪声数据都属于异常数据。显然,这些数据对于判断设备是否有故障非常重要。The real-time condition monitoring of various facilities in the power system is very important to the safety production management. The monitoring data of the equipment can be used to determine whether the operating equipment is functioning properly or what kind of failure has occurred. The rapid development of the Internet has made many devices automatically monitored by various types of sensors. Wireless sensor network technology has also been successfully applied in major factories. Sensor data can be uniformly transmitted through the network, which greatly saves labor costs and improves. real time monitoring. However, various factors, such as environmental factors, sensor hardware performance, wireless transmission interference, etc., cause monitoring data to be prone to anomalies. Secondly, due to changes in the status of the monitoring equipment, such as failures, abnormal sensor data can also be caused. Since these two kinds of anomalies are somewhat similar, the accuracy of state analysis and fault detection will be greatly reduced during data cleaning, and these fault data and noise data are abnormal data. Obviously, these data are very important to judge whether the equipment is faulty or not.
由于噪声数据和故障数据往往十分相似,因此,在数据清理期间不应直接平滑或丢弃异常数据。Since noisy and faulty data tend to be quite similar, outliers should not be smoothed or discarded directly during data cleaning.
发明内容SUMMARY OF THE INVENTION
为解决传感器异常数据中的噪声数据和故障数据十分相似导致的用传感器数据进行精度状态分析和故障检测的准确性偏低的问题,本发明提出了一种电力系统传感器数据清理装置及方法,可以分类故障数据与噪声数据,并修复噪声数据,提高精度状态分析和故障检测的准确性。In order to solve the problem that the accuracy of state analysis and fault detection using sensor data is relatively low due to the very similar noise data and fault data in abnormal sensor data, the present invention proposes a power system sensor data cleaning device and method, which can Classify fault data from noisy data, and repair noisy data to improve the accuracy of state analysis and fault detection.
为实现上述发明目的,本发明是采用以下技术方案:In order to realize the above-mentioned purpose of the invention, the present invention adopts the following technical solutions:
一种电力系统传感器异常数据清理装置,包括:异常数据检测模块、异常数据分类模块和噪声数据修复模块;A power system sensor abnormal data cleaning device, comprising: an abnormal data detection module, an abnormal data classification module and a noise data repair module;
异常数据检测模块检测电力系统传感器异常数据;Abnormal data detection module detects abnormal data of power system sensors;
异常数据分类模块用于将传感器异常数据分为噪声数据和故障数据;The abnormal data classification module is used to divide the abnormal data of the sensor into noise data and fault data;
噪声数据修复模块将噪声数据拟合为正常数据,再将噪声数据替换为对应的SDAE拟合的数据,完成修复。The noise data repair module fits the noise data to normal data, and then replaces the noise data with the corresponding SDAE-fitted data to complete the repair.
所述异常数据检测模块利用传感器正常数据对SDAE进行训练,学习传感器正常数据的特征,对传感器正常数据进行拟合,将SDAE拟合数据与原数据的差值的最大值作为判断数据是否正常的阈值;当有新数据输入时,将新数据输入SDAE进行拟合,求出拟合数据与原数据的差值,差值大于阈值的数据判断为异常数据。The abnormal data detection module uses the normal data of the sensor to train SDAE, learns the characteristics of the normal data of the sensor, fits the normal data of the sensor, and uses the maximum value of the difference between the SDAE fitting data and the original data as the normal value of the data. Threshold; when new data is input, the new data is input into SDAE for fitting, and the difference between the fitted data and the original data is obtained, and the data with the difference greater than the threshold is judged as abnormal data.
所述异常数据分类模块用于将传感器数据集X按照时间划分为m个窗口X={L1,...,Lm},计算每个窗口内传感器数据之间的相关度,对于异常数据,找出同时刻所有异常数据并记录这些异常数据所在传感器与传感器i在该窗口内的相关度;当时刻t至少存在w-1个异常数据,且这些异常数据所在传感器时间序列与传感器的时间序列的相关度均大于最小相关度阈值时,认定该数据为故障数据;若不满足该条件,则为噪声数据;建立包括正常数据、故障数据和噪声数据的数据类别矩阵。The abnormal data classification module is used to divide the sensor data set X into m windows X={L 1 , . . . , L m } according to time, and calculate the correlation between the sensor data in each window. , find out all abnormal data at the same time and record the correlation between the sensor where these abnormal data are located and sensor i within the window; there are at least w-1 abnormal data at time t, and the time series of the sensor where these abnormal data are located and the time of the sensor When the correlations of the sequences are all greater than the minimum correlation threshold, the data is identified as fault data; if it does not meet this condition, it is noise data; a data category matrix including normal data, fault data and noise data is established.
所述传感器i与传感器j之间的相关度RTij的计算公式为:The calculation formula of the correlation degree RT ij between the sensor i and the sensor j is:
其中,xij表示第i个传感器的j时刻的数据,X_it是传感器在窗口内第t个数据,X_jt是传感器j在窗口内第t个数据,每个窗口长度为s。Among them, x ij represents the data of the ith sensor at time j, X_it is the t-th data of the sensor in the window, X_jt is the t-th data of the sensor j in the window, and the length of each window is s.
所述噪声数据修复模块用于建立数据矩阵Y={yij},令Y=X;找出数据类别矩阵中kij=1的数据xij,令数据矩阵Y为修复完成的数据集。The noise data repair module is used to establish a data matrix Y={y ij }, let Y=X; find out the data x ij of k ij =1 in the data category matrix, let The data matrix Y is the repaired dataset.
所述的一种电力系统传感器异常数据清理装置的数据清理方法,包括:The data cleaning method of a power system sensor abnormal data cleaning device includes:
将待清理的新数据输入训练好的SDAE,输出拟合后的数据;将拟合的数据与原数据做差,差值超过所规定的阈值的数据为异常数据;Input the new data to be cleaned into the trained SDAE, and output the fitted data; make the difference between the fitted data and the original data, and the data whose difference exceeds the specified threshold is abnormal data;
对每个异常数据,查找同一时刻其他传感器是否存在异常数据,记录在同一时刻出现异常的其他传感器;计算该传感器与记录的其他传感器之间的相关度,若相关度大于规定的阈值,则判定它们之间具有强相关性;若与该传感器具有强相关性的传感器的数量大于规定的数目阈值,则该传感器该时刻的异常数据是故障数据;否则,为噪声数据;For each abnormal data, find out whether there is abnormal data in other sensors at the same time, and record other sensors with abnormality at the same time; calculate the correlation between the sensor and other recorded sensors, if the correlation is greater than the specified threshold, then determine There is a strong correlation between them; if the number of sensors with strong correlation with the sensor is greater than the specified number threshold, the abnormal data of the sensor at this moment is fault data; otherwise, it is noise data;
对每一个噪声数据,找出其对应的SDAE拟合的数据;用拟合数据替换噪声数据,完成修复。For each noise data, find the corresponding SDAE fitting data; replace the noise data with the fitting data to complete the restoration.
所述异常数据检测的具体步骤包括:The specific steps of the abnormal data detection include:
S1:收集传感器正常数据,学习传感器正常数据的特征,对传感器正常数据进行拟合;S1: Collect the normal data of the sensor, learn the characteristics of the normal data of the sensor, and fit the normal data of the sensor;
S2:计算拟合正常数据时SDAE拟合数据与原数据的差值,将差值序列做直方图,画出其正态分布曲线,取置信区间0.99,得出上阈值ThL和下阈值ThU;S2: Calculate the difference between the SDAE fitting data and the original data when fitting normal data, make a histogram of the difference sequence, draw its normal distribution curve, take a confidence interval of 0.99, and obtain the upper threshold Th L and the lower threshold Th U ;
S3:假设新数据为X={xij},其中xij表示第i个传感器的j时刻的数据,将数据输入SDAE,对数据进行拟合,得到拟合数据其中表示第i个传感器的j时刻的拟合数据,将拟合数据与原数据做差,求得差值其中dij表示第i个传感器的j时刻的拟合数据与原数据的差值;S3: Suppose the new data is X={x ij }, where x ij represents the data of the ith sensor at time j, input the data into SDAE, fit the data, and obtain the fitted data in Indicates the fitted data at time j of the i-th sensor, and the fitted data is compared with the original data to obtain the difference where d ij represents the difference between the fitted data at time j of the ith sensor and the original data;
S4:将差值与阈值比较,若ThL<dij<ThU,则xij为正常数据,kij=0,否则,xij为异常数据,kij=1。S4: Compare the difference with the threshold, if Th L <d ij <Th U , then x ij is normal data, and k ij =0; otherwise, x ij is abnormal data, and k ij =1.
所述异常数据分类的具体步骤包括:The specific steps of the abnormal data classification include:
S1:将传感器数据集X按照时间划分为m个窗口X={L1,...,Lm},其中每个窗口S1: Divide the sensor data set X into m windows X={L 1 ,...,L m } according to time, where each window
Lj={X1j,...,Xnj}T,Xij={xi,(j-1)×s+1,...,xi,j×s},L j ={X 1j ,...,X nj } T ,X ij ={xi,(j-1)×s+1,...,xi,j×s},
每个窗口长度为s;The length of each window is s;
S2:计算每个窗口内传感器数据之间的相关度;S2: Calculate the correlation between sensor data in each window;
S3:对于kit=1的异常数据xit,找出t时刻所有的异常数据并记录这些异常数据所在传感器与传感器i在该窗口内的相关度;当时刻t至少存在w-1个异常数据,且这些异常数据所在传感器时间序列与传感器的时间序列的相关度均大于最小相关度阈值时,即Num((RTij>RTmin)&(kjt=1))>w-1,认定该数据为故障数据,令kit=2;若不满足该条件,则为噪声数据,kit=1不变;S3: For the abnormal data x it with k it = 1, find out all abnormal data at time t and record the correlation between the sensor where these abnormal data are located and sensor i within the window; at least w-1 abnormal data exists at time t , and the correlation between the time series of the sensor where these abnormal data are located and the time series of the sensor are all greater than the minimum correlation threshold, that is, Num((RT ij >RT min )&(k jt =1))>w-1, it is determined that this If the data is fault data, let k it =2; if this condition is not met, it is noise data, and k it =1 remains unchanged;
S4:建立数据类别矩阵K={kij},kij∈{0,1,2},kij=0表明数据xij为正常数据,kij=1表明数据xij为噪声数据,kij=2表明数据xij为故障数据。S4: Establish a data category matrix K={k ij }, k ij ∈ {0,1,2}, k ij =0 indicates that the data x ij is normal data, k ij =1 indicates that the data x ij is noise data, k ij =2 indicates that the data x ij is fault data.
所述传感器i与传感器j之间的相关度RTij的计算公式为:The calculation formula of the correlation degree RT ij between the sensor i and the sensor j is:
其中,xij表示第i个传感器的j时刻的数据,X_it是传感器在窗口内第t个数据,X_jt是传感器j在窗口内第t个数据,每个窗口长度为s。Among them, x ij represents the data of the ith sensor at time j, X_it is the t-th data of the sensor in the window, X_jt is the t-th data of the sensor j in the window, and the length of each window is s.
所述噪声数据修复的具体步骤如下:The specific steps of the noise data restoration are as follows:
S1:建立数据矩阵Y={yij},令Y=X;S1: establish a data matrix Y={y ij }, let Y=X;
S2:找出数据类别矩阵中kij=1的数据xij,令 S2: Find the data x ij with k ij =1 in the data category matrix, let
S3:使用数据矩阵Y作为修复完成的数据集。S3: Use the data matrix Y as the inpainted dataset.
本发明具有如下优点及有益效果:The present invention has the following advantages and beneficial effects:
本发明构建了一种传感器数据清理模型,用于清理传感器数据,它将传感器数据进行分类,分成正常数据、噪声数据和故障数据,并完成了噪声数据的修复,去除了噪声数据的干扰,用其进行故障诊断分类可以大大增强准确率。该方法具有实时性,能够快速清理实时传送过来的数据。并具有通用性,可以适用于大部分的工业传感器网络。The invention constructs a sensor data cleaning model for cleaning sensor data, which classifies the sensor data into normal data, noise data and fault data, completes the restoration of the noise data, removes the interference of the noise data, and uses Its fault diagnosis classification can greatly enhance the accuracy. The method is real-time and can quickly clean up the data transmitted in real time. It is universal and can be applied to most industrial sensor networks.
附图说明Description of drawings
为了便于本领域普通技术人员理解和实施本发明,下面结合附图及具体实施例对本发明作进一步的详细描述,但应当理解为本发明的保护范围并不受具体实施方式的限制。In order to facilitate the understanding and implementation of the present invention by those of ordinary skill in the art, the present invention is further described in detail below with reference to the accompanying drawings and specific embodiments, but it should be understood that the protection scope of the present invention is not limited by the specific embodiments.
图1为本发明实施例的方法流程图;1 is a flow chart of a method according to an embodiment of the present invention;
图2为本发明将5个传感器时间序列数据划分为几个窗口的示意图。FIG. 2 is a schematic diagram of dividing five sensor time series data into several windows according to the present invention.
具体实施方式Detailed ways
本发明是一种电力系统传感器数据清理装置及清理方法,如图1所示,图1为本发明实施例的方法流程图。The present invention is a power system sensor data cleaning device and cleaning method. As shown in FIG. 1 , FIG. 1 is a flowchart of a method according to an embodiment of the present invention.
本发明电力系统传感器数据清理装置分为三个阶段:异常数据检测模块、异常数据分类模块和噪声数据修复模块。这三个阶段从前至后依次运行,完成数据清理的任务。The power system sensor data cleaning device of the present invention is divided into three stages: an abnormal data detection module, an abnormal data classification module and a noise data restoration module. These three stages run sequentially from front to back to complete the task of data cleaning.
异常数据检测模块采用三层叠加去噪自编码机SDAE检测电力系统传感器异常数据;The abnormal data detection module adopts three-layer superimposed denoising self-encoder SDAE to detect abnormal data of power system sensors;
异常数据分类模块用于将传感器异常数据分为噪声数据和故障数据,利用传感器数据之间的相关性对异常数据分类,同一时刻的多个传感器数据同时被判定为异常数据则将其认定为故障数据,否则为噪声数据;The abnormal data classification module is used to divide the abnormal data of the sensor into noise data and fault data, and use the correlation between the sensor data to classify the abnormal data. If multiple sensor data at the same time are judged as abnormal data at the same time, it will be regarded as a fault. data, otherwise noise data;
噪声数据修复模块利用SDAE将噪声数据拟合为正常数据,再将噪声数据替换为对应的SDAE拟合的数据,完成修复。The noise data repair module uses SDAE to fit the noise data to normal data, and then replaces the noise data with the corresponding SDAE-fitted data to complete the repair.
所述异常数据检测阶段负责将传感器数据中异常数据检测出来;通过应用传感器正常数据对三层叠加去噪自编码机SDAE进行训练,可以令三层叠加去噪自编码机SDAE学习到传感器数据的特征,并可以对传感器数据进行拟合。记录三层叠加去噪自编码机SDAE拟合正常数据时的差值的最大值,作为判断正常、异常数据的阈值。当有新数据到来时,将数据输入SDAE进行拟合,求出拟合数据与原数据的差值,差值大于阈值的数据判断为异常数据。The abnormal data detection stage is responsible for detecting abnormal data in the sensor data; by applying the normal data of the sensor to train the three-layer superimposed denoising auto-encoder SDAE, the three-layer superimposed denoising auto-encoder SDAE can learn the sensor data. features and can fit the sensor data. The maximum value of the difference between the three-layer superposition denoising auto-encoder SDAE fitting normal data is recorded as the threshold for judging normal and abnormal data. When new data arrives, the data is input into SDAE for fitting, and the difference between the fitted data and the original data is obtained, and the data with the difference greater than the threshold value is judged as abnormal data.
所述异常数据分类阶段负责将异常数据分为噪声数据和故障数据,通过利用传感器数据之间的相关性,将之前检测出的异常数据进行分类,同一时刻的多个相关传感器数据同时被判定为异常数据则将其认定为故障数据,否则为噪声数据。The abnormal data classification stage is responsible for classifying abnormal data into noise data and fault data. By using the correlation between sensor data, the abnormal data detected before is classified, and multiple relevant sensor data at the same time are simultaneously determined as Abnormal data is regarded as fault data, otherwise it is noise data.
所述噪声数据修复阶段负责将上阶段判别出来的噪声数据修复为正常数据,令其不影响后续的数据分析处理工作。通过利用三层叠加去噪自编码机SDAE可以将噪声拟合为正常数据的特性,将噪声数据替换为对应的三层叠加去噪自编码机SDAE拟合的数据,完成修复。The noise data repairing stage is responsible for repairing the noise data discriminated in the previous stage into normal data, so that it does not affect subsequent data analysis and processing work. By using the three-layer superposition denoising auto-encoder SDAE, the noise can be fitted to the characteristics of normal data, and the noise data can be replaced with the data fitted by the corresponding three-layer superposition denoising auto-encoder SDAE to complete the restoration.
本发明一种电力系统传感器异常数据清理装置的数据清理方法,将数据进行分类,然后,修复噪声数据,同时保留正常数据和故障数据以分析设备故障。具体包括:The present invention is a data cleaning method of a power system sensor abnormal data cleaning device. The data is classified, and then the noise data is repaired, while the normal data and the fault data are retained to analyze the equipment failure. Specifically include:
异常数据检测,用正常传感器数据训练三层叠加去噪自编码机SDAE;将待清理的新数据输入训练好的SDAE,输出拟合后的数据;将拟合的数据与原数据做差,差值超过所规定的阈值的数据为异常数据;For abnormal data detection, use normal sensor data to train three-layer superimposed denoising auto-encoder SDAE; input the new data to be cleaned into the trained SDAE, and output the fitted data; make the difference between the fitted data and the original data, the difference is Data whose value exceeds the specified threshold is abnormal data;
异常数据分类:对每个异常数据,查找同一时刻其他传感器是否存在异常数据,记录在同一时刻出现异常的其他传感器;计算该传感器与记录的其他传感器之间的相关度,若相关度大于规定的阈值,则判定它们之间具有强相关性;若与该传感器具有强相关性的传感器的数量大于规定的数目阈值,则该传感器该时刻的异常数据是故障数据;否则,为噪声数据;Abnormal data classification: for each abnormal data, find out whether there is abnormal data in other sensors at the same time, and record other sensors with abnormality at the same time; calculate the correlation between the sensor and other recorded sensors, if the correlation is greater than the specified If the number of sensors with strong correlation with the sensor is greater than the specified number threshold, the abnormal data of the sensor at this moment is fault data; otherwise, it is noise data;
异常数据修复:对每一个噪声数据,找出其对应的SDAE拟合的数据;用拟合数据替换噪声数据,完成修复。Abnormal data repair: For each noise data, find the corresponding SDAE fitting data; replace the noise data with the fitted data to complete the repair.
所述异常数据检测模块的具体步骤包括:The specific steps of the abnormal data detection module include:
提前收集传感器正常数据,用于训练三层叠加去噪自编码机SDAE,可以令三层叠加去噪自编码机SDAE学习到传感器正常数据的特征,并可以对传感器正常数据进行拟合。The normal sensor data is collected in advance for training the three-layer denoising auto-encoder SDAE, which can make the three-layer denoising auto-encoder SDAE learn the characteristics of the sensor's normal data, and can fit the sensor's normal data.
记录三层叠加去噪自编码机SDAE拟合正常数据时的差值的最大值,作为判断正常、异常数据的阈值。由于有误差的存在,我们不能直接取最大值,首先计算拟合正常数据时三层叠加去噪自编码机SDAE拟合数据与原数据的差值,将差值序列做直方图,画出其正态分布曲线,取置信区间0.99,得出上阈值ThL和下阈值ThU。The maximum value of the difference between the three-layer superposition denoising auto-encoder SDAE fitting normal data is recorded as the threshold for judging normal and abnormal data. Due to the existence of errors, we cannot directly take the maximum value. First, calculate the difference between the SDAE fitting data of the three-layer denoising auto-encoder and the original data when fitting normal data, make a histogram of the difference sequence, and draw its The normal distribution curve, taking the confidence interval of 0.99, obtains the upper threshold Th L and the lower threshold Th U .
假设新数据为X={xij},其中xij表示第i个传感器的j时刻的数据,将数据输入三层叠加去噪自编码机SDAE,三层叠加去噪自编码机SDAE便可根据之前学习的特征对数据进行拟合,得到拟合数据其中表示第i个传感器的j时刻的拟合数据。将拟合数据与原数据做差,求得差值其中dij表示第i个传感器的j时刻的拟合数据与原数据的差值。Assuming that the new data is X={x ij }, where x ij represents the data of the ith sensor at time j, input the data into the three-layer denoising auto-encoder SDAE, and the three-layer denoising auto-encoder SDAE can be based on The previously learned features are fitted to the data to obtain the fitted data in Represents the fitted data at time j of the ith sensor. Calculate the difference between the fitted data and the original data where d ij represents the difference between the fitted data of the ith sensor at time j and the original data.
将差值与阈值比较,若ThL<dij<ThU,则xij为正常数据,kij=0,否则,xij为异常数据,kij=1。Compare the difference with the threshold, if Th L <d ij <Th U , then x ij is normal data, and k ij =0; otherwise, x ij is abnormal data, and k ij =1.
所述异常数据分类模块的具体步骤包括:The specific steps of the abnormal data classification module include:
将传感器数据集X按照时间划分为m个窗口X={L1,...,Lm},其中每个窗口Lj={X1j,...,Xnj}T,Xij={xi,(j-1)×s+1,...,xi,j×s},每个窗口长度为s,图2所示为对5个传感器数据进行的窗口划分,图中窗口长度为200。可以看出L3窗口内各传感器之间的相关度明显与其他窗口的相关度不同,因此,划分窗口后再进行相关度计算是十分有必要的。Divide the sensor dataset X into m windows X={L 1 ,...,L m } according to time, where each window L j ={X 1j ,...,X nj } T ,X ij ={ xi,(j-1)×s+1,...,xi,j×s}, the length of each window is s, Figure 2 shows the window division for 5 sensor data, the window length in the figure is 200. It can be seen that the correlation between the sensors in the L3 window is obviously different from that of other windows. Therefore, it is necessary to calculate the correlation after dividing the window.
计算每个窗口内传感器数据之间的相关度,传感器i与传感器j之间的相关度RTij的计算公式为:To calculate the correlation between sensor data in each window, the calculation formula of the correlation RT ij between sensor i and sensor j is:
对于kit=1的异常数据xit,找出t时刻所有的异常数据并记录这些异常数据所在传感器与传感器i在该窗口内的相关度。只有在时刻t,至少存在w-1个异常数据,且这些异常数据所在传感器时间序列与传感器的时间序列的相关度均大于最小相关度阈值时,即For the abnormal data x it with k it =1, find out all the abnormal data at time t and record the correlation between the sensor where these abnormal data are located and the sensor i within the window. Only at time t, there are at least w-1 abnormal data, and the correlation between the time series of the sensor where these abnormal data are located and the time series of the sensor are all greater than the minimum correlation threshold, that is
Num((RTij>RTmin)&(kjt=1))>w-1,认定该数据为故障数据,令kit=2;若不满足该条件,则为噪声数据,kit=1不变。Num((RT ij >RT min )&(k jt =1))>w-1, consider the data as fault data, let k it =2; if this condition is not met, it is noise data, k it =1 constant.
自此,数据类别矩阵K={kij},kij∈{0,1,2}建立完毕,异常数据的分类完成:kij=0表明数据xij为正常数据,kij=1表明数据xij为噪声数据,kij=2表明数据xij为噪声数据。Since then, the data category matrix K={k ij }, k ij ∈{0,1,2} has been established, and the classification of abnormal data has been completed: k ij =0 indicates that the data x ij is normal data, and k ij =1 indicates that the data x ij is noise data, and k ij =2 indicates that the data x ij is noise data.
所述噪声数据修复模块的具体步骤包括:The specific steps of the noise data repair module include:
建立数据矩阵Y={yij},令Y=X。找出数据类别矩阵中kij=1的数据xij,令矩阵Y便为修复完成的数据集。Create a data matrix Y={y ij }, let Y=X. Find the data x ij with k ij = 1 in the data category matrix, let The matrix Y is the inpainted dataset.
本领域内的技术人员应明白:本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。It should be understood by those skilled in the art that the embodiments of the present application may be provided as a method, a system, or a computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing device to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing device produce Means for implementing the functions specified in a flow or flow of a flowchart and/or a block or blocks of a block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory result in an article of manufacture comprising instruction means, the instructions The apparatus implements the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process such that The instructions provide steps for implementing the functions specified in the flow or blocks of the flowcharts and/or the block or blocks of the block diagrams.
最后应当说明的是:以上实施例仅用以说明本发明的技术方案而非对其限制,尽管参照上述实施例对本发明进行了详细的说明,所属领域的普通技术人员应当理解:依然可以对本发明的具体实施方式进行修改或者等同替换,而未脱离本发明精神和范围的任何修改或者等同替换,其均应涵盖在本发明的权利要求保护范围之内。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention rather than to limit them. Although the present invention has been described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: the present invention can still be Modifications or equivalent replacements are made to the specific embodiments of the present invention, and any modifications or equivalent replacements that do not depart from the spirit and scope of the present invention shall be included within the protection scope of the claims of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910959898.8A CN110727669B (en) | 2019-10-10 | 2019-10-10 | A power system sensor data cleaning device and cleaning method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910959898.8A CN110727669B (en) | 2019-10-10 | 2019-10-10 | A power system sensor data cleaning device and cleaning method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110727669A true CN110727669A (en) | 2020-01-24 |
CN110727669B CN110727669B (en) | 2023-10-13 |
Family
ID=69220927
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910959898.8A Active CN110727669B (en) | 2019-10-10 | 2019-10-10 | A power system sensor data cleaning device and cleaning method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110727669B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112528227A (en) * | 2020-11-18 | 2021-03-19 | 浙江蓝卓工业互联网信息技术有限公司 | Sensor abnormal data identification method based on mathematical statistics |
CN114048811A (en) * | 2021-11-10 | 2022-02-15 | 中国核动力研究设计院 | Wireless sensor node fault diagnosis method and device based on deep learning |
CN116842459A (en) * | 2023-09-01 | 2023-10-03 | 国网信息通信产业集团有限公司 | Electric energy metering fault diagnosis method and diagnosis terminal based on small sample learning |
CN118427539A (en) * | 2024-07-04 | 2024-08-02 | 陕西苏秦佳宏药业有限公司 | Waste water management control system and method based on data processing |
CN119124096A (en) * | 2024-09-12 | 2024-12-13 | 重庆交通大学 | Method and system for monitoring subsidence of power tower |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104537034A (en) * | 2014-12-22 | 2015-04-22 | 国家电网公司 | Electric-transmission-and-transformation-equipment state monitoring data cleaning method based on time series analysis |
CN105512474A (en) * | 2015-12-02 | 2016-04-20 | 国网山东省电力公司电力科学研究院 | Transformer state monitoring data anomaly detection method |
US20160112602A1 (en) * | 2014-10-15 | 2016-04-21 | Ricoh Company, Ltd. | Abnormal noise operation control device, image forming apparatus, abnormal noise operation control method, and non-transitory recording medium |
CN106909664A (en) * | 2017-02-28 | 2017-06-30 | 国网福建省电力有限公司 | A kind of power equipment data stream failure recognition methods |
CN108334907A (en) * | 2018-02-09 | 2018-07-27 | 哈尔滨工业大学(威海) | A kind of complex equipment point method for detecting abnormality and system based on deep learning |
CN109061341A (en) * | 2018-07-10 | 2018-12-21 | 杭州安脉盛智能技术有限公司 | Kalman filtering transformer fault prediction technique and system neural network based |
CN109115501A (en) * | 2018-07-12 | 2019-01-01 | 哈尔滨工业大学(威海) | A kind of Civil Aviation Engine Gas path fault diagnosis method based on CNN and SVM |
CN109190473A (en) * | 2018-07-29 | 2019-01-11 | 国网上海市电力公司 | The application of a kind of " machine vision understanding " in remote monitoriong of electric power |
CN109491339A (en) * | 2018-11-16 | 2019-03-19 | 国网江苏省电力有限公司盐城供电分公司 | A kind of operating condition of transformer station equipment early warning system based on big data |
CN109885951A (en) * | 2019-02-28 | 2019-06-14 | 中科云创(厦门)科技有限公司 | Equipment fault diagnosis method and device |
-
2019
- 2019-10-10 CN CN201910959898.8A patent/CN110727669B/en active Active
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160112602A1 (en) * | 2014-10-15 | 2016-04-21 | Ricoh Company, Ltd. | Abnormal noise operation control device, image forming apparatus, abnormal noise operation control method, and non-transitory recording medium |
CN104537034A (en) * | 2014-12-22 | 2015-04-22 | 国家电网公司 | Electric-transmission-and-transformation-equipment state monitoring data cleaning method based on time series analysis |
CN105512474A (en) * | 2015-12-02 | 2016-04-20 | 国网山东省电力公司电力科学研究院 | Transformer state monitoring data anomaly detection method |
CN106909664A (en) * | 2017-02-28 | 2017-06-30 | 国网福建省电力有限公司 | A kind of power equipment data stream failure recognition methods |
CN108334907A (en) * | 2018-02-09 | 2018-07-27 | 哈尔滨工业大学(威海) | A kind of complex equipment point method for detecting abnormality and system based on deep learning |
CN109061341A (en) * | 2018-07-10 | 2018-12-21 | 杭州安脉盛智能技术有限公司 | Kalman filtering transformer fault prediction technique and system neural network based |
CN109115501A (en) * | 2018-07-12 | 2019-01-01 | 哈尔滨工业大学(威海) | A kind of Civil Aviation Engine Gas path fault diagnosis method based on CNN and SVM |
CN109190473A (en) * | 2018-07-29 | 2019-01-11 | 国网上海市电力公司 | The application of a kind of " machine vision understanding " in remote monitoriong of electric power |
CN109491339A (en) * | 2018-11-16 | 2019-03-19 | 国网江苏省电力有限公司盐城供电分公司 | A kind of operating condition of transformer station equipment early warning system based on big data |
CN109885951A (en) * | 2019-02-28 | 2019-06-14 | 中科云创(厦门)科技有限公司 | Equipment fault diagnosis method and device |
Non-Patent Citations (4)
Title |
---|
J.H.LIU 等: "A Robust Acoustic Feature Extraction Approach Based on Stacked Denoising Autoencoder" * |
李星南 等: "基于孤立森林算法和BP神经网络算法的电力运维数据清洗方法" * |
杨挺 等: "人工智能在电力系统及综合能源系统中的应用综述" * |
赵敏 等: "基于自编码的电力信息通信资产数据清洗算法" * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112528227A (en) * | 2020-11-18 | 2021-03-19 | 浙江蓝卓工业互联网信息技术有限公司 | Sensor abnormal data identification method based on mathematical statistics |
CN114048811A (en) * | 2021-11-10 | 2022-02-15 | 中国核动力研究设计院 | Wireless sensor node fault diagnosis method and device based on deep learning |
CN114048811B (en) * | 2021-11-10 | 2024-10-25 | 中国核动力研究设计院 | Wireless sensor node fault diagnosis method and device based on deep learning |
CN116842459A (en) * | 2023-09-01 | 2023-10-03 | 国网信息通信产业集团有限公司 | Electric energy metering fault diagnosis method and diagnosis terminal based on small sample learning |
CN116842459B (en) * | 2023-09-01 | 2023-11-21 | 国网信息通信产业集团有限公司 | An electric energy metering fault diagnosis method and diagnostic terminal based on small sample learning |
CN118427539A (en) * | 2024-07-04 | 2024-08-02 | 陕西苏秦佳宏药业有限公司 | Waste water management control system and method based on data processing |
CN118427539B (en) * | 2024-07-04 | 2024-08-30 | 陕西苏秦佳宏药业有限公司 | Waste water management control system and method based on data processing |
CN119124096A (en) * | 2024-09-12 | 2024-12-13 | 重庆交通大学 | Method and system for monitoring subsidence of power tower |
Also Published As
Publication number | Publication date |
---|---|
CN110727669B (en) | 2023-10-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110727669A (en) | A power system sensor data cleaning device and cleaning method | |
WO2021179574A1 (en) | Root cause localization method, device, computer apparatus, and storage medium | |
CA2931624A1 (en) | Systems and methods for event detection and diagnosis | |
CN111563524A (en) | Multi-station fusion system operation situation abnormity monitoring and alarm combining method | |
WO2025039538A1 (en) | Fault diagnosis method and apparatus, device, medium, and product | |
CN117170304B (en) | PLC remote monitoring control method and system based on industrial Internet of things | |
CN111314110B (en) | A Fault Early Warning Method for Distributed System | |
CN118656763B (en) | Safety management system and method for gas engineering construction | |
CN111949496B (en) | A data detection method and device | |
CN118334563A (en) | A method and system for detecting and analyzing abnormality of power equipment under monitoring environment | |
CN118885941A (en) | A fault prediction method based on 360° dynamic image detection system | |
CN118312896A (en) | Fault diagnosis expert system based on large language model and implementation method | |
CN118821835A (en) | A data processing, analysis and decision-making method and system based on large model intelligent agent | |
CN111380687B (en) | Fault diagnosis method of industrial motor bearing based on multi-local model decision fusion | |
CN113496332B (en) | Industrial Internet fault prediction method and system | |
CN114418042B (en) | Industrial robot operation trend diagnosis method based on cluster analysis | |
CN116049642A (en) | Fault diagnosis method, system, electronic equipment and computer storage medium | |
CN119782390A (en) | A method and system for generating substation inspection report | |
CN115526407A (en) | Power grid big data safety detection early warning method and system based on causal machine learning | |
CN114779739A (en) | Fault monitoring method for industrial process under cloud-side-device collaboration based on probabilistic graphical model | |
CN105187821A (en) | Video quality intelligent diagnostic analysis system | |
CN118859037A (en) | A method for power equipment fault analysis based on multi-source data fusion | |
CN113987697A (en) | A method for fault diagnosis of mechanical equipment based on vibration data | |
CN106228248B (en) | A System Automatic Fault Diagnosis Method Based on Fuzzy FMEA Analysis | |
CN115114124A (en) | Host risk assessment method and assessment device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20200124 Assignee: NARI INFORMATION & COMMUNICATION TECHNOLOGY Co. Assignor: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES Contract record no.: X2025980009706 Denomination of invention: A power system sensor data cleaning device and cleaning method Granted publication date: 20231013 License type: Common License Record date: 20250529 Application publication date: 20200124 Assignee: Shenyang Xinlongyuan Electric Meter Instrument Co.,Ltd. Assignor: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES Contract record no.: X2025980009705 Denomination of invention: A power system sensor data cleaning device and cleaning method Granted publication date: 20231013 License type: Common License Record date: 20250529 Application publication date: 20200124 Assignee: Shenyang Bingkun Technology Co.,Ltd. Assignor: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES Contract record no.: X2025980009703 Denomination of invention: A power system sensor data cleaning device and cleaning method Granted publication date: 20231013 License type: Common License Record date: 20250529 |