CN110286656B - False alarm filtering method and device for tolerance of error data - Google Patents

False alarm filtering method and device for tolerance of error data Download PDF

Info

Publication number
CN110286656B
CN110286656B CN201910374187.4A CN201910374187A CN110286656B CN 110286656 B CN110286656 B CN 110286656B CN 201910374187 A CN201910374187 A CN 201910374187A CN 110286656 B CN110286656 B CN 110286656B
Authority
CN
China
Prior art keywords
similarity
time sequence
suspected
data
alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910374187.4A
Other languages
Chinese (zh)
Other versions
CN110286656A (en
Inventor
宋韶旭
刘志成
王建民
王晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN201910374187.4A priority Critical patent/CN110286656B/en
Publication of CN110286656A publication Critical patent/CN110286656A/en
Application granted granted Critical
Publication of CN110286656B publication Critical patent/CN110286656B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • G05B19/4184Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by fault tolerance, reliability of production system
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/31From computer integrated manufacturing till monitoring
    • G05B2219/31088Network communication between supervisor and cell, machine group
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Alarm Systems (AREA)

Abstract

The embodiment of the invention provides a false alarm filtering method and device for tolerance of error data. The method comprises the steps of obtaining time sequence data corresponding to equipment measured by a sensor, and determining a suspected time sequence included in the time sequence data according to a state parameter value corresponding to normal operation of the equipment; according to a preset similarity matching method, calculating the similarity between each suspected time sequence and an alarm time sequence corresponding to a historical true alarm, determining the suspected time sequence with the corresponding similarity higher than a similarity threshold as the true alarm, and calculating the abnormality of each suspected time sequence with the corresponding similarity not higher than the similarity threshold according to a preset abnormality factor detection algorithm; and judging whether the suspected time sequence is a true alarm or not according to the abnormal degree of each corresponding suspected time sequence of which the similarity is not higher than the similarity threshold. The embodiment of the invention can filter out most false alarms caused by error data and improve the alarm accuracy.

Description

一种错误数据容忍的虚警过滤方法和装置False alarm filtering method and device for tolerance of erroneous data

技术领域technical field

本发明涉及计算机技术领域,尤其涉及一种错误数据容忍的虚警过滤方法和装置。The present invention relates to the technical field of computers, and in particular, to a false alarm filtering method and device for tolerance of erroneous data.

背景技术Background technique

在工业领域,企业为了监控设备的运行状态,会在设备上安装许多传感器,根据传感器测量得到的数据来判断设备是否正常工作。而传感器按照时间顺序测量得到的数据就是所谓的时序数据。企业利用传感器测量得到的时序数据,结合设备正常工作时的状态参数,按照一定的逻辑来判断设备的运行状态。In the industrial field, in order to monitor the running status of the equipment, enterprises will install many sensors on the equipment, and judge whether the equipment is working normally according to the data measured by the sensors. The data measured by the sensor in time sequence is the so-called time series data. Enterprises use the time series data measured by sensors, combined with the state parameters of the equipment during normal operation, to judge the operating state of the equipment according to a certain logic.

但是,由于传感器本身的一些因素,例如传感器可能损坏了,或是传感器本身测量的不准确,或是接收以及保存传感器数据的过程中数据被污染了等等,会出现测量得到的部分数据是错误数据,基于该错误数据判断设备的运行状态就可能得到一些“虚警”。所谓的“虚警”是指设备是正常工作的,但是错误数据却不在该设备正常工作的状态参数范围内,因此会得到设备此时不正常工作的警报。而传感器是在持续不断地产生数据,这些数据中的错误数据也会不断地产生虚警,这些虚警给企业的生产监控带来了很大的干扰。However, due to some factors of the sensor itself, such as the sensor may be damaged, or the measurement of the sensor itself is inaccurate, or the data is polluted in the process of receiving and saving sensor data, etc., some of the measured data may be wrong. Based on the error data, it is possible to obtain some "false alarms" by judging the operating status of the equipment. The so-called "false alarm" means that the device is working normally, but the wrong data is not within the range of the state parameters of the device's normal work, so an alarm that the device is not working properly at this time will be obtained. The sensor is continuously generating data, and the erroneous data in these data will also continuously generate false alarms. These false alarms have brought great interference to the production monitoring of enterprises.

因此,如何减少因错误数据而产生的虚警,成为业界亟待解决的技术问题。Therefore, how to reduce false alarms caused by erroneous data has become an urgent technical problem to be solved in the industry.

发明内容SUMMARY OF THE INVENTION

针对现有技术存在的问题,本发明实施例提供一种错误数据容忍的虚警过滤方法和装置。Aiming at the problems existing in the prior art, the embodiments of the present invention provide a false alarm filtering method and device tolerant to erroneous data.

第一方面,本发明实施例提供一种错误数据容忍的虚警过滤方法,包括:In a first aspect, an embodiment of the present invention provides a false alarm filtering method for tolerance of erroneous data, including:

获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;Acquire the time series data corresponding to the device measured by the sensor, and determine the suspected time series included in the time series data according to the state parameter value corresponding to the normal operation of the device;

根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;According to the preset similarity matching method, the similarity between each of the suspected time series and the alarm time series corresponding to the historical real alarm is calculated, and the corresponding suspected time series whose similarity is higher than the similarity threshold is determined as a real alarm , calculate the abnormality of each suspected time series whose similarity is not higher than the similarity threshold according to the preset abnormality factor detection algorithm;

根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断。Whether the suspected time series is a true alarm is judged according to the abnormality of each suspected time series whose similarity is not higher than the similarity threshold.

第二方面,本发明实施例提供一种错误数据容忍的虚警过滤装置,包括:In a second aspect, an embodiment of the present invention provides a false alarm filtering device for tolerance of erroneous data, including:

第一处理模块,用于获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;a first processing module, configured to obtain time series data corresponding to the device measured by the sensor, and determine the suspected time series included in the time series data according to the state parameter value corresponding to the normal operation of the device;

第二处理模块,用于根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;The second processing module is configured to calculate the similarity between each of the suspected time series and the alarm time series corresponding to the historical real alarm according to the preset similarity matching method, and calculate the similarity between the corresponding similarity higher than the similarity threshold. The suspected time series is determined to be a true alarm, and the abnormality of each suspected time series whose corresponding similarity is not higher than the similarity threshold is calculated according to a preset abnormal factor detection algorithm;

第三处理模块,用于根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断。The third processing module is configured to judge whether the suspected time series is a true alarm according to the abnormality of each corresponding suspected time series whose similarity is not higher than the similarity threshold.

第三方面,本发明实施例提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如第一方面所述的错误数据容忍的虚警过滤方法的步骤。In a third aspect, an embodiment of the present invention provides an electronic device, including a memory, a processor, and a computer program stored in the memory and running on the processor, the processor implementing the program as described in the first aspect when the processor executes the program The steps of the false alarm filtering method described in erroneous data tolerance.

第四方面,本发明实施例提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如第一方面所述的错误数据容忍的虚警过滤方法的步骤。In a fourth aspect, an embodiment of the present invention provides a non-transitory computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the false alarm filtering for tolerance of error data as described in the first aspect is implemented steps of the method.

本发明实施例提供的错误数据容忍的虚警过滤方法和装置,通过将可能产生报警的疑似时间序列与历史真警做相似度匹配以及计算其异常度来过滤错误数据引起的虚警,能够过滤掉绝大部分错误数据引起的虚警,提高报警的准确率。The method and device for filtering false alarms tolerant to false data provided by the embodiments of the present invention filter false alarms caused by false data by matching the suspected time series that may generate alarms with historical true alarms and calculating their abnormality to filter false alarms. False alarms caused by most of the wrong data are eliminated, and the accuracy of alarms is improved.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained according to these drawings without creative efforts.

图1为本发明一实施例提供的错误数据容忍的虚警过滤方法的流程示意图;FIG. 1 is a schematic flowchart of a false alarm filtering method for tolerance of erroneous data provided by an embodiment of the present invention;

图2为本发明另一实施例提供的错误数据容忍的虚警过滤方法的流程示意图;2 is a schematic flowchart of a false alarm filtering method for tolerance of wrong data provided by another embodiment of the present invention;

图3为本发明又一实施例提供的错误数据容忍的虚警过滤方法的流程示意图;3 is a schematic flowchart of a false alarm filtering method for tolerance of wrong data provided by another embodiment of the present invention;

图4为本发明实施例提供的错误数据容忍的虚警过滤装置的结构示意图;4 is a schematic structural diagram of a false-alarm filtering device tolerant to erroneous data provided by an embodiment of the present invention;

图5为本发明实施例提供的电子设备实体结构示意图。FIG. 5 is a schematic diagram of a physical structure of an electronic device according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

图1为本发明一实施例提供的错误数据容忍的虚警过滤方法的流程示意图,如图1所示,本发明实施例针对错误数据判断设备的运行状态出现“虚警”的情况,提供一种如下解决方案,具体包括如下步骤:FIG. 1 is a schematic flowchart of a false alarm filtering method for tolerance of erroneous data provided by an embodiment of the present invention. As shown in FIG. 1 , the embodiment of the present invention provides a “false alarm” for judging the operating state of a device for erroneous data. The following solution includes the following steps:

步骤100、获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;Step 100: Acquire the time series data corresponding to the device measured by the sensor, and determine the suspected time series included in the time series data according to the state parameter value corresponding to the normal operation of the device;

企业为了监控设备的运行状态,会在设备上安装各类型的传感器,根据各个传感器来实时地检测设备的各项运行参数值。传感器是按照时间顺序来采集设备数据的,所采集的数据形成时序数据。In order to monitor the operating status of the equipment, enterprises will install various types of sensors on the equipment, and detect various operating parameter values of the equipment in real time according to each sensor. Sensors collect device data in chronological order, and the collected data form time series data.

本方法实施例流程的执行主体可以是用于执行错误数据容忍的虚警过滤方法的装置,例如CPU或DSP等,该装置从传感器中获取传感器所测量到的设备对应的时序数据。然后,该装置将该时序数据与该设备正常运行所对应的状态参数值进行比较,确定时序数据中包括的疑似时间序列。该设备正常运行所对应的状态参数值是该设备在没有出现任何告警的情况下所检测到的数据。The execution subject of the process of this embodiment of the method may be a device for executing the false alarm filtering method for tolerance of error data, such as a CPU or a DSP, and the device obtains time series data corresponding to the device measured by the sensor from the sensor. Then, the apparatus compares the time series data with the state parameter values corresponding to the normal operation of the equipment, and determines the suspected time series included in the time series data. The state parameter value corresponding to the normal operation of the device is the data detected by the device without any alarm.

本方法实施例中,以设备正常运行时的状态参数值为阈值,判断传感器测量得到的数据是否落在正常的数值范围内。判断的逻辑可以是:例如连续一分钟得到的设备的状态数据都低于该阈值,则此时设备可能是坏了,需要产生报警。这连续一分钟内的时序数据就是可能产生报警的疑似时间序列。以每辆设备正常工作的状态参数值为阈值,对传感器测量得到的时序数据进行过滤,过滤掉不符合报警逻辑的数据,得到的就是可能产生报警的疑似时间序列,本发明方法实施例中将通过该步骤的阈值过滤获得的疑似时间序列称为疑似时间序列。In this embodiment of the method, it is determined whether the data measured by the sensor falls within the normal numerical range by taking the state parameter value of the device as a threshold value during normal operation. The logic of the judgment may be: for example, the status data of the device obtained continuously for one minute is lower than the threshold, then the device may be broken at this time, and an alarm needs to be generated. This one-minute time series data is a suspected time series that may generate an alarm. Taking the state parameter value of the normal operation of each device as the threshold value, filter the time series data obtained by the sensor measurement, filter out the data that does not conform to the alarm logic, and obtain a suspected time series that may generate an alarm. The suspected time series obtained through the threshold filtering of this step is called the suspected time series.

步骤101、根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;Step 101, according to the preset similarity matching method, calculate the similarity between each of the suspected time series and the alarm time series corresponding to the historical real alarm, and determine the suspected time series whose corresponding similarity is higher than the similarity threshold. If it is a true alarm, calculate the abnormality of each suspected time series whose corresponding similarity is not higher than the similarity threshold according to the preset abnormality factor detection algorithm;

在通过阈值过滤获得疑似时间序列后,可以根据预设的相似度匹配方法,计算每一个疑似时间序列与历史真警对应的报警时间序列之间的相似度。其中,所述的历史真警对应的报警时间序列存储在历史真警数据库中,所述的真警数据库中存储的每个报警时间序列都是造成设备真实报警的数据,而不是虚警。本方法实施例中将疑似时间序列与报警时间序列按照相似度匹配方法例如采用动态时间归整算法(Dynamic Time Warping,简称:DTW)来计算二者之间的相似度。After obtaining the suspected time series through threshold filtering, the similarity between each suspected time series and the alarm time series corresponding to the historical real alarm can be calculated according to the preset similarity matching method. Wherein, the alarm time series corresponding to the historical real alarms is stored in the historical real alarm database, and each alarm time series stored in the real alarm database is the data that caused the real alarm of the equipment, not the false alarm. In this embodiment of the method, the suspected time series and the alarm time series are calculated according to the similarity matching method, for example, a dynamic time warping algorithm (Dynamic Time Warping, DTW for short) is used to calculate the similarity between the two.

通过计算相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,即某个疑似时间序列与造成真实报警的报警时间序列之间的相似度若高于一个预先设定好的相似度阈值,则将该疑似时间序列被认定为真警。具体地,将通过阈值过滤获得的疑似时间序列,与历史上真实出现的报警序列计算相似度,相似度高的,说明该疑似时间序列与历史上的真警的很相似,历史上处于这种状态的设备报警了,说明此时的疑似时间序列有很大的概率就是真警。相似性度量方法可以使用DTW或其他时序数据度量方法,使用DTW来计算两个时序数据序列相似度的原因是时序数据序列可能存在缺失,导致两个序列的维度不一致,而DTW可以容忍数据缺失的情况,基于动态规划的思想,很好地实现对两个长短不一致的序列的匹配问题,根据匹配的结果判断两个序列的相似度。By calculating the similarity, the corresponding suspected time series whose similarity is higher than the similarity threshold are determined as true alarms, that is, if the similarity between a suspected time series and the alarm time series that caused the real alarm is higher than a preset value If the similarity threshold is good, the suspected time series is regarded as a true alarm. Specifically, the similarity between the suspected time series obtained by threshold filtering and the real alarm sequence in history is calculated. If the similarity is high, it means that the suspected time series is very similar to the real alarm in history. The equipment in the state has alarmed, indicating that the suspected time series at this time has a high probability of being a real alarm. The similarity measurement method can use DTW or other time series data measurement methods. The reason for using DTW to calculate the similarity of two time series data series is that the time series data series may be missing, resulting in inconsistent dimensions of the two series, and DTW can tolerate missing data. Based on the idea of dynamic programming, the matching problem of two sequences with inconsistent lengths is well realized, and the similarity of the two sequences is judged according to the matching results.

通过计算相似度,针对对应的相似度不高于相似度阈值的疑似时间序列,此时并不能判断其是否为虚警或真警,因此还需要进一步地进行判断。可以根据预设的异常因子检测算法例如局部异常因子检测算法(Local Outlier Factor;简称:LOF)计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度。通过计算疑似时间序列与真警的相似度,相似度高的认为其是真警,相似度低的不一定就不是真警,原因是可能该疑似时间序列在历史上未出现过,所以不能判断其就不是真警,需要进一步计算其异常度来判断。By calculating the similarity, it is not possible to judge whether it is a false alarm or a true alarm for a suspected time series whose corresponding similarity is not higher than the similarity threshold at this time, so further judgment is required. The abnormality of each corresponding suspected time series whose similarity is not higher than the similarity threshold may be calculated according to a preset abnormality factor detection algorithm, such as a local outlier factor detection algorithm (Local Outlier Factor; LOF for short). By calculating the similarity between the suspected time series and the real alarm, those with high similarity are considered to be real alarms, and those with low similarity are not necessarily not real alarms. The reason is that the suspected time series may not have appeared in history, so it cannot be judged. It is not a true alarm, and it needs to be further calculated to determine its abnormality.

步骤102、根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断。Step 102: Determine whether the suspected time series is a true alarm according to the abnormality of each corresponding suspected time series whose similarity is not higher than the similarity threshold.

针对对应的相似度不高于相似度阈值的疑似时间序列,根据LOF算法计算其对应的异常度,并根据该异常度对该疑似时间序列是否为真警进行判断。For the suspected time series whose corresponding similarity is not higher than the similarity threshold, the corresponding abnormality is calculated according to the LOF algorithm, and whether the suspected time series is a true alarm is judged according to the abnormality.

本发明实施例提供的错误数据容忍的虚警过滤方法,通过将可能产生报警的疑似时间序列与历史真警做相似度匹配以及计算其异常度来过滤错误数据引起的虚警,能够过滤掉绝大部分错误数据引起的虚警,提高报警的准确率。The false alarm filtering method for tolerance of erroneous data provided by the embodiment of the present invention filters false alarms caused by erroneous data by matching the suspected time series that may generate alarms with historical true alarms and calculating their abnormality. Most false alarms caused by wrong data improve the accuracy of alarms.

本发明实施例提供的错误数据容忍的虚警过滤方法,基于时序数据相似度匹配方法和异常因子检测算法来对错误数据产生的虚警进行过滤,可以实现过滤因为错误数据产生的绝大部分虚警,本发明实施例对虚警过滤的思想是:如果虚警与历史上出现的真警类似,则进行报警,如果虚警与历史上出现过的假警类似,则不再报警。时序数据相似度匹配方法可以是任何适用的计算相似度的方法,这里以现实场景中用到的DTW算法为例,异常因子检测算法可以是任何适用的异常因子检测方法,这里以现实场景中用到的LOF算法为例。The false alarm filtering method for tolerance of error data provided by the embodiment of the present invention filters the false alarms generated by error data based on the similarity matching method of time series data and the abnormal factor detection algorithm, which can filter most of the false alarms generated by error data. The idea of filtering false alarms in this embodiment of the present invention is: if the false alarms are similar to true alarms in history, then alarm, and if the false alarms are similar to false alarms in history, no longer alarm. The time series data similarity matching method can be any suitable method for calculating the similarity. Here, the DTW algorithm used in the real scene is taken as an example. The abnormal factor detection algorithm can be any suitable abnormal factor detection method. Take the LOF algorithm as an example.

图2为本发明另一实施例提供的错误数据容忍的虚警过滤方法的流程示意图,如图2所示,该方法包括如下步骤:FIG. 2 is a schematic flowchart of a false alarm filtering method for tolerance of wrong data provided by another embodiment of the present invention. As shown in FIG. 2 , the method includes the following steps:

步骤200、获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;Step 200: Acquire the time series data corresponding to the device measured by the sensor, and determine the suspected time series included in the time series data according to the state parameter value corresponding to the normal operation of the device;

步骤201、根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;Step 201, according to the preset similarity matching method, calculate the similarity between each of the suspected time series and the alarm time series corresponding to the historical real alarm, and determine the suspected time series whose corresponding similarity is higher than the similarity threshold. If it is a true alarm, calculate the abnormality of each suspected time series whose corresponding similarity is not higher than the similarity threshold according to the preset abnormality factor detection algorithm;

步骤202、根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断,具体包括,若对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,低于异常度阈值,则过滤掉所述疑似时间序列;若对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,不低于所述异常度阈值,则输出所述疑似时间序列以供人工判断是否为真警。Step 202: According to the abnormality of each suspected time series whose similarity is not higher than the similarity threshold, judge whether the suspected time series is a true alarm, specifically including, if the corresponding similarity is not higher than If the abnormality of the suspected time series of the similarity threshold is lower than the abnormality threshold, the suspected time series is filtered out; if the corresponding similarity is not higher than the abnormality of the suspected time series of the similarity threshold, no If it is lower than the abnormality threshold, the suspected time series is output for manual judgment whether it is a true alarm.

具体地,当设备的状态数值连续低于某个值时,设备可能处于异常状态,此时需要产生警报,这个值就是用于判断设备是否正常工作的阈值。而连续多长时间低于阈值则产生报警就是报警判断逻辑,通过报警判断逻辑和阈值对时序数据进行过滤,找到可能产生报警的疑似时间序列。Specifically, when the state value of the device is continuously lower than a certain value, the device may be in an abnormal state, and an alarm needs to be generated at this time, and this value is the threshold used to judge whether the device is working normally. The alarm judgment logic is used to filter the time series data through the alarm judgment logic and the threshold value to find the suspected time series that may generate an alarm.

将得到的疑似时间序列与历史上真实的报警时间序列做匹配,匹配的方法可以是DTW或其他时序数据相似度度量方法,相似度高的疑似时间序列判定为真警,相似度低的时间序列需要进一步判断。但过滤得到的与历史上的真警相似度低的疑似时间序列可能是虚警也可能是真警,需要进一步进行判断。可以采用离群点检测算法LOF或其他异常检测算法,通过计算疑似时间序列的异常度,认定异常度低的疑似时间序列为虚警,将这些虚警过滤。将得到的异常度高于一定阈值的疑似时间序列反馈给业务人员,由业务人员判断是否为真警,将确定为真警所对应的疑似时间序列,作为历史真警加入到历史真警数据库中。如果不是真警,进行过滤。Match the obtained suspected time series with the real alarm time series in history. The matching method can be DTW or other time series data similarity measurement methods. The suspected time series with high similarity is judged as true alarm, and the time series with low similarity Further judgment is required. However, the filtered suspected time series with low similarity to the historical real alarms may be false alarms or real alarms, and further judgment is required. The outlier detection algorithm LOF or other anomaly detection algorithms can be used to calculate the abnormality of the suspected time series, identify the suspected time series with low abnormality as false alarms, and filter these false alarms. Feed back the obtained suspected time series with an abnormality higher than a certain threshold to the business personnel, and the business personnel will judge whether it is a true alarm, and the suspected time series corresponding to the true alarm will be added to the historical true alarm database as a historical true alarm . If it is not a true alarm, filter it.

在本方法实施例提供的方法中,对过滤得到的与历史上真警相似度低的疑似时间序列,计算其异常度,异常度高的有很大的概率是真警,异常度低的说明这些序列在历史上出现过,且并没有产生报警,因此可以认定这些序列是虚警,可以进行过滤。在计算异常度的时候,疑似时间序列被当作计算对象,假设某个疑似时间序列表示为S_t={(t_1,v_1),(t_2,v_2),…,(t_n,v_n)},表示的是传感器在时刻点测得值。在计算过程中,我们把时间点略去,将每个子序列表示为(v_1,v_2,…,v_n)来计算其异常度。In the method provided by this embodiment of the method, the anomaly degree of the filtered suspected time series with low similarity to the historical real alarm is calculated, and a high anomaly degree has a high probability of being a true alarm, and a description of low anomaly degree These sequences have occurred in history and did not generate alarms, so these sequences can be considered false alarms and can be filtered. When calculating the anomaly degree, the suspected time series is regarded as the calculation object. Suppose a suspected time series is represented as S_t={(t_1,v_1),(t_2,v_2),…,(t_n,v_n)}, which means is the value measured by the sensor at the point in time. In the calculation process, we omit the time points and denote each subsequence as (v_1, v_2, ..., v_n) to calculate its anomaly degree.

进一步地,本方法实施例还包括在获取传感器测量得到的设备对应的时序数据后执行跳变清除的步骤,该步骤包括:获取传感器测量得到的设备对应的时序数据后,根据3σ准则对所述时序数据自行跳变清除的步骤,以删除所述时序数据中发生跳变的数据。Further, this embodiment of the method further includes the step of performing jump clearing after acquiring the time series data corresponding to the device measured by the sensor, the step includes: after acquiring the time series data corresponding to the device measured by the sensor, according to the 3σ criterion The step of clearing the time series data by self-jump, so as to delete the data that has jumped in the time series data.

具体地,时序数据中可能会出现一些跳变的数据,这些跳变的数据对虚警的判断会产生一定的干扰,因此需要删除掉这些跳变的数据。所谓的跳变数据,是根据3σ准则,产生速度变化异常、加速度变化异常的数据点。具体是通过3σ准则,发现引起速度变化异常和加速度变化异常的单点即跳变点,发现的跳变点很有可能是一些错误的数据,会影响到可能引起报警的疑似时间序列的判断,因此需要将其清除。Specifically, some hopping data may appear in the time series data, and these hopping data may interfere with the judgment of false alarms to a certain extent. Therefore, these hopping data need to be deleted. The so-called jump data are data points that produce abnormal changes in velocity and acceleration according to the 3σ criterion. Specifically, through the 3σ criterion, the single point that causes the abnormal speed change and the abnormal acceleration change is found, that is, the jump point. The found jump point is likely to be some wrong data, which will affect the judgment of the suspected time series that may cause an alarm. So it needs to be cleared.

本发明实施例提供的错误数据容忍的虚警过滤方法,通过3σ准则过滤掉数据中的跳变点;根据设备运行状态判断逻辑从过滤了跳变点后的数据中找到可能产生报警的疑似时间序列;将可能产生报警的疑似时间序列与历史的真警计算时序数据相似度,相似度高的则判断为真警,相似度低的则进行过滤;对过滤得到的相似度低的疑似时间序列计算序列异常度;将异常度超过一定阈值的时间序列反馈给业务人员,判断其是否为真警。The false alarm filtering method for tolerance of wrong data provided by the embodiment of the present invention filters out the transition points in the data through the 3σ criterion; according to the equipment operating state judgment logic, the suspected time that may generate an alarm is found from the data after filtering the transition points. Sequence; calculate the similarity of time series data between the suspected time series that may generate alarms and the historical real alarms, judge the ones with high similarity as true alarms, and filter those with low similarity; filter the suspected time series with low similarity Calculate the abnormality of the sequence; feed back the time series whose abnormality exceeds a certain threshold to the business personnel to determine whether it is a true alarm.

图3为本发明又一实施例提供的错误数据容忍的虚警过滤方法的流程示意图,如图3所示,该方法包括:FIG. 3 is a schematic flowchart of a false alarm filtering method for tolerance of wrong data provided by another embodiment of the present invention. As shown in FIG. 3 , the method includes:

步骤300、输入设备状态数据;Step 300, input device status data;

基于安装在设备上的传感器采集设备的各项运行参数值。传感器是按照时间顺序来采集设备数据的,所采集的数据形成时序数据。Various operating parameter values of the device are collected based on the sensors installed on the device. Sensors collect device data in chronological order, and the collected data form time series data.

步骤301、跳变清除,即过滤掉引起速度和加速度变化异常的单点;Step 301, clearing the jump, that is, filtering out the single point that causes abnormal changes in speed and acceleration;

根据3σ准则对所述时序数据自行跳变清除的步骤,具体可以包括:以时序数据作为输入,基于设备的实际运行所对应的状态参数值计算变化速度和变化加速度,依次求得变化速度的均值u和标准偏差σ,变化加速度的均值u和标准偏差σ。根据3σ准则,将变化速度和变化加速度落在区间(u-3*σ,u+3*σ)之外的数据确定为所述时序数据中发生跳变的数据,并删除所述异常点。The step of clearing the time series data by itself according to the 3σ criterion may specifically include: taking the time series data as input, calculating the change speed and change acceleration based on the state parameter values corresponding to the actual operation of the equipment, and sequentially obtaining the average value of the change speed u and standard deviation σ, mean u and standard deviation σ of varying acceleration. According to the 3σ criterion, the data whose change speed and change acceleration fall outside the interval (u-3*σ, u+3*σ) are determined as jumped data in the time series data, and the abnormal point is deleted.

通过变化速度和变化加速度异常点的位置确定设备状态原始数据的位置。假设变化速度的异常单点的位置在第m行,那么这个位置的变化速度是由设备状态数据的第m个时间点测得的数据和第m+1个时间点测得的数据计算而来的,因此将设备状态数据中的第m个时间点数据和第m+1个时间点数据标为待删除数据。假设变化加速度的异常单点的位置在第n行,那么该位置的变化加速度是由设备状态数据的第n个时间点、第n+1个时间点和第n+2个时间点测得的数据计算而来的,因此将设备状态数据中的第n+1个时间点数据标为待删除数据。将待删除数据点的位置取并集,在原始设备的状态数据的对应位置清除掉这些跳变的数据点。Determine the position of the raw data of the device state by the position of the abnormal point of the changing speed and the changing acceleration. Assuming that the position of the abnormal single point of change speed is in the mth row, then the change speed of this position is calculated from the data measured at the mth time point and the m+1th time point of the equipment status data. Therefore, the m th time point data and the m+1 th time point data in the device status data are marked as the data to be deleted. Suppose the position of the abnormal single point of changing acceleration is in the nth row, then the changing acceleration of this position is measured by the nth time point, the n+1th time point and the n+2th time point of the device status data Therefore, the data at the n+1th time point in the device status data is marked as the data to be deleted. Take the union of the positions of the data points to be deleted, and clear these jumped data points at the corresponding positions of the status data of the original device.

步骤302、阈值过滤,即通过报警判断逻辑找到疑似时间序列;Step 302, threshold filtering, that is, finding the suspected time series through the alarm judgment logic;

根据预先提供的设备异常工作的阈值,以及报警判断逻辑,在过滤掉跳变点的基础上,查找连续时间低于阈值的疑似时间序列。假设发动机连续一分钟测得的水温压力低于阈值s,则认为设备处于故障,此时应该产生警报。阈值过滤的逻辑是,以一分钟的时间间隔对传感器测得的时间序列数据分段,每段的数据都是同一分钟测得的设备的状态数据,判断这连续一分钟测得的水温压力是否低于阈值s,若是,则将这段时间序列数据视为可能产生报警的疑似时间序列数据,若不是,则进行过滤。在之前的步骤清除了跳变数据的基础上查找可能产生报警的疑似时间序列,并输出疑似时间序列。According to the pre-provided threshold for abnormal operation of the equipment and the alarm judgment logic, on the basis of filtering out the jump points, it searches for the suspected time series whose continuous time is lower than the threshold. Assuming that the water temperature and pressure measured by the engine for one minute is lower than the threshold s, the equipment is considered to be in failure, and an alarm should be generated at this time. The logic of threshold filtering is to segment the time-series data measured by the sensor at one-minute intervals, and each segment of data is the status data of the device measured in the same minute to determine whether the water temperature and pressure measured in this continuous minute are If it is lower than the threshold s, if it is, consider this time series data as suspected time series data that may generate an alarm, if not, filter it. On the basis of clearing the jump data in the previous steps, the suspected time series that may generate an alarm is searched, and the suspected time series is output.

步骤303、匹配真警,即与历史的真警计算相似度;Step 303, matching the real alarm, that is, calculating the similarity with the historical real alarm;

将上一步骤得到的每段疑似时间序列作为计算对象,分别与历史上真实报警的时间序列数据计算相似度。相似度度量方法可以是DTW,该算法基于动态规划的思想,解决了序列长短不一的模板匹配问题。采用DTW的一个重要原因是时间序列中可能存在数据缺失,即某些个时间点的数据没有采集到,如果按照传统的相似度计算方法,对长短不一致的时间序列计算的相似度误差可能非常大,与真警匹配的效果也会很差。基于DTW算法的相似度计算方法可以在疑似时间序列中找到与历史真警高度相似的序列,这些序列会被认定为真警,即执行步骤304。而相似度低的疑似时间序列将作为下一个步骤的输入,即自行步骤305。Take each suspected time series obtained in the previous step as the calculation object, and calculate the similarity with the time series data of real alarms in history. The similarity measurement method can be DTW, which is based on the idea of dynamic programming and solves the problem of template matching with different sequence lengths. An important reason for using DTW is that there may be missing data in the time series, that is, data at certain time points have not been collected. If the traditional similarity calculation method is used, the similarity error calculated for time series with inconsistent lengths may be very large. , the effect of matching with the real police will also be poor. The similarity calculation method based on the DTW algorithm can find sequences that are highly similar to historical true alarms in the suspected time series, and these sequences will be identified as true alarms, that is, step 304 is executed. The suspected time series with low similarity will be used as the input of the next step, that is, self-step 305 .

步骤304、相似度高的判定为真警;Step 304, determining that the similarity is high as a true alarm;

步骤305、异常过滤,即计算相似度低的疑似时间序列的异常度;Step 305, abnormality filtering, that is, calculating the abnormality degree of the suspected time series with low similarity;

与历史真警匹配度低的疑似时间序列将作为本步骤的输入,通过异常检测算法,计算每个序列的异常度。以局部异常因子检测算法LOF为例,通过设置参数k,以每个点的第k近邻的距离来计算每个点的局部可达密度,从而计算点的局部离群因子,得到每个点的异常度。异常度的值越大于1,说明该序列越有可能是异常序列,即越有可能是真警。在本步骤中,将每个疑似子序列作为计算对象,计算序列之间的距离,建立疑似时间序列的距离矩阵M,将M作为离群点检测算法LOF的输入,得到所有疑似子序列的异常度列表,返回所有疑似时间序列的异常度列表。The suspected time series with low matching degree with historical true alarms will be used as the input of this step, and the anomaly degree of each series will be calculated through the anomaly detection algorithm. Taking the local abnormal factor detection algorithm LOF as an example, by setting the parameter k, the local reachability density of each point is calculated by the distance of the k-th nearest neighbor of each point, so as to calculate the local outlier factor of the point, and get the value of each point. abnormality. The value of the abnormality degree is larger than 1, indicating that the sequence is more likely to be an abnormal sequence, that is, the more likely it is to be a true alarm. In this step, take each suspected subsequence as the calculation object, calculate the distance between the sequences, establish the distance matrix M of the suspected time series, use M as the input of the outlier detection algorithm LOF, and obtain the abnormality of all suspected subsequences Degree list, returns a list of anomaly degrees for all suspected time series.

步骤306、异常报警,即业务人员判断异常度高的时间序列是否真警;Step 306, abnormal alarm, that is, business personnel determine whether the time series with high abnormality is a real alarm;

根据步骤305得到的疑似时间序列的异常列表,设定异常度的阈值,将低于阈值的疑似时间序列过滤,将大等于阈值的疑似时间序列返回,反馈给业务人员,判断是否为真警。若是,则加入历史真警库,否则过滤。According to the abnormal list of suspected time series obtained in step 305, the threshold of abnormal degree is set, the suspected time series below the threshold is filtered, and the suspected time series greater than or equal to the threshold is returned and fed back to the business personnel to determine whether it is a true alarm. If it is, it will be added to the historical real police library, otherwise it will be filtered.

本发明实施例提供的错误数据容忍的虚警过滤方法,通过3σ准则过滤掉数据中的跳变点,基于相似度度量方法来匹配真警,又通过异常检测算法来查找异常度高的时间序列,判断其是否为真警,通过本发明提供的错误数据容忍的虚警过滤方法,可以实现对大部分错误数据产生的虚警进行过滤。The false alarm filtering method for tolerance of erroneous data provided by the embodiment of the present invention filters out the transition points in the data through the 3σ criterion, matches the true alarms based on the similarity measurement method, and searches for the time series with high abnormality degree through the abnormality detection algorithm. , to judge whether it is a true alarm, and through the false alarm filtering method for tolerance of erroneous data provided by the present invention, the false alarm generated by most of the erroneous data can be filtered.

图4为本发明实施例提供的错误数据容忍的虚警过滤装置的结构示意图,如图4所示,该装置包括第一处理模块401、第二处理模块402和第三处理模块403,其中:FIG. 4 is a schematic structural diagram of a false alarm filtering device for error data tolerance provided by an embodiment of the present invention. As shown in FIG. 4 , the device includes a first processing module 401, a second processing module 402, and a third processing module 403, wherein:

第一处理模块401用于获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;第二处理模块402用于根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;第三处理模块403用于根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断。The first processing module 401 is used to obtain the time series data corresponding to the device measured by the sensor, and determine the suspected time series included in the time series data according to the state parameter value corresponding to the normal operation of the device; the second processing module 402 is used for According to the preset similarity matching method, the similarity between each of the suspected time series and the alarm time series corresponding to the historical real alarm is calculated, and the corresponding suspected time series whose similarity is higher than the similarity threshold is determined as a real alarm , calculate the abnormality of each suspected time series whose similarity is not higher than the similarity threshold according to the preset abnormality factor detection algorithm; the third processing module 403 is used to calculate the abnormality of each corresponding similarity not higher than the similarity threshold; The abnormality of the suspected time series with the similarity threshold is determined, and whether the suspected time series is a true alarm is judged.

具体地,第一处理模块401通过报警逻辑来查找异常单点过滤后的数据中,可能引起报警的疑似时间序列,第二处理模块402将疑似时间序列发现模块找到的疑似时间序列与历史真警计算时序数据相似度,认定相似度高的疑似时间序列为真警,第三处理模块403采用异常检测算法对得到的相似度低的时间序列计算其异常度,设置阈值过滤掉异常度低的时间序列,返回异常度高的疑似时间序列,将这些时间序列反馈给业务人员。Specifically, the first processing module 401 uses the alarm logic to find the suspected time series that may cause an alarm in the data filtered by the abnormal single point, and the second processing module 402 compares the suspected time series found by the suspected time series discovery module with the historical real alarms Calculate the similarity of time series data, and determine the suspected time series with high similarity as true alarms. The third processing module 403 uses anomaly detection algorithm to calculate the abnormality of the obtained time series with low similarity, and sets a threshold to filter out the time with low abnormality. Sequence, return suspected time series with high abnormality, and feed these time series back to business personnel.

进一步地,第三处理模块403包括:第一单元,用于若对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,低于异常度阈值,则过滤掉所述疑似时间序列;第二单元,用于若对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,不低于所述异常度阈值,则输出所述疑似时间序列以供人工判断是否为真警。Further, the third processing module 403 includes: a first unit, configured to filter out the suspected time series if the abnormality degree of the suspected time series whose corresponding similarity is not higher than the similarity degree threshold is lower than the abnormality degree threshold sequence; the second unit is used to output the suspected time series for manual judgment if the abnormality degree of the suspected time series whose similarity is not higher than the similarity threshold value is not lower than the abnormality degree threshold value. For real police.

再进一步地,本实施提供的错误数据容忍的虚警过滤装置中还可以包括第四处理模块,用于计算数据变化速度的均值和标准偏差,计算数据变化加速度的均值和标准偏差,基于3σ准则,发现引起速度变化异常和加速度变化异常的数据单点,过滤掉这些跳变的数据点。Still further, the false alarm filtering device for tolerance of wrong data provided by this implementation may also include a fourth processing module for calculating the mean value and standard deviation of the data change speed, and calculating the mean value and standard deviation of the data change acceleration, based on the 3σ criterion. , find the single data points that cause abnormal changes in velocity and acceleration, and filter out these jumping data points.

本发明实施例提供的错误数据容忍的虚警过滤装置,具体用于执行上述各方法实施例提供的错误数据容忍的虚警过滤方法流程,其具体的功能和流程可以详见上述方法实施例,此处不再赘述。The false alarm filtering device for tolerance of error data provided by the embodiment of the present invention is specifically used to execute the process of the false alarm filtering method for tolerance of error data provided by the above method embodiments. It will not be repeated here.

本发明实施例提供的错误数据容忍的虚警过滤装置,根据3σ准则过滤引起数据变化速度异常和数据变化加速度异常的单点,在此基础上依据报警判断逻辑查找可能引起报警的疑似时间序列,将查找到的疑似时间序列与历史真警做相似度的匹配,再对相似度低的疑似时间序列计算异常度,将异常度值高的疑似时间序列返回,过滤掉所有异常度低的疑似时间序列,实现了因错误数据产生的虚警的过滤,可以实现任何类型场景下设备因错误数据产生的虚警的过滤。The false alarm filtering device for tolerance of erroneous data provided by the embodiment of the present invention filters the single points that cause abnormal data change speed and data change acceleration according to the 3σ criterion, and on this basis, searches for suspected time series that may cause an alarm according to the alarm judgment logic. Match the found suspected time series with the historical real police, then calculate the anomaly degree for the suspected time series with low similarity, return the suspected time series with high anomaly degree value, and filter out all the suspected time series with low anomaly degree The sequence realizes the filtering of false alarms generated by incorrect data, and can filter the false alarms generated by devices due to incorrect data in any type of scenario.

图5为本发明实施例提供的电子设备实体结构示意图,如图5所示,该服务器可以包括:处理器(processor)510、通信接口(Communications Interface)520、存储器(memory)530和通信总线540,其中,处理器510,通信接口520,存储器530通过通信总线540完成相互间的通信。处理器510可以调用存储器530中的计算机程序,以执行上述各实施例提供的方法,例如包括:获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断。FIG. 5 is a schematic diagram of the physical structure of an electronic device provided by an embodiment of the present invention. As shown in FIG. 5 , the server may include: a processor (processor) 510, a communication interface (Communications Interface) 520, a memory (memory) 530, and a communication bus 540 , wherein the processor 510 , the communication interface 520 , and the memory 530 communicate with each other through the communication bus 540 . The processor 510 can call the computer program in the memory 530 to execute the methods provided in the above embodiments, for example, including: acquiring the time series data corresponding to the device measured by the sensor, and determining according to the state parameter value corresponding to the normal operation of the device. The suspected time series included in the time series data; according to the preset similarity matching method, the similarity between each of the suspected time series and the alarm time series corresponding to the historical real alarm is calculated, and the corresponding similarity is higher than The suspected time series of the similarity threshold is determined as a true alarm, and the abnormality of each corresponding suspected time series whose similarity is not higher than the similarity threshold is calculated according to the preset abnormal factor detection algorithm; according to each corresponding similarity The abnormality of the suspected time series not higher than the similarity threshold is used to judge whether the suspected time series is a true alarm.

此外,上述的存储器530中的计算机程序可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。In addition, the above-mentioned computer program in the memory 530 can be implemented in the form of software functional units and can be stored in a computer-readable storage medium when sold or used as an independent product. Based on this understanding, the technical solution of the present invention can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .

本发明实施例还提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时以执行上述各实施例提供的方法,例如包括:获取传感器测量得到的设备对应的时序数据,根据所述设备正常运行所对应的状态参数值,确定所述时序数据中包括的疑似时间序列;根据预设的相似度匹配方法,计算每一所述疑似时间序列与历史真警对应的报警时间序列之间的相似度,将对应的相似度高于相似度阈值的疑似时间序列确定为真警,根据预设的异常因子检测算法计算每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度;根据每个对应的相似度不高于所述相似度阈值的疑似时间序列的异常度,对所述疑似时间序列是否为真警进行判断。Embodiments of the present invention further provide a non-transitory computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, executes the methods provided by the foregoing embodiments, for example, comprising: acquiring data obtained by sensor measurement For the time series data corresponding to the device, according to the state parameter value corresponding to the normal operation of the device, determine the suspected time series included in the time series data; according to the preset similarity matching method, calculate each of the suspected time series and the history The similarity between the alarm time series corresponding to the true alarm, the suspected time series with the corresponding similarity higher than the similarity threshold is determined as the true alarm, and each corresponding similarity is calculated according to the preset abnormal factor detection algorithm. The abnormality of the suspected time series of the similarity threshold; according to the abnormality of each corresponding suspected time series whose similarity is not higher than the similarity threshold, determine whether the suspected time series is a true alarm.

以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动的情况下,即可以理解并实施。The device embodiments described above are only illustrative, wherein the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment. Those of ordinary skill in the art can understand and implement it without creative effort.

通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到各实施方式可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件。基于这样的理解,上述技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that each embodiment can be implemented by means of software plus a necessary general hardware platform, and certainly can also be implemented by hardware. Based on this understanding, the above-mentioned technical solutions can be embodied in the form of software products in essence or the parts that make contributions to the prior art, and the computer software products can be stored in computer-readable storage media, such as ROM/RAM, magnetic A disc, an optical disc, etc., includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform the methods described in various embodiments or some parts of the embodiments.

最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that it can still be The technical solutions described in the foregoing embodiments are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. A method for false alarm filtering for tolerance of erroneous data, comprising:
acquiring time sequence data corresponding to equipment measured by a sensor, and determining a suspected time sequence included in the time sequence data according to a state parameter value corresponding to normal operation of the equipment;
according to a preset similarity matching method, calculating the similarity between each suspected time sequence and an alarm time sequence corresponding to a historical true alarm, determining the suspected time sequence with the corresponding similarity higher than a similarity threshold as the true alarm, and calculating the abnormality of each suspected time sequence with the corresponding similarity not higher than the similarity threshold according to a preset abnormality factor detection algorithm;
and judging whether the suspected time sequence is a true alarm or not according to the abnormal degree of each corresponding suspected time sequence of which the similarity is not higher than the similarity threshold.
2. The method of claim 1, wherein the determining whether the suspected time series is true according to the abnormality degree of each suspected time series whose similarity degree is not higher than the similarity degree threshold comprises:
if the corresponding similarity is not higher than the degree of abnormality of the suspected time sequence of the similarity threshold value and is lower than the degree of abnormality threshold value, filtering the suspected time sequence;
and if the corresponding similarity is not higher than the abnormality degree of the suspected time sequence of the similarity threshold and is not lower than the abnormality degree threshold, outputting the suspected time sequence for manual judgment of whether the suspected time sequence is a true alarm.
3. The method of claim 1 or 2, wherein the obtaining of the timing data corresponding to the device measured by the sensor comprises:
and after time sequence data corresponding to the equipment measured by the sensor is obtained, automatically jumping and clearing the time sequence data according to a 3 sigma criterion so as to delete the data jumping in the time sequence data.
4. The method of claim 3, wherein the step of self-jump clearing the time-series data according to 3 σ criterion comprises:
taking the time sequence data as input, calculating a change speed and a change acceleration based on a state parameter value corresponding to actual operation of the equipment, and calculating to obtain a mean value u and a standard deviation sigma of the change speed, and a mean value u and a standard deviation sigma of the change acceleration;
and determining data of which the change speed and the change acceleration fall outside an interval (u-3 sigma, u +3 sigma) as data of which the jump occurs in the time series data according to a 3 sigma criterion, and deleting a time point of the data of which the jump occurs.
5. The method of claim 1, wherein the similarity matching method is a dynamic time warping algorithm, and the anomaly detection algorithm is a local anomaly detection algorithm.
6. The method of false data tolerant false alarm filtering according to claim 1 or 2, wherein the method further comprises:
and adding the suspected time sequence determined to correspond to the true alarm as a historical true alarm into a historical true alarm database.
7. A false alarm filtering device for fault data tolerance, comprising:
the device comprises a first processing module, a second processing module and a third processing module, wherein the first processing module is used for acquiring time sequence data corresponding to equipment measured by a sensor and determining a suspected time sequence included in the time sequence data according to a state parameter value corresponding to normal operation of the equipment;
the second processing module is used for calculating the similarity between each suspected time sequence and an alarm time sequence corresponding to a historical true alarm according to a preset similarity matching method, determining the suspected time sequence with the corresponding similarity higher than a similarity threshold as the true alarm, and calculating the abnormality of each suspected time sequence with the corresponding similarity not higher than the similarity threshold according to a preset abnormality factor detection algorithm;
and the third processing module is used for judging whether the suspected time sequence is a true alarm or not according to the abnormal degree of each corresponding suspected time sequence, wherein the similarity of each suspected time sequence is not higher than the similarity threshold.
8. The erroneous-data-tolerant false alarm filtering device of claim 7, wherein the third processing module comprises:
the first unit is used for filtering the suspected time sequence if the corresponding similarity is not higher than the abnormality degree of the suspected time sequence of the similarity threshold value and is lower than the abnormality degree threshold value;
and the second unit is used for outputting the suspected time sequence for manual judgment of whether the suspected time sequence is a true alarm or not if the corresponding similarity of the suspected time sequence is not higher than the similarity threshold and is not lower than the abnormality threshold.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the error data tolerant false alarm filtering method according to any of claims 1 to 6 when executing the program.
10. A non-transitory computer readable storage medium, having stored thereon a computer program, which, when being executed by a processor, carries out the steps of the false alarm filtering method for fault data tolerance of any one of claims 1 to 6.
CN201910374187.4A 2019-05-07 2019-05-07 False alarm filtering method and device for tolerance of error data Active CN110286656B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910374187.4A CN110286656B (en) 2019-05-07 2019-05-07 False alarm filtering method and device for tolerance of error data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910374187.4A CN110286656B (en) 2019-05-07 2019-05-07 False alarm filtering method and device for tolerance of error data

Publications (2)

Publication Number Publication Date
CN110286656A CN110286656A (en) 2019-09-27
CN110286656B true CN110286656B (en) 2020-07-17

Family

ID=68001831

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910374187.4A Active CN110286656B (en) 2019-05-07 2019-05-07 False alarm filtering method and device for tolerance of error data

Country Status (1)

Country Link
CN (1) CN110286656B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111338892B (en) * 2020-02-17 2022-02-15 北京理工大学 Time sequence rise abnormity identification method under extreme operation condition
CN113381874B (en) * 2020-03-10 2023-06-02 上海杰之能软件科技有限公司 Fault signal processing method, storage medium and terminal
CN111726341B (en) * 2020-06-02 2022-10-14 五八有限公司 Data detection method and device, electronic equipment and storage medium
CN111897695B (en) * 2020-07-31 2022-06-17 平安科技(深圳)有限公司 Method and device for acquiring KPI abnormal data sample and computer equipment
CN112115184A (en) * 2020-09-18 2020-12-22 平安科技(深圳)有限公司 Time series data detection method, device, computer equipment and storage medium
CN113077613A (en) * 2021-04-07 2021-07-06 浙江佳伯尔电子科技有限公司 Alarm for preventing sensor misjudgment and judgment method thereof
CN113918398B (en) * 2021-11-25 2025-02-28 中国商用飞机有限责任公司 Information suppression method, device, computer equipment and readable storage medium
CN115033453A (en) * 2022-06-14 2022-09-09 阿里巴巴(中国)有限公司 Abnormality detection method, device, equipment, storage medium and program
CN116821833B (en) * 2023-08-29 2023-11-14 好林(威海)新材料有限公司 Data anomaly detection method for machine filter cloth adhesion
CN117193164B (en) * 2023-11-08 2024-01-30 东莞市博思特数控机械有限公司 Fault monitoring method and system of numerical control machine tool
CN119089348A (en) * 2024-08-19 2024-12-06 武汉市丰谊商务国际旅行社有限公司 A self-service ticket purchasing human-computer interaction method and system based on artificial intelligence

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104075749A (en) * 2014-06-30 2014-10-01 通号通信信息集团有限公司 Abnormal state detecting method and system for equipment in internet of things
CN104700532A (en) * 2013-12-11 2015-06-10 杭州海康威视数字技术股份有限公司 Video alarm method and video alarm device
CN105764162A (en) * 2016-05-10 2016-07-13 江苏大学 Wireless sensor network abnormal event detecting method based on multi-attribute correlation
CN107798272A (en) * 2016-08-30 2018-03-13 佳能株式会社 Fast multi-target detects and tracking system
CN108319981A (en) * 2018-02-05 2018-07-24 清华大学 A kind of time series data method for detecting abnormality and device based on density
CN108595300A (en) * 2018-03-21 2018-09-28 北京奇艺世纪科技有限公司 A kind of method and device of configurable monitoring and alarm

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2897112B1 (en) * 2014-01-17 2019-03-06 Wincor Nixdorf International GmbH Method and apparatus for the prevention of false alarms in monitoring systems
US10726540B2 (en) * 2017-10-17 2020-07-28 International Business Machines Corporation Self-similarity analysis for defect detection on patterned industrial objects

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104700532A (en) * 2013-12-11 2015-06-10 杭州海康威视数字技术股份有限公司 Video alarm method and video alarm device
CN104075749A (en) * 2014-06-30 2014-10-01 通号通信信息集团有限公司 Abnormal state detecting method and system for equipment in internet of things
CN105764162A (en) * 2016-05-10 2016-07-13 江苏大学 Wireless sensor network abnormal event detecting method based on multi-attribute correlation
CN107798272A (en) * 2016-08-30 2018-03-13 佳能株式会社 Fast multi-target detects and tracking system
CN108319981A (en) * 2018-02-05 2018-07-24 清华大学 A kind of time series data method for detecting abnormality and device based on density
CN108595300A (en) * 2018-03-21 2018-09-28 北京奇艺世纪科技有限公司 A kind of method and device of configurable monitoring and alarm

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
False alarms in episodic recognition : an examination of base-rate, similarity-based and comprehensive theories;Angela M. Maguire.;《memory》;20050101;全文 *
搜索网络中虚警干扰内容隔离技术研究仿真;马一路;《计算机仿真》;20150123;全文 *
无线传感器网络环境下的一种轻量级事件容错检测算法;董传明, 刘克中, 罗广,等;《传感技术学报》;20140315;全文 *

Also Published As

Publication number Publication date
CN110286656A (en) 2019-09-27

Similar Documents

Publication Publication Date Title
CN110286656B (en) False alarm filtering method and device for tolerance of error data
CN109981328B (en) A kind of fault early warning method and device
KR102440335B1 (en) Anomaly detection and management method and device therefor
US11733688B2 (en) System and method for recognizing and forecasting anomalous sensory behavioral patterns of a machine
US9524223B2 (en) Performance metrics of a computer system
CN110598180B (en) Event detection method, device and system based on statistical analysis
CN112380073B (en) Fault position detection method and device and readable storage medium
CN107391335B (en) Method and equipment for checking health state of cluster
JP2015028700A (en) Failure detection device, failure detection method, failure detection program and recording medium
CN109034423B (en) Fault early warning judgment method, device, equipment and storage medium
CN114757380A (en) A thermal power plant fault early warning system, method, electronic device and storage medium
CN117061334B (en) A link alarm processing method, device, equipment and storage medium
Folmer et al. Detection of temporal dependencies in alarm time series of industrial plants
CN117473514B (en) Intelligent operation and maintenance method and system of industrial control system
CN112988512A (en) Method, device and equipment for detecting time sequence data abnormity and storage medium
CN110889597A (en) Method and device for detecting abnormal business timing sequence indexes
CN105096041A (en) Index change traceability and prediction method and apparatus
CN118174788B (en) Fault detection method, device and equipment of optical fiber wiring cabinet and storage medium
CN114039837A (en) Alarm data processing method, device, system, equipment and storage medium
CN102158372B (en) Distributed system abnormity detection method
CN113112036A (en) Data processing method and device for vehicle and computer equipment
KR102108975B1 (en) Apparatus and method for condition based maintenance support of naval ship equipment
CN115242457B (en) A method, device, electronic device and storage medium for detecting log data
CN106652393A (en) Method for determining false alarm
CN116248532A (en) Network abnormality detection method, network abnormality detection device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant