CN104598514A - Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis - Google Patents

Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis Download PDF

Info

Publication number
CN104598514A
CN104598514A CN201410685956.XA CN201410685956A CN104598514A CN 104598514 A CN104598514 A CN 104598514A CN 201410685956 A CN201410685956 A CN 201410685956A CN 104598514 A CN104598514 A CN 104598514A
Authority
CN
China
Prior art keywords
data
monitoring
algorithm
rare
environmental monitoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410685956.XA
Other languages
Chinese (zh)
Inventor
邹志强
王正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING JINSHUI YONGLI TECHNOLOGY Co Ltd
Original Assignee
BEIJING JINSHUI YONGLI TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING JINSHUI YONGLI TECHNOLOGY Co Ltd filed Critical BEIJING JINSHUI YONGLI TECHNOLOGY Co Ltd
Priority to CN201410685956.XA priority Critical patent/CN104598514A/en
Publication of CN104598514A publication Critical patent/CN104598514A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Alarm Systems (AREA)

Abstract

The invention relates to an algorithm for finding a rare environmental monitoring data combination mode based on clustering analysis. According to the algorithm, the clustering analysis principle is adopted for analyzing the data mode of the multi-parameter data combination of the environmental monitoring data and finding the combination mode of the rare monitoring data hidden therein, so that environmental monitoring staff are reminded. The environmental monitoring data considered by the algorithm are not confined to a specific monitoring parameter, but are a monitoring data combination in the overall consideration of multiple parameters at the same time. According to the algorithm for finding the rare environmental monitoring data combination mode based on clustering analysis, the clustering analysis principle is adopted, the multi-parameter monitoring data combination is in the overall consideration and compared with historical data, and a data combination mode will be judged to be a rare data combination mode in history if the number of the occurrence times of the current data combination mode is less than the corresponding threshold value.

Description

A kind of algorithm finding the rare data assemblies pattern of environmental monitoring based on cluster analysis
Art
The present invention relates to a kind of algorithm finding the rare data assemblies pattern of environmental monitoring based on cluster analysis.This algorithm can utilize cluster analysis principle, analyzes the data pattern that the Multi-parameter data of environmental monitoring data combines, and finds the Monitoring Data integrated mode that wherein hidden history is rare, thus can remind the attention of environmental monitoring personnel.
Background technology
In environmental monitoring, the ANOMALOUS VARIATIONS of environmental monitoring data may be caused by Monitoring Data quality problems, also may be the change of hint environment.Therefore, for the discovery of abnormal data in environmental monitoring data, environmental monitoring department can be assisted to find environmental monitoring data quality problems, and environmental change is given warning in advance.As a kind of form of abnormal data, when the rare data assemblies pattern of history appears in environmental monitoring data, just need to remind environmental monitoring personnel may there is data quality problem and maybe may occur environmental change.
Algorithm described in the invention can be analyzed environmental monitoring data by cluster analysis, to find the rare data assemblies pattern of history occurred in environmental monitoring data.
Summary of the invention
The environmental monitoring data that algorithm described in the invention is considered is not limited to certain specific monitoring parameter, but considers the Monitoring Data combination of multiple parameters of synchronization.Algorithm described in the invention utilizes cluster analysis principle, consider the Monitoring Data combination of multiparameter, and compare with historical data, if when finding that the number of times that current data assemblies pattern occurs in history is less than respective threshold, this data assemblies pattern will be judged as the rare data assemblies pattern of history.
Ultimate principle of the present invention is that the environmental monitoring data with ground is periodically variable, if there is the data assemblies that history is rare, then can think that this data assemblies is abnormal conditions, the tendency change of Monitoring Data quality problems or environment may be reflected, therefore should cause the attention of environmental monitoring personnel.In algorithm, the present invention adopts gridding method to multi-parameter monitoring data construct history feature database, and compare with Real-time Monitoring Data and history feature database, if current Real-time Monitoring Data is combined in institute's occurrence number in history feature database and lower than setting threshold value, is then judged to be that this data assemblies is the rare data assemblies of history.
Arthmetic statement of the present invention as shown in Figure 1, this algorithm is realized by software development, can first generate history feature database, then the combination of current environment Monitoring Data is compared with history feature database, if current Monitoring Data combination is lower than set threshold value, be then judged to be the rare data assemblies pattern of environmental monitoring.
The invention has the beneficial effects as follows: by the analytical approach of cluster analysis, the rare data assemblies data exception of history in environmental monitoring data can be found, thus environmental monitoring personnel inspection Monitoring Data quality can be reminded or pay attention to Trend of Environmental Change.
Accompanying drawing explanation
Below in conjunction with accompanying drawing, the present invention is further described.
Fig. 1 finds the algorithm flow chart of the rare data assemblies pattern of environmental monitoring based on cluster analysis
Embodiment
In FIG, the algorithm flow finding the rare data assemblies pattern of environmental monitoring based on cluster analysis is described.Key step comprises:
1, statistical study is carried out to different monitoring parameter combination (P parameter), with each monitoring parameter for dimension, each dimension is divided into D decile, thus all values of this parameter combinations form are divided into (D^P) individual multi-dimensional grid
2, according to historical data, the number of times (N) that the historical data calculating this parameter combinations occurs in each grid
3, by data occurrence number history of forming property data base in grid composition and each grid
4, the data of this monitoring parameter data assemblies current and history feature database are contrasted
5, the judgement of rare data assemblies pattern: grid corresponding historical data combination occurrence number (N) < sets threshold value L belonging to current Monitoring Data combination, then judge that this Monitoring Data is combined as the rare data assemblies pattern of environmental monitoring, otherwise be not judged to be the rare data assemblies pattern of environmental monitoring.

Claims (3)

1. one kind finds the algorithm of the rare data assemblies pattern of environmental monitoring based on cluster analysis, it is characterized in that: the present invention adopts gridding method to multi-parameter monitoring data construct history feature database, and compare with Real-time Monitoring Data and history feature database, if current Real-time Monitoring Data is combined in institute's occurrence number in history feature database and lower than setting threshold value, is then judged to be that this data assemblies is the rare data assemblies of history.
2. a kind of algorithm finding the rare data assemblies pattern of environmental monitoring based on cluster analysis according to claim 1, its algorithm characteristics is: the environmental monitoring data that algorithm is considered is not limited to certain specific monitoring parameter, but considers the Monitoring Data combination of multiple parameters of synchronization.
3. a kind of algorithm finding the rare data assemblies pattern of environmental monitoring based on cluster analysis according to claim 1, its algorithm steps feature is:
1) statistical study is carried out to different monitoring parameter combination (P parameter), with each monitoring parameter for dimension, each dimension is divided into D decile, thus all values of this parameter combinations form are divided into (D^P) individual multi-dimensional grid
2) according to historical data, the number of times (N) that the historical data calculating this parameter combinations occurs in each grid
3) by data occurrence number history of forming property data base in grid composition and each grid
4) data of this monitoring parameter data assemblies current and history feature database are contrasted
5) judgement of rare data assemblies pattern: grid corresponding historical data combination occurrence number (N) < sets threshold value L belonging to current Monitoring Data combination, then judge that this Monitoring Data is combined as the rare data assemblies pattern of environmental monitoring, otherwise be not judged to be the rare data assemblies pattern of environmental monitoring.
CN201410685956.XA 2014-11-26 2014-11-26 Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis Pending CN104598514A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410685956.XA CN104598514A (en) 2014-11-26 2014-11-26 Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410685956.XA CN104598514A (en) 2014-11-26 2014-11-26 Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis

Publications (1)

Publication Number Publication Date
CN104598514A true CN104598514A (en) 2015-05-06

Family

ID=53124300

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410685956.XA Pending CN104598514A (en) 2014-11-26 2014-11-26 Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis

Country Status (1)

Country Link
CN (1) CN104598514A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113376327A (en) * 2021-07-08 2021-09-10 海南绿能环境工程有限公司 Environmental monitoring information management method and system based on big data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020642A (en) * 2012-10-08 2013-04-03 江苏省环境监测中心 Water environment monitoring and quality-control data analysis method
CN103176221A (en) * 2013-03-07 2013-06-26 李春华 Mineralizing abnormal feature identification method based on different geological unit background values and contrast values

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020642A (en) * 2012-10-08 2013-04-03 江苏省环境监测中心 Water environment monitoring and quality-control data analysis method
CN103176221A (en) * 2013-03-07 2013-06-26 李春华 Mineralizing abnormal feature identification method based on different geological unit background values and contrast values

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
田启明等: "基于网格距离的聚类算法的设计、实现和应用", 《计算机应用》 *
舒红平等: "网格聚类在多雷达数据融合算法中的应用", 《电子科大学学报》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113376327A (en) * 2021-07-08 2021-09-10 海南绿能环境工程有限公司 Environmental monitoring information management method and system based on big data
CN113376327B (en) * 2021-07-08 2023-01-17 海南海笙信息科技有限公司 Environmental monitoring information management method and system based on big data

Similar Documents

Publication Publication Date Title
CN109726731B (en) Power consumption abnormality detection method, device, equipment and readable storage medium
CN108021736B (en) Load switching action monitoring method based on sliding window residual error model
US11531688B2 (en) Time-series data processing device, time-series data processing system, and time-series data processing method
Liu et al. A hybrid FCW-EMD and KF-BA-SVM based model for short-term load forecasting
Wang et al. An IoT application for fault diagnosis and prediction
EP2333629A1 (en) Error detection method and system
CN105630885A (en) Abnormal power consumption detection method and system
GB201108778D0 (en) System monitor and method of system monitoring
CN106708694B (en) Method and device for positioning burr abnormal point and computing equipment
JP2014511096A5 (en)
CN104102668A (en) Automatic database performance analysis and early warning system
Li et al. Data quality observation in pervasive environments
CN104020724A (en) Alarm monitoring method and device
CN102882895A (en) Method and device for identifying message attack
CN116304766A (en) Multi-sensor-based quick assessment method for state of switch cabinet
CN108932217A (en) The method and device of energy consumption statistic
CN104809525A (en) Electricity load prediction method and device
CN109446243B (en) Method for detecting power generation abnormity of photovoltaic power station based on big data analysis
CN111241208A (en) Method and device for monitoring abnormity of periodic time sequence data
Bo et al. Recognition of control chart patterns in auto-correlated process based on random forest
CN104598514A (en) Algorithm for finding rare environmental monitoring data combination mode based on clustering analysis
Ahmad et al. Statistical distributions of load profiling data
CN105183612B (en) The appraisal procedure of server free memory abnormal growth and operation conditions
CN111861121A (en) Abnormal electricity utilization judgment method based on user electricity fluctuation and line loss fluctuation analysis
CN110207987A (en) A kind of determination method of rolling bearing performance degeneration decline node

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20150506