CN112988815A - 一种大规模高维高速流数据在线异常检测的方法及系统 - Google Patents
一种大规模高维高速流数据在线异常检测的方法及系统 Download PDFInfo
- Publication number
- CN112988815A CN112988815A CN202110279428.4A CN202110279428A CN112988815A CN 112988815 A CN112988815 A CN 112988815A CN 202110279428 A CN202110279428 A CN 202110279428A CN 112988815 A CN112988815 A CN 112988815A
- Authority
- CN
- China
- Prior art keywords
- data
- matrix
- hash
- sketch
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 66
- 238000000034 method Methods 0.000 title claims abstract description 47
- 239000011159 matrix material Substances 0.000 claims abstract description 246
- 230000002159 abnormal effect Effects 0.000 claims abstract description 42
- 238000004364 calculation method Methods 0.000 claims abstract description 21
- 238000012545 processing Methods 0.000 claims abstract description 17
- 230000006870 function Effects 0.000 claims description 22
- 230000005856 abnormality Effects 0.000 claims description 18
- 230000008569 process Effects 0.000 claims description 16
- 238000013507 mapping Methods 0.000 claims description 9
- 238000012549 training Methods 0.000 claims description 6
- 238000000354 decomposition reaction Methods 0.000 claims description 5
- 238000012856 packing Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 5
- 238000007418 data mining Methods 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000002955 isolation Methods 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 230000002547 anomalous effect Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24568—Data stream processing; Continuous queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2474—Sequence data queries, e.g. querying versioned data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/2433—Single-class perspective, e.g. one-against-all classification; Novelty detection; Outlier detection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/03—Data mining
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Probability & Statistics with Applications (AREA)
- Mathematical Physics (AREA)
- Fuzzy Systems (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110279428.4A CN112988815B (zh) | 2021-03-16 | 2021-03-16 | 一种大规模高维高速流数据在线异常检测的方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110279428.4A CN112988815B (zh) | 2021-03-16 | 2021-03-16 | 一种大规模高维高速流数据在线异常检测的方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112988815A true CN112988815A (zh) | 2021-06-18 |
CN112988815B CN112988815B (zh) | 2023-09-05 |
Family
ID=76336058
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110279428.4A Active CN112988815B (zh) | 2021-03-16 | 2021-03-16 | 一种大规模高维高速流数据在线异常检测的方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112988815B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114826675A (zh) * | 2022-03-28 | 2022-07-29 | 杭州趣链科技有限公司 | 基于数据块集成分类的网络流量异常检测方法、设备及存储介质 |
CN115563570A (zh) * | 2022-12-05 | 2023-01-03 | 上海飞旗网络技术股份有限公司 | 一种资源的异常检测方法、装置及设备 |
CN116029220A (zh) * | 2023-03-24 | 2023-04-28 | 国网福建省电力有限公司 | 一种电压互感器运行误差评估方法、系统、设备及介质 |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6389408B1 (en) * | 1999-06-30 | 2002-05-14 | The United States Of America As Represented By The Secretary Of The Army | Neural network systems for chemical and biological pattern recognition via the Mueller matrix |
WO2002057987A2 (en) * | 2001-01-16 | 2002-07-25 | Infolenz Corporation | System and method for association of object sets |
US20070240061A1 (en) * | 2006-03-29 | 2007-10-11 | Lucent Technologies Inc. | Method for distributed tracking of approximate join size and related summaries |
US7383253B1 (en) * | 2004-12-17 | 2008-06-03 | Coral 8, Inc. | Publish and subscribe capable continuous query processor for real-time data streams |
US20110052000A1 (en) * | 2009-08-31 | 2011-03-03 | Wesley Kenneth Cobb | Detecting anomalous trajectories in a video surveillance system |
CN102299897A (zh) * | 2010-06-23 | 2011-12-28 | 电子科技大学 | 基于特征关联的对等网络特征分析方法 |
US8977627B1 (en) * | 2011-11-01 | 2015-03-10 | Google Inc. | Filter based object detection using hash functions |
CN104731884A (zh) * | 2015-03-11 | 2015-06-24 | 北京航空航天大学 | 一种基于多特征融合的多哈希表的查询方法 |
CN105335975A (zh) * | 2015-10-22 | 2016-02-17 | 西安电子科技大学 | 基于低秩分解和直方图统计的极化sar图像分割方法 |
CN105894336A (zh) * | 2016-05-25 | 2016-08-24 | 北京比邻弘科科技有限公司 | 一种基于移动互联网的大数据挖掘方法及系统 |
CN109871379A (zh) * | 2018-12-10 | 2019-06-11 | 宁波大学 | 一种基于数据块学习的在线哈希最近邻查询方法 |
CN110023991A (zh) * | 2016-12-02 | 2019-07-16 | 皇家飞利浦有限公司 | 用于从对象类中识别对象的装置 |
CN111367187A (zh) * | 2015-08-27 | 2020-07-03 | 雾角系统公司 | 用于改进对分布式网络中的传感器流数据的处理的方法 |
CN112036460A (zh) * | 2020-08-24 | 2020-12-04 | 河海大学 | 一种识别量化控制泉流量潜在因素的方法 |
-
2021
- 2021-03-16 CN CN202110279428.4A patent/CN112988815B/zh active Active
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6389408B1 (en) * | 1999-06-30 | 2002-05-14 | The United States Of America As Represented By The Secretary Of The Army | Neural network systems for chemical and biological pattern recognition via the Mueller matrix |
WO2002057987A2 (en) * | 2001-01-16 | 2002-07-25 | Infolenz Corporation | System and method for association of object sets |
US7383253B1 (en) * | 2004-12-17 | 2008-06-03 | Coral 8, Inc. | Publish and subscribe capable continuous query processor for real-time data streams |
US20070240061A1 (en) * | 2006-03-29 | 2007-10-11 | Lucent Technologies Inc. | Method for distributed tracking of approximate join size and related summaries |
US20110052000A1 (en) * | 2009-08-31 | 2011-03-03 | Wesley Kenneth Cobb | Detecting anomalous trajectories in a video surveillance system |
CN102299897A (zh) * | 2010-06-23 | 2011-12-28 | 电子科技大学 | 基于特征关联的对等网络特征分析方法 |
US8977627B1 (en) * | 2011-11-01 | 2015-03-10 | Google Inc. | Filter based object detection using hash functions |
CN104731884A (zh) * | 2015-03-11 | 2015-06-24 | 北京航空航天大学 | 一种基于多特征融合的多哈希表的查询方法 |
CN111367187A (zh) * | 2015-08-27 | 2020-07-03 | 雾角系统公司 | 用于改进对分布式网络中的传感器流数据的处理的方法 |
CN105335975A (zh) * | 2015-10-22 | 2016-02-17 | 西安电子科技大学 | 基于低秩分解和直方图统计的极化sar图像分割方法 |
CN105894336A (zh) * | 2016-05-25 | 2016-08-24 | 北京比邻弘科科技有限公司 | 一种基于移动互联网的大数据挖掘方法及系统 |
CN110023991A (zh) * | 2016-12-02 | 2019-07-16 | 皇家飞利浦有限公司 | 用于从对象类中识别对象的装置 |
CN109871379A (zh) * | 2018-12-10 | 2019-06-11 | 宁波大学 | 一种基于数据块学习的在线哈希最近邻查询方法 |
CN112036460A (zh) * | 2020-08-24 | 2020-12-04 | 河海大学 | 一种识别量化控制泉流量潜在因素的方法 |
Non-Patent Citations (12)
Title |
---|
CONG LENG等: "Online Sketching Hashing", 《PROCEEDINGS OF THE IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 * |
CONG LENG等: "Online Sketching Hashing", 《PROCEEDINGS OF THE IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》, 31 December 2015 (2015-12-31), pages 1 - 3 * |
HAO HUANG等: "Streaming Anomaly Detection Using Randomized Matrix Sketching", 《PROCEEDINGS OF THE VLDB ENDOWMEN》 * |
HAO HUANG等: "Streaming Anomaly Detection Using Randomized Matrix Sketching", 《PROCEEDINGS OF THE VLDB ENDOWMEN》, vol. 9, no. 3, 3 November 2015 (2015-11-03), pages 3 - 4 * |
XIN MU 等: "Streaming Classfication with Emerging New Class by Class Matrix Sketching", 《THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》 * |
XIN MU 等: "Streaming Classfication with Emerging New Class by Class Matrix Sketching", 《THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》, 13 February 2017 (2017-02-13), pages 2373 - 2379 * |
吴培: "基于矩阵素描和哈希学习的流数据在线异常检测方法研究", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》 * |
吴培: "基于矩阵素描和哈希学习的流数据在线异常检测方法研究", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》, no. 2022, 15 March 2022 (2022-03-15), pages 138 - 821 * |
曹晓莉等: "基于聚类支持向量机的船用污水处理装置故障诊断", 《计算机应用》 * |
曹晓莉等: "基于聚类支持向量机的船用污水处理装置故障诊断", 《计算机应用》, no. 10, 1 October 2008 (2008-10-01), pages 2648 - 2651 * |
潘旭等: "智能配电网多维数据质量评价方法", 《中国电机工程学报》 * |
潘旭等: "智能配电网多维数据质量评价方法", 《中国电机工程学报》, no. 05, 24 January 2018 (2018-01-24), pages 105 - 114 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114826675A (zh) * | 2022-03-28 | 2022-07-29 | 杭州趣链科技有限公司 | 基于数据块集成分类的网络流量异常检测方法、设备及存储介质 |
CN114826675B (zh) * | 2022-03-28 | 2024-05-28 | 杭州趣链科技有限公司 | 基于数据块集成分类的网络流量异常检测方法、设备及存储介质 |
CN115563570A (zh) * | 2022-12-05 | 2023-01-03 | 上海飞旗网络技术股份有限公司 | 一种资源的异常检测方法、装置及设备 |
CN116029220A (zh) * | 2023-03-24 | 2023-04-28 | 国网福建省电力有限公司 | 一种电压互感器运行误差评估方法、系统、设备及介质 |
CN116029220B (zh) * | 2023-03-24 | 2023-07-18 | 国网福建省电力有限公司 | 一种电压互感器运行误差评估方法、系统、设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
CN112988815B (zh) | 2023-09-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112988815B (zh) | 一种大规模高维高速流数据在线异常检测的方法及系统 | |
Kumari et al. | Comparison and analysis of different software cost estimation methods | |
JP2004054370A (ja) | 時系列データに対する自己回帰モデル学習装置並びにそれを用いた外れ値および変化点の検出装置 | |
Zhou et al. | Deep learning enabled cutting tool selection for special-shaped machining features of complex products | |
CN113822284A (zh) | 一种基于边界注意力的rgbd图像语义分割方法 | |
CN108764541B (zh) | 一种结合时空特征和误差处理的风能预测方法 | |
Iturbide et al. | A comparison between LARS and LASSO for initialising the time-series forecasting auto-regressive equations | |
CN114580747A (zh) | 基于数据相关性和模糊系统的异常数据预测方法及系统 | |
Yang et al. | Parallel fractional hot-deck imputation and variance estimation for big incomplete data curing | |
CN115168326A (zh) | Hadoop大数据平台分布式能源数据清洗方法及系统 | |
Li et al. | Multi scale temporal graph networks for skeleton-based action recognition | |
CN118230415A (zh) | 一种图注意力网络驱动的人体异结构动作数据预测方法 | |
Cui | Complex industrial automation data stream mining algorithm based on random Internet of robotic things | |
CN111767324B (zh) | 一种智能关联的自适应数据分析方法及装置 | |
CN113098848A (zh) | 基于矩阵素描和哈希学习的流数据异常检测方法及其系统 | |
Zhang et al. | LIFE: Learning individual features for multivariate time series prediction with missing values | |
CN116821745B (zh) | 智能线切割慢走丝设备的控制方法及其系统 | |
US10339235B1 (en) | Massively parallel processing (MPP) large-scale combination of time series data | |
CN110175287B (zh) | 一种基于Flink的矩阵分解隐式反馈推荐方法和系统 | |
CN113297185A (zh) | 一种特征衍生方法及装置 | |
Ye et al. | Improved SVD algorithm based on Slope One | |
CN113835964B (zh) | 基于小样本学习的云数据中心服务器能耗预测方法 | |
AU2021106594A4 (en) | Online anomaly detection method and system for streaming data | |
CN108717444A (zh) | 一种基于分布式结构的大数据聚类方法和装置 | |
CN115935285A (zh) | 基于掩码图神经网络模型的多元时间序列异常检测方法和系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Fan Xingrong Inventor after: Zhang Xianming Inventor after: Wang Jianhui Inventor after: Guo Zhiwei Inventor after: Zhao Xiaolong Inventor after: Zhao Dujiang Inventor after: Shen Yu Inventor before: Fan Xingrong Inventor before: Wang Jianhui Inventor before: Guo Zhiwei Inventor before: Zhao Xiaolong Inventor before: Zhao Dujiang Inventor before: Shen Yu |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |