CN1581166A - 通过在线和离线组件聚类进化数据流的方法和设备 - Google Patents
通过在线和离线组件聚类进化数据流的方法和设备 Download PDFInfo
- Publication number
- CN1581166A CN1581166A CNA2004100563262A CN200410056326A CN1581166A CN 1581166 A CN1581166 A CN 1581166A CN A2004100563262 A CNA2004100563262 A CN A2004100563262A CN 200410056326 A CN200410056326 A CN 200410056326A CN 1581166 A CN1581166 A CN 1581166A
- Authority
- CN
- China
- Prior art keywords
- statistical information
- data
- online
- cluster
- group
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24568—Data stream processing; Continuous queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99943—Generating database or data structure, e.g. via user interface
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (25)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/641,951 | 2003-08-14 | ||
US10/641,951 US7353218B2 (en) | 2003-08-14 | 2003-08-14 | Methods and apparatus for clustering evolving data streams through online and offline components |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1581166A true CN1581166A (zh) | 2005-02-16 |
CN100416560C CN100416560C (zh) | 2008-09-03 |
Family
ID=34136487
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100563262A Expired - Fee Related CN100416560C (zh) | 2003-08-14 | 2004-08-06 | 通过在线和离线组件聚类进化数据流的方法和设备 |
Country Status (3)
Country | Link |
---|---|
US (2) | US7353218B2 (zh) |
JP (1) | JP5089854B2 (zh) |
CN (1) | CN100416560C (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100442287C (zh) * | 2005-04-20 | 2008-12-10 | 国际商业机器公司 | 处理数据流的方法和设备 |
CN102495938A (zh) * | 2011-10-19 | 2012-06-13 | 武汉科技大学 | 对含噪声点的实时数据流进行聚类和聚类边界界定的方法 |
CN102693361A (zh) * | 2012-05-07 | 2012-09-26 | 北京航空航天大学 | 一种大数据量的趋势曲线绘制方法 |
CN107315760A (zh) * | 2012-04-05 | 2017-11-03 | 微软技术许可有限责任公司 | 用于连续图更新和计算的平台 |
CN108475218A (zh) * | 2016-01-14 | 2018-08-31 | 起元技术有限责任公司 | 可恢复流处理 |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7937269B2 (en) * | 2005-08-22 | 2011-05-03 | International Business Machines Corporation | Systems and methods for providing real-time classification of continuous data streams |
US7421452B2 (en) * | 2006-06-14 | 2008-09-02 | International Business Machines Corporation | Method and apparatus for predicting future behavior of data streams |
WO2008034213A1 (en) * | 2006-09-18 | 2008-03-27 | Infobright Inc. | A method and system for data compression in a relational database |
US8266147B2 (en) * | 2006-09-18 | 2012-09-11 | Infobright, Inc. | Methods and systems for database organization |
US8195734B1 (en) | 2006-11-27 | 2012-06-05 | The Research Foundation Of State University Of New York | Combining multiple clusterings by soft correspondence |
JP4990696B2 (ja) * | 2007-06-27 | 2012-08-01 | 株式会社日立製作所 | ストリームデータの処理方法およびストリームデータ処理システム |
US20090171902A1 (en) * | 2007-12-28 | 2009-07-02 | Microsoft Corporation | Life recorder |
JP5647602B2 (ja) * | 2009-04-27 | 2015-01-07 | パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America | データ処理装置、データ処理方法、プログラム、及び集積回路 |
US9141300B2 (en) * | 2009-09-22 | 2015-09-22 | Emc Corporation | Performance improvement of a capacity optimized storage system using a performance segment storage system and a segment storage system |
US8533318B2 (en) * | 2009-10-06 | 2013-09-10 | International Business Machines Corporation | Processing and presenting multi-dimensioned transaction tracking data |
US9195713B2 (en) * | 2009-11-08 | 2015-11-24 | Hewlett-Packard Development Company, L.P. | Outlier data point detection |
US8417727B2 (en) | 2010-06-14 | 2013-04-09 | Infobright Inc. | System and method for storing data in a relational database |
US8521748B2 (en) | 2010-06-14 | 2013-08-27 | Infobright Inc. | System and method for managing metadata in a relational database |
US9165051B2 (en) * | 2010-08-24 | 2015-10-20 | Board Of Trustees Of The University Of Illinois | Systems and methods for detecting a novel data class |
US20130140887A1 (en) * | 2010-12-09 | 2013-06-06 | Sanyo Electric Co., Ltd. | Clustering method, optimization method using the same, power supply control device |
JP5946423B2 (ja) * | 2013-04-26 | 2016-07-06 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | システム・ログの分類方法、プログラム及びシステム |
US9411632B2 (en) | 2013-05-30 | 2016-08-09 | Qualcomm Incorporated | Parallel method for agglomerative clustering of non-stationary data |
CN104699702A (zh) * | 2013-12-09 | 2015-06-10 | 中国银联股份有限公司 | 数据挖掘及分类方法 |
US10496921B2 (en) | 2016-05-03 | 2019-12-03 | Fujitsu Limited | Neural network mapping dictionary generation |
US11461372B1 (en) | 2021-03-18 | 2022-10-04 | Bae Systems Information And Electronic Systems Integration Inc. | Data clustering in logic devices using unsupervised learning |
Family Cites Families (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5257206A (en) * | 1991-04-08 | 1993-10-26 | Praxair Technology, Inc. | Statistical process control for air separation process |
DE69218912T2 (de) * | 1991-08-28 | 1997-10-09 | Becton Dickinson Co | Schwerkraftsattraktionsmaschine zur anpassungsfähigen autoclusterbildung n-dimensionaler datenströme |
US5765166A (en) * | 1996-04-23 | 1998-06-09 | Raytheon Company | Use of symmetric multiprocessors for multiple hypothesis tracking |
US5832182A (en) * | 1996-04-24 | 1998-11-03 | Wisconsin Alumni Research Foundation | Method and system for data clustering for very large databases |
US6026397A (en) * | 1996-05-22 | 2000-02-15 | Electronic Data Systems Corporation | Data analysis system and method |
US6134532A (en) * | 1997-11-14 | 2000-10-17 | Aptex Software, Inc. | System and method for optimal adaptive matching of users to most relevant entity and information in real-time |
US6012058A (en) * | 1998-03-17 | 2000-01-04 | Microsoft Corporation | Scalable system for K-means clustering of large databases |
US20030154072A1 (en) * | 1998-03-31 | 2003-08-14 | Scansoft, Inc., A Delaware Corporation | Call analysis |
US6092072A (en) * | 1998-04-07 | 2000-07-18 | Lucent Technologies, Inc. | Programmed medium for clustering large databases |
US6393460B1 (en) * | 1998-08-28 | 2002-05-21 | International Business Machines Corporation | Method and system for informing users of subjects of discussion in on-line chats |
US6006259A (en) * | 1998-11-20 | 1999-12-21 | Network Alchemy, Inc. | Method and apparatus for an internet protocol (IP) network clustering system |
US6564197B2 (en) * | 1999-05-03 | 2003-05-13 | E.Piphany, Inc. | Method and apparatus for scalable probabilistic clustering using decision trees |
JP3562572B2 (ja) * | 2000-05-02 | 2004-09-08 | インターナショナル・ビジネス・マシーンズ・コーポレーション | データベースのドキュメントにおける新規な事項・新規クラスの検出及び追跡 |
US7162482B1 (en) * | 2000-05-03 | 2007-01-09 | Musicmatch, Inc. | Information retrieval engine |
JP3606556B2 (ja) * | 2000-05-16 | 2005-01-05 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 情報整理方法、情報処理装置、記憶媒体、およびプログラム伝送装置 |
DE60116877T2 (de) * | 2000-08-11 | 2006-09-14 | British Telecommunications P.L.C. | System und verfahren zum erfassen von ereignissen |
US7003509B2 (en) * | 2003-07-21 | 2006-02-21 | Leonid Andreev | High-dimensional data clustering with the use of hybrid similarity matrices |
US6772375B1 (en) * | 2000-12-22 | 2004-08-03 | Network Appliance, Inc. | Auto-detection of limiting factors in a TCP connection |
US7194454B2 (en) * | 2001-03-12 | 2007-03-20 | Lucent Technologies | Method for organizing records of database search activity by topical relevance |
JP2002304400A (ja) * | 2001-04-03 | 2002-10-18 | Ricoh Co Ltd | 文書分類装置 |
US6915241B2 (en) * | 2001-04-20 | 2005-07-05 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Method for segmentation and identification of nonstationary time series |
JP2003044491A (ja) * | 2001-07-30 | 2003-02-14 | Toshiba Corp | 知識分析システムならびに同システムにおける分析条件設定方法、分析条件保存方法および再分析処理方法 |
KR100518781B1 (ko) * | 2001-10-17 | 2005-10-06 | 한국과학기술원 | 하이퍼사각형 기반의 다차원 데이터 세그먼테이션 장치,클러스터링 장치 및 그 방법 |
KR100483321B1 (ko) * | 2001-10-17 | 2005-04-15 | 한국과학기술원 | 하이퍼사각형 기반의 다차원 데이터 세그먼테이션을이용한 유사성 검색 장치와 그 방법 |
US7028230B2 (en) * | 2001-11-05 | 2006-04-11 | Nokia Corporation | Partially filling block interleaver for a communication system |
US6801917B2 (en) * | 2001-11-13 | 2004-10-05 | Koninklijke Philips Electronics N.V. | Method and apparatus for partitioning a plurality of items into groups of similar items in a recommender of such items |
US20040103013A1 (en) * | 2002-11-25 | 2004-05-27 | Joel Jameson | Optimal scenario forecasting, risk sharing, and risk trading |
US6765532B2 (en) * | 2002-12-17 | 2004-07-20 | Bae Systems Information And Electronic Systems Integration Inc. | Wideband signal detection and tracking system |
US6947933B2 (en) * | 2003-01-23 | 2005-09-20 | Verdasys, Inc. | Identifying similarities within large collections of unstructured data |
US7557805B2 (en) * | 2003-04-01 | 2009-07-07 | Battelle Memorial Institute | Dynamic visualization of data streams |
US7089266B2 (en) * | 2003-06-02 | 2006-08-08 | The Board Of Trustees Of The Leland Stanford Jr. University | Computer systems and methods for the query and visualization of multidimensional databases |
-
2003
- 2003-08-14 US US10/641,951 patent/US7353218B2/en active Active
-
2004
- 2004-08-06 CN CNB2004100563262A patent/CN100416560C/zh not_active Expired - Fee Related
- 2004-08-11 JP JP2004234267A patent/JP5089854B2/ja not_active Expired - Fee Related
-
2007
- 2007-05-30 US US11/755,473 patent/US20070226209A1/en not_active Abandoned
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100442287C (zh) * | 2005-04-20 | 2008-12-10 | 国际商业机器公司 | 处理数据流的方法和设备 |
US7739284B2 (en) | 2005-04-20 | 2010-06-15 | International Business Machines Corporation | Method and apparatus for processing data streams |
CN102495938A (zh) * | 2011-10-19 | 2012-06-13 | 武汉科技大学 | 对含噪声点的实时数据流进行聚类和聚类边界界定的方法 |
CN107315760A (zh) * | 2012-04-05 | 2017-11-03 | 微软技术许可有限责任公司 | 用于连续图更新和计算的平台 |
CN102693361A (zh) * | 2012-05-07 | 2012-09-26 | 北京航空航天大学 | 一种大数据量的趋势曲线绘制方法 |
CN102693361B (zh) * | 2012-05-07 | 2014-11-26 | 北京航空航天大学 | 一种大数据量的趋势曲线绘制方法 |
CN108475218A (zh) * | 2016-01-14 | 2018-08-31 | 起元技术有限责任公司 | 可恢复流处理 |
Also Published As
Publication number | Publication date |
---|---|
JP2005100363A (ja) | 2005-04-14 |
CN100416560C (zh) | 2008-09-03 |
US7353218B2 (en) | 2008-04-01 |
US20070226209A1 (en) | 2007-09-27 |
JP5089854B2 (ja) | 2012-12-05 |
US20050038769A1 (en) | 2005-02-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100416560C (zh) | 通过在线和离线组件聚类进化数据流的方法和设备 | |
Tao et al. | Spatio-temporal aggregation using sketches | |
Gan et al. | Moment-based quantile sketches for efficient high cardinality aggregation queries | |
Khalilian et al. | Data stream clustering by divide and conquer approach based on vector model | |
Phillips | Acceleration of k-means and related clustering algorithms | |
US20100179855A1 (en) | Large-Scale Behavioral Targeting for Advertising over a Network | |
KR100385528B1 (ko) | 다차원 데이터 표시 방법 및 기록 매체 | |
CN1855097A (zh) | 处理数据流的方法和设备 | |
US10210280B2 (en) | In-memory database search optimization using graph community structure | |
Gama et al. | Data stream processing | |
Gaber et al. | Data stream mining | |
Youn et al. | Efficient data stream clustering with sliding windows based on locality-sensitive hashing | |
US20110213740A1 (en) | System and method for resource adaptive classification of data streams | |
CN110334290B (zh) | 一种基于MF-Octree的时空数据快速检索方法 | |
US11567952B2 (en) | Systems and methods for accelerating exploratory statistical analysis | |
Elnekave et al. | Incremental clustering of mobile objects | |
CN114329094A (zh) | 一种基于Spark的大规模高维数据近似近邻查询系统和方法 | |
US20210117447A1 (en) | Adaptive data clustering for databases | |
Sun et al. | Spatio-temporal join selectivity | |
Ahsani et al. | Improvement of CluStream algorithm using sliding window for the clustering of data streams | |
CN108536823B (zh) | 一种物联网感知大数据的缓存设计和查询方法 | |
Gothwal et al. | The survey on skyline query processing for data-specific applications | |
Liu | Approximate Query Processing. | |
Brönnimann et al. | Efficient data-reduction methods for on-line association rule discovery | |
Wu et al. | NEIST: A neural-enhanced index for spatio-temporal queries |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: GOOGLE INC. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINES CORP. Effective date: 20120503 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20120503 Address after: American California Patentee after: Google Inc. Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20080903 Termination date: 20170806 |
|
CF01 | Termination of patent right due to non-payment of annual fee |