JP2004503849A - 電子データを整理する方法および装置 - Google Patents
電子データを整理する方法および装置 Download PDFInfo
- Publication number
- JP2004503849A JP2004503849A JP2002509880A JP2002509880A JP2004503849A JP 2004503849 A JP2004503849 A JP 2004503849A JP 2002509880 A JP2002509880 A JP 2002509880A JP 2002509880 A JP2002509880 A JP 2002509880A JP 2004503849 A JP2004503849 A JP 2004503849A
- Authority
- JP
- Japan
- Prior art keywords
- cluster
- distance
- data
- level
- dataset
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Cash Registers Or Receiving Machines (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00114636 | 2000-07-07 | ||
EP00115867 | 2000-07-24 | ||
EP00125503A EP1170674A3 (fr) | 2000-07-07 | 2000-11-21 | Procédé et dispostif pour commander des données électroniques |
PCT/EP2001/007801 WO2002005084A2 (fr) | 2000-07-07 | 2001-07-06 | Procede et appareil d'ordonnancement de donnees electroniques |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2004503849A true JP2004503849A (ja) | 2004-02-05 |
Family
ID=27223067
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2002509880A Pending JP2004503849A (ja) | 2000-07-07 | 2001-07-06 | 電子データを整理する方法および装置 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20030145014A1 (fr) |
EP (1) | EP1170674A3 (fr) |
JP (1) | JP2004503849A (fr) |
AU (1) | AU2001272527A1 (fr) |
WO (1) | WO2002005084A2 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005063341A (ja) * | 2003-08-20 | 2005-03-10 | Nec Soft Ltd | 集合の動的形成システム、集合の動的形成方法及びそのプログラム |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070233659A1 (en) * | 1998-05-23 | 2007-10-04 | Lg Electronics Inc. | Information auto classification method and information search and analysis method |
US7747624B2 (en) * | 2002-05-10 | 2010-06-29 | Oracle International Corporation | Data summarization |
US7788327B2 (en) | 2002-11-28 | 2010-08-31 | Panasonic Corporation | Device, program and method for assisting in preparing email |
JP4189246B2 (ja) | 2003-03-28 | 2008-12-03 | 日立ソフトウエアエンジニアリング株式会社 | データベース検索経路表示方法 |
JP4189248B2 (ja) | 2003-03-31 | 2008-12-03 | 日立ソフトウエアエンジニアリング株式会社 | データベース検索経路判定方法 |
US20050044487A1 (en) * | 2003-08-21 | 2005-02-24 | Apple Computer, Inc. | Method and apparatus for automatic file clustering into a data-driven, user-specific taxonomy |
DE102005014761A1 (de) * | 2005-03-31 | 2006-10-05 | Siemens Ag | Verfahren zum Anordnen von Objektdaten in elektronischen Karten |
US20070067278A1 (en) * | 2005-09-22 | 2007-03-22 | Gtess Corporation | Data file correlation system and method |
DE602005025881D1 (de) * | 2005-10-21 | 2011-02-24 | Hewlett Packard Development Co | Grafische Anordnung von IT-Netzwerkkomponenten |
US7653659B2 (en) * | 2005-12-12 | 2010-01-26 | International Business Machines Corporation | System for automatic arrangement of portlets on portal pages according to semantical and functional relationship |
EP1860578A1 (fr) * | 2006-05-22 | 2007-11-28 | Caterpillar Inc. | Système d'analyse de brevets |
US7555480B2 (en) * | 2006-07-11 | 2009-06-30 | Microsoft Corporation | Comparatively crawling web page data records relative to a template |
US9298722B2 (en) * | 2009-07-16 | 2016-03-29 | Novell, Inc. | Optimal sequential (de)compression of digital data |
WO2011126491A1 (fr) * | 2010-04-09 | 2011-10-13 | Hewlett-Packard Development Company, L.P. | Groupement de projets et visualisation de relations |
US8832103B2 (en) | 2010-04-13 | 2014-09-09 | Novell, Inc. | Relevancy filter for new data based on underlying files |
US8671111B2 (en) * | 2011-05-31 | 2014-03-11 | International Business Machines Corporation | Determination of rules by providing data records in columnar data structures |
WO2012174639A1 (fr) * | 2011-06-22 | 2012-12-27 | Rogers Communications Inc. | Systèmes et procédés de classement de groupes de documents |
DE102012102797B4 (de) * | 2012-03-30 | 2017-08-10 | Beyo Gmbh | Kamerabasiertes Mobilfunkgerät zur Konvertierung eines Dokuments anhand von aufgenommenen Bildern in ein Format zur optimierten Anzeige auf dem kamerabasierten Mobilfunkgerät |
US9336302B1 (en) | 2012-07-20 | 2016-05-10 | Zuci Realty Llc | Insight and algorithmic clustering for automated synthesis |
US20140164376A1 (en) * | 2012-12-06 | 2014-06-12 | Microsoft Corporation | Hierarchical string clustering on diagnostic logs |
US9355105B2 (en) * | 2012-12-19 | 2016-05-31 | International Business Machines Corporation | Indexing of large scale patient set |
US10572926B1 (en) * | 2013-01-31 | 2020-02-25 | Amazon Technologies, Inc. | Using artificial intelligence to efficiently identify significant items in a database |
US9471662B2 (en) | 2013-06-24 | 2016-10-18 | Sap Se | Homogeneity evaluation of datasets |
US10290092B2 (en) * | 2014-05-15 | 2019-05-14 | Applied Materials Israel, Ltd | System, a method and a computer program product for fitting based defect detection |
US10605842B2 (en) * | 2016-06-21 | 2020-03-31 | International Business Machines Corporation | Noise spectrum analysis for electronic device |
US10444945B1 (en) | 2016-10-10 | 2019-10-15 | United Services Automobile Association | Systems and methods for ingesting and parsing datasets generated from disparate data sources |
US11205103B2 (en) | 2016-12-09 | 2021-12-21 | The Research Foundation for the State University | Semisupervised autoencoder for sentiment analysis |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5040133A (en) * | 1990-01-12 | 1991-08-13 | Hughes Aircraft Company | Adaptive clusterer |
US5442778A (en) * | 1991-11-12 | 1995-08-15 | Xerox Corporation | Scatter-gather: a cluster-based method and apparatus for browsing large document collections |
US5483650A (en) * | 1991-11-12 | 1996-01-09 | Xerox Corporation | Method of constant interaction-time clustering applied to document browsing |
US5710916A (en) * | 1994-05-24 | 1998-01-20 | Panasonic Technologies, Inc. | Method and apparatus for similarity matching of handwritten data objects |
US5787422A (en) * | 1996-01-11 | 1998-07-28 | Xerox Corporation | Method and apparatus for information accesss employing overlapping clusters |
US5933823A (en) * | 1996-03-01 | 1999-08-03 | Ricoh Company Limited | Image database browsing and query using texture analysis |
US5926812A (en) * | 1996-06-20 | 1999-07-20 | Mantra Technologies, Inc. | Document extraction and comparison method with applications to automatic personalized database searching |
US5848404A (en) * | 1997-03-24 | 1998-12-08 | International Business Machines Corporation | Fast query search in large dimension database |
US6012058A (en) * | 1998-03-17 | 2000-01-04 | Microsoft Corporation | Scalable system for K-means clustering of large databases |
US6842876B2 (en) * | 1998-04-14 | 2005-01-11 | Fuji Xerox Co., Ltd. | Document cache replacement policy for automatically generating groups of documents based on similarity of content |
JP3855551B2 (ja) * | 1999-08-25 | 2006-12-13 | 株式会社日立製作所 | 検索方法及び検索システム |
US6584456B1 (en) * | 2000-06-19 | 2003-06-24 | International Business Machines Corporation | Model selection in machine learning with applications to document clustering |
-
2000
- 2000-11-21 EP EP00125503A patent/EP1170674A3/fr not_active Withdrawn
-
2001
- 2001-07-06 WO PCT/EP2001/007801 patent/WO2002005084A2/fr active Application Filing
- 2001-07-06 JP JP2002509880A patent/JP2004503849A/ja active Pending
- 2001-07-06 US US10/332,234 patent/US20030145014A1/en not_active Abandoned
- 2001-07-06 AU AU2001272527A patent/AU2001272527A1/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005063341A (ja) * | 2003-08-20 | 2005-03-10 | Nec Soft Ltd | 集合の動的形成システム、集合の動的形成方法及びそのプログラム |
Also Published As
Publication number | Publication date |
---|---|
WO2002005084A2 (fr) | 2002-01-17 |
WO2002005084A3 (fr) | 2002-04-25 |
US20030145014A1 (en) | 2003-07-31 |
EP1170674A3 (fr) | 2002-04-17 |
AU2001272527A1 (en) | 2002-01-21 |
EP1170674A2 (fr) | 2002-01-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2004503849A (ja) | 電子データを整理する方法および装置 | |
US8332439B2 (en) | Automatically generating a hierarchy of terms | |
JP5391633B2 (ja) | オントロジー空間を規定するタームの推奨 | |
US9317593B2 (en) | Modeling topics using statistical distributions | |
JP5391634B2 (ja) | 文書の段落分析によるその文書のタグの選択 | |
US7409404B2 (en) | Creating taxonomies and training data for document categorization | |
JP5353173B2 (ja) | 文書の具体性の決定 | |
US20020087567A1 (en) | Unified binary model and methodology for knowledge representation and for data and information mining | |
US20090327259A1 (en) | Automatic concept clustering | |
JP5391632B2 (ja) | ワードと文書の深さの決定 | |
EP4165487A1 (fr) | Architecture d'analyse de document | |
Rossi et al. | Building a topic hierarchy using the bag-of-related-words representation | |
Salih et al. | Semantic Document Clustering using K-means algorithm and Ward's Method | |
JP4426041B2 (ja) | カテゴリ因子による情報検索方法 | |
Bouakkaz et al. | OLAP textual aggregation approach using the Google similarity distance | |
D’hondt et al. | Topic identification based on document coherence and spectral analysis | |
Sarmento et al. | An approach to web-scale named-entity disambiguation | |
Kadhim et al. | Combined chi-square with k-means for document clustering | |
Moradi | Small-world networks for summarization of biomedical articles | |
Irshad et al. | SwCS: Section-Wise Content Similarity Approach to Exploit Scientific Big Data. | |
Lee et al. | A classifier-based text mining approach for evaluating semantic relatedness using support vector machines | |
Bhopale et al. | Optimised Clustering Based Approach for Healthcare Data Analytics. | |
Kanaan et al. | kNN Arabic text categorization using IG feature selection | |
Bellandi et al. | A Comparative Study of Clustering Techniques Applied on Covid-19 Scientific Literature | |
Nowak-Brzezińska | Feature Selection Approach for Rule-Based Knowledge Bases |