CA2348420C - Data display method and apparatus for use in text mining - Google Patents
Data display method and apparatus for use in text mining Download PDFInfo
- Publication number
- CA2348420C CA2348420C CA002348420A CA2348420A CA2348420C CA 2348420 C CA2348420 C CA 2348420C CA 002348420 A CA002348420 A CA 002348420A CA 2348420 A CA2348420 A CA 2348420A CA 2348420 C CA2348420 C CA 2348420C
- Authority
- CA
- Canada
- Prior art keywords
- phrases
- words
- components
- analysis axis
- mining
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99948—Application of database or data structure, e.g. distributed, multimedia, or image
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2001-042690 | 2001-02-20 | ||
| JP2001042690A JP2002245070A (ja) | 2001-02-20 | 2001-02-20 | データ表示方法及び装置並びにその処理プログラムを記憶した媒体 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA2348420A1 CA2348420A1 (en) | 2002-08-20 |
| CA2348420C true CA2348420C (en) | 2006-07-11 |
Family
ID=18904949
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002348420A Expired - Fee Related CA2348420C (en) | 2001-02-20 | 2001-06-05 | Data display method and apparatus for use in text mining |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US6738786B2 (enExample) |
| EP (1) | EP1233349A3 (enExample) |
| JP (1) | JP2002245070A (enExample) |
| CA (1) | CA2348420C (enExample) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4116329B2 (ja) * | 2002-05-27 | 2008-07-09 | 株式会社日立製作所 | 文書情報表示システム、文書情報表示方法及び文書検索方法 |
| GB2390704A (en) | 2002-07-09 | 2004-01-14 | Canon Kk | Automatic summary generation and display |
| GB2399427A (en) * | 2003-03-12 | 2004-09-15 | Canon Kk | Apparatus for and method of summarising text |
| US7613731B1 (en) * | 2003-06-11 | 2009-11-03 | Quantum Reader, Inc. | Method of analysis, abstraction, and delivery of electronic information |
| US7346839B2 (en) | 2003-09-30 | 2008-03-18 | Google Inc. | Information retrieval based on historical data |
| US7689433B2 (en) * | 2004-08-13 | 2010-03-30 | Accenture Global Services Gmbh | Active relationship management |
| US20060074928A1 (en) * | 2004-09-28 | 2006-04-06 | Microsoft Corporation | Selection based container listing |
| US8745054B1 (en) | 2005-11-30 | 2014-06-03 | At&T Intellectual Property Ii, L.P. | Method and apparatus for large volume text summary and visualization |
| JP4761460B2 (ja) * | 2006-05-01 | 2011-08-31 | コニカミノルタビジネステクノロジーズ株式会社 | 検索装置による情報検索方法、情報検索装置及び情報検索処理プログラム |
| US20080288488A1 (en) * | 2007-05-15 | 2008-11-20 | Iprm Intellectual Property Rights Management Ag C/O Dr. Hans Durrer | Method and system for determining trend potentials |
| US8825693B2 (en) * | 2007-12-12 | 2014-09-02 | Trend Micro Incorporated | Conditional string search |
| US8176419B2 (en) * | 2007-12-19 | 2012-05-08 | Microsoft Corporation | Self learning contextual spell corrector |
| JPWO2009101954A1 (ja) * | 2008-02-15 | 2011-06-09 | 日本電気株式会社 | テキスト情報分析システム |
| JP5153390B2 (ja) * | 2008-03-07 | 2013-02-27 | 富士フイルム株式会社 | 関連語辞書作成方法及び装置、並びに関連語辞書作成プログラム |
| US8577884B2 (en) * | 2008-05-13 | 2013-11-05 | The Boeing Company | Automated analysis and summarization of comments in survey response data |
| JP5330046B2 (ja) * | 2009-03-23 | 2013-10-30 | 株式会社東芝 | 共起表現抽出装置及び共起表現抽出方法 |
| US20100299132A1 (en) * | 2009-05-22 | 2010-11-25 | Microsoft Corporation | Mining phrase pairs from an unstructured resource |
| US20110044447A1 (en) * | 2009-08-21 | 2011-02-24 | Nexidia Inc. | Trend discovery in audio signals |
| JPWO2011118428A1 (ja) * | 2010-03-26 | 2013-07-04 | 日本電気株式会社 | 要求獲得システム、要求獲得方法、及び要求獲得用プログラム |
| JP6166980B2 (ja) * | 2013-08-02 | 2017-07-19 | エヌ・ティ・ティ・コムウェア株式会社 | 情報処理装置、情報処理方法、および情報処理プログラム |
| US10733221B2 (en) * | 2016-03-30 | 2020-08-04 | Microsoft Technology Licensing, Llc | Scalable mining of trending insights from text |
| CN108346474B (zh) * | 2018-03-14 | 2021-09-28 | 湖南省蓝蜻蜓网络科技有限公司 | 基于单词的类内分布与类间分布的电子病历特征选择方法 |
| US12277389B2 (en) | 2021-05-10 | 2025-04-15 | International Business Machines Corporation | Text mining based on document structure information extraction |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0437615B1 (en) | 1989-06-14 | 1998-10-21 | Hitachi, Ltd. | Hierarchical presearch-type document retrieval method, apparatus therefor, and magnetic disc device for this apparatus |
| JP2987099B2 (ja) | 1996-03-27 | 1999-12-06 | 株式会社日立国際ビジネス | 文書作成支援システム及び用語辞書 |
| US6038561A (en) * | 1996-10-15 | 2000-03-14 | Manning & Napier Information Services | Management and analysis of document information text |
| WO1999005614A1 (en) * | 1997-07-23 | 1999-02-04 | Datops S.A. | Information mining tool |
| US6006223A (en) * | 1997-08-12 | 1999-12-21 | International Business Machines Corporation | Mapping words, phrases using sequential-pattern to find user specific trends in a text database |
| AU1108199A (en) * | 1997-10-22 | 1999-05-10 | Glaxo Group Limited | Computer thesaurus manager |
| US6446061B1 (en) * | 1998-07-31 | 2002-09-03 | International Business Machines Corporation | Taxonomy generation for document collections |
| US6212532B1 (en) * | 1998-10-22 | 2001-04-03 | International Business Machines Corporation | Text categorization toolkit |
| JP4025443B2 (ja) * | 1998-12-04 | 2007-12-19 | 富士通株式会社 | 文書データ提供装置及び文書データ提供方法 |
| US6510406B1 (en) * | 1999-03-23 | 2003-01-21 | Mathsoft, Inc. | Inverse inference engine for high performance web search |
| US6611825B1 (en) * | 1999-06-09 | 2003-08-26 | The Boeing Company | Method and system for text mining using multidimensional subspaces |
| US6388592B1 (en) * | 2001-01-18 | 2002-05-14 | International Business Machines Corporation | Using simulated pseudo data to speed up statistical predictive modeling from massive data sets |
-
2001
- 2001-02-20 JP JP2001042690A patent/JP2002245070A/ja active Pending
- 2001-06-05 EP EP01113754A patent/EP1233349A3/en not_active Withdrawn
- 2001-06-05 CA CA002348420A patent/CA2348420C/en not_active Expired - Fee Related
- 2001-06-06 US US09/874,005 patent/US6738786B2/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| CA2348420A1 (en) | 2002-08-20 |
| US6738786B2 (en) | 2004-05-18 |
| JP2002245070A (ja) | 2002-08-30 |
| EP1233349A2 (en) | 2002-08-21 |
| US20020116398A1 (en) | 2002-08-22 |
| EP1233349A3 (en) | 2004-10-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2348420C (en) | Data display method and apparatus for use in text mining | |
| CN111401045B (zh) | 一种文本生成方法、装置、存储介质和电子设备 | |
| CA2423033C (en) | A document categorisation system | |
| EP1124189A1 (en) | Document sorting method, document sorter, and recorded medium on which document sorting program is recorded | |
| JPH11110416A (ja) | データベースからドキュメントを検索するための方法および装置 | |
| US20030233350A1 (en) | System and method for electronic catalog classification using a hybrid of rule based and statistical method | |
| WO1999034307A1 (en) | Extraction server for unstructured documents | |
| Huffman | Language-independent document categorization by N-grams | |
| CN111052123A (zh) | 同义词辞典制作装置、同义词辞典制作程序以及同义词辞典制作方法 | |
| US20040128292A1 (en) | Search data management | |
| WO2000026839A9 (en) | Advanced model for automatic extraction of skill and knowledge information from an electronic document | |
| JP3583631B2 (ja) | 情報マイニング方法、情報マイニング装置、および情報マイニングプログラムを記録したコンピュータ読み取り可能な記録媒体 | |
| JP3198932B2 (ja) | 文書検索装置 | |
| TWI396990B (zh) | 引用文獻記錄擷取系統、方法及程式產品 | |
| Torrisi et al. | Automated bundle pagination using machine learning | |
| Kumova Metin et al. | Collocation extraction in Turkish texts using statistical methods | |
| JPH11143902A (ja) | n−gramを用いた類似文書検索方法 | |
| JP2002288189A (ja) | 文書分類方法及び文書分類装置並びに文書分類処理プログラムを記録した記録媒体 | |
| Smith et al. | SWIFT: A software program for the analysis of written comments | |
| Waegel | The Development of Text-Mining Tools and Algorithms | |
| Mahdi et al. | A citation-based approach to automatic topical indexing of scientific literature | |
| Dandapat et al. | Statistical investigation of Bengali noun-verb (NV) collocations as multi-word-expressions | |
| KR101088483B1 (ko) | 이종 분류체계들을 매핑시키는 방법 및 장치 | |
| Taghva et al. | Results and implications of the noisy data projects | |
| Svatek et al. | URL as starting point for WWW document categorization. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request | ||
| MKLA | Lapsed |