CA2348420C - Methode et appareil d'affichage de donnees pour l'analyse de textes - Google Patents

Methode et appareil d'affichage de donnees pour l'analyse de textes Download PDF

Info

Publication number
CA2348420C
CA2348420C CA002348420A CA2348420A CA2348420C CA 2348420 C CA2348420 C CA 2348420C CA 002348420 A CA002348420 A CA 002348420A CA 2348420 A CA2348420 A CA 2348420A CA 2348420 C CA2348420 C CA 2348420C
Authority
CA
Canada
Prior art keywords
phrases
words
components
analysis axis
mining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002348420A
Other languages
English (en)
Other versions
CA2348420A1 (fr
Inventor
Natsuko Sugaya
Katsumi Tada
Yoshifumi Sato
Tadataka Matsubayashi
Yasuhiko Inaba
Mikihiko Tokunaga
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Publication of CA2348420A1 publication Critical patent/CA2348420A1/fr
Application granted granted Critical
Publication of CA2348420C publication Critical patent/CA2348420C/fr
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99948Application of database or data structure, e.g. distributed, multimedia, or image

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CA002348420A 2001-02-20 2001-06-05 Methode et appareil d'affichage de donnees pour l'analyse de textes Expired - Fee Related CA2348420C (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001-042690 2001-02-20
JP2001042690A JP2002245070A (ja) 2001-02-20 2001-02-20 データ表示方法及び装置並びにその処理プログラムを記憶した媒体

Publications (2)

Publication Number Publication Date
CA2348420A1 CA2348420A1 (fr) 2002-08-20
CA2348420C true CA2348420C (fr) 2006-07-11

Family

ID=18904949

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002348420A Expired - Fee Related CA2348420C (fr) 2001-02-20 2001-06-05 Methode et appareil d'affichage de donnees pour l'analyse de textes

Country Status (4)

Country Link
US (1) US6738786B2 (fr)
EP (1) EP1233349A3 (fr)
JP (1) JP2002245070A (fr)
CA (1) CA2348420C (fr)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4116329B2 (ja) * 2002-05-27 2008-07-09 株式会社日立製作所 文書情報表示システム、文書情報表示方法及び文書検索方法
GB2390704A (en) 2002-07-09 2004-01-14 Canon Kk Automatic summary generation and display
GB2399427A (en) * 2003-03-12 2004-09-15 Canon Kk Apparatus for and method of summarising text
US7613731B1 (en) * 2003-06-11 2009-11-03 Quantum Reader, Inc. Method of analysis, abstraction, and delivery of electronic information
US7346839B2 (en) 2003-09-30 2008-03-18 Google Inc. Information retrieval based on historical data
US7689433B2 (en) 2004-08-13 2010-03-30 Accenture Global Services Gmbh Active relationship management
US20060074928A1 (en) * 2004-09-28 2006-04-06 Microsoft Corporation Selection based container listing
US8745054B1 (en) 2005-11-30 2014-06-03 At&T Intellectual Property Ii, L.P. Method and apparatus for large volume text summary and visualization
JP4761460B2 (ja) * 2006-05-01 2011-08-31 コニカミノルタビジネステクノロジーズ株式会社 検索装置による情報検索方法、情報検索装置及び情報検索処理プログラム
US20080288488A1 (en) * 2007-05-15 2008-11-20 Iprm Intellectual Property Rights Management Ag C/O Dr. Hans Durrer Method and system for determining trend potentials
US8825693B2 (en) * 2007-12-12 2014-09-02 Trend Micro Incorporated Conditional string search
US8176419B2 (en) * 2007-12-19 2012-05-08 Microsoft Corporation Self learning contextual spell corrector
WO2009101954A1 (fr) * 2008-02-15 2009-08-20 Nec Corporation Système d'analyse d'informations de texte
JP5153390B2 (ja) * 2008-03-07 2013-02-27 富士フイルム株式会社 関連語辞書作成方法及び装置、並びに関連語辞書作成プログラム
US8577884B2 (en) * 2008-05-13 2013-11-05 The Boeing Company Automated analysis and summarization of comments in survey response data
JP5330046B2 (ja) * 2009-03-23 2013-10-30 株式会社東芝 共起表現抽出装置及び共起表現抽出方法
US20100299132A1 (en) * 2009-05-22 2010-11-25 Microsoft Corporation Mining phrase pairs from an unstructured resource
US20110044447A1 (en) * 2009-08-21 2011-02-24 Nexidia Inc. Trend discovery in audio signals
US9262394B2 (en) * 2010-03-26 2016-02-16 Nec Corporation Document content analysis and abridging apparatus
JP6166980B2 (ja) * 2013-08-02 2017-07-19 エヌ・ティ・ティ・コムウェア株式会社 情報処理装置、情報処理方法、および情報処理プログラム
US10733221B2 (en) * 2016-03-30 2020-08-04 Microsoft Technology Licensing, Llc Scalable mining of trending insights from text
CN108346474B (zh) * 2018-03-14 2021-09-28 湖南省蓝蜻蜓网络科技有限公司 基于单词的类内分布与类间分布的电子病历特征选择方法

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990016036A1 (fr) 1989-06-14 1990-12-27 Hitachi, Ltd. Procede de recherche documentaire a prerecherche hierarchique, appareil a cet effet, et dispositif a disque magnetique destine a cet appareil
JP2987099B2 (ja) 1996-03-27 1999-12-06 株式会社日立国際ビジネス 文書作成支援システム及び用語辞書
US6038561A (en) * 1996-10-15 2000-03-14 Manning & Napier Information Services Management and analysis of document information text
WO1999005614A1 (fr) * 1997-07-23 1999-02-04 Datops S.A. Outil d'extraction d'informations
US6006223A (en) * 1997-08-12 1999-12-21 International Business Machines Corporation Mapping words, phrases using sequential-pattern to find user specific trends in a text database
AU1108199A (en) * 1997-10-22 1999-05-10 Glaxo Group Limited Computer thesaurus manager
US6446061B1 (en) * 1998-07-31 2002-09-03 International Business Machines Corporation Taxonomy generation for document collections
US6212532B1 (en) * 1998-10-22 2001-04-03 International Business Machines Corporation Text categorization toolkit
JP4025443B2 (ja) * 1998-12-04 2007-12-19 富士通株式会社 文書データ提供装置及び文書データ提供方法
US6510406B1 (en) * 1999-03-23 2003-01-21 Mathsoft, Inc. Inverse inference engine for high performance web search
US6611825B1 (en) * 1999-06-09 2003-08-26 The Boeing Company Method and system for text mining using multidimensional subspaces
US6388592B1 (en) * 2001-01-18 2002-05-14 International Business Machines Corporation Using simulated pseudo data to speed up statistical predictive modeling from massive data sets

Also Published As

Publication number Publication date
EP1233349A3 (fr) 2004-10-13
JP2002245070A (ja) 2002-08-30
US6738786B2 (en) 2004-05-18
US20020116398A1 (en) 2002-08-22
CA2348420A1 (fr) 2002-08-20
EP1233349A2 (fr) 2002-08-21

Similar Documents

Publication Publication Date Title
CA2348420C (fr) Methode et appareil d'affichage de donnees pour l'analyse de textes
US7971150B2 (en) Document categorisation system
JPH11110416A (ja) データベースからドキュメントを検索するための方法および装置
JP5086799B2 (ja) 質問応答方法、装置、プログラム並びにそのプログラムを記録した記録媒体
US20030233350A1 (en) System and method for electronic catalog classification using a hybrid of rule based and statistical method
WO1999034307A1 (fr) Serveur d'extraction
CN111401045A (zh) 一种文本生成方法、装置、存储介质和电子设备
US20040128292A1 (en) Search data management
WO2007113585A1 (fr) procédés et systèmes d'indexation et de récupération de documents
JP3198932B2 (ja) 文書検索装置
Ghanem et al. Stemming effectiveness in clustering of Arabic documents
KR102150560B1 (ko) 토픽을 이용한 타겟 분석 장치 및 방법
Torrisi et al. Automated bundle pagination using machine learning
WO2000026839A9 (fr) Modele evolue destine a l'extraction automatique des informations relatives au savoir-faire et aux connaissances depuis un document electronique
Mahdi et al. A citation-based approach to automatic topical indexing of scientific literature
JPH11120183A (ja) キーワード抽出方法及び装置
JP2000172691A (ja) 情報マイニング方法、情報マイニング装置、および情報マイニングプログラムを記録したコンピュータ読み取り可能な記録媒体
TWI396990B (zh) 引用文獻記錄擷取系統、方法及程式產品
JPH11143902A (ja) n−gramを用いた類似文書検索方法
JP2002288189A (ja) 文書分類方法及び文書分類装置並びに文書分類処理プログラムを記録した記録媒体
Waegel The Development of Text-Mining Tools and Algorithms
Dandapat et al. Statistical investigation of Bengali noun-verb (NV) collocations as multi-word-expressions
KR101088483B1 (ko) 이종 분류체계들을 매핑시키는 방법 및 장치
Stubbs The ARL Library Index and Quantitative Relationships in the ARL.
Svatek et al. URL as starting point for WWW document categorization.

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed