ATE450012T1 - Computerunterstütztes abrufen von dokumenten - Google Patents

Computerunterstütztes abrufen von dokumenten

Info

Publication number
ATE450012T1
ATE450012T1 AT04787046T AT04787046T ATE450012T1 AT E450012 T1 ATE450012 T1 AT E450012T1 AT 04787046 T AT04787046 T AT 04787046T AT 04787046 T AT04787046 T AT 04787046T AT E450012 T1 ATE450012 T1 AT E450012T1
Authority
AT
Austria
Prior art keywords
term
computer
documents
document retrieval
probability distribution
Prior art date
Application number
AT04787046T
Other languages
English (en)
Inventor
David Patterson
Vladimir Dobrynin
Original Assignee
Univ Ulster
St Petersburg State University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Ulster, St Petersburg State University filed Critical Univ Ulster
Application granted granted Critical
Publication of ATE450012T1 publication Critical patent/ATE450012T1/de

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)
  • Silicon Polymers (AREA)
  • Transition And Organic Metals Composition Catalysts For Addition Polymerization (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
AT04787046T 2003-09-26 2004-09-27 Computerunterstütztes abrufen von dokumenten ATE450012T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB0322600.8A GB0322600D0 (en) 2003-09-26 2003-09-26 Thematic retrieval in heterogeneous data repositories
PCT/EP2004/010877 WO2005031600A2 (en) 2003-09-26 2004-09-27 Computer aided document retrieval

Publications (1)

Publication Number Publication Date
ATE450012T1 true ATE450012T1 (de) 2009-12-15

Family

ID=29286916

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04787046T ATE450012T1 (de) 2003-09-26 2004-09-27 Computerunterstütztes abrufen von dokumenten

Country Status (11)

Country Link
US (1) US7747593B2 (de)
EP (1) EP1673704B1 (de)
AT (1) ATE450012T1 (de)
AU (1) AU2004276906B2 (de)
CA (1) CA2540241C (de)
DE (1) DE602004024324D1 (de)
DK (1) DK1673704T3 (de)
ES (1) ES2336678T3 (de)
GB (1) GB0322600D0 (de)
NZ (1) NZ546763A (de)
WO (1) WO2005031600A2 (de)

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1704957A (zh) * 2004-05-28 2005-12-07 国际商业机器公司 动态组装业务流程模型的装置和方法
EP1825395A4 (de) * 2004-10-25 2010-07-07 Yuanhua Tang Volltextanfrage- und -suchsysteme und benutzungsverfahren
US20080077570A1 (en) * 2004-10-25 2008-03-27 Infovell, Inc. Full Text Query and Search Systems and Method of Use
BRPI0616809B1 (pt) * 2005-10-04 2018-10-23 Thomson Global Resources sistemas, métodos e software para determinar ambigüidade de termos médicos
US7529740B2 (en) * 2006-08-14 2009-05-05 International Business Machines Corporation Method and apparatus for organizing data sources
CN100557608C (zh) * 2006-11-14 2009-11-04 株式会社理光 基于文档非内容特征的查询结果优化方法及装置
US20080201292A1 (en) * 2007-02-20 2008-08-21 Integrated Device Technology, Inc. Method and apparatus for preserving control information embedded in digital data
US8478747B2 (en) * 2008-06-05 2013-07-02 Samsung Electronics Co., Ltd. Situation-dependent recommendation based on clustering
US8671104B2 (en) 2007-10-12 2014-03-11 Palo Alto Research Center Incorporated System and method for providing orientation into digital information
US8165985B2 (en) 2007-10-12 2012-04-24 Palo Alto Research Center Incorporated System and method for performing discovery of digital information in a subject area
US8073682B2 (en) * 2007-10-12 2011-12-06 Palo Alto Research Center Incorporated System and method for prospecting digital information
US20090132236A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. Selection or reliable key words from unreliable sources in a system and method for conducting a search
US8316041B1 (en) 2007-11-28 2012-11-20 Adobe Systems Incorporated Generation and processing of numerical identifiers
US7849081B1 (en) * 2007-11-28 2010-12-07 Adobe Systems Incorporated Document analyzer and metadata generation and use
US8090724B1 (en) * 2007-11-28 2012-01-03 Adobe Systems Incorporated Document analysis and multi-word term detector
US7831588B2 (en) * 2008-02-05 2010-11-09 Yahoo! Inc. Context-sensitive query expansion
US7958136B1 (en) 2008-03-18 2011-06-07 Google Inc. Systems and methods for identifying similar documents
US7979426B2 (en) * 2008-06-05 2011-07-12 Samsung Electronics Co., Ltd. Clustering-based interest computation
US8285719B1 (en) 2008-08-08 2012-10-09 The Research Foundation Of State University Of New York System and method for probabilistic relational clustering
US20100057536A1 (en) * 2008-08-28 2010-03-04 Palo Alto Research Center Incorporated System And Method For Providing Community-Based Advertising Term Disambiguation
US8010545B2 (en) * 2008-08-28 2011-08-30 Palo Alto Research Center Incorporated System and method for providing a topic-directed search
US8209616B2 (en) * 2008-08-28 2012-06-26 Palo Alto Research Center Incorporated System and method for interfacing a web browser widget with social indexing
US20100057577A1 (en) * 2008-08-28 2010-03-04 Palo Alto Research Center Incorporated System And Method For Providing Topic-Guided Broadening Of Advertising Targets In Social Indexing
US8560298B2 (en) * 2008-10-21 2013-10-15 Microsoft Corporation Named entity transliteration using comparable CORPRA
US8549016B2 (en) 2008-11-14 2013-10-01 Palo Alto Research Center Incorporated System and method for providing robust topic identification in social indexes
US8356044B2 (en) * 2009-01-27 2013-01-15 Palo Alto Research Center Incorporated System and method for providing default hierarchical training for social indexing
US8239397B2 (en) * 2009-01-27 2012-08-07 Palo Alto Research Center Incorporated System and method for managing user attention by detecting hot and cold topics in social indexes
US8452781B2 (en) * 2009-01-27 2013-05-28 Palo Alto Research Center Incorporated System and method for using banded topic relevance and time for article prioritization
US7953679B2 (en) * 2009-07-22 2011-05-31 Xerox Corporation Scalable indexing for layout based document retrieval and ranking
US9355171B2 (en) * 2009-10-09 2016-05-31 Hewlett Packard Enterprise Development Lp Clustering of near-duplicate documents
CA2789010C (en) * 2010-02-05 2013-10-22 Fti Technology Llc Propagating classification decisions
US9031944B2 (en) 2010-04-30 2015-05-12 Palo Alto Research Center Incorporated System and method for providing multi-core and multi-level topical organization in social indexes
US8639773B2 (en) * 2010-06-17 2014-01-28 Microsoft Corporation Discrepancy detection for web crawling
US8645298B2 (en) 2010-10-26 2014-02-04 Microsoft Corporation Topic models
WO2013049864A1 (en) * 2011-09-30 2013-04-04 Willem Morkel Van Der Westhuizen Method for human-computer interaction on a graphical user interface (gui)
US8572089B2 (en) * 2011-12-15 2013-10-29 Business Objects Software Ltd. Entity clustering via data services
US20130253910A1 (en) * 2012-03-23 2013-09-26 Sententia, LLC Systems and Methods for Analyzing Digital Communications
US9336302B1 (en) * 2012-07-20 2016-05-10 Zuci Realty Llc Insight and algorithmic clustering for automated synthesis
US9483463B2 (en) * 2012-09-10 2016-11-01 Xerox Corporation Method and system for motif extraction in electronic documents
RU2583739C2 (ru) 2013-10-16 2016-05-10 Общество С Ограниченной Ответственностью "Яндекс" Сервер для определения поисковой выдачи на поисковый запрос и электронное устройство
US8837835B1 (en) * 2014-01-20 2014-09-16 Array Technology, LLC Document grouping system
US20150220680A1 (en) * 2014-01-31 2015-08-06 International Business Machines Corporation Inferring biological pathways from unstructured text analysis
US9959364B2 (en) * 2014-05-22 2018-05-01 Oath Inc. Content recommendations
US10657186B2 (en) * 2015-05-29 2020-05-19 Dell Products, L.P. System and method for automatic document classification and grouping based on document topic
US10698908B2 (en) * 2016-07-12 2020-06-30 International Business Machines Corporation Multi-field search query ranking using scoring statistics
US11397558B2 (en) 2017-05-18 2022-07-26 Peloton Interactive, Inc. Optimizing display engagement in action automation
EP3779733A1 (de) 2019-08-12 2021-02-17 Universität Bern Informationsabrufverfahren
CN113569012B (zh) * 2021-07-28 2023-12-26 卫宁健康科技集团股份有限公司 医疗数据查询方法、装置、设备及存储介质

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4839853A (en) * 1988-09-15 1989-06-13 Bell Communications Research, Inc. Computer information retrieval using latent semantic structure
US5442778A (en) * 1991-11-12 1995-08-15 Xerox Corporation Scatter-gather: a cluster-based method and apparatus for browsing large document collections
US5787422A (en) * 1996-01-11 1998-07-28 Xerox Corporation Method and apparatus for information accesss employing overlapping clusters
US5839106A (en) * 1996-12-17 1998-11-17 Apple Computer, Inc. Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model
JP2940501B2 (ja) * 1996-12-25 1999-08-25 日本電気株式会社 ドキュメント分類装置及び方法
US6128613A (en) * 1997-06-26 2000-10-03 The Chinese University Of Hong Kong Method and apparatus for establishing topic word classes based on an entropy cost function to retrieve documents represented by the topic words
US6564197B2 (en) * 1999-05-03 2003-05-13 E.Piphany, Inc. Method and apparatus for scalable probabilistic clustering using decision trees
US6757646B2 (en) * 2000-03-22 2004-06-29 Insightful Corporation Extended functionality for an inverse inference engine based web search
US6584456B1 (en) * 2000-06-19 2003-06-24 International Business Machines Corporation Model selection in machine learning with applications to document clustering
US6687696B2 (en) * 2000-07-26 2004-02-03 Recommind Inc. System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models
KR100426382B1 (ko) * 2000-08-23 2004-04-08 학교법인 김포대학 엔트로피 정보와 베이지안 에스오엠을 이용한 문서군집기반의 순위조정 방법
US6993535B2 (en) * 2001-06-18 2006-01-31 International Business Machines Corporation Business method and apparatus for employing induced multimedia classifiers based on unified representation of features reflecting disparate modalities
US20030120630A1 (en) * 2001-12-20 2003-06-26 Daniel Tunkelang Method and system for similarity search and clustering
US7174343B2 (en) * 2002-05-10 2007-02-06 Oracle International Corporation In-database clustering
US7080063B2 (en) * 2002-05-10 2006-07-18 Oracle International Corporation Probabilistic model generation
US7590642B2 (en) * 2002-05-10 2009-09-15 Oracle International Corp. Enhanced K-means clustering
DE10237310B4 (de) * 2002-08-14 2006-11-30 Wismüller, Axel, Dipl.-Phys. Dr.med. Verfahren, Datenverarbeitungseinrichtung und Computerprogrammprodukt zur Datenverarbeitung
US7383258B2 (en) * 2002-10-03 2008-06-03 Google, Inc. Method and apparatus for characterizing documents based on clusters of related words
US7231393B1 (en) * 2003-09-30 2007-06-12 Google, Inc. Method and apparatus for learning a probabilistic generative model for text
US7454428B2 (en) * 2003-10-29 2008-11-18 Oracle International Corp. Network data model for relational database management system

Also Published As

Publication number Publication date
EP1673704B1 (de) 2009-11-25
AU2004276906B2 (en) 2010-03-04
US20070174267A1 (en) 2007-07-26
ES2336678T3 (es) 2010-04-15
WO2005031600A3 (en) 2005-07-21
NZ546763A (en) 2008-03-28
AU2004276906A1 (en) 2005-04-07
WO2005031600A2 (en) 2005-04-07
DE602004024324D1 (de) 2010-01-07
EP1673704A2 (de) 2006-06-28
DK1673704T3 (da) 2010-04-12
CA2540241A1 (en) 2005-04-07
GB0322600D0 (en) 2003-10-29
CA2540241C (en) 2013-09-17
US7747593B2 (en) 2010-06-29

Similar Documents

Publication Publication Date Title
ATE450012T1 (de) Computerunterstütztes abrufen von dokumenten
Yin et al. Efficiently mining top-k high utility sequential patterns
WO2004063863A3 (en) Document management apparatus, system and method
WO2007005975A3 (en) Risk modeling system
IL175956A0 (en) Method for indexing and identifying multimedia documents
WO2011034502A8 (en) Textual query based multimedia retrieval system
CN103530284A (zh) 短句切分装置、机器翻译系统及对应切分方法和翻译方法
CN103235812B (zh) 查询多意图识别方法和系统
WO2009129425A3 (en) Forum web page clustering based on repetitive regions
WO2007059232A3 (en) Methods and apparatus for probe-based clustering
CN104519323A (zh) 一种人车目标分类系统和方法
CN102622353B (zh) 一种固定音频检索方法
CA2912019C (en) Systems and methods for generating issue networks
CN104731811A (zh) 一种面向大规模动态短文本的聚类信息演化分析方法
ATE418106T1 (de) Generisches suchverfahren für verschiedenen objekttypen
CN103714178A (zh) 一种基于词间相关性的图像自动标注方法
CN103530344A (zh) 一种基于改进的tf-idf方法的检索词实时修正方法
CN103514214B (zh) 数据查询方法及装置
Mirabi et al. PS+ Pre/Post: A novel structure and access mechanism for wireless XML stream supporting twig pattern queries
CN110309139B (zh) 高维近邻对搜索方法和系统
Masood et al. Load balance: Energy efficient routing protocol in wireless sensor network
CN105389359A (zh) 搜索方法及系统
Pliakos et al. Tree based feature induction for biomedical data
Parapar et al. Compression-based document length prior for language models
Fan et al. A diverse niche radii niching technique for multimodal function optimization

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties