WO2009029924A3 - Indexing role hierarchies for words in a search index - Google Patents

Indexing role hierarchies for words in a search index Download PDF

Info

Publication number
WO2009029924A3
WO2009029924A3 PCT/US2008/074987 US2008074987W WO2009029924A3 WO 2009029924 A3 WO2009029924 A3 WO 2009029924A3 US 2008074987 W US2008074987 W US 2008074987W WO 2009029924 A3 WO2009029924 A3 WO 2009029924A3
Authority
WO
WIPO (PCT)
Prior art keywords
words
role
query
document
documents
Prior art date
Application number
PCT/US2008/074987
Other languages
French (fr)
Other versions
WO2009029924A2 (en
Inventor
Den Berg Martin H Van
Giovanni L Thione
Chad P Walters
Richard S Crouch
Original Assignee
Powerset Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/201,721 external-priority patent/US8229730B2/en
Application filed by Powerset Inc filed Critical Powerset Inc
Priority to EP08799057.8A priority Critical patent/EP2181403B1/en
Priority to CN200880105548A priority patent/CN101796510A/en
Publication of WO2009029924A2 publication Critical patent/WO2009029924A2/en
Publication of WO2009029924A3 publication Critical patent/WO2009029924A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

Methods, systems and computer readable media for finding documents in a data store that match a natural language query submitted by a user are provided. The documents and queries are matched by determining that words within the query have the same relationship to each other as the same words in the document. Documents are semantically analyzed and words in the document are indexed along with the role the word plays in a sentence. The initial semantic role may be generalized using a role hierarchy and stored in the index along with the original role. A similar analysis may be used with the search query to find words used in the same role in both the query and the document.
PCT/US2008/074987 2007-08-31 2008-09-02 Indexing role hierarchies for words in a search index WO2009029924A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP08799057.8A EP2181403B1 (en) 2007-08-31 2008-09-02 Indexing role hierarchies for words in a search index
CN200880105548A CN101796510A (en) 2007-08-31 2008-09-02 Indexing role hierarchies for words in a search index

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US96949007P 2007-08-31 2007-08-31
US60/969,490 2007-08-31
US12/201,721 US8229730B2 (en) 2007-08-31 2008-08-29 Indexing role hierarchies for words in a search index
US12/201,721 2008-08-29

Publications (2)

Publication Number Publication Date
WO2009029924A2 WO2009029924A2 (en) 2009-03-05
WO2009029924A3 true WO2009029924A3 (en) 2009-05-14

Family

ID=42062053

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/074987 WO2009029924A2 (en) 2007-08-31 2008-09-02 Indexing role hierarchies for words in a search index

Country Status (3)

Country Link
EP (1) EP2181403B1 (en)
CN (1) CN101796510A (en)
WO (1) WO2009029924A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112434127B (en) * 2020-11-03 2023-10-17 咪咕文化科技有限公司 Text information searching method, apparatus and readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6246977B1 (en) * 1997-03-07 2001-06-12 Microsoft Corporation Information retrieval utilizing semantic representation of text and based on constrained expansion of query words
KR100546743B1 (en) * 2003-10-02 2006-01-26 한국전자통신연구원 Method for automatically creating a question and indexing the question-answer by language-analysis and the question-answering method and system
US7171349B1 (en) * 2000-08-11 2007-01-30 Attensity Corporation Relational text index creation and searching
US20070073533A1 (en) * 2005-09-23 2007-03-29 Fuji Xerox Co., Ltd. Systems and methods for structural indexing of natural language text

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7526425B2 (en) 2001-08-14 2009-04-28 Evri Inc. Method and system for extending keyword searching to syntactically and semantically annotated data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6246977B1 (en) * 1997-03-07 2001-06-12 Microsoft Corporation Information retrieval utilizing semantic representation of text and based on constrained expansion of query words
US7171349B1 (en) * 2000-08-11 2007-01-30 Attensity Corporation Relational text index creation and searching
KR100546743B1 (en) * 2003-10-02 2006-01-26 한국전자통신연구원 Method for automatically creating a question and indexing the question-answer by language-analysis and the question-answering method and system
US20070073533A1 (en) * 2005-09-23 2007-03-29 Fuji Xerox Co., Ltd. Systems and methods for structural indexing of natural language text

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
DONG-IL HAN ET AL.: "A Study on the Conceptual Modeling and Implementation of a Semantic Search System", KOREA INTELLIGENT INFORMATION SYSTEMS SOCIETY, vol. 14, no. 1, March 2008 (2008-03-01), pages 67 - 84, XP055100125 *

Also Published As

Publication number Publication date
EP2181403A4 (en) 2017-11-22
EP2181403B1 (en) 2022-05-11
EP2181403A2 (en) 2010-05-05
WO2009029924A2 (en) 2009-03-05
CN101796510A (en) 2010-08-04

Similar Documents

Publication Publication Date Title
WO2007064887A3 (en) Methods and systems for optimizing text searches over structured data in a multi-tenant environment
WO2007008263A3 (en) Self-organized concept search and data storage method
WO2006086179A3 (en) Method and system for semantic search and retrieval of electronic documents
WO2007019311A3 (en) Systems for and methods of finding relevant documents by analyzing tags
WO2008031062A3 (en) System and method for building and retriving a full text index
WO2007143666A3 (en) Element query method and system
WO2007047971A3 (en) Real time query trends with multi-document summarization
WO2009152370A3 (en) Searching using patterns of usage
WO2007016440A3 (en) Carousel control for metadata navigation and assignment
WO2008051750A3 (en) Associating geographic-related information with objects
WO2008092018A3 (en) Cross-lingual information retrieval
NO20053640D0 (en) Phrase-based browsing in an information retrieval system
WO2007021842A3 (en) Data object search and retrieval
WO2007008492A3 (en) Processing collocation mistakes in documents
NO20053637D0 (en) Phrase-based indexing in an information retrieval system
WO2006081325A3 (en) Multiple index based information retrieval system
WO2012135437A3 (en) Management and storage of distributed bookmarks
WO2008066637A3 (en) Generation of a multidimensional dataset from an associative database
BRPI0502063A (en) Combining multidimensional expressions and data mining extensions to explore olap cubes
JP2010538375A5 (en)
WO2010062737A3 (en) Retrieval using a generalized sentence collocation
CN102081660B (en) Method for searching and sequencing keywords of XML documents based on semantic correlation
GB2463221A (en) Biological database index and query searching
Bast et al. A case for semantic full-text search
GSK et al. Multilingual document clustering using wikipedia as external knowledge

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880105548.5

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08799057

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2008799057

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE