WO2009029924A3 - Indexing role hierarchies for words in a search index - Google Patents
Indexing role hierarchies for words in a search index Download PDFInfo
- Publication number
- WO2009029924A3 WO2009029924A3 PCT/US2008/074987 US2008074987W WO2009029924A3 WO 2009029924 A3 WO2009029924 A3 WO 2009029924A3 US 2008074987 W US2008074987 W US 2008074987W WO 2009029924 A3 WO2009029924 A3 WO 2009029924A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- words
- role
- query
- document
- documents
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Abstract
Methods, systems and computer readable media for finding documents in a data store that match a natural language query submitted by a user are provided. The documents and queries are matched by determining that words within the query have the same relationship to each other as the same words in the document. Documents are semantically analyzed and words in the document are indexed along with the role the word plays in a sentence. The initial semantic role may be generalized using a role hierarchy and stored in the index along with the original role. A similar analysis may be used with the search query to find words used in the same role in both the query and the document.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP08799057.8A EP2181403B1 (en) | 2007-08-31 | 2008-09-02 | Indexing role hierarchies for words in a search index |
CN200880105548A CN101796510A (en) | 2007-08-31 | 2008-09-02 | Indexing role hierarchies for words in a search index |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US96949007P | 2007-08-31 | 2007-08-31 | |
US60/969,490 | 2007-08-31 | ||
US12/201,721 US8229730B2 (en) | 2007-08-31 | 2008-08-29 | Indexing role hierarchies for words in a search index |
US12/201,721 | 2008-08-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2009029924A2 WO2009029924A2 (en) | 2009-03-05 |
WO2009029924A3 true WO2009029924A3 (en) | 2009-05-14 |
Family
ID=42062053
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2008/074987 WO2009029924A2 (en) | 2007-08-31 | 2008-09-02 | Indexing role hierarchies for words in a search index |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP2181403B1 (en) |
CN (1) | CN101796510A (en) |
WO (1) | WO2009029924A2 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112434127B (en) * | 2020-11-03 | 2023-10-17 | 咪咕文化科技有限公司 | Text information searching method, apparatus and readable storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6246977B1 (en) * | 1997-03-07 | 2001-06-12 | Microsoft Corporation | Information retrieval utilizing semantic representation of text and based on constrained expansion of query words |
KR100546743B1 (en) * | 2003-10-02 | 2006-01-26 | 한국전자통신연구원 | Method for automatically creating a question and indexing the question-answer by language-analysis and the question-answering method and system |
US7171349B1 (en) * | 2000-08-11 | 2007-01-30 | Attensity Corporation | Relational text index creation and searching |
US20070073533A1 (en) * | 2005-09-23 | 2007-03-29 | Fuji Xerox Co., Ltd. | Systems and methods for structural indexing of natural language text |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7526425B2 (en) | 2001-08-14 | 2009-04-28 | Evri Inc. | Method and system for extending keyword searching to syntactically and semantically annotated data |
-
2008
- 2008-09-02 CN CN200880105548A patent/CN101796510A/en active Pending
- 2008-09-02 EP EP08799057.8A patent/EP2181403B1/en active Active
- 2008-09-02 WO PCT/US2008/074987 patent/WO2009029924A2/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6246977B1 (en) * | 1997-03-07 | 2001-06-12 | Microsoft Corporation | Information retrieval utilizing semantic representation of text and based on constrained expansion of query words |
US7171349B1 (en) * | 2000-08-11 | 2007-01-30 | Attensity Corporation | Relational text index creation and searching |
KR100546743B1 (en) * | 2003-10-02 | 2006-01-26 | 한국전자통신연구원 | Method for automatically creating a question and indexing the question-answer by language-analysis and the question-answering method and system |
US20070073533A1 (en) * | 2005-09-23 | 2007-03-29 | Fuji Xerox Co., Ltd. | Systems and methods for structural indexing of natural language text |
Non-Patent Citations (1)
Title |
---|
DONG-IL HAN ET AL.: "A Study on the Conceptual Modeling and Implementation of a Semantic Search System", KOREA INTELLIGENT INFORMATION SYSTEMS SOCIETY, vol. 14, no. 1, March 2008 (2008-03-01), pages 67 - 84, XP055100125 * |
Also Published As
Publication number | Publication date |
---|---|
EP2181403A4 (en) | 2017-11-22 |
EP2181403B1 (en) | 2022-05-11 |
EP2181403A2 (en) | 2010-05-05 |
WO2009029924A2 (en) | 2009-03-05 |
CN101796510A (en) | 2010-08-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007064887A3 (en) | Methods and systems for optimizing text searches over structured data in a multi-tenant environment | |
WO2007008263A3 (en) | Self-organized concept search and data storage method | |
WO2006086179A3 (en) | Method and system for semantic search and retrieval of electronic documents | |
WO2007019311A3 (en) | Systems for and methods of finding relevant documents by analyzing tags | |
WO2008031062A3 (en) | System and method for building and retriving a full text index | |
WO2007143666A3 (en) | Element query method and system | |
WO2007047971A3 (en) | Real time query trends with multi-document summarization | |
WO2009152370A3 (en) | Searching using patterns of usage | |
WO2007016440A3 (en) | Carousel control for metadata navigation and assignment | |
WO2008051750A3 (en) | Associating geographic-related information with objects | |
WO2008092018A3 (en) | Cross-lingual information retrieval | |
NO20053640D0 (en) | Phrase-based browsing in an information retrieval system | |
WO2007021842A3 (en) | Data object search and retrieval | |
WO2007008492A3 (en) | Processing collocation mistakes in documents | |
NO20053637D0 (en) | Phrase-based indexing in an information retrieval system | |
WO2006081325A3 (en) | Multiple index based information retrieval system | |
WO2012135437A3 (en) | Management and storage of distributed bookmarks | |
WO2008066637A3 (en) | Generation of a multidimensional dataset from an associative database | |
BRPI0502063A (en) | Combining multidimensional expressions and data mining extensions to explore olap cubes | |
JP2010538375A5 (en) | ||
WO2010062737A3 (en) | Retrieval using a generalized sentence collocation | |
CN102081660B (en) | Method for searching and sequencing keywords of XML documents based on semantic correlation | |
GB2463221A (en) | Biological database index and query searching | |
Bast et al. | A case for semantic full-text search | |
GSK et al. | Multilingual document clustering using wikipedia as external knowledge |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200880105548.5 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08799057 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008799057 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |