WO2007149623A3 - Full text query and search systems and method of use - Google Patents
Full text query and search systems and method of use Download PDFInfo
- Publication number
- WO2007149623A3 WO2007149623A3 PCT/US2007/067439 US2007067439W WO2007149623A3 WO 2007149623 A3 WO2007149623 A3 WO 2007149623A3 US 2007067439 W US2007067439 W US 2007067439W WO 2007149623 A3 WO2007149623 A3 WO 2007149623A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- measure
- itoms
- hits
- shared
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Roughly described, a database searching method for searching a database, in which hits are ranked in dependence upon an information measure of itoms shared by both the hit and the query. The information measure can be a Shannon information score, or another measure which indicates the information value of the shared itoms. An itom can be a word or other token, or a multi-word phrase, and can overlap with each other. Synonyms can be substituted for itoms in the query, with the information measure of substituted itoms being derated in accordance with a predetermined measure of the synonyms' similarity. Indirect searching methods are described in which hit from other search engines are re-ranked in dependence upon the information measures of shared itoms. Structured and completely unstructured databases may be searched, with hits being demarcated dynamically. Hits may be clustered based upon distances in an information- measure- weighted distance space.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP07761298A EP2013788A4 (en) | 2006-04-25 | 2007-04-25 | Full text query and search systems and method of use |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US74560506P | 2006-04-25 | 2006-04-25 | |
US74560406P | 2006-04-25 | 2006-04-25 | |
US60/745,605 | 2006-04-25 | ||
US60/745,604 | 2006-04-25 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007149623A2 WO2007149623A2 (en) | 2007-12-27 |
WO2007149623A3 true WO2007149623A3 (en) | 2009-02-12 |
Family
ID=38834185
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/067439 WO2007149623A2 (en) | 2006-04-25 | 2007-04-25 | Full text query and search systems and method of use |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP2013788A4 (en) |
WO (1) | WO2007149623A2 (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US8364679B2 (en) | 2009-09-17 | 2013-01-29 | Cpa Global Patent Research Limited | Method, system, and apparatus for delivering query results from an electronic document collection |
TWI486797B (en) * | 2010-03-09 | 2015-06-01 | Alibaba Group Holding Ltd | Methods and devices for sorting search results |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US10692015B2 (en) * | 2016-07-15 | 2020-06-23 | Io-Tahoe Llc | Primary key-foreign key relationship determination through machine learning |
CN106789895B (en) * | 2016-11-18 | 2020-03-27 | 东软集团股份有限公司 | Compressed text detection method and device |
US11604841B2 (en) | 2017-12-20 | 2023-03-14 | International Business Machines Corporation | Mechanistic mathematical model search engine |
US10394555B1 (en) | 2018-12-17 | 2019-08-27 | Bakhtgerey Sinchev | Computing network architecture for reducing a computing operation time and memory usage associated with determining, from a set of data elements, a subset of at least two data elements, associated with a target computing operation result |
CN110413734B (en) * | 2019-07-25 | 2023-02-17 | 万达信息股份有限公司 | Intelligent search system and method for medical service |
CN111079036B (en) * | 2019-11-25 | 2023-11-07 | 罗靖涛 | Field type searching method |
CN111222040B (en) * | 2019-12-30 | 2023-06-13 | 航天信息股份有限公司企业服务分公司 | Scheme self-matching processing method and system based on training requirements |
US11900272B2 (en) * | 2020-05-13 | 2024-02-13 | Factset Research System Inc. | Method and system for mapping labels in standardized tables using machine learning |
CN113327572B (en) * | 2021-06-02 | 2024-02-09 | 清华大学深圳国际研究生院 | Controllable emotion voice synthesis method and system based on emotion type label |
US11546142B1 (en) | 2021-12-22 | 2023-01-03 | Bakhtgerey Sinchev | Cryptography key generation method for encryption and decryption |
CN116595973B (en) * | 2023-05-19 | 2023-10-03 | 广东职教桥数据科技有限公司 | Post function identification method based on natural language processing classification technology |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5761497A (en) * | 1993-11-22 | 1998-06-02 | Reed Elsevier, Inc. | Associative text search and retrieval system that calculates ranking scores and window scores |
US5812998A (en) * | 1993-09-30 | 1998-09-22 | Omron Corporation | Similarity searching of sub-structured databases |
US20020111941A1 (en) * | 2000-12-19 | 2002-08-15 | Xerox Corporation | Apparatus and method for information retrieval |
US6633817B1 (en) * | 1999-12-29 | 2003-10-14 | Incyte Genomics, Inc. | Sequence database search with sequence search trees |
US20040024583A1 (en) * | 2000-03-20 | 2004-02-05 | Freeman Robert J | Natural-language processing system using a large corpus |
US20060026147A1 (en) * | 2004-07-30 | 2006-02-02 | Cone Julian M | Adaptive search engine |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060212441A1 (en) * | 2004-10-25 | 2006-09-21 | Yuanhua Tang | Full text query and search systems and methods of use |
-
2007
- 2007-04-25 EP EP07761298A patent/EP2013788A4/en not_active Withdrawn
- 2007-04-25 WO PCT/US2007/067439 patent/WO2007149623A2/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5812998A (en) * | 1993-09-30 | 1998-09-22 | Omron Corporation | Similarity searching of sub-structured databases |
US5761497A (en) * | 1993-11-22 | 1998-06-02 | Reed Elsevier, Inc. | Associative text search and retrieval system that calculates ranking scores and window scores |
US6633817B1 (en) * | 1999-12-29 | 2003-10-14 | Incyte Genomics, Inc. | Sequence database search with sequence search trees |
US20040024583A1 (en) * | 2000-03-20 | 2004-02-05 | Freeman Robert J | Natural-language processing system using a large corpus |
US20020111941A1 (en) * | 2000-12-19 | 2002-08-15 | Xerox Corporation | Apparatus and method for information retrieval |
US20060026147A1 (en) * | 2004-07-30 | 2006-02-02 | Cone Julian M | Adaptive search engine |
Non-Patent Citations (1)
Title |
---|
See also references of EP2013788A4 * |
Also Published As
Publication number | Publication date |
---|---|
WO2007149623A2 (en) | 2007-12-27 |
EP2013788A4 (en) | 2012-04-25 |
EP2013788A2 (en) | 2009-01-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007149623A3 (en) | Full text query and search systems and method of use | |
Zhang et al. | Entity linking leveraging automatically generated annotation | |
WO2006047654A3 (en) | Full text query and search systems and methods of use | |
WO2005010691A3 (en) | Disambiguation of search phrases using interpretation clusters | |
WO2008009017A3 (en) | Method and system for qualifying keywords in query strings | |
NZ578672A (en) | Information-retrieval systems, methods, and software with concept-based searching and ranking | |
WO2005017682A3 (en) | Product placement engine and method | |
WO2006118814A3 (en) | Method for finding semantically related search engine queries | |
WO2005032235A3 (en) | Increasing a number of relevant advertisements using a relaxed match | |
WO2007038713A3 (en) | Search engine determining results based on probabilistic scoring of relevance | |
WO2008073502A3 (en) | Viewport-relative scoring for location search queries | |
WO2007130716A3 (en) | Methods and apparatus for computerized searching | |
BRPI0501320A (en) | Suggested Related Terms for a Multisense Query | |
WO2007101194A3 (en) | System and method for identifying related queries for languages with multiple writing systems | |
WO2008051750A3 (en) | Associating geographic-related information with objects | |
WO2007016232A3 (en) | Processor for fast phrase searching | |
WO2008058146A3 (en) | Method and system for generating scored recommendations based on scored references | |
WO2002089004A3 (en) | Search data management | |
Crimp et al. | Refining query expansion terms using query context | |
Vechtomova | Using Subjective Adjectives in Opinion Retrieval from Blogs. | |
van Engers | Thesaurus-based retrieval of case law | |
US20180101606A1 (en) | Method and system for searching for relevant items in a collection of documents given user defined documents | |
Wood et al. | Orthogonal query recommendations for children | |
Xu et al. | Using multiple features and statistical model to calculate text units similarity | |
Selvi et al. | An approach to improve precision and recall for ad-hoc information retrieval using sbir algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200780023220.4 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07761298 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007761298 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |