WO2007149623A3 - Full text query and search systems and method of use - Google Patents

Full text query and search systems and method of use Download PDF

Info

Publication number
WO2007149623A3
WO2007149623A3 PCT/US2007/067439 US2007067439W WO2007149623A3 WO 2007149623 A3 WO2007149623 A3 WO 2007149623A3 US 2007067439 W US2007067439 W US 2007067439W WO 2007149623 A3 WO2007149623 A3 WO 2007149623A3
Authority
WO
WIPO (PCT)
Prior art keywords
information
measure
itoms
hits
shared
Prior art date
Application number
PCT/US2007/067439
Other languages
French (fr)
Other versions
WO2007149623A2 (en
Inventor
Yuanhua Tom Tang
Qianjin Hu
Yonghong Grace Yang
Chunnuan Chen
Minghua Mei
Original Assignee
Infovell Inc
Yuanhua Tom Tang
Qianjin Hu
Yonghong Grace Yang
Chunnuan Chen
Minghua Mei
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Infovell Inc, Yuanhua Tom Tang, Qianjin Hu, Yonghong Grace Yang, Chunnuan Chen, Minghua Mei filed Critical Infovell Inc
Priority to EP07761298A priority Critical patent/EP2013788A4/en
Publication of WO2007149623A2 publication Critical patent/WO2007149623A2/en
Publication of WO2007149623A3 publication Critical patent/WO2007149623A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Roughly described, a database searching method for searching a database, in which hits are ranked in dependence upon an information measure of itoms shared by both the hit and the query. The information measure can be a Shannon information score, or another measure which indicates the information value of the shared itoms. An itom can be a word or other token, or a multi-word phrase, and can overlap with each other. Synonyms can be substituted for itoms in the query, with the information measure of substituted itoms being derated in accordance with a predetermined measure of the synonyms' similarity. Indirect searching methods are described in which hit from other search engines are re-ranked in dependence upon the information measures of shared itoms. Structured and completely unstructured databases may be searched, with hits being demarcated dynamically. Hits may be clustered based upon distances in an information- measure- weighted distance space.
PCT/US2007/067439 2006-04-25 2007-04-25 Full text query and search systems and method of use WO2007149623A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07761298A EP2013788A4 (en) 2006-04-25 2007-04-25 Full text query and search systems and method of use

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US74560506P 2006-04-25 2006-04-25
US74560406P 2006-04-25 2006-04-25
US60/745,605 2006-04-25
US60/745,604 2006-04-25

Publications (2)

Publication Number Publication Date
WO2007149623A2 WO2007149623A2 (en) 2007-12-27
WO2007149623A3 true WO2007149623A3 (en) 2009-02-12

Family

ID=38834185

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/067439 WO2007149623A2 (en) 2006-04-25 2007-04-25 Full text query and search systems and method of use

Country Status (2)

Country Link
EP (1) EP2013788A4 (en)
WO (1) WO2007149623A2 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9348912B2 (en) 2007-10-18 2016-05-24 Microsoft Technology Licensing, Llc Document length as a static relevance feature for ranking search results
US8364679B2 (en) 2009-09-17 2013-01-29 Cpa Global Patent Research Limited Method, system, and apparatus for delivering query results from an electronic document collection
TWI486797B (en) * 2010-03-09 2015-06-01 Alibaba Group Holding Ltd Methods and devices for sorting search results
US9495462B2 (en) 2012-01-27 2016-11-15 Microsoft Technology Licensing, Llc Re-ranking search results
US10692015B2 (en) * 2016-07-15 2020-06-23 Io-Tahoe Llc Primary key-foreign key relationship determination through machine learning
CN106789895B (en) * 2016-11-18 2020-03-27 东软集团股份有限公司 Compressed text detection method and device
US11604841B2 (en) 2017-12-20 2023-03-14 International Business Machines Corporation Mechanistic mathematical model search engine
US10394555B1 (en) 2018-12-17 2019-08-27 Bakhtgerey Sinchev Computing network architecture for reducing a computing operation time and memory usage associated with determining, from a set of data elements, a subset of at least two data elements, associated with a target computing operation result
CN110413734B (en) * 2019-07-25 2023-02-17 万达信息股份有限公司 Intelligent search system and method for medical service
CN111079036B (en) * 2019-11-25 2023-11-07 罗靖涛 Field type searching method
CN111222040B (en) * 2019-12-30 2023-06-13 航天信息股份有限公司企业服务分公司 Scheme self-matching processing method and system based on training requirements
US11900272B2 (en) * 2020-05-13 2024-02-13 Factset Research System Inc. Method and system for mapping labels in standardized tables using machine learning
CN113327572B (en) * 2021-06-02 2024-02-09 清华大学深圳国际研究生院 Controllable emotion voice synthesis method and system based on emotion type label
US11546142B1 (en) 2021-12-22 2023-01-03 Bakhtgerey Sinchev Cryptography key generation method for encryption and decryption
CN116595973B (en) * 2023-05-19 2023-10-03 广东职教桥数据科技有限公司 Post function identification method based on natural language processing classification technology

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5761497A (en) * 1993-11-22 1998-06-02 Reed Elsevier, Inc. Associative text search and retrieval system that calculates ranking scores and window scores
US5812998A (en) * 1993-09-30 1998-09-22 Omron Corporation Similarity searching of sub-structured databases
US20020111941A1 (en) * 2000-12-19 2002-08-15 Xerox Corporation Apparatus and method for information retrieval
US6633817B1 (en) * 1999-12-29 2003-10-14 Incyte Genomics, Inc. Sequence database search with sequence search trees
US20040024583A1 (en) * 2000-03-20 2004-02-05 Freeman Robert J Natural-language processing system using a large corpus
US20060026147A1 (en) * 2004-07-30 2006-02-02 Cone Julian M Adaptive search engine

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060212441A1 (en) * 2004-10-25 2006-09-21 Yuanhua Tang Full text query and search systems and methods of use

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5812998A (en) * 1993-09-30 1998-09-22 Omron Corporation Similarity searching of sub-structured databases
US5761497A (en) * 1993-11-22 1998-06-02 Reed Elsevier, Inc. Associative text search and retrieval system that calculates ranking scores and window scores
US6633817B1 (en) * 1999-12-29 2003-10-14 Incyte Genomics, Inc. Sequence database search with sequence search trees
US20040024583A1 (en) * 2000-03-20 2004-02-05 Freeman Robert J Natural-language processing system using a large corpus
US20020111941A1 (en) * 2000-12-19 2002-08-15 Xerox Corporation Apparatus and method for information retrieval
US20060026147A1 (en) * 2004-07-30 2006-02-02 Cone Julian M Adaptive search engine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2013788A4 *

Also Published As

Publication number Publication date
WO2007149623A2 (en) 2007-12-27
EP2013788A4 (en) 2012-04-25
EP2013788A2 (en) 2009-01-14

Similar Documents

Publication Publication Date Title
WO2007149623A3 (en) Full text query and search systems and method of use
Zhang et al. Entity linking leveraging automatically generated annotation
WO2006047654A3 (en) Full text query and search systems and methods of use
WO2005010691A3 (en) Disambiguation of search phrases using interpretation clusters
WO2008009017A3 (en) Method and system for qualifying keywords in query strings
NZ578672A (en) Information-retrieval systems, methods, and software with concept-based searching and ranking
WO2005017682A3 (en) Product placement engine and method
WO2006118814A3 (en) Method for finding semantically related search engine queries
WO2005032235A3 (en) Increasing a number of relevant advertisements using a relaxed match
WO2007038713A3 (en) Search engine determining results based on probabilistic scoring of relevance
WO2008073502A3 (en) Viewport-relative scoring for location search queries
WO2007130716A3 (en) Methods and apparatus for computerized searching
BRPI0501320A (en) Suggested Related Terms for a Multisense Query
WO2007101194A3 (en) System and method for identifying related queries for languages with multiple writing systems
WO2008051750A3 (en) Associating geographic-related information with objects
WO2007016232A3 (en) Processor for fast phrase searching
WO2008058146A3 (en) Method and system for generating scored recommendations based on scored references
WO2002089004A3 (en) Search data management
Crimp et al. Refining query expansion terms using query context
Vechtomova Using Subjective Adjectives in Opinion Retrieval from Blogs.
van Engers Thesaurus-based retrieval of case law
US20180101606A1 (en) Method and system for searching for relevant items in a collection of documents given user defined documents
Wood et al. Orthogonal query recommendations for children
Xu et al. Using multiple features and statistical model to calculate text units similarity
Selvi et al. An approach to improve precision and recall for ad-hoc information retrieval using sbir algorithm

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780023220.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07761298

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2007761298

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE