WO2007124385A3 - Processing of query terms - Google Patents

Processing of query terms Download PDF

Info

Publication number
WO2007124385A3
WO2007124385A3 PCT/US2007/067014 US2007067014W WO2007124385A3 WO 2007124385 A3 WO2007124385 A3 WO 2007124385A3 US 2007067014 W US2007067014 W US 2007067014W WO 2007124385 A3 WO2007124385 A3 WO 2007124385A3
Authority
WO
WIPO (PCT)
Prior art keywords
query
synonyms
method includes
another aspect
language
Prior art date
Application number
PCT/US2007/067014
Other languages
French (fr)
Other versions
WO2007124385A2 (en
Inventor
Ruchira S Datta
Fabio Lopiano
Original Assignee
Google Inc
Ruchira S Datta
Fabio Lopiano
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/407,860 external-priority patent/US7835903B2/en
Priority claimed from US11/408,245 external-priority patent/US8762358B2/en
Priority claimed from US11/408,243 external-priority patent/US7475063B2/en
Priority claimed from US11/408,242 external-priority patent/US8255376B2/en
Application filed by Google Inc, Ruchira S Datta, Fabio Lopiano filed Critical Google Inc
Priority to CN2007800219021A priority Critical patent/CN101467125B/en
Priority to EP07760955A priority patent/EP2016486A4/en
Publication of WO2007124385A2 publication Critical patent/WO2007124385A2/en
Publication of WO2007124385A3 publication Critical patent/WO2007124385A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3338Query expansion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/263Language identification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

Methods, systems, and apparatus, including computer program products, to perform operations relating to processing query terms in a search query presented to a search engine. In one aspect, a method includes determining a query language from the query terms and the language of a user interface. In another aspect, a method includes using the interface language to select one or more mappings and using the mappings to simplify each query term; and applying each simplified query term to a synonyms map to identify possible synonyms with which to augment the search query. In another aspect, a synonyms map is generated from a corpus of documents. In another aspect, a method includes identifying one or more potential synonyms for a query term by looking up simplified query term in a synonyms map, the synonyms map mapping each of a plurality of keys to one or more variants, each variant being a word associated with one or more document languages.
PCT/US2007/067014 2006-04-19 2007-04-19 Processing of query terms WO2007124385A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2007800219021A CN101467125B (en) 2006-04-19 2007-04-19 Processing of query terms
EP07760955A EP2016486A4 (en) 2006-04-19 2007-04-19 Processing of query terms

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US11/408,245 2006-04-19
US11/407,860 2006-04-19
US11/408,242 2006-04-19
US11/407,860 US7835903B2 (en) 2006-04-19 2006-04-19 Simplifying query terms with transliteration
US11/408,245 US8762358B2 (en) 2006-04-19 2006-04-19 Query language determination using query terms and interface language
US11/408,243 US7475063B2 (en) 2006-04-19 2006-04-19 Augmenting queries with synonyms selected using language statistics
US11/408,242 US8255376B2 (en) 2006-04-19 2006-04-19 Augmenting queries with synonyms from synonyms map
US11/408,243 2006-04-19

Publications (2)

Publication Number Publication Date
WO2007124385A2 WO2007124385A2 (en) 2007-11-01
WO2007124385A3 true WO2007124385A3 (en) 2008-08-28

Family

ID=38625747

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/067014 WO2007124385A2 (en) 2006-04-19 2007-04-19 Processing of query terms

Country Status (3)

Country Link
EP (1) EP2016486A4 (en)
CN (1) CN102024026B (en)
WO (1) WO2007124385A2 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8661012B1 (en) * 2006-12-29 2014-02-25 Google Inc. Ensuring that a synonym for a query phrase does not drop information present in the query phrase
KR100930617B1 (en) * 2008-04-08 2009-12-09 한국과학기술정보연구원 Multiple object-oriented integrated search system and method
JP2012114584A (en) * 2010-11-22 2012-06-14 Buffalo Inc Radio communication system
CN105354026A (en) * 2015-10-29 2016-02-24 杭州佳谷数控技术有限公司 Multilingual implementation method of underwear machine control system
CN113035170B (en) * 2019-12-25 2022-07-12 中国科学院声学研究所 Voice recognition method and system of Turkish based on vowel harmony
CN111539228B (en) * 2020-04-29 2023-08-08 支付宝(杭州)信息技术有限公司 Vector model training method and device and similarity determining method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5717913A (en) * 1995-01-03 1998-02-10 University Of Central Florida Method for detecting and extracting text data using database schemas
US5956711A (en) * 1997-01-16 1999-09-21 Walter J. Sullivan, III Database system with restricted keyword list and bi-directional keyword translation

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6999932B1 (en) 2000-10-10 2006-02-14 Intel Corporation Language independent voice-based search system
FR2848688A1 (en) * 2002-12-17 2004-06-18 France Telecom Text language identifying device for linguistic analysis of text, has analyzing unit to analyze chain characters of words extracted from one text, where each chain is completed so that each time chains are found in word
US7451129B2 (en) * 2003-03-31 2008-11-11 Google Inc. System and method for providing preferred language ordering of search results
CN1598814A (en) * 2003-09-19 2005-03-23 鸿富锦精密工业(深圳)有限公司 Classification retrieval system and method for synonym

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5717913A (en) * 1995-01-03 1998-02-10 University Of Central Florida Method for detecting and extracting text data using database schemas
US5956711A (en) * 1997-01-16 1999-09-21 Walter J. Sullivan, III Database system with restricted keyword list and bi-directional keyword translation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2016486A4 *

Also Published As

Publication number Publication date
EP2016486A4 (en) 2011-08-10
EP2016486A2 (en) 2009-01-21
CN102024026A (en) 2011-04-20
WO2007124385A2 (en) 2007-11-01
CN102024026B (en) 2013-03-27

Similar Documents

Publication Publication Date Title
US9916304B2 (en) Method of creating translation corpus
WO2007062397A3 (en) Inferring search category synonyms from user logs
CA2656425C (en) Recognizing text in images
WO2007106858A3 (en) System, method, and computer program product for data mining and automatically generating hypotheses from data repositories
WO2007124385A3 (en) Processing of query terms
WO2005033967A3 (en) Systems and methods for searching using queries written in a different character-set and/or language from the target pages
NZ578672A (en) Information-retrieval systems, methods, and software with concept-based searching and ranking
WO2004072757A3 (en) Text and attribute searches of data stores that include business object
WO2006072882A3 (en) Embedded translation-enhanced search
WO2008097490A3 (en) A method and an apparatus to disambiguate requests
WO2007101194A3 (en) System and method for identifying related queries for languages with multiple writing systems
WO2012015958A3 (en) Semantically generating personalized recommendations based on social feeds to a user in real-time and display methods thereof
WO2005124599A3 (en) Content search in complex language, such as japanese
WO2007005536A3 (en) Information retrieving and displaying method and computer-readable medium
WO2007114932A3 (en) Search system and method with text function tagging
WO2007033468A3 (en) System and method configuring contextual based content with publisher content for display on a user interface
WO2011035007A3 (en) Systems and methods for providing advanced search result page content
WO2008157021A3 (en) Text prediction with partial selection in a variety of domains
WO2005010691A3 (en) Disambiguation of search phrases using interpretation clusters
WO2010062737A3 (en) Retrieval using a generalized sentence collocation
Hou et al. Classifications and typologies: Labeling sign languages and signing communities
NZ583751A (en) A system and method using graphical user interfaces for jury veridct information
EP1675019A3 (en) System and method for disambiguating non diacritized arabic words in a text
WO2013002940A3 (en) Method and apparatus for creating a search index for a composite document and searching same
WO2014204701A1 (en) Providing web-based alternate text options

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780021902.1

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07760955

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2007760955

Country of ref document: EP