WO2007124385A3 - Processing of query terms - Google Patents
Processing of query terms Download PDFInfo
- Publication number
- WO2007124385A3 WO2007124385A3 PCT/US2007/067014 US2007067014W WO2007124385A3 WO 2007124385 A3 WO2007124385 A3 WO 2007124385A3 US 2007067014 W US2007067014 W US 2007067014W WO 2007124385 A3 WO2007124385 A3 WO 2007124385A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- query
- synonyms
- method includes
- another aspect
- language
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3338—Query expansion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/247—Thesauruses; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Methods, systems, and apparatus, including computer program products, to perform operations relating to processing query terms in a search query presented to a search engine. In one aspect, a method includes determining a query language from the query terms and the language of a user interface. In another aspect, a method includes using the interface language to select one or more mappings and using the mappings to simplify each query term; and applying each simplified query term to a synonyms map to identify possible synonyms with which to augment the search query. In another aspect, a synonyms map is generated from a corpus of documents. In another aspect, a method includes identifying one or more potential synonyms for a query term by looking up simplified query term in a synonyms map, the synonyms map mapping each of a plurality of keys to one or more variants, each variant being a word associated with one or more document languages.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2007800219021A CN101467125B (en) | 2006-04-19 | 2007-04-19 | Processing of query terms |
EP07760955A EP2016486A4 (en) | 2006-04-19 | 2007-04-19 | Processing of query terms |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/408,245 | 2006-04-19 | ||
US11/407,860 | 2006-04-19 | ||
US11/408,242 | 2006-04-19 | ||
US11/407,860 US7835903B2 (en) | 2006-04-19 | 2006-04-19 | Simplifying query terms with transliteration |
US11/408,245 US8762358B2 (en) | 2006-04-19 | 2006-04-19 | Query language determination using query terms and interface language |
US11/408,243 US7475063B2 (en) | 2006-04-19 | 2006-04-19 | Augmenting queries with synonyms selected using language statistics |
US11/408,242 US8255376B2 (en) | 2006-04-19 | 2006-04-19 | Augmenting queries with synonyms from synonyms map |
US11/408,243 | 2006-04-19 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007124385A2 WO2007124385A2 (en) | 2007-11-01 |
WO2007124385A3 true WO2007124385A3 (en) | 2008-08-28 |
Family
ID=38625747
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/067014 WO2007124385A2 (en) | 2006-04-19 | 2007-04-19 | Processing of query terms |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP2016486A4 (en) |
CN (1) | CN102024026B (en) |
WO (1) | WO2007124385A2 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8661012B1 (en) * | 2006-12-29 | 2014-02-25 | Google Inc. | Ensuring that a synonym for a query phrase does not drop information present in the query phrase |
KR100930617B1 (en) * | 2008-04-08 | 2009-12-09 | 한국과학기술정보연구원 | Multiple object-oriented integrated search system and method |
JP2012114584A (en) * | 2010-11-22 | 2012-06-14 | Buffalo Inc | Radio communication system |
CN105354026A (en) * | 2015-10-29 | 2016-02-24 | 杭州佳谷数控技术有限公司 | Multilingual implementation method of underwear machine control system |
CN113035170B (en) * | 2019-12-25 | 2022-07-12 | 中国科学院声学研究所 | Voice recognition method and system of Turkish based on vowel harmony |
CN111539228B (en) * | 2020-04-29 | 2023-08-08 | 支付宝(杭州)信息技术有限公司 | Vector model training method and device and similarity determining method and device |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5717913A (en) * | 1995-01-03 | 1998-02-10 | University Of Central Florida | Method for detecting and extracting text data using database schemas |
US5956711A (en) * | 1997-01-16 | 1999-09-21 | Walter J. Sullivan, III | Database system with restricted keyword list and bi-directional keyword translation |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6999932B1 (en) | 2000-10-10 | 2006-02-14 | Intel Corporation | Language independent voice-based search system |
FR2848688A1 (en) * | 2002-12-17 | 2004-06-18 | France Telecom | Text language identifying device for linguistic analysis of text, has analyzing unit to analyze chain characters of words extracted from one text, where each chain is completed so that each time chains are found in word |
US7451129B2 (en) * | 2003-03-31 | 2008-11-11 | Google Inc. | System and method for providing preferred language ordering of search results |
CN1598814A (en) * | 2003-09-19 | 2005-03-23 | 鸿富锦精密工业(深圳)有限公司 | Classification retrieval system and method for synonym |
-
2007
- 2007-04-19 CN CN2010105465806A patent/CN102024026B/en active Active
- 2007-04-19 WO PCT/US2007/067014 patent/WO2007124385A2/en active Application Filing
- 2007-04-19 EP EP07760955A patent/EP2016486A4/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5717913A (en) * | 1995-01-03 | 1998-02-10 | University Of Central Florida | Method for detecting and extracting text data using database schemas |
US5956711A (en) * | 1997-01-16 | 1999-09-21 | Walter J. Sullivan, III | Database system with restricted keyword list and bi-directional keyword translation |
Non-Patent Citations (1)
Title |
---|
See also references of EP2016486A4 * |
Also Published As
Publication number | Publication date |
---|---|
EP2016486A4 (en) | 2011-08-10 |
EP2016486A2 (en) | 2009-01-21 |
CN102024026A (en) | 2011-04-20 |
WO2007124385A2 (en) | 2007-11-01 |
CN102024026B (en) | 2013-03-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9916304B2 (en) | Method of creating translation corpus | |
WO2007062397A3 (en) | Inferring search category synonyms from user logs | |
CA2656425C (en) | Recognizing text in images | |
WO2007106858A3 (en) | System, method, and computer program product for data mining and automatically generating hypotheses from data repositories | |
WO2007124385A3 (en) | Processing of query terms | |
WO2005033967A3 (en) | Systems and methods for searching using queries written in a different character-set and/or language from the target pages | |
NZ578672A (en) | Information-retrieval systems, methods, and software with concept-based searching and ranking | |
WO2004072757A3 (en) | Text and attribute searches of data stores that include business object | |
WO2006072882A3 (en) | Embedded translation-enhanced search | |
WO2008097490A3 (en) | A method and an apparatus to disambiguate requests | |
WO2007101194A3 (en) | System and method for identifying related queries for languages with multiple writing systems | |
WO2012015958A3 (en) | Semantically generating personalized recommendations based on social feeds to a user in real-time and display methods thereof | |
WO2005124599A3 (en) | Content search in complex language, such as japanese | |
WO2007005536A3 (en) | Information retrieving and displaying method and computer-readable medium | |
WO2007114932A3 (en) | Search system and method with text function tagging | |
WO2007033468A3 (en) | System and method configuring contextual based content with publisher content for display on a user interface | |
WO2011035007A3 (en) | Systems and methods for providing advanced search result page content | |
WO2008157021A3 (en) | Text prediction with partial selection in a variety of domains | |
WO2005010691A3 (en) | Disambiguation of search phrases using interpretation clusters | |
WO2010062737A3 (en) | Retrieval using a generalized sentence collocation | |
Hou et al. | Classifications and typologies: Labeling sign languages and signing communities | |
NZ583751A (en) | A system and method using graphical user interfaces for jury veridct information | |
EP1675019A3 (en) | System and method for disambiguating non diacritized arabic words in a text | |
WO2013002940A3 (en) | Method and apparatus for creating a search index for a composite document and searching same | |
WO2014204701A1 (en) | Providing web-based alternate text options |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200780021902.1 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07760955 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007760955 Country of ref document: EP |