CN103106220B - 一种搜索方法、搜索装置及一种搜索引擎系统 - Google Patents

一种搜索方法、搜索装置及一种搜索引擎系统 Download PDF

Info

Publication number
CN103106220B
CN103106220B CN201110361975.3A CN201110361975A CN103106220B CN 103106220 B CN103106220 B CN 103106220B CN 201110361975 A CN201110361975 A CN 201110361975A CN 103106220 B CN103106220 B CN 103106220B
Authority
CN
China
Prior art keywords
search
real
words
string
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201110361975.3A
Other languages
English (en)
Chinese (zh)
Other versions
CN103106220A (zh
Inventor
郎皓
唐超
张小洵
薛贵荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201610311962.8A priority Critical patent/CN105956137B/zh
Priority to CN201110361975.3A priority patent/CN103106220B/zh
Priority to TW101107359A priority patent/TW201319842A/zh
Priority to US13/677,147 priority patent/US8959080B2/en
Priority to PCT/US2012/065096 priority patent/WO2013074685A1/en
Priority to JP2014541422A priority patent/JP6006327B2/ja
Priority to EP12816542.0A priority patent/EP2780837A1/en
Publication of CN103106220A publication Critical patent/CN103106220A/zh
Priority to HK13108164.8A priority patent/HK1181132B/xx
Priority to US14/589,883 priority patent/US9477761B2/en
Application granted granted Critical
Publication of CN103106220B publication Critical patent/CN103106220B/zh
Priority to JP2016175645A priority patent/JP6291001B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN201110361975.3A 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统 Active CN103106220B (zh)

Priority Applications (10)

Application Number Priority Date Filing Date Title
CN201610311962.8A CN105956137B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统
CN201110361975.3A CN103106220B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统
TW101107359A TW201319842A (zh) 2011-11-15 2012-03-05 搜尋方法、搜尋裝置及搜尋引擎系統
PCT/US2012/065096 WO2013074685A1 (en) 2011-11-15 2012-11-14 Search method, search apparatus and search engine system
JP2014541422A JP6006327B2 (ja) 2011-11-15 2012-11-14 検索方法、検索装置及び検索エンジンシステム
EP12816542.0A EP2780837A1 (en) 2011-11-15 2012-11-14 Search method, search apparatus and search engine system
US13/677,147 US8959080B2 (en) 2011-11-15 2012-11-14 Search method, search apparatus and search engine system
HK13108164.8A HK1181132B (en) 2013-07-12 A searching method, searching device and searching engine system
US14/589,883 US9477761B2 (en) 2011-11-15 2015-01-05 Search method, search apparatus and search engine system
JP2016175645A JP6291001B2 (ja) 2011-11-15 2016-09-08 検索方法、検索装置及び検索エンジンシステム

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110361975.3A CN103106220B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201610311962.8A Division CN105956137B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统

Publications (2)

Publication Number Publication Date
CN103106220A CN103106220A (zh) 2013-05-15
CN103106220B true CN103106220B (zh) 2016-08-03

Family

ID=47594974

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201110361975.3A Active CN103106220B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统
CN201610311962.8A Active CN105956137B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201610311962.8A Active CN105956137B (zh) 2011-11-15 2011-11-15 一种搜索方法、搜索装置及一种搜索引擎系统

Country Status (6)

Country Link
US (2) US8959080B2 (enExample)
EP (1) EP2780837A1 (enExample)
JP (2) JP6006327B2 (enExample)
CN (2) CN103106220B (enExample)
TW (1) TW201319842A (enExample)
WO (1) WO2013074685A1 (enExample)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544266B (zh) * 2013-10-16 2017-05-31 北京奇虎科技有限公司 一种搜索建议词生成的方法以及装置
US9418103B2 (en) * 2013-12-06 2016-08-16 Quixey, Inc. Techniques for reformulating search queries
CN105446982A (zh) * 2014-06-30 2016-03-30 国际商业机器公司 用于管理数据存储系统的方法和装置
CN104462575B (zh) * 2014-12-29 2019-03-08 北京奇虎科技有限公司 音乐综合搜索的实现方法和装置
CN105138535A (zh) * 2015-06-30 2015-12-09 百度在线网络技术(北京)有限公司 一种搜索结果的展示方法及装置
CN104991943A (zh) * 2015-07-10 2015-10-21 百度在线网络技术(北京)有限公司 音乐搜索方法及装置
CN109643322B (zh) * 2016-09-02 2022-11-29 株式会社日立高新技术 字符串辞典的构建方法、字符串辞典的检索方法及字符串辞典的处理系统
US11170005B2 (en) * 2016-10-04 2021-11-09 Verizon Media Inc. Online ranking of queries for sponsored search
CN106446235B (zh) * 2016-10-10 2021-04-06 Tcl科技集团股份有限公司 视频的搜索方法和装置
TWI645303B (zh) * 2016-12-21 2018-12-21 財團法人工業技術研究院 字串驗證方法、字串擴充方法與驗證模型訓練方法
CN106844482B (zh) * 2016-12-23 2021-01-29 北京奇虎科技有限公司 一种基于搜索引擎的检索信息匹配方法及装置
CN106933947B (zh) * 2017-01-20 2018-12-04 北京三快在线科技有限公司 一种搜索方法及装置、电子设备
CN107480162B (zh) * 2017-06-15 2021-09-21 北京百度网讯科技有限公司 基于人工智能的搜索方法、装置、设备及计算机可读存储介质
CN107256267B (zh) * 2017-06-19 2020-07-24 北京百度网讯科技有限公司 查询方法和装置
CN107704525A (zh) * 2017-09-04 2018-02-16 优酷网络技术(北京)有限公司 视频搜索方法和装置
CN110472058B (zh) 2018-05-09 2023-03-03 华为技术有限公司 实体搜索方法、相关设备及计算机存储介质
US10585922B2 (en) 2018-05-23 2020-03-10 International Business Machines Corporation Finding a resource in response to a query including unknown words
US11379487B2 (en) 2018-08-27 2022-07-05 International Business Machines Corporation Intelligent and interactive knowledge system
CN109543016A (zh) * 2018-11-15 2019-03-29 北京搜狗科技发展有限公司 一种数据处理方法、装置和用于数据处理的装置
CN109902149B (zh) 2019-02-21 2021-08-13 北京百度网讯科技有限公司 查询处理方法和装置、计算机可读介质
CN110162535B (zh) * 2019-03-26 2023-11-07 腾讯科技(深圳)有限公司 用于执行个性化的搜索方法、装置、设备以及存储介质
CN109977294B (zh) * 2019-04-03 2020-04-28 三角兽(北京)科技有限公司 信息/查询处理装置、查询处理/文本查询方法、存储介质
CN110489032B (zh) * 2019-08-14 2021-08-24 掌阅科技股份有限公司 用于电子书的词典查询方法及电子设备
CN111090771B (zh) * 2019-10-31 2023-08-25 腾讯音乐娱乐科技(深圳)有限公司 歌曲搜索方法、装置及计算机存储介质
CN111782962B (zh) * 2020-09-04 2021-01-12 浙江口碑网络技术有限公司 模式匹配方法、装置及电子设备
CN112182321B (zh) * 2020-09-28 2023-12-15 严永存 一种基于地图技术的互联网信息发布搜索方法
CN112163104B (zh) * 2020-09-29 2022-04-15 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质
CN112434072B (zh) * 2021-01-27 2021-04-30 浙江口碑网络技术有限公司 搜索方法、装置、电子设备及存储介质
CN112965992B (zh) * 2021-03-22 2023-08-15 三门核电有限公司 多参数约束数据检索人机交互方法及装置
US20220398251A1 (en) * 2021-06-14 2022-12-15 Bank Of America Corporation Data processing system and method for implementing a search engine based on detecting intent from a search string
CN113312523B (zh) * 2021-07-30 2021-12-14 北京达佳互联信息技术有限公司 字典生成、搜索关键字推荐方法、装置和服务器
US11816427B1 (en) * 2022-10-27 2023-11-14 Intuit, Inc. Automated data classification error correction through spatial analysis using machine learning
CN117493641B (zh) * 2024-01-02 2024-03-22 中国电子科技集团公司第二十八研究所 一种基于语义元数据的二次模糊搜索方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101145153A (zh) * 2006-09-13 2008-03-19 阿里巴巴公司 一种搜索信息的方法及系统
CN101770498A (zh) * 2009-01-05 2010-07-07 李铭 分步搜索法
CN102043833A (zh) * 2010-11-25 2011-05-04 北京搜狗科技发展有限公司 一种基于查询词进行搜索的方法和搜索装置

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999063425A1 (fr) * 1998-06-02 1999-12-09 Sony Corporation Procede et appareil de traitement d'informations et support de fourniture d'informations
JP2002288201A (ja) * 2001-03-23 2002-10-04 Fujitsu Ltd 質問応答処理方法,質問応答処理プログラム,質問応答処理プログラム記録媒体および質問応答処理装置
US7269545B2 (en) * 2001-03-30 2007-09-11 Nec Laboratories America, Inc. Method for retrieving answers from an information retrieval system
JP2003108584A (ja) * 2001-09-28 2003-04-11 Casio Comput Co Ltd 情報検索システム及びプログラム
US7840547B1 (en) * 2004-03-31 2010-11-23 Google Inc. Methods and systems for efficient query rewriting
US7519581B2 (en) * 2004-04-30 2009-04-14 Yahoo! Inc. Method and apparatus for performing a search
US7860875B2 (en) 2004-05-26 2010-12-28 International Business Machines Corporation Method for modifying a query by use of an external system for managing assignment of user and data classifications
US20060106769A1 (en) * 2004-11-12 2006-05-18 Gibbs Kevin A Method and system for autocompletion for languages having ideographs and phonetic characters
US7620628B2 (en) * 2004-12-06 2009-11-17 Yahoo! Inc. Search processing with automatic categorization of queries
US7401073B2 (en) 2005-04-28 2008-07-15 International Business Machines Corporation Term-statistics modification for category-based search
US7844599B2 (en) 2005-08-24 2010-11-30 Yahoo! Inc. Biasing queries to determine suggested queries
US8676868B2 (en) * 2006-08-04 2014-03-18 Chacha Search, Inc Macro programming for resources
US7860886B2 (en) * 2006-09-29 2010-12-28 A9.Com, Inc. Strategy for providing query results based on analysis of user intent
US8010529B2 (en) 2006-10-23 2011-08-30 Yahoo! Inc. System and method for determining a relationship between available content and current interests to identify a need for content
US20080313142A1 (en) * 2007-06-14 2008-12-18 Microsoft Corporation Categorization of queries
US20090094224A1 (en) * 2007-10-05 2009-04-09 Google Inc. Collaborative search results
WO2009061390A1 (en) * 2007-11-05 2009-05-14 Enhanced Medical Decisions, Inc. Machine learning systems and methods for improved natural language processing
US8041733B2 (en) * 2008-10-14 2011-10-18 Yahoo! Inc. System for automatically categorizing queries
US20100094826A1 (en) * 2008-10-14 2010-04-15 Omid Rouhani-Kalleh System for resolving entities in text into real world objects using context
US20100094835A1 (en) * 2008-10-15 2010-04-15 Yumao Lu Automatic query concepts identification and drifting for web search
CN101770499A (zh) * 2009-01-07 2010-07-07 上海聚力传媒技术有限公司 搜索引擎中的信息检索方法及相应搜索引擎
US8745076B2 (en) * 2009-01-13 2014-06-03 Red Hat, Inc. Structured query language syntax rewriting
US8533181B2 (en) * 2009-04-29 2013-09-10 Oracle International Corporation Partition pruning via query rewrite
US20100299342A1 (en) 2009-05-22 2010-11-25 Nbc Universal, Inc. System and method for modification in computerized searching
US8161035B2 (en) * 2009-06-04 2012-04-17 Oracle International Corporation Query optimization by specifying path-based predicate evaluation in a path-based query operator
US9405841B2 (en) 2009-10-15 2016-08-02 A9.Com, Inc. Dynamic search suggestion and category specific completion
US20120259829A1 (en) 2009-12-30 2012-10-11 Xin Zhou Generating related input suggestions
US8719246B2 (en) 2010-06-28 2014-05-06 Microsoft Corporation Generating and presenting a suggested search query
US20120117102A1 (en) * 2010-11-04 2012-05-10 Microsoft Corporation Query suggestions using replacement substitutions and an advanced query syntax
US8219575B2 (en) * 2010-11-12 2012-07-10 Business Objects Software Ltd. Method and system for specifying, preparing and using parameterized database queries
US8515986B2 (en) * 2010-12-02 2013-08-20 Microsoft Corporation Query pattern generation for answers coverage expansion
US8799312B2 (en) * 2010-12-23 2014-08-05 Microsoft Corporation Efficient label acquisition for query rewriting
CN102073725B (zh) * 2011-01-11 2013-05-08 百度在线网络技术(北京)有限公司 结构化数据的搜索方法和实现该搜索方法的搜索引擎系统
US20120179705A1 (en) * 2011-01-11 2012-07-12 Microsoft Corporation Query reformulation in association with a search box
CN102214208B (zh) * 2011-04-27 2014-04-09 百度在线网络技术(北京)有限公司 一种基于非结构化文本生成结构化信息实体的方法与设备
US8667007B2 (en) * 2011-05-26 2014-03-04 International Business Machines Corporation Hybrid and iterative keyword and category search technique
US20130086509A1 (en) 2011-09-29 2013-04-04 Microsoft Corporation Alternative query suggestions by dropping query terms

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101145153A (zh) * 2006-09-13 2008-03-19 阿里巴巴公司 一种搜索信息的方法及系统
CN101770498A (zh) * 2009-01-05 2010-07-07 李铭 分步搜索法
CN102043833A (zh) * 2010-11-25 2011-05-04 北京搜狗科技发展有限公司 一种基于查询词进行搜索的方法和搜索装置

Also Published As

Publication number Publication date
CN105956137B (zh) 2019-10-01
US9477761B2 (en) 2016-10-25
JP2016201153A (ja) 2016-12-01
US20130124493A1 (en) 2013-05-16
JP6291001B2 (ja) 2018-03-14
US8959080B2 (en) 2015-02-17
CN103106220A (zh) 2013-05-15
WO2013074685A1 (en) 2013-05-23
CN105956137A (zh) 2016-09-21
HK1181132A1 (zh) 2013-11-01
TW201319842A (zh) 2013-05-16
US20150161263A1 (en) 2015-06-11
JP2014533407A (ja) 2014-12-11
JP6006327B2 (ja) 2016-10-12
EP2780837A1 (en) 2014-09-24

Similar Documents

Publication Publication Date Title
CN103106220B (zh) 一种搜索方法、搜索装置及一种搜索引擎系统
US10261954B2 (en) Optimizing search result snippet selection
US9864808B2 (en) Knowledge-based entity detection and disambiguation
CN104239340B (zh) 搜索结果筛选方法与装置
US10146862B2 (en) Context-based metadata generation and automatic annotation of electronic media in a computer network
He et al. Crawling deep web entity pages
Shinzato et al. Tsubaki: An open search engine infrastructure for developing information access methodology
CN105022827B (zh) 一种面向领域主题的Web新闻动态聚合方法
CN102200975B (zh) 一种利用语义分析的垂直搜索引擎系统
US20160034514A1 (en) Providing search results based on an identified user interest and relevance matching
Gupta et al. An overview of social tagging and applications
CN105389328B (zh) 一种大规模开源软件搜索排序优化方法
WO2018013400A1 (en) Contextual based image search results
CN105279231A (zh) 一种音乐资源聚合搜索的方法
CN104252487A (zh) 一种用于生成词条信息的方法和装置
US20150088859A1 (en) Click magnet images
Djuana et al. Personalization in tag ontology learning for recommendation making
Zhang et al. A semantics-based method for clustering of Chinese web search results
Djuana Tjhwa et al. Learning personalized tag ontology from user tagging information
Al-Hamami et al. Development of an opinion blog mining system
Kahng et al. Ranking objects by following paths in entity-relationship graphs
TWI423053B (zh) Domain Interpretation Data Retrieval Method and Its System
TW201124860A (en) Method and apparatus for identifying synonym, and searching method and apparatus utilizing the same.
HK1181132B (en) A searching method, searching device and searching engine system
Yamamoto et al. An editable browser for reranking web search results

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1181132

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1181132

Country of ref document: HK