TWI547815B - Information retrieval method and device - Google Patents
Information retrieval method and device Download PDFInfo
- Publication number
- TWI547815B TWI547815B TW101103773A TW101103773A TWI547815B TW I547815 B TWI547815 B TW I547815B TW 101103773 A TW101103773 A TW 101103773A TW 101103773 A TW101103773 A TW 101103773A TW I547815 B TWI547815 B TW I547815B
- Authority
- TW
- Taiwan
- Prior art keywords
- synonym
- pair
- spectrum
- word
- information
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3338—Query expansion
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201110391864.7A CN103136262B (zh) | 2011-11-30 | 2011-11-30 | 信息检索方法及装置 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201322020A TW201322020A (zh) | 2013-06-01 |
| TWI547815B true TWI547815B (zh) | 2016-09-01 |
Family
ID=47470148
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW101103773A TWI547815B (zh) | 2011-11-30 | 2012-02-06 | Information retrieval method and device |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20130138429A1 (enExample) |
| EP (1) | EP2786275A1 (enExample) |
| JP (1) | JP6124917B2 (enExample) |
| CN (1) | CN103136262B (enExample) |
| TW (1) | TWI547815B (enExample) |
| WO (1) | WO2013082506A1 (enExample) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| NZ589787A (en) * | 2010-12-08 | 2012-03-30 | S L I Systems Inc | A method for determining relevant search results |
| US20150019382A1 (en) * | 2012-10-19 | 2015-01-15 | Rakuten, Inc. | Corpus creation device, corpus creation method and corpus creation program |
| US10339216B2 (en) | 2013-07-26 | 2019-07-02 | Nuance Communications, Inc. | Method and apparatus for selecting among competing models in a tool for building natural language understanding models |
| CN104598613B (zh) * | 2015-01-30 | 2017-11-03 | 百度在线网络技术(北京)有限公司 | 一种用于垂直领域的概念关系构建方法和装置 |
| CN105069086B (zh) * | 2015-07-31 | 2017-07-11 | 焦点科技股份有限公司 | 一种优化电子商务商品搜索的方法及系统 |
| CN106815265B (zh) * | 2015-12-01 | 2020-07-03 | 北京国双科技有限公司 | 裁判文书的搜索方法及装置 |
| CN106844571B (zh) * | 2017-01-03 | 2020-04-07 | 北京齐尔布莱特科技有限公司 | 识别同义词的方法、装置和计算设备 |
| CN109002432B (zh) * | 2017-06-07 | 2022-01-04 | 北京京东尚科信息技术有限公司 | 同义词的挖掘方法及装置、计算机可读介质、电子设备 |
| CN108881945B (zh) * | 2018-07-11 | 2020-09-22 | 深圳创维数字技术有限公司 | 消除关键词歧义的方法、电视及可读存储介质 |
| CN109522547B (zh) * | 2018-10-23 | 2020-09-18 | 浙江大学 | 基于模式学习的中文同义词迭代抽取方法 |
| CN110688837B (zh) * | 2019-09-27 | 2023-10-31 | 北京百度网讯科技有限公司 | 数据处理的方法及装置 |
| JP7747403B2 (ja) * | 2020-02-21 | 2025-10-01 | 日本電気株式会社 | シナリオ生成装置、シナリオ生成方法、及びプログラム |
| CN114791973A (zh) * | 2021-01-23 | 2022-07-26 | 北京猎户星空科技有限公司 | 一种数据处理方法、装置、设备及介质 |
| CN114817625B (zh) * | 2022-05-25 | 2025-08-22 | 腾讯音乐娱乐科技(深圳)有限公司 | 文本处理方法及装置 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040003005A1 (en) * | 2002-06-28 | 2004-01-01 | Surajit Chaudhuri | Detecting duplicate records in databases |
| CN101432685A (zh) * | 2006-02-28 | 2009-05-13 | 电子湾有限公司 | 数据库搜索查询的扩展 |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3379608B2 (ja) * | 1994-11-24 | 2003-02-24 | 日本電信電話株式会社 | 単語間意味類似性判別方法 |
| JP2003091552A (ja) * | 2001-09-17 | 2003-03-28 | Hitachi Ltd | 検索要求情報抽出方法及びその実施システム並びにその処理プログラム |
| WO2005020094A1 (en) * | 2003-08-21 | 2005-03-03 | Idilia Inc. | System and method for associating documents with contextual advertisements |
| NO325864B1 (no) * | 2006-11-07 | 2008-08-04 | Fast Search & Transfer Asa | Fremgangsmåte ved beregning av sammendragsinformasjon og en søkemotor for å støtte og implementere fremgangsmåten |
| US7890521B1 (en) * | 2007-02-07 | 2011-02-15 | Google Inc. | Document-based synonym generation |
| US20100094835A1 (en) * | 2008-10-15 | 2010-04-15 | Yumao Lu | Automatic query concepts identification and drifting for web search |
-
2011
- 2011-11-30 CN CN201110391864.7A patent/CN103136262B/zh active Active
-
2012
- 2012-02-06 TW TW101103773A patent/TWI547815B/zh not_active IP Right Cessation
- 2012-11-30 JP JP2014544948A patent/JP6124917B2/ja active Active
- 2012-11-30 EP EP12808973.7A patent/EP2786275A1/en not_active Withdrawn
- 2012-11-30 WO PCT/US2012/067411 patent/WO2013082506A1/en not_active Ceased
- 2012-11-30 US US13/691,268 patent/US20130138429A1/en not_active Abandoned
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040003005A1 (en) * | 2002-06-28 | 2004-01-01 | Surajit Chaudhuri | Detecting duplicate records in databases |
| CN101432685A (zh) * | 2006-02-28 | 2009-05-13 | 电子湾有限公司 | 数据库搜索查询的扩展 |
Non-Patent Citations (2)
| Title |
|---|
| Giriprasad Sridhara, Emily Hill, Lori Pollock and K. Vijay-Shanker, "Identifying Word Relations in Software: A Comparative Study of Semantic Similarity Tools", The 16th IEEE International Conference on Program Comprehension, ICPC 2008. Pp. 123-132, 10-13 June 2008 * |
| Hu Yan, Li Wei, Qiu Ying, Wu Wei, "Research of Duplicate Record Cleaning Technology Based on a Reformative Keywords Matching Algorithm", 2009 International Conference on E-Business and Information System Security, Pp. 1-5, 23-24 May 2009 * |
Also Published As
| Publication number | Publication date |
|---|---|
| TW201322020A (zh) | 2013-06-01 |
| CN103136262A (zh) | 2013-06-05 |
| WO2013082506A1 (en) | 2013-06-06 |
| JP2015500525A (ja) | 2015-01-05 |
| CN103136262B (zh) | 2016-08-24 |
| US20130138429A1 (en) | 2013-05-30 |
| EP2786275A1 (en) | 2014-10-08 |
| JP6124917B2 (ja) | 2017-05-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI547815B (zh) | Information retrieval method and device | |
| KR102080362B1 (ko) | 쿼리 확장 | |
| Madhu et al. | Intelligent semantic web search engines: a brief survey | |
| JP5391633B2 (ja) | オントロジー空間を規定するタームの推奨 | |
| US8478749B2 (en) | Method and apparatus for determining relevant search results using a matrix framework | |
| US8965872B2 (en) | Identifying query formulation suggestions for low-match queries | |
| CN102722498B (zh) | 搜索引擎及其实现方法 | |
| CN102722501B (zh) | 搜索引擎及其实现方法 | |
| CN112988969A (zh) | 用于文本检索的方法、装置、设备以及存储介质 | |
| CN102737021B (zh) | 搜索引擎及其实现方法 | |
| CN101364239A (zh) | 一种分类目录自动构建方法及相关系统 | |
| US20160321365A1 (en) | Systems and methods for evaluating search query terms for improving search results | |
| KR20160042896A (ko) | 마이닝된 하이퍼링크 텍스트 스니펫을 통한 이미지 브라우징 | |
| CN102722499A (zh) | 搜索引擎及其实现方法 | |
| CN105389328B (zh) | 一种大规模开源软件搜索排序优化方法 | |
| CN106599304B (zh) | 一种针对中小型网站的模块化用户检索意图建模方法 | |
| CN103942232B (zh) | 用于挖掘意图的方法和设备 | |
| US20150149448A1 (en) | Method and system for generating dynamic themes for social data | |
| CN105512224A (zh) | 基于光标位置序列的搜索引擎用户满意度自动评估方法 | |
| US10255246B1 (en) | Systems and methods for providing a searchable concept network | |
| CN117370932A (zh) | 基于多模态数据融合感知的交通运输情报处理及感知方法 | |
| CN108932247A (zh) | 一种优化文本搜索的方法及装置 | |
| KR20240115436A (ko) | 트렌드 키워드 구축 및 활용 시스템 및 방법 | |
| CN107562904B (zh) | 融合项权值与频度的英文词间加权正负关联模式挖掘方法 | |
| CN111222918A (zh) | 关键词挖掘方法、装置、电子设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MM4A | Annulment or lapse of patent due to non-payment of fees |