TW200715152A - Systems for and methods of finding relevant documents by analyzing tags - Google Patents
Systems for and methods of finding relevant documents by analyzing tagsInfo
- Publication number
- TW200715152A TW200715152A TW095128551A TW95128551A TW200715152A TW 200715152 A TW200715152 A TW 200715152A TW 095128551 A TW095128551 A TW 095128551A TW 95128551 A TW95128551 A TW 95128551A TW 200715152 A TW200715152 A TW 200715152A
- Authority
- TW
- Taiwan
- Prior art keywords
- tag
- relevance
- objects
- algorithms
- search query
- Prior art date
Links
Abstract
A method of determining relevancies of objects to a search query includes associating multiple tags with multiple objects, recording bookmarks to the multiple objects, or both, and determining a relevance score for each of the multiple objects and a search query. One embodiment of the method combines full-text relevance algorithms with tag relevance algorithms. Other embodiments include statistical relevance algorithms such as statistical classification or rank regression algorithms. When a user executes a search query, a results list containing the objects is returned, with the objects organized based on the relevance scores. The objects are organized by, for example, listing those with the highest relevance scores first or by marking them with an indication of their relevance. Preferably, relevance scores for a tag-object pair are based on a number of times a term in the tag has been associated with the object, a number of tags associated with the object, a number of times that the tag has been associated with the multiple objects, a number of tag-object pairs that contain a term in the tag, a number of tag-object pairs that contain a reference to the object, or any combination of these.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US70570405P | 2005-08-03 | 2005-08-03 |
Publications (2)
Publication Number | Publication Date |
---|---|
TW200715152A true TW200715152A (en) | 2007-04-16 |
TWI391834B TWI391834B (en) | 2013-04-01 |
Family
ID=40014928
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW95128551A TWI391834B (en) | 2005-08-03 | 2006-08-03 | Systems for and methods of finding relevant documents by analyzing tags |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN101283353B (en) |
TW (1) | TWI391834B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI411926B (en) * | 2009-01-05 | 2013-10-11 | Inventec Corp | Generating dynamic web pages system and method thereof |
TWI451273B (en) * | 2007-05-04 | 2014-09-01 | Microsoft Corp | Method, system, and computer readable medium for link spam detection using smooth classification function |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101593187B (en) * | 2008-05-30 | 2012-05-30 | 国际商业机器公司 | Method and system for managing book marks |
WO2011022867A1 (en) * | 2009-08-24 | 2011-03-03 | Hewlett-Packard Development Company, L.P. | Method and apparatus for searching electronic documents |
US20120002884A1 (en) * | 2010-06-30 | 2012-01-05 | Alcatel-Lucent Usa Inc. | Method and apparatus for managing video content |
US8880517B2 (en) * | 2011-02-18 | 2014-11-04 | Microsoft Corporation | Propagating signals across a web graph |
US10402407B2 (en) * | 2013-06-17 | 2019-09-03 | Lenovo (Singapore) Pte. Ltd. | Contextual smart tags for content retrieval |
US20150046418A1 (en) * | 2013-08-09 | 2015-02-12 | Microsoft Corporation | Personalized content tagging |
US10769229B2 (en) * | 2016-04-14 | 2020-09-08 | Microsoft Technology Licensing, Llc | Separation of work and personal content |
CN105956016A (en) * | 2016-04-21 | 2016-09-21 | 成都数联铭品科技有限公司 | Associated information visualization processing system |
CN107463711B (en) * | 2017-08-22 | 2020-07-28 | 山东浪潮云服务信息科技有限公司 | Data tag matching method and device |
US10733243B2 (en) * | 2017-08-30 | 2020-08-04 | Microsoft Technology Licensing, Llc | Next generation similar profiles |
CN108241745B (en) * | 2018-01-08 | 2020-04-28 | 阿里巴巴集团控股有限公司 | Sample set processing method and device and sample query method and device |
CN109977318B (en) * | 2019-04-04 | 2021-06-29 | 掌阅科技股份有限公司 | Book searching method, electronic device and computer storage medium |
CN110704624B (en) * | 2019-09-30 | 2021-08-10 | 武汉大学 | Geographic information service metadata text multi-level multi-label classification method |
CN111125566B (en) * | 2019-12-11 | 2021-08-31 | 贝壳找房(北京)科技有限公司 | Information acquisition method and device, electronic equipment and storage medium |
CN112100506B (en) * | 2020-11-10 | 2021-03-16 | 中国电力科学研究院有限公司 | Information pushing method, system, equipment and storage medium |
CN116431686B (en) * | 2023-06-08 | 2023-09-01 | 成都航空职业技术学院 | Training data query method and system based on heterogeneous archives |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6070176A (en) * | 1997-01-30 | 2000-05-30 | Intel Corporation | Method and apparatus for graphically representing portions of the world wide web |
US6360215B1 (en) * | 1998-11-03 | 2002-03-19 | Inktomi Corporation | Method and apparatus for retrieving documents based on information other than document content |
US6718365B1 (en) * | 2000-04-13 | 2004-04-06 | International Business Machines Corporation | Method, system, and program for ordering search results using an importance weighting |
US6601075B1 (en) * | 2000-07-27 | 2003-07-29 | International Business Machines Corporation | System and method of ranking and retrieving documents based on authority scores of schemas and documents |
US6944609B2 (en) * | 2001-10-18 | 2005-09-13 | Lycos, Inc. | Search results using editor feedback |
US6983280B2 (en) * | 2002-09-13 | 2006-01-03 | Overture Services Inc. | Automated processing of appropriateness determination of content for search listings in wide area network searches |
US8666983B2 (en) * | 2003-06-13 | 2014-03-04 | Microsoft Corporation | Architecture for generating responses to search engine queries |
-
2006
- 2006-08-03 TW TW95128551A patent/TWI391834B/en active
- 2006-08-03 CN CN200680036981.9A patent/CN101283353B/en active Active
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI451273B (en) * | 2007-05-04 | 2014-09-01 | Microsoft Corp | Method, system, and computer readable medium for link spam detection using smooth classification function |
TWI411926B (en) * | 2009-01-05 | 2013-10-11 | Inventec Corp | Generating dynamic web pages system and method thereof |
Also Published As
Publication number | Publication date |
---|---|
TWI391834B (en) | 2013-04-01 |
CN101283353A (en) | 2008-10-08 |
CN101283353B (en) | 2015-11-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TW200715152A (en) | Systems for and methods of finding relevant documents by analyzing tags | |
NZ578672A (en) | Information-retrieval systems, methods, and software with concept-based searching and ranking | |
CN105593851A (en) | A method and an apparatus for tracking microblog messages for relevancy to an entity identifiable by an associated text and an image | |
Copenheaver et al. | Lack of gender bias in citation rates of publications by dendrochronologists: What is unique about this discipline? | |
TW200636512A (en) | Methods of and systems for searching by incorporating user-entered information | |
Kim et al. | COLIEE-2015: evaluation of legal question answering | |
Hamed et al. | Measuring climate change on Twitter using Google’s algorithm: Perception and events | |
Zhang et al. | USTB at INEX2014: Social Book Search Track. | |
US10242090B1 (en) | Method and device for measuring relevancy of a document to a keyword(s) | |
Minack et al. | Current Approaches to Search Result Diversification. | |
Weren et al. | Exploring Information Retrieval Features for Author Profiling. | |
Koolen et al. | Social Book Search: The Impact of Professional and User-Generated Content on Book Suggestions. | |
Hafsi et al. | LaHC at INEX 2014: Social book search track | |
Potey et al. | A survey of query log processing techniques and evaluation of web query intent identification | |
Zhou et al. | Topic Categorization for Relevancy and Opinion Detection. | |
Hall | The impact of tourism knowledge: Google scholar, citations and the opening up of academic space | |
Weren et al. | Using simple content features for the author profiling task | |
Guardiola-Wanden-Berghe et al. | Medical subject headings versus American Psychological Association Index Terms: indexing eating disorders | |
Azpiazu et al. | Is readability a valuable signal for hashtag recommendations? | |
He et al. | Heuristic Ranking and Diversification of Web Documents. | |
Cao et al. | ICTNET at Microblog Track TREC 2011. | |
Asghar et al. | Finding correlation between content based features and the popularity of a celebrity on Twitter | |
Kamps et al. | Using anchor text, spam filtering and wikipedia for web search and entity ranking | |
Mejova et al. | TREC Blog and TREC Chem: A View from the Corn Fields. | |
Ryang et al. | Ranking book reviews based on user discussion |