TW200715152A - Systems for and methods of finding relevant documents by analyzing tags - Google Patents

Systems for and methods of finding relevant documents by analyzing tags

Info

Publication number
TW200715152A
TW200715152A TW095128551A TW95128551A TW200715152A TW 200715152 A TW200715152 A TW 200715152A TW 095128551 A TW095128551 A TW 095128551A TW 95128551 A TW95128551 A TW 95128551A TW 200715152 A TW200715152 A TW 200715152A
Authority
TW
Taiwan
Prior art keywords
tag
relevance
objects
algorithms
search query
Prior art date
Application number
TW095128551A
Other languages
Chinese (zh)
Other versions
TWI391834B (en
Inventor
yun-shan Lu
Michael Tanne
Original Assignee
Wink Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wink Technologies Inc filed Critical Wink Technologies Inc
Publication of TW200715152A publication Critical patent/TW200715152A/en
Application granted granted Critical
Publication of TWI391834B publication Critical patent/TWI391834B/en

Links

Abstract

A method of determining relevancies of objects to a search query includes associating multiple tags with multiple objects, recording bookmarks to the multiple objects, or both, and determining a relevance score for each of the multiple objects and a search query. One embodiment of the method combines full-text relevance algorithms with tag relevance algorithms. Other embodiments include statistical relevance algorithms such as statistical classification or rank regression algorithms. When a user executes a search query, a results list containing the objects is returned, with the objects organized based on the relevance scores. The objects are organized by, for example, listing those with the highest relevance scores first or by marking them with an indication of their relevance. Preferably, relevance scores for a tag-object pair are based on a number of times a term in the tag has been associated with the object, a number of tags associated with the object, a number of times that the tag has been associated with the multiple objects, a number of tag-object pairs that contain a term in the tag, a number of tag-object pairs that contain a reference to the object, or any combination of these.
TW95128551A 2005-08-03 2006-08-03 Systems for and methods of finding relevant documents by analyzing tags TWI391834B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US70570405P 2005-08-03 2005-08-03

Publications (2)

Publication Number Publication Date
TW200715152A true TW200715152A (en) 2007-04-16
TWI391834B TWI391834B (en) 2013-04-01

Family

ID=40014928

Family Applications (1)

Application Number Title Priority Date Filing Date
TW95128551A TWI391834B (en) 2005-08-03 2006-08-03 Systems for and methods of finding relevant documents by analyzing tags

Country Status (2)

Country Link
CN (1) CN101283353B (en)
TW (1) TWI391834B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI411926B (en) * 2009-01-05 2013-10-11 Inventec Corp Generating dynamic web pages system and method thereof
TWI451273B (en) * 2007-05-04 2014-09-01 Microsoft Corp Method, system, and computer readable medium for link spam detection using smooth classification function

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101593187B (en) * 2008-05-30 2012-05-30 国际商业机器公司 Method and system for managing book marks
WO2011022867A1 (en) * 2009-08-24 2011-03-03 Hewlett-Packard Development Company, L.P. Method and apparatus for searching electronic documents
US20120002884A1 (en) * 2010-06-30 2012-01-05 Alcatel-Lucent Usa Inc. Method and apparatus for managing video content
US8880517B2 (en) * 2011-02-18 2014-11-04 Microsoft Corporation Propagating signals across a web graph
US10402407B2 (en) * 2013-06-17 2019-09-03 Lenovo (Singapore) Pte. Ltd. Contextual smart tags for content retrieval
US20150046418A1 (en) * 2013-08-09 2015-02-12 Microsoft Corporation Personalized content tagging
US10769229B2 (en) * 2016-04-14 2020-09-08 Microsoft Technology Licensing, Llc Separation of work and personal content
CN105956016A (en) * 2016-04-21 2016-09-21 成都数联铭品科技有限公司 Associated information visualization processing system
CN107463711B (en) * 2017-08-22 2020-07-28 山东浪潮云服务信息科技有限公司 Data tag matching method and device
US10733243B2 (en) * 2017-08-30 2020-08-04 Microsoft Technology Licensing, Llc Next generation similar profiles
CN108241745B (en) * 2018-01-08 2020-04-28 阿里巴巴集团控股有限公司 Sample set processing method and device and sample query method and device
CN109977318B (en) * 2019-04-04 2021-06-29 掌阅科技股份有限公司 Book searching method, electronic device and computer storage medium
CN110704624B (en) * 2019-09-30 2021-08-10 武汉大学 Geographic information service metadata text multi-level multi-label classification method
CN111125566B (en) * 2019-12-11 2021-08-31 贝壳找房(北京)科技有限公司 Information acquisition method and device, electronic equipment and storage medium
CN112100506B (en) * 2020-11-10 2021-03-16 中国电力科学研究院有限公司 Information pushing method, system, equipment and storage medium
CN116431686B (en) * 2023-06-08 2023-09-01 成都航空职业技术学院 Training data query method and system based on heterogeneous archives

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6070176A (en) * 1997-01-30 2000-05-30 Intel Corporation Method and apparatus for graphically representing portions of the world wide web
US6360215B1 (en) * 1998-11-03 2002-03-19 Inktomi Corporation Method and apparatus for retrieving documents based on information other than document content
US6718365B1 (en) * 2000-04-13 2004-04-06 International Business Machines Corporation Method, system, and program for ordering search results using an importance weighting
US6601075B1 (en) * 2000-07-27 2003-07-29 International Business Machines Corporation System and method of ranking and retrieving documents based on authority scores of schemas and documents
US6944609B2 (en) * 2001-10-18 2005-09-13 Lycos, Inc. Search results using editor feedback
US6983280B2 (en) * 2002-09-13 2006-01-03 Overture Services Inc. Automated processing of appropriateness determination of content for search listings in wide area network searches
US8666983B2 (en) * 2003-06-13 2014-03-04 Microsoft Corporation Architecture for generating responses to search engine queries

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI451273B (en) * 2007-05-04 2014-09-01 Microsoft Corp Method, system, and computer readable medium for link spam detection using smooth classification function
TWI411926B (en) * 2009-01-05 2013-10-11 Inventec Corp Generating dynamic web pages system and method thereof

Also Published As

Publication number Publication date
TWI391834B (en) 2013-04-01
CN101283353A (en) 2008-10-08
CN101283353B (en) 2015-11-25

Similar Documents

Publication Publication Date Title
TW200715152A (en) Systems for and methods of finding relevant documents by analyzing tags
NZ578672A (en) Information-retrieval systems, methods, and software with concept-based searching and ranking
CN105593851A (en) A method and an apparatus for tracking microblog messages for relevancy to an entity identifiable by an associated text and an image
Copenheaver et al. Lack of gender bias in citation rates of publications by dendrochronologists: What is unique about this discipline?
TW200636512A (en) Methods of and systems for searching by incorporating user-entered information
Kim et al. COLIEE-2015: evaluation of legal question answering
Hamed et al. Measuring climate change on Twitter using Google’s algorithm: Perception and events
Zhang et al. USTB at INEX2014: Social Book Search Track.
US10242090B1 (en) Method and device for measuring relevancy of a document to a keyword(s)
Minack et al. Current Approaches to Search Result Diversification.
Weren et al. Exploring Information Retrieval Features for Author Profiling.
Koolen et al. Social Book Search: The Impact of Professional and User-Generated Content on Book Suggestions.
Hafsi et al. LaHC at INEX 2014: Social book search track
Potey et al. A survey of query log processing techniques and evaluation of web query intent identification
Zhou et al. Topic Categorization for Relevancy and Opinion Detection.
Hall The impact of tourism knowledge: Google scholar, citations and the opening up of academic space
Weren et al. Using simple content features for the author profiling task
Guardiola-Wanden-Berghe et al. Medical subject headings versus American Psychological Association Index Terms: indexing eating disorders
Azpiazu et al. Is readability a valuable signal for hashtag recommendations?
He et al. Heuristic Ranking and Diversification of Web Documents.
Cao et al. ICTNET at Microblog Track TREC 2011.
Asghar et al. Finding correlation between content based features and the popularity of a celebrity on Twitter
Kamps et al. Using anchor text, spam filtering and wikipedia for web search and entity ranking
Mejova et al. TREC Blog and TREC Chem: A View from the Corn Fields.
Ryang et al. Ranking book reviews based on user discussion