BRPI0502189A - Método e sistema para classificação de documentos de um resultado de busca para aperfeiçoar riqueza de informação e diversidade - Google Patents
Método e sistema para classificação de documentos de um resultado de busca para aperfeiçoar riqueza de informação e diversidadeInfo
- Publication number
- BRPI0502189A BRPI0502189A BR0502189-8A BRPI0502189A BRPI0502189A BR PI0502189 A BRPI0502189 A BR PI0502189A BR PI0502189 A BRPI0502189 A BR PI0502189A BR PI0502189 A BRPI0502189 A BR PI0502189A
- Authority
- BR
- Brazil
- Prior art keywords
- documents
- search result
- diversity
- document
- information
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
"MéTODO E SISTEMA PARA CLASSIFICAçãO DE DOCUMENTOS DE UM RESULTADO DE BUSCA PARA APERFEIçOAR RIQUEZA DE INFORMAçãO E DIVERSIDADE". é descrito um método e sistema para classificar documentos de resultados de busca com base em riqueza de informação e diversidade de tópicos. Um sistema de classificação determina a riqueza de informação de cada documento dentro de um resultado de busca. O sistema de classificação agrupa documentos de um resultado de busca com base nos seus relacionamentos, significando que eles são direcionados para tópicos similares. O sistema de classificação classifica os documentos para garantir que os documentos de classificação mais alta podem incluir pelo menos um documento que cobre cada tópico, ou seja, um documento de cada um dos grupos. O sistema de classificação seleciona o documento de cada grupo que tem a riqueza de informação mais alta dos documentos dentro do grupo. Quando os documentos forem apresentados a um usuário em uma ordem de classificação, o usuário provavelmente encontrará na primeira página do resultado da busca os documentos que cobrem uma variedade de tópicos, em vez de apenas um único tópico popular.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/837,540 US7664735B2 (en) | 2004-04-30 | 2004-04-30 | Method and system for ranking documents of a search result to improve diversity and information richness |
Publications (1)
Publication Number | Publication Date |
---|---|
BRPI0502189A true BRPI0502189A (pt) | 2006-01-10 |
Family
ID=34939598
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR0502189-8A BRPI0502189A (pt) | 2004-04-30 | 2005-04-28 | Método e sistema para classificação de documentos de um resultado de busca para aperfeiçoar riqueza de informação e diversidade |
Country Status (10)
Country | Link |
---|---|
US (1) | US7664735B2 (pt) |
EP (1) | EP1591923A1 (pt) |
JP (1) | JP4845420B2 (pt) |
KR (1) | KR101130535B1 (pt) |
CN (1) | CN100573513C (pt) |
AU (1) | AU2005201824A1 (pt) |
BR (1) | BRPI0502189A (pt) |
CA (1) | CA2505904C (pt) |
MX (1) | MXPA05004681A (pt) |
RU (1) | RU2383922C2 (pt) |
Families Citing this family (68)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6560600B1 (en) * | 2000-10-25 | 2003-05-06 | Alta Vista Company | Method and apparatus for ranking Web page search results |
US7809548B2 (en) * | 2004-06-14 | 2010-10-05 | University Of North Texas | Graph-based ranking algorithms for text processing |
US20070073708A1 (en) * | 2005-09-28 | 2007-03-29 | Smith Adam D | Generation of topical subjects from alert search terms |
US20070094242A1 (en) * | 2005-10-26 | 2007-04-26 | John Dove | System and method for returning search results |
US20070112898A1 (en) * | 2005-11-15 | 2007-05-17 | Clairvoyance Corporation | Methods and apparatus for probe-based clustering |
US20070112867A1 (en) * | 2005-11-15 | 2007-05-17 | Clairvoyance Corporation | Methods and apparatus for rank-based response set clustering |
US8171128B2 (en) | 2006-08-11 | 2012-05-01 | Facebook, Inc. | Communicating a newsfeed of media content based on a member's interactions in a social network environment |
US7827208B2 (en) * | 2006-08-11 | 2010-11-02 | Facebook, Inc. | Generating a feed of stories personalized for members of a social network |
US7644074B2 (en) * | 2005-12-22 | 2010-01-05 | Microsoft Corporation | Search by document type and relevance |
US7814099B2 (en) * | 2006-01-31 | 2010-10-12 | Louis S. Wang | Method for ranking and sorting electronic documents in a search result list based on relevance |
US7818315B2 (en) * | 2006-03-13 | 2010-10-19 | Microsoft Corporation | Re-ranking search results based on query log |
US20080005137A1 (en) * | 2006-06-29 | 2008-01-03 | Microsoft Corporation | Incrementally building aspect models |
US9779441B1 (en) * | 2006-08-04 | 2017-10-03 | Facebook, Inc. | Method for relevancy ranking of products in online shopping |
US20080109435A1 (en) * | 2006-11-07 | 2008-05-08 | Bellsouth Intellectual Property Corporation | Determining Sort Order by Traffic Volume |
US8156112B2 (en) * | 2006-11-07 | 2012-04-10 | At&T Intellectual Property I, L.P. | Determining sort order by distance |
US8301621B2 (en) | 2006-11-07 | 2012-10-30 | At&T Intellectual Property I, L.P. | Topic map for navigational control |
US20080114750A1 (en) * | 2006-11-14 | 2008-05-15 | Microsoft Corporation | Retrieval and ranking of items utilizing similarity |
US7958126B2 (en) * | 2006-12-19 | 2011-06-07 | Yahoo! Inc. | Techniques for including collection items in search results |
US20080154878A1 (en) * | 2006-12-20 | 2008-06-26 | Rose Daniel E | Diversifying a set of items |
US20080215571A1 (en) * | 2007-03-01 | 2008-09-04 | Microsoft Corporation | Product review search |
US8117137B2 (en) * | 2007-04-19 | 2012-02-14 | Microsoft Corporation | Field-programmable gate array based accelerator system |
US8024327B2 (en) * | 2007-06-26 | 2011-09-20 | Endeca Technologies, Inc. | System and method for measuring the quality of document sets |
US8935249B2 (en) | 2007-06-26 | 2015-01-13 | Oracle Otc Subsidiary Llc | Visualization of concepts within a collection of information |
US8543380B2 (en) * | 2007-10-05 | 2013-09-24 | Fujitsu Limited | Determining a document specificity |
US20090094209A1 (en) * | 2007-10-05 | 2009-04-09 | Fujitsu Limited | Determining The Depths Of Words And Documents |
CN101855631B (zh) * | 2007-11-08 | 2016-06-29 | 上海惠普有限公司 | 用于聚焦爬行的导航排名 |
US8321406B2 (en) | 2008-03-31 | 2012-11-27 | Google Inc. | Media object query submission and response |
KR100926876B1 (ko) * | 2008-04-01 | 2009-11-16 | 엔에이치엔(주) | 랭크 발생 확률을 이용한 랭크 학습 모델 생성 방법 및랭크 학습 모델 생성 시스템 |
US20090287668A1 (en) * | 2008-05-16 | 2009-11-19 | Justsystems Evans Research, Inc. | Methods and apparatus for interactive document clustering |
JP5146108B2 (ja) * | 2008-05-27 | 2013-02-20 | 日本電気株式会社 | 文書重要度算出システム、文書重要度算出方法およびプログラム |
CN101625680B (zh) * | 2008-07-09 | 2012-08-29 | 东北大学 | 面向专利领域的文档检索方法 |
US8301638B2 (en) * | 2008-09-25 | 2012-10-30 | Microsoft Corporation | Automated feature selection based on rankboost for ranking |
US8131659B2 (en) * | 2008-09-25 | 2012-03-06 | Microsoft Corporation | Field-programmable gate array based accelerator system |
US9135396B1 (en) | 2008-12-22 | 2015-09-15 | Amazon Technologies, Inc. | Method and system for determining sets of variant items |
US8458171B2 (en) * | 2009-01-30 | 2013-06-04 | Google Inc. | Identifying query aspects |
US8533202B2 (en) | 2009-07-07 | 2013-09-10 | Yahoo! Inc. | Entropy-based mixing and personalization |
US8245135B2 (en) * | 2009-09-08 | 2012-08-14 | International Business Machines Corporation | Producing a visual summarization of text documents |
CN101650746B (zh) * | 2009-09-27 | 2011-06-29 | 中国电信股份有限公司 | 一种对排序结果进行验证的方法和系统 |
US8849807B2 (en) | 2010-05-25 | 2014-09-30 | Mark F. McLellan | Active search results page ranking technology |
US9240020B2 (en) | 2010-08-24 | 2016-01-19 | Yahoo! Inc. | Method of recommending content via social signals |
EP2568396A1 (en) * | 2011-09-08 | 2013-03-13 | Axel Springer Digital TV Guide GmbH | Method and apparatus for generating a sorted list of items |
US8838583B1 (en) * | 2011-10-05 | 2014-09-16 | Amazon Technologies, Inc | Diversity within search results |
US9075498B1 (en) * | 2011-12-22 | 2015-07-07 | Symantec Corporation | User interface for finding similar documents |
US9501566B1 (en) | 2012-01-17 | 2016-11-22 | Veritas Technologies Llc | User interface for transparent concept search |
JP6149434B2 (ja) * | 2012-04-10 | 2017-06-21 | 株式会社リコー | 情報処理装置、文書管理サーバ、プログラム、ファイルシステム |
US20140075282A1 (en) * | 2012-06-26 | 2014-03-13 | Rediff.Com India Limited | Method and apparatus for composing a representative description for a cluster of digital documents |
US9400789B2 (en) * | 2012-07-20 | 2016-07-26 | Google Inc. | Associating resources with entities |
US9536001B2 (en) * | 2012-11-13 | 2017-01-03 | Microsoft Technology Licensing, Llc | Intent-based presentation of search results |
US9129020B2 (en) | 2012-12-21 | 2015-09-08 | Microsoft Technology Licensing, Llc | Search results through interest circles |
CN103927545B (zh) * | 2014-03-14 | 2017-10-17 | 小米科技有限责任公司 | 聚类方法及相关装置 |
US9355227B2 (en) | 2014-06-30 | 2016-05-31 | Konica Minolta Laboratory U.S.A., Inc. | Dynamic document display personalization implemented in a digital rights management system |
US9992262B2 (en) * | 2014-07-29 | 2018-06-05 | Konica Minolta Laboratory U.S.A., Inc. | Personalized document content aggregation and document association implemented in a digital rights management system |
US9858251B2 (en) | 2014-08-14 | 2018-01-02 | Rakuten Kobo Inc. | Automatically generating customized annotation document from query search results and user interface thereof |
KR102243286B1 (ko) * | 2014-09-18 | 2021-04-22 | 경북대학교 산학협력단 | 데이터베이스 구축 방법, 이를 수행하기 위한 기록매체 |
CN104881798A (zh) * | 2015-06-05 | 2015-09-02 | 北京京东尚科信息技术有限公司 | 基于商品图像特征的个性化搜索装置及方法 |
US11392568B2 (en) | 2015-06-23 | 2022-07-19 | Microsoft Technology Licensing, Llc | Reducing matching documents for a search query |
US10242071B2 (en) | 2015-06-23 | 2019-03-26 | Microsoft Technology Licensing, Llc | Preliminary ranker for scoring matching documents |
US10467215B2 (en) * | 2015-06-23 | 2019-11-05 | Microsoft Technology Licensing, Llc | Matching documents using a bit vector search index |
US11281639B2 (en) | 2015-06-23 | 2022-03-22 | Microsoft Technology Licensing, Llc | Match fix-up to remove matching documents |
US10685029B2 (en) | 2015-11-23 | 2020-06-16 | Google Llc | Information ranking based on properties of a computing device |
GB2545931A (en) * | 2015-12-31 | 2017-07-05 | Francis Murphy Dominic | Defining edges and their weights between nodes in a network |
CN105955990A (zh) * | 2016-04-15 | 2016-09-21 | 北京理工大学 | 一种兼顾多样性和有效性的评论排序和筛选方法 |
RU2630427C2 (ru) * | 2016-08-12 | 2017-09-07 | Дмитрий Владимирович Мительков | Способ и система семантической обработки текстовых документов |
US10733359B2 (en) * | 2016-08-26 | 2020-08-04 | Adobe Inc. | Expanding input content utilizing previously-generated content |
GB2570447A (en) * | 2018-01-23 | 2019-07-31 | Canon Kk | Method and system for improving construction of regions of interest |
US11699094B2 (en) * | 2018-10-31 | 2023-07-11 | Salesforce, Inc. | Automatic feature selection and model generation for linear models |
US11328238B2 (en) * | 2019-04-01 | 2022-05-10 | Microsoft Technology Licensing, Llc | Preemptively surfacing relevant content within email |
CN110516062B (zh) * | 2019-08-26 | 2022-11-04 | 腾讯科技(深圳)有限公司 | 一种文档的搜索处理方法及装置 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5020019A (en) * | 1989-05-29 | 1991-05-28 | Ricoh Company, Ltd. | Document retrieval system |
US5598557A (en) * | 1992-09-22 | 1997-01-28 | Caere Corporation | Apparatus and method for retrieving and grouping images representing text files based on the relevance of key words extracted from a selected file to the text files |
US5576954A (en) * | 1993-11-05 | 1996-11-19 | University Of Central Florida | Process for determination of text relevancy |
US5642502A (en) * | 1994-12-06 | 1997-06-24 | University Of Central Florida | Method and system for searching for relevant documents from a text database collection, using statistical ranking, relevancy feedback and small pieces of text |
US5870740A (en) * | 1996-09-30 | 1999-02-09 | Apple Computer, Inc. | System and method for improving the ranking of information retrieval results for short queries |
US6601075B1 (en) * | 2000-07-27 | 2003-07-29 | International Business Machines Corporation | System and method of ranking and retrieving documents based on authority scores of schemas and documents |
US20020194161A1 (en) * | 2001-04-12 | 2002-12-19 | Mcnamee J. Paul | Directed web crawler with machine learning |
CA2496567A1 (en) | 2002-09-16 | 2004-03-25 | The Trustees Of Columbia University In The City Of New York | System and method for document collection, grouping and summarization |
JP4356347B2 (ja) * | 2003-04-16 | 2009-11-04 | セイコーエプソン株式会社 | 文書抽出システム |
-
2004
- 2004-04-30 US US10/837,540 patent/US7664735B2/en not_active Expired - Fee Related
-
2005
- 2005-04-28 BR BR0502189-8A patent/BRPI0502189A/pt not_active IP Right Cessation
- 2005-04-29 MX MXPA05004681A patent/MXPA05004681A/es not_active Application Discontinuation
- 2005-04-29 EP EP05103553A patent/EP1591923A1/en not_active Withdrawn
- 2005-04-29 AU AU2005201824A patent/AU2005201824A1/en not_active Abandoned
- 2005-04-29 CA CA2505904A patent/CA2505904C/en not_active Expired - Fee Related
- 2005-04-29 RU RU2005113189/09A patent/RU2383922C2/ru not_active IP Right Cessation
- 2005-04-29 KR KR1020050036407A patent/KR101130535B1/ko not_active IP Right Cessation
- 2005-04-30 CN CNB2005100896477A patent/CN100573513C/zh not_active Expired - Fee Related
- 2005-05-02 JP JP2005134488A patent/JP4845420B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CA2505904A1 (en) | 2005-10-30 |
RU2005113189A (ru) | 2006-11-10 |
US20050246328A1 (en) | 2005-11-03 |
JP2005322244A (ja) | 2005-11-17 |
CN1758244A (zh) | 2006-04-12 |
JP4845420B2 (ja) | 2011-12-28 |
MXPA05004681A (es) | 2006-03-08 |
AU2005201824A1 (en) | 2005-11-17 |
EP1591923A1 (en) | 2005-11-02 |
RU2383922C2 (ru) | 2010-03-10 |
KR101130535B1 (ko) | 2012-04-12 |
CA2505904C (en) | 2013-09-03 |
CN100573513C (zh) | 2009-12-23 |
KR20060047664A (ko) | 2006-05-18 |
US7664735B2 (en) | 2010-02-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BRPI0502189A (pt) | Método e sistema para classificação de documentos de um resultado de busca para aperfeiçoar riqueza de informação e diversidade | |
Clarke et al. | Overview of the TREC 2011 Web Track. | |
Jiang et al. | A ranking approach to keyphrase extraction | |
Clarke et al. | Overview of the TREC 2009 Web Track. | |
BRPI0503779A (pt) | métodos de indexação de documentos numa coleção de documentos | |
BR0103391A (pt) | Sistema de classificação de documentos eletrônicos | |
BR0017306A (pt) | Método e sistema para gerenciar informação eletrônica e dispositivo de computador | |
BRPI0503781A (pt) | métodos de identificação e de computação para identificação de frases relacionadas numa coleção de documentos e produto de programa de computador | |
Ho et al. | Asia. com: Asia encounters the Internet | |
Kyriakopoulou et al. | Using clustering to enhance text classification | |
Choi et al. | What is this song about anyway?: Automatic classification of subject using user interpretations and lyrics | |
Pulijala et al. | Hierarchical text classification | |
Yang et al. | Effectiveness of web page classification on finding list answers | |
Kumaran et al. | Biasing web search results for topic familiarity | |
Chandar et al. | Diversification of search results using webgraphs | |
Derman et al. | Democracy, development, and human rights in Zimbabwe: A contradictory terrain | |
FALAHATKAR | A co-evolutionary approach to graph coloring problem | |
Shou et al. | Experiments on data fusion using headline information | |
秦兵 et al. | Research on multi-document summarization based on latent semantic indexing | |
Grodzinski | The War of 1812: An Annotated Bibliography | |
Nomoto et al. | Conceptualizing documents with wikipedia | |
Okayama | A Transparent Dynamic Traffic Balancing on Multihomed Networks | |
Roussinov et al. | Discretization based learning approach to information retrieval | |
Sakai et al. | Evaluating retrieval performance for Japanese question answering: what are best passages? | |
Zhu et al. | Query classification using asymmetric learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B03A | Publication of a patent application or of a certificate of addition of invention [chapter 3.1 patent gazette] | ||
B08F | Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette] |
Free format text: REFERENTE AS 6A E 7A ANUIDADES. |
|
B08K | Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette] |
Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 2158 DE 15/05/2012. |
|
B15K | Others concerning applications: alteration of classification |
Free format text: PROCEDIMENTO AUTOMATICO DE RECLASSIFICACAO. A CLASSIFICACAO IPC ANTERIOR ERA G06F 17/60. Ipc: G06F 17/30 (2006.01) Ipc: G06F 17/30 (2006.01) |