BR0204257A - Métodos, sistemas e artigos de fabricação para grupamento hierárquico temporário de objetos coocorrentes - Google Patents
Métodos, sistemas e artigos de fabricação para grupamento hierárquico temporário de objetos coocorrentesInfo
- Publication number
- BR0204257A BR0204257A BR0204257-6A BR0204257A BR0204257A BR 0204257 A BR0204257 A BR 0204257A BR 0204257 A BR0204257 A BR 0204257A BR 0204257 A BR0204257 A BR 0204257A
- Authority
- BR
- Brazil
- Prior art keywords
- articles
- systems
- methods
- hierarchy
- document
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/231—Hierarchical techniques, i.e. dividing or merging pattern sets so as to obtain a dendrogram
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/762—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
- G06V10/7625—Hierarchical techniques, i.e. dividing or merging patterns to obtain a tree-like representation; Dendograms
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Multimedia (AREA)
- Software Systems (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
"MéTODOS, SISTEMAS E ARTIGOS DE FABRICAçãO PARA GRUPAMENTO HIERáRQUICO TEMPORáRIO DE OBJETOS COOCORRENTES". Métodos, sistemas e artigos de fabricação consistentes com determinados princípios relacionados à presente invenção possibilitam que um sistema de computação execute grupamento hierárquico topical de dados de texto em função de modelagem estatística de coocorrências de pares (documento, palavra). O sistema de computação pode ser configurado para receber uma coleção de documentos, cada documento incluindo uma pluralidade de palavras, e executar um processo de Expectativa-Maximização (EM) modificado de recozimento determinístico na coleção para produzir uma hierarquia de nós atribuída temporariamente ("softly"). O processo pode envolver atribuir documentos e fragmentos de documentos a múltiplos nós na hierarquia baseada em palavras incluídas na hierarquia, com isto eliminando a atribuição rígida de documentos na hierarquia.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/982,236 US7644102B2 (en) | 2001-10-19 | 2001-10-19 | Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects |
Publications (2)
Publication Number | Publication Date |
---|---|
BR0204257A true BR0204257A (pt) | 2003-09-16 |
BRPI0204257B1 BRPI0204257B1 (pt) | 2016-05-17 |
Family
ID=25528969
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BRPI0204257A BRPI0204257B1 (pt) | 2001-10-19 | 2002-10-18 | método implementado por computador para agrupar uma pluralidade de documentos em uma estrutura hierárquica de dados, método implementado por computador para agrupar dados que refletem usuários em uma estrutura hierárquica de dados e método implementado por computador para agrupar uma pluralidade de imagens baseadas em texto associado às imagens em uma estrutura hierárquica de dados |
Country Status (4)
Country | Link |
---|---|
US (1) | US7644102B2 (pt) |
EP (1) | EP1304627B1 (pt) |
JP (1) | JP4384398B2 (pt) |
BR (1) | BRPI0204257B1 (pt) |
Families Citing this family (51)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7035864B1 (en) * | 2000-05-18 | 2006-04-25 | Endeca Technologies, Inc. | Hierarchical data-driven navigation system and method for information retrieval |
US7617184B2 (en) * | 2000-05-18 | 2009-11-10 | Endeca Technologies, Inc. | Scalable hierarchical data-driven navigation system and method for information retrieval |
US7831467B1 (en) * | 2000-10-17 | 2010-11-09 | Jpmorgan Chase Bank, N.A. | Method and system for retaining customer loyalty |
US8090717B1 (en) * | 2002-09-20 | 2012-01-03 | Google Inc. | Methods and apparatus for ranking documents |
US20040117366A1 (en) * | 2002-12-12 | 2004-06-17 | Ferrari Adam J. | Method and system for interpreting multiple-term queries |
US7395256B2 (en) | 2003-06-20 | 2008-07-01 | Agency For Science, Technology And Research | Method and platform for term extraction from large collection of documents |
US8175908B1 (en) | 2003-09-04 | 2012-05-08 | Jpmorgan Chase Bank, N.A. | Systems and methods for constructing and utilizing a merchant database derived from customer purchase transactions data |
US20070244690A1 (en) * | 2003-11-21 | 2007-10-18 | Koninklijke Philips Electronic, N.V. | Clustering of Text for Structuring of Text Documents and Training of Language Models |
JP4637113B2 (ja) * | 2003-11-28 | 2011-02-23 | キヤノン株式会社 | 階層データの好ましいビューを構築するための方法 |
US7139754B2 (en) * | 2004-02-09 | 2006-11-21 | Xerox Corporation | Method for multi-class, multi-label categorization using probabilistic hierarchical modeling |
US7457808B2 (en) * | 2004-12-17 | 2008-11-25 | Xerox Corporation | Method and apparatus for explaining categorization decisions |
US7630980B2 (en) * | 2005-01-21 | 2009-12-08 | Prashant Parikh | Automatic dynamic contextual data entry completion system |
US7672830B2 (en) * | 2005-02-22 | 2010-03-02 | Xerox Corporation | Apparatus and methods for aligning words in bilingual sentences |
US20070050388A1 (en) * | 2005-08-25 | 2007-03-01 | Xerox Corporation | Device and method for text stream mining |
US8019752B2 (en) * | 2005-11-10 | 2011-09-13 | Endeca Technologies, Inc. | System and method for information retrieval from object collections with complex interrelationships |
US7720848B2 (en) * | 2006-03-29 | 2010-05-18 | Xerox Corporation | Hierarchical clustering with real-time updating |
US8676802B2 (en) * | 2006-11-30 | 2014-03-18 | Oracle Otc Subsidiary Llc | Method and system for information retrieval with clustering |
US20080140707A1 (en) * | 2006-12-11 | 2008-06-12 | Yahoo! Inc. | System and method for clustering using indexes |
US7711747B2 (en) * | 2007-04-06 | 2010-05-04 | Xerox Corporation | Interactive cleaning for automatic document clustering and categorization |
US8108392B2 (en) | 2007-10-05 | 2012-01-31 | Fujitsu Limited | Identifying clusters of words according to word affinities |
US9317593B2 (en) * | 2007-10-05 | 2016-04-19 | Fujitsu Limited | Modeling topics using statistical distributions |
US8543380B2 (en) | 2007-10-05 | 2013-09-24 | Fujitsu Limited | Determining a document specificity |
US7856434B2 (en) | 2007-11-12 | 2010-12-21 | Endeca Technologies, Inc. | System and method for filtering rules for manipulating search results in a hierarchical search and navigation system |
US8189930B2 (en) * | 2008-07-17 | 2012-05-29 | Xerox Corporation | Categorizer with user-controllable calibration |
JP4636141B2 (ja) * | 2008-08-28 | 2011-02-23 | ソニー株式会社 | 情報処理装置および方法、並びにプログラム |
US8126891B2 (en) * | 2008-10-21 | 2012-02-28 | Microsoft Corporation | Future data event prediction using a generative model |
US8386437B2 (en) * | 2009-04-02 | 2013-02-26 | Xerox Corporation | Apparatus and method for document collection and filtering |
US8339680B2 (en) | 2009-04-02 | 2012-12-25 | Xerox Corporation | Printer image log system for document gathering and retention |
US8165974B2 (en) | 2009-06-08 | 2012-04-24 | Xerox Corporation | System and method for assisted document review |
WO2011004529A1 (ja) * | 2009-07-06 | 2011-01-13 | 日本電気株式会社 | 分類階層再作成システム、分類階層再作成方法及び分類階層再作成プログラム |
US8566349B2 (en) | 2009-09-28 | 2013-10-22 | Xerox Corporation | Handwritten document categorizer and method of training |
EP2488970A4 (en) * | 2009-10-15 | 2016-03-16 | Rogers Comm Tnc | SYSTEM AND METHOD FOR CLASSIFYING MULTIPLE DATA STREAMS |
US8356045B2 (en) * | 2009-12-09 | 2013-01-15 | International Business Machines Corporation | Method to identify common structures in formatted text documents |
US8407228B1 (en) * | 2010-03-26 | 2013-03-26 | Cadence Design Systems, Inc | Method and mechanism for maintaining existence information for electronic layout data |
US8509537B2 (en) | 2010-08-05 | 2013-08-13 | Xerox Corporation | Learning weights of fonts for typed samples in handwritten keyword spotting |
WO2013133844A1 (en) | 2012-03-08 | 2013-09-12 | New Jersey Institute Of Technology | Image retrieval and authentication using enhanced expectation maximization (eem) |
WO2013142852A1 (en) * | 2012-03-23 | 2013-09-26 | Sententia, LLC | Method and systems for text enhancement |
US8880525B2 (en) | 2012-04-02 | 2014-11-04 | Xerox Corporation | Full and semi-batch clustering |
US9189473B2 (en) | 2012-05-18 | 2015-11-17 | Xerox Corporation | System and method for resolving entity coreference |
US9569327B2 (en) | 2012-10-03 | 2017-02-14 | Xerox Corporation | System and method for labeling alert messages from devices for automated management |
US8930181B2 (en) | 2012-12-06 | 2015-01-06 | Prashant Parikh | Automatic dynamic contextual data entry completion |
US9639881B2 (en) * | 2013-05-20 | 2017-05-02 | TCL Research America Inc. | Method and system for personalized video recommendation based on user interests modeling |
US20150127323A1 (en) * | 2013-11-04 | 2015-05-07 | Xerox Corporation | Refining inference rules with temporal event clustering |
US9483738B2 (en) * | 2014-01-17 | 2016-11-01 | Hulu, LLC | Topic model based media program genome generation |
US9992209B1 (en) * | 2016-04-22 | 2018-06-05 | Awake Security, Inc. | System and method for characterizing security entities in a computing environment |
US10997231B2 (en) | 2019-01-17 | 2021-05-04 | International Business Machines Corporation | Image-based ontology refinement using clusters |
CN110377823A (zh) * | 2019-06-28 | 2019-10-25 | 厦门美域中央信息科技有限公司 | 一种Hadoop框架下的热点挖掘系统的构建 |
US11675766B1 (en) | 2020-03-03 | 2023-06-13 | Amazon Technologies, Inc. | Scalable hierarchical clustering |
US11514321B1 (en) | 2020-06-12 | 2022-11-29 | Amazon Technologies, Inc. | Artificial intelligence system using unsupervised transfer learning for intra-cluster analysis |
US11423072B1 (en) | 2020-07-31 | 2022-08-23 | Amazon Technologies, Inc. | Artificial intelligence system employing multimodal learning for analyzing entity record relationships |
US11620558B1 (en) | 2020-08-25 | 2023-04-04 | Amazon Technologies, Inc. | Iterative machine learning based techniques for value-based defect analysis in large data sets |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3669016B2 (ja) | 1994-09-30 | 2005-07-06 | 株式会社日立製作所 | 文書情報分類装置 |
US6460036B1 (en) * | 1994-11-29 | 2002-10-01 | Pinpoint Incorporated | System and method for providing customized electronic newspapers and target advertisements |
US5761418A (en) * | 1995-01-17 | 1998-06-02 | Nippon Telegraph And Telephone Corp. | Information navigation system using clusterized information resource topology |
US5864855A (en) * | 1996-02-26 | 1999-01-26 | The United States Of America As Represented By The Secretary Of The Army | Parallel document clustering process |
EP1486891A3 (en) * | 1997-02-12 | 2005-03-09 | Kokusai Denshin Denwa Co., Ltd | Document retrieval apparatus |
JPH10228486A (ja) * | 1997-02-14 | 1998-08-25 | Nec Corp | 分散ドキュメント分類システム及びプログラムを記録した機械読み取り可能な記録媒体 |
US5819258A (en) * | 1997-03-07 | 1998-10-06 | Digital Equipment Corporation | Method and apparatus for automatically generating hierarchical categories from large document collections |
US6154213A (en) * | 1997-05-30 | 2000-11-28 | Rennison; Earl F. | Immersive movement-based interaction with large complex information structures |
US6233575B1 (en) * | 1997-06-24 | 2001-05-15 | International Business Machines Corporation | Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values |
US6742003B2 (en) * | 2001-04-30 | 2004-05-25 | Microsoft Corporation | Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications |
US6556958B1 (en) * | 1999-04-23 | 2003-04-29 | Microsoft Corporation | Fast clustering with sparse data |
US6460025B1 (en) * | 1999-07-27 | 2002-10-01 | International Business Machines Corporation | Intelligent exploration through multiple hierarchies using entity relevance |
US20020129038A1 (en) * | 2000-12-18 | 2002-09-12 | Cunningham Scott Woodroofe | Gaussian mixture models in a data mining system |
US7039638B2 (en) * | 2001-04-27 | 2006-05-02 | Hewlett-Packard Development Company, L.P. | Distributed data clustering system and method |
-
2001
- 2001-10-19 US US09/982,236 patent/US7644102B2/en not_active Expired - Fee Related
-
2002
- 2002-10-15 JP JP2002300829A patent/JP4384398B2/ja not_active Expired - Fee Related
- 2002-10-18 EP EP02023413.4A patent/EP1304627B1/en not_active Expired - Fee Related
- 2002-10-18 BR BRPI0204257A patent/BRPI0204257B1/pt not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
EP1304627A3 (en) | 2007-03-07 |
JP2003140942A (ja) | 2003-05-16 |
US20030101187A1 (en) | 2003-05-29 |
US7644102B2 (en) | 2010-01-05 |
JP4384398B2 (ja) | 2009-12-16 |
EP1304627B1 (en) | 2014-04-02 |
EP1304627A2 (en) | 2003-04-23 |
BRPI0204257B1 (pt) | 2016-05-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR0204257A (pt) | Métodos, sistemas e artigos de fabricação para grupamento hierárquico temporário de objetos coocorrentes | |
Ide et al. | GrAF: A graph-based format for linguistic annotations | |
Cordeiro et al. | The Beta‐Half‐Cauchy Distribution | |
BRPI0410112A (pt) | método, sistema e produto de programa de computador para mapeamento de dados de exibição | |
Giannakopoulos et al. | AutoSummENG and MeMoG in Evaluating Guided Summaries. | |
BR0111691A (pt) | Método de formação de perfil interativo de uma estrutura | |
BRPI0503785A (pt) | sistema eletrÈnico de permuta de dados | |
RU2004129675A (ru) | Система для идентификации перефразирования с использованием технологии машинного перевода | |
BR0316728A (pt) | Sistema e método para criar, agregar, e transferir reduções de emissões ambientais | |
BR0306695A (pt) | Processamento de tinta eletrônica | |
Gul et al. | Qualitative Analysis of Implicit Dirichlet Boundary Value Problem for Caputo‐Fabrizio Fractional Differential Equations | |
BR0316335A (pt) | Processo para a geração de uma corrente de bits a partir de uma árvore de indexação | |
ATE345533T1 (de) | System und verfahren in einer datentabelle um rekursive, skalierbare schabloneninstanzen herzustellen | |
Ferrone et al. | Towards syntax-aware compositional distributional semantic models | |
Daciuk | Comparison of construction algorithms for minimal, acyclic, deterministic, finite-state automata from sets of strings | |
Van Eynde et al. | Number agreement in copular constructions: A treebank-based investigation | |
Georgakopoulos et al. | Framing the difference between Sources and Goals in Change of Possession events: A corpus-based study in German and Modern Greek | |
CN104699666A (zh) | 基于近邻传播模型从图书目录中学习层次结构的方法 | |
BR112023003044A2 (pt) | Agendamento vinculado à memória | |
MY135390A (en) | Methods and systems for screening chinese address data | |
Pakray et al. | Transliterated search system for Indian languages | |
BR0009354A (pt) | Planejamento de processo para fabricação distribuìda e conserto | |
Yoshizumi et al. | A graph grammar for entity relationship diagrams | |
Bağrıaçık et al. | Greek and Turkish influences in the clausal complements of Cunda Turkish | |
Rychlikowski et al. | Named entity recognition and linking augmented with large-scale structured data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B07A | Application suspended after technical examination (opinion) [chapter 7.1 patent gazette] | ||
B06A | Patent application procedure suspended [chapter 6.1 patent gazette] | ||
B09A | Decision: intention to grant [chapter 9.1 patent gazette] | ||
B16A | Patent or certificate of addition of invention granted [chapter 16.1 patent gazette] |
Free format text: PRAZO DE VALIDADE: 10 (DEZ) ANOS CONTADOS A PARTIR DE 17/05/2016, OBSERVADAS AS CONDICOES LEGAIS. |
|
B21F | Lapse acc. art. 78, item iv - on non-payment of the annual fees in time |
Free format text: REFERENTE A 20A ANUIDADE. |
|
B24J | Lapse because of non-payment of annual fees (definitively: art 78 iv lpi, resolution 113/2013 art. 12) |
Free format text: EM VIRTUDE DA EXTINCAO PUBLICADA NA RPI 2692 DE 09-08-2022 E CONSIDERANDO AUSENCIA DE MANIFESTACAO DENTRO DOS PRAZOS LEGAIS, INFORMO QUE CABE SER MANTIDA A EXTINCAO DA PATENTE E SEUS CERTIFICADOS, CONFORME O DISPOSTO NO ARTIGO 12, DA RESOLUCAO 113/2013. |