AR069933A1 - Sistemas, metodos y software para resoluc ion de bases de datos por relaciones de entidades (erd) - Google Patents

Sistemas, metodos y software para resoluc ion de bases de datos por relaciones de entidades (erd)

Info

Publication number
AR069933A1
AR069933A1 ARP080105667A ARP080105667A AR069933A1 AR 069933 A1 AR069933 A1 AR 069933A1 AR P080105667 A ARP080105667 A AR P080105667A AR P080105667 A ARP080105667 A AR P080105667A AR 069933 A1 AR069933 A1 AR 069933A1
Authority
AR
Argentina
Prior art keywords
records
entities
public
candidate
resolution
Prior art date
Application number
ARP080105667A
Other languages
English (en)
Inventor
Jack G Conrad
Christopher C Dozier
Harsha Veeramachaneni
Original Assignee
Thomson Reuters Glo Resources
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Reuters Glo Resources filed Critical Thomson Reuters Glo Resources
Publication of AR069933A1 publication Critical patent/AR069933A1/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2471Distributed queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Fuzzy Systems (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Storage Device Security (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Para facilitar el acceso a registros publicos, los inventores idearon, entre otras cosas, un sistema de resolucion de entidades. El sistema ejemplificativo incluye la base de datos de registros maestros de 300 millones de entidades, que está particionada en multiples porciones diferenciadas. El sistema ejemplificativo extrae informacion sobre entidades de los registros publicos ingresados y construye una o más consultas en bloque respecto de porciones específicas de la base de datos de registros maestros para identificar uno o más conjuntos de registros de candidatos. Se definen vectores de características para los registros de candidatos y se usan técnicas de instruccion automática, tales como la máquina de vectores de apoyo (SVM), para determinar cuáles de los registros candidatos de la base de datos de registros maestros coinciden con los registros publicos ingresados. Los registros de candidatos que coinciden están logicamente asociados con los registros publicos, lo cual posibilita el fácil acceso por medio de consultas directas o indirectas.
ARP080105667A 2007-12-21 2008-12-22 Sistemas, metodos y software para resoluc ion de bases de datos por relaciones de entidades (erd) AR069933A1 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US889107P 2007-12-21 2007-12-21

Publications (1)

Publication Number Publication Date
AR069933A1 true AR069933A1 (es) 2010-03-03

Family

ID=40428450

Family Applications (1)

Application Number Title Priority Date Filing Date
ARP080105667A AR069933A1 (es) 2007-12-21 2008-12-22 Sistemas, metodos y software para resoluc ion de bases de datos por relaciones de entidades (erd)

Country Status (5)

Country Link
US (3) US9600509B2 (es)
EP (2) EP2631822A1 (es)
AR (1) AR069933A1 (es)
CA (1) CA2710427C (es)
WO (1) WO2009086311A1 (es)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2631822A1 (en) 2007-12-21 2013-08-28 Thomson Reuters Global Resources Systems, methods, and software for entity relationship resolution
US9454606B2 (en) * 2009-09-11 2016-09-27 Lexisnexis Risk & Information Analytics Group Inc. Technique for providing supplemental internet search criteria
US8260763B2 (en) * 2010-01-15 2012-09-04 Hewlett-Packard Devlopment Company, L.P. Matching service entities with candidate resources
US8352460B2 (en) * 2010-03-29 2013-01-08 International Business Machines Corporation Multiple candidate selection in an entity resolution system
US8719267B2 (en) * 2010-04-19 2014-05-06 Alcatel Lucent Spectral neighborhood blocking for entity resolution
US8918393B2 (en) 2010-09-29 2014-12-23 International Business Machines Corporation Identifying a set of candidate entities for an identity record
US8498998B2 (en) 2010-10-11 2013-07-30 International Business Machines Corporation Grouping identity records to generate candidate lists to use in an entity and relationship resolution process
US8843501B2 (en) * 2011-02-18 2014-09-23 International Business Machines Corporation Typed relevance scores in an identity resolution system
US8635197B2 (en) 2011-02-28 2014-01-21 International Business Machines Corporation Systems and methods for efficient development of a rule-based system using crowd-sourcing
US9619494B2 (en) * 2011-05-25 2017-04-11 Qatar Foundation Scalable automatic data repair
US10417263B2 (en) * 2011-06-03 2019-09-17 Robert Mack Method and apparatus for implementing a set of integrated data systems
CN102831127B (zh) * 2011-06-17 2015-04-22 阿里巴巴集团控股有限公司 重复数据处理方法、装置及系统
US8972387B2 (en) 2011-07-28 2015-03-03 International Business Machines Corporation Smarter search
US8965848B2 (en) * 2011-08-24 2015-02-24 International Business Machines Corporation Entity resolution based on relationships to a common entity
US11507548B1 (en) 2011-09-21 2022-11-22 Amazon Technologies, Inc. System and method for generating a classification model with a cost function having different penalties for false positives and false negatives
US9047278B1 (en) 2012-11-09 2015-06-02 Google Inc. Identifying and ranking attributes of entities
US20140164378A1 (en) * 2012-12-11 2014-06-12 International Business Machines Corporation Source record management for master data
WO2014149555A1 (en) * 2013-03-15 2014-09-25 Thomson Reuters Global Resources Method and system for generating and using a master entity associative data network
US20150012530A1 (en) * 2013-07-05 2015-01-08 Accenture Global Services Limited Determining an emergent identity over time
US9922290B2 (en) 2014-08-12 2018-03-20 Microsoft Technology Licensing, Llc Entity resolution incorporating data from various data sources which uses tokens and normalizes records
US20160132830A1 (en) * 2014-11-12 2016-05-12 Adp, Llc Multi-level score based title engine
EP3289483A1 (en) 2015-05-01 2018-03-07 Entit Software LLC Secure multi-party information retrieval
WO2017017533A1 (en) 2015-06-11 2017-02-02 Thomson Reuters Global Resources Risk identification and risk register generation system and engine
US10997134B2 (en) * 2015-06-18 2021-05-04 Aware, Inc. Automatic entity resolution with rules detection and generation system
US20170083820A1 (en) * 2015-09-21 2017-03-23 International Business Machines Corporation Posterior probabilistic model for bucketing records
US10783268B2 (en) 2015-11-10 2020-09-22 Hewlett Packard Enterprise Development Lp Data allocation based on secure information retrieval
EP3507721B1 (en) 2016-09-02 2022-11-23 FutureVault Inc. Real-time document filtering systems and methods
US11080301B2 (en) 2016-09-28 2021-08-03 Hewlett Packard Enterprise Development Lp Storage allocation based on secure data comparisons via multiple intermediaries
US20190207946A1 (en) * 2016-12-20 2019-07-04 Google Inc. Conditional provision of access by interactive assistant modules
US10127227B1 (en) 2017-05-15 2018-11-13 Google Llc Providing access to user-controlled resources by automated assistants
US11436417B2 (en) 2017-05-15 2022-09-06 Google Llc Providing access to user-controlled resources by automated assistants
US20190130050A1 (en) * 2017-10-31 2019-05-02 Sap Se Dynamically generating normalized master data
US11501111B2 (en) 2018-04-06 2022-11-15 International Business Machines Corporation Learning models for entity resolution using active learning
JP2021529385A (ja) * 2018-06-25 2021-10-28 セールスフォース ドット コム インコーポレイティッド エンティティー間の関係の調査するためのシステム及び方法
EP3937030A1 (en) 2018-08-07 2022-01-12 Google LLC Assembling and evaluating automated assistant responses for privacy concerns
US11875253B2 (en) 2019-06-17 2024-01-16 International Business Machines Corporation Low-resource entity resolution with transfer learning
US11281640B2 (en) 2019-07-02 2022-03-22 Walmart Apollo, Llc Systems and methods for interleaving search results
US11556845B2 (en) * 2019-08-29 2023-01-17 International Business Machines Corporation System for identifying duplicate parties using entity resolution
US11544477B2 (en) 2019-08-29 2023-01-03 International Business Machines Corporation System for identifying duplicate parties using entity resolution
US11474983B2 (en) 2020-07-13 2022-10-18 International Business Machines Corporation Entity resolution of master data using qualified relationship score
JP2023548532A (ja) * 2020-11-03 2023-11-17 ライブランプ インコーポレーテッド アイデンティティ・グラフ・データ構造におけるアサートされた関係マッチング
KR102608736B1 (ko) * 2020-12-15 2023-12-01 주식회사 포티투마루 질의에 대한 문서 검색 방법 및 장치
US11501075B1 (en) * 2021-07-01 2022-11-15 Fmr Llc Systems and methods for data extraction using proximity co-referencing

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5974420A (en) * 1998-01-27 1999-10-26 International Business Machines Corporation Information exchange operator for a tuplespace
US6658412B1 (en) 1999-06-30 2003-12-02 Educational Testing Service Computer-based method and system for linking records in data files
US7152060B2 (en) * 2002-04-11 2006-12-19 Choicemaker Technologies, Inc. Automated database blocking and record matching
US7155039B1 (en) * 2002-12-18 2006-12-26 Motorola, Inc. Automatic fingerprint identification system and method
US7403942B1 (en) * 2003-02-04 2008-07-22 Seisint, Inc. Method and system for processing data records
US7783614B2 (en) * 2003-02-13 2010-08-24 Microsoft Corporation Linking elements of a document to corresponding fields, queries and/or procedures in a database
US7805299B2 (en) * 2004-03-01 2010-09-28 Coifman Robert E Method and apparatus for improving the transcription accuracy of speech recognition software
US20040243588A1 (en) 2003-05-29 2004-12-02 Thomas Tanner Systems and methods for administering a global information database
US7624126B2 (en) * 2003-06-25 2009-11-24 Microsoft Corporation Registering for and retrieving database table change information that can be used to invalidate cache entries
US20050273346A1 (en) * 2004-06-02 2005-12-08 Frost Richard N Real property information management system and method
US20050273453A1 (en) * 2004-06-05 2005-12-08 National Background Data, Llc Systems, apparatus and methods for performing criminal background investigations
US7031860B2 (en) * 2004-09-22 2006-04-18 Taiwan Semiconductor Manufacturing Co., Ltd. Method and system of semiconductor fabrication fault analysis
US7870151B2 (en) * 2007-02-07 2011-01-11 Fair Issac Corporation Fast accurate fuzzy matching
EP2631822A1 (en) 2007-12-21 2013-08-28 Thomson Reuters Global Resources Systems, methods, and software for entity relationship resolution

Also Published As

Publication number Publication date
EP2245554A1 (en) 2010-11-03
US9690816B2 (en) 2017-06-27
US20090198678A1 (en) 2009-08-06
US20150161191A1 (en) 2015-06-11
WO2009086311A1 (en) 2009-07-09
EP2631822A1 (en) 2013-08-28
US9600509B2 (en) 2017-03-21
US20130191329A1 (en) 2013-07-25
CA2710427C (en) 2018-04-24
US8990152B2 (en) 2015-03-24
CA2710427A1 (en) 2009-07-09

Similar Documents

Publication Publication Date Title
AR069933A1 (es) Sistemas, metodos y software para resoluc ion de bases de datos por relaciones de entidades (erd)
CO2019003638A2 (es) Método y aparato para el acceso a datos bioinformáticos estructurados en unidades de acceso
BRPI0401092A (pt) Busca em computador com associações
WO2014113772A3 (en) Methods and systems for mapping repair orders within a database
ES2664744T3 (es) Sistema de identificación individual
WO2012177794A3 (en) Identifying information related to a particular entity from electronic sources, using dimensional reduction and quantum clustering
CL2013000844A1 (es) Metodo implementado por computador para focalizar mensajes y anuncios, comprende: recibir a traves del computador datos de identificacion del usuario de un dispositivo del usuario, recuperar a traves del computador, la informacion del usuario con base en los datos de identificacion del usuario, filtrar a traves del computador la informacion del usuario para crear un perfil anonimo, uno o mas medios de almacenamiento; sistema.
BR112015031703A8 (pt) meio de armazenamento legível por computador, método para proporcionar terminais de lançamento de aplicação a partir de múltiplos centros de dados tendo diferentes conjuntos de locação e sistema
RU2014109361A (ru) Отслеживание " грязных" областей энергонезависимых носителей
BR112012008653A2 (pt) sistema de localização e método para operar um sistema de localização
BR112013019236A2 (pt) sistema servidor para fornecer acesso seguro a um registro de dados, token de hardware para uso com um terminal de usuário em comunicação com o sistema servidor, sistema, método de fornecimento de acesso seguro a um registro de dados e produto de programa de computador
IN2012CH04230A (es)
RU2013155626A (ru) Рекомендательная система для пополнения данных
BR112014007472A2 (pt) recuperação de imagens
ATE544119T1 (de) Anfragenverwaltung in einem verteilten datenbanksystem
TW200745887A (en) Navigation system, procedure and computer program product for the operation the same
BR112015012250A2 (pt) método e sistema para identificar defeitos em vidro
MY175611A (en) Information-processing system
WO2009158664A8 (en) Library description of the user interface for federated search results
BR112014006450A2 (pt) gerenciamento de identidades de dispositivo móvel
BR102015028272A8 (pt) Dispositivos de computação de acesso a base de dados e de armazenamento legível por computador, e, método para exibir informação em relação a uma ou mais peças em um produto
MX2019011597A (es) Sistemas y metodos para optimizacion de consulta e indice para recuperar datos en casos de una estructura de datos de formulacion desde una base de datos.
Araujo et al. SERIMI results for OAEI 2011
GB2463221A (en) Biological database index and query searching
BR112012008645A2 (pt) circuito de acionamento de exibição, dispositivo de exibição e método de acionamento de exibição

Legal Events

Date Code Title Description
FA Abandonment or withdrawal