MY192169A - System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository - Google Patents
System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repositoryInfo
- Publication number
- MY192169A MY192169A MYPI2018001926A MYPI2018001926A MY192169A MY 192169 A MY192169 A MY 192169A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY 192169 A MY192169 A MY 192169A
- Authority
- MY
- Malaysia
- Prior art keywords
- knowledge base
- entities
- duplicates
- module
- production knowledge
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Devices For Executing Special Programs (AREA)
Abstract
Disclosed is a system and method for managing one or more duplicate entities based on a relationship cardinality in a production knowledge base repository. The method comprises steps of performing a first level detection of duplicates in existing data present in the production knowledge base repository through an object harmonisation module (202). The first level detection identifies duplicates of one or more attribute objects within a specific entity. The object harmonisation module (202) implements a sanitization and standardization operation on the identified attribute objects. Then the method performs a second level detection of duplicates between entities of a specific concept through a homogeneity recognition module (204). The homogeneity recognition module (204) identifies duplicates according to base-attributes of the specific concept based on a predefined similarity threshold. The method then enables a user to determine the similarity of the entities and further enables the user to merge the similar entities through an entity conflation and merging module (206). (FIG. 2)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
MYPI2018001926A MY192169A (en) | 2018-11-14 | 2018-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
PCT/MY2019/050093 WO2020101478A1 (en) | 2018-11-14 | 2019-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
MYPI2018001926A MY192169A (en) | 2018-11-14 | 2018-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
Publications (1)
Publication Number | Publication Date |
---|---|
MY192169A true MY192169A (en) | 2022-08-03 |
Family
ID=70730534
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MYPI2018001926A MY192169A (en) | 2018-11-14 | 2018-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
Country Status (2)
Country | Link |
---|---|
MY (1) | MY192169A (en) |
WO (1) | WO2020101478A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112001451A (en) * | 2020-08-27 | 2020-11-27 | 上海擎感智能科技有限公司 | Data redundancy processing method, system, medium and device |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014012576A1 (en) * | 2012-07-16 | 2014-01-23 | Qatar Foundation | A method and system for integrating data into a database |
KR101740317B1 (en) * | 2013-04-10 | 2017-05-26 | 한국전자통신연구원 | Method and apparatus for memory management |
US9569491B2 (en) * | 2013-09-13 | 2017-02-14 | Nec Corporation | MISO (multistore-online-tuning) system |
KR20150121505A (en) * | 2014-04-21 | 2015-10-29 | 삼성전자주식회사 | Method and device for data deduplication |
-
2018
- 2018-11-14 MY MYPI2018001926A patent/MY192169A/en unknown
-
2019
- 2019-11-14 WO PCT/MY2019/050093 patent/WO2020101478A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
WO2020101478A1 (en) | 2020-05-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8977646B2 (en) | Leveraging graph databases in a federated database system | |
MX2019014440A (en) | Method and system for information extraction from document images using conversational interface and database querying. | |
US10878000B2 (en) | Extracting graph topology from distributed databases | |
Sadiq et al. | Data quality: The role of empiricism | |
US8412652B2 (en) | Apparatus and methods for operator training in information extraction | |
GB2598493A (en) | Inferring temporal relationships for cybersecurity events | |
WO2019118469A3 (en) | Methods and systems for management of media content associated with message context on mobile computing devices | |
CN103234549B (en) | A kind of differential data generation method for upgrading map | |
EP3822875A2 (en) | Method and apparatus for outputting information, device, storage medium, and computer program product | |
EP3922950A3 (en) | Road information processing method and apparatus, electronic device, storage medium and program | |
GB2574537A (en) | Managing large scale association sets using optimized bit map representations | |
TW201915942A (en) | Hierarchical image classification method and system | |
CN112000773A (en) | Data association relation mining method based on search engine technology and application | |
US20120197925A1 (en) | Optimization of Database Driver Performance | |
TW202004526A (en) | Index creating method and apparatus based on NoSQL database of mobile terminal | |
CN112116331A (en) | Talent recommendation method and device | |
CN111666419A (en) | Knowledge graph construction method and device for legal data | |
MY192169A (en) | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository | |
US11036718B2 (en) | Linking entities in dynamic graphs | |
CN110472034A (en) | Detection method, device, equipment and the computer readable storage medium of question answering system | |
TWI731469B (en) | Apparatus and method for verfication of information | |
US10671668B2 (en) | Inferring graph topologies | |
CN108268462A (en) | A kind of data quality checking system of relation integraity | |
US9244988B2 (en) | Dynamic relevant reporting | |
DE602004022479D1 (en) | System and method for generating a query of information about selected objects |