MY192169A - System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository - Google Patents

System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Info

Publication number
MY192169A
MY192169A MYPI2018001926A MYPI2018001926A MY192169A MY 192169 A MY192169 A MY 192169A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY 192169 A MY192169 A MY 192169A
Authority
MY
Malaysia
Prior art keywords
knowledge base
entities
duplicates
module
production knowledge
Prior art date
Application number
MYPI2018001926A
Inventor
Binti Mohamed Sa'niah
Zarina Binti Ishak Ros'aleza
Stella Tabora Domingo Ma
Wooi Kin Goon
Raziq Ramesh Bin Abdullah Muhammad
Original Assignee
Mimos Berhad
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mimos Berhad filed Critical Mimos Berhad
Priority to MYPI2018001926A priority Critical patent/MY192169A/en
Priority to PCT/MY2019/050093 priority patent/WO2020101478A1/en
Publication of MY192169A publication Critical patent/MY192169A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Devices For Executing Special Programs (AREA)

Abstract

Disclosed is a system and method for managing one or more duplicate entities based on a relationship cardinality in a production knowledge base repository. The method comprises steps of performing a first level detection of duplicates in existing data present in the production knowledge base repository through an object harmonisation module (202). The first level detection identifies duplicates of one or more attribute objects within a specific entity. The object harmonisation module (202) implements a sanitization and standardization operation on the identified attribute objects. Then the method performs a second level detection of duplicates between entities of a specific concept through a homogeneity recognition module (204). The homogeneity recognition module (204) identifies duplicates according to base-attributes of the specific concept based on a predefined similarity threshold. The method then enables a user to determine the similarity of the entities and further enables the user to merge the similar entities through an entity conflation and merging module (206). (FIG. 2)
MYPI2018001926A 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository MY192169A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
MYPI2018001926A MY192169A (en) 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository
PCT/MY2019/050093 WO2020101478A1 (en) 2018-11-14 2019-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
MYPI2018001926A MY192169A (en) 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Publications (1)

Publication Number Publication Date
MY192169A true MY192169A (en) 2022-08-03

Family

ID=70730534

Family Applications (1)

Application Number Title Priority Date Filing Date
MYPI2018001926A MY192169A (en) 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Country Status (2)

Country Link
MY (1) MY192169A (en)
WO (1) WO2020101478A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112001451A (en) * 2020-08-27 2020-11-27 上海擎感智能科技有限公司 Data redundancy processing method, system, medium and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014012576A1 (en) * 2012-07-16 2014-01-23 Qatar Foundation A method and system for integrating data into a database
KR101740317B1 (en) * 2013-04-10 2017-05-26 한국전자통신연구원 Method and apparatus for memory management
US9569491B2 (en) * 2013-09-13 2017-02-14 Nec Corporation MISO (multistore-online-tuning) system
KR20150121505A (en) * 2014-04-21 2015-10-29 삼성전자주식회사 Method and device for data deduplication

Also Published As

Publication number Publication date
WO2020101478A1 (en) 2020-05-22

Similar Documents

Publication Publication Date Title
US8977646B2 (en) Leveraging graph databases in a federated database system
MX2019014440A (en) Method and system for information extraction from document images using conversational interface and database querying.
US10878000B2 (en) Extracting graph topology from distributed databases
Sadiq et al. Data quality: The role of empiricism
US8412652B2 (en) Apparatus and methods for operator training in information extraction
GB2598493A (en) Inferring temporal relationships for cybersecurity events
WO2019118469A3 (en) Methods and systems for management of media content associated with message context on mobile computing devices
CN103234549B (en) A kind of differential data generation method for upgrading map
EP3822875A2 (en) Method and apparatus for outputting information, device, storage medium, and computer program product
EP3922950A3 (en) Road information processing method and apparatus, electronic device, storage medium and program
GB2574537A (en) Managing large scale association sets using optimized bit map representations
TW201915942A (en) Hierarchical image classification method and system
CN112000773A (en) Data association relation mining method based on search engine technology and application
US20120197925A1 (en) Optimization of Database Driver Performance
TW202004526A (en) Index creating method and apparatus based on NoSQL database of mobile terminal
CN112116331A (en) Talent recommendation method and device
CN111666419A (en) Knowledge graph construction method and device for legal data
MY192169A (en) System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository
US11036718B2 (en) Linking entities in dynamic graphs
CN110472034A (en) Detection method, device, equipment and the computer readable storage medium of question answering system
TWI731469B (en) Apparatus and method for verfication of information
US10671668B2 (en) Inferring graph topologies
CN108268462A (en) A kind of data quality checking system of relation integraity
US9244988B2 (en) Dynamic relevant reporting
DE602004022479D1 (en) System and method for generating a query of information about selected objects