CN115380281A - 根据数据字段的语义标签生成用于数据字段的数据处理值的规则 - Google Patents

根据数据字段的语义标签生成用于数据字段的数据处理值的规则 Download PDF

Info

Publication number
CN115380281A
CN115380281A CN202180022638.3A CN202180022638A CN115380281A CN 115380281 A CN115380281 A CN 115380281A CN 202180022638 A CN202180022638 A CN 202180022638A CN 115380281 A CN115380281 A CN 115380281A
Authority
CN
China
Prior art keywords
data
field
value
tag
fields
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180022638.3A
Other languages
English (en)
Chinese (zh)
Inventor
约翰·乔伊斯
马歇尔·A·伊斯曼
S·梅尔布希
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ab Initio Technology LLC
Original Assignee
Ab Initio Technology LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ab Initio Technology LLC filed Critical Ab Initio Technology LLC
Publication of CN115380281A publication Critical patent/CN115380281A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/046Forward inferencing; Production systems
    • G06N5/047Pattern matching networks; Rete networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
CN202180022638.3A 2020-02-26 2021-02-25 根据数据字段的语义标签生成用于数据字段的数据处理值的规则 Pending CN115380281A (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202062981646P 2020-02-26 2020-02-26
US62/981,646 2020-02-26
US17/006,504 2020-08-28
US17/006,504 US11886399B2 (en) 2020-02-26 2020-08-28 Generating rules for data processing values of data fields from semantic labels of the data fields
PCT/US2021/019572 WO2021173777A1 (en) 2020-02-26 2021-02-25 Generating rules for data processing values of data fields from semantic labels of the data fields

Publications (1)

Publication Number Publication Date
CN115380281A true CN115380281A (zh) 2022-11-22

Family

ID=77366048

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180022638.3A Pending CN115380281A (zh) 2020-02-26 2021-02-25 根据数据字段的语义标签生成用于数据字段的数据处理值的规则

Country Status (9)

Country Link
US (4) US11886399B2 (https=)
EP (2) EP4521261A3 (https=)
JP (2) JP7512402B2 (https=)
CN (1) CN115380281A (https=)
AR (1) AR121459A1 (https=)
AU (1) AU2021226330B2 (https=)
BR (1) BR112022016716A2 (https=)
CA (1) CA3167627A1 (https=)
WO (1) WO2021173777A1 (https=)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115840748A (zh) * 2022-12-15 2023-03-24 金蝶软件(中国)有限公司 数据处理方法、系统及相关设备
CN115952174A (zh) * 2023-03-13 2023-04-11 青岛庚泽信息技术有限公司 一种数据表联接方法、系统、终端及存储介质
CN116126704A (zh) * 2023-01-03 2023-05-16 四川新网银行股份有限公司 一种基于语义识别的mock自动造数方法、装置及介质
CN116150225A (zh) * 2023-01-04 2023-05-23 建信金融科技有限责任公司 数据字段处理方法、装置、设备、介质和程序产品
CN117668090A (zh) * 2024-02-01 2024-03-08 安徽容知日新科技股份有限公司 数据交换方法、装置、电子设备和计算机可读存储介质
CN120849471A (zh) * 2025-09-24 2025-10-28 北京科杰科技有限公司 一种基于人工智能的质量规则配置方法及其相关装置

Families Citing this family (62)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11194845B2 (en) 2019-04-19 2021-12-07 Tableau Software, LLC Interactive lineage analyzer for data assets
US11704494B2 (en) 2019-05-31 2023-07-18 Ab Initio Technology Llc Discovering a semantic meaning of data fields from profile data of the data fields
US11651003B2 (en) 2019-09-27 2023-05-16 Tableau Software, LLC Interactive data visualization interface for data and graph models
US11423217B2 (en) 2019-11-07 2022-08-23 Tableau Software, LLC Flexible table based visualizations
US12264301B2 (en) 2019-11-08 2025-04-01 Coors Brewing Company Method of brewing non-alcoholic beer
US11886399B2 (en) * 2020-02-26 2024-01-30 Ab Initio Technology Llc Generating rules for data processing values of data fields from semantic labels of the data fields
US11422985B2 (en) * 2020-07-30 2022-08-23 Tableau Software, LLC Interactive data modeling
JP2022033625A (ja) * 2020-08-17 2022-03-02 富士フイルムビジネスイノベーション株式会社 情報処理装置
US11698906B2 (en) * 2020-08-26 2023-07-11 Jpmorgan Chase Bank, N.A. Method and apparatus for automatically mapping physical data models/objects to logical data models and business terms
US11514109B2 (en) 2020-10-29 2022-11-29 Google Llc Inferring semantic label(s) for assistant device(s) based on device-specific signal(s)
US12141323B2 (en) * 2020-11-25 2024-11-12 Anonomatic, Inc. Processing personally identifiable information from a schema
US11520929B2 (en) * 2020-12-15 2022-12-06 Netapp, Inc. Methods and systems for securely storing unstructured data in a storage system
US20220215107A1 (en) * 2021-01-07 2022-07-07 Salesforce.Com, Inc. System and methods to perform row level field masking leveraging attribute based access control in a multi tenant environment
US20220300707A1 (en) * 2021-03-18 2022-09-22 Capital One Services, Llc Systems and methods for generating term definitions using recurrent neural networks
US12001416B1 (en) 2021-04-20 2024-06-04 The Travelers Indemnity Company Systems and methods for generic data parsing applications
US12321583B1 (en) 2021-04-20 2025-06-03 The Travelers Indemnity Company Systems and methods for artificial intelligence (AI)-driven data mapping user-interface (UI) generation
US20220366341A1 (en) * 2021-05-17 2022-11-17 Dataworkz Inc System and method for managing dataset quality in a computing environment
US20220383283A1 (en) * 2021-05-27 2022-12-01 Mastercard International Incorporated Systems and methods for rules management for a data processing network
US12229145B2 (en) 2021-06-01 2025-02-18 Tableau Software, LLC Metadata inheritance for data assets
EP4102424A1 (en) * 2021-06-09 2022-12-14 ABB Schweiz AG Broker entity to bridge semantic gaps for information produced in industrial plants
US12229555B2 (en) * 2021-06-20 2025-02-18 International Business Machines Corporation Generating masks for formats including masking restrictions
US12423333B2 (en) 2021-07-08 2025-09-23 Tableau Software, LLC Data processing for visualizing hierarchical data
US11941151B2 (en) * 2021-07-16 2024-03-26 International Business Machines Corporation Dynamic data masking for immutable datastores
US20230059083A1 (en) 2021-08-23 2023-02-23 Tableau Software, LLC Generating shortcut paths between related data types
US12346297B2 (en) * 2021-08-29 2025-07-01 Technion Research & Development Foundation Limited Database record lineage and vector search
US12105742B2 (en) 2021-08-31 2024-10-01 Tableau Software, LLC Providing data flow directions for data objects
US12105687B2 (en) * 2021-09-29 2024-10-01 Jpmorgan Chase Bank, N.A. Systems and methods for automated data quality semantic constraint identification using rich data type inferences
CN113836146B (zh) * 2021-09-29 2024-04-26 五八同城信息技术有限公司 一种特征标签生成方法、装置、电子设备及存储介质
CN113642030B (zh) * 2021-10-14 2022-02-15 广东鸿数科技有限公司 敏感数据多层识别方法
US11567975B1 (en) * 2021-11-05 2023-01-31 NVISNX, Inc. System and method for user interactive contextual model classification based on metadata
KR102622434B1 (ko) * 2021-11-12 2024-01-09 주식회사 스타캣 데이터의 타입을 자동으로 판별하여 메타데이터를 생성하는 방법 및 이를 위한 머신러닝/딥러닝 모델을 이용한 데이터 타입 판별 장치
US11797430B2 (en) * 2021-12-03 2023-10-24 T-Mobile Usa, Inc. Configuration-driven data conversion and hosting for software development systems and methods
US20230177026A1 (en) * 2021-12-06 2023-06-08 Microsoft Technology Licensing, Llc Data quality specification for database
US20230177379A1 (en) * 2021-12-06 2023-06-08 Microsoft Technology Licensing, Llc Data quality machine learning model
CN114722103A (zh) * 2022-05-12 2022-07-08 江苏数兑科技有限公司 一种基于人口法人基础库的智能数据治理方法
US11875017B2 (en) * 2022-05-17 2024-01-16 Sap Se Intelligent adjustment of documents via machine learning
CN115080616B (zh) * 2022-06-13 2025-09-02 京东方科技集团股份有限公司 字典数据获取方法、装置、存储介质及电子设备
DE102022207482B4 (de) * 2022-07-21 2024-03-07 Zf Friedrichshafen Ag Computerimplementiertes Verfahren zum Bestimmen eines Datenqualitätsindex, Computerprogramm und Steuereinheit
US12361414B2 (en) * 2022-08-23 2025-07-15 Plaid Inc. Parsing event data for clustering and classification
WO2024064705A1 (en) 2022-09-20 2024-03-28 Ab Initio Technology Llc Techniques for discovering and updating semantic meaning of data fields
CN115543977B (zh) * 2022-09-29 2024-07-19 河北雄安睿天科技有限公司 一种供水行业数据清洗方法
US12271355B2 (en) * 2022-09-30 2025-04-08 Bmc Software, Inc. Semantic classification for data management
TWI822388B (zh) * 2022-10-12 2023-11-11 財團法人資訊工業策進會 資安防護偵測規則的標示方法及資安威脅策略、技術與攻擊流程標示裝置
US20240211800A1 (en) * 2022-12-23 2024-06-27 The Johns Hopkins University Processing event data based on machine learning
US12393903B2 (en) 2023-01-27 2025-08-19 Tableau Software, LLC Determining shortcut relationships in data models
WO2024163398A1 (en) * 2023-01-31 2024-08-08 Mastercard International Incorporated Expert systems implementing prioritization techniques for improved transaction categorization
CN115827644B (zh) * 2023-02-13 2023-06-09 明度智云(浙江)科技有限公司 一种基于可视化视图配置的报表生成方法、系统和服务器
US12182176B2 (en) * 2023-03-28 2024-12-31 Accenture Global Solutions Limited System and method for intelligent synthetic test data generation
WO2024205582A1 (en) * 2023-03-29 2024-10-03 Pricewaterhousecoopers Llp Ai-augmented composable and configurable microservices for record linkage and reconciliation
US12265553B2 (en) 2023-03-29 2025-04-01 PwC Product Sales LLC AI-augmented composable and configurable microservices for record linkage and reconciliation
CN116028481B (zh) * 2023-03-30 2023-06-27 紫金诚征信有限公司 一种数据质量检测方法、装置、设备和存储介质
CN116415199B (zh) * 2023-04-13 2023-10-20 广东铭太信息科技有限公司 基于审计中间表的业务数据离群分析方法
US12327282B2 (en) * 2023-06-07 2025-06-10 Wells Fargo Bank, N.A. User interface for identifying and correcting results of data processing rules preventing transmission of data
US12518052B2 (en) * 2023-08-04 2026-01-06 Dell Products L.P. Automated privacy profiling framework for machine learning workspaces
US20250103750A1 (en) * 2023-09-27 2025-03-27 Acronis International Gmbh Systems and methods for applying data anonymization schemes based on versions of a software
US12579116B2 (en) * 2023-12-29 2026-03-17 Truist Bank Database and data structure management systems
US12566878B2 (en) 2024-01-01 2026-03-03 Bank Of America Corporation Data sanitizer
US12561297B2 (en) * 2024-04-03 2026-02-24 Sap Se Rule remediation actions
US20250328505A1 (en) 2024-04-19 2025-10-23 Anomalo, Inc. Benchmarking Algorithms for Data Quality Monitoring
CN119127864B (zh) * 2024-11-08 2025-01-28 云南省大数据有限公司 关联数据源跨数据层库和表的数据质量检测方法、系统
CN120277509B (zh) * 2025-06-10 2025-09-02 山东泰开互感器有限公司 输变电装备制造全流程数据检核方法、系统、终端及介质
CN120850352B (zh) * 2025-09-25 2026-01-27 浪潮云信息技术股份公司 一种多字段数据脱敏方法、装置、设备及介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180107694A1 (en) * 2016-10-17 2018-04-19 Sap Se Performing data quality functions using annotations
CN109191030A (zh) * 2018-11-21 2019-01-11 深圳越海全球供应链有限公司 一种提升数据质量的方法及提高仓库运作效率的方法
CN110442568A (zh) * 2019-07-30 2019-11-12 北京明略软件系统有限公司 字段标签的获取方法及装置、存储介质、电子装置

Family Cites Families (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7426520B2 (en) 2003-09-10 2008-09-16 Exeros, Inc. Method and apparatus for semantic discovery and mapping between data sources
US8868580B2 (en) 2003-09-15 2014-10-21 Ab Initio Technology Llc Data profiling
US7912904B2 (en) * 2004-03-31 2011-03-22 Google Inc. Email system with conversation-centric user interface
US9792351B2 (en) 2005-06-10 2017-10-17 International Business Machines Corporation Tolerant and extensible discovery of relationships in data using structural information and data analysis
AU2006260795A1 (en) 2005-06-20 2006-12-28 Future Route Limited Analytical system for discovery and generation of rules to predict and detect anomalies in data and financial fraud
US7587636B2 (en) 2005-08-04 2009-09-08 Microsoft Corporation Unit test generalization
US20080201319A1 (en) 2006-04-25 2008-08-21 Mcnamar Richard Timothy Method, system and computer software for using an XBRL medical record for diagnosis, treatment, and insurance coverage
US7783564B2 (en) 2006-07-25 2010-08-24 Visa U.S.A. Inc. Compliance control in a card based program
KR20100102607A (ko) 2007-11-12 2010-09-24 바이파 사이언스 인코포레이티드 Parp 억제제를 단독으로 사용하거나 항종양제와 병용하여 자궁암 및 난소암을 치료하는 방법
US8140531B2 (en) 2008-05-02 2012-03-20 International Business Machines Corporation Process and method for classifying structured data
EP2307951A4 (en) 2008-07-29 2012-12-19 Textwise Llc METHOD AND APPARATUS FOR ASSOCIATING DATA SETS USING SEMANTIC VECTORS AND KEYWORD ANALYSIS
US8176072B2 (en) 2009-07-28 2012-05-08 Vulcan Technologies Llc Method and system for tag suggestion in a tag-associated data-object storage system
US8379939B1 (en) 2009-09-08 2013-02-19 Adobe Systems Incorporated Efficient and scalable face recognition in photo albums
US20120254333A1 (en) 2010-01-07 2012-10-04 Rajarathnam Chandramouli Automated detection of deception in short and multilingual electronic messages
US8751218B2 (en) 2010-02-09 2014-06-10 Siemens Aktiengesellschaft Indexing content at semantic level
US8719207B2 (en) 2010-07-27 2014-05-06 Oracle International Corporation Method and system for providing decision making based on sense and respond
US20130006914A1 (en) 2011-06-28 2013-01-03 Microsoft Corporation Exposing search history by category
US10248672B2 (en) * 2011-09-19 2019-04-02 Citigroup Technology, Inc. Methods and systems for assessing data quality
CN107451225B (zh) 2011-12-23 2021-02-05 亚马逊科技公司 用于半结构化数据的可缩放分析平台
US9461876B2 (en) 2012-08-29 2016-10-04 Loci System and method for fuzzy concept mapping, voting ontology crowd sourcing, and technology prediction
CA2886603A1 (en) 2012-09-28 2014-04-03 Alkis Papadopoullos A method and system for monitoring social media and analyzing text to automate classification of user posts using a facet based relevance assessment model
US20140095296A1 (en) * 2012-10-01 2014-04-03 Ebay Inc. Systems and methods for analyzing and reporting geofence performance metrics
US9613125B2 (en) 2012-10-11 2017-04-04 Nuance Communications, Inc. Data store organizing data using semantic classification
US10489360B2 (en) 2012-10-17 2019-11-26 Ab Initio Technology Llc Specifying and applying rules to data
KR102113366B1 (ko) 2012-10-22 2020-05-20 아브 이니티오 테크놀로지 엘엘시 데이터 저장 시스템에서 데이터 소스 특성화
US9239889B2 (en) 2013-03-15 2016-01-19 Sugarcrm Inc. Adaptive search and navigation through semantically aware searching
US9836986B2 (en) 2013-05-21 2017-12-05 Pearson Education, Inc. Dynamic response entry
US20150032609A1 (en) 2013-07-29 2015-01-29 International Business Machines Corporation Correlation of data sets using determined data types
US10528718B2 (en) * 2013-09-27 2020-01-07 Paypal, Inc. Method and apparatus for a data confidence index
WO2015084408A1 (en) 2013-12-06 2015-06-11 Hewlett-Packard Development Company, L.P. Flexible schema table
GB2521198A (en) 2013-12-13 2015-06-17 Ibm Refactoring of databases to include soft type information
GB201322057D0 (en) * 2013-12-13 2014-01-29 Qatar Foundation Descriptive and prescriptive data cleaning
US10346358B2 (en) 2014-06-04 2019-07-09 Waterline Data Science, Inc. Systems and methods for management of data platforms
US10198460B2 (en) 2014-06-04 2019-02-05 Waterline Data Science, Inc. Systems and methods for management of data platforms
US10169642B2 (en) 2014-08-06 2019-01-01 Facebook, Inc. Systems and methods for face alert
US20160055427A1 (en) 2014-10-15 2016-02-25 Brighterion, Inc. Method for providing data science, artificial intelligence and machine learning as-a-service
US10210246B2 (en) 2014-09-26 2019-02-19 Oracle International Corporation Techniques for similarity analysis and data enrichment using knowledge sources
US10409802B2 (en) 2015-06-12 2019-09-10 Ab Initio Technology Llc Data quality analysis
US9910842B2 (en) 2015-08-12 2018-03-06 Captricity, Inc. Interactively predicting fields in a form
US20170068891A1 (en) 2015-09-04 2017-03-09 Infotech Soft, Inc. System for rapid ingestion, semantic modeling and semantic querying over computer clusters
US10067972B2 (en) 2015-11-17 2018-09-04 International Business Machines Corporation Semantic database driven form validation
WO2018025706A1 (ja) 2016-08-05 2018-02-08 日本電気株式会社 テーブル意味推定システム、方法およびプログラム
US10552443B1 (en) * 2016-08-11 2020-02-04 MuleSoft, Inc. Schemaless to relational representation conversion
US20180060404A1 (en) 2016-08-29 2018-03-01 Linkedin Corporation Schema abstraction in data ecosystems
US10565177B2 (en) * 2016-11-14 2020-02-18 At&T Intellectual Property I, L.P. Software defined entities for digital service transactions
CN106528874B (zh) 2016-12-08 2019-07-19 重庆邮电大学 基于Spark内存计算大数据平台的CLR多标签数据分类方法
CN106897424A (zh) 2017-02-24 2017-06-27 北京时间股份有限公司 信息标注系统及方法
JP7235269B2 (ja) 2017-03-13 2023-03-08 日本電気株式会社 データ項目名推定装置、データ項目名推定プログラム、及びデータ項目名推定方法
CN111771364B (zh) 2018-01-10 2022-08-23 爱维士软件有限责任公司 经由dns属性在远程网络中进行基于云的异常流量检测和保护
US20200110736A1 (en) * 2018-03-29 2020-04-09 Robert Paul Bauman Natural language, flat field, record management and file system that defines, integrates and operates records comprising best practices and establishes collaborative peer networks to evolve new best practice records
US12455778B2 (en) 2018-07-06 2025-10-28 Capital One Services, Llc Systems and methods for data stream simulation
US11474978B2 (en) 2018-07-06 2022-10-18 Capital One Services, Llc Systems and methods for a data search engine based on data profiles
CN109635288B (zh) 2018-11-29 2023-05-23 东莞理工学院 一种基于深度神经网络的简历抽取方法
US11328004B2 (en) 2019-03-22 2022-05-10 Microsoft Technology Licensing, Llc Method and system for intelligently suggesting tags for documents
US11704494B2 (en) 2019-05-31 2023-07-18 Ab Initio Technology Llc Discovering a semantic meaning of data fields from profile data of the data fields
CA3160259A1 (en) * 2019-12-19 2021-06-24 Ryan Michael McKay Self-optimizing labeling platform
US11886399B2 (en) * 2020-02-26 2024-01-30 Ab Initio Technology Llc Generating rules for data processing values of data fields from semantic labels of the data fields
US11630853B2 (en) 2021-01-29 2023-04-18 Snowflake Inc. Metadata classification
WO2024064705A1 (en) 2022-09-20 2024-03-28 Ab Initio Technology Llc Techniques for discovering and updating semantic meaning of data fields

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180107694A1 (en) * 2016-10-17 2018-04-19 Sap Se Performing data quality functions using annotations
CN109191030A (zh) * 2018-11-21 2019-01-11 深圳越海全球供应链有限公司 一种提升数据质量的方法及提高仓库运作效率的方法
CN110442568A (zh) * 2019-07-30 2019-11-12 北京明略软件系统有限公司 字段标签的获取方法及装置、存储介质、电子装置

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115840748A (zh) * 2022-12-15 2023-03-24 金蝶软件(中国)有限公司 数据处理方法、系统及相关设备
CN116126704A (zh) * 2023-01-03 2023-05-16 四川新网银行股份有限公司 一种基于语义识别的mock自动造数方法、装置及介质
CN116126704B (zh) * 2023-01-03 2026-01-30 四川新网银行股份有限公司 一种基于语义识别的mock自动造数方法、装置及介质
CN116150225A (zh) * 2023-01-04 2023-05-23 建信金融科技有限责任公司 数据字段处理方法、装置、设备、介质和程序产品
CN115952174A (zh) * 2023-03-13 2023-04-11 青岛庚泽信息技术有限公司 一种数据表联接方法、系统、终端及存储介质
CN115952174B (zh) * 2023-03-13 2023-05-30 青岛庚泽信息技术有限公司 一种数据表联接方法、系统、终端及存储介质
CN117668090A (zh) * 2024-02-01 2024-03-08 安徽容知日新科技股份有限公司 数据交换方法、装置、电子设备和计算机可读存储介质
CN117668090B (zh) * 2024-02-01 2024-06-04 安徽容知日新科技股份有限公司 数据交换方法、装置、电子设备和计算机可读存储介质
CN120849471A (zh) * 2025-09-24 2025-10-28 北京科杰科技有限公司 一种基于人工智能的质量规则配置方法及其相关装置
CN120849471B (zh) * 2025-09-24 2026-01-27 北京科杰科技有限公司 一种基于人工智能的质量规则配置方法及其相关装置

Also Published As

Publication number Publication date
US12242443B2 (en) 2025-03-04
JP2023516139A (ja) 2023-04-18
WO2021173777A1 (en) 2021-09-02
JP7806137B2 (ja) 2026-01-26
CA3167627A1 (en) 2021-09-02
JP7512402B2 (ja) 2024-07-08
US12242442B2 (en) 2025-03-04
US12242444B2 (en) 2025-03-04
US20240126735A1 (en) 2024-04-18
BR112022016716A2 (pt) 2022-10-11
AR121459A1 (es) 2022-06-08
EP4111325B1 (en) 2025-04-02
US20240152495A1 (en) 2024-05-09
AU2021226330A1 (en) 2022-08-25
US11886399B2 (en) 2024-01-30
JP2024144425A (ja) 2024-10-11
EP4521261A2 (en) 2025-03-12
US20240126734A1 (en) 2024-04-18
EP4111325A1 (en) 2023-01-04
US20210263900A1 (en) 2021-08-26
AU2021226330B2 (en) 2025-02-27
EP4521261A3 (en) 2025-05-07

Similar Documents

Publication Publication Date Title
JP7806137B2 (ja) データフィールドの意味論的ラベルからのデータフィールドのデータ処理値に対するルールの生成
US12456016B2 (en) Discovering a semantic meaning of data fields from profile data of the data fields
Qahtan et al. FAHES: A robust disguised missing values detector
US12141107B2 (en) Techniques for discovering and updating semantic meaning of data fields
De Bruin Record linkage toolkit documentation
HK40079584A (en) Generating rules for data processing values of data fields from semantic labels of the data fields
HK40079584B (en) Generating rules for data processing values of data fields from semantic labels of the data fields
CA3268252C (en) Techniques for discovering and updating semantic meaning of data fields
US20250036602A1 (en) Machine learning techniques for discovering keys in relational datasets
US20260030241A1 (en) Automated generation of pairs of natural language questions and database queries
Bhadauria et al. A Catalog of Data Errors
Mondal et al. Identify Researchers’ Credibility on Citation Using Self-citation Detection by Author Name Disambiguation
Papastergios et al. Data and Information Quality
Huang Relational Data Curation by Deduplication, Anonymization, and Diversification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination