MX2023010157A - Generacion y ejecucion de flujos de trabajo de procesamiento para corregir los problemas de calidad de datos en conjuntos de datos. - Google Patents

Generacion y ejecucion de flujos de trabajo de procesamiento para corregir los problemas de calidad de datos en conjuntos de datos.

Info

Publication number
MX2023010157A
MX2023010157A MX2023010157A MX2023010157A MX2023010157A MX 2023010157 A MX2023010157 A MX 2023010157A MX 2023010157 A MX2023010157 A MX 2023010157A MX 2023010157 A MX2023010157 A MX 2023010157A MX 2023010157 A MX2023010157 A MX 2023010157A
Authority
MX
Mexico
Prior art keywords
data
data quality
workflow
state
results
Prior art date
Application number
MX2023010157A
Other languages
English (en)
Inventor
Jonathan Martin
Adam Weiss
Original Assignee
Ab Initio Technology Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ab Initio Technology Llc filed Critical Ab Initio Technology Llc
Publication of MX2023010157A publication Critical patent/MX2023010157A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06311Scheduling, planning or task assignment for a person or group
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06316Sequencing of tasks or work

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Databases & Information Systems (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Stored Programmes (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Debugging And Monitoring (AREA)
  • Detection And Prevention Of Errors In Transmission (AREA)
  • Facsimile Image Signal Circuits (AREA)

Abstract

Los sistemas y métodos son para ejecutar, mediante un sistema de procesamiento de datos, un flujo de trabajo para procesar datos de resultados que indican una salida de una comprobación de calidad de datos en registros de datos al generar, en respuesta a la recepción de datos de resultados y metadatos que describen los datos de resultados, un problema de calidad de datos asociado con un estado y una o más etapas de procesamiento del flujo de trabajo para resolver un error de calidad de datos asociado con la comprobación de calidad de datos. Las operaciones incluyen generar un flujo de trabajo para procesar datos de resultados basado en un estado especificado por un problema de calidad de datos. Generar el flujo de trabajo incluye: asignar, basado en los datos de resultados y el estado del problema de calidad de datos, una entidad responsable de resolver el error de calidad de datos; determinar, basado en los metadatos, una o más acciones para satisfacer la condición de calidad de datos especificada en la comprobación de calidad de datos; y actualizar el estado asociado con el problema de calidad de datos.
MX2023010157A 2021-03-01 2022-03-01 Generacion y ejecucion de flujos de trabajo de procesamiento para corregir los problemas de calidad de datos en conjuntos de datos. MX2023010157A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163155148P 2021-03-01 2021-03-01
PCT/US2022/018310 WO2022187224A1 (en) 2021-03-01 2022-03-01 Generation and execution of processing workflows for correcting data quality issues in data sets

Publications (1)

Publication Number Publication Date
MX2023010157A true MX2023010157A (es) 2023-09-11

Family

ID=80780731

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2023010157A MX2023010157A (es) 2021-03-01 2022-03-01 Generacion y ejecucion de flujos de trabajo de procesamiento para corregir los problemas de calidad de datos en conjuntos de datos.

Country Status (10)

Country Link
US (1) US20220276920A1 (es)
EP (1) EP4302193A1 (es)
JP (1) JP2024508643A (es)
CN (1) CN116917869A (es)
AU (1) AU2022229349A1 (es)
BR (1) BR112023017346A2 (es)
CA (1) CA3208255A1 (es)
DE (1) DE112022001326T5 (es)
MX (1) MX2023010157A (es)
WO (1) WO2022187224A1 (es)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220366341A1 (en) * 2021-05-17 2022-11-17 Dataworkz Inc System and method for managing dataset quality in a computing environment
CN117131037A (zh) * 2023-10-25 2023-11-28 北京集度科技有限公司 一种数据质量检测方法、装置、系统及智能车辆
CN117648388B (zh) * 2024-01-29 2024-04-12 成都七柱智慧科技有限公司 一种可视化的安全实时的数据仓库实现方法及其系统

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6850806B2 (en) * 1999-04-16 2005-02-01 Siemens Energy & Automation, Inc. Method and apparatus for determining calibration options in a motion control system
US6684349B2 (en) * 2000-01-18 2004-01-27 Honeywell International Inc. Reliability assessment and prediction system and method for implementing the same
US6701451B1 (en) * 2000-08-11 2004-03-02 Emc Corporation Selective correction of data errors
US6886108B2 (en) * 2001-04-30 2005-04-26 Sun Microsystems, Inc. Threshold adjustment following forced failure of storage device
FR2837043B1 (fr) * 2002-03-05 2004-06-04 Cit Alcatel Systeme de commutation, dispositif de transmission, procede de transmission et procede de commutation pour satellite
US7428663B2 (en) * 2004-06-01 2008-09-23 Alcatel Lucent Electronic device diagnostic methods and systems
US7984220B2 (en) * 2004-09-02 2011-07-19 International Business Machines Corporation Exception tracking
JP2006146833A (ja) * 2004-11-25 2006-06-08 Hitachi Global Storage Technologies Netherlands Bv ディスク装置の整合性検査支援方法およびディスクアレイ装置の整合性検査方法
US7849062B1 (en) * 2005-03-18 2010-12-07 Beyondcore, Inc. Identifying and using critical fields in quality management
US8352412B2 (en) * 2009-02-27 2013-01-08 International Business Machines Corporation System for monitoring global online opinions via semantic extraction
US9086983B2 (en) * 2011-05-31 2015-07-21 Micron Technology, Inc. Apparatus and methods for providing data integrity
US20140195448A1 (en) * 2013-01-08 2014-07-10 Where 2 Get It, Inc. Social Location Data Management Methods and Systems
US9286153B2 (en) * 2013-12-12 2016-03-15 International Business Machines Corporation Monitoring the health of a question/answer computing system
US9898553B2 (en) * 2014-07-08 2018-02-20 Jpmorgan Chase Bank, N.A. Capturing run-time metadata
US10776740B2 (en) * 2016-06-07 2020-09-15 International Business Machines Corporation Detecting potential root causes of data quality issues using data lineage graphs
US10379920B2 (en) * 2017-06-23 2019-08-13 Accenture Global Solutions Limited Processing data to improve a quality of the data
US20200210401A1 (en) * 2018-12-28 2020-07-02 Microsoft Technology Licensing, Llc Proactive automated data validation
CN111143334A (zh) * 2019-11-13 2020-05-12 深圳市华傲数据技术有限公司 一种数据质量闭环控制方法
US11379465B2 (en) * 2020-01-09 2022-07-05 Raytheon Company Autonomous self-healing application data validation using database configurations
US11436204B2 (en) * 2020-06-04 2022-09-06 Bank Of America Corporation Enterprise data flow lineage from enterprise data testing metadata

Also Published As

Publication number Publication date
CA3208255A1 (en) 2022-09-09
DE112022001326T5 (de) 2024-02-08
US20220276920A1 (en) 2022-09-01
CN116917869A (zh) 2023-10-20
JP2024508643A (ja) 2024-02-28
WO2022187224A1 (en) 2022-09-09
EP4302193A1 (en) 2024-01-10
BR112023017346A2 (pt) 2023-09-26
AU2022229349A1 (en) 2023-08-17

Similar Documents

Publication Publication Date Title
MX2023010157A (es) Generacion y ejecucion de flujos de trabajo de procesamiento para corregir los problemas de calidad de datos en conjuntos de datos.
MX2019011590A (es) Metodos y sistemas para realizar pruebas en aplicaciones web.
US20160092290A1 (en) Processing data errors for a data processing system
CA2716266A1 (en) Content based audio copy detection
CN107025224B (zh) 一种监控任务运行的方法和设备
CN104657274B (zh) 软件界面测试方法及装置
EP3547145A3 (en) Systems and methods for reducing storage required for code coverage results
CN103699637A (zh) 一种代码生产率统计方法及其系统
GB2596438A (en) Computer model machine learning based on correlations of training data with performance trends
CN108228443B (zh) 一种web应用的测试方法及装置
US20150339286A1 (en) Automatically generating certification documents
CN105930257A (zh) 一种确定目标测试用例的方法及装置
CN110109824B (zh) 大数据自动回归测试方法、装置、计算机设备和存储介质
US20200319874A1 (en) Predicting downtimes for software system upgrades
MY189491A (en) Database data modification request processing method and apparatus
US20170091082A1 (en) Test db data generation apparatus
CN110716843B (zh) 系统故障分析处理方法、装置、存储介质及电子设备
CN111857981A (zh) 一种数据处理方法以及装置
CN113138990A (zh) 一种数据血缘构建、追溯方法、装置及设备
CN109783369B (zh) 一种自然语言理解模块回归测试方法、装置及电子设备
IN2015DE00970A (es)
CN104657267A (zh) 弹性的源代码语法树解析系统及方法
GB2602238A (en) Language statement processing in computing system
GB2582509A (en) Error handling
Higo et al. Correlation analysis between code clone metrics and project data on the same specification projects