CN103593586A - Data quality evaluation method - Google Patents

Data quality evaluation method Download PDF

Info

Publication number
CN103593586A
CN103593586A CN201310636049.1A CN201310636049A CN103593586A CN 103593586 A CN103593586 A CN 103593586A CN 201310636049 A CN201310636049 A CN 201310636049A CN 103593586 A CN103593586 A CN 103593586A
Authority
CN
China
Prior art keywords
logic
logic rules
rule
descriptive language
rules
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310636049.1A
Other languages
Chinese (zh)
Inventor
付萍萍
陶振文
陈燕青
洪微明
余鹏飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Information and Telecommunication Branch of State Grid Jiangxi Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Information and Telecommunication Branch of State Grid Jiangxi Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Information and Telecommunication Branch of State Grid Jiangxi Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201310636049.1A priority Critical patent/CN103593586A/en
Publication of CN103593586A publication Critical patent/CN103593586A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Stored Programmes (AREA)

Abstract

The invention relates to a data quality evaluation method. The data quality evaluation method comprises the following steps that firstly, a logic object rule model is built, logic processing requirements and rules are abstracted, and a logic rule model of logic objects is built; secondly, a logic rule description language is built according to the logic object rule model, and namely the logic rule description language for data quality evaluation is built according to logic rules and standards, wherein the logic rule description language is composed of logic defining, logic analysis and logic execution; thirdly, logic rule analysis is carried out, and a rule engine of the logic rule description language is built for the logic rule description language to accomplish logic rule description language processing so as to achieve logic rule judgment and accomplish data quality evaluation, wherein the logic rule engine includes logic rule description language analysis, correctness checking and rule execution. The data quality evaluation method has the advantages of being easy to maintain and expand and strong in adaptability.

Description

A kind of method that the quality of data is evaluated
Technical field
The present invention relates to a kind of evaluation method, the method for especially quality of data being evaluated.
Background technology
Along with the propagation and employment of all kinds of service application infosystems, increasing enterprise starts to pay close attention to the quality of miscellaneous service data, and aspect the lifting of the quality of data, is taking up to carry out a large amount of controls; Meanwhile, the corresponding check-up measure and evaluation for Various types of data quality is also along with the application of infosystem is further improved, and most units have all carried out quality of data examination work for aspects such as different departments, specialty, fields from enterprise.
Meanwhile, along with the continuous variation of quality of data checking evaluation standard with rule, existing evaluation infosystem is difficult to adapt to the variation adjustment demand of these evaluation ways.Under this background, the present invention proposes a kind of method that the quality of data is evaluated, and utilizes informationization technology means, and realization can adapt to the variation of various evaluations fast.
Summary of the invention
The object of the present invention is to provide a kind of method that the quality of data is evaluated, comprise the following steps:
A, set up object logic rule model, take out the requirement and rule of logical process, set up a kind of logic rules model of object logic;
B, according to object logic rule model, set up logic rules descriptive language, according to logic rules and standard, by analyzing with abstract, set up a kind of logic rules descriptive language that carries out quality of data evaluation, described logic rules descriptive language consists of following several parts,
(1) logical definition: according to the requirement of data evaluation, carry out the definition of logic rules;
(2) logical analysis: by take out the logic rules of definition from logic rules storehouse, by resolving, change into the language syntax that computer program can be identified, carry out for program, obtain result;
(3) logic is carried out: according to logical analysis result, carry out and calculate;
C, logic rules are resolved, for logic rules descriptive language, set up the regulation engine of logic rules descriptive language, by logic rules engine, automatically complete the processing to logic rules descriptive language, thereby the judgement of the logic of implementation rule, the evaluation of complete paired data quality; Logic rules engine comprises the parsing of logic rules descriptive language, verifying correctness and rule and carries out three parts;
(1) logic rules descriptive language is resolved: according to logic rules, logic rules descriptive language is carried out to dissection process;
(2) verifying correctness: identify irrational analysis result, logic rules setting is carried out to verifying correctness analysis;
(3) rule is carried out: by resolving, change into the language syntax that computer program can be identified, program, according to logical analysis result, is carried out and calculated;
Described logic rules comprise interval type logic rules and non-interval type logic rules.
Beneficial effect of the present invention:
(1) be easy to safeguard: the change procedure of quality of data evaluation assignment logic is described with logic rules descriptive language, independent with program, be convenient to general service personnel miscellaneous service logic rules are dynamically updated.
(2) be easy to expansion: these logic rules have been summed up various types of definition and the parsing in the application of Various types of data quality assessment practical business, are easy to expansion.
(3) strong adaptability: by using based on quality of data evaluation logic rule definition and parsing and calculating, can adapt to application and the variation of various evaluation type systematics.
Accompanying drawing explanation
Fig. 1 is logic rules dissection process process flow diagram of the present invention;
Fig. 2 is that logic rules of the present invention are processed structural drawing.
Embodiment
Below in conjunction with accompanying drawing, to of the present invention, be elaborated.
The object of the present invention is to provide a kind of method that the quality of data is evaluated, comprise the following steps:
A, set up object logic rule model, take out the requirement and rule of logical process, set up a kind of logic rules model of object logic;
B, according to object logic rule model, set up logic rules descriptive language, according to logic rules and standard, by analyzing with abstract, set up a kind of logic rules descriptive language that carries out quality of data evaluation, described logic rules descriptive language consists of following several parts,
(1) logical definition: according to the requirement of data evaluation, carry out the definition of logic rules;
(2) logical analysis: by take out the logic rules of definition from logic rules storehouse, by resolving, change into the language syntax that computer program can be identified, carry out for program, obtain result;
(3) logic is carried out: according to logical analysis result, carry out and calculate;
C, logic rules are resolved, for logic rules descriptive language, set up the regulation engine of logic rules descriptive language, by logic rules engine, automatically complete the processing to logic rules descriptive language, thereby the judgement of the logic of implementation rule, the evaluation of complete paired data quality; Logic rules engine comprises the parsing of logic rules descriptive language, verifying correctness and rule and carries out three parts;
(1) logic rules descriptive language is resolved: according to logic rules, logic rules descriptive language is carried out to dissection process;
(2) verifying correctness: identify irrational analysis result, logic rules setting is carried out to verifying correctness analysis; The logic preanalysis that verifying correctness comprises the result after logic rules are resolved, identifies irrational logic rules definition, provides and prevents from arranging wrong regular method.
(3) rule is carried out: by resolving, change into the language syntax that computer program can be identified, program, according to logical analysis result, is carried out and calculated; Executing rule is the systematic knowledge expression way of using multiple production, and its primary expression mode is: CASE object logic WHEN logic rules THEN (consequent).
Described logic rules comprise interval type logic rules and non-interval type logic rules.
The logic rules of interval type mainly realize the processing of certain data when certain logic interval range, and the business description of this logic is summarized as follows:
1, the interval of vertex type: rule setting and processing when this Interval Type allows logic judgement interval to be certain point value, be described below: when Z=X1, value=K1 (or carry out certain and calculate); During Z=X2, value=K2 (or carry out certain and calculate); During Z=X3, value=K3 (or carry out certain and calculate) When not meeting interval, value=K0 (or carry out certain and calculate).The length range class that the processing of this interval type allows in system can unrestrictedly expand.
2, continuous interval: rule setting and processing when this Interval Type allows logic judgement interval to be certain section of continuous value, be described below:
When Z>=X1, value=K1 (or carry out certain and calculate); During X1<Z<=X2, value=K2 (or carry out certain and calculate); During X2<Z<=X3, value=K3 (or carry out certain and calculate) When not meeting interval, value=K0 (or carry out certain and calculate).The length range class that the processing of this interval type allows in system can unrestrictedly expand.(wherein, X1<X2<X3<Xn)
3, discrete interval: rule setting and processing when this Interval Type allows logic judgement interval to be the discrete value of multistage, be described below:
When Z>=X1, value=K1 (or carry out certain and calculate); During X2<Z<X4, value=K2 (or carry out certain and calculate); During X5<Z<=X6, value=K3 (or carry out certain and calculate) When not meeting interval, value=K0 (or carry out certain and calculate).The length range class that the processing of this interval type allows in system can unrestrictedly expand.(wherein, X1<X2<X3< ... <Xn)
The logic rules of non-interval type mainly realize the processing of certain data when non-interval range, and the logic rules of all non-interval types in this article, are referred to as " non-interval type logic rules "; The business description of this logic is summarized as follows:
The logic rules of 1 constant type: this type of logic rules are processed not to be needed to calculate, are directly used and in logic rules, define numerical value, as: settings constant is K, as long as the program that adopts these logic rules to configure is all calculated as automatically: value=K; As: acquiescence was to 10 minutes.
2, the logic rules of computing formula class: this logic rules meet definition, the parsing of legal arbitrarily computing formula and calculate, as: (K1-X)/K2*K3.Wherein, K1, K2, K3 is constant, and X is certain variate-value, and this value obtains after by system-computed, then this formula of substitution calculates result.
3, the logic rules of Decline type: this logic rules meet " add up to Z, often X is individual less, and button Y, till having detained ", are less than within X and do not detain, and surpass X just button; By the relation of X, Y, Z, draw after critical value, draw certain computing formula, define and calculate in conjunction with 2, obtain a result.As: 100% gets full marks, and every minimizing 1% button 1 minute, till having detained.
4, increase progressively the logic rules of type: this logic rules are satisfied, and " add up to Z, " every increase X, button Y, till having detained ", increases within X and do not detain, and surpasses X hour and just detains; By the relation of X, Y, Z, draw after critical value, draw certain computing formula, define and calculate in conjunction with 2, obtain a result.As: 0% gets full marks, and every increase by 1% button 1 minute, till having detained.
This method defines, resolves and processes mainly for the evaluation rule of the quality of data, often be applied in the data evaluation demands such as standardization of promptness, integrality, accuracy and data detail to the quality of data, as: the aspects such as the inspection of the quality of data, the formulation of quality of data evaluation criterion and quality of data evaluation.
Basic thought of the present invention is: rule and the standard of data-driven quality evaluation index checking system, take out the requirement and rule of logical process, set up a kind of logic rules model that meets various index evaluation application, according to this rule model, design a kind of descriptive language that can realize these logic rules models, realize the method based on self-defining checking evaluation standard logical language rule definition and parsing and calculating.By all kinds of logic rules being analyzed with abstract, design a kind of based on self-defining checking evaluation standard logic rules descriptive language and analytics engine.Make actual business rule independent from program, the variation of business logic processing rule, only need to define logic rules, adjust; Provide that a kind of business personnel easily understands, the logic rules language of easy maintenance; Thereby realized the service application of all kinds of index systems in all kinds of evaluations.

Claims (2)

1. a method of the quality of data being evaluated, is characterized in that, comprises the following steps:
A, set up object logic rule model, take out the requirement and rule of logical process, set up a kind of logic rules model of object logic;
B, according to object logic rule model, set up logic rules descriptive language, according to logic rules and standard, by analyzing with abstract, set up a kind of logic rules descriptive language that carries out quality of data evaluation, described logic rules descriptive language consists of following several parts,
(1) logical definition: according to the requirement of data evaluation, carry out the definition of logic rules;
(2) logical analysis: by take out the logic rules of definition from logic rules storehouse, by resolving, change into the language syntax that computer program can be identified, carry out for program, obtain result;
(3) logic is carried out: according to logical analysis result, carry out and calculate;
C, logic rules are resolved, for logic rules descriptive language, set up the regulation engine of logic rules descriptive language, by logic rules engine, automatically complete the processing to logic rules descriptive language, thereby the judgement of the logic of implementation rule, the evaluation of complete paired data quality; Logic rules engine comprises the parsing of logic rules descriptive language, verifying correctness and rule and carries out three parts;
(1) logic rules descriptive language is resolved: according to logic rules, logic rules descriptive language is carried out to dissection process;
(2) verifying correctness: identify irrational analysis result, logic rules setting is carried out to verifying correctness analysis;
(3) rule is carried out: by resolving, change into the language syntax that computer program can be identified, program, according to logical analysis result, is carried out and calculated.
2. a kind of method that the quality of data is evaluated according to claim 1, is characterized in that: described logic rules comprise interval type logic rules and non-interval type logic rules.
CN201310636049.1A 2013-12-03 2013-12-03 Data quality evaluation method Pending CN103593586A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310636049.1A CN103593586A (en) 2013-12-03 2013-12-03 Data quality evaluation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310636049.1A CN103593586A (en) 2013-12-03 2013-12-03 Data quality evaluation method

Publications (1)

Publication Number Publication Date
CN103593586A true CN103593586A (en) 2014-02-19

Family

ID=50083723

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310636049.1A Pending CN103593586A (en) 2013-12-03 2013-12-03 Data quality evaluation method

Country Status (1)

Country Link
CN (1) CN103593586A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105117980A (en) * 2015-08-24 2015-12-02 云南电网有限责任公司 Power grid equipment state automatic evaluation method
CN109597606A (en) * 2018-10-24 2019-04-09 中国平安人寿保险股份有限公司 Method, equipment and the storage medium of operational decision making are carried out using regulation engine
CN111932103A (en) * 2020-08-04 2020-11-13 红旗智行科技(北京)有限公司 Carrier, driver and transportation demand matching calculation method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080059925A1 (en) * 2006-08-29 2008-03-06 International Business Machines Corporation Method, System, and Program Product for Automated Verification of Gating Logic Using Formal Verification
CN102789450A (en) * 2012-07-12 2012-11-21 卢玉敏 Definable semantic analysis system and method on basis of rules
CN102929646A (en) * 2011-12-09 2013-02-13 江西省电力公司信息通信中心 Application program production method and device
CN102968305A (en) * 2012-02-24 2013-03-13 江西省电力公司信息通信中心 Logical processing method, logical processing device and evaluation system
CN103034703A (en) * 2012-12-10 2013-04-10 江西省电力公司信息通信分公司 Method for data exchange among multiple systems based on rule configuration

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080059925A1 (en) * 2006-08-29 2008-03-06 International Business Machines Corporation Method, System, and Program Product for Automated Verification of Gating Logic Using Formal Verification
CN102929646A (en) * 2011-12-09 2013-02-13 江西省电力公司信息通信中心 Application program production method and device
CN102968305A (en) * 2012-02-24 2013-03-13 江西省电力公司信息通信中心 Logical processing method, logical processing device and evaluation system
CN102789450A (en) * 2012-07-12 2012-11-21 卢玉敏 Definable semantic analysis system and method on basis of rules
CN103034703A (en) * 2012-12-10 2013-04-10 江西省电力公司信息通信分公司 Method for data exchange among multiple systems based on rule configuration

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105117980A (en) * 2015-08-24 2015-12-02 云南电网有限责任公司 Power grid equipment state automatic evaluation method
CN105117980B (en) * 2015-08-24 2019-02-12 云南电网有限责任公司 A kind of automatic evaluation method of grid equipment state
CN109597606A (en) * 2018-10-24 2019-04-09 中国平安人寿保险股份有限公司 Method, equipment and the storage medium of operational decision making are carried out using regulation engine
CN111932103A (en) * 2020-08-04 2020-11-13 红旗智行科技(北京)有限公司 Carrier, driver and transportation demand matching calculation method and system

Similar Documents

Publication Publication Date Title
CN103106605B (en) A kind of used car pricing system
CN103345209B (en) production monitoring method and system
CN103107509B (en) Full automatic relay protection fixed value setting calculation and validation method based on spreadsheet
Cao et al. Performance evaluation and enhancement of multistage manufacturing systems with rework loops
CN103593586A (en) Data quality evaluation method
Haridy et al. An attribute chart for monitoring the process mean and variance
CN103971022A (en) Aircraft part quality stability control algorithm based on T2 control chart
CN103488169B (en) Continuous chemical plant installations and control loop performance real-time estimating method, device
CN107606745A (en) Metro Air conditioner season by when ring control energy consumption Forecasting Methodology
CN103280779B (en) Auditing processing method for relay protection setting value
CN102968305B (en) Logical process method, device and evaluation system
Gu et al. Reliability modeling of manufacturing systems based on the task network evolved by key quality characteristics
CN105117606A (en) Method for determining element fault probability change tendency
CN103714440A (en) Safety production integrated information management system
Stogniy et al. Using accuracy measurements to evaluate simulation model simplification
CN104537423A (en) Occupational hazard quantitative evaluation and enterprise occupational security prediction system
Kruse et al. Simulation-based assessment and optimization of the energy consumption in multi variant production
Cai et al. Method on integrated reliability assessment of test data based on Duane model
Fung et al. Note on the productivity convergence of airports in China
CN104954174A (en) Method for setting double-threshold warning
Danilevich et al. Use of simulation modelling for checking monitoring and testing procedures
Atluru et al. Statistical process monitoring with MTConnect
CN103078319A (en) Real-time plan balancing capability evaluation method and real-time plan balancing capability evaluation system for power grids
Chen et al. The model and method of trustworthiness level evaluation for software product
CN109472480A (en) Medicine patent valve estimating system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140219