CN112783882A - Big data quality inspection method, system, storage medium and equipment - Google Patents

Big data quality inspection method, system, storage medium and equipment Download PDF

Info

Publication number
CN112783882A
CN112783882A CN202110085577.7A CN202110085577A CN112783882A CN 112783882 A CN112783882 A CN 112783882A CN 202110085577 A CN202110085577 A CN 202110085577A CN 112783882 A CN112783882 A CN 112783882A
Authority
CN
China
Prior art keywords
quality inspection
index data
data
inspection index
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110085577.7A
Other languages
Chinese (zh)
Inventor
朱水斌
胡乔治
范燎
罗稳
刘建
邓振强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuoeer Purchase Information Technology Wuhan Co ltd
Original Assignee
Zhuoeer Purchase Information Technology Wuhan Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuoeer Purchase Information Technology Wuhan Co ltd filed Critical Zhuoeer Purchase Information Technology Wuhan Co ltd
Priority to CN202110085577.7A priority Critical patent/CN112783882A/en
Publication of CN112783882A publication Critical patent/CN112783882A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06395Quality analysis or management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Databases & Information Systems (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • General Factory Administration (AREA)

Abstract

The invention relates to a big data quality inspection method, a system, a storage medium and equipment, wherein the method comprises the steps of reading service data information in a database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool; determining a corresponding quality inspection rule according to the quality inspection target task, and calculating a quality inspection result according to the quality inspection rule and the quality inspection index data; and directly comparing the quality inspection result with a corresponding preset quality inspection threshold value, and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule. According to the quality inspection method and the quality inspection system, the quality inspection index data are calculated in advance according to the service data information and the predefined rule, and the quality inspection index data pool is constructed, so that the quality inspection result can be calculated by directly adopting the data in the quality inspection index data pool, the workload of repeated calculation is greatly reduced, the data in the quality inspection index data pool can be shared, the expansibility is strong, the flexibility and the convenience are realized, the quality inspection efficiency is improved, and the quality inspection result can be directly returned, so that the convenience and the intuition are realized.

Description

Big data quality inspection method, system, storage medium and equipment
Technical Field
The invention relates to the technical field of big data, in particular to a big data quality inspection method, a big data quality inspection system, a big data quality inspection storage medium and big data quality inspection equipment.
Background
In the prior art, a quality inspection method for big data generally includes that a quality inspection request is sent by a structure client, then a corresponding data quality inspection scheme is obtained according to an inspection task identifier carried by the quality inspection request, then a parameter value of an inspection parameter carried by the quality inspection request is utilized to analyze the data quality inspection scheme to generate a data quality inspection instruction executable by a big data platform, then the data quality inspection instruction is sent to the big data platform, and finally a data quality inspection result returned after the big data platform executes the data quality inspection instruction is received. The quality inspection method of big data mainly has the following problems and pain points: 1. data quality inspection schemes need to be configured in advance, and a plurality of quality inspection schemes can have the condition that the same field of the same table needs to be checked, so that the inspection results of the schemes cannot be shared, repeated calculation exists, resource waste is caused, and if the schemes need to be modified, the schemes are troublesome and are not convenient to expand; 2. the background needs SQL sentences generated through quality inspection, the same problems exist, the same object to be detected appears in different sentences, the detection granularity is too coarse, and resources cannot be shared; 3. the quality inspection result cannot be returned in real time, and the larger the data volume is, the slower the time for returning the result is; there is also a problem of resource waste.
Disclosure of Invention
The present invention provides a method, a system, a storage medium and a device for quality inspection of big data, aiming at the above-mentioned deficiencies of the prior art.
The technical scheme for solving the technical problems is as follows: a big data quality inspection method comprises the following steps:
s1: reading service data information in a database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool;
s2: determining a corresponding quality inspection rule according to a quality inspection target task, and calculating a quality inspection result according to the quality inspection rule and quality inspection index data in a quality inspection index data pool;
s3: and directly comparing the quality inspection result with a corresponding preset quality inspection threshold value, and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule.
According to the big data quality inspection method, the quality inspection index data is calculated in advance according to the service data information and the predefined rule, and the quality inspection index data pool is constructed, so that the quality inspection result can be calculated by directly adopting the quality inspection index data in the quality inspection index data pool for a specific quality inspection task subsequently, the method is plug and play, the process is simple, the repeated calculation workload is greatly reduced, the quality inspection work difficulty is obviously reduced, the data in the quality inspection index data pool can be shared, the expansibility is strong, flexibility and convenience are realized, the quality inspection efficiency is improved, the performance is not influenced along with the increase of the data amount, and the quality inspection result can be directly returned, so that the method is convenient and visual.
On the basis of the technical scheme, the invention can be further improved as follows:
further: before the step S2, the method further includes the following steps:
and screening the quality inspection index data in the quality inspection index data pool according to a quality inspection target task, and marking the screened quality inspection index data.
The beneficial effects of the further scheme are as follows: by screening and marking the quality inspection index data in the quality inspection index data pool, the corresponding required quality inspection index data can be conveniently screened out according to the mark aiming at a specific quality inspection task, so that the quality inspection result is quickly calculated, the quality inspection efficiency is improved, and the system performance is favorably optimized.
Further: the specific method for screening the quality inspection index data in the quality inspection index data pool according to the quality inspection target task comprises the following steps:
and determining the type of target service data according to the quality inspection target task, and screening quality inspection index data of which the data type is matched with the type of the target quality inspection index data from the quality inspection index data pool according to the type of the target quality inspection index data.
The beneficial effects of the further scheme are as follows: by determining the data type of the quality inspection target task, quality inspection index data with the data type matched with the target quality inspection index data type can be accurately screened from the quality inspection index data pool, and the quality inspection index data are quality inspection index data required by the current quality inspection target task, so that the quality inspection result calculation of the current quality inspection target task can be conveniently completed.
Further: in step S1, the step of calculating quality inspection index data according to the service data information and the predefined rule specifically includes the following steps:
s11: judging the data volume of the service data information, and determining a sampling proportion according to the data volume of the service data information;
s12: sampling the service data information according to the sampling proportion, and calculating quality inspection index data according to the service data information obtained by sampling and a predefined rule.
The beneficial effects of the further scheme are as follows: by sampling in a corresponding proportion according to the data volume of the service data information, the calculation amount of quality inspection index data can be reduced, the calculation efficiency is improved, and the system performance is ensured.
The invention also provides a big data quality inspection system, which comprises a quality inspection index data pool module, a quality inspection rule engine module and an early warning module;
the quality inspection index data pool module is used for reading the service data information in the database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool;
the quality inspection rule engine module is used for determining a corresponding quality inspection rule according to a quality inspection target task and calculating a quality inspection result according to the quality inspection rule and quality inspection index data in the quality inspection index data pool;
and the early warning module is used for directly comparing the quality inspection result with a corresponding preset quality inspection threshold value and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule.
According to the big data quality inspection system, the quality inspection index data is calculated in advance according to the service data information and the predefined rule, and the quality inspection index data pool is constructed, so that the quality inspection result can be calculated by directly adopting the quality inspection index data in the quality inspection index data pool for a specific quality inspection task subsequently, the system is plug and play, the process is simple, the repeated calculation workload is greatly reduced, the quality inspection work difficulty is obviously reduced, the data in the quality inspection index data pool can be shared, the expansibility is strong, the system is flexible and convenient, the quality inspection efficiency is improved, the performance is not influenced along with the increase of the data amount, and the quality inspection result can be directly returned, so that the system is convenient and visual.
On the basis of the technical scheme, the invention can be further improved as follows:
further: the big data quality inspection system also comprises a screening and marking module which is used for screening the quality inspection index data in the quality inspection index data pool according to the quality inspection target task and marking the screened quality inspection index data.
The beneficial effects of the further scheme are as follows: by screening and marking the quality inspection index data in the quality inspection index data pool, the corresponding required quality inspection index data can be conveniently screened out according to the mark aiming at a specific quality inspection task, so that the quality inspection result is quickly calculated, the quality inspection efficiency is improved, and the system performance is favorably optimized.
Further: the screening and marking module screens the quality inspection index data in the quality inspection index data pool according to the quality inspection target task, and the screening and marking module specifically realizes the following steps:
and determining the type of target service data according to the quality inspection target task, and screening quality inspection index data of which the data type is matched with the type of the target quality inspection index data from the quality inspection index data pool according to the type of the target quality inspection index data.
The beneficial effects of the further scheme are as follows: by determining the data type of the quality inspection target task, quality inspection index data with the data type matched with the target quality inspection index data type can be accurately screened from the quality inspection index data pool, and the quality inspection index data are quality inspection index data required by the current quality inspection target task, so that the quality inspection result calculation of the current quality inspection target task can be conveniently completed.
Further: the quality inspection index data pool module calculates quality inspection index data according to the service data information and the predefined rule, and the specific implementation is as follows:
judging the data volume of the service data information, and determining a sampling proportion according to the data volume of the service data information;
sampling the service data information according to the sampling proportion, and calculating quality inspection index data according to the service data information obtained by sampling and a predefined rule.
The beneficial effects of the further scheme are as follows: by sampling in a corresponding proportion according to the data volume of the service data information, the calculation amount of quality inspection index data can be reduced, the calculation efficiency is improved, and the system performance is ensured.
The invention also provides a computer readable storage medium, on which a computer program is stored, which, when executed by a processor, implements the big data quality inspection method.
The invention also provides big data quality inspection equipment which comprises the storage medium and a processor, wherein the processor realizes the steps of the big data quality inspection method when executing the computer program on the storage medium.
Drawings
FIG. 1 is a schematic flow chart illustrating a big data quality inspection method according to an embodiment of the present invention;
fig. 2 is a block diagram of a big data quality inspection system according to an embodiment of the present invention.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.
As shown in fig. 1, a big data quality inspection method includes the following steps:
s1: reading service data information in a database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool;
s2: determining a corresponding quality inspection rule according to a quality inspection target task, and calculating a quality inspection result according to the quality inspection rule and quality inspection index data in a quality inspection index data pool;
s3: and directly comparing the quality inspection result with a corresponding preset quality inspection threshold value, and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule.
According to the big data quality inspection method, the quality inspection index data is calculated in advance according to the service data information and the predefined rule, and the quality inspection index data pool is constructed, so that the quality inspection result can be calculated by directly adopting the quality inspection index data in the quality inspection index data pool for a specific quality inspection task subsequently, the method is plug and play, the process is simple, the repeated calculation workload is greatly reduced, the quality inspection work difficulty is obviously reduced, the data in the quality inspection index data pool can be shared, the expansibility is strong, flexibility and convenience are realized, the quality inspection efficiency is improved, the performance is not influenced along with the increase of the data amount, and the quality inspection result can be directly returned, so that the method is convenient and visual.
Optionally, in one or more embodiments of the present invention, before the step S2, the method further includes the following steps:
and screening the quality inspection index data in the quality inspection index data pool according to a quality inspection target task, and marking the screened quality inspection index data.
By screening and marking the quality inspection index data in the quality inspection index data pool, the corresponding required quality inspection index data can be conveniently screened out according to the mark aiming at a specific quality inspection task, so that the quality inspection result is quickly calculated, the quality inspection efficiency is improved, and the system performance is favorably optimized.
Specifically, in one or more embodiments of the present invention, the specific method for screening the quality inspection index data in the quality inspection index data pool according to the quality inspection target task is as follows:
and determining the type of target service data according to the quality inspection target task, and screening quality inspection index data of which the data type is matched with the type of the target quality inspection index data from the quality inspection index data pool according to the type of the target quality inspection index data.
By determining the data type of the quality inspection target task, quality inspection index data with the data type matched with the target quality inspection index data type can be accurately screened from the quality inspection index data pool, and the quality inspection index data are quality inspection index data required by the current quality inspection target task, so that the quality inspection result calculation of the current quality inspection target task can be conveniently completed.
In one or more embodiments of the present invention, in step S1, the calculating quality inspection index data according to the service data information and the predefined rule specifically includes the following steps:
s11: judging the data volume of the service data information, and determining a sampling proportion according to the data volume of the service data information;
s12: sampling the service data information according to the sampling proportion, and calculating quality inspection index data according to the service data information obtained by sampling and a predefined rule.
By sampling in a corresponding proportion according to the data volume of the service data information, the calculation amount of quality inspection index data can be reduced, the calculation efficiency is improved, and the system performance is ensured.
In practice, the data amount and the sampling ratio of the service data information can be flexibly adjusted according to actual conditions. It should be noted that, in order to ensure the uniformity of the data and not to substantially affect the calculation result, the service data information of the same data type is sampled according to the sampling ratio.
The big data quality inspection method of the present invention will be explained below by taking the purchase cost of a product as an example. The total cost of purchasing commodities is commodity purchasing cost (a) + transportation cost (b) + inventory cost (c) + interest cost (financing) (d) + management cost (e) + insurance cost (f) + other cost (g), wherein the total cost of purchasing commodities is purchasing amount (a1) × purchasing unit price (a 2).
In the embodiment of the invention, the data volume of the service data information exceeds 30 ten thousand, which is relatively large, sampling is carried out according to the proportion of 15:1, then the data volume of the obtained service data information exceeds 2 ten thousand, and then the quality inspection index data is calculated according to the service data information and the predefined rule, as follows:
a total number of purchases (a1) of the item in the month;
the transportation cost (b) is the total transportation cost of the purchased commodities in the month;
a purchase unit price (a2) which is an average price for purchasing the commodity in the month;
the inventory cost (c) is the total inventory cost of all the commodities in the month;
interest charge (d) total interest financed in the month;
the management cost (e) is the total cost generated by purchasing the commodities in the month;
insurance cost (f): the total cost of purchasing commodity insurance in the same month;
and the other cost (g) is the total cost of other purchased commodities in the month.
And after the quality inspection index data is calculated, a quality inspection index data pool can be constructed. And then, determining a corresponding quality inspection rule according to the quality inspection target task, and calculating a quality inspection result according to the quality inspection rule and the quality inspection index data.
Specifically, the early warning rule is set for the total purchase expense
(1) The total purchase cost cm ═ a1 a2+ b + c + d + e + f + g > con (constant, custom);
description of the drawings: and if the total purchasing cost cm is greater than a preset purchasing cost quality inspection threshold con, early warning is given.
(2) Stock cost (cd) ═ c > con1 (constant, custom);
description of the drawings: if the inventory cost cm is larger than a preset inventory cost quality inspection threshold con1, early warning is given;
(3) logistics and inventory costs (cld) c + b > con2 (constant, custom);
description of the drawings: if the logistics and inventory fees cld > the preset logistics and inventory fees quality inspection threshold con2, early warning is given;
(4) commodity procurement cost (cp) a1 a2> con3 (constant, custom)
Description of the drawings: and if the commodity purchasing charge cp > the preset commodity purchasing charge quality inspection threshold con2, early warning is given.
In practice, corresponding quality inspection rules are determined for different quality inspection tasks, and then quality inspection results are calculated according to the quality inspection rules and quality inspection index data, so that quality inspection of large data is completed, repeated calculation from the source of the large data in a database is not needed, quality inspection index data in a quality inspection index data pool are directly adopted, and calculation of the quality inspection results can be completed very quickly through the quality inspection index data matched with the corresponding data types by combining with the target service data types determined by the corresponding quality inspection target tasks.
As shown in fig. 2, the present invention further provides a big data quality inspection system, which includes a quality inspection index data pool module, a quality inspection rule engine module and an early warning module;
the quality inspection index data pool module is used for reading the service data information in the database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool;
the quality inspection rule engine module is used for determining a corresponding quality inspection rule according to a quality inspection target task and calculating a quality inspection result according to the quality inspection rule and quality inspection index data in the quality inspection index data pool;
and the early warning module is used for directly comparing the quality inspection result with a corresponding preset quality inspection threshold value and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule.
According to the big data quality inspection system, the quality inspection index data is calculated in advance according to the service data information and the predefined rule, and the quality inspection index data pool is constructed, so that the quality inspection result can be calculated by directly adopting the quality inspection index data in the quality inspection index data pool for a specific quality inspection task subsequently, the system is plug and play, the process is simple, the repeated calculation workload is greatly reduced, the quality inspection work difficulty is obviously reduced, the data in the quality inspection index data pool can be shared, the expansibility is strong, the system is flexible and convenient, the quality inspection efficiency is improved, the performance is not influenced along with the increase of the data amount, and the quality inspection result can be directly returned, so that the system is convenient and visual.
Optionally, in one or more embodiments of the present invention, the big data quality inspection system further includes a screening and marking module, configured to screen the quality inspection index data in the quality inspection index data pool according to a quality inspection target task, and mark the screened quality inspection index data.
By screening and marking the quality inspection index data in the quality inspection index data pool, the corresponding required quality inspection index data can be conveniently screened out according to the mark aiming at a specific quality inspection task, so that the quality inspection result is quickly calculated, the quality inspection efficiency is improved, and the system performance is favorably optimized.
Specifically, in one or more embodiments of the present invention, the screening and labeling module, according to a quality inspection target task, specifically implements the following steps of:
and determining the type of target service data according to the quality inspection target task, and screening quality inspection index data of which the data type is matched with the type of the target quality inspection index data from the quality inspection index data pool according to the type of the target quality inspection index data.
By determining the data type of the quality inspection target task, quality inspection index data with the data type matched with the target quality inspection index data type can be accurately screened from the quality inspection index data pool, and the quality inspection index data are quality inspection index data required by the current quality inspection target task, so that the quality inspection result calculation of the current quality inspection target task can be conveniently completed.
In one or more embodiments of the present invention, the specific implementation of the quality inspection index data pool module calculating the quality inspection index data according to the service data information and the predefined rule is as follows:
judging the data volume of the service data information, and determining a sampling proportion according to the data volume of the service data information;
sampling the service data information according to the sampling proportion, and calculating quality inspection index data according to the service data information obtained by sampling and a predefined rule.
By sampling in a corresponding proportion according to the data volume of the service data information, the calculation amount of quality inspection index data can be reduced, the calculation efficiency is improved, and the system performance is ensured.
The invention also provides a computer readable storage medium, on which a computer program is stored, which, when executed by a processor, implements the big data quality inspection method.
The invention also provides big data quality inspection equipment which comprises the storage medium and a processor, wherein the processor realizes the steps of the big data quality inspection method when executing the computer program on the storage medium.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (10)

1. A big data quality inspection method is characterized by comprising the following steps:
s1: reading service data information in a database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool;
s2: determining a corresponding quality inspection rule according to a quality inspection target task, and calculating a quality inspection result according to the quality inspection rule and quality inspection index data in a quality inspection index data pool;
s3: and directly comparing the quality inspection result with a corresponding preset quality inspection threshold value, and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule.
2. The big data quality inspection method according to claim 1, wherein before the step S2, the method further comprises the steps of:
and screening the quality inspection index data in the quality inspection index data pool according to a quality inspection target task, and marking the screened quality inspection index data.
3. The big data quality inspection method according to claim 2, wherein the specific method for screening the quality inspection index data in the quality inspection index data pool according to the quality inspection target task is as follows:
and determining the type of target service data according to the quality inspection target task, and screening quality inspection index data of which the data type is matched with the type of the target quality inspection index data from the quality inspection index data pool according to the type of the target quality inspection index data.
4. The big data quality inspection method according to any one of claims 1 to 3, wherein in the step S1, the calculating quality inspection index data according to the business data information and the predefined rule specifically includes the steps of:
s11: judging the data volume of the service data information, and determining a sampling proportion according to the data volume of the service data information;
s12: sampling the service data information according to the sampling proportion, and calculating quality inspection index data according to the service data information obtained by sampling and a predefined rule.
5. A big data quality inspection system is characterized in that: the quality control system comprises a quality control index data pool module, a quality control rule engine module and an early warning module;
the quality inspection index data pool module is used for reading the service data information in the database, calculating quality inspection index data according to the service data information and a predefined rule, and constructing a quality inspection index data pool;
the quality inspection rule engine module is used for determining a corresponding quality inspection rule according to a quality inspection target task and calculating a quality inspection result according to the quality inspection rule and quality inspection index data in the quality inspection index data pool;
and the early warning module is used for directly comparing the quality inspection result with a corresponding preset quality inspection threshold value and generating early warning information when the quality inspection result and the preset quality inspection threshold value trigger an early warning rule.
6. The big data quality inspection system of claim 5, wherein: the quality inspection target task screening and marking module is used for screening the quality inspection index data in the quality inspection index data pool according to the quality inspection target task and marking the screened quality inspection index data.
7. The big data quality inspection system of claim 6, wherein: the screening and marking module screens the quality inspection index data in the quality inspection index data pool according to the quality inspection target task, and the screening and marking module specifically realizes the following steps:
and determining the type of target service data according to the quality inspection target task, and screening quality inspection index data of which the data type is matched with the type of the target quality inspection index data from the quality inspection index data pool according to the type of the target quality inspection index data.
8. The big data quality inspection system according to any one of claims 5 to 7, wherein: the quality inspection index data pool module calculates quality inspection index data according to the service data information and the predefined rule, and the specific implementation is as follows:
judging the data volume of the service data information, and determining a sampling proportion according to the data volume of the service data information;
sampling the service data information according to the sampling proportion, and calculating quality inspection index data according to the service data information obtained by sampling and a predefined rule.
9. A computer-readable storage medium, having stored thereon a computer program which, when executed by a processor, implements the big data quality inspection method of any one of claims 1 to 4.
10. A big data quality inspection apparatus comprising the storage medium of claim 9 and a processor, the processor implementing the steps of the big data quality inspection method of any one of claims 1 to 4 when executing the computer program on the storage medium.
CN202110085577.7A 2021-01-22 2021-01-22 Big data quality inspection method, system, storage medium and equipment Pending CN112783882A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110085577.7A CN112783882A (en) 2021-01-22 2021-01-22 Big data quality inspection method, system, storage medium and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110085577.7A CN112783882A (en) 2021-01-22 2021-01-22 Big data quality inspection method, system, storage medium and equipment

Publications (1)

Publication Number Publication Date
CN112783882A true CN112783882A (en) 2021-05-11

Family

ID=75758477

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110085577.7A Pending CN112783882A (en) 2021-01-22 2021-01-22 Big data quality inspection method, system, storage medium and equipment

Country Status (1)

Country Link
CN (1) CN112783882A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116975044A (en) * 2023-09-21 2023-10-31 云粒智慧科技有限公司 Quality inspection rule determining method, quality inspection rule determining device, quality inspection rule determining equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1826601A (en) * 2003-08-29 2006-08-30 瑞士银行股份有限公司 Redundancy-free provision of multi-purpose data
CN110019566A (en) * 2019-03-13 2019-07-16 平安信托有限责任公司 Data checking, device, computer equipment and storage medium based on data warehouse
CN111026749A (en) * 2019-11-11 2020-04-17 支付宝(杭州)信息技术有限公司 Service alarm method and device
CN111414376A (en) * 2020-03-02 2020-07-14 中国建设银行股份有限公司 Data early warning method and device
CN111563074A (en) * 2020-04-28 2020-08-21 厦门市美亚柏科信息股份有限公司 Data quality detection method and system based on multi-dimensional label

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1826601A (en) * 2003-08-29 2006-08-30 瑞士银行股份有限公司 Redundancy-free provision of multi-purpose data
CN110019566A (en) * 2019-03-13 2019-07-16 平安信托有限责任公司 Data checking, device, computer equipment and storage medium based on data warehouse
CN111026749A (en) * 2019-11-11 2020-04-17 支付宝(杭州)信息技术有限公司 Service alarm method and device
CN111414376A (en) * 2020-03-02 2020-07-14 中国建设银行股份有限公司 Data early warning method and device
CN111563074A (en) * 2020-04-28 2020-08-21 厦门市美亚柏科信息股份有限公司 Data quality detection method and system based on multi-dimensional label

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116975044A (en) * 2023-09-21 2023-10-31 云粒智慧科技有限公司 Quality inspection rule determining method, quality inspection rule determining device, quality inspection rule determining equipment and storage medium
CN116975044B (en) * 2023-09-21 2023-12-22 云粒智慧科技有限公司 Quality inspection rule determining method, quality inspection rule determining device, quality inspection rule determining equipment and storage medium

Similar Documents

Publication Publication Date Title
JP6707564B2 (en) Data quality analysis
CN107168854B (en) Internet advertisement abnormal click detection method, device, equipment and readable storage medium
CN108734561B (en) Electronic device, order data processing method, and computer-readable storage medium
CN109191133B (en) Payment channel selection method and terminal equipment
WO2019095665A1 (en) Interview method, server and computer-readable storage medium
CN111210321B (en) Risk early warning method and system based on contract management
CN112712417A (en) Bidding management method, system and storage medium
CN112783882A (en) Big data quality inspection method, system, storage medium and equipment
CN114969040A (en) Data display method and device, electronic equipment and storage medium
CN109697203B (en) Index transaction analysis method and device, computer storage medium, and computer device
JP2020057356A (en) Intelligent prediction of bundles of spare parts
CN111680941A (en) Premium recommendation method, device, equipment and storage medium
CN115168509A (en) Processing method and device of wind control data, storage medium and computer equipment
CN109345301A (en) A kind of data price-determining system and determining method
CN112329814B (en) Invoice data processing method and equipment
CN111427900B (en) Label library updating method, device, equipment and readable storage medium
CN108109002B (en) Data processing method and device
CN111833085A (en) Method and device for calculating price of article
CN112541514A (en) Event distribution method, server, terminal and storage medium
CN113409025B (en) Service data extraction method, device and storage medium
CN111598638A (en) Click rate determination method, device and equipment
CN114996113B (en) Real-time monitoring and early warning method and device for abnormal operation of large-data online user
CN113688645B (en) Identification method, system and equipment
CN113656486B (en) Method, device, terminal equipment and storage medium for generating visualized object
KR102473175B1 (en) Method and apparatus of valuation of business model using jump model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210511

RJ01 Rejection of invention patent application after publication