CN113225221B - Effective data quality management method for industrial internet industry - Google Patents

Effective data quality management method for industrial internet industry Download PDF

Info

Publication number
CN113225221B
CN113225221B CN202110370784.7A CN202110370784A CN113225221B CN 113225221 B CN113225221 B CN 113225221B CN 202110370784 A CN202110370784 A CN 202110370784A CN 113225221 B CN113225221 B CN 113225221B
Authority
CN
China
Prior art keywords
data
executing
collected
judging whether
gateway
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110370784.7A
Other languages
Chinese (zh)
Other versions
CN113225221A (en
Inventor
聂清
方森涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Jiuxin Internet Of Things Science & Technology Co ltd
Original Assignee
Hangzhou Jiuxin Internet Of Things Science & Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Jiuxin Internet Of Things Science & Technology Co ltd filed Critical Hangzhou Jiuxin Internet Of Things Science & Technology Co ltd
Priority to CN202110370784.7A priority Critical patent/CN113225221B/en
Publication of CN113225221A publication Critical patent/CN113225221A/en
Application granted granted Critical
Publication of CN113225221B publication Critical patent/CN113225221B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors

Abstract

The invention provides an effective data quality management method for the industrial internet industry, which comprises the following steps: s1, inputting all gateway MAC addresses, and calculating each data quality judgment index; s2, judging whether point location data of each gateway is completely collected or not, and if all the point location data are collected, executing the step S3; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems; s3, judging whether the key fields are abnormal or not, and if not, executing the step S4; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems; and S4, judging that the data acquisition and the data quality are normal. The method is simple and practical, is beneficial to data quality check of data acquired by the data processing platform, and ensures data quality reliability of data acquisition in the industrial internet industry. In addition, the method can play a role in early warning, for example, the gateway can timely find and early warn corresponding personnel to solve problems and prevent data from being lost for a long time when the data is abnormally collected.

Description

Effective data quality management method for industrial internet industry
[ technical field ] A method for producing a semiconductor device
The invention relates to the technical field of industrial Internet/Internet of things, in particular to an effective data quality management method for the industrial Internet industry.
[ background of the invention ]
In the existing industrial internet/internet of things industry, data are acquired through different types of industrial gateways according to certain data source formats, types, frequencies and the like; the collected data are transmitted and summarized to a data processing platform through an Internet of things protocol; the data processing platform extracts, converts and loads various data, analyzes and processes the data and stores the data into a target database; the quality of the acquired data is not checked, and the quality of the acquired data cannot be verified.
However, the above technical solutions have certain drawbacks:
1) the data processing platform does not carry out data quality verification on the acquired data, whether the data are complete or not, whether the data meet consistency or not and whether the data are normally acquired or not are judged, and manual inquiry and determination are needed when one gateway is added, so that time and labor are wasted.
2) If the industrial internet platform does not carry out data quality verification, the reliability of the data quality cannot be ensured.
[ summary of the invention ]
The invention aims to solve the problems in the prior art and provides an effective data quality management method for the industrial internet industry, which can be used for carrying out data quality verification on collected data and ensuring data quality reliability.
In order to achieve the purpose, the invention provides an effective data quality management method for the industrial internet industry, which sequentially comprises the following steps:
s1, inputting all gateway MAC addresses, and calculating each data quality judgment index;
s2, judging whether point location data of each gateway is completely collected or not, and if all the point location data are collected, executing the step S3; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems;
s3, judging whether the key fields are abnormal or not, and if not, executing the step S4; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems;
and S4, judging that the data acquisition and the data quality are normal.
Preferably, the method specifically comprises the following steps:
s01, inputting all gateway MAC addresses;
s02, calculating the number of the configured point positions of each gateway;
s03, calculating the number of the collected point positions of each gateway;
s04, calculating the number of null values of each key field;
s05, calculating the repeated number of each key field;
s06, judging whether the number of the configured point locations is equal to that of the collected point locations, if so, representing that all point location data are collected, and executing a step S07; otherwise, representing that part of point location data is not collected, and executing the step S010;
s07, judging whether a null value exists in the related key field, if not, executing the step S08; otherwise, executing step S010;
s08, judging whether a key field has a repeated value, if not, executing the step S09; otherwise, executing step S010;
s09, judging whether the data acquisition frequency is normal or not, if not, executing the step S010, otherwise, executing the step S012;
s010, informing related personnel to check and check the problems in the acquisition process and solving the problems;
s011, judging whether the problem is solved or not, and if the problem is solved, returning to execute the step S01; otherwise, re-executing step S010;
and S012, normal data acquisition and normal data quality.
The invention has the beneficial effects that:
1) the method is simple and practical, is beneficial to data quality check of data acquired by the data processing platform, and ensures data quality reliability of data acquisition in the industrial internet industry.
2) The method can play a role in early warning, for example, the gateway can timely find and early warn corresponding personnel to solve problems and prevent data from being lost for a long time when the gateway acquires abnormal data.
The features and advantages of the present invention will be described in detail by embodiments in conjunction with the accompanying drawings.
[ description of the drawings ]
Fig. 1 is a flowchart of an effective data quality management method for the industrial internet industry according to the present invention.
[ detailed description ] embodiments
The invention relates to an effective data quality management method for the industrial internet industry, which is provided by the weak data quality of the industrial internet. The data quality not only relates to the accuracy of a data application presentation result, but also shows the rationality of the whole flow of data acquisition, processing, cleaning and the like. This method is described in detail below.
The invention mainly manages the data quality in the aspect of three dimensional indexes, namely a consistency index, a normative index and an accuracy index, wherein the specific indexes are realized in a program.
A method for managing the quality of effective data in the industrial Internet industry sequentially comprises the following steps:
s1, collecting all gateway MAC addresses;
s2, judging whether point location data of each gateway is completely collected or not, and if all the point location data are collected, executing the step S3; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems;
s3, judging whether the key fields are abnormal or not, and if not, executing the step S4; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems;
and S4, judging that the data acquisition and the data quality are normal.
Referring to fig. 1, the method specifically includes the following steps:
1) acquiring all gateway MAC addresses, wherein the gateway addresses have unique identifiers and can represent the uniqueness of each gateway;
2) calculating the configured point location number a1 of each gateway, wherein a1 represents the configured point location number of each gateway;
3) calculating the number a2 of the point locations collected by each gateway, wherein a2 represents the number of the point locations collected by each gateway;
4) calculating the number b of null values of each key field;
5) calculating the repeated number c of each key field;
6) and judging whether the configured point location is equal to the collected point location. If a1 is a2, all the points are collected, and step 7 is executed; if a1 ≠ a2, indicating that the partial point data was not collected, then step 10 is performed.
7) And judging whether each gateway key field has a null value. If b is 0, representing that no null value exists in the key field, executing step 8; if b ≠ 0, it represents a data quality anomaly and step 10 is performed.
8) And judging whether each gateway key field has a repeated value. If c is 0, representing that there is no duplicate value case, then step 9 is executed; if c ≠ 0, it indicates data quality anomaly and step 10 is performed.
9) And judging whether the data acquisition frequency is normal or not. If not, executing step 10; if so, step 12 is performed.
10) Informing relevant personnel to check and check the problems in the acquisition flow and solving the problems;
11) it is determined whether the problem has been resolved. If the problem has been solved, perform step 1; if not, step 10 is re-executed.
12) The data acquisition is normal, and the data quality is normal.
The above embodiments are illustrative of the present invention, and are not intended to limit the present invention, and any simple modifications of the present invention are within the scope of the present invention.

Claims (1)

1. A method for managing the quality of effective data in the industrial Internet industry is characterized in that: the method sequentially comprises the following steps:
s1, inputting all gateway MAC addresses, and calculating each data quality judgment index;
s2, judging whether point location data of each gateway is completely collected or not, and if all the point location data are collected, executing the step S3; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems;
s3, judging whether the key fields are abnormal or not, and if not, executing the step S4; otherwise, informing related personnel to check the problems in the acquisition flow and solve the problems;
s4, judging that data acquisition and data quality are normal;
the method specifically comprises the following steps:
s01, inputting all gateway MAC addresses;
s02, calculating the number of the configured point positions of each gateway;
s03, calculating the number of the collected point positions of each gateway;
s04, calculating the number of null values of each key field;
s05, calculating the repeated number of each key field;
s06, judging whether the number of the configured point locations is equal to that of the collected point locations, if so, representing that all point location data are collected, and executing a step S07; otherwise, representing that part of point location data is not collected, and executing the step S010;
s07, judging whether a null value exists in the related key field, if not, executing the step S08; otherwise, executing step S010;
s08, judging whether a key field has a repeated value, if not, executing the step S09; otherwise, executing step S010;
s09, judging whether the data acquisition frequency is normal or not, if not, executing the step S010, otherwise, executing the step S012;
s010, informing related personnel to check and check the problems in the acquisition process and solving the problems;
s011, judging whether the problem is solved or not, and if the problem is solved, returning to execute the step S01; otherwise, re-executing step S010;
and S012, normal data acquisition and normal data quality.
CN202110370784.7A 2021-04-07 2021-04-07 Effective data quality management method for industrial internet industry Active CN113225221B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110370784.7A CN113225221B (en) 2021-04-07 2021-04-07 Effective data quality management method for industrial internet industry

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110370784.7A CN113225221B (en) 2021-04-07 2021-04-07 Effective data quality management method for industrial internet industry

Publications (2)

Publication Number Publication Date
CN113225221A CN113225221A (en) 2021-08-06
CN113225221B true CN113225221B (en) 2022-08-05

Family

ID=77086473

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110370784.7A Active CN113225221B (en) 2021-04-07 2021-04-07 Effective data quality management method for industrial internet industry

Country Status (1)

Country Link
CN (1) CN113225221B (en)

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100574224C (en) * 2007-03-07 2009-12-23 中控科技集团有限公司 The detection method of Industrial Ethernet data monitoring and device
US20180091390A1 (en) * 2016-09-27 2018-03-29 Ca, Inc. Data validation across monitoring systems
CN110495138B (en) * 2017-05-31 2023-09-29 西门子股份公司 Industrial control system and monitoring method for network security thereof
CN108282026A (en) * 2017-12-27 2018-07-13 河南平高电气股份有限公司 A kind of high-tension switch gear novel maintenance system
CN109656913B (en) * 2018-12-20 2022-12-27 江苏昂内斯电力科技股份有限公司 Data acquisition abnormity recruitment method based on Internet of things
CN109857732A (en) * 2019-01-31 2019-06-07 山东省电子信息产品检验院 A kind of industry internet platform monitoring data transmission switching method and system
CN109981402A (en) * 2019-03-05 2019-07-05 山东浪潮云信息技术有限公司 The real-time detection and appraisal procedure and system of a kind of data collection effect
CN110336703A (en) * 2019-07-12 2019-10-15 河海大学常州校区 Industrial big data based on edge calculations monitors system
CN110941677B (en) * 2019-11-27 2023-04-18 上海西码智能科技股份有限公司 Gateway device, control method and Internet of things control system based on edge calculation
CN111047431A (en) * 2019-12-11 2020-04-21 深圳微众信用科技股份有限公司 Credit service processing device, method and equipment based on big data
CN112235159B (en) * 2020-10-13 2022-05-10 中移(杭州)信息技术有限公司 Gateway quality portrait generation method, system, network equipment and storage medium
CN112506097A (en) * 2020-11-27 2021-03-16 江苏科技大学 Jig frame remote monitoring system and method based on industrial internet

Also Published As

Publication number Publication date
CN113225221A (en) 2021-08-06

Similar Documents

Publication Publication Date Title
JP2008139995A5 (en)
CN113225221B (en) Effective data quality management method for industrial internet industry
CN111782456B (en) Anomaly detection method, device, computer equipment and storage medium
CN111049882B (en) Cache state processing system, method, device and computer readable storage medium
CN113806343B (en) Evaluation method and system for Internet of vehicles data quality
CN113612567B (en) Alignment method and device for data collected by multiple sensors of equipment and electronic equipment
CN117061170B (en) Intelligent manufacturing industry big data analysis method based on feature selection
CN117312290A (en) Method for improving heterogeneous system data quality
CN111754334A (en) Vehicle mortgage risk early warning method and device
KR20180130630A (en) Vulnerability diagnosing and managing system and method of information system using automatic diagnosis tool
CN109688236B (en) Sinkhole domain name processing method and server
CN114866546B (en) PaaS-based one-stop management system for monitoring platform
CN112491584B (en) Service operation safety condition judgment method and device, electronic medium and storage medium
CN112990711B (en) Aluminum alloy template construction monitoring method and system based on site construction
CN112291085B (en) Fault positioning method, device, equipment and medium
CN112687030A (en) Vehicle condition information processing method and device
CN113986990A (en) Data resource acquisition and labeling method and device based on block chain data mining
CN113360359A (en) Index abnormal data tracing method, device, equipment and storage medium
CN112819071A (en) Fault information clustering method and device, computer equipment and storage medium
CN113572628A (en) Data association method and device, computing equipment and computer storage medium
CN112579559A (en) Key value pair management verification method, device, equipment and storage medium
CN112148459B (en) Processing method, device, readable medium and equipment for node association data
CN110968862B (en) Data anomaly detection method and terminal
CN111209130B (en) Fault processing method, system, equipment and medium based on MySQL master-slave replication cluster
CN117171740A (en) Method and device for determining health degree of open source assembly

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant