WO2023050229A1 - Industrial data processing method and apparatus, electronic device, and storage medium - Google Patents

Industrial data processing method and apparatus, electronic device, and storage medium Download PDF

Info

Publication number
WO2023050229A1
WO2023050229A1 PCT/CN2021/121953 CN2021121953W WO2023050229A1 WO 2023050229 A1 WO2023050229 A1 WO 2023050229A1 CN 2021121953 W CN2021121953 W CN 2021121953W WO 2023050229 A1 WO2023050229 A1 WO 2023050229A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
quality
field
site
interaction
Prior art date
Application number
PCT/CN2021/121953
Other languages
French (fr)
Chinese (zh)
Inventor
于禾
王琪
周文晶
田德钰
Original Assignee
西门子股份公司
西门子(中国)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 西门子股份公司, 西门子(中国)有限公司 filed Critical 西门子股份公司
Priority to PCT/CN2021/121953 priority Critical patent/WO2023050229A1/en
Publication of WO2023050229A1 publication Critical patent/WO2023050229A1/en

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/04Programme control other than numerical control, i.e. in sequence controllers or logic controllers
    • G05B19/048Monitoring; Safety

Definitions

  • the present application relates to the technical field of data processing, and in particular to an industrial data processing method, device, electronic equipment and storage medium.
  • the basis of industrial digitalization is field data collected by field devices, such as field data collected by sensors.
  • Industrial systems include control systems, operating systems, management systems, and cloud computing platforms to monitor and control the production process based on field data. Field data collected by field devices has data loss due to communication interruption, and data quality problems of dirty data due to data redundancy.
  • the on-site data of industrial equipment is collected, the on-site data is directly stored in the database.
  • the required data is read from the database through the data warehouse technology (Extract-Transform-Load, ETL). After cleaning, load it to the data demand side.
  • the collected field data is directly stored in the database, and the field data needs to be processed through ETL every time the field data is used, resulting in industrial production
  • the amount of data processing in the process is relatively large.
  • the industrial data processing method, device, electronic equipment and storage medium provided by the present application can reduce the amount of data processing in the industrial production process.
  • an industrial data processing method for processing data generated in an industrial production process comprising:
  • the on-site data that has undergone data quality improvement processing is stored in a pre-created database.
  • the performing quality inspection on the on-site data to obtain data quality information indicating the quality of the on-site data includes:
  • the data difference feature being used to indicate a change trend of the on-site data over time
  • Acquiring data quality information including the data difference feature and each of the first field information.
  • performing data quality improvement processing on the on-site data according to the data quality information includes:
  • the data difference characteristics included in the data quality information determine whether the field data has abnormal characteristics, and if the field data has the abnormal characteristics, delete the abnormal data corresponding to the abnormal characteristics of the field data ;
  • the heartbeat signal of the data source of the on-site data it is judged whether there is data missing in the on-site data, and if there is data missing in the on-site data, data enhancement processing is performed on the on-site data according to a preset data enhancement rule, so as to Supplement missing data in said field data;
  • the first data quality field included in the data quality information it is judged whether the data format of the field data is the target data format, and if the data format of the field data is not the target data format, the field The data format of the data is converted to the target data format.
  • the industrial data processing method further includes:
  • the industrial data processing method further includes:
  • At least one preset second data quality field respectively acquire second field information corresponding to each of the second data quality fields of the second interaction data, where the second data quality field indicates the first Two fields for the quality of the interaction data;
  • If there is a data quality problem in the second interaction data perform data quality improvement processing on the second interaction data according to information in each of the second fields, so as to delete abnormal data in the second interaction data. Supplement missing data in the second interactive data, and perform data format conversion on the second interactive data;
  • an industrial data processing device for processing data generated during industrial production, the device comprising:
  • the acquisition module is used to acquire field data generated in the industrial production process
  • a detection module configured to perform quality detection on the on-site data acquired by the acquisition module, and obtain data quality information indicating the quality of the on-site data
  • a processing module configured to perform data quality improvement processing on the on-site data according to the data quality information acquired by the detection module, so as to solve quality problems existing in the on-site data;
  • An output module configured to store the on-site data processed by the processing module for data quality improvement into a pre-created database.
  • an electronic device including: a processor, a memory, a communication interface, and a communication bus, and the processor, the memory, and the communication interface are completed through the communication bus mutual communication;
  • the memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to the industrial data processing method provided in the first aspect or any possible implementation manner of the first aspect.
  • a computer-readable storage medium stores computer instructions, and when the computer instructions are executed by a processor, the processing The processor executes the industrial data processing method provided in the first aspect or any possible implementation manner of the first aspect.
  • a computer program including computer-executable instructions. When executed, the computer-executable instructions cause at least one processor to perform the above-mentioned first aspect or the first aspect.
  • An industrial data processing method provided by any possible implementation.
  • a computer program product the computer program product is tangibly stored on a computer-readable medium and includes computer-executable instructions, and the computer-executable instructions are executed When at least one processor executes the industrial data processing method provided in the first aspect or any possible implementation manner of the first aspect.
  • the on-site data is firstly inspected for data quality, and the data quality information indicating the quality of the on-site data is obtained, and then the on-site data is subjected to data quality improvement processing according to the data quality information.
  • Solve the data quality problems existing in the field data and then store the field data processed by data quality improvement into the pre-created database. Since the on-site data stored in the database is processed through data quality improvement, the data quality problems existing in the on-site data have been solved, and the applications and services in the industrial production system can directly read the required on-site data from the database for use. It is not necessary to process the field data through ETL every time the field data is read, thereby reducing the amount of data processing in the industrial production process.
  • FIG. 1 is a flow chart of an industrial data processing method provided in Embodiment 1 of the present application.
  • FIG. 2 is a flow chart of a method for acquiring data quality information provided in Embodiment 2 of the present application;
  • FIG. 3 is a flow chart of a data quality improvement processing method provided in Embodiment 3 of the present application.
  • FIG. 4 is a flowchart of an industrial data processing method provided in Embodiment 4 of the present application.
  • Embodiment 5 is a flow chart of an industrial data processing method provided in Embodiment 5 of the present application.
  • FIG. 6 is a schematic diagram of an industrial data processing device provided in Embodiment 6 of the present application.
  • FIG. 7 is a schematic diagram of an electronic device provided in Embodiment 7 of the present application.
  • Processing module 604 Output module 702: Processor
  • the collected field data is directly stored in the database.
  • the Warehouse technology ETL Extract-Transform-Load
  • the warehouse technology ETL Extract-Transform-Load
  • the collected field data is stored in the database, so the field data stored in the database has data quality problems such as missing data and dirty data. Every time an application or service in an industrial production system uses on-site data, it needs to process the on-site data through ETL, resulting in a large amount of data processing in the industrial production process.
  • the quality of the on-site data is inspected to obtain the data quality information used to indicate the quality of the on-site data, and then the data quality of the on-site data is improved according to the data quality information processing to solve the data quality problems existing in the field data, and then store the field data that has undergone data quality improvement processing into the database. Since the on-site data stored in the database is processed by data quality improvement, the data quality problems existing in the on-site data have been solved.
  • the application or service in the industrial production system uses the on-site data, it can directly read the required data from the database. There is no need to process the field data through ETL every time the field data is used, which can reduce the amount of data processing in the industrial production process.
  • FIG. 1 is a flow chart of an industrial data processing method 100 provided in Embodiment 1 of the present application, which is used to process data generated in the industrial production process. As shown in FIG. 1 , the industrial data processing method 100 includes the following steps:
  • Step 101 acquiring on-site data generated in the industrial production process.
  • on-site data In the process of industrial production, industrial equipment in the industrial production system will generate on-site data, and the on-site data is used to indicate the operating status of the industrial equipment.
  • On-site data can be obtained from gateway devices, sensors, on-site applications or on-site systems, etc., and manual on-site operation information can also be obtained as on-site data.
  • on-site data can be order information, temperature information, vibration information, rotational speed information, etc.
  • field data can be obtained through different types of communication protocols, such as through OPC UA (OPC is the abbreviation of OLE for Process Control, OLE technology applied to process control, OLE technology is Object Linking and Embedding, refers to Object connection and embedding, UA is the abbreviation of Unified Architecture, refers to unified architecture), message queue telemetry transmission (Message Queuing Telemetry Transport, MQTT) or hypertext transfer protocol (Hyper Text Transfer Protocol, HTTP) and other communication protocols to obtain field data.
  • OPC UA OPC is the abbreviation of OLE for Process Control
  • OLE technology is Object Linking and Embedding
  • Object connection and embedding Object connection and embedding
  • UA is the abbreviation of Unified Architecture, refers to unified architecture
  • message queue telemetry transmission Message Queuing Telemetry Transport, MQTT
  • HTTP Hyper Text Transfer Protocol
  • the data quality of the field data is detected based on the communication protocol for obtaining the field data, and the data quality information for indicating the quality of the field data is obtained. Through the data quality information, it can be determined whether there are data quality problems such as missing data and dirty data in the field data.
  • the quality inspection of on-site data can obtain on-site data from different data sources through different communication protocols through the software development kit (Software Development Kit, SDK) or embedded program modules, and perform data analysis on the received on-site data. Quality inspection to obtain corresponding data quality information.
  • Step 103 according to the data quality information, perform data quality improvement processing on the on-site data, so as to solve the quality problems existing in the on-site data.
  • the data quality information can indicate the data quality problems existing in the field data, it is possible to improve the data quality of the field data based on the data quality information to solve the data quality problems existing in the field data. For example, if the data quality information indicates that there is data missing in the on-site data, the missing data in the on-site data can be supplemented according to the data quality information. Dirty data deletion.
  • Step 104 storing the on-site data processed for data quality improvement into a pre-created database.
  • the on-site data is stored in the database for use in applications or services in industrial production systems.
  • the application or service in the industrial production system can read the required field data from the database through the data reading interface of the database, so as to perform industrial equipment control, production plan formulation, maintenance plan formulation, etc. based on the field data.
  • the embodiment of the present application after obtaining the on-site data of the industrial production system, first perform data quality inspection on the on-site data, obtain data quality information indicating the quality of the on-site data, and then perform data quality improvement processing on the on-site data according to the data quality information , solve the data quality problems existing in the field data, and then store the field data processed by data quality improvement into the pre-created database. Since the on-site data stored in the database is processed through data quality improvement, the data quality problems existing in the on-site data have been solved, and the applications and services in the industrial production system can directly read the required on-site data from the database for use. It is not necessary to process the field data through ETL every time the field data is read, thereby reducing the amount of data processing in the industrial production process.
  • Embodiment 1 On the basis of the industrial data processing method provided in Embodiment 1, when the data quality information is obtained by performing quality inspection on the on-site data, the change trend of the on-site data over time and some key fields in the on-site data that can reflect the data quality as data quality information.
  • FIG. 2 is a flowchart of a data quality information acquisition method 200 provided in Embodiment 2 of the present application. As shown in FIG. 2 , the data quality information acquisition method 200 includes the following steps:
  • Step 201 calculating data difference characteristics of field data.
  • Data variance features are used to indicate trends in field data over time. For time series data, if there is a sudden change in the data and the data source is running normally, it means that abnormal data is generated due to interference factors. Therefore, according to the data difference characteristics, it can be judged whether the field data includes abnormal data.
  • the data difference feature may be the variance, mean, and standard deviation of field data, etc., which characterize the parameter values of field data distribution.
  • Step 202 Obtain first field information corresponding to each first data quality field in the field data according to at least one preset first data quality field.
  • the first data quality field is a field that can indicate the data quality of the field data in the on-site data.
  • the first data quality field can include data acquisition time, numerical value, data sequence, signal quality, and the last variance value (last D-value ), the number of characters and data bits included, etc.
  • the first data quality field may be set through configuration information or a user experience (User Experience, UX) service.
  • UX User Experience
  • the first field information corresponding to each first data quality field can be obtained directly from the communication field, For example, when obtaining field data through OPC UA, the first field information can be obtained directly from the communication field.
  • first field information corresponding to each first data quality field may be obtained by detecting and calculating on-site data.
  • Step 203 Obtain data quality information including data difference features and first field information.
  • the data difference characteristics and each first field information are used as data quality information to indicate the field data data quality.
  • the data difference characteristics and the first field information can be embedded into the communication field according to the communication protocol used by the SDK or embedded program module, and the field data Send them together to the data quality controller, and then the data quality controller can improve the data quality of the received field data according to the received data quality information.
  • the data difference feature can indicate the change trend of the field data over time, so it can be determined according to the data difference feature whether there is abnormal data in the field data that suddenly increases or decreases suddenly, and the first data quality field is the indicator data The key field of quality, so according to the first field information corresponding to the first data field, it can be determined whether there are problems such as data format in the field data.
  • Obtaining data quality information including data difference characteristics and first field information, and then improving the data quality of field data according to data difference characteristics and each first field information, can more comprehensively solve data quality problems existing in field data and ensure
  • the on-site data stored in the database has no or less data quality problems, thereby ensuring that applications and services in the industrial production system can operate normally based on the on-site data in the database.
  • the abnormal data problems and data missing problems included in the field data can be analyzed according to the data quality information and format exceptions are handled.
  • FIG. 3 is a flowchart of a data quality improvement processing method 300 provided in Embodiment 3 of the present application. As shown in FIG. 3 , the data quality improvement processing method 300 includes the following steps:
  • Step 301 according to the data difference features included in the data quality information, judge whether there are abnormal features in the field data, if yes, go to step 302 , if not N, go to step 303 .
  • step 302 is executed accordingly; if there is no sudden increase or decrease in the field data, it is determined that there is no abnormal feature in the field data, and step 303 is executed accordingly.
  • Anomaly features can be parameters such as variance, mean, and standard deviation of field data that reflect changes in field data over time.
  • Step 302 deleting the abnormal data corresponding to the abnormal feature of the field data.
  • the abnormal data that causes the abnormal characteristics to appear in the field data can be determined. Dirty data problem.
  • Step 303 according to the heartbeat signal of the data source of the on-site data, judge whether there is data missing in the on-site data, if yes, go to step 304 , if not N, go to step 305 .
  • the data source of the field data sends a heartbeat signal to determine whether the communication is connected. If the heartbeat signal of the data source of the field data times out, that is, the data source from the field data is not received within the preset signal cycle The heartbeat signal indicates that the communication connection with the data source of the field data has been disconnected, so the field data cannot be obtained from the data source of the field data, resulting in data loss in the field data. If the heartbeat signal from the data source of the field data can be received according to the preset signal cycle, it means that the communication with the data source of the field data remains connected, and the field data can be obtained from the data source of the field data normally, so as to determine There is no missing data in field data.
  • Step 304 perform data enhancement processing on the field data according to preset data enhancement rules.
  • data enhancement processing is performed on the on-site data to generate corresponding data as missing data to supplement the on-site data, so as to solve the problem of missing data in the on-site data.
  • the missing field data can be supplemented according to the change trend of the field data over time, or the average value of the obtained field data can be calculated, and the calculated average value can be used as the missing field data.
  • Step 305 according to the first data quality field included in the data quality information, judge whether the data format of the field data is the target data format, if yes, execute step 307 , if not N, execute step 306 .
  • the first data quality field is a field that reflects the quality of on-site data. According to the first data quality field, it can be judged whether the data format of the on-site data is the target data format required by the application and service in the industrial production system. If the data format of the on-site data is the target data format, the on-site data can be directly stored in the database, and step 307 is executed accordingly. If the data format of the field data is not the target data format, the data format of the field data needs to be converted, and step 306 is executed accordingly.
  • Step 306 converting the data format of the field data into the target data format.
  • the data format of the field data is converted, and the data format of the field data is converted into the target data format.
  • the field data is converted to include 16 bits by complementing 0 or 1. Bit-by-bit data to meet the needs of applications or services in industrial production systems.
  • Step 307 store the field data in the database.
  • the dirty data included in the field data is deleted, the missing data is supplemented in the field data, and the data format of the field data is converted to the target data required by the application or service in the industrial production system format, which can more comprehensively solve the data quality problems of on-site data, ensure that the on-site data stored in the data block does not exist or has only a few data quality problems, and thus ensures that the applications and services in the industrial production system can directly use the data On-site data in the block, thereby reducing the amount of data processing in the industrial production process.
  • FIG. 4 is a flowchart of an industrial data processing method 400 provided in Embodiment 4 of the present application. As shown in FIG. 4 , the industrial data processing method 400 includes the following steps:
  • Step 401 Obtain the first interaction data output by the first data output terminal in the industrial production system through the presentation layer state conversion interface.
  • Applications and services in the industrial production system can interact through the presentation layer state transition interface (Restful interface).
  • An application can interact with other applications or services through the presentation layer state transition interface, and a service can interact with other applications or services through the presentation layer state transition interface.
  • the interface interacts with other services or applications, so the first data output end may be an application or service, and the corresponding first data receiving end may also be an application or service.
  • the first interaction data output by the first data output terminal is acquired.
  • Step 402 perform semantic search on the fields in the first interaction data, and judge whether the fields in the first interaction match the pre-created semantic template, if yes, perform step 404 , if not N, perform step 403 .
  • the semantic template defines the interaction data between the first data output terminal and the first data receiving terminal.
  • the fields to include, and the data format for each field.
  • step 403 is executed accordingly.
  • Step 403 Perform data quality improvement processing on the first interaction data, so as to solve quality problems existing in the first interaction data.
  • the first interaction data does not match the semantic template, it means that there is a data quality problem in the first interaction data, and data quality improvement processing is performed on the first interaction data to solve the data quality problem in the first interaction data.
  • the data quality improvement process of the field data in the third embodiment such as deleting dirty data in the first interaction data, supplementing missing data in the first interaction data, and improving the data quality of the first interaction data.
  • Step 404 Send the first interaction data after the quality problem is solved to the first data receiving end.
  • the first interaction data matches the semantic template, that is, there is no data quality problem in the first interaction data
  • the first interaction data is directly sent to the first data receiving end. If it is determined that the first interaction data does not match the semantic template, that is, there is a data quality problem in the first interaction data, after performing data quality improvement processing on the first interaction data, the first interaction data that has undergone data quality improvement processing is sent to the first interaction data.
  • a data receiving end If it is determined that the first interaction data matches the semantic template, that is, there is no data quality problem in the first interaction data, the first interaction data is directly sent to the first data receiving end. If it is determined that the first interaction data does not match the semantic template, that is, there is a data quality problem in the first interaction data, after performing data quality improvement processing on the first interaction data, the first interaction data that has undergone data quality improvement processing is sent to the first interaction data. A data receiving end.
  • the semantic template is created in advance, and whether the first interaction data matches the semantic template is judged through semantic search, so as to determine whether the first interaction data has data quality problems, and the first interaction data with data quality problems Perform data quality improvement processing, send the first interactive data after solving the data quality problem to the first data receiving end, and ensure that the first data output end and the first data receiving end interacting through the presentation layer state conversion interface can interact normally, so that In the industrial production system, the applications and services that interact through the state transition interface of the presentation layer can normally obtain the interactive data, thereby ensuring the normal operation of the industrial production system and the efficiency of industrial production.
  • the industrial data processing method provided by Embodiment 4 is used to solve the quality problem of the interactive data transmitted through the presentation layer state transition interface.
  • the application and service in the industrial production system can also interact through the message queue (Message queue).
  • Message queue message queue
  • the quality of data in the production system can also solve the quality problem of interactive data transmitted through message queues.
  • Fig. 5 is a flowchart of an industrial data processing method 500 provided in Embodiment 5 of the present application. As shown in Fig. 5, the industrial data processing method 500 includes the following steps:
  • Step 501 acquire the second interaction data output by the second data output terminal in the industrial production system through the message queue.
  • Applications and services in industrial production systems can interact through message queues.
  • An application can interact with other applications or services through message queues, and a service can also interact with other services or applications through message queues, so output data
  • the second data output end may be an application or a service, and the corresponding second data receiving end receiving data may also be an application or a service.
  • the second interaction data output by the second data output terminal is acquired.
  • Step 502 Acquire second field information corresponding to each second data quality field of the second interaction data according to at least one preset second data quality field.
  • At least one second data quality field used to indicate the data quality of the second interaction data is predetermined, and after the second interaction data is obtained, each The second field information corresponding to the second data quality field.
  • the second data quality field may include data acquisition time, value size, data sequence, signal quality, last variance value (last D-value), number of included characters and number of data bits, etc.
  • the second data quality field can be set through configuration information or user experience (User Experience, UX) service.
  • UX User Experience
  • Step 503 judging whether there is a data quality problem in the second interaction data according to the information of each second field, if yes, go to step 504 , if not N, go to step 505 .
  • the second data quality field is a field indicating the quality of the second interaction data
  • Step 504 Perform data quality improvement processing on the second interaction data according to the information of each second field.
  • the second interaction data When there is a data quality problem in the second interaction data, perform data quality improvement processing on the second interaction data according to the information of each second field, so as to delete abnormal data in the second interaction data and supplement missing data in the second interaction data, And performing data format conversion on the second interaction data.
  • Deleting the abnormal data in the second interactive data is mainly to delete dirty data such as redundant data in the second interactive data.
  • the data format conversion of the second interaction data is mainly to convert the second field information in the second interaction data into the data format required by the second data receiving end.
  • Step 505. Send the second interaction data after the quality problem is solved to the second data receiving end.
  • the second interaction data is directly sent to the second data receiving end. If it is determined that there is no data quality problem in the second interaction data, the second interaction data is directly sent to the second data receiving end. If it is determined that there is a data quality problem in the second interaction data, after data quality improvement processing is performed on the second interaction data, the second interaction data that has undergone data quality improvement processing is sent to the second data receiving end.
  • the second data quality field used to indicate the quality of the second interaction data is predetermined, and the second field information corresponding to each second data quality field is extracted from the second interaction data, and then according to the The two-field information can perform data quality improvement processing on the second interaction data, and send the second interaction data after solving the data quality problem to the second data receiving end, so as to ensure the second data output end and the second data receiving end through the message queue interaction
  • the terminal can interact normally, so that the applications and services interacting through the message queue in the industrial production system can normally obtain the interactive data, thereby ensuring the normal operation of the industrial production system and the efficiency of industrial production.
  • Fig. 6 is a schematic diagram of an industrial data processing device 600 provided in Embodiment 6 of the present application, which is used to process data generated in the industrial production process. As shown in Fig. 6, the industrial data processing device 600 includes:
  • An acquisition module 601, configured to acquire field data generated during industrial production
  • the detection module 602 is configured to perform quality detection on the on-site data acquired by the acquisition module 601, and obtain data quality information indicating the quality of the on-site data;
  • the processing module 603 is configured to perform data quality improvement processing on the on-site data according to the data quality information acquired by the detection module 602, so as to solve the quality problems existing in the on-site data;
  • the output module 604 is configured to store the on-site data processed by the processing module 603 for data quality improvement into a pre-created database.
  • the first acquisition module 601 can be used to execute step 101 in the first embodiment above
  • the detection module 602 can be used to execute step 102 in the first embodiment above
  • the processing module 603 can be used to execute the step 102 in the first embodiment above.
  • the output module 604 can be used to execute step 104 in the first embodiment above.
  • the detection module 602 may be configured to perform the following operations:
  • first field information corresponding to each first data quality field of the field data is respectively obtained, and the first data quality field is a field indicating the quality of the field data;
  • Data quality information including data difference features and information of each first field is acquired.
  • the detection module 602 may be used to execute steps 201 to 203 in the second embodiment above.
  • processing module 603 may be configured to perform the following operations:
  • the data difference characteristics included in the data quality information it is judged whether there are abnormal characteristics in the field data, and if there are abnormal characteristics in the field data, the abnormal data corresponding to the abnormal characteristics of the field data is deleted;
  • the heartbeat signal of the data source of the on-site data it is judged whether there is data missing in the on-site data. If there is data missing in the on-site data, data enhancement processing is performed on the on-site data according to the preset data enhancement rules to supplement the missing data in the on-site data. ;
  • the data format of the field data is the target data format, and if the data format of the field data is not the target data format, then the data format of the field data is converted into the target data format.
  • the processing module 603 may be configured to execute steps 301 to 306 in the third embodiment above.
  • the obtaining module 601 is also used to obtain the first interaction data output by the first data output terminal in the industrial production system through the presentation layer state conversion interface;
  • the detection module 602 is further configured to perform semantic retrieval on the fields in the first interaction data acquired by the acquisition module 601, and determine whether the fields in the first interaction data match the pre-created semantic template;
  • the processing module 603 is further configured to perform data quality improvement processing on the first interaction data when the detection module 602 determines that the fields in the first interaction data do not match the semantic template, so as to solve the quality problems existing in the first interaction data;
  • the output module 604 is further configured to send the first interaction data processed by the processing module 603 to improve the data quality to the first data receiving end in the industrial production system.
  • the acquisition module 601 can be used to execute the step 401 in the fourth embodiment above
  • the detection module 602 can be used to execute the step 402 in the fourth embodiment above
  • the processing module 603 can be used to execute the steps in the fourth embodiment above 403.
  • the output module 604 can be used to execute step 404 in the fourth embodiment above.
  • the obtaining module 601 is also used to obtain the second interaction data output by the second data output terminal in the industrial production system through the message queue;
  • the detection module 602 is further configured to respectively acquire second field information corresponding to each second data quality field of the second interaction data according to at least one preset second data quality field, where the second data quality field indicates the second The field of the quality of the interactive data, and judge whether there is a data quality problem in the second interactive data according to the information of each second field;
  • the processing module 603 is further configured to determine in the detection module 602 that there is a data quality problem in the second interaction data, and perform data quality improvement processing on the second interaction data according to the information of each second field, so as to delete abnormal data in the second interaction data , supplementing missing data in the second interaction data, and performing data format conversion on the second interaction data;
  • the output module 604 is further configured to send the second interaction data processed by the processing module 603 to improve the data quality to the second data receiving end in the industrial production system.
  • the acquiring module 601 can be used to execute step 501 in the fifth embodiment above
  • the detection module 602 can be used to execute step 502 and step 503 in the fifth embodiment above
  • the processing module 603 can be used to execute the fifth embodiment above
  • the output module 604 can be used to execute step 505 in the fifth embodiment above.
  • FIG. 7 is a schematic diagram of an electronic device provided in Embodiment 7 of the present application.
  • the specific embodiment of the present application does not limit the specific implementation of the electronic device.
  • an electronic device 700 provided by an embodiment of the present application includes: a processor (processor) 702 , a communication interface (Communications Interface) 704 , a memory (memory) 706 , and a communication bus 708 . in:
  • the processor 702 , the communication interface 704 , and the memory 706 communicate with each other through the communication bus 708 .
  • the communication interface 704 is used for communicating with other electronic devices or servers.
  • the processor 702 is configured to execute the program 710, specifically, may execute relevant steps in the above-mentioned embodiments of the industrial data processing method.
  • the program 710 may include program codes including computer operation instructions.
  • the processor 702 may be a central processing unit CPU, or an ASIC (Application Specific Integrated Circuit), or one or more integrated circuits configured to implement the embodiments of the present application.
  • the one or more processors included in the smart device may be of the same type, such as one or more CPUs, or may be different types of processors, such as one or more CPUs and one or more ASICs.
  • the memory 706 is used for storing the program 710 .
  • the memory 706 may include a high-speed RAM memory, and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory.
  • the program 710 may specifically be used to enable the processor 702 to execute the industrial data processing method in any of the foregoing embodiments.
  • the data quality inspection is first performed on the on-site data, and the data quality information indicating the quality of the on-site data is obtained, and then the data quality of the on-site data is improved according to the data quality information Processing to solve the data quality problems existing in the field data, and then store the field data processed by data quality improvement into the pre-created database. Since the on-site data stored in the database is processed through data quality improvement, the data quality problems existing in the on-site data have been solved, and the applications and services in the industrial production system can directly read the required on-site data from the database for use. It is not necessary to process the field data through ETL every time the field data is read, thereby reducing the amount of data processing in the industrial production process.
  • the present application also provides a computer-readable storage medium storing instructions for causing a machine to execute the industrial data processing method as described herein.
  • a system or device equipped with a storage medium may be provided, on which a software program code for realizing the functions of any of the above embodiments is stored, and the computer (or CPU or MPU of the system or device) ) to read and execute the program code stored in the storage medium.
  • the program code itself read from the storage medium can realize the function of any one of the above-mentioned embodiments, so the program code and the storage medium storing the program code constitute a part of the present application.
  • Examples of storage media for providing program code include floppy disks, hard disks, magneto-optical disks, optical disks (such as CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, DVD+RW), Tape, non-volatile memory card, and ROM.
  • the program code can be downloaded from a server computer via a communication network.
  • the program code read from the storage medium is written into the memory provided in the expansion board inserted into the computer or written into the memory provided in the expansion module connected to the computer, and then based on the program code
  • the instruction causes the CPU installed on the expansion board or the expansion module to perform some or all of the actual operations, thereby realizing the functions of any one of the above-mentioned embodiments.
  • the embodiment of the present application also provides a computer program, including computer executable instructions, when executed, the computer executable instructions cause at least one processor to execute the industrial data processing methods provided in the above embodiments.
  • the embodiment of the present application also provides a computer program product, the computer program product is tangibly stored on a computer-readable medium and includes computer-executable instructions, and the computer-executable instructions cause at least one processor to Execute the industrial data processing methods provided by the above-mentioned embodiments. It should be understood that the solutions in this embodiment have the corresponding technical effects in the foregoing method embodiments, and details are not repeated here.
  • the hardware modules may be implemented mechanically or electrically.
  • a hardware module may include permanently dedicated circuitry or logic (such as a dedicated processor, FPGA or ASIC) to perform the corresponding operations.
  • the hardware modules may also include programmable logic or circuits (such as general-purpose processors or other programmable processors), which can be temporarily set by software to complete corresponding operations.
  • the specific implementation mechanical way, or a dedicated permanent circuit, or a temporary circuit

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Automation & Control Theory (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Provided are an industrial data processing method and apparatus, an electronic device, and a storage medium. The industrial data processing method is used for processing data generated in an industrial production process, and comprises: acquiring field data generated in an industrial production process (101); performing quality inspection on the field data to obtain data quality information for indicating the quality of the field data (102); performing data quality improvement on the field data according to the data quality information, to solve a quality problem existing in the field data (103); and storing the field data subjected to the data quality improvement into a pre-created database (104). The amount of data to be processed in an industrial production process can be reduced.

Description

工业数据处理方法、装置、电子设备和存储介质Industrial data processing method, device, electronic device and storage medium 技术领域technical field
本申请涉及数据处理技术领域,尤其涉及一种工业数据处理方法、装置、电子设备和存储介质。The present application relates to the technical field of data processing, and in particular to an industrial data processing method, device, electronic equipment and storage medium.
背景技术Background technique
随着工业数字化进程的不断推进,工业生产过程中会产生大量数据,基于数据可以实现生产过程控制、生产计划制定、生产设备维护等,使得工业生产更加智能,在提高生成效率的基础上,降低人员的劳动强度。工业数字化的基础是现场设备采集的现场数据,比如传感器采集的现场数据,工业系统包括的控制系统、运营系统、管理系统及云计算平台基于现场数据实现生产过程的监测和控制。通过现场设备采集的现场数据,存在由于通信中断而导致的数据缺失,以及由于数据冗余而导致存在脏数据的数据质量问题。With the continuous advancement of the industrial digitalization process, a large amount of data will be generated in the industrial production process. Based on the data, production process control, production plan formulation, production equipment maintenance, etc. can be realized, making industrial production more intelligent. On the basis of improving production efficiency, reduce Labor intensity of personnel. The basis of industrial digitalization is field data collected by field devices, such as field data collected by sensors. Industrial systems include control systems, operating systems, management systems, and cloud computing platforms to monitor and control the production process based on field data. Field data collected by field devices has data loss due to communication interruption, and data quality problems of dirty data due to data redundancy.
目前,采集到工业设备的现场数据后,直接将现场数据存储到数据库中,在使用现场数据时,通过数据仓库技术(Extract-Transform-Load,ETL)从数据库中读取所需数据,对数据进行清洗后加载到数据需求端。At present, after the on-site data of industrial equipment is collected, the on-site data is directly stored in the database. When using the on-site data, the required data is read from the database through the data warehouse technology (Extract-Transform-Load, ETL). After cleaning, load it to the data demand side.
由于采集到的现场数据存在数据缺失、包括脏数据等数据质量问题,直接将采集到的现场数据存储到数据库中,在每次使用现场数据时都需要通过ETL对现场数据进行处理,导致工业生产过程中的数据处理量较大。Due to data quality problems such as missing data and dirty data in the collected field data, the collected field data is directly stored in the database, and the field data needs to be processed through ETL every time the field data is used, resulting in industrial production The amount of data processing in the process is relatively large.
发明内容Contents of the invention
有鉴于此,本申请提供的工业数据处理方法、装置、电子设备和存储介质,能够降低工业生产过程中的数据处理量。In view of this, the industrial data processing method, device, electronic equipment and storage medium provided by the present application can reduce the amount of data processing in the industrial production process.
根据本申请实施例的第一方面,提供了一种工业数据处理方法,用于对工业生产过程中产生的数据进行处理,所述方法包括:According to the first aspect of the embodiments of the present application, there is provided an industrial data processing method for processing data generated in an industrial production process, the method comprising:
获取工业生产过程中产生的现场数据;Obtain on-site data generated during industrial production;
对所述现场数据进行质量检测,获得用于指示所述现场数据的质量的数据质量信息;performing quality inspection on the on-site data, and obtaining data quality information indicating the quality of the on-site data;
根据所述数据质量信息,对所述现场数据进行数据质量提升处理,以解决所述现场数据存在的质量问题;performing data quality improvement processing on the on-site data according to the data quality information, so as to solve quality problems existing in the on-site data;
将经过数据质量提升处理的所述现场数据存储到预先创建的数据库中。The on-site data that has undergone data quality improvement processing is stored in a pre-created database.
在第一种可能的实现方式中,结合上述第一方面,所述对所述现场数据进行质量检测,获得用于指示所述现场数据的质量的数据质量信息,包括:In a first possible implementation manner, in combination with the first aspect above, the performing quality inspection on the on-site data to obtain data quality information indicating the quality of the on-site data includes:
计算所述现场数据的数据差异特征,所述数据差异特征用于指示所述现场数据随时间的变化趋势;calculating a data difference feature of the on-site data, the data difference feature being used to indicate a change trend of the on-site data over time;
根据预先设定的至少一个第一数据质量字段,分别获取所述现场数据对应于每个所述第一数据质量字段的第一字段信息,所述第一数据质量字段为指示所述现场数据的质量的字段;Acquire the first field information corresponding to each of the first data quality fields of the field data according to at least one preset first data quality field, and the first data quality fields indicate the fields of the field data quality field;
获取包括所述数据差异特征和各所述第一字段信息的数据质量信息。Acquiring data quality information including the data difference feature and each of the first field information.
在第二种可能的实现方式中,结合上述第一种可能的实现方式,所述根据所述数据质量信息,对所述现场数据进行数据质量提升处理,包括:In a second possible implementation manner, in combination with the first possible implementation manner above, performing data quality improvement processing on the on-site data according to the data quality information includes:
根据所述数据质量信息包括的所述数据差异特征,判断所述现场数据是否存在异常特征,若所述现场数据存在所述异常特征,则删除所述现场数据的所述异常特征对应的异常数据;According to the data difference characteristics included in the data quality information, determine whether the field data has abnormal characteristics, and if the field data has the abnormal characteristics, delete the abnormal data corresponding to the abnormal characteristics of the field data ;
根据所述现场数据的数据源的心跳信号,判断所述现场数据是否存在数据缺失,若所述现场数据存在数据缺失,则按照预设的数据增强规则对所述现场数据进行数据增强处理,以在所述现场数据中补充缺失的数据;According to the heartbeat signal of the data source of the on-site data, it is judged whether there is data missing in the on-site data, and if there is data missing in the on-site data, data enhancement processing is performed on the on-site data according to a preset data enhancement rule, so as to Supplement missing data in said field data;
根据所述数据质量信息包括的所述第一数据质量字段,判断所述现场数据的数据格式是否为目标数据格式,若所述现场数据的数据格式不是所述目标数据格式,则将所述现场数据的数据格式转换为所述目标数据格式。According to the first data quality field included in the data quality information, it is judged whether the data format of the field data is the target data format, and if the data format of the field data is not the target data format, the field The data format of the data is converted to the target data format.
在第三种可能的实现方式中,结合上述第一方面,所述工业数据处理方法还包括:In a third possible implementation manner, in combination with the first aspect above, the industrial data processing method further includes:
获取工业生产系统中第一数据输出端通过表现层状态转换接口输出的第一交互数据;Obtain the first interaction data output by the first data output terminal in the industrial production system through the presentation layer state conversion interface;
对所述第一交互数据中的字段进行语义检索,判断所述第一交互数据中的字段是否与预先创建的语义模板相匹配;Perform semantic search on the fields in the first interaction data, and judge whether the fields in the first interaction data match the pre-created semantic template;
若所述第一交互数据中的字段与所述语义模板不匹配,则对所述第一交互数据进行数据质量提升处理,以解决所述第一交互数据存在的质量问题;If the fields in the first interaction data do not match the semantic template, perform data quality improvement processing on the first interaction data, so as to solve the quality problems existing in the first interaction data;
将经过数据质量提升处理的所述第一交互数据发送给所述工业生产系统中的第一数据接收端。Sending the first interaction data that has undergone data quality improvement processing to a first data receiving end in the industrial production system.
在第四种可能的实现方式中,结合上述第一方面或第一方面的任一可能的实现方式,所述工业数据处理方法还包括:In a fourth possible implementation manner, in combination with the above first aspect or any possible implementation manner of the first aspect, the industrial data processing method further includes:
获取工业生产系统中第二数据输出端通过消息队列输出的第二交互数据;Obtain the second interaction data output by the second data output terminal in the industrial production system through the message queue;
根据预先设定的至少一个第二数据质量字段,分别获取所述第二交互数据对应于每个所述第二数据质量字段的第二字段信息,所述第二数据质量字段为指示所述第二交互数据的质量的字段;According to at least one preset second data quality field, respectively acquire second field information corresponding to each of the second data quality fields of the second interaction data, where the second data quality field indicates the first Two fields for the quality of the interaction data;
根据各所述第二字段信息判断所述第二交互数据是否存在数据质量问题;judging whether there is a data quality problem in the second interaction data according to the information in each second field;
若所述第二交互数据存在数据质量问题,则根据各所述第二字段信息,对所述第二交互数据进行数据质量提升处理,以删除所述第二交互数据中的异常数据,在所述第二交互数据中补充缺失的数据,以及对所述第二交互数据进行数据格式转换;If there is a data quality problem in the second interaction data, perform data quality improvement processing on the second interaction data according to information in each of the second fields, so as to delete abnormal data in the second interaction data. Supplement missing data in the second interactive data, and perform data format conversion on the second interactive data;
将经过数据质量提升处理的所述第二交互数据发送给所述工业生产系统中的第二数据接收端。Sending the second interaction data that has undergone data quality improvement processing to a second data receiving end in the industrial production system.
根据本申请实施例的第二方面,还提供了一种工业数据处理装置,用于对工业生产过程中产生的数据进行处理,所述装置包括:According to the second aspect of the embodiments of the present application, there is also provided an industrial data processing device for processing data generated during industrial production, the device comprising:
获取模块,用于获取工业生产过程中产生的现场数据;The acquisition module is used to acquire field data generated in the industrial production process;
检测模块,用于对所述获取模块获取到的所述现场数据进行质量检测,获得用于指示所述现场数据的质量的数据质量信息;A detection module, configured to perform quality detection on the on-site data acquired by the acquisition module, and obtain data quality information indicating the quality of the on-site data;
处理模块,用于根据所述检测模块获取到的所述数据质量信息,对所述现场数据进行数据质量提升处理,以解决所述现场数据存在的质量问题;A processing module, configured to perform data quality improvement processing on the on-site data according to the data quality information acquired by the detection module, so as to solve quality problems existing in the on-site data;
输出模块,用于将经过所述处理模块的数据质量提升处理的所述现场数据存储到预先创建的数据库中。An output module, configured to store the on-site data processed by the processing module for data quality improvement into a pre-created database.
根据本申请实施例的第三方面,还提供了一种电子设备,包括:处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;According to the third aspect of the embodiments of the present application, there is also provided an electronic device, including: a processor, a memory, a communication interface, and a communication bus, and the processor, the memory, and the communication interface are completed through the communication bus mutual communication;
所述存储器用于存储至少一可执行指令,所述可执行指令使所述处理器执行如上述第一方面或第一方面的任一可能的实现方式提供的工业数据处理方法对应的操作。The memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to the industrial data processing method provided in the first aspect or any possible implementation manner of the first aspect.
根据本申请实施例的第四方面,还提供了一种算机可读存储介质,所述计算机可读存储介质上存储有计算机指令,所述计算机指令在被处理器执行时,使所述处理器执行如上述第一方面或第一方面的任一可能的实现方式提供的工业数据处理方法。According to the fourth aspect of the embodiments of the present application, there is also provided a computer-readable storage medium, the computer-readable storage medium stores computer instructions, and when the computer instructions are executed by a processor, the processing The processor executes the industrial data processing method provided in the first aspect or any possible implementation manner of the first aspect.
根据本申请实施例的第五方面,还提供了一种计算机程序,包括计算机可执行指令,所述计算机可执行指令在被执行时使至少一个处理器执行如上述第一方面或第一方面的任一可能的实现方式提供的工业数据处理方法。According to a fifth aspect of the embodiments of the present application, there is also provided a computer program, including computer-executable instructions. When executed, the computer-executable instructions cause at least one processor to perform the above-mentioned first aspect or the first aspect. An industrial data processing method provided by any possible implementation.
根据本申请实施例的第六方面,还提供了一种计算机程序产品,所述计算机程序产品被有形地存储在计算机可读介质上并且包括计算机可执行指令,所述计算机可执行指令在被执行时使至少一个处理器执行如上述第一方面或第一方面的任一可能的实现方式提供的工业数据处理方法。According to a sixth aspect of the embodiments of the present application, there is also provided a computer program product, the computer program product is tangibly stored on a computer-readable medium and includes computer-executable instructions, and the computer-executable instructions are executed When at least one processor executes the industrial data processing method provided in the first aspect or any possible implementation manner of the first aspect.
由上述技术方案可知,获取到工业生产系统的现场数据后,首先对现场数据进行数据质量检测,获得指示现场数据的质量的数据质量信息,然后根据数据质量信息对现场数据进行数据质量提升处理,解决现场数据存在的数据质量问题,然后将经过数据质量提升处理的现场数据存储到预先创建的数据库中。由于存储到数据库中的现场数据是经过数据质量提升处理的,已经解决了现场数据存在的数据质量问题,工业生产系统中的应用和服务可以直接从数据库中读取所需的现场数据进行使用,而无需每次读取现场数据都需要通过ETL对现场数据进行处理,从而能够降低工业生产过程中的数据处理量。It can be seen from the above technical solution that after obtaining the on-site data of the industrial production system, the on-site data is firstly inspected for data quality, and the data quality information indicating the quality of the on-site data is obtained, and then the on-site data is subjected to data quality improvement processing according to the data quality information. Solve the data quality problems existing in the field data, and then store the field data processed by data quality improvement into the pre-created database. Since the on-site data stored in the database is processed through data quality improvement, the data quality problems existing in the on-site data have been solved, and the applications and services in the industrial production system can directly read the required on-site data from the database for use. It is not necessary to process the field data through ETL every time the field data is read, thereby reducing the amount of data processing in the industrial production process.
附图说明Description of drawings
图1是本申请实施例一提供的一种工业数据处理方法的流程图;FIG. 1 is a flow chart of an industrial data processing method provided in Embodiment 1 of the present application;
图2是本申请实施例二提供的一种数据质量信息获取方法的流程图;FIG. 2 is a flow chart of a method for acquiring data quality information provided in Embodiment 2 of the present application;
图3是本申请实施例三提供的一种数据质量提升处理方法的流程图;FIG. 3 is a flow chart of a data quality improvement processing method provided in Embodiment 3 of the present application;
图4是本申请实施例四提供的一种工业数据处理方法的流程图;FIG. 4 is a flowchart of an industrial data processing method provided in Embodiment 4 of the present application;
图5是本申请实施例五提供的一种工业数据处理方法的流程图;5 is a flow chart of an industrial data processing method provided in Embodiment 5 of the present application;
图6是本申请实施例六提供的一种工业数据处理装置的示意图;FIG. 6 is a schematic diagram of an industrial data processing device provided in Embodiment 6 of the present application;
图7是本申请实施例七提供的一种电子设备的示意图。FIG. 7 is a schematic diagram of an electronic device provided in Embodiment 7 of the present application.
附图标记列表:List of reference signs:
101:获取工业生产过程中产生的现场数据101: Obtain field data generated during industrial production
102:对现场数据进行质量检测,获得用于指示现场数据的质量的数据质量信息102: Perform quality inspection on on-site data, and obtain data quality information used to indicate the quality of on-site data
103:对现场数据进行数据质量提升处理,以解决现场数据存在的质量问题103: Perform data quality improvement processing on field data to solve quality problems existing in field data
104:将经过数据质量提升处理的现场数据存储到预先创建的数据库中104: Store field data processed for data quality improvement into a pre-created database
201:计算现场数据的数据差异特征201: Calculate data difference characteristics of field data
202:分别获取现场数据中对应于每个预设的第一数据质量字段的第一字段信息202: Obtain the first field information corresponding to each preset first data quality field in the field data respectively
203:获取包括数据差异特征和各第一字段信息的数据质量信息203: Obtain data quality information including data difference features and first field information
301:根据数据质量信息包括的数据差异特征,判断现场数据是否存在异常特征301: According to the data difference characteristics included in the data quality information, determine whether there are abnormal characteristics in the field data
302:删除现场数据的异常特征对应的异常数据302: Delete the abnormal data corresponding to the abnormal characteristics of the field data
303:根据现场数据的数据源的心跳信号,判断现场数据是否存在数据缺失303: According to the heartbeat signal of the data source of the field data, determine whether there is data missing in the field data
304:根据预设的数据增强规则,对现场数据进行数据增强处理304: Perform data enhancement processing on the field data according to the preset data enhancement rules
305:根据第一数据质量字段,判断现场数据的数据格式是否为目标数据格式305: According to the first data quality field, determine whether the data format of the field data is the target data format
306:将现场数据的数据格式转换为目标数据格式306: Convert the data format of field data to the target data format
307:将现场数据存储到数据库中307: Store field data in database
401:获取第一数据输出端通过表现层状态转换接口输出的第一交互数据401: Obtain the first interaction data output by the first data output terminal through the presentation layer state transition interface
402:判断第一交互中的字段是否与预先创建的语义模板相匹配402: Determine whether the fields in the first interaction match the pre-created semantic template
403:对第一交互数据进行数据质量提升处理,以解决第一交互数据存在的质量问题403: Perform data quality improvement processing on the first interaction data to solve quality problems existing in the first interaction data
404:将解决质量问题后的第一交互数据发送给第一数据接收端404: Send the first interaction data after solving the quality problem to the first data receiving end
501:获取工业生产系统中第二数据输出端通过消息队列输出的第二交互数据501: Obtain the second interaction data output by the second data output terminal in the industrial production system through the message queue
502:分别获取第二交互数据对应于每个预设的第二数据质量字段的第二字段信息502: Obtain the second field information of the second interaction data corresponding to each preset second data quality field
503:根据各第二字段信息判断第二交互数据是否存在数据质量问题503: Judging whether there is a data quality problem in the second interactive data according to the information of each second field
504:根据各第二字段信息,对第二交互数据进行数据质量提升处理504: Perform data quality improvement processing on the second interaction data according to the information in each second field
505:将解决质量问题后的第二交互数据发送给第二数据接收端505: Send the second interaction data after solving the quality problem to the second data receiving end
100:工业数据处理方法  200:数据质量信息获取方法  300:数据质量提升处理方法100: Industrial data processing method 200: Data quality information acquisition method 300: Data quality improvement processing method
400:工业数据处理方法  500:工业数据处理方法      600:工业数据处理装置400: Industrial data processing method 500: Industrial data processing method 600: Industrial data processing device
700:电子设备          601:获取模块              602:检测模块700: Electronic equipment 601: Obtaining module 602: Detection module
603:处理模块          604:输出模块              702:处理器603: Processing module 604: Output module 702: Processor
704:通信接口          706:存储器                708:通信总线704: Communication interface 706: Memory 708: Communication bus
710:程序710: Procedure
具体实施方式Detailed ways
如前所述,在目前的工业生产系统中,从现场设备采集到现场数据后,直接将采集到的现场数据存储到数据库中,工业生产系统包括的应用或服务需要使用现场数据时,通过数据仓库技术ETL(Extract-Transform-Load),从数据库读取所需的现场数据,对读取的现场数据进行清洗后加载到请求数据的应用或服务。在采集现场数据的过程中,可能会出现通信中断以及重复采集相同数据的问题,通信中断会导致所采集的现场数据存在数据缺失,重复采集相同数据会导致所采集的现场数据包括脏数据,直接将所采集的现场数据存储到数据库中,因此数据库中存储的现场数据存在数据缺失、包括脏数据等数据质量问题。工业生产系统中的应用或服务每次使用现场数据时,都需要通过ETL对现场数据进行处理,导致工业生产过程中的数据处理量较大。As mentioned above, in the current industrial production system, after the field data is collected from the field equipment, the collected field data is directly stored in the database. When the application or service included in the industrial production system needs to use the field data, the Warehouse technology ETL (Extract-Transform-Load), reads the required field data from the database, cleans the read field data and loads it to the application or service that requests the data. In the process of collecting field data, there may be problems of communication interruption and repeated collection of the same data. Communication interruption will cause data loss in the collected field data, and repeated collection of the same data will cause the collected field data to include dirty data. The collected field data is stored in the database, so the field data stored in the database has data quality problems such as missing data and dirty data. Every time an application or service in an industrial production system uses on-site data, it needs to process the on-site data through ETL, resulting in a large amount of data processing in the industrial production process.
本申请实施例中,获取到工业生产过程中产生的现场数据后,对现场数据进行质量检测,获得用于指示现场数据的质量的数据质量信息,然后根据数据质量信息对现场数据进行数据质量提升处理,以解决现场数据存在的数据质量问题,然后将经过数据质量提升处理的现场数据存储到数据库中。由于存储到数据库中的现场数据是经过数据质量提升处理的,已解决了现场数据存在的数据质量问题,工业生产系统中的应用或服务在使用现场数据时,可以直接从数据库中读取所需的现场数据进行使用,无需每次使用现场数据都需要通过ETL对现场数据进行处理,从而能够降低工业生产过程中的数据处理量。In the embodiment of this application, after obtaining the on-site data generated in the industrial production process, the quality of the on-site data is inspected to obtain the data quality information used to indicate the quality of the on-site data, and then the data quality of the on-site data is improved according to the data quality information processing to solve the data quality problems existing in the field data, and then store the field data that has undergone data quality improvement processing into the database. Since the on-site data stored in the database is processed by data quality improvement, the data quality problems existing in the on-site data have been solved. When the application or service in the industrial production system uses the on-site data, it can directly read the required data from the database. There is no need to process the field data through ETL every time the field data is used, which can reduce the amount of data processing in the industrial production process.
下面结合附图对本申请实施例提供的工业数据处理方法和装置进行详细说明。The industrial data processing method and device provided in the embodiments of the present application will be described in detail below with reference to the accompanying drawings.
实施例一Embodiment one
图1是本申请实施例一提供的一种工业数据处理方法100的流程图,用于对工业生产过程中产生的数据进行处理,如图1所示,该工业数据处理方法100包括如下步骤:FIG. 1 is a flow chart of an industrial data processing method 100 provided in Embodiment 1 of the present application, which is used to process data generated in the industrial production process. As shown in FIG. 1 , the industrial data processing method 100 includes the following steps:
步骤101、获取工业生产过程中产生的现场数据。 Step 101, acquiring on-site data generated in the industrial production process.
在工业生产的过程中,工业生产系统中的工业设备会产生现场数据,现场数据用于指示工业设备的运行状态。现场数据可以从网关设备、传感器、现场应用或现场系统等获取,还可以获取人工的现场操作信息作为现场数据。比如,现场数据可以是订单信息、温度信息、振动信息、转速信息等。In the process of industrial production, industrial equipment in the industrial production system will generate on-site data, and the on-site data is used to indicate the operating status of the industrial equipment. On-site data can be obtained from gateway devices, sensors, on-site applications or on-site systems, etc., and manual on-site operation information can also be obtained as on-site data. For example, on-site data can be order information, temperature information, vibration information, rotational speed information, etc.
在获取现场数据时,可以通过不同类型的通信协议获取现场数据,比如通过OPC UA(OPC为OLE for Process Control的缩写,为应用于过程控制的OLE技术,OLE技术为Object Linking and Embedding,是指对象连接与嵌入,UA是Unified Architecture的缩写,是指统一架构)、消息队列遥测传输(Message Queuing Telemetry Transport,MQTT)或超文本传输协议(Hyper Text Transfer Protocol,HTTP)等通信协议获取现场数据。When obtaining field data, field data can be obtained through different types of communication protocols, such as through OPC UA (OPC is the abbreviation of OLE for Process Control, OLE technology applied to process control, OLE technology is Object Linking and Embedding, refers to Object connection and embedding, UA is the abbreviation of Unified Architecture, refers to unified architecture), message queue telemetry transmission (Message Queuing Telemetry Transport, MQTT) or hypertext transfer protocol (Hyper Text Transfer Protocol, HTTP) and other communication protocols to obtain field data.
102、对现场数据进行质量检测,获得用于指示现场数据的质量的数据质量信息。102. Perform quality inspection on the on-site data, and obtain data quality information for indicating the quality of the on-site data.
在获取现场数据的过程中,基于获取现场数据的通信协议对现场数据的数据质量进行检测,获得用于指示现场数据的质量的数据质量信息。通过数据质量信息,可以确定现场数据是否存在数据缺失、包括脏数据等数据质量问题。In the process of obtaining the field data, the data quality of the field data is detected based on the communication protocol for obtaining the field data, and the data quality information for indicating the quality of the field data is obtained. Through the data quality information, it can be determined whether there are data quality problems such as missing data and dirty data in the field data.
对现场数据进行质量检测,可以通过软件开发工具包(Software Development Kit,SDK)或嵌入式程序模块,通过不同的通信协议从不同的数据源获取现场数据,并对所接收到的现场数据进行数据质量检测,获得相应的数据质量信息。The quality inspection of on-site data can obtain on-site data from different data sources through different communication protocols through the software development kit (Software Development Kit, SDK) or embedded program modules, and perform data analysis on the received on-site data. Quality inspection to obtain corresponding data quality information.
步骤103、根据数据质量信息,对现场数据进行数据质量提升处理,以解决现场数据存在的质量问题。 Step 103, according to the data quality information, perform data quality improvement processing on the on-site data, so as to solve the quality problems existing in the on-site data.
由于数据质量信息能够指示现场数据存在的数据质量问题,因此可以基于数据质量信息 对现场数据进行数据质量提升处理,解决现场数据存在的数据质量问题。比如数据质量信息指示现场数据存在数据缺失,则可以根据数据质量信息对现场数据中缺失的数据进行补充,再比如数据质量信息指示现场数据包括脏数据,则可以根据数据质量信息将现场数据中的脏数据删除。Since the data quality information can indicate the data quality problems existing in the field data, it is possible to improve the data quality of the field data based on the data quality information to solve the data quality problems existing in the field data. For example, if the data quality information indicates that there is data missing in the on-site data, the missing data in the on-site data can be supplemented according to the data quality information. Dirty data deletion.
步骤104、将经过数据质量提升处理的现场数据存储到预先创建的数据库中。 Step 104, storing the on-site data processed for data quality improvement into a pre-created database.
通过对现场数据进行数据质量提升处理,解决现场数据存在的数据质量问题后,将现场数据存储到数据库中,以备工业生产系统中的应用或服务使用。工业生产系统中的应用或服务,可以通过数据库的数据读取接口,从数据库中读取所需的现场数据,以基于现场数据进行工业设备控制、生产计划制定、维护计划制定等。By improving the data quality of on-site data and solving the data quality problems existing in the on-site data, the on-site data is stored in the database for use in applications or services in industrial production systems. The application or service in the industrial production system can read the required field data from the database through the data reading interface of the database, so as to perform industrial equipment control, production plan formulation, maintenance plan formulation, etc. based on the field data.
在本申请实施例中,获取到工业生产系统的现场数据后,首先对现场数据进行数据质量检测,获得指示现场数据的质量的数据质量信息,然后根据数据质量信息对现场数据进行数据质量提升处理,解决现场数据存在的数据质量问题,然后将经过数据质量提升处理的现场数据存储到预先创建的数据库中。由于存储到数据库中的现场数据是经过数据质量提升处理的,已经解决了现场数据存在的数据质量问题,工业生产系统中的应用和服务可以直接从数据库中读取所需的现场数据进行使用,而无需每次读取现场数据都需要通过ETL对现场数据进行处理,从而能够降低工业生产过程中的数据处理量。In the embodiment of the present application, after obtaining the on-site data of the industrial production system, first perform data quality inspection on the on-site data, obtain data quality information indicating the quality of the on-site data, and then perform data quality improvement processing on the on-site data according to the data quality information , solve the data quality problems existing in the field data, and then store the field data processed by data quality improvement into the pre-created database. Since the on-site data stored in the database is processed through data quality improvement, the data quality problems existing in the on-site data have been solved, and the applications and services in the industrial production system can directly read the required on-site data from the database for use. It is not necessary to process the field data through ETL every time the field data is read, thereby reducing the amount of data processing in the industrial production process.
实施例二Embodiment two
在实施例一所提供工业数据处理方法的基础上,通过对现场数据进行质量检测以获得数据质量信息时,可以将现场数据随时间的变化趋势以及现场数据中一些能够反映数据质量的关键字段作为数据质量信息。On the basis of the industrial data processing method provided in Embodiment 1, when the data quality information is obtained by performing quality inspection on the on-site data, the change trend of the on-site data over time and some key fields in the on-site data that can reflect the data quality as data quality information.
图2是本申请实施例二提供的一种数据质量信息获取方法200的流程图,如图2所示,该数据质量信息获取方法200包括如下步骤:FIG. 2 is a flowchart of a data quality information acquisition method 200 provided in Embodiment 2 of the present application. As shown in FIG. 2 , the data quality information acquisition method 200 includes the following steps:
步骤201、计算现场数据的数据差异特征。 Step 201, calculating data difference characteristics of field data.
数据差异特征用于指示现场数据随时间的变化趋势。对于时序数据而言,如果数据发生突变,而数据源运行正常,则说明由于干扰因素而产生了异常数据,因此根据数据差异特征,可以判断现场数据是否包括异常数据。Data variance features are used to indicate trends in field data over time. For time series data, if there is a sudden change in the data and the data source is running normally, it means that abnormal data is generated due to interference factors. Therefore, according to the data difference characteristics, it can be judged whether the field data includes abnormal data.
数据差异特征可以是现场数据的方差、均值、标准差等表征现场数据分布的参数值。The data difference feature may be the variance, mean, and standard deviation of field data, etc., which characterize the parameter values of field data distribution.
步骤202、根据预先设定的至少一个第一数据质量字段,分别获取现场数据中对应于每个第一数据质量字段的第一字段信息。Step 202: Obtain first field information corresponding to each first data quality field in the field data according to at least one preset first data quality field.
第一数据质量字段是现场数据中能够指示现场数据的数据质量的字段,比如第一数据质量字段可以包括数据采集时间、数值大小、数据序列、信号质量、最后一个方差值(last D- value)、所包括字符数和数据位数等。The first data quality field is a field that can indicate the data quality of the field data in the on-site data. For example, the first data quality field can include data acquisition time, numerical value, data sequence, signal quality, and the last variance value (last D-value ), the number of characters and data bits included, etc.
第一数据质量字段可以通过配置信息或用户体验(User Experience,UX)服务进行设置。The first data quality field may be set through configuration information or a user experience (User Experience, UX) service.
需要说明的是,根据获取现场数据时所使用通信协议的不同,对于通信字段包括第一数据质量字段的通信协议,可以直接从通信字段中获取各第一数据质量字段对应的第一字段信息,比如通过OPC UA获取现场数据时,便可以直接从通信字段中获取第一字段信息。对于通信字段不包括第一数据质量字段的通信协议,则可以通过对现场数据进行检测和计算,以获得各第一数据质量字段对应的第一字段信息。It should be noted that, depending on the communication protocol used when acquiring field data, for a communication protocol whose communication field includes a first data quality field, the first field information corresponding to each first data quality field can be obtained directly from the communication field, For example, when obtaining field data through OPC UA, the first field information can be obtained directly from the communication field. For a communication protocol whose communication field does not include a first data quality field, first field information corresponding to each first data quality field may be obtained by detecting and calculating on-site data.
步骤203、获取包括数据差异特征和各第一字段信息的数据质量信息。 Step 203. Obtain data quality information including data difference features and first field information.
在计算出现场数据的数据差异特征,并获取到现场数据中每个第一数据质量字段对应的第一字段信息后,将数据差异特征和各第一字段信息作为数据质量信息,指示现场数据的数据质量。After calculating the data difference characteristics of the field data and obtaining the first field information corresponding to each first data quality field in the field data, the data difference characteristics and each first field information are used as data quality information to indicate the field data data quality.
在通过SDK或嵌入式程序模块提供的通信协议接收现场数据时,可以根据SDK或嵌入式程序模块所使用的通信协议,将数据差异特征以及各第一字段信息嵌入到通信字段中,与现场数据一起发送给数据质量控制器,进而数据质量控制器可以根据接收到的数据质量信息,对接收到的现场数据进行数据质量提升处理。When receiving field data through the communication protocol provided by the SDK or embedded program module, the data difference characteristics and the first field information can be embedded into the communication field according to the communication protocol used by the SDK or embedded program module, and the field data Send them together to the data quality controller, and then the data quality controller can improve the data quality of the received field data according to the received data quality information.
在本申请实施例中,数据差异特征可以指示现场数据随时间的变化趋势,因此可以根据数据差异特征确定现场数据是否存在突然增大或突然减小的异常数据,第一数据质量字段是指示数据质量的关键字段,因此根据第一数据字段对应的第一字段信息可以确定现场数据是否存在数据格式等问题。获取包括数据差异特征和第一字段信息的数据质量信息,进而根据数据差异特征和各第一字段信息对现场数据进行数据质量提升处理时,可以更加全面的解决现场数据存在的数据质量问题,保证存储到数据库中的现场数据没有或仅具有较少的数据质量问题,进而保证工业生产系统中的应用和服务可以基于数据库中的现场数据正常运行。In the embodiment of this application, the data difference feature can indicate the change trend of the field data over time, so it can be determined according to the data difference feature whether there is abnormal data in the field data that suddenly increases or decreases suddenly, and the first data quality field is the indicator data The key field of quality, so according to the first field information corresponding to the first data field, it can be determined whether there are problems such as data format in the field data. Obtaining data quality information including data difference characteristics and first field information, and then improving the data quality of field data according to data difference characteristics and each first field information, can more comprehensively solve data quality problems existing in field data and ensure The on-site data stored in the database has no or less data quality problems, thereby ensuring that applications and services in the industrial production system can operate normally based on the on-site data in the database.
实施例三Embodiment Three
在实施例而所提供数据质量信息获取方法的基础上,在获取到包括数据差异特征和第一字段信息的数据质量信息后,可以根据数据质量信息对现场数据包括的异常数据问题、数据缺失问题和格式异常问题进行处理。On the basis of the data quality information acquisition method provided in the embodiment, after acquiring the data quality information including the data difference feature and the first field information, the abnormal data problems and data missing problems included in the field data can be analyzed according to the data quality information and format exceptions are handled.
图3是本申请实施例三提供的一种数据质量提升处理方法300的流程图,如图3所示,该数据质量提升处理方法300包括如下步骤:FIG. 3 is a flowchart of a data quality improvement processing method 300 provided in Embodiment 3 of the present application. As shown in FIG. 3 , the data quality improvement processing method 300 includes the following steps:
步骤301、根据数据质量信息包括的数据差异特征,判断现场数据是否存在异常特征,如果是Y,执行步骤302,如果否N,执行步骤303。 Step 301 , according to the data difference features included in the data quality information, judge whether there are abnormal features in the field data, if yes, go to step 302 , if not N, go to step 303 .
由于数据差异特征指示现场数据随时间的变化趋势,从而可以根据数据差异特征判断现 场数据中是否存在突然增大或突然减小的情况,如果现场数据存在突然增大或突然减小的情况,则确定现场数据存在异常特征,相应的执行步骤302,如果现场数据不存在突然增大和突然减小的情况,则确定现场数据不存在异常特征,相应的执行步骤303。Since the data difference feature indicates the change trend of the field data over time, it can be judged whether there is a sudden increase or decrease in the field data according to the data difference feature. If there is a sudden increase or decrease in the field data, then If it is determined that there is an abnormal feature in the field data, step 302 is executed accordingly; if there is no sudden increase or decrease in the field data, it is determined that there is no abnormal feature in the field data, and step 303 is executed accordingly.
异常特征可以是现场数据的方差、均值、标准差等反映现场数据随时间变化的参数。Anomaly features can be parameters such as variance, mean, and standard deviation of field data that reflect changes in field data over time.
步骤302、删除现场数据的异常特征对应的异常数据。 Step 302, deleting the abnormal data corresponding to the abnormal feature of the field data.
根据现场数据的异常特征,可以确定现场数据中导致异常特征出现的异常数据,该异常数据即为现场数据中的脏数据,进而将现场数据中导致异常特征出现的异常数据删除,解决现场数据包括脏数据的问题。According to the abnormal characteristics of the field data, the abnormal data that causes the abnormal characteristics to appear in the field data can be determined. Dirty data problem.
比如,现场数据中存在由于人为干扰而导致突然增大的异常数据,该异常数据会导致现场数据的方差增大,反过来看,根据现场数据的方差增大(异常特征)可以定位到现场数据中的异常数据,进而将现场数据中的异常数据删除。For example, there are abnormal data in the field data that suddenly increase due to human interference. Abnormal data in the field data, and then delete the abnormal data in the field data.
步骤303、根据现场数据的数据源的心跳信号,判断现场数据是否存在数据缺失,如果是Y,执行步骤304,如果否N,执行步骤305。 Step 303 , according to the heartbeat signal of the data source of the on-site data, judge whether there is data missing in the on-site data, if yes, go to step 304 , if not N, go to step 305 .
在获取现场数据的过程中,现场数据的数据源发送心跳信号以确定通信是否连接,如果现场数据的数据源的心跳信号超时,即在预设的信号周期内没有接收到来自现场数据的数据源的心跳信号,则说明与现场数据的数据源之间的通信连接已经断开,所以无法从现场数据的数据源获取现场数据,导致现场数据存在数据缺失。如果能够按照预设的信号周期接收到来自现场数据的数据源的心跳信号,则说明与现场数据的数据源之间的通信保持连接状态,可以正常从现场数据的数据源获取现场数据,从而确定现场数据不存在数据缺失。In the process of acquiring field data, the data source of the field data sends a heartbeat signal to determine whether the communication is connected. If the heartbeat signal of the data source of the field data times out, that is, the data source from the field data is not received within the preset signal cycle The heartbeat signal indicates that the communication connection with the data source of the field data has been disconnected, so the field data cannot be obtained from the data source of the field data, resulting in data loss in the field data. If the heartbeat signal from the data source of the field data can be received according to the preset signal cycle, it means that the communication with the data source of the field data remains connected, and the field data can be obtained from the data source of the field data normally, so as to determine There is no missing data in field data.
步骤304、根据预设的数据增强规则,对现场数据进行数据增强处理。 Step 304, perform data enhancement processing on the field data according to preset data enhancement rules.
在确定现场数据存在数据缺失后,根据预设的数据增强规则,对现场数据进行数据增强处理,以生成相应的数据作为缺失数据补充到现场数据中,达到解决现场数据存在数据缺失的问题。After it is determined that there is data missing in the on-site data, according to the preset data enhancement rules, data enhancement processing is performed on the on-site data to generate corresponding data as missing data to supplement the on-site data, so as to solve the problem of missing data in the on-site data.
在对现场数据进行数据增强处理时,可以根据现场数据随时间的变化趋势,补充缺失的现场数据,或者可以计算获取到的现场数据的平均值,将计算出的平均值作为缺失的现场数据。When performing data enhancement processing on the field data, the missing field data can be supplemented according to the change trend of the field data over time, or the average value of the obtained field data can be calculated, and the calculated average value can be used as the missing field data.
步骤305、根据数据质量信息包括的第一数据质量字段,判断现场数据的数据格式是否为目标数据格式,如果是Y,执行步骤307,如果否N,执行步骤306。 Step 305 , according to the first data quality field included in the data quality information, judge whether the data format of the field data is the target data format, if yes, execute step 307 , if not N, execute step 306 .
第一数据质量字段是反映现场数据质量的字段,根据第一数据质量字段可以判断现场数据的数据格式是否为工业生产系统中应用和服务所需的目标数据格式,如果现场数据的数据格式是目标数据格式,则可以直接将现场数据存储到数据库中,相应的执行步骤307。如果 现场数据的数据格式不是目标数据格式,则需要对现场数据的数据格式进行转换,相应的执行步骤306。The first data quality field is a field that reflects the quality of on-site data. According to the first data quality field, it can be judged whether the data format of the on-site data is the target data format required by the application and service in the industrial production system. If the data format of the on-site data is the target data format, the on-site data can be directly stored in the database, and step 307 is executed accordingly. If the data format of the field data is not the target data format, the data format of the field data needs to be converted, and step 306 is executed accordingly.
步骤306、将现场数据的数据格式转换为目标数据格式。 Step 306, converting the data format of the field data into the target data format.
在确定现场数据的数据格式不是目标数据格式后,对现场数据的数据格式进行转换,将现场数据的数据格式转换为目标数据格式。After determining that the data format of the field data is not the target data format, the data format of the field data is converted, and the data format of the field data is converted into the target data format.
例如,现场数据的二进制位数为8位,而工业生产系统中应用和服务所能够处理的数据的二进制位数为16则,则通过补0或补1的方式,将现场数据转换为包括16位二进制位数的数据,以满足工业生产系统中应用或服务的需求。For example, if the binary digits of on-site data are 8 bits, and the binary digits of the data that can be processed by applications and services in the industrial production system are 16 bits, then the field data is converted to include 16 bits by complementing 0 or 1. Bit-by-bit data to meet the needs of applications or services in industrial production systems.
步骤307、将现场数据存储到数据库中。 Step 307, store the field data in the database.
在本申请实施例中,根据数据质量信息,删除现场数据包括的脏数据,在现场数据中补充缺失的数据,并将现场数据的数据格式转换至工业生产系统中应用或服务所需的目标数据格式,能够更加全面地解决现场数据存在的数据质量问题,保证存储到数据块中的现场数据不存在或仅存在较少的数据质量问题,进而保证工业生产系统中的应用和服务能够直接使用数据块中的现场数据,从而减小工业生产过程中对数据的处理量。In the embodiment of this application, according to the data quality information, the dirty data included in the field data is deleted, the missing data is supplemented in the field data, and the data format of the field data is converted to the target data required by the application or service in the industrial production system format, which can more comprehensively solve the data quality problems of on-site data, ensure that the on-site data stored in the data block does not exist or has only a few data quality problems, and thus ensures that the applications and services in the industrial production system can directly use the data On-site data in the block, thereby reducing the amount of data processing in the industrial production process.
实施例四Embodiment four
在工业生产系统中,不仅现场数据存在数据质量问题,应用与应用、应用与服务及服务与服务之间的交互数据也存在数据质量问题,为了保证应用与应用、应用与服务及服务与服务之间能够正常交互,需要解决交互数据的质量问题。In the industrial production system, not only the on-site data has data quality problems, but also the interactive data between applications and applications, applications and services, and services and services has data quality problems. To be able to interact normally, it is necessary to solve the quality problem of the interactive data.
图4是本申请实施例四提供的一种工业数据处理方法400的流程图,如图4所示,该工业数据处理方法400包括如下步骤:FIG. 4 is a flowchart of an industrial data processing method 400 provided in Embodiment 4 of the present application. As shown in FIG. 4 , the industrial data processing method 400 includes the following steps:
步骤401、获取工业生产系统中第一数据输出端通过表现层状态转换接口输出的第一交互数据。 Step 401. Obtain the first interaction data output by the first data output terminal in the industrial production system through the presentation layer state conversion interface.
工业生产系统中的应用和服务之间可以通过表现层状态转换接口(Restful interface)进行交互,一个应用可以通过表现层状态转换接口与其他的应用或服务进行交互,一个服务可以通过表现层状态转换接口与其他的服务或应用进行交互,因此第一数据输出端可以是应用或服务,相应的第一数据接收端也可以是应用或服务。Applications and services in the industrial production system can interact through the presentation layer state transition interface (Restful interface). An application can interact with other applications or services through the presentation layer state transition interface, and a service can interact with other applications or services through the presentation layer state transition interface. The interface interacts with other services or applications, so the first data output end may be an application or service, and the corresponding first data receiving end may also be an application or service.
在第一数据输出端和第一数据接收端通过表现层状态转换接口进行交互时,获取第一数据输出端输出的第一交互数据。When the first data output terminal and the first data receiving terminal interact through the presentation layer state transition interface, the first interaction data output by the first data output terminal is acquired.
步骤402、对第一交互数据中的字段进行语义检索,判断第一交互中的字段是否与预先创建的语义模板相匹配,如果是Y,执行步骤404,如果否N,执行步骤403。 Step 402 , perform semantic search on the fields in the first interaction data, and judge whether the fields in the first interaction match the pre-created semantic template, if yes, perform step 404 , if not N, perform step 403 .
预先根据第一数据输出端和第一数据接收端之间通信接口的结构,及交互数据的字段信 息创建语义模板,语义模板定义了第一数据输出端和第一数据接收端之间交互数据所包括的字段,以及每个字段的数据格式。在获取到第一数据输出端输出的第一交互数据后,对第一交互数据进行语义搜索,确定第一交互数据所包括的字段以及各字段对应的数据,如果第一交互数据包括的字段与语义模板定义的字段一一匹配,而且第一交互数据中每个字段对应的数据满足语义模板定义的数据格式,则确定第一交互数据与语义模板相匹配,相应的执行步骤404。如果第一交互数据包括的字段与语义模板定义的字段相比,存在缺失字段或多出其他字段,或者第一交互数据中某一个或多个字段对应的数据不满足语义模板定义的数据格式,则确定第一交互数据与语义模板不匹配,相应的执行步骤403。Create a semantic template in advance according to the structure of the communication interface between the first data output terminal and the first data receiving terminal, and the field information of the interactive data. The semantic template defines the interaction data between the first data output terminal and the first data receiving terminal. The fields to include, and the data format for each field. After obtaining the first interaction data output by the first data output terminal, perform a semantic search on the first interaction data to determine the fields included in the first interaction data and the data corresponding to each field, if the fields included in the first interaction data are the same as The fields defined by the semantic template are matched one by one, and the data corresponding to each field in the first interaction data satisfies the data format defined by the semantic template, then it is determined that the first interaction data matches the semantic template, and step 404 is executed accordingly. If the fields included in the first interaction data are missing or have more fields than the fields defined in the semantic template, or the data corresponding to one or more fields in the first interaction data does not meet the data format defined in the semantic template, Then it is determined that the first interaction data does not match the semantic template, and step 403 is executed accordingly.
步骤403、对第一交互数据进行数据质量提升处理,以解决第一交互数据存在的质量问题。Step 403: Perform data quality improvement processing on the first interaction data, so as to solve quality problems existing in the first interaction data.
在确定第一交互数据与语义模板不匹配时,说明第一交互数据存在数据质量问题,则对第一交互数据进行数据质量提升处理,以解决第一交互数据存在的数据质量问题。When it is determined that the first interaction data does not match the semantic template, it means that there is a data quality problem in the first interaction data, and data quality improvement processing is performed on the first interaction data to solve the data quality problem in the first interaction data.
对第一交互数据进行数据质量提升处理,可以参见实施例三中对现场数据进行数据质量提升处理,比如删除第一交互数据中的脏数据、在第一交互数据中补充缺失的数据以及对第一交互数据进行数据格式转换等。To improve the data quality of the first interaction data, please refer to the data quality improvement process of the field data in the third embodiment, such as deleting dirty data in the first interaction data, supplementing missing data in the first interaction data, and improving the data quality of the first interaction data. - Interactive data for data format conversion, etc.
步骤404、将解决质量问题后的第一交互数据发送给第一数据接收端。Step 404: Send the first interaction data after the quality problem is solved to the first data receiving end.
如果确定第一交互数据与语义模板相匹配,即第一交互数据不存在数据质量问题,则直接将第一交互数据发送给第一数据接收端。如果确定第一交互数据与语义模板不匹配,即第一交互数据存在数据质量问题,则在对第一交互数据进行数据质量提升处理后,将经过数据质量提升处理的第一交互数据发送给第一数据接收端。If it is determined that the first interaction data matches the semantic template, that is, there is no data quality problem in the first interaction data, the first interaction data is directly sent to the first data receiving end. If it is determined that the first interaction data does not match the semantic template, that is, there is a data quality problem in the first interaction data, after performing data quality improvement processing on the first interaction data, the first interaction data that has undergone data quality improvement processing is sent to the first interaction data. A data receiving end.
在本申请实施例中,预先创建语义模板,通过语义搜索判断第一交互数据是否与语义模板相匹配,从而确定第一交互数据是否存在数据质量问题,并对存在数据质量问题的第一交互数据进行数据质量提升处理,将解决数据质量问题后的第一交互数据发送给第一数据接收端,保证通过表现层状态转换接口交互的第一数据输出端和第一数据接收端能够正常交互,使得工业生产系统中通过表现层状态转换接口交互的应用和服务能够正常获取到交互数据,进而保证工业生产系统能够正常运行,保证工业生产的效率。In this embodiment of the present application, the semantic template is created in advance, and whether the first interaction data matches the semantic template is judged through semantic search, so as to determine whether the first interaction data has data quality problems, and the first interaction data with data quality problems Perform data quality improvement processing, send the first interactive data after solving the data quality problem to the first data receiving end, and ensure that the first data output end and the first data receiving end interacting through the presentation layer state conversion interface can interact normally, so that In the industrial production system, the applications and services that interact through the state transition interface of the presentation layer can normally obtain the interactive data, thereby ensuring the normal operation of the industrial production system and the efficiency of industrial production.
实施例五Embodiment five
实施例四提供的工业数据处理方法,用于解决通过表现层状态转换接口传输的交互数据的质量问题,工业生产系统中应用和服务还可以通过消息队列(Message queue)进行交互,为了进一步提高工业生产系统中数据的质量,还可以对解决通过消息队列传输的交互数据的质量问题。The industrial data processing method provided by Embodiment 4 is used to solve the quality problem of the interactive data transmitted through the presentation layer state transition interface. The application and service in the industrial production system can also interact through the message queue (Message queue). In order to further improve the industrial The quality of data in the production system can also solve the quality problem of interactive data transmitted through message queues.
图5是本申请实施例五提供的一种工业数据处理方法500的流程图,如图5所示,该工业数据处理方法500包括如下步骤:Fig. 5 is a flowchart of an industrial data processing method 500 provided in Embodiment 5 of the present application. As shown in Fig. 5, the industrial data processing method 500 includes the following steps:
步骤501、获取工业生产系统中第二数据输出端通过消息队列输出的第二交互数据。 Step 501, acquire the second interaction data output by the second data output terminal in the industrial production system through the message queue.
工业生产系统中的应用和服务之间可以通过消息队列进行交互,一个应用可以通过消息队列与其他的应用或服务进行交互,一个服务也可以通过消息队列与其他服务或应用进行交互,因此输出数据的第二数据输出端可以是应用或服务,相应的接收数据的第二数据接收端也可以是应用或服务。Applications and services in industrial production systems can interact through message queues. An application can interact with other applications or services through message queues, and a service can also interact with other services or applications through message queues, so output data The second data output end may be an application or a service, and the corresponding second data receiving end receiving data may also be an application or a service.
在第二数据输出端和第二数据接收端通过消息队列进行交互时,获取第二数据输出端输出的第二交互数据。When the second data output terminal and the second data receiving terminal interact through the message queue, the second interaction data output by the second data output terminal is acquired.
步骤502、根据预先设定的至少一个第二数据质量字段,分别获取第二交互数据对应于每个第二数据质量字段的第二字段信息。Step 502: Acquire second field information corresponding to each second data quality field of the second interaction data according to at least one preset second data quality field.
针对通过消息队列传输的第二交互数据,预先确定用于指示该第二交互数据的数据质量的至少一个第二数据质量字段,在获取到第二交互数据后,从第二交互数据中提取每个第二数据质量字段对应的第二字段信息。第二数据质量字段可以包括数据采集时间、数值大小、数据序列、信号质量、最后一个方差值(last D-value)、所包括字符数和数据位数等。For the second interaction data transmitted through the message queue, at least one second data quality field used to indicate the data quality of the second interaction data is predetermined, and after the second interaction data is obtained, each The second field information corresponding to the second data quality field. The second data quality field may include data acquisition time, value size, data sequence, signal quality, last variance value (last D-value), number of included characters and number of data bits, etc.
第二数据质量字段可以通过配置信息或用户体验(User Experience,UX)服务进行设置。The second data quality field can be set through configuration information or user experience (User Experience, UX) service.
步骤503、根据各第二字段信息判断第二交互数据是否存在数据质量问题,如果是Y,执行步骤504,如果否N,执行步骤505。 Step 503 , judging whether there is a data quality problem in the second interaction data according to the information of each second field, if yes, go to step 504 , if not N, go to step 505 .
由于第二数据质量字段为指示第二交互数据的质量的字段,因此根据第二数据质量字段对应的第二字段信息可以确定第二交互数据是否存在数据质量问题,比如第二交互数据是否缺失字段、第二字段信息的格式不正确、包括多余的字段等。如果第二交互数据存在数据质量问题,则执行步骤504,以对第二交互数据存在的数据质量问题进行处理。如果第二交互数据不存在数据质量问题,则执行步骤505,直接将第二交互数据发送给第二数据接收端。Since the second data quality field is a field indicating the quality of the second interaction data, it can be determined whether there is a data quality problem in the second interaction data according to the second field information corresponding to the second data quality field, such as whether the second interaction data is missing a field , The format of the second field information is incorrect, including redundant fields, etc. If there is a data quality problem in the second interaction data, step 504 is executed to deal with the data quality problem in the second interaction data. If there is no data quality problem in the second interaction data, step 505 is performed to directly send the second interaction data to the second data receiving end.
步骤504、根据各第二字段信息,对第二交互数据进行数据质量提升处理。Step 504: Perform data quality improvement processing on the second interaction data according to the information of each second field.
在第二交互数据存在数据质量问题时,根据各第二字段信息对第二交互数据进行数据质量提升处理,以删除第二交互数据中的异常数据,在第二交互数据中补充缺失的数据,以及对第二交互数据进行数据格式转换。删除第二交互数据中的异常数据,主要是删除第二交互数据中的冗余数据等脏数据。对第二交互数据进行数据格式转换,主要是将第二交互数据中各第二字段信息转换为第二数据接收端所需的数据格式。When there is a data quality problem in the second interaction data, perform data quality improvement processing on the second interaction data according to the information of each second field, so as to delete abnormal data in the second interaction data and supplement missing data in the second interaction data, And performing data format conversion on the second interaction data. Deleting the abnormal data in the second interactive data is mainly to delete dirty data such as redundant data in the second interactive data. The data format conversion of the second interaction data is mainly to convert the second field information in the second interaction data into the data format required by the second data receiving end.
步骤505、将解决质量问题后的第二交互数据发送给第二数据接收端。 Step 505. Send the second interaction data after the quality problem is solved to the second data receiving end.
如果确定第二交互数据不存在数据质量问题,则直接将第二交互数据发送给第二数据接 收端。如果确定第二交互数据存在数据质量问题,则在对第二交互数据进行数据质量提升处理后,将经过数据质量提升处理的第二交互数据发送给第二数据接收端。If it is determined that there is no data quality problem in the second interaction data, the second interaction data is directly sent to the second data receiving end. If it is determined that there is a data quality problem in the second interaction data, after data quality improvement processing is performed on the second interaction data, the second interaction data that has undergone data quality improvement processing is sent to the second data receiving end.
在本申请实施例中,预先确定用于指示第二交互数据的质量的第二数据质量字段,从第二交互数据中提取每个第二数据质量字段对应的第二字段信息,进而根据各第二字段信息可以对第二交互数据进行数据质量提升处理,将解决数据质量问题后的第二交互数据发送给第二数据接收端,保证通过消息队列交互的第二数据输出端和第二数据接收端能够正常交互,使得工业生产系统中通过消息队列交互的应用和服务能够正常获取到交互数据,进而保证工业生产系统能够正常运行,保证工业生产的效率。In this embodiment of the present application, the second data quality field used to indicate the quality of the second interaction data is predetermined, and the second field information corresponding to each second data quality field is extracted from the second interaction data, and then according to the The two-field information can perform data quality improvement processing on the second interaction data, and send the second interaction data after solving the data quality problem to the second data receiving end, so as to ensure the second data output end and the second data receiving end through the message queue interaction The terminal can interact normally, so that the applications and services interacting through the message queue in the industrial production system can normally obtain the interactive data, thereby ensuring the normal operation of the industrial production system and the efficiency of industrial production.
实施例六Embodiment six
图6是本申请实施例六提供的一种工业数据处理装置600的示意图,用于对工业生产过程中产生的数据进行处理,如图6所示,该工业数据处理装置600包括:Fig. 6 is a schematic diagram of an industrial data processing device 600 provided in Embodiment 6 of the present application, which is used to process data generated in the industrial production process. As shown in Fig. 6, the industrial data processing device 600 includes:
获取模块601,用于获取工业生产过程中产生的现场数据;An acquisition module 601, configured to acquire field data generated during industrial production;
检测模块602,用于对获取模块601获取到的现场数据进行质量检测,获得用于指示现场数据的质量的数据质量信息;The detection module 602 is configured to perform quality detection on the on-site data acquired by the acquisition module 601, and obtain data quality information indicating the quality of the on-site data;
处理模块603,用于根据检测模块602获取到的数据质量信息,对现场数据进行数据质量提升处理,以解决现场数据存在的质量问题;The processing module 603 is configured to perform data quality improvement processing on the on-site data according to the data quality information acquired by the detection module 602, so as to solve the quality problems existing in the on-site data;
输出模块604,用于将经过处理模块603的数据质量提升处理的现场数据存储到预先创建的数据库中。The output module 604 is configured to store the on-site data processed by the processing module 603 for data quality improvement into a pre-created database.
在本申请实施例中,第一获取模块601可用于执行上述实施例一中的步骤101,检测模块602可用于执行上述实施例一中的步骤102,处理模块603可用于执行上述实施例一中的步骤103,输出模块604可用于执行上述实施例一中的步骤104。In this embodiment of the present application, the first acquisition module 601 can be used to execute step 101 in the first embodiment above, the detection module 602 can be used to execute step 102 in the first embodiment above, and the processing module 603 can be used to execute the step 102 in the first embodiment above. In step 103, the output module 604 can be used to execute step 104 in the first embodiment above.
在一种可能的实现方式中,检测模块602可用于执行如下操作:In a possible implementation manner, the detection module 602 may be configured to perform the following operations:
计算现场数据的数据差异特征,数据差异特征用于指示现场数据随时间的变化趋势;Calculate the data difference characteristics of the field data, and the data difference characteristics are used to indicate the change trend of the field data over time;
根据预先设定的至少一个第一数据质量字段,分别获取现场数据对应于每个第一数据质量字段的第一字段信息,第一数据质量字段为指示现场数据的质量的字段;According to at least one preset first data quality field, first field information corresponding to each first data quality field of the field data is respectively obtained, and the first data quality field is a field indicating the quality of the field data;
获取包括数据差异特征和各第一字段信息的数据质量信息。Data quality information including data difference features and information of each first field is acquired.
在本申请实施例中,检测模块602可用于执行上述实施例二中的步骤201至步骤203。In this embodiment of the present application, the detection module 602 may be used to execute steps 201 to 203 in the second embodiment above.
在一种可能的实现方式中,处理模块603可用于执行如下操作:In a possible implementation manner, the processing module 603 may be configured to perform the following operations:
根据数据质量信息包括的数据差异特征,判断现场数据是否存在异常特征,若现场数据存在异常特征,则删除现场数据的异常特征对应的异常数据;According to the data difference characteristics included in the data quality information, it is judged whether there are abnormal characteristics in the field data, and if there are abnormal characteristics in the field data, the abnormal data corresponding to the abnormal characteristics of the field data is deleted;
根据现场数据的数据源的心跳信号,判断现场数据是否存在数据缺失,若现场数据存在 数据缺失,则按照预设的数据增强规则对现场数据进行数据增强处理,以在现场数据中补充缺失的数据;According to the heartbeat signal of the data source of the on-site data, it is judged whether there is data missing in the on-site data. If there is data missing in the on-site data, data enhancement processing is performed on the on-site data according to the preset data enhancement rules to supplement the missing data in the on-site data. ;
根据数据质量信息包括的第一数据质量字段,判断现场数据的数据格式是否为目标数据格式,若现场数据的数据格式不是目标数据格式,则将现场数据的数据格式转换为目标数据格式。According to the first data quality field included in the data quality information, it is judged whether the data format of the field data is the target data format, and if the data format of the field data is not the target data format, then the data format of the field data is converted into the target data format.
在本申请实施例中,处理模块603可用于执行上述实施例三中的步骤301至步骤306。In this embodiment of the present application, the processing module 603 may be configured to execute steps 301 to 306 in the third embodiment above.
在一种可能的实现方式中,如图6所示,In a possible implementation, as shown in Figure 6,
获取模块601,还用于获取工业生产系统中第一数据输出端通过表现层状态转换接口输出的第一交互数据;The obtaining module 601 is also used to obtain the first interaction data output by the first data output terminal in the industrial production system through the presentation layer state conversion interface;
检测模块602,还用于对获取模块601获取到的第一交互数据中的字段进行语义检索,判断第一交互数据中的字段是否与预先创建的语义模板相匹配;The detection module 602 is further configured to perform semantic retrieval on the fields in the first interaction data acquired by the acquisition module 601, and determine whether the fields in the first interaction data match the pre-created semantic template;
处理模块603,还用于在检测模块602确定第一交互数据中的字段与语义模板不匹配时,对第一交互数据进行数据质量提升处理,以解决第一交互数据存在的质量问题;The processing module 603 is further configured to perform data quality improvement processing on the first interaction data when the detection module 602 determines that the fields in the first interaction data do not match the semantic template, so as to solve the quality problems existing in the first interaction data;
输出模块604,还用于将经过处理模块603的数据质量提升处理的第一交互数据发送给工业生产系统中的第一数据接收端。The output module 604 is further configured to send the first interaction data processed by the processing module 603 to improve the data quality to the first data receiving end in the industrial production system.
在本申请实施例中,获取模块601可用于执行上述实施例四中的步骤401,检测模块602可用于执行上述实施例四中的步骤402,处理模块603可用于执行上述实施例四中的步骤403,输出模块604可用于执行上述实施例四中的步骤404。In this embodiment of the present application, the acquisition module 601 can be used to execute the step 401 in the fourth embodiment above, the detection module 602 can be used to execute the step 402 in the fourth embodiment above, and the processing module 603 can be used to execute the steps in the fourth embodiment above 403. The output module 604 can be used to execute step 404 in the fourth embodiment above.
在一种可能的实现方式中,如图6所示,In a possible implementation, as shown in Figure 6,
获取模块601,还用于获取工业生产系统中第二数据输出端通过消息队列输出的第二交互数据;The obtaining module 601 is also used to obtain the second interaction data output by the second data output terminal in the industrial production system through the message queue;
检测模块602,还用于根据预先设定的至少一个第二数据质量字段,分别获取第二交互数据对应于每个第二数据质量字段的第二字段信息,第二数据质量字段为指示第二交互数据的质量的字段,并根据各第二字段信息判断第二交互数据是否存在数据质量问题;The detection module 602 is further configured to respectively acquire second field information corresponding to each second data quality field of the second interaction data according to at least one preset second data quality field, where the second data quality field indicates the second The field of the quality of the interactive data, and judge whether there is a data quality problem in the second interactive data according to the information of each second field;
处理模块603,还用于在检测模块602确定第二交互数据存在数据质量问题,则根据各第二字段信息,对第二交互数据进行数据质量提升处理,以删除第二交互数据中的异常数据,在第二交互数据中补充缺失的数据,以及对第二交互数据进行数据格式转换;The processing module 603 is further configured to determine in the detection module 602 that there is a data quality problem in the second interaction data, and perform data quality improvement processing on the second interaction data according to the information of each second field, so as to delete abnormal data in the second interaction data , supplementing missing data in the second interaction data, and performing data format conversion on the second interaction data;
输出模块604,还用于将经过处理模块603的数据质量提升处理的第二交互数据发送给工业生产系统中的第二数据接收端。The output module 604 is further configured to send the second interaction data processed by the processing module 603 to improve the data quality to the second data receiving end in the industrial production system.
在本申请实施例中,获取模块601可用于执行上述实施例五中的步骤501,检测模块602可用于执行上述实施例五中的步骤502和步骤503,处理模块603可用于执行上述实施 例五中的步骤504,输出模块604可用于执行上述实施例五中的步骤505。In this embodiment of the application, the acquiring module 601 can be used to execute step 501 in the fifth embodiment above, the detection module 602 can be used to execute step 502 and step 503 in the fifth embodiment above, and the processing module 603 can be used to execute the fifth embodiment above In step 504, the output module 604 can be used to execute step 505 in the fifth embodiment above.
需要说明的是,上述装置实施例中各个模块之间的交互与前述方法实施例基于同一发明构思,具体内容可以参见前述方法实施例中的描述,在此不再赘述。It should be noted that the interaction between the various modules in the above-mentioned device embodiment is based on the same inventive concept as that of the foregoing method embodiment, and details can be found in the description of the foregoing method embodiment, which will not be repeated here.
实施例七Embodiment seven
图7是本申请实施例七提供的一种电子设备的示意图,本申请具体实施例并不对电子设备的具体实现做限定。参见图7,本申请实施例提供的电子设备700包括:处理器(processor)702、通信接口(Communications Interface)704、存储器(memory)706、以及通信总线708。其中:FIG. 7 is a schematic diagram of an electronic device provided in Embodiment 7 of the present application. The specific embodiment of the present application does not limit the specific implementation of the electronic device. Referring to FIG. 7 , an electronic device 700 provided by an embodiment of the present application includes: a processor (processor) 702 , a communication interface (Communications Interface) 704 , a memory (memory) 706 , and a communication bus 708 . in:
处理器702、通信接口704、以及存储器706通过通信总线708完成相互间的通信。The processor 702 , the communication interface 704 , and the memory 706 communicate with each other through the communication bus 708 .
通信接口704,用于与其它电子设备或服务器进行通信。The communication interface 704 is used for communicating with other electronic devices or servers.
处理器702,用于执行程序710,具体可以执行上述工业数据处理方法实施例中的相关步骤。The processor 702 is configured to execute the program 710, specifically, may execute relevant steps in the above-mentioned embodiments of the industrial data processing method.
具体地,程序710可以包括程序代码,该程序代码包括计算机操作指令。Specifically, the program 710 may include program codes including computer operation instructions.
处理器702可能是中央处理器CPU,或者是特定集成电路ASIC(Application Specific Integrated Circuit),或者是被配置成实施本申请实施例的一个或多个集成电路。智能设备包括的一个或多个处理器,可以是同一类型的处理器,如一个或多个CPU;也可以是不同类型的处理器,如一个或多个CPU以及一个或多个ASIC。The processor 702 may be a central processing unit CPU, or an ASIC (Application Specific Integrated Circuit), or one or more integrated circuits configured to implement the embodiments of the present application. The one or more processors included in the smart device may be of the same type, such as one or more CPUs, or may be different types of processors, such as one or more CPUs and one or more ASICs.
存储器706,用于存储程序710。存储器706可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。The memory 706 is used for storing the program 710 . The memory 706 may include a high-speed RAM memory, and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory.
程序710具体可以用于使得处理器702执行前述任一实施例中的工业数据处理方法。The program 710 may specifically be used to enable the processor 702 to execute the industrial data processing method in any of the foregoing embodiments.
程序710中各步骤的具体实现可以参见上述工业数据处理方法实施例中的相应步骤和单元中对应的描述,在此不赘述。所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的设备和模块的具体工作过程,可以参考前述方法实施例中的对应过程描述,在此不再赘述。For the specific implementation of each step in the program 710, reference may be made to the corresponding description of the corresponding steps and units in the above-mentioned embodiment of the industrial data processing method, and details are not repeated here. Those skilled in the art can clearly understand that for the convenience and brevity of description, the specific working process of the above-described devices and modules can refer to the corresponding process description in the foregoing method embodiments, and details are not repeated here.
通过本实施例的电子设备,获取到工业生产系统的现场数据后,首先对现场数据进行数据质量检测,获得指示现场数据的质量的数据质量信息,然后根据数据质量信息对现场数据进行数据质量提升处理,解决现场数据存在的数据质量问题,然后将经过数据质量提升处理的现场数据存储到预先创建的数据库中。由于存储到数据库中的现场数据是经过数据质量提升处理的,已经解决了现场数据存在的数据质量问题,工业生产系统中的应用和服务可以直接从数据库中读取所需的现场数据进行使用,而无需每次读取现场数据都需要通过ETL对现场数据进行处理,从而能够降低工业生产过程中的数据处理量。Through the electronic device of this embodiment, after the on-site data of the industrial production system is obtained, the data quality inspection is first performed on the on-site data, and the data quality information indicating the quality of the on-site data is obtained, and then the data quality of the on-site data is improved according to the data quality information Processing to solve the data quality problems existing in the field data, and then store the field data processed by data quality improvement into the pre-created database. Since the on-site data stored in the database is processed through data quality improvement, the data quality problems existing in the on-site data have been solved, and the applications and services in the industrial production system can directly read the required on-site data from the database for use. It is not necessary to process the field data through ETL every time the field data is read, thereby reducing the amount of data processing in the industrial production process.
本申请还提供了一种计算机可读存储介质,存储用于使一机器执行如本文所述的工业数据处理方法的指令。具体地,可以提供配有存储介质的系统或者装置,在该存储介质上存储着实现上述实施例中任一实施例的功能的软件程序代码,且使该系统或者装置的计算机(或CPU或MPU)读出并执行存储在存储介质中的程序代码。The present application also provides a computer-readable storage medium storing instructions for causing a machine to execute the industrial data processing method as described herein. Specifically, a system or device equipped with a storage medium may be provided, on which a software program code for realizing the functions of any of the above embodiments is stored, and the computer (or CPU or MPU of the system or device) ) to read and execute the program code stored in the storage medium.
在这种情况下,从存储介质读取的程序代码本身可实现上述实施例中任何一项实施例的功能,因此程序代码和存储程序代码的存储介质构成了本申请的一部分。In this case, the program code itself read from the storage medium can realize the function of any one of the above-mentioned embodiments, so the program code and the storage medium storing the program code constitute a part of the present application.
用于提供程序代码的存储介质实施例包括软盘、硬盘、磁光盘、光盘(如CD-ROM、CD-R、CD-RW、DVD-ROM、DVD-RAM、DVD-RW、DVD+RW)、磁带、非易失性存储卡和ROM。可选择地,可以由通信网络从服务器计算机上下载程序代码。Examples of storage media for providing program code include floppy disks, hard disks, magneto-optical disks, optical disks (such as CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, DVD+RW), Tape, non-volatile memory card, and ROM. Alternatively, the program code can be downloaded from a server computer via a communication network.
此外,应该清楚的是,不仅可以通过执行计算机所读出的程序代码,而且可以通过基于程序代码的指令使计算机上操作的操作系统等来完成部分或者全部的实际操作,从而实现上述实施例中任意一项实施例的功能。In addition, it should be clear that not only by executing the program code read by the computer, but also by making the operating system on the computer complete part or all of the actual operations through instructions based on the program code, so as to realize the function of any one of the embodiments.
此外,可以理解的是,将由存储介质读出的程序代码写到插入计算机内的扩展板中所设置的存储器中或者写到与计算机相连接的扩展模块中设置的存储器中,随后基于程序代码的指令使安装在扩展板或者扩展模块上的CPU等来执行部分和全部实际操作,从而实现上述实施例中任一实施例的功能。In addition, it can be understood that the program code read from the storage medium is written into the memory provided in the expansion board inserted into the computer or written into the memory provided in the expansion module connected to the computer, and then based on the program code The instruction causes the CPU installed on the expansion board or the expansion module to perform some or all of the actual operations, thereby realizing the functions of any one of the above-mentioned embodiments.
本申请实施例还提供了一种计算机程序,包括计算机可执行指令,所述计算机可执行指令在被执行时使至少一个处理器执行上述各实施例提供的工业数据处理方法。The embodiment of the present application also provides a computer program, including computer executable instructions, when executed, the computer executable instructions cause at least one processor to execute the industrial data processing methods provided in the above embodiments.
本申请实施例还提供了一种计算机程序产品,所述计算机程序产品被有形地存储在计算机可读介质上并且包括计算机可执行指令,所述计算机可执行指令在被执行时使至少一个处理器执行上述各实施例提供的工业数据处理方法。应理解,本实施例中的各方案具有上述方法实施例中对应的技术效果,此处不再赘述。The embodiment of the present application also provides a computer program product, the computer program product is tangibly stored on a computer-readable medium and includes computer-executable instructions, and the computer-executable instructions cause at least one processor to Execute the industrial data processing methods provided by the above-mentioned embodiments. It should be understood that the solutions in this embodiment have the corresponding technical effects in the foregoing method embodiments, and details are not repeated here.
需要说明的是,上述各流程和各装置结构图中不是所有的步骤和模块都是必须的,可以根据实际的需要忽略某些步骤或模块。各步骤的执行顺序不是固定的,可以根据需要进行调整。上述各实施例中描述的系统结构可以是物理结构,也可以是逻辑结构,即,有些模块可能由同一物理实体实现,或者,有些模块可能分由多个物理实体实现,或者,可以由多个独立设备中的某些部件共同实现。It should be noted that not all the steps and modules in the above flow charts and device structure diagrams are necessary, and some steps or modules can be ignored according to actual needs. The execution order of each step is not fixed and can be adjusted as required. The system structures described in the above embodiments may be physical structures or logical structures, that is, some modules may be realized by the same physical entity, or some modules may be realized by multiple physical entities, or may be realized by multiple Certain components in individual devices are implemented together.
以上各实施例中,硬件模块可以通过机械方式或电气方式实现。例如,一个硬件模块可 以包括永久性专用的电路或逻辑(如专门的处理器,FPGA或ASIC)来完成相应操作。硬件模块还可以包括可编程逻辑或电路(如通用处理器或其它可编程处理器),可以由软件进行临时的设置以完成相应操作。具体的实现方式(机械方式、或专用的永久性电路、或者临时设置的电路)可以基于成本和时间上的考虑来确定。In the above embodiments, the hardware modules may be implemented mechanically or electrically. For example, a hardware module may include permanently dedicated circuitry or logic (such as a dedicated processor, FPGA or ASIC) to perform the corresponding operations. The hardware modules may also include programmable logic or circuits (such as general-purpose processors or other programmable processors), which can be temporarily set by software to complete corresponding operations. The specific implementation (mechanical way, or a dedicated permanent circuit, or a temporary circuit) can be determined based on cost and time considerations.
上文通过附图和优选实施例对本申请进行了详细展示和说明,然而本申请不限于这些已揭示的实施例,基与上述多个实施例本领域技术人员可以知晓,可以组合上述不同实施例中的代码审核手段得到本申请更多的实施例,这些实施例也在本申请的保护范围之内。The above has shown and described the application in detail through the accompanying drawings and preferred embodiments, but the application is not limited to these disclosed embodiments, and those skilled in the art based on the above-mentioned multiple embodiments can know that the above-mentioned different embodiments can be combined The code auditing means in the present application obtains more embodiments, and these embodiments are also within the protection scope of the present application.

Claims (10)

  1. 一种工业数据处理方法(100),用于对工业生产过程中产生的数据进行处理,所述方法包括:An industrial data processing method (100), used for processing data generated in an industrial production process, the method comprising:
    获取工业生产过程中产生的现场数据;Obtain on-site data generated during industrial production;
    对所述现场数据进行质量检测,获得用于指示所述现场数据的质量的数据质量信息;performing quality inspection on the on-site data, and obtaining data quality information indicating the quality of the on-site data;
    根据所述数据质量信息,对所述现场数据进行数据质量提升处理,以解决所述现场数据存在的质量问题;performing data quality improvement processing on the on-site data according to the data quality information, so as to solve quality problems existing in the on-site data;
    将经过数据质量提升处理的所述现场数据存储到预先创建的数据库中。The on-site data that has undergone data quality improvement processing is stored in a pre-created database.
  2. 根据权利要求1所述的方法,其中,所述对所述现场数据进行质量检测,获得用于指示所述现场数据的质量的数据质量信息,包括:The method according to claim 1, wherein said performing quality inspection on said on-site data and obtaining data quality information indicating the quality of said on-site data comprises:
    计算所述现场数据的数据差异特征,所述数据差异特征用于指示所述现场数据随时间的变化趋势;calculating a data difference feature of the on-site data, the data difference feature being used to indicate a change trend of the on-site data over time;
    根据预先设定的至少一个第一数据质量字段,分别获取所述现场数据对应于每个所述第一数据质量字段的第一字段信息,所述第一数据质量字段为指示所述现场数据的质量的字段;Acquire the first field information corresponding to each of the first data quality fields of the field data according to at least one preset first data quality field, and the first data quality fields indicate the fields of the field data quality field;
    获取包括所述数据差异特征和各所述第一字段信息的数据质量信息。Acquiring data quality information including the data difference feature and each of the first field information.
  3. 根据权利要求2所述的方法,其中,所述根据所述数据质量信息,对所述现场数据进行数据质量提升处理,包括:The method according to claim 2, wherein, performing data quality improvement processing on the field data according to the data quality information includes:
    根据所述数据质量信息包括的所述数据差异特征,判断所述现场数据是否存在异常特征,若所述现场数据存在所述异常特征,则删除所述现场数据的所述异常特征对应的异常数据;According to the data difference characteristics included in the data quality information, determine whether the field data has abnormal characteristics, and if the field data has the abnormal characteristics, delete the abnormal data corresponding to the abnormal characteristics of the field data ;
    根据所述现场数据的数据源的心跳信号,判断所述现场数据是否存在数据缺失,若所述现场数据存在数据缺失,则按照预设的数据增强规则对所述现场数据进行数据增强处理,以在所述现场数据中补充缺失的数据;According to the heartbeat signal of the data source of the on-site data, it is judged whether there is data missing in the on-site data, and if there is data missing in the on-site data, data enhancement processing is performed on the on-site data according to a preset data enhancement rule, so as to Supplement missing data in said field data;
    根据所述数据质量信息包括的所述第一数据质量字段,判断所述现场数据的数据格式是否为目标数据格式,若所述现场数据的数据格式不是所述目标数据格式,则将所述现场数据的数据格式转换为所述目标数据格式。According to the first data quality field included in the data quality information, it is judged whether the data format of the field data is the target data format, and if the data format of the field data is not the target data format, the field The data format of the data is converted to the target data format.
  4. 根据权利要求1所述的方法,所述方法还包括:The method according to claim 1, said method further comprising:
    获取工业生产系统中第一数据输出端通过表现层状态转换接口输出的第一交互数据;Obtain the first interaction data output by the first data output terminal in the industrial production system through the presentation layer state conversion interface;
    对所述第一交互数据中的字段进行语义检索,判断所述第一交互数据中的字段是否与预先创建的语义模板相匹配;Perform semantic search on the fields in the first interaction data, and judge whether the fields in the first interaction data match the pre-created semantic template;
    若所述第一交互数据中的字段与所述语义模板不匹配,则对所述第一交互数据进行数据质量提升处理,以解决所述第一交互数据存在的质量问题;If the fields in the first interaction data do not match the semantic template, perform data quality improvement processing on the first interaction data, so as to solve the quality problems existing in the first interaction data;
    将经过数据质量提升处理的所述第一交互数据发送给所述工业生产系统中的第一数据接 收端。Sending the first interaction data that has undergone data quality improvement processing to the first data receiving end in the industrial production system.
  5. 根据权利要求1至4中任一所述的方法,所述方法还包括:The method according to any one of claims 1 to 4, further comprising:
    获取工业生产系统中第二数据输出端通过消息队列输出的第二交互数据;Obtain the second interaction data output by the second data output terminal in the industrial production system through the message queue;
    根据预先设定的至少一个第二数据质量字段,分别获取所述第二交互数据对应于每个所述第二数据质量字段的第二字段信息,所述第二数据质量字段为指示所述第二交互数据的质量的字段;According to at least one preset second data quality field, respectively acquire second field information corresponding to each of the second data quality fields of the second interaction data, where the second data quality field indicates the first Two fields for the quality of the interaction data;
    根据各所述第二字段信息判断所述第二交互数据是否存在数据质量问题;judging whether there is a data quality problem in the second interaction data according to the information in each second field;
    若所述第二交互数据存在数据质量问题,则根据各所述第二字段信息,对所述第二交互数据进行数据质量提升处理,以删除所述第二交互数据中的异常数据,在所述第二交互数据中补充缺失的数据,以及对所述第二交互数据进行数据格式转换;If there is a data quality problem in the second interaction data, perform data quality improvement processing on the second interaction data according to information in each of the second fields, so as to delete abnormal data in the second interaction data. Supplement missing data in the second interactive data, and perform data format conversion on the second interactive data;
    将经过数据质量提升处理的所述第二交互数据发送给所述工业生产系统中的第二数据接收端。Sending the second interaction data that has undergone data quality improvement processing to a second data receiving end in the industrial production system.
  6. 一种工业数据处理装置(600),用于对工业生产过程中产生的数据进行处理,所述装置包括:An industrial data processing device (600), used for processing data generated in an industrial production process, said device comprising:
    获取模块(601),用于获取工业生产过程中产生的现场数据;An acquisition module (601), configured to acquire on-site data generated during industrial production;
    检测模块(602),用于对所述获取模块(601)获取到的所述现场数据进行质量检测,获得用于指示所述现场数据的质量的数据质量信息;A detection module (602), configured to perform quality detection on the on-site data acquired by the acquisition module (601), and obtain data quality information indicating the quality of the on-site data;
    处理模块(603),用于根据所述检测模块(602)获取到的所述数据质量信息,对所述现场数据进行数据质量提升处理,以解决所述现场数据存在的质量问题;A processing module (603), configured to perform data quality improvement processing on the on-site data according to the data quality information acquired by the detection module (602), so as to solve quality problems existing in the on-site data;
    输出模块(604),用于将经过所述处理模块(603)的数据质量提升处理的所述现场数据存储到预先创建的数据库中。An output module (604), configured to store the on-site data processed by the processing module (603) for data quality improvement into a pre-created database.
  7. 一种电子设备(700),包括:处理器(702)、通信接口(704)、存储器(706)和通信总线(708),所述处理器(702)、所述存储器(706)和所述通信接口(704)通过所述通信总线(708)完成相互间的通信;An electronic device (700), comprising: a processor (702), a communication interface (704), a memory (706) and a communication bus (708), the processor (702), the memory (706) and the The communication interface (704) completes mutual communication through the communication bus (708);
    所述存储器(706)用于存储至少一可执行指令,所述可执行指令使所述处理器(702)执行如权利要求1-5中任一项所述的工业数据处理方法对应的操作。The memory (706) is used to store at least one executable instruction, and the executable instruction causes the processor (702) to execute the operation corresponding to the industrial data processing method according to any one of claims 1-5.
  8. 一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机指令,所述计算机指令在被处理器执行时,使所述处理器执行权利要求1-5中任一项所述的方法。A computer-readable storage medium, where computer instructions are stored on the computer-readable storage medium, and when the computer instructions are executed by a processor, the processor executes the method described in any one of claims 1-5. method.
  9. 一种计算机程序,包括计算机可执行指令,所述计算机可执行指令在被执行时使至少一个处理器执行根据权利要求1-5中任一项所述的方法。A computer program comprising computer-executable instructions which, when executed, cause at least one processor to perform the method according to any one of claims 1-5.
  10. 一种计算机程序产品,所述计算机程序产品被有形地存储在计算机可读介质上并且包括计算机可执行指令,所述计算机可执行指令在被执行时使至少一个处理器执行根据权利要求1-5中任一项所述的方法。A computer program product tangibly stored on a computer-readable medium and comprising computer-executable instructions which, when executed, cause at least one processor to perform the any one of the methods described.
PCT/CN2021/121953 2021-09-29 2021-09-29 Industrial data processing method and apparatus, electronic device, and storage medium WO2023050229A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/121953 WO2023050229A1 (en) 2021-09-29 2021-09-29 Industrial data processing method and apparatus, electronic device, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/121953 WO2023050229A1 (en) 2021-09-29 2021-09-29 Industrial data processing method and apparatus, electronic device, and storage medium

Publications (1)

Publication Number Publication Date
WO2023050229A1 true WO2023050229A1 (en) 2023-04-06

Family

ID=85781112

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/121953 WO2023050229A1 (en) 2021-09-29 2021-09-29 Industrial data processing method and apparatus, electronic device, and storage medium

Country Status (1)

Country Link
WO (1) WO2023050229A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116719876A (en) * 2023-08-11 2023-09-08 国网信息通信产业集团有限公司 Time sequence data processing method and terminal based on rule engine

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102566429A (en) * 2012-01-09 2012-07-11 上海交通大学 Petri network based high speed railway train working permission transmission and analysis system and application
EP3352013A1 (en) * 2017-01-23 2018-07-25 ASML Netherlands B.V. Generating predicted data for control or monitoring of a production process
CN109656913A (en) * 2018-12-20 2019-04-19 江苏昂内斯电力科技股份有限公司 Data acquisition abnormity based on Internet of Things mends trick method
CN111143438A (en) * 2019-12-30 2020-05-12 江苏安控鼎睿智能科技有限公司 Workshop field data real-time monitoring and anomaly detection method based on stream processing
KR102125320B1 (en) * 2019-11-29 2020-06-22 주식회사 아이티로 Providing method, apparatus and computer-readable medium of

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102566429A (en) * 2012-01-09 2012-07-11 上海交通大学 Petri network based high speed railway train working permission transmission and analysis system and application
EP3352013A1 (en) * 2017-01-23 2018-07-25 ASML Netherlands B.V. Generating predicted data for control or monitoring of a production process
CN109656913A (en) * 2018-12-20 2019-04-19 江苏昂内斯电力科技股份有限公司 Data acquisition abnormity based on Internet of Things mends trick method
KR102125320B1 (en) * 2019-11-29 2020-06-22 주식회사 아이티로 Providing method, apparatus and computer-readable medium of
CN111143438A (en) * 2019-12-30 2020-05-12 江苏安控鼎睿智能科技有限公司 Workshop field data real-time monitoring and anomaly detection method based on stream processing

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116719876A (en) * 2023-08-11 2023-09-08 国网信息通信产业集团有限公司 Time sequence data processing method and terminal based on rule engine
CN116719876B (en) * 2023-08-11 2023-10-20 国网信息通信产业集团有限公司 Time sequence data processing method and terminal based on rule engine

Similar Documents

Publication Publication Date Title
WO2022237507A1 (en) Intelligent server fault pushing method, apparatus, and device, and storage medium
CN108829560B (en) Data monitoring method and device, computer equipment and storage medium
US7873594B2 (en) System analysis program, system analysis method, and system analysis apparatus
US8612372B2 (en) Detection rule-generating facility
CN109800259B (en) Data acquisition method and device and terminal equipment
US7793151B2 (en) Dynamic partitioning of event patterns for determining symptoms
WO2023050229A1 (en) Industrial data processing method and apparatus, electronic device, and storage medium
CN112883031A (en) Industrial control asset information acquisition method and device
CN117692216A (en) Abnormal login behavior management method and device, storage medium and electronic equipment
JP6440810B1 (en) Data collection system, control device, control program, gateway device, and gateway program
WO2023224764A1 (en) Multi-modality root cause localization for cloud computing systems
US8429458B2 (en) Method and apparatus for system analysis
CN113315769B (en) Industrial control asset information collection method and device
CN115002243A (en) Data processing method and device
US20150277858A1 (en) Performance evaluation device, method, and medium for information system
CN114546780A (en) Data monitoring method, device, equipment, system and storage medium
JP6048555B1 (en) Classification information creation device, classification information creation method, classification information creation program, search device, search method, and search program
CN118200197B (en) Meteorological data cooperative transmission monitoring method and system
WO2022038812A1 (en) Information processing device
CN114064391A (en) Distributed alarm or event information processing method and device
US10805150B2 (en) Regenerative telemetry method for resource reduction
CN117615010A (en) Conversion method and device of API (application program interface) request data, electronic equipment and storage medium
CN114996355A (en) Data synchronization method and related equipment thereof
CN118761704A (en) Logistics supply chain visual monitoring and early warning method, device, equipment and medium
CN118467193A (en) Processing method and system for delay data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21958816

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21958816

Country of ref document: EP

Kind code of ref document: A1