CN111258984B

CN111258984B - End-edge-cloud collaborative forecasting method for product quality in industrial big data environment

Info

Publication number: CN111258984B
Application number: CN202010051048.0A
Authority: CN
Inventors: 丁进良; 马宇飞; 刘长鑫; 柴天佑; 曾诚
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2020-01-17
Filing date: 2020-01-17
Publication date: 2021-06-22
Anticipated expiration: 2040-01-17
Also published as: CN111258984A

Abstract

The invention provides an end-edge-cloud collaborative forecasting method for product quality in an industrial big data environment, and relates to the technical fields of industrial big data processing and complex industrial intelligent modeling. This method uses industrial big data to train the forecast model on the cloud server, and at the same time continuously corrects the relevant parameters in the forecast model on the edge server and the terminal server, so that the results of the forecast model are more accurate, and at the same time, the product quality is checked on the terminal server. Make real-time forecasts. The present invention can effectively utilize the real-time data generated in the production process to continuously correct the parameters in the forecasting model, so that the forecasting model can adapt to the real-time changes of the product, thereby continuously improving the forecasting accuracy of the model and improving the production benefit.

Description

Product quality end-edge-cloud collaborative forecasting method under industrial big data environment

Technical Field

The invention relates to the technical field of industrial big data processing and complex industrial intelligent modeling, in particular to a product quality end-edge-cloud collaborative forecasting method in an industrial big data environment.

Background

In recent years, as artificial intelligence develops more and more mature in theory and technology, application of large data is more and more extensive, and relatively mature results are obtained in the fields of medicine, electronic information, image recognition and the like. In the field of complex industrial intelligent modeling, the application of industrial big data is very important, and the quality of a product is an important index for describing whether an industrial production process is qualified or not.

Although the existing intelligent modeling algorithm can effectively process high-dimensional data in industrial big data and automatically mine potential characteristics hidden in production process data, most of the traditional intelligent modeling algorithm is mainly used for processing static data sets and is difficult to apply to a real-time system, and the established intelligent forecasting model can only reflect rules hidden in historical data and cannot be corrected along with tiny changes in the production process.

In an actual industrial field, production data of products can be continuously increased along with the production, and if sample data generated in real time can be effectively utilized along with the production process, and tiny changes generated by the data in the production process are excavated, a forecasting model can be continuously improved, and further the model precision is improved. However, the traditional intelligent modeling method needs a large amount of training sample data in each model training process, and the training speed is slow, so that the model cannot be updated in real time. Therefore, with the advance of the production process, how to effectively apply the data samples generated in real time in the production process, discover the slight change generated by the samples in the production process, and simultaneously save the computing resources and time is the problem to be solved at present.

Disclosure of Invention

The invention aims to solve the technical problem of providing a product quality end-edge-cloud collaborative forecasting method under an industrial big data environment aiming at the defects of the prior art.

In order to solve the technical problems, the technical scheme adopted by the invention is as follows:

the invention provides a product quality end-edge-cloud collaborative forecasting method under an industrial big data environment, which comprises the following steps:

step 1: acquiring actual production process data of a product in an actual industrial field by using a sensor in the actual industrial field;

step 2: removing abnormal data samples and data samples containing missing values in the collected production process data by using a data cleaning algorithm to form an initial sample data set; data preprocessing is carried out on data in the initial sample data set by using a data complementing algorithm, all data dimensions are the same, and the preprocessed sample data is stored in an edge end database; establishing a cloud database on a cloud server, synchronizing sample data in the edge database into the cloud database when the number of samples in the edge database is more than n, and emptying data samples in the edge database;

and step 3: judging whether the total number of data in the cloud database is more than H, if not, executing the step 1, if so, selecting an intelligent modeling method aiming at the characteristics of the production process and the production process data of the product on a cloud server, and establishing a forecasting model of the product quality;

respectively establishing w forecasting models on a cloud server according to w quality indexes of the product to form a model library; the method comprises the following steps of establishing a forecasting model aiming at the ith quality index as follows:

wherein I represents the preprocessed sample data input by the forecasting model,

a prediction value f representing the i-th quality index_i(. The) represents the structure of the established prediction model, θ_iA set of parameters representing the established prediction model;

according to the industrial production process, the data characteristics of the model input data and the analysis of the correlation between the input data of the forecasting model and the quality index, the theta is further determined_iDivided into three sets of parameters, i.e.

And 4, step 4: according to the actual production sequence of the product, extracting the latest K sample data from the cloud database to form a training set D, and simultaneously recording that the total number of the data samples in the cloud database is S; respectively training all parameters in each forecast model in the model library by using sample data in the training set D, and recording the trained forecast model library as

Wherein F_i ^cA forecasting model representing the ith quality index;

taking the production process data of the sample as input data, taking the ith quality index data of the sample as tag data, and wearing the sample in a cloudOn the server, training the prediction model of the ith quality index to obtain F_i ^c(ii) a I.e. training the parameter set in step 3

All of the parameters in (1);

and 5: forecast model library F^cThe method comprises the steps that the cloud server transmits the prediction model to an edge server, the edge server puts different prediction models to different terminal servers for operation, and a user predicts different quality indexes of a product through the prediction models in the terminal servers;

step 6: acquiring input data of a forecasting model after data cleaning and data preprocessing are carried out on actual production process data acquired from a sensor in an industrial field, transmitting the input data to all terminal servers, forecasting each quality index of a product on each terminal server by using the forecasting model respectively, and transmitting a forecasting result to a user;

and 7: when the production process of the product is finished, maintaining the corresponding parameter set of the forecasting model on each terminal server

Wherein i ∈ {1,2, …, w }, and using actual production process data of the product to perform parameter set on each forecasting model

The parameters in the model are corrected in real time to obtain a new forecasting model F_i ^tAnd replacing the original forecast model in the terminal server at the moment, using forecast model F_i ^tForecasting the subsequent products; the real-time correction is to adopt different correction methods to the parameter set according to different industrial fields and modeling algorithms

Correcting the parameters in (1);

and 8: storing the actual production data and the quality index data of the product into a historical database of the edge terminal; judging the number of all samples in the historical database of the edge end at the moment, if the number of the samples at the moment is less than n, turning to the step 6, and continuing to forecast the quality index of the subsequent product; if the number of the samples is more than n, turning to step 9;

and step 9: extracting the production data of n products from the historical database of the edge end as a new training set d, and aiming at each forecasting model in the forecasting model base, on the edge end server, utilizing the sample data in the training set d to carry out parameter set treatment on the model

Correcting in real time to obtain a prediction model F_i ^eAll the corrected forecasting models are combined into a new forecasting model library

Step 10: using edge server, F^eThe forecasting models in the system are respectively downloaded to corresponding terminal servers and replace the original forecasting models; calling the retrained forecasting model by a user through different terminal servers, and forecasting the product data in a new round; synchronizing the data samples in the edge database to the cloud database, emptying data information in the edge database, and storing product data of a new round of production into the edge database;

step 11: judging the number of samples in the cloud database at the moment, judging whether the total number of the samples in the cloud database is increased by N samples compared with S or not, wherein N is larger than N, if yes, returning to the step 4, counting the total number S of the samples in the cloud database again to be S + N, and retraining F^cThe predictive model of (1); if not, returning to the step 6, and forecasting the product quality by using the forecasting model on the terminal server.

Said step 3

Parameter set

The method is used for describing the change rule of the data samples produced in a large batch; parameter set

The data characteristic of the data sample is changed in different small batch production processes; parameter set

Is used for describing the specific data characteristics included in each data sample; wherein the number of the products produced in the industrial production process of the large batch is M, each large batch is divided into r small batches, and the number of the products produced in each small batch is M.

Adopt the produced beneficial effect of above-mentioned technical scheme to lie in: according to the product quality end-edge-cloud collaborative forecasting method under the industrial big data environment, the parameters of the established forecasting model can be divided into parameters sensitive to the change of a large number of data samples; parameters sensitive to changes in a small number of data samples; and parameters sensitive to the change of a single data sample are classified into three types, and the parameters in the forecasting model are continuously trained and updated by an end-edge-cloud collaborative forecasting method. Meanwhile, the invention can effectively utilize real-time data generated in the production process and continuously correct parameters in the forecasting model, so that the forecasting model can adapt to the real-time change of the product, thereby continuously improving the forecasting precision of the model and improving the production benefit.

Drawings

Fig. 1 is a structural block diagram of a product quality end-edge-cloud collaborative prediction method in an industrial big data environment according to an embodiment of the present invention;

fig. 2 is a flowchart of a product quality end-edge-cloud collaborative forecasting method in an industrial big data environment according to an embodiment of the present invention.

Detailed Description

The following detailed description of embodiments of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.

As shown in fig. 2, the method of the present embodiment is as follows.

The invention provides a product quality end-edge-cloud collaborative forecasting method under an industrial big data environment, which is characterized in that a forecasting model is trained on a cloud server by utilizing industrial big data, and related parameters in the forecasting model are continuously corrected on an edge end server and a terminal server at the same time, as shown in figure 1, so that the result of the forecasting model is more accurate. The method for intelligently forecasting the quality of the steel plate product in the embodiment comprises the following steps:

step 1: acquiring actual production process data of a steel plate product in an industrial field by using a sensor in the industrial field of the steel plate;

step 2: removing abnormal data samples and data samples containing missing values in the collected production process data by using a data cleaning algorithm to form an initial sample data set; then, data preprocessing is carried out on data in the initial sample data set by using a data complementing algorithm according to the difference of the sample data dimensions caused by different types of the steel plates to obtain sample data with the same data dimensions, and the preprocessed sample data is stored in an edge end database; the preprocessed sample data is used as input data of a product quality forecasting model, the final quality label data of the product is used as output data of the product quality forecasting model, the model is used for constructing the product quality forecasting model, useless data are added around the input data, the final forecasting result is not affected, and meanwhile the input data dimensions of all samples are unified. And a cloud database is established on the cloud server and used for storing all the preprocessed sample data, so that the data in the industrial production process can be analyzed and modeled conveniently. When the number of the samples in the edge end database is more than n, synchronizing the sample data in the edge end database to the cloud end database, and emptying the data samples in the edge end database; and the edge end database on the edge end server is used for storing sample data generated in the small-batch production process.

And step 3: judging whether the total number of data in the cloud database is greater than H, if not, executing the step 1, and if so, selecting a proper intelligent modeling method, such as a convolutional neural network, a graph neural network, a random forest and other intelligent modeling methods, aiming at the characteristics of the steel plate production process and the steel plate production process data on a cloud server; in the embodiment, a model building method of mechanism and data is selected to build a forecasting model of product quality;

as 5 quality evaluation indexes are provided for describing whether the steel plate product is qualified or not, the indexes are respectively the size, the surface, the shape, the internal quality and the performance of the steel plate. Therefore, on the cloud server, a forecasting model needs to be established respectively for each quality index to form a model library.

The method comprises the following steps of establishing a forecasting model aiming at the ith quality index as follows:

a prediction value representing the i-th quality index given by the prediction model, f_i(. The) represents the structure of the established prediction model, θ_iA set of parameters representing the established prediction model;

according to the production process of the steel plate, the data characteristics of the model input data and the analysis of the correlation between the input data of the forecasting model and the quality index, the theta is further determined_iDivided into three sets of parameters, i.e.

In the industrial production process, the number of the products produced in a large batch is M,dividing each large batch into r small batches, wherein the number of the products produced in each small batch is m; the parameter set

Is used for describing the specific data characteristics included in each data sample;

the analysis method in the embodiment may be that a typical correlation analysis algorithm has mic, pearson correlation coefficient, and the like;

parameter set

The parameters in (1) are mainly used for describing the change rule of input data of products produced in a large batch, and are used for mining the characteristics implied in a large data sample, and are the main parameters in the forecasting model. The input data corresponding to the partial parameters are often too complex, more data samples and complex model structures are needed as supports to dig out the implicit characteristics, and the partial parameters are not obviously changed along with the passage of time in the production process for a long time, such as the temperature change rule of the steel plate in the production process, the equipment information of the steel plate in the production process, and the like.

Parameter set

The parameters in (1) are mainly used for describing slight changes of data characteristics of input data in different small batch production processes. The part of the parameters are not changed in the production process of a large batch, but are changed along with the part of the parametersThe partial parameters are usually suitable for the data samples in a small batch, but not necessarily suitable for the data samples in other batches. For example, the type of steel plate, the production mode, the heating temperature of the heating furnace, the gas supply amount, etc. in a small lot.

Parameter set

The parameters in (1) are mainly used for describing the specific data characteristics contained in each data sample, the partial parameters are sensitive to the change of the data sample and can change along with the change of the data sample, and the corresponding input data is often specific to one data sample. For example, information on the amount of wear of rolls during production of a steel sheet, the temperature of cooling water during cooling, and the like.

And 4, step 4: according to the actual production sequence of the product, extracting the nearest K (15000) sample data from the cloud database to form a training set D, and simultaneously recording the total number of the data samples in the cloud database as S (28700); the first training is based on the model base which is established in the step 3 and comprises 5 forecasting models, each forecasting model in the model base is respectively trained by utilizing sample data in the training set D aiming at each forecasting model, and then the forecasting model base aiming at each quality index is obtained and recorded as

Wherein F_i ^cA forecasting model representing the ith quality index;

taking the production process data of the sample as input data, taking the ith quality index data of the sample as tag data, and training a forecasting model of the ith quality index on a cloud server to obtain F_i ^c(ii) a I.e. training the parameter set in step 3

The parameter (1) of (1);

and 5: forecast model library F^cFrom cloud server to edgeThe edge server downloads different forecasting models to different terminal servers for operation, and the user forecasts different quality indexes of the product through the forecasting models in the terminal servers;

step 6: acquiring input data of a forecasting model after data cleaning and data preprocessing are carried out on actual production process data acquired from a sensor in an iron plate production industrial field, transmitting the input data to all terminal servers, forecasting each quality index of a product on each terminal server by using the forecasting model corresponding to the product, transmitting the forecasting result to a user, and making a decision on the production process by the user according to the forecasting result so as to improve the qualification rate of the product;

and 7: when the production process of the steel plate is finished, maintaining the corresponding parameter set of the forecasting model on each terminal server

Correcting the parameters in (1);

and 8: storing the actual production data and the quality index data of the steel plate into a historical database of the edge end; judging the number of all samples in the historical database of the edge end at the moment, if the number of the samples at the moment is less than n and is 1000, turning to the step 6, and continuing to forecast the quality index of the subsequent product; if the number of the samples is larger than n, which is 1000, the step is switched to step 9;

and step 9: extracting the production data of 1000 products from the historical database of the edge end as a new training set d, and aiming at each forecast model in the forecast model base, on the edge end server, utilizing the sample data in the training set d to carry out the parameter set in the model

Due to the parameter set

The parameters in (1) do not change due to the change of the small batch data samples in the production process, so the parameter set in the fixed prediction model

Correcting parameter set in each prediction model on edge end server by using data sample in training set d

Thereby obtaining a new forecasting model F_i ^eAnd then a new forecasting model library is formed. F_i ^eCompared with F_i ^cOnly the parameter set is changed

And the number of samples in the training set D is much smaller than the number of samples in the training set D, so the model F is predicted_i ^eThe training process can save more computing resources, the training speed is faster, and the time is saved.

step 11: judging the number of samples in the cloud database at the moment, judging whether the total number of the samples in the cloud database is increased by N to 10000 samples (N is far more than N) compared with S, if so, returning to the step 4, counting the total number of the samples in the cloud database again, namely S + N, and retraining F^cThe predictive model of (1); if not, returning to the step 6, and forecasting the product quality by using a forecasting model on the terminal server;

finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; such modifications and substitutions do not depart from the spirit of the corresponding technical solutions and scope of the present invention as defined in the appended claims.

Claims

1. a product quality end-edge-cloud collaborative forecasting method under an industrial big data environment, is characterized in that: comprise the following steps:

Step 1: Use the sensors in the actual industrial site to collect the actual production process data of the products in the industrial site;

Step 2: Use the data cleaning algorithm to remove abnormal data samples and data samples containing missing values from all the collected production process data to form the initial sample data set; use the data filling algorithm to perform the data in the initial sample data set. Data preprocessing makes all data dimensions the same, and stores the preprocessed sample data in the edge database; establishes a cloud database on the cloud server, when the number of samples in the edge database is greater than n, the edge database The sample data in the database is synchronized to the cloud database, and the data samples in the edge database are cleared at the same time;

Step 3: Determine whether the total number of data in the cloud database is greater than H, if not, go to step 1; if so, select an intelligent modeling method on the cloud server according to the characteristics of the product production process and production process data to establish product quality forecast model;

According to the w quality indicators of the product, w prediction models are respectively established on the cloud server to form a model library; the prediction model established for the i-th quality index is as follows:

Among them, I represents the preprocessed sample data input by the forecast model,

represents the forecast value of the i-th quality index, f _i (·) represents the structure of the established forecast model, and θ _i represents the parameter set of the established forecast model;

According to the industrial production process, the data characteristics of the model input data, and the analysis of the correlation between the input data of the forecast model and the quality indicators, _θi is divided into three parameter sets, namely

Step 4: According to the actual production order of the product, extract the latest K sample data from the cloud database to form a training set D, and record the total number of data samples in the cloud database at this time as S; All parameters in each forecast model in the training model library are recorded as

where F _ic represents the prediction model of the ^ith quality index;

Taking the production process data of the sample as input data, and using the i-th quality index data of the sample as label data, on the cloud server, train the prediction model of the i-th quality index to obtain F _i ^c ; that is, the parameters in training step 3 set

All parameters in;

Step 5: Transfer the forecast model library F ^c from the cloud server to the edge server, and the edge server will download different forecast models to different terminal servers for operation. Different quality indicators are forecasted separately;

Step 6: After data cleaning and data preprocessing, the actual production process data collected from the sensors on the industrial site are used to obtain the input data of the forecasting model, and the input data is transmitted to all terminal servers, and the forecasting model is used on each terminal server. Forecast each quality index of the product, and transmit the forecast result to the user;

Step 7: When the production process of the product ends, on each terminal server, keep the parameter set in the corresponding forecast model

The parameters in the model remain unchanged, where i∈{1,2,…,w}, and use the actual production process data of the product to evaluate the parameter set in each forecast model

The parameters in the real-time correction are carried out to obtain a new forecast model F _i ^t , and the original forecast model in the terminal server is replaced at this time, and the forecast model F _i ^t is used to forecast the subsequent products; the real-time correction is based on different industrial Domains and modeling algorithms use different calibration methods for parameter sets

The parameters in are corrected;

Step 8: Store the actual production data of the product and the quality index data into the historical database of the edge end; judge the number of all samples in the historical database of the edge end at this time, if the number of samples at this time is less than n, Then go to step 6, continue to forecast the quality index of the follow-up product; if the number of samples at this time is greater than n, go to step 9;

Step 9: Extract the production data of n products from the historical database at the edge as a new training set d. On the edge server, for each forecast model in the forecast model library, use the sample data in the training set d, set of parameters in the model

Perform real-time correction to obtain the forecast model F _i ^e , and combine all the corrected forecast models into a new forecast model library

Step 10: Use the edge server to download the forecast models in Fe to the corresponding terminal servers respectively, and replace the original forecast models; the user calls the ^retrained forecast models through different terminal servers, and the Product data for a new round of forecast; synchronize the data samples in the edge database to the cloud database, clear the data information in the edge database, and store the new round of product data in the edge database;

Step 11: Determine the number of samples in the cloud database at this time, and determine whether the total number of samples in the cloud database has increased by N samples compared to S, where N is greater than n, if so, return to step 4, and re-count the cloud database The total number of samples is S=S+N, and the forecasting model in F ^c is retrained; if not, return to step 6, and use the forecasting model on the terminal server to forecast the product quality.

2. The end-edge-cloud collaborative forecasting method for product quality under an industrial big data environment according to claim 1, wherein: the step 3 of

Medium parameter set

Used to describe the variation of data samples produced in a large batch; parameter set

Used to describe the changes in the data characteristics of data samples during different small batch production processes; parameter set

It is used to describe the unique data features included in each data sample; wherein the one large batch is the number of M products produced in the industrial production process, and each large batch is divided into r small batches times, the number of products produced in each small batch is m.