CN114035529B

CN114035529B - ATL-BMA-based nonlinear industrial process low-cost modeling method

Info

Publication number: CN114035529B
Application number: CN202111411517.6A
Authority: CN
Inventors: 褚菲; 朱安强; 丁珮宽; 陆宁云; 熊刚; 王军; 王福利; 马小平
Original assignee: China University of Mining and Technology CUMT
Current assignee: China University of Mining and Technology CUMT
Priority date: 2021-11-25
Filing date: 2021-11-25
Publication date: 2023-09-08
Anticipated expiration: 2041-11-25
Also published as: CN114035529A

Abstract

The invention provides a nonlinear industrial process low-cost modeling method based on ATL-BMA, which comprises the steps of selecting N groups of similar old process modeling data; collecting a new process modeling initial dataset; dividing new and old process data into two parts respectively, and carrying out normalization processing respectively; converting N groups of old process data into N groups of old process data with new process information, mixing the N groups of old process data with corresponding old process data to obtain N groups of mixed data sets, and training a support vector machine model to obtain N old process basic models with the new process information; mapping the input variables of the new process training set into the operation interval of the input variables of the similar old process, and obtaining the fusion output of the N prediction models; and taking the fusion output of the old process SVM model and the new process input data as the input data of the multi-model migration strategy, and training to obtain a new process model. The method can effectively solve the problems of high modeling cost, limited acquired modeling data and long modeling period of the complex industrial process.

Description

ATL-BMA-based nonlinear industrial process low-cost modeling method

Technical Field

The invention belongs to the technical field of industrial process construction performance prediction models, and particularly relates to a nonlinear industrial process low-cost modeling method based on ATL-BMA.

Background

Modern industrial processes are moving toward upsizing, high efficiency and integration in order to meet the demands of the market for products with multiple specifications, multiple varieties and high quality. On the one hand, with the gradual expansion of the production scale, new industrial production processes are added in the actual production process continuously to meet different product requirements, which leads to the higher and higher complexity of the actual industrial production process. On the other hand, changes in the operating environment and increases in the operating time can cause changes in the characteristics of the actual industrial process. Both of these aspects result in variable characteristics of the process data. Modeling an industrial process with a data-driven approach in this case requires solving a troublesome problem: due to various factors such as cost, modeling data obtained from a new industrial process are seriously insufficient, an accurate process prediction model cannot be established by using a data-driven modeling method under the support of a small amount of modeling data, and meanwhile, the generalization capability of the obtained model is low. In the face of this, it is desirable that existing long run-time industrial process data or knowledge can assist in guiding the creation of predictive models of new industrial processes. Although the characteristics of the operation data of the new and old industrial processes are different to a certain extent, the physical and chemical mechanism which is followed by the inside of the process is unchanged or very similar, so that the new industrial process data and the old industrial process data have the same or similar characteristic space and label space (the dimensions of the input and output data of the new industrial process data are consistent). As shown in fig. 1, the new and old industrial processes can be treated as a target domain and a source domain, respectively, and then the old industrial process data is used to assist in building a new industrial process prediction model through a transfer learning method. However, when the source domain data is far more than the target domain data, the phenomenon of 'negative migration' is easy to occur when the source domain data is used for carrying out supplementary learning on the target domain data under the traditional migration learning structure.

Disclosure of Invention

Aiming at the problems existing in the prior art, the invention provides a nonlinear industrial process low-cost modeling method based on ATL-BMA, which can effectively solve the problems of high modeling cost, limited acquired modeling data and long modeling period of a complex industrial process, and can solve the phenomenon of negative migration which can occur when old process data are far more than new process data in migration learning.

In order to achieve the above object, the present invention provides a method for modeling a nonlinear industrial process at low cost based on an ATL-BMA, comprising the steps of:

step 1: selecting N groups of similar old industrial process modeling data, and determining a stable operation range of an input variable according to information of an actual process to be modeled; meanwhile, a Latin hypercube method is selected for sampling and collecting an initial data set of modeling of a target nonlinear industrial process; the method comprises the following specific steps:

step 1.1: selecting N groups of similar old industrial process modeling data, and recording the data asModeling an ith old industrial process according to equation (1);

where X and X are old industrial process input data, y is old industrial process output data, k _i Representing the modeling data volume of the ith old industrial process, wherein n is the dimension of an input variable of the ith old industrial process, and the dimension of the input variable of all industrial processes is consistent and n is equal because of certain similarity of the new industrial process and the old industrial process;

step 1.2: according to the information of the actual process to be modeled, determining the stable operation range of the input variable, selecting discrete sparse data distribution points for sampling and collecting new industrial process modeling data, and obtaining collected new industrial process data D according to a formula (2) _new ；

Wherein, l represents the modeling data amount of the new industrial process;

step 2: dividing new industrial process data and old industrial process data into two parts, namely a training data set and a testing data set in a new modeling process and an old modeling process respectively, and carrying out normalization processing on an initial data set of the new industrial process and the modeling data of the old industrial process respectively; wherein, the liquid crystal display device comprises a liquid crystal display device,for new industrial process data, it is divided into new industrial process training data setsAnd new industrial process test data set +.>And mapping the data to [0,1 ] using equation (3)]A section;

wherein z is _i Representing the results after normalization of the input or output data of an industrial process, x _i Is the data before normalization, x _max Is the maximum value before data normalization, x _min Is the minimum value;

step 3: converting N groups of old industrial process data into N groups of old industrial process data with new industrial process information by using a new and old industrial process data migration algorithm based on a Cycle GANs; wherein the old industrial process training data set isThe method comprises the following specific steps:

step 3.1: initializing parameters: g parameter θ _G ，D _o Parameter omega _o F parameter θ _F ，D _n Parameter omega _n ，n _critic ＝5，α＝0.00005、β ₁ ＝0、β ₂ ＝0.7，m＝5，λ＝0.5，Epoch＝20000；

Wherein: g represents a generator function of old industrial process to new industrial process data, D _o Representing the corresponding discriminators of the old industrial process, F representing the generator function from the new industrial process to the old industrial process, D _n Indicating the corresponding discriminant of the new industrial process, n _critic Representing the number of training discrimination models, alpha, beta, after training a generator ₁ And beta ₂ The parameters of the Adam optimizer are that m is the sampling number and Epoch is the model cycle training times;

step 3.2: will be from the first through generator Gi old industrial process dataM samples taken from (1)Conversion to m new industrial process data, denoted X _o→n ＝F(X _o ) The method comprises the steps of carrying out a first treatment on the surface of the From the new industrial process data +.>M samples collected in->Conversion to m old industrial process data, denoted X _n→o ＝F(X _n )；

Step 3.3: obtaining a discriminator loss and two forward circulation consistent losses according to a formula (4) and a formula (5);

step 3.4: updating the discriminant D by equation (6) and equation (7) _o Parameter omega _o And D _n Parameter omega _n ；

Step 3.5: repeating the steps 3.2 to 3.4n _critic Secondary times;

step 3.6: repeating the step 3.2;

step 3.7: calculating two forward loop coincidence losses through a formula (8) and a formula (9);

step 3.8: calculating two generator losses by equation (10) and equation (11);

step 3.9: updating the generator G parameter θ by equation (12) and equation (13) _G And F parameter θ _F ；

Step 3.10: repeating the steps 3.6-3.9 Epoch times, and converting the new industrial process data into the j old industrial process data by using the trained F, and recording as

Step 3.11: repeating the steps 3.1-3.9 by using each set of old industrial process data, transferring the new industrial process data into the old industrial process domain, obtaining N sets of old industrial process data with new industrial process information by the new industrial process data through anti-transfer learning, and recording as

Step 4: mixing old industrial process data with new industrial process information in the step 3 with corresponding old industrial process data to obtain N groups of mixed data sets;

step 5: dividing a hybrid dataset into hybrid training setsAnd hybrid test dataset->At the same time, N old industrial process training data sets are combined +.>And a new industrial process prediction model y=f (x), training support vector machine SVM models by using N groups of mixed data sets respectively to obtain N old industrial process basic models with new industrial process information, and marking as f ₁ (·)-f _N (. Cndot.); wherein (1)>k _train Is training data set size, +.>k _test Is the test data set size, any ith old industrial process, < >>n _i Is the ith old industrial process training set size; the method comprises the following specific steps:

step 5.1: initializing parameters;

step 5.2: converting new industrial process data into N groups of old industrial process data carrying new industrial process information through new and old industrial process data migration algorithm based on Cycle GANsMixing D according to equation (14) _n→o And D _o Obtaining N groups of basic model training data D _Basic ；

Step 5.3: by D _Basic Training N SVMs to obtain N old industrial process basic models with new industrial process information, denoted as f ₁ (·)-f _N (·)；

Step 6: mapping the input variables of the new industrial process training set into the operation intervals of the input variables of the similar old industrial process through a model fusion formula (15), and recording the input data of the converted new industrial process training set asFusion output of the N prediction models is obtained through Bayesian model averaging algorithm>

Step 7: fusion output of SVM model of old industrial processAnd new industrial process input data->As input data of a multi-model migration strategy, training a new industrial process model by using a least square support vector machine algorithm to obtain new industrial process model output +.>Completing modeling of a new industrial process;

step 8: model verification, namely evaluating the effectiveness of the SVM model by utilizing root mean square error and a determination coefficient according to a formula (16) and a formula (17), and completing a modeling process if the prediction accuracy of the model obtained in the step 7 on a test data set meets an experimental set threshold value; otherwise, repeating the steps 3 to 7, adding new N groups of old industrial process data samples containing new industrial process information into the mixed samples, and continuing training the new industrial process model until the experimental stop condition is met;

where N is the number of test data, y _i Is the output of the predictive model and,is the average value of the predicted output, Y _i Is the true output of the new industrial process.

The method comprises the steps of firstly, collecting a small sample data set of nonlinear industrial process modeling by using a Latin hypercube method, combining a plurality of similar old process data, and learning a conversion mapping function between new industrial process data and old industrial process data by using an contrast migration algorithm, so that a small amount of new process data is converted into multiple types of old industrial process data with new industrial process information; then a plurality of 'old process models with new process information' are obtained through a support vector machine regression algorithm, and a foundation is established for modeling of a subsequent new industrial process; and finally, migrating a plurality of trained old industrial process prediction models with new industrial process information by using a multi-model migration strategy and a Bayesian model average theory, and combining a small amount of new industrial process data to obtain a final new industrial process performance prediction model. The invention transfers the useful information of a plurality of existing similar old industrial processes to help to establish a new industrial process performance prediction model, and reduces the modeling cost of the new industrial process; meanwhile, in order to effectively solve the problem of 'negative migration' which can occur when old process data is far more than new process data, a new and old process data migration method based on anti-migration learning is adopted, and migration modeling effect is improved. The method effectively solves the problems of high modeling cost and long modeling period of the complex industrial process, fully utilizes the useful information of the existing similar old industrial process model, simultaneously solves the problem of 'negative migration' possibly occurring when old process data are far more than new process data in migration learning, completes the modeling of the new industrial process, reduces the modeling cost, accelerates the modeling speed and improves the modeling precision.

Drawings

FIG. 1 is a flow chart of migration modeling;

FIG. 2 is a flow chart of a non-linear industrial process low-cost modeling method based on challenge transfer learning and Bayesian model averaging theory;

FIG. 3 is a graph of predicted values of the ATL-BMA model, and SVM model over a compressor A test set;

FIG. 4 is a RMSE histogram of ATL-BMA model, and SVM model predictions versus true values;

FIG. 5 is an R of predicted and actual values of ATL-BMA model, BMA model and SVM model ² A histogram.

Detailed Description

The invention is further described below with reference to examples and figures.

As shown in fig. 1 to 5, the present invention provides a non-linear industrial process low-cost modeling method based on ATL-BMA (anti-migration learning (Adversarial Transform Learning, ATL) and bayesian model averaging (Bayesian Model Averaging, BMA)), comprising the steps of:

step 1: selecting N groups of similar old industrial process modeling data, and determining a stable operation range of an input variable according to information of an actual process to be modeled; meanwhile, selecting a Latin super-vertical (Latin Hypercube Design, LHD) method to sample and collect a modeling initial data set of a target nonlinear industrial process (new industrial process); the information of the actual process to be modeled comprises parameter rated values, performance curves and the like; the method comprises the following specific steps:

where X is the old industrial process input data set, X is the old industrial process input data, y is the old industrial process output data, k _i Representing the modeling data volume of the ith old industrial process, wherein n is the dimension of an input variable of the ith old industrial process, and the dimension of the input variable of all industrial processes is consistent and n is equal because of certain similarity of the new industrial process and the old industrial process;

Wherein, l represents the modeling data amount of the new industrial process;

step 2: dividing the new industrial process data and the old industrial process data into two parts, namely a training data set and a testing data set in the new and old modeling processes respectively; in order to avoid adverse effects caused by data dimension differences in the stability of the subsequent training process, it is necessary to ensure that the data are normalized, and respectively performing normalization processing on the new industrial process initial data set and the old industrial process modeling data; wherein, for new industrial process data, it is divided into new industrial process training data setsAnd new industrial process test data set +.>And mapping the data to [0,1 ] using a maximum minimum data normalization method according to equation (3)]A section;

step 3.2: from the ith old industrial process data by generator GM samples taken from (1)Conversion to m new industrial process data, denoted X _o→n ＝F(X _o ) The method comprises the steps of carrying out a first treatment on the surface of the From the new industrial process data +.>M samples collected in->Conversion to m old industrial process data, denoted X _n→o ＝F(X _n )；

Step 3.5: repeating the steps 3.2 to 3.4n _critic Secondary times;

step 3.6: repeating the step 3.2;

step 3.8: calculating two generator losses by equation (10) and equation (11);

step 5: dividing a hybrid dataset into hybrid training setsAnd hybrid test dataset->At the same time, N old industrial process training data sets are combined +.>And a new industrial process prediction model y=f (x), training a support vector machine SVM (Support Vector Machine) model by using N groups of mixed data sets respectively to obtain N old industrial process basic models with new industrial process information, and marking as f ₁ (·)-f _N (. Cndot.); wherein, the liquid crystal display device comprises a liquid crystal display device,

k _train is the size of the training data set and,k _test is the test data set size, any ith old industrial process,n _i is the ith old industrial process training set size; the method comprises the following specific steps:

step 5.1: initializing parameters;

Step 7: fusion output of SVM model of old industrial processAnd new industrial process input data->As input data of a multi-model migration strategy, training a new industrial process model by utilizing a least squares support vector machine (Least Squares Support Vector Machine, LSSVM) algorithm to obtain a new industrial process model output ∈>Completing modeling of a new industrial process;

step 8: model verification using root mean Square error (Root Mean Square Error, RMSE) and coefficient determination (R-Square, R) according to equation (16) and equation (17), respectively ² ) Evaluating the effectiveness of the SVM model, and if the prediction accuracy of the model obtained in the step 7 on the test data set meets the experimental set threshold, completing the modeling process; otherwise, repeating the steps 3 to 7, adding new N groups of old industrial process data samples containing new industrial process information into the mixed samples, and continuing training the new industrial process model until the experimental stop condition is met;

To verify the effectiveness of the method, experimental data was generated using a laboratory centrifugal compressor mechanism model, and a performance prediction model of the centrifugal compressor was built to verify the effectiveness of the proposed modeling method. Four different but similar compressor models were generated A, B, C, D for simulation experiments by modifying the key geometric parameter simulations of the compressor mechanism model. For A, B, C and D four centrifugal compressors, where compressor a was the new compressor to be modeled, a small amount of new industrial process modeling data was generated, while B, C and D centrifugal compressors were old compressors with long run times, generating a large amount of old industrial process modeling data to assist in the creation of new industrial process prediction models. The stable movement intervals of the new compressor and the old compressor are shown in table 1.

Table 1 centrifugal compressor A, B, C, D steady operation interval and corresponding One-Hot encoding

And comparing the prediction effect of the built model with the prediction effects of two groups of comparison experiment models, and further displaying the superiority of the method. The three sets of comparison methods were specifically as follows:

method 1: a small amount of new industrial process data is converted into old industrial process data through anti-migration learning, the old industrial process data are mixed with each group of old industrial process data, a plurality of SVM models are obtained through training, then a new compressor prediction model is built through a multi-model migration strategy, and finally model accuracy is tested by using new compressor test data. The ATL-BMA method was designated in the analysis of experimental results.

Method 2: and training a plurality of old compressor SVM models by using a plurality of sets of old compressor data, then establishing a new compressor prediction model by combining a few new compressor training data through a multi-model migration strategy, and finally testing model accuracy by using new compressor test data. The BMA method was recorded in the analysis of experimental results.

Method 3: and only a small amount of new compressor training data is used for establishing a new compressor SVM model, the new compressor SVM model is used as a new compressor prediction model, and finally, the new compressor test data is used for testing the model precision. The SVM method is marked in the analysis of experimental results.

Figure 3 shows the predicted values of the model on the compressor a test set for three methods. The graph shows that the predicted value of the model built by the ATL-BMA method is the highest in matching degree with the test set, which proves that the ATL-BMA method can effectively utilize the useful information of similar old industrial processes to help the establishment of a new industrial process model, and simultaneously, the ATL-BMA method can effectively utilize the information between the new and old industrial processes more than a simple multi-model migration method.

To further compare the accuracy of the three models, FIGS. 4 and 5 illustrate the RMSE and R of the predicted and actual values of the three models ² As can be seen from the figure, the method provided by the chapter can fully utilize a large amount of old industrial process data and a small amount of new industrial process data, effectively improve the prediction precision of the model and reduce the cost of establishing the model.

According to the analysis, the performance prediction model is built for the new industrial process by adopting the anti-migration learning method and the Bayesian model average theory, the existing similar old industrial process performance prediction model in the industry is fully utilized, old and new industrial process data are migrated, a plurality of old industrial process prediction models containing new industrial process information are built by using a support vector machine, and finally the old industrial process model is trained by utilizing the Bayesian model average theory, so that the modeling speed of the new industrial process is accelerated, the modeling cost is reduced, the negative migration effect caused by more old industrial process data than the new industrial process data in the old and new industrial process migration modeling is solved, and the prediction model meeting the precision requirement is obtained. Meanwhile, the method can effectively utilize the information between new and old industrial processes compared with a simple multi-model migration method. Closer to the actual output, a significant cost reduction for modeling industrial processes.

Claims

1. A method for modeling a nonlinear industrial process at low cost based on ATL-BMA, comprising the steps of:

step 1.1: selecting N groups of similar old industrial process modelsData, noted asModeling an ith old industrial process according to equation (1);

Wherein, l represents the modeling data amount of the new industrial process;

step 2: dividing new industrial process data and old industrial process data into two parts, namely a training data set and a testing data set in a new modeling process and an old modeling process respectively, and carrying out normalization processing on an initial data set of the new industrial process and the modeling data of the old industrial process respectively; wherein, for new industrial process data, it is divided into new industrial process training data setsAnd new industrial process test data set +.>And utilizeEquation (3) maps data to [0,1 ]]A section;

step 3.2: from the ith old industrial process data by generator GM samples collected in->Conversion to m new industrial process data, denoted X _o→n ＝F(X _o ) The method comprises the steps of carrying out a first treatment on the surface of the From the new industrial process data +.>M samples collected in->Conversion to m old industrial process data, denoted X _n→o ＝F(X _n )；

Step 3.5: repeating the steps 3.2 to 3.4n _critic Secondary times;

step 3.6: repeating the step 3.2;

step 3.8: calculating two generator losses by equation (10) and equation (11);

step 5.1: initializing parameters;