CN114334024A

CN114334024A - Fermentation process soft measurement modeling method based on rapid component transfer learning

Info

Publication number: CN114334024A
Application number: CN202111633686.4A
Authority: CN
Inventors: 刘毅; 戴云; 朱佳良
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2021-12-28
Filing date: 2021-12-28
Publication date: 2022-04-12

Abstract

The invention discloses a fermentation process soft measurement modeling method based on rapid component transfer learning, which comprises the following steps: 1) acquiring penicillin data; 2) preprocessing penicillin data and dividing a data set; 3) adapting the edge distribution of the source domain and the target domain; 4) establishing a soft measurement model based on a regularization extreme learning machine; 5) and (6) evaluating the performance of the model. The invention reduces the characteristic distribution distance of the source domain and the target domain based on the migration component analysis method, so that the characteristic distribution of the two groups of data is similar. And then establishing a soft measurement model on the mapped source domain data by adopting a regularization extreme learning machine method, and predicting the concentration of the penicillin in the target domain.

Description

Fermentation process soft measurement modeling method based on rapid component transfer learning

Technical Field

The invention relates to the technical field of fermentation process soft measurement modeling, in particular to a fermentation process soft measurement modeling method based on rapid component transfer learning.

Background

In the chemical production process, stable operation of production equipment and product quality guarantee are greatly related to specific key variables. At present, a data-driven soft measurement modeling method is mostly adopted for a key index prediction task in an industrial process. The soft measurement modeling is dependent on methods such as statistical analysis or machine learning, and potential information in data is mined. The dependence on the internal mechanism or mathematical model of the industrial process is reduced, and the requirement on the prior knowledge of the process is greatly reduced. In a plurality of soft measurement modeling methods, an extreme learning machine is widely applied as a typical neural network algorithm by virtue of the advantages of quick training, simple structure, strong generalization capability and the like.

The penicillin fermentation process is a typical batch process, namely, the penicillin fermentation process has the characteristic of a multi-working-condition process. For the actual penicillin fermentation process, a soft measurement technology is mostly adopted to effectively monitor key variables such as the concentration of penicillin. The establishment of the soft measurement model is based on the auxiliary variables and the key variables, and in the actual multi-working-condition process, the collection of sufficient key variables, namely label values, for soft measurement modeling is time-consuming and labor-consuming, and even the situation that some working conditions are not label occurs. Meanwhile, in the traditional soft measurement modeling method, due to the fact that nonlinear relations among different working conditions are different and data distribution is different, a model established in a specific working condition cannot be directly used in other working conditions. Therefore, how to model the behavior of the unlabeled data is a question to be discussed.

Disclosure of Invention

In order to solve the problem of difficult modeling of unlabeled data under a certain working condition in the penicillin process, the invention provides a fermentation process soft measurement modeling method based on rapid component transfer learning. The distance of edge probability distribution between two working conditions is reduced by a Transfer Component Analysis (TCA) method, so that the characteristic distributions of the two working conditions are similar, and then a Regularized Extreme Learning Machine (RELM) method is adopted to quickly establish a model in a data characteristic alignment space to predict the penicillin concentration of the non-label data working condition.

The technical scheme of the invention is as follows:

a fermentation process soft measurement modeling method based on rapid component transfer learning comprises the following steps:

1) acquisition and pretreatment of penicillin data:

the penicillin data adopted by the invention is obtained by analog simulation of a Benchmark simulation platform (Pensim). In order to accelerate the convergence speed of the model and reduce the influence among different dimension data, the original data is normalized. In the modeling stage of the soft measurement model in the multi-working-condition process, one working condition is a source domain working condition with auxiliary variables, namely characteristic variables and key variables, and the other working condition is a target domain working condition with only the characteristic variables and no key variables.

2) Establishing a model of rapid component transfer learning:

based on the TCA method, the marginal probability distribution distance between the source domain and the target domain is reduced, thereby completing the feature migration. And in the space of the source domain and the target domain which are aligned through the TCA, establishing a regularization limit learning machine model based on the mapped source domain data, and quickly predicting the penicillin concentration of the working condition of the target domain.

3) And (3) evaluating model performance:

in order to evaluate the method proposed by the present invention more objectively, the Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) of the evaluation index are introduced.

Further, the process of the step 1) is as follows:

step 1.1) obtaining penicillin data:

the penicillin data adopted by the invention is obtained by analog simulation of a Benchmark simulation platform (Pensim). The penicillin concentration is a key variable to be predicted in the process, and six related variables are used as auxiliary variables, namely carbon dioxide concentration, aeration rate, substrate feeding temperature, stirrer power, culture dish volume and pH value.

Step 1.2) data normalization treatment:

in order to accelerate the convergence speed of the model, reduce the training time of the model and simultaneously reduce the influence among different dimensional data, the data is normalized, and the formula is as follows:

in the formula, x is data after normalization processing; a is the collected raw data; a is_minIs the minimum value in the original data; a is_maxIs the maximum value in the raw data.

Step 1.3) determining working condition data of a source domain and a target domain:

and randomly selecting one working condition from the different working condition data sets after the normalization processing as a source domain working condition data set, and randomly selecting one working condition from the rest working conditions as a target domain working condition data set. The source domain data has both characteristic variables and key variables, denoted as { X }_S,Y_SThe target domain data only has characteristic variables and is marked as { X }_T}。

Further, the process of step 2) is as follows:

step 2.1) feature migration of the source domain and the target domain:

it is assumed that there is a feature map ψ such that the distribution P (ψ (X) of the source domain and the destination domain after mapping_S))＝P(ψ(X_T)). The distance between the source domain and the target domain after mapping is measured by Maximum Mean Difference (MMD), which is expressed as:

where m is the number of source domain samples, n is the number of target domain samples, the above equation represents the distance of two distributions measured in the regenerated kernel hilbert space, and H represents the regenerated kernel hilbert space.

Transforming the MMD distance solving process into a kernel function learning process by using a kernel function, D (X)_S,X_T) Conversion to the following form:

tr(KL)-λtr(K)

wherein tr (·) is the inverse of the matrix, λ is the introduced parameter, K is the kernel matrix obtained by kernel function mapping, L is the matrix introduced by the MMD algorithm, and the calculation method of each element is as follows:

wherein D is_SRepresenting the source domain, D_TRepresenting the target domain, x_iAnd x_jRepresenting the characteristic variables of any two samples in the source domain or the target domain. Then, will findThe problem of solving tr (KL) -lambda tr (K) is converted into the following optimization problem:

s.t.W^TKMKW＝I_m

where M is the central matrix, μ is the introduced parameter, I_mIs an m-dimensional identity matrix. W is a matrix of lower dimension than K, whose solution is (KLK + muI)^-1KMK. Using TCA algorithm to convert source domain characteristic variable X_SAnd target domain feature variable X_TMapping to a new space to obtain a new source domain data characteristic X_SnewAnd target domain data characteristics X_Tnew. The number of rows in matrix X is the number of samples and the columns are the total number of features.

Step 2.2) establishing a soft measurement model based on RELM:

after the above TCA process operation, X_SnewAnd X_TnewThe distribution in space is similar. With { X_Snew,Y_SEstablishing a regularization extreme learning machine soft measurement model, wherein an optimization objective function of the regularization extreme learning machine soft measurement model is as follows:

s.t.j h(x_i)β＝y_i-ξ_i,i＝1,2,...,m

wherein x is_SiIs a characteristic variable of the source domain sample, y_SiFor the key variable of the source domain sample reality, h (-) is a function for solving the hidden layer matrix, beta is the output weight, xi_iThe prediction error of the ith sample is shown, and gamma is the regularization coefficient of the model. Substituting the constraint term into a target loss function formula to convert into:

obtaining a model by Moore-Penrose (Moore-Penrose generalized inverse) methodOutput matrix of

H is the hidden layer matrix of the RELM model. Thus, for unlabeled regime X_TnewThe label value obtained by the regularization limit learning machine is:

further, the process of the step 3) is as follows:

step 3.1) root mean square error RMSE evaluation:

the root mean square error is defined as follows:

in the formula: r is the total amount of the test set samples; y is_tRepresenting input samples x_tThe true tag value of;

representing input samples x_tThe predicted value of (2). The smaller the RMSE is, the better the prediction performance of the model is;

step 3.2) evaluation of average absolute error MAE value:

the mean absolute error can be expressed as:

the smaller the MAE, the smaller the prediction error of the model, and the more excellent the method can be demonstrated.

The invention has the following beneficial effects: the invention reduces the characteristic distribution distance of the source domain and the target domain based on the migration component analysis method, so that the two groups of data are distributed similarly. And then, a soft measurement model is quickly established on the mapped source domain penicillin data by adopting a regularization extreme learning machine method, and the concentration of the target domain penicillin is predicted. The method solves the problem that the soft measurement model is difficult to establish when the penicillin data has no label data under certain working condition.

Drawings

FIGS. 1(a), (b) results of predicting penicillin concentrations in condition 1 and condition 2, respectively, for the RELM model established for condition 1;

FIGS. 2(a) and (b) are the results of predicting penicillin concentrations under working

conditions

1 and 2 by using the RELM model established under working condition 1 after being treated by the TCA method, respectively;

FIGS. 3(a), (b) results of predicting penicillin concentrations in condition 1 and condition 3, respectively, for the RELM model established for condition 1;

FIGS. 4(a) and (b) are the results of predicting penicillin concentrations under working

conditions

1 and 3 by using the RELM model established under working condition 1 after being treated by the TCA method, respectively;

FIGS. 5(a), (b) results of predicting penicillin concentrations for working condition 2 and working condition 3, respectively, for the RELM model established for working condition 2;

FIGS. 6(a) and (b) are the results of predicting penicillin concentrations under working

conditions

2 and 3 by the RELM model established under working condition 2 after being processed by the TCA method, respectively;

fig. 7(a), (b), and (c) are data distribution scatter diagrams before and after the TCA method is adopted when the working condition 1 is migrated to the working condition 2, the working condition 1 is migrated to the working condition 3, and the working condition 2 is migrated to the working condition 3, respectively;

FIG. 8 is a flow chart of the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

Referring to fig. 1 to 8, a fermentation process soft measurement modeling method based on rapid component transfer learning includes the following specific steps:

(1) acquisition and pretreatment of penicillin data:

step 1.1: obtaining penicillin data

Step 1.2: data normalization processing

Step 1.3: determining source domain and target domain condition data

(2) Establishing a model of rapid component transfer learning:

step 2.1: feature migration of source domain and target domain

Transforming the solving process of the MMD distance into a kernel function by adopting a kernel functionLearning Process of numbers, D (X)_S,X_T) Conversion to the following form:

tr(KL)-λtr(K)

wherein D is_SRepresenting the source domain, D_TRepresenting the target domain, x_iAnd x_jRepresenting the characteristic variables of any two samples in the source domain or the target domain. Then, the problem of solving tr (KL) - λ tr (K) is converted into the following optimization problem:

s.t.W^TKMKW＝I_m

Step 2.2: soft measurement model establishment based on RELM

s.t.j h(x_i)β＝y_i-ξ_i,i＝1,2,...,m

the Moore-Penrose generalized inverse method is adopted to obtain an output matrix of the model as

(3) and (3) evaluating model performance:

step 3.1: root Mean Square Error (RMSE) evaluation

The root mean square error is defined as follows:

step 3.2: mean absolute error MAE value evaluation

The mean absolute error can be expressed as:

(4) Prediction of penicillin data:

penicillin data of 3 working conditions are obtained, namely working condition 1, working condition 2 and working condition 3, and each working condition has 200 samples. The effectiveness of the method is verified through 3 pairs of working condition migration, namely the working condition 1 is migrated to the working condition 2, the working condition 1 is migrated to the working condition 3, and the working condition 2 is migrated to the working condition 3. FIGS. 1(a) and (b) show the results of predicting penicillin concentrations in

conditions

1 and 2, respectively, using the RELM method alone to model condition 1. Aligning the characteristic distribution of the working condition 1 and the working condition 2 by a TCA method, then establishing an RELM model based on the mapped working condition 1 data, and respectively obtaining penicillin concentration prediction results of the working condition 1 and the working condition 2 in the graphs (a) and (b) of the graphs. As can be seen from the figure, the penicillin concentration under the working condition 2 predicted by the TCA and RELM methods is better fitted with the original data value, and meanwhile, the accuracy of the penicillin concentration prediction result under the working condition 1 is also ensured. Similarly, fig. 3(a) and (b) are the results of predicting penicillin concentrations in condition 1 and condition 3 by means of the RELM model established in condition 1, respectively, and fig. 4(a) and (b) are the results of predicting penicillin concentrations in condition 1 and condition 3 by means of the RELM model established in condition 1 after the TCA method, respectively. Fig. 5(a) and (b) are the results of prediction of penicillin concentrations in

conditions

2 and 3 by means of the RELM model established in condition 2, respectively, and fig. 6(a) and (b) are the results of prediction of penicillin concentrations in

conditions

2 and 3 by means of the RELM model established in condition 2 after the TCA method. At the same time, the RMSE and MAE values of the target domain prediction results for each set of operating conditions are recorded in table 1. From table 1, it can be seen that the TCA + rel method can significantly reduce the RMSE and MAE values of target domain prediction, further verifying the effectiveness of the TCA + rel method in the prediction of unlabeled penicillin data concentration.

Fig. 7(a), (b), and (c) are data distribution scatter diagrams before and after the working condition 1 is migrated to the working condition 2, the working condition 1 is migrated to the working condition 3, and the working condition 2 is migrated to the working condition 3, respectively, and it can be seen from the diagrams that the data distribution between the original two working conditions is far, and the TCA method draws the distance between the two working conditions, which explains well why TCA + RELM can improve the accuracy of the target domain penicillin concentration prediction.

TABLE 1

The method is based on the knowledge of transfer learning, and solves the problem of difficult modeling of unlabeled data under a certain working condition in the penicillin process. And reducing the distance between the characteristic distributions of the source domain and the target domain by a TCA method, then establishing a RELM model in a space with similar characteristic distributions, and predicting the concentration of the penicillin in the working condition of the target domain only with characteristic data. The method has high accuracy, universality and universality.

The embodiments described in this specification are merely illustrative of implementations of the inventive concept and the scope of the present invention should not be considered limited to the specific forms set forth in the embodiments but rather by the equivalents thereof as may occur to those skilled in the art upon consideration of the present inventive concept.

Claims

1. A fermentation process soft measurement modeling method based on rapid component transfer learning is characterized by comprising the following steps:

1) acquisition and pretreatment of penicillin data:

penicillin data is obtained through Pensim simulation of a Benchmark simulation platform, and normalization processing is carried out on original data in order to accelerate the convergence speed of a model and reduce the influence among data of different dimensions; in the modeling stage of the soft measurement model in the multi-working-condition process, one working condition is a source domain working condition with auxiliary variables, namely characteristic variables and key variables, and the other working condition is a target domain working condition with only the characteristic variables and no key variables;

2) establishing a model of rapid component transfer learning:

based on a migration component analysis TCA method, reducing the marginal probability distribution distance between a source domain and a target domain, thereby completing feature migration; in the space of the source domain and the target domain which are aligned through the TCA, establishing a regularization limit learning machine model based on the mapped source domain data, and quickly predicting the penicillin concentration of the working condition of the target domain;

3) and (3) evaluating model performance:

and introducing an evaluation index root mean square error RMSE and an average absolute error MAE.

2. The fermentation process soft measurement modeling method based on rapid component transfer learning according to claim 1, characterized in that the process of step 1) is as follows:

step 1.1) obtaining penicillin data:

the penicillin concentration is a key variable to be predicted in the process, and six correlation variables are used as auxiliary variables, namely carbon dioxide concentration, aeration rate, substrate feeding temperature, stirrer power, culture dish volume and pH value;

step 1.2) data normalization treatment:

in the formula, x is data after normalization processing; a is the collected raw data; a is_minIs the minimum value in the original data; a is_maxIs the maximum value in the original data;

randomly selecting one working condition from different working condition data sets after normalization processing as a source domain working condition data set, and randomly selecting one working condition from the rest working conditions as a target domain working condition data set; the source domain data has both characteristic variables and key variables, denoted as { X }_S,Y_SThe target domain data is uniqueSign variable, denoted as { X_T}。

3. The fermentation process soft measurement modeling method based on rapid component transfer learning according to claim 1, characterized in that the process of step 2) is:

step 2.1) feature migration of the source domain and the target domain:

it is assumed that there is a feature map ψ such that the distribution P (ψ (X) of the source domain and the destination domain after mapping_S))＝P(ψ(X_T) ); and measuring the distance between the source domain and the target domain after mapping by adopting the MMD with the maximum mean difference, wherein the distance is expressed as follows:

wherein m is the number of source domain samples, n is the number of target domain samples, the above formula represents the distance of two distributions measured in the regenerated kernel hilbert space, and H represents the regenerated kernel hilbert space;

tr(KL)-λtr(K)

wherein D is_SRepresenting the source domain, D_TRepresenting the target domain, x_iAnd x_jA feature variable representing any two samples in the source domain or the target domain; then, the problem of solving tr (KL) - λ tr (K) is converted into the following optimization problem:

s.t.W^TKMKW＝I_m

where M is the central matrix, μ is the introduced parameter, I_mIs an m-dimensional identity matrix, W is a matrix of lower dimension than K, the solution of which is (KLK + muI)^-1KMK, the first p eigenvalues; using TCA algorithm to convert source domain characteristic variable X_SAnd target domain feature variable X_TMapping to a new space to obtain a new source domain data characteristic X_SnewAnd target domain data characteristics X_TnewThe row number of the matrix X is the sample number, and the columns are the total characteristic number;

step 2.2) establishing a soft measurement model based on a regularization extreme learning machine RELM:

after the above TCA process operation, X_SnewAnd X_TnewThe distribution in space is similar; with { X_Snew,Y_SEstablishment of regularized extreme learning machine soft measurement model, Y_SThe key variable of the source domain is represented, and the optimization objective function of the key variable is as follows:

s.t.j h(x_Si)β＝y_Si-ξ_i,i＝1,2,...,m

wherein x is_SiIs a characteristic variable of the source domain sample, y_SiFor the key variable of the source domain sample reality, h (-) is a function for solving the hidden layer matrix, beta is the output weight, xi_iRepresenting the prediction error of the ith sample, wherein gamma is a regularization coefficient of the model; substituting the constraint term into a target loss function formula to convert into:

the output matrix of the model obtained by adopting a Muel-Penrose generalized inverse Moore-Penrose method is

H is a hidden layer matrix of the RELM model; thus, for unlabeled regime X_TnewThe label value obtained by the regularization limit learning machine is:

4. the fermentation process soft measurement modeling method based on rapid component transfer learning according to claim 1, characterized in that the process of step 3) is:

step 3.1) root mean square error RMSE evaluation:

the root mean square error is defined as follows:

representing input samples x_tThe predicted value of (2); the smaller the RMSE is, the better the prediction performance of the model is;

step 3.2) evaluation of average absolute error MAE value:

the mean absolute error can be expressed as: