CN115952685A

CN115952685A - Sewage treatment process soft measurement modeling method based on integrated deep learning

Info

Publication number: CN115952685A
Application number: CN202310053332.5A
Authority: CN
Inventors: 熊金琳; 彭甜; 李正波; 陶孜菡; 张楚; 赵环宇; 伏咏妍; 王宇涵; 黄小龙; 花磊
Original assignee: Huaiyin Institute of Technology
Current assignee: Cao Liang
Priority date: 2023-02-02
Filing date: 2023-02-02
Publication date: 2023-04-11
Anticipated expiration: 2043-02-02
Also published as: CN115952685B

Abstract

The invention discloses a sewage treatment process soft measurement modeling method based on integrated deep learning. Firstly, acquiring sewage data as an auxiliary variable; selecting several collected variables by using KPCA characteristics and then using the selected variables as the input of a model; establishing a sewage soft measurement integrated model, wherein the integrated model has two layers, the first layer comprises three base learners of BiLSTM, LSSVM and XGboost, a 5-fold cross validation method is adopted for training, and the second layer adopts ELM as an element learner; and finally, carrying out error correction on the initial prediction result by adopting an extreme learning machine. In order to improve the performance of the model, an RSA algorithm is provided to optimize the model parameters; and according to the problems of the RSA algorithm in the aspects of convergence precision, easy falling into local optimization and the like, the Latin hypercube, the nonlinear factor, the golden sine and the somersault strategy are used for improving the RSA algorithm. Compared with the traditional soft measurement method, the method can integrate the advantages of each model, and has stronger generalization capability and higher prediction precision of the whole model.

Description

Sewage treatment process soft measurement modeling method based on integrated deep learning

Technical Field

The invention relates to the field of soft measurement modeling in an industrial sewage treatment process, in particular to a soft measurement modeling method for key water quality parameters in a sewage treatment process based on integrated deep learning.

Background

The wastewater treatment process is very complex, relates to a complex dynamic physical reaction process, a biological reaction process and a chemical reaction process, has strong nonlinearity, uncertainty, time-varying characteristic and wide hysteresis, and is difficult to establish an accurate model. In order to maintain a good environment of a sewage treatment system, ensure the stability, the running speed, the reliability test and the like of the control system, ensure the sewage discharge quality to reach the discharge standard, and carry out real-time inspection and monitoring on a plurality of important technical processes such as water meters, water quality parameters, environmental parameters and the like in the sewage treatment process. However, in practice, some quality variables in industrial processes can be difficult to measure in real time due to the lack of on-line measuring instruments and corresponding measuring sensors, or their need to operate in extremely harsh environments, which are expensive to purchase and maintain. The soft measurement technology can realize real-time monitoring and control of the dominant variable by establishing a mathematical model between the variable (auxiliary variable) which is easy to measure and the variable (dominant variable) which is difficult to directly measure in the process. The method has the advantages of convenient maintenance and low time delay, and is developed rapidly.

The deep learning has a more complex multilayer structure than the traditional model, has more information, better data extraction and stronger nonlinear feature expression capability, and can accurately map the hidden complex mapping relation under the industrial data. By utilizing the advantages of deep learning and combining soft measurement, the data characteristic information can be fully extracted, and the prediction precision of the model is improved. In the deep learning development, various deep learning models are continuously developed so far, and many new deep models are introduced, however, a single soft measurement model has many problems, such as local optimization and insufficient precision. In view of a large number of implicit data characteristics contained in data, various algorithms and network models are integrated, and the advantages of the algorithms and the network models are combined, so that the overall accuracy and the generalization capability of the models are improved, and the method becomes a new research direction in the soft measurement modeling of the complex industrial process.

Disclosure of Invention

The purpose of the invention is as follows: aiming at the problem that the water quality index biochemical oxygen demand in the sewage treatment process is difficult to realize on-line measurement, the method for soft measurement modeling in the sewage treatment process based on the integrated deep learning is provided.

The technical scheme is as follows: the invention discloses a soft measurement modeling method based on integrated deep learning in a sewage treatment process, which comprises the following steps of:

s1, acquiring sewage data and performing data pretreatment;

s2, performing feature selection on the processed data by using KPCA (kernel principal component analysis), and selecting a proper auxiliary variable so as to construct a soft measurement data sample set;

s3, the sewage data set processed in the S2 is used as the original input of a first-layer base learner in a Stacking integrated framework, wherein the first-layer base learner comprises a bidirectional long-short term memory network (BilsTM), a limiting gradient lifting XGboost and a Least Square Support Vector Machine (LSSVM), and the prediction result of the first-layer base learner is obtained by performing 5-fold cross validation on each base learner;

s4, the second layer adopts ELM as a meta-learner, and the result obtained from the first layer of the base learner is used as a training set of the second layer of the meta-learner to finish the training of the second layer of the meta-learner;

s5, optimizing model parameters of the base learner by adopting an improved reptile search algorithm, wherein the improved reptile search algorithm comprises the following steps: adopting Latin hypercube initialization, in an iterative process, adopting a nonlinear method to improve an evolution significance ES value, utilizing a gold sine and tumbling strategy to improve an individual optimization mode, and utilizing an optimized model to carry out soft measurement to obtain a prediction result of the biochemical oxygen demand;

and S6, correcting errors of the initial prediction result by adopting ELM to obtain a final prediction result.

Further, the sewage data in the step S1 includes suspended matter concentration SS, total nitrogen TN, ammonia nitrogen NH3-N, total phosphorus TP, chemical oxygen demand COD, and biochemical oxygen demand BOD at a historical time.

Further, in the step S2, feature extraction is performed on the data by using KPCA, and the specific steps are as follows:

s3.1 setting the sample data of the training set as X = (X) ₁ ,x ₂ ,...,x _m ) X is mapped by a mapping function phi (x) _i Mapping to a high-dimensional feature space;

s3.2, calculating a covariance matrix C of the feature space:

s3.3, calculating a characteristic equation of the covariance matrix:

λ _i ξ _i ＝Cξ _i (3)

wherein ,λ_i Is the eigenvalue, ξ, of the covariance matrix C _i Is corresponding to the characteristic value lambda _i The feature vector of (2);

s3.4 defines a kernel matrix K:

K＝φ(x _i )·φ(x _i ) ^T (4)

s3.5, calculating a characteristic equation of the kernel matrix K:

wherein ,

is the eigenvector of the kernel matrix K, alpha _i Is corresponding to the characteristic value->

The feature vector of (2);

s3.6 substituting the covariance matrix C and the kernel matrix K into the characteristic equation of the kernel matrix K, then the eigenvector xi of the covariance matrix C _i Can use a non-linear function phi (x) _i ) Is represented as follows:

wherein ,

is xi _i The corresponding ith coefficient;

s3.7 calculating the eigenvalues of the kernel matrix K

The characteristic values are arranged in descending order as->

S3.8 calculating the contribution rate eta of the characteristic values in sequence _i And the cumulative contribution P is as follows:

s3.9, selecting the characteristic that the cumulative contribution rate P is more than or equal to 85 percent as a main auxiliary variable input by the sewage soft measurement model.

Further, in step S3, the step of establishing the bidirectional long and short term memory network prediction model is as follows:

s4.1, using the determined auxiliary variable as an input vector x of the network;

s4.2 setting the forward hidden layer state to

The reverse hidden layer state is->

w is different weight matrix, then the output y of BilSTM is calculated as:

s4.3 mixing of y _t As a result of the prediction of the model.

Further, in step S3, the step of establishing the extreme gradient boost prediction model is as follows:

s5.1 setting the training data set as T = { (x) ₁ ,y ₁ ),(x ₂ ,y ₂ ),...,(x _n ,y _n ) A loss function of

The regularization term is Ω (f) _k ) Then the overall objective function can be written as:

where L (φ) is a representation in linear space, i is the ith sample, k is the kth tree,

is the ith sample x _i The predicted value of (2);

s5.2 fit the residual of the previous tree prediction using the prediction of each tree:

s5.3, the predicted result of the t-th tree is obtained in the previous step, the predicted result is equal to the predicted result of the previous t-1 trees in value, and the performance of the t-th tree is added, and for the t-th tree, the objective function is as follows:

s5.4 Taylor expansion is used to approximate the original target in equation (14), defining

Equation (14) can be:

s5.5, solving the optimal solution of the objective function as follows:

wherein

I _j ＝{i|q(x _i ) = j } indicates that a certain sample maps to a set of nodes;

s5.6, calculating a Gain value Gain, updating the maximum Gain _ max, updating the separation points, and finally obtaining the optimal separation points;

s5.7 the above process is repeated to recursively build the tree until a termination condition.

Further, in step S3, the step of establishing the least squares support vector machine prediction model is as follows:

s6.1, optimizing a target, and defining a loss function as:

with the constraint:

where ω is the weight, ξ _i Is an error variable, b is a deviation, c > 0 is a penalty coefficient;

s6.2 introduces Lagrange multiplier, and the formula (18) can be converted into:

wherein the Lagrange multiplier a _i ＞0(i＝1,2,...,N)；

S6.3 solving optimal conditions

S6.4, obtaining an optimal regression function by combining the formulas as follows:

wherein ,K(x_i ,y _j ) Is a kernel function, x _i Is the center of the kernel function, x is the input of the training sample, y _i Is the output of the training sample.

Further, the meta-learner prediction model calculation formula in step S4 is as follows:

where L is the number of hidden layer units, N is the number of training samples, β is the weight vector between the ith hidden layer and the output layer, w is the weight vector between the input and output, g is the activation function, b is the bias vector, and x is the input vector.

Further, the steps of the modified reptile search algorithm in step S5 are as follows:

s5.1, adopting Latin hypercube sampling initialization to replace random initialization of an RSA algorithm, and setting the upper and lower search boundaries, the population size and the iteration times of IRSA;

s5.2 surrounding phase, the crocodile individual starts to surround the prey, and the mathematical model is as follows:

η _ij ＝B _j (t)×P _ij (24)

wherein ,B_j (t) represents the location of the optimal solution; s _i,j (t + 1) represents the next update position; t is the current iteration number; t is _max Is the maximum number of iterations; eta _ij Representing a hunting operator; r _ij Is a reduction function to reduce the search space; alpha and beta are sensitive parameters and control the search precision; r is a radical of hydrogen ₁ ，r ₂ Are all [1,N]A random number within; r is ₃ Is [ -1,1]A random integer of (1); ES (t) is evolutionary; p _ij Representing a percentage difference between the best solution position and the current solution position; m(s) _i ) Represents the average position of the ith solution;

s5.3, the evolutionary meaning parameter of the formula (26) is improved, and the improved evolutionary meaning expression is as follows:

s5.4 hunting stage, the crocodile individual starts hunting, and the mathematical model is as follows:

s5.5, introducing a golden sine and tumbling strategy into the position updating formula (30), wherein the improved formula is as follows:

wherein ,γ₁ ，γ ₂ Are each [0,2 pi]And [0, π]A random number within; gamma ray ₃ ，γ ₄ Is [0,1]A random number within; f =2 is the somersault coefficient, defining the position relative to the prey; x is the number of ₁ ，x ₂ Is the golden sine coefficient, x ₁ and x₂ The calculation formula of (c) is as follows:

x ₁ ＝a*(1-γ)+b*σ (32)

x ₂ ＝a*γ+b*(1-σ) (33)

wherein a and b are golden section ratio search initial values,

the golden ratio.

Further, in the step S5, optimizing parameters of the model using the improved reptile search algorithm includes: the learning rate, the number of hidden layer nodes and the extreme gradient of the bidirectional long-short term memory network BilsTM improve the weight and the learning rate of XGboost, and the optimal penalty coefficient and the kernel function width value of the least square support vector machine LSSVM.

Further, in step S6, the error correction step using the ELM is as follows:

s6.1, subtracting the initial predicted value obtained by the integrated model from the original observed value to construct an error sequence;

s6.2, predicting an error sequence by using an ELM network;

and S6.3, linearly adding the initial prediction sequence and the error prediction sequence to obtain a final prediction result.

Has the advantages that:

the depth-based and integrated learning method provided by the invention gives consideration to the training principle difference of different algorithms, and gives full play to the advantages of each model in the prediction process. The stronger the learning ability of the base learner is, the smaller the degree of mutual association is, and the better the final prediction effect is. Aiming at the problem that the super parameters of the model are difficult to determine, a reptile search algorithm is introduced to optimize the model, and the optimal model parameters are selected. And an algorithm improvement is proposed to improve the optimizing capability of the algorithm, and the improvement is as follows: firstly, aiming at the problem that random initialization of an algorithm can not enable a population to be uniformly distributed in the whole optimizing space, latin hypercube initialization is introduced to ensure that an initial population uniformly covers the whole distribution space; secondly, in the iterative process, because the random decreasing strategy of the evolution significance ES value between-2 and 2 cannot completely explain the actual convergence optimization process, the nonlinear strategy is adopted for improvement, so that the algorithm can more effectively balance the global and local search capability, and the convergence accuracy of the algorithm is improved. And finally, an individual optimizing mode is improved by utilizing a golden sine and tumbling strategy, so that a common individual can exchange information with the optimal individual in each iteration, the position difference information between the common individual and the optimal individual is completely absorbed, and the algorithm searching performance and the searching accuracy are improved. Compared with the traditional single model prediction, the sewage treatment process soft measurement modeling method based on depth and integrated learning has higher precision and generalization capability.

Drawings

FIG. 1 is a schematic diagram of a multi-model training framework of a sewage treatment process soft measurement modeling method based on integrated deep learning provided by the invention;

FIG. 2 is a flow chart of algorithm optimization model parameters in the integrated deep learning-based sewage treatment process soft measurement modeling method provided by the invention;

FIG. 3 is a flow chart of the integrated deep learning-based sewage treatment process soft measurement modeling method provided by the invention.

Detailed Description

Embodiments of the present invention will be further described with reference to the accompanying drawings.

As shown in FIG. 1, the invention provides a sewage treatment process soft measurement modeling method based on integrated deep learning, which comprises the following steps:

s1, acquiring sewage data from an international standard BSM 1 simulation platform and performing data pretreatment.

S1.1, sewage data including ammonia nitrogen NH3-N, suspended matter concentration SS, chemical oxygen demand COD, total nitrogen TN, total phosphorus TP and biochemical oxygen demand BOD at historical time are obtained from an international standard simulation platform.

S1.2, preprocessing the acquired sewage data into normalizing the data, wherein the formula is as follows:

in the formula ,S^* Representing the data after normalization, S representing the raw data, S _max and S_min Respectively representing the maximum and minimum values in the raw data.

And S2, performing feature selection on the processed data by using KPCA (kernel principal component analysis), and selecting the most appropriate auxiliary variable, thereby constructing a soft measurement data sample set.

S2.1 setting the sample data of the training set as X = (X) ₁ ,x ₂ ,...,x _m ) X is mapped by a mapping function phi (x) _i Mapping to a high-dimensional feature space.

S2.2, calculating a covariance matrix C of the feature space:

s2.3, calculating a characteristic equation of the covariance matrix:

λ _i ξ _i ＝Cξ _i (3)

in the formula ,λ_i Is the eigenvalue, ξ, of the covariance matrix C _i Is corresponding to the characteristic value lambda _i The feature vector of (2).

S2.4 defines a kernel matrix K:

K＝φ(x _i )·φ(x _i ) ^T (4)

s2.5, calculating a characteristic equation of the kernel matrix K:

in the formula ,

is the eigenvector of the kernel matrix K, alpha _i Is corresponding to a characteristic value>

The feature vector of (2).

S2.6, substituting the covariance matrix C and the kernel matrix K into the characteristic equation of the kernel matrix K, so that the characteristic vector xi of the covariance matrix C _i Can use a non-linear function phi (x) _i ) Watch (A)Shown below:

in the formula ,

is xi _i The corresponding ith coefficient.

S2.7 calculating the eigenvalues of the kernel matrix K

The characteristic values are arranged in descending order as->

S2.8 calculating contribution rate eta of these characteristic values in turn _i And the cumulative contribution P is as follows:

s2.9, selecting the characteristic that the cumulative contribution rate P is more than or equal to 85% as a main auxiliary variable input by the sewage soft measurement model.

And S3, taking the sewage data set processed in the S2 as the original input of a first-layer base learning device in a Stacking integrated framework, wherein the first-layer base learning device comprises a bidirectional long-short term memory network (BiLSTM), a limiting gradient lifting XGboost and a Least Square Support Vector Machine (LSSVM), and the prediction result of the first-layer base learning device is obtained by performing 5-fold cross validation on each base learning device.

S3.1, establishing a bidirectional long-short term memory network prediction model.

S3.1.1 takes the determined auxiliary variables as the input vector x to the network.

S3.1.2 forward hiddenContaining a layer state of

The reverse hidden layer state is->

w is different weight matrix, the output y of BilSTM is calculated as follows: />

S3.1.3 coupling y _t As a result of the prediction of the model.

S3.2, establishing a limit gradient lifting prediction model.

S3.2.1 the training data set is T = { (x) ₁ ,y ₁ ),(x ₂ ,y ₂ ),...,(x _n ,y _n ) A loss function of

is the ith sample x _i The predicted value of (2).

S3.2.2 fitting the residual error of the predicted result of the previous tree by using the predicted result of each tree, so that the overall tree model effect is better and better;

s3.2.3 gets the predicted result of the t-th tree from the previous step, which is numerically equal to the predicted result of the previous t-1 trees, plus the performance of the t-th tree. For the t tree, the objective function is:

s3.2.4 approximate equation (14) to the original target with Taylor expansion, define

Equation (14) can be:

s3.2.5 the optimal solution for the objective function is:

in the formula

I _j ＝{i|q(x _i ) = j } indicates that a certain sample maps to a set of nodes. />

S3.2.6 calculates a Gain value Gain, updates the maximum Gain _ max, and updates the division point to finally obtain the optimal division point.

S3.2.7 repeats the above process recursively building the tree until a termination condition.

And S3.3, establishing a least square support vector machine prediction model.

S3.3.1 the optimization objective defines the loss function as:

with the constraint:

where ω is the weight, ξ _i Is the error variable, b is the deviation, and c > 0 is the penalty factor.

S3.3.2 introduces the lagrange multiplier, equation (18) can be converted to:

in the formula, lagrange multiplier a _i ＞0(i＝1,2,...,N)。

S3.3.3 solving for optimal conditions

S3.3.4 the optimal regression function is obtained by combining the above equations as follows:

in the formula ,K(x_i ,y _j ) Is a kernel function, x _i Is the center of the kernel function, x is the input of the training sample, y _i Is the output of the training sample.

S4, using the result obtained from the first layer of base learner as a training set of a second layer of meta-learner to complete the training of the second layer of meta-learner, wherein the meta-learner prediction model is ELM, and the calculation formula is as follows:

And S5, optimizing the model parameters of the base learner by adopting an improved reptile search algorithm, and performing soft measurement by using the optimized model to obtain a prediction result of the biochemical oxygen demand.

S5.1, random initialization of an RSA algorithm is replaced by Latin hypercube sampling initialization, and the upper and lower search boundaries, the population size and the iteration number of the IRSA are set.

The Latin hypercube sampling initialization method comprises the following steps:

s5.1.1 determining the population size A and the dimension D.

S5.1.2 sets the variable a to the interval [ low, up ], where up and low are the upper and lower bounds of the variable a, respectively.

S5.1.3 divides the interval of variable a into N equal sub-intervals.

S5.1.4 randomly selects a point from each subinterval for each dimension.

S5.1.5 combines each of the points selected to form an initial population.

η _ij ＝B _j (t)×P _ij (24)

in the formula ,B_j (t) represents the location of the optimal solution; s. the _i,j (t + 1) represents the next update position; t is the current iteration number; t is _max Is the maximum number of iterations; eta _ij Representing a hunting operator; r _ij Is a reduction function to reduce the search space; alpha and beta are sensitive parameters and control the search precision; r is ₁ ，r ₂ Are all [1,N]A random number within; r is ₃ Is [ -1,1]A random integer of (1); ES (t) is evolutionary; p _ij Representing a percentage difference between the best solution position and the current solution position; m(s) _i ) The average position of the ith solution is indicated.

in the formula ,γ₁ ，γ ₂ are each [0,2 pi]And [0, pi ]]A random number within; gamma ray ₃ ，γ ₄ Is [0,1]A random number within; f =2 is the somersault coefficient, defining the position relative to the prey; x is the number of ₁ ，x ₂ Is the golden sine coefficient, x ₁ and x₂ The calculation formula of (a) is as follows:

x ₁ ＝a*(1-γ)+b*σ (32)

x ₂ ＝a*γ+b*(1-σ) (33)

wherein a and b are golden section ratio search initial values,

the golden ratio.

S5.6, optimizing the learning rate and the number of nodes of a hidden layer of the model BilsTM, the weight and the learning rate of XGboost, and the optimal penalty coefficient and the kernel function width value of the LSSVM by using an improved reptile search algorithm.

And S6, carrying out error correction on the initial prediction result by adopting ELM to obtain a final prediction result.

S6.1, subtracting the initial predicted value obtained by the integrated model from the original observed value, and constructing an error sequence.

S6.2 predict error sequences using the ELM network.

And S7, jointly constructing a sewage treatment soft measurement platform based on QT, python and MATLAB, wherein the sewage treatment soft measurement platform comprises a user login interface, a sewage data monitoring and online prediction module, and the visual target of a soft measurement system is realized.

The invention also realizes an intelligent BOD soft measurement system which comprises a data acquisition module, a data processing module, a model training module, a parameter optimization module, an error correction module and an online monitoring module.

A data acquisition module: is used for acquiring sewage data including ammonia nitrogen NH3-N, suspended matter concentration SS, chemical oxygen demand COD, total nitrogen TN, total phosphorus TP and biochemical oxygen demand BOD at historical time.

A data processing module: and (3) performing feature extraction on the collected sewage data by using a KPCA (kernel principal component analysis) method, selecting features with high correlation, and screening out auxiliary variables most suitable for model input.

A model training module: an integration model based on a Stacking method is established, and the method performs Stacking integration on the BilSTM, the XGboost and the LSSVM, so that the overfitting is relieved, and the model effect is improved.

A parameter optimization module: and optimizing parameters of the model by using an IRSA algorithm, wherein the parameters comprise the learning rate and hidden layer node number of the BilsTM, the weight and learning rate of the XGboost, and the optimal penalty coefficient and kernel function width value of the LSSVM.

An error correction module: error sequence based correction of the primary prediction results using ELM.

An online monitoring module: the system comprises a user login interface, a sewage data monitoring and online prediction module, and realizes the visual target of a soft measurement system.

The present invention is not limited to the above embodiments, and any simple modification, equivalent change and modification made to the above embodiments according to the technical spirit of the present invention are within the scope of the technical solution of the present invention.

Claims

1. A sewage treatment process soft measurement modeling method based on integrated deep learning is characterized by comprising the following steps:

s1, acquiring sewage data and performing data pretreatment;

s4, the second layer adopts ELM as a meta-learner, and the result obtained from the first layer of the base learner is used as a training set of the second layer of the meta-learner to finish training of the second layer of the meta-learner;

2. The integrated deep learning-based sewage treatment process soft measurement modeling method according to claim 1, wherein the sewage data in step S1 comprises suspended matter concentration SS, total nitrogen TN, ammonia nitrogen NH3-N, total phosphorus TP, chemical oxygen demand COD, and biochemical oxygen demand BOD at historical time.

3. The integrated deep learning-based sewage treatment process soft measurement modeling method according to claim 1, wherein in the step S2, KPCA is used for feature extraction of data, and the specific steps are as follows:

s3.2, calculating a covariance matrix C of the feature space:

s3.3, calculating a characteristic equation of the covariance matrix:

λ _i ξ _i ＝Cξ _i (3)

s3.4 defines the kernel matrix K:

K＝φ(x _i )·φ(x _i ) ^T (4)

s3.5, calculating a characteristic equation of the kernel matrix K:

wherein ,

The feature vector of (2);

s3.6, substituting the covariance matrix C and the kernel matrix K into the characteristic equation of the kernel matrix K, so that the characteristic vector xi of the covariance matrix C _i Can use a non-linear function phi (x) _i ) Is represented as follows:

/>

wherein ,

is xi _i The corresponding ith coefficient;

s3.7 calculating the eigenvalues of the kernel matrix K

Characteristic values arranged in descending order to->

S3.8 calculating contribution rate eta of these characteristic values in sequence _i And the cumulative contribution P is as follows:

4. The integrated deep learning-based soft measurement modeling method for sewage treatment process according to claim 1, wherein in the step S3, the step of establishing the bidirectional long-short term memory network prediction model is as follows:

s4.2 setting the state of the hidden layer to be

The reverse hidden layer state is->

w is different weight matrix, then the output y of BilSTM is calculated as:

s4.3 reaction of y _t As a mouldThe predicted result of type.

5. The integrated deep learning-based soft measurement modeling method for the sewage treatment process according to claim 1, wherein in the step S3, the step of establishing the extreme gradient boost prediction model is as follows:

is the ith sample x _i The predicted value of (2);

……

Equation (14) can be:

s5.5, solving the optimal solution of the objective function as follows:

wherein

6. The integrated deep learning-based sewage treatment process soft measurement modeling method according to claim 1, wherein in the step S3, the step of establishing a least squares support vector machine prediction model is as follows:

s6.1, optimizing a target, and defining a loss function as:

with the constraint:

wherein the Lagrange multiplier a _i ＞0(i＝1,2,...,N)；

S6.3 solving optimal conditions

7. The integrated deep learning-based sewage treatment process soft measurement modeling method according to claim 1, wherein the meta-learner prediction model calculation formula in the step S4 is as follows:

8. The integrated deep learning-based soft measurement modeling method for sewage treatment process according to claim 1, wherein the modified reptile search algorithm in step S5 comprises the following steps:

η _ij ＝B _j (t)×P _ij (24)

wherein ,B_j (t) represents the location of the optimal solution; s _i,j (t + 1) represents the next update position; t is the current iteration number; t is _max Is the maximum number of iterations; eta _ij Representing a hunting operator; r _ij Is a reduction function to reduce the search space; alpha and beta are sensitive parameters and control the search precision; r is ₁ ，r ₂ Are all [1,N]A random number within; r is ₃ Is [ -1,1]A random integer of (1); ES (t) is evolutionary; p _ij Representing a percentage difference between the best solution position and the current solution position; m(s) _i ) Represents the average position of the ith solution;

s5.3, the evolutionary meaning parameters of the formula (26) are improved, and the improved evolutionary meaning expression is as follows:

s5.5, introducing a gold sine and tendon-turning strategy into the position updating formula (30), wherein the improved formula is as follows:

wherein ,γ₁ ，γ ₂ Are respectively [0,2 pi ]]And [0, pi ]]A random number within; gamma ray ₃ ，γ ₄ Is [0,1]A random number within; f =2 is the overturning coefficientDefining a position relative to a prey; x is the number of ₁ ，x ₂ Is the golden sine coefficient, x ₁ and x₂ The calculation formula of (a) is as follows:

x ₁ ＝a*(1-γ)+b*σ (32)

x ₂ ＝a*γ+b*(1-σ) (33)

wherein a and b are golden section ratio search initial values,

the golden ratio.

9. The integrated deep learning-based soft measurement modeling method for sewage treatment process according to claim 8, wherein in the step S5, optimizing the parameters of the model by using the improved reptile search algorithm comprises: the learning rate, the number of hidden layer nodes and the extreme gradient of the bidirectional long-short term memory network BilsTM improve the weight and the learning rate of XGboost, and the optimal penalty coefficient and the kernel function width value of the least square support vector machine LSSVM.

10. The integrated deep learning-based sewage treatment process soft measurement modeling method according to any one of claims 1 to 9, wherein in the step S6, the error correction step using the ELM is as follows:

s6.2, predicting an error sequence by using an ELM network;

s6.3, the initial prediction sequence and the error prediction sequence are linearly added to obtain a final prediction result.