CN114611398A

CN114611398A - Brain-like modular neural network-based soft measurement method for nitrogen oxides in urban solid waste incineration process

Info

Publication number: CN114611398A
Application number: CN202210266639.9A
Authority: CN
Inventors: 蒙西; 王岩; 乔俊飞
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2022-03-17
Filing date: 2022-03-17
Publication date: 2022-06-10
Anticipated expiration: 2042-03-17
Also published as: CN114611398B

Abstract

The invention relates to a brain-like modular neural network-based soft measurement method for nitrogen oxides in urban solid waste incineration process, which realizes NO soft measurement_XThe method for accurately acquiring the concentration in real time comprises the following steps: firstly, acquiring data; preprocessing the acquired data, and determining an input variable and an output variable of the model; then, establishing a soft measurement model by adopting a brain-like modular neural network; and finally, the test data is used as the input of the model, and the validity of the model is verified. The invention effectively realizes NO_XThe real-time accurate detection of the concentration has important theoretical significance and application value.

Description

Brain-like modular neural network-based soft measurement method for nitrogen oxides in urban solid waste incineration process

Technical Field

The invention relates to Nitrogen Oxide (NO) in the process of burning urban solid wastes_X) A soft measurement method; establishes NO based on Brain-like Modular Neural Network (BIMNN)_XSoft measurement model, realizes to NO_XAnd (4) accurately acquiring the concentration in real time. Not only belongs to the field of urban solid waste treatment, but also belongs to the field of intelligent modeling.

Background

With the rapid development of Chinese economy and the continuous acceleration of urbanization process, the urban solid waste yield increases day by dayCities face the crisis of "solid waste enclosure". The incineration treatment mode of solid wastes increasingly becomes the main mode of urban solid waste treatment in China. And NO_XIs one of main pollutants generated in the process of incinerating urban solid wastes, and seriously influences the health of people and the quality of ecological environment. With the increasing requirements of environmental protection and treatment in China, the control of NOx emission becomes one of the key problems to be solved urgently in municipal solid waste incineration plants, and the real-time accurate detection of NOx is one of the important prerequisites for improving the denitration efficiency of the municipal solid waste incineration plants. Thus, realize NO_XThe real-time accurate detection has important theoretical significance and application value.

Disclosure of Invention

The invention aims to provide a brain-like modular neural network-based urban solid waste incineration process NO_XSoft measurement method, adopting brain-like modular neural network to establish NO_XSoft measurement model, implementation to NO_XAnd (4) accurately acquiring the concentration in real time.

The invention adopts the following technical scheme and implementation steps:

1. collecting data;

2. determining the input and output variables of the model: determining the input variable of the model by adopting a maximum correlation minimum redundancy (mRMR) algorithm, wherein the output variable of the model is NO at the current moment_XConcentration;

the method for determining the input variables of the model by adopting the mRMR algorithm is as follows:

given two random variables a and b, the mutual information between the two random variables is calculated as follows:

wherein, I is mutual information between random variables a and b, and p (a) and p (b) are edge probability distribution of the random variables a and b respectively; p (a, b) is the joint distribution of random variables a and b;

firstly, based on mutual information, finding a feature subset S having the maximum correlation with a variable c to be measured:

wherein ,m_iThe method comprises the following steps of taking characteristic variables in a characteristic subset S, | S | is the number of the characteristic variables in the characteristic subset S, D is the correlation between the selected characteristic variables and a variable c to be measured, and if D is larger, the correlation between the selected characteristic variables and the variable c to be measured is higher;

considering that certain similarity exists among the selected feature variables, and the elimination of the 'redundant' feature does not affect the model performance, therefore, the redundancy among the features is calculated, and the 'mutually exclusive' feature is found:

wherein ,m_i，n_jThe characteristic variables in the characteristic subset S are taken as R, the redundancy among the variables in the characteristic subset S is taken as R, and the smaller the R is, the lower the redundancy is;

in the mRMR algorithm application process, the maximum correlation index D and the minimum redundancy index R are generally unified into an evaluation function Φ ═ D-R, and then an optimal feature subset S is determined by finding the maximum value of the evaluation function:

maxΦ(D,R),Φ＝D-R (4)

3. designed for NO_XA brain-like modular neural network model for soft measurement of concentration;

(1) task decomposition

In order to measure and evaluate the modularization degree of a network and simulate the modularization characteristic of a brain network, a modularization index (MQ) facing a modularization neural network is provided; the modularization index consists of the density in the module and the sparsity among the modules, wherein the density in the module is calculated as follows:

wherein ,J_CIs the density in the moduleSet degree, P is the number of modules in the current network, N_lNumber of samples assigned to the l-th module, x_iTo input samples, h_l and r_lRespectively representing the position and the action range of the core node of the ith module;

the sparsity between the modules is calculated based on the Euclidean distance:

wherein ,J_SD (h) is the degree of sparsity between modules_l,h_s) Representing the distance between the core node of the l-th module and the core node of the s-th module, and comprehensively considering the density J in the module_CAnd degree of sparsity J between modules_SThe modularization index measurement mode is provided as follows:

therefore, the larger the value of MQ is, the higher the modularization degree of the network is, so a brain-like modularization partition method is proposed, and the main idea is as follows: firstly, distributing training samples through core nodes to determine whether to distribute the training samples to a current existing module or a newly added module; then, a new core node of an existing module is determined by seeking the maximum "modularity" degree of the network, and therefore, the modular structure construction can be divided into two cases: adding new modules and updating existing modules;

adding new module

At the initial moment, the number of modules of the whole network is 0;

when the first data sample enters the network, setting the first data sample as a core node of a first submodule:

h₁＝x₁ (8)

wherein ,h₁ and r₁Respectively representing the location and range of action, x, of the first module core node₁As input vector for the first training sample, d_maxFor training sample x_iAnd x_jThe maximum distance therebetween;

at time t, when the tth training sample enters the network, assuming that k modules exist, finding the core node closest to the sample:

wherein ,x_tIs the input vector of the t-th training sample, h_sDenotes the location of the core node of the s-th module, k_minRepresenting distance training samples x_tThe nearest core node;

if the t-th training sample is not in k_minIn the action range of the core node, a new module is needed to learn the current sample, and the parameters of the core node corresponding to the new module are set as follows:

h_k+1＝x_t (12)

wherein ,h_k+1，r_k+1Position and range of action of core node corresponding to newly added module, x_tIs the input vector for the t-th training sample,

the farthest distance from other core nodes to the core node of the newly added module;

② optimizing existing modules

Otherwise, the sample is considered to be classifiedIs k_minIn the module, in order to make the network have the optimal modularization degree, according to the formula (7), modularization index values MQ of the whole network under the condition that the current sample and the original core node are respectively used as the core nodes are respectively calculated_tAnd

if it is

If the network modularization degree of the current input sample as the core node is considered to be higher, the sample is used for replacing the existing core node to become a new core node, and the initial parameter setting is as follows:

wherein ,

respectively as a new core node and an action range of the module; n is a radical of_kIs the number of samples allocated to the kth module;

if it is

The location of the current core node

Keeping unchanged, only adjusting the action range of the node

Namely:

wherein ,

is the original core node of the module;

after all training samples are compared, the samples are distributed to different sub-modules, a partition structure is formed, the modularization degree of the current network can be considered to be the maximum, and then a sub-network needs to be constructed according to the task of each sub-module;

(2) subnetwork structure design

Training data set

Is divided into M subsets;

wherein ,x_i，y_iRespectively the input variables and the output variables of the model,

representing the domain, V being the input vector x_iN is the number of model input variables and output variable data;

an adaptive task-oriented radial basis function neural network (ATO-RBF) is adopted to construct a sub-network corresponding to each module, and the design of the sub-network comprises three parts: network structure growth, network structure pruning and network parameter adjustment;

(ii) network architecture growth

The center, radius and connection weight to the output layer of the node of the first hidden layer of the s-th module are based on the sample with the largest absolute output

Setting:

wherein ,

for the sample with the largest absolute output,

respectively representing the input variable and the output variable corresponding to the sample with the largest absolute output, r_sIs the range of action of the s-th module, N_sIs the number of samples assigned to the s-th module;

the center and the radius of a first hidden layer node of the s-th module and the connection weight to the output layer are respectively;

at time t_sAt time, training error vector e (t)_s) Obtained by the following formula:

wherein ,y_fIs the desired output of the f-th sample,

is the f-th sample at time t_sIs calculated by the following equation:

wherein H is the number of neurons in the hidden layer,

is the function of the jth hidden layer node of the s-th module,

for the jth hidden layer node of the s-th module to the output layer connection weight,

the center and radius of the jth hidden layer node of the s-th module;

finding the sample with the largest difference between the expected output and the network output

Then, adding an RBF neuron pair

Learning is carried out on each sample, and the initial parameters of the newly added neurons are as follows:

wherein

And

respectively representing the center of the new neuron of the s-th module and the connection weight to the output layer;

is a first

The input variable corresponding to each of the samples,

are respectively the first

Expected output sum corresponding to each sample at t_sA network output value of a time;

when the following relation is satisfied, the influence of the existing neurons on the newly added neurons is small:

wherein

Is the center of the neuron closest to the newly added neuron;

from equations (27) and (28), the radius of the newly added neuron is set as:

when a neuron is newly added, network parameters are adjusted through a second-order learning algorithm; when reaching the preset maximum structure J_maxOr period of timeInspection training precision E₀The network structure growth process ends; in the course of the experiment J_max＝10，E₀The training accuracy of the network is measured using Root Mean Square Error (RMSE) at 0.0001, and is calculated as follows:

wherein ,y_i and y_iThe expected output and the network output of the ith sample are respectively;

② trimming of network structure

In order to avoid redundancy of the network structure, it is proposed to measure the contribution value of hidden layer neurons based on the index of the connection weight:

wherein SI (j) is the contribution value of the jth hidden layer node,

is the weight from the jth hidden layer node of the s-th module to the output layer;

finding the hidden layer node with the smallest contribution value:

wherein ,

the hidden layer node with the minimum contribution value in the s-th module, and J is the number of the hidden layer nodes in the network;

therefore, deleting the hidden node with the smallest contribution, adjusting parameters through a second-order learning algorithm, and comparing the root mean square error value (RMSE _1) after the node is deleted with the root mean square error value (RMSE _0) when the node is not deleted, wherein the calculation formula of RMSE is shown as a formula (30); if RMSE _1 is less than or equal to RMSE _0, the selected nodes can be pruned under the condition of not sacrificing the network learning capability, and then the process is repeated, otherwise, the selected nodes cannot be deleted; at this time, the network structure trimming process is finished, and the sub-network construction is completed; adjusting parameters by using a second-order learning algorithm every time when the neurons are deleted;

adjusting network parameters

The second order learning algorithm is as follows:

θ_L+1＝θ_L-(Q_L+μ_LE)^-1g_L (33)

wherein θ refers to parameters to be adjusted, including center, radius and connection weight, Q is a Hessian-like matrix, μ is a learning coefficient (0.01 in the experiment), E is an identity matrix, g is a gradient vector, and L is an iteration step number (set to 50 in the experiment);

in order to reduce memory requirements and calculation time, the calculation of the Hessian-like matrix Q and the gradient vector g is converted into Hessian-like sub-matrix summation and gradient sub-vector summation:

q_zis a Hessian-like sub-matrix, η_zThe gradient subvectors can be calculated by the following formula:

wherein ,e_zIs the desired output y of the z-th sample_zAnd network prediction output

Difference of j_zFor the Jacobian vector, the calculation is as follows:

according to the chain derivation rule, each component in the jacobian vector in equation (39) is calculated as follows:

wherein

The center and radius of the jth neuron of the s-th module and the connection weight value x to the output layer_zIs the input vector of the z-th training sample;

4、NO_Xsoft measurement of concentration;

taking test sample data as the input of the brain-like modular neural network, wherein the output of the model is NO_XSoft measurements of concentration; and at time T, after the Tth test sample enters the BIMNN, searching a core node closest to the sample, and activating a sub-module to which the core node belongs:

wherein ,x_TAs input vector for the Tth test sample, h_sIs the core node of the s-th module, l_actIs a distance of the Tth test sample x_TThe nearest core node belongs to the sub-network, and A is the number of sub-network modules;

therefore, the actual output of BIMNN is the l_actOutput of the sub-network:

wherein ,

for the actual output of the BIMNN at time T,

is the first_actThe actual output of the sub-network;

and (3) quantitatively evaluating the test precision by adopting a Root Mean Square Error (RMSE), an average percent error (MAPE) and a Correlation Coefficient (CC), wherein the RMSE is calculated according to a formula (30), and the MAPE and CC are calculated according to the following formulas:

in the formula ,y_iAnd

expected output and net output, N, respectively, for the ith sample_TThe number of samples to be tested.

The invention has the following obvious advantages and beneficial effects:

1 the invention is based on the classGood nonlinear mapping capability and generalization capability of brain modular neural network, and establishes stable and effective NO_XSoft concentration measurement model for NO_XThe concentration is accurately obtained in real time, and NO is generated in the process of burning urban solid wastes_XEmission control is of great significance.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a diagram of a brain-like modular neural network architecture;

FIG. 3 is a diagram of the structure of an ATO-RBF neural network;

FIG. 4 is a graph of the training results of subnetwork 1;

FIG. 5 is a graph of the results of the training of subnetwork 2;

FIG. 6 is a graph of the results of the training of subnetwork 3;

FIG. 7 is a BIMNN soft measurement model test output diagram;

FIG. 8 is a BIMNN soft measurement test error graph.

Detailed Description

The invention uses training data to establish for NO_XA brain-like modular neural network model for soft measurement of concentration; verifying NO output by brain-like modular neural network soft measurement model by using test data set_XAccuracy of real-time concentration.

As an example, the effectiveness of the method provided by the invention is verified by using actual data from a solid waste incineration plant in a certain city of Beijing. After removing obvious abnormal data, 1000 groups of 96-dimensional experimental data are obtained. Based on the obtained data sample, adopting an mRMR algorithm to perform feature selection, selection and NO_XThe 20 variables with larger correlation are used as input variables of the soft measurement model, and are shown in table 1 in detail.

TABLE 1

For 1000 groups of data after dimension reduction, 750 groups of data are used for establishing a soft measurement model, and the other 250 groups of data are used for testing the performance of the model;

(1) based on 750 groups of training data, adopting a brain-like modular partitioning method, dividing 750 groups of training data into three subsets, wherein the sample number of each subset is 215, 273 and 262 respectively; correspondingly, the BIMNN consists of 3 sub-networks, and the sub-networks are established by the sub-network construction method in step 3; fig. 4, 5, and 6 are graphs of training results of respective sub-networks, where X-axis: number of training samples, in units of units per sample, Y-axis: NO_XConcentration in mg/Nm³；

(2) NO by brain-like modular neural network based on 250 sets of test data_XConcentration soft measurement, the test results are shown in fig. 7, X-axis: number of samples tested, in units of units per sample, Y-axis: NO_XConcentration in mg/Nm³(ii) a Test error as shown in fig. 8, X-axis: number of training samples, in units of units per sample, Y-axis: NO_XError in concentration measurement in mg/Nm³；

(3) The measurement accuracy is quantitatively evaluated by using the root mean square error RMSE, the average percentage error MAPE and the correlation coefficient CC, and the calculation result is that RMSE is 6.5031, MAPE is 3.8514% and CC is 0.9777.

It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the invention. It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that modifications and variations can be made without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A brain-like modular neural network-based soft measurement method for nitrogen oxides in an urban solid waste incineration process is characterized by comprising the following steps:

step 1, data acquisition;

step 2, determining model input and output variables;

model input variables passing maximum phaseDetermining by a minimum redundancy mRMR algorithm, wherein the output variable of the model is NO at the current moment_XConcentration;

step 3, designing a brain-like modular neural network, and establishing a soft measurement model;

step 4, using the test data as the input of the model, wherein the output of the model is NO at the current moment_XA concentration measurement;

in step 2, the input variable selection method based on the mRMR algorithm is as follows:

considering that certain similarity exists among the selected characteristic variables, and the elimination of the 'redundant' characteristic does not influence the performance of the model; therefore, the redundancy between features is calculated, and the "mutually exclusive" features are found:

in the mRMR algorithm application process, the maximum correlation index D and the minimum redundancy index R are generally unified into an evaluation function Φ — D-R, and then the optimal feature subset S is determined by finding the maximum value of the evaluation function:

maxΦ(D,R),Φ＝D-R (4)

in step 3, the design method of the soft measurement model based on the brain-like modular neural network is as follows:

(1) task decomposition

In order to measure and evaluate the modularization degree of the network and simulate the modularization characteristic of the brain network, a modularization index MQ facing to a modularization neural network is provided, the modularization index MQ consists of the density degree in the modules and the sparsity degree among the modules, wherein the density degree in the modules is calculated as follows:

wherein ,J_CFor the density within a module, P is the number of modules in the current network, N_lNumber of samples assigned to the l-th module, x_iTo input samples, h_l and r_lRespectively representing the position and the action range of the core node of the first module;

the sparsity between the modules is calculated based on the Euclidean distance:

therefore, the larger the value of MQ is, the higher the modularization degree of the network is, so a brain-like modularization partition method is proposed, and the main idea is as follows: firstly, distributing training samples through core nodes to determine whether to distribute the training samples to a current existing module or a newly added module; then, determining a new core node of the existing module by seeking the maximum modularization degree of the network; thus, modular structure construction can be divided into two cases: adding new modules and updating existing modules;

adding new module

At the initial moment, the number of modules of the whole network is 0;

when the first data sample enters the network, setting the first data sample as a core node of a first sub-module:

h₁＝x₁ (8)

wherein ,x_tIs the input vector of the t-th training sample, h_sDenotes the location of the core node of the s-th module, k_minRepresenting distance training sample x_tThe nearest core node;

if the t-th training sample x_tIs not at k_minIn the action range of the core node, a new module is needed to learn the current sample, and the parameters of the core node corresponding to the new module are set as follows:

h_k+1＝x_t (12)

② optimizing existing modules

Otherwise, the sample is considered to be classified as k_minIn the module, in order to make the network have the optimal modularization degree, according to the formula (7), modularization index values MQ of the whole network under the condition that the current sample and the original core node are respectively used as the core nodes are respectively calculated_tAnd

if it is

Then the network modularization degree of the current input sample selected as the core node is considered to be higher, and the sample is used for replacing the existing core nodeThe new core node, the initial parameters set as follows:

wherein ,

respectively as a new core node and an action range of the module; n is a radical of_kIs the number of samples assigned to the kth module;

if it is

The location of the current core node

Keeping unchanged, only adjusting the action range of the node

Namely:

wherein ,

is the original core node of the module;

after all training samples are compared, the samples are distributed to different sub-modules, a partition structure is formed, the modularization degree of the current network is considered to be the maximum, and then a sub-network needs to be constructed according to the task of each sub-module;

(2) subnetwork structure design

Training data set

Is divided into M subsets;

(ii) network architecture growth

Setting:

wherein ,

for the sample with the largest absolute output,

wherein ,y_fIs the desired output of the f-th sample,

is the f-th sample at time t_sIs calculated by the following equation:

wherein H is the number of neurons in the hidden layer,

is the function of the jth hidden layer node of the s-th module,

the center and radius of the jth hidden layer node of the s-th module;

Then, adding a new RBF neuron pair

wherein

And

is a first

The input variable corresponding to each of the samples,

are respectively the first

wherein

Is the center of the neuron closest to the newly added neuron;

from equations (27) and (28), the radius of the newly added neuron is set as:

when a neuron is newly added, network parameters are adjusted through a second-order learning algorithm; when reaching the preset maximum structure J_maxOr desired training accuracy E₀The network structure growth process ends; in the course of the experiment J_max＝10，E₀0.0001, adoptThe root mean square error RMSE is used to measure the training accuracy of the network and is calculated as follows:

② trimming of network structure

wherein SI (j) is the contribution value of the jth hidden layer node,

finding the hidden layer node with the smallest contribution value:

wherein ,

therefore, the hidden node with the smallest contribution is deleted, parameters are adjusted through a second-order learning algorithm, the root mean square error value (RMSE _1) after the node is deleted is compared with the root mean square error value (RMSE _0) when the node is not deleted, and the calculating formula of RMSE is shown as a formula (30); if RMSE _1 is less than or equal to RMSE _0, the selected nodes can be pruned under the condition of not sacrificing the network learning capability, and then the process is repeated, otherwise, the selected nodes cannot be deleted; at this time, the network structure trimming process is finished, and the sub-network construction is completed; adjusting parameters by using a second-order learning algorithm every time when the neurons are deleted;

network parameter adjustment

The second order learning algorithm is as follows:

θ_L+1＝θ_L-(Q_L+μ_LE)^-1g_L (33)

wherein theta refers to parameters needing to be adjusted and comprises a center, a radius and a connection weight, Q is a Hessian-like matrix, mu is a learning coefficient, and 0.01 is taken; e is an identity matrix, g is a gradient vector, and L is an iteration step number set to be 50;

Difference of j_zFor the Jacobian vector, the calculation is as follows:

wherein

in step 4, the test data is used as the input of the model, and the output of the model is NO at the current moment_XA concentration measurement;

and at time T, after the Tth test sample enters the BIMNN, searching a core node closest to the sample, and activating a sub-module to which the core node belongs:

therefore, the actual output of BIMNN is the l_actOutput of the sub-network:

wherein ,

for the actual output of the BIMNN at time T,

is the first_actThe actual output of the sub-network.