CN114383295B

CN114383295B - Ventilation control method based on indoor pollution concentration estimation model

Info

Publication number: CN114383295B
Application number: CN202111535815.6A
Authority: CN
Inventors: 牟淑志; 鞠全勇; 牟福元; 高素美; 张玉; 李永琳; 吴恩; 姜玉东; 陈俊桦
Original assignee: Jinling Institute of Technology
Current assignee: Jinling Institute of Technology
Priority date: 2021-12-15
Filing date: 2021-12-15
Publication date: 2023-05-16
Anticipated expiration: 2041-12-15
Also published as: CN114383295A

Abstract

The invention relates to a ventilation control method based on an indoor pollution concentration estimation model. The method comprises the following steps: step 1, building an indoor air detection system; step 2, PM2.5 and CO collected in the step 1 are collected ₂ Carrying out normalization processing on smoke signal data; step 3, establishing an indoor pollution concentration estimation model; step 4, establishing an indoor ventilation control algorithm model; step 5, embedding the designed indoor pollution concentration estimation model and the indoor ventilation control algorithm model into an air detection data processing module to complete the control of indoor ventilation; and 6, the indoor air detection data processing module detects outdoor wind speed and raindrop value in real time, and when the defined threshold value is exceeded, the controller closes the ventilation device. The invention establishes indoor PM2.5 and CO ₂ Based on the dynamic estimation model of concentration and smoke signals, PM2.5 and CO are dynamically estimated according to the rewarding value of ventilation control quantity ₂ The control of the window opening and the fan is completed by the concentration and smoke signals, so that the energy consumption can be saved and the service life of equipment can be prolonged compared with the traditional mechanical ventilation system.

Description

Ventilation control method based on indoor pollution concentration estimation model

Technical Field

The invention relates to the field of indoor pollution, in particular to a ventilation control method based on an indoor pollution concentration estimation model.

Background

With the development of modern technology and the improvement of life quality requirements of people, the functions of modern buildings are more and more perfect, and more human activities can be transferred from outdoor to indoor. Research shows that modern people spend most of their time in building rooms, so that the quality of the air in the rooms has a very important influence on the health and comfort of the human body. In recent years, pollution problems of indoor air around the world are frequent, and indoor air quality problems are also becoming more and more important.

Indoor air pollutants are mainly classified into solid and gas. The solid pollutants mainly comprise suspended particles, dust, pollen of partial plants, cells of microorganisms, partial smoke and the like; the gas pollutants mainly comprise Volatile Organic Compounds (VOCs), ozone (O3), nitrogen oxides (NOx), sulfur dioxide (SO 2), radon gas and the like. Prolonged exposure of the human body to these indoor air pollutants can suffer from serious respiratory diseases, cardiovascular diseases, skin diseases, cancers, etc., and even die. The invention provides a ventilation control method based on an indoor pollution concentration estimation model, which is used for establishing an indoor PM2.5, CO2 and smoke concentration dynamic estimation model and controlling ventilation equipment according to the dynamic estimation model.

Disclosure of Invention

In order to solve the problems, the invention collects PM2.5 and CO in the room ₂ Based on smoke concentration data and outdoor raindrops and wind speed data, a ventilation control method based on an indoor pollution concentration estimation model is provided, and indoor PM2.5 and CO are established ₂ Based on the dynamic estimation model of concentration and smoke signals, PM2.5 and CO are dynamically estimated according to the rewarding value of ventilation control quantity ₂ The concentration and smoke signals are used for controlling the opening of a window and a fan, and the specific steps of the invention are as follows, and the invention is characterized in that:

step 1, building an indoor air detection system: the indoor air detection system is composed of PM2.5 detection sensor and CO ₂ The device comprises a detection sensor, a smoke detection sensor, a wind speed detection sensor, a raindrop sensor, a signal conditioning circuit module, a power supply circuit module and an NI acquisition card;

step 2, PM2.5 and CO collected in the step 1 are collected ₂ Normalizing the smoke signal data to obtain PM2.5 and CO ₂ Normalizing the smoke signal data to a range of 0 to 1 to reduce dimensional effects between the data;

step 3, establishing an indoor pollution concentration estimation model: PM2.5 and CO after normalization in the step 2 ₂ The smoke data and the output value of the controller to the actuating mechanism are used as input, and the indoor PM2.5 and CO at the next moment are used ₂ The smoke data is used as output to establish an indoor pollution concentration estimation model;

step 4, building an indoor ventilation control algorithm model: indoor PM2.5 and CO at the next moment calculated by wind speed, ventilation action and indoor pollution concentration estimation model ₂ Smoke data and current indoor PM2.5, CO ₂ The smoke data is used as an input training depth deterministic strategy gradient network, and the depth deterministic strategy gradient network is utilized to output ventilation values;

step 5, embedding the designed indoor pollution concentration estimation model and the indoor ventilation control algorithm model into an air detection data processing module to complete the control of indoor ventilation;

and 6, the indoor air detection data processing module detects outdoor wind speed and raindrop value in real time, and when the defined threshold value is exceeded, the controller closes the ventilation device.

Further, the process of building the indoor air detection system in step 1 may be represented as follows:

PM2.5 detection sensor and CO are respectively detected by a power supply circuit ₂ Detection sensor, smoke detection sensor, wind speed detection sensor, and raindropThe sensor is powered and installed in proper position to detect PM2.5 and CO ₂ And the smoke, wind speed and raindrop signals are converted by the conditioning circuit and then are uniformly connected to the NI acquisition card, and the NI acquisition card transmits the signals to the indoor air detection data processing module of the upper computer through the PCIE interface.

Further, the process of establishing the indoor pollution concentration estimation model in step 3 may be expressed as follows:

step 3.1: the indoor pollution concentration estimation model consists of an input door, a forgetting door, a cell state and an output door; firstly, an input door selectively memorizes input, and a forget door selectively forgets input transmitted by a cell state; secondly, the outputs of the input gate and the forgetting gate are overlapped to the next cell state; finally, the output gate scales the output of the cell state and outputs the predicted value of the next moment;

step 3.2: input door for constructing indoor pollution concentration estimation model and filtering PM2.5 and CO ₂ The smoke data and the invalid information of the output value of the controller to the actuating mechanism, and inputting valid information to the cell state

Entry is indicated as follows:

i _t ＝g(W _xi x _t +W _hi h _t-1 +W _ci c _t-1 +b _i +W _ni n _t (1)

wherein x is _t Is normalized PM2.5 and CO ₂ Smoke data and data of output value of controller to actuator, g () is activation function, W _xi Is an input gate and x _t Weight coefficient between h _t-1 Is a hidden state at t-1, W _hi Is the weight coefficient between the input gate and the hidden state, c _t-1 Is the cell state at t-1, W _ci Is the weight coefficient between the input gate sample and the cell state, b _i Is the bias term of the input gate, i _t To input gate sample, n _t Is the noise interference at t, W _ni The weight coefficient between the input gate and the noise interference is used for improving the robustness of the model by adding the noise interference;

step 3.3: and constructing a forgetting gate of an indoor pollution concentration estimation model to delete part of invalid information in the cell state, wherein the forgetting gate is expressed as follows:

f _t ＝g(W _xf x _t +W _hf h _t-1 +W _cf c _t-1 +b _f ) (2)

wherein f _t For forgetting gate output, W _xf Is forgetting door and x _t Weight coefficient of each other, W _hf Is the weight coefficient between the forget gate and the hidden state, W _cf For forgetting door sample and cell state c _t-1 Weight coefficient of b _f Is forgetting

A bias term for the gate;

step 3.3: constructing a model cell state:

c _t ＝f _t c _t-1 +i _t tanh(W _xc x _t +W _hc h _t-1 +b _c ) (3)

in which W is _xc Is the cell state and x _t Weight coefficient of the two, W _hc Is cell state and hidden state h _t-1 Weight coefficient of b _c A bias term for the state of the cell;

step 3.4: building a model output gate to output indoor PM2.5 and CO at the next moment ₂ Smoke data:

s _t ＝g(W _xs x _t +W _hs h _t-1 +W _cs c _t +b _s ) (4)

wherein s is _t PM2.5 and CO in the room at the next moment of output of the output gate ₂ Normalized data of smoke, W _xs Is an output gate and x _t Weight coefficient of each other, W _hs Is to output the door sample and h _t-1 Weight coefficient of the two, W _cs Is the output gate sample, c _t Weight coefficient between b _s Bias item for output gate

Step 3.5: hidden state h of t moment through output gate _t Updating:

h _t ＝s _t tanh(c _t ) (5)

and 3.6, repeating the steps 3.2-3.5, and updating the weight and the bias parameters in the model by utilizing an SGD algorithm until the loss function reaches a convergence threshold value, wherein the convergence threshold value is set to be 1e-4.

Further, the process of building the indoor ventilation control algorithm model in step 4 may be represented as follows:

step 4.1, the indoor ventilation control algorithm model comprises a strategy network and a value network, wherein the value network aims at selecting an optimal action value, the value network aims at minimizing a loss function to update network parameters, the strategy network aims at selecting a current action according to a current state, and generating a next moment action according to an environment state;

step 4.2, respectively initializing a strategy network and a value network, wherein the strategy network and the value network respectively comprise a real network and a target network, and randomly initializing parameters of the strategy network and the value network: θ ^Q 、θ ^μ And the following steps:

θ ^Q′ ＝θ ^Q (6)

θ ^μ′ ＝θμ (7)

in θ ^Q Representing real value network parameters, theta ^Q′ Representing the target value network parameter, θ ^μ Representing realistic policy network parameters, θ ^μ′ Representing target policy network parameters

Step 4.3, selecting actuator action a under the current state _t ：

a _t ＝μ(s _t |θ ^μ )+N _t (8)

Wherein N is _t Is random noise, and the function mu () is the optimal behavior strategy;

step 4.4, by performing action a _t Obtain the prize r _t+1 And a state s estimated by an indoor pollution concentration estimation model _t+1 Save the current state s _t Action a _t Prize r _t+1 Sum state s _t+1 And placing the model into an experience pool; ,

step 4.5, when the empirical pool data reaches the training conditions, randomly selecting N samples {(s) from the empirical pool _t ，a _t ，r _t+1 ，s _t+1 )}；

Step 4.6, strategy network and value network parameters θ are measured using the samples in the experience pool ^Q 、θ ^μ Updating the target action value y calculated through the value network:

y＝r _t+1 +γQ(s _t+1 ，a _t+1 /θ ^Q′ ) (9)

gamma is the discount factor, a _t+1 Is an action at time t+1;

step 4.7, updating the target network once at intervals:

θ ^Q′ ＝τθ ^Q +(1-τ)θ ^Q ′ (10)

θ ^μ′ ＝τθ ^μ +(1-τ)θ ^μ′ (11)

wherein τ is a learning rate;

step 4.8, returning to the step 4.3 to carry out loop iteration, and knowing the maximum round number;

step 4.9, outputting ventilation control quantity a according to the indoor ventilation control algorithm model after training is completed _t 。

The ventilation control method based on the indoor pollution concentration estimation model has the beneficial effects that:

according to the invention, on the basis of establishing a dynamic estimation model of indoor PM2.5, CO2 concentration and smoke signals, the control of the window opening and the fan is finished according to the rewarding value of the ventilation control quantity and the dynamic estimation of the PM2.5, CO2 concentration and smoke signals, and compared with the traditional mechanical ventilation system, the energy consumption can be saved and the service life of equipment can be prolonged;

according to the invention, the outdoor wind speed and the raindrop value are detected in real time, and when the air speed and the raindrop value exceed the limiting threshold value, the controller turns off the ventilation device, so that the safe operation of the ventilation system is ensured;

in order to increase the self-adaption and anti-interference capacity of the ventilation system, the invention designs a novel ventilation control method based on an indoor concentration estimation model, and the system executor is controlled by utilizing the online weight learning capacity, so that the uncertainty and the interference of the ventilation system can be automatically controlled;

the invention provides an important technical means for an indoor pollution ventilation control method.

Drawings

FIG. 1 is a flow chart of the present invention;

fig. 2 is a model diagram of an indoor ventilation control algorithm of the present invention.

Detailed Description

The invention provides a ventilation control method based on an indoor pollution concentration estimation model, aims to improve the running stability and service life of ventilation control equipment while improving the indoor air quality, and provides a principle and a prediction framework based on an indoor pollution concentration estimation algorithm and a ventilation control algorithm. FIG. 1 is a flow chart of the present invention, and the present invention is further described below with reference to the drawings and the detailed description:

PM2.5 detection sensor and CO are respectively detected by a power supply circuit ₂ The detection sensor, the smoke detection sensor, the wind speed detection sensor and the raindrop sensor are powered, and meanwhile, the sensors are installed at proper positions to detect PM2.5 and CO ₂ And the smoke, wind speed and raindrop signals are converted by the conditioning circuit and then are uniformly connected to the NI acquisition card, and the NI acquisition card transmits the signals to the indoor air detection data processing module of the upper computer through the PCIE interface.

step 3, establishing an indoor pollution concentration estimation model: PM2.5 and CO after normalization in the step 2 ₂ Pair of smoke data and controllerThe constructed output value is used as input to make indoor PM2.5 and CO at the next moment ₂ The smoke data is used as output to establish an indoor pollution concentration estimation model;

step 3.2: input door for constructing indoor pollution concentration estimation model and filtering PM2.5 and CO ₂ And the smoke data and the invalid information of the output value of the controller to the execution mechanism, and inputting valid information to the cell state, wherein the input gate is expressed as follows:

i _t ＝gW _xi x _t +W _hi h _t-1 +W _ci c _t-1 +b _i +W _ni n _t (I)

f _t ＝g(W _xf x _t +W _hf h _t-1 +W _cf c _t-1 +b _f ) (2)

wherein f _t For forgetting gate output, W _xf Is forgetting door and x _t Weight coefficient of each other, W _hf Is the weight coefficient between the forget gate and the hidden state, W _cf For forgetting door sample and cell state c _t-1 Weight coefficient of b _f Is an offset item of the forgetting door;

step 3.3: constructing a model cell state:

c _t ＝f _t c _t-1 +i _t tanh(W _xc x _t +W _hc h _t-1 +b _c ) (3)

s _t ＝g(W _xs x _t +W _hs h _t-1 +W _cs c _t +b _s ) (4)

Step 3.5: hidden state h of t moment through output gate _t Updating:

h _t ＝s _t tanh(c _t ) (5)

Step 4, building an indoor ventilation control algorithm model: wind speed and ventilationIndoor PM2.5 and CO at next moment calculated by action and indoor pollution concentration estimation model ₂ Smoke data and current indoor PM2.5, CO ₂ The smoke data is used as an input training depth deterministic strategy gradient network, and the depth deterministic strategy gradient network is utilized to output ventilation values;

the indoor ventilation control algorithm model comprises a strategy network and a value network, wherein the value network aims at selecting the optimal action value, the value network aims at minimizing a loss function to update network parameters, the strategy network aims at selecting the current action according to the current state and generating the action at the next moment according to the environmental state, and the indoor ventilation control algorithm model diagram is shown in figure 2;

step 4.1, respectively initializing a strategy network and a value network, wherein the strategy network and the value network respectively comprise a real network and a target network, and randomly initializing parameters of the strategy network and the value network: θ ^Q 、θ ^μ And the following steps:

θ ^Q′ ＝θ ^Q (6)

θ ^μ′ ＝θμ (7)

Step 4.2, selecting actuator action a under the current state _t ：

a _t ＝μ(s _t |θ ^μ )+N _t (8)

step 4.3 by performing action a _t Obtain the prize r _t+1 And a state s estimated by an indoor pollution concentration estimation model _t+1 Save the current state s _t Action a _t Prize r _t+1 Sum state s _t+1 And placing the model into an experience pool; ,

step 4.4, when the experience pool data reaches the training conditionAt this time, N samples {(s) are randomly selected from the experience pool _t ，a _t ，r _t+1 ，s _t+1 )}；

Step 4.5, strategy network and value network parameters θ are measured using the samples in the experience pool ^Q 、θ ^μ Updating the target action value y calculated through the value network:

＝r _t+1 +γQ(s _t+1 ，a _t+1 /θ ^Q′ ) (9)

gamma is the discount factor, a _t+1 Is an action at time t+1;

step 4.6, updating the target network once at intervals:

θ ^Q′ ＝τθ ^Q +(1-τ)θ ^Q′ (10)

θ ^μ′ ＝τθ ^μ +(1-τ)θ ^μ′ (11)

wherein τ is a learning rate;

step 4.7, returning to the step 4.2 to carry out loop iteration, and knowing the maximum round number;

step 4.8, outputting ventilation control quantity a according to the indoor ventilation control algorithm model after training _t 。

The above description is only of the preferred embodiment of the present invention, and is not intended to limit the present invention in any other way, but is intended to cover any modifications or equivalent variations according to the technical spirit of the present invention, which fall within the scope of the present invention as defined by the appended claims.

Claims

1. The ventilation control method based on the indoor pollution concentration estimation model comprises the following specific steps of:

step 1, lappingBuilding an indoor air detection system: the indoor air detection system is composed of PM2.5 detection sensor and CO ₂ The device comprises a detection sensor, a smoke detection sensor, a wind speed detection sensor, a raindrop sensor, a signal conditioning circuit module, a power supply circuit module and an NI acquisition card;

the process of building the indoor air detection system in step 1 can be expressed as follows:

PM2.5 detection sensor and CO are respectively detected by a power supply circuit ₂ The detection sensor, the smoke detection sensor, the wind speed detection sensor and the raindrop sensor are powered, and meanwhile, the sensors are installed at proper positions to detect PM2.5 and CO ₂ The smoke, wind speed and raindrop signals are converted by the conditioning circuit and then are uniformly connected to the NI acquisition card, and the NI acquisition card transmits the signals to the indoor air detection data processing module of the upper computer through the PCIE interface;

the process of establishing the indoor pollution concentration estimation model in the step 3 is represented as follows:

step 3.2: input door for constructing indoor pollution concentration estimation model and filtering PM2.5 and CO ₂ Pair of smoke data and controllerConstruct invalid information of output value, and input valid information to cell state, input gate is expressed as follows:

i _t ＝g(W _xi x _t +W _hi h _t-1 +W _ci c _t-1 +b _i )+W _ni n _t (1)

f _t ＝g(W _xf x _t +W _hf h _t-1 +W _cf c _t-1 +b _f ) (2)

step 3.3: constructing a model cell state:

c _t ＝f _t c _t-1 +i _t tanh(W _xc x _t +W _hc h _t-1 +b _c ) (3)

s _t ＝g(W _xs x _t +W _hs h _t-1 +W _cs c _t +b _s ) (4)

Step 3.5: hidden state h of t moment through output gate _t Updating:

h _t ＝s _t tanh(c _t ) (5)

step 3.6, repeating the steps 3.2-3.5, and updating the weight and the bias parameters in the model by utilizing an SGD algorithm until the loss function reaches a convergence threshold value, wherein the convergence threshold value is set to be 1e-4;

the process of building the indoor ventilation control algorithm model in step 4 can be expressed as follows:

θ ^Q′ ＝θ ^Q (6)

θ ^μ′ ＝θ ^μ (7)

Step 4.2, selecting actuator action a under the current state _t ：

a _t ＝μ(s _t |θ ^μ )+N _t (8)

step 4.3 by performing action a _t Obtain the prize r _t+1 And a state s estimated by an indoor pollution concentration estimation model _t+1 Save the current state s _t Action a _t Prize r _t+1 Sum state s _t+1 And placing the model into an experience pool;

step 4.4, when the empirical pool data reaches the training conditions, randomly selecting N samples {(s) from the empirical pool _t ，a _t ，r _t+1 ，s _t+1 )}；

y＝r _t+1 +γQ(s _t+1 ，a _t+1 /θ ^Q′ ) (9)

gamma is the discount factor, a _t+1 Is an action at time t+1;

step 4.6, updating the target network once at intervals:

θ ^Q′ ＝τθ ^Q +(1-τ)θ ^Q′ (10)

θ ^μ′ ＝τθ ^μ +(1-τ)θ ^μ′ (11)

wherein τ is a learning rate;

step 4.8, modeling according to the indoor ventilation control algorithm after trainingOutput ventilation control amount a _t ；