CN114065335A

CN114065335A - Building energy consumption prediction method based on multi-scale convolution cyclic neural network

Info

Publication number: CN114065335A
Application number: CN202111020470.0A
Authority: CN
Inventors: 马武彬; 顾桐菲; 吴亚辉; 邓苏; 周浩浩; 皇甫先鹏
Original assignee: National University of Defense Technology
Current assignee: National University of Defense Technology
Priority date: 2021-09-01
Filing date: 2021-09-01
Publication date: 2022-02-18

Abstract

The invention discloses a building energy consumption prediction method based on a multi-scale convolution cyclic neural network, which comprises the following steps of: building energy consumption prediction models based on the multi-scale convolution cyclic neural network are built; training the building energy consumption prediction model by using training set data; and inputting the test set data into the trained building energy consumption prediction model, and calculating to obtain the predicted value of the building energy consumption. The method introduces the multi-scale convolutional layer into the recurrent neural network, and attention mechanisms are distributed from different scales, so that the model can acquire historical information from different scales; the bidirectional GRU layer can more fully acquire context information of sequence data, the whole model adopts a convolution structure to fuse recognition outputs of attention mechanisms of different scales, and output is screened and recognized by different scales through convolution connection, so that better accuracy is acquired for prediction of building energy consumption values.

Description

Building energy consumption prediction method based on multi-scale convolution cyclic neural network

Technical Field

The invention belongs to the technical field of building energy consumption prediction, and particularly relates to a building energy consumption prediction method based on a multi-scale convolution cyclic neural network.

Background

The problem of energy consumption is one of the important issues of social widespread concern. The proportion of the building power consumption to the total social power consumption exceeds 50%, and the problem of power consumption prediction of a certain building or a family is one of the key problems, so that attention of vast personnel is attracted. The prediction of the future power consumption can provide early warning for the abnormal use of the power supply, and meanwhile, the power supply system can also provide decision support for power supply strategies and scheduling of power supply departments, and has great significance.

The prediction accuracy for energy consumption is still insufficient at present. The traditional machine learning methods such as linear regression, Support Vector Regression (SVR), random forest, XBBboost, ensemble learning and the like can predict the energy consumption, but because the factors influencing the energy consumption are more and the relationship is more complex, the traditional machine learning methods are difficult to capture the long-term dependence relationship, and the time sequence importance among the factors is not well acquired. Recently, researchers have adopted deep learning methods (RNN, LSTM, GRU, Bi-LSTM, etc.) to predict energy consumption, and the method has a good effect. However, both the conventional machine learning method and the deep learning method which is popular in recent years do not capture the correlation characteristics between the elements from the time sequence, and the prediction accuracy is not ideal.

Disclosure of Invention

In view of the above, in order to solve the problem of accurate prediction of building energy consumption, the present invention aims to provide a method for predicting building energy consumption based on a multi-scale convolution cyclic neural network, which predicts the power consumption of a certain building by combining outdoor air pressure, temperature, humidity, wind power and visible light sensor data with indoor temperature and humidity sensors of the building.

Based on the purpose, the building energy consumption prediction method based on the multi-scale convolution cyclic neural network is provided, and comprises the following steps:

step 1, constructing a building energy consumption prediction model based on a multi-scale convolution cyclic neural network;

step 2, training the building energy consumption prediction model by using training set data, wherein the training set data comprises influence factor data and known building energy consumption data;

and 3, inputting the test set data into the trained building energy consumption prediction model, and calculating to obtain a predicted value of the building energy consumption.

Specifically, the building energy consumption prediction model comprises a first convolution layer, a first bidirectional GRU layer, a first multi-scale convolution layer, a second bidirectional GRU layer, a second multi-scale convolution layer, a first full-connection layer and a second full-connection layer, wherein the layers are sequentially connected, the output of the first convolution layer and the output of the first bidirectional GRU layer are connected and then simultaneously used as the input of the first multi-scale convolution layer and the second multi-scale convolution layer, the bidirectional GRU layer is formed by connecting a forward GRU model and a backward GRU model in parallel to form a bidirectional structure, the bidirectional GRU layer outputs two combined GRU signals, the output layer of the first full connection layer is 100, the output layer of the second full connection layer is 1, the input of the first convolution layer in the building energy consumption prediction model is an influence factor data sequence, and the output of the second full-connection layer is a building energy consumption value.

Specifically, the energy consumption prediction model for the building is

x₀,...,x_TTo influence the factor sequence data, (y)₀,...,y_K),K<T is a known building energy consumption value, (y)_K+1,...,y_T) To the extent that a predicted building energy consumption value is required,

for the corresponding estimated value, the input is x₀,...,x_T,y₀,...,y_KAnd (3) sequentially inputting variables into the building energy consumption prediction model to start training, wherein the loss function adopts standard normalized MSE, and the activation function adopts Relu function.

Specifically, the analytical expression of the building energy consumption prediction model is as follows:

C⁴ _t＝η²([C² _t,C³ _t])

wherein x is_tIs an input to the model at time t, η¹(. eta.) and eta²(. is) two convolution operations, [, ]]For merge join operations, MutiScalConv (. circle., Scale1) and MutiScalConv (. circle., Scale2) are two multi-ruler scales Scale1 and Scale2, respectivelyThe degree convolution operation, the specific process of fusion convolution, is as follows:

first convolution layer η¹(x_t) Accepting sequence data x_tInput and output of

Simultaneously as inputs to the first multi-scale convolutional layer and the second multi-scale convolutional layer;

is the output of the first bi-directional GRU layer,

indicating the output to be GRU in forward direction

And backward GRU output

Carrying out merging connection;

is to multiply the first bidirectional GRU layer by a weight vector

And adding the offset vector

The result of (1);

will be provided with

And η¹(x_t) Output of (2)

Are combined into

As an input to the first multi-scale convolutional layer;

is the output of the first multi-Scale convolutional layer with Scale1, connected to the second bidirectional GRU layer;

is the output of the second bidirectional GRU layer,

indicating the output of the GRU in forward direction therein

And backward GRU output

Carrying out merging connection;

is to multiply the second bidirectional GRU layer by a weight vector

And adding the offset vector

The result of (1);

by analogy, the expression is obtained

By a convolution operation with Scale2, for [ C² _t,C³ _t]Extracting to reserve the more important scale information of the target and obtain output

Then is fully connectedOperation to obtain output O_t；

Wherein the content of the first and second substances,

and

all are obtained by learning and training.

Specifically, the bidirectional GRU layer is a bidirectional structure formed by connecting two GRU models of a forward GRU and a backward GRU, and the first layer of the forward GRU is forgotten to be output by a gate: f. of¹ _t＝σ(W¹ _f[H¹ _t-1,x”_t]+B¹ _f)，σ(x)＝1/(1+e^-x) In the forward GRU update gate, the first output is: z is a radical of¹ _t＝σ(W¹ _z[H¹ _t-1,x”_t]+B¹ _z) And the second output is:

similarly, the corresponding first output to the GRU is: z is a radical of² _t＝σ(W² _z[H² _t-1,x”_t]+B² _z) And a second output:

intermediate output of forward GRU

And backward GRU intermediate output

Obtaining an output by an aggregation operation on the intermediate output

Show thatForward GRU output

And backward GRU output

Performing merged connection as output of bidirectional GRU layer

x”_tFor the input of the bidirectional GRU layer, [ W ]¹ _f,B¹ _f]， [W¹ _Z,B¹ _Z]，[W¹ _h,B¹ _h]For forward GRU model parameters, [ W ]² _f,B² _f],[W² _Z,B² _Z]，[W² _h,B² _h]For the inverse GRU model parameters, [ W ]¹² _o,B¹² _o]Are output layer parameters.

The output of the first bidirectional GRU layer, correspondingly,

is the output of the second bidirectional GRU layer.

Preferably, the convolutional layer is a 1-dimensional convolutional network.

Specifically, the influencing factor data includes: the temperature and humidity of each room in the building, as well as the outside air pressure, outside humidity and outside wind speed.

The building energy consumption prediction model in the method mainly comprises a multi-scale convolution layer, a bidirectional GRU layer and a full-connection layer, the multi-scale convolution layer is introduced into a recurrent neural network, a plurality of improved attention units are connected in series, attention mechanisms are distributed on different scales, therefore, the model can collect historical information from different scales, distinguish the influence of different input elements on a prediction result, the bidirectional GRU layer can more fully acquire the context information of sequence data on the basis of GRUs, the whole model adopts a convolution structure to fuse the recognition output of attention mechanisms of different scales, and the output is screened and identified by different scales through convolution connection, so that the scale information which is more important to the target can be reserved, and the better precision is obtained for predicting the energy consumption value of the building.

Drawings

FIG. 1 is a schematic flow diagram of the process of the present invention;

FIG. 2 is a data processing flow diagram of the method of the present invention;

FIG. 3 is a schematic diagram of a 1-dimensional convolutional network in the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As shown in fig. 1, the method for predicting the energy consumption of a building based on the multi-scale convolution cyclic neural network is provided, and includes the following steps:

Specifically, the building energy consumption prediction model comprises a first convolution layer, a first bidirectional GRU layer, a first multi-scale convolution layer, a second bidirectional GRU layer, a second multi-scale convolution layer, a first full-connection layer and a second full-connection layer, wherein the layers are sequentially connected in sequence, the output of the first convolution layer and the output of the first bidirectional GRU layer are connected and then simultaneously used as the input of the first multi-scale convolution layer and the input of the second multi-scale convolution layer, the bidirectional GRU layer is formed by connecting a forward GRU model and a backward GRU model in parallel to form a bidirectional structure, the bidirectional GRU layer outputs two combined GRU signals, the output layer of the first full-connection layer is 100, the output layer of the second full-connection layer is 1, the input of the first convolution layer in the building energy consumption prediction model is an influence factor data sequence, and the output of the full-connection layer is a building energy consumption value.

Specifically, the energy consumption prediction model for the building is

C⁴ _t＝η²([C² _t,C³ _t])

wherein x is_tIs an input to the model at time t, η¹(. eta.) and eta²(. is) two convolution operations, [, ]]For merge join operations, mutiscalaconv (·, Scale1) and mutiscalaconv (·, Scale2) are two multi-Scale convolution operations with scales Scale1 and Scale2, respectively, and the specific process of fusion convolution is as follows:

is the output of the first bi-directional GRU layer,

indicating the output to be GRU in forward direction

And backward GRU output

Carrying out merging connection;

is to multiply the first bidirectional GRU layer by a weight vector

And adding the offset vector

The result of (1);

will be provided with

And η¹(x_t) Output of (2)

Are combined into

As an input to the first multi-scale convolutional layer;

is the output of the second bidirectional GRU layer,

indicating the output of the GRU in forward direction therein

And backward GRU output

Carrying out merging connection;

is to multiply the second bidirectional GRU layer by a weight vector

And adding the offset vector

The result of (1);

by analogy, the expression is obtained

Then obtaining output O through full connection operation_t(ii) a The data processing flow of the method of the invention is shown in FIG. 2;

wherein the content of the first and second substances,

and

all are obtained by learning and training.

intermediate output of forward GRU

And backward GRU intermediate output

Obtaining an output by an aggregation operation on the intermediate output

Indicating the output to be GRU in forward direction

And backward GRU output

Performing merged connection as output of bidirectional GRU layer

The output of the first bidirectional GRU layer, correspondingly,

is the output of the second bidirectional GRU layer.

Preferably, the convolutional layer is a 1-dimensional convolutional network.

Preferably, the convolutional layer is a 1-dimensional convolutional network. Convolutional neural networks generally include 1-dimensional convolution, 2-dimensional convolution, and 3-dimensional convolution networks. The one-dimensional convolution network is mainly used for sequence data such as audio data, equipment maintenance sampling data and the like, the two-dimensional convolution is mainly used for image processing such as image classification, target recognition, image segmentation and the like, and the three-dimensional convolution network is mainly used for video processing such as medical image video, motion detection and the like. In this embodiment, the time series data is mainly analyzed, and a 1-dimensional convolution network result is adopted. A typical 1-dimensional convolutional network results are shown in fig. 3. This includes a one-dimensional convolution kernel vector with a filter size k of 4. Convolution factors (convolution factors) d is 1.

For the element s that currently needs to be convolved, the mathematical expression of the one-dimensional convolution operation is:

wherein f (i) represents a convolution kernel, X_s-d·iIndicating that sample values at interval d are taken forward.

To better show the details of this example, the experiment used an energy consumption data set for a building house in belgium. The data description is shown in table 1.

TABLE 1 data set data item meanings

The experimental parameters of the model in this example are shown in table 2:

table 2: algorithm variable parameter valuing

The experimental environment is as follows: the experimental background used in this example is: computer experiment environment: the experimental background used in this example is: the computer is mainly configured as follows: pentium (R) Dual-core 3.06CPU, 8G RAM memory.

And (3) effect evaluation: parameters employed herein for performance evaluation of algorithms include RMSE, MAE, MAPE, and CC:

RMSE (Root Mean Square Error) is calculated as:

MAE (Mean absolute Error) is calculated as:

MAPE (Mean absolute percent Error) was calculated as:

r2 (coeffient of Determination), determining the coefficient by the following calculation method:

it should be noted that RMSE, MAE and MAPE are measures of prediction errors, and the smaller the value is, the more accurate the value is, and the R2 parameter represents the coefficient for determining the number of two sequences, and the larger the value is, the more relevant the two sequence data is, the better the prediction effect is.

For the building energy consumption data set, the periodicity of each sequence data change is not strong, which indicates that the energy consumption problem does not show periodic changes along with days. Seasonally, during the data acquisition period of 4.5 months, as the weather becomes gradually hot (the data of T1-T9 are in an ascending trend), the air humidity gradually decreases, and the overall amplitude of energy consumption is reduced.

Experiments are performed on the data to predict the future energy consumption of the building, and the prediction accuracy is shown in table 3.

Table 3: predicted result values of different algorithms

As can be seen from table 3, the prediction accuracy of the method of the present invention is generally higher than that of the conventional machine learning model in the prediction calculation for building energy consumption. While the MCRNN model reduces the RMSE by 47.83%, 38.72%, 16.62%, 15.67%, 13.29%, 13.58%, 7.55%, 3.09% relative to the SVM, Random Forest, LSTM, GRU, Bi-LSTM, Bi-GRU, Bi-Conv-LSTM, Bi-Conv-GRU network models, from the RMSE index. The average accuracy is respectively improved by 37.81%, 70.38%, 32.50%, 16.09%, 25.59%, 27.43%, 4.93% and 4.39%, the prediction correlation is respectively improved by 83.09%, 69.38%, 8.12%, 11.47%, 8.80%, 8.73%, 5.22% and 2.08%, and the method of the invention has better performance than other network models in terms of average percentage error.

According to the invention content and the embodiment, the building energy consumption prediction model in the method mainly comprises a multi-scale convolutional layer, a bidirectional GRU layer and a full-connection layer, the multi-scale convolutional layer is introduced into a recurrent neural network, a plurality of improved attention units are connected in a series connection mode and a jump connection mode, attention mechanisms are distributed on different scales, therefore, the model can collect historical information from different scales, distinguish the influence of different input elements on a prediction result, the bidirectional GRU layer can more fully acquire the context information of sequence data on the basis of GRUs, the whole model adopts a convolution structure to fuse the recognition output of attention mechanisms of different scales, and the output is screened and identified by different scales through convolution connection, so that the scale information which is more important to the target can be reserved, and the better precision is obtained for predicting the energy consumption value of the building.

Claims

1. The building energy consumption prediction method based on the multi-scale convolution cyclic neural network is characterized by comprising the following steps of:

step 3, inputting the test set data into the trained building energy consumption prediction model, and calculating to obtain a prediction value of the building energy consumption;

the building energy consumption prediction model comprises a first convolution layer, a first bidirectional GRU layer, a first multi-scale convolution layer, a second bidirectional GRU layer, a second multi-scale convolution layer, a first full-connection layer and a second full-connection layer, wherein the layers are sequentially connected, the output of the first convolution layer and the output of the first bidirectional GRU layer are connected and then simultaneously used as the input of the first multi-scale convolution layer and the second multi-scale convolution layer, the bidirectional GRU layer is formed by connecting a forward GRU model and a backward GRU model in parallel to form a bidirectional structure, the bidirectional GRU layer outputs two combined GRU signals, the output layer of the first full-connection layer is 100, the output layer of the second full-connection layer is 1, the input of the first convolution in the building energy consumption prediction model is an influence factor data sequence, and the output of the second full-connection layer is a building energy consumption value.

2. The building energy consumption prediction method based on the multi-scale convolution cyclic neural network as claimed in claim 1, wherein the building energy consumption prediction model is

3. The building energy consumption prediction method based on the multi-scale convolution cyclic neural network as claimed in claim 1, wherein the analytical expression of the building energy consumption prediction model is as follows:

is the output of the first bi-directional GRU layer,

indicating the output to be GRU in forward direction

And backward GRU output

Carrying out merging connection;

is to multiply the first bidirectional GRU layer by a weight vector

And adding the offset vector

The result of (1);

will be provided with

And η¹(x_t) Output of (2)

Are combined into P_t ¹As input to the first multi-scale convolutional layer;

is the output of the second bidirectional GRU layer,

indicating the output of the GRU in forward direction therein

And backward GRU output

Carrying out merging connection;

is to multiply the second bidirectional GRU layer by a weight vector

And adding the offset vector

The result of (1);

by analogy, the expression is obtained

Then obtaining output O through full connection operation_t；

Wherein the content of the first and second substances,

and

all are obtained by learning and training.

4. The building energy consumption prediction method based on the multi-scale convolution cyclic neural network as claimed in claim 1 or 3, characterized in that the bidirectional GRU layer is a bidirectional structure formed by connecting two GRU models, namely a forward GRU and a backward GRU, and the first layer of the forward GRU is left to be output through a gate: f. of¹ _t＝σ(W¹ _f[H¹ _t-1,x”_t]+B¹ _f)，σ(x)＝1/(1+e^-x) In the forward GRU update gate, the first output is: z is a radical of¹ _t＝σ(W¹ _z[H¹ _t-1,x”_t]+B¹ _z) And the second output is:

intermediate output of forward GRU

And backward GRU intermediate output

Obtaining an output by an aggregation operation on the intermediate output

Indicating the output to be GRU in forward direction

And backward GRU output

Performing merged connection as output of bidirectional GRU layer

x”_tFor the input of the bidirectional GRU layer, [ W ]¹ _f,B¹ _f]，[W¹ _Z,B¹ _Z]，[W¹ _h,B¹ _h]For forward GRU model parameters, [ W ]² _f,B² _f],[W² _Z,B² _Z]，[W² _h,B² _h]For the inverse GRU model parameters, [ W ]¹² _o,B¹² _o]Are output layer parameters.

5. The method of claim 4, wherein the convolutional layer is a 1-dimensional convolutional network.

6. The method of claim 1, wherein the influence factor data comprises: the temperature and humidity of each room in the building, as well as the outside air pressure, outside humidity and outside wind speed.