CN113052214A - Heat exchange station ultra-short term heat load prediction method based on long and short term time series network - Google Patents

Heat exchange station ultra-short term heat load prediction method based on long and short term time series network Download PDF

Info

Publication number
CN113052214A
CN113052214A CN202110274414.3A CN202110274414A CN113052214A CN 113052214 A CN113052214 A CN 113052214A CN 202110274414 A CN202110274414 A CN 202110274414A CN 113052214 A CN113052214 A CN 113052214A
Authority
CN
China
Prior art keywords
data
model
term
short
layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110274414.3A
Other languages
Chinese (zh)
Other versions
CN113052214B (en
Inventor
刘旭东
李硕
范青武
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN202110274414.3A priority Critical patent/CN113052214B/en
Publication of CN113052214A publication Critical patent/CN113052214A/en
Application granted granted Critical
Publication of CN113052214B publication Critical patent/CN113052214B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/24323Tree-organised classifiers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Human Resources & Organizations (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开了一种换热站超短期热负荷预测的方法。首先利用随机森林算法对特征进行筛选降维;然后对数据进行标准化处理;接着建立基于长短期时间序列网络的热负荷预测模型,模型通过卷积层和循环层捕捉长短期的特征信息,然后引入循环跳跃层这一概念,捕捉更长期的特征信息,同时利用自回归算法为模型添加线性处理能力,增强了模型的鲁棒性。该方法利用逐时负荷自身的周期特性来解决神经网络在处理长序列数据时信息丢失的问题,从而提高了模型预测的性能。

Figure 202110274414

The invention discloses a method for ultra-short-term heat load prediction of a heat exchange station. Firstly, the random forest algorithm is used to filter the features and reduce the dimension; then the data is standardized; then a heat load prediction model based on a long-term and short-term time series network is established. The concept of cyclic skip layer captures longer-term feature information, while using autoregressive algorithms to add linear processing capabilities to the model, enhancing the robustness of the model. The method uses the periodic characteristics of the hourly load itself to solve the problem of information loss when the neural network processes long sequence data, thereby improving the performance of model prediction.

Figure 202110274414

Description

Heat exchange station ultra-short term heat load prediction method based on long and short term time series network
Technical Field
The invention relates to the technical field of centralized heating, in particular to a method for predicting ultra-short-term heat load of a heat exchange station. The invention relates to a specific application of a data-driven method in the field of heat load prediction in a centralized heating process.
Background
With the continuous development of the economic society of China and the continuous improvement of the urbanization level, the centralized heat supply is gradually covered in cities and rural areas in the northern China. According to the disclosure of the national statistical bureau, the central heating area of cities in China reaches 87.80 hundred million square meters by 2018, and is increased by 5.67 percent compared with the area at the end of 2017. Fossil fuel consumed by centralized heat supply can cause serious environmental pollution and haze, and in order to realize energy conservation and environmental protection and avoid uneven heat supply, prediction of heat load becomes an important research problem. And a heat supply company is researched and researched to know that the heat supply area of the heat supply company is about 350 ten thousand square meters, and if the temperature is reduced by 0.5 ℃ in the heat supply process, nearly ten thousand yuan can be saved. Therefore, from the aspects of energy conservation, environmental protection and economic benefit, the heat load prediction has very important practical significance. The central heating system is a nonlinear large-scale system, which comprises a plurality of valves and pumps, and an accurate mathematical model is difficult to establish, so that the data-driven method is more suitable for the field of heat load prediction.
The method is mainly designed for the ultra-short-term heat load forecasting task of the heat exchange station. The heat exchange station is directly connected with a heat user, distributes and distributes heat, and the heat supply company directly regulates and controls the heat exchange station. In real life, the heat exchange station is close to the district heat user, and the lag period of heat supply is close to 1 hour. Therefore, the heat exchange station and the ultra-short period respectively serve as the object of research and have better practical significance in time dimension.
The traditional heat load prediction method mainly comprises gray prediction, time series prediction, regression and other methods, and with the continuous development of intelligent algorithms, a plurality of algorithms of machine learning and neural networks are applied to the field, such as: support Vector Regression (SVR), Recurrent Neural Network (RNN), Long-Short Term Memory (LSTM), and the like. However, the above methods are based on the time sequence characteristic of the thermal load, when the input sequence is long, the gradient disappears, which easily causes the loss of information, so that the correlation of information in a long period is lost, and the prediction accuracy needs to be improved.
Disclosure of Invention
In order to solve the problem that the Long-term information is easy to lose when the neural Network processes the Long-sequence information, the invention provides a Long Short-term Time-series Network (LSTNet) model to deal with the problem. Thermal loading is a typical time series problem, and time-wise thermal loading is periodic. The long-short time sequence network model provided by the invention utilizes the characteristic and introduces the idea of cyclic jump, thereby effectively solving the problem of information loss. Firstly, the model utilizes a Random Forest (RF) algorithm to screen and reduce dimensions of features; and then, a thermal load prediction model based on a long-term and short-term time sequence network is established, long-term and short-term characteristic information is captured through the convolution layer and the cycle layer, then the concept of a cycle jump layer is introduced, longer-term characteristic information is captured, and meanwhile, linear processing capacity is added to the model by utilizing an autoregressive algorithm, so that the robustness of the model is enhanced.
The invention adopts the following technical scheme and implementation steps:
s1, selecting meteorological data and heating data in a certain time period, and constructing a data set as an input variable Xn
S2, preprocessing the data, including identifying and correcting missing values and outliers, and standardizing the data;
s3, screening the input variables by using an RF method, and performing dimensionality reduction operation on the data set to obtain XmAnd the data set is divided into 8: 2, dividing the ratio into a training set and a testing set;
s4 inputs the training set into the LSTNet model item by item, the weights and biases of the training model:
s401, firstly, capturing short-term local characteristic information by using a convolutional layer;
s402, utilizing the circulation layer to capture the long-term macro information, and outputting ht R(ii) a Simultaneous cycle of skip floor benefit
The periodic characteristics of the sequence are used to capture longer-term information, the output is ht S
S403, connecting the outputs of the cycle layer and the cycle jump layer in a mode of full connection layer to obtain the output yt D
S404 thenLinear components are added for prediction by combining the output of the AR process, and meanwhile, the model can capture the scale change of the input, the robustness of the model is enhanced, and the output y is obtainedt A
The output module of S405 integrates the output of the neural network part and the output of the AR model to obtain a final prediction model.
S5, inputting the test set into the well-trained LSTNet model one by one to obtain a predicted value
Figure BDA0002975389790000021
Advantageous effects
Compared with the prior art, the method fully considers the periodic characteristic of the ultra-short-term thermal load, and makes up the problem of information loss of the conventional neural network caused by gradient descent by introducing the concept of the cycle jump layer. Different from the traditional neural network algorithm, the method fully considers the periodic characteristic of the time-by-time heat load, the characteristic is more representative, and the prediction task of the ultra-short-term heat load can be better completed.
Drawings
FIG. 1 is a diagram of a model architecture according to the present invention;
FIG. 2 is a graph of simulated heat load data presentation of the present invention;
FIG. 3 is a graph showing the predicted results of the LSTNet model of the present invention;
Detailed Description
The technical features and advantages of the present invention will become more apparent from the following detailed description of the embodiments of the present invention when taken in conjunction with the accompanying drawings.
S1, selecting as many related characteristic variable data as possible, wherein the related characteristic variable data may include meteorological data, operation condition data, heat load data and the like, so as to construct a heat load data set to obtain Xn={x1,x2,…,xnN is the number of characteristic variables;
s2, after the data set is constructed, preprocessing the data:
s201 compensates for the missing value, i.e. the value of 0 or null data, and calculates using the following formula:
xi=0.4xi-1+0.4xi+1+0.2xi+2 (1)
in the formula xiIs the current miss value, xi-1、xi+1And xi+2The values of the previous moment, the next moment and the next two moments are respectively;
s202 treats an outlier, that is, a value exceeding 3 times or more the predetermined range, as a missing value;
s203, standardizing each dimension input variable, wherein the adopted calculation formula is as follows:
Figure BDA0002975389790000031
in the formula yiIs a normalized value; x is the number ofiIs the original value;
Figure BDA0002975389790000032
and s represent the mean and variance of the raw data, respectively. The normalized data mean is 0, variance is 1, and there is no dimension.
S3, screening and dimension reduction are carried out on the feature variables by using an RF algorithm, the idea of evaluating feature importance by using a random forest is simple, and the method mainly comprises the steps of determining how much each feature contributes to each tree in the random forest, then averaging, and finally comparing the contribution sizes of different features. The importance of a certain feature x is denoted as IMP, and the calculation method is as follows:
s301, for each decision tree in the random forest, calculates its Out-Of-Bag data error, denoted as OOB error1, using the corresponding Out-Of-Bag data (Out Of Bag, OOB), and the calculation formula is as follows:
Figure BDA0002975389790000041
and taking the out-of-bag data as input, bringing the out-of-bag data into a random forest classifier, performing classification comparison on the O pieces of data by using the classifier, and counting the number of classification errors to be set as X.
S302, randomly adding noise interference to the characteristic x of all samples of the data outside the bag, and calculating the error of the data outside the bag again and recording the error as OOBERR 2;
s303, assuming there are N trees in the random forest, the importance IMP of the feature x is shown in formula 1:
Figure BDA0002975389790000042
after noise is randomly added to a certain feature, the accuracy rate outside the bag is greatly reduced, which indicates that the feature has a great influence on the classification result of the sample, that is, the feature has a high importance degree.
The invention utilizes random forest to sort the importance of the characteristic variables in a descending order, then determines the deletion ratio, and eliminates the unimportant indexes of the corresponding ratio from the current characteristic variables, thereby obtaining a new characteristic set, wherein the characteristic of the new characteristic set is Xm={x1,x2,…,xm}. Wherein m is<n, the deleting proportion is determined according to the number of the characteristic variables in the original data set. After dimensionality reduction of the dataset, the dataset is scaled by 8: the scale of 2 is divided into a training set and a test set.
S4, inputting training set data into the LSTNet model item by item according to a time sequence, wherein the weights and the bias of the training model are shown in the overall structure of the LSTNet model in FIG. 1:
s401, the first module of the network is a convolution layer, and the function of the convolution layer is to extract features and capture local short-term feature information. The convolutional layer module consists of a number of filters, where the width is ω, the height is m, and m is the same as the number of features. The output of the ith filter is then:
hi=ReLU(Wi*X+bi) (5)
in which h is outputiAs a vector, ReLU is an activation function, and ReLU (x) max (0, x). Is a convolution operation, WiAnd biRespectively weight matrix and bias.
S402 the convolutional layer module outputs the loop layer and the loop-jump layer simultaneously input to the second module. What is used by the loop layer is a gated loop Unit (GRU), in which ReLU is used as an activation function for implicit updates. Then the hidden state output h of the cell at time tt RComprises the following steps:
Figure BDA0002975389790000051
wherein z istAnd rtThe outputs of the update gate and reset gate in the GRU neuron respectively,
Figure BDA0002975389790000052
output for an intermediate state; σ is sigmoid activation function, xtAn input at this layer at time t, which is an elemental product; w, U and b are the weight matrix and offset, respectively, for each gate cell. The output of this layer is the hidden state at each time step.
The GRU network can capture long-term history information, but because the gradient disappears, all the previous information cannot be saved, so that the correlation of the longer-term information is lost. In the LSTNet model, the problem is solved by a jumping idea, which is based on periodic data, by the hyper-parameter of period p, obtaining very far time information. When the time t is predicted, the time data information of the previous period, the previous period and the earlier period can be predicted. Since this type of dependency is difficult to capture by the cyclic unit due to the long time of one cycle, introducing a cyclic network structure with hopping connections can extend the time span of the information flow to obtain longer-term data information. Its output h at time tt SComprises the following steps:
Figure BDA0002975389790000053
the input to this layer is the same as the recycle layer and is the output of the convolutional layer. Where p is the number of skipped hidden units, i.e. the period. The general period is easily determined, and according to engineering experience or data trend, if the data is non-periodic or the periodicity is dynamically changed, attention mechanism method can be cited to dynamically update the period p.
S403, connecting the two layers, and combining the outputs of the two layers by adopting a full-connection layer mode by the model. The output of this layer at time t is:
Figure BDA0002975389790000054
wherein WRAnd WSWeights assigned to the loop layer and the loop jump layer, respectively, and b is an offset value.
S404 in the actual data set, the input scale changes non-periodically, but the neural network is not sensitive to the scale changes of the input and output, so the prediction accuracy of the neural network model is significantly reduced by this problem. Therefore, in the model, in order to solve the deficiency, a linear part is added in the model, and a classical Autoregressive (AR) model is adopted to enhance the robustness of the model. The output y of the AR model at time tt AComprises the following steps:
Figure BDA0002975389790000061
wherein q isAIs the input window size on the input matrix.
The S405 output module integrates the output of the neural network part and the output of the AR model to obtain the final output of the LSTNet model
Figure BDA0002975389790000062
Comprises the following steps:
Figure BDA0002975389790000063
wherein
Figure BDA0002975389790000064
Is the final predicted value of the model at time t.
S406, in the model training process, a Mean Square Error (MSE) function is used as a loss function, and the formula is as follows:
Figure BDA0002975389790000065
where n is the number of valid data,
Figure BDA0002975389790000066
and yiRespectively predicted values and actual values tested.
S5, inputting the test set into the well-trained LSTNet model one by one to obtain a predicted value
Figure BDA0002975389790000067
To verify the effectiveness of the method, we used normal data from a heating season for verification. The data is obtained by simulating the 120-day heating process of the heat exchange station in one district of Zheng State in Henan by EnergyPlus software, and a data diagram is shown in FIG. 2. The comparison experiment is performed by using methods such as AR, Integrated Moving Average Autoregressive (ARIMA), MLR, SVR, GRU and the like, the experimental result is shown in FIG. 3, and the evaluation index result of each model is shown in Table 1.
TABLE 1 comparison of evaluation indexes for models of thermal load prediction
Model (model) RMSE(×103) MAE(×103) R-Squared
AR 40.815 27.213 76.724%
ARIMA 33.892 19.028 83.951%
MLR 31.631 20.857 86.020%
SVR 29.220 18.662 88.070%
GRU 24.994 17.249 91.268%
LSTNet 15.833 12.341 96.501%
From the above experimental results, it can be seen that the LSTNet model utilized herein predicts performance better than other models for time-wise thermal load prediction, closer to 1 on the R-square index than other models. Compared with a GRU model, the RMSE of the LSTNet model in the model is reduced by 36.7%, the MAE is reduced by 28.5%, and the model precision is obviously improved.

Claims (4)

1.一种基于长短期时间序列网络的换热站超短期负荷预测方法,其特征在于所述方法步骤为:1. an ultra-short-term load forecasting method for a heat exchange station based on a long-term and short-term time series network, characterized in that the method steps are: S1:选取一定时间段的气象数据和供热数据,构建数据集作为输入变量XnS1: Select meteorological data and heating data for a certain period of time, and construct a data set as input variable X n ; S2:对数据进行预处理,其中包括对缺失值、离群值的识别与修正,并对数据进行标准化处理;S2: Preprocess the data, including identifying and correcting missing values and outliers, and standardizing the data; S3:采用RF方法对输入变量进行筛选,对数据集进行降维操作得到Xm,并将数据集以8:2的比例分为训练集和测试集;S3: Use the RF method to filter the input variables, perform dimensionality reduction operation on the data set to obtain X m , and divide the data set into a training set and a test set in a ratio of 8:2; S4:将训练集逐条输入到LSTNet模型中,训练模型的权重和偏置,得到训练好的网络模型;S4: Input the training set into the LSTNet model one by one, train the weights and biases of the model, and obtain the trained network model; S5:将测试集逐条输入到训练好的LSTNet模型中,得到预测值
Figure FDA0002975389780000014
S5: Input the test set into the trained LSTNet model one by one to get the predicted value
Figure FDA0002975389780000014
2.根据权利要求1所述基于长短期时间序列网络的换热站超短期负荷预测方法,其特征在于,所述步骤S2进行了数据的预处理,其步骤为:2. The ultra-short-term load forecasting method for a heat exchange station based on a long- and short-term time series network according to claim 1, wherein the step S2 has carried out data preprocessing, and the steps are: S201:针对缺失值,可采用下式来计算:S201: For missing values, the following formula can be used to calculate: xi=0.4xi-1+0.4xi+1+0.2xi+2 (1)x i =0.4x i-1 +0.4x i+1 +0.2x i+2 (1) 式中xi为当前缺失值,xi-1、xi+1和xi+2分别为上一时刻、下一时刻、下二时刻的值;where x i is the current missing value, x i-1 , x i+1 and x i+2 are the values of the previous moment, the next moment, and the next two moments respectively; S202:针对离群值,即超过规定范围3倍以上的值,将该值作为缺失值处理;S202: For outliers, that is, values that exceed the specified range by more than 3 times, treat the values as missing values; S203:对每一维数据进行标准化,标准化的公式为:S203: Standardize each dimension of data, and the standardization formula is:
Figure FDA0002975389780000011
Figure FDA0002975389780000011
式中yi为标准化后的值;xi为原始值;
Figure FDA0002975389780000012
和s分别代表原始数据的均值和方差;标准化后的数据均值为0,方差为1,且无量纲。
where y i is the standardized value; xi is the original value;
Figure FDA0002975389780000012
and s represent the mean and variance of the original data, respectively; the standardized data has a mean of 0, a variance of 1, and is dimensionless.
3.根据权利要求1所述基于长短期时间序列网络的换热站超短期负荷预测方法,其特征在于,所述步骤S3降维操作的方法包括:3. The ultra-short-term load prediction method for a heat exchange station based on a long-term and short-term time series network according to claim 1, wherein the method for dimensionality reduction operation in step S3 comprises: S301:计算随机森林中每一棵决策树的袋外数据误差;S301: Calculate the out-of-bag data error of each decision tree in the random forest; 使用相应的袋外数据(Out Of Bag,OOB)来计算它的袋外数据误差,记为OOBError1,其计算公式如下所示:Use the corresponding out-of-bag data (Out Of Bag, OOB) to calculate its out-of-bag data error, denoted as OOBError1, and its calculation formula is as follows:
Figure FDA0002975389780000013
Figure FDA0002975389780000013
其中O为袋外数据总数,以袋外数据作为输入,带入随机森林分类器,利用分类器给这O条数据进行分类比较,统计分类错误的数目,设为X;Among them, O is the total number of out-of-bag data, take out-of-bag data as input, bring it into the random forest classifier, use the classifier to classify and compare the O data, count the number of classification errors, and set it as X; S302:对特征x加入噪声干扰,再次计算随机森林中每一棵决策树的袋外数据误差;S302: Add noise interference to the feature x, and calculate the out-of-bag data error of each decision tree in the random forest again; S303:计算每个特征的重要性IMP,其计算公式为:S303: Calculate the importance IMP of each feature, and its calculation formula is:
Figure FDA0002975389780000021
Figure FDA0002975389780000021
式中OOBError1和OOBError2分别为加入噪声前后的袋外误差;N为随机森林中决策树的总数。where OOBError1 and OOBError2 are the out-of-bag errors before and after adding noise, respectively; N is the total number of decision trees in the random forest.
4.根据权利要求1所述基于长短期时间序列网络的换热站超短期负荷预测方法,其特征在于,所述步骤S4中所提出的LSTNet模型,该模型的具体方法包括:4. The ultra-short-term load prediction method for a heat exchange station based on a long-term and short-term time series network according to claim 1, wherein the LSTNet model proposed in the step S4, the specific method of the model comprises: S401:网络的第一模块为卷积层,该层由多个滤波器组成,第i个滤波器的输出公式为:S401: The first module of the network is a convolution layer, which consists of multiple filters, and the output formula of the ith filter is: hi=ReLU(Wi*X+bi) (5)h i =ReLU(W i *X+ bi ) (5) 其中输出的hi为向量,ReLU是激活函数,并且ReLU(x)=max(0,x)*。为卷积运算,Wi和bi分别为权重矩阵与偏置;where the output hi is a vector, ReLU is the activation function, and ReLU(x)=max(0,x)*. is the convolution operation, and Wi and bi are the weight matrix and bias, respectively; S402:第二模块为循环层和循环跳跃层,作用是获得长期和较长期的特征信息,循环层和循环跳跃层在t时刻单元的隐藏状态输出
Figure FDA0002975389780000025
Figure FDA0002975389780000026
的公式分别为:
S402: The second module is a cyclic layer and a cyclic skip layer, whose functions are to obtain long-term and longer-term feature information, and output the hidden state of the unit at time t in the cyclic layer and the cyclic skip layer
Figure FDA0002975389780000025
and
Figure FDA0002975389780000026
The formulas are:
Figure FDA0002975389780000022
Figure FDA0002975389780000022
Figure FDA0002975389780000023
Figure FDA0002975389780000023
其中zt和rt分别为GRU神经元中更新门和重置门的输出,
Figure FDA0002975389780000024
为中间的状态输出;σ为sigmoid激活函数,xt是t时刻在该层的输入,⊙为元素乘积;p为跳过隐藏单元的数量,即为周期;W、U和b分别是各门单元的权重矩阵和偏置;
where z t and r t are the outputs of the update gate and the reset gate in the GRU neuron, respectively,
Figure FDA0002975389780000024
is the intermediate state output; σ is the sigmoid activation function, x t is the input at the layer at time t, ⊙ is the element product; p is the number of skipped hidden units, that is, the period; W, U and b are the gates respectively the weight matrix and bias of the cell;
S403:为连接上面两层,模型采用全连接层的方式来组合两个层的输出;该层在t时刻的输出公式为:S403: In order to connect the upper two layers, the model adopts a fully connected layer to combine the outputs of the two layers; the output formula of this layer at time t is:
Figure FDA0002975389780000031
Figure FDA0002975389780000031
其中WR和WS分别为循环层和循环跳跃层分配的权重,b为偏置值;where WR and W S are the weights assigned by the recurrent layer and the recurrent skip layer, respectively, and b is the bias value; S404:为捕捉输入尺度的变化,在模型中添加了AR过程,该过程在t时刻的输出
Figure FDA0002975389780000032
为:
S404: In order to capture the change of the input scale, an AR process is added to the model, and the output of the process at time t
Figure FDA0002975389780000032
for:
Figure FDA0002975389780000033
Figure FDA0002975389780000033
其中qA为输入矩阵上的输入窗口大小;where q A is the input window size on the input matrix; S405:将神经网络部分的输出和AR模型的输出整合,得到了模型的最终预测输出
Figure FDA0002975389780000034
为:
S405: Integrate the output of the neural network part with the output of the AR model to obtain the final prediction output of the model
Figure FDA0002975389780000034
for:
Figure FDA0002975389780000035
Figure FDA0002975389780000035
其中
Figure FDA0002975389780000036
为模型在t时刻处的最终预测值;
in
Figure FDA0002975389780000036
is the final predicted value of the model at time t;
S406:在模型训练过程中,采用均方误差(Mean Square Error,MSE)函数作为损失函数,其公式为:S406: In the model training process, the Mean Square Error (MSE) function is used as the loss function, and the formula is:
Figure FDA0002975389780000037
Figure FDA0002975389780000037
式中n为有效数据数目,
Figure FDA0002975389780000038
和yi分别为预测的值与测试的真实值。
where n is the number of valid data,
Figure FDA0002975389780000038
and y i are the predicted value and the tested true value, respectively.
CN202110274414.3A 2021-03-14 2021-03-14 Heat exchange station ultra-short-term heat load prediction method based on long-short-term time sequence network Active CN113052214B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110274414.3A CN113052214B (en) 2021-03-14 2021-03-14 Heat exchange station ultra-short-term heat load prediction method based on long-short-term time sequence network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110274414.3A CN113052214B (en) 2021-03-14 2021-03-14 Heat exchange station ultra-short-term heat load prediction method based on long-short-term time sequence network

Publications (2)

Publication Number Publication Date
CN113052214A true CN113052214A (en) 2021-06-29
CN113052214B CN113052214B (en) 2024-05-28

Family

ID=76512106

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110274414.3A Active CN113052214B (en) 2021-03-14 2021-03-14 Heat exchange station ultra-short-term heat load prediction method based on long-short-term time sequence network

Country Status (1)

Country Link
CN (1) CN113052214B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113256036A (en) * 2021-07-13 2021-08-13 国网浙江省电力有限公司 Power supply cost analysis and prediction method based on Prophet-LSTNet combined model
CN113821344A (en) * 2021-09-18 2021-12-21 中山大学 A method and system for cluster load prediction based on machine learning
CN114266343A (en) * 2021-12-29 2022-04-01 北京百度网讯科技有限公司 Training method of information determination model, and method and device for determining environmental information
CN114912169A (en) * 2022-04-24 2022-08-16 浙江英集动力科技有限公司 Industrial building heat supply autonomous optimization regulation and control method based on multi-source information fusion
CN115860270A (en) * 2023-02-21 2023-03-28 保定博堃元信息科技有限公司 Network supply load prediction system and method based on LSTM neural network
CN118074112A (en) * 2024-02-21 2024-05-24 北京智芯微电子科技有限公司 Photovoltaic power prediction method based on similar day and long-short period time sequence network
CN118114091A (en) * 2024-01-18 2024-05-31 广东电网有限责任公司江门供电局 An adaptive high-voltage circuit breaker diagnostic system
CN118332496A (en) * 2024-04-22 2024-07-12 内蒙古自治区气候中心 Abnormal snowfall prediction method in winter based on meteorological data and statistical model

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110414747A (en) * 2019-08-08 2019-11-05 东北大学秦皇岛分校 A spatio-temporal long-term and short-term urban pedestrian flow prediction method based on deep learning
CN110610232A (en) * 2019-09-11 2019-12-24 南通大学 A long-term and short-term traffic flow forecasting model construction method based on deep learning
CN110619430A (en) * 2019-09-03 2019-12-27 大连理工大学 Space-time attention mechanism method for traffic prediction
CN111275169A (en) * 2020-01-17 2020-06-12 北京石油化工学院 A method for short-term building heat load forecasting
CN111309577A (en) * 2020-02-19 2020-06-19 北京工业大学 Spark-oriented batch processing application execution time prediction model construction method
AU2020101854A4 (en) * 2020-08-17 2020-09-24 China Communications Construction Co., Ltd. A method for predicting concrete durability based on data mining and artificial intelligence algorithm

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110414747A (en) * 2019-08-08 2019-11-05 东北大学秦皇岛分校 A spatio-temporal long-term and short-term urban pedestrian flow prediction method based on deep learning
CN110619430A (en) * 2019-09-03 2019-12-27 大连理工大学 Space-time attention mechanism method for traffic prediction
CN110610232A (en) * 2019-09-11 2019-12-24 南通大学 A long-term and short-term traffic flow forecasting model construction method based on deep learning
CN111275169A (en) * 2020-01-17 2020-06-12 北京石油化工学院 A method for short-term building heat load forecasting
CN111309577A (en) * 2020-02-19 2020-06-19 北京工业大学 Spark-oriented batch processing application execution time prediction model construction method
AU2020101854A4 (en) * 2020-08-17 2020-09-24 China Communications Construction Co., Ltd. A method for predicting concrete durability based on data mining and artificial intelligence algorithm

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
庄家懿;杨国华;郑豪丰;王煜东;胡瑞琨;丁旭;: "并行多模型融合的混合神经网络超短期负荷预测", 电力建设, no. 10, 1 October 2020 (2020-10-01) *
荀港益;: "基于聚类分析与随机森林的短期负荷滚动预测", 智能城市, no. 09, 14 May 2018 (2018-05-14) *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113256036A (en) * 2021-07-13 2021-08-13 国网浙江省电力有限公司 Power supply cost analysis and prediction method based on Prophet-LSTNet combined model
CN113256036B (en) * 2021-07-13 2021-10-12 国网浙江省电力有限公司 Analysis and Prediction Method of Power Supply Cost Based on Prophet-LSTNet Combination Model
CN113821344A (en) * 2021-09-18 2021-12-21 中山大学 A method and system for cluster load prediction based on machine learning
CN113821344B (en) * 2021-09-18 2024-04-05 中山大学 Cluster load prediction method and system based on machine learning
CN114266343A (en) * 2021-12-29 2022-04-01 北京百度网讯科技有限公司 Training method of information determination model, and method and device for determining environmental information
CN114912169A (en) * 2022-04-24 2022-08-16 浙江英集动力科技有限公司 Industrial building heat supply autonomous optimization regulation and control method based on multi-source information fusion
CN114912169B (en) * 2022-04-24 2024-05-31 浙江英集动力科技有限公司 Industrial building heat supply autonomous optimization regulation and control method based on multisource information fusion
CN115860270A (en) * 2023-02-21 2023-03-28 保定博堃元信息科技有限公司 Network supply load prediction system and method based on LSTM neural network
CN118114091A (en) * 2024-01-18 2024-05-31 广东电网有限责任公司江门供电局 An adaptive high-voltage circuit breaker diagnostic system
CN118074112A (en) * 2024-02-21 2024-05-24 北京智芯微电子科技有限公司 Photovoltaic power prediction method based on similar day and long-short period time sequence network
CN118332496A (en) * 2024-04-22 2024-07-12 内蒙古自治区气候中心 Abnormal snowfall prediction method in winter based on meteorological data and statistical model
CN118332496B (en) * 2024-04-22 2024-12-20 内蒙古自治区气候中心 Winter abnormal snowfall prediction method based on meteorological data and statistical model

Also Published As

Publication number Publication date
CN113052214B (en) 2024-05-28

Similar Documents

Publication Publication Date Title
CN113052214A (en) Heat exchange station ultra-short term heat load prediction method based on long and short term time series network
CN109754113B (en) Load prediction method based on dynamic time warping and long-and-short time memory
Zheng et al. An accurate GRU-based power time-series prediction approach with selective state updating and stochastic optimization
CN112990556A (en) User power consumption prediction method based on Prophet-LSTM model
CN112116147A (en) River water temperature prediction method based on LSTM deep learning
CN113554466B (en) Short-term electricity consumption prediction model construction method, prediction method and device
Tian Backtracking search optimization algorithm-based least square support vector machine and its applications
CN111985719B (en) Power load prediction method based on improved long-term and short-term memory network
CN109931678A (en) Air-conditioning fault diagnosis method based on deep learning LSTM
CN108280551A (en) A kind of photovoltaic power generation power prediction method using shot and long term memory network
CN108009674A (en) Air PM2.5 concentration prediction methods based on CNN and LSTM fused neural networks
CN112989711B (en) Aureomycin fermentation process soft measurement modeling method based on semi-supervised ensemble learning
CN116542382A (en) Sewage treatment dissolved oxygen concentration prediction method based on mixed optimization algorithm
CN111027772A (en) Multi-factor short-term load forecasting method based on PCA-DBILSTM
CN111814956B (en) A multi-task learning air quality prediction method based on multi-dimensional quadratic feature extraction
CN114282443A (en) Remaining service life prediction method based on MLP-LSTM supervised joint model
CN113537469B (en) Urban water demand prediction method based on LSTM network and Attention mechanism
CN117277279A (en) A deep learning short-term load forecasting method based on particle swarm optimization
CN113516304A (en) Spatiotemporal joint prediction method and device for regional pollutants based on spatiotemporal graph network
CN113868954B (en) Air conditioner load prediction method and system
CN116169670A (en) A short-term non-resident load forecasting method and system based on improved neural network
CN118970924A (en) A short-term load forecasting method and system based on LSTM-DNN hybrid neural network
Jang et al. Comparative Analysis of Deep Learning Techniques for Load Forecasting in Power Systems Using Single‐Layer and Hybrid Models
CN114254828B (en) A power load forecasting method based on hybrid convolutional feature extractor and GRU
CN110533109A (en) A kind of storage spraying production monitoring data and characteristic analysis method and its device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant