CN113962456A

CN113962456A - Medium-and-long-term load prediction method considering industry relevance

Info

Publication number: CN113962456A
Application number: CN202111215938.1A
Authority: CN
Inventors: 姜宇轩; 李昆明; 马吉科; 王江辉; 曹卫青; 张鸿鸣; 张昆明; 章天晗; 林之岸; 杨莉; 林振智
Original assignee: Zhejiang University ZJU; Jiangsu Fangtian Power Technology Co Ltd
Current assignee: Zhejiang University ZJU; Jiangsu Fangtian Power Technology Co Ltd
Priority date: 2021-10-19
Filing date: 2021-10-19
Publication date: 2022-01-21
Anticipated expiration: 2041-10-19
Also published as: CN113962456B

Abstract

The invention discloses a medium and long term load prediction method considering industry relevance. The method comprises the following steps: the method comprises the steps of firstly, preprocessing industrial load data aiming at the problems of industrial load data loss and large data magnitude difference, providing an industrial monthly load data restoration method based on iterative interpolation, restoring monthly load data of a typical industry, and carrying out normalization processing on the load data by adopting a min-max standardization method. On the basis, the industry relevance is analyzed and quantified based on the Shapiro-Wilk test and the Pearson correlation coefficient, and the strongly relevant industries are screened according to the relevance indexes. And finally, on the basis of the industry load data, the air temperature data and the industry relevance, predicting the Long-Term load in the industry by using a Long Short Term Memory (LSTM) neural network. The method can realize high-precision medium and long term load prediction by introducing industry relevance analysis.

Description

Medium-and-long-term load prediction method considering industry relevance

Technical Field

The invention relates to the technical field of power systems, in particular to a medium-long term load prediction method considering industry relevance.

Background

With the rapid development of electric power industry in China, the scale of an electric power system is increasingly large, a smart power grid is developed vigorously, and electric power loads are influenced by more and more uncertain factors, such as climate, GDP, industrial structures, policies and the like, which provide greater challenges for load prediction of medium-and long-term electric power systems. In addition to these objective factors, an industry located in an industry chain must be affected by the upstream and downstream industries, so the influence of the association between industries on the load prediction of the industry cannot be ignored. The comprehensive consideration of the multiple influence factors to improve the load prediction precision becomes a hot topic in the current power system research field.

The medium and long-term load prediction result has important significance for resource scheduling and planning of the power system. For example, when a power plant is newly built, a construction site, a power generation scale, a construction period, and the like may be selected based on the result of the prediction of the medium-and long-term load in the area. The high-precision load prediction has important value on the operation planning of the power grid, and the development of big data and artificial intelligence technology also provides certain technical support for the high-precision load prediction. Currently, many researches are based on big data and deep learning technology, and the influences of various natural factors and social factors on load prediction are analyzed, so that various load prediction models are proposed. However, most of the research on load prediction of middle and long-term industries is mainly based on multivariate data such as load data, meteorological data, regional data and economic data, and the influence of the relevance among the industries on the load prediction is not considered, so that the precision of the load prediction of the industries is often not high.

Therefore, the influence of the industry relevance on the load prediction is considered, and a prediction method for improving the medium-long term load prediction precision is yet to be researched.

Disclosure of Invention

The invention provides a medium-long term load prediction method considering the industry relevance in order to overcome the defects in the prior art, and the influence of the industry relevance on load prediction can be considered, so that the accuracy of the industry load prediction is improved.

The invention adopts the following technical scheme:

a medium-long term load prediction method considering industry relevance comprises the following steps:

aiming at the problems of industry load data loss and large data magnitude difference, an industry monthly load data restoration method based on iterative interpolation is provided, monthly load loss data are restored, and load data are normalized by adopting a min-max standardization method;

analyzing and quantifying the association between industries based on Shapiro-Wilk test and Pearson correlation coefficient, and screening out strong correlation industries according to the association index (namely Pearson correlation coefficient);

and on the basis of the industry load data, the air temperature data and the industry relevance, predicting the long-term load and the medium-term load of the industry by using the long-term and short-term memory neural network.

In the above technical solution, further, for the problem of industry load data loss and a large difference between data magnitudes, an industry monthly load data restoration method based on iterative interpolation is proposed to restore monthly load data loss, and a min-max standardization method is adopted to normalize the load data, including the following steps:

because the original load data usually has the problems of data loss, large magnitude difference and the like, the original load data needs to be repaired and preprocessed. The method comprises the following steps of collecting middle and long term load data of the industry at intervals of months, and forming a data input matrix which can be expressed as follows:

in the formula, y_ijThe month load value of the j month of the i industry; m is the total number of industries; n is the total number of months of data acquisition; t is_jIndicating the meteorological parameters for month j.

Aiming at the problem of load data missing,

and (2) completing the data by adopting an iterative interpolation method, namely, by means of multiple iterations, taking the load data of the industry where the missing data is located as a dependent variable y, taking the rest of related characteristic data as an independent variable x, performing fitting modeling and interpolation by using a linear regression, performing multiple iteration cycles, and outputting complete load data, wherein the calculation of the y value is shown as the following formula:

y＝a₁x₁+a₂x₂+...a_Mx_M

the magnitude of the load may vary greatly from industry to industry, which may lead to increased difficulty in model training. The gradient descent can be accelerated by adopting normalization, so that the convergence speed of the model is improved. Meanwhile, the normalized data is limited to be between 0 and 1, so that the evaluation standard can be unified, and the calculation speed can be improved. And (3) carrying out normalization processing on the load data by adopting a min-max standardization method:

in the formula, x_i' represents the load data of month i after min-max normalization; x is the number of_iRepresenting the original load data of the ith month; x is the number of_minRepresenting the minimum of elements in the original load data; x is the number of_maxRepresenting the maximum value of the elements in the original load data.

Further, the association between industries is analyzed and quantified based on Shapiro-Wilk test and Pearson correlation coefficient, and strong related industries are screened out according to the association index, and the steps are as follows:

the correlation coefficient is used to evaluate the strength and direction of the linear relationship between two random variables. And analyzing the correlation among random variables by adopting a Pearson correlation coefficient to meet the condition of normal distribution of sample data, and performing normality test on load data of each industry based on a Shapiro-Wilk normal test method. Suppose H₀: the sample data obeys normal distribution; h₁: the sample data does not follow a normal distribution. And (3) judging the similarity degree of the distribution of the sample to be detected and the normal distribution by calculating the statistic W:

wherein W is the statistic returned by Shapiro-Wilk test, and the closer the value of W is to 1, the closer the data is to the standard normal distribution; x is the number of_iRepresenting the ith data in the sample;

represents an average value of sample data; a is_iThe coefficients for the Shapiro-Wilk test can be obtained by looking up the table.

Based on the statistic W, a significance level alpha is set, and then the quantile W thereof is obtained_α. If W is less than W_αThen H is rejected₀Otherwise, accept H₀. The Shapiro-Wilk test also calculates the P value of the data samples, which is expressed in the original hypothesis H₀The probability of observing the sample and more extreme cases occurring under the true premise. If the P value is less than the significance level alpha (statistically, typically 0.05), H is rejected₀。

The Pearson correlation coefficient r can be used for analyzing the relevance between the industry X and the industry Y, and for the industry X to be predicted and the related industry Y, the Pearson correlation coefficient calculation formula is as follows:

in the formula, r represents a correlation coefficient between industry X and industry Y, and the value of r is between-1 and + 1. X_iAnd Y_iRespectively represent the electricity load data of the ith month of the industry X and the industry Y,

and

the monthly electrical load means of industry X and Y are shown respectively.

The value range of the correlation coefficient r is between-1 and +1, namely | r | < 1. The closer | r | is to 1, the higher the degree of linear correlation of X with Y. r ═ -1, indicating a completely negative linear correlation between X and Y; r ═ 1, indicating a completely positive linear correlation between X and Y; r is 0, indicating that there is no linear correlation between the two.

Further, on the basis of industry load data, air temperature data and industry relevance, a long-term and short-term memory neural network is used for predicting the industry medium-term and long-term loads, and the method comprises the following steps:

the LSTM Neural Network is a modified Recurrent Neural Network (RNN). Compared with RNN, LSTM has the greatest advantages of solving the long-term dependence problem and enabling the neural network to have the characteristic information of the hidden load of the node with earlier memory time, so that the LSTM has better performance when processing a long-term sequence and is suitable for medium-term and long-term load prediction. The core concept of long-short term memory neural networks is the state memory unit and the "gate" structure. The state memory unit forms a path for transmitting the load prediction information, so that the information can be transmitted in a time sequence; the structure of the gate determines the degree of memory of the load characteristic information in the transmission process. The long-short term memory neural network realizes the information transmission through three basic 'gate' structures, namely a forgetting gate, an input gate and an output gate, and the transmission flow and the updating steps of the industry load characteristic information in the LSTM unit are divided into the following 4 steps:

(1) forgetting door

The role of the forgetting gate in the LSTM is to determine the degree of feature forgetting of the state memory cell. The forgetting gate predicts the load predicted value h from the previous moment_t-1And current input feature x_tSimultaneously transmitting a sigmoid function and outputting f of the sigmoid function_tAs a forgetting factor of the state memory cell. f. of_tCan be expressed as:

f_t＝σ(W_f[h_t-1,x_t]+b_f)

in the formula, W_fOutput h for forgetting the moment before the door_t-1And current input feature x_tA multiplied weight matrix; b_fBiasing the item for the forgetting gate; sigma represents a sigmoid function; h is_t-1A load prediction output representing a previous time; x is the number of_tAnd the load prediction characteristic factors representing the current input comprise industry relevance characteristics, air temperature characteristics and the like.

(2) Input gate

The input gate functions to generate the state memory cell intermediate value. First, the input gate outputs the predicted output value h of the previous time_t-1And newly input x_tThe common input sigmoid function outputs i_tAs input node state g_tThe update coefficient of (2). Input node state g_tFrom h_t-1And x_tAnd (4) jointly determining.

Sigmoid function output i of input gate_tThe expression is as follows:

i_t＝σ(W_i[h_t-1,x_t]+b_i)

in the formula, W_iPassing h of sigmoid function into input gate_t-1And input feature x_tA multiplied weight matrix; b_iBiasing the term for the input gate sigmoid function.

Input node state g_tThe expression is as follows:

g_t＝tanh(W_g[h_t-1,x_t]+b_g)

in the formula, W_gTransmitting h of tanh function to input gate_t-1And input feature x_tA multiplied weight matrix; b_gThe term is biased for the input gate tanh function.

(3) Refresh state memory cell

The third step of the LSTM workflow is to update the state memory, i.e., C in the map_t. Firstly, the state of the state memory unit at the previous time and the forgetting gate are output_tMultiplying and outputting the selective memory result. Then, the value is added point by point with the intermediate value of the state memory cell to update the state memory cell C_t，C_tThe expression is as follows:

in the formula (I), the compound is shown in the specification,

is the Hadamard product.

(4) Output gate

The output gate is used for determining the load predicted value h at the current moment_tFirstly, predict the load h of the previous time_t-1And input feature x_tTransmitting sigmoid function into the output of o_t. Secondly, will get moreNew state memory cell C_tPasses to tanh function, and outputs with o_tMultiplying to obtain the predicted value h of the load at the current moment_t. Wherein o is_tThe expression of (a) is as follows:

o_t＝σ(W_o[h_t-1,x_t]+b_o)

in the formula, W_oIs h_t-1And input feature x_tA multiplied weight matrix; b_oThe gate bias term is output.

Load predicted value h at current moment_tThe expression is as follows:

。

and training the model by taking the target industry load data, the strongly-related industry load data and the air temperature data as a data set based on the LSTM neural network model. To evaluate the accuracy of the prediction model, the following evaluation indexes were employed:

in the formula, y_kFor real industry load data, y_k' load data predicted for model, R₁As mean absolute percentage error, R₂Is the root mean square error.

And when the evaluation index meets the requirement, completing model training, and if the evaluation index does not meet the requirement, modifying the LSTM model parameter and continuing training.

The invention has the beneficial effects that:

the invention adopts an industry monthly load data restoration method based on iterative interpolation to realize the industry monthly load missing data restoration, the restoration precision can reach more than 95 percent, and the problem of data missing in the existing engineering is effectively solved; the method for quantifying the industry relevance based on the Pearson correlation coefficient is provided, strong relevant industries can be screened out according to quantification results, a load prediction model is built by combining the industry relevance with an LSTM neural network and taking the industry relevance as characteristics, the middle and long term load of the industry is predicted, the precision of the middle and long term load prediction is effectively improved, and the error is kept within 4%.

Drawings

FIG. 1 is a schematic overall flow diagram of the present invention.

Fig. 2 is a schematic diagram of an LSTM network element of the present invention.

FIG. 3 is a graph of the prediction result of the LSTM-based load prediction of the agricultural and sideline food processing industry.

FIG. 4 is a graph of the results of the LSTM-based "food manufacturing" load prediction of the present invention.

FIG. 5 is a graph showing the results of the LSTM-based "alcoholic beverage and purified tea manufacturing" load prediction according to the present invention.

Detailed Description

For better understanding of the objects, technical solutions and effects of the present invention, the present invention will be further explained with reference to the accompanying drawings.

The invention provides a medium-long term load prediction method considering industry relevance, which comprises the following detailed steps:

step 1, an industry monthly load data restoration method based on iterative interpolation is provided, monthly load missing data of a typical industry are restored, and load data are normalized by adopting a min-max standardization method.

Aiming at the problem of load data missing,

completing the data by adopting an iterative interpolation method, namely, performing multiple iterations, regarding the load data of the industry where the missing data is located as a dependent variable y, regarding the rest of related characteristic data as an independent variable x, performing fitting modeling and interpolating the missing y value by using a linear regression, performing multiple iteration cycles, and outputting complete load data; the y value is calculated as shown in the following equation:

y＝a₁x₁+a₂x₂+...+a_Mx_M

And 2, analyzing and quantifying the association between industries based on Shapiro-Wilk test and Pearson correlation coefficient, and screening out strongly-related industries according to association indexes.

The correlation coefficient is used to evaluate the strength and direction of the linear relationship between two random variables. Analyzing the correlation among random variables by adopting a Pearson correlation coefficient, wherein the correlation needs to meet the condition of normal distribution of sample data, and correcting the load data of each industry based on a Shapiro-Wilk normal test methodAnd (5) checking attitude. Suppose H₀: the sample data obeys normal distribution; h₁: the sample data does not follow a normal distribution. And (3) judging the similarity degree of the distribution of the sample to be detected and the normal distribution by calculating the statistic W:

and

the monthly electrical load means of industry X and Y are shown respectively.

Based on the theory of the correlation coefficient, an industry relevance evaluation index is formulated as shown in table 1.

TABLE 1 industry Association evaluation index

Pearson correlation coefficient range \| r-	Industry relevance evaluation
		0.8～1.0	Very strong correlation
0.6～0.8	Strong correlation
		0.4～0.6	Moderate correlation
0.2～0.4	Weak correlation
		0～0.2	Very weak correlation

And evaluating the industry correlation according to the calculated Pearson correlation coefficient, and screening out the extremely strong related industries.

And 3, predicting the middle-long term load of the industry by using the long-short term memory neural network on the basis of the industry load data, the air temperature data and the industry relevance.

The LSTM Neural Network is a modified Recurrent Neural Network (RNN). Compared with RNN, LSTM has the greatest advantages of solving the long-term dependence problem and enabling the neural network to have the characteristic information of the hidden load of the node with earlier memory time, so that the LSTM has better performance when processing a long-term sequence and is suitable for medium-term and long-term load prediction. The core concept of long-short term memory neural networks is the state memory unit and the "gate" structure. The state memory unit forms a path for transmitting the load prediction information, so that the information can be transmitted in a time sequence; the structure of the gate determines the degree of memory of the load characteristic information in the transmission process. The long-short term memory neural network realizes the information transmission through three basic 'gate' structures, namely a forgetting gate, an input gate and an output gate, as shown in fig. 2. The transmission process and the updating step of the industry load characteristic information in the LSTM unit are divided into the following 4 steps:

(1) forgetting door

f_t＝σ(W_f[h_t-1,x_t]+b_f)

in the formula, W_fOutput h for forgetting the moment before the door_t-1And current input feature x_tA multiplied weight matrix; b_fBiasing the item for the forgetting gate; sigma represents a sigmoid function; h is_t-1A load prediction output representing a previous time; x is the number of_tLoad prediction characteristic factors representing current inputs, including industry association characteristics, gasTemperature characteristics, etc.

(2) Input gate

Sigmoid function output i of input gate_tThe expression is as follows:

i_t＝σ(W_i[h_t-1,x_t]+b_i)

Tan h function output g of input gate_tThe expression is as follows:

g_t＝tanh(W_g[h_t-1,x_t]+b_g)

(3) Refresh state memory cell

in the formula (I), the compound is shown in the specification,

is the Hadamard product.

(4) Output gate

The output gate is used for determining the load predicted value h at the current moment_tThe output gate firstly predicts the load predicted value h at the previous moment_t-1And input feature x_tTransmitting sigmoid function into the output of o_t. Next, the output gate will refresh the state memory cell C_tPasses to tanh function, and outputs with o_tMultiplying to obtain the predicted value h of the load at the current moment_t. Wherein o is_tThe expression of (a) is as follows:

o_t＝σ(W_o[h_t-1,x_t]+b_o)

Load predicted value h at current moment_tThe expression is as follows:

in the formula, C_tIs an updated state memory cell.

in the formula, y_kFor real industry load data, y_k' load data predicted for model, R₁As mean absolute percentage error, R₂To all areSquare root error.

In order to verify the effectiveness and accuracy of the proposed middle-long term load prediction method considering the industry relevance, example simulation is carried out on 2016-year-month 2020 load data of a certain city, wherein 2 major industries and more than 30 related industries are included.

Taking the load data of food manufacturing industry as an example, randomly selecting the load data of 2 months as a data missing point to carry out iterative interpolation, comparing the interpolated load data with the real load data and carrying out error analysis, wherein the result is shown in table 2.

TABLE 2 comparison of interpolated data with actual data

	Data missing Point 1	Data missing Point 2
			Interpolating data	16566.41	10648.95
Real data	15944.91	10277.74
			Relative error	3.90％	3.61％

As can be seen from the above table, the result of the iterative interpolation matches the real situation well.

Next, taking the food-related industry as an example, the industry load correlation analysis was performed for "agri-sideline food processing industry", "food manufacturing industry", "wine and beverage and purified tea manufacturing industry", and "tobacco manufacturing industry". The above industry load data was first subjected to the Shapiro-Wilk test, and the results are shown in Table 3.

TABLE 3 Normal distribution test results for industry load data

From Table 3, it can be seen that "agricultural and sideline food processing industry", "food manufacturing industry", "wine and beverage and refined tea manufacturing industry" obeyed normal distribution and could be analyzed by Pearson's correlation coefficient method.

And after verifying that the industry load data meet the normal distribution condition, calculating a correlation coefficient matrix between industries by using a Pearson correlation coefficient method. Correlation analysis is performed in a period of 12 months, 4 correlation coefficients between every two industries in 2016-2019 years are finally obtained, and an average value is taken as a final correlation coefficient matrix element, as shown in table 4.

TABLE 4 industry load data correlation coefficient matrix

Industry	Agricultural and sideline food processing industry	Food manufacturing industry	Production of alcoholic beverages and refined tea
				Agricultural and sideline food processing industry	1	0.962799	0.916886
Food manufacturing industry	0.962799	1	0.792809
				Production of alcoholic beverages and refined tea	0.916886	0.792809	1

Based on the LSTM neural network prediction method considering the industry relevance, the industry load data is divided into a training set and a testing set, and the model is trained to obtain the industry load prediction curves shown in figures 3 to 5 and the industry load prediction error analysis table shown in table 5.

According to the table 5, the error of the load prediction result of the food related industry is small after the industry relevance is considered, and the prediction result well meets the precision requirement.

TABLE 5 industry load prediction error analysis

Industry	R₁	R₂
			Agricultural and sideline food processing industry	3.963％	839.881
Food manufacturing industry	1.166％	232.081
			Production of alcoholic beverages and refined tea	0.987％	81.246

In order to evaluate the value of the industry relevance for the long-term load prediction in the industry, the load prediction method without considering the industry relevance is compared with the method on the basis of the load data of the agricultural and sideline food processing industry, the food manufacturing industry and the wine and refined tea manufacturing industry, and the comparison prediction result is shown in table 6.

TABLE 6 comparison of forecasted results without trade associations

As can be seen from Table 6, the prediction evaluation indexes considering the industry relevance are all superior to those under the condition that the industry relevance is not considered, and the prediction precision of the long-term load in the industry is effectively improved by considering the industry relevance.

In conclusion, the method realizes monthly load missing data recovery in typical industries, the recovery precision reaches more than 96%, the problem of data missing in actual services is solved, meanwhile, the medium-long term load prediction error is kept within 5%, and the medium-long term load prediction precision is effectively improved.

Claims

1. A medium-long term load prediction method considering industry relevance is characterized by comprising the following steps:

(1) restoring monthly load missing data by an industry monthly load data restoration method based on iterative interpolation, and performing normalization processing on load data by adopting a min-max standardization method;

(2) analyzing and quantifying the association between industries based on Shapiro-Wilk test and Pearson correlation coefficient, and screening out strong related industries according to association indexes;

(3) and on the basis of the industry load data, the air temperature data and the industry relevance, predicting the long-term load and the medium-term load of the industry by using the long-term and short-term memory neural network.

2. The method for predicting the medium-long term load considering the industry relevance as claimed in claim 1, wherein: the step (1) is specifically as follows:

the method comprises the following steps of collecting middle and long term load data of the industry at intervals of months, and forming a data input matrix which can be expressed as follows:

in the formula, y_ijThe month load value of the j month of the i industry; m is the total number of industries; n is the total number of months of data acquisition; t is_jWeather parameters representing month j;

aiming at the problem of load data loss, an iterative interpolation method is adopted to complement data, namely, through multiple iterations, the load data of the industry where the loss data is located is regarded as a dependent variable y, the rest relevant characteristic data is regarded as an independent variable x, a linear regression device is used for carrying out fitting modeling and interpolation on the lost y value, multiple iteration cycles are carried out, and complete load data are output; the y value is calculated as shown in the following equation:

y＝a₁x₁+a₂x₂+...+a_Mx_M

and (3) carrying out normalization processing on the load data by adopting a min-max standardization method:

3. The method for predicting the medium-long term load considering the industry relevance as claimed in claim 1, wherein: the step (2) is specifically as follows:

analyzing the correlation among random variables by adopting a Pearson correlation coefficient, wherein the correlation needs to meet the condition of normal distribution of sample data, and carrying out normality test on load data of each industry based on Shapiro-Wilk test; suppose H₀: the sample data obeys normal distribution; h₁: sample data does not follow normal distribution; and (3) judging the similarity degree of the distribution of the sample to be detected and the normal distribution by calculating the statistic W:

represents an average value of sample data; a is_iThe coefficients for the Shapiro-Wilk test can be obtained by table lookup;

based on the statistic W, a significance level alpha is set, and then the quantile W thereof is obtained_α(ii) a If it isW＜W_αThen H is rejected₀Otherwise, accept H₀(ii) a And calculating the P value of the data sample using the Shapiro-Wilk test, and rejecting H if the P value is less than a significance level alpha₀；

The Pearson correlation coefficient r is used for analyzing the relevance between the industry X and the industry Y, and for the industry X to be predicted and the related industry Y, the Pearson correlation coefficient calculation formula is as follows:

wherein r represents a correlation coefficient between industry X and industry Y; x_iAnd Y_iRespectively represent the electricity load data of the ith month of the industry X and the industry Y,

and

respectively representing monthly electric load mean values of industries X and Y;

the value range of the correlation coefficient r is between-1 and +1, namely | r | is less than or equal to 1; the closer the | r | is to 1, the higher the linear correlation degree of X and Y is; r ═ -1, indicating a completely negative linear correlation between X and Y; r ═ 1, indicating a completely positive linear correlation between X and Y; r is 0, indicating that there is no linear correlation between the two.

4. The method for predicting the medium-long term load considering the industry relevance as claimed in claim 1, wherein: the step (3) is specifically as follows:

(1) predicting the load h from the previous moment_t-1And current input feature x_tSimultaneously transmitting a sigmoid function and outputting f of the sigmoid function_tAs a forgetting coefficient of the state memory unit; f. of_tCan be expressed as:

f_t＝σ(W_f[h_t-1,x_t]+b_f)

in the formula, W_fOutput h for forgetting the moment before the door_t-1And current input feature x_tA multiplied weight matrix; b_fBiasing the item for the forgetting gate; sigma represents a sigmoid function; h is_t-1Indicating a predicted load value at a previous time; x is the number of_tRepresenting the currently input load prediction characteristic factors, including industry relevance characteristics and air temperature characteristics;

(2) the predicted output value h of the previous moment is output_t-1And newly input x_tThe common input sigmoid function outputs i_tAs input node state g_tThe update coefficient of (2);

output i_tThe expression is as follows:

i_t＝σ(W_i[h_t-1,x_t]+b_i)

in the formula, W_iPassing h of sigmoid function into input gate_t-1And input feature x_tA multiplied weight matrix; b_iBiasing the term for the sigmoid function of the input gate;

input node state g_tThe expression is as follows:

g_t＝tanh(W_g[h_t-1,x_t]+b_g)

in the formula, W_gTransmitting h of tanh function to input gate_t-1And input feature x_tA multiplied weight matrix; b_gBiasing the term for the input gate tanh function;

(3) refresh state memory cell C_t: firstly, the state of the state memory unit at the previous time and the forgetting gate are output_tMultiplying and outputting a selective memory result; then, the selective memory result is added to the intermediate value of the state memory cell point by point to update the state memory cell C_t，C_tThe expression is as follows:

in the formula (I), the compound is shown in the specification,

is a Hadamard product;

(4) determining the load predicted value h at the current moment_tFirstly, predict the load h of the previous time_t-1And input feature x_tTransmitting sigmoid function into the output of o_t(ii) a Next, the updated state memory cell C_tPasses to tanh function, and outputs with o_tMultiplying to obtain the predicted value h of the load at the current moment_t(ii) a Wherein o is_tThe expression of (a) is as follows:

o_t＝σ(W_o[h_t-1,x_t]+b_o)

in the formula, W_oFor the load prediction value h of the previous moment_t-1And input feature x_tA multiplied weight matrix; b_oAn output gate bias term;

load predicted value h at current moment_tThe expression is as follows: