CN111985719A - Power load prediction method based on improved long-term and short-term memory network - Google Patents

Power load prediction method based on improved long-term and short-term memory network Download PDF

Info

Publication number
CN111985719A
CN111985719A CN202010878240.7A CN202010878240A CN111985719A CN 111985719 A CN111985719 A CN 111985719A CN 202010878240 A CN202010878240 A CN 202010878240A CN 111985719 A CN111985719 A CN 111985719A
Authority
CN
China
Prior art keywords
power load
load
memory network
improved long
term memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010878240.7A
Other languages
Chinese (zh)
Other versions
CN111985719B (en
Inventor
覃晖
裴少乾
卢桂源
吕昊
谢伟
曲昱华
付佳龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong University of Science and Technology
Original Assignee
Huazhong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong University of Science and Technology filed Critical Huazhong University of Science and Technology
Priority to CN202010878240.7A priority Critical patent/CN111985719B/en
Publication of CN111985719A publication Critical patent/CN111985719A/en
Application granted granted Critical
Publication of CN111985719B publication Critical patent/CN111985719B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/06Electricity, gas or water supply
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/003Load forecast, e.g. methods or systems for forecasting future load demand
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Abstract

The invention discloses a power load prediction method based on an improved long-short term memory network, which adopts the characteristic that a maximum information coefficient initially screens a historical load, and further selects the historical load by adopting a maximum correlation minimum redundancy algorithm in combination with the consideration of the influence brought by a load correlation factor, takes the screened characteristic set as the input of a model, adopts the improved long-short term memory network to predict the power load, verifies the obtained prediction result with the actual power grid load, and proves the practicability of the model. The forecasting method (H-ILSTM) accurately considers the power load and relevant factors influencing the load, effectively improves the precision of power load forecasting, and improves the safety and the economy of power grid operation to a certain extent.

Description

Power load prediction method based on improved long-term and short-term memory network
Technical Field
The invention relates to the technical field of power load prediction, in particular to a power load prediction method based on an improved long-term and short-term memory network.
Background
The power load prediction plays a key role in the operation of the power grid, and the safety and the economy of the operation of the power grid can be greatly improved by obtaining accurate short-term load. In addition, the method has important significance for optimal combination, economic dispatching, optimal power flow and electric power market transaction of the unit. The higher the accuracy of the power load prediction, the better the utilization rate of the power generation equipment and the better the economic dispatching effect. However, the electrical load is sensitive to external factors, such as climate change, date type and social activity, etc. These uncertainties increase the randomness of the payload sequence. Therefore, how to identify the strong relevant factors of the extracted load from the many influencing factors and realize accurate prediction of the short-term load is a problem to be solved at present.
In the deep learning method, a long-short term memory network (LSTM) is composed of a recurrent neural network of complex units, and has good capability of representing sequence information. The method has good results in solving the time series prediction problem. However, the input of LSTM is typically human-defined and is physically less meaningful and less interpretable for the model. The method has the advantages that the relation between the relevant factors influencing the power load is analyzed by adopting the characteristic engineering, so that the important significance is realized on improving the prediction precision of the power load, and the physical significance and the explanatory property of the model are enhanced. Therefore, how to accurately analyze the relationship between the power load and the related influence factors and combine the relationship into a prediction model to improve the accuracy of power load prediction is a theoretical and practical engineering problem which needs to be solved urgently.
Feature engineering is the selection of a representative subset of features in a feature set. These features are highly correlated with output variables and are the most common method of extracting valid features. Common feature selection methods include auto-correlation (AC), Mutual Information (MI), relieff (rf), feature selection (CFS) based on correlation, and the like.
In the present invention, the maximum information factor is used to preliminarily screen the characteristics of the historical power load. And performing secondary screening on the characteristics by combining a maximum correlation minimum redundancy method with a prediction model.
In summary, the problems of the prior art are as follows:
(1) there are many factors that affect the power load and these factors cannot be reasonably considered in the load prediction problem.
(2) Most of the existing feature choices related to power load prediction are to consider the influence of a certain feature on the power load, and the effect of the combination of the features is not considered.
(3) Because there are many factors in the input of the prediction model, the traditional machine learning method cannot predict the power load with high precision. The traditional LSTM can achieve better prediction effect, but the structure is complicated, the training time is long, and inconvenience is brought to practical application.
The difficulty of solving the technical problems is as follows: how to properly handle these continuous and discrete features is one of the technical difficulties in order to consider the influence of the correlation factors on the power load. The influence between the factors is complicated, a single factor may not greatly improve the prediction, but the prediction precision can be greatly improved after the factors are combined, and how to screen the combined factors and input the combined factors into a model for training is the second difficulty of the technology; in addition, how to change the structure of the LSTM is difficult to improve the prediction accuracy and reduce the required training time in solving the load prediction problem.
After the technical problem is solved, the significance is brought as follows: in order to consider the mutual influence between factors, the invention provides a two-stage hybrid feature extraction method and a novel network (ILSTM) applied to power load prediction. The network structure is different from the traditional LSTM, and the correct derivation of the forward propagation and backward propagation formulas of the network is the difficulty of realizing the network.
Most of the existing power load prediction performed by adopting a deep learning method is to predict the power load in the next time period. The method can accurately predict the power load of a plurality of future time intervals, and has practical application significance. Therefore, the method is very beneficial to popularization of H-ILSTM.
Disclosure of Invention
The present invention is directed to solving the above-mentioned problems, and an object of the present invention is to provide a power load prediction method based on an improved long and short term memory network, which can accurately analyze the influence between related features, screen out an optimal feature set, and perform prediction by using a novel network, so as to obtain a high-precision power load prediction result.
The invention realizes the purpose through the following technical scheme:
the invention comprises the following steps:
(1) collecting data of the power load and relevant factors thereof, and carrying out primary screening according to the influence of the historical power load on the future;
the method comprises the steps of collecting historical power load sequences, analyzing the relation between loads in 168 time intervals (one time interval is 1 hour) in seven days in the history to predicted loads required in the same day by adopting a Maximum Information Coefficient (MIC), forming a preselected set by using historical loads with the calculated MIC larger than 0.6, and forming the rest loads into a candidate set.
(2) Performing primary processing on the power load related factor;
factors relevant to the power load are collected, wherein the factors comprise a continuous characteristic and a discrete characteristic, and the discrete characteristic needs to be processed in a certain mode to be applied by the prediction model. The invention adopts a LabelEncoder coding mode to code the date type and the time type. Wherein the discrete features include a date category and a time category. The date types are divided into workdays, holidays and legal holidays, and the legal holidays can be classified into 7 types because the legal holidays in China have seven days at most. In the time category, three periods are divided into one domain according to the power load characteristics.
(3) Performing secondary screening on the historical power load by combining the power load related factor and the prediction model;
the relationship between the loads in the candidate subset and the load currently to be predicted is analyzed. And analyzing the characteristics of the candidate subset by adopting a maximum correlation minimum redundancy method. The following formula:
Figure BDA0002653297510000031
in the formula, Sm-1Represents a preselected set, X-Sm-1Representing a candidate set, c is a category variable, m is the number of features, xjIs the jth feature. I (x)jAnd c) represents the mutual information number between the jth characteristic and the category variable.
This method finds features that maximize the value of the above formula in the remaining feature space based on the selected features. And placing the features with the maximum calculated coefficient lambda into a preselected feature subset, normalizing all the features by combining other related factors influencing the power load, and then placing the normalized features into a prediction model for prediction. The root mean square error RMSE is used as the discrimination factor and if the RMSE is reduced, the process continues until the RMSE becomes higher than before the addition of the feature.
(4) Putting the screened final feature set into an improved long and short memory network ILSTM for prediction, and firstly setting parameters of the NLSTM, including the number of input layer nodes, the number of hidden layer nodes, the number of output layer nodes, learning efficiency, batch size and training round number;
(5) the individual weight parameters of the ILSTM are trained on a training set using an adaptive moment estimation (Adam) optimizer in conjunction with a mini-batch mechanism. (ii) a
(6) And inputting the test set into the trained ILSTM for prediction to obtain a power load prediction result. Further, in step (5), the step and the calculation formula of the information forward propagation of the improved long and short memory network ILSTM in the t-th time period are as follows:
Figure BDA0002653297510000041
ut=σ(netut) (3)
Figure BDA0002653297510000042
Figure BDA0002653297510000043
Figure BDA0002653297510000044
Figure BDA0002653297510000045
ot=σ(netot) (8)
ht=ot*tanh(ct) (9)
zt=Wy·ht+by (10)
yt=σ(zt) (11)
in the above formula, netut,
Figure BDA0002653297510000051
netotAnd ztIs the state of the current stage of the t period; wu,
Figure BDA0002653297510000052
And WoTheir weight matrices, respectively; bu,
Figure BDA0002653297510000053
And boRespectively represent their deviation vectors; u. oft,
Figure BDA0002653297510000054
Ct,ot,htAnd ytRespectively an updating gate, an information state, a cell state, an output gate, a hidden layer output and a predicted value in a time period t; tanh and σ are the tan h and sigmoid activation functions, respectively; the symbols sum represent matrix multiplication and multiplication between matrix elements, respectively.
Further, in the step (5), the step of information back propagation in the t-th time period and the calculation formula are as follows:
a. defining the most common square error function as the target to be optimized
Figure BDA0002653297510000055
b. Calculating errors of output layers
Figure BDA0002653297510000056
Figure BDA0002653297510000057
Figure BDA0002653297510000058
Figure BDA0002653297510000059
c. Calculating errors of hidden layers
Figure BDA00026532975100000510
Figure BDA00026532975100000511
Figure BDA00026532975100000512
Figure BDA00026532975100000513
Figure BDA00026532975100000514
Figure BDA0002653297510000061
Figure BDA0002653297510000062
Figure BDA0002653297510000063
Figure BDA0002653297510000064
Figure BDA0002653297510000065
Figure BDA0002653297510000066
W=W-η·W (28)
d. Using Adam optimization algorithm with [ w ]h,wx,b]And [ wy,by]To update [ wh,wx,b]And [ wy,by](ii) a To better illustrate the update process, the weight is denoted by the symbol W, the gradient of the weight is denoted by W, and the general formula for Adam updating the weight is:
mt=β1·mt-1+(1-β1)·Wt (29)
vt=β2·vt-1+(1-β2)·(Wt)2 (30)
Figure BDA0002653297510000067
Figure BDA0002653297510000068
Figure BDA0002653297510000069
wherein EtAs an error function, ytAnd YtRespectively, predicted values and observed values. m istAnd
Figure BDA00026532975100000610
is the deviation of the first moment estimated and corrected, vtAnd
Figure BDA00026532975100000611
is the deviation of the biased second moment estimate and the corrected second moment. Beta is a12And Adam's parameters, default to 0.9,0.999 and 10, respectively-8(ii) a Eta represents learning efficiency;
according to the formula, the predicted value is calculated by forward propagation, then the weight is updated by backward propagation, each round of training set is trained by taking a batch with a certain size, and each batch is updated once.
(7) And inputting the test set into the trained ILSTM for prediction to obtain a power load prediction result.
The invention has the beneficial effects that:
the invention relates to a power load prediction method based on an improved long-short term memory network, which is characterized in that related factors influencing the power load are analyzed through feature engineering, a two-stage feature extraction method is adopted to screen features, and the screening method comprises a filtering type and a wrapping type, so that a feature set of an adaptive model can be better obtained. And the ILSTM is adopted for power load prediction, compared with the traditional LSTM, on one hand, the time required by model training is saved, and on the other hand, the prediction precision of the model is improved. The prediction model provided by the invention can accurately predict the future power load, has higher practical application significance and is beneficial to popularization.
Drawings
FIG. 1 is a flow chart of a method for predicting power load of an improved long-short term memory network according to an embodiment of the present invention;
FIG. 2 is a diagram of an ILSTM network architecture provided by an embodiment of the present invention;
FIG. 3 is a diagram illustrating a causal relationship of wind speed in a Xinjiang Fuzi station case and an equivalent tree thereof according to an embodiment of the present invention;
FIG. 4 is a comparative graph of wind speed prediction results of the Xinjiang Fuchun station case provided by the embodiment of the invention.
Detailed Description
The invention will be further described with reference to the accompanying drawings in which:
fig. 1 is a general flowchart of the method for predicting the power load of the improved long-short term memory network of the present invention, which specifically includes the following steps:
(1) collecting data of the power load and relevant factors thereof, and carrying out primary screening according to the influence of the historical power load on the future;
the method comprises the steps of collecting historical power load sequences, analyzing the relation between loads in 168 time intervals (one time interval is 1 hour) in seven days in the history to predicted loads required in the same day by adopting a Maximum Information Coefficient (MIC), forming a preselected set by using historical loads with the calculated MIC larger than 0.6, and forming the rest loads into a candidate set.
(2) Performing primary processing on the power load related factor;
factors relevant to the power load are collected, wherein the factors comprise a continuous characteristic and a discrete characteristic, and the discrete characteristic needs to be processed in a certain mode to be applied by the prediction model. The invention adopts a LabelEncoder coding mode to code the date type and the time type. Wherein the discrete features include a date category and a time category. The date types are divided into working days, rest days and legal holidays, and the specific coding mode is shown in table 1 because the legal holidays in China have seven days at most:
TABLE 1 date type code table
Working day Saturday wine Sunday day Holiday of one day Two-day holiday
1 2 3 4 5
Three day holiday Four day holiday Holiday of five days Six day holiday Seven day holiday
6 7 8 9 10
In the time category, three time intervals are divided into a domain according to the characteristics of the power load, and the specific coding mode is as shown in table 2:
TABLE 2 time type coding scheme
00:00-03:00 03:00-06:00 06:00-09:00 09:00-12:00
1 2 3 4
12:00-15:00 15:00-18:00 18:00-21:00 21:00-24:00
5 6 7 8
(3) Performing secondary screening on the historical power load by combining the power load related factor and the prediction model;
the relationship between the loads in the candidate subset and the load currently to be predicted is analyzed. And analyzing the characteristics of the candidate subset by adopting a maximum correlation minimum redundancy method. The following formula:
Figure BDA0002653297510000081
in the formula, Sm-1Represents a preselected set, X-Sm-1Representing a candidate set, c is a category variable, m is the number of features, xjIs the jth feature. I (x)jAnd c) represents the mutual information number between the jth characteristic and the category variable.
This method finds features that maximize the value of the above formula in the remaining feature space based on the selected features. And placing the features with the maximum calculated coefficient lambda into a preselected feature subset, normalizing all the features by combining other related factors influencing the power load, and then placing the normalized features into a prediction model for prediction. The root mean square error RMSE is used as the discrimination factor and if the RMSE is reduced, the process continues until the RMSE becomes higher than before the addition of the feature.
(4) Putting the screened final feature set into an improved long and short memory network ILSTM for prediction, and firstly setting parameters of the NLSTM, including the number of input layer nodes, the number of hidden layer nodes, the number of output layer nodes, learning efficiency, batch size, training round number and initializing weight;
(5) the individual weight parameters of the ILSTM are trained on a training set using an adaptive moment estimation (Adam) optimizer in conjunction with a mini-batch mechanism.
The step and the calculation formula of the information forward propagation in the t-th time period are as follows:
Figure BDA0002653297510000091
ut=σ(netut) (3)
Figure BDA0002653297510000092
Figure BDA0002653297510000093
Figure BDA0002653297510000094
Figure BDA0002653297510000095
ot=σ(netot) (8)
ht=ot*tanh(ct) (9)
zt=Wy·ht+by (10)
yt=σ(zt) (11)
in the above formula, netut,
Figure BDA0002653297510000096
netotAnd ztIs the state of the current stage of the t period; wu,
Figure BDA0002653297510000097
And WoTheir weight matrices, respectively; bu,
Figure BDA0002653297510000098
And boRespectively represent their deviation vectors; u. oft,
Figure BDA0002653297510000099
Ct,ot,htAnd ytRespectively an updating gate, an information state, a cell state, an output gate, a hidden layer output and a predicted value in a time period t; tanh and σ are the tan h and sigmoid activation functions, respectively; the symbols sum represent matrix multiplication and multiplication between matrix elements, respectively.
The step and the calculation formula of information back propagation in the t-th time period are as follows:
a. defining the most common square error function as the target to be optimized
Figure BDA0002653297510000101
b. Calculating errors of output layers
Figure BDA0002653297510000102
Figure BDA0002653297510000103
Figure BDA0002653297510000104
Figure BDA0002653297510000105
c. Calculating errors of hidden layers
Figure BDA0002653297510000106
Figure BDA0002653297510000107
Figure BDA0002653297510000108
Figure BDA0002653297510000109
Figure BDA00026532975100001010
Figure BDA00026532975100001011
Figure BDA00026532975100001012
Figure BDA0002653297510000111
Figure BDA0002653297510000112
Figure BDA0002653297510000113
Figure BDA0002653297510000114
W=W-η·W (28)
d. Using Adam optimization algorithm with [ w ]h,wx,b]And [ wy,by]To update [ wh,wx,b]And [ wy,by](ii) a To better illustrate the update process, the weight is denoted by the symbol W, the gradient of the weight is denoted by W, and the general formula for Adam updating the weight is:
mt=β1·mt-1+(1-β1)·Wt (29)
vt=β2·vt-1+(1-β2)·(Wt)2 (30)
Figure BDA0002653297510000115
Figure BDA0002653297510000116
Figure BDA0002653297510000117
wherein EtAs an error function, ytAnd YtRespectively, predicted values and observed values. m istAnd
Figure BDA0002653297510000118
is the deviation of the first moment estimated and corrected, vtAnd
Figure BDA0002653297510000119
is the deviation of the biased second moment estimate and the corrected second moment. Beta is a12And Adam's parameters, default to 0.9,0.999 and 10, respectively-8(ii) a Eta represents learning efficiency;
according to the formula, the predicted value is calculated by forward propagation, then the weight is updated by backward propagation, each round of training set is trained by taking a batch with a certain size, and each batch is updated once.
(6) And inputting the test set into the trained ILSTM for prediction to obtain a power load prediction result.
Fig. 2 is a diagram showing the structure of the ILSTM network.
Fig. 3 is a radar distribution diagram showing the maximum information coefficient of the power load for the first 7 days (168 periods).
The use of the present invention is further described below in conjunction with specific experiments.
The method carries out the prediction of the power load in two periods of the future by taking the power load of Wuhan in Huazhong as an object, and adopts meteorological data of two months from 22 days 3 and 22 days 2015 to 23 days 5 and 2015. The data time step is 1 hour, 1488 time intervals are totally, the first 1191 time intervals are divided into training sets, and the last 297 time intervals are divided into testing sets. Relevant factors affecting the electrical load include temperature, humidity, dew point, date category and time category. The Maximum Information Coefficient (MIC) for the load of the first 7 days of calculation is shown in fig. 3. And divides the load into a preselected set and a candidate set according to the MIC. And further screening the load by adopting a maximum correlation minimum redundancy method, and combining the model to obtain a final characteristic set. And synthesizing the feature set and the forecast factor set into a training set to train the model, and finally inputting the test set into the trained ILSTM to predict to obtain a power load prediction result.
To verify the predictive performance of the ILSTM, the following eight models were constructed to predict average wind speed and compared:
H-ILSTM: the method adopts ILSTM, and the feature selection adopts the feature extraction method of the invention;
ILSTM: the method adopts ILSTM, and the feature selection only adopts a single maximum information coefficient method for analysis;
③ H-LSTM: the method adopts the traditional LSTM, and the feature extraction method is adopted for feature selection;
fourthly, LSTM: the method adopts the traditional LSTM, and the characteristic selection is only analyzed by a single maximum information coefficient method;
H-GRU: the method adopts GRU, and the feature extraction method is adopted for feature selection;
sixthly, GRU: the method adopts GRU, and the characteristic selection is only analyzed by a single maximum information coefficient method;
and (c) H-SVR: the method adopts SVR, and the feature extraction method is adopted for feature selection;
and (v) SVR: the method adopts SVR, and the feature selection only adopts a single maximum information coefficient method for analysis;
to avoid the effect of randomness, each model was averaged 10 times. Table 3 lists the evaluation indices for the eight model predictions. The evaluation indexes adopt Root Mean Square Error (RMSE), average absolute error percentage (MAPE) and average absolute error (MAE), and the smaller the values of the indexes are, the higher the prediction precision is. As can be seen from Table 3, the prediction accuracy of H-ILSTM is higher than that of ILSTM, which shows that the feature extraction method of the present invention improves the prediction accuracy. The prediction precision of H-ILSTM is higher than that of H-LSTM, and The Time (TT) required by training is short, which shows that the ILSTM of the method is better than the standard LSTM. The difference in prediction accuracy for the 8 models can be seen more clearly in fig. 4.
TABLE 3 eight model prediction index Table
Figure BDA0002653297510000131
The foregoing shows and describes the general principles and features of the present invention, together with the advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (8)

1. A power load prediction method based on an improved long-short term memory network is characterized by comprising the following steps:
step one, preliminarily screening the characteristics of historical loads according to the historical power loads as the input of a model;
secondly, considering relevant factors influencing the power load, and performing secondary screening on the historical load by adopting a maximum relevant minimum redundancy method;
and step three, taking the screened final feature set as the input of the model, and predicting by adopting an improved long-short memory network to obtain a final prediction result.
2. The method for predicting the power load based on the improved long-short term memory network as claimed in claim 1, wherein: in the first step, the historical power load is analyzed by adopting the maximum information coefficient MIC, and the historical load with the maximum information coefficient larger than 0.6 is used as a preselected set [ x [)1,x2,...,xk]Remaining unselected historical loads as an alternative [ c ]1,c2,...,ck]。
3. The method for predicting the power load based on the improved long-short term memory network as claimed in claim 1, wherein: in step two, collecting data of relevant factors influencing the power load comprises temperature [ TI1,TI2,...,TIk]Humidity [ HI ]1,HI2,...,HIk]Dew point [ DI1,DI2,...,DIk]Date category, time period category; the date type and the time section type are discrete characteristics and are coded; that is, discrete data is converted into a number between 1 and n, where n is the number of different values of a list, and can be considered as the number of all different values of a certain feature; and further analyzing the relevant factors influencing the power load by adopting a feature extraction method to obtain a feature set.
4. The improved long-short term memory network based power load prediction method as claimed in claim 3, wherein: the characteristic extraction method adopts a wrapping type characteristic selection method to carry out secondary screening on the historical load; analyzing the historical load under a preselected set and the historical load under an alternative set by adopting a maximum correlation minimum redundancy method to obtain the characteristic of the maximum coefficient, putting the characteristic into the preselected set, and combining with other correlation factors influencing the load to form a primary training set TR (x)tr,Y]And a test set TE ═ x consisting of predictor factors onlyte]And carrying out normalization processing on the data; and training the preliminary training set by adopting an improved long and short memory network, comparing the obtained prediction result with the test set, and judging whether to continue adding the features to the preselected set or not by adopting the root mean square error as a threshold value.
5. The improved long-short term memory network based power load prediction method as claimed in claim 4, wherein: and when the root mean square error RMSE is adopted for judgment, the RMSE obtained at the previous time of adding the features is taken as a record, the RMSE obtained after adding the features is compared with the previous time, and if the RMSE becomes larger after adding the features, the addition of the features is stopped.
6. The method for predicting the power load based on the improved long-short term memory network as claimed in claim 1, wherein: in the third step, the step and the calculation formula of the information forward propagation of the improved long and short memory network ILSTM in the t-th time period are as follows:
Figure FDA0002653297500000021
ut=σ(netut) (2)
Figure FDA0002653297500000022
Figure FDA0002653297500000023
Figure FDA0002653297500000024
Figure FDA0002653297500000025
ot=σ(netot) (7)
ht=ot*tanh(ct) (8)
zt=Wy·ht+by (9)
yt=σ(zt) (10)
in the above formula, netut,netct,netotAnd ztIs the state of the current stage of the t period; wu,
Figure FDA0002653297500000026
And WoTheir weight matrices, respectively; bu,
Figure FDA0002653297500000027
And boRespectively represent their deviation vectors; u. oft,
Figure FDA0002653297500000028
Ct,ot,htAnd ytRespectively an updating gate, an information state, a cell state, an output gate, a hidden layer output and a predicted value in a time period t; tanh and σ are the tan h and sigmoid activation functions, respectively; the symbols sum represent matrix multiplication and multiplication between matrix elements, respectively.
7. The improved long-short term memory network based power load prediction method as claimed in claim 6, wherein: in the ILSTM, the steps and calculation formula of information back propagation in the t-th period are:
a. defining the most common square error function as the target to be optimized
Figure FDA0002653297500000031
b. Calculating errors of output layers
Figure FDA0002653297500000032
Figure FDA0002653297500000033
Figure FDA0002653297500000034
Figure FDA0002653297500000035
c. Calculating errors of hidden layers
Figure FDA0002653297500000036
Figure FDA0002653297500000037
Figure FDA0002653297500000038
Figure FDA0002653297500000039
Figure FDA00026532975000000310
Figure FDA00026532975000000311
Figure FDA00026532975000000312
Figure FDA00026532975000000313
Figure FDA0002653297500000041
Figure FDA0002653297500000042
Figure FDA0002653297500000043
W=W-η·W (27)。
8. The improved long-short term memory network based power load prediction method as claimed in claim 6, wherein: in the ILSTM, the step of updating the weights of the tth period includes:
using Adam optimization algorithm with [ w ]h,wx,b]And [ wy,by]To update [ wh,wx,b]And [ wy,by](ii) a To better illustrate the update process, the weight is denoted by the symbol W, the gradient of the weight is denoted by W, and the general formula for Adam updating the weight is:
mt=β1·mt-1+(1-β1)·Wt (28)
vt=β2·vt-1+(1-β2)·(Wt)2 (29)
Figure FDA0002653297500000044
Figure FDA0002653297500000045
Figure FDA0002653297500000046
wherein EtAs an error function, ytAnd YtAre respectively predictionA value and an observed value; m istAnd
Figure FDA0002653297500000047
is the deviation of the first moment estimated and corrected, vtAnd
Figure FDA0002653297500000048
is the deviation of the biased second moment estimation and correction second moment; beta is a12And Adam's parameters, default to 0.9,0.999 and 10, respectively-8(ii) a Eta represents learning efficiency;
according to the formula, the predicted value is calculated by forward propagation, then the weight is updated by backward propagation, each round of training set is trained by taking a batch with a certain size, and each batch is updated once.
CN202010878240.7A 2020-08-27 2020-08-27 Power load prediction method based on improved long-term and short-term memory network Active CN111985719B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010878240.7A CN111985719B (en) 2020-08-27 2020-08-27 Power load prediction method based on improved long-term and short-term memory network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010878240.7A CN111985719B (en) 2020-08-27 2020-08-27 Power load prediction method based on improved long-term and short-term memory network

Publications (2)

Publication Number Publication Date
CN111985719A true CN111985719A (en) 2020-11-24
CN111985719B CN111985719B (en) 2023-07-25

Family

ID=73441162

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010878240.7A Active CN111985719B (en) 2020-08-27 2020-08-27 Power load prediction method based on improved long-term and short-term memory network

Country Status (1)

Country Link
CN (1) CN111985719B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113505923A (en) * 2021-07-08 2021-10-15 广西大学 Regional power grid short-term load prediction method and system
CN113837480A (en) * 2021-09-29 2021-12-24 河北工业大学 Impact load prediction method based on improved GRU and differential error compensation
CN114187059A (en) * 2021-12-27 2022-03-15 广东工业大学 Short-term electricity price prediction method
CN114881343A (en) * 2022-05-18 2022-08-09 清华大学 Short-term load prediction method and device of power system based on feature selection

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106779505A (en) * 2017-02-28 2017-05-31 中国南方电网有限责任公司 A kind of transmission line malfunction method for early warning driven based on big data and system
CN108830487A (en) * 2018-06-21 2018-11-16 王芊霖 Methods of electric load forecasting based on long neural network in short-term
CN109948845A (en) * 2019-03-15 2019-06-28 国网江苏省电力有限公司经济技术研究院 A kind of distribution network load shot and long term Memory Neural Networks prediction technique
CN110298501A (en) * 2019-06-21 2019-10-01 河海大学常州校区 Electric load prediction technique based on long Memory Neural Networks in short-term
US20210209467A1 (en) * 2018-09-25 2021-07-08 Ennew Digital Technology Co., Ltd. Method and device for predicting thermal load of electrical system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106779505A (en) * 2017-02-28 2017-05-31 中国南方电网有限责任公司 A kind of transmission line malfunction method for early warning driven based on big data and system
CN108830487A (en) * 2018-06-21 2018-11-16 王芊霖 Methods of electric load forecasting based on long neural network in short-term
US20210209467A1 (en) * 2018-09-25 2021-07-08 Ennew Digital Technology Co., Ltd. Method and device for predicting thermal load of electrical system
CN109948845A (en) * 2019-03-15 2019-06-28 国网江苏省电力有限公司经济技术研究院 A kind of distribution network load shot and long term Memory Neural Networks prediction technique
CN110298501A (en) * 2019-06-21 2019-10-01 河海大学常州校区 Electric load prediction technique based on long Memory Neural Networks in short-term

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113505923A (en) * 2021-07-08 2021-10-15 广西大学 Regional power grid short-term load prediction method and system
CN113837480A (en) * 2021-09-29 2021-12-24 河北工业大学 Impact load prediction method based on improved GRU and differential error compensation
CN113837480B (en) * 2021-09-29 2023-11-07 河北工业大学 Impact load prediction method based on improved GRU and differential error compensation
CN114187059A (en) * 2021-12-27 2022-03-15 广东工业大学 Short-term electricity price prediction method
CN114881343A (en) * 2022-05-18 2022-08-09 清华大学 Short-term load prediction method and device of power system based on feature selection
CN114881343B (en) * 2022-05-18 2023-11-14 清华大学 Short-term load prediction method and device for power system based on feature selection

Also Published As

Publication number Publication date
CN111985719B (en) 2023-07-25

Similar Documents

Publication Publication Date Title
Afrasiabi et al. Advanced deep learning approach for probabilistic wind speed forecasting
CN109902801B (en) Flood collective forecasting method based on variational reasoning Bayesian neural network
CN111985719B (en) Power load prediction method based on improved long-term and short-term memory network
CN111260030B (en) A-TCN-based power load prediction method and device, computer equipment and storage medium
CN111260136A (en) Building short-term load prediction method based on ARIMA-LSTM combined model
CN109284866B (en) Commodity order prediction method and device, storage medium and terminal
CN109063939B (en) Wind speed prediction method and system based on neighborhood gate short-term memory network
CN110059867B (en) Wind speed prediction method combining SWLSTM and GPR
CN111553543A (en) Power load prediction method based on TPA-Seq2Seq and related assembly
CN111738520A (en) System load prediction method fusing isolated forest and long-short term memory network
CN113554466A (en) Short-term power consumption prediction model construction method, prediction method and device
CN111882157A (en) Demand prediction method and system based on deep space-time neural network and computer readable storage medium
CN110633859A (en) Hydrological sequence prediction method for two-stage decomposition integration
CN113255900A (en) Impulse load prediction method considering improved spectral clustering and Bi-LSTM neural network
CN112990587A (en) Method, system, equipment and medium for accurately predicting power consumption of transformer area
CN115456306A (en) Bus load prediction method, system, equipment and storage medium
CN111539558A (en) Power load prediction method adopting optimized extreme learning machine
CN116542701A (en) Carbon price prediction method and system based on CNN-LSTM combination model
Wang et al. A Transformer-based multi-entity load forecasting method for integrated energy systems
CN116578551A (en) GRU-GAN-based power grid data restoration method
CN115619028A (en) Clustering algorithm fusion-based power load accurate prediction method
CN115794805A (en) Medium-low voltage distribution network measurement data supplementing method
CN115545319A (en) Power grid short-term load prediction method based on meteorological similar day set
CN115310355A (en) Multi-energy coupling-considered multi-load prediction method and system for comprehensive energy system
Piotrowski et al. On the importance of training methods and ensemble aggregation for runoff prediction by means of artificial neural networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant