CN116108742A

CN116108742A - Low-voltage transformer area ultra-short-term load prediction method and system based on improved GRU-NP model

Info

Publication number: CN116108742A
Application number: CN202211595583.8A
Authority: CN
Inventors: 周玉; 赵双双; 李悦; 高凡; 张震; 纪峰; 穆卓文; 崔高颖; 周超; 王舒; 夏宇航; 冯可
Original assignee: State Grid Jiangsu Electric Power Co ltd Marketing Service Center
Current assignee: State Grid Jiangsu Electric Power Co ltd Marketing Service Center
Priority date: 2022-12-13
Filing date: 2022-12-13
Publication date: 2023-05-12

Abstract

The method and the system for predicting the ultra-short-term load of the low-voltage transformer area based on the improved GRU-NP model firstly carry out data cleaning on original load data, introduce a distance correlation coefficient to carry out characteristic correlation analysis and reject the characteristic with lower correlation; then normalizing the data set and dividing the data set into a training set and a testing set according to the proportion of 8:2; building GRU optimized based on RMSProp algorithm and NP model optimized by using Optuna; then, searching the optimal weight fused by the GRU model and the NP model by using a multi-layer perceptron MLP, and obtaining a mixed model according to the optimal weight; and loading the data to be predicted into a trained GRU-NP model to predict the load value at the future moment. Aiming at ultra-short-term load prediction of a low-voltage transformer area, an improved GRU-NP model is adopted to predict a load value of 15 minutes in the future, and experiments show that the prediction precision of multi-characteristic load data of the low-voltage transformer area is effectively improved.

Description

Low-voltage transformer area ultra-short-term load prediction method and system based on improved GRU-NP model

Technical Field

The invention belongs to the field of load prediction in an electric power environment, and particularly relates to a low-voltage transformer area ultra-short-term load prediction method and system based on an improved GRU-NP (gated recurrent unit-nerve propset) model.

Background

The gating recurrent neural network (gated recurrent neural network) is proposed to better capture the dependency of the time step distance in the time series, and controls the flow of information through a gate that can be learned. Among them, the gate-controlled loop unit (GRU, gated recurrent unit) is a common gate-controlled loop neural network. It introduces the concept of reset gate and update gate, thus modifying the way the hidden state is calculated in the recurrent neural network.

The Neural propset is a decomposable time sequence prediction model developed by a core data science team of Meta company, is a user-friendly time sequence prediction tool realized based on PyTorch, and has the main functions of the 2018 open source prediction tool propset, and is mainly used for time sequence data analysis. The Neural propset consists of different components such as trends, seasonal, auto-regressions, other regressions, etc., with three major model components being trend, seasonal, and holiday.

The multi-layer perceptron (MLP, multilayer Perceptron) is also called an artificial neural network (ANN, artificial Neural Network), which may have multiple hidden layers in between, in addition to input and output layers, and the MLP model may be implemented using different algorithms. For the hybrid model, the MLP model is implemented using back propagation, as this approach can give the predicted results higher accuracy.

Disclosure of Invention

In order to solve the defects in the prior art, the invention provides a low-voltage transformer area ultra-short-term load prediction method and system based on an improved GRU-NP model, and the accuracy of low-voltage transformer area multi-feature load data prediction is improved.

The invention adopts the following technical scheme.

The ultra-short-term load prediction method for the low-voltage transformer area based on the improved GRU-NP model comprises the following steps:

step 1, carrying out normalization processing on an original data set, calculating a distance correlation coefficient of temperature, humidity and rainfall characteristics and load data, selecting the strong correlation characteristics as external factors, and cleaning and completing the load data;

step 2, building a GRU model based on RMSProp optimization, training the model by using the load data obtained in the step 1 and the strongly related characteristic data, building an Optuna super-parameter-based optimizing NP model, and training the NP model by using the load data obtained in the step 1 and combining the time sequence characteristics;

step 3, searching the optimal weight of the combination of the GRU model and the NP model by using a three-layer MLP network, thereby obtaining an improved GRU-NP model;

and 4, predicting the prediction results of the GRU model and the NP model obtained in the step 2 by using an improved GRU-NP model to obtain a future load value, and taking the future load value as a final prediction result.

Preferably, step 1 specifically includes:

step 1.1, carrying out Min-Max normalization on an original data set, wherein the Min-Max normalization formula is as follows:

in the method, in the process of the invention,

x represents the data before the processing and,

x' represents the data after the processing and,

X _max represents the maximum value of the data and,

X _min representing a minimum value of the data;

step 1.2, introducing a distance correlation coefficient to perform characteristic correlation analysis on temperature, humidity and rainfall in the original data set, and calculating by using a distance correlation coefficient formula;

step 1.3, calculating a temperature and load data distance correlation coefficient, a humidity and load data distance correlation coefficient, a rainfall and load data distance correlation coefficient according to the step 1.2, wherein the distance correlation coefficient is 0-0.2, which means that the two sequences are irrelevant or weakly relevant, and selecting a characteristic with the distance correlation coefficient larger than 0.2 as an external factor to participate in training of the GRU model;

and 1.4, cleaning the load data, filling in the blank value and removing the abnormal value.

Preferably, in step 1.2, the distance correlation coefficient formula is as follows:

in the method, in the process of the invention,

mu represents the sequence of load data,

v represents a characteristic data sequence of temperature, humidity and rainfall,

dcov (μ, v) represents the covariance of μ and v,

dcorr (μ, v) represents the distance correlation coefficient of μ and v.

Preferably, step 2 specifically includes:

step 2.1, dividing the load data obtained in the step 1 and the strong correlation characteristic data into a training set and a testing set according to a proportion;

step 2.2, building a GRU model based on RMSProp optimization, adding a Dropout layer to reduce the risk of overfitting, wherein each gating cycle unit comprises: an update gate, a reset gate; the updating gate is used for controlling the output of the hidden state at the previous moment and the degree to which the input of the hidden state at the current moment is brought into the hidden state at the current moment; the reset gate is used for controlling the hidden state output of the previous moment to flow into the candidate hidden state of the current moment;

step 2.3, setting GRU model super parameters, and optimizing the GRU model by using a RMSProp algorithm;

and 2.4, inputting the training set into the GRU model based on RMSProp optimization for training, and obtaining a trained GRU model.

Preferably, the parameter update procedure using RMSProp algorithm is as follows:

updating the state component s _t Calculating a random gradient grad _t A prime weighted moving average of the square terms of (a) is given by:

s _t ＝γs _t-1 +(1-γ)grad _t ⊙grad _t

in the method, in the process of the invention,

t represents a step of time in time,

gamma represents the attenuation coefficient, is a super parameter, takes the value (0, 1),

grad _t representing a random gradient of the gradient,

s _t-1 representing the state component of the current time instant,

S _t representing a previous time state component;

updating argument x _t The learning rate of each element in the independent variable is readjusted by element operation, and the formula is as follows:

in the method, in the process of the invention,

epsilon represents a constant added to maintain the stability of the numerical value, and the value is 10 ^-6 ，

x _t-1 The input data representing the previous moment in time,

x _t input data representing the current time of day is displayed,

l _r representing the global learning rate.

Preferably, step 2 further comprises:

step 2.5, performing super-parameter optimization on the NP model by using Optuna; selecting a change point list, the number of change points, the regular trend and the learning rate as a super-parameter list to be optimized, and selecting a hysteresis order through a comparison experiment, wherein the hysteresis order is finally fixed to be 4;

and 2.6, building an NP model by adopting an Optuna optimized super-parameter list, wherein the formula is as follows:

y(t)＝T(t)+S(t)+H(t)+E(t)

in the method, in the process of the invention,

t (T) represents a trend term, S (T) represents a season term, H (T) represents holidays, special event effects, E (T) represents a random error term;

and 2.7, inputting the training set into the NP model for training to obtain a trained NP model.

Preferably, step 3 specifically includes:

step 3.1, constructing a three-layer MLP network, wherein the formula is as follows:

f(x)＝G(b ₂ +w ₂ *f(w ₁ *X+b ₁ )))

in the method, in the process of the invention,

x represents the input load data and,

f (x) represents the predicted value of the output,

w ₁ the weight coefficients representing the hidden layers of the first layer,

w ₂ the weight coefficients representing the hidden layer of the second layer,

b ₁ the bias term representing the hidden layer of the first layer,

b ₂ representing the bias term of the hidden layer of the second layer,

f(w ₁ *x+b ₁ ) Representing the result of the first layer hidden layer calculation,

g represents an activation function, which is a ReLU activation function;

and 3.2, training the GRU model and NP model prediction results obtained in the step 2 by using an MLP network to obtain an optimal weight coefficient, and combining the models by using the optimal weight coefficient to obtain an improved GRU-NP model.

A low-voltage area ultra-short-term load prediction system based on an improved GRU-NP model comprises: the system comprises a data processing module, a model building module, a model improvement module and a prediction module, wherein:

the data processing module is used for carrying out normalization processing on the original data set, calculating the distance correlation coefficient of temperature, humidity and rainfall characteristics and the load data, selecting the strong correlation characteristics as external factors, and cleaning and completing the load data;

the model building module is used for building a GRU model based on RMSProp optimization, training the model by using the load data obtained by the data processing module and the strongly related characteristic data, building an NP model based on Optuna super-parameter optimization, and training the NP model by combining the load data with time sequence characteristics;

the model improvement module is used for searching the optimal weight of the combination of the GRU model and the NP model by using the three-layer MLP network so as to obtain an improved GRU-NP model;

the prediction module is used for predicting the prediction results of the GRU model and the NP model by using the improved GRU-NP model to obtain a load value of one minute in the future, and the load value is used as a final prediction result.

A terminal comprising a processor and a storage medium; wherein:

the storage medium is used for storing instructions;

the processor is operative in accordance with the instructions to perform steps in accordance with a low voltage region ultra-short term load prediction method based on an improved GRU-NP model.

A computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of a method for low voltage region ultra-short term load prediction based on an improved GRU-NP model.

The invention has the advantages that compared with the prior art,

according to the improved GRU-NP model, the weak correlation characteristic is removed by calculating the distance correlation coefficient through characteristic engineering, RMSProp (root mean square prop) and Optuna are introduced to optimize the GRU model and the NP model respectively, and the optimal weight for fusion is found through a multi-layer perceptron, so that the temperature characteristic advantage of the GRU model and the time sequence, season and holiday influence components of the NP model are combined, and the ultra-short-term load prediction precision of a low-voltage station area is effectively improved.

Compared with the traditional neural network (GRU, LSTM, BP), the hybrid model has the timing sequence characteristics which are not possessed by the hybrid model, compared with the Prophet model (the previous generation of the NP model), the method has the advantages that the prediction precision is improved by introducing the autoregressive network, and compared with the single NP model, the method has the temperature characteristics. In addition, the NP model super-parameters are optimized by introducing Optuna, so that the model prediction precision is higher.

Drawings

FIG. 1 is a flow chart of a low-voltage area ultra-short-term load prediction method based on an improved GRU-NP model;

FIG. 2 is a graph of temperature versus load data;

FIG. 3 is a graph showing the comparison of the predicted result and the true value of the improved GRU-NP model;

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. The embodiments described herein are merely some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art without making any inventive effort, are within the scope of the present invention.

The data set used in the method is a real load data set in a 2016-year electrotechnical modeling competition A, the Time range is from 1 month and 1 day in 2012 to 12 months and 31 days in 2013, load data sampled every 15 minutes at intervals, the total load value is about 3.5 ten thousand times, the data is preprocessed firstly, then characteristic engineering is carried out to remove weak correlation characteristics, and a training set and a test value are constructed by taking Time step as 4. Then respectively constructing a 4-feature GRU model and an NP model, adopting MSE, RMSE, MAE, R ² As an evaluation index, a model training result shows that the 4-feature GRU model and the NP model do not perform well, so that the ultra-short-term load prediction method of the low-voltage transformer area based on the improved GRU-NP model is provided, a distance correlation coefficient is calculated for carrying out correlation analysis, RMSProp, optuna is introduced for respectively optimizing the GRU model and the NP model, and finally the advantages of the GRU model and the NP model are combined in a mode of distributing optimal weights through a multi-layer perceptron.

Example 1.

The method for predicting the ultra-short-term load of the low-voltage transformer area based on the improved GRU-NP model is shown in fig. 1, and comprises the following specific operation steps:

step 1, carrying out normalization processing on an original data set, calculating a distance correlation coefficient of temperature, humidity and rainfall characteristics and load data, selecting the strong correlation characteristics as external factors, and then cleaning and completing the load data.

Step 1.1, carrying out Min-Max normalization on an original data set, eliminating dimension influence, enabling a model training gradient to drop faster, and adopting the following Min-Max normalization formula:

wherein X is data before processing, X' is data after processing, X _max Is the maximum value of the data, X _min Is the minimum value of the data;

step 1.2, introducing a distance correlation coefficient to replace a pearson correlation coefficient to perform characteristic correlation analysis on temperature, humidity and rainfall in an original data set, and overcoming the defect that the pearson correlation coefficient is not necessarily independent between two variables even if 0, wherein the distance correlation coefficient is expressed as follows:

in the method, in the process of the invention,

mu represents the sequence of load data,

dcov (μ, v) represents the covariance of μ and v,

dcorr (μ, v) represents the distance correlation coefficient of μ and v.

Step 1.3, as shown in Table 1, calculating a temperature and load data distance correlation coefficient dcorr _t 0.6951, the relationship between temperature and load data is shown in FIG. 2, and the relationship between humidity and load data distance is a coefficient dcorr _h 0.1827, a rainfall-to-load data distance correlation coefficient dcorr _r 0.1159, the correlation coefficient is 0-0.2, and the two sequences can be regarded as uncorrelated or weakly correlated, so that the characteristic temperature with the distance correlation coefficient larger than 0.2 is selected as an external factor to participate in the training of the GRU model;

table 1 distance correlation coefficient of three features and load data

And 2, building a GRU model based on RMSProp optimization, training the model by using the mode of combining the load data obtained in the step 1 with temperature factors, building an Optuna super-parameter optimizing NP model, and training the model by using the load data obtained in the step 1 and time sequence characteristics. And updating the internal weight parameters of the NP model to obtain the trained NP model.

Wherein the timing characteristics include: year, month, quarter, holiday, etc.

Step 2.1, dividing a training set and a testing set according to 8:2, and constructing training Batch data according to the Time step of 4 and the Batch Size of 128;

step 2.2, building a GRU model based on RMSProp optimization, adding a Dropout layer to reduce the risk of overfitting, wherein each gating cycle unit comprises: an update gate, a reset gate; updating door z _t For controlling the hidden state output h of the previous time t-1 _t-1 Hidden state input x at current time t _t A hidden state h brought to the current instant t _t Reset gate r _t For controlling the hidden state output h of the previous time t-1 _t-1 How much flows into the candidate hidden state at the current time t

In (a) and (b);

step 2.3, setting GRU model super parameters, and optimizing the GRU model by using a RMSProp algorithm; the RMSProp algorithm is a self-adaptive learning rate method, adopts an exponential weighted moving average, solves the problem that the learning rate is too small to approach an optimal solution in the later period of iteration, and introduces a parameter updating process of the RMSProp algorithm as follows:

updating the state component s _t I.e. calculating random gradient grad _t A prime weighted moving average of the square terms of (a), the formula is described as follows:

s _t ＝γs _t-1 +(1-γ)grad _t ⊙grad _t

in the method, in the process of the invention,

t represents a step of time in time,

grad _t representing a random gradient of the gradient,

s _t-1 representing the state component of the current time instant,

s _t representing the state component at the previous time.

Updating argument x _t In the independent variablesThe learning rate of each element is readjusted by element operation, and the formula is described as follows:

in the method, in the process of the invention,

x _t-1 The input data representing the previous moment in time,

x _t input data representing the current time of day is displayed,

lr denotes the global learning rate.

Step 2.4, inputting a training set into the GRU MODEL based on RMSProp optimization for training to obtain a trained MODEL MODEL_GRU, and updating weight parameters w in the MODEL;

step 2.5, performing super-parameter list optimization on the NP model by using Optuna; selecting a change point list (changepoints), the number (n_changepoints), trend regularization (trend_reg) and learning rate (learning_rate) as super-parameter lists to be optimized, and selecting a hysteresis order (n_lags) through a comparison experiment, wherein the hysteresis order (n_lags) is finally fixed to be 4; the comparison experiment refers to selecting different hysteresis orders (n_lags), and comparing the evaluation indexes of R2 values, mae, mse and map.

Step 2.6, building an NP model by adopting an Optuna optimized super-parameter list; the NP model consists of a number of modules, the formula being described as:

y(t)＝T(t)+S(t)+H(t)+E(t)

in the method, in the process of the invention,

y (t) represents the output of the NP model,

step 2.7, inputting the training set into an NP MODEL for training, and updating weights to obtain a trained MODEL MODEL_NP;

step 3, searching the optimal weight of the GRU model combined with the NP model by using a three-layer MLP network, obtaining a mixed model according to the optimal weight,

step 3.1, constructing a multi-layer perceptron (MLP) network, which is formed by a plurality of fully connected layers, wherein the hybrid model uses three layers of MLP, and the formula is described as follows:

f(x)＝G(b ₂ +w ₂ *f(w ₁ *x+b ₁ )))

in the method, in the process of the invention,

x represents the input load data and,

f (x) represents the predicted value of the output,

w ₁ weight coefficient representing first layer hidden layer

b ₁ the bias term representing the hidden layer of the first layer,

b ₂ representing the bias term of the hidden layer of the second layer,

g represents the activation function, which is a ReLU activation function,

f represents a first layer activation function, and a nonlinear factor is introduced into the nerve cells, so that the nerve network can approach the nonlinear function, and a ReLU activation function is selected;

and 3.2, training the GRU model and NP model prediction results obtained in the step 2 by using an MLP network to obtain an optimal weight coefficient, and combining the model with the improved GRU-NP model by using the optimal weight coefficient for load prediction.

And 4, inputting the prediction results of the GRU model and the NP model obtained in the step 2 into an improved GRU-NP model for prediction to obtain a load value of one minute in the future, and taking the load value as a final prediction result. The prediction results are shown in fig. 3.

The present embodiment preferably divides the training set and the test set by 8:2 for the prediction results of the GRU model and the NP model.

Quantitative secondary Mean Square Error (MSE), root Mean Square Error (RMSE), mean Absolute Error (MAE), R ² Score four evaluation criteria.

Example 2.

Example 3.

Embodiment 3 of the present invention provides a computer-readable storage medium.

A computer readable storage medium having stored thereon a program which when executed by a processor performs the steps in low voltage region ultra-short term load prediction based on an improved GRU-NP model according to one embodiment of the present invention.

The detailed steps are the same as those of the low-voltage area ultra-short-term load prediction method based on the improved GRU-NP model provided in embodiment 1, and will not be described in detail here.

Example 4.

The embodiment 4 of the invention provides electronic equipment.

An electronic device comprising a memory, a processor and a program stored on the memory and executable on the processor, the processor implementing the steps in the method for low voltage region ultra-short term load prediction based on the improved GRU-NP model according to embodiment 1 of the present invention when the program is executed.

The invention has the advantages that compared with the prior art,

The present disclosure may be a system, method, and/or computer program product. The computer program product may include a computer readable storage medium having computer readable program instructions embodied thereon for causing a processor to implement aspects of the present disclosure.

The computer readable storage medium may be a tangible device that can hold and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: portable computer disks, hard disks, random Access Memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), static Random Access Memory (SRAM), portable compact disk read-only memory (CD-ROM), digital Versatile Disks (DVD), memory sticks, floppy disks, mechanical coding devices, punch cards or in-groove structures such as punch cards or grooves having instructions stored thereon, and any suitable combination of the foregoing. Computer-readable storage media, as used herein, are not to be construed as transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (e.g., optical pulses through fiber optic cables), or electrical signals transmitted through wires.

The computer readable program instructions described herein may be downloaded from a computer readable storage medium to a respective computing/processing device or to an external computer or external storage device over a network, such as the internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmissions, wireless transmissions, routers, firewalls, switches, gateway computers and/or edge servers. The network interface card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium in the respective computing/processing device.

Computer program instructions for performing the operations of the present disclosure can be assembly instructions, instruction Set Architecture (ISA) instructions, machine-related instructions, microcode, firmware instructions, state setting data, or source or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, c++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The computer readable program instructions may be executed entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computer (for example, through the Internet using an Internet service provider). In some embodiments, aspects of the present disclosure are implemented by personalizing electronic circuitry, such as programmable logic circuitry, field Programmable Gate Arrays (FPGAs), or Programmable Logic Arrays (PLAs), with state information of computer readable program instructions, which can execute the computer readable program instructions.

Finally, it should be noted that the above embodiments are only for illustrating the technical solution of the present invention and not for limiting the same, and although the present invention has been described in detail with reference to the above embodiments, it should be understood by those skilled in the art that: modifications and equivalents may be made to the specific embodiments of the invention without departing from the spirit and scope of the invention, which is intended to be covered by the claims.

Claims

1. The low-voltage station ultra-short-term load prediction method based on the improved GRU-NP model is characterized by comprising the following steps of:

2. The method for predicting ultra-short term load of low voltage transformer area based on improved GRU-NP model of claim 1,

the step 1 specifically comprises the following steps:

in the method, in the process of the invention,

x represents the data before the processing and,

x' represents the data after the processing and,

X _max represents the maximum value of the data and,

X _min representing a minimum value of the data;

3. The method for predicting ultra-short term load of low voltage transformer area based on improved GRU-NP model of claim 2,

in step 1.2, the formula of the distance correlation coefficient is as follows:

in the method, in the process of the invention,

mu represents the sequence of load data,

dcov (μ, v) represents the covariance of μ and v,

dcorr (μ, v) represents the distance correlation coefficient of μ and v.

4. The method for predicting ultra-short term load of low-voltage transformer area based on improved GRU-NP model of claim 3,

step 2, specifically comprising:

5. The method for predicting ultra-short term load of low voltage transformer area based on improved GRU-NP model of claim 4,

the parameter update procedure using RMSProp algorithm is as follows:

s _t ＝γs _t-1 +(1-γ)grad _t ⊙grad _t

in the method, in the process of the invention,

t represents a step of time in time,

grad _t representing a random gradient of the gradient,

s _t-1 representing the state component of the current time instant,

S _t representing a previous time state component;

in the method, in the process of the invention,

x _t-1 The input data representing the previous moment in time,

x _t input data representing the current time of day is displayed,

lr denotes the global learning rate.

6. The method for predicting ultra-short term load of low voltage transformer area based on improved GRU-NP model of claim 5,

step 2 further comprises:

y(t)＝T(t)+S(t)+H(t)+E(t)

in the method, in the process of the invention,

7. The method for ultra-short term load prediction of a low voltage region based on an improved GRU-NP model of claim 6,

the step 3 specifically comprises the following steps:

f(x)＝G(b ₂ +w ₂ *f(w ₁ *X+b ₁ )))

in the method, in the process of the invention,

x represents the input load data and,

f (x) represents the predicted value of the output,

b ₁ the bias term representing the hidden layer of the first layer,

b ₂ representing the bias term of the hidden layer of the second layer,

g represents an activation function, which is a ReLU activation function;

8. A low voltage domain ultrashort term load prediction system based on an improved GRU-NP model using the method of any one of claims 1-7, comprising: the model comprises a data processing module, a model building module, a model improvement module and a prediction module, and is characterized in that:

9. A terminal comprising a processor and a storage medium; the method is characterized in that:

the storage medium is used for storing instructions;

the processor is operative according to the instructions to perform the steps of the low voltage region ultra-short term load prediction method based on the modified GRU-NP model according to any one of claims 1-7.

10. A computer readable storage medium having stored thereon a computer program, which when executed by a processor, implements the steps of the improved GRU-NP model based low voltage district ultra short term load prediction method according to any one of claims 1 to 7.