CN109063939B

CN109063939B - Wind speed prediction method and system based on neighborhood gate short-term memory network

Info

Publication number: CN109063939B
Application number: CN201811296424.1A
Authority: CN
Inventors: 覃晖; 张振东; 欧阳硕; 刘永琦; 戴明龙; 邵骏; 李�杰; 裴少乾; 朱龙军
Original assignee: Huazhong University of Science and Technology; Bureau of Hydrology Changjiang Water Resources Commission
Current assignee: Huazhong University of Science and Technology; Bureau of Hydrology Changjiang Water Resources Commission
Priority date: 2018-11-01
Filing date: 2018-11-01
Publication date: 2020-08-18
Anticipated expiration: 2038-11-01
Also published as: CN109063939A

Abstract

The invention belongs to the technical field of wind speed prediction, and discloses a wind speed prediction method and a system based on a neighborhood gate long-term and short-term memory network, wherein linear and nonlinear correlations among variables are explored by respectively adopting a Pearson correlation coefficient and a maximum information coefficient to screen a wind speed correlation factor; on the basis of correlation analysis, a Glan's cause and effect relationship test is utilized to explore the cause and effect relationship of the wind speed and the wind speed factor in the statistical sense; dividing the causal relationship structure into 5 types, and unifying all types of causal relationships into an equivalent tree causal relationship structure by a decomposition-virtual variable-pruning method; and aiming at the causal relationship structure of the equivalent tree, a long-term and short-term memory network model based on a neighborhood gate is provided to predict the wind speed. The forecasting method (NLSTM) accurately considers the causal relationship between the wind speed and the wind speed factor, effectively improves the forecasting precision of the wind speed, and plays a vital role in wind power application and power grid dispatching.

Description

Wind speed prediction method and system based on neighborhood gate short-term memory network

Technical Field

The invention belongs to the technical field of wind speed prediction, and particularly relates to a wind speed prediction method and system based on a neighborhood gate length short-term memory network.

Background

Currently, the current state of the art commonly used in the industry is such that:

wind energy is a promising renewable clean energy source and has received widespread attention in recent years from all over the world. More and more wind power is connected to the power system, so that the power system becomes unreliable, which is caused by strong fluctuation and strong randomness of wind speed. Therefore, accurately predicting wind speed plays a crucial role in the utilization of wind energy and efficient scheduling of power systems. Wind speed is affected by many meteorological factors, including factors such as air pressure, temperature, humidity, etc. The wind speed prediction is difficult due to the complex relationship among the factors, and the accuracy of the wind speed prediction by the traditional machine learning method is limited.

Deep learning methods long short term memory networks (LSTM) have a high prediction accuracy when solving time series prediction problems like wind speed, but LSTM is often used as a black box model, which makes the model less interpretable. The correlation of the wind speed influence factors is analyzed through characteristic engineering, and the causal relationship between the wind speed influence factors is cleared, so that the wind speed prediction precision is improved, and the interpretability of the model is enhanced. Therefore, how to analyze the causal relationship between the wind speed and the related factors and accurately consider the causal relationship into the LSTM is a theoretical and practical engineering problem to be solved urgently so as to improve the wind speed prediction accuracy and enhance the interpretability of the model.

In the feature engineering, commonly used correlation analysis methods include a graph method, a correlation coefficient method, a covariance method, a maximum information coefficient method, and the like. Common causal relationship analysis methods include theoretical analysis, transmission entropy and glovey causal relationship test.

In the present invention, the pearson correlation coefficient is used to explore the linear correlation between the factors, and the maximum information coefficient is used to explore the non-linear correlation between the factors. The glange causal relationship test is used to explore causal relationships between factors.

In summary, the problems of the prior art are as follows:

the causal relationship structure types among the factors are complex and various, and few scholars classify the causal relationship structures at present, so that how to scientifically and completely classify all the causal relationship structure types is also a problem in the prior art.

How to conveniently and effectively use the causal relationship structure after classification by using the case which is not available for reference at present, so that unifying the causal relationship structure after classification into a universal causal relationship structure is also a problem faced by the prior art.

In the prior art, the wind speed is difficult to predict due to the complex relationship among factors, and the accuracy of predicting the wind speed by using the traditional machine learning method is limited.

In the prior art, because the LSTM only has one characteristic input interface, the LSTM can only input all factors without distinction and cannot accurately consider the causal relationship obtained through characteristic engineering, and the causal relationship structure of the wind speed factor cannot be accurately considered into the LSTM.

The difficulty and significance for solving the technical problems are as follows:

the difficulty with classifying causal structures is how the classification can encompass all causal structure types. The completeness of classification is therefore the basis for the subsequent techniques.

The difficulty in unifying the causal structures is how to find the common points among the classified causal structures to obtain a common structure. This generic structure may not only represent all types of causal structures but also needs to be an easily predictable structure. Therefore, the representativeness and operability of the unified causal relationship structure play a role in the invention.

After the technical problem is solved, the significance is brought as follows:

in order to enable the LSTM to have the capability of accurately considering the obtained wind speed causal relationship structure, the invention provides a long-short term memory Network (NLSTM) based on a neighborhood gate. NLSTM differs from LSTM in network structure, and correctly deducing forward and backward propagation formulas of NLSTM is a difficulty in realizing NLSTM. The accurate realization of NLSTM is the guarantee for improving the wind speed prediction precision.

Because the causal relationship structures of wind speeds in different areas may be different, it is also one of the difficulties to popularize the technology of the present invention. Therefore, the designed NLSTM can be correspondingly changed according to different causal relationship structures, and the popularization of the NLSTM is very favorable.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a wind speed prediction method and system based on a neighborhood gate length short-term memory network, which can accurately consider the causal relationship structure between wind speed and wind speed influence factors and can obtain a high-precision wind speed prediction result.

The invention is realized in such a way that a wind speed prediction method based on a neighborhood gate length short-term memory network comprises the following steps: the wind speed prediction method based on the neighborhood gate long and short term memory network screens wind speed related factors by respectively adopting Pearson related coefficients and linear and nonlinear correlations between maximum information coefficient analysis variables;

analyzing the causal relationship of the wind speed and the wind speed factor in the statistical significance by utilizing the Glan's causal relationship on the basis of the correlation analysis; dividing the causal relationship structure into 5 types, and unifying all types of causal relationships into an equivalent tree causal relationship structure by a decomposition-virtual variable-pruning method;

and predicting the wind speed of the causal relationship structure of the equivalent tree through a long-term and short-term memory network model based on the neighborhood gate.

The method specifically comprises the following steps:

(1) collecting data of wind speed Y and possibly the influencing factor of the wind speed

(2) Linear and non-linear correlations between wind speed and possibly wind speed influencing factors are analyzed using Pearson correlation coefficient (MIC) and Maximum Information Coefficient (MIC), respectively, to obtain wind speed correlation factor [ x ]₁,x₂,,…,x_n]The influence factor of the absolute value of the Pearson correlation coefficient with the wind speed or the maximum information coefficient of 0.5 or more can be used as the wind speed correlation factor.

(3) Method for detecting and exploring wind speed Y and wind speed related factor [ x ] by using Glanberg causal relationship₁,x₂,,…,x_n]Causal relationship in a statistical sense.

(4) According to the shape of the causal relationship among the wind speed and the wind speed related factors, the causal relationship structure is divided into five structures including a central hub, a chain structure, a ring structure, a tree structure and a network structure. Through careful analysis, it can be found that the central hub structure and the chain structure are special cases of the tree structure in the horizontal extension and the vertical extension, respectively, and the ring structure can be decomposed into a series of chain structures, so that the first three kinds of causal relationship structures can be converted into the tree causal relationship structure.

(5) The network-like structure is a general form of a causal structure. Decomposing a network structure into a plurality of chain structures in the direction of an arrow of inverse causal relationship from wind speed, replacing and distinguishing factors in a plurality of decomposition lines by virtual variables (the variables have the same attribute with actual factors but different numbers and are virtualized for distinguishing the same factor in different decomposition lines), combining all the decomposition lines into a tree structure (the recombined tree is very huge), and pruning the tree structure according to the size of computing resources to obtain a final equivalent tree structure, so that all types of causal relationship structures can be converted into the equivalent tree causal relationship structure.

(6) Constructing a training set D consisting of a wind speed factor and a wind speed according to the causal relationship structure of the equivalent tree^Ta＝[x^Ta,Y^Ta]And test set D consisting of predictor only^Te＝[x^Te]And carrying out normalization processing on the data.

(7) Constructing a long-short term memory Network (NLSTM) based on a neighborhood gate according to the causal relationship of an equivalent tree, and setting parameters of the NLSTM, including the number n of nodes of an input layer_iNumber of hidden layer nodes n_hNumber of nodes of output layer n_oFixed learning rate η, batch size T, number of training rounds Ep.

(8) Adam optimization algorithm combined with mini-batch mechanism is adopted in training set D^TaNLSTM was trained.

(9) Test set D^TeAnd inputting the wind speed prediction result into a trained NLSTM for prediction to obtain a wind speed prediction result y.

Further, in step (8), the step of information forward propagation of the t-th time period and the calculation formula are as follows:

a. each node completes the forward propagation of standard LSTM independently

f_it＝σ(net_f,i,t)＝σ(w_fh,i·h_i,t-1+w_fx,i·x_it+b_f,i) (38)

i_it＝σ(net_i,i,t)＝σ(w_ih,i·h_i,t-1+w_ix,i·x_it+b_i,i) (39)

a_it＝tanh(net_a,i,t)＝tanh(w_ah,i·h_i,t-1+w_ax,i·x_it+b_a,i) (40)

C_it＝f_it*C_i,t-1+i_it*a_it(41)

o_it＝σ(net_o,i,t)＝σ(w_oh,i·h_i,t-1+w_ox,i·x_it+b_o,i) (42)

h_it＝o_it*tanh(C_it) (43)

b. Each node propagates forward along the tree from the leaf node to the root node

n_1it＝σ(net_n1,i,t)＝σ(w_n1h,i·h_i,t-1+w_n1x,i·x_it+b_n1,i) (46)

n_2it＝σ(net_n2,i,t)＝σ(w_n2h,i·h_i,t-1+w_n2x,i·x_it+b_n2,i) (47)

N_it＝n_1it*R_it+n_2it*h_it(48)

y_t＝σ(z_t)＝σ(w_y·N_mt+b_y) (49)

Wherein f is_it,i_it,a_it,C_it,o_it,h_it,R_it,N_itAnd y_itRespectively, a forgetting gate, an input gate, an information state, a cell state, an output gate, a hidden layer output, a central pivot, a neighborhood and a predicted value of the ith node in a time period t; n is_1itAnd n_2itAll are neighborhood gates of the ith node in the time period t; p_ijIs the jth child node of the ith node; tanh and σ are the tan h and sigmoid activation functions, respectively; the symbols sum represent matrix multiplication and multiplication between matrix elements, respectively; the remaining variables are all intermediate variables.

Further, in step (8), the step of t-th period error back propagation and the calculation formula are as follows:

a. defining the most common square error function as the target to be optimized

b. Calculating errors of output layers

c. Counter-propagating errors from root node to leaf node against tree direction

d. Using Adam optimization algorithm with [ w ]_Lh,i,w_Lx,i,b_L,i]And [ w_y,b_y]To update [ w_Lh,w_Lx,b_L]And [ w_y,b_y](ii) a For generality, the weight is denoted by the symbol W, the gradient of the weight is denoted by W, and the general formula for Adam to update the weight is:

m_ti＝β₁·m_ti-1+(1-β₁)·W_ti(70)

v_ti＝β₂·v_ti-1+(1-β₂)·(W_ti)²(71)

wherein E_tAs an error function, y_tAnd Y_tβ for predicted and observed values, respectively₁,β₂And Adam's parameters, default to 0.9,0.999 and 10, respectively^-8(ii) a ti is the current update times of the weight W, and is distinguished from the time period t;

calculating a predicted value by forward propagation according to the formula, and then updating the weight by backward propagation, which is called primary updating; a total iteration of Ep rounds, each round of which will train set D^TaAnd (5) taking batches with the size of T for training, and finishing updating once in each batch.

Another object of the present invention is to provide a computer program for implementing the wind speed prediction method based on the neighborhood gate length short term memory network.

The invention also aims to provide an information data processing terminal for realizing the wind speed prediction method based on the neighborhood gate length short-term memory network.

It is another object of the present invention to provide a computer-readable storage medium, comprising instructions which, when executed on a computer, cause the computer to perform the wind speed prediction method based on a neighborhood gate short term memory network.

The invention also aims to provide a wind speed prediction control system based on the neighborhood gate length short-term memory network, which realizes the wind speed prediction method based on the neighborhood gate length short-term memory network.

The invention also aims to provide the power equipment for utilizing the wind energy by predicting the wind speed, which is provided with the wind speed prediction control system based on the neighborhood gate length short-term memory network.

In summary, the advantages and positive effects of the invention are:

the invention provides a wind speed prediction method based on a neighborhood gate length short-term memory network, which analyzes the causal relationship between wind speed and wind speed factors through characteristic engineering and converts the causal relationship structure into an equivalent tree causal relationship structure by adopting a decomposition-virtual variable-pruning method. NLSTM significantly enhances model interpretability, and can accurately consider the equivalent tree causal structure, which the prior art cannot accurately consider.

The model NLSTM provided by the invention has good universality and easy popularization, and a corresponding network structure can be transformed according to different wind speed causal relationship structures.

The wind speed prediction result obtained by the model provided by the invention has high precision, and the wind speed prediction result can be analyzed from the comparison of the wind speed prediction indexes in the attached table 4 and can also be visually seen from the attached table 5.

Drawings

FIG. 1 is a flow chart of a method for predicting wind speed of a neighborhood gate length short term memory network according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of an Equivalence Tree causal relationship structure provided by an embodiment of the present invention;

fig. 3 is a structural diagram of an NLSTM network according to an embodiment of the present invention;

FIG. 4 is a diagram illustrating a causal relationship of wind speed in a Xinjiang Fuzi station case and an equivalent tree thereof according to an embodiment of the present invention;

FIG. 5 is a comparative graph of wind speed prediction results of the Xinjiang Fuchun station case provided by the embodiment of the invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

The method firstly adopts a 'decomposition-virtual variable-pruning' method to convert all types of causal relationship structures into a uniform equivalent tree structure, and then establishes a long-short term memory Network (NLSTM) based on a neighborhood gate corresponding to the equivalent tree structure to predict the wind speed. The structure of NLSTM is the same as the equivalent tree structure, so that the causal relationship structure among factors can be accurately considered.

Fig. 1 is a general flowchart of a wind speed prediction method of a neighborhood gate length short-term memory network according to the present invention, which specifically includes the following steps:

(2) Linear and non-linear correlations between wind speed and possibly wind speed influencing factors are analyzed using Pearson correlation coefficient (MIC) and Maximum Information Coefficient (MIC), respectively, to determine the wind speedObtaining the wind speed related factor [ x₁,x₂,,…,x_n]The influence factor of the absolute value of the Pearson correlation coefficient with the wind speed or the maximum information coefficient of 0.5 or more can be used as the wind speed correlation factor.

(7) Constructing a long-short term memory Network (NLSTM) based on a neighborhood gate according to the causal relationship of an equivalent tree, and setting parameters of the NLSTM, including the number n of nodes of an input layer_iNumber of hidden layer nodes n_hNumber of nodes of output layer n_oFixed learning rate η, batchSize T, number of training rounds Ep. The weights of the nodes are initialized according to parameters, including w_fh,i,w_fx,i,b_f,i],[w_ih,i,w_ix,i,b_i,i],[w_ah,i,w_ax,i,b_a,i],[w_oh,i,w_ox,i,b_o,i],[w_n1h,i,w_n1x,i,b_n1,i],[w_n2h,i,w_n2x,i,b_n2,i],[w_rh,i,j,w_rx,i,j,b_r,i]And [ w_y,b_y]。

(8) Adam optimization algorithm combined with mini-batch mechanism is adopted in training set D^TaNLSTM was trained. The implementation of NLSTM involves forward propagation of information and backward propagation of errors.

The step and the calculation formula of the forward propagation of the information of the t-th time interval are as follows:

a. each node completes the forward propagation of standard LSTM independently

f_it＝σ(net_f,i,t)＝σ(w_fh,i·h_i,t-1+w_fx,i·x_it+b_f,i) (75)

i_it＝σ(net_i,i,t)＝σ(w_ih,i·h_i,t-1+w_ix,i·x_it+b_i,i) (76)

a_it＝tanh(net_a,i,t)＝tanh(w_ah,i·h_i,t-1+w_ax,i·x_it+b_a,i) (77)

C_it＝f_it*C_i,t-1+i_it*a_it(78)

o_it＝σ(net_o,i,t)＝σ(w_oh,i·h_i,t-1+w_ox,i·x_it+b_o,i) (79)

h_it＝o_it*tanh(C_it) (80)

n_1it＝σ(net_n1,i,t)＝σ(w_n1h,i·h_i,t-1+w_n1x,i·x_it+b_n1,i) (83)

n_2it＝σ(net_n2,i,t)＝σ(w_n2h,i·h_i,t-1+w_n2x,i·x_it+b_n2,i) (84)

N_it＝n_1it*R_it+n_2it*h_it(85)

y_t＝σ(z_t)＝σ(w_y·N_mt+b_y) (86)

Wherein f is_it,i_it,a_it,C_it,o_it,h_it,R_it,N_itAnd y_itRespectively, a forgetting gate, an input gate, an information state, a cell state, an output gate, a hidden layer output, a central pivot, a neighborhood and a predicted value of the ith node in a time period t; n is_1itAnd n_2itAll are neighborhood gates of the ith node in the time period t; p_ijIs the jth child node of the ith node; tan h and σ are tan h and sigma activation functions, respectively; the symbols sum represent matrix multiplication and multiplication between matrix elements, respectively; the remaining variables are all intermediate variables.

The step and the calculation formula of the error back propagation in the t period are as follows:

a. defining the most common square error function as the target to be optimized

b. Calculating errors of output layers

m_ti＝β₁·m_ti-1+(1-β₁)·W_ti(107)

v_ti＝β₂·v_ti-1+(1-β₂)·(W_ti)²(108)

wherein E_tAs an error function, y_tAnd Y_tβ for predicted and observed values, respectively₁,β₂And Adam's parameters, default to 0.9,0.999 and 10, respectively^-8. ti is the current update times of the weight W, and is distinguished from the time period t.

The remaining variables are synonymous with the previously mentioned variables, and the previously non-mentioned variables are intermediate variables, and no specific meaning is required.

According to the formula, the predicted value is calculated by forward propagation, and then the updating weight is updated by backward propagation, which is called once updating. A total iteration of Ep rounds, each round of which will train set D^TaAnd (5) taking batches with the size of T for training, and finishing updating once in each batch.

FIG. 2 is a diagram illustrating an Equivalence Tree cause and effect relationship structure;

fig. 3 shows a structure diagram of the NLSTM network.

The use of the present invention is further described below in conjunction with specific experiments.

The method takes meteorological data of Xinjiang Fuziji sites as objects, and the data adopts the meteorological data of one month from 7 and 15 days in 2018 to 8 and 14 days in 2018. The data time step is 1 hour, 744 time periods are totally, 595 time periods before division are taken as a training set, and 149 time periods after division are taken as a test set. The meteorological data includes a total of 20 factors as shown in table 1. And selecting the values of the first two time periods of each factor as the characteristics of the current time period. Pearson's Correlation Coefficient (PCC) and Maximum Information Coefficient (MIC) among the 20 factors were calculated as shown in Table 2. Based on the correlation analysis, the respective factors were subjected to the glange causal relationship test, as shown in table 3. The causal relationships associated with the average wind speed (AWS,13) are plotted in fig. 4(a) and converted to two equivalent tree structures as shown in fig. 4(b) and (d) based on computational resource size.

To validate the predictive performance of NLSTM, the following four models were constructed to predict average wind speed and compared:

LSTM-1: the method adopts standard LSTM, and is characterized in that [5,6,7,8,9,11,15,13] is obtained without considering the cause and effect relationship;

② LSTM-2: the method adopts standard LSTM, and only takes the characteristics of [13 ];

③ NLSTM-1: the method adopts NLSTM, and adopts an equivalent tree causal relationship structure shown in FIG. 4 (b);

(iv) NLSTM-2: the method adopts NLSTM, and adopts an equivalent tree causal relationship structure shown in FIG. 4 (d);

to avoid randomness, 4 models were run 20 times each. Table 4 lists the evaluation indices for the four models to predict the average wind speed. The evaluation indexes adopt Root Mean Square Error (RMSE) and mean absolute error percentage (MAPE), and the smaller the two indexes are, the higher the prediction precision is. As can be seen from Table 4, the prediction accuracy of NLSTM-2 and NLSTM-1 is higher than that of both LSTM-1 and LSTM-2, which shows that the method NLSTM of the present invention is superior to standard LSTM. The higher prediction accuracy of NLSTM-2 compared with NLSTM-1 shows that the closer the equivalent tree causal relationship structure is to the real causal relationship structure, the higher the accuracy of the obtained wind speed prediction result is under the permission of computing resources. The difference in prediction accuracy of the 4 models can be seen more clearly in fig. 5.

TABLE 1 Meteorological factor description Table

TABLE 2 correlation analysis table for Xinjiang Fuyun station

。

TABLE 3 analysis table of causal relationship between Xinjiang Fuyuntang Glandujie

TABLE 4 comparison table of wind speed prediction indexes of Xinjiang Fuyun station

In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When used in whole or in part, can be implemented in a computer program product that includes one or more computer instructions. When loaded or executed on a computer, cause the flow or functions according to embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, the computer instructions may be transmitted from one website site, computer, server, or data center to another website site, computer, server, or data center via wire (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL), or wireless (e.g., infrared, wireless, microwave, etc.)). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that includes one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A wind speed prediction method based on a neighborhood gate long and short term memory network is characterized in that the wind speed prediction method based on the neighborhood gate long and short term memory network respectively adopts Pearson correlation coefficient and maximum information coefficient to analyze linear and nonlinear correlation among variables to screen wind speed correlation factors;

analyzing the causal relationship of the wind speed and the wind speed factor in the statistical significance by utilizing the Glan's causal relationship on the basis of the correlation analysis; classifying the causal relationship structures, and unifying all types of causal relationships into an equivalent tree causal relationship structure by a decomposition-virtual variable-pruning method;

predicting the wind speed of the causal relationship structure of the equivalent tree through a long-term and short-term memory network model based on a neighborhood gate;

the wind speed prediction method based on the neighborhood gate length short-term memory network specifically comprises the following steps:

(2) Respectively analyzing linear and nonlinear correlations between wind speed and possible wind speed influence factors by using the Pearson correlation coefficient MIC and the maximum information coefficient MIC to obtain a wind speed correlation factor [ x ]₁,x₂,,…,x_n]The influence factors of which the absolute value of the Pearson correlation coefficient or the maximum information coefficient of the wind speed is more than 0.5 are taken as the wind speed correlation factors;

(3) analysis of wind speed Y and related factor of wind speed [ x ] by using Glanberg causal relationship₁,x₂,,…,x_n]Causal relationships in a statistical sense;

(4) dividing the causal relationship structure into five structures including a central hub, a chain structure, a ring structure, a tree structure and a network structure according to the shape of the causal relationship between the wind speed and the wind speed related factors; the central hub structure and the chain structure are special cases of the horizontal expansion and the vertical expansion of the tree structure respectively, the ring structure is decomposed into a series of chain structures, and the central hub structure, the chain structure and the ring structure are all converted into tree causal structures;

(5) decomposing a network structure into a plurality of chain structures from the wind speed according to the inverse causal relationship, replacing and distinguishing factors existing in a plurality of decomposition lines by using virtual variables, combining all the decomposition lines into a tree structure, and pruning the tree structure according to the size of calculation resources to obtain a final equivalent tree structure, so that all types of causal relationship structures are converted into equivalent tree causal relationship structures;

(6) constructing a training set D consisting of a wind speed factor and a wind speed according to the causal relationship structure of the equivalent tree^Ta＝[x^Ta,Y^Ta]And test set D consisting of predictor only^Te＝[x^Te]And carrying out normalization processing on the data;

(7) constructing a long-short term memory network NLSTM based on a neighborhood gate according to the causal relationship of an equivalent tree, and setting parameters of the NLSTM, including the number n of nodes of an input layer_iNumber of hidden layer nodes n_hNumber of nodes of output layer n_oFixed learning rate η, batch size T, number of training rounds Ep;

(8) adam optimization algorithm combined with mini-batch mechanism is adopted in training set D^TaTraining NLSTM;

2. The wind speed prediction method based on the neighborhood gate length short-term memory network as claimed in claim 1, wherein in the step (8), the step of forward propagation of the information of the t-th time period and the calculation formula are as follows:

a. each node completes the forward propagation of standard LSTM independently

f_it＝σ(net_f,i,t)＝σ(w_fh,i·h_i,t-1+w_fx,i·x_it+b_f,i) (1)

i_it＝σ(net_i,i,t)＝σ(w_ih,i·h_i,t-1+w_ix,i·x_it+b_i,i) (2)

a_it＝tanh(net_a,i,t)＝tanh(w_ah,i·h_i,t-1+w_ax,i·x_it+b_a,i) (3)

C_it＝f_it*C_i,t-1+i_it*a_it(4)

o_it＝σ(net_o,i,t)＝σ(w_oh,i·h_i,t-1+w_ox,i·x_it+b_o,i) (5)

h_it＝o_it*tanh(C_it) (6)

n_1it＝σ(net_n1,i,t)＝σ(w_n1h,i·h_i,t-1+w_n1x,i·x_it+b_n1,i) (9)

n_2it＝σ(net_n2,i,t)＝σ(w_n2h,i·h_i,t-1+w_n2x,i·x_it+b_n2,i) (10)

N_it＝n_1it*R_it+n_2it*h_it(11)

y_t＝σ(z_t)＝σ(w_y·N_mt+b_y) (12)

3. The wind speed prediction method based on the neighborhood gate length short-term memory network as claimed in claim 1, wherein in the step (8), the step of error back propagation in the t-th period and the calculation formula are as follows:

a. defining the most common square error function as the target to be optimized

b. Calculating errors of output layers

Wherein E_tAs an error function, y_tAnd Y_tRespectively, predicted values and observed values.

4. The wind speed prediction method based on the neighborhood gate length short term memory network as claimed in claim 3, wherein the step of updating the weight of the t-th time period comprises:

using Adam optimization algorithm with [ w ]_Lh,i,w_Lx,i,b_L,i]And [ w_y,b_y]To update [ w_Lh,w_Lx,b_L]And [ w_y,b_y](ii) a For generality, the weight is denoted by the symbol W, the gradient of the weight is denoted by W, and the general formula for Adam to update the weight is:

m_ti＝β₁·m_ti-1+(1-β₁)·W_ti(33)

v_ti＝β₂·v_ti-1+(1-β₂)·(W_ti)²(34)

β₁,β₂and Adam's parameters, default to 0.9,0.999 and 10, respectively^-8(ii) a ti is the current update times of the weight W, and is distinguished from the time period t;

5. A computer-readable storage medium comprising instructions that, when executed on a computer, cause the computer to perform the method for wind speed prediction based on a neighborhood gate short term memory network of any of claims 1-4.