CN111667098B

CN111667098B - Wind power station output power prediction method based on multi-model combination optimization

Info

Publication number: CN111667098B
Application number: CN202010405975.8A
Authority: CN
Inventors: 曾亮; 狄飞超; 王珊珊; 刘哲; 舒文强; 邹心怡; 陈新彦; 杨文戈; 雷舒敏
Original assignee: Hubei University of Technology
Current assignee: Hubei University of Technology
Priority date: 2020-05-14
Filing date: 2020-05-14
Publication date: 2023-04-18
Anticipated expiration: 2040-05-14
Also published as: CN111667098A

Abstract

The invention provides a wind power station output power prediction method based on multi-model combination optimization. Acquiring meteorological influence factor data and wind power data of a target wind power station, sequentially performing data missing value completion and data normalization, and analyzing the association degree of the meteorological influence factor data and the wind power data through a grey association degree analysis method; further establishing a support vector machine prediction model, and optimizing parameters through a grey wolf group optimization algorithm; taking the optimized support vector machine model as a weak regressor, constructing a strong regressor by establishing an AdaBoost integration model, and predicting by using the strong regressor to obtain a training set wind power predicted value sequence; further constructing a gated cycle unit network to predict the error sequence E; and superposing the corrected prediction error value and the prediction result of the AdaBoost integrated prediction model to obtain the final prediction result. The method improves the prediction efficiency and the prediction precision of the wind power prediction model, and is beneficial to scientific scheduling and grid-connected operation safety and stability of the power system.

Description

Wind power station output power prediction method based on multi-model combination optimization

Technical Field

The invention belongs to the technical field of wind power generation, and relates to a wind power station output power prediction method based on multi-model combination optimization.

Background

Along with the rapid development of economy, the demand of each country on energy sources is greater and greater, and the requirement on the environment is higher and higher. The wind energy is used as a clean energy, so that the outstanding problems of energy shortage, environmental pollution, greenhouse effect and the like at present can be effectively relieved; meanwhile, as a renewable energy source, the wind energy has a wide development prospect under the background that fossil energy is gradually exhausted. Data shows that in 2018, the newly added grid-connected wind power installed capacity in China is 2059 thousands of kilowatts, the accumulated grid-connected installed capacity reaches 1.84 hundred million kilowatts, and the accumulated grid-connected installed capacity accounts for 9.7 percent of the total installed capacity of power generation. In 2019, in 1-6 months, 909 million kilowatts of wind power installed capacity are newly added in China, wherein 40 million kilowatts of offshore wind power are added, and the accumulated grid-connected installed capacity reaches 1.93 hundred million kilowatts.

Due to uncertainty, volatility and intermittency of wind power, when wind power generation is carried out, severe examination can be brought to safe and stable operation of a power grid and safety and reliability of grid connection of a power system. The method has great significance for reducing the influence of wind power generation on a power system and realizing accurate prediction of wind power of a wind power plant.

Disclosure of Invention

The invention aims to provide a wind power station output power prediction method based on multi-model combination optimization, so as to improve the prediction precision of wind power output power and realize scientific scheduling and optimized operation of a power system.

In order to solve the above technical problem, an embodiment of the present invention provides the following technical solutions:

a wind power station output power prediction method based on multi-model combination optimization predicts wind power generation power in a future time period according to historical wind power characteristic data.

The wind power station output power prediction method based on the combined model comprises the following steps:

step 1, acquiring meteorological influence factor data and wind power data of a target wind power station, sequentially supplementing the meteorological influence factor data and the wind power data through data missing values and a data normalization method to obtain preprocessed meteorological influence factor data and preprocessed wind power data, taking the preprocessed meteorological influence factor data as a comparison array, taking the preprocessed wind power data as a reference array, and obtaining the association degree of each meteorological influence factor and the wind power through a grey association degree analysis method;

step 2, sorting the relevance degrees of the meteorological influence factors from large to small, taking the meteorological influence factors with the relevance degrees larger than a preset threshold value as weather influence factors after feature selection, taking the meteorological influence factors after feature selection as a training set, and further establishing a support vector machine prediction model;

step 3, respectively optimizing the punishment parameters of the support vector machine and the kernel function parameters of the support vector machine through a grey wolf colony optimization algorithm to obtain the punishment parameters of the optimized support vector machine and the kernel function parameters of the optimized support vector machine;

step 4, dividing the meteorological influence factors after feature selection into a training set and a test set according to a certain proportion, taking an optimized support vector machine model as a weak regressor, initializing sample weights, obtaining a plurality of weak regressors, calculating relative errors of each sample in the training set, calculating error rates of the plurality of weak regressors by combining the relative errors, further calculating weight coefficients of the weak regressors to update the sample weights, establishing an AdaBoost integration model to construct a strong regressor, optimizing the number, learning rate and loss function models of the weak regressors by a grid search method, predicting each sample in the training set by using the strong regressor, obtaining a predicted value sequence of wind power in the training set, and further calculating an error sequence;

step 5, constructing a gated circulation unit network, predicting an error sequence E through the gated circulation unit network, taking the meteorological factor with the selected characteristics as input data of the gated circulation unit network during error correction, and optimizing parameters of the gated circulation unit network according to a loss function model of the gated circulation unit network to obtain the optimized gated circulation unit network;

step 6, overlapping the corrected prediction error value with a prediction result of the AdaBoost integrated prediction model to obtain a final prediction result;

preferably, the preprocessed meteorological influence factor data in step 1 is:

X _i (k)，i∈[1，n]，k∈[1，m]

wherein, X _i (k) Representing kth point data in the ith preprocessed meteorological influence factor, wherein n is the type number of the meteorological influence factors, and m is the number of data points in each meteorological factor;

step 1, the preprocessed wind power data are as follows:

X _o (k)，k∈[1，m]

wherein, X _o (k) Representing preprocessed wind power data when the kth point data is selected from the preprocessed meteorological influence factor data, wherein m is the number of data points in each meteorological factor;

selecting kth point data from the preprocessed meteorological influence factor data as follows:

{X ₁ (k)，X ₂ (k)，...，X _n (k)}

step 1, obtaining the correlation of each weather influence factor by a grey correlation analysis method, specifically:

calculating a gray correlation coefficient:

i∈[1，n]，k∈[1，m]

wherein, X _i (k) Represents the kth point data theta in the ith preprocessed meteorological influence factor _i (k) Representing the gray correlation coefficient, X, of the kth point in the weather influence factor after the ith preprocessing _o (k) Representing the preprocessed wind power data when the kth point data is selected from the preprocessed meteorological influence factor data, wherein n is the type number of the meteorological influence factors, m is the number of data points in each meteorological factor, rho is a resolution coefficient,

represents the two-stage minimum difference between the wind power and the ith meteorological influence factor, and is used for judging whether the wind power is greater than or equal to the preset value>

Representing the two-stage maximum difference between the wind power and the ith meteorological influence factor;

step 1, the correlation degree of each meteorological influence factor is as follows:

γ _i ∈[0，1]

wherein, gamma is _i Correlation degree of ith meteorological influence factor and wind power, gamma _i The smaller the error between the power value and 1 is, the higher the correlation degree between the ith meteorological influence factor and the wind power is;

preferably, the meteorological influence factor data after the feature selection in step 2 is:

X _j (k)，j∈[1，c]，k∈[1，m]，c＜n

wherein, X _j (k) Representing kth point data in the meteorological influence factors after the jth characteristic selection, wherein c is the type number of the meteorological influence factors after the characteristic selection, and m is the number of data points in each meteorological factor;

simultaneously, a relaxation factor xi is introduced to avoid the relaxation factor xi _i If the size is too large, adding a penalty factor C, and supporting the target function expression of a vector machine prediction model as follows:

wherein, C represents the punishment parameter of the support vector machine as the parameter to be optimized, xi _i Represents the relaxation factor, x, of the ith sample data _i Represents input sample data, y _i Representing a label, b representing a displacement term, determining the distance between the hyperplane and the origin, ω representing a normal vector, determining the direction of the hyperplane,

kernel function K (x) of support vector machine _p ，x _q ) The method is characterized by comprising the following steps of (1) selecting a Gaussian radial basis kernel function (RBF) as follows:

x _p ＝[X ₁ (p) X ₂ (p) ... X _j (p) ... X _c (p)]

x _q ＝[X ₁ (q) X ₂ (q) ... X _j (q) ... X _c (q)]

wherein x is _p Denotes the p-th sample data, x _q It is indicated that the q-th sample data,

representing the Euclidean distance between two points of the square of the distance between the two points, σ being the width of the function, g =1/2 σ ² Representing the kernel function parameter of the support vector machine as a parameter to be optimized;

preferably, in step 3, the penalty parameter of the support vector machine and the kernel function parameter of the support vector machine are optimized by a grey wolf group optimization algorithm, respectively, as follows:

step 3.1, initializing algorithm parameters of the gray wolf colony optimization algorithm: the population scale N, the maximum iteration times maxgen, the penalty parameter C, the value range of the kernel function parameter g and the position of the wolf individual;

step 3.2, randomly generating a population of N wolf individuals according to the population scale N, wherein the positions of the N wolf individuals D are both composed of a penalty parameter C and a kernel function parameter g, and random initialization is carried out according to the value ranges of the penalty parameter C and the kernel function parameter g;

the positions of the N individual wolfs are as follows in sequence: (C) _b ，g _b )，b∈[1，N]；

Step 3.3, the mean square error MSE is used as a fitness function, the population of N wolf individuals is divided into four different levels of alpha, beta, delta and omega, and the positions of the wolfs are respectively updated according to the fitness function value and the fitness function value relations of the alpha wolf, the beta wolf, the delta wolf and the omega wolf;

predicting the meteorological influence factors after the feature selection through the SVM model established in the step 2 to obtain a wind power predicted value

The fitness function is the sum of the mean square errors of the predicted value of the wind power and the actual value of the wind power, and specifically comprises the following steps:

k∈[1，S]S＜m

wherein, X _o (k) Selecting the actual value of the wind power when the k point data is selected for the meteorological influence factor data after each characteristic is selected,

when the kth point data is selected for the meteorological influence factor data after the features are selected, the wind power predicted value predicted by a support vector machine prediction model is obtained, S is the number of wind power data points in a training set, and m is the number of the wind power data points;

and 3.4, updating the direction and the position of the omega wolf searching individual in the grey wolf group according to a formula, wherein the specific formula is as follows:

D _s (t)＝|C _u X _s (t)-X _u (t)|u＝1，2，3，s＝α，β，δ

X _u (t+1)＝X _s (t)-A _u |C _u X _s (t)-X _u (t)|u＝1，2，3，s＝α，β，δ

where t represents the current number of iterations, D _s (t) watchShowing the distances between the alpha, beta and delta level wolfs and the omega level wolf, X _s (t) represents the current position of the alpha-level wolf, the beta-level wolf, the delta-level wolf, X _u (t) represents the current position of the omega level wolf, X _u (t + 1) represents the alpha-level wolf, the beta-level wolf, the delta-level wolf determines the position of the next movement of the omega-level wolf, and X (t + 1) represents the position of the next moment of the omega-level wolf;

and 3.5, updating parameters alpha, A and C according to a formula, and determining a new position of an alpha-level wolf, a new position of a beta-level wolf and a new position of a delta-level wolf according to the updated optimal fitness function value, wherein the specific formula is as follows:

A＝α*(2r ₁ -1)

C＝2r ₂

wherein, denoted as matrix product, A, C is the first cooperative coefficient and the second cooperative coefficient respectively, a is linearly decreased from 2 to 0 in the whole iteration process, r1 and r2 are the first random vector and the second random variable between [0,1 ];

and 3.6, if the maximum iteration times maxgen are exceeded, terminating the algorithm, outputting the global optimal positions in the N wolf group individuals, namely the penalty parameter C of the optimized support vector machine ^* And kernel function parameter g of optimized support vector machine ^* Otherwise, skipping to the step 3.4 to continue optimization;

step 4, the weight of the initialization sample is w _zi ；

Step 4, the plurality of weak regressors are z weak regressors F _z (x)；

Step 4, calculating the relative error of each sample in the training set, specifically as follows:

wherein i is E [1,c]，w _zi Weight of the kth sample representing the z-th weak regressor, e _zi Expressed as the relative error of the kth sample of the z-th weak regressor, X _o (k) After selecting for each featureWind power actual value F when the k point data is selected from the meteorological influence factor data _z (X _i (k) Is the predicted value of the wind power of the z-th weak regressor at the k-th data point, E _z The maximum error of the z-th weak regressor on the training set;

and 4, calculating error rates of a plurality of weak regressors by combining the relative errors, wherein the error rates are as follows:

according to the relative error, the error rate e of the z-th weak regressor can be obtained _kz The method comprises the following steps:

step 4, further calculating the weight coefficient of the weak regressor to update the sample weight as follows:

determining the weight coefficient alpha of a weak regressor _z The method comprises the following steps:

the sample weights are updated as follows:

wherein, ω is _z+1，i Sample weight coefficient, Z, representing the Z +1 th weak regressor _a In order to normalize the factors, the method comprises the steps of,

and 4, constructing a strong regressor by establishing an AdaBoost integration model: superposing the product of the weight of each weak regression and the predicted value to obtain the final strong regression;

and 4, predicting each sample in the training set by using a strong regressor to obtain a training set wind power predicted value sequence as follows:

the training set is the meteorological influence factor after the feature selection in the step 2;

training a wind power predicted value sequence, which is specifically as follows:

the sequence of the predicted values of the wind power is tested as follows:

wherein S represents the number of wind power data points in the training set, m represents the number of wind power data points,

represents the sequence of k wind power predicted values in the training set, and>

representing k wind power predicted value sequences, X, in a test set _f (k) Represents the k wind power predicted value, X, in the training set _g (k) Representing the k wind power predicted value in the test set,

step 4, the calculation error sequence is as follows:

calculating a training set predicted power value

Historical wind power value X of wind power station _o (k) The difference in (c) is denoted as error sequence E, as follows:

preferably, the step 5 of constructing the gated cyclic unit network is as follows:

constructing a gated circulation unit network through an input layer, a plurality of gated circulation unit layers and an output layer;

an updating gate and a resetting gate corresponding to each gating circulation unit layer exist on each gating circulation unit layer;

the update gate determines information that was retained to the current time at a previous time, and the reset gate determines the extent to which information was discarded;

the updating gate and the placing gate are sequentially defined as follows:

Z _t ＝σ(W _Z X _t +U _Z h _t - ₁ +b _Z )

r _t ＝σ(W _r X _t +U _r h _t-1 +b _r )

wherein z is _t To refresh the door, r _t To reset the gate, X _t Input representing the current time, W _Z And U _Z Respectively, updating the weight matrix of the gate, W _r And U _r Weight matrices, b, of reset gates, respectively _Z And b _r Respectively are offset vectors, and sigma is a sigmoid activation function;

the hyper-parameters of the gated loop unit include: the number of layers of the neural network, the number of neurons in each layer and the dropout rate; wherein the dropout rate is a proportion of the number of discarded neurons and is used for inhibiting an overfitting phenomenon of the model, and the dropout belongs to (0,1);

by setting the value range of the parameters to be optimized simultaneously, finding out the corresponding parameters, namely the optimal number of neural network layers, the optimal number of neurons in each layer and the optimal dropout rate of the GRU model according to the minimum principle of a loss function (MAE) on a training set;

step 5, the loss function model of the gated cyclic unit network is an average absolute error, and the loss function model of the gated cyclic unit network specifically includes the following steps:

k∈[1，S] S＜m

selecting a wind power predicted value predicted by the gate control cycle unit network when the kth point data is selected for the meteorological influence factor data after each feature is selected;

and 5, optimizing the parameters of the gated cyclic unit network as follows: optimizing and solving the gating circulation unit network through an Adam algorithm to obtain a weight matrix Q of the updated gate after optimization _Z 、U* _Z Weight matrix W of reset gate after optimization _r 、U* _r Updating the offset vector b of the gate after optimization _*z Bias vector b of reset gate after optimization _r ；

After parameter optimization is carried out on the training set, prediction is carried out on the test set to obtain a predicted value sequence of the test set errors, and the predicted value sequence is recorded as

The following were used:

preferably, the final prediction result in step 6 is as follows:

wherein X' (k) is the final prediction result of k wind power in the test set,

for a sequence of predictors in a test set error>

And m-S is the number of samples of the test set.

Compared with the prior art, the invention has the following advantages:

the innovative combined model prediction scheme is as follows: because the single model has low prediction accuracy, a large error exists. The invention innovatively applies a technical scheme of predicting five model combinations of gray correlation analysis GRA, support vector machine SVM, gray wolf group optimization algorithm GWOO, adaBoost integration and gating circulation unit GRU, and improves the prediction precision of wind power to a certain extent.

And performing feature selection by utilizing a Gray Relevance Analysis (GRA), and selecting the first few influence factors with larger influence from a plurality of features as the input of the model, thereby improving the prediction efficiency of the model.

The parameters of the SVM prediction model are solved by constructing a complete optimization problem model, and a gray wolf group optimization algorithm GWO is adopted in the solving method.

And (3) constructing a strong regression by using an AdaBoost integration model, integrating on the basis of the shallow model, and improving the prediction precision of the model.

And finally, predicting the error by using a gated circulation unit GRU, and superposing the corrected predicted error and the predicted value of the wind power so as to further improve the prediction precision of the model.

The invention has the beneficial technical effects that:

according to the invention, the accurate prediction of the wind power is realized through the GRA-SVM-GWO-AdaBoost-GRU combined model, and the safety and stability of scientific scheduling and grid-connected operation of the power system are facilitated.

Drawings

FIG. 1: is a flow chart of the method of the present invention.

FIG. 2 is a schematic diagram: the invention provides a flow chart of a method for optimizing penalty parameters and kernel function parameters in a support vector machine by a wolf group optimization algorithm.

FIG. 3: the invention provides a comparison graph of regression prediction data and original data obtained by predicting wind power by a combined model.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.

In order to verify the effectiveness of the model and the method provided by the invention, the method is applied to a Germany wind power plant for short-term wind power prediction, in order to prove the effectiveness and generalization capability of the model, the data of two time periods are respectively predicted, wind power experimental data of 2016 year 1, month 1 and day 10 and 2016 year 11, month 11 and day 18 are respectively selected for simulation experiments, and the sampling interval between experimental data points is 5 minutes. In the first section of data, the data (2021 data points) of the first 7 days are used as a training set, the data (671 data points) of the last 3 days are used as a test set, and the wind power in the next 3 days is predicted; and the second stage data also adopts the data of the first 7 days as a training set to predict the wind power within 1 day in the future.

FIG. 1 shows a flow chart of the method of the present invention.

The following describes a specific embodiment of the present invention with reference to fig. 1 to 3, and a method for predicting output power of a wind power plant based on multi-model combination optimization specifically includes the following steps:

step 1, acquiring meteorological influence factor data and wind power data of a target wind power station, sequentially supplementing the meteorological influence factor data and the wind power data with missing data values and performing a data normalization method to obtain preprocessed meteorological influence factor data and preprocessed wind power data, taking the preprocessed meteorological influence factor data as a comparison series, taking the preprocessed wind power data as a reference series, and obtaining the association degree of each meteorological influence factor and the wind power through a gray association degree analysis method;

step 1, the preprocessed meteorological influence factor data are as follows:

X _i (k)，i∈[1，n]，k∈[1，m]

step 1, the preprocessed wind power data are as follows:

X _o (k)，k∈[1，m]

wherein X _o (k) Representing the preprocessed wind power data when the kth point data is selected from the preprocessed meteorological influence factor data, wherein m is the number of data points in each meteorological factor;

selecting the kth point data from the preprocessed meteorological influence factor data as follows:

{X ₁ (k)，X ₂ (k)，...，X _n (k)}

calculating a gray correlation coefficient:

i∈[1，n]，k∈[1，m]

wherein, X _i (k) Represents the kth point data in the meteorological influence factor after the ith preprocessing, theta _i (k) Representing the gray correlation coefficient, X, of the kth point in the weather influence factor after the ith preprocessing _o (k) Representing the preprocessed wind power data when the kth point data is selected from the preprocessed meteorological influence factor data, wherein n is the type number of the meteorological influence factors, m is the number of data points in each meteorological factor, rho is a resolution coefficient,

represents the two-stage minimum difference between the wind power and the i-th weather influence factor>

γ _i ∈[0，1]

step 2, the meteorological influence factor data after the feature selection is as follows:

X _j (k)，j∈[1，c]，k∈[1，m]，c＜n

wherein X _j (k) Representing kth point data in the meteorological influence factors after the jth characteristic selection, wherein c is the type number of the meteorological influence factors after the characteristic selection, and m is the number of data points in each meteorological factor;

wherein, C represents the punishment parameter of the support vector machine as the parameter to be optimized, xi _i Represents the relaxation factor, x, of the ith sample data _i Representing input sample data, y _i Representing a label, b representing a displacement term, determining the distance between the hyperplane and the origin, ω representing a normal vector, determining the direction of the hyperplane,

kernel function K (x) of support vector machine _p ，x _q ) The method is characterized by comprising the following steps of (1) selecting a Gaussian Radial Basis Function (RBF) as follows:

x _p ＝[X ₁ (p) X ₂ (p) ... X _j (p) ... X _c (p)]

x _q ＝[X ₁ (q) X ₂ (q) ... X _j (q) ... X _c (q)]

wherein x is _p Denotes the p-th sample data, x _q It means that the q-th sample data,

preferably, the preset threshold is 0.9;

and 3, respectively optimizing the punishment parameters of the support vector machine and the kernel function parameters of the support vector machine through a grey wolf colony optimization algorithm to obtain the punishment parameters C of the optimized support vector machine ^* And kernel function parameter g of optimized support vector machine ^* ；

As shown in fig. 2, in step 3, the penalty parameter of the support vector machine and the kernel function parameter of the support vector machine are respectively optimized by the gray wolf group optimization algorithm as follows:

step 3.1, initializing algorithm parameters of a graying wolf colony optimization algorithm: the population scale N, the maximum iteration times maxgen, the penalty parameter C, the value range of the kernel function parameter g and the position of the wolf individual;

the positions of the N grey wolf individuals are as follows in sequence: (C) _b ，g _b )，b∈[1，N]；

Step 3.3, taking the mean square error MSE as a fitness function, dividing the population of N wolf individuals into four different levels of alpha, beta, delta and omega, and respectively updating the positions of the wolfs according to the fitness function value and the fitness function value relations of the alpha wolf, the beta wolf, the delta wolf and the omega wolf;

k∈[1，S] S＜m

D _s (t)＝|C _u X _s (t)-X _u (t)|u＝1，2，3，s＝α，β，δ

where t represents the current number of iterations, D _s (t) represents the distance between the alpha-level wolf, the beta-level wolf and the delta-level wolf and the omega-level wolf respectively in the t iteration, X _s (t) represents the current position of the alpha-level wolf, the beta-level wolf, the delta-level wolf, X _u (t) represents the current position of the omega level wolf, X _u (t + 1) represents the alpha-level wolf, the beta-level wolf, the delta-level wolf determines the position of the next movement of the omega-level wolf, and X (t + 1) represents the position of the next moment of the omega-level wolf;

step 3.5, parameters alpha, A and C are updated according to a formula, a new position of the alpha-level wolf, a new position of the beta-level wolf and a new position of the delta-level wolf are determined according to the updated optimal fitness function value, and the specific formula is as follows:

A＝α*(2r ₁ -1)

C＝2r ₂

wherein, denotes as matrix product, A, C is the first and second cooperative coefficient respectively, a is reduced from 2 to 0 in whole iteration process, r1 and r2 are the first and second random vector between [0,1 ];

step 4, dividing the meteorological influence factors after feature selection into a training set and a test set according to a certain proportion, taking an optimized support vector machine model as a weak regressor, initializing sample weights, obtaining a plurality of weak regressors, calculating the relative error of each sample in the training set, calculating the error rates of the plurality of weak regressors by combining the relative errors, further calculating the weight coefficients of the weak regressors to update the sample weights, establishing an AdaBoost integration model to construct a strong regressor, optimizing the number, the learning rate and the loss function model of the weak regressors by a grid search method, predicting each sample in the training set by the strong regressor, obtaining a predicted value sequence of the wind power in the training set, and further calculating an error sequence;

step 4, the weight of the initialization sample is w _zi ；

Step 4, the plurality of weak regressors are z weak regressors F _z (x)；

wherein i is E [1,c]，w _zi Weight of the kth sample representing the z-th weak regressor, e _zi Expressed as the relative error of the kth sample of the z-th weak regressor, X _o (k) Selecting the actual value of wind power F when the k point data is selected for the meteorological influence factor data after each characteristic is selected _z (X _i (k) Is the predicted value of the wind power of the z-th weak regressor at the k-th data point, E _z The maximum error of the z-th weak regressor on the training set;

determining weight coefficients alpha of a weak regressor _z The method comprises the following steps:

the sample weights are updated as follows:

step 4, establishing an AdaBoost integration model to construct a strong regressor: superposing the product of the weight of each weak regression and the predicted value to obtain the final strong regression;

the sequence of the predicted values of the wind power is tested as follows:

represents a sequence of k wind power prediction values in the training set, in combination>

Representing k wind power predicted value sequences, X, in a test set _f (k) Represents the k wind power predicted value, X, in the training set _g (k) Representing the k wind power predicted value in the test set，/>

Step 4, calculating an error sequence as follows:

calculating a training set predicted power value

step 5, constructing a gated circulation unit network, predicting an error sequence E through the gated circulation unit network, taking meteorological factors with selected characteristics as input data of the gated circulation unit network during error correction, optimizing parameters of the gated circulation unit network according to a loss function model of the gated circulation unit network to obtain the optimized gated circulation unit network

And 5, constructing a gating cycle unit network:

the updating gate and the placing gate are sequentially defined as follows:

Z _t ＝σ(W _Z X _t +U _Z h _t-1 +b _Z )

r _t ＝σ(W _r X _t +U _r h _t-1 +b _r )

the hyper-parameters of the gated loop unit include: the number of neural network layers, the number of neurons in each layer and the dropout rate; wherein the dropout rate is a proportion of the number of discarded neurons and is used for inhibiting an overfitting phenomenon of the model, and the dropout belongs to (0,1);

step 5, the loss function model of the gated cyclic unit network is an average absolute error, and the loss function model of the gated cyclic unit network specifically comprises the following steps:

k∈[1，S] S＜m

and 5, optimizing the parameters of the gated loop unit network as follows: optimizing and solving the gate control cycle unit network through an Adam algorithm to obtain a weight matrix W of the updated gate after optimization _Z 、U* _Z Weight matrix W of reset gate after optimization _r 、U* _r Updating bias vector b of gate after optimization _Z Bias vector b of reset gate after optimization _r ；

The following were used:

/>

and 6, superposing the corrected prediction error value and the prediction result of the AdaBoost integrated prediction model to obtain a final prediction result, wherein the method specifically comprises the following steps:

wherein X' (k) is the final prediction result of k wind power in the test set,

for the predictor sequence of the test set error, ->

And m-S is the number of samples of the test set.

Fig. 3 is a comparison diagram of regression prediction data obtained by predicting wind power by applying the combined model of the invention according to the third embodiment of the invention and original data. The black curve in the graph is a prediction curve graph of the method, the blue curve is a real value of the wind power, and the other color curves are prediction curves of the comparison model.

The feasibility and the accuracy of the method are verified by carrying out comparison research by totally adopting 5 models, wherein the model 1 is a support vector machine (SVR), the model 2 is a gray relevance analysis-support vector machine (GRA-SVR), the model 3 is a gray relevance analysis-Grey wolf group optimization algorithm-support vector machine (GRA-GWO-SVR), the model 4 is a gray relevance analysis-Grey wolf group optimization algorithm-support vector machine integration model (GRA-GWO-SVR-AdaBoost), and the model 5 is a gray relevance analysis-Grey wolf group optimization algorithm-support vector machine integration-gating cycle unit (GRA-GWO-SVR-AdaBoost-GRU).

Specific evaluation indexes of the 5 models are shown in table 1.

TABLE 1 evaluation indexes of 5 different prediction models in two different time periods

As can be seen from Table 1, the prediction accuracy of the model and method provided by the invention is very high, the root mean Square error RMSE is about 0.1, the average absolute error MAE is about 0.07, and the R-Square exceeds 0.995. This demonstrates the effectiveness and superiority of the proposed model and method of the present invention.

In conclusion, the invention provides the wind power station output power prediction method based on the combined model of GRA-SVM-GWO-AdaBoost-GRU, so that the accurate prediction of the short-term output power of the wind power station is realized, and the scientific scheduling of a power system and the safety and stability of grid-connected operation are facilitated.

The above-mentioned embodiments only express the embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent should be subject to the appended claims.

Claims

1. A wind power station output power prediction method based on multi-model combination optimization is characterized by comprising the following steps:

and 6, overlapping the corrected prediction error value with the prediction result of the AdaBoost integrated prediction model to obtain a final prediction result.

2. The multi-model combinatorial optimization-based wind power plant output power prediction method of claim 1, characterized in that:

step 1, the preprocessed meteorological influence factor data are as follows:

X _i (k)，i∈[1，n]，k∈[1，m]

wherein X _i (k) Representing kth point data in the ith preprocessed meteorological influence factor, wherein n is the type number of the meteorological influence factors, and m is the number of data points in each meteorological factor;

step 1, the preprocessed wind power data are as follows:

X _o (k)，k∈[1，m]

wherein, X _o (k) Representing the preprocessed wind power data when the kth point data is selected from the preprocessed meteorological influence factor data, wherein m is the number of data points in each meteorological factor;

{X ₁ (k)，X ₂ (k)，...，X _n (k)}

calculating a grey correlation coefficient:

/>

i∈[1，n]，k∈[1，m]

wherein, X _i (k) Represents the kth point data in the meteorological influence factor after the ith preprocessing, theta _i (k) Representing the k point gray correlation coefficient, X, in the meteorological influence factor after the i type of pretreatment _o (k) Representing the preprocessed wind power data when the kth point data is selected from the preprocessed meteorological influence factor data, wherein n is the type number of the meteorological influence factors, m is the number of data points in each meteorological factor, rho is a resolution coefficient,

representing the two-stage minimum difference between the wind power and the ith meteorological influence factor,

γ _i ∈[0，1]

wherein, γ _i Correlation degree of the ith meteorological influence factor and wind power, gamma _i The smaller the error between 1, the higher the correlation degree of the ith meteorological influence factor and the wind power.

3. The wind power plant output power prediction method based on multi-model combinatorial optimization according to claim 1, characterized in that:

X _j (k)，j∈[1，c]，k∈[1，m]，c＜n

at the same time, a relaxation factor xi is introduced to avoid the relaxation factor xi _i If the size is too large, adding a penalty factor C, and supporting the target function expression of a vector machine prediction model as follows:

wherein, C represents the punishment parameter of the support vector machine as the to-be-searchedOptimization parameter xi _i Represents the relaxation factor, x, of the ith sample data _i Representing input sample data, y _i Representing a label, b representing a displacement term, determining the distance between the hyperplane and the origin, ω representing a normal vector, determining the direction of the hyperplane,

x _p ＝[X ₁ (p)X ₂ (p)...X _j (p)...X _c (p)]

x _q ＝[X ₁ (q)X ₂ (q)...X _j (q)...X _c (q)]

representing the Euclidean distance between two points of the square of the distance between the two points, σ being the width of the function, g =1/2 σ ² And representing the kernel function parameters of the support vector machine as the parameters to be optimized.

4. The multi-model combinatorial optimization-based wind power plant output power prediction method of claim 1, characterized in that:

in the step 3, the penalty parameter of the support vector machine and the kernel function parameter of the support vector machine are respectively optimized by a gray wolf group optimization algorithm as follows:

k∈[1，S]S＜m

wherein, X _o (k) Selecting the actual value of the wind power when the kth point data is selected for the meteorological influence factor data after each characteristic is selected,

D _s (t)＝|C _u X _s (t)-X _u (t)|u＝1，2，3，s＝α，β，δ

where t represents the current number of iterations, D _s (t) represents the distance between the alpha-level wolf, the beta-level wolf and the delta-level wolf and the omega-level wolf respectively in the t-th iteration, X _s (t) represents the current position of the alpha-level wolf, the beta-level wolf, the delta-level wolf, X _u (t) represents the current position of the omega level wolf, X _u (t + 1) represents the alpha-level wolf, the beta-level wolf, the delta-level wolf determines the position of the next movement of the omega-level wolf, and X (t + 1) represents the position of the next moment of the omega-level wolf;

A＝α*(2r ₁ -1)

C＝2r ₂

and 3.6, if the maximum iteration times maxgen are exceeded, terminating the algorithm, outputting the global optimal positions in the N wolf group individuals, namely the penalty parameter C of the optimized support vector machine ^* And kernel function parameter g of optimized support vector machine ^* And otherwise, skipping to step 3.4 to continue optimization.

5. The multi-model combinatorial optimization-based wind power plant output power prediction method of claim 1, characterized in that:

step 4, the weight of the initialization sample is w _zi ；

Step 4, the plurality of weak regressors are z weak regressors F _z (x)；

wherein i is E [1,c]，w _zi Weight of the kth sample representing the z-th weak regressor, e _zi Expressed as the relative error of the kth sample of the z-th weak regressor, X _o (k) Selecting the actual value of wind power F when the k point data is selected for the meteorological influence factor data after each characteristic is selected _z (X _i (k) Is the predicted value of the wind power at the kth data point for the z-th weak regressor, E _z The maximum error of the z-th weak regressor on the training set;

the sample weights are updated as follows:

the sequence of the predicted values of the wind power in the test set is as follows:

Representing k wind power predicted value sequences, X, in a test set _f (k) Represents the k-th wind power predicted value, X, in the training set _g (k) Representing the k wind power predicted value in the test set,

step 4, the calculation error sequence is as follows:

calculating a training set predicted power value

Historical wind power value X of wind power station _o (k) The difference of (d), denoted as error sequence E, is as follows:

6. the multi-model combinatorial optimization-based wind power plant output power prediction method of claim 1, characterized in that:

step 5, constructing the gating cycle unit network comprises the following steps:

constructing a gated cyclic unit network through an input layer, a plurality of gated cyclic unit layers and an output layer;

the update gate determines information that was retained to the current time from a previous time, and the reset gate determines the extent to which the information is discarded;

the updating gate and the placing gate are sequentially defined as follows:

z _t ＝σ(W _Z X _t +U _Z h _t-1 +b _Z )

r _t ＝σ(W _r X _t +U _r h _t-1 +b _r )

the hyper-parameters of the gated loop unit include: the number of neural network layers, the number of neurons in each layer and the dropout rate; wherein, the dropout rate is a proportion of the number of discarded neurons and is used for inhibiting the overfitting phenomenon of the model, and dropout belongs to (0,1);

k∈[1，S]S＜m

The following were used:

7. the multi-model combinatorial optimization-based wind power plant output power prediction method of claim 1, characterized in that:

and 6, the final prediction result is as follows:

wherein X' (k) is the final prediction result of k wind power in the test set,

to test the sequence of predicted values of the set error,

and m-S is the number of samples of the test set. />