CN113094891A

CN113094891A - Multi-wind-farm power modeling, PDF (Portable document Format) construction and prediction scene generation method and system

Info

Publication number: CN113094891A
Application number: CN202110358831.6A
Authority: CN
Inventors: 涂青宇; 苗世洪; 陈霞; 姚福星; 殷浩然; 张迪; 杨炜晨; 韩佶; 尹斌鑫
Original assignee: Huazhong University of Science and Technology
Current assignee: Huazhong University of Science and Technology
Priority date: 2021-03-24
Filing date: 2021-03-24
Publication date: 2021-07-09
Anticipated expiration: 2041-03-24
Also published as: CN113094891B

Abstract

The invention discloses a method and a system for modeling power of multiple wind power plants, building PDF (Portable document Format), and generating a prediction scene, and belongs to the field of scene prediction of wind power. The method establishes a probability distribution model of the prediction error, and adds the probability distribution of the prediction error and the point prediction power of each wind power plant power to be used as an edge distribution model of each wind power plant power. And calculating the cumulative probability of the power data of each wind power plant as input data of the time-varying R vine Copula model. The ARIMA-GARCH-t model and the time-varying R vine Copula model are combined, so that a joint probability distribution model of the high-dimensional wind power data is established. And fitting model parameters based on the historical power data of each wind power plant, and providing a multi-wind-plant power day-ahead prediction scene generation method by combining the predicted power data of the future day of each wind power plant. The day-ahead prediction scene generation model established by the invention can better fit the time-space correlation characteristics of the power of the multiple wind power plants, and improves the accuracy and the effectiveness of the day-ahead prediction scene of the power of the multiple wind power plants.

Description

Multi-wind-farm power modeling, PDF (Portable document Format) construction and prediction scene generation method and system

Technical Field

The invention belongs to the field of scene prediction of wind power, and particularly relates to a method and a system for multi-wind power plant power modeling, PDF (Portable document Format) construction and scene prediction generation prediction.

Background

In recent years, wind power generation has been receiving attention to clean energy supply and reduce carbonization. However, at present, accurate prediction of wind power is still difficult to achieve, and under the background that wind power grid-connected capacity is rapidly increased, the problem of uncertainty of a power supply side caused by wind power prediction errors is increasingly prominent. From the viewpoint of operation of the power system, the reliability of the dispatching plan is affected, so that not only can a serious wind abandoning problem be caused, but also potential risks are brought to safe and stable operation of the power grid. Therefore, a prediction method capable of reflecting the uncertainty of the wind power is urgently needed.

In order to solve the above problems, researchers have conducted intensive research on a scene prediction method of wind power. The scene prediction is a main method for wind power probability prediction, and the basic principle is that a probability distribution model of wind power or prediction error is established through a statistical method, and then a prediction scene is generated through sampling. A plurality of scene prediction methods are proposed in the prior research, part of documents assume that prediction errors obey Beta distribution or t-location-scale distribution, and part of the research establishes a probability distribution model of the prediction errors based on a non-parameter estimation method, wherein the probability distribution model comprises a quantile regression model, an empirical cumulative probability distribution model, a nuclear density estimation model and the like.

The research aims to use long-term frequency distribution of wind power or prediction errors as probability distribution, but the current research starts to focus on the time sequence characteristic of the wind power in consideration of the autocorrelation characteristic of the wind power in a period of time, and further provides a wind power prediction scene generation method based on the time sequence characteristic. Part of documents simulate the time-varying process of wind power generation by using a Markov method; some scholars propose Auto Regression (AR) models; on the basis, the subsequent research further establishes an autoregressive-moving average (ARMA) model. The results of the above studies indicate that, taking into account the wind power time series characteristics, the irregular fluctuations and noise components in the generated prediction scene will be significantly reduced.

On the other hand, with the expansion of the wind power construction scale, a prediction scene generation method suitable for a multi-wind power plant needs to be further provided. However, when the traditional single wind farm prediction scene generation method is directly applied to a plurality of wind farms, the method has limitations, and the key problem is that the time-space correlation characteristics of the power of the plurality of wind farms need to be fully described while the advantages of the original method are kept. A better approach to this problem is to use the Copula model. The Copula model consists of two parts: the edge distribution model and the high-dimensional data are of a dependency structure, wherein the edge distribution model is an independent probability distribution model of a single-dimensional wind power sequence and can be directly applied to part of existing research results for a single wind power plant, so that good continuity is guaranteed; and the dependency structure fits the spatio-temporal correlation features between the multiple wind farm powers. Some documents model joint probability distribution of high-dimensional wind power data based on Gauss Copula models, follow-up research further provides C rattan and D rattan Copula models, and compared with the Gauss Copula models, the topological structure has higher flexibility, so that the accuracy of the models is improved.

Z.Wang and W.S.Wang, et al, in the paper "forcecast scenes of Regional farm Based on regulated Vine video Copula", adopt a static R Vine Copula model, which takes a KDE model as an edge distribution model, takes a static binary Copula model as a basic unit, generates an R Vine topological structure Based on a maximum rotation tree algorithm, establishes a static R Vine Copula model, and provides a day-ahead prediction scene generation method of multi-Wind-field power Based on the model.

However, the above method still has the following disadvantages: 1) an edge distribution model is established based on a Kernel Density Estimation (KDE) model, and the autocorrelation of the wind power in a time dimension, namely the time correlation, cannot be fitted. 2) A basic binary Copula model is adopted in the static R rattan Copula model as a part (basic unit) of the model, the dependency structure of any two wind power plants cannot be accurately fitted, and the time-varying characteristic of the power dependency of any two wind power plants is ignored, so that the spatial dependency of the power of a plurality of wind power plants cannot be accurately fitted. 3) Based on the scene generation method provided by the KDE model and the static R rattan Copula model, the time-space correlation of the generated scene is relatively larger in deviation with the time-space correlation of the actual wind power plant power.

Disclosure of Invention

Aiming at the defect that the existing model in the prior art cannot fully describe the time-space correlation characteristics of the power of multiple wind power plants and the improvement requirement, the invention provides a method and a system method for modeling the power of the multiple wind power plants, constructing PDF (Portable document Format), and generating a predicted scene, and aims to improve the accuracy of the predicted scene of the power of the multiple wind power plants in the future.

To achieve the above object, according to a first aspect of the present invention, there is provided a method for modeling a power-dependent structure of a multiple wind farm, the method comprising the steps of:

s1, respectively calculating the cumulative probability of each wind power plant power according to a wind power plant power edge probability distribution function to obtain a cumulative probability sequence of each wind power plant power;

s2, inputting the cumulative probability sequences of the power of each wind power plant to a time-varying R rattan Copula model together to obtain the time-varying R rattan Copula model of the power of the multi-wind power plant;

the time-varying R vine Copula model is constructed in the following way:

(1) establishing a pair Copula model based on a mixed time-varying binary Copula model, wherein the pair Copula model is used for representing power-dependent structural information of every two wind power plants, and the expression of the pair Copula model is as follows:

wherein u, v represent input data of the pair Copula model; c_MixRepresenting a mixed time-varying binary Copula model; c_iRepresentation selection for composition C_MixThe binary Copula model of (a); omega_iIs represented by C_iThe weight coefficient of (a);

and

respectively represent C_MixAnd C_iThe method comprises the steps that a parameter set of a model is formed, and n represents the number of binary Copula models participating in building of a mixed time-varying binary Copula model;

(2) based on the R rattan Copula model structure generation method, all mixed time-varying binary Copula models are expanded into a multi-wind-farm power-dependent structure model.

Preferably, the binary Copula model is a t Copula model, a Clayton Copula model, or a Gumbel Copula model.

Preferably, the parameters of the mixed time-varying binary Copula model are estimated by using a maximum likelihood estimation method, and the expression is as follows:

wherein the content of the first and second substances,

representing a hybrid time-varying binary Copula model C_MixThe result of the parameter estimation of (2); argmax represents a function of each parameter value when the maximum is found.

Preferably, in step (2), the method for generating the R rattan structure is as follows:

step 1: taking the cumulative probability sequence of each wind power plant power as input data of a first layer tree structure in R vine, and fitting a pair Copula of any two columns of input data by using a mixed time-varying binary Copula model;

step 2: evaluating the accuracy of each pair Copula in the step 1 by using an AIC index;

and step 3: on the basis of ensuring no ring structure, determining a first layer tree structure by taking the minimum sum of AIC index values of all PairCopula models in the layer tree structure as a target, and initializing k to be 1;

and 4, step 4: in the case where the kth level "tree" structure has been generated, the input data of the (k +1) th level "tree" structure is calculated as follows:

wherein, F_β(e)|γ(e)And F_α(e)|γ(e)Two columns of input data each representing a kth level "tree" structure; c_{α(e)，β(e)|γ(e)}Representing F-base in the kth level 'tree' structure_β(e)|γ(e)And F_α(e)|γ(e)Fitting a pair Copula function; f_{α(e)|{γ(e)，β(e)}}And F_{β(e)|{γ(e)，α(e)}}Respectively representing possible input data of a (k +1) th layer 'tree' structure;

representing the derivation of a partial derivative;

and 5: when the kth level "tree" structure topology has been generated, enumerating all possible topologies for the (k +1) th level "tree" structure, the valid topology will satisfy the following constraints:

wherein e is_a，e_bRepresents two "edges", i.e., the pair Copula function;

represents the set of all "edges" in the (k +1) th level "tree" structure; # denotes the collective potential calculation;

step 6: fitting a pair Copula function by using a mixed time-varying binary Copula model for all possible topologies of the (k +1) th layer tree structure in the step 5;

and 7: determining the topology of the (k +1) th layer tree structure by taking the minimum sum of AIC index values of all pair Copula models in the (k +1) th layer tree structure as a target, wherein k is k + 1;

and 8: repeating steps 4 to 7 until the topology of all "tree" structures is generated, which together constitute the R rattan structure of the time-varying R rattan Copula model.

To achieve the above object, according to a second aspect of the present invention, there is provided a method for constructing a joint probability density function of multiple wind farm powers, the method comprising the steps of:

s1, respectively determining power edge probability distribution functions of each wind power plant based on historical power data of each wind power plant;

s2, on the basis of determining the power edge probability distribution function of each wind power plant, constructing a joint probability density function of the power of the multi-wind power plant by adopting a time-varying R vine Copula model;

the time-varying R vine Copula model is modeled by a method as described in the first aspect.

Preferably, in step S1, for each wind farm, the following operations are performed:

(1) establishing a probability density function of the power prediction error of the wind power plant based on an ARIMA-GARCH-t model;

(2) and adding the probability density function of the wind power plant power prediction error and the point prediction power of the wind power plant to obtain a sum, wherein the sum is used as the probability density function of the wind power plant power.

Has the advantages that: according to the method, the edge distribution model of each wind power plant power is established based on the ARIMA-GARCH-t model, so that the autocorrelation of the wind power in the time dimension can be reflected, namely, the time correlation can be fitted more accurately.

To achieve the above object, according to a third aspect of the present invention, there is provided a method for generating a multiple wind farm power day-ahead prediction scene, the method comprising the steps of:

the method comprises the following steps: generate an [ N ]_sc×N_pt×N_wf]All elements in the random matrix obey a uniform distribution U (0, 1), where N_scPredicting the number of scenes for a given number of days ahead, N_ptIndicating the number of predicted power points, N, contained in each of the specified scenes_wfRepresenting the number of wind farms;

step two: decomposing the combined probability density function of the multiple wind power plants into a form of multiplying a Pair Copula density function and an edge distribution probability density function, wherein the combined probability density function of the multiple wind power plants is generated by adopting the method in the second aspect;

step three: the following operations are performed for each scene until N is generated_scA complete day-ahead power prediction scenario;

a. assigning the values in the random matrix Rnd to the partial input data values under the structure of each level "tree" in the time-varying R rattan Copula model as follows:

wherein the content of the first and second substances,

inputting data for a part under each hierarchy 'tree' structure;

representing each wind farm power; i.e. i_scRepresents a scene number; j is a function of_ptRepresenting a data point number;

b: input data of two adjacent layers of tree structures in time-varying R vine Copula model

And

with certain data values, the pair Copula function associated with the data values is calculated according to the following formula

The value:

will be known

Value as a function

Is input with one dimension of data, will

Value and calculated function

Value substitution function

In and out of the function

Input data of another dimension, gamma denotes

The condition set of (1);

c: repeating the step b until the jth_ptThe cumulative probability value corresponding to each wind power plant power under the data point is obtained by all calculation and is recorded as

d: to be provided with

As input data, baseObtaining the jth function of the inverse function of the prediction error accumulation probability function of each wind power plant_ptThe expression of the sampling result of the prediction error of each wind power plant under the data points is as follows:

wherein the content of the first and second substances,

respectively represent the k-th_wfWind farm at jth_ptThe prediction error sampling result, the accumulative probability calculation result and the inverse function of the prediction error accumulative probability function of each data point;

e: d, preprocessing the sampling result of the prediction error of each wind power plant obtained in the step d, so that the sampling result is between the minimum value and the maximum value of the prediction error in the historical data used for fitting the model;

f: adding the prediction error sampling result and the point predicted value to obtain a prediction scene value of each wind power plant power:

wherein the content of the first and second substances,

respectively represent the k-th_wfWind farm at jth_ptPredicting scene values and point predicted values by the power of the data points;

g: j is the j th wind power plant calculated in the step f_ptCalculating the (j) th predicted scene value of the data point_pt+1) probability distribution of wind farm power and prediction error for each data point, j_pt+1) parameter values of all pair Copula in the time-varying R rattan Copula model at the data points;

h: and repeating the steps a to g until a complete day-ahead power prediction scene is generated for all the wind power plants.

To achieve the above object, according to a fourth aspect of the present invention, there is provided a multiple wind farm power day-ahead prediction scene generation system, comprising: a computer-readable storage medium and a processor;

the computer-readable storage medium is used for storing executable instructions;

the processor is configured to read executable instructions stored in the computer-readable storage medium, and execute the method for generating the multiple wind farm power day-ahead prediction scene according to the third aspect.

Generally, by the above technical solution conceived by the present invention, the following beneficial effects can be obtained:

(1) the invention provides a modeling method for a power-dependent structure of a multi-wind-power-field, which is characterized in that a mixed time-varying binary Copula model is used as a pair Copula model, high dimensionality of the model is realized based on an R rattan structure generation method, and a time-varying R rattan Copula model with high dimensionality is established. The Copula model has a good application effect in the aspect of depicting the complex nonlinear correlation characteristics of the wind power, and by establishing the mixed time-varying binary Copula model as the pair Copula model, the defects of symmetry and relatively fixed sensitivity of the upper tail and the lower tail of the traditional binary Copula model are overcome, and the dependency structure of any two wind power plants can be more flexibly and accurately depicted. Meanwhile, parameters of the hybrid time-varying binary Copula model are calculated by introducing time-varying correlation coefficient calculation models such as DCC and pattern, and time-varying characteristics of the power spatial correlation of the multi-wind power plant can be reflected. On the basis, the model is high-dimensional based on the R rattan structure, pre-assumption on the correlation of the power of multiple wind power plants is not needed, and the differentiated high-dimensional Copula model structure topology can be flexibly fitted to the wind power plants distributed in different geographies. Therefore, the method is based on the method for establishing the dependency structure model of the power of the multiple wind power plants, and the spatial correlation characteristics of the power of the multiple wind power plants can be fitted more accurately.

(2) The invention provides a method for constructing a combined probability density function of multiple wind power plants. In the aspect of an edge distribution model, an ARIMA model and a GARCH-t model are combined, a cumulative probability function model of prediction errors is established, the model and the point prediction power of each wind power plant are added, and the edge distribution model of the power of each wind power plant is calculated. The ARIMA model and the GARCH-t model are both models based on time series characteristics, the probability distribution of subsequent data points is fitted through the size and the change trend of data in a previous period of time, the GARCH-t model further reflects the characteristic of the conditional variance of the data, and the defect that the residual flexibility of the traditional GARCH model is insufficient due to Gauss distribution fitting is overcome, so that the ARIMA-GARCH-t model is based on the ARIMA-GARCH-t model to build an edge distribution model, and the autocorrelation characteristic of the wind power in the time dimension, namely the time correlation, can be favorably described. On the other hand, as can be seen from (1), establishing a dependency structure model of a plurality of wind farm powers based on a time-varying R rattan Copula model helps to reflect "spatial correlation". Therefore, the RIMA-GARCH-t model and the time-varying R rattan Copula model are combined to establish a combined probability density function model of the power of the multi-wind power plant, the characteristics of time-space correlation can be considered, and the accuracy is higher.

(3) The invention provides a method for generating a day-ahead prediction scene of power of a multi-wind farm, which is used for generating the day-ahead prediction scene of the power of the multi-wind farm on the basis of a multi-wind farm power combined probability density function model established on the basis of a time-varying R vine Copula model. The basis of scene generation is "sampling", which is essentially the inverse of the above model. Because the edge distribution model and the Copula model of the adopted joint probability distribution model are time-varying models, the traditional sampling method of the static R rattan Copula model is not applicable any more, and a sampling and prediction scene generation method with higher applicability needs to be provided. On the other hand, the time-varying R rattan Copula model fits the time-space correlation characteristics of the multi-wind-farm power, so that the generated scene has the time dimension autocorrelation characteristics and the space dimension cross correlation characteristics which are more similar to the actually-measured wind power curve, and the accuracy and the effectiveness of the multi-wind-farm power day-ahead scene prediction are improved.

Drawings

FIG. 1 is a schematic diagram of the inventive concept provided by the present invention;

FIG. 2 is a time-varying R rattan Copula model modeling flowchart provided by the present invention;

FIG. 3 is a flow chart of ARIMA-GARCH-t model-based wind farm power edge distribution modeling provided by the invention;

FIG. 4 is a flow chart of a method for generating a multi-wind farm power day-ahead predictive scene provided by the present invention;

FIG. 5 is a multi-farm total power prediction scenario generated by the different models provided by the present invention, (a) a prediction scenario generated for the proposed model (model 1); (b) a prediction scenario generated for the independent ARIMA-GARCH-t model (model 2); (c) a prediction scenario generated for a static D vine Copula model (model 3); (d) a prediction scenario generated for the independent KDE model (model 4);

FIG. 6 is a comparison of ACF indicators for a multi-wind farm total power prediction scenario generated by different models provided by the present invention, wherein (a) the ACF indicators for the prediction scenario for the proposed model (model 1); (b) predicting ACF indicators for the scene for the independent ARIMA-GARCH-t model (model 2); (c) predicting ACF indexes of scenes for a static D rattan Copula model (model 3); (d) predicting ACF index of the scene for the independent KDE model (model 4);

FIG. 7 is a comparison of CCF indices for 2 wind farm power prediction scenarios generated by different models provided by the present invention, where (a) the CCF indices for the proposed model (model 1) prediction scenario; (b) predicting a CCF index for the scene for the independent ARIMA-GARCH-t model (model 2); (c) predicting the CCF index of the scene for a static D rattan Copula model (model 3); (d) the CCF index of the scene is predicted for the independent KDE model (model 4).

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. Furthermore, the technical features mentioned in the embodiments of the present invention described below can be combined with each other as long as they do not conflict with each other.

As shown in fig. 1, the inventive concept of the present invention is as follows: 1. and expressing the power of each wind power plant as a point prediction power and prediction error sum form, establishing a probability distribution model of the prediction error based on an ARIMA-GARCH-t model, and adding the probability distribution of the prediction error and the point prediction power of the power of each wind power plant to obtain an edge distribution model of the power of each wind power plant. And calculating the cumulative probability of the power data of each wind power plant as input data of the time-varying R vine Copula model. 2. A time-varying R vine Copula model is established in two steps: firstly, establishing a pair Copula model based on a mixed time-varying binary Copula model, and secondly, expanding the mixed time-varying binary Copula model into a high-dimensional model based on an R vine Copula model topological structure generation method. 3. A combined probability distribution model of high-dimensional wind power data is established by combining an ARIMA-GARCH-t model and a time-varying R rattan Copula model (abbreviated as a TRVMC model). On the basis of fitting model parameters of historical power data of each wind power plant, and combining predicted power data of points of each wind power plant in the future day, a multi-wind-plant power day-ahead prediction scene generation method is provided. The multi-wind-farm-power day-ahead prediction scene generation model established by the invention can better fit the time-space correlation characteristics of the multi-wind-farm power, and the accuracy and the effectiveness of the multi-wind-farm-power day-ahead prediction scene are improved.

The invention provides a modeling method for a power-dependent structure of multiple wind power plants, which comprises the following steps:

and S1, respectively calculating the cumulative probability of the power of each wind power plant according to the power marginal probability distribution function of the wind power plant to obtain a cumulative probability sequence of the power of each wind power plant.

And converting the original wind power data into a uniform sequence taking [0, 1] as a boundary by calculating the cumulative probability, wherein the converted sequence is used as input data of a Copula model.

And S2, inputting the cumulative probability sequences of the power of each wind power plant to a time-varying R rattan Copula model together to obtain the time-varying R rattan Copula model of the power of the multi-wind power plant.

The definition of the R vine Copula model can be expressed as follows:

(1) r rattan Copula model (marked as M-dimensional data) established aiming at M-dimensional data

) Is composed of M layers of mutually nested Tree structures. The M-level tree structure is respectively marked as

Where i is 1, 2, …, M represents a hierarchy of a "tree" structure, and so

(2) Each layer of 'tree' structure

The node (edge) set is composed of two parts, namely a node set and an edge set which are respectively marked as

And

"node" means the input data of the "tree" structure, and "edge" means the joint probability density function in the "tree" structure, which is fitted by a binary Copula function for 2-dimensional input data, is called pair Copula function.

(3) For the first level "tree" structure (i.e., i ═ 1), the "set of nodes" is computed from an edge distribution model, where the edge distribution model refers to an independent probability distribution model of the input data. For other levels of "tree" structures (i.e., i-2, …, M), the set of "edges" of the previous level of "tree" structure will correspond to the set of "nodes" in the next level of "tree" structure, i.e., M

The next level of "tree" structure thus has 1-dimensional less input data than the previous level of "tree" structure.

(4) The R rattan Copula model should satisfy the following constraints: any two "edges" under any "tree" structure share at most one "node".

"nodes" and "edges" are generally denoted as a combination of "condition sets" and "conditioned sets". Examples are as follows: for an "edge" e, it is assumed to be subordinate to the i-th level tree structure

a, b are two 'nodes' forming 'edge' e, satisfy

Then "edge" e can be written as e ═ α (e), β (e) | γ (e), where α (e) and β (e) are the conditioned set of "edge" e, and γ (e) is the conditioned set of "edge" e. Also, for "nodes" a, b, as defined by the PDF, it corresponds to the previous level of the "tree" structure

2 "sides" in (1) can be respectively marked as e_a＝α(e_a)，β(e_a)|γ(e_a) And e_b＝α(e_b)，β(e_b)|γ(e_b) Then, the relationship between the condition set and the conditioned set between the "nodes" a, b and the "edges" e is as follows:

γ(e)＝{α(e_a)，β(e_a)，γ(e_a)}∩{α(e_b)，β(e_b)，γ(e_b)}

{α(e)，β(e)}＝({α(e_a)，β(e_a)，γ(e_a)}\γ(e))∩({α(e_b)，β(e_b)，γ(e_b)}\γ(e))

based on the above formula, the pair Copula function corresponding to the "edge" e can be abbreviated as C_{α(e)，β(e)|γ(e)}. "node" a, b represents the 2-dimensional input data of pair Copula, and is two conditional cumulative probability sequences, which can be abbreviated as F_α(e)|γ(e)And F_β(e)|γ(e)。

On this basis, a joint Probability Density Function (PDF) corresponding to the M-dimensional data can be calculated as follows:

wherein x is_iIs the ith column of input data;

is a collection of all input data, i.e.

f_jntIs a collection of input data

The joint PDF of (1); f. of_iIs input data x_iThe edge distribution model of (1);

represents the ith pair Copula function in the jth tree structure, F_α(e)|γ(e)And F_β(e)|γ(e)Representing the corresponding input data.

For ease of description, the parts of the pair Copula function in parentheses are omitted from the subsequent discussion. Pair Copula is the basic unit of the R rattan Copula model, whose function is to fit a joint probability density function of two-dimensional data. And establishing a mixed time-varying binary Copula model as a pair Copula model.

As shown in fig. 2, the time-varying R vine Copula model is constructed by:

and

respectively represent C_MixAnd C_iAnd n represents the number of the binary Copula models participating in constructing the mixed time-varying binary Copula model.

The expressions are respectively as follows:

wherein, C_t、C_Cl、C_GRespectively representing t Copula, Clayton Copula and Gumbel Copula models; rho_t、k、

Representing model parameters;

representing the inverse of the t-distribution model.

On this basis, in order to fit the time-varying process of the correlation between the powers of the multiple wind farms, parameters of each basic binary Copula model need to be dynamically calculated, and the process is as follows:

(a) t-Copula model

For the t Copula model, the DCC (1, 1) model can be used to calculate model parameters, which are expressed as follows:

where ρ is_tRepresenting t Copula model parameters; alpha is alpha_DAnd beta_DIs a parameter of the DCC model;

is the covariance coefficient of the input data; epsilon_t-1Representing an array of input data at time (t-1), i.e. ∈_t-1＝[u_t-1，v_t-1]Wherein u is_t-1、v_t-1Respectively (t-1) time 2-dimensional input data.

(b) Clayton Copula and Gumbel Copula models

For the Clayton Copula model and Gumbel Copula model, the parameters can be calculated using the pattern model, which is expressed as follows:

wherein R is_tA correlation coefficient representing data; omega_ptt、α_PttAnd beta_PttIs a parameter of the pattern model; p represents the correlation coefficient R for fitting time t_tThe length of the history data sequence of (2) is usually set to 10. Lambda_PttThe logistic function is represented and functions to keep the computation results of the pattern model within the desired range.

In addition, the calculation result of the pattern model needs to be further converted into parameters of the Copula model, and the calculation method is as follows:

wherein the content of the first and second substances,

respectively representing Clayton Copula and Gumbel Copula model parameters, R_tRepresenting the results of the computation of the pattern model.

In combination with the above calculation process, the parameter set of each basic binary Copula model and the mixed time-varying binary Copula model in equation (9) can be represented as:

wherein the content of the first and second substances,

parameter sets respectively representing t Copula, Clayton Copula, Gumbel Copula models, { α_D，β_D，k}、

Representing parameters in the corresponding model;

set of parameters, ω, representing a mixed time-varying binary Copula model_t、ω_Cl、ω_GAnd weight coefficients of t Copula, Clayton Copula and Gumbel Copula models are shown.

wherein the content of the first and second substances,

the expression is as follows:

I_AIC＝2n_ml-2ln(L_m)

wherein, I_AICRepresenting an AIC index value; n is_mlRepresenting the number of parameters in the pair Copula model; l is_mRepresenting the value of the maximum likelihood function.

the generation process of each layer of 'tree' structure in R vine is an optimization problem, and the optimization goal is to minimize the sum of AIC index values of all pair Copula models in the layer of 'tree' structure. In particular, for the first layer "tree" structure, it can be regarded as a shortest path problem, where the weighted value of each pair Copula model, i.e. the AIC index value, can adopt Prim algorithm for the problem.

wherein, F_β(e)|γ(e)And F_α(e)|γ(e)Two columns of input data each representing a kth level "tree" structure; g_{α(e)，β(e)|γ(e)}Representing F-base in the kth level 'tree' structure_β(e)|γ(e)And F_α(e)|γ(e)Fitting a pair copula function; f_{α(e)|{γ(e)，β(e)}}And F_{β(e)|{γ(e)，α(e)}}Respectively representing possible input data of a (k +1) th layer 'tree' structure;

representing the derivation of a partial derivative;

wherein e is_a，e_bRepresents two "edges", i.e., the pair Copula function;

The invention provides a method for constructing a joint probability density function of multiple wind power plant powers, which comprises the following steps:

s1, respectively determining power marginal probability distribution functions of the wind power plants based on historical power data of the wind power plants.

The power data of each wind power plant are divided into two parts, namely point prediction power data and prediction error data, wherein the point prediction refers to the existing prediction method for the expected value of the wind power. Firstly, aiming at the prediction error data of each wind power plant, establishing a probability distribution model based on an ARIMA-GARCH-t model; secondly, adding the probability distribution model of the prediction error of each wind power plant and the point prediction power, wherein the calculation result is the probability distribution model of the power of each wind power plant, and the expression is as follows:

wherein, W_rl、W_fc、W_errRespectively representing the actual power, the predicted power and the prediction error of the wind power plant; f. of_rlAnd f_errProbability density functions representing actual power and prediction error, respectively; the representation variables obey a certain probability distribution.

The ARIMA-GARCH-t model is a combined model, and the specific modeling process is shown in FIG. 3:

ARIMA(P_AR，d，Q_MA) The model can be represented as follows:

wherein, y_tRepresenting input wind powerAnd predicting an error sequence. B denotes the hysteresis operator, i.e. B^jy_t＝y_t-j；

The method is characterized by representing an n-order difference calculation process, having the function of converting input data into a stable sequence, and using an augmented Dickey-Fuller test to evaluate the stability of the sequence; mu.s_AMRepresenting fixed parameters obtained by fitting; epsilon_tRepresents the residual; e represents an expected value operation; var denotes a variance operation, i.e.

Is a residual epsilon_tThe variance of the sequence; Φ (B), Θ (B) represent AR and MA polynomials, respectively, and the expressions are as follows:

wherein, P_AR、Q_MARepresenting the order of the polynomial;

and

the fitted AR and MA coefficients are shown.

The ARIMA model can be viewed as a predictive model in which the prediction error is { ε_t}. Through an ARIMA model, the estimation of the probability distribution of the prediction error of the original wind power can be converted into a sequence { epsilon ] of residual errors_tEstimation of probability distribution, and the function of the GARCH-t model is to fit the residual sequence ε_tAnd (4) probability distribution. GARCH (P)_G，Q_AC) The t-model can be expressed as follows:

wherein N represents a normal distribution; representing variables subject to a certain probabilityDistributing; i is_t-1Represents all history information before the time (t-1); h is_tRepresents a conditional variance; v. of_tIs an independent same distribution variable which follows standard normal distribution, i.i.d. represents independent same distribution; mu.s_GA、

And

is a fixed parameter, an ARCH coefficient and a GARCH coefficient obtained by fitting; p_G、Q_ACIndicating the order of the GARCH coefficient and ARCH coefficient.

Combining the calculation results of the ARIMA model and the GARCH-t model, the probability density function of the original wind power prediction error sequence can be obtained as follows:

wherein the content of the first and second substances,

and

respectively representing the calculation results of the ARIMA model and the GARCH-t model; the representation variables obey a certain probability distribution.

It should be noted that the power of each wind farm and the edge distribution of the prediction error are not the same, so the parameters of the ARIMA-GARCH-t model are also different, and the ARIMA-GARCH-t model needs to be fitted separately during modeling. In other words, for M wind farms, there are M columns of prediction error data, so M ARIMA-GARCH-t models with different parameters need to be fitted separately.

S2, on the basis of determining the power edge probability distribution function of each wind power plant, constructing a joint probability density function of the power of the wind power plants by adopting a time-varying R rattan Copula model, wherein the time-varying R rattan Copula model is modeled by the method in the first aspect.

As shown in fig. 4, the present invention provides a method for generating a multiple wind farm power day-ahead prediction scene, which comprises the following steps:

the method comprises the following steps: generate an [ N ]_sc×N_pt×N_wf]All elements in the random matrix obey a uniform distribution U (0, 1), where N_scPredicting the number of scenes for a given number of days ahead, N_ptIndicating the number of predicted power points, N, contained in each of the specified scenes_wfRepresenting the number of wind farms.

Step two: and decomposing the combined probability density function of the multiple wind power plants into a form of multiplying the Pair Copula density function and the edge distribution probability density function, wherein the combined probability density function of the multiple wind power plants is generated by adopting the method of the second aspect.

Step three: the following operations are performed for each scene until N is generated_scThe complete day-ahead power prediction scenario.

wherein the content of the first and second substances,

inputting data for a part under each hierarchy 'tree' structure;

representing each wind farm power; i.e. i_scRepresents a scene number; j is a function of_ptRepresenting the data point number.

And

The value:

will be known

Value as a function

Is input with one dimension of data, will

Value and calculated function

Value substitution function

In and out of the function

Input data of another dimension, gamma denotes

The condition set of (1). Interpolation can be used for the calculation.

d: to be provided with

As input data, the j (th) is obtained based on the inverse function of the prediction error cumulative probability function of each wind power plant_ptThe expression of the sampling result of the prediction error of each wind power plant under the data points is as follows:

wherein the content of the first and second substances,

respectively represent the k-th_wfWind farm at jth_ptThe prediction error sampling result, the accumulative probability calculation result and the inverse function of the prediction error accumulative probability function of each data point.

e: and d, preprocessing the sampling result of the prediction error of each wind power plant obtained in the step d, so that the sampling result is between the minimum value and the maximum value of the prediction error in the historical data used for fitting the model.

In order to ensure the rationality of the prediction error sampling result, the size of the sampled value needs to be limited. Assuming that the minimum and maximum values of the prediction error in the historical data used to fit the model are respectively

When the sampling result is greater than

At that time, set as

Is less than

At that time, set as

Further, the prediction error sampling result is added with the point prediction value, so that the prediction scene value of each wind power plant power can be obtained:

wherein the content of the first and second substances,

respectively represent the k-th_wfWind farm at jth_ptAnd predicting scene values and point predicted values by the power of the data points.

wherein the content of the first and second substances,

g: j is the j th wind power plant calculated in the step f_ptCalculating the (j) th predicted scene value of the data point_pt+1) probability distribution of wind farm power and prediction error for each data point, j_pt+1) parameter values for all pair Copula in the time-varying R rattan Copula model at the data points.

Next, the validity of the model is verified by a simulation example.

Evaluation index

For the generated multi-wind farm power prediction scene, an autocorrelation function (ACF) and a cross-correlation function (CCF) are evaluated. When the ACF and the CCF of the generated wind power prediction scene are similar to those of the actually measured wind power, the scene effectiveness is high.

For data sequence y_t＝{y₀，y₁，y₂，…，y_tThe ACF index refers to the sequence y_tAnd the sequence y_t+k＝{y_0+k，y_1+k，y_2+k，…，y_t+kAnd (5) a correlation coefficient between the wind power plants, wherein the correlation coefficient is used for evaluating the autocorrelation of the total power prediction scene of the multiple wind power plants in the time dimension. The expression of ACF is as follows:

wherein, I_ACFRepresenting an ACF index value; e represents an expected value operation;

and

representing the sequence y_tAnd y_t+kAverage value of (d);

and

representing the sequence y_tAnd y_t+kStandard deviation of (d); k represents a delay time, and when the delay time k is 0, I_ACF(0)＝1。

For data sequences

And

CCF index refers to sequence

And sequence

And the correlation coefficient is used for evaluating the time-space coupling correlation between two wind power plant power prediction scenes. The expression of CCF is as follows:

wherein, I_CCFRepresenting a CCF index value;

respectively representing predicted power scenes of two wind power plants;

and

representing a sequence

And

average value of (d);

and

representing a sequence

And

standard deviation of (d); k denotes a delay time, and when the delay time k is O, CCF is equivalent to a commonly used Pearson correlation coefficient.

Description of simulation data

The method is based on data development analysis of 8 northwest wind power plants, the sampling time is 3 months (90 days), and the sampling frequency is 96 points/day. Firstly, calculating parameters of an ARIMA-GARCH-t model and a TRVMC model based on point predicted power and actual measured power of each wind power plant in the previous 85 days; and secondly, taking one day of the last 5 days as an object, taking the point predicted power of each wind power plant as the point predicted power of the next day, generating 100 day-ahead predicted scenes for the power of each wind power plant, and comparing the ACF and the CCF of the generated scenes with the ACF and the CCF of the actually measured power.

Description of comparative model

In this section, 4 kinds of prediction scene generation models are selected for comparison:

model 1: the model combining ARIMA-GARCH-t and TRVMC, which is provided by the patent, is abbreviated as 'deployed' model;

model 2: independent ARIMA-GARCH-t model;

model 3: a static D vine Copula model;

model 4: independent Kernel Density Estimation (KDE) model.

In the

models

2 and 4, "independent" means that correlation among power of a plurality of wind power plants is ignored, models are built for the wind power plants respectively, and a power prediction scene is generated. On the other hand, the

models

2 and 4 are edge distribution models of the models 1 and 3, respectively.

On the basis of generating the power prediction scenes of all the wind power plants, the prediction scenes of the total power of the multiple wind power plants can be obtained through superposition calculation.

Simulation result

FIG. 5 shows a prediction scenario for the total power of 8 wind farms, generated with 4 models each. According to the simulation result, when a prediction scene of the total power of 8 wind power plants is generated, the fitting results of 4 models have larger difference. On the one hand, compared with the models 1 and 3, the

models

2 and 4 neglect the correlation among a plurality of wind farms, so that when the power of one wind farm is larger, the power of other wind farms may be smaller, and the fluctuation range of the total power is greatly reduced after superposition calculation, thereby causing that the prediction scene cannot envelop the actually measured total power curve in some cases (see (b) in fig. 5). On the other hand, compared with the

models

1 and 2, the models 3 and 4 ignore the time series characteristics of the wind power, so that when the wind power at the current moment in the generated scene is larger, the wind power at the next moment may be too small, which results in more frequent fluctuation and larger amplitude of the predicted scene generated by the models 3 and 4, and the fluctuation amplitude of the scene in some cases will greatly exceed the fluctuation amplitude of the actually-measured total power curve (see (c) in fig. 5). Compared with

models

2, 3 and 4, the prediction scene generated by the model provided by the invention not only can effectively envelop the actually measured power curve, but also has the fluctuation frequency and amplitude similar to the actually measured curve, thereby effectively depicting the uncertainty condition of the wind power.

FIG. 6 shows ACF index values for 8 wind farm total power prediction scenarios generated with 4 models, respectively. According to the simulation result, the ACF value of the scene generated by the model (model 1) provided by the invention is closest to the ACF value of the actually measured total power curve, so that the wind power prediction scene generated by the model provided by the invention has the time dimension self-correlation characteristic close to the actually measured value. Theoretical analysis shows that the model 3 and the model 4 ignore the time series characteristics of the wind power, so that the predicted power of adjacent data points in the generated scene is relatively independent. For example, when the wind power of the current moment in the generated scene is large, the wind power of the next moment may be small, so that the total fluctuation range of the scene greatly exceeds the fluctuation range of the actually-measured total power curve, which weakens the autocorrelation of the generated scene, and the ACF index value is always lower than the actual value. On the other hand, compared with the model 1, the model 2 ignores the correlation among a plurality of wind power plants, so that after the power prediction scenes of the wind power plants are superposed and calculated, the fluctuation range of the model 2 for generating the total power prediction scene is greatly reduced, the autocorrelation of the generated scene is enhanced, the ACF index value is usually higher than the actual value, and one direct embodiment is that the generated scene is too smooth.

FIG. 7 shows CCF index values for a power prediction scenario generated with 4 models, respectively, for 2 of the wind farms. According to simulation results, the CCF value of a scene generated by the model (model 1) provided by the invention is closest to the CCF value of an actual measurement power curve of the wind power plant 2, so that the power prediction scenes of a plurality of wind power plants generated by the model provided by the invention have time-space correlation characteristics close to the actual measurement values. Theoretical analysis shows that the CCF value is almost always lower than the actual value, since model 2 and model 4 ignore the correlation between wind farms. Meanwhile, the CCF curves of the model 2 and the model 4 in the prediction scenes are distributed relatively dispersedly, which shows that the correlation among the 2 wind power plants in different scenes is greatly different and is not consistent with the actual situation. On the other hand, although the models 1 and 3 are based on Copula models and fit the correlation among a plurality of wind power plants, the model 1 further considers the time series characteristic of each wind power plant power and the time-varying characteristic of the correlation among the plurality of wind power plants, so that the similarity between the CCF value of the scene generated by the model 1 and the actual value is ensured. In contrast, since model 3 only takes into account the overall correlation of the wind farm, more specifically the Kendall correlation coefficient, the CCF value of the model 3 generation scenario is close to the actual value only when the delay time is 0, and is still less than the actual value.

In conclusion, the multi-wind-farm-power day-ahead prediction scene generation model established by the invention can better fit the time-space correlation characteristics of the multi-wind-farm power, and the accuracy and the effectiveness of the multi-wind-farm-power day-ahead prediction scene are improved.

It will be understood by those skilled in the art that the foregoing is only a preferred embodiment of the present invention, and is not intended to limit the invention, and that any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A modeling method for a multi-wind-farm power-dependent structure is characterized by comprising the following steps:

the time-varying R vine Copula model is constructed in the following way:

and

2. The method of claim 1, wherein the binary Copula model is a t Copula model, a Clayton Copula model, or a Gumbel Copula model.

3. The method of claim 1, wherein the parameters of the hybrid time-varying binary Copula model are estimated using a maximum likelihood estimation method, expressed as follows:

wherein the content of the first and second substances,

4. The method of claim 1, wherein in step (2), the R rattan structure is generated by:

step 1: taking the cumulative probability sequence of each wind power plant power as input data of a first layer tree structure in R rattan, and fitting the pair Copula of any two columns of input data by using a mixed time-varying binary Copula model;

and step 3: on the basis of ensuring no ring structure, determining a first layer tree structure by taking the minimum sum of AIC index values of all pair Copula models in the layer tree structure as a target, and initializing k to be 1;

wherein, F_β(e)|γ(e)And F_α(e)|γ(e)Two columns of input data each representing a kth level "tree" structure; g_{α(e)，β(e)|y(e)}Representing F-base in the kth level 'tree' structure_β(e)|γ(e)And F_α(e)|γ(e)Fitting a pair Copula function; f_{α(e)|{γ(e)，β(e)}}And F_{β(e)|{γ(e)，α(e)}}Respectively representing possible input data of a (k +1) th layer 'tree' structure;

representing the derivation of a partial derivative;

wherein e is_a，e_bRepresents two "edges", i.e., the pair Copula function;

5. A method for constructing a joint probability density function of power of multiple wind power plants is characterized by comprising the following steps:

the time-varying R vine Copula model is modeled by the method of any one of claims 1 to 4.

6. The method according to claim 5, characterized in that in step S1, for each wind farm, the following operations are performed:

7. A method for generating a multi-wind-farm power day-ahead prediction scene is characterized by comprising the following steps:

step two: decomposing a joint probability density function of the multiple wind farm powers into a form of multiplying a Pair Copula density function by an edge distribution probability density function, the joint probability density function of the multiple wind farm powers being generated by the method of claim 5 or 6;

wherein the content of the first and second substances,

for parts under the structure of a "tree" of levelsInputting data;

And

The value:

will be known

Value as a function

Is input with one dimension of data, will

Value and calculated function

Value substitution function

In and out of the function

Input data of another dimension, gamma denotes

The condition set of (1);

d: to be provided with

wherein the content of the first and second substances,

wherein the content of the first and second substances,

8. A multiple wind farm power day-ahead predictive scene generation system, comprising: a computer-readable storage medium and a processor;

the processor is configured to read executable instructions stored in the computer-readable storage medium and execute the method of generating multiple wind farm power day-ahead predictive scenes of claim 7.