CN113361761A

CN113361761A - Short-term wind power integration prediction method and system based on error correction

Info

Publication number: CN113361761A
Application number: CN202110608220.2A
Authority: CN
Inventors: 杨明; 丁婷婷; 于一潇; 李鹏
Original assignee: Shandong University
Current assignee: Shandong University
Priority date: 2021-06-01
Filing date: 2021-06-01
Publication date: 2021-09-07

Abstract

The invention discloses a short-term wind power integrated prediction method and a short-term wind power integrated prediction system based on error correction, wherein the method comprises the following steps: acquiring operation data and weather forecast data of a wind power plant; inputting the data into a trained wind power prediction model and a wind power error prediction model respectively for prediction; adding the prediction results output by the two models to obtain a final short-term wind power prediction result; the training data set of the wind power prediction model is historical operation data and weather data in a preset time period of the wind power plant; and taking the power error data set, historical operating data and weather data in a preset time period of the wind power plant as a training data set of the wind power error prediction model. The method improves the short-term wind power prediction precision and improves the consumption capacity of the power grid to new energy power generation.

Description

Short-term wind power integration prediction method and system based on error correction

Technical Field

The invention relates to the technical field of wind power prediction in a new energy power generation process, in particular to a short-term wind power integrated prediction method and system based on error correction.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

With the increasingly prominent environmental problems and the increasing demand for energy, the development of new energy represented typically by wind power has become a common consensus in countries around the world. Wind energy has the advantages of no pollution, reproducibility, wide resources and the like, and is vigorously developed and applied by many countries, but wind power output power has strong randomness and volatility, large-scale wind power is accessed into a power grid to cause the difficulty of distribution and utilization balance to be increased, the uncertainty of operation of a power system is obviously increased, and the contradiction between safe operation of the system and efficient consumption of new energy is increasingly prominent. Therefore, it becomes important to improve the wind power prediction accuracy.

Based on the division of time scales, wind power prediction is mainly divided into ultra-short-term prediction, short-term prediction and medium-long-term prediction. The short-term wind power prediction result can adjust the unit combination scheme, optimize the conventional power supply unit power generation plan and improve the consumption capacity of the power grid on new energy power generation, so the short-term wind power prediction is a hotspot of current research.

At present, most researches on short-term wind power prediction adopt a single model for prediction, but the single prediction model is used for prediction based on a certain assumed space, so that the wind power prediction by using the single model inevitably has prediction errors; in addition, most of the current research methods adopt a certain prediction model for prediction and directly use the model prediction result as the final prediction result, but the research methods do not consider that the results obtained by prediction based on any model have errors.

Disclosure of Invention

In order to solve the problems, the invention provides a short-term wind power integrated prediction method and system based on error correction, which comprehensively utilize an XGboost model and a random forest model in an integrated learning algorithm to predict the wind power, so that the defect that a single prediction model has larger prediction error at some points is avoided, and the short-term wind power prediction precision is improved by a residual error learning method.

In some embodiments, the following technical scheme is adopted:

a short-term wind power integration prediction method based on error correction comprises the following steps:

acquiring operation data and weather forecast data of a wind power plant;

inputting the data into a trained wind power prediction model and a wind power error prediction model respectively for prediction;

adding the prediction results output by the two models to obtain a final short-term wind power prediction result;

the training data set of the wind power prediction model is historical operation data and weather data in a preset time period of the wind power plant; inputting historical operating data and weather data in a preset time period of a wind power plant into a wind power prediction model, outputting a wind power prediction value, and obtaining a power error data set based on the wind power prediction value; and the power error data set, historical operating data in a preset time period of the wind power plant and weather data are used as a training data set of the wind power error prediction model.

As a further approach, the wind farm operational data and weather forecast data include, but are not limited to, generated power, wind direction, wind speed, temperature, humidity, and barometric pressure data for the wind farm.

As a further scheme, the wind power prediction model is constructed by adopting an XGboost model optimized by an improved genetic algorithm.

As a further scheme, during construction of the wind power prediction model, binary coding is adopted, the minimum leaf node weight, the maximum depth, the learning rate and the gamma value of the XGboost model are selected as independent variable parameters, a population is initialized in a random mode, each obtained chromosome contains the independent variable parameters of the XGboost model, the minimum normalized mean absolute error NMAE of the model prediction result is used as the model fitness, and then the optimal solution of the XGboost model independent variable parameters is found through replication, timely-changing improved intersection and variation.

As a further scheme, the wind power error prediction model is constructed by utilizing a random forest model optimized by an improved genetic algorithm.

As a further scheme, when the wind power error prediction model is constructed, binary coding is adopted, the number of trees of a random forest model, the randomly selected characteristic number of each decision tree and the deepest depth of the tree are used as parameters, a population is initialized in a random mode, each obtained chromosome contains three parameters of the random forest model, the normalized mean absolute error NMAE minimum of a model prediction result is used as the model fitness, and then the optimal solution of the three parameters of the random forest model is found through copying, timely-changing improved intersection and variation.

As a further scheme, operation and numerical weather forecast data of the wind power plant are collected in real time, the trained XGboost and random forest models are used for forecasting respectively, a wind power preliminary forecasting result and a wind power error forecasting result are correspondingly obtained, and the two model forecasting results are added to obtain a final wind power forecasting result.

In other embodiments, the following technical solutions are adopted:

a short-term wind power integrated prediction system based on error correction comprises:

the data acquisition module is used for acquiring operation data and weather forecast data of the wind power plant;

the model prediction module is used for inputting the data into a trained wind power prediction model and a wind power error prediction model respectively for prediction;

the data output module is used for adding the prediction results output by the two models to obtain a final short-term wind power prediction result;

In other embodiments, the following technical solutions are adopted:

a terminal device comprising a processor and a memory, the processor being arranged to implement instructions; the memory is used for storing a plurality of instructions which are suitable for being loaded by the processor and executing the short-term wind power integration prediction method based on error correction.

In other embodiments, the following technical solutions are adopted:

a computer readable storage medium, wherein a plurality of instructions are stored, the instructions are suitable for being loaded by a processor of a terminal device and executing the short-term wind power integration prediction method based on error correction.

Compared with the prior art, the invention has the beneficial effects that:

1. the XGboost model and the random forest model in the integrated learning algorithm are comprehensively utilized to establish the wind power

The prediction model avoids the defect that a single prediction model has large prediction errors at certain measuring points.

2. The invention optimizes the parameters of the XGboost model and the random forest model by using the improved genetic algorithm, and overcomes the defect of using the improved genetic algorithm

The traditional genetic algorithm carries out parameter optimization and falls into the defect of local optimization.

3. According to the method, the wind power preliminary prediction result is corrected by using the information of the wind power prediction error, the short-term wind power prediction precision is improved by using a residual error learning method, the power generation plan of a conventional power supply unit is optimized, and the consumption capacity of a power grid on new energy power generation is improved.

Additional features and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

Drawings

FIG. 1 is a wind power prediction flow chart based on error correction in an embodiment of the present invention;

FIG. 2 is a correlation coefficient diagram between different meteorological elements and wind power in the embodiment of the invention;

FIG. 3 is a flow chart of an improved genetic algorithm in an embodiment of the present invention;

FIG. 4 is a schematic diagram of a Bagging algorithm in an embodiment of the invention;

FIG. 5 is a graph of NMAE values for each model in an embodiment of the present invention;

FIG. 6 is a graph of NRMSE values for various models in an embodiment of the present invention;

FIG. 7 shows the predicted result of wind power No. 4/month 1 in 2019 of each model in the embodiment of the present invention;

FIG. 8 shows the predicted results of wind power generation No. 7/month 1 in 2019 for each model in the embodiment of the present invention;

FIG. 9 shows the wind power prediction results of model No. 10/month 1 in 2019 in the embodiment of the present invention;

FIG. 10 shows the wind power prediction results of model No. 1/2019 in the embodiment of the present invention;

FIG. 11 is a No.6 wind farm relative prediction error variation curve in the embodiment of the invention.

Detailed Description

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments according to the present application. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.

The embodiments and features of the embodiments of the present invention may be combined with each other without conflict.

Example one

According to the embodiment of the invention, a short-term wind power integration prediction method based on error correction is disclosed, and with reference to fig. 1, the method comprises the following steps:

(1) acquiring real-time operation data and weather forecast data of the wind power plant;

specifically, wind farm operation and numerical weather forecast data refer to any data related to wind power generation, including but not limited to: and generating power, wind direction, wind speed, temperature, humidity, air pressure and other data of the wind power plant.

(2) Inputting the data into a trained wind power prediction model and a wind power error prediction model respectively for prediction;

specifically, operation and numerical weather forecast data in a preset time period of the wind power plant are collected, and data of meteorological elements such as 10-meter wind speed and 100-meter wind speed are extracted.

The historical data set is divided into a training set A and a training set B, and a wind power prediction model is initially established by utilizing the XGboost optimized by the improved genetic algorithm based on the training set A.

Inputting meteorological data of the training set B into a trained XGboost model for prediction to obtain a wind power prediction value, further obtaining a power error data set D, and establishing a wind power error prediction model by utilizing a random forest optimized by an improved genetic algorithm based on the training set B and the prediction error set D.

The embodiment makes full use of the advantages that integrated learning can be combined with a plurality of weak supervision models to obtain a comprehensive strong supervision model, and provides a short-term wind power integrated prediction model based on error correction based on a residual error learning method. Firstly, preliminarily establishing a wind power prediction model by using an XGboost model optimized by an improved genetic algorithm, then establishing a wind power error prediction model by using a random forest optimized by the improved genetic algorithm based on a prediction error of the XGboost model, and finally adding prediction results of the XGboost model and the random forest to obtain a final prediction result. The method improves the short-term wind power prediction precision by utilizing the advantages of the integrated learning model and through a residual learning method, further optimizes the power generation plan of the conventional power supply unit and improves the consumption capacity of the power grid to new energy power generation.

The detailed method comprises the following steps:

(1) data preparation

To determine the input amount of the prediction model, the present embodiment first uses the correlation coefficient ρ_xyTo analyze meteorological factors such as wind speed and wind direction

Correlation between element and wind power, ρ_xyThe larger the value of (A), the larger is x_tAnd y_tThe stronger the correlation between two time series, ρ_xyIs defined as:

wherein the content of the first and second substances,

and

are the average of the two time series, respectively, and N is the number of samples tested.

Table 1 shows the relationship between the correlation coefficient and the degree of correlation between two time series, and thus the correlation coefficient can be based on

To determine the degree of correlation between the two time series.

TABLE 1 correlation coefficient and degree of correlation

Fig. 2 shows a correlation coefficient diagram between the wind power (P) and seven meteorological quantities including the wind speed of 10 meters (WS10), the wind direction of 10 meters (WD10), the wind speed of 100 meters (WS100), the wind direction of 100 meters (WD100), the ground temperature (Tem), the ground pressure (AP) and the ground humidity (Hum), from which it can be seen that the wind speed of 10 meters and the wind speed and power of 100 meters are significantly correlated, the wind speed of 10 meters and the wind direction of 100 meters and power are low correlated, and the temperature, air pressure and humidity and wind power are weakly correlated, so that the two meteorological quantities of the wind speed of 10 meters and the wind speed of 100 meters at the target time are selected as model inputs in the present embodiment.

(2) Preliminary wind power prediction model based on XGboost

The integrated learning algorithm is mainly divided into a Bagging algorithm and a Boosting algorithm, the XGboost algorithm belongs to one of the Boosting algorithms, a new classification regression tree is generated in each iteration of the algorithm, the new tree can continuously fit the residual error of the previous tree, and the previous experimental result is continuously repaired, so that a learning model with higher accuracy is constructed. When a sample needs to be predicted, corresponding scores are generated on the leaf nodes corresponding to each tree, and thus the sum of all the scores is the predicted value of the sample. The XGboost model can realize parallel processing, and compared with the GBDT algorithm of the Boosting algorithm, the XGboost model has great improvement on the processing speed.

1) Brief introduction to the model

The XGboost algorithm utilizes a Hessian matrix to expand a loss function Taylor to the second order, the original optimization problem is converted into a convex function to solve the optimal solution, and the problem of distributed calculation which is difficult to implement in the GBDT algorithm is solved. Meanwhile, the XGboost algorithm carries out regularization constraint on the complexity of the tree, and the probability of overfitting of the model is reduced. Assuming that the model has K trees, the XGboost model can be expressed as:

wherein, y_i' As model predictor, x_iFor the ith sample, K is the number of trees and F is the space of the regression tree.

The XGboost model generates a new tree in each iteration for fitting the residual error of the previous tree, and the predicted value in the t iteration is assumed to be y_i(t)', then

The optimization objective function can now be expressed as:

the objective function is thus partially composed of a loss value, a regularization term, and a constant term 3, expanding the loss function to the second order, and removing the constant term, the objective function becomes:

the canonical terms that define the model are:

where γ and λ are model parameters, T is the total number of leaf nodes in the tree, w_jThe weight of the jth leaf node in the tree.

Definition of

I_jIs the sample set on the jth leaf node, the objective function is further simplified as:

therefore, the objective function of the model is converted into a quadratic equation of a single element related to the leaf node weight, and the optimal and immediate change of the tree structure is solved

To solve the function optimal solution problem, the final objective function becomes:

f_objthe smaller the value of (A), the better the structure of the whole tree.

2) Model parameter setting

The XGboost model parameters mainly comprise four parameters, namely a minimum leaf node weight min _ child _ weight, a maximum depth max _ depth, a learning rate eta and a gamma value of a combination of a penalty term and a leaf node. The selection of the model parameters greatly affects the model prediction effect, and the XGboost model parameters are optimized by using an improved genetic algorithm in the embodiment in consideration of the fact that the value of each parameter is random.

The genetic algorithm simulates chromosomes by using a solution of a problem, whether the chromosomes are suitable for the environment is judged by using the numerical value of a fitness function, the larger the numerical value of the fitness function is, the more the chromosomes can be suitable for the environment, the survival principle of fitters in the biological evolution theory is used, the chromosomes with larger fitness function values are copied from a parent to be crossed and mutated to form new chromosomes, and the diversity of the solution is maintained.

Coding mode

The coding mode is determined as the basic step of the genetic algorithm, and the binary coding mode and the decimal coding mode are two commonly used coding modes. Each chromosome in the binary coding mode is represented by 0 or 1 to represent each gene position, the coding mode is simple and convenient in decoding and decoding, and steps in crossing and mutation are easy, so that the binary coding is simple and easy to understand. However, when the problem of complex processing and high precision requirement is processed by using binary coding, certain error exists because of discretization of the problem, and when the problem of high dimensionality is processed, the calculation speed is influenced because the binary coding is longer. The decimal code, i.e. decimal number, represents each gene position on each chromosome, the length of the chromosome is determined by the control variable of the model, obviously, for the same model, the length of the chromosome is shorter by adopting the decimal code than by adopting the binary code, and the coding by adopting the decimal code does not need decoding and decoding, so the calculated amount is smaller by adopting the decimal code in general, but each gene position in the chromosome can not be out of range in the crossing and variation operation process.

Generation of initial population

A population consists of many chromosomes and a solution corresponds to one chromosome of the population. The size of the population represents the space size of the model solution, for a large population, although the calculation speed is relatively slow, the optimal solution of the model is easy to find, and conversely, if the population is small, although the calculation speed is relatively fast, the solution of the optimization model is relatively single and is easy to fall into local convergence, so that the initial population size is generally 30-100. The initial population can be obtained by a random method and can also be obtained according to the actual distribution of the population.

Calculating fitness function value

The fitness function is the basis of the genetic algorithm, and is related to the realization of the optimal solution and the calculation speed of the genetic algorithm. In genetic algorithms, the value of the fitness function reflects whether an individual can adapt to the environment and decides whether a parent can be inherited to children. If the fitness function value of an individual is larger, the individual has larger probability to be transmitted to the next generation, otherwise, the individual has high probability to be eliminated. The method for constructing the fitness function mainly comprises a direct construction and an indirect construction, wherein the direct construction is to directly use the objective function as the fitness function without conversion, and the indirect construction is to construct the fitness function by changing the objective function to a certain degree. In this embodiment, the minimum value of the normalized Mean Absolute error NMAE (normalized Mean Absolute error) of the prediction result of the model is used as the fitness, where NMAE is defined as:

where n is the number of predicted sample points, P_iTo predict the power value, P_i ^*As power observations, P_NIs the rated installed capacity.

Reproduction

The replication is a process of simulating natural selection, so the replication process is a selection process, that is, an individual with strong adaptability is selected from a population, and an individual with a large fitness function value is selected by generally adopting a proportional method, a sorting method and a proportional sorting method. The proportion method is widely applied and belongs to one of random selection methods, and is also called as gambling plate selection. If n individuals exist, the circular game board is divided into n sectors, the area of each sector respectively represents the probability of the corresponding individual being selected, and the individual corresponding to the sector with the large area is easy to select, which accords with the principle of survival of the opponent. However, in the case that the population has only few individuals with high fitness, the population may propagate too fast by using the proportional selection method, so that the final result is a locally optimal solution.

Fifth, cross

The cross simulation is a gene recombination process in a biological evolution process, excellent characteristics in individuals are combined to generate filial generations different from parents, the crossing selects points at the same position of the two parents to exchange genes according to a certain probability, and then new filial generations are obtained, and in a traditional genetic algorithm, single-cut-point crossing with small calculation amount is usually utilized, namely, a cut point is selected to exchange genes behind the corresponding cut points of the two parents P1 and P2, so that the filial generations C1 and C2 are obtained. For example, P1 is 10101011, P2 is 10001001 in the parent, and when the position of the cut point is 6, the child C1 is 10101001 and C2 is 10001011 after the intersection. The generation of the new individuals is mainly completed through crossing, the speed of crossing the chromosomes depends on the size of the crossing probability, the search is slowed down or even stopped if the crossing probability is too small, so the value of the crossing probability is generally large, but the new individuals are high in speed and easy to damage if the value is too large, and therefore the crossing probability is generally 0.6-0.95.

Variation of

The variant similar organisms are mutated during evolution, if the algorithm adopts binary coding, the corresponding gene positions are inverted, namely 1 is changed into 0, and 0 is changed into 1, and if the coding adopts decimal coding, the numerical values of the corresponding gene positions can be changed to a certain extent during variation, new individuals can be generated due to variation, the gene types of the population can be increased, and thus the genetic algorithm can not obtain the local optimal solution during solving. The genetic algorithm is changed into a simple random search process due to the large numerical value of the variation probability, but if the variation probability is too small, new individuals are difficult to appear in the population, so the value range of the variation probability is generally 0.0001-0.1.

Seventhly, outputting the optimal solution

In general, convergence criteria of a mathematical optimization algorithm are strict, and the judgment basis of a genetic algorithm is heuristic, and there are two common judgment methods: one is judged according to the quality of the solution, namely the iteration process is considered to be finished when the arrangement sequence of the group solution sets does not change any more; another approach is to allow the maximum number of iterations possible as a convergence criterion.

However, the mutation method in the traditional genetic algorithm has the problem of early maturity because of the cross probability P_cAnd the probability of variation P_mThe convergence characteristic of the genetic algorithm is determined, so that the method for timely changing the crossover and mutation probabilities is adopted in the embodiment for ensuring the global optimal solution, even if the crossover probability and the mutation probability in the algorithm are timely changed along with the value of the individual fitness, the crossover and mutation probabilities are reduced when the solution space of the algorithm cannot be converged, and the crossover probability P is increased when the algorithm is in local optimal_cAnd the probability of variation P_mTo jump out of the algorithm. If the fitness of an individual is lower than the average fitness, the individual is made to have a cross probability P_cAnd the probability of variation P_mLarger values eliminate them, whereas individuals have a lower cross probability P if their fitness is higher than the average fitness_cAnd the probability of variation P_mSo that it can enter belowOne generation, the specific calculation is shown in equations (12) and (13):

where f' is the maximum of the two volume fitness values for the crossover operation, and f_avgIs an average fitness value, f_maxIs the maximum fitness value, f_minThe minimum fitness value is f, and the variant individual fitness value is f.

The flow chart of the improved genetic algorithm provided by the embodiment is shown in fig. 3, and the adaptive cross probability and the mutation probability enable the population to have good diversity on one hand, and enable the genetic algorithm to have good convergence on the other hand, so that the convergence speed of the genetic algorithm can be obviously improved, and the algorithm is effectively prevented from falling into local optimum. The present embodiment employs a double-point crossover operation, and sets the parameters as: p_c1＝0.9，P_c2＝0.6，P_m1＝0.1，P_m2＝0.001。

(3) Wind power error prediction model based on random forest

The random forest algorithm belongs to one of Bagging algorithms, a self-help sampling technology is adopted in the algorithm, a regression tree is utilized to form a combined model, and a schematic diagram of the Bagging algorithm is shown in FIG. 4.

Bagging is characterized by random sampling, wherein samples with the same number as that of a training set are randomly collected and collected for T times to obtain a sampling set. For a sample, in a random sampling of m samples, the probability of being acquired each time is 1/m, and the probability of not being acquired in m samples is:

taking the limit on m can obtain

Therefore, for each round of random sampling of Bagging, about 36.8% of the data is not collected and is called "out-of-bag data", and the data does not participate in fitting of the training set model, but can be used as a test set to test the generalization capability of the model, and the test result is called "outsource estimation". Bagging does not limit the weak learners, decision trees and neural networks are usually selected, and for the regression problem, Bagging usually adopts a simple average method to perform arithmetic average on regression results obtained by the T weak learners to obtain final model output.

1) Brief introduction to the model

The random forest algorithm is improved on the basis of Bagging algorithm, and the algorithm generates a series of mutually independent regression trees

The model is combined by averaging the output of the model, and the mathematical expression is as follows:

wherein N is the number of regression tree models.

In the process of establishing each regression tree, random sampling is carried out on a sample space and a feature space, random attributes are introduced, further, the correlation among regression tree models is reduced, and the generalization of the models is improved by combining a large number of regression trees, so that the algorithm has the characteristics of high efficiency, accuracy and the like. In the process of establishing the model, the number N of the regression tree models is a main model parameter, and the gradual increase of N can cause the generalization error of the random forest to be converged to a limit without an overfitting phenomenon, so that the number of the generated regression tree models can be increased for data containing a large amount of noise, the random forest algorithm can be ensured to show good performance, and the value range of N is generally hundreds to thousands.

2) Model parameter optimization

The random forest model parameters mainly comprise three parameters of tree number trees _ num of the forest, feature number max _ features randomly selected by each decision tree and deepest depth max _ depth of the tree, and similarly, the XGboost model parameters are determined, and the model parameters of the random forest are optimized by using an improved genetic algorithm.

(3) Adding the prediction results output by the two models to obtain a final short-term wind power prediction result;

specifically, operation and numerical weather forecast data of a wind power plant are collected in real time, the XGboot model and the random forest model are input to obtain a wind power preliminary prediction result and a wind power error prediction result respectively, and finally the wind power preliminary prediction result and the power error prediction value are added to obtain a final wind power prediction value.

The embodiment also provides corresponding prediction and evaluation indexes, which are specifically as follows:

in this embodiment, the normalized Mean absolute error NMAE and the normalized Root Mean Square error NRMSE (normalized Root Mean Square error) are selected as the evaluation indexes of the prediction result, and the NRMSE is defined as follows:

where n is the number of predicted sample points, P_iFor the wind power prediction, P_i ^*For the actual wind power value, P_NIs the rated installed capacity.

The relative prediction error RE of the wind power is defined as follows:

wherein, P_iFor the wind power prediction, P_i ^*And the actual wind power value is obtained.

The predicted results are as follows:

the analysis of the embodiment utilizes 10 wind power plants in Jilin province to simulate the wind power prediction with a prospective 72 hours. The data set comprises NWP data and historical wind power data of each wind power plant in 2017 and 2019, the time resolution is 15 minutes, and the data of the data set in 2017, 2018 and 2019 are divided into a training set A, a training set B and a testing set respectively.

In order to verify the effectiveness of the proposed model, three models, namely M1, M2 and M3, are selected as reference models for comparison in the present embodiment, and the model proposed in the present embodiment is denoted as M4.

M1: and predicting by using an XGboost model optimized by an improved genetic algorithm.

M2: and (4) predicting by using a random forest model optimized by an improved genetic algorithm.

M3: and predicting by using the BP neural network optimized by the improved genetic algorithm.

Table 2 shows the NMAE value of each wind farm when each model is used for prediction and the average NMAE value of 10 wind farms, and it can be seen from the data in the table that when each wind farm is predicted by using the model provided in this embodiment, the prediction error is smaller than that of the other three models, and when 10 wind farms are predicted by using the model provided in this embodiment, the average NMAE value is 9.26%, whereas when 3 models, M1, M2 and M3, are used for prediction, the average NMAE values are 10.35%, 11.92% and 12.13%, respectively. Particularly, the prediction accuracy of the model provided by the embodiment is improved by 2.87% compared with that of the commonly-used BP neural network.

Table 2 NMAE value (%) -of each model prediction result

Fig. 5 and 6 show the NMAE value and the NRMSE value of 10 wind farms respectively predicted by using the models in four different seasons, namely spring, summer, autumn and winter, respectively, and it can be seen from the data in the graphs that the values of the NMAE and the NRMSE of the prediction results obtained by using the model provided in the present embodiment for prediction are the minimum in any season, such as spring, summer, autumn and winter, and the prediction error fluctuation is small when using the model provided in the present embodiment for prediction in four seasons, namely spring, summer, autumn and winter.

In order to intuitively explain the prediction effect, taking a No.6 wind power plant as an example, fig. 7-10 show the wind power prediction results of the wind power plant in four seasons, namely spring, summer, autumn and winter, respectively, and it can be seen from the graph that the wind power prediction result obtained by using the wind power integrated prediction model based on error correction provided by the embodiment is closer to the real power, and the result verifies that the model provided by the embodiment has good deterministic prediction performance. Fig. 11 shows a variation curve of the relative prediction error of the No.6 wind farm with time, and it can be seen from the diagram that the relative prediction error value of the wind power is close to 0, although the absolute value of the relative prediction error of the wind power increases with the increase of the prediction time, the absolute value of the prediction error is always kept within 15%, and the prediction result verifies that the proposed model can obtain a good wind power prediction result.

In general, the model provided by the embodiment can improve the short-term wind power prediction precision. The method fully utilizes the advantages that integrated learning can be combined with a plurality of weak supervision models to obtain a comprehensive strong supervision model, the XGboost is utilized to establish a wind power preliminary prediction model, an error prediction model based on a random forest is established according to a residual error learning method, the prediction results of the XGboost and the random forest are added to serve as a final prediction result, the effectiveness of the method is proved through example result analysis, and the comparison result shows that the model provided by the example can improve the short-term wind power prediction precision, further optimize the power generation plan of a conventional power unit and improve the consumption and acceptance of a power grid for new energy power generation.

Example two

According to the embodiment of the invention, the short-term wind power integrated prediction system based on error correction is disclosed, and comprises the following components:

It should be noted that specific implementation manners of the modules are already described in detail in the first embodiment, and are not described again.

EXAMPLE III

According to an embodiment of the present invention, an embodiment of a terminal device is disclosed, which includes a processor and a memory, the processor being configured to implement instructions; the memory is used for storing a plurality of instructions, and the instructions are suitable for being loaded by the processor and executing the short-term wind power integration prediction method based on error correction in the first embodiment.

In other embodiments, a computer-readable storage medium is disclosed, wherein a plurality of instructions are stored, and the instructions are suitable for being loaded by a processor of a terminal device and executing the short-term wind power integration prediction method based on error correction in the first embodiment.

Although the embodiments of the present invention have been described with reference to the accompanying drawings, it is not intended to limit the scope of the present invention, and it should be understood by those skilled in the art that various modifications and variations can be made without inventive efforts by those skilled in the art based on the technical solution of the present invention.

Claims

1. A short-term wind power integration prediction method based on error correction is characterized by comprising the following steps:

acquiring operation data and weather forecast data of a wind power plant;

2. The short-term wind power integration prediction method based on error correction as claimed in claim 1, wherein the wind farm operational data and weather forecast data include but are not limited to power generation, wind direction, wind speed, temperature, humidity and barometric pressure data of a wind farm.

3. The short-term wind power integration prediction method based on error correction as claimed in claim 1, wherein the wind power prediction model is constructed by using an XGboost model optimized by an improved genetic algorithm.

4. The short-term wind power integration prediction method based on error correction as claimed in claim 3, characterized in that during construction of the wind power prediction model, binary coding is adopted, the minimum leaf node weight, the maximum depth, the learning rate and the gamma value of the XGboost model are selected as independent variable parameters, population is initialized in a random mode, each obtained chromosome contains the independent variable parameters of the XGboost model, the normalized mean absolute error NMAE minimum of the model prediction result is used as the model fitness, and then the optimal solution of the independent variable parameters of the XGboost model is found through replication, timely change, improved intersection and variation.

5. The short-term wind power integration prediction method based on error correction as claimed in claim 1, wherein the wind power error prediction model is constructed by using a random forest model optimized by an improved genetic algorithm.

6. The short-term wind power integration prediction method based on error correction as claimed in claim 1, characterized in that, during construction of the wind power error prediction model, binary coding is adopted, the number of trees of the random forest model is selected, the feature number randomly selected by each decision tree, the deepest depth of the trees is used as a parameter, the population is initialized in a random manner, each obtained chromosome contains three parameters of the random forest model, the normalized mean absolute error NMAE of the model prediction result is minimized as the model fitness, and then the optimal solution of the three parameters of the random forest model is found through replication, timely-varying improved intersection and variation.

7. The short-term wind power integrated prediction method based on error correction as claimed in claim 1, characterized in that the operation and numerical weather forecast data of the wind farm are collected in real time, the trained XGboost and random forest models are respectively used for prediction, a wind power preliminary prediction result and a wind power error prediction result are correspondingly obtained, and the final wind power prediction result can be obtained by adding the two model prediction results.

8. A short-term wind power integrated prediction system based on error correction is characterized by comprising the following components:

9. A terminal device comprising a processor and a memory, the processor being arranged to implement instructions; the memory is used for storing a plurality of instructions, wherein the instructions are suitable for being loaded by the processor and executing the short-term wind power integration prediction method based on error correction according to any one of claims 1-8.

10. A computer-readable storage medium having stored thereon a plurality of instructions, wherein the instructions are adapted to be loaded by a processor of a terminal device and to perform the error correction based short-term wind power integration prediction method of any one of claims 1-8.