CN116341717A - Wind speed prediction method based on error compensation - Google Patents

Wind speed prediction method based on error compensation Download PDF

Info

Publication number
CN116341717A
CN116341717A CN202310156314.XA CN202310156314A CN116341717A CN 116341717 A CN116341717 A CN 116341717A CN 202310156314 A CN202310156314 A CN 202310156314A CN 116341717 A CN116341717 A CN 116341717A
Authority
CN
China
Prior art keywords
error
value
wind speed
sequence
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310156314.XA
Other languages
Chinese (zh)
Inventor
焦绪国
张道源
田艳兵
杨秦敏
孟文超
刘文锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Qingdao University of Technology
Original Assignee
Zhejiang University ZJU
Qingdao University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU, Qingdao University of Technology filed Critical Zhejiang University ZJU
Priority to CN202310156314.XA priority Critical patent/CN116341717A/en
Publication of CN116341717A publication Critical patent/CN116341717A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biophysics (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • Operations Research (AREA)
  • Primary Health Care (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Quality & Reliability (AREA)
  • Biomedical Technology (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a wind speed prediction method based on error compensation, which is characterized in that an ARMA model is established by utilizing historical data, the order determined by the ARMA model is used for dividing a data set, then a SVR model is trained by utilizing the divided data and wind speed prediction is carried out, and an error training set is obtained by subtracting a real wind speed value from a predicted wind speed value of the training set. And (3) the obtained error training set is re-utilized by the method, an ARMA model is established for the error sequence, and the bias autocorrelation coefficient is utilized as a basis to divide the error training set. After the error sequence is processed, the ELM is used for predicting the error, and the obtained error prediction result and the wind speed prediction result are added to obtain a wind speed prediction value after error compensation. The verification experiment proves that compared with the traditional direct prediction method, the error compensation prediction wind speed prediction method adopted by the invention has better fitting effect in the period of higher frequency and quicker wind speed change, and the accuracy of wind speed prediction is improved as a whole.

Description

Wind speed prediction method based on error compensation
Technical Field
The invention belongs to the technical field of wind power prediction, and can be used for predicting univariate wind speed time series data, in particular to an ARMA and SVR wind speed prediction method based on ELM error compensation.
Background
Human demand for energy has increased more and more since the 21 st century. However, combustion of fossil fuels such as coal, oil, and natural gas still occupies the main body of the world's energy structure. However, the combustion of fossil fuels entails the emission of greenhouse gases, harmful gases, which cause serious environmental problems, and, due to their non-renewable nature, the reserves of fossil energy on earth are becoming smaller and smaller, eventually depleted.
Scientists began to study emerging energy sources in order to address the energy crisis. At present, common new energy technologies are: wind power generation, hydroelectric power generation, photovoltaic power generation, tidal energy, nuclear energy, geothermal energy and the like, wherein the wind power generation technology is one of the new technologies with the fastest growth of installed capacity in recent years, and is more than photovoltaic power generation, and the wind power generation technology is a second most renewable energy source next to hydropower in the world energy market.
However, the wind often has instability and fluctuation, which causes difficulty in wind power utilization, large fluctuation of power generation power, difficulty in grid connection, and even damage to a fan when the wind speed changes rapidly, large-area off-grid caused by the damage to the fan, and influences the power generation quality, thereby endangering personnel safety. In order to stabilize the generated power and improve the generated quality, it is necessary to predict the wind speed.
Wind speed prediction is very important in the field of wind power generation. Early wind speed predictions typically use linear models, such as autoregressive moving average (ARMA), to find relationships between historical values, and after 90 s, artificial intelligence techniques are increasingly emerging, and many nonlinear methods are used to predict wind speed, such as Artificial Neural Networks (ANNs), support Vector Machines (SVMs), extreme Learning Machines (ELMs), long-short term memory networks (LSTM), convolutional Neural Networks (CNNs), etc., which typically build predictive models by taking historical wind speeds and other features affecting wind speed as inputs. Although these methods can achieve good predictive effects as a whole, in some time periods where the frequency is relatively high and the speed changes drastically, the predictive effects may not be ideal.
In order to further improve the prediction precision of the model, the application provides a prediction method based on ELM error compensation, wherein the ARMA modeling method is used for dividing historical wind speed data to serve as input to establish SVR model to predict future wind speed, then the error of the historical wind speed predicted value is used for establishing ELM model to predict future error value, and then the error value is added with the wind speed predicted value to obtain the wind speed predicted value after error compensation. Finally, through verification experiments, the method is proved to be capable of improving the accuracy of wind speed prediction on the whole.
Disclosure of Invention
In general, the more features it is for an object to be predicted, the more accurate the prediction effect. In wind speed prediction, wind speed is the object of prediction, the value of which is affected by factors such as ambient temperature, atmospheric pressure, ambient humidity, air density and the like, and if more data including these features is collected, the prediction is more accurate, but in actual measurement, many features are ignored and only the object to be predicted is left. For wind speed predictions, many wind farms collect only historical wind speeds and few other factors, so how to correctly select the number of input historical wind speeds for the predictive model becomes critical. Meanwhile, the conventional methods, such as ARMA, often have a problem that the wind speed prediction accuracy is lowered as the prediction time is prolonged. To this end, the invention provides a wind speed prediction method based on error compensation, comprising the steps of: .
(1) Obtaining historical wind speed data of a wind power plant, carrying out stability test on the sequence, modeling an ARMA model if the stability test is passed, and converting the unstable sequence into a stable sequence by adopting a difference method if the stability test is not passed;
(2) Using the stable sequence obtained in the step (1) for ARMA modeling, adopting an AIC criterion, and solving a partial autocorrelation coefficient p and an autocorrelation coefficient q which enable the AIC value to be minimum by using an iterative algorithm;
wherein, ARMA model is:
Figure BDA0004092482270000021
wherein alpha is i Is an autoregressive coefficient, beta j Is a moving average coefficient, both of which are predetermined coefficients and are non-zero, ε t As an error term at time t, x t Represents the value of time t, epsilon t-j Is the error term at the time t-j, x t-i A value at time t-i;
(3) Dividing the data set by using the p value obtained in the step (2) as a basis;
(4) SVR modeling is carried out, a Gaussian kernel function is selected by adopting a manual trial-and-error method, the value of a penalty factor is determined, then a model is trained, and the accuracy of model prediction is checked by using a test set;
(5) Predicting the training set by using the trained SVR model to obtain a wind speed predicted value of the training set, and subtracting the predicted wind speed value from the real wind speed value of the training set to obtain an error training set;
(6) The error training set is processed by adopting the same processing method as the wind speed training set, namely, the steps (1) and (2) are repeated on the error sequence, the ARMA modeling method is used, the stability of the error sequence is firstly judged, if the error sequence is unstable, differential processing is needed, an ARMA model is built, and an AIC criterion is utilized and an iterative algorithm is used for solving a partial autocorrelation coefficient p 'and an autocorrelation coefficient q' which enable the AIC value of the ARMA model of the error sequence to be minimum;
(7) Dividing the error data set by using the partial autocorrelation coefficient p' in the step (6);
the dividing process of the step (3) and the step (7) is the same, namely, from the first value to the p 'value of the sequence, a group of data is formed at intervals of p' units, then, from the second value of the sequence, p 'units are formed, and p' +1 values are formed as a second group of data, and the like, until the data set is divided. For each set of data, the first p '-1 value is taken as input and the p' th value is taken as output, i.e. the p 'th value is predicted using the first p' -1 value.
(8) And (3) training an ELM model by using the error training set in the step (7), and carrying out error prediction by using the model to obtain an error prediction value. And finally, adding the error predicted value and the wind speed predicted value in the step (4) to obtain the wind speed predicted value after error compensation.
Output y of ELM i The method comprises the following steps:
Figure BDA0004092482270000031
wherein x is i Is an input to ELM; n represents the input dimension, namely the number of neurons of the input layer; k is the number of neurons in the hidden layer; omega j And b j Weights and biases between the jth hidden layer neuron to the input, respectively; beta j Weights between the jth hidden layer neuron and the output; g (·) is an activation function, a Sigmoid function or an RBF function can be selected, and the Sigmoid function is used in the invention;
(9) And finally, evaluating the accuracy of model prediction by using the root mean square error.
Root Mean Square Error (RMSE) is defined as:
Figure BDA0004092482270000032
wherein the method comprises the steps of
Figure BDA0004092482270000035
Representing the predicted value, x i Representing the true value, N represents the total number of samples.
Preferably, in the step (1), the plateau sequence means:
Figure BDA0004092482270000033
wherein E (X) t ) Representing sequence X t Var (X) t ) Representing variance, r t,t-i And r 0,i Autocorrelation coefficients, μ, σ, respectively representing values at times t and t-i and times 0 and i 2 These two numbers are constants.
Preferably, in the step (1), the stationary sequence means that the mean and variance of the sequence do not change with time, and the autocorrelation coefficients at the same time intervals are the same, that is:
Figure BDA0004092482270000034
wherein E (X) t ) Representing sequence X t Var (X) t ) Representing variance, r t,t-i And r 0,i Autocorrelation coefficients, μ, σ, respectively representing values at times t and t-i and times 0 and i 2 Is a constant;
the difference method refers to:
y t =Y t -Y t-1
wherein y is t Represents the value after the difference at time t, Y t And Y t-1 The values at time t and time t-1 of the original data are shown, respectively. Preferably, in the step (2), the AIC criterion means:
AIC=2k-2ln(L)
where k is the number of unknown parameters in the model and L is the maximum likelihood function value of the model.
Preferably, in the step (2), the ARMA model is:
Figure BDA0004092482270000041
wherein alpha is i Is an autoregressive coefficient, beta j Is a moving average coefficient, both of which are predetermined coefficients and are non-zero, ε t As an error term at time t, x t Represents the value of time t, epsilon t-j Is the error term at the time t-j, x t-i Is the value at time t-i.
Preferably, in the step (3), the autocorrelation function means:
Figure BDA0004092482270000042
wherein ρ is t,t-k X represents t And X t-k Is used as a function of the autocorrelation function of (c),
Figure BDA0004092482270000046
representing sequence X t Is a mean value of (c).
Preferably, in the step (3), the manner of dividing the training set and the test set is as follows: the first three quarters of the divided data are used as training sets, and the last quarter is used as testing set.
Preferably, in the step (4), the original model of SVR is:
Figure BDA0004092482270000043
Figure BDA0004092482270000044
wherein s.t. represents constraint conditions, C is penalty factor, m is sample number, and ζ i And
Figure BDA0004092482270000047
is a relaxation variable, ε is a tolerance deviation, y i Is the output, w and b are hyperplane parameters that need to be optimized.
Preferably, in the step (4), the gaussian kernel function is:
Figure BDA0004092482270000045
where x 'represents the origin of the kernel function, x represents the sampling point, |x-x' || 2 Representing the euclidean distance between the sampling point and the origin, σ controls the correlation between equally spaced sampling points of the gaussian kernel.
Preferably, in the step (4), the Root Mean Square Error (RMSE) is defined as:
Figure BDA0004092482270000051
wherein the method comprises the steps of
Figure BDA0004092482270000052
Representing the predicted value, x i Representing the true value, N represents the total number of samples.
Compared with the prior art, the invention has the following advantages:
1. the invention provides a wind speed prediction method based on error compensation, which is an ARMA and SVR combination method based on ELM error compensation, and can divide a univariate wind speed time sequence into a plurality of sections of sequences for model training. Compared with the data partitioning method by the trial-and-error method, the SVR model training set is optimized, the selection of the input quantity is more reasonable, the error compensation prediction is carried out by combining the ELM, the prediction error in the period with larger wind speed fluctuation is made up, and the good prediction effect is finally obtained.
2. According to the SVR model, different amounts of intercepted data are selected as input, so that compared with a trial-and-error method, the SVR model has a better prediction effect.
3. According to the invention, by adopting the ELM as an error compensation method, the historical error is predicted and added with the wind speed predicted value, so that the prediction effect in a period with higher frequency and faster wind speed change is improved.
Drawings
The invention is further described below with reference to the accompanying drawings.
FIG. 1 is wind speed data of a certain wind farm in China;
FIG. 2 is a sequence after the difference of the original wind speeds of the training set, wherein the sequence is a stable sequence;
FIG. 3 is a graph of autocorrelation and partial autocorrelation functions after differentiating the original data;
FIG. 4 is a flow chart of ARMA and SVR wind speed prediction method design based on ELM error compensation;
FIG. 5 shows the predicted effect of the trained model when p is 4;
FIG. 6 shows the predicted effect of the trained model when p is 2;
FIG. 7 shows the predicted effect of the trained model when p is 6;
FIG. 8 is error training set data;
FIG. 9 is error training set data after differencing;
FIG. 10 is a graph of autocorrelation and partial autocorrelation functions of an error training set;
FIG. 11 is an ELM error prediction result;
FIG. 12 is a final wind speed prediction based on ELM with error compensation.
Detailed Description
The invention will be described in further detail with reference to the drawings and the specific examples.
For wind speed prediction, the wind speed is generally affected by factors such as ambient temperature, atmospheric pressure, ambient humidity, air density and the like, and these characteristics can be used as the object of machine learning, but in the actually collected data, in consideration of cost or other factors, some wind farms generally cannot detect these factors, only the object to be predicted, namely the wind speed, is collected, and the finally obtained data becomes a univariate time series only containing the historical wind speed. For this data, it cannot be directly used as input to some methods such as SVR, so that the original wind speed sequence must be processed in order to predict wind speed using some more advanced machine learning method. The processed sequence is characterized by being only historical values of a plurality of wind speeds, so that the prediction effect is possibly poor in some places with larger frequency fluctuation in the prediction process, and an error compensation method can be adopted to compensate the part of errors.
The invention aims to treat a group of univariate wind speed sequences into sequences which can be input as SVR models through an ARMA modeling method, then predict future wind speeds by using the SVR models, process historical errors through ELM to obtain predicted values of errors, and finally add the predicted future wind speeds with the predicted errors to obtain future wind speeds after error compensation.
According to the method, an ARMA model is established by utilizing historical data, the order determined by the ARMA model is used for dividing a data set, then the SVR model is trained by utilizing the divided data, wind speed prediction is carried out, and the real wind speed value of the training set is subtracted from the predicted wind speed value to obtain an error training set. And (3) the obtained error training set is re-utilized by the method, an ARMA model is established for the error sequence, and the bias autocorrelation coefficient is utilized as a basis to divide the error training set. After the error sequence is processed, the ELM is used for predicting the error, and the obtained error prediction result and the wind speed prediction result are added to obtain a wind speed prediction value after error compensation.
Before dividing a data set, carrying out fixed order on the original data set by using an ARMA modeling method, dividing the data set by using the obtained value of the partial autocorrelation coefficient p to obtain a time sequence with stronger multi-section correlation, dividing the divided data into a training set and a test set, training the training set for an SVR model, then checking the prediction effect of the trained model by using the test set, finally subtracting a true value from a predicted value to obtain an error sequence, dividing the data by using the partial autocorrelation coefficient of the ARMA model for the same processing of the error sequence and the original wind speed data, carrying out error prediction by using ELM, and adding the error prediction result and the wind speed prediction result to obtain the wind speed predicted value after error compensation.
As shown in fig. 4, the invention adopts the ARMA and SVR combined prediction method with ELM error compensation, and specifically comprises the following steps:
step 1, collecting wind speed data of a wind power plant, carrying out ADF stability test, carrying out ARMA modeling if the ADF stability test passes the test, and carrying out differential processing if the ARMA modeling does not pass the test. The wind speed data of the wind farm is shown in fig. 1, and is obviously an unstable sequence, because the mean value and the variance of the wind farm are continuously changed, so that differential processing is performed, and in fig. 2, the sequence is a stable sequence, and ARMA modeling can be performed.
The plateau sequence refers to:
Figure BDA0004092482270000061
wherein E (X) t ) Representing sequence X t Var (X) t ) Representing variance, r t,t-i And r 0,i Autocorrelation coefficients, μ, σ, respectively representing values at times t and t-i and times 0 and i 2 These two numbers are constants.
The difference method refers to:
y t =Y t -Y t-1
wherein y is t Represents the value after the difference at time t, Y t And Y t-1 The values at time t and time t-1 of the original data are shown, respectively.
And 2, using the stable sequence obtained in the step 1 for ARMA modeling, adopting an AIC criterion, and solving a partial autocorrelation coefficient p and an autocorrelation coefficient q which enable the AIC value to be minimum by using an iterative algorithm. In the invention, the partial autocorrelation coefficient p of the model is mainly utilized, because p reflects the correlation between the current value and the historical value, the value of p represents that the correlation degree of the current value and all the historical values of the hysteresis p number is large, and the p values are utilized to form a group of data, so that reference can be provided for dividing the data, the data waste can not be caused when the number of the divided data is selected, and the machine performance loss can not be caused because the data is selected too much.
The ARMA model in this embodiment is:
Figure BDA0004092482270000071
wherein alpha is i Is an autoregressive coefficient, beta j Is a moving average coefficient, both of which are predetermined coefficients and are non-zero, ε t As an error term at time t, x t Represents the value of time t, epsilon t-j Is the error term at the time t-j, x t-i Is the value at time t-i. From the above, it can be seen that the current value x t With the first p values
Next, the p and q values that minimize the AIC values are found using AIC criteria, in combination with the iterative idea. The AIC criterion gives the best estimation of ARMA model order and parameters at the same time, and is suitable for the problem of less sample data. Wherein the expression of the AIC criterion is:
AIC=2k-2ln(L)
where k is the number of unknown parameters in the model and L is the maximum likelihood function value of the model.
By using the above steps, the obtained p value can be used as a basis for dividing the data set.
And 3, utilizing the p interception data obtained in the step 2, wherein p obtained in the embodiment is 4. The specific intercepting method comprises the following steps: from the data at the first moment to the data at the p-th moment, each p values are used as a group of new data, the previous p-1 number is selected as input data, and the p-th number is used as output; and then, according to the same rule, carrying out the same dividing operation on the data from the second moment to the p+1st moment, and repeating the steps until the data set is completely divided.
And 4, SVR modeling is carried out, a manual trial and error method is adopted to determine ideal punishment factors and kernel function parameters, then the divided data set is utilized for training, the accuracy of the model is checked by using the test set, and the prediction effect is shown in figure 5. The original model of SVR is as follows:
Figure BDA0004092482270000081
Figure BDA0004092482270000082
wherein s.t. represents constraint conditions, C is penalty factor, m is sample number, and ζ i And
Figure BDA0004092482270000083
is a relaxation variable, ε is a tolerance deviation, y i Is the output, w and b are hyperplane parameters that need to be optimized, +.>
Figure BDA0004092482270000084
Is a kernel function, important in SVR, and the kernel function of this embodiment is implemented by using a gaussian kernel function (radial basis function), and its expression is:
Figure BDA0004092482270000085
where x 'represents the origin of the kernel function, x represents the sampling point, |x-x' || 2 Representing the euclidean distance between the sampling point and the origin, σ controls the correlation between equally spaced sampling points of the gaussian kernel.
Step 5, in order to prove that the partitioning basis selected by the method is relatively good, a trial-and-error method is utilized, different p values are selected as the partitioning basis to retrain the model, the prediction result is compared with the result of the step 4, the accuracy of the prediction is judged by using root mean square error, and referring to fig. 6, p=2 and 7,p =6.
The comparison shows that when p=4, namely the p value solved in the step 2, the prediction effect is best, at the moment, the root mean square error is only 0.91226, when p=2, the root mean square error is 0.91539, when p=6, the root mean square error is 0.91718, and the root mean square error is higher than the model when p obtains the optimal value, therefore, by comparison, the data intercepted according to the p value determined by the ARMA model is proved to be used as input, the trained SVR model has better prediction effect, and the prediction precision is improved.
And 6, testing the training set by using the model trained in the step 4, wherein the aim of the step is to obtain a predicted value of the training set by using the predicted model, and subtracting the predicted wind speed value of the training set from the actual wind speed value of the training set, so that an error training set can be obtained to establish an error prediction model.
And 7, for the error training set obtained in the step 6, adopting the same processing method as that of the air speed training set, namely repeating the steps 1 and 2 for the error sequence, and firstly judging the stability of the error sequence by using an ARMA modeling method, and if the stability is unstable, carrying out differential processing. As shown in fig. 8, it is apparent that the error sequence is an unstable time sequence, and it is necessary to perform differential processing to make it stable. The differential error sequence is shown in fig. 9, where the sequence is a plateau sequence.
Step 8, building an ARMA model, and solving a partial autocorrelation coefficient p 'and an autocorrelation coefficient q' which minimize the AIC value of the error sequence ARMA model by using an AIC criterion and an iterative algorithm;
step 9, dividing an error data set by using the partial autocorrelation coefficient p' obtained in the step 8; this step is the same as the partitioning process of step 3, i.e. starting from the first value to the p-th value of the sequence, every p units as a set of data, starting from the second value of the sequence, then further partitioning by p units, to the p+1-th value as a second set of data, and so on, until the data set is partitioned. For each set of data, the first p-1 values are taken as inputs and the p-th value is taken as output, i.e., the p-th value is predicted using the first p-1 values.
And step 10, establishing a prediction model for the divided error training set obtained in the step 8 by using ELM, and predicting future errors. The output of ELM is:
Figure BDA0004092482270000091
wherein x is i Is an input to ELM; n represents the input dimension, namely the number of neurons of the input layer; k is the number of neurons in the hidden layer; omega j And b j Weights and biases between the jth hidden layer neuron to the input, respectively; beta j To the output for the jth hidden layer neuronThe weight of the space; g (·) is the activation function. Here the activation function selects the Sigmoid function, whose expression is:
Figure BDA0004092482270000092
the comparison of error using ELM prediction with true error is shown in fig. 11. It can be seen that the prediction error fits the true error well as a whole, only burrs occur at individual points, but the final prediction result is not greatly affected in view of the small unit of error.
And finally, adding the obtained error predicted value and the wind speed predicted value to obtain a final wind speed predicted value, wherein the result is shown in fig. 12, the root mean square error of a predicted model is 0.89609, which is lower than the root mean square error 0.91226 of fig. 5, and the wind speed predicted effect after error compensation prediction is better than that of the direct predicted graph 5.
The verification experiment proves that compared with the traditional direct prediction method, the error compensation prediction wind speed prediction method adopted by the invention has better fitting effect in the period of higher frequency and quicker wind speed change, and the accuracy of wind speed prediction is improved as a whole.
The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (9)

1. A wind speed prediction method based on error compensation, comprising the steps of:
(1) Obtaining historical wind speed data of a wind power plant, carrying out stability test on the sequence, modeling an ARMA model if the stability test is passed, and converting the unstable sequence into a stable sequence by adopting a difference method if the stability test is not passed;
(2) Using the stable sequence obtained in the step (1) for ARMA modeling, adopting an AIC criterion, and solving a partial autocorrelation coefficient p and an autocorrelation coefficient q which enable the AIC value to be minimum by using an iterative algorithm;
wherein, ARMA model is:
Figure FDA0004092482260000011
wherein alpha is i Is an autoregressive coefficient, beta j Is a moving average coefficient, both of which are predetermined coefficients and are non-zero, ε t As an error term at time t, x t Represents the value of time t, epsilon t-j Is the error term at the time t-j, x t-i A value at time t-i;
(3) Dividing the data set by using the p value obtained in the step (2) as a basis;
(4) SVR modeling is carried out, a Gaussian kernel function is selected by adopting a manual trial-and-error method, the value of a penalty factor is determined, then a model is trained, and the accuracy of model prediction is checked by using a test set;
(5) Predicting the training set by using the trained SVR model to obtain a wind speed predicted value of the training set, and subtracting the predicted wind speed value from the real wind speed value of the training set to obtain an error training set;
(6) The same processing method as that of the wind speed training set is adopted for the error training set, namely, the steps (1) and (2) are repeated for the error sequence, the ARMA modeling method is adopted, the stability of the error sequence is firstly judged, if the error sequence is unstable, differential processing is needed, an ARMA model is built, and the AIC criterion is utilized and an iterative algorithm is used for solving the partial autocorrelation coefficient p which enables the AIC value of the ARMA model of the error sequence to be minimum And autocorrelation coefficient q
(7) Using the partial autocorrelation coefficient p in step (6) Dividing the error data set;
wherein the division of step (3) and step (7) is the same, i.e. starting from the first value of the sequence to the p-th Every other p In units of a set of data, followed by a further p-interval starting from the second value of the sequence Units of p The +1 values are the second set of data, and so on until the data set is partitioned. Each group of data, the first p -1 value as input, p The value being taken as output, i.e. p before use -1 value, predict p A value.
(8) And (3) training an ELM model by using the error training set in the step (7), and carrying out error prediction by using the model to obtain an error prediction value. And finally, adding the error predicted value and the wind speed predicted value in the step (4) to obtain the wind speed predicted value after error compensation.
Output y of ELM i The method comprises the following steps:
Figure FDA0004092482260000021
wherein x is i Is an input to ELM; n represents the input dimension, namely the number of neurons of the input layer; k is the number of neurons in the hidden layer; omega j And b j Weights and biases between the jth hidden layer neuron to the input, respectively; beta j Weights between the jth hidden layer neuron and the output; g (·) is an activation function, a Sigmoid function or an RBF function can be selected, and the Sigmoid function is used in the invention;
(9) And finally, evaluating the accuracy of model prediction by using the root mean square error.
Root Mean Square Error (RMSE) is defined as:
Figure FDA0004092482260000022
wherein the method comprises the steps of
Figure FDA0004092482260000025
Representing the predicted value, x i Representing the true value, N represents the total number of samples.
Preferably, in the step (1), the plateau sequence means:
Figure FDA0004092482260000023
wherein E (X) t ) Representing sequence X t Var (X) t ) Representing variance, r t,t-i And r 0,i Autocorrelation coefficients, μ, σ, respectively representing values at times t and t-i and times 0 and i 2 These two numbers are constants.
2. The method of claim 1, wherein in the step (1), the stationary sequences means that the mean and variance of the sequences do not change with time, and the autocorrelation coefficients at the same time intervals are the same, namely:
Figure FDA0004092482260000024
wherein E (X) t ) Representing sequence X t Var (X) t ) Representing variance, r t,t-i And r 0,i Autocorrelation coefficients, μ, σ, respectively representing values at times t and t-i and times 0 and i 2 Is a constant;
the difference method refers to:
y t =Y t -Y t-1
wherein y is t Represents the value after the difference at time t, Y t And Y t-1 The values at time t and time t-1 of the original data are shown, respectively.
3. The method of claim 1, wherein in the step (2), the AIC criterion is:
AIC=2k-2ln(L)
where k is the number of unknown parameters in the model and L is the maximum likelihood function value of the model.
4. The method of claim 1, wherein in the step (2), the ARMA model is:
Figure FDA0004092482260000031
wherein alpha is i Is an autoregressive coefficient, beta j Is a moving average coefficient, both of which are predetermined coefficients and are non-zero, ε t As an error term at time t, x t Represents the value of time t, epsilon t-j Is the error term at the time t-j, x t-i Is the value at time t-i.
5. The method of claim 1, wherein in the step (3), the autocorrelation function means:
Figure FDA0004092482260000032
wherein ρ is t,t-k X represents t And X t-k Is used as a function of the autocorrelation function of (c),
Figure FDA0004092482260000037
representing sequence X t Is a mean value of (c).
6. The method of claim 1, wherein in the step (3), the training set and the test set are divided in such a way that: the first three quarters of the divided data are used as training sets, and the last quarter is used as testing set.
7. The method of claim 1, wherein in the step (4), the original model of SVR is:
Figure FDA0004092482260000033
Figure FDA0004092482260000034
wherein s.t. represents constraint conditions, C is penalty factor, m is sample number, and ζ i And
Figure FDA0004092482260000035
is a relaxation variable, ε is a tolerance deviation, y i Is the output, w and b are hyperplane parameters that need to be optimized.
8. The method of claim 1, wherein in the step (4), the gaussian kernel function is:
Figure FDA0004092482260000036
where x 'represents the origin of the kernel function, x represents the sampling point, |x-x' || 2 Representing the euclidean distance between the sampling point and the origin,
sigma controls the correlation between equally spaced sampling points of the gaussian kernel.
9. The method of claim 1, wherein in the step (4), the Root Mean Square Error (RMSE) is defined as:
Figure FDA0004092482260000041
wherein the method comprises the steps of
Figure FDA0004092482260000042
Representing the predicted value, x i Representing the true value, N represents the total number of samples.
CN202310156314.XA 2023-02-23 2023-02-23 Wind speed prediction method based on error compensation Pending CN116341717A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310156314.XA CN116341717A (en) 2023-02-23 2023-02-23 Wind speed prediction method based on error compensation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310156314.XA CN116341717A (en) 2023-02-23 2023-02-23 Wind speed prediction method based on error compensation

Publications (1)

Publication Number Publication Date
CN116341717A true CN116341717A (en) 2023-06-27

Family

ID=86878209

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310156314.XA Pending CN116341717A (en) 2023-02-23 2023-02-23 Wind speed prediction method based on error compensation

Country Status (1)

Country Link
CN (1) CN116341717A (en)

Similar Documents

Publication Publication Date Title
Kavousi-Fard Modeling uncertainty in tidal current forecast using prediction interval-based SVR
CN111008728A (en) Method for predicting short-term output of distributed photovoltaic power generation system
CN113107626B (en) Load prediction method of combined cycle generator set based on multivariable LSTM
CN111898828A (en) Hydroelectric power generation prediction method based on extreme learning machine
CN110555548B (en) ICS-ELM ultra-short-term wind power prediction method based on data mining original error correction
CN109599872B (en) Power system probability load flow calculation method based on stack noise reduction automatic encoder
CN110942205A (en) Short-term photovoltaic power generation power prediction method based on HIMVO-SVM
CN112149883A (en) Photovoltaic power prediction method based on FWA-BP neural network
CN111242353A (en) Wind power combined prediction modeling and prediction method
Omer et al. Comparison of decision tree based ensemble methods for prediction of photovoltaic maximum current
CN116632823A (en) Short-term photovoltaic power prediction method based on power conversion model and multi-layer perceptron
CN115995810A (en) Wind power prediction method and system considering weather fluctuation self-adaptive matching
Xu et al. Short-term wind speed prediction based on GRU
CN114298132A (en) Wind power prediction method and device and electronic equipment
CN113627674A (en) Distributed photovoltaic power station output prediction method and device and storage medium
Brusca et al. Wind farm power forecasting: New algorithms with simplified mathematical structure
Khatib et al. Assessment of three learning machines for long-term prediction of wind energy in Palestine
CN115640916B (en) Distributed power supply output and multi-energy load situation sensing method and system
CN113095534A (en) Wind power prediction method combining ARIMA and improved Elman neural network
Qiao et al. Short-term tidal current prediction based on GA-BP neural network
CN115907131A (en) Method and system for building electric heating load prediction model in northern area
CN116128211A (en) Wind-light-water combined short-term optimization scheduling method based on wind-light uncertainty prediction scene
CN116341717A (en) Wind speed prediction method based on error compensation
Terzi et al. New hybrid method proposal for wind speed prediction: a case study of Lüleburgaz
CN109165792B (en) Photovoltaic short-term output power prediction method based on SOA-WNN

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination