CN116231749A - New energy power system dispatching method based on digital twin - Google Patents

New energy power system dispatching method based on digital twin Download PDF

Info

Publication number
CN116231749A
CN116231749A CN202310062843.3A CN202310062843A CN116231749A CN 116231749 A CN116231749 A CN 116231749A CN 202310062843 A CN202310062843 A CN 202310062843A CN 116231749 A CN116231749 A CN 116231749A
Authority
CN
China
Prior art keywords
prediction
power
new energy
model
dispatching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202310062843.3A
Other languages
Chinese (zh)
Inventor
白易杰
张林音
谭勇
胡晓华
王璐
李�真
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Henan Electric Power Co Neixiang County Power Supply Co
Original Assignee
State Grid Henan Electric Power Co Neixiang County Power Supply Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Henan Electric Power Co Neixiang County Power Supply Co filed Critical State Grid Henan Electric Power Co Neixiang County Power Supply Co
Priority to CN202310062843.3A priority Critical patent/CN116231749A/en
Publication of CN116231749A publication Critical patent/CN116231749A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/38Arrangements for parallely feeding a single network by two or more generators, converters or transformers
    • H02J3/46Controlling of the sharing of output between the generators, converters, or transformers
    • H02J3/466Scheduling the operation of the generators, e.g. connecting or disconnecting generators to meet a given demand
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06312Adjustment or analysis of established resource schedule, e.g. resource or task levelling, or dynamic rescheduling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/004Generation forecast, e.g. methods or systems for forecasting future energy generation
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/24Arrangements for preventing or reducing oscillations of power in networks
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/38Arrangements for parallely feeding a single network by two or more generators, converters or transformers
    • H02J3/381Dispersed generators
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J2203/00Indexing scheme relating to details of circuit arrangements for AC mains or AC distribution networks
    • H02J2203/10Power transmission or distribution systems management focussing at grid-level, e.g. load flow analysis, node profile computation, meshed network optimisation, active network management or spinning reserve management
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J2203/00Indexing scheme relating to details of circuit arrangements for AC mains or AC distribution networks
    • H02J2203/20Simulating, e g planning, reliability check, modelling or computer assisted design [CAD]
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J2300/00Systems for supplying or distributing electric power characterised by decentralized, dispersed, or local generation
    • H02J2300/20The dispersed energy generation being of renewable origin
    • H02J2300/22The renewable source being solar energy
    • H02J2300/24The renewable source being solar energy of photovoltaic origin
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Power Engineering (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Software Systems (AREA)
  • Marketing (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • Development Economics (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • Educational Administration (AREA)
  • Computing Systems (AREA)
  • Game Theory and Decision Science (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Computation (AREA)
  • Operations Research (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The invention belongs to the technical field of new energy power system dispatching, and particularly relates to a new energy power system dispatching method based on digital twinning; it comprises the following steps: constructing a short-term direct probability prediction model suitable for photovoltaic power; constructing an electric power digital twin system, and utilizing ubiquitous Internet of things and electric power data flow to assist decision making of power grid operation management regulation through real-time situation awareness and real-time virtual deduction; based on the new energy interval prediction result, a robust scheduling model is established, the automatic power generation control response of the unit is considered to cope with the new energy output fluctuation, and the power balance of the power grid is ensured; the invention has higher prediction area coverage rate and smaller average width ratio of the prediction interval, based on the existing real-time data information of each level at the dispatching side, the prediction model has real-time updating property, the applicability and generalization of the prediction model to the local power grid situation are improved, and the optimization and improvement of the existing dispatching system are realized based on the real-time prediction result of the new energy interval.

Description

New energy power system dispatching method based on digital twin
Technical Field
The invention belongs to the technical field of new energy power system dispatching, and particularly relates to a new energy power system dispatching method based on digital twinning.
Background
The new energy is connected into the power grid in large capacity and large scale, various comprehensive new energy power systems such as wind, light, hydrogen and the like are accelerating to form, the power grid dispatching faces new challenges, and the risk of stable operation is gradually increased.
In order to ensure safe and stable and economic operation of a large power grid, the power grid dispatching system is provided with a load prediction module, the load prediction module can make a pre-judgment before the system has large load fluctuation, and the power quality problems such as voltage sag and even interruption of the power grid are prevented by adjusting emergency control modes such as system output and load shedding, so that a solid foundation is laid for reasonable formulation of a dispatching plan.
However, unlike the flexibility and controllability of the conventional thermal power generating unit, the new energy has strong volatility and randomness, and the access of the large-scale new energy power station reduces the reliability of the scheduling while increasing the diversity of the system scheduling, how to reasonably arrange the scheduling plan on the basis of fully calling various energy forms? How do the grid operate safely and stably and economically? These become the scheduling problem of the new energy power system, and the addition of a new energy unit power prediction module is one of the keys for solving the scheduling problem.
The existing new energy prediction system is generally subjected to function customization according to power grid operation, is various in variety, and has the precision and the robustness to be improved; meanwhile, most of existing new energy prediction systems are offline models, historical data is adopted for model training, and real-time operation information such as real-time meteorological data, output adjustment strategies, sensor positions and channel information is not fully utilized, so that the model generalization capability is low, and prediction results with higher precision cannot be given according to the actual conditions of a local power grid. Based on the background, the existing dispatching platform is in an offline management mode for new energy, and power dispatching personnel cannot timely master the real-time running state of a new energy device and a system and cannot reasonably regulate and control according to current actual running output. Along with the deep digital transformation and refined operation requirements of the power grid, in order to further eliminate new energy, the control capability of regulating personnel on the operation safety risk of the new energy power grid is further improved, the operation state of the new energy power station is required to be predicted more accurately at the main station side, and the power grid dispatching plan is reasonably adjusted according to the current actual operation condition.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, provides a new energy power system dispatching method based on digital twinning, which has higher prediction area coverage rate and smaller average width ratio of a prediction interval, enables a prediction model to have real-time updating property based on all levels of real-time data information existing on a dispatching side, improves the applicability and generalization of the prediction model to the situation of a local power grid, and realizes optimization and improvement of the existing dispatching system based on a new energy interval real-time prediction result.
The purpose of the invention is realized in the following way: the new energy power system dispatching method based on digital twin comprises the following steps:
s1, constructing a short-term direct probability prediction model suitable for photovoltaic power;
s2, constructing an electric power digital twin system, and utilizing ubiquitous Internet of things and electric power data flow to assist decision making of power grid operation management regulation through real-time situation awareness and real-time virtual deduction;
and step S3, based on a new energy interval prediction result, a robust scheduling model is established, and the automatic power generation control (AGC) response of the unit is considered to cope with the fluctuation of the new energy output, so that the power balance of the power grid is ensured.
The step S1 of constructing a short-term direct probability prediction model suitable for photovoltaic power comprises the following steps: preprocessing an initial data set, removing abnormal values of the data set based on the base detection, and extracting strongly-correlated weather variables by gray correlation analysis; improving the NGBoost meta-model by means of a generalized natural gradient calculation method; fusion is carried out by using a Blending architecture, so that the model learning effect is further enhanced;
model dataset D contains n D Samples, m features, i.e. d= { (x) i ,y i )}(x i ∈R m ,y i E R), where x i Characterizing the eigenvector of the ith sample, y i Representing the label value (true value) corresponding to the ith sample, i E (1, n) D )。
Preprocessing an initial data set, removing abnormal values of the data set based on drawing base detection, and extracting strongly-correlated weather variables by gray correlation analysis comprises the following steps:
1) Normalizing the time series of each variable, and taking the kth of the n meteorological variable series as a comparison series S k (t) the photovoltaic power sequence is the reference sequence S 0 (t) calculating the difference between the two as the absolute value sequence delta k (t), where k ε (1, n);
Δ k (t)=|S k (t)-S 0 (t)| (1)
2) Calculating the correlation coefficient eta k (t):
Figure SMS_1
Wherein: min (·) and Max (·) represent minimum and maximum values of the sequences; ρ is the resolution factor;
3) Solving the association degree gamma k
Figure SMS_2
Wherein: t (T) n Is the sequence length;
4) Setting a threshold value
Figure SMS_3
A variable with a degree of association greater than a threshold value is selected,a new model dataset is composed.
The improvement of the NGBoost metamodel by means of a generalized natural gradient calculation method comprises:
the solution process of the natural gradient is improved, and a relation is established between a general gradient and the natural gradient through Fisher information quantity, and the method is concretely as follows:
in y i Establishes a scoring function S (theta, y) based on the shannon information amount i ),
S(θ,y i )=-log P θ (y i ) (4)
Wherein: p (P) θ (y i ) Is y i Probability values in the predictive probability distribution; θ is a parameter vector of the predictive probability distribution;
taylor expansion is performed and the third order and the remainder are removed:
Figure SMS_4
wherein: d' is theta edge
Figure SMS_5
Moving an infinitely small step vector; />
Figure SMS_6
Representing a natural gradient;
converting European space into statistical manifold, processing formula (5) under Riemann space,
Figure SMS_7
wherein the calculation of the primary term can be simplified to:
Figure SMS_8
the remainder is denoted as:
Figure SMS_9
wherein:
Figure SMS_10
thereby achieving the calculation of natural gradients by general gradients:
Figure SMS_11
an improved NGBoost metamodel is built based on equation (10): taking theta DEG as an initial parameter vector, performing calculation until the mth iteration, and calculating y from a common gradient by a formula (10) i And corresponding parameter vector thereof
Figure SMS_12
Natural gradient +.>
Figure SMS_13
And generating a new set of basis learners along the natural gradient direction, thereby realizing parameter vector update, and the final prediction result can be expressed as a formula (11):
Figure SMS_14
wherein: alpha m Is a scale factor; beta is the unified learning rate; b (B) m For the unified representation of the base learners, the value of each sample point satisfies the Gaussian distribution, namely, theta= (mu, sigma), and the m training stage of theta correspondingly generates two base learners
Figure SMS_15
It is uniformly expressed as +.>
Figure SMS_16
/>
The Blending architecture is utilized to blend, and the further strengthening of the model learning effect comprises the following steps:
1) Raw dataset segmentation
Dividing an original training set into a sub training set DT and a test set DA according to a proportion, and defining an original prediction data set as DP;
2) Model fusion
Given the confidence level, building V NGBoost meta-models MO 1 、MO 2 、…、MO V Learning DT by using the metamodels, and outputting a predicted result DA_ P, DP _P of DA and DP on the metamodels after training is completed; wherein DA_ P, DP _P is an initial statistical parameter vector of the predicted value corresponding to DA and DP;
the predicted mean value determined by DA_P and the actual result DA_OUT corresponding to the original DA data form a new data set, and a new meta-model MO is established DA Training and obtaining predicted output MO DA P; wherein MO is DA P is the modified predicted statistical parameter vector, and MO is the same as DA_P DA P has higher accuracy and less sharpness performance, and shows the advantage of model fusion.
MO is prepared from DA P and DP P form new data set, and new meta-model MO is built P Training is performed so as to output a final prediction statistical parameter vector, the upper limit and the lower limit of a predicted value under a given confidence level can be calculated through the vector, and the points are connected into a predicted value upper limit curve and a predicted value lower limit curve.
Step S2 is to construct a power digital twin system, and the decision making of assisting the power grid operation management regulation comprises the following steps of: based on the existing data information of each level of the dispatching side, the prediction model is trained and updated in real time by combining a quasi-real-time dispatching log, an overhaul list, line conditions, unit power information and the like in an EMS (energy management system) of a dispatching master station.
Step S3 establishes a robust scheduling model based on the new energy interval prediction result, considers the automatic power generation control (AGC) response of the unit to cope with the new energy output fluctuation, and ensures the power balance of the power grid, wherein the step S3 comprises the following steps:
proposing the coverage rate I of the prediction area F Average width ratio of prediction interval I P As basic index, build the comprehensive score I C As final index, the above index is calculated specificallyThe method comprises the following steps:
1) Prediction area coverage
By introducing I F The accuracy of the probabilistic predictions is measured to quantify the reliability of the model, the index referencing the number of actual values falling within the confidence interval at a given confidence level, the larger the value, the more accurate the model,
Figure SMS_17
wherein: n (N) t To predict the number of samples; omega shape i For the ith sample, whether the ith sample falls into the marked value of the confidence interval or not, the format is Boolean constant, the sample falls into 1 and does not fall into 0;
2) Average width ratio of prediction interval
By introducing I P Measuring sharpness of probability prediction result, avoiding simple pursuit of I F The confidence interval is too wide, the predicted result loses the reference value, I P The larger the value is, the wider the confidence interval is, the higher the sharpness of the prediction distribution is, and the worse the prediction effect is;
Figure SMS_18
wherein: i P0 The confidence interval width under the initial parameters; u (U) i 、L i The upper limit and the lower limit of the confidence interval corresponding to the ith prediction sample;
3) Comprehensive score
Introduction of I C Pair I F I P Performing comprehensive evaluation, I C Higher values indicate better overall model performance, reduced sharpness while ensuring accuracy,
Figure SMS_19
the construction of the power digital twin system comprises the following steps:
collecting offline information: the related primary equipment parameters of the power grid comprise plant stations, generator sets, main transformers, buses, circuits, information of safety control devices and the like; collecting quasi-real-time information: scheduling instructions, maintenance orders, operation tickets and other information; collecting real-time information: real-time meteorological data, fault discrimination of a safety control device and related line power flow, unit power and related switch state information related to a unit to be predicted.
The invention has the beneficial effects that: the invention discloses a new energy power system dispatching method based on digital twin, which comprises the following steps of S1, constructing a short-term direct probability prediction model applicable to photovoltaic power; s2, constructing an electric power digital twin system, and utilizing ubiquitous Internet of things and electric power data flow to assist decision making of power grid operation management regulation through real-time situation awareness and real-time virtual deduction; step S3, a robust scheduling model is established based on a new energy interval prediction result, and automatic power generation control (AGC) response of a unit is considered to cope with new energy output fluctuation, so that power balance of a power grid is ensured; the new energy power system dispatching method based on digital twin has higher prediction area coverage rate and smaller average width ratio of the prediction interval, enables the prediction model to have real-time updating property based on the existing real-time data information of each level of the dispatching side, improves the applicability and generalization of the prediction model to the local power grid situation, and realizes the optimization and improvement of the existing dispatching system based on the real-time prediction result of the new energy interval.
Drawings
Fig. 1 is a schematic flow chart of a new energy power system dispatching method based on digital twin.
FIG. 2 is a schematic diagram of a Blending fusion step.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
The new energy power system dispatching method based on digital twin is shown in the attached figure 1, and comprises the following steps:
s1, constructing a short-term direct probability prediction model suitable for photovoltaic power;
s2, constructing an electric power digital twin system, and utilizing ubiquitous Internet of things and electric power data flow to assist decision making of power grid operation management regulation through real-time situation awareness and real-time virtual deduction;
and step S3, based on a new energy interval prediction result, a robust scheduling model is established, and the automatic power generation control (AGC) response of the unit is considered to cope with the fluctuation of the new energy output, so that the power balance of the power grid is ensured.
Aiming at the application defect of the ensemble learning algorithm in the probability prediction problem, the university of Steady team of AndrewY.Ng leading rank proposes a natural gradient lifting (Natural Gradient Boosting, NGboost) model, and although the popularization and application of Boosting type algorithm are realized, the following defects still exist when the actual engineering problem of photovoltaic power short-term direct probability prediction is solved: 1) The model lacks a data preprocessing link, and the generalization capability and the robustness of the NGBoost model to different photovoltaic fields are weak; 2) The natural gradient has complex calculation principle and difficult practical engineering application; 3) It is difficult for a single NGBoost meta-model to guarantee the accuracy and sharpness of probability prediction.
For better effect, the step S1 of constructing a short-term direct probability prediction model applicable to photovoltaic power includes: preprocessing an initial data set, removing abnormal values of the data set based on the base detection, and extracting strongly-correlated weather variables by gray correlation analysis; improving the NGBoost meta-model by means of a generalized natural gradient calculation method; fusion is carried out by using a Blending architecture, so that the model learning effect is further enhanced;
model dataset D contains n D Samples, m features, i.e. d= { (x) i ,y i )}(x i ∈R m ,y i E R), where x i Characterizing the eigenvector of the ith sample, y i Representing the label value (true value) corresponding to the ith sample, i E (1, n) D )。
Considering actual engineering conditions such as measurement errors, the initial data set has more abnormal values, which can cause the overall deviation of the prediction model. Therefore, the present item first eliminates outliers in the initial data using a box graph proposed by the collectist john's graph base.
Photovoltaic power is related to meteorological variables such as humidity and cloud cover. However, the correlation between various meteorological variables and photovoltaic power varies from one photovoltaic field to another, subject to factors such as the geographic location of the photovoltaic field and the local microclimate of the location.
For better effect, the preprocessing the initial data set, eliminating the abnormal value of the data set based on the graph base detection, and extracting the strongly-correlated weather variable by using gray correlation analysis comprises the following steps:
1) Normalizing the time series of each variable, and taking the kth of the n meteorological variable series as a comparison series S k (t) the photovoltaic power sequence is the reference sequence S 0 (t) calculating the difference between the two as the absolute value sequence delta k (t), where k ε (1, n);
Δ k (t)=|S k (t)-S 0 (t)| (1)
2) Calculating the correlation coefficient eta k (t):
Figure SMS_20
Wherein: min (·) and Max (·) represent minimum and maximum values of the sequences; ρ is the resolution factor;
3) Solving the association degree gamma k
Figure SMS_21
Wherein: t (T) n Is the sequence length;
4) Setting a threshold value
Figure SMS_22
And selecting variables with the association degree larger than a threshold value to form a new model data set.
The key point of NGBoost is the solution of natural gradient, however, the related concept is taken from extremely complex information geometry, which brings inconvenience to popularization and application in actual engineering.
For better effect, the improvement of the NGBoost metamodel by means of a generalized natural gradient calculation method comprises:
the solution process of the natural gradient is improved, and a relation is established between a general gradient and the natural gradient through Fisher information quantity, and the method is concretely as follows:
in y i Establishes a scoring function S (theta, y) based on the shannon information amount i ),
S(θ,y i )=-log P θ (y i ) (4)
Wherein: p (P) θ (y i ) Is y i Probability values in the predictive probability distribution; θ is a parameter vector of the predictive probability distribution;
taylor expansion is performed and the third order and the remainder are removed:
Figure SMS_23
wherein: d' is theta edge
Figure SMS_24
Moving an infinitely small step vector; />
Figure SMS_25
Representing a natural gradient;
converting European space into statistical manifold, processing formula (5) under Riemann space,
Figure SMS_26
wherein the calculation of the primary term can be simplified to:
Figure SMS_27
the remainder is denoted as:
Figure SMS_28
wherein:
Figure SMS_29
thereby achieving the calculation of natural gradients by general gradients:
Figure SMS_30
an improved NGBoost metamodel is built based on equation (10): taking theta DEG as an initial parameter vector, performing calculation until the mth iteration, and calculating y from a common gradient by a formula (10) i And corresponding parameter vector thereof
Figure SMS_31
Natural gradient +.>
Figure SMS_32
And generating a new set of basis learners along the natural gradient direction, thereby realizing parameter vector update, and the final prediction result can be expressed as a formula (11):
Figure SMS_33
wherein: alpha m Is a scale factor; beta is the unified learning rate; b (B) m For the unified representation of the base learners, the value of each sample point satisfies the Gaussian distribution, namely, theta= (mu, sigma), and the m training stage of theta correspondingly generates two base learners
Figure SMS_34
It is uniformly expressed as +.>
Figure SMS_35
/>
The fusion of the meta-model can strengthen the learning effect, and can not cause excessive redundancy of the whole model, and is widely applied in solving the prediction problem in recent years, particularly the fusion of the Stacking model. However, stacking model fusion is too complex, and the problem of data crossing of training data referencing global statistics occurs in the training process, which is not suitable for solving the problem of probability prediction.
In order to achieve a better effect, the Blending architecture is utilized to blend, and the further strengthening of the model learning effect comprises the following steps:
1) Raw dataset segmentation
Dividing an original training set into a sub training set DT and a test set DA according to a proportion, and defining an original prediction data set as DP;
2) Model fusion
Given the confidence level, building V NGBoost meta-models MO 1 、MO 2 、…、MO V Learning DT by using the metamodels, and outputting a predicted result DA_ P, DP _P of DA and DP on the metamodels after training is completed; wherein DA_ P, DP _P is an initial statistical parameter vector of the predicted value corresponding to DA and DP;
the predicted mean value determined by DA_P and the actual result DA_OUT corresponding to the original DA data form a new data set, and a new meta-model MO is established DA Training and obtaining predicted output MO DA P; wherein MO is DA P is the modified predicted statistical parameter vector, and MO is the same as DA_P DA P has higher accuracy and less sharpness performance, and shows the advantage of model fusion.
MO is prepared from DA P and DP P form new data set, and new meta-model MO is built P Training is performed so as to output a final prediction statistical parameter vector, the upper limit and the lower limit of a predicted value under a given confidence level can be calculated through the vector, and the points are connected into a predicted value upper limit curve and a predicted value lower limit curve.
In order to achieve a better effect, the step S2 is to construct a digital twin system of electric power, and the decision making for assisting the regulation and control of the power grid operation and management by using the ubiquitous internet of things and the electric power data stream through real-time situation awareness and real-time virtual deduction comprises the following steps: based on the existing data information of each level of the dispatching side, the prediction model is trained and updated in real time by combining a quasi-real-time dispatching log, an overhaul list, line conditions, unit power information and the like in an EMS (energy management system) of a dispatching master station.
In order to achieve a better effect, the step S3 of establishing a robust scheduling model based on the new energy interval prediction result, considering automatic power generation control (AGC) response of the unit to cope with new energy output fluctuation, and ensuring power balance of the power grid includes:
proposing the coverage rate I of the prediction area F Average width ratio of prediction interval I P As basic index, build the comprehensive score I C As a final index, the specific calculation method of the above index is as follows:
1) Prediction area coverage
By introducing I F The accuracy of the probabilistic predictions is measured to quantify the reliability of the model, the index referencing the number of actual values falling within the confidence interval at a given confidence level, the larger the value, the more accurate the model,
Figure SMS_36
wherein: n (N) t To predict the number of samples; omega shape i For the ith sample, whether the ith sample falls into the marked value of the confidence interval or not, the format is Boolean constant, the sample falls into 1 and does not fall into 0;
2) Average width ratio of prediction interval
By introducing I P Measuring sharpness of probability prediction result, avoiding simple pursuit of I F The confidence interval is too wide, the predicted result loses the reference value, I P The larger the value is, the wider the confidence interval is, the higher the sharpness of the prediction distribution is, and the worse the prediction effect is;
Figure SMS_37
wherein: i P0 The confidence interval width under the initial parameters; u (U) i 、L i The upper limit and the lower limit of the confidence interval corresponding to the ith prediction sample;
3) Comprehensive score
Introduction of I C Pair I F I P Performing comprehensive evaluation, I C Higher values indicate better overall model performance, reduced sharpness while ensuring accuracy,
Figure SMS_38
for better effect, the construction of the electric power digital twin system comprises:
collecting offline information: the related primary equipment parameters of the power grid comprise plant stations, generator sets, main transformers, buses, circuits, information of safety control devices and the like; collecting quasi-real-time information: scheduling instructions, maintenance orders, operation tickets and other information; collecting real-time information: real-time meteorological data, fault discrimination of a safety control device and related line power flow, unit power and related switch state information related to a unit to be predicted.
In summary, the digital twin-based new energy power system scheduling method comprises the steps of S1, constructing a short-term direct probability prediction model applicable to photovoltaic power; s2, constructing an electric power digital twin system, and utilizing ubiquitous Internet of things and electric power data flow to assist decision making of power grid operation management regulation through real-time situation awareness and real-time virtual deduction; step S3, a robust scheduling model is established based on a new energy interval prediction result, and automatic power generation control (AGC) response of a unit is considered to cope with new energy output fluctuation, so that power balance of a power grid is ensured; the new energy power system dispatching method based on digital twin has higher prediction area coverage rate and smaller average width ratio of the prediction interval, enables the prediction model to have real-time updating property based on the existing real-time data information of each level of the dispatching side, improves the applicability and generalization of the prediction model to the local power grid situation, and realizes the optimization and improvement of the existing dispatching system based on the real-time prediction result of the new energy interval.

Claims (8)

1. The new energy power system dispatching method based on digital twin is characterized by comprising the following steps of:
s1, constructing a short-term direct probability prediction model suitable for photovoltaic power;
s2, constructing an electric power digital twin system, and utilizing ubiquitous Internet of things and electric power data flow to assist decision making of power grid operation management regulation through real-time situation awareness and real-time virtual deduction;
and step S3, based on a new energy interval prediction result, a robust scheduling model is established, and the automatic power generation control (AGC) response of the unit is considered to cope with the fluctuation of the new energy output, so that the power balance of the power grid is ensured.
2. The method for dispatching a new energy power system based on digital twin according to claim 1, wherein the step S1 of constructing a short-term direct probability prediction model applicable to photovoltaic power comprises: preprocessing an initial data set, removing abnormal values of the data set based on the base detection, and extracting strongly-correlated weather variables by gray correlation analysis; improving the NGBoost meta-model by means of a generalized natural gradient calculation method; fusion is carried out by using a Blending architecture, so that the model learning effect is further enhanced;
model dataset D contains n D Samples, m features, i.e. d= { (x) i ,y i )}(x i ∈R m ,y i E R), where x i Characterizing the eigenvector of the ith sample, y i Representing the label value (true value) corresponding to the ith sample, i E (1, n) D )。
3. The method for dispatching a new energy power system based on digital twinning according to claim 2, wherein the preprocessing the initial dataset, removing the outliers of the dataset based on the graph base detection, and extracting the strongly correlated meteorological variables by using gray correlation analysis comprises:
1) Normalizing the time series of each variable, and taking the kth of the n meteorological variable series as a comparison series S k (t) the photovoltaic power sequence is the reference sequence S 0 (t) calculating the difference between the two as the absolute value sequence delta k (t), where k ε (1, n);
Δ k (t)=|S k (t)-S 0 (t)| (1)
2) Calculating the correlation coefficient eta k (t):
Figure FDA0004061475630000021
Wherein: min (·) and Max (·) represent minimum and maximum values of the sequences; ρ is the resolution factor;
3) Solving the association degree gamma k
Figure FDA0004061475630000022
Wherein: t (T) n Is the sequence length;
4) Setting a threshold value
Figure FDA0004061475630000023
And selecting variables with the association degree larger than a threshold value to form a new model data set.
4. The method for dispatching a new energy power system based on digital twinning as claimed in claim 2, wherein said improving the NGBoost metamodel by means of generalized natural gradient calculation method comprises:
the solution process of the natural gradient is improved, and a relation is established between a general gradient and the natural gradient through Fisher information quantity, and the method is concretely as follows:
in y i Establishes a scoring function S (theta, y) based on the shannon information amount i ),
S(θ,y i )=-log P θ (y i ) (4)
Wherein: p (P) θ (y i ) Is y i Probability values in the predictive probability distribution; θ is a parameter vector of the predictive probability distribution;
taylor expansion is performed and the third order and the remainder are removed:
Figure FDA0004061475630000024
wherein: d' is theta edge
Figure FDA0004061475630000025
Moving an infinitely small step vector; />
Figure FDA0004061475630000026
Representing a natural gradient;
converting European space into statistical manifold, processing formula (5) under Riemann space,
Figure FDA0004061475630000031
wherein the calculation of the primary term can be simplified to:
Figure FDA0004061475630000032
the remainder is denoted as:
Figure FDA0004061475630000033
wherein:
Figure FDA0004061475630000034
thereby achieving the calculation of natural gradients by general gradients:
Figure FDA0004061475630000035
an improved NGBoost metamodel is built based on equation (10): taking theta DEG as an initial parameter vector, performing calculation until the mth iteration, and calculating y from a common gradient by a formula (10) i And corresponding parameter vector thereof
Figure FDA0004061475630000036
Natural gradient +.>
Figure FDA0004061475630000037
And generating a new set of basis learners along the natural gradient direction, thereby realizing parameter vector update, and the final prediction result can be expressed as a formula (11):
Figure FDA0004061475630000041
wherein: alpha m Is a scale factor; beta is the unified learning rate; b (B) m For the unified representation of the base learners, the value of each sample point satisfies the Gaussian distribution, namely, theta= (mu, sigma), and the m training stage of theta correspondingly generates two base learners
Figure FDA0004061475630000042
It is uniformly expressed as +.>
Figure FDA0004061475630000043
5. The method for dispatching the new energy power system based on digital twin according to claim 2, wherein the fusing is performed by using a Blending architecture, and further strengthening the model learning effect comprises:
1) Raw dataset segmentation
Dividing an original training set into a sub training set DT and a test set DA according to a proportion, and defining an original prediction data set as DP;
2) Model fusion
Given the confidence level, building V NGBoost meta-models MO 1 、MO 2 、…、MO V Learning DT by using the metamodels, and outputting a predicted result DA_ P, DP _P of DA and DP on the metamodels after training is completed; wherein DA_ P, DP _P is an initial statistical parameter vector of the predicted value corresponding to DA and DP;
the predicted mean value determined by DA_P and the actual result DA_OUT corresponding to the original DA data form a new data set, and a new meta-model MO is established DA Training and obtaining predicted output MO DA P; wherein MO is DA P is the modified predicted statistical parameter vector, and MO is the same as DA_P DA P has higher accuracy and less sharpness performance, and shows the advantage of model fusion.
MO is prepared from DA P and DP P form new data set, and new meta-model MO is built P Training is performed so as to output a final prediction statistical parameter vector, the upper limit and the lower limit of a predicted value under a given confidence level can be calculated through the vector, and the points are connected into a predicted value upper limit curve and a predicted value lower limit curve.
6. The method for dispatching the new energy power system based on digital twinning as claimed in claim 1, wherein the step S2 is to construct a power digital twinning system, and the decision making of assisting the grid operation management regulation by using ubiquitous internet of things, power data flow, real-time situation awareness and real-time virtual deduction comprises the following steps: based on the existing data information of each level of the dispatching side, the prediction model is trained and updated in real time by combining a quasi-real-time dispatching log, an overhaul list, line conditions, unit power information and the like in an EMS (energy management system) of a dispatching master station.
7. The method for dispatching the new energy power system based on digital twin according to claim 1, wherein the step S3 of establishing a robust dispatching model based on the new energy interval prediction result, considering automatic power generation control (AGC) response of the unit to cope with new energy output fluctuation, and ensuring power balance of the power grid comprises:
proposing the coverage rate I of the prediction area F Average width ratio of prediction interval I P As basic index, build the comprehensive score I C As a final index, the specific calculation method of the above index is as follows:
1) Prediction area coverage
By introducing I F The accuracy of the probabilistic predictions is measured to quantify the reliability of the model, the index referencing the number of actual values falling within the confidence interval at a given confidence level, the larger the value, the more accurate the model,
Figure FDA0004061475630000051
wherein: n (N) t To predict the number of samples; omega shape i For the ith sample, whether the ith sample falls into the marked value of the confidence interval or not, the format is Boolean constant, the sample falls into 1 and does not fall into 0;
2) Average width ratio of prediction interval
By introducing I P Measuring sharpness of probability prediction result, avoiding simple pursuit of I F The confidence interval is too wide, the predicted result loses the reference value, I P The larger the value is, the wider the confidence interval is, the higher the sharpness of the prediction distribution is, and the worse the prediction effect is;
Figure FDA0004061475630000052
wherein: i P0 The confidence interval width under the initial parameters; u (U) i 、L i The upper limit and the lower limit of the confidence interval corresponding to the ith prediction sample;
3) Comprehensive score
Introduction of I C Pair I F I P Performing comprehensive evaluation, I C Higher values indicate better overall model performance, reduced sharpness while ensuring accuracy,
Figure FDA0004061475630000061
8. the method for dispatching a new energy power system based on digital twinning according to claim 6, wherein the constructing the power digital twinning system comprises:
collecting offline information: the related primary equipment parameters of the power grid comprise plant stations, generator sets, main transformers, buses, circuits, information of safety control devices and the like; collecting quasi-real-time information: scheduling instructions, maintenance orders, operation tickets and other information; collecting real-time information: real-time meteorological data, fault discrimination of a safety control device and related line power flow, unit power and related switch state information related to a unit to be predicted.
CN202310062843.3A 2023-01-17 2023-01-17 New energy power system dispatching method based on digital twin Withdrawn CN116231749A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310062843.3A CN116231749A (en) 2023-01-17 2023-01-17 New energy power system dispatching method based on digital twin

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310062843.3A CN116231749A (en) 2023-01-17 2023-01-17 New energy power system dispatching method based on digital twin

Publications (1)

Publication Number Publication Date
CN116231749A true CN116231749A (en) 2023-06-06

Family

ID=86570830

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310062843.3A Withdrawn CN116231749A (en) 2023-01-17 2023-01-17 New energy power system dispatching method based on digital twin

Country Status (1)

Country Link
CN (1) CN116231749A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117150935A (en) * 2023-10-30 2023-12-01 中国电力科学研究院有限公司 Novel operation method, device, equipment and medium of digital twin system of power system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117150935A (en) * 2023-10-30 2023-12-01 中国电力科学研究院有限公司 Novel operation method, device, equipment and medium of digital twin system of power system
CN117150935B (en) * 2023-10-30 2023-12-29 中国电力科学研究院有限公司 Operation method, device, equipment and medium of digital twin system of power system

Similar Documents

Publication Publication Date Title
CN109711620B (en) Short-term power load prediction method based on GRU neural network and transfer learning
CN103683326B (en) A kind of regional power grid wind-powered electricity generation multiple spot accesses the computational methods of best receiving ability
CN110880789B (en) Economic dispatching method for wind power and photovoltaic combined power generation system
CN109523084A (en) A kind of ultrashort-term wind power prediction method based on pivot analysis and machine learning
CN102478584B (en) Wind power station wind speed prediction method based on wavelet analysis and system thereof
CN105389634A (en) Combined short-term wind power prediction system and method
CN111488896B (en) Distribution line time-varying fault probability calculation method based on multi-source data mining
CN103117546A (en) Ultrashort-term slide prediction method for wind power
CN110601250B (en) Method for evaluating stability of static voltage of access point based on new energy power station
CN106451418A (en) Online clustering equivalent modeling method of photovoltaic power plant
CN115965132A (en) Distributed photovoltaic digital twin system power prediction method based on GA-BP neural network
CN116231749A (en) New energy power system dispatching method based on digital twin
CN115860177A (en) Photovoltaic power generation power prediction method based on combined machine learning model and application thereof
CN116167531A (en) Photovoltaic power generation prediction method based on digital twin
Chen et al. Research on wind power prediction method based on convolutional neural network and genetic algorithm
CN114021848A (en) Generating capacity demand prediction method based on LSTM deep learning
CN113313306A (en) Elastic neural network load prediction method based on improved wolf optimization algorithm
CN108694479A (en) Consider the distribution network reliability prediction technique that weather influences time between overhaul
CN113991711B (en) Capacity configuration method for energy storage system of photovoltaic power station
CN108694475B (en) Short-time-scale photovoltaic cell power generation capacity prediction method based on hybrid model
CN111967660B (en) Ultra-short-term photovoltaic prediction residual error correction method based on SVR
CN116316611B (en) Power supply method and system based on low-voltage transformer area
CN111697560B (en) Method and system for predicting load of power system based on LSTM
CN116415708B (en) Power grid robust planning method considering confidence level
CN111756031B (en) Power grid operation trend estimation method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20230606

WW01 Invention patent application withdrawn after publication