CN112952847A - Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity - Google Patents

Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity Download PDF

Info

Publication number
CN112952847A
CN112952847A CN202110366783.5A CN202110366783A CN112952847A CN 112952847 A CN112952847 A CN 112952847A CN 202110366783 A CN202110366783 A CN 202110366783A CN 112952847 A CN112952847 A CN 112952847A
Authority
CN
China
Prior art keywords
load
region
decision time
power
area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110366783.5A
Other languages
Chinese (zh)
Other versions
CN112952847B (en
Inventor
唐昊
曹永伦
王正风
吴旭
李智
吕凯
谭琦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hefei University of Technology
State Grid Anhui Electric Power Co Ltd
Original Assignee
Hefei University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hefei University of Technology filed Critical Hefei University of Technology
Priority to CN202110366783.5A priority Critical patent/CN112952847B/en
Publication of CN112952847A publication Critical patent/CN112952847A/en
Application granted granted Critical
Publication of CN112952847B publication Critical patent/CN112952847B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/12Circuit arrangements for ac mains or ac distribution networks for adjusting voltage in ac networks by changing a characteristic of the network load
    • H02J3/14Circuit arrangements for ac mains or ac distribution networks for adjusting voltage in ac networks by changing a characteristic of the network load by switching loads on to, or off from, network, e.g. progressively balanced loading
    • H02J3/144Demand-response operation of the power transmission or distribution network
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for ac mains or ac distribution networks
    • H02J3/007Arrangements for selectively connecting the load or loads to one or several among a plurality of power lines or power sources
    • H02J3/0075Arrangements for selectively connecting the load or loads to one or several among a plurality of power lines or power sources for providing alternative feeding paths between load and source according to economic or energy efficiency considerations, e.g. economic dispatch
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J2203/00Indexing scheme relating to details of circuit arrangements for AC mains or AC distribution networks
    • H02J2203/20Simulating, e g planning, reliability check, modelling or computer assisted design [CAD]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02BCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO BUILDINGS, e.g. HOUSING, HOUSE APPLIANCES OR RELATED END-USER APPLICATIONS
    • Y02B70/00Technologies for an efficient end-user side electric power management and consumption
    • Y02B70/30Systems integrating technologies related to power network operation and communication or information technologies for improving the carbon footprint of the management of residential or tertiary loads, i.e. smart grids as climate change mitigation technology in the buildings sector, including also the last stages of power distribution and the control, monitoring or operating management systems at local level
    • Y02B70/3225Demand response systems, e.g. load shedding, peak shaving
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P80/00Climate change mitigation technologies for sector-wide applications
    • Y02P80/10Efficient use of energy, e.g. using compressed air or pressurized fluid as energy carrier
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S20/00Management or operation of end-user stationary applications or the last stages of power distribution; Controlling, monitoring or operating thereof
    • Y04S20/20End-user application control systems
    • Y04S20/222Demand response systems, e.g. load shedding, peak shaving

Landscapes

  • Engineering & Computer Science (AREA)
  • Power Engineering (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The invention discloses a peak regulation optimization method of a multi-region active power distribution system considering power demand elasticity, which comprises the steps of firstly establishing a mathematical model of a photovoltaic and all-vanadium redox flow battery energy storage system and a multi-region flexible load scheduling unit under a power elasticity environment; then, establishing a DTMDP model by considering the peak regulation optimization problem of the multi-region active power distribution system with the elasticity of power demand; and finally, solving the mathematical model by combining reinforcement learning and an intelligent algorithm to obtain a multi-region scheduling optimization control strategy meeting peak regulation requirements. The layered learning mechanism avoids the problem of dimension disaster of reinforcement learning to a certain extent, and promotes the rapid solution of the scheduling strategy; meanwhile, the combination of reinforcement learning and an intelligent algorithm further enhances the exploration capability of the algorithm, and is beneficial to solving the optimal peak regulation strategy; potential scheduling information of the active power distribution system can be further obtained by considering the elasticity of power demand, and smooth and safe operation of the system is promoted.

Description

Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity
Technical Field
The invention belongs to the field of dispatching optimization of a multi-region active power distribution system, and particularly relates to a dynamic dispatching optimization method of the multi-region active power distribution system, which takes the peak regulation demand and the power consumption demand elasticity of a power grid into consideration and aims at achieving stable and economic operation of the system.
Background
Currently, the research focus of the active power distribution system includes planning design, hierarchical control, operation management, and the like. The research in the planning aspect mainly develops around the aspects of the optimal configuration of the distributed power supply, the net rack planning and the like; the research of the hierarchical coordination control provides technical support for scheduling and managing various resources, and the overall optimization is achieved by managing the hierarchical distributed energy sources; and the aspects of multi-enclosure reactive power compensation, scheduling optimization and the like are researched in the aspect of operation management.
The traditional power grid peak regulation problem research mainly considers the starting, stopping and output regulation control of a generating side unit, and particularly the combined operation optimization of a multi-energy complementary system containing an energy storage device is proved to be capable of effectively relieving the peak regulation pressure of the system. With the development of an active power distribution system and the application of a flexible load scheduling technology, various types of high-quality peak shaving resources on a demand side are scheduled and optimized, so that economic peak shaving and energy utilization rate improvement are realized, an important trend of current power grid peak shaving research is formed, and the method is also an effective supplement for peak shaving on a power generation side.
The current research is more in the analysis and research of the whole load of the regional power grid, the research on the load characteristics of different industries is less, the influence caused by the difference and the proportional change of the loads of different industries is ignored to a certain extent, and the change rule of the power load is not easy to grasp more accurately. Meanwhile, the electric power of different types of electric users is flexibly depicted, which is not beneficial to guiding electric power users to select reasonable electric power utilization time and adjusting the electric power utilization of the users in the planning process of an electric power system so as to achieve better peak clipping and valley filling effects.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a peak shaving optimization method of a multi-region active power distribution system considering the elasticity of power consumption requirements, so that a power consumer can be guided to select reasonable power consumption time and adjust the power consumption of the power consumer, and the effects of peak shaving and valley filling are achieved, so that the power consumption load rate is improved, and the safety and the stability of power grid operation are improved.
The invention adopts the following technical scheme for solving the technical problems:
the invention relates to a peak regulation optimization method of a multi-region active power distribution system considering power demand elasticity, which is characterized by comprising the following steps of:
step 1, constructing a multi-region active power distribution system, comprising: the system comprises a dispatching center, an industrial park dispatching area, a commercial park dispatching area and municipal and residential life park dispatching areas; recording any one of the industrial park dispatching area, the commercial park dispatching area and the municipal and residential life park dispatching areas as an area i;
the area i comprises: the system comprises an ith PV generating unit, an ith VRB energy storage unit and an ith load scheduling unit; the load type of the ith load scheduling unit comprises: the ith load-reducible load and the ith rigid load; wherein, the load type of the load scheduling unit in the industrial park scheduling area further comprises: the load can be transferred;
step 2, determining predicted values of power grid peak regulation requirements, photovoltaic output and various types of load requirements based on historical data of the area i at any decision time t in a dispatching day; wherein, the peak load regulation demand of the power grid is recorded as
Figure BDA0003007424460000021
Photovoltaic output is noted
Figure BDA0003007424460000022
Reducible load in various types of load demandsThe load demand is recorded as
Figure BDA0003007424460000023
And transferable load demand is noted
Figure BDA0003007424460000024
Step 3, establishing mathematical models of various load scheduling units, VRB energy storage units and PV power generation units in industrial, commercial, municipal and life parks, a region scheduling time attribute mathematical model and a peak shaving task allocation mechanism considering region elasticity amplitude:
step 3.1, establishing mathematical models of various types of loads of multiple areas under the elastic environment:
obtaining the minimum and maximum load reduction quantity constraints of the region i at the decision time t by using the formula (1):
Figure BDA0003007424460000025
in the formula (1), the reaction mixture is,
Figure BDA0003007424460000026
the maximum load amount of the reducible load of the area i at the decision time t is reduced;
Figure BDA0003007424460000027
actually reducing the load of the area i at the decision time t;
the transferable load force constraint is obtained by using the formula (2) to the formula (4):
Figure BDA0003007424460000028
Figure BDA0003007424460000029
Figure BDA00030074244600000210
in the formulae (2) to (4),
Figure BDA00030074244600000211
and
Figure BDA00030074244600000212
respectively determining the maximum allowable load increase and the maximum allowable load decrease of the transferable load in the area i at the decision time t;
Figure BDA00030074244600000213
the load increment of the transferable load corresponding to the area i at the decision time t;
Figure BDA00030074244600000214
the load reduction quantity of the area i at the decision time t;
defining the accumulated increase and decrease load quantity after action is taken at each decision time from the initial time to the decision time t as
Figure BDA00030074244600000215
Suppose a single scheduling day has tK-1At the moment of decision, then
Figure BDA00030074244600000216
Is the remaining period tK-1-t is a margin of resilience to transferable load;
obtaining margin constraints of the region i at the decision time t by using the equations (5) and (6)
Figure BDA00030074244600000217
And transfer direction constraints
Figure BDA00030074244600000218
Thereby obtaining transferable load constraint of the area i at the decision time t
Figure BDA00030074244600000219
Figure BDA00030074244600000220
Figure BDA0003007424460000031
In the formulae (5) and (6), aacctrIs the elastic margin coefficient; e is a natural constant; alpha is alphadirSelecting a direction coefficient for the transfer motion;
Figure BDA0003007424460000032
the load amount is increased or decreased by the accumulated transfer to the t-1 moment;
Figure BDA0003007424460000033
when the load is increased and decreased correspondingly after the action is taken for the transferable load at the current decision time t and the load is set to be increased,
Figure BDA0003007424460000034
the value of (d) is positive, and when the load amount is reduced,
Figure BDA0003007424460000035
the value of (d) is negative, when not active,
Figure BDA0003007424460000036
is 0;
3.2, establishing a mathematical model of the output of the VRB energy storage unit:
establishing the constraint condition of the VRB energy storage unit in one scheduling day by using an equation (7) to an equation (10), wherein the constraint condition comprises the following steps: terminal voltage constraint, charge-discharge power constraint, charge state constraint and initial and final charge state consistency constraint:
Figure BDA0003007424460000037
Figure BDA0003007424460000038
Figure BDA0003007424460000039
Figure BDA00030074244600000310
in the formulae (7) to (10),
Figure BDA00030074244600000311
the upper limit and the lower limit of the end voltage of the VRB energy storage unit of the area i,
Figure BDA00030074244600000312
the minimum and maximum charge-discharge power of the VRB energy storage unit in the area i at the decision time t,
Figure BDA00030074244600000313
the actual charging and discharging power of the VRB energy storage unit at decision time t for region i,
Figure BDA00030074244600000314
the remaining capacity of the VRB energy storage cells for region i constrains the upper and lower limits,
Figure BDA00030074244600000315
for the remaining capacity of the VRB energy storage unit at decision time t for region i, ts、teTo schedule the beginning and end of the day, CconSetting the expected value of the state of charge of the VRB energy storage unit;
3.3, establishing a mathematical model of photovoltaic power generation output:
obtaining a predicted value of the photovoltaic output power of the area i at the decision time t by using the formula (11)
Figure BDA00030074244600000316
Figure BDA00030074244600000317
In the formula (11), etapvThe photoelectric conversion efficiency; n ispvThe number of the photovoltaic cell panels; spvThe surface area of the photovoltaic cell panel receiving illumination is increased;
Figure BDA00030074244600000318
the solar radiation intensity of the area i at the decision time t; alpha is alphapvIn order to be the temperature conversion coefficient,
Figure BDA00030074244600000319
the outdoor temperature of the area i at the decision time t;
step 3.4, establishing a mathematical model of the regional scheduling time attribute:
obtaining the time attribute T of the area i at the decision time T by using the formula (12)i,t
Figure BDA0003007424460000041
In the formula (12), the reaction mixture is,
Figure BDA0003007424460000042
is the time magnitude parameter of the region i; c is a constant; p is a radical ofi,tThe power output at decision time t for region i,
Figure BDA0003007424460000043
maximum output power of the region i on the dispatching day;
step 3.5, establishing a peak regulation task allocation mechanism considering the elastic amplitude of the region:
setting the elastic amplitude of the area i at the decision time t as Ei,tElastic amplitude Ei,tUpper bound of (2) which can reduce the upper bound of the load
Figure BDA0003007424460000044
+ transferable increased load upper bound
Figure BDA0003007424460000045
+ an energy storage discharge margin; elastic amplitude Ei,tLower bound of (2) reducible load lower bound + transferable reduced lower bound
Figure BDA0003007424460000046
+ an energy storage charging margin; the elastic amplitude of the region i is formed by the span between the upper and lower bounds;
step 4, modeling of continuous variable discretization and uncertain random variable dynamic change processes:
step 4.1, establishing a multi-region power grid peak regulation demand uncertainty model:
at decision time t, issuing the power grid to the maximum range interval of the random uncertain part of the peak regulation demand instruction of the area i in real time
Figure BDA0003007424460000047
Is dispersed into
Figure BDA0003007424460000048
In total
Figure BDA0003007424460000049
A plurality of levels, wherein,
Figure BDA00030074244600000410
for the maximum value of the upward fluctuation based on the peak shaver demand predicted power for the region i at the decision time t,
Figure BDA00030074244600000411
the maximum value of downward fluctuation based on the peak load demand prediction power of the area i at the decision time t;
Figure BDA00030074244600000412
maximum discrete levels of upward and downward fluctuation amounts based on the predicted power of the peak shaving demand of the region i;
and (3) obtaining the actual peak regulation demand of the power grid of the area i at the decision time t by using the formula (13):
Figure BDA00030074244600000413
in the formula (13), the reaction mixture is,
Figure BDA00030074244600000414
predicting power for the peak shaver demand of the power grid in the area i at the decision time t,
Figure BDA00030074244600000415
the power level of the uncertain part of the peak shaving demand of the power grid in the area i at the decision time t,
Figure BDA00030074244600000416
the minimum unit power of the power grid peak regulation instruction uncertainty part discretization of the area i at the decision time t is obtained;
step 4.2, establishing a photovoltaic output uncertain model;
the maximum range interval of the photovoltaic output uncertain part of the area i at the decision time t
Figure BDA00030074244600000417
Is dispersed into
Figure BDA00030074244600000418
In total
Figure BDA00030074244600000419
A plurality of levels, wherein,
Figure BDA00030074244600000420
the maximum value of the upward fluctuation based on the photovoltaic output predicted power of the area i at the decision time t,
Figure BDA00030074244600000421
the maximum value of the downward fluctuation based on the photovoltaic output predicted power of the area i at the decision time t;
Figure BDA00030074244600000422
is illuminated by an area iMaximum discrete levels of upward and downward fluctuation amounts based on the predicted power of the volt-ampere output;
obtaining the actual photovoltaic output of the region i at the decision time t by using the formula (14)
Figure BDA00030074244600000423
Figure BDA0003007424460000051
In the formula (14), the compound represented by the formula (I),
Figure BDA0003007424460000052
the power is predicted for the photovoltaic output at decision time t for zone i,
Figure BDA0003007424460000053
the power level of the uncertain photovoltaic output part of the area i at the decision time t is determined;
Figure BDA0003007424460000054
the minimum unit power of the area i after the photovoltaic output uncertain part is dispersed at decision time t is obtained;
step 4.3, establishing a multi-region uncertain model of each type of load demand:
the maximum range interval of the random uncertain part of the reducible load and the transferable load of the area i at the decision time t
Figure BDA0003007424460000055
And
Figure BDA0003007424460000056
respectively dispersed into corresponding state grades
Figure BDA0003007424460000057
And
Figure BDA0003007424460000058
in total
Figure BDA0003007424460000059
And
Figure BDA00030074244600000510
a plurality of levels, wherein,
Figure BDA00030074244600000511
and
Figure BDA00030074244600000512
respectively the maximum value of the upward fluctuation based on the forecasted power of the reducible load and the transferable load demand of the area i at the decision time t,
Figure BDA00030074244600000513
and
Figure BDA00030074244600000514
maximum values of downward fluctuation based on demand predicted power of reducible load and transferable load in the area i at decision time t respectively;
Figure BDA00030074244600000515
and
Figure BDA00030074244600000516
maximum discrete levels of upward and downward fluctuation amounts based on predicted power required by reducible load and transferable load in the region i respectively;
obtaining the actual power demand of the region i at the decision time t, which can reduce the load, by using the equations (15) to (16)
Figure BDA00030074244600000517
And the actual power demand of the load can be transferred
Figure BDA00030074244600000518
Figure BDA00030074244600000519
Figure BDA00030074244600000520
In the formulae (15) to (16),
Figure BDA00030074244600000521
power is predicted for the demand that region i can shed loads and transferable loads at decision time t,
Figure BDA00030074244600000522
respectively the power levels of the uncertain parts of the reducible load and the transferable load demand power in the area i at the decision time t;
Figure BDA00030074244600000523
respectively the minimum unit power of the region i after the uncertain part of the reducible load and the transferable load demand power is dispersed under the decision time t;
step 5, establishing a corresponding DTMDP model according to the peak regulation optimization problem of the multi-region active power distribution system considering the elasticity of power consumption requirements:
step 5.1, system state space and action set of the DTMDP model:
dividing a scheduling day into K e {0, 1.,. K-1}, and K decision periods; the time length of each decision period is delta t, and the decision time of the kth decision period is tkThe end time of the scheduling day is tK-1
The formula (17) -formula (18) is used for obtaining the decision time t of the scheduling centerkState of
Figure BDA00030074244600000524
Figure BDA00030074244600000525
Figure BDA0003007424460000061
In the formulae (17) to (18),
Figure BDA0003007424460000062
as a decision time tkThe real-time peak regulation demand state grade of the lower power grid,
Figure BDA0003007424460000063
for region i at decision time tkEnvironmental information of the environment, photovoltaic output state class
Figure BDA0003007424460000064
VRB energy storage unit charging and discharging state grade
Figure BDA0003007424460000065
Multi-type load scheduling unit load demand state grade
Figure BDA0003007424460000066
Elastic margin state rating
Figure BDA0003007424460000067
And zone elastic amplitude state level
Figure BDA0003007424460000068
Composition is carried out; supThe state space of a dispatching center, N is the number of regions;
setting the multi-type load containing type as M, if the area i does not consider the load j of a certain type, corresponding to the state number
Figure BDA0003007424460000069
Is 0; the total number of states N is obtained by the formula (19)up,s
Figure BDA00030074244600000610
In the formula (19), NpeakThe maximum state grade of the real-time peak regulation demand of the power grid,
Figure BDA00030074244600000611
the photovoltaic output maximum state grade of the region i,
Figure BDA00030074244600000612
for the region iVRB energy storage cell charge-discharge maximum state level,
Figure BDA00030074244600000613
for the region i elasticity margin maximum state level,
Figure BDA00030074244600000614
for the state class with the largest elastic amplitude in the region i,
Figure BDA00030074244600000615
the maximum state grade of the load demand of the region i is set;
the dispatching center is arranged at decision time tkMaximum interval of lower random peak shaving demand power
Figure BDA00030074244600000616
The dispersion is 0 to Nap-1 to NapA plurality of levels, wherein,
Figure BDA00030074244600000617
for the scheduling centre at decision time tkLower total peak shaver power requirement, NapThe maximum discrete level is required for the total peak regulation of the dispatching center;
the peak-shaving task quantity distributed to the area i by the dispatching center is obtained by using the formula (20)
Figure BDA00030074244600000618
Figure BDA00030074244600000619
In the formula (20), the reaction mixture is,
Figure BDA00030074244600000620
for adjustingDegree center at decision time tkDescending the peak shaving task action allocated to the area i;
the peaker task action assignment constraint is established using equation (21):
Figure BDA00030074244600000621
in the formula (21), the compound represented by the formula,
Figure BDA00030074244600000622
Aia set of all possible peak shaver task motion vectors for the region i;
the formula (22) is used for obtaining the decision time t of the dispatching centerkAction vector of
Figure BDA00030074244600000623
Figure BDA00030074244600000624
In the formula (22), AupA set of all possible action vectors, namely an action set, for a scheduling center; the total number of actions of the dispatching center is Nup,a=Nap
Obtaining region i at decision time t using equation (23)kState of
Figure BDA0003007424460000071
Figure BDA0003007424460000072
In the formula (23), the compound represented by the formula,
Figure BDA0003007424460000073
is the state space of the region i;
the total number of states of the region i is obtained by equation (24)
Figure BDA0003007424460000074
Figure BDA0003007424460000075
Obtaining region i at decision time t using equation (25)kDownward movement
Figure BDA0003007424460000076
Figure BDA0003007424460000077
In the formula (25), the reaction mixture,
Figure BDA0003007424460000078
for region i at decision time tkThe lower VRB energy storage unit acts, and the three values are respectively a discharging action, an idle action and a charging action;
Figure BDA0003007424460000079
for region i at decision time tkThe lower load scheduling unit adjusting action comprises load reduction action capable of reducing load
Figure BDA00030074244600000710
Load shifting actions
Figure BDA00030074244600000711
Figure BDA00030074244600000712
For region i at decision time tkNext different actuation control actions;
Figure BDA00030074244600000713
is the set of all possible motion vectors in the region i, i.e. the motion set of the region i;
the total number of operations of the region i is obtained by the equation (26)
Figure BDA00030074244600000714
Figure BDA00030074244600000715
Step 5.2, defining the state transition process of the DTMDP model:
the state-of-charge transfer equation for the VRB energy storage unit is established using equation (27):
Figure BDA00030074244600000716
in the formula (27), N is the number of the single batteries connected in series by the electric pile, IdIn order to charge and discharge the current,
Figure BDA00030074244600000717
the total capacity of the VRB energy storage unit;
Figure BDA00030074244600000718
VRB energy storage unit of area i at current decision time tkThe state of charge of the battery,
Figure BDA00030074244600000719
taking charging and discharging actions for VRB energy storage unit
Figure BDA00030074244600000720
A later state of charge;
a reducible load state transition equation is established using equation (28):
Figure BDA00030074244600000721
in the formula (28), the reaction mixture is,
Figure BDA0003007424460000081
for region i at decision time tkTake a curtailment action
Figure BDA0003007424460000082
The latter can reduce the load demand situation,
Figure BDA0003007424460000083
for region i at decision time tkCan reduce the predicted power of the load demand,
Figure BDA0003007424460000084
decision time tkThe uncertain part of the load demand can be reduced,
Figure BDA0003007424460000085
the maximum discrete level of the load demand can be reduced;
the transferable load state transfer equation is established using equation (29):
Figure BDA0003007424460000086
in the formula (29), the reaction mixture,
Figure BDA0003007424460000087
for region i at decision time tkTake transfer action down
Figure BDA0003007424460000088
The latter transferable load demand situation,
Figure BDA0003007424460000089
at the end decision time t for region iK-1The next transfer action to be taken is,
Figure BDA00030074244600000810
for region i at decision time tkThe transferable load demand of (a) predicts the power,
Figure BDA00030074244600000811
as a decision time tkThe uncertain portion of the lower transferable load demand,
Figure BDA00030074244600000812
the maximum discrete level of transferable load demand;
step 5.3, establishing an objective function of the DTMDP model:
the upper layer cost in the decision period k is obtained by using the formula (30)
Figure BDA00030074244600000813
Figure BDA00030074244600000814
In the formula (30), ci,kGenerating a cost for the region i in the state transition process of the decision period k;
the starting and ending state of charge consistency constraint of the VRB energy storage unit is established by using the formula (31):
Figure BDA00030074244600000815
in the formula (31), the reaction mixture,
Figure BDA00030074244600000816
is the weight coefficient of the last state of the VRB energy storage unit,
Figure BDA00030074244600000817
and
Figure BDA00030074244600000818
respectively setting the actual capacity grade of the VRB energy storage unit at the last decision moment and the expected capacity grade;
step 5.4, establishing an optimization target of the DTMDP model:
obtaining scheduling center-in-strategy pi by using formula (32)upThe initial state is s0For a limited period of time to optimize the performance criterion
Figure BDA00030074244600000819
Figure BDA00030074244600000820
The upper layer optimization target is in a strategy set omegaupFind the optimal strategy
Figure BDA00030074244600000821
Obtaining region i in strategy pi using equation (33)dow,iThe initial state is s0For a limited period of time to optimize the performance criterion
Figure BDA0003007424460000091
Figure BDA0003007424460000092
The lower layer optimization target is in a strategy set omegadow,iFind the optimal strategy
Figure BDA0003007424460000093
Step 6, solving the DTMDP model established in the step 5 by adopting Q learning based on simulated annealing;
firstly, initializing parameters, learning parameters, upper and lower layer Q value tables, current learning step numbers and decision periods of a DTMDP model; then the upper and lower layers randomly select the corresponding action of the current state according to the strategy, generate the corresponding cost and update the Q value table; and repeatedly and iteratively updating the Q value table until the termination condition is met, and obtaining a scheduling strategy of each scheduling resource in each decision period meeting the peak regulation requirement of the scheduling center within one scheduling day.
Compared with the prior art, the invention has the beneficial effects that:
1. aiming at the scheduling optimization problem of a multi-region active power distribution system, the invention considers the peak regulation and the power demand elasticity of a power grid, utilizes various power elasticity resources to carry out cooperative optimization, and carries out strategy solution through a Q learning algorithm, thereby improving the load utilization rate and the peak regulation participation degree of each power consumption main body to a certain extent.
2. The invention adopts a layered learning algorithm, decomposes the active power distribution system into a plurality of different subsystems by copying boundary nodes, each subsystem is relatively independent, and performs overall coordination only by exchanging boundary node data, decomposes the original huge knowledge matrix into a plurality of smaller knowledge matrices, reduces the number of state action pairs, avoids dimension disaster to a certain extent, and is suitable for the problem of cooperative scheduling of a plurality of regions;
3. aiming at the scheduling problem of a multi-region active power distribution system, the simulated annealing algorithm is combined with Q learning, so that the exploration capability of the algorithm is further enhanced, the phenomenon of local optimization is avoided, and the most peak regulation strategy is more favorably obtained; meanwhile, the scheduling center peak shaving task allocation is carried out under the condition of considering the elastic amplitude of each region, so that the optimization time can be reduced to a certain extent; potential scheduling information of the active power distribution system can be further obtained by considering the elasticity of power demand, an intelligent solution is provided for scheduling of the active power distribution system, and stable and safe operation of the system is promoted.
Drawings
FIG. 1 is a schematic diagram of a multi-zone active power distribution system according to the present invention;
FIG. 2 is a flow chart of hierarchical coordinated scheduling as employed by the present invention;
fig. 3 is a flowchart of an algorithm for solving the problem of dynamic scheduling of a multi-zone active power distribution system according to the present invention.
Detailed Description
In this embodiment, a method for dynamically scheduling and optimizing a multi-region active power distribution system is applied to the multi-region active power distribution system shown in fig. 1, where the elastic power resources include: photovoltaic arrays, VRB energy storage devices and multi-type flexible loads in the region; in the process that the dispatching center distributes the peak shaving tasks to the areas, various electric power elastic resources and the day-ahead prediction of peak shaving requirements in each period are used as initial input, dispatching plan distribution is carried out under the condition that the elastic amplitude of each area is considered, and the dispatching plans are corrected in a rolling mode on a real-time scale.
Referring to fig. 2, the scheduling optimization method for the multi-zone active power distribution system is performed according to the following steps:
step 1, constructing a multi-region active power distribution system, comprising: the system comprises a dispatching center, an industrial park dispatching area, a commercial park dispatching area and municipal and residential life park dispatching areas; recording any one of an industrial park dispatching area, a commercial park dispatching area and municipal and residential life park dispatching areas as an area i;
the area i includes: the system comprises an ith PV generating unit, an ith VRB energy storage unit and an ith load scheduling unit; the load types of the ith load scheduling unit include: the ith load-reducible load and the ith rigid load; wherein, the load type of load scheduling unit in the industrial park scheduling region still includes: the load can be transferred;
step 2, determining predicted values of power grid peak regulation requirements, photovoltaic output and various types of load requirements based on historical data of the area i at any decision time t in a dispatching day; wherein, the peak load regulation demand of the power grid is recorded as
Figure BDA0003007424460000101
Photovoltaic output is noted
Figure BDA0003007424460000102
Reducible load demand among various types of load demands is noted
Figure BDA0003007424460000103
And transferable load demand is noted
Figure BDA0003007424460000104
Step 3, establishing mathematical models of various load scheduling units, VRB energy storage units and PV power generation units in industrial, commercial, municipal and life parks, a region scheduling time attribute mathematical model and a peak shaving task allocation mechanism considering region elasticity amplitude:
step 3.1, establishing mathematical models of various types of loads of multiple areas under the elastic environment:
obtaining the minimum and maximum load reduction quantity constraints of the region i at the decision time t by using the formula (1):
Figure BDA0003007424460000105
in the formula (1), the reaction mixture is,
Figure BDA0003007424460000106
the maximum load amount of the reducible load of the area i at the decision time t is reduced;
Figure BDA0003007424460000107
actually reducing the load of the area i at the decision time t;
compared with the power environment without considering the elasticity of the power demand, the variable excitation control is set, so that the load can be reduced to generate a certain degree of upward deviation on the basis of the originally declared reduction upper limit.
The transferable load is characterized in that a user only needs to change the service time of electric energy without reducing the self life requirement or interrupting the production task, and the total electricity consumption of the user in a period is ensured to be unchanged, so that the increase and the decrease of the transferable load can be balanced. Considering that users have certain uncontrollable basic loads in the scheduling process, namely, the load increasing and decreasing amount in each time interval has certain limitation, the output constraint of transferable loads is obtained by using the formulas (2) to (4):
Figure BDA0003007424460000111
Figure BDA0003007424460000112
Figure BDA0003007424460000113
in the formulae (2) to (4),
Figure BDA0003007424460000114
and
Figure BDA0003007424460000115
respectively determining the maximum allowable load increase and the maximum allowable load decrease of the transferable load in the area i at the decision time t;
Figure BDA0003007424460000116
the load increment of the transferable load corresponding to the area i at the decision time t;
Figure BDA0003007424460000117
the load reduction quantity of the area i at the decision time t;
defining the accumulated increase and decrease load quantity after action is taken at each decision time from the initial time to the decision time t as
Figure BDA0003007424460000118
Suppose a single scheduling day has tK-1At the moment of decision, then
Figure BDA0003007424460000119
Is the remaining period tK-1T is an elastic margin of the transferable load, wherein the elastic margin can represent the fluctuation change of the transferable load range in the remaining time period in real time so as to influence the scheduling decision of the transferable load;
obtaining margin constraints of the region i at the decision time t by using the equations (5) and (6)
Figure BDA00030074244600001110
And transfer direction constraints
Figure BDA00030074244600001111
Thereby obtaining transferable load constraint of the area i at the decision time t
Figure BDA00030074244600001112
Figure BDA00030074244600001113
Figure BDA00030074244600001114
In the formulae (5) and (6), aacctrIs the elastic margin coefficient; e is a natural constant; alpha is alphadirSelecting a direction coefficient for the transfer motion;
Figure BDA00030074244600001115
the load amount is increased or decreased by the accumulated transfer to the t-1 moment;
Figure BDA00030074244600001116
when the load is increased and decreased correspondingly after the action is taken for the transferable load at the current decision time t and the load is set to be increased,
Figure BDA00030074244600001117
the value of (d) is positive, and when the load amount is reduced,
Figure BDA00030074244600001118
the value of (d) is negative, when not active,
Figure BDA00030074244600001119
is 0;
3.2, establishing a mathematical model of the output of the VRB energy storage unit:
in order to fully reflect the dynamic change characteristics of terminal voltage, terminal current, energy storage state of charge (SOC) and the like in the charging and discharging process of the all-vanadium redox flow battery (VRB), the constraint conditions of the VRB energy storage unit in one scheduling day are established by using an equation (7) to an equation (10), and the constraint conditions comprise the following steps: terminal voltage constraint, charge-discharge power constraint, charge state constraint and initial and final charge state consistency constraint:
Figure BDA00030074244600001120
Figure BDA00030074244600001121
Figure BDA0003007424460000121
Figure BDA0003007424460000122
in the formulae (7) to (10),
Figure BDA0003007424460000123
the upper limit and the lower limit of the end voltage of the VRB energy storage unit of the area i,
Figure BDA0003007424460000124
the minimum and maximum charge-discharge power of the VRB energy storage unit in the area i at the decision time t,
Figure BDA0003007424460000125
the actual charging and discharging power of the VRB energy storage unit at decision time t for region i,
Figure BDA0003007424460000126
the remaining capacity of the VRB energy storage cells for region i constrains the upper and lower limits,
Figure BDA0003007424460000127
for the remaining capacity of the VRB energy storage unit at decision time t for region i, ts、teTo schedule the beginning and end of the day, CconSetting the expected value of the state of charge of the VRB energy storage unit;
in actual operation, the SOC of the energy storage device is controlled to be 0.2-0.8 so as to ensure that the energy storage device works in a safety area, and the charging and discharging efficiency of the battery is improved.
3.3, establishing a mathematical model of photovoltaic power generation output:
obtaining a predicted value of the photovoltaic output power of the area i at the decision time t by using the formula (11)
Figure BDA0003007424460000128
Figure BDA0003007424460000129
In the formula (11), etapvThe photoelectric conversion efficiency; n ispvThe number of the photovoltaic cell panels; spvThe surface area of the photovoltaic cell panel receiving illumination is m2
Figure BDA00030074244600001210
Solar radiation intensity in kW/m for area i at decision time t2;αpvThe order of magnitude is 10^ -3 in general for the temperature conversion coefficient,
Figure BDA00030074244600001211
outdoor temperature in units of area i at decision time t;
step 3.4, establishing a mathematical model of the regional scheduling time attribute:
when the cooperative scheduling of different areas is involved, the time dimension is introduced by considering the difference of different area characteristics. When a scheduling center issues peak shaving tasks and multi-elastic resource coordination adjustment, compared with the situation that longitudinal time difference is not considered, adjustment sequences or adjustment amount changes may exist in adjustment of different regional main bodies, and potential scheduling information of the active power distribution system is further mined. Considering that the scheduling time attribute is related to the self electricity utilization level and the electricity utilization characteristics of the region, the time attribute T of the region i at the decision time T is obtained by using an equation (12) and taking the absolute value of the difference between the electricity utilization load in each time period and the daily electricity utilization peak value of the region as an independent variable on the basis of the typical daily load curve of each regioni,t
Figure BDA00030074244600001212
In the formula (12), the reaction mixture is,
Figure BDA00030074244600001213
is the time magnitude parameter of the region i; c is a constant, the magnitude of the order is visible
Figure BDA00030074244600001214
And in turn, the temperature of the molten metal is controlled,
Figure BDA00030074244600001215
when the value is MW, c can be 10^ 2; p is a radical ofi,tThe power output at decision time t for region i,
Figure BDA00030074244600001216
maximum output power of the region i on the dispatching day;
considering that transferable loads have a time strong constraint condition, the scheduling time attribute is temporarily limited in reducible loads.
Step 3.5, establishing a peak regulation task allocation mechanism considering the elastic amplitude of the region:
setting the elastic amplitude of the area i at the decision time t as Ei,tElastic amplitude Ei,tUpper bound of (2) which can reduce the upper bound of the load
Figure BDA0003007424460000131
+ transferable increased load upper bound
Figure BDA0003007424460000132
+ an energy storage discharge margin; elastic amplitude Ei,tLower bound of (2) reducible load lower bound + transferable reduced lower bound
Figure BDA0003007424460000133
+ an energy storage charging margin; the elastic amplitude of the region i is formed by the span between the upper and lower bounds;
compared with random task allocation, the method has the advantages that the peak regulation task allocation decision made by considering the elastic amplitude state information of each region at the current moment is taken into consideration when the peak regulation task is allocated, the system electric power elastic resource information is further utilized, and the power utilization efficiency is improved to a certain extent. And then, on the basis, performing rolling correction on the scheduling plan, and establishing an in-day real-time scheduling model considering the multi-period response characteristic of the demand response resource.
Step 4, modeling of continuous variable discretization and uncertain random variable dynamic change processes:
step 4.1, establishing a multi-region power grid peak regulation demand uncertainty model:
at decision time t, issuing the power grid to the maximum range interval of the random uncertain part of the peak regulation demand instruction of the area i in real time
Figure BDA0003007424460000134
Is dispersed into
Figure BDA0003007424460000135
In total
Figure BDA0003007424460000136
A plurality of levels, wherein,
Figure BDA0003007424460000137
for the maximum value of the upward fluctuation based on the peak shaver demand predicted power for the region i at the decision time t,
Figure BDA0003007424460000138
the maximum value of downward fluctuation based on the peak load demand prediction power of the area i at the decision time t;
Figure BDA0003007424460000139
maximum discrete levels of upward and downward fluctuation amounts based on the predicted power of the peak shaving demand of the region i;
and (3) obtaining the actual peak regulation demand of the power grid of the area i at the decision time t by using the formula (13):
Figure BDA00030074244600001310
in the formula (13), the reaction mixture is,
Figure BDA00030074244600001311
make decisions for region iThe peak shaving demand of the power grid at time t predicts power,
Figure BDA00030074244600001312
the power level of the uncertain part of the peak shaving demand of the power grid in the area i at the decision time t,
Figure BDA00030074244600001313
the minimum unit power of the power grid peak regulation instruction uncertainty part discretization of the area i at the decision time t is obtained;
step 4.2, establishing a photovoltaic output uncertain model;
the maximum range interval of the photovoltaic output uncertain part of the area i at the decision time t
Figure BDA00030074244600001314
Is dispersed into
Figure BDA00030074244600001315
In total
Figure BDA00030074244600001316
A plurality of levels, wherein,
Figure BDA00030074244600001317
the maximum value of the upward fluctuation based on the photovoltaic output predicted power of the area i at the decision time t,
Figure BDA00030074244600001318
the maximum value of the downward fluctuation based on the photovoltaic output predicted power of the area i at the decision time t;
Figure BDA00030074244600001319
maximum discrete levels of upward and downward fluctuation amounts based on the photovoltaic output predicted power of the region i;
obtaining the actual photovoltaic output of the region i at the decision time t by using the formula (14)
Figure BDA0003007424460000141
Figure BDA0003007424460000142
In the formula (14), the compound represented by the formula (I),
Figure BDA0003007424460000143
the power is predicted for the photovoltaic output at decision time t for zone i,
Figure BDA0003007424460000144
the power level of the uncertain photovoltaic output part of the area i at the decision time t is determined;
Figure BDA0003007424460000145
the minimum unit power of the area i after the photovoltaic output uncertain part is dispersed at decision time t is obtained;
step 4.3, establishing a multi-region uncertain model of each type of load demand:
the maximum range interval of the random uncertain part of the reducible load and the transferable load of the area i at the decision time t
Figure BDA0003007424460000146
And
Figure BDA0003007424460000147
respectively dispersed into corresponding state grades
Figure BDA0003007424460000148
And
Figure BDA0003007424460000149
in total
Figure BDA00030074244600001410
And
Figure BDA00030074244600001411
a plurality of levels, wherein,
Figure BDA00030074244600001412
and
Figure BDA00030074244600001413
respectively the maximum value of the upward fluctuation based on the forecasted power of the reducible load and the transferable load demand of the area i at the decision time t,
Figure BDA00030074244600001414
and
Figure BDA00030074244600001415
maximum values of downward fluctuation based on demand predicted power of reducible load and transferable load in the area i at decision time t respectively;
Figure BDA00030074244600001416
and
Figure BDA00030074244600001417
maximum discrete levels of upward and downward fluctuation amounts based on predicted power required by reducible load and transferable load in the region i respectively;
obtaining the actual power demand of the region i at the decision time t, which can reduce the load, by using the equations (15) to (16)
Figure BDA00030074244600001418
And the actual power demand of the load can be transferred
Figure BDA00030074244600001419
Figure BDA00030074244600001420
Figure BDA00030074244600001421
In the formulae (15) to (16),
Figure BDA00030074244600001422
power is predicted for the demand that region i can shed loads and transferable loads at decision time t,
Figure BDA00030074244600001423
respectively the power levels of the uncertain parts of the reducible load and the transferable load demand power in the area i at the decision time t;
Figure BDA00030074244600001424
respectively the minimum unit power of the region i after the uncertain part of the reducible load and the transferable load demand power is dispersed under the decision time t;
the process of the variable uncertain part changing along with the time is approximately described by a first-order Markov process, and the transition probability at each moment follows discrete Gaussian distribution taking the state of the variable uncertain part as the center.
Step 5, establishing a corresponding DTMDP model according to the peak regulation optimization problem of the multi-region active power distribution system considering the elasticity of power consumption requirements:
step 5.1, system state space and action set of the DTMDP model:
dividing a scheduling day into K e {0, 1.,. K-1}, and K decision periods; the time length of each decision period is delta t, and the decision time of the kth decision period is tkThe end time of the scheduling day is tK-1
The formula (17) -formula (18) is used for obtaining the decision time t of the scheduling centerkState of
Figure BDA0003007424460000151
Figure BDA0003007424460000152
Figure BDA0003007424460000153
In the formulae (17) to (18),
Figure BDA0003007424460000154
as a decision time tkThe real-time peak regulation demand state grade of the lower power grid,
Figure BDA0003007424460000155
for region i at decision time tkEnvironmental information of the environment, photovoltaic output state class
Figure BDA0003007424460000156
VRB energy storage unit charging and discharging state grade
Figure BDA0003007424460000157
Multi-type load scheduling unit load demand state grade
Figure BDA0003007424460000158
Elastic margin state rating
Figure BDA0003007424460000159
And zone elastic amplitude state level
Figure BDA00030074244600001510
Composition is carried out; supThe state space of a dispatching center, N is the number of regions;
setting the multi-type load containing type as M, if the area i does not consider the load j of a certain type, corresponding to the state number
Figure BDA00030074244600001511
Is 0; the total number of states N is obtained by the formula (19)up,s
Figure BDA00030074244600001512
In the formula (19), NpeakThe maximum state grade of the real-time peak regulation demand of the power grid,
Figure BDA00030074244600001513
is a regionThe maximum state class of photovoltaic output of the field i,
Figure BDA00030074244600001514
for the region iVRB energy storage cell charge-discharge maximum state level,
Figure BDA00030074244600001515
for the region i elasticity margin maximum state level,
Figure BDA00030074244600001516
for the state class with the largest elastic amplitude in the region i,
Figure BDA00030074244600001517
the maximum state grade of the load demand of the region i is set;
the dispatching center is arranged at decision time tkMaximum interval of lower random peak shaving demand power
Figure BDA00030074244600001518
The dispersion is 0 to Nap-1 to NapA plurality of levels, wherein,
Figure BDA00030074244600001519
for the scheduling centre at decision time tkLower total peak shaver power requirement, NapThe maximum discrete level is required for the total peak regulation of the dispatching center;
the peak-shaving task quantity distributed to the area i by the dispatching center is obtained by using the formula (20)
Figure BDA00030074244600001520
Figure BDA00030074244600001521
In the formula (20), the reaction mixture is,
Figure BDA00030074244600001522
for the scheduling centre at decision time tkDescending the peak shaving task action allocated to the area i;
the peaker task action assignment constraint is established using equation (21):
Figure BDA00030074244600001523
in the formula (21), the compound represented by the formula,
Figure BDA00030074244600001524
Aia set of all possible peak shaver task motion vectors for the region i;
the formula (22) is used for obtaining the decision time t of the dispatching centerkAction vector of
Figure BDA0003007424460000161
Figure BDA0003007424460000162
In the formula (22), AupA set of all possible action vectors, namely an action set, for a scheduling center; the total number of actions of the dispatching center is Nup,a=Nap
Obtaining region i at decision time t using equation (23)kState of
Figure BDA0003007424460000163
Figure BDA0003007424460000164
In the formula (23), the compound represented by the formula,
Figure BDA0003007424460000165
is the state space of the region i;
the total number of states of the region i is obtained by equation (24)
Figure BDA0003007424460000166
Figure BDA0003007424460000167
Obtaining region i at decision time t using equation (25)kDownward movement
Figure BDA0003007424460000168
Figure BDA0003007424460000169
In the formula (25), the reaction mixture,
Figure BDA00030074244600001610
for region i at decision time tkThe lower VRB energy storage unit acts, and the three values are respectively a discharging action, an idle action and a charging action;
Figure BDA00030074244600001611
for region i at decision time tkThe lower load scheduling unit adjusting action comprises load reduction action capable of reducing load
Figure BDA00030074244600001612
Load shifting actions
Figure BDA00030074244600001613
Figure BDA00030074244600001614
For region i at decision time tkNext different actuation control actions;
Figure BDA00030074244600001615
is the set of all possible motion vectors in the region i, i.e. the motion set of the region i;
the total number of operations of the region i is obtained by the equation (26)
Figure BDA00030074244600001616
Figure BDA00030074244600001617
Step 5.2, defining the state transition process of the DTMDP model:
the state-of-charge transfer equation for the VRB energy storage unit is established using equation (27):
Figure BDA00030074244600001618
in the formula (27), N is the number of the single batteries connected in series by the electric pile, IdIn order to charge and discharge the current,
Figure BDA00030074244600001619
the total capacity of the VRB energy storage unit;
Figure BDA00030074244600001620
VRB energy storage unit of area i at current decision time tkThe state of charge of the battery,
Figure BDA00030074244600001621
taking charging and discharging actions for VRB energy storage unit
Figure BDA00030074244600001622
A later state of charge;
a reducible load state transition equation is established using equation (28):
Figure BDA0003007424460000171
in the formula (28), the reaction mixture is,
Figure BDA0003007424460000172
for region i at decision time tkTake a curtailment action
Figure BDA0003007424460000173
Later can reduce the load demandIn the case of a situation in which,
Figure BDA0003007424460000174
for region i at decision time tkCan reduce the predicted power of the load demand,
Figure BDA0003007424460000175
decision time tkThe uncertain part of the load demand can be reduced,
Figure BDA0003007424460000176
the maximum discrete level of the load demand can be reduced;
considering that the transferable load needs to meet the constraint that the total amount is unchanged before and after the transfer, a state transfer equation of the transferable load is established by using an equation (29):
Figure BDA0003007424460000177
in the formula (29), the reaction mixture,
Figure BDA0003007424460000178
for region i at decision time tkTake transfer action down
Figure BDA0003007424460000179
The latter transferable load demand situation,
Figure BDA00030074244600001710
at the end decision time t for region iK-1The next transfer action to be taken is,
Figure BDA00030074244600001711
for region i at decision time tkThe transferable load demand of (a) predicts the power,
Figure BDA00030074244600001712
as a decision time tkThe uncertain portion of the lower transferable load demand,
Figure BDA00030074244600001713
the maximum discrete level of transferable load demand; the total amount before and after the transfer is not changed and restrained by the last moment adjusting action:
Figure BDA00030074244600001714
in the case of positive numbers, 0 and negative numbers,
Figure BDA00030074244600001715
the values of (a) correspond to-1, 0 and 1, respectively;
step 5.3, establishing an objective function of the DTMDP model:
the upper layer cost in the decision period k is obtained by using the formula (30)
Figure BDA00030074244600001716
Upper layer cost
Figure BDA00030074244600001717
Returning a cost c for each step of operation of each region of the lower layeri,kSuperposition of (2):
Figure BDA00030074244600001718
in the formula (30), ci,kGenerating a cost for the region i in the state transition process of the decision period k;
the active power distribution system scheduling is considered to be periodic, and the initial and final charge states of the energy storage device also meet the consistent constraint. The starting and ending state of charge consistency constraint of the VRB energy storage unit is established by using the formula (31):
Figure BDA00030074244600001719
in the formula (31), the reaction mixture,
Figure BDA00030074244600001720
is the weight coefficient of the last state of the VRB energy storage unit,
Figure BDA00030074244600001721
and
Figure BDA00030074244600001722
respectively setting the actual capacity grade of the VRB energy storage unit at the last decision moment and the expected capacity grade;
step 5.4, establishing an optimization target of the DTMDP model:
obtaining scheduling center-in-strategy pi by using formula (32)upThe initial state is s0For a limited period of time to optimize the performance criterion
Figure BDA0003007424460000181
Figure BDA0003007424460000182
The upper layer optimization target is in a strategy set omegaupFind the optimal strategy
Figure BDA0003007424460000183
Obtaining region i in strategy pi using equation (33)dow,iThe initial state is s0For a limited period of time to optimize the performance criterion
Figure BDA00030074244600001821
Figure BDA0003007424460000184
The lower layer optimization target is in a strategy set omegadow,iFind the optimal strategy
Figure BDA0003007424460000185
And 6, referring to fig. 3, solving the established DTMDP model by adopting simulated annealing-based Q learning according to the following steps:
and 6.1, initializing system model parameters. Comprising a single sample blockThe strategy cycle number K, the maximum level N of the task allocation of the dispatching centerpeakMaximum photovoltaic output rating in region i
Figure BDA0003007424460000186
Maximum grade of output force of energy storage device
Figure BDA0003007424460000187
Multi-type flexible load power adjustment maximum grade
Figure BDA0003007424460000188
Time of use electricity price
Figure BDA0003007424460000189
Coefficient of rebound load beta1、β2、β3Operation weight coefficient gammavrb、λpvEtc.;
and 6.2, initializing system learning parameters. Including the total number of sample tracks M, the learning rate of the dispatching center alphaupDiscount factor gammaupAnd learning rate update coefficient etaup(ii) a Learning rate of region i
Figure BDA00030074244600001810
Discount factor
Figure BDA00030074244600001811
And learning rate update coefficient
Figure BDA00030074244600001812
Simulated annealing temperature TtempAnd simulated annealing coefficient etatemp
Step 6.3, initializing Q value table Q of dispatching centerupAnd Q value table of area i
Figure BDA00030074244600001813
Scheduling center and state data of each region to determine the state of the scheduling center
Figure BDA00030074244600001814
And initializing current learningThe step number m is equal to 0, and the current decision period k is equal to 0;
step 6.4 according to QupAnd greedy policy greedypolicyupSelecting the current state
Figure BDA00030074244600001815
Greedy action for down-per-region peak shaving task allocation
Figure BDA00030074244600001816
Simultaneously randomly selecting valid actions
Figure BDA00030074244600001817
If it is
Figure BDA00030074244600001818
The current scheduling center action is
Figure BDA00030074244600001819
Otherwise
Figure BDA00030074244600001820
Assigning peaking tasks to actions
Figure BDA0003007424460000191
Transmitting the status to each region, and observing the status of each region
Figure BDA0003007424460000192
Step 6.5, according to region i
Figure BDA0003007424460000193
And greedy strategy
Figure BDA0003007424460000194
Selecting a current state
Figure BDA0003007424460000195
Greedy action for lower corresponding region i
Figure BDA0003007424460000196
Simultaneously randomly selecting valid actions
Figure BDA0003007424460000197
If it is
Figure BDA0003007424460000198
The action of the current area i is
Figure BDA0003007424460000199
Otherwise
Figure BDA00030074244600001910
Observing the next period state of the dispatching center
Figure BDA00030074244600001911
And (3) counting the cost in the process and feeding the cost back to the dispatching center, and updating the Q value table of each area by using a formula (34):
Figure BDA00030074244600001912
step 6.6, update Q with equation (35)upIf K is less than K, skipping to step 6.4; otherwise, jumping to step 6.7:
Figure BDA00030074244600001913
step 6.7, executing the action selected by the current dispatching center and each area i
Figure BDA00030074244600001914
Calculating the cost generated in the process of executing action state transition in the decision period K
Figure BDA00030074244600001915
Updating the Q value table Q of the scheduling center and each area i by using the formula (36)up,
Figure BDA00030074244600001916
And (2) enabling m to be m +1:
Figure BDA00030074244600001917
step 6.8, if M is less than M, updating the learning rate alphaup:=ηupαup
Figure BDA00030074244600001918
And updates the temperature Ttemp:=ηtempTtempReturning to the step 6.4; otherwise, finishing the learning optimization method to obtain a scheduling strategy of each scheduling resource in each decision period meeting the peak regulation requirement of the scheduling center within one scheduling day.
In conclusion, the invention can effectively deal with the randomness of each elastic resource in a multi-region active power distribution system and ensure the stable and safe operation of the system.

Claims (1)

1. A peak regulation optimization method of a multi-region active power distribution system considering power demand elasticity is characterized by comprising the following steps:
step 1, constructing a multi-region active power distribution system, comprising: the system comprises a dispatching center, an industrial park dispatching area, a commercial park dispatching area and municipal and residential life park dispatching areas; recording any one of the industrial park dispatching area, the commercial park dispatching area and the municipal and residential life park dispatching areas as an area i;
the area i comprises: the system comprises an ith PV generating unit, an ith VRB energy storage unit and an ith load scheduling unit; the load type of the ith load scheduling unit comprises: the ith load-reducible load and the ith rigid load; wherein, the load type of the load scheduling unit in the industrial park scheduling area further comprises: the load can be transferred;
step 2, determining predicted values of power grid peak regulation requirements, photovoltaic output and various types of load requirements based on historical data of the area i at any decision time t in a dispatching day; wherein, the peak load regulation demand of the power grid is recorded as
Figure FDA0003007424450000011
Photovoltaic output is noted
Figure FDA0003007424450000012
Reducible load demand among various types of load demands is noted
Figure FDA0003007424450000013
And transferable load demand is noted
Figure FDA0003007424450000014
Step 3, establishing mathematical models of various load scheduling units, VRB energy storage units and PV power generation units in industrial, commercial, municipal and life parks, a region scheduling time attribute mathematical model and a peak shaving task allocation mechanism considering region elasticity amplitude:
step 3.1, establishing mathematical models of various types of loads of multiple areas under the elastic environment:
obtaining the minimum and maximum load reduction quantity constraints of the region i at the decision time t by using the formula (1):
Figure FDA0003007424450000015
in the formula (1), the reaction mixture is,
Figure FDA0003007424450000016
the maximum load amount of the reducible load of the area i at the decision time t is reduced;
Figure FDA0003007424450000017
actually reducing the load of the area i at the decision time t;
the transferable load force constraint is obtained by using the formula (2) to the formula (4):
Figure FDA0003007424450000018
Figure FDA0003007424450000019
Figure FDA00030074244500000110
in the formulae (2) to (4),
Figure FDA00030074244500000111
and
Figure FDA00030074244500000112
respectively determining the maximum allowable load increase and the maximum allowable load decrease of the transferable load in the area i at the decision time t;
Figure FDA00030074244500000113
the load increment of the transferable load corresponding to the area i at the decision time t;
Figure FDA00030074244500000114
the load reduction quantity of the area i at the decision time t;
defining the accumulated increase and decrease load quantity after action is taken at each decision time from the initial time to the decision time t as
Figure FDA0003007424450000021
Suppose a single scheduling day has tK-1At the moment of decision, then
Figure FDA0003007424450000022
Is the remaining period tK-1-t is a margin of resilience to transferable load;
obtaining margin constraints of the region i at the decision time t by using the equations (5) and (6)
Figure FDA0003007424450000023
And transfer direction constraints
Figure FDA0003007424450000024
Thereby obtaining transferable load constraint of the area i at the decision time t
Figure FDA0003007424450000025
Figure FDA0003007424450000026
Figure FDA0003007424450000027
In the formulae (5) and (6), aacctrIs the elastic margin coefficient; e is a natural constant; alpha is alphadirSelecting a direction coefficient for the transfer motion;
Figure FDA0003007424450000028
the load amount is increased or decreased by the accumulated transfer to the t-1 moment;
Figure FDA0003007424450000029
when the load is increased and decreased correspondingly after the action is taken for the transferable load at the current decision time t and the load is set to be increased,
Figure FDA00030074244500000210
the value of (d) is positive, and when the load amount is reduced,
Figure FDA00030074244500000211
the value of (d) is negative, when not active,
Figure FDA00030074244500000212
is 0;
3.2, establishing a mathematical model of the output of the VRB energy storage unit:
establishing the constraint condition of the VRB energy storage unit in one scheduling day by using an equation (7) to an equation (10), wherein the constraint condition comprises the following steps: terminal voltage constraint, charge-discharge power constraint, charge state constraint and initial and final charge state consistency constraint:
Figure FDA00030074244500000213
Figure FDA00030074244500000214
Figure FDA00030074244500000215
Figure FDA00030074244500000216
in the formulae (7) to (10),
Figure FDA00030074244500000217
the upper limit and the lower limit of the end voltage of the VRB energy storage unit of the area i,
Figure FDA00030074244500000218
the minimum and maximum charge-discharge power of the VRB energy storage unit in the area i at the decision time t,
Figure FDA00030074244500000219
the actual charging and discharging power of the VRB energy storage unit at decision time t for region i,
Figure FDA00030074244500000220
the remaining capacity of the VRB energy storage cells for region i constrains the upper and lower limits,
Figure FDA00030074244500000221
for the remaining capacity of the VRB energy storage unit at decision time t for region i, ts、teTo schedule the beginning and end of the day, CconSetting the expected value of the state of charge of the VRB energy storage unit;
3.3, establishing a mathematical model of photovoltaic power generation output:
obtaining a predicted value of the photovoltaic output power of the area i at the decision time t by using the formula (11)
Figure FDA00030074244500000222
Figure FDA0003007424450000031
In the formula (11), etapvThe photoelectric conversion efficiency; n ispvThe number of the photovoltaic cell panels; spvThe surface area of the photovoltaic cell panel receiving illumination is increased;
Figure FDA0003007424450000032
the solar radiation intensity of the area i at the decision time t; alpha is alphapvIn order to be the temperature conversion coefficient,
Figure FDA0003007424450000033
the outdoor temperature of the area i at the decision time t;
step 3.4, establishing a mathematical model of the regional scheduling time attribute:
obtaining the time attribute T of the area i at the decision time T by using the formula (12)i,t
Figure FDA0003007424450000034
In the formula (12), the reaction mixture is,
Figure FDA0003007424450000035
is the time magnitude parameter of the region i; c is a constant; p is a radical ofi,tThe power output at decision time t for region i,
Figure FDA0003007424450000036
maximum output power of the region i on the dispatching day;
step 3.5, establishing a peak regulation task allocation mechanism considering the elastic amplitude of the region:
setting the elastic amplitude of the area i at the decision time t as Ei,tElastic amplitude Ei,tUpper bound of (2) which can reduce the upper bound of the load
Figure FDA0003007424450000037
Transferable increased load upper bound
Figure FDA0003007424450000038
Energy storage and discharge allowance; elastic amplitude Ei,tLower bound of (2) reducible load lower bound + transferable reduced lower bound
Figure FDA0003007424450000039
Energy storage charging allowance; the elastic amplitude of the region i is formed by the span between the upper and lower bounds;
step 4, modeling of continuous variable discretization and uncertain random variable dynamic change processes:
step 4.1, establishing a multi-region power grid peak regulation demand uncertainty model:
at decision time t, issuing the power grid to the maximum range interval of the random uncertain part of the peak regulation demand instruction of the area i in real time
Figure FDA00030074244500000310
Is dispersed into
Figure FDA00030074244500000311
In total
Figure FDA00030074244500000312
A plurality of levels, wherein,
Figure FDA00030074244500000313
for the maximum value of the upward fluctuation based on the peak shaver demand predicted power for the region i at the decision time t,
Figure FDA00030074244500000314
the maximum value of downward fluctuation based on the peak load demand prediction power of the area i at the decision time t;
Figure FDA00030074244500000315
maximum discrete levels of upward and downward fluctuation amounts based on the predicted power of the peak shaving demand of the region i;
and (3) obtaining the actual peak regulation demand of the power grid of the area i at the decision time t by using the formula (13):
Figure FDA00030074244500000316
in the formula (13), the reaction mixture is,
Figure FDA00030074244500000317
predicting power for the peak shaver demand of the power grid in the area i at the decision time t,
Figure FDA00030074244500000318
the power level of the uncertain part of the peak shaving demand of the power grid in the area i at the decision time t,
Figure FDA00030074244500000319
the minimum unit power of the power grid peak regulation instruction uncertainty part discretization of the area i at the decision time t is obtained;
step 4.2, establishing a photovoltaic output uncertain model;
the maximum range interval of the photovoltaic output uncertain part of the area i at the decision time t
Figure FDA0003007424450000041
Is dispersed into
Figure FDA0003007424450000042
In total
Figure FDA0003007424450000043
A plurality of levels, wherein,
Figure FDA0003007424450000044
the maximum value of the upward fluctuation based on the photovoltaic output predicted power of the area i at the decision time t,
Figure FDA0003007424450000045
the maximum value of the downward fluctuation based on the photovoltaic output predicted power of the area i at the decision time t;
Figure FDA0003007424450000046
maximum discrete levels of upward and downward fluctuation amounts based on the photovoltaic output predicted power of the region i;
obtaining the actual photovoltaic output of the region i at the decision time t by using the formula (14)
Figure FDA0003007424450000047
Figure FDA0003007424450000048
In the formula (14), the compound represented by the formula (I),
Figure FDA0003007424450000049
the power is predicted for the photovoltaic output at decision time t for zone i,
Figure FDA00030074244500000410
for the indeterminate part of the photovoltaic output of zone i at decision time tA power level;
Figure FDA00030074244500000411
the minimum unit power of the area i after the photovoltaic output uncertain part is dispersed at decision time t is obtained;
step 4.3, establishing a multi-region uncertain model of each type of load demand:
the maximum range interval of the random uncertain part of the reducible load and the transferable load of the area i at the decision time t
Figure FDA00030074244500000412
And
Figure FDA00030074244500000413
respectively dispersed into corresponding state grades
Figure FDA00030074244500000414
And
Figure FDA00030074244500000415
in total
Figure FDA00030074244500000416
And
Figure FDA00030074244500000417
a plurality of levels, wherein,
Figure FDA00030074244500000418
and
Figure FDA00030074244500000419
respectively the maximum value of the upward fluctuation based on the forecasted power of the reducible load and the transferable load demand of the area i at the decision time t,
Figure FDA00030074244500000420
and
Figure FDA00030074244500000421
maximum values of downward fluctuation based on demand predicted power of reducible load and transferable load in the area i at decision time t respectively;
Figure FDA00030074244500000422
and
Figure FDA00030074244500000423
maximum discrete levels of upward and downward fluctuation amounts based on predicted power required by reducible load and transferable load in the region i respectively;
obtaining the actual power demand of the region i at the decision time t, which can reduce the load, by using the equations (15) to (16)
Figure FDA00030074244500000424
And the actual power demand of the load can be transferred
Figure FDA00030074244500000425
Figure FDA00030074244500000426
Figure FDA00030074244500000427
In the formulae (15) to (16),
Figure FDA00030074244500000428
power is predicted for the demand that region i can shed loads and transferable loads at decision time t,
Figure FDA00030074244500000429
the uncertain part of the required power of the reducible load and the transferable load in the area i at the decision time tA fractional power level;
Figure FDA00030074244500000430
respectively the minimum unit power of the region i after the uncertain part of the reducible load and the transferable load demand power is dispersed under the decision time t;
step 5, establishing a corresponding DTMDP model according to the peak regulation optimization problem of the multi-region active power distribution system considering the elasticity of power consumption requirements:
step 5.1, system state space and action set of the DTMDP model:
dividing a scheduling day into K e {0, 1.,. K-1}, and K decision periods; the time length of each decision period is delta t, and the decision time of the kth decision period is tkThe end time of the scheduling day is tK-1
The formula (17) -formula (18) is used for obtaining the decision time t of the scheduling centerkState of
Figure FDA0003007424450000051
Figure FDA0003007424450000052
Figure FDA0003007424450000053
In the formulae (17) to (18),
Figure FDA0003007424450000054
as a decision time tkThe real-time peak regulation demand state grade of the lower power grid,
Figure FDA0003007424450000055
for region i at decision time tkEnvironmental information of the environment, photovoltaic output state class
Figure FDA0003007424450000056
VRB energy storage unit charging and discharging state grade
Figure FDA0003007424450000057
Multi-type load scheduling unit load demand state grade
Figure FDA0003007424450000058
Elastic margin state rating
Figure FDA0003007424450000059
And zone elastic amplitude state level
Figure FDA00030074244500000510
Composition is carried out; supThe state space of a dispatching center, N is the number of regions;
setting the multi-type load containing type as M, if the area i does not consider the load j of a certain type, corresponding to the state number
Figure FDA00030074244500000511
Is 0; the total number of states N is obtained by the formula (19)up,s
Figure FDA00030074244500000512
In the formula (19), NpeakThe maximum state grade of the real-time peak regulation demand of the power grid,
Figure FDA00030074244500000513
the photovoltaic output maximum state grade of the region i,
Figure FDA00030074244500000514
for the region iVRB energy storage cell charge-discharge maximum state level,
Figure FDA00030074244500000515
is a regioni the maximum state level of the margin of elasticity,
Figure FDA00030074244500000516
for the state class with the largest elastic amplitude in the region i,
Figure FDA00030074244500000517
the maximum state grade of the load demand of the region i is set;
the dispatching center is arranged at decision time tkMaximum interval of lower random peak shaving demand power
Figure FDA00030074244500000518
The dispersion is 0 to Nap-1 to NapA plurality of levels, wherein,
Figure FDA00030074244500000519
for the scheduling centre at decision time tkLower total peak shaver power requirement, NapThe maximum discrete level is required for the total peak regulation of the dispatching center;
the peak-shaving task quantity distributed to the area i by the dispatching center is obtained by using the formula (20)
Figure FDA00030074244500000520
Figure FDA00030074244500000521
In the formula (20), the reaction mixture is,
Figure FDA00030074244500000522
for the scheduling centre at decision time tkDescending the peak shaving task action allocated to the area i;
the peaker task action assignment constraint is established using equation (21):
Figure FDA0003007424450000061
in the formula (21), the compound represented by the formula,
Figure FDA0003007424450000062
Aia set of all possible peak shaver task motion vectors for the region i;
the formula (22) is used for obtaining the decision time t of the dispatching centerkAction vector of
Figure FDA0003007424450000063
Figure FDA0003007424450000064
In the formula (22), AupA set of all possible action vectors, namely an action set, for a scheduling center; the total number of actions of the dispatching center is Nup,a=Nap
Obtaining region i at decision time t using equation (23)kState of
Figure FDA0003007424450000065
Figure FDA0003007424450000066
In the formula (23), the compound represented by the formula,
Figure FDA0003007424450000067
is the state space of the region i;
the total number of states of the region i is obtained by equation (24)
Figure FDA0003007424450000068
Figure FDA0003007424450000069
Obtaining region i at decision time t using equation (25)kDownward movement
Figure FDA00030074244500000610
Figure FDA00030074244500000611
In the formula (25), the reaction mixture,
Figure FDA00030074244500000612
for region i at decision time tkThe lower VRB energy storage unit acts, and the three values are respectively a discharging action, an idle action and a charging action;
Figure FDA00030074244500000613
for region i at decision time tkThe lower load scheduling unit adjusting action comprises load reduction action capable of reducing load
Figure FDA00030074244500000614
Load shifting actions
Figure FDA00030074244500000615
Figure FDA00030074244500000616
For region i at decision time tkNext different actuation control actions;
Figure FDA00030074244500000617
is the set of all possible motion vectors in the region i, i.e. the motion set of the region i;
the total number of operations of the region i is obtained by the equation (26)
Figure FDA00030074244500000618
Figure FDA00030074244500000619
Step 5.2, defining the state transition process of the DTMDP model:
the state-of-charge transfer equation for the VRB energy storage unit is established using equation (27):
Figure FDA00030074244500000620
in the formula (27), N is the number of the single batteries connected in series by the electric pile, IdIn order to charge and discharge the current,
Figure FDA00030074244500000621
the total capacity of the VRB energy storage unit;
Figure FDA0003007424450000071
VRB energy storage unit of area i at current decision time tkThe state of charge of the battery,
Figure FDA0003007424450000072
taking charging and discharging actions for VRB energy storage unit
Figure FDA0003007424450000073
A later state of charge;
a reducible load state transition equation is established using equation (28):
Figure FDA0003007424450000074
in the formula (28), the reaction mixture is,
Figure FDA0003007424450000075
for region i at decision time tkTake a curtailment action
Figure FDA0003007424450000076
The latter can reduce the load demand situation,
Figure FDA0003007424450000077
for region i at decision time tkCan reduce the predicted power of the load demand,
Figure FDA0003007424450000078
decision time tkThe uncertain part of the load demand can be reduced,
Figure FDA0003007424450000079
the maximum discrete level of the load demand can be reduced;
the transferable load state transfer equation is established using equation (29):
Figure FDA00030074244500000710
in the formula (29), the reaction mixture,
Figure FDA00030074244500000711
for region i at decision time tkTake transfer action down
Figure FDA00030074244500000712
The latter transferable load demand situation,
Figure FDA00030074244500000713
at the end decision time t for region iK-1The next transfer action to be taken is,
Figure FDA00030074244500000714
for region i at decision time tkThe transferable load demand of (a) predicts the power,
Figure FDA00030074244500000715
as a decision time tkThe uncertain portion of the lower transferable load demand,
Figure FDA00030074244500000716
the maximum discrete level of transferable load demand;
step 5.3, establishing an objective function of the DTMDP model:
the upper layer cost in the decision period k is obtained by using the formula (30)
Figure FDA00030074244500000717
Figure FDA00030074244500000718
In the formula (30), ci,kGenerating a cost for the region i in the state transition process of the decision period k;
the starting and ending state of charge consistency constraint of the VRB energy storage unit is established by using the formula (31):
Figure FDA00030074244500000719
in the formula (31), the reaction mixture,
Figure FDA00030074244500000720
is the weight coefficient of the last state of the VRB energy storage unit,
Figure FDA00030074244500000721
and
Figure FDA00030074244500000722
respectively setting the actual capacity grade of the VRB energy storage unit at the last decision moment and the expected capacity grade;
step 5.4, establishing an optimization target of the DTMDP model:
obtaining scheduling center-in-strategy pi by using formula (32)upThe initial state is s0Is optimized within a limited period of timePerformance criteria
Figure FDA0003007424450000081
Figure FDA0003007424450000082
The upper layer optimization target is in a strategy set omegaupFind the optimal strategy
Figure FDA0003007424450000083
Obtaining region i in strategy pi using equation (33)dow,iThe initial state is s0For a limited period of time to optimize the performance criterion
Figure FDA0003007424450000084
Figure FDA0003007424450000085
The lower layer optimization target is in a strategy set omegadow,iFind the optimal strategy
Figure FDA0003007424450000086
Step 6, solving the DTMDP model established in the step 5 by adopting Q learning based on simulated annealing;
firstly, initializing parameters, learning parameters, upper and lower layer Q value tables, current learning step numbers and decision periods of a DTMDP model; then the upper and lower layers randomly select the corresponding action of the current state according to the strategy, generate the corresponding cost and update the Q value table; and repeatedly and iteratively updating the Q value table until the termination condition is met, and obtaining a scheduling strategy of each scheduling resource in each decision period meeting the peak regulation requirement of the scheduling center within one scheduling day.
CN202110366783.5A 2021-04-06 2021-04-06 Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity Active CN112952847B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110366783.5A CN112952847B (en) 2021-04-06 2021-04-06 Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110366783.5A CN112952847B (en) 2021-04-06 2021-04-06 Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity

Publications (2)

Publication Number Publication Date
CN112952847A true CN112952847A (en) 2021-06-11
CN112952847B CN112952847B (en) 2022-09-16

Family

ID=76232390

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110366783.5A Active CN112952847B (en) 2021-04-06 2021-04-06 Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity

Country Status (1)

Country Link
CN (1) CN112952847B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113690878A (en) * 2021-08-03 2021-11-23 北京京能能源技术研究有限责任公司 Micro-grid three-switch control method
CN113706018A (en) * 2021-08-27 2021-11-26 广东电网有限责任公司 User peak regulation behavior evaluation model establishing method, evaluation method and device
CN116760043A (en) * 2023-05-29 2023-09-15 中国建筑科学研究院有限公司 Heat pump system power grid peak shaving potential and effect evaluation method considering multi-dimension index
CN117937474A (en) * 2024-03-20 2024-04-26 保定博堃元信息科技有限公司 New energy station energy storage management method and system

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160189175A1 (en) * 2014-12-24 2016-06-30 Hanshuang LI System and method of sensitivity-driven pricing service for non-stationary demand management
CN105741017A (en) * 2016-01-22 2016-07-06 江苏省电力公司电力科学研究院 User demand response assessment method in intelligent power grid environment
CN106777487A (en) * 2016-11-18 2017-05-31 清华大学 A kind of credible capacity calculation methods of the photovoltaic plant containing energy-storage system and system
CN108964042A (en) * 2018-07-24 2018-12-07 合肥工业大学 Regional power grid operating point method for optimizing scheduling based on depth Q network
CN109103912A (en) * 2018-07-18 2018-12-28 合肥工业大学 Consider the industrial park active distribution system method for optimizing scheduling of peaking demand of power grid
WO2019035527A1 (en) * 2017-08-17 2019-02-21 한국전력공사 Blockchain-based power trading operation system, method therefor, and computer readable storage medium that stores said method
CN109494727A (en) * 2018-11-30 2019-03-19 国网江西省电力有限公司经济技术研究院 Consider the active and idle coordination optimization operation method of power distribution network of demand response
CN109524958A (en) * 2018-11-08 2019-03-26 国网浙江省电力有限公司经济技术研究院 Consider the electric system flexibility Optimization Scheduling of depth peak regulation and demand response
CN110112767A (en) * 2019-03-19 2019-08-09 华北电力大学 The polymorphic Demand-side load of wide area participates in the lotus source optimization control method of peak-load regulating
CN111628503A (en) * 2020-06-20 2020-09-04 东北电力大学 Day-ahead-day two-stage rolling optimization scheduling method considering generalized energy storage and thermal power combined peak shaving

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160189175A1 (en) * 2014-12-24 2016-06-30 Hanshuang LI System and method of sensitivity-driven pricing service for non-stationary demand management
CN105741017A (en) * 2016-01-22 2016-07-06 江苏省电力公司电力科学研究院 User demand response assessment method in intelligent power grid environment
CN106777487A (en) * 2016-11-18 2017-05-31 清华大学 A kind of credible capacity calculation methods of the photovoltaic plant containing energy-storage system and system
WO2019035527A1 (en) * 2017-08-17 2019-02-21 한국전력공사 Blockchain-based power trading operation system, method therefor, and computer readable storage medium that stores said method
CN109103912A (en) * 2018-07-18 2018-12-28 合肥工业大学 Consider the industrial park active distribution system method for optimizing scheduling of peaking demand of power grid
CN108964042A (en) * 2018-07-24 2018-12-07 合肥工业大学 Regional power grid operating point method for optimizing scheduling based on depth Q network
CN109524958A (en) * 2018-11-08 2019-03-26 国网浙江省电力有限公司经济技术研究院 Consider the electric system flexibility Optimization Scheduling of depth peak regulation and demand response
CN109494727A (en) * 2018-11-30 2019-03-19 国网江西省电力有限公司经济技术研究院 Consider the active and idle coordination optimization operation method of power distribution network of demand response
CN110112767A (en) * 2019-03-19 2019-08-09 华北电力大学 The polymorphic Demand-side load of wide area participates in the lotus source optimization control method of peak-load regulating
CN111628503A (en) * 2020-06-20 2020-09-04 东北电力大学 Day-ahead-day two-stage rolling optimization scheduling method considering generalized energy storage and thermal power combined peak shaving

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BERK CELIK ET AL.: "Quantifying the Impact of Solar Photovoltaic and Energy Storage Assets on the Performance of a Residential Energy Aggregator", 《IEEE TRANSACTIONS ON SUSTAINABLE ENERGY》, vol. 11, no. 1, 31 January 2020 (2020-01-31), pages 405 - 414, XP011762385, DOI: 10.1109/TSTE.2019.2892603 *
XIAO-YAN GE ET AL.: "Energy-Sustainable Traffic Signal Timings for a Congested Road Network With Heterogeneous Users", 《 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS》, vol. 15, no. 3, 30 June 2014 (2014-06-30), pages 1016 - 1025, XP011549671, DOI: 10.1109/TITS.2013.2291612 *
吴熙等: "基于Q学习算法的综合能源系统韧性提升方法", 《电力自动化设备》, vol. 40, no. 04, 30 April 2020 (2020-04-30), pages 146 - 152 *
曹筱欧: "适应间歇式新能源发电负荷调峰策略研究", 《微型电脑应用》, vol. 35, no. 1, 31 December 2019 (2019-12-31), pages 91 - 97 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113690878A (en) * 2021-08-03 2021-11-23 北京京能能源技术研究有限责任公司 Micro-grid three-switch control method
CN113690878B (en) * 2021-08-03 2023-11-21 北京京能能源技术研究有限责任公司 Three-phase switching control method for micro-grid
CN113706018A (en) * 2021-08-27 2021-11-26 广东电网有限责任公司 User peak regulation behavior evaluation model establishing method, evaluation method and device
CN116760043A (en) * 2023-05-29 2023-09-15 中国建筑科学研究院有限公司 Heat pump system power grid peak shaving potential and effect evaluation method considering multi-dimension index
CN117937474A (en) * 2024-03-20 2024-04-26 保定博堃元信息科技有限公司 New energy station energy storage management method and system

Also Published As

Publication number Publication date
CN112952847B (en) 2022-09-16

Similar Documents

Publication Publication Date Title
CN112952847B (en) Multi-region active power distribution system peak regulation optimization method considering electricity demand elasticity
Hossain et al. Modified PSO algorithm for real-time energy management in grid-connected microgrids
CN109103912B (en) Industrial park active power distribution system scheduling optimization method considering power grid peak regulation requirements
CN106532751B (en) A kind of distributed generation resource efficiency optimization method and system
CN111293682B (en) Multi-microgrid energy management method based on cooperative model predictive control
CN104158203A (en) Micro-grid power supply capacity optimization configuration method
CN112800658A (en) Active power distribution network scheduling method considering source storage load interaction
CN109114662B (en) Heating control method and system of electrothermal energy storage device based on multiple intelligent agents
CN112508325B (en) Household micro-grid multi-time scale electric energy scheduling method
CN113765154B (en) High-elasticity power grid source network load storage multielement collaborative optimization control method
CN114243794B (en) Wind, light and water multi-energy complementary system flexibility requirement quantification and coordination optimization method
CN115147245B (en) Virtual power plant optimal scheduling method for industrial load participating in peak shaving auxiliary service
CN117060470B (en) Power distribution network voltage optimization control method based on flexible resources
Dong et al. Optimal scheduling framework of electricity-gas-heat integrated energy system based on asynchronous advantage actor-critic algorithm
CN115036914A (en) Power grid energy storage double-layer optimization method and system considering flexibility and new energy consumption
CN111817296A (en) Electric energy scheduling method and system for micro-grid
CN111709574A (en) Distributed cluster configuration scheduling method, computer equipment and storage medium
CN114462854A (en) Hierarchical scheduling method and system containing new energy and electric vehicle grid connection
Tang et al. Schedule of air-conditioning systems with thermal energy storage considering wind power forecast errors
CN117559526A (en) Router-simulated energy regulation and control method based on optical storage and charging integrated charging station
Desdouits et al. Multisource elevator energy optimization and control
Yunjia et al. Independent microgrid day-ahead optimization based on demand response
CN116961008A (en) Micro-grid capacity double-layer optimization method considering power spring and load demand response
CN116384655A (en) New energy consumption-oriented source network load side energy storage system optimization planning method and system
CN114421538A (en) Source-storage collaborative planning configuration method containing source-load time sequence similarity constraint

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220804

Address after: 230001 No. 193, Tunxi Road, Hefei, Anhui

Applicant after: Hefei University of Technology

Applicant after: State Grid Anhui Electric Power Company

Address before: Tunxi road in Baohe District of Hefei city of Anhui Province, No. 193 230009

Applicant before: Hefei University of Technology

GR01 Patent grant
GR01 Patent grant