CN115986748A - Data-driven microgrid voltage control method - Google Patents

Data-driven microgrid voltage control method Download PDF

Info

Publication number
CN115986748A
CN115986748A CN202211708121.2A CN202211708121A CN115986748A CN 115986748 A CN115986748 A CN 115986748A CN 202211708121 A CN202211708121 A CN 202211708121A CN 115986748 A CN115986748 A CN 115986748A
Authority
CN
China
Prior art keywords
network
voltage
model
power
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211708121.2A
Other languages
Chinese (zh)
Inventor
沈嘉伟
李培帅
韩静
董彦昊
陈敏强
王艺涵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Science and Technology
Original Assignee
Nanjing University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Science and Technology filed Critical Nanjing University of Science and Technology
Priority to CN202211708121.2A priority Critical patent/CN115986748A/en
Publication of CN115986748A publication Critical patent/CN115986748A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The invention discloses a data-driven microgrid voltage control method, which relates to the technical field of microgrid application and comprises the following steps: dividing the microgrid into a plurality of sub-networks which are mutually connected and coupled through tide based on a distributed architecture, and constructing an agent corresponding to the internal control of the sub-networks aiming at each sub-network; constructing a combined control model with the reactive output of the photovoltaic inverter as a main part and the active output of the energy storage system as an auxiliary part, and constructing a barrel-shaped voltage barrier function model by combining a V-shaped voltage barrier function model and a U-shaped voltage barrier function model; taking voltage constraint based on the network total power loss and the voltage barrier function model as target control, balancing voltage deviation and network power loss by using a weighting sum algorithm, and finally establishing a micro-grid distributed VVC model; the micro-grid distributed VVC model is fitted into a partially observable Markov decision POMG model, and a state action equation is modified from adaptation of discrete action to adaptation of continuous action so as to adapt to the real-time control requirement.

Description

Data-driven microgrid voltage control method
Technical Field
The invention relates to the technical field of micro-grid application, in particular to a data-driven micro-grid voltage control method.
Background
A micro-grid (micro-grid) refers to a small-sized power generation and distribution system formed by collecting a distributed power supply, an energy storage device, an energy conversion device, and related load, monitoring and protection devices, and is an autonomous system capable of realizing self-control, protection and management, and can be operated in a grid-connected manner with an external power grid or in an isolated manner. Is an important component of the smart grid. The microgrid has a dual role: for a power grid, a micro-grid is used as an intelligent load with changeable size, a schedulable load is provided for a local power system, a response can be made within a few seconds to meet the system requirement, and powerful support is provided for the large power grid in due time; the system can be maintained without influencing the load of a client; the method can reduce (prolong) updating of the power distribution network, adopts the IEEE1547.4 standard to guide the operation of the distributed power supply island, and can eliminate technical obstacles generated by certain special operation requirements. For the user, the microgrid acts as a customizable power supply, which can meet the diverse needs of the user, for example, enhancing local power supply reliability, reducing feed losses, supporting local voltage, improving efficiency by utilizing waste heat, providing correction of voltage sag, or serving as an uninterruptible power supply, etc. The micro-grid not only solves the problem of large-scale access of the distributed power supply, gives full play to various advantages of the distributed power supply, but also brings other multi-aspect benefits to users. The micro-grid fundamentally changes the traditional mode of coping with load increase, and has great potential in the aspects of reducing energy consumption, improving reliability and flexibility of a power system and the like.
For micro-grids, some voltage/reactive power control (VVC) methods have been developed to solve the voltage regulation problem and reduce the power consumption of the grid and other power management problems. Among other things, inverters and energy storage devices have been extensively studied and used to participate in microgrid regulation control due to their increasing availability, flexibility and rapid response speed of advanced power electronics technologies. The control framework for the microgrid mainly comprises 3 control frameworks of centralized control, local control and distributed control at present. Distributed methods have been increasingly studied because they do not require complex communication networks and can take into account the privacy, complexity, scalability, and globality issues that exist.
A common research idea for distributed control is to transform a non-convex problem into a convex optimization problem through reasonable approximation and simplification, and then solve the problem using a distributed algorithm. To cope with the uncertainty of distributed new energy and load demand, model-based distributed control methods typically require a predetermined optimal solution. However, when new situations arise, conventional model control methods must solve optimization problems, which require a large amount of computation. Furthermore, model-based control methods typically require accurate and comprehensive grid parameters, which are often difficult to achieve.
Disclosure of Invention
In order to solve the above-mentioned drawbacks of the prior art, the present invention provides a data-driven microgrid voltage control method.
The purpose of the invention can be realized by the following technical scheme: a data-driven microgrid voltage control method comprises the following steps:
dividing a micro-grid into a plurality of sub-networks which are mutually connected and coupled through power flow on the basis of a distributed architecture, and constructing an intelligent agent corresponding to the internal control of the sub-networks for each sub-network;
constructing a combined control model with the reactive output of the photovoltaic inverters as a main part and the active output of the energy storage system as an auxiliary part, controlling all the photovoltaic inverters and energy storage equipment in the sub-networks by a single intelligent body corresponding to each sub-network, and realizing effective control on the voltage of the micro-grid by controlling the reactive output of the photovoltaic inverters and the active output of the energy storage system;
combining the V-shaped voltage barrier function model and the U-shaped voltage barrier function model to construct a barrel-shaped voltage barrier function model;
taking voltage constraint based on a network total power loss and voltage barrier function model as target control, balancing voltage deviation and network power loss by using a weighting sum algorithm, establishing a space-time uncertainty model based on photovoltaic and load prediction intervals, establishing a network power flow constraint model based on Newton-Raphson method solution, integrating the space-time uncertainty model based on the photovoltaic and load prediction intervals, a combined control model taking the reactive power output of a photovoltaic inverter as a main part and taking the active power output of an energy storage system as an auxiliary part, balancing a control target after voltage deviation and network power loss and the network power flow constraint model, and establishing a micro-grid distributed VVC model;
fitting a micro-grid distributed VVC model into a partially observable Markov decision POMG model, and modifying a state action equation from adaptive discrete action to adaptive continuous action to adapt to real-time control requirements;
converting the constructed space-time uncertainty model based on the photovoltaic and load prediction interval into a random planning model, and adding a network solving failure punishment in the return reward of each training set to improve the network solving success rate;
a multi-agent depth certainty strategy gradient MADDPG algorithm is used, an algorithm process is designed, and the algorithm process is applied to a micro-grid.
Preferably, the distributed architecture does not require a central coordinator to collect all the information generated in the network, requiring global coordination issues to be considered; in distributed optimization, the sub-network represented by each constructed agent exchanges only limited boundary physical information and global reward return with its neighbors, collectively seeking a global optimal solution, during each operation the trained controller implements locally measured VVC within the respective sub-network.
Preferably, the joint control model with the photovoltaic inverter reactive output as the main and the energy storage system active output as the auxiliary is as follows:
Figure BDA0004025461920000031
Figure BDA0004025461920000032
Figure BDA0004025461920000033
Figure BDA0004025461920000041
Figure BDA0004025461920000042
Figure BDA0004025461920000043
represents the real-time active power of the photovoltaic output->
Figure BDA0004025461920000044
Represents the real-time reactive power output by the photovoltaic inverter,
Figure BDA0004025461920000045
represents the complex power of the photovoltaic inverter->
Figure BDA0004025461920000046
Represents the photovoltaic active power output boundary at t, the delta photovoltaic inverter reactive power capacity factor, and->
Figure BDA0004025461920000047
And &>
Figure BDA0004025461920000048
Represents the minimum and maximum active power that the energy storage system can emit and absorb, ->
Figure BDA0004025461920000049
Represents the active power output value of the stored energy at t->
Figure BDA00040254619200000410
Represents the real-time capacity of the stored energy at t->
Figure BDA00040254619200000411
Representing the maximum capacity of stored energy.
The photovoltaic inverters preferentially provide reactive power when needed, when the reactive power compensation capability is insufficient, the energy storage system can act, the reactive power output by each photovoltaic inverter is limited in a preset proportion of the apparent power capacity, a positive value indicates that the reactive power is injected into a power grid, and a negative value indicates that the reactive power of the power grid is absorbed; the energy storage system is arranged similarly to the inverter, positive values indicate that active power is injected into the power grid, negative values indicate that the active power of the power grid is absorbed, and the residual electric quantity of the energy storage system is always positive.
Preferably, the barrel-type voltage barrier function model is as follows:
Figure BDA00040254619200000412
in the formula, v a Real-time voltage magnitude for the node; v. of ref Taking 1.00p.u. as a network voltage reference value; l v (v a ) Real-time reward for the node voltage;
and the voltage barrier function model combines the advantages of V-shaped and U-shaped: on one hand, it has a slow gradient in a safe range, and better voltage conditions are obtained; on the other hand, a larger gradient outside the safe range ensures faster policy guidance.
Preferably, the control target after balancing the voltage deviation and the network power loss is as follows:
Figure BDA00040254619200000413
Figure BDA00040254619200000414
Figure BDA0004025461920000051
in the formula, lv (vi, t) is a real-time voltage barrier function value of an i node at the t moment; n is the number of network nodes; n is a radical of m Is a network node set;
Figure BDA0004025461920000052
is a network branch set; r is ij And x ij Respectively representing the resistance and reactance of a branch between the i node and the j node; v. of i,t And v j,t Respectively representing the voltage amplitudes of an i node and a j node at the time t;
then, a weighting sum algorithm is adopted to convert the multi-objective function into an equivalent single objective function with weighting factors, the targets are normalized by using Utox points and nadier points, and the weighting sum of the normalized targets obtained for any sub-network m is expressed as:
Figure BDA0004025461920000053
in combination with>
Figure BDA0004025461920000054
Represents the voltage deviation at the moment t of the m-network after normalization, and>
Figure BDA0004025461920000055
and the network active loss at the t moment of the m network after normalization is represented, and alpha and beta represent normalized coefficients.
Preferably, the process of establishing the photovoltaic and load prediction interval-based spatio-temporal uncertainty model is as follows:
before each operation period, generating a spatial uncertainty scene in a given prediction interval randomly through Monte Carlo sampling, generating a time uncertainty scene through Monte Carlo sampling in each operation period, considering time lag through the time uncertainty interval, calculating voltage deviation and network loss under each time uncertainty scene, and correcting the two targets by using scene occurrence probability, wherein the corrected normalized target is represented as:
Figure BDA0004025461920000056
Figure BDA0004025461920000057
normalizing the sum of the values for the case in the stochastic programming (mean voltage deviation/loss at time t of the m-network);
Figure BDA0004025461920000058
is the sum of the normalized values in the case of u; xi shape u Indicating the probability of occurrence of the u-situation.
Preferably, the process of modifying the state action equation from adapting to discrete action to adapting to continuous action is as follows:
the state-action function is used to indicate the current state or the integrated return of a state-action pair as follows:
Figure BDA0004025461920000061
in the formula, τ i Represents a history of agent i; a is -i =× j≠i a j To accommodate continuous motion, the state-motion function is modified to the form:
Figure BDA0004025461920000062
in the formula (II), based on a' i Is distributed by pi i (a′ i ∣τ i ) It is shown that,
Figure BDA0004025461920000063
obtained by monte carlo sampling, as:
Figure BDA0004025461920000064
preferably, the process of adding a network solution failure penalty to increase the network solution success rate is as follows:
adding a network solution failure penalty function F:
F=-f,t f <T max
in the formula, f is a penalty occurrence constant and is a large positive number; t is t f The duration of successful network solving in the training set with interruption due to failure of network solving; t is max Maximum time step for each training set;
the reward for each training set interrupted by the occurrence of network solution failure is adjusted to R mf
R mf =R m +F
In the formula, R m The normal prize value obtained before the training set break.
Preferably, an apparatus comprises:
one or more processors;
a memory for storing one or more programs;
when executed by one or more of the processors, cause the one or more processors to implement a data-driven microgrid voltage control method as described above.
Preferably, a storage medium contains computer executable instructions for performing a data driven microgrid voltage control method as described above when executed by a computer processor.
The invention has the beneficial effects that:
based on a data driving method and a distributed architecture, the invention adopts a mode of combining centralized training and decentralized execution, and obtains an effective operation control strategy by performing offline training on historical data, thereby being capable of making a real-time joint output decision on line according to the real-time operation condition of the network, and ensuring the reliability, economy and safety in the network operation process.
Drawings
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present invention, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings without creative efforts;
FIG. 1 is a schematic diagram of the overall framework of the network of the present invention;
FIG. 2 is a schematic diagram of an algorithm flow framework of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, a method for controlling voltage of a data-driven microgrid comprises the following steps:
step (1): the microgrid is divided into a plurality of sub-networks which are mutually connected and coupled through tide based on a distributed architecture, and an agent controlled inside the corresponding sub-network is constructed for each sub-network.
Considering that the traditional centralized VVC framework has the problems of communication and computation burden, privacy, and the like, and the problem that the traditional local control method cannot perform global coordination, a regional coordination microgrid VVC framework is constructed, as shown in fig. 1. The micro-grid is divided into a plurality of sub-networks which are mutually connected and coupled through power flow on the basis of a distributed architecture, and an agent corresponding to internal control of each sub-network is constructed. Each constructed agent's sub-network only exchanges limited boundary physical information and global reward return with its neighboring sub-networks, collectively seeking a global optimal solution. During each operation, the trained controller implements locally measured VVCs within its respective sub-network.
Step (2): and constructing a combined control model with the reactive output of the photovoltaic inverters as a main part and the active output of the energy storage system as an auxiliary part, controlling all the photovoltaic inverters and energy storage equipment in the sub-networks by a single intelligent body corresponding to each sub-network, and realizing effective control on the voltage of the micro-grid by controlling the reactive output of the photovoltaic inverters and the active output of the energy storage system.
The inverter control model is shown in (1) to (3). The reactive power output by each inverter is also limited within a preset proportion of the apparent power capacity of the inverter, positive values indicate that the reactive power is injected into the power grid, and negative values indicate that the reactive power of the power grid is absorbed.
Figure BDA0004025461920000081
Figure BDA0004025461920000082
Figure BDA0004025461920000083
Wherein the content of the first and second substances,
Figure BDA0004025461920000084
represents the real-time active power of the photovoltaic output->
Figure BDA0004025461920000085
Represents the real-time reactive power output by the photovoltaic inverter and is greater or less>
Figure BDA0004025461920000086
Representing photovoltaicsThe complex power of the inverter is->
Figure BDA0004025461920000087
And a photovoltaic active power output boundary at t, and a delta photovoltaic inverter reactive power capacity factor are represented.
The photovoltaic inverter droop control scheduling model is shown as (4) and (5). The energy storage system is arranged similarly to the inverter, positive values indicate that active power is injected into the power grid, negative values indicate that the active power of the power grid is absorbed, and the residual electric quantity of the energy storage system is always positive.
Figure BDA0004025461920000088
Figure BDA0004025461920000089
Wherein the content of the first and second substances,
Figure BDA00040254619200000810
and &>
Figure BDA00040254619200000811
Represents the minimum and maximum active power that the energy storage system can emit and absorb->
Figure BDA0004025461920000091
Represents the active power output value of the stored energy at t->
Figure BDA0004025461920000092
Represents the real-time capacity of the stored energy at t->
Figure BDA0004025461920000093
Representing the maximum capacity of stored energy.
The photovoltaic inverters preferentially provide reactive power when needed, when the reactive power compensation capability is insufficient, the energy storage system can act, the reactive power output by each photovoltaic inverter is limited in a preset proportion of the apparent power capacity, a positive value indicates that the reactive power is injected into a power grid, and a negative value indicates that the reactive power of the power grid is absorbed; the energy storage system is arranged similarly to the inverter, positive values indicate that active power is injected into the power grid, negative values indicate that the active power of the power grid is absorbed, and the residual electric quantity of the energy storage system is always positive.
And (3): combining the V-shaped voltage barrier function model and the U-shaped voltage barrier function model to construct a barrel-shaped voltage barrier function model;
Figure BDA0004025461920000094
Figure BDA0004025461920000095
in the formula (6), the V-shaped voltage barrier function has a larger gradient in the range of 0.95p.u. -1.05p.u., so that better voltage conditions can be realized, but fine adjustment cannot be realized in the range. The U-shaped voltage barrier function in the formula (7) has a small gradient outside the range of 0.95p.u. -1.05p.u, and cannot enter a safety range quickly.
Combining the advantages of the two types of the voltage barrier function model:
Figure BDA0004025461920000096
in the formula (8), v a The real-time voltage magnitude of the node; v. of ref Generally, 1.00p.u. is taken as a network voltage reference value; l v (v a ) Real-time awards for the node voltage.
The model combines the advantages of V-shaped and U-shaped models: on one hand, the gradient of the voltage gradient is slow in a safe range, so that better voltage conditions can be obtained; on the other hand, a larger gradient outside the safe range may ensure faster policy guidance.
And (4): the method comprises the steps of taking voltage constraint based on a network total power loss and voltage barrier function model as target control, balancing voltage deviation and network power loss by using a weighting sum algorithm, establishing a space-time uncertainty model based on photovoltaic and load prediction intervals, establishing a network power flow constraint model based on Newton-Raphson method solution, integrating the space-time uncertainty model based on the photovoltaic and load prediction intervals, taking the reactive power output of a photovoltaic inverter as a main combined control model and taking the active power output of an energy storage system as an auxiliary combined control model, balancing a control target after voltage deviation and network power loss and the network power flow constraint model, and establishing a micro-grid distributed VVC model;
two VVC targets, i.e., voltage constraint and network active power minimization, are set per subnet as shown in (9). Where (10) is e.g. the voltage deviation and (11) the total power loss at time t of the network.
Figure BDA0004025461920000101
Figure BDA0004025461920000102
Figure BDA0004025461920000103
In the formula (10), l v (v i,t ) The value of the real-time voltage barrier function of the i node at the time t; n is the number of network nodes; n is a radical of m Is a network node set; in formula (11)
Figure BDA0004025461920000104
For a set of network legs, r ij And x ij Representing the resistance and reactance of the branch between the i and j nodes, respectively. v. of i,t And v j,t Respectively, the voltage amplitudes of the i node and the j node at time t.
And converting the multi-objective function into an equivalent single objective function with a weighting factor by adopting a classical weighted sum algorithm. Here we normalize these targets using the utopia point and nadir point, and for subnetwork m we get a weighted sum of the normalized targets expressed as:
Figure BDA0004025461920000105
in the formula (I), the compound is shown in the specification,
Figure BDA0004025461920000106
represents the voltage deviation at the moment t of the m-network after normalization, and>
Figure BDA0004025461920000107
and the network active loss at the t moment of the m network after normalization is represented, and alpha and beta represent normalized coefficients.
And establishing a space-time uncertainty model based on photovoltaic and load prediction intervals.
A spatial uncertainty model is first established based on the prediction interval of photovoltaic power generation and load, and then optimized at each decision time step. The time interval in the optimized model varies with time and is correlated with the real-time measurement at t.
Spatial and temporal uncertainties need to be addressed to ensure operational constraints, taking into account the location of renewable energy generation and load changes, as well as short term intermittency and fluctuations.
Firstly, a space uncertainty model is established based on the predicted photovoltaic power generation time interval and load, and the uncertainty which can be realized at the position t0 is limited in a prediction interval:
Figure BDA0004025461920000111
Figure BDA0004025461920000112
in the formula (I), the compound is shown in the specification,
Figure BDA0004025461920000113
and representing the upper and lower limits of active and reactive power requirements of the maximum power point/bus i of the space uncertainty inverter.
In the above spatial uncertainty model, communication delay may cause response delay of the inverter, and in order to eliminate the influence of these delays, a time uncertainty model is established, and each decision time step is optimized and expressed as:
Figure BDA0004025461920000114
Figure BDA0004025461920000115
in the formula (I), the compound is shown in the specification,
Figure BDA0004025461920000116
and (3) showing the time uncertainty upper and lower limits of the maximum power point/bus active power and reactive power requirements of the inverter. The spacing of this time uncertainty model varies with time and is related to the real-time measurement at t.
The intervals of the time uncertainty model vary with time and are related to real-time measurements at t time.
And establishing a network power flow constraint model based on Newton Raphson method solution as shown in a formula (18).
Figure BDA0004025461920000117
In the formula p i,t And q is i,t Respectively representing active power and reactive power injected by the i node at the moment t, v i,t Representing the magnitude of the voltage at time t of the i-node, g ij And b ij Representing the conductance and susceptance, θ, of the busbar between the i-node and the j-node ij,t And N is an index of all nodes of the microgrid, and represents the voltage phase angle difference between the i node and the j node at the time t.
And (3) integrating a space-time uncertainty model based on photovoltaic and load prediction intervals, a combined control model with photovoltaic inverter reactive power output as a main part and energy storage system active power output as an auxiliary part, a balanced voltage deviation, a network power loss control target and a network power flow constraint model, and establishing a distributed VVC model of the microgrid as shown in a formula (19).
Figure BDA0004025461920000121
In the formula (19), M is a network partition index; t is the total duration; v. of i,t Is a real-time voltage.
And (5): fitting a micro-grid distributed VVC model into a partially observable Markov decision POMG model, and modifying a state action equation from adaptive discrete action to adaptive continuous action to adapt to real-time control requirements;
firstly, setting an observation space containing v i,t
Figure BDA0004025461920000122
And &>
Figure BDA0004025461920000123
Respectively representing node real-time voltage, active power consumption of a node connection load, reactive power consumption of the node connection load, active power of a real-time photovoltaic injection power grid and real-time energy storage residual energy; the successive action spaces comprising->
Figure BDA0004025461920000124
And &>
Figure BDA0004025461920000125
And respectively representing the reactive output power and the energy storage active output power of the inverter.
The goal of determining each actor is to maximize its expected revenue over a time horizon T, i.e.
Figure BDA0004025461920000126
The state-motion equations are modified from adapting discrete motion to adapting continuous motion to accommodate real-time control requirements.
The state-action function is used to indicate the current state or the integrated return of a state-action pair:
Figure BDA0004025461920000131
in the formula, τ i Represents a history of agent i; a is -i =× j≠i a j . To accommodate the continuous motion, we modify it to the following form:
Figure BDA0004025461920000132
in the formula (II), based on a' i Is distributed by pi i (a′ i ∣τ i ) And (4) showing. In the practical application of the method, the material is,
Figure BDA0004025461920000133
obtained by approximation of a monte carlo sample, which can be rewritten as:
Figure BDA0004025461920000134
and (6): converting the constructed space-time uncertainty model based on the photovoltaic and load prediction interval into a random planning model, and adding a network solving failure punishment in the return reward of each training set to improve the network solving success rate;
a spatial uncertainty scene is generated by Monte Carlo sampling according to the prediction intervals of equations (13) and (14), a time uncertainty scene is generated by Monte Carlo sampling according to equations (15) and (16) in consideration of time lag, and the voltage deviation and the network loss are corrected by using the scene occurrence probability. Xi shape u The occurrence probability of the situation U belongs to U, and the situation is normalized:
Figure BDA0004025461920000135
/>
in equation (24), U considers each decisionThere are U time uncertainty scenes xi on the time step u Is the probability of occurrence of the scenario U e U.
Figure BDA0004025461920000136
The sum of the values (average voltage deviation/grid loss at time instant m network t) is normalized for the case in stochastic programming.
Figure BDA0004025461920000137
Is the sum of the normalized values in the case of u.
Finally, a strategy gradient function is given:
Figure BDA0004025461920000138
in order to reduce the problem of network solution failure in the training process caused by the problem of the bearing capacity of the power network, a network solution failure penalty function F is added:
F=-f,t f <T max (25)
in the formula, f is a penalty occurrence constant and is a larger positive number; t is t f The duration of successful network solving in the training set with interruption due to failure of network solving; t is max The maximum time step for each training set.
Then the reward for each training set interrupted by the occurrence of network solution failure at this time is adjusted to:
R mf =R m +F(26)
in the formula (26), R m The normal prize value obtained before the training set interruption.
And (7): a multi-agent depth certainty strategy gradient MADDPG algorithm is used, a reasonable algorithm flow is designed, and the method is applied to a micro-grid.
The specific algorithm flow is shown in fig. 2. During initialization, the capacity of the energy storage system is set to be 50% of the total capacity, and if the energy storage action is carried out, the action value same as that of the photovoltaic is adopted. For each training set, the buffer stores 480 time steps (i.e., 1 day) of PV and load data. Further, the bonus is obtained based on the calculation result of the buffer data according to the PandaPower package. The received state assigns a zone according to each agent to an observation before providing feedback to the agent. Each agent receives only local observations and global rewards before making the next decision. The above process will be repeated until the training single set is finished.
After each set of actions is completed, a reward is obtained. A training comprises a plurality of training sets, and the number of sets required by training is manually set according to needs. Behavior change and policy model preservation occurs once every 40 training sets and the targets are updated once every 120 sets. Before each policy model update, 10 sets of tests will be performed. The test data is based on the total data sample and the average of the test will be used to evaluate the effectiveness of the strategy.
Based on the same inventive concept, the present invention also provides a computer apparatus, comprising: one or more processors, and memory for storing one or more computer programs; the program includes program instructions and the processor is configured to execute the program instructions stored by the memory. The Processor may be a Central Processing Unit (CPU), or may be other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete gate or transistor logic device, discrete hardware component, etc., which is a computing core and a control core of the terminal and is used for implementing one or more instructions, and in particular for loading and executing one or more instructions in a computer storage medium, so as to implement the method described above.
It should be further noted that, based on the same inventive concept, the present invention also provides a computer storage medium, on which a computer program is stored, and the computer program is executed by a processor to perform the above method. The storage medium may take any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. The computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electrical, magnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of the present invention, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
In the description herein, references to the description of "one embodiment," "an example," "a specific example," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the disclosure. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The foregoing illustrates and describes the general principles, principal features, and advantages of the present disclosure. It will be understood by those skilled in the art that the present disclosure is not limited to the embodiments described above, which are presented solely for illustrating the principles of the disclosure, but that various changes and modifications may be made without departing from the spirit and scope of the disclosure, which fall within the scope of the claimed disclosure.

Claims (10)

1. A data-driven microgrid voltage control method is characterized by comprising the following steps:
dividing the microgrid into a plurality of sub-networks which are mutually connected and coupled through tide based on a distributed architecture, and constructing an agent corresponding to the internal control of the sub-networks aiming at each sub-network;
constructing a combined control model with the reactive output of the photovoltaic inverters as a main part and the active output of the energy storage system as an auxiliary part, controlling all the photovoltaic inverters and energy storage equipment in the sub-networks by a single intelligent body corresponding to each sub-network, and realizing effective control on the voltage of the micro-grid by controlling the reactive output of the photovoltaic inverters and the active output of the energy storage system;
combining the V-shaped voltage barrier function model and the U-shaped voltage barrier function model to construct a barrel-shaped voltage barrier function model;
taking voltage constraint based on a network total power loss and voltage barrier function model as target control, balancing voltage deviation and network power loss by using a weighting sum algorithm, establishing a space-time uncertainty model based on photovoltaic and load prediction intervals, establishing a network power flow constraint model based on Newton-Raphson method solution, integrating the space-time uncertainty model based on the photovoltaic and load prediction intervals, a combined control model taking the reactive power output of a photovoltaic inverter as a main part and taking the active power output of an energy storage system as an auxiliary part, balancing a control target after voltage deviation and network power loss and the network power flow constraint model, and establishing a micro-grid distributed VVC model;
fitting a micro-grid distributed VVC model into a partially observable Markov decision POMG model, and adapting to real-time control requirements by improving a state action equation from adapting to discrete actions to adapting to continuous actions;
converting the well-constructed space-time uncertainty model based on the photovoltaic and load prediction interval into a stochastic programming model, and adding a network solution failure punishment in the return reward of each training set to improve the success rate of network solution;
a multi-agent depth certainty strategy gradient MADDPG algorithm is used, an algorithm process is designed, and the algorithm process is applied to a micro-grid.
2. The voltage control method of the data-driven microgrid according to claim 1, characterized in that the distributed architecture does not require a central coordinator to collect all the information generated in the network, and needs to consider global coordination problems; in distributed optimization, the sub-network represented by each constructed agent exchanges only limited boundary physical information and global reward return with its neighbors, collectively seeking a global optimal solution, during each operation the trained controller implements locally measured VVC within the respective sub-network.
3. The method for controlling the voltage of the data-driven microgrid according to claim 1, wherein the combined control model with the photovoltaic inverter having the reactive output as the main and the energy storage system having the active output as the auxiliary is as follows:
Figure FDA0004025461910000021
Figure FDA0004025461910000022
Figure FDA0004025461910000023
Figure FDA0004025461910000024
Figure FDA0004025461910000025
Figure FDA0004025461910000026
represents the real-time active power of the photovoltaic output->
Figure FDA0004025461910000027
Represents the real-time reactive power output by the photovoltaic inverter and is combined with the voltage value>
Figure FDA0004025461910000028
Represents the complex power of the photovoltaic inverter->
Figure FDA0004025461910000029
Represents the photovoltaic active power output boundary at t, the delta photovoltaic inverter reactive power capacity factor, and->
Figure FDA00040254619100000210
And &>
Figure FDA00040254619100000211
Represents the minimum and maximum active power that the energy storage system can emit and absorb, ->
Figure FDA00040254619100000212
Represents the active power output value of the stored energy at t->
Figure FDA00040254619100000213
Represents the real-time capacity of the stored energy at t->
Figure FDA00040254619100000214
Represents the maximum capacity of stored energy;
the photovoltaic inverters preferentially provide reactive power when needed, when the reactive power compensation capability is insufficient, the energy storage system can act, the reactive power output by each photovoltaic inverter is limited in a preset proportion of the apparent power capacity, a positive value indicates that the reactive power is injected into a power grid, and a negative value indicates that the reactive power of the power grid is absorbed; the energy storage system is arranged similarly to the inverter, positive values indicate that active power is injected into the power grid, negative values indicate that the active power of the power grid is absorbed, and the residual electric quantity of the energy storage system is always positive.
4. The method of claim 1, wherein the bucket type voltage barrier function model is as follows:
Figure FDA0004025461910000031
in the formula, v a The real-time voltage magnitude of the node; v. of ref Taking 1.00p.u. as a network voltage reference value; l v (v a ) Real-time reward for node voltage;
and the voltage barrier function model combines the advantages of the V type and the U type: on one hand, it has a slow gradient in a safe range, and better voltage conditions are obtained; on the other hand, a larger gradient outside the safe range ensures faster policy guidance.
5. The method of claim 1, wherein the trade-off between voltage deviation and network power consumption is the control objective:
Figure FDA0004025461910000032
Figure FDA0004025461910000033
Figure FDA0004025461910000034
in the formula I v (v i,t ) The value of the real-time voltage barrier function of the i node at the time t; n is the number of network nodes; n is a radical of hydrogen m Is a network node set;
Figure FDA0004025461910000035
is a network branch set; r is ij And x ij Respectively representing the resistance and reactance of a branch between the i node and the j node; v. of i,t And v j,t Respectively represent t timesVoltage amplitudes of an i node and a j node are carved;
then, a weighting sum algorithm is adopted to convert the multi-objective function into an equivalent single objective function with weighting factors, the targets are normalized by using Utox points and nadier points, and the weighting sum of the normalized targets obtained for any sub-network m is expressed as:
Figure FDA0004025461910000036
in the formula (II)>
Figure FDA0004025461910000037
Represents the voltage deviation at the moment t of the m-network after normalization, and>
Figure FDA0004025461910000038
and the network active loss at the t moment of the m network after normalization is represented, and alpha and beta represent normalized coefficients.
6. The data-driven microgrid voltage control method of claim 1, characterized in that the process of establishing a photovoltaic and load prediction interval-based spatio-temporal uncertainty model is as follows:
before each operation period, generating a spatial uncertainty scene in a given prediction interval randomly through Monte Carlo sampling, generating a time uncertainty scene through Monte Carlo sampling in each operation period, considering time lag through the time uncertainty interval, calculating voltage deviation and network loss under each time uncertainty scene, and correcting the two targets by using scene occurrence probability, wherein the corrected normalized target is represented as:
Figure FDA0004025461910000041
Figure FDA0004025461910000042
represents the normalized average voltage deviation/network power loss under scenario u; />
Figure FDA0004025461910000043
The normalized average voltage deviation/network power loss of the network m at time t, i.e. the sum of the normalized values in all cases in stochastic programming; xi u Representing the probability of occurrence of the scene u.
7. The method of claim 1, wherein the process of modifying the equation of state action from adaptive discrete action to adaptive continuous action is as follows:
the state-action function is used to indicate the current state or the integrated return of a state-action pair as follows:
Figure FDA0004025461910000044
in the formula, τ i Represents a history of agent i; a is -i =× j≠i a j To accommodate continuous motion, the state-motion function is modified to the following form:
Figure FDA0004025461910000045
in the formula (II), based on a' i Is distributed by pi i (a′ i ∣τ i ) It is shown that,
Figure FDA0004025461910000046
obtained by monte carlo sampling, as:
Figure FDA0004025461910000047
8. the data-driven microgrid voltage control method according to claim 1, wherein the process of adding a network solution failure penalty to increase a network solution success rate is as follows:
adding a network solution failure penalty function F:
F=-f,t f <T max
in the formula, f is a penalty occurrence constant and is a large positive number; t is t f The duration of successful network solving in the training set with interruption due to failure of network solving; t is a unit of max Maximum time step for each training set;
the reward for each training set interrupted by the occurrence of network solution failure is adjusted to R mf
R mf =R m +F
In the formula, R m The normal prize value obtained before the training set break.
9. An apparatus, comprising:
one or more processors;
a memory for storing one or more programs;
when executed by one or more of the processors, cause the one or more processors to implement a data-driven microgrid voltage control method of any of claims 1-8.
10. A storage medium containing computer executable instructions for performing a data driven microgrid voltage control method according to any one of claims 1 to 8 when executed by a computer processor.
CN202211708121.2A 2022-12-29 2022-12-29 Data-driven microgrid voltage control method Pending CN115986748A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211708121.2A CN115986748A (en) 2022-12-29 2022-12-29 Data-driven microgrid voltage control method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211708121.2A CN115986748A (en) 2022-12-29 2022-12-29 Data-driven microgrid voltage control method

Publications (1)

Publication Number Publication Date
CN115986748A true CN115986748A (en) 2023-04-18

Family

ID=85973701

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211708121.2A Pending CN115986748A (en) 2022-12-29 2022-12-29 Data-driven microgrid voltage control method

Country Status (1)

Country Link
CN (1) CN115986748A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116581766A (en) * 2023-07-11 2023-08-11 南京理工大学 Virtual power plant strengthening online voltage control method considering sagging characteristic

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116581766A (en) * 2023-07-11 2023-08-11 南京理工大学 Virtual power plant strengthening online voltage control method considering sagging characteristic
CN116581766B (en) * 2023-07-11 2023-11-28 南京理工大学 Virtual power plant strengthening online voltage control method considering sagging characteristic

Similar Documents

Publication Publication Date Title
Bahramipanah et al. Decentralized voltage control of clustered active distribution network by means of energy storage systems
Venkatesan et al. Optimal power flow control of hybrid renewable energy system with energy storage: A WOANN strategy
CN114725936A (en) Power distribution network optimization method based on multi-agent deep reinforcement learning
CN112651112B (en) Collaborative decision-making method, system and equipment for electric energy transaction and system operation of internet micro-grid
Ameur et al. Intelligent optimization and management system for renewable energy systems using multi-agent
CN115986748A (en) Data-driven microgrid voltage control method
Nguyen et al. Three-stage inverter-based peak shaving and Volt-VAR control in active distribution networks using online safe deep reinforcement learning
CN114142532A (en) Method and system for coordinated control of distributed photovoltaic participation source network load storage
Xiang et al. Deep reinforcement learning based topology-aware voltage regulation of distribution networks with distributed energy storage
Hossain et al. Deep reinforcement learning-based volt-var optimization in distribution grids with inverter-based resources
Ye et al. A hybrid charging management strategy for solving the under-voltage problem caused by large-scale EV fast charging
Taghizadeh et al. Deep reinforcement learning-aided bidding strategies for transactive energy market
Huang et al. Distributed real-time economic dispatch for islanded microgrids with dynamic power demand
CN113780622A (en) Multi-micro-grid power distribution system distributed scheduling method based on multi-agent reinforcement learning
Bonthu et al. Energy cost optimization in microgrids using model predictive control and mixed integer linear programming
Ebell et al. Coordinated multi-agent reinforcement learning for swarm battery control
Pinthurat et al. Unbalanced powers compensation by distributed batteries in microgrid: A data-driven approach
CN114884049B (en) Optimized operation control method for flexible direct-current power distribution network
Han et al. Hybrid control scheme for distributed energy resource management in a market context
Guo et al. Impact of power system partitioning on the efficiency of distributed multi-step optimization
Dehghanpour et al. Intelligent microgrid power management using the concept of Nash bargaining solution
Seervi et al. Renewable Energy Forecasting for Energy Storage Sizing: A Review
Huo et al. Data-driven predictive voltage control for distributed energy storage in active distribution networks
Kuppannagari et al. Risk aware net load balancing in micro grids with high DER penetration
CN116581766B (en) Virtual power plant strengthening online voltage control method considering sagging characteristic

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination