CN112467686A - Power distribution network fault recovery method and device, control equipment and storage medium - Google Patents
Power distribution network fault recovery method and device, control equipment and storage medium Download PDFInfo
- Publication number
- CN112467686A CN112467686A CN202011214337.4A CN202011214337A CN112467686A CN 112467686 A CN112467686 A CN 112467686A CN 202011214337 A CN202011214337 A CN 202011214337A CN 112467686 A CN112467686 A CN 112467686A
- Authority
- CN
- China
- Prior art keywords
- distribution network
- power distribution
- control action
- action
- control
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000011084 recovery Methods 0.000 title claims abstract description 80
- 238000000034 method Methods 0.000 title claims abstract description 57
- 230000009471 action Effects 0.000 claims abstract description 234
- 238000013024 troubleshooting Methods 0.000 claims description 44
- 230000008030 elimination Effects 0.000 claims description 23
- 238000003379 elimination reaction Methods 0.000 claims description 23
- 238000004590 computer program Methods 0.000 claims description 22
- 238000012544 monitoring process Methods 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 abstract description 21
- 230000000694 effects Effects 0.000 abstract description 9
- 230000008569 process Effects 0.000 abstract description 9
- 230000006870 function Effects 0.000 description 27
- 238000013528 artificial neural network Methods 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 238000012549 training Methods 0.000 description 7
- 230000007613 environmental effect Effects 0.000 description 6
- 238000005457 optimization Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 238000010606 normalization Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 3
- 230000005611 electricity Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000012806 monitoring device Methods 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000011478 gradient descent method Methods 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010248 power generation Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H02—GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
- H02H—EMERGENCY PROTECTIVE CIRCUIT ARRANGEMENTS
- H02H3/00—Emergency protective circuit arrangements for automatic disconnection directly responsive to an undesired change from normal electric working condition with or without subsequent reconnection ; integrated protection
- H02H3/02—Details
- H02H3/06—Details with automatic reconnection
- H02H3/063—Details concerning the co-operation of many similar arrangements, e.g. in a network
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y04—INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
- Y04S—SYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
- Y04S10/00—Systems supporting electrical power generation, transmission or distribution
- Y04S10/50—Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
- Y04S10/52—Outage or fault management, e.g. fault detection or location
Landscapes
- Supply And Distribution Of Alternating Current (AREA)
Abstract
The power distribution network fault recovery method comprises the steps of inquiring a historical self-healing action database according to a current observable state to obtain a control action set, determining to obtain an optimal control action according to fault removal overhead after control actions are executed each time, and finally performing fault recovery on the power distribution network based on the optimal control action. In the whole fault recovery process, a model does not need to be established, a large amount of calculation is not needed, only the historical self-healing action database is required to be inquired, and after the fault recovery is simulated, an optimal control action is selected according to a plurality of fault removal expenses to carry out fault recovery on the power distribution network. The method solves the technical problems that in the prior art, the calculation amount is large and the complexity is high in the solving process, and an optimal solution is difficult to find, and achieves the technical effect of greatly reducing the calculation amount and the calculation complexity on the premise of ensuring that the optimal solution can be found.
Description
Technical Field
The application relates to the technical field of power grid safety, in particular to a power distribution network fault recovery method, a power distribution network fault recovery device, control equipment and a storage medium.
Background
With the proposal of the concept of the power distribution network, the key point of power grid safety attention extends from small-range and frequent faults to large-range and catastrophic events, the recovery control gradually covers the relatively fragile power distribution network from the power transmission network, and a new idea for dealing with extreme natural disasters is provided from the aspects of defense and rapid recovery after disasters for the recovery control of the power distribution network.
At present, a method for rapidly recovering a power distribution network after a disaster is to respectively establish an adaptive multi-stage fault recovery model aiming at different fault stages of the power distribution network under an extreme disaster, and perform optimization solution by using a traditional optimization planning method (such as a commercial optimization solver) to obtain an optimal recovery control scheme. However, the traditional optimization planning method only considers very limited typical working conditions and scenes, and under extreme external environments or in the actual fault recovery process, due to the complex environment, various fault types and serious action linkage effect, the calculation amount and complexity in the solving process are large, and therefore an optimal solution is difficult to find.
Disclosure of Invention
In view of the above, it is necessary to provide a power distribution network fault recovery method, apparatus, control device and storage medium.
In a first aspect, a method for recovering a fault of a power distribution network is provided, and the method includes:
under the condition that the power distribution network fails, acquiring the current observable state of the power distribution network, wherein the observable state comprises failure state parameters and environmental parameters causing the power distribution network to fail;
inquiring a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, wherein a plurality of corresponding relations between the observable state and the control action are stored in the historical self-healing action database;
sequentially simulating and executing the control actions in the control action set, and acquiring the troubleshooting overhead corresponding to the executed control actions after each execution;
determining an optimal control action from the control action set according to the obtained multiple troubleshooting overheads;
and performing fault recovery on the power distribution network based on the optimal control action.
In an optional embodiment of the present application, after each execution, obtaining a troubleshooting overhead corresponding to the executed control action includes: after each execution, determining an execution expense corresponding to the executed control action and a compensation value of the flexible load in the power distribution network to the execution expense; and after each execution, determining the failure elimination overhead according to the execution overhead and the compensation value corresponding to the executed control action.
In an optional embodiment of the present application, determining the troubleshooting cost according to the execution cost corresponding to the executed control action and the compensation value includes: and determining a first difference value between the execution overhead and the compensation value to obtain the troubleshooting overhead.
In an optional embodiment of the present application, determining an optimal control action from the set of control actions according to the obtained plurality of troubleshooting costs includes: respectively calculating second difference values between the plurality of fault elimination overheads and the preset fault elimination overhead to obtain a plurality of second difference values; and determining the corresponding control action of the minimum value in the plurality of second difference values in the control action set as the optimal control action.
In an optional embodiment of the present application, the observable conditions include distributed power, flexible load power and line damage conditions in the power distribution network.
In an optional embodiment of the present application, further comprising: and carrying out normalization processing on the distributed power supply power, the flexible load power and the line damage state to obtain an observable state.
In an optional embodiment of the present application, the set of control actions comprises an action state of a tie switch in the power distribution network.
In a second aspect, a power distribution network fault recovery device is provided, the device comprising: the system comprises a state acquisition module, a control action determination module, an overhead determination module, an optimal action determination module and a fault recovery module.
The state acquisition module is used for acquiring the current observable state of the power distribution network under the condition that the power distribution network fails, wherein the observable state comprises failure state parameters and environment parameters causing the power distribution network to fail;
the control action determining module is used for inquiring a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, and a plurality of corresponding relations between the observable state and the control action are stored in the historical self-healing action database;
the overhead determining module is used for sequentially simulating and executing the control actions in the control action set and acquiring the fault elimination overhead corresponding to the executed control actions after each execution;
the optimal action determining module is used for determining an optimal control action from the control action set according to the obtained multiple troubleshooting expenses;
the fault recovery module is used for carrying out fault recovery on the power distribution network based on the optimal control action.
In an optional embodiment of the present application, the overhead determination module comprises: a first overhead module and a second overhead module. The first overhead module is used for determining execution overhead corresponding to the executed control action and a compensation value of the flexible load in the power distribution network to the execution overhead after each execution; the second overhead module is used for determining the fault elimination overhead according to the execution overhead and the compensation value corresponding to the executed control action after each execution.
In an optional embodiment of the present application, the second overhead module is specifically configured to determine a first difference between the execution overhead and the compensation value, so as to obtain the troubleshooting overhead.
In an optional embodiment of the present application, the optimal action determination module comprises: a first action determination submodule and a second action determination submodule. The first action determining submodule is used for respectively calculating second differences between the plurality of fault elimination overheads and the preset fault elimination overhead to obtain a plurality of second differences; the second action determining submodule is used for determining the corresponding control action of the minimum value in the second difference values in the control action set as the optimal control action.
In an optional embodiment of the present application, the observable conditions include distributed power, flexible load power and line damage conditions in the power distribution network.
In an optional embodiment of the present application, the system further includes a processing module, where the processing module is configured to perform normalization processing on the distributed power source power, the flexible load power, and the line damage state, so as to obtain an observable state.
In an optional embodiment of the present application, the set of control actions comprises an action state of a tie switch in the power distribution network.
In a third aspect, a control device is provided, comprising a memory storing a computer program and a processor implementing the steps of the above method when executing the computer program.
In a fourth aspect, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, realizes the steps of the above method.
The embodiment of the application provides a power distribution network fault recovery method, which is characterized in that under the condition that a power distribution network has a fault, an observable state of the power distribution network is obtained in real time, and a historical self-healing action database is inquired according to the current observable state to obtain a control action set. And finally, performing fault recovery on the power distribution network based on the optimal control action. In the whole fault recovery process, a model does not need to be established, a large amount of calculation is not needed, only the historical self-healing action database is required to be inquired, and after the fault recovery is simulated, an optimal control action is selected according to a plurality of fault removal expenses to carry out fault recovery on the power distribution network. The power distribution network fault recovery method provided by the embodiment of the application solves the technical problems that in the prior art, the calculation amount is large in the solving process, the complexity is high, and an optimal solution is difficult to find, and achieves the technical effect of greatly reducing the calculation amount and the calculation complexity on the premise of ensuring that the optimal solution can be found.
Drawings
FIG. 1 is a diagram of an exemplary implementation of a method for fault recovery in a power distribution network;
FIG. 2 is a schematic flow chart of a method for recovering from a fault in a power distribution network according to an embodiment;
FIG. 3 is a schematic flow chart of a method for recovering from a fault in a power distribution network according to an embodiment;
FIG. 4 is a schematic flow chart of a method for recovering from a fault in a power distribution network according to an embodiment;
FIG. 5 is a schematic flow chart of a method for recovering from a fault in a power distribution network according to an embodiment;
FIG. 6 is a schematic flow chart of a method for recovering from a fault in a power distribution network according to an embodiment;
FIG. 7 is a block diagram of a power distribution network fault recovery device in one embodiment;
fig. 8 is a block diagram showing the configuration of a control device in one embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
At present, for the post-disaster recovery of the power distribution network, the recovery fault is generally controlled by adopting a traditional optimization planning method such as a commercial optimization solution method. However, in extreme disasters, the fault derivation may pass through different stages, and different fault recovery models need to be continuously established or updated to adapt to fault recovery in different stages. The fault recovery model is complex to establish and takes long time, so that the current fault recovery model only carries out modeling aiming at very limited typical working conditions and scenes, but under an extreme external environment or in the actual fault recovery process, due to the complex environment, various fault types and serious action linkage effect, the calculation amount and the complexity in the solving process are large, and an optimal solution is difficult to find.
In view of this, the embodiment of the present application provides a power distribution network fault recovery method, which acquires an observable state of a power distribution network in real time when the power distribution network fails, and queries a historical self-healing action database according to the current observable state to obtain a control action set. And finally, performing fault recovery on the power distribution network based on the optimal control action. In the whole fault recovery process, a model does not need to be established, a large amount of calculation is not needed, only the historical self-healing action database is required to be inquired, and after the fault recovery is simulated, an optimal control action is selected according to a plurality of fault removal expenses to carry out fault recovery on the power distribution network. The power distribution network fault recovery method provided by the embodiment of the application solves the technical problems that in the prior art, the calculation amount is large in the solving process, the complexity is high, and an optimal solution is difficult to find, and achieves the technical effect of greatly reducing the calculation amount and the calculation complexity on the premise of ensuring that the optimal solution can be found.
In the following, a brief description will be given of an implementation environment related to the power distribution network fault recovery method provided by the embodiment of the present application.
Referring to fig. 1, the method for recovering a fault of a power distribution network provided in the embodiment of the present application is applied to the power distribution network, where the power distribution network includes a power distribution network body, a monitoring device, a control device, and a control terminal. The monitoring equipment is arranged at different positions of the power grid body and used for detecting running signals and environment signals of different nodes or equipment in the power grid body, the control equipment is in signal connection with different electric equipment in the power grid body and used for controlling the work of the different electric equipment, the control terminal is in signal connection with the monitoring equipment and the control equipment respectively and used for sending control instructions to the control equipment to control the action of the control equipment according to the running signals and the environment signals collected by the monitoring equipment and sending control instructions to the control equipment according to the running signals and the environment signals. Wherein, supervisory equipment can include voltage acquisition equipment, current acquisition equipment, resistance appearance, temperature acquisition equipment, humidity measurement appearance, meteorological instrument etc. and this application embodiment does not do specifically and restricts.
Referring to fig. 2, an embodiment of the present application provides a method for recovering a fault of a power distribution network, which may be applied to the power distribution network, and the following embodiment is applied to the control device in fig. 1, and is used to specifically describe the fault recovery of the power distribution network, including the following steps 201 to 205:
The monitoring equipment is used for acquiring parameters such as voltage, current, resistance, on-off of a switch and the like of the power distribution network in real time, so that the fault state parameters are obtained. The monitoring equipment is used for acquiring the temperature, the humidity, the illumination and the like in the power distribution network in real time, so that the environmental parameters which cause the power distribution network to break down are obtained. It should be noted that the fault state parameters are not limited to voltage, current, resistance, and switch on/off parameters, but may include any other parameters that can characterize the fault state of the power distribution network. Similarly, the environmental parameters are not limited to temperature, humidity and illumination parameters, but may also include any other parameters that can represent the environment causing the power distribution network to malfunction.
The historical self-healing action database is prestored in the control terminal and can also be stored in other storage devices of the power distribution network, the control terminal can access the historical self-healing action database in real time, and when the control terminal accesses the historical self-healing action database, all corresponding relations can be inquired. The historical self-healing action database comprises a plurality of corresponding relations, and one or more control actions corresponding to one observable state in each corresponding relation. Firstly, the control terminal carries out information matching in the historical self-healing action database, finds the observable state which is the same as the current observable state, and then determines the control action historically executed in the observable state according to the corresponding relation. In the historical failure recovery, a plurality of different control actions may occur in the same observable state, and the plurality of different control actions constitute the control action set.
And step 203, the control terminal sequentially simulates and executes the control actions in the control action set, and acquires the troubleshooting overhead corresponding to the executed control actions after each execution.
The control terminal stores a power grid fault model in advance, the control terminal inputs the control actions in the control action set to the power grid fault model in sequence, the control actions act in the power grid fault model, and the state of the power grid fault model changes. At this time, the control terminal determines the current troubleshooting overhead according to the current power grid fault model recovery situation and other factors such as the electric quantity expenditure spent in fault recovery, wherein the troubleshooting overhead refers to the electric quantity cost or other expenditures required by the power grid fault needing to be cleared. For example, when the troubleshooting overhead is a cost of electricity, it may be determined by determining the amount of electricity needed to perform the control action when the failure occurs.
And step 204, the control terminal determines the optimal control action from the control action set according to the obtained plurality of troubleshooting overheads.
The control terminal obtains a fault removal expense each time the control action is executed by the power grid fault model, and obtains a plurality of fault removal expenses after a plurality of control actions are executed respectively. And the control terminal determines a minimum cost according to the plurality of fault elimination costs, then inquires a control action corresponding to the minimum cost in the control action set, and determines that the control action is the optimal control action.
And step 205, the control terminal performs fault recovery on the power distribution network based on the optimal control brake.
After the optimal control action is determined, the control terminal controls and adjusts the power distribution network body through the control equipment, for example, the operation state of the power distribution network body is adjusted through adjusting certain node switches in the control equipment or restarting certain controllers, so that the fault of the power distribution network is recovered to be normal, the fault in the power distribution network is eliminated, and the purpose of recovering the fault of the power distribution network is achieved.
The embodiment of the application provides a power distribution network fault recovery method, which is characterized in that under the condition that a power distribution network has a fault, an observable state of the power distribution network is obtained in real time, and a historical self-healing action database is inquired according to the current observable state to obtain a control action set. And finally, performing fault recovery on the power distribution network based on the optimal control action. In the whole fault recovery process, a model does not need to be established, a large amount of calculation is not needed, only the historical self-healing action database is required to be inquired, and after the fault recovery is simulated, an optimal control action is selected according to a plurality of fault removal expenses to carry out fault recovery on the power distribution network. The power distribution network fault recovery method provided by the embodiment of the application solves the technical problems that in the prior art, the calculation amount is large in the solving process, the complexity is high, and an optimal solution is difficult to find, and achieves the technical effect of greatly reducing the calculation amount and the calculation complexity on the premise of ensuring that the optimal solution can be found.
In an optional embodiment of the present application, the observable conditions include distributed power, flexible load power and line damage conditions in the power distribution network.
In a first aspect, the control terminal collects the current and the voltage of the distributed power source and the flexible load in the power distribution network in real time through the current collecting device and the voltage collecting device in the monitoring device, and then calculates the power of the distributed power source and the power of the flexible load respectively through a power calculation formula. Flexible loads refer to loads that can be interrupted in a power distribution network. Alternatively, the observable state may be determined by the following equation:
wherein, in the formula (1), StRepresenting the observable state of the distribution network at time t,representing the output power, P, of the ith distributed power supply at time tt jRepresenting the power of the compliant load at time t,indicating the damaged status of the ith line at time t, NDGRepresenting a collection of distributed power sources, NLRepresenting a set of flexible loads, NfRepresenting a collection of vulnerable lines.
In a second aspect, the control terminal detects whether a line in the power distribution network is damaged through an infrared detector and the like in the monitoring device, and under another possible condition, the current data and the voltage data acquired by the control terminal in real time through the current acquisition device and the voltage acquisition device are compared with a preset normal working current range and a preset normal working voltage range to determine whether the current line is damaged. Wherein, the damaged state includes two states of damaged and not damaged.
In an optional embodiment of the present application, the set of control actions comprises an action state of a tie switch in the power distribution network.
The interconnection switch is an important control device in the power distribution network and controls whether different power transmission lines or electrical devices work or not and electrical connection among the different power transmission lines or the electrical devices. Simultaneously, the contact switch is comparatively fragile in the distribution network, an easy impaired node, consequently carries out fault recovery with this contact switch as the control point to this distribution network, and the coverage is wider, and recovery effect is better. The state of the tie switch is only closed and open, and can be determined, for example, by the following action function:
wherein, atWhich is indicative of a control action to be taken,at the moment of t +1, the state variable of the mth interconnection switch only has two values of 0 or 1, the value of 0 represents the connection, the value of 1 represents the disconnection, and N represents the disconnectionconThe branch set of the interconnection switch in the power distribution network.
In an optional embodiment of the present application, further comprising: and the control terminal performs normalization processing on the distributed power supply power, the flexible load power and the line damage state to obtain an observable state.
In a first aspect, taking the power of the flexible load as an example, the power can be determined by the following formula:
wherein the power P of the flexible loadt jThe maximum value and the minimum value of the self-healing action can be obtained from a historical self-healing action database or obtained by inquiring according to historical power grid data records.
In the second aspect, in the case of a line damage state, if the line is broken due to a fault at time t, the line is brokenAnd taking the value as 1, otherwise, taking the value as 0 by default. Therefore, the values of all the states after normalization are all [0,1]]In between, reduce the calculated amount greatly, improve the efficiency of distribution network fault recovery of this application embodiment.
Referring to fig. 3, in an alternative embodiment of the present application, step 203 includes steps 301-302:
After the control terminal executes the control action on the power grid fault model each time, the control terminal calculates and determines the execution overhead in the process of executing the control action and the compensation value of the flexible load in the power distribution network. The execution overhead refers to the amount of power consumed by the distribution grid when performing control actions on the grid fault model. The flexible load refers to a load which can be interrupted in the power distribution network, namely the flexible load can be subjected to power failure treatment in the actual fault recovery process, and the power consumption is not needed. In this embodiment, the simulation result of the power grid fault model is combined with actual factors to determine the fault elimination overhead, so as to improve the reliability of the power distribution network fault recovery method in the embodiment of the present application when fault recovery is actually performed.
The control terminal may determine a first difference between the execution overhead and the compensation value to obtain a troubleshooting overhead. The troubleshooting overhead may be determined by the following equation:
wherein r ist+1A failure-recovery overhead is indicated and,for the cost per unit output power of the distributed power supply,representing the output power of the ith distributed power supply at time t,cost per unit power for the jth flexible load, Pt jRepresenting the power of the flexible load at time t, NDGRepresenting a collection of distributed power sources, NILIs the set of flexible loads in the system.
Optionally, in the embodiment of the present application, the troubleshooting overhead may also be calculated by using an electricity generation ratio method based on the flexible load power failure loss characteristic:
wherein λ iskThe power failure loss of unit electric quantity when power failure occurs to the k flexible load of the distribution network node, L represents the power generation ratio, betakAnd the weight of the flexible load of the node k of the power distribution network.
Referring to fig. 4, in an alternative embodiment of the present application, step 204 includes steps 401-402:
The preset troubleshooting overhead is stored in the control terminal in advance, and when the control terminal obtains one troubleshooting overhead each time, the troubleshooting overhead is compared with the preset troubleshooting overhead stored in the control terminal, and a difference is calculated to obtain a second difference. After the control terminal carries out comparison for a plurality of times, a plurality of second difference values can be obtained. The preset troubleshooting overhead may be determined by historical experience or comprehensive consideration of various factors, and the embodiment is not particularly limited.
The control terminal calculates a plurality of second difference values, the second difference values are used for representing the difference value between the corresponding fault elimination overhead and the preset fault elimination overhead when each control action is executed, the smaller the second difference value is, the closer the fault elimination overhead is to the preset fault elimination overhead is, especially when the second difference value is a negative value, the fault elimination overhead when the fault recovery is actually carried out is far lower than the preset fault elimination overhead, and the control action is more optimal.
Referring to fig. 5, in an alternative embodiment of the present application, the control terminal may further perform network training through DDQN algorithm to obtain the above-mentioned optimal control action:
The control terminal respectively uses two neural networks to fit a state estimation function network of the observable state to fit a fault state estimation function V(s) in the observable statet) And a merit estimation function A(s) for each control action in the current set of control actionst,at) Wherein the fault state evaluation function V(s)t) Is a function of the fault state parameters of the power distribution network, and is a dominance estimation function A(s)t,at) Is about the above-mentioned ring causing the failure of the distribution networkFunction of environmental parameters, estimating function V(s) based on the fault statet) And the merit estimation function A(s)t,at) Obtaining an action valuation function:
wherein, Q (S)t,at) The action valuation function, A represents the set of control actions, | A | represents the number of control actions in the set of control actions, V(s)t) An evaluation function representing the fault state, A(s)t,at) Representing the merit estimation function.
In this embodiment, since only one optimal control action can be selected in each observable state, only one Q value can be obtained, and the optimal control action cannot be decomposed into a unique state estimation function V value and an action dominance function a value, the action dominance function is set as an individual action dominance function minus an average value of all action dominance functions in the current state, so as to remove redundant degrees of freedom and improve algorithm stability.
Optionally, the control terminal obtains a target value of the optimal action estimation function by using a bellman equation:
updating the action valuation function according to the current observable state, wherein the specific formula is as follows:
wherein, in the above formulas (7) to (8), the discount factor λ ∈ [0,1], and the learning rate 0< α ≦ 1.
Referring to fig. 6, in an alternative embodiment of the present application, step 502 includes steps 601-603:
601, the control terminal introduces an epsilon-greedy strategy to select the control actions in the control action set:
wherein, the fixed constant in the strategy with epsilon-greedy is a random number, T is the total training times, and T is the current training times.
To eliminate timing dependencies between samples in the short term, memory replay is employed to store a historical self-healing action database. Establishing an experience pool with the capacity of N, storing corresponding relation samples of the observable state and the control action of the power distribution network into the experience pool in each training period, randomly extracting a small batch of samples from the experience pool when the number of the samples exceeds the playback starting capacity of M, carrying out artificial neural network training, and training a neural network by randomly extracting the samples to avoid the occurrence of phenomena such as overfitting. If the number of samples exceeds the maximum capacity of the experience pool, the earliest sample is removed and stored in a new sample, and the latest observation state of the neural network learning is guaranteed.
The DDQN performs forward calculation to obtain Q values of all control actions, in the embodiment, the loss function refers to the mean square error between the target Q value and the predicted Q value output by the neural network, and the neural network loss function is determined by formulas (7) - (9), wherein the formula is
Training a neural network by using a small batch gradient descent method, then acquiring a real-time observable state of the power distribution network by using monitoring equipment, and selecting an estimated value maximum action a by using the trained neural networkkI.e. byTo control the optimal strategy, i.e. the optimal control action described above.
It should be understood that, although the steps in the flowchart are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in the figures may include multiple steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of execution of the steps or stages is not necessarily sequential, but may be performed alternately or in alternation with other steps or at least some of the other steps or stages.
Referring to fig. 7, an embodiment of the present application provides a power distribution network fault recovery apparatus 10, which includes: a state acquisition module 100, a control action determination module 200, an overhead determination module 300, an optimal action determination module 400, and a failure recovery module 500.
The state acquisition module 100 is configured to acquire a current observable state of the power distribution network when the power distribution network fails, where the observable state includes a fault state parameter and an environmental parameter that causes the power distribution network to fail;
the control action determining module 200 is configured to query a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, where multiple correspondence relationships between observable states and control actions are stored in the historical self-healing action database;
the overhead determining module 300 is configured to sequentially simulate and execute the control actions in the control action set, and obtain a troubleshooting overhead corresponding to the executed control action after each execution;
the optimal action determining module 400 is configured to determine an optimal control action from the control action set according to the obtained multiple troubleshooting costs;
the fault recovery module 500 is configured to perform fault recovery on the power distribution network based on the optimal control action.
In an optional embodiment of the present application, the overhead determining module 300 includes: a first overhead module and a second overhead module.
The first overhead module is used for determining execution overhead corresponding to the executed control action and a compensation value of the flexible load in the power distribution network to the execution overhead after each execution;
the second overhead module is used for determining the troubleshooting overhead according to the execution overhead corresponding to the executed control action and the compensation value after each execution.
In an optional embodiment of the present application, the second overhead module is specifically configured to determine a first difference between the execution overhead and the compensation value, so as to obtain the troubleshooting overhead.
In an optional embodiment of the present application, the optimal action determination module 400 includes: a first action determination submodule and a second action determination submodule.
The first action determining submodule is used for respectively calculating a plurality of second differences between the fault elimination overhead and the preset fault elimination overhead to obtain a plurality of second differences;
the second action determining submodule is configured to determine that the control action corresponding to the minimum value in the plurality of second difference values in the control action set is the optimal control action.
In an optional embodiment of the present application, the observable conditions include distributed power, flexible load power and line damage conditions in the power distribution network.
In an optional embodiment of the present application, the system further includes a processing module, where the processing module is configured to perform normalization processing on the distributed power source power, the flexible load power, and the line damage state, so as to obtain the observable state.
In an optional embodiment of the application, the set of control actions comprises an action state of a tie switch in the power distribution network.
For specific limitations of the power distribution network fault recovery device 10, reference may be made to the above limitations of the power distribution network fault recovery method, which are not described herein again. The various modules in the power distribution network fault recovery apparatus 10 described above may be implemented in whole or in part by software, hardware, and combinations thereof. The modules can be embedded in a hardware form or independent from a processor in the control device, and can also be stored in a memory in the control device in a software form, so that the processor can call and execute operations corresponding to the modules.
Fig. 8 is a schematic diagram of an internal structure of a control device in an embodiment of the present application, where the control device may be a server. As shown in fig. 8, the control device includes a processor, a memory, and a communication component connected by a system bus. Wherein the processor is used for providing calculation and control capability and supporting the operation of the whole control device. The memory may include a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The computer program can be executed by a processor for implementing a power distribution network fault recovery method provided by the above embodiments. The internal memory provides a cached execution environment for the operating system and computer programs in the non-volatile storage medium. The control device may communicate with other control devices (e.g., STAs) through the communication component.
It will be appreciated by those skilled in the art that the configuration shown in fig. 8 is a block diagram of only a portion of the configuration associated with the present application and does not constitute a limitation on the control device to which the present application is applied, and a particular control device may include more or less components than those shown in the figures, or combine certain components, or have a different arrangement of components.
In one embodiment, there is provided a control apparatus including: the system comprises a memory and a processor, wherein the memory stores a computer program, and the processor executes the computer program to realize the following steps:
under the condition that a power distribution network fails, acquiring the current observable state of the power distribution network, wherein the observable state comprises failure state parameters and environment parameters causing the power distribution network to fail;
inquiring a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, wherein the historical self-healing action database stores a plurality of corresponding relations between observable states and control actions;
sequentially simulating and executing the control actions in the control action set, and acquiring the troubleshooting overhead corresponding to the executed control actions after each execution;
determining an optimal control action from the control action set according to the obtained plurality of troubleshooting costs;
and performing fault recovery on the power distribution network based on the optimal control brake.
In one embodiment of the application, the processor when executing the computer program further performs the steps of: after each execution, determining an execution overhead corresponding to the executed control action and a compensation value of a flexible load in the power distribution network to the execution overhead; and after each execution, determining the troubleshooting cost according to the execution cost corresponding to the executed control action and the compensation value.
In one embodiment of the application, the processor when executing the computer program further performs the steps of: and determining a first difference value between the execution overhead and the compensation value to obtain the troubleshooting overhead.
In one embodiment of the application, the processor when executing the computer program further performs the steps of: respectively calculating second difference values between the plurality of fault elimination overheads and the preset fault elimination overhead to obtain a plurality of second difference values; and determining the control action corresponding to the minimum value in the plurality of second difference values in the control action set as the optimal control action.
In one embodiment of the present application, the observable conditions include distributed power, flexible load power, and line damage conditions in the power distribution network.
In one embodiment of the application, the processor when executing the computer program further performs the steps of: and normalizing the distributed power supply power, the flexible load power and the line damage state to obtain the observable state.
In one embodiment of the application, the set of control actions includes an action state of a tie switch in the power distribution network.
The implementation principle and technical effect of the control device provided in the embodiment of the present application are similar to those of the method embodiment described above, and are not described herein again.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of:
under the condition that a power distribution network fails, acquiring the current observable state of the power distribution network, wherein the observable state comprises failure state parameters and environment parameters causing the power distribution network to fail;
inquiring a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, wherein the historical self-healing action database stores a plurality of corresponding relations between observable states and control actions;
sequentially simulating and executing the control actions in the control action set, and acquiring the troubleshooting overhead corresponding to the executed control actions after each execution;
determining an optimal control action from the control action set according to the obtained plurality of troubleshooting costs;
and performing fault recovery on the power distribution network based on the optimal control brake.
In one embodiment of the application, the computer program when executed by the processor further performs the steps of: after each execution, determining an execution overhead corresponding to the executed control action and a compensation value of a flexible load in the power distribution network to the execution overhead; and after each execution, determining the troubleshooting cost according to the execution cost corresponding to the executed control action and the compensation value.
In one embodiment of the application, the computer program when executed by the processor further performs the steps of: and determining a first difference value between the execution overhead and the compensation value to obtain the troubleshooting overhead.
In one embodiment of the application, the computer program when executed by the processor further performs the steps of: respectively calculating second difference values between the plurality of fault elimination overheads and the preset fault elimination overhead to obtain a plurality of second difference values; and determining the control action corresponding to the minimum value in the plurality of second difference values in the control action set as the optimal control action.
In one embodiment of the present application, the observable conditions include distributed power, flexible load power, and line damage conditions in the power distribution network.
In one embodiment of the application, the computer program when executed by the processor further performs the steps of: and normalizing the distributed power supply power, the flexible load power and the line damage state to obtain the observable state.
In one embodiment of the application, the set of control actions includes an action state of a tie switch in the power distribution network.
The implementation principle and technical effect of the computer-readable storage medium provided by this embodiment are similar to those of the above-described method embodiment, and are not described herein again.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in M forms, such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), synchronous Link (SyMchliMk) DRAM (SLDRAM), RaMbus (RaMbus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.
Claims (10)
1. A method for fault recovery of a power distribution network, the method comprising:
under the condition that a power distribution network fails, acquiring the current observable state of the power distribution network, wherein the observable state comprises failure state parameters and environment parameters causing the power distribution network to fail;
inquiring a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, wherein the historical self-healing action database stores a plurality of corresponding relations between observable states and control actions;
sequentially simulating and executing the control actions in the control action set, and acquiring the troubleshooting overhead corresponding to the executed control actions after each execution;
determining an optimal control action from the control action set according to the obtained plurality of troubleshooting costs;
and performing fault recovery on the power distribution network based on the optimal control brake.
2. The method for recovering the power distribution network fault according to claim 1, wherein the obtaining of the fault-removal overhead corresponding to the executed control action after each execution comprises:
after each execution, determining an execution overhead corresponding to the executed control action and a compensation value of a flexible load in the power distribution network to the execution overhead;
and after each execution, determining the troubleshooting cost according to the execution cost corresponding to the executed control action and the compensation value.
3. The method for recovering from the power distribution network fault according to claim 2, wherein the determining the troubleshooting cost according to the execution cost corresponding to the executed control action and the compensation value comprises:
and determining a first difference value between the execution overhead and the compensation value to obtain the troubleshooting overhead.
4. The method according to claim 1, wherein the determining an optimal control action from the set of control actions according to the obtained plurality of troubleshooting costs comprises:
respectively calculating second difference values between the plurality of fault elimination overheads and the preset fault elimination overhead to obtain a plurality of second difference values;
and determining the control action corresponding to the minimum value in the plurality of second difference values in the control action set as the optimal control action.
5. The power distribution network fault recovery method of claim 1, wherein the observable states include distributed power, flexible load power, and line damage status in the power distribution network.
6. The method for fault recovery in a power distribution network of claim 5, further comprising:
and normalizing the distributed power supply power, the flexible load power and the line damage state to obtain the observable state.
7. The method of power distribution network fault recovery of claim 1, wherein the set of control actions includes an action state of a tie switch in the power distribution network.
8. A power distribution network fault recovery device, the device comprising:
the system comprises a state acquisition module, a state acquisition module and a state monitoring module, wherein the state acquisition module is used for acquiring the current observable state of the power distribution network under the condition that the power distribution network fails, and the observable state comprises failure state parameters and environment parameters which cause the power distribution network to fail;
the control action determining module is used for inquiring a historical self-healing action database according to the current observable state to obtain a control action set corresponding to the current observable state, and a plurality of corresponding relations between the observable state and the control action are stored in the historical self-healing action database;
the overhead determining module is used for sequentially simulating and executing the control actions in the control action set and acquiring the troubleshooting overhead corresponding to the executed control actions after each execution;
an optimal action determining module, configured to determine an optimal control action from the control action set according to the obtained multiple troubleshooting overheads;
and the fault recovery module is used for carrying out fault recovery on the power distribution network based on the optimal control action.
9. A control device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor realizes the steps of the method of any one of claims 1 to 7 when executing the computer program.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011214337.4A CN112467686B (en) | 2020-11-04 | 2020-11-04 | Power distribution network fault recovery method, device, control equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011214337.4A CN112467686B (en) | 2020-11-04 | 2020-11-04 | Power distribution network fault recovery method, device, control equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112467686A true CN112467686A (en) | 2021-03-09 |
CN112467686B CN112467686B (en) | 2023-09-01 |
Family
ID=74835942
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011214337.4A Active CN112467686B (en) | 2020-11-04 | 2020-11-04 | Power distribution network fault recovery method, device, control equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112467686B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113541092A (en) * | 2021-09-15 | 2021-10-22 | 国网天津市电力公司城南供电分公司 | Power distribution network relay protection artificial intelligence setting method based on reinforcement learning algorithm |
CN114336500A (en) * | 2021-12-30 | 2022-04-12 | 广东电网有限责任公司 | Fault diagnosis and power restoration method, device and equipment for regional power grid |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020080539A1 (en) * | 2000-12-27 | 2002-06-27 | Mcclure Graeme N. | Loop restoration scheme for distribution feeders |
JP2018025968A (en) * | 2016-08-10 | 2018-02-15 | 日本電信電話株式会社 | Restoration control system and method |
CN108123439A (en) * | 2018-01-03 | 2018-06-05 | 燕山大学 | A kind of repairing of active distribution network multiple faults and restoration methods based on multi-agent system |
CN109193652A (en) * | 2018-10-30 | 2019-01-11 | 贵州电网有限责任公司 | A kind of distribution network failure self-healing system containing distributed generation resource based on Situation Awareness |
CN110224383A (en) * | 2019-06-28 | 2019-09-10 | 深圳供电局有限公司 | Power distribution network fault self-healing and line loss cooperative optimization method and system |
CN111082401A (en) * | 2019-11-15 | 2020-04-28 | 国网河南省电力公司郑州供电公司 | Self-learning mechanism-based power distribution network fault recovery method |
CN111751655A (en) * | 2020-05-29 | 2020-10-09 | 深圳供电局有限公司 | Fault self-healing method and device for distribution line, computer equipment and storage medium |
-
2020
- 2020-11-04 CN CN202011214337.4A patent/CN112467686B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020080539A1 (en) * | 2000-12-27 | 2002-06-27 | Mcclure Graeme N. | Loop restoration scheme for distribution feeders |
JP2018025968A (en) * | 2016-08-10 | 2018-02-15 | 日本電信電話株式会社 | Restoration control system and method |
CN108123439A (en) * | 2018-01-03 | 2018-06-05 | 燕山大学 | A kind of repairing of active distribution network multiple faults and restoration methods based on multi-agent system |
CN109193652A (en) * | 2018-10-30 | 2019-01-11 | 贵州电网有限责任公司 | A kind of distribution network failure self-healing system containing distributed generation resource based on Situation Awareness |
CN110224383A (en) * | 2019-06-28 | 2019-09-10 | 深圳供电局有限公司 | Power distribution network fault self-healing and line loss cooperative optimization method and system |
CN111082401A (en) * | 2019-11-15 | 2020-04-28 | 国网河南省电力公司郑州供电公司 | Self-learning mechanism-based power distribution network fault recovery method |
CN111751655A (en) * | 2020-05-29 | 2020-10-09 | 深圳供电局有限公司 | Fault self-healing method and device for distribution line, computer equipment and storage medium |
Non-Patent Citations (3)
Title |
---|
孙秀飞等: "考虑光伏波动性与负荷时变性的ADN多故障修复策略", 分布式能源, no. 02, pages 30 - 39 * |
张亦凌: "地区电网故障恢复系统的设计与实现", 《山西电力》 * |
张亦凌: "地区电网故障恢复系统的设计与实现", 《山西电力》, no. 06, 30 December 2015 (2015-12-30), pages 12 - 15 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113541092A (en) * | 2021-09-15 | 2021-10-22 | 国网天津市电力公司城南供电分公司 | Power distribution network relay protection artificial intelligence setting method based on reinforcement learning algorithm |
CN113541092B (en) * | 2021-09-15 | 2021-11-26 | 国网天津市电力公司城南供电分公司 | Power distribution network relay protection artificial intelligence setting method based on reinforcement learning algorithm |
CN114336500A (en) * | 2021-12-30 | 2022-04-12 | 广东电网有限责任公司 | Fault diagnosis and power restoration method, device and equipment for regional power grid |
CN114336500B (en) * | 2021-12-30 | 2024-05-28 | 广东电网有限责任公司 | Fault diagnosis and power restoration method, device and equipment for regional power grid |
Also Published As
Publication number | Publication date |
---|---|
CN112467686B (en) | 2023-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104934968B (en) | Distribution based on multiple agent answers calamity to recover control method for coordinating and device | |
CN112215722B (en) | Dominant instability mode discrimination model construction method and dominant instability mode discrimination method | |
CN112467686A (en) | Power distribution network fault recovery method and device, control equipment and storage medium | |
US5627760A (en) | Method and apparatus for real time recursive parameter energy management system | |
CN113759868A (en) | Industrial Ethernet fault prediction method based on neural network | |
CN111669123A (en) | Method and device for fault diagnosis of photovoltaic string | |
CN117791597B (en) | Power distribution network fault self-healing method and system based on machine learning | |
CN111489539A (en) | Household appliance system fault early warning method, system and device | |
CN115877198A (en) | Primary and secondary fusion switch fault diagnosis early warning system based on edge calculation | |
CN116609651A (en) | Detection method of multifunctional circuit breaker for photovoltaic | |
CN117639228A (en) | Power distribution network running state prediction method and system based on digital twin | |
CN117933447A (en) | Fault prediction method and system based on data analysis | |
CN116976488A (en) | Air conditioner energy consumption prediction method for optimizing BP neural network based on sparrow search algorithm | |
CN117913895B (en) | LightGBM-LSTM-based power distribution network topology identification method and device | |
CN117787915A (en) | Digital twin intelligent brain construction method for power distribution network | |
CN114091750A (en) | Power grid load abnormity prediction method, system and storage medium | |
CN116794545A (en) | Yun Bian cooperation-based battery health prediction method and system | |
CN110533257A (en) | Prediction method for district heating load | |
CN113204280B (en) | Method, system, equipment and medium for diagnosing power failure | |
CN110232482B (en) | Equipment management method and device based on neural network | |
CN115693916A (en) | Intelligent online monitoring method and system for direct-current power supply of transformer substation | |
Weng et al. | An optimal control-based cascading failure risk assessment method | |
CN118465439B (en) | Electric power grid state intelligent diagnosis system and method based on artificial intelligence | |
CN117722306A (en) | On-line monitoring method and system for water turbine speed regulator | |
CN117973606A (en) | New energy output prediction method and device, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |