CN112290535A

CN112290535A - Online scheduling method of electricity-gas integrated energy system based on deep strategy optimization

Info

Publication number: CN112290535A
Application number: CN202011008104.9A
Authority: CN
Inventors: 胡维昊; 杜月芳; 李坚; 张斌; 曹迪; 黄越辉; 王晓蓉; 许潇; 邓惠文; 王浩
Original assignee: University of Electronic Science and Technology of China; China Electric Power Research Institute Co Ltd CEPRI
Current assignee: University of Electronic Science and Technology of China; China Electric Power Research Institute Co Ltd CEPRI
Priority date: 2020-09-23
Filing date: 2020-09-23
Publication date: 2021-01-29

Abstract

The invention discloses an online scheduling method of an electric-natural gas integrated energy system based on a depth certainty gradient strategy, which aims at the double uncertainties of intermittence of wind power, randomness of real-time electric power and natural gas markets and the like, simultaneously considers the net load fluctuation of an electric power system and the economic income of the system, adopts a depth reinforcement learning method, converts the real-time income model solving process of the electric-natural gas integrated energy system into a limited Markov decision process, and adopts a depth certainty gradient strategy (DDPG) algorithm to solve the decision problem; the electric-natural gas comprehensive energy system has high stability and economy because net load fluctuation is considered in the optimized operation of the electric-natural gas comprehensive energy system, and wind power is smoothly connected into a power grid.

Description

Online scheduling method of electricity-gas integrated energy system based on deep strategy optimization

Technical Field

The invention belongs to the technical field of new energy power generation, and particularly relates to an on-line scheduling method of an electricity-natural gas comprehensive energy system based on a depth certainty gradient strategy.

Background

The comprehensive energy system is expected to solve the contradiction of renewable energy penetration and is developed into a new mode of future energy utilization. Natural gas is used as an important primary energy source and is most closely related to a power grid. Therefore, the electricity-natural gas integrated energy system has become the basis of the energy internet.

The traditional energy management method has deterministic rules and abstract models. Both have limitations respectively: deterministic rules can only run optimally in stationary systems and are limited by fixed variables; the process of creating an abstract model is complex and generally depends on creating an actual model, the performance of which is related to the skill and experience of a modeler. In addition, due to the intermittent characteristic of wind power, the operation risk of a power grid containing high-permeability wind power is high, the emergency reserve capacity is large, and the stability and the economy of the power grid are not guaranteed. In the existing optimization scheduling research of the electricity-natural gas integrated energy system, wind power is mostly scheduled based on day-ahead optimization, and because wind power generation has a fluctuation characteristic which is difficult to predict, the method is difficult to ensure the real-time optimal operation of the photovoltaic system.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides an on-line scheduling method of an electric-natural gas integrated energy system based on a depth certainty gradient strategy, which aims to minimize the net load fluctuation while considering the lowest system operation cost and realizes the real-time optimized operation of the electric-natural gas integrated energy system through a depth certainty gradient strategy algorithm.

In order to achieve the aim, the invention provides an on-line scheduling method of an electricity-natural gas integrated energy system based on a depth deterministic gradient strategy, which comprises the following steps:

(1) collecting historical power generation data of the wind power station wp and recording the historical power generation data as P_wp,tWherein t represents time, t is 1,2,3, …; collecting historical on-line electricity price of the electricity-natural gas comprehensive energy system and recording as lambda_t(ii) a Market price of collected natural gas, recorded as kappa_t(ii) a Collecting the user load of the integrated energy system of electric-natural gas, and recording as P_l,t；

(2) Constructing a power output model of each component of the electricity-natural gas integrated energy system (IEngs);

(2.1) converting electricity into gas p2g equipment output model;

wherein the content of the first and second substances,

for the conversion efficiency of the P2g plant, P_p2g,tFor the power consumed by the p2g device at time t, H_gIs the heating value of natural gas, Q_p2g,tThe output natural gas flow rate of the plant at time t, p2 g;

(2.2) a gas turbine gt equipment output model;

wherein the content of the first and second substances,

for the conversion efficiency of the plant, P_gt,tElectric energy output for gt device at time t, Q_gt,tThe natural gas flow consumed by the equipment at the time t;

(2.3) a thermal power generating unit g equipment output model;

P_g,t＝P_p2g,t+P_l,t-α_tP_wp,t-P_gt,t

wherein, P_g,tFor the output electric power of g equipment of the thermal power generating unit, P_l,tFor the consumer electrical load at time t, α_tThe ratio of wind power to power grid at the moment t;

(3) constructing an online scheduling objective function and constraint conditions of the electricity-natural gas integrated energy system;

the objective function is:

wherein f is_oc,tFor the operating cost of the electric-natural gas integrated energy system at time t, f (p)_g,t) Expressed as the running cost of the thermal power generating unit at the moment t, { C_n,C_e,C_wp,C_p2g,C_gtThe cost coefficients are respectively a supply cost coefficient of a natural gas source, an air outlet cost coefficient of an air storage tank, a cost coefficient of a wind power access power grid, an operation cost coefficient of p2g, an operation cost coefficient of gt, and Q_n,tThe time t is the gas output of the natural gas source, when Q is_n,tLess than 0, which means that the natural gas source gives out gas to the natural gas market at the time t to obtain profit when Q is_n,tGreater than 0, which indicates that the natural gas source is changed to electricity-natural gas at the time tThe gas system supplies natural gas; p_net,tExpressed as the net load of the electro-natural gas system at time t, omega is the economic reduction factor, f_pls,tEconomic losses expressed as net load fluctuations adjacent in time t; { a_g,b_g,c_gThe coefficient is the operation cost coefficient of the thermal power generating unit; t is an optimized operation period, and F is the operation cost and the net load fluctuation economic conversion cost of the electricity-natural gas integrated energy system in the operation period;

the constraint conditions are as follows:

wherein the content of the first and second substances,

represents the minimum gas output of the natural gas source,

represents the maximum gas output of the natural gas source,

indicating the maximum amount of intake air of the air tank,

indicating the minimum amount of intake air of the air tank,

which indicates the maximum amount of intake air of the gas turbine,

represents the maximum electrical energy input for electrical conversion,

the minimum output of the thermal power generating unit is shown,

representing the maximum output of the thermal power generating unit;

(4) establishing and training an electric-natural gas integrated energy system online scheduling model based on a Deep Deterministic Gradient Policy (DDPG);

(4.1) converting an online scheduling objective function of the electric-natural gas integrated energy system in one operation period into a Markov decision process comprising a state set S, an action set A and a reward function r;

wherein S comprises all states in the decision process, and the state S at the moment t_t＝{P_wp,t,λ_t,κ_t,SoC_t,P_l.t}; a comprises all actions of the decision process, and the action at the moment t

At time t at s_tLower execution a_tThe real-time reward obtained is denoted as r_t(a_t|s_t)；

r_t(a_t|s_t)＝-(f_oc,t+f_pls,t)

(4.2) four groups of neural networks with the same structure required by the DDPG algorithm are constructed;

constructing two action networks on line, recording the two action networks as mu and mu', wherein parameter sets are respectively recorded as theta^μ、θ^μ'For realizing an input state s_tTo the output action a_t；

Two evaluation networks are constructed on line and are recorded as upsilon and upsilon', and parameter sets are recorded as theta respectively^υ、θ^υ'For realizing an input state s_tAnd an output action a_tTo the action merit function Q^π(s_t,a_t) Pi is a mapping strategy;

(4.3) setting the total iteration number N of an electric-natural gas integrated energy system online scheduling model based on a depth certainty gradient strategy algorithm and the iteration step number T of a Markov process; setting a memory bank, recording the capacity of the memory bank as M, and initializing the memory bank to be empty; initializing parameter sets of all neural networks, initializing n to 1, initializing a learning rate alpha and initializing a counter m;

(4.4) resetting the electricity-natural gas comprehensive energy system, setting t to 1 and acquiring the current state s_tThen a Markov process is performed once;

(4.5) judging whether T is smaller than T, if T is smaller than T, entering the step (4.6), otherwise, entering the step (4.14);

(4.6) mixing s_tInput to the action network mu to obtain the output action a_t；

(4.7) operation a according to the output_tCalculating the objective function value r in the step (4.1)_tWhile obtaining a_tState s at the next moment after the action_t+1；

(4.8) construction of tuple information s_t,a_t,r_t,s_t+1Storing the position M% M in a memory bank, and then assigning M to be M + 1;

(4.9) judging whether M is larger than M, if so, entering the step (4.10); otherwise, entering the step (4.13);

(4.10) updating the parameter set theta of the action network mu on line based on the depth deterministic gradient strategy^μ；

Where b represents the number of tuple information with equal probability sampling put back from the memory bank, s_i,a_iRepresenting the corresponding state and action in the ith tuple information;

is shown in state s_iAnd action a_iCalculating a for the action cost function_iThe gradient of (a) of (b) is,

is shown in state s_iOf lower network muA gradient;

(4.11) updating the parameter set theta of the evaluation network upsilon on line by minimizing a loss function^υ；

Wherein, L (theta)^υ) A loss function of the network v; q(s)_i,a_i) Is shown in state s_iAnd action a_iEvaluating the action value function value output by the network upsilon; y is_iAn estimate representing the υ' network; gamma is a discount factor that is a function of,

a gradient representing a loss function of the network v;

(4.12), updating a parameter set of the action network mu 'and the evaluation network upsilon';

wherein tau is an update coefficient;

(4.13) updating the current state, assigning t as t +1 and s_t＝s_t+1And then returning to the step (4.7);

(4.14), making N equal to N +1, judging whether N is larger than N, if yes, entering the step (4.15); otherwise, entering the step (4.4);

(4.15) stopping iteration, and outputting a neural network parameter set to obtain an on-line scheduling model of the electricity-natural gas integrated energy system;

(5) real-time collection of photovoltaic output

Electricity price data

Natural gas price data

And user load

Then determining the real time

And constructs the real-time state at the moment t

Finally will be

Inputting the data into an on-line scheduling model of the electricity-natural gas integrated energy system to obtain real-time output action

Then follow

And the on-line scheduling of the electricity-natural gas comprehensive energy system is realized.

The invention aims to realize the following steps:

the invention relates to an on-line scheduling method of an electric-natural gas integrated energy system based on a depth certainty gradient strategy, which adopts a depth reinforcement learning method to convert a real-time income model solving process of the electric-natural gas integrated energy system into a limited Markov decision process and adopts a depth certainty gradient strategy (DDPG) algorithm to solve the decision problem aiming at the double uncertainties of intermittence of wind power, randomness of real-time electric power and natural gas markets and the like and simultaneously considering the net load fluctuation of the electric power system and the economic income of the system; the electric-natural gas comprehensive energy system has high stability and economy because net load fluctuation is considered in the optimized operation of the electric-natural gas comprehensive energy system, and wind power is smoothly connected into a power grid.

Meanwhile, the on-line scheduling method of the electricity-natural gas integrated energy system based on the depth certainty gradient strategy also has the following beneficial effects:

(1) compared with the traditional excellent Particle Swarm Optimization (PSO) and deep Q network, the result shows that the DDPG algorithm-based online scheduling method for the electric-natural gas integrated energy system has the advantages of short time consumption, good optimization result and the like, so that the DDPG algorithm has better superiority in coping with various uncertain factors in the system.

(2) The invention considers the net load fluctuation during the optimized operation, and enables the wind power to be smoothly connected into the power grid, thereby ensuring the real-time optimal operation of the electricity-natural gas comprehensive energy system and having higher stability and economy.

Drawings

FIG. 1 is a flow chart of an online scheduling method of an electric-natural gas integrated energy system based on a depth deterministic gradient strategy;

FIG. 2 is a block diagram of an electric-natural gas integrated energy system;

FIG. 3 is a training flow diagram of an electric-natural gas integrated energy system online scheduling model;

FIG. 4 is a graph of the output of various components based on four scenarios;

Detailed Description

The following description of the embodiments of the present invention is provided in order to better understand the present invention for those skilled in the art with reference to the accompanying drawings. It is to be expressly noted that in the following description, a detailed description of known functions and designs will be omitted when it may obscure the subject matter of the present invention.

Examples

In the present embodiment, as shown in fig. 2, the electricity-natural gas integrated energy system is composed of a wind farm, a power grid and a natural gas network;

the wind power plant is connected with the bus bar through a transformer; the busbar is connected to the grid via an overhead transmission line.

The method for optimizing the electricity-natural gas integrated energy system in real time based on the depth deterministic gradient strategy is described in detail with reference to fig. 2.

In this embodiment, as shown in fig. 1, the method for online scheduling of an electricity-natural gas integrated energy system based on a depth deterministic gradient strategy of the present invention includes the following steps:

s1, collecting historical power generation data of the wind power station wp in the past year and recording the historical power generation data as P_wp,tWherein t represents a time, and t is 1,2, 3.; the historical online electricity price of the past year of the electricity-natural gas comprehensive energy system is collected and recorded as lambda_t(ii) a The price of the collected natural gas market over the past year is recorded as kappa_t(ii) a Collecting the user load of the past year of the electric-natural gas integrated energy system, and recording as P_l,t；

S2, constructing output models of all components of the electricity-natural gas integrated energy system (IEngs);

wherein the content of the first and second substances,

the conversion efficiency of the P2g equipment is 0.8, P_p2g,tFor the power consumed by the p2g device at time t, H_gIs the heating value of natural gas, Q_p2g,tThe output natural gas flow rate of the plant at time t, p2 g;

the conversion efficiency of the gt device is taken as 0.8, P_gt,tElectric energy output for gt device at time t, Q_gt,tThe natural gas flow consumed by the equipment at the time t; p_g,tFor the output electric power of g equipment of the thermal power generating unit, P_l,tFor the consumer electrical load at time t, α_tThe value range of the ratio of the wind power to the power grid at the moment t is [0,1 ]]；

S3, constructing an online scheduling objective function and constraint conditions of the electricity-natural gas integrated energy system;

the objective function is:

wherein f is_oc,tFor the operating cost of the electric-natural gas integrated energy system at time t, f (p)_g,t) Expressed as the running cost of the thermal power generating unit at the moment t, { C_n,C_e,C_wp,C_p2g,C_gtThe values of the supply cost coefficient of the natural gas source, the air outlet cost coefficient of the air storage tank, the abandoned wind cost coefficient, the operation cost coefficient of p2g and the operation cost coefficient of gt are shown in a table 1, and Q is shown in the table_n,tThe time t is the gas output of the natural gas source, when Q is_n,tLess than 0, which means that the natural gas source gives out gas to the natural gas market at the time t to obtain profit when Q is_n,tWhen the natural gas source is more than 0, the natural gas source supplies natural gas to the gas inlet of the electric-natural gas system at the time t; p_net,tExpressed as the net load of the electro-natural gas system at time t, omega is the economic reduction factor, f_pls,tEconomic losses expressed as net load fluctuations adjacent in time t; { a_g,b_g,c_gThe coefficient is the operation cost coefficient of the thermal power generating unit; t is an optimized operation period, and F is the operation cost and the net load fluctuation economic conversion cost of the electricity-natural gas integrated energy system in the operation period;

C_n	C_e	C_wp	C_p2g	C_gt
					30($/m³)	20($/m³)	80($/MW)	200($/MW)	200($/MW)

TABLE 1

The constraint conditions are as follows:

wherein the content of the first and second substances,

the minimum gas output of the natural gas source is shown and is 10m³，

The maximum gas output of the natural gas source is represented and is 500m³，

The maximum air inflow of the air storage tank is represented and is 300m³，

The minimum air inflow of the air storage tank is represented and is 50m³，

The maximum air inflow of the gas turbine is shown and is 200m³，

Represents the maximum electric energy input of the electric conversion gas, takes the value of 10MW,

indicating minimum discharge of thermal power generating unitThe force, which takes the value of 1MW,

the maximum output of the thermal power generating unit is represented, and the value is 50 MW;

s4, as shown in FIG. 3, building and training an electric-natural gas integrated energy system online scheduling model based on a Deep Deterministic Gradient Policy (DDPG);

s4.1, converting an online scheduling objective function of the electricity-natural gas integrated energy system in an operation period into a Markov decision process comprising a state set S, an action set A and a reward function r;

r_t(a_t|s_t)＝-(f_oc,t+f_pls,t)

S4.2, constructing four groups of neural networks with the same structure required by the DDPG algorithm;

s4.3, setting the total iteration number N of an electric-natural gas integrated energy system online scheduling model based on a depth certainty gradient strategy algorithm to 10000 and the iteration step number T of a Markov process to 24; setting a memory bank, recording the capacity of the memory bank as M40000, and initializing the memory bank to be empty; initializing parameter sets of all neural networks, wherein n is 1, learning rate alpha is 0.0001, and a counter m is 0;

s4.4, judging whether T is smaller than T, if T is smaller than T, entering the step S4.5, otherwise, entering the step S4.13;

s4.5, mixing S_tInput to the action network mu to obtain the output action a_t；

S4.6, according to the output action a_tThe value of the objective function r in step S4.1 is calculated_tWhile obtaining a_tState s at the next moment after the action_t+1；

S4.7, establishing tuple information S_t,a_t,r_t,s_t+1Storing the position M% M in a memory bank, and then assigning M to be M + 1;

s4.8, judging whether M is larger than M, if so, entering a step S4.9; otherwise, the step S4.4 is carried out;

s4.9, updating parameter set theta of action network mu on line based on depth certainty gradient strategy^μ；

is shown in state s_iGradient of lower network μ;

s4.10, updating the parameter set theta of the evaluation network upsilon on line through a minimum loss function^υ；

a gradient representing a loss function of the network v;

s4.11, updating a parameter set of the action network mu 'and the evaluation network upsilon';

wherein tau is an update coefficient and takes a value of 0.01;

s4.12, updating the current state, assigning t as t +1 and S_t＝s_t+1Then returning to step S4.7;

s4.13, let N be N +1, and then determine whether N is greater than N, if yes, go to step S4.14; otherwise, the step S4.3 is carried out;

s4.14, stopping iteration, and outputting a neural network parameter set to obtain an online scheduling model of the electricity-natural gas integrated energy system;

s5, collecting photovoltaic output in real time

Electricity price data

Natural gas price data

And user load

Then determining the real time

And constructs the real-time state at the moment t

Finally will be

Then follow

In this embodiment, the processor is selected as an Inter (R) core (TM) i9-9820X CPU @3.30GHz hardware platform, and Python3.7 and tensoflow1.8.0 are used to implement the embodiment of the method of the present invention. And carrying out real-time test on the successfully trained neural network based on four scenes, namely the following four scenes.

Scene one: the p2g equipment and a net load fluctuation model are not contained in the electricity-natural gas integrated energy system, and the optimization target only considers economic benefits;

scene two: the p2g equipment is contained in the electricity-natural gas integrated energy system, and the optimization goal only considers the economic benefit;

scene three: the electric-natural gas integrated energy system comprises a net load fluctuation model, and the optimization target simultaneously considers economic benefit and net load fluctuation to convert economic cost;

scene four: the electric-natural gas integrated energy system simultaneously comprises p2g equipment and a net load fluctuation model, and the optimization objective simultaneously considers economic benefit and net load fluctuation to reduce economic cost;

the result shows that as shown in fig. 4, it can be seen in the figure that when a p2g device is introduced into the electricity-natural gas integrated energy system, a large amount of wind power is absorbed, wind curtailment is obviously reduced, which is beneficial to increase economic benefit, meanwhile, since a net load fluctuation model is not considered, the net load curve fluctuation is obvious, the stability of the power grid is reduced, but the fluctuation amplitude of the net load fluctuation is obviously reduced compared with the net load fluctuation when the p2g device is not introduced, which indicates that p2g plays a role in suppressing the net load fluctuation. When the p2g equipment and the net load fluctuation model are introduced simultaneously, the economic benefit and the stability of the electricity-natural gas integrated energy system are both considered. In addition, in order to further embody the superiority of the method of the present invention, in the embodiment, comparison is made with a conventional particle swarm optimization algorithm (PSO) and a deep Q network algorithm (DQN). The results show that: the DDPG method is a method provided by the invention, and because a large amount of simulation training is carried out on historical wind power output, namely, a neural network describes the output probability characteristic of the wind power, real-time scheduling can be realized; the PSO method is used for carrying out day-ahead optimal economic dispatching according to predicted wind power output, electricity price data and natural gas price data when the economic dispatching of the electricity-natural gas comprehensive energy system is realized, and the day-ahead dispatching strictly depends on the accuracy of a data predicted value; the traditional DQN algorithm neural network is simple in structure and short in time consumption, but the actions need to be discretized, all possible actions in all systems are not considered, and the comprehensiveness of the optimization effect is not enough.

Table 2 shows the comparison of the different methods;

method of producing a composite material	Average running cost ($/day)	Calculating elapsed time (seconds)
			DQN	21453	32.2
PSO	26883	37812
			DDPG	18724	62.3

TABLE 2

Although illustrative embodiments of the present invention have been described above to facilitate the understanding of the present invention by those skilled in the art, it should be understood that the present invention is not limited to the scope of the embodiments, and various changes may be made apparent to those skilled in the art as long as they are within the spirit and scope of the present invention as defined and defined by the appended claims, and all matters of the invention which utilize the inventive concepts are protected.

Claims

1. An online scheduling method of an electricity-gas integrated energy system based on deep strategy optimization comprises the following steps:

(2) Constructing an output model of each component of the electricity-natural gas integrated energy system;

(2.1) converting electricity into gas p2g equipment output model;

wherein the content of the first and second substances,

(2.2) a gas turbine gt equipment output mode model;

wherein, P_gt,tElectric energy output for gt device at time t, Q_gt,tThe natural gas flow consumed by the equipment at the time t;

(2.3) a thermal power generating unit g equipment output model;

P_g,t＝P_p2g,t+P_l,t-α_tP_wp,t-P_gt,t

the objective function is:

wherein f is_oc,tFor the operating cost of the electric-natural gas integrated energy system at time t, f (p)_g,t) Expressed as the running cost of the thermal power generating unit at the moment t, { C_n,C_e,C_wp,C_p2g,C_gtThe cost coefficients are respectively a supply cost coefficient of a natural gas source, an air outlet cost coefficient of an air storage tank, a cost coefficient of a wind power access power grid, an operation cost coefficient of p2g, an operation cost coefficient of gt, and Q_n,tThe time t is the gas output of the natural gas source, when Q is_n,tLess than 0, which means that the natural gas source gives out gas to the natural gas market at the time t to obtain profit when Q is_n,tWhen the natural gas source is more than 0, the natural gas source supplies natural gas to the gas inlet of the electric-natural gas system at the time t; p_net,tExpressed as the net load of the electro-natural gas system at time t, omega is the economic reduction factor, f_pls,tEconomic losses expressed as net load fluctuations adjacent in time t; { a_g,b_g,c_gThe coefficient is the operation cost coefficient of the thermal power generating unit; t is an optimized operation period, and F is the operation cost and the net load fluctuation economic conversion cost of the electricity-natural gas integrated energy system in the operation period;

the constraint conditions are as follows: