CN116128543B

CN116128543B - Comprehensive simulation operation method and system for load declaration and clearing of electricity selling company

Info

Publication number: CN116128543B
Application number: CN202211622614.4A
Authority: CN
Inventors: 梁波; 杨明; 员江洋; 王鑫; 张海静; 王旭东; 孙小斌; 李函奇; 王所钺; 王孜旭; 刘畅; 冯延坤
Original assignee: Shandong University; Marketing Service Center of State Grid Shandong Electric Power Co Ltd
Current assignee: Shandong University; Marketing Service Center of State Grid Shandong Electric Power Co Ltd
Priority date: 2022-12-16
Filing date: 2022-12-16
Publication date: 2024-05-24
Anticipated expiration: 2042-12-16
Also published as: CN116128543A

Abstract

The invention discloses a comprehensive simulation operation method and system for load reporting and clearing of an electricity selling company, comprising the following steps: according to the acquired electric power market clearing information, a Markov decision model is built with the aim of minimizing the comprehensive cost of an electric power selling company; adopting a double-depth Q network algorithm to obtain an optimal strategy for reporting the load of an electricity selling company; and aiming at minimizing the running cost of the power system, taking system constraint, unit constraint and network security constraint into consideration, constructing a market clearing model, carrying out power market clearing simulation according to the load reporting optimal strategy of the electric company, and obtaining the power market clearing information of the next stage so as to formulate the load reporting optimal strategy of the electric company in the next stage. The load reporting decision-making behavior of the electricity selling company is intelligently simulated by adopting a double-depth Q network algorithm; by providing a market clearing model, a data-driven decision strategy is embedded in the market clearing model to reduce the clearing cost of an electricity company in market transactions.

Description

Comprehensive simulation operation method and system for load declaration and clearing of electricity selling company

Technical Field

The invention relates to the technical field of electric power markets, in particular to a comprehensive simulation operation method and system for load reporting and clearing of an electricity selling company.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

The intelligent decision method is researched in the face of new situations of various electric power selling companies participating in electric power market transaction, and the intelligent decision method has very important significance and value for optimizing self behavior strategies of the electric power selling companies and realizing income maximization and optimal allocation of electric power market resources.

In the electric power market environment, an electricity selling company directly participates in market transaction, and electricity purchasing cost is reduced by changing self decision-making behavior; the main business is to purchase electric energy in each trade market and sell the electric energy to various end users in a retail mode. In which an electricity vending company typically provides a fixed price of electricity supply to the user, the rate adjustment is not frequent. In this context, in order to ensure self-income, the electricity selling company must minimize the cost of electricity purchasing on the basis of satisfying the user's demand, maintaining the average electricity purchasing price lower than the electricity selling price. Thus, the major challenges facing electricity companies are fluctuating electricity prices and random consumer load demands.

Many students at home and abroad research the electricity purchasing decision of electricity selling companies, and the system introduces the decision problem under the uncertainty of the electric power market. Research mainly comprises load prediction, electricity purchasing strategies, retail pricing strategies and the like. Load prediction is the basic basis for the electricity selling company to make optimal decisions; in the actual power market, however, the accuracy of load forecasting remains a difficult problem for power companies due to data quality limitations and the influence of many variables.

The electric energy purchasing process is a key step of decision making, and a plurality of research results exist in the related field at home and abroad, and the research results mainly focus on influence factor analysis and purchasing optimization strategy formulation in the purchasing process. But in different cases the above-described method lacks versatility.

Retail pricing strategies are one of the core challenges for retailers to increase profitability, but existing approaches require implementation of fixed models and lack the ability to accommodate complex and dynamic environments.

Disclosure of Invention

In order to solve the problems, the invention provides a comprehensive simulation operation method and a system for reporting and clearing the load of an electricity selling company, which adopts a double-depth Q network algorithm to intelligently simulate the load reporting decision-making behavior of the electricity selling company; by providing a market clearing model, a data-driven decision strategy is embedded in the market clearing model to reduce the clearing cost of an electricity company in market transactions.

In order to achieve the above purpose, the present invention adopts the following technical scheme:

In a first aspect, the invention provides a comprehensive simulation operation method for reporting and clearing loads of an electricity selling company, which comprises the following steps:

According to the acquired electric power market clearing information, a Markov decision model is built with the aim of minimizing the comprehensive cost of the electric power selling company, and a state space, an action space and a cumulative rewarding function of the electric power selling company are formulated;

Generating an action value of an agent of the electric company by adopting a double-depth Q network algorithm for the Markov decision model, and obtaining an optimal strategy of load declaration of the electric company by iterative training until the accumulated rewards of the agent of the electric company are maximum;

aiming at minimizing the running cost of the power system, taking system constraint, unit constraint and network security constraint into consideration to construct a market clearing model;

And according to the optimal strategy of the load reporting of the electric company, adopting a market clearing model to perform electric power market clearing simulation, and obtaining electric power market clearing information of the next stage so as to formulate the optimal strategy of the load reporting of the electric company in the next stage.

As an alternative embodiment, the electric power market clearing information includes: current load prediction, day-ahead electricity prices, real-time electricity prices, and real-time load demands.

As an alternative embodiment, the objective function of the market clearing model is:

Wherein: lambda _i,t is the unit marginal cost of the generator i; p _i,t is the output of the unit i in the t period; The no-load cost, the starting cost and the stopping cost of the unit i are respectively represented; u _i,t is the start-stop state of the unit; lambda _j,t is the electricity price corresponding to the section t of the electricity selling company j; q _j,t is the electricity purchase amount of electricity company j in the section t.

As an alternative embodiment, the system constraint includes a system active power balance constraint; the active power balance constraint of the system is as follows:

∑q_j,t-∑p_i,t＝0

wherein q _j,t is the electricity purchased by electricity selling company j in the t section; p _i,t is the output of the unit i in the t period.

Alternatively, the system constraint includes a system rotation reserve constraint; the system rotation reserve constraint is:

wherein, p _i,t is the output of the unit i in the t period; u _i,t is the start-stop state of the unit.

Alternatively, the jackpot function is:

Wherein: an optimal strategy obtained by the agent in the period t for the electricity selling company d; /(I) Comprehensive electricity price for market in the future; /(I)Comprehensive electricity price for real-time market; /(I)Obtaining a load prediction curve for an agent of an electricity selling company in a market in the day before; Is a real-time load curve; k _d is the action value of the agent of the electricity selling company, and M is the uniform equal division times; d is an electricity selling company set; is a real-time load demand.

As an alternative embodiment, the state space is composed of a latest comprehensive price curve, a 96-point load prediction curve and a historical actual load curve; the action space is composed of action values generated by the agent of the electricity selling company.

In a second aspect, the present invention provides a comprehensive simulation operation system for reporting and clearing loads of an electricity selling company, including:

The decision model construction module is configured to construct a Markov decision model with the aim of minimizing the comprehensive cost of the electric company according to the acquired electric power market clearing information, and to formulate a state space, an action space and a cumulative rewarding function of the electric company;

The load reporting strategy optimization module is configured to adopt a double-depth Q network algorithm to the Markov decision model to generate an action value of an agent of the electricity selling company, and obtain an optimal load reporting strategy of the electricity selling company after iterative training until the accumulated rewards of the agent of the electricity selling company are maximum;

the system comprises a clearing model construction module, a model generation module and a model generation module, wherein the clearing model construction module is configured to take system constraint, unit constraint and network security constraint into consideration to construct a market clearing model aiming at minimizing the running cost of the power system;

The simulation module is configured to adopt a market clearing model to simulate the electric power market clearing according to the load reporting optimal strategy of the electric power selling company, and obtain electric power market clearing information of the next stage so as to formulate the load reporting optimal strategy of the electric power selling company in the next stage.

In a third aspect, the invention provides an electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the method of the first aspect.

In a fourth aspect, the present invention provides a computer readable storage medium storing computer instructions which, when executed by a processor, perform the method of the first aspect.

Compared with the prior art, the invention has the beneficial effects that:

the invention provides a comprehensive simulation operation method and a system for load declaration and clearing of an electricity selling company, which effectively apply deep reinforcement learning to simulate market quotation behaviors of the electricity selling company, and make Markov decision behaviors so as to form an intelligent quotation decision system of the electricity selling company; aiming at different types of market users, the decision-making model of the electricity selling company can stably promote the decision-making level; and developing a quick clearing model, and carrying out market clearing after the electric company obtains an optimal load reporting strategy, so as to compare and analyze the clearing cost per se and improve the use efficiency of a decision and clearing comprehensive system of the electric company.

The invention provides a comprehensive simulation operation method and a system for reporting and clearing the load of an electric company, which are used for formulating an action space, a state space and a cumulative rewarding function based on a Markov decision model, and iteratively simulating the load reporting decision behavior of an agent of the electric company by adopting an efficient and reliable double-depth Q network algorithm. The intelligent decision model of the electricity selling company based on the deep reinforcement learning has the characteristics of data driving, no model, closed-loop control and the like, can overcome the problems of complex calculation and expansibility related to an accurate model, and can also meet the real-time requirement of intelligent decision of the electricity selling company.

The invention provides a comprehensive simulation operation method and system for reporting and clearing the load of an electricity selling company, which aims at the electricity selling company in the market in the day-ahead, and for intelligently simulating the decision-making behavior of the load of the electricity selling company, adopts a double-depth Q network algorithm to simulate the decision-making behavior of the intelligent agent of the electricity selling company and provides a data-driven intelligent decision-making strategy; secondly, a market clearing model is provided, and a data-driven decision strategy is embedded in the market clearing model, so that the clearing cost of an electricity selling company in market transaction is reduced.

Additional aspects of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention.

FIG. 1 is a schematic diagram of an implementation mechanism of an intelligent decision model for reporting the load of an electricity-selling company provided in embodiment 1 of the present invention;

Fig. 2 is a schematic diagram of implementation of integrated decision making and clearing of a load declaration of an electric company based on deep reinforcement learning according to embodiment 1 of the present invention;

FIG. 3 is a block diagram of a comprehensive system for determining and clearing the load declaration of an electric company based on deep reinforcement learning according to embodiment 1 of the present invention;

FIG. 4 is a flowchart of a comprehensive simulation operation method for reporting and clearing the load of an electric company provided in embodiment 1 of the present invention;

FIG. 5 is a topology structure diagram of an IEEE 8 node system in example verification provided in embodiment 1 of the present invention;

FIG. 6 is a graph of the latest historical real-time load in the example verification provided in embodiment 1 of the present invention;

FIG. 7 is a graph of historical day-ahead, real-time electricity prices for example verification provided in example 1 of the present invention;

FIG. 8 is a graph of current load prediction in example verification provided in example 1 of the present invention;

FIG. 9 is a graph of the cumulative rewards and average rewards variation of the training process of the example verification provided in example 1 of the invention;

FIG. 10 is a graph showing the optimal decision of the electric company after training in the example verification provided in the embodiment 1 of the present invention;

FIG. 11 is a graph showing the comparison between the optimal reporting curve and the load prediction of the electric company 1 in the example verification provided in the embodiment 1 of the present invention;

FIG. 12 is a graph showing the comparison between the optimal reporting curve and the load forecast of the electric company 2 in the example verification provided in the embodiment 1 of the present invention;

FIG. 13 is a graph showing the comparison between the optimal reporting curve and the load forecast of the electric company 3 in the example verification provided in the embodiment 1 of the present invention;

Fig. 14 is a graph showing the ratio of the electricity purchase cost obtained by the electricity selling company according to the load prediction to the electricity purchase cost based on DDQN algorithm in the example verification provided in the embodiment 1 of the present invention.

Detailed Description

The invention is further described below with reference to the drawings and examples.

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present invention. As used herein, unless the context clearly indicates otherwise, the singular forms also are intended to include the plural forms, and furthermore, it is to be understood that the terms "comprises" and "comprising" and any variations thereof are intended to cover non-exclusive inclusions, such as, for example, processes, methods, systems, products or devices that comprise a series of steps or units, are not necessarily limited to those steps or units that are expressly listed, but may include other steps or units that are not expressly listed or inherent to such processes, methods, products or devices.

Embodiments of the invention and features of the embodiments may be combined with each other without conflict.

Example 1

The embodiment provides a comprehensive simulation operation method for decision-making and clearing of an electricity selling company based on deep reinforcement learning, which comprises the following steps:

The method of the present embodiment is described in detail below with reference to fig. 1-4.

In this embodiment, various types of clearing information sent by the electric power market include: current load prediction, day-ahead electricity price, real-time electricity price and real-time load demand;

according to the various clearing information, a Markov decision model is built with the aim of minimizing the comprehensive cost of the electric company, and an action space, a state space and a cumulative rewarding function of the electric company are formulated;

and dynamically generating an action value of the electric company agent by adopting a double-depth Q network (DDQN) algorithm on the Markov decision model, and iteratively training a load reporting decision curve until the accumulated rewards of the electric company agent are maximum, thereby obtaining an optimal electric company load reporting decision curve.

In this embodiment, in the market before date, the electricity selling company may make a load reporting optimal policy according to information such as load prediction, historical real-time load, current price before date, and real-time price of electricity. The electricity market can be seen as a competitive environment for electricity vendors, each of which is an agent that reacts to changes in the market environment. In fact, this is a typical Markov decision behavior, reinforcement Learning (RL) is an effective way to solve the Markov Decision Problem (MDP). According to the characteristics of the electricity selling company load reporting decision, an MDP model is designed as follows:

state space: the state space of the MDP consists of a latest comprehensive price curve issued by a power dispatching mechanism, a 96-point load prediction curve and a historical actual load curve, as shown in a formula (1).

Action space: the action value is generated by the agent of the electricity selling company d, and the action space is expressed as follows:

A＝{k_d,min,...,k_d,...,k_d,max}|k_d＝(k_d,max-k_d,min)/M (2)

Bonus function: in the power market environment, the optimal control aim is to minimize the long-term electricity purchasing cost of an electricity selling company. In the current work, delay rewards are adopted as the optimization basis of the controller in consideration of the time sequence of market release information.

The utility company declares the load curve in the future, i.e. the work order output by the controller will instruct the utility company to declare the load in the future one hour. Because the release information of the market stage before the day is limited, the state space only has the predicted load of the market before the day and the comprehensive electricity price before the day in the load reporting process, the market in the stage needs to combine the latest information such as the real-time load, the real-time comprehensive electricity price and the like released by the market in the last period to develop the settlement of the electricity cost of the market reporting.

Based on the above flow, the reward function of each action of the controller is:

Wherein: Comprehensive electricity price for market in the future; /(I) Comprehensive electricity price for real-time market; /(I)Obtaining a load prediction curve for an agent of an electricity selling company in a market in the day before; /(I)Is a real-time load curve; k _d is the action value of the agent of the electricity selling company, and M is the uniform equal division times; /(I)An optimal strategy obtained by the agent in the period t for the electricity selling company d; d is an electricity selling company set; is a real-time load demand.

In the embodiment, a DDQN algorithm architecture decision model is adopted, a Markov decision model is utilized to determine a learning method of an optimal strategy, and the optimal strategy is declared by simulating the load of an electricity selling company according to the learning method of the optimal strategy. The DDQN algorithm is a deep reinforcement learning method based on Q learning and DQN, not only inherits the value-based characteristic of the Q learning, but also introduces a neural network to predict the value of the behavior so as to avoid the problem that the Q learning is easy to occur. In addition, DDQN decouples the selection process and the evaluation process, avoiding the problem of overestimation of the Q value. The DDQN algorithm updates the agent parameters during training by the prize values returned by the external environment. The agent dynamically selects the action value through different state spaces to obtain the highest prize.

In fact, the DDQN algorithm differs from the DQN algorithm in the process of selecting the Q value. DQN always selects the maximum Q value of the target network; and DDQN first selects an action with the maximum Q value from the online network using equation (4), then evaluates the action policy by calculating the corresponding target Q value through the target network, and evaluates by equation (5), the time difference error (TD error) of DDQN is expressed as equation (6).

a^*＝argmax_aQ(S_t+1,a;θ_t) (4)

Wherein: q (S _t+1,a;θ_t) is the current Q value of action a _t in state S _t+1 output by the online Q network; q (S _t+1,argmax_aQ(S_t+1,a;θ_t);θ′_t) is a target Q value output by a target Q network; θ _t and θ' _t are online Q network and target Q network parameters, respectively; gamma is the attenuation coefficient; r _t+1 is the jackpot value.

In DDQN algorithm, in order to break the association relation between sample data, probability extraction sample learning such as experience playback pool is adopted to update neural network parameters. However, for the sparse rewards, there are fewer samples that can excite the controller to act correctly, resulting in a less efficient training of the uniform sampling mechanism. Therefore, when the experience playback pool sample data is extracted for training, the sampling probability of each experience sample is determined according to the quality of the experience sample by adopting a priority sampling mode, so that samples with higher priorities are extracted and learned more frequently. The sampling mode can obviously improve the convergence speed of DDQN algorithm and reduce the number of samples required by the convergence of the neural network. The above procedure can be defined as formula (7):

wherein: p (i) is the conversion priority at sample i, with an exponential power distribution characteristic, the exponent determining the priority; Can be obtained from the absolute value of the TD error; p _i can be expressed as p _i =1/rank (i), and rank (i) is the rank of the conversion.

When using probability distribution sampling for preferential playback, there is a bias in the estimation of the expected value. To correct for this deviation, it is typically multiplied by an importance sampling weight, as shown in equation (8):

wherein: n is a capacity replay buffer; beta is an index value; δ _j+1 is obtained by formula (6); η is the step size.

In the embodiment, constraint conditions of system balance, power grid safety and unit operation are considered, a market clearing model is established, and a decision model for reporting the load of the electricity selling company is embedded in the market clearing model so as to simulate the market clearing of an optimal strategy for reporting the load of the electricity selling company;

According to the optimal strategy of the load declaration of the electric company, carrying out electric power market clearing simulation through a market clearing model to obtain the current clearing cost of the electric company for comparison and evaluation; meanwhile, clear information of the electric power market in the next stage is formed, so that an electric power selling company can formulate a load reporting optimal strategy in the next stage; the method is used for realizing balance of market supply and demand, meeting load demands of users, and simultaneously controlling power grid operation strategies such as power output of a generator set, start and stop of loads and the like in a power grid while meeting the load demands of the users, so as to realize balance of power supply demands of the power grid and power consumption demands of the users.

In the embodiment, the intelligent decision model for reporting the load of the electric company based on deep reinforcement learning adjusts the action value of the intelligent agent when meeting the constraint condition of the reporting load of the electric company, so that the cost of the electric company is minimized; meanwhile, a market clearing model aiming at minimizing the running cost of the power system is adopted, and in a certain scheduling period, the given load balance is met, meanwhile, certain boundary conditions and standby requirements are met, and the on-off state of the unit is reasonably distributed, so that the power generation cost is minimized.

In the market in the day-ahead, an independent operator firstly distributes related information of market in the day-ahead and real-time market transaction to each electricity selling company; secondly, the electricity selling company optimizes the strategy of the electricity selling company through an intelligent decision model and forms a declaration load curve. And then, calling a market clearing program to calculate the transaction clearing result of the declared strategy, and comparing and evaluating the transaction clearing result with the transaction clearing result obtained by the forecasting method.

In the embodiment, a multi-type power supply is connected into a power system, constraint conditions such as system balance, power grid safety and unit operation are considered, and a market clearing model aiming at minimizing the operation cost of the power system is established; wherein the objective function is expressed as:

In this embodiment, constraint conditions of the market clearing model include system constraint, unit constraint, and network security constraint;

Wherein the system constraint comprises a system active power balance constraint and a system rotation reserve constraint;

(1) The active power balance constraint of the system is as follows:

Σq_j,t-Σp_i,t＝0 (11)

(2) The system rotation reserve constraint is:

the unit constraint is as follows:

wherein: i=1, 2,..n.

The network security constraints are:

F_l-F_l ^M≤0 (15)

-F_l-F_l ^M≤0 (16)

Wherein: f _l is the power flow of line l; f _l ^M is line l power limit.

In this embodiment, the market clearing model adopts mixed integer programming modeling to solve, performs electric power market clearing simulation according to the load reporting optimal strategy of the electric power selling company, and obtains electric power market clearing information of the next stage so as to formulate the load reporting optimal strategy of the electric power selling company in the next stage.

As shown in fig. 5, the present embodiment introduces an IEEE 8 node system as an example, verifying the feasibility of the method of the present embodiment. Wherein, the transmission capacity of each line is set to 300MW, the relevant parameters of the generator set are shown in table 1, and the detailed information of DDQN super parameters is listed in table 2. In the action space, the capacity of each electric company agent is set to k _min to-50 MW and k _max to 50MW. The motion space is divided into 20 motions at equal intervals, that is, M is 20. The historical real-time load of the electricity selling company is shown in fig. 6, the day-ahead electricity price and the real-time electricity price are shown in fig. 7, and the predicted load curve of the electricity selling company is shown in fig. 8.

Table 1 generator set parameters related to the generator set

Table 2 DDQN super parameters

Super parameter	Value of
		E	300
T	96
		Batch size	32
Experience pool size	5000
		Attenuation Rate	0.9
K	50
		α	0.6
β	0.4
		Learning rate	0.001

In each iteration, the agent adjusts the updating of the parameters according to the rewarding value returned by the external environment in the training process, and obtains better actions from the action space. Fig. 9 shows that the prize value increases with the number of exercises, indicating that the agent has learned effectively from previous experience. It can be seen that during the sample accumulation phase, the evaluation of the Q value and training of the neural network is not ideal due to the insufficient number of samples stored in the experience pool, and thus the revenue of this phase is low. As samples accumulate, agent learning is continually optimized, so revenue also rises significantly, and eventually reaches a steady value as training times increase. This means that the parameters of the agent have converged and can be applied in the actual scenario. Fig. 10 shows the optimal decision curve of the electricity vending company.

In an actual application scenario, the data sample of the training set can be used as a test set, and the 96-point test set in the training steady state is intercepted as an optimal strategy of an electricity selling company. In order to highlight the optimization effect, the DDQN obtained 96-point optimal decision curve is compared with a load prediction curve, as shown in fig. 11-13; meanwhile, fig. 14 shows the ratio of the electricity purchase cost obtained by the electricity selling company according to the load prediction to the electricity purchase cost based on DDQN algorithm. It can be observed that the ratio is almost greater than 1, and it can be stated that the electricity purchase cost obtained by using DDQN algorithm is generally smaller than that obtained using the predictive method. Therefore, the method can significantly improve the decision level of the electricity selling company.

To further verify the performance of the proposed method, the present embodiment performs simulation in the following two scenarios, as follows: (1) The electricity vending company uses the proposed method to optimize its own strategy and then makes market clearing. (2) The electricity selling company runs the market clear according to the load prediction curve issued by the ISO. The optimal decision curves and load prediction curves of the electricity vending company are shown in fig. 11-13. The results of the clearing of the electricity selling company in both cases are shown in table 3, and it is apparent that the electric clearing cost of the electricity selling company obtained by DDQN method is lower than the clearing cost of the use prediction method.

TABLE 3 analysis of market clearing results

Example 2

The embodiment provides a comprehensive simulation operation system for reporting and clearing loads of an electricity selling company, which comprises the following components:

It should be noted that the above modules correspond to the steps described in embodiment 1, and the above modules are the same as examples and application scenarios implemented by the corresponding steps, but are not limited to those disclosed in embodiment 1. It should be noted that the modules described above may be implemented as part of a system in a computer system, such as a set of computer-executable instructions.

In further embodiments, there is also provided:

An electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the method described in embodiment 1. For brevity, the description is omitted here.

It should be understood that in this embodiment, the processor may be a central processing unit CPU, and the processor may also be other general purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other programmable logic device, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

The memory may include read only memory and random access memory and provide instructions and data to the processor, and a portion of the memory may also include non-volatile random access memory. For example, the memory may also store information of the device type.

A computer readable storage medium storing computer instructions which, when executed by a processor, perform the method described in embodiment 1.

The method in embodiment 1 may be directly embodied as a hardware processor executing or executed with a combination of hardware and software modules in the processor. The software modules may be located in a random access memory, flash memory, read only memory, programmable read only memory, or electrically erasable programmable memory, registers, etc. as well known in the art. The storage medium is located in a memory, and the processor reads the information in the memory and, in combination with its hardware, performs the steps of the above method. To avoid repetition, a detailed description is not provided herein.

Those of ordinary skill in the art will appreciate that the elements of the various examples described in connection with the present embodiments, i.e., the algorithm steps, can be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

While the foregoing description of the embodiments of the present invention has been presented in conjunction with the drawings, it should be understood that it is not intended to limit the scope of the invention, but rather, it is intended to cover all modifications or variations within the scope of the invention as defined by the claims of the present invention.

Claims

1. The utility model provides a comprehensive simulation operation method for the load declaration and clearing of an electricity selling company, which is characterized by comprising the following steps:

According to the optimal strategy of the load declaration of the electric company, adopting a market clearing model to perform electric power market clearing simulation, and obtaining electric power market clearing information of the next stage so as to formulate the optimal strategy of the load declaration of the electric company in the next stage;

the objective function of the market clearing model is as follows:

Wherein: lambda _i,t is the unit marginal cost of the generator i; p _i,t is the output of the unit i in the t period; The no-load cost, the starting cost and the stopping cost of the unit i are respectively represented; u _i,t is the start-stop state of the unit; lambda _j,t is the electricity price corresponding to the section t of the electricity selling company j; q _j,t is the electricity quantity purchased by the electricity selling company j in the section t;

The cumulative bonus function is:

Wherein: an optimal strategy obtained by the agent in the period t for the electricity selling company d; /(I) Comprehensive electricity price for market in the future; /(I)Comprehensive electricity price for real-time market; /(I)Obtaining a load prediction curve for an agent of an electricity selling company in a market in the day before; /(I)Is a real-time load curve; k _d is the action value of the agent of the electricity selling company, and M is the uniform equal division times; d is an electricity selling company set; /(I)Is a real-time load demand.

2. The integrated simulation operation method for load declaration and clearing of an electric company according to claim 1, wherein the electric power market clearing information includes: current load prediction, day-ahead electricity prices, real-time electricity prices, and real-time load demands.

3. The method for integrated simulated operation of utility load reporting and clearing as claimed in claim 1, wherein said system constraints comprise system active power balance constraints; the active power balance constraint of the system is as follows:

∑q_j,t-∑p_i,t＝0

4. The method for integrated simulation operation of load declaration and clearing of an electric company according to claim 1, wherein the system constraint includes a system rotation reserve constraint; the system rotation reserve constraint is:

5. The integrated simulation operation method for the load declaration and clearing of an electric company according to claim 1, wherein the state space consists of a latest integrated price curve, a 96-point load prediction curve and a historical actual load curve; the action space is composed of action values generated by the agent of the electricity selling company.

6. The utility model provides a comprehensive simulation operation system of selling electric company load declaration and clearing which characterized in that includes:

The simulation module is configured to adopt a market clearing model to simulate the electric power market clearing according to the load reporting optimal strategy of the electric power selling company, and obtain electric power market clearing information of the next stage so as to formulate the load reporting optimal strategy of the electric power selling company in the next stage;

the objective function of the market clearing model is as follows:

The cumulative bonus function is:

7. An electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the method of any one of claims 1-5.

8. A computer readable storage medium storing computer instructions which, when executed by a processor, perform the method of any of claims 1-5.