CN112001752A - Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality - Google Patents

Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality Download PDF

Info

Publication number
CN112001752A
CN112001752A CN202010832963.3A CN202010832963A CN112001752A CN 112001752 A CN112001752 A CN 112001752A CN 202010832963 A CN202010832963 A CN 202010832963A CN 112001752 A CN112001752 A CN 112001752A
Authority
CN
China
Prior art keywords
vpp
power
dso
strategy
bidding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010832963.3A
Other languages
Chinese (zh)
Inventor
高红均
张凡
刘友波
刘俊勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan University
Original Assignee
Sichuan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan University filed Critical Sichuan University
Priority to CN202010832963.3A priority Critical patent/CN112001752A/en
Publication of CN112001752A publication Critical patent/CN112001752A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0206Price or cost determination based on market factors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/06Electricity, gas or water supply

Abstract

The invention discloses a multi-virtual power plant dynamic game transaction behavior analysis method based on rationality, which comprises the following steps of: firstly, each virtual power plant bidding main body fully considers the target requirement of the development stage of the main body and researches the dynamic pricing behavior of an upper-layer operator; secondly, performing transaction behavior analysis on the multiple virtual power plants by adopting different transaction target modeling, performing evolutionary learning on limited transaction behavior information by adopting a particle swarm algorithm, and further improving the self target to gradually become optimal by learning a competitor strategy; finally, a dynamic game computing process of a multi-virtual power plant is researched, and a dynamic game particle swarm optimization algorithm is provided to be combined with an optimization tool box to solve a game model; the solution of the dynamic game model has good convergence, and a new idea and reference are provided for the virtual power plant to formulate different transaction targets to participate in market transaction.

Description

Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality
Technical Field
The invention belongs to the technical field of transaction behavior analysis of multiple virtual power plants, and particularly relates to a multiple virtual power plant dynamic game transaction behavior analysis method based on rationality.
Background
Under the power development strategy of renewable energy and electric energy substitution, distributed power sources, energy storage and responsive loads are widely popularized and applied at the level of a power distribution network, so that the traditional large-scale power production and long-distance transmission gradually develop towards distributed energy. But large-scale networking thereof poses a great challenge to the safe operation of the power grid. At present, scholars at home and abroad start from simple profit subjects and have made a lot of researches on aspects such as power generation scheduling and bidding modes of virtual power plants. But as the national grid development strategy has changed in recent years, allowing more diverse social capital to emerge into the electricity market, each VPP will likely belong to a different stakeholder to operate. Under the driving of the self-rationality of each benefit agent, each benefit agent pursues self benefit maximization, and the traditional optimization scheduling method of a single agent is difficult to apply. However, most of the research has focused on the game relationship between the VPPs on the premise of a given transaction price, but the following problems still exist: firstly, the influence of a virtual power plant on market trading power price is not considered, and the virtual power plant is only used as a price acceptor to participate in market operation and is essentially still optimized in a VPP (virtual private Point); secondly, the decision environment is over-ideal, which is mostly game research under complete rationality conditions, not only the principal itself is required to have perfect rationality, but also the principals are required to trust the rationality of each other, which is difficult to achieve in practical situations. Meanwhile, most of the existing literature researches are to traditionally make an optimal bidding scheme with the goal of maximizing profit or minimizing operation cost, but in the future diversified market, each benefit agent has different development requirements in different development stages, and the trading behavior with the goal of maximizing profit or minimizing operation cost cannot occupy the bidding advantage in the fully competitive market.
Disclosure of Invention
The invention aims to provide a multi-virtual power plant dynamic game transaction behavior analysis method based on rationality, which is used for solving one of the technical problems in the prior art, such as: in the prior art, the following problems still exist: firstly, the influence of a virtual power plant on market trading power price is not considered, and the virtual power plant is only used as a price acceptor to participate in market operation and is essentially still optimized in a VPP (virtual private Point); secondly, the decision environment is over-ideal, which is mostly game research under complete rationality conditions, not only the principal itself is required to have perfect rationality, but also the principals are required to trust the rationality of each other, which is difficult to achieve in practical situations. Meanwhile, most of the existing literature researches are to traditionally make an optimal bidding scheme with the goal of maximizing profit or minimizing operation cost, but in the future diversified market, each benefit agent has different development requirements in different development stages, and the trading behavior with the goal of maximizing profit or minimizing operation cost cannot occupy the bidding advantage in the fully competitive market.
In order to achieve the purpose, the technical scheme of the invention is as follows:
a multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality comprises the following steps:
s1: each virtual power plant bidding main body researches dynamic pricing behaviors of upper operators according to target requirements of a development stage of the virtual power plant bidding main body;
s2: adopting different trading target modeling to analyze trading behaviors of the multiple virtual power plants;
s3: carrying out evolutionary learning on the limited rational transaction behavior information based on a particle swarm algorithm, and gradually optimizing the target of the user by learning a competitor strategy;
s4: the dynamic game computing process of the multi-virtual power plant is researched, and a dynamic game particle swarm optimization algorithm is provided to be combined with an optimization tool box to solve a game model.
Further, according to the influence of different stages of VPP development and decision behaviors of competitors on self optimization decisions, a decision scheme with different targets of multiple VPPs is established; considering the power purchasing and selling quantity and the distributed power output in a VPP as strategies, designing a plurality of VPP bidding decision schemes with different targets, wherein the VPP can select different transaction targets and decision schemes by considering the difference of power consumption of users in different areas and the different pursuits of VPP alliance targets; secondly, in the process of playing the game, each main body can change the behavior rule of the main body after integrating the market transaction behaviors of other competitors to convert the target; in the game playing process, information of competitors can be continuously acquired through evolutionary learning; the strategy made along with the continuous increase of the self-acquired information can be gradually optimized; eventually, no principal can benefit unilaterally from changing bidding strategies, i.e., a stable strategy is achieved.
Further, the stable equilibrium strategy of the multi-benefit agent under the limited rationality condition can be defined as: in dynamic gaming with N principals of interest, there is a stable policy combination X ═ X (X)1,X2,...,XN) For any other policy combination Y ═ (Y)1,Y2,...,YN) I Y ≠ X, there are main bodies i and0e (0,1) is such that:
Figure BDA0002638655300000021
in the formula: i (Y)i,S-i) And I (X)i,S-i) Respectively adopting Y for main body iiAnd XiTarget utility conditions at the time of the policy; s-iIs a combination of policies for other entities of the system than entity i.
Furthermore, the VPP game strategy considers the output plan of each distributed power supply in the VPP, the charging and discharging power of stored energy, the response power of a response load, and the output power of clean energy such as wind and light; the VPP obtains the system load demand and possible bidding strategies of competitors according to the limited market information and other VPP bidding historical data which are mastered at present, so that the maximum self-effectiveness is realized.
Further, the different transaction targets in step S2 include:
the game objective function of the conservative VPP is considered as maximizing self-benefit, and internal resources including distributed power generation cost, energy storage cost and responsive load compensation cost are fully integrated; deciding an optimal bidding strategy; the VPP bidding objective function is:
Figure BDA0002638655300000031
Figure BDA0002638655300000032
Figure BDA0002638655300000033
Figure BDA0002638655300000034
in the formula: lambda [ alpha ]D-V,sell、λD-V,buyRespectively selling electricity price and purchasing electricity price from VPP to DSO;
Figure BDA0002638655300000035
the electric quantity purchased and sold by the VPP to the DSO respectively; n is a VPP total number set, and VPPi belongs to N; n is a radical ofiIs DER set contained in ith VPP;
Figure BDA0002638655300000036
for the cost of power generation by MT in DER, aj、bj、cjFor the cost factor of the MT unit in DERj,
Figure BDA0002638655300000037
output power for MT;
Figure BDA0002638655300000038
in order to be a cost of the ESS operation,
Figure BDA0002638655300000039
for the scheduling cost factor of the ESS,
Figure BDA00026386553000000310
the charging and discharging power of the ESS, the value of which is more than 0 represents discharging, and the value of which is less than 0 represents charging;
Figure BDA00026386553000000311
in order to be able to respond to the load cost,
Figure BDA00026386553000000312
in response toThe load compensates for the electricity price,
Figure BDA00026386553000000313
response power for the response load; theta is subsidy electricity price of the government for new energy power generation;
Figure BDA00026386553000000314
respectively photovoltaic power output and wind power output;
the game objective function of the aggressive VPP is considered as the maximization of the market share scale, and the electric quantity bidding is carried out in the market in a maximized manner on the premise that the internal resources are integrated to meet the constraint output of each unit, so that the exchange electric quantity is obtained on the DSO level as much as possible, and the maximum power purchase quantity is obtained; the VPP bidding objective function is:
Figure BDA00026386553000000315
bidding constraint:
Figure BDA0002638655300000041
Figure BDA0002638655300000042
in the formula:
Figure BDA0002638655300000043
the power purchased from the DSO for the ith VPP,
Figure BDA0002638655300000044
power sold from the DSO for the ith VPP;
Figure BDA0002638655300000045
maximum value of purchasing and selling electric quantity bidding is carried out for VPPi respectively;
after the DSO uniform purchase and sale electricity price and the transaction electricity quantity are obtained, under the premise that an aggressive VPP objective function is met, other schedulable resources in the VPP are coordinated to meet the surplus load electricity quantity, and an electricity quantity shortage scheduling principle meets a lowest cost principle;
Figure BDA0002638655300000046
the game objective function of the stable VPP is considered as the maximum utilization of renewable energy, the renewable energy is considered as wind power and photovoltaic, meanwhile, the uncertainty of the power generation of the renewable energy is fully considered, and the influence of the fluctuation of the output of the renewable energy on the power supply stability of the system is stabilized by using the ESS; then, the output conditions of other units in the VPP and the electricity quantity bidding to the upper DSO are considered, and the VPP bidding objective function is as follows:
Figure BDA0002638655300000047
Figure BDA0002638655300000048
Figure BDA0002638655300000049
Figure BDA00026386553000000410
in the formula: alpha and beta are objective function weight coefficients; rhoj,tThe wind, light and storage combined electricity selling price is obtained;
Figure BDA00026386553000000411
wind, light and storage combined output; f is the fluctuation variance of the wind, light and storage combined output power; f. of0The minimum allowed fluctuation variance value of the system.
Further, when the VPP performs a decision response, no matter which transaction target is executed, the constraint conditions that need to be satisfied include:
and power balance constraint:
Figure BDA0002638655300000051
in the formula:
Figure BDA0002638655300000052
electricity usage that is unresponsive load;
Figure BDA0002638655300000053
is the output power of VPPi;
gas turbine set constraint:
Figure BDA0002638655300000054
Figure BDA0002638655300000055
in the formula:
Figure BDA0002638655300000056
the maximum power and the minimum power of the MT output are respectively the upper limit and the lower limit of the maximum power and the minimum power; r isd,i、ru,iThe downward slope climbing rate and the upward slope climbing rate of the unit j are respectively; Δ T is an operating period;
responsive to load constraints:
Figure BDA0002638655300000057
in the formula:
Figure BDA0002638655300000058
the maximum response quantity of the responsive load;
and (3) renewable energy output constraint:
Figure BDA0002638655300000059
Figure BDA00026386553000000510
in the formula:
Figure BDA00026386553000000511
respectively predicting the maximum value of wind and light output according to historical data;
energy storage restraint:
Figure BDA00026386553000000512
Figure BDA00026386553000000513
Figure BDA00026386553000000514
Figure BDA00026386553000000515
in the formula:
Figure BDA00026386553000000516
a 0-1 variable for ESS discharge and charge status; n is a radical ofESSAn upper limit for the number of discharge and charge state transitions for the ESS;
Figure BDA0002638655300000061
the upper and lower limits of the discharge power;
Figure BDA0002638655300000062
the charging power is the upper and lower limits;
Figure BDA0002638655300000063
at t for ESSCapacity status of the time period;
Figure BDA0002638655300000064
upper and lower capacity limits for consideration of ESS life and other factors;
Figure BDA0002638655300000065
respectively charge and discharge efficiency coefficient, the value range of which is
Figure BDA0002638655300000066
Further, the transaction is considered at the DSO level as:
DSO establishes electricity selling price lambda for VPPD-V,sellAnd the electricity purchase price lambdaD-V,buy(ii) a The DSO takes the maximized self-benefit as a game target, including the electricity purchasing and selling cost and benefit between the DSO and a superior power grid and a lower multi-VPP;
maxuDSO=FVPP-FDSO
in the formula: fVPPRevenue obtained for transactions between the DSO and the VPP; fDSOCosts spent in trading DSO with superior grids; specific expressions of the terms are as follows:
Figure BDA0002638655300000067
Figure BDA0002638655300000068
in the formula: t is a total scheduling time interval; n is a radical ofVPPIn order to be the number of the VPPs,
Figure BDA0002638655300000069
for the real-time electricity rates established by the DSO,
Figure BDA00026386553000000610
a power value for the ith VPP to interact with the DSO in the tth scheduling period;
Figure BDA00026386553000000611
for the border node electricity prices between the DSO and the upper grid,
Figure BDA00026386553000000612
the value of the transmission power between the DSO and the upper power grid is greater than 0, which indicates that the power is purchased from the upper power grid, and less than 0 indicates that the power is sold to the upper power grid.
Further, after obtaining the competitive bidding data of the competitor, the competitor strategy judgment index coefficient is as follows:
Figure BDA00026386553000000613
Figure BDA00026386553000000614
Figure BDA00026386553000000615
in the formula: xi123Index coefficient for judging competitor strategy, when xi1≥ξlim1When the measured value is "aggressive", when xi is2≤ξlim2When it is stable, the product is judged to be stable3≤ξlim3The time is judged to be 'conservative', wherein xi islim1lim2lim3Determining a constant for the index coefficient, and obtaining the constant through historical data of each VPP bidding; if the strategy index judgment is not within the index coefficient judgment constant limit value at the end of one game round, the strategy result is not output, and the next iteration round is continued;
each VPP determines the total benefit of the VPP through selecting a game strategy through evolutionary learning; each strategy corresponds to a transaction behavior mode under the strategy and is used by rhomnRepresents the condition switching state of VPPi trading behavior mode x (u) from strategy m to strategy n; after obtaining the competitive bidding information of competitors, VPPi learns and evolves the strategy of the competitors; if the strategy n is more than the strategy mWith greater utility, then strategy m will learn strategy n and make changes; expressed as:
Figure BDA0002638655300000071
Figure BDA0002638655300000072
in the formula: x is the number ofm(u) and xn(u) transaction behavior patterns of VPPi under policies m and n, respectively, when
Figure BDA0002638655300000073
If the value is more than 0, updating the VPP strategy according to the condition switching state, and if the value is more than 0, updating the VPP strategy according to the condition switching state
Figure BDA0002638655300000074
When the value is always less than 0, a stable equilibrium strategy is achieved; u. ofmAnd unRespectively representing the utility of the corresponding trading behavior modes of the strategies m and n, un-um]+Is max {0, un-um}。
Compared with the prior art, the invention has the beneficial effects that:
the virtual power plant bidding method has the innovation points that firstly, each virtual power plant bidding main body fully considers the target requirement of the development stage of the virtual power plant bidding main body and researches the dynamic pricing behavior of an upper-layer operator; secondly, performing transaction behavior analysis on the multiple virtual power plants by adopting different transaction target modeling, performing evolutionary learning on limited transaction behavior information by adopting a particle swarm algorithm, and further improving the self target to gradually become optimal by learning a competitor strategy; finally, a dynamic game computing process of a multi-virtual power plant is researched, and a dynamic game particle swarm optimization algorithm is provided to be combined with an optimization tool box to solve a game model; the solution of the dynamic game model has good convergence, and a new idea and reference are provided for the virtual power plant to formulate different transaction targets to participate in market transaction.
Drawings
Fig. 1 is a flow chart of a multi-virtual power plant dynamic game transaction behavior analysis method based on rationality.
FIG. 2 is a system block diagram of a multiple virtual power plant.
FIG. 3 is a schematic diagram of different transaction objectives of multiple virtual power plants.
Fig. 4 is a multi-virtual power plant dynamic game solving process.
Fig. 5 is a schematic diagram of the algorithm convergence process.
Fig. 6 is a schematic diagram of transaction electricity price optimization.
Fig. 7 is a schematic diagram of the transaction result of the virtual power plant 1.
Fig. 8 is a schematic diagram of the transaction result of the virtual power plant 2.
Fig. 9 is a schematic diagram of the transaction result of the virtual power plant 3.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to fig. 1 to 9 of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example (b):
a multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality is characterized by comprising the following steps:
each virtual power plant bidding main body fully considers the target requirements of the development stage of the main body and researches the dynamic pricing behaviors of upper-layer operators;
adopting different trading target modeling to analyze trading behaviors of the multiple virtual power plants;
carrying out evolutionary learning on the limited rational transaction behavior information based on a particle swarm algorithm, and further improving the self target to gradually become optimal by learning a competitor strategy;
the dynamic game computing process of the multi-virtual power plant is researched, and a dynamic game particle swarm optimization algorithm is provided to be combined with an optimization tool box to solve a game model.
Considering the influence of different stages of VPP development and decision behaviors of competitors on self optimization decision, a decision scheme with different targets of multiple VPPs is established. The method mainly considers the power consumption including the power purchasing and selling in the VPP, the distributed power supply and other output as strategies, and designs a plurality of VPP bidding decision schemes with different targets, wherein firstly, the VPP possibly selects different transaction targets and decision schemes by considering the power consumption difference of users in VPPs in different areas and the pursuit of VPP alliance targets to be different; and secondly, in the process of playing the game, each main body can change own behavior rules after integrating market transaction behaviors of other competitors to convert targets. In the process of playing the game, the information of competitors is continuously acquired through evolutionary learning. The strategy made along with the increasing of self-acquired information can be gradually optimized. Eventually, no principal can benefit unilaterally from changing bidding strategies, i.e., a stable strategy is achieved.
The stable equilibrium strategy of multi-benefit agents under limited rational conditions can be defined as: in dynamic gaming with N principals of interest, there is a stable policy combination X ═ X (X)1,X2,...,XN) For any other policy combination Y ═ (Y)1,Y2,...,YN) I Y ≠ X, there are main bodies i and0e (0,1) is such that:
Figure BDA0002638655300000091
in the formula: i (Y)i,S-i) And I (X)i,S-i) Respectively adopting Y for main body iiAnd XiTarget utility conditions at the time of the policy; s-iIs a combination of policies for other entities of the system than entity i.
The VPP game strategy considers the output plan of each distributed power supply in the VPP, the charging and discharging power of stored energy, the response power of a response load, and the output power of clean energy such as wind and light. The VPP obtains the system load demand and possible bidding strategies of competitors according to the limited market information and other VPP bidding historical data which are mastered at present, so that the maximum self-effectiveness is realized. It should be noted that the "utility" herein is different from the utility function of the conventional optimized scheduling, and refers to making an optimal strategy according to the development of itself.
The game objective function of the conservative VPP is considered to maximize self-benefit, and internal resources including the costs of distributed power generation, energy storage, responsive load compensation and the like are fully integrated. And (5) deciding an optimal bidding strategy. The VPP bidding objective function is:
Figure BDA0002638655300000092
Figure BDA0002638655300000093
Figure BDA0002638655300000094
Figure BDA0002638655300000095
in the formula: lambda [ alpha ]D-V,sell、λD-V,buyRespectively selling electricity price and purchasing electricity price from VPP to DSO;
Figure BDA0002638655300000096
the electric quantity purchased and sold by the VPP to the DSO respectively; n is a VPP total number set, and VPPi belongs to N; n is a radical ofiIs DER set contained in ith VPP;
Figure BDA0002638655300000097
for the cost of power generation by MT in DER, aj、bj、cjFor the cost factor of the MT unit in DERj,
Figure BDA0002638655300000101
output power for MT;
Figure BDA0002638655300000102
in order to be a cost of the ESS operation,
Figure BDA0002638655300000103
for the scheduling cost factor of the ESS,
Figure BDA0002638655300000104
the charging and discharging power of the ESS, the value of which is more than 0 represents discharging, and the value of which is less than 0 represents charging;
Figure BDA0002638655300000105
in order to be able to respond to the load cost,
Figure BDA0002638655300000106
in order to compensate the electricity prices in response to the load,
Figure BDA0002638655300000107
response power for the response load; theta is subsidy electricity price of the government for new energy power generation;
Figure BDA0002638655300000108
respectively photovoltaic power output and wind power output. The main parties in this document consider operational benefits, neglecting the relatively low operating and maintenance costs of new energy generation.
The game objective function of the radical VPP is considered as the maximization of the market share scale, and the electric quantity bidding is carried out in the market in a maximized mode on the premise that internal resources are integrated to meet the constraint output of each unit, so that the exchange electric quantity is obtained on the DSO level as much as possible, and the maximum power purchase quantity is obtained. The VPP bidding objective function is:
Figure BDA0002638655300000109
bidding constraint:
Figure BDA00026386553000001010
Figure BDA00026386553000001011
in the formula:
Figure BDA00026386553000001012
the power purchased from the DSO for the ith VPP,
Figure BDA00026386553000001013
power sold from the DSO for the ith VPP;
Figure BDA00026386553000001014
the maximum value of the purchased and sold electric quantity bidding is respectively carried out for the VPPi.
After the DSO unified purchase and sale electricity price and the transaction electricity quantity are obtained, on the premise that the objective function formula (5) is met, other schedulable resources in the VPP are coordinated, the surplus load electricity quantity shortage is met, and the electricity quantity shortage scheduling principle meets the lowest cost principle.
Figure BDA00026386553000001015
The game objective function of the stable VPP is considered to be the maximum utilization of renewable energy, the renewable energy is considered to be wind power and photovoltaic, meanwhile, the uncertainty of the power generation of the renewable energy is fully considered, and the influence of the fluctuation of the output of the renewable energy on the power supply stability of the system is stabilized by using the ESS. Then, the output conditions of other units in the VPP and the electricity quantity bidding to the upper DSO are considered, and the VPP bidding objective function is as follows:
Figure BDA0002638655300000111
Figure BDA0002638655300000112
Figure BDA0002638655300000113
Figure BDA0002638655300000114
in the formula: alpha and beta are objective function weight coefficients; rhoj,tThe wind, light and storage combined electricity selling price is obtained;
Figure BDA0002638655300000115
wind, light and storage combined output; f is the fluctuation variance of the wind, light and storage combined output power; f. of0The minimum allowed fluctuation variance value of the system.
1) And power balance constraint:
Figure BDA0002638655300000116
in the formula:
Figure BDA0002638655300000117
electricity usage that is unresponsive load;
Figure BDA0002638655300000118
is the output power of VPPi.
2) Gas turbine set constraint:
Figure BDA0002638655300000119
Figure BDA00026386553000001110
in the formula:
Figure BDA00026386553000001111
the maximum power and the minimum power of the MT output are respectively the upper limit and the lower limit of the maximum power and the minimum power; r isd,i、ru,iThe downward slope climbing rate and the upward slope climbing rate of the unit j are respectively; Δ T is an operating period.
3) Responsive to load constraints:
Figure BDA00026386553000001112
in the formula:
Figure BDA00026386553000001113
is the maximum response quantity of the responsive load.
4) And (3) renewable energy output constraint:
Figure BDA00026386553000001114
Figure BDA0002638655300000121
in the formula:
Figure BDA0002638655300000122
respectively, the maximum value of the wind-solar output predicted according to historical data.
5) Energy storage restraint:
Figure BDA0002638655300000123
Figure BDA0002638655300000124
Figure BDA0002638655300000125
Figure BDA0002638655300000126
in the formula:
Figure BDA0002638655300000127
a 0-1 variable for ESS discharge and charge status; n is a radical ofESSAn upper limit for the number of discharge and charge state transitions for the ESS;
Figure BDA0002638655300000128
the upper and lower limits of the discharge power;
Figure BDA0002638655300000129
the charging power is the upper and lower limits;
Figure BDA00026386553000001210
is the capacity state of the ESS in the t period;
Figure BDA00026386553000001211
upper and lower capacity limits for consideration of ESS life and other factors;
Figure BDA00026386553000001212
respectively charge and discharge efficiency coefficient, the value range of which is
Figure BDA00026386553000001213
DSO establishes electricity selling price lambda for VPPD-V,sellAnd the electricity purchase price lambdaD-V,buy. The DSO takes the maximization of self-benefit as a game target, and comprises the electricity purchasing and selling cost and benefit between the DSO and an upper-level power grid and a lower-level multi-VPP.
maxuDSO=FVPP-FDSO
In the formula: fVPPRevenue obtained for transactions between the DSO and the VPP; fDSOThe cost spent for the DSO to transact with the upper level grid. Specific expressions of the terms are as follows:
Figure BDA00026386553000001214
Figure BDA00026386553000001215
in the formula: t is a total scheduling time interval; n is a radical ofVPPIn order to be the number of the VPPs,
Figure BDA00026386553000001216
for the real-time electricity rates established by the DSO,
Figure BDA00026386553000001217
a power value for the ith VPP to interact with the DSO in the tth scheduling period;
Figure BDA0002638655300000131
for the border node electricity prices between the DSO and the upper grid,
Figure BDA0002638655300000132
the value of the transmission power between the DSO and the upper power grid is greater than 0, which indicates that the power is purchased from the upper power grid, and less than 0 indicates that the power is sold to the upper power grid.
After obtaining the competitive bidding data of the competitor, the competitor strategy judgment index coefficient is as follows:
Figure BDA0002638655300000133
Figure BDA0002638655300000134
Figure BDA0002638655300000135
in the formula: xi123Index coefficient for judging competitor strategy, when xi1≥ξlim1When the measured value is "aggressive", when xi is2≤ξlim2When it is stable, the product is judged to be stable3≤ξlim3The time is judged to be 'conservative', wherein xi islim1lim2lim3And (4) judging a constant for the index coefficient, and obtaining through historical data of each VPP bidding. If the strategy index judgment is not within the index coefficient judgment constant limit value at the end of one game round, the strategy result is not output, and the next iteration round is continued.
Each VPP determines the total benefit of the VPP through the selection of the game strategy through evolutionary learning. Each strategy corresponds to a transaction behavior mode under the strategy and is used by rhomnRepresents the conditional switch state of VPPi trade behavior pattern x (u) from policy m to policy n. After obtaining the competitive bidding information of competitors, VPPi learns to evolve the strategy of the competitors. If policy n has a higher utility than policy m, then policy m will learn policy n and make changes. Expressed as:
Figure BDA0002638655300000136
Figure BDA0002638655300000137
in the formula: x is the number ofm(u) and xn(u) transaction behavior patterns of VPPi under policies m and n, respectively, when
Figure BDA0002638655300000138
If the value is more than 0, updating the VPP strategy according to the condition switching state, and if the value is more than 0, updating the VPP strategy according to the condition switching state
Figure BDA0002638655300000139
When the value is always less than 0, a stable equilibrium strategy is achieved; u. ofmAnd unRespectively representing the utility of the corresponding trading behavior modes of the strategies m and n, un-um]+Is max {0, un-um}。
Referring to fig. 1, fig. 1 is a flowchart of a method for analyzing dynamic game transaction behaviors of a multi-virtual power plant based on rationality; the flowchart includes steps 101 to 104;
in step 101, each virtual power plant bidding main body fully considers the target demand of the development stage of the main body and researches the dynamic pricing behavior of an upper-layer operator;
in step 102, transaction behavior analysis is carried out on the multiple virtual power plants by adopting different transaction target modeling;
in one embodiment of the present invention, each federated VPP may experience different types of electricity purchase and sale states throughout the operation of the power distribution system:
the method comprises three types of electricity surplus (electricity selling VPP), electricity shortage (electricity purchasing VPP) and electricity self-sufficiency. Therefore, there is a demand for power purchase in a multi-VPP system.
The VPP comprehensively considers the output of each unit, the power generation of renewable energy sources, the electricity price information, the strategy information of other operators and the like on the premise of meeting the internal power supply of the VPP, coordinates the internal resources, establishes the target which best meets the self development, and plays games with other competitors under the condition of meeting various constraints.
The DSO layer scheduling center comprehensively considers the electric quantity information uploaded by each VPP of the lower layer, considers the maximized social benefit, formulates the uniform market purchase and sale electricity price and the transaction electric quantity of each VPP, reasonably adjusts the internal resource output of the VPPs according to the electricity price and the transaction electric quantity formulated by the DSO to each VPP, and formulates the transaction electric quantity with the DSO.
In step 103, evolutionary learning is carried out on the limited rational transaction behavior information based on the particle swarm optimization, and the competitor strategy is learned to further improve the self target so that the self target is gradually optimized;
after the game starts, the participants can learn and improve according to each round of strategy of the two parties, and the game is a continuous and repeated continuous transaction process.
And (3) taking each VPP in the market as an individual to participate in the game, respectively considering decision information of other competitors to make an objective function strategy for maximizing the self utility, solving the self optimal decision by the VPPi after obtaining the competitive bidding data of the competitors, and repeating the i ∈ N to obtain the first round game strategy of each main body.
And each main body updates the previous game strategy until each main body can not benefit from changing the bidding strategy in a unilateral way, and the final stable and balanced state is achieved.
In step 104, a dynamic game computing process of the multi-virtual power plant is researched, and a dynamic game particle swarm optimization algorithm is provided to solve a game model in combination with an optimization tool box.
In one embodiment of the invention, fig. 4 illustrates a solving process of a multi-virtual power plant dynamic game.
In the model solution, DSO gives the purchase price. In the game process, each VPP learns continuously acquired electricity price information and strategy information formulated by other competitors to select the optimal solution of the most suitable self-transaction target.
In the DSO layer, the electricity price strategy is used as a particle, the income under the electricity price is used as the fitness of the particle, and the optimal electricity price strategy is searched by using an intelligent optimization algorithm. The whole solving process can be regarded as two parts of DSO electricity price optimizing and multi-VPP strategy learning solving. An improved particle swarm algorithm is combined with a Yalmip platform to call a CPLEX solver to solve the dynamic game evolutionary learning model of the multiple VPPs with different transaction targets.
Referring to fig. 2, fig. 2 is a system configuration diagram of a multiple virtual power plant. The VPP mainly comprises a micro gas turbine set, a wind turbine set, a photovoltaic system, an energy storage system, a responsive load and the like. The micro gas turbine set has stable power output, can respond quickly and can flexibly meet the requirements required and met by a load change system, so that the micro gas turbine set is used as a schedulable distributed power supply inside a VPP.
The VPP is used as a whole to externally add DSO and schedule a power market, and is mainly used for carrying out coordinated operation control on a plurality of DER inside the VPP and carrying out negotiation interaction on a plurality of parties such as user loads and equipment maintenance managers. The VPP regulates and controls each internal resource and carries out energy and information interaction on the upper DSO through the energy management platform.
The energy management platform has the functions of day-ahead prediction, decision optimization, real-time monitoring and the like, and can store and process DER internal load historical power utilization conditions, output conditions of each power supply, user information, weather data and upper-layer DSO price information in a database, so that VPPs can make more accurate decisions.
Referring to fig. 3, firstly, considering that the difference of power consumption of users in different areas is different and the target pursuit of VPP alliance is different, the VPP may select different transaction target and decision scheme; and secondly, in the process of playing the game, each main body can change own behavior rules after integrating market transaction behaviors of other competitors to convert targets. An objective function of different transaction objective types is formulated for the VPP.
In one embodiment of the present invention, the internal constituent elements of each VPP are shown in Table 1. And selecting a corresponding cost function, constraint conditions and an initial objective function to form a limited rational game model containing 3 VPPs. Based on the example system, after the DSO receives bidding strategies submitted by various VPP operators, market trading power rates are determined, and social benefits are maximized. After 19 rounds of games, the selection strategy of each VPP tends to be stable after learning, and the obtained income also tends to be stable. Each VPP cannot change own strategy from one side to obtain more benefits, and at the moment, each main strategy target reaches the optimal equilibrium solution.
TABLE 1 VPP internal resources
Figure BDA0002638655300000151
In one embodiment of the invention, the following two operating strategies are set:
case 1: and the power grid electricity price and the internet electricity price are used as the purchase and sale electricity price, the VPPs are independently optimized by adopting a non-cooperative game, and the VPPs are respectively optimized by selecting a response target strategy.
Case 2: competitive bidding strategies of competitors are evolved and learned among VPPs, the rationality degree is improved, and meanwhile, the price optimization is carried out, namely, the game model provided by the invention is adopted.
As can be seen from the simulation result of FIG. 5, the dynamic game equilibrium solution for solving different transaction targets of the multi-virtual power plant by the algorithm of the invention is effective and convergent, and can achieve good solving precision and calculation efficiency.
In two scenarios, the DSO makes a purchase and sale electricity price as shown in fig. 6, a state of 3 exists in the VPP at each moment, and if the transaction electricity quantity is greater than 0, it indicates that the VPP sells electricity to the DSO, which is a "electricity selling VPP"; if the transaction electric quantity is less than 0, the VPP purchases electricity from the DSO, and the electricity purchasing VPP is determined; if the transaction electric quantity is equal to 0, the supply and demand in the VPP are balanced, and the VPP is a self-balancing VPP (in the figure 8, the drawing is convenient, the transaction electric quantity is positive to represent electricity purchasing, and negative to represent electricity selling). At this time, if the change of the remaining competitor policy causes the change of the purchase and sale price, the change of the VPP purchase and sale policy may be caused.
7-9 show that VPP1 adopts a 'conservative' type to aim at maximizing income per se, VPP1 supplies more than demand in time periods 4-19, so electricity is sold to the superior operator, and the rest time periods have electricity shortage, so electricity needs to be purchased to the superior operator; the VPP2 adopts an 'aggressive type' to maximize the electricity purchasing of the superior operator and improve the operation scale of the VPP, and except for time periods 8-10 and 16, the electricity purchasing is carried out on the superior operator in other time periods. VPP2 has the greatest demand during peak load periods compared to VPP1 and VPP 3. Meanwhile, load increment and reduction which are most beneficial to self scale expansion are set through RL regulation and control, and the occupation ratio of the load increment and the reduction to the electric quantity transaction of a superior operator is increased. The VPP3 adopts stable type to consider renewable energy sources to generate power maximally, and simultaneously ensures the power supply stability of the VPP3, so that a wind-light-storage combined power generation unit is established. As can be seen from fig. 9, after the output fluctuation of the renewable energy is stabilized by the energy storage unit through the output scheduling of the joint unit, the stability of the presented power curve is greatly improved. In the time period 15-18, a certain amount of wind curtailment light curtailment is generated in order to keep the fluctuation variance from exceeding the limit due to the energy storage unit capacity and the charge-discharge power limit per time period. In the time periods 9-12 and 16-18, the power is required to be purchased by the superior operators to meet the power balance due to the shortage of the self-electricity.
Table 2 shows the revenue comparison of DSO using the Case1 electricity pricing strategy and the Case2 electricity pricing strategy, and Table 3 shows the revenue comparison (units/dollars) of each VPP using the Case1 electricity pricing strategy and the Case2 electricity pricing strategy.
TABLE 2 comparison of benefits of different electricity price strategies for DSO
Figure BDA0002638655300000161
Figure BDA0002638655300000171
TABLE 3 VPP different Electricity price policy revenue comparison
Figure BDA0002638655300000172
By comparison, after the DSO is optimized by electricity price, the electricity purchasing and selling quantity of each VPP is increased and balanced. Compared with the trading of the power grid price, the trading yield between the DSO and the VPP is improved by 7.8%. Under the optimal target operation condition of the game, the VPPs can reduce the operation cost and achieve the new balance of transaction.
The established dynamic game model considering the participation of the multiple virtual power plants under the limited rationality condition of different transaction targets obtains the following conclusion by carrying out example analysis on the multiple virtual power plant system integrating different distributed energy sources: 1) compared with the traditional single objective function with the minimum operation cost, the method can effectively meet the diversified development requirements of each virtual power plant in different stages of development in the future; 2) the established limited rational transaction behavior learning enables the VPP to achieve the improvement of rationality degree through the integration of internal resources and the learning of competitive bidding strategies of competitors, and can also generate larger influence on competitive bidding results. Enabling each VPP to gain greater utility in market operations through trading; 3) the intelligent algorithm and the lower-layer mixed integer programming problem are combined to be solved to obtain the equilibrium solution, and compared with the traditional intelligent algorithm used alone, the method greatly saves the solving time and can quickly converge.
The above are preferred embodiments of the present invention, and all changes made according to the technical scheme of the present invention that produce functional effects do not exceed the scope of the technical scheme of the present invention belong to the protection scope of the present invention.

Claims (8)

1. A multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality is characterized by comprising the following steps:
s1: each virtual power plant bidding main body researches dynamic pricing behaviors of upper operators according to target requirements of a development stage of the virtual power plant bidding main body;
s2: adopting different trading target modeling to analyze trading behaviors of the multiple virtual power plants;
s3: carrying out evolutionary learning on the limited rational transaction behavior information based on a particle swarm algorithm, and gradually optimizing the target of the user by learning a competitor strategy;
s4: the dynamic game computing process of the multi-virtual power plant is researched, and a dynamic game particle swarm optimization algorithm is provided to be combined with an optimization tool box to solve a game model.
2. The limited rationality-based multi-virtual power plant dynamic game transaction behavior analysis method is characterized in that a multi-VPP decision scheme with different targets is established according to the influence of different stages of VPP development and decision behaviors of competitors on self optimization decisions; considering the power purchasing and selling quantity and the distributed power output in a VPP as strategies, designing a plurality of VPP bidding decision schemes with different targets, wherein the VPP can select different transaction targets and decision schemes by considering the difference of power consumption of users in different areas and the different pursuits of VPP alliance targets; secondly, in the process of playing the game, each main body can change the behavior rule of the main body after integrating the market transaction behaviors of other competitors to convert the target; in the game playing process, information of competitors can be continuously acquired through evolutionary learning; the strategy made along with the continuous increase of the self-acquired information can be gradually optimized; eventually, no principal can benefit unilaterally from changing bidding strategies, i.e., a stable strategy is achieved.
3. The rational-limited-based multi-virtual-power-plant dynamic game transaction behavior analysis method according to claim 2, wherein a stability balancing strategy of a multi-benefit agent under the rational-limited condition is defined as: in N stakeholdersIn the body-played dynamic game, a stable strategy combination X ═ X (X) exists1,X2,...,XN) For any other policy combination Y ═ (Y)1,Y2,...,YN) I Y ≠ X, there are main bodies i and0e (0,1) is such that:
Figure FDA0002638655290000011
in the formula: i (Y)i,S-i) And I (X)i,S-i) Respectively adopting Y for main body iiAnd XiTarget utility conditions at the time of the policy; s-iIs a combination of policies for other entities of the system than entity i.
4. The rational-limited-based multi-virtual-power-plant dynamic game transaction behavior analysis method according to claim 3, wherein the game strategy of the VPP considers the output plan of each distributed power supply in the VPP, the charging and discharging power of stored energy, the response power of a responsive load, and the output power of clean energy such as wind and light; the VPP obtains the system load demand and possible bidding strategies of competitors according to the limited market information and other VPP bidding historical data which are mastered at present, so that the maximum self-effectiveness is realized.
5. The limited rationality-based multi-virtual power plant dynamic game transaction behavior analysis method according to claim 1, wherein the different transaction objectives in step S2 include:
the game objective function of the conservative VPP is considered as maximizing self-benefit, and internal resources including distributed power generation cost, energy storage cost and responsive load compensation cost are fully integrated; deciding an optimal bidding strategy; the VPP bidding objective function is:
Figure FDA0002638655290000021
Figure FDA0002638655290000022
Figure FDA0002638655290000023
Figure FDA0002638655290000024
in the formula: lambda [ alpha ]D-V,sell、λD-V,buyRespectively selling electricity price and purchasing electricity price from VPP to DSO;
Figure FDA0002638655290000025
the electric quantity purchased and sold by the VPP to the DSO respectively; n is a VPP total number set, and VPPi belongs to N; n is a radical ofiIs DER set contained in ith VPP;
Figure FDA0002638655290000026
for the cost of power generation by MT in DER, aj、bj、cjFor the cost factor of the MT unit in DERj,
Figure FDA0002638655290000027
output power for MT;
Figure FDA0002638655290000028
in order to be a cost of the ESS operation,
Figure FDA0002638655290000029
for the scheduling cost factor of the ESS,
Figure FDA00026386552900000210
the charging and discharging power of the ESS, the value of which is more than 0 represents discharging, and the value of which is less than 0 represents charging;
Figure FDA00026386552900000211
in order to be able to respond to the load cost,
Figure FDA00026386552900000212
in order to compensate the electricity prices in response to the load,
Figure FDA00026386552900000213
response power for the response load; theta is subsidy electricity price of the government for new energy power generation;
Figure FDA00026386552900000214
respectively photovoltaic power output and wind power output;
the game objective function of the aggressive VPP is considered as the maximization of the market share scale, and the electric quantity bidding is carried out in the market in a maximized manner on the premise that the internal resources are integrated to meet the constraint output of each unit, so that the exchange electric quantity is obtained on the DSO level as much as possible, and the maximum power purchase quantity is obtained; the VPP bidding objective function is:
Figure FDA0002638655290000031
bidding constraint:
Figure FDA0002638655290000032
Figure FDA0002638655290000033
in the formula:
Figure FDA0002638655290000034
the power purchased from the DSO for the ith VPP,
Figure FDA0002638655290000035
coming out of DSO for ith VPP(ii) power sold;
Figure FDA0002638655290000036
maximum value of purchasing and selling electric quantity bidding is carried out for VPPi respectively;
after the DSO uniform purchase and sale electricity price and the transaction electricity quantity are obtained, under the premise that an aggressive VPP objective function is met, other schedulable resources in the VPP are coordinated to meet the surplus load electricity quantity, and an electricity quantity shortage scheduling principle meets a lowest cost principle;
Figure FDA0002638655290000037
the game objective function of the stable VPP is considered as the maximum utilization of renewable energy, the renewable energy is considered as wind power and photovoltaic, meanwhile, the uncertainty of the power generation of the renewable energy is fully considered, and the influence of the fluctuation of the output of the renewable energy on the power supply stability of the system is stabilized by using the ESS; then, the output conditions of other units in the VPP and the electricity quantity bidding to the upper DSO are considered, and the VPP bidding objective function is as follows:
Figure FDA0002638655290000038
Figure FDA0002638655290000039
Figure FDA00026386552900000310
Figure FDA00026386552900000311
in the formula: alpha and beta are objective function weight coefficients; rhoj,tThe wind, light and storage combined electricity selling price is obtained;
Figure FDA00026386552900000312
wind, light and storage combined output; f is the fluctuation variance of the wind, light and storage combined output power; f. of0The minimum allowed fluctuation variance value of the system.
6. The limited rationality-based multi-virtual power plant dynamic game transaction behavior analysis method according to claim 5, wherein when the VPP performs decision response, no matter which transaction target is executed, the constraint conditions required to be met include:
and power balance constraint:
Figure FDA0002638655290000041
in the formula:
Figure FDA0002638655290000042
electricity usage that is unresponsive load;
Figure FDA0002638655290000043
is the output power of VPPi;
gas turbine set constraint:
Figure FDA0002638655290000044
Figure FDA0002638655290000045
in the formula:
Figure FDA0002638655290000046
the maximum power and the minimum power of the MT output are respectively the upper limit and the lower limit of the maximum power and the minimum power; r isd,i、ru,iThe downward slope climbing rate and the upward slope climbing rate of the unit j are respectively; Δ T is oneAn operational period;
responsive to load constraints:
Figure FDA0002638655290000047
in the formula:
Figure FDA0002638655290000048
the maximum response quantity of the responsive load;
and (3) renewable energy output constraint:
Figure FDA0002638655290000049
Figure FDA00026386552900000410
in the formula:
Figure FDA00026386552900000411
respectively predicting the maximum value of wind and light output according to historical data;
energy storage restraint:
Figure FDA00026386552900000412
Figure FDA00026386552900000413
Figure FDA00026386552900000414
Figure FDA0002638655290000051
in the formula:
Figure FDA0002638655290000052
a 0-1 variable for ESS discharge and charge status; n is a radical ofESSAn upper limit for the number of discharge and charge state transitions for the ESS;
Figure FDA0002638655290000053
the upper and lower limits of the discharge power;
Figure FDA0002638655290000054
the charging power is the upper and lower limits;
Figure FDA0002638655290000055
is the capacity state of the ESS in the t period;
Figure FDA0002638655290000056
upper and lower capacity limits for consideration of ESS life and other factors;
Figure FDA0002638655290000057
respectively charge and discharge efficiency coefficient, the value range of which is
Figure FDA0002638655290000058
7. The limited rationality-based multi-virtual power plant dynamic gaming transaction behavior analysis method of claim 6, wherein the transaction is considered at the DSO level as:
DSO establishes electricity selling price lambda for VPPD-V,sellAnd the electricity purchase price lambdaD-V,buy(ii) a The DSO takes the maximized self-benefit as a game target, including the electricity purchasing and selling cost and benefit between the DSO and a superior power grid and a lower multi-VPP;
maxuDSO=FVPP-FDSO
in the formula: fVPPIs DSO andrevenue obtained from transactions between VPPs; fDSOCosts spent in trading DSO with superior grids; specific expressions of the terms are as follows:
Figure FDA0002638655290000059
Figure FDA00026386552900000510
in the formula: t is a total scheduling time interval; n is a radical ofVPPIn order to be the number of the VPPs,
Figure FDA00026386552900000511
for the real-time electricity rates established by the DSO,
Figure FDA00026386552900000512
a power value for the ith VPP to interact with the DSO in the tth scheduling period; lambda [ alpha ]t DSOFor the boundary node between DSO and the upper grid, Pt DSOThe value of the transmission power between the DSO and the upper power grid is greater than 0, which indicates that the power is purchased from the upper power grid, and less than 0 indicates that the power is sold to the upper power grid.
8. The limited rationality-based multi-virtual power plant dynamic game transaction behavior analysis method according to claim 7, wherein after competitor bidding data is obtained, the competitor strategy judgment index coefficients are as follows:
Figure FDA0002638655290000061
Figure FDA0002638655290000062
Figure FDA0002638655290000063
in the formula: xi123Index coefficient for judging competitor strategy, when xi1≥ξlim1When the measured value is "aggressive", when xi is2≤ξlim2When it is stable, the product is judged to be stable3≤ξlim3The time is judged to be 'conservative', wherein xi islim1lim2lim3Determining a constant for the index coefficient, and obtaining the constant through historical data of each VPP bidding; if the strategy index judgment is not within the index coefficient judgment constant limit value at the end of one game round, the strategy result is not output, and the next iteration round is continued;
each VPP determines the total benefit of the VPP through selecting a game strategy through evolutionary learning; each strategy corresponds to a transaction behavior mode under the strategy and is used by rhomnRepresents the condition switching state of VPPi trading behavior mode x (u) from strategy m to strategy n; after obtaining the competitive bidding information of competitors, VPPi learns and evolves the strategy of the competitors; if policy n has a higher utility than policy m, then policy m will learn policy n and make changes; expressed as:
Figure FDA0002638655290000064
Figure FDA0002638655290000065
in the formula: x is the number ofm(u) and xn(u) transaction behavior patterns of VPPi under policies m and n, respectively, when
Figure FDA0002638655290000066
If the value is more than 0, updating the VPP strategy according to the condition switching state, and if the value is more than 0, updating the VPP strategy according to the condition switching state
Figure FDA0002638655290000067
When the value is always less than 0, a stable equilibrium strategy is achieved; u. ofmAnd unRespectively representing the utility of the corresponding trading behavior modes of the strategies m and n, un-um]+Is max {0, un-um}。
CN202010832963.3A 2020-08-18 2020-08-18 Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality Pending CN112001752A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010832963.3A CN112001752A (en) 2020-08-18 2020-08-18 Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010832963.3A CN112001752A (en) 2020-08-18 2020-08-18 Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality

Publications (1)

Publication Number Publication Date
CN112001752A true CN112001752A (en) 2020-11-27

Family

ID=73473888

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010832963.3A Pending CN112001752A (en) 2020-08-18 2020-08-18 Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality

Country Status (1)

Country Link
CN (1) CN112001752A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113112287A (en) * 2021-03-18 2021-07-13 国电南瑞南京控制系统有限公司 Virtual power plant energy management method and device based on micro-grid scheduling
CN113610357A (en) * 2021-07-16 2021-11-05 远景智能国际私人投资有限公司 Transaction flow management method, device, equipment and medium for virtual power plant
CN115796406B (en) * 2023-02-13 2023-04-18 浙江浙能能源服务有限公司 Optimal adjustment method and system for virtual power plant
CN115995850A (en) * 2023-03-06 2023-04-21 华北电力大学 Collaborative scheduling optimization method and device for virtual power plant group
CN117541300A (en) * 2024-01-08 2024-02-09 国网浙江省电力有限公司宁波供电公司 Virtual power plant transaction management method, system, equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006158189A (en) * 2004-11-04 2006-06-15 Tokyo Electric Power Co Inc:The Cost minimization power control system using combination of electric power transaction and distributed power supply
CN108734396A (en) * 2018-05-16 2018-11-02 国网山东省电力公司电力科学研究院 A kind of virtual plant method for optimizing scheduling bidded and provided multiple forms of energy to complement each other based on Demand-side
CN108921368A (en) * 2018-05-03 2018-11-30 东南大学 Balanced cooperative game controller based on virtual power plant
CN108960510A (en) * 2018-07-04 2018-12-07 四川大学 A kind of virtual plant optimization trading strategies model based on two stage stochastic programming
CN109902884A (en) * 2019-03-27 2019-06-18 合肥工业大学 A kind of virtual plant Optimization Scheduling based on leader-followers games strategy
CN110135613A (en) * 2018-10-23 2019-08-16 上海交通大学 It is a kind of based on receive assorted negotiation more virtual plants collaboration optimization operating scheme
CN111402015A (en) * 2020-03-17 2020-07-10 南京工业大学 Virtual power plant double-layer bidding method and system based on purchasing and selling risks

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006158189A (en) * 2004-11-04 2006-06-15 Tokyo Electric Power Co Inc:The Cost minimization power control system using combination of electric power transaction and distributed power supply
CN108921368A (en) * 2018-05-03 2018-11-30 东南大学 Balanced cooperative game controller based on virtual power plant
CN108734396A (en) * 2018-05-16 2018-11-02 国网山东省电力公司电力科学研究院 A kind of virtual plant method for optimizing scheduling bidded and provided multiple forms of energy to complement each other based on Demand-side
CN108960510A (en) * 2018-07-04 2018-12-07 四川大学 A kind of virtual plant optimization trading strategies model based on two stage stochastic programming
CN110135613A (en) * 2018-10-23 2019-08-16 上海交通大学 It is a kind of based on receive assorted negotiation more virtual plants collaboration optimization operating scheme
CN109902884A (en) * 2019-03-27 2019-06-18 合肥工业大学 A kind of virtual plant Optimization Scheduling based on leader-followers games strategy
CN111402015A (en) * 2020-03-17 2020-07-10 南京工业大学 Virtual power plant double-layer bidding method and system based on purchasing and selling risks

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
HONGJUN GAO等: "Bounded Rationality Based Multi-VPP Trading in Local Energy Markets: A Dynamic Game Approach with Different Trading Targets" *
刘继春等: "考虑多个虚拟发电厂参与的多阶段市场交易方法" *
周博;吕林;高红均;刘俊勇;陈庆攀;谭心怡;: "多虚拟电厂日前鲁棒交易策略研究" *
周博等: "考虑热电联合调度的虚拟电厂交易策略研究" *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113112287A (en) * 2021-03-18 2021-07-13 国电南瑞南京控制系统有限公司 Virtual power plant energy management method and device based on micro-grid scheduling
CN113610357A (en) * 2021-07-16 2021-11-05 远景智能国际私人投资有限公司 Transaction flow management method, device, equipment and medium for virtual power plant
CN115796406B (en) * 2023-02-13 2023-04-18 浙江浙能能源服务有限公司 Optimal adjustment method and system for virtual power plant
CN115995850A (en) * 2023-03-06 2023-04-21 华北电力大学 Collaborative scheduling optimization method and device for virtual power plant group
CN117541300A (en) * 2024-01-08 2024-02-09 国网浙江省电力有限公司宁波供电公司 Virtual power plant transaction management method, system, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN112001752A (en) Multi-virtual power plant dynamic game transaction behavior analysis method based on limited rationality
CN110276698B (en) Distributed renewable energy transaction decision method based on multi-agent double-layer collaborative reinforcement learning
CN110147907B (en) Method for constructing virtual power plant double-layer optimization model considering network constraints
Li et al. Trading strategy and benefit optimization of load aggregators in integrated energy systems considering integrated demand response: A hierarchical Stackelberg game
CN112467722B (en) Active power distribution network source-network-load-storage coordination planning method considering electric vehicle charging station
Maity et al. Simulation and pricing mechanism analysis of a solar-powered electrical microgrid
Zhang et al. Optimal bidding strategy and profit allocation method for shared energy storage-assisted VPP in joint energy and regulation markets
CN111008739B (en) Optimal regulation and control and income distribution method and system for cogeneration virtual power plant
CN113592648B (en) Multi-main-body transaction method and system of comprehensive energy system
CN109919450B (en) Game optimization method for solving scheduling of comprehensive intelligent energy system
CN112036934A (en) Quotation method for participation of load aggregators in demand response considering thermoelectric coordinated operation
CN112217195B (en) Cloud energy storage charging and discharging strategy forming method based on GRU multi-step prediction technology
CN112564102B (en) Multi-microgrid load optimization scheduling method and system
CN111311012A (en) Multi-agent-based micro-grid power market double-layer bidding optimization method
CN115147245B (en) Virtual power plant optimal scheduling method for industrial load participating in peak shaving auxiliary service
Gao et al. Bounded rationality based multi-VPP trading in local energy markets: a dynamic game approach with different trading targets
CN114256836A (en) Capacity optimization configuration method for new energy power station shared energy storage
CN115313520A (en) Distributed energy system game optimization scheduling method, system, equipment and medium
CN110322092B (en) Source-network-load multi-layer game economic dispatching method
CN116961008A (en) Micro-grid capacity double-layer optimization method considering power spring and load demand response
CN114188940B (en) Active power distribution network scheduling method considering benefits of multiple subjects
CN116432862A (en) Multi-main-body game optimization method and device for renewable energy micro-grid
CN115358633A (en) Optimized game method for realizing coordination of power generation power among virtual power plants
Zhu et al. Transmission loss-aware peer-to-peer energy trading in networked microgrids
CN113690875B (en) Real-time interactive equivalent model building method for micro-grid

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201127