CN114269019A - Resource allocation method based on double-shooting game in aerospace information network - Google Patents

Resource allocation method based on double-shooting game in aerospace information network Download PDF

Info

Publication number
CN114269019A
CN114269019A CN202111558461.7A CN202111558461A CN114269019A CN 114269019 A CN114269019 A CN 114269019A CN 202111558461 A CN202111558461 A CN 202111558461A CN 114269019 A CN114269019 A CN 114269019A
Authority
CN
China
Prior art keywords
low
double
orbit satellite
ewa
computing resources
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111558461.7A
Other languages
Chinese (zh)
Inventor
姚海鹏
金雅晴
张尼
宫永康
吴云峰
韩庆敏
韩宝磊
忻向军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
6th Research Institute of China Electronics Corp
Original Assignee
Beijing University of Posts and Telecommunications
6th Research Institute of China Electronics Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications, 6th Research Institute of China Electronics Corp filed Critical Beijing University of Posts and Telecommunications
Priority to CN202111558461.7A priority Critical patent/CN114269019A/en
Publication of CN114269019A publication Critical patent/CN114269019A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a resource allocation method based on a double-beat game in an aerospace information network, which is characterized by comprising a low-orbit satellite system based on a High Altitude Platform (HAP) and an EWA algorithm mechanism based on the double-beat game, wherein the low-orbit satellite system can directly access computing resources of an adjacent High Altitude Platform, the computing resources are purchased from the High Altitude Platform by a low-orbit satellite, a resource trading market is established, and a buyer and a seller can dynamically adjust the bidding and asking price strategies through interaction with the environment based on the EWA algorithm. The invention introduces a high-altitude platform to unload the computing task of the low-orbit satellite so as to reduce the transmission delay. And calculating the resource allocation problem between the near-orbit satellite and the high-altitude platform based on a double-shot selling mechanism. Then, a nash equilibrium search algorithm based on Empirical Weight Attraction (EWA) was designed, which combines the advantages of reinforcement learning and belief learning, and is performed on each participant.

Description

Resource allocation method based on double-shooting game in aerospace information network
Technical Field
The invention relates to the technical field, in particular to a resource allocation method based on double-beat game selling in an aerospace information network.
Background
In recent years, satellite communication technology has gradually changed our lives and is an important communication technology that is indispensable. Satellite communication has many advantages, such as long communication distance, no geographical limitation, high system reliability, wide coverage, etc. Compared with a high Orbit Earth (GEO) satellite and a Medium Orbit Earth (MEO) satellite, the Low Orbit Earth (LEO) satellite is close to the ground, communication delay is short, data transmission rate is high, the weight and the volume of the mobile terminal are almost the same as those of personal mobile equipment, and the mobile terminal is more suitable for popularization.
However, satellite communication also has its disadvantages, due to limitations in computing power and insufficient energy, the satellite has to offload the data to the ground station for further processing. Long-range transmission has a large impact on many real-time services. Therefore, effective offloading mechanisms have received extensive attention from both academia and industry.
In this context, a High Altitude Platform (HAP) was introduced to offload the computational tasks of low earth orbit satellites. The high altitude platform with the height of 17-30km can be used as a relay station to transfer resources between the low orbit satellite system and the ground station. The introduction of the high-altitude platform can reduce transmission delay and effectively avoid interference to the troposphere. Meanwhile, the high-altitude platform has strong computing power and can help process computing resources unloaded by the satellite. With the increase of high-altitude platforms, lower transmission delay and lower cost can be obtained in the real-time remote sensing event. Nevertheless, low earth orbit satellites and high altitude platforms remain separate, self-administered individuals. Improving the resource allocation mechanism between low earth orbit satellites and high altitude platforms remains an urgent issue.
Disclosure of Invention
The invention aims to solve the technical problem that data has to be unloaded to a ground station for further processing due to the limitation of the computing capacity and insufficient energy of a low-orbit satellite at present.
A double-beat game-based resource allocation method in an aerospace information network comprises the steps of designing a High Altitude Platform (HAP) -based low orbit satellite system and an EWA algorithm mechanism based on a double-beat game, wherein the low orbit satellite system can directly access computing resources of adjacent High Altitude platforms, the computing resources are purchased from the High Altitude platforms through low orbit satellites, a resource trading market is established, and a buyer and a seller can dynamically adjust bidding strategies and asking price strategies through interaction with the environment based on the EWA algorithm.
Preferably, an M/M/1 queuing model is adopted in the low-orbit satellite system to quantify the urgency degree of low-orbit satellite tasks, and lambda is usediRepresenting LEOiTask arrival rate of, and HAPjWill be expressed as mujIn addition, since the distance between the high altitude platform and the low orbit satellite is short, the main service latency can be attributed to the processing delay, and the total delay of the mission can be expressed as:
Figure BDA0003419810370000021
equal to the wait time plus the service time; then, the average latency can be calculated as:
Figure BDA0003419810370000022
according to the formula Lq=λiTwThe average length of the queue can be expressed as:
Figure BDA0003419810370000023
each low earth orbit satellite has different traffic demands, a higher λ indicates a higher frequency of arrival of the task, and a smaller μ indicates that the task requires a large amount of computation, particularly when LqVery large means that the low earth orbit satellites have very urgent access to additional computing resources.
Preferably, a double-beat game mechanism is adopted in the low-orbit satellite system, and A ═ a is usedj1,2,3 … N to describe the asking price strategy for high-altitude platforms, where ajRepresents SSjAcceptable minimum price; use of bi={bi,j1,2,3 … N to describe LEOiBidding prices for different high-altitude platforms; using the matrix B ═ B i1,2,3 … M represents the bidding strategy for all low orbit satellites; beta is ai,jFor representing LEOiPaying HAP for computing resourcesjThe amount of (c); an aerial platform has several items available for sale to N buyers, so that β is a matrix whose columns record the HAPsjA trade price that sells its computing resources to a certain low earth orbit satellite, such that:
Figure BDA0003419810370000024
is HAPjTotal reward in all successful transactions.
Preferably, the EWA algorithm mechanism based on the double-auction game introduces Sagents and Bagents which are agents traded by buyers and sellers; the bids and asks range from 0 to PmaxDiscrete and the number of bids and asks P (P ═ P)max+1);
Figure BDA0003419810370000031
Is SagentjFor HAPjThe set of inquiry policies of (1),
Figure BDA0003419810370000032
Figure BDA0003419810370000033
meaning that it will select a certain asking price, obviously SsjIs equal to aj. Also, in the same manner as above,
Figure BDA0003419810370000034
denotes BagentiThe policy of Bagent is to bid biOf the vector of (1), the element b thereofi,jIs for SagentjThe number of Sagents, the number of elements, etc.; one seller has P bids and n sellers, so the number of strategies is PnAnd, SbiIs equal to
Figure BDA0003419810370000035
Using Ss={Ss1,Ss2,…,SsnAs a set of inquiry strategies for n sellers, i.e. SsDefine S as As-j={Ss1,Ss2,…,Ss(j-1),Ss(j+1),…,SsnDescription except SagentjOr equal A- { ajS of }sAnd (4) other strategies. For batents, Sb={S1,S2,…,SpIs its set of bidding strategies, i.e. Sb=B。Sb-i={Sb1,Sb2,…,Sb(i-1),Sb(i+1),…,SbnS or Ss=B-{biIs except Bagenti(ii) a policy other than;
key parameters in EWA:
for Bagents and Sagents, pi is usedbi(Sbi,Sb-i)=UiAnd pisj(Ssj,Ss-j)=UjIndicating the rewards of Bagents and Sagents, the rewards and the policy of the Agent itself and its opponents.
The invention introduces a high-altitude platform to unload the computing task of the low-orbit satellite so as to reduce the transmission delay. And calculating the resource allocation problem between the near-orbit satellite and the high-altitude platform based on a double-shot selling mechanism. Then, a nash equilibrium search algorithm based on Empirical Weight Attraction (EWA) was designed, which combines the advantages of reinforcement learning and belief learning, and is performed on each participant. Auction participants can obtain information from other opponents and accumulate experience and thought. Thus, they can complete transactions with other participants in a fuzzy environment and maximize the overall interest of the participants. The convergence of this EWA algorithm is verified by simulation.
Drawings
The following describes embodiments of the present invention in further detail with reference to the accompanying drawings.
FIG. 1 is a schematic diagram of a low-orbit satellite system based on a high-altitude platform in a double-beat-sale-game-based resource allocation method in an aerospace information network according to the invention;
fig. 2 is a schematic diagram of a double-auction process in a resource allocation method based on a double-auction game in an aerospace information network according to the present invention;
FIG. 3 is a schematic diagram of convergence of an EWA-based resource allocation algorithm in a double-beat-sale-game-based resource allocation method in an aerospace information network according to the present invention;
FIG. 4 is a schematic diagram illustrating the average cost of a seller and the average asking price of the seller in the method for allocating resources based on double-auction game in the air-space information network according to the present invention;
fig. 5 is a schematic diagram of average buyer value and average buyer bid price in a double auction game-based resource allocation method in an aerospace information network according to the present invention.
Detailed Description
In order that the objects and advantages of the invention will be more clearly understood, the invention is further described below with reference to examples; it should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Preferred embodiments of the present invention are described below with reference to the accompanying drawings. It should be understood by those skilled in the art that these embodiments are only for explaining the technical principle of the present invention, and do not limit the scope of the present invention.
The invention provides a resource allocation method based on a double-beat selling game in an aerospace information network, which comprises the following steps: please refer to fig. 1: a low-earth satellite system based on High Altitude Platform (HAP) is designed, wherein the low-earth satellite can directly access to the computing resources of the adjacent High Altitude Platform, and the computing resources are purchased from the High Altitude Platform by the low-earth satellite so as to reduce the task burden of the low-earth satellite, thereby establishing a resource trading market.
Suppose there are M low-orbit satellites labeled X ═ Mi},i=1,2,3…M。C={ c i1,2,3 … M denotes a low-earth satellite MiThe resource requirements of (1). Suppose there are N aerial platforms, denoted Y ═ N respectivelyj},j=1,2,3…N。R={rjJ is 1, …, N represents HAPjThe remaining capacity of the resource is calculated.
To evaluate the estimated revenue of the additional computing resources of different low earth orbit satellites, it is assumed that the estimated revenue is positively correlated with the urgency of the mission. In the present invention, an M/M/1 queuing model is used to quantify the urgency of low-earth-orbit satellite missions. Using λiRepresenting LEOiTask arrival rate of, and HAPjWill be expressed as muj. In addition, the main service latency can be attributed to processing delays due to the short distance between the high altitude platform and the low orbit satellite. The total delay of a task can be expressed as:
Figure BDA0003419810370000051
equal to the latency plus the service time. Then, the average latency can be calculated as:
Figure BDA0003419810370000052
according to the formula Lq=λiTwThe average length of the queue can be expressed as:
Figure BDA0003419810370000053
each low earth satellite has different traffic demands. A higher lambda indicates a higher frequency of arrival of the task, while a smaller mu indicates that the task requires a larger amount of computation. In particular, when LqVery large means that the low earth orbit satellites have very urgent access to additional computing resources.
How to properly allocate limited high altitude platform computing resources among low earth orbit satellites is a complex problem. In the context of this trading market model, on the one hand, the overhead platform should acquire a fee that at least matches its total fee. Low earth orbit satellites, on the other hand, are willing to purchase as much additional computing resources as possible to satisfy different types of tasks. For example, the car networking task is time sensitive and computationally intensive, and therefore it is critical whether these low earth orbit satellites can accept computing resources in a timely manner.
The double auction gaming mechanism:
in the present invention, a ═ a is usedj1,2,3 … N to describe the asking price strategy for high-altitude platforms, where ajRepresents SSjThe lowest price that is acceptable. For simplicity, it is assumed that the computing tasks from low earth orbit satellites are indivisible, which means that the computing tasks can only be offloaded on one high altitude platform. Then, using bi={ b i,j1,2,3 … N to describe LEOiAnd (4) bidding prices for different high-altitude platforms. Due to quality of goods (μ) from different vendorsj) Different, LEOiTending to different bid prices. In other words, for LEOiHAP ofjQuality of service determines LEOiThe preference for high altitude platforms is different. Using the matrix B ═ B i1,2,3 … M for all low earth orbit satellitesAnd (4) bidding strategies.
As shown in FIG. 2, in a two-way auction, each buyer and seller submit an ask price and a bid price simultaneously. The broker then determines the HAPjWhether to accept LEOiThe bid of (1). In the present invention, βi,jFor representing LEOiPaying HAP for computing resourcesjThe amount of (c). If LEOiAnd HAPjWithout transaction between, then βi,jEqual to 0. Since a high-altitude platform has several commodities available to several buyers, β is a matrix whose columns record HAPsjA trade price that sells its computing resources to a certain low-orbit satellite. Thus, it is possible to prevent the occurrence of,
Figure BDA0003419810370000061
is HAPjTotal reward in all successful transactions. Similarly, another matrix α ═ α is definedj,iIts element represents HAPjTo LEOiThe amount of computing resources of. The sum of the j row of alpha records the HAPjTotal transaction amount of (1). If LEOiAnd HAPjNo transaction occurs between, then αj,iEqual to 0. Agent can calculate HAP by the following methodjHow many resources remain in:
Figure BDA0003419810370000062
if the amount of computing resources remaining is insufficient, the transaction will not be allowed.
Game analysis:
since the transactions are conducted by brokers, the private information is not known to each other. In such an opaque information environment, the seller or buyer can only know whether he has completed the transaction. Therefore, to improve their asking price and bidding strategy, it is necessary to define a utility function to simulate the participant's profit in a round of trading by:
Uj=Pj-Costj
represents HAPjA utility function of, wherein PjIs the total reward for a round of double auctions, and CostjIncluding construction costs and maintenance costs. Also, LEOiThe utility function of (a) is:
Figure BDA0003419810370000063
if LEOiIf no transaction is reached with all sellers, UiIs equal to-Vi(Lq)。Vi(Lq) Is LEOiAn estimate or value of how much profit it can make from a successful transaction. Performing its computational tasks on a high-altitude platform depends on importance or urgency. Furthermore, this importance and the estimation of computational resources may be reflected in the need for computational delay.
Thus, using LqTo estimate the return of low earth orbit satellites in receiving high altitude platform computing resources. A large amount of LqMeaning that the low earth orbit satellite has accumulated sufficient mission and LEOiThere is now an urgent need to obtain resources from high altitude platforms. As a result, purchasing additional computing resources is more attractive to them.
In double auction, brokers strive to maximize the sum of the utility functions of both buyers and sellers. This problem can be described as follows:
Figure BDA0003419810370000071
Figure BDA0003419810370000072
in the above maximization problem, there are some constraints, one of which is the expected amount of resources, and the remaining resources must be greater than 0. In addition, the bid price of the buyer and the ask price of the seller must be between 0 and the maximum price, which means that there is a limitation in the price given by the buyer and the seller.
Therefore, we have formulated a double auction model for the resource allocation problem described above. Both buyers and sellers can continually adjust their policies to maximize their profits. On the one hand, buyers desire to obtain resources at the lowest price that sellers can accept. On the other hand, buyers need to compete with other buyers for limited computing resources. When the buyer and seller adjust the strategy after each auction round, this is a game that the two parties are not working. Specifically, the goal of this game is to find nash equilibrium.
Nash equilibrium is the best result of gaming, with each participant in the auction finding the best strategy to obtain the maximum profit, and no participant has an incentive to change his strategy after considering the choice of opponents. In our question, nash equilibrium for double-beat game is defined as follows:
definition 1: in our question, let A*And B*Optimal bidding strategies and interrogation strategies for low earth orbit satellites and high altitude platforms. Then, if the following condition is satisfied, point (A)*,B*) Is the nash equilibrium point:
Ui(A*,B*)≥Ui(A*,B)
and the number of the first and second groups,
Uj(A*,B*)≥Uj(A,B*)
the EWA mechanism:
in the present invention, in order to search Nash balance, a multi-agent machine learning algorithm, called EWA algorithm, is introduced in the system model.
An EWA mechanism based on double-beat selling game:
the EWA algorithm combines belief learning and reinforcement learning. Belief learning enables agents to select their next strategy based on observations of other Agent selections, reinforcement learning enables agents to learn from experience. Thus, based on the EWA algorithm, buyers and sellers can dynamically adjust their bid and ask strategies through interaction with the environment.
For simplicity, Sagents and Bagents, which are agents traded by buyers and sellers, were introduced. In the present invention, the bid and ask ranges from 0 to PmaxDiscrete and the number of bids and asks P (P ═ P)max+1)。
Figure BDA0003419810370000081
Is SagentjFor HAPjThe set of interrogation strategies of.
Figure BDA0003419810370000082
Figure BDA0003419810370000083
Meaning that it will select a certain asking price. Obviously, SsjIs equal to aj. Also, in the same manner as above,
Figure BDA0003419810370000084
denotes BagentiThe bidding strategy of (1). The strategy of Bagent is to bid biOf the vector of (1), the element b thereofi,jIs for SagentjThe number of elements, etc. Since one seller has P bids and n sellers have P policiesn. And, SbiIs equal to
Figure BDA0003419810370000085
Next, S is useds={Ss1,Ss2,…,SsnAs a set of inquiry strategies for n sellers, i.e. SsA. Definition of Ss-j={Ss1,Ss2,…,Ss(j-1),Ss(j+1),…,SsnDescription except SagentjOr equal A- { ajS of }sAnd (4) other strategies. For batents, Sb={S1,S2,…,SpIs its set of bidding strategies, i.e. Sb=B。Sb-i={Sb1,Sb2,…,Sb(i-1),Sb(i+1),…,SbnS or Ss=B-{biIs except BagentiAnd (4) other policies.
Key parameters in EWA:
for theBagents and Sagents, using πbi(Sbi,Sb-i)=UiAnd pisj(Ssj,Ss-j)=UjIndicating the rewards of Bagents and Sagents, the rewards and the policy of the Agent itself and its opponents. The reward of one broker depends not only on its behavior but also on the bidding and asking strategies of other brokers in the same auction. In the EWA mechanism, there are two basic parameters that can help an Agent update its policy.
1) N (t): n (t) is the "empirical equivalent" of past experience. It is usually not equal to the number of past observations because the importance of experience decreases as the number of auction rounds increases. The update rule is as follows:
N(t)=ρN(t-1)+1
ρ is the influence factor, which acts to attenuate the influence of past strategies. t is a time factor equal to the iteration or auction round.
2)A(t):
Figure BDA0003419810370000091
And
Figure BDA0003419810370000092
is BagentiStrategies of tau and SagentjAttraction value of strategy τ. Use of
Figure BDA0003419810370000093
And
Figure BDA0003419810370000094
to represent BagentiAnd SagentjThe significance of the historical experience in selecting a policy. The update rule of τ is:
Figure BDA0003419810370000095
also, in the same manner as above,
Figure BDA0003419810370000096
i (x, y) is 1 only when x is equal to y, otherwise I (x, y) is 0. Phi is a parameter as the learning rate.
δ in A (t) is a notional factor that encourages an Agent to explore the environment and attempt to select an unknown strategy. Depending on the award amount, the Agent may adjust its policy to find the most appropriate policy in the double-auction game.
Each policy or price has a corresponding A (t) indicating a desire to select that price as either a bid or ask price. This equation includes experience and observation of others' behavior, which is characteristic of reinforcement learning and belief learning. The EWA algorithm is an intermediate state between reinforcement learning and belief-based learning.
EWA and reinforcement learning:
in reinforcement learning, three parts are involved: state S, action a and reward R. In each step, the Agent observes the current state s (t) s and makes a decision a (t) a according to the current policy. Following this action, the principal gets an immediate reward R (a, S) from the environment, and the state transitions to the next state S (t +1) e S.
The reward update equation for the RL is:
Figure BDA0003419810370000097
from this equation, the relationship between RL and EWA can be found, i.e. when δ is 0, ρ is 0,
Figure BDA0003419810370000098
and N (0) ═ 1, a (t) will degrade into an accumulation of r (t) to obtain some features that enhance learning.
EWA and confidence learning:
in the belief learning model, the history of the adversary strategy set is composed of
Figure BDA0003419810370000099
Representation, which is a set of policies
Figure BDA00034198103700000910
Is compared to a hypothetical count of observed values (e.g., buyer). The sum of these frequencies is:
Figure BDA00034198103700000911
the initial a priori beliefs are:
Figure BDA0003419810370000101
the beliefs are updated by adding a depreciation coefficient p to the actions actually selected by the other participants.
Figure BDA0003419810370000102
Expressing beliefs with historical beliefs:
Figure BDA0003419810370000103
the expected return over time period T by choosing strategy τ is:
Figure BDA0003419810370000104
combining the last two equations, the update rule of the expectation function in belief learning can be obtained as follows:
Figure BDA0003419810370000105
strategy selection:
the EWA mechanism can switch from belief learning to reinforcement learning, or combine the two methods into one, depending on the different allocations of δ, ρ, Φ. Furthermore, we can find these parameters important.
Each participant who executes the EWA algorithm submits its bid or ask priceTo the broker. The price is calculated by the policy. Thus, the solution to the double beat game is a distributed approach where each low orbit satellite or high altitude platform separately runs its specific EWA algorithm. Each Agent is based on A (t) and is Bagent in the following manneriSelecting a strategy:
Figure BDA0003419810370000106
for Sagentj
Figure BDA0003419810370000107
P (t +1) is the probability of selecting strategy P for the next cycle. After receiving all bidding and asking price strategies, the Agent can finally decide how to effectively allocate the computing resources of all high-altitude platforms.
It is crucial that the calculations of the proposed algorithm cannot be too complex due to the limited computational power of low earth orbit satellites and high altitude platforms. What an Agent does in an auction is to select one of 0 to P{max}The price of (c). Further, after the broker has allocated an allocation plan, the broker resumes the attractiveness of each price. Therefore, the computational complexity of low-earth orbit satellites and high-altitude platforms is o (n). In addition, brokers need to determine a trade between buyers and sellers. The computational complexity of the transaction decision process is O (n)2)。
The double-clap selling algorithm based on the EWA in the air-space information network specifically comprises the following steps:
step 1: input V ═ Vi(Lq)},Cost={Costj},R,C;
Step 2: broker initializes β, P and α, Conv ═ 0, Flag ═ 0;
and step 3: for each low earth orbit satellite and high altitude platform, if HAPjOr LEOiIf the new addition is made, the broker allocates the agents with the initial parameters of A (0) and N (0) for the new addition, otherwise, the former agents and the former parameters are reserved;
and 4, step 4: starting to determine a resource allocation plan in T time;
and 5: repeating the following substeps until Conv ═ 1;
a: based on (18), (19), calculating A and B of all agents;
b: in ascending order aj∈A;
C: descending order of arrangement bi,j∈B;
D: let Flag equal to 0, find the intersection (i, j) of two price sequences equal to argmin(i,j){bi,j-aj(ii) removing (i, j); if r isj≥ciThen update betai=bi,j,Pj=Pj+bi,jj,i=ciFlag 1, price in descending order
Removal of B from sequence Bi,jRepeating the above steps until Flag is 0;
e: if a and B are equal in the last round of transaction, Conv is 1;
f: all agents calculated Cost, β, V, P according to (9), (10);
g: updating A (t) of all the agents;
step 6: assigning α to HAPs, LEOs;
and 7: returning to alpha.
Simulation result and analysis:
convergence performance of double clap selling algorithm based on EWA:
as shown in fig. 3, the EWA algorithm exhibits excellent convergence performance. After approximately 800 double-beat games, the strategy converges for all participants. Furthermore, we find that the total reward does not reach a maximum until the algorithm converges, which indicates that the EWA converges to an optimal solution.
Average cost of seller and average ask of seller:
as shown in fig. 4, it can be seen that the seller gradually increases the average asking price as the average cost increases. As the cost increases, the seller will raise the ask slightly, as shown, although the average cost increases from 8 to 32, the change in ask is not extreme, as the broker will protect private information and the seller will not be able to give an aggressive price. In addition, as the average capacity becomes higher, the seller will lower the average ask, with higher capacity meaning that more low-orbit satellites can be provided with computing resources and earn more profits to offset the cost. In this market, all participants trade under the broker's limits, and the broker's actions can avoid corruption and direct the market win-win.
Average buyer value and average buyer bid:
as shown in FIG. 5, we observed a change in bid price as the average buyer value increased, with little fluctuation, and a smooth increase. While computing resources may bring more profit or value to the buyer, they are willing to pay more money to acquire computing resources to help them beat other resources less valuable than their buyer. However, when their demand is higher, they must also bid higher to compete with others. While the large resource demands generally mean that they can make more profits, the total resources of high altitude platforms are also limited. The high bid for a unit of computing resource means that their task can be completed before other tasks. Thus, they may tie up computing resources to meet their needs in advance.
The invention introduces a high-altitude platform to unload the computing task of the low-orbit satellite so as to reduce the transmission delay. And calculating the resource allocation problem between the near-orbit satellite and the high-altitude platform based on a double-shot selling mechanism. Then, a nash equilibrium search algorithm based on Empirical Weight Attraction (EWA) was designed, which combines the advantages of reinforcement learning and belief learning, and is performed on each participant. Auction participants can obtain information from other opponents and accumulate experience and thought. Thus, they can complete transactions with other participants in a fuzzy environment and maximize the overall interest of the participants. The convergence of this EWA algorithm is verified by simulation.
So far, the technical solutions of the present invention have been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of the present invention is obviously not limited to these specific embodiments. Equivalent changes or substitutions of related technical features can be made by those skilled in the art without departing from the principle of the invention, and the technical scheme after the changes or substitutions can fall into the protection scope of the invention.

Claims (4)

1. A resource allocation method based on double auction game in an aerospace information network is characterized by comprising a low orbit satellite system based on a High Altitude Platform (HAP) and an EWA algorithm mechanism based on double auction game, wherein the low orbit satellite system can directly access computing resources of an adjacent High Altitude Platform, the computing resources are purchased from the High Altitude Platform by a low orbit satellite, a resource trading market is established, and a buyer and a seller can dynamically adjust the bidding and asking price strategies through interaction with the environment based on the EWA algorithm.
2. The method for allocating resources based on the double-beat game in the aerospace information network as claimed in claim 1, wherein M/1 queuing model is adopted in the low-earth orbit satellite system to quantify the urgency degree of low-earth orbit satellite tasks, and λ is usediRepresenting LEOiTask arrival rate of, and HAPjWill be expressed as mujIn addition, since the distance between the high altitude platform and the low orbit satellite is short, the main service latency can be attributed to the processing delay, and the total delay of the mission can be expressed as:
Figure FDA0003419810360000011
equal to the wait time plus the service time; then, the average latency can be calculated as:
Figure FDA0003419810360000012
according to the formula Lq=λiTwThe average length of the queue can be expressed as:
Figure FDA0003419810360000013
each low earth orbit satellite has different traffic demands, a higher λ indicates a higher frequency of arrival of the task, and a smaller μ indicates that the task requires a large amount of computation, particularly when LqVery large means that the low earth orbit satellites have very urgent access to additional computing resources.
3. The method for allocating resources based on the double-auction game in the aerospace information network according to claim 1, wherein a double-auction game mechanism is adopted in the low-earth orbit satellite system, and a ═ is usedj1,2,3 … N to describe the asking price strategy for high-altitude platforms, where ajRepresents SSjAcceptable minimum price; use of bi={bi,j1,2,3 … N to describe LEOiBidding prices for different high-altitude platforms; using the matrix B ═ Bi1,2,3 … M represents the bidding strategy for all low orbit satellites; beta is ai,jFor representing LEOiPaying HAP for computing resourcesjThe amount of (c); an aerial platform has several items available for sale to N buyers, so that β is a matrix whose columns record the HAPsjA trade price that sells its computing resources to a certain low earth orbit satellite, such that:
Figure FDA0003419810360000021
is HAPjTotal reward in all successful transactions.
4. The resource allocation method based on the double-auction game in the aerospace information network according to claim 1, wherein the EWA algorithm mechanism based on the double-auction game introduces Sagents and Bagents which are agents traded by buyers and sellers; the bids and asks range from 0 to PmaxDiscrete and the number of bids and asks P (P ═ P)max+1);
Figure FDA0003419810360000022
Is SagentjFor HAPjThe set of inquiry policies of (1),
Figure FDA0003419810360000023
Figure FDA0003419810360000024
meaning that it will select a certain asking price, obviously SsjIs equal to aj. Also, in the same manner as above,
Figure FDA0003419810360000025
denotes BagentiThe policy of Bagent is to bid biOf the vector of (1), the element b thereofi,jIs for SagentjThe number of Sagents, the number of elements, etc.; one seller has P bids and n sellers, so the number of strategies is PnAnd, SbiIs equal to
Figure FDA0003419810360000026
Using Ss={Ss1,Ss2,…,SsnAs a set of inquiry strategies for n sellers, i.e. SsDefine S as As-j={Ss1,Ss2,…,Ss(j-1),Ss(j+1),…,SsnDescription except SagentjOr equal A- { ajS of }sAnd (4) other strategies. For batents, Sb={S1,S2,…,SpIs its set of bidding strategies, i.e. Sb=B。Sb-i={Sb1,Sb2,…,Sb(i-1),Sb(i+1),…,SbnS or Ss=B-{biIs except Bagenti(ii) a policy other than;
key parameters in EWA:
for Bagents and Sagents, pi is usedbi(Sbi,Sb-i)=UiAnd pisj(Ssj,Ss-j)=UjIndicating the rewards of Bagents and Sagents, the rewards and the policy of the Agent itself and its opponents.
CN202111558461.7A 2021-12-20 2021-12-20 Resource allocation method based on double-shooting game in aerospace information network Pending CN114269019A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111558461.7A CN114269019A (en) 2021-12-20 2021-12-20 Resource allocation method based on double-shooting game in aerospace information network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111558461.7A CN114269019A (en) 2021-12-20 2021-12-20 Resource allocation method based on double-shooting game in aerospace information network

Publications (1)

Publication Number Publication Date
CN114269019A true CN114269019A (en) 2022-04-01

Family

ID=80828281

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111558461.7A Pending CN114269019A (en) 2021-12-20 2021-12-20 Resource allocation method based on double-shooting game in aerospace information network

Country Status (1)

Country Link
CN (1) CN114269019A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115616902A (en) * 2022-11-07 2023-01-17 中国人民解放军国防科技大学 Cluster spacecraft task allocation method and device based on robust auction algorithm

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115616902A (en) * 2022-11-07 2023-01-17 中国人民解放军国防科技大学 Cluster spacecraft task allocation method and device based on robust auction algorithm
CN115616902B (en) * 2022-11-07 2023-03-07 中国人民解放军国防科技大学 Cluster spacecraft task allocation method and device based on robust auction algorithm

Similar Documents

Publication Publication Date Title
US20240054560A1 (en) System and method for optimizing execution of trading orders
US20230245226A1 (en) System and method for routing a trading order based upon quantity
US7870050B2 (en) System and method for an efficient dynamic multi-unit auction
US8275687B2 (en) Allocation of commissions
CN108335182B (en) Cloud platform Web service transaction system and method based on bilateral auction mechanism
US20120284141A1 (en) System and method for a hybrid clock and proxy auction
Lee et al. A real-time group auction system for efficient allocation of cloud internet applications
US20090006115A1 (en) Establishing and updating reputation scores in online participatory systems
CN110544147A (en) multitask cross-server resource allocation method based on two-way auction in MEC
US20060167787A1 (en) System and method for an efficient dynamic multi-unit auction
CN110276670A (en) A kind of computational resource allocation method of task based access control migration
CN114269019A (en) Resource allocation method based on double-shooting game in aerospace information network
JP2002540510A (en) System and method for implementing a progressive second price auction approach
US20200349570A1 (en) System and method for enhanced clock auctions and combinatorial clock auctions
CN111178947A (en) Advertisement space recommendation method and device, computer-readable storage medium and electronic equipment
CN113543055B (en) Resource allocation method in vehicle edge calculation based on bidirectional auction mechanism
US20080167940A1 (en) Method and structure for increasing revenue for on-demand environments
US20170046777A1 (en) Bidder system for efficient dynamic multi-unit auction
US20040254847A1 (en) Automated negotiation with multiple parties
EP1170691A1 (en) System and method for an efficient dynamic multi-unit auction
US8447662B2 (en) System and method for an efficient dynamic multi-unit auction
Baisa An efficient auction for budget constrained bidders with multi-dimensional types
US20140195369A1 (en) System and method for a hybrid clock and proxy auction
KR20200115720A (en) The Method of Charge Scheduling for Multi-Drone Networks and the device thereof
JP5456558B2 (en) Electronic auction equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication