CN112101728A

CN112101728A - Energy optimization distribution method for mobile edge computing system

Info

Publication number: CN112101728A
Application number: CN202010829335.XA
Authority: CN
Inventors: 林伟伟; 许银海; 黄天晟
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2020-08-18
Filing date: 2020-08-18
Publication date: 2020-12-18

Abstract

The invention discloses an energy optimization distribution method of a mobile edge computing system, which comprises the following steps: assuming that the time intervals of the arrival events are exponentially distributed, converting the MEC system energy distribution problem into a Markov decision model of the contact time; the Markov decision model comprises six elements of a system state s, a system action a, an award r (s, a), a strategy pi, a value function V(s) and a state transition probability p (s' | s, a); obtaining the exact state transition probability, and solving the Model by adopting a Model-Based method; and solving the accurate solution of each state value function by a value iteration method to obtain the optimal strategy of energy distribution. The invention relates to a mobile edge computing system energy allocation problem, which is converted into a continuous time Markov decision model, and an accurate solution of each state value function is solved by a value iteration method, so that an optimal strategy of energy allocation is found, and long-term sustainable computation of an MEC system is realized.

Description

Energy optimization distribution method for mobile edge computing system

Technical Field

The invention belongs to the technical field of energy distribution of a mobile edge computing system, and particularly relates to an energy optimization distribution method of the mobile edge computing system.

Background

With the wide popularization of internet technology and the vigorous development of mobile communication technology, the mobile internet has become an important part of people's daily life. Smart mobile terminals such as smart phones, tablet computers, smart wearable devices, and in-vehicle mobile devices have gradually replaced personal computers, and have become main tools used by people in work, study, social contact, and entertainment. In order to solve the problem of energy limitation of mobile devices, a new network architecture, Mobile Edge Computing (MEC), is proposed. According to the ETSI definition, the moving edge is calculated as: and providing a cloud computing function and an IT service environment required by an application opener and a content service provider at the edge of the network so as to reduce the time delay of network operation and service delivery and improve the user experience.

Mobile Edge Computing (MEC) requires the distribution of millions of small servers in a city, with too much power overhead to rely on traditional power grid facilities. Due to advances in Energy Harvesting (EH) technology, renewable energy sources such as solar radiation and wind energy have become viable and promising power sources for MEC systems. However, the attainment of these energy sources is highly random and therefore requires the storage of the collected energy sources in the MEC system cells. If there is not enough available energy, the system may not operate properly, and therefore, the energy usage plan calculated for each request may have a profound effect on the sustainable operation of the system. For example, if not enough energy is provided to handle the incoming request, the computation speed may not meet the user's demand. Conversely, if too much energy is allocated, the MEC system may miss several subsequent requests due to low battery power. Therefore, reasonable distribution of the energy of the MEC system is realized, and long-term sustainable energy calculation of the MEC system is guaranteed to become a new challenge.

Disclosure of Invention

The invention mainly aims to overcome the defects and shortcomings of the prior art and provides an energy optimization allocation method for a mobile edge computing system, which converts the energy allocation problem of the mobile edge computing system into a continuous-time Markov decision model and solves the accurate solution of each state value function by a value iteration method, thereby finding the optimal strategy of energy allocation and realizing the long-term sustainable calculation of an MEC system.

In order to achieve the purpose, the invention adopts the following technical scheme:

the invention provides an energy optimization distribution method of a mobile edge computing system, which comprises the following steps:

assuming that the time intervals of arrival events are exponentially distributed, converting the energy distribution problem of the mobile edge computing MEC system into a Markov decision model of the contact time; the MEC system comprises a scheduler, wherein the scheduler is used for judging whether the MEC system receives tasks or not and distributing virtual machines and energy sources for the received tasks; the Markov decision model comprises six elements of a system state s, a system action a, an award r (s, a), a strategy pi, a value function V(s) and a state transition probability p (s' | s, a); when a new arrival event arrives, the MEC system generates state transition, wherein the arrival event comprises a task arrival event, an energy arrival event and a task completion event; the probability of the state transition is the same as the probability of the occurrence of the next arrival event, and the value is determined by the ratio of the probability of the specific arrival event to the total events of the system arrival;

obtaining the exact state transition probability according to the state transition probabilities under different system states and different system action conditions, and solving the Model by adopting a Model-Based method;

and solving the accurate solution of each system state value function by a value iteration method to obtain the optimal strategy of energy distribution.

Further, the system state s is specifically represented as follows:

wherein b is the residual energy of the MEC system in the current system state and represents the number of the distributed running virtual machines,

representing the number of running virtual machines, k, allocated to a unit of energy_nRepresenting the amount of unit energy allocated to the virtual machine;

in any system state, the arrival event e satisfies:

wherein A is_r、A_eAnd

respectively representing a task arrival event, an energy arrival event and a task completion event.

Further, when a task arrival event arrives, i.e., e ═ a_rIf the system action a is equal to 0, the MEC scheduler refuses the task arrival event, and if the system action a is equal to k_nThen it means that the MEC system assigns a value k to the arriving task request_nA virtual machine of a unit energy; bringing q for the MEC system each time when the energy arrival event arrives_eJ energy source; when other events arrive, the MEC scheduler does not perform any substantive operation.

Further, the system award r (s, a) is specifically expressed as follows:

r(s,a)＝g(s,a)-c(s,a)τ(s,a)

wherein g (s, a) represents a direct reward, c (s, a) and τ (s, a) represent cost rate and dwell time between the current task arrival event and the next task arrival event, respectively, and c (s, a) τ (s, a) represents the added delay of all computation requests between two successive arrival event time points;

the direct prize g (s, a) is specifically expressed as follows:

wherein U represents the local computation time to the task;

the cost rate c (s, a) is specifically expressed as follows:

wherein the content of the first and second substances,

representing the number of running virtual machines in the MEC system, the number of virtual machines not changing between event arrivals; 1_{a＞0}Is shown in system state a>0 is equal to 1, otherwise 0.

Further, the task arrival event and the energy arrival event are respectively subject to a parameter lambda_r、λ_ePoisson distribution of (a); the interval time obeying parameter of the task arrival event and the task completion event is mu_c(k_n) Index distribution of (d), mu_c(k_n) Specifically, the following are shown:

where φ represents the average data size of the offload request, v represents the number of CPU cycles required to compute a bit of offload request data, and κ represents the effective switch capacitance of the MEC system processor.

Further, the residence time between arrival events follows an exponential distribution with a parameter β (s, a), which represents the sum of the occurrence rates of all possible events, and is specifically expressed as follows:

wherein λ is_r、λ_eRespectively representing probability expectations of task arrival events and energy arrival events;

the probability density function of the dwell time between arrival events is specifically as follows:

when the MEC scheduler accepts a task arrival event, a new virtual machine needs to be opened, i.e.

And is formed byIt is assigned k_jThe energy source of (2), when the energy source is distributed k_jBecomes the task completion rate of the virtual machine

When a task completion event is reached, i.e.

When an event occurs, the system will shut down the virtual machine running this task, at which point k is assigned_jBecomes (σ) the task completion rate of the virtual machine_kj-1)μ_c(k_j)。

Further, the obtaining of the exact state transition probability specifically includes:

the state transition probabilities under different system states and different system operating conditions are as follows:

when in use

When the temperature of the water is higher than the set temperature,

wherein s' represents the next system state;

when in use

In time, namely, the new arriving event is the energy arriving event, the probability of occurrence of the three arriving events will not change:

wherein the content of the first and second substances,

equivalent to min (b +1, b)_m)；

When in use

When the temperature of the water is higher than the set temperature,

further, the reward function adopts a discount model, and the prospective discount reward is derived as follows:

where η is the discount factor.

Further, when the fixed policy pi is followed, the discount reward in an infinite time range is obtained, which is specifically expressed as follows:

wherein, t_pRepresents the start time, s, of the p-th action of the MEC system_pRepresents the system state at the p-th action of the MEC system,

is shown in system state s_pThe reward of following the policy pi is then,

is shown at t_pFollowing the system actions taken by the policy π time MEC system, the rewards obtained all require a discount function

The continuous-time markov model target is a strategy for achieving a maximum reward, expressed as follows:

wherein, pi^*Representing an optimal strategy;

for all system states S ∈ S, the bellman optimal equation is satisfied, which is as follows:

wherein the content of the first and second substances,

further, the value iteration specifically includes the following steps:

initialization, for all system states S ∈ S, q ═ 0, V^q(s)＝0；

The update value function, for the system state S e S,

q ═ q +1, if | V^q-V^q-1Ii >, return to the update value function step;

and searching an optimal energy distribution strategy, and for all system states S E S,

compared with the prior art, the invention has the following advantages and beneficial effects:

the invention converts the energy distribution problem of the mobile edge computing system into a Markov decision model of continuous time by assuming that the time interval of the arrival event is exponentially distributed, solves the accurate solution of each state value function by adopting a model-based method of value iteration, finds the optimal strategy of energy distribution, solves the problems of task loss, unsatisfied delay and the like of the mobile edge computing system, and realizes the long-term sustainable computation of the MEC system.

Drawings

FIG. 1 is a simplified schematic diagram of an energy allocation model of a mobile edge computing system in accordance with the method of the present invention;

FIG. 2 is a median iterative solution algorithm of the method of the present invention.

Detailed Description

The present invention will be described in further detail with reference to examples and drawings, but the present invention is not limited thereto.

Examples

Three alternative energy scores for incoming requests are considered-1, 2 and 3 energy units. The MEC system starts from its initial state (a battery containing one energy unit) and receives an additional energy unit upon encountering an energy arrival event. A task request then arrives at the MEC system, which decides to reserve 1 energy unit for the task request, with system action a being 1. Accordingly, the battery charge will be reduced by 1, the number of currently operating 1 unit of energy allocation VM (recorded as α)₁) Jump to 1. After receiving another unit of energy, the computation in the VM has been completed (task completion event arrives), then α₁A jump to 0 will occur. The last event is another task arrival event, but this time the MEC system schedules 2 units of energy allocation for it, i.e. a-2.

As shown in fig. 1, the invention relates to a method for optimizing and allocating energy of a mobile edge computing system, comprising the following steps:

s1, assuming that the time intervals of arriving events are distributed exponentially, converting the MEC system energy allocation problem into a markov decision model of continuous events, where the model includes six elements, namely a system state S, a system action a, a reward r (S, a), a policy pi, a value function v (S), and a state transition probability p (S' | S, a):

A) a system state s, said system state s being represented as follows:

b is the remaining energy of the MEC system in the current state,

representing the number of allocated running virtual machines, e representing an arrival event, the arrival of a new event generating a system state transition, the arrival event comprising a task arrival event, an energy arrival event and a task completion event.

In any system state, the arrival event e satisfies:

wherein A is_r、A_eAnd

In this embodiment, the MEC system includes an MEC scheduler.

B) System actions a,; when the task arrives, i.e. e ═ A_rIf the system action a is equal to 0, the MEC scheduler refuses the task arrival event, and if the system action a is equal to k_nThen it means that the MEC system assigns a value k to the arriving task request_nA virtual machine of a unit energy;

in this embodiment, it is assumed that the energy arrival event arrives, bringing 1 unit (q) for the MEC system each time_eJ) The energy source of (1). When other events arrive, the MEC scheduler does not perform any substantive operation.

C) A system reward r (s, a), which is specifically expressed as follows:

r(s,a)＝g(s,a)-c(s,a)τ(s,a)

wherein g (s, a) represents a direct reward, c (s, a) and τ (s, a) represent the cost rate and residence time between the current task arrival event and the next task arrival event, respectively, and c (s, a) τ (s, a) represents the added delay of all computation requests within two consecutive arrival event time points;

the direct prize g (s, a) is specifically expressed as follows:

wherein U represents the local computation time to the task; in other words, using MEC calculations, the MEC system will save U units of time for the requester.

In this embodiment, the different local computation times caused by the different capacities of the local mobile devices are not taken into account, i.e. the immediate payback expectation for each request is the same.

The cost rate c (s, a) is specifically expressed as follows:

wherein the content of the first and second substances,

representing the number of running virtual machines in the MEC server, which does not change between event arrivals; 1_{a＞0}Is shown in system a>0 is equal to 1, otherwise 0.

In this embodiment, the sum of the computation service delay times of the current task request will be increased by c for each second_tSecond, wherein c_tThe total number of the existing virtual machines.

The task arrival event and the energy arrival event are respectively obeyed with a parameter lambda_r、λ_ePoisson distribution of (a); the interval time obeying parameter of the task arrival event and the task completion event is mu_c(k_n) Index distribution of (d), mu_c(k_n) Specifically, the following are shown:

where φ represents the average data size of the offload request, v represents the number of CPU cycles required to compute data for the offload request, and κ represents the effective switched capacitance of the MEC system processor.

The residence time between arrival events obeys an exponential distribution with a parameter β (s, a) representing the sum of the occurrence rates of the possible events, as follows:

in this embodiment, when the MEC scheduler accepts a task arrival event, a new virtual machine needs to be opened, i.e., a new virtual machine is opened

And assign k thereto_jThe energy source of (2), when the energy source is distributed k_jBecomes the task completion rate of the virtual machine

When a task completion event is reached, i.e.

The reward function adopts a discount model, and the expected discount reward is deduced as follows:

where η is the discount factor. In the present embodiment, e is in the form of an index^-ηtThe expected value can be simplified as a discount compared to etat.

D) And a strategy pi, wherein when the fixed strategy pi is followed, discount rewards in an infinite time range are obtained, and the strategy pi is specifically expressed as follows:

wherein, t_pRepresents the start time, s, of the p-th action of the system_pIndicating the state of the system at the p-th action of the system,

is shown in state s_pNext, following the reward of the policy pi,

is shown at t_pFollowing the actions taken by the policy pi system, the reward obtained for each action requires a discount function

wherein, pi^*Representing an optimal strategy; to simplify the notation, in the present embodiment, let

E) For all system states S ∈ S, the bellman optimal equation is satisfied, and the value function v (S) is as follows:

wherein the content of the first and second substances,

s2, obtaining the exact state transition probability, and solving the Model by adopting a Model-Based method, wherein the method specifically comprises the following steps:

F) probability of state transition p (s' | s, a)

when in use

When the temperature of the water is higher than the set temperature,

wherein s' represents the next system state;

when in use

In time, namely, the arrival event is an energy arrival event, the occurrence probability of the three events will not change:

wherein the content of the first and second substances,

equivalent to min (b +1, b)_m)；

When in use

When the temperature of the water is higher than the set temperature,

s3, solving the accurate solution of each system state value function through a value iteration method to find out the optimal energy distribution scheme, which specifically comprises the following steps:

s31, initializing, wherein S e S, q is 0, V for all system states^q(s)＝0；

S32, updating the value function, for the system state S e S,

s33, q ═ q +1, if | V^q-V^q-1Ii >, return to step S32;

s34, finding the optimal energy distribution strategy, for all system states S belonging to S,

it should also be noted that in this specification, terms such as "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. A method for optimizing energy distribution of a mobile edge computing system, comprising the steps of:

2. The method according to claim 1, wherein the system state s is specifically expressed as follows:

in any system state, the arrival event e satisfies:

wherein A is_r、A_eAnd

3. The method of claim 2, wherein the task-arrival event is e-A when the task-arrival event arrives_rIf the system action a is equal to 0, the MEC scheduler refuses the task arrival event, and if the system action a is equal to k_nThen it means that the MEC system assigns a value k to the arriving task request_nA virtual machine of a unit energy; bringing q for the MEC system each time when the energy arrival event arrives_eJ energy source; when other events arrive, the MEC scheduler does not perform any substantive operation.

4. The method of claim 2, wherein the system reward r (s, a) is expressed as follows:

r(s，a)＝g(s，a)-c(s，a)τ(s，a)

the direct prize g (s, a) is specifically expressed as follows:

wherein U represents the local computation time to the task;

the cost rate c (s, a) is specifically expressed as follows:

wherein the content of the first and second substances,

representing the number of running virtual machines in the MEC system, the number of virtual machines not changing between event arrivals; 1_{a＞0}Indicating a 1 when system state a > 0, otherwise 0.

5. The method as claimed in claim 3, wherein the task arrival event and the energy arrival event are respectively subject to a parameter λ_r、λ_ePoisson distribution of (a); the interval time obeying parameter of the task arrival event and the task completion event is mu_c(k_n) Index distribution of (d), mu_c(k_n) Specifically, the following are shown:

where φ represents the average data size of the offload requests, v represents the number of CPU cycles required to compute a bit of offload request data, and K represents the effective switched capacitance of the MEC system processor.

6. The method according to claim 4 or 5, wherein the residence time between arrival events is subject to an exponential distribution with a parameter β (s, a), where β (s, a) represents the sum of the occurrence rates of all possible events, and is specifically expressed as follows:

When a task completion event is reached, i.e.

7. The method according to claim 6, wherein the obtaining the exact state transition probability specifically comprises:

when in use

When the temperature of the water is higher than the set temperature,

wherein s' represents the next system state;

when in use

wherein the content of the first and second substances,

equivalent to min (b +1, b)_m)；

When in use

When the temperature of the water is higher than the set temperature,

8. the method of claim 6, wherein the reward function employs a discount model, and the prospective discount reward is derived as follows:

where η is the discount factor.

9. The method according to claim 7 or 8, wherein when the fixed policy pi is followed, the discount reward within an infinite time range is obtained, which is specifically expressed as follows:

wherein, t_pRepresents the start time of the p-th action of the MEC system, sp represents the system state of the MEC system at the p-th action,

is shown in system state s_pThe reward of following the policy pi is then,

wherein, pi^*Representing an optimal strategy;

wherein the content of the first and second substances,

10. the method according to claim 9, wherein the value iteration comprises the following steps:

initialization, for all system states S ∈ S, q ═ 0, V^q(s)＝0；

The update value function, for the system state S e S,

q ═ q +1, if | | | V^q-V^q-1I >, return to the update value function step;