CN113542376A

CN113542376A - Task unloading method based on energy consumption and time delay weighting

Info

Publication number: CN113542376A
Application number: CN202110749538.2A
Authority: CN
Inventors: 曾伟良; 周镖华; 黄永慧; 孙为军; 杨超
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2021-07-01
Filing date: 2021-07-01
Publication date: 2021-10-22

Abstract

The invention discloses a task unloading method based on energy consumption and time delay weighting, which comprises the following steps: s1, constructing a system model according to communication between the task vehicle and the cooperative vehicle and communication between the task vehicle and an MEC server configured on a roadside unit arranged at the edge of the intersection road to obtain an objective function; s2, judging whether the time for the task vehicle to solve the task of the task vehicle exceeds the maximum tolerance time delay, if so, performing local calculation, otherwise, entering the step S3; s3, converting the task allocation process decision problem into a Markov decision process according to the constructed system model and the obtained objective function; s4, solving the strategy selection problem in the Markov decision process by applying the DQN algorithm, thereby obtaining the optimal unloading node. The method effectively reduces the calculation time delay of the task vehicle, relieves the calculation pressure of the task vehicle and the MEC server, and can solve the problem that a heuristic algorithm falls into a local optimal solution.

Description

Task unloading method based on energy consumption and time delay weighting

Technical Field

The invention relates to the technical field of Internet of vehicles, in particular to a task unloading method based on energy consumption and time delay weighting.

Background

With the development of technologies such as internet of things, artificial intelligence and virtual reality, the computation-intensive business with high energy consumption is continuously increased, and the conflict between the computation-intensive application and the mobile computing system with limited resources brings unprecedented challenges to the development of future mobile business. To address this challenge, Mobile Cloud Computing (MCC) is often used to offload Computing tasks on Mobile terminals to resource-rich remote clouds. However, the conventional MCC method has disadvantages of long delay and low reliability caused by data transmission through a wide area network.

In recent years, Mobile Edge Computing (MEC), which can provide cloud Computing capability in the vicinity of Mobile users, has been proposed as one of the key technologies of 5G. Offloading the user's computing tasks to a nearby MEC server, i.e., mobile edge computing offloading, is considered a promising solution to address the above challenges. Edge calculation can achieve lower latency and higher reliability compared to the conventional MCC scheme, and has been a research hotspot.

With the proliferation of vehicles on roads, a large amount of computing resources are required to meet the computing demands of users, and the proliferation of vehicles is accompanied by a lot of idle computing resources available. For vehicles, the main optimization problem for task offloading of vehicles using edge calculation is: (1) and the task calculation time delay is reduced. I.e., how to reasonably allocate the proportion of resources to minimize the time from the task generated by the vehicle to the feedback that the vehicle received the task. (2) The energy consumption is reduced. I.e., the vehicle is tasked off-loading, the energy consumed is minimal.

Currently existing vehicle task offloading strategies are:

1. in the research on the minimum time delay multi-edge node unloading balancing strategy, data flow in a Network is monitored through a Software Defined Network (SDN), data in a hot spot area are regulated and controlled to be unloaded to peripheral nodes in a multi-hop mode to execute a computing task, the purpose of reducing heat in the hot spot area and reducing time delay of the executed task is achieved, and meanwhile, an edge node unloading algorithm based on a quantum particle swarm algorithm and an edge node load balancing algorithm based on a heuristic algorithm are provided to solve the problem.

2. Vehicle networking cooperative service caching and calculation unloading based on multi-agent reinforcement learning carries out service caching and task scheduling facing vehicle application by researching cooperative cooperation of multi-path side units (RSUs), and models the service caching and task scheduling as a mixed integer nonlinear programming problem, aiming at minimizing service delay of a vehicle networking system. In order to reduce the difficulty of solving an optimization problem, firstly, a double-layer multi-RSU cooperative cache framework is provided to decouple the problem, a multi-agent meta reinforcement learning method is adopted at the outer layer, and a long-term and short-term memory network is adopted as a meta-agent to balance local decisions and accelerate the learning process while each local agent performs decision learning, so that an optimal RSU cache strategy is obtained; and the inner layer adopts a Lagrange multiplier method to solve the optimal collaborative unloading strategy, so that task allocation among RSUs is realized.

3. A genetic algorithm-based multi-site collaborative computing unloading algorithm GAMCCO is provided. The algorithm abstracts an application program into a task dependency relationship graph model, analyzes the dependency relationship among tasks, models the problem of multi-site collaborative computing unloading into a cost model, and searches an unloading scheme with the minimum cost by using a genetic algorithm.

However, the above solution has the following disadvantages:

1) the scheme does not consider energy consumption; 2) the adopted long and short term memory network needs a large amount of data training and the training process is time-consuming; 3) by applying the genetic algorithm, the heuristic algorithm is easy to fall into the local optimal solution, so that the global optimal solution is neglected.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a task unloading method based on energy consumption and time delay weighting.

In order to achieve the purpose, the technical scheme provided by the invention is as follows:

a task unloading method based on energy consumption and time delay weighting releases computing resources of cooperative vehicles with idle computing resources in a road and provides computing services for task vehicles, and comprises the following steps:

s1, constructing a system model according to communication between the task vehicle and the cooperative vehicle and communication between the task vehicle and an MEC server configured on a roadside unit arranged at the edge of the intersection road to obtain an objective function;

s2, judging whether the time for the task vehicle to solve the task of the task vehicle exceeds the maximum tolerance time delay, if so, performing local calculation, otherwise, entering the step S3;

s3, converting the task allocation process decision problem into a Markov decision process according to the system model constructed in the step S1 and the obtained objective function;

and S4, according to the Markov decision process obtained by conversion in the step S3, in the process, solving the problem of strategy selection in the Markov decision process by using a DQN algorithm, so as to obtain an optimal unloading node, namely the node for providing calculation service for the task vehicle.

Further, the step S1 includes:

s1-1, carrying out scene modeling analysis, including dividing unloading modes into a vehicle-to-vehicle V2V unloading mode and an MEC unloading mode;

s1-2, building a mathematical model for the system model.

Further, the step S1-2 specifically includes:

V2V unload mode:

in the unloading mode of V2V, assuming that each task vehicle can find the optimal cooperative vehicle and becomes an unloading pair with it, according to shannon's formula, the transmission rate between the task vehicle and the cooperative vehicle is:

in the formula (1), ω is_V2VIs bandwidth, P_iIn order to transmit the power, the power is transmitted,

for channel gain, N₀Is noise;

in the unloading mode of V2V, the time delay comprises the time of data uploading and the time of data calculation in the cooperative vehicle, and the return time is ignored because the processed data of the task is small;

time delay of transmission process

And energy consumption

Respectively as follows:

in the formula (2), D_iIs the size of the data of the task,

a set of task vehicles representing the inability to resolve their own tasks within a maximum tolerated delay;

collaborative vehicle computation latency

And energy consumption

Comprises the following steps:

in the formula (3), B_iRepresenting the number of CPU cycles required to calculate one bit size data,

f_jfor the computational capability of cooperating vehicles, P_jIn order to cooperate with the power consumption of the vehicle,

mu is an energy parameter;

so the total delay in the unloading mode of V2V

And total energy consumption

Comprises the following steps:

considering the overhead of latency and energy together, the total cost function in the V2V offload mode is:

in the formula (5), α is a weight factor of time delay and energy consumption, and the closer α is to 1, the more emphasis is placed on time delay, otherwise, the more emphasis is placed on energy consumption;

MEC uninstallation mode:

when the MEC unloading mode is selected by the task vehicle, the task vehicle is communicated with the roadside unit through the cellular network and is communicated with an MEC server deployed on the roadside unit;

according to the shannon formula, the transmission rate between the mission vehicle and the MEC server is:

in the formula (6), ω_MECIs the bandwidth;

in the MEC unloading mode, the time delay and the energy consumption are formed by two parts of task uploading to an MEC server and MEC server calculation;

time delay of task uploading part

And energy consumption

Respectively, as follows:

the time delay and the energy consumption calculated by the MEC server are respectively expressed as follows:

in the formula (8), f_MECFor the computing power of the MEC server:

so the total delay in MEC offload mode

And energy consumption

Expressed as:

the time delay and energy overhead are comprehensively considered, so the total cost function in the MEC offload mode is:

in the formula (11), α is a weight factor of time delay and energy consumption; the closer alpha is to 1, the more emphasis is placed on time delay, otherwise, the more emphasis is placed on energy consumption;

then the cost function for the entire task is:

in equation (12), β ═ {0, 1} is an unloading decision parameter, and if the task vehicle selects the V2V unloading mode to unload, β ═ 0, and otherwise, β ═ 1;

after the cost function, the objective function becomes as follows:

in equation (13), C1 represents a constraint in the sense that neither V2V nor MEC offload mode computations can exceed the maximum tolerable latency of a task.

Further, in step S3, the converting the task allocation process decision problem into the markov decision process specifically includes:

when a task vehicle carries out task unloading, tasks of the task vehicle are continuously unloaded to an MEC server or a cooperative vehicle, the state of a system is composed of the data size of the current task, the task complexity, the maximum tolerance time delay and the communication state between the vehicle and an unloading point, and the state space of the system is expressed as follows:

S_t＝{T_i，V2V₁，V2V₂，...，V2V_j，MEC₁，MEC₂，...，MEC_m} (14)

in formula (14), T_i: for task type, V2V₁，V2V₂，...，V2V_jFor the vehicle to communicate in the V2V unloaded mode, MEC₁，MEC₂，...，MEC_mPerforming a credit communication state in an MEC unloading mode for the vehicle;

and the motion space is composed of selectable unloading points, and the motion space is expressed as:

A_t＝{V₁，V₂，，...，V_j，M₁，M₂，...，M_m} (15)

in the formula (15), V₁，V₂，，...，V_jFor cooperative vehicles, M₁，M₂，...，M_mIs an MEC server;

the task vehicle selects one of the optimal unloading nodes according to a strategy, namely from A_tSelecting an action to perform the current task;

the reward function is set as

s is the current state, a is the action, the reward function is set as the negative total cost function, in order to make the reward of the whole process maximum, namely the smaller the total cost function;

the unloading strategy pi represents that the vehicle is in the state space S_tThe basis for taking action is as follows; the merits of a policy are judged by a policy-based merit function, which is divided into: a state cost function and a behavior cost function;

wherein the content of the first and second substances,

the state cost function:

a behavioral cost function: q^π(s，a)＝E(G_t|S_t＝s，A_t＝a)；

Through the Be | | man equation, the state cost function is converted to:

in formula (16), s is the current state, r(s) is the reward function in the current state, γ is the discount factor γ ∈ [0, 1], s ' is the next state, P (s ' | s) is the transition probability of the current state transitioning to the next state, and V (s ') is the value of the next state;

through the Be | man equation, it can Be seen that the value of the current state is equal to the reward to reach the current state plus the value of the next state.

Further, in step S4, the method for solving the problem of policy selection in the markov decision process by using the DQN algorithm specifically includes:

let the behavioral cost function be updated according to the following formula:

in the formula (17), R is a reward function,

in order to learn the rate of speed,

q (s, a) is the current behavioral merit function, Q^*(s, a) is the updated behavioral cost function;

the optimal strategy can obtain the optimal behavior cost function through the formula (17), and the strategy is to select the action which can obtain the maximum behavior cost function value in the next state as the optimal action;

in the DQN algorithm, a neural network is used for fitting a Q value, and in each time slot, current condition information is input to a trained model, so that the Q value and corresponding actions, namely corresponding unloading nodes, can be obtained; and searching a new unloading node by using an epsilon-greedy search algorithm according to an epsilon probability, and selecting the unloading node according to a current Q value updating formula according to a 1-epsilon probability.

Compared with the prior art, the principle and the advantages of the scheme are as follows:

1) according to the scheme, vehicles (cooperative vehicles) with idle computing resources on the road are considered, the functions of the vehicles are fully played, the idle computing resources are released, the computing time delay of the task vehicle can be effectively reduced, and the computing pressure of the task vehicle and the MEC server is relieved.

2) According to the scheme, the DQN algorithm is applied in the process of obtaining the unloading node, and as the state of the vehicle is high-dimensional, the common q-table is limited by the dimension, and the DQN algorithm can not be limited by the dimension. Due to the high latitude, the application of the deep neural network can avoid falling into the local optimal solution, so that the problem that the heuristic algorithm falls into the local optimal solution is solved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the services required for the embodiments or the technical solutions in the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is a schematic flow chart of a task offloading method based on energy consumption and time delay weighting according to the present invention;

fig. 2 is a schematic diagram of a system model involved in the present invention.

Detailed Description

The invention will be further illustrated with reference to specific examples:

according to the task unloading method based on energy consumption and time delay weighting, the computing resources of the cooperative vehicles with free computing resources in the road are released, and computing services are provided for the task vehicles.

As shown in fig. 1, the method specifically comprises the following steps:

in this step, include:

s1-1, carrying out scene modeling analysis:

as shown in fig. 2, the considered scene is an intersection, a Road Side Unit (RSU) is deployed on the Road, and an MEC server is configured on the RSU and is marked as M ═ M₁，M₂，...，M_m}. Vehicles with unused computing resources on the road, i.e., cooperating vehicles, denoted V ═ V, can provide computing services₁，V₂，，...，V_jAnd N task vehicles are available. This embodiment discretizes time, divides time into time segments, and defines a time set as t ═ 1, 2, 3.

It is assumed that the state information of the vehicles does not change within each time block and the task type of each task vehicle is T_i＝(D_i，B_i，τ_i) Where Di represents the data size of the task in units of bit, B_iRepresents the number of CPU cycles required to calculate one bit size data, and has the unit of cycle/bit, tau_iRepresents the maximum tolerated delay of the task, and has the unit of S.

The task vehicles are divided into two groups,

wherein one group is D_iB_i/f_i≤τ_iWherein f is_iFor the computing capacity of the current task vehicle, the computing tasks of the current task vehicle in the group can be solved in the vehicle, and the maximum tolerable delay is not exceeded, that is, the task unloading is not required, and the embodiment records the task vehicles as a set L ═ 1, 2.

The other group is_iB_i/f_i＞τ_iThat is to say, the calculation task of the current task vehicle cannot be solved within the maximum tolerance time delay, and the task needs to be unloaded, and this embodiment records this type of vehicle as a set

In the collection

The task in the interior has two unloading modes, namely a Vehicle-to-Vehicle (V2V) mode and an MEC unloading mode, and defines an unloading decision parameter β ═ 0, 1}, where β ═ 0 if the task Vehicle selects the V2V mode for unloading, and β ═ 1 otherwise.

S1-2, establishing a mathematical model for the system model, specifically comprising:

V2V unload mode:

for channel gain, N₀Is noise;

time delay of transmission process

And energy consumption

Respectively as follows:

in the formula (2), D_iIs the size of the data of the task,

collaborative vehicle computation latency

And energy consumption

Comprises the following steps:

in the formula (3), B_iRepresenting a calculation of bit number of CPU cycles required for size data,

f_jfor the computational capability of cooperating vehicles, P_jFor coordinating the power consumption of the vehicles, P_j＝μ(f_j)²μ is an energy parameter;

so the total delay in the unloading mode of V2V

And total energy consumption

Comprises the following steps:

MEC uninstallation mode:

in the formula (6), ω_MECIs the bandwidth;

time delay of task uploading part

And energy consumption

Respectively, as follows:

in the formula (8), f_MECFor the computing power of the MEC server:

so the total delay in MEC offload mode

And energy consumption

Expressed as:

then the cost function for the entire task is:

after the cost function, the objective function becomes as follows:

the steps are equivalentPutting task vehicles capable of completing tasks in the set L and putting task vehicles incapable of completing tasks within the maximum tolerance time delay in the set L

In, then pair sets

The task vehicle in (1) pulls in steps S3 and S4 for further processing.

S3, converting the task allocation process decision problem into a Markov decision process according to the system model constructed in the step S1 and the obtained objective function, which specifically comprises the following steps:

S_t＝{T_i’V2V₁，V2V₂，...，V2V_j，MEC₁，MEC₂，...，MEC_m} (14)

in formula (14), T_iFor task type, V2V₁，V2V₂，...，V2V_jFor the vehicle to communicate in the V2V unloaded mode, MEC₁，MEC₂，...，MEC_mPerforming a credit communication state in an MEC unloading mode for the vehicle;

A_t＝{V₁，V₂，，...，V_j，M₁，M₂，...，M_m} (15)

the task vehicle selects one of the optimal unloading vehicles according to a strategyCarrier node, i.e. slave A_tSelecting an action to perform the current task;

the reward function is set as

wherein the content of the first and second substances,

the state cost function:

(meaning the desired reward that all actions can achieve in state s)

A behavioral cost function: q^π(s，a)＝E(G_t|S_t＝s，A_tA) (meaning the reward that can be obtained by taking action a in state s)

Through the Be | | man equation, the state cost function is converted to:

This step is the last step in this embodiment, and specifically includes the following steps:

let the behavioral cost function be updated according to the following formula:

in the formula (17), R is a reward function,

in order to learn the rate of speed,

The above-mentioned embodiments are merely preferred embodiments of the present invention, and the scope of the present invention is not limited thereto, so that variations based on the shape and principle of the present invention should be covered within the scope of the present invention.

Claims

1. A task unloading method based on energy consumption and time delay weighting releases computing resources of cooperative vehicles with idle computing resources in roads to provide computing services for task vehicles, and is characterized by comprising the following steps:

2. The method for task offloading based on energy consumption and latency weighting as claimed in claim 1, wherein the step S1 comprises:

s1-2, building a mathematical model for the system model.

3. The method for task offloading based on energy consumption and latency weighting according to claim 2, wherein the step S1-2 specifically includes:

V2V unload mode:

for channel gain, N₀Is noise;

time delay of transmission process

And energy consumption

Respectively as follows:

in the formula (2), D_iIs the size of the data of the task,

collaborative vehicle computation latency

And energy consumption

Comprises the following steps:

so the total delay in the unloading mode of V2V

And total energy consumption

Comprises the following steps:

MEC uninstallation mode:

in the formula (6), ω_MECIs the bandwidth;

time delay of task uploading part

And energy consumption

Respectively, as follows:

in the formula (8), f_MECFor the computing power of the MEC server:

so the total delay in MEC offload mode

And energy consumption

Expressed as:

then the cost function for the entire task is:

after the cost function, the objective function becomes as follows:

C1：

4. The method for unloading tasks based on energy consumption and time delay weighting as claimed in claim 1, wherein in step S3, the step of converting the task allocation process decision problem into a markov decision process specifically includes:

S_t＝{T_i′V2V₁，V2V₂，...，V2V_j，MEC₁，MEC₂，...，MEC_m} (14)

A_t＝{V₁，V₂，，...，V_j，M₁，M₂，...，M_m} (15)

the task vehicle selects one optimal unloading node according to a strategy and also selects the optimal unloading nodeIs from A_tSelecting an action to perform the current task;

the reward function is set as

the unloading strategy pi represents that the vehicle is in the state space S_tThe basis for taking action is as follows; the merits of a policy are judged by a policy-based merit function, which is divided into: a state cost function and a behavior cost function; wherein the content of the first and second substances,

the state cost function:

a behavioral cost function: q^π(s，a)＝E(G_t|S_t＝s，A_t＝a)；

Through the Bellman equation, the state cost function transforms to:

from the Bellman equation, it can be seen that the value of the current state is equal to the reward for reaching the current state plus the value of the next state.

5. The method for unloading tasks based on energy consumption and time delay weighting as claimed in claim 4, wherein in the step S4, the method for solving the problem of strategy selection in the Markov decision process by applying the DQN algorithm specifically comprises:

let the behavioral cost function be updated according to the following formula:

in the formula (17), R is a reward function,

in order to learn the rate of speed,