CN113590232B

CN113590232B - Relay edge network task unloading method based on digital twinning

Info

Publication number: CN113590232B
Application number: CN202110965259.XA
Authority: CN
Inventors: 徐江; 柏基成; 李斌; 谈昊哲
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2021-08-23
Filing date: 2021-08-23
Publication date: 2023-04-25
Anticipated expiration: 2041-08-23
Also published as: CN113590232A

Abstract

The invention discloses a relay edge network task unloading method based on digital twinning, which comprises the steps of constructing a relay edge network task unloading strategy model; updating the state of each corresponding part in the digital twin body environment; transmitting the parameters of the digital twin bodies into an analog task unloading system for iterative training to obtain an optimal task unloading strategy model; transmitting the optimal task unloading strategy model to a simulation manual control interface for backup; transmitting the current digital twin parameter training model and the optimal task unloading strategy model to a digital twin environment cache, and forwarding the digital twin environment cache to each relay node by an edge server in reality, and forwarding the relay node to a user terminal in communication with the relay node; and the user terminal and the relay node perform corresponding task unloading according to the optimal task unloading strategy model. The invention can reduce the trial-and-error cost of the real 5G edge computing technology in the landing process and improve the landing efficiency.

Description

Relay edge network task unloading method based on digital twinning

Technical Field

The invention belongs to the technical field of mobile edge calculation, and particularly relates to a relay edge network task unloading method based on digital twinning.

Background

With the rapid development of 5G and industrial internet, the demand for edge computing is increasing, and the fields of intelligent manufacturing, smart city, internet of vehicles, cloud games and the like all provide requirements for edge computing services.

At present, edge computing technology test points are mostly carried out on 4G or early 5G networks, but the development of the edge computing technology and the deployment of edge servers are limited by the ecology of limited resources and fragmentation at present, so that most users still cannot directly enjoy the services of the edge computing technology. These users may apply to offload tasks that cannot be calculated locally in time to an edge server for calculation. However, the communication link cannot be established directly with the edge server due to factors such as being too far away or being blocked by an obstacle such as a building.

It is an unavoidable challenge to reasonably allocate edge computing resources in the face of differentiated user demands and terminal devices of varying performance. In the pilot process, the change of the resource allocation policy can have a considerable impact on the edge server and the end user in reality.

Most of the existing edge computing related technologies directly assume that states of an edge server and end user equipment are known to perform decision optimization, so that energy consumption and time delay are reduced. However, the optimal solution is not necessarily achieved for more complex reality situations.

The digital twin technology can fully utilize data such as a physical model, a sensor, an operation history and the like, integrate multidisciplinary and multiscale simulation processes, construct images of entities in a virtual space, reflect the full life cycle process of the corresponding physical entities, and is very suitable for adapting and landing the reality situation by the aid of an edge computing technology in the current stage.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a relay edge network task unloading method based on digital twinning, which helps a user terminal to carry out task unloading, obtains good effect within an acceptable cost range, and helps an edge computing technology at the current stage to adapt to the actual situation and land.

In order to achieve the technical purpose, the invention adopts the following technical scheme:

a relay edge network task offloading method based on digital twinning includes:

step (1): building a relay edge network task offloading policy model, comprising: the system comprises a physical communication unloading environment, a digital twin environment, an analog task unloading system and an analog manual control interface;

the physical communication offload environment includes: a physical edge server, a relay node and a user terminal set;

the digital twin environment is an environment constructed by aggregating digital twin parameter models obtained by training a relay node and a user terminal by an edge server, and comprises the following steps: the state of the edge server, the state of each relay node and the state of the user terminal;

the simulated task offloading system includes: an artificial intelligent algorithm model library corresponding to each unloading condition, an unloading strategy selection module based on an DQN algorithm and a task unloading strategy model cache module;

the simulated manual control interface is a virtual control environment constructed by an edge server through virtual and real information transmission with a real manual control interface, and a digital twin parameter training model and a task unloading strategy model which are really used are determined;

step (2): the physical entity updates the states of the corresponding parts in the digital twin body environment through the digital twin parameter model;

step (3): the digital twin body environment transmits the parameters of the digital twin body into the simulation task unloading system for iterative training to obtain an optimal task unloading strategy model;

step (4): transmitting the optimal task unloading strategy model to a simulation manual control interface for backup;

step (5): the simulation manual control interface transmits the current digital twin parameter training model and the optimal task unloading strategy model to the digital twin environment cache, and the current digital twin parameter training model and the optimal task unloading strategy model are forwarded to each relay node by an edge server in reality, and the relay nodes are forwarded to a user terminal in communication with the relay nodes;

step (6): and the user terminal and the relay node perform corresponding task unloading according to the optimal task unloading strategy model.

In order to optimize the technical scheme, the specific measures adopted further comprise:

the user terminal comprises a smart phone, a notebook computer, a mobile tablet and other devices;

the user terminal is out of coverage of the edge server.

The states of the edge server include the processor frequency, the available memory capacity, the available channels and the working state of the edge server;

the state of the relay node comprises the processor frequency, transmission power, available channels and working state of the relay node;

the state of the user terminal includes the processor frequency of the user terminal, the transmission power, the data size and computational complexity of the task, the task type, the device type, the remaining energy.

The step (2) includes:

firstly, a user terminal trains a digital twin parameter model in a local iteration mode, and the trained digital twin parameter model is transmitted to a relay node together when a task is unloaded;

the relay node packages and transmits the digital twin parameter model trained by the user terminal and the relay node to the edge server, the edge server aggregates the models of the user terminal and the relay node while training the edge server model, and the state of the corresponding part in the digital twin environment is updated after training is completed.

Step (3) above, the training process in the simulation task offloading system includes:

step (3.1): and (3) calling an artificial intelligence algorithm model library to obtain: the task is locally calculated at the user terminal without task unloading, task unloading is carried out to the relay node for calculation, task unloading is carried out to the edge server for calculation, and the optimal task cost of each of the three conditions is transmitted into the unloading strategy selection module;

step (3.2): the unloading strategy selection module firstly synthesizes the optimal task cost of the three conditions in the step (3.1) into final unloading cost, combines the state parameters of the digital twin body environment, establishes an optimal model for minimizing the final cost, and trains by using a DQN algorithm to obtain an optimal task unloading strategy model;

step (3.3): detecting and evaluating an optimal task unloading strategy model by using historical data of a digital twin environment, and temporarily storing the model and the score to a task unloading model cache module;

step (3.4): and (3.1) repeating the step (3.3) until the score meets the standard or training is finished, and obtaining a final optimal task unloading strategy model.

In the step (3.1), the task is locally calculated at the user end, and when the task is not unloaded, the optimal task cost obtained by calling the related artificial intelligence algorithm is recorded as

In the step (3.1), when the task is unloaded to the relay node for calculation, the corresponding optimal task cost calculation method is as follows:

the transmission rate between the relay node j and the user terminal i is noted as

The transmission delay is recorded as->

The energy consumption during transmission is +.>

The time required for the task to calculate at relay node j is noted +.>

The time required by the relay node j to calculate the digital twin parameter training model and package is as follows:

wherein ,

for the CPU frequency of the relay node j, D _j For the local data set of the relay node j, H is the number of user terminals communicating with the relay node j, and the digital twin parameter model of the relay node j is obtained after training and packaging>

The energy consumption of the relay node is ignored;

when the task is unloaded to the relay node for calculation, the corresponding optimal task cost is as follows:

calling related artificial intelligence algorithm to obtain optimal cost

In the step (3.1), when the task is unloaded to the edge server for calculation, the corresponding optimal task cost calculation method is as follows:

the signal of the user terminal i which can be directly received by the edge server is:

wherein ,

is the channel between the edge server and the user terminal i,/i>

Is a noise signal between the edge server and the user terminal i. The edge server receives the signal of the user terminal i in an auxiliary way through the relay node j as follows:

wherein ,

is the transmission power of relay node j, +.>

Is the auxiliary channel that the relay node j distributes to the user terminal i to the edge server, +.>

Is the noise signal at the edge server on the corresponding channel,/or->

Is a normalization parameter;

the signal to noise ratio of the user terminal i is obtained by maximum ratio amplitude synthesis at the edge server as follows:

wherein ,P_i ^UT Is the transmission power of the user terminal i,

is the channel between the relay node j and the user terminal i;

the transmission rate between the user terminal i and the edge server is:

wherein ,W_i Is the bandwidth between the edge server and the user terminal i. The transmission delay is recorded as

The transmission energy consumption is recorded as

The time required for the task to calculate at the edge server is recorded as +.>

The time required by the edge server to calculate the digital twin parameter training model and aggregate the digital twin parameter models of the relay node and the user terminal is as follows:

wherein ,f^ECS CPU frequency of edge server, D ^ECS The method comprises the steps that N is the number of relay nodes communicated with an edge server and is a local data set of the edge server;

due to

Is small, and f ^ECS Very high, compared to the time taken to train the edge server model, the time taken to aggregate the relay node model +.>

Negligible;

obtaining the digital twin parameter model of the edge server after training

The energy consumption of the edge server is ignored, and when the task is unloaded to the edge server for calculation, the corresponding optimal task cost is as follows:

calling related artificial intelligence algorithm to obtain optimal cost

In the step (3.2), the unloading policy selection module builds an optimization model that minimizes the final cost:

the final offload cost for user terminal i in three cases of step (3.1) is expressed as:

wherein ,a_i E {0,1}, i=1, 2,3 and

for each calculation task of the user terminal i, selecting which case to calculate can minimize the final cost, and an optimization model for minimizing the final cost is as follows:

wherein ,

is the expected energy consumption threshold of the user terminal.

In step (3.2) above, the present invention uses DQN as a framework for the DRL algorithm.

In the training process, the unloading strategy selection module interacts with the digital twin body environment to obtain the state of each iteration t task unloading system:

the action of the learning Agent is expressed as:

A _t ＝{a _t |a _t ∈I _t }

wherein a_t Is from a set of possible decision actions I _t The selected action;

the bonus function reflects that the selected action is in the system state s _t Is expressed as:

/>

wherein ψ is a guaranteed R _t A fixed parameter that is positive, λ is learning rate, μ _i (t) is the final cost at iteration t;

and approximating the optimal action cost function by using a neural network Q (s, a; w) in combination with a time difference algorithm to obtain an optimal task unloading strategy model, and transmitting the optimal task unloading strategy model into a task unloading model buffer module.

The invention has the following beneficial effects:

(1) The invention adopts a digital twin method to carry out the unloading decision of the simulation task, so that the trial-and-error cost of the real 5G edge computing technology in the process of landing can be reduced to a great extent; helping a user obtain a result within an acceptable cost range under the condition of limited edge computing resources at the present stage;

(2) Compared with other task unloading methods, the digital twin environment provided by the invention is updated along with the change of physical entities, so that an unloading strategy model obtained by a simulated task unloading decision system is more close to the real situation, and the landing efficiency is improved.

Drawings

FIG. 1 is a schematic diagram of a physical communication offload environment of the present invention;

FIG. 2 is a frame structure diagram of a relay edge network task offloading method based on digital twinning;

FIG. 3 is a workflow diagram of a task offloading policy model.

Detailed Description

Embodiments of the present invention are described in further detail below with reference to the accompanying drawings.

Referring to fig. 1 and 2, a relay edge network task offloading method based on digital twinning of the present invention includes:

in an embodiment, the user terminal includes a smart phone, a notebook computer, a mobile tablet, and other devices;

the user terminal is out of coverage of the edge server.

The state of the edge server comprises the processor frequency, the available memory capacity, the available channels and the working state of the edge server;

step (5): the simulation manual control interface trains the model m with the current digital twin parameters ^ECS 、m ^RN 、m ^UT Optimal task offloading policy model um ^RN 、um ^UT Transmitting to the digital twin body environment cache, and forwarding to each relay node by the edge server in reality, and forwarding the relay node to the user terminal in communication with the relay node;

In an embodiment, the step (2) includes:

In an embodiment, the step (3) includes simulating a training process in the task offloading system including:

In an example, step (3.1): and (5) calling an artificial intelligent algorithm model library to obtain the optimal task cost corresponding to the three conditions.

Case one: when the task is locally calculated at the user side and task unloading is not performed, the corresponding optimal task cost calculating method comprises the following steps:

for each user terminal i, the task to be calculated is denoted as T _i ＝(C _i ,L _i), wherein C_i Is the computational complexity of the task, L _i Is the data size of the task;

the time required for the task to calculate locally is noted as

The energy consumption required is +.>

The time required by the user terminal i to calculate the digital twin parameter training model is as follows:

wherein ,f_i ^UT For CPU frequency of user terminal i, D _i The local data set of the user terminal i is trained to obtain a digital twin parameter model of the user terminal i

The energy consumption required for this process is +.>

The total cost required by the user terminal i to complete the task is:

/>

wherein ,α_i E (0, 1) and beta _i E (0, 1) is a weight coefficient of time delay and energy consumption determined based on task type and equipment type; calling related artificial intelligence algorithm to obtain optimal task cost

And a second case: when the task is unloaded to the relay node for calculation, the corresponding optimal task cost calculation method comprises the following steps:

The transmission delay is recorded as->

The energy consumption during transmission is +.>

The time required for the task to calculate at relay node j is noted +.>

wherein ,

The energy consumption of the relay node is ignored;

calling related artificial intelligence algorithm to obtain optimal cost

And a third case: when the task is unloaded to the edge server for calculation, the corresponding optimal task cost calculation method comprises the following steps:

wherein ,

is the channel between the edge server and the user terminal i,/i>

wherein ,

is the transmission power of relay node j, +.>

Is the noise signal at the edge server on the corresponding channel,/or->

Is a normalization parameter;

wherein ,P_i ^UT Is the transmission power of the user terminal i,

is the channel between the relay node j and the user terminal i;

the transmission rate between the user terminal i and the edge server is:

The transmission energy consumption is->

due to

Negligible;

obtaining the digital twin parameter model of the edge server after training

calling related artificial intelligence algorithm to obtain optimal cost

In an embodiment, in the step (3.2), the unloading policy selection module builds an optimization model that minimizes the final cost:

wherein ,a_i E {0,1}, i=1, 2,3 and

wherein ,

is the expected energy consumption threshold of the user terminal.

In said step (3.2), the present invention uses DQN as a framework for the DRL algorithm.

the action of the learning Agent is expressed as:

A _t ＝{a _t |a _t ∈I _t }

Fig. 3 specifically illustrates a process of executing a task offloading policy model by a user terminal and a relay node, where the user terminal offloading policy model determines a final object that is responsible for a computing task, and executes a corresponding optimization target policy according to a task type during local computing, and the offloading policy model of the relay node and an edge server executes the corresponding optimization target policy according to the task type.

The above is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above examples, and all technical solutions belonging to the concept of the present invention belong to the protection scope of the present invention. It should be noted that modifications and adaptations to the invention without departing from the principles thereof are intended to be within the scope of the invention as set forth in the following claims.

Claims

1. The relay edge network task offloading method based on digital twinning is characterized by comprising the following steps of:

step (6): the user terminal and the relay node carry out corresponding task unloading according to the optimal task unloading strategy model;

the step (2) comprises:

the relay node packages and transmits the digital twin parameter model trained by the user terminal and the relay node to the edge server, the edge server aggregates the models of the user terminal and the relay node while training the edge server model, and the state of the corresponding part in the digital twin environment is updated after training is completed;

the step (3) is to simulate the training process in the task unloading system, which comprises the following steps:

step (3.4): repeating the steps (3.1) - (3.3) until the score meets the standard or training is finished, and obtaining a final optimal task unloading strategy model;

in the step (3.1), the task is locally calculated at the user terminal, and when the task is not unloaded, the corresponding optimal task cost calculating method comprises the following steps:

the time required for the task to calculate locally is noted as

The energy consumption required is +.>

The energy consumption required for this process is +.>

The total cost required by the user terminal i to complete the task is:

wherein ,α_i E (0, 1) and beta _i E (0, 1) is a weight coefficient of time delay and energy consumption determined based on task type and equipment type;

invoking artificial intelligence algorithm to obtain optimal task cost

In the step (3.1), when the task is unloaded to the relay node for calculation, the corresponding optimal task cost calculation method comprises the following steps:

The transmission delay is recorded as->

The energy consumption during transmission is +.>

The time required for the task to calculate at relay node j is noted +.>

wherein ,

The energy consumption of the relay node is ignored;

invoking artificial intelligence algorithm to obtain optimal cost

In the step (3.1), when the task is unloaded to the edge server for calculation, the corresponding optimal task cost calculation method comprises the following steps:

wherein ,

is the channel between the edge server and the user terminal i,/i>

Is a noise signal between the edge server and the user terminal i; the edge server receives the signal of the user terminal i in an auxiliary way through the relay node j as follows:

wherein ,

is the transmission power of relay node j, +.>

Is the noise signal at the edge server on the corresponding channel,/or->

Is a normalization parameter;

wherein ,P_i ^UT Is the transmission power of the user terminal i,

is the channel between the relay node j and the user terminal i;

the transmission rate between the user terminal i and the edge server is:

wherein ,W_i Is the bandwidth between the edge server and the user terminal i; the transmission delay is recorded as

The transmission energy consumption is->

wherein due to

Negligible;

f ^ECS CPU frequency of edge server, D ^ECS The method comprises the steps that N is the number of relay nodes communicated with an edge server and is a local data set of the edge server;

obtaining the digital twin parameter model of the edge server after training

The energy consumption of the edge server is negligible, and the task is offloaded to the edge serverWhen the server calculates, the corresponding optimal task cost is as follows:

invoking artificial intelligence algorithm to obtain optimal cost

wherein ,a_i E {0,1}, i=1, 2,3 and

for each computing task of user terminal i, the optimization model that minimizes the final cost is:

/>

wherein ,

an expected energy consumption threshold for the user terminal;

in said step (3.2), DQN is used as a framework for the DRL algorithm;

the action of the learning Agent is expressed as:

A _t ＝{a _t |a _t ∈I _t }

the neural network Q (s, a; w) is used in combination with a time difference algorithm to approximate the optimal action cost function to obtain an optimal task offloading strategy model.

2. The method for offloading tasks in a relay edge network based on digital twinning according to claim 1, wherein the user terminal comprises a smart phone, a notebook computer, a mobile tablet;

the user terminal is out of coverage of the edge server.

3. The method for offloading tasks in a relay edge network based on digital twinning according to claim 1, wherein the states of the edge server include processor frequency, available memory capacity, available channels and operating states of the edge server;