CN114268923A

CN114268923A - Internet of vehicles task unloading scheduling method and system

Info

Publication number: CN114268923A
Application number: CN202111535739.9A
Authority: CN
Inventors: 鲁蔚锋; 刘锐; 徐佳; 徐力杰; 蒋凌云
Original assignee: Nanjing University of Posts and Telecommunications
Current assignee: Nanjing University of Posts and Telecommunications
Priority date: 2021-12-15
Filing date: 2021-12-15
Publication date: 2022-04-01

Abstract

The invention provides a method and a system for unloading and scheduling tasks of an internet of vehicles. The method comprises the steps of designing a queue model of each vehicle by considering a communication model and a calculation model in the Internet of vehicles; considering the limits of energy consumption constraint and time delay constraint, designing a system objective function; modeling task offloading scheduling into a Markov chain decision process; solving the optimal task unloading scheduling based on the double-depth Q network; and carrying out deep reinforcement learning training based on federal learning. The invention fully considers the calculation and cache processes of the calculation task in the vehicle, finds an effective task scheduling strategy by using federal learning in artificial intelligence, ensures the requirements of the delay sensitive task, minimizes the delay loss, the energy loss and the service charge of the system, and simultaneously protects the privacy of the user vehicle by adopting a distributed training method.

Description

Internet of vehicles task unloading scheduling method and system

Technical Field

The invention relates to a method and a system for unloading and scheduling tasks of an internet of vehicles, in particular to a method and a system for unloading and scheduling tasks of an intelligent internet of vehicles based on federal learning.

Background

In recent years, the internet of things and autonomous automobiles have received much attention. In the intelligent internet of vehicles, not only the computing, communication and buffering functions of the terminal vehicle but also the time delay requirements of tasks need to be considered, and the requirements of the functions depend on the communication and computing capability of the system to a great extent. The condition for guaranteeing reasonable distribution of computing and communication resources through computing unloading is a necessary condition for realizing the intellectualization of the Internet of vehicles. The vehicle edge computing migrates the computing task to the network edge, so that the end-to-end delay can be effectively reduced, and the requirements of low time delay and high reliability of the vehicle networking application are met.

The dynamic changes in the car networking system also introduce storage and communication complexities, as the topology of the network is constantly changing due to the high mobility of the vehicles. Due to the dynamic and variable Internet of vehicles environment, resource allocation is usually a non-convex optimization problem with complex objective functions and constraints, the traditional optimization algorithm is difficult to solve, and deep reinforcement learning can well solve the complex optimization problem. With the development of the 5G network, the terminal vehicle has all conditions for carrying out artificial intelligence model training, and the training of the model on the vehicle becomes possible. And meanwhile, the federal learning is a distributed machine learning method, so that the communication delay can be further reduced, and the privacy of a terminal user can be ensured. Compared with the traditional calculation unloading of the Internet of vehicles, the intelligent Internet of vehicles task unloading scheduling method based on the federal learning comprehensively considers the joint optimization problem of communication, cache and calculation resources.

In view of the above, there is a need to design a method and a system for offloading and scheduling tasks of the internet of vehicles based on federal learning to solve the above problems.

Disclosure of Invention

The invention aims to solve the problem of communication and calculation joint optimization in the car networking environment, design a corresponding task queue and energy queue for each car, find an effective task scheduling strategy by using federal learning in artificial intelligence, ensure the delay sensitivity requirement, minimize the delay loss, the energy loss and the service charge of a system, and protect the privacy of a user car.

In order to achieve the above object, the present invention provides a task unloading scheduling method for an internet of vehicles, in the internet of vehicles, according to the coverage of a roadside unit, a whole road is divided into M disjoint road segments, a plurality of vehicles are present in the coverage of one roadside unit, and the vehicles and the roadside unit complete the task calculation and unloading through a wireless link, the method comprising the following steps:

step 1: considering a communication model and a calculation model in the Internet of vehicles, designing a queue model of each vehicle;

step 2: considering the limits of energy consumption constraint and time delay constraint, designing a system objective function;

and step 3: modeling task offloading scheduling into a Markov chain decision process;

and 4, step 4: solving the optimal task unloading scheduling based on the double-depth Q network;

and 5: and carrying out deep reinforcement learning training based on federal learning.

A further development of the invention is that step 1 comprises the following steps:

step 1.1: calculating a wireless communication rate between a vehicle and a roadside unit

When task k is offloaded to wayside unit computation, the uplink transmission delay between the vehicle and the RSU is

Wherein L is₀Is path loss, P_iFor vehicles v_iPower of P_wIs Gaussian white noise power, alpha is path loss index, d_i,mDistance between the vehicle and the RSU, and B is channel bandwidth;

step 1.2: each calculation task can be selected to be calculated locally in the vehicle or unloaded to the RSU calculation, when the task is unloaded to the RSU calculation, the calculation time of unloading the task k to the RSU m is as follows

F_mIs the CPU frequency, | V, of the edge server m_i ^mThe l is the number of vehicles in the vehicle set of the roadside unit m, and the transmission energy consumption unloaded to the edge server in one time slot is the product of the data quantity transmitted in the time slot and the energy consumption of the unit data quantity:

when the task is calculated locally in the vehicle, the local calculation time delay is t_c(i,k)＝I_k·c_k·f_i ^localWhile locally calculating the energy consumption as

Wherein f is_i ^localAnd

frequency and power of the vehicle, respectively;

step 1.3: calculating the total time for the vehicle i to process the task k:

step 1.4: each vehicle having T therein_sIndividual task priority queues, T_sFor maximum delay limits in all types of tasks, i.e.

The capacity limit of the vehicle task queue is based on time slot unit, namely the maximum capacity of the task queue, to ensure that the vehicle task queue can hold any task in any time slot, and the number of the task queue is {1,2_s}；

Step 1.5: the initial priority of the task k generated in each time slot in the vehicle i is calculated,

tasks with smaller initial priority values are processed with higher priority;

step 1.6: the change of the vehicle energy queue is

Wherein the set of n vehicles is V ═ V₁,v₂,...,v_nV vehicles_iE.v. velocity S_iDriving on a road; the set of roadside units is G ═ R₁,R₂,...,R_M}, each roadside unit R_mThe communication range of e G is a diameter d_mIn the same R_mVehicle aggregation within communication range

Indicating that vehicles within the communication range of the RSU can communicate with the corresponding RSU at V2I; dividing the total time into N equal time slots τ; there are K different types of tasks, each type of task has different generation probability; the generation probability of the k-th task in each time slot is lambda_k，

And satisfy

Wherein k represents different types of tasks; each task is represented by a triplet: a is_k＝＜I_k,c_k,T_kIs disclosed in_k，c_k，T_kRespectively represent tasks a_kThe number of turns of the CPU required by the calculation task and the task delay limit;

and

to express binary decision variables when

Representing that task k is in vehicle V within time slot t_iUnloading to a roadside UnitR_mIn the middle stage of calculation when

Representing that task k is carried by vehicle v in time slot t_iLocal calculation when

The representative task k remains in the vehicle's task queue during time slot t.

A further development of the invention is that the step 2 comprises the following steps:

step 2.1: calculating the k-type task quantity locally processed by the vehicle i in a time slot into

Step 2.2: calculating the task amount of the time slot t exceeding the time delay limit

Wherein the content of the first and second substances,

representing the task quantity in a queue with the index of 1 in a task queue of a vehicle i in a time slot t;

step 2.3: minimizing the total cost of system energy consumption and processing tasks, and the system objective function is:

satisfies the following conditions:

f_i≤f_max (4)

wherein, W₂Constraint (2) represents that the unloading decision variable can only take 0 or 1 as the weight constant; constraint (3) represents that each vehicle can only select one decision variable in one time slot; constraint (4) represents a vehicle CPU frequency constraint; constraint (5) represents a task delay constraint per time slot; constraint (6) represents a task energy consumption constraint per time slot.

A further development of the invention is that said step 3 comprises the following steps:

step 3.1: designing a system state space:

wherein

Indicating the state of the vehicle in the mth section of the time slot t, X^tRepresenting the position of the vehicle at time t, from the last slot position and the vehicle speed, and

wherein

Representing the number of vehicles in the set of vehicles in the section m of the time slot t, by

Representing the state of each task queue and the state of the energy queue in the vehicle i at the time slot t;

step 3.2: designing system motion space by

Describing a task-off decision space, i.e. an action space, of a vehicle, further

Wherein

(M is more than or equal to 1 and less than or equal to M) represents an unloading decision variable;

step 3.3: energy queue state transition:

step 3.4: and (3) task queue state conversion:

step 3.5: designing a loss function of the system, wherein in the time slot t, the vehicle is in the system state S^tTaking action A^tThe loss function of (d) is:

step 3.6: optimal scheduling strategy of the system:

where 0 < η < 1 is a discount factor that indicates the impact of future losses on current operation.

A further development of the invention is that said step 4 comprises the following steps:

step 4.1: initializing values of weight theta and Q functions and an experience buffer pool of the double-depth Q network;

step 4.2: initial state S of given Internet of vehicles system⁰；

Step 4.3: for each time slot t is 0_maxStep 4.4-step 4.15 are executed;

step 4.4: in a state S^tTaking action A^tCalculating an expected minimum loss representation：

And 4.5, calculating the optimal decision of the depth Q network:

step 4.6: selecting actions randomly with a probability p;

step 4.7: if p ≦ ε, choose random action A^t；

Step 4.8: if p > ε, select action:

step 4.9: performing action A^tObtaining the state S of the next time slot^t+1；

Step 4.10: calculating Loss function Loss^t；

Step 4.11: will experience

) Putting the mixture into an experience buffer pool;

step 4.12: randomly extracting a batch of experience from the buffer pool to be used as a training sample, and calculating a double-depth Q function:

step 4.13: computing

Step 4.14: calculating L (theta)^t) Gradient (2):

step 4.15: updating theta based on gradient descent method^t：

Step 4.16: and (5) the training model is converged to obtain a trained model.

A further development of the invention is that said step 5 comprises the following steps:

step 5.1: given a road section m, and a time slot t, the set of vehicles in this road section

Step 5.2: performing steps 5.3-5.10 for time slot T ═ 1, 2., T;

step 5.3: randomly selecting n vehicle sets

Step 5.4: obtaining the global parameter of a last time slot of the roadside unit m:

step 5.5: updating the model local parameters with the global parameters of a time slot on each vehicular roadside unit m:

step 5.6: obtaining vehicle local data:

step 5.7: utilizing vehicle local data

Training the model to obtain vehicle local training parameters and time:

step 5.8: uploading model parameters

And

to the roadside unit m;

step 5.9: the roadside unit m receives the vehicle parameters to carry out global aggregation:

step 5.10: aggregating to produce an improved global model and reassigning to the end vehicles;

step 5.11: and (5) the training model is converged, and the federal learning training is completed.

A further development of the invention consists in that, before step 5.2, a step of initializing the model parameters of the roadside units and of the vehicle is also included.

In order to achieve the purpose of the invention, the invention further provides a task unloading scheduling system of the internet of vehicles, which is used for implementing the method of any one of the preceding claims.

A further development of the invention is that the system comprises a Road Side Unit (RSU) and an end vehicle.

In a further development of the invention, each vehicle and roadside unit has its own neural network training model, and vehicles on the same road section can complete distributed federal learning together with the RSU.

The invention has the following beneficial effects: the invention considers the communication and calculation joint optimization problem in the vehicle networking environment, designs a corresponding task queue and energy queue for each vehicle, and fully considers the calculation and cache process of the calculation task in the vehicle; the invention provides an intelligent internet of vehicles task unloading scheduling research based on federal learning by using federal learning in artificial intelligence, finds an effective task scheduling strategy, ensures the requirements of delay sensitive tasks, minimizes the delay loss, energy loss and service charge of a system, and protects the privacy of a user vehicle by adopting a distributed training method.

Drawings

FIG. 1 is a schematic diagram of the Internet of vehicles task offloading system of the present invention.

FIG. 2 is a schematic diagram of a vehicle mission queue.

FIG. 3 is a flow diagram for solving an optimal task offload schedule using a dual-depth Q network.

FIG. 4 is a flow chart of a deep reinforcement learning training based on federated learning.

Fig. 5 is a schematic representation of the federal learning process.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in detail with reference to the accompanying drawings and specific embodiments.

It should be emphasized that in describing the present invention, various formulas and constraints are identified with consistent labels, but the use of different labels to identify the same formula and/or constraint is not precluded and is provided for the purpose of more clearly illustrating the features of the present invention.

The invention designs an intelligent Internet of vehicles task unloading scheduling system and method based on federal learning. The system mainly comprises two types of equipment, namely a roadside unit and a terminal vehicle. Considering the computation offload problem in edge computation, the computation task of the vehicle can be offloaded to the roadside unit computation as well as the vehicle local computation, and we need to find the optimal offload scheduling scheme to minimize the total loss of the system.

The present invention is further described in detail with reference to the following embodiments, wherein the entire road is divided into M disjoint road segments according to the coverage area of the roadside units, a plurality of vehicles are located in the coverage area of one roadside unit, and the vehicles and the roadside units complete the calculation and unloading of tasks through wireless links. As shown in FIG. 1, there are two vehicles within the coverage area of the roadside unit RSU 2, each having a T_sThe vehicle computing system comprises a task priority queue and an energy queue, wherein the task priority queue is used for storing tasks to be processed, and the energy queue provides corresponding energy for vehicle computing tasks. Meanwhile, as shown in the right side of fig. 1, each vehicle and roadside unit has a neural network training model thereof, and vehicles in the same road section can complete distributed federal learning together with the RSU.

Considering the dynamic property of the car networking environment, the topological state of the network and the state of the vehicle queue are different in each time slot, the state transition of the vehicle task queue is shown in fig. 2, which represents the change situation of the task in the queue every time slot passes, and the change of the energy queue can be obtained through an energy queue change equation. By calculating the energy consumption of the system and the time delay of the calculation task, an objective optimization function can be established, namely, the total cost of the system energy consumption and the processing task is minimized under the time delay constraint and the energy consumption constraint which guarantee the vehicle task in each time slot.

The invention finds the optimal unloading scheduling scheme of the task by a method of a double-depth Q network according to the state space S of the system^tAn operation space A^tAnd designing Markov chain decision process by using state transfer equation, training converged model by using flow chart shown in figure 3, and finding out optimum task unloading decision pi^*。

Distributed federal learning is adopted in the training process of deep reinforcement learning, and the federal learning process of a road section in the internet of vehicles is shown in fig. 5. First, the end vehicle will download an initial training model from the RSU and perform local model training using its own data to minimize the predefined loss function and update the trained model weights to the RSU via encrypted transmission. The RSU then collects the updated parameters from the end vehicles to produce an improved global model, i.e., global aggregation. Finally, the output of the RSU training model is redistributed to the terminal vehicles and the terminals perform further local training by using the global model as a reference. The training process is repeated until a specified number of iterations is reached. The communication load of the side cloud interaction is reduced and the privacy of the terminal vehicle is protected through federal learning.

The invention provides an intelligent Internet of vehicles task unloading scheduling method based on federal learning, which specifically comprises the following steps:

step 1: considering a communication model and a calculation model in the Internet of vehicles system, designing a queue model of each vehicle;

The following will explain the task unloading scheduling method of the internet of vehicles in detail.

The set of vehicles in the invention is V ═ V₁,v₂,...,v_nV, each vehicle v_iE.v. velocity S_iAnd (4) running on the road. Each vehicle is provided with a plurality of task queues and an energy queue, the task queues are used for storing tasks to be processed, and the energy queues provide corresponding energy for vehicle calculation tasks. The set of roadside units (RSUs) is G ═ R₁,R₂,...,R_M}, each roadside unit R_mThe communication range of e G is a diameter d_mThe circle of (c). A vehicle can only communicate with a Road Side Unit (RSU) in a time slot, and in the same R_mVehicle aggregation within communication range

It is shown that vehicles within the communication range of the RSU can communicate with the corresponding RSU at V2I. And dividing the whole road into M road sections according to the communication ranges of the M RSUs.

Aiming at the time-varying property of the Internet of vehicles, a time slice dividing technology is adopted to divide the total time into N equal time slots tau, wherein tau is a small time interval. The system state is assumed to remain unchanged within each slot. Each R_mWill communicate with the vehicle V in its communication range_i ^mThe V2I communication is performed to collectively complete the computation offload of the task.

The set of tasks is

The generation probability of the k-th task in each time slot is lambda_kAnd satisfy

Each task is represented by a triplet: a is_k＝＜I_k,c_k,T_kIs disclosed in_k，c_k，T_kRespectively represent tasks a_kThe number of turns of the CPU required for the calculation task and the task delay constraint. Decision variables

Representing that task k is in vehicle v within time slot t_iOff-loading to the roadside Unit R_mMiddle calculation, decision variables

Considering the communication model of the system, the wireless communication rate r between the vehicle and the roadside unit can be calculated_i ^mAccording to the communication rate, the uplink transmission time delay t when the task k is unloaded to the roadside unit can be calculated_u(i, m, k). Considering the calculation model of the system, it is possible to calculate the time delay t calculated locally in the vehicle from task k_c(i, k), and the calculated time t for offloading of the task to the roadside unit_m(k) So that the total time for the vehicle i to process the task k is

Simultaneous calculation of energy consumption locally calculated for a vehicle

And uplink transmission power consumption

Each vehicle having a T_sA task priority queue for storing tasks to be processed and an energy queue for storing energy of the tasks to be processedThe vehicle computing task provides the corresponding energy. And calculating the initial priority pr (i, k) of the tasks for the newly generated tasks in each time slot according to the data size and the residual processing time of the tasks, and sorting the tasks from small to large according to the initial priority of the tasks and putting the tasks into the corresponding lower-marked vehicle task queue. The vehicle task queue changes as shown in fig. 2, for a task queue l, its task input sources are two types, which are the task received by the vehicle itself at time slot t and the task transferred from the previous time slot, i.e. the legacy task in task queue l + 1. The output sources of the tasks are three, namely the tasks which are transmitted to the roadside unit for calculation by the vehicle through V2I communication at the time slot t and are transferred to the task queue l-1 through the time slot and the tasks which are calculated locally. The change of the vehicle energy queue can be represented by an energy change formula

To obtain.

Therefore, considering the communication model and the calculation model in the car networking system, step 1 of designing the fleet model for each vehicle specifically includes the following:

Step 1.2: when the task is unloaded to RSU calculation, the calculation time of unloading the task k to RSU m is

The transmission energy consumption unloaded to the edge server in a time slot is the product of the data quantity transmitted in the time slot and the unit data quantity energy consumption:

when the task is in the vehicleLocal computation with time delay t_c(i,k)＝I_k·c_k·f_i ^localWhile locally calculating the energy consumption as

Step 1.3: the total time for the vehicle i to process the task k is calculated,

step 1.4: number of vehicle task queue {1, 2.,. l.,. T.,_s}，T_sfor maximum delay limits in all types of tasks, i.e. T_s＝max{T_k,k∈{1,2,...,K}}；

step 1.6: the change of the vehicle energy queue is

For tasks exceeding the delay limit we introduce a penalty factor w₁If the processing time of only one time slot is left in the task queue with the index of 1, the time delay limit of the task is exceeded if the processing time cannot be processed in time, and the task quantity h of the time slot t exceeding the time delay limit is calculated^t. The total cost for the vehicle i to process the task k in the time slot t is

Service charges cs mainly including roadside units_k,mAnd a calculated cost cs local to the vehicle_k,i. The aim of the invention is to minimize the total system loss while ensuring the time delay constraint and the energy consumption constraint of the vehicle task in each time slot

Therefore, considering the limitations of the energy consumption constraint and the time delay constraint, the step 2 of designing the system objective function specifically includes the following steps:

Step 2.2: the delay constraint considering the delay sensitive task is T_kIf the task exceeding the delay limit cannot be completed, a penalty mechanism is introduced to make certain penalty on the task amount exceeding the delay limit, and a penalty factor is set as W₁. For the task queue with the index of 1, only the processing time of one time slot is left, if the processing time cannot be processed in time, the time delay limit of the task is exceeded, and h is set^tFor the amount of tasks whose time-slot t exceeds the delay limit

Wherein

step 2.3: the goal of the system is to minimize the system energy consumption and the total cost of processing tasks while guaranteeing the time delay constraint and energy consumption constraint of the vehicle tasks in each time slot, so the system objective function is:

satisfies the following conditions:

f_i≤f_max (4)

Further, said step 3 of modeling the task offload schedule as a markov chain decision process comprises the following:

step 3.1: designing a system state space:

wherein

Indicating the state of the vehicle in the mth section of the time slot t, X^tThe position of the vehicle at time t is represented and can be determined from the last slot position and the vehicle speed. Further, the method can be used for preparing a novel material

Wherein

Representing the number of vehicles in the set of vehicles in the road segment m for the t-slot. By using

step 3.2: designing system motion space by

Wherein

(M is more than or equal to 1 and less than or equal to M) represents an unloading decision variable, and determines whether the kth task of the vehicle is executed locally or unloaded to an RSU (remote service Unit) or stored in a task queue of the vehicle;

step 3.3: energy queue state transition:

step 3.4: and (3) task queue state conversion:

the task amount of V2I communication offloading in the time slot t is:

the task amount calculated locally by the vehicle in the time slot t is:

calculating the task quantity generated by the vehicle in the time slot as follows:

wherein l is a subscript of the task queue, and the task is placed into the subscript task queue according to the initial priority of the task when the task arrives;

step 3.6: optimal scheduling strategy of the system:

Further, as shown in fig. 3, the step 4 of solving the optimal task offload scheduling based on the dual-depth Q network includes the following steps:

step 4.2: initial state S of given Internet of vehicles system⁰；

Step 4.3: for each time slot t is 0_maxStep 4.4-step 4.15 are executed;

step 4.4: in a state S^tTaking action A^tThe expected minimum loss representation is calculated:

and 4.5, calculating the optimal decision of the depth Q network:

step 4.6: selecting actions randomly with a probability p;

step 4.7: if p ≦ ε, choose random action A^t；

Step 4.8: if p > ε, select action:

Step 4.10: calculating Loss function Loss^t；

Step 4.11: will experience (S)^t,A^t,Loss^t,S^t+1) Putting the mixture into an experience buffer pool;

step 4.13: computing

Step 4.14: calculating L (theta)^t) Gradient (2):

step 4.15: updating theta based on gradient descent method^t：

Step 4.16: the training model is converged to obtain a trained model;

further, as shown in fig. 4, for the model training process in each road segment, the step 5 of performing deep reinforcement learning training based on federal learning specifically includes the following steps:

Step 5.2: performing steps 5.3-5.10 for time slot T ═ 1, 2., T;

step 5.3: randomly selecting n vehicle sets

step 5.5: using global parameters of a time slot on each vehicular roadside unit mTo update the model local parameters:

step 5.6: obtaining vehicle local data:

step 5.7: utilizing vehicle local data

Training the model to obtain vehicle local training parameters and time:

step 5.8: uploading model parameters

And

to the roadside unit m;

Of course, a step of initializing the model parameters of the roadside units and the vehicle is also included before step 5.2.

In conclusion, the invention considers the problem of communication and calculation joint optimization in the vehicle networking environment, designs the corresponding task queue and energy queue for each vehicle, and fully considers the calculation and caching process of the calculation task in the vehicle. In addition, the invention provides the intelligent internet of vehicles task unloading scheduling research based on the federal learning by using the federal learning in artificial intelligence, finds an effective task scheduling strategy, ensures the requirements of time delay sensitive tasks, minimizes the time delay loss, energy loss and service charge of a system, and protects the privacy of users' vehicles by adopting a distributed training method.

Although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the spirit and scope of the present invention.

Claims

1. A task unloading scheduling method for a vehicle networking system is characterized in that in the vehicle networking system, a whole road is divided into M mutually disjoint road sections according to the coverage area of a roadside unit, a plurality of vehicles are arranged in the coverage area of one roadside unit, and the vehicles and the roadside unit complete the task calculation and unloading through a wireless link, wherein the vehicle comprises the following steps: the method comprises the following steps:

2. The method of claim 1, wherein: the step 1 comprises the following steps:

Between vehicle and RSU when task k is offloaded to wayside unit computationUplink transmission delay of

Wherein f is_i ^localAnd

frequency and power of the vehicle, respectively;

step 1.3: calculating the total time for the vehicle i to process the task k:

step 1.4: each vehicle having T therein_sIndividual task priority queues, T_sFor maximum delay in all types of tasksLimitation, i.e. T_s＝max{T_kK belongs to {1, 2.,. K } }, the capacity limit of the vehicle task queue is in time slot units, namely the maximum capacity of the task queue, so as to ensure that the vehicle task queue can put any next task at any time slot, and the number of the task queue is {1, 2.,. l.,. T. }_s}；

tasks with smaller initial priority values are processed with higher priority;

step 1.6: the change of the vehicle energy queue is

Wherein the set of n vehicles is V ═ V₁,v₂,...,v_nV vehicles_iE.v. velocity S_iDriving on a road; set M roadside units as G ═ R₁,R₂,...,R_M}, each roadside unit R_mThe communication range of e G is a diameter d_mIn the same R_mVehicle aggregation within communication range

And satisfy

and

to express binary decision variables when

Representing that task k is in vehicle V within time slot t_iOff-loading to the roadside Unit R_mIn the middle stage of calculation when

3. The method of claim 2, wherein: the step 2 comprises the following steps:

Wherein the content of the first and second substances,

representing a queue with index 1 in the task queue of vehicle i within time slot tThe task amount in (1);

satisfies the following conditions:

f_i≤f_ma (4)

4. The method of claim 3, wherein: the step 3 comprises the following steps:

step 3.1: designing a system state space:

wherein

wherein

step 3.2: designing system motion space by

Wherein

Representing an offload decision variable;

step 3.3: energy queue state transition:

step 3.4: and (3) task queue state conversion:

step 3.5: designing the loss function of the system, at time slot t, the vehicleIn the system state of S^tTaking action A^tThe loss function of (d) is:

step 3.6: optimal scheduling strategy of the system:

5. The method of claim 4, wherein: the step 4 comprises the following steps:

step 4.2: initial state S of given Internet of vehicles system⁰；

Step 4.3: for each time slot t is 0_maxStep 4.4-step 4.15 are executed;

and 4.5, calculating the optimal decision of the depth Q network:

step 4.6: selecting actions randomly with a probability p;

step 4.7: if p ≦ ε, choose random action A^t；

Step 4.8: if p > ε, select action:

step 4.9: performing action A^tTo obtain the followingState of a time slot S^t+1；

Step 4.10: calculating Loss function Loss^t；

step 4.13: computing

Step 4.14: calculating L (theta)^t) Gradient (2):

step 4.15: updating theta based on gradient descent method^t：

Step 4.16: and (5) the training model is converged to obtain a trained model.

6. The method of claim 5, wherein: the step 5 comprises the following steps:

Step 5.2: performing steps 5.3-5.10 for time slot T ═ 1, 2., T;

step 5.3: randomly selecting n vehicle sets

step 5.6: obtaining vehicle local data:

step 5.7: utilizing vehicle local data

Training the model to obtain vehicle local training parameters and time:

step 5.8: uploading model parameters

And

to the roadside unit m;

7. The method of claim 6, wherein: before step 5.2, the method also comprises the step of initializing model parameters of the roadside units and the vehicles.

8. The utility model provides a car networking task uninstallation dispatch system which characterized in that: for carrying out the method of any one of claim 1 to claim 7.

9. The system of claim 8, wherein: comprises a roadside unit (RSU) and a terminal vehicle; dividing the whole road into M mutually-disjointed road sections according to the coverage area of the roadside units, wherein a plurality of vehicles are arranged in the coverage area of one roadside unit, and the vehicles and the roadside units complete the calculation and unloading of tasks through wireless links, wherein each vehicle has T_sThe vehicle computing system comprises a task priority queue and an energy queue, wherein the task priority queue is used for storing tasks to be processed, and the energy queue provides corresponding energy for vehicle computing tasks.

10. The system of claim 9, wherein: each vehicle and roadside unit has a neural network training model, and vehicles in the same road section can complete distributed federal learning together with the RSU.