CN113326076A - Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process - Google Patents

Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process Download PDF

Info

Publication number
CN113326076A
CN113326076A CN202110594462.0A CN202110594462A CN113326076A CN 113326076 A CN113326076 A CN 113326076A CN 202110594462 A CN202110594462 A CN 202110594462A CN 113326076 A CN113326076 A CN 113326076A
Authority
CN
China
Prior art keywords
vehicle
task
fleet
fog
semi
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110594462.0A
Other languages
Chinese (zh)
Other versions
CN113326076B (en
Inventor
吴琼
王思远
葛红梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangnan University
Original Assignee
Jiangnan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangnan University filed Critical Jiangnan University
Priority to CN202110594462.0A priority Critical patent/CN113326076B/en
Publication of CN113326076A publication Critical patent/CN113326076A/en
Application granted granted Critical
Publication of CN113326076B publication Critical patent/CN113326076B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/445Program loading or initiating
    • G06F9/44594Unloading
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/12Protocols specially adapted for proprietary or special-purpose networking environments, e.g. medical networks, sensor networks, networks in vehicles or remote metering networks

Abstract

The invention relates to a vehicle-mounted fog-assisted vehicle fleet task unloading method based on a semi-Markov decision process. The invention simultaneously considers factors such as sending time delay, calculating time delay and the like in task unloading and establishes a task unloading model based on a semi-Markov decision process. Then, a system state set and an action set are respectively defined, a system state transition probability formula and a system reward function are deduced, and then an SMDP model is solved by using a value iterative algorithm based on a Bellman equation to obtain an optimal task unloading strategy. The scheme has moderate computational complexity and reasonable system model, and fully considers how the tasks are distributed and various time delays involved in the task unloading process. Simulation results show that the scheme can obtain larger long-term benefits of the system on the premise of ensuring the task unloading time delay.

Description

Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process
Technical Field
The invention relates to the technical field of vehicle-mounted task unloading, in particular to a vehicle-mounted fog-assisted vehicle fleet task unloading method based on a semi-Markov decision process.
Background
Along with the development of science and technology, unmanned motorcades are more and more popularized, so that the safety of roads is improved. Wherein each fleet consists of a head vehicle and several member vehicles. Specifically, the head vehicle controls the speed, acceleration, and direction of travel of the entire fleet, and the member vehicles travel on the same lane one by one following the head vehicle at the same speed. Unmanned vehicles are typically equipped with cameras, radars, etc., which produce a large amount of redundant data. At this time, the vehicle is required to calculate and analyze the data in order to extract useful information. From Intel corporation's data, a vehicle needs to compute, analyze and fuse large amounts of sensor data (about 1GB/s) in order to be able to make safety decisions. However, the computing power of a vehicle is limited, and the vehicle may be processing other tasks when a new task arrives, so that the computing resources of the vehicle are occupied, and the task needs to be unloaded to other vehicles in the fleet for cooperative processing, and then the result of the computation is returned to the vehicle. However, because the number of vehicles in the fleet is limited, the resource shortage of the fleet can occur, and at this time, the task cannot be processed in time, so that the processing result cannot be received in time, and the vehicles cannot make further safety decisions.
A vehicle-mounted fog computing (VFC) system can provide powerful computing resources to solve the above problems. VFC systems consist of a number of vehicles in the road, each of which is capable of providing computing resources to assist in processing tasks. When the resource of the fleet is short, the vehicles in the fleet can select to transmit the task to the head vehicle, the head vehicle unloads the task to the vehicles running in the VFC system beside the fleet for cooperation processing, then the vehicles in the VFC system return the calculation result to the head vehicle, and the head vehicle forwards the calculation result to the corresponding vehicle. The VFC system can provide various low latency and real-time applications to vehicles in a fleet of vehicles when fleet resources are tight, thereby greatly increasing network capacity. In the task unloading process, the communication modes of the vehicles all adopt an IEEE 802.11p distributed coordination function. For an automatic driving vehicle, the time delay of task unloading is of great importance, and the fact that the time delay is too long means that the vehicle can receive the calculation result after the task is unloaded, so that the vehicle cannot timely make corresponding reactions according to the calculation result, and the probability of safety accidents is greatly increased. The offload delay includes transmit delay, compute delay, and backhaul delay. The sending delay is the time occupied by the sending task, the calculating delay is the time occupied by the calculating resource processing task, and the returning delay is the time occupied by the feedback calculating result.
In a fleet of vehicles, there are typically sufficient computing resources so that one vehicle is sufficient to handle one computing task. However, each vehicle in the fleet has different computing resources, and therefore, the computing time delay generated when different vehicles process tasks is different.
In on-vehicle fog, the vehicle is generally composed of a private car. The computational resources of each vehicle are limited and approximately the same size. The present invention selects one or more vehicles for processing when offloading the task to VFC processing. Specifically, the head vehicle divides the task into several subtasks with the same size, and the 802.11pDCF protocol is adopted to transmit the subtasks to the corresponding several vehicles in the VFC, respectively. Due to the fact that the number of vehicles for processing unloading tasks is increased, the sending time delay caused by the fact that the head vehicle unloads the sub tasks to the corresponding sub tasks in the VFC is also increased, and meanwhile due to the fact that the computing resources for processing the tasks are increased, the corresponding computing time delay is reduced. And the vehicles in the VFC arrive and leave randomly, which causes the resources in the VFC to change dynamically.
In resource-limited fleets and VFCs, how to consider the selection of vehicles in the fleet to handle the task, the number of vehicles in the VFC to handle the task, and the random arrival and departure of the vehicles in the VFC, maximizing the long-term return in the system is an important issue. Where the return is related to the transmission delay, the computation delay and the return delay.
At present, no vehicle-mounted fog computing system for optimizing fleet task unloading under the 802.11p DCF protocol is researched.
Disclosure of Invention
Therefore, the technical problem to be solved by the invention is to overcome the problems of limited resources in a fleet, dynamic change of resources in vehicle-mounted fog, diversity of tasks, time delay of task unloading and the like in the prior art.
In order to solve the technical problems, the invention provides a vehicle-mounted fog-assisted vehicle fleet task unloading method based on a semi-Markov decision process, which comprises the following steps:
step S1: defining a state set and an action set of an on-vehicle fog-assisted fleet task unloading system based on a semi-Markov decision process;
step S2: obtaining the current state s and the action a(s) of the system according to the state set and the action set of the system, and calculating the state transition probability of the system according to the current state s and the action a(s) of the system;
step S3: establishing a Bellman equation according to the system reward and the system state transition probability and solving an optimal unloading strategy in the system;
the system reward is the system reward for the system to take action a(s) in state s, expressed as the difference between the immediate gain of the system and the cost consumed during the transition of the system state, calculated as follows:
R(s,a)=U(s,a)-G(s,a)
wherein U (s, a) represents the immediate gain of the system after the system takes action a(s) in state s; g (s, a) represents the consumption of the system during the transition from the current state to the next state.
In one embodiment of the present invention, in step S2, the method for calculating the transition probability includes: and respectively calculating and normalizing the transition probability when the fleet sends a task request, the transition probability when the task processed by the vehicle leaves the system, the transition probability when the task processed by the computing unit in the vehicle-mounted fog leaves the system, the transition probability when the vehicle arrives at the system and the transition probability when the vehicle leaves the system.
In one embodiment of the present invention, in step S3, the expression of the immediate gain is:
Figure BDA0003090433560000041
where eta represents the unit price of time, ElIndicating the time required for processing by the task requester itself, TpRepresenting the transmission time of transmission tasks within a vehicle fleet, each task requiring d computing resources, vehicle V of the vehicle fleetiThe computational resource of (i ═ 1.., N) is fi
Figure BDA0003090433560000042
And the penalty parameter is zeta, and the penalty parameter represents the transmission time for transmitting the task to j calculation units in the vehicle-mounted fog by the head vehicle for common processing.
In one embodiment of the present invention, the T ispAnd
Figure BDA0003090433560000043
having the same formula of calculation, using TtrRepresents:
Ttr=θ·Etr·Tslot
where θ represents the number of tasks that the vehicle needs to transmit, and its value is constant at 1 in the fleet, and in the network consisting of the first vehicle and the vehicle-mounted fog, its value depends on the decision of the system, when
Figure BDA0003090433560000044
Dividing the task into j subtasks with the same size by the head vehicle and respectively transmitting the j subtasks to j vehicles in the vehicle-mounted fog, wherein the value of theta is j; etrRepresenting the average number of time slots required for transmitting the tasks; t isslotRepresenting the average duration of each slot.
In one embodiment of the invention, the average duration T of each time slotslotThe expression is as follows:
Tslot=qidle·Tidle+qs·Ts+qc·Tc
wherein
Figure BDA0003090433560000045
Indicating the probability that the time slot is free,
Figure BDA0003090433560000046
indicating the probability that the slot is in successful transmission of data, qc=1-qidle-qsIndicates the probability, T, that the slot is in data collisionidleIndicating the duration, T, of each free time slots=Header+E[P]The value of/theta + SIFS + delta + ACK + DIFS + delta indicates that the time slot is in a data transmission state successfullyDuration of (D), Tc=Header+E[P]The/theta + SIFS + delta + ACKtimeout represents the time length of the time slot in the data collision state; wherein Header is PHYh+MAChIndicating the header length of the data packet, wherein the value of the header length of the data packet is equal to the sum of the header lengths of the physical layer and the medium access layer of the data packet; e [ P ]]Representing the length of the task; δ represents the propagation delay; SIFS, ACK, and DIFS indicate the lengths of SIFS, ACK, and DIFS, respectively. And is
Figure BDA0003090433560000051
Figure BDA0003090433560000052
Indicating the probability of a collision occurring with the vehicle transmitting data.
In one embodiment of the invention, the average number of time slots E required for the transmission of a tasktrThe expression of (a) is:
Figure BDA0003090433560000053
where m is the number of retransmissions, q is the collision probability, WminIs the minimum contention window.
In one embodiment of the present invention, in step S3, the consumption of the system is:
Figure BDA0003090433560000054
where α represents the continuous time discounting factor, C (s, a) is the number of vehicles in the system that are processing the task after taking action in the current state, and its expression is:
Figure BDA0003090433560000055
β (x, a) represents the sum of all event arrival rates at the next moment in the system, and the calculation expression of β (x, a) is:
Figure BDA0003090433560000061
in one embodiment of the present invention, the bellman optimal equation is expressed as:
Figure BDA0003090433560000062
wherein
Figure BDA0003090433560000063
Represents the normalized system award after the normalization,
Figure BDA0003090433560000064
the normalized discount factor is represented by the number of discount units,
Figure BDA0003090433560000065
expressing the normalized transition probability, introducing a constant, wherein N is the number of vehicles in the fleet, and the arrival rate of the event A at the next moment is NlambdapEvent F+1And F-1Respectively, is lambdavAnd muv,fiFor vehicles V in a vehicle fleetiM represents the number of computing units in the vehicle-mounted fog, NRRepresenting the number of computing units to which a single task can be maximally allocated in the on-board fog, fvIs the computational resource of each vehicle in the vehicle fog, and d is the computational resource required for each computational task.
In an embodiment of the present invention, the solving of the optimal unloading strategy in the vehicle-mounted fog-assisted vehicle fleet task unloading system includes: and setting a positive number epsilon, iterating each state in the system based on a Bellman optimal equation until the difference value of the absolute value of the state cost function of each system is smaller than a set value after two adjacent iterations, and stopping iteration.
In an embodiment of the present invention, the optimal offload policy calculation expression is:
Figure BDA0003090433560000071
compared with the prior art, the technical scheme of the invention has the following advantages:
the invention provides a vehicle-mounted fog-assisted vehicle fleet task unloading scheme based on a semi-Markov decision process, aiming at the problems of limited vehicle fleet internal resources, dynamic change of vehicle-mounted fog internal resources, diversity of tasks, task unloading time delay and the like. Firstly, an IEEE 802.11p protocol is used for transmitting tasks, and factors such as sending time delay, calculation time delay and the like in task unloading are considered, so that a task unloading model based on a semi-Markov decision process is established. Then, a system state set and an action set are respectively defined, a system state transition probability formula and a system reward function are deduced, and then an SMDP model is solved by using a value iterative algorithm based on a Bellman equation to obtain an optimal task unloading strategy. The method has moderate computational complexity and reasonable system model, and fully considers how the tasks are distributed and various time delays involved in the task unloading process. Simulation results show that the scheme can obtain larger long-term benefits of the system on the premise of ensuring the task unloading time delay.
Drawings
In order that the present disclosure may be more readily and clearly understood, reference is now made to the following detailed description of the embodiments of the present disclosure taken in conjunction with the accompanying drawings, in which
FIG. 1 is a system block diagram.
Fig. 2 is a graph of the long term benefit obtained for the inventive scheme compared to other schemes (λ p ═ 20).
Fig. 3 is a graph of the long term benefit obtained with the inventive scheme compared to other schemes (λ p ═ 13).
Detailed Description
The present invention is further described below in conjunction with the following figures and specific examples so that those skilled in the art may better understand the present invention and practice it, but the examples are not intended to limit the present invention.
The invention discloses a vehicle-mounted fog-assisted vehicle fleet task unloading method based on a semi-Markov decision process, which comprises the following steps of:
step S1: a state set of the system is first defined. In the semi-Markov decision model, the system state comprises the processing task state of the vehicles in the fleet, the number of tasks processed by different numbers of vehicles in the vehicle-mounted fog computing system, the total number of vehicles in the vehicle-mounted fog and the current event. Figure 1 gives a system framework. The state set of the system is represented as follows:
Figure BDA0003090433560000081
wherein n isiIndicating vehicle Vi(i ═ 1.. multidata., N) number of tasks processed, Ni0 denotes a vehicle ViFor free vehicles, ni1 denotes a vehicle ViA task is being processed; b isjIndicating the number of tasks commonly handled by the j computing units in the on-board fog, NRThe number of computing units to which a single task can be allocated at most in the vehicle-mounted fog is represented; m represents the number of computing units in the vehicle-mounted fog and satisfies
Figure BDA0003090433560000082
e represents the currently occurring event.
Step S2: a set of actions of the system is defined. From the collection
Figure BDA0003090433560000083
The relationship between the event and the action is expressed as follows:
Figure BDA0003090433560000084
where-1 indicates that the current event is by vehicle ViA processed task leaving system, a task leaving system processed by j computing units in the vehicle-mounted fog, and a vehicle ViArrival and vehicle ViWhen leaving, the system takes no task action; b represents events for generating computational tasks for a fleet of vehiclesA task, and system computing resources are scarce, the system refuses to process the task;
Figure BDA0003090433560000085
indicating fleet production of computing tasks, the system assigns the tasks to fleet vehicles ViPerforming intermediate treatment;
Figure BDA0003090433560000086
indicating that the system transmits the task to j calculation units in the vehicle-mounted fog for processing.
Figure BDA0003090433560000087
Representing a set of events, N being the number of vehicles in the fleet, A representing the calculation tasks issued by the vehicles in the fleet, DiRepresenting vehicles V in a fleetiThe task of (i ═ 1.. multidot.n) processing is completed, Lj(j=1,...,NR) Indicating that the tasks jointly processed by the j computing units in the on-board fog leave the system, F+1And F-1Respectively, vehicle arrival and departure from on-board fog.
Step S3: system state transition probabilities are defined. In the semi-Markov decision process, the transition probability P (x | s, a) is calculated in five cases based on the current state s and the action a(s). Wherein the values of action a(s) are selected from the set
Figure BDA0003090433560000088
And (4) selecting. The specific expression is as follows:
(1)
Figure BDA0003090433560000091
i.e. the fleet issues a task request.
The first case is when the fleet issues a task request and the system assigns the task to a fleet vehicle ViAt this time, the total number of vehicles in the vehicle fleet is unchanged, and j (j 1.., N.) is added to the vehicle fogR) The number of tasks processed by the computing units is not changed, and the arrival rate of the computing tasks sent by the vehicles in the event fleet at the next moment of the event A is NlambdapWhere N is the number of vehicles in the fleet, λpIs the task arrival rate. Event LjHas an arrival rate of Bj·jfvAnd d. Wherein B isjFor the number of tasks collectively processed by the j computing units in the on-vehicle fog, fvIs the computational resource of each vehicle in the vehicle fog, and d is the computational resource required for each computational task. Event F+1And F-1Respectively, is lambdavAnd muv. The arrival rate of the mission leaving the system handled by the event fleet vehicle needs to be discussed in two cases. Where E represents what happens at the next moment.
1)
Figure BDA0003090433560000092
The current event is a task generated by the motorcade and processed by the ith vehicle of the motorcade, then the vehicle ViThe number of tasks processed is increased by one, so at the next instant the event vehicle ViThe arrival rate of processing tasks leaving the system is (n)i+1)·fiAnd d. Wherein f isiFor vehicles V in a vehicle fleetiThe size of the computing resource of (2).
2)
Figure BDA0003090433560000093
The current event is a task generated by the motorcade and processed by the ith vehicle of the motorcade, then the vehicle ViThe number of the processed tasks is increased by one, and the next event is the departure of the task processed by the kth vehicle in the motorcade, because the vehicle VkIf the number of the processed tasks is not changed, the event D is carried outkHas an arrival rate of nk·fk/d。
The second situation is that when the fleet sends out task request and the system distributes the task to j vehicles in the vehicle fog for common processing, the total number of vehicles in the fleet is not changed and the number of processed tasks is not changed, the arrival rate of the event A at the next moment is NlambdapEvent DiAn arrival rate of (i ═ 1.., N) is Ni·fiAnd d. Event F+1And F-1Respectively, is lambdavAnd muv. The event is given by j (j 1.., N) in the vehicle fogR) Tasks processed by a plurality of computing unitsThe rate of arrival of the departure needs to be discussed in two cases.
1)
Figure BDA0003090433560000094
This indicates that the system distributes tasks to j vehicles in the on-vehicle fog for common processing, and the number of tasks commonly processed by j vehicles in the on-vehicle fog is increased by one, so that the event L is processed at the next timejHas an arrival rate of (B)j+1)·jfv/d。
2)
Figure BDA0003090433560000101
This case indicates that the event occurring at the next time is a departure of a task commonly handled by m vehicles in the on-vehicle fog, and since the number of tasks commonly handled by m vehicles is not changed, the event L is generated at the next timemHas an arrival rate of Bm·mfv/d。
In summary, given a system state
Figure BDA0003090433560000102
And an act a(s), where the expression of the system state transition probability is:
Figure BDA0003090433560000111
(2)
Figure BDA0003090433560000112
i.e. vehicle ViThe processed task leaves the system, and at the moment, the expression of the system state transition probability is as follows:
Figure BDA0003090433560000121
(3)
Figure BDA0003090433560000122
i.e. j in the vehicle fogThe tasks processed by the computing units jointly leave the system, and the expression of the system state transition probability is as follows:
Figure BDA0003090433560000123
(4)
Figure BDA0003090433560000124
i.e. vehicle ViAnd when the system arrives, the expression of the system state transition probability is as follows:
Figure BDA0003090433560000131
(5)
Figure BDA0003090433560000132
i.e. vehicle ViLeaving the system, wherein the expression of the system state transition probability is as follows:
Figure BDA0003090433560000133
where β (x, a) represents the sum of all event arrival rates at the next time in the system. The transition probability of an event is the ratio of the arrival rate of the event at the next time to the arrival rates of all events at the next time. In summary, under different system states s and actions a(s), the calculation expression of β (x, a) is:
Figure BDA0003090433560000141
step S4: a reward model for the system is defined. The system reward for taking action a(s) at state s may be expressed as the difference between the immediate gain of the system and the cost expended during the transition to the system state. Is represented as follows:
R(s,a)=U(s,a)-G(s,a)
where U (s, a) represents the immediate gain of the system after taking action a(s) in the current state; g (s, a) represents the consumption of the system during the transition from the current state to the next state.
The calculation of the immediate gain can be divided into the following:
(1) when the value of e is equal to that of A,
Figure BDA0003090433560000142
1, when N:
vehicle V when requesting a vehicle to issue a computational task and the system decides to offload it to the fleetiWhen the vehicle is requested to carry out in-line communication through the DCF function of the 802.11p protocol during processing, the calculation task is transmitted to the specified vehicle, and the immediate gain of the system is eta (E)l-Tp-d/fi). Where eta represents the unit price of time, ElIndicating the time required for processing by the task requester itself, TpRepresenting the transmission time of transmission tasks within the fleet, d is the computational resource required for each computational task issued by a requesting vehicle, V is the fleet vehicleiThe computational resource of (i ═ 1.., N) is fi
(2) When the value of e is equal to that of A,
Figure BDA0003090433560000143
j=1,...,NRthe method comprises the following steps:
when the system determines that j calculation units in the vehicle-mounted fog process the tasks, the vehicle is requested to transmit the tasks to a head vehicle through the communication in a vehicle fleet, the head vehicle divides the tasks into j subtasks with the same size, and then the subtasks are respectively transmitted to the j calculation units in the vehicle-mounted fog, so that the immediate gain of the system is equal to
Figure BDA0003090433560000151
Wherein
Figure BDA0003090433560000152
And the transmission time for transmitting the task to j calculation units in the vehicle-mounted fog by the head vehicle for processing together is shown.
(3) When e is a, a is b:
when the fleet of vehicles sends out a calculation task and the calculation resources in the system are scarce, the system refuses to process the task and is punished, and the punishment parameter is zeta.
(4) When in use
Figure BDA0003090433560000153
The method comprises the following steps:
when the task processed by the vehicle in the fleet leaves the vehicle, the task processed by the vehicle in the vehicle-mounted fog leaves and the vehicle arrives, the system does not take the task action, and the immediate benefit is zero.
(5) When e ═ F-1,
Figure BDA0003090433560000154
The method comprises the following steps:
when the idle vehicle leaves in the vehicle-mounted fog, the system does not take task action, and the immediate benefit is zero.
(6) When e ═ F-1,
Figure BDA0003090433560000155
The method comprises the following steps:
when a vehicle in the vehicle fog which is processing a task leaves, the task processing fails, the system is punished, and the parameter is zeta.
In summary, the expression of the system immediate gain is:
Figure BDA0003090433560000156
transmission time delay TpAnd
Figure BDA0003090433560000157
further determinations are needed. Because the vehicles in the fleet adopt the DCF function of the 802.11p protocol to transmit tasks in a unicast mode, and the network formed by the head vehicle of the fleet and the mobile vehicle-mounted fog also adopts the DCF function of the 802.11p protocol to transmit tasks in a unicast mode, only the number of the vehicles in the two networks is different, the transmission delay T ispAnd
Figure BDA0003090433560000161
has the advantages ofThe same calculation formula. With NtrIndicating the number of vehicles in the network, N for a fleet networktrA total number of vehicles N equal to the number of fleets; and for the network consisting of the first vehicle and the vehicle-mounted fog, the value is M + 1. Next, an expression of the network transmission delay will be derived, for convenience, using TtrRepresenting the propagation delay, but the formula applies to the propagation delay TpAnd
Figure BDA0003090433560000162
network transmission delay TtrThe expression of (a) is:
Ttr=θ·Etr·Tslot
wherein theta represents the number of tasks required to be transmitted by the vehicle, and the value of theta is constant to be 1 in the vehicle fleet; in the network formed by the first vehicle and the vehicle-mounted fog, the value depends on the decision of the system
Figure BDA0003090433560000163
The head vehicle divides the task into j subtasks with the same size and respectively transmits the j subtasks to j vehicles in the vehicle-mounted fog, so that the value of theta is j; etrRepresenting the average number of time slots required for transmitting the tasks; t isslotRepresenting the average duration of each slot.
Average duration per time slot TslotThe expression is as follows:
Tslot=qidle·Tidle+qs·Ts+qc·Tc
wherein
Figure BDA0003090433560000164
Indicating the probability that the time slot is free,
Figure BDA0003090433560000165
indicating the probability that the slot is in successful transmission of data, qc=1-qidle-qsIndicates the probability, T, that the slot is in data collisionidleIndicating the duration, T, of each free time slots=Header+E[P]/θ+SIFS+δ+ACK+DIFS+ δ represents the duration of the time slot in the successful transmission data state, Tc=Header+E[P]The/θ + SIFS + δ + ACKtimeout represents the duration of the slot in the data collision state. Wherein Header is PHYh+MAChIndicating the header length of the data packet, wherein the value of the header length of the data packet is equal to the sum of the header lengths of the physical layer and the medium access layer of the data packet; e [ P ]]Representing the length of the task; δ represents the propagation delay; SIFS, ACK, and DIFS indicate the lengths of SIFS, ACK, and DIFS, respectively. And is
Figure BDA0003090433560000166
Figure BDA0003090433560000167
Indicating the probability of a collision occurring with the vehicle transmitting data.
Average number of time slots E required for transmission of taskstrThe expression of (a) is:
Figure BDA0003090433560000171
where m is the number of retransmissions, q is the collision probability, WminIs the minimum contention window.
The system consumption G (s, a) is:
Figure BDA0003090433560000172
where α represents a continuous time discount factor, C (s, a) is the number of vehicles in the system that are processing the task after taking action in the current state, and the calculation expression is:
Figure BDA0003090433560000173
step S5: and solving an optimal unloading strategy in the vehicle-mounted fog-assisted vehicle fleet task unloading system. And solving the optimal unloading scheme by adopting a value iterative algorithm according to the Bellman equation.
The bellman optima equation can be expressed as:
Figure BDA0003090433560000174
wherein
Figure BDA0003090433560000175
Represents the normalized system award after the normalization,
Figure BDA0003090433560000176
the normalized discount factor is represented by the number of discount units,
Figure BDA0003090433560000177
and (4) expressing the normalized transition probability and introducing a constant.
And setting a very small positive number epsilon, iterating each state in the system based on the above Bellman optimal equation formula until the difference of the absolute value of the state cost function of each system is less than epsilon (1-gamma)/2 gamma after two adjacent iterations, and stopping iteration. At this time, each system state obtains the optimal strategy pi according to the following formula*(s)。
Figure BDA0003090433560000181
The value iteration algorithm yields the pseudo-code as follows:
algorithm 1-value iterative algorithm
Step 1: for each system state S ∈ S, a state value function is initialized
Figure BDA0003090433560000182
Setting a very small positive number epsilon, and setting the iteration number k to be 0;
step 2: for each system state, a state cost function is calculated using the Bellman equation
Figure BDA0003090433560000183
Namely, it is
Figure BDA0003090433560000184
Step 3: if S ∈ S is satisfied for each system state
Figure BDA0003090433560000185
Entering Step 4; otherwise, adding 1 to the iteration number, namely k is k +1, and entering Step 2;
step 4: for each system state S E S, calculating an optimal unloading strategy pi*(s) by the formula
Figure BDA0003090433560000186
The convergence error in the value iteration algorithm is:
Figure BDA0003090433560000187
fig. 2 and fig. 3 verify that the unloading scheme of the present invention has better performance than the unloading scheme based on the greedy algorithm under different task arrival rates, that is, the scheme of the present invention obtains more long-term benefits of the system.
According to the method, firstly, the tasks are transmitted by using an IEEE 802.11p protocol, and meanwhile, factors such as sending time delay, calculation time delay and the like in task unloading are considered, so that a task unloading model based on a semi-Markov decision process is established. Then, a system state set and an action set are respectively defined, a system state transition probability formula and a system reward function are deduced, and then an SMDP model is solved by using a value iterative algorithm based on a Bellman equation to obtain an optimal task unloading strategy. The method has moderate computational complexity and reasonable system model, and fully considers how the tasks are distributed and various time delays involved in the task unloading process. Simulation results show that the scheme can obtain larger long-term benefits of the system on the premise of ensuring the task unloading time delay.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It should be understood that the above examples are only for clarity of illustration and are not intended to limit the embodiments. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. And obvious variations or modifications of the invention may be made without departing from the spirit or scope of the invention.

Claims (10)

1. The vehicle-mounted fog-assisted vehicle fleet task unloading method based on the semi-Markov decision process is characterized by comprising the following steps of:
step S1: defining a state set and an action set of an on-vehicle fog-assisted fleet task unloading system based on a semi-Markov decision process;
step S2: obtaining the current state s and the action a(s) of the system according to the state set and the action set of the system, and calculating the state transition probability of the system according to the current state s and the action a(s) of the system;
step S3: establishing a Bellman equation according to the system reward and the system state transition probability and solving an optimal unloading strategy in the system;
the system reward is the system reward for the system to take action a(s) in state s, expressed as the difference between the immediate gain of the system and the cost consumed during the transition of the system state, calculated as follows:
R(s,a)=U(s,a)-G(s,a)
wherein U (s, a) represents the immediate gain of the system after the system takes action a(s) in state s; g (s, a) represents the consumption of the system during the transition from the current state to the next state.
2. The vehicle-mounted fog-aided fleet task offloading method based on semi-markov decision process of claim 1, wherein in step S2, the transition probability is calculated by: and respectively calculating and normalizing the transition probability when the fleet sends a task request, the transition probability when the task processed by the vehicle leaves the system, the transition probability when the task processed by the computing unit in the vehicle-mounted fog leaves the system, the transition probability when the vehicle arrives at the system and the transition probability when the vehicle leaves the system.
3. The semi-markov decision process vehicle fog assisted fleet task offloading method of claim 1, wherein in step S3, said immediate gain is expressed as:
Figure FDA0003090433550000021
where eta represents the unit price of time, ElIndicating the time required for processing by the task requester itself, TpRepresenting the transmission time of transmission tasks within a vehicle fleet, each task requiring d computing resources, vehicle V of the vehicle fleetiThe computational resource of (i ═ 1.., N) is fi
Figure FDA0003090433550000022
And the penalty parameter is zeta, and the penalty parameter represents the transmission time for transmitting the task to j calculation units in the vehicle-mounted fog by the head vehicle for common processing.
4. The semi-Markov decision process vehicle fog-assisted fleet task off-loading method of claim 3, wherein T ispAnd
Figure FDA0003090433550000023
having the same formula of calculation, using TtrRepresents:
Ttr=θ·Etr·Tslot
where θ represents the number of tasks that the vehicle needs to transmit, and its value is constant at 1 in the fleet, and in the network consisting of the first vehicle and the vehicle-mounted fog, its value depends on the decision of the system, when
Figure FDA0003090433550000024
Dividing the task into j subtasks with the same size by the head vehicle and respectively transmitting the j subtasks to j vehicles in the vehicle-mounted fog, wherein the value of theta is j; etrPresentation renderingThe average time slot number required by the task transmission; t isslotRepresenting the average duration of each slot.
5. The semi-Markov decision process vehicle fog-assisted fleet task offloading method of claim 4, wherein the average duration T of each timeslotslotThe expression is as follows:
Tslot=qidle·Tidle+qs·Ts+qc·Tc
wherein
Figure FDA0003090433550000025
Indicating the probability that the time slot is free,
Figure FDA0003090433550000026
indicating the probability that the slot is in successful transmission of data, qc=1-qidle-qsIndicates the probability, T, that the slot is in data collisionidleIndicating the duration, T, of each free time slots=Header+E[P]Theta + SIFS + delta + ACK + DIFS + delta represents the duration of the time slot in the state of successfully transmitting data, Tc=Header+E[P]The/theta + SIFS + delta + ACKtimeout represents the time length of the time slot in the data collision state; wherein Header is PHYh+MAChIndicating the header length of the data packet, wherein the value of the header length of the data packet is equal to the sum of the header lengths of the physical layer and the medium access layer of the data packet; e [ P ]]Representing the length of the task; δ represents the propagation delay; SIFS, ACK, and DIFS indicate the lengths of SIFS, ACK, and DIFS, respectively. And is
Figure FDA0003090433550000031
Figure FDA0003090433550000032
Indicating the probability of a collision occurring with the vehicle transmitting data.
6. The semi-Markov decision process vehicle fog-assisted fleet vehicle driver of claim 4Traffic offloading method, characterized in that the average number of timeslots E required for the transmission of a tasktrThe expression of (a) is:
Figure FDA0003090433550000033
where m is the number of retransmissions, q is the collision probability, WminIs the minimum contention window.
7. The semi-markov decision process vehicle fog assisted fleet task offloading method of claim 1, wherein in step S3, the system consumption is:
Figure FDA0003090433550000034
where α represents the continuous time discounting factor, C (s, a) is the number of vehicles in the system that are processing the task after taking action in the current state, and its expression is:
Figure FDA0003090433550000035
β (x, a) represents the sum of all event arrival rates at the next moment in the system, and the calculation expression of β (x, a) is:
Figure FDA0003090433550000041
8. the semi-markov decision process vehicle fog-assisted fleet task off-loading method of claim 7, wherein the bellman optimal equation is expressed as:
Figure FDA0003090433550000042
wherein
Figure FDA0003090433550000043
Represents the normalized system award after the normalization,
Figure FDA0003090433550000044
the normalized discount factor is represented by the number of discount units,
Figure FDA0003090433550000045
expressing the normalized transition probability, introducing a constant, wherein N is the number of vehicles in the fleet, and the arrival rate of the event A at the next moment is NlambdapEvent F+1And F-1Respectively, is lambdavAnd muv,fiFor vehicles V in a vehicle fleetiM represents the number of computing units in the vehicle-mounted fog, NRRepresenting the number of computing units to which a single task can be maximally allocated in the on-board fog, fvIs the computational resource of each vehicle in the vehicle fog, and d is the computational resource required for each computational task.
9. The semi-markov decision process vehicle fog assisted fleet task offloading method according to claim 1 or 8, wherein said solving for an optimal offloading strategy in a vehicle fog assisted fleet task offloading system comprises: and setting a positive number epsilon, iterating each state in the system based on a Bellman optimal equation until the difference value of the absolute value of the state cost function of each system is smaller than a set value after two adjacent iterations, and stopping iteration.
10. The semi-markov decision process vehicle-borne fog-aided fleet task offloading method of claim 9, wherein the optimal offloading policy calculation expression is:
Figure FDA0003090433550000051
CN202110594462.0A 2021-05-28 2021-05-28 Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process Active CN113326076B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110594462.0A CN113326076B (en) 2021-05-28 2021-05-28 Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110594462.0A CN113326076B (en) 2021-05-28 2021-05-28 Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process

Publications (2)

Publication Number Publication Date
CN113326076A true CN113326076A (en) 2021-08-31
CN113326076B CN113326076B (en) 2022-10-18

Family

ID=77422288

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110594462.0A Active CN113326076B (en) 2021-05-28 2021-05-28 Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process

Country Status (1)

Country Link
CN (1) CN113326076B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109656703A (en) * 2018-12-19 2019-04-19 重庆邮电大学 A kind of mobile edge calculations auxiliary vehicle task discharging method
CN109831522A (en) * 2019-03-11 2019-05-31 西南交通大学 A kind of vehicle connection cloud and mist system dynamic resource Optimal Management System and method based on SMDP
CN110489218A (en) * 2019-07-26 2019-11-22 江南大学 Vehicle-mounted mist computing system task discharging method based on semi-Markovian decision process
CN111611063A (en) * 2020-05-27 2020-09-01 江南大学 Cloud-aware mobile fog computing system task unloading method based on 802.11p

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109656703A (en) * 2018-12-19 2019-04-19 重庆邮电大学 A kind of mobile edge calculations auxiliary vehicle task discharging method
CN109831522A (en) * 2019-03-11 2019-05-31 西南交通大学 A kind of vehicle connection cloud and mist system dynamic resource Optimal Management System and method based on SMDP
CN110489218A (en) * 2019-07-26 2019-11-22 江南大学 Vehicle-mounted mist computing system task discharging method based on semi-Markovian decision process
CN111611063A (en) * 2020-05-27 2020-09-01 江南大学 Cloud-aware mobile fog computing system task unloading method based on 802.11p

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
QIONGWU等: "A Task Offloading Scheme in Vehicular Fog and Cloud Computing System", 《 IEEE ACCESS》 *
QIONGWU等: "Delay-Sensitive Task Offloading in the 802.11p-Based Vehicular Fog Computing Systems", 《IEEE INTERNET OF THINGS JOURNAL》 *
董晓丹 等: "车载云计算系统中资源分配的优化方法", 《中国电子科学研究院学报》 *

Also Published As

Publication number Publication date
CN113326076B (en) 2022-10-18

Similar Documents

Publication Publication Date Title
CN110928658A (en) Cooperative task migration system and algorithm of vehicle-side cloud cooperative architecture
CN111464976A (en) Vehicle task unloading decision and overall resource allocation method based on fleet
CN111831427B (en) Distributed inter-vehicle task unloading method based on mobile edge calculation
CN110650457B (en) Joint optimization method for task unloading calculation cost and time delay in Internet of vehicles
CN113316116B (en) Vehicle calculation task unloading method
CN115297171B (en) Edge computing and unloading method and system for hierarchical decision of cellular Internet of vehicles
CN116204315A (en) Track-based dynamic task unloading method for vehicle resource pool in Internet of vehicles
CN112115505A (en) New energy automobile charging station charging data transmission method based on mobile edge calculation and block chain technology
CN116566838A (en) Internet of vehicles task unloading and content caching method with cooperative blockchain and edge calculation
CN113709249B (en) Safe balanced unloading method and system for driving assisting service
CN116030623A (en) Collaborative path planning and scheduling method based on blockchain in cognitive Internet of vehicles scene
Hazarika et al. Multi-agent DRL-based computation offloading in multiple RIS-aided IoV networks
CN111311091B (en) Expressway task detection and scheduling method and system based on vehicle-mounted cloud and unmanned aerial vehicle
CN113326076B (en) Vehicle-mounted fog-assisted vehicle fleet task unloading method based on semi-Markov decision process
CN115550357A (en) Multi-agent multi-task cooperative unloading method
CN115964178A (en) Internet of vehicles user computing task scheduling method and device and edge service network
CN113207085B (en) MEC (Meter coordinated Circuit) assisted speed self-adaptive access method for fleet network
CN114928611A (en) Internet of vehicles energy-saving calculation unloading optimization method based on IEEE802.11p protocol
CN115118783A (en) Task unloading method based on heterogeneous communication technology ultra-reliable low-delay reinforcement learning
CN112750298A (en) Truck formation dynamic resource allocation method based on SMDP and DRL
CN115348560B (en) Task processing method based on NOMA communication in Internet of vehicles scene
CN115065683B (en) Vehicle edge network task allocation and unloading method based on vehicle clustering
CN115080249B (en) Vehicle networking multidimensional resource allocation method and system based on federal learning
Wang et al. Joint Optimization for MEC Computation Offloading and Resource Allocation in IoV Based on Deep Reinforcement Learning
CN115567979A (en) Efficient computation unloading method based on resources and time in VEC

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant