CN114116209A - Spectrum map construction and distribution method and system based on deep reinforcement learning - Google Patents

Spectrum map construction and distribution method and system based on deep reinforcement learning Download PDF

Info

Publication number
CN114116209A
CN114116209A CN202111341780.2A CN202111341780A CN114116209A CN 114116209 A CN114116209 A CN 114116209A CN 202111341780 A CN202111341780 A CN 202111341780A CN 114116209 A CN114116209 A CN 114116209A
Authority
CN
China
Prior art keywords
spectrum map
strategy
network
cognitive user
reinforcement learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111341780.2A
Other languages
Chinese (zh)
Inventor
周力
刘兴光
谭翔
魏急波
赵海涛
熊俊
高文颖
黄圣春
张姣
曹阔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN202111341780.2A priority Critical patent/CN114116209A/en
Publication of CN114116209A publication Critical patent/CN114116209A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/502Proximity

Abstract

The invention discloses a method and a system for building and distributing a frequency spectrum map based on deep reinforcement learning. The invention provides a method for constructing and distributing a frequency spectrum map with an expandable network scale in a mobile edge network scene, which provides a real-time high-precision frequency spectrum map service for network cognitive users by using the minimum frequency spectrum data load and the minimum calculation overhead, and effectively improves the frequency spectrum efficiency and the robustness of a mobile edge network.

Description

Spectrum map construction and distribution method and system based on deep reinforcement learning
Technical Field
The invention relates to the technical field of wireless communication networks, in particular to a method and a system for building and distributing a frequency spectrum map based on deep reinforcement learning.
Background
In the mobile edge network, cognitive users determine own unloading strategies according to perceived available spectrum information and own computing resources, single agents or uncooperative multiple agents do not cooperate or exchange respective operations, frequency conflicts can be generated among the cognitive users in the process of building and distributing a mobile edge network spectrum map, or an edge server is overloaded, so that network energy consumption is increased and user service quality is reduced.
Disclosure of Invention
The invention provides a spectrum map construction and distribution method and system based on deep reinforcement learning, which are used for overcoming the defects that single agents or uncooperative multiple agents do not cooperate or exchange respective operations and the like in the prior art.
In order to achieve the above object, the present invention provides a spectrum map construction and distribution method based on deep reinforcement learning, which includes the following steps:
modeling the construction and distribution problems of the spectrum map in the mobile edge network into a calculation and communication compromise model, and constructing a centralized training and distributed execution reinforcement learning framework; the reinforcement learning framework comprises an offline training module and an online execution module;
acquiring available bandwidth information according to the self frequency spectrum sensing capability of the cognitive user;
according to the available bandwidth information and the computing power of the cognitive user terminal, resource allocation is carried out on the bandwidth of the mobile edge network and the computing power of the edge server by using an offline training module, and an unloading strategy is selected; the unloading strategies comprise a full unloading strategy, a partial unloading strategy and a local computing strategy;
through a trained online execution module, performing data distribution, unloading and calculation of the cognitive user by using a selected unloading strategy, and constructing a frequency spectrum map step by step;
and monitoring available bandwidth information perceived by a cognitive user in real time, and when the change of the available bandwidth information is monitored, retraining the online execution module by using the offline training module, and determining an unloading strategy of a new environment so as to adapt to a complex and changeable communication environment.
In order to achieve the above object, the present invention further provides a spectrum map construction and distribution system based on deep reinforcement learning, including:
the model building module is used for modeling the building and distribution problems of the spectrum map in the mobile edge network into a calculation and communication compromise model and building a centralized training and distributed execution reinforcement learning framework; the reinforcement learning framework comprises an offline training module and an online execution module;
the frequency spectrum sensing module is used for acquiring available bandwidth information according to the self frequency spectrum sensing capability of the cognitive user;
the resource allocation and strategy selection module is used for allocating resources to the bandwidth of the mobile edge network and the computing capacity of the edge server by utilizing the offline training module according to the available bandwidth information and the computing capacity of the cognitive user terminal and selecting an unloading strategy; the unloading strategies comprise a full unloading strategy, a partial unloading strategy and a local computing strategy;
the spectrum map building module is used for distributing, unloading and calculating data of the cognitive user by using the selected unloading strategy through the trained online execution module, and building a spectrum map step by step;
and the real-time monitoring module is used for monitoring the available bandwidth information sensed by the cognitive user in real time, and when the change of the available bandwidth information is monitored, the offline training module is used for retraining the online execution module and determining an unloading strategy of a new environment so as to be self-adaptive to the complex and changeable communication environment.
To achieve the above object, the present invention further provides a computer device, which includes a memory and a processor, wherein the memory stores a computer program, and the processor implements the steps of the method when executing the computer program.
To achieve the above object, the present invention further proposes a computer-readable storage medium having a computer program stored thereon, which, when being executed by a processor, implements the steps of the above method.
Compared with the prior art, the invention has the beneficial effects that:
the invention provides a spectrum map construction and distribution method based on deep reinforcement learning, which models the problems of joint unloading and resource management in a mobile edge network into a calculation and communication compromise model in the mobile edge network, and constructs a reinforcement learning framework with centralized training and distributed execution, wherein the reinforcement learning framework comprises an offline training module and an online execution module, the online execution module carries out step-by-step construction of the spectrum map by utilizing a learned unloading strategy, and the offline training module dynamically updates the online execution module according to the unloading calculation result of a cognitive user. The invention provides a method for constructing and distributing a frequency spectrum map with an expandable network scale in a mobile edge network scene, which provides a real-time high-precision frequency spectrum map service for network cognitive users by using the minimum frequency spectrum data load and the minimum calculation overhead, and effectively improves the frequency spectrum efficiency and the robustness of a mobile edge network.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the structures shown in the drawings without creative efforts.
FIG. 1 is a schematic diagram of a centralized training, distributed execution reinforcement learning framework according to the present invention;
fig. 2 is a schematic view of an unloading strategy in the spectrum map construction and distribution method based on deep reinforcement learning according to the present invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In addition, the technical solutions in the embodiments of the present invention may be combined with each other, but it must be based on the realization of those skilled in the art, and when the technical solutions are contradictory or cannot be realized, such a combination of technical solutions should not be considered to exist, and is not within the protection scope of the present invention.
The invention provides a spectrum map construction and distribution method based on deep reinforcement learning, which comprises the following steps:
101: modeling the construction and distribution problem of the spectrum map in the mobile edge network into a calculation and communication compromise model, and constructing a centralized training and distributed execution reinforcement learning framework (as shown in figure 1); the reinforcement learning framework comprises an offline training module and an online execution module;
102: acquiring available bandwidth information according to the self frequency spectrum sensing capability of the cognitive user;
103: according to the available bandwidth information and the computing power of the cognitive user terminal, resource allocation is carried out on the bandwidth of the mobile edge network and the computing power of the edge server by using an offline training module, and an unloading strategy is selected; the unloading strategies comprise a full unloading strategy, a partial unloading strategy and a local computing strategy;
104: through a trained online execution module, performing data distribution, unloading and calculation of the cognitive user by using a selected unloading strategy, and constructing a frequency spectrum map step by step;
105: and monitoring available bandwidth information perceived by a cognitive user in real time, and when the change of the available bandwidth information is monitored, retraining the online execution module by using the offline training module, and determining an unloading strategy of a new environment so as to adapt to a complex and changeable communication environment.
The invention models the combined unloading and resource management problem in the mobile edge network into a calculation and communication compromise model in the mobile edge network, and constructs a centralized training and distributed execution reinforcement learning framework, wherein the reinforcement learning framework comprises an offline training module and an online execution module, the online execution module utilizes the learned unloading strategy to construct a spectrum map step by step, and the offline training module dynamically updates the online execution module according to the unloading calculation result of a cognitive user. The invention provides a method for constructing and distributing a frequency spectrum map with an expandable network scale in a mobile edge network scene, which provides a real-time high-precision frequency spectrum map service for network cognitive users by using the minimum frequency spectrum data load and the minimum calculation overhead, and effectively improves the frequency spectrum efficiency and the robustness of a mobile edge network.
In one embodiment, for step 101, the offline training module includes a centralized trainer that is built by an edge server (e.g., a small cell, a wireless access point, or a drone assisted edge computing server, etc.).
The reinforcement learning framework is a reinforcement learning framework with centralized training and distributed execution.
The online execution module comprises a policy network, and the policy network is loaded on the cognitive user side.
In the next embodiment, for step 101, the offline training module collects available bandwidth information perceived by the cognitive users through a common channel, trains a mutually-coordinated policy network for each cognitive user by using the collected available bandwidth information, and sends the trained policy network parameters to the corresponding cognitive user through the common channel to update the parameters of the policy network of the corresponding cognitive user side; the policy network comprises an unloading policy of a cognitive user, computing resource allocation of an edge server and bandwidth allocation of a mobile edge network.
In one embodiment, training a cooperative policy network for each cognitive user using the collected available bandwidth information includes:
001: taking the sum of bandwidth, energy and time consumed in the process of constructing and distributing the frequency spectrum map as an objective function of the mobile edge network system;
Figure BDA0003352433390000061
wherein N represents the total number of cognitive users, WnThe bandwidth consumed by the edge server to distribute the spectrum map to the nth cognitive user, EnThe edge server constructs and distributes a frequency spectrum map to the nth cognitive user to consume the total calculation and communication energy, TnThe total time consumed by the edge server in the process of constructing and distributing the spectrum map to the nth cognitive user is included.
002: establishing a reward function in a reinforcement learning framework according to the target function;
rt=(w'-w)/w
in the formula, rtRepresenting rewards of all cognitive users after the cognitive users execute the strategy network parameters at the time t; w' represents a target function obtained after all cognitive users execute the strategy network parameters at the time t; and w represents an objective function obtained after all cognitive users execute the strategy network parameters at the time of t-1.
003: and outputting the reward value of the current strategy network by using the reward function according to the target function obtained by the cognitive user executing the uninstalling strategy.
004: and inputting the reward value into a neural network for training, and training a mutually-cooperative strategy network for each cognitive user.
During offline centralized training, a strategy network is obtained through neural network training according to the reward value output by the reward function, namely, the spectrum bandwidth distributed by the cognitive user, the server computing resource and the unloading strategy are obtained.
In the embodiment, an edge server corresponds to N cognitive users in a mobile edge network, the cognitive users sense available bandwidth through the spectrum sensing capability of the cognitive users, the edge server collects information of all the cognitive users through a common channel, an unloading strategy is established for the cognitive users through neural network training, spectrum bandwidth is distributed to the cognitive users, server computing resources are calculated, the cognitive users execute a strategy network in a distributed mode, an objective function is obtained through calculation, the objective functions of all the cognitive users are added to serve as an objective function of the mobile edge network, the objective function establishes a reward function of the strategy network in a reinforcement learning frame, and the strategy network is updated through neural network training.
In another embodiment, for step 103, the full offload policy, as shown in fig. 2, includes:
constructing original spectrum data acquired by a cognitive user into a low-resolution spectrum map by using an edge server and adopting a kriging interpolation algorithm;
and constructing the low-resolution frequency spectrum map into a high-resolution frequency spectrum map by utilizing a super-resolution algorithm, compressing the high-resolution frequency spectrum map and distributing the compressed high-resolution frequency spectrum map to mobile edge network terminal users.
In a next embodiment, for step 103, the partial offload policy, as shown in fig. 2, includes:
constructing the original spectrum data collected by the cognitive user into a low-resolution spectrum map by using an edge server and adopting a kriging interpolation algorithm
And distributing the low-resolution frequency spectrum map to a terminal user of a mobile edge network, and converting the low-resolution frequency spectrum map into a high-resolution frequency spectrum map by using a super-resolution algorithm by using the terminal user.
In one embodiment, for step 103, locally computing the policy, as shown in fig. 2, includes:
distributing original spectrum data acquired by the cognitive user to a mobile edge network terminal user by using an edge server;
and constructing the original spectrum data into a low-resolution spectrum map by using a kriging interpolation algorithm by using a terminal user, and constructing the low-resolution spectrum map into a high-resolution spectrum map by using a super-resolution algorithm.
The invention also provides a spectrum map construction and distribution system based on deep reinforcement learning, which comprises the following steps:
the model building module is used for modeling the building and distribution problems of the spectrum map in the mobile edge network into a calculation and communication compromise model and building a centralized training and distributed execution reinforcement learning framework; the reinforcement learning framework comprises an offline training module and an online execution module;
the frequency spectrum sensing module is used for acquiring available bandwidth information according to the self frequency spectrum sensing capability of the cognitive user;
the resource allocation and strategy selection module is used for allocating resources to the bandwidth of the mobile edge network and the computing capacity of the edge server by utilizing the offline training module according to the available bandwidth information and the computing capacity of the cognitive user terminal and selecting an unloading strategy; the unloading strategies comprise a full unloading strategy, a partial unloading strategy and a local computing strategy;
the spectrum map building module is used for distributing, unloading and calculating data of the cognitive user by using the selected unloading strategy through the trained online execution module, and building a spectrum map step by step;
and the real-time monitoring module is used for monitoring the available bandwidth information sensed by the cognitive user in real time, and when the change of the available bandwidth information is monitored, the offline training module is used for retraining the online execution module and determining an unloading strategy of a new environment so as to be self-adaptive to the complex and changeable communication environment.
The invention further provides a computer device, which includes a memory and a processor, wherein the memory stores a computer program, and the processor implements the steps of the method when executing the computer program.
The invention also proposes a computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method described above.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, and all modifications and equivalents of the present invention, which are made by the contents of the present specification and the accompanying drawings, or directly/indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims (10)

1. A spectrum map construction and distribution method based on deep reinforcement learning is characterized by comprising the following steps:
modeling the construction and distribution problems of the spectrum map in the mobile edge network into a calculation and communication compromise model, and constructing a centralized training and distributed execution reinforcement learning framework; the reinforcement learning framework comprises an offline training module and an online execution module;
acquiring available bandwidth information according to the self frequency spectrum sensing capability of the cognitive user;
according to the available bandwidth information and the computing power of the cognitive user terminal, resource allocation is carried out on the bandwidth of the mobile edge network and the computing power of the edge server by using an offline training module, and an unloading strategy is selected; the unloading strategies comprise a full unloading strategy, a partial unloading strategy and a local computing strategy;
through a trained online execution module, performing data distribution, unloading and calculation of the cognitive user by using a selected unloading strategy, and constructing a frequency spectrum map step by step;
and monitoring available bandwidth information perceived by a cognitive user in real time, and when the change of the available bandwidth information is monitored, retraining the online execution module by using the offline training module, and determining an unloading strategy of a new environment so as to adapt to a complex and changeable communication environment.
2. The deep reinforcement learning-based spectrum map building and distributing method according to claim 1, wherein the offline training module comprises a centralized trainer, and the centralized trainer is built through an edge server; the online execution module comprises a policy network, and the policy network is loaded on the cognitive user side.
3. The method for building and distributing spectrum map based on deep reinforcement learning according to claim 2, wherein the offline training module collects available bandwidth information perceived by the cognitive users through a common channel, trains a mutually cooperative strategy network for each cognitive user by using the collected available bandwidth information, and sends the trained strategy network parameters to the corresponding cognitive user through the common channel to update the parameters of the strategy network of the corresponding cognitive user side; the policy network comprises an unloading policy of a cognitive user, computing resource allocation of an edge server and bandwidth allocation of a mobile edge network.
4. The deep reinforcement learning-based spectrum map building and distributing method according to claim 3, wherein training a cooperative strategy network for each cognitive user by using the collected available bandwidth information comprises:
taking the sum of bandwidth, energy and time consumed in the process of constructing and distributing the frequency spectrum map as an objective function of the mobile edge network system;
establishing a reward function in a reinforcement learning framework according to the target function;
outputting the reward value of the current strategy network by using the reward function according to a target function obtained by the cognitive user executing the uninstalling strategy;
and inputting the reward value into a neural network for training, and training a mutually-cooperative strategy network for each cognitive user.
5. The deep reinforcement learning-based spectrum map building and distributing method according to claim 1, wherein the full offloading strategy comprises:
constructing original spectrum data acquired by a cognitive user into a low-resolution spectrum map by using an edge server and adopting a kriging interpolation algorithm;
and constructing the low-resolution frequency spectrum map into a high-resolution frequency spectrum map by utilizing a super-resolution algorithm, compressing the high-resolution frequency spectrum map and distributing the compressed high-resolution frequency spectrum map to mobile edge network terminal users.
6. The deep reinforcement learning-based spectrum map building and distributing method according to claim 1, wherein the partial offloading policy includes:
constructing original spectrum data acquired by a cognitive user into a low-resolution spectrum map by using an edge server and adopting a kriging interpolation algorithm;
and distributing the low-resolution frequency spectrum map to a terminal user of a mobile edge network, and converting the low-resolution frequency spectrum map into a high-resolution frequency spectrum map by using a super-resolution algorithm by using the terminal user.
7. The deep reinforcement learning-based spectrum map building and distributing method according to claim 1, wherein the local computing strategy comprises:
distributing original spectrum data acquired by the cognitive user to a mobile edge network terminal user by using an edge server;
and constructing the original spectrum data into a low-resolution spectrum map by using a kriging interpolation algorithm by using a terminal user, and constructing the low-resolution spectrum map into a high-resolution spectrum map by using a super-resolution algorithm.
8. A spectrum map construction and distribution system based on deep reinforcement learning is characterized by comprising the following components:
the model building module is used for modeling the building and distribution problems of the spectrum map in the mobile edge network into a calculation and communication compromise model and building a centralized training and distributed execution reinforcement learning framework; the reinforcement learning framework comprises an offline training module and an online execution module;
the frequency spectrum sensing module is used for acquiring available bandwidth information according to the self frequency spectrum sensing capability of the cognitive user;
the resource allocation and strategy selection module is used for allocating resources to the bandwidth of the mobile edge network and the computing capacity of the edge server by utilizing the offline training module according to the available bandwidth information and the computing capacity of the cognitive user terminal and selecting an unloading strategy; the unloading strategies comprise a full unloading strategy, a partial unloading strategy and a local computing strategy;
the spectrum map building module is used for distributing, unloading and calculating data of the cognitive user by using the selected unloading strategy through the trained online execution module, and building a spectrum map step by step;
and the real-time monitoring module is used for monitoring the available bandwidth information sensed by the cognitive user in real time, and when the change of the available bandwidth information is monitored, the offline training module is used for retraining the online execution module and determining an unloading strategy of a new environment so as to be self-adaptive to the complex and changeable communication environment.
9. A computer device comprising a memory and a processor, the memory storing a computer program, wherein the processor when executing the computer program implements the steps of the method of any of claims 1-8.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 8.
CN202111341780.2A 2021-11-12 2021-11-12 Spectrum map construction and distribution method and system based on deep reinforcement learning Pending CN114116209A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111341780.2A CN114116209A (en) 2021-11-12 2021-11-12 Spectrum map construction and distribution method and system based on deep reinforcement learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111341780.2A CN114116209A (en) 2021-11-12 2021-11-12 Spectrum map construction and distribution method and system based on deep reinforcement learning

Publications (1)

Publication Number Publication Date
CN114116209A true CN114116209A (en) 2022-03-01

Family

ID=80379228

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111341780.2A Pending CN114116209A (en) 2021-11-12 2021-11-12 Spectrum map construction and distribution method and system based on deep reinforcement learning

Country Status (1)

Country Link
CN (1) CN114116209A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114826459A (en) * 2022-03-08 2022-07-29 南京航空航天大学 Spectrum map accurate construction method based on cross-domain reasoning
CN115129088A (en) * 2022-08-26 2022-09-30 中国人民解放军国防科技大学 Unmanned aerial vehicle track planning and obstacle avoidance method and system based on frequency spectrum map

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111405569A (en) * 2020-03-19 2020-07-10 三峡大学 Calculation unloading and resource allocation method and device based on deep reinforcement learning
CN111556572A (en) * 2020-04-21 2020-08-18 北京邮电大学 Spectrum resource and computing resource joint allocation method based on reinforcement learning
CN112967357A (en) * 2021-02-19 2021-06-15 中国人民解放军国防科技大学 Frequency spectrum map construction method based on convolutional neural network
CN113504987A (en) * 2021-06-30 2021-10-15 广州大学 Mobile edge computing task unloading method and device based on transfer learning
CN113612843A (en) * 2021-08-02 2021-11-05 吉林大学 MEC task unloading and resource allocation method based on deep reinforcement learning
US20220377137A1 (en) * 2020-01-08 2022-11-24 Shanghai Jiao Tong University Method for task offloading based on power control and resource allocation in industrial internet of things

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220377137A1 (en) * 2020-01-08 2022-11-24 Shanghai Jiao Tong University Method for task offloading based on power control and resource allocation in industrial internet of things
CN111405569A (en) * 2020-03-19 2020-07-10 三峡大学 Calculation unloading and resource allocation method and device based on deep reinforcement learning
CN111556572A (en) * 2020-04-21 2020-08-18 北京邮电大学 Spectrum resource and computing resource joint allocation method based on reinforcement learning
CN112967357A (en) * 2021-02-19 2021-06-15 中国人民解放军国防科技大学 Frequency spectrum map construction method based on convolutional neural network
CN113504987A (en) * 2021-06-30 2021-10-15 广州大学 Mobile edge computing task unloading method and device based on transfer learning
CN113612843A (en) * 2021-08-02 2021-11-05 吉林大学 MEC task unloading and resource allocation method based on deep reinforcement learning

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114826459A (en) * 2022-03-08 2022-07-29 南京航空航天大学 Spectrum map accurate construction method based on cross-domain reasoning
CN114826459B (en) * 2022-03-08 2023-02-03 南京航空航天大学 Spectrum map accurate construction method based on cross-domain reasoning
CN115129088A (en) * 2022-08-26 2022-09-30 中国人民解放军国防科技大学 Unmanned aerial vehicle track planning and obstacle avoidance method and system based on frequency spectrum map
CN115129088B (en) * 2022-08-26 2022-11-15 中国人民解放军国防科技大学 Unmanned aerial vehicle track planning and obstacle avoidance method and system based on frequency spectrum map

Similar Documents

Publication Publication Date Title
Li et al. SMDP-based coordinated virtual machine allocations in cloud-fog computing systems
CN113225377B (en) Internet of things edge task unloading method and device
CN114116209A (en) Spectrum map construction and distribution method and system based on deep reinforcement learning
CN114340016B (en) Power grid edge calculation unloading distribution method and system
CN113778677B (en) SLA-oriented intelligent optimization method for cloud-edge cooperative resource arrangement and request scheduling
CN112672382B (en) Hybrid collaborative computing unloading method and device, electronic equipment and storage medium
CN114126066B (en) MEC-oriented server resource allocation and address selection joint optimization decision method
CN112954736A (en) Policy-based computation offload of wireless energy-carrying internet-of-things equipment
CN112433852B (en) Internet of things edge calculation control method, device, equipment and storage medium
CN113641504A (en) Information interaction method for improving multi-agent reinforcement learning edge calculation effect
CN115175217A (en) Resource allocation and task unloading optimization method based on multiple intelligent agents
CN111813539A (en) Edge computing resource allocation method based on priority and cooperation
CN115686846B (en) Container cluster online deployment method integrating graph neural network and reinforcement learning in edge calculation
CN114090108B (en) Method and device for executing computing task, electronic equipment and storage medium
CN113821346A (en) Computation uninstalling and resource management method in edge computation based on deep reinforcement learning
CN111158893A (en) Task unloading method, system, equipment and medium applied to fog computing network
CN116489712B (en) Mobile edge computing task unloading method based on deep reinforcement learning
CN116489708B (en) Meta universe oriented cloud edge end collaborative mobile edge computing task unloading method
CN112312299A (en) Service unloading method, device and system
CN116954866A (en) Edge cloud task scheduling method and system based on deep reinforcement learning
CN114615705B (en) Single-user resource allocation strategy method based on 5G network
CN115665869A (en) Multi-user collaboration platform and method based on edge calculation and directed acyclic graph
CN115361453A (en) Load fair unloading and transferring method for edge service network
CN113157344B (en) DRL-based energy consumption perception task unloading method in mobile edge computing environment
Gaivoronski Stochastic optimization problems in telecommunications

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination