CN114757352A - 智能体训练方法、跨域异构环境任务调度方法及相关装置 - Google Patents
智能体训练方法、跨域异构环境任务调度方法及相关装置 Download PDFInfo
- Publication number
- CN114757352A CN114757352A CN202210663848.7A CN202210663848A CN114757352A CN 114757352 A CN114757352 A CN 114757352A CN 202210663848 A CN202210663848 A CN 202210663848A CN 114757352 A CN114757352 A CN 114757352A
- Authority
- CN
- China
- Prior art keywords
- task
- cross
- agent
- reinforcement learning
- heterogeneous environment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 206
- 238000012549 training Methods 0.000 title claims abstract description 124
- 230000002787 reinforcement Effects 0.000 claims abstract description 123
- 230000008569 process Effects 0.000 claims abstract description 52
- 238000012545 processing Methods 0.000 claims abstract description 33
- 230000009471 action Effects 0.000 claims description 67
- 238000004422 calculation algorithm Methods 0.000 claims description 40
- 230000006870 function Effects 0.000 claims description 35
- 238000004364 calculation method Methods 0.000 claims description 21
- 238000003860 storage Methods 0.000 claims description 15
- 238000013468 resource allocation Methods 0.000 claims description 11
- 230000008901 benefit Effects 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 8
- 238000011156 evaluation Methods 0.000 claims description 6
- 230000002860 competitive effect Effects 0.000 claims description 2
- 230000001965 increasing effect Effects 0.000 abstract description 3
- 230000005540 biological transmission Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 238000013461 design Methods 0.000 description 5
- 238000010801 machine learning Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 238000004088 simulation Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000013135 deep learning Methods 0.000 description 3
- 230000009977 dual effect Effects 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000007792 addition Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000010355 oscillation Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000007477 logistic regression Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Program initiating; Program switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5083—Techniques for rebalancing the load in a distributed system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Algebra (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Feedback Control In General (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210663848.7A CN114757352B (zh) | 2022-06-14 | 2022-06-14 | 智能体训练方法、跨域异构环境任务调度方法及相关装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210663848.7A CN114757352B (zh) | 2022-06-14 | 2022-06-14 | 智能体训练方法、跨域异构环境任务调度方法及相关装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114757352A true CN114757352A (zh) | 2022-07-15 |
CN114757352B CN114757352B (zh) | 2022-09-23 |
Family
ID=82337024
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210663848.7A Active CN114757352B (zh) | 2022-06-14 | 2022-06-14 | 智能体训练方法、跨域异构环境任务调度方法及相关装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114757352B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115237581A (zh) * | 2022-09-21 | 2022-10-25 | 之江实验室 | 一种面向异构算力的多策略智能调度方法和装置 |
CN115293451A (zh) * | 2022-08-24 | 2022-11-04 | 中国西安卫星测控中心 | 基于深度强化学习的资源动态调度方法 |
CN116566805A (zh) * | 2023-07-10 | 2023-08-08 | 中国人民解放军国防科技大学 | 一种面向体系容灾抗毁的节点跨域调度方法、装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110781614A (zh) * | 2019-12-06 | 2020-02-11 | 北京工业大学 | 基于深度强化学习的舰载机出动回收在线调度方法 |
US20210004735A1 (en) * | 2018-03-22 | 2021-01-07 | Siemens Corporation | System and method for collaborative decentralized planning using deep reinforcement learning agents in an asynchronous environment |
CN113422795A (zh) * | 2021-05-06 | 2021-09-21 | 江苏大学 | 一种基于深度强化学习的车载边缘任务集中调度与资源分配联合优化方法 |
CN113485826A (zh) * | 2021-06-25 | 2021-10-08 | 中国电子科技集团公司第五十四研究所 | 一种边缘服务器负载均衡方法、系统 |
CN114048689A (zh) * | 2022-01-13 | 2022-02-15 | 南京信息工程大学 | 基于深度强化学习的多无人机空中充电和任务调度方法 |
-
2022
- 2022-06-14 CN CN202210663848.7A patent/CN114757352B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210004735A1 (en) * | 2018-03-22 | 2021-01-07 | Siemens Corporation | System and method for collaborative decentralized planning using deep reinforcement learning agents in an asynchronous environment |
CN110781614A (zh) * | 2019-12-06 | 2020-02-11 | 北京工业大学 | 基于深度强化学习的舰载机出动回收在线调度方法 |
CN113422795A (zh) * | 2021-05-06 | 2021-09-21 | 江苏大学 | 一种基于深度强化学习的车载边缘任务集中调度与资源分配联合优化方法 |
CN113485826A (zh) * | 2021-06-25 | 2021-10-08 | 中国电子科技集团公司第五十四研究所 | 一种边缘服务器负载均衡方法、系统 |
CN114048689A (zh) * | 2022-01-13 | 2022-02-15 | 南京信息工程大学 | 基于深度强化学习的多无人机空中充电和任务调度方法 |
Non-Patent Citations (3)
Title |
---|
刘星 等: "基于深度强化学习算法的财务机器人任务分配的研究", 《中国管理信息化》 * |
刘晓平等: "基于Q学习的任务调度问题的改进研究", 《图学学报》 * |
黄亭飞 等: "基于DQN的多类型拦截装备复合式反无人机任务分配方法", 《控制与决策》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115293451A (zh) * | 2022-08-24 | 2022-11-04 | 中国西安卫星测控中心 | 基于深度强化学习的资源动态调度方法 |
CN115293451B (zh) * | 2022-08-24 | 2023-06-16 | 中国西安卫星测控中心 | 基于深度强化学习的资源动态调度方法 |
CN115237581A (zh) * | 2022-09-21 | 2022-10-25 | 之江实验室 | 一种面向异构算力的多策略智能调度方法和装置 |
CN115237581B (zh) * | 2022-09-21 | 2022-12-27 | 之江实验室 | 一种面向异构算力的多策略智能调度方法和装置 |
CN116566805A (zh) * | 2023-07-10 | 2023-08-08 | 中国人民解放军国防科技大学 | 一种面向体系容灾抗毁的节点跨域调度方法、装置 |
CN116566805B (zh) * | 2023-07-10 | 2023-09-26 | 中国人民解放军国防科技大学 | 一种面向体系容灾抗毁的节点跨域调度方法、装置 |
Also Published As
Publication number | Publication date |
---|---|
CN114757352B (zh) | 2022-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114757352B (zh) | 智能体训练方法、跨域异构环境任务调度方法及相关装置 | |
CN111556461B (zh) | 一种基于深度q网络的车载边缘网络任务分发卸载方法 | |
Qi et al. | Knowledge-driven service offloading decision for vehicular edge computing: A deep reinforcement learning approach | |
Zhang et al. | Deep learning empowered task offloading for mobile edge computing in urban informatics | |
CN109669768B (zh) | 一种面向边云结合架构的资源分配和任务调度方法 | |
CN110365514B (zh) | 基于强化学习的sdn多级虚拟网络映射方法和装置 | |
CN111835827A (zh) | 物联网边缘计算任务卸载方法及系统 | |
CN111629380A (zh) | 面向高并发多业务工业5g网络的动态资源分配方法 | |
Huang et al. | Toward decentralized and collaborative deep learning inference for intelligent iot devices | |
CN114172937A (zh) | 基于深度强化学习的动态服务功能链编排方法及系统 | |
CN112672382B (zh) | 混合协作计算卸载方法、装置、电子设备及存储介质 | |
CN115190033B (zh) | 一种基于强化学习的云边融合网络任务卸载方法 | |
Alghamdi et al. | On the optimality of task offloading in mobile edge computing environments | |
Qi et al. | Vehicular edge computing via deep reinforcement learning | |
CN114827284B (zh) | 工业物联网中服务功能链编排方法、装置及联邦学习系统 | |
Yamansavascilar et al. | Deepedge: A deep reinforcement learning based task orchestrator for edge computing | |
Ebrahim et al. | A deep learning approach for task offloading in multi-UAV aided mobile edge computing | |
Gao et al. | Fast adaptive task offloading and resource allocation via multiagent reinforcement learning in heterogeneous vehicular fog computing | |
CN115913712A (zh) | 基于多用户多接入点的任务卸载隐私保护系统及方法 | |
CN116489226A (zh) | 一种服务质量保障的在线资源调度方法 | |
CN111211984A (zh) | 优化cdn网络的方法、装置及电子设备 | |
Hu et al. | Dynamic task offloading in MEC-enabled IoT networks: A hybrid DDPG-D3QN approach | |
CN114090108A (zh) | 算力任务执行方法、装置、电子设备及存储介质 | |
Li et al. | Optimal service selection and placement based on popularity and server load in multi-access edge computing | |
CN111385151A (zh) | 一种基于多目标优化的虚拟网络的映射方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Sun Changyong Inventor after: Zhu Daofei Inventor after: Yang Tan Inventor after: Xu Guoai Inventor after: Liu Xinran Inventor after: Wang Yanwei Inventor after: Du Jiong Inventor after: Zheng Yongbo Inventor before: Sun Changyong Inventor before: Yang Tan Inventor before: Wang Yanwei Inventor before: Du Jiong Inventor before: Zheng Yongbo |
|
CB03 | Change of inventor or designer information |