CN112434464B - 基于maddpg算法的船舶多机械臂弧焊协同焊接方法 - Google Patents
基于maddpg算法的船舶多机械臂弧焊协同焊接方法 Download PDFInfo
- Publication number
- CN112434464B CN112434464B CN202011240612.XA CN202011240612A CN112434464B CN 112434464 B CN112434464 B CN 112434464B CN 202011240612 A CN202011240612 A CN 202011240612A CN 112434464 B CN112434464 B CN 112434464B
- Authority
- CN
- China
- Prior art keywords
- welding
- value
- action
- state
- function
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000003466 welding Methods 0.000 title claims abstract description 138
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000004422 calculation algorithm Methods 0.000 title claims abstract description 21
- 230000006870 function Effects 0.000 claims abstract description 84
- 230000009471 action Effects 0.000 claims abstract description 72
- 230000002787 reinforcement Effects 0.000 claims abstract description 26
- 230000008569 process Effects 0.000 claims abstract description 21
- 238000012549 training Methods 0.000 claims abstract description 10
- 230000008859 change Effects 0.000 claims abstract description 4
- 230000004888 barrier function Effects 0.000 claims abstract description 3
- 230000001902 propagating effect Effects 0.000 claims abstract description 3
- 238000013528 artificial neural network Methods 0.000 claims description 27
- 230000033001 locomotion Effects 0.000 claims description 20
- 238000004088 simulation Methods 0.000 claims description 5
- 230000002265 prevention Effects 0.000 claims description 2
- 230000005284 excitation Effects 0.000 description 7
- 238000013461 design Methods 0.000 description 5
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 238000011160 research Methods 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000005265 energy consumption Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 229910052742 iron Inorganic materials 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
- G06F30/27—Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- Feedback Control In General (AREA)
- Manipulator (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011240612.XA CN112434464B (zh) | 2020-11-09 | 2020-11-09 | 基于maddpg算法的船舶多机械臂弧焊协同焊接方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011240612.XA CN112434464B (zh) | 2020-11-09 | 2020-11-09 | 基于maddpg算法的船舶多机械臂弧焊协同焊接方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112434464A CN112434464A (zh) | 2021-03-02 |
CN112434464B true CN112434464B (zh) | 2021-09-10 |
Family
ID=74701145
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011240612.XA Active CN112434464B (zh) | 2020-11-09 | 2020-11-09 | 基于maddpg算法的船舶多机械臂弧焊协同焊接方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112434464B (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112966816A (zh) * | 2021-03-31 | 2021-06-15 | 东南大学 | 一种编队包围的多智能体强化学习方法 |
CN115107948B (zh) * | 2022-06-24 | 2023-08-25 | 大连海事大学 | 一种高效强化学习自主船舶避碰方法 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107102619A (zh) * | 2016-02-19 | 2017-08-29 | 发那科株式会社 | 机器学习装置、工业机械单元、制造系统及机器学习方法 |
CN108052004A (zh) * | 2017-12-06 | 2018-05-18 | 湖北工业大学 | 基于深度增强学习的工业机械臂自动控制方法 |
CN109948642A (zh) * | 2019-01-18 | 2019-06-28 | 中山大学 | 基于图像输入的多智能体跨模态深度确定性策略梯度训练方法 |
CN110390845A (zh) * | 2018-04-18 | 2019-10-29 | 北京京东尚科信息技术有限公司 | 虚拟环境下机器人训练方法及装置、存储介质及计算机系统 |
CN111881772A (zh) * | 2020-07-06 | 2020-11-03 | 上海交通大学 | 基于深度强化学习的多机械臂协同装配方法和系统 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11295174B2 (en) * | 2018-11-05 | 2022-04-05 | Royal Bank Of Canada | Opponent modeling with asynchronous methods in deep RL |
-
2020
- 2020-11-09 CN CN202011240612.XA patent/CN112434464B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107102619A (zh) * | 2016-02-19 | 2017-08-29 | 发那科株式会社 | 机器学习装置、工业机械单元、制造系统及机器学习方法 |
CN108052004A (zh) * | 2017-12-06 | 2018-05-18 | 湖北工业大学 | 基于深度增强学习的工业机械臂自动控制方法 |
CN110390845A (zh) * | 2018-04-18 | 2019-10-29 | 北京京东尚科信息技术有限公司 | 虚拟环境下机器人训练方法及装置、存储介质及计算机系统 |
CN109948642A (zh) * | 2019-01-18 | 2019-06-28 | 中山大学 | 基于图像输入的多智能体跨模态深度确定性策略梯度训练方法 |
CN111881772A (zh) * | 2020-07-06 | 2020-11-03 | 上海交通大学 | 基于深度强化学习的多机械臂协同装配方法和系统 |
Non-Patent Citations (2)
Title |
---|
《Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments》;Ryan Lowe等;《https://arxiv.org/pdf/1706.02275.pdf》;20200314;第1-16页 * |
《深度|OpenAI提出强化学习新方法:让智能体学习合作、竞争与交流》;机器之心编译;《https://www.sohu.com/a/147687356_465975》;20170610;第1-6页 * |
Also Published As
Publication number | Publication date |
---|---|
CN112434464A (zh) | 2021-03-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhang et al. | Deep interactive reinforcement learning for path following of autonomous underwater vehicle | |
CN113900445A (zh) | 基于多智能体强化学习的无人机协同控制训练方法及系统 | |
CN112434464B (zh) | 基于maddpg算法的船舶多机械臂弧焊协同焊接方法 | |
CN112427843B (zh) | 基于qmix强化学习算法的船舶多机械臂焊点协同焊接方法 | |
CN111168684B (zh) | 一种空间大型结构在轨装配序列规划方法 | |
Machmudah et al. | Polynomial joint angle arm robot motion planning in complex geometrical obstacles | |
Larsen et al. | Path planning of cooperating industrial robots using evolutionary algorithms | |
Zhu et al. | Deep reinforcement learning for real-time assembly planning in robot-based prefabricated construction | |
Kelasidi et al. | Multi-objective optimization for efficient motion of underwater snake robots | |
CN113485323A (zh) | 一种级联多移动机器人灵活编队方法 | |
Chen et al. | Maddpg algorithm for coordinated welding of multiple robots | |
Kurdi et al. | Proposed system of artificial Neural Network for positioning and navigation of UAV-UGV | |
CN117606490B (zh) | 一种水下自主航行器协同搜索路径规划方法 | |
Sun et al. | A Fuzzy-Based Bio-Inspired Neural Network Approach for Target Search by Multiple Autonomous Underwater Vehicles in Underwater Environments. | |
Mousa et al. | Path planning for a 6 DoF robotic arm based on whale optimization algorithm and genetic algorithm | |
Vesentini et al. | Velocity obstacle-based trajectory planner for anthropomorphic arms | |
Raiesdana | A hybrid method for industrial robot navigation | |
CN116796843A (zh) | 一种基于pso-m3ddpg的无人机多对多追逃博弈方法 | |
Wang et al. | Integrated reinforcement and imitation learning for tower crane lift path planning | |
Wang et al. | Path planning optimization for teaching and playback welding robot | |
CN115542921A (zh) | 多机器人的自主路径规划方法 | |
Chen et al. | Mitigating Imminent Collision for Multi-robot Navigation: A TTC-force Reward Shaping Approach | |
Liao et al. | Qmix algorithm for coordinated welding of multiple robots | |
Petrenko et al. | Machine Learning Algorithm for Anthropomorphic Manipulator Control System | |
Guan | Self-inspection method of unmanned aerial vehicles in power plants using deep q-network reinforcement learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder |
Address after: 222061 No.18, Shenghu Road, Lianyungang City, Jiangsu Province Patentee after: The 716th Research Institute of China Shipbuilding Corp. Patentee after: JIANGSU JARI TECHNOLOGY GROUP Co.,Ltd. Address before: 222061 No.18, Shenghu Road, Lianyungang City, Jiangsu Province Patentee before: 716TH RESEARCH INSTITUTE OF CHINA SHIPBUILDING INDUSTRY Corp. Patentee before: JIANGSU JARI TECHNOLOGY GROUP Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder | ||
TR01 | Transfer of patent right |
Effective date of registration: 20221012 Address after: 222061 No.18, Shenghu Road, Lianyungang City, Jiangsu Province Patentee after: The 716th Research Institute of China Shipbuilding Corp. Patentee after: JIANGSU JARI TECHNOLOGY GROUP Co.,Ltd. Patentee after: CSIC Information Technology Co.,Ltd. Address before: 222061 No.18, Shenghu Road, Lianyungang City, Jiangsu Province Patentee before: The 716th Research Institute of China Shipbuilding Corp. Patentee before: JIANGSU JARI TECHNOLOGY GROUP Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
CP01 | Change in the name or title of a patent holder |
Address after: 222061 No.18, Shenghu Road, Lianyungang City, Jiangsu Province Patentee after: The 716th Research Institute of China Shipbuilding Corp. Patentee after: JIANGSU JARI TECHNOLOGY GROUP Co.,Ltd. Patentee after: China Shipbuilding Digital Information Technology Co.,Ltd. Address before: 222061 No.18, Shenghu Road, Lianyungang City, Jiangsu Province Patentee before: The 716th Research Institute of China Shipbuilding Corp. Patentee before: JIANGSU JARI TECHNOLOGY GROUP Co.,Ltd. Patentee before: CSIC Information Technology Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder |