JP2019159888A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2019159888A5 JP2019159888A5 JP2018046510A JP2018046510A JP2019159888A5 JP 2019159888 A5 JP2019159888 A5 JP 2019159888A5 JP 2018046510 A JP2018046510 A JP 2018046510A JP 2018046510 A JP2018046510 A JP 2018046510A JP 2019159888 A5 JP2019159888 A5 JP 2019159888A5
- Authority
- JP
- Japan
- Prior art keywords
- evaluation
- reward
- value
- machine learning
- scale
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000011156 evaluation Methods 0.000 claims 32
- 230000006870 function Effects 0.000 claims 18
- 238000000034 method Methods 0.000 claims 12
- 238000010801 machine learning Methods 0.000 claims 10
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2018046510A JP6902487B2 (ja) | 2018-03-14 | 2018-03-14 | 機械学習システム |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2018046510A JP6902487B2 (ja) | 2018-03-14 | 2018-03-14 | 機械学習システム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2019159888A JP2019159888A (ja) | 2019-09-19 |
| JP2019159888A5 true JP2019159888A5 (enExample) | 2020-04-09 |
| JP6902487B2 JP6902487B2 (ja) | 2021-07-14 |
Family
ID=67996270
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2018046510A Expired - Fee Related JP6902487B2 (ja) | 2018-03-14 | 2018-03-14 | 機械学習システム |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JP6902487B2 (enExample) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110328668B (zh) * | 2019-07-27 | 2022-03-22 | 南京理工大学 | 基于速度平滑确定性策略梯度的机械臂路径规划方法 |
| WO2021156516A1 (en) * | 2020-02-07 | 2021-08-12 | Deepmind Technologies Limited | Multi-objective reinforcement learning using objective-specific action-value functions |
| CN112853560B (zh) * | 2020-12-31 | 2021-11-23 | 盐城师范学院 | 一种基于环锭纺纱线质量的全局工序共享控制系统及方法 |
| CN112953844B (zh) * | 2021-03-02 | 2023-04-28 | 中国农业银行股份有限公司 | 一种网络流量优化方法及装置 |
| CN115284285B (zh) * | 2022-07-29 | 2025-01-14 | 腾讯科技(深圳)有限公司 | 目标对象控制方法和装置、计算设备、存储介质 |
| CN116468106A (zh) * | 2023-04-12 | 2023-07-21 | 华北电力大学 | 基于预训练和知识引导的深度强化学习经济调度方法 |
| CN116976442B (zh) * | 2023-06-04 | 2025-09-26 | 西北工业大学 | 一种基于me-ddpg的无人机多对一追捕博弈方法 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3086206B2 (ja) * | 1998-07-17 | 2000-09-11 | 科学技術振興事業団 | エージェント学習装置 |
| JP5330138B2 (ja) * | 2008-11-04 | 2013-10-30 | 本田技研工業株式会社 | 強化学習システム |
-
2018
- 2018-03-14 JP JP2018046510A patent/JP6902487B2/ja not_active Expired - Fee Related