JP7420236B2 - 学習装置、学習方法および学習プログラム - Google Patents
学習装置、学習方法および学習プログラム Download PDFInfo
- Publication number
- JP7420236B2 JP7420236B2 JP2022522086A JP2022522086A JP7420236B2 JP 7420236 B2 JP7420236 B2 JP 7420236B2 JP 2022522086 A JP2022522086 A JP 2022522086A JP 2022522086 A JP2022522086 A JP 2022522086A JP 7420236 B2 JP7420236 B2 JP 7420236B2
- Authority
- JP
- Japan
- Prior art keywords
- target
- learning
- objective function
- output
- outputs
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/092—Reinforcement learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2020/018767 WO2021229625A1 (ja) | 2020-05-11 | 2020-05-11 | 学習装置、学習方法および学習プログラム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2021229625A1 JPWO2021229625A1 (https=) | 2021-11-18 |
| JPWO2021229625A5 JPWO2021229625A5 (https=) | 2023-01-24 |
| JP7420236B2 true JP7420236B2 (ja) | 2024-01-23 |
Family
ID=78525971
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022522086A Active JP7420236B2 (ja) | 2020-05-11 | 2020-05-11 | 学習装置、学習方法および学習プログラム |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20230281506A1 (https=) |
| JP (1) | JP7420236B2 (https=) |
| WO (1) | WO2021229625A1 (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20250161753A1 (en) * | 2022-03-30 | 2025-05-22 | Nec Corporation | Workout support apparatus, workout support method, training apparatus, and storage medium |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190390867A1 (en) | 2019-07-03 | 2019-12-26 | Lg Electronics Inc. | Air conditioner and method for operating the air conditioner |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10902347B2 (en) * | 2017-04-11 | 2021-01-26 | International Business Machines Corporation | Rule creation using MDP and inverse reinforcement learning |
| US11308401B2 (en) * | 2018-01-31 | 2022-04-19 | Royal Bank Of Canada | Interactive reinforcement learning with dynamic reuse of prior knowledge |
| WO2020159692A1 (en) * | 2019-01-28 | 2020-08-06 | Mayo Foundation For Medical Education And Research | Estimating latent reward functions from experiences |
| US20220390909A1 (en) * | 2019-11-14 | 2022-12-08 | Nec Corporation | Learning device, learning method, and learning program |
| US20220318917A1 (en) * | 2019-12-25 | 2022-10-06 | Nec Corporation | Intention feature value extraction device, learning device, method, and program |
| JP7464115B2 (ja) * | 2020-05-11 | 2024-04-09 | 日本電気株式会社 | 学習装置、学習方法および学習プログラム |
-
2020
- 2020-05-11 JP JP2022522086A patent/JP7420236B2/ja active Active
- 2020-05-11 WO PCT/JP2020/018767 patent/WO2021229625A1/ja not_active Ceased
- 2020-05-11 US US17/922,029 patent/US20230281506A1/en active Pending
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190390867A1 (en) | 2019-07-03 | 2019-12-26 | Lg Electronics Inc. | Air conditioner and method for operating the air conditioner |
Non-Patent Citations (1)
| Title |
|---|
| 増山 岳人 ほか,逆強化学習による学習者の選好を考慮した報酬関数の推定,第32回日本ロボット学会学術講演会 ,2014年09月29日,1102~1104頁 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2021229625A1 (ja) | 2021-11-18 |
| JPWO2021229625A1 (https=) | 2021-11-18 |
| US20230281506A1 (en) | 2023-09-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Walsh et al. | Exploring compact reinforcement-learning representations with linear regression | |
| JP7529145B2 (ja) | 学習装置、学習方法および学習プログラム | |
| CN118154089B (zh) | 基于供应链管理的智能仓储系统及方法 | |
| CN113287124A (zh) | 用于搭乘订单派遣的系统和方法 | |
| CN115146764B (zh) | 一种预测模型的训练方法、装置、电子设备及存储介质 | |
| CN119067273A (zh) | 用于物流路线规划的方法及系统 | |
| CN119558385B (zh) | 一种具有高泛化能力的分层联邦学习方法 | |
| CN118760100A (zh) | 一种大规模模糊柔性作业车间调度方法及相关设备 | |
| JP7464115B2 (ja) | 学習装置、学習方法および学習プログラム | |
| Gaidar et al. | Mathematical method for optimising the transport and logistics industry | |
| Xu et al. | Empty container repositioning problem using a reinforcement learning framework with multi-weight adaptive reward function | |
| Workneh et al. | Learning to schedule (L2S): Adaptive job shop scheduling using double deep Q network | |
| JP7420236B2 (ja) | 学習装置、学習方法および学習プログラム | |
| Li et al. | A reinforcement learning framework for efficient task allocation among AGVs in smart warehouse | |
| Chen et al. | Genetic programming with reinforcement learning trained transformer for real-world dynamic scheduling problems | |
| Su et al. | Multi-objective optimization for dynamic logistics scheduling based on hierarchical deep reinforcement learning | |
| Huang et al. | Deep reinforcement learning for solving car resequencing with selectivity banks in automotive assembly shops | |
| Zabraoui et al. | Ai-driven optimization of vehicle routing problems in supply chain: integrating graph neural networks and reinforcement learning | |
| US20240037452A1 (en) | Learning device, learning method, and learning program | |
| CN114298870A (zh) | 一种路径规划方法、装置、电子设备及计算机可读介质 | |
| Workneh et al. | Deep q network method for dynamic job shop scheduling problem | |
| WO2025223072A1 (zh) | 排产计划生成方法、装置和计算机可读存储介质 | |
| Oh et al. | Applying multi-agent reinforcement learning and graph neural networks to flexible job shop scheduling problem | |
| Beier et al. | Towards supervised learning of optimal replenishment policies | |
| Hamada | Neural Network Intelligence Model for Software Projects Cost Prediction |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20221026 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20221026 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230919 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20231031 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20231212 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20231225 |
|
| R151 | Written notification of patent or utility model registration |
Ref document number: 7420236 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R151 |