JPWO2021229625A5 - - Google Patents
Download PDFInfo
- Publication number
- JPWO2021229625A5 JPWO2021229625A5 JP2022522086A JP2022522086A JPWO2021229625A5 JP WO2021229625 A5 JPWO2021229625 A5 JP WO2021229625A5 JP 2022522086 A JP2022522086 A JP 2022522086A JP 2022522086 A JP2022522086 A JP 2022522086A JP WO2021229625 A5 JPWO2021229625 A5 JP WO2021229625A5
- Authority
- JP
- Japan
- Prior art keywords
- target
- objective function
- change
- learning
- result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000006870 function Effects 0.000 claims 15
- 238000000034 method Methods 0.000 claims 7
- 238000005457 optimization Methods 0.000 claims 6
- 230000002787 reinforcement Effects 0.000 claims 3
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2020/018767 WO2021229625A1 (ja) | 2020-05-11 | 2020-05-11 | 学習装置、学習方法および学習プログラム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2021229625A1 JPWO2021229625A1 (https=) | 2021-11-18 |
| JPWO2021229625A5 true JPWO2021229625A5 (https=) | 2023-01-24 |
| JP7420236B2 JP7420236B2 (ja) | 2024-01-23 |
Family
ID=78525971
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022522086A Active JP7420236B2 (ja) | 2020-05-11 | 2020-05-11 | 学習装置、学習方法および学習プログラム |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20230281506A1 (https=) |
| JP (1) | JP7420236B2 (https=) |
| WO (1) | WO2021229625A1 (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20250161753A1 (en) * | 2022-03-30 | 2025-05-22 | Nec Corporation | Workout support apparatus, workout support method, training apparatus, and storage medium |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10902347B2 (en) * | 2017-04-11 | 2021-01-26 | International Business Machines Corporation | Rule creation using MDP and inverse reinforcement learning |
| US11308401B2 (en) * | 2018-01-31 | 2022-04-19 | Royal Bank Of Canada | Interactive reinforcement learning with dynamic reuse of prior knowledge |
| WO2020159692A1 (en) * | 2019-01-28 | 2020-08-06 | Mayo Foundation For Medical Education And Research | Estimating latent reward functions from experiences |
| KR102653617B1 (ko) * | 2019-07-03 | 2024-04-01 | 엘지전자 주식회사 | 공기조화기 및 공기조화기의 동작 방법 |
| US20220390909A1 (en) * | 2019-11-14 | 2022-12-08 | Nec Corporation | Learning device, learning method, and learning program |
| US20220318917A1 (en) * | 2019-12-25 | 2022-10-06 | Nec Corporation | Intention feature value extraction device, learning device, method, and program |
| JP7464115B2 (ja) * | 2020-05-11 | 2024-04-09 | 日本電気株式会社 | 学習装置、学習方法および学習プログラム |
-
2020
- 2020-05-11 JP JP2022522086A patent/JP7420236B2/ja active Active
- 2020-05-11 WO PCT/JP2020/018767 patent/WO2021229625A1/ja not_active Ceased
- 2020-05-11 US US17/922,029 patent/US20230281506A1/en active Pending
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101899101B1 (ko) | 인공 신경망 기반 예측 모델 생성 장치 및 방법 | |
| JPWO2016151620A1 (ja) | シミュレートシステム、シミュレート方法およびシミュレート用プログラム | |
| CN111433689B (zh) | 用于目标系统的控制系统的生成 | |
| JPWO2021130916A5 (https=) | ||
| JPWO2020234984A5 (https=) | ||
| JP2019185127A5 (ja) | ニューラルネットワークの学習装置およびその制御方法 | |
| KR20150084596A (ko) | 최적 설계 파라미터 탐색을 위한 최적화 방법 | |
| JP2019520642A (ja) | 制御目的関数統合システム、制御目的関数統合方法、および、制御目的関数統合プログラム | |
| JP2017049801A5 (https=) | ||
| CN111077769B (zh) | 用于控制或调节技术系统的方法 | |
| JPWO2021090518A5 (ja) | 学習装置、学習方法、及び、プログラム | |
| JPWO2022013933A5 (ja) | 制御装置、制御方法及びプログラム | |
| JPWO2021229625A5 (https=) | ||
| WO2018083950A1 (ja) | パラメータ最適化装置、パラメータ最適化方法、及びコンピュータ読み取り可能な記録媒体 | |
| Wang et al. | Incorporating structural constraints into continuous optimization for causal discovery | |
| JPWO2021250720A5 (ja) | 情報処理装置、情報処理方法、及び、プログラム | |
| JPWO2021229626A5 (https=) | ||
| Xu et al. | A novel topology adaptation strategy for dynamic sparse training in deep reinforcement learning | |
| JPWO2021229648A5 (https=) | ||
| JPWO2023166573A5 (ja) | 学習装置、制御装置、学習方法及びプログラム | |
| JPWO2022180870A5 (ja) | 学習装置、学習方法およびプログラム | |
| JPWO2022254626A5 (https=) | ||
| JPWO2022091413A5 (https=) | ||
| CN114254764B (zh) | 基于反馈的机器学习模型搜索方法、系统、设备及介质 | |
| Arjun et al. | Comparison of actor-critic reinforcement learning methods for adaptive cut selection for integer programming with applications to sensor network design |