JP7623879B2 - 行動制御計画装置、行動制御計画方法及び行動制御計画システム - Google Patents
行動制御計画装置、行動制御計画方法及び行動制御計画システム Download PDFInfo
- Publication number
- JP7623879B2 JP7623879B2 JP2021067848A JP2021067848A JP7623879B2 JP 7623879 B2 JP7623879 B2 JP 7623879B2 JP 2021067848 A JP2021067848 A JP 2021067848A JP 2021067848 A JP2021067848 A JP 2021067848A JP 7623879 B2 JP7623879 B2 JP 7623879B2
- Authority
- JP
- Japan
- Prior art keywords
- episode
- data
- learning
- episode data
- episodes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Landscapes
- Traffic Control Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021067848A JP7623879B2 (ja) | 2021-04-13 | 2021-04-13 | 行動制御計画装置、行動制御計画方法及び行動制御計画システム |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021067848A JP7623879B2 (ja) | 2021-04-13 | 2021-04-13 | 行動制御計画装置、行動制御計画方法及び行動制御計画システム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2022162824A JP2022162824A (ja) | 2022-10-25 |
| JP2022162824A5 JP2022162824A5 (https=) | 2024-03-04 |
| JP7623879B2 true JP7623879B2 (ja) | 2025-01-29 |
Family
ID=83724661
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021067848A Active JP7623879B2 (ja) | 2021-04-13 | 2021-04-13 | 行動制御計画装置、行動制御計画方法及び行動制御計画システム |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JP7623879B2 (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPWO2024075851A1 (https=) | 2022-10-07 | 2024-04-11 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2019529135A (ja) | 2016-09-15 | 2019-10-17 | グーグル エルエルシー | ロボット操作のための深層強化学習 |
| JP2020162438A (ja) | 2019-03-28 | 2020-10-08 | セコム株式会社 | 監視システム及び飛行ロボット |
| KR102169876B1 (ko) | 2020-05-22 | 2020-10-27 | 주식회사 애자일소다 | 조건부 에피소드 구성을 이용한 강화학습 장치 및 방법 |
| JP2020190854A (ja) | 2019-05-20 | 2020-11-26 | ヤフー株式会社 | 学習装置、学習方法及び学習プログラム |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2021018484A (ja) * | 2019-07-17 | 2021-02-15 | 国立研究開発法人 海上・港湾・航空技術研究所 | 周辺状態表現方法、避航動作学習プログラム、避航動作学習システム、及び船舶 |
-
2021
- 2021-04-13 JP JP2021067848A patent/JP7623879B2/ja active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2019529135A (ja) | 2016-09-15 | 2019-10-17 | グーグル エルエルシー | ロボット操作のための深層強化学習 |
| JP2020162438A (ja) | 2019-03-28 | 2020-10-08 | セコム株式会社 | 監視システム及び飛行ロボット |
| JP2020190854A (ja) | 2019-05-20 | 2020-11-26 | ヤフー株式会社 | 学習装置、学習方法及び学習プログラム |
| KR102169876B1 (ko) | 2020-05-22 | 2020-10-27 | 주식회사 애자일소다 | 조건부 에피소드 구성을 이용한 강화학습 장치 및 방법 |
Non-Patent Citations (1)
| Title |
|---|
| 松本耕平, 外3名,"予測状態表現に基づく深層強化学習を用いた動的環境下における移動ロボットナビゲーション",第38回日本ロボット学会学術講演会予稿集DVD-ROM 2020年,日本,一般社団法人日本ロボット学会,2020年10月09日,p.1-4 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2022162824A (ja) | 2022-10-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110383298A (zh) | 用于连续控制任务的数据高效强化学习 | |
| Bogert et al. | Multi-robot inverse reinforcement learning under occlusion with interactions | |
| Lee et al. | Learning a Super Mario controller from examples of human play | |
| CN118132414B (zh) | 一种机器人智能性测评方法及系统 | |
| Tastan et al. | Learning to intercept opponents in first person shooter games | |
| JP7623879B2 (ja) | 行動制御計画装置、行動制御計画方法及び行動制御計画システム | |
| Mothanna et al. | Review on reinforcement learning in cartpole game | |
| Hamadouche et al. | Comparison of value iteration, policy iteration and q-learning for solving decision-making problems | |
| CN117232522A (zh) | 基于时空交互图和危险区域的机器人人群导航方法及系统 | |
| DE112020004135T5 (de) | Robotersteuerungsvorrichtung | |
| CN119487523A (zh) | 使用多步逆模型的可控的潜在空间发现 | |
| Ji et al. | Evaluating the learning and performance characteristics of self-organizing systems with different task features | |
| Mendonça et al. | Reinforcement learning with optimized reward function for stealth applications | |
| Sheh et al. | Behavioural cloning for driving robots over rough terrain | |
| Toaz et al. | Vector Cost Behavioral Planning for Autonomous Robotic Systems with Contemporary Validation Strategies | |
| KR20230079804A (ko) | 상태 전이를 선형화하는 강화 학습에 기반한 전자 장치 및 그 방법 | |
| Colbert et al. | Evaluating navigation behavior of agents in games using non-parametric statistics | |
| JP7761527B2 (ja) | 行動制御計画装置及び行動制御計画方法 | |
| JP6714058B2 (ja) | 動きを予測する方法、装置およびプログラム | |
| KR20230018569A (ko) | 개체 상태 예측 장치, 개체 상태 예측 방법 및 개체 상태 예측 방법을 실행시키도록 기록매체에 저장된 컴퓨터 프로그램 | |
| Li | Study, design, and evaluation of exploration strategies for autonomous mobile robots | |
| Løvlid et al. | Data-driven behavior modeling for computer generated forces | |
| Bucher et al. | Adversarial curiosity | |
| Mai | Apply reinforcement learning in AWS DeepRacer | |
| 최석진 | Learning to Adapt: Latent Skill-Based Reinforcement Learning for Uncertain Objects and Environments |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240222 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20240222 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20241031 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20241112 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241218 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20250114 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20250117 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7623879 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |