JP7623879B2 - 行動制御計画装置、行動制御計画方法及び行動制御計画システム - Google Patents

行動制御計画装置、行動制御計画方法及び行動制御計画システム Download PDF

Info

Publication number
JP7623879B2
JP7623879B2 JP2021067848A JP2021067848A JP7623879B2 JP 7623879 B2 JP7623879 B2 JP 7623879B2 JP 2021067848 A JP2021067848 A JP 2021067848A JP 2021067848 A JP2021067848 A JP 2021067848A JP 7623879 B2 JP7623879 B2 JP 7623879B2
Authority
JP
Japan
Prior art keywords
episode
data
learning
episode data
episodes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021067848A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022162824A5 (https=
JP2022162824A (ja
Inventor
利浩 森澤
雅佳 古林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Priority to JP2021067848A priority Critical patent/JP7623879B2/ja
Publication of JP2022162824A publication Critical patent/JP2022162824A/ja
Publication of JP2022162824A5 publication Critical patent/JP2022162824A5/ja
Application granted granted Critical
Publication of JP7623879B2 publication Critical patent/JP7623879B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Traffic Control Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
JP2021067848A 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム Active JP7623879B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2021067848A JP7623879B2 (ja) 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2021067848A JP7623879B2 (ja) 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム

Publications (3)

Publication Number Publication Date
JP2022162824A JP2022162824A (ja) 2022-10-25
JP2022162824A5 JP2022162824A5 (https=) 2024-03-04
JP7623879B2 true JP7623879B2 (ja) 2025-01-29

Family

ID=83724661

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021067848A Active JP7623879B2 (ja) 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム

Country Status (1)

Country Link
JP (1) JP7623879B2 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2024075851A1 (https=) 2022-10-07 2024-04-11

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019529135A (ja) 2016-09-15 2019-10-17 グーグル エルエルシー ロボット操作のための深層強化学習
JP2020162438A (ja) 2019-03-28 2020-10-08 セコム株式会社 監視システム及び飛行ロボット
KR102169876B1 (ko) 2020-05-22 2020-10-27 주식회사 애자일소다 조건부 에피소드 구성을 이용한 강화학습 장치 및 방법
JP2020190854A (ja) 2019-05-20 2020-11-26 ヤフー株式会社 学習装置、学習方法及び学習プログラム

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021018484A (ja) * 2019-07-17 2021-02-15 国立研究開発法人 海上・港湾・航空技術研究所 周辺状態表現方法、避航動作学習プログラム、避航動作学習システム、及び船舶

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019529135A (ja) 2016-09-15 2019-10-17 グーグル エルエルシー ロボット操作のための深層強化学習
JP2020162438A (ja) 2019-03-28 2020-10-08 セコム株式会社 監視システム及び飛行ロボット
JP2020190854A (ja) 2019-05-20 2020-11-26 ヤフー株式会社 学習装置、学習方法及び学習プログラム
KR102169876B1 (ko) 2020-05-22 2020-10-27 주식회사 애자일소다 조건부 에피소드 구성을 이용한 강화학습 장치 및 방법

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
松本耕平, 外3名,"予測状態表現に基づく深層強化学習を用いた動的環境下における移動ロボットナビゲーション",第38回日本ロボット学会学術講演会予稿集DVD-ROM 2020年,日本,一般社団法人日本ロボット学会,2020年10月09日,p.1-4

Also Published As

Publication number Publication date
JP2022162824A (ja) 2022-10-25

Similar Documents

Publication Publication Date Title
CN110383298A (zh) 用于连续控制任务的数据高效强化学习
Bogert et al. Multi-robot inverse reinforcement learning under occlusion with interactions
Lee et al. Learning a Super Mario controller from examples of human play
CN118132414B (zh) 一种机器人智能性测评方法及系统
Tastan et al. Learning to intercept opponents in first person shooter games
JP7623879B2 (ja) 行動制御計画装置、行動制御計画方法及び行動制御計画システム
Mothanna et al. Review on reinforcement learning in cartpole game
Hamadouche et al. Comparison of value iteration, policy iteration and q-learning for solving decision-making problems
CN117232522A (zh) 基于时空交互图和危险区域的机器人人群导航方法及系统
DE112020004135T5 (de) Robotersteuerungsvorrichtung
CN119487523A (zh) 使用多步逆模型的可控的潜在空间发现
Ji et al. Evaluating the learning and performance characteristics of self-organizing systems with different task features
Mendonça et al. Reinforcement learning with optimized reward function for stealth applications
Sheh et al. Behavioural cloning for driving robots over rough terrain
Toaz et al. Vector Cost Behavioral Planning for Autonomous Robotic Systems with Contemporary Validation Strategies
KR20230079804A (ko) 상태 전이를 선형화하는 강화 학습에 기반한 전자 장치 및 그 방법
Colbert et al. Evaluating navigation behavior of agents in games using non-parametric statistics
JP7761527B2 (ja) 行動制御計画装置及び行動制御計画方法
JP6714058B2 (ja) 動きを予測する方法、装置およびプログラム
KR20230018569A (ko) 개체 상태 예측 장치, 개체 상태 예측 방법 및 개체 상태 예측 방법을 실행시키도록 기록매체에 저장된 컴퓨터 프로그램
Li Study, design, and evaluation of exploration strategies for autonomous mobile robots
Løvlid et al. Data-driven behavior modeling for computer generated forces
Bucher et al. Adversarial curiosity
Mai Apply reinforcement learning in AWS DeepRacer
최석진 Learning to Adapt: Latent Skill-Based Reinforcement Learning for Uncertain Objects and Environments

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240222

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20240222

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20241031

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20241112

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241218

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20250114

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20250117

R150 Certificate of patent or registration of utility model

Ref document number: 7623879

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150