JP2022162824A5 - - Google Patents

Download PDF

Info

Publication number
JP2022162824A5
JP2022162824A5 JP2021067848A JP2021067848A JP2022162824A5 JP 2022162824 A5 JP2022162824 A5 JP 2022162824A5 JP 2021067848 A JP2021067848 A JP 2021067848A JP 2021067848 A JP2021067848 A JP 2021067848A JP 2022162824 A5 JP2022162824 A5 JP 2022162824A5
Authority
JP
Japan
Prior art keywords
episode
data
behavior control
episode data
learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2021067848A
Other languages
English (en)
Japanese (ja)
Other versions
JP7623879B2 (ja
JP2022162824A (ja
Filing date
Publication date
Application filed filed Critical
Priority to JP2021067848A priority Critical patent/JP7623879B2/ja
Priority claimed from JP2021067848A external-priority patent/JP7623879B2/ja
Publication of JP2022162824A publication Critical patent/JP2022162824A/ja
Publication of JP2022162824A5 publication Critical patent/JP2022162824A5/ja
Application granted granted Critical
Publication of JP7623879B2 publication Critical patent/JP7623879B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2021067848A 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム Active JP7623879B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2021067848A JP7623879B2 (ja) 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2021067848A JP7623879B2 (ja) 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム

Publications (3)

Publication Number Publication Date
JP2022162824A JP2022162824A (ja) 2022-10-25
JP2022162824A5 true JP2022162824A5 (https=) 2024-03-04
JP7623879B2 JP7623879B2 (ja) 2025-01-29

Family

ID=83724661

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021067848A Active JP7623879B2 (ja) 2021-04-13 2021-04-13 行動制御計画装置、行動制御計画方法及び行動制御計画システム

Country Status (1)

Country Link
JP (1) JP7623879B2 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2024075851A1 (https=) 2022-10-07 2024-04-11

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109906132B (zh) * 2016-09-15 2022-08-09 谷歌有限责任公司 机器人操纵的深度强化学习
JP7155062B2 (ja) * 2019-03-28 2022-10-18 セコム株式会社 監視システム及び飛行ロボット
JP7145813B2 (ja) * 2019-05-20 2022-10-03 ヤフー株式会社 学習装置、学習方法及び学習プログラム
JP2021018484A (ja) * 2019-07-17 2021-02-15 国立研究開発法人 海上・港湾・航空技術研究所 周辺状態表現方法、避航動作学習プログラム、避航動作学習システム、及び船舶
KR102169876B1 (ko) * 2020-05-22 2020-10-27 주식회사 애자일소다 조건부 에피소드 구성을 이용한 강화학습 장치 및 방법

Similar Documents

Publication Publication Date Title
Puik et al. Assessment of reconfiguration schemes for Reconfigurable Manufacturing Systems based on resources and lead time
CN105580031B (zh) 在多维范围上对包括可分离子系统的系统的评估
DE102020114218A1 (de) Verfahren und Vorrichtungen zum Verbessern der Laufzeitleistung auf einem heterogenen System ausgeführter Software
Ma et al. Application of grazing land models in ecosystem management: Current status and next frontiers
JP2017509982A (ja) 原位置ニューラルネットワークコプロセッシング
JP2016539407A (ja) 因果顕著性時間推論
CN114529010B (zh) 一种机器人自主学习方法、装置、设备及存储介质
TW201539335A (zh) 實現神經網路處理器
CN105531724A (zh) 用于调制神经设备的训练的方法和装置
Schmid et al. Self-adaptation based on big data analytics: a model problem and tool
CN117149410A (zh) 一种基于ai智能模型训练调度指挥监控系统
KR102577188B1 (ko) 목표 시스템에 대한 제어 시스템 생성
TW201528162A (zh) 在尖峰神經網路中使用重放來實施突觸學習
JP2019139295A5 (ja) 情報処理方法、情報処理装置およびプログラム
CN114662427B (zh) 一种逻辑系统设计的调试方法及设备
CN114613159A (zh) 基于深度强化学习的交通信号灯控制方法、装置及设备
CN118917259B (zh) 基于强化学习的与非图优化方法、装置、计算机设备、可读存储介质和程序产品
JP2022162824A5 (https=)
CN106104586B (zh) 神经元形态模型开发的上下文实时反馈
RU2016129653A (ru) Способ автоматизированного проектирования производства и эксплуатации прикладного программного обеспечения и система для его осуществления
CN117744760A (zh) 文本信息的识别方法、装置、存储介质及电子设备
US20200184029A1 (en) Simulation apparatus, simulation method, and non-transitory computer readable medium storing program
US20220326922A1 (en) Method for optimizing program using reinforcement learning
CN114386288B (zh) 一种电磁兼容仿真方法、装置、设备及可读存储介质
Santucci et al. Intrinsic motivation signals for driving the acquisition of multiple tasks: a simulated robotic study