JP2019031268A5 - - Google Patents

Download PDF

Info

Publication number
JP2019031268A5
JP2019031268A5 JP2018091189A JP2018091189A JP2019031268A5 JP 2019031268 A5 JP2019031268 A5 JP 2019031268A5 JP 2018091189 A JP2018091189 A JP 2018091189A JP 2018091189 A JP2018091189 A JP 2018091189A JP 2019031268 A5 JP2019031268 A5 JP 2019031268A5
Authority
JP
Japan
Prior art keywords
control
vehicle
collected data
value
policy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2018091189A
Other languages
English (en)
Japanese (ja)
Other versions
JP6856575B2 (ja
JP2019031268A (ja
Filing date
Publication date
Priority claimed from US15/594,020 external-priority patent/US10061316B2/en
Application filed filed Critical
Publication of JP2019031268A publication Critical patent/JP2019031268A/ja
Publication of JP2019031268A5 publication Critical patent/JP2019031268A5/ja
Application granted granted Critical
Publication of JP6856575B2 publication Critical patent/JP6856575B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2018091189A 2017-05-12 2018-05-10 能動的探索なしの強化学習に基づく制御ポリシー学習及び車両制御方法 Expired - Fee Related JP6856575B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15/594,020 US10061316B2 (en) 2016-07-08 2017-05-12 Control policy learning and vehicle control method based on reinforcement learning without active exploration
US15/594,020 2017-05-12

Publications (3)

Publication Number Publication Date
JP2019031268A JP2019031268A (ja) 2019-02-28
JP2019031268A5 true JP2019031268A5 (https=) 2020-10-15
JP6856575B2 JP6856575B2 (ja) 2021-04-07

Family

ID=65522935

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2018091189A Expired - Fee Related JP6856575B2 (ja) 2017-05-12 2018-05-10 能動的探索なしの強化学習に基づく制御ポリシー学習及び車両制御方法

Country Status (1)

Country Link
JP (1) JP6856575B2 (https=)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102267316B1 (ko) * 2019-03-05 2021-06-21 네이버랩스 주식회사 심층 강화 학습에 기반한 자율주행 에이전트의 학습 방법 및 시스템
KR102463146B1 (ko) * 2020-07-14 2022-11-03 중앙대학교 산학협력단 계층적 심화 강화학습을 이용한 hems 최적화 방법 및 장치
JP7433205B2 (ja) * 2020-12-17 2024-02-19 本田技研工業株式会社 車両制御装置、車両制御方法、およびプログラム
CN112590792B (zh) * 2020-12-18 2024-05-10 的卢技术有限公司 一种基于深度强化学习算法的车辆汇合控制方法
CA3210127A1 (en) * 2021-09-10 2023-03-16 Yann KOEBERLE Simulation based method and data center to obtain geo-fenced driving policy
CN114735027B (zh) * 2022-04-13 2025-08-19 北京京东乾石科技有限公司 应用于无人车的运行决策方法及装置
JP7788979B2 (ja) 2022-09-21 2025-12-19 本田技研工業株式会社 推定装置、推定方法、およびプログラム
CN115909780B (zh) * 2022-11-09 2023-07-21 江苏大学 基于智能网联与rbf神经网络的高速路汇入控制系统与方法
CN116843023B (zh) * 2023-06-08 2025-09-19 南京航空航天大学 一种基于深度强化学习的航空发动机旋转部件健康参数更新方法
CN116946162B (zh) * 2023-09-19 2023-12-15 东南大学 考虑路面附着条件的智能网联商用车安全驾驶决策方法
CN117911414B (zh) * 2024-03-20 2024-10-15 安徽大学 一种基于强化学习的自动驾驶汽车运动控制方法
CN118343164B (zh) * 2024-06-17 2024-10-01 北京理工大学前沿技术研究院 一种自动驾驶车辆行为决策方法、系统、设备及存储介质

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7107107B2 (en) * 2003-01-31 2006-09-12 Matsushita Electric Industrial Co., Ltd. Predictive action decision device and action decision method
JP2004348394A (ja) * 2003-05-21 2004-12-09 Toyota Central Res & Dev Lab Inc 環境変化装置及び行動指針情報生成提示装置
US10296004B2 (en) * 2017-06-21 2019-05-21 Toyota Motor Engineering & Manufacturing North America, Inc. Autonomous operation for an autonomous vehicle objective in a multi-vehicle environment
US10235881B2 (en) * 2017-07-28 2019-03-19 Toyota Motor Engineering & Manufacturing North America, Inc. Autonomous operation capability configuration for a vehicle

Similar Documents

Publication Publication Date Title
JP2019031268A5 (https=)
CN111796514B (zh) 基于所训练的贝叶斯神经网络来控制和监视物理系统
EP3772707B1 (en) Dynamics model for globally stable modeling of system dynamics
CN109492763B (zh) 一种基于强化学习网络训练的自动泊车方法
McKinnon et al. Learn fast, forget slow: Safe predictive learning control for systems with unknown and changing dynamics performing repetitive tasks
CN111507458B (zh) 提供个性化及自适应深度学习模型的方法及装置
CN111898206B (zh) 一种基于改进遗传算法的参数优化方法、计算机设备及存储介质
JP6845529B2 (ja) 行動決定システム及び自動運転制御装置
CN113874865A (zh) 借助于贝叶斯优化方法确定技术系统的调节策略的模型参数的方法和装置
US20210263526A1 (en) Method and device for supporting maneuver planning for an automated driving vehicle or a robot
US20200026296A1 (en) Method and device for driving dynamics control for a transportation vehicle
US10493625B2 (en) System for generating sets of control data for robots
US20220055217A1 (en) Method for operating a robot in a multi-agent system, robot, and multi-agent system
JPWO2020009139A5 (ja) 制御装置、システム、制御方法、方策更新方法、及び生成方法
KR102502125B1 (ko) 무인 수상정의 자율 주행을 위한 무인 수상정 운동 모델링 및 제어 방법 및 장치
CN113939775B (zh) 用于确定针对技术系统的调节策略的方法和设备
JP6986503B2 (ja) 電子制御装置、ニューラルネットワーク更新システム
US12246449B2 (en) Device and method for controlling a robotic device
JP7520238B2 (ja) ダイナミクスにおける不確実性を有するシステムを制御するための装置および方法
CN118618342A (zh) 车辆的自动泊车控制方法、装置、车辆及存储介质
Peng et al. Chance-constrained sneaking trajectory planning for reconnaissance robots
CN118439040B (zh) 车重和道路坡度估计的方法、装置、存储介质与电子设备
CN113544611B (zh) 用于运行自动化车辆的方法和设备
US20210008718A1 (en) Method, device and computer program for producing a strategy for a robot
CN108154231B (zh) Miso全格式无模型控制器基于系统误差的参数自整定方法