JP2018037064A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2018037064A5 JP2018037064A5 JP2017131700A JP2017131700A JP2018037064A5 JP 2018037064 A5 JP2018037064 A5 JP 2018037064A5 JP 2017131700 A JP2017131700 A JP 2017131700A JP 2017131700 A JP2017131700 A JP 2017131700A JP 2018037064 A5 JP2018037064 A5 JP 2018037064A5
- Authority
- JP
- Japan
- Prior art keywords
- cost
- arrival
- vehicle
- function
- approximated
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000006870 function Effects 0.000 claims description 58
- 238000000034 method Methods 0.000 claims description 41
- 230000005855 radiation Effects 0.000 claims description 2
- 230000006399 behavior Effects 0.000 claims 1
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/205,558 | 2016-07-08 | ||
| US15/205,558 US10065654B2 (en) | 2016-07-08 | 2016-07-08 | Online learning and vehicle control method based on reinforcement learning without active exploration |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2018037064A JP2018037064A (ja) | 2018-03-08 |
| JP2018037064A5 true JP2018037064A5 (enExample) | 2020-08-20 |
| JP7036545B2 JP7036545B2 (ja) | 2022-03-15 |
Family
ID=60892997
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2017131700A Active JP7036545B2 (ja) | 2016-07-08 | 2017-07-05 | 能動的探索なしの強化学習に基づくオンライン学習法及び車両制御方法 |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US10065654B2 (enExample) |
| JP (1) | JP7036545B2 (enExample) |
Families Citing this family (42)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10296004B2 (en) * | 2017-06-21 | 2019-05-21 | Toyota Motor Engineering & Manufacturing North America, Inc. | Autonomous operation for an autonomous vehicle objective in a multi-vehicle environment |
| EP3721340A4 (en) * | 2017-12-04 | 2021-09-08 | Optimum Semiconductor Technologies Inc. | NEURONAL NETWORK ACCELERATOR SYSTEM AND ARCHITECTURE |
| JP6884685B2 (ja) * | 2017-12-08 | 2021-06-09 | 三菱重工業株式会社 | 制御装置、無人システム、制御方法及びプログラム |
| US11130497B2 (en) | 2017-12-18 | 2021-09-28 | Plusai Limited | Method and system for ensemble vehicle control prediction in autonomous driving vehicles |
| US20190185012A1 (en) | 2017-12-18 | 2019-06-20 | PlusAI Corp | Method and system for personalized motion planning in autonomous driving vehicles |
| US11273836B2 (en) * | 2017-12-18 | 2022-03-15 | Plusai, Inc. | Method and system for human-like driving lane planning in autonomous driving vehicles |
| US20210116930A1 (en) * | 2018-02-28 | 2021-04-22 | Sony Corporation | Information processing apparatus, information processing method, program, and mobile object |
| JP7035734B2 (ja) * | 2018-03-30 | 2022-03-15 | 富士通株式会社 | 強化学習プログラム、強化学習方法、および強化学習装置 |
| US10990096B2 (en) * | 2018-04-27 | 2021-04-27 | Honda Motor Co., Ltd. | Reinforcement learning on autonomous vehicles |
| CN108959467B (zh) * | 2018-06-20 | 2021-10-15 | 华东师范大学 | 一种基于强化学习的问句和答案句相关度的计算方法 |
| US11823039B2 (en) | 2018-08-24 | 2023-11-21 | International Business Machines Corporation | Safe and fast exploration for reinforcement learning using constrained action manifolds |
| EP3629105A1 (en) * | 2018-09-27 | 2020-04-01 | Bayerische Motoren Werke Aktiengesellschaft | High-level decision making for safe and reasonable autonomous lane changing using reinforcement learning |
| US10940863B2 (en) * | 2018-11-01 | 2021-03-09 | GM Global Technology Operations LLC | Spatial and temporal attention-based deep reinforcement learning of hierarchical lane-change policies for controlling an autonomous vehicle |
| JP7206874B2 (ja) | 2018-12-10 | 2023-01-18 | 富士電機株式会社 | 制御装置、制御方法及びプログラム |
| EP3877117B1 (en) | 2018-12-13 | 2024-10-16 | Siemens Aktiengesellschaft | Automated system including reachability analysis |
| CN109901572B (zh) * | 2018-12-13 | 2022-06-28 | 华为技术有限公司 | 自动驾驶方法、训练方法及相关装置 |
| EP3693243B1 (en) * | 2019-02-06 | 2024-11-06 | Zenuity AB | Method and system for controlling an automated driving system of a vehicle |
| DE102019206908B4 (de) * | 2019-05-13 | 2022-02-17 | Psa Automobiles Sa | Verfahren zum Trainieren wenigstens eines Algorithmus für ein Steuergerät eines Kraftfahrzeugs, Computerprogrammprodukt, Kraftfahrzeug sowie System |
| JP2021017168A (ja) * | 2019-07-22 | 2021-02-15 | 本田技研工業株式会社 | ダンパ制御システム、車両、情報処理装置およびそれらの制御方法、ならびにプログラム |
| EP3783446B1 (de) * | 2019-08-21 | 2021-08-11 | dSPACE digital signal processing and control engineering GmbH | Computerimplementiertes verfahren und testeinheit zum approximieren einer teilmenge von testergebnissen |
| CN110843746B (zh) * | 2019-11-28 | 2022-06-14 | 的卢技术有限公司 | 一种基于强化学习的防抱死刹车控制方法及系统 |
| CN113552869B (zh) * | 2020-04-23 | 2023-07-07 | 华为技术有限公司 | 优化决策规控的方法、控制车辆行驶的方法和相关装置 |
| US12246699B2 (en) * | 2020-06-26 | 2025-03-11 | Mitsubishi Electric Research Laboratories, Inc. | System and method for data-driven reference generation |
| CN111796522B (zh) * | 2020-07-16 | 2022-06-03 | 上海智驾汽车科技有限公司 | 一种车辆状态估计方法 |
| CN112289044B (zh) * | 2020-11-02 | 2021-09-07 | 南京信息工程大学 | 基于深度强化学习的高速公路道路协同控制系统及方法 |
| US12384410B2 (en) | 2021-03-05 | 2025-08-12 | The Research Foundation For The State University Of New York | Task-motion planning for safe and efficient urban driving |
| US11669593B2 (en) | 2021-03-17 | 2023-06-06 | Geotab Inc. | Systems and methods for training image processing models for vehicle data collection |
| US12322192B2 (en) | 2021-03-17 | 2025-06-03 | Geotab Inc. | Systems and methods for vehicle data collection by image analysis |
| US11682218B2 (en) | 2021-03-17 | 2023-06-20 | Geotab Inc. | Methods for vehicle data collection by image analysis |
| US11886196B2 (en) * | 2021-04-05 | 2024-01-30 | Mitsubishi Electric Research Laboratories, Inc. | Controlling machine operating in uncertain environment discoverable by sensing |
| CN113253612B (zh) * | 2021-06-01 | 2021-09-17 | 苏州浪潮智能科技有限公司 | 一种自动驾驶控制方法、装置、设备及可读存储介质 |
| CN113359476B (zh) * | 2021-07-09 | 2022-09-16 | 广东华中科技大学工业技术研究院 | 离散时间下多智能体系统的一致性控制算法设计方法 |
| US12272184B2 (en) | 2021-11-05 | 2025-04-08 | Geotab Inc. | AI-based input output expansion adapter for a telematics device |
| US11693920B2 (en) * | 2021-11-05 | 2023-07-04 | Geotab Inc. | AI-based input output expansion adapter for a telematics device and methods for updating an AI model thereon |
| EP4507956A1 (en) * | 2022-04-14 | 2025-02-19 | Mitsubishi Electric Corporation | System and method for motion and path planning for trailer-based vehicle |
| US20230376832A1 (en) * | 2022-05-18 | 2023-11-23 | GM Global Technology Operations LLC | Calibrating parameters within a virtual environment using reinforcement learning |
| CN117437771A (zh) * | 2022-07-15 | 2024-01-23 | 北京图森智途科技有限公司 | 一种目标状态估计方法、装置、电子设备和介质 |
| CN115230720A (zh) * | 2022-07-22 | 2022-10-25 | 江苏大学 | 基于深度拉格朗日神经网络的自动驾驶车辆控制系统及方法 |
| US20240157978A1 (en) * | 2022-11-11 | 2024-05-16 | Waabi Innovation Inc. | Mixed reality simulation for autonomous systems |
| US20240326815A1 (en) * | 2023-03-31 | 2024-10-03 | Ford Global Technologies, Llc | Vehicle maneuvering |
| CN116149262B (zh) * | 2023-04-23 | 2023-07-04 | 山东科技大学 | 一种伺服系统的跟踪控制方法及系统 |
| CN119739039B (zh) * | 2024-12-20 | 2025-11-25 | 北京航空航天大学 | 拦截制导策略获取方法、装置、设备及存储介质 |
Family Cites Families (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU3477397A (en) * | 1996-06-04 | 1998-01-05 | Paul J. Werbos | 3-brain architecture for an intelligent decision and control system |
| JPH10254505A (ja) * | 1997-03-14 | 1998-09-25 | Toyota Motor Corp | 自動制御装置 |
| US6532454B1 (en) * | 1998-09-24 | 2003-03-11 | Paul J. Werbos | Stable adaptive control using critic designs |
| US6882992B1 (en) * | 1999-09-02 | 2005-04-19 | Paul J. Werbos | Neural networks for intelligent control |
| US6411871B1 (en) | 2000-08-05 | 2002-06-25 | American Gnc Corporation | Autonomous navigation, guidance and control using LDRI |
| JP2006313512A (ja) | 2005-04-04 | 2006-11-16 | Sony Corp | 学習制御装置、学習制御方法、およびプログラム |
| US8854199B2 (en) | 2009-01-26 | 2014-10-07 | Lytx, Inc. | Driver risk assessment system and method employing automated driver log |
| US8812226B2 (en) * | 2009-01-26 | 2014-08-19 | GM Global Technology Operations LLC | Multiobject fusion module for collision preparation system |
| US8380367B2 (en) | 2009-03-26 | 2013-02-19 | The University Of North Dakota | Adaptive surveillance and guidance system for vehicle collision avoidance and interception |
| JP2012533056A (ja) * | 2009-07-09 | 2012-12-20 | トムトム インターナショナル ベスローテン フエンノートシャップ | 経路探索加速データとともに地図データを用いるナビゲーション装置 |
| WO2012117044A2 (de) | 2011-03-01 | 2012-09-07 | Continental Teves Ag & Co. Ohg | Verfahren und vorrichtung zur prädiktion und adaption von bewegungstrajektorien von kraftfahrzeugen |
| US8965834B2 (en) * | 2011-12-07 | 2015-02-24 | Extendabrain Corporation | Particle methods for nonlinear control |
| US10366325B2 (en) * | 2011-12-07 | 2019-07-30 | Paul Burchard | Sparse neural control |
| CN104247467A (zh) | 2012-02-17 | 2014-12-24 | 英特托拉斯技术公司 | 用于交通工具策略施行的方法和系统 |
| US9134707B2 (en) * | 2012-03-30 | 2015-09-15 | Board Of Regents, The University Of Texas System | Optimal online adaptive controller |
| US20140142948A1 (en) | 2012-11-21 | 2014-05-22 | Somya Rathi | Systems and methods for in-vehicle context formation |
| US10133250B2 (en) * | 2014-06-20 | 2018-11-20 | Veritone Alpha, Inc. | Managing construction of decision modules to control target systems |
| US20160202670A1 (en) * | 2015-01-08 | 2016-07-14 | Northwestern University | System and method for sequential action control for nonlinear systems |
| US9511767B1 (en) * | 2015-07-01 | 2016-12-06 | Toyota Motor Engineering & Manufacturing North America, Inc. | Autonomous vehicle action planning using behavior prediction |
| US9916703B2 (en) * | 2015-11-04 | 2018-03-13 | Zoox, Inc. | Calibration for autonomous vehicle operation |
| US9568915B1 (en) * | 2016-02-11 | 2017-02-14 | Mitsubishi Electric Research Laboratories, Inc. | System and method for controlling autonomous or semi-autonomous vehicle |
| US10061316B2 (en) * | 2016-07-08 | 2018-08-28 | Toyota Motor Engineering & Manufacturing North America, Inc. | Control policy learning and vehicle control method based on reinforcement learning without active exploration |
-
2016
- 2016-07-08 US US15/205,558 patent/US10065654B2/en not_active Expired - Fee Related
-
2017
- 2017-07-05 JP JP2017131700A patent/JP7036545B2/ja active Active
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2018037064A5 (enExample) | ||
| JP6824382B2 (ja) | 複数の機械学習タスクに関する機械学習モデルのトレーニング | |
| JP7235813B2 (ja) | 補助タスクを伴う強化学習 | |
| CN113874865B (zh) | 确定技术系统的调节策略的模型参数的方法和装置 | |
| JP2019031268A5 (enExample) | ||
| CN108885717B (zh) | 异步深度强化学习 | |
| JP6728495B2 (ja) | 強化学習を用いた環境予測 | |
| JP6513015B2 (ja) | 機械の動作を制御する方法、および機械の動作を反復的に制御する制御システム | |
| US11402808B2 (en) | Configuring a system which interacts with an environment | |
| JP6483667B2 (ja) | ベイズの最適化を実施するためのシステムおよび方法 | |
| WO2018224695A1 (en) | Training action selection neural networks | |
| WO2017091629A1 (en) | Reinforcement learning using confidence scores | |
| CN118246513A (zh) | 训练动作选择神经网络 | |
| WO2018156891A1 (en) | Training policy neural networks using path consistency learning | |
| JP6901450B2 (ja) | 機械学習装置、制御装置及び機械学習方法 | |
| CN108701251A (zh) | 使用优势估计强化学习 | |
| JP6840363B2 (ja) | ネットワーク学習装置、行動決定装置、ネットワーク学習方法、及びプログラム | |
| JP2021060988A5 (enExample) | ||
| JP6955155B2 (ja) | 学習装置、学習方法及び学習プログラム | |
| JP6718500B2 (ja) | 生産システムにおける出力効率の最適化 | |
| JP2014041547A (ja) | 時系列データ解析装置、方法、及びプログラム | |
| JP2020035182A (ja) | 制御装置及び制御方法 | |
| JP2014160456A (ja) | 疎変数最適化装置、疎変数最適化方法および疎変数最適化プログラム | |
| CN118752492A (zh) | 基于深度强化学习的多任务多机器人的运动控制方法 | |
| JP2020091611A (ja) | 行動決定プログラム、行動決定方法、および行動決定装置 |