JP7579632B2 - 推定装置、システム及び方法 - Google Patents

推定装置、システム及び方法 Download PDF

Info

Publication number
JP7579632B2
JP7579632B2 JP2019209036A JP2019209036A JP7579632B2 JP 7579632 B2 JP7579632 B2 JP 7579632B2 JP 2019209036 A JP2019209036 A JP 2019209036A JP 2019209036 A JP2019209036 A JP 2019209036A JP 7579632 B2 JP7579632 B2 JP 7579632B2
Authority
JP
Japan
Prior art keywords
information
state
dynamics
dynamics parameters
behavior
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2019209036A
Other languages
English (en)
Japanese (ja)
Other versions
JP2021082014A (ja
JP2021082014A5 (enrdf_load_stackoverflow
Inventor
新一 前田
バラドワジ ホマンガ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Preferred Networks Inc
Original Assignee
Preferred Networks Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Preferred Networks Inc filed Critical Preferred Networks Inc
Priority to JP2019209036A priority Critical patent/JP7579632B2/ja
Publication of JP2021082014A publication Critical patent/JP2021082014A/ja
Publication of JP2021082014A5 publication Critical patent/JP2021082014A5/ja
Application granted granted Critical
Publication of JP7579632B2 publication Critical patent/JP7579632B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Feedback Control In General (AREA)
JP2019209036A 2019-11-19 2019-11-19 推定装置、システム及び方法 Active JP7579632B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2019209036A JP7579632B2 (ja) 2019-11-19 2019-11-19 推定装置、システム及び方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2019209036A JP7579632B2 (ja) 2019-11-19 2019-11-19 推定装置、システム及び方法

Publications (3)

Publication Number Publication Date
JP2021082014A JP2021082014A (ja) 2021-05-27
JP2021082014A5 JP2021082014A5 (enrdf_load_stackoverflow) 2022-11-29
JP7579632B2 true JP7579632B2 (ja) 2024-11-08

Family

ID=75966346

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2019209036A Active JP7579632B2 (ja) 2019-11-19 2019-11-19 推定装置、システム及び方法

Country Status (1)

Country Link
JP (1) JP7579632B2 (enrdf_load_stackoverflow)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12367509B2 (en) * 2022-10-14 2025-07-22 Shipt, Inc. Markup optimization
CN116038716B (zh) * 2023-03-14 2023-07-18 煤炭科学研究总院有限公司 机器人的控制方法和机器人的控制模型的训练方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007052589A (ja) 2005-08-17 2007-03-01 Advanced Telecommunication Research Institute International エージェント学習装置、エージェント学習方法及びエージェント学習プログラム
JP2019079227A (ja) 2017-10-24 2019-05-23 日本電信電話株式会社 状態遷移規則獲得装置、行動選択学習装置、行動選択装置、状態遷移規則獲得方法、行動選択方法、およびプログラム
JP2019165602A (ja) 2018-03-20 2019-09-26 株式会社Lixil 制御システム、学習装置、制御装置、及び制御方法
JP2019191821A (ja) 2018-04-23 2019-10-31 株式会社Preferred Networks モーション処理装置、モーション処理方法、およびプログラム

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007052589A (ja) 2005-08-17 2007-03-01 Advanced Telecommunication Research Institute International エージェント学習装置、エージェント学習方法及びエージェント学習プログラム
JP2019079227A (ja) 2017-10-24 2019-05-23 日本電信電話株式会社 状態遷移規則獲得装置、行動選択学習装置、行動選択装置、状態遷移規則獲得方法、行動選択方法、およびプログラム
JP2019165602A (ja) 2018-03-20 2019-09-26 株式会社Lixil 制御システム、学習装置、制御装置、及び制御方法
JP2019191821A (ja) 2018-04-23 2019-10-31 株式会社Preferred Networks モーション処理装置、モーション処理方法、およびプログラム

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
久保田 展行,深層学習の活用を加速するIoT向けソフトウエア基盤,日経エレクトロニクス,日本,日経BP社,2017年04月20日,第1179号,pp.59-69,ISSN:0385-1680
菱沼 徹 ほか,ロバスト方策を用いた探索木によるベイジアン強化学習アプローチ,一般社団法人 人工知能学会 第31回全国大会論文集DVD [DVD-ROM],日本,一般社団法人 人工知能学会 ,2017年05月23日,4C2-3,pp.1-4

Also Published As

Publication number Publication date
JP2021082014A (ja) 2021-05-27

Similar Documents

Publication Publication Date Title
JP6824382B2 (ja) 複数の機械学習タスクに関する機械学習モデルのトレーニング
CN110168578B (zh) 具有任务特定路径的多任务神经网络
US10603790B2 (en) Workpiece picking device and workpiece picking method for improving picking operation of workpieces
Kessler et al. Same state, different task: Continual reinforcement learning without interference
JP7050740B2 (ja) 物体を把持するための奥行知覚モデリング
CN112119406A (zh) 利用快速更新循环神经网络和慢速更新循环神经网络的深度强化学习
JP6513015B2 (ja) 機械の動作を制御する方法、および機械の動作を反復的に制御する制御システム
US20200349473A1 (en) Method for generating universal learned model
CN115812180A (zh) 使用奖励预测模型的机器人控制的离线学习
CN115427968A (zh) 边缘计算设备中的鲁棒人工智能推理
CN116776964A (zh) 用于分布式强化学习的方法、程序产品和存储介质
CN110023965A (zh) 用于选择由机器人智能体执行的动作的神经网络
CN112119404A (zh) 样本高效的强化学习
US10860895B2 (en) Imagination-based agent neural networks
JP6718500B2 (ja) 生産システムにおける出力効率の最適化
CN114529010B (zh) 一种机器人自主学习方法、装置、设备及存储介质
CN112016611B (zh) 生成器网络和策略生成网络的训练方法、装置和电子设备
US20220343216A1 (en) Information processing apparatus and information processing method
JP7579632B2 (ja) 推定装置、システム及び方法
JP7340055B2 (ja) 強化学習ポリシを訓練する方法
KR20200084010A (ko) 목표 시스템에 대한 제어 시스템 생성
US20200134498A1 (en) Dynamic boltzmann machine for predicting general distributions of time series datasets
KR20190141581A (ko) 데이터 예측을 위한 인공신경망을 학습하는 방법 및 장치
JP7179672B2 (ja) 計算機システム及び機械学習方法
WO2025074369A1 (en) System and method for efficient collaborative marl training using tensor networks

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20221118

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20221118

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20231031

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20231110

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240109

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20240329

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240624

A911 Transfer to examiner for re-examination before appeal (zenchi)

Free format text: JAPANESE INTERMEDIATE CODE: A911

Effective date: 20240704

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20241004

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20241028

R150 Certificate of patent or registration of utility model

Ref document number: 7579632

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150