JP7579632B2 - 推定装置、システム及び方法 - Google Patents
推定装置、システム及び方法 Download PDFInfo
- Publication number
- JP7579632B2 JP7579632B2 JP2019209036A JP2019209036A JP7579632B2 JP 7579632 B2 JP7579632 B2 JP 7579632B2 JP 2019209036 A JP2019209036 A JP 2019209036A JP 2019209036 A JP2019209036 A JP 2019209036A JP 7579632 B2 JP7579632 B2 JP 7579632B2
- Authority
- JP
- Japan
- Prior art keywords
- information
- state
- dynamics
- dynamics parameters
- behavior
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 62
- 230000009471 action Effects 0.000 claims description 66
- 238000009826 distribution Methods 0.000 claims description 53
- 230000006399 behavior Effects 0.000 claims description 46
- 230000015654 memory Effects 0.000 claims description 35
- 238000004088 simulation Methods 0.000 claims description 29
- 230000002787 reinforcement Effects 0.000 claims description 17
- 238000012549 training Methods 0.000 description 97
- 230000008569 process Effects 0.000 description 38
- 238000003860 storage Methods 0.000 description 21
- 238000012545 processing Methods 0.000 description 20
- 230000006870 function Effects 0.000 description 15
- 230000014509 gene expression Effects 0.000 description 15
- 238000004364 calculation method Methods 0.000 description 14
- 230000007704 transition Effects 0.000 description 14
- 230000001143 conditioned effect Effects 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- 230000008878 coupling Effects 0.000 description 7
- 238000010168 coupling process Methods 0.000 description 7
- 238000005859 coupling reaction Methods 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 230000008859 change Effects 0.000 description 6
- 238000004891 communication Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 238000013528 artificial neural network Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 230000010365 information processing Effects 0.000 description 4
- 238000010801 machine learning Methods 0.000 description 4
- 230000000052 comparative effect Effects 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 239000004065 semiconductor Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000007792 addition Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 238000005401 electroluminescence Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 102100023774 Cold-inducible RNA-binding protein Human genes 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000906744 Homo sapiens Cold-inducible RNA-binding protein Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Landscapes
- Feedback Control In General (AREA)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2019209036A JP7579632B2 (ja) | 2019-11-19 | 2019-11-19 | 推定装置、システム及び方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2019209036A JP7579632B2 (ja) | 2019-11-19 | 2019-11-19 | 推定装置、システム及び方法 |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2021082014A JP2021082014A (ja) | 2021-05-27 |
JP2021082014A5 JP2021082014A5 (enrdf_load_stackoverflow) | 2022-11-29 |
JP7579632B2 true JP7579632B2 (ja) | 2024-11-08 |
Family
ID=75966346
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2019209036A Active JP7579632B2 (ja) | 2019-11-19 | 2019-11-19 | 推定装置、システム及び方法 |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP7579632B2 (enrdf_load_stackoverflow) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12367509B2 (en) * | 2022-10-14 | 2025-07-22 | Shipt, Inc. | Markup optimization |
CN116038716B (zh) * | 2023-03-14 | 2023-07-18 | 煤炭科学研究总院有限公司 | 机器人的控制方法和机器人的控制模型的训练方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007052589A (ja) | 2005-08-17 | 2007-03-01 | Advanced Telecommunication Research Institute International | エージェント学習装置、エージェント学習方法及びエージェント学習プログラム |
JP2019079227A (ja) | 2017-10-24 | 2019-05-23 | 日本電信電話株式会社 | 状態遷移規則獲得装置、行動選択学習装置、行動選択装置、状態遷移規則獲得方法、行動選択方法、およびプログラム |
JP2019165602A (ja) | 2018-03-20 | 2019-09-26 | 株式会社Lixil | 制御システム、学習装置、制御装置、及び制御方法 |
JP2019191821A (ja) | 2018-04-23 | 2019-10-31 | 株式会社Preferred Networks | モーション処理装置、モーション処理方法、およびプログラム |
-
2019
- 2019-11-19 JP JP2019209036A patent/JP7579632B2/ja active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007052589A (ja) | 2005-08-17 | 2007-03-01 | Advanced Telecommunication Research Institute International | エージェント学習装置、エージェント学習方法及びエージェント学習プログラム |
JP2019079227A (ja) | 2017-10-24 | 2019-05-23 | 日本電信電話株式会社 | 状態遷移規則獲得装置、行動選択学習装置、行動選択装置、状態遷移規則獲得方法、行動選択方法、およびプログラム |
JP2019165602A (ja) | 2018-03-20 | 2019-09-26 | 株式会社Lixil | 制御システム、学習装置、制御装置、及び制御方法 |
JP2019191821A (ja) | 2018-04-23 | 2019-10-31 | 株式会社Preferred Networks | モーション処理装置、モーション処理方法、およびプログラム |
Non-Patent Citations (2)
Title |
---|
久保田 展行,深層学習の活用を加速するIoT向けソフトウエア基盤,日経エレクトロニクス,日本,日経BP社,2017年04月20日,第1179号,pp.59-69,ISSN:0385-1680 |
菱沼 徹 ほか,ロバスト方策を用いた探索木によるベイジアン強化学習アプローチ,一般社団法人 人工知能学会 第31回全国大会論文集DVD [DVD-ROM],日本,一般社団法人 人工知能学会 ,2017年05月23日,4C2-3,pp.1-4 |
Also Published As
Publication number | Publication date |
---|---|
JP2021082014A (ja) | 2021-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6824382B2 (ja) | 複数の機械学習タスクに関する機械学習モデルのトレーニング | |
CN110168578B (zh) | 具有任务特定路径的多任务神经网络 | |
US10603790B2 (en) | Workpiece picking device and workpiece picking method for improving picking operation of workpieces | |
Kessler et al. | Same state, different task: Continual reinforcement learning without interference | |
JP7050740B2 (ja) | 物体を把持するための奥行知覚モデリング | |
CN112119406A (zh) | 利用快速更新循环神经网络和慢速更新循环神经网络的深度强化学习 | |
JP6513015B2 (ja) | 機械の動作を制御する方法、および機械の動作を反復的に制御する制御システム | |
US20200349473A1 (en) | Method for generating universal learned model | |
CN115812180A (zh) | 使用奖励预测模型的机器人控制的离线学习 | |
CN115427968A (zh) | 边缘计算设备中的鲁棒人工智能推理 | |
CN116776964A (zh) | 用于分布式强化学习的方法、程序产品和存储介质 | |
CN110023965A (zh) | 用于选择由机器人智能体执行的动作的神经网络 | |
CN112119404A (zh) | 样本高效的强化学习 | |
US10860895B2 (en) | Imagination-based agent neural networks | |
JP6718500B2 (ja) | 生産システムにおける出力効率の最適化 | |
CN114529010B (zh) | 一种机器人自主学习方法、装置、设备及存储介质 | |
CN112016611B (zh) | 生成器网络和策略生成网络的训练方法、装置和电子设备 | |
US20220343216A1 (en) | Information processing apparatus and information processing method | |
JP7579632B2 (ja) | 推定装置、システム及び方法 | |
JP7340055B2 (ja) | 強化学習ポリシを訓練する方法 | |
KR20200084010A (ko) | 목표 시스템에 대한 제어 시스템 생성 | |
US20200134498A1 (en) | Dynamic boltzmann machine for predicting general distributions of time series datasets | |
KR20190141581A (ko) | 데이터 예측을 위한 인공신경망을 학습하는 방법 및 장치 | |
JP7179672B2 (ja) | 計算機システム及び機械学習方法 | |
WO2025074369A1 (en) | System and method for efficient collaborative marl training using tensor networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20221118 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20221118 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20231031 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20231110 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240109 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20240329 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240624 |
|
A911 | Transfer to examiner for re-examination before appeal (zenchi) |
Free format text: JAPANESE INTERMEDIATE CODE: A911 Effective date: 20240704 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20241004 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20241028 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 7579632 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |