JP2022547529A5 - - Google Patents

Info

Publication number
JP2022547529A5
JP2022547529A5 JP2022515598A JP2022515598A JP2022547529A5 JP 2022547529 A5 JP2022547529 A5 JP 2022547529A5 JP 2022515598 A JP2022515598 A JP 2022515598A JP 2022515598 A JP2022515598 A JP 2022515598A JP 2022547529 A5 JP2022547529 A5 JP 2022547529A5
Authority
JP
Japan
Prior art keywords
state
feature
subsets
aforementioned
features
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022515598A
Other languages
English (en)
Japanese (ja)
Other versions
JP7438336B2 (ja
JP2022547529A (ja
Filing date
Publication date
Priority claimed from US16/568,284 external-priority patent/US11574244B2/en
Application filed filed Critical
Publication of JP2022547529A publication Critical patent/JP2022547529A/ja
Publication of JP2022547529A5 publication Critical patent/JP2022547529A5/ja
Application granted granted Critical
Publication of JP7438336B2 publication Critical patent/JP7438336B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022515598A 2019-09-12 2020-08-11 強化学習モデルのための状態シミュレータ Active JP7438336B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/568,284 2019-09-12
US16/568,284 US11574244B2 (en) 2019-09-12 2019-09-12 States simulator for reinforcement learning models
PCT/EP2020/072487 WO2021047842A1 (en) 2019-09-12 2020-08-11 States simulator for reinforcement learning models

Publications (3)

Publication Number Publication Date
JP2022547529A JP2022547529A (ja) 2022-11-14
JP2022547529A5 true JP2022547529A5 (https=) 2022-12-13
JP7438336B2 JP7438336B2 (ja) 2024-02-26

Family

ID=72050874

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022515598A Active JP7438336B2 (ja) 2019-09-12 2020-08-11 強化学習モデルのための状態シミュレータ

Country Status (5)

Country Link
US (1) US11574244B2 (https=)
EP (1) EP4028959A1 (https=)
JP (1) JP7438336B2 (https=)
CN (1) CN114365157A (https=)
WO (1) WO2021047842A1 (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102338304B1 (ko) * 2020-10-20 2021-12-13 주식회사 뉴로코어 강화 학습을 이용한 공장 시뮬레이터 기반 스케줄링 시스템
CN115617796A (zh) * 2022-10-12 2023-01-17 中电智元数据科技有限公司 一种分布式数据库索引选择方法
CN118837737B (zh) * 2024-06-27 2025-09-09 西安交通大学 一种水下推进电机故障诊断方法、装置、设备及存储介质

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8918866B2 (en) * 2009-06-29 2014-12-23 International Business Machines Corporation Adaptive rule loading and session control for securing network delivered services
JP2013242761A (ja) * 2012-05-22 2013-12-05 Internatl Business Mach Corp <Ibm> マルコフ決定過程システム環境下における方策パラメータを更新するための方法、並びに、その制御器及び制御プログラム
US9128739B1 (en) * 2012-12-31 2015-09-08 Emc Corporation Determining instances to maintain on at least one cloud responsive to an evaluation of performance characteristics
US20160260024A1 (en) * 2015-03-04 2016-09-08 Qualcomm Incorporated System of distributed planning
US10540598B2 (en) 2015-09-09 2020-01-21 International Business Machines Corporation Interpolation of transition probability values in Markov decision processes
CN108701252B (zh) 2015-11-12 2024-02-02 渊慧科技有限公司 使用优先化经验存储器训练神经网络
US10839302B2 (en) 2015-11-24 2020-11-17 The Research Foundation For The State University Of New York Approximate value iteration with complex returns by bounding
CN108230057A (zh) * 2016-12-09 2018-06-29 阿里巴巴集团控股有限公司 一种智能推荐方法及系统
US20180342004A1 (en) * 2017-05-25 2018-11-29 Microsoft Technology Licensing, Llc Cumulative success-based recommendations for repeat users
WO2020005240A1 (en) * 2018-06-27 2020-01-02 Google Llc Adapting a sequence model for use in predicting future device interactions with a computing system
US10963313B2 (en) * 2018-08-27 2021-03-30 Vmware, Inc. Automated reinforcement-learning-based application manager that learns and improves a reward function
US11468322B2 (en) * 2018-12-04 2022-10-11 Rutgers, The State University Of New Jersey Method for selecting and presenting examples to explain decisions of algorithms
EP3776347B1 (en) * 2019-06-17 2025-07-02 Google LLC Vehicle occupant engagement using three-dimensional eye gaze vectors

Similar Documents

Publication Publication Date Title
JP2022547529A5 (https=)
US11907821B2 (en) Population-based training of machine learning models
JP6382354B2 (ja) ニューラルネットワーク及びニューラルネットワークのトレーニング方法
JP2023520420A5 (https=)
JP2021524099A5 (https=)
JP2023538923A5 (https=)
JP2022518646A5 (https=)
EP3475891A1 (en) Targeting content to underperforming users in clusters
JP2017518588A5 (https=)
JP2019192246A (ja) 自然言語質問回答システム用のトレーニングデータを提供する方法および装置
Jansen Using Knowledge about the Opponent in Game-Tree Search.
JP2019164753A5 (https=)
JP2023512665A5 (https=)
CN116521850A (zh) 一种基于强化学习的交互方法及装置
JP2023098647A5 (https=)
JP2023040035A5 (https=)
CN119917069B (zh) 一种基于ai的软件需求智能分析与预测方法及系统
CN113657330A (zh) 一种字体书写笔顺生成方法、系统及其应用方法
CN112685318A (zh) 产生测试脚本的方法和系统
JP2025003721A5 (https=)
Zhang et al. Core: Mitigating catastrophic forgetting in continual learning through cognitive replay
Sridharan et al. Multimodal learning analytics for students behavior prediction using multi-scale dilated deep temporal convolution network with improved chameleon Swarm algorithm
CN112884129B (zh) 一种基于示教数据的多步规则提取方法、设备及存储介质
CN119358652B (zh) 基于知识图谱的学习路径分析方法、装置、设备及介质
JPWO2023181219A5 (ja) 分析装置、分析方法及びプログラム