SG11202010172WA - Determining action selection policies of execution device - Google Patents
Determining action selection policies of execution deviceInfo
- Publication number
- SG11202010172WA SG11202010172WA SG11202010172WA SG11202010172WA SG11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA
- Authority
- SG
- Singapore
- Prior art keywords
- execution device
- action selection
- selection policies
- determining action
- determining
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G07—CHECKING-DEVICES
- G07F—COIN-FREED OR LIKE APPARATUS
- G07F17/00—Coin-freed apparatus for hiring articles; Coin-freed facilities or services
- G07F17/32—Coin-freed apparatus for hiring articles; Coin-freed facilities or services for games, toys, sports, or amusements
- G07F17/3225—Data transfer within a gaming system, e.g. data sent between gaming machines and users
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/01—Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
-
- G—PHYSICS
- G07—CHECKING-DEVICES
- G07F—COIN-FREED OR LIKE APPARATUS
- G07F17/00—Coin-freed apparatus for hiring articles; Coin-freed facilities or services
- G07F17/32—Coin-freed apparatus for hiring articles; Coin-freed facilities or services for games, toys, sports, or amusements
- G07F17/326—Game play aspects of gaming systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Computational Mathematics (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Algebra (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Debugging And Monitoring (AREA)
- Stored Programmes (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/124942 WO2020098822A2 (en) | 2019-12-12 | 2019-12-12 | Determining action selection policies of an execution device |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202010172WA true SG11202010172WA (en) | 2020-11-27 |
Family
ID=70733015
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202010172WA SG11202010172WA (en) | 2019-12-12 | 2019-12-12 | Determining action selection policies of execution device |
Country Status (5)
Country | Link |
---|---|
US (1) | US11144841B2 (zh) |
CN (1) | CN112997198B (zh) |
SG (1) | SG11202010172WA (zh) |
TW (1) | TWI763120B (zh) |
WO (1) | WO2020098822A2 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113779870A (zh) * | 2021-08-24 | 2021-12-10 | 清华大学 | 并行化不完美信息博弈策略生成方法、装置、电子设备以及存储介质 |
CN114580642B (zh) * | 2022-03-17 | 2023-04-07 | 中国科学院自动化研究所 | 构建博弈ai模型和数据处理的方法、装置、设备及介质 |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8606608B2 (en) | 2010-12-17 | 2013-12-10 | Microsoft Corporation | Offline counterfactual analysis |
US9047423B2 (en) | 2012-01-12 | 2015-06-02 | International Business Machines Corporation | Monte-Carlo planning using contextual information |
US8545332B2 (en) * | 2012-02-02 | 2013-10-01 | International Business Machines Corporation | Optimal policy determination using repeated stackelberg games with unknown player preferences |
US8650110B2 (en) | 2012-03-12 | 2014-02-11 | Intuit Inc. | Counterfactual testing of finances using financial objects |
US20140039913A1 (en) | 2012-07-31 | 2014-02-06 | Tuomas W. Sandholm | Medical treatment planning via sequential games |
US10217528B2 (en) * | 2014-08-29 | 2019-02-26 | General Electric Company | Optimizing state transition set points for schedule risk management |
FR3044438A1 (fr) | 2015-11-27 | 2017-06-02 | Thales Sa | Systeme et procede d'aide a la decision |
US10678125B2 (en) | 2016-03-02 | 2020-06-09 | Shin-Etsu Chemical Co., Ltd. | Photomask blank and method for preparing photomask |
US10057367B2 (en) | 2016-03-02 | 2018-08-21 | Huawei Technologies Canada Co., Ltd. | Systems and methods for data caching in a communications network |
CN106296006A (zh) | 2016-08-10 | 2017-01-04 | 哈尔滨工业大学深圳研究生院 | 非完备信息博弈中风险与收益均衡的最少遗憾的评估方法 |
US10694526B2 (en) | 2016-09-30 | 2020-06-23 | Drexel University | Adaptive pursuit learning method to mitigate small-cell interference through directionality |
CN118117305A (zh) * | 2016-12-21 | 2024-05-31 | 英特尔公司 | 无线通信技术、装置和方法 |
US11138513B2 (en) | 2017-06-13 | 2021-10-05 | Princeton University | Dynamic learning system |
US11886988B2 (en) * | 2017-11-22 | 2024-01-30 | International Business Machines Corporation | Method for adaptive exploration to accelerate deep reinforcement learning |
CN108108822B (zh) | 2018-01-16 | 2020-06-26 | 中国科学技术大学 | 并行训练的异策略深度强化学习方法 |
US11480971B2 (en) * | 2018-05-01 | 2022-10-25 | Honda Motor Co., Ltd. | Systems and methods for generating instructions for navigating intersections with autonomous vehicles |
US11263531B2 (en) * | 2018-05-18 | 2022-03-01 | Deepmind Technologies Limited | Unsupervised control using learned rewards |
CN108985458A (zh) * | 2018-07-23 | 2018-12-11 | 东北大学 | 一种序贯同步博弈的双树蒙特卡洛搜索算法 |
WO2020147074A1 (en) | 2019-01-17 | 2020-07-23 | Alibaba Group Holding Limited | Sampling schemes for strategy searching in strategic interaction between parties |
CN110222874B (zh) | 2019-05-14 | 2021-06-04 | 清华大学 | 信息处理方法及装置、存储介质及计算设备 |
CN110404265B (zh) * | 2019-07-25 | 2022-11-01 | 哈尔滨工业大学(深圳) | 一种基于博弈残局在线解算的多人非完备信息机器博弈方法、装置、系统及存储介质 |
WO2020143847A2 (en) | 2020-04-02 | 2020-07-16 | Alipay (Hangzhou) Information Technology Co., Ltd. | Determining action selection policies of an execution device |
SG11202102364YA (en) | 2020-04-02 | 2021-04-29 | Alipay Hangzhou Inf Tech Co Ltd | Determining action selection policies of an execution device |
-
2019
- 2019-12-12 WO PCT/CN2019/124942 patent/WO2020098822A2/en active Application Filing
- 2019-12-12 SG SG11202010172WA patent/SG11202010172WA/en unknown
- 2019-12-12 CN CN201980028594.8A patent/CN112997198B/zh active Active
-
2020
- 2020-10-29 US US17/084,241 patent/US11144841B2/en active Active
- 2020-11-12 TW TW109139535A patent/TWI763120B/zh active
Also Published As
Publication number | Publication date |
---|---|
TWI763120B (zh) | 2022-05-01 |
WO2020098822A2 (en) | 2020-05-22 |
WO2020098822A3 (en) | 2020-10-22 |
CN112997198B (zh) | 2022-07-15 |
CN112997198A (zh) | 2021-06-18 |
US11144841B2 (en) | 2021-10-12 |
US20210182718A1 (en) | 2021-06-17 |
TW202125398A (zh) | 2021-07-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202103113XA (en) | Determining action selection policies of an execution device | |
SG11202001804QA (en) | Determining action selection policies of an execution device | |
MA52967A (fr) | Composés antagonistes du pcsk9 | |
MA52413A (fr) | Inhibiteurs de cd73 | |
EP3903448A4 (en) | PREDICTION OF USE OR COMPLIANCE | |
GB201903347D0 (en) | Execution unit | |
IL276813A (en) | Arginase inhibitors | |
SG11202002915SA (en) | Determining action selection policies of an execution device | |
DE112020001082A5 (de) | Löschvorrichtung | |
GB201903346D0 (en) | Execution unit | |
SG11202102364YA (en) | Determining action selection policies of an execution device | |
UA41933S (uk) | Графічний користувацький інтерфейс | |
GB2582143B (en) | Execution unit | |
GB201903348D0 (en) | Execution unit | |
SG11202010172WA (en) | Determining action selection policies of execution device | |
SG11202010204TA (en) | Determining action selection policies of an execution device | |
DK3746231T3 (da) | Sigteindretning | |
SG11202002890QA (en) | Determining action selection policies of an execution device | |
GB2607200B (en) | Mass Spectrometric Determination of Particular Tissue States | |
GB201810746D0 (en) | Use of sclerostin antagonist | |
SG11202010721QA (en) | Determining action selection policies of execution device | |
SG11202002910RA (en) | Determining action selection policies of an execution device | |
MA55016A (fr) | Utilisation du spiropidion | |
EP3870099C0 (en) | SCREWABLE ANCHOR | |
ES1240311Y (es) | Dispositivo cubrepaellas |