SG11202010172WA - Determining action selection policies of execution device - Google Patents

Determining action selection policies of execution device

Info

Publication number
SG11202010172WA
SG11202010172WA SG11202010172WA SG11202010172WA SG11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA SG 11202010172W A SG11202010172W A SG 11202010172WA
Authority
SG
Singapore
Prior art keywords
execution device
action selection
selection policies
determining action
determining
Prior art date
Application number
SG11202010172WA
Other languages
English (en)
Inventor
Hui Li
Le Song
Original Assignee
Alipay Hangzhou Inf Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alipay Hangzhou Inf Tech Co Ltd filed Critical Alipay Hangzhou Inf Tech Co Ltd
Publication of SG11202010172WA publication Critical patent/SG11202010172WA/en

Links

Classifications

    • GPHYSICS
    • G07CHECKING-DEVICES
    • G07FCOIN-FREED OR LIKE APPARATUS
    • G07F17/00Coin-freed apparatus for hiring articles; Coin-freed facilities or services
    • G07F17/32Coin-freed apparatus for hiring articles; Coin-freed facilities or services for games, toys, sports, or amusements
    • G07F17/3225Data transfer within a gaming system, e.g. data sent between gaming machines and users
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G07CHECKING-DEVICES
    • G07FCOIN-FREED OR LIKE APPARATUS
    • G07F17/00Coin-freed apparatus for hiring articles; Coin-freed facilities or services
    • G07F17/32Coin-freed apparatus for hiring articles; Coin-freed facilities or services for games, toys, sports, or amusements
    • G07F17/326Game play aspects of gaming systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Algebra (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Debugging And Monitoring (AREA)
  • Stored Programmes (AREA)
SG11202010172WA 2019-12-12 2019-12-12 Determining action selection policies of execution device SG11202010172WA (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/124942 WO2020098822A2 (en) 2019-12-12 2019-12-12 Determining action selection policies of an execution device

Publications (1)

Publication Number Publication Date
SG11202010172WA true SG11202010172WA (en) 2020-11-27

Family

ID=70733015

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202010172WA SG11202010172WA (en) 2019-12-12 2019-12-12 Determining action selection policies of execution device

Country Status (5)

Country Link
US (1) US11144841B2 (zh)
CN (1) CN112997198B (zh)
SG (1) SG11202010172WA (zh)
TW (1) TWI763120B (zh)
WO (1) WO2020098822A2 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113779870A (zh) * 2021-08-24 2021-12-10 清华大学 并行化不完美信息博弈策略生成方法、装置、电子设备以及存储介质
CN114580642B (zh) * 2022-03-17 2023-04-07 中国科学院自动化研究所 构建博弈ai模型和数据处理的方法、装置、设备及介质

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8606608B2 (en) 2010-12-17 2013-12-10 Microsoft Corporation Offline counterfactual analysis
US9047423B2 (en) 2012-01-12 2015-06-02 International Business Machines Corporation Monte-Carlo planning using contextual information
US8545332B2 (en) * 2012-02-02 2013-10-01 International Business Machines Corporation Optimal policy determination using repeated stackelberg games with unknown player preferences
US8650110B2 (en) 2012-03-12 2014-02-11 Intuit Inc. Counterfactual testing of finances using financial objects
US20140039913A1 (en) 2012-07-31 2014-02-06 Tuomas W. Sandholm Medical treatment planning via sequential games
US10217528B2 (en) * 2014-08-29 2019-02-26 General Electric Company Optimizing state transition set points for schedule risk management
FR3044438A1 (fr) 2015-11-27 2017-06-02 Thales Sa Systeme et procede d'aide a la decision
US10678125B2 (en) 2016-03-02 2020-06-09 Shin-Etsu Chemical Co., Ltd. Photomask blank and method for preparing photomask
US10057367B2 (en) 2016-03-02 2018-08-21 Huawei Technologies Canada Co., Ltd. Systems and methods for data caching in a communications network
CN106296006A (zh) 2016-08-10 2017-01-04 哈尔滨工业大学深圳研究生院 非完备信息博弈中风险与收益均衡的最少遗憾的评估方法
US10694526B2 (en) 2016-09-30 2020-06-23 Drexel University Adaptive pursuit learning method to mitigate small-cell interference through directionality
CN118117305A (zh) * 2016-12-21 2024-05-31 英特尔公司 无线通信技术、装置和方法
US11138513B2 (en) 2017-06-13 2021-10-05 Princeton University Dynamic learning system
US11886988B2 (en) * 2017-11-22 2024-01-30 International Business Machines Corporation Method for adaptive exploration to accelerate deep reinforcement learning
CN108108822B (zh) 2018-01-16 2020-06-26 中国科学技术大学 并行训练的异策略深度强化学习方法
US11480971B2 (en) * 2018-05-01 2022-10-25 Honda Motor Co., Ltd. Systems and methods for generating instructions for navigating intersections with autonomous vehicles
US11263531B2 (en) * 2018-05-18 2022-03-01 Deepmind Technologies Limited Unsupervised control using learned rewards
CN108985458A (zh) * 2018-07-23 2018-12-11 东北大学 一种序贯同步博弈的双树蒙特卡洛搜索算法
WO2020147074A1 (en) 2019-01-17 2020-07-23 Alibaba Group Holding Limited Sampling schemes for strategy searching in strategic interaction between parties
CN110222874B (zh) 2019-05-14 2021-06-04 清华大学 信息处理方法及装置、存储介质及计算设备
CN110404265B (zh) * 2019-07-25 2022-11-01 哈尔滨工业大学(深圳) 一种基于博弈残局在线解算的多人非完备信息机器博弈方法、装置、系统及存储介质
WO2020143847A2 (en) 2020-04-02 2020-07-16 Alipay (Hangzhou) Information Technology Co., Ltd. Determining action selection policies of an execution device
SG11202102364YA (en) 2020-04-02 2021-04-29 Alipay Hangzhou Inf Tech Co Ltd Determining action selection policies of an execution device

Also Published As

Publication number Publication date
TWI763120B (zh) 2022-05-01
WO2020098822A2 (en) 2020-05-22
WO2020098822A3 (en) 2020-10-22
CN112997198B (zh) 2022-07-15
CN112997198A (zh) 2021-06-18
US11144841B2 (en) 2021-10-12
US20210182718A1 (en) 2021-06-17
TW202125398A (zh) 2021-07-01

Similar Documents

Publication Publication Date Title
SG11202103113XA (en) Determining action selection policies of an execution device
SG11202001804QA (en) Determining action selection policies of an execution device
MA52967A (fr) Composés antagonistes du pcsk9
MA52413A (fr) Inhibiteurs de cd73
EP3903448A4 (en) PREDICTION OF USE OR COMPLIANCE
GB201903347D0 (en) Execution unit
IL276813A (en) Arginase inhibitors
SG11202002915SA (en) Determining action selection policies of an execution device
DE112020001082A5 (de) Löschvorrichtung
GB201903346D0 (en) Execution unit
SG11202102364YA (en) Determining action selection policies of an execution device
UA41933S (uk) Графічний користувацький інтерфейс
GB2582143B (en) Execution unit
GB201903348D0 (en) Execution unit
SG11202010172WA (en) Determining action selection policies of execution device
SG11202010204TA (en) Determining action selection policies of an execution device
DK3746231T3 (da) Sigteindretning
SG11202002890QA (en) Determining action selection policies of an execution device
GB2607200B (en) Mass Spectrometric Determination of Particular Tissue States
GB201810746D0 (en) Use of sclerostin antagonist
SG11202010721QA (en) Determining action selection policies of execution device
SG11202002910RA (en) Determining action selection policies of an execution device
MA55016A (fr) Utilisation du spiropidion
EP3870099C0 (en) SCREWABLE ANCHOR
ES1240311Y (es) Dispositivo cubrepaellas