CN103198358B - 信息处理设备、信息处理方法和程序 - Google Patents

信息处理设备、信息处理方法和程序 Download PDF

Info

Publication number
CN103198358B
CN103198358B CN201210366351.5A CN201210366351A CN103198358B CN 103198358 B CN103198358 B CN 103198358B CN 201210366351 A CN201210366351 A CN 201210366351A CN 103198358 B CN103198358 B CN 103198358B
Authority
CN
China
Prior art keywords
action
data
reward
information processing
estimator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210366351.5A
Other languages
English (en)
Chinese (zh)
Other versions
CN103198358A (zh
Inventor
小林由幸
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN103198358A publication Critical patent/CN103198358A/zh
Application granted granted Critical
Publication of CN103198358B publication Critical patent/CN103198358B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Genetics & Genomics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Physiology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
CN201210366351.5A 2011-10-12 2012-09-28 信息处理设备、信息处理方法和程序 Expired - Fee Related CN103198358B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2011-224639 2011-10-12
JP2011224639A JP5879899B2 (ja) 2011-10-12 2011-10-12 情報処理装置、情報処理方法、及びプログラム

Publications (2)

Publication Number Publication Date
CN103198358A CN103198358A (zh) 2013-07-10
CN103198358B true CN103198358B (zh) 2017-05-24

Family

ID=48086658

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210366351.5A Expired - Fee Related CN103198358B (zh) 2011-10-12 2012-09-28 信息处理设备、信息处理方法和程序

Country Status (3)

Country Link
US (1) US9165249B2 (enExample)
JP (1) JP5879899B2 (enExample)
CN (1) CN103198358B (enExample)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9852419B2 (en) * 2012-09-17 2017-12-26 Capital One Financial Corporation Systems and methods for providing near field communications
US20140164220A1 (en) * 2012-12-06 2014-06-12 Microsoft Corporation Payment instrument selection
JP6516406B2 (ja) * 2013-12-13 2019-05-22 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation 処理装置、処理方法、およびプログラム
KR20150107418A (ko) * 2014-03-14 2015-09-23 삼성전자주식회사 전자 지갑을 활용한 결제 방법, 장치 그리고 시스템
JP6465106B2 (ja) * 2014-03-31 2019-02-06 日本電気株式会社 教師データ生成装置、電気機器監視システム、教師データ生成方法及びプログラム
EP3326114B1 (en) 2015-07-24 2024-09-04 DeepMind Technologies Limited Continuous control with deep reinforcement learning
JP6630522B2 (ja) * 2015-09-16 2020-01-15 株式会社バンダイナムコエンターテインメント ゲーム装置及びプログラム
JP2018049563A (ja) * 2016-09-23 2018-03-29 カシオ計算機株式会社 電子機器、サーバ、対価設定方法及びプログラム
JP7031603B2 (ja) * 2016-11-29 2022-03-08 ソニーグループ株式会社 情報処理装置及び情報処理方法
JPWO2018150654A1 (ja) * 2017-02-15 2019-12-12 ソニー株式会社 情報処理装置、および情報処理方法、並びにプログラム
WO2018156891A1 (en) * 2017-02-24 2018-08-30 Google Llc Training policy neural networks using path consistency learning
JP6795090B2 (ja) 2017-04-28 2020-12-02 富士通株式会社 行動選択学習装置、行動選択学習プログラム、行動選択学習方法及び行動選択学習システム
EP3596662B1 (en) * 2017-05-19 2025-08-27 DeepMind Technologies Limited Imagination-based agent neural networks
JP6612306B2 (ja) * 2017-11-21 2019-11-27 株式会社 ディー・エヌ・エー 情報処理装置及び情報処理プログラム
JP2019118461A (ja) * 2017-12-28 2019-07-22 株式会社 ディー・エヌ・エー 情報処理装置及び情報処理プログラム
US20210042584A1 (en) * 2018-01-30 2021-02-11 Nec Corporation Information processing apparatus, control method, and non-transitory storage medium
US20210125039A1 (en) * 2018-06-11 2021-04-29 Nec Solution Innovators, Ltd. Action learning device, action learning method, action learning system, program, and storage medium
JP7048455B2 (ja) * 2018-08-30 2022-04-05 本田技研工業株式会社 学習装置、シミュレーションシステム、学習方法、およびプログラム
JP7199203B2 (ja) * 2018-11-16 2023-01-05 株式会社Cygames ゲームプログラムを検査するためのシステム、方法、プログラム、機械学習支援装置、及びデータ構造
US11928556B2 (en) * 2018-12-29 2024-03-12 International Business Machines Corporation Removing unnecessary history from reinforcement learning state
KR102209917B1 (ko) * 2018-12-31 2021-01-29 아주대학교산학협력단 심층 강화 학습을 위한 데이터 처리 장치 및 방법
CN110327624B (zh) * 2019-07-03 2023-03-17 广州多益网络股份有限公司 一种基于课程强化学习的游戏跟随方法和系统
JP7335739B2 (ja) * 2019-07-16 2023-08-30 株式会社 ディー・エヌ・エー ゲームを提供するためのシステム、方法、及びプログラム
EP4014165A1 (en) * 2019-09-13 2022-06-22 DeepMind Technologies Limited Data-driven robot control
JP6812583B1 (ja) * 2020-02-28 2021-01-13 株式会社Cygames ゲームスクリプトの作成を支援するためのシステム及び方法
CN113877202B (zh) * 2020-07-01 2025-04-29 中移(苏州)软件技术有限公司 一种游戏控制方法及装置、存储介质

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1627251A (zh) * 2003-12-09 2005-06-15 微软公司 使用图形处理单元加速并优化机器学习技术的处理

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7403904B2 (en) * 2002-07-19 2008-07-22 International Business Machines Corporation System and method for sequential decision making for customer relationship management
US7837543B2 (en) * 2004-04-30 2010-11-23 Microsoft Corporation Reward-driven adaptive agents for video games
US7668632B2 (en) * 2004-11-22 2010-02-23 The Boeing Company System, method and computer program product for real-time event identification and course of action interpretation
JP4525477B2 (ja) * 2005-02-23 2010-08-18 ソニー株式会社 学習制御装置および学習制御方法、並びに、プログラム
US20070203871A1 (en) * 2006-01-23 2007-08-30 Tesauro Gerald J Method and apparatus for reward-based learning of improved systems management policies
JP4392620B2 (ja) 2007-08-14 2010-01-06 ソニー株式会社 情報処理装置、情報処理方法、演算装置、演算方法、プログラム、および記録媒体
JP4803212B2 (ja) * 2008-05-28 2011-10-26 ソニー株式会社 データ処理装置、データ処理方法、及びプログラム
JP5440840B2 (ja) * 2009-06-11 2014-03-12 ソニー株式会社 情報処理装置、情報処理方法、及び、プログラム

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1627251A (zh) * 2003-12-09 2005-06-15 微软公司 使用图形处理单元加速并优化机器学习技术的处理

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"active learning for reward estimation in inverse reinforcement learning";Manuel Lopes等;《active learning for reward estimation in inverse reinforcement learning》;20091231;第33-40页 *

Also Published As

Publication number Publication date
CN103198358A (zh) 2013-07-10
US20130097107A1 (en) 2013-04-18
JP5879899B2 (ja) 2016-03-08
JP2013081683A (ja) 2013-05-09
US9165249B2 (en) 2015-10-20

Similar Documents

Publication Publication Date Title
CN103198358B (zh) 信息处理设备、信息处理方法和程序
JP5874292B2 (ja) 情報処理装置、情報処理方法、及びプログラム
JP7399277B2 (ja) 情報処理方法、装置、コンピュータプログラム及び電子装置
US12268963B2 (en) Game character behavior control method and apparatus, storage medium, and electronic device
US11260302B2 (en) Apparatus and method of creating agent in game environment
CN113230650A (zh) 一种数据处理方法、装置及计算机可读存储介质
Stephenson et al. General game heuristic prediction based on ludeme descriptions
CN113543861A (zh) 用于多任务学习的方法及系统
Karavolos et al. Pairing character classes in a deathmatch shooter game via a deep-learning surrogate model
CN113946604B (zh) 分阶段围棋教学方法、装置、电子设备及存储介质
Martínez et al. Extending neuro-evolutionary preference learning through player modeling
Iwasaki et al. A Framework for Generating Playstyles of Game AI with Clustering of Play Logs.
Zhou et al. Discovering of game AIs’ characters using a neural network based AI imitator for AI clustering
Chen et al. WILD-SCAV: Benchmarking FPS gaming AI on Unity3D-based environments
JP7408213B2 (ja) 仮想アプリケーションオブジェクトの出力方法、装置及びコンピュータプログラム
Salge et al. Relevant information as a formalised approach to evaluate game mechanics
Gonzalez Enhanced Monte Carlo Tree Search in Game-Playing AI: Evaluating Deepmind's Algorithms
Handa Neuroevolution with manifold learning for playing Mario
Handa Dimensionality reduction of scene and enemy information in Mario
Langenhoven et al. Swarm tetris: Applying particle swarm optimization to tetris
HK40021455B (zh) 一种虚拟应用对象输出方法、装置以及计算机存储介质
HK40051664B (en) A data processing method, device and computer readable storage medium
CN120952117A (zh) 一种基于遗传算法的游戏卡牌组合生成方法、装置及设备
Chiang Memory Replay With Trajectory for Side-Scrolling Video Games
HK40071022B (zh) 行为模型的训练方法、结构扩容模型的训练方法

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170524

CF01 Termination of patent right due to non-payment of annual fee