JP7459328B1

JP7459328B1 - Game systems, programs, learning devices, and game provision methods

Info

Publication number: JP7459328B1
Application number: JP2023009241A
Authority: JP
Inventors: ジェロームボサー
Original assignee: Kabushiki Kaisha Bandai Namco Entertainment (also trading as Bandai Namco Entertainment Inc.); Namco Ltd
Current assignee: Kabushiki Kaisha Bandai Namco Entertainment (also trading as Bandai Namco Entertainment Inc.); Namco Ltd
Priority date: 2023-01-25
Filing date: 2023-01-25
Publication date: 2024-04-01
Anticipated expiration: 2043-01-25

Abstract

【課題】第１のキャラクタの行動及び当該行動に関連する関連情報に基づいて、学習された学習データを用いて第２のキャラクタの行動を制御する対戦ゲームにおいて、効率よく学習ができるゲームシステム等を提供すること。【解決手段】第１のキャラクタの行動と当該行動に関連する関連情報とを学習する。プレーヤの情報に基づいて、学習前に、前記関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、前記関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定する。【選択図】図３[Problem] To provide a game system or the like that allows efficient learning in a fighting game in which the actions of a second character are controlled using learned learning data based on the actions of a first character and related information related to said actions. [Solution] The actions of a first character and related information related to said actions are learned. Based on player information, at least one of the combination of multiple items that make up the related information, the learning frequency, and the weight of each of the multiple items that make up the related information is set before learning. [Selected Figure] Figure 3

Description

本発明は、ゲームシステム、プログラム、学習装置及びゲーム提供方法
に関する。 The present invention relates to a game system, a program, a learning device, and a game providing method.

従来から、仮想的な３次元空間内において、第１キャラクタと第２のキャラクタとに攻撃又は防御の動作を行わせ対戦格闘を行うゲームが知られている。 Conventionally, there are known games in which a first character and a second character perform offensive and defensive movements in a fighting match in a virtual three-dimensional space.

近年、データを分析及び学習することで生成された学習モデルを利用するゲームシステムが存在する。 In recent years, there have been game systems that utilize learning models generated by analyzing and learning data.

例えば、特許文献１に示す従来技術では、所定タイミング毎に、入力操作部の操作に応じてキャラクタの繰り出す技に関連する操作データと、画面表示に関連する画面状態データとを収集して学習データ記憶部に書き込み、当該操作データと当該画面状態データとに基づいて深層学習の計算処理を行うことによって、学習結果の重みを最適化する。そして、プレーヤの操作対象の第１のキャラクタと、コンピュータの制御対象の第２のキャラクタとが対戦する場合、第２のキャラクタに対して、推論結果のキーデータを取得して、キー履歴に追加し、キー履歴とコマンド表を参照して動作（又は技）を決定する（特許文献１の００６１～００６２、００７７～００７９段落、図９参照）。 For example, in the conventional technology disclosed in Patent Document 1, operation data related to the character's performing technique and screen state data related to the screen display are collected at predetermined timings according to the operation of the input operation unit, and the learning data is collected. The weight of the learning result is optimized by writing into the storage unit and performing deep learning calculation processing based on the operation data and the screen state data. When a first character to be controlled by the player and a second character to be controlled by the computer compete against each other, key data of the inference result is obtained for the second character and added to the key history. Then, the action (or technique) is determined by referring to the key history and command table (see paragraphs 0061 to 0062 and 0077 to 0079 of Patent Document 1, and FIG. 9).

特開２０１９－１９５５１２号公報JP 2019-195512 Publication

複数のキャラクタが対戦ゲームを行う場合、プレーヤの操作対象の第１のキャラクタの行動と当該行動に関連する関連情報とを学習し、学習された学習データを用いて、第２のキャラクタの行動を推論する場合がある。このような学習では、効率性や精度が向上することが好ましい。 When multiple characters play a competitive game, the actions of the first character to be operated by the player and related information related to the actions are learned, and the learned data is used to control the actions of the second character. May be inferred. It is preferable that such learning improves efficiency and accuracy.

本発明は、上記課題に鑑みたものであり、第１のキャラクタの行動及び当該行動に関連する関連情報に基づいて、学習された学習データを用いて第２のキャラクタの行動を制御する対戦ゲームにおいて、効率よく学習ができるゲームシステム等を提供することを目的とする。 The present invention has been made in view of the above-mentioned problems, and provides a competitive game in which the actions of a second character are controlled using learned data based on the actions of a first character and related information related to the actions. The purpose is to provide a game system etc. that allows for efficient learning.

（１）本発明は、
プレーヤの操作入力に基づいて行動を制御する第１のキャラクタと、コンピュータによって制御される第２のキャラクタとが対戦する対戦ゲームにおいて、第２のキャラクタの行動を制御するゲームシステムであって、
第１のキャラクタの行動と当該行動に関連する関連情報とを学習する学習部と、
前記学習部において学習された学習データを用いて、第２のキャラクタの行動を制御する行動制御部と、
前記プレーヤの情報に基づいて、学習前に、前記関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、前記関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定する設定部と、を含むことを特徴とするゲームシステムに関する。 (1) The present invention provides
A game system for controlling an action of a second character in a battle game in which a first character, the action of which is controlled based on a player's operation input, and a second character, the second character, are battled against each other, the game system comprising:
a learning unit that learns an action of a first character and related information related to the action;
a behavior control unit that controls a behavior of a second character using the learning data learned by the learning unit;
and a setting unit that sets, based on the player's information, at least one of a combination of multiple items constituting the related information, a learning frequency, and a weight for each of the multiple items constituting the related information before learning.

また、本発明は、上記各部を含む学習装置（端末装置、ゲーム装置）に関する。本発明は、上記各部を含むサーバ装置に関する。本発明は、上記各部としてコンピュータを機能させるプログラムに関する。また、本発明は、コンピュータが読み取り可能な情報記憶媒体であって、上記各部としてコンピュータを機能させるプログラムを記憶する情報記憶媒体に関する。 The present invention also relates to a learning device (terminal device, game device) including the above-mentioned units. The present invention relates to a server device including the above-mentioned parts. The present invention relates to a program that causes a computer to function as each of the above units. The present invention also relates to a computer-readable information storage medium that stores a program that causes the computer to function as each of the above units.

また、本発明は、プレーヤの操作入力に基づいて行動を制御する第１のキャラクタと、コンピュータによって制御される第２のキャラクタとが対戦する対戦ゲームにおいて、第２のキャラクタの行動を制御するサーバ装置から、プレーヤの端末装置に、ゲームに関する情報を提供するためのゲーム提供方法であって、前記端末装置によって受け付けられた前記プレーヤの操作入力に関する情報を前記サーバ装置に送信し、サーバ装置が上記各部を含む、ゲーム提供方法に関する。 The present invention also provides a server that controls the actions of the second character in a competitive game in which a first character whose actions are controlled based on a player's operation input competes with a second character controlled by a computer. A game providing method for providing information regarding the game from a device to a player's terminal device, the server device transmitting information regarding the player's operation input accepted by the terminal device to the server device, and the server device Regarding the game providing method, including each part.

本発明によれば、プレーヤの情報に基づいて、学習前に、関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定するので、学習の早い段階から、効率よく学習ができる。 According to the present invention, at least one of the combination of multiple items constituting related information, the learning frequency, and the weight of each of the multiple items constituting related information is set based on the player's information before learning, allowing efficient learning from an early stage.

「学習」とは、例えば、機械学習である。また、「学習」とは、情報を蓄積（記憶）することでもよい。 "Learning" is, for example, machine learning. Moreover, "learning" may also mean accumulating (memorizing) information.

「学習データ」とは、例えば、学習されたデータそのものでもよいし、機械学習によって生成される学習モデルでもよい。 The "learning data" may be, for example, the learned data itself or a learning model generated by machine learning.

（２）本発明は、
プレーヤの操作入力に基づいて行動を制御する第１のキャラクタと、コンピュータによって制御される第２のキャラクタとが対戦する対戦ゲームにおいて、第２のキャラクタの行動を制御するゲームシステムであって、
第１のキャラクタの行動と当該行動に関連する関連情報とを学習する学習部と、
前記学習部において学習された学習データを用いて、第２のキャラクタの行動を制御する行動制御部と、
前記プレーヤの情報に基づいて、ゲーム進行状況に応じて、前記関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、前記関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定する設定部と、を含むことを特徴とするゲームシステムに関する。 (2) The present invention:
A game system for controlling the actions of a second character in a competitive game in which a first character whose actions are controlled based on a player's operation input competes with a second character controlled by a computer, the game system comprising:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
Based on the player's information, at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information is determined according to the progress of the game. The present invention relates to a game system characterized by including a setting section for making settings.

本発明によれば、プレーヤの情報に基づいて、ゲーム進行状況に応じて、関連情報を構
成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定するので、リアルタイムに効率よく学習ができる。 According to the present invention, based on the player's information and in accordance with the progress of the game, at least the combination of a plurality of items constituting the related information, the learning frequency, and the weight of each of the plurality of items constituting the related information are determined. By setting one, you can study efficiently in real time.

（３）また、本発明に係るゲームシステム、学習装置、サーバ装置、プログラム、情報記憶媒体及びゲーム提供方法では、
前記設定部は、
前記プレーヤの操作入力に基づき、前記関連情報を構成する複数の項目のうち、重みが高い項目の選択を受け付けるようにしてもよい。本発明によれば、プレーヤが、重みが高い項目を選択できるので、プレーヤの意思を重視して重みの高い項目を設定できる。 (3) Furthermore, in the game system, learning device, server device, program, information storage medium, and game providing method according to the present invention,
The setting section includes:
Based on the operation input from the player, selection of an item with a higher weight among a plurality of items making up the related information may be accepted. According to the present invention, since the player can select an item with a high weight, it is possible to set a high weight item with emphasis on the player's intention.

（４）また、本発明に係るゲームシステム、学習装置、サーバ装置、プログラム、情報記憶媒体及びゲーム提供方法では、
前記設定部は、
前記プレーヤの操作入力に基づき、前記関連情報の複数の項目それぞれの優先順位を設定するようにしてもよい。本発明によれば、プレーヤが複数の項目それぞれの優先順位を設定できるので、プレーヤの意思を重視してレーヤが複数の項目それぞれの優先順位を設定できる。 (4) Furthermore, in the game system, learning device, server device, program, information storage medium, and game providing method according to the present invention,
The setting section includes:
The priority order of each of the plurality of items of the related information may be set based on the player's operation input. According to the present invention, since the player can set the priority order of each of the plurality of items, the player can set the priority order of each of the plurality of items with emphasis on the player's intention.

（５）また、本発明に係るゲームシステム、学習装置、サーバ装置、プログラム、情報記憶媒体及びゲーム提供方法では、
前記プレーヤの情報は、前記プレーヤのプレイ履歴に関する情報を含むようにしてもよい。本発明によれば、プレーヤのプレイ履歴を考慮した重みの設定ができる。 (5) Furthermore, in the game system, learning device, server device, program, information storage medium, and game providing method according to the present invention,
The player information may include information regarding the player's play history. According to the present invention, weights can be set in consideration of a player's play history.

（６）また、本発明に係るゲームシステム、学習装置、サーバ装置、プログラム、情報記憶媒体及びゲーム提供方法では、
前記プレーヤの情報は、前記プレーヤの過去のプレイ傾向に関する情報を含むようにしてもよい。本発明によれば、プレーヤの過去のプレイ傾向を考慮した重みの設定ができる。 (6) Furthermore, in the game system, learning device, server device, program, information storage medium, and game providing method according to the present invention,
The player information may include information regarding the player's past play tendencies. According to the present invention, weights can be set in consideration of the player's past play tendencies.

（７）また、本発明に係るゲームシステム、学習装置、サーバ装置、プログラム、情報記憶媒体及びゲーム提供方法では、
前記プレーヤの情報は、前記プレーヤの所定情報に基づいて決定される他のプレーヤに関連する情報を含むようにしてもよい。本発明によれば、プレーヤの所定情報（例えば、レベル）に基づいて決定される他のプレーヤに関連する情報を考慮して、重みの設定ができる。 (7) Furthermore, in the game system, learning device, server device, program, information storage medium, and game providing method according to the present invention,
The player information may include information related to other players determined based on predetermined information about the player. According to the present invention, weights can be set in consideration of information related to other players determined based on predetermined information (for example, level) of the player.

本実施形態のゲームシステムの構成の一例を示す図。FIG. 1 is a diagram showing an example of the configuration of a game system according to an embodiment of the present invention. 本実施形態のサーバ装置の機能ブロックを示す図。FIG. 3 is a diagram showing functional blocks of a server device according to the present embodiment. 本実施形態の端末装置の機能ブロックを示す図。FIG. 3 is a diagram showing functional blocks of a terminal device according to the present embodiment. 本実施形態のコントローラの一例を示す図。FIG. 1 is a diagram showing an example of a controller according to the present embodiment. 本実施形態のゲーム画面の一例を示す図。The figure which shows an example of the game screen of this embodiment. 本実施形態の行動及び関連情報の説明図。An explanatory diagram of behavior and related information according to the present embodiment. 本実施形態の関連情報を探索するための説明図。An explanatory diagram for searching for related information according to the present embodiment. 本実施形態の行動に対応付けられたコマンドの一例を示す図。FIG. 11 is a diagram showing an example of a command associated with an action according to the embodiment. 本実施形態の行動情報及び関連情報の取得の説明図。FIG. 4 is an explanatory diagram of acquisition of behavior information and related information according to the present embodiment. 本実施形態の関連情報を説明するための図。FIG. 3 is a diagram for explaining related information of this embodiment. 本実施形態の所定期間を説明するための図。FIG. 3 is a diagram for explaining a predetermined period of the present embodiment. 本実施形態の所定期間の設定画面を説明するための図。FIG. 3 is a diagram for explaining a setting screen for a predetermined period according to the present embodiment. 本実施形態の連続行動の説明するための図。FIG. 4 is a diagram for explaining continuous behavior according to the present embodiment. 本実施形態の重み設定画面の一例を示す図。The figure which shows an example of the weight setting screen of this embodiment. 本実施形態の重み設定画面の一例を示す図。The figure which shows an example of the weight setting screen of this embodiment. 本実施形態の重み設定画面の一例を示す図。The figure which shows an example of the weight setting screen of this embodiment. 本実施形態の処理の流れの一例を示すフローチャート。5 is a flowchart illustrating an example of the flow of processing according to the present embodiment. 本実施形態の処理の流れの一例を示すフローチャート。5 is a flowchart illustrating an example of the flow of processing according to the present embodiment.

以下、本実施形態について説明する。なお、以下に説明する本実施形態は、特許請求の範囲に記載された本発明の内容を不当に限定するものではない。また本実施形態で説明される構成の全てが、本発明の必須構成要件であるとは限らない。 This embodiment will be described below. Note that this embodiment described below does not unduly limit the content of the present invention described in the claims. Furthermore, not all of the configurations described in this embodiment are essential components of the present invention.

［１］ゲームシステム
まず、図１を用いて本実施形態のゲームシステムの概要及び概要構成について説明する。なお、図１は、本実施形態のゲームシステムの構成を示すシステム構成の一例を示す図である。 [1] Game System First, an overview and a general configuration of the game system of this embodiment will be described with reference to Fig. 1. Fig. 1 is a diagram showing an example of a system configuration showing the configuration of the game system of this embodiment.

サーバ装置１０は、インターネット（ネットワークの一例）を介して通信接続された端末装置（ゲーム装置）２０を用いて、所与のサービスを提供することが可能な情報処理装置である。 The server device 10 is an information processing device that can provide a given service using a terminal device (game device) 20 that is communicatively connected via the Internet (an example of a network).

端末装置２０は、家庭用テレビゲーム機、パーソナルコンピュータ（ＰＣとも言う。）スマートフォン、携帯電話、ＰＨＳ、コンピュータ、ＰＤＡ、携帯型ゲーム機等、画像生成装置、業務用ゲーム装置、アーケードゲーム装置、筐体などの情報処理装置であり、インターネット（ＷＡＮ）、ＬＡＮなどのネットワークを介してサーバ装置１０に接続可能な装置である。端末装置２０には、入力部２６０が接続されるとともに、表示部２９０が接続される。 The terminal device 20 includes a home video game machine, a personal computer (also referred to as a PC), a smartphone, a mobile phone, a PHS, a computer, a PDA, a portable game machine, an image generating device, an arcade game device, an arcade game device, and a housing. It is an information processing device such as a computer, and is a device that can be connected to the server device 10 via a network such as the Internet (WAN) or LAN. An input section 260 is connected to the terminal device 20, and a display section 290 is also connected thereto.

本実施形態の端末装置２０は、学習機能（例えば、機械学習による学習機能）を有する。端末装置２０は、学習装置と言い換えてもよい。また、端末装置２０は、機械学習による推論機能（予測機能）を有する。端末装置２０は、推論装置（予測装置）と言い換えてもよい。 The terminal device 20 of this embodiment has a learning function (for example, a learning function using machine learning). The terminal device 20 may also be referred to as a learning device. Furthermore, the terminal device 20 has an inference function (prediction function) based on machine learning. The terminal device 20 may also be referred to as an inference device (prediction device).

［１．１］クライアントサーバモデルで構成されるゲームシステム
本実施形態のゲームシステムは、図１に示すように、サーバ装置１０と、端末装置２０（例えば、端末装置２０Ａ、２０Ｂ、２０Ｃ）とが、インターネット（ネットワークの一例）に接続可能に構成される。 [1.1] Game system configured in a client-server model As shown in FIG. 1, the game system of this embodiment is configured such that a server device 10 and terminal devices 20 (e.g., terminal devices 20A, 20B, 20C) are connectable to the Internet (an example of a network).

プレーヤは、端末装置２０からサーバ装置１０にアクセスすることにより、インターネットを介してサーバ装置１０から送信される情報に基づき、ゲームをプレイすることができる。 By accessing the server device 10 from the terminal device 20, the player can play the game based on information transmitted from the server device 10 via the Internet.

特に、本実施形態では、第１のキャラクタと第２のキャラクタとが対戦するゲームを実行させるサーバ装置１０から、プレーヤの端末装置２０に、ゲームに関する情報を提供する。端末装置２０によって受け付けられたプレーヤの操作入力に関する情報をサーバ装置１０に送信する。サーバ装置１０から端末装置２０にゲームに関する情報を送信する。 In particular, in this embodiment, the server device 10, which executes a game in which a first character and a second character play against each other, provides information about the game to the player's terminal device 20. Information about the player's operation input accepted by the terminal device 20 is transmitted to the server device 10. Information about the game is transmitted from the server device 10 to the terminal device 20.

本実施形態は、一つのサーバ装置１０によって各ゲームを端末装置２０に提供してもよいし、複数のサーバ装置１０を連動させてサーバシステムを構築し、各ゲームを端末装置２０に提供してもよい。 In this embodiment, each game may be provided to the terminal device 20 by one server device 10, or a server system may be constructed by linking a plurality of server devices 10, and each game may be provided to the terminal device 20. Good too.

なお、サーバ装置１０だけで本発明に関する各種の処理を行うようにしてもよいし、端末装置２０だけで本発明に関する各種の処理を行うようにしてもよい。 Note that the server device 10 alone may perform various processes related to the present invention, or the terminal device 20 alone may perform various processes related to the present invention.

［１．２］端末装置
本実施形態のゲームシステムは、サーバ装置１０の機能を備えた単一の端末装置、すなわち、サーバ装置などの他の機器に依存せず単独で動作する装置（スタンドアローン）によって実現してもよい。 [1.2] Terminal Device The game system of this embodiment may be realized by a single terminal device having the functions of the server device 10, that is, a device that operates independently (standalone) without relying on other devices such as a server device.

また、本実施形態では、サーバ装置１０と接続せずに、端末装置２０だけで本発明を実現するものであってもよい。例えば、Ｐ２Ｐ（ピア・ツー・ピア方式）による通信によって複数の端末装置２０で実現するゲームシステムであってもよい。 Further, in this embodiment, the present invention may be implemented only by the terminal device 20 without being connected to the server device 10. For example, it may be a game system realized by a plurality of terminal devices 20 through P2P (peer-to-peer) communication.

そして、このような端末装置を有線又は無線によって複数連結させ、一の端末装置がホスト（サーバ装置１０）として機能して、複数の端末装置によって実現してもよい。 The system may be realized by connecting multiple such terminal devices by wire or wirelessly, with one terminal device functioning as a host (server device 10).

なお、端末装置は、端末装置だけでなく、タブレット型情報端末装置やパーソナルコンピュータ、又は、アミューズメントパークに設置される端末装置（筐体）でもよい。 Note that the terminal device may be not only a terminal device but also a tablet type information terminal device, a personal computer, or a terminal device (housing) installed in an amusement park.

［１．３］クラウド型のゲームシステム
また、本実施形態のゲームシステムはクラウド型のゲームシステムでもよい。例えば、クラウド型のゲームシステムでは、サーバ装置１０（クラウド装置）だけで基本的な処理（ゲーム処理、描画処理などを含む）を行い、端末装置２０側ではサーバ装置１０（クラウド装置）の処理結果の表示制御のみを行う。また、端末装置２０は、サーバ装置１０に対してコントローラの操作入力（操作情報、入力情報）などの情報を送信する。なお、かかる場合、端末装置２０は、操作入力以外の情報を送信してもよい。 [1.3] Cloud-based Game System The game system of this embodiment may be a cloud-based game system. For example, in a cloud-based game system, basic processing (including game processing, drawing processing, etc.) is performed only by the server device 10 (cloud device), and the terminal device 20 only controls the display of the processing results of the server device 10 (cloud device). The terminal device 20 transmits information such as operation input (operation information, input information) of the controller to the server device 10. In this case, the terminal device 20 may transmit information other than the operation input.

さらに、本実施形態では、入力を除き、上記の端末装置２０の処理部２００の各機能及びゲームプログラムの実行をサーバ装置１０で実行し、当該端末装置２０は、入力とストリーミングによる画像表示を実行することによって、上記のゲームを実現してもよい。 Furthermore, in this embodiment, the server device 10 executes each function of the processing unit 200 of the terminal device 20 described above and the game program, except for input, and the terminal device 20 executes input and image display by streaming. The above game may be realized by doing the following.

また、ゲームシステムは、インターネット上の所与の記憶領域に、サーバ装置１０に記憶される情報、端末装置２０に記憶される情報を記憶するようにしてもよい。 Further, the game system may store information stored in the server device 10 and information stored in the terminal device 20 in a given storage area on the Internet.

［１．４］複数のコントローラを備えた端末装置
また、本実施形態では、１台の端末装置２０が複数のコントローラ（入力部２６０）を備えたものでもよい。なお、複数のプレーヤが１台の端末装置２０の各プレーヤのコントローラを用いてゲームをプレイする場合、複数のプレーヤが現実的に近くにいることが想定される。 [1.4] Terminal device including multiple controllers In the present embodiment, one terminal device 20 may include multiple controllers (input section 260). Note that when a plurality of players play a game using the controllers of each player of one terminal device 20, it is assumed that the plurality of players are realistically close to each other.

［１．５］ソーシャルゲームの例
また、本実施形態では、コミュニケーション型のサービスを提供するＳＮＳサーバとして機能してもよい。ここで、ＳＮＳサーバとは、複数のプレーヤ間でコミュニケーションを提供することが可能なサービスを提供する情報処理装置であってもよい。 [1.5] Example of Social Game Additionally, in this embodiment, the server may function as an SNS server that provides communication-type services. Here, the SNS server may be an information processing device that provides a service that allows communication between multiple players.

また、本実施形態では、例えば、ＳＮＳサーバとして機能する場合には、提供するＳＮＳの動作環境（ＡＰＩ（アプリケーションプログラミングインタフェース）、プラットフォーム等）を利用して実行されるソーシャルゲーム（ＳｏｃｉａｌＧａｍｅ）と呼ばれるゲームを提供できる。 In addition, in this embodiment, for example, when functioning as an SNS server, it is possible to provide games known as social games that are executed using the operating environment (API (Application Programming Interface), platform, etc.) of the SNS provided.

なお、ソーシャルゲームとは、既存のオンラインゲームとは違い、専用のクライアント
ソフトウェアを必要とせず、ＷｅｂブラウザとＳＮＳのアカウントのみで利用可能なゲームが含まれる。また、本実施形態では、ネットワークを介して他のプレーヤの端末装置２０と接続し、オンラインで同時に同じゲーム進行を共有することができるオンラインゲームを提供することが可能な構成を有している。 Note that, unlike existing online games, social games include games that do not require dedicated client software and can be used only with a web browser and SNS account. Furthermore, this embodiment has a configuration that can provide an online game in which the player can connect to the terminal device 20 of another player via a network and share the same game progress online at the same time.

［１．６］ブラウザゲームの例
特に、本実施形態では、端末装置２０のＷｅｂブラウザ上で提供されるゲーム、例えばＨＴＭＬ、ＦＬＡＳＨ（登録商標）、ＣＧＩ、ＰＨＰ、ｓｈｏｃｋｗａｖｅ、Ｊａｖａ（登録商標）アプレット、ＪａｖａＳｃｒｉｐｔ（登録商標）など様々な言語で作られたブラウザゲーム（Ｗｅｂブラウザで設置サイトを開くだけで起動するゲーム）を提供してもよい。 [1.6] Examples of browser games In particular, in this embodiment, games provided on the web browser of the terminal device 20, such as HTML, FLASH (registered trademark), CGI, PHP, shockwave, Java (registered trademark) applets Browser games (games that can be started simply by opening an installation site with a Web browser) created in various languages such as , JavaScript (registered trademark), etc. may be provided.

また、端末装置２０は、Ｗｅｂページ（ＨＴＭＬ形式のデータ）を閲覧可能なＷｅｂブラウザを備えている。すなわち、端末装置２０は、サーバ装置１０との通信を行うための通信制御機能、及びサーバ装置１０から受信したデータ（Ｗｅｂデータ、ＨＴＭＬ形式で作成されたデータなど）を用いて表示制御を行うとともに、プレーヤ操作のデータをサーバ装置１０に送信するＷｅｂブラウザ機能などを備え、画面をプレーヤに提供する各種の処理を実行し、プレーヤによってゲームを実行させるようになっている。ただし、端末装置２０は、サーバ装置１０から提供されたゲーム制御情報を取得して所定のゲーム処理を実行し、ゲーム処理に基づくゲームを実行してもよい。 The terminal device 20 also has a web browser capable of viewing web pages (data in HTML format). That is, the terminal device 20 has a communication control function for communicating with the server device 10, and a web browser function for controlling display using data received from the server device 10 (web data, data created in HTML format, etc.), as well as transmitting data of player operation to the server device 10, and executes various processes for providing a screen to the player and allowing the player to play the game. However, the terminal device 20 may also acquire game control information provided by the server device 10, execute a predetermined game process, and play a game based on the game process.

具体的には、端末装置２０は、所定ゲームを行う旨の要求をサーバ装置１０に対して行うと、サーバ装置１０のゲームサイトに接続され、ゲームが開始される。特に、端末装置２０は、必要に応じてＡＰＩを用いることにより、ＳＮＳサーバとして機能するサーバ装置１０に所定の処理を行わせ、又は、ＳＮＳサーバとして機能するサーバ装置１０が管理するプレーヤの情報等を取得させてゲームを実行する構成を有してもよい。 Specifically, when the terminal device 20 makes a request to the server device 10 to play a predetermined game, it is connected to the game site of the server device 10 and starts the game. In particular, the terminal device 20 uses the API as necessary to cause the server device 10 functioning as an SNS server to perform predetermined processing, or to perform player information etc. managed by the server device 10 functioning as an SNS server. It may also have a configuration in which the game is executed by causing the player to acquire the following information.

［１．７］その他
本実施形態では、１つの（装置、プロセッサ）で構成されていてもよいし、複数の（装置、プロセッサ）で構成されていてもよい。サーバ装置１０の記憶領域（後述する記憶部１７０）に記憶される情報（例えば、プレーヤの情報、ゲーム情報等）を、ネットワーク（イントラネット又はインターネット）を介して接続されたデータベース（広義には記憶装置、メモリ）に記憶するようにしてもよい。なお、端末装置２０とサーバ装置１０との通信回線は、有線でもよいし無線でもよい。 [1.7] Others This embodiment may be configured with one (device, processor), or may be configured with a plurality of (devices, processors). Information (for example, player information, game information, etc.) stored in the storage area (storage unit 170 described later) of the server device 10 is stored in a database (in a broad sense, a storage device) connected via a network (intranet or Internet). , memory). Note that the communication line between the terminal device 20 and the server device 10 may be wired or wireless.

［２］サーバ装置
次に、図２を用いて本実施形態のサーバ装置１０について説明する。なお、図２は、本実施形態のサーバ装置１０の機能ブロックを示す図である。また、本実施形態の本実施形態では図２の構成要素（各部）の一部を省略した構成としてもよい。 [2] Server Device Next, the server device 10 of this embodiment will be described using FIG. 2. Note that FIG. 2 is a diagram showing functional blocks of the server device 10 of this embodiment. Further, in this embodiment of the present invention, some of the constituent elements (each part) in FIG. 2 may be omitted.

本実施形態では、管理者やその他の入力に用いるための入力部１６０、所定の情報が記憶された情報記憶媒体１８０、端末装置２０やその他と通信を行う通信部１９６、主に提供するゲームに関する処理を実行する処理部１００、及び、主にゲームに用いる各種のデータを記憶する記憶部１７０を含む。 In this embodiment, an input section 160 used for administrator and other input, an information storage medium 180 in which predetermined information is stored, a communication section 196 for communicating with the terminal device 20 and others, and a communication section 196 mainly related to the game to be provided. It includes a processing section 100 that executes processing, and a storage section 170 that stores various data mainly used for games.

入力部１６０は、システム管理者等がゲームに関する設定やその他の必要な設定、データの入力に用いるものである。例えば、本実施形態の入力部１６０は、マウスやキーボード等によって構成される。 The input unit 160 is used by a system administrator or the like to input game-related settings and other necessary settings and data. For example, the input unit 160 of this embodiment is configured with a mouse, a keyboard, or the like.

情報記憶媒体１８０（コンピュータにより読み取り可能な媒体）は、プログラムやデー
タなどを格納するものであり、その機能は、光ディスク（ＣＤ、ＤＶＤ）、光磁気ディスク（ＭＯ）、磁気ディスク、ハードディスク、磁気テープ、或いはメモリ（ＲＯＭ）などによって構成される。 The information storage medium 180 (a computer-readable medium) stores programs, data, etc., and its functions are realized by an optical disk (CD, DVD), a magneto-optical disk (MO), a magnetic disk, a hard disk, a magnetic tape, or a memory (ROM), etc.

通信部１９６は、外部（例えば、端末、他のサーバや他のネットワークシステム）との間で通信を行うための各種制御を行うものであり、その機能は、各種プロセッサ又は通信用ＡＳＩＣなどのハードウェアや、プログラムなどによって構成される。 The communication unit 196 performs various controls for communicating with the outside (for example, terminals, other servers, and other network systems), and its functions are performed by hardware such as various processors or communication ASICs. It consists of software, programs, etc.

記憶部１７０は、処理部１００や通信部１９６などのワーク領域となるもので、その機能は、ＲＡＭ（ＶＲＡＭ）などによって構成される。なお、記憶部１７０に記憶される情報は、データベースで管理してもよい。 The storage unit 170 serves as a work area for the processing unit 100, the communication unit 196, and the like, and its functions are configured by RAM (VRAM) and the like. Note that the information stored in the storage unit 170 may be managed in a database.

また、本実施形態の記憶部１７０は、主記憶部１７１の他に、ゲームに関する情報を示すゲーム情報が記憶されるゲーム情報記憶部１７４、及び、プレーヤ情報記憶部（ユーザ情報記憶部）１７６を有している。 In addition to the main storage section 171, the storage section 170 of this embodiment includes a game information storage section 174 in which game information indicating information about the game is stored, and a player information storage section (user information storage section) 176. have.

特に、ゲーム情報記憶部１７４には、ゲームが実施される仮想空間（ゲームフィールド）の情報、仮想カメラの情報、ゲームで用いられる設定値等が記憶される。 In particular, the game information storage unit 174 stores information on a virtual space (game field) in which the game is played, information on a virtual camera, setting values used in the game, and the like.

また、プレーヤ情報記憶部１７６には、プレーヤ毎（ユーザ毎）に、プレーヤＩＤ（ユーザＩＤ）に対応付けてプレーヤ情報（ユーザの情報）が記憶される。プレーヤＩＤは、プレーヤを識別するための識別情報である。 Further, the player information storage unit 176 stores player information (user information) for each player (for each user) in association with the player ID (user ID). The player ID is identification information for identifying the player.

処理部１００は、記憶部１７０内の主記憶部１７１をワーク領域として各種処理を行う。処理部１００の機能は各種プロセッサ（ＣＰＵ、ＤＳＰ等）、ＡＳＩＣ（ゲートアレイ等）などのハードウェアや、プログラムにより実現できる。 The processing unit 100 performs various processes using the main storage unit 171 within the storage unit 170 as a work area. The functions of the processing unit 100 can be realized by hardware such as various processors (CPU, DSP, etc.), ASIC (gate array, etc.), and programs.

処理部１００は、情報記憶媒体１８０に格納されるプログラム（データ）に基づいて本実施形態の種々の処理を行う。即ち情報記憶媒体１８０には、本実施形態の各部としてコンピュータを機能させるためのプログラム（各部の処理をコンピュータに実行させるためのプログラム）が記憶される。 The processing unit 100 performs various processes of this embodiment based on programs (data) stored in the information storage medium 180. That is, the information storage medium 180 stores a program for making the computer function as each part of this embodiment (a program for causing the computer to execute the processing of each part).

例えば、処理部１００（プロセッサ）は、情報記憶媒体１８０に記憶されているプログラムに基づいて、サーバ装置１０全体の制御を行うとともに、各部間におけるデータ等の受け渡しの制御などの各種の処理を行う。さらに、端末装置２０からの要求に応じた各種サービスを提供する処理を行う。 For example, the processing unit 100 (processor) controls the entire server device 10 based on a program stored in the information storage medium 180, and performs various processes such as controlling the exchange of data etc. between each unit. . Furthermore, processing for providing various services in response to requests from the terminal device 20 is performed.

なお、本実施形態では、サーバ装置１０が処理部１００の一部又は全部の処理を行ってもよいし、端末装置２０が処理部１００の一部の処理を行ってもよい。 Note that in this embodiment, the server device 10 may perform some or all of the processing of the processing section 100, or the terminal device 20 may perform some of the processing of the processing section 100.

処理部１００は、ゲーム処理部１１１、表示制御部１１２、受け付け部１１９、通信制御部１２０、Ｗｅｂ処理部１２１、管理部１２２、通知部１２３を含む。 The processing unit 100 includes a game processing unit 111, a display control unit 112, a reception unit 119, a communication control unit 120, a Web processing unit 121, a management unit 122, and a notification unit 123.

ゲーム処理部１１１は、本実施形態のゲーム（例えば、格闘ゲーム）を提供するための各種制御を行う。例えば、処理部１００は、端末装置２０と連動し、当該端末装置２０を介して入力されたプレーヤの入力に基づいて、各プレーヤにおいて本実施形態で提供するゲームのゲーム処理を実行する。 The game processing unit 111 performs various controls for providing the game (for example, a fighting game) of this embodiment. For example, the processing unit 100 works in conjunction with the terminal device 20, and executes game processing of the game provided in this embodiment in each player based on player input input via the terminal device 20.

また、処理部１００は、タイマ機能を有し、ゲームの進行状況を管理するため、各端末装置２０と同期を取るために用いるようにしてもよい。特に、現在時刻や予め設定された
時刻を各部に出力するようにしてもよい。 Further, the processing unit 100 may have a timer function and may be used to synchronize with each terminal device 20 in order to manage the progress of the game. In particular, the current time or a preset time may be output to each section.

ゲーム処理部１１１は、プレーヤの操作入力（操作情報）に基づいて、ゲーム処理を実行する。例えば、ゲーム処理部１１１は、プレーヤの端末装置２０から当該プレーヤの操作入力（操作情報）を受信して、ゲーム処理を行うようにしてもよい。また、オンライン対戦時には、各プレーヤとのマッチング処理等を行う。 The game processing unit 111 executes game processing based on the player's operation input (operation information). For example, the game processing unit 111 may receive the player's operation input (operation information) from the player's terminal device 20 and perform game processing. Furthermore, during online competition, matching processing with each player is performed.

表示制御部１１２は、ゲーム状況に応じてサーバ装置１０が生成した画面（画像）を、端末装置２０の表示部２９０に表示するように制御する。つまり、サーバ装置１０が生成した画面（画像）の表示先を、端末装置２０の表示部２９０とする。 The display control unit 112 controls the screen (image) generated by the server device 10 according to the game situation to be displayed on the display unit 290 of the terminal device 20. In other words, the screen (image) generated by the server device 10 is displayed on the display unit 290 of the terminal device 20 .

表示制御部１１２は、プレーヤ毎に、プレーヤの端末装置２０に表示する画面（ゲーム画像、演出画像）を表示するための制御を行う。 The display control unit 112 controls the display of the screen (game images, presentation images) on the player's terminal device 20 for each player.

受け付け部１１９は、ゲームに関する情報を受け付けてもよい。例えば、受け付け部１１９は、受け付け部１１９は、端末装置２０から操作入力の内容を受信して受け付けてもよい。 The accepting unit 119 may accept information regarding the game. For example, the receiving unit 119 may receive and accept the contents of the operation input from the terminal device 20.

また、受け付け部１１９は、通信制御部１２０によるデータ送受信によって、プレーヤの入力やゲームに関する情報などを受け付けるようにしてもよい。 Further, the receiving unit 119 may receive player input, information regarding the game, etc. through data transmission and reception by the communication control unit 120.

通信制御部１２０は、端末装置２０との接続（セッションやコネクション）を確立し、ネットワークを介してデータを通信（送受信）する処理を行う。 The communication control unit 120 establishes a connection (session or connection) with the terminal device 20 and performs processing for communicating (sending and receiving) data via the network.

例えば、通信制御部１２０は、プレーヤの端末装置２０に、ゲームに関する情報を送信する。また、通信制御部１２０は、プレーヤの端末装置２０から、当該端末装置２０によって受け付けられたプレーヤの操作情報を受信する。また、通信制御部１２０は、端末装置２０にゲームに関する情報を送信する。 For example, the communication control unit 120 transmits information regarding the game to the player's terminal device 20. Further, the communication control unit 120 receives, from the player's terminal device 20, the player's operation information accepted by the terminal device 20. Further, the communication control unit 120 transmits information regarding the game to the terminal device 20.

Ｗｅｂ処理部１２１は、Ｗｅｂサーバとして機能する。例えば、Ｗｅｂ処理部１２１は、ＨＴＴＰ（ＨｙｐｅｒｔｅｘｔＴｒａｎｓｆｅｒＰｒｏｔｏｃｏｌ）等の通信プロトコルを通じて、端末装置２０にインストールされているＷｅｂブラウザの要求に応じてデータを送信する処理、及び、端末装置２０のＷｅｂブラウザによって送信されるデータを受信する処理を行う。 The web processing unit 121 functions as a web server. For example, the web processing unit 121 performs a process of transmitting data in response to a request from a web browser installed in the terminal device 20 through a communication protocol such as HTTP (Hypertext Transfer Protocol), and a process of receiving data transmitted by the web browser of the terminal device 20.

管理部１２２は、ゲーム情報や、プレーヤ毎に、プレーヤの識別情報に対応付けられたプレーヤの情報を管理する。 The management unit 122 manages game information and player information associated with player identification information for each player.

また、通知部１２３は、プレーヤに所与の情報を通知する。なお、「通知」するとは、プレーヤ（プレーヤの端末装置２０）に対して情報を通知することである。なお、「通知」を、提供、提示、表示、送信という文言に言い換えて解釈してもよい。 The notification unit 123 also notifies the player of given information. Note that "notifying" means notifying the player (the player's terminal device 20) of information. Note that "notifying" may also be interpreted as providing, presenting, displaying, or transmitting.

通知部１２３は、画面を端末装置２０において通知するように送信制御をしてもよい。つまり、通知部１２３は、画面（画像）の表示制御情報を生成し、生成した表示制御情報を、プレーヤの端末装置２０（例えば、端末装置２０の表示部２９０）に表示するように制御する。 The notification unit 123 may perform transmission control so as to notify the screen on the terminal device 20. That is, the notification unit 123 generates display control information for the screen (image), and controls the generated display control information to be displayed on the player's terminal device 20 (for example, the display unit 290 of the terminal device 20).

なお、サーバ装置１０は、以下に説明する端末装置２０の全部又は一部の機能を有していてもよい。 Note that the server device 10 may have all or part of the functions of the terminal device 20 described below.

［３］端末装置
次に、図３を用いて本実施形態の端末装置２０について説明する。なお、図３は、本実施形態における端末装置２０の構成を示す機能ブロック図の一例である。また、本実施形態の端末装置２０は図３の構成要素（各部）の一部を省略した構成としてもよい。 [3] Terminal Device Next, the terminal device 20 of this embodiment will be described using FIG. 3. Note that FIG. 3 is an example of a functional block diagram showing the configuration of the terminal device 20 in this embodiment. Further, the terminal device 20 of this embodiment may have a configuration in which some of the components (each unit) in FIG. 3 are omitted.

情報記憶媒体２８０（コンピュータにより読み取り可能な媒体）は、プログラムやデータなどを格納するものであり、その機能は、光ディスク（ＣＤ、ＤＶＤ）、光磁気ディスク（ＭＯ）、磁気ディスク、ハードディスク、磁気テープ、或いはメモリ（ＲＯＭ）などにより実現できる。 The information storage medium 280 (computer readable medium) stores programs, data, etc., and its functions include optical disks (CD, DVD), magneto-optical disks (MO), magnetic disks, hard disks, and magnetic tapes. , or a memory (ROM).

入力部２６０は、操作入力（操作信号）を入力可能なボタンやレバーを備える。図４は、入力部２６０の一例のコントローラ３０の外観図である。なお、コントローラ３０は、ゲームセンターのアーケードゲームと同じコントローラでもよい。 The input unit 260 includes buttons and levers through which operation inputs (operation signals) can be input. FIG. 4 is an external view of the controller 30 as an example of the input section 260. Note that the controller 30 may be the same controller used for arcade games at game centers.

端末装置２０は、プレーヤに表示部２９０に表示されたゲーム画面を視聴させながら、対戦格闘ゲームを興趣させるため、記憶部２７０や情報記憶媒体２８０から読み出したゲームのプログラムやゲーム情報、又は、コントローラ３０から入力される操作入力等に基づいて、格闘ゲームを実行する。 The terminal device 20 reads the game program or game information from the storage unit 270 or the information storage medium 280, or the controller in order to make the player enjoy the fighting game while viewing the game screen displayed on the display unit 290. The fighting game is executed based on the operation input etc. input from 30.

コントローラ３０は、４つの押点を有する十字型の方向スイッチ３１が装着され、４つの押ボタン３３（３３ａ～３３ｄ）が十字形（上下左右の４方向）に配置して装着される。 The controller 30 is equipped with a cross-shaped direction switch 31 having four push points, and four push buttons 33 (33a to 33d) arranged in a cross shape (in four directions: up, down, left, and right).

また、コントローラ３０は、アナログジョイスティック（以下「ジョイスティック」と略称する）３２と、ジョイスティック３４が傾動自在に装着される。 Further, an analog joystick (hereinafter abbreviated as "joystick") 32 and a joystick 34 are attached to the controller 30 so as to be tiltable.

方向スイッチ３１、ジョイスティック３２、押ボタン３３及びジョイスティック３４は、キャラクタの移動方向の指示、又は技の種類を指定するための操作に用いられる。具体的には、方向スイッチ３１は、上下左右の４方向に押点を有し、上下左右の何れかの押点を押すことによって、上下左右の何れかの方向を指示するための入力を行うものである。また、ジョイスティック３２及びジョイスティック３４は、上下左右の４方向と、４方向の中間となる右斜め上（又は右上），右斜め下（又は右下），左斜め下（又は左下），左斜め上（又は左上）の４方向を加えた８方向の入力が可能である。 The direction switch 31, the joystick 32, the push button 33, and the joystick 34 are used for operations for instructing the moving direction of the character or specifying the type of technique. Specifically, the direction switch 31 has push points in four directions (up, down, left, and right), and by pressing any of the push points in the up, down, left, and right directions, input for instructing any of the up, down, left, and right directions is performed. It is something. Furthermore, the joystick 32 and the joystick 34 are arranged in four directions: up, down, left and right, and diagonally upper right (or upper right), diagonally lower right (or lower right), diagonally lower left (or lower left), and diagonally upper left, which are in the middle of the four directions. Inputs can be made in eight directions, including the four directions (or upper left).

なお、方向スイッチ３１を用いて８方向を指示する場合は、上と右の押点を同時に押して「右斜め上」、右と下の押点を同時に押して「右斜め下」、下と左の押点を同時に押して「左斜め下」、左と上の押点を同時に押して「左斜め上」のそれぞれの入力（斜め４方向）を指示することもできる。 When specifying eight directions using the direction switch 31, press the top and right touch points at the same time to select "diagonally up right", press the right and bottom touch points simultaneously to select "diagonally right down", and press the right and bottom touch points simultaneously to select "diagonally right down" It is also possible to instruct the respective inputs (in four diagonal directions), such as "diagonally down left" by pressing the touch points at the same time, and "diagonally up left" by pushing the left and upper touch points simultaneously.

押ボタン３３ａ～３３ｄは、その上面に△〇×□等の記号が付されてもよい。押ボタン３３ａ～３３ｄは、ゲームにおけるキャラクタの行動（例えば、攻撃）を指示するために用いられる。例えば、本実施形態では、押ボタン３３ａ～３３ｄには、右パンチ、右キック、左キック、左パンチの機能が割り当てられる。例えば、プレーヤは、ゲームプレイする場合、左手親指で方向スイッチ３１又はジョイスティック３２を操作し、右手親指で押ボタン３３ａ～３３ｄ又はジョイスティック３４を操作する。 The pushbuttons 33a to 33d may be marked with symbols such as △〇×□ on their upper surfaces. The pushbuttons 33a to 33d are used to instruct the character's actions (eg, attack) in the game. For example, in this embodiment, the functions of right punch, right kick, left kick, and left punch are assigned to the pushbuttons 33a to 33d. For example, when playing a game, the player operates the direction switch 31 or joystick 32 with the thumb of the left hand, and operates the push buttons 33a to 33d or the joystick 34 with the thumb of the right hand.

なお、入力部２６０は、タッチパネル又はタッチパネル型ディスプレイなどにより実現できてもよい。すなわち、入力部２６０は、画像が表示される画面上における２次元の指示位置座標（ｘ，ｙ）を検出可能な検出部２６２を備えている。例えば、入力部２６０は、接触検出領域（タッチパネル）における、２次元の接触位置座標（ｘ，ｙ）を検出可能
な検出部２６２を備えている。なお、表示画面（以下、特別な場合を除き「タッチパネル」という。）への接触操作は、指先を用いて行うようにしてもよいし、タッチペンなどの入力機器を用いて行うようにしてもよい。 Note that the input unit 260 may be realized by a touch panel, a touch panel type display, or the like. That is, the input unit 260 includes a detection unit 262 that can detect two-dimensional designated position coordinates (x, y) on the screen where an image is displayed. For example, the input unit 260 includes a detection unit 262 that can detect two-dimensional contact position coordinates (x, y) in a contact detection area (touch panel). Note that the touch operation on the display screen (hereinafter referred to as the "touch panel" except in special cases) may be performed using a fingertip, or may be performed using an input device such as a touch pen. .

また、入力部２６０は、キーボード、ステアリング、マイク、加速度センサなどを備えていてもよい。 Further, the input unit 260 may include a keyboard, a steering wheel, a microphone, an acceleration sensor, and the like.

記憶部２７０は、記憶部２７０は、２００や通信部２９６などのワーク領域となるもので、その機能はＲＡＭ（ＶＲＡＭ）などにより実現できる。 The memory unit 270 serves as a work area for the memory unit 200 and the communication unit 296, and its functions can be realized by a RAM (VRAM) or the like.

記憶部２７０の記憶領域の一例の図である。そして、本実施形態の記憶部２７０は、主記憶部２７１と、画像バッファ２７２と、ゲーム情報記憶部２７４、プレーヤ情報記憶部２７６と、学習記憶部２７８、学習モデル記憶部２７９を含む。なお、これらの一部を省略する構成としてもよいし、サーバ装置１０の記憶部１７０或いはネットワークを介して接続された所与の記憶装置（クラウド上の記憶領域）がその一部を構成してもよい。 3 is a diagram illustrating an example of a storage area of a storage unit 270. FIG. The storage unit 270 of this embodiment includes a main storage unit 271, an image buffer 272, a game information storage unit 274, a player information storage unit 276, a learning storage unit 278, and a learning model storage unit 279. Note that some of these may be omitted, or the storage unit 170 of the server device 10 or a given storage device (storage area on the cloud) connected via a network may constitute part of the storage section. Good too.

主記憶部２７１は、ワーク領域として使用される。画像バッファ２７２には、画像生成部２３０によって生成された画像データ（表示画像）が記憶される。なお、サーバ装置１０が生成した画像データを受信し、当該画像データを画像バッファ２７２に記憶してもよい。 Main storage section 271 is used as a work area. The image buffer 272 stores image data (display image) generated by the image generation unit 230. Note that image data generated by the server device 10 may be received and the image data may be stored in the image buffer 272.

ゲーム情報記憶部２７４には、ゲーム情報が記憶される。なお、端末装置２０は、サーバ装置１０から、ゲーム情報を受信して、ゲーム情報記憶部２７４に、当該ゲーム情報を記憶してもよい。 Game information is stored in the game information storage section 274. Note that the terminal device 20 may receive game information from the server device 10 and store the game information in the game information storage section 274.

ゲーム情報は、コマンド情報、キャラクタ情報を含む。ゲーム情報記憶部２７４に記憶されるゲーム情報は、キャラクタの行動情報及び当該行動情報に関連する関連情報を含んでもよい。また、ゲーム情報記憶部２７４に記憶されるゲーム情報は、ゲームフィールド（ゲーム空間）の情報、仮想カメラの情報、その他ゲームで用いられる設定値等を含んでもよい。 Game information includes command information and character information. The game information stored in the game information storage unit 274 may include character behavior information and related information related to the behavior information. Further, the game information stored in the game information storage unit 274 may include information on the game field (game space), information on the virtual camera, and other setting values used in the game.

コマンド情報は、キャラクタ毎に用意される。コマンド情報は、予め定義されたキーデータである。つまり、コマンド情報は、行動に対応する、コントローラ３０の入力すべきキー（入力すべきキーが複数の場合はそのキーの入力順）を示す。 Command information is prepared for each character. Command information is predefined key data. In other words, the command information indicates the key to be inputted from the controller 30 (if there are multiple keys to be inputted, the order in which the keys are inputted), which corresponds to the action.

キャラクタ情報は、キャラクタに関する情報である。例えば、キャラクタのパラメータ（体力値、攻撃力、防御力等のパラメータ）、キャラクタが有するアイテム（武器）、レベル（段位、ランク）等である。レベルは、技量レベルの高さ、キャラクタの強さを示す。本実施形態では、レベルは、ゲーム成績によって順次更新される。 Character information is information regarding a character. For example, the information includes parameters of the character (parameters such as physical strength, attack power, defense power, etc.), items (weapons) possessed by the character, level (grade, rank), and the like. The level indicates the skill level and strength of the character. In this embodiment, the level is sequentially updated according to game results.

プレーヤ情報記憶部２７６は、プレーヤの情報が記憶される。なお、端末装置２０は、サーバ装置１０から、プレーヤＩＤに対応するプレーヤ情報を受信して、プレーヤ情報記憶部２７６に記憶するようにしてもよい。 The player information storage section 276 stores player information. Note that the terminal device 20 may receive player information corresponding to the player ID from the server device 10 and store it in the player information storage section 276.

例えば、プレーヤ情報記憶部２７６には、プレーヤのレベル、プレイ履歴、プレイ傾向、プレーヤに関連する他のプレーヤの情報、等が記憶される。 For example, the player information storage unit 276 stores the player's level, play history, play tendency, information on other players related to the player, and the like.

なお、プレーヤ情報は、プレーヤの操作入力（操作情報）を含んでもよい。また、プレーヤ情報は、プレーヤに対応付けられたプレーヤの情報だけでなく、プレーヤのゲーム媒体（例えば、プレーヤが所有するキャラクタ、カード、アイテムなどのゲーム媒体）の情
報等を含む概念であってもよい。 The player information may include the operation input (operation information) of the player. Furthermore, the player information may be a concept including not only the player information associated with the player, but also information on the player's game media (e.g., game media such as characters, cards, and items owned by the player), etc.

学習記憶部２７８には、学習（例えば、機械学習）のための情報（例えば、学習データ、教師データ、重みデータ、各種パラメータ）が記憶される。 The learning storage unit 278 stores information (for example, learning data, teacher data, weight data, various parameters) for learning (for example, machine learning).

機械学習とは、入力値と出力値から、そこに潜むパターンを獲得することを目的とした学習方法である。ニューラルネットワーク（深層学習）は、機械学習で用いられる１つの枠組みである。ニューラルネットワークは、人工ニューロンという数式的なモデル（関数）で表現したものである。本実施形態の機械学習は、ニューラルネットワークに限らず、他のアルゴリズム（ロジスティック回帰、決定木、ブースティング、ランダムフォレスト、サポートベクターマシン等）でもよい。 Machine learning is a learning method that aims to acquire latent patterns from input and output values. Neural networks (deep learning) are one framework used in machine learning. A neural network is expressed using a mathematical model (function) called an artificial neuron. Machine learning in this embodiment is not limited to neural networks, and may be other algorithms (logistic regression, decision tree, boosting, random forest, support vector machine, etc.).

学習モデル記憶部２７９には、学習モデルが記憶される。学習モデルは、入力データと出力データとの相関関係（関係性）や傾向を数式（関数）で表したものである。学習モデル記憶部２７９には、学習部２１８において機械学習された学習モデルが記憶される。 The learning model storage unit 279 stores learning models. A learning model is a formula (function) that expresses the correlation and tendency between input data and output data. The learning model storage unit 279 stores a learning model that has been subjected to machine learning in the learning unit 218.

情報記憶媒体２８０について説明すると、情報記憶媒体２８０は、本実施形態の各部としてコンピュータを機能させるためのプログラム（各部の処理をコンピュータに実行させるためのプログラム）を記憶することができる。なお、処理部２００は、後述するように、情報記憶媒体２８０に格納されるプログラム（データ）に基づいて本実施形態の種々の処理を行う。 Regarding the information storage medium 280, the information storage medium 280 can store programs for causing a computer to function as each part of this embodiment (programs for causing a computer to execute the processing of each part). Note that the processing unit 200 performs various processes of this embodiment based on the programs (data) stored in the information storage medium 280, as described below.

表示部２９０は、本実施形態により生成された画像を出力するものであり、その機能は、ＣＲＴ、ＬＣＤ、タッチパネル型ディスプレイ、或いはＨＭＤ（ヘッドマウントディスプレイ）などにより実現できる。 The display unit 290 outputs the image generated according to the present embodiment, and its function can be realized by a CRT, LCD, touch panel display, HMD (head mounted display), or the like.

特に、本実施形態では表示部２９０は、タッチパネルディスプレイを用いることによりプレーヤがゲーム操作を行う入力部２６０としても機能する。ここでタッチパネルとして、例えば抵抗膜方式（４線式、５線式）、静電容量方式、電磁誘導方式、超音波表面弾性波方式、赤外線走査方式などのタッチパネルを用いることができる。 In particular, in this embodiment, the display section 290 also functions as an input section 260 through which the player operates the game by using a touch panel display. Here, as the touch panel, for example, a resistive film type (4-wire type, 5-wire type), a capacitance type, an electromagnetic induction type, an ultrasonic surface acoustic wave type, an infrared scanning type, or the like can be used.

音出力部２９２は、本実施形態により生成された音を出力するものであり、その機能は、スピーカ、或いはヘッドフォンなどにより実現できる。 The sound output unit 292 outputs the sound generated according to this embodiment, and its function can be realized by a speaker, headphones, or the like.

通信部２９６は、外部（例えばサーバ装置１０や他の端末装置２０）との間で通信を行うための各種制御を行うものであり、その機能は、各種プロセッサ又は通信用ＡＳＩＣなどのハードウェアや、プログラムなどにより実現できる。 The communication unit 296 performs various controls for communicating with the outside (for example, the server device 10 and other terminal devices 20), and its functions are performed by hardware such as various processors or communication ASICs. This can be achieved by using a program, etc.

なお、端末装置２０は、サーバ装置１０が有する情報記憶媒体１８０や記憶部１７０に記憶されている本実施形態の各部としてコンピュータを機能させるためのプログラムやデータを、ネットワークを介して受信し、受信したプログラムやデータを情報記憶媒体２８０や記憶部２７０に記憶してもよい。このようにプログラムやデータを受信して端末装置２０を機能させる場合も本発明の範囲内に含めることができる。 Note that the terminal device 20 receives programs and data stored in the information storage medium 180 and the storage unit 170 of the server device 10 for making the computer function as each part of the present embodiment via the network. The programs and data may be stored in the information storage medium 280 or the storage unit 270. A case where the terminal device 20 functions by receiving programs and data in this manner can also be included within the scope of the present invention.

処理部２００（プロセッサ）は、入力部２６０からの入力データやプログラムなどに基づいて、サーバ装置１０と連動して、ゲーム処理、画像生成処理、或いは音生成処理などの処理を行う。 The processing unit 200 (processor) performs processing such as game processing, image generation processing, sound generation processing, etc. in conjunction with the server device 10 based on input data and programs from the input unit 260.

特に、本実施形態においては、ゲーム処理として、ゲーム開始条件が満たされた場合にゲームを開始する処理、ゲームを進行させる処理、キャラクタ、敵キャラクタなどのオブ
ジェクトを配置する処理、オブジェクトを表示する処理、ゲーム結果を演算する処理、或いはゲーム終了条件が満たされた場合にゲームを終了する処理などが含まれる。 In particular, in this embodiment, the game processing includes processing for starting the game when game start conditions are met, processing for advancing the game, processing for arranging objects such as characters and enemy characters, and processing for displaying objects. , processing for calculating game results, or processing for ending the game when game ending conditions are met.

また、処理部２００は、入力部２６０によって入力されたプレーヤの操作入力に基づき、ゲーム処理（例えば、対戦ゲーム等の処理）、等を行う。また、処理部２００は、サーバ装置１０と連動して実行してもよいし、その一部又は全部がサーバ装置１０に形成されていてもよい。 Further, the processing unit 200 performs game processing (for example, processing of a competitive game, etc.) based on the player's operation input input through the input unit 260. Further, the processing unit 200 may be executed in conjunction with the server device 10, or a part or all of it may be formed in the server device 10.

また、処理部２００は、記憶部２７０をワーク領域として各種処理を行う。処理部２００の機能は各種プロセッサ（ＣＰＵ、ＤＳＰ等）、ＡＳＩＣ（ゲートアレイ等）などのハードウェアや、プログラムにより実現できる。 Furthermore, the processing unit 200 performs various processes using the storage unit 270 as a work area. The functions of the processing unit 200 can be realized by hardware such as various processors (CPU, DSP, etc.), ASIC (gate array, etc.), and programs.

処理部２００は、通信制御部２１１、オブジェクト空間設定部２１２、移動・動作制御部２１３、操作入力受付部２１４、ゲーム制御部２１５、仮想カメラ制御部２１６、行動制御部２１７、学習部２１８、設定部２１９、判断部２２０、探索部２２１、表示制御部２２２、変化部２２３、画像生成部２３０、音処理部２４０を含む。なお、これらの一部を省略する構成としてもよい。 The processing section 200 includes a communication control section 211, an object space setting section 212, a movement/motion control section 213, an operation input reception section 214, a game control section 215, a virtual camera control section 216, an action control section 217, a learning section 218, and a setting section. section 219 , determination section 220 , search section 221 , display control section 222 , change section 223 , image generation section 230 , and sound processing section 240 . Note that a configuration may be adopted in which some of these are omitted.

通信制御部２１１は、サーバ装置１０、それぞれとデータを送受信する処理を行う。また、通信制御部２１１は、サーバ装置１０から受信したデータを記憶部２７０に格納する処理、受信したデータを解析する処理、その他のデータの送受信に関する制御処理等を行う。 The communication control unit 211 performs processing for transmitting and receiving data with each of the server devices 10. The communication control unit 211 also performs processing for storing data received from the server device 10 in the storage unit 270, processing for analyzing the received data, and other control processing related to transmission and reception of data.

通信制御部２１１は、サーバ装置１０から受信した情報に基づいて対戦ゲームを実行する場合に、又は、ネットワークを介して他のプレーヤと対戦する場合に、サーバ装置１０や他の端末装置２０から送信された各種の情報を受信し、サーバ装置１０や他の端末装置２０に送信する。 The communication control unit 211 transmits information from the server device 10 or other terminal devices 20 when executing a competitive game based on information received from the server device 10 or when competing against other players via a network. It receives various types of information and transmits them to the server device 10 and other terminal devices 20.

通信制御部２１１は、プレーヤからの通信開始の操作入力を受け付けた場合に、他の端末装置２０又はサーバ装置１０との通信を行うようにしてもよい。 The communication control unit 211 may be configured to communicate with another terminal device 20 or the server device 10 when it receives an operation input from the player to start communication.

また、通信制御部２１１は、他の端末装置２０又はサーバ装置１０と通信接続が確立されてから当該通信接続が切断されるまで、データを所定周期で互いに送受信する処理を行う。 Furthermore, the communication control unit 211 performs a process of mutually transmitting and receiving data at a predetermined cycle from the time a communication connection is established with another terminal device 20 or the server device 10 until the communication connection is disconnected.

通信制御部２１１は、サーバ装置１０にプレーヤの識別情報や操作入力を送信して、データ（プレーヤのＷｅｂページ、画面等）をサーバ装置１０から受信する処理を行う。 The communication control unit 211 performs a process of transmitting player identification information and operation input to the server device 10 and receiving data (player's web page, screen, etc.) from the server device 10 .

通信制御部２１１は、所定周期でサーバ装置１０とデータ送受信を行ってもよいし、入力部２６０からの操作入力を受け付けた場合に、サーバ装置１０とデータ送受信を行ってもよい。特に、本実施形態の通信制御部２１１は、所与の画面情報（例えば、画面の情報等）を、サーバ装置１０から受信する処理を行うようにしてもよい。 The communication control unit 211 may transmit and receive data to and from the server device 10 at a predetermined interval, or may transmit and receive data to and from the server device 10 when an operation input is received from the input unit 260. In particular, the communication control unit 211 of this embodiment may perform a process of receiving given screen information (e.g., screen information, etc.) from the server device 10.

さらに、通信制御部２１１は、複数の端末装置２０によって構成されるネットワークシステムの場合は、複数の端末装置２０間でデータの送受信を行いながらオンラインゲームを実行するピア・ツー・ピア（いわゆるＰ２Ｐ）方式によって通信制御を実行してもよいし、サーバ装置１０を介して各端末装置２０がデータ（情報）の送受信を行いながらオンラインゲームを実行するクライアント・サーバ方式によって通信制御を実行してもよい。
なお、本実施形態のゲームシステムでは、有線通信のみならず無線通信でデータを送受信してもよい。 Furthermore, in the case of a network system composed of multiple terminal devices 20, the communication control unit 211 may perform communication control using a peer-to-peer (so-called P2P) method in which an online game is played while sending and receiving data between the multiple terminal devices 20, or may perform communication control using a client-server method in which each terminal device 20 plays an online game while sending and receiving data (information) via the server device 10.
In the game system of this embodiment, data may be transmitted and received not only by wired communication but also by wireless communication.

オブジェクト空間設定部２１２は、オブジェクトをオブジェクト空間（仮想的三次元空間）に配置する処理を行う。例えば、仮想三次元仮想空間中にゲーム空間（例えば、キャラクタが格闘を繰り広げるためのステージ）を形成する処理を行う。キャラクタの他に、建物、球場、車、樹木、柱、壁、マップ（地形）などの表示物を、オブジェクト空間に配置する処理を行う。ここでオブジェクト空間とは、仮想的なゲーム空間であり、例えば、ワールド座標系、仮想カメラ座標系のように、三次元座標（Ｘ，Ｙ，Ｚ）においてオブジェクトが配置される空間である。 The object space setting unit 212 performs processing for arranging objects in an object space (virtual three-dimensional space). For example, a process is performed to form a game space (for example, a stage for characters to fight) in a virtual three-dimensional virtual space. In addition to characters, display objects such as buildings, stadiums, cars, trees, pillars, walls, and maps (terrain) are placed in the object space. Here, the object space is a virtual game space, and is a space in which objects are arranged in three-dimensional coordinates (X, Y, Z), such as a world coordinate system or a virtual camera coordinate system, for example.

例えば、オブジェクト空間設定部２１２は、ワールド座標系にオブジェクト（ポリゴン、自由曲面又はサブディビジョンサーフェスなどのプリミティブで構成されるオブジェクト）を配置する。また、例えば、ワールド座標系でのオブジェクトの位置や回転角度（向き、方向と同義）を決定し、その位置（Ｘ、Ｙ、Ｚ）にその回転角度（Ｘ、Ｙ、Ｚ軸回りでの回転角度）でオブジェクトを配置する。 For example, the object space setting unit 212 arranges an object (an object composed of primitives such as a polygon, a free-form surface, or a subdivision surface) in the world coordinate system. Also, for example, you can determine the position and rotation angle (synonymous with orientation and direction) of an object in the world coordinate system, and add the rotation angle (rotation around the X, Y, and Z axes) to that position (X, Y, Z). position the object at an angle).

オブジェクト空間設定部２１２は、スクリーン（二次元画像、画面、スクリーン座標系）上に、特殊コマンドの入力指示オブジェクト、マーカーなどの指示オブジェクトを配置する処理を行うようにしてもよい。また、オブジェクト空間設定部２１２は、オブジェクト空間（三次元空間、ワールド座標系、仮想カメラ座標系、モデル座標系）に、指示オブジェクトを配置するようにしてもよい。 The object space setting unit 212 may perform a process of arranging a special command input instruction object, marker, or other instruction object on the screen (two-dimensional image, screen, screen coordinate system). Further, the object space setting unit 212 may arrange the instruction object in an object space (three-dimensional space, world coordinate system, virtual camera coordinate system, model coordinate system).

移動・動作制御部２１３は、オブジェクト空間にあるオブジェクトの移動・動作演算を行う。すなわちコントローラ３０から受け付けた操作入力や、プログラム（移動・動作アルゴリズム）や、各種データ（モーションデータ）などに基づいて、オブジェクトをオブジェクト空間内で移動させたり、オブジェクトを動作（モーション、アニメーション）させたりする処理を行う。 The movement/motion control unit 213 performs movement/motion calculations for objects in the object space. That is, based on the operation input received from the controller 30, the program (movement/motion algorithm), various data (motion data), etc., the object is moved within the object space, or the object is made to move (motion, animation). Perform the processing to do.

移動・動作制御部２１３は、オブジェクト空間において、主要なオブジェクトであるキャラクタオブジェクト（キャラクタ）の挙動を制御する。 The movement/motion control unit 213 controls the behavior of the character object (character), which is the main object, in the object space.

具体的には、移動・動作制御部２１３は、コントローラ３０への操作入力に基づくキーデータに基づいて、自機を操作するプレーヤの操作対象のプレーヤキャラクタの動作を制御する処理、当該キャラクタが繰り出す攻撃技のヒット判定処理及び、相手キャラクタから攻撃を受けた際のダメージ処理などを行う。 Specifically, the movement/motion control unit 213 performs a process of controlling the motion of a player character to be operated by the player who operates the player's own machine, based on key data based on the operation input to the controller 30, Performs hit determination processing for attack techniques, damage processing when attacked by an opponent character, etc.

移動・動作制御部２１３は、対戦相手が実プレーヤである場合、対戦相手の端末装置２０において入力されたキーデータを取得し、取得したキーデータに基づいて、相手キャラクタが繰り出す攻撃技の移動・動作、当該攻撃技のヒット判定処理、及び、プレーヤキャラクタから攻撃を受けた際のダメージの移動・動作処理などを行う。 When the opponent is a real player, the movement/motion control unit 213 acquires the key data input on the opponent's terminal device 20, and controls the movement/motion of the attack technique performed by the opponent character based on the acquired key data. It performs actions, hit determination processing for the attack technique, and damage movement/action processing when attacked by the player character.

移動・動作制御部２１３は、対戦相手がコンピュータ（ＣＰＵ）である場合、所定のプログラム、アルゴリズムに基づいて、コンピュータキャラクタ（ＮＰＣ（ノンプレーヤキャラクタ）ともいう）が繰り出す攻撃技の移動・動作、当該攻撃技のヒット判定処理、及び、プレーヤキャラクタから攻撃を受けた際のダメージの移動・動作処理などを行う。つまり、対戦相手がコンピュータである場合、プレーヤによって操作されずに、コンピュータキャラクタの移動・動作の制御を行う。 When the opponent is a computer (CPU), the movement/movement control unit 213 controls the movement/movement of attack techniques performed by a computer character (also referred to as an NPC (non-player character)) based on a predetermined program or algorithm. Performs hit determination processing for attack techniques, damage movement and movement processing when attacked by the player character, etc. That is, when the opponent is a computer, the movements and actions of the computer character are controlled without being operated by the player.

移動・動作制御部２１３は、コンピュータキャラクタが、ＡＩ（Ａｒｔｉｆｉｃｉａｌ
Ｉｎｔｅｌｌｉｇｅｎｃｅの略）によって制御されるＡＩキャラクタである場合、学習モデルを用いて推論される行動による移動・動作を行う。 The movement/motion control unit 213 controls the computer character to
In the case of an AI character controlled by AI (abbreviation for Intelligence), the character moves and moves based on actions inferred using a learning model.

また、移動・動作制御部２１３は、係る処理を１フレーム（例えば、１／６０秒）毎に行う。なお、フレームは、オブジェクトの移動・動作処理や画像生成処理を行う時間の単位である。 Furthermore, the movement/motion control unit 213 performs such processing every frame (for example, 1/60 seconds). Note that a frame is a unit of time for performing object movement/motion processing and image generation processing.

操作入力受付部２１４は、プレーヤによってコントローラ３０に入力された操作入力に基づいて、操作指示コマンドを受け付ける。 The operation input reception unit 214 receives operation instruction commands based on operation inputs input to the controller 30 by the player.

具体的には、本実施形態の操作入力受付部２１４は、入力部２６０によって入力されたキーの種別やキーの指示方向、キーの押し下げ回数、押し下げ時間、押し下げタイミング、及び、これらの組み合わせに基づいて、プレーヤによって入力された操作指示コマンドを検出する。 Specifically, the operation input reception unit 214 of this embodiment detects the operation instruction command input by the player based on the type of key, the direction of the key instruction, the number of times the key is pressed, the press time, the press timing, and a combination of these, input by the input unit 260.

ゲーム制御部２１５は、対戦ゲームの進行に係る処理を実行する。例えば、ゲーム開始条件が満たされた場合にゲームを開始する処理（以下、「ゲーム開始処理」ともいう。）、プレーヤの操作入力に応じて、プレーヤキャラクタを制御して対戦ゲームの進行を制御するゲーム処理、ゲーム終了条件が満たされた場合にゲームを終了する処理（以下、「ゲーム終了処理」ともいう。）、最終ステージをクリアした場合にはエンディングを進行させる処理などがある。 The game control unit 215 executes processing related to the progress of the competitive game. For example, the process of starting the game when game start conditions are met (hereinafter also referred to as "game start process"), and controlling the player character in response to the player's operation input to control the progress of the competitive game. There are game processing, processing for ending the game when game ending conditions are met (hereinafter also referred to as "game ending processing"), and processing for advancing the ending when the final stage is cleared.

プレーヤがコンピュータと対戦する場合、ゲーム制御部２１５は、端末装置２０の操作入力受付部２１４によって受け付けられたプレーヤの操作入力の情報と、コンピュータで制御される情報（例えば、行動制御部２１７によって決定された行動の情報を含む）に基づいて、ゲームの進行を行う。 When a player plays against a computer, the game control unit 215 uses information on the player's operation input accepted by the operation input reception unit 214 of the terminal device 20 and information controlled by the computer (for example, determined by the behavior control unit 217). (including information on the actions taken), the game progresses.

実プレーヤ同士で対戦する場合、ゲーム制御部２１５は、端末装置２０Ａの操作入力受付部２１４によって受け付けられた第１のプレーヤの操作入力の情報と、第２のプレーヤの端末装置２０Ｂから送信された情報であって、端末装置２０Ｂにおける第２のプレーヤの操作入力の情報と、に応じて、ゲームの進行を行う。 When real players play against each other, the game control unit 215 receives information on the first player's operation input received by the operation input reception unit 214 of the terminal device 20A, and information on the operation input of the second player transmitted from the terminal device 20B. The game progresses in accordance with the information of the operation input by the second player on the terminal device 20B.

ゲーム制御部２１５は、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２とが、相互に攻撃や防御を行い、両者の体力値を減少させる。そして、両者の体力値に基づいて勝敗を判定する。例えば、１ラウンドのゲーム中に、先に体力値が所定値（０）に達した方が敗者となり他方が勝者となる。１ラウンドのゲーム期間の終了時において、両者ともに体力値が１以上である場合、体力値が高い方を勝者とし、他方を敗者と判定する。 The game control unit 215 causes the first character OB1 and the second character OB2 to attack and defend against each other, thereby reducing the physical strength values of both characters. Then, the winner or loser is determined based on the physical strength values of both players. For example, during one round of the game, the player whose physical strength reaches a predetermined value (0) first becomes the loser and the other player becomes the winner. At the end of the game period of one round, if both players have a physical strength value of 1 or more, the one with the higher physical strength value is determined to be the winner, and the other player is determined to be the loser.

仮想カメラ制御部２１６は、オブジェクト空間内の所与（任意）の視点から見える画像を生成するための仮想カメラ（視点）の制御処理を行う。 The virtual camera control unit 216 performs control processing of a virtual camera (viewpoint) to generate an image that can be seen from a given (arbitrary) viewpoint in the object space.

具体的には、仮想カメラ制御部２１６は、三次元の画像を生成する場合には、ワールド座標系における仮想カメラの位置（Ｘ、Ｙ、Ｚ）、回転角度（例えば、Ｘ、Ｙ、Ｚ軸の各軸の正方向からみて時計回りに回る場合における回転角度）を制御する処理を行う。 Specifically, when generating a three-dimensional image, the virtual camera control unit 216 controls the position (X, Y, Z) and rotation angle (for example, X, Y, Z axes) of the virtual camera in the world coordinate system. (rotation angle when rotating clockwise when viewed from the positive direction of each axis).

すなわち、仮想カメラ制御部２１６は、仮想カメラの視点位置、視線方向、画角、移動方向、移動速度の少なくとも１つを制御する処理を行う。 That is, the virtual camera control unit 216 performs processing to control at least one of the viewpoint position, line of sight direction, angle of view, moving direction, and moving speed of the virtual camera.

行動制御部２１７は、キャラクタの行動を制御する処理を行う。例えば、行動制御部２１７は、プレーヤの操作入力に基づいて、当該プレーヤの操作対象のキャラクタ（以下、「第１のキャラクタ」ともいう。）の行動を制御する。 The behavior control unit 217 performs processing to control the behavior of the character. For example, the behavior control unit 217 controls the behavior of a character to be operated by the player (hereinafter also referred to as "first character") based on the player's operation input.

また、行動制御部２１７は、学習部２１８において学習（例えば、機械学習）された学習データ（例えば、学習モデル）を用いて、コンピュータによって制御されるコンピュータキャラクタ（以下、「第２のキャラクタ」ともいう。）の行動を制御する。 The behavior control unit 217 also uses learning data (for example, a learning model) learned (for example, machine learning) in the learning unit 218 to create a computer character (hereinafter also referred to as a “second character”) that is controlled by the computer. ) to control their behavior.

行動制御部２１７は、探索部２２１において探索された関連情報に基づいて、第２のキャラクタの行動を制御してもよい。 The behavior control unit 217 may control the behavior of the second character based on the related information searched by the search unit 221.

「行動」は、１つの行動でもよいし、継続条件を満たす複数の行動に基づく連続行動を含んでもよい。 "Action" may be one action, or may include continuous action based on a plurality of actions that satisfy the continuation condition.

行動制御部２１７は、予め第１のキャラクタの行動の種類毎に、当該行動に対応する第２のキャラクタの行動を定義してもよい。 The behavior control unit 217 may predefine, for each type of behavior of the first character, a behavior of the second character that corresponds to that behavior.

行動制御部２１７は、第２のキャラクタの行動を定義したデータ、及び、学習データの少なくとも一方を用いて、第２のキャラクタの行動を制御してもよい。予め定義される第２のキャラクタの行動の数は１つでもよいし複数でもよい。 The behavior control unit 217 may control the behavior of the second character using at least one of data defining the behavior of the second character and learning data. The number of predefined actions of the second character may be one or more.

例えば、行動制御部２１７は、探索された関連情報の一致度が所定値以上でない場合に、検出された未学習の当該第１のキャラクタの行動に対応する予め定義された行動を、第２のキャラクタに行わせるようにしてもよい。 For example, if the matching degree of the searched related information is not greater than or equal to a predetermined value, the behavior control unit 217 may change the predefined behavior corresponding to the detected unlearned behavior of the first character to a second one. You may also have the character do it.

また、行動制御部２１７は、探索された関連情報の一致度が所定値以上でない場合に、検出された未学習の当該第１のキャラクタの行動に対応する複数の予め定義された行動の中から、１つの行動を所定の条件で選択し、選択された行動を第２のキャラクタに行わせるようにしてもよい。 In addition, when the degree of matching of the searched related information is not equal to or higher than a predetermined value, the behavior control unit 217 selects one of a plurality of predefined behaviors corresponding to the detected unlearned behavior of the first character. , one action may be selected under predetermined conditions, and the second character may be made to perform the selected action.

学習部２１８は、第１のキャラクタの行動と当該行動に関連する関連情報とを学習（例えば、機械学習）する。例えば、端末装置２０は、関連情報を入力すると、キャラクタの行動（行動情報）が出力される学習モデルを機械学習によって生成する。 The learning unit 218 learns (for example, machine learning) the behavior of the first character and related information related to the behavior. For example, when the terminal device 20 receives related information, it generates a learning model that outputs a character's behavior (behavior information) by machine learning.

設定部２１９は、ゲーム期間において、学習（例えば、機械学習）を行う頻度が高い期間として、所定期間を設ける。例えば、設定部２１９は、ゲーム期間の開始期間において、所定期間を設けるようにしてもよい。また、設定部２１９は、プレーヤの操作入力に基づいて、所定期間を設定可能とするようにしてもよい。 The setting unit 219 sets a predetermined period during the game period as a period in which learning (for example, machine learning) is performed frequently. For example, the setting unit 219 may set a predetermined period in the start period of the game period. Further, the setting unit 219 may be configured to be able to set the predetermined period based on the player's operation input.

設定部２１９は、関連情報を構成する複数の項目の組み合わせを設定してもよい。なお、「項目」は、要素或いは種類を言い換えてもよい。 The setting unit 219 may set a combination of a plurality of items that constitute the related information. Note that "item" may also be interpreted as an element or type.

設定部２１９は、プレーヤの情報に基づいて、学習前（例えば、機械学習前）に、関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定してもよい。 Based on player information, the setting unit 219 determines, before learning (for example, before machine learning), a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information. , at least one of the following may be set.

また、設定部２１９は、プレーヤの情報に基づいて、ゲーム進行状況に応じて、関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重み、の少なくとも１つを設定してもよい。 Furthermore, the setting unit 219 determines the combination of the plurality of items forming the related information, the learning frequency, and the weight of each of the plurality of items forming the related information according to the game progress based on the player's information. At least one may be set.

設定部２１９は、プレーヤの情報に応じて、関連情報を構成する複数の項目それぞれの重みを設定してもよい。
The setting unit 219 may set weights for each of a plurality of items forming the related information according to the player's information.

設定部２１９は、第１のキャラクタの情報に応じて、関連情報を構成する各項目の重みを設定してもよい。 The setting unit 219 may set the weight of each item forming the related information according to the information of the first character.

設定部２１９は、ゲーム情報に応じて、関連情報を構成する各項目の重みを設定してもよい。 The setting unit 219 may set the weight of each item constituting the related information according to the game information.

設定部２１９は、プレーヤの操作入力に基づき、関連情報を構成する複数の項目のうち、重みが高い項目の選択を受け付けるようにしてもよい。設定部２１９は、プレーヤの操作入力に基づき、関連情報の複数の項目それぞれの優先順位を設定するようにしてもよい。設定部２１９は、なお、プレーヤの情報は、当該プレーヤのプレイ履歴に関する情報を含んでもよい。また、プレーヤの情報は、当該プレーヤの過去のプレイ傾向に関する情報を含んでもよい。また、プレーヤの情報は、当該プレーヤの所定情報（例えば、レベル）に基づいて決定される他のプレーヤに関連する情報を含むようにしてもよい。 The setting unit 219 may accept a selection of a high-weighted item from among multiple items constituting the related information based on the player's operation input. The setting unit 219 may set a priority order for each of the multiple items of the related information based on the player's operation input. The player information of the setting unit 219 may include information on the play history of the player. The player information may also include information on the player's past play tendencies. The player information may also include information related to other players that is determined based on predetermined information (e.g., level) of the player.

判断部２２０は、未学習の第１のキャラクタの行動及び当該行動の関連情報を検出したか否かを判断する。 The determination unit 220 determines whether an unlearned action of the first character and related information of the action have been detected.

探索部２２１は、未学習の第１のキャラクタの行動及び当該行動の関連情報を検出した場合に、学習済みの第１のキャラクタの行動に関連する関連情報の中から、検出された未学習の当該第１のキャラクタの行動に関連する関連情報と一致度が高い関連情報を探索する。 When the search unit 221 detects an unlearned behavior of the first character and related information of the behavior, the search unit 221 selects the detected unlearned behavior from among the related information related to the learned behavior of the first character. Search for related information that has a high degree of matching with related information related to the behavior of the first character.

つまり、探索部２２１は、未学習の第１のキャラクタの行動及び当該行動の関連情報を検出した場合に、学習済みの第１のキャラクタの行動に関連する関連情報の中から、検出された未学習の当該第１のキャラクタの行動に関連する関連情報と同一の関連情報を探索する。学習済みの第１のキャラクタの行動に関連する関連情報の中に、検出された未学習の当該第１のキャラクタの行動に関連する関連情報と同一の関連情報がない場合は、最も近似する関連情報を探索する。 That is, when the search unit 221 detects an unlearned behavior of the first character and related information of the behavior, the search unit 221 selects the detected unlearned behavior from among the related information related to the learned behavior of the first character. Search for related information that is the same as related information related to the behavior of the first character in learning. If there is no related information related to the behavior of the learned first character that is the same as the detected related information related to the behavior of the unlearned first character, the most similar related information Explore information.

探索部２２１は、未学習の第１のキャラクタの連続行動及び当該連続行動に関連する関連情報を検出した場合に、学習済みの第１のキャラクタの連続行動に関連する関連情報の中から、検出された未学習の当該第１のキャラクタの連続行動に関連する関連情報と一致度が高い関連情報を探索する。 When the search unit 221 detects a continuous action of an unlearned first character and related information related to the continuous action, the search unit 221 detects the detected continuous action from among the related information related to the continuous action of the learned first character. The related information that has a high degree of matching with the related information related to the continuous actions of the unlearned first character is searched.

表示制御部２２２は、画面を表示部２９０に表示する処理を行う。例えば、端末装置２０は、生成した画像を表示部２９０に表示する。 The display control unit 222 performs processing to display a screen on the display unit 290. For example, the terminal device 20 displays the generated image on the display unit 290.

表示制御部２２２は、サーバ装置１０から受信した画面の表示情報を表示する。なお、表示制御部２２２は、Ｗｅｂブラウザを用いて表示情報や画面を表示してもよい。 The display control unit 222 displays the screen display information received from the server device 10. Note that the display control unit 222 may display display information and screens using a web browser.

例えば、表示制御部２２２は、ゲーム画面に、現在の第１のキャラクタＯＢ１のキャラクタ名、レベル（段位）、体力値を表示し、第２のキャラクタＯＢ２のキャラクタ名、レベル（段位）、体力値を表示する。 For example, the display control unit 222 displays on the game screen the character name, level (rank), and stamina value of the current first character OB1, and the character name, level (rank), and stamina value of the second character OB2.

変化部２２３は、プレーヤの情報又は第１のキャラクタの情報に基づいて、重みづけを行う期間を変化させる。 The changing unit 223 changes the weighting period based on the player's information or the first character's information.

画像生成部２３０は、処理部２００で行われる種々の処理の結果に基づいて描画処理を行い、これにより画像を生成し、表示部２９０に出力する。 The image generation unit 230 performs drawing processing based on the results of various processes performed by the processing unit 200, thereby generating an image and outputting it to the display unit 290.

すなわち、画像生成部２３０は、オブジェクト空間において、仮想カメラから見える画像を生成する。 That is, the image generation unit 230 generates an image visible from the virtual camera in the object space.

例えば、画像生成部２３０は、オブジェクト（モデル）の各頂点の頂点データ（頂点の位置座標、テクスチャ座標、色データ、法線ベクトル或いはα値等）を含むオブジェクトデータ（モデルデータ）が入力され、入力されたオブジェクトデータに含まれる頂点データに基づいて、頂点処理（頂点シェーダによるシェーディング）が行われる。なお頂点処理を行うに際して、必要に応じてポリゴンを再分割するための頂点生成処理（テッセレーション、曲面分割、ポリゴン分割）を行うようにしてもよい。 For example, the image generation unit 230 receives object data (model data) including vertex data (vertex position coordinates, texture coordinates, color data, normal vector, α value, etc.) of each vertex of the object (model), Vertex processing (shading by a vertex shader) is performed based on vertex data included in the input object data. Note that when performing vertex processing, vertex generation processing (tessellation, curved surface division, polygon division) for redividing polygons may be performed as necessary.

頂点処理では、頂点処理プログラム（頂点シェーダプログラム、第１のシェーダプログラム）に従って、頂点の移動処理や、座標変換、例えばワールド座標変換、視野変換（カメラ座標変換）、クリッピング処理、透視変換（投影変換）、ビューポート変換等のジオメトリ処理が行われ、その処理結果に基づいて、オブジェクトを構成する頂点群について与えられた頂点データを変更（更新、調整）する。 In vertex processing, according to the vertex processing program (vertex shader program, first shader program), vertex movement processing, coordinate transformation, such as world coordinate transformation, field of view transformation (camera coordinate transformation), clipping processing, perspective transformation (projection transformation) ), geometry processing such as viewport conversion is performed, and based on the processing results, vertex data given for a group of vertices forming the object is changed (updated, adjusted).

そして、頂点処理後の頂点データに基づいてラスタライズ（走査変換）が行われ、ポリゴン（プリミティブ）の面とピクセルとが対応づけられる。そして、ラスタライズに続いて、画像を構成するピクセル（表示画面を構成するフラグメント）を描画するピクセル処理（ピクセルシェーダによるシェーディング、フラグメント処理）が行われる。 Then, rasterization (scan conversion) is performed based on the vertex data after vertex processing, and the faces of polygons (primitives) and pixels are associated. Then, following rasterization, pixel processing (shading by a pixel shader, fragment processing) is performed to draw pixels that make up the image (fragments that make up the display screen).

ピクセル処理では、ピクセル処理プログラム（ピクセルシェーダプログラム、第２のシェーダプログラム）に従って、テクスチャの読出し（テクスチャマッピング）、色データの設定／変更、半透明合成、アンチエイリアス等の各種処理を行って、画像を構成するピクセルの最終的な描画色を決定し、透視変換されたオブジェクトの描画色を画像バッファ２７２（ピクセル単位で画像情報を記憶できるバッファ。ＶＲＡＭ、レンダリングターゲット、フレームバッファ）に出力（描画）する。 In pixel processing, various processes such as reading textures (texture mapping), setting/changing color data, semi-transparent compositing, and anti-aliasing are performed according to a pixel processing program (pixel shader program, second shader program) to determine the final drawing color of the pixels that make up the image, and the drawing color of the perspective transformed object is output (drawn) to the image buffer 272 (a buffer that can store image information on a pixel-by-pixel basis; VRAM, rendering target, frame buffer).

すなわち、ピクセル処理では、画像情報（色、法線、輝度、α値等）をピクセル単位で設定あるいは変更するパーピクセル処理を行う。これにより、オブジェクト空間内において仮想カメラ（所与の視点）から見える画像が生成される。 That is, pixel processing involves per-pixel processing, where image information (color, normal, brightness, alpha value, etc.) is set or changed on a pixel-by-pixel basis. This generates an image that can be seen from a virtual camera (a given viewpoint) in object space.

なお、頂点処理やピクセル処理は、シェーディング言語によって記述されたシェーダプログラムによって、ポリゴン（プリミティブ）の描画処理をプログラム可能にするハードウェア、いわゆるプログラマブルシェーダ（頂点シェーダやピクセルシェーダ）により実現される。 Note that vertex processing and pixel processing are realized by hardware that allows drawing processing of polygons (primitives) to be programmed using a shader program written in a shading language, a so-called programmable shader (vertex shader or pixel shader).

プログラマブルシェーダでは、頂点単位の処理やピクセル単位の処理がプログラム可能になることで描画処理内容の自由度が高く、従来のハードウェアによる固定的な描画処理に比べて表現力を大幅に向上させることができる。 Programmable shaders have a high degree of freedom in the content of drawing processing because they allow processing on a vertex-by-vertex and pixel-by-pixel basis, greatly improving expressiveness compared to fixed drawing processing using conventional hardware. I can do it.

そして画像生成部２３０は、オブジェクトを描画する際に、ジオメトリ処理、テクスチャマッピング、隠面消去処理、αブレンディング等を行う。 The image generation unit 230 performs geometry processing, texture mapping, hidden surface removal processing, α-blending, etc. when drawing the object.

ジオメトリ処理では、オブジェクトに対して、座標変換、クリッピング処理、透視投影変換、或いは光源計算等の処理が行われる。そして、ジオメトリ処理後（透視投影変換後）のオブジェクトデータ（オブジェクトの頂点の位置座標、テクスチャ座標、色データ（輝度データ）、法線ベクトル、或いはα値等）は、記憶部２７０に保存される。 In the geometry processing, processing such as coordinate transformation, clipping processing, perspective projection transformation, or light source calculation is performed on the object. Then, object data (position coordinates of object vertices, texture coordinates, color data (luminance data), normal vector, or α value, etc.) after geometry processing (perspective projection transformation) is stored in the storage unit 270. .

テクスチャマッピングは、記憶部２７０に記憶されるテクスチャ（テクセル値）をオブ
ジェクトにマッピングするための処理である。 Texture mapping is a process for mapping a texture (texel value) stored in the storage unit 270 to an object.

具体的には、オブジェクトの頂点に設定（付与）されるテクスチャ座標等を用いて記憶部２７０からテクスチャ（色（ＲＧＢ）、α値などの表面プロパティ）を読み出す。そして、二次元の画像であるテクスチャをオブジェクトにマッピングする。この場合に、ピクセルとテクセルとを対応づける処理や、テクセルの補間としてバイリニア補間などを行う。 Specifically, the texture (surface properties such as color (RGB), alpha value, etc.) is read from the storage unit 270 using texture coordinates and the like set (assigned) to the vertices of the object. Then, the texture, which is a two-dimensional image, is mapped onto the object. In this case, a process is performed to associate pixels with texels, and bilinear interpolation is performed to interpolate the texels.

隠面消去処理としては、描画ピクセルのＺ値（奥行き情報）が格納されるＺバッファ（奥行きバッファ）を用いたＺバッファ法（奥行き比較法、Ｚテスト）による隠面消去処理を行うことができる。 Hidden surface removal processing can be performed using a Z buffer method (depth comparison method, Z test) using a Z buffer (depth buffer) in which the Z value (depth information) of a drawn pixel is stored. .

すなわち、オブジェクトのプリミティブに対応する描画ピクセルを描画する際に、Ｚバッファに格納されるＺ値を参照する。そして参照されたＺバッファのＺ値と、プリミティブの描画ピクセルでのＺ値とを比較し、描画ピクセルでのＺ値が、仮想カメラから見て手前側となるＺ値（例えば小さなＺ値）である場合には、その描画ピクセルの描画処理を行うとともにＺバッファのＺ値を新たなＺ値に更新する。 That is, when drawing a drawing pixel corresponding to a primitive of an object, the Z value stored in the Z buffer is referred to. Then, the Z value of the referenced Z buffer is compared with the Z value of the drawing pixel of the primitive, and the Z value of the drawing pixel is the Z value that is on the near side as seen from the virtual camera (for example, a small Z value). If so, the drawing process for that drawing pixel is performed and the Z value of the Z buffer is updated to a new Z value.

αブレンディング（α合成）は、α値（Ａ値）に基づく半透明合成処理（通常αブレンディング、加算αブレンディング又は減算αブレンディング等）のことである。 α blending (α synthesis) is a translucent synthesis process (normal α blending, additive α blending, subtractive α blending, etc.) based on the α value (A value).

例えば、αブレンディングでは、これから画像バッファ２７２に描画する描画色（上書きする色）Ｃ１と、既に画像バッファ２７２（レンダリングターゲット）に描画されている描画色（下地の色）Ｃ２とを、α値に基づいて線形合成処理を行う。つまり、最終的な描画色をＣとすると、Ｃ＝Ｃ１＊α＋Ｃ２＊（１－α）によって求めることができる。 For example, in alpha blending, a linear blending process is performed based on the alpha value on the drawing color (overwrite color) C1 that will be drawn in the image buffer 272 and the drawing color (base color) C2 that has already been drawn in the image buffer 272 (rendering target). In other words, if the final drawing color is C, it can be calculated as C = C1 * alpha + C2 * (1 - alpha).

なお、α値は、各ピクセル（テクセル、ドット）に関連づけて記憶できる情報であり、例えば色情報以外のプラスアルファの情報である。α値は、マスク情報、半透明度（透明度、不透明度と等価）、バンプ情報などとして使用できる。 Note that the α value is information that can be stored in association with each pixel (texel, dot), and is, for example, plus alpha information other than color information. The α value can be used as mask information, translucency (equivalent to transparency or opacity), bump information, etc.

また、画像生成部２３０は、他の端末（第２の端末）とネットワークを介してデータを送受信するマルチプレーヤオンラインゲームを行う場合は、端末（第１の端末）の操作対称のオブジェクトの移動に追従する仮想カメラ（端末（第１の端末）で制御される仮想カメラ）から見える画像を生成する処理を行う。つまり、各端末が独立した描画処理を行う。 When playing a multiplayer online game in which data is sent and received over a network with another terminal (second terminal), the image generation unit 230 performs processing to generate an image seen from a virtual camera (a virtual camera controlled by the terminal (first terminal)) that follows the movement of an object that is the subject of operation of the terminal (first terminal). In other words, each terminal performs independent drawing processing.

音処理部２４０は、処理部２００で行われる種々の処理の結果に基づいて音処理を行い、ＢＧＭ、効果音、又は音声などのゲーム音を生成し、音出力部２９２に出力する。なお、音処理部２４０は、サーバ装置１０の音処理部１４０と全部又は一部の処理と同様の処理を行ってもよい。 The sound processing unit 240 performs sound processing based on the results of various processes performed by the processing unit 200, generates game sounds such as BGM, sound effects, or voices, and outputs them to the sound output unit 292. Note that the sound processing unit 240 may perform the same processing as the sound processing unit 140 of the server device 10 in whole or in part.

なお、本実施形態の端末装置２０は、１人のプレーヤのみがプレイできるシングルプレーヤモード、或いは、複数のプレーヤがプレイできるマルチプレーヤモードでゲームプレイできるように制御してもよい。例えば、マルチプレーヤモードで制御する場合には、ネットワークを介して他の端末装置２０とデータを送受信してゲーム処理を行うようにしてもよいし、１つの端末装置２０が、複数の入力部からの操作入力に基づいて処理を行うようにしてもよい。 Note that the terminal device 20 of this embodiment may be controlled so that the game can be played in a single player mode in which only one player can play, or in a multiplayer mode in which a plurality of players can play. For example, when controlling in multi-player mode, game processing may be performed by transmitting and receiving data to and from other terminal devices 20 via a network, or one terminal device 20 may receive and receive data from multiple input units. The process may be performed based on the operation input.

本実施形態の端末装置２０が、Ｐ２Ｐなどによって複数の端末装置２０でゲームを行う場合は、一の端末装置２０がホストとしての役割を担い、処理を実行してもよい。かかる
場合、ホストとしての端末装置２０は、サーバ装置１０の各処理部（各処理部の一部又は全部）の処理を行うようにしてもよい。 When the terminal device 20 of this embodiment plays a game with a plurality of terminal devices 20 using P2P or the like, one terminal device 20 may play the role of a host and execute the process. In such a case, the terminal device 20 as a host may perform the processing of each processing section (a part or all of each processing section) of the server device 10.

また、本実施形態の端末装置２０は、サーバ装置１０と同等の処理を行い、当該端末装置２０の表示部に画面を表示するようにしてもよい。 Further, the terminal device 20 of this embodiment may perform the same processing as the server device 10 and display a screen on the display unit of the terminal device 20.

［４］概要
本実施形態の端末装置（学習装置）２０は、格闘ゲームをプレイ可能な装置である。本実施形態では、説明の便宜上、端末装置２０において、プレーヤＰ１がゲームプレイする場合を例にとり説明する。 [4] Overview The terminal device (learning device) 20 of this embodiment is a device that can play fighting games. In the present embodiment, for convenience of explanation, an example will be described in which the player P1 plays a game on the terminal device 20.

図５は、プレーヤＰ１の端末装置２０に表示されるゲーム画面の一例を示す。ゲームは、プレーヤＰ１の操作対象のキャラクタＯＢ１（第１のキャラクタＯＢ１とも言う）と、対戦相手のキャラクタＯＢ２（第２のキャラクタＯＢ２とも言う）とが、相互に攻撃や防御を行い互いの体力値を減少させ、体力値に基づいて勝敗を判定する格闘ゲームである。 FIG. 5 shows an example of a game screen displayed on the terminal device 20 of the player P1. In the game, the character OB1 (also referred to as the first character OB1) operated by the player P1 and the opponent character OB2 (also referred to as the second character OB2) attack and defend each other, and each other's physical strength It is a fighting game that determines victory or defeat based on physical strength.

本実施形態では、プレーヤＰ１が格闘ゲームを練習するために、実在しない仮想の対戦相手と対戦する練習モードのゲームを提供する。仮想の対戦相手とは、例えば、コンピュータ（ＣＰＵ）で制御されるコンピュータキャラクタである。本実施形態のコンピュータキャラクタは、ＡＩで行動情報が決定されるため、ＡＩキャラクタと言い換えてもよい。 In this embodiment, a practice mode game is provided in which player P1 can practice a fighting game by fighting against a non-existent virtual opponent. The virtual opponent is, for example, a computer character controlled by a computer (CPU). The computer character in this embodiment may also be called an AI character, since its behavior information is determined by AI.

本実施形態の端末装置２０は、図６に示すように、第１のキャラクタＯＢ１の行動（例えば、右パンチ）の情報及び当該行動の関連情報（例えば、右パンチ発動時の第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離等）とを学習データ（例えば、教師データ）とし、当該学習データを用いて機械学習によって学習モデルを生成する。 As shown in FIG. 6, the terminal device 20 of the present embodiment includes information about the action of the first character OB1 (for example, right punch) and information related to the action (for example, when the first character OB1 executes the right punch). and the distance to the second character OB2) as learning data (for example, teacher data), and a learning model is generated by machine learning using the learning data.

つまり、端末装置２０は、「第１のキャラクタＯＢ１の行動」と、「第１のキャラクタＯＢ１の行動に関連する関連情報」とを機械学習する。 In other words, the terminal device 20 performs machine learning of the "behavior of the first character OB1" and "related information related to the behavior of the first character OB1."

そして、本実施形態では、学習済の学習モデルを利用して、第２のキャラクタＯＢ２の行動を制御する。つまり、第２のキャラクタＯＢ２は、プレーヤＰ１の操作態様を再現する。 In this embodiment, the behavior of the second character OB2 is controlled using the learned learning model. In other words, the second character OB2 reproduces the operation mode of the player P1.

例えば、端末装置２０は、第１のキャラクタＯＢ１が「右パンチ」を発動したときに、「右パンチ」の行動の情報と、「右パンチ」の関連情報とをセットで機械学習する。 For example, when the first character OB1 performs a "right punch", the terminal device 20 performs machine learning on the action information of the "right punch" and the related information of the "right punch" as a set.

端末装置２０は、機械学習後、第２のキャラクタＯＢ２は次のように制御する。例えば、図６に示すように、第１のキャラクタＯＢ１の「右パンチ」が発動（開始）する。そして、第１のキャラクタＯＢ１が「右パンチ」を繰り出したときの関連情報を学習モデルに入力し、出力データを取得する。例えば、端末装置２０は、出力データとして「足技カウンター」の行動を取得すると、取得した「足技カウンター」の行動に基づいて、第２のキャラクタＯＢ２に対し、「足技カウンター」の行動の移動・動作制御を開始する。 After machine learning, the terminal device 20 controls the second character OB2 as follows. For example, as shown in FIG. 6, the first character OB1 launches (starts) a "right punch." Then, relevant information when the first character OB1 delivers the "right punch" is input into the learning model to obtain output data. For example, when the terminal device 20 obtains a "footwork counter" action as output data, it begins to control the movement and motion of the "footwork counter" action for the second character OB2 based on the obtained "footwork counter" action.

ところで、第１のキャラクタＯＢ１の行動や当該行動の関連情報が、学習モデルにおいて学習済みであれば第２のキャラクタＯＢ２の行動に問題ない。しかし、第１のキャラクタＯＢ１の行動やその関連情報が学習モデルにおいて未学習（学習不足）である場合、第２のキャラクタＯＢ２の行動が決められない、或いは、行動が不適切になる問題がある。 However, if the actions of the first character OB1 and related information about said actions have already been learned in the learning model, there is no problem with the actions of the second character OB2. However, if the actions of the first character OB1 and related information about said actions have not yet been learned (lack of learning) in the learning model, there is a problem that the actions of the second character OB2 cannot be determined, or the actions become inappropriate.

そこで、端末装置２０は、未学習の、第１のキャラクタＯＢ１の行動及び当該行動の関連情報が検出された場合に、次のように処理を行う。 Therefore, when the unlearned behavior of the first character OB1 and related information of the behavior are detected, the terminal device 20 performs the following process.

例えば、図７に示すように、未学習の、第１のキャラクタＯＢ１の「ジャンプキック」及び「ジャンプキック」の関連情報が検出されたとする。 For example, as shown in FIG. 7, it is assumed that unlearned "jump kick" and "jump kick" related information of the first character OB1 are detected.

本実施形態では、第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）及び「ジャンプキック」の関連情報が、学習モデルにおいて未学習である場合、第１のキャラクタＯＢ１の「ジャンプキック」の関連情報（例えば、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離）から、一致度の高い、既に学習済みの第１のキャラクタＯＢ１の行動の関連情報を探索（検索）する。例えば、探索結果として、「投げ技」の関連情報が探索される。すると、当該関連情報（「投げ技」に対応する行動の関連情報）とセットで学習された行動「投げ技」に対応する行動（例えば、「後ずさり防御」）を、第２のキャラクタＯＢ２に行わせる。 In the present embodiment, if the action of the first character OB1 (for example, "jump kick") and the related information of the "jump kick" have not been learned in the learning model, the "jump kick" of the first character OB1 is Based on the related information (for example, the distance between the first character OB1 and the second character OB2), information related to the behavior of the first character OB1 that has a high degree of matching and has already been learned is searched for. For example, as a search result, information related to "throwing technique" is searched. Then, the second character OB2 performs an action (for example, "backward defense") that corresponds to the action "throwing technique" learned in combination with the related information (related information of the action corresponding to "throwing technique"). let

このように、本実施形態では、未学習の第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）に遭遇しても、関連情報の一致度が高い行動（例えば、「投げ技」）に対応する行動（例えば、「後ずさり防御」）を、第２のキャラクタＯＢ２に行わせることができる。 In this way, in this embodiment, even if the second character OB2 encounters an unlearned action of the first character OB1 (e.g., a "jump kick"), it is possible to have the second character OB2 perform an action (e.g., "stepping back to defend") that corresponds to an action with a high degree of match in the related information (e.g., a "throw").

また、本実施形態では、プレーヤの操作入力に基づいて行動を制御する第１のキャラクタの行動及び当該行動の関連情報を機械学習するので、プレーヤＰ１は、プレーヤＰ１の癖を反映した第２のキャラクタＯＢ２と対戦できる。 In addition, in this embodiment, since the behavior of the first character whose behavior is controlled based on the player's operation input and the related information of the behavior are machine learned, the player P1 can learn the behavior of the first character that controls the behavior based on the player's operation input, and therefore the player P1 can learn the behavior of the first character that controls the behavior based on the player's operation input. You can fight against character OB2.

なお、本実施形態のゲームシステムは、もちろん、実際に存在する複数のプレーヤ同士で対戦ゲームプレイできるが、説明の便宜上、プレーヤＰ１の操作対象のキャラクタＯＢ１と、コンピュータによって制御されるキャラクタＯＢ２とが対戦するゲームの例について説明する。 Note that, of course, the game system of this embodiment allows a plurality of players who actually exist to play a competitive game, but for the sake of explanation, the character OB1 to be operated by the player P1 and the character OB2 controlled by the computer are An example of a competitive game will be explained.

［５］行動情報の説明
［５．１］行動の種類
次にキャラクタの行動について説明する。「行動」とは、前進、後退、ジャンプ、しゃがむなどの移動動作、右パンチ、右キック、左キック、左パンチ、投げ技などの攻撃の動作、後ずさり防御、ガードなどの防御の動作、コンボなどの連続行動、何もしないニュートラルを含む。 [5] Description of behavior information [5.1] Types of behavior Next, the behavior of the character will be explained. ``Actions'' include moving movements such as moving forward, retreating, jumping, and crouching; attacking movements such as right punch, right kick, left kick, left punch, and throwing techniques; defensive movements such as retreating defense, guarding, and combos. Continuous actions, including neutrality of doing nothing.

なお、「連続行動」は、所定の継続条件を満たす複数の行動であり、「コンボ」又は「連続攻撃」と言ってもよい。本実施形態の行動の種類（項目）は複数存在するが、少なくとも１つの種類があればよい。 Note that a "continuous action" is a plurality of actions that satisfy a predetermined continuation condition, and may also be referred to as a "combo" or a "continuous attack." Although there are multiple types (items) of actions in this embodiment, it is sufficient to have at least one type.

［５．２］コマンド
コマンドは、プレーヤが入力すべきキーの種類や入力順を定義した情報である。コマンドは、キャラクタ毎に、予め、行動に対応付けて定義される。 [5.2] Commands A command is information that defines the type of keys to be input by the player and the input order. Commands are defined in advance for each character in association with their actions.

本実施形態の端末装置２０は、プレーヤＰ１から入力されたキー履歴に基づき、行動を決定する。 The terminal device 20 of this embodiment determines an action based on the key history input from the player P1.

図８は、キャラクタ（例えば、第１のキャラクタＯＢ１）の行動内容及び行動に対応付けられるコマンドの一例を示す。なお、キャラクタが右に向いているときのコマンドの一例である。また、行動ＩＤに対応付けて行動内容及びコマンドをゲーム情報記憶部２７４に記憶する。 FIG. 8 shows an example of the action content of a character (for example, first character OB1) and commands associated with the action. Note that this is an example of a command when the character is facing right. Further, the action details and commands are stored in the game information storage unit 274 in association with the action ID.

例えば、操作対象のキャラクタＯＢ１を前方向に移動させる「前進」は十字型の方向スイッチの右方向への入力を行う。操作対象のキャラクタＯＢ１を後ろ方向に移動させる「後退」は十字型の方向スイッチの左方向への入力を行う。 For example, "forward" to move the character OB1 to be operated forward is performed by inputting a cross-shaped direction switch to the right. ``Backward'' to move the character OB1 to be operated backward is performed by inputting the cross-shaped direction switch to the left.

操作対象の第１のキャラクタＯＢ１を右パンチさせる場合は、△マークのボタン３３ａの入力を行う。 If the first character OB1 to be operated is to be punched to the right, the button 33a marked with a △ mark is input.

操作対象の第１のキャラクタＯＢ１にコンボＡを行わせる場合は、プレーヤは、例えば、３秒以内に、右パンチのコマンド（例えば、△マークのボタン３３ａ）を２回入力した後、続けて左キックのコマンド（例えば、×マークのボタン３３ｃ）を１回入力する。 In order to have the first character OB1 to be operated perform Combo A, the player inputs the right punch command (for example, the △ mark button 33a) twice within 3 seconds, and then consecutively inputs the left punch command. A kick command (for example, the x mark button 33c) is input once.

［５．３］第２のキャラクタの行動
本実施形態では、機械学習された学習済みの学習モデルを用いて行動を決定し、決定された行動を第２のキャラクタＯＢ２に行わせる。例えば、図９に示すように、第１のキャラクタＯＢ１の行動（右パンチ）の行動（技）がタイミングＴ１１で開始（発動）する。端末装置２０は、タイミングＴ１１で、第１のキャラクタＯＢ１の右パンチの行動を検出する。例えば、端末装置２０は、タイミングＴ１２において、関連情報を学習モデルに入力し、出力結果である「足技カウンター」の行動を取得し、第２のキャラクタＯＢ２に「足技カウンター」の動作を開始する。タイミングＴ１１とタイミングＴ１２とは、若干のずれがある（１～数フレームのずれが生じる）。また、第２のキャラクタＯＢ２が新たな行動が開始できない状態も有り得る。例えば、第２のキャラクタＯＢ２が硬直状態（行動不能状態）や、既に行っているアクションフレームの実行中の場合は、新たな行動を開始できない。かかる場合は、第２のキャラクタＯＢ２が新たな行動ができるようになった時点の関連情報を学習モデルに入力し、出力結果の行動を取得し、第２のキャラクタＯＢ２に取得した行動を開始する。 [5.3] Action of the second character In the present embodiment, an action is determined using a machine-learned learning model, and the second character OB2 is caused to perform the determined action. For example, as shown in FIG. 9, the action (skill) of the first character OB1 (right punch) starts (activates) at timing T11. The terminal device 20 detects the right punch action of the first character OB1 at timing T11. For example, at timing T12, the terminal device 20 inputs the related information into the learning model, obtains the output result of the "foot technique counter" action, and starts the "foot skill counter" action on the second character OB2. There is a slight difference between timing T11 and timing T12 (a difference of one to several frames occurs). Furthermore, there may be a situation in which the second character OB2 cannot start a new action. For example, if the second character OB2 is in a rigid state (incapable of action) or is currently executing an action frame that has already been performed, a new action cannot be started. In such a case, the relevant information at the time when the second character OB2 is able to perform a new action is input into the learning model, the output result action is obtained, and the second character OB2 starts the obtained action. .

［５．４］行動情報の項目
行動情報は、行動を特定するための情報であり、例えばアクション番号（識別子）とすることができる。なお、行動情報は、複数の項目（要素）によって構成されていてもよい。例えば、行動情報の項目には、アクション番号や、アクションフレームを含む。 [5.4] Items of Behavior Information Behavior information is information for specifying behavior, and can be, for example, an action number (identifier). Note that the behavior information may be composed of a plurality of items (elements). For example, the action information items include an action number and an action frame.

アクション番号は、キャラクタの行動に対応して振られる番号で、例えば、前進する動き、右パンチの攻撃の動き、右パンチがヒットしたときの攻撃を受ける動き、投げ技においての投げ掴みの動き、投げ技で投げられる動き、投げ技を回避するための投げ抜ける動き、等の番号である。なお、アクションフレームは、アクション番号に対応する動き（行動）の動き始めから終わりまでのどの段階かを示すものである。例えば、右パンチが４フレームの期間で行われる場合、右パンチという技が４つのアクションフレーム１～４で表現され、４段階のアクションフレームの全体で１つのアクションを特定するためのアクション番号が付される。 The action number is a number assigned in response to the character's actions, such as forward movement, right punch attack movement, receiving attack when the right punch hits, throw and grab movement in throwing techniques, This is the number for the movement thrown in a throwing technique, the movement to throw through to avoid a throwing technique, etc. Note that the action frame indicates at which stage from the beginning to the end of the movement (behavior) corresponding to the action number. For example, if a right punch is performed in a period of 4 frames, the right punch technique is expressed in 4 action frames 1 to 4, and an action number is assigned to identify one action in all 4 action frames. be done.

［６］関連情報
本実施形態では、図９に示すように、第１のキャラクタＯＢ１の行動（例えば、右パンチ）が発生したタイミングＴ１１の関連情報を取得し、第１のキャラクタＯＢ１の行動（例えば、右パンチ）と、当該関連情報とを、セットで学習記憶部２７８に記憶する。 [6] Related Information In the present embodiment, as shown in FIG. 9, related information at timing T11 when the action (for example, right punch) of the first character OB1 occurs is acquired, and the action (for example, right punch) of the first character OB1 is acquired. For example, a right punch) and the relevant information are stored in the learning storage unit 278 as a set.

関連情報は、キャラクタが行動を行った際の当該行動に関連する情報であり、別の言い方をすればゲーム状況である。端末装置２０は、第１のキャラクタＯＢ１が行動したタイミングに基づいて、当該行動に関連する情報を取得する。例えば、図９に示すように、第１のキャラクタＯＢ１がＴ１１のタイミングで右パンチの行動を開始した場合、タイミングＴ１１において関連情報を取得する。そして、右パンチの行動の情報と、当該関連情報
とをセットで学習記憶部２７８に記憶する。 The related information is information related to the action taken by the character, or in other words, the game situation. The terminal device 20 acquires information related to the action based on the timing of the action of the first character OB1. For example, as shown in FIG. 9, when the first character OB1 starts a right punch action at timing T11, related information is acquired at timing T11. Then, the information on the action of the right punch and the relevant information are stored as a set in the learning storage unit 278.

また、タイミングＴ１１の後に、更に、タイミングＴ１４でキャラクタＯＢ１が左キックの行動を開始した場合、タイミングＴ１４において関連情報を取得する。そして、左キックの行動の情報と、当該関連情報とをセットで学習記憶部２７８に記憶する。 Further, when the character OB1 starts a left kick action at timing T14 after timing T11, related information is acquired at timing T14. Then, the information on the left kick action and the related information are stored as a set in the learning storage unit 278.

このように、端末装置２０は、キャラクタＯＢ１の行動が発生したタイミングの関連情報を取得する。 In this way, the terminal device 20 obtains information related to the timing at which the action of the character OB1 occurred.

関連情報は、複数の項目（複数の要素）が存在する。各項目に変数名及びその値を対応付ける。図１０は、第１のキャラクタＯＢ１が右パンチの行動を行ったタイミングＴ１１に基づく、当該右パンチに関連する関連情報の一例を示す。 The related information includes multiple items (multiple elements). Assign a variable name and its value to each item. FIG. 10 shows an example of related information related to the right punch based on the timing T11 when the first character OB1 performs the right punch action.

例えば、関連情報は、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離の項目を含む。当該項目の変数名は、Ｄｉｓｔａｎｃｅであり、例えば、タイミングＴ１１においてＤｉｓｔａｎｃｅの値は「２」であることを示す。 For example, the related information includes an item of distance between the first character OB1 and the second character OB2. The variable name of this item is Distance, which indicates that the value of Distance is "2" at timing T11, for example.

また、関連情報は、１ラウンドの残り時間（第１のキャラクタＯＢ１の行動タイミングからゲーム終了時点までの期間）の項目を含む。当該項目の変数名は、Ｔｉｍｅであり、例えば、タイミングＴ１１においてＴｉｍｅの値は「４７」（残り時間が４７秒）であることを示す。 The related information also includes an item for the remaining time of one round (the period from when the first character OB1 takes action to the end of the game). The variable name of this item is Time, and for example, at timing T11, the value of Time is "47" (47 seconds remaining).

また、関連情報は、第１のキャラクタＯＢ１の体力値の項目を含む。当該項目の変数名は、ＭｅＨＰであり、例えば、タイミングＴ１１においてＭｅＨＰの値は「９０」であることを示す。 Further, the related information includes an item of the physical strength value of the first character OB1. The variable name of the item is MeHP, which indicates that the value of MeHP is "90" at timing T11, for example.

また、関連情報は、第２のキャラクタＯＢ２の体力値の項目を含む。当該項目の変数名は、ＯｐＨＰであり、例えば、タイミングＴ１１においてＯｐＨＰの値は「８０」であることを示す。 Further, the related information includes an item of physical strength value of the second character OB2. The variable name of the item is OpHP, which indicates that the value of OpHP is "80" at timing T11, for example.

体力値が残っているキャラクタは、精神状態に余裕があり、体力値が少ないキャラクタは、切羽詰まっている状況と予想される。したがって、これらの体力値を機械学習に生かしている。 It is assumed that a character with a remaining physical strength value is in a relaxed state of mind, and a character with a low physical strength value is in a desperate situation. Therefore, these physical strength values are used in machine learning.

また、関連情報は、第１のキャラクタＯＢ１の地面からの高さの項目を含む。当該項目の変数名は、ＭｅＯｎＦｌｏｏｒであり、例えば、タイミングＴ１１においてＭｅＯｎＦｌｏｏｒの値は「０」であることを示す。 Further, the related information includes an item of the height of the first character OB1 from the ground. The variable name of this item is MeOnFloor, which indicates that the value of MeOnFloor is "0" at timing T11, for example.

また、関連情報は、第１のキャラクタＯＢ１の地面からの高さの項目を含む。当該項目の変数名は、ＯｐＯｎＦｌｏｏｒであり、例えば、タイミングＴ１１においてＯｐＯｎＦｌｏｏｒの値は「０」であることを示す。 Further, the related information includes an item of the height of the first character OB1 from the ground. The variable name of the item is OpOnFloor, which indicates that the value of OpOnFloor is "0" at timing T11, for example.

また、関連情報は、第１のキャラクタＯＢ１の防御期間（防御開始時点から第１のキャラクタＯＢ１の行動タイミングまでの期間）の項目を含む。当該項目の変数名は、ＧｕａｒｄＴｉｍｅであり、例えば、タイミングＴ１１においてＧｕａｒｄＴｉｍｅの値は「０」であることを示す。 Further, the related information includes an item of the defense period of the first character OB1 (the period from the start of defense to the action timing of the first character OB1). The variable name of this item is GuardTime, which indicates that the value of GuardTime is "0" at timing T11, for example.

また、本実施形態の関連情報は、図示していないが、第１のキャラクタＯＢ１のゲーム空間の位置座標（Ｘ、Ｙ、Ｚ）の項目、第２のキャラクタＯＢ２のゲーム空間の位置座標（Ｘ、Ｙ、Ｚ）の項目、第１のキャラクタＯＢ１の対戦エリアの外周からの距離（例えば
、壁からの距離）の項目、第２のキャラクタＯＢ２の対戦エリアの外周からの距離（例えば、壁からの距離）の項目、経過時間（１ラウンドのゲーム開始時点から第１のキャラクタＯＢ１の行動タイミングまでの期間）の項目、第１のキャラクタＯＢ１のキャラクタのレベル（段位）の項目、第１のキャラクタＯＢ１の向きの項目、第２のキャラクタＯＢ２の向きの項目、第１のキャラクタＯＢ１の身長の項目、第２のキャラクタの身長の項目、第１のキャラクタＯＢ１の体力値と第２のキャラクタＯＢ２の体力値との体力差の項目、など他の項目を有していてもよい。 Although not shown, the related information of this embodiment includes items such as the position coordinates (X, Y, Z) of the first character OB1 in the game space, and the position coordinates (X, Y, Z) of the second character OB2 in the game space. . Item of elapsed time (period from the start of the game of one round to the action timing of the first character OB1) Item of the character level (grade) of the first character OB1, Item of the first character OB1 OB1's orientation item, second character OB2's orientation item, first character OB1's height item, second character's height item, first character OB1's physical strength value and second character OB2's physical strength value It may also include other items such as an item of the difference in physical strength from the physical strength value.

なお、関連情報は、画像情報（映像情報）の項目を有していてよい。例えば、第１のキャラクタＯＢ１が、行動を行ったタイミングの画像情報の項目を含んでもよい。 Note that the related information may include an item of image information (video information). For example, the first character OB1 may include an item of image information of the timing at which the first character OB1 performed an action.

関連情報は、画像の明度、画像のキャラクタＯＢ１の表示位置（Ｘ、Ｙ）の項目、画像のキャラクタＯＢ２の表示位置（Ｘ、Ｙ）の項目、画像上の第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との位置関係の項目を、有していてもよい。 The related information may include items such as the brightness of the image, the display position (X, Y) of the character OB1 in the image, the display position (X, Y) of the character OB2 in the image, and the positional relationship between the first character OB1 and the second character OB2 on the image.

格闘ゲームは画面の中央を境界としてキャラクタ同士が対面する構図が多いが、両者が画面の右側或いは左側にいる場合は、他方が追い込まれているゲーム状況と考えられ、これらの情報を機械学習に生かすことができる。 Fighting games often have characters facing each other with the center of the screen as the boundary, but if both characters are on the right or left side of the screen, the other player is considered to be in a corner, and this information can be used in machine learning. You can make use of it.

また、関連情報は、仮想カメラの位置、向き、画角の少なくとも１つの項目を含んでもよい。例えば、仮想カメラが第１のキャラクタＯＢ１が近い場合は、遠い場合よりもアップに表示され、ゲーム状況が白熱している状況とも考えられる。この状況を機械学習に生かすことができる。 Further, the related information may include at least one item of the position, orientation, and angle of view of the virtual camera. For example, when the virtual camera is close to the first character OB1, the first character OB1 is displayed closer than when it is far away, and it can be considered that the game situation is heated. This situation can be utilized in machine learning.

関連情報は、その行動情報が連続行動情報（コンボ行動情報）か否かを示す情報（フラグ）の項目を含んでもよい。このようにすれば、機械学習時にコンボ特有の予測を行うことができる。 The related information may include an item of information (flag) indicating whether the action information is continuous action information (combo action information). In this way, combo-specific predictions can be made during machine learning.

関連情報は、キーデータ（操作入力）の項目を含んでもよい。例えば、プレーヤＰ１のキーデータの履歴であるキー履歴を含んでもよい。キー履歴は、フレーム番号に対応付けて、プレーヤが操作するキャラクタの操作入力（キーデータ）を記憶する。 The related information may include items of key data (operation input). For example, it may include a key history that is a history of key data of the player P1. The key history stores operation inputs (key data) of the character operated by the player in association with frame numbers.

また、関連情報は、第１のキャラクタＯＢ１の移動速度、加速度、移動方向、第２のキャラクタＯＢ２の移動速度、加速度、移動方向の少なくとも１つの項目を含んでもよい。 Further, the related information may include at least one item of the moving speed, acceleration, and moving direction of the first character OB1, and the moving speed, acceleration, and moving direction of the second character OB2.

［７］記憶するデータの説明
本実施形態の端末装置２０は、プレーヤＰ１の操作対象のキャラクタＯＢ１の行動の情報と当該行動に関連する関連情報とをセットで学習記憶部２７８に記憶する。 [7] Description of Data to be Stored The terminal device 20 of the present embodiment stores information on the action of the character OB1 to be operated by the player P1 and related information related to the action as a set in the learning storage unit 278.

なお、端末装置２０は、これらの情報を学習記憶部２７８以外の記憶領域（例えば、クラウド上の記憶領域）に記憶するようにしてもよい。 Note that the terminal device 20 may store this information in a storage area other than the learning storage unit 278 (for example, a storage area on the cloud).

本実施形態では、所与の１ラウンドのゲームが開始されてからそのラウンドのゲームが終了するまでの間（１ラウンド分が終了するまでの間）、第１のキャラクタＯＢ１の行動の情報及び当該行動の関連情報を、学習データとして、累積的に学習記憶部２７８に記憶する。 In this embodiment, from the start of a given round of the game to the end of that round of the game (until the end of one round), information on the actions of the first character OB1 and information related to those actions are cumulatively stored as learning data in the learning memory unit 278.

なお、本実施形態では、１ラウンドのうちの一部の期間において、学習データを取得し記憶してもよい。また、学習データを取得し記憶する頻度を調整してもよい。また、１ラウンドに限らず、複数ラウンドにおいて、学習データを取得してもよい。 Note that in this embodiment, learning data may be acquired and stored during a part of one round. Furthermore, the frequency of acquiring and storing learning data may be adjusted. Further, learning data may be acquired not only in one round but also in multiple rounds.

また、本実施形態では、学習モデルの機械学習のために、学習データの一部又は全部を加工してもよい。 Further, in this embodiment, part or all of the learning data may be processed for machine learning of the learning model.

［８］学習モデル
本実施形態では、プレーヤＰ１の操作対象の第１のキャラクタＯＢ１の行動と当該行動に関連する関連情報とを機械学習する学習モデルを生成する。学習モデルは、プレーヤＰ１が操作する第１のキャラクタＯＢ１の行動を再現可能な学習モデルである。なお、学習モデルは、式、関数と言い換えることができる。 [8] Learning Model In this embodiment, a learning model is generated that performs machine learning on the behavior of the first character OB1 operated by the player P1 and related information related to the behavior. The learning model is a learning model that can reproduce the behavior of the first character OB1 operated by the player P1. The learning model can be rephrased as an equation or a function.

本実施形態では、プレーヤにとって最適なＡＩキャラクタが練習相手となるように、プレーヤ毎に個別の学習モデルを生成する。例えば、プレーヤＰ１の端末装置２０Ａでは、プレーヤＰ１について、プレーヤＰ１に対応する学習モデルを生成する。また、プレーヤＰ２の端末装置２０Ｂでは、プレーヤＰ２について、プレーヤＰ２に対応する学習モデルを生成する。 In this embodiment, an individual learning model is generated for each player so that the most suitable AI character for the player becomes the training partner. For example, the terminal device 20A of the player P1 generates a learning model corresponding to the player P1. Furthermore, the terminal device 20B of the player P2 generates a learning model corresponding to the player P2.

［９］未学習の場合の第２のキャラクタの行動制御
［９．１］概要
本実施形態では、プレーヤＰ１の操作入力に基づき決定された第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）及び当該行動の関連情報が、学習モデルにおいて未学習である場合、第２のキャラクタＯＢ２の行動制御ができない、或いは、行動が不適切という問題がある。 [9] Behavior control of second character in case of unlearning [9.1] Overview In this embodiment, the behavior of the first character OB1 determined based on the operation input of the player P1 (for example, "jump kick") ) and related information of the behavior have not been learned in the learning model, there is a problem that the behavior of the second character OB2 cannot be controlled or the behavior is inappropriate.

そこで、本実施形態では、第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）及び当該行動の関連情報が、学習モデルにおいて未学習である場合、次のように処理する。 Therefore, in this embodiment, if the action of the first character OB1 (e.g., "jump kick") and related information about that action have not yet been learned in the learning model, the following processing is performed.

端末装置２０は、図７に示すように、第１のキャラクタＯＢ１の行動（例えば、ジャンプキック）の関連情報と一致度が高い関連情報を、学習済みの第１のキャラクタＯＢ１の行動に関連する関連情報の中から、探索（検索、抽出）する。例えば、学習記憶部２７８に記憶されている関連情報を参照する。 As shown in FIG. 7, the terminal device 20 associates related information with a high degree of matching with the related information of the action (for example, jump kick) of the first character OB1 to the learned action of the first character OB1. Search (search, extract) from related information. For example, the related information stored in the learning storage unit 278 is referred to.

例えば、第１のキャラクタＯＢ１の行動（例えば、ジャンプキックの情報）の関連情報の１項目である「第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離」が１メートルであるとする。すると、学習済みの関連情報の中から、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離が１メートルである関連情報を探索する。 For example, assume that "distance between first character OB1 and second character OB2", which is one item of information related to the action of first character OB1 (for example, jump kick information), is 1 meter. Then, related information in which the distance between the first character OB1 and the second character OB2 is 1 meter is searched from among the learned related information.

学習済みの関連情報の中から、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離が１メートルである関連情報がない場合、当該距離が１メートルに最も近似する値を有する関連情報を探索する。 If there is no related information in which the distance between the first character OB1 and the second character OB2 is 1 meter from among the learned related information, search for related information whose distance is closest to 1 meter. do.

そして、学習済みの関連情報の中から、最も一致度の高い関連情報（例えば、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離で１メートル）が探索されると、当該関連情報と当該関連情報に対応する学習済みの行動（例えば、投げ技）に基づいて、第２のキャラクタＯＢ２の行動を制御する。 Then, when the related information with the highest degree of matching (for example, the distance between the first character OB1 and the second character OB2 is 1 meter) is searched from among the learned related information, the related information and the corresponding The behavior of the second character OB2 is controlled based on the learned behavior (eg, throwing technique) corresponding to the related information.

端末装置２０は、予め「投げ技」に対応する行動として「後ずさり防御」を行うことが決められている場合、第２のキャラクタＯＢ２に対し、「後ずさり防御」の行動の移動・動作制御を行う。 If it is decided in advance to perform "backward defense" as an action corresponding to the "throwing technique", the terminal device 20 controls the movement and movement of the "backward defense" action for the second character OB2. .

例えば、端末装置２０は、探索された関連情報とセットで学習された行動である「投げ技」を、第１のキャラクタＯＢ１が繰り出した時に、第２のキャラクタＯＢ２が対応したときの学習済みの関連情報を学習モデルに入力し、出力データを取得してもよい。当該出力データが「ガード」の行動である場合、端末装置２０は、取得した「ガード」の行動に基づいて、第２のキャラクタＯＢ２に対し、「ガード」の移動・動作制御を行う。 For example, when the first character OB1 performs a "throwing technique," which is an action learned in combination with the searched related information, the terminal device 20 determines the learned behavior when the second character OB2 responds. Relevant information may be input into the learning model and output data may be obtained. When the output data is the behavior of the "guard", the terminal device 20 controls the movement and movement of the "guard" on the second character OB2 based on the acquired behavior of the "guard".

なお、第１のキャラクタＯＢ１の行動（例えば、ジャンプキック）に関連する関連情報を取得するタイミングは、例えば、第１のキャラクタＯＢ１の行動（例えば、ジャンプキックの動作）を開始したタイミングであるが、当該行動（例えば、ジャンプキック）のアクションフレーム（動作中）の所与のタイミングでもよい。行動に関連する関連情報は、例えば、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離、第１のキャラクタＯＢ１のパラメータ（例えば、体力値）等でもよいし、他の項目でもよい。 Note that the timing at which the related information related to the action (for example, jump kick) of the first character OB1 is acquired is, for example, the timing at which the action (for example, the jump kick action) of the first character OB1 is started. , it may be a given timing of the action frame (during movement) of the action (for example, jump kick). The related information related to the action may be, for example, the distance between the first character OB1 and the second character OB2, a parameter of the first character OB1 (for example, physical strength value), or other items.

なお、本実施形態では、第１のキャラクタＯＢ１の行動（例えば、ジャンプキックの情報）の関連情報の複数の項目のうち、重要度の高い項目を参照して、一致度の高い関連情報を探索してもよい。例えば、学習済みの関連情報の中から、重要度の高い「第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離」と「第１のキャラクタＯＢ１の体力値」とを探索する。つまり、本実施形態では、「第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離」について一致度が高く、かつ、「第１のキャラクタＯＢ１の体力値」について一致度が高い関連情報を探索してもよい。 In this embodiment, related information with a high degree of match may be searched for by referring to items of high importance among multiple items of related information of the action of the first character OB1 (e.g., information on a jump kick). For example, "the distance between the first character OB1 and the second character OB2" and "the physical strength value of the first character OB1" which are of high importance are searched for from among the learned related information. In other words, in this embodiment, related information with a high degree of match for "the distance between the first character OB1 and the second character OB2" and for "the physical strength value of the first character OB1" may be searched for.

［９．２］未学習である場合の判断
本実施形態では、プレーヤＰ１の操作入力に基づき決定された第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）及び当該行動の関連情報が、学習モデルにおいて未学習であるか否かの判断は、種々考えられる。 [9.2] Judgment when unlearned In the present embodiment, the action of the first character OB1 (for example, "jump kick") determined based on the operation input of the player P1 and the related information of the action are Various methods can be considered for determining whether the learning model has not yet been learned.

例えば、端末装置２０は、第２のキャラクタＯＢ２に対して予め学習済の行動リストを記憶部２７０に登録（記憶）し、第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）が行動リストに登録されていない場合に、未学習であると判断してもよい。 For example, the terminal device 20 registers (stores) in the storage unit 270 a pre-learned action list for the second character OB2, and the action of the first character OB1 (for example, "jump kick") is registered (stored) in the action list. If the information is not registered in , it may be determined that the information has not been learned.

また、端末装置２０は、学習記憶部２７８を参照して、第１のキャラクタＯＢ１の行動の情報が記憶されていない場合に、未学習であると判断してもよい。 Further, the terminal device 20 may refer to the learning storage unit 278 and determine that the first character OB1 has not yet been learned if the information on its behavior is not stored.

また、例えば、第１のキャラクタＯＢ１の関連情報を学習モデルに入力し、学習モデルの出力結果に基づいて、当該行動が未学習であるか否かを判断してもよい。例えば、学習モデルの出力値が所定値以下である場合、未学習と判断してもよい。 Alternatively, for example, the related information of the first character OB1 may be input into a learning model, and it may be determined whether the behavior is unlearned based on the output result of the learning model. For example, if the output value of the learning model is less than or equal to a predetermined value, it may be determined that the learning model has not been learned.

［９．３］探索手法
本実施形態では、プレーヤＰ１の操作入力に基づき決定された第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）が、学習モデルにおいて未学習である場合、学習済みの第１のキャラクタＯＢ１の行動に関連する関連情報の中から、検出された未学習の当該第１のキャラクタの行動（例えば、「ジャンプキック」）に関連する関連情報に一致する度合が高い関連情報を探索する。 [9.3] Search method In this embodiment, if the action of the first character OB1 determined based on the operation input of the player P1 (for example, "jump kick") is unlearned in the learning model, it is determined that the action has been learned. Among the related information related to the behavior of the first character OB1, the association has a high degree of matching with the related information related to the detected unlearned behavior of the first character (for example, "jump kick") Explore information.

かかる場合、例えば、端末装置２０は、学習記憶部２７８に記憶されている第１のキャラクタＯＢ１の行動及び関連情報の中から、検出された未学習の当該第１のキャラクタの行動（例えば、「ジャンプキック」）に関連する関連情報に一致する度合が高い関連情報を探索する。 In such a case, for example, the terminal device 20 may determine the detected unlearned behavior of the first character (for example, " Search for related information that has a high degree of matching with related information related to "Jump Kick").

探索結果の関連情報が複数存在する場合がある。例えば、探索結果として、「投げ技」
の関連情報と「ハイキック」の関連情報とが検出される場合もあり得る。かかる場合は、いずれか１つの探索結果の行動を用いて、第２のキャラクタＯＢ２の行動を決定する。 There may be multiple pieces of information related to the search results. For example, as a search result, "throwing technique"
It is possible that information related to "High Kick" and information related to "High Kick" may be detected. In such a case, the action of the second character OB2 is determined using any one of the actions resulting from the search.

［９．４］探索の具体例
（Ａ）関連情報の体力値を参照する例
本実施形態では、第１のキャラクタＯＢ１の「ジャンプキック」の行動が、学習モデルにおいて未学習である場合、「ジャンプキック」の第１のキャラクタＯＢ１の関連情報の１項目である体力値（つまり、ジャンプキック時の第１のキャラクタＯＢ１の体力値）から、学習済みの第１のキャラクタＯＢ１の関連情報の中から、一致度が最も高い関連情報を探索してもよい。 [9.4] Specific example of search (A) Example of referring to physical strength value of related information In this embodiment, if the "jump kick" action of the first character OB1 is not learned in the learning model, " From the physical strength value, which is one item of the related information of the first character OB1 of "Jump Kick" (that is, the physical strength value of the first character OB1 at the time of jump kick), to the learned related information of the first character OB1. You may search for related information with the highest degree of matching.

例えば、ジャンプキック時の第１のキャラクタＯＢ１の体力値が「１０」であり、学習済みの第１のキャラクタＯＢ１の「左キック」時の第１のキャラクタＯＢ１の体力値が「１０」であり、体力値が同じであり最も一致度が高いと判断される。すると、第１のキャラクタＯＢ１の「左キック」の行動の関連情報に基づいて、第２のキャラクタＯＢ２の行動を制御してもよい。例えば、「左キック」に対応する行動が「ガード」である場合、第２のキャラクタＯＢ２の行動を「ガード」に決定し、「ガード」を第２のキャラクタＯＢ２に行わせる。つまり、端末装置２０は、第１のキャラクタＯＢ１が「ジャンプキック」をした場合に、第２のキャラクタＯＢ２に「ガード」の行動を行わせる。 For example, the physical strength value of the first character OB1 when performing a jump kick is "10", and the physical strength value of the first character OB1 when the learned first character OB1 performs a "left kick" is "10". , the physical strength values are the same, and it is determined that the degree of matching is the highest. Then, the action of the second character OB2 may be controlled based on the information related to the "left kick" action of the first character OB1. For example, when the action corresponding to "left kick" is "guard", the action of the second character OB2 is determined to be "guard", and the second character OB2 is caused to perform "guard". That is, when the first character OB1 performs a "jump kick", the terminal device 20 causes the second character OB2 to perform a "guard" action.

（Ｂ）残りゲーム時間を参照する例
本実施形態では、第１のキャラクタＯＢ１の「ジャンプキック」の行動が、学習モデルにおいて未学習である場合、「ジャンプキック」の第１のキャラクタＯＢ１の関連情報の１項目である残りゲーム時間（例えば、ジャンプキック時の残りゲーム時間）から、学習済みの第１のキャラクタＯＢ１の関連情報の中から、一致度が最も高い関連情報を探索してもよい。 (B) Example of referencing remaining game time In this embodiment, when the "jump kick" action of the first character OB1 has not been learned in the learning model, the remaining game time (e.g., the remaining game time at the time of the jump kick), which is one item of the related information of the first character OB1 for the "jump kick," may be used to search for related information of the first character OB1 that has already been learned that has the highest degree of match.

例えば、ジャンプキック時の第１のキャラクタＯＢ１の残りゲーム時間が「１５」であり、学習済みの第１のキャラクタＯＢ１の「左パンチ」時の第１のキャラクタＯＢ１の残りゲーム時間が「１５」であり、残りゲーム時間が同じである最も一致度が高いと判断される。すると、第１のキャラクタＯＢ１の「左パンチ」の行動の関連情報に基づいて、第２のキャラクタＯＢ２の行動を決定する。例えば、「左パンチ」に対応する行動が「後退」である場合、第２のキャラクタＯＢ２の行動を「後退」に決定し、「後退」を第２のキャラクタＯＢ２に行わせる。つまり端末装置２０は、第１のキャラクタＯＢ１が「ジャンプキック」をした場合に、第２のキャラクタＯＢ２に「後退」の行動を行わせる。 For example, the remaining game time of the first character OB1 when performing a jump kick is "15", and the remaining game time of the first character OB1 when the learned first character OB1 performs a "left punch" is "15". , and it is determined that the degree of coincidence is highest since the remaining game time is the same. Then, the action of the second character OB2 is determined based on the related information of the "left punch" action of the first character OB1. For example, when the action corresponding to "left punch" is "retreat", the action of the second character OB2 is determined to be "retreat", and the second character OB2 is caused to perform "retreat". In other words, when the first character OB1 performs a "jump kick", the terminal device 20 causes the second character OB2 to perform a "retreat" action.

［９．５］効果
本実施形態によれば、機械学習されていない第１のキャラクタＯＢ１に遭遇しても、学習済みの第１のキャラクタＯＢ１の行動の関連情報から、第２のキャラクタのＯＢ２の行動を制御するので、適切な行動を第２のキャラクタＯＢ２に行わせることができる。また、本実施形態によれば、プレーヤの癖を反映した最適なＡＩキャラクタ（例えば、第２のキャラクタＯＢ２）を練習相手として提供することができる。例えば、プレーヤＰ１の第１のキャラクタＯＢ１が、パンチ攻撃が多い場合、ＡＩキャラクタもパンチ攻撃が多いキャラクタとなる。プレーヤＰ１は、自分と似たプレイを行うコンピュータ対戦で練習を行うことにより、自分の癖を直す練習や、自分の癖を活かした練習を行うことができる。また、プレーヤＰ１は、未学習の行動（例えば、ジャンプキック）を行ったとして、ＡＩキャラクタが適切な行動を行うことができる。 [9.5] Effects According to this embodiment, even if the player encounters the first character OB1 that has not been machine-learned, the action of the second character OB2 is controlled from the related information of the action of the learned first character OB1, so that the second character OB2 can be made to take an appropriate action. In addition, according to this embodiment, an optimal AI character (e.g., the second character OB2) that reflects the player's habits can be provided as a practice partner. For example, if the first character OB1 of the player P1 has many punch attacks, the AI character will also be a character that has many punch attacks. The player P1 can practice correcting his/her habits or making use of his/her habits by practicing against a computer that plays similarly to the player P1. In addition, the player P1 can perform an unlearned action (e.g., a jump kick) and the AI character can take an appropriate action.

［９．６］第２のキャラクタの行動及び当該行動の関連情報の学習
本実施形態では、第２のキャラクタＯＢ２の行動と当該行動に関連する関連情報とを機
械学習してもよい。 [9.6] Learning of second character's behavior and related information of the behavior In this embodiment, the behavior of the second character OB2 and related information related to the behavior may be learned by machine.

例えば、本実施形態では、未学習の第１のキャラクタＯＢ１の行動（例えば、「ジャンプキック」）及び当該行動の関連情報に遭遇した場合、第２のキャラクタＯＢ２がとった行動（例えば、「後ずさり防御」）と、当該行動時（例えば、「後ずさり防御」の行動開始時）の関連情報とをセットで機械学習してもよい。 For example, in the present embodiment, when an unlearned action of the first character OB1 (e.g., "jump kick") and related information of the action are encountered, the action taken by the second character OB2 (e.g., "backwards") is encountered. Machine learning may be performed on a set of information related to the behavior (for example, when the behavior of "backward defense" is started) and the relevant information at the time of the behavior (for example, when the behavior of "backward defense" is started).

［１０］学習モードと実戦モード
本実施形態では、学習モデルにより学習をさせる学習モード（学習期間）と、当該学習モデルを用いて、第２のキャラクタＯＢ２の行動を予測（推論）しゲームを進行させる実践モード（実践期間）とを同時に行っている。しかし、本実施形態では、学習モデルを生成する学習モードの期間と実戦モードの期間とを、一部又は全部が異なる期間となるように、設定してもよい。 [10] Learning mode and actual battle mode In this embodiment, a learning mode (learning period) in which learning is performed using a learning model, and the learning model is used to predict (infer) the behavior of the second character OB2 and advance the game. A practical mode (practical period) is conducted at the same time. However, in the present embodiment, the period of the learning mode for generating the learning model and the period of the practical mode may be set so that some or all of them are different periods.

例えば、第１のキャラクタと、コンピュータキャラクタとの対戦で、学習データを収集し、ある程度の学習データが蓄積された段階で、収集した学習データに基づいて学習モデルを生成してもよい。 For example, learning data may be collected in a match between a first character and a computer character, and a learning model may be generated based on the collected learning data when a certain amount of learning data has been accumulated.

また、本実施形態では、プレーヤＰ１の端末装置２０以外の装置（例えば、他のプレーヤの端末装置２０、サーバ装置、又は学習装置）において、学習モデルを生成してもよい。そして、生成された当該学習モデルを、当該装置から受信して、学習モデル記憶部２７９に記憶し、当該学習モデルを用いてもよい。また、学習データを、当該装置から受信して、学習記憶部２７８に記憶してもよい。 Furthermore, in this embodiment, the learning model may be generated in a device other than the terminal device 20 of the player P1 (for example, a terminal device 20 of another player, a server device, or a learning device). Then, the generated learning model may be received from the device, stored in the learning model storage section 279, and used. Further, the learning data may be received from the device and stored in the learning storage section 278.

［１１］学習を行う頻度が高い期間
本実施形態の端末装置２０は、図１１に示すように、ゲーム期間（格闘ゲームの１ラウンドの期間）において、学習（例えば、機械学習）を行う頻度が高い期間として所定期間を設けるようにしてもよい。所定期間の長さは、ゲーム期間（例えば、６０秒間）よりも短い期間であり、例えば、２０秒間の長さとする。 [11] Period where learning is performed frequently As shown in FIG. A predetermined period may be provided as the high period. The length of the predetermined period is shorter than the game period (for example, 60 seconds), and is, for example, 20 seconds.

「学習を行う頻度が高い」とは、例えば、所定期間において学習データを取得して学習モデルにおいて機械学習する回数が、所定期間以外の期間よりも多いことを意味する。 "Learning is performed frequently" means, for example, that the number of times learning data is acquired and machine learning is performed on a learning model in a predetermined period is greater than in periods other than the predetermined period.

例えば、図１１に示すように、所定期間（ＴＳ～Ｔ１の期間）において、機械学習する回数は、少なくとも１０回以上とし、所定期間外（Ｔ１～ＴＥの期間）において機械学習する回数は、少なくとも５回以下とする。 For example, as shown in FIG. 11, the number of times machine learning is performed during the predetermined period (period TS to T1) is at least 10 times, and the number of times machine learning is performed outside the predetermined period (period T1 to TE) is at least 10. No more than 5 times.

なお、端末装置２０は、学習前に、所定期間の設定を行う。なお、端末装置２０は、学習中においても、所定期間の設定や、所定期間の変更を行ってもよい。例えば、端末装置２０は、ゲーム進行状況に応じて、所定期間の設定や、所定期間の変更を行ってもよい。 Note that the terminal device 20 sets a predetermined period of time before learning. Note that the terminal device 20 may set or change the predetermined period even during learning. For example, the terminal device 20 may set a predetermined period or change a predetermined period depending on the progress of the game.

［１１．１］所定期間の設定例
本実施形態の端末装置２０は、ゲーム期間の開始期間において、所定期間を設定してもよい。 [11.1] Example of setting a predetermined period The terminal device 20 of this embodiment may set a predetermined period in the start period of the game period.

例えば、図１１に示すように、ゲーム期間の開始時点と所定期間の開始時点とを同じタイミングＴＳに設定する。所定期間の終了時点Ｔ１は、開始時点ＴＳから１０秒後の時点とする。 For example, as shown in FIG. 11, the start time of the game period and the start time of the predetermined period are set to the same timing TS. The end time T1 of the predetermined period is the time 10 seconds after the start time TS.

また、所定期間（ＴＳ～Ｔ１の期間）において学習データとして取得する行動の種類を
、単純行動、技、連続行動の全ての行動の種類とし、所定期間外（Ｔ１～ＴＥの期間）において学習対象として取得する行動の種類を、技及び連続行動に限定してもよい。 In addition, the types of actions to be acquired as learning data during the predetermined period (period from TS to T1) are all types of actions, such as simple actions, techniques, and continuous actions, and the types of actions to be acquired as learning data during the predetermined period (period from T1 to T1) are all types of actions, and outside the predetermined period (period from T1 to TE), the learning data is The types of actions to be acquired may be limited to techniques and continuous actions.

ゲーム期間の開始期間は、ゲームの流れや試合の方向性を左右する重要な場面である。例えば、先制攻撃をした方はどちらのキャラクタであるか、先に技を決めた方がどちらのキャラクタであるのか、ゲームの重要な要因になる。そして、ゲーム期間の開始期間は、プレーヤの癖が出る傾向にある。 The starting period of a game period is an important moment that influences the flow of the game and the direction of the match. For example, important factors in the game include which character makes the first attack, and which character makes the move first. During the start period of the game period, players tend to develop their habits.

そのため、本実施形態では、効率的に学習を行うために、ゲーム期間の開始期間に所定期間を設けて多くの行動を学習する。そして、所定期間経過後は、技や連続行動（コンボ）などの行動について学習をする。 Therefore, in this embodiment, in order to learn efficiently, a predetermined period is set at the start of the game period to learn many actions. Then, after the predetermined period has passed, the player learns actions such as techniques and consecutive actions (combos).

［１１．２］プレーヤの操作入力に基づいて所定期間を設定する例
本実施形態の端末装置２０は、第１のキャラクタＯＢ１を操作するプレーヤＰ１の操作入力に基づいて、所定期間を設定してもよい。 [11.2] Example of Setting Predetermined Period Based on Operation Input by Player The terminal device 20 of this embodiment may set a predetermined period based on operation input by the player P1 who controls the first character OB1.

図１２は、設定画面の一例を示す。例えば、図１２に示すように、プレーヤＰ１は、設定画面において、ゲーム開始時点を０とし、ゲーム終了時点を６０とし、所定期間の開始タイミングＡ１と終了タイミングＡ２とを入力する。 FIG. 12 shows an example of a setting screen. For example, as shown in FIG. 12, the player P1 inputs, on the setting screen, a game start time point of 0, a game end time point of 60, and a start timing A1 and an end timing A2 of a predetermined period.

端末装置２０は、プレーヤＰ１は、開始タイミングＡ１（例えば、５０）と、終了タイミングＡ２（例えば、６０）とを受け付けると、開始タイミングＡ１（例えば、５０）と、終了タイミングＡ２（例えば、６０）に基づいて、所定期間を設定する。 When the player P1 receives the start timing A1 (e.g., 50) and the end timing A2 (e.g., 60), the terminal device 20 sets a predetermined period based on the start timing A1 (e.g., 50) and the end timing A2 (e.g., 60).

このようにすれば、プレーヤＰ１は、所定期間を意識してプレーヤＰ１がゲームプレイを行うことができる。例えば、所定期間において、技や連続行動の練習をすれば、それに対応する返し技の行動を学習させることができ。その結果、プレーヤＰ１に最適な練習環境を提供できる。 In this way, the player P1 can play the game keeping in mind the predetermined period. For example, if you practice a technique or a series of actions for a predetermined period of time, you can learn the corresponding counterattack behavior. As a result, an optimal practice environment can be provided to the player P1.

また、プレーヤＰ１が自分の癖が最後にでやすいことを認識している場合、最後の期間（残り１０秒間）を、所定期間を設けるようにすれば、プレーヤＰ１にとって最適な練習環境を創出できる。 Furthermore, if player P1 is aware that his own habits tend to emerge at the end, by setting a predetermined period for the final period (10 seconds remaining), an optimal practice environment for player P1 can be created. .

［１１．３］学習頻度の設定
端末装置２０は、１ラウンドのゲーム期間（例えば、６０秒間）を時系列で複数に区分し、各期間においての学習頻度を設定してもよい。例えば、ゲーム開始タイミングから１０秒経過するまでは、１０回以上の学習を行い、ゲーム開始タイミングから１０秒経過後２０秒経過するまでは、１０回未満５回以上の学習を行い、ゲーム開始タイミングから２０秒経過後は、５回未満の学習を行うように、学習頻度を設定する。 [11.3] Setting Learning Frequency The terminal device 20 may divide the game period of one round (for example, 60 seconds) into a plurality of periods in chronological order, and may set the learning frequency for each period. For example, until 10 seconds have elapsed from the game start timing, learning is performed 10 or more times, and from 10 seconds after the game start timing until 20 seconds have elapsed, learning is performed 5 or more times less than 10 times, and at the game start timing After 20 seconds have passed, the learning frequency is set so that learning is performed less than 5 times.

また、端末装置２０は、１ラウンドのゲーム期間（例えば、６０秒間）を時系列で複数に区分し、各期間において、学習する行動の種類を設定してもよい。 Further, the terminal device 20 may divide the game period of one round (for example, 60 seconds) into a plurality of periods in chronological order, and set the type of behavior to be learned in each period.

まず、行動の種類を、単純行動、技、コンボの３種類に分ける。例えば、ゲーム開始タイミングから１０秒経過するまでは、単純行動、技、コンボの全ての種類の学習を行い、ゲーム開始タイミングから１０秒経過後２０秒経過するまでは、単純行動、技、コンボの行動うち、技、コンボの学習を行い、ゲーム開始タイミングから２０秒経過後は、単純行動、技、コンボのうち、コンボの種類の学習を行うように、学習頻度を設定する。 First, the types of actions are divided into three types: simple actions, techniques, and combos. For example, all types of simple actions, techniques, and combos are learned until 10 seconds have passed from the game start timing, and all types of simple actions, techniques, and combos have been learned from 10 seconds after the game start timing until 20 seconds have passed. Among actions, techniques and combos are learned, and after 20 seconds from the game start timing, the learning frequency is set so that the types of combos among simple actions, techniques, and combos are learned.

本実施形態の端末装置２０は、プレーヤＰ１の操作入力に基づいて、学習頻度を設定し
てもよい。つまり、端末装置２０は、プレーヤＰ１の操作入力に基づいて、ゲーム期間を複数に区分した各期間においての学習頻度、各期間においての学習する行動の種類を設定してもよい。 The terminal device 20 of this embodiment may set the learning frequency based on the operation input of the player P1. In other words, the terminal device 20 may set the learning frequency in each period in which the game period is divided into a plurality of periods and the type of behavior to be learned in each period based on the operation input from the player P1.

［１２］連続行動に関する説明
［１２．１］連続行動の説明
本実施形態の行動は、所定の継続条件を満たす複数の行動である連続行動（コンボ）を含む。連続行動は、別の言い方をすれば連続攻撃である。連続攻撃は、相手キャラクタからの攻撃の隙を与えないようにでき、更に相手キャラクタに大きなダメージを与えることができる有益な攻撃手法である。キャラクタに連続行動を行わせる場合、複数のコマンド入力が要求される。連続して行う行動の数は、２つでもよいし３つ以上の行動でもよい。 [12] Description regarding continuous action [12.1] Description of continuous action The actions of this embodiment include continuous actions (combo) that are a plurality of actions that satisfy a predetermined continuation condition. Continuous actions are, in other words, continuous attacks. Continuous attacks are a useful attack method that can prevent the opponent character from attacking, and can also cause great damage to the opponent character. When making a character perform continuous actions, multiple command inputs are required. The number of consecutive actions may be two or three or more actions.

連続行動は、例えば、複数の行動の順列組み合わせ（右パンチ、右パンチ、左キックの順の連続行動）である。図８に示すように、本実施形態では、予め連続行動を行わせるためのコンボのコマンドを定義する。 A consecutive action is, for example, a sequential combination of multiple actions (a consecutive action of right punch, right punch, left kick). As shown in FIG. 8, in this embodiment, a combo command for performing consecutive actions is defined in advance.

例えば、コンボＡを発動させる場合、図１３に示すように、連続行動期間内（例えば、３秒以内）に連続して、右パンチ、右パンチ、左キックの３つの行動のコマンド入力を行うことが要求される。 For example, when activating Combo A, as shown in Figure 13, commands for three actions, right punch, right punch, and left kick, must be input consecutively within a continuous action period (for example, within 3 seconds). is required.

コンボＡが成功する例は次の通りである。図１３に示すように、例えば、タイミングＴ２１で、プレーヤＰ１の操作入力に基づき第１のキャラクタＯＢ１が右パンチの行動のコマンドを入力し、右パンチの動作が開始される。キャラクタＯＢ１の右パンチの行動に基づき、タイミングＴ２２で、第１のキャラクタＯＢ１が第２のキャラクタＯＢ２にヒットし、右パンチが成功する。なお、第２のキャラクタＯＢ２は第１のキャラクタＯＢ１からヒットを受けると、数フレームの間、硬直状態となりキーデータが反映されない無防備となる。そして、タイミングＴ２３で、プレーヤＰ１の操作入力に基づき第１のキャラクタＯＢ１が右パンチの行動のコマンドを入力し、当該右パンチの動作が開始される。キャラクタＯＢ１のタイミングＴ２３で入力した右パンチの行動に基づき、タイミングＴ２４で、第１のキャラクタＯＢ１が第２のキャラクタＯＢ２にヒットし、右パンチが成功する。そして、タイミングＴ２５で、プレーヤＰ１の操作入力に基づき第１のキャラクタＯＢ１が左キックの行動のコマンドを入力し、当該左キックの動作が開始される。キャラクタＯＢ１のタイミングＴ２５で発動した左キックの行動に基づき、タイミングＴ２６で、第１のキャラクタＯＢ１が第２のキャラクタＯＢ２にヒットし、左キックが成功する。このようにして、コンボＡが成功する。 An example of a successful combo A is as follows. As shown in FIG. 13, for example, at timing T21, the first character OB1 inputs a right punch action command based on an operation input from the player P1, and the right punch action is started. Based on the action of the right punch of the character OB1, the first character OB1 hits the second character OB2 at timing T22, and the right punch is successful. Note that when the second character OB2 receives a hit from the first character OB1, the second character OB2 remains in a rigid state for several frames and is defenseless, with no key data being reflected. Then, at timing T23, the first character OB1 inputs a right punch action command based on the operation input from the player P1, and the right punch action is started. Based on the right punch action input by the character OB1 at timing T23, the first character OB1 hits the second character OB2 at timing T24, and the right punch is successful. Then, at timing T25, the first character OB1 inputs a left kick action command based on the operation input from the player P1, and the left kick action is started. Based on the left kick action performed by the character OB1 at timing T25, the first character OB1 hits the second character OB2 at timing T26, and the left kick is successful. In this way, Combo A is successful.

「継続条件」は、種々考えられる。例えば、「継続条件」は、連続行動期間内に、予め決められた連続行動（コンボＡの場合、右パンチ、右パンチ、左キックの連続行動）を行うことである。別の言い方をすれば、「継続条件」は、相手キャラクタから攻撃を挟まれずに、連続して行動を複数回行うことである。 Various "continuation conditions" can be considered. For example, the "continuation condition" is to perform a predetermined continuous action (in the case of Combo A, a continuous action of right punch, right punch, and left kick) within the continuous action period. In other words, the "continuation condition" is to perform the action multiple times in succession without being attacked by the opponent character.

連続行動は、単発の行動よりもキャラクタに強いダメージを与える（攻撃対象の体力値を、単発の行動よりも多く減少させる）。例えば、タイミングＴ２４の右パンチがヒットしたときは、単発の右パンチよりも強いダメージを与えるようにしてもよい。また、タイミングＴ２６の左キックがヒットしたときは、単発の左キックよりも強いダメージを与えるようにしてもよい。 Continuous actions cause more damage to the character than single actions (decreases the attack target's physical strength more than single actions). For example, when a right punch at timing T24 hits, stronger damage may be caused than a single right punch. Furthermore, when the left kick at timing T26 hits, it may cause more damage than a single left kick.

［１２．２］連続行動と、連続行動の関連情報の学習
連続行動や連続行動に伴うヒット判定等は、図１３に示すように、所与の連続行動期間（Ｔ２１～Ｔ２７の期間）で生じるものである。本実施形態では、連続行動に関連する関
連情報として、この連続行動期間（Ｔ２１～Ｔ２７の期間）で生じる情報を取得する。 [12.2] Learning continuous actions and information related to continuous actions Continuous actions and hit judgments associated with continuous actions occur in a given continuous action period (period T21 to T27), as shown in Figure 13. It is something. In this embodiment, information occurring during this continuous action period (period T21 to T27) is acquired as related information related to the continuous action.

例えば、連続行動の関連情報は、連続行動期間の各フレーム（１／６０秒毎）で取得したキャラクタＯＢ１とキャラクタＯＢ２との距離の平均値（平均距離）の項目を含む。 For example, the continuous action related information includes an item of the average value (average distance) of the distance between character OB1 and character OB2 acquired in each frame (every 1/60 second) of the continuous action period.

また、連続行動の関連情報は、連続行動期間のキャラクタＯＢ２の体力値の減少値の項目を含んでもよい。例えば、連続行動期間（Ｔ２１～Ｔ２７の期間）において、キャラクタＯＢ２の体力値が１００から１０に減少した場合、その差分の９０が減少値となる。 Further, the continuous action related information may include an item of the decrease value of the physical strength value of the character OB2 during the continuous action period. For example, if character OB2's physical strength value decreases from 100 to 10 during the continuous action period (period T21 to T27), the difference, 90, becomes the decreased value.

また、連続行動の関連情報は、連続行動期間の各フレーム（１／６０秒毎）で取得したキャラクタＯＢ１とキャラクタＯＢ２との距離の合計値（延べ値、合計距離）の項目を含んでもよい。 In addition, the related information of the continuous action may include an item of the total value (total value, total distance) of the distance between character OB1 and character OB2 obtained in each frame (every 1/60th of a second) of the continuous action period.

また、連続行動の関連情報は、連続行動期間のキャラクタＯＢ１がキャラクタＯＢ２に与えたダメージの変化率の項目を含んでもよい。 Further, the continuous action related information may include an item of a rate of change in damage inflicted by character OB1 on character OB2 during the continuous action period.

また、連続行動の関連情報は、連続行動を構成する複数の行動の組み合わせの項目を含んでもよい。例えば、コンボＡの場合、右パンチ２回、左キック１回からなる組み合わせである。 Moreover, the related information of a continuous action may include an item of a combination of a plurality of actions that constitute a continuous action. For example, combo A is a combination of two right punches and one left kick.

また、連続行動の関連情報は、連続行動を構成する複数の行動の順番（攻撃順）の項目を含んでもよい。例えば、コンボＡの場合、右パンチ、右パンチ、左キックの順の情報である。 Further, the related information of a continuous action may include an item of the order (attack order) of a plurality of actions that constitute the continuous action. For example, in the case of combo A, the information is the order of right punch, right punch, and left kick.

また、連続行動の関連情報は、連続行動を構成する複数の行動の行動回数（例えば、攻撃回数）の項目を含んでもよい。例えば、コンボＡの場合、右パンチ、右パンチ、左キックの攻撃を行うので、攻撃回数は３回である。 Further, the related information of the continuous action may include an item of the number of actions (for example, the number of attacks) of a plurality of actions that constitute the continuous action. For example, in the case of Combo A, the number of attacks is three because the attacks are a right punch, a right punch, and a left kick.

［１２．３］学習データ
端末装置２０は、連続行動の情報及び、当該連続行動の関連情報として取得した情報を、学習データとして用いる。 [12.3] Learning Data The terminal device 20 uses information on a continuous action and information acquired as information related to the continuous action as learning data.

例えば、端末装置２０は、第１のキャラクタＯＢ１の連続行動の情報（コンボＡの情報）及び、当該連続行動（コンボＡ）の関連情報として取得した情報（例えば、キャラクタ間の平均距離、等）の情報を、セットで学習記憶部２７８に記憶する。 For example, the terminal device 20 includes information on the continuous action of the first character OB1 (combo A information) and information acquired as related information of the continuous action (combo A) (for example, average distance between characters, etc.) The information is stored in the learning storage unit 278 as a set.

そして、端末装置２０は、連続行動の情報と当該連続行動の関連情報とを機械学習する。 Then, the terminal device 20 performs machine learning on the information on the continuous action and the related information on the continuous action.

なお、連続行動期間中は、第１のキャラクタＯＢ１の連続行動及びその関連情報を取得しているが、併せて、第１のキャラクタＯＢ１の単発の行動（例えば、右パンチ、右パンチ、左キックそれぞれの行動）及び各関連情報を取得し、機械学習してもよい。 Note that during the continuous action period, the continuous actions of the first character OB1 and related information are acquired, but in addition, the single actions of the first character OB1 (for example, right punch, right punch, left kick) are acquired. Each action) and each related information may be acquired and machine learning may be performed.

［１２．４］連続行動の探索
本実施形態では、第１のキャラクタＯＢ１の連続行動及び当該連続行動の関連情報が、学習モデルにおいて未学習である場合に、当該連続行動に関連する関連情報に基づいて、学習済みの行動（単一の行動、又は、連続行動）の関連情報を探索する。 [12.4] Searching for consecutive actions In this embodiment, when the consecutive actions of the first character OB1 and related information for the consecutive actions have not been learned in the learning model, related information for learned actions (single actions or consecutive actions) is searched for based on related information related to the consecutive actions.

端末装置２０は、第１のキャラクタＯＢ１の「コンボＡ」の行動が、学習モデルにおいて未学習である場合、学習済みの関連情報の中から、コンボＡの関連情報と一致度の高い
関連情報を探索する。 When the action of "combo A" of the first character OB1 has not been learned in the learning model, the terminal device 20 searches for related information that has a high degree of match with the related information of combo A from among the learned related information.

例えば、関連情報のうち、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２の平均距離に基づいて探索を行う場合、次のように処理する。 For example, when performing a search based on the average distance between the first character OB1 and the second character OB2 among related information, the following processing is performed.

関連情報の１項目である「コンボの攻撃順」の項目に重きをおいて、探索を行う場合の例を説明する。例えば、コンボＡが、右パンチ、右パンチ、左キックの順で３つの攻撃を行う場合、端末装置２０は、攻撃順が最も似ている学習済みのコンボを探索する。例えば、コンボＢが、右パンチ、右パンチ、右キックが、コンボＡに最も似ている学習済みのコンボとして探索される。すると、端末装置２０は、コンボＢに対応する行動を、第２のキャラクタＯＢ２に行わせる。 An example of a case where a search is performed with emphasis on the item "combo attack order", which is one item of related information, will be explained. For example, when combo A performs three attacks in the order of right punch, right punch, and left kick, the terminal device 20 searches for a learned combo that has the most similar attack order. For example, combo B is searched as a learned combo that is most similar to combo A, including a right punch, right punch, and right kick. Then, the terminal device 20 causes the second character OB2 to perform an action corresponding to combo B.

例えば、図１３に示すように、端末装置２０は、コンボＢに対応する行動が「起き上がりキック」の場合、第２のキャラクタＯＢ２に「起き上がりキック」の行動を行わせる。つまり、端末装置２０は、学習モデルに、第１のキャラクタＯＢ１の「コンボＡ」の連続行動期間の終了後（例えばタイミングＴ２７以後）であって、第２のキャラクタＯＢ２が行動可能状態となったタイミングＴ２８で、第２のキャラクタＯＢ２に対し、「起き上がりキック」の移動・動作制御を行う。 For example, as shown in FIG. 13, when the action corresponding to combo B is "get up kick", the terminal device 20 causes the second character OB2 to perform the "get up kick" action. That is, the terminal device 20 uses the learning model to determine that the second character OB2 is ready to act after the continuous action period of "Combo A" of the first character OB1 ends (for example, after timing T27). At timing T28, the second character OB2 is controlled to move and perform a "stand-up kick."

次に、関連情報の１項目である「キャラクタＯＢ１とキャラクタＯＢ２との平均距離」の項目に重きをおいて探索を行う場合の例を説明する。例えば、コンボＡの連続行動期間の平均距離（第１のキャラクタＯＢ１と第２のキャラクタＯＢ２の平均距離）が０．８メートルである場合、端末装置２０は、当該平均距離と一致度の高い学習済みの行動（連続行動又は単一の行動）を探索する。例えば、学習済みのコンボＣの当該平均距離（第１のキャラクタＯＢ１と第２のキャラクタＯＢ２の平均距離）が０．８メートルであり、コンボＣが、コンボＡに最も一致度が高い学習済みのコンボとして探索される。すると、端末装置２０は、コンボＣに対応する行動を、第２のキャラクタＯＢ２に行わせる。 Next, an example of a case where a search is performed with emphasis on the "average distance between character OB1 and character OB2", which is one item of related information, will be described. For example, if the average distance during the continuous action period of combo A (average distance between the first character OB1 and the second character OB2) is 0.8 meters, the terminal device 20 searches for learned actions (continuous actions or single actions) that match the average distance. For example, the average distance (average distance between the first character OB1 and the second character OB2) of learned combo C is 0.8 meters, and combo C is searched for as the learned combo that matches combo A most closely. Then, the terminal device 20 causes the second character OB2 to perform the action corresponding to combo C.

例えば、端末装置２０は、コンボＣに対応する行動が「後方受け身」の場合、第２のキャラクタＯＢ２に「後方受け身」の行動を行わせる。つまり、端末装置２０は、学習モデルに、第１のキャラクタＯＢ１の「コンボＡ」の連続行動期間の終了後であって、第２のキャラクタＯＢ２が行動可能状態となったタイミングＴ２８で、第２のキャラクタＯＢ２に対し、「後方受け身」の移動・動作制御を行う。 For example, when the action corresponding to combo C is "backward passive", the terminal device 20 causes the second character OB2 to perform the "backward passive" action. In other words, the terminal device 20 uses the second character OB2 in the learning model at timing T28 after the end of the continuous action period of "Combo A" of the first character OB1 and when the second character OB2 becomes ready for action. ``Backward passive'' movement and movement control is performed for the character OB2.

このように本実施形態によれば、コンボＡが未学習であったとしても、コンボＡの一連の流れがどれだけ一致しているかを学習済みのコンボの中から探索し、第２のキャラクタＯＢ２の行動を適切に行う。 In this way, according to the present embodiment, even if combo A has not been learned, the degree to which the sequence of combo A matches is searched among the learned combos, and the second character OB2 Conduct appropriate actions.

なお、連続行動を探索する場合において、重視する関連情報の項目を変えて探索してもよい。例えば、コンボのキャラクタ間の平均距離、攻撃順だけでなく、他の関連情報の項目の一例として、例えば、攻撃回数を重視する場合、コンボＡが、右パンチ２回、左キック１回の合計３回の攻撃回数による行動であるので、学習済みであって、かつ、攻撃回数が「３回」である連続行動を探索する。例えば、コンボＣが、攻撃回数が３回であり、コンボＡに最も一致度が高い学習済みのコンボとして探索されると、端末装置２０は、コンボＣに対応する行動を、第２のキャラクタＯＢ２に行わせる。 Note that when searching for continuous actions, the items of related information to be emphasized may be changed. For example, in addition to the average distance between characters in a combo and the order of attacks, other related information items include, for example, if emphasis is placed on the number of attacks, Combo A is a total of two right punches and one left kick. Since the action is based on the number of attacks of three times, a continuous action that has been learned and has the number of attacks of "3" is searched for. For example, when combo C has three attacks and is searched as a learned combo that has the highest degree of matching with combo A, the terminal device 20 transmits the action corresponding to combo C to second character OB2. have it done.

［１３］関連情報の項目の設定
本実施形態の端末装置２０は、予め機械学習で用いる関連情報の項目を設定してもよい。また、本実施形態の端末装置２０は、プレーヤＰ１の入力情報に基づいて、関連情報の複数の項目を設定してもよい。 [13] Setting of Related Information Items The terminal device 20 of the present embodiment may set items of related information to be used in machine learning in advance. In addition, the terminal device 20 of the present embodiment may set multiple items of related information based on input information of the player P1.

端末装置２０は、プレーヤＰ１の操作入力に基づいて、関連情報の複数の項目のうち、機械学習で用いる少なくとも２以上の項目の組み合わせを決定してもよい。 The terminal device 20 may determine a combination of at least two or more items to be used in machine learning among the plurality of items of related information based on the operation input of the player P1.

関連情報の各項目の組み合わせとは、例えば、関連情報の複数の項目のうち少なくとも２以上の項目の組を意味する。 A combination of items of related information means, for example, a set of at least two or more items among a plurality of items of related information.

［１４］重みづけ
端末装置２０は、は、関連情報の各項目の重みづけを決定する。重みづけは各項目の重要度を評価（点数化）することである。つまり、本実施形態では、学習対象の関連情報の各項目について、重要度を数値化することにより、重みづけを決定する。 [14] Weighting The terminal device 20 determines the weighting of each item of related information. Weighting involves evaluating (scoring) the importance of each item. That is, in this embodiment, weighting is determined by quantifying the degree of importance for each item of related information to be learned.

例えば、端末装置２０は、プレーヤの情報、第１のキャラクタの情報、ゲーム情報の少なくとも１つに応じて、重みづけを決定してもよい。 For example, the terminal device 20 may determine the weighting according to at least one of player information, first character information, and game information.

端末装置２０は、重みづけによる数値（重みの値）は、機械学習の演算に用いてもよい。 The terminal device 20 may use the weighted numerical value (weight value) for machine learning calculations.

端末装置２０は、学習前（機械学習前）に重みづけを行う。このようにすれば、早期に機械学習を効率的に行うことができる。つまり、リアルタイムに重みづけをする場合は、ある程度の時間を要するが、本実施形態では、早期に重みづけをすることで、効率的に学習を図ることができる。 The terminal device 20 performs weighting before learning (before machine learning). In this way, machine learning can be performed efficiently at an early stage. That is, when weighting is performed in real time, it takes a certain amount of time, but in this embodiment, by performing weighting early, learning can be achieved efficiently.

端末装置２０は、ゲーム進行状況に応じて（例えば、学習中）に重みづけを行ってもよい。このようにすれば、リアルタイムに効率よく学習できる。また、ゲーム進行状況に適した重みづけを行うことができる。 The terminal device 20 may perform weighting depending on the progress of the game (for example, during learning). In this way, you can learn efficiently in real time. Further, it is possible to perform weighting appropriate to the progress of the game.

例えば、初期状態で、関連情報の各項目の重みの値を「１」に設定し、重要度の高い項目の値を「１」よりも大きな値にする。重要度の高い項目は、１つでもよいし複数あってもよい。 For example, in the initial state, the weight value of each item of related information is set to "1", and the value of an item with high importance is set to a value larger than "1". There may be one or more items with high importance.

具体的に説明すると、端末装置２０は、プレーヤＰ１の操作入力に基づいて、学習対象の関連情報の各項目のうち「第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離」の項目と、「１ラウンドの残り時間」の項目とに重きを置きたい項目として選択を受け付けた場合、この２つの項目の重みを「３」に設定し、この項目以外の重みを「１」のまま維持する。 Specifically, based on the operation input from the player P1, the terminal device 20 selects the item "distance between the first character OB1 and the second character OB2" among the items of related information to be learned; If the item "Remaining time of 1 round" is selected as an item to be prioritized, set the weight of these two items to ``3'', and maintain the weight of other items as ``1''. .

［１４．１］プレーヤ情報に基づく重みづけの設定
端末装置２０は、プレーヤ情報に応じて、学習対象の関連情報の各項目の重みづけを決定してもよい。事前に設定されたプレーヤの情報から、学習する際に優先順位の高い項目に重みづけを設定し、ユーザの満足度の高い動作を実現するための学習を実現することができる。 [14.1] Setting Weighting Based on Player Information The terminal device 20 may determine the weighting of each item of related information to be learned, depending on the player information. Based on player information set in advance, weighting can be set for items with high priority during learning, and learning can be performed to achieve actions that are highly satisfying to the user.

プレーヤ情報とは、プレーヤのレベル、ランク、初心者、中級者又は上級者のうちいずれに該当するかの情報、格闘ゲームのゲームプレイ時間、勝率、対戦回数、戦績、等である。 The player information includes the player's level, rank, information on whether the player is a beginner, intermediate, or advanced player, game play time of the fighting game, winning rate, number of battles, battle record, and the like.

例えば、初心者の場合は、体力値を気にする傾向があるが、中級者や上級者は体力値を気にせずに、相手との距離を気にする傾向がある。したがって、初心者は体力値の項目の重みの値を「３」に設定し、この項目以外の重みを「１」のまま維持する。 For example, beginners tend to be concerned about their physical strength, while intermediate and advanced players tend not to care about their physical strength but rather the distance to their opponent. Therefore, a beginner sets the weight value of the physical strength value item to "3" and maintains the weight value of other items as "1".

中級者や上級者は体力値の項目の重みの値を「３」に設定し、この項目以外の重みを「１」のまま維持する。 Intermediate and advanced players should set the weight value for the stamina item to "3" and keep the weights for all other items at "1".

プレーヤ情報とは、プレーヤＰ１の情報（年齢や性別）、に応じて、学習する各項目の重みづけを決定してもよい。 Player information may determine the weighting of each item to be learned based on information about player P1 (age and gender).

［１４．２］プレーヤキャラクタ情報に基づく重みづけの設定
そして、端末装置２０は、プレーヤキャラクタ情報に応じて、学習対象の関連情報の各項目の重みづけを決定してもよい。 [14.2] Setting Weighting Based on Player Character Information The terminal device 20 may determine the weighting of each item of related information to be studied according to the player character information.

プレーヤキャラクタ情報とは、例えば、プレーヤＰ１が操作したキャラクタの数、キャラクタの種類等である。また、プレーヤが管理するキャラクタ毎のキャラクタの技（コンボを含む）の習得度でもよい。技の習得度は、キャラクタの全ての技の数のうち、当該キャラクタが習得した技の数の割合を示す。例えば、プレーヤＰ１が練習モードにおいて成功した技については習得したとみなして判定する。 The player character information includes, for example, the number of characters operated by the player P1, the type of characters, and the like. Alternatively, the learning level of character skills (including combos) for each character managed by the player may be used. The degree of skill acquisition indicates the ratio of the number of skills that the character has learned out of all the number of skills of the character. For example, a technique that the player P1 succeeds in in the practice mode is determined to have been learned.

例えば、習得度の高いキャラクタ（例えば、習得度５０％以上のキャラクタ）の場合、関連情報の各項目のうち、距離の項目の重みを「３」に設定し、他の項目よりも高くする。 For example, in the case of a character with a high learning level (for example, a character with a learning level of 50% or more), the weight of the distance item among the related information items is set to "3", which is higher than the other items.

例えば、習得度の低いキャラクタ（例えば、習得度５０％未満のキャラクタ）の場合、関連情報の各項目のうち、体力値の重みを「３」に設定し、他の項目よりも高くする。 For example, for a character with a low level of proficiency (e.g., a character with a level of proficiency of less than 50%), the weight of the stamina value among the related information items is set to "3", making it higher than the other items.

［１４．３］ゲーム情報に基づく重みづけの設定
そして、端末装置２０は、ゲーム情報に応じて、学習対象の関連情報の各項目の重みづけを決定してもよい。 [14.3] Setting Weighting Based on Game Information Then, the terminal device 20 may determine the weighting of each item of related information of the learning target according to the game information.

ゲーム情報とは、例えば、プレーヤＰ１及び対戦相手プレーヤそれぞれのレベル、レベル差、プレーヤＰ１の操作対象のキャラクタＯＢ１及び対戦相手のキャラクタＯＢ２のレベル、レベル差、等である。 The game information includes, for example, the level and level difference between the player P1 and the opponent player, the level and level difference between the character OB1 to be operated by the player P1 and the opponent character OB2, and the like.

例えば、レベル差が１０以上ある場合、関連情報の各項目のうち、距離の項目の重みを「３」に設定するなど、距離以外項目よりも高くする
また、レベル差が９以下である場合、関連情報の各項目のうち、体力値の重みを「３」に設定し、体力値以外項目よりも高くする。 For example, if the level difference is 10 or more, the weight of the distance item among the related information items is set to "3", so that it is higher than the weight of non-distance items.Also, if the level difference is 9 or less, Among the items of related information, the weight of the physical strength value is set to "3", making it higher than items other than the physical strength value.

また、ゲーム情報は、カジュアルモード、又は真剣モードなどのモードの種類でもよい。カジュアルモードは、気楽に行うモードであり、真剣モードは、運営会社が開催するイベント、大会やＥスポーツなどの真剣な格闘ゲームを行うモードである。 Further, the game information may be the type of mode such as casual mode or serious mode. The casual mode is a mode for playing casually, and the serious mode is a mode for playing serious fighting games such as events, tournaments, and E-sports held by the management company.

また、ゲームモードが真剣モードである場合、関連情報の各項目のうち、距離及び体力値の重みを「３」に設定し、体力値以外項目よりも高くする。 In addition, when the game mode is serious mode, the weighting of the distance and stamina value among the related information items is set to "3", which is higher than the weighting of items other than stamina value.

［１５］プレーヤの操作入力に基づく設定
［１５．１］重みの設定画面の具体例
本実施形態では、プレーヤＰ１の操作入力に基づき、第１のキャラクタＯＢ１の関連情報の複数の項目のうち、重みが高い項目の選択を受け付けるようにしてもよい。 [15] Settings based on player's operation input [15.1] Specific example of weight setting screen In this embodiment, based on the player P1's operation input, among the plurality of items of related information of the first character OB1, It may also be possible to accept selections of items with high weights.

図１４は、重み設定画面の一例を示す。例えば、図１４に示すように、プレーヤＰ１が
第１のキャラクタＯＢ１の体力値を気にしている場合であって、第１のキャラクタＯＢ１の体力値を重視したいと考えている場合、体力値の項目の重みを高くする項目として選択する。 Fig. 14 shows an example of a weight setting screen. For example, as shown in Fig. 14, if the player P1 is concerned about the vitality value of the first character OB1 and wants to place importance on the vitality value of the first character OB1, the player P1 selects the item of vitality value as an item to be weighted higher.

また、図示していないが、プレーヤＰ１が、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離を重視したい場合、距離の項目の重みを高くする項目として選択する。 Further, although not shown, if the player P1 wants to place importance on the distance between the first character OB1 and the second character OB2, he selects the distance item as an item to have a higher weight.

本実施形態の重み設定画面では、図１５に示すように、プレーヤＰ１の操作入力に基づき、第１のキャラクタＯＢ１の関連情報の複数の項目それぞれに百分率（パーセンテージ）で重みの配分率を決めてもよい。そして、関連情報の各項目の重みを、配分率に応じてとの重みの値を決定する。配分率が高い程、重みの値が高くなるように重みを設定する。逆に言えば、配分率が低い程、重みの値が低くなるように、重みを設定する。 In the weight setting screen of this embodiment, as shown in FIG. 15, based on the operation input of the player P1, a weight distribution rate is determined in percentage for each of a plurality of items of related information of the first character OB1. Good too. Then, the weight value of each item of related information is determined according to the distribution ratio. The weights are set such that the higher the distribution rate, the higher the weight value. Conversely, the weights are set such that the lower the allocation rate, the lower the weight value.

また、本実施形態の重み設定画面では、プレーヤＰ１の操作入力に基づき、関連情報の複数の項目それぞれの優先順位を設定してもよい。 Further, in the weight setting screen of this embodiment, the priority order of each of the plurality of items of related information may be set based on the operational input of the player P1.

例えば、図１６に示すように、第１のキャラクタＯＢ１の関連情報の複数の項目の優先順位を決めてもよい。端末装置２０は、優先順位に基づいて、第１のキャラクタＯＢ１の関連情報の複数の項目それぞれの重みを決定する。つまり、優先順位が高い程、重みが高くなるように重みを決定する。このようにすれば、プレーヤは簡単に重みの設定を行うことができる。また、関連情報の全ての項目のうちの一部の項目（例えば、上位５項目）について優先順位を設定してもよい。 For example, as shown in FIG. 16, priorities may be determined for a plurality of items of information related to the first character OB1. The terminal device 20 determines the weight of each of the plurality of items of related information of the first character OB1 based on the priority order. In other words, the weights are determined so that the higher the priority, the higher the weight. In this way, the player can easily set the weights. Furthermore, priorities may be set for some items (for example, the top five items) among all items of related information.

［１６］プレーヤの情報に基づく重み等の設定
本実施形態の端末装置２０は、プレーヤの情報に基づいて、学習前（例えば、機械学習前）に、第１のキャラクタＯＢ１の関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重みの少なくとも１つを設定してもよい。 [16] Setting weights, etc. based on player information The terminal device 20 of the present embodiment configures the related information of the first character OB1 before learning (for example, before machine learning) based on the player information. At least one of a combination of a plurality of items, a learning frequency, and a weight of each of a plurality of items constituting the related information may be set.

また、端末装置２０は、プレーヤの情報に基づいて、ゲーム進行状況に応じて、第１のキャラクタＯＢ１の関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重みの少なくとも１つを設定してもよい。 Furthermore, the terminal device 20 determines the combination of a plurality of items constituting the related information of the first character OB1, the learning frequency, and the plurality of items constituting the related information according to the game progress based on the player's information. At least one weight may be set for each item.

別の言い方をすると、端末装置２０は、プレーヤの情報に基づいて、ゲーム中、学習中、学習後、ゲーム進行度、又は、ゲーム進捗度に応じて、第１のキャラクタＯＢ１の関連情報を構成する複数の項目の組み合わせ、学習頻度、及び、関連情報を構成する複数の項目それぞれの重みの少なくとも１つを設定してもよい。 In other words, the terminal device 20 configures the related information of the first character OB1 during the game, during learning, after learning, the progress of the game, or according to the progress of the game, based on the player's information. At least one of a combination of a plurality of items, a learning frequency, and a weight of each of a plurality of items making up the related information may be set.

［１６．１］プレーヤの情報の説明
プレーヤの情報について詳細に説明する。プレーヤの情報は、格闘ゲームのゲームプレイ時間、ゲーム経験の有無、格闘ゲームの戦歴、対戦数、対戦成績、勝率、今まで対戦した対戦相手の情報、対戦相手のレベル（段位、ランク）である。 [16.1] Explanation of Player Information The player information will be explained in detail. Player information includes fighting game playing time, experience with the game, fighting game history, number of matches, match results, winning percentage, information on opponents played so far, and the level of the opponent (dan, rank). .

また、プレーヤの情報は、プレーヤが格闘ゲームで使用したキャラクタの情報や、当該キャラクタで習得した技や連続行動（コンボ）、当該キャラクタが使用した技や連続行動を含む。 The player information also includes information on the character used by the player in the fighting game, techniques and continuous actions (combos) learned by the character, and techniques and continuous actions used by the character.

また、プレーヤが格闘ゲームで使用したキャラクタのうち、最も多く使用しているキャラクタの情報を含む。 It also includes information about the character that the player uses most often among the characters used in the fighting game.

例えば、格闘ゲームのゲームプレイ時間は、プレーヤが、格闘ゲームを起動（ログイン、電源オン）してから、終了（スリープ、ログアウト、電源オフ等）するまでの期間の合計期間である。 For example, the game playing time of a fighting game is the total period from when the player starts the fighting game (login, power on) until it ends (sleep, logout, power off, etc.).

ゲームプレイ時間が、１時間未満である場合、初心者であるとみなされ、体力値に重みをおくように重みを決定する。例えば、関連情報のキャラクタＯＢ１の体力値の項目の重みが、他の項目よりも高くなるように重みを設定する。 If the game play time is less than one hour, the player is considered to be a beginner, and the weight is determined so that weight is given to the physical strength value. For example, the weight is set so that the item of the physical strength value of the character OB1 in the related information has a higher weight than other items.

プレーヤが行った行動のうち、遠距離攻撃よりも近距離攻撃が多い場合、第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離を気にしている可能性が高い。そのため、関連情報の第１のキャラクタＯＢ１と第２のキャラクタＯＢ２との距離の項目の重みが、他の項目よりも高くなるように重みを設定する。 If there are more short-range attacks than long-range attacks among the actions performed by the player, there is a high possibility that the player is concerned about the distance between the first character OB1 and the second character OB2. Therefore, the weight is set so that the item of the distance between the first character OB1 and the second character OB2 of the related information has a higher weight than other items.

［１６．２］プレイ履歴
プレーヤの情報は、プレーヤのプレイ履歴に関する情報を含む。例えば、プレーヤＰ１のゲームプレイ履歴は、プレーヤＰ１の操作対象の第１のキャラクタＯＢ１の行動情報の履歴、関連情報の履歴である。例えば、プレーヤＰ１のゲームプレイ履歴から攻撃パターンを判定する。つまり、第１のキャラクタＯＢ１が、第２のキャラクタＯＢ２を壁に追い詰めて、技を繰り出す行動が多い場合、第１のキャラクタＯＢ１の位置情報が重要視される。そのため、関連情報の第１のキャラクタＯＢ１の位置情報の項目の重みが、他の項目よりも高くなるように重みを設定する。 [16.2] Play History The player information includes information regarding the player's play history. For example, the game play history of the player P1 is a history of behavior information and related information of the first character OB1 to be operated by the player P1. For example, the attack pattern is determined from the game play history of the player P1. In other words, when the first character OB1 often corners the second character OB2 against a wall and performs techniques, the positional information of the first character OB1 is considered important. Therefore, the weight is set so that the item of position information of the first character OB1 in the related information has a higher weight than other items.

［１６．３］過去のプレイ傾向
プレーヤの情報は、プレーヤの過去のプレイ傾向に関する情報を含んでもよい。 [16.3] Past Play Tendency The player information may include information regarding the player's past play tendency.

例えば、プレーヤＰ１の過去のプレイ傾向は、カジュアルモード又は真剣モードのいずれのモードでのゲームプレイが多いかを示す情報、上級者、中級者又は初心者のうち、いずれのプレーヤとの対戦が多いかを示す情報、プレーヤＰ１が、どのようなプレイスタイルが多いかを示す情報等である。 For example, player P1's past play tendencies include information indicating whether he plays games in casual mode or serious mode, and which player he plays most often among advanced players, intermediate players, or beginners. , and information indicating what kind of play style the player P1 tends to play.

プレイスタイルとしては、どういうキャラクタを使用して、どのような試合の流れが多いのか（例えば、ゲーム開始１０秒以内に大技を繰り出して相手キャラクタの体力値を０にする傾向が強い、後半３０秒間で、猛攻が繰り広げられる傾向）等である。 In terms of play style, what kind of character do you use and what kind of flow does the match have? (For example, players tend to perform a big move within the first 10 seconds of the game and reduce the opponent's character's health to 0. (a tendency for an onslaught of attacks to unfold within seconds).

例えば、プレーヤの過去のプレイ傾向が、カジュアルモードよりも真剣モードが多い場合、関連情報の第１のキャラクタＯＢ１の体力値の項目の重みが、他の項目よりも高くなるように重みを設定する。 For example, if the player's past playing tendency is to play in serious mode more often than casual mode, the weighting of the stamina value item of the first character OB1 in the related information is set higher than the weighting of other items.

また、プレーヤの過去のプレイ傾向が、初心者が多い場合、関連情報の第１のキャラクタＯＢ１の体力値の項目の重みが、他の項目よりも高くなるように重みを設定する。 Furthermore, if the player's past play tendency is that there are many beginners, the weight is set so that the item of the physical strength value of the first character OB1 in the related information is weighted higher than other items.

また、プレーヤの過去のプレイ傾向が、ゲーム開始１０秒以内に大技を繰り出して相手キャラクタの体力値を０にする傾向がある場合、関連情報の第１のキャラクタＯＢ１の体力値の項目と第２のキャラクタＯＢ２の体力値の項目との重みが、他の項目よりも高くなるように重みを設定する。 In addition, if the player's past play tendency is to perform a big move and reduce the opponent character's physical strength to 0 within 10 seconds of the start of the game, the item of physical strength of the first character OB1 in the related information and the The weight is set so that the weight with the physical strength value item of the character OB2 of No. 2 is higher than the other items.

［１６．４］他のプレーヤに関連する情報
プレーヤの情報は、プレーヤの情報に基づいて決定される他のプレーヤに関連する情報を含んでもよい。 [16.4] Information related to other players The player information may include information related to other players determined based on the player information.

例えば、プレーヤＰ１の情報に基づいて決定される他のプレーヤに関連する情報とは、プレーヤＰ１とレベルが似ている他のプレーヤである。 For example, the information related to other players determined based on the information of player P1 is other players whose level is similar to player P1.

例えば、プレーヤＰ１が上級者レベルである場合、上級者レベルの他のプレーヤＰ２の関連情報のうち重みの高い項目と同じ項目について、重みを高くする項目を決定する。 For example, when player P1 is at an expert level, an item to be given a higher weight is determined for the same item as an item with a higher weight among related information of another player P2 at an expert level.

上級者レベルの他のプレーヤＰ２の関連情報のうち重みの高い項目が、壁からの距離である場合、プレーヤＰ１の関連情報の壁からの距離の項目の重みが、他の項目よりも高くなるように重みを設定する。 If the item with a high weight among the related information of another player P2 at an advanced level is the distance from the wall, the weight of the distance from the wall item of the related information of the player P1 will be higher than other items. Set the weight as follows.

例えば、サーバ装置１０が、サーバ装置１０が管理する各プレーヤの関連情報を収集して一元管理してもよい。そして、プレーヤＰ１の端末装置２０が、サーバ装置１０から、プレーヤＰ１の情報に基づいて決定される他のプレーヤＰ２に関連する情報として、例えば、「プレーヤＰ２の関連情報の複数項目のうち重みの高い項目」を取得する。「プレーヤＰ２の関連情報の複数項目のうち重みの高い項目」は、最も重みの高い項目１つでもよいし、重みの高い上位（例えば、上位５つ）の項目でもよい。 For example, the server device 10 may collect and centrally manage information related to each player managed by the server device 10. Then, the terminal device 20 of the player P1 receives, from the server device 10, information related to another player P2 determined based on the information of the player P1, for example, "the weight of the plurality of items of related information of the player P2". Get "high item". The "item with the highest weight among the plurality of items of related information about the player P2" may be the single item with the highest weight, or may be the item with the highest weight (for example, the top five).

［１６．５］ゲーム進行状況に応じて重みづけを設定する例の補足説明
端末装置２０は、プレーヤの情報に基づいて、ゲーム進行状況に応じて、リアルタイムに第１のキャラクタＯＢ１の関連情報を構成する複数の項目それぞれの重みの設定を行ってもよい。 [16.5] Supplementary explanation of the example of setting weighting according to the game progress The terminal device 20 sets the related information of the first character OB1 in real time according to the game progress based on the player's information. Weights may be set for each of the plurality of constituent items.

例えば、端末装置２０は、第１のキャラクタＯＢ１の体力値が減少している体力減少状況（ゲーム進行状況の一例）を検出すると、第１のキャラクタＯＢ１の関連情報を構成する各項目の重みの値を、当該体力減少状況に対応する各項目の重みの値に設定する。 For example, when the terminal device 20 detects a physical strength reduction situation (an example of a game progress situation) in which the physical strength value of the first character OB1 is decreased, the terminal device 20 changes the weight of each item constituting the related information of the first character OB1. The value is set as the weight value of each item corresponding to the physical strength reduction situation.

また、例えば、端末装置２０は、第１のキャラクタＯＢ１の攻撃が成功している攻撃成功状況（ゲーム進行状況の一例）を検出すると、第１のキャラクタＯＢ１の関連情報を構成する各項目の重みの値を、当該攻撃成功状況に対応する各項目の重みの値に設定する。 Further, for example, when the terminal device 20 detects a successful attack situation (an example of a game progress situation) in which the attack of the first character OB1 is successful, the terminal device 20 determines the weight of each item constituting the related information of the first character OB1. The value of is set as the weight value of each item corresponding to the attack success situation.

［１６．６］複数の項目の組み合わせの設定
端末装置２０は、学習前において、学習前のプレーヤの情報に基づいて、関連情報を構成する複数の項目の組み合わせを、設定してもよい。例えば、プレーヤが初心者である場合は、初心者用の関連情報を構成する複数の項目の組み合わせを、設定する。プレーヤが上級者である場合は、上級者用の関連情報を構成する複数の項目の組み合わせを、設定する。 [16.6] Setting a combination of a plurality of items Before learning, the terminal device 20 may set a combination of a plurality of items constituting the related information based on the player's information before learning. For example, if the player is a beginner, a combination of a plurality of items constituting the related information for beginners is set. If the player is an advanced player, a combination of a plurality of items constituting related information for advanced players is set.

また、端末装置２０は、ゲーム開始後において、ゲーム進行状況に応じて、リアルタイムに第１のキャラクタＯＢ１の関連情報を構成する複数の項目の組み合わせを設定してもよい。例えば、体力減少状況を検出すると、当該体力減少状況に対応する複数の項目の組み合わせを設定する。例えば、攻撃成功状況を検出すると、当該攻撃成功状況に対応する複数の項目の組み合わせを設定する。 Further, after the game starts, the terminal device 20 may set a combination of a plurality of items that constitute the related information of the first character OB1 in real time according to the progress of the game. For example, when a state of decrease in physical strength is detected, a combination of a plurality of items corresponding to the state of decrease in physical strength is set. For example, when a successful attack situation is detected, a combination of a plurality of items corresponding to the successful attack situation is set.

［１６．７］学習頻度の設定
端末装置２０は、学習前において、学習前のプレーヤの情報に基づいて、学習頻度を設定してもよい。例えば、プレーヤが初心者である場合は、初心者用の学習頻度を設定する。プレーヤが上級者である場合は、上級者用の学習頻度を、設定する。 [16.7] Setting Learning Frequency Before learning, the terminal device 20 may set the learning frequency based on the player's information before learning. For example, if the player is a beginner, a learning frequency for beginners is set. If the player is an advanced player, a learning frequency for advanced players is set.

また、端末装置２０は、ゲーム開始後において、ゲーム進行状況に応じて、リアルタイムに学習頻度を設定してもよい。例えば、体力減少状況を検出すると、当該体力減少状況
に対応する学習頻度を設定する。例えば、攻撃成功状況を検出すると、当該攻撃成功状況に対応する学習頻度を設定する。 Further, the terminal device 20 may set the learning frequency in real time according to the progress of the game after starting the game. For example, when a state of decrease in physical strength is detected, a learning frequency corresponding to the state of decrease in physical strength is set. For example, when a successful attack situation is detected, a learning frequency corresponding to the successful attack situation is set.

［１７］フローチャート
図１７Ａ、図１７Ｂは、本実施形態の行動制御の一例を示すフローチャートを示す。図１７Ａ、図１７Ｂを用いて、プレーヤＰ１の端末装置２０において、プレーヤＰ１の操作対象のキャラクタＯＢ１と、ＡＩキャラクタである第２のキャラクタＯＢ２とが対戦する場合において、第２のキャラクタの行動制御の流れについて説明する。 17A and 17B are flowcharts showing an example of behavior control according to the present embodiment. Using these flowcharts, a flow of behavior control of a second character OB2, which is an AI character, will be described when a character OB1 operated by a player P1 competes against a second character OB2 on the terminal device 20 of the player P1.

まず、図１７Ａに示すように、重みを設定する（ステップＳ１）。例えば、行動に関連する関連情報の複数の項目それぞれについて重みを設定する。重みの設定は、プレーヤＰ１の操作入力に基づいて行ってもよい。 First, as shown in FIG. 17A, weights are set (step S1). For example, a weight is set for each of a plurality of items of related information related to an action. The weights may be set based on the operational input of the player P1.

そして、ゲームが開始されると（ステップＳ２）、プレーヤＰ１の操作入力に基づいて第１のキャラクタＯＢ１の行動を制御する（ステップＳ３）。 Then, when the game starts (step S2), the behavior of the first character OB1 is controlled based on the operation input from the player P1 (step S3).

未学習の第１のキャラクタＯＢ１の行動と当該行動に関連する関連情報に遭遇したか否かを判断する（ステップＳ４）。 Determine whether or not the unlearned behavior of the first character OB1 and related information related to that behavior have been encountered (step S4).

未学習の第１のキャラクタＯＢ１の行動と当該行動に関連する関連情報に遭遇しない場合（ステップＳ４のＮ）、関連情報に基づいて、学習モデルを用いて、第２のキャラクタＯＢ２の行動を決定する（ステップＳ５）。そして、学習を行う（ステップＳ６）。例えば、第１のキャラクタＯＢ１の行動と当該行動に関連する関連情報とを学習する。そして、ステップＳ１４に進む。 If the action of the unlearned first character OB1 and related information related to the action are not encountered (N in step S4), the action of the second character OB2 is determined using the learning model based on the related information. (Step S5). Then, learning is performed (step S6). For example, the behavior of the first character OB1 and related information related to the behavior are learned. Then, the process advances to step S14.

なお、未学習の第１のキャラクタＯＢ１の行動と当該行動に関連する関連情報に遭遇した場合（ステップＳ４のＹ）、図１７ＢのステップＳ１１に進む。そして、図１７Ｂに示すように、第１のキャラクタＯＢ１の行動に関連する関連情報に基づき、学習済みの第１のキャラクタＯＢ１の行動の関連情報を探索する（ステップＳ１１）。 Note that if the action of the unlearned first character OB1 and related information related to the action are encountered (Y in step S4), the process advances to step S11 in FIG. 17B. Then, as shown in FIG. 17B, based on the related information related to the behavior of the first character OB1, information related to the learned behavior of the first character OB1 is searched (step S11).

そして、探索された関連情報に基づいて、第２のキャラクタＯＢ２の行動を決定する（ステップＳ１２）。 Then, based on the searched related information, the action of the second character OB2 is determined (step S12).

そして、学習を行う（ステップＳ１３）。例えば、第１のキャラクタＯＢ１の行動と当該行動に関連する関連情報とを学習する。また、第２のキャラクタＯＢ２の行動と当該行動に関連する関連情報とを学習する。そして、ステップＳ１４に進む。 Then, learning is performed (step S13). For example, the behavior of the first character OB1 and related information related to the behavior are learned. Furthermore, the behavior of the second character OB2 and related information related to the behavior are learned. Then, the process advances to step S14.

ゲームが終了したか否かを判断し（ステップＳ１４）、ゲームが終了した場合（ステップＳ１４のＹ）、処理を終了する。一方、ゲームが終了していない場合（ステップＳ１４のＮ）、ステップＳ３に戻る。以上で処理を終了する。 It is determined whether the game has ended (step S14), and if the game has ended (Y in step S14), the process ends. On the other hand, if the game has not ended (N in step S14), the process returns to step S3. This completes the process.

［１８］機械学習の詳細な説明
本実施形態では、機械学習により重みづけを最適化する。端末装置２０は、学習記憶部２７８に記憶されている、行動（行動情報）と、当該行動の関連情報とに基づいて深層学習の処理を行うことによって学習に関する重みを最適化する。端末装置２０は、第１のキャラクタ及び第２のキャラクタによる後続する対戦ゲームに際して、最適化した学習結果の重みを反映させて、入力部２６０による入力に基づくことなく第２のキャラクタＯＢ２の動きを制御する。 [18] Detailed explanation of machine learning In this embodiment, weighting is optimized by machine learning. The terminal device 20 optimizes the weights related to learning by performing deep learning processing based on the behavior (behavior information) and related information of the behavior stored in the learning storage unit 278. In the subsequent competitive game between the first character and the second character, the terminal device 20 reflects the weight of the optimized learning result and controls the movement of the second character OB2 without being based on the input from the input unit 260. Control.

学習部２１８は、各関連情報を、複数層からなるリカレント・ニューラル・ネットワー
ク（再帰型ニューラルネットワーク）を用いて深層学習の計算処理を行って、行動情報を算出し、学習結果の重みを最適化する。 The learning unit 218 performs deep learning calculation processing on each relevant information using a recurrent neural network (recursive neural network) consisting of multiple layers, calculates behavioral information, and optimizes the weight of the learning result. do.

例えば、学習部２１８は、第２のキャラクタＯＢ２の行動情報を移動・動作制御部２１３に送り、移動・動作制御部２１３は、学習結果の重みを反映した行動情報を学習部２１８から受け取り、第２のキャラクタＯＢ２の行動（移動・動作）の制御を行う。 For example, the learning unit 218 sends behavioral information of the second character OB2 to the movement and movement control unit 213, and the movement and movement control unit 213 receives behavioral information reflecting the weight of the learning result from the learning unit 218 and controls the behavior (movement and movement) of the second character OB2.

また、学習記憶部２７８は、行動に関連する関連情報の履歴を記憶する履歴データ記憶部と、深層学習の処理結果として複数の層別の重みの計算結果を記憶する層別の重み記憶部とを含んでもよい。 The learning storage unit 278 also includes a history data storage unit that stores a history of related information related to an action, and a stratified weight storage unit that stores calculation results of a plurality of stratified weights as deep learning processing results. May include.

また、端末装置２０は、収集した関連情報を、学習記憶部２７８に書き込むとともに、履歴データ記憶部に記憶されている各関連情報を第一層の入力として、複数の層においてそれぞれの入力に対する重みを乗算して、層毎の出力を計算し、その出力を次の層の入力として計算する処理を逐次的に行い、最終層の出力として得た行動情報と、実際に行われた行動情報の差分を用いて、層別の重みを最適化して層別の重み記憶部に記憶させてもよい。 In addition, the terminal device 20 writes the collected related information to the learning storage unit 278, and uses each related information stored in the history data storage unit as input to the first layer, and weights each input in the plurality of layers. The output of each layer is calculated by multiplying by , and the output is calculated as the input of the next layer. The weights for each layer may be optimized using the difference and stored in the weight storage section for each layer.

また、学習部２１８は、現在の関連情報と、その時点で最新の重みとを掛け合わせて、キャラクタが行ったであろう行動を導き出すための推論処理部と、現在の関連情報とプレーヤＰ１が実際に操作入力に基づくキャラクタＯＢ１の行動を深層学習の重みに反映するための学習処理部とを含んでもよい。 The learning unit 218 also includes an inference processing unit for deriving an action that the character would have taken by multiplying the current related information by the latest weight at that time, and a It may also include a learning processing unit for reflecting the behavior of the character OB1 based on the actual operation input in the weight of deep learning.

推論処理部は、収集した関連情報を、履歴データ記憶部に書き込むとともに、履歴データ記憶部に記憶されている各関連情報を第一層の入力として、それぞれの層において重みを乗算して、層別の出力を算出し、その出力を次の層の入力として逐次的に計算することで、最終層の出力として行動情報を得るものである。 The inference processing unit writes the collected related information to the historical data storage unit, uses each related information stored in the historical data storage unit as input to the first layer, multiplies the weights in each layer, and creates a layer. By calculating another output and sequentially calculating that output as input to the next layer, behavioral information is obtained as the output of the final layer.

学習処理部は、推論処理部によって算出された行動情報と、現在の関連情報の時点に対応するプレーヤの操作入力に基づいて決定される行動情報とに基づいて、深層学習の重みを最適化する。 The learning processing unit optimizes deep learning weights based on the behavior information calculated by the inference processing unit and the behavior information determined based on the player's operation input corresponding to the time of the current related information. .

学習部２１８は、深層学習の処理として、少なくとも１つの全対全結合層による計算処理と、複数のゲーテッド・リカレント・ユニット層による計算処理を実行するものである。全対全結合層による計算処理では、重み行列を入力のベクトルと乗算する処理を実行する。各ゲーテッド・リカレント・ユニット層による計算処理では、複数のゲーテッド・リカレント・ユニット層のそれぞれにおいて、重みを使って忘れる量を計算し、重みを使って反映量を計算し、出力候補を計算する処理を順次行ってもよい。 The learning unit 218 executes, as deep learning processing, calculation processing using at least one all-to-fully connected layer and calculation processing using a plurality of gated recurrent unit layers. In the calculation process using the all-to-all connected layer, a process is performed in which a weight matrix is multiplied by an input vector. In the calculation process by each gated recurrent unit layer, in each of the multiple gated recurrent unit layers, the amount to be forgotten is calculated using the weight, the reflected amount is calculated using the weight, and the output candidate is calculated. may be performed sequentially.

学習部２１８は、各ゲーテッド・リカレント・ユニット層による計算処理として、出力候補の計算後に、選択的にドロップアウト処理をして出力してもよい。 As a calculation process performed by each gated recurrent unit layer, the learning unit 218 may selectively perform dropout processing and output after calculating the output candidates.

端末装置２０は、ゲーム情報記憶部２７４に、プレーヤによる入力部２６０の操作に応じた操作データを、先入れ先出し態様で記憶するようにしてもよい。 The terminal device 20 may store operation data corresponding to the player's operation of the input unit 260 in the game information storage unit 274 in a first-in, first-out manner.

端末装置２０は、学習記憶部２７８は、学習に関する層別の重みを記憶してもよい。 In the terminal device 20, the learning storage unit 278 may store stratified weights related to learning.

学習記憶部２７８（重み記憶部）は、深層学習後の各層の重みを記憶する複数層分の重み記憶部を含んでもよい。 The learning storage unit 278 (weight storage unit) may include weight storage units for multiple layers that store the weights of each layer after deep learning.

学習部２１８は、ゲームプレイ中の行動（行動情報）と当該行動の関連情報とを先入れ先出し態様でゲーム情報記憶部２７４に記憶させるとともに、深層学習後の各層の重みデータを対応する層別の重みデータ記憶部に記憶させるようにしてもよい。 The learning unit 218 stores actions (action information) during game play and information related to the actions in the game information storage unit 274 in a first-in, first-out manner, and also stores the weight data of each layer after deep learning as the weight of the corresponding layer. It may also be stored in a data storage unit.

次に、関連情報を入力した場合、出力結果として、第２のキャラクタＯＢ２の行動情報を得るための学習モデルの生成について説明する。 Next, generation of a learning model for obtaining behavior information of the second character OB2 as an output result when related information is input will be described.

本実施形態では、複数層（例えば、６層）のニューラルネットワークで構成される。ニューラルネットワークは、例えば、リカレントニューラルネットワーク（再帰型ニューラルネットワーク）で構成する。 In this embodiment, the neural network is configured with multiple layers (e.g., six layers). The neural network is configured, for example, as a recurrent neural network.

推論処理において、記憶部２７０の関連情報が記憶されると、関連情報の各項目をニューラルネットワークの第１層へ並列的に入力させる。入力に対するそれぞれの重みを乗算して出力を求める。 In the inference process, when the related information in the storage unit 270 is stored, each item of the related information is input in parallel to the first layer of the neural network. Find the output by multiplying the respective weights for the inputs.

続いて、ニューラルネットワークの各層では、前段の層の出力が次段の層で重み計算されて、次々段の層の入力となる。このようにして、出力層においての出力結果が、推論した第２のキャラクタＯＢ２の行動情報となる。推論した第２のキャラクタＯＢ２の行動情報は、学習処理の入力とする。なお、推論と学習を同時に行う必要はなく、個別に行ってもよい。 Next, in each layer of the neural network, the output of the previous layer is weighted and calculated in the next layer, and becomes the input to the next layer. In this way, the output result in the output layer becomes the inferred behavior information of the second character OB2. The inferred behavior information of the second character OB2 is used as input for the learning process. Note that inference and learning do not need to be performed at the same time, and may be performed separately.

続いて、学習について説明する。学習処理は、まず、推論した行動情報と、正解となる行動情報との差分の誤差を求める。層別に最適化された重みデータに更新的に書き込まれる。層別に最適化された重みが層別に更新的に記憶される。 Next, learning will be explained. In the learning process, first, the error of the difference between the inferred behavior information and the correct behavior information is determined. The weight data optimized for each layer is written in an updated manner. The weights optimized for each layer are updated and stored for each layer.

次に、各層の出力を計算する処理（推論処理）の一例について説明する。推論処理は、例えば、第１層の入力層と、隠れ層（第２層～第５層）と、第６層の出力層で構成される。そして、各層のノードの計算において、入力値と重みの積和を算出することによって出力を得る。なお、ノードの計算には、各層の入力値と重みの積和にバイアスを足してもよい。 Next, an example of processing (inference processing) for calculating the output of each layer will be described. The inference process includes, for example, a first input layer, hidden layers (second to fifth layers), and a sixth output layer. Then, in the calculation of the nodes of each layer, the output is obtained by calculating the sum of products of the input value and the weight. Note that for node calculation, a bias may be added to the sum of products of input values and weights of each layer.

例えば、入力層は、関連情報の項目数であり、出力が第２のキャラクタＯＢ２の行動情報の要素数である。 For example, the input layer is the number of items of related information, and the output is the number of elements of the behavior information of the second character OB2.

出力層（最終層）の出力は、関連情報に対応した推定される第２のキャラクタＯＢ２の行動情報となる。つまり、出力層には、行動情報の項目数（例えば、行動の種類が、６４種類あれば６４）のノードを用意する。例えば、各ノードの値が得られると、最も数値の高い項目が、例えば、「足技カウンター」の項目であれば、第２のキャラクタＯＢ２の行動情報を足技カウンターに決定する。 The output of the output layer (final layer) is the action information of the second character OB2 that is estimated to correspond to the related information. In other words, the output layer is provided with nodes equal to the number of action information items (for example, if there are 64 types of actions, then 64). For example, when the values of each node are obtained, if the item with the highest value is, for example, the "footwork counter" item, the action information of the second character OB2 is determined to be footwork counter.

次に、各層の重みデータを使って出力を計算する処理（推論処理）を説明する。本実施形態では、推論処理において重みを調整する。重みの修正が必要な場合は出力側に一番近い層から行う必要がある。重みは、例えば、６４×６４の行列である。複数種類の重みを用意してもよい。これらの重みは、初期状態において、プレーヤの操作入力或いは予め決められた情報に基づき、予め設定されてもよいが、学習処理を重ねるに従って徐々に適正化され、最終的には最適化される。 Next, a process (inference process) for calculating an output using weight data of each layer will be explained. In this embodiment, weights are adjusted in inference processing. If it is necessary to modify the weights, it must be done from the layer closest to the output side. The weights are, for example, a 64x64 matrix. You may prepare multiple types of weights. These weights may be set in advance based on the player's operational input or predetermined information in the initial state, but as the learning process is repeated, they are gradually optimized and finally optimized.

本実施形態では、１ラウンドの対戦が終了する度に、各ラウンドの６層分の重みデータがニューラルネットワークの各重みとして、学習記憶部２７８に累積的に記憶されることになる。 In this embodiment, each time one round of competition ends, weight data for six layers of each round is cumulatively stored in the learning storage unit 278 as each weight of the neural network.

重みデータでは、次回ゲーム時に再び読みだして学習を行うか、学習した行動情報を再現するのに使用してもよい。重みデータは、ネットワークを介して複数の端末装置２０間で送受信されもよい。 The weight data may be read out again the next time the game is played for further learning, or may be used to reproduce learned behavioral information. The weight data may be transmitted and received between multiple terminal devices 20 via a network.

本実施形態によれば、機械学習を用いて、プレーヤＰ１のキャラクタＯＢ１の操作を再現可能な、最適な対戦相手のキャラクタＯＢ２を創出することができる。また、事前に学習させた結果を使用することによって、より人間らしい行動情報をコンピュータ相手の対戦でも実現することができる。 According to the present embodiment, using machine learning, it is possible to create an optimal opponent character OB2 that can reproduce the operations of the player P1's character OB1. Furthermore, by using the results of pre-learning, more human-like behavioral information can be realized even when playing against a computer opponent.

［１９］その他の補足説明
［１９．１］サーバ装置
本実施形態では、機械学習によるゲームを提供する場合において、端末装置２０が、学習機能及び推論機能を有する例について説明したが、サーバ装置１０が、上述した端末装置２０が行う学習機能及び推論機能を、有していてもよい。 [19] Other supplementary explanations [19.1] Server device In the present embodiment, an example has been described in which the terminal device 20 has a learning function and an inference function when providing a game based on machine learning. However, it may have the learning function and inference function performed by the terminal device 20 described above.

［１９．２］学習データ
本実施形態では、プレーヤＰ１がゲームプレイしながら学習し、第２のキャラクタＯＢ２の行動を予測するゲームについて説明したが、予め、プレーヤＰ１がコンピュータと対戦する期間（プレーヤＰ１の操作対象の第１のキャラクタＯＢ１とコンピュータキャラクタである第２のキャラクタＯＢ２とが対戦する期間）を設け、当該期間に学習データ（第１のキャラクタＯＢ１の行動の情報及び当該行動の関連情報）を収集してもよい。また、収集した学習データに基づいて、学習済みの学習モデルを生成するようにしてもよい。 [19.2] Learning Data In this embodiment, a game has been described in which the player P1 learns while playing the game and predicts the actions of the second character OB2. A period in which the first character OB1 to be operated by P1 and the second character OB2, which is a computer character, compete is provided, and learning data (information on the behavior of the first character OB1 and information related to the behavior) is provided during the period. ) may be collected. Furthermore, a trained learning model may be generated based on the collected learning data.

本実施形態では、予め、プレーヤＰ１が、実在するプレーヤＰ２と対戦する期間（プレーヤＰ１の操作対象の第１のキャラクタＯＢ１とプレーヤＰ１の操作対象である第２のキャラクタＯＢ２とが対戦する期間）を設け、当該期間に学習データ（第１のキャラクタＯＢ１行動の情報及び当該行動の関連情報及び／又は第２のキャラクタＯＢ２行動の情報及び当該行動の関連情報）を収集してもよい。また、収集した学習データに基づいて、学習済みの学習モデルを生成するようにしてもよい。 In this embodiment, a period during which player P1 plays against real player P2 (a period during which a first character OB1 operated by player P1 plays against a second character OB2 operated by player P1) may be set in advance, and learning data (information on the actions of first character OB1 and related information about said actions and/or information on the actions of second character OB2 and related information about said actions) may be collected during that period. In addition, a learned learning model may be generated based on the collected learning data.

［１９．３］重みづけを行う期間及びレベルの補足の説明
本実施形態では、プレーヤの情報又は第１のキャラクタの情報に基づいて、重みづけを行う期間を変化させてもよい。第１のキャラクタの情報は、第１のキャラクタの学習データ（第１のキャラクタの行動の情報及び当該行動の関連情報）を含む概念でもよい。なお、「重みづけを行う期間」とは、関連情報を構成する複数の項目のうち、所定の項目（例えば、プレーヤの操作入力に基づき、選択された項目等）について、重みを高くする期間である。 [19.3] Supplementary explanation of weighting period and level In this embodiment, the weighting period may be changed based on the player's information or the first character's information. The first character information may be a concept including learning data of the first character (information on the first character's behavior and information related to the behavior). Note that the "weighting period" is a period in which a given item (for example, an item selected based on the player's operation input) is given a higher weight among the multiple items that make up the related information. be.

例えば、端末装置２０は、プレーヤの情報に基づき、プレーヤが初心者、中級者、上級者のいずれに該当するかを判定する。プレーヤのレベルが、例えば、１０未満の場合、初心者と判定する。プレーヤのレベルが１０以上であって８０未満の場合、中級者と判定する。プレーヤのレベルが８０以上の場合、上級者（ベテラン）と判定する。 For example, the terminal device 20 determines whether the player is a beginner, intermediate, or advanced based on the player's information. If the player's level is, for example, less than 10, the player is determined to be a beginner. If the player's level is 10 or higher but lower than 80, the player is determined to be an intermediate player. If the player's level is 80 or higher, the player is determined to be an advanced player (veteran).

そして、端末装置２０は、プレーヤのレベルに応じて、重みづけを行う期間を決定する。例えば、初心者のプレーヤの場合、ゲーム期間において、ゲーム開始時点ＴＳから３０秒の期間に、重みづけを行う期間として決定する。中級者のプレーヤの場合、ゲーム期間において、ゲーム開始時点ＴＳから１０秒の期間と、ゲーム開始時点ＴＳから３０秒を経過した時点から４０秒を経過した時点までの１０秒間とに、重みづけを行う期間として決定する。上級者のプレーヤの場合、ゲーム期間において、ゲーム開始時点ＴＳから２０秒
の期間と、ゲーム終了時点ＴＥを含む最後の５秒間とに、重みづけを行う期間として決定する。このようにすれば、プレーヤのレベルに応じて、多様な期間で、学習の重みづけを行うことができる。 Then, the terminal device 20 determines a period for weighting according to the level of the player. For example, in the case of a novice player, a period of 30 seconds from the game start time TS during the game period is determined as the period for weighting. For intermediate players, weighting is applied to the 10 second period from the game start time TS and the 10 second period from 30 seconds after the game start time TS to 40 seconds after the game start point TS. Decide on the period in which it will be carried out. In the case of an expert player, a period of 20 seconds from the game start time TS and the last 5 seconds including the game end time TE are determined as weighting periods during the game period. In this way, learning can be weighted in various periods depending on the level of the player.

なお、プレーヤのレベルと同様に、プレーヤが操作する対象の第１のキャラクタＯＢ１のレベルに基づいて、初心者、中級車、上級者のいずれかに該当するかを判定してもよい。そして、第１のキャラクタＯＢ１の情報（例えば、レベル）に基づいて、重みづけを行う期間を変化させてもよい。 Note that, similarly to the level of the player, it may be determined whether the first character OB1 to be operated by the player falls under the category of beginner, intermediate, or advanced based on the level of the first character OB1 to be operated by the player. Then, the period during which weighting is performed may be changed based on the information (for example, level) of the first character OB1.

［１９．４］第２のキャラクタの行動制御の補足説明
本実施形態では、予め第１のキャラクタの行動の種類毎に、当該行動に対応する第２のキャラクタの行動を定義してもよい。 [19.4] Supplementary explanation of second character's behavior control In this embodiment, for each type of behavior of the first character, the behavior of the second character corresponding to the behavior may be defined in advance.

例えば、第１のキャラクタＯＢ１の行動の「ジャンプキック」に対応する第２のキャラクタの行動として、「ガード」を定義する。また、第１のキャラクタＯＢ１の行動の「投げ技」に対応する第２のキャラクタの行動として、「投げ抜け」を定義する。また、第１のキャラクタＯＢ１の行動の「キック」に対応する第２のキャラクタの行動として、「しゃがむ」を定義する。このように、端末装置２０は、予め、第１のキャラクタＯＢ１の行動（行動ＩＤ）それぞれに対応付けて、第２のキャラクタＯＢ２の行動を定義する。 For example, "guard" is defined as the action of the second character that corresponds to the action "jump kick" of the first character OB1. Furthermore, "throw through" is defined as the second character's action corresponding to the "throwing move" action of the first character OB1. Further, "crouch" is defined as the second character's action corresponding to the "kick" action of the first character OB1. In this way, the terminal device 20 defines in advance the actions of the second character OB2 in association with each action (action ID) of the first character OB1.

端末装置２０は、予め第１のキャラクタの行動の種類毎に、当該行動に対応する第２のキャラクタの行動を定義したデータ（以下、「固定データ」とも言う）、及び、学習データの少なくとも一方を用いて、第２のキャラクタの行動を制御してもよい。 The terminal device 20 stores at least one of data (hereinafter also referred to as "fixed data") that defines the behavior of a second character corresponding to the behavior of the first character for each type of behavior of the first character, and learning data. may be used to control the actions of the second character.

例えば、端末装置２０は、学習データが不足し、第１のキャラクタＯＢ１の行動や当該行動の関連情報の学習が不十分である場合は、固定データ及び学習データの少なくとも一方を用いて第２のキャラクタの行動を制御する。かかる場合、固定データの使用か、学習データの使用かについては、所与の抽選処理に基づいて選択してもよい。例えば、学習データが不足している場合（例えば、学習データセットが１００件以下の場合）は、固定データの当選確率を８０％にし、固定データが抽選で選択される可能性が高くなるようにする。このようにすれば、第２のキャラクタに対する不自然な行動を極力減らすことができる。なお、端末装置２０は、学習データの数が増えるにつれて、当選確率を減少させるようにしてもよい。 For example, if the learning data is insufficient and learning of the behavior of the first character OB1 and information related to the behavior is insufficient, the terminal device 20 uses at least one of the fixed data and the learning data to learn the second character OB1's behavior and information related to the behavior. Control your character's actions. In such a case, whether to use fixed data or learning data may be selected based on a given lottery process. For example, if the training data is insufficient (for example, the training dataset is 100 or less), the winning probability of the fixed data is set to 80%, so that the fixed data has a higher probability of being selected by lottery. do. In this way, unnatural behavior toward the second character can be reduced as much as possible. Note that the terminal device 20 may reduce the probability of winning as the number of learning data increases.

［１９．５］一致度の補足説明
端末装置２０は、未学習の第１のキャラクタＯＢ１の行動に関連する関連情報と、探索された関連情報とを対比し、未学習の第１のキャラクタＯＢ１の行動に関連する関連情報と、探索された関連情報との一致度Ｋを百分率（パーセンテージ）で数値化する。 [19.5] Supplementary explanation on the degree of similarity The terminal device 20 compares the related information related to the behavior of the unlearned first character OB1 with the searched related information, and quantifies the degree of similarity K between the related information related to the behavior of the unlearned first character OB1 and the searched related information as a percentage.

所与のアルゴリズムや数式に基づき、一致度Ｋを算出する。また、検出された未学習の第１のキャラクタの行動（例えば、「ジャンプキック」）に関連する関連情報と一致度が高い学習済みの関連情報を探索する場合は、その探索過程、又は、探索後に、探索された関連情報の一致度Ｋを算出する。 The degree of matching K is calculated based on a given algorithm or formula. In addition, when searching for learned related information that has a high degree of match with the related information related to the detected unlearned first character's action (for example, "jump kick"), the search process or the search Afterwards, the matching degree K of the searched related information is calculated.

端末装置２０は、各項目の重みに基づいて、探索された関連情報の一致度Ｋを、算出してもよい。 The terminal device 20 may calculate the matching degree K of the searched related information based on the weight of each item.

また、端末装置２０は、探索された関連情報を構成する各項目それぞれの一致度Ｌを算出する。なお、項目の一致度Ｌを、項目一致度と言い換えてもよい。 The terminal device 20 also calculates the degree of match L for each item that constitutes the searched related information. The degree of match L for an item may also be referred to as the degree of item match.

例えば、関連情報の１項目である「体力値」の一致度Ｌを算出する例について説明する。第１のキャラクタＯＢ１のジャンプキック時の第１のキャラクタＯＢ１の体力値が「１００」である場合において、探索された学習済みの「体力値」が「１００」である場合、「体力値」の一致度Ｌが１００％であると判定する。探索された学習済みの「体力値」が「８０」である場合、一致度Ｌが８０％であると判定する。探索された学習済みの「体力値」が「３０」である場合、「体力値」の一致度Ｌが３０％であると判定する。 For example, an example will be described in which the degree of coincidence L of "physical strength value", which is one item of related information, is calculated. If the physical strength value of the first character OB1 at the time of the jump kick of the first character OB1 is "100", and the learned "physical strength value" that has been searched is "100", the "physical strength value" It is determined that the degree of coincidence L is 100%. If the learned "physical strength value" found is "80", it is determined that the degree of matching L is 80%. If the learned "physical strength value" that has been searched for is "30", it is determined that the matching degree L of the "physical strength value" is 30%.

端末装置２０は、探索された関連情報の各項目の一致度Ｌに基づいて、関連情報の一度Ｋを求める。例えば、探索された関連情報の各項目の一致度Ｌの平均値を、関連情報の一度Ｋとして算出してもよいし、全ての項目数（例えば、１００項目ある場合は「１００」）に対する、一致度Ｌが８０％以上の項目の数（例えば、一致度が８０％以上の項目が２０あれば「２０」）の割合（例えば、２０％）を、関連情報の一度度Ｋとしてもよい。 The terminal device 20 obtains K of the related information based on the matching degree L of each item of the searched related information. For example, the average value of the matching degree L of each item of the related information searched may be calculated as once K of the related information, or for all the number of items (for example, "100" if there are 100 items), The ratio (for example, 20%) of the number of items with a matching degree L of 80% or more (for example, if there are 20 items with a matching degree of 80% or more, "20") may be set as the degree K of related information.

［１９．６］探索手法の補足説明
端末装置２０は、重みづけが行われた項目に着目して探索をする場合は、項目の重みづけの値に基づいて、探索する。例えば、重みづけの値が高い項目から順に、項目単位での一致度Ｌの高い関連情報を探索するようにしてもよい。 [19.6] Supplementary explanation of the search method When the terminal device 20 performs a search focusing on weighted items, the terminal device 20 performs the search based on the weighting value of the item. For example, the terminal device 20 may search for related information with a high degree of match L on an item-by-item basis, starting from the item with the highest weighting value.

具体的に説明すると、関連情報の複数の項目のうち、「体力値」の重みが「３」であり、「残りゲーム時間」の重みが「２」であり、その他の項目が「１」である場合、重みの値が高い「体力値」の項目について、最も一致度Ｌの高い関連情報を探索する。「体力値」について、最も一致度の高い関連情報が複数探索された場合（例えば、体力値の一致度Ｌが１００％である関連情報が複数探索された場合）は、その中で、「残りゲーム時間」の項目について、最も一致度Ｌの高い関連情報を探索する。 Specifically, among multiple items of related information, the weight of "physical strength" is "3", the weight of "remaining game time" is "2", and the weight of other items is "1". If there is, related information with the highest matching degree L is searched for the item "physical strength value" which has a high weight value. When multiple pieces of related information with the highest matching degree are searched for "physical strength value" (for example, when multiple pieces of related information are searched for where the matching degree L of physical strength value is 100%), among them, "remaining Regarding the item "game time", related information with the highest matching degree L is searched.

「体力値」及び「残りゲーム時間」の各項目について、最も一致度の高い関連情報が複数探索された場合（例えば、「体力値」及び「残りゲーム時間」いずれも一致度が１００％である関連情報が複数探索された場合）は、他の項目の一致度Ｌが最も高い関連情報を探索する。 When multiple pieces of related information with the highest matching degree are searched for each item of "physical strength value" and "remaining game time" (for example, if the matching degree is 100% for both "physical strength value" and "remaining game time") If a plurality of pieces of related information are searched), the related information with the highest matching degree L for other items is searched.

［１９．７］一致度に基づく第２のキャラクタの行動制御
本実施形態では、探索された関連情報の一致度Ｋが所定値以上でない場合に、検出された未学習の当該第１のキャラクタの行動に対応する予め定義された行動を、第２のキャラクタに行わせるようにしてもよい。このようにすれば、第２のキャラクタに対する不自然な行動を極力減らすことができる。 [19.7] Behavior control of second character based on matching degree In this embodiment, when the matching degree K of the searched related information is not equal to or higher than a predetermined value, the detected unlearned first character The second character may be made to perform a predefined action corresponding to the action. In this way, unnatural behavior toward the second character can be reduced as much as possible.

例えば、未学習の当該第１のキャラクタの行動として「ジャンプキック」及び「ジャンプキック」の関連情報が検出されたとする。端末装置２０は、「ジャンプキック」の関連情報の一致度Ｋが、所定値以上（例えば、７０％以上）でない場合に、「ジャンプキック」に対応する予め定義された行動（例えば、「ガード」）を、第２のキャラクタに行わせる。 For example, assume that related information of "jump kick" and "jump kick" is detected as the unlearned actions of the first character. The terminal device 20 executes a predefined action corresponding to the "jump kick" (for example, "guard") when the matching degree K of the related information of "jump kick" is not equal to or higher than a predetermined value (for example, 70% or higher). ) to the second character.

また、本実施形態では、探索された関連情報の一致度Ｋが所定値以上でない場合に、検出された未学習の当該第１のキャラクタの行動に対応する複数の予め定義された行動の中から、１つの行動を所定の条件で選択し、選択された行動を第２のキャラクタに行わせるようにしてもよい。 In addition, in this embodiment, when the matching degree K of the searched related information is not equal to or higher than a predetermined value, a plurality of predefined actions corresponding to the detected action of the unlearned first character are selected. , one action may be selected under predetermined conditions, and the second character may be made to perform the selected action.

例えば、第１のキャラクタＯＢ１の「ジャンプキック」に対応する予め定義された行動が、「ガード」、「後ずさり防御」、「しゃがみ」の３つの行動が定義されていたとする。端末装置２０は、「ガード」、「後ずさり防御」、「しゃがみ」の３つの行動の中から
、１つの行動を所定の条件で選択し、選択された行動を第２のキャラクタに行わせる。所定の条件とは、抽選処理で選択することや、ゲーム状況（ゲーム情報）が所定の状況である場合、等である。 For example, it is assumed that three predefined actions corresponding to the "jump kick" of the first character OB1 are "guard", "backward defense", and "crouch". The terminal device 20 selects one action from among the three actions of "guard", "backward defense", and "crouch" under predetermined conditions, and causes the second character to perform the selected action. The predetermined conditions include selection in a lottery process, a case where the game situation (game information) is a predetermined situation, and the like.

例えば、端末装置２０は、「ガード」、「後ずさり防御」、「しゃがみ」の３つの行動の中から、抽選処理によって「ガード」が当選されると、「ガード」を選択し、第２のキャラクタに「ガード」を行わせる。 For example, if "guard" is selected in the lottery process from among the three actions "guard", "backward defense", and "crouch", the terminal device 20 selects "guard" and selects the second character. have them perform a “guard”.

また、端末装置２０は、ゲーム状況が所定の状況である場合、抽選処理ではなく、「ガード」を必ず選択するようにする、等の制御を行う。所定の状況とは、例えば、残りゲーム時間が残り１０秒以内である場合、第２のキャラクタの体力値が１０以下である場合、等である。 Furthermore, when the game situation is a predetermined situation, the terminal device 20 performs control such as ensuring that "guard" is selected instead of the lottery process. The predetermined situation is, for example, when the remaining game time is within 10 seconds, when the physical strength value of the second character is 10 or less, and the like.

また、本実施形態では、連続行動（コンボ）についても同様の処理を行う。つまり、端末装置２０は、未学習の第１のキャラクタの連続行動が検出された場合、学習済みの関連情報の中から探索された関連情報の一致度Ｋが所定値以上でない場合に、検出された未学習の当該第１のキャラクタの連続行動に対応する予め定義された行動を、第２のキャラクタに行わせる。 Furthermore, in this embodiment, similar processing is performed for continuous actions (combos). That is, when the continuous action of the unlearned first character is detected, the terminal device 20 detects the unlearned first character when the matching degree K of the related information searched from the learned related information is not higher than the predetermined value. The second character is caused to perform a predefined action corresponding to the unlearned continuous action of the first character.

例えば、第１のキャラクタＯＢ１が連続してキックを２回行う連続行動を行った場合において、「下段ガード」、「キック」、「離れる移動」のいずれか１つの行動を抽選で選択する。そして、選択された行動を、第２のキャラクタＯＢ２に行わせる。 For example, if the first character OB1 performs a consecutive action of kicking twice in succession, one of the actions "low guard," "kick," or "moving away" is selected by lottery. The second character OB2 is then made to perform the selected action.

また、第１のキャラクタＯＢ１が連続してキックを２回行う連続行動を行った場合において、第２のキャラクタＯＢ２の体力値が第１の所定値未満（例えば、上限値の５０％未満）であって、第２の所定値以上（例えば、上限値の３０％以上）のとき、「下段ガード」を、第２のキャラクタＯＢ２に行わせ、第２のキャラクタＯＢ２の体力値が第２の所定値未満（例えば、上限値の３０％未満）である場合は、「離れる移動」を、第２のキャラクタＯＢ２に行わせるように、制御してもよい。 Further, when the first character OB1 performs a continuous action of kicking twice in a row, the physical strength value of the second character OB2 is less than the first predetermined value (for example, less than 50% of the upper limit). When the physical strength value is equal to or higher than the second predetermined value (for example, 30% or more of the upper limit value), the second character OB2 performs "lower guard", and the physical strength value of the second character OB2 is set to the second predetermined value. If the value is less than the upper limit value (for example, less than 30% of the upper limit value), the second character OB2 may be controlled to "move away".

［２０］応用例
本実施形態では、対戦ゲームだけでなく、レースゲーム、シューティングゲーム、音楽ゲーム、ＲＰＧ（ロールプレイングゲーム）、アクションゲーム、スポーツゲーム、育成シミュレーションゲームなどに応用可能である。 [20] Application Example This embodiment is applicable not only to competitive games, but also to racing games, shooting games, music games, RPGs (role playing games), action games, sports games, training simulation games, and the like.

また、３次元の仮想空間でユーザのキャラクタ（アバター）を配置して、仮想空間を楽しむサービスにも応用可能である。 It is also applicable to a service where a user's character (avatar) is placed in a three-dimensional virtual space to enjoy the virtual space.

［２１］その他
本発明は、上記実施形態で説明したものに限らず、種々の変形実施が可能である。例えば、明細書又は図面中の記載において広義や同義な用語として引用された用語は、明細書又は図面中の他の記載においても広義や同義な用語に置き換えることができる。 [21] Others The present invention is not limited to the above-described embodiment, and various modifications are possible. For example, a term cited in the description of the specification or drawings as a term with a broad meaning or synonymous meaning can be replaced with a term with a broad meaning or synonymous meaning in other descriptions of the specification or drawings.

本発明は、実施形態で説明した構成と実質的に同一の構成（例えば、機能、方法及び結果が同一の構成、あるいは目的及び効果が同一の構成）を含む。また、本発明は、実施形態で説明した構成の本質的でない部分を置き換えた構成を含む。また、本発明は、実施形態で説明した構成と同一の作用効果を奏する構成又は同一の目的を達成することができる構成を含む。また、本発明は、実施形態で説明した構成に公知技術を付加した構成を含む。 The present invention includes configurations that are substantially the same as the configurations described in the embodiments (for example, configurations with the same functions, methods, and results, or configurations with the same purpose and effect). The present invention also includes configurations that replace non-essential parts of the configurations described in the embodiments. The present invention also includes configurations that achieve the same effects as the configurations described in the embodiments, or that can achieve the same purpose. The present invention also includes configurations in which publicly known technology is added to the configurations described in the embodiments.

上記のように、本発明の実施形態について詳細に説明したが、本発明の新規事項及び効果から実体的に逸脱しない多くの変形が可能であることは当業者には容易に理解できるであろう。したがって、このような変形例はすべて本発明の範囲に含まれるものとする。 As mentioned above, the embodiments of the present invention have been described in detail, but those skilled in the art will easily understand that many modifications can be made without substantially departing from the novelty and effects of the present invention. . Therefore, all such modifications are included within the scope of the present invention.

１０サーバ装置、２０、２０Ａ、２０Ｂ、２０Ｃ端末装置、
１００処理部、１１１ゲーム処理部、１１２表示制御部、
１１９受け付け部、１２０通信制御部、１２１Ｗｅｂ処理部、
１２２管理部、１２３通知部、
１６０入力部、１７０記憶部、１７１主記憶部、
１７４ゲーム情報記憶部、１７６プレーヤ情報記憶部、
１８０情報記憶媒体、１９６通信部、
２００処理部、２１１通信制御部、２１２オブジェクト空間設定部、
２１３移動・動作制御部、２１４操作入力受付部、２１５ゲーム制御部、
２１６仮想カメラ制御部、２１７行動制御部、２１８学習部、２１９設定部、
２２０判断部、２２１探索部、２２２表示制御部、
２２３変化部、２３０画像生成部、
２４０音処理部、２６０入力部、２６２検出部、２７０記憶部、
２７１主記憶部、２７２画像バッファ、２７４ゲーム情報記憶部、
２７６プレーヤ情報記憶部、２７８学習記憶部、２７９学習モデル記憶部、
２８０情報記憶媒体、２９０表示部、
２９２音出力部、２９６通信部 10 server device, 20, 20A, 20B, 20C terminal device,
100 processing unit, 111 game processing unit, 112 display control unit,
119 reception unit, 120 communication control unit, 121 web processing unit,
122 Management Department, 123 Notification Department,
160 input section, 170 storage section, 171 main storage section,
174 Game information storage unit, 176 Player information storage unit,
180 Information storage medium, 196 Communication Department,
200 processing unit, 211 communication control unit, 212 object space setting unit,
213 movement/motion control unit, 214 operation input reception unit, 215 game control unit,
216 virtual camera control unit, 217 behavior control unit, 218 learning unit, 219 setting unit,
220 judgment unit, 221 search unit, 222 display control unit,
223 changing unit, 230 image generating unit,
240 sound processing unit, 260 input unit, 262 detection unit, 270 storage unit,
271 main storage unit, 272 image buffer, 274 game information storage unit,
276 player information storage unit, 278 learning storage unit, 279 learning model storage unit,
280 information storage medium, 290 display section,
292 Sound output section, 296 Communication section

Claims

A game system for controlling the actions of a second character in a competitive game in which a first character whose actions are controlled based on a player's operation input competes with a second character controlled by a computer, the game system comprising:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
A setting for setting at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information, based on the information of the player, before learning. A game system comprising:

A game system for controlling the actions of a second character in a competitive game in which a first character whose actions are controlled based on a player's operation input competes with a second character controlled by a computer, the game system comprising:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
Based on the player's information, at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information is determined according to the game progress. A game system comprising: a setting section for making settings.

In claim 1 or 2,
The setting section includes:
A game system characterized in that, based on an operation input by the player, selection of an item having a higher weight among a plurality of items forming the related information is accepted.

In claim 1 or 2,
The setting unit is
A game system characterized in that a priority order is set for each of a plurality of items of said related information based on operational input by said player.

In claim 1 or 2,
A game system, wherein the player information includes information regarding the player's play history.

In claim 1 or 2,
The game system is characterized in that the player's information includes information regarding the player's past play tendencies.

In claim 1 or 2,
A game system characterized in that the player's information includes information related to other players determined based on predetermined information of the player.

A program that causes a computer to control the actions of a second character in a competitive game in which a first character whose actions are controlled based on a player's operation input competes with a second character controlled by a computer. hand,
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
A setting for setting at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information, based on the information of the player, before learning. A program that, as a part, causes a computer to function.

In a battle game in which a first character whose behavior is controlled based on a player's operation input competes against a second character controlled by a computer, a program for causing a computer to control the behavior of the second character, comprising:
a learning unit that learns an action of a first character and related information related to the action;
a behavior control unit that controls a behavior of a second character using the learning data learned by the learning unit;
A program that causes a computer to function as a setting unit that sets at least one of the combination of multiple items that make up the related information, the learning frequency, and the weighting of each of the multiple items that make up the related information, based on the player's information and in accordance with the game progress.

A learning device for controlling the behavior of a second character in a competitive game in which a first character whose behavior is controlled based on a player's operation input competes with a second character controlled by a computer, the learning device comprising:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
A setting for setting at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information, based on the information of the player, before learning. A learning device comprising:

A learning device for controlling the behavior of a second character in a competitive game in which a first character whose behavior is controlled based on a player's operation input competes with a second character controlled by a computer, the learning device comprising:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
Based on the player's information, at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information is determined according to the game progress. A learning device comprising: a setting section for setting.

In a competitive game in which a first character whose actions are controlled based on the player's operation input competes with a second character whose actions are controlled by a computer, the player's terminal is sent from a server device that controls the actions of the second character. A game providing method for providing information regarding a game to a device, the method comprising:
transmitting information regarding the player's operation input accepted by the terminal device to the server device;
transmitting information regarding the game from the server device to the terminal device;
The server device includes:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
A setting for setting at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information, based on the information of the player, before learning. A game providing method characterized by comprising:

In a competitive game in which a first character whose actions are controlled based on the player's operation input competes with a second character whose actions are controlled by a computer, the player's terminal is sent from a server device that controls the actions of the second character. A game providing method for providing information regarding a game to a device, the method comprising:
transmitting information regarding the player's operation input accepted by the terminal device to the server device;
transmitting information regarding the game from the server device to the terminal device;
The server device includes:
a learning section that learns the actions of the first character and related information related to the actions;
a behavior control unit that controls the behavior of the second character using the learning data learned in the learning unit;
Based on the player's information, at least one of a combination of a plurality of items constituting the related information, a learning frequency, and a weight of each of the plurality of items constituting the related information is determined according to the game progress. A game providing method, comprising: a setting section for making settings.