JP2023079293A

JP2023079293A - Information processing device, information processing system, information processing method, and program

Info

Publication number: JP2023079293A
Application number: JP2021192687A
Authority: JP
Inventors: 康貴田中; Yasutaka Tanaka; 明弘小出; Akihiro Koide; 朋史市丸; Tomofumi Ichimaru; 浩司塚本; Koji Tsukamoto; 健志手塚; Kenji Tezuka
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2021-11-29
Filing date: 2021-11-29
Publication date: 2023-06-08
Anticipated expiration: 2041-11-29
Also published as: JP7351887B2

Abstract

To provide an information processing device, system, method and program for determining more appropriate incentive for each user.SOLUTION: In an information processing system in which a service server, a learning device and an information processing device communicate with one another through a network, the information processing device includes an acquisition part for acquiring features of a user of an object user, and a determination processing part for inputting the features of the object user to a first model and a second model, and determining an incentive to be given to the object user on the basis of an integrated score obtained by integrating a first score outputted by the first model and a second score outputted by the second model. The first model outputs a score showing the certainty that the object user performs an intended action in the case of assuming that no incentive or a first incentive is given to the user. The second model outputs a score showing the certainty that the object user performs an intended action in the case of assuming that a second incentive higher than the first incentive is given.SELECTED DRAWING: Figure 7

Description

本発明は、情報処理装置、情報処理システム、情報処理方法、およびプログラムに関する。 The present invention relates to an information processing device, an information processing system, an information processing method, and a program.

従来、インセンティブが付与されたユーザの購入履歴と、前記インセンティブが付与されたユーザごとの素性情報に基づいて生成されたインセンティブごとのユーザによる購入傾向をモデル化した予測モデルを用いて、インセンティブを付与することによって生じる購入行動に関する事象を最適化するための確率を予測する情報解析装置が開示されている（例えば、特許文献１参照）。 Conventionally, incentives are given using a prediction model that models the purchase trends of users for each incentive generated based on the purchase history of users who have been given incentives and the background information of each user who has been given incentives. An information analysis device is disclosed that predicts the probability for optimizing an event related to purchase behavior caused by purchasing (see, for example, Patent Literature 1).

特開２０１８－０７３３４９号公報JP 2018-073349 A

しかしながら、上記従来の技術では、ユーザごとに適切なインセンティブを決定することができない場合があった。 However, with the conventional technology described above, there are cases where an appropriate incentive cannot be determined for each user.

本発明は、このような事情が考慮されたものであり、ユーザごとにより適切なインセンティブを決定することができる情報処理装置、情報処理システム、情報処理方法、およびプログラムを提供することを目的の一つとする。 SUMMARY OF THE INVENTION It is an object of the present invention to provide an information processing apparatus, an information processing system, an information processing method, and a program capable of determining an appropriate incentive for each user. one.

本発明の一態様は、対象のユーザの特徴を取得する取得部と、前記対象のユーザの特徴を第１モデルおよび第２モデルに入力し、前記第１モデルが出力した第１スコアと前記第２モデルが出力した第２スコアとを統合して得られた統合スコアに基づいて、前記対象のユーザに付与するインセンティブを決定する決定部と、を備え、前記第１モデルは、インセンティブ無またはユーザに第１インセンティブを付与したと想定した場合に意図する行動を前記対象のユーザが行う確からしさを示すスコアを出力するモデルであり、前記第２モデルは、前記第１インセンティブより高い第２インセンティブを付与したと想定した場合に意図する行動を前記対象のユーザが行う確からしさを示すスコアを出力するモデルである情報処理装置である。 One aspect of the present invention is an acquisition unit that acquires features of a target user, inputs the features of the target user into a first model and a second model, outputs a first score and the first a determination unit that determines an incentive to be given to the target user based on the integrated score obtained by integrating the second score output by the two models, wherein the first model is provided with no incentive or the user is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that the first incentive is given to the second model, the second incentive is higher than the first incentive The information processing device is a model for outputting a score indicating the probability that the target user will perform the intended action when it is assumed that the target user performs the intended action.

本発明の一態様によれば、ユーザごとにより適切なインセンティブを決定することができる。 According to one aspect of the present invention, more appropriate incentives can be determined for each user.

情報処理装置を含む情報処理システム１の機能構成の一例を示す図である。1 is a diagram showing an example of a functional configuration of an information processing system 1 including an information processing device; FIG. 学習装置１００の機能構成の一例を示す図である。2 is a diagram showing an example of the functional configuration of the learning device 100; FIG. 第１学習情報１２２の内容の一例を示す図である。4 is a diagram showing an example of the contents of first learning information 122. FIG. 第１入会情報について説明するための図である。It is a figure for demonstrating the 1st admission information. 第２学習情報１２６の内容の一例を示す図である。4 is a diagram showing an example of the contents of second learning information 126. FIG. 第２入会情報について説明するための図である。It is a figure for demonstrating the 2nd admission information. 情報処理装置２００の機能構成の一例を示す図である。2 is a diagram showing an example of a functional configuration of an information processing device 200; FIG. ユーザ情報２２２の内容の一例を示す図である。4 is a diagram showing an example of contents of user information 222. FIG. 第１スコアおよび第２スコアが取得される処理について説明するための図である。FIG. 10 is a diagram for explaining a process of obtaining a first score and a second score; FIG. 統合スコアが取得される処理について説明するための図である。FIG. 11 is a diagram for explaining processing for obtaining an integrated score; FIG. インセンティブが決定される処理について説明するための図である。FIG. 4 is a diagram for explaining a process of determining incentives; FIG. 所定のユーザから得られた統合スコアと入会率との関係を示す図である。FIG. 10 is a diagram showing the relationship between integrated scores obtained from predetermined users and admission rates; 閾値の決定手法の一例について説明するための図（その１）である。FIG. 11 is a diagram (part 1) for explaining an example of a threshold determination method; 閾値の決定手法の一例について説明するための図（その２）である。FIG. 11 is a diagram (part 2) for explaining an example of a threshold determination method; 本実施形態および比較例の処理による入会者数およびインセンティブの合計を示す図である。FIG. 10 is a diagram showing the total number of enrollees and incentives according to the processing of the present embodiment and the comparative example; 情報処理システム１により実行される処理の流れの一例を示すシーケンス図である。3 is a sequence diagram showing an example of the flow of processing executed by the information processing system 1; FIG.

以下、図面を参照し、本発明の情報処理装置、情報処理システム、情報処理方法、およびプログラムの実施形態について説明する。 Hereinafter, embodiments of an information processing apparatus, an information processing system, an information processing method, and a program according to the present invention will be described with reference to the drawings.

［概要］
本実施形態の一態様の情報処理装置は、対象のユーザの特徴を取得する取得部と、対象のユーザの特徴を第１モデルおよび第２モデルに入力し、第１モデルが出力した第１スコアと第２モデルが出力した第２スコアとを統合して得られた統合スコアに基づいて、対象のユーザに付与するインセンティブを決定する決定部とを備える。 [overview]
An information processing apparatus according to one aspect of the present embodiment includes an acquisition unit that acquires features of a target user; and a determining unit that determines an incentive to be given to the target user based on the integrated score obtained by integrating the second score output by the second model.

ユーザの特徴とは、例えば、ユーザの行動履歴やユーザの属性などである。ユーザの行動履歴とは、商品またはサービスの購買を示す情報や、購買の金額を示す情報など種々の情報である。ユーザの行動履歴とは、ネットワーク上におけるユーザの行動履歴である。ユーザの行動履歴は、例えば、ネットワークを介して提供されているサービスにおけるユーザの行動履歴（購入した商品やサービス、商品の価格、サービスの価格などの購買履歴情報や、決済金額、検索クエリなど）である。 User features are, for example, the user's action history, user attributes, and the like. The user's action history includes various types of information such as information indicating purchases of products or services and information indicating purchase amounts. A user's action history is a user's action history on the network. The user's action history is, for example, the user's action history in services provided via the network (purchasing history information such as purchased products and services, product prices, service prices, payment amounts, search queries, etc.) is.

統合スコアとは、第１スコアと第２スコアから得られたスコアである。統合スコアは、第１スコアと第２スコアとを統計処理して得られたスコアである。統合スコアは、例えば、第１スコアと第２スコアとの差分に基づくスコアである。 An integrated score is a score obtained from the first score and the second score. An integrated score is a score obtained by statistically processing the first score and the second score. The integrated score is, for example, a score based on the difference between the first score and the second score.

第１モデルは、インセンティブ無またはユーザに第１インセンティブを付与したと想定した場合に意図する行動を対象のユーザが行う確からしさを示すスコアを出力するモデルである。第２モデルは、第１インセンティブより高い第２インセンティブを付与したと想定した場合に意図する行動を対象のユーザが行う確からしさを示すスコアを出力するモデルである。 The first model is a model that outputs a score indicating the probability that the target user will perform the intended action when it is assumed that there is no incentive or that the user is given the first incentive. The second model is a model that outputs a score indicating the probability that the target user will perform the intended action when it is assumed that the second incentive higher than the first incentive is given.

意図する行動とは、情報処理システム１の管理者やサービス提供者などのインセンティブの付与の主体（またはユーザ以外の所定の者）が意図する行動である。意図する行動とは、例えば、所定のサービスに対する申し込みや、アンケートの回答、会員登録、アプリケーションプログラムのインストール、商品の購入、サービスの購入など任意の行動である。意図する行動は、インセンティブの付与の主体（またはユーザ以外の所定の者）が意図するものであればよい。以下の説明では、意図する行動は、クレジットカードの入会（申し込み）であるものとして説明する。 An intended action is an action intended by an incentive granting subject (or a predetermined person other than the user) such as an administrator of the information processing system 1 or a service provider. The intended action is, for example, any action such as applying for a predetermined service, answering a questionnaire, registering as a member, installing an application program, purchasing a product, or purchasing a service. The intended behavior may be intended by the incentive granting subject (or a predetermined person other than the user). In the following explanation, it is assumed that the intended action is membership (application) for a credit card.

［情報処理システム］
図１は、情報処理装置を含む情報処理システム１の機能構成の一例を示す図である。情報処理システム１は、例えば、一以上のユーザの端末装置Ｕと、一以上のサービスサーバ１０と、学習装置１００と、情報処理装置２００とを備える。一以上のサービスサーバ１０と、学習装置１００と、情報処理装置２００とは（或いは情報処理装置２００と一以上のサービスサーバ１０とは）、一つのまたは同じグループに属する管理者によって管理されている。サービスサーバ１０と、学習装置１００と、情報処理装置２００とは、ネットワークＮＷを介して互いに通信する。ネットワークＮＷは、インターネットやＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）、セルラー網などを含む。また、サービスサーバ１０と端末装置Ｕとは、上記のネットワークＮＷまたは他のネットワークを介して互いに通信する。 [Information processing system]
FIG. 1 is a diagram showing an example of a functional configuration of an information processing system 1 including an information processing device. The information processing system 1 includes, for example, one or more user terminal devices U, one or more service servers 10 , a learning device 100 , and an information processing device 200 . One or more service servers 10, learning device 100, and information processing device 200 (or information processing device 200 and one or more service servers 10) are managed by one or the same group of administrators. . Service server 10, learning device 100, and information processing device 200 communicate with each other via network NW. The network NW includes the Internet, a LAN (Local Area Network), a WAN (Wide Area Network), a cellular network, and the like. Also, the service server 10 and the terminal device U communicate with each other via the above network NW or another network.

［端末装置］
端末装置Ｕは、スマートフォンやパーソナルコンピュータ、タブレット端末などの通信機能と表示機能を有するコンピュータ装置である。端末装置Ｕでは、ＣＰＵ（Central Processing Unit）などのプロセッサが端末装置Ｕの記憶装置に記憶されたプログラムを実行することで各種処理を実行する。 [Terminal device]
The terminal device U is a computer device having a communication function and a display function, such as a smart phone, a personal computer, or a tablet terminal. In the terminal device U, a processor such as a CPU (Central Processing Unit) executes a program stored in a storage device of the terminal device U to execute various processes.

［サービスサーバ］
サービスサーバ１０は、端末装置Ｕにサービスを提供する装置である。サービスサーバ１０は、例えば、ショッピングサービス（電子商取引のサービス）を提供するサーバ装置や、オークションサービスを提供するサーバ装置、フリーマーケットサービスを提供するサーバ装置、レストランの予約やホテルの予約、旅行の予約などのサービスを提供するサーバ装置、検索サービスを提供するサーバ装置など種々のサービスを提供するサーバ装置を含む。 [Service server]
The service server 10 is a device that provides the terminal device U with a service. The service server 10 includes, for example, a server device that provides shopping services (electronic commerce services), a server device that provides auction services, a server device that provides flea market services, restaurant reservations, hotel reservations, and travel reservations. and a server device that provides various services such as a server device that provides a search service.

［学習装置］
図２は、学習装置１００の機能構成の一例を示す図である。学習装置１００は、例えば、取得部１０２と、第１学習部１０４と、第２学習部１０６と、提供部１０８と、記憶部１２０とを備える。取得部１０２、第１学習部１０４、第２学習部１０６、および提供部１０８は、例えば、ＣＰＵ（Central Processing Unit）などのハードウェアプロセッサがプログラム（ソフトウェア）を実行することにより実現される。これらの構成要素のうち一部または全部は、ＬＳＩ（Large Scale Integration）やＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field-Programmable Gate Array）、ＧＰＵ（Graphics Processing Unit）などのハードウェア（回路部；circuitryを含む）によって実現されてもよいし、ソフトウェアとハードウェアの協働によって実現されてもよい。プログラムは、予めＨＤＤ（Hard Disk Drive）やフラッシュメモリなどの記憶装置（非一過性の記憶媒体を備える記憶装置）に格納されていてもよいし、ＤＶＤやＣＤ－ＲＯＭなどの着脱可能な記憶媒体（非一過性の記憶媒体）に格納されており、記憶媒体がドライブ装置に装着されることで記憶装置にインストールされてもよい。各構成要素の機能については後述する。 [Learning device]
FIG. 2 is a diagram showing an example of the functional configuration of the learning device 100. As shown in FIG. The learning device 100 includes an acquisition unit 102, a first learning unit 104, a second learning unit 106, a providing unit 108, and a storage unit 120, for example. Acquisition unit 102, first learning unit 104, second learning unit 106, and providing unit 108 are implemented by a hardware processor such as a CPU (Central Processing Unit) executing a program (software), for example. Some or all of these components are hardware (circuit part; circuitry) or by cooperation of software and hardware. The program may be stored in advance in a storage device (a storage device with a non-transitory storage medium) such as a HDD (Hard Disk Drive) or flash memory, or may be stored in a removable storage such as a DVD or CD-ROM. It may be stored in a medium (non-transitory storage medium) and installed in the storage device by loading the storage medium into the drive device. The function of each component will be described later.

記憶部１２０は、例えば、例えば、ＲＯＭ（Read Only Memory）、フラッシュメモリ、ＳＤカード、ＲＡＭ（Random Access Memory）、ＨＤＤ（Hard Disc Drive）、レジスタ等によって実現される。また、記憶部１２０の一部または全部は、ＮＡＳ（Network Attached Storage）や外部ストレージサーバ装置等であってもよい。 The storage unit 120 is implemented by, for example, a ROM (Read Only Memory), a flash memory, an SD card, a RAM (Random Access Memory), a HDD (Hard Disc Drive), a register, and the like. Also, part or all of the storage unit 120 may be a NAS (Network Attached Storage), an external storage server device, or the like.

記憶部１２０には、例えば、第１学習情報１２２、第１モデル１２４、第２学習情報１２６、および第２モデル１２８が記憶されている。これらの情報の詳細については後述する。第１モデル１２４または第２モデル１２８は、例えば、機械学習を利用したモデルであって、勾配ブースティングを用いたモデルや、決定木アルゴリズムを用いたモデルであってもよいし、サポートベクターマシンや、ニューラルネットワークを、ディープラーニングなどを用いたモデルであってもよい。なお、第１学習情報１２２は、「意図した行動をユーザが行ってもインセンティブ無であることを示してまたは意図した行動をユーザが行った場合に第１インセンティブが付与されることを示してユーザが意図した行動を行ったことを示す情報と、意図した行動を行ったユーザの特徴とが対応付けられた情報」の一例である。第２学習情報１２６は、「意図した行動をユーザが行った場合に第２インセンティブが付与されることを示してユーザが意図した行動を行ったことを示す情報と、意図した行動を行ったユーザの特徴とが対応付けられた情報」の一例である。 The storage unit 120 stores, for example, first learning information 122, first model 124, second learning information 126, and second model 128. FIG. Details of these pieces of information will be described later. The first model 124 or the second model 128 may be, for example, a model using machine learning, a model using gradient boosting, a model using a decision tree algorithm, a support vector machine, or , a neural network, or a model using deep learning or the like. Note that the first learning information 122 is "a user who indicates that there is no incentive even if the user performs the intended action, or that the first incentive is given when the user performs the intended action." is an example of "information in which information indicating that a user performed an intended action is associated with the characteristics of the user who performed the intended action". The second learning information 126 includes “information indicating that the user has performed the intended action by indicating that a second incentive will be given when the user performs the intended action, and information indicating that the user has performed the intended action. is an example of "information associated with the features of

取得部１０２は、記憶部１２０に記憶された第１学習情報１２２および第２学習情報１２６（後述）を取得し、取得した第１学習情報１２２を第１学習部１０４に提供し、取得した第２学習情報１２６を第２学習部１０６に提供する。 Acquisition unit 102 acquires first learning information 122 and second learning information 126 (described later) stored in storage unit 120, provides acquired first learning information 122 to first learning unit 104, and acquires the acquired first learning information 122. 2 provides learning information 126 to the second learning unit 106;

第１学習部１０４は、第１学習情報１２２を学習して第１モデル１２４を生成する。図３は、第１学習情報１２２の内容の一例を示す図である。第１学習情報１２２は、ユーザの識別情報（ＩＤ）に対して、ネットワークＮＷ上におけるユーザの行動履歴と、インセンティブなしでクレジットカードの入会の案内がユーザに提供された場合のユーザが入会を行ったか否かを示す第１入会情報とが対応付けられた情報である。ユーザのＩＤは、例えば、サービスサーバ１０が提供するサービスを利用するために、ユーザがログインに利用するユーザのＩＤである。ユーザのＩＤは、ＨＴＴＰＣｏｏｋｉｅを利用したユーザのＩＤであってもよい。 The first learning unit 104 learns the first learning information 122 to generate the first model 124 . FIG. 3 is a diagram showing an example of the contents of the first learning information 122. As shown in FIG. The first learning information 122 includes, for user identification information (ID), the user's action history on the network NW and the user's membership when the user is provided with credit card membership guidance without incentives. This information is associated with the first membership information indicating whether or not the membership has been completed. The user ID is, for example, a user ID used for login by the user in order to use the service provided by the service server 10 . The user ID may be a user ID using HTTP Cookie.

ユーザの行動履歴は、例えば、電子商取引のサービスにおける行動履歴、オークションサービスにおける行動履歴、または検索サービスにおける行動履歴のうち一以上の行動履歴である。ユーザの行動履歴とは、例えば、どのサービスをどの程度利用しているかを示す情報や、どのような商品またはサービスを購入したかを示す情報（購買履歴情報）、どの程度の金額をサービスで利用したかを示す情報（例えば購入した商品の金額など）、検索に用いた検索クエリなど任意の行動履歴である。 The user's action history is, for example, one or more action histories of an action history in an electronic commerce service, an action history in an auction service, or an action history in a search service. The user's action history includes, for example, information indicating which services are used and how much, information indicating what products or services have been purchased (purchase history information), and how much money has been used for services. It is arbitrary action history such as information indicating whether or not the product was purchased (for example, the price of the product purchased), search query used for search, and the like.

図４は、第１入会情報について説明するための図である。サービスサーバ１０は、例えば、ユーザが端末装置Ｕを操作して商品の購入を行う操作を行った場合、図４に示すコンテンツを端末装置Ｕに提供する。ユーザは、例えば、ユーザＩＤを用いてサービスサーバ１０が提供するサービスを利用するためにログインしたユーザである。このコンテンツは、クレジットカードの入会をレコメンドするコンテンツであり、クレジットカードのデザインや機能、年会費などクレジットカードに関する情報、およびクレジットカードの入会に申し込むための申し込みボタンが含まれる。 FIG. 4 is a diagram for explaining the first membership information. The service server 10 provides the terminal device U with the content shown in FIG. A user is, for example, a user who logs in to use a service provided by the service server 10 using a user ID. This content is content for recommending credit card membership, and includes credit card design and functions, credit card information such as annual membership fees, and an application button for applying for credit card membership.

例えば、ユーザが申し込みボタンを操作した場合、クレジットカードの申し込み画面が表示され、この画面においてユーザは、各種情報を入力することでクレジットカードの申し込みを行うことができる。クレジットカードの申し込み画面は、サービスサーバ１０が提供してもよいし、サービスサーバ１０と連携している他のサービスサーバ１０（クレジットカードの申し込みを受けるためのサービスサーバ１０やクレジットカードのサービスを管理するサービスサーバ）が提供してもよい。サービスサーバ１０は、上記のユーザの操作に関する情報を管理し、管理する情報を学習装置１００に提供する。学習装置１００は、サービスサーバ１０から取得した情報を第１学習情報１２２として管理する。 For example, when the user operates an application button, a credit card application screen is displayed, and the user can apply for a credit card by inputting various information on this screen. The credit card application screen may be provided by the service server 10, or may be provided by another service server 10 cooperating with the service server 10 (the service server 10 for receiving credit card applications and managing credit card services). service server) may provide. The service server 10 manages the information about the user's operation and provides the learning device 100 with the information to be managed. The learning device 100 manages information acquired from the service server 10 as first learning information 122 .

クレジットカードの申し込みを行ったユーザの情報は正例データ（第１正例データ）であり、クレジットカードの申し込みを行わなかったユーザの情報は負例データ（第１負例データ）である。第１学習部１０４は、第１正例データおよび第１負例データのユーザの行動履歴をベクトル化したユーザベクトルを生成する。ユーザベクトルは、ユーザの行動履歴を分散表現化したものである。ユーザベクトルは、ユーザの特徴を示す。第１学習部１０４は、第１正例データのユーザベクトルをモデルに入力した場合にモデルが出力するスコアが所定値（例えば「１」）に近づき、第１負例データのユーザベクトルをモデルに入力した場合にモデルが出力するスコアが所定値（例えば「ゼロ」）に近づくように学習を行って、第１モデル１２４を生成する。 Information on users who have applied for a credit card is positive example data (first positive example data), and information on users who have not applied for a credit card is negative example data (first negative example data). The first learning unit 104 generates a user vector by vectorizing the user's action history of the first positive example data and the first negative example data. A user vector is a distributed representation of a user's action history. A user vector indicates user characteristics. The first learning unit 104 determines that when the user vector of the first positive data is input to the model, the score output by the model approaches a predetermined value (for example, "1"), and the user vector of the first negative data is used as the model. The first model 124 is generated by performing learning so that the score output by the model when input approaches a predetermined value (for example, "zero").

これにより、第１モデル１２４は、第１正例データのユーザベクトルが入力された場合に「１」または「１」に近いスコアを出力する傾向となり、第１負例データのユーザベクトルが入力された場合に「ゼロ」または「ゼロ」に近いスコアを出力する傾向となる。第１モデル１２４が出力するスコアは、「インセンティブ無またはユーザに第１インセンティブを付与したと想定した場合に意図する行動を前記対象のユーザが行う確からしさを示すスコア」の一例である。 As a result, the first model 124 tends to output a score of "1" or close to "1" when the user vector of the first positive data is input, and the user vector of the first negative data is input. tend to output a score of “zero” or close to “zero” when The score output by the first model 124 is an example of "a score indicating the probability that the target user will perform the intended action when it is assumed that there is no incentive or that the user is given the first incentive".

第１モデル１２４は、「意図した行動をユーザが行ってもインセンティブ無であることをユーザに示しても意図した行動を行ったユーザの特徴が入力されると、インセンティブが与えられなくても意図する行動を行う傾向が高いことを示す第１スコアを出力するように学習されたモデル」または「意図した行動をユーザが行った場合に第１インセンティブが付与されることを示して意図した行動を行ったユーザの特徴が入力されると、第１インセンティブが与えられることで意図する行動を行う傾向が高いことを示す第１スコアを出力するように学習されたモデル」の一例である。 The first model 124 is such that even if the user performs the intended action, even if the user is shown that there is no incentive, when the characteristics of the user who performed the intended action are input, the intentional behavior is determined even if no incentive is given. A model trained to output a first score indicating that the user is likely to perform the intended action, or a model that indicates that the user will be given the first incentive when the intended action is performed. This is an example of a "model that has been learned to output a first score indicating that there is a high tendency to perform an intended action by being given a first incentive when characteristics of the user who performed the action are input.

第２学習部１０６は、第２学習情報１２６を学習して第２モデル１２８を生成する。図５は、第２学習情報１２６の内容の一例を示す図である。第２学習情報１２６は、ユーザの識別情報（ＩＤ）に対して、ネットワークＮＷ上におけるユーザの行動履歴と、インセンティブありでクレジットカードの入会の案内がユーザに提供された場合のユーザが入会を行ったか否かを示す第２入会情報とが対応付けられた情報である。 The second learning unit 106 learns the second learning information 126 to generate a second model 128 . FIG. 5 is a diagram showing an example of the contents of the second learning information 126. As shown in FIG. The second learning information 126 includes, for user identification information (ID), the user's action history on the network NW and the user's membership when the user is provided with incentives and credit card membership guidance. This information is associated with the second membership information indicating whether or not the membership has been completed.

図６は、第２入会情報について説明するための図である。第１入金情報との相違点について説明する。サービスサーバ１０は、例えば、ユーザＩＤを用いてサービスサーバ１０が提供するサービスを利用するためにログインしたユーザが端末装置Ｕを操作して商品の購入を行う操作を行った場合、図６に示すコンテンツを端末装置Ｕに提供する。このコンテンツには、前述した図４で示したコンテンツに含まれる情報の他にインセンティブに関する情報が含まれる。インセンティブは、例えば、申し込みの特典である。インセンティブは、例えば、クレジットカードにユーザが入会した場合にユーザに所定のポイントが付与されることである。学習装置１００は、サービスサーバ１０から取得した情報を第２学習情報１２６として管理する。なお、インセンティブは、ポイントに限られず、ユーザが入会を行う動機付けとなると考えられる他の特典（例えば割引、クーポン付与等）であってもよい。 FIG. 6 is a diagram for explaining the second membership information. Differences from the first deposit information will be explained. For example, when a user who has logged in using a user ID to use a service provided by the service server 10 operates the terminal device U to purchase a product, the service server 10 performs an operation shown in FIG. The content is provided to the terminal device U. This content includes information about incentives in addition to the information included in the content shown in FIG. Incentives are, for example, subscription benefits. The incentive is, for example, that the user is given predetermined points when the user signs up for a credit card. The learning device 100 manages information acquired from the service server 10 as second learning information 126 . Incentives are not limited to points, and may be other benefits (for example, discounts, coupons, etc.) that are considered to motivate users to become members.

クレジットカードの申し込みを行ったユーザの情報は正例データ（第２正例データ）であり、クレジットカードの申し込みを行わなかったユーザの情報は負例データ（第２負例データ）である。第２学習部１０６は、第２正例データおよび第２負例データのユーザの行動履歴をベクトル化したユーザベクトルを生成する。第２学習部１０６は、第２正例データのユーザベクトルをモデルに入力した場合にモデルが出力するスコアが所定値（例えば「１」）に近づき、第２負例データのユーザベクトルをモデルに入力した場合にモデルが出力するスコアが所定値（例えば「ゼロ」）に近づくように学習を行って、第２モデル１２８を生成する。 Information on users who have applied for a credit card is positive example data (second positive example data), and information on users who have not applied for a credit card is negative example data (second negative example data). The second learning unit 106 generates a user vector by vectorizing the user's action history of the second positive example data and the second negative example data. The second learning unit 106 determines that when the user vector of the second positive example data is input to the model, the score output by the model approaches a predetermined value (for example, “1”), and the user vector of the second negative example data is applied to the model. A second model 128 is generated by performing learning so that the score output by the model when input approaches a predetermined value (for example, "zero").

これにより、第２モデル１２８は、第２正例データのユーザベクトルが入力された場合に「１」または「１」に近いスコアを出力する傾向となり、第２負例データのユーザベクトルが入力された場合に「ゼロ」または「ゼロ」に近いスコアを出力する傾向となる。第２モデル１２８が出力するスコアは、「前記第１インセンティブより高い第２インセンティブを付与したと想定した場合に意図する行動を前記対象のユーザが行う確からしさを示すスコア」の一例である。 As a result, the second model 128 tends to output a score of "1" or close to "1" when the user vector of the second positive data is input, and the user vector of the second negative data is input. tend to output a score of “zero” or close to “zero” when The score output by the second model 128 is an example of "a score indicating the likelihood that the target user will perform the intended action when it is assumed that a second incentive higher than the first incentive is given."

第２モデル１２８は「意図した行動をユーザが行った場合に第２インセンティブが付与されることを示して意図した行動を行ったユーザの特徴が入力されると、第２インセンティブが与えられることで意図する行動を行う傾向が高いことを示す第２スコアを出力するように学習されたモデル」の一例である。 The second model 128 is defined as "when the characteristics of the user who performed the intended action are input, the second incentive is given by indicating that the second incentive is given when the user performs the intended action. This is an example of a model that has been trained to output a second score that indicates a high tendency to perform the intended action.

提供部１０８は、生成された第１モデル１２４および第２モデル１２８を情報処理装置２００に提供する。 The providing unit 108 provides the generated first model 124 and second model 128 to the information processing apparatus 200 .

なお、上記の例では、ユーザの行動履歴に基づいてユーザベクトルが生成されるものとして説明したが、ユーザの行動履歴に加え（または代えて）、ユーザの属性が加味されたユーザベクトルが利用されてもよい。また、後述するように情報処理装置２００も同様に、ユーザの行動履歴に加え（または代えて）、ユーザの属性が加味されたユーザベクトルを利用して、ユーザに対するインセンティブを決定してもよい。ユーザの属性とは、例えば、ユーザのＩＤに対応付けられている性別や年齢、家族構成などの種々の情報である。 In the above example, the user vector is generated based on the user's action history. However, in addition to (or instead of) the user's action history, a user vector that takes into account the user's attributes is used. may Further, as will be described later, the information processing apparatus 200 may similarly determine incentives for the user by using a user vector in which the user's attributes are added in addition to (or instead of) the user's action history. User attributes are, for example, various information such as sex, age, and family structure associated with the user's ID.

［情報処理装置］
図７は、情報処理装置２００の機能構成の一例を示す図である。情報処理装置２００は、例えば、情報管理部２０２と、第１処理部２０４と、第２処理部２０６と、決定処理部２０８と、記憶部２２０とを備える。情報管理部２０２、第１処理部２０４、第２処理部２０６、および決定処理部２０８は、例えば、ＣＰＵなどのハードウェアプロセッサがプログラム（ソフトウェア）を実行することにより実現される。これらの構成要素のうち一部または全部は、ＬＳＩやＡＳＩＣ、ＦＰＧＡ、ＧＰＵなどのハードウェア（回路部；circuitryを含む）によって実現されてもよいし、ソフトウェアとハードウェアの協働によって実現されてもよい。プログラムは、予めＨＤＤやフラッシュメモリなどの記憶装置（非一過性の記憶媒体を備える記憶装置）に格納されていてもよいし、ＤＶＤやＣＤ－ＲＯＭなどの着脱可能な記憶媒体（非一過性の記憶媒体）に格納されており、記憶媒体がドライブ装置に装着されることで記憶装置にインストールされてもよい。各構成要素の機能については後述する。 [Information processing device]
FIG. 7 is a diagram showing an example of the functional configuration of the information processing device 200. As shown in FIG. The information processing apparatus 200 includes, for example, an information management unit 202, a first processing unit 204, a second processing unit 206, a determination processing unit 208, and a storage unit 220. The information management unit 202, the first processing unit 204, the second processing unit 206, and the decision processing unit 208 are implemented by, for example, a hardware processor such as a CPU executing a program (software). Some or all of these components may be realized by hardware (including circuitry) such as LSI, ASIC, FPGA, GPU, etc., or by cooperation of software and hardware. good too. The program may be stored in advance in a storage device such as an HDD or flash memory (a storage device with a non-transitory storage medium), or may be stored in a removable storage medium such as a DVD or CD-ROM (non-transitory storage medium). physical storage medium), and may be installed in the storage device by mounting the storage medium in the drive device. The function of each component will be described later.

記憶部２２０は、例えば、ＲＯＭ、フラッシュメモリ、ＳＤカード、ＲＡＭ、ＨＤＤ、レジスタ等によって実現される。また、記憶部２２０の一部または全部は、ＮＡＳや外部ストレージサーバ装置等であってもよい。記憶部２２０には、例えば、ユーザ情報２２２、第１モデル２２４、および第２モデル２２６が記憶されている。 The storage unit 220 is implemented by, for example, a ROM, flash memory, SD card, RAM, HDD, register, and the like. Also, part or all of the storage unit 220 may be a NAS, an external storage server device, or the like. User information 222, a first model 224, and a second model 226 are stored in the storage unit 220, for example.

図８は、ユーザ情報２２２の内容の一例を示す図である。ユーザ情報２２２は、サービスサーバ１０に提供された情報や、サービスサーバ１０の情報を管理するサーバ装置により提供された情報である。ユーザ情報２２２は、ユーザの識別情報と、ユーザの行動履歴とが対応付けられた情報である。 FIG. 8 is a diagram showing an example of the contents of the user information 222. As shown in FIG. The user information 222 is information provided to the service server 10 or information provided by a server device that manages the information of the service server 10 . The user information 222 is information in which user identification information and user action history are associated with each other.

情報管理部２０２は、サービスサーバ１０や学習装置１００に提供された情報を取得し、取得した情報を管理する。 The information management unit 202 acquires information provided to the service server 10 and the learning device 100, and manages the acquired information.

第１処理部２０４は、第１モデル２２４を用いて第１スコア（後述）を取得する。第２処理部２０６は、第２モデル２２６を用いて第２スコア（後述）を取得する。決定処理部２０８は、第１スコアと第２スコアとを統合して統合スコアを取得する。決定処理部２０８は、取得した統合スコアに基づいて対象のユーザに付与するインセンティブを決定する。決定処理部２０８は、例えば、第２スコアから第１スコアを減算した統合スコアに基づいてユーザに付与するインセンティブを決定する。インセンティブを決定するとは、インセンティブを付与することを決定することや、インセンティブの内容を決定することである。 The first processing unit 204 acquires a first score (described later) using the first model 224 . The second processing unit 206 uses the second model 226 to obtain a second score (described later). The decision processing unit 208 acquires an integrated score by integrating the first score and the second score. The determination processing unit 208 determines an incentive to be given to the target user based on the acquired integrated score. The decision processing unit 208 decides the incentive to be given to the user, for example, based on the integrated score obtained by subtracting the first score from the second score. Determining an incentive means determining to give an incentive or determining the content of the incentive.

［情報処理装置に行われる処理］
図９は、第１スコアおよび第２スコアが取得される処理について説明するための図である。情報管理部２０２が、対象のユーザの行動履歴を取得する。第１処理部２０４は、対象のユーザの行動履歴に基づいて生成されたユーザベクトルを生成する。 [Processing performed by information processing device]
FIG. 9 is a diagram for explaining the process of obtaining the first score and the second score. The information management unit 202 acquires the action history of the target user. The first processing unit 204 generates a user vector generated based on the action history of the target user.

第１処理部２０４は、ユーザベクトルを第１モデル２２４に入力し、第１モデル２２４が出力した第１スコアを取得する。第１スコアは、インセンティブなしでクレジットカードに入会する可能性を示すスコアである。例えば、第１スコアが「１」に近いほど、入会する可能性が高い。 The first processing unit 204 inputs the user vector to the first model 224 and acquires the first score output by the first model 224 . The first score is a score that indicates the possibility of joining the credit card without any incentive. For example, the closer the first score is to "1", the higher the possibility of becoming a member.

第２処理部２０６は、前述したユーザベクトルを第２モデル２２６に入力し、第２モデル２２６が出力した第２スコアを取得する。第２スコアは、インセンティブありでクレジットカードに入会する可能性を示すスコアである。例えば、第２スコアが「１」に近いほど、入会する可能性が高い。 The second processing unit 206 inputs the above-described user vector to the second model 226 and acquires the second score output by the second model 226 . The second score is a score that indicates the possibility of joining a credit card with an incentive. For example, the closer the second score is to "1", the higher the possibility of becoming a member.

図１０は、統合スコアが取得される処理について説明するための図である。決定処理部２０８は、第２スコアから第１スコアを減算したスコアを統合スコアとして取得する。例えば、インセンティブありに対応する第２スコア「０．９」からインセンティブなしに対応する第１スコア「０．５」を減算した統合スコアが「プラス０．４」である場合、すなわち統合スコアが正である場合、比較的、対象のユーザはインセンティブなしでは入会を行わずにインセンティブありで入会を行う傾向であると推測される。 FIG. 10 is a diagram for explaining the process of acquiring the integrated score. The decision processing unit 208 acquires a score obtained by subtracting the first score from the second score as an integrated score. For example, if the integrated score obtained by subtracting the first score "0.5" corresponding to no incentive from the second score "0.9" corresponding to incentive is "plus 0.4", that is, the integrated score is positive , it is estimated that the target user tends to join with an incentive rather than joining without an incentive.

例えば、インセンティブありに対応する第２スコア「０．５」からインセンティブなしに対応する第１スコア「０．９」を減算した統合スコアが「マイナス０．４」である場合、すなわち統合スコアが負である場合、比較的、対象のユーザはインセンティブなしでも入会を行う傾向であると推測される。 For example, if the integrated score obtained by subtracting the first score "0.9" corresponding to no incentive from the second score "0.5" corresponding to incentive is "minus 0.4", that is, the integrated score is negative. If , it is estimated that the target user is relatively likely to join the membership without any incentive.

図１１は、インセンティブが決定される処理について説明するための図である。決定処理部２０８は、統合スコアと閾値とを比較してインセンティブを付与するかを決定する。決定処理部２０８は、例えば、図１１の上図のように統合スコアが閾値Ｔｈ以上である場合、対象のユーザにインセンティブを付与すると決定し、図１１の下図のように統合スコアが閾値Ｔｈ未満である場合、対象のユーザにインセンティブを付与しないと決定する。閾値の決定手法については後述する。 FIG. 11 is a diagram for explaining the process of determining incentives. The decision processing unit 208 compares the integrated score with the threshold to decide whether to give an incentive. For example, when the integrated score is equal to or greater than the threshold Th as shown in the upper diagram of FIG. 11, the determination processing unit 208 determines to give the incentive to the target user, , it is determined not to give the incentive to the target user. A method for determining the threshold will be described later.

これにより、例えば、インセンティブを付与しないと決定されたユーザに対しては、インセンティブなしでクレジットカードを入会するレコメンドがされ、インセンティブを付与すると決定されたユーザに対しては、インセンティブの付与ありでクレジットカードを入会するレコメンドがされる。 As a result, for example, a user determined not to give an incentive is recommended to apply for a credit card without an incentive, and a user determined to give an incentive is credited with an incentive. A recommendation to join the card is made.

なお、上述した図９－図１１で説明した処理の一部または全部は、サービスサーバ１０において実行されてもよい。例えば、上記の情報処理装置２００の機能構成の一部または全部は、他の装置に搭載されてもよい。 Part or all of the processing described with reference to FIGS. 9 to 11 may be executed in the service server 10. For example, part or all of the functional configuration of the information processing device 200 described above may be installed in another device.

図１２は、所定のユーザから得られた統合スコアと入会率との関係を示す図である。図１２の縦軸は入会率を示し、図１２の横軸は統合スコアを示している。図１２のプロット線のそれぞれは、インセンティブなしの場合の入会率と統合スコアとの関係を示すプロット線（インセンティブなし）、インセンティブＡを付与した場合の入会率と統合スコアとの関係を示すプロット線（インセンティブＡ）、インセンティブＢ（＞Ａ）を付与した場合の入会率と統合スコアとの関係を示すプロット線（インセンティブＢ）を示している。 FIG. 12 is a diagram showing the relationship between integrated scores obtained from predetermined users and membership rates. The vertical axis in FIG. 12 indicates the membership rate, and the horizontal axis in FIG. 12 indicates the integrated score. Each of the plot lines in FIG. 12 is a plot line showing the relationship between the admission rate and the integrated score when there is no incentive (no incentive), and a plot line showing the relationship between the admission rate and the integrated score when incentive A is given. (Incentive A) and a plot line (Incentive B) showing the relationship between the admission rate and the integrated score when incentive B (>A) is given.

具体的には、プロット線（インセンティブなし）は、過去にインセンティブなしで入会をレコメンドしたユーザごとの統合スコアおよび入会の有無の情報から得たものである。プロット線（インセンティブなし）の統合スコア「ゼロ」に着目した場合の入会率は、例えば、過去にインセンティブなしで入会をレコメンドされたユーザのうち統合スコアが「ゼロ」のユーザにおいて、入会を行った割合である。他のプロット線も同様のため、具体的な説明は省略する。このように、プロット線の統合スコアからレコメンドの条件および統合スコアが同じユーザにおける入会率が認識可能である。 Specifically, the plot line (without incentive) is obtained from the integrated score for each user who recommended membership without incentive in the past and the information on whether or not there was membership. The admission rate when focusing on the integrated score "zero" of the plot line (no incentive) is, for example, among the users who were recommended to join without incentives in the past, the users with the integrated score of "zero" joined. percentage. Since other plot lines are similar, detailed description is omitted. In this way, it is possible to recognize the joining rate of users with the same recommendation conditions and the same integrated score from the integrated score of the plot line.

プロット線（インセンティブなし）を参照すると、統合スコアが大きい場合よりも小さい場合（ゼロやマイナスである場合）の方が、入会率が高いことがわかる。これにより、統合スコアが高くないユーザは、インセンティブを付与しなくても比較的入会する可能性が高いことがわかる。プロット線（インセンティブＡ）およびプロット線（インセンティブＢ）を参照すると、インセンティブの付与度合が小さい場合よりも大きい場合の方が、入会率が高いことがわかる。更に、各プロット線の統合スコアに着目すれば、インセンティブの付与度合によって、どの程度、入会率が変化するかがわかる。 Looking at the plot line (without incentives), we can see that the enrollment rate is higher when the integrated score is low (zero or negative) than when it is high. As a result, it can be seen that users whose integrated scores are not high are likely to become members even if incentives are not given. Referring to the plotted line (incentive A) and the plotted line (incentive B), it can be seen that the membership rate is higher when the degree of incentive given is greater than when the degree of incentive is small. Furthermore, by focusing on the integrated score of each plot line, it can be understood to what degree the admission rate changes depending on the degree of incentive provision.

［閾値が設定される処理］
上記の傾向に基づいて、インセンティブの付与が決定される。例えば、決定処理部２０８が用いる閾値が決定される。図１３は、閾値の決定手法の一例について説明するための図（その１）である。図１２との相違点について説明する。例えば、統合スコアに対して閾値を設定する。例えば、閾値未満の統合スコアのユーザに対してインセンティブなしで入会をレコメンドした場合に想定される入会者数と、閾値以上の統合スコアのユーザに対してインセンティブありで入会をレコメンドした場合に想定される入会者数との合計が所望の入会数以上や最大になるように閾値が最適化される。 [Processing in which a threshold is set]
Based on the above trends, incentives are determined. For example, a threshold used by the decision processing unit 208 is determined. FIG. 13 is a diagram (part 1) for explaining an example of a threshold determination method; Differences from FIG. 12 will be described. For example, a threshold is set for the integrated score. For example, the expected number of enrollees when recommending enrollment without incentives to users with integrated scores below the threshold, and the number of enrollees when recommending enrollment with incentives to users with integrated scores above the threshold The threshold is optimized so that the sum of the number of enrollees and the number of enrollees is the desired number of enrollees or more or the maximum.

更に、最適化される際に、インセンティブの合計が条件を満たすように最適化されてもよい。例えば、インセンティブがポイント付与の場合、ユーザに付与されるポイントの合計が所望のポイント内で入会者数の合計が所望の入会数以上や最大になるように最適化されてもよい。更に、上記と同じ考え方で、各種条件が設定されてもよい。各種条件とは、インセンティブの合計が最小となることや、サービスサーバ１０の利用頻度が所定度合以上のユーザの入会が基準を満たすことなどである。 Further, when optimized, the total incentive may be optimized to meet the conditions. For example, when the incentive is to give points, the total number of points given to the user may be optimized so that the total number of new members is greater than or equal to the desired number of new members. Furthermore, various conditions may be set based on the same concept as above. The various conditions include that the total of incentives is minimized, that the membership of a user whose frequency of use of the service server 10 is equal to or higher than a predetermined degree, and the like.

例えば、閾値Ｔｈ１が設定されてもよい。閾値Ｔｈ１は、図１３に示すように、インセンティブなしであっても所望の入会率や入会者数の獲得が見込まれる統合スコアに対応する閾値である。閾値Ｔｈ１は、例えば、プロット線（インセンティブなし）の入会率が、プロット線（インセンティブＢ）の入会率と交わる（超える）統合スコアであってもよい。このように、閾値Ｔｈ１が設定されることで、閾値Ｔｈ１未満のユーザに対してはインセンティブを付与しなくても、高い入会率や入会者数が期待される。これにより、インセンティブの付与を抑制しつつ、効率的に入会へ誘導することができる。 For example, a threshold Th1 may be set. The threshold Th1, as shown in FIG. 13, is a threshold corresponding to an integrated score that is expected to achieve a desired enrollment rate and enrollment number even without incentives. The threshold Th1 may be, for example, an integrated score at which the admission rate of the plot line (no incentive) intersects (exceeds) the admission rate of the plot line (incentive B). By setting the threshold Th1 in this way, a high admission rate and a high number of new members can be expected without giving incentives to users who are less than the threshold Th1. As a result, it is possible to efficiently induce membership while suppressing the provision of incentives.

上記の例では、統合スコアが閾値Ｔｈ１未満であるか否かによってインセンティブの付与を決定したが、これに代えて、２以上の閾値に基づいて、インセンティブの付与や、付与するインセンティブの度合が決定されてもよい。 In the above example, granting of incentives is determined depending on whether or not the integrated score is less than the threshold Th1. Alternatively, granting of incentives and the degree of incentives to be granted are determined based on thresholds of 2 or more. may be

図１４は、閾値の決定手法の一例について説明するための図（その２）である。図１３との相違点について説明する。図１４では、閾値Ｔｈ２が設定される。閾値Ｔｈ２は、閾値Ｔｈ１よりも大きい統合スコアに対応する値である。統合スコアが、閾値Ｔｈ１以上、且つ閾値Ｔｈ２未満のユーザに対しては、インセンティブＡが付与され、統合スコアが、閾値Ｔｈ２以上のユーザに対しては、インセンティブＢが付与される。上記の場合においても、例えば、付与するインセンティブの合計や、獲得が期待できるユーザ数、期待される入会率、種々の制約条件等に基づいて、閾値Ｔｈ１および閾値Ｔｈ２が設定される。 FIG. 14 is a diagram (part 2) for explaining an example of a threshold determination method. Differences from FIG. 13 will be described. In FIG. 14, a threshold Th2 is set. The threshold Th2 is a value corresponding to an integrated score greater than the threshold Th1. Incentive A is given to users whose integrated score is greater than or equal to threshold Th1 and less than threshold Th2, and incentive B is given to users whose integrated score is greater than or equal to threshold Th2. Also in the above case, the threshold Th1 and the threshold Th2 are set based on, for example, the total incentives to be given, the number of users that can be expected to be acquired, the expected membership rate, various constraints, and the like.

なお、上記の閾値は、情報処理装置２００（決定処理部２０８）が統計処理を行って導出してもよいし、他の装置から提供されてもよい。また、例えば、管理者の所望の条件（所望の獲得入会者数や制約条件）が入力されると、例えば、情報処理装置２００が、自動で条件を満たすように閾値を導出してもよい。また、この場合に、一つの閾値を導出することや、複数の閾値を導出することを指定することが入力された場合、この指定に基づいて、情報処理装置２００は、閾値を導出してもよい。更に、この場合、情報処理装置２００は、閾値で区切られた区分ごとに、インセンティブの有無や内容を自動で決定してもよい。 The above threshold may be derived by statistical processing performed by the information processing apparatus 200 (the determination processing unit 208), or may be provided from another apparatus. Further, for example, when the administrator's desired conditions (desired number of acquired members and constraint conditions) are input, for example, the information processing apparatus 200 may automatically derive a threshold value so as to satisfy the conditions. Further, in this case, if a specification to derive one threshold or to derive a plurality of thresholds is input, the information processing apparatus 200 may derive thresholds based on this specification. good. Furthermore, in this case, the information processing apparatus 200 may automatically determine the presence or absence of incentives and the content thereof for each category divided by thresholds.

［比較例との比較］
図１５は、本実施形態および比較例の処理による入会者数およびインセンティブの合計を示す図である。図１５の左図は、比較例の処理による入会者数およびインセンティブの合計である。比較例では、経路ごとにインセンティブの有無や度合を変更している。経路とは、どのようなコンテンツを辿ったかなどのネットワークにおける経路である。 [Comparison with Comparative Example]
FIG. 15 is a diagram showing the total number of enrollees and incentives according to the processing of the present embodiment and the comparative example. The left diagram of FIG. 15 shows the total number of enrollees and incentives by the processing of the comparative example. In the comparative example, the presence or absence and degree of incentives are changed for each route. A route is a route in a network such as what kind of content is followed.

図１５の右図は、本実施形態の情報処理装置２００を利用してユーザごとにインセンティブの付与の有無や度合を変更した場合の入会者数およびインセンティブの合計である。本実施形態では、上述したようにインセンティブを付与しなくても入会が期待されるユーザに対してはインセンティブを付与せずに、インセンティブを付与することで入会が期待されるユーザに対してはインセンティブを付与する。本実施形態では、インセンティブを効率的に付与することで、より多くの入会者数を効率的に獲得することができる。 The right diagram of FIG. 15 shows the total number of enrollees and incentives when the information processing apparatus 200 of the present embodiment is used to change the presence or absence and degree of incentives for each user. In this embodiment, as described above, users who are expected to join without giving incentives are not given incentives, and incentives are given to users who are expected to join. to give In this embodiment, by efficiently providing incentives, it is possible to efficiently acquire a larger number of members.

特に、本実施形態では、第２スコアから第１スコアを減算した統合スコアを用いて（統合スコアと閾値を用いて）、ユーザに対するインセンティブを決定することができる。例えば、上記とは異なる他の比較例である「インセンティブが付与されたユーザの購入履歴と、前記インセンティブが付与されたユーザごとの素性情報に基づいて生成されたインセンティブごとのユーザによる購入傾向をモデル化した予測モデルを用いて、インセンティブを付与することによって生じる購入行動に関する事象を最適化するための確率を予測する情報解析装置」では、上記のような統合スコアは用いていなく、適切なインセンティブを決定することができなかったり、適切なインセンティブを簡易に決定することができなかったりすることがあった。これに対して、本実施形態では、簡易な手法で、適切なインセンティブを決定することができる。 In particular, in this embodiment, an integrated score obtained by subtracting the first score from the second score (using the integrated score and the threshold) can be used to determine incentives for the user. For example, another comparative example different from the above is a model of the purchasing tendency of users for each incentive generated based on the purchase history of users who have been given incentives and the background information of each user who has been given incentives. "Information analysis device that predicts the probability for optimizing events related to purchase behavior caused by giving incentives using a customized prediction model" does not use the above integrated score, and provides appropriate incentives. In some cases, they were unable to make decisions or easily determine appropriate incentives. On the other hand, in this embodiment, a suitable incentive can be determined by a simple method.

［シーケンス図］
図１６は、情報処理システム１により実行される処理の流れの一例を示すシーケンス図である。まず、情報処理装置２００が、学習装置１００から第１モデル２２４および第２モデル２２６を取得する（Ｓ１０）。次に、ユーザの端末装置Ｕが、ユーザの操作に応じて所定の画面（例えば商品購入の決済を行うための画面）を表示するためのリクエストをサービスサーバ１０に送信する（Ｓ１２）。 [Sequence Diagram]
FIG. 16 is a sequence diagram showing an example of the flow of processing executed by the information processing system 1. As shown in FIG. First, the information processing device 200 acquires the first model 224 and the second model 226 from the learning device 100 (S10). Next, the terminal device U of the user transmits a request to the service server 10 to display a predetermined screen (for example, a screen for making payment for purchasing a product) according to the user's operation (S12).

次に、サービスサーバ１０が、リクエストを取得したことに応じて、統合スコアの提供のリクエストを、情報処理装置２００に送信する（Ｓ１４）。Ｓ１４のリクエストには、例えば、ユーザの行動履歴が含まれている。次に、情報処理装置２００が、リクエストに応じて統合スコアを導出する（Ｓ１６）。情報処理装置２００は、ユーザの行動履歴から得たユーザベクトルを、上述したように第１モデル２２４および第２モデル２２６に入力して、第１モデル２２４が出力した第１スコアおよび第２モデル２２６が出力した第２スコアを統合して統合スコアを取得する。
情報処理装置２００は、統合スコアと閾値とに基づいて、ユーザにインセンティブを付与するか、ユーザに付与するインセンティブの内容を決定する（Ｓ１８）。 Next, the service server 10 transmits a request for providing an integrated score to the information processing device 200 in response to obtaining the request (S14). The request of S14 includes, for example, the user's action history. Next, the information processing device 200 derives an integrated score according to the request (S16). The information processing device 200 inputs the user vector obtained from the user's action history to the first model 224 and the second model 226 as described above, and converts the first score and the second model 226 output by the first model 224 into Integrate the second scores output by to obtain an integrated score.
The information processing apparatus 200 gives an incentive to the user or determines the content of the incentive to be given to the user based on the integrated score and the threshold (S18).

なお、Ｓ１４のリクエストには、ユーザＩＤが含まれ、情報処理装置２００は、ユーザ情報２２２を参照して、ユーザＩＤからユーザの行動履歴を取得してもよい。 Note that the request of S14 includes the user ID, and the information processing apparatus 200 may refer to the user information 222 and acquire the user's action history from the user ID.

次に、情報処理装置２００は、決定したインセンティブの情報をサービスサーバ１０に送信する（Ｓ２０）。次に、サービスサーバ１０は、送信されたインセンティブの情報に基づいて、ユーザに対するインセンティブを決定し、決定したインセンティブの情報を含むコンテンツを端末装置Ｕの表示部に表示させる（Ｓ２２）。例えば、インセンティブなしのユーザに対して、前述した図４に示すようなコンテンツが提供され、例えば、インセンティブありのユーザに対して、前述した図６に示すようなコンテンツが提供される。 Next, the information processing device 200 transmits information on the determined incentive to the service server 10 (S20). Next, the service server 10 determines an incentive for the user based on the transmitted incentive information, and causes the display unit of the terminal device U to display content including the determined incentive information (S22). For example, the content shown in FIG. 4 described above is provided to users without incentives, and the content shown in FIG. 6 described above is provided to users with incentives, for example.

上記のように、情報処理システム１は、ユーザに対する適切なインセンティブを決定し、決定したインセンティブの情報をユーザに提供することで、効率的にユーザに意図した行動の実施を促すことができる。 As described above, the information processing system 1 determines an appropriate incentive for the user and provides information on the determined incentive to the user, thereby efficiently prompting the user to perform the intended action.

以上説明した実施形態によれば、情報処理装置２００が、対象のユーザの特徴を第１モデルおよび第２モデルに入力し、第１モデルが出力した第１スコアと第２モデルが出力した第２スコアとを統合して得られた統合スコアに基づいて、対象のユーザに付与するインセンティブを決定することにより、ユーザごとにより適切なインセンティブを決定することができる。 According to the embodiment described above, the information processing apparatus 200 inputs the features of the target user to the first model and the second model, and the first score output by the first model and the second score output by the second model are input. By determining the incentive to be given to the target user based on the integrated score obtained by integrating the scores, it is possible to determine a more appropriate incentive for each user.

以上、本発明を実施するための形態について実施形態を用いて説明したが、本発明はこうした実施形態に何等限定されるものではなく、本発明の要旨を逸脱しない範囲内において種々の変形及び置換を加えることができる。 As described above, the mode for carrying out the present invention has been described using the embodiments, but the present invention is not limited to such embodiments at all, and various modifications and replacements can be made without departing from the scope of the present invention. can be added.

１‥情報処理システム
１００‥学習装置
１０２‥取得部
１０４‥第１学習部
１０６‥第２学習部
１０８‥提供部
１２２‥第１学習情報
１２４‥第１モデル
１２６‥第２学習情報
１２８‥第２モデル
２００‥情報処理装置
２０２‥情報管理部
２０４‥第１処理部
２０６‥第２処理部
２０８‥決定処理部
２２４‥第１モデル
２２６‥第２モデル 1 -- information processing system 100 -- learning device 102 -- acquisition unit 104 -- first learning unit 106 -- second learning unit 108 -- providing unit 122 -- first learning information 124 -- first model 126 -- second learning information 128 -- second Model 200... Information processing device 202... Information management unit 204... First processing unit 206... Second processing unit 208... Decision processing unit 224... First model 226... Second model

Claims

an acquisition unit that acquires characteristics of a target user;
The characteristics of the target user are input to the first model and the second model, and the integrated score obtained by integrating the first score output by the first model and the second score output by the second model a determination unit that determines an incentive to be given to the target user based on,
The first model is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that there is no incentive or that the user is given the first incentive,
The second model is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that a second incentive higher than the first incentive is given.
Information processing equipment.

The first model indicates that there is no incentive even if the user performs the intended action, or that the first incentive is given if the user performs the intended action. is a learned model generated by learning learning information in which information indicating that the is performed the intended action is associated with the characteristics of the user who performed the intended action,
The second model includes information indicating that the user has performed the intended action by indicating that the second incentive is given when the user performs the intended action, and information indicating that the user has performed the intended action. A trained model generated by learning learning information associated with the characteristics of the user who performed
The information processing device according to claim 1 .

In the first model, even if the user is shown that there is no incentive even if the user performs the intended action, the incentive is not given when the characteristics of the user who performed the intended action are input. output a first score indicating that there is a high tendency to perform the intended action even if the
The first model indicates that a first incentive is given when the user performs the intended action, and when the characteristics of the user who performed the intended action are input, the first incentive is given. Outputting a first score indicating that there is a high tendency to perform the intended action when given,
The second model indicates that a second incentive is given when the user performs the intended action, and when the characteristics of the user who performed the intended action are input, the second incentive is given. Outputting a second score indicating a high tendency to perform the intended action given
The information processing apparatus according to claim 2.

The determination unit determines an incentive to be given to the user based on an integrated score obtained by subtracting the first score from the second score.
The information processing apparatus according to any one of claims 1 to 3.

The features of the user are features based on the user's action history in the network,
The information processing apparatus according to any one of claims 1 to 4.

The user's action history is one or more of an action history in an e-commerce service, an action history in an auction service, or an action history in a search service,
The information processing device according to claim 5 .

The action history in the e-commerce service or the action history in the auction service includes information on the product purchased by the user or the amount of money required for the purchase,
The information processing device according to claim 6 .

The intended action is card enrollment, questionnaire response, member registration, or application program installation.
The information processing apparatus according to any one of claims 1 to 7.

Incentives are discounts, awarding of points, or the granting of coupons;
The information processing apparatus according to any one of claims 1 to 8.

an information processing apparatus according to any one of claims 1 to 9;
a service server that provides services to the user in cooperation with the user's terminal device,
The information processing device provides the service server with the content of the determined incentive for the user,
The service server causes the display unit of the terminal device to display the provided content based on the incentive;
Information processing system.

an acquisition unit that acquires the first learning information and the second learning information;
a first learning unit that learns the first learning information to generate a first model;
a second learning unit that learns the second learning information to generate a second model;
The first learning information indicates that there is no incentive even if the user performs the intended action, or indicates that the first incentive is given if the user performs the intended action, and the user Information in which information indicating that the intended action was performed is associated with characteristics of the user who performed the intended action,
In the first model, even if the user is shown that there is no incentive even if the user performs the intended action, the incentive is not given when the characteristics of the user who performed the intended action are input. A model trained to output a first score indicating that there is a high tendency to perform the intended action even if the
The first model indicates that a first incentive is given when the user performs the intended action, and when the characteristics of the user who performed the intended action are input, the first incentive is given. A model trained to output a first score indicating a high tendency to perform the intended action when given,
The second learning information includes information indicating that the user has performed the intended action by indicating that a second incentive will be given when the user performs the intended action, and information indicating that the user has performed the intended action. Information associated with the characteristics of the user who performed,
The second model indicates that a second incentive is given when the user performs the intended action, and when the characteristics of the user who performed the intended action are input, the second incentive is given. A model trained to output a second score that indicates a high tendency to perform the intended action when given
Information processing equipment.

the computer
Get the characteristics of the target user,
The characteristics of the target user are input to the first model and the second model, and the integrated score obtained by integrating the first score output by the first model and the second score output by the second model Based on, determine the incentive to be given to the target user,
The first model is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that there is no incentive or that the user is given the first incentive,
The second model is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that a second incentive higher than the first incentive is given.
Information processing methods.

to the computer,
Acquire the characteristics of the target user,
The characteristics of the target user are input to the first model and the second model, and the integrated score obtained by integrating the first score output by the first model and the second score output by the second model Based on, determine the incentive to be given to the target user,
The first model is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that there is no incentive or that the user is given the first incentive,
The second model is a model that outputs a score indicating the likelihood that the target user will perform the intended action when it is assumed that a second incentive higher than the first incentive is given.
program.