JP7096093B2

JP7096093B2 - Learning equipment, learning methods and learning programs

Info

Publication number: JP7096093B2
Application number: JP2018140429A
Authority: JP
Inventors: 伸裕鍜治
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2018-07-26
Filing date: 2018-07-26
Publication date: 2022-07-05
Anticipated expiration: 2038-07-26
Also published as: JP2020017126A

Description

本発明は、学習装置、学習方法及び学習プログラムに関する。 The present invention relates to a learning device, a learning method and a learning program.

従来、各種の情報を利用者に対して提供する技術が知られている。このような技術の一例として、利用者が過去に選択した情報と関連する情報に基づいて、利用者と関連性が高い情報や利用者が興味を有する情報等、利用者と対応する情報を推定し、推定した情報を利用者に提供する技術が知られている。 Conventionally, techniques for providing various types of information to users have been known. As an example of such a technique, based on the information related to the information selected by the user in the past, the information corresponding to the user such as the information highly related to the user or the information that the user is interested in is estimated. However, there are known techniques for providing estimated information to users.

特開２０１２－１５０５６１号公報Japanese Unexamined Patent Publication No. 2012-150561

しかしながら、上述した従来技術では、利用者と対応する情報を適切に推定できない恐れがある。 However, with the above-mentioned conventional technique, there is a possibility that the information corresponding to the user cannot be estimated appropriately.

例えば、上述した従来技術では、新規の利用者等、過去に情報を選択していない利用者に対しては、利用者と対応する情報を適切に推定できない。 For example, in the above-mentioned conventional technique, it is not possible to appropriately estimate information corresponding to a user for a user who has not selected information in the past, such as a new user.

本願は、上記に鑑みてなされたものであって、利用者と対応する情報の推定精度を向上させることを目的とする。 The present application has been made in view of the above, and an object thereof is to improve the estimation accuracy of the information corresponding to the user.

本願に係る学習装置は、配信対象を示すベクトルであって、当該配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルと、利用者を示す利用者ベクトルとを取得する取得部と、前記配信対象ベクトルと前記利用者ベクトルとに基づいて、当該配信対象ベクトルと対応する配信対象が、当該利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを学習する学習部とを有することを特徴とする。 The learning device according to the present application is a vector indicating a distribution target, and acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target and a user vector indicating a user. Based on the acquisition unit, the distribution target vector, and the user vector, a model for determining whether or not the distribution target corresponding to the distribution target vector corresponds to the user corresponding to the user vector is determined. It is characterized by having a learning unit for learning.

実施形態の一態様によれば、利用者と対応する情報の推定精度を向上させることができるという効果を奏する。 According to one aspect of the embodiment, there is an effect that the estimation accuracy of the information corresponding to the user can be improved.

図１は、実施形態に係るネットワークシステムの構成例を示す図である。FIG. 1 is a diagram showing a configuration example of a network system according to an embodiment. 図２は、実施形態に係る学習処理の一例を示す図である。FIG. 2 is a diagram showing an example of a learning process according to an embodiment. 図３は、実施形態に係る学習装置の構成例を示す図である。FIG. 3 is a diagram showing a configuration example of the learning device according to the embodiment. 図４は、実施形態に係る訓練データ記憶部の一例を示す図である。FIG. 4 is a diagram showing an example of a training data storage unit according to an embodiment. 図５は、実施形態に係る学習装置による学習処理手順を示すフローチャートである。FIG. 5 is a flowchart showing a learning processing procedure by the learning device according to the embodiment. 図６は、ハードウェア構成の一例を示す図である。FIG. 6 is a diagram showing an example of a hardware configuration.

以下に、本願に係る学習装置、学習方法及び学習プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る学習装置、学習方法及び学習プログラムが限定されるものではない。また、各実施形態は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略する。 Hereinafter, the learning device, the learning method, and the embodiment for implementing the learning program (hereinafter referred to as “the embodiment”) according to the present application will be described in detail with reference to the drawings. It should be noted that this embodiment does not limit the learning device, learning method and learning program according to the present application. In addition, each embodiment can be appropriately combined as long as the processing contents do not contradict each other. Further, in each of the following embodiments, the same parts are designated by the same reference numerals, and duplicate description will be omitted.

〔１．ネットワークシステムの構成〕
まず、図１を参照して、実施形態に係るネットワークシステム１の構成について説明する。図１は、実施形態に係るネットワークシステム１の構成例を示す図である。図１に示すように、実施形態に係るネットワークシステム１には、端末装置１０と、提供装置２０と、学習装置１００とが含まれる。端末装置１０、提供装置２０および学習装置１００は、それぞれネットワークＮと有線又は無線により接続される。 [1. Network system configuration]
First, the configuration of the network system 1 according to the embodiment will be described with reference to FIG. FIG. 1 is a diagram showing a configuration example of the network system 1 according to the embodiment. As shown in FIG. 1, the network system 1 according to the embodiment includes a terminal device 10, a providing device 20, and a learning device 100. The terminal device 10, the providing device 20, and the learning device 100 are connected to the network N by wire or wirelessly, respectively.

図１中では図示していないが、ネットワークシステム１は、端末装置１０_１～１０_ｎを含んでもよい。本明細書では、端末装置１０_１～１０_ｎを区別する必要がない場合は、端末装置１０_１～１０_ｎを「端末装置１０」と総称する。また、ネットワークシステム１は、複数台の提供装置２０や、複数台の学習装置１００を含んでもよい。 Although not shown in FIG. 1, the network system 1 may include terminal devices 10 ₁ to 10 _n . In the present specification, when it is not necessary to distinguish the terminal devices 10 ₁ to 10 _n , the terminal devices 10 ₁ to 10 _n are collectively referred to as "terminal device 10". Further, the network system 1 may include a plurality of providing devices 20 and a plurality of learning devices 100.

端末装置１０は、ユーザによって利用される情報処理装置である。端末装置１０は、スマートフォン、デスクトップ型ＰＣ（Personal Computer）、ノート型ＰＣ、タブレット型ＰＣ、ＰＤＡ（Personal Digital Assistant）を含む、任意のタイプの情報処理装置であってもよい。 The terminal device 10 is an information processing device used by the user. The terminal device 10 may be any type of information processing device including a smartphone, a desktop PC (Personal Computer), a notebook PC, a tablet PC, and a PDA (Personal Digital Assistant).

提供装置２０は、端末装置１０および学習装置１００に、各種情報を提供するサーバ装置である。例えば、提供装置２０は、端末装置１０に対して、コンテンツ（例えば、プッシュ通知）を提供する。また、例えば、提供装置２０は、学習装置１００に対して、コンテンツに関連する判定モデルを生成するための訓練データを提供する。 The providing device 20 is a server device that provides various information to the terminal device 10 and the learning device 100. For example, the providing device 20 provides content (for example, push notification) to the terminal device 10. Further, for example, the providing device 20 provides the learning device 100 with training data for generating a determination model related to the content.

学習装置１００は、ユーザに適合する情報を判定するモデルを学習するサーバ装置である。学習装置１００は、ネットワークＮを介して、有線又は無線により端末装置１０および提供装置２０と通信を行う。 The learning device 100 is a server device that learns a model for determining information suitable for a user. The learning device 100 communicates with the terminal device 10 and the providing device 20 by wire or wirelessly via the network N.

〔２．学習処理〕
次に、図２を参照して、実施形態に係る学習処理の一例について説明する。図２は、実施形態に係る学習処理の一例を示す図である。 [2. Learning process]
Next, an example of the learning process according to the embodiment will be described with reference to FIG. FIG. 2 is a diagram showing an example of a learning process according to an embodiment.

図２の例では、提供装置２０は、「ユーザＵ１」によって利用される端末装置１０_１に対して、プッシュ型で「通知Ｎ１」を送信する。「プッシュ型」という語は、情報が、情報を提供する装置のトリガに基づいて送信される仕組みを指す。言い換えると、提供装置２０は、端末装置１０_１に、通知Ｎ１を、プッシュ通知として送信する。 In the example of FIG. 2, the providing device 20 transmits the “notification N1” in _a push type to the terminal device 101 used by the “user U1”. The term "push" refers to the mechanism by which information is transmitted based on the trigger of the device that provides the information. In other words, the providing device 20 transmits the notification N1 to the terminal device 10 ₁ as a push notification.

図２の例では、提供装置２０は、通知Ｎ１に対する応答を受信する。一例では、提供装置２０は、通知Ｎ１が選択されたことを示す選択情報を、端末装置１０_１から受信する。この場合、通知Ｎ１は、ユーザＵ１に適合すると考えられる。別の例では、提供装置２０は、通知Ｎ１が選択されなかったことを示す選択情報を、端末装置１０_１から受信する。この場合、通知Ｎ１は、ユーザＵ１に適合しないと考えられる。 In the example of FIG. 2, the providing device 20 receives a response to the notification N1. In _one example, the providing device 20 receives selection information from the terminal device 101 indicating that the notification N1 has been selected. In this case, the notification N1 is considered to be suitable for the user U1. In _another example, the providing device 20 receives selection information from the terminal device 101 indicating that the notification N1 has not been selected. In this case, the notification N1 is considered to be incompatible with the user U1.

図２の例では、提供装置２０は、訓練データを、学習装置１００に送信する。訓練データは、ユーザＵ１の属性情報ＡＩ１（例えば、ユーザＵ１のデモグラフィック属性）と、通知Ｎ１の通知情報ＮＩ１（例えば、通知Ｎ１のタイトル）と、ユーザＵ１が通知Ｎ１を選択したか否かを示す選択情報ＳＩ１（例えば、選択フラグ）とを含む。 In the example of FIG. 2, the providing device 20 transmits the training data to the learning device 100. The training data includes the attribute information AI1 of the user U1 (for example, the demographic attribute of the user U1), the notification information NI1 of the notification N1 (for example, the title of the notification N1), and whether or not the user U1 has selected the notification N1. The selection information SI1 (for example, a selection flag) is included.

図２の例では、学習装置１００は、提供装置２０から受信された訓練データを用いて、所定の通知がユーザに適合するか否かを判定する順伝播型ニューラルネットを学習する。より具体的には、学習装置１００は、ユーザＵ１の属性情報ＡＩ１と通知Ｎ１の通知情報ＮＩ１とを、訓練データの入力として用いる。また、学習装置１００は、通知Ｎ１が選択されたか否か示す選択情報ＳＩ１を、訓練データの出力として用いる。 In the example of FIG. 2, the learning device 100 uses the training data received from the providing device 20 to learn a forward propagation neural network that determines whether or not a predetermined notification is suitable for the user. More specifically, the learning device 100 uses the attribute information AI1 of the user U1 and the notification information NI1 of the notification N1 as input of training data. Further, the learning device 100 uses the selection information SI1 indicating whether or not the notification N1 is selected as the output of the training data.

図２の例では、ユーザＵ１の属性情報ＡＩ１は、ユーザＵ１に対応するワンホットベクトル（one hot vector）ＯＶ１を含む。ワンホットベクトルＯＶ１の複数の成分のうちゼロでない１つの成分は、ユーザＵ１のユーザＩＤに対応する。一例では、ワンホットベクトルＯＶ１の次元数は、ユーザＩＤの数と同一である。また、ユーザＵ１の属性情報ＡＩ１は、ワンホットベクトルＯＶ１に対応する埋め込みベクトルＵＥ１を含む。一例では、埋め込みベクトルＵＥ１は、ユーザＵ１のデモグラフィック属性に対応する。例えば、ユーザＵ１の属性が「３０代」の場合に、埋め込みベクトルＵＥ１は、ユーザ属性「３０代」に対応する分散表現である。 In the example of FIG. 2, the attribute information AI1 of the user U1 includes a one hot vector OV1 corresponding to the user U1. One non-zero component of the plurality of components of the one-hot vector OV1 corresponds to the user ID of the user U1. In one example, the number of dimensions of the one-hot vector OV1 is the same as the number of user IDs. Further, the attribute information AI1 of the user U1 includes an embedded vector UE1 corresponding to the one-hot vector OV1. In one example, the embedded vector UE1 corresponds to the demographic attribute of user U1. For example, when the attribute of the user U1 is "30s", the embedded vector UE1 is a distributed expression corresponding to the user attribute "30s".

図２の例では、通知Ｎ１の通知情報ＮＩ１は、ＢｏＷベクトル（bag of words vector）ＢＶ１を含む。ＢｏＷベクトルＢＶ１の複数の成分のうちゼロでない少なくとも１つの成分は、通知Ｎ１の通知情報ＮＩ１に含まれる少なくとも１つの単語に対応する。また、通知情報ＮＩ１は、ＢｏＷベクトルＢＶ１に対応する埋め込みベクトルＮＥ１を含む。一例では、ＢｏＷベクトルＢＶ１は、通知Ｎ１のタイトルに対応する。例えば、通知Ｎ１のタイトルが、単語「今夜」、「映画」、「テレビ」を含む場合に、埋め込みベクトルＮＥ１は、単語「今夜」、「映画」、「テレビ」にそれぞれ対応する３つの分散表現の平均ベクトルである。 In the example of FIG. 2, the notification information NI1 of the notification N1 includes a BoW vector (bag of words vector) BV1. At least one non-zero component of the plurality of components of the BoW vector BV1 corresponds to at least one word contained in the notification information NI1 of the notification N1. Further, the notification information NI1 includes an embedded vector NE1 corresponding to the BoW vector BV1. In one example, the BoW vector BV1 corresponds to the title of notification N1. For example, if the title of notification N1 includes the words "tonight", "movie", and "television", the embedded vector NE1 has three distributed representations corresponding to the words "tonight", "movie", and "television", respectively. Is the average vector of.

図２の例では、通知Ｎ１が選択されたか否か示す選択情報ＳＩ１は、０または１の選択フラグを含む。一例では、通知Ｎ１が選択されたことを示す選択フラグは、「１」である。この例では、通知Ｎ１が選択されなかったことを示す選択フラグは、「０」である。 In the example of FIG. 2, the selection information SI1 indicating whether or not the notification N1 is selected includes a selection flag of 0 or 1. In one example, the selection flag indicating that notification N1 has been selected is "1". In this example, the selection flag indicating that notification N1 was not selected is "0".

はじめに、学習装置１００は、提供装置２０から受信された訓練データを参照し、ユーザＵ１のワンホットベクトルＯＶ１に対応する埋め込みベクトルＵＥ１を取得する。 First, the learning device 100 refers to the training data received from the providing device 20 and acquires the embedded vector UE1 corresponding to the one-hot vector OV1 of the user U1.

次いで、学習装置１００は、訓練データを参照し、通知Ｎ１のＢｏＷベクトルＢＶ１に対応する埋め込みベクトルＮＥ１を取得する。 Next, the learning device 100 refers to the training data and acquires the embedded vector NE1 corresponding to the BoW vector BV1 of the notification N1.

次いで、学習装置１００は、埋め込みベクトルＵＥ１と埋め込みベクトルＮＥ１とを連結（concatenate）することで、ベクトルＣＶ１を生成する。埋め込みベクトルＵＥ１が次元ｎ（ｎは、任意の自然数）を持ち、埋め込みベクトルＮＥ１が次元ｍ（ｍは、任意の自然数）を持つ場合に、生成されたベクトルＣＶ１は、次元（ｎ＋ｍ）を持つ。この場合、ベクトルＣＶ１は、（ｎ＋ｍ）次元の分散表現である。 Next, the learning device 100 concatenates the embedded vector UE1 and the embedded vector NE1 to generate the vector CV1. When the embedded vector UE1 has a dimension n (n is an arbitrary natural number) and the embedded vector NE1 has a dimension m (m is an arbitrary natural number), the generated vector CV1 has a dimension (n + m). In this case, the vector CV1 is a (n + m) dimensional distributed representation.

次いで、学習装置１００は、第１の重み行列にベクトルＣＶ１を乗ずることで、ベクトル（図示せず）を算出する。そして、学習装置１００は、算出されたベクトルを活性化関数（例えば、ＲｅＬＵ（Rectified Linear unit）関数）に代入することで、隠れ状態ベクトルＨＶ１を生成する。 Next, the learning device 100 calculates a vector (not shown) by multiplying the first weight matrix by the vector CV1. Then, the learning device 100 generates the hidden state vector HV1 by substituting the calculated vector into the activation function (for example, the ReLU (Rectified Linear unit) function).

次いで、学習装置１００は、第２の重み行列に隠れ状態ベクトルＨＶ１を乗ずることで、関連度ＲＳ１を算出する。 Next, the learning device 100 calculates the relevance degree RS1 by multiplying the second weight matrix by the hidden state vector HV1.

次いで、学習装置１００は、選択情報ＳＩ１に含まれる選択フラグの値と、算出された関連度ＲＳ１とを用いて、所定の通知がユーザＵ１に適合するか否かを判定する順伝播型ニューラルネットを学習する。例えば、学習装置１００は、誤差逆伝播法を用いて、選択情報ＳＩ１に含まれる選択フラグの値および算出された関連度ＲＳ１に対応する目的関数（例えば、交差エントロピー損失関数）が最小化されるように、順伝播型ニューラルネットを学習する。 Next, the learning device 100 uses the value of the selection flag included in the selection information SI1 and the calculated relevance degree RS1 to determine whether or not the predetermined notification conforms to the user U1. To learn. For example, the learning device 100 uses an error backpropagation method to minimize the value of the selection flag included in the selection information SI1 and the objective function (for example, the cross entropy loss function) corresponding to the calculated relevance degree RS1. As you can see, we learn a forward propagation neural network.

通知Ｎ１以外の通知についても、学習装置１００は、同様の学習処理で、順伝播型ニューラルネットを学習することができる。学習装置１００は、かかる学習処理を繰り返すことで、汎化誤差が最小化されるように、順伝播型ニューラルネットを学習することができる。 For notifications other than the notification N1, the learning device 100 can learn the forward propagation type neural network by the same learning process. The learning device 100 can learn the forward propagation type neural network so that the generalization error is minimized by repeating the learning process.

その後、学習装置１００は、学習された順伝播型ニューラルネットを用いて、ユーザＵ１と通知との間の関連性を示す関連性スコアを出力する。より具体的には、学習装置１００は、ユーザＵ１に対応する埋め込みベクトルＵＥ１と、所定の通知に対応する埋め込みベクトルとを、判定モデルに入力することで、ユーザＵ１が所定の通知を選択する事後確率を、ユーザＵ１と通知との間の関連性スコアとして出力する。 After that, the learning device 100 uses the learned forward propagation neural network to output a relevance score indicating the relevance between the user U1 and the notification. More specifically, the learning device 100 inputs the embedded vector UE1 corresponding to the user U1 and the embedded vector corresponding to the predetermined notification into the determination model, so that the user U1 selects the predetermined notification. The probability is output as a relevance score between the user U1 and the notification.

上述のように、実施形態に係る学習装置１００は、ユーザに対応する埋め込みベクトルと、通知に対応する埋め込みベクトルとを、訓練データの入力として用いる。また、学習装置１００は、通知の選択フラグの値を、訓練データの出力として用いる。そして、学習装置１００は、汎化誤差が最小化されるように、順伝播型ニューラルネットを学習する。これにより、学習装置１００は、所定の通知がユーザに適合するか否かを判定する判定モデルを学習することができる。 As described above, the learning device 100 according to the embodiment uses the embedded vector corresponding to the user and the embedded vector corresponding to the notification as input of training data. Further, the learning device 100 uses the value of the selection flag of the notification as the output of the training data. Then, the learning device 100 learns the forward propagation type neural network so that the generalization error is minimized. As a result, the learning device 100 can learn a determination model for determining whether or not a predetermined notification is suitable for the user.

ところで、図２の例では、提供装置２０は、端末装置１０に、通知をプッシュ通知として送信している。一般的に、プッシュ通知は、季節性のある情報である。例えば、提供装置２０は、速報性の高い情報（例えば、今夜の番組）を１回だけプッシュ通知として送信する。したがって、このようなプッシュ通知に対応する配信履歴が存在しない場合がある。このような状況は、コールドスタート問題（cold start problem）に関連する。この点に関し、プッシュ通知に対応する配信履歴が存在しない場合でも、学習装置１００は、上述の判定モデルを用いて、プッシュ通知がユーザに適合するか否かを判定することができる。このため、学習装置１００は、ユーザが、ユーザが関心のあるプッシュ通知をタイムリーに受け取ることを可能にする。以下、このような学習処理を実現する学習装置１００について詳細に説明する。 By the way, in the example of FIG. 2, the providing device 20 transmits the notification to the terminal device 10 as a push notification. In general, push notifications are seasonal information. For example, the providing device 20 transmits information with high breaking news (for example, tonight's program) only once as a push notification. Therefore, there may be no delivery history corresponding to such push notifications. Such a situation is associated with a cold start problem. In this regard, the learning device 100 can determine whether or not the push notification is suitable for the user by using the above-mentioned determination model even when the distribution history corresponding to the push notification does not exist. Therefore, the learning device 100 enables the user to receive push notifications of interest to the user in a timely manner. Hereinafter, the learning device 100 that realizes such a learning process will be described in detail.

〔３．学習装置の構成〕
次に、図３を参照して、実施形態に係る学習装置１００の構成例について説明する。図３は、実施形態に係る学習装置１００の構成例を示す図である。図３に示すように、学習装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、学習装置１００は、学習装置１００を利用する管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（液晶ディスプレイ等）を有してもよい。 [3. Configuration of learning device]
Next, a configuration example of the learning device 100 according to the embodiment will be described with reference to FIG. FIG. 3 is a diagram showing a configuration example of the learning device 100 according to the embodiment. As shown in FIG. 3, the learning device 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The learning device 100 has an input unit (for example, a keyboard, a mouse, etc.) that receives various operations from an administrator or the like who uses the learning device 100, and a display unit (liquid crystal display, etc.) for displaying various information. You may.

（通信部１１０）
通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。通信部１１０は、ネットワーク網と有線又は無線により接続され、ネットワーク網を介して、端末装置１０および提供装置２０との間で情報の送受信を行う。 (Communication unit 110)
The communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. The communication unit 110 is connected to the network network by wire or wirelessly, and transmits / receives information to / from the terminal device 10 and the providing device 20 via the network network.

（記憶部１２０）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory)、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。図３に示すように、記憶部１２０は、訓練データ記憶部１２１と、判定モデル記憶部１２２とを有する。 (Memory unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory (Flash Memory), or a storage device such as a hard disk or an optical disk. As shown in FIG. 3, the storage unit 120 has a training data storage unit 121 and a determination model storage unit 122.

（訓練データ記憶部１２１）
図４は、実施形態に係る訓練データ記憶部１２１の一例を示す図である。訓練データ記憶部１２１は、コンテンツに関連する予測モデルを生成するための訓練データを記憶する。予測モデルは、例えば、所定の通知がユーザに適合するか否かを判定する判定モデルである。一例では、判定モデルは、順伝播型ニューラルネットである。 (Training data storage unit 121)
FIG. 4 is a diagram showing an example of the training data storage unit 121 according to the embodiment. The training data storage unit 121 stores training data for generating a prediction model related to the content. The prediction model is, for example, a determination model for determining whether or not a predetermined notification is suitable for the user. In one example, the decision model is a forward propagation neural network.

訓練データ記憶部１２１は、例えば、受信部１３１によって受信された訓練データを記憶する。図４の例では、訓練データ記憶部１２１には、「訓練データ」が「ユーザＩＤ」ごとに記憶される。例示として、「訓練データ」には、項目「属性情報」、「通知ＩＤ」、「通知情報」および「選択情報」が含まれる。 The training data storage unit 121 stores, for example, the training data received by the reception unit 131. In the example of FIG. 4, "training data" is stored in the training data storage unit 121 for each "user ID". By way of example, the "training data" includes the items "attribute information", "notification ID", "notification information" and "selection information".

「ユーザＩＤ」は、ユーザを識別するための識別子を示す。「属性情報」は、ユーザの属性に関する情報を示す。ユーザの属性は、例えば、ユーザのデモグラフィック属性や、サイコグラフィック属性等である。 The "user ID" indicates an identifier for identifying a user. "Attribute information" indicates information about the user's attributes. User attributes are, for example, user demographic attributes, psychographic attributes, and the like.

属性情報は、ユーザに対応するワンホットベクトルを含む。ワンホットベクトルの複数の成分のうちゼロでない１つの成分が、ユーザのユーザＩＤに対応する。一例では、ワンホットベクトルの次元数は、ユーザＩＤの数と同一である。また、属性情報は、ワンホットベクトルに対応する埋め込みベクトル（例えば、分散表現）を含む。埋め込みベクトルは、例えば、ユーザのデモグラフィック属性や、サイコグラフィック属性等に対応する。例えば、ユーザの属性が「女性」の場合に、埋め込みベクトルは、ユーザ属性「女性」に対応する分散表現である。ユーザ属性に対応する分散表現は、所定のコーパスと、ニューラル言語モデルによって獲得され得る。 The attribute information includes a one-hot vector corresponding to the user. One non-zero component of the plurality of components of the one-hot vector corresponds to the user ID of the user. In one example, the number of dimensions of the one-hot vector is the same as the number of user IDs. Further, the attribute information includes an embedded vector (for example, a distributed representation) corresponding to the one-hot vector. The embedded vector corresponds to, for example, a user's demographic attribute, a psychographic attribute, and the like. For example, when the user attribute is "female", the embedded vector is a distributed representation corresponding to the user attribute "female". The distributed representation corresponding to the user attribute can be acquired by a predetermined corpus and a neural language model.

「通知ＩＤ」は、通知を識別するための識別子を示す。「通知情報」は、通知の内容に関する情報を示す。通知の内容は、例えば、通知のタイトルや、通知のボディ（例えば、本文のテキスト）や、通知が選択された場合のアクション（例えば、リンク先への画面遷移）等である。 The "notification ID" indicates an identifier for identifying the notification. "Notification information" indicates information regarding the content of the notification. The content of the notification is, for example, the title of the notification, the body of the notification (for example, the text of the text), the action when the notification is selected (for example, the screen transition to the link destination), and the like.

通知情報は、通知に対応するＢｏＷベクトル（ＢｏＷ表現とも呼ばれる）を含む。ＢｏＷベクトルの複数の成分のうちゼロでない少なくとも１つの成分は、通知の通知情報に含まれる少なくとも１つの単語に対応する。また、通知情報は、ＢｏＷベクトルに対応する埋め込みベクトルを含む。一例では、ＢｏＷベクトルは、通知のタイトルに対応する。例えば、通知のタイトルが「ジムに行く？」場合に、埋め込みベクトルは、単語「ジム」、「行く」にそれぞれ対応する２つの分散表現の平均ベクトルである。 The notification information includes a BoW vector (also referred to as a BoW representation) corresponding to the notification. At least one non-zero component of the plurality of components of the BoW vector corresponds to at least one word contained in the notification information of the notification. Further, the notification information includes an embedded vector corresponding to the Bow vector. In one example, the BoW vector corresponds to the title of the notification. For example, if the title of the notification is "Go to Jim?", The embedded vector is the average vector of the two distributed representations corresponding to the words "Jim" and "Go" respectively.

「選択情報」は、通知の選択に関する情報を示す。例えば、選択情報は、通知が選択されたか否か示す。例えば、選択情報は、通知が選択されたか否か示す選択フラグを含む。図４の例では、通知が選択されたことを示す選択フラグは、「１」である。この例では、通知が選択されなかったことを示す選択フラグは、「０」である。 "Selection information" indicates information regarding the selection of notifications. For example, the selection information indicates whether or not the notification has been selected. For example, the selection information includes a selection flag indicating whether or not the notification was selected. In the example of FIG. 4, the selection flag indicating that the notification has been selected is "1". In this example, the selection flag indicating that the notification was not selected is "0".

例えば、図４は、ユーザＩＤ「Ｕ１」で識別されるユーザが、属性情報ＡＩ１に対応することを示している。また、例えば、図４は、通知ＩＤ「Ｎ１」で識別される通知が、通知情報ＮＩ１に対応することを示している。また、例えば、図４は、ユーザＩＤ「Ｕ１」で識別されるユーザが、通知ＩＤ「Ｎ１」で識別される通知を選択したことを示している。 For example, FIG. 4 shows that the user identified by the user ID “U1” corresponds to the attribute information AI1. Further, for example, FIG. 4 shows that the notification identified by the notification ID “N1” corresponds to the notification information NI1. Further, for example, FIG. 4 shows that the user identified by the user ID “U1” has selected the notification identified by the notification ID “N1”.

（判定モデル記憶部１２２）
図３に戻ると、判定モデル記憶部１２２は、配信対象に関連する判定モデルを記憶する。判定モデル記憶部１２２は、例えば、学習部１３３によって学習されたモデルであって、配信対象ベクトルと対応する配信対象が、利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを記憶する。 (Judgment model storage unit 122)
Returning to FIG. 3, the determination model storage unit 122 stores the determination model related to the distribution target. The determination model storage unit 122 is, for example, a model learned by the learning unit 133, and determines whether or not the distribution target corresponding to the distribution target vector corresponds to the user corresponding to the user vector. Remember.

（制御部１３０）
制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro Processing Unit）等のプロセッサによって、学習装置１００内部の記憶装置に記憶されている各種プログラムがＲＡＭ等を作業領域として実行されることにより実現される。また、制御部１３０は、コントローラ（controller）であり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現されてもよい。 (Control unit 130)
The control unit 130 is a controller, and for example, various programs stored in a storage device inside the learning device 100 by a processor such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit) store a RAM or the like. It is realized by being executed as a work area. Further, the control unit 130 is a controller, and may be realized by, for example, an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

（受信部１３１）
受信部１３１は、提供装置２０から、コンテンツに関連する判定モデルを生成するための訓練データを受信する。訓練データは、ユーザの属性情報と、通知の通知情報と、ユーザが通知を選択したか否かを示す選択情報とを含む。受信部１３１は、受信された訓練データを、訓練データ記憶部１２１に格納してもよい。 (Receiver 131)
The receiving unit 131 receives training data from the providing device 20 for generating a determination model related to the content. The training data includes the user's attribute information, the notification information of the notification, and the selection information indicating whether or not the user has selected the notification. The receiving unit 131 may store the received training data in the training data storage unit 121.

受信部１３１は、提供装置２０から、配信対象を示すベクトルであって、かかる配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルを受信する。例えば、受信部１３１は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを受信する。一例では、受信部１３１は、通知に対応する埋め込みベクトルを受信する。上述の通知情報は、配信対象を示すベクトルであって、かかる配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルを含んでもよい。受信部１３１は、受信された配信対象ベクトルを、訓練データ記憶部１２１に格納してもよい。受信部１３１は、配信対象に含まれる情報を受信してもよい。 The receiving unit 131 receives from the providing device 20 a distribution target vector that is a vector indicating a distribution target and in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target. For example, the receiving unit 131 receives a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed. In one example, the receiving unit 131 receives the embedded vector corresponding to the notification. The above-mentioned notification information is a vector indicating a distribution target, and may include a distribution target vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target. The receiving unit 131 may store the received distribution target vector in the training data storage unit 121. The receiving unit 131 may receive the information included in the distribution target.

上述の配信対象は、例えば、通知（例えば、プッシュ通知）、ニュース記事、テキストベースのメッセージ（例えば、レコメント情報）等である。配信対象がプッシュ通知である場合に、上述の配信対象ベクトルは、例えば、プッシュ通知のタイトルに含まれる単語と対応する次元に所定の値が格納されたベクトル（例えば、分散表現）である。 The above-mentioned distribution target is, for example, a notification (for example, push notification), a news article, a text-based message (for example, recomment information), or the like. When the delivery target is a push notification, the above-mentioned delivery target vector is, for example, a vector (for example, a distributed representation) in which a predetermined value is stored in a dimension corresponding to a word included in the title of the push notification.

受信部１３１は、提供装置２０から、利用者を示す利用者ベクトルを受信する。例えば、受信部１３１は、利用者が有する属性と対応する次元に所定の値が格納された利用者ベクトルを受信する。一例では、受信部１３１は、ユーザに対応する埋め込みベクトルを受信する。上述の属性情報は、利用者を示す利用者ベクトルを含んでもよい。受信部１３１は、受信された利用者ベクトルを、訓練データ記憶部１２１に格納してもよい。受信部１３１は、利用者が有する属性に関する情報を受信してもよい。 The receiving unit 131 receives a user vector indicating a user from the providing device 20. For example, the receiving unit 131 receives a user vector in which a predetermined value is stored in a dimension corresponding to an attribute possessed by the user. In one example, the receiving unit 131 receives the embedded vector corresponding to the user. The above-mentioned attribute information may include a user vector indicating a user. The receiving unit 131 may store the received user vector in the training data storage unit 121. The receiving unit 131 may receive information regarding the attributes possessed by the user.

（取得部１３２）
取得部１３２は、コンテンツに関連する判定モデルを生成するための訓練データを取得する。取得部１３２は、例えば、受信部１３１によって受信された訓練データを取得する。取得部１３２は、訓練データ記憶部１２１から訓練データを取得してもよい。 (Acquisition unit 132)
The acquisition unit 132 acquires training data for generating a determination model related to the content. The acquisition unit 132 acquires, for example, the training data received by the reception unit 131. The acquisition unit 132 may acquire training data from the training data storage unit 121.

取得部１３２は、配信対象を示すベクトルであって、かかる配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルを取得する。例えば、受信部１３１は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを取得する。 The acquisition unit 132 acquires a distribution target vector that is a vector indicating a distribution target and in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target. For example, the receiving unit 131 acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed.

取得部１３２は、例えば、受信部１３１によって受信された配信対象ベクトルを取得する。取得部１３２は、訓練データ記憶部１２１から配信対象ベクトルを取得してもよい。 The acquisition unit 132 acquires, for example, the distribution target vector received by the reception unit 131. The acquisition unit 132 may acquire the distribution target vector from the training data storage unit 121.

取得部１３２は、利用者を示す利用者ベクトルを取得する。例えば、取得部１３２は、利用者が有する属性と対応する次元に所定の値が格納された利用者ベクトルを取得する。 The acquisition unit 132 acquires a user vector indicating a user. For example, the acquisition unit 132 acquires a user vector in which a predetermined value is stored in a dimension corresponding to an attribute possessed by the user.

取得部１３２は、例えば、受信部１３１によって受信された利用者ベクトルを取得する。取得部１３２は、訓練データ記憶部１２１から利用者ベクトルを取得してもよい。 The acquisition unit 132 acquires, for example, the user vector received by the reception unit 131. The acquisition unit 132 may acquire a user vector from the training data storage unit 121.

（学習部１３３）
学習部１３３は、配信対象ベクトルと利用者ベクトルとに基づいて、配信対象ベクトルと対応する配信対象が、利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを学習する。例えば、学習部１３３は、利用者に配信された配信対象と対応する配信対象ベクトルと、利用者の利用者ベクトルとの共起性をモデルに学習させる。 (Learning Department 133)
The learning unit 133 learns a model for determining whether or not the distribution target corresponding to the distribution target vector corresponds to the user corresponding to the user vector based on the distribution target vector and the user vector. For example, the learning unit 133 makes the user learn the co-occurrence between the distribution target vector corresponding to the distribution target and the user vector of the user as a model.

より具体的には、学習部１３３は、利用者に配信された配信対象が選択されたか否か示す選択情報（例えば、選択フラグ）に基づいて、配信対象ベクトルと利用者ベクトルとの共起性をモデルに学習させる。例えば、学習部１３３は、利用者ベクトルおよび配信対象ベクトルを訓練データの入力として用い、かつ利用者に配信された配信対象が選択されたか否か示す選択情報を訓練データの出力として用いることで、配信対象ベクトルと利用者ベクトルとの共起性に関する距離をモデルに学習させる。 More specifically, the learning unit 133 has co-occurrence between the distribution target vector and the user vector based on the selection information (for example, the selection flag) indicating whether or not the distribution target delivered to the user has been selected. To train the model. For example, the learning unit 133 uses the user vector and the distribution target vector as the input of the training data, and uses the selection information indicating whether or not the distribution target distributed to the user has been selected as the output of the training data. Let the model learn the distance related to the co-occurrence between the distribution target vector and the user vector.

一例では、モデルは、ニューラルネットである。この例では、配信対象ベクトルは、配信対象（例えば、プッシュ通知）に対応する埋め込みベクトル（例えば、分散表現）である。また、この例では、利用者ベクトルは、利用者（例えば、ユーザ）に対応する埋め込みベクトルである。 In one example, the model is a neural network. In this example, the delivery target vector is an embedded vector (eg, distributed representation) corresponding to the delivery target (eg, push notification). Further, in this example, the user vector is an embedded vector corresponding to a user (for example, a user).

学習部１３３は、配信対象ベクトルと利用者ベクトルとを連結した連結ベクトルから、配信対象ベクトルと対応する配信対象が、利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを学習する。言い換えれば、学習部１３３は、配信対象ベクトルと利用者ベクトルとの連結（concatenation）であるベクトルから、モデルを学習する。例えば、学習部１３３は、利用者ベクトルの末尾に配信対象ベクトルを連結した連結ベクトルから、配信対象が利用者と対応するか否かを判定するモデルを学習する。例えば、学習部１３３は、次元数が削減されるように連結ベクトルを徐々に畳み込むことで、配信対象が利用者と対応するか否かを示す値を生成するモデルを学習する。 The learning unit 133 learns a model for determining whether or not the distribution target corresponding to the distribution target vector corresponds to the user corresponding to the user vector from the concatenated vector in which the distribution target vector and the user vector are connected. do. In other words, the learning unit 133 learns the model from the vector which is a concatenation between the distribution target vector and the user vector. For example, the learning unit 133 learns a model for determining whether or not the distribution target corresponds to the user from the connection vector in which the distribution target vector is connected to the end of the user vector. For example, the learning unit 133 learns a model that generates a value indicating whether or not the distribution target corresponds to the user by gradually convolving the connection vector so that the number of dimensions is reduced.

一例では、学習部１３３は、配信対象に対応する埋め込みベクトルと利用者に対応する埋め込みベクトルとを連結することで、第１のベクトルを生成する。そして、学習部１３３は、生成された第１のベクトルに対応する中間層を含むニューラルネットであって、配信対象が利用者と対応するか否かを判定するニューラルネットを学習する。 In one example, the learning unit 133 generates the first vector by connecting the embedded vector corresponding to the distribution target and the embedded vector corresponding to the user. Then, the learning unit 133 learns a neural network including an intermediate layer corresponding to the generated first vector, and determines whether or not the distribution target corresponds to the user.

例えば、学習部１３３は、第１の重み行列に第１のベクトルを乗ずることで、第２のベクトルを算出する。次いで、学習部１３３は、算出された第２のベクトルを活性化関数に代入することで、隠れ状態ベクトルを生成する。次いで、学習部１３３は、第２の重み行列に隠れ状態ベクトを乗ずることで、関連度を算出する。その後、学習部１３３は、配信対象が選択されたか否か示す選択情報（例えば、選択フラグ）と、算出された関連度とを用いて、所定の配信対象が利用者に適合するか否かを判定するニューラルネットを学習する。 For example, the learning unit 133 calculates the second vector by multiplying the first weight matrix by the first vector. Next, the learning unit 133 generates a hidden state vector by substituting the calculated second vector into the activation function. Next, the learning unit 133 calculates the degree of relevance by multiplying the second weight matrix by the hidden state vector. After that, the learning unit 133 uses the selection information (for example, the selection flag) indicating whether or not the distribution target is selected and the calculated relevance to determine whether or not the predetermined distribution target is suitable for the user. Learn the neural network to judge.

学習部１３３は、配信対象ベクトルから、配信対象ベクトルが示す配信対象に含まれる情報の特徴を示す特徴ベクトルの平均値の和となるベクトルを生成し、生成されたベクトルに基づいて、利用者と配信対象とが対応するか否かを判定するモデルを学習する。 The learning unit 133 generates a vector from the distribution target vector, which is the sum of the average values of the feature vectors indicating the characteristics of the information included in the distribution target indicated by the distribution target vector, and based on the generated vector, with the user. Learn a model that determines whether or not it corresponds to the delivery target.

一例では、情報の特徴を示す特徴ベクトルは、分散表現である。この例では、配信対象に含まれる情報が、第１～３の特徴を含む場合に、配信対象に対応する埋め込みベクトルは、第１～３の特徴にそれぞれ対応する３つの分散表現の平均ベクトルである。 In one example, the feature vector that represents the feature of the information is a distributed representation. In this example, when the information included in the distribution target includes the first to third features, the embedded vector corresponding to the delivery target is the average vector of the three distributed representations corresponding to the first to third features, respectively. be.

学習部１３３は、利用者ベクトルと、利用者ベクトルが示す利用者が選択した配信対象を示す配信対象ベクトルとが入力された際に、利用者ベクトルが示す利用者と配信対象ベクトルが示す配信対象とが対応する旨を出力し、利用者ベクトルと、利用者ベクトルが示す利用者が選択しなかった配信対象を示す配信対象ベクトルとが入力された際に、利用者ベクトルが示す利用者と配信対象ベクトルが示す配信対象とが対応しない旨が出力されるように、モデルを学習する。 When the user vector and the distribution target vector indicating the distribution target selected by the user indicated by the user vector are input, the learning unit 133 includes the user indicated by the user vector and the distribution target indicated by the distribution target vector. Is output, and when the user vector and the distribution target vector indicating the distribution target not selected by the user indicated by the user vector are input, the user and distribution indicated by the user vector are input. The model is trained so that it is output that the distribution target indicated by the target vector does not correspond.

例えば、学習部１３３は、利用者ベクトルと、利用者が選択した配信対象を示す配信対象ベクトル（例えば、テストデータである配信対象ベクトル）とが入力された際に、モデルによって出力される値が、配信対象が選択されたことを示す値（例えば、選択フラグの選択に対応する値「１」）に近づくように、訓練用のデータである配信対象ベクトルを用いてモデルを訓練する。また、例えば、学習部１３３は、正則化手法（例えば、Ｌ２正則化等）を用いて、汎化誤差が最小化されるようにモデルを学習する。 For example, in the learning unit 133, when the user vector and the distribution target vector indicating the distribution target selected by the user (for example, the distribution target vector which is test data) are input, the value output by the model is set. , The model is trained using the distribution target vector, which is the training data, so as to approach the value indicating that the distribution target has been selected (for example, the value “1” corresponding to the selection of the selection flag). Further, for example, the learning unit 133 uses a regularization method (for example, L2 regularization, etc.) to learn the model so that the generalization error is minimized.

学習部１３３は、例えば、配信対象が選択されたか否か示す選択情報およびモデルの出力に対応する目的関数が最小化されるように、モデルを学習する。一例では、学習部１３３は、誤差逆伝播法を用いて、選択情報に含まれる選択フラグの値およびニューラルネットワークの出力（例えば、上述の関連度）に対応する目的関数が最小化されるように、ニューラルネットを学習する。 The learning unit 133 learns the model so that, for example, the selection information indicating whether or not the distribution target is selected and the objective function corresponding to the output of the model are minimized. In one example, the learning unit 133 uses an error backpropagation method to minimize the objective function corresponding to the value of the selection flag contained in the selection information and the output of the neural network (for example, the degree of relevance described above). , Learn neural nets.

（生成部１３４）
生成部１３４は、配信対象の候補となる配信対象に含まれる情報と対応する次元に所定の値が格納された候補ベクトルと、配信対象の候補の配信先となる利用者を示す配信先ベクトルとを生成する。例えば、生成部１３４は、所定のサーバから、配信対象の候補となる配信対象に含まれる情報に含まれるキーワードに対応する少なくとも１つのベクトル（例えば、分散表現）を取得する。また、例えば、生成部１３４は、所定のサーバから、配信対象の候補の配信先となる利用者の特徴に対応する少なくとも１つのベクトル（例えば、分散表現）を取得する。そして、生成部１３４は、情報に含まれるキーワードに対応する少なくとも１つのベクトルに基づいて、候補ベクトルを生成する。また、生成部１３４は、利用者の特徴に対応する少なくとも１つのベクトルに基づいて、配信先ベクトルを生成する。 (Generation unit 134)
The generation unit 134 includes a candidate vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target that is a candidate for distribution target, and a distribution destination vector that indicates a user who is a distribution destination of the candidate distribution target. To generate. For example, the generation unit 134 acquires at least one vector (for example, a distributed representation) corresponding to a keyword included in the information included in the distribution target that is a candidate for distribution target from a predetermined server. Further, for example, the generation unit 134 acquires at least one vector (for example, a distributed representation) corresponding to the characteristics of the user who is the delivery destination of the candidate to be delivered from a predetermined server. Then, the generation unit 134 generates a candidate vector based on at least one vector corresponding to the keyword included in the information. Further, the generation unit 134 generates a delivery destination vector based on at least one vector corresponding to the characteristics of the user.

生成部１３４は、配信対象に含まれる情報を用いて、配信対象を示すベクトルであって、かかる配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルを生成してもよい。また、生成部１３４は、利用者が有する属性に関する情報を用いて、利用者が有する属性と対応する次元に所定の値が格納された利用者ベクトルを生成してもよい。 The generation unit 134 uses the information included in the distribution target to generate a distribution target vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target, which is a vector indicating the distribution target. May be good. Further, the generation unit 134 may generate a user vector in which a predetermined value is stored in a dimension corresponding to the attribute possessed by the user by using the information regarding the attribute possessed by the user.

生成部１３４は、配信対象に含まれる情報を用いて、配信対象に対応するＢｏＷベクトルを生成してもよい。また、生成部１３４は、利用者が有する属性に関する情報を用いて、利用者に対応するワンホットベクトルを生成してもよい。 The generation unit 134 may generate a BoW vector corresponding to the distribution target by using the information included in the distribution target. Further, the generation unit 134 may generate a one-hot vector corresponding to the user by using the information regarding the attribute possessed by the user.

（判定部１３５）
判定部１３５は、学習部１３３により学習されたモデルを用いて、候補ベクトルと配信先ベクトルとから、配信対象の候補を配信先となる利用者に配信するか否かを判定する。また、判定部１３５は、候補ベクトルと配信先ベクトルとが入力された際に、候補ベクトルが示す配信候補が、配信先ベクトルが示す利用者と対応する旨をモデルが出力した場合は、配信候補を利用者に配信すると判定する。 (Judgment unit 135)
The determination unit 135 determines whether or not to distribute the candidate to be distributed to the user who is the distribution destination from the candidate vector and the distribution destination vector by using the model learned by the learning unit 133. Further, when the determination unit 135 outputs that the delivery candidate indicated by the candidate vector corresponds to the user indicated by the delivery destination vector when the candidate vector and the delivery destination vector are input, the determination unit 135 outputs that the delivery candidate corresponds to the user indicated by the delivery destination vector. Is determined to be delivered to the user.

一例では、判定部１３５は、学習されたモデル（例えば、ニューラルネット）を用いて、利用者と配信対象の候補との間の関連性を示す関連性スコアを出力する。より具体的には、判定部１３５は、配信対象の候補の配信先に対応する配信先ベクトルと、配信対象の候補に対応する候補ベクトルとを、モデルに入力することで、利用者が配信対象の候補を選択する確率を、利用者と配信対象の候補との間の関連性スコアとして出力する。 In one example, the determination unit 135 uses a trained model (eg, a neural network) to output a relevance score indicating the relevance between the user and the candidate to be distributed. More specifically, the determination unit 135 inputs a delivery destination vector corresponding to the delivery destination of the candidate to be delivered and a candidate vector corresponding to the candidate to be delivered into the model, so that the user can deliver the target. The probability of selecting a candidate is output as a relevance score between the user and the candidate to be delivered.

（提供部１３６）
提供部１３６は、学習部１３３により学習されたモデルを提供する。例えば、提供部１３６は、学習されたモデルを提供装置２０に提供する。提供部１３６は、判定部１３５による判定結果を、提供装置２０に提供してもよい。 (Providing Department 136)
The providing unit 136 provides a model learned by the learning unit 133. For example, the providing unit 136 provides the trained model to the providing device 20. The providing unit 136 may provide the determination result by the determination unit 135 to the providing device 20.

〔４．学習処理のフロー〕
次に、実施形態に係る学習装置１００による学習処理の手順について説明する。図５は、実施形態に係る学習装置１００による学習処理手順を示すフローチャートである。 [4. Learning process flow]
Next, the procedure of the learning process by the learning device 100 according to the embodiment will be described. FIG. 5 is a flowchart showing a learning processing procedure by the learning device 100 according to the embodiment.

図５に示すように、はじめに、学習装置１００は、配信対象を示すベクトルであって、配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルと、利用者を示す利用者ベクトルとを取得する（ステップＳ１０１）。例えば、学習装置１００は、利用者が有する属性と対応する次元に所定の値が格納された利用者ベクトルを取得する。また、例えば、学習装置１００は、信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルと取得する。 As shown in FIG. 5, first, the learning device 100 is a vector indicating a distribution target, and shows a distribution target vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target, and a user. Acquire the user vector (step S101). For example, the learning device 100 acquires a user vector in which a predetermined value is stored in a dimension corresponding to an attribute possessed by the user. Further, for example, the learning device 100 acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be trusted.

次いで、学習装置１００は、配信対象ベクトルと利用者ベクトルとに基づいて、配信対象ベクトルと対応する配信対象が、利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを学習する（ステップＳ１０２）。例えば、学習装置１００は、利用者に配信された配信対象と対応する配信対象ベクトルと、利用者の利用者ベクトルとの共起性を前記モデルに学習させる。 Next, the learning device 100 learns a model for determining whether or not the distribution target corresponding to the distribution target vector corresponds to the user corresponding to the user vector based on the distribution target vector and the user vector. (Step S102). For example, the learning device 100 causes the model to learn the co-occurrence between the distribution target vector corresponding to the distribution target distributed to the user and the user vector of the user.

より具体的には、学習装置１００は、利用者ベクトルおよび配信対象ベクトルを訓練データの入力として用い、かつ利用者に配信された配信対象が選択されたか否か示す選択情報を訓練データの出力として用いることで、配信対象ベクトルと利用者ベクトルとの共起性に関する距離をモデルに学習させる。 More specifically, the learning device 100 uses the user vector and the distribution target vector as input of training data, and outputs selection information indicating whether or not the distribution target distributed to the user is selected as the output of the training data. By using it, the model learns the distance related to the coexistence between the distribution target vector and the user vector.

〔５．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の一部を手動的に行うこともできる。あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [5. others〕
Further, among the processes described in the above-described embodiment, a part of the processes described as being automatically performed can also be performed manually. Alternatively, all or part of the process described as being performed manually can be automatically performed by a known method. In addition, information including processing procedures, specific names, various data and parameters shown in the above documents and drawings can be arbitrarily changed unless otherwise specified. For example, the various information shown in each figure is not limited to the information shown in the figure.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each of the illustrated devices is a functional concept, and does not necessarily have to be physically configured as shown in the figure. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or part of them may be functionally or physically distributed / physically in any unit according to various loads and usage conditions. Can be integrated and configured.

例えば、図３に示した記憶部１２０の一部又は全部は、学習装置１００によって保持されるのではなく、ストレージサーバ等に保持されてもよい。この場合、学習装置１００は、ストレージサーバにアクセスすることで、訓練データ等の各種情報を取得する。 For example, a part or all of the storage unit 120 shown in FIG. 3 may not be held by the learning device 100, but may be held by a storage server or the like. In this case, the learning device 100 acquires various information such as training data by accessing the storage server.

〔６．ハードウェア構成〕
また、上述してきた実施形態に係る学習装置１００は、例えば図６に示すような構成のコンピュータ１０００によって実現される。図６は、ハードウェア構成の一例を示す図である。コンピュータ１０００は、出力装置１０１０、入力装置１０２０と接続され、演算装置１０３０、一次記憶装置１０４０、二次記憶装置１０５０、出力ＩＦ（Interface）１０６０、入力ＩＦ１０７０、ネットワークＩＦ１０８０がバス１０９０により接続された形態を有する。 [6. Hardware configuration]
Further, the learning device 100 according to the above-described embodiment is realized by, for example, a computer 1000 having a configuration as shown in FIG. FIG. 6 is a diagram showing an example of a hardware configuration. The computer 1000 is connected to the output device 1010 and the input device 1020, and the arithmetic device 1030, the primary storage device 1040, the secondary storage device 1050, the output IF (Interface) 1060, the input IF 1070, and the network IF 1080 are connected by the bus 1090. Have.

演算装置１０３０は、一次記憶装置１０４０や二次記憶装置１０５０に格納されたプログラムや入力装置１０２０から読み出したプログラム等に基づいて動作し、各種の処理を実行する。一次記憶装置１０４０は、ＲＡＭ等、演算装置１０３０が各種の演算に用いるデータを一時的に記憶するメモリ装置である。また、二次記憶装置１０５０は、演算装置１０３０が各種の演算に用いるデータや、各種のデータベースが登録される記憶装置であり、ＲＯＭ(Read Only Memory)、ＨＤＤ、フラッシュメモリ等により実現される。 The arithmetic unit 1030 operates based on a program stored in the primary storage device 1040 or the secondary storage device 1050, a program read from the input device 1020, or the like, and executes various processes. The primary storage device 1040 is a memory device such as a RAM that temporarily stores data used by the arithmetic unit 1030 for various calculations. Further, the secondary storage device 1050 is a storage device in which data used by the arithmetic unit 1030 for various calculations and various databases are registered, and is realized by a ROM (Read Only Memory), an HDD, a flash memory, or the like.

出力ＩＦ１０６０は、モニタやプリンタといった各種の情報を出力する出力装置１０１０に対し、出力対象となる情報を送信するためのインタフェースであり、例えば、ＵＳＢ（Universal Serial Bus）やＤＶＩ（Digital Visual Interface）、ＨＤＭＩ（登録商標）（High Definition Multimedia Interface）といった規格のコネクタにより実現される。また、入力ＩＦ１０７０は、マウス、キーボード、およびスキャナ等といった各種の入力装置１０２０から情報を受信するためのインタフェースであり、例えば、ＵＳＢ等により実現される。 The output IF 1060 is an interface for transmitting information to be output to an output device 1010 that outputs various information such as a monitor and a printer. For example, USB (Universal Serial Bus), DVI (Digital Visual Interface), and the like. It is realized by a standard connector such as HDMI (registered trademark) (High Definition Multimedia Interface). Further, the input IF 1070 is an interface for receiving information from various input devices 1020 such as a mouse, a keyboard, a scanner, and the like, and is realized by, for example, USB.

なお、入力装置１０２０は、例えば、ＣＤ（Compact Disc）、ＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等から情報を読み出す装置であってもよい。また、入力装置１０２０は、ＵＳＢメモリ等の外付け記憶媒体であってもよい。 The input device 1020 is, for example, an optical recording medium such as a CD (Compact Disc), a DVD (Digital Versatile Disc), a PD (Phase change rewritable Disk), a magneto-optical recording medium such as an MO (Magneto-Optical disk), or a tape. It may be a device that reads information from a medium, a magnetic recording medium, a semiconductor memory, or the like. Further, the input device 1020 may be an external storage medium such as a USB memory.

ネットワークＩＦ１０８０は、ネットワークＮを介して他の機器からデータを受信して演算装置１０３０へ送り、また、ネットワークＮを介して演算装置１０３０が生成したデータを他の機器へ送信する。 The network IF 1080 receives data from another device via the network N and sends it to the arithmetic unit 1030, and also transmits the data generated by the arithmetic unit 1030 to the other device via the network N.

演算装置１０３０は、出力ＩＦ１０６０や入力ＩＦ１０７０を介して、出力装置１０１０や入力装置１０２０の制御を行う。例えば、演算装置１０３０は、入力装置１０２０や二次記憶装置１０５０からプログラムを一次記憶装置１０４０上にロードし、ロードしたプログラムを実行する。 The arithmetic unit 1030 controls the output device 1010 and the input device 1020 via the output IF 1060 and the input IF 1070. For example, the arithmetic unit 1030 loads a program from the input device 1020 or the secondary storage device 1050 onto the primary storage device 1040, and executes the loaded program.

例えば、コンピュータ１０００が学習装置１００として機能する場合、コンピュータ１０００の演算装置１０３０は、一次記憶装置１０４０上にロードされたプログラムを実行することにより、制御部１３０の機能を実現する。 For example, when the computer 1000 functions as the learning device 100, the arithmetic unit 1030 of the computer 1000 realizes the function of the control unit 130 by executing the program loaded on the primary storage device 1040.

〔７．効果〕
上述してきたように、実施形態に係る学習装置１００は、取得部１３２と、学習部１３３とを有する。取得部１３２は、配信対象を示すベクトルであって、配信対象に含まれる情報と対応する次元に所定の値が格納された配信対象ベクトルと、利用者を示す利用者ベクトルとを取得する。学習部１３３は、配信対象ベクトルと利用者ベクトルとに基づいて、配信対象ベクトルと対応する配信対象が、利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを学習する。 [7. effect〕
As described above, the learning device 100 according to the embodiment has an acquisition unit 132 and a learning unit 133. The acquisition unit 132 acquires a distribution target vector, which is a vector indicating a distribution target and in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target, and a user vector indicating a user. The learning unit 133 learns a model for determining whether or not the distribution target corresponding to the distribution target vector corresponds to the user corresponding to the user vector based on the distribution target vector and the user vector.

また、実施形態に係る学習装置１００において、取得部１３２は、利用者が有する属性と対応する次元に所定の値が格納された利用者ベクトルを取得する。 Further, in the learning device 100 according to the embodiment, the acquisition unit 132 acquires a user vector in which a predetermined value is stored in a dimension corresponding to the attribute possessed by the user.

また、実施形態に係る学習装置１００において、取得部１３２は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを取得する。 Further, in the learning device 100 according to the embodiment, the acquisition unit 132 acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed.

また、実施形態に係る学習装置１００において、学習部１３３は、利用者に配信された配信対象と対応する配信対象ベクトルと、利用者の利用者ベクトルとの共起性をモデルに学習させる。 Further, in the learning device 100 according to the embodiment, the learning unit 133 learns the co-occurrence between the distribution target vector corresponding to the distribution target distributed to the user and the user vector of the user as a model.

また、実施形態に係る学習装置１００において、学習部１３３は、配信対象ベクトルと利用者ベクトルとを連結した連結ベクトルから、配信対象ベクトルと対応する配信対象が、利用者ベクトルと対応する利用者と対応するか否かを判定するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 uses a connection vector in which the distribution target vector and the user vector are connected, and the distribution target corresponding to the distribution target vector is the user corresponding to the user vector. Learn a model that determines whether or not it corresponds.

また、実施形態に係る学習装置１００において、学習部１３３は、利用者ベクトルの末尾に配信対象ベクトルを連結した連結ベクトルから、配信対象が利用者と対応するか否かを判定するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 learns a model for determining whether or not the distribution target corresponds to the user from the connection vector in which the distribution target vector is connected to the end of the user vector. ..

また、実施形態に係る学習装置１００において、学習部１３３は、次元数が削減されるように連結ベクトルを徐々に畳み込むことで、配信対象が利用者と対応するか否かを示す値を生成するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 gradually convolves the connection vector so that the number of dimensions is reduced, thereby generating a value indicating whether or not the distribution target corresponds to the user. Learn the model.

また、実施形態に係る学習装置１００において、学習部１３３は、配信対象ベクトルから、配信対象ベクトルが示す配信対象に含まれる情報の特徴を示す特徴ベクトルの平均値の和となるベクトルを生成し、生成されたベクトルに基づいて、利用者と配信対象とが対応するか否かを判定するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 generates a vector from the distribution target vector, which is the sum of the average values of the feature vectors indicating the characteristics of the information included in the distribution target indicated by the distribution target vector. Based on the generated vector, learn a model that determines whether or not the user and the distribution target correspond to each other.

また、実施形態に係る学習装置１００において、学習部１３３は、利用者ベクトルと、利用者ベクトルが示す利用者が選択した配信対象を示す配信対象ベクトルとが入力された際に、利用者ベクトルが示す利用者と配信対象ベクトルが示す配信対象とが対応する旨を出力し、利用者ベクトルと、利用者ベクトルが示す利用者が選択しなかった配信対象を示す配信対象ベクトルとが入力された際に、利用者ベクトルが示す利用者と配信対象ベクトルが示す配信対象とが対応しない旨が出力されるように、モデルを学習する。 Further, in the learning device 100 according to the embodiment, when the user vector and the distribution target vector indicating the distribution target selected by the user indicated by the user vector are input, the learning unit 133 uses the user vector. When it is output that the indicated user and the distribution target indicated by the distribution target vector correspond to each other, and the user vector and the distribution target vector indicating the distribution target not selected by the user indicated by the user vector are input. The model is trained so that it is output that the user indicated by the user vector and the distribution target indicated by the distribution target vector do not correspond to each other.

また、実施形態に係る学習装置１００は、配信対象の候補となる配信対象に含まれる情報と対応する次元に所定の値が格納された候補ベクトルと、配信対象の候補の配信先となる利用者を示す配信先ベクトルとを生成する生成部１３４を有する。また、実施形態に係る学習装置１００は、学習部１３３により学習されたモデルを用いて、候補ベクトルと配信先ベクトルとから、配信対象の候補を配信先となる利用者に配信するか否かを判定する判定部１３５を有する。 Further, the learning device 100 according to the embodiment has a candidate vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target that is a candidate for distribution target, and a user that is a distribution destination of the candidate distribution target. It has a generation unit 134 that generates a delivery destination vector indicating the above. Further, the learning device 100 according to the embodiment uses the model learned by the learning unit 133 to determine whether or not to distribute the candidate to be distributed to the user who is the distribution destination from the candidate vector and the distribution destination vector. It has a determination unit 135 for determination.

また、実施形態に係る学習装置１００において、判定部１３５は、候補ベクトルと配信先ベクトルとが入力された際に、候補ベクトルが示す配信候補が、配信先ベクトルが示す利用者と対応する旨をモデルが出力した場合は、配信候補を利用者に配信すると判定する。 Further, in the learning device 100 according to the embodiment, when the candidate vector and the delivery destination vector are input, the determination unit 135 indicates that the delivery candidate indicated by the candidate vector corresponds to the user indicated by the delivery destination vector. When the model outputs, it is determined that the delivery candidate is delivered to the user.

上述した各処理により、学習装置１００は、所定の通知がユーザに適合するか否かを判定する判定モデルを学習することができる。 By each of the above-mentioned processes, the learning device 100 can learn a determination model for determining whether or not a predetermined notification is suitable for the user.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 Although some of the embodiments of the present application have been described in detail with reference to the drawings, these are examples, and various modifications are made based on the knowledge of those skilled in the art, including the embodiments described in the disclosure column of the invention. It is possible to carry out the present invention in other modified forms.

また、上述した学習装置１００は、複数のサーバコンピュータで実現してもよく、また、機能によっては外部のプラットフォーム等をＡＰＩ（Application Programming Interface）やネットワークコンピューティングなどで呼び出して実現するなど、構成は柔軟に変更できる。 Further, the above-mentioned learning device 100 may be realized by a plurality of server computers, and depending on the function, it may be realized by calling an external platform or the like by API (Application Programming Interface), network computing, or the like. It can be changed flexibly.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、受信部は、受信手段や受信回路に読み替えることができる。 Further, the above-mentioned "section, module, unit" can be read as "means" or "circuit". For example, the receiving unit can be read as a receiving means or a receiving circuit.

１ネットワークシステム
１０端末装置
２０提供装置
１００学習装置
１２０記憶部
１２１訓練データ記憶部
１２２判定モデル記憶部
１３０制御部
１３１受信部
１３２取得部
１３３学習部
１３４生成部
１３５判定部
１３６提供部 1 Network system 10 Terminal device 20 Providing device 100 Learning device 120 Storage unit 121 Training data storage unit 122 Judgment model storage unit 130 Control unit 131 Reception unit 132 Acquisition unit 133 Learning unit 134 Generation unit 135 Judgment unit 136 Providing unit

Claims

The distribution target vector, which is a distributed expression corresponding to the word included in the title of the first distribution target, which is a push notification, the user vector, which is a distributed expression corresponding to the demographic attribute of the first user, and the first. An acquisition unit that acquires selection information indicating whether the user of 1 has selected the first distribution target, and
The distribution target vector and the user vector are used as input of training data, and the selection information is used as output of training data, and the second user selects a second distribution target which is a push notification . It has a learning unit that learns a model to determine whether
The learning unit
By gradually convolving the connection vector that connects the distribution target vector and the user vector so that the number of dimensions is reduced, the distribution target corresponding to the distribution target vector can be used corresponding to the user vector. Learn a model that determines whether or not to correspond to a person
A learning device that features that.

The acquisition unit
The learning device according to claim 1, wherein a user vector in which a predetermined value is stored in a dimension corresponding to the demographic attribute possessed by the first user is acquired.

The acquisition unit
The learning device according to claim 1 or 2, wherein a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed is acquired.

The learning unit
Of claims 1 to 3, a model for determining whether or not the distribution target corresponds to the user is learned from a connection vector in which the distribution target vector is connected to the end of the user vector. The learning device according to any one .

The learning unit
From the distribution target vector, an average vector of a feature vector indicating the characteristics of the information included in the distribution target indicated by the distribution target vector is generated, and based on the generated average vector, the second user and the second user. The learning device according to any one of claims 1 to 4, wherein the learning device is characterized by learning a model for determining whether or not the delivery target is compatible with the above.

The learning unit
When the user vector and the distribution target vector indicating the distribution target selected by the user indicated by the user vector are input, the user indicated by the user vector and the distribution target indicated by the distribution target vector Outputs the correspondence, and when the user vector and the distribution target vector indicating the distribution target not selected by the user indicated by the user vector are input, the user indicated by the user vector is input. The learning device according to any one of claims 1 to 5 , wherein the model is trained so that it is output that the delivery target and the delivery target indicated by the delivery target vector do not correspond to each other.

Generation that generates a candidate vector in which a predetermined value is stored in a dimension corresponding to the information contained in the distribution target that is a candidate for distribution target, and a distribution destination vector that indicates a user who is a distribution destination of the candidate distribution target. Department and
Using the model learned by the learning unit, it has a determination unit for determining whether or not to distribute the candidate to be distributed to the user who is the distribution destination from the candidate vector and the distribution destination vector. The learning device according to any one of claims 1 to 6, wherein the learning device is characterized by the above.

The determination unit
When the candidate vector and the delivery destination vector are input, if the model outputs that the delivery candidate indicated by the candidate vector corresponds to the user indicated by the delivery destination vector, the delivery candidate is selected. The learning device according to claim 7 , wherein it is determined to deliver to the user.

The way the learning device performs,
The distribution target vector, which is a distributed expression corresponding to the word included in the title of the first distribution target, which is a push notification, the user vector, which is a distributed expression corresponding to the demographic attribute of the first user, and the first. An acquisition process for acquiring selection information indicating whether the user of 1 has selected the first distribution target, and
The distribution target vector and the user vector are used as input of training data, and the selection information is used as output of training data, and the second user selects a second distribution target which is a push notification . Including a learning process to learn a model to determine
The learning process is
By gradually convolving the connection vector that connects the distribution target vector and the user vector so that the number of dimensions is reduced, the distribution target corresponding to the distribution target vector can be used corresponding to the user vector. Learn a model that determines whether or not to correspond to a person
A learning method characterized by that.

The distribution target vector, which is a distributed expression corresponding to the word included in the title of the first distribution target, which is a push notification, the user vector, which is a distributed expression corresponding to the demographic attribute of the first user, and the first. An acquisition procedure for acquiring selection information indicating whether the user of 1 has selected the first distribution target, and
The distribution target vector and the user vector are used as input of training data, and the selection information is used as output of training data, and the second user selects a second distribution target which is a push notification . Let the computer perform the learning procedure to learn the model to determine
The learning procedure is
By gradually convolving the connection vector that connects the distribution target vector and the user vector so that the number of dimensions is reduced, the distribution target corresponding to the distribution target vector can be used corresponding to the user vector. Learn a model that determines whether or not to correspond to a person
A learning program characterized by that .