JP7041018B2

JP7041018B2 - Learning equipment, learning methods and learning programs

Info

Publication number: JP7041018B2
Application number: JP2018139835A
Authority: JP
Inventors: 伸裕鍜治
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2018-07-25
Filing date: 2018-07-25
Publication date: 2022-03-23
Anticipated expiration: 2038-07-25
Also published as: JP2020017089A

Description

本発明は、学習装置、学習方法及び学習プログラムに関する。 The present invention relates to a learning device, a learning method and a learning program.

従来、各種の情報を利用者に対して提供する技術が知られている。このような技術の一例として、利用者が過去に選択した情報と関連する情報に基づいて、利用者と関連性が高い情報や利用者が興味を有する情報等、利用者と対応する情報を推定し、推定した情報を利用者に提供する技術が知られている。 Conventionally, a technique for providing various types of information to a user has been known. As an example of such technology, based on the information related to the information selected by the user in the past, the information corresponding to the user such as the information highly related to the user and the information that the user is interested in is estimated. However, there is known a technique for providing the estimated information to the user.

特開２０１２－１５０５６１号公報Japanese Unexamined Patent Publication No. 2012-150561

しかしながら、上述した従来技術では、利用者と対応する情報を適切に推定できない恐れがある。 However, with the above-mentioned conventional technology, there is a possibility that the information corresponding to the user cannot be estimated appropriately.

例えば、上述した従来技術では、新規の利用者等、過去に情報を選択していない利用者に対しては、利用者と対応する情報を適切に推定できない。 For example, in the above-mentioned conventional technology, it is not possible to appropriately estimate information corresponding to a user for a user who has not selected information in the past, such as a new user.

本願は、上記に鑑みてなされたものであって、利用者と対応する情報の推定精度を向上させることを目的とする。 The present application has been made in view of the above, and an object thereof is to improve the estimation accuracy of the information corresponding to the user.

本願に係る学習装置は、配信対象を示す複数の配信対象ベクトルを取得する取得部と、
所定の利用者に配信された配信対象を示す複数の配信対象ベクトルと、配信候補を示す配信候補ベクトルとが入力されると、当該配信候補ベクトルとの類似性に基づく重みが設定された複数の配信対象ベクトルと、前記配信候補ベクトルとに基づいて、前記配信候補が前記所定の利用者と対応するか否かを判定するモデルを、前記取得部により取得された配信対象ベクトルを用いて学習する学習部とを有することを特徴とする。 The learning device according to the present application includes an acquisition unit that acquires a plurality of distribution target vectors indicating distribution targets, and an acquisition unit.
When a plurality of distribution target vectors indicating a distribution target delivered to a predetermined user and a distribution candidate vector indicating a distribution candidate are input, a plurality of weights set based on the similarity with the distribution candidate vector are set. Based on the distribution target vector and the distribution candidate vector, a model for determining whether or not the distribution candidate corresponds to the predetermined user is learned using the distribution target vector acquired by the acquisition unit. It is characterized by having a learning unit.

実施形態の一態様によれば、利用者と対応する情報の推定精度を向上させることができるという効果を奏する。 According to one aspect of the embodiment, there is an effect that the estimation accuracy of the information corresponding to the user can be improved.

図１は、実施形態に係るネットワークシステムの構成例を示す図である。FIG. 1 is a diagram showing a configuration example of a network system according to an embodiment. 図２は、実施形態に係る学習処理の一例を示す図である。FIG. 2 is a diagram showing an example of a learning process according to an embodiment. 図３は、実施形態に係る学習装置の構成例を示す図である。FIG. 3 is a diagram showing a configuration example of the learning device according to the embodiment. 図４は、実施形態に係る訓練データ記憶部の一例を示す図である。FIG. 4 is a diagram showing an example of a training data storage unit according to an embodiment. 図５は、実施形態に係る学習装置による学習処理手順を示すフローチャートである。FIG. 5 is a flowchart showing a learning processing procedure by the learning device according to the embodiment. 図６は、ハードウェア構成の一例を示す図である。FIG. 6 is a diagram showing an example of a hardware configuration.

以下に、本願に係る学習装置、学習方法及び学習プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る学習装置、学習方法及び学習プログラムが限定されるものではない。また、各実施形態は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略する。 Hereinafter, the learning device, the learning method, and the embodiment for implementing the learning program (hereinafter referred to as “the embodiment”) according to the present application will be described in detail with reference to the drawings. It should be noted that this embodiment does not limit the learning device, learning method and learning program according to the present application. In addition, each embodiment can be appropriately combined as long as the processing contents do not contradict each other. Further, in each of the following embodiments, the same parts are designated by the same reference numerals, and duplicate description will be omitted.

〔１．ネットワークシステムの構成〕
まず、図１を参照して、実施形態に係るネットワークシステム１の構成について説明する。図１は、実施形態に係るネットワークシステム１の構成例を示す図である。図１に示すように、実施形態に係るネットワークシステム１には、端末装置１０と、提供装置２０と、学習装置１００とが含まれる。端末装置１０、提供装置２０および学習装置１００は、それぞれネットワークＮと有線又は無線により接続される。 [1. Network system configuration]
First, the configuration of the network system 1 according to the embodiment will be described with reference to FIG. FIG. 1 is a diagram showing a configuration example of the network system 1 according to the embodiment. As shown in FIG. 1, the network system 1 according to the embodiment includes a terminal device 10, a providing device 20, and a learning device 100. The terminal device 10, the providing device 20, and the learning device 100 are connected to the network N by wire or wirelessly, respectively.

図１中では図示していないが、ネットワークシステム１は、端末装置１０_１～１０_ｎを含んでもよい。本明細書では、端末装置１０_１～１０_ｎを区別する必要がない場合は、端末装置１０_１～１０_ｎを「端末装置１０」と総称する。また、ネットワークシステム１は、複数台の提供装置２０や、複数台の学習装置１００を含んでもよい。 Although not shown in FIG. 1, the network system 1 may include terminal devices 10 ₁ to 10 _n . In the present specification, when it is not necessary to distinguish the terminal devices 10 ₁ to 10 _n , the terminal devices 10 ₁ to 10 _n are collectively referred to as "terminal device 10". Further, the network system 1 may include a plurality of providing devices 20 and a plurality of learning devices 100.

端末装置１０は、ユーザによって利用される情報処理装置である。端末装置１０は、スマートフォン、デスクトップ型ＰＣ（Personal Computer）、ノート型ＰＣ、タブレット型ＰＣ、ＰＤＡ（Personal Digital Assistant）を含む、任意のタイプの情報処理装置であってもよい。 The terminal device 10 is an information processing device used by the user. The terminal device 10 may be any type of information processing device including a smartphone, a desktop PC (Personal Computer), a notebook PC, a tablet PC, and a PDA (Personal Digital Assistant).

提供装置２０は、端末装置１０および学習装置１００に、各種情報を提供するサーバ装置である。例えば、提供装置２０は、端末装置１０に対して、コンテンツ（例えば、プッシュ通知）を提供する。また、例えば、提供装置２０は、学習装置１００に対して、コンテンツに関連する判定モデルを生成するための訓練データを提供する。 The providing device 20 is a server device that provides various information to the terminal device 10 and the learning device 100. For example, the providing device 20 provides content (for example, push notification) to the terminal device 10. Further, for example, the providing device 20 provides the learning device 100 with training data for generating a determination model related to the content.

学習装置１００は、ユーザに適合する情報を判定するモデルを学習するサーバ装置である。学習装置１００は、ネットワークＮを介して、有線又は無線により端末装置１０および提供装置２０と通信を行う。 The learning device 100 is a server device that learns a model for determining information suitable for the user. The learning device 100 communicates with the terminal device 10 and the providing device 20 by wire or wirelessly via the network N.

〔２．学習処理〕
次に、図２を参照して、実施形態に係る学習処理の一例について説明する。図２は、実施形態に係る学習処理の一例を示す図である。 [2. Learning process]
Next, an example of the learning process according to the embodiment will be described with reference to FIG. FIG. 2 is a diagram showing an example of a learning process according to an embodiment.

図２の例では、提供装置２０は、「ユーザＵ１」によって利用される端末装置１０_１に対して、プッシュ型で複数の「通知Ｎ１～Ｎｎ（ｎは、任意の自然数）」および「通知ＮＭ１～ＮＭｍ（ｍは、任意の自然数）」を送信する。「プッシュ型」という語は、情報が、情報を提供する装置のトリガに基づいて送信される仕組みを指す。言い換えると、提供装置２０は、端末装置１０_１に、通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍを、プッシュ通知として送信する。 In the example of FIG. 2, the providing device 20 is _a push type and a plurality of "notifications N1 to Nn (n is an arbitrary natural number)" and "notification NM1" with respect to the terminal device 101 used by the "user U1". ~ NMm (m is an arbitrary natural number) ”is transmitted. The term "push" refers to the mechanism by which information is transmitted based on the trigger of the device that provides the information. In other words, the providing device 20 transmits the notifications N1 to Nn and the notifications NM1 to NMm to the terminal device 10 ₁ as push notifications.

図２の例では、提供装置２０は、通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍに対する応答を受信する。一例では、提供装置２０は、通知Ｎ１が選択されたことを示す選択情報を、端末装置１０_１から受信する。この場合、通知Ｎ１は、ユーザＵ１に適合すると考えられる。別の例では、提供装置２０は、通知Ｎ１が選択されなかったことを示す選択情報を、端末装置１０_１から受信する。この場合、通知Ｎ１は、ユーザＵ１に適合しないと考えられる。 In the example of FIG. 2, the providing device 20 receives the responses to the notifications N1 to Nn and the notifications NM1 to NMm. In _one example, the providing device 20 receives selection information from the terminal device 101 indicating that the notification N1 has been selected. In this case, the notification N1 is considered to be suitable for the user U1. In _another example, the providing device 20 receives selection information from the terminal device 101 indicating that the notification N1 has not been selected. In this case, the notification N1 is considered to be incompatible with the user U1.

図２の例では、提供装置２０は、訓練データを、学習装置１００に送信する。訓練データは、通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍの通知情報（例えば、通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍのタイトル）と、ユーザが通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍを選択したか否かを示す選択情報（例えば、選択フラグ）とを含む。 In the example of FIG. 2, the providing device 20 transmits the training data to the learning device 100. The training data includes notification information of notifications N1 to Nn and notifications NM1 to NMm (for example, titles of notifications N1 to Nn and notifications NM1 to NMm), and whether or not the user has selected notifications N1 to Nn and notifications NM1 to NMm. Includes selection information (eg, selection flag) indicating.

図２の例では、学習装置１００は、提供装置２０から受信された訓練データを用いて、所定の通知がユーザに適合するか否かを判定する、注意機構（attention mechanism）付きのモデル（アテンションモデルとも呼ばれる）を学習する。より具体的には、学習装置１００は、通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍの通知情報を、訓練データの入力として用いる。また、学習装置１００は、通知Ｎ１～Ｎｎが選択されたか否か示す選択情報を、訓練データの出力として用いる。 In the example of FIG. 2, the learning device 100 uses the training data received from the providing device 20 to determine whether or not a predetermined notification is suitable for the user, and is a model (attention) with an attention mechanism. Also called a model). More specifically, the learning device 100 uses the notification information of the notifications N1 to Nn and the notifications NM1 to NMm as input of training data. Further, the learning device 100 uses the selection information indicating whether or not the notifications N1 to Nn are selected as the output of the training data.

図２の例では、通知Ｎ１～Ｎｎおよび通知ＮＭ１～ＮＭｍの通知情報は、「ＢｏＷベクトル（bag of words vector）ＢＶＮ１～ＢＶＮｎ」および「ＢｏＷベクトルＢＶＮＭ１～ＢＶＮＭｍ」を含む。各ＢｏＷベクトルの複数の成分のうちゼロでない少なくとも１つの成分が、各通知の通知情報に含まれる少なくとも１つの単語に対応する。また、通知情報は、ＢｏＷベクトルＢＶＮ１～ＢＶＮｎおよびＢｏＷベクトルＢＶＮＭ１～ＢＶＮＭｍに対応する「埋め込みベクトルＮＥＮ１～ＮＥＮｎ」および「埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍ」を含む。一例では、ＢｏＷベクトルは、通知のタイトルに対応する。例えば、通知のタイトルが、単語「今夜」、「映画」、「テレビ」を含む場合に、埋め込みベクトルは、単語「今夜」、「映画」、「テレビ」にそれぞれ対応する３つの分散表現の平均ベクトルである。 In the example of FIG. 2, the notification information of the notifications N1 to Nn and the notifications NM1 to NMm includes "Bow vector (bag of words vector) BVN1 to BVNn" and "BoW vector BVNM1 to BVNMm". At least one non-zero component of the plurality of components of each BoW vector corresponds to at least one word contained in the notification information of each notification. Further, the notification information includes "embedded vectors NEN1 to NENn" and "embedded vectors NENM1 to NENMm" corresponding to the BoW vectors BVN1 to BVNn and the BoW vectors BVNM1 to BVNMm. In one example, the BoW vector corresponds to the title of the notification. For example, if the title of the notification contains the words "tonight", "movie", and "television", the embedded vector is the average of the three distributed representations corresponding to the words "tonight", "movie", and "television" respectively. It is a vector.

図２の例では、通知Ｎ１～Ｎｎが選択されたか否か示す選択情報は、０または１の選択フラグを含む。通知Ｎ１が選択されたことを示す選択フラグは、「１」である。また、通知Ｎ１が選択されなかったことを示す選択フラグは、「０」である。一方、通知ＮＭ１～ＮＭｍが選択されたか否か示す選択情報は、「１」の選択フラグを含む。すなわち、通知ＮＭ１～ＮＭｍは、全て過去に選択された通知である。このように、通知ＮＭ１～ＮＭｍは、注意機構のメモリに相当する。 In the example of FIG. 2, the selection information indicating whether or not the notifications N1 to Nn are selected includes a selection flag of 0 or 1. The selection flag indicating that the notification N1 has been selected is "1". The selection flag indicating that the notification N1 has not been selected is "0". On the other hand, the selection information indicating whether or not the notifications NM1 to NMm are selected includes the selection flag of "1". That is, the notifications NM1 to NMm are all notifications selected in the past. As described above, the notifications NM1 to NMm correspond to the memory of the attention mechanism.

はじめに、学習装置１００は、訓練データを参照し、ＢｏＷベクトルＢＶＮ１に対応する埋め込みベクトルＮＥＮ１を取得する。 First, the learning device 100 refers to the training data and acquires the embedded vector NEN1 corresponding to the BoW vector BVN1.

次いで、学習装置１００は、訓練データを参照し、ＢｏＷベクトルＢＶＮＭ１～ＢＶＮＭｍに対応する埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍを取得する。 Next, the learning device 100 refers to the training data and acquires the embedded vectors NENM1 to NENMm corresponding to the BoW vectors BVNM1 to BVNMm.

次いで、学習装置１００は、埋め込みベクトルＮＥＮ１と埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍとの間の重みα_１～α_ｍを算出する。学習装置１００は、例えば、ソフト注意機構における重みα_１～α_ｍを算出する。重みα_１～α_ｍは、埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍの重要度をそれぞれ示す。一例では、重みα_１～α_ｍは、アラインメントモデル（例えば、順伝播型ニューラルネットとしてパラメトライズされたアラインメントモデル）に基づいて算出される。別の例では、重みα_１～α_ｍは、埋め込みベクトルＮＥＮ１と、埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍの各々との内積に基づいて算出される。 Next, the learning device 100 calculates the weights α ₁ to α _m between the embedded vector NEN1 and the embedded vectors NENM1 to NENMm. The learning device 100 calculates, for example, the weights α ₁ to α _m in the soft attention mechanism. The weights α ₁ to α _m indicate the importance of the embedded vectors NENM 1 to NENM m, respectively. In one example, the weights α ₁ to α _m are calculated based on an alignment model (eg, an alignment model parameterized as a forward propagation neural network). In another example, the weights α ₁ to α _m are calculated based on the inner product of the embedded vector NEN1 and each of the embedded vectors NENM1 to NENMm.

次いで、学習装置１００は、算出された重みα_１～α_ｍを用いて、埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍの重み付き平均ＭＥＭ１を算出する。 Next, the learning device 100 calculates the weighted average MEM1 of the embedded vectors NENM1 to NENMm using the calculated weights α ₁ to α _m .

次いで、学習装置１００は、埋め込みベクトルＮＥＮ１と、算出された重み付き平均ＭＥＭ１（すなわち、重み付き平均のベクトル）とを連結（concatenate）することで、ベクトルを生成する。そして、生成されたベクトルに、活性化関数（例えば、ＲｅＬＵ（Rectified Linear unit）関数）を適用することで、隠れ状態ベクトルを生成する。 Next, the learning device 100 concatenates the embedded vector NEN1 and the calculated weighted average MEM1 (that is, the weighted average vector) to generate a vector. Then, a hidden state vector is generated by applying an activation function (for example, a ReLU (Rectified Linear unit) function) to the generated vector.

次いで、学習装置１００は、重み行列に生成された隠れ状態ベクトルを乗ずることで、関連度ＲＳ１を算出する。 Next, the learning device 100 calculates the relevance degree RS1 by multiplying the weight matrix by the generated hidden state vector.

次いで、学習装置１００は、通知Ｎ１が選択されたか否か示す選択情報に含まれる選択フラグの値と、算出された関連度ＲＳ１とを用いて、所定の通知がユーザに適合するか否かを判定する、注意機構付きのモデルを学習する。例えば、学習装置１００は、誤差逆伝播法を用いて、通知Ｎ１が選択されたか否か示す選択情報に含まれる選択フラグの値と算出された関連度ＲＳ１とに対応する目的関数（例えば、交差エントロピー損失関数）が最小化されるように、ソフト注意機構付きのモデルを学習する。 Next, the learning device 100 uses the value of the selection flag included in the selection information indicating whether or not the notification N1 is selected and the calculated relevance degree RS1 to determine whether or not the predetermined notification is suitable for the user. Learn a model with an attention mechanism to judge. For example, the learning device 100 uses an error backpropagation method and uses an objective function (for example, an intersection) corresponding to the value of the selection flag included in the selection information indicating whether or not the notification N1 is selected and the calculated relevance RS1. Learn a model with a soft attention mechanism so that the entropy loss function) is minimized.

通知Ｎ１以外の通知についても、学習装置１００は、例えば、通知Ｎ２～Ｎｎに対応する訓練データを用いて、ソフト注意機構付きのモデルを学習することができる。学習装置１００は、かかる学習処理を繰り返すことで、汎化誤差が最小化されるように、ソフト注意機構付きのモデルを学習することができる。 For notifications other than notification N1, the learning device 100 can learn a model with a soft attention mechanism using, for example, training data corresponding to notifications N2 to Nn. By repeating the learning process, the learning device 100 can learn a model with a soft attention mechanism so that the generalization error is minimized.

その後、学習装置１００は、学習された注意機構付きのモデルを用いて、ユーザＵ１と通知との間の関連性を示す関連性スコアを出力する。例えば、学習装置１００は、所定の通知に対応する埋め込みベクトルと、通知ＮＭ１～ＮＭｍに対応する埋め込みベクトル（すなわち、埋め込みベクトルＮＥＮＭ１～ＮＥＮＭｍ）とを、判定モデルに入力することで、ユーザＵ１が所定の通知を選択する事後確率を、ユーザＵ１と通知との間の関連性スコアとして出力する。 After that, the learning device 100 outputs a relevance score indicating the relevance between the user U1 and the notification by using the model with the learned attention mechanism. For example, the learning device 100 inputs the embedded vector corresponding to the predetermined notification and the embedded vector corresponding to the notifications NM1 to NMm (that is, the embedded vectors NENM1 to NENMm) into the determination model, so that the user U1 determines. The posterior probability of selecting the notification of is output as the relevance score between the user U1 and the notification.

上述のように、実施形態に係る学習装置１００は、通知に対応する埋め込みベクトルと、過去に選択された特定の通知（すなわち、注意機構のメモリ）に対応する埋め込みベクトルとを、訓練データの入力として用いる。また、学習装置１００は、通知の選択フラグの値を、訓練データの出力として用いる。そして、学習装置１００は、汎化誤差が最小化されるように、注意機構付きのモデルを学習する。これにより、学習装置１００は、所定の通知がユーザに適合するか否かを判定する判定モデルを学習することができる。 As described above, the learning device 100 according to the embodiment inputs the training data of the embedded vector corresponding to the notification and the embedded vector corresponding to the specific notification (that is, the memory of the attention mechanism) selected in the past. Used as. Further, the learning device 100 uses the value of the selection flag of the notification as the output of the training data. Then, the learning device 100 learns a model with an attention mechanism so that the generalization error is minimized. As a result, the learning device 100 can learn a determination model for determining whether or not a predetermined notification is suitable for the user.

ところで、図２の例では、提供装置２０は、端末装置１０に、通知をプッシュ通知として送信している。一般的に、プッシュ通知は、季節性のある情報である。例えば、提供装置２０は、速報性の高い情報（例えば、今夜の番組）を１回だけプッシュ通知として送信する。したがって、このようなプッシュ通知に対応する配信履歴が存在しない場合がある。このような状況は、コールドスタート問題（cold start problem）に関連する。この点に関し、プッシュ通知に対応する配信履歴が存在しない場合でも、学習装置１００は、上述の判定モデルを用いて、プッシュ通知がユーザに適合するか否かを判定することができる。このため、学習装置１００は、ユーザが、ユーザが関心のあるプッシュ通知をタイムリーに受け取ることを可能にする。以下、このような学習処理を実現する学習装置１００について詳細に説明する。 By the way, in the example of FIG. 2, the providing device 20 transmits the notification to the terminal device 10 as a push notification. In general, push notifications are seasonal information. For example, the providing device 20 transmits information with high breaking news (for example, tonight's program) only once as a push notification. Therefore, there may be no delivery history corresponding to such push notifications. Such a situation is associated with a cold start problem. In this regard, the learning device 100 can determine whether or not the push notification is suitable for the user by using the above-mentioned determination model even when the distribution history corresponding to the push notification does not exist. Therefore, the learning device 100 enables the user to receive push notifications of interest to the user in a timely manner. Hereinafter, the learning device 100 that realizes such a learning process will be described in detail.

〔３．学習装置の構成〕
次に、図３を参照して、実施形態に係る学習装置１００の構成例について説明する。図３は、実施形態に係る学習装置１００の構成例を示す図である。図３に示すように、学習装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、学習装置１００は、学習装置１００を利用する管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（液晶ディスプレイ等）を有してもよい。 [3. Configuration of learning device]
Next, a configuration example of the learning device 100 according to the embodiment will be described with reference to FIG. FIG. 3 is a diagram showing a configuration example of the learning device 100 according to the embodiment. As shown in FIG. 3, the learning device 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The learning device 100 has an input unit (for example, a keyboard, a mouse, etc.) that receives various operations from an administrator or the like who uses the learning device 100, and a display unit (liquid crystal display, etc.) for displaying various information. You may.

（通信部１１０）
通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。通信部１１０は、ネットワーク網と有線又は無線により接続され、ネットワーク網を介して、端末装置１０および提供装置２０との間で情報の送受信を行う。 (Communication unit 110)
The communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. The communication unit 110 is connected to the network network by wire or wirelessly, and transmits / receives information to / from the terminal device 10 and the providing device 20 via the network network.

（記憶部１２０）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory)、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。図３に示すように、記憶部１２０は、訓練データ記憶部１２１と、判定モデル記憶部１２２とを有する。 (Memory unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory (Flash Memory), or a storage device such as a hard disk or an optical disk. As shown in FIG. 3, the storage unit 120 has a training data storage unit 121 and a determination model storage unit 122.

（訓練データ記憶部１２１）
図４は、実施形態に係る訓練データ記憶部１２１の一例を示す図である。訓練データ記憶部１２１は、コンテンツに関連する予測モデルを生成するための訓練データを記憶する。予測モデルは、例えば、所定の通知がユーザに適合するか否かを判定する判定モデルである。一例では、判定モデルは、注意機構付きのモデルである。 (Training data storage unit 121)
FIG. 4 is a diagram showing an example of the training data storage unit 121 according to the embodiment. The training data storage unit 121 stores training data for generating a prediction model related to the content. The prediction model is, for example, a determination model for determining whether or not a predetermined notification is suitable for the user. In one example, the determination model is a model with an attention mechanism.

訓練データ記憶部１２１は、例えば、受信部１３１によって受信された訓練データを記憶する。図４の例では、訓練データ記憶部１２１には、「訓練データ」が「ユーザＩＤ」ごとに記憶される。「ユーザＩＤ」は、ユーザを識別するための識別子を示す。例示として、「訓練データ」には、項目「通知ＩＤ」、「通知情報」および「選択情報」が含まれる。 The training data storage unit 121 stores, for example, the training data received by the reception unit 131. In the example of FIG. 4, "training data" is stored in the training data storage unit 121 for each "user ID". The "user ID" indicates an identifier for identifying a user. By way of example, the "training data" includes the items "notification ID", "notification information" and "selection information".

「通知ＩＤ」は、通知を識別するための識別子を示す。「通知情報」は、通知の内容に関する情報を示す。通知の内容は、例えば、通知のタイトルや、通知のボディ（例えば、本文のテキスト）や、通知が選択された場合のアクション（例えば、リンク先への画面遷移）等である。 The "notification ID" indicates an identifier for identifying the notification. "Notification information" indicates information regarding the content of the notification. The content of the notification is, for example, the title of the notification, the body of the notification (for example, the text of the text), the action when the notification is selected (for example, the screen transition to the link destination), and the like.

通知情報は、通知に対応するＢｏＷベクトル（ＢｏＷ表現とも呼ばれる）を含む。ＢｏＷベクトルの複数の成分のうちゼロでない少なくとも１つの成分は、通知の通知情報に含まれる少なくとも１つの単語に対応する。また、通知情報は、ＢｏＷベクトルに対応する埋め込みベクトルを含む。一例では、ＢｏＷベクトルは、通知のタイトルに対応する。例えば、通知のタイトルが「ジムに行く？」場合に、埋め込みベクトルは、単語「ジム」、「行く」にそれぞれ対応する２つの分散表現の平均ベクトルである。 The notification information includes a BoW vector (also referred to as a BoW representation) corresponding to the notification. At least one non-zero component of the plurality of components of the BoW vector corresponds to at least one word contained in the notification information of the notification. Further, the notification information includes an embedded vector corresponding to the Bow vector. In one example, the BoW vector corresponds to the title of the notification. For example, if the title of the notification is "Go to Jim?", The embedded vector is the average vector of the two distributed representations corresponding to the words "Jim" and "Go" respectively.

「選択情報」は、通知の選択に関する情報を示す。例えば、選択情報は、通知が選択されたか否か示す。例えば、選択情報は、通知が選択されたか否か示す選択フラグを含む。図４の例では、通知が選択されたことを示す選択フラグは、「１」である。この例では、通知が選択されなかったことを示す選択フラグは、「０」である。 "Selection information" indicates information regarding the selection of notifications. For example, the selection information indicates whether or not the notification has been selected. For example, the selection information includes a selection flag indicating whether or not the notification was selected. In the example of FIG. 4, the selection flag indicating that the notification has been selected is "1". In this example, the selection flag indicating that the notification was not selected is "0".

例えば、図４は、通知ＩＤ「Ｎ１」で識別される通知が、通知情報ＮＩ１に対応することを示している。また、例えば、図４は、ユーザＩＤ「Ｕ１」で識別されるユーザが、通知ＩＤ「Ｎ１」で識別される通知を選択したことを示している。 For example, FIG. 4 shows that the notification identified by the notification ID “N1” corresponds to the notification information NI1. Further, for example, FIG. 4 shows that the user identified by the user ID “U1” has selected the notification identified by the notification ID “N1”.

例えば、図４は、通知ＩＤ「ＮＭ１」～「ＮＭｍ」にそれぞれ対応する複数の通知が、ユーザＩＤ「Ｕ１」で識別されるユーザによって全て過去に選択されたことを示している。通知ＩＤ「ＮＭ１」～「ＮＭｍ」にそれぞれ対応する複数の通知は、注意機構のメモリに相当する。 For example, FIG. 4 shows that a plurality of notifications corresponding to the notification IDs “NM1” to “NMm” have all been selected in the past by the user identified by the user ID “U1”. The plurality of notifications corresponding to the notification IDs "NM1" to "NMm" correspond to the memory of the attention mechanism.

（判定モデル記憶部１２２）
図３に戻ると、判定モデル記憶部１２２は、配信対象に関連する判定モデルを記憶する。判定モデル記憶部１２２は、例えば、学習部１３３によって学習されたモデルであって、配信候補が所定の利用者と対応するか否かを判定するモデルを記憶する。 (Judgment model storage unit 122)
Returning to FIG. 3, the determination model storage unit 122 stores the determination model related to the distribution target. The determination model storage unit 122 stores, for example, a model learned by the learning unit 133 and determines whether or not the distribution candidate corresponds to a predetermined user.

（制御部１３０）
制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro Processing Unit）等のプロセッサによって、学習装置１００内部の記憶装置に記憶されている各種プログラムがＲＡＭ等を作業領域として実行されることにより実現される。また、制御部１３０は、コントローラ（controller）であり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現されてもよい。 (Control unit 130)
The control unit 130 is a controller, and for example, various programs stored in a storage device inside the learning device 100 by a processor such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit) store a RAM or the like. It is realized by being executed as a work area. Further, the control unit 130 is a controller, and may be realized by, for example, an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

（受信部１３１）
受信部１３１は、提供装置２０から、コンテンツに関連する判定モデルを生成するための訓練データを受信する。訓練データは、通知の通知情報と、ユーザが通知を選択したか否かを示す選択情報とを含む。受信部１３１は、受信された訓練データを、訓練データ記憶部１２１に格納してもよい。 (Receiver 131)
The receiving unit 131 receives training data from the providing device 20 for generating a determination model related to the content. The training data includes notification information of the notification and selection information indicating whether or not the user has selected the notification. The receiving unit 131 may store the received training data in the training data storage unit 121.

受信部１３１は、提供装置２０から、配信対象を示す複数の配信対象ベクトルを受信する。例えば、受信部１３１は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを受信する。一例では、受信部１３１は、通知に対応する埋め込みベクトルを受信する。上述の通知情報は、配信対象を示す複数の配信対象ベクトルを含んでもよい。受信部１３１は、受信された複数の配信対象ベクトルを、訓練データ記憶部１２１に格納してもよい。受信部１３１は、配信対象に含まれる情報を受信してもよい。 The receiving unit 131 receives a plurality of distribution target vectors indicating the distribution target from the providing device 20. For example, the receiving unit 131 receives a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed. In one example, the receiving unit 131 receives the embedded vector corresponding to the notification. The above-mentioned notification information may include a plurality of distribution target vectors indicating distribution targets. The receiving unit 131 may store a plurality of received distribution target vectors in the training data storage unit 121. The receiving unit 131 may receive the information included in the distribution target.

上述の配信対象は、例えば、通知（例えば、プッシュ通知）、ニュース記事、テキストベースのメッセージ（例えば、レコメント情報）等である。配信対象がプッシュ通知である場合に、上述の配信対象ベクトルは、例えば、プッシュ通知のタイトルに含まれる単語と対応する次元に所定の値が格納されたベクトル（例えば、分散表現）である。 The above-mentioned distribution target is, for example, a notification (for example, push notification), a news article, a text-based message (for example, recomment information), or the like. When the delivery target is a push notification, the above-mentioned delivery target vector is, for example, a vector (for example, a distributed representation) in which a predetermined value is stored in a dimension corresponding to a word included in the title of the push notification.

（取得部１３２）
取得部１３２は、コンテンツに関連する判定モデルを生成するための訓練データを取得する。取得部１３２は、例えば、受信部１３１によって受信された訓練データを取得する。取得部１３２は、訓練データ記憶部１２１から訓練データを取得してもよい。 (Acquisition unit 132)
The acquisition unit 132 acquires training data for generating a determination model related to the content. The acquisition unit 132 acquires, for example, the training data received by the reception unit 131. The acquisition unit 132 may acquire training data from the training data storage unit 121.

取得部１３２は、配信対象を示す複数の配信対象ベクトルを取得する。例えば、取得部１３２は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを取得する。 The acquisition unit 132 acquires a plurality of distribution target vectors indicating the distribution target. For example, the acquisition unit 132 acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed.

取得部１３２は、例えば、受信部１３１によって受信された配信対象ベクトルを取得する。取得部１３２は、訓練データ記憶部１２１から配信対象ベクトルを取得してもよい。 The acquisition unit 132 acquires, for example, the distribution target vector received by the reception unit 131. The acquisition unit 132 may acquire the distribution target vector from the training data storage unit 121.

（学習部１３３）
学習部１３３は、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルと、配信候補を示す配信候補ベクトルとが入力されると、配信候補ベクトルとの類似性に基づく重みが設定された複数の配信対象ベクトルと、配信候補ベクトルとに基づいて、配信候補が所定の利用者と対応するか否かを判定するモデルを、取得部１３２により取得された配信対象ベクトルを用いて学習する。例えば、学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルとの共起性をモデルに学習させる。 (Learning Department 133)
When a plurality of distribution target vectors indicating distribution targets distributed to a predetermined user and distribution candidate vectors indicating distribution candidates are input, the learning unit 133 sets weights based on the similarity with the distribution candidate vectors. A model for determining whether or not a distribution candidate corresponds to a predetermined user based on a plurality of distribution target vectors and a distribution candidate vector is learned using the distribution target vector acquired by the acquisition unit 132. do. For example, the learning unit 133 learns the co-occurrence between the distribution target vector indicating the distribution target selected by the predetermined user and the plurality of distribution target vectors indicating the distribution target distributed to the predetermined user as a model. ..

より具体的には、学習部１３３は、所定の利用者に配信された配信対象が選択されたか否か示す選択情報（例えば、選択フラグ）に基づいて、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルとの共起性をモデルに学習させる。例えば、学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトルおよび所定の利用者に配信された配信対象を示す複数の配信対象ベクトルを訓練データの入力として用い、かつ所定の利用者に配信された配信対象が選択されたか否か示す選択情報を訓練データの出力として用いることで、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルとの共起性に関する距離をモデルに学習させる。 More specifically, the learning unit 133 selects the distribution target selected by the predetermined user based on the selection information (for example, the selection flag) indicating whether or not the distribution target delivered to the predetermined user is selected. The model learns the co-occurrence between the distribution target vector shown and a plurality of distribution target vectors indicating the distribution target delivered to a predetermined user. For example, the learning unit 133 uses a distribution target vector indicating a distribution target selected by a predetermined user and a plurality of distribution target vectors indicating a distribution target distributed to a predetermined user as input of training data, and is predetermined. By using the selection information indicating whether or not the distribution target distributed to the user has been selected as the output of the training data, the distribution target vector indicating the distribution target selected by the predetermined user and the distribution target vector indicating the distribution target selected by the predetermined user are distributed to the predetermined user. Let the model learn the distance related to co-occurrence with multiple distribution target vectors indicating the distribution target.

一例では、モデルは、注意機構付きのモデルである。この例では、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルは、複数の配信対象（例えば、プッシュ通知）にそれぞれ対応する複数の埋め込みベクトル（例えば、分散表現）である。また、この例では、所定の利用者が選択した配信対象を示す配信対象ベクトルは、所定の利用者（例えば、ユーザ）によって過去に選択された配信対象に対応する埋め込みベクトルである。言い換えると、所定の利用者が選択した配信対象を示す配信対象ベクトルは、注意機構のメモリに相当する。 In one example, the model is a model with an attention mechanism. In this example, the plurality of distribution target vectors indicating the distribution targets delivered to a predetermined user are a plurality of embedded vectors (for example, distributed representation) corresponding to the plurality of distribution targets (for example, push notification). Further, in this example, the distribution target vector indicating the distribution target selected by the predetermined user is an embedded vector corresponding to the distribution target previously selected by the predetermined user (for example, a user). In other words, the distribution target vector indicating the distribution target selected by a predetermined user corresponds to the memory of the attention mechanism.

学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトルの上述の重み（すなわち、配信候補ベクトルとの類似性に基づく重み）による重み付き平均から、配信候補が所定の利用者と対応するか否かを判定するモデルを学習する。例えば、学習部１３３は、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つと重み付き平均とを連結した連結ベクトルから、配信候補が所定の利用者と対応するか否かを判定するモデルを学習する。言い換えれば、学習部１３３は、複数の配信対象ベクトルのうちの１つと重み付き平均との連結（concatenation）であるベクトルから、モデルを学習する。例えば、学習部１３３は、次元数が削減されるように連結ベクトルを徐々に畳み込むことで、配信候補が所定の利用者と対応するか否かを示す値を生成するモデルを学習する。 In the learning unit 133, the distribution candidate is a predetermined user from the weighted average by the above-mentioned weight (that is, the weight based on the similarity with the distribution candidate vector) of the distribution target vector indicating the distribution target selected by the predetermined user. Learn a model that determines whether or not it corresponds to. For example, in the learning unit 133, the delivery candidate corresponds to the predetermined user from the concatenated vector in which one of the plurality of delivery target vectors indicating the delivery target delivered to the predetermined user and the weighted average are concatenated. Learn the model to determine whether or not. In other words, the learning unit 133 learns the model from a vector that is a concatenation of one of the plurality of distribution target vectors and the weighted average. For example, the learning unit 133 learns a model that generates a value indicating whether or not the distribution candidate corresponds to a predetermined user by gradually convolving the connection vector so that the number of dimensions is reduced.

一例では、学習部１３３は、所定の利用者に配信された配信対象に対応する埋め込みベクトルと、所定の利用者が選択した複数の配信対象にそれぞれ対応する複数の埋め込みベクトルとの間の複数の重みを算出する。そして、学習部１３３は、所定の利用者が選択した複数の配信対象にそれぞれ対応する複数の埋め込みベクトルのかかる複数の重みによる重み付き平均（すなわち、重み付き平均のベクトル）に対応する中間層を含む注意機構付きのモデルであって、配信候補が所定の利用者と対応するか否かを判定する注意機構付きのモデルを学習する。 In one example, the learning unit 133 has a plurality of embedded vectors corresponding to a distribution target delivered to a predetermined user and a plurality of embedded vectors corresponding to a plurality of distribution targets selected by the predetermined user. Calculate the weight. Then, the learning unit 133 provides an intermediate layer corresponding to the weighted average (that is, the weighted average vector) by the plurality of weights of the plurality of embedded vectors corresponding to the plurality of distribution targets selected by the predetermined user. A model with a caution mechanism including, and a model with a caution mechanism for determining whether or not a delivery candidate corresponds to a predetermined user is learned.

例えば、学習部１３３は、所定の利用者に配信された配信対象に対応する埋め込みベクトルと、所定の利用者が選択した複数の配信対象にそれぞれ対応する複数の埋め込みベクトルの複数の重みによる重み付き平均とを連結することで、第１のベクトルを生成する。次いで、学習部１３３は、生成された第１のベクトルに活性化関数を適用することで、隠れ状態ベクトルを生成する。次いで、学習部１３３は、重み行列に隠れ状態ベクトを乗ずることで、関連度を算出する。その後、学習部１３３は、所定の利用者に配信された配信対象が選択されたか否か示す選択情報（例えば、選択フラグ）と、算出された関連度とを用いて、配信候補が所定の利用者と対応するか否かを判定する注意機構付きのモデルを学習する。 For example, the learning unit 133 is weighted by a plurality of weights of an embedded vector corresponding to a distribution target delivered to a predetermined user and a plurality of embedded vectors corresponding to a plurality of distribution targets selected by the predetermined user. A first vector is generated by concatenating with the average. Next, the learning unit 133 generates a hidden state vector by applying an activation function to the generated first vector. Next, the learning unit 133 calculates the degree of relevance by multiplying the weight matrix by the hidden state vector. After that, the learning unit 133 uses the selection information (for example, a selection flag) indicating whether or not the distribution target distributed to the predetermined user has been selected, and the calculated relevance, and the distribution candidate uses the predetermined use. Learn a model with an attention mechanism that determines whether or not it corresponds to a person.

学習部１３３は、複数の配信対象ベクトルのうちの１つから、複数の配信対象ベクトルのうちの１つが示す配信対象に含まれる情報の特徴を示す特徴ベクトルの平均値の和となるベクトルを生成し、生成されたベクトルに基づいて、配信候補が所定の利用者と対応するか否かを判定するモデルを学習する。 The learning unit 133 generates a vector that is the sum of the average values of the feature vectors indicating the characteristics of the information included in the distribution target indicated by one of the plurality of distribution target vectors from one of the plurality of distribution target vectors. Then, based on the generated vector, a model for determining whether or not the distribution candidate corresponds to a predetermined user is learned.

一例では、情報の特徴を示す特徴ベクトルは、分散表現である。この例では、配信対象に含まれる情報が、第１～３の特徴を含む場合に、配信対象に対応する埋め込みベクトルは、第１～３の特徴にそれぞれ対応する３つの分散表現の平均ベクトルである。 In one example, the feature vector that represents the feature of the information is a distributed representation. In this example, when the information included in the distribution target includes the first to third features, the embedded vector corresponding to the delivery target is the average vector of the three distributed representations corresponding to the first to third features, respectively. be.

学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つであり、かつ所定の利用者が選択した配信対象を示す配信対象ベクトルとが入力された際に、配信候補が所定の利用者と対応する旨を出力し、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つであり、かつ所定の利用者が選択しなかった配信対象を示す配信対象ベクトルとが入力された際に、配信候補が所定の利用者と対応しない旨を出力されるように、モデルを学習する。 The learning unit 133 is one of a distribution target vector indicating a distribution target selected by a predetermined user and a plurality of distribution target vectors indicating a distribution target distributed to a predetermined user, and is a predetermined use. When a distribution target vector indicating a distribution target selected by a user is input, it is output that the distribution candidate corresponds to a predetermined user, and a distribution target vector indicating a distribution target selected by the predetermined user and a distribution target vector. When one of a plurality of distribution target vectors indicating a distribution target delivered to a predetermined user and a distribution target vector indicating a distribution target not selected by the predetermined user is input, The model is learned so that it is output that the delivery candidate does not correspond to the predetermined user.

例えば、学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトル（例えば、テストデータである配信対象ベクトル）と、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つであり、かつ所定の利用者が選択した配信対象を示す配信対象ベクトル（例えば、注意機構のメモリ）とが入力された際に、モデルによって出力される値が、配信対象が選択されたことを示す値（例えば、選択フラグ「１」）に近づくように、訓練用のデータである配信対象ベクトルを用いてモデルを訓練する。また、例えば、学習部１３３は、正則化手法（例えば、Ｌ２正則化等）を用いて、汎化誤差が最小化されるようにモデルを学習する。 For example, the learning unit 133 has a distribution target vector indicating a distribution target selected by a predetermined user (for example, a distribution target vector which is test data), and a plurality of distribution targets indicating a distribution target distributed to a predetermined user. The value output by the model when one of the vectors and the distribution target vector indicating the distribution target selected by the predetermined user (for example, the memory of the attention mechanism) is input is the distribution target. The model is trained using the distribution target vector, which is the training data, so as to approach the value indicating that is selected (for example, the selection flag “1”). Further, for example, the learning unit 133 uses a regularization method (for example, L2 regularization, etc.) to learn the model so that the generalization error is minimized.

学習部１３３は、例えば、所定の利用者に配信された配信対象が選択されたか否か示す選択情報およびモデルの出力に対応する目的関数が最小化されるように、モデルを学習する。一例では、学習部１３３は、誤差逆伝播法を用いて、選択情報に含まれる選択フラグの値および注意機構付きのモデルの出力（例えば、上述の関連度）に対応する目的関数が最小化されるように、注意機構付きのモデルを学習する。 The learning unit 133 learns the model so that, for example, the selection information indicating whether or not the distribution target distributed to a predetermined user is selected and the objective function corresponding to the output of the model are minimized. In one example, the learning unit 133 uses an error backpropagation method to minimize the objective function corresponding to the value of the selection flag contained in the selection information and the output of the model with the attention mechanism (eg, the degree of relevance described above). So, learn a model with an attention mechanism.

（生成部１３４）
生成部１３４は、配信対象の候補となる配信対象に含まれる情報と対応する次元に所定の値が格納された候補ベクトルを生成する。例えば、生成部１３４は、所定のサーバから、配信対象の候補となる配信対象に含まれる情報に含まれるキーワードに対応する少なくとも１つのベクトル（例えば、分散表現）を取得する。そして、生成部１３４は、情報に含まれるキーワードに対応する少なくとも１つのベクトルに基づいて、候補ベクトルを生成する。 (Generation unit 134)
The generation unit 134 generates a candidate vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target that is a candidate for the distribution target. For example, the generation unit 134 acquires at least one vector (for example, a distributed expression) corresponding to a keyword included in the information included in the distribution target that is a candidate for distribution target from a predetermined server. Then, the generation unit 134 generates a candidate vector based on at least one vector corresponding to the keyword included in the information.

生成部１３４は、配信対象に含まれる情報を用いて、配信対象を示す配信対象ベクトルを生成してもよい。また、配信対象に含まれる情報を用いて、配信対象に対応するＢｏＷベクトルを生成してもよい。 The generation unit 134 may generate a distribution target vector indicating a distribution target by using the information included in the distribution target. Further, the BoW vector corresponding to the distribution target may be generated by using the information included in the distribution target.

（判定部１３５）
判定部１３５は、学習部１３３により学習されたモデルを用いて、候補ベクトルから、配信対象の候補を所定の利用者に配信するか否かを判定する。また、判定部１３５は、候補ベクトルが入力された際に、候補ベクトルが示す配信候補が、所定の利用者と対応する旨をモデルが出力した場合は、配信候補を所定の利用者に配信すると判定する。 (Judgment unit 135)
The determination unit 135 determines whether or not to distribute the candidate to be distributed to a predetermined user from the candidate vector using the model learned by the learning unit 133. Further, when the model outputs that the distribution candidate indicated by the candidate vector corresponds to a predetermined user when the candidate vector is input, the determination unit 135 distributes the distribution candidate to the predetermined user. judge.

一例では、判定部１３５は、学習されたモデル（例えば、注意機構付きのモデル）を用いて、所定の利用者と配信対象の候補との間の関連性を示す関連性スコアを出力する。より具体的には、判定部１３５は、所定の利用者が選択した配信対象を示す配信対象ベクトルと、配信対象の候補に対応する候補ベクトルとを、モデルに入力することで、所定の利用者が配信対象の候補を選択する確率を、所定の利用者と配信対象の候補との間の関連性スコアとして出力する。 In one example, the determination unit 135 uses a learned model (for example, a model with an attention mechanism) to output a relevance score indicating a relevance between a predetermined user and a candidate to be delivered. More specifically, the determination unit 135 inputs a distribution target vector indicating a distribution target selected by a predetermined user and a candidate vector corresponding to the candidate of the distribution target into the model, so that the predetermined user Outputs the probability of selecting a candidate for distribution as a relevance score between a predetermined user and the candidate for distribution.

（提供部１３６）
提供部１３６は、学習部１３３により学習されたモデルを提供する。例えば、提供部１３６は、学習されたモデルを提供装置２０に提供する。提供部１３６は、判定部１３５による判定結果を、提供装置２０に提供してもよい。 (Providing Department 136)
The providing unit 136 provides a model learned by the learning unit 133. For example, the providing unit 136 provides the trained model to the providing device 20. The providing unit 136 may provide the determination result by the determination unit 135 to the providing device 20.

〔４．学習処理のフロー〕
次に、実施形態に係る学習装置１００による学習処理の手順について説明する。図５は、実施形態に係る学習装置１００による学習処理手順を示すフローチャートである。 [4. Learning process flow]
Next, the procedure of the learning process by the learning device 100 according to the embodiment will be described. FIG. 5 is a flowchart showing a learning processing procedure by the learning device 100 according to the embodiment.

図５に示すように、はじめに、学習装置１００は、配信対象を示す複数の配信対象ベクトルを取得する（ステップＳ１０１）。例えば、学習装置１００は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを取得する。 As shown in FIG. 5, first, the learning device 100 acquires a plurality of distribution target vectors indicating distribution targets (step S101). For example, the learning device 100 acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed.

次いで、学習装置１００は、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルと、配信候補を示す配信候補ベクトルとが入力されると、配信候補ベクトルとの類似性に基づく重みが設定された複数の配信対象ベクトルと、配信候補ベクトルとに基づいて、配信候補が所定の利用者と対応するか否かを判定するモデルを、取得された配信対象ベクトルを用いて学習する（ステップＳ１０２）。例えば、学習装置１００は、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルとの共起性をモデルに学習させる。 Next, when a plurality of distribution target vectors indicating distribution targets distributed to a predetermined user and distribution candidate vectors indicating distribution candidates are input, the learning device 100 weights based on the similarity with the distribution candidate vectors. A model for determining whether or not a distribution candidate corresponds to a predetermined user based on a plurality of distribution target vectors in which is set and a distribution candidate vector is learned using the acquired distribution target vector (. Step S102). For example, the learning device 100 makes a model learn the co-occurrence between a distribution target vector indicating a distribution target selected by a predetermined user and a plurality of distribution target vectors indicating a distribution target distributed to a predetermined user. ..

より具体的には、学習装置１００は、所定の利用者が選択した配信対象を示す配信対象ベクトルおよび所定の利用者に配信された配信対象を示す複数の配信対象ベクトルを訓練データの入力として用い、かつ所定の利用者に配信された配信対象が選択されたか否か示す選択情報を訓練データの出力として用いることで、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルとの共起性に関する距離をモデルに学習させる。 More specifically, the learning device 100 uses a distribution target vector indicating a distribution target selected by a predetermined user and a plurality of distribution target vectors indicating a distribution target distributed to a predetermined user as input of training data. In addition, by using the selection information indicating whether or not the distribution target delivered to the predetermined user has been selected as the output of the training data, the distribution target vector indicating the distribution target selected by the predetermined user and the predetermined use Let the model learn the distance related to the co-occurrence with a plurality of delivery target vectors indicating the delivery target delivered to the person.

〔５．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の一部を手動的に行うこともできる。あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [5. others〕
Further, among the processes described in the above-described embodiment, a part of the processes described as being automatically performed can also be performed manually. Alternatively, all or part of the process described as being performed manually can be automatically performed by a known method. In addition, information including processing procedures, specific names, various data and parameters shown in the above documents and drawings can be arbitrarily changed unless otherwise specified. For example, the various information shown in each figure is not limited to the information shown in the figure.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each of the illustrated devices is a functional concept, and does not necessarily have to be physically configured as shown in the figure. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or part of them may be functionally or physically distributed / physically in any unit according to various loads and usage conditions. Can be integrated and configured.

例えば、図３に示した記憶部１２０の一部又は全部は、学習装置１００によって保持されるのではなく、ストレージサーバ等に保持されてもよい。この場合、学習装置１００は、ストレージサーバにアクセスすることで、訓練データ等の各種情報を取得する。 For example, a part or all of the storage unit 120 shown in FIG. 3 may not be held by the learning device 100, but may be held by a storage server or the like. In this case, the learning device 100 acquires various information such as training data by accessing the storage server.

〔６．ハードウェア構成〕
また、上述してきた実施形態に係る学習装置１００は、例えば図６に示すような構成のコンピュータ１０００によって実現される。図６は、ハードウェア構成の一例を示す図である。コンピュータ１０００は、出力装置１０１０、入力装置１０２０と接続され、演算装置１０３０、一次記憶装置１０４０、二次記憶装置１０５０、出力ＩＦ（Interface）１０６０、入力ＩＦ１０７０、ネットワークＩＦ１０８０がバス１０９０により接続された形態を有する。 [6. Hardware configuration]
Further, the learning device 100 according to the above-described embodiment is realized by, for example, a computer 1000 having a configuration as shown in FIG. FIG. 6 is a diagram showing an example of a hardware configuration. The computer 1000 is connected to the output device 1010 and the input device 1020, and the arithmetic unit 1030, the primary storage device 1040, the secondary storage device 1050, the output IF (Interface) 1060, the input IF 1070, and the network IF 1080 are connected by the bus 1090. Have.

演算装置１０３０は、一次記憶装置１０４０や二次記憶装置１０５０に格納されたプログラムや入力装置１０２０から読み出したプログラム等に基づいて動作し、各種の処理を実行する。一次記憶装置１０４０は、ＲＡＭ等、演算装置１０３０が各種の演算に用いるデータを一時的に記憶するメモリ装置である。また、二次記憶装置１０５０は、演算装置１０３０が各種の演算に用いるデータや、各種のデータベースが登録される記憶装置であり、ＲＯＭ(Read Only Memory)、ＨＤＤ、フラッシュメモリ等により実現される。 The arithmetic unit 1030 operates based on a program stored in the primary storage device 1040 or the secondary storage device 1050, a program read from the input device 1020, or the like, and executes various processes. The primary storage device 1040 is a memory device such as a RAM that temporarily stores data used by the arithmetic unit 1030 for various operations. Further, the secondary storage device 1050 is a storage device in which data used by the arithmetic unit 1030 for various calculations and various databases are registered, and is realized by a ROM (Read Only Memory), an HDD, a flash memory, or the like.

出力ＩＦ１０６０は、モニタやプリンタといった各種の情報を出力する出力装置１０１０に対し、出力対象となる情報を送信するためのインタフェースであり、例えば、ＵＳＢ（Universal Serial Bus）やＤＶＩ（Digital Visual Interface）、ＨＤＭＩ（登録商標）（High Definition Multimedia Interface）といった規格のコネクタにより実現される。また、入力ＩＦ１０７０は、マウス、キーボード、およびスキャナ等といった各種の入力装置１０２０から情報を受信するためのインタフェースであり、例えば、ＵＳＢ等により実現される。 The output IF 1060 is an interface for transmitting information to be output to an output device 1010 that outputs various information such as a monitor and a printer. For example, USB (Universal Serial Bus), DVI (Digital Visual Interface), and the like. It is realized by a connector of a standard such as HDMI (registered trademark) (High Definition Multimedia Interface). Further, the input IF 1070 is an interface for receiving information from various input devices 1020 such as a mouse, a keyboard, a scanner, and the like, and is realized by, for example, USB.

なお、入力装置１０２０は、例えば、ＣＤ（Compact Disc）、ＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等から情報を読み出す装置であってもよい。また、入力装置１０２０は、ＵＳＢメモリ等の外付け記憶媒体であってもよい。 The input device 1020 is, for example, an optical recording medium such as a CD (Compact Disc), a DVD (Digital Versatile Disc), a PD (Phase change rewritable Disk), a magneto-optical recording medium such as an MO (Magneto-Optical disk), or a tape. It may be a device that reads information from a medium, a magnetic recording medium, a semiconductor memory, or the like. Further, the input device 1020 may be an external storage medium such as a USB memory.

ネットワークＩＦ１０８０は、ネットワークＮを介して他の機器からデータを受信して演算装置１０３０へ送り、また、ネットワークＮを介して演算装置１０３０が生成したデータを他の機器へ送信する。 The network IF 1080 receives data from another device via the network N and sends it to the arithmetic unit 1030, and also transmits the data generated by the arithmetic unit 1030 to the other device via the network N.

演算装置１０３０は、出力ＩＦ１０６０や入力ＩＦ１０７０を介して、出力装置１０１０や入力装置１０２０の制御を行う。例えば、演算装置１０３０は、入力装置１０２０や二次記憶装置１０５０からプログラムを一次記憶装置１０４０上にロードし、ロードしたプログラムを実行する。 The arithmetic unit 1030 controls the output device 1010 and the input device 1020 via the output IF 1060 and the input IF 1070. For example, the arithmetic unit 1030 loads a program from the input device 1020 or the secondary storage device 1050 onto the primary storage device 1040, and executes the loaded program.

例えば、コンピュータ１０００が学習装置１００として機能する場合、コンピュータ１０００の演算装置１０３０は、一次記憶装置１０４０上にロードされたプログラムを実行することにより、制御部１３０の機能を実現する。 For example, when the computer 1000 functions as the learning device 100, the arithmetic unit 1030 of the computer 1000 realizes the function of the control unit 130 by executing the program loaded on the primary storage device 1040.

〔７．効果〕
上述してきたように、実施形態に係る学習装置１００は、取得部１３２と、学習部１３３とを有する。取得部１３２は、配信対象を示す複数の配信対象ベクトルを取得する。学習部１３３は、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルと、配信候補を示す配信候補ベクトルとが入力されると、配信候補ベクトルとの類似性に基づく重みが設定された複数の配信対象ベクトルと、配信候補ベクトルとに基づいて、配信候補が所定の利用者と対応するか否かを判定するモデルを、取得部１３２により取得された配信対象ベクトルを用いて学習する。 [7. effect〕
As described above, the learning device 100 according to the embodiment has an acquisition unit 132 and a learning unit 133. The acquisition unit 132 acquires a plurality of distribution target vectors indicating the distribution target. When a plurality of distribution target vectors indicating distribution targets distributed to a predetermined user and distribution candidate vectors indicating distribution candidates are input, the learning unit 133 sets weights based on the similarity with the distribution candidate vectors. A model for determining whether or not a distribution candidate corresponds to a predetermined user based on a plurality of distribution target vectors and a distribution candidate vector is learned using the distribution target vector acquired by the acquisition unit 132. do.

また、実施形態に係る学習装置１００において、取得部１３２は、配信対象となる文章に含まれる単語と対応する次元に所定の値が格納された配信対象ベクトルを取得する。 Further, in the learning device 100 according to the embodiment, the acquisition unit 132 acquires a distribution target vector in which a predetermined value is stored in a dimension corresponding to a word included in a sentence to be distributed.

また、実施形態に係る学習装置１００において、学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルとの共起性を前記モデルに学習させる。 Further, in the learning device 100 according to the embodiment, the learning unit 133 includes a distribution target vector indicating a distribution target selected by a predetermined user, and a plurality of distribution target vectors indicating a distribution target distributed to the predetermined user. Let the model learn the co-occurrence of.

また、実施形態に係る学習装置１００において、学習部１３３は、配信候補ベクトルとの類似性に基づく重みによる、所定の利用者が選択した配信対象を示す配信対象ベクトルの重み付き平均から、配信候補が所定の利用者と対応するか否かを判定するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 is a distribution candidate from a weighted average of the distribution target vectors indicating the distribution target selected by a predetermined user by the weight based on the similarity with the distribution candidate vector. Learns a model that determines whether or not corresponds to a given user.

また、実施形態に係る学習装置１００において、学習部１３３は、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つと重み付き平均とを連結した連結ベクトルから、配信候補が所定の利用者と対応するか否かを判定するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 distributes from a concatenated vector in which one of a plurality of distribution target vectors indicating a distribution target distributed to a predetermined user and a weighted average are concatenated. Learn a model that determines whether a candidate corresponds to a given user.

また、実施形態に係る学習装置１００において、学習部１３３は、次元数が削減されるように連結ベクトルを徐々に畳み込むことで、配信候補が所定の利用者と対応するか否かを示す値を生成するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 gradually convolves the connection vector so that the number of dimensions is reduced, so that a value indicating whether or not the distribution candidate corresponds to a predetermined user is set. Learn the model to generate.

また、実施形態に係る学習装置１００において、学習部１３３は、複数の配信対象ベクトルのうちの１つから、複数の配信対象ベクトルのうちの１つが示す配信対象に含まれる情報の特徴を示す特徴ベクトルの平均値の和となるベクトルを生成し、生成されたベクトルに基づいて、配信候補が前記所定の利用者と対応するか否かを判定するモデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 is a feature showing the characteristics of the information included in the distribution target indicated by one of the plurality of distribution target vectors from one of the plurality of distribution target vectors. A vector to be the sum of the average values of the vectors is generated, and a model for determining whether or not the distribution candidate corresponds to the predetermined user is learned based on the generated vector.

また、実施形態に係る学習装置１００において、学習部１３３は、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つであり、かつ所定の利用者が選択した配信対象を示す配信対象ベクトルとが入力された際に、配信候補が所定の利用者と対応する旨を出力し、所定の利用者が選択した配信対象を示す配信対象ベクトルと、所定の利用者に配信された配信対象を示す複数の配信対象ベクトルのうちの１つであり、かつ所定の利用者が選択しなかった配信対象を示す配信対象ベクトルとが入力された際に、配信候補が所定の利用者と対応しない旨を出力されるように、モデルを学習する。 Further, in the learning device 100 according to the embodiment, the learning unit 133 has a distribution target vector indicating a distribution target selected by a predetermined user, and a plurality of distribution target vectors indicating a distribution target distributed to the predetermined user. When one of them and the distribution target vector indicating the distribution target selected by the predetermined user is input, the distribution candidate outputs that the distribution candidate corresponds to the predetermined user, and the predetermined user Indicates a distribution target vector indicating a selected distribution target and a distribution target that is one of a plurality of distribution target vectors indicating a distribution target distributed to a predetermined user and is not selected by the predetermined user. The model is learned so that when the distribution target vector is input, it is output that the distribution candidate does not correspond to the predetermined user.

また、実施形態に係る学習装置１００は、配信対象の候補となる配信対象に含まれる情報と対応する次元に所定の値が格納された候補ベクトルを生成する生成部１３４を有する。また、実施形態に係る学習装置１００は、学習部１３３により学習されたモデルを用いて、候補ベクトルから、配信対象の候補を所定の利用者に配信するか否かを判定する判定部１３５を有する。 Further, the learning device 100 according to the embodiment has a generation unit 134 that generates a candidate vector in which a predetermined value is stored in a dimension corresponding to the information included in the distribution target that is a candidate for the distribution target. Further, the learning device 100 according to the embodiment has a determination unit 135 that determines whether or not to distribute the candidate to be distributed to a predetermined user from the candidate vector using the model learned by the learning unit 133. ..

また、実施形態に係る学習装置１００において、判定部１３５は、候補ベクトルが入力された際に、候補ベクトルが示す配信候補が、所定の利用者と対応する旨をモデルが出力した場合は、配信候補を所定の利用者に配信すると判定する。 Further, in the learning device 100 according to the embodiment, when the candidate vector is input, the determination unit 135 distributes when the model outputs that the distribution candidate indicated by the candidate vector corresponds to a predetermined user. It is determined that the candidate will be delivered to a predetermined user.

上述した各処理により、学習装置１００は、所定の通知がユーザに適合するか否かを判定する判定モデルを学習することができる。 By each of the above-mentioned processes, the learning device 100 can learn a determination model for determining whether or not a predetermined notification is suitable for the user.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 Although some of the embodiments of the present application have been described in detail with reference to the drawings, these are examples, and various modifications are made based on the knowledge of those skilled in the art, including the embodiments described in the disclosure column of the invention. It is possible to carry out the present invention in other modified forms.

また、上述した学習装置１００は、複数のサーバコンピュータで実現してもよく、また、機能によっては外部のプラットフォーム等をＡＰＩ（Application Programming Interface）やネットワークコンピューティングなどで呼び出して実現するなど、構成は柔軟に変更できる。 Further, the above-mentioned learning device 100 may be realized by a plurality of server computers, and depending on the function, it may be realized by calling an external platform or the like by API (Application Programming Interface), network computing, or the like. It can be changed flexibly.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、受信部は、受信手段や受信回路に読み替えることができる。 Further, the above-mentioned "section, module, unit" can be read as "means" or "circuit". For example, the receiving unit can be read as a receiving means or a receiving circuit.

１ネットワークシステム
１０端末装置
２０提供装置
１００学習装置
１２０記憶部
１２１訓練データ記憶部
１２２判定モデル記憶部
１３０制御部
１３１受信部
１３２取得部
１３３学習部
１３４生成部
１３５判定部
１３６提供部 1 Network system 10 Terminal device 20 Providing device 100 Learning device 120 Storage unit 121 Training data storage unit 122 Judgment model storage unit 130 Control unit 131 Reception unit 132 Acquisition unit 133 Learning unit 134 Generation unit 135 Judgment unit 136 Providing unit

Claims

A first distribution target vector indicating a first distribution target distributed to a predetermined user, and a plurality of second distribution targets distributed to the predetermined user and selected by the predetermined user. An acquisition unit that acquires a plurality of second distribution target vectors, respectively, and selection information indicating whether or not the predetermined user has selected the first distribution target .
Using the first distribution target vector and the plurality of second distribution target vectors as input of training data and using the selection information as output of the training data, the predetermined user becomes a distribution candidate . It is a model for determining whether or not it matches, and uses the distribution candidate vector indicating the distribution candidate and the plurality of second distribution target vectors as inputs, and the distribution candidate vector and the plurality of the second distribution targets. A learning device characterized by having a learning unit for learning a model in which a plurality of weights are set between the distribution target vector and the vector .

The acquisition unit
The first distribution target vector corresponding to the vector in which a predetermined value is stored in the dimension corresponding to the word included in the first distribution target sentence is acquired , and the corresponding second distribution target sentence is obtained. Acquires each second distribution target vector corresponding to the vector in which a predetermined value is stored in the dimension corresponding to the word contained in.
The learning device according to claim 1.

The learning unit
From the weighted average of the plurality of second distribution target vectors calculated by using the weights between the first distribution target vector and the plurality of second distribution target vectors, the predetermined user is said to be the said. The predetermined probability of selecting the first distribution target is calculated, and the selection information and the calculated probability are used to determine whether or not the predetermined user is suitable for the distribution candidate. The learning device according to claim 1 , wherein the user learns the model for calculating the probability of selecting the delivery candidate .

The learning unit
From the connection vector generated by concatenating the first distribution target vector and the weighted average, the probability that the predetermined user selects the first distribution target is calculated.
The learning device according to claim 3 , wherein the learning device is characterized by the above.

The learning unit
By applying an activation function to the concatenated vector, a hidden state vector is generated, and by multiplying the generated hidden state vector by a weight matrix, the probability that the predetermined user selects the first delivery target. To calculate
The learning device according to claim 4 , wherein the learning device is characterized in that.

The learning unit
An average vector of a plurality of distributed expressions corresponding to a plurality of words included in the first distribution target sentence is generated as the first distribution target vector, and the corresponding second distribution target sentence is generated. Generate an average vector of multiple distributed representations corresponding to each of the included words as a second distribution target vector for each.
The learning device according to any one of claims 1 to 5 , wherein the learning device is characterized in that.

A generation unit that generates the distribution candidate vector using the information included in the distribution candidate, and
By inputting the distribution candidate vector generated by the generation unit and the plurality of second distribution target vectors into the model learned by the learning unit, the predetermined user selects the distribution candidate. The learning device according to any one of claims 1 to 6 , further comprising a determination unit that outputs a probability as a relevance score indicating a relevance between the predetermined user and a delivery candidate. ..

The way the learning device performs,
A first distribution target vector indicating a first distribution target distributed to a predetermined user, and a plurality of second distribution targets distributed to the predetermined user and selected by the predetermined user. An acquisition process for acquiring a plurality of second distribution target vectors, respectively, and selection information indicating whether or not the predetermined user has selected the first distribution target .
Using the first distribution target vector and the plurality of second distribution target vectors as input of training data and using the selection information as output of the training data, the predetermined user becomes a distribution candidate . It is a model for determining whether or not it matches, and uses the distribution candidate vector indicating the distribution candidate and the plurality of second distribution target vectors as inputs, and the distribution candidate vector and the plurality of the second distribution targets. A learning method comprising a learning process of learning a model in which multiple weights are set with and to a delivery target vector of .

A first distribution target vector indicating a first distribution target distributed to a predetermined user, and a plurality of second distribution targets distributed to the predetermined user and selected by the predetermined user. An acquisition procedure for acquiring a plurality of second distribution target vectors, respectively, and selection information indicating whether or not the predetermined user has selected the first distribution target, and an acquisition procedure.
Using the first distribution target vector and the plurality of second distribution target vectors as input of training data and using the selection information as output of the training data, the predetermined user becomes a distribution candidate . It is a model for determining whether or not it is compatible, and uses the distribution candidate vector indicating the distribution candidate and the plurality of second distribution target vectors as inputs, and the distribution candidate vector and the plurality of the second distribution targets. A learning program for a computer to perform a learning procedure to learn a model in which multiple weights are set between the delivery target vector and the .