JP2021149408A

JP2021149408A - Information processing apparatus, information processing method, and information processing program

Info

Publication number: JP2021149408A
Application number: JP2020047633A
Authority: JP
Inventors: 俊平大倉; Shumpei Okura; 智亮岩瀬; Tomoaki Iwase; 翔悟湯浅; Shogo Yuasa; 直樹石川; Naoki Ishikawa
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2020-03-18
Filing date: 2020-03-18
Publication date: 2021-09-27
Anticipated expiration: 2040-03-18
Also published as: JP7293158B2

Abstract

To provide an information processing apparatus which can properly determine a content in which a user is interested, an information processing method, and an information processing program.SOLUTION: A server device 10 includes: a score calculation unit 134 which calculates a score indicating how a user is interested in a content, on the basis of a user feature vector indicating characteristics of the user and a content feature vector indicating characteristics of the content; a score correction unit 135 which corrects the score by adding a random number of which an absolute value is equal to or smaller than a first threshold to a score for each content; and a distribution unit 136 which selects a predetermined number of contents with the corrected scores in descending order and distributes them to a terminal device of the user.SELECTED DRAWING: Figure 2

Description

本発明は、ユーザにコンテンツをレコメンドする情報処理装置、情報処理方法、及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing method, and an information processing program that recommends contents to a user.

従来、インターネットを介して、ニュースや広告等のコンテンツを、ユーザが所有する端末装置にレコメンドする情報処理装置が知られている（例えば、特許文献１参照）。
このような従来の情報処理装置では、ユーザの特徴を示すユーザ特徴ベクトルと、コンテンツの特徴を示すコンテンツ特徴ベクトルとを用いて、ユーザに対するコンテンツのスコアを算出し、当該スコアが高い順にコンテンツを並び替えて、ユーザの端末装置に配信（レコメンド）する。 Conventionally, there is known an information processing device that recommends contents such as news and advertisements to a terminal device owned by a user via the Internet (see, for example, Patent Document 1).
In such a conventional information processing device, a content score for a user is calculated using a user feature vector indicating the user's characteristics and a content feature vector indicating the content characteristics, and the contents are arranged in descending order of the score. Instead, it is distributed (recommended) to the user's terminal device.

特開２０１０-２６２３８３号公報Japanese Unexamined Patent Publication No. 2010-262383

ところで、上記のようなレコメンドを配信する情報処理装置において、各コンテンツを類似度が近い順に横軸に並べ、各コンテンツに対するユーザのスコアを縦軸としたグラフ（スコアを曲線で示したグラフ）を用いると、特定のコンテンツでスコアがピークとなり、当該コンテンツの近傍に配置されたコンテンツのスコアも総じて高くなる。これは、特定のコンテンツに対応したジャンルに対して、ユーザが興味を持っていることを示している。
従来の情報処理装置では、このようなピーク近傍のコンテンツを優先的にユーザにレコメンドする。ここで、ピークのスコアが十分に高い場合、ユーザの当該ジャンルへの興味が際立っていることを示しており、適切なコンテンツのレコメンドができているといえる。
一方で、ピークのスコアが小さく、全てのコンテンツのスコアがなだらかに変化する場合がある。この場合、ユーザがどのようなジャンルに興味があるのか不明であり、ピークに対応するジャンルがユーザの興味が高いジャンルとは限らない。したがって、ユーザの興味があるジャンルを適切に判定するために、多種ジャンルのコンテンツを配信することが好ましい。しかしながら、従来の情報処理装置では、このような場合でも、ピークに対応するコンテンツを優先的に配信してしまうため、ユーザの興味があるコンテンツを適切に判定することが困難となる。 By the way, in an information processing device that delivers recommendations as described above, a graph (graph in which the scores are shown by curves) in which each content is arranged on the horizontal axis in order of similarity and the user's score for each content is on the vertical axis is displayed. When used, the score peaks for specific content, and the score for content placed in the vicinity of the content is generally high. This indicates that the user is interested in the genre corresponding to the specific content.
In the conventional information processing device, the content near the peak is preferentially recommended to the user. Here, when the peak score is sufficiently high, it indicates that the user's interest in the genre is outstanding, and it can be said that appropriate content is recommended.
On the other hand, the peak score is small, and the scores of all contents may change gently. In this case, it is unclear what kind of genre the user is interested in, and the genre corresponding to the peak is not necessarily the genre that the user is highly interested in. Therefore, it is preferable to distribute contents of various genres in order to appropriately determine the genre of interest to the user. However, in the conventional information processing apparatus, even in such a case, the content corresponding to the peak is preferentially delivered, so that it is difficult to appropriately determine the content of interest to the user.

本発明は、ユーザの興味があるコンテンツを適切に判定可能な情報処理装置、情報処理方法、及び情報処理プログラムを提供することを目的とする。 An object of the present invention is to provide an information processing device, an information processing method, and an information processing program capable of appropriately determining a content of interest to a user.

本発明の情報処理装置は、ユーザの特徴を示すユーザ特徴ベクトル、及び、コンテンツの特徴を示すコンテンツ特徴ベクトルに基づいて、前記ユーザの前記コンテンツへの興味を示すスコアを算出するスコア算出部と、各前記コンテンツに対する前記スコアに対して、絶対値が第一閾値以下となる乱数を加算して前記スコアを補正するスコア補正部と、補正された前記スコアが高い順に所定数の前記コンテンツを選択して前記ユーザが所有する端末装置に配信する配信部と、を備える。
The information processing apparatus of the present invention includes a score calculation unit that calculates a score indicating the user's interest in the content based on a user feature vector indicating the user's characteristics and a content feature vector indicating the content characteristics. A score correction unit that corrects the score by adding a random number whose absolute value is equal to or less than the first threshold value to the score for each content, and a predetermined number of the contents in descending order of the corrected score are selected. It is provided with a distribution unit that distributes the content to the terminal device owned by the user.

本発明では、各コンテンツに対するスコアが一様であり、ユーザの興味があるジャンルを特定できない場合に、多種多様なジャンルのコンテンツをランダムでレコメンド配信することができる。これにより、当該レコメンドに対して、ユーザがいずれかを選択することで、ユーザが興味のあるコンテンツのジャンルを特定でき、ユーザの興味があるコンテンツを適切に判定することができる。 In the present invention, when the score for each content is uniform and the genre of interest of the user cannot be specified, the content of a wide variety of genres can be randomly recommended and distributed. Thereby, when the user selects one of the recommendations, the genre of the content that the user is interested in can be specified, and the content that the user is interested in can be appropriately determined.

本発明の一実施形態の情報処理システムを示す概略図。The schematic diagram which shows the information processing system of one Embodiment of this invention. 本実施形態のサーバ装置の概略構成を示すブロック図。The block diagram which shows the schematic structure of the server apparatus of this embodiment. 本実施形態における情報処理方法を示すフローチャート。The flowchart which shows the information processing method in this embodiment. 本実施形態のステップＳ３で算出されたスコアの一例を示す図。The figure which shows an example of the score calculated in step S3 of this embodiment. 本実施形態のステップＳ３で算出されたスコアの一例を示す図。The figure which shows an example of the score calculated in step S3 of this embodiment. 本実施形態において、図４のスコアに対して乱数を加算した補正スコアの一例を示す図。In this embodiment, the figure which shows an example of the correction score which added the random number to the score of FIG. 本実施形態において、図５のスコアに対して乱数を加算した補正スコアの一例を示す図。In this embodiment, the figure which shows an example of the correction score which added the random number to the score of FIG. 図４のスコアに対する従来のレコメンド表示欄の一例を示す図。The figure which shows an example of the conventional recommendation display column for the score of FIG. 図６の補正スコアに基づいた、本実施形態のレコメンド表示欄の一例を示す図The figure which shows an example of the recommendation display column of this embodiment based on the correction score of FIG. 図７の補正スコアに基づいた、本実施形態のレコメンド表示欄の一例を示す図。The figure which shows an example of the recommendation display column of this embodiment based on the correction score of FIG.

以下、本発明に係る一実施形態について説明する。
［情報処理システムの概要］
図１は、本実施形態の情報処理システムを示す概略図である。
本実施形態の情報処理システムは、図１に示すように、情報処理装置であるサーバ装置１０、サーバ装置１０に対してインターネットを介して接続された複数の端末装置２０を備える。この情報処理システムは、サーバ装置１０から端末装置２０にコンテンツであるニュースコンテンツ（以降、ニュースと略す場合がある）をレコメンドするニュース配信システムであり、端末装置２０を所有するユーザの特徴に応じたニュースをレコメンドする。
以下、このような情報処理システムの、特にサーバ装置１０について詳細に説明する。 Hereinafter, an embodiment according to the present invention will be described.
[Overview of information processing system]
FIG. 1 is a schematic view showing an information processing system of the present embodiment.
As shown in FIG. 1, the information processing system of the present embodiment includes a server device 10 which is an information processing device, and a plurality of terminal devices 20 connected to the server device 10 via the Internet. This information processing system is a news distribution system that recommends news content (hereinafter, may be abbreviated as news) which is content from the server device 10 to the terminal device 20, and corresponds to the characteristics of the user who owns the terminal device 20. Recommend news.
Hereinafter, such an information processing system, particularly the server device 10, will be described in detail.

［サーバ装置１０の構成］
図２は、サーバ装置１０の概略構成を示すブロック図である。
サーバ装置１０は、一般的なコンピュータにより構成されており、図２に示すように、通信部１１、記憶部１２、プロセッサ１３等の、コンピュータを構成する各部を備えている。なお、サーバ装置１０を構成するコンピュータの数は特に限定されない。例えば、１台のコンピュータによってサーバ装置１０が構成されてもよく、複数のコンピュータをネットワークで接続して構築されるクラウドサーバをサーバ装置１０としてもよい。 [Configuration of server device 10]
FIG. 2 is a block diagram showing a schematic configuration of the server device 10.
The server device 10 is composed of a general computer, and as shown in FIG. 2, includes each part constituting the computer, such as a communication unit 11, a storage unit 12, and a processor 13. The number of computers constituting the server device 10 is not particularly limited. For example, the server device 10 may be configured by one computer, or a cloud server constructed by connecting a plurality of computers via a network may be used as the server device 10.

通信部１１は、インターネットに接続され、インターネットを介して端末装置２０等の各装置と通信する。 The communication unit 11 is connected to the Internet and communicates with each device such as the terminal device 20 via the Internet.

記憶部１２は、サーバ装置１０を制御するための各種情報や情報処理プログラムを記録する。
また、記憶部１２は、端末装置２０に配信するニュースを記録するニュース蓄積部１２１、及び、各ユーザに関するユーザ情報を記録するユーザ情報蓄積部１２２等のデータベースを備えている。
なお、ここでは、サーバ装置１０の記憶部１２に、ニュース蓄積部１２１やユーザ情報蓄積部１２２が設けられる例を示すが、サーバ装置１０とネットワークを介して通信可能に接続された他のデータサーバやクラウドストレージに、これらの情報が記録される構成としてもよい。 The storage unit 12 records various information and information processing programs for controlling the server device 10.
Further, the storage unit 12 includes a database such as a news storage unit 121 that records news delivered to the terminal device 20 and a user information storage unit 122 that records user information related to each user.
Here, an example is shown in which a news storage unit 121 and a user information storage unit 122 are provided in the storage unit 12 of the server device 10, but another data server that is communicably connected to the server device 10 via a network. Or cloud storage may be configured to record this information.

ニュース蓄積部１２１には、端末装置２０に配信可能なニュースが蓄積されるデータベースであり、具体的には、複数のニュース情報が記録されている。これらのニュース情報は、例えばサーバ装置１０の管理者が生成してもよく、ニュースの編集者が所有する編集者端末から送信されるものであってもよい。あるいは、サーバ装置１０が、インターネット上の所定のウェブコンテンツをクローリングして収集してもよい。
ニュース情報は、コンテンツＩＤ、入稿日時情報、ニュース詳細情報等を含む。また、ニュース情報として、さらに、ニュースのジャンル等が記録されていてもよい。
コンテンツＩＤは、ニュース情報を識別する識別情報である。
入稿日時情報はニュースが入稿された日時である。
ニュース詳細情報は、ニュースの詳細な内容であり、例えばテキストデータや、イメージデータを含んで構成されている。ニュース詳細情報に含まれるテキストデータには、ニュースの詳細に関係する多数のキーワードが含まれる。 The news storage unit 121 is a database in which news that can be distributed to the terminal device 20 is stored, and specifically, a plurality of news information is recorded. These news information may be generated by, for example, the administrator of the server device 10, or may be transmitted from an editor terminal owned by the editor of the news. Alternatively, the server device 10 may crawl and collect predetermined web content on the Internet.
The news information includes content ID, submission date / time information, detailed news information, and the like. Further, as news information, the genre of news and the like may be further recorded.
The content ID is identification information that identifies news information.
The submission date and time information is the date and time when the news was submitted.
The detailed news information is the detailed content of the news, and is configured to include, for example, text data and image data. The text data contained in the news details includes a number of keywords related to the news details.

ユーザ情報蓄積部１２２には、上述のように複数のユーザ情報が記録されている。このユーザ情報は、ユーザＩＤ、ユーザ属性情報、閲覧履歴情報等を含む。
ユーザＩＤは、ユーザを識別する識別情報である。
ユーザ属性情報は、ユーザの様々な属性を記録する。例えば、ユーザの性別、年齢層、居所等の個人情報が含まれてもよく、ユーザの趣味等の嗜好性が記録されていてもよい。
閲覧履歴情報は、サーバ装置１０からレコメンドされたニュースに対して、ユーザが選択（閲覧）したニュースの履歴情報である。
なお、ユーザ情報としては、その他、ユーザが端末装置２０を操作して実施した検索処理における検索キーワードの履歴、インターネットを介した商品売買履歴、ユーザのウェブコンテンツの閲覧履歴等が含まれてもよい。 As described above, a plurality of user information is recorded in the user information storage unit 122. This user information includes user ID, user attribute information, browsing history information, and the like.
The user ID is identification information that identifies the user.
The user attribute information records various attributes of the user. For example, personal information such as the user's gender, age group, and whereabouts may be included, and the user's hobbies and other preferences may be recorded.
The browsing history information is the history information of the news selected (browsed) by the user with respect to the news recommended by the server device 10.
The user information may also include a history of search keywords in a search process performed by the user operating the terminal device 20, a history of buying and selling products via the Internet, a history of browsing user web contents, and the like. ..

プロセッサ１３は、ＣＰＵ等の演算回路、ＲＡＭ等の記録回路により構成される。プロセッサ１３は、記憶部１２に記録されている情報処理プログラムをＲＡＭに展開し、ＲＡＭに展開されたプログラムとの協働で、各種処理を実行する。
そして、プロセッサ１３は、情報処理プログラムを読み込み実行することで、図２に示すように、ユーザ情報取得部１３１、ユーザ特徴算出部１３２（ユーザ補正部）、コンテンツ特徴算出部１３３、スコア算出部１３４、スコア補正部１３５、及び配信部１３６として機能する。 The processor 13 is composed of an arithmetic circuit such as a CPU and a recording circuit such as a RAM. The processor 13 expands the information processing program recorded in the storage unit 12 into the RAM, and executes various processes in cooperation with the program expanded in the RAM.
Then, the processor 13 reads and executes the information processing program, and as shown in FIG. 2, the user information acquisition unit 131, the user feature calculation unit 132 (user correction unit), the content feature calculation unit 133, and the score calculation unit 134. , The score correction unit 135, and the distribution unit 136.

ユーザ情報取得部１３１は、本発明の選択コンテンツ取得部としても機能し、端末装置２０からユーザ情報を取得する。すなわち、ユーザ情報取得部１３１は、サーバ装置１０から端末装置２０にレコメンドしたニュースに対して、ユーザがいずれかのニュースを選択及び閲覧した場合に、その選択されたニュースを選択コンテンツとして取得する。つまり、ユーザのニュースの閲覧履歴情報を取得する。
また、ユーザ情報取得部１３１は、その他、端末装置２０でのユーザの入力操作によって入力されたユーザ属性を取得したり、検索履歴等の他の行動履歴を取得したりしてもよい。 The user information acquisition unit 131 also functions as a selected content acquisition unit of the present invention, and acquires user information from the terminal device 20. That is, when the user selects and browses any news with respect to the news recommended from the server device 10 to the terminal device 20, the user information acquisition unit 131 acquires the selected news as selected content. That is, the browsing history information of the user's news is acquired.
In addition, the user information acquisition unit 131 may acquire the user attributes input by the user's input operation on the terminal device 20, or acquire other action histories such as the search history.

ユーザ特徴算出部１３２は、本発明のユーザ補正部としても機能し、ユーザ情報取得部１３１によって取得されたユーザ情報に基づいて、ユーザの特徴を示す複数次元の情報であるユーザ特徴ベクトルを算出する。なお、本実施形態では、ユーザ特徴算出部１３２は、ユーザ情報の閲覧履歴情報に基づいて、ユーザ特徴ベクトルを算出する例を示すが、閲覧履歴情報に加え、ユーザ属性や、その他の行動履歴情報などを用いてユーザ特徴ベクトルを算出してもよい。なお、ユーザ特徴ベクトルの算出は公知の技術を利用でき、例えば、適合フィードバックなどを利用できる。 The user feature calculation unit 132 also functions as a user correction unit of the present invention, and calculates a user feature vector, which is multidimensional information indicating a user's characteristics, based on the user information acquired by the user information acquisition unit 131. .. In this embodiment, the user feature calculation unit 132 calculates an example of calculating the user feature vector based on the browsing history information of the user information. However, in addition to the browsing history information, the user attribute and other action history information The user feature vector may be calculated using the above. A known technique can be used for the calculation of the user feature vector, and for example, conformity feedback can be used.

コンテンツ特徴算出部１３３は、ニュースの特徴であるコンテンツ特徴ベクトルを、ニュース詳細情報に基づいて算出する。上述したように、ニュース詳細情報は、ニュースの内容を示すテキストデータを含み、多数のキーワードが含まれる。コンテンツ特徴ベクトルの算出は公知の技術を利用でき、例えば、ｔｆ−ｉｄｆ（Term Frequency - Inverse Document Frequency）、Ｋｅｙ-Ｖａｌｕｅ方式、Ｗ２Ｖ（Word 2 Vector）等の技術を用いて、ニュース詳細情報からコンテンツ特徴ベクトルを算出する。 The content feature calculation unit 133 calculates the content feature vector, which is a feature of the news, based on the detailed news information. As described above, the detailed news information includes text data indicating the content of the news and includes a large number of keywords. Known technology can be used to calculate the content feature vector. For example, using technologies such as tf-idf (Term Frequency --Inverse Document Frequency), Key-Value method, and W2V (Word 2 Vector), content can be calculated from detailed news information. Calculate the feature vector.

スコア算出部１３４は、ニュース蓄積部１２１に記録される各ニュースに対してユーザがどの程度興味を有するかを予測するスコアを算出する。
スコア補正部１３５は、スコア算出部１３４により算出された各ニュースのスコアに対して、絶対値が第一閾値以下となる乱数を加算してスコアを補正する。すなわち、各スコアを所定範囲内で増減して補正する。
なお、スコア算出部１３４及びスコア補正部１３５の詳細な処理については後述する。
配信部１３６は、スコア補正部１３５により補正されたスコアに基づいて、スコアが高い順にニュースを選出して端末装置２０にレコメンドする。 The score calculation unit 134 calculates a score that predicts how much the user is interested in each news recorded in the news storage unit 121.
The score correction unit 135 corrects the score by adding a random number whose absolute value is equal to or less than the first threshold value to the score of each news calculated by the score calculation unit 134. That is, each score is increased or decreased within a predetermined range to correct it.
The detailed processing of the score calculation unit 134 and the score correction unit 135 will be described later.
The distribution unit 136 selects news in descending order of score based on the score corrected by the score correction unit 135, and recommends the news to the terminal device 20.

［端末装置２０の構成］
端末装置２０は、ユーザが管理するコンピュータであり、例えばスマートフォン、タブレット端末、パーソナルコンピューター等により構成されている。
端末装置２０の具体的な構成の図示は省略するが、端末装置２０は、一般的なコンピュータが有する基本的な構成を有する。すなわち、端末装置２０は、操作者の操作を受け付ける入力操作部、画像情報を表示させるディスプレイ、各種情報を記録する記録装置、各種情報を演算処理する演算回路（ＣＰＵ等）を備えている。 [Configuration of terminal device 20]
The terminal device 20 is a computer managed by a user, and is composed of, for example, a smartphone, a tablet terminal, a personal computer, or the like.
Although the illustration of the specific configuration of the terminal device 20 is omitted, the terminal device 20 has a basic configuration of a general computer. That is, the terminal device 20 includes an input operation unit that accepts the operation of the operator, a display that displays image information, a recording device that records various information, and an arithmetic circuit (CPU or the like) that performs arithmetic processing of various information.

［情報処理方法］
次に、本実施形態の情報処理システムでのレコメンド方法について、特に、サーバ装置１０での情報処理方法を中心に説明する。
図３は、本実施形態におけるレコメンド方法を示すフローチャートである。
サーバ装置１０は、端末装置２０へのニュースのレコメンド処理にあたり、事前にユーザ情報及びニュース情報を取得しており、記憶部１２に記憶しているものとする。また、サーバ装置１０のコンテンツ特徴算出部１３３は、ニュース情報の取得時に当該ニュースのコンテンツ特徴ベクトルを算出しているものとする。 [Information processing method]
Next, the recommendation method in the information processing system of the present embodiment will be described with particular focus on the information processing method in the server device 10.
FIG. 3 is a flowchart showing the recommendation method in the present embodiment.
It is assumed that the server device 10 has acquired user information and news information in advance and stores the news information in the storage unit 12 when recommending news to the terminal device 20. Further, it is assumed that the content feature calculation unit 133 of the server device 10 calculates the content feature vector of the news when the news information is acquired.

サーバ装置１０は、端末装置２０から、ニュース配信を要求する要求情報及びユーザＩＤを受信する（ステップＳ１）。そして、ユーザ特徴算出部１３２は、要求情報とともに受信したユーザＩＤに基づいてユーザ情報を読み込み、当該ユーザに対するユーザ特徴ベクトルを算出する（ステップＳ２）。
この後、スコア算出部１３４は、算出されたユーザ特徴ベクトルと、コンテンツ特徴ベクトルとに基づいて、各ニュース情報に対するスコアを算出する（ステップＳ３：スコア算出ステップ）。このステップＳ３では、スコア算出部１３４は、例えばユーザ特徴ベクトルと、コンテンツ特徴ベクトルとの内積を算出することで、スコアを算出する。 The server device 10 receives the request information and the user ID requesting news distribution from the terminal device 20 (step S1). Then, the user feature calculation unit 132 reads the user information based on the user ID received together with the request information, and calculates the user feature vector for the user (step S2).
After that, the score calculation unit 134 calculates the score for each news information based on the calculated user feature vector and the content feature vector (step S3: score calculation step). In step S3, the score calculation unit 134 calculates the score by, for example, calculating the inner product of the user feature vector and the content feature vector.

次に、スコア補正部１３５は、ステップＳ３で算出されたスコアに対して、乱数を加算する（ステップＳ４：スコア補正ステップ）。 Next, the score correction unit 135 adds a random number to the score calculated in step S3 (step S4: score correction step).

ここで、ステップＳ４で加算される乱数について説明する。
図４及び図５は、ステップＳ３で算出されたスコアの一例を示す図である。
図４及び図５において、横軸は各ニュース、縦軸は各ニュースに対するスコアを示している。ここで、横軸には、コンテンツ特徴ベクトルの類似度が近い順にニュースを配置しており、互いに近い位置に配置されているニュースは、ジャンルが近いニュースであることを示している。
例えば、図４では、スポーツのジャンルに対して高い興味を持つユーザの各ニュースに対するスコアの一例であって、スポーツのジャンルにおけるニュースＮ_１でスコアがピーク値となる。ニュースＮ_１の周辺は、ニュースＮ_１とコンテンツ特徴ベクトルが類似する他のニュースであって、ニュースＮ_１と略同じようなジャンル（スポーツ）を有している。これは、ユーザの過去のニュースの閲覧履歴において、他のジャンルのニュースに比べて、スポーツのニュースをより多く閲覧していたことを示しており、ユーザ特徴算出部１３２は、このような過去のニュースの閲覧履歴を用い、機械学習によりユーザ特徴ベクトルを推算する。これにより、スポーツのニュースに対するスコアが、他のジャンルのニュースより高くなる。本実施形態では、スコアが所定の第二閾値Ｓ_Ｔｈ以上となる場合に、スコアが、他のジャンルに比べて際立って高いと判定する。一般に、ニュースをレコメンドする場合、スコアが高いニュースは、ユーザの最も興味が高いジャンルのニュースであると予測でき、スコアが高い順にニュースを並び替えて端末装置２０にレコメンドする。したがって、図４に示すように、特定のジャンルでスコアが第二閾値Ｓ_Ｔｈ以上となる場合、ユーザの興味に対応したニュースを適切に配信可能である。 Here, the random numbers added in step S4 will be described.
4 and 5 are diagrams showing an example of the score calculated in step S3.
In FIGS. 4 and 5, the horizontal axis represents each news and the vertical axis represents the score for each news. Here, on the horizontal axis, news is arranged in order of similarity of content feature vectors, and news arranged at positions close to each other indicates that news has similar genres.
For example, in Figure 4, an example of a score for each news users with high interest in the genre of sports, scores news N ₁ is the peak value in the genre of sports. Near News N ₁ is an other news News N ₁ and the content characteristic vector is similar, and has a substantially similar genre as news N ₁ (sports). This indicates that the user has browsed more sports news than other genres of news in the past news browsing history of the user, and the user feature calculation unit 132 has browsed such past news. The user feature vector is estimated by machine learning using the news browsing history. This results in a higher score for sports news than for other genres of news. In the present embodiment, when the score is equal to or higher than the predetermined second threshold value S _Th , it is determined that the score is remarkably high as compared with other genres. Generally, when recommending news, it can be predicted that the news with a high score is the news of the genre most of interest to the user, and the news is sorted in descending order of the score and recommended to the terminal device 20. Therefore, as shown in FIG. 4, when the score is _{equal to or higher than the second threshold value S Th} in a specific genre, news corresponding to the user's interest can be appropriately distributed.

一方、図５は、様々なジャンルに対するニュースを一様に閲覧しているユーザの、各ニュースに対するスコアの一例である。ユーザが様々なジャンルのニュースを閲覧した場合、ユーザ特徴算出部１３２により算出されるユーザ特徴ベクトルも、多数の次元での値が一様となり、各ニュースに対するスコアも、図５のように、一様になだらかに変化する。すなわち、図５の例では、全ニュースに対するスコアが、第二閾値Ｓ_Ｔｈ未満となっている。このようなスコアが得られる場合、ユーザの興味があるジャンルを特定できていないことを示している。
ここで、従来のニュースレコメンド方法では、スコアが高い順にニュースを並べ替えて端末装置２０に送信するので、スコアが一様であっても、その中でスコアが高い順にニュースが配信されることになる。例えば、図５の例では、スポーツのニュースＮ_１に対するスコアがピークとなり、他のニュースよりも若干高くなっている。このため、図４のパターンと同様、スポーツのニュースＮ_１及びその周辺のスポーツのジャンルのニュースが、レコメンドされるニュースの上位を占めることになる。しかしながら、ユーザの興味があるジャンルは特定できていないので、従来のレコメンド方法を用いると、ユーザが他のジャンルに興味があるのも関わらず、スポーツのニュースのみが上位に配信されることになる。 On the other hand, FIG. 5 is an example of a score for each news of a user who uniformly browses news for various genres. When the user browses news of various genres, the user feature vector calculated by the user feature calculation unit 132 also has uniform values in many dimensions, and the score for each news is also one, as shown in FIG. It changes gently. That is, in the example of FIG. 5, the score for all news is less than _{the second threshold value S Th.} If such a score is obtained, it indicates that the genre of interest to the user has not been identified.
Here, in the conventional news recommendation method, news is sorted in descending order of score and transmitted to the terminal device 20, so that even if the score is uniform, the news is delivered in descending order of score. Become. For example, in the example of FIG. 5, the score is the peak for the sports news N _1, which is slightly higher than the other news. Therefore, similar to the pattern in FIG. 4, sports news N ₁ and genre news sports periphery thereof, will occupy the top news is recommended. However, since the genre that the user is interested in has not been identified, if the conventional recommendation method is used, only sports news will be delivered to the top even though the user is interested in other genres. ..

これに対して、本実施形態ではステップＳ４で、図４や図５に示すスコアに対して乱数が加算される。
図６は、図４に対して乱数を加算した補正スコアの一例を示す図であり、図７は、図５に対して乱数を加算した補正スコアの一例を示す図である。図６及び図７において、一点鎖線は、補正前のスコアを示し、実線は、乱数を加算して補正した補正スコアの一例を示し、二点鎖線で挟まれる領域は、補正スコアとして取りうる範囲を示している。
ステップＳ４では、スコア補正部１３５は、乱数として、絶対値が第一閾値α以下となる乱数（正数及び負数を含む）を加算する。なお、第一閾値αは、第二閾値ＳＴｈを超えるニュースのレコメンド率をどの程度に設定するかによって、適宜変更することができる。すなわち、第二閾値Ｓ_Ｔｈ以上のスコアのニュースが有る場合、ユーザが当該ニュースに対して強い興味を持っていることを示しており、乱数を加算することで、当該第二閾値Ｓ_Ｔｈを大幅に超えるスコア（第二閾値Ｓ_Ｔｈよりも、所定値Ｍ以上大きいスコア）が、第二閾値Ｓ_Ｔｈ未満の他のニュースのスコアよりも低下する場合、ユーザの興味に対応したニュースがレコメンドできない。したがって、乱数を加えた場合でも、第二閾値Ｓ_Ｔｈを大幅に超えるスコアのニュースのレコメンド率が低下しないように、第一閾値αが適宜設定される。例えば、第一閾値αを、α＜Ｍ／２とすることで、スコアがＳ_Ｔｈ＋Ｍを超えるニュースの補正スコアが、第二閾値Ｓ_Ｔｈ未満のニュースの補正スコアよりも小さくなることを抑制できる。
これにより、図６に示すように、第二閾値Ｓ_Ｔｈを所定値Ｍ以上、上回るスコアのニュースＮ_１やその周辺のニュースでは、たとえ負の乱数が加算されて補正されても、補正スコアが他のジャンルのニュースよりも高くなる。つまり、補正スコアが高い順にニュースを並び替えると、ニュースＮ_１と同じジャンルのニュースが上位に並ぶことになり、ユーザの興味に即したレコメンドが行える（レコメンド精度の低下が抑制できる）。
一方、図７に示すように、各ニュースのスコアが一様である場合、ランダムにスコアが高いニュースが現れることになる。つまり、補正スコアが高い順にニュースを並び替えると、コンテンツ特徴ベクトルがそれぞれ異なる、様々なジャンルのニュースが選出され、これらの様々なジャンルのニュースがレコメンドされることになる。 On the other hand, in the present embodiment, in step S4, a random number is added to the scores shown in FIGS. 4 and 5.
FIG. 6 is a diagram showing an example of a correction score obtained by adding a random number to FIG. 4, and FIG. 7 is a diagram showing an example of a correction score obtained by adding a random number to FIG. In FIGS. 6 and 7, the alternate long and short dash line indicates the score before correction, the solid line indicates an example of the correction score corrected by adding random numbers, and the area sandwiched by the alternate long and short dash line is the range that can be taken as the correction score. Is shown.
In step S4, the score correction unit 135 adds a random number (including a positive number and a negative number) whose absolute value is equal to or less than the first threshold value α as a random number. The first threshold value α can be appropriately changed depending on how much the recommendation rate of news exceeding the second threshold value STh is set. That is, when there is news with a score equal to or higher than the second threshold value S _Th, it indicates that the user has a strong interest in the news, and by adding a random number, the second threshold value S _Th can be significantly increased. a score above (than the second threshold value S _Th, score greater than a predetermined value M) is, if lower than the score of the other news below the second threshold value S _Th, news corresponding to the user's interest can not be recommended. Therefore, even when a random number is added, the first threshold value α is appropriately set so that the news recommendation rate of the score significantly exceeding _{the second threshold value S Th does not decrease.} For example, by setting the first threshold value α to α <M / 2, it is possible to prevent the correction score of news whose score exceeds _{S Th} + M from becoming smaller than the correction score of news whose score is less than _{the second threshold value S Th.} ..
Thus, as shown in FIG. 6, the second threshold value S _Th a predetermined value or more M, the News News N ₁ and around the above scores, be corrected is even negative random numbers addition, the correction score Higher than other genres of news. That is, when the correction score rearranges high news order will be the same genre news and news N ₁ are arranged in the upper, can be performed recommendation in line with the user's interest (decrease in the recommendation accuracy can be suppressed).
On the other hand, as shown in FIG. 7, when the scores of each news are uniform, news with a high score appears randomly. In other words, if the news is sorted in descending order of correction score, news of various genres with different content feature vectors will be selected, and news of these various genres will be recommended.

この後、配信部１３６は、スコア補正部１３５によって補正されたスコア（補正スコア）が高い順にニュースを選択して、端末装置２０にレコメンドする（ステップＳ５：配信ステップ）。
図８は、図４に示すように、スポーツに対するニュースＮ_１のスコアがピークとなる場合に端末装置２０に配信される、従来のニュースのレコメンド表示欄４０を示す図である。
図９は、図６に示すように図４のスコアを補正した場合に、端末装置２０に配信される、本実施形態のニュースのレコメンド表示欄５０を示す図である。
図１０は、図７に示すように図５のスコアを補正した場合に、端末装置２０に配信される、本実施形態のニュースのレコメンド表示欄５０を示す図である。
なお、詳細な説明は省略するが、図８から図１０に示す例は、主にスマートフォン等の携帯端末にインストールされるニュース閲覧アプリケーションでの表示例である。通常、このようなニュース閲覧アプリケーションでは、ユーザ情報に基づいたニュースがレコメンドされる「おすすめ」タブ、最新のニュースを新着順に表示させる「ＴＯＰ」タブ、各種ジャンルに対応したタブ等が表示され、これらのいずれかを選択することで、タブに対応したニュースがレコメンドされる。
ここでは、ユーザ情報に基づいてニュースのレコメンドを行う「おすすめ」タブが選択された場合に、レコメンド表示欄４０，５０に表示されるニュースについて説明する。 After that, the distribution unit 136 selects news in descending order of the score (correction score) corrected by the score correction unit 135, and recommends the news to the terminal device 20 (step S5: distribution step).
8, as shown in FIG. 4, the score news N ₁ is distributed to the terminal device 20 when the peak for sports, is a diagram illustrating a recommendation display section 40 of a conventional news.
FIG. 9 is a diagram showing a recommendation display column 50 of the news of the present embodiment, which is delivered to the terminal device 20 when the score of FIG. 4 is corrected as shown in FIG.
FIG. 10 is a diagram showing a recommendation display column 50 of the news of the present embodiment, which is delivered to the terminal device 20 when the score of FIG. 5 is corrected as shown in FIG. 7.
Although detailed description is omitted, the examples shown in FIGS. 8 to 10 are display examples in a news viewing application mainly installed on a mobile terminal such as a smartphone. Normally, in such a news viewing application, a "recommended" tab that recommends news based on user information, a "TOP" tab that displays the latest news in the order of new arrival, tabs corresponding to various genres, etc. are displayed. By selecting one of the above, news corresponding to the tab will be recommended.
Here, the news displayed in the recommendation display fields 40 and 50 when the "Recommended" tab for recommending news based on the user information is selected will be described.

従来の情報処理装置では、図４に示すように例えばスポーツに対するニュースＮ_１のスコアが高い場合では、図８に示すような、上位にスポーツに対するニュースが配置されたレコメンド表示欄４０が端末装置２０に表示される。なお、ユーザが端末装置２０を操作して、レコメンド表示欄４０を下方にスクロールすることで、レコメンド表示欄４０に他のニュースが表示される。
また、従来の情報処理装置では、図５に示すように、各ニュースのスコアが一様である場合でも、上位スコアのニュースを選択する。したがって、ニュースＮ_１のスコアが第二閾値Ｓ_Ｔｈ未満であったとしても、図８のように、上位にスポーツに対するニュースが配置されたレコメンド表示欄４０が表示される。この場合、ユーザは、レコメンド表示欄４０を下方にスクロールさせることで、自分が興味のあるニュースを探す必要がある。また、仮に、興味が無いにもかかわらず、レコメンド表示欄４０に表示された上位のニュースを選択、閲覧すると、当該閲覧履歴情報に基づいてユーザ特性ベクトルが補正される。したがって、ユーザの興味に対応したユーザ特徴ベクトルを正しく算出することがより困難になってしまう。 In the conventional information processing apparatus, when the score news N ₁ against for example sports, as shown in FIG. 4 is high, as shown in FIG. 8, the recommendation display section 40 is a terminal device that news is arranged for sports top 20 Is displayed in. When the user operates the terminal device 20 and scrolls the recommendation display field 40 downward, other news is displayed in the recommendation display field 40.
Further, in the conventional information processing apparatus, as shown in FIG. 5, even if the scores of the news are uniform, the news with the highest score is selected. Therefore, even if _{the score of the news N 1} is less than the second threshold value S _Th , the recommendation display column 40 in which the news about the sport is arranged at the upper level is displayed as shown in FIG. In this case, the user needs to search for news that he / she is interested in by scrolling the recommendation display field 40 downward. Further, if the higher-ranking news displayed in the recommendation display field 40 is selected and browsed even though he / she is not interested, the user characteristic vector is corrected based on the browsing history information. Therefore, it becomes more difficult to correctly calculate the user feature vector corresponding to the user's interest.

一方、本実施形態では、スコアに対して乱数を加算した補正スコアに基づいてニュースが選出される。ここで、図４のように、第二閾値Ｓ_Ｔｈを大きく超えるスコアのニュースが有る場合、当該ニュースのスコアに乱数として負の値が加算されたとしても、第二閾値Ｓ_Ｔｈを超える。この場合、図９のように、スコア上位の並び順が多少変化するものの、選出されるニュースは、従来と同様となり、レコメンド精度の低下はない。 On the other hand, in the present embodiment, news is selected based on a correction score obtained by adding a random number to the score. Here, as shown in FIG. 4, _{when there is news with a score that greatly exceeds the second threshold value S Th} , even if a negative value is added as a random number to the score of the news, the second threshold value S _Th is exceeded. In this case, as shown in FIG. 9, although the order of the top scores changes slightly, the news to be selected is the same as in the past, and the recommendation accuracy does not decrease.

また、本実施形態では、図５のように、各ニュースのスコアが一様であっても、図７の様にスコアが補正される。このため、スコアを高い順に選択すると、様々なジャンルのニュースが、ランダムで選出され、図１０に示すように、端末装置２０に表示されるレコメンド表示欄５０に、多種多様なジャンルのニュースＮ_Ｘが配置される。
この場合、ユーザは、レコメンド表示欄５０の上位に配置されるニュースに、興味のあるニュースが含まれる場合、レコメンド表示欄５０を下方にスクロールさせることなく、当該ニュースを選択、閲覧することができる。
また、レコメンド表示欄５０の上位に興味のあるニュースが配置される可能性も高くなり、１つのジャンルのニュースのみが配置される場合に比べて、ユーザの興味に対応した閲覧履歴情報を適正に取得でき、ユーザが真に興味を有するジャンルを適正に学習することができる。 Further, in the present embodiment, even if the score of each news is uniform as shown in FIG. 5, the score is corrected as shown in FIG. 7. Therefore, by selecting a score in descending order, of various genres news, elected randomly, as shown in FIG. 10, the recommendation display column 50 displayed on the terminal device 20, a wide variety of genres News N _X Is placed.
In this case, if the news arranged above the recommendation display field 50 includes news of interest, the user can select and view the news without scrolling the recommendation display field 50 downward. ..
In addition, there is a high possibility that news of interest will be placed at the top of the recommendation display column 50, and the browsing history information corresponding to the user's interest will be properly placed compared to the case where only news of one genre is placed. It can be acquired, and the genre that the user is really interested in can be properly learned.

この後、端末装置２０において、レコメンド表示欄５０に配置されたニュースのうちのいずれかがユーザの操作によって選択されると、端末装置２０から選択されたニュース（選択コンテンツ）を示す選択コンテンツ情報（例えば、コンテンツＩＤ）がサーバ装置１０に送信される。
サーバ装置１０のユーザ情報取得部１３１は、端末装置２０から選択コンテンツ情報を受信したか否かを判定する（ステップＳ６）。
ステップＳ６でＮＯと判定される場合、つまり、選択コンテンツ情報が受信されず、端末装置２０において、ユーザによるニュースの閲覧が終了した場合等では、処理を終了させる。
一方、ステップＳ６でＹＥＳと判定される場合、つまり、選択コンテンツ情報を受信した場合、ユーザ情報取得部１３１は、閲覧履歴情報を更新する（ステップＳ７）。例えば、選択コンテンツ情報を受信した日時と、選択コンテンツに対応するコンテンツＩＤとを閲覧履歴情報に記録する。 After that, when any one of the news arranged in the recommendation display field 50 is selected by the user's operation in the terminal device 20, the selected content information (selected content) indicating the news (selected content) selected from the terminal device 20 (selected content). For example, the content ID) is transmitted to the server device 10.
The user information acquisition unit 131 of the server device 10 determines whether or not the selected content information has been received from the terminal device 20 (step S6).
When it is determined as NO in step S6, that is, when the selected content information is not received and the user finishes viewing the news in the terminal device 20, the process is terminated.
On the other hand, if YES is determined in step S6, that is, when the selected content information is received, the user information acquisition unit 131 updates the browsing history information (step S7). For example, the date and time when the selected content information is received and the content ID corresponding to the selected content are recorded in the browsing history information.

ステップＳ７の後、ユーザ特徴算出部１３２は、さらに、更新された閲覧履歴情報を含むユーザ情報に基づいて、ユーザ特徴ベクトルを算出する（ステップＳ８）。
この後、スコア算出部１３４は、ステップＳ８で算出されたユーザ特徴ベクトルに基づいて、各ニュースに対するスコアを再度算出する（ステップＳ９：スコア算出ステップ）。 After step S7, the user feature calculation unit 132 further calculates the user feature vector based on the user information including the updated browsing history information (step S8).
After that, the score calculation unit 134 recalculates the score for each news based on the user feature vector calculated in step S8 (step S9: score calculation step).

さらに、スコア算出部１３４は、受信した選択コンテンツ情報に対応したニュースが、第二閾値Ｓ_Ｔｈ以上のスコアのニュースであるか否かを判定する（ステップＳ１０）。
ステップＳ１０でＹＥＳと判定される場合、つまり、選択されたニュースが、第二閾値Ｓ_Ｔｈ以上のスコアのニュースである場合、ステップＳ４に戻る。 Further, the score calculation unit 134 determines whether or not the news corresponding to the received selected content information is _{news with a score equal to or higher than the second threshold value S Th (step S10).}
If YES is determined in step S10, that is, if the selected news has _{a score equal to or higher than the second threshold value S Th} , the process returns to step S4.

一方、ステップＳ１０でＮＯと判定される場合、所定の判定期間（例えば１週間）の間で、選択されなかった第二閾値Ｓ_Ｔｈ以上のスコアのニュースが有るか否かを判定する（ステップＳ１１）。
このステップＳ１１でＮＯと判定される場合、つまり、判定期間の間で選択されなかった第二閾値Ｓ_Ｔｈ以上のスコアのニュースがない場合は、ステップＳ４に戻る。 On the other hand, if NO is determined in step S10, it is determined whether or not there is news with a score _{equal to} or higher than the second threshold value S Th that has not been selected during a predetermined determination period (for example, one week) (step S11). ).
If NO is determined in step S11, that is, if there is no news with a score equal to or higher than _{the second threshold value S Th that was not selected during the determination period, the process returns to step S4.}

ステップＳ１１でＹＥＳと判定される場合、つまり、判定期間の間で選択されなかった第二閾値Ｓ_Ｔｈ以上のスコアのニュースが有る場合、当該ニュース(対象ニュース)は、ユーザにとって興味がないニュース、又は、ユーザの興味が薄れているニュースである可能性がある。
この場合、スコア算出部１３４は、対象ニュースのスコアを、所定量低減させる（ステップＳ１２）。スコアの低減値は、例えば一定の値であってもよく、一定の割合であってもよい。この後、ステップＳ４に戻る。
これにより、スコアが高いニュースであっても、所定の判定期間の間でユーザに閲覧されないニュースはスコアが徐々に低減し、スコア補正部１３５によって乱数が加算された場合に、他のニュースのスコアよりも下回る可能性が出てくる。したがって、他のニュースがレコメンド表示欄５０の上位に表示される可能性も高くなり、ユーザが真に興味を有するジャンルを適正に学習することができる。 If YES is determined in step S11, that is, if there is _{news with a score equal to or higher than the second threshold value S Th} that was not selected during the determination period, the news (target news) is news that is not of interest to the user. Alternatively, the news may be less interesting to the user.
In this case, the score calculation unit 134 reduces the score of the target news by a predetermined amount (step S12). The score reduction value may be, for example, a constant value or a constant ratio. After that, the process returns to step S4.
As a result, even if the news has a high score, the score of news that is not viewed by the user during the predetermined judgment period is gradually reduced, and when a random number is added by the score correction unit 135, the score of other news is added. There is a possibility that it will be lower than. Therefore, there is a high possibility that other news will be displayed at the top of the recommendation display column 50, and the genre that the user is really interested in can be properly learned.

［本実施形態の作用効果］
本実施形態のサーバ装置１０は、記憶部１２と、プロセッサ１３とを備え、プロセッサ１３は、記憶部１２に記憶された情報処理プログラムを読み込むことで、スコア算出部１３４、スコア補正部１３５、及び配信部１３６として機能する。
スコア算出部１３４は、ユーザの特徴を示すユーザ特徴ベクトル、及び、コンテンツ（ニュース）の特徴を示すコンテンツ特徴ベクトルに基づいて、ユーザのコンテンツへの興味を示すスコアを算出する。スコア補正部１３５は、各ニュースに対するスコアに対して、絶対値が第一閾値α以下となる乱数を加算してスコアを補正する。配信部１３６は、補正スコアが高い順に所定数のニュースを選択して端末装置２０にレコメンドする。 [Action and effect of this embodiment]
The server device 10 of the present embodiment includes a storage unit 12 and a processor 13, and the processor 13 reads the information processing program stored in the storage unit 12 to read the score calculation unit 134, the score correction unit 135, and the score correction unit 135. It functions as a distribution unit 136.
The score calculation unit 134 calculates a score indicating the user's interest in the content based on the user feature vector indicating the user's characteristics and the content feature vector indicating the content (news) characteristics. The score correction unit 135 corrects the score by adding a random number whose absolute value is equal to or less than the first threshold value α to the score for each news. The distribution unit 136 selects a predetermined number of news items in descending order of the correction score and recommends them to the terminal device 20.

これにより、各ニュースに対するスコアが一様であり、ユーザの興味があるジャンルを特定できない場合に、多種多様なニュースをランダムでレコメンド配信することができる。よって、当該レコメンドに対して、ユーザがいずれかを選択することで、ユーザが興味のあるニュースのジャンルを閲覧履歴情報として取得でき、ユーザの興味を学習することができる。一方、特定のニュースに対して第二閾値以上のスコアを有する場合は、ユーザが当該特定のニュースのジャンルに対して際立って興味を有することを意味する。このような場合に、当該ニュースがレコメンドされない場合、ユーザの興味に対応したニュースが配信されず、レコメンド精度が低下する。しかしながら、本実施形態では、第一閾値以下の乱数を加算するものであり、このように、際立って高いスコアに対して乱数を加算しても、ニュース全体から見ると、当該ジャンルのニュースのスコアは依然として高く、当該ジャンルのニュースがレコメンドされるニュースとして選択される。よって、レコメンド精度が低下することがない。 As a result, when the score for each news is uniform and the genre of interest of the user cannot be specified, a wide variety of news can be randomly recommended and distributed. Therefore, when the user selects one of the recommendations, the genre of news that the user is interested in can be acquired as browsing history information, and the user's interest can be learned. On the other hand, when the score is equal to or higher than the second threshold value for a specific news, it means that the user is remarkably interested in the genre of the specific news. In such a case, if the news is not recommended, the news corresponding to the user's interest is not delivered, and the recommendation accuracy is lowered. However, in the present embodiment, the random numbers below the first threshold value are added, and even if the random numbers are added to the remarkably high score in this way, the score of the news of the genre is viewed from the whole news. Is still high and news of the genre is selected as recommended news. Therefore, the recommendation accuracy does not decrease.

本実施形態では、サーバ装置１０のプロセッサ１３は、ユーザ情報取得部１３１、及びユーザ特徴算出部１３２としても機能する。ユーザ情報取得部１３１は、本発明の選択コンテンツ取得部であり、配信部１３６により端末装置２０に配信された複数のニュースのうち、端末装置２０において選択及び閲覧されたニュースを選択コンテンツ情報として取得する。そして、ユーザ特徴算出部１３２は、本発明のユーザ補正部としても機能し、選択コンテンツ情報に基づいて更新された閲覧履歴情報に基づいて、ユーザ特徴ベクトルを算出する。
上述のように、ユーザの興味ジャンルが特定されていない場合、スコアに対して乱数が加算された補正スコアに基づいたニュースの配信を行い、複数のジャンルのニュースがランダムでレコメンドされることになる。この際、配信されたニュースのいずれかを選択する操作が端末装置２０で実施されることで、閲覧履歴情報が更新され、ユーザ特徴算出部１３２は、その閲覧履歴情報に基づいたユーザ特徴ベクトルを算出する。つまり、ランダムで配信されたニュースから、ユーザが興味を持つニュースのジャンルを特定でき、ユーザの興味を適切に学習することができる。よって、ユーザが興味を持つニュースのジャンルに対するベクトル値が高いユーザ特徴ベクトルを算出でき、これによって、真にユーザが興味を持つジャンルに対するニュースをレコメンドすることができる。 In the present embodiment, the processor 13 of the server device 10 also functions as a user information acquisition unit 131 and a user feature calculation unit 132. The user information acquisition unit 131 is the selected content acquisition unit of the present invention, and among a plurality of news distributed to the terminal device 20 by the distribution unit 136, the news selected and viewed by the terminal device 20 is acquired as the selected content information. do. Then, the user feature calculation unit 132 also functions as a user correction unit of the present invention, and calculates the user feature vector based on the browsing history information updated based on the selected content information.
As described above, if the user's interest genre is not specified, news will be distributed based on the correction score obtained by adding a random number to the score, and news of multiple genres will be recommended at random. .. At this time, the operation of selecting one of the delivered news is performed by the terminal device 20, so that the browsing history information is updated, and the user feature calculation unit 132 calculates the user feature vector based on the browsing history information. calculate. That is, the genre of news that the user is interested in can be identified from the randomly distributed news, and the user's interest can be appropriately learned. Therefore, it is possible to calculate a user feature vector having a high vector value for the news genre that the user is interested in, and thereby it is possible to recommend news for the genre that the user is really interested in.

本実施形態では、ユーザ情報取得部１３１が、判定期間の間、スコアが第二閾値Ｓ_Ｔｈ以上となるニュースに対する選択コンテンツ情報を取得しなかった場合に、スコア算出部１３４は、当該ニュースに対するスコアを所定の低減値だけ低減させる。
これにより、スコアが高いもののユーザの興味がないジャンルのニュース、又は、ユーザの興味が薄れてきたニュースに対するスコアが低減され、乱数が加算された場合に、他のニュースの補正スコアよりも低くなる可能性が生じる。この場合、スコアが高いもののユーザの興味がないジャンルのニュース、又は、ユーザの興味が薄れてきたニュースが、レコメンド表示欄５０の上位に表示されるニュースから外される。したがって、スコアが高いもののユーザによって選択されないジャンルのニュースが、いつまでもレコメンド表示欄５０の上位に表示される不都合を低減でき、ユーザの興味に即したニュースのレコメンドが可能となる。 In the present embodiment, when the user information acquisition unit 131 does _{not acquire the selected content information for the news whose score is equal to or higher than the second threshold value S Th} during the determination period, the score calculation unit 134 performs the score for the news. Is reduced by a predetermined reduction value.
This reduces the score for news in genres that are high but not of interest to the user, or news that the user is less interested in, and when random numbers are added, it is lower than the correction score of other news. Possibility arises. In this case, news of a genre that has a high score but is not of interest to the user, or news of which the user's interest has diminished is excluded from the news displayed at the top of the recommendation display column 50. Therefore, it is possible to reduce the inconvenience that news of a genre that has a high score but is not selected by the user is displayed at the top of the recommendation display field 50 forever, and it is possible to recommend news that suits the user's interests.

［変形例］
なお、本発明は、上述した実施形態に限定されるものではなく、本発明の目的を達成できる範囲で、以下に示される変形をも含むものである。 [Modification example]
The present invention is not limited to the above-described embodiment, but also includes the modifications shown below to the extent that the object of the present invention can be achieved.

［変形例１］
上記実施形態では、ステップＳ７により閲覧履歴情報が更新された後、ステップＳ８によってユーザ特徴ベクトルを算出し、その後、選択されたニュースが第二閾値Ｓ_Ｔｈ以上のニュースではなく、判定期間が経過した場合に、スコア算出部１３４が、選択されなかった第二閾値Ｓ_Ｔｈ以上のニュースのスコアをステップＳ１２で低減させる例を示した。
これに対して、選択されたニュースが第二閾値Ｓ_Ｔｈ以上のニュースではなく、所定期間が経過した場合に、ステップＳ９によってユーザ特徴ベクトルを算出する際に、選択されなかった第二閾値以上のニュースに対応したジャンルのベクトル値を低減させるように、ユーザ特徴ベクトルを算出してもよい。 [Modification 1]
In the above embodiment, after the browsing history information is updated in step S7, the user feature vector is calculated in step S8, and then the selected news is not the news of the second threshold value _STh or more, and the determination period has elapsed. In this case, the score calculation unit 134 shows an example in which the score of news having a _{second threshold value S Th or higher that is not selected is reduced in step S12.}
On the other hand, when the selected news is not the news of the second threshold value S _Th or more and the predetermined period has elapsed, the second threshold value or more that was not selected when calculating the user feature vector in step S9 is performed. The user feature vector may be calculated so as to reduce the vector value of the genre corresponding to the news.

［変形例２］
上記実施形態では、第二閾値Ｓ_Ｔｈ以上のスコアのニュースが、一定の判定期間の間で、選択されなかった場合にスコアを低減させる例を示した。これに対して、判定期間は、スコアの値によって変化させてもよい。
例えば、スコアと第二閾値Ｓ_Ｔｈとの差が大きくなる程、判定期間を短くしてもよい。すなわち、スコアが高いほど、ユーザがそのニュースに対する興味が高いことを示しているが、当該ニュースをユーザが閲覧しなかった場合、ユーザは、当該ニュースに対して予想している程の興味があるわけではないと判断できる。このような場合、スコアの低減期間を短くすることで、ユーザが真にそのジャンルに興味があるか否かを適正に判断できる。また、真にユーザが興味を持っているニュースであれば、短い判定期間であっても、選択される可能性が高く、スコアが低減される可能性も低くなる。よって、ユーザの興味に対応したニュースのレコメンド率が低下する等の問題は生じにくい。
一方、他のニュースに比べてスコアが高く、第二閾値Ｓ_Ｔｈを超えているものの、スコアと第二閾値Ｓ_Ｔｈとの差が小さい場合、当該ニュースに対する興味は高いものの、当該ジャンルに対するニュースの数が少なく、スコアが十分に得られていない可能性もある。この場合、判定期間を長くすることで、真にユーザが興味を持っているニュースに対するスコアが高い頻度で低減されることがなく、レコメンド精度の低下を抑制できる。 [Modification 2]
In the above embodiment, an example is shown in which news with a score equal to or higher than _{the second threshold value S Th reduces the score when it is not selected during a certain determination period.} On the other hand, the determination period may be changed depending on the value of the score.
For example, the larger the difference between the score and the second threshold value S _Th , the shorter the determination period may be. That is, the higher the score, the higher the user's interest in the news, but if the user does not view the news, the user is as interested in the news as expected. It can be judged that this is not the case. In such a case, by shortening the score reduction period, it is possible to properly judge whether or not the user is really interested in the genre. Also, if the news is truly of interest to the user, it is more likely to be selected, even for a short decision period, and less likely to reduce the score. Therefore, problems such as a decrease in the news recommendation rate corresponding to the user's interest are unlikely to occur.
On the other hand, higher score than other news, although exceeds a second threshold value S _Th, when the difference between the score and the second threshold value S _Th is smaller, but interest in the news high, news for the genre It is possible that the number is small and the score is not sufficient. In this case, by lengthening the determination period, the score for the news that the user is really interested in is not reduced at a high frequency, and the decrease in the recommendation accuracy can be suppressed.

［変形例３］
また、スコア算出部１３４は、ステップＳ１２で低減させるスコアの低減値を、スコアの値によって変化させてもよい。
例えば、スコアと第二閾値Ｓ_Ｔｈとの差が小さくなる程、低減値を小さくしてもよい。すなわち、スコアが高いほど、ユーザがそのニュースに対する興味が高いはずであるが、当該ニュースをユーザが閲覧しなかった場合、ユーザは当該ニュースに対して予想している程の興味があるわけではないと判断できる。よって、当該ニュースに対するスコアの低減値を大きくすることで、ユーザが興味ジャンルを適正に判定できる。また、真にユーザが興味を持っているニュースであれば、当該ニュースが頻繁に閲覧される可能性が高いので、スコアが低減されにくく、低減されたとしても、再び高いスコアを得ることができる。よって、ユーザの興味に対応したニュースのレコメンド率が低下する等の問題は生じにくい。
一方、他のニュースに比べてスコアが高く、第二閾値Ｓ_Ｔｈを超えているものの、スコアと第二閾値Ｓ_Ｔｈとの差が小さい場合（例えば、スコアがＳ_Ｔｈ〜Ｓ_Ｔｈ＋Ｍの間の場合）、上述のように、当該ニュースに対する興味は高いものの、当該ジャンルに対するニュースの数が少なく、スコアが十分に得られていない可能性もある。この場合、低減値を小さくすることで、真にユーザが興味を持っているニュースに対するスコアが大きく低減されることがなく、レコメンド精度の低下を抑制できる。 [Modification 3]
Further, the score calculation unit 134 may change the score reduction value to be reduced in step S12 depending on the score value.
For example, the smaller the difference between the score and the second threshold value S _Th , the smaller the reduction value may be. That is, the higher the score, the more the user should be interested in the news, but if the user does not view the news, the user is not as interested in the news as expected. Can be judged. Therefore, by increasing the reduction value of the score for the news, the user can appropriately determine the genre of interest. Also, if the news is truly of interest to the user, the news is likely to be viewed frequently, so the score is unlikely to be reduced, and even if it is reduced, a high score can be obtained again. .. Therefore, problems such as a decrease in the news recommendation rate corresponding to the user's interest are unlikely to occur.
On the other hand, higher score than other news, although exceeds a second threshold value S _Th, when the difference between the score and the second threshold value S _Th is small (for example, scores of between _{S Th ~S Th} + _M In the case), as mentioned above, although the interest in the news is high, the number of news in the genre is small, and it is possible that the score is not sufficiently obtained. In this case, by reducing the reduction value, the score for the news that the user is really interested in is not significantly reduced, and the decrease in the recommendation accuracy can be suppressed.

［変形例４］
上記実施形態では、全てのニュースのスコアに対して乱数を加算する例を示したが、これに限定されない。例えば、第二閾値Ｓ_Ｔｈ以上のスコアに対して乱数を加算せず、第二閾値Ｓ_Ｔｈ未満のスコアに対して乱数を加算してもよい。
また、スコアの値によって、第一閾値αの値を変化させてもよい。例えば、スコアの値が小さい程、乱数の範囲である第一閾値αの値を大きくしてもよい。スコア算出部１３４により算出されるスコアが極端に低いニュースは、入稿されるニュースの数が少ないジャンルや、ユーザが知識を有さず、ユーザが興味を持つか否かが未知数のジャンル等が含まれることがあり、このようなニュースは、ユーザにとって興味があるジャンルである可能性もある。これに対して、上記の様に、スコアが低い場合に、乱数で加算される値の範囲を大きくすることで、当該ニュースが、端末装置２０にレコメンドされる可能性が高まり、ユーザが興味を有するジャンルを学習することができる。 [Modification example 4]
In the above embodiment, an example of adding random numbers to all news scores has been shown, but the present invention is not limited to this. For example, a random number may not be added to a score having a second threshold value S _Th or higher, and a random number may be added to a score having a second threshold value S _{Thh or} less.
Further, the value of the first threshold value α may be changed depending on the value of the score. For example, the smaller the score value, the larger the value of the first threshold value α, which is the range of random numbers. News with an extremely low score calculated by the score calculation unit 134 includes genres in which the number of submitted news is small, genres in which the user does not have knowledge, and whether or not the user is interested is unknown. It may be included, and such news may be a genre of interest to the user. On the other hand, as described above, when the score is low, by increasing the range of values added by random numbers, the possibility that the news is recommended to the terminal device 20 increases, and the user is interested. You can learn the genre you have.

［変形例５］
さらに、第一閾値αとして、正値を加算する場合の閾値と、負値を加算する閾値とを、それぞれ異ならせてもよく、上記のようにスコアに応じて、閾値を異ならせてもよい。
例えば、スコアが第二閾値Ｓ_Ｔｈ以上である場合に、−α₋〜＋α_＋の範囲の乱数を加算し、スコアが低くなるに従って、｜α₋｜を小さくし、｜α_＋｜を大きくしてもよい。 [Modification 5]
Further, as the first threshold value α, the threshold value for adding the positive value and the threshold value for adding the negative value may be different from each other, or the threshold value may be different according to the score as described above. ..
For example, when the score is equal to or higher than _{the second threshold value S Th} _{, random numbers in the range of −α −} to + α ₊ are added, and as the score decreases, | α ₋ | decreases and | α ₊ | increases. You may.

［変形例６］
上記実施形態では、コンテンツの一例としてニュースを例示したが、これに限定されない。コンテンツとして、例えば商品等を広告する広告コンテンツであってもよく、動画配信サービスで配信される動画コンテンツ等であってもよい。 [Modification 6]
In the above embodiment, news is illustrated as an example of content, but the present invention is not limited to this. The content may be, for example, advertising content that advertises a product or the like, or video content or the like distributed by a video distribution service.

［変形例７］
上記実施形態では、ユーザ特徴算出部１３２は、ユーザ情報の閲覧履歴情報に基づいてユーザ特徴ベクトルを算出する例を示したが、ニュースの閲覧履歴情報に加え、ユーザ情報に含まれるユーザ属性や、インターネット上の検索履歴等のその他の行動履歴に基づいて、ユーザ特徴ベクトルを算出してもよい。 [Modification 7]
In the above embodiment, the user feature calculation unit 132 shows an example of calculating the user feature vector based on the browsing history information of the user information. However, in addition to the news browsing history information, the user attributes included in the user information and the user attributes are included. The user feature vector may be calculated based on other action history such as search history on the Internet.

１０…サーバ装置（情報処理装置）、１２…記憶部、１３…プロセッサ、２０…端末装置、５０…レコメンド表示欄、１２１…ニュース蓄積部、１２２…ユーザ情報蓄積部、１３１…ユーザ情報取得部（選択コンテンツ取得部）、１３２…ユーザ特徴算出部（ユーザ補正部）、１３３…コンテンツ特徴算出部、１３４…スコア算出部、１３５…スコア補正部、１３６…配信部、Ｓ_Ｔｈ…第二閾値、α…第一閾値。 10 ... Server device (information processing device), 12 ... Storage unit, 13 ... Processor, 20 ... Terminal device, 50 ... Recommendation display field, 121 ... News storage unit, 122 ... User information storage unit, 131 ... User information acquisition unit ( Selected content acquisition unit), 132 ... User feature calculation unit (user correction unit) 133 ... Content feature calculation unit, 134 ... Score calculation unit, 135 ... Score correction unit, 136 ... Distribution unit, S _Th ... Second threshold, α … First threshold.

Claims

A score calculation unit that calculates a score indicating the user's interest in the content based on the user feature vector indicating the user's characteristics and the content feature vector indicating the content characteristics.
A score correction unit that corrects the score by adding a random number whose absolute value is equal to or less than the first threshold value with respect to the score for each content.
A distribution unit that selects a predetermined number of the contents in descending order of the corrected score and distributes the contents to the terminal device owned by the user.
An information processing device characterized by being equipped with.

In the information processing apparatus according to claim 1,
Among the plurality of contents distributed to the terminal device by the distribution unit, the selected content acquisition unit that acquires the content selected by the terminal device as the selected content, and the selected content acquisition unit.
A user correction unit that corrects the user feature vector based on the selected content,
An information processing device characterized by further providing.

In the information processing apparatus according to claim 2,
The score calculation unit reduces the score of the content when the selected content acquisition unit does not acquire the content whose score is equal to or higher than the second threshold value as the selected content during a predetermined determination period. An information processing device characterized by being made to operate.

It is an information processing method that distributes content by computer.
The computer includes a score calculation unit, a score correction unit, and a distribution unit.
A score calculation step in which the score calculation unit calculates a score indicating the user's interest in the content based on the user feature vector indicating the user's characteristics and the content feature vector indicating the content's characteristics.
A score correction step in which the score correction unit corrects the score by adding a random number whose absolute value is equal to or less than the first threshold value with respect to the score for each content.
A distribution step in which the distribution unit selects a predetermined number of the contents in descending order of the corrected score and distributes the content to the terminal device owned by the user.
An information processing method characterized by carrying out.

An information processing program that can be read and executed by a computer.
An information processing program characterized in that the computer functions as the information processing device according to any one of claims 1 to 3.