JP2012203663A

JP2012203663A - Evaluation method of recommendation method

Info

Publication number: JP2012203663A
Application number: JP2011067930A
Authority: JP
Inventors: Naoyuki Tanaka; 均幸田中; Naoto Yamamoto; 直人山本; Masao Kano; 正雄加納
Original assignee: Nippon Telegraph and Telephone West Corp
Current assignee: Nippon Telegraph and Telephone West Corp
Priority date: 2011-03-25
Filing date: 2011-03-25
Publication date: 2012-10-22

Abstract

PROBLEM TO BE SOLVED: To evaluate a method of recommending items such as a community or merchandise from the past action record of a user so as to compare the result and method itself.SOLUTION: For data lowly related to past attribute data, after the lapse of a fixed period, only data for which a user has actually taken action to the attribute data is handled as an unexpected object. Then, of all the recommended attribute data, a ratio occupied by new objects and a ratio occupied by the unexpected objects are obtained.

Description

この発明は、ユーザに対して、趣味嗜好に沿った属性データを選び出して推薦する様々な手法について、その手法自体を評価する方法に関する。 The present invention relates to a method for evaluating a method itself for various methods for selecting and recommending attribute data according to hobbies and preferences to a user.

ソーシャルネットワークサービスの運営や、マーケティングにあたり、ユーザの過去の行動を解析して、その行動に関連があるコミュニティを勧めたり、過去に購入した商品と関連ある商品を勧めたりといったことが行われている。ソーシャルネットワークサービスであれば、一のユーザがアクセスしたり登録したりしたコミュニティやパーソナルデータについて、同じコミュニティやパーソナルデータにアクセスしたり登録したりした別のユーザが別途アクセスしたり登録したりしたコミュニティやパーソナルデータを、当該一のユーザに勧めるべく、当該一のユーザの利用する画面に表示するといったことが行われている。また、マーケティングにあたっては、ある商品を購入した一のユーザに対し、同じ商品を購入した別のユーザが購入した別の商品を、当該一のユーザに勧めるべく、マーケティングサイトのお勧め欄に表示したり、メールマガジンのお勧め欄に記載したりするといったことが行われている。 In the operation and marketing of social network services, the past behavior of users is analyzed, and a community related to the behavior is recommended, and products related to products purchased in the past are recommended. . If it is a social network service, the community or personal data accessed or registered by one user will be accessed or registered by another user who has accessed or registered the same community or personal data. And personal data are displayed on a screen used by the one user so as to be recommended to the one user. In marketing, for one user who purchased a product, another product purchased by another user who purchased the same product is displayed in the recommendation column of the marketing site in order to recommend it to the one user. Or listed in the recommendation column of an e-mail magazine.

このような推奨方法は、ソーシャルネットワークサービスにおけるコミュニティの拡張や、マーケティングにおいて大きな効果を発揮するが、単純に趣味嗜好が類似しているユーザの中で特に多いものを選んで提示し続けていくと、提示するコミュニティや商品の傾向が偏ってしまい、コミュニティの拡大や商品の販売の拡大に限界があり、一定の範囲に留まりやすい。そうなると、提示できるコミュニティや商品が利用者にとって新鮮味のあるものでなくなり、驚きを覚えるような体験を提供できず、引いては当該サービスに対する興味を減退させてしまう。このため、趣味嗜好が類似しているユーザが行動の対象とする件数が最も多いものといった類似性の高いものを優先して推奨するだけではなく、比較的件数が少ないものの、一部のユーザ層が重なっている意外な対象を勧められる推薦手法が求められる場合がある。 Such a recommendation method has a great effect on the expansion of the community in social network services and marketing, but if you continue to select and present a particularly large number of users with similar hobbies and preferences The tendency of the community and the product to be presented is biased, and there is a limit to the expansion of the community and the sales of the product, and it tends to stay within a certain range. If this happens, the community and products that can be presented will not be fresh to the user, and it will not be possible to provide a surprising experience, which in turn will reduce interest in the service. For this reason, it is not only recommended that users with similar hobbies and preferences have the highest number of cases to be acted on, but also recommends those with a high degree of similarity. There is a case where a recommendation method is recommended for recommending an unexpected object with overlapping.

例えば特許文献１に記載の方法は、利用者の行動記録に基づいて、その利用者の嗜好モデルを学習して記憶し、複数の推薦候補についてその嗜好モデルに基づいて選択確率を計算して、各推薦候補について利用者が選択する確率を表す評価リストを作成し、利用者の行動記録に関する情報から当該利用者の習慣性を特定して、前記複数の推薦候補についてのデフォルト予測値を求め、前記選択確率からデフォルト予測値を引いた差を、意外性評価値として求めることで、コンテンツの意外性を評価し、意外性の高いコンテンツを提示するものである。すなわち、閾値から離れたデータも、意外性のあるものとして廃棄せずに提示する。 For example, the method described in Patent Literature 1 learns and stores a user's preference model based on a user's behavior record, calculates a selection probability based on the preference model for a plurality of recommended candidates, Create an evaluation list representing the probability that the user selects for each recommendation candidate, identify the user's habit from the information about the user's behavior record, and obtain a default prediction value for the plurality of recommendation candidates, The difference obtained by subtracting the default prediction value from the selection probability is obtained as an unexpected evaluation value, thereby evaluating the unexpectedness of the content and presenting highly unexpected content. In other words, data that deviates from the threshold value is presented as being unexpected without being discarded.

一方で、意外性の高いものばかり推奨しては困るケースもある。例えば、急激なコミュニティの拡大を求めない場合や、安定した顧客に対して安定した販売を継続したい場合などである。 On the other hand, there are cases where it is difficult to recommend only those that are surprising. For example, there is a case where rapid expansion of the community is not required, or when it is desired to continue stable sales to stable customers.

従って、推薦手法は最良解があるわけではなく、様々な推薦手法を検討する必要がある。また、一つの推薦手法でも、パラメータを変更することで推薦結果が大きく変わることもある。 Therefore, the recommendation method does not have the best solution, and it is necessary to consider various recommendation methods. Even with one recommendation method, the recommendation result may change greatly by changing the parameter.

特開２００８−２６２３９８号公報JP 2008-262398 A

しかしながら、それらの推薦手法がどのような性質を持っているかということは容易に把握することは難しく、結果として、どのような推薦手法を用いればよいかという指標を設けることも難しかった。 However, it is difficult to easily grasp what kind of properties these recommendation methods have, and as a result, it is difficult to provide an index as to what recommendation method should be used.

また、特許文献１の手法により、意外性があるとして提示するものは、あくまで、関連性が薄いものを意外なものとして提示するだけである。その結果、意外であって驚きを持って迎えられるコンテンツもある一方で、実際にはほとんど無関係で興味を持ちうるはずもないコンテンツまで、意外性があるものとして分類してしまい、実体とかけ離れた結果を出してしまうという問題があった。 In addition, what is presented as being unexpected by the method of Patent Document 1 is merely presented as an unexpected thing that is less relevant. As a result, while some of the contents are unexpected and greeted with surprise, even content that is almost irrelevant and should not be of interest is classified as surprising and far from the substance. There was a problem of giving results.

そこでこの発明は、推薦手法によって提示される候補や、推薦手法そのものの性質を、実体に則した意外性を踏まえて評価することにより、種種の推薦手法の性質を比較検討したり、推薦手法に代入するパラメータの効果などを検討できるようにすることを目的とする。 Therefore, the present invention compares the properties of various recommendation methods by evaluating the candidates presented by the recommendation method and the properties of the recommendation method based on the unexpectedness in accordance with the substance. The purpose is to make it possible to study the effects of parameters to be substituted.

この発明は、実態に則した意外性の評価として、利用者にとって新規な属性データを推薦するにあたり、過去の属性データに対して関連性の低いデータは、一定期間経過するまでに実際に当該利用者がその属性データに対して行動を起こしたもののみ、意外性がある対象として扱うようにし、その上で、一の利用者に、一のグループに、又は全ての利用者に対して、全ての推薦する属性データのうち、新規性のある対象の占める割合と、意外性のある対象の占める割合とを求め、これらの割合の数値により、上記の課題を解決したのである。すなわち、過去の属性データを採取するだけでなく、一定期間経過後にさらにデータを収集して、意外性のある対象とすべきか否かの判断に用いたことを特徴とする。推薦しても、利用者がそれに対して行動結果を示さなければ、推薦する意味が無いからである。 The present invention recommends new attribute data for a user as an evaluation of unexpectedness in accordance with the actual situation, and data that is less relevant to past attribute data is actually used before a certain period of time. Only those that have acted on the attribute data are treated as unexpected objects, and then all to one user, to one group, or to all users In the recommended attribute data, the ratio of the novel object and the ratio of the unexpected object were obtained, and the above problem was solved by the numerical values of these ratios. That is, it is characterized by not only collecting past attribute data but also collecting data after a certain period of time and determining whether or not it should be an unexpected object. This is because even if the recommendation is made, if the user does not show an action result for the recommendation, there is no point in recommending it.

この数値を一の利用者に提示された推薦内容について算出すると、提示された内容の新規性と意外性の評価として用いることができる。また、複数の利用者について同様の数値を算出した上で、その値について、平均、中間値、分布などの統計値を求めたり、グラフにプロットしたりする数的処理を行うことにより、その推薦手法がそれら複数の利用者に対して提示した結果の評価と、当該推薦手法の傾向を掴むことができる。さらに、同一の利用者に対して、異なる推薦手法による前記の数値を算出し、個々の推薦手法ごとに平均値などを求める数的処理を行い、推薦手法ごとの値を比較することで、複数の推薦手法についてその性質、利点、適正などを把握することができる。 If this numerical value is calculated for the recommended content presented to one user, it can be used as an evaluation of novelty and unexpectedness of the presented content. In addition, after calculating the same numerical value for multiple users, it is recommended to calculate the statistical value such as mean, median, distribution, etc., and plot it on a graph. It is possible to grasp the tendency of the recommendation method and the evaluation of the results presented to the plurality of users by the method. Furthermore, for the same user, the above numerical values by different recommendation methods are calculated, an average value is obtained for each recommendation method, and the values for each recommendation method are compared. It is possible to grasp the nature, advantages, appropriateness, etc. of the recommendation method.

推薦手法の利用形態例の概略図Schematic diagram of an example of how to use the recommendation method 行動記録のタイミングを示す概念図Conceptual diagram showing the timing of action recording 推薦コミュニティの例図Example of recommended community ユーザ１に示された推薦コミュニティの類似度の概念図Conceptual diagram of similarity of recommended community shown to user 1 類似度計算を行う第一の演算部の処理フロー図Process flow diagram of the first computing unit that calculates similarity 類似度に基づく新規性と意外性の分類の概念図Conceptual diagram of novelty and unexpected classification based on similarity 評価方法全体の処理フロー図Process flow diagram of the entire evaluation method 類似度計算を行う第二の演算部の処理フロー図Process flow diagram of the second calculation unit for calculating similarity

以下、この発明について詳細に説明する。この発明は、一つ又は複数の推薦手法により、利用者に対して、その利用者が過去に行った行動記録を基礎として、関連性が様々に異なる属性データを推薦する結果又はその推薦手法自体の性質を評価する方法である。この評価を実行する環境としては、主に一つ又は複数のＷＥＢサイトにおけるユーザの行動への推薦と評価が挙げられるが、ネットワーク上の事象に限らず、実店舗における会員登録した利用者の購入履歴を用いたマーケティングなど、実空間上の事象に対しても適用可能である。 The present invention will be described in detail below. The present invention provides a result of recommending attribute data having different relevance to a user based on action records performed by the user in the past by one or a plurality of recommendation methods or the recommendation method itself. It is a method for evaluating the properties of As an environment for executing this evaluation, there are mainly recommendations and evaluations of user behaviors on one or a plurality of WEB sites, but not limited to events on the network, the purchase of registered users at actual stores It can also be applied to real space events such as marketing using history.

ここで属性データとは、人の趣味、嗜好、性格などを反映させた物理的な、又は概念的な属性を有するデータであり、他のデータと何らかの関連性を有するものである。アイテム、又はコンテンツと呼ばれるものも含む場合がある。具体的には、商品、趣味、コミュニティ、検索単語、入力単語、位置情報サービスでの立ち寄り場所、旅行先、走行ルートの選択など、趣味嗜好等が反映され得るものであれば特に限定されない。 Here, the attribute data is data having physical or conceptual attributes reflecting a person's hobbies, preferences, personality, and the like, and has some relationship with other data. It may also include what is called an item or content. Specifically, there is no particular limitation as long as hobbies and preferences can be reflected, such as selection of merchandise, hobbies, communities, search words, input words, stopover locations in location information services, travel destinations, and travel routes.

具体例を挙げる。例えば、マーケティングサイトやスポーツ用品店における商品について、「バスケットボール」と「バスケットシューズ」と「マラソンシューズ」が属性データとしてあったとする。バスケットボールを購入した利用者はバスケットシューズにも興味がある可能性が高いが、マラソンシューズに興味がある可能性はそれよりも低くなる。「バスケットボール」と「バスケットシューズ」とは高い関連性を有するが、「バスケットボール」と「マラソンシューズ」との関連性はそれに比べれば低くなる。 A specific example is given. For example, it is assumed that “basketball”, “basket shoes”, and “marathon shoes” are attribute data for products at a marketing site or a sporting goods store. Users who have purchased basketball are more likely to be interested in basketball shoes, but less likely to be interested in marathon shoes. “Basketball” and “basket shoes” are highly related, but “basketball” and “marathon shoes” are less related.

また例えば、ソーシャルネットワークサービスにおけるコミュニティで、「野球」と「サッカー」と「アイスホッケー」と「ガーデニング」と「リアル系巨大ロボットアニメ作品」といった趣味のコミュニティを属性データとして例にとる。「野球」と「サッカー」と両方のコミュニティに所属している利用者は、スポーツ全般に興味がある可能性が高く、「アイスホッケー」のコミュニティについても興味を持つ可能性がそれなりに高いが、「ガーデニング」や「リアル系巨大ロボットアニメ作品」といったコミュニティに興味を持つ可能性はそれに比べると低くなる。ただし、一戸建ての家庭を持つ社会人が興味を抱きやすい傾向にある「ガーデニング」に比べると、スポーツ全般を楽しむ１０代から２０代の男性が好みやすい「リアル系巨大ロボットアニメ作品」は、やや関連性が高くなる。このため、「野球」を基準にすると、関連性の高いものから「サッカー」「アイスホッケー」「リアル系巨大ロボットアニメ作品」「ガーデニング」のような順となる。なお、ソーシャルネットワークサービス（ＳＮＳ）とは、対人関係を構築するための手段を用意したサービスである。コミュニティとはそのＳＮＳの一機能で、ある事象について興味を有したり、ある事象について賛同したりする、同一の方向性を持った利用者について、タグ付けやテーブルへの登録などによりまとめることができ、利用者間での交流が可能な専用の場を有していたりするものが挙げられる。 Further, for example, a community in a social network service, such as “baseball”, “soccer”, “ice hockey”, “gardening”, and “real giant robot anime work”, is taken as an example of attribute data. Users who belong to both “baseball” and “soccer” communities are more likely to be interested in sports overall, and more likely to be interested in the “ice hockey” community. The possibility of being interested in communities such as “gardening” and “real giant robot anime works” is low. However, compared to “gardening”, which tends to be of interest to members of society who have single-family homes, “real-type giant robot anime works” that are easy to like for men in their teens and 20s who enjoy sports in general are somewhat related. Increases nature. For this reason, when “baseball” is used as a reference, the order from “relevant” to “soccer”, “ice hockey”, “real-type giant robot anime work”, “gardening” is the order. Note that the social network service (SNS) is a service that provides means for establishing an interpersonal relationship. A community is a function of the SNS. Users with the same direction who are interested in or agree with a certain event can be grouped by tagging or registering in a table. And can have a dedicated place where users can interact with each other.

さらに例えば、ニュースサイトでは、「少女アイドルグループ（１）」と「少女アイドルグループ（２）」と「大学受験」と「３０代男性アイドルグループ（３）」といった、ニュース文書に含まれるキーワードやタグを属性データとして例にとる。「少女アイドルグループ（１）」についてのニュースを見ている利用者は他の「少女アイドルグループ（２）」についてのニュースに興味を持つ可能性が高いが、「大学受験」についてのニュースに興味を持つ可能性はそれより低く、「３０代男性アイドルグループ（３）」に興味を持つ可能性はほとんど無い。「少女アイドルグループ（１）」と「少女アイドルグループ（２）」との関連性は高く、「少女アイドルグループ（１）」と「大学受験」との関連性はそれより低くなり、「少女アイドルグループ（１）」と「３０代男性アイドルグループ（３）」との関連性はさらに低くなる。ここでは、内容としてはアイドルグループに関するもので似通っている「少女アイドルグループ（１）」と「３０代男性アイドルグループ（３）」であるが、属性データの関連性で見ると、前者は１０代から３０代の男性が興味を持ち、後者は４０代以上の女性が興味を持つ傾向が強く、閲覧する層が全く異なる。このため、例えば１０代男性層は「少女アイドルグループ（１）」と「大学受験」とのいずれにも興味を持ちやすいので、「少女アイドルグループ（１）」は「３０代男性アイドルグループ（３）」よりも、「大学受験」との類似度の方が高いということになる。 Further, for example, in the news site, keywords and tags included in news documents such as “Girls idol group (1)”, “Girls idol group (2)”, “College exam” and “30s male idol group (3)” Is taken as an example of attribute data. Users watching news about “Girl Idol Group (1)” are likely to be interested in news about other “Girl Idol Group (2)”, but they are interested in news about “College Exam” It is unlikely that they have an interest in the “30's male idol group (3)”. The relationship between “Girl idol group (1)” and “Girl idol group (2)” is high, and the relationship between “Girl idol group (1)” and “University examination” is lower than that. The relationship between the “group (1)” and the “30s male idol group (3)” is further reduced. Here, the contents are “girl idol group (1)” and “30's male idol group (3)”, which are similar to those related to idol groups. Men in their 30s are interested, and the latter has a strong tendency to be interested in women in their 40s and the browsing layer is completely different. For this reason, for example, the teenage male group is likely to be interested in both the “girl idol group (1)” and the “university examination”, so the “girl idol group (1)” is the “30s male idol group (3 ) ”Is more similar to“ university exam ”.

属性データの関連性のモデル化にはいくつかの方式があり、一般的にはコンテンツに基づくフィルタリングと協調フィルタリングとが挙げられる。前者は、それぞれの属性データの内容に基づき情報を取捨選択するものであり、後者は、ｗｅｂサイト、或いはネットワークにおいて同じ好みを有するものが共通して好むものを選択するものである。コンテンツによるフィルタリングの方式としては、ルールベース方式、メモリベース方式（概念ベース方式）、モデルベース方式の三種類が主に挙げられる。協調フィルタリングの方式としては、ユーザベース方式（メモリベース方式）、アイテムベース方式（メモリベース方式）、モデルベース方式の三種類が主に挙げられる。コンテンツに基づくフィルタリングは、属性データをキーワードやタグとして処理しているため、ニュースサイトや検索サイトにおける記事の推奨に向いており、協調フィルタリングは、属性データそのものがいわゆるアイテム単位であるため、コミュニティや商品の推奨に向いている。 There are several methods for modeling the relevance of attribute data, and in general, content-based filtering and collaborative filtering are included. The former selects information based on the contents of each attribute data, and the latter selects what is commonly used by those having the same preference in a web site or network. There are mainly three types of content filtering methods: rule-based methods, memory-based methods (concept-based methods), and model-based methods. As a collaborative filtering method, there are mainly three types: a user-based method (memory-based method), an item-based method (memory-based method), and a model-based method. Since filtering based on content processes attribute data as keywords and tags, it is suitable for recommending articles on news sites and search sites. Collaborative filtering is based on so-called item units for attribute data itself. Suitable for product recommendations.

この発明で利用する利用者の過去の行動記録は、アンケートやページ評価などの利用者が自ら行う明示的（直接的）手法と、アクセス解析や閲覧時間評価などの暗黙的（間接的）手法とがある。明示的手法は正確性が高いが利用者にかかる負担が大きい。暗黙的手法は利用者に対して推薦手法の評価をしていることを気づかせない程度に負荷が小さい。 The user's past action records used in the present invention are classified into explicit (direct) methods such as questionnaires and page evaluations, and implicit (indirect) methods such as access analysis and browsing time evaluation. There is. Although the explicit method has high accuracy, the burden on the user is large. The implicit method has a small load so that the user is not aware that the recommendation method is being evaluated.

この発明が対象とする推薦手法とは、利用者の過去の行動記録から、上記の属性データを抽出し、基礎とする過去属性データとして扱い、この過去属性データから、あるアルゴリズムによって、別の属性データ（推薦属性データ）を提示、推薦することができるものである。この行動記録とは、コミュニティページや商品ページ、ニュースページ等のアクセス履歴だけでなく、登録コミュニティの一覧、商品の購入履歴、検索履歴、入力単語の履歴、ＲＳＳの登録項目などが挙げられる。基本的には、過去属性データとそれから提示推薦できる推薦属性データとは共通である。例えば、登録コミュニティの一覧から提示推薦できるのは、次に登録してはどうかと提案される登録コミュニティであり、ニュースページのアクセス履歴から提示推薦できるのは、別のニュースページである。しかし、実体的な内容を紐付けることができれば、過去属性データと推薦属性データが異なっていてもよい。例えば、登録コミュニティとしてＮＢＡ（ナショナルバスケットアソシエーション）ファンクラブに登録されている利用者に対して、推薦属性データとして、ＮＢＡチームのユニフォームの宣伝を提示するといったことが可能である。 The recommendation method targeted by the present invention is to extract the above-mentioned attribute data from the user's past action record and treat it as the basic past attribute data. Data (recommendation attribute data) can be presented and recommended. This action record includes not only the access history of community pages, product pages, news pages, etc., but also a list of registered communities, product purchase history, search history, input word history, RSS registration items, and the like. Basically, past attribute data and recommended attribute data that can be recommended for presentation are common. For example, a registered community that can be recommended from the list of registered communities is a registered community that is proposed to register next, and another news page that can be recommended from the access history of a news page. However, the past attribute data and the recommended attribute data may be different as long as substantial contents can be associated. For example, it is possible to present an NBA team uniform advertisement as recommended attribute data to a user who is registered in an NBA (National Basketball Association) fan club as a registered community.

この発明では、上記の類似度として、それぞれの上記属性データの間の、利用者にとっての関連性の高さを数値的に比較できる値が算出できるものであれば、その算定基準と計算方式を特に限定するものではない。類似度の数値が高いほど関連性が高いと一応の想定ができ、類似度の数値がある閾値以上であるか否かによって、興味の飛躍が生じる、すなわち、意外であり得るとされるか否かを分類できればよい。一般的には、当該属性データにアクセスしたり、登録したり、購入したりする利用者がどの程度一致しているかを、個々の属性データの全てについて算出することで、基本的な値の算出は可能であるが、特にそれに限定されるものではない。 In the present invention, as the above-mentioned similarity, if a value that can numerically compare the degree of relevance for the user between the respective attribute data can be calculated, the calculation standard and calculation method are set. There is no particular limitation. It can be assumed that the higher the similarity value is, the higher the relevance is, and a jump in interest occurs depending on whether the similarity value is greater than a certain threshold value. I just need to be able to classify. In general, the basic value is calculated by calculating the degree to which users accessing, registering, and purchasing the attribute data match for all individual attribute data. Is possible, but not particularly limited thereto.

例えば上記属性データとして、上記のＳＮＳにおけるコミュニティ同士の類似度の値を求める方法として、合計利用者数で共通利用者数を割るといった方法が挙げられる。あるコミュニティＸとコミュニティＹがあり、一定期間中に、ＸにはユーザＡ，Ｂ，Ｃ，Ｄ，Ｅの５人がアクセスしており、ＹにはユーザＡ，Ｃ，Ｄ，Ｆ，Ｇの５人がアクセスしていて、全部で７人いるうち、共通してアクセスしていたのがＡ，Ｃ，Ｄの３人である場合、３／７＝４３％という計算をして、類似度を４３％、とすることができる。 For example, as the attribute data, there is a method of dividing the number of common users by the total number of users as a method for obtaining a similarity value between communities in the SNS. There is a community X and a community Y, and during a certain period, X is accessed by five users A, B, C, D, and E, and Y has users A, C, D, F, and G If 5 people are accessing and there are 7 people in total, A, C, and D who are accessing in common, the calculation is 3/7 = 43%, and the degree of similarity Can be 43%.

また、ニュースサイトにおけるニュース文書の類似度を求める方法として、ニュース文書から属性データであるキーワードを概念抽出し、単語ベクトル間の兄弟概念や上位概念との単語ベクトル間の距離値を求める方法がある。これは、大量のニュース文書を取り込み、似た概念を有する単語の出現パターンが似通ってくることを利用して、単語の概念ベクトルを収集してデータベース化した上で、算出可能となる個々のキーワード間の距離を類似度として用いるものである。 Also, as a method for obtaining the similarity of news documents in a news site, there is a method of extracting a keyword as attribute data from a news document and obtaining a distance value between word vectors with a sibling concept between word vectors and a superordinate concept. . This is because individual keywords that can be calculated after collecting a large number of news documents, collecting the concept vectors of words using a similar appearance pattern of words having similar concepts, and creating a database. The distance between them is used as the similarity.

上記過去属性データが複数ある場合、過去属性データを一つのまとまり（群）として見なし、それ以外の属性データとの類似度は一意に定まる。これについて、上記のコミュニティの例に沿って計算例を示す。コミュニティＺには、ユーザＡ，Ｂ，Ｃ，Ｇ，Ｈ，Ｉの６人がアクセスしていたとする。ここでＸとＹのコミュニティ群（以下、ＸＹ群）とＺの類似度を計算する。ＸとＹに共通的にアクセスしているユーザはＡ，Ｂ，Ｃ，Ｄ，Ｅ，Ｆ，Ｇとなる。そのため、ＸＹ群とＺとの類似度は、上記と同じ手法で計算すると、全部で９人（ＡＢＣＤＥＦＧＨＩ）いるうち、共通してアクセスしていたのが４人（ＡＢＣＧ）なので、４／９≒４４％となる。 When there are a plurality of the past attribute data, the past attribute data is regarded as one group (group), and the similarity with other attribute data is uniquely determined. In this regard, an example of calculation will be shown along with the above community example. It is assumed that six users A, B, C, G, H, and I are accessing the community Z. Here, the similarity between the community group of X and Y (hereinafter referred to as XY group) and Z is calculated. Users who commonly access X and Y are A, B, C, D, E, F, and G. Therefore, if the similarity between the XY group and Z is calculated by the same method as described above, out of a total of 9 people (ABCDEFGHI), 4 people (ABCCG) are accessing in common, so 4 / 9≈ 44%.

ただし、類似度が低いことがすなわち意外性があるわけではない。行動記録上ある過去属性データを有する利用者に対して、その過去属性データとの類似度が低い推薦属性データを提示し、その推薦属性データに対して利用者が所定の行動を起こせば、それは意外性があるということになり、行動を起こさなければ、意外ではなく、無縁であったということになる。一方、既に行動の対象となった属性データが、推薦属性データに仮に含まれたとしても、以下の新規性対象、意外性対象からは除外する。 However, it is not surprising that the similarity is low. If the recommended attribute data with a low similarity to the past attribute data is presented to the user having the past attribute data on the action record, and the user takes a predetermined action on the recommended attribute data, It means that there is unexpectedness, and if you do not take action, it means that you were not surprised and not related. On the other hand, even if attribute data that has already been an object of action is included in the recommended attribute data, it is excluded from the following novelty object and unexpected object.

この提示された推薦属性データについて類似度を求める。すなわち、推薦属性データのうち、過去属性データに含まれないものについて、それぞれの属性データごとに、比較対象となる当該利用者の過去属性データ群との類似度を求める。この類似度が閾値以上であるものを、新規性対象、すなわち、意外ではないが、未踏、未訪問、未登録、未見である新規なものとする。一方、この類似度が閾値未満であるものは、提示開始から一定期間経過までの間に、利用者の登録、アクセス、購入などといった、それぞれに適した行動の対象となったら、意外性対象、すなわち、関連性は薄くとも利用者が興味を持った意外なものとする、一定期間経過までに利用者の行動の対象とならなかったら、その数緯線属性データは用を為さなかったものとする。なお、この閾値も本発明によって評価可能なパラメータであり、閾値が変わることで、当該推薦手法の評価内容は変化する。 The degree of similarity is obtained for the presented recommended attribute data. That is, among the recommended attribute data that is not included in the past attribute data, the similarity with the past attribute data group of the user to be compared is obtained for each attribute data. A thing whose similarity is equal to or greater than a threshold value is a novelty object, that is, a novel object that is not unexpected but has not yet been visited, unvisited, unregistered, or unseen. On the other hand, if this similarity is less than the threshold, if it becomes the target of appropriate actions such as user registration, access, purchase, etc. between the start of presentation and the elapse of a certain period of time, In other words, even if the relevance is low, it is surprising that the user is interested in it. If it is not subject to the user's action by a certain period of time, the latitude and longitude attribute data was not used. To do. This threshold value is also a parameter that can be evaluated according to the present invention, and the evaluation content of the recommendation method changes as the threshold value changes.

その上で、推薦属性データの全数のうち、新規性対象の占める割合である新規性対象率と、前記意外性対象の占める割合である意外性対象率とを求める。これにより、当該利用者に対して提示された推薦属性データの内訳を評価できる。 Then, the novelty target ratio that is the ratio of the novelty target and the unexpected target ratio that is the ratio of the unexpected target are obtained from the total number of recommended attribute data. Thereby, the breakdown of the recommendation attribute data presented to the user can be evaluated.

このようにして得られた新規性対象率と意外性対象率とを、複数の利用者について求め、それらの値について数的評価を行う。複数の利用者とは、全利用者でもよいし、抽出した利用者でもよい。対象とする利用者数が多い方が評価の信憑性は向上するが、その分解析に時間がかかる。ここで数的評価とは、統計値の算出や、グラフ上へのプロット、又はその両方が挙げられる。統計値としては、一般的な平均値の他、中間値、最小値、最大値、最頻値、分散、標準偏差などが挙げられる。これにより、当該推薦手法により提示されたデータの全体的な傾向を掴むことができる。例えば、対象とする利用者中での新規性対象率の平均値が高く、意外性対象率の平均値が低ければ、提示された推薦属性データは、堅実で類似性の高いものを推奨するものであるとわかる。逆に意外性対象率の平均値の方が高ければ、利用者の利用を飛躍的に高める傾向にあることがわかる。また、新規性対象率と意外性対象率とのいずれの平均値も低い場合は、その推薦手法は有用性に問題があることがわかる。また、最小値と最大値の幅が大きかったり、分散が大きかったりすると、推薦する内容にムラがあり、逆であればムラが少なく、照準が定めやすいということがわかる。 The novelty target rate and the unexpected target rate obtained in this way are obtained for a plurality of users, and numerical evaluation is performed on those values. The plurality of users may be all users or extracted users. Although the credibility of the evaluation improves as the number of target users increases, the analysis takes time. Here, the numerical evaluation includes calculation of statistical values, plotting on a graph, or both. Examples of the statistical value include a general average value, an intermediate value, a minimum value, a maximum value, a mode value, a variance, a standard deviation, and the like. Thereby, the general tendency of the data presented by the recommendation method can be grasped. For example, if the average value of the novelty target rate among the target users is high and the average value of the unexpected target rate is low, the recommended attribute data presented is recommended to be solid and highly similar I understand that. Conversely, if the average value of the unexpectedness target rate is higher, it can be seen that the user's usage tends to be dramatically increased. In addition, when the average value of the novelty target rate and the unexpected target rate is low, it can be understood that the recommendation method has a problem in usefulness. It can also be seen that if the width between the minimum value and the maximum value is large or the variance is large, the recommended content is uneven, and if it is the other way around, the unevenness is small and the aim is easy to determine.

このような統計値の算出を、複数の推薦手法により得られた推薦属性データについて行い、推薦手法ごとの値を求めて互いに比較することで、他の推薦手法と比べて、その推薦手法がどのような傾向にあり、どういう場合にどの推薦手法を採用すればよいかを判断できる。なお、ここで複数の推薦手法とは、アルゴリズムが同一であってもパラメータの違いによって異なる結果が得られるものであれば、複数の推薦手法として考えることができる。堅実な拡大路線を勧めようとするならば、新規性対象率が高い推薦手法を選び、常に興味を引きつけて利用を拡大させていこうとするならば意外性対象率が高い推薦手法を選ぶ、といったような選択ができる。 This kind of statistical value calculation is performed on recommended attribute data obtained by multiple recommendation methods, and the value of each recommendation method is obtained and compared with each other. Therefore, it is possible to determine which recommendation method should be adopted under what circumstances. Here, a plurality of recommendation methods can be considered as a plurality of recommendation methods as long as different results can be obtained depending on the parameter even if the algorithm is the same. If you want to recommend a steady expansion route, select a recommendation method with a high novelty target rate, and if you want to constantly expand interest and attract use, select a recommendation method with a high unexpected target rate. You can choose

以上のような評価方法を、具体例を挙げて説明する。推薦手法を採用する対象は、ＳＮＳサイトにおけるコミュニティの推薦である。ここでコミュニティとは、サービス内である事柄について同項の士が集まる掲示板機能や新規通知機能などを有し、当該事柄についての話題をやりとりできる仮想的な場、ページのことである。推薦手法の利用形態の概略図を図１に示す。利用者が当該サイトを利用して、複数あるコミュニティにアクセスすることにより（Ｓ１０１）、行動記録であるアクセスログが蓄積されていく。このログは、それぞれの利用者がどのコミュニティにアクセスしたか、というデータを主に有する。このログを一定期間に亘って蓄積する（Ｓ１０２）。次に、蓄積した行動記録を過去属性データとして、評価しようとする複数の推薦手法を実施する（Ｓ１０３）。これにより、それぞれの利用者に対して推薦するコミュニティのリストである推薦属性データが得られる（Ｓ１０４）。 The evaluation method as described above will be described with specific examples. The object for which the recommendation method is adopted is the recommendation of the community on the SNS site. Here, the community is a virtual place or page that has a bulletin board function or a new notification function for gathering persons in the same section about matters in the service and can exchange topics about the matters. A schematic diagram of the usage pattern of the recommendation method is shown in FIG. When a user accesses a plurality of communities using the site (S101), an access log as an action record is accumulated. This log mainly has data on which community each user has accessed. This log is accumulated over a certain period (S102). Next, a plurality of recommendation methods to be evaluated are implemented using the accumulated action records as past attribute data (S103). Thereby, recommendation attribute data which is a list of communities recommended for each user is obtained (S104).

この推薦属性データを一定期間利用者に対して提示する。具体的には、当該利用者が当該ＳＮＳサイトを利用する（Ｓ１０１）際のスタートページなどに表示し続けることで、推薦するコミュニティへのアクセスを促す。これを一定期間に亘って続けて、アクセスログを収集する（Ｓ１０２）。この評価期間が終了した後、それらの推薦表示したコミュニティの内容、およびアクセスしたか否かを踏まえて、新規性、意外性の評価を行う（Ｓ１１１）。すなわち、この発明では利用者の行動を、図２に示すように、少なくとも二回の期間に亘って記録する。一つは、推薦属性データを導き出すための過去属性データの取得のためであり、もう一つは、推薦属性データに対する利用者の行動結果を記録し、意外性や新規性の評価に用いるためである。 This recommended attribute data is presented to the user for a certain period. Specifically, by continuing to display on the start page or the like when the user uses the SNS site (S101), the user is prompted to access the recommended community. This is continued for a certain period, and an access log is collected (S102). After the evaluation period is over, the novelty and the unexpectedness are evaluated based on the contents of the recommended and displayed communities and whether or not they have been accessed (S111). That is, in the present invention, the user's behavior is recorded over at least two periods as shown in FIG. One is for obtaining past attribute data for deriving recommended attribute data, and the other is for recording user behavior results for recommended attribute data for use in evaluating unexpectedness and novelty. is there.

図３に、推薦コミュニティの例を示す。利用者を推薦手法の数だけグループ分けし、それぞれのグループについて、一の推薦手法を実行して、推薦コミュニティのリストを作成する。ここでは、概念ベース技術と、協調フィルタリング技術、ランダム選択の三種類を、五百人ごとに分けたグループのそれぞれに対して実行したとする。 FIG. 3 shows an example of a recommended community. Users are grouped by the number of recommended methods, and one recommended method is executed for each group to create a list of recommended communities. Here, it is assumed that the concept-based technology, the collaborative filtering technology, and the random selection are executed for each group divided into 500 people.

利用者の一人であるユーザ１を例に取り、類似度の計算を説明する。この概念図を図４に示し、処理のフローを図５に示す。まず、ユーザ１は最初のログを取得する期間（過去属性データ取得期間）に、「野球」「サッカー」「バスケットボール」のコミュニティにアクセスしていたとする。この事実は過去属性データ取得期間のアクセスログから把握できる。これを前提にすると、推薦されたリストに表示されたコミュニティのうち（Ｓ２１１）、「野球」「サッカー」は既にユーザ１がアクセスしていたことがわかる（Ｓ２１２）。この場合、「野球」「サッカー」は再訪問を促すものであり、新規なものでも意外なものでもない。よって、新規性対象、意外性対象のいずれにも属さない（Ｓ２１３）。一方、「アイスホッケー」「リアルロボット」「ガーデニング」はまだユーザ１がアクセスしていないコミュニティであるとする（Ｓ２１２）。 Taking the user 1 who is one of the users as an example, the calculation of the similarity will be described. This conceptual diagram is shown in FIG. 4, and the flow of processing is shown in FIG. First, it is assumed that the user 1 accesses a community of “baseball”, “soccer”, and “basketball” during a period of acquiring the first log (past attribute data acquisition period). This fact can be grasped from the access log in the past attribute data acquisition period. Assuming this, it can be seen that among the communities displayed in the recommended list (S211), "baseball" and "soccer" have already been accessed by the user 1 (S212). In this case, “baseball” and “soccer” are urged to revisit and are neither new nor surprising. Therefore, it belongs to neither novelty object nor unexpected object (S213). On the other hand, it is assumed that “ice hockey”, “real robot”, and “gardening” are communities that the user 1 has not yet accessed (S212).

次に、提示された推薦属性データである推薦コミュニティのうち、推薦前は未アクセスであった３つのコミュニティのそれぞれについて、全利用者、又は抽出したある程度の数の利用者のアクセスログを前提として、既訪問であるコミュニティ群との類似度を計算する（Ｓ２１６）。ここでは既訪問のコミュニティ群が「野球」及び「サッカー」と「バスケットボール」の３つあり、この３つのコミュニティを１つの過去属性データとみなし、この群からの類似度を求めて、未アクセスコミュニティを分類する。 Next, among the recommended communities that are the recommended attribute data presented, the access logs of all users or a certain number of extracted users are assumed for each of the three communities that were not accessed before recommendation. The similarity with the visited community group is calculated (S216). Here, there are three visited community groups, “baseball”, “soccer”, and “basketball”. These three communities are regarded as one past attribute data, and the similarity from this group is obtained to determine whether the community is an unaccessed community. Classify.

その分類の概念図を図６に示す。まず分類に当たり、類似度について閾値を決定する。ここで採用する類似度の範囲のうち、その閾値より高い値を示す関係にあるコミュニティは類似度が高いため、表示されても意外性は無いものとする。例えばここでは、閾値を０．７とするが、コミュニティの数により適切な閾値は変化する。ここでは、「アイスホッケー」は類似度が０．８と閾値以上であるので（Ｓ２１７）、新規性対象として分類する（Ｓ２１８）。一方、閾値が０．８より低い「リアルロボット」と「ガーデニング」とについては、上記評価期間（Ｓ１０２〜Ｓ１０４）の間に、ユーザ１がアクセスしたか否かによって分類する（Ｓ２１９）。上記評価期間の間にユーザ１がアクセスしている「リアルロボット」は、推薦前にアクセスしたコミュニティと類似度が低いにも拘わらず、ユーザ１が新たに興味を持った意外性対象となる（Ｓ２２０）。一方、上記評価期間の間にユーザ１がアクセスしなかった「ガーデニング」については、興味を惹くことができなかったものであり、新規性対象でも意外性対象でもないとして該当なしとなる（Ｓ２２１）。 A conceptual diagram of the classification is shown in FIG. First, for classification, a threshold is determined for the similarity. Of the range of similarity used here, a community having a relationship higher than the threshold value has a high similarity, and it is assumed that there is no surprise even if displayed. For example, although the threshold value is 0.7 here, the appropriate threshold value changes depending on the number of communities. Here, since the degree of similarity of “ice hockey” is 0.8, which is equal to or greater than the threshold (S217), it is classified as a novelty target (S218). On the other hand, “real robot” and “gardening” whose threshold is lower than 0.8 are classified according to whether or not the user 1 has accessed during the evaluation period (S102 to S104) (S219). The “real robot” accessed by the user 1 during the evaluation period is an unexpected object that the user 1 is newly interested in even though the similarity with the community accessed before the recommendation is low ( S220). On the other hand, the “gardening” that the user 1 did not access during the evaluation period has been unable to attract interest and is not applicable because it is neither a novelty object nor an unexpected object (S221). .

以上で、推薦属性データの全てについて評価が完了する（Ｓ２１４）。これにより、推薦属性データの傾向が評価できる。ここでは、推薦された属性データ数が５で、うち「野球」「サッカー」「ガーデニング」が「該当なし」となり、「アイスホッケー」が新規性対象、「リアルロボット」が意外性対象であった。従って、ユーザ１に提示された推薦属性データ中、新規性対象の占める割合である新規性対象率は１／５＝２０（％）であり、意外性対象の占める割合である意外性対象率は１／５＝２０（％）となる。このユーザ１に与えた結果としては、一応の興味の拡大に繋がったことがわかる。 Thus, the evaluation for all the recommended attribute data is completed (S214). Thereby, the tendency of recommendation attribute data can be evaluated. Here, the recommended number of attribute data is 5, of which “baseball”, “soccer” and “gardening” are “not applicable”, “ice hockey” is a novelty object, and “real robot” is an unexpected object. . Accordingly, in the recommended attribute data presented to the user 1, the novelty target ratio that is the ratio of the novelty target is 1/5 = 20 (%), and the unexpectedness target ratio that is the ratio of the unexpected target is 1/5 = 20 (%). As a result given to the user 1, it can be seen that this led to an increase in interest.

このような評価を、各々の推薦手法を用いたユーザ全体について行い、さらにそれを全ての推薦手法についても行うことで、結果や傾向の相互比較が可能となる。すなわち、推薦手法１，２、３……を用いて個々のユーザに推薦した全ての推薦属性データについて分類、統計処理を行う。 Such an evaluation is performed for all users using each recommendation method, and further for all the recommendation methods, the results and trends can be compared with each other. That is, classification and statistical processing are performed on all recommended attribute data recommended to individual users using the recommendation methods 1, 2, 3,.

そのためのフローの全体像を図７に示す。フロー中、中央にある演算部１が、上記の図５における新規性対象と意外性対象との分類に対応する。まず、推薦手法１により提示された全ての推薦コミュニティ（推薦属性データ）について、ユーザ１から順に上記の演算部１の処理を行う（Ｓ２１１〜Ｓ２２２）。その上で、推薦手法１により推薦したユーザ（ここではユーザ１〜５００）の平均値を算出する（Ｓ２２３）。なお、図中、新規性対象率を「新規性」と、意外性対象率を「意外性」と略記する。これにより、推薦手法１により推薦したコミュニティについての数的処理と新規性及び意外性の分類がされた。この値は、推薦手法１で出力される推薦属性データの傾向を示すものである（Ｓ２３２）。 An overall flow diagram for this purpose is shown in FIG. In the flow, the calculation unit 1 in the center corresponds to the classification of novelty object and unexpected object in FIG. First, with respect to all recommended communities (recommended attribute data) presented by the recommendation method 1, the processing of the calculation unit 1 is performed in order from the user 1 (S211 to S222). Then, an average value of users recommended by the recommendation method 1 (here, users 1 to 500) is calculated (S223). In the figure, the novelty target rate is abbreviated as “novelty”, and the unexpectedness target rate is abbreviated as “unexpectedness”. As a result, the numerical process and the novelty and unexpectedness of the community recommended by the recommendation method 1 were classified. This value indicates the tendency of the recommended attribute data output by the recommendation method 1 (S232).

この処理を、続けて推薦手法２，３……とについても行い（Ｓ２３１〜Ｓ２３２）、それぞれの推薦手法による、新規性対象率、意外性対象率を算出する。これにより、図７に記載のように、推薦手法ごとの新規性対象率及び意外性対象率が比較できるようになる。例えば図７のような結果となった場合、推薦手法１は新規性と意外性のバランスがよく、推薦手法２は新規性が多いために堅実な運用に向いており、推薦手法３は意外性に特化していて拡大傾向に拍車を掛けたいときに向いている、といった評価ができる。 This process is also performed for the recommendation methods 2, 3... (S231 to S232), and the novelty object rate and the unexpected object rate are calculated by the respective recommendation methods. Thereby, as shown in FIG. 7, the novelty object rate and the unexpected object rate for each recommendation method can be compared. For example, when the result shown in FIG. 7 is obtained, the recommendation method 1 has a good balance between novelty and unexpectedness, the recommendation method 2 has a lot of novelty and is suitable for steady operation, and the recommendation method 3 is unexpected. It can be evaluated that it is suitable for when you want to spur the expansion trend.

このように、利用者（ユーザ）ごとの新規性対象率、意外性対象率を求め、それを推薦手法ごとにまとめて数的処理し、推薦手法間で対比することで、推薦手法の評価ができる。この全体構造を保持すれば、それ以外の点は変更が可能である。 As described above, the novelty target rate and the unexpected target rate for each user (user) are obtained, and are numerically processed for each recommended method, and the recommended methods are evaluated by comparing the recommended methods. it can. If this overall structure is maintained, other points can be changed.

例えば、上記の判断においては、アクセスしたコミュニティについて過去属性データを収集し、推薦属性データの意外性評価もアクセスの有無によって行っているが、これを、コミュニティへの登録について過去属性データを収集し、意外性評価も登録の有無について行っても同様のことができる。 For example, in the above judgment, past attribute data is collected for the accessed community, and the unexpectedness evaluation of the recommended attribute data is also performed based on the presence / absence of access. The same can be done for the unexpectedness evaluation.

また例えば、上記のフローでは推薦属性データのうち類似度が閾値以上のものは無条件に新規性対象としているが、意外性対象の判断と同様に、実際にアクセスしなかったものは「該当なし」としてもよい。類似度が高いからといって、実際にアクセスに寄与しなければ、推薦する意義がないからである。この場合、上記の演算部１のフローは図８のようになる。それぞれの推薦属性データについて、最大類似度を抽出引用した後（Ｓ２１６）、まず評価期間中にアクセスがあったか否かを判断し（Ｓ２１９Ａ）、アクセスがなかったものは、閾値とは関係なく「該当なし」となる（Ｓ２２１Ａ）。一方で、評価期間中にアクセスがあった推薦属性データ（推薦コミュニティ）については、類似度が閾値以上か否かを判断し（Ｓ２１７Ａ）、新規性対象であるか（Ｓ２１８Ａ）、意外性対象であるか（Ｓ２２０）を判断する。 In addition, for example, in the above flow, the recommendation attribute data having a similarity equal to or greater than the threshold value is unconditionally regarded as a novelty target, but as in the case of the unexpectedness target, the data that was not actually accessed is “not applicable It is good also as. This is because, even if the degree of similarity is high, there is no point in recommending it unless it actually contributes to access. In this case, the flow of the calculation unit 1 is as shown in FIG. After extracting and quoting the maximum similarity for each recommended attribute data (S216), it is first determined whether or not there was access during the evaluation period (S219A). None ”(S221A). On the other hand, for the recommended attribute data (recommended community) accessed during the evaluation period, it is determined whether or not the similarity is equal to or higher than a threshold (S217A), and is a novelty target (S218A). It is determined whether there is (S220).

さらに例えば、上記のフローでは推薦手法１〜３を異なるユーザグループに対して行っているが、推薦手法１〜３を全てのユーザに対して実施し、提供期間をずらして順に推薦属性データを示してもよい。ただしこの場合、推薦手法１による推薦属性データを見た利用者は、次の推薦手法２による推薦属性データに対して行動するときに、前に推薦された推薦属性データが記憶にあり、行動を左右されているため、提供順序によりある程度の誤差が生じる。 Further, for example, in the above flow, the recommendation methods 1 to 3 are performed for different user groups. However, the recommendation methods 1 to 3 are performed for all users, and the recommended attribute data are sequentially displayed by shifting the provision period. May be. However, in this case, when the user who sees the recommended attribute data by the recommendation method 1 acts on the recommended attribute data by the next recommendation method 2, the recommended attribute data previously recommended is stored in the memory, and the action is performed. Depending on the order of provision, some error occurs.

Claims

Based on one user's past action records, multiple recommendation attribute data presented to one user by one recommendation method,
Calculate the degree of similarity that is highly relevant to the historical attribute data group derived from the action records made by the user in the past,
Classifying the recommendation attribute data that is less than a predetermined threshold that has achieved an action on the recommendation attribute data until a predetermined time has elapsed from the start of presentation as a target of unexpectedness,
Classifying the recommended attribute data whose similarity is equal to or higher than a predetermined threshold, or the one that has achieved an action for the recommended attribute data from the start of presentation to a novelty target,
Finding the novelty target rate that is the ratio of the novelty target and the unexpectedness target rate that is the ratio of the unexpected target with respect to the total number of recommended attribute data,
A method for evaluating the presented content, which evaluates the degree of novelty and unexpectedness of the presented content.

A plurality of the past attribute data is regarded as one group, and the similarity to the group calculated for each of the recommended attribute data is classified into the novelty object and the unexpected object classification for the recommended attribute data. The presentation content evaluation method according to claim 1, wherein the presentation content is used as a similarity that is a criterion of the presentation.

For a plurality of users, the presentation content evaluation method according to claim 1 or 2 is executed, and for each of the plurality of users, the novelty target rate and the unexpected target rate are obtained.
A method for evaluating a recommendation method in which the degree of novelty and unexpectedness of the one recommendation method is evaluated by numerical evaluation of those values.

A method for evaluating a plurality of recommended methods, wherein the method for evaluating a recommended method according to claim 3 is executed for a plurality of recommended methods, and numerical evaluations of the methods are compared.