JP5667959B2

JP5667959B2 - Impact analysis method, impact analysis apparatus and program thereof

Info

Publication number: JP5667959B2
Application number: JP2011225009A
Authority: JP
Inventors: 妙佐藤; 伊藤　浩二; 浩二伊藤; 藤田　将成; 将成藤田; 小林　稔; 稔小林; 理塩野入
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-10-12
Filing date: 2011-10-12
Publication date: 2015-02-12
Anticipated expiration: 2031-10-12
Also published as: JP2013084208A

Description

この発明は、コンテンツ閲覧履歴等からユーザの他者からの影響度を分析する影響度分析方法、影響度分析装置及びそのプログラムに関する。 The present invention relates to an influence analysis method, an influence analysis apparatus, and a program for analyzing an influence from another user from a content browsing history or the like.

近年、ユーザの行動や状況に合わせて適切なサービス・コンテンツをレコメンドする技術が望まれている。このため、書籍通販サイトにて、サイト内での書籍情報閲覧履歴からユーザの興味を推定して書籍をレコメンドする等、履歴情報からユーザの趣味等を推定する技術が提案されている。 In recent years, there has been a demand for a technique for recommending appropriate services and contents in accordance with user actions and situations. For this reason, a technique for estimating a user's hobbies from history information has been proposed, such as estimating a user's interest from a book information browsing history in the site and recommending a book.

このような興味推定の方法において、各コンテンツに内容をサマライズするメタ情報が付与されていることを前提として、ユーザの操作履歴において出現する概念等の頻度からユーザの興味を推定する方法は、内容ベースフィルタリング手法（Content Based Filtering: CBF）で、特にメモリベース手法として研究が進められている。具体的に、内容ベースフィルタリング技術は、例えば特定ブランド（ブランドを示す情報を概念タグとして保持）の商品を閲覧した場合に、同じブランドの商品（同じ概念タグを保持）を提示する。この場合の、メモリベース手法は、過去に閲覧した履歴から、特定ブランドを頻繁に閲覧していれば、特定ブランドの商品を提示することとなる。単純な方法では、閲覧履歴により多く出現した概念タグに関連する商品を提示することとなる。このような手法においてタクソノミ（オントロジ）を利用した方法としては、例えば、非特許文献１の「操作履歴を用いた多面的嗜好把握方法の提案」がある。 In such an interest estimation method, assuming that meta information for summarizing the contents is given to each content, the method for estimating the user's interest from the frequency of concepts that appear in the user's operation history is Research is progressing especially as a memory-based technique in content filtering (CBF). Specifically, the content-based filtering technology presents products of the same brand (holding the same concept tag) when browsing products of a specific brand (holding information indicating the brand as a concept tag), for example. In this case, the memory-based method presents a product of a specific brand if the specific brand is frequently browsed from the history of browsing in the past. In a simple method, products related to concept tags that appear more frequently in the browsing history are presented. As a method using a taxonomy (ontology) in such a method, for example, there is “Proposal of a multifaceted preference grasping method using an operation history” in Non-Patent Document 1.

一方で、ユーザへの情報の推薦サービスを実現しようとするとき、ユーザの実際の社会生活や選択行動に基づく情報提供サービスが求められている。また、ユーザが社会生活を営む中で他者からの影響を受けること（非特許文献２）や、選択行動が生じるきっかけとして、ユーザ自身の持つ興味に加えて、他者からの影響が関わること（非特許文献３）が広く知られている。 On the other hand, when a service for recommending information to a user is to be realized, an information providing service based on the user's actual social life and selection behavior is required. In addition to being influenced by others while the user is engaged in social life (Non-Patent Document 2), and as an opportunity for selection behavior, in addition to the user's own interests, influence from others is involved. (Non-Patent Document 3) is widely known.

「操作履歴を用いた多面的嗜好把握方法の提案」，信学技報，vol. 110， no. 450， LOIS2010-76， pp. 67-72， 2011年3月"Proposal of multifaceted preference grasp method using operation history", IEICE Technical Report, vol. 110, no. 450, LOIS2010-76, pp. 67-72, March 2011 井上隆二、山下富美代著，「図解雑学社会心理学」，株式会社ナツメ社，2010年4月10日，pp. 12-13Ryuji Inoue, Fumiyo Yamashita, “Illustrative Trivia Social Psychology”, Natsume Inc., April 10, 2010, pp. 12-13 シーナ・アイエンガー著，櫻井祐子訳，「選択の科学」，株式会社文藝春秋，2010年11月15日，pp. 138-139Written by Sina Eiengar, translated by Yuko Sakurai, “Science of Choice”, Bunju Shunju, November 15, 2010, pp. 138-139

上述したように、ユーザの実際の社会生活や選択行動に近い情報推薦サービスを実現するためには、当該ユーザへの他ユーザからの影響を精度良く推定する必要がある。
この発明は上記事情に着目してなされたもので、その目的とするところは、コンテンツを閲覧する際のユーザの他者からの影響度を高精度に推定可能にする影響度分析方法、影響度分析装置及びそのプログラムを提供することにある。 As described above, in order to realize an information recommendation service close to a user's actual social life and selection behavior, it is necessary to accurately estimate the influence of the user from other users.
The present invention has been made paying attention to the above circumstances, and the object of the present invention is an influence analysis method and an influence degree which can estimate the influence degree of the user from the other when browsing the content with high accuracy. It is to provide an analyzer and a program thereof.

本発明は、上記目的を達成するために、以下のような手段を講じている。
本発明の第１の態様は、コンピュータによって、複数の概念についてユーザが他ユーザから受ける影響度スコアを体系化した概念体系を用いて前記ユーザへの影響度を分析する方法、装置及びプログラムであって、前記ユーザが複数のコンテンツを一覧として閲覧した第１のコンテンツリストと、前記第１のコンテンツリストからコンテンツの本体を閲覧した第２のコンテンツリストとをクラスタ化し、前記クラスタ毎に、前記第１のコンテンツリストのコンテンツの総数を第１の総数とし、前記第１のコンテンツリストにおいて前記他ユーザによりお薦め情報が付与されたコンテンツが出現する数を第１の出現数とし、前記第２のコンテンツリストのコンテンツの総数を第２の総数とし、前記第２のコンテンツリストにおいて前記お薦め情報が付与されたコンテンツが出現する数を第２の出現数としたとき、前記第１の総数、前記第１の出現数、及び前記第２の総数の条件下で、前記第２のコンテンツリストに前記お薦め情報が付与されたコンテンツの出現数が、前記第２の出現数以上となる第１の確率及び前記第２の出現数以下となる第２の確率を算出し、前記第１の確率及び前記第２の確率のうちのいずれか一方をもとに標準正規分布の累積分布関数の逆関数により特徴スコアを算出し、前記クラスタ毎に算出した特徴スコアを用いて前記概念に対する前記影響度スコアを更新する。 In order to achieve the above object, the present invention takes the following measures.
A first aspect of the present invention is a method, apparatus, and program for analyzing the degree of influence on a user by using a concept system that systematizes the degree of influence score that the user receives from other users for a plurality of concepts. Then, the first content list in which the user browses a plurality of contents as a list and the second content list in which the content body is browsed from the first content list are clustered, and for each cluster, the first content list The total number of contents in one content list is defined as a first total number, the number of appearances of content to which recommendation information is given by the other user in the first content list is defined as a first appearance number, and the second content The total number of contents in the list is set as the second total number, and the recommended information in the second content list is displayed. When the number of appearances of content to which the number is given is defined as the second appearance number, the second content list includes the first total number, the first appearance number, and the second total number. Calculating a first probability that the number of appearances of the content to which the recommendation information is given is equal to or greater than the second number of occurrences and a second probability that the number of appearances is equal to or less than the second number of occurrences; A feature score is calculated by an inverse function of a cumulative distribution function of a standard normal distribution based on any one of the second probabilities, and the influence score for the concept is calculated using the feature score calculated for each cluster. Update.

第１の態様によれば、お薦め情報が付与されたコンテンツの出現の希少性を利用してユーザへの他ユーザ（個人あるいは集団）からの影響度を高精度に推定できる影響度分析方法及び影響度分析装置を提供することができる。 According to the first aspect, the influence analysis method and the influence that can accurately estimate the influence from another user (individual or group) on the user using the scarcity of appearance of the content to which the recommended information is given A degree analyzer can be provided.

本発明の第２の態様は、前記第１の態様の前記算出ステップにおいて、前記第１のコンテンツリストにおいて前記お薦め情報が付与され、かつ所定の概念が出現するコンテンツの数を前記第１の出現数とし、前記第２のコンテンツリストにおいて前記お薦め情報が付与され、かつ前記所定の概念が出現するコンテンツの数を前記第２の出現数とする。
第２の態様によれば、他者からの影響度の推定において、各概念や複数の概念が属するジャンル毎の影響度を推定することが可能となる。 According to a second aspect of the present invention, in the calculation step of the first aspect, the number of contents in which the recommendation information is given in the first content list and a predetermined concept appears is the first appearance. The number of contents in which the recommendation information is given in the second content list and the predetermined concept appears is the second appearance number.
According to the second aspect, in the estimation of the degree of influence from others, it is possible to estimate the degree of influence for each concept or each genre to which a plurality of concepts belong.

本発明の第３の態様は、前記第１の態様の前記算出ステップにおいて、前記第１のコンテンツリストにおいて前記他ユーザの属する所定の集合により前記お薦め情報が付与されたコンテンツが出現する数を前記第１の出現数とし、前記第２のコンテンツリストにおいて前記所定の集合により前記お薦め情報が付与されたコンテンツが出現する数を前記第２の出現数とする。
第３の態様によれば、人間体系を用いたコンテンツ閲覧履歴分析において、コンテンツにお薦め情報を付与した個人だけでなく、組織やグループ属性についても、統計的な観点で合理的かつ的確に分析可能となる。また、特定の個人が付与するお薦め情報は少ないために、影響度分析を行うために十分な量の履歴が確保できない、という場合であっても、複数ユーザが属する所定の集合単位で影響度を分析することにより、十分な履歴を確保できる。 According to a third aspect of the present invention, in the calculation step of the first aspect, the number of appearances of the content to which the recommendation information is added by the predetermined set to which the other user belongs in the first content list is calculated. The number of appearances is the first appearance number, and the number of appearances of the content to which the recommendation information is given by the predetermined set in the second content list is the second appearance number.
According to the third aspect, in the content browsing history analysis using the human system, it is possible to analyze not only the individual who gave the recommended information to the content but also the organization and group attributes reasonably and accurately from a statistical viewpoint. It becomes. Even if it is difficult to secure a sufficient amount of history to perform an impact analysis because there is little recommended information given by a specific individual, the impact level can be set for a given set unit to which multiple users belong. By analyzing it, a sufficient history can be secured.

本発明の第４の態様は、前記第１乃至３のいずれかの態様において、前記クラスタ毎に、前記第１のコンテンツリストのコンテンツの総数を第１の総数と、前記第１のコンテンツリストにおいて前記概念が出現するコンテンツの数を第１の出現数と、前記第２のコンテンツリストのコンテンツの総数を第２の総数と、前記第２のコンテンツリストにおいて前記概念が出現するコンテンツの数を第２の出現数としたとき、前記第１の総数、前記第１の出現数、及び前記第２の総数の条件下で、前記第２のコンテンツリストに前記概念が出現するコンテンツの数が、前記第２の出現数以上となる第１の確率及び前記第２の出現数以下となる第２の確率を算出し、前記第１の確率及び前記第２の確率のうちのいずれか一方をもとに標準正規分布の累積分布関数の逆関数により特徴スコアを算出し、前記クラスタ毎に算出した特徴スコアを用いて前記概念に対する前記ユーザ興味スコアを更新し、前記概念が出現するコンテンツについて、当該コンテンツに出現する各概念の前記ユーザ興味スコア及び前記影響度スコアを用いて、当該コンテンツに対する評価スコアを算出することをさらに有する。 According to a fourth aspect of the present invention, in any one of the first to third aspects, the total number of contents in the first content list is the first total number and the first content list for each cluster. The number of contents in which the concept appears is the first number of appearances, the total number of contents in the second content list is the second total number, and the number of contents in which the concept appears in the second content list is the first number. When the number of appearances is 2, the number of contents in which the concept appears in the second content list under the conditions of the first total number, the first number of appearances, and the second total number is second appearance number or more to become the first probability and to calculate a second probability of the second number of occurrences below, based on either one of the first probability and the second probability Standard normal distribution A feature score is calculated by an inverse function of a distribution function, the user interest score for the concept is updated using the feature score calculated for each cluster, and for the content in which the concept appears, for each concept that appears in the content An evaluation score for the content is further calculated using the user interest score and the influence score.

第４の態様によれば、ユーザの興味とユーザへの他ユーザからの影響度を推定し、これらの推定結果を融合して、融合された推定結果に基づいて情報をパーソナライズし、ユーザへ情報を提示することが可能となる。ユーザの興味スコアと、ユーザが他ユーザから受ける影響を表す影響度スコアは、確率をスコア化したものであり、同次元で扱うことができる。従って、新規のコンテンツに対して、ユーザへ推薦を行うために、提示対象となるコンテンツに与えられている各概念と、他ユーザから付与されたお薦め情報に基づき、情報に対する評価スコアをスコア更新式を用いて算出する。各コンテンツに対して算出した評価スコアを用いて、ユーザへの提示対象コンテンツのソート等を行うことが可能となる。 According to the fourth aspect, the user's interest and the degree of influence from other users on the user are estimated, these estimation results are merged, information is personalized based on the merged estimation results, and information is sent to the user. Can be presented. The user's interest score and the influence score representing the influence that the user receives from other users are scored probabilities and can be handled in the same dimension. Therefore, in order to recommend new content to the user, a score update formula for an evaluation score for the information based on each concept given to the content to be presented and recommendation information given by other users Calculate using. It is possible to sort the content to be presented to the user using the evaluation score calculated for each content.

すなわちこの発明によれば、ユーザの興味とユーザへの他ユーザからの影響を考慮することで、ユーザに対して実際の社会生活や選択行動に近しい情報推薦サービスを可能にする影響度分析方法、影響度分析装置及びそのプログラムを提供することができる。 That is, according to the present invention, an influence analysis method that enables an information recommendation service that is close to the actual social life and selection behavior for the user by considering the user's interests and the influence of other users on the user, It is possible to provide an influence analysis apparatus and a program thereof.

本発明に係る影響度分析装置を用いた情報提示システムの全体構成図。1 is an overall configuration diagram of an information presentation system using an influence analysis apparatus according to the present invention. 図１の情報提示システムの各装置の機能構成を示すブロック図。The block diagram which shows the function structure of each apparatus of the information presentation system of FIG. 興味分析装置の処理概要を示す図。The figure which shows the process outline | summary of an interest analyzer. 興味の特徴スコア算出部の動作を説明するための模式図。The schematic diagram for demonstrating operation | movement of the feature score calculation part of interest. 特徴スコア算出処理の詳細を示す図。The figure which shows the detail of a characteristic score calculation process. 概念体系更新処理の詳細を示す図。The figure which shows the detail of a concept system update process. コンテンツに付与されるオススメタグを示す図。The figure which shows the recommendation tag provided to content. 一覧閲覧コンテンツリストと詳細閲覧コンテンツリストを示す図。The figure which shows a list browsing content list and a detailed browsing content list. 第１実施例に係る影響度分析処理を示す図。The figure which shows the influence analysis process which concerns on 1st Example. 第２実施例に係る影響度分析処理を示す図。The figure which shows the influence analysis process which concerns on 2nd Example. 第３実施例に係る影響度分析処理を示す図。The figure which shows the influence analysis process which concerns on 3rd Example. 第４実施例に係る影響度分析処理を示す図。The figure which shows the influence analysis process which concerns on 4th Example. コンテンツスコアリング処理を示す図。The figure which shows a content scoring process.

以下、図面を参照してこの発明に係る実施形態を説明する。
図１は本発明に係る影響度分析装置を用いた情報提示システムの全体構成図である。このシステムは、クライアント端末４００と、コンテンツサーバ５００と、興味分析装置１００と、影響度分析装置２００と、コンテンツスコアリング装置３００を備える。クライアント端末４００とコンテンツサーバ５００との間、及びコンテンツサーバ５００と興味分析装置１００との間、及び、コンテンツサーバ５００と影響度分析装置２００との間、及び、コンテンツサーバ５００とコンテンツスコアリング装置３００との間はそれぞれ通信ネットワークで接続される。ユーザは、クライアント端末４００での閲覧操作により、所望のコンテンツをコンテンツサーバ５００から取得し、取得したコンテンツをクライアント端末４００の画面に提示して閲覧する。 Embodiments according to the present invention will be described below with reference to the drawings.
FIG. 1 is an overall configuration diagram of an information presentation system using an influence analysis apparatus according to the present invention. This system includes a client terminal 400, a content server 500, an interest analysis device 100, an influence analysis device 200, and a content scoring device 300. Between the client terminal 400 and the content server 500, between the content server 500 and the interest analysis device 100, between the content server 500 and the influence analysis device 200, and between the content server 500 and the content scoring device 300. Are connected by a communication network. The user acquires desired content from the content server 500 by browsing operation on the client terminal 400, and presents the acquired content on the screen of the client terminal 400 for browsing.

クライアント端末４００は、ユーザ操作によるコンテンツ閲覧履歴を収集し、複数のコンテンツを一覧として閲覧した一覧閲覧コンテンツリスト（第１のコンテンツリスト）と、コンテンツ一覧からコンテンツの本体を閲覧した詳細閲覧コンテンツリスト（第２のコンテンツリスト）とをコンテンツサーバ５００に送信する。コンテンツサーバ５００は、この一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを、通信ネットワークを介して、興味分析装置１００と影響度分析装置２００に転送する。 The client terminal 400 collects a content browsing history by a user operation, and browses a list browsing content list (first content list) in which a plurality of contents are browsed as a list, and a detailed browsing content list (in which a content body is browsed from the content list ( 2nd content list) is transmitted to the content server 500. The content server 500 transfers the list browsing content list and the detailed browsing content list to the interest analysis device 100 and the impact analysis device 200 via the communication network.

興味分析装置１００は、上記一覧閲覧コンテンツリストおよび詳細閲覧コンテンツリストをもとに、コンテンツに出現する各概念に対する興味の特徴スコア及びユーザ興味スコアを算出し、ユーザの興味を推定する。
影響度分析装置２００は、上記一覧閲覧コンテンツリストおよび詳細閲覧コンテンツリストをもとに、コンテンツに“オススメタグ”を付与した他ユーザ（個人あるいは集団）に対する影響度の特徴スコア及び影響度スコアを算出し、当該ユーザへの他ユーザ（個人あるいは集合）からの影響度を推定する。 The interest analysis device 100 calculates a feature score of interest and a user interest score for each concept appearing in the content based on the list browsing content list and the detailed browsing content list, and estimates the user's interest.
The influence analysis device 200 calculates the characteristic score and the influence score of the influence on other users (individuals or groups) who added “recommended tags” to the content based on the list browsing content list and the detailed browsing content list. Then, the degree of influence from other users (individuals or groups) on the user is estimated.

コンテンツスコアリング装置３００は、興味分析装置１００にユーザ興味スコアを問い合わせ、また、影響度分析装置２００にユーザへの他ユーザ（個人あるいは集団）からの影響度スコアを問い合わせ、コンテンツサーバ５００から受け取った「提示コンテンツリスト」から、ユーザの興味と他ユーザ（個人あるいは集団）からの影響度とを考慮してソートを行ったコンテンツのリスト（ソート済み提示コンテンツリスト）を生成し、コンテンツサーバ５００に送信する。 The content scoring apparatus 300 inquires of the interest analysis apparatus 100 about the user interest score, and inquires the influence analysis apparatus 200 about the influence degree score from another user (individual or group) to the user, and receives it from the content server 500. From the “presentation content list”, a list of sorted contents (sorted presented content list) in consideration of the user's interest and the influence from other users (individuals or groups) is generated and transmitted to the content server 500. To do.

図２は、図１に示す情報提示システムの各装置の機能構成を示すブロック図である。なお、図２における各処理部は、例えば、各装置のＣＰＵ（Central Processing Unit）とメモリ上で実行される制御プログラムにより実現する。
［興味分析装置］
興味分析装置１００は、履歴情報受信部１１０、興味の特徴スコア算出部１２０、興味概念体系更新処理部１３０、及び概念体系／ユーザ興味スコアデータベース１４０を備える。概念体系／ユーザ興味スコアデータベース１４０は、ルート概念ノードＩＤと、概念体系テーブルと、ユーザ興味スコアテーブルとを有する。 FIG. 2 is a block diagram showing a functional configuration of each device of the information presentation system shown in FIG. Each processing unit in FIG. 2 is realized by, for example, a CPU (Central Processing Unit) of each device and a control program executed on a memory.
[Interest analysis device]
The interest analysis apparatus 100 includes a history information receiving unit 110, an interest feature score calculation unit 120, an interest concept system update processing unit 130, and a concept system / user interest score database 140. The concept system / user interest score database 140 includes a root concept node ID, a concept system table, and a user interest score table.

ルート概念ノードＩＤとは、概念体系構造において最上位にある概念ノードＩＤである。システム内に１つだけ存在する。
概念体系テーブルは、自概念ＩＤ、親概念ＩＤリスト、及び子概念ＩＤリストを格納する。概念体系内の全ての自概念ＩＤは、親概念ＩＤ及び子概念ＩＤ（ただし、自概念が最下位の場合には子概念ＩＤは無し）と紐付けて保存されており、これにより概念構造が定義される。
ユーザ興味スコアテーブルは、概念ＩＤ、ユーザＩＤ（クライアント端末ＩＤ）、ＴｏｔａｌＺ（ユーザ興味スコア）の値を格納する。ＴｏｔａｌＺの定義及び算出方法は後述する。 The root concept node ID is a concept node ID at the highest level in the concept system structure. There is only one in the system.
The concept system table stores a self-concept ID, a parent concept ID list, and a child concept ID list. All the self-concept IDs in the concept system are stored in association with a parent concept ID and a child concept ID (however, if the self-concept is the lowest, there is no child concept ID). Defined.
The user interest score table stores values of concept ID, user ID (client terminal ID), and TotalZ (user interest score). The definition and calculation method of TotalZ will be described later.

図３は、興味分析装置１００の処理を示したものである。
履歴情報受信部１１０は、クライアント端末４００からの一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを、コンテンツサーバ５００を介して受信する。一覧閲覧コンテンツリストとは、例えば、ユーザがコンテンツのタイトルのみを一覧で閲覧したコンテンツのリストである。詳細閲覧コンテンツリストとは、ユーザがコンテンツ本体の内容（詳細）を閲覧したコンテンツのリストである。例えば、図３において、一覧閲覧コンテンツリストには、コンテンツ１〜８が含まれ、詳細閲覧コンテンツリストには、コンテンツ１，３，４が含まれる。また、図３において、斜線パターンで示すコンテンツは、概念Ｂがコンテンツ１，６，７，８に出現することを示す。 FIG. 3 shows processing of the interest analysis device 100.
The history information receiving unit 110 receives the list browsing content list and the detailed browsing content list from the client terminal 400 via the content server 500. The list browsing content list is, for example, a list of content in which the user browses only the content titles in a list. The detailed browsing content list is a list of content that the user has viewed the content (details) of the content body. For example, in FIG. 3, the list browsing content list includes contents 1 to 8, and the detailed browsing content list includes contents 1, 3, and 4. In FIG. 3, the content indicated by the hatched pattern indicates that the concept B appears in the content 1, 6, 7, and 8.

興味の特徴スコア算出部１２０は、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを利用して概念選択の統計モデルにより各概念の興味の特徴スコア（後述するＺ値）を算出する。
図４（ａ）に分析パラメータ抽出処理の模式図を示す。例えば、５０個（＝Ｓ）のコンテンツが一覧表示されている中から、ユーザが１０個（＝a）のコンテンツの詳細を閲覧した場合を示す。ここで、一覧表示されている５０個のコンテンツのうち「野球」という概念が含まれている記事が１５個（＝N）あり、ユーザが閲覧した１０個のコンテンツのうち、「野球」という概念が含まれているコンテンツが５個（＝ｎ）あったことを示す。 The interest feature score calculation unit 120 calculates an interest feature score (Z value to be described later) of each concept using a statistical model of concept selection using the list browsing content list and the detailed browsing content list.
FIG. 4A shows a schematic diagram of the analysis parameter extraction process. For example, a case where the user browses details of 10 (= a) contents from a list of 50 (= S) contents is shown. Here, there are 15 articles (= N) that contain the concept of “baseball” among the 50 contents displayed in a list, and the concept of “baseball” among the 10 contents viewed by the user. This indicates that there are five (= n) contents including “”.

興味の特徴スコア算出部１２０は、上記分析パラメータＳ，ａ，Ｎ，ｎを利用して概念ＩＤ毎に特徴スコアＺを算出する。図５に興味の特徴スコア算出処理の詳細を示す。図５において、ｉは概念の識別子、ｊは、クラスタＩＤを示す。
ここで、クラスタとは、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストに一意に付与される識別子である。別の時刻（時間帯）に表示した一覧閲覧コンテンツをユーザが閲覧した場合は、別のクラスタＩＤが付与される。なお、時刻以外の条件でクラスタＩＤを新たに付与する条件としては、一覧閲覧コンテンツリスト表示中に一定時間操作が無かった場合や、閲覧するユーザ（ユーザＩＤ）を切り替えた場合、一覧閲覧コンテンツリストに対して、コンテンツジャンル等を観点に絞り込み検索を掛けた場合、その他閲覧アプリケーションにおいて閲覧モードを切り替えた場合がある。 The interest feature score calculation unit 120 calculates the feature score Z for each concept ID using the analysis parameters S, a, N, and n. FIG. 5 shows details of the feature score calculation process of interest. In FIG. 5, i is a concept identifier, and j is a cluster ID.
Here, the cluster is an identifier uniquely assigned to the list browsing content list and the detailed browsing content list. When the user browses the list browsing content displayed at another time (time zone), another cluster ID is given. The conditions for newly assigning the cluster ID under conditions other than the time include when there is no operation for a certain period of time while the list browsing content list is displayed, or when the browsing user (user ID) is switched, the list browsing content list On the other hand, when a narrow search is performed from the viewpoint of the content genre or the like, the browsing mode may be switched in other browsing applications.

Ｈ１（第１の確率）は、一覧閲覧コンテンツリストに含まれる一覧閲覧コンテンツの総数Ｓ、一覧閲覧コンテンツのうち概念ｉが出現するコンテンツ数Ｎのとき、詳細閲覧コンテンツをａ個ランダム選択して閲覧した場合に、概念ｉが出現する詳細閲覧コンテンツの数がｎ以上となる累積確率である。Ｈ２（第２の確率）は、一覧閲覧コンテンツリストに含まれる一覧閲覧コンテンツの総数Ｓ、一覧閲覧コンテンツのうち概念ｉが出現するコンテンツ数Ｎのとき、詳細閲覧コンテンツをａ個ランダム選択して閲覧した場合に、概念ｉが出現する詳細閲覧コンテンツの数がｎ以下となる累積確率である。なお、本実施形態では、累積確率Ｈ１及びＨ２は、超幾何分布により求めるが、この手法に限定するものではない。他の分布の例としては、二項分布、正規分布が存在する。 When H1 (first probability) is the total number S of the list browsing contents included in the list browsing content list and the number N of the contents of the list browsing content where the concept i appears, a detailed browsing content is randomly selected and viewed. In this case, the cumulative probability that the number of detailed browsing contents in which the concept i appears is n or more. When H2 (second probability) is the total number S of the list browsing contents included in the list browsing content list and the number of contents N in which the concept i appears in the list browsing contents, a detailed browsing content is randomly selected and viewed. In this case, the cumulative probability that the number of detailed browsing contents in which the concept i appears is n or less. In the present embodiment, the cumulative probabilities H1 and H2 are obtained by the hypergeometric distribution, but are not limited to this method. Examples of other distributions include a binomial distribution and a normal distribution.

図４（ｂ）に示すように、例えば、上記の分析パラメータＳ、Ｎ、ａ、ｎを用いて、ユーザが閲覧した１０個のコンテンツのうち、「野球」という概念が含まれるコンテンツが５以上である確率が、「０．１２」であることを示す。ここで、「０．１２」は、累積確率Ｈ１の値に相当する。 As shown in FIG. 4B, for example, there are five or more contents including the concept of “baseball” among the ten contents browsed by the user using the analysis parameters S, N, a, and n described above. It is shown that the probability of being “0.12”. Here, “0.12” corresponds to the value of the cumulative probability H1.

なお、Ｈ２の値を使う例として、上記の分析パラメータでｎが０である場合を考える。この場合は、出現数が０以下の場合の確率を算出する。具体的には、図４（ｂ）において横軸が０の項目の値となるため「０．０２」となる。
そして、興味の特徴スコア算出部１２０は、図５に示すように、上記算出した累積確率Ｈ１及びＨ２を用いて、標準正規分布の累積分布関数の逆関数により興味の特徴スコアＺを算出する。図４（ｃ）に示すように、上記Ｈ１を累積確率とする標準正規分布の累積分布関数の逆関数により特徴スコアＺを求める。なお、累積確率としてＨ２を利用する場合には、標準正規分布の累積分布関数の逆関数の返値の符号を負にして特徴スコアＺを求める。この特徴スコアＺを用いて、後述する興味概念体系更新処理部１３０は、「野球」という概念に対するユーザ興味スコア（ＴｏｔａｌＺ）を求める。 As an example of using the value of H2, consider the case where n is 0 in the above analysis parameters. In this case, the probability when the number of appearances is 0 or less is calculated. Specifically, in FIG. 4B, the horizontal axis is the value of the item whose value is 0, so “0.02”.
Then, as shown in FIG. 5, the interest feature score calculation unit 120 calculates the interest feature score Z by the inverse function of the cumulative distribution function of the standard normal distribution using the calculated cumulative probabilities H1 and H2. As shown in FIG. 4C, the feature score Z is obtained by the inverse function of the cumulative distribution function of the standard normal distribution with H1 as the cumulative probability. When H2 is used as the cumulative probability, the feature score Z is obtained with the sign of the return value of the inverse function of the standard normal distribution cumulative distribution function being negative. Using this feature score Z, the interest concept system update processing unit 130 described later obtains a user interest score (TotalZ) for the concept of “baseball”.

興味の特徴スコア算出部１２０は、更新対象概念リストを生成し、興味概念体系更新処理部１３０に出力する。「更新対象概念リスト」とは、概念ＩＤ、前記で算出した特徴スコアＺ、及び重みｗのセットである。なお、この更新対象概念リストに出現する概念ＩＤが、次の概念体系更新処理で更新対象のノード（概念）となる。上位概念を追加した出現概念リスト内の概念ＩＤすべてについて、特徴スコアＺと重みｗを算出する。重みｗは、各クラスタＩＤにおいて概念毎に設定される値である。 The interest feature score calculation unit 120 generates an update target concept list and outputs it to the interest concept system update processing unit 130. The “update target concept list” is a set of a concept ID, the characteristic score Z calculated above, and a weight w. The concept ID appearing in the update target concept list becomes a node (concept) to be updated in the next concept system update process. The feature score Z and the weight w are calculated for all the concept IDs in the appearance concept list to which the superordinate concept is added. The weight w is a value set for each concept in each cluster ID.

なお、重みｗは、初期値ｗ＝１とし、ユーザの特徴的な操作等が有った場合に、以下のように値を変化させることができる。例えば、ユーザは、クライアント端末４００に提示されたコンテンツに対し、お気に入りコンテンツとしての登録や、コンテンツへの評価入力等ができる。クライアント端末４００が、このような操作履歴を興味分析装置１００に送信できる場合には以下の処理を行う。 Note that the weight w can be changed as follows when the initial value w = 1 and a user's characteristic operation is performed. For example, the user can register the content presented on the client terminal 400 as a favorite content, input an evaluation for the content, and the like. When the client terminal 400 can transmit such an operation history to the interest analysis apparatus 100, the following processing is performed.

興味の特徴スコア算出部１２０は、例えば、コンテンツがお気に入りに登録されたとき、そのコンテンツが含む全ての概念ＩＤについて重みｗをｗ＝１．５のように増加させる。その他にも、コンテンツ閲覧時刻、閲覧時の天気、気温、湿度、季節、曜日、休日、余暇かどうか、閲覧時のユーザ位置情報、スケジューラ、日記等から収集したイベント情報に応じて重みｗの値を変えることもできる。 For example, when the content is registered as a favorite, the interest feature score calculation unit 120 increases the weight w such that w = 1.5 for all concept IDs included in the content. In addition to the content browsing time, browsing weather, temperature, humidity, season, day of the week, holiday, leisure time, user location information at browsing, scheduler, diary, etc., the value of weight w Can also be changed.

興味概念体系更新処理部１３０は、上記特徴スコアを用いて概念体系における概念間の関係情報（上位概念及び下位概念）に基づいて各概念に対するユーザ興味スコアを更新する。概念体系のグラフに含まれるノードは概念を表し、リンクは概念間の関係を表す。ユーザ興味スコアは、概念体系における各概念に対応するノードの値として保持する。概念体系において、上位に位置するノードほど抽象的な概念を表し、下位に位置するノードほど具体的な概念を表す。概念体系及び概念ＩＤ（ノード毎に付与される識別子）は、サービス運用者等が事前に設計し定義するものとする。 The interest concept system update processing unit 130 updates the user interest score for each concept based on the relationship information (superordinate concept and subordinate concept) in the concept system using the feature score. Nodes included in the graph of the concept system represent concepts, and links represent relationships between concepts. The user interest score is held as a value of a node corresponding to each concept in the concept system. In the concept system, the nodes located at the higher level represent the abstract concept, and the nodes located at the lower level represent the specific concept. The concept system and concept ID (identifier assigned to each node) are designed and defined in advance by a service operator or the like.

図６に興味概念体系更新処理部１３０の処理の詳細を示す。興味概念体系更新処理部１３０は、コンテンツに出現した概念（出現概念）、及びこの出現概念の上位概念の概念ＩＤについて、図６に示す各概念ｉに対するユーザ興味スコア更新式を用いて、ユーザ興味スコアＴｏｔａｌＺ_ｉｎ，及びＸ_{ｉ（ｎ−１）}，Ｙ_{ｉ（ｎ−１）}の値を求め、概念体系／ユーザ興味スコアデータベース１４０の、クラスタＩＤに対応するユーザＩＤ（クライアント端末ＩＤ）のカラムに対応するユーザ興味スコアテーブルに格納されている各値を更新する。 FIG. 6 shows details of processing of the interest concept system update processing unit 130. The interest concept system update processing unit 130 uses the user interest score update formula for each concept i shown in FIG. 6 for the concept that has appeared in the content (appearance concept) and the concept ID of the superordinate concept of this appearance concept. The values of the scores TotalZ _in and X _{i (n−1)} , Y _{i (n−1)} are obtained, and the column of the user ID (client terminal ID) corresponding to the cluster ID in the conceptual system / user interest score database 140 is obtained. Each value stored in the corresponding user interest score table is updated.

ここで、Ｘ_{ｉ（ｎ−１）}は、各概念ＩＤ（ここでは識別子ｉで表現）に対する、過去の（前回までの）前記更新対象概念リストの重みｗの二乗の合計である。Ｙ_{ｉ（ｎ−１）}は、同様に各概念ＩＤ（ここでは識別子ｉで表現）に対する、過去の前記更新対象概念リストの重みｗと特徴スコアＺの乗算の合計である。 Here, X _{i (n−1)} is the sum of the squares of the weights w of the update target concept list in the past (up to the previous time) for each concept ID (represented by the identifier i here). Similarly, Y _{i (n−1)} is the sum of multiplication of the weight w of the past update target concept list and the feature score Z for each concept ID (represented by identifier i here).

図６において、ｎは、概念体系更新処理が何度目かを示す識別子である。ユーザ興味スコアＴｏｔａｌＺを求める一連の処理は、クラスタＩＤ単位で行なわれ、この一連の処理が行なわれる単位を１度と数えるとき、ｎはこの一連の処理が何度目に行なわれたものであるかを示す識別子である。ｉは、概念ＩＤの識別子である。Ｚ_ｉｎは、概念ｉの各更新処理に利用するＺ値である。なお、上記Ｚ_ｉｊは一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリスト毎のＺ値であり、Ｚ_ｉｊ∈Ｚ_ｉｎの関係である。重みｗ_ｉｎは、概念ｉの各更新処理に利用する重みである。上記重みｗと同じであり、上記興味の特徴スコア算出部１２０で設定したものと同様である。 In FIG. 6, n is an identifier indicating how many times the concept system update process is performed. A series of processes for obtaining the user interest score TotalZ is performed in units of cluster IDs. When the unit in which this series of processes is performed is counted once, n is the number of times this series of processes has been performed. Is an identifier. i is an identifier of a concept ID. Z _in is a Z value used for each update process of concept i. Note that Z _ij is a Z value for each of the list browsing content list and the detailed browsing content list, and has a relationship of Z _ij εZ _in . Weight w _in is the weight to be used in each process of updating the concept i. The weight w is the same as that set by the interesting feature score calculation unit 120.

例えば、ｗ_ｉｎは、お気に入り登録、他ユーザとの共有等の閲覧以外の特殊な操作をユーザが行った場合、及びコンテンツ閲覧時間（閲覧開始から終了までの間隔）、コンテンツと概念の関連度合い、コンテンツ閲覧時刻、閲覧時の天気・気温・湿度・季節・曜日・休日・余暇かどうか、閲覧時のユーザ位置情報、スケジューラ・日記等から収集したイベント情報に応じて値を変化させる。その他サービス利用者、サービス運用者が特に指定した場合にも変化させる。 For example, w _in the favorite registration, if the user a special operation other than the inspection of the public, such as with the other user has performed, and (interval to the end from the viewing start) content viewing time, related the degree of content and concepts, The value is changed in accordance with the content browsing time, weather / temperature / humidity / season / day of the week / holiday / leisure at the time of browsing, user location information at the time of browsing, event information collected from a scheduler / diary and the like. It is also changed when specified by other service users and service operators.

さらに、興味概念体系更新処理部１３０は、「更新対象概念リスト」の各概念ＩＤ（出現概念及び上位概念）の下位概念を抽出し、下位概念のノード値を更新する。下位概念の抽出では、「更新対象概念リスト」の各概念ＩＤについて、概念体系／ユーザ興味スコアデータベース１４０の概念体系テーブルを参照し、子概念ＩＤリストから概念ＩＤのリストを抽出し、さらに各子概念ＩＤリストの概念ＩＤについて概念体系テーブルを参照して子概念リストを抽出する処理を繰り返す。 Furthermore, the interest concept system update processing unit 130 extracts the subordinate concept of each concept ID (appearance concept and superordinate concept) in the “update target concept list”, and updates the node value of the subordinate concept. In the extraction of subordinate concepts, for each concept ID of the “update target concept list”, the concept system table of the concept system / user interest score database 140 is referred to, a list of concept IDs is extracted from the child concept ID list, and each child ID is further extracted. The process of extracting the child concept list with reference to the concept system table for the concept ID of the concept ID list is repeated.

下位概念の興味度の更新に利用する特徴スコアＺは、例えば、隣接した親ノードのうち特徴スコアの絶対値が最も大きい値を利用、最も近い上位ノードの値を利用、親ノードの値を平均、または確率結合した値とする。なお、「更新対象概念リスト」のうち、上記で更新済みの概念（コンテンツに出現した概念、及び上位概念）のユーザ興味スコアは更新しない。 The feature score Z used to update the interest level of the lower concept is, for example, the value having the largest absolute value of the feature score among the adjacent parent nodes, the value of the closest higher node, the average of the values of the parent nodes Or a probability-coupled value. In the “update target concept list”, the user interest score of the updated concept (the concept that appears in the content and the superordinate concept) is not updated.

［クライアント端末］
図２において、クライアント端末４００は、履歴収集部４１０、履歴情報送信部４２０、コンテンツ提示部４３０、コンテンツ要求送信部４４０、及びオススメタグ付与部４５０を備える。
コンテンツ要求送信部４４０は、ユーザの指示（入力）によりコンテンツサーバ５００に対して、コンテンツの提示要求を行う。具体的には、例えば、クライアント端末ＩＤ（もしくはユーザＩＤ）及び要求時刻を含むコンテンツ要求データをコンテンツサーバ５００に送信する。なお、要求時刻は、コンテンツサーバ５００において追加するようにしてもよい。クライアント端末ＩＤ（もしくはユーザＩＤ）は、端末（もしくはユーザ）毎に一意に付与される数字であって、後述する概念体系／ユーザ興味スコアデータベース１４０のユーザ興味スコアテーブルのユーザＩＤと一致するＩＤである。 [Client terminal]
2, the client terminal 400 includes a history collection unit 410, a history information transmission unit 420, a content presentation unit 430, a content request transmission unit 440, and a recommendation tag addition unit 450.
The content request transmission unit 440 makes a content presentation request to the content server 500 in accordance with a user instruction (input). Specifically, for example, content request data including a client terminal ID (or user ID) and a request time is transmitted to the content server 500. The request time may be added in the content server 500. The client terminal ID (or user ID) is a number uniquely assigned to each terminal (or user), and is an ID that matches the user ID of the user interest score table of the conceptual system / user interest score database 140 described later. is there.

履歴収集部４１０は、上述したように、ユーザの操作履歴を収集して一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを作成する。履歴情報送信部４２０は、履歴収集部４１０により作成された一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをコンテンツサーバ５００に送信する。 As described above, the history collection unit 410 collects user operation histories and creates a list browsing content list and a detailed browsing content list. The history information transmission unit 420 transmits the list browsing content list and the detailed browsing content list created by the history collection unit 410 to the content server 500.

オススメタグ付与部４５０は、クライアント端末上に表示されている一覧表示コンテンツリスト及び詳細閲覧コンテンツについて、ユーザが他者にコンテンツを推薦したい場合に、ユーザの操作入力によりコンテンツにオススメタグを付与する。コンテンツにオススメタグが付与された旨の情報（お薦め情報）は、オススメタグ入力部５７０に送信される。 The recommended tag adding unit 450 adds a recommended tag to the content by the user's operation input when the user wants to recommend the content to the other person regarding the list display content list and the detailed browsing content displayed on the client terminal. Information indicating that a recommended tag has been added to the content (recommended information) is transmitted to the recommended tag input unit 570.

他ユーザからの影響度の推定のための準備として、ユーザが他ユーザ（個人あるいは集団）からのオススメタグが見られるようにしておく。すなわち、他ユーザは、閲覧した情報を誰かに薦めたい場合に、オススメタグを付与できるようにし、タグが付与されたかどうかは、他のユーザから閲覧できるようにしておく。 As a preparation for estimating the degree of influence from other users, the user can see recommended tags from other users (individuals or groups). That is, when another user wants to recommend the browsed information to someone, a recommendation tag can be assigned, and whether or not a tag has been assigned can be browsed by another user.

図７に、コンテンツに付与されるオススメタグを示す。例えば、Ａさんが他ユーザへのオススメ対象として、「情報２」、「情報５」にオススメタグを付与すると、ＡＡさんの一覧閲覧コンテンツリストおよび詳細閲覧コンテンツリスト中の「情報２」、「情報５」にオススメタグ（Ａさんオススメ）が表示される。 FIG. 7 shows recommended tags assigned to the contents. For example, if Mr. A adds a recommendation tag to “information 2” and “information 5” as a recommendation target for other users, “information 2” and “information” in the list browsing content list and the detailed browsing content list of AA A recommended tag (Recommended by Mr. A) is displayed at “5”.

［コンテンツサーバ］
上記図２において、コンテンツサーバ５００は、コンテンツ送信処理部５１０、ソート済み提示コンテンツリスト受信部５２０、提示コンテンツリスト送信部５３０、提示コンテンツリスト入力部５４０、履歴情報転送部５５０、コンテンツ要求転送部５６０、オススメタグ入力部５７０、及びオススメタグ送信部５８０を備える。 [Content Server]
2, the content server 500 includes a content transmission processing unit 510, a sorted presentation content list reception unit 520, a presentation content list transmission unit 530, a presentation content list input unit 540, a history information transfer unit 550, and a content request transfer unit 560. The recommended tag input unit 570 and the recommended tag transmission unit 580 are provided.

履歴情報転送部５５０は、クライアント端末４００から受信した一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを通信ネットワークを介して興味分析装置１００に転送する。
提示コンテンツリスト入力部５４０には、サービス運用者により、ユーザの利用するクライアント端末４００に提示するコンテンツを一覧にした提示コンテンツリストが入力される。提示コンテンツリスト送信部５３０は、上記入力された提示コンテンツリストをコンテンツスコアリング装置３００へ通信ネットワークを介して送信する。 The history information transfer unit 550 transfers the list browsing content list and the detailed browsing content list received from the client terminal 400 to the interest analysis device 100 via the communication network.
The presentation content list input unit 540 receives a presentation content list that lists the content to be presented to the client terminal 400 used by the user by the service operator. The presented content list transmission unit 530 transmits the input presented content list to the content scoring device 300 via the communication network.

提示コンテンツリストは、コンテンツＩＤ、概念ＩＤ／関連度リスト、コンテンツ本体、コンテンツ登録時刻、及びお薦め情報を有する。コンテンツＩＤは、各コンテンツに対してコンテンツサーバ５００にて付与される一意のＩＤである。概念ＩＤ／関連度リストは、コンテンツに出現する概念の概念ＩＤ及び当該概念とコンテンツと関連性の程度を示す値のセットが格納される。概念ＩＤ／関連度リストは、コンテンツ毎に予め設定されており、具体例としては、コンテンツ１（スポーツ記事）には、｛“野球”の概念ＩＤ=１，関連度＝０．５｝、｛“サッカー”の概念ＩＤ=２，関連度＝０．８｝、｛“ゴルフ”の概念ＩＤ=３、関連度＝０．６｝…のように、概念ＩＤと関連度のセットが格納される。お薦め情報は、コンテンツＩＤに紐づいて格納される。 The presented content list includes a content ID, a concept ID / relevance list, a content body, a content registration time, and recommendation information. The content ID is a unique ID assigned by the content server 500 to each content. The concept ID / relationship degree list stores a concept ID of a concept that appears in content and a set of values indicating the degree of relevance between the concept and the content. The concept ID / relevance degree list is set in advance for each content. As a specific example, content 1 (sports article) includes {“baseball” concept ID = 1, relevance = 0.5}, { A set of concept ID and degree of association is stored as “soccer” concept ID = 2, degree of association = 0.8}, {“golf” concept ID = 3, degree of association = 0.6}. . The recommended information is stored in association with the content ID.

なお、概念ＩＤは、概念体系／ユーザ興味スコアデータベース１４０に格納される値と一致する。関連度は、例えば、０から１までの値とし、大きいほど関連性が強いものとする。関連度は、サービス運用者がコンテンツ登録時に設定する値、若しくは別システムにより算出される値を利用する。 The concept ID matches the value stored in the concept system / user interest score database 140. For example, the relevance is a value from 0 to 1, and the larger the relevance, the stronger the relevance. As the relevance, a value set by the service operator at the time of content registration or a value calculated by another system is used.

ソート済み提示コンテンツリスト受信部５２０は、ソート済みコンテンツスコアリスト送信部３１０から提示コンテンツリストの一部又は全部をソートしたソート済み提示コンテンツリストとクライアント端末ＩＤ（もしくはユーザＩＤ）を受信する。コンテンツ送信処理部５１０は、ソート済み提示コンテンツリストをクライアント端末ＩＤ（もしくはユーザＩＤ）に該当するクライアント端末４００に送信する。
コンテンツ要求転送部５６０は、クライアント端末４００のコンテンツ要求送信部４４０からのコンテンツ提示要求であるコンテンツ要求データをコンテンツ評価処理部３２０に転送する。 The sorted presentation content list receiving unit 520 receives a sorted presentation content list and a client terminal ID (or user ID) obtained by sorting a part or all of the presentation content list from the sorted content score list transmission unit 310. The content transmission processing unit 510 transmits the sorted presentation content list to the client terminal 400 corresponding to the client terminal ID (or user ID).
The content request transfer unit 560 transfers content request data that is a content presentation request from the content request transmission unit 440 of the client terminal 400 to the content evaluation processing unit 320.

［影響度分析装置］
影響度分析装置２００は、履歴情報受信部２１０と、人間関係データベース２２０と、影響度の特徴スコア算出部２３０と、各人・グループに対する影響度の概念体系更新処理部２４０と、各人・グループに対する影響度スコアを持つ影響度概念体系データベース２５０を備える。 [Effect analysis device]
The influence analysis device 200 includes a history information receiving unit 210, a human relation database 220, an influence characteristic score calculation unit 230, an influence degree conceptual system update processing unit 240 for each person / group, and each person / group. There is provided an influence concept system database 250 having an influence degree score for.

履歴情報受信部２１０は、履歴情報転送部５５０から一覧閲覧コンテンツリストと詳細閲覧コンテンツリストと、各リストに付与されたオススメタグ情報を取得する。オススメタグの付与された一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストの例を図８に示す。 The history information receiving unit 210 acquires a list browsing content list, a detailed browsing content list, and recommended tag information assigned to each list from the history information transfer unit 550. An example of a list browsing content list and a detailed browsing content list to which recommended tags are attached is shown in FIG.

図８において、例えば、ＡＡさんは、クライアント端末４００−１において、５０個（＝Ｓ）のコンテンツが一覧表示されている中から、ユーザが１０個（＝ａ）のコンテンツの詳細を閲覧した場合を考える。ここで、一覧表示されている５０個のコンテンツのうち、「野球」という概念が含まれているコンテンツが１５個（＝Ｎ）、また、「Ａさんオススメ」がついたコンテンツが１０個（＝Ｍ）あり、そのうち「Ａさんオススメ」かつ「野球」が含まれるコンテンツが６個（＝Ｐ）である。そして、ユーザが閲覧した１０個のコンテンツのうち、「Ａさんオススメ」のついたコンテンツが４個（＝ｍ）、「野球」という概念が含まれているコンテンツが５個（＝ｎ）、そのうち、「Ａさんオススメ」かつ「野球」がついたコンテンツが４個（＝ｐ）であるものとする。 In FIG. 8, for example, when AA browses details of 10 (= a) contents from a list of 50 (= S) contents displayed on the client terminal 400-1. think of. Here, out of the 50 contents displayed in the list, 15 contents (= N) containing the concept of “baseball” and 10 contents with “Recommended by Mr. A” (= M), of which 6 (= P) content includes “Recommended by Mr. A” and “Baseball”. Of the 10 contents browsed by the user, 4 contents (= m) with “Mr. A's recommendation”, 5 contents (= n) containing the concept of “baseball”, of which Suppose that there are four (= p) contents with “Recommended by Mr. A” and “Baseball”.

ユーザが情報を閲覧する際、オススメタグが付与されたコンテンツについて閲覧率が上昇した場合には、ユーザのオススメタグを付与した他ユーザからの影響度は正（＋）であるものとみなす。また、オススメタグが付与された情報についての閲覧率が下降した場合には、ユーザのオススメタグを付与した他ユーザからの影響度は負（−）であるものとみなす。また、オススメタグが付与された情報についての閲覧率が変化しなかった場合には、ユーザへのオススメタグを付与した他ユーザからの影響度はないものとみなすことができる。 When the user browses the information, if the browsing rate increases for the content with the recommended tag, the degree of influence from other users to whom the user's recommended tag is added is considered to be positive (+). In addition, when the browsing rate for information to which a recommendation tag is assigned decreases, the degree of influence from other users to whom the user's recommendation tag is assigned is regarded as negative (−). Moreover, when the browsing rate about the information with which the recommendation tag was provided did not change, it can be considered that there is no influence from the other user who provided the recommendation tag to a user.

以下、図８の例における影響度分析処理について、各実施例にしたがって説明する。
（第１実施例）
図９は、第１実施例に係る影響度分析処理を示す図である。影響度の特徴スコア算出部２３０は、上記Ｓ，ａ，Ｍ，ｍを分析パラメータとして利用して、ＡＡさんへのＡさんからの影響度の特徴スコアＺ´を算出する。影響度の特徴スコアＺ´は、上記図５に示す算出式において、Ｎ，ｎをＭ，ｍに置き換えることで算出することができる。 Hereinafter, the influence analysis process in the example of FIG. 8 will be described according to each embodiment.
(First embodiment)
FIG. 9 is a diagram illustrating an influence analysis process according to the first embodiment. The influence characteristic score calculation unit 230 uses the above S, a, M, and m as analysis parameters to calculate an influence characteristic score Z ′ from Mr. A to Mr. AA. The characteristic score Z ′ of the influence degree can be calculated by replacing N and n with M and m in the calculation formula shown in FIG.

影響度の特徴スコアＺ´を求めるために、まず、累積確率Ｈ１、Ｈ２を求める。Ｈ１は、Ｓ個の一覧コンテンツのうち「Ａさんオススメ」が出現する数がＭのとき、詳細閲覧コンテンツをａ個ランダムに選択して閲覧した場合に、「Ａさんオススメ」が出現する詳細閲覧コンテンツがｍ個以上となる累積確率である。また、Ｈ２は、Ｓ個の一覧閲覧コンテンツのうち「Ａさんオススメ」が出現する数がＭのとき、詳細閲覧コンテンツをａ個ランダムに選択して閲覧した場合に、「Ａさんオススメ」が出現する詳細閲覧コンテンツがｍ個以下となる累積確率である。そして、Ｈ１、Ｈ２のうち０．５未満の方を採用し、標準正規分布の累積分布関数の逆関数により特徴スコアＺ´を求める。なお、累積確率としてＨ２を利用する場合には、標準正規分布の累積分布関数の逆関数の返値の符号を負にして特徴スコアとする。Ｈ１、Ｈ２のいずれも０．５以上だった場合には、特徴スコアは０とする。 In order to obtain the influence characteristic score Z ′, first, cumulative probabilities H1 and H2 are obtained. In H1, when the number of occurrences of “Recommended by Mr. A” among the S list contents is M, the detailed browsing in which “Recommended by Mr. A” appears when a detailed browsing content is randomly selected and viewed. This is the cumulative probability that the content is m or more. In addition, when the number of occurrences of “Recommended by Mr. A” out of the S list browsing contents is M, “Recommended by Mr. A” appears when a detailed browsing content is randomly selected and viewed. This is the cumulative probability that the number of detailed browsing contents to be m or less. Then, one of H1 and H2 that is less than 0.5 is adopted, and the feature score Z ′ is obtained by the inverse function of the cumulative distribution function of the standard normal distribution. When H2 is used as the cumulative probability, the sign of the return value of the inverse function of the standard normal distribution cumulative distribution function is set to be a negative characteristic score. If both H1 and H2 are 0.5 or more, the feature score is 0.

上記の場合（Ｓ＝５０、a＝１０、M=１０、ｍ＝４）、Ｈ１＝０．０９６６、Ｈ２＝０．９８１であり、特徴スコアはＺ´=１．３となる。
以上より「Ａさんオススメ」に対する影響度の特徴スコアが算出される。算出された特徴スコアを、各人・グループに対する影響度スコアを持つ影響度概念体系データベース２５０に渡す。各人・グループに対する影響度スコアを持つ影響度概念体系データベース２５０には、興味推定と同様の概念体系を構成する各概念に影響度スコアを持たせておく。 In the above case (S = 50, a = 10, M = 10, m = 4), H1 = 0.0966, H2 = 0.981, and the feature score is Z ′ = 1.3.
From the above, the characteristic score of the degree of influence on “Recommended by Mr. A” is calculated. The calculated feature score is passed to an influence concept system database 250 having an influence score for each person / group. In the influence degree concept system database 250 having an influence degree score for each person / group, an influence degree score is given to each concept constituting the same concept system as the interest estimation.

そして、第１実施例の場合、影響度について、各概念について同等とみなすため、Ａさんの影響度を示す概念体系のルート概念に影響度の特徴スコアを記憶させる。また、一覧コンテンツリストが更新され、ユーザの閲覧履歴も更新された場合には再度、影響度の特徴スコアを算出し、影響度スコアを更新させる。更新には、下記式（１）の影響度スコア更新式を用いる。第１実施例の場合、概念iはルート概念である。 In the case of the first embodiment, the influence score is stored in the root concept of the concept system indicating the influence degree of Mr. A in order to regard the influence degree as equivalent for each concept. When the list content list is updated and the user's browsing history is also updated, the influence feature score is calculated again, and the influence score is updated. For the update, the influence score update formula of the following formula (1) is used. In the case of the first embodiment, the concept i is a root concept.

なお、式（１）において、重みｗは、初期値ｗ＝１とし、コンテンツに関するユーザの操作履歴（お気に入りコンテンツとしての登録や、コンテンツへの評価入力等）、コンテンツの閲覧時間及び閲覧状況、並びにコンテンツとオススメタグとの関連度の少なくとも１つに応じて変化させることができる。ユーザ影響度スコアの算出に際し、閲覧時のユーザの状況や閲覧操作の特徴（お気に入り登録、長時間閲覧等）などを反映することで、ユーザの行動や状況に基づいてユーザ影響度スコアをさらに的確に求めることが可能となる。

In equation (1), the weight w is an initial value w = 1, the user's operation history regarding the content (registration as a favorite content, evaluation input to the content, etc.), content browsing time and browsing status, and It can be changed according to at least one of the relevance levels between the content and the recommended tag. In calculating the user influence score, the user influence score is more accurately based on the user's behavior and situation by reflecting the user's situation at the time of browsing and the characteristics of browsing operation (favorite registration, long-time browsing, etc.). It becomes possible to ask for.

また、ルート概念のスコアを子孫にあたる下位概念についても同値で与える。「Ａさんオススメ」に関する影響度のスコア更新が終了した後に、同一の一覧閲覧コンテンツに含まれる、Ａさん以外の「オススメタグ」を付与した人についても、上記と同様の方法で影響度に関する特徴スコアを算出し、上記式（１）を用いて影響度スコアの更新を行う。 Also, the root concept score is given to the subordinate concepts that are descendants with the same value. After the update of the impact score for “Recommended by Mr. A” is completed, the characteristics related to the impact level are also applied in the same way as above for those who have been given a “recommended tag” other than Mr. A included in the same list browsing content. The score is calculated, and the influence score is updated using the above formula (1).

すなわち、第１実施例によれば、ユーザの選択候補となる一覧閲覧コンテンツリストにおける各「オススメタグ」が出現するコンテンツ数（Ｍ）と、一覧閲覧コンテンツリストからユーザが選択した詳細閲覧コンテンツリストにおける各「オススメタグ」が出現するコンテンツ数（ｍ）とを算出し、このＭおよびｍを比較分析することで、各“「オススメタグ」の出現の希少性を考慮し、且つ選択候補のコンテンツに付与されている「オススメタグ」が、比較的に「選ばれる」という事象だけでなく、比較的に「選ばれない」という事象の出現数の特徴を活用することができるため、ユーザの他者からの影響度を高精度に推定することが可能となる。 That is, according to the first embodiment, the number of contents (M) in which each “recommended tag” appears in the list browsing content list that is a user selection candidate and the detailed browsing content list selected by the user from the list browsing content list. The number of contents (m) in which each “recommended tag” appears is calculated, and this M and m are compared and analyzed. Since the "recommended tag" that is given can utilize the characteristics of the number of occurrences of events that are relatively "not selected" as well as events that are relatively "selected", It is possible to estimate the degree of influence from a high accuracy.

またＨ１およびＨ２を超幾何分布により求めることで、ユーザが「オススメタグ」の付与されたコンテンツを偶然と比べて比較的“選ぶ”という特徴だけでなく、ユーザが「オススメタグ」が付与されたコンテンツを偶然と比べて比較的“選ばない”という特徴を活用して、精度良くユーザの影響度スコアを求めることが可能となる。 In addition, by determining H1 and H2 by hypergeometric distribution, the user is given not only the feature of “selecting” content with “recommended tag”, but also “recommended tag” by the user. It is possible to obtain the user's influence score with high accuracy by utilizing the feature that the content is relatively “not selected” compared to chance.

（第２実施例）
第２実施例では、ＡＡさんへのＡさんからの特定の概念（又は特定のジャンル）毎の影響度を推定する手法を説明する。図１０は、第２実施例に係る影響度分析処理を示す図である。影響度の特徴スコア算出部２３０は、分析パラメータＳ，ａ，Ｐ，ｐを利用してＡさんに対する影響度の特徴スコアＺ´を算出する。この場合の影響度の特徴スコアＺ´は、上記図５に示す算出式において、Ｎ，ｎをＰ，ｐに置き換えることで算出することができる。 (Second embodiment)
In the second embodiment, a method for estimating the degree of influence for each specific concept (or specific genre) from Mr. A to Mr. AA will be described. FIG. 10 is a diagram illustrating an influence analysis process according to the second embodiment. The influence degree feature score calculation unit 230 calculates the influence degree feature score Z ′ for Mr. A using the analysis parameters S, a, P, and p. The characteristic score Z ′ of the influence degree in this case can be calculated by replacing N and n with P and p in the calculation formula shown in FIG.

第２実施例の場合、Ｈ１は、一覧閲覧コンテンツＳ個のうち「Ａさんオススメ」かつ「野球」が同時についたコンテンツが出現する数がＰのとき、詳細閲覧コンテンツをａ個ランダムに選択して閲覧した場合に、「Ａさんオススメ」かつ「野球」が同時に出現する詳細閲覧コンテンツがｐ個以上となる累積確率である。また、Ｈ２は、一覧閲覧コンテンツのうち「Ａさんオススメ」かつ「野球」が同時についたコンテンツが出現する数がＰのとき、詳細閲覧コンテンツをａ個ランダムに選択して閲覧した場合に、「Ａさんオススメ」かつ「野球」が同時に出現する詳細閲覧コンテンツがｐ個以下となる累積確率である。そして、Ｈ１、Ｈ２のうち０．５未満の方を採用し、標準正規分布の累積分布関数の逆関数により特徴スコアＺ´を求める。なお、累積確率としてＨ２を利用する場合には、標準正規分布の累積分布関数の逆関数の返値の符号を負にして特徴スコアとする。Ｈ１、Ｈ２のいずれも０．５以上だった場合には、特徴スコアは０とする。
以上より、「Ａさんの野球に関するオススメ」に対する影響度の特徴スコアが算出される。 In the case of the second embodiment, H1 selects a detailed browsing content at random when P is the number of content that has “S recommendation A” and “baseball” at the same time among the S browsing content S. The cumulative probability that the number of detailed browsing contents in which “Recommended by Mr. A” and “Baseball” appear at the same time will be p or more. In addition, when the number of contents where “A-san's recommendation” and “baseball” are added at the same time appears among the list browsing contents is H2, when “a” detailed browsing contents are randomly selected and viewed, This is the cumulative probability that the number of detailed browsing contents in which “Mr. A's recommendation” and “baseball” appear simultaneously will be p or less. Then, one of H1 and H2 that is less than 0.5 is adopted, and the feature score Z ′ is obtained by the inverse function of the cumulative distribution function of the standard normal distribution. When H2 is used as the cumulative probability, the sign of the return value of the inverse function of the standard normal distribution cumulative distribution function is set to be a negative characteristic score. If both H1 and H2 are 0.5 or more, the feature score is 0.
From the above, the characteristic score of the degree of influence on “Mr. A's recommendation regarding baseball” is calculated.

加えて、「野球」の上位概念にあたる、例えば「スポーツ」のような特定のジャンルについて、「Ａさんオススメ」と「スポーツ」が同時に出現する数（＝ｑ）を用いて、詳細閲覧コンテンツをａ個ランダムに選択して閲覧した場合に、「Ａさんオススメ」かつ「スポーツ」が同時に出現する詳細閲覧コンテンツがｑ個以上となる累積確率（Ｈ１）と、詳細閲覧コンテンツをａ個ランダムに選択して閲覧した場合に、「Ａさんオススメ」かつ「スポーツ」が同時に出現する詳細閲覧コンテンツがｑ個以上となる累積確率（Ｈ２）を求める。そして、上記と同様に、「Ａさんのスポーツに関するオススメ」に対する影響度の特徴スコアを求める。 In addition, for a specific genre such as “sports”, which is a superordinate concept of “baseball”, the detailed browsing content is a using the number (= q) where “Mr. A's recommendation” and “sports” appear simultaneously. When browsing randomly selected items, select the cumulative probability (H1) that the number of detailed browsing contents that “Recommended by Mr. A” and “Sports” appear simultaneously is q or more, and a detailed browsing content. The cumulative probability (H2) that the number of detailed browsing contents in which “A-san's recommendation” and “sports” appear at the same time becomes q or more is obtained. Then, in the same manner as described above, the characteristic score of the degree of influence on “Mr. A's recommendation regarding sports” is obtained.

同様に、「スポーツ」の下位概念にあたる、例えば「サッカー」についても、「Ａさんオススメ」と「スポーツ」が同時に出現する数（＝ｑ）を用いて、「Ａさんのサッカーに関するオススメ」に対する影響度の特徴スコアも求める。
「Ａさんオススメ」に関する影響度のスコア更新が終了した後に、同一の一覧閲覧コンテンツに含まれる、Ａさん以外の「オススメタグ」を付与した人についても、上記と同様の方法で影響度に関する特徴スコアを算出し、上記式（１）を用いて影響度スコアの更新を行う。 Similarly, for “soccer”, which is a subordinate concept of “sports”, for example, using the number (= q) of “Mr. A's recommendation” and “Sports” appearing simultaneously, the impact on “Mr. A ’s recommendation about soccer” Also find the degree feature score.
After the update of the impact score for “Recommended by Mr. A” is completed, the characteristics related to the impact level are also applied in the same way as above for those who have been given a “recommended tag” other than Mr. A included in the same list browsing content. The score is calculated, and the influence score is updated using the above formula (1).

第２実施例によれば、「オススメタグ」が付与されたコンテンツについて「オススメタグ」が付与されていると同時に概念が付与されていることを利用し、影響度について、「オススメタグ」を付与した他ユーザに対しての各概念についての影響度の推定が可能である。すなわち、人に対する影響について１つのスコアで表現するだけでなく、ジャンルによる影響度の違い（例えば、ＡＡさんは、Ａさんに対してジャンルＸについて影響を受けるが、ジャンルYについて影響を受けない。）などを把握することも可能である。 According to the second embodiment, “recommended tag” is added to the content with “recommended tag”, and the “recommended tag” is added to the content using the concept at the same time. The degree of influence of each concept on other users can be estimated. In other words, the influence on the person is not only expressed by one score, but also the difference in the degree of influence depending on the genre (for example, Mr. AA is affected by the genre X but not the genre Y. ) Etc. can also be grasped.

（第３実施例）
図１１は、第３実施例に係る影響度分析処理を示す図である。第３実施例では、ＡＡさんへのグループＦからの影響度を推定する手法を説明する。つまり、お薦め情報を付与した他ユーザらがグループ属性や組織体系を有し、これらの関係について階層的な体系を作成できる場合（以後、前述の人から成る体系を、人間体系という）に、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストのコンテンツに出現するオススメタグを付与した他ユーザの上位のグループ属性を人間体系から抽出し、上位のグループ属性が当該コンテンツに出現するものとみなして、上位のグループ属性に対するユーザの影響度スコアを更新することをさらに特徴とするものである。なお、上記人間体系の関係情報は、多数の前記他ユーザあるいはグループ属性が相互に関連を持つことでグラフ構造を持つことができる。グラフ構造を持つ場合には、各前記他者あるいはグループ属性の直接あるいは間接の上位属性を「親グループ」、直接あるいは間接の下位属性を「子グループ」と呼ぶ（個人の場合も所属するメンバーが一人のグループとみなす）。 (Third embodiment)
FIG. 11 is a diagram illustrating an influence analysis process according to the third embodiment. In the third example, a method for estimating the degree of influence from group F on Mr. AA will be described. In other words, when other users who have given recommendation information have group attributes and organizational systems and can create a hierarchical system for these relationships (hereinafter, the system consisting of people is referred to as a human system) The upper group attributes of other users who have been given recommended tags appearing in the contents of the browsing content list and the detailed browsing content list are extracted from the human system, and the higher group attributes are regarded as appearing in the content. It is further characterized in that the user's influence score for the attribute is updated. Note that the human system relationship information can have a graph structure when a number of other users or group attributes are related to each other. In the case of having a graph structure, the direct or indirect superior attribute of each other or group attribute is referred to as a “parent group”, and the direct or indirect superior attribute is referred to as a “child group” Considered a group).

人間体系データベース２２０は、一覧閲覧コンテンツリストに含まれるオススメタグ付与者の一覧（本例の場合、「Ａさん」、「Ｃさん」、「Ｄさん」及び「Ｅさん」）を、履歴情報受信部２１０から受け取って格納する。履歴情報受信部２１０は、人間体系データベース２２０から、オススメタグ付与者の上位の人間体系（集団あるいは属性など）を読み出す（本例の場合、「グループＦ」、「グループＧ」及び「グループＨ」）。履歴情報受信部２１０は、人間体系データベース２２０から読み出した親グループを加えて、影響度の特徴スコア算出部２３０にオススメタグ情報を渡す。 The human system database 220 receives a list of recommended tag granters included in the list browsing content list (in this example, “A-san”, “C-san”, “D-san”, and “E-san”) and received history information. Received from the unit 210 and stored. The history information receiving unit 210 reads the human system (group or attribute, etc.) higher than the recommended tag grantor from the human system database 220 (in this example, “group F”, “group G”, and “group H”). ). The history information receiving unit 210 adds the parent group read from the human system database 220 and passes the recommended tag information to the influence characteristic score calculating unit 230.

影響度の特徴スコア算出部２３０は、履歴情報受信部２１０からグループの情報が加えられたオススメタグ情報を受け取る。影響度の特徴スコア算出部２３０は、上記第１実施例と同様の方法で、「Ａさんオススメ」、「Ｃさんオススメ」、「Ｄさんオススメ」及び「Ｅさんオススメ」に関する影響度の特徴スコアを更新する。 The influence characteristic score calculator 230 receives the recommended tag information to which the group information is added from the history information receiver 210. The influence characteristic score calculation unit 230 is the same method as in the first embodiment, and the influence characteristic score for “Recommended by Mr. A”, “Recommended by Mr. C”, “Recommended by Mr. D”, and “Recommended by Mr. E”. Update.

さらに、親グループについての影響度スコアを求める。具体例として、「グループＦ」の影響度スコアを求める方法を示す。まず、「グループＦ」に所属する個人「Ａさん」、「Ｂさん」及び「Ｃさん」のオススメタグを含むコンテンツ（本例の場合、「Ａさんオススメ」「Ｃさんオススメ」の２個）を「グループＦオススメ」とみなし、「グループＦ」に所属する個人のオススメタグを含むコンテンツの合計数を用いて、前述の方法と同様に影響度の特徴スコアを求め、上記式（１）を用いて影響度スコアの更新を行う。 Further, an influence score for the parent group is obtained. As a specific example, a method of obtaining the influence score of “Group F” is shown. First, contents including recommended tags of individuals “Mr. A”, “Mr. B”, and “Mr. C” belonging to “Group F” (in this example, “Mr. A” and “Mr. C” are recommended) Is regarded as “Group F recommendation”, and the characteristic score of the influence degree is obtained in the same manner as described above by using the total number of contents including recommended tags of individuals belonging to “Group F”, and the above formula (1) is obtained. Use to update the impact score.

一覧閲覧コンテンツ（Ｓ個）のうち「グループＦオススメ」が出現する数がＲ個の場合に、詳細閲覧コンテンツ（ａ個）を選択したときの「グループＦオススメ」が出現する詳細閲覧コンテンツがｒ個であるものとする。このとき、影響度の特徴スコア算出部２３０は、分析パラメータＳ，ａ，Ｒ，ｒを利用して「グループＦ」に対する影響度の特徴スコアＺ´を算出する。この場合の影響度の特徴スコアＺ´は、上記図５に示す算出式において、Ｎ，ｎをＲ，ｒに置き換えることで算出することができる。 When the number of “Group F recommendation” that appears in the list browsing content (S) is R, the detailed browsing content in which “Group F recommendation” appears when the detailed browsing content (a) is selected is r. It shall be a piece. At this time, the influence characteristic score calculator 230 calculates the influence characteristic score Z ′ for “group F” using the analysis parameters S, a, R, and r. The characteristic score Z ′ of the influence degree in this case can be calculated by replacing N and n with R and r in the calculation formula shown in FIG.

さらに、学習対象となった親グループの下位に属する子グループ（「グループＦ」では、「Ａさん」、「Ｂさん」、「Ｃさん」）について、一覧閲覧コンテンツリストに含まれない子グループについて（本例の「グループＦ」では「Ｂさん」）については、「グループＦ」の特徴スコアを用いて、影響度スコアの更新を行う。 In addition, regarding child groups that belong to the lower level of the parent group to be learned (in “Group F”, “Mr. A”, “Mr. B”, “Mr. C”), the child groups that are not included in the list browsing content list (For “Group F” in this example, “Mr. B”), the influence score is updated using the characteristic score of “Group F”.

更新された影響度スコアは各人・グループに対する影響度スコアを持つ影響度概念体系データベース２５０内の各人、グループの影響度概念体系のルート概念に記憶させる。そして、ルート概念のスコアを子孫にあたる下位概念についても同値で与える。
（第４実施例）
図１２は、第４実施例に係る影響度分析処理を示す図である。人間関係データベース２２０は、上記第３実施例と同様のものとする。影響度の特徴スコア算出部２３０は人間関係データベース２２０から受け取ったオススメタグについて、上記第２実施例と同様の方法を用いて、オススメタグを付与した人（またはグループ）ついての、ジャンル毎の影響度スコアを更新する。 The updated impact score is stored in the root concept of the impact concept system of each person and group in the impact concept system database 250 having the impact score for each person / group. And the score of the root concept is given to the subordinate concepts corresponding to the descendants with the same value.
(Fourth embodiment)
FIG. 12 is a diagram illustrating an influence analysis process according to the fourth embodiment. The human relation database 220 is the same as that in the third embodiment. The feature score calculation unit 230 of the influence degree uses the same method as in the second embodiment for the recommended tag received from the human relationship database 220, and affects the genre for the person (or group) to which the recommended tag is assigned. Update degree score.

［コンテンツスコアリング装置］
コンテンツスコアリング装置３００は、ソート済みコンテンツスコアリスト送信部３１０、コンテンツ評価処理部３２０、コンテンツデータベース３３０、及び提示コンテンツリスト・オススメタグ受信部３４０を備える。図１３は、コンテンツスコアリング処理を示す図である。 [Content scoring device]
The content scoring device 300 includes a sorted content score list transmission unit 310, a content evaluation processing unit 320, a content database 330, and a presentation content list / recommended tag reception unit 340. FIG. 13 is a diagram showing content scoring processing.

コンテンツスコアリング装置３００は、提示コンテンツリスト送信部５３０から、提示対象のコンテンツを受信し、コンテンツをスコアリングし、スコアを元にコンテンツをソートし、ソート済み提示コンテンツリスト受信部５２０に、ソート済みデータを渡す。 The content scoring device 300 receives the content to be presented from the presented content list transmission unit 530, scores the content, sorts the content based on the score, and sorts the sorted content into the sorted presented content list receiving unit 520. Pass data.

具体的には、提示コンテンツリスト・オススメタグ受信部３４０が、提示コンテンツリスト送信部５３０から、提示対象コンテンツと、提示対象コンテンツの内容を示す概念体系と、提示対象コンテンツにユーザから付与された「オススメタグ」を受信し、コンテンツデータベース３３０に格納する。次に、コンテンツ評価処理部３２０は、各提示対象コンテンツ（例えば、評価コンテンツ（評価コンテンツ１には「概念Ｈ」、「概念Ｅ」、「Ａさんオススメ」、「Ｂさんオススメ」のタグが付いている。）について、コンテンツのオススメタグを付与した人からの影響度概念体系（「Ａさん」、「Ｂさん」の影響度概念体系）のうち、「概念Ｈ」スコアを用いて、下記式（２）により評価用影響度スコアを求める。「概念Ｅ」についても同様に行う。 Specifically, the presentation content list / recommended tag receiving unit 340 receives from the presentation content list transmission unit 530 a presentation target content, a conceptual system indicating the content of the presentation target content, and “ The “recommended tag” is received and stored in the content database 330. Next, the content evaluation processing unit 320 tags each presentation target content (for example, evaluation content (evaluation content 1 has “concept H”, “concept E”, “recommended A”), and “recommended B”). )), The following formula using the “concept H” score from the influence concept system (“Mr. A” and “Mr. B” influence concept system) from the person who gave the recommended tag of the content The evaluation impact score is obtained from (2), and the same applies to “concept E”.

その後、興味スコアを持つ概念体系のうちの「概念Ｈ」、「概念Ｅ」の持つスコアを用いて、下記式（３）により評価用興味スコアを求める。

そして、前述した評価用影響度スコアと評価用興味スコアを用いて、下記式（４）により評価用影響度スコアと評価用興味スコアを重ね合わせ、評価コンテンツ１の評価スコアを求める。

同様にして、すべての提示対象コンテンツについて、評価スコアを求め、評価スコアを元にコンテンツをソートする。ソート済のコンテンツを、ソート済コンテンツスコアリスト送信部３１０に渡す。ソート済コンテンツスコアリスト送信部３１０は、ソート済み提示コンテンツリスト受信部５２０にソート済コンテンツスコアリストを送信する。 Thereafter, an evaluation interest score is obtained by the following equation (3) using the scores of “concept H” and “concept E” in the concept system having the interest score.

Then, using the evaluation impact score and the evaluation interest score described above, the evaluation impact score and the evaluation interest score are superimposed by the following formula (4) to obtain the evaluation score of the evaluation content 1.

Similarly, evaluation scores are obtained for all the presentation target contents, and the contents are sorted based on the evaluation scores. The sorted content is passed to the sorted content score list transmission unit 310. The sorted content score list transmission unit 310 transmits the sorted content score list to the sorted presentation content list reception unit 520.

ソート済みコンテンツリスト受信部５２０は、ソート済みコンテンツリストを元に、コンテンツ送信処理部５１０にソートされたコンテンツを送り、クライアント端末４００−１のコンテンツ提示部４３０に送信する。
以上述べたように、上記実施形態では、ユーザ自身の興味と当該ユーザへの他ユーザからの影響を考慮することで、コンテンツを閲覧する際のユーザの他者からの影響度を高精度に推定することができる。これにより、ユーザに対して実際の社会生活や選択行動に近しい情報推薦サービスが実現可能となる。 The sorted content list receiving unit 520 sends the sorted content to the content transmission processing unit 510 based on the sorted content list, and transmits it to the content presentation unit 430 of the client terminal 400-1.
As described above, in the above embodiment, the degree of influence from the other user when browsing the content is estimated with high accuracy by considering the user's own interest and the influence of the user on the user. can do. This makes it possible to realize an information recommendation service that is close to the actual social life and selection behavior for the user.

なお、この発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態に亘る構成要素を適宜組み合せてもよい。 Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. Further, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, you may combine suitably the component covering different embodiment.

１００…興味分析装置、２００…影響度分析装置、３００…コンテンツスコアリング装置、４００…クライアント端末、５００…コンテンツサーバ、１１０…履歴情報受信部、１２０…興味の特徴スコア算出部、１３０…興味概念体系更新処理部、１４０…概念体系／ユーザ興味スコアデータベース、１５０…提示コンテンツリスト受信部、１６０…コンテンツデータベース、１７０…コンテンツ評価処理部、１８０…ソート済みコンテンツスコアリスト送信部、２１０…履歴情報受信部、２２０…人間関係データベース、２３０…影響度の特徴スコア算出部、２４０…各人・グループに対する影響度の概念体系更新処理部、２５０…各人・グループに対する影響度スコアを持つ影響度概念体系データベース、３１０…ソート済みコンテンツスコアリスト送信部、３２０…コンテンツ評価処理部、３３０…コンテンツデータベース、３４０…提示コンテンツリスト・オススメタグ受信部、４１０…コンテンツ提示部、４２０…履歴収集部、４３０…履歴情報転送部、４４０…コンテンツ要求送信部、４５０…オススメタグ付与部、５１０…コンテンツ送信部、５２０…ソート済みコンテンツリスト受信部、５３０…履歴情報転送部、５４０…提示コンテンツリスト入力部、５５０…提示コンテンツリスト送信部、５６０…コンテンツ要求転送部、５７０…オススメタグ入力部、５８０…オススメタグ送信部。 DESCRIPTION OF SYMBOLS 100 ... Interest analysis apparatus, 200 ... Influence degree analysis apparatus, 300 ... Content scoring apparatus, 400 ... Client terminal, 500 ... Content server, 110 ... History information receiving part, 120 ... Interest characteristic score calculation part, 130 ... Interest concept System update processing unit, 140 ... Concept system / user interest score database, 150 ... Presentation content list reception unit, 160 ... Content database, 170 ... Content evaluation processing unit, 180 ... Sorted content score list transmission unit, 210 ... History information reception , 220 ... Human relation database, 230 ... Influence score characteristic score calculation part, 240 ... Concept system update processing part of influence level for each person / group, 250 ... Influence concept system having an influence score for each person / group Database, 310 ... sorted content Core list transmission unit 320 ... Content evaluation processing unit 330 ... Content database 340 ... Present content list / recommended tag reception unit 410 ... Content presentation unit 420 ... History collection unit 430 ... History information transfer unit 440 ... Content Request transmission unit, 450 ... recommended tag addition unit, 510 ... content transmission unit, 520 ... sorted content list reception unit, 530 ... history information transfer unit, 540 ... presentation content list input unit, 550 ... presentation content list transmission unit, 560 ... content request transfer unit, 570 ... recommended tag input unit, 580 ... recommended tag transmission unit.

Claims

A method of analyzing the degree of influence on a user by using a concept system that systematizes an influence score that a user receives from another user for a plurality of concepts,
Clustering a first content list in which the user has browsed a plurality of contents as a list and a second content list in which a content body is browsed from the first content list;
For each cluster, the total number of contents in the first content list is the first total number, and the number of appearances of content to which recommendation information is given by the other user in the first content list is the first number of appearances. When the total number of contents in the second content list is the second total number and the number of appearances of the content to which the recommendation information is added in the second content list is the second appearance number, Under the conditions of the total number of 1, the first number of appearances, and the second total number, the number of appearances of the content to which the recommendation information is added to the second content list is equal to or greater than the second number of appearances. comprising first probability and to calculate a second probability of the second number of occurrences below, the original standard normal distribution either one of the first probability and the second probability A calculation step for calculating a feature score by inverse function of the product distribution function,
And an update step of updating the influence score for the concept using a feature score calculated for each cluster.

In the calculating step, the recommendation information is given in the first content list and the number of contents in which a predetermined concept appears is the first appearance number, and the recommendation information is given in the second content list. The influence analysis method according to claim 1, wherein the number of contents in which the predetermined concept appears is set as the second appearance number.

In the calculating step, the number of appearances of the content to which the recommendation information is given by a predetermined set to which the other user belongs in the first content list is defined as the first appearance number, and the number of appearances in the second content list The impact analysis method according to claim 1, wherein the number of appearances of content to which the recommendation information is given by a predetermined set is set as the second appearance number.

For each cluster, the total number of contents in the first content list is the first total number, the number of contents in which the concept appears in the first content list is the first appearance number, and the second content When the total number of contents in the list is the second total number and the number of contents in which the concept appears in the second content list is the second appearance number, the first total number, the first appearance number, And under the condition of the second total number, a first probability that the number of contents in which the concept appears in the second content list is greater than or equal to the second occurrence number and less than or equal to the second occurrence number. and step second calculating a probability, for calculating a feature score by inverse function of the cumulative distribution function of the original standard normal distribution either one of said first probability and said second probability that,
Updating the user interest score for the concept using a feature score calculated for each cluster;
The content in which the concept appears further includes an evaluation step of calculating an evaluation score for the content using the user interest score and the influence score of each concept that appears in the content. 4. The influence analysis method according to any one of 1 to 3.

An apparatus for analyzing the degree of influence on a user using a concept system that systematizes an influence score that a user receives from other users for a plurality of concepts,
Means for clustering a first content list in which the user has browsed a plurality of contents as a list and a second content list in which a content body is browsed from the first content list;
For each cluster, the total number of contents in the first content list is the first total number, and the number of contents in which the concept to which recommendation information is given by the other user appears in the first content list is the first. The total number of contents in the second content list is the second total number, and the number of contents in which the concept to which the recommendation information is added in the second content list appears is the second number of appearances When the first total number, the first number of appearances, and the second total number are satisfied, the number of appearances of the content to which the recommendation information is added to the second content list is the first number. calculating a first probability and a second probability of the second number of occurrences below the second occurrence number or more, either one of the first probability and the second probability A calculating means for calculating a feature score by inverse function of the cumulative distribution function of the standard normal distribution bets,
An influence analyzing apparatus, comprising: an updating unit that updates the influence score for the concept using a feature score calculated for each cluster.

6. An influence analysis program for causing a computer to function as each means constituting the influence analysis apparatus according to claim 5.