JP2013196653A

JP2013196653A - Interest analysis method, device and program

Info

Publication number: JP2013196653A
Application number: JP2012066377A
Authority: JP
Inventors: Koji Ito; 浩二伊藤; Masanari Fujita; 将成藤田; Tae Sato; 妙佐藤
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-03-22
Filing date: 2012-03-22
Publication date: 2013-09-30
Anticipated expiration: 2032-03-22
Also published as: JP5668007B2

Abstract

PROBLEM TO BE SOLVED: To enable a user to obtain a relatively good recommendation result even in the case of using an information recommendation service for the first time.SOLUTION: A model of each attribute of the existing information user is created and stored on the basis of information representing a content selection history by the information user. Then, attribute information of a new information user is acquired, a model of an attribute of the existing information user coincident with an attribute of the new information user is selected from in the stored model group on the basis of the acquired attribute information, a model expressing the degree of interest in a prescribed concept of the new information user is created on the basis of the selected model, and the interest of the new information user is analyzed on the basis of the created model.

Description

この発明は、コンテンツ閲覧履歴等からユーザの興味を分析する興味分析方法、装置及びプログラムに関する。 The present invention relates to an interest analysis method, apparatus, and program for analyzing a user's interest from a content browsing history or the like.

ユーザの行動や状況に合わせて適切なサービス・コンテンツをレコメンドする技術が望まれている。そこで、情報利用者毎に適する情報を選択し、あるいは情報利用者毎に適する順番に並び替えて情報を表示する、情報推薦サービスが提案されている。例えば、情報が広告である場合は、情報利用者毎に適する広告を選択し、あるいは情報利用者毎に適する順番に並び替えて広告を表示する広告推薦サービスであり、情報がニュースである場合は、情報利用者毎に適するニュースを選択し、あるいは情報利用者毎に適する順番に並び替えてニュースを表示するニュース推薦サービスであり、情報が飲食店情報である場合は、情報利用者毎に適する飲食店情報を選択し、あるいは情報利用者毎に適する順番に並び替えて飲食店情報を表示する飲食店情報推薦サービスである。 There is a demand for a technique for recommending appropriate services and contents in accordance with user behavior and situations. In view of this, an information recommendation service has been proposed in which information suitable for each information user is selected or information is rearranged in an order suitable for each information user and displayed. For example, when the information is an advertisement, it is an advertisement recommendation service that displays an advertisement by selecting an advertisement suitable for each information user or rearranging in an order suitable for each information user, and when the information is news It is a news recommendation service that selects news that is appropriate for each information user, or displays the news in an order that is appropriate for each information user. If the information is restaurant information, it is appropriate for each information user. It is a restaurant information recommendation service that displays restaurant information by selecting restaurant information or rearranging in order suitable for each information user.

上記情報推薦サービスを実現するための方法として、各コンテンツに内容をサマライズする概念タグ（メタデータ）が付与されていることを前提としてユーザ履歴において出現する概念等の頻度からユーザの興味を推定する方法があり、この方法は内容ベースフィルタリング手法（Content Based Filtering：CBF）で、特にメモリベース手法として研究が進められている。 As a method for realizing the above information recommendation service, the user's interest is estimated from the frequency of the concept or the like appearing in the user history on the premise that the concept tag (metadata) for summarizing the contents is assigned to each content. There is a method, and this method is a content-based filtering method (Content Based Filtering: CBF).

具体的に、内容ベースフィルタリング技術とは、例えば特定ブランド（ブランドを示す情報を概念タグとして保持）の商品を閲覧した場合に、同じブランドの商品（同じ概念タグを保持）を提示する。この場合の、メモリベース手法は、過去に閲覧した履歴から、特定ブランドを頻繁に閲覧していれば、特定ブランドの商品を提示することとなる。単純な方法では、閲覧履歴により多く出現した概念タグに関連する商品を提示することとなる。このような手法においてタクソノミ（オントロジ）を利用した方法としては、例えば、非特許文献１の「操作履歴を用いた多面的嗜好把握方法の提案」がある。 Specifically, the content-based filtering technique presents products of the same brand (holding the same concept tag) when browsing products of a specific brand (holding information indicating the brand as a concept tag). In this case, the memory-based method presents a product of a specific brand if the specific brand is frequently browsed from the history of browsing in the past. In a simple method, products related to concept tags that appear more frequently in the browsing history are presented. As a method using a taxonomy (ontology) in such a method, for example, there is “Proposal of a multifaceted preference grasping method using an operation history” in Non-Patent Document 1.

「操作履歴を用いた多面的嗜好把握方法の提案」，信学技報，vol. 110， no. 450， LOIS2010-76， pp. 67-72， 2011年3月"Proposal of multifaceted preference grasp method using operation history", IEICE Technical Report, vol. 110, no. 450, LOIS2010-76, pp. 67-72, March 2011

ところが、非特許文献１に記載された技術には以下のような解決すべき課題があった。すなわち、情報利用者が初めて情報推薦サービスを利用する場合、あるいは殆ど利用したことが無い場合には、ユーザ履歴が全くないか或いは非常に少ない。このため、モデルの作成に必要なユーザ履歴が不足して適切なモデルが作成できず、この結果適切な推薦が行われない。これは、Cold Start問題と呼ばれ、以下のような不具合を生じる。すなわち、情報推薦サービス利用開始時は、ユーザ履歴が少なく適切な推薦が行われないため、情報利用者の満足度が上がりにくい。そのため、情報推薦サービスが利用されず、ユーザ履歴が溜らない。その結果、ユーザ履歴が少ないため適切な推薦が行われないというように負の連鎖に陥り、その結果サービスの利用を停止する情報推薦サービス利用者が増え、情報推薦サービスが普及しない。 However, the technique described in Non-Patent Document 1 has the following problems to be solved. That is, when the information user uses the information recommendation service for the first time, or when the information user has rarely used it, there is no or very little user history. For this reason, the user history necessary for creating the model is insufficient and an appropriate model cannot be created, and as a result, appropriate recommendation is not performed. This is called a Cold Start problem and causes the following problems. That is, when using the information recommendation service, since the user history is small and appropriate recommendation is not performed, the satisfaction of the information user is hardly increased. Therefore, the information recommendation service is not used and the user history is not accumulated. As a result, since the user history is small, an appropriate recommendation is not performed, and a negative chain occurs. As a result, the number of information recommendation service users who stop using the service increases, and the information recommendation service does not spread.

この発明は上記事情に着目してなされたもので、その目的とするところは、ユーザが初めて情報推薦サービスを利用する場合であっても、比較的良い推薦結果を得ることができる興味分析方法、装置及びプログラムを提供することにある。 This invention was made paying attention to the above circumstances, and its purpose is an interest analysis method capable of obtaining a relatively good recommendation result even when the user uses an information recommendation service for the first time, To provide an apparatus and a program.

上記目的を達成するためにこの発明の１つの観点は、コンピュータによって、概念出現の希少性を表す情報を利用して情報利用者の興味を推定し、その推定結果を用いて情報利用者毎に所定の概念に対する興味の度合いを表すモデルを作成し、この作成されたモデルを用いて上記情報利用者毎の興味を分析する方法にあって、上記情報利用者によるコンテンツの選択履歴を表す情報をもとに、当該既存の情報利用者の属性毎のモデルを作成して第１の記憶部に保存する。そして、新規の情報利用者の属性情報を取得し、この取得された属性情報をもとに当該新規の情報利用者の属性と合致する既存の情報利用者の属性のモデルを上記第１の記憶部から選択的に読み出し、この読み出されたモデルをもとに上記新規の情報利用者の所定の概念に対する興味の度合いを表すモデルを作成して、これを初期モデルとして第２の記憶部に保存する。そして、上記第２の記憶部に記憶されたモデルを初期モデルとして上記新規の情報利用者の興味分析を開始するようにしたものである。
したがって、ユーザが初めて情報推薦サービスを利用する場合であっても、比較的良い推薦結果を得ることができる。 In order to achieve the above object, one aspect of the present invention is that a computer estimates information user's interest using information representing the scarcity of concept appearance and uses the estimation result for each information user. In a method for creating a model representing the degree of interest in a predetermined concept and analyzing the interest of each information user using the created model, information representing a content selection history by the information user is obtained. Originally, a model for each attribute of the existing information user is created and stored in the first storage unit. Then, the attribute information of the new information user is acquired, and the model of the attribute of the existing information user that matches the attribute of the new information user based on the acquired attribute information is stored in the first memory. A model representing the degree of interest in the predetermined concept of the new information user is created based on the read model and selectively stored in the second storage unit as an initial model. save. Then, the interest analysis of the new information user is started using the model stored in the second storage unit as an initial model.
Therefore, even if the user uses the information recommendation service for the first time, a relatively good recommendation result can be obtained.

また、この発明の１つの観点は以下のような態様を備えることを特徴とする。
第１の態様は、上記既存の情報利用者の属性毎のモデルを作成する際に、当該既存の情報利用者の属性が下位概念に対応する第１の階層とこの第１の階層より上位の概念に対応する第２の階層とを有する場合に、上記第１の階層に対応する下位概念のモデルを作成し、この第１の階層について作成された下位概念のモデルをもとに上記第２の階層に対応する上位概念のモデルを作成するようにしたものである。
このようにすると、属性が階層構造を持つ場合に、下位概念の属性のモデルを用いて上位概念の属性のモデルが作成されるので、少ない計算量で階層構造を持つ属性のモデルを作成することが可能となる。 One aspect of the present invention is characterized by comprising the following aspects.
In the first aspect, when the model for each attribute of the existing information user is created, the attribute of the existing information user has a first hierarchy corresponding to a lower concept and a higher hierarchy than the first hierarchy. A second-level model corresponding to the concept, a lower-level concept model corresponding to the first level is created, and the second-level model is created based on the lower-level concept model created for the first level. The model of the superordinate concept corresponding to the hierarchy of is created.
In this way, when the attribute has a hierarchical structure, a model of the higher-level concept attribute is created using the model of the lower-level concept attribute. Is possible.

第２の態様は、コンピュータの処理負荷が予め設定した値以下となる閾値又は時間帯を設定し、前記処理負荷が前記設定された閾値以下となったとき又は前記設定された時間帯になったときに上記属性毎のモデルを作成する処理を実行するようにしたものである。
このようにすると、既存の情報利用者の属性毎のモデルを作成する処理は、コンピュータの処理負荷が少ない深夜時間帯などを選んで実施されるので、コンピュータの処理負荷の増加を招かずに実施することができる。 In the second aspect, a threshold value or a time zone in which the processing load of the computer is equal to or less than a preset value is set, and when the processing load is equal to or less than the set threshold value, or the preset time zone has been reached In some cases, processing for creating a model for each attribute is executed.
In this way, the process of creating a model for each attribute of an existing information user is performed by selecting a midnight time zone where the processing load on the computer is low, etc., so it does not increase the processing load on the computer. can do.

すなわちこの発明によれば、ユーザが初めて情報推薦サービスを利用する場合であっても、比較的良い推薦結果を得ることができる興味分析方法、装置及びプログラムを提供することができる。 That is, according to the present invention, it is possible to provide an interest analysis method, apparatus, and program capable of obtaining a relatively good recommendation result even when a user uses an information recommendation service for the first time.

この発明の一実施形態に係る興味分析装置を用いたシステムの全体構成図。1 is an overall configuration diagram of a system using an interest analysis device according to an embodiment of the present invention. 図１に示したクライアント端末、コンテンツサーバ及び興味分析装置の機能構成を示すブロック図。The block diagram which shows the function structure of the client terminal shown in FIG. 1, a content server, and an interest analysis apparatus. 図２に示した興味分析装置の処理の概要を示す図。The figure which shows the outline | summary of a process of the interest analysis apparatus shown in FIG. 図２に示したクライアント端末上でのコンテンツ閲覧操作の一例を示す図。The figure which shows an example of content browsing operation on the client terminal shown in FIG. 一覧閲覧コンテンツリストのデータ構成例を示す図。The figure which shows the data structural example of a list browsing content list. 詳細閲覧コンテンツのデータ構成例を示す図。The figure which shows the data structural example of detailed browsing content. 提示コンテンツリストのデータ構成例を示す図。The figure which shows the data structural example of a presentation content list. コンテンツデータベースの一例を示す図。The figure which shows an example of a content database. 概念体系／ユーザ興味スコアデータベースの一例を示す図。The figure which shows an example of a concept system / user interest score database. 履歴情報受信部の処理フローを示す図。The figure which shows the processing flow of a log | history information receiving part. 特徴スコア算出部の処理フローを示す図。The figure which shows the processing flow of a characteristic score calculation part. 分析パラメータリストのデータ構成例を示す図。The figure which shows the data structural example of an analysis parameter list. 特徴スコア算出部の動作を説明するための模式図。The schematic diagram for demonstrating operation | movement of a characteristic score calculation part. 特徴スコア算出処理の詳細を示す図。The figure which shows the detail of a characteristic score calculation process. 概念体系更新処理部の処理フローを示す図。The figure which shows the processing flow of a concept system update process part. 概念体系更新処理の詳細を示す図。The figure which shows the detail of a concept system update process. コンテンツ評価処理部の処理フローを示す図。The figure which shows the processing flow of a content evaluation process part. コンテンツスコアリストの一例を示す図。The figure which shows an example of a content score list. コンテンツ評価処理の詳細を示す図。The figure which shows the detail of a content evaluation process. コンテンツ要求データの一例を示す図。The figure which shows an example of content request data. 図２に示した興味分析装置による属性モデル作成処理の第１の例を示す図。The figure which shows the 1st example of the attribute model creation process by the interest analysis apparatus shown in FIG. 図２に示した興味分析装置による属性モデル作成処理の第２の例を示す図。The figure which shows the 2nd example of the attribute model creation process by the interest analysis apparatus shown in FIG. 図２に示した興味分析装置による初期モデル作成処理の例を示す図。The figure which shows the example of the initial model creation process by the interest analysis apparatus shown in FIG. この発明の原理説明に用いる概略図。Schematic used for explanation of the principle of the present invention. 属性ごとのモデル作成処理を説明するための図。The figure for demonstrating the model creation process for every attribute. ユーザ興味スコアの更新式の第１の例を示す図。The figure which shows the 1st example of the update formula of a user interest score. ユーザ興味スコアの更新式の第２の例を示す図。The figure which shows the 2nd example of the update formula of a user interest score. 属性ごとのモデルを用いた初期モデル作成処理の概要を説明するための図。The figure for demonstrating the outline | summary of the initial model creation process using the model for every attribute. 属性ごとのモデルを用いた初期モデル作成処理に使用する計算式の一例を示す図。The figure which shows an example of the calculation formula used for the initial model creation process using the model for every attribute. モデル更新処理の輻輳について説明するための図。The figure for demonstrating the congestion of a model update process. ユーザのモデルを利用した属性ごとのモデル更新処理の説明に用いる図。The figure used for description of the model update process for every attribute using a user's model. ユーザのモデルを利用した属性ごとのモデル更新処理に使用する計算式の一例を示す図。The figure which shows an example of the calculation formula used for the model update process for every attribute using a user's model. ユーザモデルを用いた階層構造を持つ属性ごとのモデル作成処理の説明に用いる図。The figure used for description of model creation processing for every attribute with a hierarchical structure using a user model. ユーザモデルを用いた階層構造を持つ属性ごとのモデル作成処理に使用する計算式の一例を示す図。The figure which shows an example of the calculation formula used for the model creation process for every attribute with a hierarchical structure using a user model.

［原理］
先ず、この発明の原理を図２４〜図３４を用いて説明する。
この発明は、概念出現の希少性を利用して情報利用者の興味を高精度に推定し、その推定結果を用いて情報利用者毎のモデルを作成し、この作成されたモデルを用いて情報を推薦する方法を前提とする。この前提とする方法では、情報利用者が選択候補情報一覧から情報を選択した履歴から情報利用者の興味を推定する場合に、情報利用者の選択候補となる情報一覧を定義し、そこからの情報選択における概念の出現数を分析することで、各概念の出現の希少性を考慮する。さらに、各概念が出現しない希少性も利用している。このため、情報利用者の興味を高精度に推定することが可能となる。以下、情報利用者に対して提示された選択候補情報一覧と、情報利用者が選択候補情報一覧から任意の情報を選択した履歴を情報選択履歴と呼ぶ。 [principle]
First, the principle of the present invention will be described with reference to FIGS.
This invention estimates the interest of an information user with high precision using the scarcity of concept appearance, creates a model for each information user using the estimation result, and uses this created model to create information Assuming a method of recommending In this premise method, when the information user estimates the information user's interest from the history of selecting the information from the selection candidate information list, an information list that is a selection candidate of the information user is defined, and from there By analyzing the number of occurrences of concepts in information selection, the rareness of each concept appears. Furthermore, the rarity where each concept does not appear is also used. For this reason, it becomes possible to estimate an interest of an information user with high accuracy. Hereinafter, the selection candidate information list presented to the information user and the history in which the information user has selected arbitrary information from the selection candidate information list are referred to as an information selection history.

この発明は、上記前提とする方法をさらに改良し、ユーザが初めて情報推薦サービスを利用する場合であっても、比較的良い推薦結果を得ることができるようにするものである。
図２４はこの改良した発明の概要を説明するための図である。図２４に示すように、この発明に係る興味分析装置は、新規ユーザＦに対して信頼性の高い情報を推薦するために以下に示す処理を実行する。 The present invention further improves the above-described method so that a relatively good recommendation result can be obtained even when the user uses an information recommendation service for the first time.
FIG. 24 is a diagram for explaining the outline of the improved invention. As shown in FIG. 24, the interest analysis apparatus according to the present invention executes the following process to recommend highly reliable information to the new user F.

(1) 既存ユーザ（例えば図２４ではユーザＡ〜Ｅ）の履歴を用いて、属性毎、例えば図２４に示した例では東京や千葉等の居住地毎、男性や女性等の性別毎のモデルを作成する。
(2) 新規ユーザＦが開示を許諾する一部の属性情報をシステムに開示する。
(3) 新規ユーザＦが開示した属性に合致したモデル、例えば図２４の例では東京のモデルと女性のモデルを選定する。
(4) 選定されたモデルを用いて新規ユーザＦの初期モデルを作成する。
(5) 上記作成された初期モデルを用いて新規ユーザＦがサービスの利用を開始し、その後利用を継続する。これにより、新規ユーザＦのモデルを東京かつ女性のステレオタイプを表す初期モデルから、新規ユーザＦ自身を表すモデルへと徐々に修正し、推薦精度を向上させる。 (1) Using the history of existing users (for example, users A to E in FIG. 24), for each attribute, for example in the example shown in FIG. 24, for each residence such as Tokyo and Chiba, and for each sex such as male and female Create
(2) A part of attribute information that the new user F permits to disclose is disclosed to the system.
(3) A model that matches the attribute disclosed by the new user F, for example, the Tokyo model and the female model are selected in the example of FIG.
(4) Create an initial model for the new user F using the selected model.
(5) The new user F starts using the service using the created initial model, and then continues using it. As a result, the model of the new user F is gradually corrected from the initial model representing the stereotype of Tokyo and women to the model representing the new user F itself, and the recommendation accuracy is improved.

このような処理を実行することで、この発明に係る興味分析装置によれば次のような作用効果が奏せられる。すなわち、適切な初期モデルを用いることにより、利用開始時から比較的良好な推薦精度が実現されて高い満足度が実現され、これにより情報推薦サービスの利用が促進される。このため、情報選択履歴の蓄積が進み、ユーザ自身の興味に対する適合度が上昇し、推薦精度がより一層向上する。この結果、サービスの利用を停止する情報推薦サービス利用者が減り、情報推薦サービスが普及する。 By executing such processing, the following effects can be obtained according to the interest analysis apparatus according to the present invention. That is, by using an appropriate initial model, relatively good recommendation accuracy is realized from the start of use, and high satisfaction is realized, thereby promoting the use of the information recommendation service. For this reason, accumulation of the information selection history proceeds, the degree of adaptation to the user's own interest increases, and the recommendation accuracy is further improved. As a result, the number of information recommendation service users who stop using the service is reduced, and the information recommendation service becomes widespread.

次に、図２４に示した一連の処理のうち、既存ユーザの履歴を用いて属性毎のモデルを作成する方法について図２５を用いて説明する。
１回の学習において、複数のコンテンツを一覧として閲覧した履歴と、コンテンツの本体を閲覧した履歴とを用いてユーザのモデルを更新する。ここで、ユーザ毎のモデルを、概念毎に算出されるユーザ興味スコア（TotalZ）の集合とする。概念毎のユーザ興味スコア（TotalZ）を求めるとき、先ず複数のコンテンツを一覧として閲覧した第１のコンテンツリストと、この第１のコンテンツリストからコンテンツの本体を閲覧した第２のコンテンツリストとをクラスタ化する。このクラスタ化されたコンテンツリストを履歴クラスタと呼ぶ。 Next, a method for creating a model for each attribute using the history of existing users in the series of processes shown in FIG. 24 will be described with reference to FIG.
In one learning, the user's model is updated using a history of browsing a plurality of contents as a list and a history of browsing the content body. Here, the model for each user is a set of user interest scores (TotalZ) calculated for each concept. When obtaining the user interest score (TotalZ) for each concept, first cluster a first content list in which a plurality of contents are browsed as a list and a second content list in which content bodies are browsed from the first content list. Turn into. This clustered content list is called a history cluster.

次に、この履歴クラスタ毎に、上記第１のコンテンツリストのコンテンツの総数を第１の総数、上記第１のコンテンツリストにおいて上記概念が出現するコンテンツの数を第１の出現数、上記第２のコンテンツリストのコンテンツの総数を第２の総数、上記第２のコンテンツリストにおいて上記概念が出現するコンテンツの数を第２の出現数とそれぞれ表したとき、これらの第１の総数、第１の出現数及び第２の総数の条件下で、上記第２のコンテンツリストに上記概念が出現するコンテンツの数が上記第２の出現数以上となる第１の確率と、上記第２の出現数以下となる第２の確率を算出する。 Next, for each history cluster, the total number of contents in the first content list is the first total number, the number of contents in which the concept appears in the first content list is the first appearance number, and the second When the total number of contents in the content list is expressed as a second total number and the number of contents in which the concept appears in the second content list is expressed as a second appearance number, these first total number, first A first probability that the number of contents in which the concept appears in the second content list is greater than or equal to the second occurrence number under the condition of the occurrence number and the second total number, and less than or equal to the second occurrence number The second probability is calculated.

そして、上記第１及び第２の確率をもとに標準正規分布の累積分布関数の逆関数により特徴スコア（Ｚ）を算出し、この履歴クラスタ毎に算出した特徴スコアを用いて上記概念に対する上記ユーザ興味スコアを計算し更新する。図２６は上記特徴スコアを用いてユーザ興味スコアを算出する式を示したものである。ここで、重みＷは、各履歴クラスタにおいて概念毎に設定される値である。 Based on the first and second probabilities, a feature score (Z) is calculated by an inverse function of the cumulative distribution function of the standard normal distribution, and the feature score calculated for each history cluster is used to calculate the feature score. Calculate and update user interest scores. FIG. 26 shows an equation for calculating the user interest score using the feature score. Here, the weight W is a value set for each concept in each history cluster.

次に、図２５に示すようにユーザのモデルの更新と同時に属性毎のモデルを更新する。例えば、男性のモデルを更新する場合は、属性が“男性”のユーザ（図２５に示す例ではユーザＡ、ユーザＢ、ユーザＥ）の履歴クラスタが得られた際に、ユーザＡのモデル、ユーザＢのモデル、ユーザＥのモデルを更新するのと同時に、男性のモデルも更新する。属性Ｃに合致するユーザＡの履歴クラスタを用いた属性Ｃの各概念i に対する興味スコア（TotalZ）の更新式を図２７に示す。 Next, as shown in FIG. 25, the model for each attribute is updated simultaneously with the updating of the user's model. For example, when updating a male model, when a history cluster of a user whose attribute is “male” (user A, user B, user E in the example shown in FIG. 25) is obtained, the model of user A, the user At the same time as updating the model of B and the model of user E, the male model is also updated. FIG. 27 shows a formula for updating the interest score (TotalZ) for each concept i of the attribute C using the history cluster of the user A that matches the attribute C.

続いて、属性毎のモデルを用いた初期モデルを作成する。図２８はその作成方法を説明するための図である。例えば、新規ユーザＦが“東京”、“女性”という属性を情報推薦システムに対して提示した場合、情報推薦システムは新規ユーザＦの属性に合致する東京のモデルと女性のモデルを図２９に示す式を用いて結合し、新規ユーザＦの初期モデルとして用いる。 Subsequently, an initial model using a model for each attribute is created. FIG. 28 is a diagram for explaining the creation method. For example, when the new user F presents the attributes “Tokyo” and “female” to the information recommendation system, the information recommendation system shows the Tokyo model and female model that match the attributes of the new user F in FIG. Combined using equations, used as an initial model for new user F.

しかし、図２５に示すように履歴クラスタが得られるたびに属性毎のモデルを更新する方法を用いると、例えば男性のモデルや女性のモデルのようにユーザ数に対する割合が高い属性のモデルを更新する場合には、サービス利用者数の増加に従い当該モデルの更新頻度が増加する。そして、その結果システムの負荷が上昇し、更新処理の遅延や更新漏れ、システム停止を招くことがある（図３０）。 However, if a method for updating a model for each attribute each time a history cluster is obtained as shown in FIG. 25, for example, a model with a high ratio to the number of users such as a male model or a female model is updated. In this case, the update frequency of the model increases as the number of service users increases. As a result, the load on the system increases, which may cause delays in update processing, omission of updates, and system shutdown (FIG. 30).

この課題は以下のようにすることで解決される。図３１は、図２５に示した属性毎のモデルを更新する方法をさらに改良した提案を示すものである。属性毎のモデルの更新は、当該属性に合致する各ユーザの履歴クラスタを用いる代わりに、ユーザ毎の学習結果を用いて行う。但し、ユーザ毎に学習回数が異なるため、各ユーザの興味スコアを同じ割合で確率結合すると、学習回数の差異が考慮されない。従って、各ユーザの興味スコアでなく、共に保存されているＸ、Ｙを用いて求める。Ｘ、Ｙは各ユーザの学習の回数に依存する。このため、これらも考慮することにより、ユーザ毎の学習回数の差異を考慮することができる。 This problem can be solved as follows. FIG. 31 shows a proposal obtained by further improving the method for updating the model for each attribute shown in FIG. The model update for each attribute is performed using the learning result for each user instead of using the history cluster of each user that matches the attribute. However, since the number of learnings is different for each user, if the interest scores of the users are stochastically combined at the same rate, the difference in the number of learnings is not considered. Accordingly, X and Y stored together are used instead of the interest score of each user. X and Y depend on the number of times each user learns. For this reason, the difference of the learning frequency for every user can also be considered by considering these.

属性Ｃに関連する全てのユーザについての、Ｙi ，Ｘi が定まっているとき、属性Ｃに関するモデルの概念i に関するｎ番目の興味スコア（TotalZ _Cin）と、Ｙ_Cin，Ｘ_Cinを更新する式を図３２に示す。
さらに、属性毎のモデルを階層的に作成する例を図３３に示す。例えば、関東のモデルが東京のモデルと千葉のモデルを子に持つ場合、図３４に示すように子属性のモデルのＸ_子属性i，Ｙ_子属性iを用いて、親属性のモデルのＸ_親属性i，Ｙ_親属性iと、興味スコア（TotalZ_親属性in）を求めることができる。 When Yi and Xi are determined for all users related to attribute C, the nth interest score (TotalZ _Cin ) related to the concept i of the model related to attribute C and the formula for updating Y _Cin and X _Cin are shown in the figure. 32.
Further, FIG. 33 shows an example of hierarchically creating a model for each attribute. For example, when a Kanto model has a Tokyo model and a Chiba model as children, as shown in FIG. 34, the X _{child attribute i} and Y _{child attribute i} of the _{child attribute} model are used, and the X _{parent of the} parent attribute model is used. _{Attribute i} , Y _{parent attribute i} , and interest score (TotalZ _{parent attribute in} ) can be obtained.

［実施形態］
以下、図面を参照してこの発明に係わる一実施形態を説明する。
（１）システム全体の構成
図１は、この発明の一実施形態に係る興味分析装置を用いたシステムの全体構成図である。このシステムは、クライアント端末２００と、コンテンツサーバ３００と、興味分析装置１００を備える。クライアント端末２００とコンテンツサーバ３００との間、及びコンテンツサーバ３００と興味分析装置１００との間はそれぞれ通信ネットワークで接続される。ユーザは、クライアント端末２００上での閲覧操作により、所望のコンテンツをコンテンツサーバ３００から取得し、取得したコンテンツをクライアント端末２００の画面に表示させ、閲覧する。 [Embodiment]
Hereinafter, an embodiment according to the present invention will be described with reference to the drawings.
(1) Overall Configuration of System FIG. 1 is an overall configuration diagram of a system using an interest analysis apparatus according to an embodiment of the present invention. This system includes a client terminal 200, a content server 300, and an interest analysis device 100. The client terminal 200 and the content server 300, and the content server 300 and the interest analysis device 100 are connected to each other via a communication network. The user acquires desired content from the content server 300 through a browsing operation on the client terminal 200, displays the acquired content on the screen of the client terminal 200, and browses.

クライアント端末２００は、ユーザ操作によるコンテンツ閲覧履歴を収集し、複数のコンテンツを一覧として閲覧した一覧閲覧コンテンツリスト（第１のコンテンツリスト）と、コンテンツの一覧からコンテンツの本体を閲覧した詳細閲覧コンテンツリスト（第２のコンテンツリスト）とをコンテンツサーバ３００に送信する。コンテンツサーバ３００は、この一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを、通信ネットワークを介して興味分析装置１００に転送する。 The client terminal 200 collects a content browsing history by a user operation, browses a plurality of content as a list, a list browsing content list (first content list), and a detailed browsing content list that browses the content body from the content list (Second content list) is transmitted to the content server 300. The content server 300 transfers the list browsing content list and the detailed browsing content list to the interest analysis device 100 via the communication network.

興味分析装置１００は、この一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをもとに、コンテンツに出現する各概念に対する特徴スコア及びユーザ興味スコアを算出し、ユーザの興味を推定する。興味分析装置１００は、このユーザ興味スコアに基づいて、コンテンツサーバ３００から受け取った「提示コンテンツリスト」から、ユーザの興味に合わせてソートを行ったコンテンツのリスト（ソート済み提示コンテンツリスト）を生成し、コンテンツサーバ３００に送信する。
また興味分析装置１００は、定期的にユーザの興味モデルを用いて属性の興味モデルを作成し、初期利用ユーザから属性情報の提示を受けた場合には、属性にあわせて初期モデルを作成する機能も備える。 The interest analysis apparatus 100 calculates a feature score and a user interest score for each concept appearing in the content based on the list browsing content list and the detailed browsing content list, and estimates the user's interest. Based on this user interest score, the interest analysis apparatus 100 generates a list of contents sorted according to the user's interest (sorted presented content list) from the “presentation content list” received from the content server 300. To the content server 300.
In addition, the interest analysis apparatus 100 periodically creates an interest model of an attribute using a user's interest model, and creates an initial model according to the attribute when receiving attribute information from an initial use user. Also equipped.

図２は、図１に示したクライアント端末２００、コンテンツサーバ３００及び興味分析装置１００の機能構成を示すブロック図である。
（２）興味分析装置の構成
興味分析装置１００は、履歴情報受信部１１０、特徴スコア算出部１２０、概念体系更新処理部１３０、概念体系／ユーザ興味スコアデータベース１４０、提示コンテンツリスト受信部１５０、コンテンツデータベース１６０、コンテンツ評価処理部１７０、及びソート済みコンテンツスコアリスト送信部１８０を備え、さらに属性興味スコア更新起動部１９０、属性興味スコア更新部１９５、属性興味スコアデータベース１９６、初期ユーザ興味スコア作成部１９７及びユーザ／属性情報データベース１９８を備えている。 FIG. 2 is a block diagram illustrating functional configurations of the client terminal 200, the content server 300, and the interest analysis apparatus 100 illustrated in FIG.
(2) Configuration of Interest Analysis Device The interest analysis device 100 includes a history information reception unit 110, a feature score calculation unit 120, a concept system update processing unit 130, a concept system / user interest score database 140, a presentation content list reception unit 150, content A database 160, a content evaluation processing unit 170, and a sorted content score list transmission unit 180; an attribute interest score update activation unit 190; an attribute interest score update unit 195; an attribute interest score database 196; and an initial user interest score creation unit 197. And a user / attribute information database 198.

図３は、興味分析装置１００の処理の概要を示すものである。
履歴情報受信部１１０は、クライアント端末２００からの一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをコンテンツサーバ３００を介して受信する。一覧閲覧コンテンツリストとは、例えばユーザがコンテンツのタイトルのみを一覧で閲覧したコンテンツのリストである。詳細閲覧コンテンツリストとは、ユーザがコンテンツ本体の内容（詳細）を閲覧したコンテンツのリストである。例えば、図３において、一覧閲覧コンテンツリストには、コンテンツ１〜８が含まれ、詳細閲覧コンテンツリストには、コンテンツ１，３，４が含まれる。また図３において、斜線パターンで示すコンテンツは、概念Ｂがコンテンツ１，６，７，８に出現することを示す。 FIG. 3 shows an outline of processing of the interest analysis apparatus 100.
The history information receiving unit 110 receives the list browsing content list and the detailed browsing content list from the client terminal 200 via the content server 300. The list browsing content list is, for example, a list of content in which the user browses only the content titles in a list. The detailed browsing content list is a list of content that the user has viewed the content (details) of the content body. For example, in FIG. 3, the list browsing content list includes contents 1 to 8, and the detailed browsing content list includes contents 1, 3, and 4. In FIG. 3, the content indicated by the hatched pattern indicates that the concept B appears in the contents 1, 6, 7, and 8.

特徴スコア算出部１２０は、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを利用して概念選択の統計モデルにより各概念の特徴スコア（後述するＺ値）を算出する。
概念体系更新処理部１３０は、上記特徴スコアを用いて概念体系における概念間の関係情報（上位概念及び下位概念）に基づいて各概念に対するユーザ興味スコアを更新する。概念体系のグラフに含まれるノードは概念を表し、リンクは概念間の関係を表す。ユーザ興味スコアは、概念体系における各概念に対応するノードの値として保持する。概念体系において、上位に位置するノードほど抽象的な概念を表し、下位に位置するノードほど具体的な概念を表す。概念体系及び概念ＩＤ（ノード毎に付与される識別子）は、サービス運用者等が事前に設計し定義するものとする。 The feature score calculation unit 120 calculates a feature score (Z value to be described later) of each concept by a statistical model of concept selection using the list browsing content list and the detailed browsing content list.
The concept system update processing unit 130 updates the user interest score for each concept based on the relationship information (superordinate concept and subordinate concept) in the concept system using the feature score. Nodes included in the graph of the concept system represent concepts, and links represent relationships between concepts. The user interest score is held as a value of a node corresponding to each concept in the concept system. In the concept system, the nodes located at the higher level represent the abstract concept, and the nodes located at the lower level represent the specific concept. The concept system and concept ID (identifier assigned to each node) are designed and defined in advance by a service operator or the like.

コンテンツ評価処理部１７０は、評価コンテンツに出現する各概念のユーザ興味スコアを利用して確率結合によってコンテンツに対するユーザの評価スコアを算出する。図３の例では、コンテンツ１に出現する概念Ｅ，Ｆ，Ｄのユーザ興味スコアを用いてコンテンツ１の評価スコアを求めている。興味分析装置１００の詳しい動作説明は後述する。 The content evaluation processing unit 170 calculates the user's evaluation score for the content by probability combining using the user interest score of each concept appearing in the evaluation content. In the example of FIG. 3, the evaluation score of the content 1 is obtained using the user interest scores of the concepts E, F, and D that appear in the content 1. Detailed operation description of the interest analysis apparatus 100 will be described later.

属性興味スコア更新起動部１９０は、深夜時間帯などの負荷が少ない時間帯に定期的に属性興味スコアの更新処理を起動する。
属性興味スコアデータベース１９６は、後述する属性興味スコア更新部１９５により算出された属性のＸ，Ｙ，TotalZを記憶するために用いられる。
ユーザ／属性情報データベース１９８は、ユーザ情報とその属性情報を記憶するために用いられる。 The attribute interest score update activation unit 190 periodically activates an attribute interest score update process in a time zone with a low load such as a midnight time zone.
The attribute interest score database 196 is used to store X, Y, and TotalZ of attributes calculated by the attribute interest score update unit 195 described later.
The user / attribute information database 198 is used for storing user information and its attribute information.

属性興味スコア更新部１９５は、ユーザ／属性情報データベース１９８に記憶された属性情報を用いて、属性毎に当該属性に合致するユーザ群を特定し、この特定したユーザのＸ，Ｙをもとに図３２に示した式を用いて上記属性のＸ，Ｙ，TotalZを求め、この求めた結果を属性興味スコアデータベースに保存する処理機能を有する。 The attribute interest score update unit 195 uses the attribute information stored in the user / attribute information database 198 to specify a user group that matches the attribute for each attribute, and based on the X and Y of the specified user. 32 has a processing function for obtaining X, Y, and TotalZ of the attributes using the formula shown in FIG. 32 and storing the obtained results in the attribute interest score database.

また属性興味スコア更新部１９５は、ユーザ／属性情報データベース１９８に記憶された属性情報を用いて、属性の子属性群を特定し、この特定した属性のＸ，Ｙをもとに図３４に示した式を用いて上記属性のＸ，Ｙ，TotalZを求め、この求めた結果を属性興味スコアデータベース１９６に保存する処理機能も有する。なお、Ｘ，Ｙ，TotalZについては後述する。 Also, the attribute interest score update unit 195 specifies attribute child attribute groups using attribute information stored in the user / attribute information database 198, and the attribute interest score update unit 195 is shown in FIG. 34 based on the X and Y of the specified attributes. The processing function for obtaining X, Y, and TotalZ of the above-described attributes using the formulas and storing the obtained results in the attribute interest score database 196 is also provided. X, Y, and TotalZ will be described later.

初期ユーザ興味スコア作成部１９７は、クライアント端末１００からコンテンツサーバ３００を経由して転送された新規ユーザの属性に合致する属性のモデルを、属性興味スコアデータベース１９６から読み出す。そして、この読み出された属性のモデルを図２９に示す式を用いて確率結合し、その結果を初期利用ユーザの初期モデルとして概念体系／ユーザ興味スコアデータベース１４０に保存する機能を有する。 The initial user interest score creation unit 197 reads an attribute model that matches the attributes of the new user transferred from the client terminal 100 via the content server 300 from the attribute interest score database 196. Then, the read attribute model is stochastically coupled using the formula shown in FIG. 29, and the result is stored in the conceptual system / user interest score database 140 as an initial model of the initial use user.

（３）クライアント端末の構成
図２において、クライアント端末２００は、履歴収集部２１０、履歴情報送信部２２０、コンテンツ提示部２３０、コンテンツ要求送信部２４０及び属性提示部２５０を備える。
コンテンツ要求送信部２４０は、ユーザの指示（入力）によりコンテンツサーバ３００に対してコンテンツの提示要求を行う。具体的には、図２０に示すようなコンテンツ要求データをコンテンツサーバ３００に送信する。例えば、コンテンツ要求データは、クライアント端末ＩＤ（もしくはユーザＩＤ）及び要求時刻を有する。なお、要求時刻は、コンテンツサーバ３００において追加するようにしてもよい。クライアント端末ＩＤ（もしくはユーザＩＤ）は、端末（もしくはユーザ）毎に一意に付与される数字であって、後述する概念体系／ユーザ興味スコアデータベース１４０のユーザ興味スコアテーブルのユーザＩＤと一致するＩＤである。 (3) Configuration of Client Terminal In FIG. 2, the client terminal 200 includes a history collection unit 210, a history information transmission unit 220, a content presentation unit 230, a content request transmission unit 240, and an attribute presentation unit 250.
The content request transmission unit 240 makes a content presentation request to the content server 300 in accordance with a user instruction (input). Specifically, content request data as shown in FIG. 20 is transmitted to the content server 300. For example, the content request data has a client terminal ID (or user ID) and a request time. The request time may be added in the content server 300. The client terminal ID (or user ID) is a number uniquely assigned to each terminal (or user), and is an ID that matches the user ID of the user interest score table of the conceptual system / user interest score database 140 described later. is there.

図４は、クライアント端末２００上でのユーザによるコンテンツ閲覧操作の一例を示したものである。
コンテンツ提示部２３０は、コンテンツサーバ３００から受信したソート済み提示コンテンツリストをもとに、クライアント端末２００の表示画面サイズが許容する範囲でソート順の上位から一覧として表示を行う。 FIG. 4 shows an example of a content browsing operation by the user on the client terminal 200.
Based on the sorted presentation content list received from the content server 300, the content presentation unit 230 displays a list from the top of the sort order within the range allowed by the display screen size of the client terminal 200.

図４に示した例では、１０個のコンテンツ（コンテンツ１〜１０）が一覧表示されている。ユーザのフリック、スクロールバーの操作等で一覧によりソート順下位のコンテンツが表示することができる。このように実際にクライアント端末２００に表示されたコンテンツのリストを一覧閲覧コンテンツリストとする。つまり、ソート済み提示コンテンツリスト内のすべてのコンテンツがクライアント端末２００で表示されるとは限らないため、一覧閲覧コンテンツリストに含まれるとは限らない。ユーザがこの一覧から各コンテンツのタイトルをクリック操作等で選択すると、選択されたタイトルのコンテンツ（図４のコンテンツ３，５，６）の本体（詳細）を閲覧することができる。この詳細を閲覧したコンテンツを、詳細閲覧コンテンツリストに含む。 In the example shown in FIG. 4, ten contents (contents 1 to 10) are displayed in a list. The content in the lower order of the sort order can be displayed by the list by the user's flick, scroll bar operation or the like. The list of contents actually displayed on the client terminal 200 in this way is referred to as a list browsing content list. That is, not all the contents in the sorted presentation content list are displayed on the client terminal 200, and thus are not necessarily included in the list browsing content list. When the user selects a title of each content from this list by clicking or the like, the main body (details) of the content of the selected title (contents 3, 5, and 6 in FIG. 4) can be viewed. The content whose details are browsed is included in the detailed browsing content list.

履歴収集部２１０は、上述したように、ユーザの操作履歴を収集して一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを作成する。履歴情報送信部２２０は、履歴収集部２１０により作成された一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをコンテンツサーバ３００に送信する。 As described above, the history collection unit 210 collects user operation histories and creates a list browsing content list and a detailed browsing content list. The history information transmission unit 220 transmits the list browsing content list and the detailed browsing content list created by the history collection unit 210 to the content server 300.

図５に、上記図４の場合の一覧閲覧コンテンツリストのデータ構成例を示す。一覧閲覧コンテンツリストは、クラスタＩＤ、コンテンツＩＤ及び閲覧時刻を有する。クラスタとは、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストに一意に付与される識別子（図５では“１”）である。別の時刻（時間帯）に表示した一覧閲覧コンテンツをユーザが閲覧した場合は、別のクラスタＩＤが付与される。なお、時刻以外の条件でクラスタＩＤを新たに付与する条件としては、一覧閲覧コンテンツリスト表示中に一定時間操作が無かった場合や、場所を移動した場合、気温や天候が変化した場合、同行者が変化した場合、閲覧するユーザ（ユーザＩＤ）を切り替えた場合、一覧閲覧コンテンツリストに対してコンテンツジャンル等を観点に絞り込み検索を掛けた場合、その他閲覧アプリケーションにおいて閲覧モードを切り替えた場合がある。コンテンツＩＤは、一覧閲覧コンテンツの各コンテンツに一意に付与された識別子であり、後述するコンテンツデータベース１６０が保持する値と一致するものとする。 FIG. 5 shows a data configuration example of the list browsing content list in the case of FIG. The list browsing content list has a cluster ID, a content ID, and a browsing time. The cluster is an identifier ("1" in FIG. 5) uniquely assigned to the list browsing content list and the detailed browsing content list. When the user browses the list browsing content displayed at another time (time zone), another cluster ID is given. The conditions for newly assigning the cluster ID under conditions other than the time include cases where there is no operation for a certain period of time while the list browsing content list is displayed, when the location is moved, when the temperature or weather changes, and the accompanying person May change, the browsing user (user ID) may be switched, the narrowed search may be applied to the list browsing content list from the viewpoint of the content genre, or the browsing mode may be switched in other browsing applications. The content ID is an identifier uniquely assigned to each content of the list browsing content, and is assumed to match a value held in a content database 160 described later.

図６は、上記図４の場合の詳細閲覧コンテンツリストのデータ構成例を示したものである。詳細閲覧コンテンツリストは、上記一覧閲覧コンテンツリストと同様に、クラスタＩＤ、コンテンツＩＤ及び閲覧時刻を有する。クラスタＩＤは、一覧閲覧コンテンツリストと同一の値とする（図６では“１”）。コンテンツＩＤ及び閲覧時刻は、詳細閲覧コンテンツリストでは、ユーザが一覧閲覧コンテンツから選択して詳細を閲覧したコンテンツ（図６ではコンテンツ３，５，６）の識別子及び当該コンテンツを閲覧した時刻となる。 FIG. 6 shows a data configuration example of the detailed browsing content list in the case of FIG. The detailed browsing content list has a cluster ID, a content ID, and a browsing time, like the list browsing content list. The cluster ID is set to the same value as the list browsing content list (“1” in FIG. 6). In the detailed browsing content list, the content ID and browsing time are the identifier of the content (contents 3, 5, and 6 in FIG. 6) that the user has selected from the browsing content and browsed the content and the time when the content was browsed.

属性提示部２５０は、この情報推薦サービスを初めて利用するユーザ、つまり新規ユーザ（初期利用ユーザ）が自己の属性情報を入力した場合に、この属性情報の入力を受け付けてコンテンツサーバ３００へ送信する機能を有する。 The attribute presenting unit 250 has a function of accepting input of this attribute information and transmitting it to the content server 300 when a user who uses this information recommendation service for the first time, that is, a new user (initial use user) inputs his / her attribute information. Have

（４）コンテンツサーバの構成
上記図２において、コンテンツサーバ３００は、コンテンツ送信処理部３１０、ソート済み提示コンテンツリスト受信部３２０、提示コンテンツリスト送信部３３０、提示コンテンツリスト入力部３４０、履歴情報転送部３５０、コンテンツ要求転送部３６０及び属性転送部３７０を備える。 (4) Configuration of Content Server In FIG. 2, the content server 300 includes a content transmission processing unit 310, a sorted presentation content list reception unit 320, a presentation content list transmission unit 330, a presentation content list input unit 340, and a history information transfer unit. 350, a content request transfer unit 360 and an attribute transfer unit 370.

履歴情報転送部３５０は、クライアント端末２００から受信した一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを、通信ネットワークを介して興味分析装置１００に転送する。
提示コンテンツリスト入力部３４０には、サービス運用者により、ユーザの利用するクライアント端末２００に提示するコンテンツを一覧にした提示コンテンツリストが入力される。提示コンテンツリスト送信部３３０は、上記入力された提示コンテンツリストを興味分析装置１００へ通信ネットワークを介して送信する。 The history information transfer unit 350 transfers the list browsing content list and the detailed browsing content list received from the client terminal 200 to the interest analysis device 100 via the communication network.
The presentation content list input unit 340 receives a presentation content list that lists contents to be presented to the client terminal 200 used by the user by the service operator. The presented content list transmission unit 330 transmits the input presented content list to the interest analysis apparatus 100 via the communication network.

図７に、提示コンテンツリストのデータ構成例を示す。提示コンテンツリストは、コンテンツＩＤ、概念ＩＤ／関連度リスト、コンテンツ本体、及びコンテンツ登録時刻を有する。コンテンツＩＤは、各コンテンツに対してコンテンツサーバ３００にて付与される一意のＩＤである。概念ＩＤ／関連度リストは、コンテンツに出現する概念の概念ＩＤ及び当該概念とコンテンツと関連性の程度を示す値のセットが格納される。概念ＩＤ／関連度リストは、コンテンツ毎に予め設定されており、具体例としては、コンテンツ１（スポーツ記事）には、｛“野球”の概念ＩＤ=１，関連度＝０．５｝、｛“サッカー”の概念ＩＤ=２，関連度＝０．８｝、｛“ゴルフ”の概念ＩＤ=３、関連度＝０．６｝…のように、概念ＩＤと関連度のセットが格納される。 FIG. 7 shows a data configuration example of the presented content list. The presented content list has a content ID, a concept ID / relevance list, a content body, and a content registration time. The content ID is a unique ID assigned by the content server 300 to each content. The concept ID / relationship degree list stores a concept ID of a concept that appears in content and a set of values indicating the degree of relevance between the concept and the content. The concept ID / relevance degree list is set in advance for each content. As a specific example, content 1 (sports article) includes {“baseball” concept ID = 1, relevance = 0.5}, { A set of concept ID and degree of association is stored as “soccer” concept ID = 2, degree of association = 0.8}, {“golf” concept ID = 3, degree of association = 0.6}. .

なお、概念ＩＤは、概念体系／ユーザ興味スコアデータベース１４０に格納される値と一致する。関連度は、例えば、０から１までの値とし、大きいほど関連性が強いものとする。関連度は、サービス運用者がコンテンツ登録時に設定する値、若しくは別システムにより算出される値を利用する。 The concept ID matches the value stored in the concept system / user interest score database 140. For example, the relevance is a value from 0 to 1, and the larger the relevance, the stronger the relevance. As the relevance, a value set by the service operator at the time of content registration or a value calculated by another system is used.

ソート済み提示コンテンツリスト受信部３２０は、興味分析装置１００から提示コンテンツリストの一部又は全部をソートしたソート済み提示コンテンツリストとクライアント端末ＩＤ（もしくはユーザＩＤ）を受信する。コンテンツ送信処理部３１０は、ソート済み提示コンテンツリストをクライアント端末ＩＤ（もしくはユーザＩＤ）に該当するクライアント端末２００に送信する。 The sorted presentation content list receiving unit 320 receives a sorted presentation content list and a client terminal ID (or user ID) obtained by sorting a part or all of the presentation content list from the interest analysis device 100. The content transmission processing unit 310 transmits the sorted presentation content list to the client terminal 200 corresponding to the client terminal ID (or user ID).

コンテンツ要求転送部３６０は、クライアント端末２００のコンテンツ要求送信部２４０からのコンテンツ提示要求であるコンテンツ要求データ（図２０）を興味分析装置１００に転送する。
属性転送部３７０は、上記クライアント端末２００から送信された新規ユーザの属性情報を受信し、この受信した新規ユーザの属性情報を興味分析装置１００に転送する機能を有する。 The content request transfer unit 360 transfers content request data (FIG. 20), which is a content presentation request from the content request transmission unit 240 of the client terminal 200, to the interest analysis device 100.
The attribute transfer unit 370 has a function of receiving the new user attribute information transmitted from the client terminal 200 and transferring the received new user attribute information to the interest analysis apparatus 100.

（５）興味分析装置の動作
次に、興味分析装置１００の各部の動作について説明する。
（５−１）コンテンツデータベース１６０
図８にコンテンツデータベース１６０のデータ構造の一例を示す。コンテンツデータベース１６０は、コンテンツテーブルと、ユーザ履歴テーブルとを有する。
コンテンツテーブルは、コンテンツＩＤ、概念ＩＤ／関連度リスト、コンテンツ本体、及びコンテンツ登録時刻を格納する。提示コンテンツリスト受信部１５０で受信した値が格納される。 (5) Operation of Interest Analysis Device Next, the operation of each part of the interest analysis device 100 will be described.
(5-1) Content database 160
FIG. 8 shows an example of the data structure of the content database 160. The content database 160 has a content table and a user history table.
The content table stores a content ID, a concept ID / relevance list, a content body, and a content registration time. The value received by the presented content list receiving unit 150 is stored.

ユーザ履歴テーブルは、コンテンツＩＤ、ユーザＩＤ（クライアント端末ＩＤ）、詳細閲覧総数、詳細閲覧時刻、一覧閲覧総数、一覧閲覧時刻、及び一覧非表示フラグを格納する。詳細閲覧時刻は、詳細閲覧総数が０の場合はｎｕｌｌ、１以上であれば各閲覧の時系列による閲覧時刻のリストを格納する。一覧閲覧時刻は、一覧閲覧総数が０の場合はｎｕｌｌ、１以上であれば各閲覧の時系列による閲覧時刻のリストを格納する。一覧非表示フラグは、まだユーザにクライアント端末の画面上で一覧としても表示／視認していない場合はｆａｌｓｅ、一度でも閲覧した場合はｔｒｕｅを格納する。ユーザ履歴テーブルおいては、ユーザＩＤ毎に全コンテンツＩＤの値を保持する。詳細閲覧総数及び一覧閲覧総数は、上記クラスタＩＤで示される一覧閲覧コンテンツリストが多数受信された場合には過去の履歴の累計を格納する。 The user history table stores content ID, user ID (client terminal ID), detailed browsing total number, detailed browsing time, list browsing total number, list browsing time, and list non-display flag. The detailed browsing time is null when the total number of detailed browsing is 0, and if it is 1 or more, a list of browsing times in a time series of each browsing is stored. The list browsing time is null when the total number of browsing the list is 0, and if it is 1 or more, a list of browsing times according to the time series of each browsing is stored. The list non-display flag stores false when the user has not yet displayed / viewed as a list on the screen of the client terminal, and stores true when the list has been viewed even once. In the user history table, all content ID values are held for each user ID. As the detailed browsing total number and the list browsing total number, when a large number of list browsing content lists indicated by the cluster ID are received, a cumulative total of past histories is stored.

例えば、このユーザ履歴テーブルのデータを利用することで、ユーザの閲覧回数に応じて、コンテンツについて、今後の評価（コンテンツ評価処理部１７０での処理時）で評価スコアを下げるようにする。評価スコアの低減方法としては、あるコンテンツに対する閲覧回数をｋとしたとき、当該コンテンツの評価スコアをｋ＋１で割る、或いは評価スコアに重み（例えば０．９）のｋ乗を乗算するなどがある。この処理により、同じコンテンツの反復提示を興味との一致度を加味して低減することができるためユーザの推薦に対する満足度を向上することができる。 For example, by using the data of the user history table, the evaluation score of the content is lowered in the future evaluation (during processing in the content evaluation processing unit 170) according to the number of times the user browses. As a method for reducing the evaluation score, when the number of browsing for a certain content is k, the evaluation score of the content is divided by k + 1, or the evaluation score is multiplied by a weight (for example, 0.9) to the kth power. By this process, it is possible to reduce the repeated presentation of the same content in consideration of the degree of coincidence with the interest, so that the satisfaction with the user's recommendation can be improved.

（５−２）概念体系／ユーザ興味スコアデータベース１４０
図９に概念体系／ユーザ興味スコアデータベース１４０のデータ構造の一例を示す。概念体系／ユーザ興味スコア１６０は、ルート概念ノードＩＤと、概念体系テーブルと、ユーザ興味スコアテーブルとを有する。
ルート概念ノードＩＤとは、概念体系構造において最上位にある概念ノードＩＤであり、システム内に１つだけ存在する。 (5-2) Conceptual system / user interest score database 140
FIG. 9 shows an example of the data structure of the conceptual system / user interest score database 140. The concept system / user interest score 160 includes a root concept node ID, a concept system table, and a user interest score table.
The root concept node ID is the highest concept node ID in the concept system structure, and only one root node exists in the system.

概念体系テーブルは、自概念ＩＤ、親概念ＩＤリスト及び子概念ＩＤリストを格納する。概念体系内の全ての自概念ＩＤは、親概念ＩＤ及び子概念ＩＤ（ただし、自概念が最下位の場合には子概念ＩＤはなし）と紐付けて保存されており、これにより概念構造が定義される。
ユーザ興味スコアテーブルは、概念ＩＤ、ユーザＩＤ（クライアント端末ＩＤ）、TotalZ（ユーザ興味スコア）、Ｘ、及びＹの値を格納する。TotalZ、Ｘ及びＹの定義及び算出方法は後述する。 The concept system table stores a self-concept ID, a parent concept ID list, and a child concept ID list. All self-concept IDs in the concept system are stored in association with a parent concept ID and a child concept ID (however, if the self-concept is the lowest, there is no child concept ID), thereby defining the concept structure Is done.
The user interest score table stores values of concept ID, user ID (client terminal ID), TotalZ (user interest score), X, and Y. The definition and calculation method of TotalZ, X, and Y will be described later.

（５−３）提示コンテンツリスト受信部１５０
提示コンテンツリスト受信部１５０は、コンテンツサーバ３００から上記図７に示すような提示コンテンツリストを受信し、コンテンツデータベース１６０に保存する。 (5-3) Presented content list receiving unit 150
The presented content list receiving unit 150 receives the presented content list as shown in FIG. 7 from the content server 300 and stores it in the content database 160.

（５−４）履歴情報受信部１１０
図１０は、履歴情報受信部１１０の処理手順と処理内容を示すフローチャートである。
（ステップＳ１１：履歴データ受信）
履歴情報受信部１１０は、コンテンツサーバ３００の履歴情報転送部３５０から通信ネットワークを介してクライアント端末ＩＤ（もしくはユーザＩＤ）、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを受信し、特徴スコア算出部１２０へ出力する。 (5-4) History information receiving unit 110
FIG. 10 is a flowchart showing a processing procedure and processing contents of the history information receiving unit 110.
(Step S11: History data reception)
The history information receiving unit 110 receives the client terminal ID (or user ID), the list browsing content list, and the detailed browsing content list from the history information transfer unit 350 of the content server 300 via the communication network, and sends the client information to the feature score calculation unit 120. Output.

（５−５）特徴スコア算出部１２０
図１１は、特徴スコア算出部１２０の処理手順と処理内容を示すフローチャートである。特徴スコア算出部１２０には、履歴情報受信部１１０からクライアント端末ＩＤ（もしくはユーザＩＤ）、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストが入力される。 (5-5) Feature score calculation unit 120
FIG. 11 is a flowchart showing a processing procedure and processing contents of the feature score calculation unit 120. A client terminal ID (or user ID), a list browsing content list, and a detailed browsing content list are input from the history information receiving unit 110 to the feature score calculation unit 120.

（ステップＳ１２：出現概念抽出）
特徴スコア算出部１２０は、詳細閲覧コンテンツリスト内の各コンテンツに出現する概念ＩＤをコンテンツデータベース１６０から抽出する。具体的には、図６の詳細閲覧コンテンツリストにおいて、各コンテンツＩＤに紐付けされている「概念ＩＤ」を図８のコンテンツデータベース１６０のコンテンツテーブルから検索する。 (Step S12: Appearance concept extraction)
The feature score calculation unit 120 extracts a concept ID that appears in each content in the detailed browsing content list from the content database 160. Specifically, the “concept ID” associated with each content ID in the detailed browsing content list of FIG. 6 is searched from the content table of the content database 160 of FIG.

また特徴スコア算出部１２０は、クラスタデータ｛クラスタＩＤ，一覧閲覧コンテンツリスト，詳細閲覧コンテンツリスト｝と、コンテンツＩＤ／概念ＩＤ関連づけリスト｛｛コンテンツＩＤ，｛関連づいている概念ＩＤ，…｝｝，…｝と、出現概念リスト｛概念ＩＤ｝とを生成する。ここで、「コンテンツＩＤ／概念ＩＤ関連付けリスト」とは、コンテンツＩＤをもとに検索された概念ＩＤのリストである。「出現概念リスト」とは、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストに含まれる各コンテンツに出現する概念の概念ＩＤを全て列挙したものである。 The feature score calculation unit 120 also includes cluster data {cluster ID, list browsing content list, detailed browsing content list}, content ID / concept ID association list {{content ID, {related concept ID, ...}}, ...} and the appearance concept list {concept ID} are generated. Here, the “content ID / concept ID association list” is a list of concept IDs searched based on the content ID. The “appearance concept list” is a list of all concept IDs of concepts that appear in each content included in the list browsing content list and the detailed browsing content list.

（ステップＳ１３：出現概念抽出）
特徴スコア算出部１２０は、「出現概念リスト」の各概念ＩＤについて、図９の概念体系／ユーザ興味スコアデータベース１４０から上位概念を抽出し、上位概念の概念ＩＤを「出現概念リスト」及び「コンテンツＩＤ／概念ＩＤ関連づけリスト」に追加する。 (Step S13: Appearance concept extraction)
For each concept ID in the “appearance concept list”, the feature score calculation unit 120 extracts a superordinate concept from the concept system / user interest score database 140 of FIG. 9, and sets the superordinate concept IDs as “appearance concept list” and “content”. To the “ID / concept ID association list”.

具体的には、特徴スコア算出部１２０は、図９の概念体系テーブルを検索することにより、「出現概念リスト」の概念ＩＤが「自概念ＩＤ」と一致するものを読み出し、その「親概念ＩＤ」を抽出する。例えば、上記抽出された「親概念ＩＤ」が図９の概念体系テーブルの「自概念ＩＤ」と一致するものを探し、その「親概念ＩＤ」も上位概念として抽出する。そして、特徴スコア算出部１２０は、上位概念の概念ＩＤを抽出の元になった出現概念の概念ＩＤを有するコンテンツＩＤに関連づける。 Specifically, the feature score calculation unit 120 searches the concept system table of FIG. 9 to read out the one whose concept ID in the “appearance concept list” matches the “own concept ID”, and the “parent concept ID”. Is extracted. For example, a search is made for a case where the extracted “parent concept ID” matches the “own concept ID” in the concept system table of FIG. 9, and the “parent concept ID” is also extracted as a superordinate concept. Then, the feature score calculation unit 120 associates the concept ID of the superordinate concept with the content ID having the concept ID of the appearance concept that is the source of extraction.

すなわち、上記抽出された「上位概念」を「元になった概念ＩＤを持っていたコンテンツＩＤ」に対して上位概念が付与されていたと見なして、「出現概念リスト」「コンテンツＩＤ／概念ＩＤ関連づけリスト」に追加する。なお、概念体系階層におけるルート概念の抽出は除外する。 That is, the extracted “superior concept” is regarded as having been assigned a superordinate concept with respect to “content ID having the original concept ID”, and “appearance concept list” “content ID / concept ID association” Add to list. The extraction of the root concept in the concept system hierarchy is excluded.

（ステップＳ１４：分析パラメータ抽出）
特徴スコア算出部１２０は、「出現概念リスト」の各概念について出現数を算出し、特徴スコアの算出に必要な分析パラメータを抽出し、分析パラメータリストを生成する。
図１２に、分析パラメータリストのデータ構成例を示す。分析パラメータリストは、クラスタＩＤ毎に、一覧閲覧コンテンツリストのコンテンツ総数Ｓ（第１の総数）、詳細閲覧コンテンツリストのコンテンツ総数ａ（第２の総数）、クラスタＩＤに紐づいた出現概念リスト内の概念ＩＤ毎に算出するＮとｎがある。Ｎ（第１の出現数）は、一覧閲覧コンテンツリストにおいて当該概念ＩＤが付与されているコンテンツ数とする。ｎ（第２の出現数）は詳細閲覧コンテンツリストにおける当該概念ＩＤが付与されているコンテンツ数とする。なお、ステップＳ１３にて追加した上位概念も含めて出現概念リスト内の概念ＩＤすべてについて、Ｎとｎを算出する。 (Step S14: Analysis parameter extraction)
The feature score calculation unit 120 calculates the number of appearances for each concept in the “appearance concept list”, extracts analysis parameters necessary for calculating the feature score, and generates an analysis parameter list.
FIG. 12 shows a data configuration example of the analysis parameter list. The analysis parameter list includes, for each cluster ID, the total content S (first total) of the list browsing content list, the total content a (second total) of the detailed browsing content list, and the appearance concept list associated with the cluster ID. N and n are calculated for each concept ID. N (first appearance number) is the number of contents to which the concept ID is assigned in the list browsing content list. n (second appearance number) is the number of contents to which the concept ID is assigned in the detailed browsing content list. Note that N and n are calculated for all concept IDs in the appearance concept list including the superordinate concept added in step S13.

図１３（ａ）に分析パラメータ抽出処理の模式図を示す。例えば、５０個（＝Ｓ）のコンテンツが一覧表示されている中から、ユーザが１０個（＝ａ）のコンテンツの詳細を閲覧した場合を示す。ここで、一覧表示されている５０個のコンテンツのうち「野球」という概念が含まれている記事が１５個（＝Ｎ）あり、ユーザが閲覧した１０個のコンテンツのうち、「野球」という概念が含まれているコンテンツが５個（＝ｎ）あったことを示す。 FIG. 13A shows a schematic diagram of the analysis parameter extraction process. For example, the case where the user browses the details of 10 (= a) contents from a list of 50 (= S) contents is shown. Here, 15 articles (= N) containing the concept of “baseball” among the 50 contents displayed in a list are displayed, and the concept of “baseball” is included in 10 contents viewed by the user. This indicates that there are five (= n) contents including “”.

（ステップＳ１５：特徴スコア算出）
特徴スコア算出部１２０は、上記分析パラメータＳ，ａ，Ｎ，ｎを利用して概念ＩＤ毎に特徴スコアＺを算出する。図１４に特徴スコア算出処理の詳細を示す。図１４において、ｉは概念の識別子、ｊは、クラスタＩＤを示す。Ｈ１（第１の確率）は、一覧閲覧コンテンツリストに含まれる一覧閲覧コンテンツの総数Ｓ、一覧閲覧コンテンツのうち概念ｉが出現するコンテンツ数Ｎのとき、詳細閲覧コンテンツをａ個ランダム選択して閲覧した場合に、概念ｉが出現する詳細閲覧コンテンツの数がｎ以上となる累積確率である。Ｈ２（第２の確率）は、一覧閲覧コンテンツリストに含まれる一覧閲覧コンテンツの総数Ｓ、一覧閲覧コンテンツのうち概念ｉが出現するコンテンツ数Ｎのとき、詳細閲覧コンテンツをａ個ランダム選択して閲覧した場合に、概念ｉが出現する詳細閲覧コンテンツの数がｎ以下となる累積確率である。なお、本実施形態では、累積確率Ｈ１及びＨ２は、超幾何分布により求めるが、この手法に限定するものではない。他の分布の例としては、二項分布、正規分布が存在する。 (Step S15: feature score calculation)
The feature score calculation unit 120 calculates a feature score Z for each concept ID using the analysis parameters S, a, N, and n. FIG. 14 shows details of the feature score calculation process. In FIG. 14, i is a concept identifier, and j is a cluster ID. When H1 (first probability) is the total number S of the list browsing contents included in the list browsing content list and the number N of the contents of the list browsing content where the concept i appears, a detailed browsing content is randomly selected and viewed. In this case, the cumulative probability that the number of detailed browsing contents in which the concept i appears is n or more. When H2 (second probability) is the total number S of the list browsing contents included in the list browsing content list and the number of contents N in which the concept i appears in the list browsing contents, a detailed browsing content is randomly selected and viewed. In this case, the cumulative probability that the number of detailed browsing contents in which the concept i appears is n or less. In the present embodiment, the cumulative probabilities H1 and H2 are obtained by the hypergeometric distribution, but are not limited to this method. Examples of other distributions include a binomial distribution and a normal distribution.

図１３（ｂ）に示すように、例えば、上記の分析パラメータＳ、Ｎ、ａ、ｎを用いて、ユーザが閲覧した１０個のコンテンツのうち、「野球」という概念が含まれるコンテンツが５以上である確率が「０．１２」であることを示す。ここで、「０．１２」は、累積確率Ｈ１の値に相当する。
なお、Ｈ２の値を使う例として、上記の分析パラメータでｎが０である場合を考える。この場合は、出現数が０以下の場合の確率を算出する。具体的には、図１３（ｂ）において横軸が０の項目の値となるため「０．０２」となる。 As shown in FIG. 13B, for example, there are five or more contents including the concept of “baseball” among the ten contents viewed by the user using the analysis parameters S, N, a, and n described above. It is shown that the probability of being “0.12”. Here, “0.12” corresponds to the value of the cumulative probability H1.
As an example of using the value of H2, consider the case where n is 0 in the above analysis parameters. In this case, the probability when the number of appearances is 0 or less is calculated. Specifically, in FIG. 13B, the horizontal axis is the value of the item of 0, so “0.02”.

そして、特徴スコア算出部１２０は、図１４に示すように、上記算出した累積確率Ｈ１及びＨ２を用いて、標準正規分布の累積分布関数の逆関数により特徴スコアＺを算出する。図１３（ｃ）に示すように、上記Ｈ１を累積確率とする標準正規分布の累積分布関数の逆関数により特徴スコアＺを求める。なお、累積確率としてＨ２を利用する場合には、標準正規分布の累積分布関数の逆関数の返値の符号を負にして特徴スコアＺを求める。この特徴スコアＺを用いて、後述する概念体系更新処理部１３０は、「野球」という概念に対するユーザ興味スコアTotalZを求める。 Then, as shown in FIG. 14, the feature score calculation unit 120 calculates the feature score Z by the inverse function of the standard normal distribution cumulative distribution function using the calculated cumulative probabilities H1 and H2. As shown in FIG. 13C, the feature score Z is obtained by the inverse function of the cumulative distribution function of the standard normal distribution with H1 as the cumulative probability. When H2 is used as the cumulative probability, the feature score Z is obtained with the sign of the return value of the inverse function of the standard normal distribution cumulative distribution function being negative. Using this feature score Z, a concept system update processing unit 130 described later calculates a user interest score TotalZ for the concept of “baseball”.

特徴スコア算出部１２０は、更新対象概念リストを生成し、概念体系更新処理部１３０に出力する。「更新対象概念リスト」とは、概念ＩＤ、前記で算出した特徴スコアＺ、及び重みｗのセットである。なお、この更新対象概念リストに出現する概念ＩＤが、次の概念体系更新処理で更新対象のノード（概念）となる。上位概念を追加した出現概念リスト内の概念ＩＤすべてについて、特徴スコアＺと重みｗを算出する。重みｗは、各クラスタＩＤにおいて概念毎に設定される値である。
なお、重みｗは、初期値ｗ＝１とし、ユーザの特徴的な操作等が有った場合に、以下のように値を変化させることができる。例えば、クライアント端末２００において、ユーザに提示されたコンテンツについて、ユーザは、お気に入りコンテンツとして登録や、他ユーザへのお勧め、又はコンテンツへの評価入力ができる。 The feature score calculation unit 120 generates an update target concept list and outputs it to the concept system update processing unit 130. The “update target concept list” is a set of a concept ID, the characteristic score Z calculated above, and a weight w. The concept ID appearing in the update target concept list becomes a node (concept) to be updated in the next concept system update process. The feature score Z and the weight w are calculated for all the concept IDs in the appearance concept list to which the superordinate concept is added. The weight w is a value set for each concept in each cluster ID.
Note that the weight w can be changed as follows when the initial value w = 1 and a user's characteristic operation is performed. For example, with respect to the content presented to the user at the client terminal 200, the user can register as a favorite content, recommend to other users, or input an evaluation for the content.

クライアント端末２００が、このような閲覧操作以外の操作履歴を興味分析装置１００に送信できる場合には、以下の処理を行う。
すなわち、特徴スコア算出部１２０は、例えばコンテンツがお気に入りに登録されたとき、そのコンテンツが含む全ての概念ＩＤについて重みｗをｗ＝１．５のように増加させる。その他にも、コンテンツ閲覧時刻、閲覧時の天気、気温、湿度、季節、曜日、休日、余暇かどうか、閲覧時のユーザ位置情報、スケジューラ、日記等から収集したイベント情報に応じて重みｗの値を変えることもできる。 When the client terminal 200 can transmit an operation history other than the browsing operation to the interest analysis apparatus 100, the following processing is performed.
That is, for example, when a content is registered as a favorite, the feature score calculation unit 120 increases the weight w such that w = 1.5 for all concept IDs included in the content. In addition to the content browsing time, browsing weather, temperature, humidity, season, day of the week, holiday, leisure time, user location information at browsing, scheduler, diary, etc., the value of weight w Can also be changed.

（５−６）概念体系更新処理部１３０
図１５に、概念体系更新処理部１３０の処理フローを示す。概念体系更新処理部１３０には、特徴スコア算出部１２０から、クライアント端末ＩＤ（もしくはユーザＩＤ）及び更新対象概念リスト｛クラスタＩＤ，｛概念ＩＤ，特徴スコア＝Ｚ，重み＝ｗ｝，…｝が入力される。 (5-6) Conceptual system update processing unit 130
FIG. 15 shows a processing flow of the concept system update processing unit 130. The concept system update processing unit 130 receives from the feature score calculation unit 120 the client terminal ID (or user ID) and the update target concept list {cluster ID, {concept ID, feature score = Z, weight = w},. Entered.

（ステップＳ１６：概念ノード値更新）
概念体系更新処理部１３０は、「更新対象概念リスト」の各概念ＩＤのノード値を更新する。図１６に概念体系更新処理部１３０の処理の詳細を示す。概念体系更新処理部１３０は、コンテンツに出現した概念（出現概念）、及びこの出現概念の上位概念の概念ＩＤについて、図１６に示す各概念ｉに対するユーザ興味スコア更新式を用いて、ユーザ興味スコアTotalZ_ｉｎ，及びＸ_{ｉ（ｎ−１）}，Ｙ_{ｉ（ｎ−１）}の値を求め、図９の概念体系／ユーザ興味スコアデータベース１４０の、クラスタＩＤに対応する図１１のステップＳ１２に入力されたユーザＩＤ（クライアント端末ＩＤ）のカラムに対応するユーザ興味スコアテーブルに格納されている各値を更新する。 (Step S16: Concept node value update)
The concept system update processing unit 130 updates the node value of each concept ID in the “update target concept list”. FIG. 16 shows details of the processing of the conceptual system update processing unit 130. The concept system update processing unit 130 uses the user interest score update formula for each concept i shown in FIG. 16 for the concept that appeared in the content (appearance concept) and the concept ID of the superordinate concept of this appearance concept, and the user interest score. TotalZ _in , X _{i (n−1)} , Y _{i (n−1)} values are obtained and input to step S 12 of FIG. 11 corresponding to the cluster ID in the conceptual system / user interest score database 140 of FIG. Each value stored in the user interest score table corresponding to the user ID (client terminal ID) column is updated.

ここで、Ｘ_{ｉ（ｎ−１）}は、各概念ＩＤ（ここでは識別子ｉで表現）に対する、過去の（前回までの）前記更新対象概念リストの重みｗの二乗の合計である。Ｙ_{ｉ（ｎ−１）}は、同様に各概念ＩＤ（ここでは識別子ｉで表現）に対する、過去の前記更新対象概念リストの重みｗと特徴スコアＺの乗算の合計である。 Here, X _{i (n−1)} is the sum of the squares of the weights w of the update target concept list in the past (up to the previous time) for each concept ID (represented by the identifier i here). Similarly, Y _{i (n−1)} is the sum of multiplication of the weight w of the past update target concept list and the feature score Z for each concept ID (represented by identifier i here).

この、Ｘ，Ｙはユーザ興味スコア（TotalZ）計算過程における中間結果を保持することとなり、省メモリ／ストレージを優先させる場合、最低限では各ノードの変数としてTotalZ，Ｘ，Ｙの３つの実数値を保持することで実現可能である。省メモリ／ストレージを優先させない場合は、算出した各概念、各クラスタの特徴スコアＺをすべて保存することとなる。この場合は、Ｘ，Ｙの保存は不要となる。 These X and Y hold intermediate results in the user interest score (TotalZ) calculation process. When giving priority to memory saving / storage, at least three real values of TotalZ, X, and Y are used as variables for each node. It can be realized by holding. When priority is not given to memory saving / storage, all the calculated concept scores and feature scores Z of the respective clusters are stored. In this case, storage of X and Y is not necessary.

図１６において、ｎは、概念体系更新処理が何度目かを示す識別子である。ユーザ興味スコアTotalZを求める一連の処理は、クラスタＩＤ単位で行なわれ、この一連の処理が行なわれる単位を１度と数えるとき、ｎはこの一連の処理が何度目に行なわれたものであるかを示す識別子である。ｉは、概念ＩＤの識別子である。Ｚ_inは、概念ｉの各更新処理に利用するＺ値である。なお、上記Ｚ_ijは一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリスト毎のＺ値であり、Ｚ_ij∈Ｚ_inの関係である。重みｗ_inは、概念ｉの各更新処理に利用する重みである。上記重みｗと同じであり、上記特徴スコア算出部１２０で設定したものと同様である。 In FIG. 16, n is an identifier indicating how many times the concept system update process is performed. A series of processes for obtaining the user interest score TotalZ is performed in units of cluster IDs. When the unit in which this series of processes is performed is counted once, n is the number of times this series of processes has been performed. Is an identifier. i is an identifier of a concept ID. Z _in is a Z value used for each update process of concept i. The Z _ij is a Z value for each of the list browsing content list and the detailed browsing content list, and has a relationship of Z _ij εZ _in . Weight w _in is the weight to be used in each process of updating the concept i. The weight w is the same as that set by the feature score calculation unit 120.

例えば、ｗ_inは、お気に入り登録、他ユーザとの共有等の閲覧以外の特殊な操作をユーザが行った場合、及びコンテンツ閲覧時間（閲覧開始から終了までの間隔）、コンテンツと概念の関連度合い、コンテンツ閲覧時刻、閲覧時の天気・気温・湿度・季節・曜日・休日・余暇かどうか、閲覧時のユーザ位置情報、スケジューラ・日記等から収集したイベント情報に応じて値を変化させる。その他サービス利用者、サービス運用者が特に指定した場合にも変化させる。 For example, w _in the favorite registration, if the user a special operation other than the inspection of the public, such as with the other user has performed, and (interval to the end from the viewing start) content viewing time, related the degree of content and concepts, The value is changed in accordance with the content browsing time, weather / temperature / humidity / season / day of the week / holiday / leisure at the time of browsing, user location information at the time of browsing, event information collected from a scheduler / diary and the like. It is also changed when specified by other service users and service operators.

なお、一定期間過ぎた履歴の影響を低減させるため等のユーザ興味スコアの忘却は、最終更新時から現在の時刻までの時間間隔の閾値を超えた場合に、TotalZ，Ｘ，Ｙをそれぞれ減衰させることで実現する。
減衰の計算式の例を示す。例えば、ｋを減衰率（例えばｋ＝０．８）と設定し、以下のように算出することができる。
TotalZ（減衰後）＝ｋ×TotalZ（現在）
Ｘ（減衰後）＝ｋ²×Ｘ（現在）
Ｙ（減衰後）＝ｋ²×Ｙ（現在）
（ステップＳ１７：下位概念ノード値更新） Note that forgetting the user interest score, such as to reduce the influence of the history after a certain period of time, attenuates TotalZ, X, and Y when the threshold value of the time interval from the last update to the current time is exceeded. It will be realized.
An example of an attenuation calculation formula is shown. For example, k can be set as an attenuation rate (for example, k = 0.8), and can be calculated as follows.
TotalZ (after attenuation) = k x TotalZ (current)
X (after attenuation) = k ² × X (current)
Y (after attenuation) = k ² × Y (current)
(Step S17: Lower concept node value update)

さらに、概念体系更新処理部１３０は、「更新対象概念リスト」の各概念ＩＤ（出現概念及び上位概念）の下位概念を抽出し、下位概念のノード値を更新する。下位概念の抽出では、「更新対象概念リスト」の各概念ＩＤについて、図９に示す概念体系／興味度データベース１４０の概念体系テーブルを参照し、子概念ＩＤリストから概念ＩＤのリストを抽出し、さらに各子概念ＩＤリストの概念ＩＤについて概念体系テーブルを参照して子概念リストを抽出する処理を繰り返す。 Further, the concept system update processing unit 130 extracts the subordinate concept of each concept ID (appearance concept and superordinate concept) in the “update target concept list”, and updates the node value of the subordinate concept. In the extraction of the subordinate concepts, for each concept ID of the “update target concept list”, the concept system table of the concept system / interest degree database 140 shown in FIG. Further, the process of extracting the child concept list with reference to the concept system table for the concept ID of each child concept ID list is repeated.

下位概念の興味度の更新に利用する特徴スコアＺは、例えば、隣接した親ノードのうち特徴スコアの絶対値が最も大きい値を利用、最も近い上位ノードの値を利用、親ノードの値を平均、または確率結合した値とする。なお、「更新対象概念リスト」のうち、上記ステップＳ１６で更新済みの概念（コンテンツに出現した概念、及び上位概念）のユーザ興味スコアは更新しない。 The feature score Z used to update the interest level of the lower concept is, for example, the value having the largest absolute value of the feature score among the adjacent parent nodes, the value of the closest higher node, the average of the values of the parent nodes Or a probability-coupled value. In the “update target concept list”, the user interest scores of the concepts updated in step S16 (concepts that appear in the content and higher-level concepts) are not updated.

（５−７）コンテンツ評価処理部１７０
図１７はコンテンツ評価処理部１７０の処理手順と処理内容を示すフローチャートである。
コンテンツ評価処理部１７０には、コンテンツサーバ３００のコンテンツ要求転送部３６０からの通知を入力として、コンテンツデータベース１６０のコンテンツテーブルから図７に示すような形式の提示コンテンツリストを読み出して以下のコンテンツ評価処理を行う。 (5-7) Content evaluation processing unit 170
FIG. 17 is a flowchart showing the processing procedure and processing contents of the content evaluation processing unit 170.
The content evaluation processing unit 170 receives the notification from the content request transfer unit 360 of the content server 300, reads a presentation content list in the format shown in FIG. 7 from the content table of the content database 160, and performs the following content evaluation processing I do.

すなわち、コンテンツ要求転送部３６０からはクライアント端末ＩＤ（もしくはユーザＩＤ）を含む、図２０に示すようなコンテンツ要求データを受信する。また、上記提示コンテンツリストについては、サービス運用者もしくはサービス利用者（クライアント端末利用者）の事前設定により、過去何日以内に登録されたコンテンツのみを評価対象とするか（提示コンテンツリストに含めるか）を設定することができる。 That is, content request data as shown in FIG. 20 including the client terminal ID (or user ID) is received from the content request transfer unit 360. In addition, with regard to the above-mentioned presented content list, whether the content registered within the past number of days is to be evaluated based on the prior setting of the service operator or service user (client terminal user) (whether it is included in the presented content list) ) Can be set.

（ステップＳ２１：分析対象概念フィルタリング）
分析対象概念フィルタリングでは、サービス運用者又はサービス利用者が、事前設定や手動等で特に分析対象の概念ＩＤを指定した場合は、概念体系／興味度データベース１４０を参照し、指定された分析対象の概念ＩＤおよび下位の概念ＩＤのみを評価対象とする。コンテンツ評価処理部１７０は、入力された提示コンテンツリストが保持する概念ＩＤについて、事前にサービス運用者又はサービス利用者が設定した条件にしたがって分析対象外とし、「フィルタリング済みコンテンツリスト」を生成する。 (Step S21: Analysis object concept filtering)
In analysis target concept filtering, when a service operator or a service user designates a concept ID of an analysis target particularly in advance or manually, the concept system / interest degree database 140 is referred to, and the specified analysis target Only concept IDs and subordinate concept IDs are evaluated. The content evaluation processing unit 170 excludes the concept ID held in the input presentation content list from the analysis target according to the conditions set in advance by the service operator or the service user, and generates a “filtered content list”.

例えば、ユーザが、野球に関するコンテンツのレコメンドを求めた場合には、図９に示す概念体系テーブルを参照し、野球に対応する概念ＩＤの下位概念のみを分析対象とする。「フィルタリング済みコンテンツリスト」とは上記処理によって、各コンテンツＩＤに紐付けされている概念ＩＤを事前にサービス運用者又はユーザが設定した条件にしたがって削除したコンテンツリストである。「フィルタリング済みコンテンツリスト」は、上記図７の提示コンテンツリストと同じデータ構成である。 For example, when the user requests a recommendation for content related to baseball, the concept system table shown in FIG. 9 is referred to, and only the subordinate concepts of the concept ID corresponding to baseball are analyzed. The “filtered content list” is a content list obtained by deleting the concept ID associated with each content ID according to the conditions set in advance by the service operator or the user by the above processing. The “filtered content list” has the same data configuration as the presented content list of FIG.

（ステップＳ２２：スコア評価）
コンテンツ評価処理部１７０は、「フィルタリング済みコンテンツリスト」に含まれるコンテンツの評価スコアを算出し、図１８に示すようなコンテンツスコアリストを生成する。コンテンツスコアリストは、コンテンツＩＤ、評価スコア、コンテンツ本体及びコンテンツ登録時刻を有する。 (Step S22: Score evaluation)
The content evaluation processing unit 170 calculates the evaluation score of the content included in the “filtered content list” and generates a content score list as shown in FIG. The content score list has a content ID, an evaluation score, a content body, and a content registration time.

図１９に評価スコアの算出方法の一例を示す。例えば、図１９に示すコンテンツ評価式により、コンテンツｘに対する評価スコアＥｎｔｉｔｙＺ_xを概念ｉのユーザ興味スコアTotalZ_i、コンテンツｘと概念ｉとの関連度ｗ_i（もしくは、概念ｉの重要度）、及びコンテンツｘに出現する概念ＩＤの集合ｐを用いて算出することができる。なお、概念の識別子ｉは集合ｐ内の概念ＩＤに対応する。 FIG. 19 shows an example of a method for calculating the evaluation score. For example, according to the content evaluation formula shown in FIG. 19, the evaluation score EntityZ _x for the content x is the user interest score TotalZ _{i of} the concept i, the relevance w _i between the content x and the concept _i (or the importance of the concept i), and It can be calculated using a set p of concept IDs appearing in the content x. The concept identifier i corresponds to the concept ID in the set p.

図１９の算出で利用するユーザ興味スコア（TotalZ）は、各コンテンツに関連した概念ＩＤについて、概念体系／ユーザ興味スコアデータベース１４０のユーザ興味スコアテーブル（図９）から、クライアント端末ＩＤ（もしくは、ユーザＩＤ）をもとに読み出し利用する。図１９において、概念Ｋ、概念Ｂ及び概念Ｄが出現するコンテンツ１を評価コンテンツとした場合、概念Ｋ、概念Ｂ及び概念ＤのTotalZ，ｗを利用して評価スコアＥｎｔｉｔｙＺ_x ＝０．１８と算出できる。一方、概念Ｂのみが出現するコンテンツ２を評価コンテンツとした場合、概念ＢのTotalZ，ｗを利用して評価スコアＥｎｔｉｔｙＺ_x ＝−０．３と算出できる。評価スコアＥｎｔｉｔｙＺ_xの値が大きいコンテンツ１が優先して表示される。 The user interest score (TotalZ) used in the calculation of FIG. 19 is the client terminal ID (or user) from the user interest score table (FIG. 9) of the concept system / user interest score database 140 for the concept ID related to each content. Read out based on (ID). In FIG. 19, when the content 1 in which the concept K, the concept B, and the concept D appear is the evaluation content, the evaluation score EntityZ _x = 0.18 is calculated using the TotalZ, w of the concept K, the concept B, and the concept D. it can. On the other hand, when the content 2 in which only the concept B appears is the evaluation content, the evaluation score EntityZ _x = −0.3 can be calculated using the TotalZ, w of the concept B. The content 1 having a large evaluation score EntityZ _x is displayed with priority.

その他にも、評価スコアＥｎｔｉｔｙＺ_xは、以下の変形例１〜３の方法により求めることができる。
変形例１としては、ＥｎｔｉｔｙＺ_x=ＭＡＸ（TotalZ_i＊ｗ_i）により求める。ＭＡＸ（TotalZ_i＊ｗ_i ）は、ｉ∈ｐのTotalZ_i＊ｗ_iの最大値を返す関数とする。 In addition, the evaluation score EntityZ _x can be obtained by the method of Modifications 1 to 3 below.
As a first modification, it is obtained by EntityZ _x = MAX (TotalZ _i * w _i ). MAX (TotalZ _i * w _i ) is a function that returns the maximum value of TotalZ _i * w _i for i∈p.

変形例２としては、ＥｎｔｉｔｙＺ_xの値は、ＭＡＸ（TotalZ_i＊ｗ_i）の値が閾値を超えた場合には、ＭＡＸ（TotalZ_i＊ｗ_i ）の返り値とする。ＭＡＸ（TotalZ_i＊ｗ_i）はｉ∈ｐのTotalZ_i＊ｗ_iの最大値を返す関数とする。閾値を超えない場合は、図１９のコンテンツ評価式の結果をＥｎｔｉｔｙＺ_x とする。ＭＡＸ（）は、はｉ∈ｐのTotalZ_i＊ｗ_iで最大値を返す関数とする。閾値はサービス運用者が設定する値とする。 The second modification, the value of EntityZ _x When the value of the MAX (TotalZ _{_i} * w _i) exceeds the threshold value, the return value of _{_{MAX (TotalZ i * w i)}} . MAX (TotalZ _i * w _i ) is a function that returns the maximum value of TotalZ _i * w _i with i∈p. It does not exceed the threshold, and EntityZ _x the resulting content evaluation formula of Figure 19. MAX () is a function that returns the maximum value with TotalZ _i * w _i of i∈p. The threshold is a value set by the service operator.

変形例３としては、TotalZ_iが正の値のｉ∈ｐについてのみ取り出し、図１９のコンテンツ評価式で統合した値をＥｎｔｉｔｙＺ_xとする。
（ステップＳ２３：コンテンツをスコア順にソート）
コンテンツ評価処理部１７０は、コンテンツスコアリストに含まれるコンテンツを評価スコアＥｎｔｉｔｙＺ_xの降順にソートし、ソート済みコンテンツスコアリストをソート済コンテンツスコアリスト送信部１８０に出力する。 As a third modification example, only a value of iεp having a positive TotalZ _i is extracted, and a value integrated by the content evaluation formula of FIG. 19 is defined as EntityZ _x .
(Step S23: Sort contents in score order)
The content evaluation processing unit 170 sorts the content included in the content score list in descending order of the evaluation score EntityZ _x and outputs the sorted content score list to the sorted content score list transmission unit 180.

（５−８）ソート済みコンテンツスコアリスト送信部１８０
ソート済みコンテンツスコアリスト送信部１８０は、コンテンツ評価処理部１７０から入力されるソート済みコンテンツスコアリストとクライアント端末ＩＤ（もしくはユーザＩＤ）を、通信ネットワークを介してコンテンツサーバ３００に送信する。 (5-8) Sorted content score list transmission unit 180
The sorted content score list transmission unit 180 transmits the sorted content score list and client terminal ID (or user ID) input from the content evaluation processing unit 170 to the content server 300 via the communication network.

（５−９）属性興味スコア更新部１９５
属性興味スコア更新部１９５は、ユーザの属性毎に当該属性のモデルを、当該属性に合致する他のユーザのモデルを用いて作成する。
図２１は、興味分析装置１００による、属性に関するモデルを作成する処理の概要を示すもので、属性Ａのモデルを、当該属性Ａに合致するユーザＢ、ユーザＣのモデルを用いて作成する場合について例示したものである。属性興味スコア更新起動部１９０は、深夜時間帯などの興味分析装置１００の処理負荷が少ない時間帯に定期的に処理を起動する。処理が起動されると属性興味スコア更新部１９５は、先ずユーザ／属性情報データベース１９８に保存されている属性情報を用いて、属性Ａに合致するユーザ群、例えばユーザＢ及びユーザＣを特定する。次に、この特定したユーザのＸ，Ｙをもとに、図３２に示した式を用いて属性ＡのＸ，Ｙ，TotalZを算出する。そして、この属性ＡのＸ，Ｙ，TotalZの算出結果を属性興味スコアデータベース１９６に保存する。 (5-9) Attribute Interest Score Update Unit 195
The attribute interest score update unit 195 creates a model of the attribute for each user attribute using a model of another user that matches the attribute.
FIG. 21 shows an outline of processing for creating a model related to an attribute by the interest analysis apparatus 100, and a case where a model of attribute A is created using models of user B and user C that match the attribute A is shown. This is just an example. The attribute interest score update activation unit 190 periodically activates processing in a time zone where the processing load of the interest analysis device 100 is low, such as a midnight time zone. When the process is started, the attribute interest score update unit 195 first identifies a user group that matches the attribute A, for example, the user B and the user C, using the attribute information stored in the user / attribute information database 198. Next, X, Y, and TotalZ of the attribute A are calculated using the formula shown in FIG. 32 based on the identified X and Y of the user. Then, the calculation result of X, Y, and TotalZ of the attribute A is stored in the attribute interest score database 196.

図２２は、興味分析装置１００による、下位の属性のモデルを用いて上位の属性のモデルを作成する場合の処理の概要を示したもので、属性Ａのモデルを、当該属性Ａの子属性である属性Ｂ、属性Ｃのモデルを用いて作成する場合について例示している。属性興味スコア更新起動部１９０は、先に述べたように深夜時間帯などの装置の処理負荷が少ない時間帯に定期的に処理を起動する。処理が起動すると属性興味スコア更新部１９５は、ユーザ／属性情報データベース１９８に保存されている属性情報を用いて、属性Ａの子属性群、例えば属性Ｂ及び属性Ｃを特定する。次に、この特定した属性のＸ，Ｙをもとに図３４に示す式を用い、属性ＡのＸ，Ｙ，TotalZを算出する。そして、この属性ＡのＸ，Ｙ，TotalZの算出結果を属性興味スコアデータベース１９６に保存する。 FIG. 22 shows an overview of processing when the interest analysis apparatus 100 creates a higher-level attribute model using a lower-level attribute model. The attribute A model is represented by a child attribute of the attribute A. The case where it creates using the model of a certain attribute B and attribute C is illustrated. As described above, the attribute interest score update activation unit 190 periodically activates processing in a time zone where the processing load of the apparatus is low, such as a midnight time zone. When the process is started, the attribute interest score update unit 195 specifies a child attribute group of the attribute A, for example, the attribute B and the attribute C, using the attribute information stored in the user / attribute information database 198. Next, X, Y, and TotalZ of attribute A are calculated using the formula shown in FIG. 34 based on the specified attributes X and Y. Then, the calculation result of X, Y, and TotalZ of the attribute A is stored in the attribute interest score database 196.

図２３は、興味分析装置１００による、属性のモデルを用いた新規ユーザ（初期利用ユーザ）の初期モデルの作成に関する処理の概要を示したものである。
同図において、新規ユーザは自身のクライアント端末２００において、自らの属性と各属性を確率結合する比率Ｗを入力する。そうすると、クライアント端末２００の属性提示部２５０により、上記入力されたユーザ自身の属性と確率結合比率Ｗを表す情報が、コンテンツサーバ３００の属性転送部３７０を経由して興味分析装置１００に転送される。 FIG. 23 shows an outline of processing related to creation of an initial model of a new user (initial use user) using an attribute model by the interest analysis apparatus 100.
In the figure, a new user inputs his / her own attribute and a ratio W for probabilistically connecting each attribute in his / her client terminal 200. Then, the attribute presentation unit 250 of the client terminal 200 transfers the input information representing the user's own attribute and the probability coupling ratio W to the interest analysis device 100 via the attribute transfer unit 370 of the content server 300. .

興味分析装置１００では、初期ユーザ興味スコア作成部１９７が上記転送された属性に合致する属性のモデルを属性興味スコアデータベース１９６から読み出す。そして、図２９に示す式を用いて確率結合して初期利用ユーザに関する初期の興味スコアTotalZを計算し、この計算された初期の興味スコアTotalZを上記初期利用ユーザの初期モデルとして概念体系／ユーザ興味スコアデータベース１４０に記憶させる。 In the interest analysis apparatus 100, the initial user interest score creation unit 197 reads an attribute model that matches the transferred attribute from the attribute interest score database 196. Then, the initial interest score TotalZ for the initial use user is calculated by stochastic combination using the equation shown in FIG. 29, and the calculated initial interest score TotalZ is used as the initial model of the initial use user as a conceptual system / user interest. Store in the score database 140.

（６）実施形態の作用効果
以上詳述したように一実施形態によれば、以下のような作用効果が奏せられる。
(1) ユーザの選択候補となる一覧リストを定義し、そこからのコンテンツ選択における概念の出現数を分析することで、各概念の出現の希少性を考慮し、且つ一覧から選ばれない概念の履歴特徴を利用することができるため、ユーザの興味を高精度に推定することが可能となる。 (6) Effects of Embodiments As described above in detail, according to one embodiment, the following functions and effects can be achieved.
(1) By defining a list that can be selected by the user and analyzing the number of occurrences of the concept in content selection from that list, the rareness of each concept can be considered and the concept not selected from the list Since the history feature can be used, it is possible to estimate the user's interest with high accuracy.

(2) さらに、上記ステップＳ１５、ステップＳ１６に示したように、特徴スコアの算出やステユーザ興味スコアの算出に際し、閲覧時のユーザの状況や閲覧操作の特徴（お気に入り登録、長時間閲覧等）などを重み係数（重みｗ）を介して反映することができるため、ユーザ興味スコアをさらに精度良く求めることが可能となる。 (2) Further, as shown in steps S15 and S16 above, when calculating the feature score and the steer user interest score, the situation of the user at the time of browsing, the characteristics of the browsing operation (favorite registration, long-time browsing, etc.), etc. Can be reflected via the weighting coefficient (weight w), so that the user interest score can be obtained with higher accuracy.

(3) また、タクソノミ（オントロジ）等で定義された概念をメタタグとして付与したコンテンツ閲覧履歴分析において、概念出現の希少性を合理的に分析に反映することが難しかったため、従来はタクソノミ（オントロジ）構造の深さを一定する等によりコンテンツに付与する概念の抽象度を統一する等のオントロジ構造側の調整が必要があった。しかし、本実施形態では概念出現の希少性を考慮するオントロジ構造によるユーザ興味スコアの更新処理により上位概念が付与されたコンテンツと、下位概念が付与されたコンテンツの閲覧履歴を統合して分析可能となる。このため、分析に利用するタクソノミ（オントロジ）への制約低減し、タクソノミ（オントロジ）の維持・運用・管理コストを低減することが可能となる。
(4) さらに、ユーザ興味スコアを用いてコンテンツに対するユーザの評価スコアを算出することで、ユーザの興味に合ったコンテンツを推薦することが可能となる。 (3) In addition, in the content browsing history analysis where the concept defined by taxonomy (ontology) etc. was added as a meta tag, it was difficult to rationally reflect the rareness of concept appearance in the analysis. It was necessary to make adjustments on the ontology structure side, such as unifying the abstraction level of the concept to be given to the content by fixing the depth of the structure. However, in this embodiment, it is possible to analyze by integrating the content given the higher concept by the update process of the user interest score by the ontology structure considering the rareness of concept appearance and the browsing history of the content given the lower concept Become. For this reason, it is possible to reduce the restrictions on the taxonomy (ontology) used for the analysis and to reduce the maintenance, operation, and management costs of the taxonomy (ontology).
(4) Furthermore, by calculating the user's evaluation score for content using the user interest score, it is possible to recommend content that matches the user's interest.

(5) また、既存ユーザのモデルを用いて属性に関するモデルを作成し、新規ユーザ（初期利用ユーザ）が提示した自身の属性に合致したモデルを当該新規ユーザの初期モデルとして用いている。このため、ユーザが初めて情報推薦サービスを利用する場合であっても、比較的良い推薦結果を得ることができる。しかも、上記属性に関するモデルを既存ユーザのモデルを用いて確率結合することにより作成しているので、少ない計算量で属性毎のモデルを作成することができる。 (5) In addition, a model related to an attribute is created using a model of an existing user, and a model that matches the own attribute presented by the new user (initial use user) is used as the initial model of the new user. For this reason, even when the user uses the information recommendation service for the first time, a relatively good recommendation result can be obtained. In addition, since the model relating to the attribute is created by stochastic coupling using a model of an existing user, a model for each attribute can be created with a small amount of calculation.

(6) さらに、確率結合処理は装置の処理負荷が少ない深夜時間帯などを選んで実施しているので、装置の処理負荷の増加を招かずに実施できる利点がある。
(7) さらに、属性が階層構造を持つ場合に、下位の属性のモデルを用いて上位の属性のモデルを作成することにより、少ない計算量で階層構造を持つ属性のモデルを作成することができる。 (6) Furthermore, since the probability coupling process is performed by selecting a midnight time zone where the processing load of the apparatus is low, there is an advantage that it can be performed without increasing the processing load of the apparatus.
(7) Furthermore, when an attribute has a hierarchical structure, an attribute model having a hierarchical structure can be created with a small amount of calculation by creating a higher attribute model using a lower attribute model. .

［他の実施形態］
なお、この発明は上記実施形態に限定されるものではなく、例えば興味分析装置の構成、属性毎のモデル作成処理及び初期モデルの作成処理の処理手順と処理内容、属性の種類等についても、この発明の要旨を逸脱しない範囲で種々変形して実施可能である。 [Other Embodiments]
The present invention is not limited to the above embodiment. For example, the configuration of the interest analysis apparatus, the model creation process for each attribute and the processing procedure and processing contents of the initial model creation process, the type of attribute, etc. Various modifications can be made without departing from the scope of the invention.

要するにこの発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態に亘る構成要素を適宜組み合せてもよい。 In short, the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. Further, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, you may combine suitably the component covering different embodiment.

１００…興味分析装置、２００…クライアント端末、３００…コンテンツサーバ、１１０…履歴情報受信部、１２０…特徴スコア算出部、１３０…概念体系更新処理部、１４０…概念体系／ユーザ興味スコアデータベース、１５０…提示コンテンツリスト受信部、１６０…コンテンツデータベース、１７０…コンテンツ評価処理部、１８０…ソート済みコンテンツスコアリスト送信部、１９０…属性興味スコア更新起動部、１９５…属性興味スコア更新部、１９６…属性興味スコアデータベース、１９７…初期ユーザ興味スコア作成部、１９８…ユーザ／属性情報データベース、２１０…履歴収集部、２２０…履歴情報送信部、２３０…コンテンツ提示部、２４０…コンテンツ要求送信部、２５０…属性提示部、３１０…コンテンツ送信処理部、３２０…ソート済み提示コンテンツリスト受信部、３３０…提示コンテンツリスト送信部、３４０…提示コンテンツリスト入力部、３５０…履歴情報転送部、３６０…コンテンツ要求転送部、３７０…属性転送部。 DESCRIPTION OF SYMBOLS 100 ... Interest analysis apparatus, 200 ... Client terminal, 300 ... Content server, 110 ... History information receiving part, 120 ... Feature score calculation part, 130 ... Concept system update processing part, 140 ... Concept system / user interest score database, 150 ... Presented content list receiving unit, 160 ... content database, 170 ... content evaluation processing unit, 180 ... sorted content score list transmitting unit, 190 ... attribute interest score update starting unit, 195 ... attribute interest score updating unit, 196 ... attribute interest score Database, 197 ... initial user interest score creation section, 198 ... user / attribute information database, 210 ... history collection section, 220 ... history information transmission section, 230 ... content presentation section, 240 ... content request transmission section, 250 ... attribute presentation section 310 ... Content transmission processing , 320 ... it sorted provided content list receiving section, 330 ... provided content list transmitting section, 340 ... provided content list input unit, 350 ... History information transfer section, 360 ... content request transfer unit, 370 ... attribute transfer unit.

Claims

The computer uses the information representing the scarcity of concept appearances to estimate the interests of information users, and uses the estimation results to create a model that represents the degree of interest in a given concept for each information user. A method for analyzing the interest of each information user using a created model,
A first step of creating a model for each attribute of the existing information user based on information representing a content selection history by the information user and storing the model in a first storage unit;
A second process of acquiring attribute information of a new information user;
Based on the acquired attribute information, an existing information user attribute model that matches the attribute of the new information user is selectively read out from the first storage unit, and the read model is A model that represents the degree of interest in the predetermined concept of the new information user is created, and the created model is stored in a second storage unit as an initial model of the new information user. 3 process,
And a fourth step of analyzing the interest of the new information user using the model stored in the second storage unit as an initial model.

In the case where the attribute of the existing information user has a first hierarchy corresponding to a lower concept and a second hierarchy corresponding to a concept higher than the first hierarchy, the first process includes: A model of a subordinate concept corresponding to the first hierarchy is created, and a model of a superordinate concept corresponding to the second hierarchy is created based on the model of the subordinate concept created for the first hierarchy. The interest analysis method according to claim 1.

The first step sets a threshold value or a time zone in which the processing load of the computer is equal to or less than a preset value, and when the processing load becomes equal to or less than the set threshold value or the set time zone The interest analysis method according to claim 1 or 2, wherein a process of creating a model for each attribute is executed when the value becomes.

The information that represents the scarcity of concept appearance is used to estimate the interest of the information user, and the estimation result is used to create a model that represents the degree of interest in a given concept for each information user. An interest analysis device that analyzes the interest of each information user using a model,
Means for creating a model for each attribute of the existing information user based on the information representing the content selection history by the information user;
First storage means for storing the created model for each attribute of the existing information user;
Means for acquiring attribute information of a new information user;
Based on the acquired attribute information, a model of an existing information user attribute that matches the attribute of the new information user is selectively read out from the first storage means, and the read model is Means for creating a model that represents the degree of interest in the predetermined concept of the new information user based on;
Second storage means for storing the created model relating to the new information user as an initial model of the new information user;
An interest analysis apparatus comprising: means for analyzing the interest of the new information user based on the initial model stored in the second storage means.

The means for creating a model for each attribute has a first hierarchy in which the attributes of the existing information user correspond to a lower concept and a second hierarchy corresponding to a concept higher than the first hierarchy. Then, a lower concept model corresponding to the first hierarchy is created, and a higher concept model corresponding to the second hierarchy is created based on the lower concept model created for the first hierarchy. The interest analysis apparatus according to claim 4, wherein:

When the processing load of the interest analysis device is set to a threshold value or a time zone where the processing load is equal to or less than a preset value, and when the processing load is equal to or less than the set threshold value or when the set time zone is reached 6. The interest analysis apparatus according to claim 4, further comprising means for causing the means for creating a model for each attribute to execute a process for creating the model for each attribute.

The program which makes the said computer perform the process corresponding to the process with which the interest analysis method in any one of Claims 1 thru | or 3 is provided.