JP2011141666A

JP2011141666A - Device, method and program for recommending interest information

Info

Publication number: JP2011141666A
Application number: JP2010001255A
Authority: JP
Inventors: Makoto Nakatsuji; 真中辻; Akimichi Tanaka; 明通田中; Masashi Uchiyama; 匡内山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2010-01-06
Filing date: 2010-01-06
Publication date: 2011-07-21
Anticipated expiration: 2030-01-06
Also published as: JP5320307B2

Abstract

PROBLEM TO BE SOLVED: To provide an interest information recommendation device clearly indicating a cluster from which a content recommended to an active user a is found. SOLUTION: The interest information recommendation device includes: a user interest similarity measurement instrument 10 measuring interest similarity of each user based on taxonomy in consideration of a class including a word inside interest information of each user, an interest degree of each instance, and topology that is a connection form of the class; a user interest correlation graph generator 20 setting each user as a node based on the user interest similarity, setting an edge having a weight based on the similarity as a score between the users, and generating a user interest correlation graph; a recommendation score calculator 30 randomly walking on the graph, and calculating a recommendation score of an interest target content; and a belonging cluster specification device 40 of a recommendation item clustering the graph, and creating a recommendation list including the recommendation item and background information of the item based on a result thereof. COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、協調フィルタリングに基づくコンテンツ推薦、その中でも、特にグラフベースの協調フィルタリングに関し、多様な興味を持つユーザヘの推薦精度向上のための、タクソノミに基づくユーザ興味マッチングアルゴリズムに関する。 The present invention relates to content recommendation based on collaborative filtering, and in particular to graph-based collaborative filtering, and more particularly to a user interest matching algorithm based on taxonomies to improve recommendation accuracy for users with various interests.

これまでのグラフベースの協調フィルタリング技術では、非特許文献１が知られている。これは、主に、ｓｐａｒｓｉｔｙｐｒｏｂｌｅｍ（スパース問題）という、ユーザのコンテンツ消費（利用）が少ない場合に、あまり良い推薦ができない、という問題に対し、直接的に近いユーザのみでなく、ユーザグラフ上で少し遠くにいるユーザの興味コンテンツまで追いかけて、推薦に用いることで、上記ｓｐａｒｓｉｔｙｐｒｏｂｌｅｍを改善している。 Non-patent document 1 is known as a conventional graph-based collaborative filtering technique. This is mainly due to the problem of sparseness problem (sparse problem) that the user can not make a good recommendation when content consumption (use) is small. By pursuing the content of interest of the user who is a little far away and using it for recommendation, the parity problem is improved.

それと同様のアプローチとして、近年、ランダムウォークを用いた手法（非特許文献２参照）がよく利用されるようになってきている。その手法では、アイテムの類似度に基づくアイテムグラフ（アイテムをノード、類似するアイテム間にエッジを持つ。エッジには類似度に基づくスコアを与える。）を最初に構築し、アクティブユーザａの持つ興味アイテムを起点とし、グラフ上をランダムウォークさせ、たどったユーザの持つ興味アイテムの出現回数と、エッジのスコアを基に、アイテムの推薦値を計算し、推薦値に基づきアイテムをランキングし、ユーザに表示する。 As a similar approach, a technique using a random walk (see Non-Patent Document 2) has been frequently used in recent years. In that method, an item graph based on item similarity (items are nodes, edges are between similar items. Edges are given a score based on similarity), and the interest of the active user a is first created. Based on the number of appearances of the item of interest that the user has followed and the edge score, the item's recommended value is calculated based on the item as the starting point, and the item's recommended value is ranked based on the recommended value. indicate.

この方法でも、ｓｐａｒｓｉｔｙｐｒｏｂｌｅｍに対応できることがわかっている（この非特許文献２によると、アイテムグラフに対し、手法を適用しているのは、アイテムの方が一般的に情報推薦システムを利用するユーザの数より少ないからである、と指摘している。ただ、現状のシステムでは反対の例も多い）。 It is known that this method can also handle the parity problem (according to this non-patent document 2, the method is applied to the item graph because the item generally uses the information recommendation system. It is pointed out that it is less than the number of (but the current system has many opposite cases).

しかし、上述の手法では、以下の２つの欠点がある。 However, the above-described method has the following two drawbacks.

（１）従来は、ユーザの興味の類似度に基づくユーザグラフの構築においては、ユーザの消費コンテンツのみを利用したユーザ間類似度計算を行っている。それだと、同様のコンテンツを消費（利用）しており、興味がある程度近くても、同一のコンテンツを両者が消費していない限り、類似度が０になることが多くでてくる。そのため、興味がある程度近くてもエッジが引かれない（もしくは、エッジのスコアが０）となる場合が多くなってしまう。そのため、アイテムグラフ上をランダムウォークする際に、興味がある程度近いにもかかわらず、すぐにはたどれないユーザが多く出てしまい、ランキングの精度を下げる。 (1) Conventionally, in the construction of the user graph based on the similarity of the user's interest, the similarity calculation between users using only the user's consumed content is performed. In that case, the same content is consumed (used), and even if the interest is close to some extent, the similarity often becomes 0 unless both the same content is consumed. Therefore, even if the interest is close to some extent, the edge is not drawn (or the edge score is 0) in many cases. Therefore, when a random walk is performed on the item graph, there are many users who do not follow immediately even though their interest is close to some extent, and the accuracy of ranking is lowered.

また、同じ原因に起因する結果として、同一のコンテンツを消費しているということは、かなり類似するユーザということになるため、そうしたユーザから発見できるのは、だいたい同じ傾向のコンテンツのみとなり、幅広い興味を持つユーザヘの推薦の精度が下がる。 In addition, as a result of the same cause, consuming the same content means that the users are quite similar, so only those content with the same tendency can be found from such users, and a wide range of interests can be found. The accuracy of the recommendation to users with is reduced.

まとめると、ユーザの多様な興味を、情報推薦システムが一部しか把握していないとし、さらに、残りの部分が、システムが把握しているユーザ興味と異なっているとすると、推薦がうまくいかなくなる。この問題は、ユーザの回りに、ユーザが興味を持ちうるアイテムを持つユーザが現れてこないという点でｓｐａｒｓｉｔｙｐｒｏｂｌｅｍと非常に近い問題である。 In summary, if the information recommendation system knows only a part of the user's various interests, and the remaining part is different from the user's interest that the system knows, the recommendation will not be successful. . This problem is very similar to the parity problem in that a user having an item that the user may be interested in does not appear around the user.

（２）従来は、ユーザグラフ上を縦横無尽にランダムウォークさせており、たどった先のユーザがアクティブユーザａとどういう関係にあるのかが把握できていない。そのため、どのようなユーザからどのようなコンテンツを発見できたかが、明確につかめないため、見つかったコンテンツの特性をつかみにくい。 (2) Conventionally, the user graph is randomly walked in the vertical and horizontal directions, and it is not possible to grasp the relationship between the followed user and the active user a. For this reason, it is difficult to grasp what kind of content has been found from which user, so it is difficult to grasp the characteristics of the found content.

また、特許文献１には、ユーザの興昧を反映するＷＥＢページとして、ＮＴＴレゾナントのｇｏｏブログやＮＴＴデータのｄｏｂｌｏｇなどのブログエントリは、従来の掲示板と異なり、特定の一人の人物が自身の感じた情報を書き綴っているため、この情報には、個人の興味をあらわす情報も多く含まれていることを利用し、このブログエントリに対し、各サービスプロバイダが提供するタクソノミ上のクラス（ジャンルなど）に分類を行うことで、ユーザの興味をタクソノミに沿って抽出することを可能とする技術が開示されている。 Also, in Patent Document 1, as a WEB page that reflects the user's interest, blog entries such as NTT Resonant's goo blog and NTT data doblog are different from conventional bulletin boards, and a specific individual person feels his own. This information is written and spelled, so this information contains a lot of information that expresses the interests of individuals, and this blog entry has a taxonomy class (genre etc.) provided by each service provider. ), A technique that enables the user's interest to be extracted along the taxonomy is disclosed.

ただしこの特許文献１では、タクソノミに沿ったユーザ興味は必ずしも、ブログから抽出するものに限定するというわけではなく、購買履歴や音楽試聴履歴からも抽出できる。 However, in Patent Document 1, the user interest along the taxonomy is not necessarily limited to that extracted from the blog, but can be extracted from the purchase history and the music audition history.

また、特許文献２には、ユーザの興味をタクソノミに沿って作成する際に、ユーザの興味対象（インスタンス、クラス）への思い入れの強さである興味度をも与える方法が開示されている。 Patent Document 2 discloses a method of giving an interest level, which is a strength of a user's interest in an object of interest (instance, class) when creating the user's interest along a taxonomy.

このような技術背景があり、ユーザ興味をタクソノミに沿って抽出する方法は与えられつつある。 With such a technical background, a method of extracting user interest along a taxonomy is being given.

さらに、特許文献３には、ユーザ間の興味の類似度を、タクソノミに沿ったユーザ興味の類似度から計算する方法が開示されている。 Further, Patent Literature 3 discloses a method of calculating the similarity of interest between users from the similarity of user interest along the taxonomy.

尚、本発明に関連する技術として、非特許文献３〜５に記載のものが公知となっている。 In addition, the thing of a nonpatent literature 3-5 is well-known as a technique relevant to this invention.

特開２００７−２２６４６８号公報JP 2007-226468 A 特開２００８−２７１４２号公報JP 2008-27142 A 特開２００８−２７１４３号公報JP 2008-27143 A

ＺＡＮＨＵＡＮＧ，ＨＳＩＮＣＨＵＮ，ａｎｄＤＡＮＩＥＬＺＥＮＧ，“Ａｐｐｌｙｉｎｇａｓｓｏｃｉａｔｉｖｅｒｅｔｒｉｅｖａｌｔｅｃｈｎｉｑｕｅｓｔｏａｌｌｅｖｉａｔｅｔｈｅｓｐａｒｓｉｔｙｐｒｏｂｌｅｍｉｎｃｏｌｌａｂｏｒａｔｉｖｅｆｉｌｔｅｒｉｎｇ”，ＡＣＭＴｒａｎｓａｃｔｉｏｎｓｏｎＩｎｆｏｒｍａｔｉｏｎＳｙｓｔｅｍｓ，Ｖｏｌ．２２，Ｎ０．１，Ｊａｎｕａｒｙ２００４，ｐｐ．１１６−１４２ZAN HUANG, HSINCHUN, and DANIEL ZENG, “Applying associate retrievable technologies to alleviate the inspirables in the CV”. 22, N0.1, January 2004, pp. 116-142 ＨｉｌｍｉＹｉｌｄｉｒｉｍａｎｄＭｕｋｋａｉＳ．Ｋｒｉｓｈｎａｍｏｏｒｔｈｙ，“ＡＲａｎｄｏｍＷａｌｋＭｅｔｈｏｄｆｏｒＡｌｌｅｖｉａｔｉｎｇｔｈｅＳｐａｒｓｉｔｙＰｒｏｂｌｅｍｉｎＣｏｌｌａｂｏｒａｔｉｖｅＦｉｌｔｅｒｉｎｇ”，ＳＩＧ−ＲＥＣ２００８Hilmi Yildirim and Mukakai S. Krishnamorthy, “A Random Walk Method for Allivating the Sparity Problem in Collaborative Filtering”, SIG-REC2008 ＦａｓｔａｌｇｏｒｉｔｈｍｆｏｒｄｅｔｅｃｔｉｎｇｃｏｍｍｕｎｉｔｙｓｔｒｕｃｔｕｒｅｉｎｎｅｔｗｏｒｋｓＥｘｐｏｒｔｂｙ：Ｍ．Ｅ．Ｊ．Ｎｅｗｍａｎ、インターネット＜ＵＲＬ：ｈｔｔｐ：／／ａｒｘｉｖ．ｏｒｇ／ａｂｓ／ｃｏｎｄ−ｍａｔ／０３０９５０８＞，[平成２１年１２月２２日検索]Fast algorithm for detecting community structure in networks Export by: M.M. E. J. et al. Newman, Internet <URL: http: // arxiv. org / abs / cond-mat / 3095098>, [retrieval on December 22, 2009] 中辻真、三好優、大塚祥広、“ブログデータに基づくユーザの興味オントロジ自動生成とコミュニティ形成支援手法の提案”、ＤＥＷＳ２００６２Ａ−ｉ７、２００６Makoto Nakajo, Yu Miyoshi, Yoshihiro Otsuka, “Proposal of Automatic Generation of User Interest Ontology Based on Blog Data and Supporting Community Formation”, DEWS 2006 2A-i7, 2006 増井俊之、“インターフェイスの街角（９３）−本棚演算”、ＵＮＩＸＭＡＧＡＺＩＮＥ２００５．１２Toshiyuki Masui, “City Corner of Interface (93)-Bookshelf Calculation”, UNIX MAGAZINE 2005.12

本発明では、前記技術背景をふまえ、上述した問題（１）、（２）に対応するべく、ユーザ興味グラフを作成し、そのグラフ上でランダムウォークを行い、また、グラフをクラスタリングし、アクティブユーザａに推薦されるコンテンツがどのようなクラスタから発見されたものであるかを、明示する装置、方法、プログラムを提供することを目的としている。 In the present invention, based on the technical background, a user interest graph is created and a random walk is performed on the graph in order to deal with the problems (1) and (2) described above. The object is to provide an apparatus, a method, and a program that clearly indicate from which cluster the content recommended for a is found.

本発明の特徴は次の２点である。
（１）過去に発明者が提案したタクソノミに沿ったユーザ興味抽出（特許文献１）、類似度計算（特許文献３）を補完する内容。上記手法を、タクソノミに沿ったユーザ興味類似度グラフの生成、およびグラフ上のランダムウォークヘの適用、それによるスパース問題への改善ができるという領域までカバーする構成とした。
（２）ユーザグラフをクラスタリングする新規方法を構築した。さらに、アクティブユーザａに対し推薦されるコンテンツが、グラフのクラスタリング結果を考えにいれると、どのようなクラスタから起因して出てきたものなのか、を明示する方法を構築した。 The features of the present invention are the following two points.
(1) Contents that complement user interest extraction (Patent Document 1) and similarity calculation (Patent Document 3) along the taxonomy proposed by the inventor in the past. The above-described method is configured to cover an area in which the user interest similarity graph along the taxonomy can be generated, the random walk on the graph can be applied, and the sparse problem can be improved thereby.
(2) A new method for clustering user graphs was constructed. Furthermore, a method has been constructed to clearly indicate what kind of cluster the content recommended for the active user a comes from when considering the clustering result of the graph.

すなわち本発明では、コンテンツのみでなく、コンテンツの分類知識であるタクソノミを利用してユーザ間の興味類似度を計算し、ユーザグラフを構成することで、これまでよりも、グラフ上のランダムウォークの精度をあげる。タクソノミを用いることで、同じインスタンスに興味を持つユーザのみでなく、同じクラスに興味を持つユーザ間でも、類似性を判定することができるため、特にＳｐａｒｓｉｔｙ問題に有効となる（問題（１）に対応）。さらに、ユーザグラフをクラスタリングした事前結果を利用し、自身の所属するクラスタからは発見できないけれども、ユーザが興味を持ちうる可能性の高いコンテンツを推薦可能とする（問題（２）に対応）。 In other words, in the present invention, the interest similarity between users is calculated using not only the content but also the taxonomy that is the classification knowledge of the content, and the user graph is constructed. Increase accuracy. By using a taxonomy, similarity can be determined not only for users who are interested in the same instance but also among users who are interested in the same class, so it is particularly effective for the Sparsity problem (problem (1)) Correspondence). Furthermore, using a prior result obtained by clustering the user graph, it is possible to recommend content that is highly likely to be of interest to the user although it cannot be found from the cluster to which the user graph belongs (corresponding to the problem (2)).

本発明の興味情報推薦装置は、各ユーザ毎に構築された興味情報を概念階層化したパーソナルオントロジを生成し、前記興味情報内の単語を含むクラス、インスタンス毎の興味度およびクラスの接続形態であるトポロジを考慮して、タクソノミに基づいて前記ユーザ毎の興味類似度を計測するユーザ興味類似度計測手段と、前記ユーザ興味類似度計測手段により計測されたユーザ毎のユーザ興味類似度に基づいて、ユーザをノードとし、類似度に基づく重みをスコアとしてもつエッジをユーザ間に設定してユーザ興味相関グラフを生成するユーザ興味相関グラフ生成手段と、前記ユーザ興味相関グラフ生成手段により生成されたグラフ上をランダムウォークし、たどったユーザの興味対象コンテンツを推薦対象リストに加え、前記各コンテンツのカウント数と、ユーザ間のエッジのスコアを基に前記興味対象コンテンツの推薦スコアを計算する推薦スコア計算手段と、前記ユーザ興味相関グラフ生成手段により生成されたグラフをクラスタリングし、該クラスタリング結果に基づいて、推薦アイテムおよびそのアイテムの背景情報を含む推薦リストを作成する推薦アイテムの所属クラスタ特定手段と、を備えたことを特徴としている。 An interest information recommendation device according to the present invention generates a personal ontology that conceptually hires interest information constructed for each user, and includes a class including words in the interest information, an interest degree for each instance, and a connection form of classes. Considering a certain topology, based on the user interest similarity measurer that measures the interest similarity for each user based on the taxonomy, and based on the user interest similarity measure for each user measured by the user interest similarity measurer User interest correlation graph generation means for generating a user interest correlation graph by setting an edge having a user as a node and a weight based on similarity as a score between the users, and a graph generated by the user interest correlation graph generation means Randomly walk up and add the content of interest of the user who followed the content to the recommendation target list. Clustering the recommendation score calculation means for calculating the recommendation score of the content of interest based on the count number of the user and the score of the edge between users, and the graph generated by the user interest correlation graph generation means. And a recommended item affiliation cluster specifying means for creating a recommended list including a recommended item and background information of the item.

（１）請求項１〜９に記載の発明によれば、特に、ユーザの興味を、例えば情報推薦システムが十分に把握していない段階において、推薦精度を向上させることができる。 (1) According to the inventions described in claims 1 to 9, the recommendation accuracy can be improved particularly in a stage where the user's interest is not sufficiently grasped by the information recommendation system, for example.

また、推薦アイテムが、あるユーザの所属するクラスタに存在しない（そのユーザにとって今までの興味とは外れたアイテムである）場合であっても、例えばそれがどのクラスタから発見されたかの情報（推薦アイテムの背景情報）とともに提示することができ、ユーザは、どのようなユーザからどのようなコンテンツを発見することができたかを明確につかむことができ、コンテンツの特性をつかみやすい。 Even if a recommended item does not exist in a cluster to which a user belongs (ie, an item that is not of interest to the user so far), for example, information on which cluster it was found from (recommended item) The user can clearly grasp what kind of content has been discovered by which user, and can easily grasp the characteristics of the content.

このため、従来のＡｍａｚｏｎなどで行なわれている協調フィルタリングのように、同様のインスタンスが並んだ推薦リストと異なり、各種別々のクラスタから出てきたインスタンスを、そのクラスタ毎に分離しながら推薦リストに混ぜ込むことにより、多様であるが、ユーザがそれぞれ興味を持つ確率が高いインスタンスの推薦リストを作成し、ユーザに提示することができる。
（２）請求項２、６に記載の発明によれば、タクソノミに沿って親クラス、子クラスから成るトポロジＴ₁を形成する共通クラス集合に対する興味一致度Ｓ（Ｔ₁）と、クラスとインスタンスから成るトポロジＴ₂を形成する共通クラス集合に対する興味一致度Ｓ（Ｔ₂）と、前記両トポロジＴ₁、Ｔ₂に対する重要度に応じた評価関数ｆ（Ｘ）とによってオントロジ間の近似度Ｓｏを求めているので、ユーザの興味を、例えば情報推薦システムが十分に把握していない段階においても、インスタンスのみならずクラスでも繋がるユーザがあればエッジが引かれる可能性があり、推薦誤差を低く抑えることができる。
（３）請求項３、７に記載の発明によれば、エッジを設定するためのユーザの人数Ｘを決定しているので、興味が類似するユーザ集合を正しく求めることが可能なユーザ興味相関グラフを生成することができる。
（４）請求項４、８に記載の発明によれば、今までとは異なり、推薦されてきた情報が、自分の所属するクラスタからのものではない、という意味で、自分の今までの興昧とはずれたインスタンスである、と明示しながら、推薦を行うことができるようになる。 For this reason, unlike the collaborative filtering performed in conventional Amazon, etc., unlike the recommendation list in which similar instances are arranged, the instances coming out of various separate clusters are separated into the recommendation list while being separated for each cluster. By mixing, it is possible to create a recommendation list of a variety of instances that the user has a high probability of being interested in and present to the user.
(2) According to the inventions of claims 2 and 6, the interest coincidence S (T ₁ ) for the common class set forming the topology T ₁ composed of the parent class and the child class along the taxonomy, the class and the instance The degree of approximation So between the ontology is obtained by the interest coincidence S (T ₂ ) for the common class set forming the topology T ₂ consisting of and the evaluation function f (X) according to the importance for the topologies T ₁ and T ₂ . Even if the information recommendation system does not fully grasp the user's interest, for example, if there is a user who connects not only with the instance but also with the class, the edge may be drawn, and the recommendation error is reduced. Can be suppressed.
(3) According to the inventions according to claims 3 and 7, since the number of users X for setting an edge is determined, a user interest correlation graph capable of correctly obtaining a user set having similar interests Can be generated.
(4) According to the inventions described in claims 4 and 8, unlike the past, the recommended information is not from the cluster to which the user belongs. It becomes possible to make a recommendation while clearly stating that the instance is out of ambiguity.

また、推薦インスタンスの発見基クラスタを明示することで、例えば、複数のクラスタからインスタンスが推薦されてきた場合には、推薦インスタンスそれぞれに対するクラスタを明示することができるため、複数インスタンスのクラスタがそれぞれ多岐に飛んでいたならば、それぞれ、出元が異なるという情報を付加させながら、ユーザに推薦を行うことができるようになる。 Also, by specifying the discovery base cluster of recommended instances, for example, when instances have been recommended from a plurality of clusters, the clusters for each recommended instance can be specified. If the user has jumped, the user can be recommended while adding information that the sources are different.

本発明の興味情報推薦装置の実施形態例を示す構成図。The block diagram which shows the example of embodiment of the interest information recommendation apparatus of this invention. 本発明の興味情報推薦方法の実施形態例を示すフローチャート。The flowchart which shows the embodiment of the interest information recommendation method of this invention. 本発明の実施形態例で利用するユーザ分布解析と興味オントロジ自動生成の手順を示す説明図。Explanatory drawing which shows the procedure of the user distribution analysis and interest ontology automatic generation which are utilized in the embodiment of this invention. 本発明の実施形態例で利用する特許文献３における近似度計測アルゴリズムの説明図。Explanatory drawing of the approximation measurement algorithm in patent document 3 utilized in the embodiment of this invention. 本発明の実施形態例における興味度の例を示す説明図。Explanatory drawing which shows the example of the degree of interest in the embodiment example of this invention. 本発明の実施形態例における興味度を考慮したオントロジ間の近似度計測アルゴリズムの説明図。Explanatory drawing of the approximation degree measurement algorithm between ontology in consideration of the degree of interest in the embodiment of this invention. 本発明の実施形態例における興味オントロジの近似度計測による実現サービスイメージの説明図。Explanatory drawing of the realization service image by the approximation degree measurement of interest ontology in the example embodiment of this invention. 図２のユーザ興味相関グラフ生成ステップの処理の詳細を示すフローチャート。The flowchart which shows the detail of a process of the user interest correlation graph production | generation step of FIG. 図２の推薦アイテムの所属クラスタ特定ステップの処理の詳細を示すフローチャート。The flowchart which shows the detail of a process of the affiliation cluster identification step of the recommendation item of FIG. 本発明により出力される推薦リストの一例を示す説明図。Explanatory drawing which shows an example of the recommendation list | wrist output by this invention. 本発明と従来技術との比較検証を表し、推薦システムのユーザ興味把握度に対する推薦誤差を示すグラフ。The graph which shows the comparison verification of this invention and a prior art, and shows the recommendation error with respect to the user interest grasp degree of a recommendation system.

以下、図面を参照しながら本発明の実施の形態を説明するが、本発明は下記の実施形態例に限定されるものではない。 Hereinafter, embodiments of the present invention will be described with reference to the drawings, but the present invention is not limited to the following embodiments.

図１は、本発明の一実施形態例に係る興味情報推薦装置が適用されるシステムの概略構成を示すブロック図である。１はインターネットやＩＰ−ＶＰＮ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ−ＶｉｒｔｕａｌＰｒｉｖａｔｅＮｅｔｗｏｒｋ）などの通信網であり、この通信網１を介して端末２〜４およびサーバ５が接続されている。 FIG. 1 is a block diagram showing a schematic configuration of a system to which an interest information recommendation device according to an embodiment of the present invention is applied. Reference numeral 1 denotes a communication network such as the Internet or an IP-VPN (Internet Protocol-Virtual Private Network), and the terminals 2 to 4 and the server 5 are connected via the communication network 1.

端末２〜４は、ノート型パソコンあるいはデスクトップ型パソコンからなり、携帯電話端末やＰＤＡ（ＰｅｒｓｏｎａｌＤａｔａＡｓｓｉｓｔａｎｔ）などでもよい。また、サーバ５は、ブログプロバイダやＩＳＰ（ＩｎｔｅｒｎｅｔＳｅｒｖｉｃｅＰｒｏｖｉｄｅｒ）上に設置することができる。 The terminals 2 to 4 are notebook personal computers or desktop personal computers, and may be mobile phone terminals or PDAs (Personal Data Assistants). The server 5 can be installed on a blog provider or ISP (Internet Service Provider).

ここで、サーバ５には、端末２〜４にそれぞれ対応したブログサイト７〜９が設けられ、各ブログサイト７〜９には、ブログエントリ７ａ〜７ｎ、８ａ〜８ｎ、９ａ〜９ｎがそれぞれ保持されている。なお、ブログエントリ７ａ〜７ｎ、８ａ〜８ｎ、９ａ〜９ｎはブログにおける記事の最小単位を表し、日にち毎に設けることができる。また、サーバ５には雛形オントロジ６が保持され、雛形オントロジ６には、個人（ユーザ）の興味情報が概念階層化されたパーソナルオントロジの雛形が設けられている。 Here, the server 5 is provided with blog sites 7 to 9 corresponding to the terminals 2 to 4 respectively, and the blog entries 7a to 7n, 8a to 8n, and 9a to 9n are held in the blog sites 7 to 9, respectively. Has been. The blog entries 7a to 7n, 8a to 8n, and 9a to 9n represent minimum units of articles in the blog, and can be provided for each day. The server 5 holds a template ontology 6, and the template ontology 6 is provided with a personal ontology template in which personal (user) interest information is conceptually hierarchized.

なお、雛形オントロジ６は、ブログプロバイダ側で恣意的に作成することができる。例えば、ブログプロバイダが音楽に関するパーソナルオントロジを各端末２〜４のユーザに構築させたければ、音楽に関する雛形オントロジ６を構築すればよい。ここで、各端末２〜４のユーザの興味を細やかに表現するために、可能な限り細分化された網羅性の高い雛形オントロジ６を構築することが好ましい。また、雛形オントロジ６の実体は、オントロジ記述言語ＯＷＬなどのＸＭＬ言語で記述されたテキストファイル等、クラス・インスタンスツリーをＤＢ上で表現したものである。また、情報の整理の簡単化のため、インスタンスは最下位クラスにのみ分類してもよい。 The template ontology 6 can be arbitrarily created on the blog provider side. For example, if the blog provider wants the users of the terminals 2 to 4 to construct a personal ontology related to music, the template ontology 6 related to music may be constructed. Here, in order to express the interests of the users of the terminals 2 to 4 in detail, it is preferable to construct a template ontology 6 that is subdivided as much as possible and has high coverage. The entity of the template ontology 6 is a representation of a class / instance tree on the DB, such as a text file described in an XML language such as the ontology description language OWL. In order to simplify the organization of information, instances may be classified only into the lowest class.

さらにサーバ５には、ユーザ興味類似度計測手段としてのユーザ興味類似度計測器１０、ユーザ興味相関グラフ生成手段としてのユーザ興味相関グラフ生成器２０、推薦スコア計算手段としての推薦スコア計算器３０および推薦アイテムの所属クラスタ特定手段としての推薦アイテムの所属クラスタ特定器４０が設けられている。 Further, the server 5 includes a user interest similarity measurer 10 as a user interest similarity measurer, a user interest correlation graph generator 20 as a user interest correlation graph generator, a recommended score calculator 30 as a recommended score calculator, and A recommended item belonging cluster identification device 40 is provided as a recommended item belonging cluster identifying means.

これらユーザ興味類似度計測器１０、ユーザ興味相関グラフ生成器２０、推薦スコア計算器３０および推薦アイテムの所属クラスタ特定器４０で実行される各処理は図２のフローチャートに沿って行われ、後述する各機能は例えばコンピュータによって達成される。 Each process executed by the user interest similarity measuring device 10, the user interest correlation graph generator 20, the recommendation score calculator 30, and the recommended item affiliation cluster specifying device 40 is performed according to the flowchart of FIG. Each function is achieved by a computer, for example.

＜ユーザ興味類似度計測器１０（ステップＳ１００）＞
（１）ユーザ興味類似度計測器１０は、ユーザ興味、すなわち購買履歴、視聴履歴などのユーザとアイテムの関係の情報を入力とし、例えば特許文献１、３に記載のユーザの興味分布解析および興味オントロジ自動生成（図３〜図５）と、ユーザ毎の興味類似度の計測（図４〜図７）とを、以下のようにして行なう。 <User Interest Similarity Measuring Device 10 (Step S100)>
(1) The user interest similarity measurer 10 receives user interest, that is, information on the relationship between the user and the item such as purchase history and viewing history, and analyzes the interest distribution analysis and interest of the user described in Patent Documents 1 and 3, for example. Ontology automatic generation (FIGS. 3 to 5) and interest similarity measurement for each user (FIGS. 4 to 7) are performed as follows.

（１−１）まずユーザ分布解析と興味オントロジ自動生成手順を示す図３において、ブログの更新情報を集めて提供しているｐｉｎｇサーバ（図１のサーバ５）などを通じ収集した全ブログエントリ（ユーザＡ〜Ｘのエントリ集合）に対し形態素解析を行って、該エントリに頻出する単語を抽出しインデックスを作成する（ステップＳ１）。ここで、収集されたブログエントリは一意なユーザＩＤを持つとする。 (1-1) First, in FIG. 3 showing a user distribution analysis and an interest ontology automatic generation procedure, all blog entries (users) collected through a ping server (server 5 in FIG. 1) that collects and provides blog update information, etc. Morphological analysis is performed on the entry set of A to X), and words that frequently appear in the entry are extracted to create an index (step S1). Here, it is assumed that the collected blog entries have a unique user ID.

（１−２）次に全ブログエントリを雛形オントロジＯＨに対し分類する（ステップＳ２）。分類方法としては、あるエントリ内の記述に雛形オントロジＯＨのあるクラスＣ_iの名前属性があれば、そのエントリをＣ_iに分類し、また、Ｃ_iに所属するインスタンスＩ_i（∈Ｃ_i）の名前属性があれば、そのエントリをクラスＣ_iのインスタンスＩ_iに分類する。なお、同一のエントリが複数クラスに分類されても良い。例えば、図３において、エントリ内の記述に“Ｃｈａｒｌａｔａｎｓ”という文字列がある場合、そのエントリはクラス“Ｍａｄｃｈｅｓｔｅｒ”のインスタンス“Ｃｈａｒｌａｔａｎｓ”に分類される。 (1-2) Next, all blog entries are classified with respect to the template ontology OH (step S2). As the classification method, if there is a name attribute of the class C _i with stationery ontology OH to the description in a certain entry, classifies the entry to C _i, also, instance I _i belonging to C _{_i} (∈C _i) If there is a name attribute, classify the entry into an instance I _i of class C _i . Note that the same entry may be classified into a plurality of classes. For example, in FIG. 3, when there is a character string “Charlantans” in the description in the entry, the entry is classified into an instance “Charlantans” of the class “Machchester”.

（１−３）次に、雛形オントロジＯＨを形成する最下層クラスＣ_lの持つ各インスタンスに対し興味を持つユーザ数を計測する（ステップＳ３）。なお、クラスＣ_lのインスタンスＩ_lに興味を持つユーザ数を算出する際、同一ユーザが複数エントリにおいてインスタンスＩ_lを記述していたとしても、ユーザ数は１と計測する。次に上記計測を最下層クラスに対しても実施し、最下層クラスに興味を持つユーザ数を、最下層クラス配下の全インスタンスに興味を持つユーザ数と最下層クラスＣ_l自身に興味を持つユーザ数の総和で計測する。この場合も、同一ユーザが複数インスタンスに興味を持っていたり、最下層クラスとそのクラスに所属するインスタンスに同時に興味を持つとしても、ユーザ数は１と計測する。このようにして雛形オントロジＯＨを形成するクラスやインスタンスに興味を持つユーザ数をルートクラスまで再帰的に計測する事で，そのドメインに興味を持つユーザ分布を計測できる。 (1-3) Next, the number of users who are interested in each instance of the lowermost class C ₁ forming the template ontology OH is measured (step S3). When calculating the number of users interested in the instance I _l of the class C _l, the number of users is measured as 1 even if the same user describes the instance I _{l in a} plurality of entries. Then also performed on the lowermost class the measurement, the number of users who are interested in the lowest layer class, interested in the number of users and the lowermost class C _l itself interested in all instances of subordinate lowermost class Measure with the total number of users. Also in this case, even if the same user is interested in a plurality of instances or is interested in the lowermost class and instances belonging to the class at the same time, the number of users is measured as one. In this way, by recursively measuring the number of users interested in the classes and instances forming the template ontology OH up to the root class, the distribution of users interested in the domain can be measured.

（１−４）次に全エントリを分類した上で、分類結果をユーザＩＤ毎に整理し各ユーザＡ〜Ｘ毎の興味オントロジを生成する。 (1-4) Next, after classifying all entries, the classification result is arranged for each user ID, and an interest ontology for each user A to X is generated.

（２）次に図４を用いて、特許文献３に記載の類似度計測アルゴリズムについて説明する。 (2) Next, the similarity measurement algorithm described in Patent Document 3 will be described with reference to FIG.

尚、以下の説明では、あるオントロジに対する別のオントロジとの間の近似度を計測する場合、前者をソースオントロジ、後者をターゲットオントロジと呼ぶ。 In the following description, when the degree of approximation between an ontology and another ontology is measured, the former is called a source ontology and the latter is called a target ontology.

図４において、雛形オントロジＯＨの“ａ１”というクラスの直下には、“ｂ１”、“ｂ２”および“ｂ３”というクラスが存在し、“ｂ１”というクラスの直下には、“ｃ１”および“ｃ２”というクラスが存在し、“ｃ１”というクラスの直下には、“ｄ１”および“ｄ２”というクラスが存在しているものとする。 In FIG. 4, classes “b1”, “b2”, and “b3” exist immediately below the class “a1” of the template ontology OH, and “c1” and “c3” immediately below the class “b1”. It is assumed that a class called “c2” exists and classes “d1” and “d2” exist immediately below the class called “c1”.

また、“ｂ２”というクラスの直下には、“ｃ３”および“ｃ４”というクラスが存在し、“ｂ３”というクラスの直下には、“ｃ５”というクラスが存在しているものとする。 Also, it is assumed that classes “c3” and “c4” exist immediately below the class “b2”, and a class “c5” exists immediately below the class “b3”.

また、“ｄ１”というクラスには、“ｊ”および“ｋ”というインスタンスが存在し、“ｄ２”というクラスには、“ｌ”というインスタンスが存在し、“ｃ２”というクラスには、“ｍ”というインスタンスが存在し、“ｂ２”というクラスには、“ｎ”というインスタンスが存在し、“ｃ３”というクラスには、“ａ”、“ｅ”、“ｃ”、“ｆ”、“ｂ”、“ｄ”および“ｇ”というインスタンスが存在し、“ｃ４”というクラスには、“ｐ”、“ｑ”、“ｊ”および“ｈ”というインスタンスが存在しているものとする。 The class “d1” includes instances “j” and “k”, the class “d2” includes the instance “l”, and the class “c2” includes “m”. "B", the class "b2" has an instance "n", and the class "c3" has "a", "e", "c", "f", "b" ”,“ D ”, and“ g ”exist, and the class“ c4 ”includes instances“ p ”,“ q ”,“ j ”, and“ h ”.

そして、各ユーザのブログエントリに頻出する単語をそれぞれ抽出し、その単語を含むクラスまたはインスタンスおよびそれらの上位の全てのクラスを雛形オントロジＯＨからそれぞれ抽出することにより、パーソナルオントロジＯＡ、ＯＢが作成されたものとする。 Then, the words that frequently appear in each user's blog entry are extracted, and the classes or instances including the words and all the classes above them are extracted from the template ontology OH, thereby creating personal ontologies OA and OB. Shall be.

ここで、パーソナルオントロジＯＡの“ａ１というクラスの直下には、“ｂ１”および“ｂ２”というクラスが存在し、“ｂ１”というクラスの直下には、“ｃ１”および“ｃ２”というクラスが存在し、“ｃ１”というクラスの直下には、“ｄ１”というクラスが存在し、“ｂ２”というクラスの直下には、“ｃ３”および“ｃ４”というクラスが存在しているものとする。また、“ｄ１”というクラスには、“ｊ”および“ｋ”というインスタンスが存在し、“ｃ２”というクラスには、“ｍ”というインスタンスが存在し、“ｃ３”というクラスには、“ａ”、“ｃ”、“ｂ”および“ｄ”というインスタンスが存在し、“ｃ４”というクラスには、“ｑ”および“ｈ”というインスタンスが存在しているものとする。 Here, the classes “b1” and “b2” exist immediately below the class “a1” of the personal ontology OA, and the classes “c1” and “c2” exist immediately below the class “b1”. It is assumed that a class “d1” exists immediately below the class “c1”, and a class “c3” and “c4” exist immediately below the class “b2”. , The class “d1” includes instances “j” and “k”, the class “c2” includes the instance “m”, and the class “c3” includes “a”. , “C”, “b”, and “d” exist, and the class “c4” includes instances “q” and “h”.

また、パーソナルオントロジＯＢの“ａ１”というクラスの直下には、“ｂ１”、“ｂ２”および“ｂ３”というクラスが存在し、“ｂ１”というクラスの直下には、“ｃ１”というクラスが存在し、“ｃ１”というクラスの直下には、“ｄ２”というクラスが存在し、“ｂ２”というクラスの直下には、“ｃ３”および“ｃ４”というクラスが存在しているものとする。また、“ｄ２”というクラスには、“ｌ”というインスタンスが存在し、“ｂ２”というクラスには、“ｎ”というインスタンスが存在し、“ｃ３”というクラスには、“ａ”、“ｃ”、“ｅ”および“ｆ”というインスタンスが存在し、“ｃ４”というクラスには、“ｐ”および“ｊ”というインスタンスが存在しているものとする。 Also, in the personal ontology OB, there are “b1”, “b2”, and “b3” classes directly under the “a1” class, and there is a “c1” class directly under the “b1” class. Assume that a class “d2” exists immediately under the class “c1”, and a class “c3” and “c4” exist immediately below the class “b2”. The class “d2” has an instance “l”, the class “b2” has an instance “n”, and the class “c3” has “a” and “c”. ”,“ E ”, and“ f ”exist, and“ c4 ”class includes“ p ”and“ j ”.

（２−１）まず、雛形オントロジＯＨおよびパーソナルオントロジＯＡ、ＯＢ間で末端クラスを除く共通クラスを分析し、共通クラスを親クラスとした親子クラスからなるトポロジを抽出する。なお、図４の例では、末端クラスは、“ｄ１”、“ｄ２”、“ｃ３”および“ｃ４”とする。この結果、雛形オントロジＯＨおよびパーソナルオントロジＯＡ、ＯＢ間において、クラス“ａ１”を親クラスとした子クラス集合Ｇ１、クラス“ｂ１”を親クラスとした子クラス集合Ｇ２、クラス“ｃ１”を親クラスとした子クラス集合Ｇ３、クラス“ｂ２”を親クラスとした子クラス集合Ｇ４、クラス“ｂ３”を親クラスとした子クラス集合Ｇ５を抽出することができる。 (2-1) First, a common class excluding a terminal class is analyzed between the template ontology OH and the personal ontology OA and OB, and a topology composed of parent-child classes with the common class as a parent class is extracted. In the example of FIG. 4, the end classes are “d1”, “d2”, “c3”, and “c4”. As a result, a child class set G1 having the class “a1” as the parent class, a child class set G2 having the class “b1” as the parent class, and the class “c1” being the parent class between the template ontology OH and the personal ontologies OA and OB. Child class set G3, child class set G4 having class “b2” as a parent class, and child class set G5 having class “b3” as a parent class can be extracted.

なお、この共通クラスの分析は、同じクラスＩＤが雛形オントロジＯＨおよびパーソナルオントロジＯＡ、ＯＢ間に存在するかということだけを確認すればよい。このため、クラスの名前属性やインスタンス集合プロパティなどの近似度を計測する必要がなくなり、クラス間の対応関係を正確に維持しつつ、計算量を減らすことができる。 The analysis of the common class only needs to confirm whether the same class ID exists between the template ontology OH and the personal ontology OA, OB. For this reason, it is not necessary to measure the degree of approximation of class name attributes, instance set properties, etc., and the amount of calculation can be reduced while maintaining the correspondence between classes accurately.

（２−２）次に、パーソナルオントロジＯＡ、ＯＢ間で各トポロジを形成する子クラス集合Ｘ、Ｙ間の近似度を深さ優先で計算する。ここで、雛形オントロジＯＨを構成するクラスの子クラスの集合をＺとすると、子クラス集合Ｘ、Ｙ間の近似度は、｜Ｘ∩Ｙ｜／｜Ｚ｜にて求めることができる。そして、各トポロジの子クラス集合間の近似度を足し合わせることにより、パーソナルオントロジＯＡ、ＯＢ間でのトポロジの近似度Ｓ_Tを計測することができる。 (2-2) Next, the degree of approximation between the child class sets X and Y forming each topology between the personal ontology OA and OB is calculated with depth priority. Here, if the set of child classes of the classes constituting the template ontology OH is Z, the degree of approximation between the child class sets X and Y can be obtained by | X∩Y | / | Z |. By adding the approximation degree between the child class set for each topology, it is possible to measure the degree of approximation S _T personal ontology OA, between OB topology.

例えば、子クラス集合Ｇ１において、雛形オントロジＯＨには“ｂ１”、“ｂ２”および“ｂ３”という子クラスが含まれているので、子クラス集合Ｇ１における雛形オントロジＯＨの子クラス集合のメンバ数は３となる。また、子クラス集合Ｇ１において、パーソナルオントロジＯＡには“ｂ１”および“ｂ２”という子クラスが含まれ、パーソナルオントロジＯＢには“ｂ１”、“ｂ２”および“ｂ３”という子クラスが含まれているので、パーソナルオントロジＯＡ、ＯＢに共通に含まれている子クラスは“ｂ１”およびｂ２”だけとなり、子クラス集合Ｇ１におけるパーソナルオントロジＯＡ、ＯＢの子クラス集合の積集合のメンバ数は２となる。この結果、パーソナルオントロジＯＡ、ＯＢの子クラス集合Ｇ１間における近似度は２／３となる。 For example, in the child class set G1, since the template ontology OH includes child classes “b1”, “b2”, and “b3”, the number of members of the child class set of the template ontology OH in the child class set G1 is 3 In the child class set G1, the personal ontology OA includes child classes “b1” and “b2”, and the personal ontology OB includes child classes “b1”, “b2”, and “b3”. Therefore, the child classes that are commonly included in the personal ontologies OA and OB are only “b1” and “b2”, and the number of members of the product set of the child class sets of the personal ontology OA and OB in the child class set G1 is 2. As a result, the degree of approximation between the child ontology sets G1 of the personal ontology OA and OB is 2/3.

同様に、パーソナルオントロジＯＡ、ＯＢの子クラス集合Ｇ２間における近似度は１／２、子クラス集合Ｇ３間における近似度は０／２、子クラス集合Ｇ４間における近似度は２／２となる。この結果、パーソナルオントロジＯＡ、ＯＢ間でのトポロジの近似度Ｓ_Tは２／３＋１／２＋０／２＋２／２となる。 Similarly, the degree of approximation between the personal ontology OA and OB between the child class sets G2 is 1/2, the degree of approximation between the child class sets G3 is 0/2, and the degree of approximation between the child class sets G4 is 2/2. As a result, personal ontology OA, similarity S _T topology between OB becomes 2/3 + 1/2 + 0/2 + 2/2.

なお、パーソナルオントロジＯＡ、ＯＢ間でのトポロジの近似度の計算でも、雛形オントロジＯＨから割り振られたクラスＩＤを参照し、雛形オントロジＯＨの接続形態に沿ったものがパーソナルオントロジＯＡ、ＯＢ間に存在するかということだけを確認すればよい。例えば、パーソナルオントロジＯＡ、ＯＢにおけるａ１−ｂ１−ｃ１という接続形態は雛形オントロジＯＨの接続形態と同じであるかどうかは、“ａ１”、“ｂ１”および“ｃ１”というクラスがパーソナルオントロジＯＡ、ＯＢにて保持されているかどうかということを確認するだけで判断することができる。 In the calculation of the degree of topology approximation between the personal ontology OA and OB, the class ID assigned from the template ontology OH is referred to, and the one along the connection form of the template ontology OH exists between the personal ontology OA and OB. You only have to confirm whether you want to do it. For example, whether the connection form a1-b1-c1 in the personal ontology OA, OB is the same as the connection form of the template ontology OH depends on whether the classes “a1”, “b1”, and “c1” are the personal ontology OA, OB. It can be determined simply by confirming whether or not it is held at.

このため、パーソナルオントロジＯＡ、ＯＢ間のトポロジの一致度を確認するために、パーソナルオントロジＯＡ、ＯＢが持つクラスＩＤを調べるだけでよく、対応クラスを起点として、上下クラスにさらに対応クラスがあるかを調べる必要がなくなり、パーソナルオントロジＯＡ、ＯＢ間でのトポロジの近似度の計算量を減らすことができる。 For this reason, in order to confirm the degree of coincidence of the topology between the personal ontology OA and OB, it is only necessary to check the class ID possessed by the personal ontology OA and OB. Therefore, it is possible to reduce the calculation amount of the degree of approximation of the topology between the personal ontology OA and OB.

（２−３）次に、パーソナルオントロジＯＡ、ＯＢの共通クラス間の近似度を計算する。ここで、パーソナルオントロジＯＡ、ＯＢの共通クラス間の近似度を計算する場合、クラスに所属するインスタンス集合を用いることができる。すなわち、あるクラスＣ１において、ソースオントロジのインスタンス集合をｘ、ターゲットオントロジのインスタンス集合をｙ、雛形オントロジＯＨのインスタンス集合をｚとすると、パーソナルオントロジＯＡ、ＯＢの共通クラス間の近似度は、｜ｘ∩ｙ｜／｜ｚ｜にて求めることができる。そして、共通クラス間の近似度を足し合わせることにより、パーソナルオントロジＯＡ、ＯＢのクラス間の近似度Ｓ_Cを計測することができる。 (2-3) Next, the degree of approximation between the common classes of personal ontology OA and OB is calculated. Here, when calculating the degree of approximation between the common classes of the personal ontology OA and OB, an instance set belonging to the class can be used. That is, in a certain class C1, if the instance set of the source ontology is x, the instance set of the target ontology is y, and the instance set of the template ontology OH is z, the degree of approximation between the common classes of the personal ontology OA and OB is | x ∩y | / | z |. By adding the degree of approximation between the common class, it is possible to measure the degree of approximation S _C between personal ontology OA, the OB classes.

例えば、パーソナルオントロジＯＡ、ＯＢ間の共通クラス“ｂ２”において、パーソナルオントロジＯＡの共通クラス“ｂ２”におけるインスタンス集合Ｇ６には、インスタンス“ｎ”が存在し、パーソナルオントロジＯＢの共通クラス“ｂ２”におけるインスタンス集合Ｇ６には、インスタンス“ｎ”が存在し、雛形オントロジＯＨの共通クラス“ｂ２”にはインスタンス“ｎ”が存在している。この結果、雛形オントロジＯＨの共通クラス“ｂ２”のインスタンスのメンバ数は１、パーソナルオントロジＯＡ、ＯＢのインスタンス集合Ｇ６の積集合のメンバ数は１となり、“ｂ２”という共通クラス間の近似度は１／１となる。 For example, in the common class “b2” between the personal ontologies OA and OB, the instance set G6 in the common class “b2” of the personal ontology OA includes the instance “n”, and the common class “b2” of the personal ontology OB. An instance “n” exists in the instance set G6, and an instance “n” exists in the common class “b2” of the template ontology OH. As a result, the number of members of the common class “b2” of the template ontology OH is 1, the number of members of the product set of the instance set G6 of the personal ontologies OA and OB is 1, and the degree of approximation between the common classes “b2” is 1/1.

また、パーソナルオントロジＯＡ、ＯＢ間の共通クラス“ｃ３”において、パーソナルオントロジＯＡの共通クラス“ｃ３”におけるインスタンス集合Ｇ７には、インスタンス“ａ”、“ｃ”、“ｂ”および“ｄ”が存在し、パーソナルオントロジＯＢの共通クラス“ｃ３”におけるインスタンス集合Ｇ７には、インスタンス“ａ”、“ｃ”、“ｅ”および“ｆ”が存在し、雛形オントロジＯＨの共通クラス“ｃ３”にはインスタンス“ａ”、“ｅ”、“ｃ”、“ｆ”、“ｂ”、 “ｄ”および“ｇ”が存在している。 Further, in the common class “c3” between the personal ontology OA and OB, the instance set G7 in the common class “c3” of the personal ontology OA includes instances “a”, “c”, “b”, and “d”. In the instance set G7 in the common class “c3” of the personal ontology OB, instances “a”, “c”, “e”, and “f” exist, and in the common class “c3” of the template ontology OH “A”, “e”, “c”, “f”, “b”, “d” and “g” exist.

この結果、雛形オントロジＯＨの共通クラス“ｃ３”のインスタンスのメンバ数は７、パーソナルオントロジＯＡ、ＯＢのインスタンス集合Ｇ７の積集合のメンバ数は２となり、“ｃ３”という共通クラス間の近似度は２／７となる。従って、パーソナルオントロジＯＡ、ＯＢのクラス間の近似度Ｓ_Cは１／１＋２／７となる。 As a result, the number of members of the common class “c3” of the template ontology OH is 7, the number of members of the product set of the instance set G7 of the personal ontologies OA and OB is 2, and the degree of approximation between the common classes “c3” is 2/7. Accordingly, the degree of approximation S _C between the classes of the personal ontology OA and OB is 1/1 + 2/7.

なお、パーソナルオントロジＯＡ、ＯＢの共通クラス間の近似度の計算でも、雛形オントロジＯＨから割り振られたインスタンスＩＤがパーソナルオントロジＯＡ、ＯＢ間に存在するかということだけを確認すればよい。このため、パーソナルオントロジＯＡ、ＯＢのクラス間の近似度を計算するために、インスタンスの名前の一致性などによるインスタンスの対応関係を予め確認する必要がなくなり、計算量を減らすことができる。 Note that even in the calculation of the degree of approximation between the common classes of the personal ontology OA and OB, it is only necessary to confirm whether the instance ID allocated from the template ontology OH exists between the personal ontology OA and OB. For this reason, in order to calculate the degree of approximation between the classes of the personal ontology OA and OB, it is not necessary to confirm in advance the correspondence between instances based on the consistency of the names of instances, and the amount of calculation can be reduced.

（２−４）そして、パーソナルオントロジＯＡ、ＯＢにおけるトポロジの近似度Ｓ_Tおよびクラス間の近似度Ｓ_Cが求まると、トポロジとクラスに対する重要度に応じた評価関数ｆ（Ｘ）を用いることにより、以下の式（１）にてパーソナルオントロジＯＡ、ＯＢ間の近似度Ｓ_O（ＡＢ）を与えることができる。 (2-4) Then, when the topology approximation S _T and the class approximation S _C in the personal ontology OA, OB are obtained, the evaluation function f (X) corresponding to the importance for the topology and the class is used. The approximation S _O (AB) between the personal ontology OA and OB can be given by the following equation (1).

Ｓ_O（ＡＢ）＝Ｓ_T＋ｆ（Ｓ_C）…（１）
また、ユーザ毎の興味オントロジを構成するクラス・インスタンスに対する興味度合を示す興味度を導入することで，ユーザの決め細やかな興味を反映し、同じクラス・インスタンスに興味を持つユーザの中でも興味度の近いユーザ間でのコミュニティ形成を試みる。まず興味度の定義を与える。 S _O (AB) = S _T + f (S _C ) (1)
Also, by introducing an interest level that indicates the degree of interest in the class / instance that constitutes the interest ontology for each user, the user's detailed interest is reflected, and even among users who are interested in the same class / instance, Try to create a community between nearby users. First, the definition of interest is given.

（２−５−１）１エントリ当たりのユーザの興味度を１とする。 (2-5-1) The degree of interest of the user per entry is 1.

（２−５−２）あるエントリＥ_iに出現するユーザのクラスおよびインスタンスの種類をＮ（Ｅ_i）個とすると、そのエントリＥ_iにおけるユーザの各クラスおよびインスタンスの興味度は１／Ｎ（Ｅ_i）である。 (2-5-2) When the types of user classes and instances appearing in an entry E _i are N (E _i ), the degree of interest of each class and instance of the user in the entry E _i is 1 / N ( E _i ).

（２−５−３）オントロジ内の各インスタンスＩ_iに対する興味度は、ユーザの全蓄積エントリ集合をＥとすると、以下の式（２）で与えられる。 (2-5-3) The degree of interest for each instance I _i in the ontology is given by the following equation (2), where E is the total accumulated entry set of the user.

（２−５−４）オントロジ内の各クラスＣ_iに対する興味度は、ユーザの全蓄積エントリ集合をＥとすると、以下の式（３）で与えられる。 (2-5-4) The degree of interest for each class C _i in the ontology is given by the following equation (3), where E is the total accumulated entry set of the user.

（２−５−５）インスタンスの興味度は、そのインスタンスが所属するクラスの興味度に引き継がれ、子クラスの興味度は、親クラスの興味度に引き継がれる。 (2-5-5) The interest level of the instance is inherited by the interest level of the class to which the instance belongs, and the interest level of the child class is inherited by the interest level of the parent class.

図５は、図３、図４におけるクラスおよびインスタンスの興味度の一例を示す図である。図５において、ユーザＡの全蓄積エントリには、エントリＥ１、Ｅ２が含まれているものとする。そして、ユーザＡの興味オントロジＯＡには、“オルタナティブ”というクラスが存在し、“オルタナティブ”というクラスには、“Ｍａｄｃｈｅｓｔｅｒ”および“Ｓｈｏｅｇａｚｅ”という子クラスが存在しているものとする。また、オルタナティブ”というクラスには、“ｎｉｒｖａｎａ”というインスタンスが存在し、“Ｍａｄｃｈｅｓｔｅｒ”というクラスには、“ＮｅｗＯｒｄｅｒ”および“ＳｔｏｎｅＲｏｓｅｓ”というインスタンスが存在し、“Ｓｈｏｅｇａｚｅ”というクラスには、“ＭｙＢｌｏｏｄｙＶａｌｅｎｔｉｎｅ”および“Ｒｉｄｅ”というインスタンスが存在しているものとする。 FIG. 5 is a diagram showing an example of the degree of interest of the classes and instances in FIGS. In FIG. 5, it is assumed that entries A1 and E2 are included in all the stored entries of user A. In the interest ontology OA of the user A, a class called “alternative” exists, and a class called “alternative” has child classes called “Madchester” and “Shoegaze”. The class “alternative” has an instance “nirvana”, the class “Madchester” has instances “New Order” and “Stone Roses”, and the class “Shoegaze” has “ Assume that instances of “My Bloody Valentine” and “Ride” exist.

そして、エントリＥ１では、“ＳｔｏｎｅＲｏｓｅｓ”、“ＭｙＢｌｏｏｄｙＶａｌｅｎｔｉｎｅ”、“ｎｉｒｖａｎａ”および“ＮｅｗＯｒｄｅｒ”という４アーチストが出現し、エントリＥ２では、“ＭｙＢｌｏｏｄｙＶａｌｅｎｔｉｎｅ”という１アーチストが再度出現する上に、“Ｓｈｏｅｇａｚｅ”という１ジャンルと、“Ｒｉｄｅ”という１アーチストが出現している。 In entry E1, four artists “Stone Roses”, “My Bloody Valentine”, “nirvana”, and “New Order” appear, and in entry E2, one artist “My Bloody Valentine” appears again. , One genre of “Shoegaze” and one artist of “Ride” have appeared.

このため、“ＮｅｗＯｒｄｅｒ”というインスタンスの興味度は１／４、“ＳｔｏｎｅＲｏｓｅｓ”というインスタンスの興味度は１／４、“ＭｙＢｌｏｏｄｙＶａｌｅｎｔｉｎｅ”というインスタンスの興味度は１／４＋１／３＝７／１２、“Ｒｉｄｅ”というインスタンスの興味度は１／３、“ｎｉｒｖａｎａ”というインスタンスの興味度は１／４となる。 Therefore, the interest level of the instance “New Order” is 1/4, the interest level of the instance “Stone Roses” is 1/4, and the interest level of the instance “My Bloody Valentine” is 1/4 + 1/3 = 7 / 12. The interest level of the instance “Ride” is 1/3, and the interest level of the instance “nirvana” is 1/4.

また、“Ｍａｄｃｈｅｓｔｅｒ”というクラスの興味度は、そのクラスの配下の“ＮｅｗＯｒｄｅｒ”および“ＳｔｏｎｅＲｏｓｅｓ”というインスタンスの興味度を足し合わせることで求めることができ、１／４＋１／４＝１／２となる。また、“Ｓｈｏｅｇａｚｅ”というクラスの興味度は、そのクラスの配下の“ＭｙＢｌｏｏｄｙＶａｌｅｎｔｉｎｅ”および“Ｒｉｄｅ”というインスタンスの興味度を足し合わせることで求めることができ、７／１２＋１／３＋１／３＝５／４となる。さらに、“オルタナティブ”というクラスの興味度は、そのクラスの配下の“ｎｉｒｖａｎａ”というインスタンス、“Ｍａｄｃｈｅｓｔｅｒ”および“Ｓｈｏｅｇａｚｅ”というクラスの興味度を足し合わせることで求めることができ、１／２＋５／４＋１／４＝２となる。 The interest level of the class “Madchester” can be obtained by adding the interest levels of the instances “New Order” and “Stone Roses” under the class, and 1/4 + 1/4 = 1/2. It becomes. Further, the interest level of the class “Shoegaze” can be obtained by adding the interest levels of the instances “My Bloody Valentine” and “Ride” under the class, and 7/12 + 1/3 + 1/3 = 5 / 4. Furthermore, the interest level of the class “alternative” can be obtained by adding the interest levels of the instances “nirvana”, “Madchester”, and “Shoegaze” under the class, 1/2 + 5/4 + 1 / 4 = 2.

これにより、“ＭｙＢｌｏｏｄｙＶａｌｅｎｔｉｎｅ”というにアーチストついては、２つのエントリＥ１、Ｅ２に書き込みがなされているため、ユーザＡの興味が強いということを認識することができる。また、“Ｓｈｏｅｇａｚｅ”というにジャンルついては、そのクラス名もエントリＥ２に出現する上に、その配下のアーチストをエントリＥ１、Ｅ２において頻繁に語っているため、興味度を大きくすることができ、単純に配下にアーチストを多く持っていても、興味としては均等に扱われることを防止することができる。 As a result, the artist “My Bloody Valentine” has been written in the two entries E 1 and E 2, so that it can be recognized that the user A is highly interested. In addition, for the genre of “Shoegaze”, the class name also appears in entry E2, and the subordinate artists are frequently spoken in entries E1 and E2, so the degree of interest can be increased, and simply Even if there are many artists under them, it can be prevented that they are treated equally as interests.

（３）次に、クラス・インスタンス毎の興味度やクラスの接続形態であるトポロジをも考慮しオントロジ問の近似度を求める本発明の手法である近似度計測アルゴリズムについて図６とともに説明する。 (3) Next, an approximation measurement algorithm, which is a method of the present invention for obtaining the approximation of ontology questions in consideration of the interest for each class / instance and the topology that is the class connection form, will be described with reference to FIG.

図６において、ユーザＡの興味オントロジＯＡの“ａ１というクラスの直下には、“ｂ１”および“ｂ２”というクラスが存在し、“ｂ２”というクラスの直下には、“ｃ３”および“ｃ４”というクラスが存在しているものとする。また、“ｂ１”というクラスには、“ｍ”および“ｋ”というインスタンスが存在し、“ｂ２”というクラスには、“ｎ”というインスタンスが存在し、“ｃ３”というクラスには、“ａ”、“ｂ”および“ｃ”というインスタンスが存在し、“ｃ４”というクラスには、“ｇ”および“ｈ”というインスタンスが存在しているものとする。 In FIG. 6, the classes “b1” and “b2” exist immediately below the class “a1” of the interest ontology OA of the user A, and “c3” and “c4” exist immediately below the class “b2”. The class “b1” has instances “m” and “k”, and the class “b2” has an instance “n”. The class “c3” has instances “a”, “b”, and “c”, and the class “c4” has instances “g” and “h”. To do.

そして、オントロジＯＡにおいて、“ｃ３”というクラスの興味度は１、“ａ”というインスタンスの興味度は２、“ｂ”というインスタンスの興味度は５、“ｃ”というインスタンスの興味度は４であるとすると、“ｃ３”というクラスの配下の興味度は１＋２＋５＋４＝１２となる。 In the ontology OA, the interest degree of the class “c3” is 1, the interest degree of the instance “a” is 2, the interest degree of the instance “b” is 5, and the interest degree of the instance “c” is 4. If there is, the degree of interest under the class “c3” is 1 + 2 + 5 + 4 = 12.

また、オントロジＯＡにおいて、“ｃ４”というクラスの興味度は０、“ｇ”というインスタンスの興味度は２、“ｈ”というインスタンスの興味度は１であるとすると、“ｃ４”というクラスの配下の興味度は０＋２＋１＝３となる。 Also, in the ontology OA, if the interest level of the class “c4” is 0, the interest level of the instance “g” is 2, and the interest level of the instance “h” is 1, the subordinate of the class “c4” The degree of interest is 0 + 2 + 1 = 3.

また、オントロジＯＡにおいて、“ｂ２”というクラスの興味度は１、“ｎ”というインスタンスの興味度は２であるとすると、“ｂ２”というクラスの配下の興味度は１２＋３＋１＋２＝１８となる。 In the ontology OA, if the interest level of the class “b2” is 1, and the interest degree of the instance “n” is 2, the interest degree under the class “b2” is 12 + 3 + 1 + 2 = 18.

また、オントロジＯＡにおいて、“ｂ１”というクラスの興味度は０、“ｍ”というインスタンスの興味度は３、“ｋ”というインスタンスの興味度は２であるとすると、“ｂ１”というクラスの配下の興味度は０＋３＋２＝５となる。 Also, in the ontology OA, if the interest level of the class “b1” is 0, the interest degree of the instance “m” is 3, and the interest degree of the instance “k” is 2, the subordinate of the class “b1” The degree of interest is 0 + 3 + 2 = 5.

また、オントロジＯＡにおいて、“ａ１”というクラスの興味度は０であるとすると、“ａ１”というクラスの配下の興味度は５＋１８＝２３となる。また、ユーザＢの興味オントロジＯＢの“ａ１というクラスの直下には、“ｂ１”、“ｂ２”および“ｂ３”というクラスが存在し、“ｂ１”というクラスの直下には、“ｃ１”というクラスが存在し“ｂ２”というクラスの直下には、“ｃ３”および“ｃ４”というクラスが存在しているものとする。また、“ｃ１”というクラスには、“ｌ”というインスタンスが存在し、“ｂ２”というクラスには、“ｎ”というインスタンスが存在し、“ｂ３”というクラスには、“ｄ”というインスタンスが存在し、“ｃ３”というクラスには、“ａ”、“ｃ”および“ｅ”というインスタンスが存在し、“ｃ４”というクラスには、“ｐ”および“ｉ”というインスタンスが存在しているものとする。 In the ontology OA, if the interest level of the class “a1” is 0, the interest level under the class “a1” is 5 + 18 = 23. In addition, classes “b1”, “b2”, and “b3” exist immediately below the class “a1” of the user B's interest ontology OB, and a class “c1” exists immediately below the class “b1”. And a class “c3” and a class “c4” exist immediately under the class “b2”, and an instance “l” exists in the class “c1”. The class “b2” has an instance “n”, the class “b3” has an instance “d”, the class “c3” has “a”, “c”, and It is assumed that an instance “e” exists, and a class “c4” includes instances “p” and “i”.

そして、オントロジＯＢにおいて、“ｃ３”というクラスの興味度は０、“ａ”というインスタンスの興味度は３、“ｃ”というインスタンスの興味度は３、“ｅ”というインスタンスの興味度は３であるとすると、“ｃ３”というクラスの配下の興味度は３＋３＋３＝９となる。 In the ontology OB, the interest degree of the class “c3” is 0, the interest degree of the instance “a” is 3, the interest degree of the instance “c” is 3, and the interest degree of the instance “e” is 3. If there is, the degree of interest under the class “c3” is 3 + 3 + 3 = 9.

また、オントロジＯＢにおいて、“ｃ４”というクラスの興味度は１、“ｐ”というインスタンスの興味度は３、“ｉ”というインスタンスの興味度は２であるとすると、“ｃ４”というクラスの配下の興味度は１＋３＋２＝６となる。 Also, in the ontology OB, if the interest level of the class “c4” is 1, the interest degree of the instance “p” is 3, and the interest degree of the instance “i” is 2, the subordinate of the class “c4” Is 1 + 3 + 2 = 6.

また、オントロジＯＢにおいて、“ｂ３”というクラスの興味度は０、“ｄ”というインスタンスの興味度は２であるとすると、“ｂ３”というクラスの配下の興味度は０＋２＝２となる。 In the ontology OB, if the interest level of the class “b3” is 0 and the interest degree of the instance “d” is 2, the interest degree under the class “b3” is 0 + 2 = 2.

また、オントロジＯＢにおいて、“ｂ２”というクラスの興味度は０、“ｎ”というインスタンスの興味度は６であるとすると、“ｂ２”というクラスの配下の興味度は６＋９＋６＝２１となる。 Also, in the ontology OB, if the interest level of the class “b2” is 0 and the interest degree of the instance “n” is 6, the subordinate interest level of the class “b2” is 6 + 9 + 6 = 21.

また、オントロジＯＢにおいて、“ｂ１”というクラスの興味度は０であるとすると、“ｂ１”というクラスの配下の興味度は０＋３＝３となる。 In the ontology OB, if the degree of interest of the class “b1” is 0, the degree of interest under the class “b1” is 0 + 3 = 3.

また、オントロジＯＢにおいて、“ａ１”というクラスの興味度は０であるとすると、“ａ１”というクラスの配下の興味度は３＋２１＋２＝２６となる。 In the ontology OB, if the interest level of the class “a1” is 0, the interest level under the class “a1” is 3 + 21 + 2 = 26.

ここで、オントロジ間の興味一致度の計算に用いる用語を以下のように定義する。親クラスおよび子クラスからなるトポロジをＴ₁、クラスとインスタンスからなるトポロジをＴ₂とする。さらに、興味オントロジＯＡ、ＯＢの共通クラスをＣ_iと定義し、共通インスタンスをＩ_i∈Ｉと定義する。特に、トポロジＴ₁に所属するクラス集合をＣ（Ｔ₁）、トポロジＴ₂に所属するクラス集合をＣ（Ｔ₂）とする。また、共通クラスＣ_iに対する興味一致度をＩ（Ｃ_i）とし、共通インスタンスＩ_iに対する興味一致度をＩ（Ｉ_i）とし、共通クラスＣ_iの配下の興味一致度をＩ_t（Ｃ_i）とする。 Here, the terms used for calculating the degree of interest matching between ontology are defined as follows. Let T ₁ be the topology consisting of the parent class and child class, and T ₂ be the topology consisting of the class and instance. Furthermore, a common class of interest ontology OA and OB is defined as C _i, and a common instance is defined as I _i ∈I. In particular, a class set belonging to topology T ₁ is C (T ₁ ), and a class set belonging to topology T ₂ is C (T ₂ ). Further, the interest coincidence for the common class C _i is I (C _i ), the interest coincidence for the common instance I _i is I (I _i ), and the interest coincidence under the common class C _i is I _t (C _i). ).

（３−１）まず、興味オントロジＯＡ、ＯＢの共通クラスＣ_iを分析し、両トポロジＴ₁、Ｔ₂を形成する共通クラスＣ_iが存在することに注意しながら、トポロジＴ₁を形成する共通クラスＣ_iとトポロジＴ₂を形成する共通クラスＣ_iとを抽出する。例えば、図５の例では、共通クラスａ１、ｂ１、ｂ２はトポロジＴ₁を形成し、共通クラスｂ２、ｃ３、ｃ４はトポロジＴ₂を形成することができる。 (3-1) First, the common class C _i of the interest ontology OA and OB is analyzed, and the topology T ₁ is formed while paying attention to the presence of the common class C _i that forms both topologies T ₁ and T _2. extracts the common classes C _i to form a common class C _i and topology T _2. For example, in the example of FIG. 5, the common class a1, b1, b2 forms a topology T _1, the common class b2, c3, c4 can form a topology T _2.

（３−２）共通クラスＣ_iが興味オントロジＯＡ、ＯＢ間で共通インスタンスＩ_iを持つとすると、共通インスタンスＩ_iに対する興味一致度をＩ（Ｉ_i）は、興味オントロジＯＡ、ＯＢのインスタンスに対する興味度のうち小さい方とする。例えば、図６の例では、共通クラスｃ３の配下の共通インスタンスａに対する興味一致度は興味オントロジＯＡ側の２となる。 (3-2) Assuming that the common class C _i has a common instance I _i between the interest ontologies OA and OB, the interest coincidence I (I _i ) with respect to the common instance I _{i is obtained} for the instances of the interest ontologies OA and OB. The lesser of interest. For example, in the example of FIG. 6, the degree of interest coincidence with respect to the common instance a under the common class c3 is 2 on the interest ontology OA side.

（３−３）興味オントロジＯＡ、ＯＢにおける共通クラスＣ_iに対する興味一致度Ｉ（Ｃ_i）は、興味オントロジＯＡ、ＯＢのクラスに対する興味度のうち小さい方とすることができる。例えば、図６の例では、共通クラスｃ３に対する興味一致度は興味オントロジＯＢ側の０となる。 (3-3) The interest coincidence I (C _i ) for the common class C _i in the interest ontologies OA and OB can be the smaller of the interest degrees for the classes of the interest ontologies OA and OB. For example, in the example of FIG. 6, the interest coincidence with respect to the common class c3 is 0 on the interest ontology OB side.

（３−４）共通クラスＣ_i∈Ｃ（Ｔ₁）の配下の興味一致度Ｉ_t（Ｃ_i）は、興味オントロジＯＡ、ＯＢにおけるクラスＣ_iの配下の子クラスの積集合をＮ（Ｃ_i）、和集合をＵ（Ｃ_i）とすると、以下の式（４）で与えることができる。 (3-4) The interest coincidence I _t (C _i ) under the common class C _i ∈ C (T ₁ ) is the product set of the child classes under the class C _i in the interest ontologies OA and OB as N (C _i ), where the union is U (C _i ), it can be given by the following equation (4).

例えば図６の例では、共通クラスｂ２の配下の興味一致度Ｉ_t（Ｃ_i）は、（９＋３）／２＝６となる。 For example, in the example of FIG. 6, the interest coincidence I _t (C _i ) under the common class b2 is (9 + 3) / 2 = 6.

そして、以下の式（５）で示すように、トポロジＴ₁を形成する全ての共通クラスＣ_iについての興味一致度Ｉ_t（Ｃ_i）を足し合わせた値を、トポロジＴ₁を形成する共通クラス集合に対する興味一致度Ｓ（Ｔ₁）とすることができる。 Then, as shown in the following equation (5), all common classes C _i value obtained by adding an interest matching score I _t (C _i) for forming a topology T _1, to form a topology T ₁ common The interest coincidence S (T ₁ ) for the class set can be used.

（３−５）一方、共通クラスＣ_i∈Ｃ（Ｔ₂）の配下の興味一致度Ｉ_t（Ｃ_i）は、興味オントロジＯＡにおけるクラスＣ_iの配下のインスタンス集合をＩ_A（Ｃ_i）、興味オントロジＯＢにおけるクラスＣ_iの配下のインスタンス集合をＩ_B（Ｃ_i）とすると、以下の式（６）で与えることができる。 (3-5) On the other hand, the interest coincidence I _t (C _i ) under the common class C _i ∈ C (T ₂ ) is the instance set under the class C _i in the interest ontology OA as I _A (C _i ) If the instance set under the class C _i in the interest ontology OB is I _B (C _i ), it can be given by the following equation (6).

例えば図６の例では、共通クラスｃ３の配下の興味一致度Ｉ_t（Ｃ_i）は、（２＋０＋３＋０）／４＝５／４となる。 For example, in the example of FIG. 6, the interest coincidence I _t (C _i ) under the common class c3 is (2 + 0 + 3 + 0) / 4 = 5/4.

そして、以下の式（７）で示すように、トポロジＴ₂を形成する全ての共通クラスＣ_iについての興味一致度Ｉ_t（Ｃ_i）を足し合わせた値を、トポロジＴ₂を形成する共通クラス集合に対する興味一致度Ｓ（Ｔ₂）とすることができる。 Then, as shown by the following equation (7), all common classes C _i value obtained by adding an interest matching score I _t (C _i) for forming a topology T _2, to form a topology T ₂ Common The interest coincidence S (T ₂ ) for the class set can be used.

（３−６）そして、トポロジＴ₁、Ｔ₂に対する重要度に応じた評価関数をｆ（Ｘ）とすると、興味度を考慮した興味オントロジＯＡ、ＯＢ間の近似度は、
Ｓ_O（ＡＢ）＝Ｓ（Ｔ₁）＋ｆ（Ｓ（Ｔ₂））…（８）
で与えることができる。 (3-6) Then, assuming that the evaluation function corresponding to the importance with respect to the topologies T ₁ and T ₂ is f (X), the degree of approximation between the interest ontology OA and OB considering the degree of interest is
S _O (AB) = S (T ₁ ) + f (S (T ₂ )) (8)
Can be given in

これによって、前記抽出されたパーソナルオントロジのトポロジだけでなく、ユーザの興味度を考慮しながら、パーソナルオントロジ間の近似度を計測することにより、個人の興味の強さを考慮しながら単なる文字列ではなく概念間の一致性に基づいて、自分の嗜好に適合した情報を精度よく検索することが可能となる。 In this way, not only the extracted personal ontology topology but also the degree of interest of the user is taken into account, and the degree of approximation between the personal ontologies is measured, thereby taking into account the strength of the individual's interest and using only a character string. Instead, it is possible to search for information suitable for one's preference with high accuracy based on consistency between concepts.

図７は、本発明の一実施形態に係る興味オントロジの近似性を利用したコミュニティ形成方法を示す図である。 FIG. 7 is a diagram illustrating a community formation method using the approximation of interest ontology according to an embodiment of the present invention.

図７において、各ユーザＡ、ＢのブログエントリＰＡ、ＰＢを雛形オントロジに対してそれぞれ分類することにより、各ユーザＡ、Ｂの興味オントロジＫＡ、ＫＢがそれぞれ生成されたものとする（ステップＳ１１）。そして、各ユーザＡ、Ｂの興味オントロジＫＡ、ＫＢ間の近似度を計測し（ステップＳ１２）、近似度の高い興味オントロジＫＡ、ＫＢ間で共起するクラスやインスタンスを分析することで、トポロジが異なるにも関わらず興味を持つ可能性が高い情報を他のユーザのエントリを介して意外な情報としてユーザに推奨することができる（ステップＳ１３）。 In FIG. 7, it is assumed that the blog entries PA and PB of the users A and B are classified with respect to the template ontology, thereby generating the interest ontologies KA and KB of the users A and B, respectively (step S11). . Then, the degree of approximation between the interest ontologies KA and KB of the users A and B is measured (step S12), and the topology and the instance are analyzed by analyzing the classes and instances that co-occur between the interest ontologies KA and KB having a high degree of approximation. Although it is different, information that is highly likely to be of interest can be recommended to the user as unexpected information through the entry of another user (step S13).

例えば、各ユーザＡ、Ｂの興味オントロジＫＡ、ＫＢ間の近似度を計測することにより、“Ｍａｄｃｈｅｓｔｅｒ”などのクラスや“ＨａｐｐｙＭｏｎｄａｙｓ”などのインスタンスに興味を持つユーザは、“Ｇｌａｓｇｏｗ”というクラスや“ＴｅｅｎａｇｅＦａｎｃｌｕｂ”というインスタンスにも興味を持つ可能性が高いことが判る。 For example, by measuring the degree of approximation between the interest ontologies KA and KB of each user A and B, a user who is interested in a class such as “Madchester” or an instance such as “Happy Mondays” can have a class “Glasgow” It can be seen that there is a high possibility of being interested in an instance of “Teenage Fanclub”.

また、このような興味オントロジＫＡ、ＫＢをブログに適用することで、単純なキーワード検索ではなく、興味オントロジＫＡ、ＫＢ間の近似度に基づく意外なエントリ推薦によるコミュニティの形成を支援することができ、ユーザの興味を自然に広げることができる（ステップＳ１４）。 Moreover, by applying such interest ontologies KA and KB to a blog, it is possible to support the formation of a community based on an unexpected entry recommendation based on the degree of approximation between the interest ontologies KA and KB rather than a simple keyword search. The user's interest can be naturally expanded (step S14).

＜ユーザ興味相関グラフ生成器２０（ステップＳ２００）＞
（４）次にユーザ興味相関グラフ生成器２０は、前記計測されたユーザ興味類似度に基づいて、ユーザをノードとし興味の類似するユーザ間にエッジを引くことでユーザグラフ（ユーザ興味相関グラフ）を生成する。そのグラフ生成処理は図８のフローチャートに沿って実行される。 <User Interest Correlation Graph Generator 20 (Step S200)>
(4) Next, based on the measured user interest similarity, the user interest correlation graph generator 20 draws an edge between users having similar interests with the user as a node, thereby creating a user graph (user interest correlation graph). Is generated. The graph generation process is executed according to the flowchart of FIG.

（４−１）図８において、まずステップＳ２０１では、前記図３〜図７で述べた類似度計測法により計測されたユーザ毎の類似度を取得する。次にステップＳ２０２では、ユーザｕ_iから見て類似度の高いＸ人のユーザｕ_jとの間にリンクを引く。このパラメータＸの決定は重要である。ユーザＡと興味の近いユーザを、前述した、類似度計測法で計算した後、類似度が高いユーザ順に、ユーザの持つインスタンスの種類をカウントしていく。そして、ユニークなインスタンスの種類数を、類似度が高いユーザ集合で足しこんでいったときに、ある点で急激にインスタンスの種類が増えた場合に、ユーザＡとの興味が大きくずれたと考え、その手前のユーザまでを、興味が類似するユーザ集合（Ｘ人からなる）であると定義する。 (4-1) In FIG. 8, first, in step S201, the similarity for each user measured by the similarity measurement method described in FIGS. Next, in step S202, a link is drawn with X users u _j having a high degree of similarity when viewed from the user u _i . The determination of this parameter X is important. After calculating a user who is interested in user A by the similarity measurement method described above, the types of instances possessed by the user are counted in the order of the user having the highest similarity. And, when the number of unique instances is added to a set of users with high similarity, if the number of instance types suddenly increases at a certain point, it is considered that the interest in user A has greatly shifted, The user up to that point is defined as a user set (consisting of X persons) with similar interests.

（４−２）次にステップＳ２０３では、エッジに、ユーザｕ_iからみたｕ_jの類似度を正規化した値を重みとして与える。この正規化の方法は、ユーザｕ_iからみて、最も類似度が低いユーザの類似度をａとし、最も類似度が大きいユーザの類似度をｂとすると、ｕ_jとｕ_iの間のエッジの重みは、
｛ｓｃｏｒｅ（ｕ_j）−ａ｝／（ｂ−ａ）…（９）
で与えられる。 (4-2) Next, in step S203, a value obtained by normalizing the similarity of u _j viewed from the user u _i is given to the edge as a weight. This method of normalization is viewed from the user u _i, and the highest degree of similarity is low user similarity with a, when the highest similarity is larger user similarity is b, the edge between u _j and u _i The weight is
{Score (u _j ) −a} / (b−a) (9)
Given in.

＜推薦スコア計算器３０（ステップＳ３００）＞
（５）次に推薦スコア計算器３０は、前記ユーザ興味相関グラフ生成器２０により生成されたユーザグラフ上を、アクティブユーザａの持つ興味アイテムを起点としてランダムウォークさせ、たどったユーザの持つ興味アイテム（ユーザの興味対象コンテンツ）を推薦対象リストとし、そのコンテンツの出現回数と、エッジのスコアを基に、興味対象コンテンツの推薦スコアを計算する。 <Recommended Score Calculator 30 (Step S300)>
(5) Next, the recommendation score calculator 30 causes the user's interest generated by the active user a to randomly walk on the user graph generated by the user interest correlation graph generator 20 starting from the item of interest possessed by the active user a. The recommended target list is calculated based on the number of appearances of the content and the edge score.

前記ランダムウォークは、非特許文献２の３章に記載のランダムウォークに基づくグラフベースの協調フィルタリングを利用するものであるが、本実施形態例では、ランダムウォークにおけるマルコフ遷移行列の要素に、前記図３〜図７で述べたタクソノミに基づく類似度計算に基づいたユーザ興味の類似度を代入することで実施される。 The random walk uses graph-based collaborative filtering based on the random walk described in Chapter 3 of Non-Patent Document 2, but in the present embodiment, the elements of the Markov transition matrix in the random walk It implements by substituting the similarity of a user interest based on the similarity calculation based on the taxonomy described in FIGS.

＜推薦アイテムの所属クラスタ特定器４０（ステップＳ４００）＞
（６）次に推薦アイテムの所属クラスタ特定器４０は、前記ユーザ興味相関グラフ生成器２０により生成されたユーザグラフをクラスタリングし、該クラスタリング結果に基づいて、推薦アイテムおよびそのアイテムの背景情報を含む推薦リストを作成する。その処理は図９のフローチャートに沿って実行される。 <Recommended Item Affiliation Cluster Identification Device 40 (Step S400)>
(6) Next, the recommended item affiliation cluster identification unit 40 clusters the user graph generated by the user interest correlation graph generator 20, and includes the recommended item and background information of the item based on the clustering result. Create a recommendation list. The process is executed according to the flowchart of FIG.

（６−１）図９において、まずステップＳ４０１では、ユーザ興味相関グラフ生成器２０により生成されたユーザグラフを取得する。次にステップＳ４０２では、非特許文献３に記載の方法を、前記取得したユーザグラフに適用して、例えばＧＮ法によりクラスタリングを実施する。 (6-1) In FIG. 9, first, in step S401, the user graph generated by the user interest correlation graph generator 20 is acquired. Next, in step S402, the method described in Non-Patent Document 3 is applied to the acquired user graph, and clustering is performed by, for example, the GN method.

すなわち、ユーザグラフ内の全てのエッジについてｂｅｔｗｅｅｎｎｅｓｓ（クラスタ間のエッジは高い値となり、クラスタ内のエッジは低い値となる指標）を計算し、最も高いｂｅｔｗｅｅｎｎｅｓｓを持つエッジを取り除き、残りの全てのエッジについて再びｂｅｔｗｅｅｎｎｅｓｓを計算する手順を繰り返し行なう。 That is, for all edges in the user graph, betweenness (an index in which the inter-cluster edge has a high value and the in-cluster edge has a low value) is calculated, the edge having the highest betweenness is removed, and all the remaining edges Repeat the procedure for calculating betweenness again.

前記非特許文献に記載の方法ではクラスタリングを適用するグラフに対する重みを考慮していないが、本実施形態例におけるクラスタリングは、エッジに重みがあるグラフに対し適切なクラスタを生成するものである。 Although the method described in the non-patent document does not consider the weight for the graph to which clustering is applied, the clustering in the present embodiment generates an appropriate cluster for the graph having a weight at the edge.

また、本実施形態例のクラスタリングにおける新規点は、ユーザグラフのエッジ数を、前記図８のステップＳ２０２で説明したパラメータＸによって制御できる点である。 Further, the new point in the clustering of the present embodiment is that the number of edges of the user graph can be controlled by the parameter X described in step S202 of FIG.

（６−２）次にステップＳ４０３〜Ｓ４０６では、前記ステップＳ４０２のクラスタリングの結果を利用して推薦アイテムおよびそのアイテムの背景情報を生成し、それら推薦アイテムおよび背景情報を前記推薦スコア計算器３０により決定された推薦対象リストに反映させて推薦リストとして出力する。 (6-2) Next, in steps S403 to S406, a recommended item and background information of the item are generated using the result of clustering in step S402, and the recommended item and background information are generated by the recommended score calculator 30. The result is reflected in the determined recommendation target list and output as a recommendation list.

すなわち、ステップＳ４０３ではユーザ毎の所属クラスタを識別し、ステップＳ４０４では各クラスタの中心ユーザを識別し、ステップＳ４０５では各クラスタの持つクラスを識別し、ステップ４０６では推薦アイテムおよび背景情報を推薦リストに反映させ、出力する（ユーザに提示する）。 That is, the cluster to which each user belongs is identified in step S403, the central user of each cluster is identified in step S404, the class possessed by each cluster is identified in step S405, and the recommended item and background information are added to the recommendation list in step 406. Reflect and output (present to user).

前記推薦アイテムの背景情報提示の実施態様は次のとおりである。 The embodiment of the background information presentation of the recommended item is as follows.

すなわち、前記クラスタリング結果を利用し、ランダムホップした先で発見されたインスタンスが自分が所属するクラスタにも存在するものであるのか？もし、そうでないならば、どのクラスタから発見されたものなのか？を明示する。 That is, using the clustering result, does the instance found at the random hop destination also exist in the cluster to which it belongs? If not, from which cluster is it discovered? Is specified.

明示する方法としては、まず、（１）異なるクラスタからのみ発見されるものであるという情報を提示する。（２）どのクラスタからか、ということに関しては、発見基クラスタの中でもエッジを多く持つ中心ユーザ（もしくは、そのユーザの持つ、インスタンス集合）を例示することで、どういう情報かを明示する。さらに、そのクラスタの持つインスタンスセット全てを提示するのは、被推薦ユーザが受ける情報量が多すぎるため、そのクラスタが多く持つクラスの情報を提示する。 As an explicit method, first, (1) information indicating that it is found only from different clusters is presented. (2) As to which cluster it is from, the central user (or the instance set possessed by the user) having many edges in the discovery base cluster is exemplified to clearly indicate what information. Furthermore, since all the instance sets possessed by the cluster are presented with the amount of information received by the recommended user, the class information possessed by the cluster is presented.

図１０は、推薦アイテムの所属クラスタ特定器４０から出力される推薦リストの一例を示している。図１０において、「Ａｒｔｉｓｔａ〜ｄ」は推薦ランキング上位から下位のインスタンス（推薦コンテンツ）であり、()内の「Ａさんの持たない（又は持つ）クラス」は、当該推薦コンテンツが、自分が所属するクラスタにも存在するか否かを表し、「ａｎｄｃｌｕｓｔｅｒＣ２〜Ｃ４」はどのクラスタから発見されたかを表し、「中心ユーザＢ〜Ｅ」は発見されたクラスタの中でもエッジを多く持つ中心ユーザ名を表している。 FIG. 10 shows an example of a recommendation list output from the recommended item belonging cluster identification device 40. In FIG. 10, “Artist a to d” are instances (recommended content) from the top to the bottom of the recommended ranking, and “class A does not have (or has)” in () indicates that the recommended content is It indicates whether or not the cluster to which it belongs also, “and cluster C2 to C4” indicates from which cluster it was discovered, and “central users B to E” are central users with many edges among the discovered clusters Represents a name.

本発明の実施形態例のように、タクソノミに基づくユーザ興味を使うことによる、ｓｐａｒｓｉｔｙｐｒｏｂｌｅｍ（スパース問題）への効果は次のとおりである。 As in the embodiment of the present invention, the effect on the parity problem by using the user interest based on the taxonomy is as follows.

タクソノミに基づくユーザ興味を、特に、推薦システムにおいて、ユーザの興味をシステムが十分に把握していない段階（ｓｐａｒｓｉｔｙｐｒｏｂｌｅｍが発生しやすい状態ともいえるし、多様な興味を持つユーザのユーザ興味をシステムが一部しか把握できていない状態ともいえる）において、特に有効利用できることを、図１１のグラフを用いて示す。 It can be said that user interest based on taxonomy, particularly in the recommendation system, is a state where the system does not sufficiently grasp the user's interest (sparity problem is likely to occur) It can be said that it can be effectively used in a state in which only a part can be grasped) using the graph of FIG.

図１１は、ある情報推薦システムが把握しているユーザ興味を、システムが１／１０から９／１０まで把握している状況の１０種類に分け、それぞれにおいて、残り９／１０から１／１０まで、システムが被推薦ユーザに、適したコンテンツを推薦できているかどうかを計算したものであり、Ａｍａｚｏｎなどで使われている一般的な協調フィルタリング（非特許文献５参照）と、本発明のタクソノミに基づくユーザ興味、およびその類似度を基に、推薦を実施した場合との比較を行ったものである。 FIG. 11 divides user interests ascertained by a certain information recommendation system into 10 types of situations where the system grasps from 1/10 to 9/10, and in each of the remaining 9/10 to 1/10 The system calculates whether or not the user can recommend a suitable content to the recommended user. In general collaborative filtering (see Non-Patent Document 5) used in Amazon and the like, and the taxonomy of the present invention The comparison is made with the recommendation based on the user interest and the similarity degree.

図１１（ａ）は全ユーザ（３５００人）を対象とした場合、図１１（ｂ）は興味が少ないユーザ（２９００人）を対象とした場合の各比較検証を表し、実線は本発明による推薦誤差を示し、破線はＡｍａｚｏｎなどの従来技術による推薦誤差を示している
図１１（ａ）によると、ユーザ興味をシステムが把握できていない場合（Ｘ軸が小さい場合）において、特に、推薦誤差（平均絶対誤差ＭＡＥ）が小さいことがわかる。 FIG. 11A shows comparison verifications when all users (3500) are targeted, and FIG. 11B shows comparison verifications when less interested users (2900) are targeted, and the solid line indicates a recommendation according to the present invention. FIG. 11A shows a recommendation error (when the X-axis is small), in particular, a recommendation error (when the X-axis is small). It can be seen that the average absolute error MAE) is small.

また図１１（ｂ）によると、ユーザ興味の量が少ない（システムが把握できていない）ユーザに焦点をあてると、常にＭＡＥがよいことがわかる。 Further, according to FIG. 11B, it can be seen that the MAE is always good when focusing on a user who has a small amount of user interest (the system cannot be grasped).

このことより、本発明の手法は、ユーザ興味をシステムが十分に把握できていない場合に有効であることがわかる。 From this, it can be seen that the method of the present invention is effective when the system does not sufficiently grasp the user's interest.

次に、ユーザグラフにおけるランダムウォークを用いた推薦システムに対しても、本発明のタクソノミを用いたユーザ興味を用いて推薦精度を検証した。前記と同様に、あるシステムが把握しているユーザ興味をシステムが１／１０から９／１０まで把握している状況の１０種類に分け、それぞれにおいて、残り９／１０から１／１０まで、システムが被推薦ユーザに、適したコンテンツを推薦できているかどうかを計算した。 Next, the recommendation accuracy of the recommendation system using the random walk in the user graph was verified using the user interest using the taxonomy of the present invention. Similarly to the above, the user's interest that a certain system grasps is divided into 10 types of situations that the system grasps from 1/10 to 9/10, and in each of the remaining 9/10 to 1/10, the system Calculated whether it was possible to recommend suitable content to the recommended users.

インスタンスのみのマッチングを用いた従来手法では、特に、ユーザの興味をシステムが把握していない場合、ユーザ間を結ぶエッジの数が減る。一方、タクソノミを用いた場合は、インスタンスのみでなく、クラスでも繋がるユーザであれば、エッジが引かれる可能性がある。 In the conventional method using the instance-only matching, particularly when the system does not grasp the user's interest, the number of edges connecting the users is reduced. On the other hand, when a taxonomy is used, an edge may be drawn if the user is connected not only by an instance but also by a class.

検証結果では、グラフベースにおいても、特に、リスタートのパラメータを０．４から０．６程度にした（つまり、ほどほどにｒｅｓｔａｒｔｓする）場合で、かつ、ユーザ興味をシステムが、１／１０から４／１０程度しか把握できていない場合、本発明の手法の方が精度がいいことがわかった。さらに、前記グラフベースの場合よりも、精度（ＭＡＥを基にした）はさらによい。このことから、本発明の手法は、グラフベースにおいて、さらに、ユーザの潜在興味を発見するのに適していることがわかる。 As a result of the verification, even in the graph base, in particular, when the restart parameter is set to about 0.4 to 0.6 (that is, restarts moderately) and the user's interest is 1/10 to 4 When only about / 10 was grasped, it was found that the method of the present invention had better accuracy. Furthermore, the accuracy (based on MAE) is better than in the case of the graph base. From this, it can be seen that the method of the present invention is suitable for discovering a user's potential interest in the graph base.

また、本実施形態の興味情報推薦装置における各手段の一部もしくは全部の機能をコンピュータのプログラムで構成し、そのプログラムをコンピュータを用いて実行して本発明を実現することができること、本実施形態の興味情報推薦方法における手順をコンピュータのプログラムで構成し、そのプログラムをコンピュータに実行させることができることは言うまでもなく、コンピュータでその機能を実現するためのプログラムを、そのコンピュータが読み取り可能な記録媒体、例えばＦＤ（Ｆｌｏｐｐｙ（登録商標）Ｄｉｓｋ）や、ＭＯ（Ｍａｇｎｅｔｏ−Ｏｐｔｉｃａｌｄｉｓｋ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、メモリカード、ＣＤ（ＣｏｍｐａｃｔＤｉｓｋ）−ＲＯＭ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＨＤＤ、リムーバブルディスクなどに記録して、保存したり、配布したりすることが可能である。また、上記のプログラムをインターネットや電子メールなど、ネットワークを通して提供することも可能である。 Also, the present invention can be realized by configuring some or all of the functions of each means in the interest information recommendation apparatus of the present embodiment with a computer program and executing the program using the computer. It is needless to say that the procedure in the interest information recommendation method can be configured by a computer program and the program can be executed by the computer, and the program for realizing the function by the computer can be read by the computer, For example, FD (Floppy (registered trademark) Disk), MO (Magneto-Optical disk), ROM (Read Only Memory), memory card, CD (Compact Disk) -ROM, DVD (Digital Versati) e Disk) -ROM, CD-R, CD-RW, HDD, and recorded in a removable disk, or stored, it is possible or distribute. It is also possible to provide the above program through a network such as the Internet or electronic mail.

今後の産業発展の基盤の一つである情報通信システムは、現在急激な成長の過程にあるが、膨大な量に拡大した情報通信によるアクセス可能な情報源を充分に活用出来ているかどうかが現在大問題になっている。本発明は、パーソナルオントロジを簡易に作成して自己の興味にマッチングする情報を個人毎が興味を持つ対象、およびその対象への個人の思い入れの強さまで考慮したパーソナライゼーションを行ったうえで、パーソナライズされた興味オントロジ間の近さに基づく情報推薦やコミュニティ形成を行うことができ、情報通信システムのもつ情報源から自己の興味にマッチングする情報を自動的にあるいは効率的に活用できるようになる可能性がある。 The information communication system, which is one of the foundations of future industrial development, is currently undergoing rapid growth, but whether or not the information source that can be accessed by the information communication that has been expanded to an enormous amount can be fully utilized. It has become a big problem. The present invention personalizes personal information after easily creating a personal ontology and personalizing information that matches the interests of the individual, taking into account the interest of each individual and the strength of the individual's feelings about the subject. It is possible to recommend information and create a community based on the proximity of interest interests that have been created, and to automatically or efficiently utilize information that matches your interests from information sources of the information communication system There is sex.

１…通信網
２〜４…端末
５…サーバ
６…雛形オントロジ
７〜９…ブログサイト
１０…ユーザ興味類似度計測器
２０…ユーザ興味相関グラフ生成器
３０…推薦スコア計算器
４０…推薦アイテムの所属クラスタ特定器 DESCRIPTION OF SYMBOLS 1 ... Communication network 2-4 ... Terminal 5 ... Server 6 ... Model ontology 7-9 ... Blog site 10 ... User interest similarity measuring device 20 ... User interest correlation graph generator 30 ... Recommended score calculator 40 ... Affiliation of recommended item Cluster identifier

Claims

A personal ontology that conceptually stratifies the interest information constructed for each user is generated, and the taxonomy is created in consideration of the class that includes the words in the interest information, the degree of interest for each instance, and the topology that is the connection form of the class. User interest similarity measurement means for measuring the interest similarity for each user based on;
Based on the user interest similarity for each user measured by the user interest similarity measurement means, a user interest correlation graph is generated by setting an edge between the users as a node and a weight based on the similarity as a score. A user interest correlation graph generating means,
Random walk on the graph generated by the user interest correlation graph generation means, add the content of interest of the user who has followed to the recommendation target list, based on the count number of each content and the score of the edge between the users A recommendation score calculating means for calculating a recommendation score of the content of interest;
Clustering the graph generated by the user interest correlation graph generating means, and based on the clustering result, a recommended item belonging cluster specifying means for creating a recommended list including recommended items and background information of the items;
An interest information recommendation device comprising:

The user interest similarity measuring means includes:
A topology T ₁ composed of a parent class and a child class of each generated user ontology and a topology T ₂ composed of classes and instances are defined, and a common class is analyzed among a plurality of users' personal ontology to analyze the topology T _1. And the common class forming topology T ₂ are extracted,
Based on the product set and union of the child classes under the common class of the extracted two users' personal ontology, the interest coincidence under the common class is calculated, and the interest coincidence in all common classes forming the topology T ₁ The sum of degrees is calculated, and the interest matching degree S (T ₁ ) for the common class set forming the topology T ₁ is calculated.
Of the extracted common classes of two users' personal ontology, a common class is subordinate based on an instance set under a common class in one user's personal ontology and an instance set under a common class in the other user's personal ontology. The interest coincidence S is calculated, the sum of the interest coincidence in all the common classes forming the topology T ₂ is obtained, and the interest coincidence S (T ₂ ) for the common class set forming the topology T ₂ is calculated.
The degree of approximation So ontology between the ontology is calculated based on the calculated interest coincidence S (T ₁ ) and S (T ₂ ) and the evaluation function f (X) corresponding to the importance for the topologies T ₁ and T ₂ . The interest information recommendation device according to claim 1, wherein (T ₁ ) + f (S (T ₂ )) is obtained.

The user interest correlation graph generation means includes:
Count the types of instances that each user has in order of the users with the highest interest similarity obtained by the user interest similarity measurement means, determine the counted number of users X immediately before the number of types of instances rapidly increases,
An edge is set between the X users uj having a high degree of similarity when viewed from a certain user ui,
The interest information recommendation device according to claim 1 or 2, wherein a value obtained by normalizing the similarity of the user uj viewed from the user ui is given to the edge as a weight.

The background information of the recommended item in the recommended list created by the recommended item belonging cluster specifying means is whether the user's content of interest added to the recommended target list also exists in the cluster to which the recommended item belongs. 4. The information according to claim 1, including information indicating which cluster has been found when it does not exist, and information on a central user having many edges in the discovery base cluster. 5. The interest information recommendation device described.

The user interest similarity measurer generates a personal ontology in which the interest information constructed for each user is conceptually hierarchized, and includes a class including words in the interest information, an interest degree for each instance, and a connection form of the classes. Considering the topology, a user interest similarity measurement step for measuring the interest similarity for each user based on the taxonomy,
The user interest correlation graph generation means sets an edge between users based on the user interest similarity for each user measured by the user interest similarity measurement means and having a user as a node and a weight based on the similarity as a score. A user interest correlation graph generating step for generating a user interest correlation graph;
The recommendation score calculation means randomly walks on the graph generated by the user interest correlation graph generation means, adds the content of interest of the user who has been traced to the recommendation target list, the count number of each content, and the edge between the users A recommendation score calculating step of calculating a recommendation score of the content of interest based on the score of
The recommended item affiliation cluster specifying unit clusters the graph generated by the user interest correlation graph generation unit, and creates a recommendation list including a recommended item and background information of the item based on the clustering result. Belonging cluster identification step,
An interest information recommendation method characterized by comprising:

The user interest similarity measurement step includes:
A topology T ₁ composed of a parent class and a child class of each generated user ontology and a topology T ₂ composed of classes and instances are defined, and a common class is analyzed among a plurality of users' personal ontology to analyze the topology T _1. Extracting a common class forming a topology and a common class forming a topology T ₂ ;
Based on the product set and union of the child classes under the common class of the extracted two users' personal ontology, the interest coincidence under the common class is calculated, and the interest coincidence in all common classes forming the topology T ₁ Calculating an interest matching degree S (T ₁ ) for the common class set forming the topology T ₁ by obtaining a sum of degrees;
Of the extracted common classes of two users' personal ontology, a common class is subordinate based on an instance set under a common class in one user's personal ontology and an instance set under a common class in the other user's personal ontology. the step of interest matching degree calculated by the total sum of the interest matching score at all common classes of forming the topology T _2, to calculate an interest coincidence degree S (T ₂₎ to a common class set forming a topology T ₂ of the When,
The degree of approximation So ontology between the ontology is calculated based on the calculated interest coincidence S (T ₁ ) and S (T ₂ ) and the evaluation function f (X) corresponding to the importance for the topologies T ₁ and T ₂ . The interest information recommendation method according to claim 5, further comprising a step of obtaining (T ₁ ) + f (S (T ₂ )).

The user interest correlation graph generation step includes:
Steps of counting the types of instances possessed by each user in descending order of the users' interest similarity obtained by the user interest similarity measurement means, and determining the counted number of users X immediately before the number of types of instances rapidly increases When,
Setting an edge with the X users uj having a high degree of similarity as viewed from a certain user ui;
Giving the edge a value obtained by normalizing the similarity of the user uj viewed from the user ui as a weight;
The interest information recommendation method according to claim 5 or 6, characterized by comprising:

The background information of the recommended item in the recommendation list created by the recommended item belonging cluster specifying step is whether or not the content of interest of the user added to the recommendation target list exists in the cluster to which the recommended item belongs. 8. The information according to any one of claims 5 to 7, including information indicating which cluster is found when it does not exist, and information on a central user having many edges in the discovery base cluster. The interest information recommendation method described.

An interest information recommendation program for causing a computer to function as each means according to any one of claims 1 to 4.