JP2013250672A

JP2013250672A - Interest analysis method

Info

Publication number: JP2013250672A
Application number: JP2012123669A
Authority: JP
Inventors: Koji Ito; 浩二伊藤; Masanari Fujita; 将成藤田; Tae Sato; 妙佐藤
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-05-30
Filing date: 2012-05-30
Publication date: 2013-12-12
Anticipated expiration: 2032-05-30
Also published as: JP5723830B2

Abstract

PROBLEM TO BE SOLVED: To accurately present information desired by a user at that point of time.SOLUTION: A feature score calculation part 120 acquires a first condition list in which a plurality of concepts as the selection candidates of narrowing-down conditions in a concept system are browsed as a list and a second condition list including the concept selected by a user from this, and searches analysis parameters S', N', a' and n' from the first condition list and the second condition list, and calculates first probability that the number of the concepts which appear in the second condition list becomes n' or more and second probability that the number of the concepts which appear in the second condition list becomes n' or less under the conditions of the analysis parameters S', N' and a', and calculates feature scores from the inverse functions of the cumulative distribution functions of standard normal distribution on the basis of the first probability and the second probability. A concept system update processing part 130 updates user interest scores with respect to the concept by using the calculated feature scores.

Description

この発明は、コンピュータネットワーク上において、情報提供者が情報利用者毎に適する情報を選択して提示する情報推薦サービスに用いられる興味分析方法に関する。 The present invention relates to an interest analysis method used in an information recommendation service in which an information provider selects and presents information suitable for each information user on a computer network.

情報提供者が情報利用者ごとに適する情報を選択し、あるいは情報利用者ごとに適する順番に並び替えて情報を表示する方法として、各コンテンツに内容をサマライズするメタ情報が付与されていることを前提として、ユーザ履歴において出現する概念等の頻度からユーザの興味を推定する方法は、内容ベースフィルタリング手法（Content Based Filtering：ＣＢＦ）で、特にメモリベース手法として研究が進められている。 Meta information that summarizes the content is added to each content as a method of displaying information by selecting information suitable for each information user or rearranging in a suitable order for each information user. As a premise, a method for estimating the user's interest from the frequency of the concept or the like appearing in the user history is a content-based filtering method (Content Based Filtering: CBF), and research is being conducted particularly as a memory-based method.

具体的に、内容ベースフィルタリング技術とは、例えば特定ブランド（ブランドを示す情報を概念タグとして保持）の商品を閲覧した場合に、同じブランドの商品（同じ概念タグを保持）を提示する。この場合のメモリベース手法は、過去に閲覧した履歴から特定ブランドを頻繁に閲覧していれば、特定ブランドの商品を提示することとなる。単純な方法では、閲覧履歴により多く出現した概念タグに関連する商品を提示することとなる。適合性フィードバックと統計的仮説検定を利用した方法としては、例えば、非特許文献１の「適合性フィードバックと統計的仮説検定を利用したニュース推薦手法の提案」がある。 Specifically, the content-based filtering technique presents products of the same brand (holding the same concept tag) when browsing products of a specific brand (holding information indicating the brand as a concept tag). The memory-based method in this case presents a product of a specific brand if the specific brand is frequently browsed from the history of browsing in the past. In a simple method, products related to concept tags that appear more frequently in the browsing history are presented. As a method using relevance feedback and statistical hypothesis testing, for example, Non-Patent Document 1 “Proposal of News Recommendation Method Using Relevance Feedback and Statistical Hypothesis Test” is available.

「適合性フィードバックと統計的仮説検定を利用したニュース推薦手法の提案」，第二種研究会資料，WI2-2012-19，pp.53-58，2012“Proposal of News Recommendation Method Using Relevance Feedback and Statistical Hypothesis Testing”, Type 2 Study Group Material, WI2-2012-19, pp.53-58, 2012

ところが、上記従来技術の手法において、大量かつ詳細な履歴が得られたとしても、ユーザの嗜好はその時々の気分によっても変化する場合があるため、過去の履歴のみからではユーザの興味を常に正確に推定することはできず、ユーザがその時点では望まない情報が提示される可能性がある。 However, even if a large amount of detailed history is obtained in the above-described conventional technique, the user's preference may change depending on the mood at that time. Information that the user does not want at that time may be presented.

この発明は上記事情に着目してなされたもので、その目的とするところは、ユーザがその時点で望む情報を精度良く提示するための興味分析方法を提供することにある。 The present invention has been made paying attention to the above circumstances, and an object thereof is to provide an interest analysis method for accurately presenting information desired by a user at that time.

本発明の第１の態様は、コンピュータによって複数の概念に対するユーザ興味スコアを体系化した概念体系を用いてユーザの興味を分析する方法であって、前記概念体系における絞り込み条件の選択候補となる複数の概念を一覧として閲覧した第１の条件リストと、前記第１の条件リストから前記ユーザにより選択された概念を含む第２の条件リストとを取得するステップと、前記第１の条件リストに含まれる概念の総数を第１の総数と、前記第１の条件リストにおいて前記ユーザにより選択された概念が出現する数を第１の出現数と、前記第２の条件リストに含まれる概念の総数を第２の総数と、前記第２の条件リストにおいて前記ユーザにより選択された概念が出現する数を第２の出現数としたとき、前記第１の総数、前記第１の出現数、及び前記第２の総数の条件下で、前記第２の条件リストに前記概念が出現する数が、前記第２の出現数以上となる第１の確率及び前記第２の出現数以下となる第２の確率を算出し、前記第１の確率及び前記第２の確率をもとに標準正規分布の累積分布関数の逆関数により特徴スコアを算出する算出ステップと、前記特徴スコアを用いて前記概念に対する前記ユーザ興味スコアを更新する更新ステップとを有するものである。 A first aspect of the present invention is a method for analyzing a user's interest using a concept system in which user interest scores for a plurality of concepts are systematized by a computer, and a plurality of candidates for selection of narrowing conditions in the concept system. A first condition list obtained by browsing the concepts as a list, a second condition list including the concept selected by the user from the first condition list, and included in the first condition list The total number of concepts to be generated is the first total number, the number of occurrences of the concept selected by the user in the first condition list is the first appearance number, and the total number of concepts included in the second condition list is When the second total number and the number of occurrences of the concept selected by the user in the second condition list are the second number of appearances, the first total number and the first number of appearances And under the condition of the second total number, the number of occurrences of the concept in the second condition list is a first probability that the number of occurrences is equal to or greater than the second number of occurrences, and a number that is equal to or less than the second number of occurrences. A calculation step of calculating a feature score by an inverse function of a cumulative distribution function of a standard normal distribution based on the first probability and the second probability, and the concept using the feature score And updating the user interest score for.

第１の態様によれば、ユーザが第１の条件リストの選択候補から絞り込み条件を選択した場合、絞り込み条件の指定について、偶然と比べて比較的“選ぶ”あるいは、“選ばない”程度を分析して特徴スコアを求め、この特徴スコアを用いてユーザの興味を推定するものであって、コンテンツ選択履歴による興味推定と併せて用いる事に拠り、ユーザが絞り込み条件を指定した場合であっても、ユーザの興味を合理的かつ的確に分析可能となる。 According to the first aspect, when the user selects a narrowing-down condition from the selection candidates in the first condition list, the specification of the narrowing-down condition is comparatively “selected” or “not selected” compared to chance. The feature score is obtained, and the user's interest is estimated using the feature score. Even if the user designates a narrowing condition based on the use of the feature score together with the interest estimation based on the content selection history, The user's interests can be analyzed reasonably and accurately.

本発明の第２の態様は、コンピュータによって複数の概念に対するユーザ興味スコアを体系化した概念体系を用いてユーザの興味を分析する方法であって、前記概念体系における除外条件の選択候補となる複数の概念を一覧として閲覧した第１の条件リストと、前記第１の条件リストから前記ユーザにより選択された第２の条件リストとを取得するステップと、前記第１の条件リストに含まれる概念の総数を第１の総数と、前記第１の条件リストにおいて前記ユーザにより選択された概念が出現する数を第１の出現数と、前記第２の条件リストに含まれる概念の総数を第２の総数と、前記第２の条件リストにおいて前記ユーザにより選択された概念が出現する数を第２の出現数としたとき、前記第１の総数、前記第１の出現数、及び前記第２の総数の条件下で、前記第２の条件リストに前記概念が出現する数が、前記第２の出現数以上となる第１の確率及び前記第２の出現数以下となる第２の確率を算出し、前記第１の確率及び前記第２の確率をもとに標準正規分布の累積分布関数の逆関数により特徴スコアを算出する算出ステップと、前記特徴スコアを用いて前記概念に対する前記ユーザ興味スコアを更新する更新ステップとを有するものである。 A second aspect of the present invention is a method for analyzing user interests using a concept system in which user interest scores for a plurality of concepts are systematized by a computer, and a plurality of candidates for selection of exclusion conditions in the concept system. A first condition list obtained by browsing the concepts as a list, a second condition list selected by the user from the first condition list, and a concept included in the first condition list The total number is the first total number, the number of occurrences of the concept selected by the user in the first condition list is the first appearance number, and the total number of concepts included in the second condition list is the second number. When the total number and the number of appearances of the concept selected by the user in the second condition list are defined as the second number of appearances, the first total number, the first number of appearances, and the second number A first probability that the number of occurrences of the concept in the second condition list is greater than or equal to the second occurrence number and a second probability that is less than or equal to the second occurrence number under the condition of the total number A calculation step of calculating a feature score by an inverse function of a cumulative distribution function of a standard normal distribution based on the first probability and the second probability, and the user interest score for the concept using the feature score And an update step for updating.

第２の態様によれば、ユーザが第１の条件リストの選択候補から除外条件を選択した場合、偶然と比べて比較的“選ぶ”あるいは、“選ばない”という特徴を活用し、ユーザの興味を推定するものであって、コンテンツ選択履歴による興味推定と併せて用いる事に拠り、ユーザが絞り込み条件を指定した場合であっても、ユーザの興味を合理的かつ的確に分析可能となる。 According to the second aspect, when the user selects an exclusion condition from the selection candidates in the first condition list, the user's interest is utilized by utilizing the feature of “select” or “not select” relatively compared to chance. The user's interest can be analyzed reasonably and accurately even when the user designates the narrowing-down condition by using it together with the interest estimation based on the content selection history.

本発明の第３の態様は、上記第１又は第２の態様において、前記算出ステップは、前記第２の条件リストに含まれる概念をもとに前記概念体系における選択経路を特定し、前記選択経路に発生する各選択に対応する各概念について前記特徴スコアを算出するものである。 According to a third aspect of the present invention, in the first or second aspect, the calculating step specifies a selection path in the concept system based on a concept included in the second condition list, and the selection The feature score is calculated for each concept corresponding to each selection occurring in the route.

第３の態様によれば、概念体系における上位概念からの絞り込み条件又は除外条件の選択経路を特定し、それらの各選択についても興味推定に利用することにより、興味推定精度を高めることが可能となる。 According to the third aspect, it is possible to increase the accuracy of interest estimation by specifying the selection path of the narrowing-down condition or exclusion condition from the superordinate concept in the concept system and using each of those selections for the interest estimation. Become.

本発明の第４の態様は、上記第１乃至第３のいずれかの態様において、前記更新ステップは、前記特徴スコアを用いて当該概念の下位概念のユーザ興味スコアを更新するものである。
第４の態様によれば、特徴スコアを用いて当該概念の下位概念のユーザ興味スコアを更新することで、選択した概念だけでなく、これらの下位概念についても合理的かつ的確に分析可能となる。 According to a fourth aspect of the present invention, in any one of the first to third aspects, the updating step updates a user interest score of a subordinate concept of the concept using the feature score.
According to the fourth aspect, by updating the user interest score of the subordinate concept of the concept using the feature score, not only the selected concept but also the subordinate concept can be analyzed reasonably and accurately. .

本発明の第５の態様は、上記第１乃至第４のいずれかの態様において、１つ以上の概念が出現するコンテンツについて、当該コンテンツに出現する各概念の前記ユーザ興味スコアを用いて、当該コンテンツに対するユーザの評価スコアを算出する評価ステップをさらに有するものである。
第５の態様によれば、算出されたユーザ興味スコアを用いてコンテンツに対するユーザの評価スコアを算出することで、ユーザの興味に合ったコンテンツを推薦することが可能となる。 According to a fifth aspect of the present invention, in any one of the first to fourth aspects described above, for the content in which one or more concepts appear, the user interest score of each concept that appears in the content is used. The method further includes an evaluation step of calculating a user evaluation score for the content.
According to the fifth aspect, it is possible to recommend content that matches the user's interest by calculating the user's evaluation score for the content using the calculated user interest score.

すなわちこの発明によれば、ユーザがその時点で望む情報を精度良く提示するための興味分析方法を提供することができる。 That is, according to the present invention, it is possible to provide an interest analysis method for accurately presenting information desired by a user at that time.

本発明に係る情報推薦システムの全体構成図。1 is an overall configuration diagram of an information recommendation system according to the present invention. 情報選択履歴のみを用いた情報推薦の課題を示す図。The figure which shows the subject of the information recommendation using only an information selection log | history. 情報選択履歴と明示的な条件指定を用いた情報推薦の概念を示す図。The figure which shows the concept of the information recommendation using information selection history and explicit condition designation. 図１に示す情報推薦システムの各装置の機能構成を示すブロック図。The block diagram which shows the function structure of each apparatus of the information recommendation system shown in FIG. コンテンツ要求データの一例を示す図。The figure which shows an example of content request data. クライアント端末上でのコンテンツ閲覧操作の一例を示す図。The figure which shows an example of content browsing operation on a client terminal. 一覧閲覧コンテンツリストのデータ構成例を示す図。The figure which shows the data structural example of a list browsing content list. 詳細閲覧コンテンツのデータ構成例を示す図。The figure which shows the data structural example of detailed browsing content. 条件一覧要求データの一例を示す図。The figure which shows an example of condition list request data. 条件選択履歴の収集方法を示す図。The figure which shows the collection method of condition selection log | history. 提示コンテンツリストのデータ構成例を示す図。The figure which shows the data structural example of a presentation content list. 閲覧履歴からユーザの興味を推定する場合の処理概要を示す図。The figure which shows the process outline in the case of estimating a user's interest from browsing history. 概念体系を用いた条件設定方法を示す図。The figure which shows the condition setting method using a conceptual system. 条件設定時の対象コンテンツの特定方法を示す図。The figure which shows the specific method of the target content at the time of condition setting. コンテンツデータベースの一例を示す図。The figure which shows an example of a content database. 概念体系／ユーザ興味スコアデータベースの一例を示す図。The figure which shows an example of a concept system / user interest score database. 履歴情報受信部の処理フローを示す図。The figure which shows the processing flow of a log | history information receiving part. 特徴スコア算出部の処理フローを示す図。The figure which shows the processing flow of a characteristic score calculation part. 分析パラメータリストのデータ構成例を示す図。The figure which shows the data structural example of an analysis parameter list. 特徴スコア算出部の動作を説明するための模式図。The schematic diagram for demonstrating operation | movement of a characteristic score calculation part. 特徴スコア算出処理の詳細を示す図。The figure which shows the detail of a characteristic score calculation process. 絞り込み条件選択時の希少性を用いた興味推定方法を示す図。The figure which shows the interest estimation method using the scarcity at the time of narrowing-down condition selection. 絞り込み条件選択経路の特定方法を示す図。The figure which shows the identification method of a narrowing-down condition selection path | route. 絞り込み条件指定時に行われた選択一覧を示す図。The figure which shows the selection list | wrist performed when narrowing-down conditions were designated. 絞り込み条件選択時の特徴スコアの計算結果を示す図。The figure which shows the calculation result of the characteristic score at the time of narrowing-down condition selection. 除外条件選択時の希少性を用いた興味推定方法を示す図。The figure which shows the interest estimation method using the rarity at the time of exclusion condition selection. 除外条件選択経路の特定方法を示す図。The figure which shows the identification method of an exclusion condition selection path | route. 除外条件指定時に行われた選択一覧を示す図。The figure which shows the selection list | wrist performed when exclusion conditions were designated. 除外条件選択時の特徴スコアの計算結果を示す図。The figure which shows the calculation result of the characteristic score at the time of exclusion condition selection. 除外条件選択時の特徴スコアの算出方法を示す図。The figure which shows the calculation method of the characteristic score at the time of exclusion condition selection. 概念体系更新処理部の処理フローを示す図。The figure which shows the processing flow of a concept system update process part. 概念体系更新処理の詳細を示す図。The figure which shows the detail of a concept system update process. 概念体系を用いた興味スコアの伝播処理を示す図。The figure which shows the propagation process of the interest score using a concept system. 絞り込み条件に関する全ての選択を確率結合した特徴スコアを示す図。The figure which shows the characteristic score which carried out the probability coupling | bonding of all the selections regarding a narrowing-down condition. 上位概念からの選択経路を特定しない場合の特徴スコアを示す図。The figure which shows the characteristic score when not selecting the selection path | route from a high-order concept. 除外条件に関する全ての選択を確率結合した特徴スコアを示す図。The figure which shows the characteristic score which stochastically combined all the selections regarding exclusion conditions. 上位概念からの選択経路を特定しない場合の特徴スコアを示す図。The figure which shows the characteristic score when not selecting the selection path | route from a high-order concept. 絞り込み条件に関する全ての選択を確率結合した特徴スコアを示す図。The figure which shows the characteristic score which carried out the probability coupling | bonding of all the selections regarding a narrowing-down condition. 上位概念からの選択経路を特定しない場合の特徴スコアを示す図。The figure which shows the characteristic score when not selecting the selection path | route from a high-order concept. コンテンツ評価処理部の処理フローを示す図。The figure which shows the processing flow of a content evaluation process part. コンテンツスコアリストの一例を示す図。The figure which shows an example of a content score list. コンテンツ評価処理の詳細を示す図。The figure which shows the detail of a content evaluation process. 条件設定時の興味学習に関する課題を示す図。The figure which shows the subject regarding the interest learning at the time of condition setting.

以下、図面を参照してこの発明に係る実施の形態について説明する。
図１は、本発明に係る情報推薦システムの全体構成図である。このシステムは、クライアント端末２００と、コンテンツサーバ３００と、興味分析装置１００とを備える。クライアント端末２００とコンテンツサーバ３００との間、及びコンテンツサーバ３００と興味分析装置１００との間はそれぞれ通信ネットワークで接続される。ユーザは、クライアント端末２００上での閲覧操作及び条件指定により、所望のコンテンツをコンテンツサーバ３００から取得し、取得したコンテンツをクライアント端末２００の画面に提示して閲覧することができる。 Embodiments according to the present invention will be described below with reference to the drawings.
FIG. 1 is an overall configuration diagram of an information recommendation system according to the present invention. This system includes a client terminal 200, a content server 300, and an interest analysis device 100. The client terminal 200 and the content server 300, and the content server 300 and the interest analysis device 100 are connected to each other via a communication network. The user can acquire desired content from the content server 300 by browsing operation and specifying a condition on the client terminal 200, and can display the acquired content on the screen of the client terminal 200 for browsing.

クライアント端末２００は、ユーザ操作によるコンテンツ閲覧履歴を収集し、複数のコンテンツを一覧として閲覧した一覧閲覧コンテンツリスト（第１のコンテンツリスト）と、コンテンツの一覧からコンテンツの本体を閲覧した詳細閲覧コンテンツリスト（第２のコンテンツリスト）とをコンテンツサーバ３００に送信する。コンテンツサーバ３００は、この一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを、通信ネットワークを介して興味分析装置１００に転送する。 The client terminal 200 collects a content browsing history by a user operation, browses a plurality of content as a list, a list browsing content list (first content list), and a detailed browsing content list that browses the content body from the content list (Second content list) is transmitted to the content server 300. The content server 300 transfers the list browsing content list and the detailed browsing content list to the interest analysis device 100 via the communication network.

さらに、クライアント端末２００は、ユーザ操作による絞り込み条件あるいは除外条件指定履歴を収集し、ユーザが条件指定時に選択肢として閲覧した一覧閲覧条件リスト（第１の条件リスト）と、一覧閲覧条件リストから条件を選択した条件選択リスト（第２の条件リスト）とをコンテンツサーバ３００に送信する。コンテンツサーバ３００は、この一覧閲覧条件リスト及び条件選択リストを、通信ネットワークを介して興味分析装置１００に転送する。 Furthermore, the client terminal 200 collects a narrowing condition or exclusion condition designation history by a user operation, and selects a condition from the list browsing condition list (first condition list) browsed as an option when the user designates the condition and the list browsing condition list. The selected condition selection list (second condition list) is transmitted to the content server 300. The content server 300 transfers the list browsing condition list and the condition selection list to the interest analysis apparatus 100 via the communication network.

興味分析装置１００は、この一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをもとに、コンテンツに出現する各概念に対する特徴スコアの算出及びユーザ興味スコアの更新を行い、ユーザの興味を推定する。また、条件指定があった場合は、興味分析装置１００は、一覧閲覧条件リスト及び条件選択リストをもとに、条件として選択した各概念に対する特徴スコアの算出及びユーザ興味スコアの更新を行い、ユーザの興味を推定する。 The interest analysis apparatus 100 calculates the feature score for each concept appearing in the content and updates the user interest score based on the list browsing content list and the detailed browsing content list, and estimates the user's interest. If there is a condition designation, the interest analysis device 100 calculates the feature score for each concept selected as the condition and updates the user interest score based on the list browsing condition list and the condition selection list, and the user interest score is updated. Estimate the interest.

興味分析装置１００は、上記ユーザ興味スコアに基づいて、コンテンツサーバ３００から受け取った「提示コンテンツリスト」から、ユーザの興味に合わせてソートを行ったコンテンツのリスト（ソート済み提示コンテンツリスト）を生成し、コンテンツサーバ３００に送信する。 The interest analysis device 100 generates a list of sorted contents (sorted presented content list) from the “presented content list” received from the content server 300 based on the user interest score, according to the user's interest. To the content server 300.

例えば、図２に示す事例では、日頃ラーメン店を選択する頻度が高いユーザが、その日に限ってはお酒を飲みたいとの希望を持つ場合、情報提供側は日頃の情報選択履歴に基づきユーザの興味を“ラーメン好き”と推定することになるため、ラーメン店を推薦する。この推薦結果はユーザの“お酒を飲みたい”という興味に合致していない。つまり、ユーザが希望する情報（この例の場合は飲み屋）を提示できない可能性がある。 For example, in the example shown in FIG. 2, when a user who frequently selects a ramen shop has a desire to drink alcohol only on that day, the information providing side uses the user's daily information selection history. I would recommend a ramen shop because I would presume that I am interested in ramen. This recommendation result does not match the user's interest to “drink”. That is, there is a possibility that the information desired by the user (in this example, a bar) cannot be presented.

そこで、本発明では、ユーザの情報選択履歴だけでなく、ユーザが示した明示的な条件指定も併せて考慮し、ユーザがその時点で望む情報を精度良く提示できるようにする。また、ユーザが明示的に行った条件指定の履歴もユーザの興味を推定するために有用な履歴である。従って、ユーザの閲覧履歴を用いた興味推定とユーザが行った条件指定履歴をと用いた興味推定を組み合わせて用いる事により、推薦精度を向上させる。 Therefore, in the present invention, not only the information selection history of the user but also the explicit condition specification indicated by the user is taken into consideration so that the information desired by the user can be presented with high accuracy. Further, the history of condition designation explicitly performed by the user is also a useful history for estimating the user's interest. Therefore, the recommendation accuracy is improved by combining the interest estimation using the user browsing history and the interest estimation using the condition designation history performed by the user.

例えば、図３に示すとおり、情報提供側は特定のユーザに関し「“全ジャンル”の中では“ラーメン”を選択する割合が高い」、「“お酒”の中では“ビール”を選択する割合が高い」、「“ビール”の中では、“エール”を選択する割合が高い」などの履歴に基づきユーザの興味を「“全ジャンル”の中では、“ラーメン好き”」、「“お酒”の中では“ビール”が好き」、「“ビール”の中では“エール”が好き」と推定する。この結果、ユーザによる“お酒を飲みたい”との明示的な条件指定を考慮し、推薦結果を“お酒”のみに限定した上、“お酒”の中でも特に“ビール”を、“ビール”の中でも、特に“エール”を推薦する事ができるため、ユーザが希望する情報（この例の場合は“ビール”、特に“エール”を提供する飲み屋）を提示することができる。 For example, as shown in FIG. 3, the information providing side “has a high ratio of selecting“ ramen ”among“ all genres ”” and “a ratio of selecting“ beer ”among“ alcohol ”for a specific user. Is “high”, “among“ beer ”,“ the rate of selecting “ale” is high ”,” and so on. "I like" beer "" and "I like" ale "in" beer "". As a result, in consideration of the explicit condition specification of “I want to drink alcohol” by the user, the recommendation result is limited to only “alcohol”, and “beer” in “alcohol” is especially “beer” Since “Ale” can be recommended in particular, information desired by the user (in this example, “beer”, in particular, a bar where “Ale” is provided) can be presented.

図４は、図１に示す情報推薦システムの各装置の機能構成を示すブロック図である。なお、図４における各部は、例えば、各装置のＣＰＵ（Central Processing Unit）とメモリ上で実行される制御プログラムにより実現することができる。以下、各装置の詳細について説明を行う。 FIG. 4 is a block diagram showing a functional configuration of each device of the information recommendation system shown in FIG. Each unit in FIG. 4 can be realized by, for example, a CPU (Central Processing Unit) of each device and a control program executed on a memory. Details of each device will be described below.

［クライアント端末］
図４において、クライアント端末２００は、履歴収集部２１０、履歴情報送信部２２０、コンテンツ提示部２３０、及びコンテンツ要求送信部２４０、一覧閲覧条件リスト要求送信部２５０、一覧閲覧条件リスト提示部２６０、閲覧条件選択履歴収集部２７０及び条件選択履歴送信部２８０を備える。 [Client terminal]
4, the client terminal 200 includes a history collection unit 210, a history information transmission unit 220, a content presentation unit 230, a content request transmission unit 240, a list browsing condition list request transmission unit 250, a list browsing condition list presentation unit 260, a browsing. A condition selection history collection unit 270 and a condition selection history transmission unit 280 are provided.

コンテンツ要求送信部２４０は、ユーザの指示（入力）によりコンテンツサーバ３００に対して、コンテンツの提示要求を行う。具体的には図５のようなコンテンツ要求データをコンテンツサーバ３００に送信する。例えば、コンテンツ要求データは、ユーザＩＤ（もしくは、クライアント端末ＩＤ）及び要求時刻を有する。なお、要求時刻は、コンテンツサーバ３００において追加するようにしてもよい。ユーザＩＤ（もしくは、クライアント端末ＩＤ）は、端末（もしくはユーザ）毎に一意に付与される数字あるいは文字列であって、後述する概念体系／ユーザ興味スコアデータベース１４０のユーザ興味スコアテーブルのユーザＩＤと一致するＩＤである。 The content request transmission unit 240 makes a content presentation request to the content server 300 in accordance with a user instruction (input). Specifically, content request data as shown in FIG. 5 is transmitted to the content server 300. For example, the content request data has a user ID (or client terminal ID) and a request time. The request time may be added in the content server 300. The user ID (or client terminal ID) is a number or character string that is uniquely assigned to each terminal (or user), and is a user ID in a user interest score table of the conceptual system / user interest score database 140 described later. The matching ID.

コンテンツ提示部２３０は、コンテンツサーバ３００から受信したソート済み提示コンテンツリストをもとに、クライアント端末２００の表示画面サイズが許容する範囲でソート順の上位から一覧として表示を行う。図６は、クライアント端末２００上でのユーザによるコンテンツ閲覧操作の一例を示したものである。 Based on the sorted presentation content list received from the content server 300, the content presentation unit 230 displays a list from the top of the sort order within the range allowed by the display screen size of the client terminal 200. FIG. 6 shows an example of a content browsing operation by the user on the client terminal 200.

図６の例では、１０個のコンテンツ（コンテンツ１〜１０）が一覧表示されている。ユーザのフリック、スクロールバーの操作等で一覧によりソート順下位のコンテンツを表示することができる。このように実際にクライアント端末２００に表示されたコンテンツのリストを一覧閲覧コンテンツリストとする。つまり、ソート済み提示コンテンツリスト内のすべてのコンテンツがクライアント端末２００で表示されるとは限らないため、一覧閲覧コンテンツリストに含まれるとは限らない。ユーザがこの一覧から各コンテンツのタイトルをクリック操作等で選択すると、選択されたタイトルのコンテンツ（図６のコンテンツ３，５，６）の本体（詳細）を閲覧することができる。この詳細を閲覧したコンテンツを、詳細閲覧コンテンツリストに含む。 In the example of FIG. 6, ten contents (contents 1 to 10) are displayed in a list. The content in the lower order of the sort order can be displayed by the list by the user's flick, scroll bar operation or the like. The list of contents actually displayed on the client terminal 200 in this way is referred to as a list browsing content list. That is, not all the contents in the sorted presentation content list are displayed on the client terminal 200, and thus are not necessarily included in the list browsing content list. When the user selects a title of each content from this list by clicking or the like, the main body (details) of the content of the selected title (contents 3, 5, and 6 in FIG. 6) can be viewed. The content whose details are browsed is included in the detailed browsing content list.

履歴収集部２１０は、上述したように、ユーザの操作履歴を収集して一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを作成する。履歴情報送信部２２０は、履歴収集部２１０により作成された一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをユーザＩＤ（もしくは、クライアント端末ＩＤ）と共にコンテンツサーバ３００に送信する。 As described above, the history collection unit 210 collects user operation histories and creates a list browsing content list and a detailed browsing content list. The history information transmission unit 220 transmits the list browsing content list and the detailed browsing content list created by the history collection unit 210 to the content server 300 together with the user ID (or client terminal ID).

図７に、上記図６の場合の一覧閲覧コンテンツリストのデータ構成例を示す。一覧閲覧コンテンツリストは、クラスタＩＤ、コンテンツＩＤ、及び閲覧時刻を有する。クラスタとは、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストに一意に付与される識別子（図７では“１”）である。別の時刻（時間帯）に表示した一覧閲覧コンテンツをユーザが閲覧した場合は、別のクラスタＩＤが付与される。なお、時刻以外の条件でクラスタＩＤを新たに付与する条件としては、一覧閲覧コンテンツリスト表示中に一定時間操作が無かった場合や、閲覧するユーザ（ユーザＩＤ）を切り替えた場合、一覧閲覧コンテンツリストに対して、絞り込み条件、あるいは除外条件の指定により、コンテンツジャンル等を絞り込んだ場合、その他閲覧アプリケーションにおいて閲覧モードを切り替えた場合がある。コンテンツＩＤは、一覧閲覧コンテンツの各コンテンツに一意に付与された識別子であり、後述するコンテンツデータベース１６０が保持する値と一致するものとする。 FIG. 7 shows a data configuration example of the list browsing content list in the case of FIG. The list browsing content list has a cluster ID, a content ID, and a browsing time. The cluster is an identifier ("1" in FIG. 7) uniquely assigned to the list browsing content list and the detailed browsing content list. When the user browses the list browsing content displayed at another time (time zone), another cluster ID is given. The conditions for newly assigning the cluster ID under conditions other than the time include when there is no operation for a certain period of time while the list browsing content list is displayed, or when the browsing user (user ID) is switched, the list browsing content list On the other hand, when the content genre or the like is narrowed down by specifying narrowing conditions or exclusion conditions, the browsing mode may be switched in other browsing applications. The content ID is an identifier uniquely assigned to each content of the list browsing content, and is assumed to match a value held in a content database 160 described later.

図８は、上記図６の場合の詳細閲覧コンテンツリストのデータ構成例を示したものである。詳細閲覧コンテンツリストは、上記一覧閲覧コンテンツリストと同様に、クラスタＩＤ、コンテンツＩＤ、及び閲覧時刻を有する。クラスタＩＤは、一覧閲覧コンテンツリストと同一の値とする（図８では“１”）。コンテンツＩＤ及び閲覧時刻は、詳細閲覧コンテンツリストでは、ユーザが一覧閲覧コンテンツから選択して詳細を閲覧したコンテンツ（図８ではコンテンツ３，５，６）の識別子及び当該コンテンツを閲覧した時刻となる。 FIG. 8 shows an example of the data structure of the detailed browsing content list in the case of FIG. The detailed browsing content list has a cluster ID, a content ID, and a browsing time, like the list browsing content list. The cluster ID is the same value as the list browsing content list (“1” in FIG. 8). In the detailed browsing content list, the content ID and browsing time are the identifier of the content (contents 3, 5, and 6 in FIG. 8) that the user has selected from the browsing content and browsed the content and the time when the content was browsed.

一覧閲覧条件リスト要求送信部２５０は、ユーザの指示（入力）により、コンテンツサーバ３００に対して、一覧閲覧条件リストの要求を行う。具体的には図９のような条件一覧要求データをコンテンツサーバ３００に送信する。例えば、条件一覧要求データは、種別、パラメータ名、及びパラメータ値を有する。図９（ａ）に示すように、キーワード検索結果から条件を指定する場合の条件一覧要求データは、種別（＝検索）、パラメータ名（＝キーワード）、及びパラメータ値（＝スポーツ）を有する。図９（ｂ）に示すように、一覧から条件を指定する場合の条件一覧要求データは、種別（＝一覧）、パラメータ名（＝表示ルール）、及びパラメータ値（＝昇順）を有する。また、図９（ｃ）に示すように、概念体系から条件を指定する場合の条件一覧要求データは、種別（＝概念）、パラメータ名（＝親概念）、及びパラメータ値（＝スポーツ）を有する。一覧閲覧リスト要求データは、一覧閲覧条件リスト要求転送部３７０により、一覧閲覧条件リスト作成部１８５に送られる。 The list browsing condition list request transmission unit 250 requests the content server 300 for a list browsing condition list in accordance with a user instruction (input). Specifically, condition list request data as shown in FIG. 9 is transmitted to the content server 300. For example, the condition list request data has a type, a parameter name, and a parameter value. As shown in FIG. 9A, the condition list request data when a condition is specified from a keyword search result has a type (= search), a parameter name (= keyword), and a parameter value (= sport). As shown in FIG. 9B, the condition list request data when a condition is specified from the list has a type (= list), a parameter name (= display rule), and a parameter value (= ascending order). Further, as shown in FIG. 9C, the condition list request data when the condition is specified from the concept system has a type (= concept), a parameter name (= parent concept), and a parameter value (= sport). . The list browsing list request data is sent to the list browsing condition list creation unit 185 by the list browsing condition list request transfer unit 370.

一覧閲覧条件リスト提示部２６０は、コンテンツサーバ３００を介して興味分析装置１００から送られてくる提示用一覧閲覧条件リストを、クライアント端末２００の表示画面サイズが許容する範囲で一覧表示する。図１０は、クライアント端末２００上でのユーザによる条件閲覧操作の一例を示したものである。図１０（ａ）は、キーワード検索結果から条件を指定する場合、図１０（ｂ）は、一覧から条件を指定する場合を示す。ユーザは、提示された一覧閲覧条件リストの中から絞り込み条件、あるいは除外条件を選択する。なお、絞り込み条件を選択するか、あるいは除外条件を選択するかの指定は、例えば、選択する際のボタンを変更することで区別する。親概念から条件を指定する場合は、後述するように当該親概念の直接の下位概念を抽出した一覧閲覧条件リストから条件を指定する。 The list browsing condition list presenting unit 260 displays a list of presentation list browsing condition lists sent from the interest analysis apparatus 100 via the content server 300 within a range that the display screen size of the client terminal 200 allows. FIG. 10 shows an example of a condition browsing operation by the user on the client terminal 200. FIG. 10A shows a case where conditions are designated from the keyword search result, and FIG. 10B shows a case where conditions are designated from the list. The user selects a narrowing condition or an exclusion condition from the presented list browsing condition list. In addition, designation | designated of selecting a narrowing-down condition or an exclusion condition distinguishes by changing the button at the time of selection, for example. When a condition is specified from a parent concept, the condition is specified from a list browsing condition list obtained by extracting a direct subordinate concept of the parent concept as described later.

閲覧条件選択履歴収集部２７０は、提示された一覧閲覧条件リスト（第１の条件リスト）と、条件の一覧から選択した１つ以上の条件を含む条件選択リスト（第２の条件リスト）を収集する。興味分析装置１００から送られてくる提示用一覧閲覧条件リストに選択肢が多い場合は、クライアント端末２００の画面上に全ての選択肢を画面内に一覧閲覧条件リストとして表示できない場合もある。そのような場合は、画面上に表示されたもののみを一覧閲覧条件リストとみなすこともできる。 The browsing condition selection history collection unit 270 collects the presented list browsing condition list (first condition list) and a condition selection list (second condition list) including one or more conditions selected from the list of conditions. To do. If there are many choices in the presentation list browsing condition list sent from the interest analysis device 100, all options may not be displayed on the screen of the client terminal 200 as a list browsing condition list. In such a case, only what is displayed on the screen can be regarded as a list browsing condition list.

条件が階層構造である場合は、条件の設定に関し、選択経路に応じて複数の選択が発生する場合もある。その場合は、発生した選択毎に条件選択リストを送信する。あるいは、図１０に示すように、条件が階層構造を持たず、キーワード検索の結果やアルファベット順などにより並んだ一覧閲覧条件リストから、条件選択リストを選択する場合もある。そのような場合は単一の条件設定リストを送信する。
条件選択履歴送信部２８０は、閲覧条件選択履歴収集部２７０で収集された条件選択履歴をコンテンツサーバ３００を介して興味分析装置１００に送信する。 When the condition has a hierarchical structure, a plurality of selections may occur depending on the selected route with respect to setting the condition. In that case, a condition selection list is transmitted for each selection that occurs. Alternatively, as shown in FIG. 10, the condition selection list may be selected from a list browsing condition list arranged according to the result of keyword search or alphabetical order without the hierarchical structure of the condition. In such a case, a single condition setting list is transmitted.
The condition selection history transmission unit 280 transmits the condition selection history collected by the browsing condition selection history collection unit 270 to the interest analysis device 100 via the content server 300.

［コンテンツサーバ］
上記図４において、コンテンツサーバ３００は、コンテンツ送信処理部３１０、ソート済み提示コンテンツリスト受信部３２０、提示コンテンツリスト送信部３３０、提示コンテンツリスト入力部３４０、履歴情報転送部３５０、コンテンツ要求転送部３６０、一覧閲覧条件リスト要求転送部３７０、一覧閲覧条件リスト転送部３８０及び条件選択履歴転送部３９０を備える。 [Content Server]
4, the content server 300 includes a content transmission processing unit 310, a sorted presentation content list reception unit 320, a presentation content list transmission unit 330, a presentation content list input unit 340, a history information transfer unit 350, and a content request transfer unit 360. A list browsing condition list request transfer unit 370, a list browsing condition list transfer unit 380, and a condition selection history transfer unit 390.

履歴情報転送部３５０は、クライアント端末２００から受信した一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを通信ネットワークを介して興味分析装置１００に転送する。
提示コンテンツリスト入力部３４０には、サービス運用者により、ユーザの利用するクライアント端末２００に提示するコンテンツを一覧にした提示コンテンツリストが入力される。提示コンテンツリスト送信部３３０は、上記入力された提示コンテンツリストを興味分析装置１００へ通信ネットワークを介して送信する。 The history information transfer unit 350 transfers the list browsing content list and the detailed browsing content list received from the client terminal 200 to the interest analysis device 100 via the communication network.
The presentation content list input unit 340 receives a presentation content list that lists contents to be presented to the client terminal 200 used by the user by the service operator. The presented content list transmission unit 330 transmits the input presented content list to the interest analysis apparatus 100 via the communication network.

図１１に、提示コンテンツリストのデータ構成例を示す。提示コンテンツリストは、コンテンツＩＤ、概念ＩＤ／関連度リスト、コンテンツ本体、及びコンテンツ登録時刻を有する。コンテンツＩＤは、各コンテンツに対してコンテンツサーバ３００にて付与される一意のＩＤである。概念ＩＤ／関連度リストは、コンテンツに出現する概念の概念ＩＤ及び当該概念とコンテンツとの関連性の程度を示す値のセットが格納される。概念ＩＤ／関連度リストは、コンテンツ毎に予め設定されており、具体例としては、コンテンツ１（スポーツ記事）には、｛“野球”の概念ＩＤ=１，関連度＝０．５｝、｛“サッカー”の概念ＩＤ=２，関連度＝０．８｝、｛“ゴルフ”の概念ＩＤ=３、関連度＝０．６｝…のように、概念ＩＤと関連度のセットが格納される。 FIG. 11 shows a data configuration example of the presented content list. The presented content list has a content ID, a concept ID / relevance list, a content body, and a content registration time. The content ID is a unique ID assigned by the content server 300 to each content. The concept ID / relationship degree list stores a concept ID of a concept that appears in the content and a set of values indicating the degree of relevance between the concept and the content. The concept ID / relevance degree list is set in advance for each content. As a specific example, content 1 (sports article) includes {“baseball” concept ID = 1, relevance = 0.5}, { A set of concept ID and degree of association is stored as “soccer” concept ID = 2, degree of association = 0.8}, {“golf” concept ID = 3, degree of association = 0.6}. .

なお、概念ＩＤは、概念体系／ユーザ興味スコアデータベース１４０に格納される値と一致する。関連度は、例えば、０から１までの値とし、大きいほど関連性が強いものとする。関連度は、サービス運用者がコンテンツ登録時に設定する値、若しくは別システムにより算出される値を利用する。 The concept ID matches the value stored in the concept system / user interest score database 140. For example, the relevance is a value from 0 to 1, and the larger the relevance, the stronger the relevance. As the relevance, a value set by the service operator at the time of content registration or a value calculated by another system is used.

ソート済み提示コンテンツリスト受信部３２０は、興味分析装置１００から提示コンテンツリストの一部又は全部をソートしたソート済み提示コンテンツリストとユーザＩＤ（もしくは、クライアント端末ＩＤ）を受信する。コンテンツ送信処理部３１０は、ソート済み提示コンテンツリストをユーザＩＤ（もしくは、クライアント端末ＩＤ）に該当するクライアント端末２００に送信する。 The sorted presentation content list receiving unit 320 receives a sorted presentation content list and a user ID (or client terminal ID) obtained by sorting a part or all of the presentation content list from the interest analysis device 100. The content transmission processing unit 310 transmits the sorted presentation content list to the client terminal 200 corresponding to the user ID (or client terminal ID).

コンテンツ要求転送部３６０は、クライアント端末２００のコンテンツ要求送信部２４０から送信されるコンテンツ要求データ（図５）を興味分析装置１００に転送する。
また、一覧閲覧条件リスト要求転送部３７０は、クライアント端末２００の一覧閲覧条件リスト要求送信部２５０から送信される条件一覧要求データ（図９）を興味分析装置１００に転送する。 The content request transfer unit 360 transfers the content request data (FIG. 5) transmitted from the content request transmission unit 240 of the client terminal 200 to the interest analysis device 100.
Further, the list browsing condition list request transfer unit 370 transfers the condition list request data (FIG. 9) transmitted from the list browsing condition list request transmission unit 250 of the client terminal 200 to the interest analysis device 100.

一覧閲覧条件リスト転送部３８０は、後述する興味分析装置１００の一覧閲覧条件リスト作成部１８５で作成される提示用一覧閲覧条件リストをクライアント端末２００に転送する。
条件選択履歴転送部３９０は、クライアント端末２００の条件選択履歴送信部２８０から送られてくる条件選択履歴を興味分析装置１００に転送する。 The list browsing condition list transfer unit 380 transfers the list browsing condition list for presentation created by the list browsing condition list creation unit 185 of the interest analysis apparatus 100 described later to the client terminal 200.
The condition selection history transfer unit 390 transfers the condition selection history sent from the condition selection history transmission unit 280 of the client terminal 200 to the interest analysis device 100.

［興味分析装置］
興味分析装置１００は、履歴情報受信部１１０、特徴スコア算出部１２０、概念体系更新処理部１３０、概念体系／ユーザ興味スコアデータベース１４０、提示コンテンツリスト受信部１５０、コンテンツデータベース１６０、コンテンツ評価処理部１７０、ソート済みコンテンツスコアリスト送信部１８０、一覧閲覧条件リスト作成部１８５及び条件選択履歴受信部１９０を備える。 [Interest analysis device]
The interest analysis apparatus 100 includes a history information reception unit 110, a feature score calculation unit 120, a concept system update processing unit 130, a concept system / user interest score database 140, a presented content list reception unit 150, a content database 160, and a content evaluation processing unit 170. The sorted content score list transmission unit 180, the list browsing condition list creation unit 185, and the condition selection history reception unit 190 are provided.

図１２は、コンテンツ閲覧履歴からユーザの興味を推定する場合の処理概要を示したものである。履歴情報受信部１１０は、クライアント端末２００からの一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをコンテンツサーバ３００を介して受信する。一覧閲覧コンテンツリストとは、例えば、ユーザがコンテンツのタイトルのみを一覧で閲覧したコンテンツのリストである。詳細閲覧コンテンツリストとは、ユーザがコンテンツ本体の内容（詳細）を閲覧したコンテンツのリストである。例えば、図１２において、一覧閲覧コンテンツリストには、コンテンツ１〜８が含まれ、詳細閲覧コンテンツリストには、コンテンツ１，３，４が含まれる。また、図１２において、斜線パターンで示すコンテンツは、概念Ｂがコンテンツ１，６，７，８に出現することを示す。 FIG. 12 shows an outline of processing when estimating the user's interest from the content browsing history. The history information receiving unit 110 receives the list browsing content list and the detailed browsing content list from the client terminal 200 via the content server 300. The list browsing content list is, for example, a list of content in which the user browses only the content titles in a list. The detailed browsing content list is a list of content that the user has viewed the content (details) of the content body. For example, in FIG. 12, the list browsing content list includes contents 1 to 8, and the detailed browsing content list includes contents 1, 3, and 4. In FIG. 12, the content indicated by the hatched pattern indicates that the concept B appears in the contents 1, 6, 7, and 8.

特徴スコア算出部１２０は、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを利用して概念選択の統計モデルにより各概念の特徴スコア（後述するＺ値）を算出する。
概念体系更新処理部１３０は、上記特徴スコアを用いて概念体系における概念間の関係情報（上位概念及び下位概念）に基づいて各概念に対するユーザ興味スコアを更新する。概念体系のグラフに含まれるノードは概念を表し、リンクは概念間の関係を表す。ユーザ興味スコアは、概念体系における各概念に対応するノードの値として保持する。概念体系において、上位に位置するノードほど抽象的な概念を表し、下位に位置するノードほど具体的な概念を表す。概念体系及び概念ＩＤ（ノード毎に付与される識別子）は、サービス運用者等が事前に設計し定義するものとする。 The feature score calculation unit 120 calculates a feature score (Z value to be described later) of each concept by a statistical model of concept selection using the list browsing content list and the detailed browsing content list.
The concept system update processing unit 130 updates the user interest score for each concept based on the relationship information (superordinate concept and subordinate concept) in the concept system using the feature score. Nodes included in the graph of the concept system represent concepts, and links represent relationships between concepts. The user interest score is held as a value of a node corresponding to each concept in the concept system. In the concept system, the nodes located at the higher level represent the abstract concept, and the nodes located at the lower level represent the specific concept. The concept system and concept ID (identifier assigned to each node) are designed and defined in advance by a service operator or the like.

コンテンツ評価処理部１７０は、評価コンテンツに出現する各概念のユーザ興味スコアを利用して確率結合によってコンテンツに対するユーザの評価スコアを算出する。図１２の例では、コンテンツ１に出現する概念Ｅ，Ｆ，Ｄのユーザ興味スコアを用いて評価コンテンツ１の評価スコアを求めている。 The content evaluation processing unit 170 calculates the user's evaluation score for the content by probability combining using the user interest score of each concept appearing in the evaluation content. In the example of FIG. 12, the evaluation score of the evaluation content 1 is obtained using the user interest scores of the concepts E, F, and D that appear in the content 1.

さらに、条件指定があった場合は、例えば、図１３において、絞り込み条件として“お酒”を指定したり、“ワイン”を指定する事に拠り、ユーザの明示的な興味である絞り込み条件を指定する事ができ、例えば、図１４において“お酒”を絞り込み条件に指定した場合、当該概念を絞り込み条件とみなし、絞り込み条件から、少なくとも１つ以上のリンクを持つコンテンツＢ、Ｃ、Ｄ、Ｅを評価対象コンテンツとする。このとき、絞り込み条件以外の特徴スコアも評価に用いる。例えば、図１４において、”ラーメン”の特徴スコアをコンテンツＢの評価に用いる。 Furthermore, when conditions are specified, for example, in FIG. 13, specifying “sake” as a narrowing condition or specifying “wine” as a narrowing condition specifies the narrowing condition that is the user's explicit interest. For example, when “alcohol” is designated as a refinement condition in FIG. 14, the concept is regarded as a refinement condition, and contents B, C, D, E having at least one link are determined based on the refinement condition. Is the content to be evaluated. At this time, feature scores other than the narrowing-down conditions are also used for evaluation. For example, in FIG. 14, the characteristic score of “ramen” is used for the evaluation of the content B.

また、例えば、図１３において、除外条件候補一覧（“ワイン”、“ビール”、“ウィスキー”）の中から、除外条件として“ウィスキー”を選択する事により、除外条件を指定する事ができ、図１４において“ウィスキー”を除外条件に指定した場合、当該概念を除外条件とみなし、絞り込み条件からのリンクは評価に用いない。例えば、図１４において、除外条件であるウィスキーからコンテンツＥのリンクは評価に用いず、コンテンツＥは他に絞り込み概念からのリンクを持たないため、評価対象から除外する。あるいは、除外条件からのリンクを持つコンテンツを評価対象から除外する場合もある。この場合、コンテンツＤも評価対象から除外する。 Further, for example, in FIG. 13, by selecting “whiskey” as the exclusion condition from the candidate exclusion condition list (“wine”, “beer”, “whiskey”), the exclusion condition can be specified. In FIG. 14, when “whiskey” is designated as an exclusion condition, the concept is regarded as an exclusion condition, and the link from the narrowing-down condition is not used for evaluation. For example, in FIG. 14, a link from whiskey to content E, which is an exclusion condition, is not used for evaluation, and content E is excluded from the evaluation target because it does not have any other links from the refinement concept. Alternatively, content having a link from the exclusion condition may be excluded from the evaluation target. In this case, the content D is also excluded from the evaluation target.

以下、興味分析装置１００の各部の詳細について説明する。
（コンテンツデータベース１６０）
図１５にコンテンツデータベース１６０のデータ構造の一例を示す。コンテンツデータベース１６０は、コンテンツテーブルと、ユーザ履歴テーブルとを有する。コンテンツテーブルは、コンテンツＩＤ、概念ＩＤ／関連度リスト、コンテンツ本体、及びコンテンツ登録時刻を有し、提示コンテンツリスト受信部１５０で受信した各値が格納される。 Hereinafter, the detail of each part of the interest analysis apparatus 100 is demonstrated.
(Content database 160)
FIG. 15 shows an example of the data structure of the content database 160. The content database 160 has a content table and a user history table. The content table has a content ID, a concept ID / relevance list, a content body, and a content registration time, and stores each value received by the presented content list receiving unit 150.

ユーザ履歴テーブルは、コンテンツＩＤ、ユーザＩＤ（クライアント端末ＩＤ）、詳細閲覧総数、詳細閲覧時刻、一覧閲覧総数、一覧閲覧時刻、及び一覧非表示フラグを格納する。詳細閲覧時刻は、詳細閲覧総数が０の場合はｎｕｌｌ、１以上であれば各閲覧の時系列による閲覧時刻のリストを格納する。一覧閲覧時刻は、一覧閲覧総数が０の場合はｎｕｌｌ、１以上であれば各閲覧の時系列による閲覧時刻のリストを格納する。一覧非表示フラグは、まだユーザにクライアント端末の画面上で一覧としても表示／視認していない場合はｆａｌｓｅ、一度でも閲覧した場合はｔｒｕｅを格納する。ユーザ履歴テーブルおいては、ユーザＩＤ毎に全コンテンツＩＤの値を保持する。詳細閲覧総数及び一覧閲覧総数は、上記クラスタＩＤで示される一覧閲覧コンテンツリストが多数受信された場合には過去の履歴の累計を格納する。 The user history table stores content ID, user ID (client terminal ID), detailed browsing total number, detailed browsing time, list browsing total number, list browsing time, and list non-display flag. The detailed browsing time is null when the total number of detailed browsing is 0, and if it is 1 or more, a list of browsing times in a time series of each browsing is stored. The list browsing time is null when the total number of browsing the list is 0, and if it is 1 or more, a list of browsing times according to the time series of each browsing is stored. The list non-display flag stores false when the user has not yet displayed / viewed as a list on the screen of the client terminal, and stores true when the list has been viewed even once. In the user history table, all content ID values are held for each user ID. As the detailed browsing total number and the list browsing total number, when a large number of list browsing content lists indicated by the cluster ID are received, a cumulative total of past histories is stored.

例えば、このユーザ履歴テーブルのデータを利用することで、ユーザの閲覧回数に応じて、コンテンツについて、今後の評価（コンテンツ評価処理部１７０での処理時）で評価スコアを下げるようにする。評価スコアの低減方法としては、あるコンテンツに対する閲覧回数をｋとしたとき、当該コンテンツの評価スコアをｋ＋１で割る、或いは評価スコアに重み（例えば０．９）のｋ乗を乗算するなどがある。この処理により、同じコンテンツの反復提示を興味との一致度を加味して低減することができるためユーザの推薦に対する満足度を向上することができる。 For example, by using the data of the user history table, the evaluation score of the content is lowered in the future evaluation (during processing in the content evaluation processing unit 170) according to the number of times the user browses. As a method for reducing the evaluation score, when the number of browsing for a certain content is k, the evaluation score of the content is divided by k + 1, or the evaluation score is multiplied by a weight (for example, 0.9) to the kth power. By this process, it is possible to reduce the repeated presentation of the same content in consideration of the degree of coincidence with the interest, so that the satisfaction with the user's recommendation can be improved.

（概念体系／ユーザ興味スコア１４０）
図１６に概念体系／ユーザ興味スコアデータベース１４０のデータ構造の一例を示す。概念体系／ユーザ興味スコア１４０は、ルート概念ノードＩＤと、概念体系テーブルと、ユーザ興味スコアテーブルとを有する。ルート概念ノードＩＤとは、概念体系構造において最上位にある概念ノードＩＤである。システム内に１つだけ存在する。概念体系テーブルは、自概念ＩＤ、親概念ＩＤリスト、及び子概念ＩＤリストを格納する。概念体系内の全ての自概念ＩＤは、親概念ＩＤ及び子概念ＩＤ（ただし、自概念が最下位の場合には子概念ＩＤは無し）と紐付けて保存されており、これにより概念構造が定義される。ユーザ興味スコアテーブルは、概念ＩＤ、ユーザＩＤ（もしくは、クライアント端末ＩＤ）、ＴｏｔａｌＺ（ユーザ興味スコア）、Ｘ、及びＹの値を格納する。ＴｏｔａｌＺ、Ｘ、及びＹの定義及び算出方法は後述する。 (Conceptual system / user interest score 140)
FIG. 16 shows an example of the data structure of the conceptual system / user interest score database 140. The concept system / user interest score 140 includes a root concept node ID, a concept system table, and a user interest score table. The root concept node ID is a concept node ID at the highest level in the concept system structure. There is only one in the system. The concept system table stores a self-concept ID, a parent concept ID list, and a child concept ID list. All the self-concept IDs in the concept system are stored in association with a parent concept ID and a child concept ID (however, if the self-concept is the lowest, there is no child concept ID). Defined. The user interest score table stores values of concept ID, user ID (or client terminal ID), TotalZ (user interest score), X, and Y. The definition and calculation method of TotalZ, X, and Y will be described later.

（提示コンテンツリスト受信部１５０）
提示コンテンツリスト受信部１５０は、コンテンツサーバ３００から上記図１１のような提示コンテンツリストを受信し、コンテンツデータベース１６０に保存する。
（履歴情報受信部１１０）
図１７に、履歴情報受信部１１０の処理フローを示す。ステップＳ１１において、履歴情報受信部１１０は、クライアント端末２００から送信されるユーザＩＤ（もしくは、クライアント端末ＩＤ）、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストをコンテンツサーバ３００を介して受信し、特徴スコア算出部１２０へ出力する。 (Presentation content list receiving unit 150)
The presented content list receiving unit 150 receives the presented content list as shown in FIG. 11 from the content server 300 and stores it in the content database 160.
(History information receiving unit 110)
FIG. 17 shows a processing flow of the history information receiving unit 110. In step S11, the history information receiving unit 110 receives the user ID (or client terminal ID), the list browsing content list, and the detailed browsing content list transmitted from the client terminal 200 via the content server 300, and calculates the characteristic score. To the unit 120.

（条件選択履歴受信部１９０）
条件選択履歴受信部１９０は、クライアント端末２００の条件選択履歴送信部２８０から送信される一覧閲覧条件リスト及び条件選択リストをコンテンツサーバ３００を介して受信し、特徴スコア算出部１２０へ出力する。また、条件選択履歴受信部１９０は、条件選択リストを一覧閲覧条件リスト作成部１８５及びコンテンツ評価処理部１７０へ出力する。 (Condition selection history receiving unit 190)
The condition selection history reception unit 190 receives the list browsing condition list and the condition selection list transmitted from the condition selection history transmission unit 280 of the client terminal 200 via the content server 300 and outputs them to the feature score calculation unit 120. Further, the condition selection history receiving unit 190 outputs the condition selection list to the list browsing condition list creating unit 185 and the content evaluation processing unit 170.

（一覧閲覧条件リスト作成部１８５）
一覧閲覧条件リスト作成部１８５は、コンテンツサーバ３００の一覧閲覧条件リスト要求転送部３７０から送られてくる一覧閲覧リスト要求データに基づいて提示用一覧閲覧条件リストを作成する。例えば、図９（ａ）のような条件検索キーワードが送られてきた場合は、当該キーワードに合致する検索結果を示す提示用一覧閲覧条件リストを作成する。図９（ｂ）のような条件一覧要求が送られてきた場合は、条件一覧を例えば名前順に並び替えた提示用一覧閲覧条件リストを作成する。図９（ｃ）のような親概念が送られてきた場合は、概念体系／ユーザ興味スコアデータベース１４０を参照して、当該親概念の直接の下位概念の一覧を抽出した提示用一覧閲覧条件リストを作成する。また、一覧閲覧条件リスト作成部１８５は、条件選択履歴受信部１９０から条件選択リストが入力されると、概念体系／ユーザ興味スコアデータベース１４０を参照して、条件選択リストで指定される条件に該当する下位概念を抽出した提示用一覧閲覧条件リストを作成する。作成された提示用一覧閲覧条件リストは、通信ネットワークを介してコンテンツサーバ３００に送信される。 (List browsing condition list creation unit 185)
The list browsing condition list creation unit 185 creates a list browsing condition list for presentation based on the list browsing condition request data sent from the list browsing condition list request transfer unit 370 of the content server 300. For example, when a conditional search keyword as shown in FIG. 9A is sent, a presentation list browsing condition list indicating a search result that matches the keyword is created. When a condition list request as shown in FIG. 9B is sent, a presentation list browsing condition list is created by rearranging the condition list in, for example, name order. When a parent concept as shown in FIG. 9C is sent, a presentation list browsing condition list obtained by extracting a list of direct subordinate concepts of the parent concept with reference to the concept system / user interest score database 140 Create In addition, when the condition selection list is input from the condition selection history receiving unit 190, the list browsing condition list creation unit 185 refers to the concept system / user interest score database 140 and corresponds to the condition specified in the condition selection list. A list browsing condition list for presentation in which subordinate concepts to be extracted are extracted is created. The created list browsing condition list for presentation is transmitted to the content server 300 via the communication network.

（特徴スコア算出部１２０）
特徴スコア算出部１２０は、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストを利用して、概念選択の統計モデルにより各概念の特徴スコア（後述するＺ値）を算出する。また、特徴スコア算出部１２０は、条件選択履歴受信部１９０から入力される一覧閲覧条件リスト及び条件選択リストを利用して、条件として選択した各概念に対する特徴スコア（後述するＺ´値）を算出する。 (Feature score calculator 120)
The feature score calculation unit 120 uses the list browsing content list and the detailed browsing content list to calculate a feature score (Z value to be described later) of each concept using a statistical model of concept selection. In addition, the feature score calculation unit 120 calculates a feature score (Z ′ value described later) for each concept selected as a condition using the list browsing condition list and the condition selection list input from the condition selection history reception unit 190. To do.

（コンテンツ閲覧履歴からの特徴スコア算出処理）
図１８に、特徴スコア算出部１２０の処理フローを示す。特徴スコア算出部１２０には、履歴情報受信部１１０からユーザＩＤ（もしくは、クライアント端末ＩＤ）、一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリストが入力される。 (Feature score calculation processing from content browsing history)
FIG. 18 shows a processing flow of the feature score calculation unit 120. The feature score calculation unit 120 receives the user ID (or client terminal ID), the list browsing content list, and the detailed browsing content list from the history information receiving unit 110.

ステップＳ１２において、特徴スコア算出部１２０は、詳細閲覧コンテンツリスト内の各コンテンツに出現する概念ＩＤをコンテンツデータベース１６０から抽出する。具体的には、図８詳細閲覧コンテンツリストにおいて、各コンテンツＩＤに紐付けされている「概念ＩＤ」を図１５のコンテンツデータベース１６０のコンテンツテーブルから検索する。特徴スコア算出部１２０は、クラスタデータ｛クラスタＩＤ，一覧閲覧コンテンツリスト，詳細閲覧コンテンツリスト｝と、コンテンツＩＤ／概念ＩＤ関連づけリスト｛｛コンテンツＩＤ，｛関連づいている概念ＩＤ，…｝｝，…｝と、出現概念リスト｛概念ＩＤ｝とを生成する。「コンテンツＩＤ／概念ＩＤ関連付けリスト」とは、コンテンツＩＤをもとに検索された概念ＩＤのリストである。「出現概念リスト」とは、一覧閲覧コンテンツリスト、及び詳細閲覧コンテンツリストに含まれる各コンテンツに出現する概念の概念ＩＤを全て列挙したものである。 In step S <b> 12, the feature score calculation unit 120 extracts a concept ID that appears in each content in the detailed browsing content list from the content database 160. Specifically, the “concept ID” associated with each content ID in the detailed browsing content list of FIG. 8 is searched from the content table of the content database 160 of FIG. The feature score calculation unit 120 includes cluster data {cluster ID, list browsing content list, detailed browsing content list} and content ID / concept ID association list {{content ID, {related concept ID, ...}}, ... } And the appearance concept list {concept ID}. The “content ID / concept ID association list” is a list of concept IDs searched based on the content ID. The “appearance concept list” is a list of all concept IDs of concepts that appear in each content included in the list browsing content list and the detailed browsing content list.

ステップＳ１３において、特徴スコア算出部１２０は、「出現概念リスト」の各概念ＩＤについて、図１６の概念体系／ユーザ興味スコアデータベース１４０から上位概念を抽出し、上位概念の概念ＩＤを「出現概念リスト」及び「コンテンツＩＤ／概念ＩＤ関連づけリスト」に追加する。 In step S13, the feature score calculation unit 120 extracts, for each concept ID in the “appearance concept list”, a superordinate concept from the concept system / user interest score database 140 of FIG. And “content ID / concept ID association list”.

具体的には、特徴スコア算出部１２０は、「出現概念リスト」の概念ＩＤが、図１６の概念体系テーブルから「自概念ＩＤ」と一致するものを検索し、その「親概念ＩＤ」を抽出する。さらに、上記抽出された「親概念ＩＤ」が図１６の概念体系テーブルの「自概念ＩＤ」と一致するものをさがし、その「親概念ＩＤ」も上位概念として抽出し、さらにこの親概念ＩＤリストの概念ＩＤについて概念体系テーブルを参照して親概念リストを抽出する処理を繰り返す。そして、特徴スコア算出部１２０は、上位概念の概念ＩＤを抽出の元になった出現概念の概念ＩＤを有するコンテンツＩＤに関連づける。すなわち、上記抽出された「上位概念」を「元になった概念ＩＤを持っていたコンテンツＩＤ」に対して上位概念が付与されていたと見なして、「出現概念リスト」「コンテンツＩＤ／概念ＩＤ関連づけリスト」に追加する。なお、概念体系階層におけるルート概念の抽出は除外する。 Specifically, the feature score calculation unit 120 searches the concept system table in FIG. 16 for the concept ID of the “appearance concept list” that matches the “own concept ID”, and extracts the “parent concept ID”. To do. Further, the “parent concept ID” that is extracted matches the “self concept ID” in the concept system table of FIG. 16, and the “parent concept ID” is also extracted as a superordinate concept, and this parent concept ID list The process of extracting the parent concept list with reference to the concept system table with respect to the concept ID is repeated. Then, the feature score calculation unit 120 associates the concept ID of the superordinate concept with the content ID having the concept ID of the appearance concept that is the source of extraction. That is, the extracted “superior concept” is regarded as having been assigned a superordinate concept with respect to “content ID having the original concept ID”, and “appearance concept list” “content ID / concept ID association” Add to list. The extraction of the root concept in the concept system hierarchy is excluded.

ステップＳ１４において、特徴スコア算出部１２０は、「出現概念リスト」の各概念について出現数を算出し、特徴スコアの算出に必要な分析パラメータを抽出し、分析パラメータリストを生成する。
図１９に、分析パラメータリストのデータ構成例を示す。分析パラメータリストは、クラスタＩＤ毎に、一覧閲覧コンテンツリストのコンテンツ総数Ｓ（第１の総数）、詳細閲覧コンテンツリストのコンテンツ総数ａ（第２の総数）、クラスタＩＤに紐づいた出現概念リスト内の概念ＩＤ毎に算出するＮとｎがある。Ｎ（第１の出現数）は、一覧閲覧コンテンツリストにおいて当該概念ＩＤが付与されているコンテンツ数とする。ｎ（第２の出現数）は詳細閲覧コンテンツリストにおける当該概念ＩＤが付与されているコンテンツ数とする。なお、ステップＳ１３にて追加した上位概念も含めて出現概念リスト内の概念ＩＤすべてについて、Ｎとｎを算出する。 In step S14, the feature score calculation unit 120 calculates the number of appearances for each concept in the “appearance concept list”, extracts analysis parameters necessary for calculating the feature score, and generates an analysis parameter list.
FIG. 19 shows a data configuration example of the analysis parameter list. The analysis parameter list includes, for each cluster ID, the total content S (first total) of the list browsing content list, the total content a (second total) of the detailed browsing content list, and the appearance concept list associated with the cluster ID. N and n are calculated for each concept ID. N (first appearance number) is the number of contents to which the concept ID is assigned in the list browsing content list. n (second appearance number) is the number of contents to which the concept ID is assigned in the detailed browsing content list. Note that N and n are calculated for all concept IDs in the appearance concept list including the superordinate concept added in step S13.

図２０（ａ）に分析パラメータ抽出処理の模式図を示す。例えば、５０個（＝Ｓ）のコンテンツが一覧表示されている中から、ユーザが１０個（＝ａ）のコンテンツの詳細を閲覧した場合を示す。ここで、一覧表示されている５０個のコンテンツのうち「野球」という概念が含まれている記事が１５個（＝Ｎ）あり、ユーザが閲覧した１０個のコンテンツのうち、「野球」という概念が含まれているコンテンツが５個（＝ｎ）あったことを示す。 FIG. 20A shows a schematic diagram of the analysis parameter extraction process. For example, the case where the user browses the details of 10 (= a) contents from a list of 50 (= S) contents is shown. Here, 15 articles (= N) containing the concept of “baseball” among the 50 contents displayed in a list are displayed, and the concept of “baseball” is included in 10 contents viewed by the user. This indicates that there are five (= n) contents including “”.

ステップＳ１５において、特徴スコア算出部１２０は、上記分析パラメータＳ，ａ，Ｎ，ｎを利用して概念ＩＤ毎に特徴スコアＺを算出する。図２１に特徴スコア算出処理の詳細を示す。図２１において、ｉは概念の識別子、ｊは、クラスタＩＤを示す。Ｈ１（第１の確率）は、一覧閲覧コンテンツリストに含まれる一覧閲覧コンテンツの総数Ｓ、一覧閲覧コンテンツのうち概念ｉが出現するコンテンツ数Ｎのとき、詳細閲覧コンテンツをａ個ランダム選択して閲覧した場合に、概念ｉが出現する詳細閲覧コンテンツの数がｎ以上となる累積確率である。Ｈ２（第２の確率）は、一覧閲覧コンテンツリストに含まれる一覧閲覧コンテンツの総数Ｓ、一覧閲覧コンテンツのうち概念ｉが出現するコンテンツ数Ｎのとき、詳細閲覧コンテンツをａ個ランダム選択して閲覧した場合に、概念ｉが出現する詳細閲覧コンテンツの数がｎ以下となる累積確率である。なお、本実施形態では、累積確率Ｈ１及びＨ２は、超幾何分布により求めるが、この手法に限定するものではない。他の分布の例としては、二項分布、正規分布が存在する。 In step S15, the feature score calculation unit 120 calculates a feature score Z for each concept ID using the analysis parameters S, a, N, and n. FIG. 21 shows details of the feature score calculation process. In FIG. 21, i indicates a concept identifier, and j indicates a cluster ID. When H1 (first probability) is the total number S of the list browsing contents included in the list browsing content list and the number N of the contents of the list browsing content where the concept i appears, a detailed browsing content is randomly selected and viewed. In this case, the cumulative probability that the number of detailed browsing contents in which the concept i appears is n or more. When H2 (second probability) is the total number S of the list browsing contents included in the list browsing content list and the number of contents N in which the concept i appears in the list browsing contents, a detailed browsing content is randomly selected and viewed. In this case, the cumulative probability that the number of detailed browsing contents in which the concept i appears is n or less. In the present embodiment, the cumulative probabilities H1 and H2 are obtained by the hypergeometric distribution, but are not limited to this method. Examples of other distributions include a binomial distribution and a normal distribution.

図２０（ｂ）に示すように、例えば、上記の分析パラメータＳ、Ｎ、ａ、ｎを用いて、ユーザが閲覧した１０個のコンテンツのうち、「野球」という概念が含まれるコンテンツが５以上である確率が、「０．１２」であることを示す。ここで、「０．１２」は、累積確率Ｈ１の値に相当する。 As shown in FIG. 20B, for example, there are 5 or more contents including the concept of “baseball” among the 10 contents viewed by the user using the analysis parameters S, N, a, and n described above. It is shown that the probability of being “0.12”. Here, “0.12” corresponds to the value of the cumulative probability H1.

なお、Ｈ２の値を使う例として、上記の分析パラメータでｎが０である場合を考える。この場合は、出現数が０以下の場合の確率を算出する。具体的には、図２０（ｂ）において横軸が０の項目の値となるため「０．０２」となる。
そして、特徴スコア算出部１２０は、図２１に示すように、上記算出した累積確率Ｈ１及びＨ２を用いて、標準正規分布の累積分布関数の逆関数により特徴スコアＺを算出する。図２０（ｃ）に示すように、上記Ｈ１を累積確率とする標準正規分布の累積分布関数の逆関数により特徴スコアＺを求める。なお、累積確率としてＨ２を利用する場合には、標準正規分布の累積分布関数の逆関数の返値の符号を負にして特徴スコアＺを求める。この特徴スコアＺを用いて、後述する概念体系更新処理部１３０は、「野球」という概念に対するユーザ興味スコア（ＴｏｔａｌＺ）を求める。 As an example of using the value of H2, consider the case where n is 0 in the above analysis parameters. In this case, the probability when the number of appearances is 0 or less is calculated. Specifically, in FIG. 20B, since the horizontal axis is the value of the item of 0, “0.02”.
Then, as shown in FIG. 21, the feature score calculation unit 120 uses the calculated cumulative probabilities H1 and H2 to calculate the feature score Z by the inverse function of the standard normal distribution cumulative distribution function. As shown in FIG. 20C, the feature score Z is obtained by the inverse function of the cumulative distribution function of the standard normal distribution with H1 as the cumulative probability. When H2 is used as the cumulative probability, the feature score Z is obtained with the sign of the return value of the inverse function of the standard normal distribution cumulative distribution function being negative. Using this feature score Z, a concept system update processing unit 130, which will be described later, obtains a user interest score (TotalZ) for the concept of “baseball”.

特徴スコア算出部１２０は、更新対象概念リストを生成し、概念体系更新処理部１３０に出力する。「更新対象概念リスト」とは、概念ＩＤ、前記で算出した特徴スコアＺ、及び重みｗのセットである。なお、この更新対象概念リストに出現する概念ＩＤが、次の概念体系更新処理で更新対象のノード（概念）となる。上位概念を追加した出現概念リスト内の概念ＩＤすべてについて、特徴スコアＺと重みｗを算出する。重みｗは、各クラスタＩＤにおいて概念毎に設定される値である。 The feature score calculation unit 120 generates an update target concept list and outputs it to the concept system update processing unit 130. The “update target concept list” is a set of a concept ID, the characteristic score Z calculated above, and a weight w. The concept ID appearing in the update target concept list becomes a node (concept) to be updated in the next concept system update process. The feature score Z and the weight w are calculated for all the concept IDs in the appearance concept list to which the superordinate concept is added. The weight w is a value set for each concept in each cluster ID.

なお、重みｗは、初期値ｗ＝１とし、ユーザの特徴的な操作等が有った場合に、以下のように値を変化させることができる。例えば、クライアント端末２００において、ユーザに提示されたコンテンツについて、ユーザは、お気に入りコンテンツとして登録や、他ユーザへのお勧め、又はコンテンツへの評価入力ができる。クライアント端末２００が、このような閲覧操作以外の操作履歴を興味分析装置１００に送信できる場合には以下の処理を行う。 Note that the weight w can be changed as follows when the initial value w = 1 and a user's characteristic operation is performed. For example, with respect to the content presented to the user at the client terminal 200, the user can register as a favorite content, recommend to other users, or input an evaluation for the content. When the client terminal 200 can transmit an operation history other than the browsing operation to the interest analysis apparatus 100, the following processing is performed.

特徴スコア算出部１２０は、例えば、コンテンツがお気に入りに登録されたとき、そのコンテンツが含む全ての概念ＩＤについて重みｗをｗ＝１．５のように増加させる。その他にも、コンテンツ閲覧時刻、閲覧時の天気、気温、湿度、季節、曜日、休日、余暇かどうか、閲覧時のユーザ位置情報、スケジューラ、日記等から収集したイベント情報に応じて重みｗの値を変えることもできる。 For example, when the content is registered as a favorite, the feature score calculation unit 120 increases the weight w such that w = 1.5 for all the concept IDs included in the content. In addition to the content browsing time, browsing weather, temperature, humidity, season, day of the week, holiday, leisure time, user location information at browsing, scheduler, diary, etc., the value of weight w Can also be changed.

（条件として選択した各概念に対する特徴スコア算出処理）
また、特徴スコア算出部１２０は、条件選択履歴受信部１９０から入力される一覧閲覧条件リスト及び条件選択リストを利用して、条件として選択した各概念に対する特徴スコア（Ｚ´値）を算出する。 (Feature score calculation processing for each concept selected as a condition)
In addition, the feature score calculation unit 120 calculates a feature score (Z ′ value) for each concept selected as a condition using the list browsing condition list and the condition selection list input from the condition selection history reception unit 190.

（絞り込み条件が指定された場合）
例えば、図２２の概念体系において、ビール、あるいは赤ワインを絞り込み条件として指定した場合、“ワイン”配下の選択肢（“赤ワイン”、“白ワイン”、“ロゼ”）の中から、“赤ワイン”を絞り込み条件として選択する事が、偶然と比較しどの程度珍しいかの程度、あるいは、“洋酒”配下の選択肢（“ワイン”、“ビール”、“ウィスキー”）の中から、“ビール”を絞り込み条件として選択する事が、偶然と比較しどの程度珍しいかの程度を用いてユーザの興味スコアを推定することができる。 (When filtering conditions are specified)
For example, in the conceptual system of FIG. 22, when beer or red wine is specified as a narrowing condition, “red wine” is narrowed down from the choices under “wine” (“red wine”, “white wine”, “rose”). How rare it is to select as a condition, or “beer” from the choices under “Western sake” (“wine”, “beer”, “whiskey”) The user's interest score can be estimated using the degree to which the selection is unusual compared to chance.

図２３に示すように、絞り込み選択結果が“赤ワイン”の場合、選択経路は“お酒”→“洋酒”→“ワイン”→“赤ワイン”と特定される。絞り込み選択結果が“ビール”の場合、選択経路は“お酒”→“洋酒”→“ビール”と特定される。特徴スコア算出部１２０は、図２３のように特定された選択経路に従い、発生した選択を全て抽出する。図２４に示すように７つの選択履歴が得られ、“お酒”→“洋酒”→“ワイン”→“赤ワイン”から選択１〜４、“お酒”→“洋酒”→“ビール”から選択５〜７のが抽出される。図２４において、選択１は、選択候補“お酒”，“麺類”から選択結果として“お酒”が選択されたことを示している。 As shown in FIG. 23, when the narrowing selection result is “red wine”, the selection route is specified as “alcohol” → “western sake” → “wine” → “red wine”. When the narrowing selection result is “beer”, the selection route is specified as “alcohol” → “western sake” → “beer”. The feature score calculation unit 120 extracts all the selections that have occurred according to the selected selection path as shown in FIG. As shown in FIG. 24, seven selection histories are obtained. Selection is made from “alcohol” → “western sake” → “wine” → “red wine” 1-4, “alcohol” → “western sake” → “beer” 5-7 are extracted. In FIG. 24, selection 1 indicates that “alcohol” is selected as a selection result from selection candidates “alcohol” and “noodles”.

特徴スコア算出部１２０は、図２５に示すように、選択１〜７のそれぞれについて、分析パラメータＮ´，Ｓ´，ｎ´，ａ´を抽出し、絞り込み条件選択時の概念出現の希少性を用いた特徴スコアＺ´を算出する。例えば、図２４の選択１では、一覧閲覧条件リストとして選択候補“お酒”，“麺類”（Ｓ´＝２個）が提示されている中から、ユーザが“お酒”（ａ´＝１個）を条件選択リストとして選択した場合を示す。ここで、一覧閲覧条件リストには“お酒”（Ｎ´＝１個）が含まれており、条件選択リストに“お酒”（ｎ´＝１個）が含まれていることを示す。特徴スコアＺ´は、上記図１９に示す算出式において、Ｎ，Ｓ，ｎ，ａをＮ´，Ｓ´，ｎ´，ａ´に置き換えることで算出することができる。 As shown in FIG. 25, the feature score calculation unit 120 extracts the analysis parameters N ′, S ′, n ′, and a ′ for each of the selections 1 to 7, and determines the rarity of the concept appearance when selecting the narrowing conditions. The used feature score Z ′ is calculated. For example, in the selection 1 of FIG. 24, the selection candidates “alcohol” and “noodles” (S ′ = 2) are presented as the list browsing condition list, and the user selects “alcohol” (a ′ = 1). Is selected as the condition selection list. Here, “alcohol” (N ′ = 1) is included in the list browsing condition list, and “alcohol” (n ′ = 1) is included in the condition selection list. The characteristic score Z ′ can be calculated by replacing N, S, n, a with N ′, S ′, n ′, a ′ in the calculation formula shown in FIG.

（除外条件が指定された場合）
例えば、図２６の概念体系において、ウィスキーを除外条件として指定した場合、“洋酒”配下の選択肢（“ワイン”、“ビール”、“ウィスキー”）の中から、“ウィスキー”を除外条件として選択する事が、偶然と比較しどの程度珍しいかの程度を用いてユーザの興味スコアを推定することができる。 (When an exclusion condition is specified)
For example, in the conceptual system of FIG. 26, when whiskey is specified as an exclusion condition, “whiskey” is selected as an exclusion condition from the options under “Western sake” (“wine”, “beer”, “whiskey”). The user's interest score can be estimated using the degree to which things are unusual compared to chance.

図２７に示すように、除外選択結果が“ウィスキー”の場合、除外条件の選択経路は、“お酒”→“洋酒”→“ウィスキー”と特定される。特徴スコア算出部１２０は、図２８のように特定された選択経路に従い、上位概念から下位概念を選択するにあたり、発生した選択を全て抽出する。図２８に示すように３つの除外条件選択履歴（選択８〜１０）が得られる。 As shown in FIG. 27, when the exclusion selection result is “whiskey”, the selection path of the exclusion condition is specified as “alcohol” → “western sake” → “whiskey”. The feature score calculation unit 120 extracts all selections that have occurred when selecting a lower concept from a higher concept according to the selected route specified as shown in FIG. As shown in FIG. 28, three exclusion condition selection histories (selections 8 to 10) are obtained.

特徴スコア算出部１２０は、図２９に示すように選択８〜１０のそれぞれについて、分析パラメータＮ´，Ｓ´，ｎ´，ａ´を抽出し、除外条件選択時の概念出現の希少性を用いた特徴スコアＺ´を算出する。但し、与えられた履歴が除外条件選択履歴である場合は、図３０に示す式を用いて特徴スコアＺ´を計算する。つまり、コンテンツ選択履歴を用いて算出された特徴スコアと条件選択履歴を用いて算出された特徴スコアは、同じ興味スコアに統合することができる。 The feature score calculation unit 120 extracts the analysis parameters N ′, S ′, n ′, and a ′ for each of the selections 8 to 10 as shown in FIG. 29, and uses the rarity of the concept appearance when the exclusion condition is selected. The feature score Z ′ obtained is calculated. However, when the given history is the exclusion condition selection history, the feature score Z ′ is calculated using the formula shown in FIG. That is, the feature score calculated using the content selection history and the feature score calculated using the condition selection history can be integrated into the same interest score.

（概念体系更新処理部１３０）
図３１に、概念体系更新処理部１３０の処理フローを示す。概念体系更新処理部１３０には、特徴スコア算出部１２０から、ユーザＩＤ（もしくは、クライアント端末ＩＤ）及び更新対象概念リスト｛クラスタＩＤ，｛概念ＩＤ，特徴スコア＝Ｚ，重み＝ｗ｝，…｝が入力される。 (Concept system update processing unit 130)
FIG. 31 shows a processing flow of the conceptual system update processing unit 130. The concept system update processing unit 130 receives the user ID (or client terminal ID) and the update target concept list {cluster ID, {concept ID, feature score = Z, weight = w},. Is entered.

ステップＳ１６において、概念体系更新処理部１３０は、「更新対象概念リスト」の各概念ＩＤのノード値を更新する。図３２に概念体系更新処理部１３０の処理の詳細を示す。概念体系更新処理部１３０は、コンテンツに出現した概念（出現概念）、及びこの出現概念の上位概念の概念ＩＤについて、図３２に示す各概念ｉに対するユーザ興味スコア更新式を用いて、ユーザ興味スコアＴｏｔａｌＺ_ｉｎ，及びＸ_{ｉ（ｎ−１）}，Ｙ_{ｉ（ｎ−１）}の値を求める。そして、図１６の概念体系／ユーザ興味スコアデータベース１４０のユーザ興味スコアテーブルにおいて、概念ＩＤ及び図１８のステップＳ１２で入力されたユーザＩＤ（クライアント端末ＩＤ）に対応するカラムに格納されている各値（ＴｏｔａｌＺ，Ｘ，Ｙ）を更新する。 In step S <b> 16, the concept system update processing unit 130 updates the node value of each concept ID in the “update target concept list”. FIG. 32 shows details of the processing of the conceptual system update processing unit 130. The concept system update processing unit 130 uses the user interest score update formula for each concept i shown in FIG. 32 for the concept (appearance concept) that appeared in the content and the concept ID of the superordinate concept of this appearance concept, and the user interest score. TotalZ _in and the values of X _{i (n−1)} and Y _{i (n−1)} are obtained. Then, in the user interest score table of the conceptual system / user interest score database 140 in FIG. 16, each value stored in the column corresponding to the concept ID and the user ID (client terminal ID) input in step S12 in FIG. (TotalZ, X, Y) is updated.

ここで、Ｘ_{ｉ（ｎ−１）}は、各概念ＩＤ（ここでは識別子ｉで表現）に対する、過去の（前回までの）前記更新対象概念リストの重みｗの二乗の合計である。Ｙ_{ｉ（ｎ−１）}は、同様に各概念ＩＤ（ここでは識別子ｉで表現）に対する、過去の前記更新対象概念リストの重みｗと特徴スコアＺの乗算の合計である。 Here, X _{i (n−1)} is the sum of the squares of the weights w of the update target concept list in the past (up to the previous time) for each concept ID (represented by the identifier i here). Similarly, Y _{i (n−1)} is the sum of multiplication of the weight w of the past update target concept list and the feature score Z for each concept ID (represented by identifier i here).

この、Ｘ，Ｙはユーザ興味スコア（ＴｏｔａｌＺ）計算過程における中間結果を保持することとなり、省メモリ／ストレージを優先させる場合、最低限では各ノードの変数としてＴｏｔａｌＺ，Ｘ，Ｙの３つの実数値を保持することで実現可能である。省メモリ／ストレージを優先させない場合は、算出した各概念、各クラスタの特徴スコアＺをすべて保存することとなる。この場合は、Ｘ，Ｙの保存は不要となる。 X and Y hold intermediate results in the user interest score (TotalZ) calculation process. When prioritizing memory saving / storage, at least three real values of TotalZ, X, and Y are used as variables of each node. It can be realized by holding. When priority is not given to memory saving / storage, all the calculated concept scores and feature scores Z of the respective clusters are stored. In this case, storage of X and Y is not necessary.

図３２において、ｎは、概念体系更新処理が何度目かを示す識別子である。ユーザ興味スコアＴｏｔａｌＺを求める一連の処理は、クラスタＩＤ単位で行なわれ、この一連の処理が行なわれる単位を１度と数えるとき、ｎはこの一連の処理が何度目に行なわれたものであるかを示す識別子である。ｉは、概念ＩＤの識別子である。Ｚ_ｉｎは、概念ｉの各更新処理に利用するＺ値である。なお、上記Ｚ_ｉｊは一覧閲覧コンテンツリスト及び詳細閲覧コンテンツリスト毎のＺ値であり、Ｚ_ｉｊ∈Ｚ_ｉｎの関係である。重みｗ_ｉｎは、概念ｉの各更新処理に利用する重みである。上記重みｗと同じであり、上記特徴スコア算出部１２０で設定したものと同様である。 In FIG. 32, n is an identifier indicating how many times the concept system update process is performed. A series of processes for obtaining the user interest score TotalZ is performed in units of cluster IDs. When the unit in which this series of processes is performed is counted once, n is the number of times this series of processes has been performed. Is an identifier. i is an identifier of a concept ID. Z _in is a Z value used for each update process of concept i. Note that Z _ij is a Z value for each of the list browsing content list and the detailed browsing content list, and has a relationship of Z _ij εZ _in . Weight w _in is the weight to be used in each process of updating the concept i. The weight w is the same as that set by the feature score calculation unit 120.

例えば、ｗ_ｉｎは、お気に入り登録、他ユーザとの共有等の閲覧以外の特殊な操作をユーザが行った場合、及びコンテンツ閲覧時間（閲覧開始から終了までの間隔）、コンテンツと概念の関連度合い、コンテンツ閲覧時刻、閲覧時の天気・気温・湿度・季節・曜日・休日・余暇かどうか、閲覧時のユーザ位置情報、スケジューラ・日記等から収集したイベント情報に応じて値を変化させる。その他サービス利用者、サービス運用者が特に指定した場合にも変化させる。あるいは、概念（若しくは概念グループ）毎に値を変化させることもできる。これにより、例えば、サービス提供者側の判断で、特定の概念の学習を重視するといった設定が可能である。 For example, w _in the favorite registration, if the user a special operation other than the inspection of the public, such as with the other user has performed, and (interval to the end from the viewing start) content viewing time, related the degree of content and concepts, The value is changed according to the content browsing time, whether it is weather / temperature / humidity / season / day of the week / holiday / leisure at the time of browsing, user location information at the time of browsing, event information collected from scheduler / diary, etc. It is also changed when specified by other service users and service operators. Alternatively, the value can be changed for each concept (or concept group). Thereby, for example, a setting that places importance on learning of a specific concept can be made based on the judgment of the service provider.

なお、一定期間過ぎた履歴の影響を低減させるため等のユーザ興味スコアの忘却は、最終更新時から現在の時刻までの時間間隔の閾値を超えた場合に、ＴｏｔａｌＺ，Ｘ，Ｙをそれぞれ減衰させることで実現する。
減衰の計算式の例を示す。例えば、ｋを減衰率（例えばｋ＝０・８）と設定し、以下のように算出することができる。
ＴｏｔａｌＺ（減衰後）＝ｋ×ＴｏｔａｌＺ（現在）
Ｘ（減衰後）＝ｋ^２×Ｘ（現在）
Ｙ（減衰後）＝ｋ^２×Ｙ（現在）
さらに、ステップＳ１７において、概念体系更新処理部１３０は、「更新対象概念リスト」の各概念ＩＤ（出現概念及び上位概念）の下位概念を抽出し、下位概念のノード値を更新する。下位概念の抽出では、「更新対象概念リスト」の各概念ＩＤについて、図１６の概念体系／興味度データベース１４０の概念体系テーブルを参照し、子概念ＩＤリストから概念ＩＤのリストを抽出し、さらに各子概念ＩＤリストの概念ＩＤについて概念体系テーブルを参照して子概念リストを抽出する処理を繰り返す。 Note that the forgetting of the user interest score, such as to reduce the influence of the history after a certain period, attenuates TotalZ, X, and Y, respectively, when the threshold of the time interval from the last update time to the current time is exceeded. It will be realized.
An example of an attenuation calculation formula is shown. For example, k can be set as an attenuation rate (for example, k = 0 · 8), and can be calculated as follows.
TotalZ (after attenuation) = k × TotalZ (current)
X (after attenuation) = k ² × X (current)
Y (after attenuation) = k ² × Y (current)
Further, in step S <b> 17, the concept system update processing unit 130 extracts a subordinate concept of each concept ID (appearance concept and superordinate concept) in the “update target concept list”, and updates the node value of the subordinate concept. In the extraction of subordinate concepts, for each concept ID of the “update target concept list”, the concept system table of the concept system / interest degree database 140 in FIG. 16 is referred to, and a list of concept IDs is extracted from the child concept ID list. The process of extracting the child concept list with reference to the concept system table for the concept ID of each child concept ID list is repeated.

下位概念の興味度の更新に利用する特徴スコアＺは、例えば、隣接した親ノードのうち特徴スコアの絶対値が最も大きい値を利用、最も近い上位ノードの値を利用、親ノードの値を平均、または確率結合した値とする。なお、「更新対象概念リスト」のうち、上記ステップＳ１６で更新済みの概念（コンテンツに出現した概念、及び上位概念）のユーザ興味スコアは更新しない。
また、興味概念体系更新処理部１３０は、絞り込み条件または除外条件が指定された場合、特徴スコアＺ´を用いて概念体系における概念間の関係情報（上位概念及び下位概念）に基づいて各概念に対するユーザ興味スコアを更新する。 The feature score Z used to update the interest level of the lower concept is, for example, the value having the largest absolute value of the feature score among the adjacent parent nodes, the value of the closest higher node, the average of the values of the parent nodes Or a probability-coupled value. In the “update target concept list”, the user interest scores of the concepts updated in step S16 (concepts that appear in the content and higher-level concepts) are not updated.
In addition, when the narrowing-down condition or the exclusion condition is designated, the interested concept system update processing unit 130 applies the feature score Z ′ to each concept based on the relationship information (superordinate concept and subordinate concept) between the concepts in the conceptual system. Update user interest score.

（絞り込み条件が指定された場合）
興味概念体系更新処理部１３０は、図２５のように算出された各選択についての特徴スコアを、図３３に示すように子孫概念に伝播させる。例えば、“洋酒”について算出された特徴スコア（Ｚ´＝０．６７）を“洋酒”の子ノードにも確率結合により加える。子孫概念に特徴スコアを伝播させ、かつ絞り込み条件選択経路の特定を考慮し、絞り込み条件に関する全ての選択を確率結合すると、図３４に示す結果が得られる。あるいは、図２３に示すような条件選択経路を特定しない場合は、図３５に示す結果が得られる。なお、図３４，図３５中の値は、図１７に示すユーザ興味スコアを算出するために用いる（Ｚ，Ｘ，Ｙ）の値を表す。 (When filtering conditions are specified)
The interest concept system update processing unit 130 propagates the feature score for each selection calculated as shown in FIG. 25 to the descendant concept as shown in FIG. For example, the feature score (Z ′ = 0.67) calculated for “Western Sake” is also added to the child node of “Western Sake” by stochastic coupling. When the feature score is propagated to the descendant concept, and the selection of the narrowing condition selection path is considered and all selections regarding the narrowing condition are stochastically combined, the result shown in FIG. 34 is obtained. Alternatively, when the condition selection route as shown in FIG. 23 is not specified, the result shown in FIG. 35 is obtained. The values in FIGS. 34 and 35 represent the values of (Z, X, Y) used to calculate the user interest score shown in FIG.

（除外条件が指定された場合）
興味概念体系更新処理部１３０は、図２９のように計算された各選択についての特徴スコアを図３６に示すように子孫概念に伝播させる。例えば、“洋酒”について算出された特徴スコア（Ｚ´＝−０．６７）を“洋酒”の子ノードにも確率結合により加える。子孫概念に特徴スコアを伝播させ、かつ除外条件選択経路の特定を考慮し、除外条件に関する全ての選択を確率結合すると、図３６に示す結果が得られる。あるいは、図２７に示すような除外条件選択経路を特定しない場合は、図３７に示す結果が得られる。なお、図３６，図３７中の値は、図３２に示すユーザ興味スコアを算出するために用いる（Ｚ，Ｘ，Ｙ）の値を表す。
さらに、図３４に示す結果と図３６に示す結果を統合すると図３８に示す結果が得られる。図３５に示す結果と図３７に示す結果を統合すると図３９に示す結果を得ることができる。 (When an exclusion condition is specified)
The interest concept system update processing unit 130 propagates the feature score for each selection calculated as shown in FIG. 29 to the descendant concept as shown in FIG. For example, the feature score (Z ′ = − 0.67) calculated for “Western Sake” is also added to the child node of “Western Sake” by stochastic coupling. If the feature score is propagated to the descendant concept and the selection of the exclusion condition selection path is considered, and all the selections regarding the exclusion condition are stochastically combined, the result shown in FIG. 36 is obtained. Alternatively, when the exclusion condition selection route as shown in FIG. 27 is not specified, the result shown in FIG. 37 is obtained. The values in FIGS. 36 and 37 represent the values of (Z, X, Y) used to calculate the user interest score shown in FIG.
Furthermore, when the result shown in FIG. 34 and the result shown in FIG. 36 are integrated, the result shown in FIG. 38 is obtained. When the result shown in FIG. 35 and the result shown in FIG. 37 are integrated, the result shown in FIG. 39 can be obtained.

（コンテンツ評価処理部１７０）
図４０にコンテンツ評価処理部１７０の処理フローを示す。コンテンツ評価処理部１７０には、コンテンツサーバ３００のコンテンツ要求転送部３６０からの通知を入力として、コンテンツデータベース１６０のコンテンツテーブルから図１１のような形式の提示コンテンツリストを読み出して以下のコンテンツ評価処理を行う。コンテンツ要求転送部３６０からはユーザＩＤ（もしくは、クライアント端末ＩＤ）を含む、図５に示すようなコンテンツ要求データを受信する。また、上記提示コンテンツリストについては、サービス運用者もしくはサービス利用者（クライアント端末利用者）の事前設定により、過去何日以内に登録されたコンテンツのみを評価対象とするか（提示コンテンツリストに含めるか）を設定することができる。 (Content Evaluation Processing Unit 170)
FIG. 40 shows a processing flow of the content evaluation processing unit 170. The content evaluation processing unit 170 receives the notification from the content request transfer unit 360 of the content server 300, reads a presentation content list in the format shown in FIG. 11 from the content table of the content database 160, and performs the following content evaluation processing. Do. The content request transfer unit 360 receives content request data including a user ID (or client terminal ID) as shown in FIG. In addition, with regard to the above-mentioned presented content list, whether the content registered within the past number of days is to be evaluated based on the prior setting of the service operator or service user (client terminal user) (whether it is included in the presented content list) ) Can be set.

ステップＳ２１の分析対象概念フィルタリングでは、サービス運用者又はサービス利用者が、条件選択履歴受信部１９０や事前設定等で特に分析対象の概念ＩＤ、あるいは分析対象から除外する概念ＩＤを指定した場合は、概念体系／興味度データベース１４０を参照し、図１４に示すように、指定された分析対象の概念ＩＤおよび下位の概念ＩＤのみを評価対象とする。あるいは除外条件に指定された概念ＩＤおよび下位の概念ＩＤを評価対象から除外する。コンテンツ評価処理部１７０は、入力された提示コンテンツリストが保持する概念ＩＤについて、事前にサービス運用者又はサービス利用者が設定した条件にしたがってフィルタリングし、「フィルタリング済みコンテンツリスト」を生成する。 In the analysis target concept filtering in step S21, when the service operator or the service user designates a concept ID that is particularly an analysis target or a concept ID that is excluded from the analysis target in the condition selection history receiving unit 190 or presetting, With reference to the concept system / interest degree database 140, as shown in FIG. 14, only the concept ID and the lower-level concept ID of the specified analysis target are evaluated. Alternatively, the concept ID and lower concept ID specified in the exclusion condition are excluded from the evaluation target. The content evaluation processing unit 170 performs filtering according to the conditions set in advance by the service operator or service user with respect to the concept ID held by the input presentation content list, and generates a “filtered content list”.

例えば、ユーザが、野球に関するコンテンツのレコメンドを求めた場合には、図１６の概念体系テーブルを参照し、野球に対応する概念ＩＤの下位概念のみを分析対象とする。「フィルタリング済みコンテンツリスト」とは上記処理によって、各コンテンツＩＤに紐付けされている概念ＩＤを事前にサービス運用者又はユーザが設定した条件にしたがってフィルタリングしたコンテンツリストである。「フィルタリング済みコンテンツリスト」は、上記図１１の提示コンテンツリストと同じデータ構成である。 For example, when the user requests a recommendation for content related to baseball, the concept system table in FIG. 16 is referred to, and only the subordinate concepts of the concept ID corresponding to baseball are analyzed. The “filtered content list” is a content list obtained by filtering the concept ID linked to each content ID according to the conditions set in advance by the service operator or the user. The “filtered content list” has the same data configuration as the presented content list of FIG.

ステップＳ２２において、コンテンツ評価処理部１７０は、「フィルタリング済みコンテンツリスト」に含まれるコンテンツの評価スコアを算出し、図４１に示すようなコンテンツスコアリストを生成する。コンテンツスコアリストは、コンテンツＩＤ、評価スコア、コンテンツ本体、及びコンテンツ登録時刻を有する。 In step S22, the content evaluation processing unit 170 calculates an evaluation score of the content included in the “filtered content list”, and generates a content score list as shown in FIG. The content score list has a content ID, an evaluation score, a content body, and a content registration time.

図４２に評価スコアの算出方法の一例を示す。例えば、図４２に示すコンテンツ評価式により、コンテンツｘに対する評価スコアＥｎｔｉｔｙＺ_ｘを概念ｉのユーザ興味スコアＴｏｔａｌＺ_ｉ、コンテンツｘと概念ｉとの関連度ｗ_ｉ（もしくは、概念ｉの重要度）、及びコンテンツｘに出現する概念ＩＤの集合ｐを用いて算出することができる。なお、概念の識別子ｉは集合ｐ内の概念ＩＤに対応する。 FIG. 42 shows an example of a method for calculating the evaluation score. For example, according to the content evaluation formula shown in FIG. 42, the evaluation score EntityZ _x for the content x is changed to the user interest score TotalZ _{i of} the concept i, the relevance w _i between the content x and the concept _i (or the importance of the concept i), and It can be calculated using a set p of concept IDs appearing in the content x. The concept identifier i corresponds to the concept ID in the set p.

図４２の算出で利用するユーザ興味スコア（ＴｏｔａｌＺ）は、各コンテンツに関連した概念ＩＤについて、概念体系／ユーザ興味スコアデータベース１４０のユーザ興味スコアテーブル（図１６）から、ユーザＩＤ（もしくは、クライアント端末ＩＤ）をもとに読み出し利用する。図４２において、概念Ｋ、概念Ｂ及び概念Ｄが出現する評価コンテンツ１を評価コンテンツとした場合、概念Ｋ、概念Ｂ及び概念ＤのＴｏｔａｌＺ，ｗを利用して評価スコアＥｎｔｉｔｙＺ_{評価コンテンツ１}＝０．１８と算出できる。一方、概念Ｂのみが出現するコンテンツ２を評価コンテンツとした場合、概念ＢのＴｏｔａｌＺ，ｗを利用して評価スコアＥｎｔｉｔｙＺ_{評価コンテンツ２}＝−０．３と算出できる。評価スコアＥｎｔｉｔｙＺ_ｘの値が大きいコンテンツ１が優先して表示される。 The user interest score (TotalZ) used in the calculation of FIG. 42 is obtained from the user interest score table (FIG. 16) of the concept system / user interest score database 140 for the user ID (or client terminal) for the concept ID related to each content. Read out based on (ID). In FIG. 42, when the evaluation content 1 in which the concept K, the concept B, and the concept D appear is set as the evaluation content, the evaluation score EntityZ _{evaluation content 1} using the TotalZ, w of the concept K, the concept B, and the concept D = 0. 18 can be calculated. On the other hand, when the content 2 in which only the concept B appears is set as the evaluation content, the evaluation score EntityZ _{evaluation content 2} = −0.3 can be calculated using the TotalZ, w of the concept B. Content 1 with a large value of evaluation score EntityZ _x is displayed preferentially.

その他にも、評価スコアＥｎｔｉｔｙＺ_ｘは、以下の変形例１〜３の方法により求めることができる。
変形例１としては、ＥｎｔｉｔｙＺ_ｘ=ＭＡＸ（ＴｏｔａｌＺ_ｉ＊ｗ_ｉ）により求める。ＭＡＸ（ＴｏｔａｌＺ_ｉ＊ｗ_ｉ）は、ｉ∈ｐのＴｏｔａｌＺ_ｉ＊ｗ_ｉの最大値を返す関数とする。 In addition, the evaluation score EntityZ _x can be obtained by the following methods 1 to 3.
As a first modification, it is obtained by EntityZ _x = MAX (TotalZ _i * w _i ). MAX (TotalZ _i * w _i ) is a function that returns the maximum value of TotalZ _i * w _i for i∈p.

変形例２としては、ＥｎｔｉｔｙＺ_ｘの値は、ＭＡＸ（ＴｏｔａｌＺ_ｉ＊ｗ_ｉ）の値が閾値を超えた場合には、ＭＡＸ（ＴｏｔａｌＺ_ｉ＊ｗ_ｉ）の返り値とする。ＭＡＸ（ＴｏｔａｌＺ_ｉ＊ｗ_ｉ）はｉ∈ｐのＴｏｔａｌＺ_ｉ＊ｗ_ｉの最大値を返す関数とする。閾値を超えない場合は、図４２のコンテンツ評価式の結果をＥｎｔｉｔｙＺ_ｘとする。ＭＡＸ（）は、はｉ∈ｐのＴｏｔａｌＺ_ｉ＊ｗ_ｉで最大値を返す関数とする。閾値はサービス運用者が設定する値とする。 The second modification, the value of EntityZ _x When the value of the MAX (TotalZ _{_i} * _w _i) exceeds the threshold value, the return value of _{_{MAX (TotalZ i * w i)}} . MAX (TotalZ _i * w _i ) is a function that returns the maximum value of TotalZ _i * w _i for i∈p. If the threshold is not exceeded, the result of the content evaluation formula in FIG. 42 is set to EntityZ _x . MAX () is, is a function that returns the maximum value in TotalZ _{_i} * _w _i of i∈p. The threshold is a value set by the service operator.

変形例３としては、ＴｏｔａｌＺ_ｉが正の値のｉ∈ｐについてのみ取り出し、図４２のコンテンツ評価式で統合した値をＥｎｔｉｔｙＺ_ｘとする。
ステップＳ２３において、コンテンツ評価処理部１７０は、コンテンツスコアリストに含まれるコンテンツを評価スコアＥｎｔｉｔｙＺ_ｘの降順にソートし、ソート済みコンテンツスコアリストをソート済みコンテンツスコアリスト送信部１８０に出力する。 As a third modification example, only the value iεp where TotalZ _i is a positive value is extracted, and the value integrated by the content evaluation formula of FIG. 42 is EntityZ _x .
In step S23, the content evaluation processing unit 170 sorts the content included in the content score list in descending order of the evaluation score EntityZ _x , and outputs the sorted content score list to the sorted content score list transmission unit 180.

（ソート済みコンテンツスコアリスト送信部１８０）
ソート済みコンテンツスコアリスト送信部１８０は、コンテンツ評価処理部１７０から入力されるソート済みコンテンツスコアリストとユーザＩＤ（もしくは、クライアント端末ＩＤ）を通信ネットワークを介してコンテンツサーバ３００に送信する。 (Sorted content score list transmission unit 180)
The sorted content score list transmission unit 180 transmits the sorted content score list and the user ID (or client terminal ID) input from the content evaluation processing unit 170 to the content server 300 via the communication network.

以上述べたように、上記構成によれば、ユーザの選択候補となる一覧リストを定義し、そこからのコンテンツ選択における概念の出現数を分析することで、各概念の出現の希少性を考慮し、且つ一覧から選ばれない概念の履歴特徴を利用することができるため、ユーザの興味を高精度に推定することが可能となる。 As described above, according to the above configuration, the list of candidates for user selection is defined, and the number of appearances of concepts in content selection from there is analyzed, thereby taking into consideration the rare occurrence of each concept. In addition, since it is possible to use history features of concepts not selected from the list, it becomes possible to estimate the user's interest with high accuracy.

さらに、上記ステップＳ１５、ステップＳ１６、ステップＳ１７に示したように、特徴スコアの算出やユーザ興味スコアの算出に際し、閲覧時のユーザの状況や閲覧操作の特徴（お気に入り登録、長時間閲覧等）などを重み係数（重みｗ）を介して反映することができるため、ユーザ興味スコアをさらに精度良く求めることが可能となる。 Further, as shown in steps S15, S16, and S17, when calculating the feature score and the user interest score, the user's situation at the time of browsing, the characteristics of the browsing operation (favorite registration, long-time browsing, etc.), etc. Can be reflected via the weighting coefficient (weight w), so that the user interest score can be obtained with higher accuracy.

また、タクソノミ（オントロジ）等で定義された概念をメタタグとして付与したコンテンツ閲覧履歴分析において、概念出現の希少性を合理的に分析に反映することが難しかったため、従来はタクソノミ（オントロジ）構造の深さを一定する等によりコンテンツに付与する概念の抽象度を統一する等のオントロジ構造側の調整が必要があったが、本実施形態では概念出現の希少性を考慮するオントロジ構造によるユーザ興味スコアの更新処理により上位概念が付与されたコンテンツと、下位概念が付与されたコンテンツの閲覧履歴を統合して分析可能となるため、分析に利用するタクソノミ（オントロジ）への制約を低減し、タクソノミ（オントロジ）の維持・運用・管理コストを低減することが可能となる。 In addition, in the content browsing history analysis with the concept defined by taxonomies (ontologies) as meta tags, it was difficult to rationally reflect the rareness of concept appearances in the analysis. Although it was necessary to adjust the ontology structure side such as unifying the abstraction level of the concept to be given to the content by fixing the thickness, etc., in this embodiment, the user interest score of the ontology structure considering the rarity of concept appearance Since it is possible to analyze the content with the higher concept by the update process and the browsing history of the content with the lower concept, the restriction on the taxonomy (ontology) used for analysis is reduced, and the taxonomy (ontology) is reduced. ) Maintenance, operation, and management costs can be reduced.

さらに、ユーザ興味スコアを用いてコンテンツに対するユーザの評価スコアを算出することで、ユーザの興味に合ったコンテンツを推薦することが可能となる。
また、本実施形態では、ユーザの履歴を用いた興味推定とユーザが行った条件指定履歴を用いた興味推定を組合せ、ユーザがその時点で望む情報を精度良く提示する事ができる。 Furthermore, by calculating the user's evaluation score for content using the user interest score, it is possible to recommend content that matches the user's interest.
Moreover, in this embodiment, the interest estimation using the user's history and the interest estimation using the condition designation history performed by the user can be combined, and the information desired by the user at that time can be accurately presented.

例えば、図４３に示す例は、ユーザが“ビール”を絞り込み条件として指定した場合であるが、この場合、第１のコンテンツリスト、及び第２のコンテンツリスト共に全て“ビール”のコンテンツとなるため、第２のコンテンツリストが全て“ビール”に関するコンテンツである事は当然であり、ユーザが偶然と比べて比較的“エール”を選ぶ、あるいは、偶然と比べて比較的“ラガー”を選ばないという事実から、“エールが好き”、あるいは“ラガーが嫌い”という興味推定を行う事ができるが、一方、ユーザが絞り込み条件に、“ビール”を指定したとしても”ビール”が好きであるという興味は学習できない。 For example, the example shown in FIG. 43 is a case where the user designates “beer” as a narrowing condition. In this case, both the first content list and the second content list are “beer” content. Naturally, the second content list is all about “beer”, and the user chooses relatively “ale” compared to chance, or does not choose “lager” relatively compared to chance. From the facts, it is possible to estimate the interest of “I like ale” or “I don't like lager”, but I am interested in “beer” even if the user specifies “beer” as the filtering condition. Cannot learn.

この課題に対し、本実施形態では、ユーザが絞り込み条件を絞り込み条件の選択候補から選択した場合、絞り込み条件の指定について、偶然と比べて比較的“選ぶ”あるいは、“選ばない”程度を分析し、ユーザの興味を推定するものであって、コンテンツ選択履歴による興味推定と併せて用いる事に拠り、ユーザが絞り込み条件を指定した場合であっても、ユーザの興味を合理的かつ的確に分析可能となる。 In response to this problem, in the present embodiment, when the user selects a narrowing condition from the narrowing condition selection candidates, the specification of the narrowing condition is analyzed to the extent that it is relatively “selected” or “not selected” compared to chance. , Which estimates the user's interests, and can be used in conjunction with the interest estimation based on the content selection history, so that even if the user specifies the narrowing conditions, the user's interests can be analyzed rationally and accurately. It becomes.

例えば、図２２の概念体系において、ビール、あるいは赤ワインを絞り込み条件として指定した場合、“ワイン”配下の選択肢（“赤ワイン”、“白ワイン”、“ロゼ”）の中から、“赤ワイン”を絞り込み条件として選択する事が、偶然と比較しどの程度珍しいかの程度、あるいは、”洋酒”配下の選択肢（“ワイン”、“ビール”、“ウィスキー”）の中から、“ビール”を絞り込み条件として選択する事が、偶然と比較しどの程度珍しいかの程度を用いてユーザの興味スコアを推定することができ、コンテンツ閲覧履歴を用いた興味推定結果と併せて用いる事により興味推定の精度を高める事ができる。 For example, in the conceptual system of FIG. 22, when beer or red wine is specified as a narrowing condition, “red wine” is narrowed down from the choices under “wine” (“red wine”, “white wine”, “rose”). How rare it is to select as a condition compared to chance, or “Beer” as a narrowing condition from the choices under “Western Sake” (“Wine”, “Beer”, “Whisky”) The user's interest score can be estimated using the degree to which the selection is rare compared to chance, and the interest estimation accuracy is improved by using it together with the interest estimation result using the content browsing history. I can do things.

さらに、絞り込み選択結果について、概念体系における上位概念からの絞り込み条件選択経路を特定し、それらの選択についても興味推定に利用することにより、興味推定精度を高めることを目的とするものであって、例えば図２３において、“お酒”、“洋酒”、“ワイン”、“赤ワイン”という順に上位概念から選択を行った場合、“ワイン”配下の選択肢（“赤ワイン”、“白ワイン”、“ロゼ”）の中から、“赤ワイン”を絞り込み条件として選択した事実だけでなく、“お酒”配下の選択肢（“日本のお酒”、“洋酒”、“韓国のお酒”、“中国のお酒”）の中から、“洋酒”を選んだ事実などもさらに加えて分析することにより、興味推定精度を高めることができる。 Furthermore, with regard to the narrowing selection result, it is intended to increase the interest estimation accuracy by specifying the narrowing condition selection path from the superordinate concept in the concept system and using the selection for the interest estimation, For example, in FIG. 23, when selection is made from the upper concepts in the order of “alcohol”, “western sake”, “wine”, “red wine”, the options under “wine” (“red wine”, “white wine”, “rose” )), Not only the fact that “red wine” was selected as a narrowing condition, but also the choices under “Sake” (“Japanese liquor”, “Western liquor”, “Korean liquor”, “Chinese liquor” The interest estimation accuracy can be improved by further analyzing the fact that “Western Sake” was selected from “Sake”).

また、本実施形態では、ユーザが一覧閲覧条件リストから除外条件を選択した場合、偶然と比べて比較的“選ぶ”あるいは、“選ばない”という特徴を活用し、ユーザの興味を推定するものであって、コンテンツ選択履歴による興味推定と併せて用いる事に拠り、ユーザが除外条件を指定した場合であっても、ユーザの興味を合理的かつ的確に分析可能となる。 Further, in the present embodiment, when the user selects an exclusion condition from the list browsing condition list, the user's interest is estimated by utilizing the feature of “select” or “not select” relatively compared to chance. Thus, the user's interest can be analyzed reasonably and accurately even when the user designates an exclusion condition by using it together with the interest estimation based on the content selection history.

例えば、図２６の概念体系において、ウィスキーを除外条件として指定した場合、“洋酒”配下の選択肢（“ワイン”、“ビール”、“ウィスキー”）の中から、“ウィスキー”を除外条件として選択する事が、偶然と比較しどの程度珍しいかの程度を用いてユーザの興味スコアを推定することができ、コンテンツ閲覧履歴を用いた興味推定結果と併せて用いる事により興味推定の精度を高める事ができる。 For example, in the conceptual system of FIG. 26, when whiskey is specified as an exclusion condition, “whiskey” is selected as an exclusion condition from the options under “Western sake” (“wine”, “beer”, “whiskey”). It is possible to estimate the user's interest score by using how rare it is compared with chance, and to improve the accuracy of interest estimation by using it together with the result of interest estimation using content browsing history. it can.

さらに、除外条件選択結果について、概念体系における上位概念からの除外条件選択経路を特定し、それらの選択についても興味推定に利用することにより、興味推定精度を高めることを目的とするものであって、例えば図２７において、“お酒”、“洋酒”、“ウィスキーという順に上位概念から選択を行った場合、“洋酒”配下の選択肢（“ワイン”、“ビール”、“ウィスキー”）の中から、“ウィスキー”を除外条件として選択した事実だけでなく、“お酒”配下の選択肢（“日本のお酒”、“洋酒”、“韓国のお酒”、“中国のお酒”）の中から、“洋酒”を選んだ事実などもさらに加えて分析することにより、興味推定精度を高めることができる。 Furthermore, the exclusion condition selection result is intended to improve the accuracy of interest estimation by specifying the exclusion condition selection route from the superordinate concept in the concept system and using the selection for the interest estimation as well. For example, in FIG. 27, when the selection is made in the order of “alcohol”, “western sake”, and “whiskey”, from the options under “Western sake” (“wine”, “beer”, “whiskey”) In addition to the fact that “Whisky” was selected as an exclusion condition, among the choices under “Sake” (“Japanese Sake”, “Western Sake”, “Korean Sake”, “Chinese Sake”) Therefore, the interest estimation accuracy can be improved by further analyzing the fact that “Western sake” was selected.

また、本実施形態では、各選択について算出した特徴スコアを子概念群に同じく適用する。例えば、図３３の例において、洋酒に関する特徴スコアが０．６７と計算された場合、洋酒の子孫概念である“ワイン”や“赤ワイン”についても特徴スコアを同じく加算することにより、興味推定精度を高めることができる。 In the present embodiment, the feature score calculated for each selection is also applied to the child concept group. For example, in the example of FIG. 33, when the feature score for Western liquor is calculated to be 0.67, interest estimation accuracy is also increased by adding the feature scores for “wine” and “red wine” which are the concept of descendants of Western liquor. Can be increased.

なお、この発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態に亘る構成要素を適宜組み合せてもよい。 Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. Further, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, you may combine suitably the component covering different embodiment.

１００…興味分析装置、２００…クライアント端末、３００…コンテンツサーバ、１１０…履歴情報受信部、１２０…特徴スコア算出部、１３０…概念体系更新処理部、１４０…概念体系／ユーザ興味スコアデータベース、１５０…提示コンテンツリスト受信部、１６０…コンテンツデータベース、１７０…コンテンツ評価処理部、１８０…ソート済みコンテンツスコアリスト送信部、１８５…一覧閲覧条件リスト作成部、１９０…条件選択履歴受信部、２１０…履歴収集部、２２０…履歴情報送信部、２３０…コンテンツ提示部、２４０…コンテンツ要求送信部、２５０…一覧閲覧条件リスト要求送信部、２６０…一覧閲覧条件リスト提示部、２７０…閲覧条件選択履歴収集部、２８０…条件選択履歴送信部、３１０…コンテンツ送信処理部、３２０…ソート済み提示コンテンツリスト受信部、３３０…提示コンテンツリスト送信部、３４０…提示コンテンツリスト入力部、３５０…履歴情報転送部、３６０…コンテンツ要求転送部、３７０…一覧閲覧条件リスト要求転送部、３８０…一覧閲覧条件リスト転送部、３９０…条件選択履歴転送部。 DESCRIPTION OF SYMBOLS 100 ... Interest analysis apparatus, 200 ... Client terminal, 300 ... Content server, 110 ... History information receiving part, 120 ... Feature score calculation part, 130 ... Concept system update processing part, 140 ... Concept system / user interest score database, 150 ... Presented content list receiving unit, 160 ... content database, 170 ... content evaluation processing unit, 180 ... sorted content score list transmitting unit, 185 ... list browsing condition list creating unit, 190 ... condition selection history receiving unit, 210 ... history collecting unit , 220 ... history information transmission unit, 230 ... content presentation unit, 240 ... content request transmission unit, 250 ... list browsing condition list request transmission unit, 260 ... list browsing condition list presentation unit, 270 ... browsing condition selection history collection unit, 280 ... Condition selection history transmission unit, 310 ... Content transmission processing 320 ... Sorted presentation content list reception unit 330 ... Presentation content list transmission unit 340 ... Presentation content list input unit 350 ... History information transfer unit 360 ... Content request transfer unit 370 ... List browsing condition list request transfer unit 380 ... List browsing condition list transfer unit, 390 ... Condition selection history transfer unit.

Claims

A method of analyzing a user's interest using a concept system in which user interest scores for a plurality of concepts are systematized by a computer,
Obtaining a first condition list in which a plurality of concepts as candidates for selection of narrowing conditions in the concept system are browsed as a list, and a second condition list including the concept selected by the user from the first condition list And steps to
The total number of concepts included in the first condition list is a first total number, the number of occurrences of the concept selected by the user in the first condition list is a first appearance number, and the second condition When the total number of concepts included in the list is the second total number and the number of occurrences of the concept selected by the user in the second condition list is the second appearance number, the first total number, the first number A first probability that the number of occurrences of the concept in the second condition list is greater than or equal to the second occurrence number under the condition of the occurrence number of 1 and the second total number, and the second occurrence Calculating a second probability that is less than or equal to a number, and calculating a feature score by an inverse function of a cumulative distribution function of a standard normal distribution based on the first probability and the second probability;
An updating step of updating the user interest score for the concept using the feature score.

A method of analyzing a user's interest using a concept system in which user interest scores for a plurality of concepts are systematized by a computer,
Obtaining a first condition list obtained by browsing a plurality of concepts as candidates for selection of exclusion conditions in the concept system, and a second condition list selected by the user from the first condition list; ,
The total number of concepts included in the first condition list is a first total number, the number of occurrences of the concept selected by the user in the first condition list is a first appearance number, and the second condition When the total number of concepts included in the list is the second total number and the number of occurrences of the concept selected by the user in the second condition list is the second appearance number, the first total number, the first number A first probability that the number of occurrences of the concept in the second condition list is greater than or equal to the second occurrence number under the condition of the occurrence number of 1 and the second total number, and the second occurrence Calculating a second probability that is less than or equal to a number, and calculating a feature score by an inverse function of a cumulative distribution function of a standard normal distribution based on the first probability and the second probability;
An updating step of updating the user interest score for the concept using the feature score.

The calculating step specifies a selection path in the concept system based on a concept included in the second condition list, and calculates the feature score for each concept corresponding to each selection occurring in the selection path. The interest analysis method according to claim 1, wherein the method is an interest analysis method.

The interest analysis method according to any one of claims 1 to 3, wherein the updating step updates a user interest score of a subordinate concept of the concept using the feature score.

The content further includes an evaluation step of calculating a user evaluation score for the content using the user interest score of each concept that appears in the content for the content in which one or more concepts appear. 5. The interest analysis method according to any one of items 1 to 4.