JP2014110004A

JP2014110004A - Information processor and information processing method and data structure

Info

Publication number: JP2014110004A
Application number: JP2012265491A
Authority: JP
Inventors: Yoshiaki Akazawa; 由章赤澤
Original assignee: Samsung R&D Institute Japan Co Ltd
Current assignee: Samsung R&D Institute Japan Co Ltd
Priority date: 2012-12-04
Filing date: 2012-12-04
Publication date: 2014-06-12
Anticipated expiration: 2032-12-04
Also published as: JP6074702B2

Abstract

PROBLEM TO BE SOLVED: To provide an information processor capable of easily updating a directed graph constituting a taste model.SOLUTION: An information processor 10 uses a taste model 151 having a data structure in which a determination node in a directed graph includes one attribute information among a plurality of attribute information included in program information and information indicating a determination formula for determining the degree of taste of a user related to the determination node, and a start node is the start point of a program for generating one or more rules for performing predetermined determination related to the taste of the user on the basis of a determination result based on the determination formula associated with the determination node following the start node, with which connection information to one or more determination nodes is associated. The information processor 10 includes a content evaluation part 120 for estimating the taste of the user by inputting the program information, and applying one or more rules generated on the basis of the taste model 151 to the program information.

Description

本発明は、ユーザの嗜好にあったコンテンツを推薦する際に用いて好適な情報処理装置及び情報処理方法並びにデータ構造に関する。 The present invention relates to an information processing apparatus, an information processing method, and a data structure that are suitable for use when recommending content that meets a user's preference.

動画等のコンテンツの視聴や録画を行う際に、コンテンツの内容を表す文字、コード等からなるメタ情報を用いて、ユーザの嗜好にあったコンテンツを自動的に抽出して推薦する技術がある。例えば、特許文献１には、ネットワークを介して他のユーザの嗜好度を表すデータ利用することで、本人の嗜好にあったコンテンツを自動的に選択可能とする技術が示されている。 There is a technique for automatically extracting and recommending content that suits the user's preference using meta-information including characters, codes, and the like representing the content when viewing or recording content such as a moving image. For example, Patent Literature 1 discloses a technology that enables the selection of content that suits the user's preference automatically by using data representing the degree of preference of another user via a network.

また、特許文献２には、ベイジアンネットワーク（ＢａｙｅｓｉａｎＮｅｔｗｏｒｋ）を用いて生成した嗜好モデルを利用してコンテンツの推薦するための技術が示されている。特許文献２に記載されている嗜好モデルは、複数のノードと、複数のノードの親子関係を示す接続情報とから表されている。ここで、複数のノードは、番組情報に含まれている各項目等に対応した確率変数を表す。また、親ノードと子ノードとの間の接続は、因果関係、すなわち原因と結果との関係を表す。これらの各ノード間の関係は、有向グラフで表される。さらに、親ノードと子ノードとの間の因果関係は、条件付き確率で表される。すなわち、親ノードの確率変数で求めた結果を条件として子ノードの確率変数が求められる。この各ノードが表す条件付き確率の値は、ユーザの各項目に対する視聴確率に対応している。なお、特許文献２のシステムでは、嗜好モデル学習部が、過去の一定期間の視聴履歴に基づいて嗜好モデル内の条件付き確率の値を計算する。 Patent Document 2 discloses a technique for recommending content using a preference model generated using a Bayesian Network. The preference model described in Patent Document 2 is represented by a plurality of nodes and connection information indicating a parent-child relationship between the plurality of nodes. Here, the plurality of nodes represent random variables corresponding to items included in the program information. Further, the connection between the parent node and the child node represents a causal relationship, that is, a relationship between the cause and the result. The relationship between each of these nodes is represented by a directed graph. Furthermore, the causal relationship between the parent node and the child node is expressed by a conditional probability. In other words, the child node random variable is obtained on the condition that the result obtained from the parent node random variable is used as a condition. The value of the conditional probability represented by each node corresponds to the viewing probability for each item of the user. In the system of Patent Document 2, the preference model learning unit calculates the value of the conditional probability in the preference model based on the viewing history for a certain period in the past.

特開２００３−１１４９０３号公報JP 2003-114903 A 特開２００８−２６２３９８号公報JP 2008-262398 A

特許文献１に記載されている技術では、他のユーザの嗜好の度合いを示すデータを利用することで、本人の嗜好を推定する際に利用可能な情報量を多くしている。しかし、ネットワーク接続を前提としているため、トラフィック量の増加や、オフライン環境での使用について一定の課題がある。 In the technique described in Patent Document 1, the amount of information that can be used when estimating the user's preference is increased by using data indicating the degree of preference of other users. However, since it is premised on a network connection, there are certain problems regarding an increase in traffic volume and use in an offline environment.

一方、特許文献２に記載されている技術では、例えば新しい項目を嗜好モデルに追加する処理に、次のような制約があるという課題がある。すなわち、特許文献２に記載されている嗜好モデルには、複数のノード間に親子関係が存在する。そのため、例えば、新たにノードを追加したり、削除したりすると、その追加したり削除したりしたノードを親ノードとする子ノードの内容も変更しなければならない。したがって、ノードの追加や削除の処理が過大になってしまうという課題がある。 On the other hand, in the technique described in Patent Document 2, there is a problem that the process of adding a new item to the preference model has the following restrictions, for example. That is, the preference model described in Patent Document 2 has a parent-child relationship between a plurality of nodes. Therefore, for example, when a new node is added or deleted, the contents of the child node whose parent node is the added or deleted node must be changed. Therefore, there is a problem that the process of adding or deleting nodes becomes excessive.

本発明は、上記の事情を考慮してなされたものであり、上記の課題を解決することができる情報処理装置及び情報処理方法並びにデータ構造を提供することを目的とする。 The present invention has been made in view of the above circumstances, and an object thereof is to provide an information processing apparatus, an information processing method, and a data structure that can solve the above-described problems.

上記課題を解決するため、本発明の情報処理装置は、複数のノードと、前記複数のノード間を方向性を有して接続する複数のエッジとで構成された有向グラフで表され、前記複数のノードが開始ノード又は判定ノードであり、前記判定ノードには、複数の属性情報を用いてコンテンツの属性を表すコンテンツ情報に含まれている前記複数の属性情報うちの１つの属性情報と、当該判定ノードに係るユーザの嗜好度合いを判定するための判定式を表す情報と、が対応づけられていて、前記開始ノードは、該開始ノードに後続する１又は複数の前記判定ノードに対応づけられている１又は複数の前記判定式による１又は複数の判定結果に基づいてユーザの嗜好に関する所定の判定を行うための１又は複数のルールを生成するプログラムの開始点であって、１又は複数の前記判定ノードへの接続情報が対応づけられている、データ構造を有する嗜好モデル、を記憶した記憶部と、前記コンテンツ情報を入力し、前記嗜好モデルに基づいて生成された１又は複数の前記ルールを該コンテンツ情報に対してあてはめることで、ユーザの嗜好を推定する推定部とを備える。 In order to solve the above-described problem, an information processing apparatus according to the present invention is represented by a directed graph including a plurality of nodes and a plurality of edges that connect the plurality of nodes with directionality, and the plurality of nodes The node is a start node or a determination node, and the determination node includes one attribute information among the plurality of attribute information included in the content information representing the attribute of the content using the plurality of attribute information, and the determination Information indicating a determination formula for determining a user's preference degree related to a node is associated with the start node, and the start node is associated with one or more determination nodes subsequent to the start node. At the starting point of a program that generates one or a plurality of rules for making a predetermined determination regarding the user's preference based on one or a plurality of determination results based on one or a plurality of the determination formulas A storage unit storing a preference model having a data structure associated with connection information to one or a plurality of the determination nodes, and the content information is input and generated based on the preference model And an estimation unit that estimates the user's preference by applying one or more of the rules to the content information.

また、本発明の他の情報処理装置は、前記判定ノードには、さらに、当該判定ノードに対応づけられている前記判定式の信頼性を表すための所定のパラメータが対応づけられていて、前記嗜好モデルに基づいて生成された前記ルールには、該ルールを生成する際に基準とした前記１又は複数の判定結果を求めた前記１又は複数の判定式の信頼性を表す前記１又は複数の所定のパラメータを用いて求めた値であって、該ルールの信頼度を表す値が設定されていて、前記推定部が、前記１又は複数のルールを前記コンテンツ情報に対してあてはめる際に、前記各ルールに設定されている前記各信頼度を表す値を利用することを特徴とする。 Further, in another information processing apparatus of the present invention, the determination node is further associated with a predetermined parameter for representing the reliability of the determination expression associated with the determination node. In the rule generated based on the preference model, the one or more representing the reliability of the one or more determination formulas obtained from the one or more determination results used as a reference when generating the rule A value obtained by using a predetermined parameter, and a value representing the reliability of the rule is set, and when the estimation unit applies the one or more rules to the content information, A value representing each reliability set in each rule is used.

また、本発明の他の情報処理装置は、前記嗜好モデルが、複数の前記コンテンツ情報に基づいて生成又は更新されたものであり、前記判定ノードに対応づけられている前記所定のパラメータが、前記嗜好モデルを生成又は更新する際に基準とされた前記複数のコンテンツ情報に当該判定ノードに対応づけられている前記属性情報が出現した頻度に応じたものであることを特徴とする。 In another information processing apparatus of the present invention, the preference model is generated or updated based on a plurality of pieces of the content information, and the predetermined parameter associated with the determination node is It is characterized in that it corresponds to the frequency of appearance of the attribute information associated with the determination node in the plurality of pieces of content information used as a reference when generating or updating a preference model.

また、本発明の他の情報処理装置は、前記推定部が、遺伝的アルゴリズムを用いて、前記嗜好モデルに対して進化処理を行いつつ、前記ユーザの嗜好を推定することを特徴とする。 Further, another information processing apparatus according to the present invention is characterized in that the estimation unit estimates the user's preference while performing an evolution process on the preference model using a genetic algorithm.

また、本発明の情報処理方法は、複数のノードと、前記複数のノード間を方向性を有して接続する複数のエッジとで構成された有向グラフで表され、前記複数のノードが開始ノード又は判定ノードであり、前記判定ノードには、複数の属性情報を用いてコンテンツの属性を表すコンテンツ情報に含まれている前記複数の属性情報うちの１つの属性情報と、当該判定ノードに係るユーザの嗜好度合いを判定するための判定式を表す情報と、が対応づけられていて、前記開始ノードは、該開始ノードに後続する１又は複数の前記判定ノードに対応づけられている１又は複数の前記判定式による１又は複数の判定結果に基づいてユーザの嗜好に関する所定の判定を行うための１又は複数のルールを生成するプログラムの開始点であって、１又は複数の前記判定ノードへの接続情報が対応づけられている、データ構造を有する嗜好モデル、を所定の記憶部に記憶し、推定部によって、前記コンテンツ情報を入力し、前記嗜好モデルに基づいて生成された１又は複数の前記ルールを該コンテンツ情報に対してあてはめることで、ユーザの嗜好を推定することを特徴とする。 The information processing method of the present invention is represented by a directed graph including a plurality of nodes and a plurality of edges that connect the plurality of nodes with directionality, and the plurality of nodes are start nodes or A determination node, wherein the determination node includes one attribute information of the plurality of attribute information included in the content information representing the attribute of the content using the plurality of attribute information, and the user of the determination node Information indicating a determination formula for determining the degree of preference, and the start node is associated with one or a plurality of the determination nodes subsequent to the start node. A starting point of a program for generating one or more rules for making a predetermined determination related to a user's preference based on one or more determination results based on a determination formula. A preference model having a data structure associated with connection information to the determination node is stored in a predetermined storage unit, the content information is input by the estimation unit, and generated based on the preference model The user's preference is estimated by applying one or a plurality of the rules to the content information.

また、本発明のデータ構造は、ユーザの嗜好モデルを表すためのデータ構造であって、複数のノードと、前記複数のノード間を方向性を有して接続する複数のエッジとで構成された有向グラフで表され、前記複数のノードが開始ノード又は判定ノードであり、前記判定ノードには、複数の属性情報を用いてコンテンツの属性を表すコンテンツ情報に含まれている前記複数の属性情報うちの１つの属性情報と、当該判定ノードに係るユーザの嗜好度合いを判定するための判定式を表す情報と、が対応づけられていて、前記開始ノードは、該開始ノードに後続する１又は複数の前記判定ノードに対応づけられている１又は複数の前記判定式による１又は複数の判定結果に基づいてユーザの嗜好に関する所定の判定を行うための１又は複数のルールを生成するプログラムの開始点であって、１又は複数の前記判定ノードへの接続情報が対応づけられている、ことを特徴とする。 The data structure of the present invention is a data structure for representing a user's preference model, and includes a plurality of nodes and a plurality of edges that connect the plurality of nodes with directionality. Represented by a directed graph, wherein the plurality of nodes are start nodes or determination nodes, and the determination node includes a plurality of pieces of attribute information included in the content information representing content attributes using the plurality of attribute information. One attribute information is associated with information representing a determination formula for determining the degree of preference of the user related to the determination node, and the start node is one or more of the one or more of the above-described start nodes One or more routes for making a predetermined determination regarding the user's preference based on one or more determination results based on one or more of the determination formulas associated with the determination node A starting point of the program that generates the connection information to one or more of the determining node is associated, characterized in that.

また、本発明の他のデータ構造は、さらに、当該判定ノードに対応づけられている前記判定式の信頼性を表すための所定のパラメータが対応づけられていることを特徴とする。 In addition, another data structure of the present invention is further characterized in that a predetermined parameter for representing the reliability of the determination formula associated with the determination node is associated.

本発明によれば、各判定ノードに対応づけられた情報と、他の判定ノードに対応づけられた情報とに親子関係は必要としない。よって、嗜好モデルに対する判定ノードの追加や削除の処理を容易に行うことができる。また、本発明の嗜好モデルを生成したり、更新したりするための情報は、番組情報等のコンテンツ情報から取得することができるので、上述したようなトラフィック量の増加や、オフライン環境での使用上の制約といった課題も生じない。 According to the present invention, there is no need for a parent-child relationship between information associated with each determination node and information associated with other determination nodes. Therefore, it is possible to easily add or delete determination nodes from the preference model. In addition, since the information for generating or updating the preference model of the present invention can be obtained from content information such as program information, the traffic volume is increased as described above, or used in an offline environment. There is no problem of the above restrictions.

本発明の一実施形態の構成を示したブロック図である。It is the block diagram which showed the structure of one Embodiment of this invention. 図１に示した嗜好モデル１５１の構成例を説明するための模式図である。It is a schematic diagram for demonstrating the structural example of the preference model 151 shown in FIG. 図２に示した有向グラフ１５１ａを表すデータの一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the data showing the directed graph 151a shown in FIG. 図２に示した判定ノード内容情報１５１ｂの構成例を説明するための模式図である。It is a schematic diagram for demonstrating the structural example of the determination node content information 151b shown in FIG. 図２に示した開始ノード１２１ａ１と判定ノード１２１ａ２の具体例を説明するための説明図である。FIG. 3 is an explanatory diagram for describing specific examples of a start node 121a1 and a determination node 121a2 illustrated in FIG. 2; 図５に示した各判定ノード１２１ａ２に対応づけられている判定ノード内容情報１５１ｂの具体例を説明するための説明図である。FIG. 6 is an explanatory diagram for describing a specific example of determination node content information 151b associated with each determination node 121a2 illustrated in FIG. 5; 図２に示した有向グラフ１５１ａから生成される可能性があるルールを説明するための説明図である。It is explanatory drawing for demonstrating the rule which may be produced | generated from the directed graph 151a shown in FIG. 図２に示した嗜好モデル１５１から生成されるルールの例を説明するための説明図である。It is explanatory drawing for demonstrating the example of the rule produced | generated from the preference model 151 shown in FIG. 図１に示した情報処理装置１０の動作例を説明するためのフローチャートである。3 is a flowchart for explaining an operation example of the information processing apparatus 10 illustrated in FIG. 1. 図９のステップＳ１０３で行われるコンテンツ評価処理の内容を説明するためのフローチャートである。It is a flowchart for demonstrating the content of the content evaluation process performed by FIG.9 S103. 図１０のステップＳ２０２で行われる淘汰処理の内容を説明するための説明図である。It is explanatory drawing for demonstrating the content of the wrinkle process performed by step S202 of FIG. 図１０のステップＳ２０４で行われる進化処理の１つである交叉処理の一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the crossover process which is one of the evolution processes performed by step S204 of FIG. 図１０のステップＳ２０４で行われる進化処理の１つである突然変異処理の一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the mutation process which is one of the evolution processes performed by step S204 of FIG. 図１０のステップＳ２０５で行われる嗜好推定処理の内容を説明するための説明図である。It is explanatory drawing for demonstrating the content of the preference estimation process performed by FIG.10 S205.

以下、図面を参照して本発明の一実施の形態について説明する。図１は、本発明の一実施形態としての情報処理システム（情報処理装置）１０の構成例を示したブロック図である。情報処理システム１０は、コンテンツ表示機器１００とコンテンツ制御機器２００とを備えている。また、図１に示した情報処理システム１０内の各構成は、例えば、１又は複数のＣＰＵ（中央処理装置）と、その周辺装置と、そのＣＰＵで実行されるプログラムとの組み合わせによって実現することができる。コンテンツ表示機器１００は、コンテンツ受信部１１０、コンテンツ出力部１１５、コンテンツ評価部１２０、推薦結果表示選択部１３０、コンテンツ閲覧ログ生成部１４０、嗜好モデル生成部１５０、及びトレーニング情報設定部１６０を備えている。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration example of an information processing system (information processing apparatus) 10 as an embodiment of the present invention. The information processing system 10 includes a content display device 100 and a content control device 200. Each configuration in the information processing system 10 illustrated in FIG. 1 is realized by a combination of one or a plurality of CPUs (central processing units), peripheral devices thereof, and programs executed by the CPUs. Can do. The content display device 100 includes a content reception unit 110, a content output unit 115, a content evaluation unit 120, a recommendation result display selection unit 130, a content browsing log generation unit 140, a preference model generation unit 150, and a training information setting unit 160. Yes.

コンテンツ表示機器１００は、例えば、ＴＶ（テレビジョン）、スマートフォン、携帯電話機やＰＣ（パーソナルコンピュータ）のような情報端末である。あるいは、コンテンツ表示機器１００は、ＴＶ等の情報端末と、セットトップボックスやビデオ録画機器等のコンテンツ受信・録画機器とを組み合わせたものであってもよい。 The content display device 100 is, for example, an information terminal such as a TV (television), a smartphone, a mobile phone, or a PC (personal computer). Alternatively, the content display device 100 may be a combination of an information terminal such as a TV and a content reception / recording device such as a set top box or a video recording device.

コンテンツ表示機器１００は、コンテンツ受信部１１０とコンテンツ出力部１１５とによって、コンテンツを受信して表示することが可能な機器である。コンテンツ受信部１１０は、推薦結果表示選択部１３０からの指示に従って受信するコンテンツを選択し、放送波やネットワーク等の所定の媒体を介してコンテンツを受信する。そして、コンテンツ受信部１１０は、受信したコンテンツを表す信号を、所定形式の映像信号や音声信号に変換する等してコンテンツ出力部１１５に対して出力する。コンテンツ出力部１１５は、ディスプレイやスピーカを有して構成されていて、入力された映像信号や音声信号に基づいて映像表示や音声出力を行う。また、コンテンツ受信部１１０は、電子番組情報等の、複数の属性情報を用いてコンテンツの属性を表すコンテンツ情報（以下、番組情報と呼ぶ）を受信して、コンテンツ評価部１２０へ出力する。 The content display device 100 is a device capable of receiving and displaying content by the content receiving unit 110 and the content output unit 115. The content receiving unit 110 selects content to be received in accordance with an instruction from the recommendation result display selection unit 130, and receives the content via a predetermined medium such as a broadcast wave or a network. Then, the content receiving unit 110 outputs a signal representing the received content to the content output unit 115 by converting it into a video signal or audio signal in a predetermined format. The content output unit 115 includes a display and a speaker, and performs video display and audio output based on the input video signal and audio signal. In addition, the content receiving unit 110 receives content information (hereinafter referred to as program information) representing content attributes using a plurality of attribute information such as electronic program information, and outputs the content information to the content evaluation unit 120.

コンテンツ評価部１２０（推定部）は、受信したコンテンツに対するユーザの嗜好（すなわちユーザの好み）を評価して、ユーザに推薦する番組情報の一覧情報を作成する。一方、嗜好モデル生成部１５０は、ユーザのコンテンツに関しての嗜好（好き嫌い）を表すモデルである、嗜好モデル１５１を生成・更新する。コンテンツ評価部１２０は、嗜好モデル生成部１５０が生成した嗜好モデル１５１を用いてコンテンツを評価し、ユーザの嗜好に合いそうだと推定した１又は複数の番組情報を、コンテンツの推薦結果として、推薦結果表示選択部１３０へ出力する。 The content evaluation unit 120 (estimation unit) evaluates the user's preference (that is, the user's preference) with respect to the received content, and creates program information list information recommended to the user. On the other hand, the preference model generation unit 150 generates / updates a preference model 151 that is a model representing a preference (likes and dislikes) regarding a user's content. The content evaluation unit 120 evaluates the content using the preference model 151 generated by the preference model generation unit 150, and uses one or more program information estimated to be suitable for the user's preference as a recommendation result of the content. The data is output to the display selection unit 130.

推薦結果表示選択部１３０は、コンテンツ評価部１２０の評価結果に基づいて、コンテンツの推薦結果を表示する。また、推薦結果表示選択部１３０は、トレーニング情報設定部１６０の指示に従って、嗜好モデル１５１のトレーニング用の番組情報を表示する。その際、推薦結果表示選択部１３０は、コンテンツ評価部１２０から受信したコンテンツの推薦結果を表す文字や画像、又はトレーニング情報設定部１６０から受信した番組情報を、コンテンツ受信部１１０を介してコンテンツ出力部１１５から出力する。また、推薦結果表示選択部１３０は、コンテンツ制御機器２００から受信した所定の制御信号に基づいて選択するコンテンツを決定し、決定した結果をコンテンツ受信部１１０へ出力する。ここで、コンテンツ受信部１１０は、上述したように、選択されたコンテンツを所定の媒体を介して受信し、所定の映像信号や音声信号に変換する等してコンテンツ出力部１１５から出力する。 The recommendation result display selection unit 130 displays the content recommendation result based on the evaluation result of the content evaluation unit 120. Further, the recommendation result display selection unit 130 displays the program information for training of the preference model 151 in accordance with the instruction of the training information setting unit 160. At this time, the recommendation result display selection unit 130 outputs, via the content receiving unit 110, the content or image representing the content recommendation result received from the content evaluation unit 120 or the program information received from the training information setting unit 160. Output from the unit 115. In addition, the recommendation result display selection unit 130 determines content to be selected based on a predetermined control signal received from the content control device 200, and outputs the determined result to the content reception unit 110. Here, as described above, the content receiving unit 110 receives the selected content via a predetermined medium, converts it into a predetermined video signal or audio signal, and outputs it from the content output unit 115.

コンテンツ閲覧ログ生成部１４０は、推薦結果表示選択部１３０が表示したコンテンツ推薦結果に対するコンテンツ制御部２１０による選択等の制御内容に基づいて、ユーザの嗜好に関するログを生成する。すなわち、コンテンツ閲覧ログ生成部１４０は、ユーザが表示や録画を指示したコンテンツの番組情報を複数、記録する。このコンテンツ閲覧ログ生成部１４０によって記録された番組情報は、嗜好モデル生成部１５０が嗜好モデル１５１を更新する際に、例えばユーザの好みにあったコンテンツを表すトレーニング用の番組情報として使用される。 The content browsing log generation unit 140 generates a log relating to user preferences based on control details such as selection by the content control unit 210 for the content recommendation result displayed by the recommendation result display selection unit 130. That is, the content browsing log generation unit 140 records a plurality of pieces of program information of content that the user has instructed to display or record. The program information recorded by the content browsing log generation unit 140 is used, for example, as training program information representing content that meets the user's preference when the preference model generation unit 150 updates the preference model 151.

トレーニング情報設定部１６０は、トレーニング用の番組情報を、推薦結果表示選択部１３０等を介してコンテンツ出力部１１５から表示する。トレーニング情報設定部１６０は、さらに、そのトレーニング番組情報の表示に対するユーザの操作に応じてコンテンツ制御機器２００から受信した所定の制御信号に基づき、当該番組情報に対してユーザの嗜好（“好き”、“嫌い”等）を設定し、設定した結果を番組情報とともに、嗜好モデル生成部１５０へ出力する。 The training information setting unit 160 displays training program information from the content output unit 115 via the recommendation result display selection unit 130 and the like. Further, the training information setting unit 160, based on a predetermined control signal received from the content control device 200 in response to a user operation for displaying the training program information, the user's preference (“like”, “I dislike” is set, and the set result is output to the preference model generation unit 150 together with the program information.

嗜好モデル生成部１５０は、予め用意されているトレーニング用の番組情報（すなわち好き嫌いを表す正解の評価結果を表す情報が予め対応づけられている番組情報）を用いて、あるいは、コンテンツ閲覧ログ生成部１４０が生成したログやトレーニング情報設定部１６０が設定したトレーニング用の番組情報を用いて、ユーザの嗜好モデル１５１を生成したり、更新したりする。この嗜好モデル１５１は、嗜好モデル生成部１５０内の所定の記憶部に記憶される。また、嗜好モデル生成部１５０は、嗜好モデル１５１を生成したり、更新したりする際に使用したトレーニング用の番組情報を、その好き嫌いの評価結果とともに、所定の記憶部に記憶する。 The preference model generation unit 150 uses pre-prepared training program information (that is, program information in which information indicating correct evaluation results indicating likes and dislikes is associated in advance) or a content browsing log generation unit The user's preference model 151 is generated or updated using the log generated by 140 or the program information for training set by the training information setting unit 160. The preference model 151 is stored in a predetermined storage unit in the preference model generation unit 150. In addition, the preference model generation unit 150 stores training program information used when the preference model 151 is generated or updated, together with the evaluation results of likes and dislikes, in a predetermined storage unit.

一方、コンテンツ制御機器２００は、ＴＶのリモコンや携帯電話機のタッチパネルやＰＣのマウスとして構成されている。コンテンツ制御機器２００は、操作部や信号送信部を有するコンテンツ制御部２１０を備えている。コンテンツ制御機器２００では、このコンテンツ制御部２１０が、ユーザの所定の操作を入力し、その入力結果を示す所定の制御信号を推薦結果表示部１３０やトレーニング情報設定部１６０に対して出力する。具体的には、コンテンツ制御部２１０は、ユーザのコンテンツに対する選択、表示指示等の操作や、トレーニング用の番組情報に対する“好き”、“嫌い”等のユーザの選択、設定指示等の操作情報を入力し、それらの操作の内容に応じた制御信号を出力する。なお、コンテンツ制御機器２００は、図１に示したようにコンテンツ表示機器１００とは別の装置として構成されていてもよいし、あるいは、コンテンツ表示機器１００と一体として構成されていてもよい。 On the other hand, the content control device 200 is configured as a TV remote control, a mobile phone touch panel, or a PC mouse. The content control device 200 includes a content control unit 210 having an operation unit and a signal transmission unit. In the content control device 200, the content control unit 210 inputs a user's predetermined operation and outputs a predetermined control signal indicating the input result to the recommendation result display unit 130 and the training information setting unit 160. Specifically, the content control unit 210 performs operation information such as selection and display instructions for the user's content, selection of the user such as “like” and “dislike” for the program information for training, and setting instructions. Input and output control signals according to the contents of those operations. Note that the content control device 200 may be configured as an apparatus different from the content display device 100 as illustrated in FIG. 1, or may be configured integrally with the content display device 100.

ここで、図２〜図８を参照して、図１に示した嗜好モデル１５１の構成例等について説明する。図２は、図１に示した嗜好モデル１５１の構成例を示した模式図である。嗜好モデル１５１は、ユーザのコンテンツに対する嗜好（好き、好き嫌い等）を典型的に表す型である。嗜好モデル１５１に基づいて、未知の番組情報に対するユーザの嗜好を推定する際に所定の判定処理を行うための複数のルールが生成される。この嗜好モデル１５１は、学習によって内容を更新することができるモデルであるが、初期のモデルは例えば次のようにして生成することができる。すなわち、予め用意された複数の番組情報に対して、予め“好き”、“嫌い”等の嗜好が設定されているもの、あるいは、ユーザによって新たに“好き”、“嫌い”等の嗜好を設定してもらったものを、トレーニング用の番組情報として用いることで生成することができる。ここで、予め用意する嗜好の設定がなされたトレーニング用の番組情報は、例えば、年代別、性別等に応じて複数用意しておくこともできる。また、嗜好モデル１５１自体も、ゼロから生成するのではなく、一定のトレーニングが行われた状態を初期状態として用意しておくこともできる。この初期モデルを作成する際に使用した番組情報は、嗜好モデル生成部１５０内の所定の記憶部に記憶しておく。 Here, with reference to FIGS. 2 to 8, a configuration example of the preference model 151 shown in FIG. 1 will be described. FIG. 2 is a schematic diagram showing a configuration example of the preference model 151 shown in FIG. The preference model 151 is a type that typically represents a user's preference (like, likes and dislikes) for content. Based on the preference model 151, a plurality of rules for performing a predetermined determination process when a user's preference for unknown program information is estimated is generated. The preference model 151 is a model whose contents can be updated by learning, but an initial model can be generated as follows, for example. In other words, a preference such as “like” or “dislike” is set in advance for a plurality of program information prepared in advance, or a new preference such as “like” or “dislike” is set by the user. What is requested can be generated by using it as program information for training. Here, a plurality of training program information for which preference settings are prepared in advance may be prepared in accordance with, for example, age group and sex. In addition, the preference model 151 itself is not generated from zero, but a state in which a constant training is performed can be prepared as an initial state. The program information used when creating the initial model is stored in a predetermined storage unit in the preference model generation unit 150.

図２に示したように、嗜好モデル１５１は、嗜好モデル１５１内の複数のノード１５１ａ１又は１５１ａ２間の接続関係を示す有向グラフ１５１ａを表す情報と、複数の判定ノード１５１ａ２に対応づけられている情報の内容をノード毎に示すデータである複数の判定ノード内容情報１５１ｂとで表現される。すなわち、嗜好モデル１５１は、複数のノード１５１ａ１又は１５１ａ２と、複数のノード１５１ａ１又は１５１ａ２間を方向性を有して接続する複数のエッジ１５１ａ３とで構成された有向グラフ１５１ａと、複数の判定ノード内容情報１５１ｂとから表されるデータ構造を有している。また、複数のノード１５１ａ１又は１５１ａ２は、開始ノード１５１ａ１と、判定ノード１５１ａ２とに分類される。これらの有向グラフ１５１ａの構造と、判定ノード１５１ａ２に対応づけられている情報とに基づいて、ユーザの嗜好に関する所定の判定を行うため複数のルールが生成される。 As illustrated in FIG. 2, the preference model 151 includes information indicating a directed graph 151 a indicating a connection relationship between the plurality of nodes 151 a 1 or 151 a 2 in the preference model 151 and information associated with the plurality of determination nodes 151 a 2. It is expressed by a plurality of pieces of determination node content information 151b which is data indicating the content for each node. That is, the preference model 151 includes a directed graph 151a including a plurality of nodes 151a1 or 151a2 and a plurality of edges 151a3 that connect the plurality of nodes 151a1 or 151a2 with directionality, and a plurality of pieces of determination node content information. 151b. The plurality of nodes 151a1 or 151a2 are classified into a start node 151a1 and a determination node 151a2. Based on the structure of the directed graph 151a and the information associated with the determination node 151a2, a plurality of rules are generated to make a predetermined determination regarding the user's preference.

開始ノード１５１ａ１は、ルールを作成するプログラム（あるいは計算手順）の開始地点を示し、判定ノード１５１ａ２につながるノードである。判定ノード１５１ａ２は、番組を表すための「属性情報」を一つ対応づけられたノードであり、このノード単体での「判定式」とその判定式の「信頼度に対応する所定のパラメータ」とが対応づけられている。このとき、番組を表すための属性情報とは、例えば“ドラマ”や“放送時刻”、“チャンネル”など、番組の特徴を表す情報、性質を表す情報等、番組に関する情報のことを指す。この場合において判定ノード１５１ａ２に対応づけられた判定式とは、当該判定ノードに対応づけられた属性情報を番組情報が含む場合に、すなわち“好き嫌い”の程度を表す値を出力する数式（あるいはプログラム）である、嗜好の度合いについて判定を下す（あるいは数値化する）ための数式や条件のことを指す。各判定ノードでは、この判定式によって、当該ノード単体で“好き”か“嫌い”かの度合いを算出することができる。信頼度に対応する所定のパラメータとは、判定式の信頼性を表現するためのパラメータであり、例えば、当該判定ノードに対応づけられた属性情報がトレーニング用の番組情報においてどのくらい出現しているかという頻度にもとづき算出される。この「信頼度に対応する所定のパラメータ」を用いることで、例えば頻度が多い場合に信頼度が高く、少ない場合に信頼度が低いというように仮定し、当該判定式から得た結果に相対的な重み付けをする等を行うことできる。嗜好モデル１５１は、この２種類のノードと有向グラフを使用して、番組の好き嫌いを判定するためのルールを表す論理式を生成する遺伝子構造を構築する。 The start node 151a1 indicates a start point of a program (or calculation procedure) for creating a rule, and is a node connected to the determination node 151a2. The determination node 151a2 is a node associated with one piece of “attribute information” for representing a program. The “determination formula” of this node alone and the “predetermined parameter corresponding to the reliability” of the determination formula are Are associated. At this time, the attribute information for representing the program refers to information related to the program, such as information representing the characteristics of the program, information representing the nature, such as “drama”, “broadcast time”, and “channel”. In this case, the determination expression associated with the determination node 151a2 is a mathematical expression (or a program that outputs a value indicating the degree of “like and dislike” when the program information includes attribute information associated with the determination node. ), Which is a mathematical expression or condition for determining (or digitizing) the degree of preference. Each determination node can calculate the degree of “like” or “dislike” for the node alone by using this determination formula. The predetermined parameter corresponding to the reliability is a parameter for expressing the reliability of the determination formula. For example, how much attribute information associated with the determination node appears in the training program information. Calculated based on frequency. By using this “predetermined parameter corresponding to the reliability”, for example, it is assumed that the reliability is high when the frequency is high and the reliability is low when the frequency is low, and relative to the result obtained from the determination formula. Weighting can be performed. The preference model 151 uses these two types of nodes and the directed graph to construct a gene structure that generates a logical expression that represents a rule for determining liking or dislike of a program.

なお、図２において、開始ノード１５１ａ１は、六角形の記号で示されていて、その記号内の数字“１”〜“３”は、各開始ノード１５１ａ１に設定されたＩＤ（識別符号）を表している。また、判定ノード１５１ａ２は、丸形の記号で示されていて、その記号内の数字“１”〜“８”は、各判定ノード１５１ａ２に設定されたＩＤを表している。 In FIG. 2, the start node 151a1 is indicated by a hexagonal symbol, and the numbers “1” to “3” in the symbol represent IDs (identification codes) set to the start nodes 151a1. ing. Further, the decision node 151a2 is indicated by a round symbol, and the numbers “1” to “8” in the symbol represent the ID set for each decision node 151a2.

図２に示した有向グラフ１５１ａにおける各開始ノード１５１ａ１、各判定ノード１５１ａ２及び各有向エッジ１５１ａ３との接続関係は、例えば、図３に示した２種類のテーブルで表すことができる。すなわち、図３（ａ）に示したように、各開始ノード１５１ａ１と、その各開始ノード１５１ａ１に直接接続されている１又は複数の判定ノード１５１ａ２との接続関係は、開始ノード毎に、開始ノードのＩＤと、その開始ノードに直接接続された１又は複数の接続先の判定ノードのＩＤとの対応付けを示すテーブルで表すことができる。図３（ａ）のテーブルは、例えば開始ノードＩＤが“１”の開始ノード１５１ａ１（以下、“開始ノード１”とも表記する）が、ＩＤが“４”の判定ノード１５１ａ２（以下、“判定ノード４”とも表記する）に、開始ノード１から判定ノード４への方向で直接接続されていることを示している。また、開始ノードＩＤが“２”の開始ノード１５１ａ１（以下、“開始ノード２”とも表記する（以下、同様））が、ＩＤが“２”、“５”及び“７”の３個の判定ノード１５１ａ２に、開始ノードから各判定ノードへの方向で直接接続されていることを示している。 The connection relationship between each start node 151a1, each determination node 151a2, and each directed edge 151a3 in the directed graph 151a illustrated in FIG. 2 can be represented by, for example, two types of tables illustrated in FIG. That is, as shown in FIG. 3A, the connection relationship between each start node 151a1 and one or more determination nodes 151a2 directly connected to each start node 151a1 is as follows. And the ID of one or more connection destination determination nodes directly connected to the start node can be represented by a table. In the table of FIG. 3A, for example, a start node 151a1 (hereinafter also referred to as “start node 1”) having a start node ID “1” is a determination node 151a2 (hereinafter “determination node”) having an ID “4”. 4 ") indicates that the connection is made directly in the direction from the start node 1 to the determination node 4. Further, the start node 151a1 having the start node ID “2” (hereinafter also referred to as “start node 2” (hereinafter the same)) has three determinations with IDs “2”, “5”, and “7”. This indicates that the node 151a2 is directly connected in the direction from the start node to each determination node.

一方、各判定ノード１５１ａ２間の接続関係は、図３（ｂ）に示したように、判定ノード毎に、判定ノードのＩＤと、その判定ノードに直接接続されている１又は複数の接続先の判定ノードのＩＤとの対応関係を示すテーブルで表すことができる。図３（ｂ）のテーブルは、例えば判定ノード１に、ＩＤが“３”の判定ノード１５１ａ２（以下、“判定ノード３”とも表記する（以下、同様））が判定ノード１から判定ノード３への方向に直接接続されていることを示している。また判定ノード２には、判定ノード２から他の判定ノードへと向かう方向では、他の判定ノードが接続されていないことを示している。また、判定ノード４には、判定ノード１と判定ノード８との２個の判定ノードが、判定ノード１から判定ノード１又は判定ノード８への方向に直接接続されていることを示している。 On the other hand, as shown in FIG. 3B, the connection relationship between each determination node 151a2 is the ID of the determination node and one or a plurality of connection destinations directly connected to the determination node. It can be represented by a table showing the correspondence with the ID of the determination node. In the table of FIG. 3B, for example, the decision node 1 has a decision node 151 a 2 with ID “3” (hereinafter also referred to as “determination node 3” (hereinafter, the same)) from the decision node 1 to the decision node 3. It shows that it is directly connected in the direction of. In addition, the determination node 2 indicates that no other determination node is connected in the direction from the determination node 2 to the other determination node. In addition, it is shown that two determination nodes, determination node 1 and determination node 8, are directly connected to determination node 4 in the direction from determination node 1 to determination node 1 or determination node 8.

なお、有向グラフ１５１の構成は、図３に示した２種類のテーブルで表すことができるほか、１種類のテーブルで表すこともできる。すなわち、ＩＤだけではなく、ノードの種類を表す情報を追加して用いることで、開始ノード毎又は判定ノード毎の区別ではなく開始ノードと判定ノードに共通のノード毎の区別として、各ノード間の接続関係を表すことができる。また、有向グラフ１５１内のノードは、上述したように開始ノードと判定ノードとの２種類を設けるのではなく、例えばノードをすべて１種類の判定ノードとして、判定ノードの一部にプログラム（あるいはルール）の開始点を表す情報を付加することで、判定ノードの一部に開始ノードの機能を持たせるようにすることもできる。 The configuration of the directed graph 151 can be represented by the two types of tables shown in FIG. In other words, by using not only the ID but also the information indicating the type of the node, it is possible not to distinguish between each start node or each decision node, but to distinguish between each node common to the start node and the decision node. A connection relationship can be expressed. Further, as described above, the nodes in the directed graph 151 are not provided with two types of nodes, i.e., a start node and a determination node. By adding information representing the starting point of the above, it is possible to give a part of the determination node the function of the starting node.

一方、図２に示した判定ノード内容情報１５１ｂは、例えば図４に示したようなデータ形式で構成することができる。図４に示した例では、判定ノード内容情報１５１ｂが、判定ノードＩＤを表すフィールド１５１ｂ１、属性情報を表すフィールド１５１ｂ２、判定式を表すフィールド１５１ｂ３、及び頻度情報を表すフィールド１５１ｂ４を有して構成されている。この例では、“頻度情報”が上述した“判定式の信頼度に対応する所定のパラメータ”に対応している。この“頻度情報”は、嗜好モデル１５１を生成又は更新する際に使用されたトレーニング用の複数の番組情報に、当該判定ノードに対応づけられている属性情報が出現した頻度を表す情報である。ただし、この頻度情報に代えて、頻度情報に対して所定の計算処理を行うことで判定式の信頼度を表す値を算出した結果を、判定ノード内容情報１５１ｂに含ませることもできる。例えば、各頻度の値を、全判定ノードの中で最も多い頻度の値で除した値（あるいはその値にさらに非線形に重み付けをした値）等を、“判定式の信頼度に対応する所定のパラメータ”とすることができる。“判定式の信頼度に対応する所定のパラメータ”は、例えば“０”〜“１”の実数として、“０”が信頼度が最も低く、“１”が信頼度が最も高いというような値に設定することができる。 On the other hand, the determination node content information 151b shown in FIG. 2 can be configured in a data format as shown in FIG. 4, for example. In the example illustrated in FIG. 4, the determination node content information 151b includes a field 151b1 indicating a determination node ID, a field 151b2 indicating attribute information, a field 151b3 indicating a determination expression, and a field 151b4 indicating frequency information. ing. In this example, “frequency information” corresponds to the “predetermined parameter corresponding to the reliability of the determination formula” described above. This “frequency information” is information indicating the frequency of appearance of attribute information associated with the determination node in a plurality of training program information used when the preference model 151 is generated or updated. However, instead of the frequency information, a result of calculating a value representing the reliability of the determination formula by performing a predetermined calculation process on the frequency information may be included in the determination node content information 151b. For example, a value obtained by dividing the value of each frequency by the value of the highest frequency among all the determination nodes (or a value obtained by further weighting the value in a non-linear manner), etc. Parameter ". The “predetermined parameter corresponding to the reliability of the judgment formula” is, for example, a value such that “0” has the lowest reliability and “1” has the highest reliability as real numbers “0” to “1”. Can be set to

なお、属性情報は、図４に示したように、属性情報の種類と内容との組み合わせによって表してもよいし、種類に関わらないキーワードによって表してもよいし、あるいは、両方の構成を混在させてもよい。また、頻度情報は、例えば図４に示したように、嗜好モデル１５１を生成、更新等する際のトレーニングにおいて、“好き”と設定された番組情報に含まれていた頻度と、“嫌い”と設定された番組情報に含まれていた頻度とに分けて含むものとすることができる。あるいは、頻度情報は、“好き嫌い”の設定とは無関係に、トレーニングで使用された番組情報に当該属性情報が出現した頻度を表すものとしてもよい。 As shown in FIG. 4, the attribute information may be represented by a combination of the type and content of the attribute information, may be represented by a keyword that is not related to the type, or both configurations are mixed. May be. Further, for example, as shown in FIG. 4, the frequency information includes the frequency included in the program information set as “like” and the “dislike” in training when the preference model 151 is generated and updated. It can be divided and included in the frequency included in the set program information. Alternatively, the frequency information may represent the frequency at which the attribute information appears in the program information used in the training regardless of the “like and dislike” setting.

また、判定式は、例えば次のような形式とすることができる。すなわち、判定式は、判定式の値（すなわち計算結果）が、例えば“−１”〜“１”の実数の範囲で、“−１”が“嫌い”の確率が最も高く、“１”が“好き”の確率が最も高いという値を出力するものとすることができる。判定式で用いる変数（すなわちパラメータ）としては、例えば、頻度情報を用いてもよいし、あるいは属性情報の種類や内容に応じて予め設定された値（すなわち定数）を用いてもよい。例えば、頻度情報をパラメータとして用いる場合、“好き”と“嫌い”の合計の頻度がゼロではなく、“好き”と設定された場合の頻度がゼロのときは（すなわち、すべて“嫌い”と設定されている場合）、判定式の値を“−１”とする。また、“嫌い”と設定された場合の頻度がゼロのときは（すなわち、すべて“好き”と設定されている場合）、判定式の値は“１”とする。そして、“好き”の頻度と“嫌い”の頻度がどちらもゼロでない場合は、中間の値となるように、判定式を内容を構成する。この場合、判定式は、１又は複数の計算式を用いて構成したり、プログラムにおける条件文（例えば「ｉｆ」文）等を用いて構成したりすることができる。 Further, the determination formula can be in the following format, for example. That is, the judgment formula has the highest probability that the value of the judgment formula (that is, the calculation result) is “-1” to “1”, for example, “−1” is “dislike”, and “1” is “1”. A value indicating that the probability of “like” is the highest can be output. As a variable (that is, parameter) used in the determination formula, for example, frequency information may be used, or a value (that is, a constant) set in advance according to the type and content of attribute information may be used. For example, when frequency information is used as a parameter, the total frequency of “like” and “dislike” is not zero, but when the frequency is set to “like” is zero (that is, all are set to “dislike”) The value of the judgment formula is “−1”. Further, when the frequency when “dislike” is set is zero (that is, when all “like” is set), the value of the determination formula is “1”. Then, when both the “like” frequency and the “dislike” frequency are not zero, the contents of the determination formula are configured so as to be an intermediate value. In this case, the determination formula may be configured using one or a plurality of calculation formulas, or may be configured using a conditional statement (for example, “if” statement) in the program.

次に、図２に示した有向グラフ１５１ａから、番組の好き嫌いを判定するためのルールを表す論理式を生成する手順について説明する。以下では、８個の判定ノードに対して図５に示した属性情報が対応づけられていることとして説明を行う。すなわち、判定ノード１には“ドラマ”が、判定ノード２には“サッカー”が、判定ノード３には“１３時”が、判定ノード４には“放送局Ａ”が、判定ノード５には“映画”が、判定ノード６には“俳優Ｂ”が、判定ノード７には“特番”が、そして、判定ノード８には“ニュース”が、それぞれ対応づけられている。また、各判定ノードの内容は図６に示した形式を有している。すなわち、各判定ノード１〜８には、当該ノード単体で“好き”か“嫌い”かの度合いを出力する判定式ｆ１〜ｆ８が対応づけられている。また、各判定ノード１〜８には信頼性を表すパラメータとして、この判定ノードに対応付けられている属性情報がトレーニング用の番組情報に出現した頻度に対応した値ｎ１ａ〜ｎ８ａ（“好き”の場合）とｎ１ｂ〜ｎ８ｂ（“嫌い”の場合）が設定されている。 Next, a description will be given of a procedure for generating a logical expression representing a rule for determining whether or not to like a program from the directed graph 151a shown in FIG. In the following description, it is assumed that the attribute information shown in FIG. 5 is associated with the eight determination nodes. That is, “drama” is determined for the determination node 1, “soccer” for the determination node 2, “13:00” for the determination node 3, “broadcast station A” for the determination node 4, and “Movie”, “Actor B” is associated with the decision node 6, “Special number” is associated with the decision node 7, and “News” is associated with the decision node 8. The contents of each determination node have the format shown in FIG. That is, each of the determination nodes 1 to 8 is associated with determination formulas f1 to f8 that output the degree of “like” or “dislike” of the node alone. Further, each of the determination nodes 1 to 8 has a value n1a to n8a ("like") corresponding to the frequency at which the attribute information associated with the determination node appears in the training program information as a parameter indicating reliability. ) And n1b to n8b (in the case of “dislike”) are set.

有向グラフ１５１ａからルールを抽出する際には、図７に示したように、開始ノード１〜３から、それぞれ、有向エッジ（すなわち矢印）で接続されている判定ノードをたどって各判定ノードにおける判定式による判定結果を組み合わせていく。各ルールは、開始ノードを開始点とするとともに、矢印の向きでたどることができる０、１又は複数の判定ノードのうちの任意の判定ノードを終了点とすることができる。吹き出し１５１ｃ内に、有向グラフ１５１ａから抽出される可能性のあるルールを示した。なお、ルールの抽出の際には、同じ判定ノードを２回以上含まないようにして、ルールの循環を避けている。例えば、図７で鎖線で囲って示した、開始ノード１を開始点として、判定ノード４、判定ノード１、判定ノード３、判定ノード７、及び判定ノード１をたどるルールは、同じ判定ノード１を２回含むので、抽出することができない。 When a rule is extracted from the directed graph 151a, as shown in FIG. 7, the determination at each determination node is performed by following the determination nodes connected by directed edges (that is, arrows) from the start nodes 1 to 3, respectively. Combine the judgment results by formula. Each rule can have a start node as a start point and an end point of any one of 0, 1, or a plurality of determination nodes that can be followed in the direction of the arrow. Rules that may be extracted from the directed graph 151a are shown in the balloon 151c. When extracting a rule, the same judgment node is not included twice or more to avoid rule circulation. For example, the rule that follows the determination node 4, the determination node 1, the determination node 3, the determination node 7, and the determination node 1 with the start node 1 indicated by a chain line in FIG. Since it is included twice, it cannot be extracted.

次に図８を用いて、抽出されたルールの概念図から、実際の論理式をどのように表現するかについて説明する。抽出されるルールの論理式は、“ｉｆＸ１ａｎｄＸ２ａｎｄＸ３ｔｈｅｎＹ”といった表現形式となる。このとき、Ｘｉがある判定ノードｉに対応づけられた番組の属性情報が、入力された番組情報に含まれていることを指す。Ｙは、判定ノードｉのすべての判定式ｆｉから得られる命令であり、“好き”か“嫌い”かの命令となる。ただし、このルールには、判定ノードｉが持つ“判定式の信頼度に対応する所定のパラメータ”から算出されるパラメータである、ルールの信頼度パラメータが設定される。 Next, how an actual logical expression is expressed from the conceptual diagram of the extracted rule will be described with reference to FIG. The logical expression of the extracted rule has an expression form such as “if X1 and X2 and X3 then Y”. At this time, this indicates that the attribute information of the program associated with a certain determination node i is included in the input program information. Y is an instruction obtained from all the determination expressions fi of the determination node i, and is an instruction of “like” or “dislike”. However, the rule reliability parameter, which is a parameter calculated from the “predetermined parameter corresponding to the reliability of the determination formula” of the determination node i, is set in this rule.

一例として、“放送局Ａ”と“ドラマ”と“１３時”が一つのルールとして接続されており、それぞれの判定ノード４、１及び３の判定式ｆ４、ｆ１及びｆ３を総合した結果が“好き”の確率が高かったルールを考える。例えば、判定式の総合判定を「（ｆ４＋ｆ１＋ｆ３）の値が大きいほど“好き”の確率が高い」とする。この場合に、判定式ｆ４、ｆ１、ｆ３がそれぞれ“−１”から“１”までの値を出力するものであって（ここでは“１”に近いほど“好き”の確率が高いとする）、例えば判定式ｆ４、ｆ１、ｆ３がすべて“１”に近い値であったとする。このとき、「（ｆ４＋ｆ１＋ｆ３）の値」が大きくなる（例えば所定の閾値より大きくなる）ので、ルールの命令は「“好き”」となる。この場合、開始ノード１と判定ノード４、１及び３から生成されるルールは、ある番組が“放送局Ａ”で“ドラマ”で“１３時”なら、“好き”であるという判定結果を返す論理式となる。また、ルールの信頼度パラメータは、各判定ノード４、１及び３の頻度情報ｎ４ａ、ｎ４ｂ、ｎ１ａ、ｎ１ｂ、ｎ３ａ及びｎ３ｂから設定することができる。例えば、頻度（あるいはそれを全判定ノードの最大頻度で除した値）の積算値や最大値、最小値あるいは平均値が大きいほど、信頼度が大きくなるようにして、ルールの信頼度パラメータを算出する。 As an example, “broadcasting station A”, “drama”, and “13:00” are connected as one rule, and the result obtained by combining the determination formulas f4, f1, and f3 of the respective determination nodes 4, 1, and 3 is “ Consider a rule with a high probability of “like”. For example, it is assumed that the overall determination of the determination formula is “the probability of“ like ”is higher as the value of (f4 + f1 + f3) is larger”. In this case, the determination formulas f4, f1, and f3 each output a value from “−1” to “1” (here, the closer to “1”, the higher the probability of “like”). For example, it is assumed that the determination formulas f4, f1, and f3 are all values close to “1”. At this time, since the “value of (f4 + f1 + f3)” becomes large (for example, becomes larger than a predetermined threshold value), the rule command becomes ““ like ””. In this case, the rule generated from the start node 1 and the determination nodes 4, 1 and 3 returns a determination result of “like” if a certain program is “Broadcasting station A”, “Drama” and “13:00”. It becomes a logical expression. Further, the rule reliability parameter can be set from the frequency information n4a, n4b, n1a, n1b, n3a, and n3b of the decision nodes 4, 1, and 3, respectively. For example, the reliability parameter of the rule is calculated so that the reliability increases as the integrated value, maximum value, minimum value, or average value of the frequency (or the value obtained by dividing it by the maximum frequency of all judgment nodes) increases. To do.

また、他の例として、“映画”と“俳優Ｂ”が一つのルールとして接続されており、それぞれの判定ノード５及び６の判定式ｆ５及びｆ６を総合した結果が“好き”の確率が高かったルールを考える。上記と同様に、判定式の総合判定を「（ｆ５＋ｆ６）
の値が大きいほど“好き”の確率が高い」として、判定式ｆ５、ｆ６がそれぞれ“−１”から“１”までの値を出力するものであって（ここでは“１”に近いほど“好き”の確率が高い）、例えば判定式ｆ５、ｆ６がすべて“１”に近い値であったとすると、（ｆ５＋ｆ６）が大きくなる（例えば所定の閾値より大きくなる）ので、ルールの命令は「“好き”」となる。この場合、開始ノード２と判定ノード５及び６から生成されるルールは、ある番組が“映画”で“俳優Ｂ”なら、“好き”であるという判定結果を返す論理式となる。また、このルールの信頼度パラメータは、各判定ノード５及び６の頻度情報ｎ５ａ、ｎ５ｂ、ｎ６ａ及びｎ６ｂから設定することができる。 As another example, “movie” and “actor B” are connected as one rule, and the result of combining the determination expressions f5 and f6 of the respective determination nodes 5 and 6 has a high probability of “like”. Think about the rules. Similar to the above, the overall judgment of the judgment formula is “(f5 + f6)”.
The determination formulas f5 and f6 each output a value from “−1” to “1” as the value of “I like” increases as the value of “is larger”. If the determination formulas f5 and f6 are all close to “1”, for example, (f5 + f6) becomes large (eg, becomes larger than a predetermined threshold value). I like "". In this case, the rule generated from the start node 2 and the determination nodes 5 and 6 is a logical expression that returns a determination result of “like” if a certain program is “movie” and “actor B”. Further, the reliability parameter of this rule can be set from the frequency information n5a, n5b, n6a, and n6b of each determination node 5 and 6.

さらに、他の例として、“映画”と“特番”が一つのルールとして接続されており、それぞれの判定ノード５及び７の判定式ｆ５及びｆ７を総合した結果が“嫌い”の確率が高かったルールを考える。上記と同様に、判定式の総合判定を「（ｆ５＋ｆ７）の値が大きいほど“好き”の確率が高い」として、判定式ｆ５、ｆ７がそれぞれ“−１”から“１”までの値を出力するものである（ここでは“１”に近いほど“好き”の確率が高く、“−１”に近いほど“嫌い”の確率が高い）とする。この場合には、例えば判定式ｆ５が“１”に近い値であったとしても、判定式ｆ７が“−１”に近い値であった場合、（ｆ５＋ｆ７）はあまり大きくならず（例えば所定の閾値より大きくならず）、ルールの命令は、“好き”の確率が低いとなる。ここでは“好き”の確率が低い場合を“嫌い”とすると、開始ノード３と判定ノード５及び７から生成されるルールは、ある番組が“映画”で“特番”なら、“嫌い”であるという判定結果を返す論理式となる。また、このルールの信頼度パラメータは、各判定ノード５及び７の頻度情報ｎ５ａ、ｎ５ｂ、ｎ７ａ及びｎ７ｂから設定することができる。 Furthermore, as another example, “movie” and “special number” are connected as one rule, and the result of combining the judgment expressions f5 and f7 of the respective judgment nodes 5 and 7 has a high probability of “dislike”. Think about the rules. Similarly to the above, the overall judgment of the judgment formula is set as “the probability of“ like ”is higher as the value of (f5 + f7) is larger”, and the judgment formulas f5 and f7 each output a value from “−1” to “1”. (Here, the closer to “1”, the higher the probability of “likes”, and the closer to “−1”, the higher the probability of “dislikes”). In this case, for example, even if the judgment formula f5 is a value close to “1”, if the judgment formula f7 is a value close to “−1”, (f5 + f7) is not so large (for example, a predetermined value) The rule command has a low probability of “like” (not greater than the threshold). Here, assuming that the case where the probability of “like” is low is “dislike”, the rule generated from the start node 3 and the determination nodes 5 and 7 is “dislike” if a certain program is “movie” and “special number”. This is a logical expression that returns a determination result. Further, the reliability parameter of this rule can be set from the frequency information n5a, n5b, n7a, and n7b of each determination node 5 and 7.

次に、図９から図１４を参照して、図１に示した情報処理システム１０の動作例について説明する。図９は、情報処理システム１０がユーザに対してユーザの嗜好にあった番組を推薦する際の処理の流れの一例を示したフローチャートである。また、図１０は、図９に示したコンテンツ評価（ステップＳ１０３）における処理の流れの一例を説明するためのフローチャートである。 Next, an exemplary operation of the information processing system 10 illustrated in FIG. 1 will be described with reference to FIGS. FIG. 9 is a flowchart illustrating an example of a processing flow when the information processing system 10 recommends a program that suits the user's preference to the user. FIG. 10 is a flowchart for explaining an example of the processing flow in the content evaluation (step S103) shown in FIG.

情報処理システム１０では、まず、嗜好モデル生成部１５０がトレーニング用の番組情報を用いて嗜好モデル１５１を生成する（ステップＳ１０１）。嗜好モデル生成部１５０は、例えば、トレーニング用の番組情報に含まれるすべての属性情報が、嗜好モデル１５１に含まれるように、開始ノードや判定ノードを有向グラフに適宜に追加する。また、追加した判定ノードに対応する判定ノード内容情報を作成する。有向グラフの生成の仕方については特に限定はない。本実施形態では、後述する遺伝的アルゴリズム、すなわち、淘汰処理や進化処理によって、ルールの取捨選択等が自動的に行われる。したがって有向グラフへのノードの追加は任意に行うことができる。 In the information processing system 10, first, the preference model generation unit 150 generates a preference model 151 using the program information for training (step S101). For example, the preference model generation unit 150 appropriately adds a start node and a determination node to the directed graph so that all the attribute information included in the program information for training is included in the preference model 151. Also, determination node content information corresponding to the added determination node is created. There is no particular limitation on the method of generating the directed graph. In the present embodiment, rule selection and the like are automatically performed by a genetic algorithm, which will be described later, that is, a wrinkle process and an evolution process. Therefore, a node can be arbitrarily added to the directed graph.

次に、コンテンツ受信部１１０が、番組情報を受信する（ステップＳ１０２）。次に、コンテンツ評価部１２０と嗜好モデル生成部１５０とが協調して、コンテンツ受信部１１０が受信した１又は複数の番組情報を、評価対象の未知の番組情報として、ユーザの嗜好を推定するための処理である、コンテンツ評価処理を行う（ステップＳ１０３）。このコンテンツ評価処理の内容については後述する。 Next, the content receiving unit 110 receives program information (step S102). Next, the content evaluation unit 120 and the preference model generation unit 150 cooperate with each other to estimate the user's preference using one or more pieces of program information received by the content reception unit 110 as unknown program information to be evaluated. The content evaluation process is performed (step S103). Details of the content evaluation process will be described later.

次に、推薦結果表示選択部１３０が、コンテンツ評価部１２０が“好き”と評価した番組情報を、推薦結果として表示する（ステップＳ１０４）。ここで、ユーザが所定の操作を行った場合（ステップＳ１０５で「ｙｅｓ」の場合）、トレーニング情報設定部１６０が、ユーザの操作に応じて、推薦結果として表示された番組情報のうち、指定された番組情報に対して“好き”又は“嫌い”の設定を行って、その設定結果を指定された番組情報とともに所定の記憶部に一時的に記憶する（ステップＳ１０６）。 Next, the recommendation result display selection unit 130 displays the program information evaluated as “like” by the content evaluation unit 120 as a recommendation result (step S104). Here, when the user performs a predetermined operation (in the case of “yes” in step S105), the training information setting unit 160 is designated in the program information displayed as the recommendation result according to the user's operation. The program information is set to “like” or “dislike”, and the setting result is temporarily stored in a predetermined storage unit together with the designated program information (step S106).

次に、ユーザが、推薦結果として示されたいずれかのコンテンツを選択すると（ステップＳ１０７で「ｙｅｓ」の場合）、推薦結果表示選択部１３０がコンテンツ受信部１１０に対して選択されたコンテンツの受信と表示の指示を出力する（ステップＳ１０８）。次に、推薦結果表示選択部１３０が選択されたコンテンツの番組情報を、コンテンツ閲覧ログ生成部１４０に対して出力し、コンテンツ閲覧ログ生成部１４０が入力された番組情報を記録する（ステップＳ１０９）。 Next, when the user selects any content indicated as the recommendation result (in the case of “yes” in step S107), the recommendation result display selection unit 130 receives the content selected by the content reception unit 110. And a display instruction are output (step S108). Next, the program information of the content selected by the recommendation result display selection unit 130 is output to the content browsing log generation unit 140, and the input program information is recorded by the content browsing log generation unit 140 (step S109). .

次に、嗜好モデル生成部１５０が、ステップＳ１０６でトレーニング情報設定部１６０が記憶した番組情報や、ステップＳ１０９でコンテンツ閲覧ログ生成部１４０が記録した番組情報に基づいて、新規の属性情報に対応するノードを嗜好モデル１５１に対して追加したり、既存の判定ノードの出現頻度を更新したりする処理を行うことで、嗜好モデル１５１を更新する（ステップＳ１１０）。以降、例えば所定のタイミングで、ステップＳ１０２からの処理が再度実行される。 Next, the preference model generation unit 150 responds to the new attribute information based on the program information stored by the training information setting unit 160 in step S106 and the program information recorded by the content browsing log generation unit 140 in step S109. The preference model 151 is updated by performing a process of adding a node to the preference model 151 or updating the appearance frequency of an existing determination node (step S110). Thereafter, the processing from step S102 is executed again at a predetermined timing, for example.

次に、図１０を参照して、ステップＳ１０３でのコンテンツ評価処理について説明する。まず、コンテンツ評価部１２０が、図７及び図８を参照して上記で説明したようにして、嗜好モデル１５１から“好き”を命令とする（すなわち“好き”と判定する）ルールの集合と“嫌い”を命令とする（すなわち“嫌い”と判定する）ルールの集合とを抽出する（ステップＳ２０１）。その際、コンテンツ評価部１２０は、嗜好モデル１５１から新たに生成したルールと、そのルールを生成する際に使用したノードの組み合わせとを表す情報を、ルール生成に関する履歴情報として保持する。そして、コンテンツ評価部１２０は、この履歴情報にすでに記録されていたノードの組み合わせ以外のノードの組み合わせが検知された場合に、新たなルールを抽出する処理を行う。したがって、ここでは、すでに抽出済みのノードの組み合わせについては、ルールの生成は行われない。 Next, the content evaluation processing in step S103 will be described with reference to FIG. First, as described above with reference to FIGS. 7 and 8, the content evaluation unit 120 sets a set of rules using “like” as a command from the preference model 151 (ie, determines “like”) and “ A set of rules having “dislike” as a command (that is, determining “dislike”) is extracted (step S201). At that time, the content evaluation unit 120 holds information indicating the rule newly generated from the preference model 151 and the combination of nodes used when generating the rule as history information regarding rule generation. Then, when a combination of nodes other than the combination of nodes already recorded in the history information is detected, the content evaluation unit 120 performs a process of extracting a new rule. Therefore, no rule is generated for a combination of nodes that have already been extracted.

次に、コンテンツ評価部１２０は、トレーニング用番組情報を用いてルールに対して淘汰処理を実行する（ステップＳ２０２）。本実施形態では、図７に示したような、有向グラフ１５１におけるすべてのノードの組み合わせについてルールの生成が可能であるとしている。そのため、生成したルールをすべて記憶していくとすると、大量のルールを保持することとなる。そこで、本実施形態では、次のような淘汰処理によって、進化計算の過程で大量に生成され続けるルールから、嗜好の推定において有効であるとはいえないルールを淘汰している。 Next, the content evaluation unit 120 executes a wrinkle process on the rule using the training program information (step S202). In the present embodiment, it is assumed that a rule can be generated for all combinations of nodes in the directed graph 151 as shown in FIG. Therefore, if all the generated rules are stored, a large number of rules are held. Therefore, in this embodiment, a rule that is not effective in estimating the preference is hesitant from a rule that continues to be generated in a large amount in the course of evolutionary computation by the following trap process.

図１１に淘汰処理の流れを模式図を用いて示した。本実施形態では、後述するステップＳ２０４において嗜好モデル１５１に対して遺伝的アルゴリズムで用いられているような進化処理を実行する。その進化処理の前後では嗜好モデル１５１が異なる世代のモデルである、と定義することができる。また、図１１に示したように、ある世代における嗜好モデル１５１から抽出されたルールの集合３０２は、“好き”という判定を下すルール群３０２ａと“嫌い”という判定を下すルール群３０２ｂとにグループ化できる。コンテンツ評価部１２０は、これらのルールを集合として一定数保持しておく。このときコンテンツ評価部１２０は、以下の手順で、ルールの淘汰処理を実行する。 FIG. 11 shows a flow of the wrinkle process using a schematic diagram. In the present embodiment, evolution processing as used in the genetic algorithm is executed on the preference model 151 in step S204 described later. Before and after the evolution process, the preference model 151 can be defined as a model of a different generation. Further, as shown in FIG. 11, the rule set 302 extracted from the preference model 151 in a certain generation is grouped into a rule group 302a that makes a determination of “like” and a rule group 302b that makes a determination of “dislike”. Can be The content evaluation unit 120 holds a certain number of these rules as a set. At this time, the content evaluation unit 120 executes a rule habit process according to the following procedure.

（ステップＳ３０１）まず、コンテンツ評価部１２０は、トレーニング用の番組情報３０１を入力とすると、番組情報３０１を各ルールにあてはめることで、すべてのルールにおいて“好き”か“嫌い”かの判定結果を算出する。 (Step S301) First, when the program information 301 for training is input, the content evaluation unit 120 applies the program information 301 to each rule, thereby obtaining a determination result of “like” or “dislike” in all the rules. calculate.

（ステップＳ３０２）次に、コンテンツ評価部１２０は、すべての判定結果を統計的に処理し（すなわち、例えば判定結果の合計値にもとづき）、その番組が“好き”か“嫌い”かの推薦結果３０３を最終的に決定する。このとき、各判定結果に対しては、そのルールの信頼度を掛け合わせることで、総合的な判定結果への影響度を正規化することができる。 (Step S302) Next, the content evaluation unit 120 statistically processes all the determination results (that is, based on the total value of the determination results, for example), and recommends whether the program is “like” or “dislike”. Finally, 303 is determined. At this time, the degree of influence on the overall determination result can be normalized by multiplying each determination result by the reliability of the rule.

（ステップＳ３０３）次に、コンテンツ評価部１２０は、推薦結果３０３とトレーニング用の番組情報にあらかじめ付与されている正解値３０４を比較して、一致するルールを良いルール、一致しないルールを悪いルールとして順位付けしていく。 (Step S303) Next, the content evaluation unit 120 compares the recommendation result 303 and the correct answer value 304 previously given to the program information for training, and determines that the rule that matches is a good rule and the rule that does not match is a bad rule. I will rank.

以上の操作（ステップＳ３０１〜３０３）を多数のトレーニング用番組情報に対して繰り返すことにより、下位に順位付けされたルールを淘汰し、上位に順位付けされたルールだけを残していく。 By repeating the above operation (steps S301 to S303) for a large number of pieces of training program information, the rules ranked lower are deceived, and only the rules ranked higher are left.

そして、コンテンツ評価部１２０は、このようなルールの淘汰処理（すなわちステップＳ２０２）を、ステップＳ２０４で実行する進化処理（あるいは進化計算）における世代交代処理を進めていく中で再帰的に進めることにより、高い推薦精度に寄与するルールだけを残すことができる。 Then, the content evaluation unit 120 recursively advances the rule selection process (that is, step S202) while performing the generation change process in the evolution process (or evolution calculation) executed in step S204. Only rules that contribute to high recommendation accuracy can be left.

ステップＳ２０２で複数のトレーニング用番組情報を用いて淘汰処理を実行した後は、ステップＳ２０３で、コンテンツ評価部１２０が、新たに出現したルールの個数が所定の閾値以下であるか否かを判定する。ここで、ルールが新たに出現する場合とは、次のステップＳ２０４での進化処理を実行した場合に発生する現象である。また、進化計算と淘汰処理が十分に進むと、図１１に示されるような淘汰処理をしても、推薦に寄与する新たなルールがあまり出現しなくなる。本システムでは、新しく出現したルールの個数が所定の閾値を下回る場合、学習処理が終わったと判定して進化計算を終了する。この構成によれば、後述するステップＳ２０５での推定処理では、未知の番組情報を入力として、学習が終わったルール群によってユーザの嗜好を判定するため、高い精度が期待される。 After executing the selection process using a plurality of training program information in step S202, in step S203, the content evaluation unit 120 determines whether or not the number of newly appearing rules is equal to or less than a predetermined threshold. . Here, a case where a rule newly appears is a phenomenon that occurs when the evolution process in the next step S204 is executed. Further, if the evolutionary calculation and the wrinkle processing sufficiently proceed, new rules that contribute to the recommendation do not appear much even if the wrinkle processing as shown in FIG. 11 is performed. In this system, when the number of newly appearing rules falls below a predetermined threshold value, it is determined that the learning process has ended, and the evolution calculation ends. According to this configuration, in the estimation process in step S205 described later, unknown program information is input, and the user's preference is determined based on the learned rule group. Therefore, high accuracy is expected.

ステップＳ２０４における、本実施形態の遺伝的アルゴリズムを用いた進化計算では、一般的な遺伝的アルゴリズムと同様に交叉処理や突然変異処理が行われる。これらの交叉処理や突然変異処理は、嗜好モデル生成部１５０によって行うことができる。以下、この２つの処理について説明する。 In the evolution calculation using the genetic algorithm of this embodiment in step S204, crossover processing and mutation processing are performed in the same manner as a general genetic algorithm. These crossover processing and mutation processing can be performed by the preference model generation unit 150. Hereinafter, these two processes will be described.

図１２に示すように、本実施形態の交叉処理では、両親となる２つの有効グラフ（すなわち遺伝的アルゴリズムにおける遺伝子）について、次のノードへの接続情報を保持した同じ判定ノード同士が交換される。結果として、新しい有効グラフ（遺伝子）は、２つの遺伝子の有向グラフを組み合わせた新しいグラフ構造となる。 As shown in FIG. 12, in the crossover process of the present embodiment, the same determination nodes holding connection information to the next node are exchanged for two effective graphs (that is, genes in the genetic algorithm) serving as parents. . As a result, the new effective graph (gene) has a new graph structure that combines the directed graphs of the two genes.

また、図１３（ａ）、（ｂ）に示すように、本実施形態の突然変異処理では、判定ノードの番号をランダムに変更するか、次の接続先を表すグラフの構成情報（図３（ｂ））をランダムに変更する。 Further, as shown in FIGS. 13A and 13B, in the mutation process of the present embodiment, the determination node number is changed at random or the configuration information of the graph representing the next connection destination (FIG. 3 ( b)) is changed randomly.

ステップＳ２０１〜Ｓ２０４では、ルールの集合の抽出と、淘汰処理及び進化処理を繰り返し実行することで、高い推薦精度に寄与するルールだけを残すことができる。そして、新しく出現したルールの個数が所定の閾値を下回る場合、ステップＳ２０３の判定結果が「ｎｏ」となる。 In steps S201 to S204, it is possible to leave only a rule that contributes to high recommendation accuracy by repeatedly extracting a set of rules and repeatedly executing the selection process and the evolution process. When the number of newly appearing rules is below a predetermined threshold, the determination result in step S203 is “no”.

次に、ステップＳ２０５では、コンテンツ評価部１２０が、嗜好モデル１５１に基づいて生成された各ルールを、入力された番組情報に対してあてはめることでユーザの嗜好を推定する。すなわち、コンテンツ評価部１２０は、図１４に示したように、未知の番組情報４０１を、ステップＳ２０１〜Ｓ２０４で選択された“好き”と判定を下すルール群４０２ａと“嫌い”と判定を下すルール群４０２ｂとを含むルールの集合４０２の各ルールにあてはめる（ステップＳ４０１）。次に、コンテンツ評価部１２０は、各ルールの判定結果を統計的に処理し、好き嫌いを判定し、推薦結果４０３（すなわち当該番組情報を推薦すべきか否かを示す情報）を決定する（ステップＳ４０２）。 Next, in step S205, the content evaluation unit 120 estimates the user's preference by applying each rule generated based on the preference model 151 to the input program information. That is, as shown in FIG. 14, the content evaluation unit 120 determines that the unknown program information 401 is determined to be “dislike” and the rule group 402 a that is determined as “like” selected in steps S <b> 201 to S <b> 204. The rule 402 is applied to each rule of the rule set 402 including the group 402b (step S401). Next, the content evaluation unit 120 statistically processes the determination results of each rule, determines likes and dislikes, and determines a recommendation result 403 (that is, information indicating whether or not the program information should be recommended) (step S402). ).

次に、図１０のステップＳ２０６では、コンテンツ評価部１２０が、コンテンツ評価処理を実行すべきすべての番組情報に対する評価が終了したか否かを判定する。評価すべき他の番組情報が残されている場合には、次の番組情報を対象としてステップＳ２０５の処理を再度実行する（ステップＳ２０６で「ｎｏ」からステップＳ２０５）。評価すべき他の番組情報が残されていない場合には、処理を終了する（ステップＳ２０６で「ｙｅｓ」の場合）。 Next, in step S206 of FIG. 10, the content evaluation unit 120 determines whether or not evaluation for all program information for which content evaluation processing should be executed has been completed. If other program information to be evaluated remains, the process of step S205 is executed again for the next program information (from “no” in step S206 to step S205). If there is no remaining program information to be evaluated, the process ends (in the case of “yes” in step S206).

以上のように、本実施形態によれば、各判定ノードに対応づけられた情報と、他の判定ノードに対応づけられた情報とに親子関係は必要としない。よって、嗜好モデルに対する判定ノードの追加や削除の処理を容易に行うことができる。また、本実施形態の嗜好モデルを生成したり、更新したりするための情報は、番組情報等のコンテンツ情報から取得することができるので、トラフィック量の増加や、オフライン環境での使用上の制約といった課題も生じない。 As described above, according to the present embodiment, there is no need for a parent-child relationship between information associated with each determination node and information associated with another determination node. Therefore, it is possible to easily add or delete determination nodes from the preference model. In addition, since the information for generating or updating the preference model according to the present embodiment can be acquired from content information such as program information, an increase in traffic volume and restrictions on use in an offline environment Such a problem does not occur.

また、本実施形態により、コンテンツに関するユーザの嗜好を数式の集合で表現できるようになり、これらの数式集合に未知のコンテンツを与えることで、好みを判定できるようになる。つまり、コンテンツに関するユーザの嗜好を数式としてモデル化できるため、結果の可視化性に優れている。 Further, according to the present embodiment, the user's preference regarding the contents can be expressed by a set of mathematical expressions, and the preference can be determined by giving unknown contents to these mathematical expression sets. In other words, the user's preference regarding the content can be modeled as a mathematical formula, so that the result is highly visible.

また、本実施形態のコンテンツ推薦システムは、大量の入力情報からユーザの嗜好を表現するために必要な情報を自動的に抽出しルール化できるため、大規模なコンテンツデータでも良好な推薦精度が期待される。また、遺伝的アルゴリズムを用いたコンテンツ推薦システムは、完全なオフライン環境でも動作可能であるため、ネットワークなど外部環境に接続できない機器においても有用である。 In addition, since the content recommendation system of this embodiment can automatically extract information necessary for expressing user preferences from a large amount of input information and create rules, it can be expected to have good recommendation accuracy even for large-scale content data. Is done. In addition, since the content recommendation system using the genetic algorithm can operate even in a completely offline environment, it is also useful for devices that cannot be connected to an external environment such as a network.

なお、本発明が特長とする点については例えば次のようにまとめることができる。本発明は、ＧＮＰ（ＧｅｎｅｔｉｃＮｅｔｗｏｒｋＰｒｏｇｒａｍｍｉｎｇ；遺伝的ネットワークプログラミング）と呼ばれるノードをネットワーク状に接続することでプログラムを自動生成する計算手法の一つを用いたものである、ということができる。ここで、以下の特長１と特長２は、推薦システムにＧＮＰアルゴリズムを適用する場合の独自の工夫点であり、特長３は推薦システム自体の背景とする技術との差異である。 The features of the present invention can be summarized as follows, for example. It can be said that the present invention uses one of calculation methods for automatically generating a program by connecting nodes called GNP (Genetic Network Programming) in a network. Here, Feature 1 and Feature 2 below are unique contrivances when applying the GNP algorithm to the recommendation system, and Feature 3 is a difference from the technology behind the recommendation system itself.

特長１：ＧＮＰアルゴリズムにおける各計算において、判定ノードに信頼度情報を入れた点。
ＧＮＰによる進化計算は複雑なルールを計算可能であるという利点のほかに、計算コストが膨大であるという欠点がある。したがってＧＮＰアルゴリズムを推薦システムに応用するためには、大量の判定ノードから有意なルールを、効率よく探索する方法が重要となる。本発明では、信頼度情報の導入により、トレーニングデータ（すなわちトレーニング用番組情報）で多く出現している属性情報を含む判定ノードが優先的にルールの一部に採用されやすいように工夫している。この信頼度の導入により、ルールの収束時間が大幅に高まり、ＧＮＰアルゴリズムが推薦システムでも現実時間で駆動できるようになった。 Feature 1: The reliability information is entered in the decision node in each calculation in the GNP algorithm.
The evolution calculation by GNP has a disadvantage that the calculation cost is enormous in addition to the advantage that a complicated rule can be calculated. Therefore, in order to apply the GNP algorithm to the recommendation system, a method for efficiently searching for significant rules from a large number of decision nodes is important. In the present invention, by introducing reliability information, a determination node including attribute information frequently appearing in training data (that is, training program information) is devised so as to be easily adopted as a part of the rule. . With the introduction of this reliability, the convergence time of the rule is greatly increased, and the GNP algorithm can be driven in real time even in the recommendation system.

特長２：推薦処理においても、判定ノードの信頼度情報を使って、ルールの影響度に正規化を加えた点。
本発明のシステムにより採用されたルール群を用いて推薦処理をする場合、それらすべてのルールによる判定結果を集計して、“好き”と“嫌い”の判定結果の多数決により決める方法が考えられる。実際に、提案システムもこの方式を採用している。しかし、ルールによっては、トレーニングデータの情報とは全く関連がなくても、偶然うまく判別できただけのものもある。このようなルールは、未知の番組情報を推薦するときに判定処理を失敗する原因となりうる。この問題を回避するため、特長１において導入した信頼度情報を、ルールを用いた“好き”と“嫌い”の判別処理においても、利用できるように工夫した。信頼度情報は、トレーニングデータにおける属性情報の出現頻度を表す。したがって、信頼度をもとに算出されたルールの信頼度は、いかにトレーニングデータと相関があるかを表すことになる。推薦処理において、あるルールが与える影響力をルールの信頼性で正規化することで、より高い推薦精度が確認された。 Feature 2: Even in the recommendation process, the degree of influence of the rule is normalized by using the reliability information of the judgment node.
When the recommendation process is performed using the rule group adopted by the system of the present invention, a method is conceivable in which the determination results based on all the rules are totaled and determined by majority decision of the “like” and “dislike” determination results. In fact, the proposed system also uses this method. However, depending on the rules, there are some that have been successfully discriminated accidentally even though they are not related to the training data information. Such a rule may cause the determination process to fail when recommending unknown program information. In order to avoid this problem, the reliability information introduced in Feature 1 has been devised so that it can be used in the “like” and “dislike” discrimination processing using rules. The reliability information represents the appearance frequency of attribute information in the training data. Therefore, the reliability of the rule calculated based on the reliability represents how correlated with the training data. In the recommendation process, a higher recommendation accuracy was confirmed by normalizing the influence of a rule with the reliability of the rule.

特長３：推薦システムにおいて、嗜好度（好き嫌い）の分解能が高い点。
ＧＮＰを用いた推薦システムそのものが、他の推薦システムと違って優位な点について述べる。その他の推薦システムで採用されているアルゴリズムの多くが、２クラス分類器であり、“好き”か“嫌い”にしか分類できない。それに比べ論理式で判定結果を出力する本手法は、Ｎクラスに分類できる。実際に本発明のシステムでは、“すごく好き”、”すごく嫌い”、“まあまあ好き”、“まあまあ嫌い”といった複数分類で結果を出力することができる。したがって、アプリケーションの要求によっては、“すごく好き”に加えて“まあまあ好き”といった意外性のある推薦結果を出力することができる。 Feature 3: The recommendation system has high resolution of preference (likes and dislikes).
The following describes the advantages of the recommendation system itself using GNP, unlike other recommendation systems. Many of the algorithms used in other recommendation systems are two-class classifiers that can only be classified as “like” or “dislike”. In contrast, this method of outputting the determination result by a logical expression can be classified into N classes. Actually, the system of the present invention can output the results in a plurality of categories such as “I like it very much”, “I like it a lot”, “I like it so much”, and “I like it so much”. Therefore, depending on the request of the application, an unexpected recommendation result such as “somewhat like” can be output in addition to “very like”.

１０情報処理装置
１００コンテンツ表示機器
１２０コンテンツ評価部
１３０推薦結果表示選択部
１４０コンテンツ閲覧ログ生成部
１５０嗜好モデル生成部
１５１嗜好モデル
１５１ａ有向グラフ
１５１ａ１開始ノード
１５１ａ２判定ノード
１５１ａ３エッジ
１５１ｂ判定ノード内容情報
２００コンテンツ制御機器
２１０コンテンツ制御部 DESCRIPTION OF SYMBOLS 10 Information processing apparatus 100 Content display apparatus 120 Content evaluation part 130 Recommendation result display selection part 140 Content browsing log production | generation part 150 Preference model production | generation part 151 Preference model 151a Directed graph 151a1 Start node 151a2 Judgment node 151a3 Edge 151b Judgment node content information 200 Content control Device 210 Content control unit

Claims

Represented by a directed graph composed of a plurality of nodes and a plurality of edges connecting the plurality of nodes with directionality,
The plurality of nodes are start nodes or decision nodes;
The determination node determines one attribute information of the plurality of attribute information included in the content information representing the attribute of the content using the plurality of attribute information, and a user's preference degree related to the determination node. Is associated with information representing a judgment formula for
The start node makes a predetermined determination regarding user preference based on one or more determination results based on one or more of the determination formulas associated with one or more of the determination nodes following the start node. A starting point of a program for generating one or a plurality of rules for connection with connection information to one or a plurality of the determination nodes,
A storage unit storing a preference model having a data structure;
An information processing apparatus comprising: an estimation unit that inputs the content information and estimates one's preference by applying one or more rules generated based on the preference model to the content information.

The determination node is further associated with a predetermined parameter for representing the reliability of the determination formula associated with the determination node.
The rule generated based on the preference model includes the one or more representing the reliability of the one or more determination formulas obtained from the one or more determination results used as a reference when generating the rule Is a value obtained using a predetermined parameter, and a value representing the reliability of the rule is set,
The said estimation part utilizes the value showing each said reliability set to each said rule, when applying the said 1 or several rule with respect to the said content information. Information processing device.

The preference model is generated or updated based on a plurality of the content information,
The attribute information associated with the determination node appears in the plurality of content information used as a reference when the predetermined parameter associated with the determination node generates or updates the preference model. The information processing apparatus according to claim 2, wherein the information processing apparatus corresponds to a frequency.

The information according to any one of claims 1 to 3, wherein the estimation unit estimates the user's preference while performing an evolution process on the preference model using a genetic algorithm. Processing equipment.

Represented by a directed graph composed of a plurality of nodes and a plurality of edges connecting the plurality of nodes with directionality,
The plurality of nodes are start nodes or decision nodes;
The determination node determines one attribute information of the plurality of attribute information included in the content information representing the attribute of the content using the plurality of attribute information, and a user's preference degree related to the determination node. Is associated with information representing a judgment formula for
The start node makes a predetermined determination regarding user preference based on one or more determination results based on one or more of the determination formulas associated with one or more of the determination nodes following the start node. A starting point of a program for generating one or a plurality of rules for connection with connection information to one or a plurality of the determination nodes,
Storing a preference model having a data structure in a predetermined storage unit;
Information processing characterized by inputting the content information by an estimation unit and estimating the user's preference by applying one or more rules generated based on the preference model to the content information Method.

A data structure for representing a user preference model,
Represented by a directed graph composed of a plurality of nodes and a plurality of edges connecting the plurality of nodes with directionality,
The plurality of nodes are start nodes or decision nodes;
The determination node determines one attribute information of the plurality of attribute information included in the content information representing the attribute of the content using the plurality of attribute information, and a user's preference degree related to the determination node. Is associated with information representing a judgment formula for
The start node makes a predetermined determination regarding user preference based on one or more determination results based on one or more of the determination formulas associated with one or more of the determination nodes following the start node. A starting point of a program for generating one or a plurality of rules for connection with connection information to one or a plurality of the determination nodes,
A data structure characterized by that.

The data structure according to claim 6, wherein the determination node is further associated with a predetermined parameter for representing the reliability of the determination formula associated with the determination node.