JP6069245B2

JP6069245B2 - Information processing apparatus, information processing apparatus control method, and program

Info

Publication number: JP6069245B2
Application number: JP2014055093A
Authority: JP
Inventors: 廣中地; 中田　和宏; 和宏中田
Original assignee: NEC Personal Computers Ltd
Current assignee: NEC Personal Computers Ltd
Priority date: 2014-03-18
Filing date: 2014-03-18
Publication date: 2017-02-01
Anticipated expiration: 2034-03-18
Also published as: JP2015177525A

Description

本発明は、情報処理装置、情報処理装置の制御方法、及びプログラムに関する。 The present invention relates to an information processing apparatus, a control method for the information processing apparatus, and a program.

近年、番組のジャンル等の情報を含む番組メタデータに基づいて、ユーザに番組を推薦するシステムやサービスが提案されている。これは、ユーザの番組視聴履歴を元に番組に対するユーザの嗜好を学習し、その学習結果に基づいてユーザの嗜好に合った番組を推薦するものである。 In recent years, systems and services that recommend programs to users based on program metadata including information such as program genres have been proposed. This learns the user's preference for the program based on the user's program viewing history, and recommends a program that matches the user's preference based on the learning result.

特許文献１には、第１計算部で計算された視聴履歴を参照し電子番組ガイド（ＥＰＧ：Electronic Program Guide）から抽出されたトピックに対するユーザの嗜好度合いを示す数値と、第２計算部で計算された電子番組ガイド（ＥＰＧ）と視聴履歴とに基づいてユーザの番組に対する嗜好度合いを示す数値とを用いて、ユーザに推薦するトピックとそれに対応する推薦番組とをユーザに提示する番組推薦装置が開示されている。 In Patent Document 1, a numerical value indicating a user's preference degree for a topic extracted from an electronic program guide (EPG) with reference to the viewing history calculated by the first calculation unit, and a calculation by the second calculation unit A program recommendation device for presenting a user with a topic recommended to the user and a recommended program corresponding to the topic using a numerical value indicating the degree of preference of the user based on the electronic program guide (EPG) and the viewing history. It is disclosed.

非特許文献１には、バラエティ番組や音楽番組だったら、ユーザが好んでいると思われる出演者によって選ぶとか、旅の番組だったら登場する地方で分析するとか、番組のジャンルだけでは分からない部分もカバーするようなテレビ番組のレコメンデーションのための技術が開示されている。 Non-Patent Document 1 shows that if it is a variety program or music program, it is selected by the performer that the user likes, or if it is a travel program, it is analyzed in the region where it appears, or the part that is not known only by the genre of the program A technique for recommending a television program that covers the above is also disclosed.

国際公開第２０１１／０６７８０８号International Publication No. 2011/067808

マイナビニュースニューストップエンタープライズネットトレンド、“インタビューＰＣならではの新ＴＶ生活を提案するソニー（１）−ｉＥＰＧでの自動番組推薦”、［online］、平成１５年１０月、［平成２６年１月２７日検索］、インターネット〈URL：http://news.mynavi.jp/news/2003/10/14/06.html〉Mynavi News News Top Enterprise Net Trends, “Interview: Proposing a New TV Life Unique to PCs (1) – Automatic Program Recommendation on iEPG”, [online], October 2003, [January 27, 2014 Search], Internet <URL: http://news.mynavi.jp/news/2003/10/14/06.html>

しかしながら、上記従来の技術では、ユーザ自身が認める嗜好（興味度）と、ユーザの視聴履歴を元に学習した嗜好（興味度）との双方を反映させた状態で番組をお勧めすることは行われていなかった。そのため、例えば、サッカー中継の番組説明文には、お気に入りのチーム名やそのチームに所属する選手名が含まれることがある。そうすると、当該チームに所属する選手のファンであっても、その選手が出演しているバラエティ番組やトーク番組は、ユーザの視聴履歴を元に学習した嗜好（興味度）には属しないため、たとえユーザ自身が認める嗜好（興味度）に属したとしてもユーザに対して推薦されないという問題がある。 However, in the above conventional technique, it is not recommended to recommend a program in a state that reflects both the preference (interest) recognized by the user and the preference (interest) learned based on the user's viewing history. It wasn't. Therefore, for example, a program description for a soccer broadcast may include a favorite team name or a player name belonging to the team. Then, even if it is a fan of a player who belongs to the team, the variety program or talk program in which the player appears does not belong to the preference (degree of interest) learned based on the user's viewing history. There is a problem that even if it belongs to the preference (degree of interest) recognized by the user, it is not recommended for the user.

また、上記特許文献１に記載された番組推薦装置では、何れの計算部においても、ユーザの視聴履歴を参照することによりユーザの嗜好度合いの高いトピックに対応する番組を推薦しているため、ユーザの嗜好を自動的に学習した興味度に対応することはできても、ユーザ自身が認める嗜好や、ユーザがマニュアルで設定する興味度に対して対応できないという問題がある。 Further, in the program recommendation device described in Patent Document 1, the user recommends a program corresponding to a topic having a high degree of user preference by referring to the user's viewing history in any calculation unit. Although the user's preference can be handled with the automatically learned interest degree, there is a problem that the user's own preference and the user's manually set interest degree cannot be handled.

また、非特許文献１に記載された技術では、ユーザに対して特にお勧めする番組をピンポイントで抽出することはできるが、レコメンデーションのやり方に融通性がなく、ユーザの嗜好、興味度の変動に対しフレキシブルに対応したお勧め番組を提示することができないという問題がある。 In addition, with the technology described in Non-Patent Document 1, it is possible to pinpoint a program that is particularly recommended for users, but the method of recommendation is not flexible, and the user's preference and interest level There is a problem that it is not possible to present a recommended program that flexibly responds to fluctuations.

そこで本発明は、上記従来の問題点に鑑みてなされたもので、コンテンツのジャンルに対する興味度とコンテンツの文字列（ワード、単語）に対する興味度とを合成することにより、ユーザの嗜好にフレキシブルに対応したコンテンツを推薦することが可能な情報処理装置、情報処理装置の制御方法、及びプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of the above-described conventional problems, and by combining the degree of interest with respect to the content genre and the degree of interest with respect to the character string (word, word) of the content, it is possible to flexibly meet the user's preference. An object of the present invention is to provide an information processing apparatus capable of recommending corresponding content, a method for controlling the information processing apparatus, and a program.

上記課題を解決するため、請求項１に記載の本発明における情報処理装置は、コンテンツの内容に基づく分類であるジャンル毎に予めジャンル興味度を設定するジャンル興味度設定手段と、ユーザの視聴履歴を分析することにより得られた文字列に対する前記ユーザの興味度を推定する文字列興味度推定手段と、前記文字列興味度推定手段により推定された前記ユーザの興味度を、前記ジャンル興味度設定手段により設定された前記ジャンル興味度を用いて変換して得られたお勧め度に対し、前記ジャンル興味度が設定されたコンテンツがすべてのコンテンツに対して如何なる割合でお勧めされるかを決定するための閾値を設定する閾値設定手段と、を含むことを特徴とする。 In order to solve the above-described problem, the information processing apparatus according to the first aspect of the present invention includes a genre interest degree setting unit that sets a genre interest degree in advance for each genre, which is a classification based on the content, and a user's viewing history. a string interestingness estimation means for estimating the degree of interest of the user for the character string obtained by analyzing a degree of interest of the user estimated by the character string interestingness estimation means, wherein the genre degree of interest set Decide what proportion of content with the genre interest level recommended for all content with respect to the recommendation level obtained by conversion using the genre interest level set by means And a threshold value setting means for setting a threshold value for doing so.

また、請求項２に記載の本発明における情報処理装置は、請求項１に記載の情報処理装置において、前記文字列に対する前記ユーザの興味度は、前記ユーザの嗜好を反映した視聴履歴に基づいて機械学習により作成された文字列に対する興味度であることを特徴とする。 An information processing apparatus according to a second aspect of the present invention is the information processing apparatus according to the first aspect, wherein the degree of interest of the user with respect to the character string is based on a viewing history reflecting the user's preference. wherein the interest degree der Rukoto for string produced by machine learning.

さらに、請求項３に記載の本発明における情報処理装置は、請求項１又は２に記載の情報処理装置において、前記文字列に対する前記ユーザの興味度は、前記文字列に対する興味度と前記文字列の順位との関係のうち、少なくとも非線形領域を正規化することにより推定されることを特徴とする。 Furthermore, the information processing apparatus according to the present invention described in claim 3 is the information processing apparatus according to claim 1 or 2, wherein the user's degree of interest in the character string includes an interest degree in the character string and the character string. It is estimated that it is estimated by normalizing at least a non-linear area | region among the relationship with this order | rank .

また、請求項４に記載の本発明における情報処理装置は、請求項１から３の何れか１項に記載の情報処理装置において、前記得られたお勧め度に対し、前記閾値設定手段により設定された閾値に基づいて、所定のジャンル興味度が設定されたジャンルに属するコンテンツの、全ジャンルのコンテンツに占めるお勧め割合を推定する手段をさらに含むことを特徴とする。 An information processing apparatus according to a fourth aspect of the present invention is the information processing apparatus according to any one of the first to third aspects , wherein the threshold setting means sets the obtained recommendation level. The method further includes means for estimating a recommended ratio of content belonging to a genre for which a predetermined genre interest degree is set based on the set threshold to the content of all genres .

そして、上記課題を解決するために、請求項５に記載の本発明における情報処理装置の制御方法は、コンテンツの内容に基づく分類であるジャンル毎に予めジャンル興味度を設定するジャンル興味度設定工程と、ユーザの視聴履歴を分析することにより得られた文字列に対する前記ユーザの興味度を推定する文字列興味度推定工程と、前記文字列興味度推定工程により推定された前記ユーザの興味度を、前記ジャンル興味度設定工程により設定された前記ジャンル興味度を用いて変換して得られたお勧め度に対し、前記ジャンル興味度が設定されたコンテンツがすべてのコンテンツに対して如何なる割合でお勧めされるかを決定するための閾値を設定する閾値設定工程と、を含むことを特徴とする。 And in order to solve the said subject, the control method of the information processing apparatus in this invention of Claim 5 sets the genre interest degree beforehand for every genre which is a classification | category based on the content content, The genre interest degree setting process When a string interestingness estimating step of estimating the degree of interest of the user with respect to the resulting string by analyzing the viewing history of the user, the interest degree of the user estimated by the character string interestingness estimating step The content set with the genre interest level is in any ratio with respect to the recommended level obtained by converting the genre interest level set in the genre interest level setting step . And a threshold value setting step for setting a threshold value for determining whether to be recommended.

また、上記課題を解決するために、請求項６に記載の本発明におけるプログラムは、情報処理装置のコンピュータに、コンテンツの内容に基づく分類であるジャンル毎に予めジャンル興味度を設定するジャンル興味度設定処理と、ユーザの視聴履歴を分析することにより得られた文字列に対する前記ユーザの興味度を推定する文字列興味度推定処理と、前記文字列興味度推定処理により推定された前記ユーザの興味度を、前記ジャンル興味度設定処理により設定された前記ジャンル興味度を用いて変換して得られたお勧め度に対し、前記ジャンル興味度が設定されたコンテンツがすべてのコンテンツに対して如何なる割合でお勧めされるかを決定するための閾値を設定する閾値設定処理と、を実行させることを特徴とする。 In order to solve the above problem, the program according to the present invention described in claim 6 sets a genre interest degree in advance for each genre, which is a classification based on the content content, in the computer of the information processing apparatus. and setting process, and the string interestingness estimate processing for estimating the degree of interest of the user with respect to the resulting string by analyzing the viewing history of the user, interests of the user estimated by the character string interestingness estimation process The ratio of the content set with the genre interest level to all the content with respect to the recommendation level obtained by converting the degree using the genre interest level set by the genre interest level setting process And a threshold value setting process for setting a threshold value for determining whether or not it is recommended.

本発明によれば、コンテンツのジャンルに対する興味度とコンテンツの文字列（ワード、単語）に対する興味度とを合成することにより、ユーザの嗜好にフレキシブルに対応したコンテンツを推薦することが可能な情報処理装置、情報処理装置の制御方法、及びプログラムが得られる。 According to the present invention, information processing that can recommend content flexibly corresponding to a user's preference by combining the interest level with respect to the genre of the content and the interest level with respect to the character string (word, word) of the content. Apparatus, information processing apparatus control method, and program are obtained.

本発明の実施形態における情報処理装置を含む情報処理システム全体の構成を示す図である。It is a figure which shows the structure of the whole information processing system containing the information processing apparatus in embodiment of this invention. 本発明の実施形態における情報処理装置の全体構成について説明する概略ブロック図である。It is a schematic block diagram explaining the whole structure of the information processing apparatus in embodiment of this invention. 本発明の実施形態におけるＷｅｂサーバの全体構成について説明する概略ブロック図である。It is a schematic block diagram explaining the whole structure of the web server in embodiment of this invention. 本発明の実施形態における情報処理装置の機能ブロック図である。It is a functional block diagram of an information processor in an embodiment of the present invention. 本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の設定について説明する図である。It is a figure explaining the setting of the interest degree for every genre of content in the information processing apparatus in the embodiment of the present invention. 本発明の実施形態における情報処理装置において視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を示す図である。It is a figure which shows the relationship between the interest degree with respect to the character string produced by the machine learning based on viewing history in the information processing apparatus in embodiment of this invention, and an order | rank. 図６で作成した機械学習により作成した文字列に対する興味度と順位との関係を正規化した図である。It is the figure which normalized the relationship between the degree of interest with respect to the character string created by the machine learning created in FIG. 本発明の実施形態における情報処理装置のお勧め度計算アルゴリズムについて説明する機能ブロック図である。It is a functional block diagram explaining the recommendation degree calculation algorithm of the information processing apparatus in the embodiment of the present invention. 本発明の実施形態における情報処理装置の動作について説明するフローチャートである。It is a flowchart explaining operation | movement of the information processing apparatus in embodiment of this invention. 本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の離散値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係を示す図である。In the information processing apparatus according to the embodiment of the present invention, the discrete value of the degree of interest for each genre of content, and the normalized relationship between the degree of interest and rank for the character string created by machine learning based on the viewing history, It is a figure which shows the relationship between the interest degree and recommendation degree with respect to the normalized character string when it calculates. 本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の離散値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係に対して閾値を設定したときの状態について説明する図である。In the information processing apparatus according to the embodiment of the present invention, the discrete value of the degree of interest for each genre of content, and the normalized relationship between the degree of interest and rank for the character string created by machine learning based on the viewing history, It is a figure explaining the state when a threshold value is set with respect to the relationship between the degree of interest and the recommendation degree with respect to the normalized character string when calculated. 本発明の実施形態における情報処理装置のお勧め度逆計算アルゴリズムについて説明する機能ブロック図である。It is a functional block diagram explaining the recommendation degree reverse calculation algorithm of the information processing apparatus in the embodiment of the present invention. 本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の離散値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係に対して設定された閾値から逆算するときの状態について説明する図である。In the information processing apparatus according to the embodiment of the present invention, the discrete value of the degree of interest for each genre of content, and the normalized relationship between the degree of interest and rank for the character string created by machine learning based on the viewing history, It is a figure explaining the state at the time of calculating backward from the threshold value set with respect to the relationship between the interest degree and recommendation degree with respect to the normalized character string when it calculates. 本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の実数値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係を示す図である。In the information processing apparatus according to the embodiment of the present invention, the real value of the degree of interest for each genre of content and the normalized relationship between the degree of interest and the ranking for the character string created by machine learning based on the viewing history, It is a figure which shows the relationship between the interest degree and recommendation degree with respect to the normalized character string when it calculates.

次に、本発明を実施するための形態について図面を参照して詳細に説明する。なお、各図中、同一又は相当する部分には同一の符号を付しており、その重複説明は適宜に簡略化乃至省略する。本発明の内容を簡潔に説明すると、コンテンツのジャンル毎に予め設定された設定興味度と、コンテンツの視聴により推定された推定興味度とを合成する合成手段と、合成により、推定興味度に対する設定興味度のジャンルに属するコンテンツのお勧め度を生成する生成手段と、任意の設定興味度を有するジャンルに属するコンテンツの、全ジャンルのコンテンツに占めるお勧め割合を決定するための閾値を、生成されたお勧め度に対して設定する設定手段と、を含むことにより、ユーザの嗜好にフレキシブルに対応したコンテンツを推薦することができるのである。 Next, embodiments for carrying out the present invention will be described in detail with reference to the drawings. In addition, in each figure, the same code | symbol is attached | subjected to the part which is the same or it corresponds, The duplication description is simplified thru | or abbreviate | omitted suitably. Briefly explaining the contents of the present invention, a setting means for setting the estimated interest level by combining the setting interest degree preset for each content genre and the estimated interest degree estimated by viewing the content, and the composition. A generating means for generating a recommendation level of content belonging to a genre of interest level and a threshold value for determining a recommended ratio of content belonging to a genre having an arbitrary set interest level in all genres are generated. In addition, by including setting means for setting the recommendation level, it is possible to recommend content that flexibly corresponds to the user's preference.

すなわち、ユーザが設定したコンテンツのジャンル毎の興味度に応じた非線形の数式で、自動的に学習した興味度の推定値を変換している。この変換において、自動的に学習した興味度の推定値の分布を制御することを目的としている。そして、ジャンルに対する興味度に対して平均的なお勧め度を生成する場合と比較して、ジャンルに対する興味度が設定されたコンテンツが、全ジャンルのコンテンツに対してどれくらいの割合でお勧めされるかを決定するために閾値を設けることとしている。要するに、どれくらいの重み付け（傾斜）をつけてジャンルに対する興味度毎にお勧め度を生成するかを決定するために閾値を設けることとしている。 That is, the estimated value of the degree of interest learned automatically is converted by a non-linear expression corresponding to the degree of interest for each genre of content set by the user. The purpose of this conversion is to control the distribution of the estimated values of the degree of interest learned automatically. And, what percentage of content with the genre interest level recommended for all genre content compared to the case of generating an average recommendation level for the genre interest level? In order to determine the threshold value, a threshold value is provided. In short, a threshold value is provided to determine how much weight (gradient) is added and a recommendation level is generated for each degree of interest in a genre.

まず、本発明の実施形態における情報処理装置を含む情報処理システム全体の構成について説明する。図１は、本発明の実施形態における情報処理装置を含む情報処理システム全体の構成を示す図である。図１を参照すると、本発明の実施形態における情報処理装置を含む情報処理システム１０は、例えば、パーソナルコンピュータ（以下、ＰＣともいう。）を一例とする情報処理装置２００が、インターネット等の広く公衆によって接続可能なネットワーク３００を介してＷｅｂサーバ４００に接続されている。また、ＰＣ２００は、放送局１００から送信される放送波や、Ｗｅｂサーバ４００から送出されるコンテンツを受信することができるようになっている。なお、情報処理装置２００には、ＰＣ以外に図示しないスマートフォン等の携帯情報端末も含まれることは勿論である。 First, the configuration of the entire information processing system including the information processing apparatus according to the embodiment of the present invention will be described. FIG. 1 is a diagram illustrating a configuration of an entire information processing system including an information processing apparatus according to an embodiment of the present invention. Referring to FIG. 1, an information processing system 10 including an information processing apparatus according to an embodiment of the present invention includes an information processing apparatus 200 exemplifying a personal computer (hereinafter also referred to as a PC), which is widely used by the public such as the Internet. Is connected to the Web server 400 via the connectable network 300. Further, the PC 200 can receive broadcast waves transmitted from the broadcasting station 100 and contents transmitted from the Web server 400. Of course, the information processing apparatus 200 includes a mobile information terminal such as a smartphone (not shown) in addition to the PC.

また、Ｗｅｂサーバ４００は、ＰＣ２００、及び図示しないスマートフォン等と、インターネット等のネットワーク３００を介して接続されている。そして、ＰＣ２００、及び図示しないスマートフォン等から発せられた任意のＷｅｂページに接続したい旨のアクセス要求に対して、ＵＲＬ（Uniform Resource Locator）で特定されたＷｅｂサーバ４００から提供されるＷｅｂページの閲覧が可能となる。なお、図１中には、インターネット等のネットワーク３００に対してＰＣ２００のみ接続されているが、情報処理装置は１台に限定されず、複数台接続されていることはいうまでも無い。 The Web server 400 is connected to the PC 200 and a smartphone (not shown) via a network 300 such as the Internet. Then, in response to an access request for connecting to an arbitrary Web page issued from the PC 200 or a smartphone (not shown), the Web page provided from the Web server 400 specified by a URL (Uniform Resource Locator) is browsed. It becomes possible. In FIG. 1, only the PC 200 is connected to the network 300 such as the Internet, but the number of information processing apparatuses is not limited to one, and it goes without saying that a plurality of information processing apparatuses are connected.

次に、本発明の実施形態における情報処理装置の全体構成について説明する。図２は、本発明の実施形態における情報処理装置の全体構成について説明する概略ブロック図である。ＰＣ２００は、ＴＶチューナ部２０１と、ネットワーク接続部２０５と、ＣＰＵ（Central Processing Unit）２０６と、ＲＯＭ（Read Only Memory）２０２と、ＲＡＭ（Random Access Memory）２０３と、ＨＤＤ（Hard Disk Drive）２０４と、表示部２０７と、入力部２０８と、電源部２０９とから構成される。 Next, the overall configuration of the information processing apparatus according to the embodiment of the present invention will be described. FIG. 2 is a schematic block diagram illustrating the overall configuration of the information processing apparatus according to the embodiment of the present invention. The PC 200 includes a TV tuner unit 201, a network connection unit 205, a CPU (Central Processing Unit) 206, a ROM (Read Only Memory) 202, a RAM (Random Access Memory) 203, an HDD (Hard Disk Drive) 204, , A display unit 207, an input unit 208, and a power supply unit 209.

ＴＶチューナ部２０１は、放送局１００（図１）から送信される地上デジタル、ＢＳ（Broadcasting Satellite）、及びＣＳ（Communications Satellite）放送をアンテナから受信し復調するものである。ネットワーク接続部２０５は、インターネットに代表されるネットワーク３００に接続され、ネットワーク３００とのインタフェースを図るものである。ＣＰＵ２０６は、ＰＣ２００全体の動作を制御するものであり、ＲＯＭ２０２に格納された制御プログラムをロードし、ＰＣ２００の動作によって得られた様々なデータをＲＡＭ２０３に展開するものである。ＨＤＤ２０４は、ＰＣ２００のアプリケーションソフトウェアプログラムを格納したり、ＴＶチューナ部２０１によって受信されたテレビ番組や、Ｗｅｂサーバ４００から送出されるコンテンツ（以下、これ等をまとめてコンテンツともいう。）を録画したりするものである。 The TV tuner unit 201 receives and demodulates terrestrial digital, BS (Broadcasting Satellite), and CS (Communications Satellite) broadcasts transmitted from the broadcast station 100 (FIG. 1) from an antenna. The network connection unit 205 is connected to a network 300 represented by the Internet and serves as an interface with the network 300. The CPU 206 controls the operation of the entire PC 200, loads a control program stored in the ROM 202, and develops various data obtained by the operation of the PC 200 in the RAM 203. The HDD 204 stores application software programs of the PC 200, records TV programs received by the TV tuner unit 201, and contents transmitted from the Web server 400 (hereinafter collectively referred to as contents). To do.

表示部２０７は、ＬＣＤ（Liquid Crystal Display）等で構成される表示画面であり、ＰＣ２００によって実行されたアプリケーションソフトウェアプログラムの結果やＴＶチューナ部２０１によって受信されたテレビ番組、及びＷｅｂサーバ４００から受信したコンテンツを表示するものである。入力部２０８は、キーボード、マウス、タッチパネル等、ユーザがＰＣ２００に対して指示を与えるものである。そして、電源部２０９は、ＰＣ２００に対してＡＣ（Alternative Current：交流）、又はＤＣ（Direct Current：直流）電源を与えるものである。 The display unit 207 is a display screen composed of an LCD (Liquid Crystal Display) or the like. The display unit 207 receives a result of an application software program executed by the PC 200, a TV program received by the TV tuner unit 201, and the Web server 400. The content is displayed. The input unit 208 is for a user to give an instruction to the PC 200 such as a keyboard, a mouse, and a touch panel. The power supply unit 209 supplies AC (Alternative Current: AC) or DC (Direct Current: DC) power to the PC 200.

次に、本発明の実施形態におけるＷｅｂサーバの全体構成について説明する。図３は、本発明の実施形態におけるＷｅｂサーバの全体構成について説明する概略ブロック図である。Ｗｅｂサーバ４００は、ネットワーク接続部４０３と、ＣＰＵ４０４と、ＲＯＭ４０１と、データベース部４０２と、表示部４０５と、操作部４０６と、電源部４０７とから構成される。 Next, the overall configuration of the Web server in the embodiment of the present invention will be described. FIG. 3 is a schematic block diagram illustrating the overall configuration of the Web server according to the embodiment of the present invention. The Web server 400 includes a network connection unit 403, a CPU 404, a ROM 401, a database unit 402, a display unit 405, an operation unit 406, and a power supply unit 407.

ネットワーク接続部４０３は、インターネットに代表されるネットワーク３００（図１）に接続され、ネットワーク３００とのインタフェースを図るものである。ＣＰＵ４０４は、Ｗｅｂサーバ４００全体の動作を制御するものであり、ＲＯＭ４０１に格納された制御プログラムをロードし、ＣＰＵ４０４の動作によって得られた様々なデータをデータベース部４０２に展開したり、後述するように、ユーザの嗜好に合致するコンテンツを送出したりするものである。表示部４０５は、ＬＣＤ等で構成される表示画面であり、Ｗｅｂサーバ４００によってデータベース部４０２に格納されたデータの格納状況等を表示するものである。操作部４０６は、キーボード、マウス、タッチパネル等、Ｗｅｂサーバ４００の保守者が、Ｗｅｂサーバ４００に対して指示を与えるものである。そして、電源部４０７は、Ｗｅｂサーバ４００に対してＡＣ又はＤＣ電源を与えるものである。 The network connection unit 403 is connected to a network 300 (FIG. 1) represented by the Internet, and serves as an interface with the network 300. The CPU 404 controls the overall operation of the Web server 400. The CPU 404 loads a control program stored in the ROM 401 and develops various data obtained by the operation of the CPU 404 in the database unit 402, as will be described later. The content that matches the user's preference is transmitted. The display unit 405 is a display screen configured by an LCD or the like, and displays a storage status of data stored in the database unit 402 by the Web server 400. The operation unit 406 is for a maintenance person of the Web server 400 to give an instruction to the Web server 400 such as a keyboard, a mouse, and a touch panel. The power supply unit 407 supplies AC or DC power to the Web server 400.

次に、本発明の実施形態における情報処理装置の機能ブロックについて説明する。図４は、本発明の実施形態における情報処理装置の機能ブロック図である。 Next, functional blocks of the information processing apparatus according to the embodiment of the present invention will be described. FIG. 4 is a functional block diagram of the information processing apparatus according to the embodiment of the present invention.

図４において、ＰＣ２００は、放送局１００（図１）から送信される放送波を受信する複数のチューナ部２１０、２１１、２１２、及び２１３と、チューナ部２１０、２１１、２１２、及び２１３により復調されたテレビ番組や、Ｗｅｂサーバ４００から送出されるコンテンツを記録し再生するするコンテンツ記録再生部２１４と、放送局１００から送信される電波の隙間を使って送信される電子番組ガイド（ＥＰＧ）を管理するＥＰＧ情報管理部２１７と、液晶（ＬＣＤ）等のディスプレイ２２１にテレビ番組やコンテンツを表示する動画表示処理部２１５と、を含んで構成されている。なお、チューナ部の数は、４個に限定されないことは勿論である。 In FIG. 4, the PC 200 is demodulated by a plurality of tuner units 210, 211, 212, and 213 that receive broadcast waves transmitted from the broadcast station 100 (FIG. 1) and the tuner units 210, 211, 212, and 213. A content recording / playback unit 214 that records and plays back TV programs and content sent from the Web server 400, and an electronic program guide (EPG) that is transmitted using a gap between radio waves transmitted from the broadcasting station 100 And an EPG information management unit 217 for displaying a television program and content on a display 221 such as a liquid crystal (LCD). Of course, the number of tuner sections is not limited to four.

また、ディスプレイ２２１には、ライブ放送されているテレビ番組の画面、録画されたテレビ番組の再生画面、録画されたテレビ番組の一覧表、Ｗｅｂサーバ４００から受信するコンテンツが表示されることは勿論であるが、後述するＥＰＧ情報管理部２１７により取得された、過去に放送された番組、現在放送されている番組、又は今後放送される予定の番組の番組表も表示される。また、テレビ番組とFacebook（登録商標）画面とを並べて表示したり、Facebook画面をテレビ画面のバックグラウンドで表示したりすることも可能である。 The display 221 displays a live TV program screen, a recorded TV program playback screen, a list of recorded TV programs, and content received from the Web server 400. However, the program table of the program broadcast in the past, the program currently broadcast, or the program scheduled to be broadcast in the future, acquired by the EPG information management unit 217 described later is also displayed. It is also possible to display a TV program and a Facebook (registered trademark) screen side by side, or display the Facebook screen in the background of the TV screen.

さらに、ＰＣ２００は、インターネット等の公衆に利用可能なネットワーク３００（図１）を介してＷｅｂサーバ４００に接続するためのネットワーク接続処理部２２０と、チューナ部２１０、２１１、２１２、及び２１３により受信されたテレビ番組や、Ｗｅｂサーバ４００から取得されたコンテンツに対するユーザの視聴履歴に基づいて、文字列（ワード、単語）を取得し記録する動画情報取得記録部２１９と、ＥＰＧ情報管理部２１７から送信される電子番組表、及び興味情報取得部２１８を介して設定されたコンテンツのジャンル毎の興味度と、受信されたユーザの視聴履歴に基づいて機械学習により作成した文字列（ワード、単語）に対する興味度と順位との関係を正規化したものとの間で所定の演算を実行し、お勧め度を解析する映像解析処理部２１６と、映像解析処理部２１６で解析されたお勧め度をディスプレイ２２１に表示する動画表示処理部２１５と、ＰＣ２００の動作を遠隔操作するリモコン２２２と、を含んで構成されている。 Further, the PC 200 is received by the network connection processing unit 220 for connecting to the Web server 400 via the network 300 (FIG. 1) available to the public such as the Internet, and the tuner units 210, 211, 212, and 213. Is transmitted from the EPG information management unit 217 and the moving image information acquisition recording unit 219 that acquires and records character strings (words, words) based on the user's viewing history of the TV program and the content acquired from the Web server 400. Interest in character strings (words, words) created by machine learning based on the degree of interest for each genre of content set through the electronic program guide and the interest information acquisition unit 218 and the viewing history of the received user Analyze the recommended degree by performing a predetermined operation between the normalized degree and rank The video analysis processing unit 216, a moving image display processing unit 215 that displays the recommended degree analyzed by the video analysis processing unit 216 on the display 221, and a remote controller 222 that remotely controls the operation of the PC 200 are configured. .

次に、本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の設定について説明する。図５は、本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の設定について説明する図である。 Next, setting of the degree of interest for each genre of content in the information processing apparatus according to the embodiment of the present invention will be described. FIG. 5 is a diagram illustrating setting of the degree of interest for each genre of content in the information processing apparatus according to the embodiment of the present invention.

図５に示すように、ＰＣ２００のユーザは、チューナ部２１０、２１１、２１２、及び２１３を介して受信されるテレビ番組について、番組ジャンル５０１と各番組ジャンルに対する興味度５０２とを設定することができる。番組ジャンル５０１は、ニュース／報道、スポーツ、情報／ワイドショー、ドラマ、音楽、バラエティ、映画、アニメ／特撮、ドキュメンタリー／教養、劇場／公演、趣味／教育、福祉といった１２ジャンルに分類されている。このジャンル分けは、ＡＲＩＢ（Association of Radio Industries and Business：社団法人電波産業界）で定められたジャンル分類にしたがっているが、これ以外にも任意のジャンル分類を用いることができる。また、興味度５０２は、興味なし（０）から興味あり（４）の５段階で設定することができる。 As shown in FIG. 5, the user of the PC 200 can set a program genre 501 and an interest level 502 for each program genre for television programs received via the tuner units 210, 211, 212, and 213. . The program genre 501 is classified into 12 genres such as news / report, sports, information / wide show, drama, music, variety, movie, animation / special effects, documentary / cultural, theater / performance, hobby / education, welfare. This genre classification follows the genre classification defined by ARIB (Association of Radio Industries and Business), but any other genre classification can be used. In addition, the degree of interest 502 can be set in five stages from not interested (0) to interested (4).

図５の例では、ニュース／報道は興味度１、スポーツは興味なし（０）、情報／ワイドショーは興味度１、ドラマは興味度２、音楽は興味あり（４）、バラエティは興味度２、映画は興味度３、アニメ・特撮は興味度３、ドキュメンタリー／教養は興味度２、劇場／公演は興味度３、趣味／教育は興味度１、福祉は興味なし（０）、にそれぞれ設定されている。この各番組ジャンルに対する興味度の設定は、ユーザがＰＣ２００を購入した直後、又はユーザがＰＣ２００を購入後、ある一定期間コンテンツを視聴した後に設定するようにしても良い。そして、この各番組ジャンルに対する興味度の設定は、表示部２０７に図５に示す画面が表示された状態で、興味情報取得部２１８（図４）に対してユーザがＰＣ２００の入力部２０８を用いて行うようにする。 In the example of FIG. 5, news / report coverage is interest level 1, sports is not interested (0), information / wide show is interest level 1, drama is interest level 2, music is interested (4), variety is interest level 2. , The interest level is 3 for movies, the interest level is 3 for animation and special effects, the interest level is 2 for documentary / cultural education, the interest level is 3 for theater / performance, the interest level is 1 for hobbies / education, and the interest level is not interested (0). Has been. The degree of interest for each program genre may be set immediately after the user purchases the PC 200 or after the user purchases the PC 200 and views the content for a certain period. Then, the interest level for each program genre is set by the user using the input unit 208 of the PC 200 to the interest information acquisition unit 218 (FIG. 4) while the screen shown in FIG. 5 is displayed on the display unit 207. To do.

次に、本発明の実施形態における情報処理装置において視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係について説明する。図６は、本発明の実施形態における情報処理装置において視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を示す図である。 Next, the relationship between the degree of interest and the ranking for the character string created by machine learning based on the viewing history in the information processing apparatus according to the embodiment of the present invention will be described. FIG. 6 is a diagram showing the relationship between the degree of interest and the ranking for the character string created by machine learning based on the viewing history in the information processing apparatus according to the embodiment of the present invention.

図６に示すように、ユーザの嗜好を反映した視聴履歴に基づいて機械学習（マシンラーニング）により作成した文字列に対する興味度、換言すれば、視聴履歴に含まれる文字列（ワード、単語）を分析評価することにより得られた当該文字列（ワード、単語）に対する興味度を縦軸に、順位を横軸にとる。図６において、文字列に対する興味度が平均値にあるとき機械学習による順位は略網羅されるが、文字列に対する興味度が極端に大きくなるに連れて機械学習による順位は低下して飽和状態となり、文字列に対する興味度が極端に小さくなるに連れて機械学習による順位は増大して飽和状態となる。 As shown in FIG. 6, the degree of interest in the character string created by machine learning (machine learning) based on the viewing history reflecting the user's preference, in other words, the character string (word, word) included in the viewing history. The degree of interest for the character string (word, word) obtained by analyzing and evaluating is plotted on the vertical axis, and the rank is plotted on the horizontal axis. In FIG. 6, the ranking by machine learning is substantially covered when the degree of interest in the character string is an average value, but as the degree of interest in the character string becomes extremely large, the rank by machine learning decreases and becomes saturated. As the degree of interest in the character string becomes extremely small, the rank by machine learning increases and becomes saturated.

ここで、機械学習（マシンラーニング）について若干説明する。本実施形態では、お勧め度を生成する計算過程において、文字列による興味度推定アルゴリズムを利用している。文字列による興味度推定アルゴリズムは、電子番組ガイド（ＥＰＧ）から取得される番組の概要や人名といった文字列情報から、ユーザの興味度を推定するものである。この推定は、例えば、ＴＦ／ＩＤＦ（Term Frequency Inverse Document Frequency）、ナイーブベイズ、ベクトル空間法、サポートベクタマシン等、公知の機械学習（マシンラーニング）を用いて行っている。本実施形態では、図６に示した文字列に対する興味度と順位との関係のうち、非線形領域（飽和状態となっている領域）だけでなく、線形領域（文字列に対する興味度が略平均値にある非飽和領域）をも用いてコンテンツの推薦を行うことにより、推薦数の自由度を高めている。 Here, machine learning (machine learning) will be described briefly. In the present embodiment, an interest degree estimation algorithm based on a character string is used in a calculation process for generating a recommendation degree. The interest level estimation algorithm based on a character string estimates a user's interest level from character string information such as an outline of a program and a person's name acquired from an electronic program guide (EPG). This estimation is performed using known machine learning (machine learning) such as TF / IDF (Term Frequency Inverse Document Frequency), naive Bayes, vector space method, support vector machine, and the like. In the present embodiment, not only the nonlinear region (saturated region) but also the linear region (the degree of interest with respect to the character string is approximately the average value) of the relationship between the degree of interest and the ranking with respect to the character string illustrated in FIG. The degree of freedom of the number of recommendations is increased by recommending content using the non-saturated region in FIG.

次に、図６で作成した機械学習により作成した文字列に対する興味度と順位との関係を正規化したものについて説明する。図７は、図６で作成した機械学習により作成した文字列に対する興味度と順位との関係を正規化した図である。 Next, what normalized the relationship between the degree of interest and the ranking for the character string created by the machine learning created in FIG. 6 will be described. FIG. 7 is a diagram in which the relationship between the degree of interest and the rank for the character string created by the machine learning created in FIG. 6 is normalized.

図７において、図６における非線形曲線を線形式で正規化すると、図７の曲線Ｃに示すように、順位の平均値である０．５を中心として歪んでいる。本実施形態では、（０、１）の正規化を行うことにより、曲線Ｃを直線Ｄに近くなるように計算し変換している。そうすると、順位０．９の点を境として文字列に対する興味度が、約９対１に分割される。このように直線Ｄに近似しておけば、直線Ｄから超えた曲線Ｃの部分の長さは、全体に占める番組数と合い易くなる。 In FIG. 7, when the non-linear curve in FIG. 6 is normalized in a linear format, as shown by a curve C in FIG. In the present embodiment, the curve C is calculated and converted so as to be close to the straight line D by performing normalization of (0, 1). If it does so, the interest degree with respect to a character string will be divided | segmented into about 9 to 1 by the point of the order 0.9. When approximated to the straight line D in this way, the length of the portion of the curve C exceeding the straight line D can easily match the number of programs in the whole.

次に、本発明の実施形態における情報処理装置のお勧め度計算アルゴリズムについて説明する。図８は、本発明の実施形態における情報処理装置のお勧め度計算アルゴリズムについて説明する機能ブロック図である。 Next, the recommendation level calculation algorithm of the information processing apparatus in the embodiment of the present invention will be described. FIG. 8 is a functional block diagram for explaining the recommendation level calculation algorithm of the information processing apparatus according to the embodiment of the present invention.

図８に示すように、本実施形態の対象となるお勧め度計算アルゴリズム８０４は、１週間分の電子番組ガイド（ＥＰＧ）８０１と、図５で説明したジャンル興味度８０２と、閾値８０３とを入力とし、最終的に閾値以上のお勧め度付き番組リスト８０６を出力するものである。１週間分のＥＰＧ８０１は、ＥＰＧ情報管理部２１７（図４）によりテレビ放送の各番組の情報を格納したテーブルである。番組毎にタイトル、ジャンル、放送日時、放送局、その他のテキスト情報を保持する。 As shown in FIG. 8, the recommendation degree calculation algorithm 804 that is the object of this embodiment includes an electronic program guide (EPG) 801 for one week, the genre interest degree 802 described in FIG. 5, and a threshold value 803. As an input, a program list 806 with a recommendation level that is equal to or higher than a threshold is finally output. The EPG 801 for one week is a table in which information on each program of the television broadcast is stored by the EPG information management unit 217 (FIG. 4). The title, genre, broadcast date, broadcast station, and other text information are held for each program.

ジャンル興味度８０２は、図５において説明したように、明示的にユーザが各ジャンルに対して設定した興味度である。“ドラマ”、“ニュース／報道”、“バラエティ”等の各ジャンルに対して、興味度を段階的に設定可能なものとする。 The genre interest level 802 is an interest level explicitly set for each genre by the user as described in FIG. The degree of interest can be set in stages for each genre such as “drama”, “news / report”, and “variety”.

閾値８０３は、ジャンルに対する興味度毎に生成されたお勧め度に対する切り捨て位置（お勧め度の割合）を設定するものである。この閾値に関して、お勧め度計算アルゴリズム８０４と相関性を持たせることも本実施形態の特徴である。お勧め度計算アルゴリズム８０４は、本実施形態の主眼となる機能であり、お勧め度計算アルゴリズム８０４の計算過程において、図７で説明した文字列による興味度推定アルゴリズム８０５を利用する。 The threshold value 803 is used to set a cut-off position (recommendation rate ratio) with respect to the recommendation level generated for each interest level with respect to the genre. It is a feature of this embodiment that the threshold value is correlated with the recommendation degree calculation algorithm 804. The recommendation degree calculation algorithm 804 is a main function of this embodiment, and the interest degree estimation algorithm 805 based on the character string described with reference to FIG. 7 is used in the calculation process of the recommendation degree calculation algorithm 804.

なお、１週間分の電子番組ガイド（ＥＰＧ）８０１は、ＥＰＧ情報管理部２１７（図４）によって取得された後、ＲＡＭ２０３（図２）に格納され、ジャンル興味度８０２、及び閾値８０３は、入力部２０８を用いて入力された後、ＨＤＤ２０４に格納される。文字列による興味度推定アルゴリズム８０５、及びお勧め度計算アルゴリズム８０４は、ＣＰＵ２０６によって実行され、閾値以上のお勧め度付き番組リスト８０６は、ＨＤＤ２０４又はＲＡＭ２０３に格納される。 The electronic program guide (EPG) 801 for one week is acquired by the EPG information management unit 217 (FIG. 4) and then stored in the RAM 203 (FIG. 2), and the genre interest level 802 and the threshold value 803 are input. After being input using the unit 208, it is stored in the HDD 204. An interest level estimation algorithm 805 and a recommendation level calculation algorithm 804 based on a character string are executed by the CPU 206, and a program list 806 with a recommendation level equal to or higher than a threshold is stored in the HDD 204 or the RAM 203.

次に、本発明の実施形態における情報処理装置の動作について説明する。図９は、本発明の実施形態における情報処理装置の動作について説明するフローチャートである。 Next, the operation of the information processing apparatus in the embodiment of the present invention will be described. FIG. 9 is a flowchart for explaining the operation of the information processing apparatus according to the embodiment of the present invention.

本実施形態のお勧め度計算アルゴリズムは、特に興味度の正規化手順を有すること、ジャンル興味度による非線形変換を有することを特徴としている。また、足切りに用いる閾値設定も、本実施形態の特徴である。以下、動作の流れについて説明する。 The recommendation degree calculation algorithm of the present embodiment is characterized in that it particularly has an interest degree normalization procedure and a non-linear transformation based on genre interest degree. In addition, threshold setting used for cut off is also a feature of this embodiment. The operation flow will be described below.

図９において、まず、ステップ（以下、Ｓという。）９０１の処理では、文字列による番組の興味度が計算される。具体的には、図６で説明したような、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係が求められる。Ｓ９０２の処理では、コンテンツの数分ループしたか否かが判断される。コンテンツの数分ループしていない（Ｓ９０２：ＮＯ）と判断されると、コンテンツの数分ループするまで待機する。コンテンツの数分ループした（Ｓ９０２：ＹＥＳ）と判断されると、Ｓ９０３の処理へ移行する。 In FIG. 9, first, in the processing of step (hereinafter referred to as “S”) 901, the degree of interest of a program by a character string is calculated. Specifically, as described with reference to FIG. 6, the relationship between the degree of interest and the ranking for the character string created by machine learning based on the viewing history is obtained. In the process of S902, it is determined whether or not the number of contents has been looped. If it is determined that the number of contents is not looped (S902: NO), the process waits until the number of contents is looped. If it is determined that the number of contents has been looped (S902: YES), the process proceeds to S903.

Ｓ９０３の処理では、興味度の正規化が行われる。具体的には、図７で説明したような、図６で作成した機械学習により作成した文字列に対する興味度と順位との関係を正規化する処理である。興味度の正規化は、文字列による番組の興味度の推定値に対して、最大値を１、平均値を０．５、最小値を０とするような変換を行う。以後の計算の精度のためには、まず第１に、最大値、平均値、最小値は、直近のある期間の統計量を用いることである。例えば、ＴＦ／ＩＤＦやナイーブベイズを用いた演算の場合、理論的な最大／最小を規定することができないので、学習状況や番組表の状況に応じて、異なる統計量となる。状況に応じて最適に制御するためには、ある期間での統計量を用いることが望ましい。 In the process of S903, the interest level is normalized. Specifically, this is a process for normalizing the relationship between the degree of interest and the rank for the character string created by the machine learning created in FIG. 6 as described in FIG. In the normalization of the interest level, a conversion is performed such that the maximum value is 1, the average value is 0.5, and the minimum value is 0 with respect to the estimated value of the interest level of the program by the character string. For the accuracy of the subsequent calculations, first, the maximum value, the average value, and the minimum value are to use statistics for a certain period. For example, in the case of calculation using TF / IDF or naïve Bayes, the theoretical maximum / minimum cannot be defined, so that the statistic varies depending on the learning situation and the situation of the program guide. In order to optimally control according to the situation, it is desirable to use statistics over a certain period.

第２に、非線形な変換を用いて、正規化後の興味度の分布を一様分布に近づけることである。一様分布から大きく離れる場合、ジャンル興味度を用いた非線形変換による分布の制御が困難となるからである。 Secondly, the normalized interest distribution is brought close to a uniform distribution using a non-linear transformation. This is because if the distribution is far from the uniform distribution, it becomes difficult to control the distribution by non-linear transformation using the genre interest degree.

Ｓ９０４の処理では、コンテンツの数分ループしたか否かが判断される。コンテンツの数分ループしていない（Ｓ９０４：ＮＯ）と判断されると、コンテンツの数分ループするまで待機する。コンテンツの数分ループした（Ｓ９０４：ＹＥＳ）と判断されると、Ｓ９０５の処理へ移行する。 In the process of S904, it is determined whether or not the number of contents has been looped. If it is determined that the number of contents has not been looped (S904: NO), the process waits until the number of contents is looped. If it is determined that the number of contents has been looped (S904: YES), the process proceeds to S905.

Ｓ９０５の処理では、ジャンル興味度による非線形変換が行われる。ジャンル興味度による非線形変換は、図１０に示すように正規化された文字列による興味度を非線形に変換する操作である。入力、出力共に、０−１の範囲を領域とするが、ジャンル興味度により、分布が制御される。変換式のイメージを図１０に示す。図１０は、本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の離散値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係を示す図である。 In the process of S905, non-linear conversion based on the genre interest is performed. The non-linear conversion based on the genre interest is an operation for converting the interest based on the normalized character string non-linearly as shown in FIG. Although both the input and output ranges from 0 to 1, the distribution is controlled by the degree of genre interest. An image of the conversion formula is shown in FIG. FIG. 10 normalizes the relationship between the discrete value of the degree of interest for each genre of content in the information processing apparatus according to the embodiment of the present invention and the degree of interest for the character string created by machine learning based on the viewing history. It is a figure which shows the relationship between the degree of interest and the recommendation degree with respect to the normalized character string when calculating a thing.

ジャンル興味度は１から３で、数値が大きいほど興味があるものとする。図１０では、興味度３の変換式で生成されるお勧め度は高い方向に、興味度１の変換式で生成されるお勧め度は低い方向に、分布が移動することが分かる。また、興味度３の変換式で生成されるお勧め度は、正規化済み文字列に対する興味度が少しでも高くなればお勧め度は高くなり、興味度１の変換式で生成されるお勧め度は、正規化済み文字列に対する興味度が高くならない限り、お勧め度は高くならない。興味度１の変換式で生成されるお勧め度は、視聴履歴中に非常に興味度が高い文字列（ワード、単語）が書かれている番組に対しては高くなる傾向にある。したがって、ジャンル興味度が低いジャンルに属するコンテンツであっても、当該コンテンツに関連する派生コンテンツに対する視聴履歴に基づいて機械学習により作成された正規化文字列に対する興味度が高ければ、お勧め度が高くなるため、録画対象に成り易くなる。 The genre interest level is 1 to 3, and the greater the value, the more interested. In FIG. 10, it can be seen that the distribution moves in a direction in which the recommendation degree generated by the conversion formula of interest degree 3 is high and the recommendation degree generated by the conversion expression of interest degree 1 is low. In addition, the recommendation degree generated by the conversion formula of interest level 3 becomes higher if the interest level for the normalized character string becomes higher as much as possible, and the recommendation level generated by the conversion expression of interest level 1 The degree of recommendation is not high unless the degree of interest in the normalized character string is high. The recommendation level generated by the conversion formula of interest level 1 tends to be high for programs in which character strings (words, words) having a very high level of interest are written in the viewing history. Therefore, even if the content belongs to a genre with a low genre interest level, the recommendation level is high if the interest level for the normalized character string created by machine learning based on the viewing history for the derivative content related to the content is high. Since it becomes high, it becomes easy to become a recording target.

分布を制御するため、ジャンル興味度が１（興味度１の変換式）であっても、正規化済み文字列による興味度推定値が十分高い場合には、お勧め度として上位に入ることができる。このような変換式は、べき乗や、座標点（０、０）、（１、１）を通る円弧形状等、複数想定される。何れの場合であっても類似する効果を与えることが可能である。このように、文字列による興味度推定値を、ジャンルに対する興味度を用いて変換し、順位を振り直している。その際、文字列による興味度を正規化し、番組の属するジャンルに設定されている興味度に応じてさらにお勧め度を上下させているのである。 In order to control the distribution, even if the genre interest level is 1 (conversion formula of interest level 1), if the estimated interest level based on the normalized character string is sufficiently high, the recommendation level may be higher. it can. A plurality of such conversion expressions are assumed, such as a power and an arc shape passing through coordinate points (0, 0) and (1, 1). In any case, it is possible to give a similar effect. Thus, the interest level estimated value based on the character string is converted using the interest level with respect to the genre , and the order is reassigned. At that time, the degree of interest by the character string is normalized, and the recommendation degree is further increased or decreased according to the degree of interest set in the genre to which the program belongs.

図９に戻り、Ｓ９０６の処理では、コンテンツの数分ループしたか否かが判断される。コンテンツの数分ループしていない（Ｓ９０６：ＮＯ）と判断されると、コンテンツの数分ループするまで待機する。コンテンツの数分ループした（Ｓ９０６：ＹＥＳ）と判断されると、Ｓ９０７の処理へ移行する。 Returning to FIG. 9, in the processing of S <b> 906, it is determined whether or not the number of contents has been looped. If it is determined that the content is not looped (S906: NO), the process waits until the content is looped. If it is determined that the number of contents has been looped (S906: YES), the process proceeds to S907.

Ｓ９０７の処理では、お勧め度によるソートが行われる。具体的には、図１０に示すように、お勧め度の高い方から低い方へソートが行われ、お勧め度が生成される。Ｓ９０８の処理では、閾値による足切り処理が行われる。閾値設定方法のイメージを図１１に示す。図１１は、本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の離散値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係に対して閾値を設定したときの状態について説明する図である。 In the processing of S907, sorting is performed according to the recommendation level. Specifically, as shown in FIG. 10, sorting is performed from a higher recommendation level to a lower recommendation level to generate a recommendation level. In the process of S908, a cut-off process using a threshold value is performed. An image of the threshold setting method is shown in FIG. FIG. 11 normalizes the relationship between the discrete value of the degree of interest for each genre of content and the degree of interest and rank for a character string created by machine learning based on the viewing history in the information processing apparatus according to the embodiment of the present invention. It is a figure explaining the state when a threshold value is set with respect to the relationship between the degree of interest and the recommendation degree with respect to the normalized character string when the one is calculated.

各興味度に対応するコンテンツが、全ジャンルのコンテンツに対して平均的な割合で録画されるように閾値を設定する場合、正規化済みの文字列に対する興味度が０．５の場合の、各興味度における変換式の値を閾値に設定すれば良い。この場合、閾値は、閾値１、閾値２、閾値３のように設定する。同様に、例えば、各興味度に対応するコンテンツが、全ジャンルのコンテンツに対して３０％の割合で録画されるように閾値を設定する場合には、正規化済みの文字列に対する興味度が０．７の場合の、各興味度における変換式の値を閾値に設定すれば良い。この場合、閾値は、閾値１´、閾値２´、閾値３´のように設定する。Ｓ９０８の処理における閾値による足切りが行われた後、処理を終了する。 When setting the threshold value so that the content corresponding to each degree of interest is recorded at an average ratio to the content of all genres, each of the cases where the degree of interest in the normalized character string is 0.5 What is necessary is just to set the value of the conversion formula in an interest degree to a threshold value. In this case, the thresholds are set as threshold 1, threshold 2, and threshold 3. Similarly, for example, when the threshold value is set so that the content corresponding to each degree of interest is recorded at a rate of 30% with respect to the content of all genres, the degree of interest in the normalized character string is 0. In the case of .7, the value of the conversion formula for each degree of interest may be set as a threshold value. In this case, the threshold values are set as a threshold value 1 ′, a threshold value 2 ′, and a threshold value 3 ′. After the cut-off by the threshold in the process of S908 is performed, the process is terminated.

ここで、ユーザに対し閾値に対する意味をどのようにして提示するかについて若干説明する。具体的には、閾値を十番単位の切り番となるように設定しておき、例えば閾値が５０の場合には、画面イメージとして録画される番組と録画されない番組とに色分けして表示したり、録画されない番組はどれであり、全体的にいくつの番組が録画されるかをＧＵＩ（Graphical User Interface）で表示したりする等、閾値に対する意味を報知するユーザインタフェースは任意の形態を取り得る。 Here, how the meaning for the threshold value is presented to the user will be described briefly. Specifically, the threshold value is set to be a turn number in units of ten. For example, when the threshold value is 50, a program recorded as a screen image and a program not recorded are displayed in different colors. The user interface for informing the meaning of the threshold can take any form, such as displaying which program is not recorded and how many programs are recorded as a whole with a GUI (Graphical User Interface).

このように、本実施形態では、ユーザが設定したコンテンツのジャンル毎の興味度に応じた非線形の数式で、自動的に学習した興味度の推定値を変換している。この変換において、自動的に学習した興味度の推定値の分布を制御することを目的としている。そして、ジャンルに対する興味度に対して平均的なお勧め度を生成する場合と比較して、どれくらいの重み付け（傾斜）をつけてジャンルに対する興味度毎にお勧め度を生成するかを決定するために閾値を設けることとしている。 Thus, in this embodiment, the estimated value of the degree of interest learned automatically is converted by a non-linear mathematical expression corresponding to the degree of interest for each genre of content set by the user. The purpose of this conversion is to control the distribution of the estimated values of the degree of interest learned automatically. And, in order to determine how much weight (slope) to generate the recommendation degree for each degree of interest in the genre, compared to the case of generating the average recommendation degree for the interest degree in the genre A threshold is set.

次に、本発明の実施形態における情報処理装置のお勧め度逆計算アルゴリズムのシステム構成について説明する。図１２は、本発明の実施形態における情報処理装置のお勧め度逆計算アルゴリズムについて説明する機能ブロック図である。 Next, the system configuration of the recommended degree inverse calculation algorithm of the information processing apparatus according to the embodiment of the present invention will be described. FIG. 12 is a functional block diagram illustrating the recommended degree reverse calculation algorithm of the information processing apparatus according to the embodiment of this invention.

図８では、閾値を離散的に設定する方法について説明した。ここまでの説明を応用すれば、ユーザが閾値を連続値として設定した場合であっても、閾値に対応する各ジャンルの興味度の変換式を用いて、ジャンル毎にどれくらいの割合で録画されるかを推定することができる。図１２に示す機能ブロック図では、ジャンル興味度１２１と閾値１２２とを入力とし、お勧め度の逆算を行うアルゴリズムであるお勧め度逆算アルゴリズム１２３を用いてジャンル毎の推定録画割合１２４を提示することにより、設定された閾値の意味するところをユーザに対して提示することができる。 In FIG. 8, the method of setting the threshold values discretely has been described. Applying the description so far, even if the user sets the threshold value as a continuous value, the ratio is recorded for each genre using the interest level conversion formula for each genre corresponding to the threshold value. Can be estimated. In the functional block diagram shown in FIG. 12, the genre interest 121 and the threshold 122 are input, and the estimated recording ratio 124 for each genre is presented using the recommended degree reverse calculation algorithm 123 which is an algorithm for performing the reverse calculation of the recommended level. Thus, the meaning of the set threshold value can be presented to the user.

お勧め度を逆算するアルゴリズムのイメージを図１３に示す。図１３は、本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の離散値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係に対して設定された閾値から逆算するときの状態について説明する図である。 FIG. 13 shows an image of the algorithm for calculating the recommended degree. FIG. 13 normalizes the relationship between the discrete value of the degree of interest for each genre of content in the information processing apparatus according to the embodiment of the present invention and the degree of interest for the character string created by machine learning based on the viewing history. It is a figure explaining the state at the time of calculating backward from the threshold set with respect to the relationship between the degree of interest and the recommendation degree with respect to the normalized character string when the one is calculated.

コンテンツのジャンル毎の興味度と、文字列に対する興味度と順位との関係を正規化したものと、を演算したときに生成されたお勧め度に対して、ユーザによって連続値として設定された任意の閾値と、ジャンル興味度１から３の変換式との交点を求める。交点の正規化済み文字列に対する興味度が、それぞれ、０．４、０．７６、０．９６となったとする。その結果、ジャンル興味度３の番組は、全ジャンルの番組に対して６０％、ジャンル興味度２の番組は、全ジャンルの番組に対して２４％、ジャンル興味度１の番組は、全ジャンルの番組に対して４％録画されるものと推定される。 Arbitrary value set as a continuous value by the user for the recommendation level generated when computing the interest level for each genre of content and the normalized relationship between the level of interest and ranking for the character string And the intersection of the genre interest level 1 to 3 conversion formulas. Assume that the degrees of interest in the normalized character strings at the intersections are 0.4, 0.76, and 0.96, respectively. As a result, programs with a genre interest level of 3 are 60% for all genre programs, programs with a genre interest level of 2 are 24% for programs of all genres, and programs with a genre interest level of 1 are for all genres. It is estimated that 4% of the program is recorded.

また、上記実施形態では、ジャンル興味度が１、２、３といった離散値を取るという想定で説明した。ジャンル興味度については、実数値として表現することも可能である。ジャンル興味度毎の変換式について、パラメータを興味度に応じて変化する実数として表現すれば良い。 Moreover, in the said embodiment, it demonstrated on the assumption that a genre interest degree takes discrete values, such as 1, 2, and 3. The genre interest level can also be expressed as a real value. The conversion formula for each genre interest level may be expressed as a real number that changes according to the interest level.

具体的には、べき乗を用いる場合には、べき数を実数としてジャンル興味度から算出すれば良い。すなわち、ｙ＝ｘⁿを用いる場合には、ｎを興味度にしたがって加減すれば良い。要するに、ジャンル興味度３の場合にはｎ＝１／２、ジャンル興味度２の場合にはｎ＝１、ジャンル興味度１の場合にはｎ＝２のように、ｎを興味度に応じて連続値に設定すれば良い。 Specifically, when a power is used, the power may be calculated from the genre interest degree as a real number. That is, when y = x ⁿ is used, n may be adjusted according to the degree of interest. In short, n is 1/2 according to the genre interest degree 3, n = 1 when the genre interest degree 2 is n, and n = 2 when the genre interest degree 1 is n according to the interest degree. A continuous value may be set.

また、円弧を用いる場合には、中心点が上下どちらにあるかという点と、円弧の半径をジャンル興味度から算出するようにすれば良い。図１４に、本発明の実施形態における情報処理装置においてコンテンツのジャンル毎の興味度の実数値と、視聴履歴に基づいて機械学習により作成した文字列に対する興味度と順位との関係を正規化したものと、を演算したときの正規化済み文字列に対する興味度とお勧め度との関係を示す。図１４におけるＥ領域が、コンテンツのジャンル毎の興味度を実数とした場合の領域である。 Also, when using an arc, the point of whether the center point is above and below and the radius of the arc may be calculated from the genre interest. In FIG. 14, the relationship between the real value of the degree of interest for each genre of content and the degree of interest for the character string created by machine learning based on the viewing history is normalized in the information processing apparatus according to the embodiment of the present invention. The relationship between the degree of interest and the recommendation degree for the normalized character string when the one is calculated. The area E in FIG. 14 is an area when the degree of interest for each genre of content is a real number.

さらに、上記実施形態では、ジャンル興味度は、図５に示すようにユーザが明示的に設定するものとしていたが、異なる方式により学習するようにしても良い。例えば、自動的に学習するようにしても良い。図１４ではジャンル興味度を実数化できることを説明したが、ジャンル興味度として自動学習の値を用いても良い。特に、ユーザの意識と番組表の表現とがずれている場合に有効となる。さらに、お勧め度を録画目的以外の用途に用いることも可能である。 Further, in the above embodiment, the genre interest level is explicitly set by the user as shown in FIG. 5, but may be learned by a different method. For example, you may make it learn automatically. Although it has been explained in FIG. 14 that the genre interest level can be converted into a real number, an automatic learning value may be used as the genre interest level. This is particularly effective when the user's consciousness is different from the program guide expression. Furthermore, the recommendation level can be used for purposes other than recording purposes.

また、上記実施形態では、文字列（ワード、単語）に対する興味度推定値を正規化済み興味度として採用しているが、適当な判断手段を搭載することが可能であれば、画像データや音声データに対する興味度を、正規化済み興味度として採用することも可能である。さらに、各ジャンルに対する興味度をＷｅｂサーバから取得したりすることも可能である。 Further, in the above embodiment, the interest level estimated value for the character string (word, word) is adopted as the normalized interest level. However, if it is possible to mount an appropriate determination means, image data and audio The degree of interest in data can be adopted as the normalized degree of interest. Furthermore, it is possible to acquire the degree of interest for each genre from a Web server.

また、本実施形態では、テレビ番組を一例として説明しているが、同様な情報量が存在する他の分野に応用することが可能である。例えば、ＹｏｕＴｕｂｅ（登録商標）を一例とするＶＯＤ（Voice On Demand）やＷｅｂニュースといった、何らかのコンテンツがジャンル分けされており、ジャンル以外に興味度を取得することができ、ジャンル自体も別途興味度を評価できるものであれば、お勧め度を生成することが可能である。例えば、コンテンツに関するテキスト情報が一緒になっており、コンテンツに対する操作履歴に基づいて興味度又は好みを学習して行く。そして、テキスト情報の単語ベースでの学習に対して順位、得点を付与して行き、最終的に正規化されていれば良いのである。また、ジャンルに対する興味度が別途設定されており、ジャンルに対する興味度に対し、この正規化された重み付けを用いて変換すればお勧め度を生成することができる。 In the present embodiment, a television program is described as an example, but the present invention can be applied to other fields where a similar amount of information exists. For example, some contents such as VOD (Voice On Demand) and Web news, which are examples of YouTube (registered trademark), are classified into genres, and the degree of interest can be acquired in addition to the genre. If it can be evaluated, a recommendation degree can be generated. For example, text information related to the content is combined, and the degree of interest or preference is learned based on the operation history for the content. Then, the ranking and the score are assigned to the word-based learning of the text information, and it is only necessary to be normalized in the end. In addition, the degree of interest in the genre is set separately, and the degree of recommendation can be generated by converting the degree of interest in the genre using this normalized weight.

以上説明したように、本発明によれば、ジャンルに対する興味度と、文字列に対する興味度推定値の双方を合成したお勧めを行うことができる。そして、興味度が低いジャンルであっても、文字列に対する興味度推定値が特に高い場合には、お勧め度が高くなる。また、閾値を設定することにより、各ジャンルの録画割合の推定値という形で、提示することができる。 As described above, according to the present invention, it is possible to make a recommendation that combines both an interest level for a genre and an estimated interest level for a character string. Even if the genre has a low interest level, the recommendation level is high when the interest level estimation value for the character string is particularly high. Also, by setting a threshold value, it can be presented in the form of an estimated value of the recording ratio of each genre.

なお、図９に示した本発明の実施形態における情報処理装置（ＰＣ）２００を構成する各機能ブロックの各動作は、コンピュータ上のプログラムに実行させることもできる。すなわち、情報処理装置２００のＣＰＵ２０６が、ＲＯＭ２０２、ＲＡＭ２０３等から構成される記憶部に格納されたプログラムをロードし、プログラムの各処理ステップが順次実行されることによって行われる。 In addition, each operation | movement of each functional block which comprises the information processing apparatus (PC) 200 in embodiment of this invention shown in FIG. 9 can also be made to perform the program on a computer. That is, the processing is performed by the CPU 206 of the information processing apparatus 200 loading a program stored in a storage unit configured by the ROM 202, the RAM 203, and the like, and sequentially executing each processing step of the program.

以上説明してきたように、本発明によれば、コンテンツのジャンル毎に予め設定された設定興味度と、コンテンツの視聴により推定された推定興味度とを合成する合成手段と、合成により、推定興味度に対する設定興味度のジャンルに属するコンテンツのお勧め度を生成する生成手段と、任意の設定興味度を有するジャンルに属するコンテンツの、全ジャンルのコンテンツに占めるお勧め割合を決定するための閾値を、生成されたお勧め度に対して設定する設定手段と、を含むことにより、ユーザの嗜好にフレキシブルに対応したコンテンツを推薦することができるのである。 As described above, according to the present invention, the setting interest degree set in advance for each content genre and the estimated interest degree estimated by viewing the content are combined, and the combined interest is estimated by combining. Generating means for generating a recommendation degree of content belonging to the genre of the set interest degree with respect to the degree, and a threshold value for determining a recommended ratio of the content belonging to the genre having an arbitrary set interest degree in the content of all genres By including setting means for setting the generated recommendation level, it is possible to recommend content that flexibly corresponds to the user's preference.

以上、本発明の好適な実施の形態により本発明を説明した。ここでは特定の具体例を示して本発明を説明したが、特許請求の範囲に定義された本発明の広範囲な趣旨及び範囲から逸脱することなく、これら具体例に様々な修正及び変更が可能である。 The present invention has been described above by the preferred embodiments of the present invention. While the invention has been described with reference to specific embodiments thereof, various modifications and changes can be made to these embodiments without departing from the broader spirit and scope of the invention as defined in the claims. is there.

１０情報処理システム
１００放送局
１２１、８０２ジャンル興味度
１２２、８０３閾値
１２３お勧め度逆算アルゴリズム
１２４ジャンル毎の推定録画割合
２００情報処理装置（ＰＣ）
２０１ＴＶチューナ部
２０２、４０１ＲＯＭ
２０３ＲＡＭ
２０４ＨＤＤ
２０５、４０３ネットワーク接続部
２０６、４０４ＣＰＵ
２０７、４０５表示部
２０８入力部
２０９、４０７電源部
２１０、２１１、２１２、２１３チューナ部
２１４コンテンツ記録再生部
２１５動画表示処理部
２１６映像解析処理部
２１７ＥＰＧ情報管理部
２１８興味情報取得部
２１９動画情報取得記録部
２２０ネットワーク接続処理部
２２１ディスプレイ
２２２リモコン
３００ネットワーク
４００Ｗｅｂサーバ
４０２データベース部
４０６操作部
５０１番組ジャンル
５０２興味度
８０１１週間分のＥＰＧ
８０４お勧め度計算アルゴリズム
８０５文字列による興味度推定アルゴリズム
８０６閾値以上のお勧め度付き番組リスト DESCRIPTION OF SYMBOLS 10 Information processing system 100 Broadcasting station 121,802 Genre interest degree 122,803 Threshold value 123 Recommendation degree reverse calculation algorithm 124 Estimated recording ratio for every genre 200 Information processing apparatus (PC)
201 TV tuner 202, 401 ROM
203 RAM
204 HDD
205, 403 Network connection unit 206, 404 CPU
207, 405 Display unit 208 Input unit 209, 407 Power source unit 210, 211, 212, 213 Tuner unit 214 Content recording / playback unit 215 Video display processing unit 216 Video analysis processing unit 217 EPG information management unit 218 Interest information acquisition unit 219 Video information Acquisition recording unit 220 Network connection processing unit 221 Display 222 Remote control 300 Network 400 Web server 402 Database unit 406 Operation unit 501 Program genre 502 Interest 801 EPG for one week
804 Recommended degree calculation algorithm 805 Interest level estimation algorithm based on character string 806 Program list with recommended degree above threshold

Claims

A genre interest level setting means for setting a genre interest level in advance for each genre, which is a classification based on the content of the content;
Character string interest degree estimating means for estimating the user's interest degree with respect to the character string obtained by analyzing the user's viewing history;
The genre interest with respect to the recommendation degree obtained by converting the interest degree of the user estimated by the character string interest degree estimation means using the genre interest degree set by the genre interest degree setting means. A threshold setting means for setting a threshold for determining the proportion of recommended content for all content;
An information processing apparatus comprising:

The information processing apparatus according to claim 1, wherein the degree of interest of the user with respect to the character string is an interest degree with respect to a character string created by machine learning based on a viewing history reflecting the user's preference. .

The degree of interest of the user for the character string is estimated by normalizing at least a non-linear region of the relationship between the degree of interest for the character string and the rank of the character string. 2. The information processing apparatus according to 2.

Based on the threshold set by the threshold setting means, the recommended ratio of the content belonging to the genre for which the predetermined genre interest is set to the content of all genres is estimated with respect to the obtained recommendation level. The information processing apparatus according to claim 1, further comprising means.

A genre interest level setting step for setting a genre interest level in advance for each genre, which is a classification based on the content of the content;
A character string interest degree estimating step for estimating the user's interest degree with respect to the character string obtained by analyzing the user's viewing history;
The genre interest with respect to the recommendation degree obtained by converting the interest degree of the user estimated in the character string interest degree estimation step using the genre interest degree set in the genre interest degree setting step. And a threshold value setting step for setting a threshold value for determining at what ratio the set content is recommended for all contents.

In the computer of the information processing device,
A genre interest level setting process for setting a genre interest level for each genre, which is a classification based on the content,
A character string interest estimation process for estimating the user's interest in the character string obtained by analyzing the user's viewing history;
The genre interest with respect to the recommendation degree obtained by converting the interest degree of the user estimated by the character string interest degree estimation process using the genre interest degree set by the genre interest degree setting process. A threshold setting process for setting a threshold value for determining what ratio is recommended for all contents,
A program for running