JP2007058562A

JP2007058562A - Content classification device, content classification method, content classification program and recording medium

Info

Publication number: JP2007058562A
Application number: JP2005242965A
Authority: JP
Inventors: Keiko Hirukawa; 慶子蛭川; Akira Tsuruta; 彰鶴田
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2005-08-24
Filing date: 2005-08-24
Publication date: 2007-03-08

Abstract

<P>PROBLEM TO BE SOLVED: To classify content into groups without imposing any labor to classify content on a user. <P>SOLUTION: When meta information showing the contents of content is at least partially matched with the other meta information showing the contents of the other content which is different from the content, a content relevancy judging part 12 judges that the content and the other content are relevant to each other. A content classifying part 13 classifies the content and the other content whose relevancy is judged by the content relevancy judging means into the same group. Therefore, it is possible to classify the content into groups without imposing any labor to classify the content on a user. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、コンテンツを分類する手間をユーザに与えることなく、コンテンツをグループに自動的に分類できるコンテンツ分類装置、コンテンツ分類方法、コンテンツ分類プログラムおよび記録媒体に関する。 The present invention relates to a content classification device, a content classification method, a content classification program, and a recording medium that can automatically classify content into groups without giving users the trouble of classifying content.

放送されている番組を受信できる装置として、テレビ、ハードディスクレコーダ、およびパーソナルコンピュータが知られている。これらの装置は、受信した番組を、ハードディスクなどの所定の記録媒体に保存することによって、番組を録画する。以前の装置は、番組を録画する場合に、各番組のコンテンツを、それぞれ所定のファイルとして、ルートディレクトリに順次配列するだけであった。そのため、ユーザは、多くのコンテンツの中から所望のコンテンツを迅速に検索することができなかった。 Televisions, hard disk recorders, and personal computers are known as devices that can receive broadcast programs. These apparatuses record the program by storing the received program in a predetermined recording medium such as a hard disk. In the previous device, when recording a program, the contents of each program were simply arranged in the root directory sequentially as a predetermined file. Therefore, the user has not been able to quickly search for desired content from among many contents.

この課題を解決する技術を開示する文献として、特許文献１がある。この文献には、提供される情報を取得する取得手段と、前記取得手段により取得された情報の中から、記録する情報を選択する選択手段と、前記選択手段により選択された情報を記録する第１の記録手段と、前記選択手段により選択された情報の分類を決定する決定手段と、前記決定手段により決定された分類を記録する第２の記録手段とを含むことを特徴とする情報処理装置が開示されている。 There is Patent Document 1 as a document disclosing a technique for solving this problem. This document includes an acquisition unit that acquires provided information, a selection unit that selects information to be recorded from the information acquired by the acquisition unit, and a first unit that records information selected by the selection unit. An information processing apparatus comprising: one recording unit; a determination unit that determines a classification of the information selected by the selection unit; and a second recording unit that records the classification determined by the determination unit Is disclosed.

特許文献１の装置では、録画予約時に、放送番組とともに番組情報を取得する。取得した番組情報から番組属性を抽出し、この番組属性の中から所定の文字列をユーザに選択させる。選択された文字列をディレクトリ名としてディレクトリを作成し、番組をディレクトリ下に記録する。これにより、番組を分類する。さらに、選択された文字列のディレクトリが既に存在する場合は、既存のディレクトリの下に番組を記録する。 In the apparatus of Patent Document 1, program information is acquired together with a broadcast program at the time of recording reservation. A program attribute is extracted from the acquired program information, and a user selects a predetermined character string from the program attribute. A directory is created using the selected character string as a directory name, and a program is recorded under the directory. Thereby, the program is classified. Further, when the directory of the selected character string already exists, the program is recorded under the existing directory.

類似する技術は、特許文献２にも開示されている。
特開２０００−２８５６３８号公報（２０００年１０月１３日公開）特開２０００−２８５６４７号公報（２０００年１０月１３日公開） A similar technique is also disclosed in Patent Document 2.
JP 2000-285638 A (released on October 13, 2000) JP 2000-285647 A (released on October 13, 2000)

しかし、特許文献１および２に開示されている技術では、番組ごとに、ユーザにディレクトリ名とする文字列を選択させる手間がかかる。ディレクトリ名になり得る文字列として、番組属性の中に、ジャンル、曜日、タイトル、出演者、およびキーワードなど多くの選択肢がある。そのため、ユーザは、番組を予約するたびに、その中から選択しなければならない。 However, the techniques disclosed in Patent Documents 1 and 2 require time and effort for the user to select a character string as a directory name for each program. As a character string that can be a directory name, there are many options such as a genre, a day of the week, a title, a performer, and a keyword among program attributes. Therefore, every time a user reserves a program, the user has to select from among them.

また、ユーザは、適当な分類状況になるように、予約の段階でディレクトリ構成を考えておかなければならない。適当な分類状況とは、コンテンツが適度に分散している状況をいう。すなわち、ある一つのディレクトリにコンテンツが偏ったり、もしくはディレクトリの数がコンテンツの数に近くなりディレクトリの意味をなさなかったりすることがない状況である。また、番組を検索する際に、ディレクトリの中身が容易に推測できるようなディレクトリ名にしておかなければならない。コンテンツが少ないうちには適当であるディレクトリ構成も、コンテンツが増減するにしたがって、不適当になりうる。１つのディレクトリに、コンテンツが偏る可能性があるからである。したがって、ユーザは、番組を検索しやすくするために、大きな負担を要して分類構成を最適化する必要が生ずる。 In addition, the user must consider the directory structure at the reservation stage so as to obtain an appropriate classification situation. Appropriate classification status refers to a situation where content is moderately dispersed. That is, there is no situation where content is biased to a certain directory, or the number of directories is close to the number of content and does not make sense for the directory. Also, when searching for a program, the directory name must be such that the contents of the directory can be easily guessed. A directory structure that is appropriate while content is small can become inappropriate as content increases or decreases. This is because the content may be biased in one directory. Therefore, the user needs to optimize the classification configuration with a heavy burden in order to make it easy to search for programs.

さらに、ある特定のコンテンツは、ルートディレクトリに格納しておいた方が検索しやすい。たとえば、すぐに消去する予定のコンテンツや、録画したばかりで日付の新しいコンテンツや、ディレクトリを生成しても同じディレクトリ内のコンテンツ数が少ないコンテンツなどがそうである。したがって、全てのコンテンツを同じディレクトリに分類してしまうと、ユーザは検索時に必ず一度ディレクトリを選択しなければならず、使い勝手がよくない。 Furthermore, it is easier to search for certain specific contents stored in the root directory. For example, content that is scheduled to be erased immediately, content that has just been recorded and that has a new date, and content that has a small number of contents in the same directory even if a directory is created. Therefore, if all contents are classified into the same directory, the user must select the directory once at the time of search, which is not convenient.

タイトルなどの番組情報に基づいた場合、コンテンツを正しく分類できないこともある。これは、タイトルが省略されていたり、放送回数の表記やサブタイトルの表記に揺らぎが生ずる可能性があるからである。選択された文字列が、既存のディレクトリの中に存在すれば、新たにディレクトリは作成せず既存のディレクトリの下に記録する。しかし、逆に、存在しなければ、新たに選択された文字列のディレクトリを作成する。 When based on program information such as a title, the content may not be correctly classified. This is because the title may be omitted, or there may be fluctuations in the number of broadcasts and subtitles. If the selected character string exists in an existing directory, a new directory is not created and is recorded under the existing directory. However, conversely, if it does not exist, a directory of the newly selected character string is created.

たとえば、番組属性の中でタイトルが選択され、ディレクトリ名がタイトルであるディレクトリが作成されたとする。ここでユーザは、同じ番組を同じディレクトリに記録したいとする。次に、別の日に同じ番組を予約する際に、ユーザは、予約時に同じように番組属性の中からタイトルを選択する。このとき、前回予約したときと、今回予約したときのタイトルが同表記であればよい。しかし、番組情報のデータには表記のゆらぎがある場合がある。そのため、この場合は、既存のディレクトリとの対応がとれず、新たに別のディレクトリが作成されてしまう。したがって、番組が、ユーザの思惑通りに分類されない可能性がある。 For example, assume that a title is selected in the program attributes and a directory whose directory name is the title is created. Here, it is assumed that the user wants to record the same program in the same directory. Next, when reserving the same program on another day, the user selects a title from the program attributes in the same way at the time of reservation. At this time, the titles of the previous reservation and the current reservation may be the same notation. However, the program information data may have fluctuations in the notation. Therefore, in this case, the correspondence with the existing directory cannot be taken, and another new directory is created. Therefore, the program may not be classified according to the user's expectation.

本発明は上記の課題を解決するためになされたものであり、その目的は、コンテンツを分類する手間をユーザに与えることなく、コンテンツをグループに自動的に分類できるコンテンツ分類装置、コンテンツ分類方法、コンテンツ分類プログラムおよび記録媒体を提供することにある。 The present invention has been made to solve the above-described problems, and its purpose is to provide a content classification device, a content classification method, and a content classification method capable of automatically classifying content into groups without giving users the trouble of classifying content. To provide a content classification program and a recording medium.

本発明に係るコンテンツ分類装置は、上記の課題を解決するために、
コンテンツの内容を表す付加情報と、このコンテンツと異なる他のコンテンツの内容を表す他の付加情報とが少なくとも部分的に一致するとき、上記コンテンツと上記他のコンテンツとが互いに関連していると判定するコンテンツ関連性判定手段と、
上記コンテンツ関連性判定手段によって互いに関連していると判定される上記コンテンツおよび上記他のコンテンツを、同一のグループに分類するコンテンツ分類手段とを備えていることを特徴としている。 In order to solve the above problems, the content classification device according to the present invention provides:
It is determined that the content and the other content are related to each other when the additional information representing the content and the other additional information representing the content of other content different from the content are at least partially matched. Content relevance determining means for
Content classification means for classifying the content determined to be related to each other by the content relevance determination means and the other content into the same group is provided.

上記の構成によれば、コンテンツ分類装置では、コンテンツ関連性判定手段が、コンテンツの内容を表す付加情報と、他のコンテンツの内容を表す他の付加情報とを比較する。ここでいう「コンテンツ」とは、たとえば、放送されている所定の番組である。また、「付加情報」とは、番組の内容を表す、たとえば電子番組表である。 According to the above configuration, in the content classification device, the content relevance determination unit compares the additional information indicating the content details with other additional information indicating the details of other content. The “content” here is, for example, a predetermined program being broadcast. The “additional information” is, for example, an electronic program guide that represents the contents of a program.

コンテンツ分類装置は、付加情報を内部に含んでいるコンテンツを取得することもできるし、コンテンツに付加されている付加情報を取得することもできるし、コンテンツと付加情報とを別々に取得することもできる。いずれにせよ、コンテンツ分類装置は、何らかの手段によって取得した付加情報と、他の付加情報とを比較する。このとき、付加情報および他の付加情報とが、少なくとも部分的に一致するなら、コンテンツ関連性判定手段は、コンテンツと他のコンテンツとを、互いに関連していると判定する。 The content classification device can acquire content that includes additional information therein, can acquire additional information added to the content, or can acquire content and additional information separately. it can. In any case, the content classification device compares the additional information acquired by some means with other additional information. At this time, if the additional information and the other additional information match at least partially, the content relevance determining unit determines that the content and the other content are related to each other.

コンテンツ分類装置では、さらに、コンテンツ関連性判定手段によって互いに関連していると判定されたコンテンツおよび他のコンテンツを、コンテンツ分類手段が、同一のグループに分類する。たとえば、それぞれの電子番組表に、同じ出演者に関する情報が含まれていると判定されるとき、コンテンツ分類手段は、２つの番組を、その出演者に関するグループに分類する。 In the content classification device, the content classification unit further classifies the content determined to be related to each other by the content relevance determination unit and the other content into the same group. For example, when it is determined that the information related to the same performer is included in each electronic program guide, the content classification unit classifies the two programs into groups related to the performer.

このように、コンテンツ分類装置は、互いに関連していると判定する複数のコンテンツを、共通するグループに自動的に分類する。したがって、コンテンツ分類装置は、コンテンツをユーザに分類する手間を与えることなく、コンテンツをグループに自動的に分類できる効果を奏する。 In this way, the content classification device automatically classifies a plurality of contents determined to be related to each other into a common group. Therefore, the content classification device has an effect of automatically classifying content into groups without giving time and effort to classify content to users.

本発明に係るコンテンツ分類方法は、上記の課題を解決するために、
コンテンツの内容を表す付加情報と、このコンテンツと異なる他のコンテンツの内容を表す他の付加情報とが少なくとも部分的に一致するとき、上記コンテンツと上記他のコンテンツとが互いに関連していると判定するコンテンツ関連性判定ステップと、
上記コンテンツと上記他のコンテンツとが互いに関連していると判定されると、上記コンテンツと上記他のコンテンツとを同一のグループに分類するコンテンツ分類ステップとを含んでいることを特徴としている。 In order to solve the above problems, the content classification method according to the present invention provides:
It is determined that the content and the other content are related to each other when the additional information representing the content and the other additional information representing the content of other content different from the content are at least partially matched. A content relevance determining step,
When it is determined that the content and the other content are related to each other, a content classification step of classifying the content and the other content into the same group is included.

上記の構成によれば、本発明に係るコンテンツ分類装置と同様の効果を奏する。 According to said structure, there exists an effect similar to the content classification device based on this invention.

また、本発明に係るコンテンツ分類装置では、さらに、
上記付加情報は、上記コンテンツの内容を表すキーワードを含んでおり、
上記他の付加情報は、上記他のコンテンツの内容を表す他のキーワードを含んでおり、
上記コンテンツ関連性判定手段は、
上記キーワードと上記他のキーワードとが少なくとも部分的に一致すると判定するとき、上記コンテンツと上記他のコンテンツとが互いに関連していると判定することが好ましい。 In the content classification device according to the present invention,
The additional information includes a keyword representing the content content,
The other additional information includes other keywords representing the contents of the other content,
The content relevance determining means is
When it is determined that the keyword and the other keyword match at least partially, it is preferable to determine that the content and the other content are related to each other.

上記の構成によれば、コンテンツの内容を表す付加情報は、所定のキーワードを含んでいる。このキーワードは、コンテンツの内容を表している。付加情報は、たとえばキーワードとして、コンテンツのタイトルを表すテキストを含んでいる。さらに、他のコンテンツの内容を表す他の付加情報も、所定の他のキーワードを含んでいる。この他のキーワードは、他のコンテンツの内容を表している。他の付加情報は、たとえば他のキーワードとして、他のコンテンツのジャンルを表すテキストを含んでいる。 According to said structure, the additional information showing the content content contains the predetermined keyword. This keyword represents the content. The additional information includes text representing the title of the content, for example, as a keyword. Furthermore, other additional information representing the contents of other contents also includes other predetermined keywords. The other keywords represent the contents of other contents. Other additional information includes text representing the genre of other content as another keyword, for example.

ここで、コンテンツ関連性判定手段は、キーワードと他のキーワードとが、少なくとも部分的に一致するとき、コンテンツおよび他のコンテンツが、互いに関連していると判定する。これにより、コンテンツ分類手段は、キーワード同士が互いにマッチングしているコンテンツおよび他のコンテンツを、共通するグループに分類する。 Here, the content relevance determining unit determines that the content and the other content are related to each other when the keyword and the other keyword are at least partially matched. As a result, the content classification means classifies the content in which the keywords match each other and other content into a common group.

したがって、付加情報全体を判定しなくても、キーワードのみを判定することによって、コンテンツと他のコンテンツとが関連するか否かを判定できる。このように、コンテンツ分類装置は、コンテンツをグループに分類するときの処理速度を、より速めることができる効果を奏する。 Therefore, it is possible to determine whether the content and other content are related by determining only the keyword without determining the entire additional information. As described above, the content classification device has an effect of further increasing the processing speed when the content is classified into groups.

また、本発明に係るコンテンツ分類装置では、さらに、
上記コンテンツ関連性判定手段は、
上記キーワードと上記他のキーワードとの距離が、あらかじめ定められた閾値より小さいとき、上記コンテンツと上記他のコンテンツとが互いに関連していると判定することが好ましい。 In the content classification device according to the present invention,
The content relevance determining means is
When the distance between the keyword and the other keyword is smaller than a predetermined threshold, it is preferable to determine that the content and the other content are related to each other.

上記の構成によれば、コンテンツ関連性判定手段は、２つのキーワードの間の距離を算出する。このとき、たとえば、ＤＰマッチングによって、この距離を算出する。コンテンツ関連性判定手段は、算出した距離が、あらかじめ定められた閾値よりも小さいとき、コンテンツと他のコンテンツとが互いに関連していると判定する。これにより、コンテンツ分類手段は、キーワード間の距離が閾値よりも小さい複数のコンテンツを、共通するグループに分類する。したがって、コンテンツ分類装置は、より客観性の高い指標に基づき、コンテンツをグループに分類できる効果を奏する。 According to said structure, a content relevance determination means calculates the distance between two keywords. At this time, for example, this distance is calculated by DP matching. The content relevance determining unit determines that the content and the other content are related to each other when the calculated distance is smaller than a predetermined threshold. Thereby, the content classification means classifies a plurality of contents whose distance between keywords is smaller than a threshold value into a common group. Therefore, the content classification device has an effect of classifying content into groups based on a more objective index.

また、本発明に係るコンテンツ分類装置では、さらに、
上記同一のグループの名前は、上記キーワードおよび上記他のキーワードに共通している文字列であることが好ましい。 In the content classification device according to the present invention,
The name of the same group is preferably a character string common to the keyword and the other keyword.

上記の構成によれば、コンテンツ分類手段は、キーワードおよび他のキーワードに共通している文字列が名前に設定されているグループに、コンテンツおよび他のコンテンツを分類する。これにより、コンテンツとの関連が深いグループに、コンテンツを分類できる効果を奏する。 According to said structure, a content classification | category means classify | categorizes a content and another content into the group by which the character string common to a keyword and another keyword is set to the name. Thereby, there exists an effect which can classify | categorize a content into the group with a close relationship with a content.

また、本発明に係るコンテンツ分類装置では、さらに、
上記キーワードが属するカテゴリは、上記他のキーワードが属するカテゴリと同一であることが好ましい。 In the content classification device according to the present invention,
The category to which the keyword belongs is preferably the same as the category to which the other keyword belongs.

上記の構成によれば、コンテンツ関連性判定手段は、同一のカテゴリに属するキーワードおよび他のキーワードとが、少なくとも部分的に互いに一致するとき、コンテンツおよび他のコンテンツが互いに関連していると判定する。これにより、コンテンツ分類手段は、同一のカテゴリに属するキーワードがマッチングするコンテンツを、共通するグループに分類できる。したがって、コンテンツ分類装置は、コンテンツをグループに分類するときの精度を、より高くできる効果を奏する。 According to the above configuration, the content relevance determining unit determines that the content and the other content are related to each other when the keyword and the other keyword belonging to the same category at least partially match each other. . As a result, the content classification means can classify content matching keywords belonging to the same category into a common group. Therefore, the content classification device has an effect that the accuracy when the content is classified into groups can be further increased.

また、本発明に係るコンテンツ分類装置では、さらに、
上記コンテンツ関連性判定手段は、さらに、上記付加情報に含まれる、上記カテゴリと異なる他のカテゴリに属するキーワードと、上記他の付加情報に含まれる、上記他のカテゴリに属する他のキーワードとが、少なくとも部分的に一致するとき、上記コンテンツと上記他のコンテンツとが互いに関連していると判定することが好ましい。 In the content classification device according to the present invention,
The content relevance determination means further includes a keyword belonging to another category different from the category included in the additional information and another keyword belonging to the other category included in the other additional information. When at least partially matching, it is preferable to determine that the content and the other content are related to each other.

上記の構成によれば、あるカテゴリに属するキーワードによってマッチングする２つのコンテンツに関して、さらに、このカテゴリと異なる他のカテゴリに属するキーワードによって、再びマッチングするか否かを判定する。すなわち、異なるカテゴリに属する２種類のキーワードが互いにマッチングしているとき、コンテンツ関連性判定手段は、２つのコンテンツを互いに関連していると判定する。これによりコンテンツ分類手段は、これら２つのコンテンツを、共通するグループに分類する。すなわち、分類するコンテンツを絞りこむ。 According to the above configuration, regarding two contents matching by a keyword belonging to a certain category, it is further determined whether or not to match again by a keyword belonging to another category different from this category. That is, when two types of keywords belonging to different categories match each other, the content relevance determination unit determines that the two contents are related to each other. Thereby, the content classification means classifies these two contents into a common group. That is, the contents to be classified are narrowed down.

このように、コンテンツ分類装置は、特定のグループに過剰なコンテンツが分類されることを防止できる効果を奏する。 In this way, the content classification device has an effect of preventing excessive content from being classified into a specific group.

また、本発明に係るコンテンツ分類装置では、さらに、
上記キーワードは、より高い優先度が設定されているキーワードであることが好ましい。 In the content classification device according to the present invention,
The keyword is preferably a keyword for which a higher priority is set.

上記の構成によれば、コンテンツ関連性判定手段は、より高い優先度が設定されているキーワードを、コンテンツのマッチングに使用する。優先度が高いということは、他のキーワードよりも優先して、コンテンツのマッチングに使用されることが好ましいことを意味する。これにより、コンテンツ分類装置は、コンテンツを、コンテンツが分類されるべきより適切なグループに分類できる効果を奏する。 According to said structure, a content relevance determination means uses the keyword by which the higher priority was set for the content matching. A high priority means that it is preferably used for content matching in preference to other keywords. Thereby, the content classification device has an effect that the content can be classified into a more appropriate group into which the content should be classified.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザによる上記キーワードの選択を受け付るキーワード選択手段と、
ユーザによって選択された上記キーワードに、より高い優先度を設定する優先度設定手段とをさらに備えていることが好ましい。 In the content classification device according to the present invention,
Keyword selection means for accepting selection of the keyword by the user;
It is preferable to further include priority setting means for setting a higher priority for the keyword selected by the user.

上記の構成によれば、優先度設定手段は、ユーザが選択したキーワードに、ユーザが選択しなかったキーワードより高い優先度を設定する。ここで、ユーザが選択するキーワードは、ユーザが嗜好している可能性が高いキーワードである。これにより、コンテンツ関連性判定手段は、ユーザが嗜好している可能性が高いキーワードがマッチングする２つのコンテンツを、互いに関連していると判定する。したがって、コンテンツ分類装置は、コンテンツをグループに分類する分類構成を、ユーザが求めるものにより近づけることができる効果を奏する。 According to said structure, a priority setting means sets a higher priority to the keyword which the user selected than the keyword which the user did not select. Here, the keyword selected by the user is a keyword that is highly likely to be liked by the user. As a result, the content relevance determining unit determines that the two contents that match the keywords that are likely to be liked by the user are related to each other. Therefore, the content classification device has an effect that the classification configuration for classifying content into groups can be made closer to what the user wants.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザの選択した上記グループの名前を受け付るグループ名選択手段と、
上記グループ名選択手段が受け付た上記グループの名前を含んでいる上記キーワードに、より高い優先度を設定する優先度設定手段さらに備えていることが好ましい。 In the content classification device according to the present invention,
A group name selection means for receiving the name of the group selected by the user;
It is preferable to further include priority setting means for setting a higher priority for the keyword including the name of the group received by the group name selection means.

上記の構成によれば、優先度設定手段は、ユーザが選択したグループの名前を含んでいるキーワードに、より高い優先度を設定する。ここで、ユーザが選択する、グループの名前は、ユーザが嗜好している可能性が高い。これにより、コンテンツ関連性判定手段は、ユーザが嗜好している可能性が高い文字列を含んでいるキーワードによってマッチングされる２つのコンテンツを、同一のグループに分類する。したがって、コンテンツ分類装置は、コンテンツをグループに分類する分類構成を、ユーザが求めるものにより近づけることができる効果を奏する。 According to said structure, a priority setting means sets a higher priority to the keyword containing the name of the group which the user selected. Here, the name of the group selected by the user is likely to be preferred by the user. As a result, the content relevance determination unit classifies the two contents matched by the keyword including the character string that the user is likely to like into the same group. Therefore, the content classification device has an effect that the classification configuration for classifying content into groups can be made closer to what the user wants.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザに関する情報の入力を受け付るユーザ情報入力手段と、
上記ユーザ情報入力手段によって受け付けられた上記ユーザに関する情報に含まれている文字列を含んでいるキーワードに、より高い優先度を設定する優先度設定手段とをさらに備えていることが好ましい。 In the content classification device according to the present invention,
User information input means for receiving input of information about the user;
Preferably, the apparatus further includes priority setting means for setting a higher priority to a keyword including a character string included in the information related to the user received by the user information input means.

上記の構成によれば、優先度設定手段は、ユーザに関する情報に含まれている文字列を含んでいるキーワードに、より高い優先度を設定する。したがって、コンテンツ分類手段、ユーザに関する情報に関連が深いキーワードによってマッチングする２つのコンテンツを、同一のグループに分類する。 According to said structure, a priority setting means sets a higher priority to the keyword containing the character string contained in the information regarding a user. Therefore, the content classification means classifies two contents that match with keywords that are closely related to information about the user into the same group.

したがって、コンテンツ分類装置は、コンテンツをグループに分類する分類構成を、ユーザが求めるものにより近づけることができる。さらに、このような処理を、ユーザが嗜好しているキーワードまたはグループの名前を入力することなく、実現できる。 Therefore, the content classification device can make the classification configuration for classifying content into groups closer to what the user wants. Further, such processing can be realized without inputting the keyword or group name that the user likes.

また、本発明に係るコンテンツ分類装置では、さらに、
上記コンテンツ分類手段は、
より高い優先度が設定されているキーワードおよび他のキーワードに共通する文字列を、より上位の階層に位置する上記グループの名前に設定することが好ましい。 In the content classification device according to the present invention,
The content classification means
It is preferable to set a keyword having a higher priority and a character string common to other keywords to the name of the group located in a higher hierarchy.

上記の構成によれば、より高い優先度が設定されているキーワードおよび他のキーワードに共通する文字列を、コンテンツおよび他のコンテンツを分類するグループの名前に設定する。さらに、このグループを、より上位の階層に位置させる。したがって、ユーザにより関連が深いグループを、より上位に位置させるため、ユーザがコンテンツを検索しやすいグループ構成にすることができる効果を奏する。 According to said structure, the character string which is common to the keyword in which a higher priority is set, and another keyword is set to the name of the group which classifies content and other content. Further, this group is positioned in a higher hierarchy. Therefore, since the group more closely related to the user is positioned higher, it is possible to achieve a group configuration in which the user can easily search for content.

また、本発明に係るコンテンツ分類装置では、さらに、
階層構造のグループがあらかじめ存在するとき、
より上位の階層に位置するグループの名前を含んでいるキーワードが属するカテゴリと同一のカテゴリに属するキーワードに、より上位の優先度を設定する優先度設定手段をさらに備えていることが好ましい。 In the content classification device according to the present invention,
When a hierarchical group exists in advance,
It is preferable to further include priority setting means for setting a higher priority for a keyword belonging to the same category as the category to which the keyword including the name of the group located in a higher hierarchy belongs.

上記の構成によれば、優先度設定手段は、既存のグループの階層構造に基づき、この階層においてより上位に位置するグループの名前を含んでいるキーワードが属するカテゴリと同一のカテゴリに属するキーワードに、より高い優先度を設定する。したがって、コンテンツ分類装置は、コンテンツおよび他のコンテンツを分類するグループの階層構造を、既存のグループの階層構造に類似させることができる。これにより、グループの階層構造に統一感が生じるため、グループに分類されているコンテンツを検索しやすいグループ構成とすることができる効果を奏する。 According to the above configuration, the priority setting means, based on the hierarchical structure of the existing group, assigns the keyword belonging to the same category to the category to which the keyword including the name of the group located higher in this hierarchy belongs Set a higher priority. Therefore, the content classification device can resemble the hierarchical structure of groups that classify content and other content with the hierarchical structure of existing groups. Thereby, since a sense of unity occurs in the hierarchical structure of the group, there is an effect that it is possible to make a group configuration that makes it easy to search for contents classified into groups.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザによって指示された上記グループを削除するグループ削除手段をさらに備えていることが好ましい。 In the content classification device according to the present invention,
It is preferable to further include group deletion means for deleting the group designated by the user.

上記の構成によれば、コンテンツ分類装置は、ユーザが不要と感じるグループを削除できる。これにより、管理しているグループの構成を最適化できる効果を奏する。 According to said structure, the content classification device can delete the group which a user feels unnecessary. Thereby, there exists an effect which can optimize the structure of the group currently managed.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザによって指示された上記グループを削除するグループ削除手段をさらに備え、
上記グループ削除手段によって削除されたグループの名前を含んでいるキーワードの優先度を、より下位に設定する優先度設定手段とを備えていることが好ましい。 In the content classification device according to the present invention,
A group deleting means for deleting the group instructed by the user;
It is preferable to include priority setting means for setting the priority of the keyword including the name of the group deleted by the group deleting means to a lower level.

上記の構成によれば、コンテンツ分類装置は、ユーザが不要と感じるグループを削除できる。したがって、まず、管理しているグループの構成を最適化できる。コンテンツ分類装置では、さらに、優先度設定手段が、削除されたグループの名前を含んでいるキーワードの優先度を、より下位に設定する。 According to said structure, the content classification device can delete the group which a user feels unnecessary. Therefore, first, the configuration of the managed group can be optimized. In the content classification device, the priority setting unit further sets the priority of the keyword including the name of the deleted group to a lower level.

ここで、ユーザが削除を要求するグループは、ユーザが不要と感じるグループである。すなわち、このグループの名前は、ユーザにとって、特に重要ではない可能性が高い。このとき、コンテンツ分類装置は、ユーザが特に重要と感じないキーワードの優先度を、より低く設定する。したがって、コンテンツ分類手段は、コンテンツおよび他のコンテンツが互いに関連しているかを判定するとき、このキーワードを使用しにくくなる。 Here, the group that the user requests to delete is a group that the user feels unnecessary. That is, the name of this group is likely not particularly important to the user. At this time, the content classification apparatus sets a lower priority for keywords that the user does not feel particularly important. Therefore, it becomes difficult for the content classification means to use this keyword when determining whether the content and other content are related to each other.

これにより、コンテンツ分類装置は、コンテンツをグループに分類するときの構成が、ユーザが望まないものになることを防止できる効果を奏する。 Thereby, the content classification device has an effect of preventing the configuration when the content is classified into groups from being undesired by the user.

また、本発明に係るコンテンツ分類装置では、さらに、
上記コンテンツ分類手段による分類結果を所定のディスプレイに表示する表示手段をさらに備えていることが好ましい。 In the content classification device according to the present invention,
It is preferable to further include display means for displaying the result of classification by the content classification means on a predetermined display.

上記の構成によれば、表示手段は、コンテンツ分類手段がコンテンツをグループに分類した結果を、所定のディスプレイに表示する。たとえばコンテンツの新たな分類構成を、ディスプレイに表示する。これにより、ユーザは、コンテンツ分類装置が管理しているコンテンツの分類状況を、容易に把握できる効果を奏する。 According to the above configuration, the display unit displays the result of the content classification unit classifying the content into the group on the predetermined display. For example, a new classification configuration of content is displayed on the display. Thus, the user can easily grasp the content classification status managed by the content classification device.

ユーザが選択した上記キーワードを学習する嗜好学習部と、
上記嗜好学習部が学習した上記キーワードに基づき、ユーザにコンテンツを推薦するコンテンツ推薦手段とをさらに備えていることが好ましい。 A preference learning unit for learning the keyword selected by the user;
It is preferable to further include content recommendation means for recommending content to the user based on the keyword learned by the preference learning unit.

上記の構成によれば、嗜好学習部は、ユーザが選択したキーワードを、たとえば、ユーザが嗜好しているキーワードとして学習する。このとき、学習したキーワードを、たとえば、所定の嗜好情報データベースに記憶する。これにより、コンテンツ推薦手段は、嗜好学習部が学習したキーワードに基づき、このキーワードによる表されるコンテンツを、ユーザに推薦する。たとえば、所定のディスプレイに、このコンテンツに関する情報を表示することによって、コンテンツをユーザに推薦する。 According to said structure, a preference learning part learns the keyword which the user selected as a keyword which the user likes, for example. At this time, the learned keyword is stored in, for example, a predetermined preference information database. Accordingly, the content recommendation unit recommends the content represented by the keyword to the user based on the keyword learned by the preference learning unit. For example, the content is recommended to the user by displaying information on the content on a predetermined display.

このように、コンテンツ分類装置は、コンテンツのマッチングに使用するキーワードをユーザが選択するだけで、ユーザが嗜好している可能性が高いコンテンツをユーザに推薦できる効果を奏する。 In this way, the content classification device has an effect of recommending to the user content that is likely to be liked by the user simply by selecting the keyword used for content matching.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザの指示に基づき、上記グループに新たなコンテンツを追加するコンテンツ追加手段と、
上記コンテンツ追加手段によって上記新たなコンテンツが上記グループに追加されたとき、上記新たなコンテンツの内容を表すキーワードと、上記グループにあらかじめ含まれているコンテンツの内容を表すキーワードとの距離のうち、最大の距離を算出するキーワード距離測定手段と、
上記最大の距離があらかじめ定められた閾値よりも大きいとき、この最大の距離を新たな閾値に設定する閾値設定手段とをさらに備えていることが好ましい。 In the content classification device according to the present invention,
Content adding means for adding new content to the group based on user instructions;
When the new content is added to the group by the content adding means, the maximum of the distance between the keyword representing the content of the new content and the keyword representing the content of the content included in the group in advance Keyword distance measuring means for calculating the distance of
When the maximum distance is larger than a predetermined threshold, it is preferable to further include threshold setting means for setting the maximum distance as a new threshold.

上記の構成によれば、コンテンツ分類装置では、ユーザが指示したコンテンツを、コンテンツ追加手段がグループに追加する。さらに、キーワード距離測定手段は、新たなコンテンツの内容を表すキーワードと、このコンテンツが分類されるグループにあらかじめ含まれているコンテンツの内容を表すキーワードとの距離のうち、最大の距離を算出する。つぎに、キーワード距離測定手段が算出する最大の距離が、グループにあらかじめ定められた閾値よりも大きいとき、この最大の距離を、グループの新たな閾値に設定する。 According to the above configuration, in the content classification device, the content adding unit adds the content specified by the user to the group. Further, the keyword distance measuring means calculates the maximum distance among the distances between the keyword representing the content of the new content and the keyword representing the content of the content previously included in the group into which the content is classified. Next, when the maximum distance calculated by the keyword distance measuring means is larger than a threshold value predetermined for the group, this maximum distance is set as a new threshold value for the group.

ユーザが、あるコンテンツをグループに追加することをコンテンツ分類装置に指示するのは、グループに分類されるコンテンツの範囲を、より広くして欲しいからである可能性が高い。そこで、閾値設定手段は、あらかじめ定められた閾値を、より大きい値に変更することによって、より多くのコンテンツをこのグループに分類できるようにする。このように、コンテンツ分類装置は、ユーザが要求したコンテンツをグループに追加だけで、このグループに新たに追加されるコンテンツの範囲を自動的に広げることができる効果を奏する。 It is highly likely that the user instructs the content classification apparatus to add a certain content to the group because the range of the content classified into the group is desired to be wider. Therefore, the threshold value setting means changes a predetermined threshold value to a larger value so that more contents can be classified into this group. As described above, the content classification apparatus has an effect that the range of content newly added to the group can be automatically expanded only by adding the content requested by the user to the group.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザによって指示された上記コンテンツを、上記グループから取り除くコンテンツ削除手段と、
上記コンテンツ削除手段によって上記グループから上記コンテンツが取り除かれたとき、上記グループから任意に選択される第１コンテンツの内容を表すキーワードと、上記グループから任意に選択される、上記第１コンテンツと異なる第２コンテンツの内容を表す他のキーワードとの距離のうち、最大の距離を算出するキーワード距離測定手段と、
上記最大の距離が上記あらかじめ定められた閾値よりも小さいとき、この最大の距離を新たな閾値に設定する閾値設定手段とをさらに備えていることが好ましい。 In the content classification device according to the present invention,
Content deletion means for removing the content instructed by the user from the group;
When the content is removed from the group by the content deletion means, a keyword representing the content of the first content arbitrarily selected from the group and a first content different from the first content arbitrarily selected from the group A keyword distance measuring means for calculating the maximum distance among the distances to other keywords representing the contents of the two contents;
When the maximum distance is smaller than the predetermined threshold, it is preferable to further include threshold setting means for setting the maximum distance as a new threshold.

上記の構成によれば、コンテンツ分類装置では、コンテンツ削除手段が、ユーザが指示したコンテンツを、コンテンツ追加手段がグループから取り除く。さらに、キーワード距離測定手段は、コンテンツが取り除かれた後のグループから任意に選択される第１コンテンツの内容を表すキーワードと、同じく任意に選択される第２コンテンツの内容を表す他のキーワードとの距離のうち、最大の距離を算出する。つぎに、閾値設定手段が、算出された最大の距離が、あらかじめ定められた閾値よりも小さいとき、この最大の距離を、新たな閾値に設定する。 According to the above configuration, in the content classification device, the content deleting unit removes the content designated by the user from the group. Further, the keyword distance measuring means includes a keyword representing the content of the first content arbitrarily selected from the group after the content is removed and another keyword representing the content of the second content also arbitrarily selected. Of the distances, the maximum distance is calculated. Next, when the calculated maximum distance is smaller than a predetermined threshold, the threshold setting unit sets the maximum distance as a new threshold.

ユーザが、あるコンテンツをグループから取り除くことをコンテンツ分類装置に指示するのは、グループに分類するコンテンツの範囲を、より狭くして欲しいからである可能性が高い。そこで、閾値設定手段は、あらかじめ定められた閾値を、より小さい値に変更することによって、このグループに次に分類されるコンテンツの数がより少なくなるように、調整する。このように、コンテンツ分類装置は、ユーザが要求したコンテンツをグループから取り除くだけで、このグループに新たに追加されるコンテンツの範囲を自動的に狭めることができる効果を奏する。 It is highly likely that the user instructs the content classification apparatus to remove a certain content from the group because the range of the content to be classified into the group is desired to be narrower. Therefore, the threshold value setting means adjusts the predetermined threshold value to a smaller value so that the number of contents classified next in this group becomes smaller. As described above, the content classification device has an effect of automatically narrowing the range of content newly added to the group only by removing the content requested by the user from the group.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザが指示した新たなコンテンツを上記グループに追加するコンテンツ追加手段をさらに備え、
上記コンテンツ分類手段は、
上記コンテンツ追加手段によって上記グループに上記新たなコンテンツが追加されたとき、上記グループに含まれる各コンテンツの内容を表している上記キーワードに共通する文字列を、上記グループの新たな名前に設定することが好ましい。 In the content classification device according to the present invention,
A content adding means for adding new content designated by the user to the group;
The content classification means
When the new content is added to the group by the content adding means, a character string common to the keyword representing the content of each content included in the group is set as a new name of the group. Is preferred.

上記の構成によれば、コンテンツ追加手段は、ユーザが指示したコンテンツを、グループに追加する。これにより、コンテンツ分類手段は、新たなコンテンツが追加されるグループの名前を変更する。具体的には、コンテンツ分類手段は、グループに分類されている各コンテンツの内容を表す各キーワードに共通する文字列を、抽出し、この文字列を、グループの名前に設定する。これにより、グループに設定される名前は、グループに追加された新たなコンテンツのキーワードを反映したものとなる。 According to said structure, a content addition means adds the content which the user instruct | indicated to the group. Thereby, the content classification means changes the name of the group to which new content is added. Specifically, the content classification unit extracts a character string common to each keyword representing the content of each content classified into a group, and sets this character string as the name of the group. Thereby, the name set to the group reflects the keyword of the new content added to the group.

このように、コンテンツ分類装置は、グループに新たに追加されたコンテンツを反映した、より適切な名前をグループに設定できる効果を奏する。 In this way, the content classification device has an effect that a more appropriate name that reflects the content newly added to the group can be set for the group.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザによって指示された上記コンテンツを上記グループから取り除くコンテンツ削除手段をさらに備え、
上記コンテンツ分類手段は、
上記コンテンツ削除手段によって上記コンテンツが上記グループから取り除かれたとき、上記グループに新たに含まれる各コンテンツの内容を表す上記キーワードに共通する文字列を、上記グループの名前に新たに設定することが好ましい。 In the content classification device according to the present invention,
A content deleting means for removing the content instructed by the user from the group;
The content classification means
When the content is removed from the group by the content deletion unit, it is preferable to newly set a character string common to the keyword representing the content of each content newly included in the group as the name of the group .

上記の構成によれば、コンテンツ削除手段は、ユーザが指示したコンテンツを、グループから取り除く。これにより、コンテンツ分類手段は、グループの名前を変更する。具体的には、コンテンツ分類手段は、グループに残されている各コンテンツの内容を表す各キーワードに共通する文字列を抽出し、この文字列を、グループの名前に設定する。したがって、グループに設定される名前は、グループに残されているコンテンツを反映した、すなわちグループに取り除かれたコンテンツを反映しないものとなる。 According to said structure, a content deletion means removes the content which the user instruct | indicated from the group. Thereby, the content classification means changes the name of the group. Specifically, the content classification unit extracts a character string common to each keyword representing the content of each content remaining in the group, and sets this character string as the name of the group. Therefore, the name set for the group reflects the content remaining in the group, that is, does not reflect the content removed in the group.

このように、コンテンツ分類装置は、あるコンテンツが取り除かれた後にグループに残されているコンテンツを反映した、より適切な名前をグループに新たに設定できる効果を奏する。 As described above, the content classification apparatus has an effect that a more appropriate name can be newly set for a group reflecting content remaining in the group after a certain content is removed.

また、本発明に係るコンテンツ分類装置では、さらに、
所定の番組を上記コンテンツとして受信する番組受信手段と、
上記番組の内容を表す番組情報を上記付加情報として取得する番組情報取得手段とをさらに備えていることが好ましい。 In the content classification device according to the present invention,
Program receiving means for receiving a predetermined program as the content;
It is preferable to further include program information acquisition means for acquiring program information representing the contents of the program as the additional information.

上記の構成によれば、コンテンツ分類装置は、番組をグループに分類する手間をユーザに与えることなく、受信した番組をグループに自動的に分類できる効果を奏する。 According to the above configuration, the content classification device has an effect of automatically classifying a received program into a group without giving the user the trouble of classifying the program into a group.

また、本発明に係るコンテンツ分類装置では、さらに、
上記付加情報は、上記番組の内容を表す所定のキーワードを含んでおり、
上記他の付加情報は、上記他の番組の内容を表す所定の他のキーワードを含んでおり、
上記コンテンツ関連性判定手段は、
上記キーワードと上記他のキーワードとが少なくとも部分的に一致すると判定するとき、上記番組と上記他の番組とが互いに関連していると判定することが好ましい
上記の構成によれば、コンテンツ分類装置は、番組をより適切なグループに分類できる効果を奏する。 In the content classification device according to the present invention,
The additional information includes a predetermined keyword representing the content of the program,
The other additional information includes a predetermined other keyword representing the content of the other program,
The content relevance determining means is
When it is determined that the keyword and the other keyword are at least partially matched, it is preferable to determine that the program and the other program are related to each other. The program can be classified into a more appropriate group.

また、本発明に係るコンテンツ分類装置では、さらに、
ユーザが録画および視聴の少なくともいずれかを要求した上記番組を表す上記番組情報を所定の記録媒体に保存する番組情報保存手段と、
上記記録媒体に記憶されている上記番組情報に含まれている文字列を含んでいる上記キーワードに、より高い優先度を設定する優先度設定手段とをさらに備えていることが好ましい。 In the content classification device according to the present invention,
Program information storage means for storing the program information representing the program requested by the user for at least one of recording and viewing on a predetermined recording medium;
It is preferable to further include priority setting means for setting a higher priority for the keyword including the character string included in the program information stored in the recording medium.

上記の構成によれば、番組情報保存手段は、ユーザが録画および視聴の少なくともいずれかを要求した番組の表す番組情報を、所定の記録媒体に記憶する。すなわち番組情報保存手段は、ユーザが指定した番組の履歴を記録媒体に蓄積する。これにより優先度設定手段は、記録媒体に記憶されている番組情報に含まれている文字列を含んでいるキーワードに、より高い優先度を設定する。すなわち、優先度設定手段は、ユーザが録画する番組より、視聴する番組を表すキーワードに、より高い優先度を設定する。したがって、コンテンツ分類手段は、ユーザが普段親しんでいる番組を表すキーワードのマッチング結果に基づき、番組をグループに分類する。 According to said structure, a program information preservation | save means memorize | stores in the predetermined recording medium the program information which the program which the user requested | required at least any one of recording and viewing. That is, the program information storage means stores the history of the program designated by the user in the recording medium. As a result, the priority setting means sets a higher priority for the keyword including the character string included in the program information stored in the recording medium. That is, the priority setting means sets a higher priority to the keyword representing the program to be viewed than the program recorded by the user. Therefore, the content classifying unit classifies programs into groups based on keyword matching results representing programs that the user is usually familiar with.

これにより、コンテンツ分類装置では、ユーザが番組の嗜好に関する情報を入力しなくても、番組をユーザの嗜好にあわせてグループに自動的に分類できる効果を奏する
上記記録媒体に記憶されている上記番組情報に含まれている文字列を含んでいる上記キーワードを学習する嗜好学習部と、
上記嗜好学習部が学習した上記キーワードに基づき、ユーザに番組を推薦する番組推薦手段とをさらに備えていることが好ましい。 Thereby, in the content classification device, the program stored in the recording medium has an effect of automatically classifying the program into a group according to the user's preference without the user inputting information regarding the program preference. A preference learning unit for learning the keyword including the character string included in the information;
It is preferable to further include program recommendation means for recommending a program to the user based on the keyword learned by the preference learning unit.

上記の構成によれば、嗜好学習部は、ユーザが指定した番組の履歴に基づき、キーワードを学習する。このとき、学習したキーワードを、たとえば、所定の嗜好情報データベースに記憶する。これにより、番組推薦手段は、嗜好学習部が学習したキーワードに基づき、このキーワードによる表される番組を、ユーザに推薦する。たとえば、所定のディスプレイに、この番組について情報を表示することによって、番組をユーザに推薦する。 According to said structure, a preference learning part learns a keyword based on the log | history of the program designated by the user. At this time, the learned keyword is stored in, for example, a predetermined preference information database. Thereby, the program recommendation means recommends a program represented by the keyword to the user based on the keyword learned by the preference learning unit. For example, a program is recommended to the user by displaying information about the program on a predetermined display.

このように、コンテンツ分類装置は、ユーザが番組を視聴したり録画したりするだけで、ユーザが嗜好している可能性が高い番組をユーザに自動的に推薦できる効果を奏する。 As described above, the content classification device has an effect of automatically recommending to the user a program that is likely to be liked by the user only by viewing or recording the program.

ユーザによる上記キーワードの選択を受け付るキーワード選択手段と、
ユーザが選択した上記キーワードを学習する嗜好学習部と、
上記嗜好学習部が学習した上記キーワードに基づき、ユーザに番組を推薦する番組推薦手段とをさらに備えていることが好ましい。
ｘ上記の構成によれば、嗜好学習部は、ユーザが選択したキーワードを、たとえば、ユーザが嗜好しているキーワードとして学習する。このとき、学習したキーワードを、たとえば、所定の嗜好情報データベースに記憶する。これにより、番組推薦手段は、嗜好学習部が学習したキーワードに基づき、このキーワードによって表される番組を、ユーザに推薦する。たとえば、所定のディスプレイに、この番組に関する情報を表示することによって、番組をユーザに推薦する。 Keyword selection means for accepting selection of the keyword by the user;
A preference learning unit for learning the keyword selected by the user;
It is preferable to further include program recommendation means for recommending a program to the user based on the keyword learned by the preference learning unit.
x According to said structure, a preference learning part learns the keyword which the user selected as a keyword which the user likes, for example. At this time, the learned keyword is stored in, for example, a predetermined preference information database. Thereby, the program recommendation means recommends the program represented by the keyword to the user based on the keyword learned by the preference learning unit. For example, the program is recommended to the user by displaying information about the program on a predetermined display.

このように、コンテンツ分類装置は、番組のマッチングに使用するキーワードをユーザが選択するだけで、ユーザが嗜好している可能性が高い番組をユーザに自動的に推薦できる効果を奏する。 As described above, the content classification device has an effect of automatically recommending to the user a program that is likely to be liked by the user only by selecting the keyword used for program matching.

なお、上記コンテンツ分類装置は、コンピュータによって実現してもよい。この場合、コンピュータを上記各手段として動作させることにより上記コンテンツ分類装置をコンピュータにおいて実現するコンテンツ分類プログラム、およびそのコンテンツ分類プログラムを記録したコンピュータ読み取り可能な記録媒体も、本発明の範疇に入る。 The content classification apparatus may be realized by a computer. In this case, a content classification program for realizing the content classification device in a computer by operating the computer as each of the above means and a computer-readable recording medium recording the content classification program are also included in the scope of the present invention.

以上のように、本発明に係るコンテンツ分類装置は、互いに関連していると判定されたコンテンツおよび他のコンテンツとを、同一のグループに分類するコンテンツ分類手段を備えているため、コンテンツを分類する手間をユーザに与えることなく、コンテンツをグループに自動的に分類できる効果を奏する。 As described above, since the content classification device according to the present invention includes content classification means for classifying content determined to be related to each other and other content into the same group, the content classification device classifies the content. There is an effect that contents can be automatically classified into groups without giving users trouble.

本発明の一実施形態について、図１から図２１を参照して以下に説明する。
（コンテンツ分類装置１）
まず、本実施形態に係るコンテンツ分類装置１の構成について、図１を参照して以下に説明する。図１は、本発明の一実施形態に係るコンテンツ分類装置１の詳細を表すブロック図である。この図によれば、コンテンツ分類装置１は、入力部２、ディスプレイ４、コンテンツデータベース５、分類データベース６、およびコンテンツ管理部１０を備えている。
（入力部２）
入力部２は、所定のコンテンツの入力を受け付る。このコンテンツは、たとえば、ウェブページまたは電子書籍などである。入力部２は、さらに、このコンテンツの内容を表すメタ情報（付加情報）の入力を受け付る。入力部２は、入力されるコンテンツおよびメタ情報を、コンテンツ管理部１０に出力する。
（表示部３およびディスプレイ４）
表示部３は、コンテンツの分類に関する所定の情報をディスプレイ４に表示する。たとえば、現在のコンテンツの分類構成をディスプレイ４に表示する。また、コンテンツをグループに分類するときにユーザに選択させるキーワードの一覧を表示する。
（コンテンツデータベース５）
コンテンツデータベース５は、コンテンツ分類装置１に入力されるコンテンツを格納しているデータベースである。このデータベースは、さらに、コンテンツの内容を表すメタ情報も格納している。コンテンツデータベース５は、コンテンツ分類装置１において、所定の図示しないメモリに形成されている。
（分類データベース６）
分類データベース６は、コンテンツがどのグループに分類されているかを定義している表示用データを格納しているデータベースである。この分類データの詳細は、後述する。分類データベース６は、コンテンツ分類装置１において、所定の図示しないメモリに形成されている。
（コンテンツ管理部１０）
コンテンツ管理部１０は、入力部２を通じて入力されるコンテンツを、コンテンツデータベース５および分類データベース６において管理する。コンテンツ管理部１０は、その内部にサブモジュールとして、キーワード抽出部１１、コンテンツ関連性判定部１２（コンテンツ関連性判定手段）、コンテンツ分類部１３（コンテンツ分類手段）、優先度設定部１４（優先度設定手段）、グループ作成部１５、グループ削除部１６（グループ削除手段）、および嗜好学習部１７（嗜好学習手段）を備えている。 An embodiment of the present invention will be described below with reference to FIGS.
(Content classification device 1)
First, the configuration of the content classification device 1 according to the present embodiment will be described below with reference to FIG. FIG. 1 is a block diagram showing details of a content classification apparatus 1 according to an embodiment of the present invention. According to this figure, the content classification device 1 includes an input unit 2, a display 4, a content database 5, a classification database 6, and a content management unit 10.
(Input unit 2)
The input unit 2 receives input of predetermined content. This content is, for example, a web page or an electronic book. The input unit 2 further receives input of meta information (additional information) representing the contents. The input unit 2 outputs the input content and meta information to the content management unit 10.
(Display unit 3 and display 4)
The display unit 3 displays predetermined information related to content classification on the display 4. For example, the current content classification configuration is displayed on the display 4. In addition, a list of keywords to be selected by the user when the contents are classified into groups is displayed.
(Content database 5)
The content database 5 is a database that stores content input to the content classification device 1. The database further stores meta information representing the contents. The content database 5 is formed in a predetermined memory (not shown) in the content classification device 1.
(Classification database 6)
The classification database 6 is a database that stores display data that defines which group the content is classified into. Details of the classification data will be described later. The classification database 6 is formed in a predetermined memory (not shown) in the content classification apparatus 1.
(Content Management Unit 10)
The content management unit 10 manages content input through the input unit 2 in the content database 5 and the classification database 6. The content management unit 10 includes a keyword extraction unit 11, a content relevance determination unit 12 (content relevance determination unit), a content classification unit 13 (content classification unit), and a priority setting unit 14 (priority) as submodules. A setting unit), a group creation unit 15, a group deletion unit 16 (group deletion unit), and a preference learning unit 17 (preference learning unit).

次に、これらの各部材について説明する。
（キーワード抽出部１１）
キーワード抽出部１１は、コンテンツの内容を表すメタ情報から、所定のキーワードを抽出する。キーワード抽出部１１は、新たに入力されるコンテンツを表すメタ情報から、たとえば所定のテキストデータによって表されるキーワードを抽出できる。さらに、既存のグループに分類されているコンテンツを表すメタ情報からも、コンテンツの内容を表すキーワードを抽出する。このときキーワード抽出部１１は、コンテンツデータベース５に格納されているメタ情報から、キーワードを抽出する。 Next, each of these members will be described.
(Keyword extraction unit 11)
The keyword extraction unit 11 extracts a predetermined keyword from meta information representing the content content. The keyword extraction unit 11 can extract, for example, a keyword represented by predetermined text data from meta information representing newly input content. Further, a keyword representing the content content is extracted from the meta information representing the content classified into the existing group. At this time, the keyword extraction unit 11 extracts keywords from the meta information stored in the content database 5.

キーワード抽出部１１は、抽出したキーワードを、コンテンツ関連性判定部１２に出力する。
（コンテンツ関連性判定部１２）
コンテンツ関連性判定部１２は、コンテンツと、このコンテンツと異なる他のコンテンツとが、互いに関連しているかを判定する。具体的には、コンテンツ関連性判定部１２は、コンテンツの内容を表すメタ情報と、このコンテンツとは異なる他のコンテンツを表す他のメタ情報とが、少なくとも部分的に一致するか否かを判定する。一致すると判定するとき、コンテンツ関連性判定部１２は、コンテンツおよび他のコンテンツを互いに関連していると判定する。 The keyword extraction unit 11 outputs the extracted keyword to the content relevance determination unit 12.
(Content relevance determination unit 12)
The content relevance determination unit 12 determines whether the content and other content different from the content are related to each other. Specifically, the content relevance determination unit 12 determines whether or not the meta information representing the content and the other meta information representing other content different from the content match at least partially. To do. When determining that they match, the content relevance determination unit 12 determines that the content and other content are related to each other.

コンテンツ関連性判定部１２は、さらに、メタ情報から抽出したキーワードと、他のメタ情報から抽出した他のキーワードとが、少なくとも部分的に一致するときも、コンテンツと他のコンテンツとを、互いに関連していると判定する。すなわち、コンテンツ関連性判定部１２は、コンテンツの内容を表すキーワードのマッチングによって、２つのコンテンツが関連しているかを判定する。 The content relevance determination unit 12 further associates the content with the other content even when the keyword extracted from the meta information and the other keyword extracted from the other meta information at least partially match each other. It is determined that That is, the content relevance determination unit 12 determines whether two content items are related by matching keywords representing the content content.

なお、コンテンツ関連性判定部１２は、判定の結果を、コンテンツ分類部１３に分類する。
（コンテンツ分類部１３）
コンテンツ分類部１３は、コンテンツ関連性判定部１２による判定結果に基づき、コンテンツをグループに分類する。たとえば、コンテンツ分類装置１に新たに入力されるコンテンツを、作成済みのグループに分類する。また、新たに入力された複数のコンテンツを、新たに作成されるグループに分類することもできる。
（優先度設定部１４）
優先度設定部１４は、キーワードに優先度を設定する。この詳細は、後述する。
（グループ作成部１５）
グループ作成部１５は、コンテンツを分類するためのグループを作成する。グループ作成部１５は、２つのコンテンツのキーワードに共通する文字列を、グループの名前に設定することができる。この詳細は、後述する。
（グループ削除部１６）
グループ削除部１６は、ユーザが指定した既存のグループを削除する。すなわち、分類データベース６に記憶されている分類データから、ユーザが指定したグループに関する情報を削除する。
（嗜好学習部１７）
嗜好学習部１７は、ユーザが選択したキーワードを学習する。このとき、嗜好学習部１７は、学習したキーワードに関する情報を、図示しない嗜好情報データベースに記憶している。詳細は後述する。 The content relevance determination unit 12 classifies the determination result in the content classification unit 13.
(Content classification unit 13)
The content classification unit 13 classifies content into groups based on the determination result by the content relevance determination unit 12. For example, content newly input to the content classification device 1 is classified into a created group. In addition, a plurality of newly input contents can be classified into a newly created group.
(Priority setting unit 14)
The priority setting unit 14 sets a priority for the keyword. Details of this will be described later.
(Group creation unit 15)
The group creation unit 15 creates a group for classifying content. The group creation unit 15 can set a character string common to two content keywords as the name of the group. Details of this will be described later.
(Group deletion unit 16)
The group deletion unit 16 deletes an existing group designated by the user. That is, the information regarding the group designated by the user is deleted from the classification data stored in the classification database 6.
(Preference learning section 17)
The preference learning unit 17 learns the keyword selected by the user. At this time, the preference learning unit 17 stores information on the learned keywords in a preference information database (not shown). Details will be described later.

なお、コンテンツ分類装置１において、キーワード抽出部１１、コンテンツ関連性判定部１２、コンテンツ分類部１３、グループ作成部１５、およびグループ削除部１６は、それぞれの機能を併せて持った、一つの部材として実装されていてもよい。その意味では、たとえば、コンテンツ分類部１３がグループ作成部１５としての機能を有していてもよい。
（コンピュータシステム構成）
本実施の形態に係るコンテンツ分類装置１は、実質的には、コンピュータハードウェアと、そのコンピュータハードウェアにより実行されるプログラムと、コンピュータハードウェアに格納されるデータとにより実現される。 In the content classification device 1, the keyword extraction unit 11, the content relevance determination unit 12, the content classification unit 13, the group creation unit 15, and the group deletion unit 16 are combined into each other as a single member. May be implemented. In that sense, for example, the content classification unit 13 may have a function as the group creation unit 15.
(Computer system configuration)
The content classification device 1 according to the present embodiment is substantially realized by computer hardware, a program executed by the computer hardware, and data stored in the computer hardware.

そこで、コンテンツ分類装置１を実現するコンピュータシステム２００について、図２を参照して以下に説明する。図２は、コンテンツ分類装置１を実現するコンピュータシステム２００を示す図である。 A computer system 200 that implements the content classification device 1 will be described below with reference to FIG. FIG. 2 is a diagram illustrating a computer system 200 that implements the content classification apparatus 1.

この図によれば、コンピュータシステム２００は、光ディスクドライブ２０１および磁気ディスクドライブ２０２を有するコンピュータ２２０と、モニタ２０３と、リモコン２０４とを備えている。 According to this figure, the computer system 200 includes a computer 220 having an optical disk drive 201 and a magnetic disk drive 202, a monitor 203, and a remote control 204.

コンピュータ２２０は、光ディスクドライブ２０１および磁気ディスクドライブ２０２に加えて、リモコン２０４からの信号を受信するリモコンインタフェース（Ｉ／Ｆ）２０５を備えている。さらに、ＣＰＵ（中央処理装置）２０６を備えている。さらに、リモコンインタフェース２０５、ＣＰＵ２０６、光ディスクドライブ２０１および磁気ディスクドライブ２０２に接続されたバス２０７を備えている。さらに、バス２０７に接続され、ブートアッププログラム等を記憶する読出専用メモリ（ＲＯＭ）２０８を備えている。さらに、同じくバス２０７に接続され、プログラム命令、システムプログラム、および作業データ等を記憶するランダムアクセスメモリ（ＲＡＭ）２０９を備えている。 The computer 220 includes a remote control interface (I / F) 205 that receives a signal from the remote control 204 in addition to the optical disc drive 201 and the magnetic disc drive 202. Further, a CPU (Central Processing Unit) 206 is provided. Further, a bus 207 connected to the remote control interface 205, the CPU 206, the optical disk drive 201, and the magnetic disk drive 202 is provided. Furthermore, a read-only memory (ROM) 208 is provided which is connected to the bus 207 and stores a bootup program and the like. Further, a random access memory (RAM) 209 is also connected to the bus 207 and stores program instructions, system programs, work data, and the like.

図２には示さないが、コンピュータ２２０は、さらに、ローカルエリアネットワーク（ＬＡＮ）への接続を提供するネットワークアダプタボードを備えてもよい。 Although not shown in FIG. 2, the computer 220 may further include a network adapter board that provides a connection to a local area network (LAN).

コンピュータシステム２００に情報処理装置の機能を実現させるためのプログラム、入力情報はいずれも、光ディスクドライブ２０１に挿入される光ディスク２１０に記憶されている。または、磁気ディスクドライブ２０２に挿入される磁気ディスク２１１に記憶されている。この記録媒体に記憶されているコンテンツ分類プログラムは、ハードディスク２１２に転送される。または、コンテンツ分類プログラムは、図示しないネットワークを通じてコンピュータ２２０に送信され、ハードディスク２１２に記憶されてもよい。コンテンツ分類プログラムは、ＲＡＭ２０９にロードされることによって、実行される。なお、光ディスク２１０から、磁気ディスク２１１から、またはネットワークを介して、直接にＲＡＭ２０９にプログラムをロードしてもよい。 Both a program for causing the computer system 200 to realize the functions of the information processing apparatus and input information are stored in the optical disc 210 inserted into the optical disc drive 201. Alternatively, it is stored in the magnetic disk 211 inserted into the magnetic disk drive 202. The content classification program stored in this recording medium is transferred to the hard disk 212. Alternatively, the content classification program may be transmitted to the computer 220 through a network (not shown) and stored in the hard disk 212. The content classification program is executed by being loaded into the RAM 209. Note that the program may be loaded directly into the RAM 209 from the optical disk 210, from the magnetic disk 211, or via a network.

このコンテンツ分類プログラムは、コンピュータ２２０に、本実施の形態に係るコンテンツ分類装置１の機能を実現させる複数の命令を含んでいる。これら機能を実現させるのに必要な基本的機能のいくつかは、コンピュータ２２０上で動作するオペレーティングシステム（ＯＳ）またはサードパーティのプログラム、若しくはコンピュータ２２０にインストールされる各種ツールキットのモジュールにより提供される。したがって、このコンテンツ分類プログラムは、本実施形態に係るコンテンツ分類装置１の機能を実現するのに必要な機能の全てを必ずしも含まなくてよい。すなわちコンテンツ分類プログラムは、命令のうち、所望の結果が得られるように制御されたやり方で適切な機能または「ツール」を呼出すことにより、上記した情報処理装置の制御を実行する命令のみを含んでいればよい。コンピュータシステム２００の動作は周知であるので、ここでは繰り返さない。 This content classification program includes a plurality of instructions for causing the computer 220 to realize the functions of the content classification apparatus 1 according to the present embodiment. Some of the basic functions necessary to realize these functions are provided by operating system (OS) or third-party programs running on the computer 220 or modules of various toolkits installed on the computer 220. . Therefore, this content classification program does not necessarily include all the functions necessary for realizing the functions of the content classification device 1 according to the present embodiment. That is, the content classification program includes only an instruction for executing control of the information processing apparatus described above by calling an appropriate function or “tool” in a controlled manner so as to obtain a desired result. It only has to be. The operation of computer system 200 is well known and will not be repeated here.

なお、情報処理装置の機能を実現するためのプログラムが記憶されている記録媒体は、ＣＤ−ＲＯＭ（コンパクトディスク読取専用メモリ）、ＭＯ（Ｍａｇｎｅｔｏ−Ｏｐｔｉｃａｌｄｉｓｃ）、ＭＤ（ＭｉｎｉＤｉｓｃ）、およびＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）等の光ディスク２１０、ＦＤ（フレキシブルディスク）およびハードディスク等の磁気ディスク２１１に限らない。磁気テープおよびカセットテープ等のテープ、ＩＣ（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）カードおよび光カード等のカード型記録媒体、並びにマスクＲＯＭ、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰｒｏｇｒａｍａｂｌｅＲＯＭ）、ＥＥＰＲＯＭ（ＥｌｅｃｔｒｉｃａｌｌｙＥｒａｓａｂｌｅＰｒｏｇｒａｍａｂｌｅＲＯＭ）、およびフラッシュＲＯＭ等の半導体メモリのいずれかでもよい。ただし、コンピュータシステム２００は、これら記録媒体からの読出を行なうための読出装置を実装する必要がある。
（コンテンツおよびメタ情報のデータ構造）
コンテンツ分類装置１が処理するコンテンツ、およびこのコンテンツの内容を表すメタ情報のデータ構造について、図３を参照して以下に説明する。図３は、コンテンツおよびメタ情報を表すコンテンツデータを説明する図である。この図によれば、コンテンツ分類装置１が処理するコンテンツデータは、コンテンツを表すコンテンツ領域と、メタ情報を表すメタ情報領域とが一体化された構造である。 Note that recording media storing programs for realizing the functions of the information processing apparatus are CD-ROM (Compact Disc Read Only Memory), MO (Magneto-Optical disc), MD (Mini Disc), and DVD ( The present invention is not limited to the optical disk 210 such as Digital Versatile Disc), and the magnetic disk 211 such as FD (flexible disk) and hard disk. Tapes such as magnetic tape and cassette tape, card-type recording media such as IC (Integrated Circuit) card and optical card, and semiconductors such as mask ROM, EPROM (Erasable Programmable ROM), EEPROM (Electrically Erasable Programmable ROM), and flash ROM It can be either memory. However, the computer system 200 needs to be equipped with a reading device for reading from these recording media.
(Data structure of content and meta information)
The data processed by the content classification device 1 and the data structure of meta information representing the content will be described below with reference to FIG. FIG. 3 is a diagram illustrating content data representing content and meta information. According to this figure, the content data processed by the content classification device 1 has a structure in which a content area representing content and a meta information area representing meta information are integrated.

コンテンツデータは、文字通り、コンテンツを構成するデータである。メタ情報は、このコンテンツの内容を表す、所定の情報である。図３に示すように、メタ情報は、所定のキーワードを含んでいる。各キーワードは、コンテンツの内容を表している。また、これらのキーワードは、それぞれ、所定のカテゴリに属している。たとえば、キーワード「コンテンツＡＢＣパート４」は、カテゴリ「タイトル」に属している。さらに、キーワード「田中一朗」は、カテゴリ「作成者」に属している。 The content data is literally data constituting the content. The meta information is predetermined information that represents the content. As shown in FIG. 3, the meta information includes a predetermined keyword. Each keyword represents the content. Each of these keywords belongs to a predetermined category. For example, the keyword “content ABC part 4” belongs to the category “title”. Further, the keyword “Ichiro Tanaka” belongs to the category “creator”.

キーワードが属するカテゴリの例として、たとえば、作成者、作成日、コンテンツサイズ、および内容説明がある。これらのカテゴリは、いずれも、コンテンツに関する特定の特徴を端的に表した所定の属性である。
（分類データベース６の詳細）
上述したように、コンテンツ分類装置１は、コンテンツのどのグループに分類するか否かを、分類データベース６を使用して管理している。そこで、この分類データベース６の詳細について、図４を参照して以下に説明する。図４は、分類データベース６の詳細を表す図である。 Examples of categories to which keywords belong include creator, creation date, content size, and content description. Each of these categories is a predetermined attribute that expresses a specific feature related to the content.
(Details of classification database 6)
As described above, the content classification device 1 uses the classification database 6 to manage which group of content is to be classified. The details of the classification database 6 will be described below with reference to FIG. FIG. 4 is a diagram showing details of the classification database 6.

この図によれば、分類データベース６は、コンテンツと、コンテンツが分類されているグループとの対応関係を表すデータベースである。すなわち、分類データベース６は、コンテンツのタイトルと、グループの名前とを互いに関連付け、格納している。さらに、グループごとに、グループの名前と、この名前が属するカテゴリとを関連付け、格納している。ここでいう、グループの名前が属する「カテゴリ」は、グループ作成部１５がグループの名前を決定するときに使用した、コンテンツの内容を表すキーワードが属するカテゴリと、同一である。
（グループ分類フラグ）
分類データベース６は、さらに、所定のグループ分類フラグを格納している。このグループ分類フラグは、あるコンテンツがあるグループに分類されているか否かを表している。グループ分類フラグは、分類データベース６において、コンテンツのタイトルと、このコンテンツが分類されているグループの名前とに関連付られている。 According to this figure, the classification database 6 is a database representing a correspondence relationship between contents and groups into which the contents are classified. That is, the classification database 6 stores the content title and the group name in association with each other. Furthermore, for each group, the name of the group and the category to which the name belongs are associated and stored. The “category” to which the group name belongs here is the same as the category to which the keyword representing the content content used when the group creating unit 15 determines the group name.
(Group classification flag)
The classification database 6 further stores a predetermined group classification flag. This group classification flag indicates whether or not a certain content is classified into a certain group. The group classification flag is associated with the title of the content and the name of the group into which the content is classified in the classification database 6.

グループ分類フラグが「１」のとき、コンテンツはグループに分類されている。たとえば、コンテンツのタイトルである「コンテンツＡＢＣ」と、グループの名前である「コンテンツＡＢＣ」とに関連付られているグループ分類フラグは、「１」である。すなわちコンテンツ「コンテンツＡＢＣ」は、グループ「コンテンツＡＢＣ」に分類されている。なお、グループ分類フラグが「０」のとき、コンテンツはグループに分類されていない。
（キーワードの格納）
分類データベース６は、さらに、あるコンテンツをあるグループに分類することを決定したときに使用されたキーワードを格納している。このキーワードは、分類データベース６において、コンテンツのタイトルと、このコンテンツが分類されているグループの名前とに関連付られている。分類データベース６に格納されているキーワードが属するカテゴリは、このキーワードが関連付られているグループの名前が属するカテゴリと同一である。すなわち、キーワードは、カテゴリが同一のグループに関連付られて格納されている。 When the group classification flag is “1”, the content is classified into groups. For example, the group classification flag associated with the content title “content ABC” and the group name “content ABC” is “1”. That is, the content “content ABC” is classified into the group “content ABC”. When the group classification flag is “0”, the content is not classified into groups.
(Keyword storage)
The classification database 6 further stores keywords used when it is decided to classify a certain content into a certain group. This keyword is associated in the classification database 6 with the title of the content and the name of the group into which the content is classified. The category to which the keyword stored in the classification database 6 belongs is the same as the category to which the name of the group to which this keyword is associated belongs. That is, the keywords are stored in association with groups having the same category.

図４に示す例では、タイトルがコンテンツＡＢＣ、コンテンツＡＢＣパート２、およびコンテンツＡＢＣパート３である３つのコンテンツが、いずれも、名前がコンテンツＡＢＣである１つのグループに分類されている。グループ「コンテンツＡＢＣ」の名前は、図４に示すように、カテゴリ「タイトル」に属している。そこで、コンテンツ分類装置１は、新たに入力されるコンテンツをこのグループに分類するか否かを、このコンテンツの内容を表すメタ情報に含まれているキーワードのうち、カテゴリ「タイトル」に属するキーワードをマッチングに使用することによって、決定する。すなわち、コンテンツ関連性判定部１２は、同一のカテゴリに属するキーワードを使用することによって、キーワードのマッチングを実行する。 In the example shown in FIG. 4, the three contents whose titles are content ABC, content ABC part 2 and content ABC part 3 are all classified into one group whose name is content ABC. The name of the group “content ABC” belongs to the category “title” as shown in FIG. Therefore, the content classification device 1 determines whether or not to classify newly input content into this group, and selects a keyword belonging to the category “title” from among keywords included in the meta information representing the content. Determine by using for matching. That is, the content relevance determination unit 12 performs keyword matching by using keywords belonging to the same category.

図４に示すように、各コンテンツ「データ１」、「データ２」、「情報１」、および「情報２」は、いずれも、グループ「山田」に分類されている。このグループの名前「山田」は、カテゴリ「作成者」に属している。そこでコンテンツ分類装置１は、新たに入力されるコンテンツをこのグループに分類するか否かを、このコンテンツの内容を表すメタ情報に含まれているキーワードのうち、カテゴリ「作成者」に属するキーワードをマッチングに使用することによって、決定する。
（キーワードの抽出）
コンテンツ分類装置１では、キーワード抽出部１１が、メタ情報から所定のキーワードを抽出する。さらに、他のメタ情報から所定の他のキーワードを抽出する。以下では、説明の便宜のため、キーワードという表現は、他のキーワードである意味も含むものとする。この点は、コンテンツ、メタ情報、番組についても同様である。 As shown in FIG. 4, each of the contents “data 1”, “data 2”, “information 1”, and “information 2” is classified into the group “Yamada”. The name “Yamada” of this group belongs to the category “Creator”. Therefore, the content classification device 1 determines whether or not to classify newly input content into this group, and selects keywords belonging to the category “creator” from among the keywords included in the meta information representing the content. Determine by using for matching.
(Keyword extraction)
In the content classification device 1, the keyword extraction unit 11 extracts a predetermined keyword from the meta information. Furthermore, predetermined other keywords are extracted from other meta information. In the following, for convenience of explanation, the expression “keyword” includes meanings that are other keywords. The same applies to content, meta information, and programs.

キーワード抽出部１１は、１つのメタ情報から、各カテゴリに属する全てのキーワードを抽出する。たとえば、図３に示すメタ情報から、キーワードを抽出するとき、キーワード抽出部１１は、タイトル、作成者、作成日、コンテンツサイズ、ジャンル、および内容説明のカテゴリに属するキーワードを、それぞれ、抽出する。 The keyword extraction unit 11 extracts all keywords belonging to each category from one piece of meta information. For example, when extracting keywords from the meta information shown in FIG. 3, the keyword extracting unit 11 extracts keywords belonging to the title, creator, creation date, content size, genre, and description category, respectively.

キーワード抽出部１１は、あるカテゴリに属するキーワードとして抽出した文字列から、さらに、単語などの、より短い部分文字列を抽出してもよい。たとえば、コンテンツの作成者は、１人だけではなく、複数人であることもある。複数の作成者によって作成されるコンテンツでは、その内容を表すメタ情報には、カテゴリ「作成者」に属するキーワードとして、「山田太郎、山田一朗」のように、各作成者をそれぞれ表す複数の文字列を含んでいる。すなわち、１つのカテゴリに、複数の情報が含まれる。このようなとき、キーワード抽出部１１は、１つのカテゴリに属する複数の情報を、それぞれ、１つのキーワードとして抽出すればよい。 The keyword extraction unit 11 may further extract a shorter partial character string such as a word from a character string extracted as a keyword belonging to a certain category. For example, there may be a plurality of content creators, not just one. In content created by multiple creators, the meta information representing the content includes multiple characters that represent each creator, such as “Taro Yamada, Ichiro Yamada” as keywords belonging to the category “creator”. Contains columns. That is, a plurality of information is included in one category. In such a case, the keyword extracting unit 11 may extract a plurality of pieces of information belonging to one category as one keyword.

キーワード抽出部１１は、さらに、各カテゴリに属する文字列、たとえば、カテゴリ「タイトル」に属する文字列、または、カテゴリ「内容説明」に属する文字列を、それぞれ、形態素解析によって、名詞のみや動詞のみなど、より短い部分文字列として抽出してもよい。 The keyword extraction unit 11 further converts only character strings belonging to the respective categories, for example, character strings belonging to the category “title” or character strings belonging to the category “contents description” to only nouns or only verbs by morphological analysis. For example, it may be extracted as a shorter partial character string.

このときコンテンツ関連性判定部１２は、抽出した各部分文字列を、それぞれ、１つのキーワードとして使用してもよい。なお、以下では、メタ情報から抽出する、各カテゴリに属する文字列を、そのまま、キーワードとして使用する例を説明する。
（コンテンツ関連性の判定）
コンテンツ分類装置１では、コンテンツ関連性判定部１２が、コンテンツ分類装置１に新たに入力されたコンテンツを、作成済みのグループに分類するか否かを判定する。このとき、コンテンツ関連性判定部１２は、入力されたコンテンツのメタ情報から抽出したキーワードと、既存のグループに含まれている各コンテンツのメタ情報から抽出したキーワードとを比較する。これにより、入力されたコンテンツと、グループに既存のコンテンツ（他のコンテンツ）とが、互いに関連するか否かを判定する。 At this time, the content relevance determination unit 12 may use each extracted partial character string as one keyword. In the following, an example will be described in which character strings belonging to each category extracted from meta information are used as keywords as they are.
(Determination of content relevance)
In the content classification device 1, the content relevance determination unit 12 determines whether or not the content newly input to the content classification device 1 is classified into a created group. At this time, the content relevance determination unit 12 compares the keyword extracted from the meta information of the input content with the keyword extracted from the meta information of each content included in the existing group. Thereby, it is determined whether the input content and the content existing in the group (other content) are related to each other.

このときコンテンツ関連性判定部１２は、既存のグループの名前が属するカテゴリと同一のカテゴリに属するキーワードを使用する。たとえば、グループの名前がカテゴリ「タイトル」に属するとき、入力されたコンテンツのメタ情報から抽出される、カテゴリ「タイトル」に属するキーワード（たとえば、コンテンツＡＢＣパート４）と、既存のグループに分類されている他のコンテンツのメタ情報から抽出される、カテゴリ「タイトル」に属するキーワード（たとえば、コンテンツＡＢＣパート２）とを比較する。コンテンツ関連性判定部１２は、これらのキーワードが互いにマッチングするとき、追加コンテンツを上記グループに分類することを決定する。 At this time, the content relevance determination unit 12 uses a keyword belonging to the same category as the category to which the name of the existing group belongs. For example, when the group name belongs to the category “title”, the keywords (for example, content ABC part 4) belonging to the category “title” extracted from the meta information of the input content and the existing group are classified. Compared with a keyword (for example, content ABC part 2) belonging to the category “title” extracted from the meta information of other content. When these keywords match each other, the content relevance determination unit 12 determines to classify the additional content into the group.

この決定に基づき、コンテンツ分類部１３は、コンテンツをグループに分類する。すなわち、追加するコンテンツのメタ情報から抽出される、グループのタイトルと、マッチングに使用したキーワードとを、分類データベース６に保存する。さらに、分類する先のグループの名前に、グループ分類フラグを立てる。
（距離の算出）
上述したように、コンテンツ関連性判定部１２は、新たに入力されたコンテンツの内容をキーワードと、既存のグループに分類されているコンテンツの内容を表すキーワードとが、少なくとも部分的に一致するとき、このコンテンツと他のコンテンツとを、互いに関連していると判定する。ここで、コンテンツ関連性判定部１２は、２つのキーワード間の距離が、あらかじめ定められた閾値よりも小さいとき、２つのコンテンツが互いに関連していると判定してもよい。 Based on this determination, the content classification unit 13 classifies the content into groups. That is, the group title extracted from the meta information of the content to be added and the keyword used for matching are stored in the classification database 6. Further, a group classification flag is set on the name of the group to be classified.
(Calculation of distance)
As described above, when the content relevance determination unit 12 at least partially matches the keyword of the content of the newly input content and the keyword indicating the content of the content classified into the existing group, It is determined that this content and other content are related to each other. Here, the content relevance determination unit 12 may determine that the two contents are related to each other when the distance between the two keywords is smaller than a predetermined threshold.

このとき、コンテンツ関連性判定部１２は、キーワードと、他のキーワードとの距離を算出する。さらに、この距離を、あらかじめ定められた閾値と比較する。この閾値は、コンテンツ分類装置１において、図示しないメモリにあらかじめ格納されている。コンテンツ関連性判定部１２は、算出した距離が、あらかじめ定められた閾値より小さいとき、比較した２つのキーワードを互いにマッチングすると判定する。２つのキーワードがマッチングしていると判定するとき、コンテンツ関連性判定部１２は、コンテンツと他のコンテンツとを、互いに関連していると判定する。
（ＤＰマッチング）
コンテンツ関連性判定部１２は、キーワードと、他のキーワードとの距離を、たとえば、ＤＰマッチングによって算出する。すなわち、このＤＰマッチングに基づき、キーワードと他のキーワードとがマッチングするか否かを判定する。このときコンテンツ関連性判定部１２は、キーワードを構成する文字の数を、文字数Ｎ１として算出する。さらに、他のキーワードを構成する文字の数を、文字数Ｎ２として算出する。 At this time, the content relevance determination unit 12 calculates the distance between the keyword and another keyword. Furthermore, this distance is compared with a predetermined threshold value. This threshold value is stored in advance in a memory (not shown) in the content classification device 1. When the calculated distance is smaller than a predetermined threshold, the content relevance determination unit 12 determines that the two compared keywords are matched with each other. When it is determined that the two keywords match, the content relevance determination unit 12 determines that the content and other content are related to each other.
(DP matching)
The content relevance determination unit 12 calculates the distance between the keyword and another keyword, for example, by DP matching. That is, based on this DP matching, it is determined whether or not the keyword matches another keyword. At this time, the content relevance determination unit 12 calculates the number of characters constituting the keyword as the number of characters N1. Further, the number of characters constituting another keyword is calculated as the number of characters N2.

コンテンツ関連性判定部１２は、ＤＰマッチングを使用するとき、キーワードと他のキーワードとの距離を、以下の式によって算出する。 When using DP matching, the content relevance determination unit 12 calculates the distance between a keyword and another keyword using the following formula.

（詳細例）
コンテンツ分類部１３が、コンテンツＡＢＣパート４をグループ「コンテンツＡＢＣ」に分類する例について、以下に説明する。 (Detailed example)
An example in which the content classification unit 13 classifies the content ABC part 4 into the group “content ABC” will be described below.

まず、コンテンツ関連性判定部１２は、コンテンツＡＢＣパート４を、グループ「コンテンツＡＢＣ」に分類するか否かを決定するとき、このグループの名前の属するカテゴリをチェックする。グループ「コンテンツＡＢＣ」の名前は、カテゴリ「タイトル」に属している。したがって、コンテンツ関連性判定部１２は、追加のコンテンツＡＢＣパート４の内容を表すメタ情報から、カテゴリ「タイトル」に属するキーワードを抽出する。コンテンツＡＢＣパート４のメタ情報では、カテゴリ「タイトル」に属するキーワードは、「コンテンツＡＢＣパート４」である。すなわち、キーワード抽出部１１は、キーワードとして、文字列「コンテンツＡＢＣパート４」を抽出する。 First, when determining whether to classify the content ABC part 4 into the group “content ABC”, the content relevance determination unit 12 checks the category to which the name of this group belongs. The name of the group “content ABC” belongs to the category “title”. Therefore, the content relevance determination unit 12 extracts keywords belonging to the category “title” from the meta information representing the content of the additional content ABC part 4. In the meta information of the content ABC part 4, the keyword belonging to the category “title” is “content ABC part 4”. That is, the keyword extraction unit 11 extracts the character string “content ABC part 4” as a keyword.

コンテンツ分類装置１において既存のグループ「コンテンツＡＢＣ」には、コンテンツＡＢＣ、コンテンツＡＢＣパート２、およびコンテンツＡＢＣパート３が分類されている。そこでキーワード抽出部１１は、コンテンツＡＢＣのメタ情報から、タイトルのキーワードとして、文字列「コンテンツＡＢＣ」を抽出する。さらに、コンテンツＡＢＣパート２のメタ情報から、タイトルのキーワードとして、文字列「コンテンツＡＢＣパート２」を抽出する。さらに、コンテンツＡＢＣパート３のメタ情報から、タイトルのキーワードとして、文字列「コンテンツＡＢＣパート３」を抽出する。
（算出距離の例）
コンテンツ関連性判定部１２は、キーワード「コンテンツＡＢＣパート４」と、グループ「コンテンツＡＢＣ」に含まれている各コンテンツのメタ情報から抽出される、タイトルのキーワードとの距離を、それぞれ、数式（１）に示すＤＰマッチングによって算出する。具体的には、キーワード「コンテンツＡＢＣパート４」とキーワード「コンテンツＡＢＣ」との距離を算出する。さらに、キーワード「コンテンツＡＢＣパート４」とキーワード「コンテンツＡＢＣパート２」との距離を算出する。さらに、キーワード「コンテンツＡＢＣパート４」とキーワード「コンテンツＡＢＣパート３」との距離を算出する。さらに、キーワード「コンテンツＡＢＣパート４」とグループの名前である文字列「コンテンツＡＢＣ」との距離を算出する。 In the content classification device 1, the content ABC, the content ABC part 2, and the content ABC part 3 are classified into the existing group “content ABC”. Therefore, the keyword extraction unit 11 extracts the character string “content ABC” as the title keyword from the meta information of the content ABC. Further, the character string “content ABC part 2” is extracted from the meta information of the content ABC part 2 as the keyword of the title. Further, the character string “content ABC part 3” is extracted from the meta information of the content ABC part 3 as a title keyword.
(Example of calculated distance)
The content relevance determination unit 12 sets the distance between the keyword “content ABC part 4” and the title keyword extracted from the meta information of each content included in the group “content ABC”, using the formula (1). ) By DP matching shown in FIG. Specifically, the distance between the keyword “content ABC part 4” and the keyword “content ABC” is calculated. Further, the distance between the keyword “content ABC part 4” and the keyword “content ABC part 2” is calculated. Further, the distance between the keyword “content ABC part 4” and the keyword “content ABC part 3” is calculated. Further, the distance between the keyword “content ABC part 4” and the character string “content ABC” which is the name of the group is calculated.

コンテンツ関連性判定部１２が算出する距離の例について、図５を参照して以下に説明する。図５は、コンテンツ関連性判定部１２が算出するキーワード間の距離の例を示す図である
この図によれば、コンテンツ関連性判定部１２は、キーワード「コンテンツＡＢＣパート４」と名前「コンテンツＡＢＣ」との距離を、０．３４５であると算出する。一方、キーワード「コンテンツＡＢＣパート４」と名前「コンテンツＡＢＣ」との距離を、０．３４５であると算出する。さらに、キーワード「コンテンツＡＢＣパート４」とキーワード「コンテンツＡＢＣパート２」との距離を、０．１５であると算出する。さらに、キーワード「コンテンツＡＢＣパート４」とキーワード「コンテンツＡＢＣパート３」との距離を、０．１５であると算出する。

（判定距離と基準最大距離との比較）
このように、コンテンツ関連性判定部１２は、入力されたコンテンツのメタ情報から抽出されるキーワードと、既存のグループの名前との距離を、０．３４５であると算出する。コンテンツ関連性判定部１２は、この距離を「判定距離」として算出する。一方、コンテンツ関連性判定部１２は、追加されるコンテンツのメタ情報から抽出されるキーワードと、既存のグループに含まれているコンテンツのメタ情報から抽出するキーワードとの距離のうち、もっとも大きい距離を、０．３４５であると算出する。コンテンツ関連性判定部１２は、この距離を「基準最大距離」として算出する。 An example of the distance calculated by the content relevance determination unit 12 will be described below with reference to FIG. FIG. 5 is a diagram illustrating an example of the distance between keywords calculated by the content relevance determining unit 12. According to this figure, the content relevance determining unit 12 includes the keyword “content ABC part 4” and the name “content ABC”. ”Is calculated to be 0.345. On the other hand, the distance between the keyword “content ABC part 4” and the name “content ABC” is calculated to be 0.345. Further, the distance between the keyword “content ABC part 4” and the keyword “content ABC part 2” is calculated to be 0.15. Further, the distance between the keyword “content ABC part 4” and the keyword “content ABC part 3” is calculated to be 0.15.

(Comparison between judgment distance and reference maximum distance)
As described above, the content relevance determination unit 12 calculates the distance between the keyword extracted from the input meta information of the content and the name of the existing group to be 0.345. The content relevance determination unit 12 calculates this distance as a “determination distance”. On the other hand, the content relevance determination unit 12 calculates the largest distance among the distances between the keyword extracted from the meta information of the content to be added and the keyword extracted from the meta information of the content included in the existing group. , 0.345. The content relevance determination unit 12 calculates this distance as a “reference maximum distance”.

上述したように、「判定距離」は、「基準最大距離」と同一である。すなわち、コンテンツ関連性判定部１２は、判定距離を基準最大距離以内であると算出する。これによりコンテンツ関連性判定部１２は、コンテンツＡＢＣパート４を、グループ「コンテンツＡＢＣ」に分類することを決定する。したがってコンテンツ分類部１３は、コンテンツＡＢＣパート４を、グループ「コンテンツＡＢＣ」に分類する。 As described above, the “determination distance” is the same as the “reference maximum distance”. That is, the content relevance determination unit 12 calculates the determination distance to be within the reference maximum distance. Thereby, the content relevance determination unit 12 determines to classify the content ABC part 4 into the group “content ABC”. Therefore, the content classification unit 13 classifies the content ABC part 4 into the group “content ABC”.

このように、判定距離が基準最大距離以内であると判定されるとき、コンテンツ分類部１３は、コンテンツをグループに追加する。すなわち、コンテンツ関連性判定部１２は、算出した距離があらかじめ定められた所定の閾値より小さいとき、キーワードと他のキーワードとをマッチングしていると判定している。
（分類データベース６の更新）
コンテンツ分類部１３は、グループに追加するコンテンツＡＢＣパート４のタイトルを、分類データベース６に追加する。すなわち、コンテンツＡＢＣパート４のメタ情報から抽出され、コンテンツのマッチングに使用された文字列「コンテンツＡＢＣパート４」を、グループ「コンテンツＡＢＣ」に関連付けて記憶する。さらに、グループ「コンテンツＡＢＣ」の欄に、コンテンツＡＢＣパート４がグループ「コンテンツＡＢＣ」に分類されていることを示すグループ分類フラグを立てる。すなわちこのグループ分類フラグを「１」に設定する。
（部分文字列の使用）
上述したように、コンテンツ分類装置１は、入力されたコンテンツのメタ情報から、カテゴリ「タイトル」に属するキーワード「コンテンツＡＢＣパート４」を抽出し、キーワードのマッチングに使用する。ここでコンテンツ関連性判定部１２は、文字列「コンテンツＡＢＣパート４」から、その部分文字列を抽出し、マッチングに使用してもよい。たとえば、コンテンツＡＢＣパート４から、部分文字列「コンテンツ」、または部分文字列「ＡＢＣ４」を抽出し、マッチングに使用してもよい。
（新たなグループの作成）
以上のように、コンテンツ分類装置１は、入力されるコンテンツを、既存のグループに分類することができる。これに加えて、コンテンツ分類装置１は、新たなグループを作成し、このグループにコンテンツを分類することもできる。このときコンテンツ分類装置１は、複数のコンテンツから、１つのコンテンツを抽出し、抽出したコンテンツと、他のコンテンツとが互いに関連するか否かを判定する。これにより、互いに関連するコンテンツを複数のコンテンツを抽出し、それらを、新たに作成する１つのグループに分類する。この処理の具体例について、以下に説明する。 Thus, when it is determined that the determination distance is within the reference maximum distance, the content classification unit 13 adds the content to the group. That is, the content relevance determination unit 12 determines that the keyword is matched with another keyword when the calculated distance is smaller than a predetermined threshold.
(Update of classification database 6)
The content classification unit 13 adds the title of the content ABC part 4 to be added to the group to the classification database 6. That is, the character string “content ABC part 4” extracted from the meta information of the content ABC part 4 and used for content matching is stored in association with the group “content ABC”. Further, a group classification flag indicating that the content ABC part 4 is classified into the group “content ABC” is set in the field of the group “content ABC”. That is, this group classification flag is set to “1”.
(Use substring)
As described above, the content classification device 1 extracts the keyword “content ABC part 4” belonging to the category “title” from the input meta information of the content, and uses it for keyword matching. Here, the content relevance determination unit 12 may extract the partial character string from the character string “content ABC part 4” and use it for matching. For example, the partial character string “content” or the partial character string “ABC4” may be extracted from the content ABC part 4 and used for matching.
(Create a new group)
As described above, the content classification device 1 can classify input content into existing groups. In addition to this, the content classification device 1 can also create a new group and classify the content into this group. At this time, the content classification device 1 extracts one content from a plurality of contents, and determines whether or not the extracted content and other content are related to each other. Thereby, a plurality of contents are extracted from the contents related to each other, and these are classified into one newly created group. A specific example of this process will be described below.

図６は、コンテンツ分類装置１が管理しているグループ、および各グループに分類されているコンテンツの例を表す図であり、（ａ）はグループの例を表す図であり、（ｂ）はコンテンツの例を表す図であり、（ｃ）はコンテンツをさらに詳細に説明する例を表す図である
図６（ａ）に示す例では、コンテンツ分類装置１は、２つのグループを作成している。すなわちグループ「コンテンツＡＢＣ」とグループ「山田」とである。グループ「コンテンツＡＢＣ」の作成日は、２００５年１月２１日である。グループ「コンテンツＡＢＣ」の名前は、カテゴリ「タイトル」に属している。すなわち、コンテンツ分類装置１は、カテゴリ「タイトル」に属するキーワードをマッチングに使用することによって、このグループを作成している。 FIG. 6 is a diagram illustrating an example of a group managed by the content classification device 1 and content classified into each group, (a) is a diagram illustrating an example of a group, and (b) is a content. FIG. 6C is a diagram illustrating an example for explaining the content in more detail. In the example illustrated in FIG. 6A, the content classification device 1 creates two groups. That is, the group “content ABC” and the group “Yamada”. The creation date of the group “content ABC” is January 21, 2005. The name of the group “content ABC” belongs to the category “title”. That is, the content classification device 1 creates this group by using keywords belonging to the category “title” for matching.

図６に示すように、グループ「山田」の作成日は、２００５年２月２８日である。また、グループ「山田」の名前は、カテゴリ「作成者」に属している。すなわち、コンテンツ分類装置１は、カテゴリ「作成者」に属するキーワードをマッチングに使用することによって、グループ「山田」をを作成している。
（コンテンツおよびグループの例）
図６（ｂ）に示すように、コンテンツ分類装置１は、コンテンツデータベース５に、６つのコンテンツを格納している。すなわち、３つのコンテンツ「ミュージックα」と、１つのコンテンツ「衝撃映像スクープ」と、１つのコンテンツ「歌謡リクエスト」と、１つコンテンツ「ミュージックＢ」とを格納している。 As shown in FIG. 6, the creation date of the group “Yamada” is February 28, 2005. The name of the group “Yamada” belongs to the category “Creator”. That is, the content classification device 1 creates the group “Yamada” by using keywords belonging to the category “creator” for matching.
(Example of content and group)
As shown in FIG. 6B, the content classification device 1 stores six contents in the content database 5. That is, three content “music α”, one content “shock video scoop”, one content “singing request”, and one content “music B” are stored.

コンテンツ分類装置１が３つコンテンツ「ミュージックα」をコンテンツデータベース５に追加した日付は、それぞれ、２００５年２月１４日、２００５年２月２１日、および２００５年２月２８日である。また、コンテンツ「衝撃映像スクープ」をコンテンツデータベース５に追加した日付は、２００５年２月２３日である。さらに、コンテンツ「歌謡リクエスト」をコンテンツデータベース５に追加した日付は、２００５年２月２５日である。さらに、コンテンツ「歌謡リクエスト」をコンテンツデータベース５に追加した日付は、２００５年３月３１日である。 The dates when the content classification device 1 added the three contents “music α” to the content database 5 are February 14, 2005, February 21, 2005, and February 28, 2005, respectively. The date when the content “impact video scoop” was added to the content database 5 is February 23, 2005. Furthermore, the date when the content “kayo request” is added to the content database 5 is February 25, 2005. Furthermore, the date on which the content “kayo request” is added to the content database 5 is March 31, 2005.

コンテンツ分類装置１は、コンテンツの内容を表すメタ情報も、各コンテンツと共に、それぞれ、コンテンツデータベース５に格納している。 The content classification device 1 also stores meta information representing the content content in the content database 5 together with each content.

図６（ｃ）に示す例では、ユーザは、作成日が２００５年２月１４日であるコンテンツ「ミュージックα」を選択する。ユーザが選択したコンテンツの情報は、入力部２が受け付け、コンテンツ管理部１０に出力する。これにより、キーワード抽出部１１は、コンテンツ「ミュージックα」のメタ情報から、キーワードを抽出する。すなわち、カテゴリ「タイトル」に属するキーワードとして、文字列「ミュージックα」を抽出する。さらに、カテゴリ「ジャンル」に属するキーワードとして、文字列「ミュージック」を抽出する。さらに、カテゴリ「作成者」に属するキーワードとして、文字列「田中一朗」を抽出する。 In the example shown in FIG. 6C, the user selects the content “music α” whose creation date is February 14, 2005. Information on the content selected by the user is received by the input unit 2 and output to the content management unit 10. As a result, the keyword extraction unit 11 extracts keywords from the meta information of the content “music α”. That is, the character string “music α” is extracted as a keyword belonging to the category “title”. Further, the character string “music” is extracted as a keyword belonging to the category “genre”. Further, the character string “Ichiro Tanaka” is extracted as a keyword belonging to the category “creator”.

キーワード抽出部１１は、他の５つのコンテンツのメタ情報からも、それぞれ、キーワードを抽出する。
（グループ「ミュージックα」の作成）
コンテンツ関連性判定部１２は、同一のカテゴリに属するキーワードを使用することによって、ＤＰマッチングを実行する。図６（ｃ）に示すように、カテゴリ「タイトル」に属するキーワード「ミュージックα」に、完全にマッチングするキーワード、すなわちキーワード「ミュージックα」が、３つある。すなわち、タイトルがミュージックαであるコンテンツが、３つある。したがって、コンテンツ関連性判定部１２は、これら３つのコンテンツを、互いに関連していると判定する。 The keyword extraction unit 11 also extracts keywords from the meta information of the other five contents.
(Create group “Music α”)
The content relevance determination unit 12 performs DP matching by using keywords belonging to the same category. As shown in FIG. 6C, the keyword “music α” belonging to the category “title” has three keywords that match perfectly, that is, the keyword “music α”. That is, there are three contents whose title is music α. Therefore, the content relevance determination unit 12 determines that these three contents are related to each other.

グループ作成部１５は、マッチングするキーワード、すなわち文字列「ミュージックα」を、作成するグループの名前に設定する。すなわち、グループ作成部１５は、名前がミュージックαであるグループを作成する。これにより、コンテンツ分類部１３は、３つのコンテンツ「ミュージックα」を、グループ作成部１５が作成したグループ「ミュージックα」に分類する。
（グループ「音楽」の作成）
図６（ｃ）に示す例では、さらに、カテゴリ「ジャンル」に属するキーワード「音楽」に完全にマッチングするキーワードが、５つある。すなわち、ジャンルが音楽であるコンテンツが、５つある。したがってコンテンツ関連性判定部１２は、これら５つのコンテンツを、互いに関連していると判定する。 The group creation unit 15 sets the matching keyword, that is, the character string “music α” as the name of the group to be created. That is, the group creation unit 15 creates a group whose name is music α. Accordingly, the content classification unit 13 classifies the three content “music α” into the group “music α” created by the group creation unit 15.
(Create group “music”)
In the example shown in FIG. 6C, there are further five keywords that perfectly match the keyword “music” belonging to the category “genre”. That is, there are five contents whose genre is music. Therefore, the content relevance determination unit 12 determines that these five contents are related to each other.

グループ作成部１５は、マッチングするキーワード、すなわち文字列「音楽」を、作成するグループの名前に設定する。すなわち、名前が音楽であるグループを作成する。コンテンツ分類部１３は、５つのコンテンツを、グループ作成部１５が作成したグループ「ミュージックα」に分類する。
（グループ「田中一朗」の作成）
図６（ｃ）に示す例では、さらに、カテゴリ「作成者」に属するキーワード「田中一朗」に完全にマッチングするキーワードが、２つある。すなわち、作成者が田中一朗であるコンテンツが、２つある。これにより、コンテンツ関連性判定部１２は、これら２つのコンテンツを、互いに関連していると判定する。 The group creating unit 15 sets the matching keyword, that is, the character string “music” as the name of the group to be created. That is, a group whose name is music is created. The content classification unit 13 classifies the five contents into the group “music α” created by the group creation unit 15.
(Created by the group “Ichiro Tanaka”)
In the example shown in FIG. 6C, there are two more keywords that completely match the keyword “Ichiro Tanaka” belonging to the category “creator”. That is, there are two contents whose creator is Ichiro Tanaka. Thereby, the content relevance determination unit 12 determines that these two contents are related to each other.

グループ作成部１５は、マッチングするキーワード、すなわち文字列「田中一朗」を、作成するグループの名前に設定する。すなわち、名前が田中一朗であるグループを作成する。コンテンツ分類部１３は、２つのコンテンツを、グループ作成部１５が作成したグループ「ミュージックα」に分類する。 The group creation unit 15 sets the matching keyword, that is, the character string “Ichiro Tanaka” as the name of the group to be created. That is, a group whose name is Ichiro Tanaka is created. The content classification unit 13 classifies the two contents into the group “music α” created by the group creation unit 15.

このように、コンテンツ分類部１３は、キーワードおよび他のキーワードに共通している文字列が名前に設定されているグループに、コンテンツおよび他のコンテンツを分類する。これにより、コンテンツとの関連が深いグループに、コンテンツを分類できる。
（閾値と比較することによって判定）
コンテンツ関連性判定部１２は、比較するキーワードと、他のキーワードとの距離が、あらかじめ定められた閾値よりも小さいとき、２つのコンテンツを互いに関連していると判定してもよい。たとえば、コンテンツ関連性判定部１２は、キーワード「ミュージックα」と、キーワード「ミュージックＢ」との距離を、ＤＰマッチングによって、０．２８であると算出する。このとき、あらかじめ定められた閾値を０．３５であるとする。この閾値は、図示しないメモリにあらかじめ記憶されている。 In this way, the content classification unit 13 classifies content and other content into groups in which character strings common to the keyword and other keywords are set as names. As a result, the content can be classified into groups that are closely related to the content.
(Determined by comparing with threshold)
The content relevance determination unit 12 may determine that two contents are related to each other when the distance between the keyword to be compared and another keyword is smaller than a predetermined threshold. For example, the content relevance determination unit 12 calculates the distance between the keyword “music α” and the keyword “music B” to be 0.28 by DP matching. At this time, it is assumed that the predetermined threshold is 0.35. This threshold value is stored in advance in a memory (not shown).

コンテンツ関連性判定部１２は、算出した距離（０．２８）と、図示しないメモリにあらかじめ記憶されている閾値（０．３５）とを比較することによって、キーワード「ミュージックα」とキーワード「ミュージックＢ」との距離が、あらかじめ定められた閾値よりも小さいことを判定する。これによりコンテンツ関連性判定部１２は、キーワード「ミュージックα」とキーワード「ミュージックＢ」とを、完全ではないが、互いにマッチングしていると判定する。したがって、タイトルがミュージックαであるコンテンツと、タイトルがミュージックＢであるコンテンツとを、互いに関連していると判定する。 The content relevance determination unit 12 compares the calculated distance (0.28) with a threshold value (0.35) stored in advance in a memory (not shown) to thereby determine the keyword “music α” and the keyword “music B”. Is determined to be smaller than a predetermined threshold. Thereby, the content relevance determination unit 12 determines that the keyword “music α” and the keyword “music B” are not perfect but match each other. Therefore, it is determined that the content whose title is music α and the content whose title is music B are related to each other.

これにより、コンテンツ分類部１３は、コンテンツ「ミュージックＢ」を、グループ「ミュージックα」に分類することができる。すなわち、キーワードが完全にマッチングする場合よりも、より多いコンテンツをグループに分類できる。
（部分文字列の利用）
上述したように、コンテンツ関連性判定部１２は、キーワード抽出部１１が抽出したキーワードを、そのまま使用できる。しかし、このとき、次のような問題が生ずる可能性がある。たとえば、キーワード抽出部１１がキーワードを抽出し、コンテンツ関連性判定部１２がマッチングするコンテンツを検索したとき、マッチングするコンテンツが、所定の数よりも少なくなる可能性がある。さらに、キーワード抽出部１１が抽出したキーワードが、必要以上に長く、マッチングに適さない場合もある。 Thereby, the content classification unit 13 can classify the content “music B” into the group “music α”. That is, more content can be classified into groups than when the keywords match perfectly.
(Use of substring)
As described above, the content relevance determination unit 12 can use the keyword extracted by the keyword extraction unit 11 as it is. However, the following problems may occur at this time. For example, when the keyword extraction unit 11 extracts keywords and the content relevance determination unit 12 searches for matching content, the matching content may be less than a predetermined number. Furthermore, the keyword extracted by the keyword extraction unit 11 may be longer than necessary and is not suitable for matching.

これらの問題を回避するため、キーワード抽出部１１は、抽出されたキーワードの少なくとも一部を、部分文字列としてさらに抽出してもよい。このときコンテンツ関連性判定部１２は、キーワード抽出部１１が抽出した部分文字列を、マッチングに使用すればよい。このとき、コンテンツ関連性判定部１２は、より短いキーワードを、コンテンツのマッチングに使用することになる。 In order to avoid these problems, the keyword extraction unit 11 may further extract at least a part of the extracted keywords as a partial character string. At this time, the content relevance determination unit 12 may use the partial character string extracted by the keyword extraction unit 11 for matching. At this time, the content relevance determination unit 12 uses a shorter keyword for content matching.

キーワード抽出部１１は、文字列「ミュージックα」から、文字列「ミュージック」および文字列「α」を、それぞれ部分文字列として抽出すればよい。さらに、コンテンツ関連性判定部１２は、これらの部分文字列を、それぞれマッチングに使用してもよい。 The keyword extraction unit 11 may extract the character string “music” and the character string “α” as partial character strings from the character string “music α”. Furthermore, the content relevance determination unit 12 may use these partial character strings for matching.

コンテンツ関連性判定部１２は、たとえば、テキスト「ミュージックα」およびテキスト「ミュージックＢ」を、互いに完全にマッチングしないと判定する。
このとき、キーワード抽出部１１は、「ミュージックα」から、部分文字列として、「ミュージック」および「α」を抽出する。さらに、「ミュージックＢ」から、部分文字列として、「ミュージック」および「β」を抽出する。コンテンツ関連性判定部１２は、キーワード抽出部１１が抽出したこれらの部分文字列を、キーワードとしてマッチングに使用する。このとき、コンテンツ関連性判定部１２は、「ミュージックα」から抽出される「ミュージック」と、「ミュージックＢ」から抽出される「ミュージック」とを、互いに完全にマッチングしていると判定する。
（部分文字列をグループの名前に設定する）
グループ作成部１５は、互いに完全にマッチングする部分文字列を、グループの名前に設定してもよい。すなわち、完全にマッチングする部分文字列を名前として有するグループを作成してもよい。たとえば、「ミュージックα」から抽出される「ミュージック」と、「ミュージックＢ」から抽出される「ミュージック」とは、完全にマッチングする。そこでグループ作成部１５は、文字列「ミュージック」を、作成するグループの名前に設定してもよい。
（部分文字列の抽出法）
キーワード抽出部１１は、所定のアルゴリズムに基づき、キーワードから部分文字列を抽出すればよい。たとえば、キーワードに含まれているひらがな、かたかな、漢字、記号など、異なる文字によって区切られる所定長の文字列を、部分文字列として抽出すればよい。または、キーワードとして使用する部分文字列をあらかじめ格納しているキーワード辞書を参照することによって、この辞書に含まれている部分文字列をキーワードから抽出してもよい。さらに、キーワード抽出部１１は、キーワードを形態素解析して、たとえば名詞のみを部分文字列として抽出してもよい。
（コンテンツをグループに分類しない場合）
コンテンツ分類部１３は、場合によっては、コンテンツデータベース５に記憶されているコンテンツのほとんど全てを、１つのグループに分類することもある。これでは、コンテンツをグループに分類する意味がない。そこでコンテンツ分類部１３は、コンテンツのキーワードがグループの名前にマッチングしている場合であっても、コンテンツをグループに分類しなくてもよい。すなわち、コンテンツが互いに関連していると判定されている場合であっても、グループ作成部１５は、これらのコンテンツを分類するためグループを、作成しなくてもよい。なお、グループ作成部１５は、コンテンツデータベース５に記憶されている全てのコンテンツのうち、新たに作成するグループに含むコンテンツが、どのくらいの割合を占めるかに応じて、そのグループを作成するか否かを決定してもよい。
（コンテンツを複数のグループへ分類）
コンテンツ関連性判定部１２は、入力されるコンテンツと、既存のグループにすでに分類されているコンテンツが、互いに関連しているかを判定してもよい。これにより、コンテンツ分類部１３は、１つのコンテンツを、異なる複数のグループに分類できる。
（キーワードの優先度）
上述したように、コンテンツ関連性判定部１２は、メタ情報に含まれているキーワードが互いにマッチングするとき、２つのコンテンツを互いに作成していると判定する。キーワードが互いにマッチングするとき、コンテンツ分類部１３は、これら２つのコンテンツを、共通するグループに分類する。ここでコンテンツ関連性判定部１２は、マッチングに使用するキーワードを、キーワードに設定されている優先度に基づき決定してもよい。 For example, the content relevance determination unit 12 determines that the text “music α” and the text “music B” do not completely match each other.
At this time, the keyword extraction unit 11 extracts “music” and “α” as partial character strings from “music α”. Further, “music” and “β” are extracted as partial character strings from “music B”. The content relevance determination unit 12 uses these partial character strings extracted by the keyword extraction unit 11 as keywords for matching. At this time, the content relevance determination unit 12 determines that “music” extracted from “music α” and “music” extracted from “music B” completely match each other.
(Set the substring to the name of the group)
The group creation unit 15 may set partial character strings that completely match each other as the group name. That is, you may create the group which has the partial character string which matches completely as a name. For example, “music” extracted from “music α” and “music” extracted from “music B” are completely matched. Therefore, the group creation unit 15 may set the character string “music” as the name of the group to be created.
(Substring extraction method)
The keyword extraction part 11 should just extract a partial character string from a keyword based on a predetermined algorithm. For example, a character string of a predetermined length separated by different characters such as hiragana, kana, kanji, and symbols included in the keyword may be extracted as a partial character string. Alternatively, a partial character string included in this dictionary may be extracted from the keyword by referring to a keyword dictionary in which partial character strings used as keywords are stored in advance. Further, the keyword extraction unit 11 may perform morphological analysis on the keyword and extract, for example, only a noun as a partial character string.
(When content is not classified into groups)
In some cases, the content classification unit 13 may classify almost all of the content stored in the content database 5 into one group. This makes no sense to classify content into groups. Therefore, the content classification unit 13 does not have to classify content into groups even when the keyword of the content matches the name of the group. That is, even if it is determined that the contents are related to each other, the group creating unit 15 does not have to create a group to classify these contents. It should be noted that the group creation unit 15 determines whether to create the group depending on how much of the content stored in the content database 5 is included in the newly created group. May be determined.
(Categorize content into multiple groups)
The content relevance determination unit 12 may determine whether the input content and the content already classified into the existing group are related to each other. Accordingly, the content classification unit 13 can classify one content into a plurality of different groups.
(Keyword priority)
As described above, when the keywords included in the meta information match each other, the content relevance determination unit 12 determines that the two contents are created with each other. When the keywords match each other, the content classification unit 13 classifies these two contents into a common group. Here, the content relevance determination unit 12 may determine the keyword used for matching based on the priority set for the keyword.

まず、作成されるグループの名前に設定できるキーワードとして、タイトルカテゴリに属するテキスト「ミュージック」と、作成者カテゴリに属するテキスト「田中」と、ジャンルカテゴリに属するテキスト「音楽」とがあるとする。このとき優先度設定部１４は、キーワード候補に設定する優先度を決定する。この優先度は、複数のキーワード候補のうち、どのキーワード候補をキーワードとして使用するかの指標となる値である。優先度設定部１４は、決定する優先度に関する情報を、コンテンツ関連性判定部１２に出力する。 First, as keywords that can be set as the name of the group to be created, it is assumed that there is a text “music” belonging to the title category, a text “Tanaka” belonging to the creator category, and a text “music” belonging to the genre category. At this time, the priority setting unit 14 determines the priority set for the keyword candidate. This priority is a value that serves as an index of which keyword candidate to use as a keyword among a plurality of keyword candidates. The priority setting unit 14 outputs information on the priority to be determined to the content relevance determination unit 12.

図６（ｂ）に示す例では、グループ作成部１５は、キーワード「ミュージック」が名前に設定されているグループを作成している。コンテンツ分類部１３は、このグループに、タイトルがミュージックαである３つのコンテンツと、タイトルがミュージックＢである１つのコンテンツとを、分類している。すなわち、グループ作成部１５は、ミュージックαおよびミュージックＢに完全にマッチングするテキスト「ミュージック」を抽出し、このテキストをグループの名前に設定している。 In the example shown in FIG. 6B, the group creation unit 15 creates a group in which the keyword “music” is set as the name. The content classifying unit 13 classifies the three contents whose title is music α and the one content whose title is music B into this group. That is, the group creating unit 15 extracts text “music” that perfectly matches music α and music B, and sets this text as the name of the group.

さらに、グループ作成部１５は、キーワード「田中」が名前に設定されているグループを作成している。コンテンツ分類部１３は、このグループに、作成者が田中一朗である１つのコンテンツと、作成者が田中四朗である２つのコンテンツと、作成者が田中五朗である１つのコンテンツとを、それぞれ分類している。すなわちキーワード抽出部１１は、田中一朗、田中四朗、および田中五朗に完全にマッチングするテキスト「田中」を抽出している。なお、この例では、田中一朗、田中四朗、および田中五朗から任意に選択される２つのキーワードの距離は、いずれも、あらかじめ定められた閾値よりも小さいとする。 Further, the group creation unit 15 creates a group in which the keyword “Tanaka” is set as the name. The content classification unit 13 classifies into this group one content whose creator is Ichiro Tanaka, two contents whose creator is Shiro Tanaka, and one content whose creator is Goro Tanaka. ing. That is, the keyword extraction unit 11 extracts the text “Tanaka” that perfectly matches Ichiro Tanaka, Shiro Tanaka, and Goro Tanaka. In this example, the distance between two keywords arbitrarily selected from Ichiro Tanaka, Shiro Tanaka, and Goro Tanaka is assumed to be smaller than a predetermined threshold.

優先度設定部１４は、ユーザが選択したキーワードに、より高い優先度を設定すればよい。または、コンテンツ分類装置１がキーワード候補から任意に選択したキーワード候補に、より高い優先度を設定してもよい。 The priority setting unit 14 may set a higher priority for the keyword selected by the user. Or you may set a higher priority to the keyword candidate arbitrarily selected from the keyword candidates by the content classification device 1.

コンテンツ分類装置１がユーザにキーワード候補を提供し、ユーザが選択したキーワード候補に、より高い優先度を設定する例について、図８を参照して以下に説明する。図８は、ユーザにキーワードを選択させるための表示画面を表す図である。
（ユーザがキーワードを選択する例）
この図によれば、表示部３は、キーワード候補として、ミュージック、田中、および音楽をディスプレイ４に表示している。さらに、各グループに現在分類されているコンテンツのタイトルを、同時に表示している。これにより表示部３は、各グループに分類されているコンテンツの名前を、ユーザに通知している。 An example in which the content classification device 1 provides keyword candidates to the user and a higher priority is set for the keyword candidate selected by the user will be described below with reference to FIG. FIG. 8 is a diagram illustrating a display screen for allowing the user to select a keyword.
(Example of user selecting a keyword)
According to this figure, the display unit 3 displays music, Tanaka, and music on the display 4 as keyword candidates. Furthermore, the titles of the contents currently classified into each group are displayed at the same time. Thereby, the display unit 3 notifies the user of the name of the content classified into each group.

このとき表示部３は、キーワード候補のテキストと共に、このテキストが属するカテゴリをディスプレイ４に表示してもよい。ユーザは、ディスプレイ４に表示されているキーワード候補から、作成したいグループの名前として、任意のキーワード候補を選択する。すなわちユーザは、１つのキーワード候補を選択する。または、複数のキーワード候補を選択してもよい。この例では、１つのキーワード候補として、ユーザは、ミュージックを選択している。すなわち表示部３は、ユーザが選択している「選択」ボタンを反転してディスプレイ４に表示している。
（優先度設定のルール）
上述したようにコンテンツ分類装置１は、あらかじめ定められたルールに基づき、キーワードの優先度を決定してもよい。このルールは、コンテンツ分類装置１において、図示しないメモリにあらかじめ記憶されている。このルールとして、たとえば、タイトルカテゴリに属するキーワードに、より高い優先度を設定し、ジャンルカテゴリに属するキーワードに、より低い優先度を設定する、というものがある。 At this time, the display unit 3 may display the category to which the text belongs on the display 4 together with the keyword candidate text. The user selects an arbitrary keyword candidate as the name of the group to be created from the keyword candidates displayed on the display 4. That is, the user selects one keyword candidate. Alternatively, a plurality of keyword candidates may be selected. In this example, the user selects music as one keyword candidate. That is, the display unit 3 inverts the “select” button selected by the user and displays it on the display 4.
(Priority setting rules)
As described above, the content classification device 1 may determine the priority of keywords based on a predetermined rule. This rule is stored in advance in a memory (not shown) in the content classification device 1. As this rule, for example, a higher priority is set for a keyword belonging to a title category, and a lower priority is set for a keyword belonging to a genre category.

優先度設定部１４は、キーワード候補の長さに基づき、設定する優先度を決定してもよい。このとき、より長いキーワード候補、すなわち、文字数がより多いキーワード候補に、より高い優先度を設定すればよい。または、優先度設定部１４は、キーワード候補が、あるグループの名前に設定されているとき、このグループに分類されているコンテンツを表すキーワードに設定されている優先度と、同一の優先度を、このキーワード候補に設定してもよい。このとき、より多くのコンテンツが分類されているグループの名前と同一であるキーワードに、より高い優先度を設定してもよい。 The priority setting unit 14 may determine the priority to be set based on the length of the keyword candidate. At this time, a higher priority may be set for a longer keyword candidate, that is, a keyword candidate having a larger number of characters. Or, when the keyword candidate is set to the name of a certain group, the priority setting unit 14 sets the same priority as the priority set to the keyword representing the content classified into this group, This keyword candidate may be set. At this time, a higher priority may be set for a keyword that is the same as the name of a group into which more content is classified.

優先度設定部１４は、ユーザによってキーワード候補の選択、またはコンテンツ分類装置１によってキーワード候補の選択に基づき、各キーワード候補に優先度を設定する優先度を決定する。たとえば、ユーザがキーワード候補を複数選択したとき、これら複数のキーワード候補に同一の優先度を設定すれば。
（キーワード学習に基づき優先度を設定する）
優先度設定部１４は、キーワードを選択するとき、嗜好学習部１７が学習したキーワードの情報を使用してもよい。上述したように、嗜好学習部１７ａは、ユーザが選択したキーワード候補およびグループ、さらに、ユーザが削除したグループを学習している。これにより優先度設定部１４は、ユーザが嗜好しているキーワードを、優先度をより高く設定するキーワードとして選択してもよい。または、優先度設定部１４は、ユーザが嗜好しないキーワード以外のキーワードを、優先度をより高く設定するキーワードとして選択してもよい。 The priority setting unit 14 determines the priority for setting the priority for each keyword candidate based on the selection of keyword candidates by the user or the selection of keyword candidates by the content classification device 1. For example, when the user selects a plurality of keyword candidates, the same priority is set for the plurality of keyword candidates.
(Set priority based on keyword learning)
The priority setting unit 14 may use the keyword information learned by the preference learning unit 17 when selecting a keyword. As described above, the preference learning unit 17a learns keyword candidates and groups selected by the user, and groups deleted by the user. Thereby, the priority setting part 14 may select the keyword which the user likes as a keyword which sets a priority higher. Or the priority setting part 14 may select keywords other than the keyword which a user does not like as a keyword which sets a priority higher.

ユーザは、入力部２を通じて、ユーザに関する情報を入力する。この情報として、ユーザは、たとえば、ユーザの名前、ユーザの所属などのプロファイル情報、ユーザの好みを表す情報を入力する。たとえば、ユーザは、音楽を嗜好していることを表す情報を、入力する。これにより、コンテンツ分類装置１は、この情報を、図示しないメモリに記憶し、管理している。 The user inputs information about the user through the input unit 2. As this information, for example, the user inputs profile information such as the user's name, the user's affiliation, and information indicating the user's preferences. For example, the user inputs information indicating that the user likes music. Thus, the content classification device 1 stores and manages this information in a memory (not shown).

優先度設定部１４は、コンテンツ分類装置１に記憶されているユーザ情報に基づき、たとえば、キーワード「ミュージック」の優先度を、他のキーワードの優先度よりもより高くする。すなわち、ユーザが嗜好しているキーワードにマッチングしているキーワードの優先度を、他のキーワードの優先度よりもより高くする。
（分類構成に基づき優先度を設定する例）
優先度設定部１４は、既存のグループの分類構成に基づき、キーワードの優先度を決定してもよい。これにより、グループ作成部１５は、作成するグループの分類構成を、すでに作成されている他のグループの分類構成に類似させることができる。この例について、図１２を参照して以下に説明する。図１２は、グループの階層構造を表す図である。 For example, the priority setting unit 14 sets the priority of the keyword “music” to be higher than the priority of other keywords based on the user information stored in the content classification device 1. That is, the priority of the keyword that matches the keyword that the user likes is set higher than the priority of other keywords.
(Example of setting priority based on classification configuration)
The priority setting unit 14 may determine the priority of the keyword based on the existing group classification configuration. Thereby, the group creation unit 15 can resemble the classification configuration of the group to be created with the classification configuration of another group that has already been created. This example will be described below with reference to FIG. FIG. 12 is a diagram illustrating a hierarchical structure of groups.

この図によれば、グループ作成部１５は、分類データベース６において、グループ「山田」を作成している。このグループの名前は、作成者カテゴリに属している。さらに、グループ作成部１５は、このグループの下に、グループ「情報」を作成している。このグループの名前は、タイトルカテゴリに属している。すなわちグループ作成部１５は、これら２つのグループを作成したとき、まず、コンテンツを、作成者のキーワードによってマッチングするコンテンツを分類するためグループを作成し、さらに、このグループに分類されるコンテンツのうち、タイトルのキーワードによってマッチングするコンテンツを分類するためグループを作成している。 According to this figure, the group creation unit 15 creates a group “Yamada” in the classification database 6. The name of this group belongs to the creator category. Further, the group creation unit 15 creates a group “information” under this group. The name of this group belongs to the title category. That is, when these two groups are created, the group creating unit 15 first creates a group for classifying content that matches the content by the creator's keyword, and among the contents classified into this group, A group is created to classify the matching content by the keyword of the title.

グループ作成部１５は、この分類構成に基づき、新たなコンテンツを分類するためグループを作成する。コンテンツ分類部１３は、グループ作成部１５が作成した分類構成のグループに、コンテンツを分類する。すなわち、グループ作成部１５は、まず、作成者のキーワードによってマッチングするコンテンツを分類するためグループを作成する。これにより、コンテンツ分類部１３は、作成者のキーワードによってマッチングするコンテンツを、このグループに分類する。つぎに、グループ作成部１５は、このグループに分類されるコンテンツのうち、タイトルのキーワードによってマッチングするコンテンツを分類するためグループを作成する。これにより、コンテンツ分類部１３は、タイトルのキーワードによってマッチングするコンテンツを、このグループに分類する。 The group creation unit 15 creates a group for classifying new content based on this classification configuration. The content classification unit 13 classifies the content into groups with the classification configuration created by the group creation unit 15. In other words, the group creating unit 15 first creates a group for classifying content that matches with the creator's keyword. As a result, the content classification unit 13 classifies content matched by the creator's keyword into this group. Next, the group creating unit 15 creates a group for classifying content matched by the keyword of the title among the contents classified into this group. As a result, the content classification unit 13 classifies the content matched by the keyword of the title into this group.

分類構成が類似するグループとは、グループの名前が属するカテゴリが、より上位のグループ同士で一致し、さらに、どのグループ同士で一致することを意味する。
（優先度の設定）
たとえば、優先度設定部１４は、キーワード候補として、作成者カテゴリに属する田中と、タイトルカテゴリに属するミュージックと、ジャンルカテゴリに属する音楽とを処理するとする。上述したように、すでに作成されているグループの分類構成では、作成者カテゴリに属する「山田」が名前に設定されているグループが、他のグループより上位に存在している。このとき優先度設定部１４は、キーワード候補「田中」に設定する優先度を、キーワード候補「ミュージックα」および「音楽」に設定する優先度よりもより高くする。 A group having a similar classification structure means that a category to which a group name belongs matches among higher groups, and further matches between groups.
(Priority setting)
For example, it is assumed that the priority setting unit 14 processes, as keyword candidates, Tanaka belonging to the creator category, music belonging to the title category, and music belonging to the genre category. As described above, in the group classification configuration that has already been created, a group in which “Yamada” belonging to the creator category is set as the name is present higher than other groups. At this time, the priority setting unit 14 sets the priority set for the keyword candidate “Tanaka” higher than the priority set for the keyword candidates “music α” and “music”.

優先度設定部１４は、すでに作成されているグループの分類構成に基づき、キーワードの優先度を決定したとき、表示部３は、より高い優先度が設定されているキーワードを、表示画面における一番上に表示すればよい。
（まとめ）
このようにコンテンツ管理部１０は、グループの分類構成を統一する。これにより、ユーザは、コンテンツを検索するとき、グループの分類構成を把握しやすい。したがって、コンテンツを検索しやすい。このように、優先度設定部１４は、既存のグループの分類構成に基づき、キーワード候補に設定する優先度を決定することは、ユーザにとって非常に有意義である。 When the priority setting unit 14 determines the priority of the keyword based on the group configuration already created, the display unit 3 selects the keyword with the higher priority on the display screen. Display it above.
(Summary)
Thus, the content management unit 10 unifies the group classification configuration. Thus, the user can easily grasp the group classification configuration when searching for content. Therefore, it is easy to search for content. As described above, it is very significant for the user that the priority setting unit 14 determines the priority to be set as the keyword candidate based on the existing group classification configuration.

以下ではタイトルカテゴリに属するキーワード「ミュージック」を、ユーザが選択する例を説明する。ユーザが「ミュージック」を選択することによって、優先度設定部１４は、「ミュージック」の優先度を、他のキーワードよりもより高くする。優先度設定部１４は、キーワードに設定する優先度の情報を、分類データベース６において更新する。 Hereinafter, an example in which the user selects the keyword “music” belonging to the title category will be described. When the user selects “music”, the priority setting unit 14 makes the priority of “music” higher than other keywords. The priority setting unit 14 updates the priority information set for the keyword in the classification database 6.

グループ作成部１５は、より高い優先度のキーワードが名前に設定されているグループを、優先して作成する。このときグループ作成部１５は、図４に示す、分類データベース６に格納されている分類データを更新する。たとえば、タイトルカテゴリに属するミュージックを名前に設定したグループを作成する。さらに、このグループに属するコンテンツに、フラグを特定する。更新したあとの分類データを、図７に示す。図７は、分類データベースの詳細を表す図である。 The group creation unit 15 preferentially creates a group in which a keyword with a higher priority is set in the name. At this time, the group creation unit 15 updates the classification data stored in the classification database 6 shown in FIG. For example, a group in which music belonging to the title category is set as a name is created. Further, a flag is specified for the content belonging to this group. FIG. 7 shows the classification data after the update. FIG. 7 is a diagram showing details of the classification database.

表示部３は、分類データベース６に格納されている分類データに基づき、ディスプレイ４に表示する情報を表す表示用データを作成する。表示部３は、作成した表示用データに基づき、ディスプレイ４に所定の情報を表示する。表示部３がディスプレイ４に表示する情報の例について、図１０を参照して以下に説明する。図１０は、グループおよびコンテンツを表示している表示画面の例を表す図である。 The display unit 3 creates display data representing information to be displayed on the display 4 based on the classification data stored in the classification database 6. The display unit 3 displays predetermined information on the display 4 based on the created display data. An example of information displayed on the display 4 by the display unit 3 will be described below with reference to FIG. FIG. 10 is a diagram illustrating an example of a display screen displaying groups and contents.

この図によれば、表示部３は、３つのグループを表示している。すなわち、グループ「コンテンツＡＢＣ」と、グループ「山田」と、グループ「ミュージック」とを表示している。さらに、これら３つのグループに分類されていないコンテンツを、２つ表示している。すなわち、コンテンツ「衝撃映像スクープ」と、コンテンツ「歌謡リクエスト」とを表示している。ユーザは、これら２つのコンテンツを、表示されているコンテンツから直接に選択できる。
（キーワードの学習）
ユーザが選択したキーワードは、ユーザの嗜好を表している可能性がある。そこで嗜好学習部１７は、ユーザが選択したキーワードを学習する。この例について、説明する。 According to this figure, the display unit 3 displays three groups. That is, the group “content ABC”, the group “Yamada”, and the group “music” are displayed. Further, two contents not classified into these three groups are displayed. That is, the content “shock video scoop” and the content “song request” are displayed. The user can select these two contents directly from the displayed contents.
(Keyword learning)
The keyword selected by the user may represent the user's preference. Therefore, the preference learning unit 17 learns the keyword selected by the user. This example will be described.

たとえば、ユーザは、タイトルカテゴリに属するミュージックと、作成者カテゴリに属する田中と、ジャンルカテゴリに属する音楽とを選択できるとする。ユーザは、これらのキーワードのうち、「音楽」を選択する。これにより嗜好学習部１７は、ユーザが選択したキーワード「音楽」を、ユーザが嗜好しているキーワードとして、図示しないメモリに記憶する。すなわち嗜好学習部１７は、キーワード「音楽」を学習する。これにより、コンテンツ分類装置１は、コンテンツデータベース５に確認されているコンテンツをグループに分類するとき、このキーワードを他のキーワードよりも優先して使用する。 For example, it is assumed that the user can select music belonging to the title category, Tanaka belonging to the creator category, and music belonging to the genre category. The user selects “music” from these keywords. Accordingly, the preference learning unit 17 stores the keyword “music” selected by the user in a memory (not shown) as a keyword that the user likes. That is, the preference learning unit 17 learns the keyword “music”. Thereby, when classifying the content confirmed in the content database 5 into a group, the content classification device 1 uses this keyword in preference to other keywords.

ユーザが「ミュージック」を選択したとき、優先度設定部１４は、音楽のジャンルのうち、タイトルがミュージックであるコンテンツをユーザが嗜好していると判断する。これにより、優先度設定部１４は、キーワード「ミュージックα」を図示しないメモリに格納する。
（優先度管理データベース）
キーワードに設定される優先度を管理するため優先度管理データベースの例について、図９を参照して以下に説明する。図９は、各キーワードに設定されている優先度の例を表す図である。ユーザがミュージックを選択したとき、優先度設定部１４は、キーワードが音楽であり、カテゴリがジャンルであるキーワードを追加する。さらに、このキーワードのポイントを、１とする。このポイントは、キーワードの優先度を表す。 When the user selects “music”, the priority setting unit 14 determines that the user likes the content whose title is music among the music genres. As a result, the priority setting unit 14 stores the keyword “music α” in a memory (not shown).
(Priority management database)
An example of a priority management database for managing priorities set for keywords will be described below with reference to FIG. FIG. 9 is a diagram illustrating an example of priorities set for each keyword. When the user selects music, the priority setting unit 14 adds a keyword whose keyword is music and whose category is genre. Furthermore, the point of this keyword is 1. This point represents the priority of the keyword.

ユーザが、音楽のジャンルのうち、タイトル「ミュージック」を選択したとき、優先度設定部１４は、ジャンルが音楽であり、キーワードがミュージックであり、カテゴリがタイトルであるキーワードを、追加する。さらに、このキーワードのポイントを、プラス１とする。また、図９に示す例では、ジャンルがミュージックであり、キーワードが田中であり、図示している作成者のキーワードも追加している。さらに、このキーワードのポイントを、マイナス１に設定している。 When the user selects the title “music” from among the music genres, the priority setting unit 14 adds a keyword whose genre is music, whose keyword is music, and whose category is title. Further, this keyword point is set to plus one. In the example shown in FIG. 9, the genre is music, the keyword is Tanaka, and the creator's keyword shown is also added. Further, the keyword point is set to minus one.

優先度設定部１４は、ユーザが選択したキーワードの情報を、優先度管理データベースに格納する。さらに、ユーザが入力した情報に、ユーザの嗜好に関する情報が含まれているとき、この情報も優先度管理データベースに格納してもよい。
（グループを削除する例）
コンテンツ分類装置１は、ユーザが指定したグループを、分類データベース６から削除できる。そこで、ユーザが指定したグループを、コンテンツ分類装置１が削除する例について、以下に説明する。 The priority setting unit 14 stores information on the keyword selected by the user in the priority management database. Furthermore, when the information input by the user includes information related to the user's preference, this information may also be stored in the priority management database.
(Example of deleting a group)
The content classification device 1 can delete the group designated by the user from the classification database 6. Therefore, an example in which the content classification device 1 deletes a group designated by the user will be described below.

ユーザは、ディスプレイ４に表示されている分類構成を見て、不要と感じるグループを削除することをコンテンツ分類装置１に指示する。グループ削除部１６は、ユーザが指示したグループを削除する。 The user looks at the classification configuration displayed on the display 4 and instructs the content classification apparatus 1 to delete a group that is felt unnecessary. The group deletion unit 16 deletes the group designated by the user.

ここで、ユーザが選択したキーワードにより高い優先度を設定されているとき、ユーザは、作成されるグループに特に不満を感じることはないだろう。しかし、グループ作成部１５は、キーワードの優先度を自動的に設定しているとき、ユーザにとって不要なキーワードを名前に設定したグループを作成している可能性がある。または、分類されているコンテンツをユーザが想像しにくいグループを作成している可能性もある。 Here, when a higher priority is set for the keyword selected by the user, the user will not feel particularly dissatisfied with the created group. However, when the keyword priority is automatically set, the group creation unit 15 may create a group in which a keyword unnecessary for the user is set as a name. Or, there is a possibility that a group that makes it difficult for the user to imagine the classified content is created.

たとえば、作成できるグループのキーワードとして、カテゴリ「タイトル」に属するキーワード「ミュージック」と、カテゴリ「作成者」に属するキーワード「田中」があるとする。これにより、グループ作成部１５が、グループ「ミュージック」およびグループ「田中」を作成したとする。ここで、ユーザは、グループ「田中」を不要であると判断する。たとえば、図１０に示す表示画面において、削除するグループとしてグループ「田中」を選択し、さらに、削除ボタンを押下する。 For example, it is assumed that there are a keyword “music” belonging to the category “title” and a keyword “Tanaka” belonging to the category “creator” as keywords of the group that can be created. Accordingly, it is assumed that the group creating unit 15 creates the group “music” and the group “Tanaka”. Here, the user determines that the group “Tanaka” is unnecessary. For example, in the display screen shown in FIG. 10, the group “Tanaka” is selected as the group to be deleted, and the delete button is further pressed.

コンテンツ分類装置１では、ユーザが指定した、削除するグループに関する情報を、入力部２が受け付け、コンテンツ管理部１０に出力する。コンテンツ管理部１０では、グループ削除部１６が、ユーザが指定したグループを、分類データベース６から削除する。また、嗜好学習部１７が、ユーザが指定したグループの名前を、ユーザが好まない文字列として、学習する。 In the content classification device 1, the input unit 2 receives information regarding a group to be deleted designated by the user and outputs the information to the content management unit 10. In the content management unit 10, the group deletion unit 16 deletes the group designated by the user from the classification database 6. Further, the preference learning unit 17 learns the name of the group designated by the user as a character string that the user does not like.

図９に示す例に基づき説明すると、ユーザがグループ「田中」を削除することを選択したとき、嗜好学習部１７は、ジャンルが音楽であり、キーワードが田中であり、カテゴリが作成者であるキーワードのポイントを、マイナス１として、優先度管理データベースに記憶する。なお、優先度管理データベースは、ポイントがマイナスであるキーワードを、ユーザが好まないキーワードとして格納している。
（グループを再編成する例）
さらに、コンテンツ分類装置１は、ユーザが指定したコンテンツを、既存のグループから取り除くことができる。また、ユーザが指定したコンテンツを、既存のグループに追加できる。これらの例について、次に説明する。 Referring to the example shown in FIG. 9, when the user selects to delete the group “Tanaka”, the preference learning unit 17 uses the keyword that the genre is music, the keyword is Tanaka, and the category is creator. Is stored in the priority management database as minus one. The priority management database stores keywords with negative points as keywords that the user does not like.
(Example of group reorganization)
Furthermore, the content classification device 1 can remove the content specified by the user from the existing group. In addition, content specified by the user can be added to an existing group. These examples will be described next.

コンテンツ分類部１３がグループに分類した複数のコンテンツの分類構成を、ユーザは、自分が意図した分類構成でないと感じることがある。このとき、ユーザは、グループに分類されている、不要なコンテンツを、グループから削除したいと考える。さらに、グループに分類されていないコンテンツを、このグループに分類したいと考えることもある。コンテンツ分類装置１は、ユーザによるこれらの要求を、次に示す方法によって実現する。 The user may feel that the classification configuration of a plurality of contents classified into groups by the content classification unit 13 is not the classification configuration intended by the user. At this time, the user wants to delete unnecessary contents classified into the group from the group. Furthermore, content that is not classified into a group may want to be classified into this group. The content classification device 1 realizes these requests by the user by the following method.

コンテンツ分類装置１では、表示部３が、現在の分類構成をディスプレイ４に表示する。この表示をみたユーザは、ディスプレイ４に表示されているコンテンツの中から、グループから取り除きたいコンテンツを入力部２を通じて選択する。入力部２は、ユーザが選択したコンテンツ「情報」を受け付け、受け付けたコンテンツに関する情報をコンテンツ管理部１０に出力する。コンテンツ分類部１３が、ユーザが選択したコンテンツを、そのコンテンツが分類されているグループから取り除く。 In the content classification device 1, the display unit 3 displays the current classification configuration on the display 4. The user who sees this display selects the content to be removed from the group from the contents displayed on the display 4 through the input unit 2. The input unit 2 receives the content “information” selected by the user and outputs information related to the received content to the content management unit 10. The content classification unit 13 removes the content selected by the user from the group in which the content is classified.

あるグループから取り除いたコンテンツが、そのグループとは異なる他のどのグループにも分類されていないとき、コンテンツ分類部１３は、このコンテンツをルートディレクトリに格納する。表示部３は、表示画面において、このコンテンツがルートディレクトリに格納されていることを表示する。また、コンテンツ分類部１３は、グループから削除するコンテンツの情報を、分類データベース６から削除する。
（コンテンツをグループに追加する）
ユーザは、あるコンテンツをグループに追加したいとき、ディスプレイ４に表示されている表示画面において、追加したいコンテンツを選択し、グループ追加メニューを押下する。これにより、表示部３は、ディスプレイ４に、ユーザが選択したコンテンツを分類できるグループの一覧を表示する。ユーザは、表示されている一覧の中から、コンテンツを分類したいグループを選択する。 When the content removed from a certain group is not classified into any other group different from the group, the content classification unit 13 stores this content in the root directory. The display unit 3 displays on the display screen that this content is stored in the root directory. Further, the content classification unit 13 deletes the information on the content to be deleted from the group from the classification database 6.
(Add content to group)
When the user wants to add a certain content to the group, the user selects the content to be added on the display screen displayed on the display 4 and presses the group addition menu. Thereby, the display unit 3 displays a list of groups that can classify the content selected by the user on the display 4. The user selects a group into which content is to be classified from the displayed list.

コンテンツ分類部１３は、ユーザが選択したグループに、ユーザが選択したコンテンツを追加する。このとき分類データベース６において、追加するコンテンツの情報を追加する。具体的には、追加するグループにグループ分類フラグを立てる。さらに、追加するグループの名前が属するカテゴリをチェックし、追加するカテゴリのメタ情報から、このカテゴリに属するキーワードを抽出する。また、メタ情報から抽出されたキーワードを、分類データベース６に追加する。 The content classification unit 13 adds the content selected by the user to the group selected by the user. At this time, content information to be added is added in the classification database 6. Specifically, a group classification flag is set for the group to be added. Further, the category to which the name of the group to be added belongs is checked, and keywords belonging to this category are extracted from the meta information of the category to be added. In addition, keywords extracted from the meta information are added to the classification database 6.

このように、コンテンツ分類装置１は、ユーザの指定に基づき、グループに分類されているコンテンツの構成を変更する。このときコンテンツ分類装置１は、グループにあらかじめ定められている、キーワードのマッチング閾値を、新たなグループの分類構成を反映する新たな値に変更してもよい。または、あるグループに新たに分類されるコンテンツ同士によって、キーワードのマッチングを新たに実行し、グループの名前を、新たなコンテンツを反映する名前に変更してもよい。
（閾値を変更する例）
まず、コンテンツ分類装置１が、キーワードの閾値を変更する例について説明する。コンテンツ分類装置１は、距離に関する閾値を、グループごとに、分類データベース６に格納している。すなわち分類データベース６は、図４に示すように、グループの名前、グループ分類フラグ、マッチングするキーワードに加えて、グループにあらかじめ設定されている閾値を格納している。 As described above, the content classification device 1 changes the configuration of the content classified into groups based on the designation by the user. At this time, the content classification device 1 may change the keyword matching threshold predetermined for the group to a new value that reflects the new group classification configuration. Alternatively, keyword matching may be newly executed between contents newly classified into a certain group, and the name of the group may be changed to a name that reflects the new content.
(Example of changing the threshold)
First, an example in which the content classification device 1 changes the keyword threshold will be described. The content classification device 1 stores a threshold related to distance in the classification database 6 for each group. That is, as shown in FIG. 4, the classification database 6 stores a threshold value set in advance for a group in addition to a group name, a group classification flag, and a matching keyword.

上述したように、グループ削除部１６は、ユーザが指定したコンテンツを、このコンテンツが分類されているグループから取り除く。これにより、コンテンツ分類部１３は、取り除かれたコンテンツのキーワードと、グループに残されているコンテンツのキーワードとの最小の距離であるＬ１を算出する。さらに、このグループに残されているコンテンツのキーワード同士の距離のうち、最大の距離であるＬ２を算出する。 As described above, the group deletion unit 16 removes the content designated by the user from the group in which the content is classified. As a result, the content classification unit 13 calculates L1 which is the minimum distance between the removed content keyword and the content keyword remaining in the group. Furthermore, L2 which is the maximum distance among the distances between the keywords of the content remaining in this group is calculated.

ここで、コンテンツ分類部１３は、算出したＬ１がＬ２よりも大きいとき、このグループに設定されている閾値を、Ｌ２に変更する。なお、コンテンツ関連性判定部１２は、キーワードのマッチングを実行するとき、グループを作成したとき、または、グループにコンテンツを追加したときにおいて、キーワードのマッチングに使用したカテゴリと同一のカテゴリに属するキーワードを使用する。 Here, when the calculated L1 is larger than L2, the content classification unit 13 changes the threshold set for this group to L2. It should be noted that the content relevance determination unit 12 selects keywords belonging to the same category as the category used for keyword matching when performing keyword matching, when creating a group, or when adding content to the group. use.

グループに不要なコンテンツが分類されていると判断したため、ユーザは、この不要なコンテンツを、グループから取り除くように、コンテンツ分類装置１に要求する。さらに、Ｌ１がＬ２よりも大きいことは、キーワードのマッチングに使用した閾値が大きすぎて、本来は分類されるべきではない余計なコンテンツが、グループに分類されたことを意味する。そこで優先度設定部１４が、グループの閾値をより適切な値に変更する。これにより、コンテンツ分類部１３は、つぎにコンテンツをグループに次に分類するとき、余計なコンテンツをグループに分類しないようにする。 Since it is determined that unnecessary content is classified in the group, the user requests the content classification device 1 to remove the unnecessary content from the group. Further, L1 being larger than L2 means that an excessive content that should not be classified because the threshold used for keyword matching is too large is classified into a group. Therefore, the priority setting unit 14 changes the group threshold to a more appropriate value. As a result, the content classification unit 13 prevents the unnecessary content from being classified into groups when the content is next classified into groups.

上述したように、グループ作成部１５は、グループ「ミュージック」を作成している。グループ「ミュージック」には、タイトルが「ミュージックα」である３つのコンテンツと、タイトルがミュージックＢである１つのコンテンツとが、分類されている。図１１に、これらのコンテンツのキーワード同士の距離を表す。図１１は、コンテンツを取り除く前および後において、グループに分類できるコンテンツの範囲を表す図である。 As described above, the group creation unit 15 creates a group “music”. In the group “Music”, three contents whose title is “Music α” and one content whose title is Music B are classified. FIG. 11 shows the distance between the keywords of these contents. FIG. 11 is a diagram showing a range of contents that can be classified into groups before and after removing the contents.

ここで、ユーザが、コンテンツ「ミュージックＢ」をグループ「ミュージック」から取り除くことを、コンテンツ分類装置１に指示したとする。これにより、グループ削除部１６は、コンテンツ「ミュージックＢ」をグループ「ミュージック」から取り除く。このとき、コンテンツ「ミュージックＢ」が取り除かれているグループ「ミュージック」には、コンテンツ「ミュージックα」が３つ、残されている。これらの残されている３つのコンテンツでは、キーワード間の距離は、いずれも「０」と算出され、Ｌ１よりも小さい。このときコンテンツ分類部１３は、Ｌ２を「０」と算出する。Ｌ１がＬ２と同一であるため、コンテンツ関連性判定部１２は、このグループの閾値を、「０」に変更する。 Here, it is assumed that the user instructs the content classification apparatus 1 to remove the content “Music B” from the group “Music”. As a result, the group deletion unit 16 removes the content “Music B” from the group “Music”. At this time, three contents “music α” are left in the group “music” from which the content “music B” is removed. In these three remaining contents, the distance between keywords is calculated as “0”, which is smaller than L1. At this time, the content classification unit 13 calculates L2 as “0”. Since L1 is the same as L2, the content relevance determination unit 12 changes the threshold value of this group to “0”.

図１１には、コンテンツ「ミュージックＢ」がグループから取り除かれる前における、グループに分類されるコンテンツの範囲と、コンテンツが取り除かれる後における、グループに分類されるコンテンツの範囲とを表している。仮にコンテンツ「ミュージックＢ」がコンテンツ分類装置１に新たに入力されたとしても、コンテンツ分類部１３は、このコンテンツをグループ「ミュージックα」には分類しない。グループの閾値が「０」であるため、コンテンツ分類装置１は、グループ「ミュージックα」には、キーワードが完全にマッチングするコンテンツ、すなわちコンテンツ「ミュージックα」のみを分類する。 FIG. 11 shows the range of content classified into a group before the content “Music B” is removed from the group and the range of content classified into a group after the content is removed. Even if the content “music B” is newly input to the content classification device 1, the content classification unit 13 does not classify the content into the group “music α”. Since the threshold value of the group is “0”, the content classification device 1 classifies only the content whose keyword completely matches, that is, the content “music α”, into the group “music α”.

キーワード「ミュージックγ」と、キーワード「ミュージックα」との距離が、仮にＬ１だとすれば、コンテンツ分類装置１は、ミュージックＢが取り除かれている前のグループ「ミュージック」は、コンテンツ「ミュージックγ」を分類できる。しかし、コンテンツ「ミュージックγ」が取り除かれた後のグループ「ミュージックα」には、コンテンツ「ミュージックγ」を分類できない。 If the distance between the keyword “music γ” and the keyword “music α” is L1, the content classification device 1 determines that the group “music” before the music B is removed is the content “music γ”. Can be classified. However, the content “music γ” cannot be classified into the group “music α” after the content “music γ” is removed.

Ｌ１とＬ２とが同一であるとき、コンテンツ関連性判定部１２は、閾値を変更せず、そのままの値としておいてもよい。しかし、コンテンツ分類部１３は、１つのグループから、複数のコンテンツを、一度に取り除くこともありえる。このとき、コンテンツ関連性判定部１２は、グループに残されているコンテンツのキーワード同士の距離のうち、Ｌ２の次に大きいＬ２’を算出する。このとき、算出したＬ２’を、グループの新たな閾値に設定してもよい。さらに、コンテンツ分類部１３は、グループに残されているコンテンツのうち、他のコンテンツとの距離が全てＬ２（Ｌ１）であるキーワードに関連するコンテンツが存在するとき、このコンテンツをグループから取り除いていてもよい。 When L1 and L2 are the same, the content relevance determination unit 12 may leave the threshold value as it is without changing the threshold value. However, the content classification unit 13 may remove a plurality of contents from one group at a time. At this time, the content relevance determination unit 12 calculates L2 ', which is the second largest after L2, among the distances between the keywords of the content remaining in the group. At this time, the calculated L2 'may be set as a new threshold for the group. Furthermore, the content classification unit 13 removes the content from the group when there is content related to the keyword whose distance from the other content is all L2 (L1) among the content remaining in the group. Also good.

上述したように、グループ作成部１５は、名前が「ミュージック」であるグループを作成した。グループ「ミュージック」には、コンテンツ「ミュージックα」と、コンテンツ「ミュージックＢ」とが分類されている。コンテンツ分類部１３は、ユーザの指定に基づき、コンテンツ「ミュージックＢ」をグループ「ミュージック」から取り除く。このとき、グループに残されているコンテンツに、コンテンツ「ミュージックＢ」が存在するなら、Ｌ１とＬ２とが同一になる。すなわち、キーワード「ミュージックＢ」と、キーワード「ミュージックα」との距離は、Ｌ２である。このＬ２は、Ｌ１と同一である。 As described above, the group creation unit 15 created a group whose name is “music”. In the group “music”, the content “music α” and the content “music B” are classified. The content classification unit 13 removes the content “Music B” from the group “Music” based on the user's designation. At this time, if the content “music B” exists in the content remaining in the group, L1 and L2 are the same. That is, the distance between the keyword “music B” and the keyword “music α” is L2. This L2 is the same as L1.

これにより、コンテンツ分類部１３は、コンテンツ「ミュージックＢ」を、グループ「ミュージック」から取り除く。さらに、コンテンツ関連性判定部１２は、このグループの閾値を、Ｌ２よりも小さい「０」に変更する。このようにして、コンテンツ分類装置１は、ユーザが指定したコンテンツに基づき、このコンテンツと異なる他のコンテンツをも、グループから自動的に取り除くことができる。
（コンテンツをグループに追加する例）
つぎに、ユーザが指定したコンテンツを、コンテンツ分類部１３が所定のグループに追加する例について、以下に説明する。 As a result, the content classification unit 13 removes the content “Music B” from the group “Music”. Furthermore, the content relevance determination unit 12 changes the threshold value of this group to “0” smaller than L2. In this way, the content classification device 1 can automatically remove other content different from this content from the group based on the content specified by the user.
(Example of adding content to a group)
Next, an example in which the content classification unit 13 adds content specified by the user to a predetermined group will be described below.

ユーザが指定した、既存のグループに分類されていないコンテンツを、コンテンツ分類部１３がグループから取り除いたとき、コンテンツ関連性判定部１２は、グループに追加されたコンテンツのキーワードと、グループにあらかじめ分類されているコンテンツのキーワードとの距離のうち、最大の距離であるＬ３を算出する。さらに、算出したＬ３と、このグループにあらかじめ設定されている閾値Ｍとを比較する。Ｌ３が閾値Ｍよりも大きいとき、コンテンツ関連性判定部１２は、このグループの閾値を算出したＬ３に変更する。このときコンテンツ関連性判定部１２は、グループの名前が属するカテゴリと同一のカテゴリに属するキーワードを、マッチングに使用する。 When the content classification unit 13 removes content specified by the user that is not classified into an existing group from the group, the content relevance determination unit 12 classifies the content keyword added to the group and the group in advance. L3 which is the maximum distance among the distances to the keyword of the content being calculated is calculated. Further, the calculated L3 is compared with a threshold value M set in advance for this group. When L3 is larger than the threshold value M, the content relevance determination unit 12 changes the threshold value of this group to the calculated L3. At this time, the content relevance determination unit 12 uses a keyword belonging to the same category as the category to which the group name belongs for matching.

ユーザが、あるコンテンツをグループに追加することをコンテンツ分類装置１に指示するのは、グループに分類するコンテンツの範囲を、より広くして欲しいからである可能性が高い。そこで、コンテンツ関連性判定部１２は、算出したＬ３が、あらかじめ定められた閾値Ｍよりも大きいとき、グループに設定する閾値をより大きくする。これにより、グループの名前にマッチングするコンテンツのキーワードの範囲を、より広くする。したがって、コンテンツ分類装置１に新たなコンテンツが追加されたとき、グループに分類されているコンテンツのキーワードと、追加されたコンテンツのキーワードとの距離が、Ｌ３よりも以下であれば、コンテンツ分類部１３は、このコンテンツを、グループに追加できる。
（分類構成の変更）
コンテンツ関連性判定部１２は、あらかじめ定められた閾値を新たな値に変更したとき、このグループに分類できるコンテンツを、コンテンツデータベース５から検索してもよい。このときコンテンツ分類部１３は、検索したコンテンツのキーワードと、グループに分類されているコンテンツのキーワードとの距離を算出する。このとき算出される距離が、グループに新たに設定された閾値よりも小さいとき、コンテンツ分類部１３は、コンテンツデータベース５から検索したコンテンツを、グループに追加する。 It is highly likely that the user instructs the content classification apparatus 1 to add a certain content to the group because the range of the content to be classified into the group is desired to be wider. Therefore, when the calculated L3 is greater than the predetermined threshold M, the content relevance determination unit 12 increases the threshold set for the group. This broadens the range of content keywords that match the group name. Therefore, when a new content is added to the content classification device 1, if the distance between the keyword of the content classified into the group and the keyword of the added content is less than L3, the content classification unit 13 Can add this content to a group.
(Changing the classification structure)
The content relevance determination unit 12 may search the content database 5 for content that can be classified into this group when the predetermined threshold is changed to a new value. At this time, the content classification unit 13 calculates the distance between the keyword of the searched content and the keyword of the content classified into the group. When the distance calculated at this time is smaller than the threshold newly set for the group, the content classification unit 13 adds the content searched from the content database 5 to the group.

このように、コンテンツ分類装置１は、グループから１つのコンテンツを取り除くか、またはグループに１つのコンテンツを追加することの少なくともいずれかを実行することによって、このグループに分類されていない他のコンテンツの分類構成も、適宜、変更できる。 As described above, the content classification device 1 removes one content from the group or adds one content to the group, thereby executing another content not classified in this group. The classification configuration can also be changed as appropriate.

グループにコンテンツを追加するとき、コンテンツ分類部１３は、分類データベース６を更新する。すなわち、まず、分類データベース６において、追加するコンテンツをこのグループに関連付る。さらに、追加したコンテンツに、このグループに分類されていることを表すグループ分類フラグを立てる。さらに、追加したコンテンツのキーワードのうち、グループの名前が属するカテゴリと同一のカテゴリに属するキーワードを抽出し、分類データベース６に追加する。
（グループ名の変更）
コンテンツ分類装置１は、グループに分類するコンテンツの数を変更するとき、このグループに現在分類されているコンテンツのキーワード同士のマッチングを実行することによって、より適切なキーワードを抽出し、このグループの名前に設定できる。この例について、以下に説明する。 When adding content to the group, the content classification unit 13 updates the classification database 6. That is, first, the content to be added is associated with this group in the classification database 6. Further, a group classification flag indicating that the content is classified into this group is set on the added content. Further, from the added content keywords, keywords belonging to the same category as the category to which the group name belongs are extracted and added to the classification database 6.
(Rename group)
When the content classification device 1 changes the number of contents to be classified into a group, the content classification apparatus 1 extracts a more appropriate keyword by executing matching between keywords of the content currently classified into this group, and the name of this group Can be set. This example will be described below.

以下では、コンテンツ分類装置１が、ユーザの指定したコンテンツをグループから取り除いたあとに、グループの名前を変更する例について説明する。コンテンツ分類部１３は、グループは残されているコンテンツに関するキーワード同士によりマッチングする文字列が存在するか否かを判定する。マッチングする文字列が存在すると判定するとき、グループ作成部１５は、この文字列を抽出し、現在のグループの名前に追加する。このとき、グループ作成部１５は、完全にマッチングするときに、マッチングした文字列を、グループの名前に追加する。したがって、グループ作成部１５は、グループに、より長い名前を設定する。 Hereinafter, an example will be described in which the content classification device 1 changes the name of the group after removing the content designated by the user from the group. The content classification unit 13 determines whether there is a character string that matches the keywords related to the content that remains in the group. When determining that there is a matching character string, the group creating unit 15 extracts this character string and adds it to the name of the current group. At this time, the group creation unit 15 adds the matched character string to the name of the group when complete matching is performed. Therefore, the group creation unit 15 sets a longer name for the group.

コンテンツ分類装置１は、グループの名前が属するカテゴリと同一のカテゴリに属するキーワードを、マッチングに使用する。１つのコンテンツをグループから取り除くことによって、グループに残されているコンテンツのキーワードをマッチングしたとき、より長い文字列がマッチングすることがある。このときコンテンツ分類装置１は、グループの名前を、マッチングした文字列に変更すればよい。 The content classification device 1 uses a keyword belonging to the same category as the category to which the group name belongs for matching. By removing one content from a group, a longer character string may be matched when a keyword of content remaining in the group is matched. At this time, the content classification device 1 may change the name of the group to a matched character string.

たとえば、グループ作成部１５は、名前が「ミュージック」であるグループを作成している。グループ「ミュージック」には、コンテンツ「ミュージックα」およびコンテンツ「ミュージックＢ」が分類されている。ここで、コンテンツ分類部１３は、ユーザが指定したミュージックＢを、グループ「ミュージック」から取り除く。このとき、コンテンツ関連性判定部１２は、グループに残されているコンテンツ「ミュージックα」のキーワードである「ミュージックα」を、マッチングに使用する。 For example, the group creation unit 15 creates a group whose name is “music”. In the group “Music”, the content “Music α” and the content “Music B” are classified. Here, the content classification unit 13 removes the music B designated by the user from the group “music”. At this time, the content relevance determination unit 12 uses “music α”, which is a keyword of the content “music α” remaining in the group, for matching.

これによりキーワード抽出部１１は、現在のグループの名前である「ミュージック」よりも長い文字列である「ミュージックα」を、キーワードから抽出する。したがってコンテンツ分類部１３は、グループの名前を、「ミュージック」から、「ミュージックα」に変更する。 As a result, the keyword extraction unit 11 extracts “music α”, which is a character string longer than “music”, which is the name of the current group, from the keyword. Therefore, the content classification unit 13 changes the name of the group from “music” to “music α”.

なお、グループ作成部１５は、既存のグループの名前に、新たに抽出された文字列を、所定の記号を介して連結させてもよい。具体的には、「ミュージック」に、「α」を、「−」を介して接続し、グループの名前を、「ミュージック−α」に設定すればよい。これにより、元のグループの名前と、新たに抽出した文字とのつながりが不自然な場合であっても、より自然な名前をグループに設定できる。さらに、ユーザは、このようなグループの名前を見ることによって、グループの名前が決定されたときに、複数のキーワードが使用されたことを知ることができる。具体的には、このグループに、キーワードとして、「ミュージック」および「α」が関連づけられていることを知る。 The group creating unit 15 may connect a newly extracted character string to a name of an existing group via a predetermined symbol. Specifically, “α” is connected to “music” via “−”, and the group name is set to “music-α”. Thereby, even when the connection between the name of the original group and the newly extracted character is unnatural, a more natural name can be set for the group. Further, the user can know that a plurality of keywords are used when the name of the group is determined by looking at the name of the group. Specifically, it is known that “music” and “α” are associated with this group as keywords.

グループからコンテンツを取り除く前と、後とにおいて、コンテンツ関連性判定部１２は、グループに分類されているコンテンツのキーワード同士の距離の平均値を算出してもよい。このとき、コンテンツをグループから取り除いた後において算出した平均値が、取り除く前において算出した平均値よりも大きいとき、グループに残されているコンテンツのキーワード同士を、コンテンツを取り除く前のグループに分類されているコンテンツのキーワード同士よりも、長い文字がマッチングしている可能性が高いと判定する。そこで、このとき、コンテンツ関連性判定部１２は、コンテンツのキーワードに、新たにマッチングする文字が存在するか否かを判定してもよい。 Before and after removing the content from the group, the content relevance determination unit 12 may calculate an average value of the distances between the keywords of the content classified into the group. At this time, when the average value calculated after removing the content from the group is larger than the average value calculated before removing the content, the keywords of the content remaining in the group are classified into the group before removing the content. It is determined that there is a higher possibility that long characters match than the keywords of the content in question. Therefore, at this time, the content relevance determination unit 12 may determine whether there is a newly matching character in the content keyword.

コンテンツ関連性判定部１２は、ユーザが指定した、グループに現在分類されていないコンテンツを、グループに追加したとき、追加したコンテンツのキーワードの部分文字列が、グループの名前にマッチングするか否かを判定してもよい。マッチングすると判定するなら、グループ作成部１５は、グループの名前を変更せず、そのままにすればよい。このときコンテンツ関連性判定部１２は、グループの名前が属するカテゴリと同一のカテゴリに属するキーワードを、マッチングに使用すればよい。ここでいうマッチングとは、あらかじめマッチングすることを意味する。 The content relevance determination unit 12 determines whether or not the partial character string of the keyword of the added content matches the name of the group when content specified by the user and not currently classified into the group is added to the group. You may judge. If it is determined to match, the group creating unit 15 may leave the group name unchanged. At this time, the content relevance determination unit 12 may use a keyword belonging to the same category as the category to which the group name belongs for matching. Matching here means matching in advance.

コンテンツのキーワードの部分文字列と、グループの名前とがマッチングしないとき、コンテンツ関連性判定部１２は、グループに新たに分類されたコンテンツのキーワードが、共通して、コンテンツを追加する前におけるグループの名前の部分文字列とマッチングするか否かを判定してもよい。マッチングすると判定するとき、グループ作成部１５は、グループの名前を、グループの名前にマッチングする部分文字列に変更すればよい。 When the partial character string of the content keyword and the name of the group do not match, the content relevance determination unit 12 shares the keyword of the content newly classified into the group in common before adding the content. It may be determined whether or not it matches a partial character string of the name. When determining that matching is to be performed, the group creating unit 15 may change the group name to a partial character string that matches the group name.

コンテンツ関連性判定部１２は、互いにマッチングするキーワードが存在しないことを判定するとき、コンテンツのキーワードのうち、他のカテゴリに属するキーワード同士を比較することによって、マッチングするキーワードが他に存在するか否かを判定してもよい。存在すると判定するとき、グループ作成部１５は、新たにマッチングした文字列を、グループの名前に設定してもよい。 When the content relevance determination unit 12 determines that there are no matching keywords, the content relevance determination unit 12 compares the keywords belonging to other categories among the content keywords to determine whether there are other matching keywords. It may be determined. When determining that it exists, the group creation unit 15 may set a newly matched character string as the name of the group.

上述したように、コンテンツ分類装置１は、グループの名前を変更するとき、さらに、分類データベース６において管理しているグループの名前も、変更する。
（番組分類装置１ａ）
以上に、図１に示すコンテンツ分類装置１の詳細を説明した。つぎに、コンテンツとして番組を、さらに、メタ情報として番組情報を取得し、取得した番組情報に基づき、受信した番組を自動的にグループに分類する番組分類装置１ａについて、説明する。この番組分類装置１ａは、たとえばハードディスクレコーダのような、番組情報を取得する機能、および番組を録画する機能を有している。さらに、ユーザに適すると判定した番組を、ユーザに推薦する機能も有している。 As described above, when changing the name of the group, the content classification device 1 also changes the name of the group managed in the classification database 6.
(Program classification device 1a)
The details of the content classification apparatus 1 shown in FIG. 1 have been described above. Next, a program classification apparatus 1a that acquires a program as contents and further acquires program information as meta information and automatically classifies the received programs into groups based on the acquired program information will be described. The program classification device 1a has a function of acquiring program information and a function of recording a program, such as a hard disk recorder. Further, it has a function of recommending to the user a program determined to be suitable for the user.

まず番組分類装置１ａの詳細について、図１３を参照して以下に説明する。図１３は、本発明の一実施形態に係る番組分類装置１ａの詳細を表すブロック図である。 First, details of the program classification device 1a will be described below with reference to FIG. FIG. 13 is a block diagram showing details of the program classification device 1a according to the embodiment of the present invention.

この図によれば、番組分類装置１ａは、入力部２ａ、ディスプレイ４ａ、番組データベース５ａ、分類データベース６ａ、通信アンテナ７ａ、番組管理部１０ａを備えている。 According to this figure, the program classification device 1a includes an input unit 2a, a display 4a, a program database 5a, a classification database 6a, a communication antenna 7a, and a program management unit 10a.

さらに、番組管理部１０ａは、キーワード抽出部１１ａ（キーワード抽出手段）、番組関連性判定部１２ａ、番組分類部１３ａ（番組分類手段）、優先度設定部１４ａ（優先度設定手段）、グループ作成部１５ａ、グループ削除部１６ａ（グループ削除手段）、および嗜好学習部１７ａ（嗜好学習手段）、および番組推薦部１８ａ（番組推薦手段）を備えている。 Further, the program management unit 10a includes a keyword extraction unit 11a (keyword extraction unit), a program relevance determination unit 12a, a program classification unit 13a (program classification unit), a priority setting unit 14a (priority setting unit), and a group creation unit. 15a, a group deletion unit 16a (group deletion unit), a preference learning unit 17a (preference learning unit), and a program recommendation unit 18a (program recommendation unit).

これら部材のうち、図１に示すコンテンツ分類装置１に備えている部材と同様の機能を有するものについては、その説明を省略する。番組分類装置１ａは、コンテンツとして番組を処理する。さらに、メタ情報として、番組情報を処理する。
（番組受信部８ａ）
番組受信部８ａは、放送されている番組を受信し、番組データベース５ａに格納する。この番組を表す番組情報について、図１４を参照して以下に説明する。図１４は、番組情報の一例を表す図である。この図にしめすように、番組情報は、番組ごとに、あるカテゴリに属するキーワードを、テキスト情報として格納している。たとえば、カテゴリとして、ジャンル、タイトル、放送局名、出演者、および内容説明がある。 Among these members, those having the same functions as those provided in the content classification device 1 shown in FIG. The program classification device 1a processes a program as content. Further, program information is processed as meta information.
(Program receiver 8a)
The program receiving unit 8a receives a broadcast program and stores it in the program database 5a. Program information representing this program will be described below with reference to FIG. FIG. 14 is a diagram illustrating an example of program information. As shown in this figure, the program information stores keywords belonging to a certain category as text information for each program. For example, the category includes genre, title, broadcast station name, performer, and content description.

番組受信部８ａは、任意の日数の番組の番組情報を、特定の時間帯において、放送局から受信し、番組データベース５ａに格納する。番組受信部８ａは、デジタル衛星放送などの放送システムにしたがって放送される番組を受信するとき、この番組に付加されている番組情報を同時に受信できる。なお、番組受信部８ａは、インターネットなどを通じて、番組とは別に、番組情報を受信してもよい。 The program receiving unit 8a receives program information of a program of an arbitrary number of days from a broadcasting station in a specific time zone, and stores it in the program database 5a. When receiving a program broadcast according to a broadcasting system such as digital satellite broadcasting, the program receiving unit 8a can simultaneously receive program information added to the program. The program receiving unit 8a may receive program information separately from the program through the Internet or the like.

ユーザは、入力部２ａを通じて、番組の録画に関する情報を入力する。表示部３ａは、ディスプレイ４ａに、録画した番組をグループに分類して表示する。番組分類部１３ａは、ユーザが指定した番組を、既存のグループに分類する。このとき、番組の内容を表す番組情報を使用する。 The user inputs information related to program recording through the input unit 2a. The display unit 3a classifies and displays the recorded programs on the display 4a. The program classification unit 13a classifies programs designated by the user into existing groups. At this time, program information representing the contents of the program is used.

番組管理部１０ａは、番組をグループに分類するとき、分類データベース６ａに格納されている番組情報を使用してもよい。このとき、ユーザは、番組を特定する番組ＩＤ（番組識別情報）を入力する。番組管理部１０ａは、入力される番組ＩＤに基づき、番組データベース５ａにおいてユーザが指定した番組を特定する。番組管理部１０ａは、特定した番組の番組情報に基づき、番組をグループに分類する。 The program management unit 10a may use program information stored in the classification database 6a when classifying programs into groups. At this time, the user inputs a program ID (program identification information) that identifies the program. The program management unit 10a specifies a program designated by the user in the program database 5a based on the input program ID. The program management unit 10a classifies the programs into groups based on the program information of the specified programs.

なお、番組分類装置１ａが番組をグループに分類するときの処理は、番組分類装置１ａがコンテンツをグループに分類するときの処理と、実質的に同一であるため、詳細な説明を省略する。
（番組推薦部１８ａ）
番組推薦部１８ａは、ユーザに番組を推薦する。このとき、ユーザは、入力部２ａを通じて、番組の推薦要求を入力する。番組推薦部１８ａは、図示しない嗜好情報データベースに格納されている、ユーザの嗜好に関する情報に基づき、ユーザに推薦する番組を、番組データベース５ａから特定する。 Note that the processing when the program classification device 1a classifies the program into groups is substantially the same as the processing when the program classification device 1a classifies the content into groups, and thus detailed description thereof is omitted.
(Program recommendation unit 18a)
The program recommendation unit 18a recommends a program to the user. At this time, the user inputs a program recommendation request through the input unit 2a. The program recommendation unit 18a specifies a program recommended to the user from the program database 5a based on information related to the user's preference stored in a preference information database (not shown).

表示部３ａは、ディスプレイ４ａに、番組推薦部１８ａが特定した、ユーザに推薦する番組に関する情報を表示する。 The display unit 3a displays on the display 4a information related to the program recommended to the user specified by the program recommendation unit 18a.

ユーザは、キーワードの優先度を決定するとき、ディスプレイ４ａに表示されている、複数のキーワードの中から、１つのキーワードを選択する。たとえば各キーワード「女優Ａ」、「俳優Ｂ」、および「女優Ｃ」のうち、「俳優Ｂ」を選択する。これによりグループ作成部１５ａは、名前が俳優Ｂであるグループを作成する。番組分類部１３ａは、番組をこのグループに分類する。 When determining the priority of a keyword, the user selects one keyword from a plurality of keywords displayed on the display 4a. For example, “Actor B” is selected from the keywords “Actress A”, “Actor B”, and “Actress C”. Thereby, the group creation unit 15a creates a group whose name is the actor B. The program classification unit 13a classifies programs into this group.

このとき嗜好学習部１７ａは、キーワード「女優Ｂ」を、ユーザが嗜好しているキーワードとして、嗜好情報データベースに格納する。番組推薦部１８ａは、嗜好情報データベースに格納されているキーワード「俳優Ｂ」を抽出し、この文字列をキーワードとして含んでいる番組情報によって表される番組を、ユーザに推薦する。
（推薦する番組の表示）
表示部３ａは、番組推薦部１８ａが抽出した番組を表示するため分類データを作成し、作成した分類データに基づき、ユーザに推薦する番組に関する情報をディスプレイ４ａに表示する。たとえば、図１５に示す表示画面を、ディスプレイ４ａに表示する。 At this time, the preference learning unit 17a stores the keyword “actress B” in the preference information database as a keyword that the user likes. The program recommendation unit 18a extracts the keyword “Actor B” stored in the preference information database, and recommends the program represented by the program information including this character string as a keyword to the user.
(Display recommended programs)
The display unit 3a creates classification data for displaying the program extracted by the program recommendation unit 18a, and displays information on the program recommended to the user on the display 4a based on the created classification data. For example, the display screen shown in FIG. 15 is displayed on the display 4a.

このように、嗜好学習部１７ａは、番組をグループに分類するときにユーザが選択した情報を学習している。番組推薦部１８ａは嗜好学習部１７ａ学習した情報に基づき、ユーザが嗜好する番組を推測し、ユーザに推薦する。これにより、番組分類装置１ａは、ユーザの嗜好をユーザがあらかじめ入力しなくても、ユーザに関連の深い番組を推薦できる。
（録画または視聴に基づく学習）
嗜好学習部１７ａは、ユーザが録画する番組を、ユーザが嗜好している番組であるとして学習してもよい。このとき、嗜好学習部１７ａは、ユーザが録画するように指定した番組の番組情報から、キーワードを抽出し、抽出したキーワードを、好みのキーワードとして嗜好情報データベースに格納すればよい。 In this manner, the preference learning unit 17a learns information selected by the user when classifying programs into groups. Based on the information learned by the preference learning unit 17a, the program recommendation unit 18a estimates a program that the user likes and recommends it to the user. Thereby, the program classification device 1a can recommend a program closely related to the user without the user inputting the user's preference in advance.
(Learning based on recording or viewing)
The preference learning unit 17a may learn that the program recorded by the user is a program that the user likes. At this time, the preference learning part 17a should just extract a keyword from the program information of the program designated to be recorded by the user, and store the extracted keyword in the preference information database as a favorite keyword.

嗜好学習部１７ａは、ユーザが視聴する番組を、ユーザが嗜好している番組であるとして学習してもよい。このとき嗜好学習部１７ａは、ユーザが視聴している番組すなわち視聴番組の番組情報からキーワードを抽出し、抽出したキーワードを、好みのキーワードとして嗜好情報データベースに格納してもよい。 The preference learning unit 17a may learn that the program viewed by the user is a program that the user likes. At this time, the preference learning unit 17a may extract a keyword from the program that the user is viewing, that is, the program information of the viewing program, and store the extracted keyword in the preference information database as a favorite keyword.

視聴番組または録画番組に基づき、ユーザの嗜好を学習するとき、嗜好学習部１７ａは、ユーザが視聴する番組の番組情報、または録画する番組の番組情報に非常に高い割合で含まれているキーワードを、好みのキーワードとして学習してもよい。 When learning the user's preference based on the viewing program or the recorded program, the preference learning unit 17a includes keywords included in the program information of the program viewed by the user or the program information of the program to be recorded at a very high rate. You may learn as a favorite keyword.

優先度設定部１４ａは、図９に示すように、キーワードに優先度を設定してもよい。すなわち、ユーザが番組を視聴または録画するごとに、この番組の番組情報に含まれているキーワードのポイントを増やす。このとき、番組推薦部１８ａは、ポイントがより高い番組を、ユーザがより好んでいる番組であると判定する。 As shown in FIG. 9, the priority setting unit 14a may set a priority for a keyword. That is, every time a user views or records a program, the keyword points included in the program information of the program are increased. At this time, the program recommendation unit 18a determines that a program with a higher point is a program that the user likes more.

ユーザが視聴するごとに、キーワードのポイントを増加するとき、視聴する回数がより多い番組のキーワードに、より高い優先度が設定され、有利になる。すなわち番組推薦部１８ａは、ポイントがより高いキーワードを抽出し、この抽出したキーワードを含んでいる番組をユーザに推薦する。 When the number of keyword points is increased each time a user views, a higher priority is set for a keyword of a program that is viewed more frequently, which is advantageous. That is, the program recommendation unit 18a extracts a keyword having a higher point, and recommends a program including the extracted keyword to the user.

このように嗜好学習部１７ａは、ユーザが視聴した番組の履歴、または録画した番組の履歴に基づき、ユーザの嗜好を学習する。このとき嗜好学習部１７ａは、さらに、番組をグループに選択するときに、ユーザが選択したキーワードを、ユーザの嗜好する情報として使用することによって、ユーザが嗜好する情報を、より精度よく学習できる。 In this way, the preference learning unit 17a learns the user's preference based on the history of the program viewed by the user or the history of the recorded program. At this time, the preference learning unit 17a can learn the information preferred by the user more accurately by using the keyword selected by the user as information preferred by the user when selecting a program as a group.

たとえば、ユーザが、出演者カテゴリに属するキーワード「俳優Ａ」および「俳優Ｂ」を含んでいる番組情報が表す番組を、複数回、録画したとする。この番組Ａのタイトルは「番組Ａ」であり、ジャンルはドラマである。嗜好学習部１７ａは、ユーザがこの番組Ａの録画を指示するごとに、タイトルの番組Ａと、ジャンルのドラマと、出演者の俳優Ａおよび俳優Ｂとに、それぞれ、ポイントを加算する。 For example, it is assumed that the user has recorded the program represented by the program information including the keywords “Actor A” and “Actor B” belonging to the performer category a plurality of times. The title of the program A is “program A” and the genre is drama. Each time the user instructs recording of the program A, the preference learning unit 17a adds points to the title program A, the genre drama, and actors A and B of the performers.

嗜好学習部１７ａは、番組および番組のキーワードに設定されている優先度を、どちらも同じようにより高くする。これにより、番組推薦部１８ａは、俳優Ａをキーワードとして含んでいる番組、および俳優Ｂをキーワードとして含んでいる番組とを、どちらもユーザに推薦する。 The preference learning unit 17a increases the priority set for the program and the keyword of the program in the same manner. Thereby, the program recommendation unit 18a recommends both a program including the actor A as a keyword and a program including the actor B as a keyword to the user.

ここで、ユーザが、俳優Ａを好んでいるが、俳優Ｂを好んでいないとする。このとき、俳優Ａが出演する番組は、ユーザが好んでいない番組である可能性が高い。しかし、ユーザが録画した番組の履歴、および視聴した番組の履歴に基づくなら、番組情報が複数のキーワードを含んでいるとき、ユーザがどのキーワードを好んでいるのかを嗜好学習部１７ａが学習ことは、困難である。
（キーワードの優先度と番組の推薦）
「番組Ａ」をキーワードとして含む複数の番組が、コンテンツデータベース５に格納されているとき、これらの番組のうち、番組Ａをユーザが選択し、グループに分類するように番組分類装置１ａに指示したとする。このとき、ユーザが選択できるキーワード候補には、「俳優Ａ」と、「俳優Ｂ」と、番組Ａのタイトルと、番組Ａのジャンル（ドラマ）とがある。このとき、ユーザは、たとえば、キーワード候補として、俳優Ｂを選択する。これにより、優先度設定部１４ａは、「俳優Ｂ」のポイントを、「俳優Ａ」のポイントよりもより高くする。番組分類装置１ａでは、ユーザが俳優Ｂを選択したときに、ポイントに加算される値が、あらかじめ決定されている。 Here, it is assumed that the user likes the actor A but does not like the actor B. At this time, the program in which the actor A appears is likely to be a program that the user does not like. However, based on the history of the program recorded by the user and the history of the program viewed, the preference learning unit 17a learns which keyword the user likes when the program information includes a plurality of keywords. ,Have difficulty.
(Keyword priority and program recommendation)
When a plurality of programs including “program A” as a keyword are stored in the content database 5, the user selects the program A from these programs and instructs the program classification apparatus 1 a to classify it into a group. And At this time, keyword candidates that can be selected by the user include “actor A”, “actor B”, the title of program A, and the genre (drama) of program A. At this time, the user selects actor B as a keyword candidate, for example. Thereby, the priority setting unit 14a makes the point of “Actor B” higher than the point of “Actor A”. In the program classification device 1a, when the user selects the actor B, a value to be added to the points is determined in advance.

ユーザがキーワード候補として「俳優Ｂ」を選択することによって、嗜好学習部１７ａは、俳優Ｂのポイントを、俳優Ａのポイントよりもより高くできる。このとき、番組分類装置１ａは、ポイントがより高い番組を、ユーザに推薦するとする。このとき、番組推薦部１８ａは、俳優Ｂをキーワードとして含んでいる番組情報が表す番組を、俳優Ａをキーワードとして含んでいる番組情報が表す番組よりも、ユーザに優先的に推薦する。これにより、俳優Ａが出演する番組をユーザに推薦することを、抑えることができる。 When the user selects “actor B” as a keyword candidate, the preference learning unit 17a can make the actor B's point higher than the actor A's point. At this time, the program classification device 1a recommends a program with a higher point to the user. At this time, the program recommendation unit 18a preferentially recommends the program represented by the program information including the actor B as a keyword to the user over the program represented by the program information including the actor A as a keyword. Thereby, it can suppress recommending to the user the program in which actor A appears.

このように、嗜好学習部１７ａは、ユーザが録画した番組の履歴、およびユーザが視聴した番組の履歴に加え、番組分類装置１ａが番組を分類するときにユーザが指定したキーワードを学習する。したがって、番組推薦部１８ａは、学習した情報に基づき番組を推薦することによって、番組をユーザにより精度よく推薦できる。
（表示画面の例）
コンテンツ分類装置１が番組をグループに分類するときにディスプレイ４ａに表示する表示画面の例について、図１６を参照して以下に説明する。図１６は、グループに分類する番組をユーザが選択するときに、ディスプレイ４ａに表示する表示画面の例を表す図であり、（ａ）は、録画した番組の一覧を表示している表示画面であり、（ｂ）はユーザが番組Ａを選択している表示画面を表す図であり、（ｃ）は作成するグループをユーザに選択させるための表示画面を表す図であり、（ｄ）はユーザが選択した番組をユーザが選択したグループに分類したことを表示する表示画面を表す図である。 In this way, the preference learning unit 17a learns the keyword specified by the user when the program classification device 1a classifies the program in addition to the history of the program recorded by the user and the history of the program viewed by the user. Therefore, the program recommendation unit 18a can recommend the program with higher accuracy by recommending the program based on the learned information.
(Example of display screen)
An example of a display screen displayed on the display 4a when the content classification device 1 classifies programs into groups will be described below with reference to FIG. FIG. 16 is a diagram illustrating an example of a display screen displayed on the display 4a when the user selects a program to be classified into groups. FIG. 16A is a display screen displaying a list of recorded programs. (B) is a diagram showing a display screen on which the user has selected program A, (c) is a diagram showing a display screen for allowing the user to select a group to be created, and (d) is a diagram showing the user It is a figure showing the display screen which displays that the program selected by was classified into the group which the user selected.

この図によれば、まず、表示部３ａは、ディスプレイ４ａに、録画した番組の一覧を表示する。このとき、各番組の番組情報から、番組のキーワードを抽出し、同時に表示している。たとえば、番組のタイトル、録画日時、ジャンル、出演者、内容説明を、表示している。 According to this figure, first, the display unit 3a displays a list of recorded programs on the display 4a. At this time, the keyword of the program is extracted from the program information of each program and displayed simultaneously. For example, the program title, recording date / time, genre, performer, and description of contents are displayed.

このとき、ユーザが分類メニューを選択することによって、表示部３ａは、図１６（ｂ）に示す表示画面をディスプレイ４ａに表示する。ユーザは、この表示画面に表示されている番組から、グループに分類したい番組を選択する。図１６（ｂ）の例では、ユーザは「番組Ａ」を選択している。 At this time, when the user selects the classification menu, the display unit 3a displays the display screen shown in FIG. 16B on the display 4a. The user selects a program to be classified into a group from the programs displayed on the display screen. In the example of FIG. 16B, the user has selected “program A”.

キーワード抽出部１１ａは、ユーザが選択した番組Ａの番組情報から、グループの名前に設定できるキーワードを複数抽出する。これにより、表示部３ａは、図１６（ｃ）に示す表示画面をディスプレイ４ａに表示する。ユーザは、この表示画面に選択されているキーワードの中から、グループの名前に設定したいキーワードを選択する。図１６（ｃ）の例では、ユーザは、キーワード「俳優Ｂ」を選択している。 The keyword extraction unit 11a extracts a plurality of keywords that can be set as the group name from the program information of the program A selected by the user. Thereby, the display part 3a displays the display screen shown in FIG.16 (c) on the display 4a. The user selects a keyword to be set as the group name from the keywords selected on the display screen. In the example of FIG. 16C, the user has selected the keyword “actor B”.

ユーザがキーワードを選択することによって、表示部３ａは、図１６（ｄ）に示す表示画面を、ディスプレイ４ａに表示する。すなわち、表示部３ａは、番組Ａをグループ「俳優Ｂ」に分類したことを示す表示画面を表示する。
（カテゴリが異なる２つのキーワードを使用する）
以下では、１つの番組の番組情報が、キーワードとして、番組のタイトル、番組のおよび番組の内容説明を含んでいる例について、説明する。番組分類装置１ａは、複数の番組を処理する。これら複数の番組をそれぞれ表す複数の番組情報は、それぞれ、キーワードとしてプロ野球のチームを示す「チームＡｌを含んでいる。これにより、番組関連性判定部１２ａは、チームＡをキーワードとしてマッチングを判定することによって、このキーワードを含んでいる番組情報が表す番組が、複数マッチングすることを判定する。 When the user selects a keyword, the display unit 3a displays a display screen shown in FIG. 16D on the display 4a. That is, the display unit 3a displays a display screen indicating that the program A is classified into the group “actor B”.
(Use two keywords with different categories)
Hereinafter, an example in which program information of one program includes a program title, a program description, and a program content description as keywords will be described. The program classification device 1a processes a plurality of programs. Each of the plurality of program information representing each of the plurality of programs includes “team Al” indicating a professional baseball team as a keyword. Thereby, the program relevance determination unit 12a determines matching using the team A as a keyword. By doing so, it is determined that a plurality of programs represented by the program information including the keyword are matched.

番組関連性判定部１２ａは、さらに、キーワード「チームＡ」によってマッチングする複数の番組の中から、別のキーワードによってマッチングするコンテンツを絞り込む。このとき、これらの番組を表す複数の番組情報から、他のカテゴリ、たとえば番組のジャンルに属するキーワードを、それぞれ抽出する。グループ作成部１５ａは、ジャンルから抽出したキーワードに共通する文字列が名前に設定されるグループを作成する。 The program relevance determination unit 12a further narrows down contents to be matched by another keyword from among a plurality of programs matched by the keyword “Team A”. At this time, keywords belonging to other categories, for example, program genres, are extracted from a plurality of pieces of program information representing these programs. The group creation unit 15a creates a group in which a character string common to the keywords extracted from the genre is set as the name.

このように番組関連性判定部１２ａは、キーワード「チームＡ」がマッチングし、さらに、キーワード「野球」がマッチングする番組を、互いに関連していると判定する。 As described above, the program relevance determination unit 12a determines that the programs matched by the keyword “team A” and further matched by the keyword “baseball” are related to each other.

これによりグループ作成部１５ａは、番組のジャンルが「野球」であり、かつ番組の内容説明が「チームＡ」であるグループを作成する。このときグループ作成部１５ａは、名前が「チームＡ」であるグループを作成すればよい。または、グループ作成部１５ａは、名前が「野球−チームＡ」であるグループを作成してもよい。すなわち、グループ作成部１５ａは、マッチングに使用したキーワードのいずれか１つを名前に設定したグループを作成すればよい。または、マッチングに使用した各キーワードを、ハイフンなどの所定の記号によって連結したテキストを名前に設定したグループを作成してもよい。 Thereby, the group creation unit 15a creates a group whose program genre is “baseball” and whose program description is “team A”. At this time, the group creating unit 15a may create a group whose name is “Team A”. Alternatively, the group creation unit 15a may create a group whose name is “baseball-team A”. That is, the group creation unit 15a may create a group in which any one of keywords used for matching is set as a name. Or you may create the group which set the text which connected each keyword used for matching by predetermined symbols, such as a hyphen, to the name.

コンテンツ分類部１３は、作成されたグループに、キーワード「チームＡ」およびキーワード「野球」の両方によってマッチングする番組のみを分類する。したがって、チームＡのみによってマッチングする番組、および野球のみによってマッチングする番組を、いずれも、このグループには分類しない。さらに、グループ作成部１５ａは、複数の番組が、チームＡのみによってマッチングするときには、チームＡが名前に設定されるグループを作成しない。
（グループを作成しない例）
番組分類装置１ａを使用するユーザは、番組を予約するとき、予約する番組を分類するグループを指定しなくてもよい。一方、従来技術の装置では、ユーザは、番組を予約するごとに、どのグループに分類するか否かを指定する必要がある。すなわち、番組分類装置１ａは、ユーザにグループを指定させることなく、番組をグループに自動的に分類できる。 The content classification unit 13 classifies only programs that match the created group with both the keyword “team A” and the keyword “baseball”. Therefore, neither a program matching only by team A nor a program matching only by baseball is classified into this group. Furthermore, when a plurality of programs are matched only by team A, the group creation unit 15a does not create a group in which team A is set as a name.
(Example of not creating a group)
When a user who uses the program classification device 1a reserves a program, it does not have to specify a group for classifying the program to be reserved. On the other hand, in the prior art apparatus, the user needs to specify which group to classify each time a program is reserved. In other words, the program classification device 1a can automatically classify programs into groups without allowing the user to designate groups.

さらに、番組分類装置１ａは、番組を表す番組情報に含まれている、番組の内容を表すキーワードに基づき、番組を既存のグループに分類する。または、新たなグループを作成し、作成したグループに番組を分類する。すなわち、グループ作成部１５ａは、番組を分類するため必要なグループのみを作成する。したがって、作成したグループに分類されていない番組を、ルートディレクトリに配置する。これにより、ユーザは、グループに分類されている番組を、グループを介することなく直接に検索できる。 Further, the program classification device 1a classifies programs into existing groups based on keywords representing program contents included in program information representing programs. Alternatively, a new group is created and programs are classified into the created group. That is, the group creation unit 15a creates only a group necessary for classifying programs. Therefore, programs that are not classified into the created group are placed in the root directory. Thereby, the user can directly search for programs classified into groups without going through the groups.

全ての番組をグループに分類するなら、ユーザは、番組を選択するとき、この番組が分類されているグループを、必ず一度は指定する必要がある。コンテンツ分類装置１では、ユーザは、その手順を省くことができ、番組を直接に選択できる。たとえば、ユーザが録画したあとすぐに削除する番組は、グループに分類する必要がない。そこで、コンテンツ分類装置１は、このような番組をグループに分類しない。
（複数のグループに番組を分類する例）
番組分類装置１ａは、１つの番組を、異なる複数のグループに分類できる。このとき、番組分類装置１ａは、番組を表す複数のキーワードをマッチングに使用することによって、あるキーワードによってマッチングする場合、このキーワードが名前に設定されるグループに、この番組を分類し、他のキーワードによってマッチングする場合、このキーワードが名前に設定されている他のグループに、この番組を分類する。このときユーザは、複数のグループから、１つの番組を検索できる。したがって、１つの番組が１つのグループに分類されている場合に比べ、番組を検索しやすい。 If all programs are classified into groups, the user must always designate a group into which the programs are classified once when selecting a program. In the content classification device 1, the user can omit the procedure and can directly select a program. For example, it is not necessary to classify programs that are deleted immediately after being recorded by the user. Therefore, the content classification device 1 does not classify such programs into groups.
(Example of classifying programs into multiple groups)
The program classification device 1a can classify one program into a plurality of different groups. At this time, the program classification device 1a uses a plurality of keywords representing programs for matching, and when matching by a certain keyword, the program classification device 1a classifies the program into a group in which the keyword is set as a name, When matching by, this program is classified into other groups in which this keyword is set in the name. At this time, the user can search for one program from a plurality of groups. Therefore, it is easier to search for programs than when one program is classified into one group.

コンテンツ分類装置１は、コンテンツデータベース５に現在格納している複数の番組から、それぞれキーワードを抽出し、これらのキーワードに比較する文字が名前に設定されるグループを作成する。これにより、番組分類装置１ａはコンテンツデータベース５に現在格納している番組を分類するために最適なグループを作成できる。
（従来例の問題）
一方、従来例で、ユーザが、映画番組を予約するとき、ジャンルの項目で「映画」というグループに、番組を格納することを指定したとする。その後、映画番組を複数予約し、同じように「映画」というグループに格納していくと、「映画」のグループの中に番組がたくさん格納されることになる。従来例では、ユーザが、曜日などの項目も予約時に選択しておけば、装置は、「映画」のグループの中でさらに「曜日」についての小グループを作成する。しかし、そうでない場合は、装置は、「映画」グループの中にたくさん番組を格納する。したがって、ユーザは番組を検索しにくい。 The content classification device 1 extracts keywords from a plurality of programs currently stored in the content database 5 and creates groups in which characters to be compared with these keywords are set in names. As a result, the program classification device 1a can create an optimal group for classifying the programs currently stored in the content database 5.
(Problem of conventional example)
On the other hand, in the conventional example, when a user reserves a movie program, the user specifies that the program is stored in the group “movie” in the item of genre. Thereafter, if a plurality of movie programs are reserved and stored in the group “movie” in the same manner, many programs are stored in the group “movie”. In the conventional example, if an item such as a day of the week is selected by the user at the time of reservation, the apparatus creates a small group for “day of the week” in the “movie” group. However, if this is not the case, the device stores many programs in the “movie” group. Therefore, it is difficult for the user to search for a program.

逆に、ユーザが、全部の項目（タイトル、曜日、時間など）を選んでしまうと、装置は、「映画」のグループの中の番組が少ないうちも「映画」から「曜日」というように、まずグループを選択しなければならない。そのため、検索の手続きが増える。このように、ユーザが予約時に項目を選択する方法では、予約時には「映画」のグループに入る番組がどれだけ増えるかがわからない。したがって、ユーザが、予約時に的確に項目を選択するのは困難である。
（本願装置の有利な点）
番組分類装置１ａは、「映画」のグループの中の番組が増えて、ユーザが「映画」の中で小グループを作りたいときにも、対応できる。「映画」の中のある番組をユーザが選択すると、番組分類装置１ａは、「映画」の中で選択された番組と番組データで共通するキーワードをもつ番組を抽出し、グループを作成する。番組データには出演者、映画監督などの情報がある。そこで、たとえばユーザが同じ監督の映画をよく録画していれば、番組分類装置１ａは、その監督の映画のグループを作成することができる。したがって、番組分類装置１ａは、分類データベース６に格納している番組にあわせて、グループを作成することができる。 Conversely, if the user selects all items (title, day of the week, time, etc.), the device will change from “movie” to “day of the week” while there are few programs in the “movie” group. You must first select a group. This increases the number of search procedures. As described above, in the method in which the user selects an item at the time of reservation, it is not possible to know how much more programs will enter the “movie” group at the time of reservation. Therefore, it is difficult for the user to select an item accurately at the time of reservation.
(Advantages of the present device)
The program classification device 1a can cope with a case where the number of programs in the “movie” group increases and the user wants to create a small group in the “movie”. When a user selects a certain program in “movie”, the program classification device 1a extracts a program having a keyword common to the program selected in the “movie” and the program data, and creates a group. The program data includes information such as performers and movie directors. Therefore, for example, if the user often records movies of the same director, the program classification device 1a can create a group of movies of the director. Therefore, the program classification device 1a can create a group in accordance with the programs stored in the classification database 6.

番組分類装置１ａは、キーワードをマッチングするとき、キーワード間で文字が一致する部分を抽出することによって、キーワード同士の距離を算出する。算出した距離が、閾値より近いかどうかによってマッチングするかどうかを決める。このようにすることによって、キーワード間の表記のゆらぎに対応できる。 When matching a keyword, the program classification device 1a calculates a distance between the keywords by extracting a portion where characters match between the keywords. Whether or not to match is determined by whether or not the calculated distance is closer than a threshold value. By doing so, it is possible to deal with fluctuations in the notation between keywords.

たとえば、シリーズ番組などの番組タイトルには、「刑事サスペンスドラマ１」、「刑事サスペンスドラマ２」のようにシリーズ番号が付与されることがある。このような場合でも、番組分類装置１ａは、これらのグループのタイトルを、同タイトルとみなすことによって、比較するタイトルでグループを作成することができる。さらに、番組情報に含まれている番組タイトルには、省略された表記や、表記のゆらぎがあるが、番組分類装置１ａは、このような場合にも対応できる。 For example, a series number such as “criminal suspense drama 1” or “criminal suspense drama 2” may be given to a program title such as a series program. Even in such a case, the program classification device 1a can create groups with titles to be compared by regarding the titles of these groups as the same titles. Further, the program title included in the program information includes omitted notations and notation fluctuations, but the program classification device 1a can cope with such cases.

キーワードが互いにマッチングするかどうかは、キーワード間の距離の閾値に応じて決まる。閾値を小さくするほど、共通する文字がより多いキーワードがマッチングする。番組分類装置１ａは、作成したグループに対して、ユーザが実行する修正に応じて、閾値を自動的に変更する。たとえば、タイトルが「刑事サスペンスドラマ」として互いに一致する番組を分類しているグループの例を説明する。このグループに、「サスペンス劇場」というタイトルの番組が、キーワード間の距離が閾値内だったために分類されるとする。 Whether or not keywords match each other depends on a threshold value of the distance between keywords. As the threshold value is decreased, keywords with more common characters are matched. The program classification device 1a automatically changes the threshold for the created group in accordance with the correction performed by the user. For example, an example of a group that classifies programs that match each other with the title “criminal suspense drama” will be described. It is assumed that a program titled “Suspense Theater” is classified into this group because the distance between keywords is within the threshold.

この場合、そのグループから、ユーザの指定した、「サスペンス劇場」というタイトルの番組を取り除いたとき、番組分類装置１ａは、そのグループの閾値を、この番組が残される前において閾値よりも、小さくする。これにより、番組分類装置１ａは、「サスペンス劇場」というタイトルの番組が、そのグループに再び分類されることを防止できる。 In this case, when the program titled “Suspense Theater” designated by the user is removed from the group, the program classification device 1a makes the threshold of the group smaller than the threshold before the program is left. . Thereby, the program classification device 1a can prevent the program titled “Suspense Theater” from being reclassified into the group.

番組を分類するとき、作成できるグループが複数あるなら、番組分類装置１ａは、ユーザの指定したグループを作成できる。番組分類装置１ａは、ユーザが選択したグループのキーワードを、ユーザが好んでいるキーワードとして学習する。これにより、番組分類装置１ａは、学習したキーワードに基づき、ユーザに番組を推薦できる。 When a program is classified, if there are a plurality of groups that can be created, the program classification device 1a can create a group designated by the user. The program classification device 1a learns the keywords of the group selected by the user as keywords that the user likes. Thereby, the program classification | category apparatus 1a can recommend a program to a user based on the learned keyword.

一方、ユーザの嗜好に関する情報をあらかじめ記憶するなら、作成可能なグループからユーザが１つのグループを選択したとき、番組分類装置１ａはこの情報を使用する。これにより、コンテンツ分類装置１は、ユーザが嗜好しているキーワードを名前として有するグループを作成できる。
（コンテンツ分類装置１が実行する各処理の流れ）
以上に、コンテンツ分類装置１および番組分類装置１ａの詳細について説明した。つぎに、コンテンツ分類装置１が実行する処理の流れについて、図１７から図２１に示す各フローチャートを参照して以下に説明する。 On the other hand, if information related to user preferences is stored in advance, when the user selects one group from the groups that can be created, the program classification device 1a uses this information. Thereby, the content classification device 1 can create a group having the keyword that the user likes as a name.
(Flow of each process executed by the content classification device 1)
The details of the content classification device 1 and the program classification device 1a have been described above. Next, the flow of processing executed by the content classification device 1 will be described below with reference to the flowcharts shown in FIGS.

まず、コンテンツ分類装置１が、入力されるコンテンツを既存のグループに追加する例について、図１７を参照して以下に説明する。図１７は、入力されたコンテンツをコンテンツ分類装置１がグループに分類するときの処理の流れを表すフローチャートである。 First, an example in which the content classification device 1 adds input content to an existing group will be described below with reference to FIG. FIG. 17 is a flowchart showing the flow of processing when the content classification device 1 classifies the input content into groups.

ユーザは、コンテンツ分類装置１にコンテンツを入力する（ステップＳ１１）。キーワード抽出部１１は、入力されるコンテンツの内容を表すメタ情報から、所定のキーワードを抽出する（ステップＳ１２）。コンテンツ関連性判定部１２は、キーワード抽出部１１が抽出したキーワード同士がマッチングするか否かを判定する（ステップＳ１３）。キーワードがマッチングするとき、グループ作成部１５は、コンテンツをグループに分類する（ステップＳ１４）。グループ作成部１５は、さらに、分類データベース６において、分類データを更新する。すなわち、追加したコンテンツに関する情報を、分類データベース６に格納する。さらに、コンテンツを分類したグループに、グループ分類フラグを立てる。 The user inputs content into the content classification device 1 (step S11). The keyword extraction unit 11 extracts a predetermined keyword from the meta information representing the content of the input content (Step S12). The content relevance determination unit 12 determines whether or not the keywords extracted by the keyword extraction unit 11 match each other (step S13). When the keywords match, the group creation unit 15 classifies the contents into groups (step S14). The group creation unit 15 further updates the classification data in the classification database 6. That is, information regarding the added content is stored in the classification database 6. Further, a group classification flag is set for the group into which the contents are classified.

コンテンツ分類装置１が、ユーザの指定したグループを、コンテンツ分類装置１が削除するときの処理の流れについて、図１８を参照して以下に説明する。図１８は、ユーザが指示したグループをコンテンツ分類装置１が削除し、さらに、グループの名前を変更するときの処理の流れを表すフローチャートである。この図によれば、ユーザは、グループを削除することを要求する指示を、コンテンツ分類装置１に入力する（ステップＳ２１）。グループ削除部１６は、ユーザが指示したグループを削除する（ステップＳ２２）。嗜好学習部１７は、グループ削除部１６が削除したグループの名前を学習する（ステップＳ２４）。さらに、コンテンツ管理部１０は、分類データベース６において、分類データを更新する。 The flow of processing when the content classification device 1 deletes the group specified by the user will be described below with reference to FIG. FIG. 18 is a flowchart showing the flow of processing when the content classification device 1 deletes a group designated by the user and further changes the name of the group. According to this figure, the user inputs an instruction for requesting deletion of a group to the content classification apparatus 1 (step S21). The group deletion unit 16 deletes the group designated by the user (step S22). The preference learning unit 17 learns the name of the group deleted by the group deletion unit 16 (step S24). Further, the content management unit 10 updates the classification data in the classification database 6.

コンテンツ分類装置１が、キーワードを学習するときの処理の流れについて、図１９を参照して以下に説明する。図１９は、ユーザまたはコンテンツ分類装置１が選択したキーワードにコンテンツ分類装置１がより高い優先度を設定し、さらに、ユーザが選択したキーワードをコンテンツ分類装置１が学習するときの処理の流れを表す図である。 The flow of processing when the content classification device 1 learns keywords will be described below with reference to FIG. FIG. 19 shows a flow of processing when the content classification device 1 sets a higher priority to the keyword selected by the user or the content classification device 1 and the content classification device 1 learns the keyword selected by the user. FIG.

この図によれば、ユーザまたはコンテンツ分類装置１が、グループに分類するコンテンツを選択する（ステップＳ３１）。キーワード抽出部１１は、選択されたコンテンツの内容を表すメタ情報から、所定のキーワードを抽出する（ステップＳ３２）コンテンツ関連性判定部１２は、抽出されたキーワードに設定されている優先度に基づき、マッチングに使用するキーワードを決定する（ステップＳ３３）。コンテンツ関連性判定部１２は、複数のキーワードがマッチングするか否かを判定する（ステップＳ３４）。嗜好学習部１７は、ユーザがキーワードを選択したのか否かを判定する（）。ステップＳ３５において判定結果が「真」であるとき（Ｙｅｓ）、嗜好学習部１７は、ユーザが選択したキーワードを学習する（ステップＳ３６）。さらに、コンテンツ管理部１０は、分類データベース６において、分類データを更新する。 According to this figure, the user or content classification device 1 selects content to be classified into groups (step S31). The keyword extraction unit 11 extracts a predetermined keyword from the meta information representing the content of the selected content (Step S32). The content relevance determination unit 12 is based on the priority set for the extracted keyword. A keyword used for matching is determined (step S33). The content relevance determination unit 12 determines whether or not a plurality of keywords match (step S34). The preference learning unit 17 determines whether or not the user has selected a keyword (). When the determination result is “true” in step S35 (Yes), the preference learning unit 17 learns the keyword selected by the user (step S36). Further, the content management unit 10 updates the classification data in the classification database 6.

コンテンツ分類装置１が、グループにあらかじめ定められた閾値を、より適切な値に変更するときの処理の流れについて、図２０を参照して以下に説明する。図２０は、グループにあらかじめ定められた閾値をコンテンツ分類装置１が変更するときの処理の流れを表すフローチャートである。 The flow of processing when the content classification device 1 changes the threshold predetermined for the group to a more appropriate value will be described below with reference to FIG. FIG. 20 is a flowchart showing the flow of processing when the content classification device 1 changes the threshold value predetermined for the group.

この図によれば、ユーザは、グループの閾値を変更する要求をコンテンツ分類装置１に指示する（）。コンテンツ関連性判定部１２は、グループに分類されているコンテンツのキーワードをマッチングに使用することによって、キーワード間の距離を算出し、グループの閾値を変更する。さらに、必要があれば、グループ作成部１５は、グループの名前を変更する（ステップＳ４３）。さらに、コンテンツ管理部１０は、分類データベース６において、分類データを更新する。
コンテンツ分類装置１が、コンテンツの現在の分類構成をディスプレイ４に表示するときの処理の流れについて、図２１を参照して以下に説明する。図２１は、ユーザの指示に応じてコンテンツ分類装置１がディスプレイ４にコンテンツの分類状況を表示するときの処理の流れを表すフローチャートである。 According to this figure, the user instructs the content classification device 1 to make a request to change the group threshold (). The content relevance determination unit 12 calculates the distance between the keywords by using the keywords of the content classified into the group for matching, and changes the threshold value of the group. Furthermore, if necessary, the group creation unit 15 changes the name of the group (step S43). Further, the content management unit 10 updates the classification data in the classification database 6.
The flow of processing when the content classification device 1 displays the current classification configuration of content on the display 4 will be described below with reference to FIG. FIG. 21 is a flowchart showing the flow of processing when the content classification device 1 displays the content classification status on the display 4 in accordance with a user instruction.

この図によれば、ユーザは、コンテンツおよびグループをディスプレイ４に表示することを、コンテンツ分類装置１に要求する（ステップＳ５１）。これにより、表示部３は、分類データベース６にアクセスすることによって、グループの現在の分類構成を取得し、ディスプレイ４に表示する（ステップＳ５２）。 According to this figure, the user requests the content classification apparatus 1 to display the content and group on the display 4 (step S51). Thereby, the display part 3 acquires the present classification structure of a group by accessing the classification database 6, and displays it on the display 4 (step S52).

また、本発明は、以下に示す構成のものとしても実現できる。
（第１の構成）
複数のコンテンツとコンテンツメタ情報を記憶する記憶手段を有し、記憶手段に記憶されたコンテンツを選択する選択手段と、選択されたコンテンツのコンテンツメタ情報と共通するコンテンツメタ情報をもつコンテンツを探索する手段と、前記共通するコンテンツメタ情報をもつコンテンツによって複数のコンテンツを分類することを特徴とする情報処理装置。
（第２の構成）
第１の構成において、コンテンツメタ情報からキーワードを抽出するキーワード抽出手段をもち、キーワードのマッチングが可能かどうかによって、コンテンツメタ情報が共通するかどうかを判断することを特徴とする情報処理装置。
（第３の構成）
第２の構成において、マッチング可能かどうかは、キーワード間で文字が一致する部分を抽出し、キーワード間の距離を算出し、ある任意の閾値より近い距離にあるキーワード同士はマッチすると判断することを特徴とする情報処理装置。
（第４の構成）
第３の構成において、一旦マッチング可能としてコンテンツを分類したのち、ユーザによって分類の修正が行われた場合、閾値の修正を行うことを特徴とする情報処理装置。
（第５の構成）
第２の構成において、複数のコンテンツメタ情報に共通に含まれるキーワードのうち、優先度の高いキーワードによって分類することを特徴とする情報処理装置。
（第６の構成）
第５の構成において、ユーザがキーワードを選択する手段をもち、キーワードの優先度をユーザの選択結果によって決定することを特徴とする情報処理装置。
（第７の構成）
第５の構成において、ユーザ情報記憶手段をもち、キーワードの優先度をユーザ情報によって決定することを特徴とする情報処理装置。
（第８の構成）
第５の構成において、キーワードの優先度を他のコンテンツの分類構成によって決定することを特徴とする情報処理装置。
（第９の構成）
第２の構成において、あるキーワード１が複数のコンテンツのコンテンツメタ情報でマッチングされる場合、同時に別のキーワード２でもマッチング可能なときのみ、キーワード１を分類に利用できるといった、キーワードに条件を付けることを特徴とする情報処理装置。
（第１０の構成）
第２の構成において、表示手段をもつ情報処理装置。
（第１１の構成）
第２の構成において、嗜好学習手段をもち、分類に利用されたキーワードを嗜好キーワードとして学習することを特徴とする情報処理装置。
（第１２の構成）
第２の構成において、生成された分類を取消できることを特徴とする情報処理装置。
（第１３の構成）
第１２の構成において、取り消された分類を生成したキーワードを非嗜好キーワードとして学習することを特徴とする情報処理装置。
（第１４の構成）
第２の構成において、コンテンツを分類したのち、ユーザによって分類の修正が行われた場合、分類の名称を変更することを特徴とする情報処理装置。
（第１５の構成）
第１の構成において、番組取得手段と、番組情報取得手段と、番組を分類する手段をもち、分類状況を表示する表示手段をもつことを特徴とする情報処理装置。
（第１６の構成）
第１５の構成において、履歴格納手段をもち、履歴から嗜好を学習する嗜好学習手段をもち、学習した嗜好によって、番組情報に共通に含まれるキーワードの優先度を決定することを特徴とする情報処理装置。
（第１７の構成）
第１６の構成において、学習した嗜好情報でもって番組推薦することを特徴とする情報処理装置。
（第１８の構成）
複数のコンテンツとコンテンツメタ情報を記憶するステップを有し、記憶手段に記憶されたコンテンツを選択するステップと、選択されたコンテンツのコンテンツメタ情報と共通するコンテンツメタ情報をもつコンテンツを探索するステップと、前記共通するコンテンツメタ情報によって複数のコンテンツを分類することを特徴とする情報処理方法。
（第１９の構成）
第１８の構成を実現するプログラム。
（第２０の構成）
第１８の構成のプログラムを格納した記録媒体。 Further, the present invention can also be realized with the following configuration.
(First configuration)
A storage unit that stores a plurality of contents and content meta information; a selection unit that selects content stored in the storage unit; and a search for content having content meta information common to the content meta information of the selected content And an information processing apparatus that classifies a plurality of contents according to content having the common content meta information.
(Second configuration)
In the first configuration, an information processing apparatus having keyword extracting means for extracting a keyword from content meta information, and determining whether or not the content meta information is common depending on whether the keyword can be matched.
(Third configuration)
In the second configuration, whether matching is possible is determined by extracting a portion where characters match between keywords, calculating a distance between keywords, and determining that keywords that are closer than a certain threshold match each other. A characteristic information processing apparatus.
(Fourth configuration)
In the third configuration, the information processing apparatus is characterized in that, after content is classified as being capable of being matched, the threshold is corrected when the classification is corrected by the user.
(Fifth configuration)
An information processing apparatus according to the second configuration, wherein keywords are classified according to a keyword having a high priority among keywords commonly included in a plurality of content meta information.
(Sixth configuration)
An information processing apparatus according to the fifth configuration, characterized in that the user has means for selecting a keyword, and the priority of the keyword is determined based on the selection result of the user.
(Seventh configuration)
In the fifth configuration, an information processing apparatus having user information storage means and determining keyword priorities based on user information.
(Eighth configuration)
The information processing apparatus according to the fifth configuration, wherein the priority of the keyword is determined by a classification configuration of other contents.
(Ninth configuration)
In the second configuration, when a keyword 1 is matched with content meta information of a plurality of contents, a condition is added to the keyword such that the keyword 1 can be used for classification only when matching with another keyword 2 is possible at the same time. An information processing apparatus characterized by the above.
(Tenth configuration)
An information processing apparatus having display means in the second configuration.
(Eleventh configuration)
In the second configuration, an information processing apparatus having preference learning means and learning a keyword used for classification as a preference keyword.
(Twelfth configuration)
In the second configuration, an information processing apparatus capable of canceling a generated classification.
(13th configuration)
In the twelfth configuration, an information processing apparatus that learns a keyword that generates a canceled classification as a non-preferred keyword.
(14th configuration)
In the second configuration, after classifying content, when the classification is corrected by the user, the name of the classification is changed.
(15th configuration)
An information processing apparatus having a program acquisition means, a program information acquisition means, and a means for classifying a program, and a display means for displaying a classification status in the first configuration.
(Sixteenth configuration)
In the fifteenth configuration, the information processing has a history storage unit, has a preference learning unit that learns a preference from a history, and determines the priority of keywords commonly included in program information based on the learned preference apparatus.
(17th configuration)
The information processing apparatus according to the sixteenth configuration, wherein the program is recommended with the learned preference information.
(18th configuration)
A step of storing a plurality of contents and content meta information, a step of selecting content stored in the storage means, and a step of searching for content having content meta information common to the content meta information of the selected content; An information processing method, wherein a plurality of contents are classified according to the common content meta information.
(19th configuration)
A program for realizing the eighteenth configuration.
(20th configuration)
A recording medium storing a program having an eighteenth configuration.

本発明は、テレビ、ハードディスクレコーダ、パーソナルコンピュータなど、番組を受信し、受信した番組を所定のグループに自動的に分類する装置として利用できるる。 The present invention can be used as an apparatus that receives a program and automatically classifies the received program into a predetermined group, such as a television, a hard disk recorder, and a personal computer.

本発明の一実施形態に係るコンテンツ分類装置の詳細を表すブロック図である。It is a block diagram showing the detail of the content classification apparatus which concerns on one Embodiment of this invention. コンテンツ分類装置を実現するコンピュータシステムを示す図である。It is a figure which shows the computer system which implement | achieves a content classification apparatus. コンテンツおよびメタ情報を表すコンテンツデータを説明する図である。It is a figure explaining the content data showing a content and meta information. 分類データベースの詳細を表す図である。It is a figure showing the detail of a classification database. コンテンツ関連性判定部が算出するキーワード間の距離の例を示す図である。It is a figure which shows the example of the distance between the keywords which a content relevance determination part calculates. コンテンツ分類装置が管理しているグループ、および各グループに分類されているコンテンツの例を表す図である。It is a figure showing the example of the group which the content classification apparatus manages, and the content classified into each group. 分類データベースの詳細を表す図である。It is a figure showing the detail of a classification database. ユーザにキーワードを選択させるための表示画面を表す図である。It is a figure showing the display screen for making a user select a keyword. 各キーワードに設定されている優先度の例を表す図である。It is a figure showing the example of the priority set to each keyword. グループおよびコンテンツを表示している表示画面の例を表す図である。It is a figure showing the example of the display screen which is displaying the group and the content. コンテンツを取り除く前および後において、グループに分類できるコンテンツの範囲を表す図である。It is a figure showing the range of the content which can be classified into a group before and after removing a content. グループの階層構造を表す図である。It is a figure showing the hierarchical structure of a group. 本発明の一実施形態に係る番組分類装置の詳細を表すブロック図である。It is a block diagram showing the detail of the program classification | category apparatus which concerns on one Embodiment of this invention. 番組情報の一例を表す図である。It is a figure showing an example of program information. ディスプレイに表示されている、ユーザに推薦する番組の一覧を表す図である。It is a figure showing the list of the programs recommended to a user currently displayed on the display. グループに分類する番組をユーザが選択するときに、ディスプレイに表示する表示画面の例を表す図であり、（ａ）は、録画した番組の一覧を表示している表示画面である、（ｂ）はユーザが番組Ａを選択している表示画面を表す図であり、（ｃ）は作成するグループをユーザに選択させるための表示画面を表す図であり、（ｄ）はユーザが選択した番組をユーザが選択したグループに分類したことを表示する表示画面を表す図である。It is a figure showing the example of the display screen displayed on a display when a user selects the program classified into a group, (a) is a display screen which displays the list of the recorded programs, (b) Is a diagram showing a display screen on which the user has selected program A, (c) is a diagram showing a display screen for allowing the user to select a group to be created, and (d) is a program selected by the user. It is a figure showing the display screen which displays that it classified into the group which the user selected. 入力されたコンテンツをコンテンツ分類装置がグループに分類するときの処理の流れを表すフローチャートである。It is a flowchart showing the flow of a process when a content classification device classify | categorizes the input content into a group. ユーザが指示したグループをコンテンツ分類装置が削除し、さらに、グループの名前を変更するときの処理の流れを表すフローチャートである。It is a flowchart showing the flow of processing when a content classification device deletes a group designated by a user and further changes the name of the group. ユーザまたはコンテンツ分類装置が選択したキーワードにコンテンツ分類装置がより高い優先度を設定し、さらに、ユーザが選択したキーワードをコンテンツ分類装置が学習するときの処理の流れを表す図である。It is a figure showing the flow of a process when a content classification device sets a higher priority to the keyword which the user or the content classification device selected, and also the content classification device learns the keyword which the user selected. グループにあらかじめ定められた閾値をコンテンツ分類装置が変更するときの処理の流れを表すフローチャートである。It is a flowchart showing the flow of a process when a content classification apparatus changes the threshold value predetermined for the group. ユーザの指示に応じてコンテンツ分類装置がディスプレイにコンテンツの分類状況を表示するときの処理の流れを表すフローチャートである。It is a flowchart showing the flow of a process when a content classification apparatus displays the classification status of a content on a display according to a user's instruction | indication.

Explanation of symbols

１コンテンツ分類装置
２入力部（キーワード選択手段、ユーザ情報入力手段、グループ名選択手段）
３表示部
４ディスプレイ
５コンテンツデータベース
６分類データベース
１０コンテンツ管理部
１１キーワード抽出部（キーワード抽出手段）
１２コンテンツ関連性判定部（コンテンツ関連性判定手段、キーワード距離測定手段）
１３コンテンツ分類部（コンテンツ分類手段）
１４優先度設定部（優先度設定手段）
１５グループ作成部
１６グループ削除部（コンテンツ削除手段）
１７嗜好学習部（嗜好学習手段）
１ａ番組分類装置
２ａ入力部（キーワード選択手段、ユーザ情報入力手段、グループ名選択手段）
３ａ表示部
４ａディスプレイ
５ａ番組データベース
６ａ分類データベース
７ａ通信アンテナ
８ａ番組受信部（番組受信手段）
１０ａ番組管理部
１１ａキーワード抽出部（キーワード抽出手段）
１２ａ番組関連性判定部（コンテンツ関連性判定手段）
１３ａ番組分類部（コンテンツ分類手段）
１４ａ優先度設定部（優先度設定手段）
１５ａグループ作成部
１６ａグループ削除部（グループ削除手段）
１７ａ嗜好学習部（嗜好学習手段）
１８ａ番組推薦部（嗜好学習手段）
1 content classification device 2 input unit (keyword selection means, user information input means, group name selection means)
3 Display unit 4 Display 5 Content database 6 Classification database 10 Content management unit 11 Keyword extraction unit (keyword extraction means)
12 Content relevance determination unit (content relevance determination means, keyword distance measurement means)
13 Content classification section (content classification means)
14 Priority setting section (priority setting means)
15 Group creation section 16 Group deletion section (content deletion means)
17 Preference learning part (preference learning means)
1a Program classification device 2a input unit (keyword selection means, user information input means, group name selection means)
3a display unit 4a display 5a program database 6a classification database 7a communication antenna 8a program receiving unit (program receiving means)
10a Program management unit 11a Keyword extraction unit (keyword extraction means)
12a Program relevance determining unit (content relevance determining means)
13a Program classification part (content classification means)
14a Priority setting unit (priority setting means)
15a Group creation unit 16a Group deletion unit (group deletion means)
17a Preference learning unit (preference learning means)
18a Program recommendation section (preference learning means)

Claims

It is determined that the content and the other content are related to each other when the additional information representing the content and the other additional information representing the content of other content different from the content are at least partially matched. Content relevance determining means for
A content classification apparatus comprising: content classification means for classifying the content determined to be related to each other by the content relevance determination means and the other content into the same group.

The additional information includes a keyword representing the content content,
The other additional information includes other keywords representing the contents of the other content,
The content relevance determining means is
2. The content classification apparatus according to claim 1, wherein when it is determined that the keyword and the other keyword are at least partially matched, it is determined that the content and the other content are related to each other. .

The content relevance determining means is
The content according to claim 2, wherein when the distance between the keyword and the other keyword is smaller than a predetermined threshold, it is determined that the content and the other content are related to each other. Classification device.

The content classification apparatus according to claim 2, wherein the name of the same group is a character string common to the keyword and the other keyword.

The content classification apparatus according to claim 2, wherein the category to which the keyword belongs is the same as the category to which the other keyword belongs.

The content relevance determination means further includes a keyword belonging to another category different from the category included in the additional information and another keyword belonging to the other category included in the other additional information. 6. The content classification apparatus according to claim 5, wherein the content classification device determines that the content and the other content are related to each other when at least partially matching.

The content classification apparatus according to claim 2, wherein the keyword is a keyword having a higher priority.

Keyword selection means for accepting selection of the keyword by the user;
The content classification apparatus according to claim 2, further comprising priority setting means for setting a higher priority for the keyword selected by the user.

A group name selection means for receiving the name of the group selected by the user;
3. The content classification apparatus according to claim 2, further comprising priority setting means for setting a higher priority for the keyword including the name of the group received by the group name selection means. .

User information input means for receiving input of information about the user;
Claims further comprising priority setting means for setting a higher priority to a keyword including a character string included in the information related to the user received by the user information input means. Item 3. The content classification device according to Item 2.

The content classification means
8. The content classification apparatus according to claim 7, wherein a character string common to a keyword set with a higher priority and another keyword is set to the name of the group located in a higher hierarchy.

When a hierarchical group exists in advance,
Claims further comprising priority setting means for setting a higher priority to a keyword belonging to the same category as a category to which a keyword including the name of a group located in a higher hierarchy belongs. Item 8. The content classification device according to Item 7.

The content classification apparatus according to claim 1, further comprising group deletion means for deleting the group designated by the user.

A group deleting means for deleting the group instructed by the user;
8. The content classification apparatus according to claim 7, further comprising priority setting means for setting a priority of a keyword including the name of the group deleted by the group deletion means to a lower level.

2. The content classification apparatus according to claim 1, further comprising display means for displaying a result of classification by the content classification means on a predetermined display.

A preference learning unit for learning the keyword selected by the user;
The content classification device according to claim 2, further comprising content recommendation means for recommending content to a user based on the keyword learned by the preference learning unit.

Content adding means for adding new content to the group based on user instructions;
When the new content is added to the group by the content adding means, the maximum of the distance between the keyword representing the content of the new content and the keyword representing the content of the content included in the group in advance Keyword distance measuring means for calculating the distance of
4. The content classification apparatus according to claim 3, further comprising threshold setting means for setting the maximum distance as a new threshold when the maximum distance is larger than a predetermined threshold.

Content deletion means for removing the content instructed by the user from the group;
When the content is removed from the group by the content deletion means, a keyword representing the content of the first content arbitrarily selected from the group and a first content different from the first content arbitrarily selected from the group A keyword distance measuring means for calculating the maximum distance among the distances to other keywords representing the contents of the two contents;
4. The content classification apparatus according to claim 3, further comprising threshold setting means for setting the maximum distance as a new threshold when the maximum distance is smaller than the predetermined threshold. .

A content adding means for adding new content designated by the user to the group;
The content classification means
When the new content is added to the group by the content adding means, a character string common to the keyword representing the content of each content included in the group is set as a new name of the group. The content classification device according to claim 3.

A content deleting means for removing the content instructed by the user from the group;
The content classification means
When the content is removed from the group by the content deletion means, a character string common to the keyword representing the content of each content newly included in the group is newly set as the name of the group The content classification device according to claim 2.

Program receiving means for receiving a predetermined program as the content;
The content classification device according to claim t, further comprising program information acquisition means for acquiring program information representing the contents of the program as the additional information.

The additional information includes a predetermined keyword representing the content of the program,
The other additional information includes a predetermined other keyword representing the content of the other program,
The content relevance determining means is
The content classification apparatus according to claim 21, wherein when it is determined that the keyword and the other keyword are at least partially matched, it is determined that the program and the other program are related to each other. .

Program information storage means for storing the program information representing the program requested by the user for at least one of recording and viewing on a predetermined recording medium;
The apparatus further comprises priority setting means for setting a higher priority to the keyword including the character string included in the program information stored in the recording medium. 22. The content classification device according to 22.

A preference learning unit for learning the keyword including a character string included in the program information stored in the recording medium;
The content classification apparatus according to claim 23, further comprising program recommendation means for recommending a program to a user based on the keyword learned by the preference learning unit.

Keyword selection means for accepting selection of the keyword by the user;
A preference learning unit for learning the keyword selected by the user;
The content classification apparatus according to claim 22, further comprising program recommendation means for recommending a program to a user based on the keyword learned by the preference learning unit.

It is determined that the content and the other content are related to each other when the additional information representing the content and the other additional information representing the content of other content different from the content are at least partially matched. A content relevance determining step,
A content classification comprising a content classification step of classifying the content and the other content into the same group when it is determined that the content and the other content are related to each other Method.

A content classification program for operating the content classification device according to any one of claims 1 to 25 for causing a computer to function as each of the means.

A computer-readable recording medium on which the content classification program according to claim 27 is recorded.