JP2012173813A

JP2012173813A - Content classification device, content classification method, and content classification program

Info

Publication number: JP2012173813A
Application number: JP2011032624A
Authority: JP
Inventors: Masaaki Makino; 正明牧野
Original assignee: JVCKenwood Corp
Current assignee: JVCKenwood Corp
Priority date: 2011-02-17
Filing date: 2011-02-17
Publication date: 2012-09-10
Anticipated expiration: 2031-02-17
Also published as: JP5578368B2

Abstract

PROBLEM TO BE SOLVED: To reduce a deviation in a number of contents among groups when classifying a large number of contents into a plurality of groups on the basis of a feature amount represented as an index.SOLUTION: A content classification device for grouping a plurality of contents into a predetermined number of groups comprises a group creation section that obtains a feature amount represented as an index for each content to be classified, calculates a representative value of feature amounts for each group on the basis of the feature amounts of contents belonging to the group, when the plurality of contents are grouped into a larger number of groups than the predetermined number of groups, and performs combining process of combining groups having adjacent representative values into one group. The group creation section repeats the combining process until the number of groups equals the predetermined number.

Description

本発明は、多数のコンテンツを、指標化された特徴量に基づいて複数のグループに分類するコンテンツ分類装置、コンテンツ分類方法およびコンテンツ分類プログラムに関する。 The present invention relates to a content classification device, a content classification method, and a content classification program for classifying a large number of contents into a plurality of groups based on indexed feature amounts.

近年、画像、音楽、映像等のデジタルコンテンツを、ハードディスクドライブ等の大容量記憶装置に格納し、利用することが広く行なわれている。それに伴って、多数のコンテンツの中から所望のコンテンツを選び出す検索技術や、コンテンツを自動的に分類・整理する技術に対する要望が高まっており、種々の技術が開発されている。 In recent years, digital contents such as images, music, and videos have been widely stored and used in a large-capacity storage device such as a hard disk drive. Along with this, there is an increasing demand for a search technology for selecting desired content from a large number of content and a technology for automatically classifying and organizing content, and various technologies have been developed.

例えば、特許文献１には、音響信号を解析して得られる特徴量を基に、それぞれの楽曲データを印象空間と呼ばれる座標軸上に配置することで、楽曲データの検索を行なう装置が開示されている。 For example, Patent Document 1 discloses an apparatus for searching for music data by arranging each music data on a coordinate axis called an impression space based on a feature amount obtained by analyzing an acoustic signal. Yes.

特許文献１に記載された技術によれば、楽曲データの特徴量を印象空間上の点に変換し、その点からのユークリッド距離が近い楽曲を順次候補曲とすることで、ユーザの所望する楽曲のグループを得ることができる。また、ある楽曲をグループの中心とし、その楽曲からユークリッド距離が近い楽曲を順次候補曲とすることで、楽曲の類似検索を行なうこともできる。なお、楽曲データの特徴量を収集するための技術としては、例えば、特許文献２が挙げられる。 According to the technique described in Patent Document 1, the music data desired by the user is obtained by converting the feature amount of the music data into a point on the impression space and sequentially selecting music having a short Euclidean distance from that point as candidate music. You can get a group of. Further, similarity search of music can be performed by setting a music as the center of a group and sequentially selecting music having a short Euclidean distance from the music as candidate music. An example of a technique for collecting the feature amount of music data is Patent Document 2.

特開２００２−２７８５４７号公報JP 2002-278547 A 特開平６−２９０５７４号公報JP-A-6-290574

ところで、大容量記憶装置に格納された多数のコンテンツを、特徴量にしたがって複数のグループに分類し、その分類されたグループ単位で再生を行ないたいという要求がある。例えば、コンテンツとして楽曲データを想定した場合、カフェなどで、多種多様の楽曲の中から、ゆったりとした曲調の楽曲を選んで再生し続けたいといった状況である。 By the way, there is a demand to classify a large number of contents stored in a large-capacity storage device into a plurality of groups according to the feature amount, and to reproduce the classified groups. For example, when music data is assumed as content, it is a situation where a user wants to continue to select and play music with a relaxed music tone from a wide variety of music at a cafe or the like.

特徴量にしたがって楽曲の分類を行なう場合、ある座標上の１点からユークリッド距離が所定値以内の楽曲をグループ化するといった、文献１に記載の方法を取ると、印象座標軸上に楽曲が密な空間と疎な空間とが存在するため、各々のグループに属する楽曲数に大きな偏りが生じる場合がある。例えば、あるグループには１曲しか楽曲が分類されていないが、別のグループでは３０曲の楽曲が分類されている、といったことが起こり得る。このグループ間の要素数の不均衡は、クラスタの平均を用い、与えられたクラスタ数Ｋ個に分類するＫ平均法のような従来のアルゴリズムを用いた場合にも、同様に起こり得る。 When the music is classified according to the feature amount, if the method described in Document 1 is used, such as grouping music whose Euclidean distance is within a predetermined value from one point on a certain coordinate, the music is dense on the impression coordinate axis. Since there are spaces and sparse spaces, there may be a large bias in the number of songs belonging to each group. For example, it may happen that only one song is classified in one group, but 30 songs are classified in another group. This imbalance in the number of elements between groups can occur in the same manner when using a conventional algorithm such as a K-average method that uses the average of clusters and classifies it into a given number K of clusters.

そこで本発明は、多数のコンテンツを、指標化された特徴量に基づいて複数のグループに分類する際に、グループ間のコンテンツ数の偏りを小さくすることを目的とする。 In view of the above, an object of the present invention is to reduce a deviation in the number of contents between groups when a large number of contents are classified into a plurality of groups based on indexed feature amounts.

上記課題を解決するため、本発明の第１の態様であるコンテンツ分類装置は、複数のコンテンツを所定数のグループへとグループ分けするコンテンツ分類装置において、分類対象のコンテンツそれぞれにおける指標化された特徴量を取得し、前記複数のコンテンツが前記所定数のグループよりも多い数のグループにグループ分けされている状態から、グループに属するコンテンツの特徴量に基づいて各グループの特徴量の代表値を算出し、その代表値が近いグループ同士を１のグループへと結合する結合処理を行なうグループ作成部を備え、前記グループ作成部は、前記結合処理を、グループ数が前記所定数となるまで繰り返すことを特徴とする。
ここで、前記特徴量は、分類対象のコンテンツそれぞれにおける指標化された特徴量ベクトルであり、前記グループの代表値は、前記グループに属するコンテンツの特徴量ベクトルの重心とすることができる。
このとき、前記グループ作成部は、未結合のグループの内の、前記重心の距離が近いグループ同士を結合することができる。
また、前記グループ作成部は、未結合のグループが１つ残っている場合には、前記代表値に基づいていずれかの結合済のグループと結合させることができる。
また、作成グループ数の候補を、前記分類対象のコンテンツ数に基づいて算出し、候補となった作成グループ数の選択を受け付けるグループ数設定部をさらに備え、前記所定数は、前記グループ数設定部が受け付けた作成グループ数とすることができる。
あるいは、作成グループ数の指定を受け付けるグループ数設定部をさらに備え、前記所定数は、指定された前記作成グループ数よりも多い数とするようにしてもよい。
このとき、前記グループ作成部は、グループ数が前記所定数となった後、前記所定数と作成グループ数との差に相当する数のグループを選択し、選択されたグループに属する各コンテンツを、選択されなかったグループのいずれかに振り分けることができる。
上記課題を解決するため、本発明の第２の態様であるコンテンツ分類方法は、複数のコンテンツを所定数のグループへとグループ分けするコンテンツ分類方法において、分類対象のコンテンツそれぞれにおける指標化された特徴量を取得するステップと、前記複数のコンテンツが前記所定数のグループよりも多い数のグループにグループ分けされている状態から、グループに属するコンテンツの特徴量に基づいて各グループの特徴量の代表値を算出するステップと、前記代表値が近いグループ同士を１のグループへと結合する結合処理を行なうグループ作成ステップと、を有し、前記グループステップにおいて、前記結合処理を、グループ数が前記所定数となるまで繰り返すことを特徴とする。
上記課題を解決するため、本発明の第３の態様であるコンテンツ分類プログラムは、複数のコンテンツを所定数のグループへとグループ分けする処理をコンピュータに実行させるコンテンツ分類プログラムにおいて、コンピュータに、分類対象のコンテンツそれぞれにおける指標化された特徴量を取得するステップと、前記複数のコンテンツが前記所定数のグループよりも多い数のグループにグループ分けされている状態から、グループに属するコンテンツの特徴量に基づいて各グループの特徴量の代表値を算出するステップと、前記代表値が近いグループ同士を１のグループへと結合する結合処理を行なうグループ作成ステップと、を実行させ、前記グループステップは、前記結合処理を、グループ数が前記所定数となるまで繰り返すステップであることを特徴とする。 In order to solve the above-mentioned problem, the content classification device according to the first aspect of the present invention is a content classification device that groups a plurality of contents into a predetermined number of groups. And obtaining a representative value of the feature amount of each group based on the feature amount of the content belonging to the group from the state in which the plurality of contents are grouped into a larger number of groups than the predetermined number of groups. And a group creating unit that performs a joining process for joining groups having similar representative values to one group, and the group creating unit repeats the joining process until the number of groups reaches the predetermined number. Features.
Here, the feature amount is an indexed feature amount vector in each content to be classified, and the representative value of the group can be a center of gravity of the feature amount vector of the content belonging to the group.
At this time, the group creation unit can join groups having a short distance of the center of gravity among unjoined groups.
In addition, when one unjoined group remains, the group creating unit can join with any joined group based on the representative value.
In addition, a candidate for the number of creation groups is calculated based on the number of contents to be classified, and further includes a group number setting unit that accepts selection of the number of creation groups that are candidates, and the predetermined number is the group number setting unit Can be the number of created groups accepted.
Alternatively, a group number setting unit that receives designation of the number of created groups may be further provided, and the predetermined number may be larger than the designated number of created groups.
At this time, after the number of groups reaches the predetermined number, the group creation unit selects a number of groups corresponding to the difference between the predetermined number and the number of created groups, and each content belonging to the selected group is Can be assigned to any of the groups that were not selected.
In order to solve the above-described problem, the content classification method according to the second aspect of the present invention is a content classification method in which a plurality of contents are grouped into a predetermined number of groups. A representative value of the feature amount of each group based on the feature amount of the content belonging to the group from the state in which the plurality of contents are grouped into a larger number of groups than the predetermined number of groups And a group creation step for performing a joining process for joining the groups having similar representative values together into one group. In the group step, the joining process is performed when the number of groups is the predetermined number. It repeats until it becomes.
In order to solve the above problem, a content classification program according to a third aspect of the present invention is a content classification program that causes a computer to execute a process of grouping a plurality of contents into a predetermined number of groups. Obtaining the indexed feature amount in each of the contents, and based on the feature amount of the content belonging to the group from the state in which the plurality of contents are grouped into more groups than the predetermined number of groups Calculating a representative value of the feature value of each group, and a group creating step for performing a combining process for combining the groups having similar representative values into one group, wherein the group step includes the combining step Repeat the process until the number of groups reaches the predetermined number. Characterized in that it is a flop.

本発明によれば、複数のコンテンツを、指標化された特徴量に基づいて複数のグループに分類する際に、グループ間のコンテンツ数の偏りを小さくすることができる。 According to the present invention, when a plurality of contents are classified into a plurality of groups based on the indexed feature amounts, the deviation in the number of contents between groups can be reduced.

本発明の第１実施形態に係る楽曲分類装置の構成を示すブロック図である。It is a block diagram which shows the structure of the music classification apparatus which concerns on 1st Embodiment of this invention. 楽曲分類装置と外部記憶装置と再生装置とが回線により接続された状態を示す図である。It is a figure which shows the state by which the music classification | category apparatus, the external storage device, and the reproducing | regenerating apparatus were connected by the line. 特徴量ベクトル格納部の格納形態の一例を示す図である。It is a figure which shows an example of the storage form of a feature-value vector storage part. グループ格納部の格納形態の一例を示す図である。It is a figure which shows an example of the storage form of a group storage part. 楽曲分類装置の動作について説明するフローチャートである。It is a flowchart explaining operation | movement of a music classification apparatus. 作成する楽曲グループ数の選択を受け付けるための画面例である。It is an example of a screen for accepting selection of the number of music groups to be created. 楽曲分類の概要を説明するための図である。It is a figure for demonstrating the outline | summary of a music classification. 第１実施形態における楽曲分類処理の詳細な手順について説明するフローチャートである。It is a flowchart explaining the detailed procedure of the music classification process in 1st Embodiment. 代表値である重心の算出処理について説明する図である。It is a figure explaining the calculation process of the gravity center which is a representative value. 未結合の楽曲グループの結合処理を説明する図である。It is a figure explaining the joining process of an unjoined music group. 楽曲グループ間の距離算出の別例を説明する図である。It is a figure explaining another example of the distance calculation between music groups. 本発明の第２実施形態に係る楽曲分類装置の構成を示すブロック図である。It is a block diagram which shows the structure of the music classification apparatus which concerns on 2nd Embodiment of this invention. 第２実施形態における楽曲グループ作成処理の詳細な手順について説明するフローチャートである。It is a flowchart explaining the detailed procedure of the music group creation process in 2nd Embodiment. 楽曲グループの分解処理について説明するフローチャートである。It is a flowchart explaining the decomposition | disassembly process of a music group. 楽曲グループの分解処理について説明する図である。It is a figure explaining the decomposition | disassembly process of a music group.

本発明の実施の形態について図面を参照して詳細に説明する。本実施形態では、複数のグループに分類するコンテンツとして楽曲データを例に説明するが、本発明は、楽曲データに限られず、映像データ、画像データ、ソフトウェア、その他のコンテンツを分類対象とすることができる。
＜第１実施形態＞ Embodiments of the present invention will be described in detail with reference to the drawings. In this embodiment, music data is described as an example of content classified into a plurality of groups. However, the present invention is not limited to music data, and video data, image data, software, and other content may be classified. it can.
<First Embodiment>

まず、本発明の第１実施形態について説明する。図１は、本発明のコンテンツ分類装置の第１実施形態である楽曲分類装置の構成を示すブロック図である。本図に示すように、楽曲分類装置１００は、ユーザインタフェース部１１０、楽曲グループ数設定部１２０、楽曲グループ作成部１３０、格納部１４０を備えて構成される。 First, a first embodiment of the present invention will be described. FIG. 1 is a block diagram showing a configuration of a music classification device which is a first embodiment of the content classification device of the present invention. As shown in the figure, the music classification apparatus 100 includes a user interface unit 110, a music group number setting unit 120, a music group creation unit 130, and a storage unit 140.

楽曲分類装置１００は、ＣＰＵ、メモリ、インタフェース、入出力装置等を備えたパーソナルコンピュータ等の一般的な情報処理装置を用いて構成することができる。すなわち、情報処理装置は、上述の各機能ブロックを実現するためのコンピュータプログラムを実行することにより、楽曲分類装置１００として機能することができる。ただし、楽曲分類装置１００は、専用のハードウェアを用いて構成するようにしてもよい。 The music classification apparatus 100 can be configured using a general information processing apparatus such as a personal computer equipped with a CPU, a memory, an interface, an input / output device, and the like. That is, the information processing apparatus can function as the music classification apparatus 100 by executing a computer program for realizing the above-described functional blocks. However, the music classification apparatus 100 may be configured using dedicated hardware.

楽曲分類装置１００は、例えば、図２に示すように外部記憶装置２００、再生装置３００と回線４０を介して接続され、使用される。外部記憶装置２００には、楽曲データ格納部２１０が構成され、多数の楽曲データが格納されている。楽曲データ格納部２１０は、データベース機能を備えており、デジタルコンテンツである楽曲データ本体と、その識別子である楽曲ＩＤを管理している。楽曲分類装置１００は、楽曲データ格納部２１０に楽曲データが格納された楽曲を分類対象とする。 The music classification device 100 is used by being connected to an external storage device 200 and a playback device 300 via a line 40 as shown in FIG. The external storage device 200 includes a music data storage unit 210 and stores a large number of music data. The music data storage unit 210 has a database function, and manages a music data body that is digital content and a music ID that is an identifier thereof. The music classification apparatus 100 sets the music whose music data is stored in the music data storage unit 210 as a classification target.

再生装置３００は、外部記憶装置２００の楽曲データ格納部２１０に格納された楽曲データを再生することができる音響装置である。回線４０は、バスやＡＴＡＰＩ、ＬＡＮ等の通信回線である。再生装置３００は、楽曲分類装置１００によって生成された分類情報にしたがって、外部記憶装置２００に格納された楽曲データから、どの楽曲データを再生するかを決定して再生する。なお、楽曲分類装置１００、外部記憶装置２００、再生装置３００は、それぞれ独立した機器として構成してもよいし、これらの機能をまとめた機器として構成してもよい。 The playback device 300 is an acoustic device that can play back music data stored in the music data storage unit 210 of the external storage device 200. The line 40 is a communication line such as a bus, ATAPI, or LAN. The playback device 300 determines and plays back which song data is to be played from the song data stored in the external storage device 200 in accordance with the classification information generated by the song classification device 100. Note that the music classification device 100, the external storage device 200, and the playback device 300 may be configured as independent devices, or may be configured as devices that combine these functions.

図１において、格納部１４０は、ハードディスクやメモリ等の高速アクセスが可能な記憶領域であり、特徴量ベクトル格納部１４１と楽曲グループ格納部１４２とが構成されている。 In FIG. 1, the storage unit 140 is a storage area such as a hard disk or a memory that can be accessed at high speed, and includes a feature vector storage unit 141 and a music group storage unit 142.

特徴量ベクトル格納部１４１は、外部記憶装置２００の楽曲データ格納部２１０に格納された楽曲データ毎の特徴量ベクトルを格納する。特徴量ベクトルは、楽曲の特徴を示す複数の項目について、それぞれの項目を指標化した値を要素とするベクトルである。 The feature vector storage unit 141 stores a feature vector for each piece of music data stored in the music data storage unit 210 of the external storage device 200. The feature vector is a vector whose elements are values obtained by indexing each item of a plurality of items indicating the features of the music.

楽曲の特徴を示す項目は、例えば、平均音圧、低域平均音圧、テンポ、拍子等とすることができる。これらの項目の値は、人為的に設定したり、外部からデータを取得してもよいし、あるいは、特許文献２に記載されたような方法で楽曲の音響信号に基づいて数値化することもできる。各項目の値は、音圧、テンポ等の値をそのまま用いるのではなく、所定の範囲に正規化されるように指標化して特徴量ベクトル格納部１４１に格納することが望ましい。また、特徴量は、楽曲から受ける印象や、楽器編成、ジャンル等を数値化したものであってもよい。 The items indicating the characteristics of the music can be, for example, average sound pressure, low frequency average sound pressure, tempo, time signature, and the like. The values of these items may be set artificially, data may be acquired from the outside, or may be quantified based on the acoustic signal of the music by a method as described in Patent Document 2. it can. It is desirable that the value of each item is stored in the feature vector storage unit 141 after being indexed so as to be normalized to a predetermined range, rather than using values such as sound pressure and tempo as they are. The feature amount may be a numerical value of an impression received from a music piece, musical instrument organization, genre, and the like.

図３は、特徴量ベクトル格納部１４１の格納形態の一例を示している。本図の例では、楽曲ＩＤと複数の項目を有する特徴量ベクトルとが対応付けられている。また、楽曲ＩＤにより、分類対象の楽曲が、外部記憶装置２００の楽曲データ格納部２１０に格納された楽曲データと関連付けられている。 FIG. 3 shows an example of the storage form of the feature vector storage unit 141. In the example of this figure, music ID and the feature-value vector which has several items are matched. Further, the music to be classified is associated with the music data stored in the music data storage unit 210 of the external storage device 200 by the music ID.

楽曲グループ格納部１４２は、楽曲グループを識別する識別子である楽曲グループＩＤと、その楽曲グループに属する１つ以上の楽曲の楽曲ＩＤとを対応付けて格納する。図４は、楽曲グループ格納部１４２の格納形態の一例を示す図である。本図の例では、楽曲グループＩＤが「Ｇｒｏｕｐ１」である楽曲グループに、「楽曲ＩＤ１」「楽曲ＩＤ３」の楽曲が属しており、楽曲グループＩＤが「Ｇｒｏｕｐ２」である楽曲グループに、「楽曲ＩＤ２」「楽曲ＩＤ４」「楽曲ＩＤ６」の楽曲が属していることを示している。なお、楽曲グループ格納部１４２は、グループ作成処理において随時更新される。また、楽曲グループ格納部１４２は、楽曲分類処理においてその楽曲グループが結合済か否かを示す結合済フラグを有している。 The song group storage unit 142 stores a song group ID, which is an identifier for identifying a song group, and a song ID of one or more songs belonging to the song group in association with each other. FIG. 4 is a diagram illustrating an example of a storage form of the music group storage unit 142. In the example shown in the figure, the music group having the music group ID “Group 1” belongs to the music group “Music ID 1” and “Music ID 3”, and the music group ID “Group 2” is assigned to the music group “Music ID 2”. "Music ID 4" and "Music ID 6" indicate that the music belongs. The music group storage unit 142 is updated as needed in the group creation process. The music group storage unit 142 also has a combined flag indicating whether or not the music group has been combined in the music classification process.

ユーザインタフェース部１１０は、マウス、キーボード、モニタ等の入出力装置を介して、ユーザからの操作を受け付けたり、ユーザに情報を提供する機能部である。 The user interface unit 110 is a functional unit that receives an operation from a user or provides information to the user via an input / output device such as a mouse, a keyboard, or a monitor.

楽曲グループ数設定部１２０は、ユーザインタフェース部１１０を介したユーザからの操作に基づいて、作成する楽曲グループ数を設定する。第１実施形態においては、分類対象の楽曲数に応じて、作成可能な楽曲グループ数の候補をユーザに提示し、ユーザが候補の楽曲グループ数の中から選択することで、作成する楽曲グループ数が設定される。すなわち、第１実施形態においては、作成される楽曲グループ数に一定の制約が課せられている。 The music group number setting unit 120 sets the number of music groups to be created based on an operation from the user via the user interface unit 110. In the first embodiment, the number of song groups that can be created is presented to the user according to the number of songs to be classified, and the number of song groups to be created is selected by the user from the candidate song group numbers. Is set. In other words, in the first embodiment, certain restrictions are imposed on the number of music groups to be created.

楽曲グループ作成部１３０は、特徴量ベクトル格納部１４１に特徴量ベクトルが格納された楽曲を、楽曲グループ数設定部１２０が設定した数の楽曲グループに分類する。このとき、各楽曲グループには、ほぼ同じ数の楽曲が含まれるように分類処理を行なう。 The music group creation unit 130 classifies the music whose feature vector is stored in the feature vector storage unit 141 into the number of music groups set by the music group number setting unit 120. At this time, classification processing is performed so that each music group includes approximately the same number of music.

楽曲グループ作成部１３０は、距離算出部１３１、楽曲グループ結合部１３２、代表値算出部１３３を備えている。代表値算出部１３３は、楽曲グループの代表値を算出する。ここでは、代表値は、特徴量ベクトルの項目毎に、楽曲グループに属している楽曲の指標値を平均して得られる重心を用いるものとする。距離算出部１３１は、楽曲グループ間で代表値同士の距離を算出する。代表値として重心を用いた場合は、両重心が示す座標間のユークリッド距離を算出する。楽曲グループ結合部１３２は、距離が近い２つの楽曲グループ同士を結合して新たな楽曲グループを作成する。 The music group creation unit 130 includes a distance calculation unit 131, a music group combination unit 132, and a representative value calculation unit 133. The representative value calculation unit 133 calculates a representative value of the music group. Here, as the representative value, the center of gravity obtained by averaging the index values of the music pieces belonging to the music group is used for each item of the feature vector. The distance calculation unit 131 calculates the distance between representative values between music groups. When the centroid is used as the representative value, the Euclidean distance between the coordinates indicated by both centroids is calculated. The music group combining unit 132 combines two music groups that are close to each other to create a new music group.

次に、第１実施形態における楽曲分類装置１００の動作について図５のフローチャートを参照して説明する。まず、楽曲グループ数設定部１２０が、格納部１４０の特徴量ベクトル格納部１４１を参照して、分類対象の楽曲数を取得する（Ｓ１１）。分類対象の楽曲数は、特徴量ベクトル格納部１４１に格納されている楽曲ＩＤの数をカウントすることにより取得することができる。 Next, the operation of the music classification apparatus 100 in the first embodiment will be described with reference to the flowchart of FIG. First, the music group number setting unit 120 refers to the feature vector storage unit 141 of the storage unit 140 and acquires the number of music pieces to be classified (S11). The number of songs to be classified can be acquired by counting the number of song IDs stored in the feature vector storage unit 141.

そして、取得した楽曲数に基づいて作成可能なグループ数を算出する（Ｓ１２）。第１実施形態においては、距離の近い未結合の２つの楽曲グループ同士を結合していくため、分類対象の楽曲数をＮとすると、作成可能な楽曲グループ数Ｇｎは、［数１］で求められる値となる。ここで、ｎは０以上の整数である。
Then, the number of groups that can be created is calculated based on the acquired number of songs (S12). In the first embodiment, two unjoined music groups that are close to each other are joined together. Therefore, if the number of songs to be classified is N, the number Gn of song groups that can be created is obtained by [Equation 1]. Value. Here, n is an integer of 0 or more.

具体的な数値例を示すと、分類対象の楽曲数Ｎを４０とすると、楽曲グループ数Ｇｎの取り得る値は、［数１］より、Ｇｎ＝｛４０，２０，１０，５，２，１，０｝となる。 As a specific numerical example, assuming that the number N of music pieces to be classified is 40, the possible value of the music group number Gn is Gn = {40, 20, 10, 5, 2, 1 from [Equation 1]. , 0}.

作成可能な楽曲グループ数を算出すると、楽曲グループ数設定部１２０は、ユーザインタフェース部１１０を介して、ユーザから楽曲グループ数の指定を受け付ける（Ｓ１３）。楽曲グループ数の指定は、算出された作成可能な楽曲グループ数をユーザに提示して、その中から選択を受け付けるようにする。 When the number of music groups that can be created is calculated, the music group number setting unit 120 receives designation of the number of music groups from the user via the user interface unit 110 (S13). For the designation of the number of music groups, the calculated number of music groups that can be created is presented to the user, and a selection is accepted from these.

図６（ａ）は、作成可能なグループ数を提示して、選択を受け付けるための画面例を示している。ここでは、楽曲グループ作成数として意味を持たない１と０とを除いて提示している。ユーザは、本画面において希望する楽曲グループ数にチェックを入力することで、作成する楽曲グループ数を指定することができる。 FIG. 6A shows an example of a screen for presenting the number of groups that can be created and accepting selection. Here, 1 and 0, which have no meaning as the number of music group creations, are excluded. The user can designate the number of music groups to be created by inputting a check for the desired number of music groups on this screen.

なお、作成する楽曲グループ数の選択に代えて、楽曲グループに含める最小楽曲数の選択をユーザから受け付けることで、間接的に楽曲グループ数の指定を受け付けるようにしてもよい。第１実施形態において、各楽曲グループに分類される最小の数ｇｎは、楽曲グループ数をＧｎ、分類対象の楽曲数をＮとすると、［数２］で求めることができる。
Note that instead of selecting the number of music groups to be created, the user may receive a selection of the number of music groups indirectly by receiving selection of the minimum number of music pieces to be included in the music group from the user. In the first embodiment, the minimum number gn classified into each music group can be obtained by [Equation 2] where Gn is the number of music groups and N is the number of music to be classified.

すなわち、各楽曲グループに分類される最小の楽曲数ｇｎは、２の冪乗であるので、２の冪乗の値のみが選択可能であるようにユーザに提示する。このとき、分類対象の楽曲数の半数以下である最小楽曲数を候補として提示するようにする。図６（ｂ）は、各楽曲グループに分類される最小の楽曲数の候補を提示して、選択を受け付けるための画面例を示している。本画面で、各楽曲グループに分類される最小の楽曲数ｇｎが選択されると、楽曲グループ数設定部１２０は、［数３］にしたがって、作成する楽曲グループ数Ｇｎを設定する。
That is, since the minimum number of music pieces gn classified into each music group is a power of 2, it is presented to the user so that only a power value of 2 can be selected. At this time, the minimum number of music pieces that is equal to or less than half of the number of music pieces to be classified is presented as a candidate. FIG. 6B shows an example of a screen for presenting a candidate for the minimum number of music pieces classified into each music group and accepting the selection. When the minimum number of songs gn classified into each song group is selected on this screen, the song group number setting unit 120 sets the number of song groups Gn to be created according to [Equation 3].

グループ数の指定を受け付けると、楽曲グループ作成部１３０が、特徴量ベクトル格納部１４１に格納された特徴量ベクトルに基づいて、指定された楽曲グループ数に楽曲を分類する（Ｓ１４）。 When the designation of the number of groups is received, the music group creation unit 130 classifies the music into the designated number of music groups based on the feature vector stored in the feature vector storage unit 141 (S14).

ここで、楽曲分類の概要について具体例を用いて説明する。図７は、楽曲分類の概要を説明するための図であり、８つの楽曲を２つの楽曲グループに分類する場合を例にしている。説明を簡単にするため、特徴量ベクトルは、項目が２つの２次元ベクトルであるとする。 Here, an outline of music classification will be described using a specific example. FIG. 7 is a diagram for explaining the outline of the music classification, and an example in which eight music pieces are classified into two music groups. In order to simplify the description, it is assumed that the feature vector is a two-dimensional vector having two items.

図７（ａ）は、分類対象の８つの楽曲について、２次元の特徴量ベクトルを２次元座標空間に配置したものであり、白丸が各楽曲の特徴量ベクトルの座標を示している。まず、楽曲グループ作成部１３０は、楽曲グループ格納部１４２に、楽曲と同数の楽曲グループを作成し、それぞれの楽曲グループに１つずつ楽曲に対応した楽曲ＩＤを格納する。この場合、楽曲グループの代表値である重心は、特徴量ベクトルの座標と等しい。 FIG. 7 (a) shows the eight music pieces to be classified in which two-dimensional feature vectors are arranged in a two-dimensional coordinate space, and white circles indicate the coordinates of the feature vector of each song. First, the music group creation unit 130 creates the same number of music groups as the music in the music group storage unit 142, and stores a music ID corresponding to the music for each music group. In this case, the center of gravity, which is a representative value of the music group, is equal to the coordinates of the feature vector.

次に、楽曲グループ作成部１３０の楽曲グループ結合部１３２は、代表値である重心のユークリッド距離の近い楽曲グループ同士でペアを作り、そのペアを結合する処理を行なう。図７（ｂ）は、ユークリッド距離の近い楽曲グループ同士でペアを作成した様子を示したものである。点線枠は、楽曲グループを示したものであり、１つの楽曲グループは１つの楽曲を含んでいる。実線で結ばれたものが、楽曲グループのペアを示している。 Next, the music group combining unit 132 of the music group creating unit 130 forms a pair of music groups having a Euclidean distance of the center of gravity, which is a representative value, and performs processing for combining the pairs. FIG. 7B shows a state in which a pair is created between music groups having a short Euclidean distance. A dotted line frame indicates a music group, and one music group includes one music. What is connected with a solid line indicates a pair of music groups.

そして、楽曲グループのペアを、新たな１つの楽曲グループとして結合し、楽曲グループ格納部１４２を更新する。このとき、結合された２つの楽曲グループは、楽曲グループ格納部１４２から削除する。図７（ｃ）は、楽曲グループのペアを結合して、新たな１つの楽曲グループとした様子を示している。また、図７（ｃ）の黒丸は、楽曲グループの代表値である重心を示している。 Then, the music group pair is combined as a new music group, and the music group storage unit 142 is updated. At this time, the combined two music groups are deleted from the music group storage unit 142. FIG. 7C shows a state in which a pair of music groups is combined into a new music group. Further, the black circle in FIG. 7C indicates the center of gravity which is a representative value of the music group.

次に、楽曲グループ作成部１３０の楽曲グループ結合部１３２は、再度、代表値である重心の距離が近い楽曲グループ同士でペアを作り、そのペアを結合する処理を行なう。図８（ｄ）は、重心間のユークリッド距離の近い楽曲グループ同士でペアを作成した様子を示している。楽曲グループの重心である黒丸を結ぶ実線は、楽曲グループのペアを示している。これらのペアを結合して、再び新たな楽曲グループを作成し、結合された２つの楽曲グループを削除して楽曲グループ格納部１４２を更新する。楽曲グループが結合された結果、楽曲グループ数は２となり、作成する楽曲グループ数と同数になるため、楽曲グループ作成部１３０は、結合処理を終了する。このとき２つの楽曲グループには、４つずつ同数の楽曲が含まれ、偏りが生じない。 Next, the music group combining unit 132 of the music group creating unit 130 again forms a pair of music groups having a distance of the center of gravity, which is a representative value, and performs a process of combining the pairs. FIG. 8D shows a state in which a pair is created between music groups having a short Euclidean distance between the centroids. A solid line connecting the black circles, which are the center of gravity of the music group, indicates a pair of music groups. These pairs are combined to create a new music group again, delete the two combined music groups, and update the music group storage unit 142. As a result of combining the music groups, the number of music groups becomes 2, which is the same as the number of music groups to be created, so the music group creation unit 130 ends the combining process. At this time, the two music groups include the same number of music pieces by four, and there is no bias.

以上、楽曲グループ作成部１３０が８つの楽曲を２つの楽曲グループに分類する処理の概要を、具体例を用いて説明した。このように、本実施形態では、楽曲グループ作成部１３０が、代表値である重心のユークリッド距離の近い楽曲グループを結合する処理を繰り返して最終的な楽曲グループを作成するので、それぞれの楽曲グループには、［数２］で算出される数以上の楽曲が含められることになる。このため、楽曲グループ間の楽曲数の偏りを小さくすることができる。また、ユークリッド距離の近い２つの楽曲グループ同士を結合していくため、楽曲グループには、特徴量が似た楽曲が集まるようになる。 The outline of the process in which the music group creation unit 130 classifies eight music pieces into two music groups has been described above using a specific example. As described above, in the present embodiment, the music group creation unit 130 creates a final music group by repeating the process of combining the music groups with the Euclidean distance of the center of gravity, which is a representative value. Will include more than the number of songs calculated in [Equation 2]. For this reason, the deviation of the number of music between music groups can be reduced. In addition, since two music groups having a short Euclidean distance are joined together, songs having similar feature quantities gather in the music group.

次に、楽曲グループ作成部１３０が行なうステップ（Ｓ１４）の楽曲分類処理の詳細な手順について図８のフローチャートを参照して説明する。以下では、Ｎ個の楽曲が分類対象であり、作成する楽曲グループ数としてＧｎが指定されているものとする。 Next, the detailed procedure of the music classification process of step (S14) performed by the music group creation unit 130 will be described with reference to the flowchart of FIG. In the following, it is assumed that N pieces of music are to be classified and Gn is designated as the number of music groups to be created.

楽曲分類処理では、まず初期楽曲グループを作成する（Ｓ１４１１）。初期楽曲グループは、上述のように分類対象の楽曲数Ｎと同数の楽曲グループを楽曲グループ格納部１４２に作成し、それぞれの楽曲グループに１つずつ楽曲に対応した楽曲ＩＤを格納する。このため、初期楽曲グループが作成された時点で、楽曲グループ数Ｍは、楽曲数Ｎと等しい。 In the music classification process, first, an initial music group is created (S1411). As described above, the initial music group creates the same number of music groups as the number N of music to be classified in the music group storage unit 142, and stores the music ID corresponding to the music one by one in each music group. For this reason, the number of music groups M is equal to the number of music N when the initial music group is created.

そして、楽曲グループ作成部１３０は、作成された楽曲グループ数Ｍが、指定された楽曲グループ数Ｇｎと等しくなるまで、以下に説明するグループ結合処理を繰り返し、作成された楽曲グループ数Ｍが、指定された楽曲グループ数Ｇｎと等しくなると、処理を終了する（Ｓ１４１２）。 Then, the music group creation unit 130 repeats the group combining process described below until the created music group number M is equal to the designated music group number Gn, and the created music group number M is designated. When it becomes equal to the number of music groups Gn, the processing is terminated (S1412).

グループ結合処理では、まず、各楽曲グループの代表値である重心を計算する（Ｓ１４１３）。重心は、楽曲グループに属する楽曲毎の特徴量ベクトルに対して、ベクトル要素毎に平均を算出したものであり、特徴量ベクトルと同一の次元数を有するベクトルである。重心の算出は、代表値算出部１３３が行なう。 In the group joining process, first, the center of gravity, which is a representative value of each music group, is calculated (S1413). The center of gravity is a vector obtained by calculating an average for each vector element with respect to a feature quantity vector for each piece of music belonging to the music group, and is a vector having the same number of dimensions as the feature quantity vector. The representative value calculation unit 133 calculates the center of gravity.

代表値算出部１３３は、楽曲グループ格納部１４２から、重心を算出する楽曲グループについて、その楽曲グループに属するすべての楽曲ＩＤを取得し、さらに、取得した楽曲ＩＤの特徴量ベクトルを、特徴量ベクトル格納部１４１からそれぞれ取得して、重心を計算する。ｊ番目の楽曲グループに属する楽曲ＩＤの個数をＫ、ｉ番目の楽曲ＩＤの特徴量ベクトルをＦｋとすると、ｊ番目の楽曲グループの重心Ｇｊは、以下の［数４］で求めることができる。
The representative value calculation unit 133 acquires all the song IDs belonging to the song group for the song group for which the center of gravity is calculated from the song group storage unit 142, and further obtains the feature quantity vector of the obtained song ID as the feature quantity vector. Each is acquired from the storage unit 141 and the center of gravity is calculated. If the number of music IDs belonging to the j-th music group is K and the feature vector of the i-th music ID is Fk, the center Gj of the j-th music group can be obtained by the following [Equation 4].

ここで、ある楽曲グループに、３つの楽曲ＩＤが登録されている場合に、その楽曲グループの重心を求める具体例を示す。特徴量ベクトルは２次元であって、それぞれＦ１＝｛１，５｝、Ｆ２＝｛２，１｝、Ｆ３＝｛６，３｝とし、求める重心をＧ０とする。このとき、重心Ｇ０は、［数４］より、Ｇ０＝｛１＋２＋６，５＋１＋３｝／３＝｛９／３，９／３｝＝｛３，３｝と求められる。図９は、Ｆ１、Ｆ２、Ｆ３、およびＧ０を、それぞれ２次元座標上に示した図である。本図では、白丸が各特徴量ベクトルの座標位置であり、黒丸が代表値である重心を示している。なお、分類において重視したい項目に重みを付けて重心を求めるようにしてもよい。 Here, a specific example in which the center of gravity of a music group is obtained when three music IDs are registered in a music group. The feature vector is two-dimensional, and F1 = {1,5}, F2 = {2,1}, F3 = {6,3}, respectively, and the calculated center of gravity is G0. At this time, the center of gravity G0 is obtained from [Equation 4] as G0 = {1 + 2 + 6, 5 + 1 + 3} / 3 = {9/3, 9/3} = {3, 3}. FIG. 9 is a diagram showing F1, F2, F3, and G0 on two-dimensional coordinates, respectively. In the figure, white circles indicate the coordinate positions of the feature vectors, and black circles indicate the center of gravity, which is a representative value. Note that the center of gravity may be obtained by weighting items to be emphasized in classification.

次に、楽曲グループ作成部１３０は、ループ処理「ループＡ」を開始する（Ｓ１４１４）。ループＡは、開始値１のループカウンタｋが、ループを繰り返す毎に１ずつ増加し、終了値Ｍになるまで、終端ステップである（Ｓ１４２０）までの処理を繰り返す。終了値Ｍは、そのループ開始時における楽曲グループ格納部１４２に格納された楽曲グループの数である。 Next, the music group creation unit 130 starts the loop process “Loop A” (S1414). In the loop A, the loop counter k having the start value 1 is incremented by 1 every time the loop is repeated, and the process up to the termination step (S1420) is repeated until the loop counter k reaches the end value M. The end value M is the number of music groups stored in the music group storage unit 142 at the start of the loop.

ループＡ内において、楽曲グループ作成部１３０は、楽曲グループ格納部１４２に格納されているｋ番目の楽曲グループ（以降、「楽曲グループｋ」と称する）が、既に結合処理済みかどうかを、結合済フラグを参照して判断する（Ｓ１４１５）。結合処理済であるか、既に削除されている場合は（Ｓ１４１５：Ｙｅｓ）、ループＡの終端ステップ（Ｓ１４２０）に飛び、次のループに進む。 In the loop A, the music group creation unit 130 determines whether the k-th music group stored in the music group storage unit 142 (hereinafter referred to as “music group k”) has already been combined. The determination is made with reference to the flag (S1415). If it has been combined or has already been deleted (S1415: Yes), the process jumps to the loop A termination step (S1420) and proceeds to the next loop.

まだ結合処理を行なっていない場合には（Ｓ１４１５：Ｎｏ）、楽曲グループ作成部１３０の距離算出部１３１が、他の未結合の楽曲グループを対象に、それぞれの楽曲グループとの距離を計算する（Ｓ１４１６）。楽曲グループとの距離は、上述のように楽曲グループの代表値である重心同士のユークリッド距離とすることができる。ただし、マハラノビスの汎距離等の他の距離を用いるようにしてもよい。 If the joining process has not yet been performed (S1415: No), the distance calculation unit 131 of the song group creation unit 130 calculates the distance to each song group for other unjoined song groups ( S1416). As described above, the distance to the music group can be the Euclidean distance between the centroids that are representative values of the music group. However, other distances such as Mahalanobis general distance may be used.

楽曲グループｋと他の未結合の楽曲グループとの距離をそれぞれ計算すると、楽曲グループ作成部１３０は、計算された距離のうち最短距離である未結合の楽曲グループを選択し、結合対象として特定する（Ｓ１４１７）。結合対象となった楽曲グループを「楽曲グループｓ」と称する。 When the distances between the music group k and the other uncoupled music groups are calculated, the music group creation unit 130 selects the uncoupled music group that is the shortest distance from the calculated distances and identifies it as a combination target. (S1417). The music group to be combined is referred to as “music group s”.

次に、楽曲グループ作成部１３０の楽曲グループ結合部１３２が、楽曲グループｋと、楽曲グループｓとを結合し、新しい楽曲グループを作成する（Ｓ１４１８）。新しい楽曲グループには、結合元の２つの楽曲グループに属していたすべての楽曲ＩＤを割り当てる。 Next, the music group combining unit 132 of the music group creating unit 130 combines the music group k and the music group s to create a new music group (S1418). All music IDs that belonged to the two music groups of the combination source are assigned to the new music group.

具体例を示すと、楽曲グループｋに属する楽曲ＩＤが、｛楽曲ＩＤｋ１，楽曲ＩＤｋ２｝の２曲で、結合対象となった楽曲グループｓに属する楽曲ＩＤが、｛楽曲ＩＤｓ１，楽曲ＩＤｓ２｝の２曲である場合、結合後の新しい楽曲グループに属する楽曲ＩＤは、｛楽曲ＩＤｓ１，楽曲ＩＤｓ２，楽曲ＩＤｋ１，楽曲ＩＤｋ２｝の４曲となる。 As a specific example, two music IDs belonging to the music group k are {music ID k1, music ID k2}, and two music IDs belonging to the music group s to be combined are {music IDs1, music IDs2}. If it is a song, the song IDs belonging to the new song group after the combination are four songs of {song IDs1, song IDs2, song IDk1, song IDk2}.

そして、楽曲グループ結合部１３２は、楽曲グループ格納部１４２から、楽曲グループｋと、楽曲グループｓのデータを削除し、新たに作成した楽曲グループのデータを格納する。このとき、新たに作成した楽曲グループに対して、結合済みかどうかを判別する結合済フラグが結合済を示すように更新する。（Ｓ１４１９）。 Then, the music group combination unit 132 deletes the data of the music group k and the music group s from the music group storage unit 142 and stores the data of the newly created music group. At this time, the newly created music group is updated so that the combined flag for determining whether or not it has been combined indicates combined. (S1419).

ループＡの終端ステップにおいて、楽曲グループ作成部１３０は、ループカウンタｋを１増加させ、ループＡを繰り返す（Ｓ１４２０）。このとき、ループカウンタｋがＭに達していたならば、ループＡを終了する。 In the end step of Loop A, the music group creation unit 130 increments the loop counter k by 1 and repeats Loop A (S1420). At this time, if the loop counter k has reached M, the loop A is terminated.

ループＡの終了後、楽曲グループ結合部１３２は、結合済フラグが結合済を示していない楽曲グループを、その楽曲グループから最も近い距離にある楽曲グループと結合させる（Ｓ１４２１）。すなわち、結合対象とならなかった、あるいは結合対象が残っていなかった１つの楽曲グループについて、（Ｓ１４１６）の距離算出処理と、（Ｓ１４１７）の結合対象特定処理と、（Ｓ１４１８）の楽曲グループ結合処理を行なう。このとき、結合対象となる楽曲グループは、結合フラグが結合済を示している楽曲グループであってもよい。 After the end of Loop A, the music group combining unit 132 combines a music group whose combined flag does not indicate combined with a music group that is closest to the music group (S1421). That is, with respect to one music group that has not become a join target or a join target does not remain, a distance calculation process in (S1416), a join target specifying process in (S1417), and a song group join process in (S1418) To do. At this time, the music group to be combined may be a music group whose combined flag indicates combined.

図１０は、このときの処理を具体的に説明する図である。図１０（ａ）は、３つの楽曲グループを示している。３つの楽曲グループの楽曲グループＩＤはそれぞれ楽曲グループα、楽曲グループβ、楽曲グループγであり、このうち、楽曲グループαと楽曲グループβは結合処理が行なわれ、楽曲グループγは結合処理を行なっていないものとする。 FIG. 10 is a diagram for specifically explaining the processing at this time. FIG. 10A shows three music groups. The music group IDs of the three music groups are the music group α, the music group β, and the music group γ, respectively. Of these, the music group α and the music group β are combined, and the music group γ is performing the combining process. Make it not exist.

楽曲グループγは結合処理を行なっていないため、楽曲グループαか、楽曲グループβのどちらかと結合させる。そこで、楽曲グループγの重心との距離が近い重心を有する楽曲グループが楽曲グループβであった場合、楽曲グループ結合部１３２は、楽曲グループγと、楽曲グループβとの結合処理を行なう。図１０（ｂ）は、楽曲グループγと楽曲グループβとを結合し、新たに楽曲グループδを作成した結果を示している。 Since the music group γ has not been combined, it is combined with either the music group α or the music group β. Therefore, when the music group having the center of gravity close to the center of the music group γ is the music group β, the music group combining unit 132 performs a process of combining the music group γ and the music group β. FIG. 10B shows a result of newly creating a music group δ by combining the music group γ and the music group β.

この処理により、すべての楽曲グループが他のいずれかの楽曲グループと結合され、新たなＭ個の楽曲グループが作成されたため、新たな楽曲グループすべての結合済フラグを解除する（Ｓ１４２２）。 As a result of this processing, all the music groups are combined with any other music group, and new M music groups have been created, so the combined flags of all the new music groups are canceled (S1422).

以上の処理を、楽曲グループ数Ｍが、指定された楽曲グループ数Ｇｎになるまで繰り返すようにする（Ｓ１４１２）。これにより、Ｎ個の楽曲が、偏ることなくＧｎ個のグループに分類されることになる。 The above processing is repeated until the music group number M reaches the designated music group number Gn (S1412). Thereby, the N pieces of music are classified into Gn groups without being biased.

なお、距離算出部１３１で算出する楽曲グループ間の距離について、楽曲グループの重心間の距離ではなく、異なる楽曲グループに属する楽曲間の最短距離を、楽曲グループ間の距離としてもよい。図１１を参照して、この方法による楽曲グループ間の距離の算出についての具体例を説明する。図１１の２つの点線枠は楽曲グループであり、２つの楽曲グループに、それぞれ白丸で示されている２つの楽曲ＩＤが属しているものとする。 In addition, about the distance between the music groups calculated by the distance calculation unit 131, the shortest distance between music pieces belonging to different music groups may be used as the distance between the music groups instead of the distance between the centroids of the music groups. With reference to FIG. 11, a specific example of calculating the distance between music groups by this method will be described. Two dotted frames in FIG. 11 are music groups, and two music IDs indicated by white circles belong to the two music groups, respectively.

この場合、異なる楽曲グループに属する楽曲ＩＤ間の距離を考えると、４つの距離を算出することができる。すなわち、図中の距離Ｄ１，距離Ｄ２，距離Ｄ３，距離Ｄ４である。本例では、これらの４つの距離のうち、距離の最も短いものを、楽曲グループ間の距離とする。例えば、距離Ｄ２が最も距離が短い場合、距離Ｄ２を求める楽曲グループ間の距離とする。 In this case, considering the distance between music IDs belonging to different music groups, four distances can be calculated. That is, distance D1, distance D2, distance D3, and distance D4 in the figure. In this example, the shortest distance among these four distances is set as the distance between music groups. For example, when the distance D2 is the shortest, the distance D2 is the distance between music groups for which the distance D2 is obtained.

また、特徴量ベクトル格納部１４１に格納された特徴量ベクトルは、主成分分析などの手法を用いることで、特徴量ベクトルの要素を要約し、特徴量ベクトルの次元数を減らすようにしてもよい。 In addition, the feature quantity vector stored in the feature quantity vector storage unit 141 may be summarized by summarizing the elements of the feature quantity vector and reducing the number of dimensions of the feature quantity vector by using a method such as principal component analysis. .

また、作成した楽曲グループから、さらに、決まった曲順で再生を行なうためのプレイリストを作成してもよい。例えば、プレイリストとする楽曲グループを決定し、その楽曲グループに属する楽曲をプレイリストの曲目とし、曲順は、楽曲グループの重心から近い特徴量ベクトルを有する楽曲順に決定するといった処理を行なうことで、プレイリストを作成することができる。もちろん、プレイリストの曲順はランダムに決定してもよい。 Further, a playlist for reproduction in a predetermined song order may be created from the created song group. For example, by determining a music group to be a playlist, setting the music belonging to the music group as the music of the playlist, and determining the music order in the order of music having a feature vector close to the center of the music group. You can create a playlist. Of course, the order of songs in the playlist may be determined randomly.

以上説明したように、第１実施形態の楽曲分類装置１００によれば、未結合の２つの楽曲グループ同士を結合していくことで新たな楽曲グループを作成するため、楽曲グループに属する楽曲数の偏りが小さい楽曲グループを作成することができる。このとき、楽曲グループに属する楽曲の特徴量ベクトルに基づいて求められた距離の近い楽曲グループ同士を結合するため、音楽的特徴の似通った楽曲グループを作成することができる。
＜第２実施形態＞ As described above, according to the music classification apparatus 100 of the first embodiment, since a new music group is created by joining two unjoined music groups, the number of songs belonging to the music group A music group with a small bias can be created. At this time, the music groups having similar musical characteristics can be created because the music groups having a short distance obtained based on the feature vector of the music belonging to the music group are combined.
Second Embodiment

次に、本発明の第２実施形態について説明する。第１実施形態における楽曲分類装置１００では、作成できる楽曲グループ数に、［数１］に示したような制約があったが、第２実施形態における楽曲分類装置は、任意の数の楽曲グループを作成でき、なおかつ各楽曲グループに属する楽曲の数も、大きく偏らないように楽曲グループを作成することができる。 Next, a second embodiment of the present invention will be described. In the music classification apparatus 100 according to the first embodiment, the number of music groups that can be created is limited as shown in [Equation 1]. However, the music classification apparatus according to the second embodiment uses an arbitrary number of music groups. A music group can be created so that the number of music pieces that can be created and that belong to each music group is not largely biased.

図１２は、本発明の第２実施形態に係る楽曲分類装置の構成を示すブロック図である。第１実施形態と同じブロックについては同じ符号を付し、説明を省略する。本図に示すように第２実施形態に係る楽曲分類装置１００ａは、ユーザインタフェース部１１０、楽曲グループ数設定部１２０ａ、楽曲グループ作成部１３０ａ、格納部１４０を備えて構成される。 FIG. 12 is a block diagram showing a configuration of a music classification apparatus according to the second embodiment of the present invention. The same blocks as those in the first embodiment are denoted by the same reference numerals, and description thereof is omitted. As shown in the figure, the music classification apparatus 100a according to the second embodiment includes a user interface unit 110, a music group number setting unit 120a, a music group creation unit 130a, and a storage unit 140.

楽曲グループ数設定部１２０ａは、ユーザインタフェース部１１０を介したユーザからの操作に基づいて、作成する楽曲グループ数を設定する。第２実施形態においては、ユーザから任意の楽曲グループ数の指定を受け付けることで、作成する楽曲グループ数が設定される。もちろん、楽曲グループ数は、分類対象の楽曲の数を超えないように設定する。 The music group number setting unit 120 a sets the number of music groups to be created based on an operation from the user via the user interface unit 110. In the second embodiment, the number of music groups to be created is set by accepting designation of an arbitrary number of music groups from the user. Of course, the number of music groups is set so as not to exceed the number of music to be classified.

楽曲グループ作成部１３０ａは、第１実施形態の楽曲グループ作成部１３０と同じブロックである距離算出部１３１、楽曲グループ結合部１３２、代表値算出部１３３に加え、結合した楽曲グループを分解する楽曲グループ分解部１３４を備えている。 The music group creation unit 130a adds to the distance calculation unit 131, the music group combination unit 132, and the representative value calculation unit 133, which are the same blocks as the music group creation unit 130 of the first embodiment, and the music group that decomposes the combined music group A decomposition unit 134 is provided.

次に、第２実施形態における楽曲グループ作成部１３０ａが行なう楽曲分類処理の詳細な手順について図１３のフローチャートを参照して説明する。以下では、Ｎ個の楽曲が分類対象であり、作成する楽曲グループ数としてＩが指定されているものとする。上述のように、楽曲グループ数Ｉは任意の数とすることができる。このため、図６（ａ）の楽曲グループ数選択画面に代えて、楽曲グループ数を直接入力する画面を用いてユーザからの指定を受け付けるものとする。 Next, the detailed procedure of the music classification process performed by the music group creation unit 130a in the second embodiment will be described with reference to the flowchart of FIG. In the following, it is assumed that N pieces of music are to be classified, and I is designated as the number of music groups to be created. As described above, the number of music groups I can be an arbitrary number. For this reason, it replaces with the music group number selection screen of Fig.6 (a), and the designation | designated from a user shall be received using the screen which inputs a music group number directly.

第２実施形態においても、楽曲分類処理では、まず初期楽曲グループを作成する（Ｓ１４３１）。初期楽曲グループは、第１実施形態と同様に、分類対象の楽曲数Ｎと同数の楽曲グループを楽曲グループ格納部１４２に作成し、それぞれの楽曲グループに１つずつ楽曲に対応した楽曲ＩＤを格納する。 Also in the second embodiment, in the music classification process, first, an initial music group is created (S1431). As in the first embodiment, the initial music group creates the same number of music groups as the number N of music to be classified in the music group storage unit 142, and stores the music ID corresponding to the music one by one in each music group. To do.

ただし、第２実施形態では、楽曲数と作成する楽曲グループ数との制約がないため、初期状態として、楽曲数Ｎと同数の楽曲グループを作成するのに限られず、楽曲数Ｎよりも少ない数の楽曲グループを作成するようにしてもよい。この場合、初期状態において、複数の楽曲ＩＤを格納した楽曲グループが作成されることになる。例えば、ユーザが意図的に特定の楽曲を同じ楽曲グループに含めたい場合には、初期状態において同じ楽曲グループにその楽曲の楽曲ＩＤを格納しておけばよい。 However, in the second embodiment, since there is no restriction between the number of songs and the number of song groups to be created, it is not limited to creating the same number of song groups as the number of songs N as an initial state. You may make it produce the music group of. In this case, in an initial state, a music group storing a plurality of music IDs is created. For example, when a user intentionally wants to include a specific song in the same song group, the song ID of that song may be stored in the same song group in the initial state.

そして、結合カウンタｃに０を設定する（Ｓ１４３２）。ここで、結合カウンタｃは、一連の楽曲グループ結合処理を行なった回数を示す変数であり、０以上の整数である。初期状態では、一連の楽曲グループ結合処理はまだ行なわれていないため、０が設定される。 Then, 0 is set to the combination counter c (S1432). Here, the combination counter c is a variable indicating the number of times a series of music group combination processing has been performed, and is an integer of 0 or more. In the initial state, since a series of music group combination processing has not yet been performed, 0 is set.

次に、楽曲グループ作成部１３０ａは、結合カウンタｃが、所定の条件を満たすまで、以下に説明するグループ結合処理を繰り返し、結合カウンタｃが、所定の条件を満たすと、（Ｓ１４４５）の楽曲グループ分解処理に進む（Ｓ１４１２）。ここで、所定の条件は、［数５］に示す条件である。
Next, the music group creation unit 130a repeats the group combining process described below until the combination counter c satisfies a predetermined condition, and when the combination counter c satisfies the predetermined condition, the music group of (S1445). The process proceeds to decomposition processing (S1412). Here, the predetermined condition is the condition shown in [Formula 5].

［数５］において、右辺は、一連の楽曲グループ結合処理をｃ＋１回行なった場合の楽曲グループ数を示している。すなわち、一連の楽曲グループ結合処理をもう一回行なった場合に作成される楽曲グループ数であり、［数５］の条件は、楽曲グループ数Ｍが、作成する楽曲グループ数Ｉを下回る１回手前まで一連の楽曲グループ結合処理を繰り返すことを意味している。 In [Formula 5], the right side indicates the number of music groups when a series of music group combination processing is performed c + 1 times. That is, it is the number of song groups created when the series of song group combination processing is performed once more, and the condition of [Equation 5] is one time before the number of song groups M is less than the number of song groups I to be created. This means that a series of music group combination processing is repeated.

なお、分類対象の楽曲数Ｎが１００である場合、一連の楽曲グループ結合処理が０回のとき、楽曲グループ数は１００となり、一連の楽曲グループ結合処理が１回のとき、楽曲グループ数は５０となり、一連の楽曲グループ結合処理が２回のとき、楽曲グループ数は２５となり、一連の楽曲グループ結合処理が３回のとき、楽曲グループ数は１２となっていく。 When the number N of songs to be classified is 100, the number of music groups is 100 when the series of music group combination processing is 0, and the number of music groups is 50 when the series of music group combination processing is 1 time. Thus, when the series of music group combination processing is two times, the number of music groups is 25, and when the series of music group combination processing is three times, the number of music groups is 12.

例えば、作成する楽曲グループ数Ｉが２０であり、分類対象の楽曲数Ｎが１００である場合、右辺は、ｃ＝０，１，２の順に、５０，２５，１２となる。ｃ＝２、つまり一連の楽曲グループ結合処理を２回行なうと、Ｉ＝２０＞１２となり、［数５］が成り立つことになる。 For example, when the number I of music groups to be created is 20 and the number N of music pieces to be classified is 100, the right side is 50, 25, 12 in the order of c = 0, 1, 2. When c = 2, that is, when a series of music group combination processing is performed twice, I = 20> 12, and [Equation 5] is established.

第２実施形態のグループ結合処理においても、各楽曲グループの代表値である重心を計算し（Ｓ１４３４）、楽曲グループ作成部１３０ａが、ループ処理「ループＡ」を開始する（Ｓ１４３５）。 Also in the group joining process of the second embodiment, the center of gravity, which is the representative value of each music group, is calculated (S1434), and the music group creation unit 130a starts the loop process “loop A” (S1435).

ループＡ内において、楽曲グループ作成部１３０ａは、楽曲グループｋが既に結合処理済みかどうかを判断する（Ｓ１４３６）。結合処理済であるか、既に削除されている場合は（Ｓ１４３６：Ｙｅｓ）、ループＡの終端ステップ（Ｓ１４４１）に飛び、次のループに進む。 In the loop A, the music group creation unit 130a determines whether or not the music group k has already been combined (S1436). If it has been combined or has been deleted (S1436: Yes), the process jumps to the loop A termination step (S1441) and proceeds to the next loop.

まだ結合処理を行なっていない場合には（Ｓ１４３６：Ｎｏ）、楽曲グループ作成部１３０ａの距離算出部１３１が、他の未結合の楽曲グループを対象に、それぞれの楽曲グループとの距離を計算する（Ｓ１４３７）。楽曲グループｋと他の未結合の楽曲グループとの距離をそれぞれ計算すると、楽曲グループ作成部１３０ａは、最短距離である未結合の楽曲グループｓを選択し、結合対象として特定する（Ｓ１４３８）。 If the joining process has not yet been performed (S1436: No), the distance calculation unit 131 of the song group creation unit 130a calculates the distance to each song group for other unjoined song groups ( S1437). When the distance between the music group k and another uncoupled music group is calculated, the music group creation unit 130a selects the uncoupled music group s that is the shortest distance and identifies it as a combination target (S1438).

次に、楽曲グループ作成部１３０ａの楽曲グループ結合部１３２が、楽曲グループｋと、楽曲グループｓとを結合し、新しい楽曲グループを作成する（Ｓ１４３９）そして、楽曲グループ格納部１４２から、楽曲グループｋと、楽曲グループｓのデータを削除し、新たに作成した楽曲グループのデータを格納する。このとき、新たに作成した楽曲グループには、結合済みかどうかを判別する結合済フラグが結合済を示すように更新する。（Ｓ１４４０）。 Next, the song group combining unit 132 of the song group creating unit 130a combines the song group k and the song group s to create a new song group (S1439). From the song group storage unit 142, the song group k Then, the music group s data is deleted and the newly created music group data is stored. At this time, the newly created music group is updated so that a combined flag for determining whether or not it has been combined indicates combined. (S1440).

ループＡの終端ステップにおいて、楽曲グループ作成部１３０は、ループカウンタｋを１増加させ、ループＡを繰り返す（Ｓ１４４１）。このとき、ループカウンタｋがＭに達していたならば、ループＡを終了する。ループＡの終了後、楽曲グループ結合部１３２は、結合済フラグが結合済を示していない楽曲グループを、その楽曲グループから最も近い距離にある楽曲グループと結合させる（Ｓ１４４２）。 In the end step of Loop A, the music group creation unit 130 increments the loop counter k by 1 and repeats Loop A (S1441). At this time, if the loop counter k has reached M, the loop A is terminated. After the end of Loop A, the music group combining unit 132 combines a music group whose combined flag does not indicate combined with a music group that is closest to the music group (S1442).

この処理により、すべての楽曲グループが他のいずれかの楽曲グループと結合され、新たなＭ個の楽曲グループが作成されたため、新たな楽曲グループすべての結合済フラグを解除する（Ｓ１４４３）。そして、結合カウンタｃを１増加させて（Ｓ１４４４）、所定条件を満たすか否かの判定（Ｓ１４３３）に戻る。 As a result of this processing, all music groups are combined with any other music group, and new M music groups have been created, so the combined flag for all new music groups is canceled (S1443). Then, the combination counter c is incremented by 1 (S1444), and the process returns to the determination (S1433) as to whether or not a predetermined condition is satisfied.

上述のように、結合カウンタｃが所定の条件を満たすと（Ｓ１４１２：Ｙｅｓ）、（Ｓ１４４５）の楽曲グループ分解処理に進む。図１４のフローチャートを参照して、楽曲グループ作成部１３０ａの楽曲グループ分解部１３４が行なう楽曲グループ分解処理について説明する。 As described above, when the combination counter c satisfies the predetermined condition (S1412: Yes), the process proceeds to the music group disassembly process of (S1445). With reference to the flowchart of FIG. 14, the music group decomposition process performed by the music group decomposition unit 134 of the music group creation unit 130a will be described.

楽曲グループ分解処理は、作成される楽曲グループの数が、指定された楽曲グループ数Ｉと等しくなるように、必要数の楽曲グループを削除し、削除された楽曲グループに属していた楽曲ＩＤを、他の楽曲グループに再登録する処理である。 The song group disassembly process deletes the required number of song groups so that the number of song groups to be created is equal to the designated song group number I, and the song IDs belonging to the deleted song groups are This is a process of re-registering with another music group.

まず、楽曲グループ分解部１３４は、一連の楽曲グループ結合処理で作成された楽曲グループ数Ｍから、指定された作成すべき楽曲グループ数Ｉを引いて、減少させる楽曲グループ数Ｌを設定する（Ｓ２０１）。 First, the music group disassembling unit 134 sets the music group number L to be decreased by subtracting the designated music group number I to be created from the music group number M created by a series of music group combination processes (S201). ).

次に、楽曲グループ格納部１４２に格納されているすべての楽曲グループについて、楽曲グループの代表値である重心を算出し（Ｓ２０２）、算出された重心とその楽曲グループに属する楽曲の特徴量ベクトルとの距離をそれぞれ求め、その平均距離（以下「楽曲平均距離」と称す）を算出する（Ｓ２０３）。 Next, for all the music groups stored in the music group storage unit 142, a centroid that is a representative value of the tune group is calculated (S202), and the calculated centroid and a feature vector of the music belonging to the tune group are calculated. Are calculated, and the average distance (hereinafter referred to as “music average distance”) is calculated (S203).

楽曲グループの重心の特徴量ベクトルを｛ｇ１，…，ｇｘ｝、楽曲グループに属するＫ個の楽曲データのうち、ｋ番目の楽曲データの特徴量ベクトルを｛ｆｋ１，・・・ｆｋｘ｝とすると、楽曲平均距離Ｄは、［数６］によって求めることができる。なお、［数６］で求められる楽曲平均距離はユークリッド距離であるが、他の距離、例えばマハラノビスの汎距離などを用いてもよい。
If the feature quantity vector of the center of gravity of the music group is {g1,..., Gx}, and the K music data belonging to the music group is {fk1,. The music average distance D can be obtained by [Equation 6]. In addition, although the music average distance calculated | required by [Equation 6] is Euclidean distance, you may use other distance, for example, the Mahalanobis generalized distance, etc.

次に、楽曲グループ分解部１３４は、楽曲平均距離の大きい順に、Ｌ個の楽曲グループを選択する（Ｓ２０４）。そして、選択されたＬ個の楽曲グループに属するそれぞれの楽曲を、選択されたＬ個以外の楽曲グループのうち、最も距離の近い楽曲グループに移動させる（Ｓ２０５）。すなわち、楽曲グループの移動先は、移動対象の楽曲に対応する特徴量ベクトルと、選択されたＬ個以外の楽曲グループの代表値である重心との距離をそれぞれ計算し、最も距離の近い楽曲グループとする。 Next, the music group decomposing unit 134 selects L music groups in descending order of the average music distance (S204). Then, the music pieces belonging to the selected L music groups are moved to the music group having the closest distance among the music groups other than the selected L music groups (S205). That is, the movement destination of the music group calculates the distance between the feature vector corresponding to the music to be moved and the center of gravity, which is the representative value of the selected music group other than L, and the music group with the closest distance And

なお、選択されたＬ個の楽曲グループに属する楽曲と、選択されたＬ個以外の楽曲グループとの距離の計算は、楽曲の移動前にすべて行なうようにしてもよいし、楽曲を移動させる度に、移動先の楽曲グループの代表値である重心を逐次更新するようにしてもよい。 The calculation of the distance between the music belonging to the selected L music groups and the music group other than the selected L music groups may be performed before the music is moved, or each time the music is moved. In addition, the center of gravity, which is the representative value of the destination music group, may be updated sequentially.

選択されたＬ個の楽曲グループに属する楽曲すべての移動を終えると、楽曲グループ分解部１３４は、選択されたＬ個の楽曲グループすべてを、楽曲グループ格納部１４２から削除して（Ｓ２０６）、楽曲グループ分解処理を終了する。 When the movement of all the songs belonging to the selected L music groups is finished, the music group disassembling unit 134 deletes all the selected L music groups from the music group storage unit 142 (S206), The group disassembly process ends.

ここで、楽曲グループ分解部１３４の処理について、作成する楽曲グループ数Ｉを４、分類対象の楽曲数Ｎを１０、一連の楽曲グループ結合処理で作成された楽曲グループ数Ｍを５とした場合の例を、図１５を参照して説明する。 Here, regarding the processing of the music group disassembling unit 134, the number of music groups I to be created is 4, the number of music N to be classified is 10, and the number of music groups M created by a series of music group combining processes is 5. An example will be described with reference to FIG.

図１５（ａ）は、一連の楽曲グループ結合処理後の状態を示している。図１５（ａ）の白丸は楽曲であり、それぞれ特徴量ベクトルに基づいて配置されている。点線枠は、楽曲グループを示しており、点線枠中のＡ，Ｂ，Ｃ，Ｄ，Ｅは、それぞれの楽曲グループの楽曲グループＩＤを示している。 FIG. 15A shows a state after a series of music group joining processes. The white circles in FIG. 15A are music pieces, which are arranged based on feature quantity vectors. A dotted line frame indicates a music group, and A, B, C, D, and E in the dotted line frame indicate a music group ID of each music group.

これら５個の楽曲グループから、楽曲平均距離の大きいものを、（Ｍ−Ｉ）個、すなわち（５−４）＝１個選択する。図１５（ｂ）は、楽曲グループＡが最も楽曲平均距離が大きかったとして、楽曲グループＡを分解対象楽曲グループとして選択した様子を示している。 From these five music groups, (M−I), that is, (5−4) = 1, are selected having a large music average distance. FIG. 15B shows a state in which the music group A is selected as the disassembly target music group, assuming that the music group A has the longest music average distance.

次に、楽曲グループＡに属している楽曲データを、別の楽曲グループに移動させる。図１５（ｃ）は、楽曲グループＡに属している２つの楽曲をａ１、ａ２とし、それぞれの楽曲から最も近い重心を有する楽曲グループを矢印で示している。すなわち、楽曲ａ１は楽曲グループＢと、楽曲ａ２は楽曲グループＣと近いことを示している。図１５（ｄ）は、楽曲グループＡを削除し、楽曲グループＢに楽曲ａ１を、楽曲グループＣに楽曲ａ２を移動させた最終的な楽曲グループを示している。 Next, the music data belonging to the music group A is moved to another music group. FIG. 15C shows two music pieces belonging to the music group A as a1 and a2, and the music group having the center of gravity closest to each music piece is indicated by an arrow. That is, the music a1 is close to the music group B and the music a2 is close to the music group C. FIG. 15D shows a final music group in which the music group A is deleted, the music a1 is moved to the music group B, and the music a2 is moved to the music group C.

以上説明したように、第２実施形態における楽曲分類装置１００ａでは、第１実施形態における楽曲分類装置１００の楽曲グループ間の楽曲数の偏りが小さくなるという効果に加え、任意の個数の楽曲グループを作成することができるという効果を奏する。 As described above, in the music classification device 100a according to the second embodiment, in addition to the effect that the deviation in the number of music between the music groups of the music classification device 100 according to the first embodiment is reduced, an arbitrary number of music groups is added. There is an effect that it can be created.

１００…楽曲分類装置、１００ａ…楽曲分類装置、１１０…ユーザインタフェース部、１２０…楽曲グループ数設定部、１２０ａ…楽曲グループ数設定部、１３０…楽曲グループ作成部、１３０ａ…楽曲グループ作成部、１３１…距離算出部、１３２…楽曲グループ結合部、１３３…代表値算出部、１３４…楽曲グループ分解部、１４０…格納部、１４１…特徴量ベクトル格納部、１４２…楽曲グループ格納部、２００…外部記憶装置、２１０…楽曲データ格納部、３００…再生装置 DESCRIPTION OF SYMBOLS 100 ... Music classification apparatus, 100a ... Music classification apparatus, 110 ... User interface part, 120 ... Music group number setting part, 120a ... Music group number setting part, 130 ... Music group creation part, 130a ... Music group creation part, 131 ... Distance calculation unit 132 ... Music group combination unit 133 133 Representative value calculation unit 134 ... Music group decomposition unit 140 140 Storage unit 141 Feature vector storage unit 142 Music group storage unit 200 External storage device 210 ... Music data storage unit 300 ... Playback device

Claims

In a content classification apparatus that groups a plurality of contents into a predetermined number of groups,
Based on the feature amount of the content belonging to the group from the state in which the indexed feature amount in each content to be classified is acquired and the plurality of contents are grouped into a larger number of groups than the predetermined number of groups A group creating unit that calculates a representative value of the feature amount of each group and performs a joining process for joining groups having similar representative values together into one group,
The content classification apparatus, wherein the group creation unit repeats the combining process until the number of groups reaches the predetermined number.

The feature amount is an indexed feature amount vector in each content to be classified,
The content classification apparatus according to claim 1, wherein the representative value of the group is a centroid of a feature vector of content belonging to the group.

The group creation unit
The content classification apparatus according to claim 2, wherein groups that are close to each other in the center of gravity are combined among unjoined groups.

The group creation unit
4. The method according to claim 1, wherein when one unjoined group remains, the group is joined with any joined group based on the representative value. Content classification device.

A candidate for the number of creation groups is calculated based on the number of contents to be classified, and further includes a group number setting unit that accepts selection of the number of creation groups that are candidates.
The content classification apparatus according to claim 1, wherein the predetermined number is the number of created groups received by the group number setting unit.

It further includes a group number setting unit that accepts designation of the number of created groups,
The content classification apparatus according to claim 1, wherein the predetermined number is larger than the designated number of created groups.

The group creation unit
After the number of groups reaches the predetermined number, a number of groups corresponding to the difference between the predetermined number and the number of created groups is selected, and each content belonging to the selected group is selected as one of the unselected groups. 7. The content classification apparatus according to claim 6, wherein distribution is performed.

In a content classification method for grouping a plurality of contents into a predetermined number of groups,
Obtaining indexed feature values for each content to be classified;
Calculating a representative value of a feature amount of each group based on a feature amount of the content belonging to the group from a state in which the plurality of contents are grouped into a larger number of groups than the predetermined number of groups;
A group creating step for performing a joining process for joining the groups having similar representative values into one group;
Have
In the group step, the combination process is repeated until the number of groups reaches the predetermined number.

In a content classification program for causing a computer to execute a process of grouping a plurality of contents into a predetermined number of groups,
On the computer,
Obtaining indexed feature values for each content to be classified;
Calculating a representative value of a feature amount of each group based on a feature amount of the content belonging to the group from a state in which the plurality of contents are grouped into a larger number of groups than the predetermined number of groups;
A group creating step for performing a joining process for joining the groups having similar representative values into one group;
And execute
The content grouping program characterized in that the group step is a step of repeating the combining process until the number of groups reaches the predetermined number.