JP2005107718A

JP2005107718A - Database retrieval device

Info

Publication number: JP2005107718A
Application number: JP2003338393A
Authority: JP
Inventors: Masahiro Yoshida; 昌弘吉田
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 2003-09-29
Filing date: 2003-09-29
Publication date: 2005-04-21

Abstract

<P>PROBLEM TO BE SOLVED: To provide a database retrieval device realizing retrieval which is much more suitable for an individual. <P>SOLUTION: The device is provided with a first means for inputting user information for specifying a classification pattern to a user in addition to an impression value at every sensitivity word pair at the time of retrieval, a second means calculating a sensitivity spatial coordinate value corresponding to the impression value at every sensitivity word pair, which are inputted by the first means, a third means calculating a spatial distance between the sensitivity spatial coordinate value calculated by the second means and the sensitivity spatial coordinate value registered in a database as an index, and a fourth means selecting a content whose spatial distance calculated by the third means is the shortest. The third means uses only index information corresponding to the classification pattern specified by user information inputted by the first means as an object of calculation of the spatial distance. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

この発明は、音楽、映像等のデータベース検索装置に関する。 The present invention relates to a database search apparatus for music, video and the like.

感性語によって音楽を検索するシステムが提案されている（「音楽感性空間を用いた感性語による音楽データベース検索システム」情報処理学会論文誌 Vol.42 No12 ２００１）。この従来の検索システムでは、データベース中の曲をマッピングするための検索空間をＳＤ法ならびに因子分析により生成する。また、データベースに新たな曲を登録する際には、ＧＡとニューラルネットワークにより構成された自動インデクシングシステムにより検索空間へのマッピングを行う。検索の際には、８つの感性語対の度合い（１〜７）をニューラルネットワークに入力することにより、ニューラルネットワークはそれらの入力に対応する感性空間中の座標を出力するので、検索システムはその出力座標値からユークリッド距離の最も近い曲から順番に検索候補としてユーザに提示を行う。 A system for searching music by sensitivity words has been proposed ("Music database search system by sensitivity words using music sensitivity space", Information Processing Society of Japan Vol.42 No12 2001). In this conventional search system, a search space for mapping songs in a database is generated by the SD method and factor analysis. In addition, when a new song is registered in the database, mapping to a search space is performed by an automatic indexing system constituted by a GA and a neural network. At the time of search, by inputting the degree (1-7) of eight sensitivity word pairs to the neural network, the neural network outputs the coordinates in the sensitivity space corresponding to those inputs. The search candidates are presented to the user in order from the song with the closest Euclidean distance from the output coordinate value.

なお、感性語対として、「明るい−暗い」、「重い−軽い」、「かたい−やわらかい」、「安定−不安定」、「澄んだ−濁った」、「滑らか−歯切れのよい」、「激しい−穏やか」および「厚い−薄い」の８種類が使用されている。 The sensitivity word pairs are “bright-dark”, “heavy-light”, “hard-soft”, “stable-unstable”, “clear-cloudy”, “smooth-crisp”, “ Eight types are used: severe-mild and thick-thin.

ところで、音楽、映像に対するユーザの感性も、性別、年齢層によって異なるものと考えられるが、上記従来の検索システムでは、この点が考慮されていない。
特開２００１−３０６５８０号公報「音楽感性空間を用いた感性語による音楽データベース検索システム」情報処理学会論文誌 Vol.42 No12 By the way, although it is thought that the user's sensitivity with respect to a music and an image also changes with sex and age groups, this point is not considered in the said conventional search system.
JP 2001-306580 A "Music database retrieval system by sensitivity word using music sensitivity space" IPSJ Journal Vol.42 No12

この発明は、より個人に適合した検索が可能となるデータベース検索装置を提供することを目的とする。 An object of the present invention is to provide a database search device that enables a search that is more personalized.

請求項１に記載の発明は、データベースに登録される各コンテンツ毎に、そのコンテンツに対して求められた感性空間座標値がそのコンテンツのインデックスとしてデータベースに登録されており、検索時にはユーザによって入力された各感性語対毎の印象値に基づいて、入力された各感性語対毎の印象値に対応する感性空間座標値が算出され、算出された感性空間座標値との空間的距離が最も短いコンテンツを検索するデータベース検索装置において、性別および／または年齢によって分類される分類パターン毎に、各コンテンツの感性空間座標値が求められて、インデックス情報としてデータべースに登録されており、検索時において、各感性語対毎の印象値の他に、分類パターンを特定するためのユーザ情報をユーザに入力させるための第１手段、第１手段によって入力された各感性語対毎の印象値に対応する感性空間座標値を算出する第２手段、第２手段によって算出された感性空間座標値とデータベースにインデックスとして登録されている感性空間座標値との間の空間距離の計算する第３手段、ならびに第３手段によって算出された空間距離が最も短いコンテンツを選択する第４手段を備えており、第３手段は、空間距離を計算する際に、第１手段によって入力されたユーザ情報によって特定される分類パターンに対応したインデックス情報のみを空間距離の算出対象として使用することを特徴とする。 According to the first aspect of the present invention, for each content registered in the database, the Kansei space coordinate value obtained for the content is registered in the database as an index of the content, and is input by the user during the search. Based on the impression value for each sensitivity word pair, the sensitivity space coordinate value corresponding to the input impression value for each sensitivity word pair is calculated, and the spatial distance from the calculated sensitivity space coordinate value is the shortest In a database search device for searching for content, the sensitivity space coordinate value of each content is obtained for each classification pattern classified by gender and / or age, and is registered in the database as index information. In order to make the user input user information for specifying the classification pattern in addition to the impression value for each sensitivity word pair The first means, the second means for calculating the sensitivity space coordinate value corresponding to the impression value for each sensitivity word pair input by the first means, and the sensitivity space coordinate value calculated by the second means and registered in the database as an index A third means for calculating a spatial distance between the sentimental spatial coordinate values, and a fourth means for selecting the content having the shortest spatial distance calculated by the third means. The third means comprises: When calculating the spatial distance, only index information corresponding to the classification pattern specified by the user information input by the first means is used as a spatial distance calculation target.

請求項２に記載の発明は、データベースに登録される各コンテンツ毎に、そのコンテンツに対して求められた感性空間座標値がそのコンテンツのインデックスとしてデータベースに登録されており、検索時にはユーザによって指定された感性空間座標値との空間的距離が最も短いコンテンツを検索するデータベース検索装置において、性別および／または年齢によって分類される分類パターン毎に、各コンテンツの感性空間座標値が求められて、インデックス情報としてデータべースに登録されており、検索時において、感性空間座標値の他に、分類パターンを特定するためのユーザ情報をユーザに入力させるための第１手段、第１手段によって入力された感性空間座標値とデータベースにインデックスとして登録されている感性空間座標値との間の空間距離の計算する第２手段、ならびに第２手段によって算出された空間距離が最も短いコンテンツを選択する第３手段を備えており、第２手段は、空間距離を計算する際に、第１手段によって入力されたユーザ情報によって特定される分類パターンに対応したインデックス情報のみを空間距離の算出対象として使用することを特徴とする。 According to the second aspect of the present invention, for each content registered in the database, the Kansei space coordinate value obtained for the content is registered in the database as an index of the content, and is specified by the user at the time of search. In a database search device that searches for content having the shortest spatial distance from the emotional space coordinate value, the emotional space coordinate value of each content is obtained for each classification pattern classified by sex and / or age, and index information is obtained. And is input by the first means and the first means for allowing the user to input user information for specifying the classification pattern in addition to the sensitivity space coordinate value at the time of the search. Kansei space coordinate values and Kansei space coordinate values registered as indexes in the database A second means for calculating the spatial distance between the second means, and a third means for selecting the content having the shortest spatial distance calculated by the second means. When the second means calculates the spatial distance, Only the index information corresponding to the classification pattern specified by the user information input by the first means is used as a spatial distance calculation target.

この発明によれば、より個人に適合した検索が可能となる。 According to the present invention, it is possible to search more suited to an individual.

以下、図面を参照して、この発明を音楽データベース検索システムに適用した場合の実施例について説明する。 DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention applied to a music database search system will be described below with reference to the drawings.

〔１〕音楽データベース検索システムの構成についての説明 [1] Explanation of the configuration of the music database search system

図１は、音楽データベース検索システムの構成を示している。 FIG. 1 shows the configuration of a music database search system.

音楽データベース検索システムは、音楽感性空間生成部１００と検索システム部２００とから構成されている。音楽感性空間生成部１００は、検索空間となる音楽感性空間を生成する。検索システム部２００は、入力された曲印象に適合した曲を音楽データベースから検索し、その曲を出力する。 The music database search system includes a music sensitivity space generation unit 100 and a search system unit 200. The music sensitivity space generation unit 100 generates a music sensitivity space as a search space. The search system unit 200 searches the music database for a song that matches the input song impression, and outputs the song.

〔２〕音楽感性空間生成部１００についての説明 [2] Description of the music sensitivity space generation unit 100

音楽感性空間生成部１００は、図１に示すように、聴取実験によって音楽感性空間を生成する初期音楽感性空間生成部１０１と、データベースに追加したい曲の音楽感性空間座標値を自動的に算出するための自動インデクシング部１０２とを備えている。 As shown in FIG. 1, the music sensitivity space generation unit 100 automatically calculates an initial music sensitivity space generation unit 101 that generates a music sensitivity space by a listening experiment, and music sensitivity space coordinate values of a song to be added to the database. And an automatic indexing unit 102.

〔２−１〕初期音楽感性空間生成部１０１についての説明 [2-1] Description of the initial music sensitivity space generation unit 101

初期音楽感性空間生成部１０１は、聴取実験によって得られた曲印象値を用いて因子分析を行う因子分析部１１１と、データベース部１１２とを備えている。 The initial music sensation space generation unit 101 includes a factor analysis unit 111 that performs factor analysis using a song impression value obtained by a listening experiment, and a database unit 112.

聴取実験について説明する。被験者として、３０歳未満の男性、３０歳以上の男性、３０歳未満の女性、３０歳以上の女性、各１０名に対して、ポップスを１００曲提示し、各感性語対に対して７段階評価を行ってもらった。 A listening experiment will be described. As subjects, 100 pops were presented to 10 males under 30 years old, males over 30 years old, females under 30 years old, females over 30 years old, each with 7 levels for each sensitivity word pair. We had you evaluate.

感性語対は、次の４種類とした。
（１）明るい−暗い
（２）重い−軽い
（３）安定−不安定
（４）力強い−弱々しい The following four types of sensitivity word pairs were used.
(1) bright-dark (2) heavy-light (3) stable-unstable (4) powerful-weak

この実験結果から、性別、年齢別の４グループ（３０歳未満の男性、３０歳以上の男性、３０歳未満の女性、３０歳以上の女性）毎に、各曲それぞれについて、各感性語対毎の曲印象値を算出した。各感性語対毎の曲印象値は、その感性語対の評価値の平均値で表される。 From this experimental result, for each song, for each sex word pair, for each group of 4 groups by sex and age (male under 30 years old, male over 30 years old, female under 30 years old, female over 30 years old). The song impression value was calculated. The music impression value for each sensitivity word pair is represented by an average value of the evaluation values of the sensitivity word pairs.

因子分析部１１１は、各グループ毎に、各曲の各感性語対毎の曲印象値を多変量とする因子分析を行い、各曲毎に、各因子を軸とした、曲印象の心理的な相関を表す空間（音楽感性空間）の座標データを得る。このように、音楽感性空間は、複数の因子軸を有する空間である。この例では、因子は３種類あり、因子軸も３種類あるものとする。 The factor analysis unit 111 performs, for each group, a factor analysis in which the song impression value for each sensitivity word pair of each song is multivariate, and for each song, the psychological impression of the song impression with each factor as an axis. Coordinate data of a space (musical sensation space) representing a strong correlation is obtained. Thus, the music sensitivity space is a space having a plurality of factor axes. In this example, it is assumed that there are three types of factors and three types of factor axes.

データベース部１１２には、各曲の楽曲データが登録されるとともに、各グループ毎に、各曲の感性空間中の座標データが各曲のインデックス情報として登録される。 In the database unit 112, music data of each music is registered, and coordinate data in the sensitivity space of each music is registered as index information of each music for each group.

〔２−２〕自動インデクシング部１０２についての説明 [2-2] Description of the automatic indexing unit 102

自動インデクシング部１０２は、新規登録曲を解析して、テンポ、リズム等の物理的特徴量を抽出する物理的特徴量抽出部１２１と、物理的特徴量抽出部１２１によって抽出された物理的特徴量を各感性語対毎の曲印象値に変換する物理的特徴量／曲印象値変換部１２２と、物理的特徴量／曲印象値変換部１２２によって得られた各感性語対毎の曲印象値を音楽感性空間座標値に変換する曲印象値／音楽感性空間座標変換部１２３とを備えている。 The automatic indexing unit 102 analyzes a newly registered song and extracts physical feature amounts such as tempo and rhythm, and a physical feature amount extracted by the physical feature amount extraction unit 121. Is converted into a song impression value for each sensitivity word pair, and a music impression value for each sensitivity word pair obtained by the physical feature value / music impression value conversion unit 122. Is converted into a music sensitivity space coordinate value, and a music impression space / music sensitivity space coordinate conversion unit 123 is provided.

自動インデクシング部１０２によって得られた新規登録曲の音楽感性空間座標値は、新規登録曲の曲データとともに、データベース部１１２に登録される。 The music sensitivity space coordinate value of the newly registered song obtained by the automatic indexing unit 102 is registered in the database unit 112 together with the song data of the newly registered song.

図２は、自動インデクシング部１０２の構成をより具体的に示している。 FIG. 2 shows the configuration of the automatic indexing unit 102 more specifically.

物理的特徴量／曲印象値変換部１２２は、４種類の感性語対それぞれに対応する回帰分析部１２２ａ〜１２２ｄから構成されている。曲印象値／音楽感性空間座標変換部１２３は、ニューラルネットワークから構成されている。 The physical feature quantity / musical impression value conversion unit 122 includes regression analysis units 122a to 122d corresponding to four types of sensitivity word pairs. The music impression value / musical sensitivity space coordinate conversion unit 123 includes a neural network.

物理的特徴量／曲印象値変換部１２２内の各回帰分析部１２２ａ〜１２２ｄには、それぞれに対応する感性語対に関係した複数の物理的特徴量が入力される。各回帰分析部１２２ａ〜１２２ｄに入力される物理特徴量の種類は予め定められている。 A plurality of physical feature quantities related to the corresponding sensitivity word pairs are input to the regression analysis units 122 a to 122 d in the physical feature quantity / musical impression value conversion unit 122. The types of physical feature values input to the regression analysis units 122a to 122d are determined in advance.

各回帰分析部１２２ａ〜１２２ｄは、次式（１）に基づいて、物理的特徴量を対応する感性語対の曲印象値に変換する。 Each regression analysis part 122a-122d converts a physical feature-value into the music impression value of the corresponding sensitivity word pair based on following Formula (1).

上記式（１）において、ＯＵＴは、当該回帰分析部の出力値であり、当該回帰分析部に対応する感性語対に関する曲印象値の推定値を表している。Ｉｎ〔ｋ〕は、当該回帰分析部に入力するｋ番目の物理的特徴量を表している。Ｗ〔ｋ〕は、Ｉｎ〔ｋ〕に対する重み係数である。Ｗ〔０〕は、重み係数である。 In the above equation (1), OUT is an output value of the regression analysis unit, and represents an estimated value of the music impression value related to the sensitivity word pair corresponding to the regression analysis unit. In [k] represents the k-th physical feature amount input to the regression analysis unit. W [k] is a weighting coefficient for In [k]. W [0] is a weighting coefficient.

上記式（１）の各重み係数Ｗ〔ｋ〕，Ｗ〔０〕は、上述した聴取実験結果と、それに用いられた各曲の物理的特徴量に基づいて、性別、年齢別の４グループ毎に予め求められている。１つのグループに対する重み係数Ｗ〔ｋ〕，Ｗ〔０〕は、次のようにして決定される。当該グループに対する聴取実験結果、つまり、各曲毎に求められた感性語対毎の曲印象値を教師信号（出力値ＯＵＴ）とする。また、各曲毎に物理的特徴量抽出部１２１によって抽出された物理的特徴量を入力値Ｉｎとする。そして、全曲それぞれに対する回帰分析結果が、その曲に対する教師信号に最も近い値となるような重み係数Ｗ〔ｋ〕，Ｗ〔０〕を算出する。 The weighting factors W [k] and W [0] in the above formula (1) are based on the results of the above-mentioned listening experiment and the physical features of each song used for each group of 4 groups by sex and age. Is required in advance. The weighting factors W [k] and W [0] for one group are determined as follows. A listening experiment result for the group, that is, a song impression value for each sensitivity word pair obtained for each song is used as a teacher signal (output value OUT). Further, the physical feature amount extracted by the physical feature amount extraction unit 121 for each song is set as the input value In. Then, the weighting coefficients W [k] and W [0] are calculated so that the regression analysis results for all the songs become values closest to the teacher signal for the songs.

曲印象値／音楽感性空間座標変換部１２３を構成するニューラルネットワークの学習について説明する。ニューラルネットワークは、性別、年齢別の４グループ毎に学習が行われ、性別、年齢別の４グループ毎に内部状態が決定されている。学習時の入力信号は、上述した聴取実験結果、つまり、各曲毎に求められた感性語対毎の曲印象値である。学習時の教師信号は、因子分析部１１１によって得られた各曲の音楽感性空間座標値（各因子軸の座標値）である。 Learning of the neural network constituting the music impression value / musical sensitivity space coordinate conversion unit 123 will be described. In the neural network, learning is performed for every four groups according to sex and age, and the internal state is determined for every four groups according to sex and age. The input signal at the time of learning is the result of the above-described listening experiment, that is, the music impression value for each sensitivity word pair obtained for each music. The teacher signal at the time of learning is a music sensitivity space coordinate value (coordinate value of each factor axis) of each song obtained by the factor analysis unit 111.

〔３〕検索システム部２００についての説明 [3] Description of the search system unit 200

検索システム部２００は、図１に示すように、ユーザ入力部２０１、曲印象値／音楽感性空間座標変換部２０２、検索部２０３および検索結果出力部２０４とからなる。 As shown in FIG. 1, the search system unit 200 includes a user input unit 201, a song impression value / musical sensitivity space coordinate conversion unit 202, a search unit 203, and a search result output unit 204.

ユーザは、ユーザ入力部２０１に対して、性別、年齢等のユーザデータ、検索したい曲に関する各感性語対毎の曲印象値（７段階）、各感性語対の重要度情報等を入力する。ユーザ入力部２０１に入力された性別、年齢等のユーザデータおよび選択したい曲に関する各感性語対毎の曲印象値は、曲印象値／音楽感性空間座標変換部２０２に与えられる。曲印象値／音楽感性空間座標変換部２０２としては、自動インデクシング部１０２内の曲印象値／音楽感性空間座標変換部１２３を構成するニューラルネットワークがそのまま用いられる。 The user inputs user data such as gender and age, song impression values for each sensitivity word pair (seven levels), importance level information of each sensitivity word pair, and the like on the user input unit 201. User data such as sex and age input to the user input unit 201 and the song impression value for each sensitivity word pair regarding the song to be selected are given to the song impression value / music sensitivity space coordinate conversion unit 202. As the music impression value / musical sensitivity space coordinate conversion unit 202, a neural network constituting the music impression value / musical sensitivity space coordinate conversion unit 123 in the automatic indexing unit 102 is used as it is.

曲印象値／音楽感性空間座標変換部２０２は、ニューラルネットワークの内部状態をユーザ入力部２０１から与えられたユーザデータに対応するグループの内部状態に設定した後、ユーザ入力部２０１から与えられた各感性語対毎の曲印象値をニューラルネットワークに入力することにより、それに対応する音楽感性空間座標値を取得する。曲印象値／音楽感性空間座標変換部２０２は、取得した音楽感性空間座標値を検索部２０３に与える。 The music impression value / musical sensitivity space coordinate conversion unit 202 sets the internal state of the neural network to the internal state of the group corresponding to the user data given from the user input unit 201, and then each of the given values given from the user input unit 201. The music impression space coordinate value corresponding to the sensitivity word pair is acquired by inputting the music impression value for each sensitivity word pair to the neural network. The music impression value / music sensitivity space coordinate conversion unit 202 gives the acquired music sensitivity space coordinate value to the search unit 203.

検索部２０３には、ユーザ入力部２０１に入力された各感性語対の重要度情報も与えられる。まず、各感性語対の重要度情報が与えられない場合の、検索部２０３の処理について説明する。 The search unit 203 is also given importance level information of each sensitivity word pair input to the user input unit 201. First, the processing of the search unit 203 when importance level information of each sensitivity word pair is not given will be described.

データベース部１１２に登録されている曲の、ユーザデータで指定されるグループに対応する音楽感性空間座標値を（Ｘ１〔ｉ〕，Ｘ２〔ｉ〕，Ｘ３〔ｉ〕）とする。ｉは曲番号である。また、ユーザが入力した選択したい曲に関する各感性語対毎の曲印象値が、曲印象値／音楽感性空間座標変換部２０２によって音楽感性空間座標に変換された値を、（ａ１，ａ２，ａ３）とする。 The music sensitivity space coordinate values corresponding to the group specified by the user data of the music registered in the database unit 112 are defined as (X1 [i], X2 [i], X3 [i]). i is a song number. Also, values obtained by converting the music impression value for each sensitivity word pair input by the user into music sensitivity space coordinates by the music impression value / music sensitivity space coordinate conversion unit 202 are (a1, a2, a3). ).

検索部２０３は、次式（２）に基づいて、各登録曲毎に空間距離Ｌを算出し、この空間距離Ｌが最も小さい曲を選択する。検索結果出力部２０４は、検索部２０３によって選択された曲の曲データをデータベース部１１２から読み出して出力する。 The search unit 203 calculates a spatial distance L for each registered song based on the following formula (2), and selects a song having the smallest spatial distance L. The search result output unit 204 reads the song data of the song selected by the search unit 203 from the database unit 112 and outputs it.

L ＝（X1〔ｉ〕−a1）²＋（X2〔ｉ〕−a2）²＋（X3〔ｉ〕−a3）² …（２） L = (X1 [i] -a1) ² + (X2 [i] -a2) ² + (X3 [i] -a3) ² (2)

次に、各感性語対の重要度情報が与えられる場合の、検索部２０３の処理について説明する。 Next, processing of the search unit 203 when importance level information of each sensitivity word pair is given will be described.

ユーザは、４種類の感性語対（明るい−暗い，重い−軽い，安定−不安定，力強い−弱々しい）に対する重要度を考慮して、各感性語対に対する重要度情報を入力することが可能である。各感性語対（明るい−暗い，重い−軽い，安定−不安定，力強い−弱々しい）の重要度情報を｛ｐ１，ｐ２，ｐ３，ｐ４｝で表すことにする。ｐ_n（ｎ＝１，２，３，４）は、１≦ｐ_n≦４の範囲の値であり、１が最も重要度が高く、数値が大きくなるほど重要度が低くなる。全ての感性語対を同等に扱う場合には、全ての感性語対に対する重要度情報を１に設定する。 The user can input importance information for each sensitivity word pair in consideration of the importance for the four types of sensitivity word pairs (bright-dark, heavy-light, stable-unstable, strong-weak). Is possible. The importance level information of each sensitivity word pair (bright-dark, heavy-light, stable-unstable, strong-weak) is represented by {p1, p2, p3, p4}. p _n (n = 1, 2, 3, 4) is a value in the range of 1 ≦ p _n ≦ 4, with 1 being the most important, and the greater the numerical value, the lower the importance. When all sensitivity word pairs are handled equally, importance level information for all sensitivity word pairs is set to 1.

表１は、各感性語対の各因子（因子１，因子２，因子３）への寄与率（因子負荷）を示している。 Table 1 shows the contribution rate (factor load) of each sensitivity word pair to each factor (factor 1, factor 2, factor 3).

各感性語対の各因子（因子１，因子２，因子３）への寄与率が表１に示すような場合には、各因子（因子１，因子２，因子３）の重みＷ１，Ｗ２，Ｗ３を次式（３）に基づいて算出する。なお、各因子（因子１，因子２，因子３）の重みＷ１，Ｗ２，Ｗ３は、予め算出されて、検索部２０３に保持されている。 When the contribution rate of each sensitivity word pair to each factor (factor 1, factor 2, factor 3) is as shown in Table 1, the weights W1, W2, and weights of each factor (factor 1, factor 2, factor 3) W3 is calculated based on the following equation (3). The weights W1, W2, and W3 of each factor (factor 1, factor 2, factor 3) are calculated in advance and held in the search unit 203.

W1＝(0.14 ＋0.17＋0.96＋0.34)
÷(0.14 ×p1＋0.17×p2＋0.96×p3＋0.34×p4)
W2＝(0.69 ＋0.01＋0.12＋0.91)
÷(0.69 ×p1＋0.01×p2＋0.12×p3＋0.91×p4)
W3＝( 0.64 ＋0.96＋0.00＋0.01)
÷( 0.64×p1＋0.96×p2＋0.00×p3＋0.01×p4) …（３） W1 = (0.14 +0.17 +0.96 +0.34)
÷ (0.14 x p1 + 0.17 x p2 + 0.96 x p3 + 0.34 x p4)
W2 = (0.69 +0.01 +0.12 +0.91)
÷ (0.69 x p1 + 0.01 x p2 + 0.12 x p3 + 0.91 x p4)
W3 = (0.64 + 0.96 + 0.00 + 0.01)
÷ (0.64 x p1 + 0.96 x p2 + 0.00 x p3 + 0.01 x p4) (3)

なお、各因子（因子１，因子２，因子３）の重みＷ１，Ｗ２，Ｗ３を次式（４）に基づいて算出してもよい。 The weights W1, W2, and W3 of each factor (factor 1, factor 2, factor 3) may be calculated based on the following equation (4).

W1＝ 0.14 ×(1/p1)＋0.17×(1/p2)＋0.96×(1/p3)＋0.34×(1/p4)
W2＝ 0.69 ×(1/p1)＋0.01×(1/p2)＋0.12×(1/p3)＋0.91×(1/p4)
W3＝ 0.64 ×(1/p1)＋0.96×(1/p2)＋0.00×(1/p3)＋0.01×(1/p4) …（４） W1 = 0.14 x (1 / p1) + 0.17 x (1 / p2) + 0.96 x (1 / p3) + 0.34 x (1 / p4)
W2 = 0.69 x (1 / p1) + 0.01 x (1 / p2) + 0.12 x (1 / p3) + 0.91 x (1 / p4)
W3 = 0.64 x (1 / p1) + 0.96 x (1 / p2) + 0.00 x (1 / p3) + 0.01 x (1 / p4)… (4)

検索部２０３は、次式（５）に基づいて、各登録曲毎に空間距離Ｌを算出し、この空間距離Ｌが最も小さい曲を選択する。検索結果出力部２０４は、検索部２０３によって選択された曲の曲データをデータベース部１１２から読み出して出力する。 The search unit 203 calculates a spatial distance L for each registered song based on the following equation (5), and selects a song having the smallest spatial distance L. The search result output unit 204 reads the song data of the song selected by the search unit 203 from the database unit 112 and outputs it.

L ＝W1×（X1〔ｉ〕−a1）²＋W2×（X2〔ｉ〕−a2）²＋W3×（X3〔ｉ〕−a3）² …（５） L = W1 × (X1 [i] −a1) ² + W2 × (X2 [i] −a2) ² + W3 × (X3 [i] −a3) ² (5)

なお、検索部２０３は、次式（６）に基づいて、各登録曲毎に空間距離Ｌを算出し、この空間距離Ｌが最も小さい曲を選択するようにしてもよい。 The search unit 203 may calculate the spatial distance L for each registered song based on the following equation (6), and may select a song having the smallest spatial distance L.

L ＝W1×α1 ×（X1〔ｉ〕−a1）²＋W2×α2 ×（X2〔ｉ〕−a2）²
＋W3×α3 ×（X3〔ｉ〕−a3）² …（６） L = W1 × α1 × (X1 [i] −a1) ² + W2 × α2 × (X2 [i] −a2) ²
+ W3 × α3 × (X3 [i] −a3) ² (6)

上記式（６）においては、各因子には、上記重みＷ１，Ｗ２，Ｗ３の他に、重みα１，α２，α３が付加されている。αｎ（ただし、ｎ＝１，２，３）は、Ｘｎ〔ｉ〕（ただし、ｎ＝１，２，３）とａn （ただし、ｎ＝１，２，３）との符号によって、ｋ１またはｋ２を取る。ｋ１≧ｋ２＞０である。 In the above equation (6), weights α1, α2, and α3 are added to each factor in addition to the weights W1, W2, and W3. αn (where n = 1, 2, 3) is k1 or k2 depending on the sign of Xn [i] (where n = 1, 2, 3) and an (where n = 1, 2, 3). I take the. k1 ≧ k2> 0.

つまり、Ｘｎ〔ｉ〕とａn とが異符号の場合には（Ｘｎ〔ｉ〕とａn とが因子軸の原点を挟んで互いに反対側に存在する場合には）、それらの距離をより大きくするために、αｎ＝ｋ１とされる。Ｘｎ〔ｉ〕とａn とが同符号の場合には、αｎ＝ｋ２とされる。 That is, when Xn [i] and an have different signs (when Xn [i] and an are on opposite sides of the origin of the factor axis), the distance between them is increased. Therefore, αn = k1. When Xn [i] and an have the same sign, αn = k2.

上記実施例では、性別、年齢別によって分けられた複数のグループ毎に、各登録曲の音楽感性空間座標が求められているが、性別、年齢別、音楽ジャンル別によって分けられた複数のパターン毎に、各登録曲の音楽感性空間座標を求めるようにしてもよい。 In the above embodiment, music sensitivity space coordinates of each registered song are obtained for each of a plurality of groups divided by sex and age, but for each of a plurality of patterns divided by sex, age, and music genre. In addition, the music sensitivity space coordinates of each registered song may be obtained.

上記実施例では、ユーザ入力部２０１には、各感性語対毎の曲印象値が入力されているが、感性空間座標値（各因子毎の座標値）を直接入力させるようにしてもよい。この場合には、重要度情報としては、各因子毎の重要度情報を直接入力させるようにしてもよい。 In the above-described embodiment, the music impression value for each sensitivity word pair is input to the user input unit 201, but the sensitivity space coordinate values (coordinate values for each factor) may be directly input. In this case, importance information for each factor may be directly input as importance information.

この発明は、カーステレオやホームオーディオ、ＰＣ等の音楽再生機器に利用できる。また、この発明は、音楽配信サービス等の検索ソフトに利用できる。 The present invention can be used for music playback devices such as car stereos, home audio, and PCs. The present invention can also be used for search software such as a music distribution service.

また、この発明は、音楽データベースの他映像データベースの検索にも利用できる。つまり、絵画やカタログなどのデータベースの検索装置として利用できる。 The present invention can also be used for searching video databases as well as music databases. That is, it can be used as a search device for databases such as pictures and catalogs.

音楽データベース検索システムの構成を示すブロック図である。It is a block diagram which shows the structure of a music database search system. 自動インデクシング部１０２の構成をより具体的に示すブロック図である。2 is a block diagram showing more specifically the configuration of an automatic indexing unit 102. FIG.

Explanation of symbols

１００音楽感性空間生成部
２００検索システム部
１０１初期音楽感性空間生成部
１０２自動インデクシング部
１１１因子分析部
１１２データベース部
１２１物理的特徴量抽出部
１２２物理的特徴量／曲印象値変換部
１２３曲印象値／音楽感性空間座標変換部
２０１ユーザ入力部
２０２曲印象値／音楽感性空間座標変換部
２０３検索部
２０４検索結果出力部 DESCRIPTION OF SYMBOLS 100 Music sensitivity space generation part 200 Search system part 101 Initial music sensitivity space generation part 102 Automatic indexing part 111 Factor analysis part 112 Database part 121 Physical feature-value extraction part 122 Physical feature-value / music impression value conversion part 123 Music impression value / Music sensitivity space coordinate conversion unit 201 User input unit 202 Music impression value / Music sensitivity space coordinate conversion unit 203 Search unit 204 Search result output unit

Claims

For each content registered in the database, the Kansei space coordinate value obtained for the content is registered in the database as an index of the content, and the impression value for each sensitivity word pair input by the user at the time of search In the database search device for searching the content having the shortest spatial distance from the calculated emotional space coordinate value, the emotional space coordinate value corresponding to the input impression value for each sensitivity word pair is calculated based on
For each classification pattern classified by gender and / or age, the Kansei space coordinate value of each content is obtained and registered in the database as index information.
A first means for allowing a user to input user information for specifying a classification pattern in addition to an impression value for each sensitivity word pair at the time of search;
Second means for calculating a sensitivity space coordinate value corresponding to an impression value for each sensitivity word pair input by the first means;
A third means for calculating a spatial distance between the emotional space coordinate value calculated by the second means and the emotional space coordinate value registered as an index in the database; and the spatial distance calculated by the third means is the shortest. A fourth means of selecting content,
The third means uses only the index information corresponding to the classification pattern specified by the user information input by the first means as the spatial distance calculation target when calculating the spatial distance. apparatus.

For each content registered in the database, the Kansei space coordinate value obtained for the content is registered in the database as an index of the content, and the spatial space with the Kansei space coordinate value specified by the user at the time of search In a database search device that searches for content with the shortest distance,
For each classification pattern classified by gender and / or age, the Kansei space coordinate value of each content is obtained and registered in the database as index information.
A first means for allowing a user to input user information for specifying a classification pattern in addition to the sensitivity space coordinate value at the time of retrieval;
A second means for calculating a spatial distance between the emotional space coordinate value input by the first means and the emotional space coordinate value registered as an index in the database; and the spatial distance calculated by the second means is the shortest. A third means for selecting content,
The second means uses the index information corresponding to the classification pattern specified by the user information input by the first means as the spatial distance calculation target when calculating the spatial distance. apparatus.