JP2008210382A

JP2008210382A - Music data processor

Info

Publication number: JP2008210382A
Application number: JP2008032848A
Authority: JP
Inventors: Takayasu Miki; 孝保三木; Yukiko Yamamoto; 祐規子山本; Takashi Takeda; 享司竹田; Junichi Tagawa; 潤一田川
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2008-02-14
Filing date: 2008-02-14
Publication date: 2008-09-11

Abstract

<P>PROBLEM TO BE SOLVED: To provide a music data processor allowing a user to grasp a music content visually and intuitively based on a feature amount extracted from a sound signal of a music. <P>SOLUTION: The music data processor comprises: a sound signal input means 61 for inputting the sound signal of a music; a feature amount extraction means 62 for extracting a predetermined feature amount from the sound signal; a character imparting means 63 for selecting a two-dimensional or three-dimensional character shape or character motion to express the feature of the music based on the feature amount; and a character storage means 66 for storing the feature amount and the character shape and/or character motion in association with each other. Thus, the predetermined feature amount is extracted for the sound signal of the music, the character featuring the corresponding music is imparted using the extracted feature amount, and thereby a user is allowed to grasp the content of the music visually. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、利用者に対し、楽曲検索再生時、あるいは楽曲データ操作時に視覚的で直感的な楽曲内容の把握を可能にする楽曲データ処理装置に関する。 The present invention relates to a music data processing apparatus that enables a user to grasp music contents visually and intuitively during music search reproduction and music data operation.

ＭＤプレーヤや、ＤＶＤプレーヤ、あるいはハードディスクに大量の音楽が格納されているような楽曲データベースから、利用者が好みの楽曲を検索する場合、楽曲の曲名や歌手名などをキーワードとして検索したり、曲名や歌手名などのテキスト情報の一覧が表示されたリストの中から、利用者が自分の好みに合うと思う楽曲を選択する場合が多い。しかし、上記のように、曲名や歌手名などのテキスト情報からだけでは、その楽曲が、例えばテンポの速い曲であるか遅い曲であるかや、激しい曲か穏やかな曲かなどの楽曲自体の内容を知るのは困難である。曲名や歌手名以外により多くのテキスト情報を書誌情報として準備すれば、選択時の助けにはなるが、書誌情報を準備したり、登録するには、別途作業が必要になる。ＣＤＤＢなどの楽曲の書誌情報を蓄積したデータベースを利用するソフトウエアなどの場合では、書誌情報の自動挿入による操作性・利便性の改善が試みられているが、テキスト情報だけでは楽曲の内容を把握し理解するのに時間がかかり、楽曲の内容を直感的にイメージしにくい。 When a user searches for a favorite song from an MD player, a DVD player, or a song database in which a large amount of music is stored on the hard disk, the song title or singer name is searched as a keyword, In many cases, the user selects a song that he / she finds fits his / her preference from a list in which a list of text information such as a singer name is displayed. However, as described above, only from text information such as the song name and singer name, the song itself is, for example, a song with a fast tempo, a song with a slow tempo, an intense song, or a gentle song. It is difficult to know the contents. If more text information than bibliographic information is prepared as bibliographic information, it will help at the time of selection, but separate work is required to prepare and register bibliographic information. In the case of software that uses a database that stores the bibliographic information of music such as CDDB, attempts have been made to improve operability and convenience by automatically inserting bibliographic information. It takes time to understand and understand the contents of the music intuitively.

これに対し、その楽曲に対応付けて、楽曲自体の内容あるいは雰囲気を表現する静止画を表示できれば、テキスト情報に加えて静止画のイメージ情報をもとにより楽曲の内容を把握しやすくなるので、利用者にとって求める楽曲を分類したり検索したりするのが容易になる。しかし、現状では楽曲の内容に合った静止画を用意するのが煩雑であり、かつ内容を表現する複数の代表的な静止画を用意しても、それを楽曲に適切に対応させることが難しい。 On the other hand, if it is possible to display a still image that expresses the content or atmosphere of the song itself in association with the song, it becomes easier to grasp the content of the song based on the image information of the still image in addition to the text information. It becomes easy for the user to classify and search for the desired music. However, at present, it is cumbersome to prepare a still image that matches the content of the music, and even if multiple representative still images that express the content are prepared, it is difficult to properly correspond to the music .

また、現在では楽曲を再生する際、キャラクタなどを表示する技術がある。しかし、それらは楽曲内容に関係なくランダムに表示するものであったり、楽曲の曲名や歌詞など楽曲情報をキーワードとして表示するものだったりする。前者の方法では、楽曲の内容を把握することは不可能であり、後者の方法では、楽曲情報を登録する作業が別途必要となり時間もかかる。 Currently, there is a technique for displaying a character or the like when playing a music piece. However, they may be displayed randomly regardless of the music content, or may display music information such as the song title and lyrics as keywords. In the former method, it is impossible to grasp the contents of the music, and in the latter method, a work for registering music information is separately required and takes time.

本発明は、楽曲の音響信号の内容に基づいて、その楽曲に適合する静止画を付与する手段を提案することにより、多くの楽曲データの中から利用者の希望する楽曲を容易に検索あるいは選択することを可能にする手段を提供することを目的としてなされたものである。また、楽曲の音響信号の内容に基づいて、その楽曲に適合するキャラクタを動作させる手段を提案することにより、楽曲再生時に、利用者が楽曲内容を聴覚だけではなく視覚的に把握することを可能にする手段を提供することを目的としてなされたものである。 The present invention proposes a means for assigning a still image suitable for a song based on the contents of the acoustic signal of the song, thereby easily searching for or selecting a song desired by the user from a lot of song data. It has been made for the purpose of providing a means for making it possible. In addition, by suggesting a means to operate a character that matches the music based on the contents of the acoustic signal of the music, it is possible for the user to visually grasp the music contents as well as the hearing during music playback. It was made for the purpose of providing a means to make.

請求項１記載の楽曲データ処理装置は、楽曲の音響信号を入力する音響信号入力手段と、音響信号からあらかじめ定めた特徴量を抽出する特徴量抽出手段と、特徴量をもとに楽曲の特徴を表現する２次元または３次元キャラクタの形状及びまたはキャラクタの動作を選択するキャラクタ付与手段と、特徴量とキャラクタの形状及びまたはキャラクタの動作を関連付けて保管するキャラクタ保管手段とを備えることを特徴とする。 The music data processing device according to claim 1 is an acoustic signal input means for inputting an acoustic signal of music, a feature quantity extracting means for extracting a predetermined feature quantity from the acoustic signal, and a music feature based on the feature quantity. Character adding means for selecting the shape and / or character motion of a two-dimensional or three-dimensional character expressing the character, and character storage means for storing the feature value in association with the character shape and / or character motion. To do.

請求項２記載の楽曲データ処理装置は、楽曲の音響信号を、ある周期ごとに入力する音響信号入力手段と、音響信号からあらかじめ定めた特徴量を抽出する特徴量抽出手段と、特徴量の変化に連動して２次元または３次元キャラクタの形状及びまたは動作を選択するキャラクタ付与手段と、特徴量とキャラクタの形状及びまたはキャラクタの動作を関連付けて保管するキャラクタ保管手段とを備えることを特徴とする。 The music data processing apparatus according to claim 2, wherein an acoustic signal input means for inputting an acoustic signal of the music every certain period, a feature quantity extraction means for extracting a predetermined feature quantity from the acoustic signal, and a change in the feature quantity And a character providing means for selecting the shape and / or action of the two-dimensional or three-dimensional character in conjunction with the character, and a character storage means for storing the feature value in association with the character's shape and / or character action. .

請求項３記載の発明は、請求項１または請求項２記載の楽曲データ処理装置において、特徴量抽出が、楽曲登録時または楽曲再生時に抽出することを特徴とする。 According to a third aspect of the present invention, in the music data processing apparatus according to the first or second aspect, the feature amount is extracted at the time of music registration or music reproduction.

請求項４記載の発明は、請求項３記載の楽曲データ処理装置において、楽曲再生時に抽出することを特徴とする特徴量抽出が、楽曲の再生と同時に並行して行う、あるいは楽曲の再生の前にあらかじめ抽出しておくことを特徴とする。 According to a fourth aspect of the present invention, in the music data processing apparatus according to the third aspect, the feature amount extraction is performed at the time of music playback, or is performed in parallel with music playback or before music playback. It is characterized by extracting in advance.

請求項５記載の発明は、請求項１または請求項２に記載の楽曲データ処理装置において、特徴量抽出手段が、楽曲の全体、楽曲の一部分及び、楽曲の複数の部分に対し、任意の組み合わせからなる領域から楽曲の音響信号の特徴量を抽出することを特徴とする。 According to a fifth aspect of the present invention, in the music data processing apparatus according to the first or second aspect, the feature amount extraction unit is an arbitrary combination of the whole music, a part of the music, and a plurality of parts of the music The feature quantity of the acoustic signal of a music is extracted from the area | region which consists of.

請求項６記載の発明は、請求項１または請求項２に記載の楽曲データ処理装置において、音響信号入力手段が、楽曲の圧縮された音響信号を入力することを特徴とする。 According to a sixth aspect of the present invention, in the music data processing apparatus according to the first or second aspect, the acoustic signal input means inputs a compressed acoustic signal of the music.

請求項７記載の発明は、請求項１または請求項２に記載の楽曲データ処理装置において、楽曲と特徴量とキャラクタの形状及びまたはキャラクタの動作を関連付けて管理を行うデータ管理手段をさらに備えることを特徴とする。 A seventh aspect of the present invention is the music data processing apparatus according to the first or second aspect, further comprising data management means for performing management by associating the music, the feature amount, the character shape and / or the character motion. It is characterized by.

請求項８記載の発明は、請求項１または請求項２に記載の楽曲データ処理装置において、キャラクタ保管手段において、キャラクタの形状とキャラクタの動作のうち少なくともどちらか１つを特徴量と関連付けて保管することを特徴とする。 According to an eighth aspect of the present invention, in the music data processing apparatus according to the first or second aspect, the character storage means stores at least one of the character shape and the character action in association with the feature amount. It is characterized by doing.

請求項１に記載の発明により、楽曲の音響信号に対し、あらかじめ定めた特徴量を抽出し、その特徴量を用いて、該当する楽曲を特徴づけるキャラクタを付与することにより、利用者がその楽曲の内容を視覚的に把握することが可能になる。 According to the first aspect of the present invention, a predetermined feature value is extracted from an acoustic signal of a music piece, and a character characterizing the corresponding music piece is assigned using the feature quantity, so that the user can obtain the music piece. It becomes possible to grasp the contents of.

請求項２に記載の発明により、楽曲の音響信号に対し、あらかじめ定めた特徴量を何度か抽出し、その特徴量の変化を用いて、該当する楽曲を特徴づけるキャラクタの形状及びまたはキャラクタの動作を変化させることにより、利用者がその楽曲の内容を視覚的に把握することが可能となる。 According to the second aspect of the present invention, a predetermined feature amount is extracted several times from the sound signal of the music piece, and the shape of the character characterizing the corresponding music piece and / or By changing the operation, the user can visually grasp the contents of the music.

請求項３に記載の発明により、特徴量を抽出するタイミングを、楽曲登録時と楽曲再生時とで選択することが可能となる。 According to the third aspect of the present invention, it is possible to select the timing for extracting the feature amount at the time of music registration and at the time of music playback.

請求項４に記載の発明により、特徴量抽出の精度を上げるために一定の時間を要する場合も、あらかじめ抽出した特徴量を初期値として用いることにより、高い精度での特徴量の利用が可能になる。 According to the fourth aspect of the present invention, even when a certain amount of time is required to improve the accuracy of feature quantity extraction, it is possible to use the feature quantity with high accuracy by using the feature quantity extracted in advance as an initial value. Become.

請求項５に記載の発明により、特徴量抽出の領域を選択可能にすることにより、利用者は楽曲のイントロ部分や、エンディング部分あるいは、好みのフレーズの部分に近い特徴を持つ楽曲を、キャラクタの形状及び動作から容易に知ることが可能となる。 According to the fifth aspect of the present invention, by enabling selection of the feature extraction area, the user can select a song having characteristics close to the intro part, ending part, or favorite phrase part of the character. It can be easily known from the shape and operation.

請求項６に記載の発明により、音楽ＣＤなどで利用されているリニアＰＣＭ方式のディジタルオーディオだけでなく、ＡＡＣ、ＭＰ３、ＷＭＡその他の圧縮されたオーディオデータに対しても、楽曲の音響信号から抽出される特徴量に応じてキャラクタを対応付けることが可能となる。 According to the invention described in claim 6, not only linear PCM digital audio used in music CDs but also compressed audio data such as AAC, MP3, WMA and the like are extracted from the sound signal of the music. Characters can be associated according to the feature amount.

請求項７に記載の発明により、楽曲の音響信号、特徴量、キャラクタの形状及びまたは動作などのデータが保存・管理されることが可能となる。 According to the seventh aspect of the present invention, it is possible to store and manage data such as music acoustic signals, feature amounts, character shapes and / or actions.

請求項８に記載の発明により、キャラクタの形状と動作のうちどちらか１つを固定することが可能になる。 According to the eighth aspect of the present invention, it becomes possible to fix one of the shape and the motion of the character.

楽曲の一覧表示時に、映像データの一覧をサムネイル表示するように、楽曲に対してもその特徴量をもとに付与したサムネイル用静止画を対応させ表示することによって、利用者が楽曲や楽曲群の内容を視覚的・直感的に把握することが可能になる。 When displaying a list of songs, the thumbnails of the video data are displayed in correspondence with the music so that the list of video data is displayed as thumbnails. It becomes possible to grasp the contents of the contents visually and intuitively.

（実施の形態１）
以下、本発明の実施の形態１について、図面を参照しながら説明する。図１は本発明の実施の形態１における楽曲検索装置の全体構成を示すブロック図である。図１において、１１は音響信号入力手段、１２は特徴量抽出手段、１３はサムネイル付与手段、１４はデータ管理手段、１５はブラウズ手段、１６はブラウズ要件入力手段、１７は書誌情報入力手段を表している。 (Embodiment 1)
Embodiment 1 of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the overall configuration of a music search apparatus according to Embodiment 1 of the present invention. In FIG. 1, 11 is an acoustic signal input means, 12 is a feature quantity extraction means, 13 is a thumbnail assignment means, 14 is a data management means, 15 is a browsing means, 16 is a browsing requirement input means, and 17 is a bibliographic information input means. ing.

以上のように構成された楽曲検索装置について、以下、その動作について図１を用いて説明する。本装置は大きく分けて、対象となる楽曲の音響信号及びその付随するデータを登録する楽曲データ登録部１１１と、登録された楽曲データを管理するデータ管理手段１４、及び管理されたデータの中から利用者の所望する楽曲をブラウズする楽曲ブラウズ部１１２より構成される。 The operation of the music search apparatus configured as described above will be described below with reference to FIG. This apparatus is roughly divided into a music data registration unit 111 for registering an acoustic signal of the target music and its accompanying data, a data management means 14 for managing the registered music data, and the managed data. The music browsing unit 112 is configured to browse a music desired by a user.

まず、楽曲データ登録部１１１について概説する。データ管理手段１４は、楽曲ごとにその音響信号及び以下に記述する付随情報を関連させて記録し、検索参照可能とするものである。最初に、音響信号入力手段１１は登録対象として入力された楽曲の音響信号をデータ管理手段１４に登録すると共に、付随情報生成のため後段の特徴量抽出手段１２に出力する。音響信号入力手段１１は、入力される音響信号がアナログ信号の場合は、デジタル化した後、後段に出力する。また、圧縮された音響信号の場合は、圧縮データをデータ管理手段１４に登録し、圧縮データを伸張した後、伸張データを特徴量抽出手段１２に出力する。 First, the music data registration unit 111 will be outlined. The data management means 14 records the acoustic signal and associated information described below in association with each piece of music so that it can be searched for. First, the acoustic signal input unit 11 registers the acoustic signal of the music input as a registration target in the data management unit 14 and outputs it to the subsequent feature quantity extraction unit 12 for generating accompanying information. When the input acoustic signal is an analog signal, the acoustic signal input unit 11 digitizes and outputs it to the subsequent stage. In the case of a compressed acoustic signal, the compressed data is registered in the data management unit 14, and after the compressed data is expanded, the expanded data is output to the feature amount extraction unit 12.

次に、特徴量抽出手段１２は、入力された音響信号から、その音響信号の物理的特徴を表すいくつかの特徴量を抽出し、付随情報としてデータ管理手段１４に登録し、後段のサムネイル付与手段１３に特徴量を出力する。サムネイル付与手段１３は、入力された特徴量からサムネイル用静止画を生成し、付随情報としてデータ管理手段１４に登録する。上記手続きと関連して、書誌情報入力手段１７は、入力した楽曲名や歌手名及びジャンル名などの書誌情報を、付随情報としてデータ管理手段１４に登録する。 Next, the feature amount extraction unit 12 extracts some feature amounts representing the physical features of the acoustic signal from the input acoustic signal, registers the feature amount in the data management unit 14 as accompanying information, and assigns a subsequent thumbnail. The feature value is output to the means 13. The thumbnail assigning unit 13 generates a thumbnail still image from the input feature amount, and registers it in the data management unit 14 as accompanying information. In association with the above procedure, the bibliographic information input unit 17 registers the input bibliographic information such as the music title, singer name, and genre name in the data management unit 14 as accompanying information.

利用者は、ブラウズ要件入力手段１６より、歌手名やジャンル名などをキーワードとして入力し、データ管理手段１４は、前記キーワードをもとに管理しているデータを検索し、該当する楽曲候補を抽出し、結果をブラウズ手段１５に出力する。ブラウズ手段１５は、データ管理手段１４より入手した結果に対し、該当楽曲に関連付けられたサムネイル用静止画を楽曲名などの書誌情報と共に一覧表示する。一覧表示されたリストの中から、利用者が、サムネイル用静止画や書誌情報から得られる情報をもとに、ブラウズ要件入力手段１６を通じて所望する楽曲を選択する。ブラウズ要件入力手段１６は、選択された楽曲の音響信号をデータ管理手段１４からブラウズ手段１５に出力させ、ブラウズ手段１５は前記楽曲を再生する。 A user inputs a singer name, a genre name, or the like as a keyword from the browsing requirement input means 16, and the data management means 14 searches data managed based on the keyword and extracts a corresponding music candidate. The result is output to the browsing means 15. In response to the result obtained from the data management unit 14, the browsing unit 15 displays a list of thumbnail still images associated with the corresponding music together with bibliographic information such as a music name. From the displayed list, the user selects a desired piece of music through the browse requirement input means 16 based on information obtained from the still image for thumbnail or bibliographic information. The browsing requirement input means 16 causes the audio management signal of the selected music to be output from the data management means 14 to the browsing means 15, and the browsing means 15 reproduces the music.

特徴量抽出手段１２で抽出される特徴量として、スペクトル変化度Ｐ１（フレーム間のスペクトル変化の度合い）、平均発音数Ｐ２（楽曲中で発音される音の発音頻度）、発音非周期性Ｐ３（楽曲中で発音される音の非周期性の度合い）、拍周期Ｐ４（楽曲の４分音符に相当する時間長）、が挙げられるが、上記４種類の特徴量の他に、拍周期比率Ｐ５、拍強度Ｐ６、拍強度比Ｐ７などのパラメータについても特徴量として利用してもよい。これらの特徴量の算出方法の詳細は、特願２００１−０８２１５０に記載されている。なお、対象となる楽曲から特徴量を抽出するにあたり、特徴量の抽出範囲は、楽曲の全体、楽曲の一部分及び、
楽曲の複数の部分に対し、任意の組み合わせからなる領域から抽出してよい。 As feature quantities extracted by the feature quantity extraction means 12, the degree of spectrum change P1 (the degree of spectrum change between frames), the average number of pronunciations P2 (the pronunciation frequency of sounds produced in the music), the pronunciation aperiodicity P3 The degree of non-periodicity of the sound generated in the music) and the beat period P4 (the length of time corresponding to a quarter note of the music), but in addition to the above four types of feature quantities, the beat period ratio P5 Parameters such as the beat intensity P6 and the beat intensity ratio P7 may also be used as feature quantities. Details of the calculation method of these feature amounts are described in Japanese Patent Application No. 2001-082150. In extracting the feature value from the target music, the feature value extraction range includes the entire music, a part of the music, and
You may extract from the area | region which consists of arbitrary combinations with respect to several parts of a music.

次に、サムネイル付与手段の動作について説明する。サムネイル付与手段は前段で抽出された特徴量Ｐ１からＰＮのＮ個の値を入力し、この値をもとに、サムネイル用静止画を生成する。サムネイル用静止画生成の実施例として、２次元あるいは３次元形状静止画の生成、及び色彩静止画の生成を説明する。 Next, the operation of the thumbnail providing unit will be described. The thumbnail assigning unit inputs N values of feature values P1 to PN extracted in the previous stage, and generates a still image for thumbnails based on these values. As an example of thumbnail still image generation, generation of a two-dimensional or three-dimensional shape still image and generation of a color still image will be described.

２次元あるいは３次元形状静止画の生成については、Ｎ個の特徴量の中からＭ個の特徴量をあらかじめ選定しておき、Ｍ個の値をもつグラフを生成する。グラフは円グラフや棒グラフとして表示した静止画でもよいし、Ｍ次元の値を２次元あるいは３次元の形状に表現した静止画として生成したものでもよい。 For generating a two-dimensional or three-dimensional shape still image, M feature amounts are selected in advance from N feature amounts, and a graph having M values is generated. The graph may be a still image displayed as a pie chart or a bar graph, or may be generated as a still image expressing M-dimensional values in a two-dimensional or three-dimensional shape.

色彩静止画の生成については、Ｎ個の特徴量の中からＭ個の特徴量をあらかじめ選定しておき、Ｍ個の値を用いて色空間に変換する。（数１）はあらかじめ定めた前記３つの特徴量をもとに３原色を用いて色空間の値に変換する例である。 For generation of a color still image, M feature values are selected in advance from N feature values and converted to a color space using M values. (Equation 1) is an example of conversion to a color space value using the three primary colors based on the above-described three feature amounts.

また、前記拍周期Ｐ４は楽曲の４分音符の時間長を表し、楽曲のテンポを表現する数値であるので、（数２）により、前記拍周期Ｐ４を用いてテンポの速い楽曲はより赤い色に、テンポの遅い楽曲はより青い色に静止画を表示するようにしてもよい。 The beat period P4 represents the time length of the quarter note of the music, and is a numerical value representing the tempo of the music. Therefore, according to (Equation 2), a music with a fast tempo using the beat period P4 has a red color. In addition, a still image may be displayed in a blue color for music with a slow tempo.

以上の動作により、サムネイル付与手段１３は、楽曲に対するサムネイル用静止画を生成し、データ管理手段１４はこれを楽曲と関連付けて管理する。さらに書誌情報入力手段１７から、楽曲名、歌手名、ジャンル名などの書誌情報を入力し、データ管理手段１４はこれらを楽曲と関連させて管理する。楽曲の音響信号、書誌情報、特徴量、サムネイル用静止画などのデータはそれぞれ関連付けられレコードの形でデータ管理手段１４にて管理される。図３にデータ管理手段１４が管理するレコード形式の一例を示す。３２は特徴量の内容を示し、３１は楽曲の音響信号の前に書誌情報と特徴量が付随情報１として並べられたレコードを示す。３３は付随情報２であり、サムネイル用静止画データを含む。３３の付随情報２と３１の音響信号は、各楽曲にユニークに与えられた曲IＤをもとに相互に関連付けられている。 Through the above operation, the thumbnail assigning unit 13 generates a thumbnail still image for the music, and the data management unit 14 manages this in association with the music. Further, bibliographic information such as a music title, singer name, and genre name is input from the bibliographic information input means 17, and the data management means 14 manages these in association with the music. Data such as music audio signals, bibliographic information, feature amounts, and still images for thumbnails are associated with each other and managed by the data management means 14 in the form of records. FIG. 3 shows an example of a record format managed by the data management means 14. 32 indicates the content of the feature amount, and 31 indicates a record in which the bibliographic information and the feature amount are arranged as the accompanying information 1 before the sound signal of the music. Reference numeral 33 denotes accompanying information 2, which includes thumbnail still image data. The acoustic signals 33 of the accompanying information 2 and 31 are associated with each other based on the music ID uniquely given to each music.

次に利用者が、ブラウズ要件入力手段１６より、例えばジャンル名を検索のキーワードとして入力すると、ブラウズ要件入力手段１６は、データ管理手段１４に対し前記キーワードをもとに楽曲データの検索を指示し、データ管理手段１４は検索結果である指定したジャンルに相当する楽曲の一覧を、各楽曲に関連付けられた書誌情報やサムネイル用静止画及び３１で示す曲ＩＤ共にブラウズ手段１５に出力し、ブラウズ手段１５は、楽曲に関連付けられた書誌情報やサムネイル用静止画を表示する。利用者はブラウズ手段１５にて表示された、書誌情報やサムネイル用静止画の一覧の中から所望の楽曲を、ブラウズ要件入力手段１６にて選択し、ブラウズ要件入力手段１６は選択された楽曲の曲ＩＤをもとにデータ管理手段１４に検索を指示する。データ管理手段１４は、曲ＩＤに対応する楽曲の音響信号データをブラウズ手段１５に出力し、ブラウズ手段１５が楽曲の再生を行う。 Next, when the user inputs, for example, a genre name as a search keyword from the browse requirement input unit 16, the browse requirement input unit 16 instructs the data management unit 14 to search for music data based on the keyword. The data management means 14 outputs a list of music corresponding to the specified genre as a search result to the browsing means 15 together with the bibliographic information associated with each music and the still image for thumbnail and the music ID indicated by 31. 15 displays bibliographic information and thumbnail still images associated with the music. The user selects a desired piece of music from the list of bibliographic information and thumbnail still images displayed by the browsing unit 15 using the browsing requirement input unit 16, and the browsing requirement input unit 16 selects the selected piece of music. A search is instructed to the data management means 14 based on the song ID. The data management unit 14 outputs the acoustic signal data of the music corresponding to the music ID to the browsing unit 15, and the browsing unit 15 reproduces the music.

以上では、上記の動作を実現する装置として本発明の実施の形態１を述べたが、前記各手段をコンピュータ上で上記動作を実現するプログラムとして全体を構成してもよい。また、前記プログラムは楽曲データ登録部１１１及びデータ管理手段を実現するプログラムはサーバ上で実行され、楽曲ブラウズ部１１２はインターネットを経由して、端末側のコンピュータで実行されるプログラムとしてもよい。あるいは、サーバ上では、データ管理手段を実現するプログラムが実行され、楽曲データ登録部１１１及び楽曲ブラウズ部１１２を実現するプログラムはインターネットを経由して、それぞれ同じコンピュータあるいは異なるコンピュータ上で実行される形態としてもよい。 In the above, the first embodiment of the present invention has been described as an apparatus for realizing the above-described operation. However, the respective units may be configured as a program for realizing the above-described operation on a computer. In addition, the program may be a program that executes the music data registration unit 111 and the data management unit on a server, and the music browsing unit 112 may be a program that is executed on a terminal computer via the Internet. Alternatively, a program for realizing data management means is executed on the server, and the programs for realizing the music data registration unit 111 and the music browsing unit 112 are executed on the same computer or different computers via the Internet. It is good.

（実施の形態２）
以下、本発明の実施の形態２について、図面を参照しながら説明する。図２は本発明の実施の形態２における楽曲検索装置の全体構成を示すブロック図である。図２において、１１は音響信号入力手段、１２は特徴量抽出手段、２３はサムネイル付与手段、２４はデータ管理手段、２５はブラウズ手段、２６はブラウズ要件入力手段、２７は静止画入力手段を表している。 (Embodiment 2)
Embodiment 2 of the present invention will be described below with reference to the drawings. FIG. 2 is a block diagram showing the overall configuration of the music search apparatus according to Embodiment 2 of the present invention. In FIG. 2, 11 is an acoustic signal input means, 12 is a feature amount extraction means, 23 is a thumbnail assignment means, 24 is a data management means, 25 is a browsing means, 26 is a browsing requirement input means, and 27 is a still image input means. ing.

以上のように構成された楽曲検索装置について、以下、その動作について図２を用いて説明する。本装置は大きく分けて、対象となる楽曲の音響信号及びその付随するデータを登録する楽曲データ登録部２１１と、登録された楽曲データを管理するデータ管理手段２４、及び管理されたデータの中から、利用者の所望する楽曲をブラウズする楽曲ブラウズ部２１２より構成される。本装置は実施の形態１における各ブロックで示される各手段を利用し、一部を新しい機能に置き換えたものである。以下にその内容を説明する。 The operation of the music search apparatus configured as described above will be described below with reference to FIG. This apparatus is roughly divided into a music data registration unit 211 for registering an acoustic signal of the target music and its associated data, a data management means 24 for managing the registered music data, and the managed data. The music browsing unit 212 browses the music desired by the user. This apparatus uses each means indicated by each block in the first embodiment, and a part thereof is replaced with a new function. The contents will be described below.

音響信号入力手段１１及び特徴量入力手段１２は、実施の形態１の音響信号入力手段１１及び特徴量入力手段１２と同じ動作をし、サムネイル付与手段２３に対し、楽曲から抽出した特徴量を出力する。このとき同時に静止画入力手段２７からサムネイル用静止画が入力されている場合は、サムネイル付与手段２３は前記サムネイル用静止画と前記特徴量をデータ管理手段２４に出力し、データ管理手段２４はこれらを楽曲の音響信号データと共に関連付けて管理する。 The acoustic signal input unit 11 and the feature amount input unit 12 perform the same operations as the acoustic signal input unit 11 and the feature amount input unit 12 of the first embodiment, and output the feature amount extracted from the music to the thumbnail providing unit 23. To do. At this time, if a still image for thumbnail is input from the still image input means 27, the thumbnail providing means 23 outputs the thumbnail still image and the feature quantity to the data management means 24, and the data management means 24 Are associated with the sound signal data of the music and managed.

静止画入力手段２７からサムネイル用静止画が入力されていない場合は、サムネイル付与手段２３は入力された楽曲から抽出された上記の特徴量Ｐをキーとして、データ管理手段２４にて管理されている付随情報の中の特徴量ＰＸを検索し、（数３）にて各特徴量間のユークリッド距離を算出しその逆数Ｌを、類似度を表す値として求める。 When a still image for thumbnail is not input from the still image input unit 27, the thumbnail providing unit 23 is managed by the data management unit 24 using the feature amount P extracted from the input music as a key. The feature amount PX in the accompanying information is searched, the Euclidean distance between each feature amount is calculated in (Equation 3), and its reciprocal L is obtained as a value representing the similarity.

サムネイル付与手段２３は、算出された類似度の中で最も大きい値Ｌｍを持つ特徴量ＰＸｍを求め、ＰＸｍに関連付けて管理されているサムネイル用静止画データを抽出する。その後、楽曲に関連させてサムネイル用静止画、特徴量Ｐ及び類似度Ｌｍを、データ管理手段２４に出力し、データ管理手段２４はこれらを関連付けて管理する。図３の３４は、サムネイル用静止画データと類似度データを持つレコードである付随情報３の例を示す。 The thumbnail assigning means 23 obtains a feature amount PXm having the largest value Lm in the calculated similarity, and extracts thumbnail still image data managed in association with PXm. Thereafter, the thumbnail still image, the feature amount P, and the similarity Lm are output to the data management unit 24 in association with the music, and the data management unit 24 manages these in association with each other. 3 shows an example of the accompanying information 3 which is a record having thumbnail still image data and similarity data.

利用者は、ブラウズ要件入力手段２６により、楽曲データの一覧表示を指定し、データ管理手段２４はブラウズ要件入力手段２６からの指示で、検索結果である曲ＩＤ、サムネイル用静止画データ、特徴量及び類似度の一覧をブラウズ手段２５に出力する。ブラウズ手段２５はサムネイル用静止画の一覧を表示する。 The user designates the list display of the music data by the browse requirement input means 26, and the data management means 24, in response to an instruction from the browse requirement input means 26, searches for the music ID, thumbnail still image data, feature amount as a search result. The similarity list is output to the browsing means 25. The browsing means 25 displays a list of thumbnail still images.

また、ブラウズ手段２５は、サムネイル用静止画を表示するときに、サムネイル用静止画に関連付けられた類似度データをもとにサムネイル用静止画の表示の明るさを調節したり、色合いを調整することによって、同じサムネイル用静止画を割り当てられた楽曲間の類似の程度を表現してもよい。また、類似度を各楽曲に対応された特徴量のうち全ての特徴量からユークリッド距離を用いて算出したが、任意の特徴量の組み合わせからなる値の間でのユークリッド距離として算出してもよいし、距離ではなく、単純な値の差や、特定の特徴量を強調するような係数を用いて算出した値を類似度として用いてもよい。さらに、発明の実施の形態１にあった書誌情報入力手段を、実施の形態２に追加し、書誌情報の入力を可能にし、ブラウズ手段２５は、サムネイル用静止画と共に、楽曲の曲名などの書誌情報を表示するようにしてもよい。図４はブラウズ手段２５によって提示された一覧表示の例を示す。４１はブラウズ画面であり、４２はサムネイル用静止画の例である。また、実施の形態１と同様に、コンピュータ上で実施の形態２の動作を実現するプログラムとして全体を構成してもよいし、前記プログラムはサーバ上で実行される部分と、インターネットを経由して、1つのあるいは複数のコンピュータ上で実行される部分をもった形態としてもよい。 Further, when displaying the thumbnail still image, the browsing means 25 adjusts the display brightness of the thumbnail still image or the color tone based on the similarity data associated with the thumbnail still image. Thus, the degree of similarity between music pieces assigned the same thumbnail still image may be expressed. Also, the similarity is calculated from all the feature values corresponding to each music using the Euclidean distance. However, the similarity may be calculated as a Euclidean distance between values composed of combinations of arbitrary feature values. However, instead of the distance, a value calculated using a simple value difference or a coefficient that emphasizes a specific feature amount may be used as the similarity. Further, the bibliographic information input unit according to the first embodiment of the invention is added to the second embodiment, and bibliographic information can be input. The browsing unit 25 includes a bibliography such as a song title of a song together with a still image for thumbnails. Information may be displayed. FIG. 4 shows an example of a list display presented by the browsing means 25. Reference numeral 41 denotes a browse screen, and reference numeral 42 denotes an example of a thumbnail still image. As in the first embodiment, the entire program may be configured as a program for realizing the operation of the second embodiment on a computer. The program may be executed on a server and via the Internet. It is also possible to adopt a form having a part executed on one or a plurality of computers.

（実施の形態３）
以下、本発明の実施の形態３について、図面を参照しながら説明する。図５は本発明の実施の形態３における楽曲データ処理装置の全体構成を示すブロック図である。図５において、５１は音響信号入力手段、５２は特徴量抽出手段、５３はキャラクタ付与手段、５４はデータ管理手段、５５は表示動作手段、５６はキャラクタ保管手段を表している。 (Embodiment 3)
Embodiment 3 of the present invention will be described below with reference to the drawings. FIG. 5 is a block diagram showing the overall configuration of the music data processing apparatus according to Embodiment 3 of the present invention. In FIG. 5, 51 is an acoustic signal input means, 52 is a feature quantity extraction means, 53 is a character provision means, 54 is a data management means, 55 is a display operation means, and 56 is a character storage means.

以上のように構成された楽曲データ処理装置について、以下、その動作について図５を用いて説明する。本装置は大きく分けて、対象となる楽曲の音響信号及びその付随するデータを登録する楽曲データ登録部５１１と、登録された楽曲データを管理するデータ管理手段５４、及び管理されたデータの中から楽曲の特徴に対応するキャラクタを表示、動作させる表示動作手段５５より構成される。まず、楽曲データ登録部５１１について概説する。データ管理手段５４は、楽曲ごとにその音響信号及び以下に記述する付随情報を関連させて記録し、検索参照可能とするものである。 The operation of the music data processing apparatus configured as described above will be described below with reference to FIG. This apparatus is roughly divided into a music data registration unit 511 for registering an acoustic signal of the target music and its associated data, a data management means 54 for managing the registered music data, and the managed data. The display operation means 55 is configured to display and operate characters corresponding to the characteristics of the music. First, the music data registration unit 511 will be outlined. The data management means 54 records the sound signal and accompanying information described below in association with each piece of music so that it can be searched for.

最初に、音響信号入力手段５１は登録対象として入力された楽曲の音響信号をデータ管理手段５４に登録すると共に、付随情報生成のため後段の特徴量抽出手段５２に出力する。音響信号入力手段５１は、入力される音響信号がアナログ信号の場合は、デジタル化した後、後段に出力する。また、圧縮された音響信号の場合は、圧縮データをデータ管理手段５４に登録し、圧縮データを伸張した後、伸張データを特徴量抽出手段５２に出力する。 First, the acoustic signal input unit 51 registers the acoustic signal of the music input as a registration target in the data management unit 54 and outputs it to the subsequent feature amount extraction unit 52 for generating accompanying information. If the input acoustic signal is an analog signal, the acoustic signal input means 51 digitizes it and outputs it to the subsequent stage. In the case of a compressed acoustic signal, the compressed data is registered in the data management unit 54, and after the compressed data is expanded, the expanded data is output to the feature amount extraction unit 52.

次に、特徴量抽出手段５２は、入力された音響信号から、その音響信号の物理的特徴を表すいくつかの特徴量を抽出し、付随情報としてデータ管理手段５４に登録し、後段のキャラクタ付与手段５６に特徴量を出力する。なお、特徴量として、実施の形態１と同様に、スペクトル変化度Ｐ１、平均発音数Ｐ２、発音非周期性Ｐ３、拍周期Ｐ４が挙げられるが、上記４種類の特徴量の他に、拍周期比率Ｐ５、拍強度Ｐ６、拍強度比Ｐ７などのパラメータについても特徴量として利用してもよい。また、対象となる楽曲から特徴量を抽出するにあたり、特徴量の抽出範囲は、楽曲の全体、楽曲の一部分及び、楽曲の複数の部分に対し、任意の組
み合わせからなる領域から抽出してよい。 Next, the feature quantity extraction unit 52 extracts some feature quantities representing the physical features of the acoustic signal from the input acoustic signal, registers them as accompanying information in the data management unit 54, and assigns the character in the subsequent stage. The feature value is output to the means 56. Note that, as in the first embodiment, the feature amount includes a spectrum change degree P1, an average pronunciation number P2, a pronunciation non-periodicity P3, and a beat period P4. Parameters such as the ratio P5, the beat intensity P6, and the beat intensity ratio P7 may also be used as feature amounts. Moreover, when extracting the feature value from the target music, the feature value extraction range may be extracted from an area composed of any combination of the entire music, a part of the music, and a plurality of parts of the music.

次に、キャラクタ付与手段５３について説明する。キャラクタ付与手段５３は前段で抽出された特徴量Ｐ１からＰＮのＮ個の値を入力し、この特徴量との距離が最も近い特徴量に対応付けられた２次元あるいは３次元キャラクタの形状及び動作を後段のキャラクタ保管手段より選択し、付随情報としてデータ管理手段５４に登録する。以下に、キャラクタ選択の実施例を説明する。 Next, the character provision unit 53 will be described. The character assigning means 53 inputs N values of the feature quantity P1 to PN extracted in the previous stage, and the shape and motion of the two-dimensional or three-dimensional character associated with the feature quantity closest to the feature quantity. Is selected from the subsequent character storage means and registered in the data management means 54 as accompanying information. Hereinafter, examples of character selection will be described.

キャラクタ保管手段５６は、あらかじめ楽曲の特徴量と、キャラクタの形状及びキャラクタの動作を対応させておく。特徴量の特定の、あるいは任意の組み合わせのパラメータごとに対応させてもよいし、Ｐ１からＰＮのＮ個の値の統計と対応させてもよい。前段キャラクタ付与手段５３に特徴量が入力されると、キャラクタ付与手段５３は、キャラクタ保管手段５６から前記方法により楽曲の特徴量との距離が最も近い特徴量に対応付けられたキャラクタの形状及びキャラクタの動作を選択する。さらに、キャラクタの動作のテンポについては、動作の周期を指定する時間のパラメータとして前記拍周期Ｐ４を用いると、前記拍周期Ｐ４が楽曲の４部音符の時間長を表すので、キャラクタは対応する楽曲のテンポに合わせた動きで動作する。この他にも、他の特徴量をキャラクタの動作パラメータとして利用することにより、楽曲の内容をより多くの動きによって表現してもよい。前段キャラクタ付与手段５３は、上記手段によりキャラクタの形状、動作、及び動作の周期を指定し、データ管理手段に登録する。 The character storage unit 56 associates the feature amount of the music with the character shape and the character action in advance. It may correspond to each parameter of a specific or arbitrary combination of feature amounts, or may correspond to statistics of N values from P1 to PN. When the feature amount is input to the pre-stage character assigning means 53, the character assigning means 53 determines the character shape and the character associated with the feature amount that is closest to the feature amount of the music from the character storage means 56 by the above method. Select the action. Further, regarding the tempo of the character's motion, if the beat cycle P4 is used as a time parameter for designating the motion cycle, the beat cycle P4 represents the time length of the four-part note of the music, so that the character corresponds to the corresponding music It works with movement that matches the tempo. In addition to this, the content of the music may be expressed by more movements by using other feature amounts as the motion parameters of the character. The pre-stage character assigning means 53 designates the character shape, motion, and motion cycle by the above means and registers them in the data management means.

また、楽曲の特徴量と、キャラクタの形状または動作を対応させる際、特願２００１−０８２１５０にも記載されているように、ＳＤ（ｓｅｍａｎｔｉｃｄｉｆｆｅｒｅｎｃｉａｌ）法等による官能評価実験などにより、楽曲に対する利用者の主観的な印象を元にしてキャラクタの形状や動作を決定すると、より楽曲の内容に合ったものになる。 Further, when associating the feature amount of the music with the shape or motion of the character, as described in Japanese Patent Application No. 2001-082150, the user with respect to the music is performed by a sensory evaluation experiment using an SD (semantic differential) method or the like. If the shape and movement of the character are determined based on the subjective impression of the song, it will be more suitable for the content of the music.

以上の動作により、キャラクタ付与手段５３は、楽曲に対するキャラクタを選択し、データ管理手段５４はこれを楽曲と関連付けて管理する。楽曲の音響信号、特徴量、キャラクタの形状、キャラクタの動作などのデータはそれぞれ関連付けられレコードの形でデータ管理手段５４によって管理される。図７にデータ管理手段５４が管理するレコード形式の一例を示す。７２は特徴量の内容を示し、７３はキャラクタの情報を示す。７１は楽曲の音響信号の前に特徴量とキャラクタ情報が付随情報として並べられたレコードを示す。７４はキャラクタ保管例であり、ｘは特徴量の値である。なお、７４の他にも、あらかじめいくつかの楽曲レコードをキャラクタ保管手段に登録しておくなどしてもよい。７５はキャラクタの形状情報一覧であり、７６はキャラクタの動作情報一覧である。 With the above operation, the character giving means 53 selects a character for the music piece, and the data management means 54 manages it in association with the music piece. The data management means 54 manages the sound signal, feature quantity, character shape, character motion, and other data of the music piece in association with each other. FIG. 7 shows an example of a record format managed by the data management means 54. 72 indicates the contents of the feature amount, and 73 indicates character information. Reference numeral 71 denotes a record in which feature amounts and character information are arranged as accompanying information before an acoustic signal of music. 74 is an example of character storage, and x is a feature value. In addition to 74, some music records may be registered in advance in the character storage means. 75 is a character shape information list, and 76 is a character motion information list.

楽曲再生時、データ管理手段５４は、管理しているデータより該当する楽曲情報を抽出し、結果を表示動作手段５５に出力する。表示動作手段５５は、データ管理手段５４より入手した結果に対し、該当楽曲に関連付けられたキャラクタを動作させる。また、上記の実施例では、一度登録した楽曲のデータはデータ管理手段５４によって保存され、楽曲再生時に前記データを読み出すが、データ管理手段を用いず楽曲再生ごとに特徴量を抽出し、キャラクタと関係付けて表示動作させてもよい。また、楽曲再生ごとに特徴量を抽出する際、あらかじめ抽出した特徴量を初期値として用いることによって、特徴量の精度を高めてもよい。また
、実施の形態１と同様に、コンピュータ上で実施の形態３の動作を実現するプログラムとして全体を構成してもよいし、前記プログラムはサーバ上で実行される部分と、インターネットを経由して、１つのあるいは複数のコンピュータ上で実行される部分を持った形態としてもよい。 At the time of music reproduction, the data management means 54 extracts the corresponding music information from the managed data and outputs the result to the display operation means 55. The display operation unit 55 operates the character associated with the corresponding music in response to the result obtained from the data management unit 54. Also, in the above embodiment, once registered music data is stored by the data management means 54, and the data is read at the time of music playback, but the feature quantity is extracted for each music playback without using the data management means, and the character and The display operation may be performed in association with each other. In addition, when extracting the feature amount for each music reproduction, the feature amount accuracy may be increased by using the feature amount extracted in advance as an initial value. Further, as in the first embodiment, the whole may be configured as a program for realizing the operation of the third embodiment on a computer. The program may be executed on a server and via the Internet. It is good also as a form with the part performed on one or several computers.

（実施の形態４）
以下、本発明の実施の形態４について、図面を参照しながら説明する。図６は本発明の実施の形態４における楽曲データ処理装置の全体構成を示すブロック図である。図６において、６１は音響信号入力手段、６２は特徴量抽出手段、６３はキャラクタ付与手段、６４はデータ管理手段、６５は表示動作手段、６６はキャラクタ保管手段を表している。 (Embodiment 4)
Embodiment 4 of the present invention will be described below with reference to the drawings. FIG. 6 is a block diagram showing the overall configuration of the music data processing apparatus according to Embodiment 4 of the present invention. In FIG. 6, 61 is an acoustic signal input means, 62 is a feature amount extraction means, 63 is a character provision means, 64 is a data management means, 65 is a display operation means, and 66 is a character storage means.

以上のように構成された楽曲データ処理装置について、以下、その動作について図６を用いて説明する。本装置は大きく分けて、対象となる楽曲の音響信号及びその付随するデータを登録する楽曲データ登録部６１１と、登録された楽曲データを管理するデータ管理手段６４、及び管理されたデータの中から楽曲の特徴に対応するキャラクタを表示、動作させる表示動作手段６５より構成される。本装置は実施の形態３における各ブロックで示される各手段を利用し、一部を新しい機能に置き換えたものである。以下にその内容を説明する。 The operation of the music data processing apparatus configured as described above will be described below with reference to FIG. The apparatus is roughly divided into a music data registration unit 611 for registering an acoustic signal of the target music and its associated data, a data management means 64 for managing the registered music data, and the managed data. It comprises display operation means 65 for displaying and operating characters corresponding to the characteristics of the music. This apparatus uses each means indicated by each block in the third embodiment, and a part thereof is replaced with a new function. The contents will be described below.

音響信号入力手段６１は、特徴量抽出手段６２とデータ管理手段６４に対し、楽曲の音響信号を、一定時間デルタＴごとの時間Ｔ（Ｔ１、Ｔ２、…Ｔｎ、…）に出力する。特徴量抽出手段６２は、入力された、時間Ｔｎにおける音響信号の物理的特徴を表すいくつかの特徴量を抽出し、付随情報としてデータ管理手段６４に登録し、後段のキャラクタ付与手段６３に特徴量を出力する。キャラクタ付与手段６３は、一定時間Ｔごとに抽出された特徴量との距離が最も近い特徴量に対応付けられた２次元あるいは３次元キャラクタの形状及びまたは動作をそれぞれ後段のキャラクタ保管手段より選択し、時間Ｔｎと共に付随情報としてデータ管理手段６４に登録する。楽曲再生時、データ管理手段６４は、管理しているデータより該当する楽曲情報を抽出し、結果を表示動作手段６５に出力する。表示動作手段６５は、データ管理手段６４より入手した結果に対し、該当楽曲に関連付けられたキャラクタを動作させる。 The acoustic signal input means 61 outputs the acoustic signal of the music to the feature quantity extraction means 62 and the data management means 64 at a time T (T1, T2,... Tn,...) Every fixed time delta T. The feature quantity extracting means 62 extracts some feature quantities representing the physical characteristics of the input acoustic signal at the time Tn, registers them in the data management means 64 as accompanying information, and features them in the subsequent character adding means 63. Output quantity. The character imparting means 63 selects the shape and / or action of the two-dimensional or three-dimensional character associated with the feature quantity closest to the feature quantity extracted every fixed time T from the subsequent character storage means. Then, it is registered in the data management means 64 as accompanying information together with the time Tn. During music reproduction, the data management means 64 extracts the corresponding music information from the managed data and outputs the result to the display operation means 65. The display operation unit 65 operates the character associated with the corresponding music in response to the result obtained from the data management unit 64.

以上の動作により、キャラクタ付与手段６３は、楽曲に対するキャラクタを選択し、データ管理手段６４はこれを楽曲と関連付けて管理する。楽曲の音響信号、特徴量、キャラクタの形状、一定時間Ｔごとにおけるキャラクタの動作などのデータはそれぞれ関連付けられレコードの形でデータ管理手段６４によって管理される。上記の実施例では、一度登録した楽曲のデータはデータ管理手段６４によって保存され楽曲再生時に前記データを読み出すが、データ管理手段を用いず楽曲再生ごとに特徴量を抽出し、キャラクタと関係付けて表示動作させてもよい。また、上記の実施例では、一定時間Ｔごとに、キャラクタの形状及び動作の変更の有無に関わらず、それらのデータをデータ管理手段６４に登録するが、時間Ｔｎにおいて、直前の時間Ｔｎ−１におけるキャラクタの形状及び動作に変更があった場合、変更したデータのみをデータ管理手段６４に登録してもよい。また、時間周期デルタＴを短くすることによって、より楽曲の内容に合ったものになる。また、実施の形態１と同様に、コンピュータ上で実施の形態４の動作を実現するプログラムとして全体を構成してもよいし、前記プログラムはサーバ上で実行される部分と、インターネットを経由して、１つのあるいは複数のコンピュータ上で実行される部分を持った形態としてもよい。 With the above operation, the character giving means 63 selects a character for the music, and the data management means 64 manages this in association with the music. The data management means 64 manages the acoustic signal of the music, the feature amount, the character shape, and the data such as the character motion at a predetermined time T in the form of records. In the above embodiment, once registered music data is stored by the data management means 64, and the data is read at the time of music reproduction. However, the feature quantity is extracted for each music reproduction without using the data management means, and is associated with the character. A display operation may be performed. In the above-described embodiment, the data is registered in the data management means 64 at every fixed time T regardless of whether the character shape and the movement are changed. However, at time Tn, the previous time Tn−1 is registered. When there is a change in the shape and action of the character, only the changed data may be registered in the data management means 64. Further, by shortening the time period delta T, it becomes more suitable for the contents of the music. Further, as in the first embodiment, the whole may be configured as a program for realizing the operation of the fourth embodiment on a computer. The program may be executed on a server and via the Internet. It is good also as a form with the part performed on one or several computers.

本発明にかかる楽曲データ処理装置は、２次元や３次元のキャラクタを楽曲の特徴に合わせて自動的に動作させ、楽曲の再生と連動して表示することが可能になる。その結果、好みの楽曲の内容を聴覚だけでなく視覚的にも鑑賞できるようになるため、家庭や劇場あるいは車内等の移動空間でのエンターテイメント端末などに適用できる。 The music data processing apparatus according to the present invention can automatically display a two-dimensional or three-dimensional character according to the characteristics of the music and display it in conjunction with the reproduction of the music. As a result, the contents of the favorite music can be viewed visually as well as audibly, so that it can be applied to an entertainment terminal in a moving space such as a home, a theater or a car.

本発明の実施の形態１による楽曲検索装置の概略構成を表すブロック図The block diagram showing schematic structure of the music search device by Embodiment 1 of this invention. 本発明の実施の形態２による楽曲検索装置の概略構成を表すブロック図The block diagram showing schematic structure of the music search apparatus by Embodiment 2 of this invention. 本発明の実施の形態１及び２によるデータ管理手段が管理するレコードの例を示す図The figure which shows the example of the record which the data management means by Embodiment 1 and 2 of this invention manages 本発明の実施の形態２によるブラウズ手段が提示する画面例を示す図The figure which shows the example of a screen which the browsing means by Embodiment 2 of this invention shows. 本発明の実施の形態３による楽曲データ処理装置の概略構成を表すブロック図The block diagram showing the schematic structure of the music data processing apparatus by Embodiment 3 of this invention. 本発明の実施の形態４による楽曲データ処理装置の概略構成を表すブロック図Block diagram showing a schematic configuration of a music data processing apparatus according to Embodiment 4 of the present invention. 本発明の実施の形態３及び４によるデータ管理手段が管理するレコードの例、及びキャラクタ情報保管の例を示す図The figure which shows the example of the record which the data management means by Embodiment 3 and 4 of this invention manages, and the example of character information storage

Explanation of symbols

１１，５１，６１音響信号入力手段
１２，５２，６２特徴量抽出手段
１３，２３サムネイル付与手段
１４，２４，５４，６４データ管理手段
１５，２５ブラウズ手段
１６，２６ブラウズ要件入力手段
１７書誌入力手段
２７静止画入力手段
１１１，２１１，５１１，６１１楽曲データ登録部
１１２，２１２楽曲データブラウズ部
３１付随情報１
３２特徴量
３３付随情報２
３４付随情報３
４１ブラウズ画面
４２サムネイル用静止画
５３，６３キャラクタ付与手段
５５，６５表示動作手段
５６，６６キャラクタ保管手段
７１楽曲レコード
７２特徴量
７３キャラクタ情報
７４キャラクタ情報保管例
７５キャラクタの形状情報
７６キャラクタの動作情報 11, 51, 61 Acoustic signal input means 12, 52, 62 Feature amount extraction means 13, 23 Thumbnail assignment means 14, 24, 54, 64 Data management means 15, 25 Browse means 16, 26 Browse requirement input means 17 Bibliographic input means 17 27 Still image input means 111, 211, 511, 611 Music data registration unit 112, 212 Music data browsing unit 31 Accompanying information 1
32 feature amount 33 additional information 2
34 Accompanying information 3
41 Browse screen 42 Still image for thumbnail 53, 63 Character giving means 55, 65 Display operation means 56, 66 Character storage means 71 Music record 72 Feature value 73 Character information 74 Character information storage example 75 Character shape information 76 Character action information

Claims

An acoustic signal input means for inputting an acoustic signal of a song, a feature quantity extracting means for extracting a predetermined feature quantity from the acoustic signal, and a two-dimensional or three-dimensional representation of the feature of the song based on the feature quantity A music data processing apparatus comprising: character giving means for selecting a shape and / or action of a character; and character storage means for storing the feature quantity in association with the shape and / or action of the character.

An acoustic signal input means for inputting an acoustic signal of a music every certain period, a feature quantity extraction means for extracting a predetermined feature quantity from the acoustic signal, and two-dimensional or three-dimensional in conjunction with the change of the feature quantity A music data processing apparatus, comprising: character giving means for selecting a character shape and / or action; and character storage means for storing the feature quantity in association with the character shape and / or character action.

3. The music data processing apparatus according to claim 1, wherein the feature amount extraction unit extracts the music at the time of the music registration or the music reproduction. 4.

4. The music data processing apparatus according to claim 3, wherein the feature amount extraction means is performed in parallel with the reproduction of the music or is extracted in advance before the reproduction of the music.

The feature amount extraction unit extracts a feature amount of an acoustic signal of a song from an area composed of an arbitrary combination with respect to the entire song, a part of the song, and a plurality of portions of the song. Or the music data processing apparatus of Claim 2.

The music data processing apparatus according to claim 1, wherein the acoustic signal input unit inputs a compressed acoustic signal of music.

The music data processing apparatus according to claim 1 or 2, further comprising data management means for performing management by associating a music, a feature value, a character shape, and / or a character action.

The music data processing apparatus according to claim 1 or 2, wherein the character storage means stores at least one of a character shape and a character action in association with a feature amount.