JP2002024232A

JP2002024232A - Device and method for music retrieval

Info

Publication number: JP2002024232A
Application number: JP2000208945A
Authority: JP
Inventors: Takuichi Nishimura; 拓一西村; Hiroki Hashiguchi; 博樹橋口; Ryuichi Oka; 隆一岡
Original assignee: REAL WORLD COMPUTING PARTNERSH; Real World Computing Partnership
Current assignee: REAL WORLD COMPUTING PARTNERSH; Real World Computing Partnership
Priority date: 2000-07-10
Filing date: 2000-07-10
Publication date: 2002-01-25
Anticipated expiration: 2020-07-10
Also published as: JP3764033B2

Abstract

PROBLEM TO BE SOLVED: To provide a device and a method for music retrieval which can simplify user's operation and retrieval processing regarding music. SOLUTION: In a music database structuring part 1010, variation patterns of absolute scales of music are registered and a relative variation pattern as part of music is inputted as a query to a music retrieval device. A pattern retrieval processing part 1020 retrieves a range in the variation pattern of an absolute scale having the relative variation pattern by a continuous DP method.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、クエリーの音声の
特徴と類似した特徴を有する楽曲を、音楽データベース
の中から検索する楽曲検索装置および方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a music search apparatus and a music search method for searching a music database for music having characteristics similar to those of a query voice.

【０００２】[0002]

【従来の技術】連続ＤＰ（たとえば、岡隆一、“連続Ｄ
Ｐを用いた連続音声認識”，音響学会音声研資料，Ｓ７
８−２０，ｐｐ，１４５−１５２（１９７８−０６））
は、ジェスチャ認識や音声認識に用いられてきた基本的
なパターンマッチ処理法の一つである。音声の要求に基
づいて楽曲を検索する技術として、特願平１１−１２０
１９８号に示される装置がある。この提案では、入力音
声の示す音階と楽曲の音階との差分をマッチングするこ
とにより楽曲を検索している。2. Description of the Related Art Continuous DP (for example, Ryuichi Oka, "Continuous D
Continuous Speech Recognition Using P ”, Acoustic Society of Japan, S7
8-20, pp, 145-152 (1978-06))
Is one of the basic pattern matching processing methods used for gesture recognition and voice recognition. Japanese Patent Application No. Hei 11-120 discloses a technique for searching for music based on voice requests.
No. 198. In this proposal, music is searched for by matching the difference between the scale indicated by the input voice and the scale of the music.

【０００３】また、“ハミング歌唱を手掛りとするメロ
ディイ検索”，蔭山，高島，信学論（Ｄ−II，ｖｏｌ．
Ｊ７７−Ｄ−II，ｎｏ．８，ｐｐ．１５４３−１５５
１，１９９４），“ＷＷＷ上での歌声による曲検索シス
テム”，園田，後藤，村岡，信学論（Ｄ−II，ｖｏｌ．
Ｊ８２−Ｄ−II，ｎｏ．４，ｐｐ．７２１−７３１，１
９９９）では、音高と音長に対して連続ＤＰの適用によ
る楽曲検索を行っている。[0003] Also, "Melody Search Using Clues to Humming Singing", Kageyama, Takashima, IEICE (D-II, vol.
J77-D-II, no. 8, pp. 1543-155
1, 1994), "Song search system by singing voice on WWW", Sonoda, Goto, Muraoka, IEICE (D-II, vol.
J82-D-II, no. 4, pp. 721-731,1
999), a music search is performed by applying the continuous DP to the pitch and the pitch.

【０００４】連続ＤＰを以下に説明しておく。[0004] The continuous DP will be described below.

【０００５】クエリーであるＴフレームの時系列データ
を、Ｎ次元特徴ベクトル（ｚ（τ，１），…，ｚ（τ，
Ｎ））の系列としてThe time-series data of a T frame as a query is converted into N-dimensional feature vectors (z (τ, 1),..., Z (τ,
N))

【０００６】[0006]

【数１】 (Equation 1)

【０００７】と表し、同様にデータベースに登録された
１つの楽曲の特徴ベクトル系列を[0007] Similarly, the feature vector sequence of one music piece registered in the database is

【０００８】[0008]

【数２】 (Equation 2)

【０００９】と表す。また、（ｚ（τ，１），…，ｚ
（τ，Ｎ））と（ｕ（ｔ，１），…，ｕ（ｔ，Ｎ））の
との局所距離をｄ（ｔ，τ）と表記する。局所距離の計
算式の一例を以下に示す。[0009] Also, (z (τ, 1),..., Z
The local distance between (τ, N)) and (u (t, 1),..., U (t, N)) is denoted as d (t, τ). An example of the calculation formula of the local distance is shown below.

【００１０】[0010]

【数３】 (Equation 3)

【００１１】ここで、データベース、クエリーの時間軸
をそれぞれｔ，τと区別している。さらに、点（ｔ，
τ）を終点としたクエリーとデータベース上の楽曲との
間の特徴ベクトルに関する累積距離をＳ（ｔ，τ）と表
す。連続ＤＰではＳ（ｔ，τ）を以下のような漸化式で
更新する。初期条件（ｔ＝０）；Here, the time axes of the database and the query are distinguished from each other by t and τ. Further, the point (t,
The cumulative distance of the feature vector between the query ending at τ) and the music on the database is represented as S (t, τ). In the continuous DP, S (t, τ) is updated by the following recurrence formula. Initial conditions (t = 0);

【００１２】[0012]

【数４】 (Equation 4)

【００１３】[0013]

【外１】 [Outside 1]

【００１４】[0014]

【数５】 (Equation 5)

【００１５】[0015]

【数６】 (Equation 6)

【００１６】[0016]

【数７】 (Equation 7)

【００１７】この漸化式では、図１に示す３個の局所パ
スのうちで累積距離が最小となる値が選択されている。
このため、クエリー全体との累積距離Ｓ（ｔ，Ｔ）は、
データベースの時間方向の伸縮が１／２〜２倍であると
したときの最小の累積距離となっている。連続ＤＰの出
力Ａ（ｔ）、すなわち、クエリーとデータベース上の１
つの楽曲の特徴ベクトルの累積距離は、重みの和３Ｔで
正規化してIn this recurrence formula, a value that minimizes the cumulative distance is selected from the three local paths shown in FIG.
Thus, the cumulative distance S (t, T) from the entire query is
This is the minimum cumulative distance when the expansion and contraction of the database in the time direction is 1/2 to 2 times. Output A (t) of continuous DP, ie, query and 1 on database
The cumulative distance between the feature vectors of two songs is normalized by the sum of weights 3T

【００１８】[0018]

【数８】 (Equation 8)

【００１９】と表される。このような計算をデータベー
ス上の全ての楽曲の特徴ベクトルについて行い、最小の
累積距離をまた、検索時にメモリに記憶しておくデータ
は、## EQU1 ## Such calculation is performed for the feature vectors of all songs on the database, and the minimum cumulative distance is also stored in the memory at the time of search.

【００２０】[0020]

【外２】 [Outside 2]

【００２１】についてＳ（ｔ，τ），Ｓ（ｔ，２，
τ），Ｓ（ｔ−２，τ），ｄ（ｔ，τ），ｄ（ｔ−１，
τ）だけでよいので、楽曲検索装置はデータベースのフ
レーム数に依存せず、わずか５Ｔのメモリ量で検索が可
能である。S (t, τ), S (t, 2,
τ), S (t−2, τ), d (t, τ), d (t−1,
τ), the music search device can search with only 5T of memory without depending on the number of frames in the database.

【００２２】[0022]

【発明が解決しようとする課題】従来の楽曲検索方法で
は、楽曲データベース登録される楽曲の特徴およびクエ
リーとして使用される楽曲の一部の特徴は共に音階の相
対変化、具体的には、たとえば、最初の音階から２音階
上がり次に１音階下がるというような相対変化としてい
た。このため、従来では楽曲の波形から直接的に得られ
るの絶対音階を示す楽曲データからさらに処理を行い、
相対音階を求める処理を別に用意しなければならならな
い。この付加的な処理は一般の楽曲について良好に行な
われる保証はない不具合があった。In the conventional music search method, both the characteristics of the music registered in the music database and a part of the characteristics of the music used as a query are relative changes in musical scale, specifically, for example, The relative change was such that the musical scale went up two musical scales from the initial musical scale and then went down one musical scale. For this reason, conventionally, further processing is performed from music data indicating an absolute scale directly obtained from the waveform of the music,
A process for finding the relative scale must be prepared separately. There is a problem that this additional processing is not guaranteed to be performed well for general music.

【００２３】そこで、本発明の目的は、楽曲に関するユ
ーザの操作および検索処理を簡素化することができる楽
曲検索装置および方法を提供することにある。SUMMARY OF THE INVENTION It is therefore an object of the present invention to provide a music search apparatus and a music search method that can simplify a user's operation and search processing for music.

【００２４】[0024]

【課題を解決するための手段】本発明は、楽曲データベ
ースの特徴は絶対音階の時間的変化とし、絶対音階の時
間的変化とクエリーの時間的な相対変化とを比較して、
楽曲の検索を行なう。このような着想に基づき、上記目
的を達成するために、請求項１の発明は、予め楽曲デー
タベースに登録された楽曲の第１の特徴と、クエリーと
して使用される楽曲の一部の第２の特徴とを比較し、第
２の特徴と最も類似する楽曲の範囲を検索する楽曲検索
装置において、前記第１の特徴を時間軸方向に沿った楽
曲の絶対音階群のパワー系列で表し、前記第２の特徴を
時間軸方向に沿った楽曲の相対音階で表し、前記第２の
特徴を入力する入力手段と、前記第１の特徴と前記入力
手段から入力された前記第２の特徴とを連続ＤＰの手法
により比較して第２の特徴に類似する楽曲の範囲を検索
する楽曲検索手段とを具えたことを特徴とする。According to the present invention, the feature of a music database is a temporal change of an absolute scale, and the temporal change of the absolute scale is compared with the temporal relative change of a query.
Search for songs. Based on such an idea, in order to achieve the above object, the invention of claim 1 provides a first feature of music registered in advance in a music database and a second characteristic of a part of music used as a query. In a music search device for comparing a feature with a second feature and searching for a range of a music most similar to the second feature, the first feature is represented by a power sequence of an absolute scale group of the music along a time axis direction, The second feature is represented by the relative scale of the music along the time axis direction, and the input means for inputting the second feature, and the first feature and the second feature input from the input means are continuously displayed. Music search means for searching for a range of music similar to the second feature by comparison using the DP method.

【００２５】請求項２の発明は、請求項１に記載の楽曲
検索装置において、前記入力手段は音声を入力し、入力
した音声を分析して前記第２の特徴を取得することを特
徴とする。According to a second aspect of the present invention, in the music search apparatus according to the first aspect, the input means inputs a voice and analyzes the input voice to acquire the second feature. .

【００２６】請求項３の発明は、請求項１に記載の楽曲
検索装置において、前記データベースには複数の楽曲に
関する第１の特徴が登録されており、前記複数の楽曲の
第１の特徴と前記第２の特徴を比較することを特徴とす
る。According to a third aspect of the present invention, in the music search apparatus according to the first aspect, a first feature relating to a plurality of songs is registered in the database, and the first feature of the plurality of songs and the first feature of the plurality of songs are registered. A feature is to compare the second feature.

【００２７】請求項４の発明は、請求項１に記載の楽曲
検索装置において、前記楽曲検索手段により範囲が検索
された楽曲を第１の特徴に基づき再生出力する再生出力
手段をさらに具えたことを特徴とする。According to a fourth aspect of the present invention, in the music search apparatus according to the first aspect, the music search apparatus further includes a reproduction output unit that reproduces and outputs the music whose range has been searched by the music search unit based on a first characteristic. It is characterized by.

【００２８】請求項５の発明は、請求項１に記載の楽曲
検索装置において、所定時間の間隔を有する２つの音階
の差分値を時間軸上の異なる時刻について複数組取得
し、当該取得した複数組の差分値を並べた系列で前記第
２の特徴を表すことを特徴とする。According to a fifth aspect of the present invention, in the music search apparatus according to the first aspect, a plurality of sets of difference values of two scales having a predetermined time interval are acquired at different times on a time axis, and the acquired plurality of sets are acquired. The second feature is represented by a series in which a set of difference values are arranged.

【００２９】請求項６の発明は、予め楽曲データベース
に登録された楽曲の第１の特徴と、クエリーとして使用
される楽曲の一部の第２の特徴とを比較し、第２の特徴
と最も類似する楽曲の範囲を検索する楽曲検索方法にお
いて、前記第１の特徴を時間軸方向に沿った楽曲の絶対
音階群のパワー系列で表し、前記第２の特徴を時間軸方
向に沿った楽曲の相対音階で表し、前記第２の特徴を楽
曲検索装置に入力する入力し、前記楽曲検索装置では、
前記第１の特徴と前記入力手段から入力された前記第２
の特徴とを連続ＤＰの手法により比較して第２の特徴に
類似する楽曲の範囲を検索することを特徴とする。According to a sixth aspect of the present invention, a first feature of a song registered in the song database in advance is compared with a second feature of a portion of the song used as a query, and the second feature is In a music search method for searching a range of similar music, the first feature is represented by a power sequence of an absolute scale group of the music along the time axis direction, and the second feature is expressed by a music sequence along the time axis direction. The second feature is represented by a relative scale, and the second feature is input to a music search device.
The first feature and the second input from the input means;
And searching for a range of music similar to the second feature by comparing the feature with the second feature.

【００３０】請求項７の発明は、請求項６に記載の楽曲
検索方法において、前記楽曲検索装置に音声を入力し、
当該入力した音声を前記楽曲検索装置により分析して前
記第２の特徴を取得することを特徴とする。According to a seventh aspect of the present invention, in the music search method according to the sixth aspect, a voice is input to the music search device,
The input voice is analyzed by the music search device to acquire the second feature.

【００３１】請求項８の発明は、請求項６に記載の楽曲
検索方法において、前記データベースには複数の楽曲に
関する第１の特徴が登録されており、前記楽曲検索装置
は前記複数の楽曲の第１の特徴と前記第２の特徴を比較
することを特徴とする。According to an eighth aspect of the present invention, in the music search method according to the sixth aspect, a first feature relating to a plurality of music pieces is registered in the database, and the music search device stores the first feature of the plurality of music pieces. The first feature is compared with the second feature.

【００３２】請求項９の発明は、請求項６に記載の楽曲
検索方法において、前記楽曲検索装置は範囲が検索され
た楽曲を第１の特徴に基づき再生出力することを特徴と
する。According to a ninth aspect of the present invention, in the music search method according to the sixth aspect, the music search device reproduces and outputs the music whose range has been searched based on the first characteristic.

【００３３】請求項１０の発明は、請求項６に記載の楽
曲検索方法において、所定時間の間隔を有する２つの音
階の差分値を時間軸上の異なる時刻について複数組取得
し、当該取得した複数組の差分値を並べた系列で前記第
２の特徴を表すことを特徴とするAccording to a tenth aspect of the present invention, in the music search method according to the sixth aspect, a plurality of sets of difference values between two scales having a predetermined time interval are acquired at different times on the time axis. The second feature is represented by a series in which a set of difference values are arranged.

【００３４】[0034]

【発明の実施形態】以下、図面を参照して、本発明の実
施形態を詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００３５】最初に本発明を適用した楽曲検索法の原理
について説明する。First, the principle of the music search method to which the present invention is applied will be described.

【００３６】クエリーおよびデータベースに登録され、
クエリーの比較（距離計算）の対象とされる楽曲の特徴
パターンが、従来では主旋律の変化パターン、すなわ
ち、スカラー量の時系列（相対的な音階の変化）で表す
のに対し、本実施形態ではデータベースに登録される特
徴パターンが、横軸が時間軸、縦軸が音階軸により定ま
るパワースペクトルの周波数（絶対音階）軸を有する時
空間パターンを使用する点が新規特徴である。換言すれ
ば、本実施形態では時空間パターン上にいくつも存在し
ている絶対音階の変化パターンの中から、クエリー（検
索の対象の楽曲の一部）の音階変化系列に一致する部分
を見出すことに新規特徴がある。このような処理を実行
するために本実施形態でも上述の連続ＤＰの手法を使用
する。ただし、従来では２次元的な累積距離の計算を行
なうのに対し、本実施形態では３次元に拡張した累積距
離の計算を行なう。この３次元の連続ＤＰを最初に説明
する。Registered in the query and database,
In the present embodiment, the feature pattern of a musical piece targeted for query comparison (distance calculation) is conventionally represented by a change pattern of a main melody, that is, a time series of scalar amounts (changes in relative scales). A new feature is that a feature pattern registered in the database uses a spatiotemporal pattern having a frequency (absolute scale) axis of a power spectrum whose horizontal axis is defined by a time axis and whose vertical axis is defined by a scale axis. In other words, in the present embodiment, a part that matches the musical scale change series of the query (part of the music to be searched) is found from the absolute musical scale change patterns that exist on the spatiotemporal pattern. Has a new feature. In order to execute such a process, this embodiment also uses the above-described continuous DP method. However, while the conventional two-dimensional cumulative distance is calculated, the present embodiment calculates the three-dimensional cumulative distance. This three-dimensional continuous DP will be described first.

【００３７】音階の総数をＮとし、クエリーである音声
区間長をＴ、フレームの音階差分データを｛ｑ（τ）｜
τ＝１，…，Ｔ｝とする。ここで、音階差分データは、
入力音声の主たる音階が抽出された後、その時間差分に
よって定まるものである。時間差分とは、所定時間の間
隔を有する２つの音階の差分値であり、該差分値を時間
軸上の異なる時刻についての差分値を複数組取得し、当
該取得した複数組の差分値を並べた系列で楽曲の音階の
変化量を表す。Assuming that the total number of scales is N, the voice section length as a query is T, and the scale difference data of the frame is ｛q (τ) |
Let τ = 1,..., T｝. Here, the scale difference data is
After the main scale of the input voice is extracted, it is determined by the time difference. The time difference is a difference value between two scales having an interval of a predetermined time. A plurality of sets of difference values at different times on the time axis are obtained from the difference values, and the obtained plurality of sets of difference values are arranged. Represents the amount of change in the scale of the musical composition.

【００３８】したがって、Therefore,

【００３９】[0039]

【外３】 [Outside 3]

【００４０】としてよい。一方、予め登録される複数組
の楽曲の特徴ベクトルの集合（以下、単にデータベース
と称することもある）についての分析特徴である、ベク
トルの時系列パターンをMay be used. On the other hand, a time-series pattern of a vector, which is an analysis feature of a set of feature vectors of a plurality of music pieces registered in advance (hereinafter sometimes simply referred to as a database), is used.

【００４１】[0041]

【数９】 (Equation 9)

【００４２】とする。ここで、データベース、クエリー
の時間軸をそれぞれｔ，τと区別する。It is assumed that Here, the time axes of the database and the query are distinguished from t and τ, respectively.

【００４３】参照の区間時系列データを区間の始終端に
依存しない対象の時系列データ中に、最も類似した区間
を見出すために本実施形態では連続ＤＰを使用する。た
だし、ｂ（ｔ，ｘ）はｔフレーム、音階ｘのパワーを表
す。参照データ（クエリーの特徴）をスカラー変化量の
時系列パターンとして捉え、検索データ（楽曲の特徴）
を時空間パターンとするように連続ＤＰを拡張する。こ
のような連続ＤＰの手法を本願明細書では、Ｍｏｄｅｌ
ｄｒｉｖｅｎｐａｔｈ連続ＤＰ（ｍｐ−連続ＤＰ）
と呼ぶことにする。In the present embodiment, a continuous DP is used to find the most similar section in the target section time-series data that does not depend on the start and end of the section. Here, b (t, x) represents the power of t scales and scale x. Reference data (characteristics of query) is captured as a time-series pattern of scalar variation, and search data (characteristics of music)
Is extended so that is a spatio-temporal pattern. In this specification, such a continuous DP method is referred to as Model
driven path continuous DP (mp-continuous DP)
I will call it.

【００４４】ｍｐ−連続ＤＰにおいて、累積すべき局所
距離を定めるが、ここでは局所累積距離を、データベー
スの時刻ｔ、および音階ｘに対してｄ（ｘ，ｔ）と表記
する。局所距離の計算式の一例を以下に示す。In the mp-continuous DP, a local distance to be accumulated is determined. Here, the local accumulated distance is expressed as d (x, t) with respect to time t and scale x in the database. An example of the calculation formula of the local distance is shown below.

【００４５】局所距離ｄ（ｘ，ｔ）には、時刻ｔでのｘ
に対する最大パワーＭ（ｔ）＝ｍａｘｂ（ｔ，ｘ）で
正規化される量The local distance d (x, t) includes x at time t
Power M (t) = max b (t, x)

【００４６】[0046]

【数１０】 (Equation 10)

【００４７】などを使用することができる。Can be used.

【００４８】さらに、点（ｘ，τ，ｔ）を終点としたク
エリーとデータベースとの累積距離をＳ（ｘ，τ，ｔ）
で表す。ｍｐ−連続ＤＰではＳ（ｘ，τ，ｔ）を以下の
ような漸化式で更新する。Further, the cumulative distance between the query ending at the point (x, τ, t) and the database is represented by S (x, τ, t).
Expressed by In the mp-continuous DP, S (x, τ, t) is updated by the following recurrence formula.

【００４９】[0049]

【数１１】Ｓ(x,1,t) ＝ 3d(x,t) (11)S (x, 1, t) = 3d (x, t) (11)

【００５０】[0050]

【数１２】 (Equation 12)

【００５１】境界条件は以下で与えられる。The boundary conditions are given below.

【００５２】[0052]

【数１３】 (Equation 13)

【００５３】この漸化式では、連続ＤＰ同様、クエリー
全体との累積距離Ｓ（ｘ，τ，ｔ）は、データベースの
時間方向の伸縮が１／２〜２倍であるとしたときの最小
の累積距離となっている。In this recurrence formula, like the continuous DP, the cumulative distance S (x, τ, t) to the entire query is the minimum when the expansion and contraction of the database in the time direction is 1/2 to 2 times. It is a cumulative distance.

【００５４】ｍｐ−連続ＤＰの出力Ａ（ｔ，ｘ）は、重
みの和３Ｔで正規化してThe output A (t, x) of the mp-continuous DP is normalized by the weight sum 3T.

【００５５】[0055]

【数１４】 [Equation 14]

【００５６】と定める。Is determined.

【００５７】このとき、検出結果はデータベースの時刻
t^*と音階ｘ^*であり、以下の式で求められる。At this time, the detection result is the time of the database.
t ^* and scale x ^*, which are obtained by the following equations.

【００５８】[0058]

【数１５】 (Equation 15)

【００５９】ここで、ａｒｇは、Ｓ（ｘ，τ，ｔ）の引
数（ｔ，ｘ）を返す関数である。Here, arg is a function that returns the argument (t, x) of S (x, τ, t).

【００６０】このような距離計算を１つの登録の楽曲に
ついて行なうと、距離計算結果の最小値を持つ部分が、
１つの楽曲の中のクエリーと最も類似する部分となる。
また、全ての楽曲について同様の距離計算を行ない、そ
れぞれ得られる最小値の中の最もも小さい値を持つ楽曲
部分が最終的な計算結果となる。When such a distance calculation is performed for one registered music piece, the portion having the minimum value of the distance calculation result is
This is the part that is most similar to the query in one song.
The same distance calculation is performed for all songs, and the song portion having the smallest value among the obtained minimum values is the final calculation result.

【００６１】このような楽曲検索方法を適用して楽曲検
索を行なうための楽曲検索装置の機能構成を図１に示
す。FIG. 1 shows a functional configuration of a music search apparatus for performing a music search by applying such a music search method.

【００６２】図１において、１０００はクエリーとして
使用する楽曲の一部分から特徴パターンを取得するクエ
リー取得部である。クエリー取得部では、マイクなどの
音響装置から得られる音声信号に対して、ＦＦＴなどの
通常のスペクトル分析を行い、基本周波数、それに対応
する音階を得た後、クエリーとなる音階差分の系列を生
成する。音階差分の系列は所定時間の間隔を有する２つ
の音階の差分値を異なる時刻について複数組取得したも
のである。In FIG. 1, reference numeral 1000 denotes a query acquisition unit for acquiring a characteristic pattern from a part of a music piece used as a query. The query acquisition unit performs a normal spectrum analysis such as FFT on an audio signal obtained from an audio device such as a microphone to obtain a fundamental frequency and a corresponding scale, and then generates a sequence of scale differences serving as a query. I do. The series of scale differences is obtained by acquiring a plurality of sets of difference values of two scales having a predetermined time interval at different times.

【００６３】１０１０は複数組の楽曲の各々の特徴パタ
ーンを記憶し、データベースの形態で記憶した楽曲デー
タベース構築部である。データベース構築部では、コン
パクトディスク（ＣＤ）などの楽曲から、スペクトル分
析を行い、各音階に対するパワースペクトルを得た後、
それらを時空間パターンとしてハードディスクなどの記
録媒体に保存しておく。もちろん、楽曲の入力を繰り返
すことでデータベースを拡大することができる。Reference numeral 1010 denotes a music database construction unit which stores feature patterns of a plurality of sets of music and stores them in the form of a database. The database construction unit performs spectrum analysis from music such as compact discs (CDs) and obtains power spectra for each scale.
These are stored as a spatiotemporal pattern on a recording medium such as a hard disk. Of course, the database can be expanded by repeating the input of music.

【００６４】１０２０は楽曲データベース上の特徴パタ
ーンとクエリーの特徴パターンとの間との間の距離計算
を行なってクエリーに最も類似する楽曲部分を検索する
パターン検索処理部である。距離計算には上述のｍｐ−
連続ＤＰが使用される。Reference numeral 1020 denotes a pattern search processing unit for calculating a distance between a feature pattern on a music database and a feature pattern of a query to search for a music part most similar to the query. For the distance calculation, the mp-
Continuous DP is used.

【００６５】クエリー入力である音声（歌、ハミング）
の特徴パターンは音階差分データとして表され、楽曲デ
ータベース構築部１０１０に登録された楽曲の特徴パタ
ーン、すなわち、時空間パターンで表される。Voice (song, humming) as query input
Is represented as scale difference data, and is represented by a feature pattern of the music registered in the music database construction unit 1010, that is, a spatiotemporal pattern.

【００６６】１０３０はパターン検索処理部１０２０の
検索結果から、クエリーに最も類似する部分を有する楽
曲を楽曲データベース構築部１０１０から検出し、検出
した楽曲の先頭部分から演奏を行なう楽曲演奏部であ
る。Reference numeral 1030 denotes a music performance section for detecting a music piece having a part most similar to the query from the search result of the pattern search processing section 1020 from the music database construction section 1010 and playing the music from the head of the detected music piece.

【００６７】パターン検索処理部１０２０は、音階差分
の系列と時空間パターンを基に、上述のｍｐ−連続ＤＰ
の計算手順にしたがって、累積距離Ｓ（ｘ，τ，ｔ）、
Ｓ（ｘ，Ｔ，ｔ）を計算し、時空間パターンの中から音
階差分の系列と最も類似した部分の終端（ｔ^*，ｘ^*）を
見つけ出す処理を行う。このとき、出力時刻ｔ^*からク
エリーの長さＴだけ時間を遡れば、入力音声に適合した
楽曲部分の頭出しが可能となる。この頭だし処理（楽曲
およびその先頭部分の検出）を検索楽曲演奏部１０３０
で行い、演奏（出力する）する。The pattern search processing unit 1020 performs the above-described mp-continuous DP
According to the calculation procedure of, the cumulative distance S (x, τ, t),
S (x, T, t) is calculated, and a process of finding the end (t ^* , x ^* ) of the part most similar to the sequence of scale differences from the spatiotemporal pattern is performed. At this time, if the time is traced back from the output time t ^* by the length of the query T, it is possible to find the beginning of the music portion suitable for the input voice. This start-up processing (detection of music and its head) is performed by a search music performance unit 1030
And perform (output).

【００６８】このような機能構成を実際に実現するため
の楽曲検索装置の一例を図２を参照して次に説明する。
図２は楽曲検索装置のシステム構成を示す。楽曲検索装
置として市販のパーソナルコンピュータやワークステー
ションを使用できるので、説明は簡単に留める。An example of a music search apparatus for actually realizing such a functional configuration will be described below with reference to FIG.
FIG. 2 shows a system configuration of the music search device. Since a commercially available personal computer or workstation can be used as the music search device, the description will be briefly omitted.

【００６９】図２において、ＣＰＵ１０はシステムメモ
リ５０にローディングされた楽曲検索プログラム（後
述）を実行して、本発明に係る楽曲検索処理を実行す
る。ハードディスク（ＨＤ）２０は上記楽曲検索プログ
ラムおよびパターン検索処理で使用するパターンデータ
ベース、本実施形態では検索の対象となる複数組の楽曲
の時空間パターンを蓄積（記憶）したデータベースを保
存している。In FIG. 2, the CPU 10 executes a music search program (described later) loaded into the system memory 50 to execute a music search process according to the present invention. The hard disk (HD) 20 stores a pattern database used in the music search program and the pattern search process, and in this embodiment, a database storing (storing) spatiotemporal patterns of a plurality of sets of music to be searched.

【００７０】Ｉ／Ｏ８０は、検索結果の始端時刻に一致
するデータベース上の特徴パターンのアドレス位置を検
出し、その位置を開始位置として１つの楽曲全体の特徴
パターンをハードディスク２０のデータベースから読み
出してスピーカ３０に出力する。スピーカ３０は出力さ
れた特徴パターンを音響信号に変換して再生出力する。The I / O 80 detects the address position of the feature pattern on the database that coincides with the start time of the search result, reads the feature pattern of one entire music from the database on the hard disk 20 with the position as the start position, and reads out the speaker. Output to 30. The speaker 30 converts the output characteristic pattern into an audio signal and reproduces and outputs the audio signal.

【００７１】入力インターフェース（Ｉ／Ｏ）４０はマ
イク７０から、検索対象（クエリー）の音声を入力す
る。システムメモリ５０はＣＰＵ１０が実行する楽曲検
索プログラム、ＣＰＵ１０に対する入出力データを一時
記憶する。入力装置６０はマウスなどのポインティング
デバイス（以下、マスク）を有しＣＰＵ１０に対する動
作指示や情報入力を行う。The input interface (I / O) 40 inputs the voice of the search target (query) from the microphone 70. The system memory 50 temporarily stores a music search program executed by the CPU 10 and input / output data for the CPU 10. The input device 60 has a pointing device (hereinafter, a mask) such as a mouse and performs an operation instruction and information input to the CPU 10.

【００７２】図３および図４は上記楽曲検索プログラム
の内容を示す。図３はメイン処理手順を示し、図４は図
３内の局所距離および累積距離の計算処理の詳細を示
す。図３，図４を参照して楽曲検索処理を説明する。FIGS. 3 and 4 show the contents of the music search program. FIG. 3 shows the main processing procedure, and FIG. 4 shows details of the calculation processing of the local distance and the accumulated distance in FIG. The music search processing will be described with reference to FIGS.

【００７３】入力装置６０からの指示に応じてＣＰＵ１
０はＨＤ２０から図３の検索プログラムをシステムメモ
リ５０はローディングし、実行を開始する。In response to an instruction from the input device 60, the CPU 1
At 0, the system memory 50 loads the search program of FIG.

【００７４】図３において、ユーザは、入力装置６０か
らクエリー、この場合、マイク７０から検索対象となる
クエリーの音声の入力を指示する。この指示に応じて、
ＣＰＵ１０はマイクから入出力インターフェース４０を
介して音声を入力し、図１のクエリー取得部１０００と
しての処理を行って音声信号から取得した特徴パターン
をシステムメモリ５０に書き込む。In FIG. 3, the user instructs to input a query from the input device 60, in this case, to input a voice of a query to be searched from the microphone 70. In response to this instruction,
The CPU 10 inputs a voice from a microphone via the input / output interface 40, performs processing as the query obtaining unit 1000 in FIG. 1, and writes the characteristic pattern obtained from the voice signal into the system memory 50.

【００７５】また、ＣＰＵ１０は楽曲検索処理で使用す
るデータの初期設定を行う（ステップＳ１０）。これに
より（１１）式で説明した初期条件、（１３）式で説明
した境界条件が設定される。次にＣＰＵ１０はハードデ
ィスク２０上の（楽曲）データベースを検索すべく、デ
ータベース内の最初のフレーム番号としてｔ＝０を初期
設定し、最適パスの始端位置（ｔ₀，ｘ₀）＝（０，０）
を設定する。Further, the CPU 10 performs initial setting of data used in the music search processing (step S10). As a result, the initial condition described in the equation (11) and the boundary condition described in the equation (13) are set. Next, the CPU 10 initializes t = 0 as the first frame number in the database in order to search the (song) database on the hard disk 20, and the starting position (t ₀ , x ₀ ) of the optimal path is ( ₀ , ₀ ). )
Set.

【００７６】次に、ＣＰＵ１０は、クエリーとして入力
した複数枚（τ）の音階差分とデータベースの時空間パ
ターンの局所距離および累積距離を（１２）式に従って
計算する（ステップＳ３０）。局所距離計算および累積
距離計算については後で図４により詳細に説明する。Next, the CPU 10 calculates the local distances and the cumulative distances of the plurality of musical scales (τ) inputted as the query and the spatiotemporal pattern of the database according to the equation (12) (step S30). The local distance calculation and the cumulative distance calculation will be described later in detail with reference to FIG.

【００７７】さらに、ＣＰＵ１０は計算された累積距離
について、所定期間内の累積距離の極大値検出処理を行
い、その極大値を持つ最適パスの始端位置および終端位
置を検索結果、換言すれば、クエリーに類似する楽曲の
範囲（１つの楽曲の中の類似する一部分）を検索結果と
してシステムメモリ５０に出力する（記憶する）（ステ
ップＳ４０）。Further, the CPU 10 performs a process of detecting the maximum value of the cumulative distance within a predetermined period for the calculated cumulative distance, and searches for the start position and the end position of the optimum path having the maximum value, in other words, the query Is output (stored) to the system memory 50 as a search result for a range of music similar to (a similar part in one music) (step S40).

【００７８】この後、ステップＳ５０で変数ｔがデータ
ベースの最終フレーム番号に到達するまで、変数ｔの値
をインクリメント（１づつ更新すること）して（ステッ
プＳ６０）、局所距離計算および累積距離計算等を繰り
返すと、これにより１つの楽曲の中のクエリーに最も類
似する範囲が検出される。これをデータベース上の複数
の楽曲（特徴パターンの形態）について上記処理を実行
し、累積距離が最も小さいものが検索結果となる。最終
的に、データベース上の楽曲の中で最も類似する個所が
検出され、その個所を含む楽曲の始端部も検出される。
このときのＣＰＵ１１が図１のパターン検索処理部１０
２０および楽曲演奏部１０３０の楽曲先端部検出部とし
て機能する。Thereafter, the value of the variable t is incremented (updated one by one) until the variable t reaches the last frame number of the database in step S50 (step S60), and the local distance calculation and the cumulative distance calculation are performed. Is repeated, thereby detecting the range most similar to the query in one song. The above process is executed for a plurality of music pieces (feature pattern forms) on the database, and the one with the smallest cumulative distance is the search result. Eventually, the most similar location in the music on the database is detected, and the beginning of the music including that location is also detected.
At this time, the CPU 11 executes the pattern search processing unit 10 shown in FIG.
20 and functions as a music tip detection unit of the music performance unit 1030.

【００７９】図３のステップＳ３０の局所距離計算およ
び累積距離計算の詳細を図４に示す。図４において、デ
ータベース上のｔの時空間パターンとクエリーの１番目
の音階差分との間の局所距離を計算すべくクエリー側の
フレーム番号τが０に初期設定（ステップＳ１００）さ
れ、さらに１にインクリメントされる（Ｓ１１０）。FIG. 4 shows details of the local distance calculation and the cumulative distance calculation in step S30 in FIG. In FIG. 4, the frame number τ on the query side is initially set to 0 (step S100) to calculate the local distance between the spatiotemporal pattern of t on the database and the first scale difference of the query, and further to 1 It is incremented (S110).

【００８０】次にｔ，τ，ｘに対して、局所距離ｄ
（ｘ，ｔ）およびＳ（ｘ，τ，ｔ）の計算が上述の数式
により行われる。音階ｘは総音階数Ｎに達する（Ｓ１７
０）まで、インクリメントされる（Ｓ１２０）が、到達
後は、τのインクリメント（Ｓ１１０）とともに再度、
音階番号１（ｘ＝１）に初期化（Ｓ１１５に続きＳ１２
０）される。引続き、τのインクリメントがデータベー
スの末端Ｗに達するまで続けられるが、このｘ，τのイ
ンクリメントの最中は、常に、現時点での累積距離Ｓ
（ｘ，τ，ｔ）と、それまでの累積距離の最大値ＳＭＡ
Ｘの比較が行われることによって、順次ＳＭＡＸも単調
非減少に更新されていく。Next, for t, τ, and x, the local distance d
The calculation of (x, t) and S (x, τ, t) is performed by the above-described equations. The scale x reaches the total number of scales N (S17)
0), but is incremented (S120), but after reaching, together with the increment of τ (S110),
Initialized to scale number 1 (x = 1) (S12 following S115)
0). Subsequently, the increment of τ is continued until the end W of the database is reached. During the increment of x, τ, the accumulated distance S at the current time is always determined.
(X, τ, t) and the maximum value SMA of the cumulative distance up to that point
As the comparison of X is performed, the SMAX is also sequentially updated to be monotonically non-decreasing.

【００８１】累積距離計算のすべての手順は、τ＝Ｔ、
ｘ＝Ｎになったとき、そのときのＳＭＡＸの値と音階ｘ
を返り値として出力し、終了する。図４の処理が終了す
ると、図３のステップＳ４０に戻る。ここで、τの値が
クエリーの最後尾の値Ｔに、ｘの値が総音階数Ｎに到達
した時が、現在のｔにおける最適パスの終端位置となっ
ている。このｔの値の累積距離に帯同されている最適パ
スの始端位置および累積距離の値からなるデータセット
（３つのデータを１組とするデータ）がシステムメモリ
５０に一時記憶する。All procedures for calculating the cumulative distance are as follows: τ = T,
When x = N, the value of SMAX and scale x at that time
Is output as a return value, and the processing ends. When the process in FIG. 4 ends, the process returns to step S40 in FIG. Here, when the value of τ reaches the value T at the end of the query and the value of x reaches the total number of scales N, it is the end position of the optimal path at the current t. A data set (a set of three data) consisting of the start position of the optimal path and the value of the cumulative distance, which is associated with the cumulative distance of the value of t, is temporarily stored in the system memory 50.

【００８２】本実施形態では、累積計算Ｓ（ｘ，τ，
ｔ）を計算する際、複数のデータセット全てをシステム
メモリ５０に記憶すると多量のメモリ領域が必要となる
ので、ｔ番目（今回）のデータセットとｔ−１番目のデ
ータ（前回）だけを記憶するメモリ領域だけを用意し、
ｔの値が更新されるごとに今回得られたデータセットを
前回の記憶領域に移す。このため、これまで記憶されて
いたデータセットは新たなデータセットにより上書き記
憶により消去される。In this embodiment, the cumulative calculation S (x, τ,
When calculating t), storing all of the plurality of data sets in the system memory 50 requires a large amount of memory area. Therefore, only the t-th (current) data set and the (t-1) -th data (previous) are stored. Prepare only the memory area to
Each time the value of t is updated, the data set obtained this time is moved to the previous storage area. For this reason, the data set stored so far is erased by overwriting with a new data set.

【００８３】図３のステップＳ４０の極大値検出処理で
は、上述の累積距離計算により、τの値がクエリーの最
後尾の位置まで到達すると、最適パスの始端位置、終端
位置および累積距離のデータセットが得られる。図３の
繰り返し計算でｔの値がインクリメントされ、これまで
に得られた仮の極大値と、今回得られた累積距離の値の
大小関係が判定され、今回得られた累積距離が仮の極大
値よりも大きい場合には、仮の極大値が今回の累積値で
更新される。In the local maximum value detection processing of step S40 in FIG. 3, when the value of τ reaches the end position of the query by the above-described cumulative distance calculation, the data set of the start position, end position, and cumulative distance of the optimal path Is obtained. The value of t is incremented by the iterative calculation of FIG. 3, and the magnitude relationship between the temporary maximum value obtained so far and the value of the accumulated distance obtained this time is determined. If the value is larger than the value, the temporary maximum value is updated with the current accumulated value.

【００８４】[0084]

【実施例】具体的な計算事例を図５を参照して説明す
る。人が歌った音律から図ａのクエリー取得処理後の、
クエリー列を（ｑ（１），ｑ（２），ｑ（３），ｑ
（４））＝（２，１，−１，０）とする。例えばレ（＋
２）ミ（＋１）ファ（−１）ミ（＋０）ミの音律で人が
歌った場合などがこれに当る。ＣＤなどの楽曲から、図
１のデータベース構築部１０１０を通じて得られた時系
列パターンに対する局所距離を図５で５×１０の行列と
して表現している。行列の行インデックスは音階（ここ
では、ｘ₁，…，ｘ₅の５音階）を表し、列インデックス
ｔ₁，…，ｔ₁₀はデータベース内の時刻あるいはフレー
ム番号を表している。さらに、各要素は局所距離に対応
し、例えば、ｘ₃行ｔ₆列目の要素１は、ｄ（ｘ₃，ｔ₆）
＝１を表している。DESCRIPTION OF THE PREFERRED EMBODIMENTS A specific calculation example will be described with reference to FIG. After the query acquisition processing of Fig.
The query sequence is expressed as (q (1), q (2), q (3), q
(4)) = (2, 1, −1, 0). For example,
2) This corresponds to a case where a person sings according to the temperament of mi (+1) fa (-1) mi (+0) mi. The local distance with respect to the time-series pattern obtained from the music such as a CD through the database construction unit 1010 in FIG. 1 is expressed as a 5 × 10 matrix in FIG. Row index scale matrix (here, x _1, ..., x 5 scale of ₅₎ represents a column index t _1, ..., t ₁₀ represents a time or frame number in the database. Further, each element corresponds to a local distance. For example, element 1 at x ₃ row and t ₆ column is d (x ₃ , t ₆ )
= 1.

【００８５】累積距離計算（１１）式、（１２）式によ
れば、ｔ＝ｔ₁のとき、Calculation of Cumulative Distance According to equations (11) and (12), when t = t ₁ ,

【００８６】[0086]

【数１６】Ｓ(x₁ ,1 ,t₁ )＝Ｓ(x₂ ,1 ,t₁ )＝…＝Ｓ(x₁₀ ,1 ,t₁ )＝３×0.５ (16) である。（１５）式の値を得るために上の累積距離の最
大値をとる必要があるが、この場合全て同じ値（１．
５）なので、音階の最初値ｘ₁をとる。楽曲検索では、
時間ｔが重要なので、音階の最初値をとっても問題はな
い。また、ｍｐ−連続ＤＰの出力（１４）式の値は、以
下で与えられる。S (x ₁ , 1, t ₁ ) = S (x ₂ , 1, t ₁ ) =... = S (x ₁₀ , 1, t ₁ ) = 3 × 0.5 (16) In order to obtain the value of the expression (15), it is necessary to take the maximum value of the above cumulative distance.
5) So, take the first value x ₁ of the scale. In music search,
Since time t is important, there is no problem in taking the first value of the scale. The value of the output (14) of the mp-continuous DP is given by:

【００８７】[0087]

【数１７】Ａ(x₁ ,1 ,t₁ )＝Ａ(x₂ ,1 ,t₁ )＝…＝Ａ(x₁₀ ,1 ,t₁ )＝1.5/(3T)＝0.125 (1 7) ｔ₁，…，ｔ₁₀においても累積距離計算（１１）式、
（１２）式に基づいて計算し、ｍｐ−連続ＤＰの出力を
求めると、ｔ₇のとき、累積距離の最大値は、Ｓ（ｘ₃，
４，ｔ₇）＝１２をとり、ｔ₇が所望の楽曲の末端とな
る。したがって、始端は４時点戻ったｔ₃であり、図５
の矢印で記されているときの階差は、クエリーと一致し
ていることが分かる。したがって、これらの矢印で決ま
る道順が、所望の楽曲の始端から末端までとなってい
る。A (x ₁ , 1, t ₁ ) = A (x ₂ , 1, t ₁ ) =... = A (x ₁₀ , 1, t ₁ ) = 1.5 / (3T) = 0.125 (17) Also at t ₁ ,..., t ₁₀ , cumulative distance calculation formula (11),
(12) calculated based on formula, when obtaining the output of mp- continuous DP, when t _7, the maximum value of the cumulative distance is, S (x _3,
4, taking the t _₇₎ = 12, t ₇ is the end of the desired music. Therefore, the starting point is t ₃ , which is four times back, and FIG.
It can be seen that the difference when marked with the arrow is consistent with the query. Therefore, the route determined by these arrows is from the beginning to the end of the desired music.

【００８８】上述の実施形態の他に次の形態を実施でき
る。１）上述の実施形態では、楽曲の特徴パターンをデータ
ベースの形態で登録しているが、楽曲の音響信号を登録
して、検索ごとに音響信号から楽曲の特徴パターンを抽
出してもよい。２）クエリーとして使用する楽曲の特徴パターンそのも
のを楽曲検索装置に入力してもよい。３）上述の実施形態では音声を含む楽曲の演奏をマイク
で入力で入力し、フーリエ解析し、その中の音声特徴を
示すパワースペクトラム成分を取り出し、その成分を時
間的にならべることで、楽曲データベースに登録する楽
曲の絶対的音階（本発明の第１の特徴）を作成する。こ
れにより演奏などの雑音成分が取り除かれるという効果
をも奏する。雑音のない絶対音階を得られる場合には、
絶対音階を他の形態で表してもよい。The following embodiment can be carried out in addition to the above embodiment. 1) In the above embodiment, the characteristic pattern of the music is registered in the form of a database. However, the acoustic signal of the music may be registered, and the characteristic pattern of the music may be extracted from the acoustic signal for each search. 2) The feature pattern itself of the music used as a query may be input to the music search device. 3) In the above-described embodiment, the performance of a music piece including voice is input by a microphone, Fourier analysis is performed, and a power spectrum component indicating a voice feature is taken out of the input, and the components are arranged in time, thereby obtaining a music database. An absolute scale (a first feature of the present invention) of a musical composition to be registered is created. This also has the effect of removing noise components such as performances. If you can get absolute scale without noise,
The absolute scale may be represented in other forms.

【００８９】[0089]

【発明の効果】以上、説明したように、本発明では、人
が歌った音声などの楽曲の絶対音階の時間変化を特徴と
しているので、従来のように別途楽曲の絶対音階のデー
タから相対音階の特徴を取り出してデータベースに登録
する必要がない。このため、楽曲データベースへの登録
が簡素化され、検索も絶対音階と相対音階の双方を取り
扱わなくてもよいので、簡素化される。As described above, the present invention is characterized by the temporal change of the absolute scale of a song such as a voice sung by a person. There is no need to take out the features and register them in the database. For this reason, the registration in the music database is simplified, and the search does not have to deal with both the absolute scale and the relative scale.

【００９０】さらには、クエリーとして与える楽曲の一
部分が１つ楽曲のどの範囲にあるかを検索したり、楽曲
の一部分から楽曲を楽曲データベースの中から探したり
多様な用途に本発明を適用することができる。Further, the present invention can be applied to various applications, such as searching for a range of a music piece to be given as a query within a single music piece, searching for a music piece from a music piece part in a music database, and so on. Can be.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明実施形態の機能構成を示すブロック図で
ある。FIG. 1 is a block diagram illustrating a functional configuration of an embodiment of the present invention.

【図２】本発明実施形態のシステム構成を示すブロック
図である。FIG. 2 is a block diagram illustrating a system configuration according to an embodiment of the present invention.

【図３】本発明実施形態の検出処理手順を示すフローチ
ャートである。FIG. 3 is a flowchart illustrating a detection processing procedure according to the embodiment of the present invention.

【図４】本発明実施形態の局所距離および累積距離の計
算処理の詳細を示すフローチャートである。FIG. 4 is a flowchart illustrating details of a calculation process of a local distance and a cumulative distance according to the embodiment of the present invention.

【図５】本発明実施例を説明するための説明図である。FIG. 5 is an explanatory diagram for explaining an embodiment of the present invention.

[Explanation of symbols]

１０ＣＰＵ２０ＨＤ３０スピーカ４０Ｉ／Ｏ５０システムメモリ６０入力装置７０マイク８０Ｉ／Ｏ 10 CPU 20 HD 30 Speaker 40 I / O 50 System memory 60 Input device 70 Microphone 80 I / O

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/12 Ｇ１０Ｌ 3/00 ５３１Ｎ 15/02 ５３３Ａ // Ｇ１０Ｌ 101:065 9/16 ３０１Ａ (72)発明者橋口博樹東京都千代田区東神田２−５−12 龍角散ビル８階技術研究組合新情報処理開発機構内 (72)発明者岡隆一東京都千代田区東神田２−５−12 龍角散ビル８階技術研究組合新情報処理開発機構内Ｆターム(参考） 5B075 ND14 NK06 PP03 PP07 PP12 PQ04 PR06 QM08 5D015 AA06 FF04 HH07 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 15/12 G10L 3/00 531N 15/02 533A // G10L 101: 065 9/16 301A (72) Invention Person Hiroki Hashiguchi Ryukakusan 2F-12 Higashikanda, Chiyoda-ku, Tokyo Building 8th floor, Technology Research Association, New Information Processing Development Machine premises Inventor Ryuichi Oka 2-5-12 Higashikanda, Chiyoda-ku, Tokyo Building 8F Technology Research Association New Information Processing Development Facility F-Term (Reference) 5B075 ND14 NK06 PP03 PP07 PP12 PQ04 PR06 QM08 5D015 AA06 FF04 HH07

Claims

[Claims]

1. A first feature of a song registered in a song database in advance and a second feature of a portion of a song used as a query are compared, and a range of a song most similar to the second feature is compared. The first feature is represented by a power sequence of an absolute scale group of musical scales along the time axis direction, and the second characteristic is represented by a relative musical scale along the time axis direction. And input means for inputting the second feature, and the second feature input from the first feature and the input means.
A music search device comprising: a music search unit that searches for a range of music that is similar to the second feature by comparing the feature with the second feature by a continuous DP technique.

2. The music search device according to claim 1, wherein said input means inputs a voice, and acquires the second feature by acoustically analyzing the input voice.

3. The music search device according to claim 1, wherein a first feature of the plurality of songs is registered in the database, and the first feature and the second feature of the plurality of songs are registered. A music search device characterized by comparison.

4. The music search device according to claim 1, further comprising: playback output means for playing back and outputting the music whose range has been searched by said music search means based on a first characteristic. Search device.

5. The music search device according to claim 1, wherein a plurality of sets of difference values of two scales having a predetermined time interval are obtained at different times on a time axis, and the obtained plurality of sets of difference values are obtained. A music search device wherein the second feature is represented by an arranged sequence.

6. A first feature of a song registered in a song database in advance and a second feature of a portion of a song used as a query are compared, and a range of a song most similar to the second feature is compared. In the song search method, the first feature is represented by a power sequence of an absolute scale group of musical scales along the time axis direction, and the second feature is represented by a relative musical scale along the time axis direction. Expressing and inputting the second feature to be input to the music search device, wherein the music search device compares the first feature and the second feature input from the input means by a continuous DP technique. A music range similar to the second feature.

7. The music search method according to claim 6, wherein a voice is input to said music search device, and said input voice is acoustically analyzed by said music search device to acquire said second feature. A featured music search method.

8. The music search method according to claim 6, wherein a first feature related to a plurality of songs is registered in the database, and the song search device stores the first feature of the plurality of songs and the first feature of the plurality of songs. A music search method characterized by comparing a second characteristic.

9. The music search method according to claim 6, wherein the music search device reproduces and outputs the music whose range has been searched based on a first characteristic.

10. The music search method according to claim 6, wherein a plurality of sets of difference values of two scales having a predetermined time interval are acquired at different times on the time axis, and the acquired plurality of sets of difference values are obtained. A music search method, wherein the second feature is represented by an arranged sequence.