JPH05127668A

JPH05127668A - Automatic transcription device

Info

Publication number: JPH05127668A
Application number: JP29167891A
Authority: JP
Inventors: Shigeaki Komatsu; 慈明小松
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1991-11-07
Filing date: 1991-11-07
Publication date: 1993-05-25

Abstract

PURPOSE:To provide the automatic transcription device with high precision by taking an analysis so that frequency resolution is high in a low-frequency range and time resolution is high in a high-frequency range. CONSTITUTION:A CPU performs a frequency analyzing process for a music signal, which is sampled and stored in a RAM by an A/D conversion device, by wavelet conversion (S1). Then the output value of the wavelet conversion is inputted and a fundamental frequency component is extracted (S2); and note information such as MIDI codes is generated according to the frequency channel, intensity, start time, and end time of the fundamental frequency component and outputted on a display (S3).

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音響的な楽曲信号に対
して周波数分析を行なう自動採譜装置に係わり、特にそ
の周波数分析処理に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an automatic music transcription device for performing frequency analysis on an acoustic music signal, and more particularly to its frequency analysis processing.

【０００２】[0002]

【従来の技術】従来、この種の自動採譜装置は図１に示
すように構成されていた。オーディオ・アンプ１は演奏
された楽曲信号を入力とし、この信号を適切な電圧値に
増幅する。ローパス・フィルター２はオーディオ・アン
プ１により増幅された信号における５．５ｋＨｚ以下の
周波数成分のみを通過させることにより、標本化時の折
返し歪を抑えている。Ａ／Ｄ変換装置３は、ローパス・
フィルター通過信号を、サンプリング周波数１２ｋＨ
ｚ、１６ビットのディジタル信号に変換する。Ｉ／Ｏポ
ート４はＣＰＵ５とＡ／Ｄ変換装置３、ディスプレイ８
とを接続している。ＣＰＵ５は楽曲信号データの周波数
分析処理、基本周波数成分の抽出処理、符号化処理等を
行ない、ＲＡＭ６、及びＲＯＭ７に接続されている。前
記ＲＡＭ６には、Ａ／Ｄ変換装置３により標本化された
楽曲信号データ、ＣＰＵ５により処理された周波数分析
結果等が格納されるエリアが用意されている。前記ＲＯ
Ｍ７には、周波数分析ロジック、基本周波数成分抽出ロ
ジック、符号化処理ロジック等が格納されている。前記
ディスプレイ８は処理結果等の表示を行なう。2. Description of the Related Art Conventionally, this type of automatic music transcription device has been constructed as shown in FIG. The audio amplifier 1 receives the played music signal as an input and amplifies this signal to an appropriate voltage value. The low-pass filter 2 suppresses aliasing distortion at the time of sampling by passing only the frequency component of 5.5 kHz or less in the signal amplified by the audio amplifier 1. The A / D converter 3 is a low-pass
Sampling frequency of 12 kHz for the filtered signal
z, 16-bit digital signal is converted. I / O port 4 is CPU 5, A / D converter 3, display 8
And are connected. The CPU 5 performs frequency analysis processing of music signal data, extraction processing of basic frequency components, encoding processing, etc., and is connected to the RAM 6 and the ROM 7. The RAM 6 is provided with an area for storing the music signal data sampled by the A / D converter 3 and the frequency analysis result processed by the CPU 5. The RO
M7 stores a frequency analysis logic, a fundamental frequency component extraction logic, an encoding processing logic, and the like. The display 8 displays processing results and the like.

【０００３】以下、従来例の作動について図２、図４を
参照して説明する。The operation of the conventional example will be described below with reference to FIGS. 2 and 4.

【０００４】図２は従来例により行なわれる処理を示す
フローチャートである。FIG. 2 is a flowchart showing the processing performed by the conventional example.

【０００５】ＣＰＵ５は始めに、Ａ／Ｄ変換装置３によ
り標本化されＲＡＭ６に格納されている楽曲信号データ
ｓ_{i:i=0,1,・・・,N-1}に対して、２５ｍｓｅｃ毎に短時間
フーリエ分析により周波数分析処理を行なう（Ｓ１）。First, the CPU 5 receives the music signal data s _{i: i = 0, 1, ..., N-1} sampled by the A / D converter 3 and stored in the RAM 6 every 25 msec. Frequency analysis processing is performed by short-time Fourier analysis (S1).

【０００６】ここで、周波数分析処理（Ｓ１）について
図４を参照し説明する。まず、ＣＰＵ５は楽曲信号デー
タに対する観測位置を示すポインタｈを初期化する（ｈ
＝０：Ｓ４１）。次に、ＲＡＭ６に格納されている楽曲
信号データを入力とし、短時間フーリエ分析を行う。短
時間フーリエ分析は、数１のような離散フーリエ変換に
より行なわれる。The frequency analysis process (S1) will be described with reference to FIG. First, the CPU 5 initializes a pointer h indicating an observation position for the music signal data (h
= 0: S41). Next, the music signal data stored in the RAM 6 is input, and short-time Fourier analysis is performed. The short-time Fourier analysis is performed by the discrete Fourier transform as shown in equation 1.

【０００７】[0007]

【数１】 [Equation 1]

【０００８】このとき、離散フーリエ変換の周波数分解
能は各周波数帯域において均一であり数１のＬに比例
し、時間分解能はＬに反比例することが一般に知られて
いる。実際の処理においては、ＣＰＵ５は数１を直接計
算するのではなく、高速フーリエ変換によりＸ
_{k:k=0,1,・・・、L-1}を求めている（Ｓ４３）。At this time, it is generally known that the frequency resolution of the discrete Fourier transform is uniform in each frequency band and is proportional to L of the equation 1 and the time resolution is inversely proportional to L. In the actual processing, the CPU 5 does not directly calculate the equation 1, but uses the fast Fourier transform to calculate X.
_{k: k = 0,1, ..., L-1} is obtained (S43).

【０００９】次に、ＣＰＵ５は離散フーリエ変換の出力
に対し、数２により対数パワー・スペクトルＰ
_{k:k-0,1,・・・、L-1}を算出する（Ｓ４４）。Next, the CPU 5 outputs the logarithmic power spectrum P to the output of the discrete Fourier transform according to the equation (2).
_{k: k-0,1, ..., L-1} are calculated (S44).

【００１０】[0010]

【数２】 [Equation 2]

【００１１】次に、ＣＰＵ５は、対数パワー・スペクト
ルのピーク値を検出する。そして、ピーク値に対応する
周波数を、一般的な４４０Ｈｚを基準とする平均律音階
の音階番号に変換し、この音階番号、及び、ピーク値を
ＲＡＭ６に格納する（Ｓ４５）。Next, the CPU 5 detects the peak value of the logarithmic power spectrum. Then, the frequency corresponding to the peak value is converted into the scale number of the equal temperament scale based on the general 440 Hz, and the scale number and the peak value are stored in the RAM 6 (S45).

【００１２】次に、ＣＰＵ５は、前記ポインタｈを３０
０ポイント分（２５ｍｓｅｃ）インクリメントし、処理
を（Ｓ４２）に戻す（Ｓ４６）。Next, the CPU 5 sets the pointer h to 30
It is incremented by 0 point (25 msec), and the process is returned to (S42) (S46).

【００１３】次に、ＣＰＵ５は前記ポインタｈと楽曲信
号データのサンプル数Ｎとを比較し、ｈ＜Ｎの判断が
「ＹＥＳ」の場合、以上で説明した（Ｓ４３〜Ｓ４６）
の処理を繰り返し、判断が「ＮＯ」である場合周波数分
析処理（Ｓ１）を終了する（Ｓ４２）。Next, the CPU 5 compares the pointer h with the sample number N of the music signal data, and if h <N is "YES", it is explained above (S43 to S46).
When the judgment is “NO”, the frequency analysis processing (S1) is ended (S42).

【００１４】次に、ＣＰＵ５は、前記ＲＡＭ６に格納さ
れている、スペクトルのピークに対応する音階番号、及
び、そのピーク値を入力とし、基本周波数成分の抽出を
行なう。基本周波数成分の抽出方法としては、例えば、
特開平３−９４０７２号公報に示される方法等を使用す
る（Ｓ２）。Next, the CPU 5 inputs the scale number corresponding to the peak of the spectrum stored in the RAM 6 and the peak value thereof, and extracts the fundamental frequency component. As a method of extracting the fundamental frequency component, for example,
The method disclosed in Japanese Patent Laid-Open No. 3-94072 is used (S2).

【００１５】次に、ＣＰＵ５は、基本周波数成分の音階
番号、強さ、始端時刻、終端時刻から、ＭＩＤＩコード
等の音符情報を作成し結果をディスプレイ８に出力する
（Ｓ３）。Next, the CPU 5 creates musical note information such as a MIDI code from the scale number, strength, start time and end time of the fundamental frequency component and outputs the result to the display 8 (S3).

【００１６】[0016]

【発明が解決しようとする課題】自動採譜装置において
は、楽曲信号を処理し、楽器音の基本周波数を同定する
ことが目的となる。代表的な平均律音階は、オクターブ
間隔を対数的に１２等分したものであり、周波数分析処
理においては、１２分の１オクターブの周波数分解能が
要求される。つまり、低周波数領域では、高い周波数分
解能が必要とされるが、高周波数領域では、あまり高い
周波数分解能は必要とされない。しかし、楽曲の特性と
して一般的に、高周波数領域では高い時間分解能が要求
されことが多い。SUMMARY OF THE INVENTION In an automatic transcription apparatus, it is an object to process a music signal and identify a fundamental frequency of a musical instrument sound. A typical equal tempered scale is an octave interval logarithmically divided into 12 equal parts, and a frequency analysis process requires a frequency resolution of 1/12 octave. That is, high frequency resolution is required in the low frequency region, but not very high frequency resolution in the high frequency region. However, as a characteristic of music, generally, a high time resolution is often required in a high frequency region.

【００１７】しかし、上記従来の自動採譜装置の周波数
分析処理においては、数１の様な離散フーリエ変換によ
る短時間スペクトル分析が使われているが、前記したよ
うに離散フーリエ変換の周波数分解能は数１のＬに比例
し、時間分解能はＬに反比例し、それらは各周波数帯域
において均一であることが知られている。したがって、
低周波数領域における周波数分解能を上げようとする
と、高周波数領域での時間分解能が不足することにな
る。逆に、高周波数領域における時間分解能を上げよう
とすると、低周波数領域での周波数分解能が不足してし
まうという欠点があった。However, in the frequency analysis processing of the above-mentioned conventional automatic transcription apparatus, short-time spectrum analysis by discrete Fourier transform as shown in equation 1 is used, but the frequency resolution of discrete Fourier transform is several as described above. It is known that it is proportional to L of 1 and the temporal resolution is inversely proportional to L, and they are uniform in each frequency band. Therefore,
When trying to increase the frequency resolution in the low frequency region, the time resolution in the high frequency region becomes insufficient. On the other hand, there is a drawback in that the frequency resolution in the low frequency region becomes insufficient when trying to increase the time resolution in the high frequency region.

【００１８】本発明は、上述した問題点を解決するもの
で、低周波数領域では周波数分解能が高く、高周波数領
域では時間分解能が高くなるような分析方法を用いるこ
とにより、精度の高い自動採譜装置を提供することを目
的としている。The present invention solves the above-mentioned problems, and by using an analysis method in which the frequency resolution is high in the low frequency region and the time resolution is high in the high frequency region, a highly accurate automatic transcription device is provided. Is intended to provide.

【００１９】[0019]

【課題を解決するための手段】この目的を達成するため
に本発明は、音響的な信号データを入力とし周波数分析
を行う周波数分析手段と、周波数分析結果から基本周波
数成分を抽出する基本周波数成分抽出手段と、基本周波
数成分の音程、強さ、始端時刻、終端時刻等からＭＩＤ
Ｉコード等の音符情報を出力する符号化手段とを備え、
前記周波数分析手段を、ウェーブレット変換により構成
した。To achieve this object, the present invention provides a frequency analysis means for performing frequency analysis with acoustic signal data as input, and a fundamental frequency component for extracting a fundamental frequency component from the frequency analysis result. From the extraction means and the pitch, strength, start time, end time, etc. of the fundamental frequency component, MID
An encoding means for outputting note information such as an I-code,
The frequency analysis means is constructed by wavelet transform.

【００２０】また、前記ウェーブレット変換の出力の中
心周波数を、演奏高度を基準とする平均律音階の各周波
数に等しくなるようにしてもよい。Further, the center frequency of the output of the wavelet transform may be set to be equal to each frequency of the equal tempered scale based on the performance altitude.

【００２１】また、前記ウェーブレット変換の基本ウェ
ーブレット関数としてガボール関数を用いてもよい。A Gabor function may be used as the basic wavelet function of the wavelet transform.

【００２２】[0022]

【作用】上記の構成を有する本発明の周波数分析手段
は、音響的な信号を入力としウェーブレット変換により
周波数分析を行う。基本周波数成分抽出手段は周波数分
析結果から基本周波数成分を抽出する。符号化手段は基
本周波数成分の音程、強さ、始端時刻、終端時刻等から
ＭＩＤＩコード等の音符情報を出力する。The frequency analyzing means of the present invention having the above-mentioned structure receives the acoustic signal as an input and performs the frequency analysis by the wavelet transform. The fundamental frequency component extracting means extracts the fundamental frequency component from the frequency analysis result. The encoding means outputs note information such as MIDI code from the pitch, strength, start time, end time, etc. of the fundamental frequency component.

【００２３】[0023]

【実施例】以下、本発明を具体化した一実施例を図面を
参照して説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【００２４】図１は本実施例による自動採譜装置のブロ
ック図である。本実施例を構成するオーディオ・アンプ
１は演奏された楽曲信号を入力とし、この信号を適切な
電圧値に増幅する。ローパス・フィルター２はオーディ
オ・アンプ１により増幅された信号における５．５ｋＨ
ｚ以下の周波数成分のみを通過させることにより、標本
化時の折返し歪を抑えている。Ａ／Ｄ変換装置３はロー
パス・フィルター通過信号を、サンプリング周波数１２
ｋＨｚ、１６ビットのディジタル信号に変換する。Ｉ／
Ｏポート４はＣＰＵ５と、Ａ／Ｄ変換装置３、ディスプ
レイ８とを接続している。ＣＰＵ５は楽曲信号データの
周波数分析処理、基本周波数成分の抽出処理、符号化処
理等を行ない、ＲＡＭ６、及びＲＯＭ７に接続されてい
る。前記ＲＡＭ６にはＡ／Ｄ変換装置３により標本化さ
れた楽曲信号データ、ＣＰＵ５により処理された周波数
分析結果等が格納されるエリアが用意されている。前記
ＲＯＭ７には周波数分析ロジック、基本周波数抽出ロジ
ック、符号化処理ロジック等が格納されている。前記デ
ィスプレイ８は処理結果等の表示を行なう。FIG. 1 is a block diagram of an automatic music transcription device according to this embodiment. The audio amplifier 1 constituting the present embodiment receives the played music signal as an input and amplifies this signal to an appropriate voltage value. The low-pass filter 2 is 5.5 kHz in the signal amplified by the audio amplifier 1.
The aliasing distortion at the time of sampling is suppressed by passing only the frequency components equal to or lower than z. The A / D converter 3 converts the low-pass filtered signal into the sampling frequency 12
Converted to a 16-bit digital signal of kHz. I /
The O port 4 connects the CPU 5, the A / D conversion device 3, and the display 8. The CPU 5 performs frequency analysis processing of music signal data, extraction processing of basic frequency components, encoding processing, etc., and is connected to the RAM 6 and the ROM 7. The RAM 6 is provided with an area for storing the music signal data sampled by the A / D converter 3 and the frequency analysis result processed by the CPU 5. The ROM 7 stores a frequency analysis logic, a fundamental frequency extraction logic, an encoding processing logic and the like. The display 8 displays processing results and the like.

【００２５】以下、本実施例の作動について図２、図３
を参照して説明する。The operation of this embodiment will be described below with reference to FIGS.
Will be described.

【００２６】図２は本実施例により行なわれる処理を示
すフローチャートである。FIG. 2 is a flow chart showing the processing performed by this embodiment.

【００２７】ＣＰＵ５は始めに、Ａ／Ｄ変換装置３によ
り標本化されＲＡＭ６に格納されている楽曲信号データ
ｓ_{i:i=0,1,・・・,N-1}に対してウェーブレット変換による
周波数分析処理を行なう（Ｓ１）。First, the CPU 5 performs frequency conversion by wavelet transformation on the music signal data s _{i: i = 0,1, ..., N-1} sampled by the A / D converter 3 and stored in the RAM 6. Analysis processing is performed (S1).

【００２８】ここで、ウェーブレット変換について説明
する。１次元の信号ｆ_(t)のウェーブレット変換Ｆ_(a,b)
は数３で定義されている。Here, the wavelet transform will be described. Wavelet transform F _{(a, b) of} one-dimensional signal f _(t ₎
Is defined by Equation 3.

【００２９】[0029]

【数３】 [Equation 3]

【００３０】ここで、Ψは基本ウェーブレット関数と呼
ばれる関数である。ウェーブレット変換は周波数に対す
るスケールを表わすパラメータａと、時刻を表わすパラ
メータｂを動かすことによりΨから作り出される関数の
族と解析の対象となる関数ｆ_(t)との内積として定義さ
れる。Ψ^*はΨの複素共役である。本実施例ではａ，ｂ
を数４、数５で定義し、ディジタル化された入力信号ｓ
_iに対する離散化されたウェーブレット変換Ｓ_m,nを数６
で定義する。Here, Ψ is a function called a basic wavelet function. The wavelet transform is defined as an inner product of a parameter a representing a scale with respect to frequency and a family of functions created from Ψ by moving a parameter b representing time and a function f _(t) to be analyzed. Ψ ^* is the complex conjugate of Ψ. In this embodiment, a, b
Is defined by Equations 4 and 5, and the digitized input signal s
_The discretized wavelet transform S _{m, n} for _i is given by
Define in.

【００３１】[0031]

【数４】 [Equation 4]

【００３２】[0032]

【数５】 [Equation 5]

【００３３】[0033]

【数６】 [Equation 6]

【００３４】ここで，１／Ｔ_sは前記サンプリング周波
数（本実施例では１２０００）である。ｍ、ｎはそれぞ
れ周波数、観測位置（時刻）に関するパラメータであ
り、今後本明細書中において周波数チャンネル番号、観
測位置と呼ぶ。周波数チャンネル番号ｍは小さいほど高
周波数領域を、大きいほど低周波数領域を表わす。ま
た、基本ウェーブレット関数Ψ_(t)としてはラプラシア
ンガウシアン関数、聴覚系インパルス応答を利用したも
のなどが知られているが、本実施例では時間と周波数に
関する不確定性が最小であり、この意味で時間−周波数
空間において最も局在性が良いとされるガボール関数を
使用している。ガボール関数は数７で定義される。Here, 1 / T _s is the sampling frequency (12000 in this embodiment). m and n are parameters relating to frequency and observation position (time), respectively, and will be referred to as frequency channel number and observation position in the present specification. The smaller the frequency channel number m, the higher the frequency range, and the larger the frequency channel number m, the lower the frequency range. Further, as the basic wavelet function Ψ _(t) , a Laplacian-Gaussian function, one using an auditory system impulse response, etc. are known, but in this embodiment, uncertainty about time and frequency is minimum, and in this sense, The Gabor function, which is said to have the best localization in the time-frequency space, is used. The Gabor function is defined by Equation 7.

【００３５】[0035]

【数７】 [Equation 7]

【００３６】数６、数７より時間分解能はａに反比例
し、ａが小さい（高周波数領域）ほど時間分解能は高く
なり、ａが大さい（低周波数領域）ほど低くなることが
分かる。また、ガボール関数のフーリエ変換Ψ₍ω₎は数
８のようになり、ａ^-1/2・Ψ_((t-b)/a)のフーリエ変換
がａ^1/2・Ψ₍ａω₎・ｅｘｐ［−ｉωｂ］となることか
ら、周波数分解能はａに比例し、ａが大さい（低周波数
領域）ほど周波数分解能は高くなり、ａが小さい（高周
波数領域）ほど低くなることが分かる。From Equations 6 and 7, it is understood that the time resolution is inversely proportional to a, that is, the smaller a (high frequency region), the higher the time resolution, and the larger a (low frequency region), the lower. In addition, the Fourier transform Ψ ₍ ω ₎ of the Gabor function is as shown in ^Equation 8, and the Fourier transform of a ⁻¹ / ² · Ψ _{((tb) / a)} is a ^1/2 · Ψ ₍ aω ₎ · exp [− iωb], the frequency resolution is proportional to a, and it can be seen that the larger a is (low frequency region), the higher the frequency resolution, and the smaller a is (high frequency region), the lower.

【００３７】[0037]

【数８】 [Equation 8]

【００３８】以上説明したように、数６を計算すること
により低周波数領域では周波数分解能が高く、高周波数
領域では時間分解能が高くなる様な分析を行うことがで
きる。As described above, by calculating equation 6, it is possible to perform analysis such that the frequency resolution is high in the low frequency region and the time resolution is high in the high frequency region.

【００３９】数７において、ω_pは最も高い周波数チャ
ンネル（ｍ＝０）に対する出力の中心周波数に２πをか
けたものであり、本実施例ではω_p＝２π×５５８６Ｈ
ｚとした。これにより各周波数チャンネルの出力の中心
周波数が、一般的な４４０Ｈｚを基準とする平均律音階
の周波数に程等しくなるようにすることができる。ま
た、周波数空間での半値幅は（２ω_p／ｒ）で与えら
れ、本実施例ではｒ＝３０としている。In Expression 7, ω _p is the output center frequency for the highest frequency channel (m = 0) multiplied by 2π, and in this embodiment ω _p = 2π × 5586H
z. As a result, the center frequency of the output of each frequency channel can be made approximately equal to the frequency of the equal temperament scale based on the general 440 Hz. Further, the half width in the frequency space is given by (2ω _p / r), and in this embodiment, r = 30.

【００４０】また、本実施例において、Ｍ＝７２とし
た。これにより８７Ｈｚ〜５５８６Ｈｚまでの周波数帯
域の分析を行うことができる。また、ＮはＲＡＭ６に格
納されている楽曲信号データのサンプル数である。In the present embodiment, M = 72. This enables analysis of the frequency band from 87 Hz to 5586 Hz. N is the number of samples of the music signal data stored in the RAM 6.

【００４１】ここで、図３を使い周波数分析処理（Ｓ
１）の作動について説明する。まず始めに、ＣＰＵ５は
周波数チャンネル番号ｍの初期化を行う（ｍ＝０：Ｓ３
１）。次に、ＣＰＵ５は観測位置ｎの初期化を行う（ｎ
＝０：Ｓ３３）。次に、ＣＰＵ５は前記数６によりＳ
_m,nを算出する（Ｓ３５）。次に、ＣＰＵ５は数８によ
りＳ_m,nを対数化し、ＲＡＭに格納する（Ｓ３６）。Here, the frequency analysis process (S
The operation of 1) will be described. First, the CPU 5 initializes the frequency channel number m (m = 0: S3).
1). Next, the CPU 5 initializes the observation position n (n
= 0: S33). Next, the CPU 5 uses S by the above equation 6.
_{m and n} are calculated (S35). Next, the CPU 5 logarithmizes S _{m, n} by the equation 8 and stores it in the RAM (S36).

【００４２】[0042]

【数９】 [Equation 9]

【００４３】次に、ＣＰＵ５は観測位置ｎをインクリメ
ントし、処理を（Ｓ３４）に戻す（Ｓ３７）。ＣＰＵ５
は観測位置ｎと楽曲信号データのサンプル数Ｎとを比較
し、ｎ＜Ｎの判断が「ＹＥＳ」なら以上で説明した（Ｓ
３５〜Ｓ３７）の処理を繰り返し、「ＮＯ」なら処理を
（Ｓ３８）に移す（Ｓ３４）。（Ｓ３４）において判断
が「ＮＯ」である場合、ＣＰＵ５は周波数チャンネル番
号ｍをインクリメントし処理を（Ｓ３２）に戻す（Ｓ３
８）。ＣＰＵ５は、周波数チャンネル番号ｍとＭとを比
較し、ｍ≦Ｍの判断が「ＹＥＳ」なら以上で説明した
（Ｓ３３〜Ｓ３８）の処理を繰り返し、「ＮＯ」なら周
波数分析処理（Ｓ１）を終了する（Ｓ３２）。Next, the CPU 5 increments the observation position n and returns the processing to (S34) (S37). CPU5
Compares the observation position n with the sample number N of the music signal data, and if the judgment of n <N is “YES”, it is explained above (S
The processes of 35 to S37 are repeated, and if “NO”, the process proceeds to (S38) (S34). When the determination is (NO) in (S34), the CPU 5 increments the frequency channel number m and returns the process to (S32) (S3).
8). The CPU 5 compares the frequency channel numbers m and M, and repeats the above-described processing of (S33 to S38) if the determination of m ≦ M is “YES”, and ends the frequency analysis processing (S1) if “NO”. Yes (S32).

【００４４】次に、ＣＰＵ５は前記ＲＡＭ６に格納され
ているウェーブレット変換の出力値を入力とし、基本周
波数成分の抽出を行なう。基本周波数成分の抽出方法と
しては、例えば特開平３−９４０７２号公報に示される
方法等を使用する（Ｓ２）。Next, the CPU 5 receives the output value of the wavelet transform stored in the RAM 6 as an input, and extracts the fundamental frequency component. As a method of extracting the fundamental frequency component, for example, the method disclosed in Japanese Patent Laid-Open No. 3-94072 is used (S2).

【００４５】次に、ＣＰＵ５は基本周波数成分の周波数
チャンネル番号、強さ、始端時刻、終端時刻等から、Ｍ
ＩＤＩコード等の音符情報を作成しディスプレイ８に出
力する（Ｓ３）。Next, the CPU 5 determines M from the frequency channel number, strength, start time, end time, etc. of the basic frequency component.
Note information such as an IDI code is created and output to the display 8 (S3).

【００４６】本発明は、以上詳述した構成に限定される
ものではなく、その主旨を逸脱しない範囲において種々
の変更を加えることができる。The present invention is not limited to the configuration described in detail above, and various modifications can be made without departing from the spirit of the invention.

【００４７】[0047]

【発明の効果】以上説明したことから明かなように、本
発明の自動採譜装置によれば、音響的な信号を入力とし
周波数分析を行う周波数分析手段と、周波数分析結果か
ら基本周波数成分を抽出する基本周波数成分抽出手段
と、基本周波数成分の音程、強さ、始端時刻、終端時刻
等からＭＩＤＩコード等の音符情報を出力する符号化手
段とを備えた自動採譜装置において、前記周波数分析手
段をウェーブレット変換で実現したことにより、低周波
数領域では高い周波数分解能が要求され、高周波数領域
では高い時間分解能が要求されるような、楽曲信号の採
譜処理に適した周波数分析を行うことができる。As is apparent from the above description, according to the automatic music transcription apparatus of the present invention, the frequency analysis means for performing frequency analysis with an acoustic signal as an input, and the fundamental frequency component extracted from the frequency analysis result. In the automatic music transcription device, the frequency analysis means is provided with a fundamental frequency component extracting means and a coding means for outputting musical note information such as MIDI code from the pitch, strength, start time, end time, etc. of the fundamental frequency component. By implementing the wavelet transform, it is possible to perform frequency analysis suitable for music notation processing, which requires high frequency resolution in the low frequency region and high time resolution in the high frequency region.

[Brief description of drawings]

【図１】図１は、本発明の一実施例によるブロック構成
図である。FIG. 1 is a block diagram according to an embodiment of the present invention.

【図２】図２は、本発明の一実施例による自動採譜装置
のフローチャートである。FIG. 2 is a flowchart of an automatic music transcription device according to an embodiment of the present invention.

【図３】図３は、周波数分析処理のフローチャートであ
る。FIG. 3 is a flowchart of frequency analysis processing.

【図４】図４は、従来例の周波数分析処理のフローチャ
ートである。FIG. 4 is a flowchart of a conventional frequency analysis process.

[Explanation of symbols]

１オーディオ・アンプ２ローパス・フィルター３Ａ／Ｄ変換装置５ＣＰＵ８ディスプレイ 1 Audio amplifier 2 Low pass filter 3 A / D converter 5 CPU 8 Display

Claims

[Claims]

1. A frequency analysis means for inputting acoustic signal data to perform a frequency analysis, a fundamental frequency component extraction means for extracting a fundamental frequency component from the frequency analysis result, a pitch, a strength, and a start time of the fundamental frequency component. , From the end time, etc. to MI
An automatic music transcription device comprising an encoding means for outputting musical note information such as a DI code, wherein the frequency analysis means is constituted by a wavelet transform.

2. The automatic music transcription device according to claim 1, wherein the center frequency of the output of the wavelet transform is equal to each frequency of the equal tempered scale with the performance altitude as a reference.

3. The automatic music transcription device according to claim 1, wherein the basic wavelet function of the wavelet transform is a Gabor function.