JPH04261591A

JPH04261591A - Automatic music scoreing device

Info

Publication number: JPH04261591A
Application number: JP1143291A
Authority: JP
Inventors: Seiko Ishikawa; 石川　せい子
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1991-01-07
Filing date: 1991-01-07
Publication date: 1992-09-17
Anticipated expiration: 2014-03-17
Also published as: JP2871120B2

Abstract

PURPOSE:To satisfactorily extract the basic frequency by extracting a basic frequency candidate from a power spectrum obtained by executing a frequency analysis at every prescribed time with respect to data. CONSTITUTION:A signal fetching part 21 executes an A/D conversion of music, and sets it to a digital signal handled in a computer. Also, a frequency analysis processing part 22 calculates a power spectrum in the frequency direction within a prescribed time by executing a frequency analysis at every certain prescribed time with respected to data fetched by the signal fetching part 21. Subsequently, a basic frequency candidate extracting part 23 extracts a basic frequency from the power spectrum.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、音楽を楽譜もしくは楽
譜に相当する符号に変換する自動採譜装置に係り、さら
に詳細には、実際に鳴っている音である基本周波数の抽
出のための構成に関する。[Field of Industrial Application] The present invention relates to an automatic score transcription device that converts music into a musical score or a code corresponding to a musical score, and more specifically, the present invention relates to an automatic score transcription device that converts music into a musical score or a code corresponding to a musical score. Regarding.

【０００２】0002

【従来の技術】従来、複数の楽器によって演奏された音
楽の採譜は、音楽的知識を有する採譜者によって行われ
てきた。また、採譜を人が行うのでなく装置により自動
的に行うものでは、単音からなる音楽を採譜したり、あ
るいは鍵盤の押された情報から採譜を行う装置が提案さ
れている。2. Description of the Related Art Conventionally, music played by a plurality of musical instruments has been transcribed by a transcriber with musical knowledge. In addition, in the case where music is automatically transcribed by a device rather than by a person, there have been proposed devices that transcribe music consisting of single notes or that transcribe music based on information about keyboard presses.

【０００３】0003

【発明が解決しようとする課題】しかしながら、従来の
採譜装置では、楽器の種類や楽器数に制約があり、採譜
の範囲が限定され、一般的な音楽の演奏をそのまま採譜
できないという問題があった。また、複数の楽器によっ
て演奏された音楽をＡ／Ｄ変換した信号をＦＦＴ等によ
り周波数分析しただけでは、各楽器の倍音が多数観測さ
れ、どの音が基本周波数であるのか容易に判断できない
という問題があった。本発明は、上述した問題点を解決
するもので、音楽をＡ／Ｄ変換した信号を周波数分析し
、その結果から倍音を除去し、基本周波数のみを取り出
すことによって、楽器の種類や楽器数に制約を受けない
自動採譜装置を提供することを目的とする。[Problems to be Solved by the Invention] However, with conventional music transcription devices, there are restrictions on the types and number of instruments, which limits the range of music transcription, and there is a problem in that it is not possible to transcribe general musical performances as they are. . Another problem is that if you simply frequency-analyze the A/D-converted signal of music played by multiple instruments using FFT, etc., many harmonics of each instrument will be observed, making it impossible to easily determine which sound is the fundamental frequency. was there. The present invention solves the above-mentioned problems by frequency-analyzing the A/D-converted music signal, removing overtones from the result, and extracting only the fundamental frequency. The purpose is to provide an automatic score transcription device that is not subject to restrictions.

【０００４】0004

【課題を解決するための手段】上記目的を達成するため
に請求項１の発明は、音楽信号を楽譜もしくは楽譜に相
当する符号に変換する採譜装置において、音楽信号を取
り込みＡ／Ｄ変換する信号取り込み部と、前記信号取り
込み部で取り込んだデータに対して、一定時間毎に周波
数解析を行うことにより、一定時間内での周波数方向の
パワー・スペクトルを計算する周波数解析処理部と、前
記パワー・スペクトルより基本周波数候補を抽出する基
本周波数候補抽出部とを備えたものである。請求項２の
発明は、上記の基本周波数候補抽出部が、パワー・スペ
クトルのピークからパワーのしきい値を計算するパワー
のしきい値計算手段と、前記パワー・スペクトルのピー
クのうちの一つが演奏音の基本周波数かどうかを、それ
より高い周波数の他のピークのなかで元のピークの倍音
となっているピークと、元のピークと、両者の間にある
ピークの周波数とパワーに基づいて判定する基本周波数
らしさ判定手段と、前記パワー・スペクトルのピークの
うちの一つがそれより低い周波数の他のピークの中のど
れかの倍音となっているかを検索する基本周波数検索手
段と、前記基本周波数検索手段で検索されたピークの倍
音系列が、パワー・スペクトルのピークの中にどの程度
含まれているかどうかを算出する倍音系列含有度計算手
段と、前記パワー・スペクトルのピークのうちの一つが
他のピークの倍音であると判断された時に、倍音とされ
た方のピークをパワー・スペクトルから除去する倍音除
去手段とからなるものである。Means for Solving the Problems In order to achieve the above object, the invention of claim 1 provides a score recording device that converts a music signal into a musical score or a code corresponding to the musical score, in which a music signal is taken in and A/D converted. an acquisition unit; a frequency analysis processing unit that calculates a power spectrum in the frequency direction within a certain time by performing frequency analysis on the data acquired by the signal acquisition unit at certain time intervals; and a fundamental frequency candidate extraction section that extracts fundamental frequency candidates from the spectrum. The invention according to claim 2 is characterized in that the fundamental frequency candidate extracting unit includes power threshold calculation means for calculating a power threshold from the peak of the power spectrum, and Determines whether a performance note has the fundamental frequency based on the frequency and power of the peak that is a harmonic of the original peak among other peaks with higher frequencies, the original peak, and the peaks between the two. fundamental frequency retrieval means for determining whether one of the peaks of the power spectrum is a harmonic of any of the other peaks having a lower frequency; a harmonic series content calculation means for calculating to what extent the harmonic series of the peak searched by the frequency search means is included in the peaks of the power spectrum; The overtone removing means removes the peak determined to be an overtone from the power spectrum when the peak is determined to be an overtone of another peak.

【０００５】[0005]

【作用】請求項１の構成によれば、信号取り込み部は、
音楽をＡ／Ｄ変換し、計算機内で扱えるデジタル信号に
する。周波数解析処理部は、信号取り込み部で取り込ん
だデータに対して、ある一定時間毎に周波数解析を行う
ことにより、前記一定時間内での周波数方向のパワー・
スペクトルを計算する。基本周波数候補抽出部はこのパ
ワー・スペクトルより基本周波数候補を抽出する。請求
項２に記載した基本周波数候補抽出部を構成する各手段
は以下のような作用をする。パワーのしきい値計算手段
は、パワー・スペクトルのピークからパワーのしきい値
を計算する。基本周波数らしさ判定手段は、パワー・ス
ペクトルのピークのうちの一つが基本周波数かどうかを
、それより高い周波数の他のピークのなかで元のピーク
の倍音となっているピークと、元のピークと、両者の間
にあるピークの周波数とパワーとから判定する。基本周
波数検索手段は、前記パワー・スペクトルのピークのう
ちの一つが、それより低い周波数の他のピークの中のど
れかの倍音となっているかを検索する。倍音系列含有度
計算手段は、前記基本周波数検索手段で検索されたピー
クの倍音系列が、パワー・スペクトルのピークの中にど
の程度含まれているかどうかを算出する。倍音除去手段
は、前記パワー・スペクトルのピークのうちの一つが他
のピークの倍音であると判断された場合に、倍音とされ
た方のピークをパワー・スペクトルから除去する。[Operation] According to the structure of claim 1, the signal acquisition section:
Converts music from A/D to a digital signal that can be handled within a computer. The frequency analysis processing section performs frequency analysis on the data acquired by the signal acquisition section at certain fixed time intervals, thereby calculating the power/power in the frequency direction within the fixed time period.
Calculate the spectrum. The fundamental frequency candidate extraction section extracts fundamental frequency candidates from this power spectrum. Each means constituting the fundamental frequency candidate extracting section described in claim 2 operates as follows. The power threshold calculation means calculates a power threshold from the peak of the power spectrum. The fundamental frequency similarity determining means determines whether one of the peaks in the power spectrum is a fundamental frequency or not, and compares it with a peak that is a harmonic of the original peak among other peaks with higher frequencies, and a peak that is a harmonic of the original peak. , based on the frequency and power of the peak between the two. The fundamental frequency search means searches whether one of the peaks of the power spectrum is a harmonic of any of the other peaks having a lower frequency. The harmonic series content calculating means calculates to what extent the harmonic series of the peak searched by the fundamental frequency searching means is included in the peaks of the power spectrum. The overtone removing means removes the peak determined to be an overtone from the power spectrum when it is determined that one of the peaks of the power spectrum is an overtone of another peak.

【０００６】[0006]

【実施例】以下、本発明を具体化した一実施例を図面を
参照して説明する。図１は本発明による自動採譜装置の
ブロック図である。本装置は、音楽信号が入力されるオ
ーディオ・アンプ１と、ローパス・フィルター２と、Ａ
／Ｄ変換装置３と、Ｉ／Ｏポート４と、基本周波数候補
を抽出処理するＣＰＵ５と、ＲＡＭ６と、ＲＯＭ７と、
抽出した結果を表示するデイスプレイ８とから構成され
ている。DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment embodying the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of an automatic score transcription apparatus according to the present invention. This device includes an audio amplifier 1 to which a music signal is input, a low-pass filter 2, and an A
/D conversion device 3, I/O port 4, CPU 5 for extracting fundamental frequency candidates, RAM 6, ROM 7,
It is composed of a display 8 that displays the extracted results.

【０００７】次に、上記図１に図２、図３を加えて、本
自動採譜装置により行われる基本周波数抽出処理のため
の機能構成を説明する。本装置は、機能構成要素として
、信号取り込み部２１と高速フーリエ変換（ＦＦＴ）処
理部２２と基本周波数候補抽出部２３とからなる。信号
取り込み部２１は、オーディオ・アンプ１、ローパス・
フィルタ２、Ａ／Ｄ変換装置３、Ｉ／Ｏポート４、ＣＰ
Ｕ５等により構成される。入力された音楽信号はオーデ
ィオ・アンプ１により増幅され、この増幅された信号は
、ローパス・フィルタ２に入力され、例えば、５．５ｋ
Ｈｚ以下の周波数成分のみが通過し、標本化時の折返し
歪みを抑えている。この出力信号は、Ａ／Ｄ変換装置３
により１２ｋＨｚ、１６ｂｉｔで標本化される。標本化
されたデータは、Ｉ／Ｏポート４を介し、ＣＰＵ５に取
り込まれ、ＲＡＭ６に記憶される。Next, by adding FIGS. 2 and 3 to FIG. 1 above, the functional configuration for fundamental frequency extraction processing performed by the automatic score transcription apparatus will be explained. This device includes a signal acquisition section 21, a fast Fourier transform (FFT) processing section 22, and a fundamental frequency candidate extraction section 23 as functional components. The signal acquisition unit 21 includes an audio amplifier 1, a low-pass
Filter 2, A/D converter 3, I/O port 4, CP
Consists of U5 etc. The input music signal is amplified by the audio amplifier 1, and this amplified signal is input to the low-pass filter 2, for example, 5.5k.
Only frequency components below Hz pass through, suppressing aliasing distortion during sampling. This output signal is transmitted to the A/D converter 3
The signal is sampled at 12 kHz and 16 bits. The sampled data is taken into the CPU 5 via the I/O port 4 and stored in the RAM 6.

【０００８】ＦＦＴ処理部２２は、ＣＰＵ５、ＲＡＭ６
等でなり、ＣＰＵ５はＲＡＭ６より標本化されたデータ
を読み出し、２５ｍｓｅｃ毎を１フレームとし、１フレ
ーム毎に８５．３ｍｓｅｃハミング窓を掛けた後、ＦＦ
Ｔ分析により対数パワー・スペクトルが算出される。次
に、ＣＰＵ５は、算出された対数パワー・スペクトルか
ら放射線内挿処理によりピーク周波数を求める。図５は
以上のようにして求めたピーク・スペクトルを、時間軸
を横軸に、ピーク周波数を鍵盤番号に変換して縦軸にと
り、強度を濃淡で示したものである。同図は倍音除去前
のデータである。基本周波数候補抽出部２３は、ＣＰＵ
５でなり、図３のようにパワーのしきい値計算手段３１
と、基本周波数らしさ判定手段３２と、基本周波数検索
手段３３と、倍音系列含有度計算手段３４と、倍音除去
手段３５とから構成されている。[0008] The FFT processing section 22 includes a CPU 5 and a RAM 6.
etc., the CPU 5 reads the sampled data from the RAM 6, takes every 25 msec as one frame, multiplies an 85.3 msec Hamming window for each frame, and then reads the sampled data from the FF
A logarithmic power spectrum is calculated by T analysis. Next, the CPU 5 determines the peak frequency from the calculated logarithmic power spectrum by radiation interpolation processing. FIG. 5 shows the peak spectrum obtained as described above, with the time axis on the horizontal axis and the peak frequency converted into a keyboard number on the vertical axis, and the intensity shown in shading. The figure shows data before overtone removal. The fundamental frequency candidate extracting unit 23 uses the CPU
5, and the power threshold calculation means 31 as shown in FIG.
, a fundamental frequency likelihood determining means 32 , a fundamental frequency searching means 33 , an overtone series content calculating means 34 , and an overtone removing means 35 .

【０００９】基本周波数候補抽出部２３では、ＦＦＴ処
理部２２の各フレーム毎に処理を進める。いま、あるフ
レームに対して、ＦＦＴ処理部２２で求めたスペクトル
のピークがＮ個あるとし、以下、図４のフローチャート
に従って説明する。図４は、基本周波数候補抽出部２３
での処理を、ある１フレーム分について示したものであ
る。Ｎ個のピークについては、各々周波数Ｆとパワーの
値Ｐが周波数によってソートされてＲＡＭ６に記憶され
ている。このＮ個のピークのパワーの値から、パワーに
よるしきい値Ｓｐを計算する（ステップ＃１）。Ｎ個の
ピークについては、周波数の高い方から低い方へ、順に
処理を進める。Ｎ個のピークの、周波数の高い方からｎ
番目のピークの周波数をＦ（ｎ）、パワーをＰ（ｎ）と
する。Ｆ（ｎ）が基本周波数であるかどうかを、Ｆ（ｎ
）よりも高い周波数Ｆ（ｍ）およびパワーＰ（ｍ）（１
≦ｍ＜ｎ）から判定する（＃２，＃３）。この判定は以
下のようにして行う。まず、Ｆ（ｍ）の中にＦ（ｎ）の
第Ｘ次倍音（Ｘ≧２）となっているＦ（ｘ）があるかど
うか探す。Ｆ（ｘ）が見つかった場合、Ｆ（ｎ）とＦ（
ｘ）の関係は、Ｆ（ｎ）が基本周波数で、Ｆ（ｘ）が第
Ｘ次倍音となっている場合（これをケース１とする）と
、Ｆ（ｎ）、Ｆ（ｘ）がより低い、両者の公約数となっ
ている周波数の、それぞれれＬ次（Ｌ≧２）、Ｌ×Ｘ次
倍音となっている場合（これをケース２とする）とがあ
り得る。[0009] The fundamental frequency candidate extracting section 23 advances the processing for each frame of the FFT processing section 22. Now, it is assumed that there are N peaks in the spectrum obtained by the FFT processing unit 22 for a certain frame, and the following description will be made according to the flowchart of FIG. FIG. 4 shows the fundamental frequency candidate extraction unit 23
This figure shows the processing for one frame. Regarding the N peaks, the frequency F and power value P are sorted by frequency and stored in the RAM 6. A power threshold Sp is calculated from the power values of these N peaks (step #1). Regarding the N peaks, processing is performed in order from the highest frequency to the lowest frequency. Of N peaks, n from the highest frequency
Let the frequency of the th peak be F(n) and the power be P(n). Check whether F(n) is the fundamental frequency by F(n
) higher frequency F(m) and power P(m)(1
Judgment is made from ≦m<n) (#2, #3). This determination is made as follows. First, a search is made to see if there is an F(x) in F(m) that is the X-th harmonic (X≧2) of F(n). If F(x) is found, then F(n) and F(
x) relationship is when F(n) is the fundamental frequency and F(x) is the There may be a case (this is referred to as case 2) where the low frequency is an L-order (L≧2) or L×X-order overtone of a frequency that is a common divisor of both.

【００１０】このどちらの場合であるかを判断するため
に、Ｆ（ｘ）とＦ（ｎ）の間に存在する周波数Ｆ（ｙ）
、およびパワーＰ（ｙ）（ｘ＜ｙ＜ｎ）について、以下
のような処理をする。Ｐ（ｙ）＞（Ｐ（ｘ）＋Ｐ（ｎ）
）／２であれば、Ｆ（ｙ）とＦ（ｘ）、Ｆ（ｎ）の周波
数差を計算する。いま、Ｆｘｙ＝Ｆ（ｘ）−Ｆ（ｙ）、
Ｆｙｎ＝Ｆ（ｙ）−Ｆ（ｎ）とする。ＦｘｙとＦｙｎの
小さい方をＦ０とする。Ｆ０がＦ（ｎ）、Ｆ（ｘ）を両
方とも倍音とする場合は、前述のケース２である可能性
が高いので、Ｆ（ｎ）は基本周波数でないと判断する。また、Ｆ（ｘ）が見つからなかった場合もＦ（ｎ）は基
本周波数でないと判断する。これ以外の場合、すなわち
Ｆ（ｍ）の中に存在するＦ（ｎ）の第Ｘ次倍音（Ｘ≧２
）となっている全てのＦ（ｘ）について、Ｆ（ｘ）とＦ
（ｎ）の間に存在するＦ（ｙ）から計算された全てのＦ
０の中に、Ｆ（ｎ）、Ｆ（ｘ）を両方とも倍音とするも
のがひとつもない場合、Ｆ（ｎ）は基本周波数であると
判断される。In order to determine which of these cases is the case, the frequency F(y) that exists between F(x) and F(n) is
, and power P(y) (x<y<n), the following processing is performed. P(y)>(P(x)+P(n)
)/2, calculate the frequency difference between F(y), F(x), and F(n). Now, Fxy=F(x)-F(y),
Let Fyn=F(y)−F(n). The smaller of Fxy and Fyn is set as F0. If F0 has both F(n) and F(x) as overtones, the above-mentioned case 2 is likely, and therefore F(n) is determined not to be the fundamental frequency. Also, if F(x) is not found, it is determined that F(n) is not the fundamental frequency. In other cases, that is, the Xth harmonic of F(n) existing in F(m) (X≧2
) for all F(x), F(x) and F
All F calculated from F(y) existing between (n)
If there is no one in which both F(n) and F(x) are overtones, F(n) is determined to be the fundamental frequency.

【００１１】以上により、Ｐ（ｎ）＞ＳｐかつＦ（ｎ）
が基本周波数であると判断された場合は（＃４，＃５で
ＹＥＳ）、そのピークは基本周波数であるとして次のピ
ークに処理を移す（＃６，＃７）。そうでない場合は、
Ｆ（ｎ）よりも低い周波数Ｆ（ｐ）（ｎ＜ｐ≦Ｎ）の中
に、Ｆ（ｎ）を第Ｚ次倍音（Ｚ≧２）とするＦ（ｚ）が
あるか探す（＃８，＃９）。Ｆ（ｚ）が見つかった場合
には、Ｆ（ｚ）のＺ次までの倍音系列をＦ（ｑ）（ｚ≦
ｑ≦ｎ）の中で探す。Ｆ（ｑ）の中にＺ個含まれるはず
のＦ（ｚ）の倍音系列のうち、実際にＦ（ｑ）の中で見
つかった個数をＷ個とし、倍音系列含有度Ｗ／Ｚを計算
する（＃１０）。Ｗ／Ｚがあるしきい値以上の場合には
（＃１１でＹＥＳ）、Ｆ（ｎ）はＦ（ｚ）の倍音である
として除去する（＃１２）。条件を満たすＦ（ｚ）が見
つからなかった場合には、Ｆ（ｎ）を除去しない。なお
、Ｆ（ｎ）を除去した場合でも以降の処理に支障をきた
すことのないように、Ｆ（ｎ）にピークが存在したとい
う情報は残しておく。以上の処理をＮ個のピーク全てに
ついて行ったら、次のフレームに処理を移す。ＦＦＴ処
理部２２でパワー・スペクトルを算出した全てのフレー
ムについて以上の処理を終了した時、基本周波数候補抽
出部２３の処理が終了する。From the above, P(n)>Sp and F(n)
If it is determined that is the fundamental frequency (YES in #4, #5), that peak is determined to be the fundamental frequency, and processing is shifted to the next peak (#6, #7). If not,
Search for F(z) with F(n) as the Z-th harmonic (Z≧2) among the frequencies F(p) (n<p≦N) lower than F(n) (#8 , #9). If F(z) is found, the overtone series up to the Zth order of F(z) is expressed as F(q)(z≦
Search within q≦n). Of the Z overtone series of F(z) that should be included in F(q), let W be the number actually found in F(q), and calculate the overtone series content W/Z. (#10). If W/Z is above a certain threshold (YES in #11), F(n) is considered to be an overtone of F(z) and is removed (#12). If F(z) that satisfies the conditions is not found, F(n) is not removed. Note that even if F(n) is removed, the information that a peak exists in F(n) is left so as not to interfere with subsequent processing. After the above processing is performed for all N peaks, processing is moved to the next frame. When the FFT processing section 22 completes the above processing for all frames for which power spectra have been calculated, the processing of the fundamental frequency candidate extraction section 23 ends.

【００１２】図６は、図５のデータに基本周波数候補抽
出部２３による倍音除去の処理を施した後のデータであ
る。図６では、図５に比べて、倍音除去により不要なデ
ータが除去されていることが分かる。本発明は上記の実
施例に限定するものではなく、その趣旨を逸脱しない範
囲において種々の変更を加えることができる。例えば、
本実施例においては、ＦＦＴ処理部２２のフレーム毎に
基本周波数候補抽出部２３の処理を行ったが、予め近接
するフレーム間でパワー・スペクトルの安定部が続く区
間を一つの分析区間として設定し、その分析区間毎に以
降の処理を行うことも可能である。FIG. 6 shows the data after the data shown in FIG. 5 has been subjected to overtone removal processing by the fundamental frequency candidate extraction section 23. In FIG. 6, compared to FIG. 5, it can be seen that unnecessary data has been removed by overtone removal. The present invention is not limited to the above embodiments, and various changes can be made without departing from the spirit thereof. for example,
In this embodiment, the fundamental frequency candidate extracting section 23 processes each frame of the FFT processing section 22, but the section in which the stable part of the power spectrum continues between adjacent frames is set in advance as one analysis section. , it is also possible to perform subsequent processing for each analysis section.

【００１３】[0013]

【発明の効果】以上のように本発明によれば、Ａ／Ｄ変
換して取り込んだ音楽信号を周波数解析し、その結果か
ら倍音を除去することによって、実際に鳴っている音の
基本周波数のみを取り出すようにしているので、楽器の
種類や楽器数に制約を受けずに基本周波数を抽出するこ
とができ、採譜することが容易となる。As described above, according to the present invention, by frequency-analyzing the A/D-converted music signal and removing overtones from the result, only the fundamental frequency of the sound actually being played can be obtained. , the fundamental frequency can be extracted without being restricted by the type of instrument or the number of instruments, making it easy to transcribe.

[Brief explanation of the drawing]

【図１】　　本発明の一実施例による自動採譜装置のブ
ロック図である。FIG. 1 is a block diagram of an automatic score transcription apparatus according to an embodiment of the present invention.

【図２】　　自動採譜装置の基本周波数抽出までの機能
構成図である。FIG. 2 is a functional configuration diagram of the automatic score transcription device up to fundamental frequency extraction.

【図３】　　基本周波数候補抽出部２３の構成図である
。FIG. 3 is a configuration diagram of a fundamental frequency candidate extraction unit 23.

【図４】　　基本周波数候補抽出部２３における処理を
示すフローチャートである。FIG. 4 is a flowchart showing processing in the fundamental frequency candidate extracting unit 23.

【図５】　　ＦＦＴ処理部２２により算出された、倍音
除去処理前のパワー・スペクトルの説明図である。FIG. 5 is an explanatory diagram of a power spectrum calculated by the FFT processing unit 22 before overtone removal processing.

【図６】　　図５に基本周波数候補抽出部２による倍音
除去処理を施した後のパワー・スペクトルの説明図であ
る。6 is an explanatory diagram of the power spectrum after the overtone removal process is performed by the fundamental frequency candidate extraction unit 2 in FIG. 5. FIG.

[Explanation of symbols]

３　　Ａ／Ｄ変換装置５　　ＣＰＵ６　　ＲＡＭ２１　　信号取り込み部２２　　ＦＦＴ処理部２３　　基本周波数候補抽出部３１　　パワーのしきい値計算手段３２　　基本周波数らしさ判定手段３３　　基本周波数検索手段３４　　倍音系列含有度計算手段３５　　倍音除去手段 3 A/D conversion device 5 CPU 6 RAM 21 Signal acquisition section 22 FFT processing section 23 Fundamental frequency candidate extraction unit 31 Power threshold calculation means 32 Fundamental frequency likeness determination means 33 Fundamental frequency search means 34 Overtone series content calculation means 35 Overtone removal means

Claims

[Claims]

Claim 1. A score transcription device that converts a music signal into a musical score or a code corresponding to the musical score, which includes a signal acquisition section that takes in the music signal and performs A/D conversion, and a signal acquisition section that converts the music signal into an A/D converter, and a signal acquisition section that converts the music signal into a musical score or a code corresponding to the musical score. a frequency analysis processing unit that calculates a power spectrum in the frequency direction within a certain period of time by performing frequency analysis at each time; and a fundamental frequency candidate extraction unit that extracts fundamental frequency candidates from the power spectrum. An automatic music transcription device featuring:

2. The fundamental frequency candidate extraction section includes a power threshold calculation means for calculating a power threshold from a peak of a power spectrum, and one of the peaks of the power spectrum is a fundamental frequency candidate of a performance sound. The fundamental frequency, which is determined based on the frequency and power of the peak that is a harmonic of the original peak among other peaks at higher frequencies, the original peak, and the peaks between them. similarity determining means, fundamental frequency searching means for searching whether one of the peaks of the power spectrum is a harmonic of any of the other peaks having a lower frequency, and the fundamental frequency searching means a harmonic series content calculating means for calculating to what extent the harmonic series of the searched peak is included in the peaks of the power spectrum; 2. The automatic score transcription apparatus according to claim 1, further comprising an overtone removing means for removing the peak determined to be an overtone from the power spectrum when the peak is determined to be an overtone.