JP2005234303A

JP2005234303A - Performance sound decision apparatus and performance sound decision program and recording medium with the program recorded therein

Info

Publication number: JP2005234303A
Application number: JP2004044587A
Authority: JP
Inventors: Ren Sumida; 錬澄田; Naoyuki Tanaka; 直行田中
Original assignee: Kawai Musical Instrument Manufacturing Co Ltd
Current assignee: Kawai Musical Instrument Manufacturing Co Ltd
Priority date: 2004-02-20
Filing date: 2004-02-20
Publication date: 2005-09-02
Anticipated expiration: 2024-02-20
Also published as: JP4492932B2

Abstract

<P>PROBLEM TO BE SOLVED: To reduce an attack detecting delay of a performance sound of a piano inputted from a microphone in a performance sound decision apparatus, a performance sound decision program and a recording medium which are suitable to detect the attack (rise of a sound of a musical instrument) as basic information of musical sound decision. <P>SOLUTION: A spectrum generating part 12 calculates a power spectrum by performing the FFT processing of digital data of the performance sound of the piano played according to a playing instruction for every FFT window. An attack detecting part 13 decides whether there is the attack of the performance sound or not depending on whether the power of the calculated power spectrum is increased for more than a scheduled value from that of the calculation of the last time. A spectrum comparing part 14 decides whether a distance between the calculated power spectrum and that of a pitch which is included in the playing instruction among power spectra for every pitch in which power spectra calculated after the decision of the presence of the attack are preliminarily stored is near or not. Moreover, the apparatus is equipped with a means which shifts mutual windows so that front and rear FFT windows are partially overlapped with each other in the FFT processing. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、自然楽器の演奏音判定システムに関し、特に、楽音判定の基礎情報としてのアタック（楽器音の立ち上がり）を検出するのに好適な演奏音判定装置、演奏音判定プログラムおよび記録媒体に関する。 The present invention relates to a performance sound determination system for a natural musical instrument, and more particularly to a performance sound determination apparatus, a performance sound determination program, and a recording medium suitable for detecting an attack (rising of a musical instrument sound) as basic information for musical sound determination.

例えば、特公平２−７０５号公報に開示されているように、鍵盤楽器の演奏を独習するための押鍵指示手段を有する電子楽器が知られている。この電子楽器では、記憶された演奏情報が表示回路に順次読み出され、鍵盤の各鍵に対応して設けられた表示ランプがこの演奏情報に従って点灯される。演奏者による操作鍵はキースキャン回路で検出され、この操作鍵の音高が表示ランプで指示された演奏情報の音高と一致した場合に次の演奏情報が表示回路に読み出される。 For example, as disclosed in Japanese Patent Publication No. 2-705, an electronic musical instrument having a key pressing instruction unit for self-studying the performance of a keyboard instrument is known. In this electronic musical instrument, stored performance information is sequentially read out to a display circuit, and a display lamp provided corresponding to each key of the keyboard is turned on according to this performance information. An operation key by the performer is detected by a key scanning circuit, and when the pitch of the operation key matches the pitch of the performance information indicated by the display lamp, the next performance information is read out to the display circuit.

しかし、このような押鍵指示手段は、キースキャン回路を備えた電子楽器では容易に実現できるが、キーススキャン回路を持たないアコースティックピアノのような自然楽器では実現が困難である。自然楽器にキースキャン回路を設けることが考えられるが、キースキャン回路に必要なキースイッチや表示ランプを鍵毎に後付けする作業が繁雑であり、手軽には行えない。 However, such a key pressing instruction means can be easily realized with an electronic musical instrument provided with a key scan circuit, but is difficult to realize with a natural musical instrument such as an acoustic piano without a key scan circuit. Although it is conceivable to provide a key scan circuit for a natural musical instrument, the work of retrofitting key switches and display lamps necessary for the key scan circuit for each key is complicated and cannot be performed easily.

キースキャン回路を用いる代わりに、演奏による楽音の周波数分析により操作鍵を検出することが考えられる。例えば、特公平１−５１２００公報に記載された表示装置では、音声信号の周波数分析結果に基づいて判定された音声信号の音高を、ディスプレイ上の五線譜に音符として表示する。 Instead of using a key scan circuit, it is conceivable to detect an operation key by frequency analysis of musical sounds by performance. For example, in the display device described in Japanese Patent Publication No. 1-51200, the pitch of the audio signal determined based on the frequency analysis result of the audio signal is displayed as a musical note on a staff score on the display.

しかし、ピアノ音のような倍音成分を多く含む楽音の音高検出では、押鍵の倍音成分つまり高調波成分によって音高を誤判断してしまったり、オクターブ間隔の二つの和音と該和音の下の単音との判別（例えば、Ｃ３とＣ４の和音とＣ３単音との判別）ができなかったりすることがある。これらは互いに倍音構成が似ているからである。 However, when detecting the pitch of a musical tone that contains a lot of harmonic components such as piano sounds, the pitch may be misjudged by the harmonic component of the key press, that is, the harmonic component, or two chords with an octave interval and (For example, discrimination between a chord of C3 and C4 and a C3 single tone) may not be possible. This is because these harmonic structures are similar to each other.

また、同時に複数の基音とその倍音を発生する楽器の演奏に対応したＭＩＤＩコードを生成する装置が提案されている（特許第２８９０８３１号公報）。この装置では、演奏楽音の波形データから基音に対応した周波数成分の波形データを抽出し、かつそのエンベロープデータを検出する手段を備える一方、楽器が発生する各基音毎にそのエンベロープ波形に対応して基音エンベロープデータを記憶している記憶手段を備え、検出されたエンベロープデータと記憶されている基音エンベロープデータとを相互相関によって比較し、楽器音を判定する。この装置では、構成が非常に複雑であって、特に、ソフトウェアでエンベロープデータの比較や楽器音の判定をリアルタイムで処理するのが困難である。 In addition, an apparatus for generating a MIDI code corresponding to the performance of a musical instrument that generates a plurality of fundamental sounds and their harmonics simultaneously has been proposed (Japanese Patent No. 2890831). This device is provided with means for extracting waveform data of frequency components corresponding to the fundamental tone from the waveform data of the performance music sound and detecting the envelope data, while corresponding to the envelope waveform for each fundamental tone generated by the instrument. A storage means for storing the fundamental tone envelope data is provided, and the detected envelope data and the stored fundamental tone envelope data are compared with each other by cross-correlation to determine the instrument sound. This apparatus has a very complicated configuration, and in particular, it is difficult to process envelope data comparison and instrument sound determination in real time with software.

特許第２８０６０４７号公報には、シンセサイザで種々の楽器音を種々の音程で順次組合わせて作成した信号と入力信号とを比較して楽音から採譜する装置が提案されている。しかし、この装置では、和音も含めたすべての組み合わせの音と比較する必要があるので、計算時間が膨大になるという問題がある。
特公平２−７０５号公報特公平１−５１２００公報特許第２８９０８３１号公報特許第２８０６０４７号公報 Japanese Patent No. 2880647 proposes a device that compares a signal created by sequentially combining various instrument sounds with various pitches by a synthesizer with an input signal, and recording from musical sounds. However, in this apparatus, since it is necessary to compare with all combinations of sounds including chords, there is a problem that calculation time becomes enormous.
Japanese Patent Publication No. 2-705 Japanese Patent Publication No. 1-51200 Japanese Patent No. 2890831 Japanese Patent No. 2880647

上述のように、キースキャン手段を有していないアコースティックピアノでは、指示通りに押鍵できたかどうかの判定ができないし、マイクロフォンから入力された楽音から押鍵の正確さをリアルタイムで判定するのは容易ではないという問題点があった。 As described above, in an acoustic piano that does not have a key scanning means, it is not possible to determine whether or not the key has been depressed as instructed, and it is not possible to determine the accuracy of the key depression in real time from the musical sound input from the microphone. There was a problem that it was not easy.

そこで、本出願人は、各音高のパワースペクトルを単音モデルとして記憶しておき、押鍵指示に従って演奏された音のパワースペクトルとこの単音モデルとを比較し、両者のパワースペクトルの距離が予定の範囲内の場合に押鍵指示を次に進める方法を提案した（特願２００３−１３４３７２号）。この方法は、演奏音からピッチ等のパラメータを検出するのと比較して計算が単純で、しかも、演奏しようとする楽器を実際に弾いて、その音に基づいてモデルを作成できるので、調律のずれ具合にかかわらず演奏音が正しいかどうかを判定することができる。 Therefore, the applicant stores the power spectrum of each pitch as a single-tone model, compares the power spectrum of the sound played according to the key-pressing instruction with this single-tone model, and determines the distance between the two power spectra. In this case, a method has been proposed in which a key pressing instruction is advanced (Japanese Patent Application No. 2003-134372). This method is simpler to calculate than parameters such as pitch from the performance sound, and can play a musical instrument to be played and create a model based on that sound. Whether or not the performance sound is correct can be determined regardless of the deviation.

この方法では、アタックのパワースペクトル同士を比較して演奏音が正しいかどうかを判断するのが好ましい。アタック検出は、マイクロフォンからの入力音に対する高速フーリエ変換（ＦＦＴ）を行い、検出されたパワースペクトルが、前回検出されたパワースペクトルに対してどの程度増加しているかによって行うことができる。 In this method, it is preferable to compare the power spectra of the attacks to determine whether or not the performance sound is correct. Attack detection can be performed by performing Fast Fourier Transform (FFT) on the input sound from the microphone, and how much the detected power spectrum is increased with respect to the power spectrum detected last time.

ＦＦＴは所定のＦＦＴ窓の長さ毎に区切って実施され、当該窓時間でアタックが検出されない場合は、次の窓時間内で再度ＦＦＴを実施する。しかし、ＦＦＴの窓時間は長いので、このように窓の長さ毎に区切って処理を行っていたのではアタックの検出が遅れてしまう。特に、アルペジオのように複数音を素速く演奏する場合にはアタック検出が間に合わない。例えば、アコースティックピアノの最高音の場合、音の持続時間が約４０ｍ秒程度しかないが、１１０２５Ｈｚで１０２４ポイントのＦＦＴでは窓時間が約９３ｍ秒となり、アタックの瞬間を捉えることができない。 The FFT is performed for each predetermined FFT window length, and when an attack is not detected within the window time, the FFT is performed again within the next window time. However, since the FFT window time is long, the detection of an attack is delayed if processing is performed by dividing each window length in this way. In particular, when multiple sounds are played quickly like an arpeggio, attack detection is not in time. For example, in the case of the highest sound of an acoustic piano, the duration of the sound is only about 40 milliseconds, but in the case of an FFT of 1024 points at 11025 Hz, the window time is about 93 milliseconds, and the moment of attack cannot be captured.

本発明は、上記課題に鑑み、ピアノ等、自然楽器が指示通りに弾けたかどうかを判定する際に必要なアタックの検出を確実に行うことができる自然楽器の演奏音判定システムを提供することを目的とする。 In view of the above-described problems, the present invention provides a performance sound determination system for a natural instrument that can reliably detect an attack necessary for determining whether a natural instrument such as a piano has been played as instructed. Objective.

上記の課題を解決し、目的を達成するための本発明は、演奏指示に従って演奏された自然楽器演奏音のデジタルデータに対してＦＦＴ窓毎にＦＦＴ処理をして該デジタルデータのパワースペクトルを計算するＦＦＴ手段と、前記各ＦＦＴ処理毎に計算されたパワースペクトルのパワーが前回計算時より予定以上増大したか否かによって該自然楽器演奏音のアタックがあったか否かを判断するアタック検出手段と、前記アタックがあったと判定された以降に算出されたパワースペクトルと予め記憶されている自然楽器音の音高毎のパワースペクトルのうち前記演奏指示に含まれる音高のものとの距離が予定のしきい値以内であるか否かを判断する比較手段と、前記ＦＦＴ手段のＦＦＴ窓を、前後のＦＦＴ窓同士が部分的に重複するように設定する手段とを具備した点に第１の特徴がある。 In order to solve the above-described problems and achieve the object, the present invention calculates the power spectrum of the digital data by performing FFT processing for each FFT window on the digital data of the natural musical instrument performance sound performed according to the performance instruction. FFT means for performing an attack detection means for determining whether or not there is an attack of the natural musical instrument performance sound based on whether or not the power of the power spectrum calculated for each FFT processing has increased more than expected from the previous calculation time; The distance between the power spectrum calculated after it is determined that the attack has occurred and the power spectrum for each pitch of the natural musical instrument sound stored in advance from the pitch included in the performance instruction is scheduled. The comparison means for determining whether or not the threshold value is within the threshold value and the FFT window of the FFT means are set so that the preceding and following FFT windows partially overlap each other. There is a first feature in that and means.

また、本発明は、前記演奏指示を、予め選択された曲の演奏情報に従ってコンピュータの表示装置に表示するとともに、前記距離が予定のしきい値以内であるときに、前記演奏情報に従って次の演奏指示を表示する押鍵指示手段を具備した点に第２の特徴がある。 Further, the present invention displays the performance instruction on a display device of a computer in accordance with performance information of a preselected song, and when the distance is within a predetermined threshold, the next performance is in accordance with the performance information. A second feature is that a key pressing instruction means for displaying an instruction is provided.

また、本発明は、前記ＦＦＴ窓同士の重複期間を決定するシフト時間を予め設定しておき、後続の前記ＦＦＴ手段および前記アタック検出手段の処理が開始される時間までに、先行する前記ＦＦＴ手段および前記アタック検出手段の処理が終了しない場合に前記シフト時間を拡大する手段をさらに備えた点に第３の特徴がある。 In the present invention, a shift time for determining an overlap period between the FFT windows is set in advance, and the preceding FFT means is processed by the time when processing of the subsequent FFT means and attack detection means is started. A third feature is that the apparatus further includes means for extending the shift time when the processing of the attack detection means does not end.

また、本発明は、前記演奏情報に基づいて、演奏速度が速い所定のフレーズを予め検出する高速部検出手段手段と、前記フレーズに対応する演奏指示の演奏音に対しては、前記アタック検出手段による処理を省略するとともに、前記比較手段による判断を、アタックの有無に拘わらず、演奏指示後の入力音について実行する手段を備えた点に第４の特徴がある。 Further, the present invention provides a high-speed unit detecting means for detecting a predetermined phrase having a high performance speed based on the performance information in advance, and the attack detecting means for a performance sound of a performance instruction corresponding to the phrase. There is a fourth feature in that the processing by the comparison means is omitted, and the judgment by the comparison means is executed for the input sound after the performance instruction regardless of the presence or absence of the attack.

第１の特徴によれば、演奏指示に従って演奏された楽器音のパワースペクトルの増大量が予定以上であったときにアタックがあったと判断される。そして、この楽器演奏音のパワースペクトルが予め記憶されている自然楽器音の音高毎のパワースペクトルのうち前記演奏指示に含まれる音高のものと一致したときに指示どおりに演奏されたと判定されて演奏指示が進行される。 According to the first feature, it is determined that there has been an attack when the amount of increase in the power spectrum of the musical instrument sound performed in accordance with the performance instruction is greater than or equal to a predetermined amount. Then, when the power spectrum of the musical instrument performance sound matches the one of the pitches included in the performance instruction among the power spectra of the natural musical instrument sounds stored in advance, it is determined that the musical instrument performance sound was played as instructed. The performance instruction is advanced.

特に、アタックの判断ではＦＦＴ窓が部分的に重複するように設定されるので、先のＦＦＴ窓に関する計算に続く次のＦＦＴ窓に関する計算は、入力データがＦＦＴ窓の長さの分だけ新たに入力されなくても、ＦＦＴ窓を部分的に重複させるためにシフトさせた時間分のデータが先の計算後に入力されていれば実行できる。したがって、先行するＦＦＴ窓の直後に現れているアタックは、次にＦＦＴ窓の長さ分のデータが入力されるまで待たなくても、先のＦＦＴ窓に関する計算に続けて短時間内に実行される次のＦＦＴ窓に関する計算で検出される。 In particular, since the FFT window is set so as to partially overlap in the attack determination, the calculation related to the next FFT window following the calculation related to the previous FFT window is newly performed by the amount of the length of the FFT window. Even if it is not input, it can be executed if data for the time shifted to partially overlap the FFT window is input after the previous calculation. Therefore, an attack appearing immediately after the preceding FFT window is executed within a short time following the calculation related to the previous FFT window, without waiting for the next input of data corresponding to the FFT window length. Detected in the next FFT window.

第２の特徴によれば、前記演奏指示は、コンピュータの表示装置に表示され、演奏音のパワースペクトルと予め記憶されている自然楽器音のパワースペクトルとが略一致したときに演奏指示が進行される。 According to the second feature, the performance instruction is displayed on a display device of a computer, and the performance instruction is advanced when the power spectrum of the performance sound substantially matches the power spectrum of the natural musical instrument sound stored in advance. The

第３の特徴によれば、コンピュータの能力に応じて、ＦＦＴ窓の重複時間を決定するシフト時間を調節することができるので、コンピュータの能力内でできるだけ高速でアタックの有無を判断することができる。 According to the third feature, since the shift time for determining the overlap time of the FFT window can be adjusted according to the capability of the computer, it is possible to determine whether or not there is an attack as fast as possible within the capability of the computer. .

アルペジオ等、演奏速度が速い演奏ではアタックの有無を判断してからパワースペクトルの比較を行ったのでは、演奏者の速い演奏に判定が追いつかない。第４の特徴によれば、このような速い演奏においては、アタックの有無を判断することなくパワースペクトルの一致を判断することができる。 For performances such as arpeggios, where the performance speed is high, comparing the power spectrum after determining the presence or absence of an attack does not catch up with the player's fast performance. According to the fourth feature, in such a fast performance, it is possible to determine the coincidence of power spectra without determining the presence or absence of an attack.

以下、図面を参照して本発明を詳細に説明する。図１は、本発明の楽音判定装置の一実施形態に係るアコースティックピアノ（以下、特に電子ピアノと区別しない場合は、単に「ピアノ」と呼ぶ）のレッスンシステムのブロック構成図である。このレッスンシステムでは、予め準備された演奏情報に基づいてピアノの鍵もしくは音高の指示、および押鍵タイミングの指示を含む押鍵指示を表示し、演奏者はこの押鍵指示に従って順次演奏を進めていく。具体的処理は、モデル作成フェーズとレッスンフェーズとからなる。モデル作成フェーズでは、レッスンに先立ち、レッスンに使用するピアノの各鍵の発生音のパワースペクトルを単音モデルとして記憶する。そして、レッスンモードでは、押鍵指示のための表示を行うとともに（押鍵指示は、図１２に関して後述する）、押鍵指示に従って行われる演奏音のパワースペクトルと前記単音モデルとを比較し、両者のパワースペクトルが予定の距離内にあれば押鍵指示を先に進める。 Hereinafter, the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram of a lesson system of an acoustic piano (hereinafter, simply referred to as “piano” unless otherwise distinguished from an electronic piano) according to an embodiment of a musical tone determination apparatus of the present invention. In this lesson system, key press instructions including piano key or pitch instructions and key press timing instructions are displayed based on pre-prepared performance information, and the performer sequentially advances the performance according to the key press instructions. To go. Specific processing consists of a model creation phase and a lesson phase. In the model creation phase, prior to the lesson, the power spectrum of the generated sound of each piano key used in the lesson is stored as a single tone model. In the lesson mode, a key press instruction is displayed (the key press instruction will be described later with reference to FIG. 12), and the power spectrum of the performance sound performed according to the key press instruction is compared with the single tone model. If the power spectrum is within the planned distance, the key pressing instruction is advanced.

図１に示すレッスンシステムは、ＣＰＵ１、ＲＯＭ２、ＲＡＭ３、音源装置４、スピーカ５、Ａ／Ｄ変換器６、入力装置（キーボードやマウス）７、および表示装置８を含むパーソナルコンピュータで要部を構成することができる。表示装置８としては液晶ディスプレイやブラウン管等、パーソナルコンピュータの処理結果を表示するための周知の表示手段を使用することができる。パーソナルコンピュータには外部記憶装置９が接続され、かつＡ／Ｄ変換器６を介してマイクロフォン１０が接続される。マイクロフォン１０はピアノＰの発生音を採取するために設けられるものであり、ピアノＰ内に配置されるのが望ましい。 The lesson system shown in FIG. 1 includes a personal computer including a CPU 1, ROM 2, RAM 3, sound source device 4, speaker 5, A / D converter 6, input device (keyboard or mouse) 7, and display device 8. can do. As the display device 8, a known display means for displaying the processing result of the personal computer, such as a liquid crystal display or a cathode ray tube, can be used. An external storage device 9 is connected to the personal computer, and a microphone 10 is connected via the A / D converter 6. The microphone 10 is provided for collecting the sound generated by the piano P, and is preferably arranged in the piano P.

ＣＰＵ１は、音量測定部１１、スペクトル作成部１２，アタック検出部１３、スペクトル比較部１４、および押鍵指示部１５を要部機能として備える。音量測定部１１は、マイクロフォン１０から入力された音のレベル（音量）を検出する。スペクトル作成部１２は、マイクロフォン９およびＡ／Ｄ変換器６を通じて入力されたデジタル楽音信号からパワースペクトルを得るＦＦＴ機能を有する。ＦＦＴによって得られたパワースペクトルはＲＡＭ３に記憶される。アタック検出部１３は、検出されたパワースペクトルに基づいてアタックを検出する。スペクトル比較部１４はモデル作成フェーズおよびレッスンフェーズにおいてスペクトル作成部１２でそれぞれ作成されたパワースペクトルの距離を比較して演奏が押鍵指示通り行われたかどうかを判断する。押鍵指示部１５は、演奏情報に従って表示装置８への押鍵指示表示のための表示データを作成する。表示データは入力装置７から入力される演奏開始指示に応答して作成され表示装置８に入力される。 The CPU 1 includes a volume measuring unit 11, a spectrum creating unit 12, an attack detecting unit 13, a spectrum comparing unit 14, and a key pressing instruction unit 15 as main functions. The volume measuring unit 11 detects the level (volume) of the sound input from the microphone 10. The spectrum creating unit 12 has an FFT function for obtaining a power spectrum from a digital musical tone signal input through the microphone 9 and the A / D converter 6. The power spectrum obtained by the FFT is stored in the RAM 3. The attack detection unit 13 detects an attack based on the detected power spectrum. The spectrum comparison unit 14 compares the distances of the power spectra created by the spectrum creation unit 12 in the model creation phase and the lesson phase, and determines whether or not the performance has been performed according to the key depression instruction. The key pressing instruction unit 15 creates display data for displaying a key pressing instruction on the display device 8 according to the performance information. The display data is created in response to a performance start instruction input from the input device 7 and input to the display device 8.

図２は、レッスンで表示装置８に押鍵指示を表示したり伴奏をしたりするのに使用される演奏情報のフォーマット例である。演奏情報には、少なくともイベントデータ、およびイベントデータの読み出しタイミングを指示するタイミングデータとが記憶されている。イベントデータはノートナンバを含むノートオンデータおよびノートオフデータからなる。タイミングデータは、例えば、一つのイベント終了から次のイベント発生までの時間情報として設定される。イベントデータとタイミングデータは、図示のように、アドレス進行に従って記憶される。この演奏情報は、ＲＡＭ３もしくは外部記憶装置９に記憶しておくことができる。 FIG. 2 is a format example of performance information used for displaying a key pressing instruction on the display device 8 and performing accompaniment during a lesson. The performance information stores at least event data and timing data for instructing the reading timing of the event data. The event data is composed of note-on data and note-off data including a note number. The timing data is set as time information from the end of one event to the next event occurrence, for example. Event data and timing data are stored according to address progression as shown. This performance information can be stored in the RAM 3 or the external storage device 9.

図３は、モデル作成フェーズに関するＣＰＵ１の処理を示すフローチャートである。ステップＳ１では、曲の選択を促す画面を表示装置８に表示する。選曲されたならば（ステップＳ２肯定）、ステップＳ３に進んで、選ばれた曲の演奏情報を走査してその曲の音域を検出する。この音域で単音モデルを作成するためである。単音モデル作成する音域は、選曲に従って行うのでなくてもよい。例えば、演奏者が直接音域を入力してもよいし、選曲に代えて技量のレベルを選択できるように画面表示してもよい（図９に関して後述）。また、演奏者が直接音域を入力する際の参考となるように選曲された曲の音域を表示してもよい（図１０に関して後述）。 FIG. 3 is a flowchart showing the processing of the CPU 1 regarding the model creation phase. In step S <b> 1, a screen for prompting selection of music is displayed on the display device 8. If a song is selected (Yes at step S2), the process proceeds to step S3, and the performance information of the selected song is scanned to detect the range of the song. This is because a single tone model is created in this range. The tone range for creating a single tone model does not have to be in accordance with music selection. For example, the performer may input the range directly, or the screen may be displayed so that the level of skill can be selected instead of the music selection (described later with reference to FIG. 9). Further, the musical range of the selected music may be displayed for reference when the performer directly inputs the musical range (described later with reference to FIG. 10).

ステップＳ４では、収録する単音モデルの音高を決定する。最初は上記音域中の最低音を収録音高として決定する。ステップＳ５では、単音モデル収録用の押鍵指示（レッスンフェーズでの押鍵指示と区別して、「モデル押鍵指示」という）として表示装置８に収録音高を表示する（図１１に関して後述する）。モデル押鍵指示は、表示装置８によらず音源装置４を使って音高に対応した楽音データを作成し、スピーカ５で発音させるようにしてもよい。また、発音による指示は表示装置８による表示と併せて行ってもよい。 In step S4, the pitch of the single tone model to be recorded is determined. First, the lowest pitch in the above range is determined as the recorded pitch. In step S5, the recorded pitch is displayed on the display device 8 as a key pressing instruction for single-tone model recording (referred to as “model key pressing instruction” in distinction from the key pressing instruction in the lesson phase) (described later with reference to FIG. 11). . For the model key pressing instruction, the musical tone data corresponding to the pitch may be created using the sound source device 4 without using the display device 8 and may be generated by the speaker 5. Further, the instruction by pronunciation may be performed together with the display by the display device 8.

ステップＳ６では、音量測定部１１での音量測定を開始する。音量測定は離鍵指示まで継続される。ステップＳ７で音量がアタック検出のためのしきい値を超えているか否かが判断され、この判断が肯定ならばアタックを検出したとしてステップＳ８に進む。ステップＳ８では、ＦＦＴによりパワースペクトルを計算し、このパワースペクトルおよびパワースペクトル計算時の音量をＲＡＭ３に一時的に保存する。 In step S6, the sound volume measurement by the sound volume measuring unit 11 is started. Volume measurement continues until a key release instruction is issued. In step S7, it is determined whether or not the sound volume exceeds a threshold for attack detection. If this determination is affirmative, it is determined that an attack has been detected, and the process proceeds to step S8. In step S8, a power spectrum is calculated by FFT, and the power spectrum and the volume at the time of power spectrum calculation are temporarily stored in the RAM 3.

ステップＳ９では、表示装置８に離鍵を促す指示をする。離鍵指示は表示装置１１での表示またはスピーカ５による音もしくは音声の指示であってもよい。ステップＳ１０では、音量が無音検出のためのしきい値以下になったかどうかをを判断する。音量がこのしきい値以下になれば実質的に無音と判断され、ステップＳ１１に進む。 In step S9, the display device 8 is instructed to release the key. The key release instruction may be a display on the display device 11 or a sound or voice instruction from the speaker 5. In step S10, it is determined whether or not the volume is equal to or lower than a threshold value for detecting silence. If the volume falls below this threshold, it is determined that the sound is substantially silent, and the process proceeds to step S11.

ステップＳ１１では、モデル押鍵指示から離鍵指示までの音量の最大値が許容範囲に入っているかどうかが判断される。最大音量が許容範囲にない場合は、極端に強くまたは極端に弱く弾かれたためと判断され、単音モデルとして適当でないので、再度押鍵を要求するためにステップＳ５に進む。アタックを検出したとみなしたときの音量から判断するのではなく、測定した音量の最大値によって押鍵強さが適当であるかどうかを判断するのは、アタック時点で音量が最大になるとは限らず、通常はアタック検出後に音量は最大値となるからである。なお、押鍵強さは、音量によって判断するのに代えて振幅の最大値によって判断してもよい。 In step S11, it is determined whether the maximum value of the volume from the model key press instruction to the key release instruction is within the allowable range. If the maximum volume is not within the allowable range, it is determined that the sound has been played extremely strongly or extremely weakly and is not suitable as a single-tone model, so the process proceeds to step S5 to request a key press again. Rather than judging from the sound volume when it is considered that an attack has been detected, judging whether or not the key-pressing strength is appropriate based on the maximum value of the measured sound volume does not necessarily mean that the sound volume is maximum at the time of the attack. This is because the sound volume normally reaches the maximum value after detecting the attack. Note that the key pressing strength may be determined by the maximum value of the amplitude instead of determining by the volume.

正しく押鍵されたと判断されたならば、ステップＳ１２に進み、ステップＳ８で一時的に保存したパワースペクトルおよび音量をこの音高の単音モデルとして対応づけてＲＡＭ３もしくは外部記憶装置９に記憶する。 If it is determined that the key has been correctly pressed, the process proceeds to step S12, and the power spectrum and the volume temporarily stored in step S8 are stored in the RAM 3 or the external storage device 9 in correspondence with the single tone model of the pitch.

ステップＳ１１が否定の場合は、押鍵強さを変えて弾くようにメッセージを表示してからステップＳ５に進むようにしてもよい。例えば、許容範囲に対して音量が大きいときは「もっと弱く弾いてください」と表示し、許容範囲に対して音量が小さいときは「もっと強く弾いてください」と表示する。 If step S11 is negative, a message may be displayed so that the key-pressing strength is changed and then the process proceeds to step S5. For example, when the volume is higher than the allowable range, “Please play weaker” is displayed, and when the volume is lower than the allowable range, “Play more strongly” is displayed.

ステップＳ１３では、ステップＳ３で決定された音域の最終音（最高音）の単音モデルを作成したかどうかを判断し、最終音の単音モデルを作成したのでなければ、ステップＳ１４で、音高を１段階高くした後、ステップＳ５に進む。ステップＳ１４では、音高を音階に従って直上の音高に上げるのではなく、白鍵および黒鍵について、それぞれ個別に連続して押鍵できるように音高を上げていくのがよい。音階に従って順に音高を上げていくと白鍵と黒鍵を交互に弾く場合が生じ、モデル押鍵指示と対応しない鍵を弾く誤りが起こりやすいからである。 In step S13, it is determined whether or not a single tone model of the final tone (highest tone) determined in step S3 has been created. If no single tone model of the final tone has been created, the pitch is set to 1 in step S14. After increasing the level, go to step S5. In step S14, it is preferable to raise the pitch so that the white key and the black key can be pressed individually separately, instead of raising the pitch to the pitch immediately above according to the scale. This is because if the pitch is raised in order according to the scale, the white key and the black key may be played alternately, and an error in playing a key that does not correspond to the model key pressing instruction is likely to occur.

最終音まで単音モデルを作成したならば、ステップＳ１３は肯定となって、このモデル作成フェーズの処理を終了する。 If a single tone model is created up to the final sound, step S13 becomes affirmative and the process of this model creation phase is terminated.

図４は、レッスンフェーズに関するＣＰＵ１の処理を示すフローチャートである。ステップＳ２１では、ステップＳ１２で記憶した単音モデルを読み込む。ここでは最新に記憶された単音モデル、もしくは前回のレッスンで使用した単音モデルを自動的に読み込んでもよいし、記憶されている単音モデルを表示装置８に表示して演奏者が選択したものを読み込んでもよい。ステップＳ２２では、レッスンする曲の選択を促す画面を表示装置８に表示する。この画面は、ＲＡＭ３もしくは外部記憶装置９に予め記憶されているレッスン曲データに基づいて曲の一覧表として作成される。記憶されているレッスン曲のうち演奏者の技量レベルに応じた曲のみを表示するようにしてもよい。 FIG. 4 is a flowchart showing the processing of the CPU 1 regarding the lesson phase. In step S21, the single tone model stored in step S12 is read. Here, the latest memorized single note model or the single note model used in the previous lesson may be automatically read, or the memorized single note model is displayed on the display device 8 and the one selected by the performer is read. But you can. In step S22, a screen for prompting selection of a song to be lesson is displayed on the display device 8. This screen is created as a song list based on lesson song data stored in advance in the RAM 3 or the external storage device 9. Of the stored lesson songs, only the songs corresponding to the skill level of the performer may be displayed.

ステップＳ２３では、レッスン曲が選ばれたかどうかが判断され、レッスンする曲が選曲されたならば（ステップＳ２３肯定）、ステップＳ２４に進んで、選ばれた曲の演奏情報をＲＡＭ３もしくは外部記憶装置９から読み出す。 In step S23, it is determined whether or not a lesson song is selected. If a song to be lesson is selected (Yes in step S23), the process proceeds to step S24, and the performance information of the selected song is stored in the RAM 3 or the external storage device 9. Read from.

ステップＳ２５では、選ばれた曲の演奏情報中のすべての音の高さが、選択された単音モデルの音域内に含まれているかどうかを判断する。この判断が肯定ならば、ステップＳ２６に進むが、否定であれば、改めて選曲を促すためにステップＳ２２に戻る。なお、選曲に戻ってもよいし、単音モデルの選択をやり直すようにしてもよい。 In step S25, it is determined whether or not all the pitches in the performance information of the selected song are included in the range of the selected single-tone model. If this determination is affirmative, the process proceeds to step S26. If negative, the process returns to step S22 to prompt the user to select a new song. Note that it may be possible to return to the music selection, or to select the single tone model again.

ステップＳ２６では、選ばれたレッスン曲の最初の楽音の音高を、変数としてＲＡＭ３にセットする。そして、ステップＳ２７ではこの音高に対応する鍵を弾くことができるように指示（押鍵指示）を行う。押鍵指示は、詳細を後述するように表示装置８に鍵盤もしくは楽譜を表示することによって行う。ステップＳ２８では、単音モデルのうちから押鍵指示の音高に対応するものを読み出す。 In step S26, the pitch of the first musical tone of the selected lesson song is set in the RAM 3 as a variable. In step S27, an instruction (key pressing instruction) is given so that the key corresponding to the pitch can be played. The key pressing instruction is performed by displaying a keyboard or a score on the display device 8 as will be described in detail later. In step S28, one corresponding to the pitch of the key depression instruction is read out from the single tone model.

ステップＳ２９では、スペクトル作成部１２の機能により、マイクロフォン１０からの入力音のパワースペクトルを算出する。ステップＳ３０では、アタックが既に検出されているか否かを判断する。通常アタックが検出されるのは、音が立ち上がる瞬間のみであり、必ずしも再びアタック検出時のパワースペクトルと単音モデルとが一致するとは限らない。したがって、アタックが一度検出された後、同じ楽音が発生されている間は、このステップＳ３０の判断によって２度目以降のアタック検出を回避する。最初はこのステップＳ３０の判断は否定であり、２度目以降に判断が肯定となり、ステップＳ３１に進んでアタック検出を行う。アタック検出はパワースペクトルのパワー増加分が予定量を超えたか否かによって判断する。アタックが検出されたならばステップＳ３２が肯定となり、ステップＳ３３でアタック検出フラグをセットしてステップＳ３４に進む。 In step S29, the power spectrum of the input sound from the microphone 10 is calculated by the function of the spectrum creating unit 12. In step S30, it is determined whether or not an attack has already been detected. The normal attack is detected only at the moment when the sound rises, and the power spectrum at the time of the attack detection does not always match the single tone model. Therefore, after the attack is detected once, while the same musical tone is being generated, the second and subsequent attack detections are avoided by the determination in step S30. Initially, the determination in step S30 is negative, the determination is affirmative after the second time, and the process proceeds to step S31 to perform attack detection. Attack detection is determined by whether or not the power increase in the power spectrum exceeds a predetermined amount. If an attack is detected, step S32 becomes affirmative, an attack detection flag is set in step S33, and the process proceeds to step S34.

ステップＳ３４では、スペクトル比較部１４の機能により、ステップＳ２９で計算されたパワースペクトルと単音モデルとが一致しているかどうかを判断する。この判断は、単音モデルのパワースペクトルとステップＳ２９で計算されたパワースペクトルとの距離の合計によって行う。つまり各周波数成分毎に互いの距離を算出し、その合計がしきい値を超えているか否かによって両スペクトルが一致しているか否かを判断する。両パワースペクトルの単純比較でもよいし、互いの音量の違いに応じて正規化したパワースペクトル同士の比較であってもよい。また、ユークリッド距離を使用するのがよい。ユークリッド距離によれば、二つの値の差が２乗されて、その結果、該差が強調され、二つの値の一致の判別が容易だからである。 In step S34, the function of the spectrum comparison unit 14 determines whether or not the power spectrum calculated in step S29 matches the single tone model. This determination is made based on the total distance between the power spectrum of the single tone model and the power spectrum calculated in step S29. That is, the mutual distance is calculated for each frequency component, and it is determined whether or not both spectra match depending on whether or not the sum exceeds a threshold value. A simple comparison of both power spectra may be used, or a comparison of power spectra normalized according to the difference in volume between each other may be used. The Euclidean distance should be used. This is because, according to the Euclidean distance, the difference between the two values is squared, and as a result, the difference is emphasized, and it is easy to determine whether the two values match.

なお、パワースペクトルの比較に際して、スペクトル作成部１２から入力されるパワースペクトルのレベルが、単音モデルのパワースペクトルのレベルに対してどれだけ不足しているかを各周波数成分毎に算出し、その不足分の合計を算出するようにしてもよい。ピアノ等、残響が多く残る楽器には、この方が好都合である。 When comparing the power spectra, the amount of power spectrum input from the spectrum creating unit 12 is calculated for each frequency component to determine how much the power spectrum level is lower than the power spectrum level of the single tone model. May be calculated. This is more convenient for instruments such as pianos that have a lot of reverberation.

両パワースペクトルが一致していない場合は、ステップＳ２９に戻る。両パワースペクトルが一致していた場合はステップＳ３５に進んでアタック検出フラグをリセットする。 If the two power spectra do not match, the process returns to step S29. If the two power spectra match, the process proceeds to step S35 to reset the attack detection flag.

ステップＳ３６では、演奏情報の最後の楽音について単音モデルとの一致が終了したか否かが判断される。最後の楽音についての単音モデルとの比較処理が終了していない場合は、ステップＳ３７において、演奏情報の次の楽音データの音高を、変数としてＲＡＭ３にセットし、ステップＳ２７に進む。 In step S36, it is determined whether or not the last musical tone of the performance information has been matched with the single tone model. If the comparison processing with the single tone model for the last musical tone is not completed, the pitch of the musical tone data next to the performance information is set as a variable in the RAM 3 in step S37, and the process proceeds to step S27.

最後の楽音についての前記比較処理が終了した場合は、ステップＳ３８に進んでもう一度同じ曲についてレッスンを行うかどうかを演奏者に選択させる表示をしたり、ステップＳ３９に進んで別の曲のレッスンを行うかどうかを演奏者に選択させる表示をしたりする。 When the comparison processing for the last musical tone is completed, the process proceeds to step S38 to display whether or not to perform the lesson for the same song again, or to proceed to step S39 to select a lesson for another song. Display the screen to let the player choose whether to do it.

ステップＳ３８が肯定の場合はステップＳ２６に進むし、ステップＳ３９が肯定ならばステップＳ２２に進む。ステップＳ３６およびステップＳ３９が否定ならば、レッスンフェーズを終了する。 If step S38 is positive, the process proceeds to step S26, and if step S39 is positive, the process proceeds to step S22. If step S36 and step S39 are negative, the lesson phase ends.

次に、アタック検出をより確実に行うことができる処理について説明する。図５は、入力データの入力タイミングとＦＦＴ窓の位置とを示す図である。アタック検出は、マイクロフォン１０から入力された音のデータに対して、このように区切られたＦＦＴ窓毎に行われる。 Next, processing that can more reliably perform attack detection will be described. FIG. 5 is a diagram showing the input timing of input data and the position of the FFT window. Attack detection is performed on the sound data input from the microphone 10 for each of the FFT windows thus divided.

図６は、入力データの入力タイミングと処理タイミングとの関係を示す図である。例えば、入力データがタイミングｔ０から蓄積開始されて、タイミングｔ１でＦＦＴに必要な分が蓄積されると、その入力データに関して計算が開始され、タイミングｔ２で計算が終了してアタックか否かの判断がなされる。そして、アタックが検出されなければ、タイミングｔ１〜t３間のデータが採取されるのを待ってタイミングｔ３で再び計算が開始され、タイミングｔ４で計算が終了してアタックか否かの判断がなされる。この計算タイミングによれば、例えば、タイミングｔ１とｔ２との間で生じたアタックは、タイミングｔ４において、つまり実際のアタック時から、ほぼＦＦＴ窓の長さ分遅れた時点で検出される。 FIG. 6 is a diagram illustrating a relationship between input timing of input data and processing timing. For example, when input data is started to be accumulated from timing t0 and the amount necessary for FFT is accumulated at timing t1, calculation is started for the input data, and the calculation is terminated at timing t2 to determine whether or not the attack is an attack. Is made. If no attack is detected, the calculation is started again at timing t3 after waiting for data to be collected between timings t1 to t3, and the calculation is completed at timing t4 to determine whether or not the attack is being performed. . According to this calculation timing, for example, an attack that occurs between timings t1 and t2 is detected at timing t4, that is, at a point that is approximately delayed by the length of the FFT window from the time of the actual attack.

そこで、次のようにして、このようなアタック検出の遅れを解消する。図７は、アタック検出の遅れを解消できる入力データの入力タイミングと処理タイミングとの関係を示す図である。図７において、タイミングｔ１で、タイミングｔ０からタイミングｔ１の間に入力されたデータに対してＦＦＴ処理を開始し、さらにタイミングｔ１からから時間ΔＬずらした時点ｔ２０で、その時点ｔ２０直前のＦＦＴ窓時間に相当する時間Ｔの入力データに対してＦＦＴ処理を開始する。時間ΔＬは予め設定される値で、例えば、ＦＦＴ窓の１／４の長さに設定しておく。このＦＦＴ処理に基づくアタック有無は計算開始から処理時間ΔＴ経過後のタイミングｔ２１で検出される。つまり、タイミングｔ１の直後に生じたアタックがタイミングｔ２１で検出される。同様にタイミングｔ０から時間２ΔＬ、３ΔＬ…、のように、順次、ＦＦＴ窓を予定のシフト時間ΔＬずつずらした時間の入力データについてＦＦＴ処理を行う。したがって、図６に関して説明した例のようにタイミングｔ４でアタック検出される処理よりも短時間でアタック検出することができる。 Therefore, such a delay in attack detection is eliminated as follows. FIG. 7 is a diagram showing the relationship between the input timing of input data that can eliminate the delay in attack detection and the processing timing. In FIG. 7, the FFT processing is started on the data input between the timing t0 and the timing t1 at the timing t1, and the FFT window time immediately before the timing t20 at the time t20 shifted by the time ΔL from the timing t1. FFT processing is started for input data at time T corresponding to. The time ΔL is a preset value, and is set to, for example, a quarter length of the FFT window. The presence or absence of an attack based on this FFT processing is detected at timing t21 after the processing time ΔT has elapsed since the start of calculation. That is, an attack that occurs immediately after timing t1 is detected at timing t21. Similarly, the FFT processing is performed on the input data at the time when the FFT window is sequentially shifted by the predetermined shift time ΔL, such as time 2ΔL, 3ΔL. Therefore, the attack can be detected in a shorter time than the process in which the attack is detected at the timing t4 as in the example described with reference to FIG.

図８は、ＦＦＴ窓をずらしてＦＦＴ処理を行ってアタック検出遅れを解消する図７の処理に係る入力データの入力タイミングとＦＦＴ窓の位置とを示す図である。 FIG. 8 is a diagram showing the input timing of the input data and the position of the FFT window according to the processing of FIG. 7 in which the FFT processing is performed by shifting the FFT window to eliminate the attack detection delay.

なお、ＣＰＵ１の処理能力によって、ＦＦＴ処理に時間がかかる場合、つまりタイミングｔ１から開始される処理時間ΔＴが、ずらした時間（シフト時間）ΔＬより長い場合は、シフトΔＬ時間を処理時間ΔＴと同じか、処理時間ΔＴより長くなるように延長する。一方、処理時間ΔＴが、シフト時間ΔＬより長い場合は、シフトΔＬ時間を処理時間ΔＴに合わせて短縮することもできる。こうして、ＦＦＴ処理の時間に応じてできるだけ素速くアタック検出することができる。 If the FFT processing takes time depending on the processing capability of the CPU 1, that is, if the processing time ΔT started from the timing t1 is longer than the shifted time (shift time) ΔL, the shift ΔL time is the same as the processing time ΔT. Alternatively, it is extended to be longer than the processing time ΔT. On the other hand, when the processing time ΔT is longer than the shift time ΔL, the shift ΔL time can be shortened in accordance with the processing time ΔT. Thus, attack detection can be performed as quickly as possible in accordance with the FFT processing time.

また、１回のＦＦＴ処理で１度しかアタック検出できないので、一つのＦＦＴ窓内で２回アタックがあるような場合、例えば、アルペジオのように速いフレーズでは、後の音のアタックを検出することができない。この場合、押鍵指示が進行しないことになる。そこで、このような速いフレーズのときはアタック検出をスキップして処理を先に進めることができる。アルペジオ等であるか否かは予め演奏情報を先読みして判断する処理ルーチンを設けておくことによって検出することができる。つまり演奏情報を予め読み取って、アルペジオなど演奏速度が速いフレーズの位置（発生タイミング）を予め記憶しておく。そして、このフレーズの発生タイミングでの楽音に対する処理では、図４のステップＳ２９でパワースペクトルを算出したならば、アタックの検出如何にかかわらず、直ちに単音モデルとの比較（ステップＳ３４）に移行する。 In addition, since an attack can be detected only once in one FFT process, when there is an attack twice in one FFT window, for example, in a fast phrase such as an arpeggio, an attack of a later sound is detected. I can't. In this case, the key pressing instruction does not proceed. Therefore, for such a fast phrase, the attack detection can be skipped and the process can proceed. Whether it is an arpeggio or the like can be detected by providing a processing routine for pre-determining performance information in advance. That is, the performance information is read in advance, and the position (occurrence timing) of a phrase having a high performance speed such as an arpeggio is stored in advance. In the process for the musical sound at the phrase generation timing, if the power spectrum is calculated in step S29 in FIG. 4, the process immediately proceeds to the comparison with the single tone model (step S34) regardless of whether an attack is detected.

アタックの有無を判断するのに使用されるパワースペクトルの増加分のしきい値は、モデル作成フェーズにパワースペクトルと併せて記憶した音量に応じて連動させてもよい。また、アタック検出処理（ステップＳ３１，Ｓ３２）を繰り返してもアタックが検出されない場合は、しきい値設定画面を表示装置８に表示して演奏者にしき値の調節を促してもよい。 The threshold value for increasing the power spectrum used to determine whether or not there is an attack may be linked according to the volume stored in the model creation phase together with the power spectrum. If an attack is not detected even after repeating the attack detection process (steps S31 and S32), a threshold setting screen may be displayed on the display device 8 to prompt the player to adjust the threshold value.

アタックを検出した後に、パワースペクトル中に次に弾くべき音のピークが立っているかチェックして、余分な音ではアタックを検出しないようにすることもできる。 After detecting the attack, it is possible to check whether the peak of the next sound to be played is standing in the power spectrum, and to prevent the attack from being detected with an extra sound.

図９は、技量レベルに応じた音域を表示画面上で設定するための表示装置８上の表示例を示す図である。図９には、鍵盤図形Ｋを表示し、この鍵盤図形Ｋ上に、技量レベルを示す「入門」、「初級」、「中級」および「上級」の文字と、各技量レベルのそれぞれに対応した音域を矢印で表示する。この各技量レベルのいずれかを選択すれば、各レベルに対応した図示の範囲の音域が設定される。例えば、この画面の技量レベルを示す各文字上にカーソルを合わせて指示を入力すれば、各レベルに対応した図示の範囲がレッスン音域として設定されるようにする。 FIG. 9 is a diagram showing a display example on the display device 8 for setting a sound range corresponding to the skill level on the display screen. In FIG. 9, a keyboard figure K is displayed, and on this keyboard figure K, “Introductory”, “Beginner”, “Intermediate”, and “Advanced” indicating the skill level correspond to each skill level. The range is displayed with arrows. If any one of these skill levels is selected, the range of the illustrated range corresponding to each level is set. For example, if an instruction is input by placing the cursor on each character indicating the skill level on this screen, the illustrated range corresponding to each level is set as the lesson range.

図１０は、音域設定の際の入力の参考になるように表示される画面の例を示す図である。図１０において、表示された鍵盤図形Ｋの鍵のうち、白鍵および黒鍵とは異なる色で示した鍵１６が、選曲されたレッスン曲に含まれる音高に対応する鍵である。図では着色できないので、鍵１６には、他の黒鍵および白鍵と区別するため粗いドット模様を付した。演奏者はこの音高の表示を参考に、これらの鍵１６のすべてを含むように音域を設定することができる。例えば、任意の二つの鍵を表示画面上で指示し、これらの鍵をそれぞれ最低音および最高音とする音高の範囲をレッスン音域として設定するとよい。 FIG. 10 is a diagram illustrating an example of a screen that is displayed so as to be a reference for input when setting a sound range. In FIG. 10, among the keys of the displayed keyboard figure K, a key 16 shown in a color different from the white key and the black key is a key corresponding to the pitch included in the selected lesson song. Since it cannot be colored in the figure, the key 16 is given a rough dot pattern to distinguish it from other black keys and white keys. The performer can set the range to include all of these keys 16 with reference to the pitch display. For example, any two keys may be indicated on the display screen, and a range of pitches where these keys are the lowest and highest sounds, respectively, may be set as the lesson range.

図１１は、モデル押鍵指示の表示例を示す図である。図１１において、表示装置８に表示された鍵盤図形Ｋには、他の鍵と区別できるように色もしくは模様を付けた鍵１７によって、単音モデル作成のための押鍵すべき鍵盤位置を示している。演奏者はこの表示された鍵盤図形Ｋ上の鍵１７に従って実際のピアノの鍵を押すことができる。なお、図中の白鍵上に示したマーク１８は押鍵位置を認識しやすくするためのものである。このマーク１８で示された鍵と同じ音高に対応する実際のピアノの鍵に、単音モデル作成に先だってシール等を付しておくことにより、表示された鍵盤図形Ｋと実際のピアノの鍵盤との対応がつきやすくなる。 FIG. 11 is a diagram illustrating a display example of a model key pressing instruction. In FIG. 11, the keyboard figure K displayed on the display device 8 indicates the keyboard position to be pressed for creating a single tone model by the key 17 having a color or pattern so that it can be distinguished from other keys. Yes. The performer can press an actual piano key according to the key 17 on the displayed keyboard figure K. Note that the mark 18 shown on the white key in the drawing is for easily recognizing the key pressing position. By attaching a seal or the like to the actual piano key corresponding to the same pitch as the key indicated by the mark 18 prior to the creation of the single tone model, the displayed keyboard figure K and the actual piano keyboard It becomes easy to deal with.

図１２は、レッスンフェーズにおける押鍵指示の一例を示す図である。同図ではピアノロールつまりスクロール表示方式で押鍵指示をしている。表示装置８の表示画面８１には鍵盤図形Ｋが表示されていて、鍵盤図形Ｋの上方には押鍵指示マークＭが表示される。表示画面８１は上下が時間軸であり、押鍵指示マークＭと鍵盤図形Ｋとの距離は、押鍵指示マークＭの下方の鍵を押す時までの時間に対応する。つまり、鍵盤図形Ｋにより近い押鍵指示マークＭの下方にある鍵は、より早い時期に弾くべき鍵である。押鍵指示マークＭは、押鍵すべき鍵に対応する位置に、音符長さに対応した長さで表示され、演奏音および単音モデルのパワースペクトルが互いに一致すると、下方にスクロールされる。現在押鍵されているべき鍵はマークｍで示される。小節線ＢＬで示すように、この例では２小節分の押鍵指示が同時に一画面上に表示されているが、同時に表示する画面の大きさつまり表示される演奏情報の範囲は任意である。このように演奏情報に基づく押鍵指示マークと鍵盤図形とを同時に表示し、押鍵指示マークをスクロール表示する演奏指示装置は、例えば、特開平９−３０５１７１号公報に開示されている。 FIG. 12 is a diagram illustrating an example of a key pressing instruction in the lesson phase. In the figure, the key is pressed by a piano roll, that is, a scroll display method. A keyboard figure K is displayed on the display screen 81 of the display device 8, and a key pressing instruction mark M is displayed above the keyboard figure K. The display screen 81 has a time axis at the top and bottom, and the distance between the key pressing instruction mark M and the keyboard figure K corresponds to the time until the key below the key pressing instruction mark M is pressed. That is, the key below the key pressing instruction mark M closer to the keyboard figure K is a key to be played earlier. The key depression instruction mark M is displayed at a position corresponding to the key to be depressed, with a length corresponding to the note length, and is scrolled downward when the power spectrum of the performance sound and the single tone model match each other. The key to be currently pressed is indicated by a mark m. As indicated by the bar line BL, in this example, the key pressing instructions for two bars are simultaneously displayed on one screen, but the size of the screen to be displayed simultaneously, that is, the range of the performance information to be displayed is arbitrary. A performance instruction apparatus that simultaneously displays a key depression instruction mark and a keyboard figure based on performance information and scrolls the key depression instruction mark is disclosed in, for example, Japanese Patent Application Laid-Open No. 9-305171.

なお、押鍵指示の表示形式は、鍵盤図形に対してマークをスクロールする方式に限らず、楽譜形式、つまり五線譜上に表示された音符をスクロールする方式であってもよい。また、ピアノＰの鍵盤に沿って各鍵毎に対応する位置にＬＥＤ（発光ダイオード）を配した表示器を設置し、押鍵すべき鍵に対応するＬＥＤを点灯させて押鍵指示を表示するものであってもよい。 The display format of the key pressing instruction is not limited to the method of scrolling the mark with respect to the keyboard figure, but may be a score format, that is, a method of scrolling the notes displayed on the staff. In addition, an indicator with LEDs (light emitting diodes) arranged at positions corresponding to each key along the keyboard of the piano P is installed, and the LED corresponding to the key to be pressed is lit to display the key pressing instruction. It may be a thing.

図３および図４に示したフローチャートや変形例に係る処理はＣＰＵ１で実行可能なようにプログラムされ、ＣＤやＲＯＭ等、周知の記録媒体に記録して提供されることができる。 The processes according to the flowcharts and modifications shown in FIGS. 3 and 4 are programmed so as to be executed by the CPU 1 and can be provided by being recorded on a known recording medium such as a CD or a ROM.

上記実施形態では、単音モデルを現にレッスンに使用するピアノＰの演奏音に基づいて作成し、この単音モデルとピアノＰの演奏音との比較を行った。これによって、ピアノＰが仮に調律が正しく行われていない場合であっても、同じピアノＰの演奏音同士の比較で判断するので、押鍵指示通りに弾けたかどうか判断が確実である。 In the above embodiment, a single tone model is created based on the performance sound of the piano P that is actually used for the lesson, and the single tone model and the performance sound of the piano P are compared. As a result, even if the piano P is not tuned correctly, it is determined by comparing the performance sounds of the same piano P, so that it is surely determined whether or not the piano P can be played according to the key pressing instruction.

しかし、本発明は、これに限らず、他のピアノの演奏音から別途作成した単音モデルを外部記憶装置９から読み込んでピアノＰの演奏音のパワースペクトルと比較する場合にも適用できる。 However, the present invention is not limited to this, and can also be applied to a case where a single tone model created separately from the performance sound of another piano is read from the external storage device 9 and compared with the power spectrum of the performance sound of the piano P.

また、本発明は、ピアノの押鍵判定に限らず、オルガン、アコーディオン、ハーモニカ、リコーダ、ギター等、各種自然楽器の演奏音判定に適用することができる。 Further, the present invention is not limited to piano key depression determination, and can be applied to performance sound determination of various natural musical instruments such as an organ, an accordion, a harmonica, a recorder, and a guitar.

本発明の一実施形態に係るピアノのレッスンシステムの構成を示すブロック図である。It is a block diagram which shows the structure of the lesson system of the piano which concerns on one Embodiment of this invention. 演奏情報のフォーマット例を示す図である。It is a figure which shows the example of a format of performance information. モデル作成フェーズに関するＣＰＵの処理を示すフローチャートである。It is a flowchart which shows the process of CPU regarding a model creation phase. レッスンフェーズに関するＣＰＵ１の処理を示すフローチャートである。It is a flowchart which shows the process of CPU1 regarding a lesson phase. 入力データの入力タイミングとＦＦＴ窓の位置との関連を示す図である。It is a figure which shows the relationship between the input timing of input data, and the position of an FFT window. 入力データの入力タイミングと処理タイミングとの関係を示す図である。It is a figure which shows the relationship between the input timing of input data, and a processing timing. アタック検出の遅れを解消できる入力データの入力タイミングと処理タイミングとの関係を示す図（その２）である。FIG. 10 is a diagram (part 2) illustrating a relationship between input timing of input data and processing timing that can eliminate a delay in attack detection; アタック検出の遅れを解消できる例に係る入力データの入力タイミングとＦＦＴ窓の位置との関連を示す図である。It is a figure which shows the relationship between the input timing of the input data which concerns on the example which can eliminate the delay of attack detection, and the position of an FFT window. 技量レベルに応じた音域を表示画面上で設定するための表示装置上の表示例を示す図である。It is a figure which shows the example of a display on a display apparatus for setting the sound range according to a skill level on a display screen. 音域設定の際の入力の参考になるように表示される画面の例を示す図である。It is a figure which shows the example of the screen displayed so that it may become reference of the input in the range setting. モデル押鍵指示の表示例を示す図である。It is a figure which shows the example of a display of a model key press instruction | indication. レッスンフェーズにおける押鍵指示の一例を示す図である。It is a figure which shows an example of the key pressing instruction | indication in a lesson phase.

Explanation of symbols

１…ＣＰＵ、２…ＲＯＭ、４…音源装置、５…スピーカ、６…Ａ／Ｄ変換器、７…入力装置、８…表示装置、９…外部記憶装置、１０…マイクロフォン、１１…音量測定部、１２…スペクトル作成部、１３…アタック検出部、１４…スペクトル比較部、１５…押鍵指示部
DESCRIPTION OF SYMBOLS 1 ... CPU, 2 ... ROM, 4 ... Sound source device, 5 ... Speaker, 6 ... A / D converter, 7 ... Input device, 8 ... Display device, 9 ... External storage device, 10 ... Microphone, 11 ... Volume measurement part 12 ... Spectrum creation unit, 13 ... Attack detection unit, 14 ... Spectrum comparison unit, 15 ... Key press instruction unit

Claims

FFT means for calculating the power spectrum of the digital data by performing FFT processing for each FFT window on the digital data of the natural musical instrument performance sound played in accordance with the performance instruction;
Attack detection means for determining whether or not there is an attack of the natural musical instrument performance sound based on whether or not the power of the power spectrum calculated for each FFT processing has increased more than expected from the previous calculation time;
The power spectrum calculated after it is determined that the attack has occurred is stored in advance, and the distance from the pitch included in the performance instruction among the power spectra for each pitch of the natural musical instrument sound is scheduled. A comparison means for determining whether the value is within the threshold;
A performance sound judging device comprising: means for shifting the FFT window of the FFT means so that the front and rear FFT windows partially overlap each other.

The key press for displaying the performance instruction on the display device of the computer according to the performance information of the pre-selected song and displaying the next performance instruction according to the performance information when the distance is within a predetermined threshold. 2. The performance sound determination apparatus according to claim 1, further comprising an instruction means.

A shift time for determining an overlap period between the FFT windows is set in advance, and the FFT means and attack detection means of the preceding FFT means and attack detection means are started by the time when processing of the subsequent FFT means and attack detection means is started. The performance sound determination device according to claim 1, further comprising means for extending the shift time when the processing is not completed.

Based on the performance information, high-speed part detecting means for detecting a predetermined phrase having a high performance speed in advance;
For the performance sound of the performance instruction corresponding to the phrase, the processing by the attack detection means is omitted,
4. The performance sound determination apparatus according to claim 1, further comprising means for executing the determination by the comparison means for an input sound after a performance instruction regardless of whether there is an attack.

FFT means for calculating the power spectrum of the digital data by performing FFT processing for each FFT window on the digital data of the natural musical instrument performance sound played in accordance with the performance instruction;
Attack detection means for determining whether or not there is an attack of the natural musical instrument performance sound based on whether or not the power of the power spectrum calculated for each FFT processing has increased more than expected from the previous calculation time;
The power spectrum calculated after it is determined that the attack has occurred is stored in advance, and the distance from the pitch included in the performance instruction among the power spectra for each pitch of the natural musical instrument sound is scheduled. A comparison means for determining whether the value is within the threshold;
A performance sound determination program for causing a computer to function as means for shifting the FFT windows of the FFT means so that the front and rear FFT windows partially overlap each other.

The key press for displaying the performance instruction on the display device of the computer according to the performance information of the pre-selected song and displaying the next performance instruction according to the performance information when the distance is within a predetermined threshold. 6. The performance sound determination program according to claim 5, further comprising a program for causing a computer to function as the instruction means.

A shift time for determining an overlap period between the FFT windows is set in advance, and the FFT means and attack detection means of the preceding FFT means and attack detection means are started by the time when processing of the subsequent FFT means and attack detection means is started. The performance sound determination program according to claim 5 or 6, further comprising a program for causing a computer to function as means for extending the shift time when processing is not completed.

Based on the performance information, high-speed part detecting means for detecting a predetermined phrase having a high performance speed in advance;
For the performance sound of the performance instruction corresponding to the phrase, the processing by the attack detection means is omitted,
The performance sound determination program according to any one of claims 5 to 7, further comprising a program for causing the computer to function as a means for executing the determination by the comparison means on the input sound after the performance instruction regardless of whether or not there is an attack. .

A computer-readable recording medium on which the performance sound determination program according to any one of claims 5 to 8 is recorded.