JP4665664B2

JP4665664B2 - Sequence data generation apparatus and sequence data generation program

Info

Publication number: JP4665664B2
Application number: JP2005242208A
Authority: JP
Inventors: 隼也村上
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2005-08-24
Filing date: 2005-08-24
Publication date: 2011-04-06
Anticipated expiration: 2025-08-24
Also published as: JP2007057751A

Description

この発明は、楽音に連動して素子を駆動するのに用いるシーケンスデータを生成する装置およびプログラムに関する。 The present invention relates to an apparatus and a program for generating sequence data used for driving elements in conjunction with musical sounds.

携帯電話など、楽曲再生機能を持った電子機器では、楽曲再生の際、アミューズメント性を高めるために、楽曲のノート（音符）のタイミングに合わせてＬＥＤを点滅させ、あるいはバイブレータを振動させる場合がある。ここで、電子機器がＭＩＤＩデータなどの演奏データによって音源を駆動して楽曲再生を行うものである場合には、演奏データに含まれるノートオンやノートオフの指示の情報をＬＥＤの点滅等の制御に用いることができた。しかし、最近の携帯電話などでは、このように音源の駆動により楽曲再生を行うのではなく、演奏を録音することにより得られたオーディオストリームデータを再生することにより楽曲再生を行う機種も提供されている。この場合、オーディオストリームデータは、楽音波形を表すデータであって、ノートオンやノートオフのタイミングを直接表すデータではないので、これをＬＥＤの点滅制御等に用いることはできない。従って、このようなオーディオストリームデータを用いて楽曲再生を行う装置において、ＬＥＤの点滅制御等を行うためには、楽曲のノートのタイミングを示すデータを別途用意する必要がある。従来、オーディオストリームデータからノートのタイミングの抽出を行う技術として、特許文献１に開示された技術があった。この技術は、オーディオストリームデータを一定時間長のフレームに分割し、フレーム毎に周波数解析を行ってスペクトルを求め、フレーム間におけるスペクトルのピークの不連続性を検出することによりノートのタイミングを検出するものである。
特開２００３−５７４４号公報 In an electronic device having a music playback function such as a mobile phone, in order to improve amusement when playing music, the LED may blink or the vibrator may be vibrated in accordance with the timing of music notes (musical notes). . Here, in the case where the electronic device is to play a music piece by driving a sound source with performance data such as MIDI data, control of blinking of LED or the like is performed on note-on and note-off instruction information included in the performance data. Could be used. However, recent cellular phones and the like also provide a model in which music playback is performed by playing back audio stream data obtained by recording a performance instead of playing music by driving a sound source in this way. Yes. In this case, since the audio stream data is data representing a musical sound waveform and not directly representing note-on or note-off timing, it cannot be used for LED blinking control or the like. Therefore, in an apparatus that reproduces music using such audio stream data, in order to perform LED blinking control and the like, it is necessary to separately prepare data indicating the timing of music notes. Conventionally, there has been a technique disclosed in Patent Document 1 as a technique for extracting note timing from audio stream data. This technology divides audio stream data into frames of a certain time length, performs frequency analysis for each frame to obtain a spectrum, and detects the timing of notes by detecting discontinuity of the spectrum peak between frames. Is.
JP 2003-5744 A

ところで、上述した特許文献１に開示された技術は、スペクトルのピークの不連続性に基づいてノートのタイミングを検出するので、時間経過に伴って音高が明確に変化する楽音波形のオーディオストリームデータでないと、オーディオストリームデータ自体からノートのタイミングを検出するのは困難であるという問題があった。 By the way, since the technique disclosed in Patent Document 1 described above detects the timing of a note based on the discontinuity of the spectrum peak, the sound stream audio stream data whose pitch changes clearly with the passage of time. Otherwise, there is a problem that it is difficult to detect the timing of the note from the audio stream data itself.

この発明は、以上説明した事情に鑑みてなされたものであり、処理対象が打楽器音などのような音高の変化が明確でない楽音波形のオーディオストリームデータである場合であっても、ノートのタイミングを検出し、ＬＥＤやバイブレータなどの素子を駆動するシーケンスデータを生成することができるシーケンスデータ生成装置およびシーケンスデータ生成プログラムを提供することを目的とする。 The present invention has been made in view of the circumstances described above, and even when the processing target is music stream audio stream data whose pitch change is not clear, such as percussion instrument sound, the timing of the notes An object of the present invention is to provide a sequence data generation device and a sequence data generation program that can detect sequence and generate sequence data for driving elements such as LEDs and vibrators.

この発明は、楽曲またはその一部の波形を示すオーディオストリームデータの開始点から終了点までの各部の周波数特性を順次求める周波数解析手段と、周波数帯域および振幅域を指定する範囲指定手段と、前記範囲指定手段により指定された周波数帯域および振幅域に前記周波数解析手段によって順次求められる周波数特性におけるパワースペクトルのピークが含まれる期間を検出し、この検出結果に基づき、楽音の発生タイミングを示すシーケンスデータを生成するシーケンスデータ生成手段とを具備することを特徴とするシーケンスデータ生成装置およびコンピュータを前記各手段として機能させるシーケンスデータ生成プログラムを提供する。
かかる発明によれば、範囲指定手段により適切な周波数帯域および振幅域を指定することにより、周波数解析手段により順次求められるオーディオストリームデータの周波数特性から楽音の発生タイミングを検出し、シーケンスデータを生成することができる。 The present invention provides a frequency analysis means for sequentially obtaining frequency characteristics of each part from a start point to an end point of audio stream data indicating a waveform of a music piece or a part thereof, a range specifying means for specifying a frequency band and an amplitude range, Sequence data indicating the generation timing of a musical tone based on the detection result of detecting a period in which the frequency spectrum sequentially obtained by the frequency analysis means includes frequency spectrum and amplitude range designated by the range designation means. And a sequence data generation program for causing a computer to function as each means.
According to this invention, by designating appropriate frequency band and amplitude range by the range designating means, the tone generation timing is detected from the frequency characteristics of the audio stream data sequentially obtained by the frequency analyzing means, and sequence data is generated. be able to.

以下、図面を参照し、この発明の実施の形態を説明する。
図１はこの発明の一実施形態であるシーケンスデータ生成装置の構成を示すブロック図である。このシーケンスデータ生成装置は、パーソナルコンピュータなどのコンピュータに対し、オーディオストリームデータからシーケンスデータを生成するシーケンスデータ生成プログラムをインストールしたものである。 Embodiments of the present invention will be described below with reference to the drawings.
FIG. 1 is a block diagram showing a configuration of a sequence data generating apparatus according to an embodiment of the present invention. This sequence data generation apparatus is obtained by installing a sequence data generation program for generating sequence data from audio stream data in a computer such as a personal computer.

図１において、ＣＰＵ１は、このシーケンスデータ生成装置の各部を制御する制御中枢である。ＲＯＭ２は、ローダなど、このシーケンスデータ生成装置の基本的な動作を制御するための制御プログラムを記憶した読み出し専用メモリである。表示部３は、装置の動作状態やユーザに対するメッセージなどを表示するための装置である。操作部４は、ユーザからコマンドや各種の情報を受け取るための手段であり、キーボードやマウスなどの各種の操作子により構成されている。Ｉ／Ｆ（インタフェース）群５は、ネットワークを介して他の装置との間でデータ通信を行うためのネットワークインタフェースや、磁気ディスク、ＣＤ−ＲＯＭなどの外部記憶媒体との間でデータの授受を行うためのドライバなどにより構成されている。ＨＤＤ（ハードディスク装置）６は、各種のプログラムやデータベースなどの情報を記憶するための不揮発性記憶装置である。ＲＡＭ７は、ＣＰＵ１によってワークエリアとして使用される揮発性メモリである。ＣＰＵ１は、操作部４を介して与えられる指令に従い、ＨＤＤ６内のプログラムをＲＡＭ７にロードして実行する。サウンドシステム８は、ＣＰＵ１による制御の下、音声を出力する装置である。 In FIG. 1, a CPU 1 is a control center that controls each part of the sequence data generating apparatus. The ROM 2 is a read-only memory that stores a control program for controlling the basic operation of the sequence data generating device such as a loader. The display unit 3 is a device for displaying an operation state of the device and a message for the user. The operation unit 4 is a means for receiving commands and various types of information from the user, and includes various types of operators such as a keyboard and a mouse. The I / F (interface) group 5 exchanges data with a network interface for performing data communication with other devices via a network, and with an external storage medium such as a magnetic disk or a CD-ROM. It consists of a driver for performing. The HDD (hard disk device) 6 is a non-volatile storage device for storing information such as various programs and databases. The RAM 7 is a volatile memory used as a work area by the CPU 1. The CPU 1 loads a program in the HDD 6 into the RAM 7 and executes it in accordance with a command given via the operation unit 4. The sound system 8 is a device that outputs sound under the control of the CPU 1.

ＨＤＤ６に記憶される情報として、シーケンスデータ生成プログラムとその処理対象であるオーディオストリームデータがある。ここで、オーディオストリームデータは、楽曲演奏の際の楽音波形をサンプリングして符号化することにより得られるデータである。また、シーケンスデータ生成プログラムは、このオーディオストリームデータから楽音の発音期間を示すノート情報を抽出し、一連のノート情報により構成されるシーケンスデータを生成するプログラムである。好ましい態様において、シーケンスデータ生成プログラムおよびオーディオストリームデータは、例えばインターネット内のサイトからＩ／Ｆ群５の中の適当なものを介してダウンロードされ、ＨＤＤ６にインストールされる。また、他の態様において、シーケンスデータ生成プログラムおよびオーディオストリームデータは、ＣＤ−ＲＯＭ、ＭＤなどのコンピュータ読み取り可能な記憶媒体に記憶された状態で取引される。この態様では、Ｉ／Ｆ群５の中の適当なものを介して記憶媒体からシーケンスデータ生成プログラムまたはオーディオストリームデータが読み出され、ＨＤＤ６にインストールされる。 Information stored in the HDD 6 includes a sequence data generation program and audio stream data to be processed. Here, the audio stream data is data obtained by sampling and encoding a musical sound waveform during music performance. The sequence data generation program is a program for extracting note information indicating a tone generation period from the audio stream data and generating sequence data including a series of note information. In a preferred embodiment, the sequence data generation program and the audio stream data are downloaded from a site in the Internet via an appropriate one in the I / F group 5 and installed in the HDD 6. In another aspect, the sequence data generating program and the audio stream data are traded in a state stored in a computer-readable storage medium such as a CD-ROM or MD. In this aspect, the sequence data generation program or audio stream data is read from the storage medium via an appropriate one in the I / F group 5 and installed in the HDD 6.

図２はシーケンスデータ生成プログラムの処理内容を示すブロック図である。この図に示すように、シーケンスデータ生成プログラムは、窓掛け１１、ＦＦＴ（高速フーリエ変換）１２、パワースペクトル表示１３、矩形指定によるパラメータ設定１４、矩形内ピーク検出１５および発音時間累算１６の各処理を含む。窓掛け１１では、処理対象であるオーディオストリームデータを一定時間長のフレームに分割し、フレーム毎に窓関数を乗算する。ＦＦＴ１２では、フレーム毎に、窓関数の乗じられたオーディオストリームデータにＦＦＴを施し、オーディオストリームデータの周波数特性を求める。そして、パワースペクトル表示１３では、横軸を周波数軸、縦軸を振幅軸として、ＦＦＴ１２において求めた各フレームの周波数特性の絶対値であるパワースペクトルを表示部３に表示する。 FIG. 2 is a block diagram showing the processing contents of the sequence data generation program. As shown in this figure, the sequence data generation program includes each of windowing 11, FFT (Fast Fourier Transform) 12, power spectrum display 13, parameter setting 14 by rectangular designation, peak detection 15 within rectangle, and sounding time accumulation 16. Includes processing. In the windowing 11, the audio stream data to be processed is divided into frames having a fixed time length, and a window function is multiplied for each frame. In the FFT 12, for each frame, the audio stream data multiplied by the window function is subjected to FFT to obtain the frequency characteristics of the audio stream data. In the power spectrum display 13, the power spectrum that is the absolute value of the frequency characteristic of each frame obtained in the FFT 12 is displayed on the display unit 3 with the horizontal axis as the frequency axis and the vertical axis as the amplitude axis.

図３はこのパワースペクトル表示１３の処理により表示部３に表示されるパワースペクトルの例を示している。打楽器音などの波形は、図示の例のように、広い周波数範囲に分布したパワースペクトルを含んでいるが、それらのパワースペクトルの中には発音開始から発音終了までの間に振幅が大きく推移する特徴的なパワースペクトルがある。ユーザは、表示部３におけるパワースペクトル表示を確認し、そのような特徴的なパワースペクトルが存在する周波数帯域と振幅域を見つける。矩形指定によるパラメータ設定１４では、パワースペクトルの表示された表示部３の表示画面上において、上記の特徴的なパワースペクトルの存在する矩形領域の始点Ｐ１および終点Ｐ２を操作部４におけるマウスなどのポインティングデバイスの操作によりユーザに指定させる。そして、矩形指定によるパラメータ設定１４では、図３に示すように、操作部４の操作によって指定された矩形領域をパワースペクトルと重ね表示するとともに、その矩形領域の横辺によって示される周波数帯域と縦辺によって示される振幅域をパラメータとして受け取る。 FIG. 3 shows an example of a power spectrum displayed on the display unit 3 by the processing of the power spectrum display 13. A waveform such as a percussion instrument sound includes a power spectrum distributed over a wide frequency range, as shown in the example in the figure, but the amplitude of the power spectrum changes greatly from the start of sound generation to the end of sound generation. There is a characteristic power spectrum. The user confirms the power spectrum display on the display unit 3 and finds a frequency band and an amplitude area where such a characteristic power spectrum exists. In the parameter setting 14 by specifying a rectangle, the start point P1 and the end point P2 of the rectangular area where the characteristic power spectrum exists are displayed on the display screen of the display unit 3 on which the power spectrum is displayed by pointing the operation unit 4 with a mouse or the like. Let the user specify by operating the device. Then, in the parameter setting 14 by rectangle designation, as shown in FIG. 3, the rectangular area designated by the operation of the operation unit 4 is superimposed on the power spectrum, and the frequency band and the vertical direction indicated by the horizontal side of the rectangular area are displayed. The amplitude range indicated by the edge is received as a parameter.

矩形内ピーク検出１５および発音時間累算１６の各処理は、オーディオストリームデータのパワースペクトルからノート情報を抽出し、シーケンスデータを生成する処理を構成している。これらの処理は、矩形指定によるパラメータ設定１４が実行され、周波数帯域および振幅域が設定されることにより実行可能となる。矩形内ピーク検出１５では、ＦＦＴ１２により得られたパワースペクトルのピークの中に、パラメータ設定１４により得られた周波数帯域および振幅域の範囲内に属するものがあるか否かをフレーム毎に判定する。この判定がフレーム毎に繰り返されることにより、楽音の発音期間の開始と終了が求められる。そして、発音時間累算１６では、この周波数帯域幅および振幅域の範囲内にパワースペクトルのピークが含まれる時間を累算する。この累算により、楽音の発音期間の長さが求められる。そして、楽音の発音期間の開始時刻および長さを示すノート情報が得られ、シーケンスデータが構成される。 Each process of the detection of the peak in rectangle 15 and the sounding time accumulation 16 constitutes a process of extracting note information from the power spectrum of the audio stream data and generating sequence data. These processes can be executed by executing parameter setting 14 by specifying a rectangle and setting a frequency band and an amplitude band. In the in-rectangular peak detection 15, it is determined for each frame whether any of the power spectrum peaks obtained by the FFT 12 belongs to the frequency band and amplitude range obtained by the parameter setting 14. By repeating this determination for each frame, the start and end of the tone generation period are obtained. In sound generation time accumulation 16, the time during which the peak of the power spectrum is included in the frequency bandwidth and amplitude range is accumulated. By this accumulation, the length of the tone generation period is obtained. Then, note information indicating the start time and length of the tone generation period of the musical tone is obtained, and sequence data is constructed.

次に本実施形態の動作を説明する。図４は、シーケンスデータ生成プログラムにおいて図２における矩形指定によるパラメータ設定１４に相当するルーチンの処理内容を示すフローチャートである。また、図５は、シーケンスデータ生成プログラムにおいて図２における矩形内ピーク検出１５および発音時間累算１６に相当するルーチンの処理内容を示すフローチャートである。ユーザが操作部４の操作によりシーケンスデータ生成プログラムの起動を指示すると、ＣＰＵ１は、同プログラムをＨＤＤ６からＲＡＭ７にロードして実行する。このシーケンスデータ生成プログラムの実行過程において、ＣＰＵ１は、まず、図４に示すルーチンの実行を開始する。まず、ＣＰＵ１は、パワースペクトル表示ウィンドウがアクティブか否かを判断し（ステップＳ１０１）。この判断の結果が「ＮＯ」である場合、ＣＰＵ１は、同判断を繰り返す。ここで、操作部４の操作により、ＨＤＤ６内の所望のオーディオストリームデータの選択が指示されると、ＣＰＵ１は、シーケンスデータ生成プログラムに従い、そのオーディオストリームデータをＨＤＤ６からＲＡＭ７内のワークエリアにロードし、このワークエリア内のオーディストリームデータについて図２における窓掛け１１、ＦＦＴ１２、パワースペクトル表示１３を実行する。これによりパワースペクトルが表示部３に表示される。この際、ＲＡＭ７のワークエリアにロードされたオーディオストリームデータをサウンドシステム８により再生し、処理対象をユーザに聴かせるようにしてもよい。 Next, the operation of this embodiment will be described. FIG. 4 is a flowchart showing the processing contents of a routine corresponding to the parameter setting 14 by rectangle designation in FIG. 2 in the sequence data generation program. FIG. 5 is a flowchart showing the processing contents of a routine corresponding to the in-rectangular peak detection 15 and the sounding time accumulation 16 in FIG. 2 in the sequence data generation program. When the user instructs activation of the sequence data generation program by operating the operation unit 4, the CPU 1 loads the program from the HDD 6 to the RAM 7 and executes it. In the execution process of the sequence data generation program, the CPU 1 first starts executing the routine shown in FIG. First, the CPU 1 determines whether or not the power spectrum display window is active (step S101). When the result of this determination is “NO”, the CPU 1 repeats the same determination. Here, when selection of desired audio stream data in the HDD 6 is instructed by the operation of the operation unit 4, the CPU 1 loads the audio stream data from the HDD 6 to the work area in the RAM 7 according to the sequence data generation program. The windowing 11, FFT 12, and power spectrum display 13 in FIG. 2 are executed for the audio stream data in this work area. As a result, the power spectrum is displayed on the display unit 3. At this time, the audio stream data loaded in the work area of the RAM 7 may be reproduced by the sound system 8 so that the user can listen to the processing target.

パワースペクトルの表示の態様には各種のものが考えられる。第１の態様では、横軸を周波数、縦軸を振幅とするパワースペクトルのグラフをフレーム毎に順次表示する。この態様では、ユーザは、パワースペクトルのグラフが時間経過に伴って変化してゆくのを観察することができる。なお、矩形指定によるパラメータ設定１４におけるユーザの操作を容易にするため、矩形指定によるパラメータ設定１４が完了するまでの間、オーディオストリームデータの最後のフレームのパワースペクトルの表示が終わった後、再び最初のフレームに戻り、窓掛け１１、ＦＦＴ１２、パワースペクトル表示１３を繰り返し実行するようにしてもよい。第２の態様では、フレーム毎に得られるパワースペクトルのグラフの画像をＲＡＭ７内のビデオメモリエリアに重ね書きし、ビデオメモリエリア内の画像を表示部３に表示する。この態様では、ユーザは、全フレームについて重ね合わせたパワースペクトルのグラフを表示部３の表示画面上において観察することができる。 Various types of display of the power spectrum are conceivable. In the first aspect, a graph of a power spectrum with the horizontal axis representing frequency and the vertical axis representing amplitude is sequentially displayed for each frame. In this aspect, the user can observe the power spectrum graph changing over time. In order to facilitate the user's operation in the parameter setting 14 by the rectangle designation, after the display of the power spectrum of the last frame of the audio stream data is completed until the parameter setting 14 by the rectangle designation is completed, Returning to this frame, the windowing 11, the FFT 12, and the power spectrum display 13 may be repeatedly executed. In the second aspect, the image of the power spectrum graph obtained for each frame is overwritten in the video memory area in the RAM 7, and the image in the video memory area is displayed on the display unit 3. In this aspect, the user can observe on the display screen of the display unit 3 a graph of the power spectrum superimposed for all frames.

表示部３にパワースペクトルが表示されると、ユーザは、パワースペクトルを観察する。そして、時間経過に伴って振幅が大きく変化するような特徴的なパワースペクトルがある場合、ユーザは、操作部４のマウスのドラッグ操作により、その特徴的なパワースペクトルのピークを囲む矩形を指定する。以下の処理は、このマウス操作による矩形入力を受け付けるための処理である。 When the power spectrum is displayed on the display unit 3, the user observes the power spectrum. When there is a characteristic power spectrum whose amplitude greatly changes with time, the user designates a rectangle surrounding the peak of the characteristic power spectrum by dragging the mouse of the operation unit 4. . The following processing is processing for accepting rectangular input by this mouse operation.

まず、パワースペクトル表示が行われ、ステップＳ１０１の判断結果が「ＹＥＳ」となってステップＳ１０２に進むと、ＣＰＵ１は、操作部４におけるマウスボタンが押されたか否かを判断し、肯定的な判断結果が得られるまで同判断を繰り返す。マウスボタンが押され、ステップＳ１０２の判断結果が「ＹＥＳ」になると、ＣＰＵ１は、既に矩形が表示部３に表示されているか否かを判断する（ステップＳ１０３）。この判断結果が「ＮＯ」である場合、ＣＰＵ１は、マウスボタンが押されたときの表示画面上のカーソルの位置座標を始点Ｐ１の位置座標として記憶する（ステップＳ１０６）。次にＣＰＵ１は、マウスボタンが離されたか否かを判断し（ステップＳ１０７）、肯定的な判断結果が得られるまで同判断を繰り返す。 First, the power spectrum is displayed, and when the determination result in step S101 is “YES” and the process proceeds to step S102, the CPU 1 determines whether or not the mouse button on the operation unit 4 has been pressed, and makes a positive determination. The same judgment is repeated until a result is obtained. When the mouse button is pressed and the determination result in step S102 is “YES”, the CPU 1 determines whether or not a rectangle is already displayed on the display unit 3 (step S103). When the determination result is “NO”, the CPU 1 stores the position coordinate of the cursor on the display screen when the mouse button is pressed as the position coordinate of the start point P1 (step S106). Next, the CPU 1 determines whether or not the mouse button has been released (step S107), and repeats the same determination until a positive determination result is obtained.

そして、マウスボタンが離されて、ステップＳ１０７の判断結果が「ＹＥＳ」になると、ＣＰＵ１は、マウスボタンが離されたときの表示画面上のカーソルの位置座標を終点Ｐ２の位置座標として記憶する（ステップＳ１０８）。次にＣＰＵ１は始点Ｐ１と終点Ｐ２を結ぶ直線が対角線となる矩形を、パワースペクトルと重ね合わせて表示部３に表示する（ステップＳ１０９）。そして、ＣＰＵ１は、表示された矩形の縦辺に相当する振幅域と横辺に相当する周波数帯域幅を求め、ノート情報の抽出対象となる領域を特定するパラメータとして設定し（ステップＳ１１０）、処理をステップＳ１０２に戻す。 When the mouse button is released and the determination result in step S107 is “YES”, the CPU 1 stores the position coordinate of the cursor on the display screen when the mouse button is released as the position coordinate of the end point P2 ( Step S108). Next, the CPU 1 displays a rectangle whose diagonal is a straight line connecting the start point P1 and the end point P2 on the display unit 3 so as to overlap with the power spectrum (step S109). Then, the CPU 1 obtains an amplitude region corresponding to the vertical side of the displayed rectangle and a frequency bandwidth corresponding to the horizontal side, and sets it as a parameter for specifying the region from which the note information is to be extracted (step S110). Is returned to step S102 .

本実施形態においてユーザは、以上のようにして表示部３に矩形が表示された状態において、マウス操作により矩形を変更することができる。以下、その場合の動作を説明する。まず、矩形が表示された状態においてマウスボタンが押されると、ステップＳ１０２を介してステップＳ１０３に進んだとき、このステップＳ１０３の判断結果が「ＹＥＳ」となり、処理はステップＳ１０４に進む。ステップＳ１０４において、ＣＰＵ１は、マウスが押されたときのカーソル位置が表示されている矩形の辺上にあるか否かを判断する。この判断結果が「ＮＯ」である場合、ＣＰＵ１は、表示部３における矩形の表示を消去し（ステップＳ１０５）、マウスが押されたときのカーソル位置の座標を始点Ｐ１の座標として記憶する（ステップＳ１０６）。その後、マウスが離されたときに、上述と同様、ステップＳ１０７〜Ｓ１１０を順次実行して矩形を確定させ、矩形の縦辺に相当する振幅域と矩形の横辺に相当する周波数帯域をパラメータとして設定する。このように、始点Ｐ１として、表示されている矩形の辺上でない位置を指すマウス操作が行われた場合には、既に表示されている矩形が消去され、新たな矩形入力として取り扱われる。 In the present embodiment, the user can change the rectangle by operating the mouse while the rectangle is displayed on the display unit 3 as described above. The operation in that case will be described below. First, when the mouse button is pressed in a state where a rectangle is displayed, when the process proceeds to step S103 via step S102, the determination result in step S103 is “YES”, and the process proceeds to step S104. In step S104, the CPU 1 determines whether or not the cursor position when the mouse is pressed is on the side of the displayed rectangle. When the determination result is “NO”, the CPU 1 deletes the rectangular display on the display unit 3 (step S105) and stores the coordinates of the cursor position when the mouse is pressed as the coordinates of the start point P1 (step S105). S106). Thereafter, when the mouse is released, as in the case described above, steps S107 to S110 are sequentially executed to fix the rectangle, and the amplitude range corresponding to the vertical side of the rectangle and the frequency band corresponding to the horizontal side of the rectangle are used as parameters. Set. As described above, when a mouse operation indicating a position that is not on the side of the displayed rectangle is performed as the start point P1, the already displayed rectangle is deleted and treated as a new rectangle input.

一方、表示されている矩形の辺上にカーソルを位置させてマウスボタンが押されたときには、ステップＳ１０２およびステップＳ１０３を介してステップＳ１０４に進んだとき、ステップＳ１０４の判断結果が「ＹＥＳ」となり、ステップＳ１１１に進む。このステップＳ１１１において、ＣＰＵ１は、マウスボタンが離されたか否かを判断し、肯定的な判断結果が得られるまで同判断を繰り返す。そして、マウスが離されてステップＳ１１１の判断結果が「ＹＥＳ」になると、ＣＰＵ１は、ステップＳ１０４においてマウスによって指示されていることを確認した矩形の辺を、マウスのドラッグ量に応じて移動させ、この辺の移動後の矩形を表示部３に表示する（ステップＳ１１２）。そして、処理はステップＳ１１０に進み、ＣＰＵ１は、表示された矩形の縦辺に相当する振幅域と横辺に相当する周波数帯域幅を求め、ノート情報の抽出対象となる領域を特定するパラメータとして設定し（ステップＳ１１０）、処理をステップＳ１０２に戻す。 On the other hand, when the cursor is positioned on the side of the displayed rectangle and the mouse button is pressed, when the process proceeds to step S104 via step S102 and step S103, the determination result in step S104 is “YES”. Proceed to step S111. In step S111, the CPU 1 determines whether or not the mouse button has been released, and repeats the same determination until a positive determination result is obtained. Then, when the mouse is released and the determination result in step S111 becomes “YES”, the CPU 1 moves the side of the rectangle confirmed by the mouse in step S104 according to the drag amount of the mouse, The rectangle after the movement of this side is displayed on the display unit 3 (step S112). Then, the process proceeds to step S110, and the CPU 1 obtains an amplitude region corresponding to the vertical side of the displayed rectangle and a frequency bandwidth corresponding to the horizontal side, and sets it as a parameter for specifying the region from which the note information is extracted. (Step S110), and the process returns to Step S102 .

以上のようにして、矩形の縦辺に相当する振幅域と、矩形の横辺に相当する周波数帯域が設定されると、図５に示すルーチンの実行が可能な状態となる。そして、ユーザが操作部４の操作により、シーケンスデータの生成の指示を入力すると、ＣＰＵ１は、図５に示すルーチンを実行する。このルーチンでは、オーディオストリームデータをフレーム単位で処理してノートイベントを含むシーケンスデータを生成する。まず、ＣＰＵ１は、初期化処理を実行し、ノートオンフラグＮＯＮを“０”に、フレームカウンタＦＣＮＴを「０」に、ノートオン期間カウンタＮＬを「０」に設定する（ステップＳ２０１）。ここで、ノートオンフラグＮＯＮは、現在処理しているフレームがノートオン期間（楽音の発音期間）のものかノートオフ期間のものかを示すフラグであり、ノートオン期間は“１”、ノートオフ期間は“０”とされる。また、フレームカウンタＦＣＮＴは、オーディオストリームデータのフレーム単位での処理を進める際に処理したフレーム数を計数するカウンタである。このカウンタのカウント値は、オーディオストリームデータの開始点を基準とした時刻情報として用いられる。ノートオン期間カウンタＮＬは、ノートオン期間が始まってから終了するまでのフレーム数を計数するカウンタである。 When the amplitude range corresponding to the vertical side of the rectangle and the frequency band corresponding to the horizontal side of the rectangle are set as described above, the routine shown in FIG. 5 can be executed. When the user inputs an instruction to generate sequence data by operating the operation unit 4, the CPU 1 executes a routine shown in FIG. In this routine, the audio stream data is processed in units of frames to generate sequence data including note events. First, the CPU 1 executes an initialization process, and sets the note-on flag NON to “0”, the frame counter FCNT to “0”, and the note-on period counter NL to “0” (step S201). Here, the note-on flag NON is a flag indicating whether the currently processed frame is in the note-on period (musical sound generation period) or the note-off period. The period is set to “0”. The frame counter FCNT is a counter that counts the number of frames processed when processing the audio stream data in units of frames. The count value of this counter is used as time information based on the start point of the audio stream data. The note-on period counter NL is a counter that counts the number of frames from the start of the note-on period to the end thereof.

ステップＳ２０１の初期化処理が終了すると、ＣＰＵ１は、ＲＡＭ７のワークエリアから１フレーム分のオーディオストリームデータを読み出し、そのフレームについて図２における窓掛け１１、ＦＦＴ１２およびパワースペクトル表示１３の各処理を実行する（ステップＳ２０２）。次にＣＰＵ１は、ＦＦＴ１２により得られたパワースペクトルのピークの中に、現在設定されている矩形領域の周波数帯域および振幅域に収まるものがあるか否かを判断する（ステップＳ２０３）。この判断結果が「ＮＯ」である場合、ＣＰＵ１は、ノートオンフラグＮＯＮが“０”であるか否かを判断する（ステップＳ２０４）。そして、ステップＳ２０４の判断結果が「ＹＥＳ」である場合、ＣＰＵ１は、フレームカウンタＦＣＮＴを「１」だけ増加させる（ステップＳ２１０）。次にＣＰＵ１は、現在のフレームが処理対象であるオーディオストリームデータの最後のフレームか否かを判断し（ステップＳ２１１）、この判断結果が「ＮＯ」である場合には処理をステップＳ２０２に戻す。以後、ステップＳ２０２、Ｓ２０３、Ｓ２０４、Ｓ２１０、Ｓ２１１の各処理が繰り返される。 When the initialization process of step S201 is completed, the CPU 1 reads out audio stream data for one frame from the work area of the RAM 7, and executes each process of the windowing 11, the FFT 12, and the power spectrum display 13 in FIG. (Step S202). Next, the CPU 1 determines whether there are any peaks in the power spectrum obtained by the FFT 12 that fall within the frequency band and amplitude area of the currently set rectangular area (step S203). When the determination result is “NO”, the CPU 1 determines whether or not the note-on flag NON is “0” (step S204). If the determination result in step S204 is “YES”, the CPU 1 increases the frame counter FCNT by “1” (step S210). Next, the CPU 1 determines whether or not the current frame is the last frame of the audio stream data to be processed (step S211). If the determination result is “NO”, the process returns to step S202. Thereafter, the processes of steps S202, S203, S204, S210, and S211 are repeated.

この間、ステップＳ２０２の処理対象となるフレームが楽曲のノートオフ期間に属するものからノートオン期間に属するものに移行してゆくと、ステップＳ２０２において得られるパワースペクトルの振幅が大きくなってゆく。そして、ステップＳ２０２において得られるパワースペクトルのピークが現在設定されている矩形領域内に入ると、ステップＳ２０３の判断結果が「ＹＥＳ」になり、ＣＰＵ１の処理はステップＳ２０７に進む。このステップＳ２０７においてＣＰＵ１は、ノートオンフラグＮＯＮが“１”か否かを判断する。この判断結果が「ＮＯ」である場合、すなわち、ノートオフ期間からノートオン期間への切り換わりがあった場合には、ＣＰＵ１は、ノートオンフラグＮＯＮを“１”とし、その時点におけるフレームカウンタＦＣＮＴのカウント値をノートオン開始時刻データＦＳとして保存し、ノートオン期間カウンタＮＬを「０」に初期化する（ステップＳ２０８）。次にＣＰＵ１は、ノートオン期間カウンタＮＬを「１」だけ増加させ（ステップＳ２０９）、上述したステップＳ２１０およびＳ２１１を順次実行し、ステップＳ２１１の判断結果が「ＮＯ」である場合には処理をステップＳ２０２に戻す。 In the meantime, when the frame to be processed in step S202 shifts from that belonging to the note-off period to that belonging to the note-on period, the amplitude of the power spectrum obtained in step S202 increases. When the peak of the power spectrum obtained in step S202 falls within the currently set rectangular area, the determination result in step S203 is “YES”, and the process of the CPU 1 proceeds to step S207. In step S207, the CPU 1 determines whether or not the note-on flag NON is “1”. When the determination result is “NO”, that is, when there is a switch from the note-off period to the note-on period, the CPU 1 sets the note-on flag NON to “1” and the frame counter FCNT at that time point. Is stored as note-on start time data FS, and a note-on period counter NL is initialized to “0” (step S208). Next, the CPU 1 increments the note-on period counter NL by “1” (step S209) and sequentially executes the above-described steps S210 and S211. If the determination result in step S211 is “NO”, the process is stepped. Return to S202.

この結果、処理はステップＳ２０２およびＳ２０３を介してステップＳ２０７に進む。このとき、ノートオンフラグＮＯＮは“１”となっているため、ステップＳ２０７の判断結果は「ＹＥＳ」となる。従って、ＣＰＵ１は、ステップＳ２０９、Ｓ２１０およびＳ２１１を順次実行し、ステップＳ２１１の判断結果が「ＮＯ」である場合には処理をステップＳ２０２に戻す。以後、ステップＳ２０２、Ｓ２０３、Ｓ２０７、Ｓ２０９、Ｓ２１０、Ｓ２１１の各処理が繰り返される。 As a result, the process proceeds to step S207 via steps S202 and S203. At this time, since the note-on flag NON is “1”, the determination result in step S207 is “YES”. Therefore, the CPU 1 sequentially executes steps S209, S210, and S211. If the determination result in step S211 is “NO”, the process returns to step S202. Thereafter, steps S202, S203, S207, S209, S210, and S211 are repeated.

この間、ステップＳ２０２の処理対象となるフレームが楽曲のノートオン期間に属するものからノートオフ期間に属するものに移行してゆくと、ステップＳ２０２において得られるパワースペクトルの振幅が小さくなってゆく。そして、ステップＳ２０２において得られるパワースペクトルのピークが現在設定されている矩形領域外に出ると、ステップＳ２０３の判断結果が「ＮＯ」になり、ＣＰＵ１の処理はステップＳ２０４に進む。このステップＳ２０４においてＣＰＵ１は、ノートオンフラグＮＯＮが“０”か否かを判断する。このときノートオンフラグＮＯＮは“１”となっているため、ステップＳ２０４の判断結果は「ＮＯ」となり、ＣＰＵ１の処理はステップＳ２０５に進む。このステップＳ２０５においてＣＰＵ１は、ノートオンフラグＮＯＮを“０”とする。次にステップＳ２０６に進み、ＣＰＵ１は、その時点におけるノートオン開始時刻データＦＳおよびノートオン期間カウンタＮＬをノート情報としてＲＡＭ７内に設定されたシーケンスデータの格納エリアに格納する（ステップＳ２０６）。次にＣＰＵ１は、ステップＳ２１０およびＳ２１１を順次実行し、ステップＳ２１１の判断結果が「ＮＯ」である場合には処理をステップＳ２０２に戻す。
以後、同様の動作が繰り返され、ステップＳ２１１の判断結果が「ＹＥＳ」となったときこのルーチンは終了する。 In the meantime, when the frame to be processed in step S202 shifts from that belonging to the note-on period to that belonging to the note-off period, the amplitude of the power spectrum obtained in step S202 decreases. If the peak of the power spectrum obtained in step S202 goes outside the currently set rectangular area, the determination result in step S203 is “NO”, and the process of the CPU 1 proceeds to step S204. In step S204, the CPU 1 determines whether or not the note-on flag NON is “0”. Since the note-on flag NON is “1” at this time, the determination result in step S204 is “NO”, and the process of the CPU 1 proceeds to step S205. In step S205, the CPU 1 sets the note-on flag NON to “0”. Next, proceeding to step S206, the CPU 1 stores the note-on start time data FS and the note-on period counter NL at that time in the sequence data storage area set in the RAM 7 as note information (step S206). Next, the CPU 1 sequentially executes steps S210 and S211. If the determination result in step S211 is “NO”, the process returns to step S202.
Thereafter, the same operation is repeated, and this routine ends when the determination result in step S211 is "YES".

以上の処理により、ノート情報を時系列的に並べたシーケンスデータがＲＡＭ７内に得られる。このシーケンスデータは元のオーディオストリームデータと共に携帯電話などの楽曲再生機能を持った電子機器のユーザに供給され、オーディオストリームデータは楽曲の再生に、シーケンスデータはＬＥＤやバイブレータの駆動制御に用いられる。オーディオデータおよびシーケンスデータは、Ｉ／Ｆ群５の中の適切な装置により記憶媒体に格納してユーザに供給してもよいし、Ｉ／Ｆ群５の中の適切な装置を介してネットワーク内のサーバに送信し、そこからユーザに配信するようにしてもよい。 Through the above processing, sequence data in which note information is arranged in time series is obtained in the RAM 7. This sequence data is supplied together with the original audio stream data to a user of an electronic device having a music playback function such as a mobile phone, the audio stream data is used for playback of music, and the sequence data is used for driving control of LEDs and vibrators. Audio data and sequence data may be stored in a storage medium by an appropriate device in the I / F group 5 and supplied to the user, or may be supplied to the user via an appropriate device in the I / F group 5. May be transmitted to the server and distributed to the user from there.

以上のように、本実施形態によれば、周波数解析によりオーディオストリームデータのパワースペクトルを求め、指定された周波数帯域および振幅域内にパワースペクトルのピークが収まる期間を検出し、この検出結果に基づいてシーケンスデータを生成するようにしたので、打楽器演奏音など音高の変化が明確でない楽音の発音タイミングを示すシーケンスデータをオーディオストリームデータから生成することができる。また、本実施形態によれば、周波数解析により求めたパワースペクトルを、横軸を周波数、縦軸を振幅として表示し、この表示画面上において所望の周波数帯域および振幅域をマウスなどのポインティングデバイスにより指定させるようにしたので、ユーザは、所望の楽音の種類に特有な特徴的なパワースペクトルの発生する周波数帯域および振幅域を見つけ、その領域を簡単な操作により指定することができる。 As described above, according to the present embodiment, the power spectrum of the audio stream data is obtained by frequency analysis, the period during which the peak of the power spectrum falls within the designated frequency band and amplitude range is detected, and based on this detection result Since the sequence data is generated, it is possible to generate sequence data indicating the tone generation timing of a musical tone whose pitch change is not clear, such as a percussion instrument performance sound, from the audio stream data. Further, according to the present embodiment, the power spectrum obtained by frequency analysis is displayed with the horizontal axis as frequency and the vertical axis as amplitude, and a desired frequency band and amplitude range are displayed on this display screen by a pointing device such as a mouse. Since the designation is made, the user can find a frequency band and an amplitude range in which a characteristic power spectrum peculiar to the type of desired musical sound is generated, and can designate the range by a simple operation.

以上、この発明の一実施形態について説明したが、この発明にはこれ以外にも他の実施形態が考えられる。例えば次の通りである。
（１）上記実施形態では、マウスの操作により矩形領域をユーザに指定させたため、振幅域の上限および下限の両方が得られ、パワースペクトルのピークが指定された周波数帯域内にあり、かつ、振幅域の上限および下限の間に収まっている期間を検出した。しかし、振幅域の上限は無視し、パワースペクトルのピークが指定された周波数帯域内にあり、かつ、振幅域の下限以上である期間を検出し、これを楽音の発音期間として扱ってもよい。 Although one embodiment of the present invention has been described above, other embodiments are possible for the present invention. For example:
(1) In the above embodiment, since the rectangular area is designated by the user by operating the mouse, both the upper limit and the lower limit of the amplitude area are obtained, the peak of the power spectrum is within the specified frequency band, and the amplitude A period that falls between the upper and lower limits of the range was detected. However, the upper limit of the amplitude range may be ignored, and a period in which the peak of the power spectrum is within the specified frequency band and is equal to or greater than the lower limit of the amplitude range may be detected and treated as a tone generation period.

（２）上記実施形態では、楽音の発音開始時刻を示す情報と発音の持続時間を示す情報を含むシーケンスデータを生成したが、楽音の発音開始時刻を示す情報を含み、発音の持続時間を示す情報を含まないシーケンスデータを生成してもよい。あるいは、シーケンスデータに従ってＬＥＤやバイブレータの駆動を行う携帯電話などの電子機器において、シーケンスデータ中の楽音の発音開始時刻を示す情報のみを利用してＬＥＤ等の駆動開始タイミングを制御し、駆動の持続時間は例えばユーザが操作子の操作により自由に設定するようにしてもよい。 (2) In the above embodiment, the sequence data including the information indicating the tone generation start time of the musical tone and the information indicating the duration of the pronunciation is generated. However, the sequence data includes the information indicating the pronunciation start time of the tone and indicates the duration of the pronunciation Sequence data that does not include information may be generated. Alternatively, in an electronic device such as a mobile phone that drives an LED or a vibrator according to sequence data, the drive start timing of the LED or the like is controlled by using only the information indicating the sound generation start time in the sequence data, and the drive is continued. For example, the user may freely set the time by operating the operator.

（３）上記実施形態において、窓掛け１１、ＦＦＴ１２、パワースペクトル表示１３の高速実行が可能な場合には、処理対象のオーディオストリームデータをサウンドシステム８に送って楽曲を再生させ、これに同期させて、オーディオストリームデータの各フレームについての窓掛け１１、ＦＦＴ１２、パワースペクトル表示１３を順次実行させるようにしてもよい。この態様によれば、ユーザは、サウンドシステム８により再生される楽曲を聴き、例えばドラム音などの所望の楽音が発音されるのに合わせて大きく動く特徴的なパワースペクトルを発見し、矩形領域の指定を行うことができる。 (3) In the above embodiment, when the windowing 11, the FFT 12, and the power spectrum display 13 can be executed at high speed, the audio stream data to be processed is sent to the sound system 8 to reproduce the music and synchronize with it. Thus, the windowing 11, the FFT 12, and the power spectrum display 13 for each frame of the audio stream data may be sequentially executed. According to this aspect, the user listens to the music reproduced by the sound system 8, finds a characteristic power spectrum that moves greatly as a desired musical sound such as a drum sound is generated, and the like. You can specify.

（４）矩形領域の指定を行わせるために利用するオーディオストリームデータとシーケンスデータを生成するために利用するオーディオストリームデータは別のデータであってもよい。すなわち、矩形指定によるパラメータ設定１４を容易に行わせるために、処理対象のオーディオストリームデータとは別に、ドラム音、シンバル音といった各種の楽音の単発音のオーディオストリームデータを用意する。そして、これらのうちユーザによって指定されたものについて窓掛け１１、ＦＦＴ１２、パワースペクトル表示１３および矩形指定によるパラメータ設定１４を実行する。そして、矩形領域が定まった場合には、所望の楽曲のオーディオストリームデータを処理対象とし、窓掛け１１、ＦＦＴ１２、パワースペクトル表示１３、矩形内ピーク検出１５および発音時間累算１６の各処理を実行するのである。 (4) The audio stream data used for specifying the rectangular area and the audio stream data used for generating the sequence data may be different data. That is, in order to easily perform parameter setting 14 by specifying a rectangle, a single tone audio stream data of various musical sounds such as a drum sound and a cymbal sound is prepared separately from the audio stream data to be processed. Then, the windowing 11, the FFT 12, the power spectrum display 13, and the parameter setting 14 by the rectangle designation are executed for those designated by the user. When the rectangular area is determined, the audio stream data of the desired music is processed, and the windowing 11, FFT 12, power spectrum display 13, in-rectangular peak detection 15, and pronunciation time accumulation 16 are executed. To do.

（５）上記実施形態では、１個の矩形領域を指定するようにしたが、矩形領域を複数指定することを認め、矩形領域毎に、パワースペクトルのピークが収まる期間の検出および検出結果に基づくシーケンスデータの生成を行うようにしてもよい。この態様によれば、複数種類の楽音を含む楽曲のオーディオストリームデータが処理対象となっており、かつ、楽音の種類により、特徴的なパワースペクトルの現れる周波数帯域が異なるような場合に、それらの楽音の種類毎に矩形領域を設定し、各楽音に対応したシーケンスデータを生成することができる。この場合において、シーケンスデータを利用する携帯電話等において、例えば第１の楽音のシーケンスデータにより赤色ＬＥＤの点滅制御を行い、第２の楽音のシーケンスデータにより青色ＬＥＤの点滅制御を行うようにしてもよい。
（６）窓掛け１１およびＦＦＴ１２は、ＣＰＵ１がＤＳＰ（デジタル信号プロセッサ）などの他の装置に実行させるようにしてもよい。 (5) In the above embodiment, one rectangular area is specified. However, it is recognized that a plurality of rectangular areas are specified, and detection of a period in which the peak of the power spectrum falls within each rectangular area and based on the detection result. Sequence data may be generated. According to this aspect, when the audio stream data of a song including a plurality of types of musical sounds is a processing target and the frequency band in which a characteristic power spectrum appears varies depending on the types of musical sounds, A rectangular area can be set for each type of musical sound, and sequence data corresponding to each musical sound can be generated. In this case, in a cellular phone or the like using sequence data, for example, the blinking control of the red LED is performed by the sequence data of the first musical tone, and the blinking control of the blue LED is performed by the sequence data of the second musical tone. Good.
(6) The windowing 11 and the FFT 12 may be executed by the CPU 1 by another device such as a DSP (digital signal processor).

この発明の一実施形態であるシーケンスデータ生成装置の構成を示すブロック図である。It is a block diagram which shows the structure of the sequence data generation apparatus which is one Embodiment of this invention. 同実施形態におけるシーケンスデータ生成プログラムの処理内容を示す図である。It is a figure which shows the processing content of the sequence data generation program in the embodiment. 同実施形態におけるパワースペクトルの表示例を示す図である。It is a figure which shows the example of a display of the power spectrum in the same embodiment. 同実施形態における矩形指定によるパラメータ設定のためのルーチンの処理内容を示すフローチャートである。It is a flowchart which shows the processing content of the routine for the parameter setting by the rectangle designation | designated in the embodiment. 同実施形態における矩形内ピーク検出および発音時間累算のためのルーチンの処理内容を示すフローチャートである。It is a flowchart which shows the processing content of the routine for the detection of the peak in a rectangle and the sounding time accumulation in the same embodiment.

Explanation of symbols

１……ＣＰＵ、２……ＲＯＭ、３……表示部、４……操作部、５……Ｉ／Ｆ群、６……ＨＤＤ、７……ＲＡＭ、８……サウンドシステム、１１……窓掛け、１２……ＦＦＴ、１３……パワースペクトル表示、１４……矩形指定によるパラメータ設定、１５……矩形内ピーク検出、１６……発音時間累算。 1 ... CPU, 2 ... ROM, 3 ... display unit, 4 ... operation unit, 5 ... I / F group, 6 ... HDD, 7 ... RAM, 8 ... sound system, 11 ... window Multiplication, 12... FFT, 13... Power spectrum display, 14... Parameter setting by rectangle specification, 15... Peak detection within rectangle, 16.

Claims

Frequency analysis means for sequentially obtaining the frequency characteristics of each part from the start point to the end point of the audio stream data indicating the waveform of the music piece or a part thereof;
A range specifying means for specifying a region consisting of the frequency band and amplitude range according to operation of the operation unit performed by the user,
A period in which the peak of the power spectrum in the frequency characteristics sequentially obtained by the frequency analysis means is included in a region composed of the frequency band and the amplitude range designated by the range designation means is detected, and the generation timing of the musical sound is determined based on the detection result. And a sequence data generating means for generating sequence data representing the sequence data.

The frequency analysis means sequentially obtains the frequency characteristics of the audio stream data for each frame,
The sequence data generating means detects a frame including a peak of a power spectrum in a frequency characteristic sequentially obtained for each frame by the frequency analyzing means in a region composed of a frequency band and an amplitude range specified by the range specifying means, The sequence data generating apparatus according to claim 1, wherein sequence data indicating a musical sound generation timing is generated by setting a frame including the peak as a note-on period.

The range specifying means specifies a region composed of a frequency band and an amplitude region having only a lower limit without an upper limit in accordance with an operation of the operation unit. Sequence data generator.

The range designating unit designates one or a plurality of regions consisting of a frequency band and an amplitude region according to an operation of an operation unit performed by a user,
The sequence data generation means includes a peak of a power spectrum in the frequency characteristics sequentially obtained by the frequency analysis means for each area when a plurality of areas consisting of frequency bands and amplitude areas are designated by the range designation means. 4. The sequence data generation device according to claim 1, wherein a plurality of sequence data each indicating a tone generation timing is generated based on the detection result. .

Comprising display means for displaying a power spectrum in a frequency characteristic obtained by the frequency analysis means,
5. The pointing device according to claim 1, wherein the range designating unit includes a pointing device that designates a region composed of the frequency band and the amplitude region on a display screen of the power spectrum in the display unit. The sequence data generation device according to claim.

6. The sequence data generation apparatus according to claim 5, wherein the display means sequentially displays power spectra sequentially obtained by the frequency analysis means.

6. The sequence data generation apparatus according to claim 5, wherein the display means displays a power spectrum sequentially obtained by the frequency analysis means.

The frequency analysis means sequentially obtains the frequency characteristics of the first audio stream data,
The display means displays a power spectrum in the frequency characteristics of the first audio stream data obtained by the frequency analysis means;
In accordance with the operation of the operation unit performed by the user by the range specifying unit, the region consisting of the frequency band and the amplitude region is specified on the display screen of the power spectrum of the first audio stream data in the display unit,
The frequency analysis means sequentially obtains the frequency characteristics of each part from the start point to the end point of the second audio stream data,
The sequence data generating means includes a peak of the power spectrum in the frequency characteristic of the second audio stream data sequentially obtained by the frequency analyzing means in the region consisting of the frequency band and the amplitude range designated by the range designating means. 8. The sequence data generating apparatus according to claim 5, wherein a period is detected, and sequence data indicating a musical sound generation timing is generated based on the detection result.

Comprising a reproducing means for reproducing the audio stream data as sound,
The frequency analysis means obtains a frequency characteristic of the audio stream data in synchronization with the reproduction of the audio stream data by the reproduction means, and the display means displays a power spectrum in the frequency characteristic. The sequence data generation device according to any one of claims 1 to 8.

Computer
Frequency analysis means for sequentially obtaining frequency characteristics of each part from the start point to the end point of the audio stream data indicating the waveform of the music piece or a part thereof;
A range specifying means for specifying a region composed of a frequency band and an amplitude region according to an operation of the operation unit performed by the user;
A period in which the peak of the power spectrum in the frequency characteristics sequentially obtained by the frequency analysis means is included in a region composed of the frequency band and the amplitude range designated by the range designation means is detected, and the generation timing of the musical sound is determined based on the detection result. A sequence data generation program that functions as a sequence data generation unit that generates sequence data indicating