WO2017037920A1 - Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program - Google Patents

Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program Download PDF

Info

Publication number
WO2017037920A1
WO2017037920A1 PCT/JP2015/075077 JP2015075077W WO2017037920A1 WO 2017037920 A1 WO2017037920 A1 WO 2017037920A1 JP 2015075077 W JP2015075077 W JP 2015075077W WO 2017037920 A1 WO2017037920 A1 WO 2017037920A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound data
fft
execution interval
data
time
Prior art date
Application number
PCT/JP2015/075077
Other languages
French (fr)
Japanese (ja)
Inventor
四郎 鈴木
Original Assignee
Pioneer DJ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pioneer DJ株式会社 filed Critical Pioneer DJ株式会社
Priority to JP2017537159A priority Critical patent/JP6549234B2/en
Priority to EP15903039.4A priority patent/EP3346468B1/en
Priority to PCT/JP2015/075077 priority patent/WO2017037920A1/en
Publication of WO2017037920A1 publication Critical patent/WO2017037920A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/45Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Definitions

  • the present invention relates to a music analysis device, a music analysis method, and a music analysis program.
  • An object of the present invention is to provide a music analysis apparatus, a music analysis method, and a music analysis program capable of reducing the analysis time regardless of the time of sound data.
  • the music analysis apparatus is Playback time detection means for detecting the playback time of the input sound data; Execution interval setting means for setting an execution interval of FFT (Fast Fourier Transform) according to the reproduction time detected by the reproduction time detection means; Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means; It is characterized by having.
  • FFT Fast Fourier Transform
  • the music analysis apparatus is Data length detecting means for detecting the data length of the input sound data; Execution interval setting means for setting an FFT execution interval in accordance with the data length detected by the data length detection means; Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means; It is characterized by having.
  • the music analysis method includes: The procedure to detect the playback time of the input sound data, A procedure for setting the FFT execution interval according to the detected playback time; A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval; It is characterized by performing.
  • the music analysis program is: The procedure to detect the playback time of the input sound data, A procedure for setting the FFT execution interval according to the detected playback time; A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval; Is executed by a computer.
  • the block diagram which shows the music analysis apparatus which concerns on embodiment of this invention.
  • FIG. 1 shows a music analysis apparatus 1 according to this embodiment.
  • the music analysis device 1 analyzes digitized sound data SD such as input PCM data, determines a key of the sound data SD, and determines the sound data input on the display screen of a display device or the like. This is a device for displaying as a key display KD.
  • the music analysis apparatus 1 is configured as a software application that is activated by a general computer, a portable information terminal equipped with an OS (Operation System), and includes a reproduction time detection means 2, a sound data determination means 3, a sound data duplication means 4, and a sound.
  • Data analysis means 5, execution interval setting means 6, and key determination means 7 are provided.
  • the playback time detection means 2 detects the playback time of the input sound data SD. Specifically, the reproduction time detection means 2 detects the reproduction time by counting the number of samplings from the beginning to the end of the sound data SD. When the reproduction time is detected, the reproduction time detection unit 2 outputs the detected reproduction time to the sound data determination unit 3 and the execution interval setting unit 6.
  • the sound data determination means 3 determines whether or not the input sound data SD has a predetermined reproduction time or more based on the reproduction time detected by the reproduction time detection means 2. Specifically, it is determined whether or not the sound data SD has a reproduction time that can be analyzed by sound data analysis means 5 described later. Whether or not analysis is possible is determined by whether or not the reproduction time of the sound data SD has a time longer than the shortest time necessary for the window function applied to the sound data analysis means 5.
  • the time window length of the window function is determined by the sampling frequency of the sound data SD, the lower limit frequency to be detected, and the frequency resolution. For example, in the case of a regular 4/4 time tune of BPM200, one beat is 300 msec and a sixteenth note is 75 msec. When performing FFT and analyzing the low-pitched sound data SD of 27.5 Hz corresponding to A0 of the scale, data of 1.2 s is required at the minimum. When the sound data determination unit 3 determines that the sound data SD is less than the predetermined reproduction time, the sound data determination unit 3 outputs a determination result to that effect to the sound data duplication unit 4.
  • the sound data duplicating unit 4 duplicates the input sound data SD based on the determination result of the sound data determining unit 3, and continues the data so that the predetermined time or more is reached. Specifically, as shown in FIG. 2, the sound data duplicating unit 4 duplicates the input sound data SD of the reproduction time t1, and pastes the duplicate data CD behind the sound data SD so as to be continuous. , Continuous sound data SD ′ having a reproduction time t2 is generated.
  • the sound data duplicating unit 4 duplicates the duplicate data CD of the sound data SD, and the reproduction time t2 is 1.2 seconds or longer.
  • the duplication is repeated until the sound data SD ′ is obtained. It should be noted that the duplication only needs to be performed so that the reproduction time t2 can be analyzed by the sound data analysis means 5, and the number N of duplications may not be an integer.
  • the sound data replicating unit 4 outputs to the sound data analyzing unit 5 the sound data SD ′ that has been duplicated for a predetermined time or more.
  • the sound data analysis means 5 performs frequency spectrum analysis of the sound data SD and SD ′.
  • the analysis method is performed by analysis using FFT.
  • the analysis method is not particularly limited, and for example, analysis using DCT (Discrete Cosine transform), analysis using a time axis, analysis using a signal level, volume feeling, and attack feeling may be used. it can.
  • a Hamming window HMW that is a window function is usually applied to the FFT.
  • the Hamming window HMW is applied to weaken the signal intensity amplification at both ends on the time axis during the FFT execution time and reduce the influence of the discontinuous portion of the sampled waveform seam during the FFT execution.
  • the analysis period T0 is set to a 0.7 (70%) value of the Hamming window HMW.
  • the Hamming window HMW is applied.
  • the present invention is not limited to this, and a Hanning window, a flap top window, or the like may be used as the window function.
  • the execution interval setting means 6 sets the FFT execution interval by the sound data analysis means 5 based on the reproduction time detected by the reproduction time detection means 2. Specifically, as shown in FIG. 4, the execution interval setting means 6 sets the execution start time of the second FFT2 as the execution interval TI after the first execution of FFT1. Thereafter, in the present embodiment, the third FFT3 is started after time 2TI, and the FFTs are sequentially executed.
  • the execution interval TI is set according to the reproduction time of the sound data SD and SD ′. For example, in the case of long sound data SD with a reproduction time of 30 seconds or more, the execution interval setting means 6 increases the execution interval TI as shown in FIG. 5, and in the case of short sound data SD with a reproduction time of less than 30 s, As shown in FIG. 6, the execution interval TI is reduced.
  • the lower limit value of the execution interval TI is set to an execution interval TI such that the analysis periods T0 of FFT1, FFT2,.
  • the execution interval setting means 6 outputs the set execution interval TI to the sound data analysis means 5 described above.
  • the sound data analysis unit 5 repeatedly executes FFT based on the execution interval TI, and outputs the analysis result to the key determination unit 7 every time FFT is executed.
  • the key determination unit 7 performs key determination of the sound data SD and SD ′ based on the analysis result output from the sound data analysis unit 5. Specifically, the key determination means 7 stores a total of 24 reference frequencies of a total of 24 kinds of scales, that is, a minor key and a major key having each sound as a root for each of 12 scales per octave. The key determination means 7 adds the analysis results input at every execution interval TI in the time axis direction, and selects a reference frequency close to a frequency with a strong signal strength from the obtained sum, as shown in FIG. Next, the signal strength of each scale is obtained. Next, as shown in FIG.
  • the key determination means 7 rearranges and normalizes the signals in the order of strong signal strength, selects several scales with strong signal strength, and selects the keys of the sound data SD, SD ′. Determine.
  • the key determination means 7 displays the key determination result of the sound data SD, SD ′ as a key display KD on the computer display or the screen of the portable terminal.
  • Step S1 when a user of a computer or a portable terminal selects the music analysis device 1 on the screen, starts the program, and selects the sound data SD to be analyzed, the sound data SD is input to the music analysis device 1 ( Procedure S1).
  • the reproduction time detecting means 2 detects the reproduction time of the sound data SD (step S2).
  • the sound data determination means 3 determines whether or not the reproduction time of the sound data SD is equal to or longer than a predetermined time (step S3).
  • the sound data duplicating means 4 duplicates the sound data SD (step S4) and pastes the duplicate data CD on the sound data SD to obtain continuous sound.
  • Data SD ′ is generated.
  • the execution interval setting means 6 reproduces the sound data SD, SD ′. Is set to the execution interval TI of the sound data analysis means 5 (step S6).
  • the sound data analysis means 5 repeats FFT based on the set execution interval TI, and performs frequency spectrum analysis of the sound data SD and SD ′ (step S7).
  • the sound data analysis means 5 determines whether or not the sound data SD, SD ′ has ended (step S8). If it is determined that the sound data SD, SD ′ has ended, the analysis result is output to the key determination means. .
  • the key determination means 7 performs key determination on the sound data SD and SD ′ based on the analysis result (step S9).
  • the key determination means 7 outputs the sound data SD and SD ′ keys as determination results on the display screen of the computer or portable terminal (step S10).
  • the music analysis apparatus 1 includes the sound data duplicating means 4, even if the sound data SD is very short, the sound data SD ′ can be made longer than a predetermined time by duplication. Regardless of the time, the sound data analysis means 5 can perform FFT and perform frequency spectrum analysis to perform key determination of the sound data SD and SD ′. As a result, a variety of sound data SD can be used in a DJ-related device regardless of the length of playback time of the sound data SD, so that a DJ with high performance can be provided.
  • the frequency analysis apparatus 1 includes the reproduction time detection unit 2 and the execution interval setting unit 6, the FFT execution interval TI by the sound data analysis unit 5 can be changed according to the reproduction time of the sound data SD. . Therefore, in the case of the sound data SD having a short reproduction time, the analysis interval of the sound data SD can be improved by shortening the execution interval TI and increasing the number of executions of FFT. On the other hand, in the case of sound data SD having a long reproduction time, the analysis time can be shortened by increasing the execution interval TI of the execution interval TI and reducing the number of FFT executions during reproduction of the sound data SD. In the case of long sound data SD, the number of executions of FFT tends to be relatively small and the analysis tends to be rough. However, since the number of executions sufficient for key determination and the like can be obtained, satisfactory results can be obtained. Can be obtained.
  • the music analysis apparatus 1 performs key determination of the sound data SD, but the present invention is not limited to this, and may be used as a music analysis apparatus that performs key and scale determination.
  • the execution interval setting unit 6 sets the execution interval TI based on the reproduction time of the sound data SD and SD ′.
  • the execution interval may be set based on the data length of the input sound data. In addition, other configurations may be adopted as long as the object of the present invention can be achieved.

Abstract

A musical-piece analysis device (1) is equipped with: a playback-interval detection means (2) for detecting the playback interval of inputted sound data; an execution-period setting means (6) for setting the execution period for a Fast Fourier Transform (FFT), according to the playback interval detected by the playback-interval detection means (2); and a sound data analysis means (5) for executing the FFT and analyzing the inputted sound data, on the basis of the FFT execution period set by the execution period setting means (6).

Description

楽曲解析装置、楽曲解析方法、及び楽曲解析プログラムMusic analysis apparatus, music analysis method, and music analysis program
 本発明は、楽曲解析装置、楽曲解析方法、及び楽曲解析プログラムに関する。 The present invention relates to a music analysis device, a music analysis method, and a music analysis program.
 従来、楽曲等の音データから自動で拍やテンポ、楽曲のキー、スケール等の楽曲を解析する技術が知られている(例えば、特許文献1参照)。
 音データの解析は、ある程度の長さの音データをサンプリングし、入力された波形を、例えば、FFT(Fast Fourier Transform)を用いて解析することにより行われる。
 このような音データの解析は、DJ関連機器の分野においても、BPM(Beats Per Minute)楽曲に関連する技術として利用されている。
 DJ関連機器でこのような音データの解析を利用すれば、解析されたテンポ、キー、スケール等により、曲と曲との間を違和感なくつなぐことができるため、パフォーマンスの高いDJを提供することができる。
2. Description of the Related Art Conventionally, a technique for automatically analyzing music such as beats, tempos, music keys, and scales from sound data such as music is known (see, for example, Patent Document 1).
The sound data is analyzed by sampling sound data of a certain length and analyzing the input waveform using, for example, FFT (Fast Fourier Transform).
Such analysis of sound data is also used as a technology related to BPM (Beats Per Minute) music in the field of DJ-related equipment.
By using such sound data analysis with DJ-related equipment, it is possible to connect songs with a sense of incongruity using the analyzed tempo, key, scale, etc., and provide a DJ with high performance Can do.
特開2010-97084号公報JP 2010-97084 A
 しかしながら、従来、FFTの実行間隔は固定されているため、長時間の音データを解析しようとすると、FFTの実行回数が多くなり、解析に時間を要するという課題がある。 However, since the FFT execution interval is conventionally fixed, there is a problem in that when the sound data for a long time is analyzed, the number of times the FFT is executed increases and the analysis takes time.
 本発明の目的は、音データの時間によらず、解析時間を短縮することのできる楽曲解析装置及び楽曲解析方法、及び楽曲解析プログラムを提供することにある。 An object of the present invention is to provide a music analysis apparatus, a music analysis method, and a music analysis program capable of reducing the analysis time regardless of the time of sound data.
 本発明に係る楽曲解析装置は、
 入力された音データの再生時間を検出する再生時間検出手段と、
 前記再生時間検出手段で検出された再生時間に応じて、FFT(Fast Fourier Transform)の実行間隔を設定する実行間隔設定手段と、
 前記実行間隔設定手段で設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う音データ解析手段と、
を備えていることを特徴とする。
The music analysis apparatus according to the present invention is
Playback time detection means for detecting the playback time of the input sound data;
Execution interval setting means for setting an execution interval of FFT (Fast Fourier Transform) according to the reproduction time detected by the reproduction time detection means;
Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
It is characterized by having.
 本発明に係る楽曲解析装置は、
 入力された音データのデータ長を検出するデータ長検出手段と、
 前記データ長検出手段で検出されたデータ長に応じて、FFTの実行間隔を設定する実行間隔設定手段と、
 前記実行間隔設定手段で設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う音データ解析手段と、
を備えていることを特徴とする。
The music analysis apparatus according to the present invention is
Data length detecting means for detecting the data length of the input sound data;
Execution interval setting means for setting an FFT execution interval in accordance with the data length detected by the data length detection means;
Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
It is characterized by having.
 本発明に係る楽曲解析方法は、
 入力された音データの再生時間を検出する手順と、
 検出された再生時間に応じて、FFTの実行間隔を設定する手順と、
 設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う手順と、
を実行することを特徴とする。
The music analysis method according to the present invention includes:
The procedure to detect the playback time of the input sound data,
A procedure for setting the FFT execution interval according to the detected playback time;
A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
It is characterized by performing.
 本発明に係る楽曲解析プログラムは、
 入力された音データの再生時間を検出する手順と、
 検出された再生時間に応じて、FFTの実行間隔を設定する手順と、
 設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う手順と、
をコンピュータに実行させることを特徴とする。
The music analysis program according to the present invention is:
The procedure to detect the playback time of the input sound data,
A procedure for setting the FFT execution interval according to the detected playback time;
A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
Is executed by a computer.
本発明の実施形態に係る楽曲解析装置を示すブロック図。The block diagram which shows the music analysis apparatus which concerns on embodiment of this invention. 前記実施形態における音データの複製を説明するための模式図。The schematic diagram for demonstrating duplication of the sound data in the said embodiment. 前記実施形態における窓関数を説明するための模式図。The schematic diagram for demonstrating the window function in the said embodiment. 前記実施形態におけるFFTの実行間隔を説明するための模式図。The schematic diagram for demonstrating the execution interval of FFT in the said embodiment. 前記実施形態における再生時間の長い音データの実行間隔を説明するための模式図。The schematic diagram for demonstrating the execution interval of the sound data with a long reproduction time in the said embodiment. 前記実施形態における再生時間の短い音データの実行間隔を説明するための模式図。The schematic diagram for demonstrating the execution space | interval of sound data with a short reproduction time in the said embodiment. 前記実施形態におけるFFT実行後のキー判定を説明するための模式図。The schematic diagram for demonstrating the key determination after FFT execution in the said embodiment. 前記実施形態におけるFFT実行後のキー判定を説明するための模式図。The schematic diagram for demonstrating the key determination after FFT execution in the said embodiment. 前記実施形態における楽曲解析方法を説明するためのフローチャート。The flowchart for demonstrating the music analysis method in the said embodiment.
 以下、本発明の実施の一形態について説明する。
 図1には、本実施形態に係る楽曲解析装置1が示されている。この楽曲解析装置1は、入力されたPCMデータ等のデジタル化された音データSDを解析し、その音データSDのキーを判定して、表示装置等の表示画面上に入力された音データのキー表示KDとして表示させる装置である。
 楽曲解析装置1は、一般的なコンピュータ、OS(Operation System)を搭載した携帯情報端末で起動するソフトウェアアプリケーションとして構成され、再生時間検出手段2、音データ判定手段3、音データ複製手段4、音データ解析手段5、実行間隔設定手段6、及びキー判定手段7を備える。
Hereinafter, an embodiment of the present invention will be described.
FIG. 1 shows a music analysis apparatus 1 according to this embodiment. The music analysis device 1 analyzes digitized sound data SD such as input PCM data, determines a key of the sound data SD, and determines the sound data input on the display screen of a display device or the like. This is a device for displaying as a key display KD.
The music analysis apparatus 1 is configured as a software application that is activated by a general computer, a portable information terminal equipped with an OS (Operation System), and includes a reproduction time detection means 2, a sound data determination means 3, a sound data duplication means 4, and a sound. Data analysis means 5, execution interval setting means 6, and key determination means 7 are provided.
 再生時間検出手段2は、入力された音データSDの再生時間を検出する。具体的には、再生時間検出手段2は、音データSDの始めから最後までのサンプリングの回数をカウントすることにより、再生時間を検出する。再生時間を検出したら、再生時間検出手段2は、音データ判定手段3及び実行間隔設定手段6に、検出した再生時間を出力する。 The playback time detection means 2 detects the playback time of the input sound data SD. Specifically, the reproduction time detection means 2 detects the reproduction time by counting the number of samplings from the beginning to the end of the sound data SD. When the reproduction time is detected, the reproduction time detection unit 2 outputs the detected reproduction time to the sound data determination unit 3 and the execution interval setting unit 6.
 音データ判定手段3は、再生時間検出手段2で検出された再生時間に基づいて、入力された音データSDが所定の再生時間以上を有するか否かを判定する。具体的には、音データSDが、後述する音データ解析手段5で解析できるだけの再生時間を有するか否かを判定する。
 解析の可否の判定は、音データ解析手段5に適用される窓関数に必要な最短の時間以上の時間を、音データSDの再生時間が有しているか否かで判定される。
The sound data determination means 3 determines whether or not the input sound data SD has a predetermined reproduction time or more based on the reproduction time detected by the reproduction time detection means 2. Specifically, it is determined whether or not the sound data SD has a reproduction time that can be analyzed by sound data analysis means 5 described later.
Whether or not analysis is possible is determined by whether or not the reproduction time of the sound data SD has a time longer than the shortest time necessary for the window function applied to the sound data analysis means 5.
 窓関数の時間窓長は、音データSDのサンプリング周波数、検出する下限周波数、周波数分解能により決定される。
 例えば、BPM200の通常の4/4拍子の楽曲の場合、1拍は300msecとなり、16分音符は75msecとなる。FFTを実行して、音階のA0にあたる27.5Hzの低音の音データSDの解析を行う場合、最低でも1.2sのデータが必要である。
 音データ判定手段3は、音データSDが所定の再生時間未満であると判定したら、その旨の判定結果を、音データ複製手段4に出力する。
The time window length of the window function is determined by the sampling frequency of the sound data SD, the lower limit frequency to be detected, and the frequency resolution.
For example, in the case of a regular 4/4 time tune of BPM200, one beat is 300 msec and a sixteenth note is 75 msec. When performing FFT and analyzing the low-pitched sound data SD of 27.5 Hz corresponding to A0 of the scale, data of 1.2 s is required at the minimum.
When the sound data determination unit 3 determines that the sound data SD is less than the predetermined reproduction time, the sound data determination unit 3 outputs a determination result to that effect to the sound data duplication unit 4.
 音データ複製手段4は、音データ判定手段3の判定結果に基づいて、入力された音データSDを複製し、前述した所定の時間以上となるように連続させる。具体的には、音データ複製手段4は、図2に示されるように、入力された再生時間t1の音データSDを複製し、複製データCDを、音データSDの後ろに貼り付けて連続させ、再生時間t2の連続した音データSD’を生成する。 The sound data duplicating unit 4 duplicates the input sound data SD based on the determination result of the sound data determining unit 3, and continues the data so that the predetermined time or more is reached. Specifically, as shown in FIG. 2, the sound data duplicating unit 4 duplicates the input sound data SD of the reproduction time t1, and pastes the duplicate data CD behind the sound data SD so as to be continuous. , Continuous sound data SD ′ having a reproduction time t2 is generated.
 例えば、音データSDの再生時間t1が前述した例における1.2secより短い場合、音データ複製手段4は、音データSDの複製データCDを複製して、再生時間t2が1.2sec以上の連続した音データSD’となるまで複製を繰り返す。
 なお、複製は、再生時間t2が音データ解析手段5で解析できるだけの再生時間であればよく、複製の回数Nは整数でなくてもよい。
 音データ複製手段4は、複製により、所定の時間以上とされた音データSD’を音データ解析手段5に出力する。
For example, when the reproduction time t1 of the sound data SD is shorter than 1.2 seconds in the above-described example, the sound data duplicating unit 4 duplicates the duplicate data CD of the sound data SD, and the reproduction time t2 is 1.2 seconds or longer. The duplication is repeated until the sound data SD ′ is obtained.
It should be noted that the duplication only needs to be performed so that the reproduction time t2 can be analyzed by the sound data analysis means 5, and the number N of duplications may not be an integer.
The sound data replicating unit 4 outputs to the sound data analyzing unit 5 the sound data SD ′ that has been duplicated for a predetermined time or more.
 音データ解析手段5は、音データSD、SD’の周波数スペクトル解析を行う。解析方法は、本実施形態では、FFTを用いて解析で行っている。但し、解析方法は、特に限定されるわけでなく、例えば、DCT(Discrete Cosine transform)を用いた解析や、時間軸による解析や、信号レベルによる解析、音量感、アタック感による解析を用いることができる。
 FFTには、図3に示すように、通常、窓関数であるハミング窓HMWが適用されている。ハミング窓HMWは、FFTの実行時間中の時間軸上の両端の信号強度の増幅を弱め、FFT実行時に、サンプリングされた波形の継ぎ目の不連続部分の影響を少なくするために適用されている。
The sound data analysis means 5 performs frequency spectrum analysis of the sound data SD and SD ′. In this embodiment, the analysis method is performed by analysis using FFT. However, the analysis method is not particularly limited, and for example, analysis using DCT (Discrete Cosine transform), analysis using a time axis, analysis using a signal level, volume feeling, and attack feeling may be used. it can.
As shown in FIG. 3, a Hamming window HMW that is a window function is usually applied to the FFT. The Hamming window HMW is applied to weaken the signal intensity amplification at both ends on the time axis during the FFT execution time and reduce the influence of the discontinuous portion of the sampled waveform seam during the FFT execution.
 このため、FFTを実行したデータの時間軸上の両端部は、信号強度が弱すぎるので、解析データとして使用することができない。
 そこで、本実施形態では、解析可能なデータとして、FFT実行時間のうち信号強度が弱められない解析期間T0の信号強度を用いて、周波数スペクトル解析を行っている。解析期間T0は、任意に定めうるが、本実施形態では、ハミング窓HMWの0.7(70%)値で設定している。
 なお、本実施形態では、ハミング窓HMWを適用しているが、これに限らず、ハニング窓、フラップトップ窓等を窓関数として用いてもよい。
For this reason, since the signal intensity is too weak at both ends on the time axis of the data on which FFT has been executed, it cannot be used as analysis data.
Therefore, in the present embodiment, frequency spectrum analysis is performed using the signal intensity of the analysis period T0 during which the signal intensity is not weakened in the FFT execution time as the data that can be analyzed. Although the analysis period T0 can be arbitrarily determined, in this embodiment, the analysis period T0 is set to a 0.7 (70%) value of the Hamming window HMW.
In the present embodiment, the Hamming window HMW is applied. However, the present invention is not limited to this, and a Hanning window, a flap top window, or the like may be used as the window function.
 実行間隔設定手段6は、再生時間検出手段2で検出された再生時間に基づいて、音データ解析手段5によるFFTの実行間隔を設定する。
 具体的には、実行間隔設定手段6は、図4に示されるように、1回目にFFT1が実行された後、2回目のFFT2の実行開始の時間を、実行間隔TIとして設定する。以後、本実施形態では、3回目のFFT3は、時間2TI後に開始され、順次FFTが実行される。
The execution interval setting means 6 sets the FFT execution interval by the sound data analysis means 5 based on the reproduction time detected by the reproduction time detection means 2.
Specifically, as shown in FIG. 4, the execution interval setting means 6 sets the execution start time of the second FFT2 as the execution interval TI after the first execution of FFT1. Thereafter, in the present embodiment, the third FFT3 is started after time 2TI, and the FFTs are sequentially executed.
 実行間隔TIの設定は、音データSD、SD’の再生時間によって設定される。
 実行間隔設定手段6は、例えば、再生時間が30sec以上の長い音データSDの場合、図5に示されるように、実行間隔TIを大きくし、再生時間が30s未満の短い音データSDの場合、図6に示されるように、実行間隔TIを小さくする。実行間隔TIの下限値は、FFT1、FFT2・・・の解析期間T0が連続するような実行間隔TIとする。
 実行間隔設定手段6は、設定した実行間隔TIを前述した音データ解析手段5に出力する。
 音データ解析手段5は、実行間隔TIに基づいて、FFTを繰り返し実行し、FFTの実行の度に解析結果をキー判定手段7に出力する。
The execution interval TI is set according to the reproduction time of the sound data SD and SD ′.
For example, in the case of long sound data SD with a reproduction time of 30 seconds or more, the execution interval setting means 6 increases the execution interval TI as shown in FIG. 5, and in the case of short sound data SD with a reproduction time of less than 30 s, As shown in FIG. 6, the execution interval TI is reduced. The lower limit value of the execution interval TI is set to an execution interval TI such that the analysis periods T0 of FFT1, FFT2,.
The execution interval setting means 6 outputs the set execution interval TI to the sound data analysis means 5 described above.
The sound data analysis unit 5 repeatedly executes FFT based on the execution interval TI, and outputs the analysis result to the key determination unit 7 every time FFT is executed.
 キー判定手段7は、音データ解析手段5から出力された解析結果に基づいて、音データSD、SD’のキー判定を行う。
 具体的には、キー判定手段7は、オクターブあたり12の音階それぞれについて、各音をルートとするマイナーキー及びメジャーキーの2種類、計24種類の音階の基準周波数を記憶している。
 キー判定手段7は、実行間隔TI毎に入力される解析結果を時間軸方向に合算し、得られた合算値から、信号強度の強い周波数に近い基準周波数を選択し、図7に示されるように、各音階の信号強度を求める。
 次に、キー判定手段7は、図8に示されるように、信号強度の強い順番に並べ替え、正規化し、信号強度の強いいくつかの音階を選択して、音データSD、SD’のキーを判定する。
 キー判定手段7は、音データSD、SD’のキー判定結果をキー表示KDとして、コンピュータのディスプレイや、携帯端末の画面上に表示する。
The key determination unit 7 performs key determination of the sound data SD and SD ′ based on the analysis result output from the sound data analysis unit 5.
Specifically, the key determination means 7 stores a total of 24 reference frequencies of a total of 24 kinds of scales, that is, a minor key and a major key having each sound as a root for each of 12 scales per octave.
The key determination means 7 adds the analysis results input at every execution interval TI in the time axis direction, and selects a reference frequency close to a frequency with a strong signal strength from the obtained sum, as shown in FIG. Next, the signal strength of each scale is obtained.
Next, as shown in FIG. 8, the key determination means 7 rearranges and normalizes the signals in the order of strong signal strength, selects several scales with strong signal strength, and selects the keys of the sound data SD, SD ′. Determine.
The key determination means 7 displays the key determination result of the sound data SD, SD ′ as a key display KD on the computer display or the screen of the portable terminal.
 次に、前述した構成を備えた楽曲解析装置1による音データSDのキー判定を、図9に示されるフローチャートに基づいて、説明する。
 まず、コンピュータや携帯端末のユーザーが、楽曲解析装置1を画面上で選択し、プログラムを起動し、解析対象である音データSDを選択すると、楽曲解析装置1に音データSDが入力される(手順S1)。
 音データSDが入力されたら、再生時間検出手段2は、音データSDの再生時間を検出する(手順S2)。
 音データ判定手段3は、音データSDの再生時間が所定の時間以上であるかどうかを判定する(手順S3)。
Next, key determination of the sound data SD by the music analysis apparatus 1 having the above-described configuration will be described based on the flowchart shown in FIG.
First, when a user of a computer or a portable terminal selects the music analysis device 1 on the screen, starts the program, and selects the sound data SD to be analyzed, the sound data SD is input to the music analysis device 1 ( Procedure S1).
When the sound data SD is input, the reproduction time detecting means 2 detects the reproduction time of the sound data SD (step S2).
The sound data determination means 3 determines whether or not the reproduction time of the sound data SD is equal to or longer than a predetermined time (step S3).
 音データSDの再生時間が所定の時間未満であると判定されたら、音データ複製手段4は、音データSDを複製し(手順S4)複製データCDを音データSDに貼り付けて、連続した音データSD’を生成する。
 はじめから音データSDの再生時間が所定の時間以上であるか、音データSD’の再生時間が所定の時間以上となった場合、実行間隔設定手段6は、音データSD、SD’の再生時間に基づいて、音データ解析手段5の実行間隔TIを設定する(手順S6)。
If it is determined that the reproduction time of the sound data SD is less than the predetermined time, the sound data duplicating means 4 duplicates the sound data SD (step S4) and pastes the duplicate data CD on the sound data SD to obtain continuous sound. Data SD ′ is generated.
When the reproduction time of the sound data SD from the beginning is equal to or longer than the predetermined time or when the reproduction time of the sound data SD ′ is equal to or longer than the predetermined time, the execution interval setting means 6 reproduces the sound data SD, SD ′. Is set to the execution interval TI of the sound data analysis means 5 (step S6).
 音データ解析手段5は、設定された実行間隔TIに基づいて、FFTを繰り返し、音データSD、SD’の周波数スペクトル解析を行う(手順S7)。
 音データ解析手段5は、音データSD、SD’が終了したか否かを判定し(手順S8)、音データSD、SD’が終了したと判定されたら、解析結果をキー判定手段に出力する。
 キー判定手段7は、解析結果に基づいて、音データSD、SD’のキー判定を行う(手順S9)。
 キー判定手段7は、判定結果である音データSD、SD’のキーを、コンピュータや携帯端末の表示画面上に出力する(手順S10)。
The sound data analysis means 5 repeats FFT based on the set execution interval TI, and performs frequency spectrum analysis of the sound data SD and SD ′ (step S7).
The sound data analysis means 5 determines whether or not the sound data SD, SD ′ has ended (step S8). If it is determined that the sound data SD, SD ′ has ended, the analysis result is output to the key determination means. .
The key determination means 7 performs key determination on the sound data SD and SD ′ based on the analysis result (step S9).
The key determination means 7 outputs the sound data SD and SD ′ keys as determination results on the display screen of the computer or portable terminal (step S10).
 このような本実施形態によれば、以下のような効果がある。
 楽曲解析装置1が音データ複製手段4を備えていることにより、非常に短い音データSDであっても、複製により所定時間以上の音データSD’とすることができるため、音データSDの再生時間によらず、音データ解析手段5により、FFTを実行して周波数スペクトル解析を行って、音データSD、SD’のキー判定を行うことができる。
 これにより、音データSDの再生時間の長短によらず、DJ関連機器で様々な音データSDを利用できるので、パフォーマンスの高いDJを提供することができる。
According to this embodiment, there are the following effects.
Since the music analysis apparatus 1 includes the sound data duplicating means 4, even if the sound data SD is very short, the sound data SD ′ can be made longer than a predetermined time by duplication. Regardless of the time, the sound data analysis means 5 can perform FFT and perform frequency spectrum analysis to perform key determination of the sound data SD and SD ′.
As a result, a variety of sound data SD can be used in a DJ-related device regardless of the length of playback time of the sound data SD, so that a DJ with high performance can be provided.
 楽曲解析装置1が再生時間検出手段2及び実行間隔設定手段6を備えていることにより、音データSDの再生時間に応じて、音データ解析手段5によるFFTの実行間隔TIを変更することができる。従って、再生時間の短い音データSDの場合は、実行間隔TIを短くして、FFTの実行回数を増やして、音データSDの解析精度を向上させることができる。
 一方、再生時間の長い音データSDの場合、実行間隔TIの実行間隔TIを長くして、音データSDの再生中のFFTの実行回数を減らして、解析時間を短縮することができる。なお、長い音データSDの場合、FFTの実行回数が相対的に少なくなり、解析が粗くなる傾向があるが、キー判定等の用途には充分な実行回数が得られるため、支障なく良好な結果を得ることができる。
Since the music analysis apparatus 1 includes the reproduction time detection unit 2 and the execution interval setting unit 6, the FFT execution interval TI by the sound data analysis unit 5 can be changed according to the reproduction time of the sound data SD. . Therefore, in the case of the sound data SD having a short reproduction time, the analysis interval of the sound data SD can be improved by shortening the execution interval TI and increasing the number of executions of FFT.
On the other hand, in the case of sound data SD having a long reproduction time, the analysis time can be shortened by increasing the execution interval TI of the execution interval TI and reducing the number of FFT executions during reproduction of the sound data SD. In the case of long sound data SD, the number of executions of FFT tends to be relatively small and the analysis tends to be rough. However, since the number of executions sufficient for key determination and the like can be obtained, satisfactory results can be obtained. Can be obtained.
 なお、本発明は、前述の実施形態に限定されるものではなく、以下に示すような変形をも含むものである。
 前述した実施形態では、楽曲解析装置1は、音データSDのキー判定を行うものであったが、本発明はこれに限らず、キー及びスケール判定を行う楽曲解析装置として利用してもよい。
 また、前述した実施形態では、実行間隔設定手段6は、音データSD、SD’の再生時間に基づいて、実行間隔TIを設定していたが、本発明はこれに限られない。入力される音データのデータ長に基づいて、実行間隔を設定してもよい。
 その他、本発明の目的を達成できる範囲で他の構成を採用してもよい。
In addition, this invention is not limited to the above-mentioned embodiment, The modification as shown below is also included.
In the embodiment described above, the music analysis apparatus 1 performs key determination of the sound data SD, but the present invention is not limited to this, and may be used as a music analysis apparatus that performs key and scale determination.
In the above-described embodiment, the execution interval setting unit 6 sets the execution interval TI based on the reproduction time of the sound data SD and SD ′. However, the present invention is not limited to this. The execution interval may be set based on the data length of the input sound data.
In addition, other configurations may be adopted as long as the object of the present invention can be achieved.
 1…楽曲解析装置、2…再生時間検出手段、3…音データ判定手段、4…音データ複製手段、5…音データ解析手段、6…実行間隔設定手段、7…キー判定手段、CD…複製データ、HMW…ハミング窓、KD…キー表示、S1…手順、S2…手順、S3…手順、S4…手順、S6…手順、S7…手順、S8…手順、S9…手順、S10…手順、SD…音データ、T0…解析期間、t1…再生時間、t2…再生時間、TI…実行間隔 DESCRIPTION OF SYMBOLS 1 ... Music analysis apparatus, 2 ... Reproduction time detection means, 3 ... Sound data determination means, 4 ... Sound data duplication means, 5 ... Sound data analysis means, 6 ... Execution interval setting means, 7 ... Key judgment means, CD ... Duplication Data, HMW ... Hamming window, KD ... Key display, S1 ... Procedure, S2 ... Procedure, S3 ... Procedure, S4 ... Procedure, S6 ... Procedure, S7 ... Procedure, S8 ... Procedure, S9 ... Procedure, S10 ... Procedure, SD ... Sound data, T0 ... analysis period, t1 ... reproduction time, t2 ... reproduction time, TI ... execution interval

Claims (6)

  1.  入力された音データの再生時間を検出する再生時間検出手段と、
     前記再生時間検出手段で検出された再生時間に応じて、FFT(Fast Fourier Transform)の実行間隔を設定する実行間隔設定手段と、
     前記実行間隔設定手段で設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う音データ解析手段と、
    を備えていることを特徴とする楽曲解析装置。
    Playback time detection means for detecting the playback time of the input sound data;
    Execution interval setting means for setting an execution interval of FFT (Fast Fourier Transform) according to the reproduction time detected by the reproduction time detection means;
    Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
    A music analysis apparatus comprising:
  2.  入力された音データのデータ長を検出するデータ長検出手段と、
     前記データ長検出手段で検出されたデータ長に応じて、FFTの実行間隔を設定する実行間隔設定手段と、
     前記実行間隔設定手段で設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う音データ解析手段と、
    を備えていることを特徴とする楽曲解析装置。
    Data length detecting means for detecting the data length of the input sound data;
    Execution interval setting means for setting an FFT execution interval in accordance with the data length detected by the data length detection means;
    Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
    A music analysis apparatus comprising:
  3.  請求項1又は請求項2に記載の楽曲解析装置において、
     前記実行間隔設定手段は、
     入力された音データの再生時間又はデータ長が、所定の時間又は所定のデータ長よりも長い場合、FFTの実行間隔を通常のFFTの実行間隔よりも長くとり、
     入力された音データの再生時間又はデータ長が所定の時間又は所定のデータ長よりも短い場合、FFTの実行間隔を通常のFFTの実行間隔よりも短くとること、
    を特徴とする楽曲解析装置。
    In the music analysis device according to claim 1 or 2,
    The execution interval setting means includes:
    When the playback time or data length of the input sound data is longer than the predetermined time or the predetermined data length, the FFT execution interval is set longer than the normal FFT execution interval,
    When the reproduction time or data length of the input sound data is shorter than the predetermined time or the predetermined data length, the FFT execution interval should be shorter than the normal FFT execution interval;
    A music analysis device characterized by this.
  4.  請求項3に記載の楽曲解析装置において、
     前記音データ解析手段は、前記FFTの実行結果の解析可能なデータについて解析を行い、
     前記実行間隔設定手段は、
     前記FFTの解析可能なデータが時間軸上で連続するように、実行間隔を設定することを特徴とする楽曲解析装置。
    In the music analysis device according to claim 3,
    The sound data analyzing means analyzes data that can be analyzed as an execution result of the FFT,
    The execution interval setting means includes:
    An execution interval is set so that the FFT analyzable data is continuous on a time axis.
  5.  入力された音データの再生時間を検出する手順と、
     検出された再生時間に応じて、FFTの実行間隔を設定する手順と、
     設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う手順と、
    を実行することを特徴とする楽曲解析方法。
    The procedure to detect the playback time of the input sound data,
    A procedure for setting the FFT execution interval according to the detected playback time;
    A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
    The music analysis method characterized by performing.
  6.  入力された音データの再生時間を検出する手順と、
     検出された再生時間に応じて、FFTの実行間隔を設定する手順と、
     設定されたFFTの実行間隔に基づいて、FFTを実行し、入力された音データの解析を行う手順と、
    をコンピュータに実行させることを特徴とする楽曲解析プログラム。
     
    The procedure to detect the playback time of the input sound data,
    A procedure for setting the FFT execution interval according to the detected playback time;
    A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
    The music analysis program which makes a computer execute.
PCT/JP2015/075077 2015-09-03 2015-09-03 Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program WO2017037920A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2017537159A JP6549234B2 (en) 2015-09-03 2015-09-03 Music analysis apparatus, music analysis method, and music analysis program
EP15903039.4A EP3346468B1 (en) 2015-09-03 2015-09-03 Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program
PCT/JP2015/075077 WO2017037920A1 (en) 2015-09-03 2015-09-03 Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2015/075077 WO2017037920A1 (en) 2015-09-03 2015-09-03 Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program

Publications (1)

Publication Number Publication Date
WO2017037920A1 true WO2017037920A1 (en) 2017-03-09

Family

ID=58186787

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2015/075077 WO2017037920A1 (en) 2015-09-03 2015-09-03 Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program

Country Status (3)

Country Link
EP (1) EP3346468B1 (en)
JP (1) JP6549234B2 (en)
WO (1) WO2017037920A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008040283A (en) * 2006-08-09 2008-02-21 Kawai Musical Instr Mfg Co Ltd Chord name detecting device and program for chord name detection
JP2013235050A (en) * 2012-05-07 2013-11-21 Sony Corp Information processing apparatus and method, and program

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3741106B2 (en) * 2003-02-21 2006-02-01 ヤマハ株式会社 Musical sound waveform analysis method and musical sound waveform analysis synthesis method
JP4767691B2 (en) * 2005-07-19 2011-09-07 株式会社河合楽器製作所 Tempo detection device, code name detection device, and program

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008040283A (en) * 2006-08-09 2008-02-21 Kawai Musical Instr Mfg Co Ltd Chord name detecting device and program for chord name detection
JP2013235050A (en) * 2012-05-07 2013-11-21 Sony Corp Information processing apparatus and method, and program

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3346468A4 *

Also Published As

Publication number Publication date
EP3346468A4 (en) 2019-04-24
JPWO2017037920A1 (en) 2018-06-14
EP3346468B1 (en) 2021-11-03
JP6549234B2 (en) 2019-07-24
EP3346468A1 (en) 2018-07-11

Similar Documents

Publication Publication Date Title
US8344234B2 (en) Tempo detecting device and tempo detecting program
US8901406B1 (en) Selecting audio samples based on excitation state
CN108292499A (en) Skill determining device and recording medium
WO2019043797A1 (en) Song analysis device and song analysis program
JP2015079151A (en) Music discrimination device, discrimination method of music discrimination device, and program
JP5395399B2 (en) Mobile terminal, beat position estimating method and beat position estimating program
JP6920445B2 (en) Music analysis device and music analysis program
JP2008065153A (en) Musical piece structure analyzing method, program and device
WO2017037920A1 (en) Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program
WO2017037919A1 (en) Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program
JP6263383B2 (en) Audio signal processing apparatus, audio signal processing apparatus control method, and program
JP6263382B2 (en) Audio signal processing apparatus, audio signal processing apparatus control method, and program
JP4632136B2 (en) Music tempo extraction method, apparatus and program
WO2021059701A1 (en) Musical composition analysis device, musical composition analysis method, and musical composition analysis program
JP6286933B2 (en) Apparatus, method, and program for estimating measure interval and extracting feature amount for the estimation
JP6946442B2 (en) Music analysis device and music analysis program
JP6168649B2 (en) Code detection apparatus and program
WO2024034115A1 (en) Audio signal processing device, audio signal processing method, and program
JP6847242B2 (en) Music analysis device and music analysis program
JP7175395B2 (en) Music structure analysis device and music structure analysis program
JP2009186762A (en) Beat timing information generation device and program
JPWO2019026236A1 (en) Music analysis device and music analysis program
JP6464853B2 (en) Audio playback apparatus and audio playback program
EP2043089B1 (en) Method and device for humanizing music sequences
JP2000163068A (en) Waveform data processor

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15903039

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017537159

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2015903039

Country of ref document: EP