WO2017037920A1

WO2017037920A1 - Musical-piece analysis device, musical-piece analysis method, and musical-piece analysis program

Info

Publication number: WO2017037920A1
Application number: PCT/JP2015/075077
Authority: WO
Inventors: 四郎鈴木
Original assignee: ＰｉｏｎｅｅｒＤＪ株式会社
Priority date: 2015-09-03
Filing date: 2015-09-03
Publication date: 2017-03-09
Also published as: EP3346468A4; JPWO2017037920A1; EP3346468B1; JP6549234B2; EP3346468A1

Abstract

A musical-piece analysis device (1) is equipped with: a playback-interval detection means (2) for detecting the playback interval of inputted sound data; an execution-period setting means (6) for setting the execution period for a Fast Fourier Transform (FFT), according to the playback interval detected by the playback-interval detection means (2); and a sound data analysis means (5) for executing the FFT and analyzing the inputted sound data, on the basis of the FFT execution period set by the execution period setting means (6).

Description

Music analysis apparatus, music analysis method, and music analysis program

The present invention relates to a music analysis device, a music analysis method, and a music analysis program.

2. Description of the Related Art Conventionally, a technique for automatically analyzing music such as beats, tempos, music keys, and scales from sound data such as music is known (see, for example, Patent Document 1).
The sound data is analyzed by sampling sound data of a certain length and analyzing the input waveform using, for example, FFT (Fast Fourier Transform).
Such analysis of sound data is also used as a technology related to BPM (Beats Per Minute) music in the field of DJ-related equipment.
By using such sound data analysis with DJ-related equipment, it is possible to connect songs with a sense of incongruity using the analyzed tempo, key, scale, etc., and provide a DJ with high performance Can do.

JP 2010-97084 A

However, since the FFT execution interval is conventionally fixed, there is a problem in that when the sound data for a long time is analyzed, the number of times the FFT is executed increases and the analysis takes time.

An object of the present invention is to provide a music analysis apparatus, a music analysis method, and a music analysis program capable of reducing the analysis time regardless of the time of sound data.

The music analysis apparatus according to the present invention is
Playback time detection means for detecting the playback time of the input sound data;
Execution interval setting means for setting an execution interval of FFT (Fast Fourier Transform) according to the reproduction time detected by the reproduction time detection means;
Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
It is characterized by having.

The music analysis apparatus according to the present invention is
Data length detecting means for detecting the data length of the input sound data;
Execution interval setting means for setting an FFT execution interval in accordance with the data length detected by the data length detection means;
Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
It is characterized by having.

The music analysis method according to the present invention includes:
The procedure to detect the playback time of the input sound data,
A procedure for setting the FFT execution interval according to the detected playback time;
A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
It is characterized by performing.

The music analysis program according to the present invention is:
The procedure to detect the playback time of the input sound data,
A procedure for setting the FFT execution interval according to the detected playback time;
A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
Is executed by a computer.

The block diagram which shows the music analysis apparatus which concerns on embodiment of this invention. The schematic diagram for demonstrating duplication of the sound data in the said embodiment. The schematic diagram for demonstrating the window function in the said embodiment. The schematic diagram for demonstrating the execution interval of FFT in the said embodiment. The schematic diagram for demonstrating the execution interval of the sound data with a long reproduction time in the said embodiment. The schematic diagram for demonstrating the execution space | interval of sound data with a short reproduction time in the said embodiment. The schematic diagram for demonstrating the key determination after FFT execution in the said embodiment. The schematic diagram for demonstrating the key determination after FFT execution in the said embodiment. The flowchart for demonstrating the music analysis method in the said embodiment.

Hereinafter, an embodiment of the present invention will be described.
FIG. 1 shows a music analysis apparatus 1 according to this embodiment. The music analysis device 1 analyzes digitized sound data SD such as input PCM data, determines a key of the sound data SD, and determines the sound data input on the display screen of a display device or the like. This is a device for displaying as a key display KD.
The music analysis apparatus 1 is configured as a software application that is activated by a general computer, a portable information terminal equipped with an OS (Operation System), and includes a reproduction time detection means 2, a sound data determination means 3, a sound data duplication means 4, and a sound. Data analysis means 5, execution interval setting means 6, and key determination means 7 are provided.

The playback time detection means 2 detects the playback time of the input sound data SD. Specifically, the reproduction time detection means 2 detects the reproduction time by counting the number of samplings from the beginning to the end of the sound data SD. When the reproduction time is detected, the reproduction time detection unit 2 outputs the detected reproduction time to the sound data determination unit 3 and the execution interval setting unit 6.

The sound data determination means 3 determines whether or not the input sound data SD has a predetermined reproduction time or more based on the reproduction time detected by the reproduction time detection means 2. Specifically, it is determined whether or not the sound data SD has a reproduction time that can be analyzed by sound data analysis means 5 described later.
Whether or not analysis is possible is determined by whether or not the reproduction time of the sound data SD has a time longer than the shortest time necessary for the window function applied to the sound data analysis means 5.

The time window length of the window function is determined by the sampling frequency of the sound data SD, the lower limit frequency to be detected, and the frequency resolution.
For example, in the case of a regular 4/4 time tune of BPM200, one beat is 300 msec and a sixteenth note is 75 msec. When performing FFT and analyzing the low-pitched sound data SD of 27.5 Hz corresponding to A0 of the scale, data of 1.2 s is required at the minimum.
When the sound data determination unit 3 determines that the sound data SD is less than the predetermined reproduction time, the sound data determination unit 3 outputs a determination result to that effect to the sound data duplication unit 4.

The sound data duplicating unit 4 duplicates the input sound data SD based on the determination result of the sound data determining unit 3, and continues the data so that the predetermined time or more is reached. Specifically, as shown in FIG. 2, the sound data duplicating unit 4 duplicates the input sound data SD of the reproduction time t1, and pastes the duplicate data CD behind the sound data SD so as to be continuous. , Continuous sound data SD ′ having a reproduction time t2 is generated.

For example, when the reproduction time t1 of the sound data SD is shorter than 1.2 seconds in the above-described example, the sound data duplicating unit 4 duplicates the duplicate data CD of the sound data SD, and the reproduction time t2 is 1.2 seconds or longer. The duplication is repeated until the sound data SD ′ is obtained.
It should be noted that the duplication only needs to be performed so that the reproduction time t2 can be analyzed by the sound data analysis means 5, and the number N of duplications may not be an integer.
The sound data replicating unit 4 outputs to the sound data analyzing unit 5 the sound data SD ′ that has been duplicated for a predetermined time or more.

The sound data analysis means 5 performs frequency spectrum analysis of the sound data SD and SD ′. In this embodiment, the analysis method is performed by analysis using FFT. However, the analysis method is not particularly limited, and for example, analysis using DCT (Discrete Cosine transform), analysis using a time axis, analysis using a signal level, volume feeling, and attack feeling may be used. it can.
As shown in FIG. 3, a Hamming window HMW that is a window function is usually applied to the FFT. The Hamming window HMW is applied to weaken the signal intensity amplification at both ends on the time axis during the FFT execution time and reduce the influence of the discontinuous portion of the sampled waveform seam during the FFT execution.

For this reason, since the signal intensity is too weak at both ends on the time axis of the data on which FFT has been executed, it cannot be used as analysis data.
Therefore, in the present embodiment, frequency spectrum analysis is performed using the signal intensity of the analysis period T0 during which the signal intensity is not weakened in the FFT execution time as the data that can be analyzed. Although the analysis period T0 can be arbitrarily determined, in this embodiment, the analysis period T0 is set to a 0.7 (70%) value of the Hamming window HMW.
In the present embodiment, the Hamming window HMW is applied. However, the present invention is not limited to this, and a Hanning window, a flap top window, or the like may be used as the window function.

The execution interval setting means 6 sets the FFT execution interval by the sound data analysis means 5 based on the reproduction time detected by the reproduction time detection means 2.
Specifically, as shown in FIG. 4, the execution interval setting means 6 sets the execution start time of the second FFT2 as the execution interval TI after the first execution of FFT1. Thereafter, in the present embodiment, the third FFT3 is started after time 2TI, and the FFTs are sequentially executed.

The execution interval TI is set according to the reproduction time of the sound data SD and SD ′.
For example, in the case of long sound data SD with a reproduction time of 30 seconds or more, the execution interval setting means 6 increases the execution interval TI as shown in FIG. 5, and in the case of short sound data SD with a reproduction time of less than 30 s, As shown in FIG. 6, the execution interval TI is reduced. The lower limit value of the execution interval TI is set to an execution interval TI such that the analysis periods T0 of FFT1, FFT2,.
The execution interval setting means 6 outputs the set execution interval TI to the sound data analysis means 5 described above.
The sound data analysis unit 5 repeatedly executes FFT based on the execution interval TI, and outputs the analysis result to the key determination unit 7 every time FFT is executed.

The key determination unit 7 performs key determination of the sound data SD and SD ′ based on the analysis result output from the sound data analysis unit 5.
Specifically, the key determination means 7 stores a total of 24 reference frequencies of a total of 24 kinds of scales, that is, a minor key and a major key having each sound as a root for each of 12 scales per octave.
The key determination means 7 adds the analysis results input at every execution interval TI in the time axis direction, and selects a reference frequency close to a frequency with a strong signal strength from the obtained sum, as shown in FIG. Next, the signal strength of each scale is obtained.
Next, as shown in FIG. 8, the key determination means 7 rearranges and normalizes the signals in the order of strong signal strength, selects several scales with strong signal strength, and selects the keys of the sound data SD, SD ′. Determine.
The key determination means 7 displays the key determination result of the sound data SD, SD ′ as a key display KD on the computer display or the screen of the portable terminal.

Next, key determination of the sound data SD by the music analysis apparatus 1 having the above-described configuration will be described based on the flowchart shown in FIG.
First, when a user of a computer or a portable terminal selects the music analysis device 1 on the screen, starts the program, and selects the sound data SD to be analyzed, the sound data SD is input to the music analysis device 1 ( Procedure S1).
When the sound data SD is input, the reproduction time detecting means 2 detects the reproduction time of the sound data SD (step S2).
The sound data determination means 3 determines whether or not the reproduction time of the sound data SD is equal to or longer than a predetermined time (step S3).

If it is determined that the reproduction time of the sound data SD is less than the predetermined time, the sound data duplicating means 4 duplicates the sound data SD (step S4) and pastes the duplicate data CD on the sound data SD to obtain continuous sound. Data SD ′ is generated.
When the reproduction time of the sound data SD from the beginning is equal to or longer than the predetermined time or when the reproduction time of the sound data SD ′ is equal to or longer than the predetermined time, the execution interval setting means 6 reproduces the sound data SD, SD ′. Is set to the execution interval TI of the sound data analysis means 5 (step S6).

The sound data analysis means 5 repeats FFT based on the set execution interval TI, and performs frequency spectrum analysis of the sound data SD and SD ′ (step S7).
The sound data analysis means 5 determines whether or not the sound data SD, SD ′ has ended (step S8). If it is determined that the sound data SD, SD ′ has ended, the analysis result is output to the key determination means. .
The key determination means 7 performs key determination on the sound data SD and SD ′ based on the analysis result (step S9).
The key determination means 7 outputs the sound data SD and SD ′ keys as determination results on the display screen of the computer or portable terminal (step S10).

According to this embodiment, there are the following effects.
Since the music analysis apparatus 1 includes the sound data duplicating means 4, even if the sound data SD is very short, the sound data SD ′ can be made longer than a predetermined time by duplication. Regardless of the time, the sound data analysis means 5 can perform FFT and perform frequency spectrum analysis to perform key determination of the sound data SD and SD ′.
As a result, a variety of sound data SD can be used in a DJ-related device regardless of the length of playback time of the sound data SD, so that a DJ with high performance can be provided.

Since the music analysis apparatus 1 includes the reproduction time detection unit 2 and the execution interval setting unit 6, the FFT execution interval TI by the sound data analysis unit 5 can be changed according to the reproduction time of the sound data SD. . Therefore, in the case of the sound data SD having a short reproduction time, the analysis interval of the sound data SD can be improved by shortening the execution interval TI and increasing the number of executions of FFT.
On the other hand, in the case of sound data SD having a long reproduction time, the analysis time can be shortened by increasing the execution interval TI of the execution interval TI and reducing the number of FFT executions during reproduction of the sound data SD. In the case of long sound data SD, the number of executions of FFT tends to be relatively small and the analysis tends to be rough. However, since the number of executions sufficient for key determination and the like can be obtained, satisfactory results can be obtained. Can be obtained.

In addition, this invention is not limited to the above-mentioned embodiment, The modification as shown below is also included.
In the embodiment described above, the music analysis apparatus 1 performs key determination of the sound data SD, but the present invention is not limited to this, and may be used as a music analysis apparatus that performs key and scale determination.
In the above-described embodiment, the execution interval setting unit 6 sets the execution interval TI based on the reproduction time of the sound data SD and SD ′. However, the present invention is not limited to this. The execution interval may be set based on the data length of the input sound data.
In addition, other configurations may be adopted as long as the object of the present invention can be achieved.

DESCRIPTION OF SYMBOLS 1 ... Music analysis apparatus, 2 ... Reproduction time detection means, 3 ... Sound data determination means, 4 ... Sound data duplication means, 5 ... Sound data analysis means, 6 ... Execution interval setting means, 7 ... Key judgment means, CD ... Duplication Data, HMW ... Hamming window, KD ... Key display, S1 ... Procedure, S2 ... Procedure, S3 ... Procedure, S4 ... Procedure, S6 ... Procedure, S7 ... Procedure, S8 ... Procedure, S9 ... Procedure, S10 ... Procedure, SD ... Sound data, T0 ... analysis period, t1 ... reproduction time, t2 ... reproduction time, TI ... execution interval

Claims

Playback time detection means for detecting the playback time of the input sound data;
Execution interval setting means for setting an execution interval of FFT (Fast Fourier Transform) according to the reproduction time detected by the reproduction time detection means;
Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
A music analysis apparatus comprising:
Data length detecting means for detecting the data length of the input sound data;
Execution interval setting means for setting an FFT execution interval in accordance with the data length detected by the data length detection means;
Sound data analysis means for executing FFT and analyzing input sound data based on the FFT execution interval set by the execution interval setting means;
A music analysis apparatus comprising:
In the music analysis device according to claim 1 or 2,
The execution interval setting means includes:
When the playback time or data length of the input sound data is longer than the predetermined time or the predetermined data length, the FFT execution interval is set longer than the normal FFT execution interval,
When the reproduction time or data length of the input sound data is shorter than the predetermined time or the predetermined data length, the FFT execution interval should be shorter than the normal FFT execution interval;
A music analysis device characterized by this.
In the music analysis device according to claim 3,
The sound data analyzing means analyzes data that can be analyzed as an execution result of the FFT,
The execution interval setting means includes:
An execution interval is set so that the FFT analyzable data is continuous on a time axis.
The procedure to detect the playback time of the input sound data,
A procedure for setting the FFT execution interval according to the detected playback time;
A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
The music analysis method characterized by performing.
The procedure to detect the playback time of the input sound data,
A procedure for setting the FFT execution interval according to the detected playback time;
A procedure for performing FFT and analyzing input sound data based on the set FFT execution interval;
The music analysis program which makes a computer execute.