CN107180636B

CN107180636B - Audio data acquisition method based on Fourier transform of android system

Info

Publication number: CN107180636B
Application number: CN201710357361.5A
Authority: CN
Inventors: 鲁霖
Original assignee: Shenzhen Xinzhongxin Technology Co Ltd
Current assignee: Shenzhen Xinzhongxin Technology Co Ltd
Priority date: 2017-05-19
Filing date: 2017-05-19
Publication date: 2020-12-18
Anticipated expiration: 2037-05-19
Also published as: CN107180636A

Abstract

The invention relates to an audio data acquisition method based on Fourier transform of an Android system, which is characterized in that Android MediaPlayer APIs are used for instantiating a Visualizer to bind a music player; initializing a Visualizer; converting the continuous signal (waveform) intensity in time domain into discrete frequency domain signal (frequency spectrum); calculating each frequency, namely squaring the real part of the FFT and the corresponding imaginary part, adding the squared parts, and then squaring, namely squaring and modulus; calculating a weighted average value for the frequency domain data of each segment; the result of the calculation is compared with the spectrum of the file. The invention has the beneficial effects that: the method solves the problem that the information of the Android audio file obtained in the prior art is less; fourier transform and the weighted average are combined, and the calculation method is accurate and reliable, so that more audio data can be acquired on the Android platform.

Description

Audio data acquisition method based on Fourier transform of android system

Technical Field

The utility model relates to an tall and erect system multimedia technology field of ann, in particular to audio data acquisition method based on tall and erect system Fourier transform of ann.

Background

At present, the android system platform is increasingly rich in functions, but the native android system multimedia framework has limited support for capturing and encoding audio in multiple formats, and has few analysis interfaces for audio files, such as: there is no interface for extracting spectral, pitch, intensity, and timbre information in the audio signal. The diversified requirements of the users cannot be met.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the method for acquiring the audio data based on the Fourier transform of the android system solves the problem that the existing method for acquiring the audio file information of the android system is few in the related art, can effectively acquire parameters such as frequency spectrum, volume and the like of the audio file, is simple, has accurate results, and is a trend of wide application in industries such as acquiring the audio file of the android system at present.

In order to solve the technical problem, the invention provides an audio data acquisition method based on the Fourier transform of an android system, which is characterized in that a Visualizer is instantiated by using MediaPlayer APIs in the android system to bind a music player; converting the continuous signal intensity in the time domain into a discrete frequency domain signal; further, the continuous signal is a waveform signal; the discrete frequency domain signal is a frequency spectrum signal; further, square modulus: calculating each frequency, squaring the real part of FFT and the corresponding imaginary part, adding the squared parts, and squaring; calculating a weighted average value for the frequency domain data of each segment; comparing the calculated result with the frequency spectrum of the file;

further optimizing, the Android Fourier transform-based audio data acquisition method comprises the following working steps:

a. instantiating a Visualizer by using Android media Player APIs, and binding a music player;

b. initializing a Visualizer;

c. converting the continuous signal (waveform) intensity in time domain into discrete frequency domain signal (frequency spectrum);

d. calculating each frequency, namely squaring the real part of the FFT and the corresponding imaginary part, adding the squared parts, and then squaring, namely squaring and modulus;

e. calculating a weighted average value for the frequency domain data of each segment;

f. the result of the calculation is compared with the spectrum of the file.

Further optimized, the parameter SessionId is obtained through the object of MediaPlayer.

Further optimization, the calculated weighted average can reflect the actual frequency and volume. The error rate is adopted for evaluating the extraction of the audio data, the extracted beat number is compared with the actual beat number of the file, the time error is less than 50ms, and the error rate is not more than 5% through actual measurement of a plurality of pieces of music.

Error rate =

*100%

Further optimized, the butterfly transform twiddle factor of the fast Fourier transform algorithm is

The cosine term (real part) in the decomposed complex form is the same as the cosine term of the FFT algorithm, and the sign bit of the sine term (imaginary part) is just as same as the sine term of the FFT algorithmIn contrast, the algorithm implements a table of sine and cosine terms in FFT _ HANDLE.

Extracting N frequencies according to a frequency domain in a certain continuous interval to obtain N sampling points, wherein the inverse discrete Fourier transform formula of each sampling point is as follows:

after the technical scheme is adopted, the invention has the beneficial effects that:

compared with the prior art, the method and the device solve the problem that the information of the Android audio file obtained in the prior art is less; fourier transform and the weighted average are combined, and the calculation method is accurate and reliable, so that more audio data can be acquired on the Android platform.

Drawings

FIG. 1 is a flow chart of the audio data acquisition method based on the Fourier transform of the android system

Detailed Description

The invention will now be described in detail with reference to the accompanying figure 1, which shows an embodiment of the invention, but is not intended to limit the invention thereto.

As shown in fig. 1, in an audio data acquisition method based on the fourier transform of the android system, a Visualizer is instantiated by MediaPlayer APIs in the android system, and a music player is bound; converting the continuous signal intensity in the time domain into a discrete frequency domain signal; further, the continuous signal is a waveform signal; the discrete frequency domain signal is a frequency spectrum signal; further, square modulus: calculating each frequency, squaring the real part of FFT and the corresponding imaginary part, adding the squared parts, and squaring; calculating a weighted average value for the frequency domain data of each segment; comparing the calculated result with the frequency spectrum of the file;

the Android Fourier transform-based audio data acquisition method comprises the following working steps:

b. initializing a Visualizer;

f. the result of the calculation is compared with the spectrum of the file.

The implementation mode mainly comprises the following steps:

s101, instantiating a Visualizer by using Android media Player APIs to bind a music player;

in an embodiment of the invention, instantiate a Visualizer:

mVisualizer = new Visualizer(playBinder.getAudioSessionId());

the parameter SessionId can be obtained by the object of MediaPlayer.

S102, initializing a Visualizer;

in an embodiment of the invention, the length of the music content to be converted is then set, which in professional terms is a sample, which is typically an exponential multiple of 2, e.g. 64,128,256,512,1024. In analyzing signals with dense spectra, we require that the smaller the frequency resolution Δ f, the better, and that the sampling frequency fs must be kept large to contain all the frequency components that are destined for analysis. Meanwhile, the length of the sampling data can be increased without reducing the sampling rate while the resolution is improved. This means that the operation rate and analysis efficiency will be reduced. In the API interface document, it is said that array [0] is the minimum (128) and array [1] is the maximum (1024), where 128 is set by comparison:

mVisualizer.setCaptureSize(Visualizer.getCaptureSizeRange()[0])。

s103, converting the continuous signal (waveform) intensity in the time domain into discrete frequency domain signal (frequency spectrum);

in the embodiment of the present invention, a listener is set for the revitalizer, and there are 4 parameters in total, the first is the listener, the second unit is mhz, which indicates the acquisition frequency, the third is whether to acquire the waveform, and the fourth is whether to acquire the frequency:

mVisualizer.setDataCaptureListener(mVisualizerListener, Visualizer.getMaxCaptureRate() / 2, false, true);

the mVisualizerListener table listens to the function, and the anonymous internal class implements the interface, which needs to implement two functions.

mVisualizer.setEnabled(true)

After the above parameter setting is completed, the audio data analysis function is started.

S104, calculating each frequency, namely squaring the real part of the FFT and the corresponding imaginary part, adding the squared real part and the corresponding imaginary part, and then squaring, namely squaring and modulus;

the butterfly transform twiddle factor of the inverse fast Fourier transform algorithm is

The cosine term (real part) in the decomposed complex form is the same as the cosine term of the FFT algorithm, and the sign bit of the sine term (imaginary part) is just opposite to the sine term of the FFT algorithm, so the algorithm implementation can still use the sine term and cosine term table in FFT _ HANDLE.

Now, assuming that N frequencies are extracted according to the frequency domain in a certain continuous interval to obtain N sampling points, the inverse discrete fourier transform formula of each sampling point is:

in an embodiment of the invention, data related to fast fourier transform is collected:

public void onFftDataCapture(Visualizer visualizer,byte[] fft, int samplingRate)

the real part and the corresponding imaginary part are squared and then added and then squared:

byte[] model = new byte[fft.length / 2 + 1];

model[0] = (byte) Math.abs(fft[1]);

int j = 1;

for (int i = 2; i < fft.length; ) {

model[j] = (byte) Math.hypot(fft[i], fft[i + 1]);

i += 2;

j++;

}

s105, calculating a weighted average value for the frequency domain data of each section;

in the embodiment of the present invention, in order to avoid the frequency roll-off of the high frequency part, the frequency domain data needs to be pre-weighted to smooth the audio data. And finding a central frequency point in each band, uniformly taking two frequency points from the central frequency point to the left and the right respectively, and adding the central frequency point to acquire values of 19 frequency points for calculation. The calculation method is to give different weights to the 19 points, the weight of the middle point is the highest, and the weights are sequentially reduced towards two sides, and then the weighted average of the 19 points is calculated.

In the embodiment of the present invention, the weights assigned to these 19 points are: the central point 1, two points adjacent to the central point are 0.9, and so on, the distance from the central point is different by one point, the value of the weight is different by 0.1, and the two points calculated to the outermost edge are 0.1.

S106, comparing the calculated result with the frequency spectrum of the file.

The weighted average thus calculated is more reflective of the actual frequency and volume magnitudes. The extraction of the audio data is evaluated by adopting an error rate, for example, the extracted beat number is compared with the actual file beat number (the time error is less than 50 ms), and the error rate is actually measured on a plurality of pieces of music and is not more than 5%.

Error rate =

*100%

Converting the audio signal in the time domain into the frequency domain for analysis: acquiring frequency characteristics of a dialing audio; acquiring the volume of an audio file; and performing gain or attenuation calculation on the specific frequency, and finally converting the result into a time domain, thereby realizing the modification of the tone color of the audio.

In the embodiment of the present invention, the weighted average value calculated in this way is more reflective of the actual frequency and volume level.

It will be appreciated by those skilled in the art that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The embodiments disclosed above are therefore to be considered in all respects as illustrative and not restrictive. All changes which come within the scope of or equivalence to the invention are intended to be embraced therein.

Claims

1. An audio data acquisition method based on Fourier transform of an android system is characterized in that: instantiating a Visualizer by using MediaPlayer APIs in the android system, and binding a music player; converting continuous signal intensity in a time domain into discrete frequency domain signals, setting a monitor for a visualizer, wherein the monitor has 4 parameters, the first parameter is a monitor, the second unit is millihertz and represents the acquisition frequency, the third parameter is whether to acquire a waveform, and the fourth parameter is whether to acquire the frequency; further, the continuous signal is a waveform signal; the discrete frequency domain signal is a frequency spectrum signal; further, square modulus: calculating each frequency, squaring the real part of FFT and the corresponding imaginary part, adding the squared parts, and squaring; calculating a weighted average value of the frequency domain data of each section, performing pre-weighting processing on the frequency domain data to smooth the audio data in order to avoid frequency range dropping of a high-frequency part, finding a central frequency point in each band, uniformly and respectively taking two frequency points from the central frequency point to the left and the right, adding the central frequency point to acquire values of 19 frequency points for calculation, wherein the calculation method is that the 19 points are endowed with different weights, the weight of the middle point is the highest, the weights are sequentially reduced towards the two sides, and then calculating the weighted average value of the 19 points; the result of the calculation is compared with the spectrum of the file.

2. The android system Fourier transform-based audio data acquisition method of claim 1, wherein: the Android Fourier transform-based audio data acquisition method comprises the following working steps:

b. initializing a Visualizer;

f. the result of the calculation is compared with the spectrum of the file.

3. Audio data acquisition method based on android system Fourier transform is characterized in that: the parameter SessionId is obtained by the object of MediaPlayer.

4. The android system Fourier transform-based audio data acquisition method according to claim 2, wherein: the calculated weighted average value can reflect the actual frequency and volume; the error rate is adopted for evaluating the extraction of the audio data, the extracted beat number is compared with the actual beat number of the file, the time error is less than 50ms, and the error rate is not more than 5% by actually measuring a plurality of pieces of music;

5. the android system Fourier transform-based audio data acquisition method according to claim 2, wherein: the butterfly transform twiddle factor of the inverse fast Fourier transform algorithm is

The cosine term (real part) in the decomposed complex form is the same as the cosine term of the FFT algorithm, and the sign of the sine term (imaginary part)The bit is just opposite to the sine item of the FFT algorithm, and the algorithm is realized by using a sine item and cosine item table in FFT _ HANDLE;