CN104036788B - The acoustic fidelity identification method of audio file and device - Google Patents

The acoustic fidelity identification method of audio file and device Download PDF

Info

Publication number
CN104036788B
CN104036788B CN201410235733.3A CN201410235733A CN104036788B CN 104036788 B CN104036788 B CN 104036788B CN 201410235733 A CN201410235733 A CN 201410235733A CN 104036788 B CN104036788 B CN 104036788B
Authority
CN
China
Prior art keywords
audio data
target
channel
audio file
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410235733.3A
Other languages
Chinese (zh)
Other versions
CN104036788A (en
Inventor
田彪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Taile Culture Technology Co ltd
Original Assignee
Beijing Yinzhibang Culture Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yinzhibang Culture Technology Co Ltd filed Critical Beijing Yinzhibang Culture Technology Co Ltd
Priority to CN201410235733.3A priority Critical patent/CN104036788B/en
Publication of CN104036788A publication Critical patent/CN104036788A/en
Application granted granted Critical
Publication of CN104036788B publication Critical patent/CN104036788B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Stereophonic System (AREA)

Abstract

The present invention provides acoustic fidelity identification method and the device of a kind of audio file.The embodiment of the present invention is by obtaining target audio file to be identified, and then according to described target audio file, obtain at least one in the time domain waveform feature of described target audio file and the frequency domain spectral line characteristic of described target audio file, make it possible to according at least one in described time domain waveform feature and described frequency domain spectral line characteristic, the tonequality identifying described target audio file is the first tonequality or the second tonequality, described first tonequality is higher than described second tonequality, so, the audio file of real high tone quality can be provided a user with, allow users to appreciate the audio file of real high tone quality.

Description

Sound quality identification method and device for audio file
[ technical field ] A method for producing a semiconductor device
The present invention relates to audio processing technologies, and in particular, to a method and an apparatus for identifying sound quality of an audio file.
[ background of the invention ]
The sound quality of an audio file refers to the fidelity of original audio data after compression processing. The audio file with high tone quality can completely recover the original audio data without causing any distortion; and the audio file with low tone quality can not completely restore the original audio data, thereby causing partial distortion. Currently, some conversion techniques have emerged to convert audio files of low sound quality into pseudo high sound quality audio files. In fact, the sound quality of such a pseudo high-quality audio file is the same as that of the audio file before conversion, and does not belong to the true high-quality. After obtaining the pseudo high-quality audio files through some music applications, the user cannot enjoy the real high-quality audio files at all, which may affect the brand images of the music applications and even cause legal disputes.
Therefore, it is an urgent problem to provide a user with a real audio file with high quality so that the user can enjoy the real audio file with high quality and effectively recognize the quality of the audio file.
[ summary of the invention ]
Aspects of the present invention provide a method and an apparatus for recognizing sound quality of an audio file, so as to realize sound quality recognition of the audio file.
In one aspect of the present invention, a method for identifying a sound quality of an audio file is provided, including:
acquiring a target audio file to be identified;
according to the target audio file, at least one of a time domain waveform characteristic of the target audio file and a frequency domain spectral line characteristic of the target audio file is obtained;
according to the time domain waveform characteristics and at least one of the frequency domain spectral line characteristics, identifying that the tone quality of the target audio file is a first tone quality or a second tone quality, wherein the first tone quality is higher than the second tone quality.
The above aspect and any possible implementation manner further provide an implementation manner, where the obtaining, according to the target audio file, at least one of a time-domain waveform characteristic of the target audio file and a frequency-domain spectral line characteristic of the target audio file includes:
determining the number of channels of the target audio file;
decoding the data blocks of the target audio file to obtain original audio data;
and obtaining the audio data of the sound channel corresponding to each sound channel according to the number of the sound channels and the original audio data.
The above-mentioned aspect and any possible implementation manner further provide an implementation manner, where the identifying, according to at least one of the time-domain waveform feature and the frequency-domain spectral line feature, the sound quality of the target audio file as a first sound quality or a second sound quality includes:
if the number of the sound channels is larger than or equal to 2, obtaining first sound channel audio data and second sound channel audio data corresponding to at least two sound channels according to the sound channel audio data corresponding to each sound channel;
adding the first channel audio data and the second channel audio data to obtain mixed channel audio data;
if the mixed channel audio data is greater than or equal to the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the first tone quality;
if the mixed channel audio data is smaller than the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the second tone quality; wherein,
n is a number greater than 1; m is a number greater than 1.
The above-mentioned aspect and any possible implementation manner further provide an implementation manner, where the identifying, according to at least one of the time-domain waveform feature and the frequency-domain spectral line feature, the sound quality of the target audio file as a first sound quality or a second sound quality includes:
if the difference value between every two of the values of the target sound channel audio data with the continuously specified number is smaller than or equal to a first amplitude threshold value, identifying the sound quality of the target audio file as the second sound quality, wherein the target sound channel audio data comprises sound channel audio data corresponding to any sound channel in the sound channel audio data corresponding to each sound channel; or
And if the difference value of the values of the two continuous target channel audio data is greater than or equal to a second amplitude threshold value and the signs of the values of the two continuous target channel audio data are opposite, identifying that the tone quality of the target audio file is the second tone quality, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel.
The above-mentioned aspect and any possible implementation manner further provide an implementation manner, where after obtaining the channel audio data corresponding to each channel according to the number of channels and the original audio data, the method further includes:
performing frame processing on target channel audio data to obtain at least one frame of audio data, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel;
and performing frequency domain transformation processing on the at least one frame of audio data to obtain frequency domain data corresponding to each frame of audio data.
The above-mentioned aspect and any possible implementation manner further provide an implementation manner, where the identifying, according to at least one of the time-domain waveform feature and the frequency-domain spectral line feature, the sound quality of the target audio file as a first sound quality or a second sound quality includes:
according to the frequency domain data corresponding to each frame of audio data, obtaining the energy component of the frequency domain data corresponding to each frame of audio data at each frequency point;
and if the difference value between every two frequency domain data corresponding to each frame of audio data in the energy components at least one same frequency point is smaller than or equal to the energy threshold value, identifying the tone quality of the target audio file as the second tone quality.
The above-described aspect and any possible implementation manner further provide an implementation manner, before the obtaining of the target audio file to be identified, further including:
acquiring format parameters of the candidate audio files;
determining the candidate audio file as the target audio file according to the format parameter; or identifying the tone quality of the candidate audio file as the second tone quality.
The above aspect and any possible implementation further provide an implementation, wherein the format parameter includes at least one of a compression format, a sampling rate, a sampling depth, and a code rate.
In another aspect of the present invention, there is provided an apparatus for recognizing a sound quality of an audio file, including:
the acquisition unit is used for acquiring a target audio file to be identified;
the characteristic unit is used for acquiring at least one of the time domain waveform characteristic of the target audio file and the frequency domain spectral line characteristic of the target audio file according to the target audio file;
the identification unit is used for identifying the tone quality of the target audio file as first tone quality or second tone quality according to at least one of the time domain waveform characteristics and the frequency domain spectral line characteristics, and the first tone quality is higher than the second tone quality.
The above aspects and any possible implementations further provide an implementation, the feature unit being specifically configured to
Determining the number of channels of the target audio file;
decoding the data blocks of the target audio file to obtain original audio data; and
and obtaining the audio data of the sound channel corresponding to each sound channel according to the number of the sound channels and the original audio data.
The above-mentioned aspects and any possible implementation further provide an implementation in which the identification unit is specifically configured to
If the number of the sound channels is larger than or equal to 2, obtaining first sound channel audio data and second sound channel audio data corresponding to at least two sound channels according to the sound channel audio data corresponding to each sound channel;
adding the first channel audio data and the second channel audio data to obtain mixed channel audio data; and
if the mixed channel audio data is greater than or equal to the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the first tone quality;
if the mixed channel audio data is smaller than the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the second tone quality; wherein,
n is a number greater than 1; m is a number greater than 1.
The above-mentioned aspects and any possible implementation further provide an implementation in which the identification unit is specifically configured to
If the difference value between every two of the values of the target sound channel audio data with the continuously specified number is smaller than or equal to a first amplitude threshold value, identifying the sound quality of the target audio file as the second sound quality, wherein the target sound channel audio data comprises sound channel audio data corresponding to any sound channel in the sound channel audio data corresponding to each sound channel; or
And if the difference value of the values of the two continuous target channel audio data is greater than or equal to a second amplitude threshold value and the signs of the values of the two continuous target channel audio data are opposite, identifying that the tone quality of the target audio file is the second tone quality, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel.
The above-mentioned aspects and any possible implementation further provide an implementation, and the feature unit is further configured to
Performing frame processing on target channel audio data to obtain at least one frame of audio data, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel; and
and performing frequency domain transformation processing on the at least one frame of audio data to obtain frequency domain data corresponding to each frame of audio data.
The above-mentioned aspects and any possible implementation further provide an implementation in which the identification unit is specifically configured to
According to the frequency domain data corresponding to each frame of audio data, obtaining the energy component of the frequency domain data corresponding to each frame of audio data at each frequency point; and
and if the difference value between every two frequency domain data corresponding to each frame of audio data in the energy components at least one same frequency point is smaller than or equal to the energy threshold value, identifying the tone quality of the target audio file as the second tone quality.
The above-mentioned aspect and any possible implementation manner further provide an implementation manner, where the identification unit is further configured to
Acquiring format parameters of the candidate audio files; and
determining the candidate audio file as the target audio file according to the format parameter; or identifying the tone quality of the candidate audio file as the second tone quality.
The above aspect and any possible implementation further provide an implementation, wherein the format parameter includes at least one of a compression format, a sampling rate, a sampling depth, and a code rate.
According to the technical scheme, at least one of the time domain waveform characteristic of the target audio file and the frequency domain spectral line characteristic of the target audio file is obtained by obtaining the target audio file to be identified according to the target audio file, so that the tone quality of the target audio file can be identified to be the first tone quality or the second tone quality according to the at least one of the time domain waveform characteristic and the frequency domain spectral line characteristic, and the first tone quality is higher than the second tone quality.
In addition, the technical scheme provided by the invention is simple to operate, and can effectively improve the efficiency of tone quality identification of the audio file.
[ description of the drawings ]
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed in the embodiments or the prior art descriptions will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without inventive labor.
Fig. 1 is a flowchart illustrating a method for recognizing a sound quality of an audio file according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a time-domain waveform of original audio data, i.e., target channel audio data in the embodiment corresponding to FIG. 1;
FIG. 3 is a schematic diagram of another time-domain waveform of the original audio data, i.e., the target channel audio data in the embodiment corresponding to FIG. 1;
FIG. 4 is a schematic energy spectrum diagram of frequency domain data corresponding to the starting audio data, i.e., the target channel audio data in the embodiment corresponding to FIG. 1;
fig. 5 is a schematic structural diagram of an apparatus for recognizing sound quality of an audio file according to another embodiment of the present invention.
[ detailed description ] embodiments
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
It should be noted that the terminal according to the embodiment of the present invention may include, but is not limited to, a mobile phone, a Personal Digital Assistant (PDA), a wireless handheld device, a wireless netbook, a portable Computer, a Personal Computer (PC), an MP3 player, an MP4 player, and the like.
In addition, the term "and/or" herein is only one kind of association relationship describing an associated object, and means that there may be three kinds of relationships, for example, a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.
Fig. 1 is a flowchart illustrating a method for recognizing a sound quality of an audio file according to an embodiment of the present invention, as shown in fig. 1.
101. And acquiring a target audio file to be identified.
The target Audio file may include Audio files in various encoding formats in the prior art, such as a Moving Picture Experts Group (MPEG) layer 3(MPEG layer-3, MP3) format Audio file, a wma (windows Media Audio) format Audio file, an Advanced Audio Coding (AAC) format Audio file, a Lossless Audio compression Coding (FLAC) or APE format Audio file, and the like, which is not particularly limited in this embodiment.
102. And acquiring at least one of the time domain waveform characteristic of the target audio file and the frequency domain spectral line characteristic of the target audio file according to the target audio file.
The time-domain waveform characteristics of the target audio file may include, but are not limited to, amplitude information of the original audio data.
The original audio data is a digital signal converted from an audio signal, and for example, the audio signal is sampled, quantized and encoded to obtain Pulse Code Modulation (PCM) data, which can be obtained by parsing a data block of a target audio file.
The frequency-domain spectral line characteristics of the target audio file may include, but are not limited to, spectral information of the original audio data.
103. According to the time domain waveform characteristics and at least one of the frequency domain spectral line characteristics, identifying that the tone quality of the target audio file is a first tone quality or a second tone quality, wherein the first tone quality is higher than the second tone quality.
The execution main bodies 101 to 103 may be processing devices, and may be located in a local Application (App), for example, hundredth music, or may also be located in a server on a network side, or may also be partially located in the local Application, and another portion is located in the server on the network side.
It should be understood that the application may be an application installed on the terminal (native app), or may also be a web page of a browser on the terminal (webAPP), as long as an objective existence form of processing of the audio data can be implemented, which is not limited in this embodiment.
Like this, through acquireing the target audio file of treating discernment, and then according to the target audio file obtains the time domain waveform characteristics of target audio file with at least one item in the frequency domain spectral line characteristic of target audio file makes can be according to time domain waveform characteristics with at least one item in the frequency domain spectral line characteristic is discerned the tone quality of target audio file is first tone quality or second tone quality, first tone quality is higher than second tone quality, like this, just can provide the audio file of real high tone quality to the user, makes the user can appreciate the audio file of real high tone quality.
Optionally, in a possible implementation manner of this embodiment, before 101, the processing device may further obtain format parameters of the candidate audio file. Then, the processing device may determine the candidate audio file as the target audio file according to the format parameter; or identifying the tone quality of the candidate audio file as the second tone quality.
Wherein the format parameter may include, but is not limited to, at least one of a compression format, a sampling rate, a sampling depth, and a code rate.
The compression format is a compression method in which original audio data is compressed by a program, such as MP3 format, WMA format, AAC format, FLAC format, APE format, or the like.
The sampling rate, also referred to as sampling speed or sampling frequency, defines the number of samples per second that are extracted from a continuous signal and constitute a discrete signal, which is expressed in hertz (Hz).
The sampling depth, which means that the value of a sample point is represented by a few bits, determines the number of bits of the value of each sample point, for example, 8 bits (bit), 16 bits or 24 bits, etc.
The code rate is the number of bits processed per unit time, and the unit is bits per second (bps).
Specifically, the processing device may specifically parse a frame header of the candidate audio file to obtain a format parameter of the candidate audio file.
For example, if the sampling depth is 8 bits, the tone quality of the candidate audio file is identified as the second tone quality; and if the sampling depth is 16 bits, determining the candidate audio file as the target audio file.
Or, for another example, if the sampling rate is less than 44100Hz, identifying the tone quality of the candidate audio file as the second tone quality; and if the sampling rate is greater than or equal to 44100Hz, determining the candidate audio file as the target audio file.
Or, for another example, the compression format is MP3, and the bitrate is less than 320 kilobits per second (kbps), identifying the timbre of the candidate audio file as the second timbre; the compression format is MP3, and the code rate is greater than or equal to 320kbps, and the candidate audio file is determined as the target audio file.
Therefore, by acquiring the format parameters of the candidate audio file, the tone quality of the candidate audio file can be identified as the second tone quality in advance according to the format parameters, so that the candidate audio file does not need to be used as a target audio file for further identification, and the efficiency of tone quality identification of the audio file can be effectively improved.
In addition, since the candidate audio file does not need to be decoded, and the format parameters of the candidate audio file can be obtained only by analyzing the frame header, the efficiency of sound quality identification of the audio file can be further improved.
Optionally, in a possible implementation manner of this embodiment, in 102, the processing device may specifically determine the number of channels of the target audio file, and decode the data blocks of the target audio file to obtain the original audio data. Then, the processing device may obtain channel audio data corresponding to each channel according to the number of channels and the original audio data. For a detailed description of the parsing method and the decoding method, reference may be made to related contents in the prior art, and details are not repeated here.
For example, the processing device may specifically parse a frame header of the target audio file to determine the number of channels of the target audio file.
Or for another example, the processing device specifically parses the file header of the target audio file to determine the number of channels of the target audio file.
For another example, the processing device may further parse other portions of the target audio file to determine the number of channels of the target audio file, which is not particularly limited in this embodiment.
Or for another example, the processing device may specifically obtain the number of channels of the target audio file from the configuration file.
It is to be understood that, without a fixed order, the two steps of "determining the number of channels of the target audio file" and "decoding the data blocks of the target audio file to obtain the original audio data" may be performed first, and then the step of "decoding the data blocks of the target audio file to obtain the original audio data" may be performed first, or the step of "decoding the data blocks of the target audio file to obtain the original audio data" may be performed first, and then the step of "determining the number of channels of the target audio file" may be performed, or both of these steps may be performed simultaneously, which is not particularly limited in this embodiment.
Accordingly, in a possible implementation manner of this embodiment, in 103, if the number of channels is greater than or equal to 2, the processing device may obtain, according to the channel audio data corresponding to each channel, first channel audio data and second channel audio data corresponding to at least two channels, and further add the first channel audio data and the second channel audio data to obtain mixed channel audio data.
If the mixed channel audio data is greater than or equal to the first channel audio data/N or the second channel audio data/M, the processing device may identify the sound quality of the target audio file as the first sound quality. Wherein N is a number greater than 1; m is a number greater than 1.
If the mixed channel audio data is smaller than the first channel audio data/N or the second channel audio data/M, the processing device may identify the sound quality of the target audio file as the second sound quality; wherein N is a number greater than 1; m is a number greater than 1.
Accordingly, in a possible implementation manner of this embodiment, in 103, if the difference between two of the values of the successively specified number (e.g. 3) of target channel audio data is smaller than or equal to the first amplitude threshold, and the corresponding waveform in this case may be as shown in fig. 2, then the processing device may identify the sound quality of the target audio file as the second sound quality. The target channel audio data may be channel audio data corresponding to any one channel, which is not particularly limited in this embodiment. In fig. 2, the abscissa represents time and the ordinate represents amplitude.
Accordingly, in a possible implementation manner of this embodiment, in 103, if the difference between the values of the two consecutive target channel audio data is greater than or equal to the second amplitude threshold and the signs of the values of the two consecutive target channel audio data are opposite, and the corresponding waveforms may be as shown in fig. 3, then the processing device may identify that the sound quality of the target audio file is the second sound quality. The target channel audio data may be channel audio data corresponding to any one channel, which is not particularly limited in this embodiment. In fig. 3, the abscissa represents time, and the ordinate represents amplitude.
Optionally, in a possible implementation manner of this embodiment, in 102, after obtaining the channel audio data corresponding to each channel, the processing device may further perform frame division processing on target channel audio data to obtain at least one frame of audio data, where the target channel audio data includes channel audio data corresponding to any channel in the channel audio data corresponding to each channel. Then, the processing device may perform frequency domain transform processing on the at least one frame of audio data to obtain frequency domain data corresponding to each frame of audio data. The target channel audio data may be channel audio data corresponding to any one channel, which is not particularly limited in this embodiment.
In particular, the frequency domain Transform process may include, but is not limited to, a Fast Fourier Transform (FFT).
For example, the processing device may perform framing processing on the target channel audio data at intervals of 20ms with 50% of data overlap between adjacent frames to obtain at least one frame of audio data. Then, the processing device may perform FFT processing on the at least one frame of audio data to obtain frequency domain data, denoted as a, corresponding to each frame of audio datai,j(ii) a Wherein i represents the number of frequency points, j represents the number of frames, Ai,jRepresenting the frequency domain data of the jth frame at the ith frequency point.
Accordingly, in a possible implementation manner of this embodiment, in 103, the processing device may specifically obtain, according to the frequency domain data corresponding to each frame of audio data, an energy component at each frequency point of the frequency domain data corresponding to each frame of audio data. If the difference between every two of the energy components of the frequency domain data corresponding to each frame of audio data at least one same frequency point is less than or equal to the energy threshold, and the energy spectrum corresponding to this situation may be as shown in fig. 4, then the processing device may identify the sound quality of the target audio file as the second sound quality. In fig. 4, the abscissa represents time, the ordinate represents frequency, and the color of each point represents energy.
For example, the frequency domain data corresponding to each frame of audio data obtained by the processing device is recorded as Ai,jObtaining the energy component E of the frequency domain data corresponding to each frame of audio data at each frequency pointi,j(ii) a Wherein i represents the number of frequency points, j represents the number of frames, Ei,jRepresenting the energy component of the jth frame at the ith frequency point.
In this embodiment, through obtaining the target audio file of treating discernment, and then according to the target audio file, obtain the time domain waveform characteristic of target audio file with at least one item in the frequency domain spectral line characteristic of target audio file makes can be according to time domain waveform characteristic with at least one item in the frequency domain spectral line characteristic is discerned the tone quality of target audio file is first tone quality or second tone quality, first tone quality is higher than second tone quality, like this, just can provide the audio file of real high tone quality to the user, makes the user can appreciate the audio file of real high tone quality.
In addition, the technical scheme provided by the invention is simple to operate, and can effectively improve the efficiency of tone quality identification of the audio file.
It should be noted that, for simplicity of description, the above-mentioned method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the order of acts, as some steps may occur in other orders or concurrently in accordance with the invention. Further, those skilled in the art should also appreciate that the embodiments described in the specification are preferred embodiments and that the acts and modules referred to are not necessarily required by the invention.
In the foregoing embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
Fig. 5 is a schematic structural diagram of a sound quality recognition apparatus for audio files according to another embodiment of the present invention, as shown in fig. 5. The sound quality recognition apparatus of an audio file of the present embodiment may include an acquisition unit 51, a feature unit 52, and a recognition unit 53. Wherein,
an obtaining unit 51, configured to obtain a target audio file to be identified.
The target Audio file may include Audio files in various encoding formats in the prior art, such as a Moving Picture Experts Group (MPEG) layer 3(MPEG layer-3, MP3) format Audio file, a wma (windows Media Audio) format Audio file, an Advanced Audio Coding (AAC) format Audio file, a Lossless Audio compression Coding (FLAC) or APE format Audio file, and the like, which is not particularly limited in this embodiment.
A feature unit 52, configured to obtain at least one of a time-domain waveform feature of the target audio file and a frequency-domain spectral line feature of the target audio file according to the target audio file.
The time-domain waveform characteristics of the target audio file may include, but are not limited to, amplitude information of the original audio data.
The original audio data is a digital signal converted from an audio signal, and for example, the audio signal is sampled, quantized and encoded to obtain Pulse Code Modulation (PCM) data, which can be obtained by parsing a data block of a target audio file.
The frequency-domain spectral line characteristics of the target audio file may include, but are not limited to, spectral information of the original audio data.
And the identifying unit 53 is used for identifying that the tone quality of the target audio file is a first tone quality or a second tone quality according to at least one of the time domain waveform characteristics and the frequency domain spectral line characteristics, wherein the first tone quality is higher than the second tone quality.
It should be noted that the sound quality recognition device for the audio file provided in this embodiment may be a processing device, may be located in a local Application (App), for example, hundredth music, or may also be located in a server on the network side, or may also be located in a part of the local Application, and another part is located in the server on the network side.
It should be understood that the application may be an application installed on the terminal (native app), or may also be a web page of a browser on the terminal (webAPP), as long as an objective existence form of processing of the audio data can be implemented, which is not limited in this embodiment.
Like this, acquire the target audio file of treating discernment through the acquisition element, and then by the characteristic cell basis the target audio file obtains the time domain waveform characteristics of target audio file with at least one item in the frequency domain spectral line characteristic of target audio file for the identification element can be according to time domain waveform characteristics with at least one item in the frequency domain spectral line characteristic is discerned the tone quality of target audio file is first tone quality or second tone quality, first tone quality is higher than second tone quality, like this, just can provide the audio file of real high tone quality to the user, makes the user can appreciate the audio file of real high tone quality.
Optionally, in a possible implementation manner of this embodiment, the identifying unit may be further configured to obtain a format parameter of the candidate audio file; determining the candidate audio file as the target audio file according to the format parameter; or identifying the tone quality of the candidate audio file as the second tone quality.
Wherein the format parameter may include, but is not limited to, at least one of a compression format, a sampling rate, a sampling depth, and a code rate.
The compression format is a compression method in which original audio data is compressed by a program, such as MP3 format, WMA format, AAC format, FLAC format, APE format, or the like.
The sampling rate, also referred to as sampling speed or sampling frequency, defines the number of samples per second that are extracted from a continuous signal and constitute a discrete signal, which is expressed in hertz (Hz).
The sampling depth, which means that the value of a sample point is represented by a few bits, determines the number of bits of the value of each sample point, for example, 8 bits (bit), 16 bits or 24 bits, etc.
The code rate is the number of bits processed per unit time, and the unit is bits per second (bps).
Specifically, the identifying unit 53 may specifically parse a frame header of the candidate audio file to obtain a format parameter of the candidate audio file.
For example, if the sampling depth is 8 bits, the tone quality of the candidate audio file is identified as the second tone quality; and if the sampling depth is 16 bits, determining the candidate audio file as the target audio file.
Or, for another example, if the sampling rate is less than 44100Hz, identifying the tone quality of the candidate audio file as the second tone quality; and if the sampling rate is greater than or equal to 44100Hz, determining the candidate audio file as the target audio file.
Or, for another example, the compression format is MP3, and the bitrate is less than 320 kilobits per second (kbps), identifying the timbre of the candidate audio file as the second timbre; the compression format is MP3, and the code rate is greater than or equal to 320kbps, and the candidate audio file is determined as the target audio file.
Therefore, the format parameters of the candidate audio files are obtained through the identification unit, and then the tone quality of the candidate audio files can be identified as the second tone quality in advance according to the format parameters, so that the candidate audio files do not need to be used as target audio files for further identification, and the efficiency of tone quality identification of the audio files can be effectively improved.
In addition, since the candidate audio file does not need to be decoded, and the format parameters of the candidate audio file can be obtained only by analyzing the frame header, the efficiency of sound quality identification of the audio file can be further improved.
Optionally, in a possible implementation manner of this embodiment, the feature unit 52 may be specifically configured to determine the number of channels of the target audio file; decoding the data blocks of the target audio file to obtain original audio data; and obtaining the audio data of the sound channel corresponding to each sound channel according to the number of the sound channels and the original audio data. For a detailed description of the parsing method and the decoding method, reference may be made to related contents in the prior art, and details are not repeated here.
For example, the feature unit 52 may specifically parse a frame header of the target audio file to determine the number of channels of the target audio file.
Or for another example, the feature unit 52 specifically parses the file header of the target audio file to determine the number of channels of the target audio file.
For another example, the feature unit 52 may further parse other portions of the target audio file to determine the number of channels of the target audio file, which is not particularly limited in this embodiment.
Or for another example, the feature unit 52 may specifically obtain the number of channels of the target audio file from a configuration file.
Accordingly, in a possible implementation manner of this embodiment, the identifying unit 53 may be specifically configured to, if the number of channels is greater than or equal to 2, obtain, according to the channel audio data corresponding to each channel, first channel audio data and second channel audio data corresponding to at least two channels; adding the first channel audio data and the second channel audio data to obtain mixed channel audio data; if the mixed channel audio data is greater than or equal to the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the first tone quality; if the mixed channel audio data is smaller than the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the second tone quality; wherein N is a number greater than 1; m is a number greater than 1.
Accordingly, in a possible implementation manner of this embodiment, the identifying unit 53 may be specifically configured to identify the sound quality of the target audio file as the second sound quality if a difference between two of values of a specified number (e.g., 3) of target channel audio data is smaller than or equal to a first amplitude threshold, where the target channel audio data includes channel audio data corresponding to any channel in the channel audio data corresponding to each channel. The corresponding waveform for this case can be as shown in fig. 2. The target channel audio data may be channel audio data corresponding to any one channel, which is not particularly limited in this embodiment.
Accordingly, in a possible implementation manner of this embodiment, the identifying unit 53 may be specifically configured to identify that the sound quality of the target audio file is the second sound quality if the difference between the values of two consecutive target channel audio data is greater than or equal to a second amplitude threshold and the signs of the values of the two consecutive target channel audio data are opposite, where the target channel audio data includes channel audio data corresponding to any channel in the channel audio data corresponding to each channel. The corresponding waveform for this case can be as shown in fig. 3. The target channel audio data may be channel audio data corresponding to any one channel, which is not particularly limited in this embodiment.
Optionally, in a possible implementation manner of this embodiment, the feature unit 52 may be further configured to perform frame division processing on target channel audio data to obtain at least one frame of audio data, where the target channel audio data includes channel audio data corresponding to any channel in channel audio data corresponding to each channel; and performing frequency domain transformation processing on the at least one frame of audio data to obtain frequency domain data corresponding to each frame of audio data. The target channel audio data may be channel audio data corresponding to any one channel, which is not particularly limited in this embodiment.
In particular, the frequency domain Transform process may include, but is not limited to, a Fast Fourier Transform (FFT).
For example, the feature unit 52 may perform framing processing on the target channel audio data at intervals of 20ms with 50% data overlap between adjacent frames to obtain at least one frame of audio data. Then, the feature unit 52 may perform FFT on the at least one frame of audio data to obtain frequency domain data, denoted as a, corresponding to each frame of audio datai,j(ii) a Wherein i represents the number of frequency points, j represents the number of frames, Ai,jRepresenting the frequency domain data of the jth frame at the ith frequency point.
Accordingly, in a possible implementation manner of this embodiment, the identifying unit 53 may be specifically configured to obtain, according to the frequency domain data corresponding to each frame of audio data, an energy component at each frequency point of the frequency domain data corresponding to each frame of audio data; and if the difference value between every two frequency domain data corresponding to each frame of audio data in the energy components at least one same frequency point is smaller than or equal to the energy threshold value, identifying the tone quality of the target audio file as the second tone quality. The corresponding energy spectrum for this case can be seen in fig. 4.
For example, the identification unit 53 records as a frequency domain data corresponding to each frame of the obtained audio datai,jObtaining the energy component E of the frequency domain data corresponding to each frame of audio data at each frequency pointi,j(ii) a Wherein i represents the number of frequency points, j represents the number of frames, Ei,jRepresenting the energy component of the jth frame at the ith frequency point.
In this embodiment, acquire the target audio file of treating discernment through the acquisition element, and then by the characteristic cell basis the target audio file, obtain the time domain waveform characteristic of target audio file with at least one item in the frequency domain spectral line characteristic of target audio file for the identification element can be according to time domain waveform characteristic with at least one item in the frequency domain spectral line characteristic is discerned the tone quality of target audio file is first tone quality or second tone quality, first tone quality is higher than second tone quality, like this, just can provide the audio file of real high tone quality to the user, makes the user can appreciate the audio file of real high tone quality.
In addition, the technical scheme provided by the invention is simple to operate, and can effectively improve the efficiency of tone quality identification of the audio file.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the embodiments provided in the present invention, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.
The integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium. The software functional unit is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, an audio processing engine, or a network device) or a processor (processor) to execute some steps of the methods according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (12)

1. A method for recognizing the sound quality of an audio file is characterized by comprising the following steps:
acquiring a target audio file to be identified;
according to the target audio file, at least one of a time domain waveform characteristic of the target audio file and a frequency domain spectral line characteristic of the target audio file is obtained;
according to at least one of the time domain waveform characteristics and the frequency domain spectral line characteristics, identifying the tone quality of the target audio file as a first tone quality or a second tone quality, wherein the first tone quality is higher than the second tone quality; wherein,
the obtaining at least one of a time-domain waveform characteristic of the target audio file and a frequency-domain spectral line characteristic of the target audio file according to the target audio file includes:
determining the number of channels of the target audio file;
decoding the data blocks of the target audio file to obtain original audio data;
obtaining channel audio data corresponding to each channel according to the number of the channels and the original audio data;
after obtaining the channel audio data corresponding to each channel according to the number of channels and the original audio data, the method further includes:
performing frame processing on target channel audio data to obtain at least one frame of audio data, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel;
and performing frequency domain transformation processing on the at least one frame of audio data to obtain frequency domain data corresponding to each frame of audio data.
2. The method of claim 1, wherein identifying the sound quality of the target audio file as a first sound quality or a second sound quality based on at least one of the time-domain waveform characteristics and the frequency-domain spectral line characteristics comprises:
if the number of the sound channels is larger than or equal to 2, obtaining first sound channel audio data and second sound channel audio data corresponding to at least two sound channels according to the sound channel audio data corresponding to each sound channel;
adding the first channel audio data and the second channel audio data to obtain mixed channel audio data;
if the mixed channel audio data is greater than or equal to the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the first tone quality;
if the mixed channel audio data is smaller than the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the second tone quality; wherein,
n is a number greater than 1; m is a number greater than 1.
3. The method of claim 1, wherein identifying the sound quality of the target audio file as a first sound quality or a second sound quality based on at least one of the time-domain waveform characteristics and the frequency-domain spectral line characteristics comprises:
if the difference value between every two of the values of the target sound channel audio data with the continuously specified number is smaller than or equal to a first amplitude threshold value, identifying the sound quality of the target audio file as the second sound quality, wherein the target sound channel audio data comprises sound channel audio data corresponding to any sound channel in the sound channel audio data corresponding to each sound channel; or
And if the difference value of the values of the two continuous target channel audio data is greater than or equal to a second amplitude threshold value and the signs of the values of the two continuous target channel audio data are opposite, identifying that the tone quality of the target audio file is the second tone quality, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel.
4. The method of claim 1, wherein identifying the sound quality of the target audio file as a first sound quality or a second sound quality based on at least one of the time-domain waveform characteristics and the frequency-domain spectral line characteristics comprises:
according to the frequency domain data corresponding to each frame of audio data, obtaining the energy component of the frequency domain data corresponding to each frame of audio data at each frequency point;
and if the difference value between every two frequency domain data corresponding to each frame of audio data in the energy components at least one same frequency point is smaller than or equal to the energy threshold value, identifying the tone quality of the target audio file as the second tone quality.
5. The method according to any one of claims 1 to 4, wherein before the obtaining the target audio file to be identified, the method further comprises:
acquiring format parameters of the candidate audio files;
determining the candidate audio file as the target audio file according to the format parameter; or identifying the tone quality of the candidate audio file as the second tone quality.
6. The method of claim 5, wherein the format parameter comprises at least one of a compression format, a sampling rate, a sampling depth, and a code rate.
7. An apparatus for recognizing a sound quality of an audio file, comprising:
the acquisition unit is used for acquiring a target audio file to be identified;
the characteristic unit is used for acquiring at least one of the time domain waveform characteristic of the target audio file and the frequency domain spectral line characteristic of the target audio file according to the target audio file;
the identification unit is used for identifying the tone quality of the target audio file as a first tone quality or a second tone quality according to at least one of the time domain waveform characteristics and the frequency domain spectral line characteristics, wherein the first tone quality is higher than the second tone quality; wherein,
the characteristic unit, in particular for
Determining the number of channels of the target audio file;
decoding the data blocks of the target audio file to obtain original audio data; and
obtaining channel audio data corresponding to each channel according to the number of the channels and the original audio data;
the characteristic unit is also used for
Performing frame processing on target channel audio data to obtain at least one frame of audio data, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel; and
and performing frequency domain transformation processing on the at least one frame of audio data to obtain frequency domain data corresponding to each frame of audio data.
8. Device according to claim 7, characterized in that the identification unit is, in particular, adapted to
If the number of the sound channels is larger than or equal to 2, obtaining first sound channel audio data and second sound channel audio data corresponding to at least two sound channels according to the sound channel audio data corresponding to each sound channel;
adding the first channel audio data and the second channel audio data to obtain mixed channel audio data; and
if the mixed channel audio data is greater than or equal to the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the first tone quality;
if the mixed channel audio data is smaller than the first channel audio data/N or the second channel audio data/M, identifying the tone quality of the target audio file as the second tone quality; wherein,
n is a number greater than 1; m is a number greater than 1.
9. Device according to claim 7, characterized in that the identification unit is, in particular, adapted to
If the difference value between every two of the values of the target sound channel audio data with the continuously specified number is smaller than or equal to a first amplitude threshold value, identifying the sound quality of the target audio file as the second sound quality, wherein the target sound channel audio data comprises sound channel audio data corresponding to any sound channel in the sound channel audio data corresponding to each sound channel; or
And if the difference value of the values of the two continuous target channel audio data is greater than or equal to a second amplitude threshold value and the signs of the values of the two continuous target channel audio data are opposite, identifying that the tone quality of the target audio file is the second tone quality, wherein the target channel audio data comprises channel audio data corresponding to any channel in the channel audio data corresponding to each channel.
10. Device according to claim 7, characterized in that the identification unit is, in particular, adapted to
According to the frequency domain data corresponding to each frame of audio data, obtaining the energy component of the frequency domain data corresponding to each frame of audio data at each frequency point; and
and if the difference value between every two frequency domain data corresponding to each frame of audio data in the energy components at least one same frequency point is smaller than or equal to the energy threshold value, identifying the tone quality of the target audio file as the second tone quality.
11. The apparatus according to any one of claims 7 to 10, wherein the identification unit is further adapted to identify the user
Acquiring format parameters of the candidate audio files; and
determining the candidate audio file as the target audio file according to the format parameter; or identifying the tone quality of the candidate audio file as the second tone quality.
12. The apparatus of claim 11, wherein the format parameter comprises at least one of a compression format, a sampling rate, a sampling depth, and a code rate.
CN201410235733.3A 2014-05-29 2014-05-29 The acoustic fidelity identification method of audio file and device Active CN104036788B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410235733.3A CN104036788B (en) 2014-05-29 2014-05-29 The acoustic fidelity identification method of audio file and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410235733.3A CN104036788B (en) 2014-05-29 2014-05-29 The acoustic fidelity identification method of audio file and device

Publications (2)

Publication Number Publication Date
CN104036788A CN104036788A (en) 2014-09-10
CN104036788B true CN104036788B (en) 2016-10-05

Family

ID=51467534

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410235733.3A Active CN104036788B (en) 2014-05-29 2014-05-29 The acoustic fidelity identification method of audio file and device

Country Status (1)

Country Link
CN (1) CN104036788B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105047200A (en) * 2015-07-21 2015-11-11 重庆邮电大学 FPGA-based FLAC hardware decoder and decoding method
CN105050021B (en) * 2015-08-05 2019-02-22 Oppo广东移动通信有限公司 Earphone sound quality detection method, system and terminal
CN105719661B (en) * 2016-01-29 2019-06-11 西安交通大学 A kind of stringed musical instrument performance sound quality automatic distinguishing method
CN106228994B (en) * 2016-07-26 2019-02-26 广州酷狗计算机科技有限公司 A kind of method and apparatus detecting sound quality
CN107895571A (en) * 2016-09-29 2018-04-10 亿览在线网络技术(北京)有限公司 Lossless audio file identification method and device
CN107886956B (en) * 2017-11-13 2020-12-11 广州酷狗计算机科技有限公司 Audio recognition method and device and computer storage medium
CN108111908A (en) * 2017-12-25 2018-06-01 深圳Tcl新技术有限公司 Audio quality determines method, equipment and computer readable storage medium
CN111554320A (en) * 2020-03-31 2020-08-18 紫光云技术有限公司 Audio stream Fourier analysis method based on Windows platform

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1213135A (en) * 1997-08-26 1999-04-07 三星电子株式会社 High quality audio encoding/decoding apparatus and digital versatile disc
CN1777891A (en) * 2003-04-24 2006-05-24 皇家飞利浦电子股份有限公司 Parameterized temporal feature analysis
CN1802696A (en) * 2003-06-05 2006-07-12 松下电器产业株式会社 Sound quality adjusting apparatus and sound quality adjusting method
EP1691348A1 (en) * 2005-02-14 2006-08-16 Ecole Polytechnique Federale De Lausanne Parametric joint-coding of audio sources
CN101479787A (en) * 2006-09-29 2009-07-08 Lg电子株式会社 Method for encoding and decoding object-based audio signal and apparatus thereof
CN101645265A (en) * 2008-08-05 2010-02-10 中兴通讯股份有限公司 Method and device for identifying audio category in real time
CN101762320A (en) * 2009-12-18 2010-06-30 深圳市万兴软件有限公司 Method for drawing audio waveform under MAC desktop and system thereof
CN102253987A (en) * 2011-07-01 2011-11-23 中山大学 Method and system for sequencing network MP3 (moving picture experts group audio layer-3) tone qualities
CN102510541A (en) * 2011-12-30 2012-06-20 Tcl数码科技(深圳)有限责任公司 Multi-screen interaction video and audio content switching method and media player
CN102568470A (en) * 2012-01-11 2012-07-11 广州酷狗计算机科技有限公司 Acoustic fidelity identification method and system for audio files
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110049068A (en) * 2009-11-04 2011-05-12 삼성전자주식회사 Method and apparatus for encoding/decoding multichannel audio signal

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1213135A (en) * 1997-08-26 1999-04-07 三星电子株式会社 High quality audio encoding/decoding apparatus and digital versatile disc
CN1777891A (en) * 2003-04-24 2006-05-24 皇家飞利浦电子股份有限公司 Parameterized temporal feature analysis
CN1802696A (en) * 2003-06-05 2006-07-12 松下电器产业株式会社 Sound quality adjusting apparatus and sound quality adjusting method
EP1691348A1 (en) * 2005-02-14 2006-08-16 Ecole Polytechnique Federale De Lausanne Parametric joint-coding of audio sources
CN101479787A (en) * 2006-09-29 2009-07-08 Lg电子株式会社 Method for encoding and decoding object-based audio signal and apparatus thereof
CN101645265A (en) * 2008-08-05 2010-02-10 中兴通讯股份有限公司 Method and device for identifying audio category in real time
CN101762320A (en) * 2009-12-18 2010-06-30 深圳市万兴软件有限公司 Method for drawing audio waveform under MAC desktop and system thereof
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal
CN102253987A (en) * 2011-07-01 2011-11-23 中山大学 Method and system for sequencing network MP3 (moving picture experts group audio layer-3) tone qualities
CN102510541A (en) * 2011-12-30 2012-06-20 Tcl数码科技(深圳)有限责任公司 Multi-screen interaction video and audio content switching method and media player
CN102568470A (en) * 2012-01-11 2012-07-11 广州酷狗计算机科技有限公司 Acoustic fidelity identification method and system for audio files

Also Published As

Publication number Publication date
CN104036788A (en) 2014-09-10

Similar Documents

Publication Publication Date Title
CN104036788B (en) The acoustic fidelity identification method of audio file and device
US9875752B2 (en) Voice profile management and speech signal generation
TWI480855B (en) Extraction and matching of characteristic fingerprints from audio signals
CN109147805B (en) Audio tone enhancement based on deep learning
CN101421780B (en) Method and device for encoding and decoding time-varying signal
CN104718572B (en) Audio coding method and device, audio-frequency decoding method and device and the multimedia device using this method and device
CN109147806B (en) Voice tone enhancement method, device and system based on deep learning
CA2717584A1 (en) Method and apparatus for processing an audio signal
CN112767954B (en) Audio encoding and decoding method, device, medium and electronic equipment
TWI459377B (en) Electronic device, apparatus, method and computer program product for coding and decoding a transient frame
CN101521010B (en) Coding and decoding method for voice frequency signals and coding and decoding device
MX2014004797A (en) Lossless energy encoding method and apparatus, audio encoding method and apparatus, lossless energy decoding method and apparatus, and audio decoding method and apparatus.
CN111816197B (en) Audio encoding method, device, electronic equipment and storage medium
KR20120121895A (en) Method and apparatus for processing an audio signal
US9886962B2 (en) Extracting audio fingerprints in the compressed domain
CN110176241B (en) Signal encoding method and apparatus, and signal decoding method and apparatus
CN104064191A (en) Audio mixing method and device
CN104882146B (en) The processing method and processing device of audio promotion message
EP2929531B1 (en) Method of encoding and decoding audio signal and apparatus for encoding and decoding audio signal
CN105283915B (en) Digital watermark embedding device and method and digital watermark detecting device and method
CN113035213B (en) Digital audio watermark detection method and device
CN105336327B (en) The gain control method of voice data and device
CN104715756A (en) Audio data processing method and device
CN113112993B (en) Audio information processing method and device, electronic equipment and storage medium
CN108877816B (en) QMDCT coefficient-based AAC audio frequency recompression detection method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160321

Address after: 100027 Haidian District, Qinghe Qinghe East Road, No. 23, building two, floor 2108, No., No. 18

Applicant after: BEIJING YINZHIBANG CULTURE TECHNOLOGY Co.,Ltd.

Address before: 100085 Beijing, Haidian District, No. ten on the street Baidu building, No. 10

Applicant before: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY Co.,Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220414

Address after: 518057 3305, floor 3, building 1, aerospace building, No. 51, Gaoxin South ninth Road, high tech Zone community, Yuehai street, Nanshan District, Shenzhen, Guangdong

Patentee after: Shenzhen Taile Culture Technology Co.,Ltd.

Address before: 2108, floor 2, building 23, No. 18, anningzhuang East Road, Qinghe, Haidian District, Beijing 100027

Patentee before: BEIJING YINZHIBANG CULTURE TECHNOLOGY Co.,Ltd.