CN115767373A - Audio processing method, device, equipment and computer readable storage medium - Google Patents

Audio processing method, device, equipment and computer readable storage medium Download PDF

Info

Publication number
CN115767373A
CN115767373A CN202211372536.7A CN202211372536A CN115767373A CN 115767373 A CN115767373 A CN 115767373A CN 202211372536 A CN202211372536 A CN 202211372536A CN 115767373 A CN115767373 A CN 115767373A
Authority
CN
China
Prior art keywords
data
sound effect
target
audio
sampling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211372536.7A
Other languages
Chinese (zh)
Inventor
高岩峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
MIGU Culture Technology Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
MIGU Culture Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, MIGU Culture Technology Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202211372536.7A priority Critical patent/CN115767373A/en
Publication of CN115767373A publication Critical patent/CN115767373A/en
Pending legal-status Critical Current

Links

Images

Abstract

The embodiment of the invention relates to the technical field of audio processing, and discloses an audio processing method, an audio processing device, audio processing equipment and a computer readable storage medium, wherein the method comprises the following steps: acquiring sampling data and non-sound effect data, wherein the sampling data comprises a target sound effect; correcting the impact response of the sampled data according to the non-sound effect data, and calculating to obtain sound effect parameters corresponding to the target sound effect according to the corrected impact response; and carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect. Through the mode, the embodiment of the invention realizes that the sound effect parameters corresponding to the target sound effect are loaded on the original audio data through a simple audio processing process to obtain the target audio data added with the target sound effect, and reduces the influence of the recording equipment and the audio line on the sampled data through correcting the impact response, so that the finally obtained sound effect of the target audio data is consistent with the target sound effect.

Description

Audio processing method, device, equipment and computer readable storage medium
Technical Field
The embodiment of the invention relates to the technical field of audio processing, in particular to an audio processing method, an audio processing device, audio processing equipment and a computer readable storage medium.
Background
At present, the purpose of loading sound effect parameters on target audio data is achieved, so that the target audio data added with a target sound effect is obtained, professionals are required to judge through hearing, sound effect modules such as a balance device and a reverberator are used, experimental parameters are calculated manually, the realization process is complex, and the time period is long when the audio processing is completed.
Disclosure of Invention
In view of the foregoing problems, embodiments of the present invention provide an audio processing method, an audio processing apparatus, an audio processing device, and a computer-readable storage medium, which are used to solve the problems in the prior art that an implementation process is complex and it takes a long time period to complete audio processing.
According to an aspect of an embodiment of the present invention, there is provided an audio processing method, including:
acquiring sampling data and non-sound effect data, wherein the sampling data comprises a target sound effect;
correcting the impact response of the sampled data according to the non-sound effect data, and calculating to obtain sound effect parameters corresponding to the target sound effect according to the corrected impact response;
and carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
In an alternative mode, the correcting the impulse response of the sampled data according to the null data includes:
calculating the sampling offset of the sound-effect-free data and the sampling data;
calculating to obtain a sound effect-free impact response according to the sound effect-free data;
cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference;
and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
In an optional manner, before performing the audio processing on the original audio data according to the sound-effect parameters, the method further includes:
acquiring the sampling rate of the original audio data;
and carrying out sampling rate conversion on the sound effect parameters according to the sampling rate of the original audio data to obtain the sound effect parameters consistent with the sampling rate of the original audio data.
In an optional manner, the audio processing is performed on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect, specifically:
performing Fourier transform on the original audio data and the sound effect parameters respectively to obtain the Fourier transform of the original audio data and the Fourier transform of the sound effect parameters;
acquiring Fourier transform of target audio data according to Fourier transform of the original audio data and Fourier transform of the sound effect parameters;
and performing inverse Fourier transform on the Fourier transform of the target audio data to obtain the target audio data added with the target sound effect.
In an optional manner, the acquiring the sample data and the null data includes:
recording a target sound effect and the sweep frequency data to obtain the sampling data;
recording the sweep frequency data to obtain the soundless data;
the sampling data comprises frequency sweeping data and a corresponding target sound effect; the sweep frequency data is audio data comprising audio in a preset frequency range; the silence data comprises frequency sweep data.
In an alternative mode, the original audio data is audio data subjected to slicing processing.
According to another aspect of the embodiments of the present invention, there is provided an audio processing apparatus including:
the sampling module is used for acquiring sampling data and soundless data;
the sound effect parameter calculation module is used for correcting the impact response of the sampled data according to the non-sound effect data and calculating to obtain a sound effect parameter corresponding to the target sound effect according to the corrected impact response;
and the sound effect loading module is used for carrying out audio processing on original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
In an optional manner, the sound effect parameter calculating module is further configured to:
calculating the sampling offset of the sound-effect-free data and the sampling data;
calculating to obtain a sound effect-free impact response according to the sound effect-free data;
cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference;
and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
According to another aspect of the embodiments of the present invention, there is provided an audio processing apparatus including: the system comprises a processor, a memory, a communication interface and a communication bus, wherein the processor, the memory and the communication interface complete mutual communication through the communication bus;
the memory is used for storing at least one executable instruction, and the executable instruction enables the processor to execute the operation of the audio processing method.
According to a further aspect of the embodiments of the present invention, there is provided a computer-readable storage medium having at least one executable instruction stored therein, which when executed on an audio processing apparatus, causes the audio processing apparatus to perform the operations of the audio processing method described above.
According to the embodiment of the invention, the sound effect parameter is obtained by calculation according to the sampling data and the non-sound effect data, the sound effect parameter corresponding to the target sound effect is loaded on the original audio data, the target audio data added with the target sound effect is obtained, a sound effect technology of balance and reverberation is not needed, the audio processing process is simple and convenient, a professional is not needed to complete the sound effect parameter calculation process of subsequent audio processing through auditory judgment, the audio processing period is short, and the influence of the recording equipment and the audio line on the sampling data is reduced through correcting the impact response, so that the finally obtained sound effect of the target audio data is consistent with the target sound effect.
Further, by performing fragment processing on the target audio data, real-time audio processing on the target audio data can be realized.
The foregoing description is only an overview of the technical solutions of the embodiments of the present invention, and the embodiments of the present invention can be implemented according to the content of the description in order to make the technical means of the embodiments of the present invention more clearly understood, and the detailed description of the present invention is provided below in order to make the foregoing and other objects, features, and advantages of the embodiments of the present invention more clearly understandable.
Drawings
The drawings are only for purposes of illustrating embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
fig. 1 is a flow chart of an audio processing method provided by an embodiment of the invention;
fig. 2 is a schematic structural diagram of an audio processing apparatus according to an embodiment of the present invention;
FIG. 3 is a schematic structural diagram of a sound effect device according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the invention are shown in the drawings, it should be understood that the invention can be embodied in various forms and should not be limited to the embodiments set forth herein.
First, the inventive concepts for implementing the present application are further elaborated: at present, the sound effect of a competitor needs to be realized, a professional needs to judge through hearing, sound effect modules such as balance and reverberator are used, a sound effect is completed through manual experiment parameters, and the time of several weeks is generally needed. The existing scheme has at least three problems: firstly, the realization speed is slow, and the development of a sound effect period is long; secondly, the current scheme requires professionals to judge through hearing, mainly simulates balance and reverberation, other effects are difficult to distinguish, and the accuracy of sound effect realization cannot be guaranteed under the condition of manpower consumption; thirdly, under the condition that the technology for realizing the sound effect of the competitive products is not provided, and the sound effect is lower than that of the competitive products, the sound effect of the competitive products cannot be simulated by the conventional scheme.
Fig. 1 shows a flow diagram of an audio processing method, performed by a computing device, provided by an embodiment of the invention. Wherein, the computing device may be a personal computer, a palm computer, a mobile phone, etc., as shown in fig. 1, the method includes the following steps:
step 110: and acquiring sampling data and non-sound effect data, wherein the sampling data comprises a target sound effect.
In an embodiment of the present invention, the acquiring of the sampling data and the null data includes: recording a target sound effect and the sweep frequency data to obtain the sampling data; recording the sweep frequency data to obtain the non-sound effect data; the sampling data comprises frequency sweeping data and a corresponding target sound effect; the frequency sweep data is audio data comprising audio in a preset frequency range; the non-acoustic data includes sweep frequency data.
The sampling data is audio data and adopts wav format. In the embodiment of the invention, the sweep frequency data is audio data comprising audio in a preset frequency range, and the preset frequency range comprises 20Hz-20000Hz. The sweep frequency data is a two-channel file, the left channel data and the right channel data are kept consistent, the length of processing time is reduced as far as possible under the condition of ensuring precision, and the duration of the sweep frequency data in the embodiment of the invention can be 19-21 seconds.
In the embodiment of the invention, a target sound effect and the sweep frequency data are recorded to obtain the sampling data; and recording the sweep frequency data to obtain the soundless data. Specifically, under the condition that the sound effect is started on target sound effect original equipment and application, frequency sweep data are played and recorded to obtain sampling data, and the recording method can adopt a mode of recording in an operating system or a mode of connecting an earphone socket and a microphone socket of a computer; and under the condition of not starting the target sound effect, playing the sweep frequency data through an API of an operating system, and recording to obtain non-sound effect data.
Step 120: and correcting the impact response of the sampling data according to the non-sound effect data, and calculating to obtain sound effect parameters corresponding to the target sound effect according to the corrected impact response.
Wherein, according to the impact response of no sound effect data to the sampling data carries on the correction, according to the impact response after correcting, calculate and get the sound effect parameter that the said goal sound effect corresponds, include: acquiring non-sound effect data; calculating the sampling offset of the sound-effect-free data and the sampling data; calculating to obtain a sound effect-free impact response according to the sound effect-free data; cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference; and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
In the embodiment of the invention, the impulse response is calculated according to the sampling data. Specifically, data of sweep frequency data of sampling data is recorded as In, the sampling data is Out, the data In of the sweep frequency data includes sweep frequency data left channel data Inl and sweep frequency data right channel data Inr, and the sampling data Out includes sampling audio data left channel data Outl and sampling audio data right channel data Outr. The impulse response comprises a first left channel impulse response and a first right channel impulse response; calculating an impulse response of the sampled data, comprising: a first left channel impulse response and a first right channel impulse response of the sampled data are calculated.
Calculating a Fourier transform of the first left channel impulse response:
Hl=FFT(Inl)/FFT(Outl);
wherein, FFT (Inl) represents fourier transform of the left channel data Inl of the sweep data, and FFT (Outl) represents fourier transform of the left channel data Outl of the sample data.
Calculating a Fourier transform of the first right channel impulse response:
Hl=FFT(Inr)/FFT(Outr);
here, FFT (Inr) represents fourier transform of sweep data right channel data Inr, and FFT (Outr) represents fourier transform of sample data right channel data Outr.
In the embodiment of the invention, the sampling offset of the sound-effect-free data and the sampling data is calculated. Specifically, aligning the non-sound effect data with the sampling data, and calculating the offset of the non-sound effect data and the sampling data according to the following formula:
Figure BDA0003925631630000061
wherein, x is the data sequence of the sampling data, y is the data sequence of the silent data, and d is the serial number of the data in the sequence. And traversing all values of d, and when the value of R (d) is maximum, taking d as the offset.
In the embodiment of the invention, the non-sound effect impact response is calculated according to the non-sound effect data. Specifically, the sound-free impact response comprises a third left channel impact response and a third right channel impact response; recording the sound-effect-free frequency sweep data as In ', the sound-effect-free data as Out', the sound-effect-free frequency sweep data In 'comprising sound-effect-free frequency sweep data left channel data In' l and sound-effect-free frequency sweep data right channel data In 'r, the sound-effect-free data Out' comprising sound-effect-free left channel data Out 'l and sound-effect-free right channel data Out' r, and calculating to obtain Fourier transform of third left channel impact response:
Hnl=FFT(In′l)/FFT(Out′l);
wherein, FFT (In 'l) represents Fourier transform of the left channel data of the soundless sweep data, and FFT (Out' l) represents Fourier transform of the left channel data of the soundless sweep data.
Calculating a Fourier transform of the third right channel impulse response:
Hnl=FFT(In′r)/FFT(Out′r);
wherein, FFT (In 'r) represents the Fourier transform of the sound effect free sweep data right channel data, and FFT (Out' r) represents the Fourier transform of the sound effect free sweep data right channel data.
In the embodiment of the invention, the non-sound effect impact response is cut according to the sampling offset, so that the impact response of recording interference is obtained. Specifically, the shift amount is used to cut Hnl and Hnr, and the impulse response of the recording interference aligned with Hl and Hr is obtained, and the impulse response of the recording interference includes a fourth left channel impulse response Hn 'l and a fourth right channel impulse response Hn' r. Specifically, the impulse response is an integer data sequence, for example, 44100 samples of one second monaural audio data correspond to the first sample number 0, the second sample number 1, and so on. And clipping Hnl and Hnl by using the offset, namely deleting samples with serial numbers smaller than or equal to the offset.
In the embodiment of the invention, the impact response is corrected according to the impact response of the recording interference, and the corrected impact response is obtained. Specifically, the corrected impulse response includes a second left channel impulse response Hel and a second right channel impulse response Her; the second left channel impulse response is calculated by the following formula:
Hel=IFFT(FFT(Hl)/FFT(Hn′l));
where IFFT () represents an inverse fourier transform, FFT (Hl) represents a fourier transform of the first left channel impulse response Hl, and FFT (Hn 'l) represents a fourier transform of the fourth left channel impulse response Hn' l.
Wherein the second right channel impulse response is obtained by the following formula:
Her=IFFT(FFT(Hr)/FFT(Hn′r));
where IFFT () represents an inverse fourier transform, FFT (Hr) represents a fourier transform of the first right channel impulse response Hr, and FFT (Hn 'r) represents a fourier transform of the fourth right channel impulse response Hn' r.
In the embodiment of the present invention, the impact response is an array, where m (e.g. 10000) number elements before the impact response are intercepted from the array and recorded as the sound effect parameter P, where P is recorded by using floating point number, the maximum value is 1, the minimum value is-1,P includes a left channel sound effect parameter Pl and a right channel sound effect parameter Pr, where Pl is intercepted by Hel, and Pr is intercepted by Her; the Pl and Pr are PCM audio formats. It can be understood that the number of elements of the left channel sound effect parameter Pl and the right channel sound effect parameter Pr is the same.
Step 130: and carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
In the embodiment of the invention, the audio processing is carried out on the original audio data according to the sound effect parameters to obtain the target audio data added with the target sound effect. Specifically, fourier transform is respectively carried out on the sound effect parameters and the original audio data to obtain the Fourier transform of the sound effect parameters and the Fourier transform of the original audio data; acquiring Fourier transform of target audio data according to Fourier transform of the original audio data and Fourier transform of the sound effect parameters; and carrying out inverse Fourier transform on the Fourier transform of the target audio data to obtain the target audio data added with the target sound effect.
In an embodiment of the present invention, before performing audio processing on original audio data according to the sound effect parameter, the method further includes: acquiring the sampling rate of the original audio data; and carrying out sampling rate conversion on the sound effect parameters according to the sampling rate of the original audio data to obtain the sound effect parameters consistent with the sampling rate of the original audio data. Specifically, the impact response is related to the sampling rate, the same sound effect is realized, different sampling rates are used when the sampling data are recorded, and the obtained results are different. The impulse response calculated from the sampled data can only be processed for audio data of the same sampling rate. In this embodiment of the present invention, taking the original audio data as the song file as an example, before step 130, the method further includes: and acquiring the sampling rate of the song file, and obtaining the sound effect parameter T (P) of the converted sampling rate by using an algorithm of converting the audio sampling rate for the sound effect parameter, wherein the T (P) is the same as the sampling rate of the song file, and the T (P) comprises a left channel sound effect parameter T (Pl) of the converted sampling rate and a right channel sound effect parameter T (Pr) of the converted sampling rate.
In this embodiment of the present invention, the original audio data is audio data subjected to slice processing. Taking the original audio data as the song file as an example, the song file is sliced, that is, the song is divided into a plurality of data segments, and the length of each data segment is not required to be enforced. One of the data segments is denoted as S, which is data in PCM format, and includes a left channel data segment Sl and a right channel data segment Sr. The audio processing is carried out on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect, and the method comprises the following steps: fourier transform is carried out on the data section S to obtain F (S), fourier transform is carried out on the sound effect parameter T (P) of the converted sampling rate to obtain FT (P), and Fourier transform of the song data added with the target sound effect is obtained through calculation:
F(E)=F(S)*FT(P);
and E represents song data of the sound effect parameter corresponding to the added target sound effect, and the song data E added with the target sound effect is obtained by performing inverse Fourier transform on F (E).
According to the embodiment of the invention, the sound effect parameters are calculated according to the sampling data and the non-sound effect data, the sound effect parameters corresponding to the target sound effect are loaded on the original audio data, the target audio data added with the target sound effect is obtained, the sound effect technology of balance and reverberation is not needed, the audio processing process is simple and convenient, the sound effect parameter calculation process of the subsequent audio processing is not needed to be completed by a professional through auditory judgment, the period of audio processing is short, and the influence of the sound recording equipment and the audio line on the sampling data is reduced through correcting the impact response, so that the finally obtained sound effect of the target audio data is consistent with the target sound effect.
Further, by performing fragment processing on the target audio data, real-time audio processing on the target audio data can be realized.
Fig. 2 shows a schematic structural diagram of an audio processing apparatus according to an embodiment of the present invention, and as shown in fig. 2, the apparatus 200 includes: a sampling module 210, a sound effect parameter calculation module 220 and an effect loading module 230.
The sampling module 210 is configured to obtain sampling data and null data; the sound effect parameter calculation module 220 is configured to correct an impact response of the sampled data according to the non-sound effect data, and calculate a sound effect parameter corresponding to the target sound effect according to the corrected impact response; the sound effect loading module 230 is configured to perform audio processing on the original audio data according to the sound effect parameters, so as to obtain target audio data added with the target sound effect.
In an optional manner, the sound effect parameter calculating module 220 is further configured to: calculating the sampling offset of the sound-effect-free data and the sampling data; calculating to obtain a sound effect-free impact response according to the sound effect-free data; cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference; and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
In an optional manner, before performing the audio processing on the original audio data according to the sound-effect parameters, the method further includes: acquiring the sampling rate of the original audio data; and carrying out sampling rate conversion on the sound effect parameters according to the sampling rate of the original audio data to obtain the sound effect parameters consistent with the sampling rate of the original audio data.
In an optional manner, the audio processing is performed on original audio data according to the sound effect parameter to obtain target audio data added with the target sound effect, and the specific steps are as follows: respectively carrying out Fourier transform on the original audio data and the sound effect parameters to obtain the Fourier transform of the original audio data and the Fourier transform of the sound effect parameters; acquiring Fourier transform of target audio data according to Fourier transform of the original audio data and Fourier transform of the sound effect parameters; and carrying out inverse Fourier transform on the Fourier transform of the target audio data to obtain the target audio data added with the target sound effect.
In an optional manner, the acquiring the sample data and the null data includes: recording a target sound effect and the sweep frequency data to obtain the sampling data; recording the sweep frequency data to obtain the non-sound effect data; the sampling data comprises frequency sweeping data and a corresponding target sound effect; the sweep frequency data is audio data comprising audio in a preset frequency range; the non-acoustic data includes sweep frequency data.
In an alternative mode, the original audio data is audio data subjected to slicing processing.
According to the embodiment of the invention, the sound effect parameters are calculated according to the sampling data and the non-sound effect data, the sound effect parameters corresponding to the target sound effect are loaded on the original audio data, the target audio data added with the target sound effect is obtained, the sound effect technology of balance and reverberation is not needed, the audio processing process is simple and convenient, the sound effect parameter calculation process of the subsequent audio processing is not needed to be completed by a professional through auditory judgment, the period of audio processing is short, and the influence of the sound recording equipment and the audio line on the sampling data is reduced through correcting the impact response, so that the finally obtained sound effect of the target audio data is consistent with the target sound effect.
Further, by performing fragment processing on the target audio data, real-time audio processing on the target audio data can be realized.
Fig. 3 is a schematic structural diagram of an audio processing device according to an embodiment of the present invention, and the specific embodiment of the present invention does not limit the specific implementation of the audio processing device. As shown in fig. 3, the audio processing apparatus may include: a processor (processor) 402, a Communications Interface 404, a memory 406, and a Communications bus 408.
Wherein: the processor 402, communication interface 404, and memory 406 communicate with each other via a communication bus 408. A communication interface 404 for communicating with network elements of other devices, such as clients or other servers. The processor 402 is configured to execute the program 410, and may specifically perform the relevant steps in the embodiment of the audio processing method described above.
In particular, program 410 may include program code comprising computer-executable instructions.
The processor 402 may be a central processing unit CPU, or an Application Specific Integrated Circuit ASIC (Application Specific Integrated Circuit), or one or more Integrated circuits configured to implement an embodiment of the present invention. The audio processing device includes one or more processors, which may be the same type of processor, such as one or more CPUs; or may be different types of processors such as one or more CPUs and one or more ASICs.
And a memory 406 for storing a program 410. Memory 406 may comprise high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk memory.
The program 410 may specifically be invoked by the processor 402 to cause the audio processing device to perform the following operations:
acquiring sampling data and non-sound effect data; correcting the impact response of the sampled data according to the non-sound effect data, and calculating to obtain sound effect parameters corresponding to the target sound effect according to the corrected impact response; and carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
In an alternative mode, the correcting the impulse response of the sampled data according to the null data includes: calculating the sampling offset of the sound-effect-free data and the sampling data; calculating to obtain a sound effect-free impact response according to the sound effect-free data; cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference; and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
In an optional manner, before performing audio processing on the original audio data according to the sound effect parameters, the method further includes: acquiring the sampling rate of the original audio data; and carrying out sampling rate conversion on the sound effect parameters according to the sampling rate of the original audio data to obtain the sound effect parameters consistent with the sampling rate of the original audio data.
In an optional manner, the audio processing is performed on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect, specifically: respectively carrying out Fourier transform on the original audio data and the sound effect parameters to obtain the Fourier transform of the original audio data and the Fourier transform of the sound effect parameters; acquiring Fourier transform of target audio data according to Fourier transform of the original audio data and Fourier transform of the sound effect parameters; and carrying out inverse Fourier transform on the Fourier transform of the target audio data to obtain the target audio data added with the target sound effect.
In an optional manner, the acquiring the sample data and the null data includes: recording a target sound effect and the sweep frequency data to obtain the sampling data; recording the sweep frequency data to obtain the non-sound effect data; the sampling data comprises frequency sweeping data and a corresponding target sound effect; the sweep frequency data is audio data comprising audio in a preset frequency range; the non-acoustic data includes sweep frequency data.
In an alternative mode, the original audio data is audio data subjected to slicing processing.
According to the embodiment of the invention, the sound effect parameters are calculated according to the sampling data and the non-sound effect data, the sound effect parameters corresponding to the target sound effect are loaded on the original audio data, the target audio data added with the target sound effect is obtained, the sound effect technology of balance and reverberation is not needed, the audio processing process is simple and convenient, the sound effect parameter calculation process of the subsequent audio processing is not needed to be completed by a professional through auditory judgment, the period of audio processing is short, and the influence of the sound recording equipment and the audio line on the sampling data is reduced through correcting the impact response, so that the finally obtained sound effect of the target audio data is consistent with the target sound effect.
Further, by performing fragment processing on the target audio data, real-time audio processing on the target audio data can be realized.
An embodiment of the present invention provides a computer-readable storage medium, where the storage medium stores at least one executable instruction, and when the executable instruction is executed on an audio processing device, the audio processing device executes an audio processing method in any method embodiment described above.
The executable instructions may be specifically configured to cause the audio processing device to perform the following operations:
acquiring sampling data and soundless data; the sampling data comprises frequency sweeping data and a corresponding target sound effect; the frequency sweep data is audio data comprising audio in a preset frequency range; the non-sound effect data comprises frequency sweep data; correcting the impact response of the sampled data according to the non-sound effect data, and calculating to obtain sound effect parameters corresponding to the target sound effect according to the corrected impact response; and carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
In an optional manner, the acquiring the sample data and the null data includes: recording a target sound effect and the sweep frequency data to obtain the sampling data; and recording the sweep frequency data to obtain the unvoiced data.
In an alternative mode, the correcting the impulse response of the sampled data according to the null data includes: calculating the sampling offset of the sound-effect-free data and the sampling data; calculating to obtain a sound effect-free impact response according to the sound effect-free data; cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference; and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
In an optional manner, after the sound effect parameter corresponding to the target sound effect is obtained through calculation, the method further includes: acquiring the sampling rate of the original audio data; and carrying out sampling rate conversion on the sound effect parameters according to the sampling rate of the original audio data to obtain the sound effect parameters consistent with the sampling rate of the original audio data.
In an optional manner, the audio processing is performed on original audio data according to the sound effect parameter to obtain target audio data added with the target sound effect, and the specific steps are as follows: respectively calculating Fourier transform of the original audio data and the sound effect parameters to obtain the Fourier transform of the original audio data and the Fourier transform of the sound effect parameters; calculating to obtain Fourier transform of target audio data according to Fourier transform of the original audio data and Fourier transform of the sound effect parameters; and carrying out inverse Fourier transform on the Fourier transform of the target audio data to obtain the target audio data added with the target sound effect.
In an alternative mode, the original audio data is audio data subjected to slicing processing.
According to the embodiment of the invention, the sound effect parameters are calculated according to the sweep frequency data, the sound effect parameters are loaded on the target audio data, the target audio data added with the target sound effect is obtained, the sound effect technology of balance and reverberation is not needed, the audio processing process is simple and convenient, the sound effect parameter calculation process of subsequent audio processing is not needed to be completed by professionals through auditory judgment, the period of audio processing is short, and the influence of recording equipment and audio lines on the sampled data is reduced by correcting impact response, so that the sound effect finally obtained by the target audio data is consistent with the target sound effect.
Further, by performing fragment processing on the target audio data, real-time audio processing on the target audio data can be realized.
An embodiment of the present invention provides an audio processing apparatus, configured to execute the audio processing method.
Embodiments of the present invention provide a computer program that can be invoked by a processor to cause an audio processing device to perform an audio processing method in any of the above method embodiments.
Embodiments of the present invention provide a computer program product comprising a computer program stored on a computer-readable storage medium, the computer program comprising program instructions that, when run on a computer, cause the computer to perform the audio processing method of any of the method embodiments described above.
The algorithms or displays presented herein are not inherently related to any particular computer, virtual system, or other apparatus. Various general purpose systems may also be used with the teachings herein. The required structure for constructing such a system will be apparent from the description above. In addition, embodiments of the present invention are not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the embodiments of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the invention and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components in the embodiments may be combined into one module or unit or component, and may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names. The steps in the above embodiments should not be construed as limiting the order of execution unless specified otherwise.

Claims (10)

1. A method of audio processing, the method comprising:
acquiring sampling data and non-sound effect data, wherein the sampling data comprises a target sound effect;
correcting the impact response of the sampled data according to the non-sound effect data, and calculating to obtain sound effect parameters corresponding to the target sound effect according to the corrected impact response;
and carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
2. The method of claim 1, wherein said correcting the impulse response of the sampled data based on non-sound valid data comprises:
calculating the sampling offset of the sound-effect-free data and the sampling data;
calculating to obtain a sound effect-free impact response according to the sound effect-free data;
cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference;
and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
3. The method of claim 1, wherein prior to audio processing the raw audio data according to the sound-effect parameters, the method further comprises:
acquiring the sampling rate of the original audio data;
and carrying out sampling rate conversion on the sound effect parameters according to the sampling rate of the original audio data to obtain the sound effect parameters consistent with the sampling rate of the original audio data.
4. The method according to claim 1, wherein the audio processing is performed on original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect, specifically:
respectively carrying out Fourier transform on the original audio data and the sound effect parameters to obtain the Fourier transform of the original audio data and the Fourier transform of the sound effect parameters;
acquiring Fourier transform of target audio data according to Fourier transform of the original audio data and Fourier transform of the sound effect parameters;
and carrying out inverse Fourier transform on the Fourier transform of the target audio data to obtain the target audio data added with the target sound effect.
5. The method of claim 1, wherein the obtaining sample data and silence data comprises:
recording a target sound effect and the sweep frequency data to obtain the sampling data;
recording the sweep frequency data to obtain the non-sound effect data;
the sampling data comprises frequency sweep data; the sweep frequency data is audio data comprising audio in a preset frequency range; the non-acoustic data includes sweep frequency data.
6. The method of claim 1, wherein the raw audio data is sliced audio data.
7. An audio processing apparatus, characterized in that the apparatus comprises:
a sampling module: the system comprises a data acquisition module, a data processing module and a data processing module, wherein the data acquisition module is used for acquiring sampling data and non-sound effect data, and the sampling data comprises a target sound effect;
a sound effect parameter calculation module: the system comprises a data acquisition module, a data processing module and a data processing module, wherein the data acquisition module is used for acquiring sampling data of a target sound effect;
a sound effect loading module: and the audio processing module is used for carrying out audio processing on the original audio data according to the sound effect parameters to obtain target audio data added with the target sound effect.
8. The apparatus of claim 7, wherein the sound-effect parameter calculating module is further configured to:
calculating the sampling offset of the sound-effect-free data and the sampling data;
calculating to obtain a sound effect-free impact response according to the sound effect-free data;
cutting the non-sound effect impact response according to the sampling offset to obtain the impact response of recording interference;
and correcting the impact response according to the impact response of the recording interference to obtain the corrected impact response.
9. An audio processing device, comprising: the system comprises a processor, a memory, a communication interface and a communication bus, wherein the processor, the memory and the communication interface complete mutual communication through the communication bus;
the memory is configured to store at least one executable instruction that causes the processor to perform the operations of the audio processing method of any of claims 1-6.
10. A computer-readable storage medium having stored therein at least one executable instruction that, when executed on an audio processing device, causes the audio processing device to perform operations of the audio processing method of any one of claims 1-6.
CN202211372536.7A 2022-11-03 2022-11-03 Audio processing method, device, equipment and computer readable storage medium Pending CN115767373A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211372536.7A CN115767373A (en) 2022-11-03 2022-11-03 Audio processing method, device, equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211372536.7A CN115767373A (en) 2022-11-03 2022-11-03 Audio processing method, device, equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN115767373A true CN115767373A (en) 2023-03-07

Family

ID=85356171

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211372536.7A Pending CN115767373A (en) 2022-11-03 2022-11-03 Audio processing method, device, equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN115767373A (en)

Similar Documents

Publication Publication Date Title
CN110503976B (en) Audio separation method and device, electronic equipment and storage medium
EP3338461B1 (en) Microphone array signal processing system
CN111383646B (en) Voice signal transformation method, device, equipment and storage medium
WO2022012195A1 (en) Audio signal processing method and related apparatus
CN104134444B (en) A kind of song based on MMSE removes method and apparatus of accompanying
CN111031463B (en) Microphone array performance evaluation method, device, equipment and medium
CN111192594B (en) Method for separating voice and accompaniment and related product
WO2013138122A2 (en) Automatic realtime speech impairment correction
CN110738980A (en) Singing voice synthesis model training method and system and singing voice synthesis method
CN111210850B (en) Lyric alignment method and related product
CN111640411A (en) Audio synthesis method, device and computer readable storage medium
CN114333865A (en) Model training and tone conversion method, device, equipment and medium
CN109074814B (en) Noise detection method and terminal equipment
WO2021036046A1 (en) Sound separating method and apparatus, and electronic device
CN112992190B (en) Audio signal processing method and device, electronic equipment and storage medium
JP2000069597A (en) Method for measuring impulse response
CN112652290A (en) Method for generating reverberation audio signal and training method of audio processing model
CN115767373A (en) Audio processing method, device, equipment and computer readable storage medium
CN107919136B (en) Digital voice sampling frequency estimation method based on Gaussian mixture model
JP2004274234A (en) Reverberation eliminating method for sound signal, apparatus therefor, reverberation eliminating program for sound signal and recording medium with record of the program
CN111596261B (en) Sound source positioning method and device
CN113593604A (en) Method, device and storage medium for detecting audio quality
JP5169584B2 (en) Impulse response processing device, reverberation imparting device and program
CN111885474A (en) Microphone testing method and device
CN112420004A (en) Method and device for generating songs, electronic equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination