WO2022143522A1

WO2022143522A1 - Audio signal processing method and apparatus, and electronic device

Info

Publication number: WO2022143522A1
Application number: PCT/CN2021/141628
Authority: WO
Inventors: 倪忠
Original assignee: 维沃移动通信有限公司
Priority date: 2020-12-31
Filing date: 2021-12-27
Publication date: 2022-07-07
Also published as: CN112969130A

Abstract

The present application discloses an audio signal processing method and apparatus, and an electronic device. The audio signal processing method comprises: acquiring a low frequency signal in a first audio signal, the low frequency signal being an audio signal of which the frequency is within a preset frequency range, and the low frequency signal comprising M frequency points; determining M probability values respectively according to energy values of the M frequency points, each probability value being used for indicating the probability that there is a voice signal at the corresponding frequency point; on the basis of the M probability values, determining a target noise reduction parameter, the target noise reduction parameter being used for representing a noise reduction amount of an electronic device performing noise reduction processing on the audio signal; and on the basis of the target noise reduction parameter, performing noise reduction processing on the low frequency signal.

Description

Audio signal processing method, device and electronic device

This application claims the priority of the Chinese patent application with the application number 202011628024.3 and the application title "Audio Signal Processing Method, Apparatus and Electronic Equipment" filed with the State Intellectual Property Office on December 31, 2020, the entire contents of which are incorporated by reference in in this application.

technical field

The present application belongs to the field of communication technologies, and in particular relates to an audio signal processing method, apparatus and electronic device.

Background technique

At present, during a voice call between an electronic device and other electronic devices, if the noise in the audio signal collected by the microphone of the electronic device is strong (for example, the noise is non-stationary strong noise), the electronic device can first perform the audio signal on the audio signal. Noise reduction processing with a larger amount of noise reduction is performed to reduce noise in the audio signal, and then the processed audio signal is sent to other electronic devices, so that the call quality of the voice call can be improved.

However, since there may be a voice signal in the audio signal, and in the case of strong noise in the audio signal, the electronic device will still perform noise reduction processing with a large amount of noise reduction on the audio signal, so it may be In some cases, the electronic device performs noise reduction processing with a large noise reduction amount on the voice signal, which may result in a decrease in the clarity of the voice signal after the noise reduction processing.

In this way, the effect of noise reduction processing on the audio signal by the electronic device is poor.

SUMMARY OF THE INVENTION

The purpose of the embodiments of the present application is to provide an audio signal processing method, apparatus and electronic device, which can solve the problem that the effect of noise reduction processing on the audio signal by the electronic device is poor.

In order to solve the above technical problems, this application is implemented as follows:

In a first aspect, an embodiment of the present application provides an audio signal processing method, the method includes: acquiring a low-frequency signal in a first audio signal; the low-frequency signal is: an audio signal whose frequency is within a preset frequency range, the low-frequency signal M frequency points are included; M is a positive integer; M probability values are respectively determined according to the energy values of the M frequency points; each probability value is used to indicate the probability that the corresponding frequency point has a voice signal; , determine the target noise reduction parameter; the target noise reduction parameter is used to represent: the noise reduction amount that the electronic device performs noise reduction processing on the audio signal; based on the target noise reduction parameter, the low frequency signal is subjected to noise reduction processing.

In a second aspect, an embodiment of the present application provides an audio signal processing apparatus, where the audio signal processing apparatus includes: an acquisition module, a determination module, and a processing module. Wherein, the acquisition module is used for acquiring a low frequency signal in the first audio signal; the low frequency signal is an audio signal whose frequency is within a preset frequency range, and the low frequency signal includes M frequency points; M is a positive integer. The determination module is used to determine M probability values respectively according to the energy values of the M frequency points; each probability value is used to indicate the probability that the corresponding frequency point has a speech signal; and based on the M probability values, determine the target noise reduction parameter; the target noise reduction parameter is used to characterize: the amount of noise reduction performed by the audio signal processing apparatus for noise reduction processing on the audio signal. The processing module is configured to perform noise reduction processing on the low frequency signal based on the target noise reduction parameter determined by the determination module.

In a third aspect, embodiments of the present application provide an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being The processor implements the steps of the method according to the first aspect when executed.

In a fourth aspect, an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .

In a fifth aspect, an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.

In this embodiment of the present application, the electronic device may acquire a low-frequency signal in the first audio signal (the low-frequency signal includes M frequency points), and according to the energy values of the M frequency points, respectively determine M probability values (each The probability values are respectively used to indicate the probability that the corresponding frequency point has a speech signal), so that the electronic device can determine the noise reduction parameters corresponding to the noise reduction processing of the low frequency signal by the electronic device based on the M probability values, and based on the noise reduction parameters , perform noise reduction processing on the low-frequency signal. Since the noise in the audio signal is strong, the electronic device can determine the probability value of the presence of a voice signal at each frequency point according to the energy value of each frequency point of the low-frequency signal, and based on the existence of the voice signal at each frequency point The probability value determines the noise reduction parameters corresponding to the noise reduction processing performed by the electronic device on the low-frequency signal, that is, the electronic device can determine different noise-reduction parameters according to whether there is a voice signal in the low-frequency signal, so as to perform different noise reduction on the low-frequency signal. Instead of directly using a larger noise reduction parameter to perform a larger amount of noise reduction processing on the low-frequency signal, it is possible to avoid the situation where the clarity of the speech signal after the noise reduction process is reduced. , which can improve the effect of noise reduction processing on audio signals by electronic devices.

Description of drawings

1 is one of schematic diagrams of an audio signal processing method provided by an embodiment of the present application;

2 is a second schematic diagram of an audio signal processing method provided by an embodiment of the present application;

3 is a third schematic diagram of an audio signal processing method provided by an embodiment of the present application;

4 is a fourth schematic diagram of an audio signal processing method provided by an embodiment of the present application;

5 is a schematic structural diagram of an audio signal processing apparatus provided by an embodiment of the present application;

6 is a schematic structural diagram of an electronic device provided by an embodiment of the present application;

FIG. 7 is a schematic diagram of hardware of an electronic device provided by an embodiment of the present application.

Detailed ways

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.

The terms "first", "second" and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between "first", "second", etc. The objects are usually of one type, and the number of objects is not limited. For example, the first object may be one or more than one. In addition, "and/or" in the description and claims indicates at least one of the connected objects, and the character "/" generally indicates that the associated objects are in an "or" relationship.

The audio signal processing method provided by the embodiments of the present application will be described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.

The audio signal processing method provided in the embodiment of the present application can be applied to a scenario where an electronic device conducts a voice call with other electronic devices.

Assuming that the user conducts a voice call with the electronic device 2 through the electronic device 1 , during the voice call, the microphone of the electronic device 1 can collect audio signals and send the audio signals to the electronic device 2 . If the audio signal includes a non-stationary strong noise signal, the electronic device 1 can perform noise reduction processing on the audio signal first, and then send the processed audio signal to the electronic device 2 . In the related art, the electronic device 1 can determine the noise reduction parameter 1 corresponding to the intensity of the noise signal according to the intensity of the noise signal, and based on the noise reduction parameter 1, through a signal processing (digital signal processing, DSP) noise reduction algorithm, the microphone The collected audio signal is subjected to noise reduction processing, so as to send the processed audio signal to the electronic device 2 . However, since there may be voice brainwashing in the audio signal, the electronic device 1 will still perform noise reduction processing on the audio signal collected by the microphone through the DSP noise reduction algorithm based on the noise reduction parameter 1, which may cause noise reduction. In some cases, the electronic device 1 performs noise reduction processing with a relatively large amount of noise reduction on the voice signal, and therefore, the intelligibility of the voice signal may be degraded.

However, in this embodiment of the present application, the electronic device 1 may first divide the audio signal into a low-frequency signal (that is, an audio signal whose frequency is within a preset frequency range) and a high-frequency signal (that is, an audio signal whose frequency is outside the preset frequency range) ), and then according to the energy values of multiple frequency points of the low-frequency signal, respectively determine multiple probability values, each probability value is used to indicate the probability of the existence of a voice signal at its corresponding frequency point; then based on the multiple probability values, determine The electronic device 1 performs noise reduction parameters 2 for noise reduction processing on the low-frequency signal, and, based on the noise reduction parameters 2, performs noise reduction processing on the low-frequency signals. Next, the electronic device 1 can perform noise reduction processing on the high frequency signal based on the noise reduction parameter 3 (for example, the default noise reduction parameter), so that the electronic device 1 can The audio signal is synthesized and the synthesized audio signal is output. It can be understood that the electronic device 1 can determine different noise reduction parameters according to whether there is a speech signal in the low frequency signal, so as to perform noise reduction processing with different noise reduction amounts on the low frequency signal, instead of directly using a larger noise reduction parameter for the noise reduction. The low-frequency signal is subjected to noise reduction processing with a larger amount of noise reduction, so that the reduction in the clarity of the speech signal can be avoided.

FIG. 1 shows a flowchart of an audio signal processing method provided by an embodiment of the present application. As shown in FIG. 1 , the audio signal processing method provided by the embodiment of the present application may include the following steps 101 to 104 .

Step 101: The audio signal processing apparatus acquires the low frequency signal in the first audio signal.

Optionally, in this embodiment of the present application, when the audio signal processing apparatus displays the interface of the target application, the user can input the identifier (for example, the avatar) of the target contact in the interface, so that the audio signal processing apparatus can The conversation page of the target contact is displayed, so that the user can input in the conversation page of the target contact, so that the audio signal processing apparatus and other audio signal processing apparatuses (that is, the audio signal processing apparatuses corresponding to the target contact) establish a voice connected, and the first audio signal is collected through the microphone of the audio signal processing device, and the first audio signal is detected. In the case of detecting that there is a strong noise signal (for example, a non-stationary strong noise signal) in the first audio signal, the audio signal processing apparatus may divide the first audio signal into a low frequency signal and a high frequency signal, and obtain the low frequency signal .

Optionally, in this embodiment of the present application, the above-mentioned target application may be a call type application.

Optionally, in this embodiment of the present application, when the audio signal processing apparatus detects that the intensity of the noise signal in the first audio signal is greater than or equal to a preset intensity, the audio signal processing apparatus may divide the first audio signal into low-frequency signal and high-frequency signal, and obtain the low-frequency signal.

In the embodiment of the present application, the above-mentioned low-frequency signal is an audio signal whose frequency is within a preset frequency range, the low-frequency signal includes M frequency points, and M is a positive integer.

It should be noted that "low frequency signal" and "high frequency signal" are two relative concepts, and the frequency of the low frequency signal is lower than the frequency of the high frequency signal. It can be understood that the first audio signal includes at least two sub-band signals (the sub-band signal refers to a signal corresponding to a certain frequency interval), the frequency ranges corresponding to the at least two sub-band signals do not overlap, and the low-frequency signal may be the at least two sub-band signals. One sub-band signal in the two sub-band signals, and the high-frequency signal may be the other sub-band signal in the at least two sub-band signals.

Optionally, in this embodiment of the present application, when the audio signal processing apparatus detects that there is a strong noise signal in the first audio signal, the audio signal processing apparatus may use a preset algorithm to echo the first audio signal. processing to cancel the echo signal in the first audio signal.

Optionally, in this embodiment of the present application, the foregoing preset algorithm may specifically be: an adaptive echo cancellation algorithm.

It should be noted that, for the description of the adaptive echo cancellation algorithm, reference may be made to the specific description in the related art, which is not repeated in this embodiment of the present application.

Further optionally, in the embodiment of the present application, the audio signal processing apparatus may detect the echo-processed first audio signal, and determine an audio signal whose frequency is within a preset frequency range to obtain a low-frequency signal.

Step 102: The audio signal processing apparatus determines M probability values respectively according to the energy values of the M frequency points.

Optionally, in the embodiment of the present application, the audio signal processing apparatus may detect the low-frequency signal to obtain the energy value of each frequency point of the low-frequency signal, so as to obtain the energy value of M frequency points.

In the embodiment of the present application, each of the above-mentioned M probability values is respectively used to indicate the probability that a speech signal exists at its corresponding frequency point.

It can be understood that the higher the probability value corresponding to a frequency point (for example, the closer to 1), the higher the probability that there is a speech signal in the frequency point, that is, the more similar the frequency point is to the speech signal; a frequency point The lower the corresponding probability value (for example, the closer to 0), the lower the probability that the speech signal exists in the one frequency point can be considered that the one frequency point is more like a noise signal.

Optionally, in this embodiment of the present application, for each probability value in the M probability values, the audio signal processing apparatus may input the energy value of a frequency point into the input variable of the target neural network to obtain a probability value ( mask) to obtain M probability values (ie, M masks).

Further optionally, in the embodiment of the present application, the above-mentioned target neural network may specifically be: a deep learning neural network. The target neural network can be a deep neural network obtained by training the deep neural network to be trained by using a plurality of simulated audio signals as a training set.

Step 103: The audio signal processing apparatus determines the target noise reduction parameter based on the M probability values.

In the embodiment of the present application, the above target noise reduction parameter is used to represent: the noise reduction amount by which the audio signal processing apparatus performs noise reduction processing on the audio signal.

Optionally, in this embodiment of the present application, the above noise reduction processing may include at least one of the following: DSP noise reduction algorithm processing, and residual echo suppression processing.

Optionally, in this embodiment of the present application, the audio signal processing apparatus may determine a probability value from the M probability values, and determine the target noise reduction parameter according to the one probability value; or, the audio signal processing apparatus may determine The average probability value of the M probability values (for example, the target probability value in the following embodiments) is determined as the target noise reduction parameter according to the average probability value.

Further optionally, in this embodiment of the present application, the audio signal processing apparatus may, according to the strength of the noise signal in the first audio signal, determine a default drop value corresponding to the strength of the noise signal from at least one first correspondence. noise parameter, and then according to whether a determined probability value (or average probability value) satisfies a preset condition, based on the default noise reduction parameter, a noise reduction parameter is obtained, and the noise reduction parameter is determined as the target noise reduction parameter. The at least one first correspondence is respectively: a correspondence between at least one intensity and at least one default noise reduction parameter.

Exemplarily, the above-mentioned preset condition may specifically be: a probability value (or an average probability value) determined by the audio signal processing apparatus is greater than or equal to a preset threshold value.

In the case that a determined probability value (or an average probability value) satisfies a preset condition, the audio signal processing apparatus may use a first algorithm to obtain a noise reduction parameter based on a determined default noise reduction parameter, and use the determined noise reduction parameter. A noise reduction parameter, determined as the target noise reduction parameter.

In the case that the determined one probability value (or the average probability value) does not satisfy the preset condition, the audio signal processing apparatus may use the second algorithm to obtain another noise reduction parameter based on the determined one default noise reduction parameter, and The other noise reduction parameter is determined as the target noise reduction parameter.

It should be noted that, the description of the first algorithm and the second algorithm will be specifically described in the following embodiments of the present application, and the embodiments of the present application will not be repeated here.

Step 104: The audio signal processing apparatus performs noise reduction processing on the low frequency signal based on the target noise reduction parameter.

Optionally, in the embodiment of the present application, the audio signal processing apparatus may use a DSP noise reduction algorithm (or an echo suppression algorithm) to perform noise reduction processing on the low frequency signal based on the target noise reduction parameter.

It can be understood that the audio signal processing device can use the target noise reduction parameter as the noise suppression amount control factor, and use it in the DSP noise reduction algorithm (or echo suppression algorithm) to suppress the noise signal in the low frequency signal, so as to reduce the low frequency signal. deal with.

In the embodiment of the present application, in the case where the noise signal in the audio signal is strong, if there is a voice signal in the audio signal, the audio signal processing apparatus will perform noise reduction processing with a large amount of noise reduction on the audio signal, thereby reducing the noise. As a result, the voiced speech in the speech signal is damaged, and the intelligibility of the speech signal is decreased. Therefore, the audio signal processing apparatus can acquire the audio signal (ie, the low-frequency signal) corresponding to the voiced speech in the audio signal (ie, the first audio signal), and determine that the low-frequency signal exists in the low-frequency signal according to the energy values of multiple frequency points of the low-frequency signal. The probability value of the speech signal, and based on the probability value, determine the noise reduction parameter corresponding to the noise reduction process performed by the audio signal processing device on the low frequency signal, so as to avoid damage to the voiced speech in the noise reduction processed speech signal.

In the audio signal processing method provided by the embodiment of the present application, the audio signal processing apparatus may acquire the low-frequency signal in the first audio signal (the low-frequency signal includes M frequency points), and determine M respectively according to the energy values of the M frequency points. probability values (each probability value is used to indicate the probability of the existence of speech signals at its corresponding frequency point), so that the audio signal processing device can determine the corresponding frequency of noise reduction processing performed by the audio signal processing device on the low-frequency signal based on the M probability values. Noise reduction parameters, and based on the noise reduction parameters, noise reduction processing is performed on the low-frequency signal. Because when the noise signal in the audio signal is strong, the audio signal processing apparatus can determine the probability value of the existence of the speech signal at each frequency point according to the energy value of each frequency point of the low-frequency signal, and based on the existence of each frequency point The probability value of the voice signal determines the noise reduction parameters corresponding to the noise reduction processing performed by the audio signal processing device on the low-frequency signal, that is, the audio signal processing device can determine different noise reduction parameters according to whether there is a voice signal in the low-frequency signal. The low-frequency signal is subjected to noise reduction processing with different noise reduction amounts, instead of directly using a larger noise reduction parameter to perform noise reduction processing with a larger noise reduction amount on the low-frequency signal. Therefore, noise reduction of the voice signal after noise reduction processing can be avoided. In the case where the definition is lowered, the effect of noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.

Optionally, in the embodiment of the present application, with reference to FIG. 1 , as shown in FIG. 2 , after the above step 104 , the audio signal processing method provided by the embodiment of the present application may further include the following step 201 .

Step 201: The audio signal processing apparatus performs synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and outputs the target audio signal.

In the embodiment of the present application, the above-mentioned first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.

Further optionally, in this embodiment of the present application, the above-mentioned first signal may specifically be obtained by performing noise reduction processing on the high-frequency signal by an audio signal processing apparatus.

It can be seen from this that, because the audio signal processing apparatus can divide the first audio signal into a low-frequency signal and a high-frequency signal, and respectively perform noise reduction processing with different noise reduction amounts on the low-frequency signal and the high-frequency signal, and, after the noise reduction processing The low-frequency signal and the high-frequency signal after the noise reduction process are synthesized and processed. Therefore, the voiced voice can be damaged in the voice signal after the noise reduction process, so as to avoid the reduction of the clarity of the voice signal after the noise reduction process. , in this way, the effect of noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.

Optionally, in the embodiment of the present application, with reference to FIG. 2 , as shown in FIG. 3 , before the above step 201 , the audio signal processing method provided by the embodiment of the present application may further include the following step 301 .

Step 301: The audio signal processing apparatus acquires the high frequency signal, and based on the third noise reduction parameter, performs noise reduction processing on the high frequency signal to obtain the first signal.

Further optionally, in this embodiment of the present application, when the audio signal processing apparatus detects that the intensity of the noise signal in the first audio signal is greater than or equal to the preset intensity, the audio signal processing apparatus may classify the first audio signal into are the low-frequency signal and the high-frequency signal, and obtain the high-frequency signal.

Further optionally, in the embodiment of the present application, the audio signal processing apparatus may detect the echo-processed first audio signal, and determine the audio signal whose frequency is outside the preset frequency range, so as to obtain the high frequency signal.

Further optionally, in this embodiment of the present application, the third noise reduction parameter may specifically be: a default noise reduction parameter.

Further optionally, in this embodiment of the present application, the audio signal processing apparatus may, according to the strength of the noise signal in the first audio signal, determine a default drop value corresponding to the strength of the noise signal from at least one first correspondence. noise parameter, and the one default noise reduction parameter is determined as the third noise reduction parameter, so that the audio signal processing apparatus can perform noise reduction processing on the high frequency signal based on the third noise reduction parameter.

Further optionally, in the embodiment of the present application, the audio signal processing apparatus may perform residual echo suppression processing on the high-frequency signal based on the third noise reduction parameter to obtain the first signal.

It should be noted that, the execution order of step 301 and step 102 is not limited in this embodiment of the present application.

In a possible implementation manner, the audio signal processing apparatus may perform step 301 first, and then perform step 102, that is, the audio signal processing apparatus may first acquire the high-frequency signal, and based on the third noise reduction parameter, perform a A noise reduction process is performed to obtain a first signal, and then M probability values are respectively determined according to the energy values of the M frequency points.

In another possible implementation manner, the audio signal processing apparatus may first perform step 102, and then perform step 301, that is, the audio signal processing apparatus may first determine M probability values respectively according to the energy values of the M frequency points, Then, the high frequency signal is acquired, and based on the third noise reduction parameter, the high frequency signal is subjected to noise reduction processing to obtain the first signal.

In another possible implementation manner, the audio signal processing apparatus may perform step 301 and step 102 at the same time, that is, the audio signal processing apparatus may acquire the high-frequency signal, and reduce the high-frequency signal based on the third noise reduction parameter. Noise processing is performed, and at the same time as the first signal is obtained, M probability values are respectively determined according to the energy values of the M frequency points.

In the embodiment of the present application, the voice in the high-frequency signal is unvoiced voice, and the unvoiced voice is damaged, which will not cause a decrease in the clarity of the voice signal. Therefore, the audio signal processing apparatus can obtain the audio signal (that is, the first audio signal ) corresponding to the audio signal (that is, the high-frequency signal) of the unvoiced speech, and the high-frequency signal is subjected to noise reduction processing based on the default noise reduction parameters.

It can be seen from this that when the noise signal in the audio signal is strong, the audio processing device can use a larger noise reduction parameter to perform noise reduction processing with a larger amount of noise reduction on the audio signal (ie high-frequency signal) corresponding to the unvoiced speech. Therefore, , it is possible to avoid the reduction of the clarity of the noise-reduced speech signal, and to reduce the noise signal in the noise-reduced audio signal, so that the effect of the audio signal processing apparatus for noise reduction processing on the audio signal can be improved.

Optionally, in this embodiment of the present application, with reference to FIG. 1 , as shown in FIG. 4 , the foregoing step 103 may be specifically implemented by the following

steps

103 a and 103 b.

Step 103a, the audio signal processing apparatus determines the target probability value according to the M probability values.

In the embodiment of the present application, the above target probability value is: the average value of M probability values.

Further optionally, in this embodiment of the present application, the audio signal processing apparatus may determine the target probability value by using an average value algorithm according to the M probability values.

It should be noted that, for the description of the average value algorithm, reference may be made to the specific description in the related art, which is not repeated in this embodiment of the present application.

Step 103b, the audio signal processing apparatus determines the target noise reduction parameter based on the target probability value.

Further optionally, in this embodiment of the present application, the audio signal processing apparatus may determine the target noise reduction parameter according to whether the target probability value satisfies a preset condition.

It can be understood that the audio signal processing apparatus can determine whether there is a speech signal in the low-frequency signal according to the target probability value, so as to determine different noise reduction parameters to perform noise reduction processing with different noise reduction amounts on the low-frequency signal.

It can be seen from this that the audio signal processing device can determine the target noise reduction parameter according to the average value of M probability values, determine whether there is a speech signal in the low frequency signal, determine different noise reduction parameters, and perform different noise reduction amounts on the low frequency signal. Noise reduction processing, therefore, it is possible to avoid the situation that the clarity of the voice signal after the noise reduction processing is lowered, and thus, the effect of the noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.

Optionally, in this embodiment of the present application, the foregoing step 103b may be specifically implemented by the following step 103b1 or step 103b2.

Step 103b1 , when the target probability value is greater than or equal to a preset threshold, the audio signal processing apparatus determines the first noise reduction parameter as the target noise reduction parameter.

Further optionally, in this embodiment of the present application, the audio signal processing apparatus may adopt a first algorithm, obtain a first noise reduction parameter according to a determined default noise reduction parameter, and determine the first noise reduction parameter as the target noise reduction parameter. noise parameter.

Specifically, the above-mentioned first algorithm may specifically be C ₁ =A+B ₁ . Wherein, C1 is the first noise reduction parameter, A is a default noise reduction parameter determined by the audio signal processing apparatus, and B1 is the first preset noise reduction parameter.

It can be understood that if the target probability value is greater than or equal to the preset threshold (that is, the target probability value satisfies the preset condition), it can be considered that there is a voice signal in the low-frequency signal. Therefore, the audio signal processing device can lower a determined default noise reduction. parameter to avoid damage to the voiced speech in the first audio signal, thereby avoiding the reduction of the clarity of the speech signal after noise reduction.

It can be seen from this that, when it is determined that there is a speech signal in the low-frequency signal, the audio signal processing apparatus can lower a determined default noise reduction parameter, so as to use a smaller noise reduction parameter to perform a smaller noise reduction on the low-frequency signal. Noise reduction processing, therefore, it is possible to avoid the reduction of the clarity of the speech signal after the noise reduction processing, and reduce the noise signal in the audio signal after the noise reduction processing, so that the audio signal processing device can improve the noise reduction processing of the audio signal. Effect.

Step 103b2: In the case that the target probability value is smaller than the preset threshold, the audio signal processing apparatus determines the second noise reduction parameter as the target noise reduction parameter.

Further optionally, in this embodiment of the present application, the audio signal processing apparatus may use a second algorithm to obtain a second noise reduction parameter according to a determined default noise reduction parameter, and determine the second noise reduction parameter as the target noise reduction parameter. noise parameter.

Specifically, the above-mentioned second algorithm may specifically be C ₂ =A+B ₂ . Wherein, C2 is the second noise reduction parameter, A is a default noise reduction parameter determined by the audio signal processing apparatus, and B2 is the second preset noise reduction parameter.

It can be understood that if the target probability value is less than the preset threshold (that is, the target probability value does not meet the preset conditions), it can be considered that there is no speech signal in the low-frequency signal (that is, there is only noise signal in the low-frequency signal), therefore, the audio signal processing The device may increase a determined default noise reduction parameter to reduce the noise signal in the noise reduction processed audio signal.

It can be seen from this that when it is determined that there is no speech signal in the low-frequency signal, the audio signal processing apparatus can increase a determined default noise reduction parameter, so as to use a larger noise reduction parameter to perform a larger amount of noise reduction on the low-frequency signal. Therefore, the noise signal in the audio signal after the noise reduction process can be reduced, and thus, the effect of the audio signal processing apparatus on the noise reduction process can be improved.

It should be noted that, for the audio signal processing method provided by the embodiments of the present application, the execution body may be the audio signal processing apparatus in the above embodiment, or a control module in the audio signal processing apparatus for executing the audio signal processing method. In the embodiments of the present application, an audio signal processing method performed by an audio signal processing apparatus is used as an example to describe the apparatus for the audio signal processing method provided by the embodiments of the present application.

FIG. 5 shows a possible schematic structural diagram of the audio signal processing apparatus involved in the embodiment of the present application. As shown in FIG. 5 , the audio signal processing apparatus 60 may include: an acquisition module 61 , a determination module 62 and a processing module 63 .

The acquisition module 61 is used to acquire a low frequency signal in the first audio signal; the low frequency signal is an audio signal whose frequency is within a preset frequency range, and the low frequency signal includes M frequency points; M is a positive integer. The determination module 62 is used to determine M probability values respectively according to the energy values of the M frequency points; each probability value is used to indicate the probability that the corresponding frequency point has a voice signal; and based on the M probability values, determine the target drop. Noise parameter; the target noise reduction parameter is used to represent: the amount of noise reduction performed by the audio signal processing apparatus for noise reduction processing on the audio signal. The processing module 63 is configured to perform noise reduction processing on the low frequency signal based on the target noise reduction parameter determined by the determination module 62 .

In a possible implementation manner, the above determination module 62 is specifically configured to determine a target probability value according to M probability values; the target probability value is: the average value of the M probability values; and based on the target probability value, determine the target Noise reduction parameters.

In a possible implementation manner, the above determination module 62 is configured to determine the first noise reduction parameter as the target noise reduction parameter when the target probability value is greater than or equal to the preset threshold; or, when the target probability value is less than In the case of a preset threshold, the second noise reduction parameter is determined as the target noise reduction parameter.

In a possible implementation manner, the above-mentioned processing module 63 is further configured to perform synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and output the target audio signal. The above-mentioned first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.

In a possible implementation manner, the obtaining module 61 is further configured to obtain a high-frequency signal, and perform noise reduction processing on the high-frequency signal based on the third noise reduction parameter to obtain the first signal.

In the audio signal processing apparatus provided by the embodiment of the present application, when the noise signal in the audio signal is strong, the audio signal processing apparatus can respectively determine that there is a voice signal at each frequency point according to the energy value of each frequency point of the low frequency signal. The probability value of , and based on the probability value of the existence of speech signals at each frequency point, determine the noise reduction parameters corresponding to the noise reduction processing performed by the audio signal processing device on the low frequency signal, that is, the audio signal processing device can be based on whether there is speech in the low frequency signal. signal, and determine different noise reduction parameters to perform noise reduction processing with different noise reduction amounts on the low-frequency signal, instead of directly using larger noise reduction parameters to perform noise reduction processing on the low-frequency signal with a larger amount of noise reduction. Therefore, , it is possible to avoid the situation that the clarity of the speech signal after the noise reduction processing is lowered, and thus, the effect of the noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.

The audio signal processing apparatus in this embodiment of the present application may be an apparatus, or may be a component, an integrated circuit, or a chip in a terminal. The apparatus may be a mobile electronic device or a non-mobile electronic device. Illustratively, the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant). assistant, PDA), etc., non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.

The audio signal processing apparatus in the embodiment of the present application may be an apparatus having an operating system. The operating system may be an Android (Android) operating system, an iOS operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.

The audio signal processing apparatus provided in this embodiment of the present application can implement each process implemented by the method embodiments in FIG. 1 to FIG. 4 , and to avoid repetition, details are not repeated here.

Optionally, as shown in FIG. 6 , an embodiment of the present application further provides an electronic device 70, including a processor 72, a memory 71, and a program or instruction stored on the memory 71 and executable on the processor 72, When the program or instruction is executed by the processor 72, each process of the above-mentioned audio signal processing method embodiments can be realized, and the same technical effect can be achieved. To avoid repetition, details are not repeated here.

It should be noted that the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.

FIG. 7 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.

The electronic device 100 includes but is not limited to: a radio frequency unit 101, a network module 102, an audio output unit 103, an input unit 104, a sensor 105, a display unit 106, a user input unit 107, an interface unit 108, a memory 109, and a processor 110, etc. part.

Those skilled in the art can understand that the electronic device 100 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 110 through a power management system, so as to manage charging, discharging, and power management through the power management system. consumption management and other functions. The structure of the electronic device shown in FIG. 7 does not constitute a limitation on the electronic device. The electronic device may include more or less components than the one shown, or combine some components, or arrange different components, which will not be repeated here. .

Wherein, the processor 110 is configured to acquire a low-frequency signal in the first audio signal; the low-frequency signal is: an audio signal with a frequency within a preset frequency range, the low-frequency signal includes M frequency points; M is a positive integer; For the energy values of the M frequency points, M probability values are respectively determined; each probability value is used to indicate the probability that the corresponding frequency point has a speech signal; based on the M probability values, the target noise reduction parameter is determined; the target noise reduction parameter It is used to characterize: the noise reduction amount that the electronic device performs noise reduction processing on the audio signal; and, based on the target noise reduction parameter, performs noise reduction processing on the low frequency signal.

In the electronic device provided by the embodiment of the present application, when the noise in the audio signal is strong, the electronic device can determine the probability value of the existence of the speech signal at each frequency point according to the energy value of each frequency point of the low-frequency signal, and Determine the noise reduction parameters corresponding to the noise reduction processing performed by the electronic device on the low-frequency signal based on the probability value of the presence of a voice signal at each frequency point, that is, the electronic device can determine different noise reduction parameters according to whether there is a voice signal in the low-frequency signal, In order to perform noise reduction processing with different noise reduction amounts on the low-frequency signal, instead of directly using a larger noise reduction parameter to perform noise reduction processing with a larger noise reduction amount on the low-frequency signal, it is possible to avoid noise reduction processing. In the case where the clarity of the signal is reduced, the effect of noise reduction processing on the audio signal by the electronic device can be improved.

Optionally, in this embodiment of the present application, the processor 110 is specifically configured to determine a target probability value according to M probability values; the target probability value is: the average value of the M probability values; and based on the target probability value, determine the target Noise reduction parameters.

It can be seen from this that the electronic device can determine the target noise reduction parameter according to the average value of M probability values, determine whether there is a speech signal in the low frequency signal, determine different noise reduction parameters, and perform noise reduction with different noise reduction amounts on the low frequency signal. Therefore, it is possible to avoid the situation that the clarity of the voice signal after the noise reduction processing is lowered, and thus, the effect of the noise reduction processing on the audio signal by the electronic device can be improved.

Optionally, in this embodiment of the present application, the processor 110 is specifically configured to determine the first noise reduction parameter as the target noise reduction parameter when the target probability value is greater than or equal to the preset threshold; When the value is less than the preset threshold, the second noise reduction parameter is determined as the target noise reduction parameter.

It can be seen from this that when it is determined that there is a speech signal in the low-frequency signal, the electronic device can lower a determined default noise reduction parameter, so as to use a smaller noise reduction parameter to perform noise reduction with a smaller amount of noise reduction on the low-frequency signal. Therefore, the reduction of the clarity of the noise-reduced speech signal can be avoided, and the noise signal in the noise-reduced audio signal can be reduced. In this way, the effect of noise reduction processing on the audio signal by the electronic device can be improved.

It can be seen from this that, when it is determined that there is no speech signal in the low-frequency signal, the electronic device can increase a determined default noise reduction parameter, so as to use a larger noise reduction parameter to reduce the low frequency signal by a larger amount of noise reduction. Therefore, the noise signal in the audio signal after the noise reduction processing can be reduced, and thus, the effect of the noise reduction processing on the audio signal by the electronic device can be improved.

Optionally, in this embodiment of the present application, the processor 110 is further configured to perform synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and output the target audio signal.

The above-mentioned first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.

It can be seen from this that since the electronic device can divide the first audio signal into a low-frequency signal and a high-frequency signal, and perform noise reduction processing with different noise reduction amounts on the low-frequency signal and the high-frequency signal respectively, and The signal and the high-frequency signal after noise reduction processing are synthesized and processed. Therefore, the voiced voice can be damaged in the voice signal after noise reduction processing, so as to avoid the situation that the clarity of the voice signal after noise reduction processing is reduced. , which can improve the effect of noise reduction processing on audio signals by electronic devices.

Optionally, in this embodiment of the present application, the processor 110 is further configured to acquire a high-frequency signal, and perform noise reduction processing on the high-frequency signal based on the third noise reduction parameter to obtain the first signal.

It can be seen from this that when the noise signal in the audio signal is strong, the audio processing device can use a larger noise reduction parameter to perform noise reduction processing with a larger amount of noise reduction on the audio signal (ie high-frequency signal) corresponding to the unvoiced speech. Therefore, , which can avoid the reduction of the clarity of the noise-reduced speech signal, and reduce the noise signal in the noise-reduced audio signal, so that the effect of noise reduction processing on the audio signal by the electronic device can be improved.

It should be understood that, in this embodiment of the present application, the input unit 104 may include a graphics processing unit (graphics processing unit, GPU) 1041 and a microphone 1042. Such as camera) to obtain still pictures or video image data for processing. The display unit 106 may include a display panel 1061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 107 includes a touch panel 1071 and other input devices 1072 . The touch panel 1071 is also called a touch screen. The touch panel 1071 may include two parts, a touch detection device and a touch controller. Other input devices 1072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which are not described herein again. Memory 109 may be used to store software programs as well as various data including, but not limited to, application programs and operating systems. The processor 110 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, and an application program, and the like, and the modem processor mainly processes wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 110 .

The embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, each process of the above-mentioned audio signal processing method embodiment can be achieved, and can achieve The same technical effect, in order to avoid repetition, will not be repeated here.

Wherein, the processor is the processor in the electronic device described in the foregoing embodiments. The readable storage medium includes a computer-readable storage medium, such as computer read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk, etc.

An embodiment of the present application further provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the above-mentioned embodiment of the audio signal processing method and can achieve the same technical effect, in order to avoid repetition, it will not be repeated here.

It should be understood that the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.

It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element. In addition, it should be noted that the scope of the methods and apparatus in the embodiments of the present application is not limited to performing the functions in the order shown or discussed, but may also include performing the functions in a substantially simultaneous manner or in the reverse order depending on the functions involved. To perform functions, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to some examples may be combined in other examples.

From the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course hardware can also be used, but in many cases the former is better implementation. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, or a network device, etc.) execute the methods described in the various embodiments of this application.

The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of this application, without departing from the scope of protection of the purpose of this application and the claims, many forms can be made, which all fall within the protection of this application.

Claims

An audio signal processing method, wherein the method comprises:

Acquire a low-frequency signal in the first audio signal; the low-frequency signal is an audio signal whose frequency is within a preset frequency range, the low-frequency signal includes M frequency points; M is a positive integer;

According to the energy values of the M frequency points, M probability values are respectively determined; each probability value is respectively used to indicate the probability that a speech signal exists at its corresponding frequency point;

Based on the M probability values, a target noise reduction parameter is determined; the target noise reduction parameter is used to represent: the noise reduction amount that the electronic device performs noise reduction processing on the audio signal;

Based on the target noise reduction parameter, noise reduction processing is performed on the low frequency signal.
The method according to claim 1, wherein the determining the target noise reduction parameter based on the M probability values comprises:

According to the M probability values, a target probability value is determined; the target probability value is: the average value of the M probability values;

Based on the target probability value, the target noise reduction parameter is determined.
The method according to claim 2, wherein the determining the target noise reduction parameter based on the target probability value comprises:

In the case that the target probability value is greater than or equal to a preset threshold, determine the first noise reduction parameter as the target noise reduction parameter; or,

In the case that the target probability value is smaller than the preset threshold, the second noise reduction parameter is determined as the target noise reduction parameter.
The method according to claim 1, wherein after performing noise reduction processing on the low frequency signal based on the target noise reduction parameter, the method further comprises:

Synthesize the low-frequency signal after the noise reduction process and the first signal to obtain a target audio signal, and output the target audio signal;

The first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
The method according to claim 4, wherein, before the synthesizing the low-frequency signal after the noise reduction processing and the first signal, the method further comprises:

The high frequency signal is acquired, and based on the third noise reduction parameter, noise reduction processing is performed on the high frequency signal to obtain the first signal.
An audio signal processing device, wherein the audio signal processing device comprises: an acquisition module, a determination module and a processing module;

The acquisition module is configured to acquire a low-frequency signal in the first audio signal; the low-frequency signal is an audio signal whose frequency is within a preset frequency range, and the low-frequency signal includes M frequency points; M is a positive integer;

The determining module is configured to determine M probability values respectively according to the energy values of the M frequency points; each probability value is respectively used to indicate the probability that the corresponding frequency point has a speech signal; and based on the M probability values value, to determine the target noise reduction parameter; the target noise reduction parameter is used to represent: the noise reduction amount of the audio signal processing device performing noise reduction processing on the audio signal;

The processing module is configured to perform noise reduction processing on the low-frequency signal based on the target noise reduction parameter determined by the determination module.
The audio signal processing apparatus according to claim 6, wherein the determining module is specifically configured to determine a target probability value according to the M probability values; the target probability value is: an average of the M probability values and determining the target noise reduction parameter based on the target probability value.
The audio signal processing apparatus according to claim 7, wherein the determining module is configured to determine the first noise reduction parameter as the target noise reduction when the target probability value is greater than or equal to a preset threshold parameter; or, when the target probability value is smaller than a preset threshold, determine a second noise reduction parameter as the target noise reduction parameter.
The audio signal processing apparatus according to claim 6, wherein the processing module is further configured to perform synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and output the target audio signal ;

The first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
The audio signal processing apparatus according to claim 9, wherein the acquisition module is further configured to acquire the high-frequency signal, and perform noise reduction processing on the high-frequency signal based on a third noise reduction parameter, to obtain the the first signal.
An electronic device, comprising a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being executed by the processor to achieve as claimed in claim 1 Steps of the audio signal processing method described in any one of to 5.
A readable storage medium, wherein a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the audio signal processing method according to any one of claims 1 to 5 is implemented. step.
A chip, wherein the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used for running a program or an instruction to implement the method described in any one of claims 1 to 5 The steps of an audio signal processing method.
A computer program product, wherein the computer program product is stored in a non-volatile storage medium, the computer program product being executed by at least one processor to implement the method as claimed in any one of claims 1 to 5 The steps of an audio signal processing method.
An electronic device comprising the steps of the electronic device being configured to perform the audio signal processing method as claimed in any one of claims 1 to 5.