WO2022143522A1 - Audio signal processing method and apparatus, and electronic device - Google Patents

Audio signal processing method and apparatus, and electronic device Download PDF

Info

Publication number
WO2022143522A1
WO2022143522A1 PCT/CN2021/141628 CN2021141628W WO2022143522A1 WO 2022143522 A1 WO2022143522 A1 WO 2022143522A1 CN 2021141628 W CN2021141628 W CN 2021141628W WO 2022143522 A1 WO2022143522 A1 WO 2022143522A1
Authority
WO
WIPO (PCT)
Prior art keywords
noise reduction
audio signal
signal
frequency
target
Prior art date
Application number
PCT/CN2021/141628
Other languages
French (fr)
Chinese (zh)
Inventor
倪忠
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2022143522A1 publication Critical patent/WO2022143522A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/04Circuits for transducers, loudspeakers or microphones for correcting frequency response

Definitions

  • the above noise reduction processing may include at least one of the following: DSP noise reduction algorithm processing, and residual echo suppression processing.
  • the audio signal processing apparatus may use the second algorithm to obtain another noise reduction parameter based on the determined one default noise reduction parameter, and The other noise reduction parameter is determined as the target noise reduction parameter.
  • the audio signal processing apparatus can divide the first audio signal into a low-frequency signal and a high-frequency signal, and respectively perform noise reduction processing with different noise reduction amounts on the low-frequency signal and the high-frequency signal, and, after the noise reduction processing
  • the low-frequency signal and the high-frequency signal after the noise reduction process are synthesized and processed. Therefore, the voiced voice can be damaged in the voice signal after the noise reduction process, so as to avoid the reduction of the clarity of the voice signal after the noise reduction process. , in this way, the effect of noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.

Abstract

The present application discloses an audio signal processing method and apparatus, and an electronic device. The audio signal processing method comprises: acquiring a low frequency signal in a first audio signal, the low frequency signal being an audio signal of which the frequency is within a preset frequency range, and the low frequency signal comprising M frequency points; determining M probability values respectively according to energy values of the M frequency points, each probability value being used for indicating the probability that there is a voice signal at the corresponding frequency point; on the basis of the M probability values, determining a target noise reduction parameter, the target noise reduction parameter being used for representing a noise reduction amount of an electronic device performing noise reduction processing on the audio signal; and on the basis of the target noise reduction parameter, performing noise reduction processing on the low frequency signal.

Description

音频信号处理方法、装置和电子设备Audio signal processing method, device and electronic device
本申请要求于2020年12月31日提交国家知识产权局、申请号为202011628024.3、申请名称为“音频信号处理方法、装置和电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202011628024.3 and the application title "Audio Signal Processing Method, Apparatus and Electronic Equipment" filed with the State Intellectual Property Office on December 31, 2020, the entire contents of which are incorporated by reference in in this application.
技术领域technical field
本申请属于通信技术领域,具体涉及一种音频信号处理方法、装置和电子设备。The present application belongs to the field of communication technologies, and in particular relates to an audio signal processing method, apparatus and electronic device.
背景技术Background technique
目前,在电子设备与其他电子设备进行语音通话的过程中,若电子设备的麦克风采集的音频信号中的噪声较强(例如该噪声为非平稳强噪声),则电子设备可以先对该音频信号进行较大降噪量的降噪处理,以减少该音频信号中的噪声,再将处理后的音频信号发送至其他电子设备,从而可以提升语音通话的通话质量。At present, during a voice call between an electronic device and other electronic devices, if the noise in the audio signal collected by the microphone of the electronic device is strong (for example, the noise is non-stationary strong noise), the electronic device can first perform the audio signal on the audio signal. Noise reduction processing with a larger amount of noise reduction is performed to reduce noise in the audio signal, and then the processed audio signal is sent to other electronic devices, so that the call quality of the voice call can be improved.
但是,由于可能会出现音频信号中存在语音信号的情况,而在该音频信号中的噪声较强的情况下,电子设备还是会对该音频信号进行较大降噪量的降噪处理,因此可能会出现电子设备对该语音信号进行较大降噪量的降噪处理的情况,从而可能会导致降噪处理后的语音信号的清晰度下降。However, since there may be a voice signal in the audio signal, and in the case of strong noise in the audio signal, the electronic device will still perform noise reduction processing with a large amount of noise reduction on the audio signal, so it may be In some cases, the electronic device performs noise reduction processing with a large noise reduction amount on the voice signal, which may result in a decrease in the clarity of the voice signal after the noise reduction processing.
如此,导致电子设备对音频信号进行降噪处理的效果较差。In this way, the effect of noise reduction processing on the audio signal by the electronic device is poor.
发明内容SUMMARY OF THE INVENTION
本申请实施例的目的是提供一种音频信号处理方法、装置和电子设备,能够解决电子设备对音频信号进行降噪处理的效果较差的问题。The purpose of the embodiments of the present application is to provide an audio signal processing method, apparatus and electronic device, which can solve the problem that the effect of noise reduction processing on the audio signal by the electronic device is poor.
为了解决上述技术问题,本申请是这样实现的:In order to solve the above technical problems, this application is implemented as follows:
第一方面,本申请实施例提供了一种音频信号处理方法,该方法包括:获取第一音频信号中的低频信号;该低频信号为:频率处于预设频率范围内的音频信号,该低频信号包括M个频点;M为正整数;根据M个频点的能量值,分别确定M个概率值;每个概率值分别用于指示其对应频点存在语音信号的概率;基于M个概率值,确定目标降噪参数;该目标降噪参数用于表征:电子设备对音频信号进行降噪处理的降噪量;基于目标降噪参数,对低频信号进行降噪处理。In a first aspect, an embodiment of the present application provides an audio signal processing method, the method includes: acquiring a low-frequency signal in a first audio signal; the low-frequency signal is: an audio signal whose frequency is within a preset frequency range, the low-frequency signal M frequency points are included; M is a positive integer; M probability values are respectively determined according to the energy values of the M frequency points; each probability value is used to indicate the probability that the corresponding frequency point has a voice signal; , determine the target noise reduction parameter; the target noise reduction parameter is used to represent: the noise reduction amount that the electronic device performs noise reduction processing on the audio signal; based on the target noise reduction parameter, the low frequency signal is subjected to noise reduction processing.
第二方面,本申请实施例提供了一种音频信号处理装置,该音频信号处理装置包括:获取模块、确定模块和处理模块。其中,获取模块,用于获取第一音频信号中的低频信号;该低频信号为:频率处于预设频率范围内的音频信号,该低频信号包括M个频点;M为正整数。确定模块,用于根据M个频点的能量值,分别确定M个概率值;每个概率值分别用于指示其对应频点存在语音信号的概率;并基于M个概率值,确定目标降噪参数;该目标降噪参数用于表征:音频信号处理装置对音频信号进行降噪处理的降噪量。处理模块,用于基于确定模块确定的目标降噪参数,对低频信号进行降噪处理。In a second aspect, an embodiment of the present application provides an audio signal processing apparatus, where the audio signal processing apparatus includes: an acquisition module, a determination module, and a processing module. Wherein, the acquisition module is used for acquiring a low frequency signal in the first audio signal; the low frequency signal is an audio signal whose frequency is within a preset frequency range, and the low frequency signal includes M frequency points; M is a positive integer. The determination module is used to determine M probability values respectively according to the energy values of the M frequency points; each probability value is used to indicate the probability that the corresponding frequency point has a speech signal; and based on the M probability values, determine the target noise reduction parameter; the target noise reduction parameter is used to characterize: the amount of noise reduction performed by the audio signal processing apparatus for noise reduction processing on the audio signal. The processing module is configured to perform noise reduction processing on the low frequency signal based on the target noise reduction parameter determined by the determination module.
第三方面,本申请实施例提供了一种电子设备,该电子设备包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所 述处理器执行时实现如第一方面所述的方法的步骤。In a third aspect, embodiments of the present application provide an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being The processor implements the steps of the method according to the first aspect when executed.
第四方面,本申请实施例提供了一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如第一方面所述的方法的步骤。In a fourth aspect, an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .
第五方面,本申请实施例提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如第一方面所述的方法。In a fifth aspect, an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.
在本申请实施例中,电子设备可以获取第一音频信号中的低频信号(该低频信号包括M个频点),并根据该M个频点的能量值,分别确定M个概率值(每个概率值分别用于指示其对应频点存在语音信号的概率),从而电子设备可以基于该M个概率值,确定电子设备对低频信号进行降噪处理对应的降噪参数,并基于该降噪参数,对该低频信号进行降噪处理。由于在音频信号中的噪声较强时,电子设备可以根据低频信号的每个频点的能量值,分别确定该每个频点存在语音信号的概率值,并基于每个频点存在语音信号的概率值,确定电子设备对该低频信号进行降噪处理对应的降噪参数,即,电子设备可以根据低频信号中是否存在语音信号,确定不同的降噪参数,以对该低频信号进行不同降噪量的降噪处理,而并不是直接采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升电子设备对音频信号进行降噪处理的效果。In this embodiment of the present application, the electronic device may acquire a low-frequency signal in the first audio signal (the low-frequency signal includes M frequency points), and according to the energy values of the M frequency points, respectively determine M probability values (each The probability values are respectively used to indicate the probability that the corresponding frequency point has a speech signal), so that the electronic device can determine the noise reduction parameters corresponding to the noise reduction processing of the low frequency signal by the electronic device based on the M probability values, and based on the noise reduction parameters , perform noise reduction processing on the low-frequency signal. Since the noise in the audio signal is strong, the electronic device can determine the probability value of the presence of a voice signal at each frequency point according to the energy value of each frequency point of the low-frequency signal, and based on the existence of the voice signal at each frequency point The probability value determines the noise reduction parameters corresponding to the noise reduction processing performed by the electronic device on the low-frequency signal, that is, the electronic device can determine different noise-reduction parameters according to whether there is a voice signal in the low-frequency signal, so as to perform different noise reduction on the low-frequency signal. Instead of directly using a larger noise reduction parameter to perform a larger amount of noise reduction processing on the low-frequency signal, it is possible to avoid the situation where the clarity of the speech signal after the noise reduction process is reduced. , which can improve the effect of noise reduction processing on audio signals by electronic devices.
附图说明Description of drawings
图1是本申请实施例提供的音频信号处理方法的示意图之一;1 is one of schematic diagrams of an audio signal processing method provided by an embodiment of the present application;
图2是本申请实施例提供的音频信号处理方法的示意图之二;2 is a second schematic diagram of an audio signal processing method provided by an embodiment of the present application;
图3是本申请实施例提供的音频信号处理方法的示意图之三;3 is a third schematic diagram of an audio signal processing method provided by an embodiment of the present application;
图4是本申请实施例提供的音频信号处理方法的示意图之四;4 is a fourth schematic diagram of an audio signal processing method provided by an embodiment of the present application;
图5是本申请实施例提供的音频信号处理装置的结构示意图;5 is a schematic structural diagram of an audio signal processing apparatus provided by an embodiment of the present application;
图6是本申请实施例提供的电子设备的结构示意图;6 is a schematic structural diagram of an electronic device provided by an embodiment of the present application;
图7是本申请实施例提供的电子设备的硬件示意图。FIG. 7 is a schematic diagram of hardware of an electronic device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
本申请的说明书和权利要求书中的术语“第一”、“第二”等是用于区别类似的对象,而不用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施,且“第一”、“第二”等所区分的对象通常为一类,并不限定对象的个数,例如第一对象可以是一个,也可以是多个。此外,说明书以及权利要求中“和/或”表示所连接对象的至少其中之一,字符“/”,一般表示前后关联对象是一种“或”的关系。The terms "first", "second" and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between "first", "second", etc. The objects are usually of one type, and the number of objects is not limited. For example, the first object may be one or more than one. In addition, "and/or" in the description and claims indicates at least one of the connected objects, and the character "/" generally indicates that the associated objects are in an "or" relationship.
下面结合附图,通过具体的实施例及其应用场景对本申请实施例提供的音频信号处理方法进行详细地说明。The audio signal processing method provided by the embodiments of the present application will be described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.
本申请实施例提供的音频信号处理方法可以应用于电子设备与其他电子设备进行 语音通话的场景。The audio signal processing method provided in the embodiment of the present application can be applied to a scenario where an electronic device conducts a voice call with other electronic devices.
假设用户通过电子设备1与电子设备2进行语音通话,在语音通话过程中,电子设备1的麦克风可以采集音频信号,并将该音频信号发送至电子设备2。若音频信号中包括非平稳强噪声信号,则电子设备1可以先对该音频信号进行降噪处理,再将处理后的音频信号发送至电子设备2。在相关技术中,电子设备1可以根据噪声信号的强度,确定与该强度对应的降噪参数1,并基于该降噪参数1,通过信号处理(digital signal processing,DSP)降噪算法,对麦克风采集的音频信号进行降噪处理,以将处理后的音频信号发送至电子设备2。但是,由于可能会出现音频信号中存在语音洗脑的情况下,而电子设备1还是会并基于该降噪参数1,通过DSP降噪算法,对麦克风采集的音频信号进行降噪处理,这样可能会出现电子设备1对该语音信号进行较大降噪量的降噪处理的情况,因此,可能会导致该语音信号的清晰度下降。Assuming that the user conducts a voice call with the electronic device 2 through the electronic device 1 , during the voice call, the microphone of the electronic device 1 can collect audio signals and send the audio signals to the electronic device 2 . If the audio signal includes a non-stationary strong noise signal, the electronic device 1 can perform noise reduction processing on the audio signal first, and then send the processed audio signal to the electronic device 2 . In the related art, the electronic device 1 can determine the noise reduction parameter 1 corresponding to the intensity of the noise signal according to the intensity of the noise signal, and based on the noise reduction parameter 1, through a signal processing (digital signal processing, DSP) noise reduction algorithm, the microphone The collected audio signal is subjected to noise reduction processing, so as to send the processed audio signal to the electronic device 2 . However, since there may be voice brainwashing in the audio signal, the electronic device 1 will still perform noise reduction processing on the audio signal collected by the microphone through the DSP noise reduction algorithm based on the noise reduction parameter 1, which may cause noise reduction. In some cases, the electronic device 1 performs noise reduction processing with a relatively large amount of noise reduction on the voice signal, and therefore, the intelligibility of the voice signal may be degraded.
然而,在本申请实施例中,电子设备1可以先将音频信号分为低频信号(即频率处于预设频率范围内的音频信号)和高频信号(即频率处于预设频率范围外的音频信号),然后再根据低频信号的多个频点的能量值,分别确定多个概率值,每个概率值分别用于指示其对应频点存在语音信号的概率;再基于该多个概率值,确定电子设备1对低频信号进行降噪处理的降噪参数2,以及,基于该降噪参数2,对该低频信号进行降噪处理。下来,电子设备1可以基于降噪参数3(例如默认降噪参数),对高频信号进行降噪处理,这样,电子设备1可以将降噪处理后的低频信号、和降噪处理后的高频信号进行合成,并输出合成得到的音频信号。可以理解,电子设备1可以根据低频信号中是否存在语音信号,确定不同的降噪参数,以对该低频信号进行不同降噪量的降噪处理,而并不是直接采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以避免语音信号的清晰度下降。However, in this embodiment of the present application, the electronic device 1 may first divide the audio signal into a low-frequency signal (that is, an audio signal whose frequency is within a preset frequency range) and a high-frequency signal (that is, an audio signal whose frequency is outside the preset frequency range) ), and then according to the energy values of multiple frequency points of the low-frequency signal, respectively determine multiple probability values, each probability value is used to indicate the probability of the existence of a voice signal at its corresponding frequency point; then based on the multiple probability values, determine The electronic device 1 performs noise reduction parameters 2 for noise reduction processing on the low-frequency signal, and, based on the noise reduction parameters 2, performs noise reduction processing on the low-frequency signals. Next, the electronic device 1 can perform noise reduction processing on the high frequency signal based on the noise reduction parameter 3 (for example, the default noise reduction parameter), so that the electronic device 1 can The audio signal is synthesized and the synthesized audio signal is output. It can be understood that the electronic device 1 can determine different noise reduction parameters according to whether there is a speech signal in the low frequency signal, so as to perform noise reduction processing with different noise reduction amounts on the low frequency signal, instead of directly using a larger noise reduction parameter for the noise reduction. The low-frequency signal is subjected to noise reduction processing with a larger amount of noise reduction, so that the reduction in the clarity of the speech signal can be avoided.
图1示出了本申请实施例提供的一种音频信号处理方法的流程图。如图1所示,本申请实施例提供的音频信号处理方法可以包括下述的步骤101至步骤104。FIG. 1 shows a flowchart of an audio signal processing method provided by an embodiment of the present application. As shown in FIG. 1 , the audio signal processing method provided by the embodiment of the present application may include the following steps 101 to 104 .
步骤101、音频信号处理装置获取第一音频信号中的低频信号。Step 101: The audio signal processing apparatus acquires the low frequency signal in the first audio signal.
可选地,本申请实施例中,在音频信号处理装置显示目标应用的界面的情况下,用户可以在该界面中对目标联系人的标识(例如头像)进行输入,以使得音频信号处理装置可以显示该目标联系人的会话页面,从而用户可以在该目标联系人的会话页面中进行输入,以使得音频信号处理装置与其他音频信号处理装置(即目标联系人对应的音频信号处理装置)建立语音连接,并通过音频信号处理装置的麦克风采集第一音频信号,以及,对该第一音频信号进行检测。在检测到第一音频信号中存在较强的噪声信号(例如非平稳强噪声信号)的情况下,音频信号处理装置可以将第一音频信号分为低频信号和高频信号,并获取该低频信号。Optionally, in this embodiment of the present application, when the audio signal processing apparatus displays the interface of the target application, the user can input the identifier (for example, the avatar) of the target contact in the interface, so that the audio signal processing apparatus can The conversation page of the target contact is displayed, so that the user can input in the conversation page of the target contact, so that the audio signal processing apparatus and other audio signal processing apparatuses (that is, the audio signal processing apparatuses corresponding to the target contact) establish a voice connected, and the first audio signal is collected through the microphone of the audio signal processing device, and the first audio signal is detected. In the case of detecting that there is a strong noise signal (for example, a non-stationary strong noise signal) in the first audio signal, the audio signal processing apparatus may divide the first audio signal into a low frequency signal and a high frequency signal, and obtain the low frequency signal .
可选地,本申请实施例中,上述目标应用可以通话类应用。Optionally, in this embodiment of the present application, the above-mentioned target application may be a call type application.
可选地,本申请实施例中,在音频信号处理装置检测到第一音频信号中的噪声信号的强度、大于或等于预设强度的情况下,音频信号处理装置可以将第一音频信号分为低频信号和高频信号,并获取该低频信号。Optionally, in this embodiment of the present application, when the audio signal processing apparatus detects that the intensity of the noise signal in the first audio signal is greater than or equal to a preset intensity, the audio signal processing apparatus may divide the first audio signal into low-frequency signal and high-frequency signal, and obtain the low-frequency signal.
本申请实施例中,上述低频信号为:频率处于预设频率范围内的音频信号,该低频信号包括M个频点;M为正整数。In the embodiment of the present application, the above-mentioned low-frequency signal is an audio signal whose frequency is within a preset frequency range, the low-frequency signal includes M frequency points, and M is a positive integer.
需要说明的是,“低频信号”和“高频信号”和两个相对的概念,低频信号的频率低于高频信号的频率。可以理解,第一音频信号包括至少两个子带信号(该子带信号是指某个频率区间对应的信号),该至少两个子带信号分别对应的频率范围不重叠,该低频信号可以为该至少两个子带信号中的一个子带信号,该高频信号可以为该至少两个子带信号中的另一个子带信号。It should be noted that "low frequency signal" and "high frequency signal" are two relative concepts, and the frequency of the low frequency signal is lower than the frequency of the high frequency signal. It can be understood that the first audio signal includes at least two sub-band signals (the sub-band signal refers to a signal corresponding to a certain frequency interval), the frequency ranges corresponding to the at least two sub-band signals do not overlap, and the low-frequency signal may be the at least two sub-band signals. One sub-band signal in the two sub-band signals, and the high-frequency signal may be the other sub-band signal in the at least two sub-band signals.
可选地,本申请实施例中,在音频信号处理装置检测到第一音频信号中存在较强的噪声信号的情况下,音频信号处理装置可以采用预设算法,对该第一音频信号进行回声处理,以消除该第一音频信号中的回声信号。Optionally, in this embodiment of the present application, when the audio signal processing apparatus detects that there is a strong noise signal in the first audio signal, the audio signal processing apparatus may use a preset algorithm to echo the first audio signal. processing to cancel the echo signal in the first audio signal.
可选地,本申请实施例中,上述预设算法具体可以为:自适应回声消除算法。Optionally, in this embodiment of the present application, the foregoing preset algorithm may specifically be: an adaptive echo cancellation algorithm.
需要说明的是,针对自适应回声消除算法的说明,可以参考相关技术中的具体描述,本申请实施例在此不予赘述。It should be noted that, for the description of the adaptive echo cancellation algorithm, reference may be made to the specific description in the related art, which is not repeated in this embodiment of the present application.
进一步可选地,本申请实施例中,音频信号处理装置可以对回声处理后的第一音频信号进行检测,并确定出频率处于预设频率范围内的音频信号,以获取低频信号。Further optionally, in the embodiment of the present application, the audio signal processing apparatus may detect the echo-processed first audio signal, and determine an audio signal whose frequency is within a preset frequency range to obtain a low-frequency signal.
步骤102、音频信号处理装置根据M个频点的能量值,分别确定M个概率值。Step 102: The audio signal processing apparatus determines M probability values respectively according to the energy values of the M frequency points.
可选地,本申请实施例中,音频信号处理装置可以对低频信号进行检测,以获取该低频信号的每个频点的能量值,以获取M个频点的能量值。Optionally, in the embodiment of the present application, the audio signal processing apparatus may detect the low-frequency signal to obtain the energy value of each frequency point of the low-frequency signal, so as to obtain the energy value of M frequency points.
本申请实施例中,上述M个概率值中的每个概率值分别用于指示其对应频点存在语音信号的概率。In the embodiment of the present application, each of the above-mentioned M probability values is respectively used to indicate the probability that a speech signal exists at its corresponding frequency point.
可以理解,一个频点对应的一个概率值越高(例如越接近1),则可以认为该一个频点中存在语音信号的概率越高,即,该一个频点越像语音信号;一个频点对应的一个概率值越低(例如越接近0),则可以认为该一个频点中存在语音信号的概率越低,即,该一个频点越像噪声信号。It can be understood that the higher the probability value corresponding to a frequency point (for example, the closer to 1), the higher the probability that there is a speech signal in the frequency point, that is, the more similar the frequency point is to the speech signal; a frequency point The lower the corresponding probability value (for example, the closer to 0), the lower the probability that the speech signal exists in the one frequency point can be considered that the one frequency point is more like a noise signal.
可选地,本申请实施例中,针对M个概率值中的每个概率值,音频信号处理装置可以将一个频点的能量值,输入到目标神经网络的输入变量中,得到一个概率值(mask),以得到M个概率值(即M个mask)。Optionally, in this embodiment of the present application, for each probability value in the M probability values, the audio signal processing apparatus may input the energy value of a frequency point into the input variable of the target neural network to obtain a probability value ( mask) to obtain M probability values (ie, M masks).
进一步可选地,本申请实施例中,上述目标神经网络具体可以为:深度学习神经网络。该目标神经网络可以是以模拟的多个音频信号为训练集,对待训练深度神经网络进行训练得到的深度神经网络。Further optionally, in the embodiment of the present application, the above-mentioned target neural network may specifically be: a deep learning neural network. The target neural network can be a deep neural network obtained by training the deep neural network to be trained by using a plurality of simulated audio signals as a training set.
步骤103、音频信号处理装置基于M个概率值,确定目标降噪参数。Step 103: The audio signal processing apparatus determines the target noise reduction parameter based on the M probability values.
本申请实施例中,上述目标降噪参数用于表征:音频信号处理装置对音频信号进行降噪处理的降噪量。In the embodiment of the present application, the above target noise reduction parameter is used to represent: the noise reduction amount by which the audio signal processing apparatus performs noise reduction processing on the audio signal.
可选地,本申请实施例中,上述降噪处理可以包括以下至少一项:DSP降噪算法处理、残留回声抑制处理。Optionally, in this embodiment of the present application, the above noise reduction processing may include at least one of the following: DSP noise reduction algorithm processing, and residual echo suppression processing.
可选地,本申请实施例中,音频信号处理装置可以从M个概率值中,确定出一个概率值,并根据该一个概率值,确定目标降噪参数;或者,音频信号处理装置可以确定出M个概率值平均概率值(例如下述实施例中的目标概率值),并根据该平均概率值,确定为目标降噪参数。Optionally, in this embodiment of the present application, the audio signal processing apparatus may determine a probability value from the M probability values, and determine the target noise reduction parameter according to the one probability value; or, the audio signal processing apparatus may determine The average probability value of the M probability values (for example, the target probability value in the following embodiments) is determined as the target noise reduction parameter according to the average probability value.
进一步可选地,本申请实施例中,音频信号处理装置可以根据第一音频信号中的噪声信号的强度,从至少一个第一对应关系中,确定出与该噪声信号的强度对应的一 个默认降噪参数,再根据确定出的一个概率值(或平均概率值)是否满足预设条件,基于该一个默认降噪参数,得到一个降噪参数,并将该一个降噪参数,确定为目标降噪参数。至少一个第一对应关系分别为:至少一个强度和至少一个默认降噪参数间的对应关系。Further optionally, in this embodiment of the present application, the audio signal processing apparatus may, according to the strength of the noise signal in the first audio signal, determine a default drop value corresponding to the strength of the noise signal from at least one first correspondence. noise parameter, and then according to whether a determined probability value (or average probability value) satisfies a preset condition, based on the default noise reduction parameter, a noise reduction parameter is obtained, and the noise reduction parameter is determined as the target noise reduction parameter. The at least one first correspondence is respectively: a correspondence between at least one intensity and at least one default noise reduction parameter.
示例性地,上述预设条件具体可以为:音频信号处理装置确定出的一个概率值(或平均概率值)大于或等于预设阈值。Exemplarily, the above-mentioned preset condition may specifically be: a probability value (or an average probability value) determined by the audio signal processing apparatus is greater than or equal to a preset threshold value.
在确定出的一个概率值(或平均概率值)满足预设条件的情况下,音频信号处理装置可以采用第一算法,基于确定出的一个默认降噪参数,得到一个降噪参数,并将该一个降噪参数,确定为目标降噪参数。In the case that a determined probability value (or an average probability value) satisfies a preset condition, the audio signal processing apparatus may use a first algorithm to obtain a noise reduction parameter based on a determined default noise reduction parameter, and use the determined noise reduction parameter. A noise reduction parameter, determined as the target noise reduction parameter.
在确定出的一个概率值(或平均概率值)不满足预设条件的情况下,音频信号处理装置可以采用第二算法,基于确定出的一个默认降噪参数,得到另一个降噪参数,并将该另一个降噪参数,确定为目标降噪参数。In the case that the determined one probability value (or the average probability value) does not satisfy the preset condition, the audio signal processing apparatus may use the second algorithm to obtain another noise reduction parameter based on the determined one default noise reduction parameter, and The other noise reduction parameter is determined as the target noise reduction parameter.
需要说明的是,针对第一算法和第二算法的说明,将在本申请的下述实施例中进行具体描述,本申请实施例在此不予赘述。It should be noted that, the description of the first algorithm and the second algorithm will be specifically described in the following embodiments of the present application, and the embodiments of the present application will not be repeated here.
步骤104、音频信号处理装置基于目标降噪参数,对低频信号进行降噪处理。Step 104: The audio signal processing apparatus performs noise reduction processing on the low frequency signal based on the target noise reduction parameter.
可选地,本申请实施例中,音频信号处理装置可以基于目标降噪参数,采用DSP降噪算法(或回声抑制算法),对低频信号进行降噪处理。Optionally, in the embodiment of the present application, the audio signal processing apparatus may use a DSP noise reduction algorithm (or an echo suppression algorithm) to perform noise reduction processing on the low frequency signal based on the target noise reduction parameter.
可以理解,音频信号处理装置可以将目标降噪参数作为噪声抑制量控制因子,采用于DSP降噪算法(或回声抑制算法),对低频信号中的噪声信号进行抑制,以对低频信号进行降噪处理。It can be understood that the audio signal processing device can use the target noise reduction parameter as the noise suppression amount control factor, and use it in the DSP noise reduction algorithm (or echo suppression algorithm) to suppress the noise signal in the low frequency signal, so as to reduce the low frequency signal. deal with.
本申请实施例中,在音频信号中的噪声信号较强的情况下,若音频信号中存在语音信号,则音频信号处理装置会对该音频信号进行较大降噪量的降噪处理,从而会导致该语音信号中、浊音语音受损,进而导致语音信号的清晰度下降。因此,音频信号处理装置可以获取音频信号(即第一音频信号)中的浊音语音对应的音频信号(即低频信号),并根据该低频信号的多个频点的能量值,确定低频信号中存在语音信号的概率值,并基于该概率值,确定音频信号处理装置对该低频信号进行降噪处理对应的降噪参数,从而可以避免降噪处理后的语音信号中、浊音语音受损。In the embodiment of the present application, in the case where the noise signal in the audio signal is strong, if there is a voice signal in the audio signal, the audio signal processing apparatus will perform noise reduction processing with a large amount of noise reduction on the audio signal, thereby reducing the noise. As a result, the voiced speech in the speech signal is damaged, and the intelligibility of the speech signal is decreased. Therefore, the audio signal processing apparatus can acquire the audio signal (ie, the low-frequency signal) corresponding to the voiced speech in the audio signal (ie, the first audio signal), and determine that the low-frequency signal exists in the low-frequency signal according to the energy values of multiple frequency points of the low-frequency signal. The probability value of the speech signal, and based on the probability value, determine the noise reduction parameter corresponding to the noise reduction process performed by the audio signal processing device on the low frequency signal, so as to avoid damage to the voiced speech in the noise reduction processed speech signal.
本申请实施例提供的音频信号处理方法,音频信号处理装置可以获取第一音频信号中的低频信号(该低频信号包括M个频点),并根据该M个频点的能量值,分别确定M个概率值(每个概率值分别用于指示其对应频点存在语音信号的概率),从而音频信号处理装置可以基于该M个概率值,确定音频信号处理装置对低频信号进行降噪处理对应的降噪参数,并基于该降噪参数,对该低频信号进行降噪处理。由于在音频信号中的噪声信号较强时,音频信号处理装置可以根据低频信号的每个频点的能量值,分别确定该每个频点存在语音信号的概率值,并基于每个频点存在语音信号的概率值,确定音频信号处理装置对该低频信号进行降噪处理对应的降噪参数,即,音频信号处理装置可以根据低频信号中是否存在语音信号,确定不同的降噪参数,以对该低频信号进行不同降噪量的降噪处理,而并不是直接采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。In the audio signal processing method provided by the embodiment of the present application, the audio signal processing apparatus may acquire the low-frequency signal in the first audio signal (the low-frequency signal includes M frequency points), and determine M respectively according to the energy values of the M frequency points. probability values (each probability value is used to indicate the probability of the existence of speech signals at its corresponding frequency point), so that the audio signal processing device can determine the corresponding frequency of noise reduction processing performed by the audio signal processing device on the low-frequency signal based on the M probability values. Noise reduction parameters, and based on the noise reduction parameters, noise reduction processing is performed on the low-frequency signal. Because when the noise signal in the audio signal is strong, the audio signal processing apparatus can determine the probability value of the existence of the speech signal at each frequency point according to the energy value of each frequency point of the low-frequency signal, and based on the existence of each frequency point The probability value of the voice signal determines the noise reduction parameters corresponding to the noise reduction processing performed by the audio signal processing device on the low-frequency signal, that is, the audio signal processing device can determine different noise reduction parameters according to whether there is a voice signal in the low-frequency signal. The low-frequency signal is subjected to noise reduction processing with different noise reduction amounts, instead of directly using a larger noise reduction parameter to perform noise reduction processing with a larger noise reduction amount on the low-frequency signal. Therefore, noise reduction of the voice signal after noise reduction processing can be avoided. In the case where the definition is lowered, the effect of noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.
可选地,本申请实施例中,结合图1,如图2所示,在上述步骤104之后,本申请实施例提供的音频信号处理方法还可以包括下述的步骤201。Optionally, in the embodiment of the present application, with reference to FIG. 1 , as shown in FIG. 2 , after the above step 104 , the audio signal processing method provided by the embodiment of the present application may further include the following step 201 .
步骤201、音频信号处理装置将降噪处理后的低频信号和第一信号进行合成处理,得到目标音频信号,并输出目标音频信号。Step 201: The audio signal processing apparatus performs synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and outputs the target audio signal.
本申请实施例中,上述第一信号是:基于第一音频信号中的高频信号得到的,该高频信号为:频率处于预设频率范围外的音频信号。In the embodiment of the present application, the above-mentioned first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
进一步可选地,本申请实施例中,上述第一信号具体可以为:音频信号处理装置对该高频信号进行降噪处理得到的。Further optionally, in this embodiment of the present application, the above-mentioned first signal may specifically be obtained by performing noise reduction processing on the high-frequency signal by an audio signal processing apparatus.
如此可知,由于音频信号处理装置可以将第一音频信号分为低频信号和高频信号,并分别对该低频信号和高频信号进行不同降噪量的降噪处理,以及,将降噪处理后的低频信号、和降噪处理后的高频信号进行合成处理,因此,可以降噪处理后的语音信号中、浊音语音受损,从而可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。It can be seen from this that, because the audio signal processing apparatus can divide the first audio signal into a low-frequency signal and a high-frequency signal, and respectively perform noise reduction processing with different noise reduction amounts on the low-frequency signal and the high-frequency signal, and, after the noise reduction processing The low-frequency signal and the high-frequency signal after the noise reduction process are synthesized and processed. Therefore, the voiced voice can be damaged in the voice signal after the noise reduction process, so as to avoid the reduction of the clarity of the voice signal after the noise reduction process. , in this way, the effect of noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.
可选地,本申请实施例中,结合图2,如图3所示,在上述步骤201之前,本申请实施例提供的音频信号处理方法还可以包括下述的步骤301。Optionally, in the embodiment of the present application, with reference to FIG. 2 , as shown in FIG. 3 , before the above step 201 , the audio signal processing method provided by the embodiment of the present application may further include the following step 301 .
步骤301、音频信号处理装置获取高频信号,并基于第三降噪参数,对高频信号进行降噪处理,得到第一信号。Step 301: The audio signal processing apparatus acquires the high frequency signal, and based on the third noise reduction parameter, performs noise reduction processing on the high frequency signal to obtain the first signal.
进一步可选地,本申请实施例中,在音频信号处理装置检测到第一音频信号中的噪声信号的强度、大于或等于预设强度的情况下,音频信号处理装置可以将第一音频信号分为低频信号和高频信号,并获取该高频信号。Further optionally, in this embodiment of the present application, when the audio signal processing apparatus detects that the intensity of the noise signal in the first audio signal is greater than or equal to the preset intensity, the audio signal processing apparatus may classify the first audio signal into are the low-frequency signal and the high-frequency signal, and obtain the high-frequency signal.
进一步可选地,本申请实施例中,音频信号处理装置可以对回声处理后的第一音频信号进行检测,并确定出频率处于预设频率范围外的音频信号,以获取高频信号。Further optionally, in the embodiment of the present application, the audio signal processing apparatus may detect the echo-processed first audio signal, and determine the audio signal whose frequency is outside the preset frequency range, so as to obtain the high frequency signal.
进一步可选地,本申请实施例中,上述第三降噪参数具体可以为:默认降噪参数。Further optionally, in this embodiment of the present application, the third noise reduction parameter may specifically be: a default noise reduction parameter.
进一步可选地,本申请实施例中,音频信号处理装置可以根据第一音频信号中的噪声信号的强度,从至少一个第一对应关系中,确定出与该噪声信号的强度对应的一个默认降噪参数,并将该一个默认降噪参数,确定为第三降噪参数,从而音频信号处理装置可以基于第三降噪参数,对高频信号进行降噪处理。Further optionally, in this embodiment of the present application, the audio signal processing apparatus may, according to the strength of the noise signal in the first audio signal, determine a default drop value corresponding to the strength of the noise signal from at least one first correspondence. noise parameter, and the one default noise reduction parameter is determined as the third noise reduction parameter, so that the audio signal processing apparatus can perform noise reduction processing on the high frequency signal based on the third noise reduction parameter.
进一步可选地,本申请实施例中,音频信号处理装置可以基于第三降噪参数,对高频信号进行残留回声抑制处理,得到第一信号。Further optionally, in the embodiment of the present application, the audio signal processing apparatus may perform residual echo suppression processing on the high-frequency signal based on the third noise reduction parameter to obtain the first signal.
需要说明的是,针对步骤301和步骤102的执行顺序,本申请实施例在此不作限定。It should be noted that, the execution order of step 301 and step 102 is not limited in this embodiment of the present application.
在一种可能的实现方式中,音频信号处理装置可以先执行步骤301,然后再执行步骤102,即,音频信号处理装置可以先获取高频信号,并基于第三降噪参数,对高频信号进行降噪处理,得到第一信号,然后再根据M个频点的能量值,分别确定M个概率值。In a possible implementation manner, the audio signal processing apparatus may perform step 301 first, and then perform step 102, that is, the audio signal processing apparatus may first acquire the high-frequency signal, and based on the third noise reduction parameter, perform a A noise reduction process is performed to obtain a first signal, and then M probability values are respectively determined according to the energy values of the M frequency points.
在另一种可能的实现方式中,音频信号处理装置可以先执行步骤102,然后再执行步骤301,即,音频信号处理装置可以先根据M个频点的能量值,分别确定M个概率值,然后再获取高频信号,并基于第三降噪参数,对高频信号进行降噪处理,得到第一信号。In another possible implementation manner, the audio signal processing apparatus may first perform step 102, and then perform step 301, that is, the audio signal processing apparatus may first determine M probability values respectively according to the energy values of the M frequency points, Then, the high frequency signal is acquired, and based on the third noise reduction parameter, the high frequency signal is subjected to noise reduction processing to obtain the first signal.
在又一种可能的实现方式中,音频信号处理装置可以同时执行步骤301和步骤102,即,音频信号处理装置可以在获取高频信号,并基于第三降噪参数,对高频信号进行降噪处理,得到第一信号的同时,根据M个频点的能量值,分别确定M个概率值。In another possible implementation manner, the audio signal processing apparatus may perform step 301 and step 102 at the same time, that is, the audio signal processing apparatus may acquire the high-frequency signal, and reduce the high-frequency signal based on the third noise reduction parameter. Noise processing is performed, and at the same time as the first signal is obtained, M probability values are respectively determined according to the energy values of the M frequency points.
本申请实施例中,高频信号中的语音为清音语音,而该清音语音受损,并不会导致语音信号的清晰度下降,因此,音频信号处理装置可以获取音频信号(即第一音频信号)中的清音语音对应的音频信号(即高频信号),并基于默认降噪参数,对该高频信号进行降噪处理。In the embodiment of the present application, the voice in the high-frequency signal is unvoiced voice, and the unvoiced voice is damaged, which will not cause a decrease in the clarity of the voice signal. Therefore, the audio signal processing apparatus can obtain the audio signal (that is, the first audio signal ) corresponding to the audio signal (that is, the high-frequency signal) of the unvoiced speech, and the high-frequency signal is subjected to noise reduction processing based on the default noise reduction parameters.
如此可知,由于在音频信号中的噪声信号较强时,音频处理装置可以采用较大降噪参数对清音语音对应的音频信号(即高频信号)进行较大降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降,且降低降噪处理后的音频信号中的噪声信号,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。It can be seen from this that when the noise signal in the audio signal is strong, the audio processing device can use a larger noise reduction parameter to perform noise reduction processing with a larger amount of noise reduction on the audio signal (ie high-frequency signal) corresponding to the unvoiced speech. Therefore, , it is possible to avoid the reduction of the clarity of the noise-reduced speech signal, and to reduce the noise signal in the noise-reduced audio signal, so that the effect of the audio signal processing apparatus for noise reduction processing on the audio signal can be improved.
可选地,本申请实施例中,结合图1,如图4所示,上述步骤103具体可以通过下述的步骤103a和步骤103b实现。Optionally, in this embodiment of the present application, with reference to FIG. 1 , as shown in FIG. 4 , the foregoing step 103 may be specifically implemented by the following steps 103 a and 103 b.
步骤103a、音频信号处理装置根据M个概率值,确定目标概率值。 Step 103a, the audio signal processing apparatus determines the target probability value according to the M probability values.
本申请实施例中,上述目标概率值为:M个概率值的平均值。In the embodiment of the present application, the above target probability value is: the average value of M probability values.
进一步可选地,本申请实施例中,音频信号处理装置可以根据M个概率值,采用平均值算法,确定目标概率值。Further optionally, in this embodiment of the present application, the audio signal processing apparatus may determine the target probability value by using an average value algorithm according to the M probability values.
需要说明的是,针对平均值算法的说明,可以参考相关技术中的具体描述,本申请实施例在此不予赘述。It should be noted that, for the description of the average value algorithm, reference may be made to the specific description in the related art, which is not repeated in this embodiment of the present application.
步骤103b、音频信号处理装置基于目标概率值,确定目标降噪参数。 Step 103b, the audio signal processing apparatus determines the target noise reduction parameter based on the target probability value.
进一步可选地,本申请实施例中,音频信号处理装置可以根据目标概率值是否满足预设条件,以确定目标降噪参数。Further optionally, in this embodiment of the present application, the audio signal processing apparatus may determine the target noise reduction parameter according to whether the target probability value satisfies a preset condition.
可以理解,音频信号处理装置可以根据目标概率值,确定低频信号中是否存在语音信号,从而可以确定不同的降噪参数,以对该低频信号进行不同降噪量的降噪处理。It can be understood that the audio signal processing apparatus can determine whether there is a speech signal in the low-frequency signal according to the target probability value, so as to determine different noise reduction parameters to perform noise reduction processing with different noise reduction amounts on the low-frequency signal.
如此可知,由于音频信号处理装置可以根据M个概率值的平均值,确定目标降噪参数,确定低频信号中是否存在语音信号,以确定不同的降噪参数,对低频信号进行不同降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。It can be seen from this that the audio signal processing device can determine the target noise reduction parameter according to the average value of M probability values, determine whether there is a speech signal in the low frequency signal, determine different noise reduction parameters, and perform different noise reduction amounts on the low frequency signal. Noise reduction processing, therefore, it is possible to avoid the situation that the clarity of the voice signal after the noise reduction processing is lowered, and thus, the effect of the noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.
可选地,本申请实施例中,上述步骤103b具体可以通过下述的步骤103b1或步骤103b2实现。Optionally, in this embodiment of the present application, the foregoing step 103b may be specifically implemented by the following step 103b1 or step 103b2.
步骤103b1、在目标概率值大于或等于预设阈值的情况下,音频信号处理装置将第一降噪参数确定为目标降噪参数。Step 103b1 , when the target probability value is greater than or equal to a preset threshold, the audio signal processing apparatus determines the first noise reduction parameter as the target noise reduction parameter.
进一步可选地,本申请实施例中,音频信号处理装置可以采用第一算法,根据确定出的一个默认降噪参数,得到第一降噪参数,并将该第一降噪参数确定为目标降噪参数。Further optionally, in this embodiment of the present application, the audio signal processing apparatus may adopt a first algorithm, obtain a first noise reduction parameter according to a determined default noise reduction parameter, and determine the first noise reduction parameter as the target noise reduction parameter. noise parameter.
具体地,上述第一算法具体可以为C 1=A+B 1。其中,C1为第一降噪参数,A为音频信号处理装置确定出的一个默认降噪参数,B1为第一预设降噪参数。 Specifically, the above-mentioned first algorithm may specifically be C 1 =A+B 1 . Wherein, C1 is the first noise reduction parameter, A is a default noise reduction parameter determined by the audio signal processing apparatus, and B1 is the first preset noise reduction parameter.
可以理解,若目标概率值大于或等于预设阈值(即目标概率值满足预设条件),则可以认为低频信号中存在语音信号,因此,音频信号处理装置可以调低确定出的一 个默认降噪参数,以避免第一音频信号中的浊音语音受损,从而避免降噪处理后的语音信号的清晰度下降。It can be understood that if the target probability value is greater than or equal to the preset threshold (that is, the target probability value satisfies the preset condition), it can be considered that there is a voice signal in the low-frequency signal. Therefore, the audio signal processing device can lower a determined default noise reduction. parameter to avoid damage to the voiced speech in the first audio signal, thereby avoiding the reduction of the clarity of the speech signal after noise reduction.
如此可知,由于在确定低频信号中存在语音信号的情况下,音频信号处理装置可以调低确定出的一个默认降噪参数,以采用较小降噪参数对该低频信号进行较小降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降,且降低降噪处理后的音频信号中的噪声信号,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。It can be seen from this that, when it is determined that there is a speech signal in the low-frequency signal, the audio signal processing apparatus can lower a determined default noise reduction parameter, so as to use a smaller noise reduction parameter to perform a smaller noise reduction on the low-frequency signal. Noise reduction processing, therefore, it is possible to avoid the reduction of the clarity of the speech signal after the noise reduction processing, and reduce the noise signal in the audio signal after the noise reduction processing, so that the audio signal processing device can improve the noise reduction processing of the audio signal. Effect.
步骤103b2、在目标概率值小于预设阈值的情况下,音频信号处理装置将第二降噪参数确定为目标降噪参数。Step 103b2: In the case that the target probability value is smaller than the preset threshold, the audio signal processing apparatus determines the second noise reduction parameter as the target noise reduction parameter.
进一步可选地,本申请实施例中,音频信号处理装置可以采用第二算法,根据确定出的一个默认降噪参数,得到第二降噪参数,并将该第二降噪参数确定为目标降噪参数。Further optionally, in this embodiment of the present application, the audio signal processing apparatus may use a second algorithm to obtain a second noise reduction parameter according to a determined default noise reduction parameter, and determine the second noise reduction parameter as the target noise reduction parameter. noise parameter.
具体地,上述第二算法具体可以为C 2=A+B 2。其中,C2为第二降噪参数,A为音频信号处理装置确定出的一个默认降噪参数,B2为第二预设降噪参数。 Specifically, the above-mentioned second algorithm may specifically be C 2 =A+B 2 . Wherein, C2 is the second noise reduction parameter, A is a default noise reduction parameter determined by the audio signal processing apparatus, and B2 is the second preset noise reduction parameter.
可以理解,若目标概率值小于预设阈值(即目标概率值不满足预设条件),则可以认为低频信号中不存在语音信号(即该低频信号中仅存在噪声信号),因此,音频信号处理装置可以调高确定出的一个默认降噪参数,以降低降噪处理后的音频信号中的噪声信号。It can be understood that if the target probability value is less than the preset threshold (that is, the target probability value does not meet the preset conditions), it can be considered that there is no speech signal in the low-frequency signal (that is, there is only noise signal in the low-frequency signal), therefore, the audio signal processing The device may increase a determined default noise reduction parameter to reduce the noise signal in the noise reduction processed audio signal.
如此可知,由于在确定低频信号中不存在语音信号的情况下,音频信号处理装置可以调高确定出的一个默认降噪参数,以采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以降低降噪处理后的音频信号中的噪声信号,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。It can be seen from this that when it is determined that there is no speech signal in the low-frequency signal, the audio signal processing apparatus can increase a determined default noise reduction parameter, so as to use a larger noise reduction parameter to perform a larger amount of noise reduction on the low-frequency signal. Therefore, the noise signal in the audio signal after the noise reduction process can be reduced, and thus, the effect of the audio signal processing apparatus on the noise reduction process can be improved.
需要说明的是,本申请实施例提供的音频信号处理方法,执行主体可以为上述实施例中的音频信号处理装置,或者该音频信号处理装置中的用于执行音频信号处理方法的控制模块。本申请实施例中是以音频信号处理装置执行音频信号处理方法为例,说明本申请实施例提供的音频信号处理方法的装置的。It should be noted that, for the audio signal processing method provided by the embodiments of the present application, the execution body may be the audio signal processing apparatus in the above embodiment, or a control module in the audio signal processing apparatus for executing the audio signal processing method. In the embodiments of the present application, an audio signal processing method performed by an audio signal processing apparatus is used as an example to describe the apparatus for the audio signal processing method provided by the embodiments of the present application.
图5示出了本申请实施例中涉及的音频信号处理装置的一种可能的结构示意图。如图5所示,音频信号处理装置60可以包括:获取模块61、确定模块62和处理模块63。FIG. 5 shows a possible schematic structural diagram of the audio signal processing apparatus involved in the embodiment of the present application. As shown in FIG. 5 , the audio signal processing apparatus 60 may include: an acquisition module 61 , a determination module 62 and a processing module 63 .
其中,获取模块61,用于获取第一音频信号中的低频信号;该低频信号为:频率处于预设频率范围内的音频信号,该低频信号包括M个频点;M为正整数。确定模块62,用于根据M个频点的能量值,分别确定M个概率值;每个概率值分别用于指示其对应频点存在语音信号的概率;并基于M个概率值,确定目标降噪参数;该目标降噪参数用于表征:音频信号处理装置对音频信号进行降噪处理的降噪量。处理模块63,用于基于确定模块62确定的目标降噪参数,对低频信号进行降噪处理。The acquisition module 61 is used to acquire a low frequency signal in the first audio signal; the low frequency signal is an audio signal whose frequency is within a preset frequency range, and the low frequency signal includes M frequency points; M is a positive integer. The determination module 62 is used to determine M probability values respectively according to the energy values of the M frequency points; each probability value is used to indicate the probability that the corresponding frequency point has a voice signal; and based on the M probability values, determine the target drop. Noise parameter; the target noise reduction parameter is used to represent: the amount of noise reduction performed by the audio signal processing apparatus for noise reduction processing on the audio signal. The processing module 63 is configured to perform noise reduction processing on the low frequency signal based on the target noise reduction parameter determined by the determination module 62 .
在一种可能的实现方式中,上述确定模块62,具体用于根据M个概率值,确定目标概率值;该目标概率值为:M个概率值的平均值;并基于目标概率值,确定目标降噪参数。In a possible implementation manner, the above determination module 62 is specifically configured to determine a target probability value according to M probability values; the target probability value is: the average value of the M probability values; and based on the target probability value, determine the target Noise reduction parameters.
在一种可能的实现方式中,上述确定模块62,用于在目标概率值大于或等于预设 阈值的情况下,将第一降噪参数确定为目标降噪参数;或者,在目标概率值小于预设阈值的情况下,将第二降噪参数确定为目标降噪参数。In a possible implementation manner, the above determination module 62 is configured to determine the first noise reduction parameter as the target noise reduction parameter when the target probability value is greater than or equal to the preset threshold; or, when the target probability value is less than In the case of a preset threshold, the second noise reduction parameter is determined as the target noise reduction parameter.
在一种可能的实现方式中,上述处理模块63,还用于将降噪处理后的低频信号和第一信号进行合成处理,得到目标音频信号,并输出目标音频信号。其中,上述第一信号是:基于第一音频信号中的高频信号得到的,该高频信号为:频率处于预设频率范围外的音频信号。In a possible implementation manner, the above-mentioned processing module 63 is further configured to perform synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and output the target audio signal. The above-mentioned first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
在一种可能的实现方式中,上述获取模块61,还用于获取高频信号,并基于第三降噪参数,对高频信号进行降噪处理,得到第一信号。In a possible implementation manner, the obtaining module 61 is further configured to obtain a high-frequency signal, and perform noise reduction processing on the high-frequency signal based on the third noise reduction parameter to obtain the first signal.
本申请实施例提供的音频信号处理装置,由于在音频信号中的噪声信号较强时,音频信号处理装置可以根据低频信号的每个频点的能量值,分别确定该每个频点存在语音信号的概率值,并基于每个频点存在语音信号的概率值,确定音频信号处理装置对该低频信号进行降噪处理对应的降噪参数,即,音频信号处理装置可以根据低频信号中是否存在语音信号,确定不同的降噪参数,以对该低频信号进行不同降噪量的降噪处理,而并不是直接采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升音频信号处理装置对音频信号进行降噪处理的效果。In the audio signal processing apparatus provided by the embodiment of the present application, when the noise signal in the audio signal is strong, the audio signal processing apparatus can respectively determine that there is a voice signal at each frequency point according to the energy value of each frequency point of the low frequency signal. The probability value of , and based on the probability value of the existence of speech signals at each frequency point, determine the noise reduction parameters corresponding to the noise reduction processing performed by the audio signal processing device on the low frequency signal, that is, the audio signal processing device can be based on whether there is speech in the low frequency signal. signal, and determine different noise reduction parameters to perform noise reduction processing with different noise reduction amounts on the low-frequency signal, instead of directly using larger noise reduction parameters to perform noise reduction processing on the low-frequency signal with a larger amount of noise reduction. Therefore, , it is possible to avoid the situation that the clarity of the speech signal after the noise reduction processing is lowered, and thus, the effect of the noise reduction processing performed on the audio signal by the audio signal processing apparatus can be improved.
本申请实施例中的音频信号处理装置可以是装置,也可以是终端中的部件、集成电路、或芯片。该装置可以是移动电子设备,也可以为非移动电子设备。示例性地,移动电子设备可以为手机、平板电脑、笔记本电脑、掌上电脑、车载电子设备、可穿戴设备、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本或者个人数字助理(personal digital assistant,PDA)等,非移动电子设备可以为服务器、网络附属存储器(Network Attached Storage,NAS)、个人计算机(personal computer,PC)、电视机(television,TV)、柜员机或者自助机等,本申请实施例不作具体限定。The audio signal processing apparatus in this embodiment of the present application may be an apparatus, or may be a component, an integrated circuit, or a chip in a terminal. The apparatus may be a mobile electronic device or a non-mobile electronic device. Illustratively, the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant). assistant, PDA), etc., non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
本申请实施例中的音频信号处理装置可以为具有操作系统的装置。该操作系统可以为安卓(Android)操作系统,可以为iOS操作系统,还可以为其他可能的操作系统,本申请实施例不作具体限定。The audio signal processing apparatus in the embodiment of the present application may be an apparatus having an operating system. The operating system may be an Android (Android) operating system, an iOS operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
本申请实施例提供的音频信号处理装置能够实现图1至图4的方法实施例实现的各个过程,为避免重复,这里不再赘述。The audio signal processing apparatus provided in this embodiment of the present application can implement each process implemented by the method embodiments in FIG. 1 to FIG. 4 , and to avoid repetition, details are not repeated here.
可选地,如图6所示,本申请实施例还提供一种电子设备70,包括处理器72,存储器71,存储在存储器71上并可在所述处理器72上运行的程序或指令,该程序或指令被处理器72执行时实现上述音频信号处理方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。Optionally, as shown in FIG. 6 , an embodiment of the present application further provides an electronic device 70, including a processor 72, a memory 71, and a program or instruction stored on the memory 71 and executable on the processor 72, When the program or instruction is executed by the processor 72, each process of the above-mentioned audio signal processing method embodiments can be realized, and the same technical effect can be achieved. To avoid repetition, details are not repeated here.
需要说明的是,本申请实施例中的电子设备包括上述所述的移动电子设备和非移动电子设备。It should be noted that the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.
图7为实现本申请实施例的一种电子设备的硬件结构示意图。FIG. 7 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
该电子设备100包括但不限于:射频单元101、网络模块102、音频输出单元103、输入单元104、传感器105、显示单元106、用户输入单元107、接口单元108、存储器109、以及处理器110等部件。The electronic device 100 includes but is not limited to: a radio frequency unit 101, a network module 102, an audio output unit 103, an input unit 104, a sensor 105, a display unit 106, a user input unit 107, an interface unit 108, a memory 109, and a processor 110, etc. part.
本领域技术人员可以理解,电子设备100还可以包括给各个部件供电的电源(比 如电池),电源可以通过电源管理系统与处理器110逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。图7中示出的电子设备结构并不构成对电子设备的限定,电子设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置,在此不再赘述。Those skilled in the art can understand that the electronic device 100 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 110 through a power management system, so as to manage charging, discharging, and power management through the power management system. consumption management and other functions. The structure of the electronic device shown in FIG. 7 does not constitute a limitation on the electronic device. The electronic device may include more or less components than the one shown, or combine some components, or arrange different components, which will not be repeated here. .
其中,处理器110,用于获取第一音频信号中的低频信号;该低频信号为:频率处于预设频率范围内的音频信号,该低频信号包括M个频点;M为正整数;并根据M个频点的能量值,分别确定M个概率值;每个概率值分别用于指示其对应频点存在语音信号的概率;基于M个概率值,确定目标降噪参数;该目标降噪参数用于表征:电子设备对音频信号进行降噪处理的降噪量;以及,基于目标降噪参数,对低频信号进行降噪处理。Wherein, the processor 110 is configured to acquire a low-frequency signal in the first audio signal; the low-frequency signal is: an audio signal with a frequency within a preset frequency range, the low-frequency signal includes M frequency points; M is a positive integer; For the energy values of the M frequency points, M probability values are respectively determined; each probability value is used to indicate the probability that the corresponding frequency point has a speech signal; based on the M probability values, the target noise reduction parameter is determined; the target noise reduction parameter It is used to characterize: the noise reduction amount that the electronic device performs noise reduction processing on the audio signal; and, based on the target noise reduction parameter, performs noise reduction processing on the low frequency signal.
本申请实施例提供的电子设备,由于在音频信号中的噪声较强时,电子设备可以根据低频信号的每个频点的能量值,分别确定该每个频点存在语音信号的概率值,并基于每个频点存在语音信号的概率值,确定电子设备对该低频信号进行降噪处理对应的降噪参数,即,电子设备可以根据低频信号中是否存在语音信号,确定不同的降噪参数,以对该低频信号进行不同降噪量的降噪处理,而并不是直接采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升电子设备对音频信号进行降噪处理的效果。In the electronic device provided by the embodiment of the present application, when the noise in the audio signal is strong, the electronic device can determine the probability value of the existence of the speech signal at each frequency point according to the energy value of each frequency point of the low-frequency signal, and Determine the noise reduction parameters corresponding to the noise reduction processing performed by the electronic device on the low-frequency signal based on the probability value of the presence of a voice signal at each frequency point, that is, the electronic device can determine different noise reduction parameters according to whether there is a voice signal in the low-frequency signal, In order to perform noise reduction processing with different noise reduction amounts on the low-frequency signal, instead of directly using a larger noise reduction parameter to perform noise reduction processing with a larger noise reduction amount on the low-frequency signal, it is possible to avoid noise reduction processing. In the case where the clarity of the signal is reduced, the effect of noise reduction processing on the audio signal by the electronic device can be improved.
可选地,本申请实施例中,处理器110,具体用于根据M个概率值,确定目标概率值;该目标概率值为:M个概率值的平均值;并基于目标概率值,确定目标降噪参数。Optionally, in this embodiment of the present application, the processor 110 is specifically configured to determine a target probability value according to M probability values; the target probability value is: the average value of the M probability values; and based on the target probability value, determine the target Noise reduction parameters.
如此可知,由于电子设备可以根据M个概率值的平均值,确定目标降噪参数,确定低频信号中是否存在语音信号,以确定不同的降噪参数,对低频信号进行不同降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升电子设备对音频信号进行降噪处理的效果。It can be seen from this that the electronic device can determine the target noise reduction parameter according to the average value of M probability values, determine whether there is a speech signal in the low frequency signal, determine different noise reduction parameters, and perform noise reduction with different noise reduction amounts on the low frequency signal. Therefore, it is possible to avoid the situation that the clarity of the voice signal after the noise reduction processing is lowered, and thus, the effect of the noise reduction processing on the audio signal by the electronic device can be improved.
可选地,本申请实施例中,处理器110,具体用于在目标概率值大于或等于预设阈值的情况下,将第一降噪参数确定为目标降噪参数;或者,在目标概率值小于预设阈值的情况下,将第二降噪参数确定为目标降噪参数。Optionally, in this embodiment of the present application, the processor 110 is specifically configured to determine the first noise reduction parameter as the target noise reduction parameter when the target probability value is greater than or equal to the preset threshold; When the value is less than the preset threshold, the second noise reduction parameter is determined as the target noise reduction parameter.
如此可知,由于在确定低频信号中存在语音信号的情况下,电子设备可以调低确定出的一个默认降噪参数,以采用较小降噪参数对该低频信号进行较小降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降,且降低降噪处理后的音频信号中的噪声信号,如此,可以提升电子设备对音频信号进行降噪处理的效果。It can be seen from this that when it is determined that there is a speech signal in the low-frequency signal, the electronic device can lower a determined default noise reduction parameter, so as to use a smaller noise reduction parameter to perform noise reduction with a smaller amount of noise reduction on the low-frequency signal. Therefore, the reduction of the clarity of the noise-reduced speech signal can be avoided, and the noise signal in the noise-reduced audio signal can be reduced. In this way, the effect of noise reduction processing on the audio signal by the electronic device can be improved.
如此可知,由于在确定低频信号中不存在语音信号的情况下,电子设备可以调高确定出的一个默认降噪参数,以采用较大降噪参数对该低频信号进行较大降噪量的降噪处理,因此,可以降低降噪处理后的音频信号中的噪声信号,如此,可以提升电子设备对音频信号进行降噪处理的效果。It can be seen from this that, when it is determined that there is no speech signal in the low-frequency signal, the electronic device can increase a determined default noise reduction parameter, so as to use a larger noise reduction parameter to reduce the low frequency signal by a larger amount of noise reduction. Therefore, the noise signal in the audio signal after the noise reduction processing can be reduced, and thus, the effect of the noise reduction processing on the audio signal by the electronic device can be improved.
可选地,本申请实施例中,处理器110,还用于将降噪处理后的低频信号和第一信号进行合成处理,得到目标音频信号,并输出目标音频信号。Optionally, in this embodiment of the present application, the processor 110 is further configured to perform synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and output the target audio signal.
其中,上述第一信号是:基于第一音频信号中的高频信号得到的,该高频信号为:频率处于预设频率范围外的音频信号。The above-mentioned first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
如此可知,由于电子设备可以将第一音频信号分为低频信号和高频信号,并分别对该低频信号和高频信号进行不同降噪量的降噪处理,以及,将降噪处理后的低频信号、和降噪处理后的高频信号进行合成处理,因此,可以降噪处理后的语音信号中、浊音语音受损,从而可以避免降噪处理后的语音信号的清晰度下降的情况,如此,可以提升电子设备对音频信号进行降噪处理的效果。It can be seen from this that since the electronic device can divide the first audio signal into a low-frequency signal and a high-frequency signal, and perform noise reduction processing with different noise reduction amounts on the low-frequency signal and the high-frequency signal respectively, and The signal and the high-frequency signal after noise reduction processing are synthesized and processed. Therefore, the voiced voice can be damaged in the voice signal after noise reduction processing, so as to avoid the situation that the clarity of the voice signal after noise reduction processing is reduced. , which can improve the effect of noise reduction processing on audio signals by electronic devices.
可选地,本申请实施例中,处理器110,还用于获取高频信号,并基于第三降噪参数,对高频信号进行降噪处理,得到第一信号。Optionally, in this embodiment of the present application, the processor 110 is further configured to acquire a high-frequency signal, and perform noise reduction processing on the high-frequency signal based on the third noise reduction parameter to obtain the first signal.
如此可知,由于在音频信号中的噪声信号较强时,音频处理装置可以采用较大降噪参数对清音语音对应的音频信号(即高频信号)进行较大降噪量的降噪处理,因此,可以避免降噪处理后的语音信号的清晰度下降,且降低降噪处理后的音频信号中的噪声信号,如此,可以提升电子设备对音频信号进行降噪处理的效果。It can be seen from this that when the noise signal in the audio signal is strong, the audio processing device can use a larger noise reduction parameter to perform noise reduction processing with a larger amount of noise reduction on the audio signal (ie high-frequency signal) corresponding to the unvoiced speech. Therefore, , which can avoid the reduction of the clarity of the noise-reduced speech signal, and reduce the noise signal in the noise-reduced audio signal, so that the effect of noise reduction processing on the audio signal by the electronic device can be improved.
应理解的是,本申请实施例中,输入单元104可以包括图形处理器(graphics processing unit,GPU)1041和麦克风1042,图形处理器1041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元106可包括显示面板1061,可以采用液晶显示器、有机发光二极管等形式来配置显示面板1061。用户输入单元107包括触控面板1071以及其他输入设备1072。触控面板1071,也称为触摸屏。触控面板1071可包括触摸检测装置和触摸控制器两个部分。其他输入设备1072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。存储器109可用于存储软件程序以及各种数据,包括但不限于应用程序和操作系统。处理器110可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器110中。It should be understood that, in this embodiment of the present application, the input unit 104 may include a graphics processing unit (graphics processing unit, GPU) 1041 and a microphone 1042. Such as camera) to obtain still pictures or video image data for processing. The display unit 106 may include a display panel 1061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 107 includes a touch panel 1071 and other input devices 1072 . The touch panel 1071 is also called a touch screen. The touch panel 1071 may include two parts, a touch detection device and a touch controller. Other input devices 1072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which are not described herein again. Memory 109 may be used to store software programs as well as various data including, but not limited to, application programs and operating systems. The processor 110 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, and an application program, and the like, and the modem processor mainly processes wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 110 .
本申请实施例还提供一种可读存储介质,所述可读存储介质上存储有程序或指令,该程序或指令被处理器执行时实现上述音频信号处理方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, each process of the above-mentioned audio signal processing method embodiment can be achieved, and can achieve The same technical effect, in order to avoid repetition, will not be repeated here.
其中,所述处理器为上述实施例中所述的电子设备中的处理器。所述可读存储介质,包括计算机可读存储介质,如计算机只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等。Wherein, the processor is the processor in the electronic device described in the foregoing embodiments. The readable storage medium includes a computer-readable storage medium, such as computer read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk, etc.
本申请实施例另提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现上述音频信号处理方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。An embodiment of the present application further provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the above-mentioned embodiment of the audio signal processing method and can achieve the same technical effect, in order to avoid repetition, it will not be repeated here.
应理解,本申请实施例提到的芯片还可以称为系统级芯片、系统芯片、芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同 要素。此外,需要指出的是,本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能,还可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能,例如,可以按不同于所描述的次序来执行所描述的方法,并且还可以添加、省去、或组合各种步骤。另外,参照某些示例所描述的特征可在其他示例中被组合。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element. In addition, it should be noted that the scope of the methods and apparatus in the embodiments of the present application is not limited to performing the functions in the order shown or discussed, but may also include performing the functions in a substantially simultaneous manner or in the reverse order depending on the functions involved. To perform functions, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to some examples may be combined in other examples.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,或者网络设备等)执行本申请各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course hardware can also be used, but in many cases the former is better implementation. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, or a network device, etc.) execute the methods described in the various embodiments of this application.
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本申请的保护之内。The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of this application, without departing from the scope of protection of the purpose of this application and the claims, many forms can be made, which all fall within the protection of this application.

Claims (15)

  1. 一种音频信号处理方法,其中,所述方法包括:An audio signal processing method, wherein the method comprises:
    获取第一音频信号中的低频信号;所述低频信号为:频率处于预设频率范围内的音频信号,所述低频信号包括M个频点;M为正整数;Acquire a low-frequency signal in the first audio signal; the low-frequency signal is an audio signal whose frequency is within a preset frequency range, the low-frequency signal includes M frequency points; M is a positive integer;
    根据所述M个频点的能量值,分别确定M个概率值;每个概率值分别用于指示其对应频点存在语音信号的概率;According to the energy values of the M frequency points, M probability values are respectively determined; each probability value is respectively used to indicate the probability that a speech signal exists at its corresponding frequency point;
    基于所述M个概率值,确定目标降噪参数;所述目标降噪参数用于表征:电子设备对音频信号进行降噪处理的降噪量;Based on the M probability values, a target noise reduction parameter is determined; the target noise reduction parameter is used to represent: the noise reduction amount that the electronic device performs noise reduction processing on the audio signal;
    基于所述目标降噪参数,对所述低频信号进行降噪处理。Based on the target noise reduction parameter, noise reduction processing is performed on the low frequency signal.
  2. 根据权利要求1所述的方法,其中,所述基于所述M个概率值,确定目标降噪参数,包括:The method according to claim 1, wherein the determining the target noise reduction parameter based on the M probability values comprises:
    根据所述M个概率值,确定目标概率值;所述目标概率值为:所述M个概率值的平均值;According to the M probability values, a target probability value is determined; the target probability value is: the average value of the M probability values;
    基于所述目标概率值,确定所述目标降噪参数。Based on the target probability value, the target noise reduction parameter is determined.
  3. 根据权利要求2所述的方法,其中,所述基于所述目标概率值,确定所述目标降噪参数,包括:The method according to claim 2, wherein the determining the target noise reduction parameter based on the target probability value comprises:
    在所述目标概率值大于或等于预设阈值的情况下,将第一降噪参数确定为所述目标降噪参数;或者,In the case that the target probability value is greater than or equal to a preset threshold, determine the first noise reduction parameter as the target noise reduction parameter; or,
    在所述目标概率值小于预设阈值的情况下,将第二降噪参数确定为所述目标降噪参数。In the case that the target probability value is smaller than the preset threshold, the second noise reduction parameter is determined as the target noise reduction parameter.
  4. 根据权利要求1所述的方法,其中,所述基于所述目标降噪参数,对所述低频信号进行降噪处理之后,所述方法还包括:The method according to claim 1, wherein after performing noise reduction processing on the low frequency signal based on the target noise reduction parameter, the method further comprises:
    将降噪处理后的低频信号和第一信号进行合成处理,得到目标音频信号,并输出所述目标音频信号;Synthesize the low-frequency signal after the noise reduction process and the first signal to obtain a target audio signal, and output the target audio signal;
    其中,所述第一信号是:基于所述第一音频信号中的高频信号得到的,所述高频信号为:频率处于所述预设频率范围外的音频信号。The first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
  5. 根据权利要求4所述的方法,其中,所述将降噪处理后的低频信号和第一信号进行合成处理之前,所述方法还包括:The method according to claim 4, wherein, before the synthesizing the low-frequency signal after the noise reduction processing and the first signal, the method further comprises:
    获取所述高频信号,并基于第三降噪参数,对所述高频信号进行降噪处理,得到所述第一信号。The high frequency signal is acquired, and based on the third noise reduction parameter, noise reduction processing is performed on the high frequency signal to obtain the first signal.
  6. 一种音频信号处理装置,其中,所述音频信号处理装置包括:获取模块、确定模块和处理模块;An audio signal processing device, wherein the audio signal processing device comprises: an acquisition module, a determination module and a processing module;
    所述获取模块,用于获取第一音频信号中的低频信号;所述低频信号为:频率处于预设频率范围内的音频信号,所述低频信号包括M个频点;M为正整数;The acquisition module is configured to acquire a low-frequency signal in the first audio signal; the low-frequency signal is an audio signal whose frequency is within a preset frequency range, and the low-frequency signal includes M frequency points; M is a positive integer;
    所述确定模块,用于根据所述M个频点的能量值,分别确定M个概率值;每个概率值分别用于指示其对应频点存在语音信号的概率;并基于所述M个概率值,确定目标降噪参数;所述目标降噪参数用于表征:音频信号处理装置对音频信号进行降噪处理的降噪量;The determining module is configured to determine M probability values respectively according to the energy values of the M frequency points; each probability value is respectively used to indicate the probability that the corresponding frequency point has a speech signal; and based on the M probability values value, to determine the target noise reduction parameter; the target noise reduction parameter is used to represent: the noise reduction amount of the audio signal processing device performing noise reduction processing on the audio signal;
    所述处理模块,用于基于所述确定模块确定的所述目标降噪参数,对所述低频信 号进行降噪处理。The processing module is configured to perform noise reduction processing on the low-frequency signal based on the target noise reduction parameter determined by the determination module.
  7. 根据权利要求6所述的音频信号处理装置,其中,所述确定模块,具体用于根据所述M个概率值,确定目标概率值;所述目标概率值为:所述M个概率值的平均值;并基于所述目标概率值,确定所述目标降噪参数。The audio signal processing apparatus according to claim 6, wherein the determining module is specifically configured to determine a target probability value according to the M probability values; the target probability value is: an average of the M probability values and determining the target noise reduction parameter based on the target probability value.
  8. 根据权利要求7所述的音频信号处理装置,其中,所述确定模块,用于在所述目标概率值大于或等于预设阈值的情况下,将第一降噪参数确定为所述目标降噪参数;或者,在所述目标概率值小于预设阈值的情况下,将第二降噪参数确定为所述目标降噪参数。The audio signal processing apparatus according to claim 7, wherein the determining module is configured to determine the first noise reduction parameter as the target noise reduction when the target probability value is greater than or equal to a preset threshold parameter; or, when the target probability value is smaller than a preset threshold, determine a second noise reduction parameter as the target noise reduction parameter.
  9. 根据权利要求6所述的音频信号处理装置,其中,所述处理模块,还用于将降噪处理后的低频信号和第一信号进行合成处理,得到目标音频信号,并输出所述目标音频信号;The audio signal processing apparatus according to claim 6, wherein the processing module is further configured to perform synthesis processing on the low-frequency signal after noise reduction processing and the first signal to obtain a target audio signal, and output the target audio signal ;
    其中,所述第一信号是:基于所述第一音频信号中的高频信号得到的,所述高频信号为:频率处于所述预设频率范围外的音频信号。The first signal is obtained based on a high-frequency signal in the first audio signal, and the high-frequency signal is an audio signal whose frequency is outside the preset frequency range.
  10. 根据权利要求9所述的音频信号处理装置,其中,所述获取模块,还用于获取所述高频信号,并基于第三降噪参数,对所述高频信号进行降噪处理,得到所述第一信号。The audio signal processing apparatus according to claim 9, wherein the acquisition module is further configured to acquire the high-frequency signal, and perform noise reduction processing on the high-frequency signal based on a third noise reduction parameter, to obtain the the first signal.
  11. 一种电子设备,其中,包括处理器,存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求1至5中任一项所述的音频信号处理方法的步骤。An electronic device, comprising a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being executed by the processor to achieve as claimed in claim 1 Steps of the audio signal processing method described in any one of to 5.
  12. 一种可读存储介质,其中,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如权利要求1至5中任一项所述的音频信号处理方法的步骤。A readable storage medium, wherein a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the audio signal processing method according to any one of claims 1 to 5 is implemented. step.
  13. 一种芯片,其中,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如权利要求1至5中任一项所述的音频信号处理方法的步骤。A chip, wherein the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used for running a program or an instruction to implement the method described in any one of claims 1 to 5 The steps of an audio signal processing method.
  14. 一种计算机程序产品,其中,所述计算机程序产品被存储在非易失的存储介质中,所述计算机程序产品被至少一个处理器执行以实现如权利要求1至5中任一项所述的音频信号处理方法的步骤。A computer program product, wherein the computer program product is stored in a non-volatile storage medium, the computer program product being executed by at least one processor to implement the method as claimed in any one of claims 1 to 5 The steps of an audio signal processing method.
  15. 一种电子设备,其中,包括所述电子设备被配置成用于执行如权利要求1至5中任一项所述的音频信号处理方法的步骤。An electronic device comprising the steps of the electronic device being configured to perform the audio signal processing method as claimed in any one of claims 1 to 5.
PCT/CN2021/141628 2020-12-31 2021-12-27 Audio signal processing method and apparatus, and electronic device WO2022143522A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011628024.3A CN112969130A (en) 2020-12-31 2020-12-31 Audio signal processing method and device and electronic equipment
CN202011628024.3 2020-12-31

Publications (1)

Publication Number Publication Date
WO2022143522A1 true WO2022143522A1 (en) 2022-07-07

Family

ID=76271618

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/141628 WO2022143522A1 (en) 2020-12-31 2021-12-27 Audio signal processing method and apparatus, and electronic device

Country Status (2)

Country Link
CN (1) CN112969130A (en)
WO (1) WO2022143522A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112969130A (en) * 2020-12-31 2021-06-15 维沃移动通信有限公司 Audio signal processing method and device and electronic equipment
CN113556654B (en) * 2021-07-16 2022-11-22 RealMe重庆移动通信有限公司 Audio data processing method and device and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102938254A (en) * 2012-10-24 2013-02-20 中国科学技术大学 Voice signal enhancement system and method
CN104424954A (en) * 2013-08-20 2015-03-18 华为技术有限公司 Noise estimation method and device
US20150127331A1 (en) * 2013-11-07 2015-05-07 Continental Automotive Systems, Inc. Speech probability presence modifier improving log-mmse based noise suppression performance
CN110875054A (en) * 2018-08-31 2020-03-10 阿里巴巴集团控股有限公司 Far-field noise suppression method, device and system
CN111477243A (en) * 2020-04-16 2020-07-31 维沃移动通信有限公司 Audio signal processing method and electronic equipment
CN112969130A (en) * 2020-12-31 2021-06-15 维沃移动通信有限公司 Audio signal processing method and device and electronic equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000072305A2 (en) * 1999-05-19 2000-11-30 Noisecom Aps A method and apparatus for noise reduction in speech signals
CN1991976A (en) * 2005-12-31 2007-07-04 潘建强 Phoneme based voice recognition method and system
US9721580B2 (en) * 2014-03-31 2017-08-01 Google Inc. Situation dependent transient suppression
CN108074582B (en) * 2016-11-10 2021-08-06 电信科学技术研究院 Noise suppression signal-to-noise ratio estimation method and user terminal
CN108831508A (en) * 2018-06-13 2018-11-16 百度在线网络技术(北京)有限公司 Voice activity detection method, device and equipment
CN109616139B (en) * 2018-12-25 2023-11-03 平安科技(深圳)有限公司 Speech signal noise power spectral density estimation method and device
CN110827858B (en) * 2019-11-26 2022-06-10 思必驰科技股份有限公司 Voice endpoint detection method and system
CN110890104B (en) * 2019-11-26 2022-05-03 思必驰科技股份有限公司 Voice endpoint detection method and system
CN111899752B (en) * 2020-07-13 2023-01-10 紫光展锐(重庆)科技有限公司 Noise suppression method and device for rapidly calculating voice existence probability, storage medium and terminal

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102938254A (en) * 2012-10-24 2013-02-20 中国科学技术大学 Voice signal enhancement system and method
CN104424954A (en) * 2013-08-20 2015-03-18 华为技术有限公司 Noise estimation method and device
US20150127331A1 (en) * 2013-11-07 2015-05-07 Continental Automotive Systems, Inc. Speech probability presence modifier improving log-mmse based noise suppression performance
CN110875054A (en) * 2018-08-31 2020-03-10 阿里巴巴集团控股有限公司 Far-field noise suppression method, device and system
CN111477243A (en) * 2020-04-16 2020-07-31 维沃移动通信有限公司 Audio signal processing method and electronic equipment
CN112969130A (en) * 2020-12-31 2021-06-15 维沃移动通信有限公司 Audio signal processing method and device and electronic equipment

Also Published As

Publication number Publication date
CN112969130A (en) 2021-06-15

Similar Documents

Publication Publication Date Title
WO2022143522A1 (en) Audio signal processing method and apparatus, and electronic device
CN111524498B (en) Filtering method and device and electronic equipment
CN108595431B (en) Voice interaction text error correction method, device, terminal and storage medium
US10553236B1 (en) Multichannel noise cancellation using frequency domain spectrum masking
US11349525B2 (en) Double talk detection method, double talk detection apparatus and echo cancellation system
CN109756818B (en) Dual-microphone noise reduction method and device, storage medium and electronic equipment
CN108806713B (en) Method and device for detecting double-speech state
JP7218391B2 (en) NOISE REDUCTION METHOD, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM, AND PROGRAM FOR IN-VEHICLE ENVIRONMENT
US20200265848A1 (en) Method for recovering audio signals, terminal and storage medium
WO2022160715A1 (en) Voice signal processing method and electronic device
WO2020252629A1 (en) Residual acoustic echo detection method, residual acoustic echo detection device, voice processing chip, and electronic device
US20240046947A1 (en) Speech signal enhancement method and apparatus, and electronic device
CN112532958A (en) Image processing method, image processing device, electronic equipment and readable storage medium
CN112015365A (en) Volume adjustment method and device and electronic equipment
WO2024041512A1 (en) Audio noise reduction method and apparatus, and electronic device and readable storage medium
CN112289336B (en) Audio signal processing method and device
WO2024051676A1 (en) Model training method and apparatus, electronic device, and medium
CN113160846A (en) Noise suppression method and electronic device
CN110890104B (en) Voice endpoint detection method and system
CN112399302A (en) Audio playing method and device of wearable audio playing device
CN110992975A (en) Voice signal processing method and device and terminal
CN115662409B (en) Voice recognition method, device, equipment and storage medium
CN114333912B (en) Voice activation detection method, device, electronic equipment and storage medium
CN113766385B (en) Earphone noise reduction method and device
CN116095565A (en) Audio signal processing method, device, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21914258

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21914258

Country of ref document: EP

Kind code of ref document: A1