CN113973250A

CN113973250A - Noise suppression method and device and auxiliary listening earphone

Info

Publication number: CN113973250A
Application number: CN202111249997.0A
Authority: CN
Inventors: 李倩; 江源; 伍星强
Original assignee: Bestechnic Shanghai Co Ltd
Current assignee: Bestechnic Shanghai Co Ltd
Priority date: 2021-10-26
Filing date: 2021-10-26
Publication date: 2022-01-25
Anticipated expiration: 2041-10-26
Also published as: CN113973250B

Abstract

The present disclosure relates to a noise suppression method, a noise suppression device and an auxiliary listening earphone, wherein the noise suppression method is used for noise suppression of the auxiliary listening earphone, and comprises: acquiring an environmental audio signal; after the environmental audio signal is processed by an analysis filter bank, dividing the environmental audio signal into a plurality of first sub-band parts according to frequency bands; down-sampling the environment audio signal and transforming the environment audio signal to a frequency domain to obtain a plurality of second sub-band parts, wherein the frequency band interval of the plurality of second sub-band parts is less than or equal to the minimum frequency band interval of the plurality of first sub-band parts; determining the voice existence probability of each second sub-band part; determining the gain of each second subband portion based on the probability of speech presence in that second subband portion; processing each first subband portion based on the gain of each second subband portion; and synthesizing each processed first sub-band portion. The method can achieve good noise reduction effect while realizing low-delay auxiliary listening, and effectively improves the use experience of auxiliary listening earphones.

Description

Noise suppression method and device and auxiliary listening earphone

Technical Field

The present disclosure relates to signal processing technologies, and in particular, to a noise suppression method and apparatus, and an auxiliary listening headphone.

Background

When the existing PSAP earphone amplifies environment sounds, a voice signal and an environment noise signal are amplified simultaneously, but sometimes a user has communication requirements, and at the moment, external environment noise influences the speech intelligibility and hinders normal communication.

The assistant listens to the particularity of earphone usage scene, has caused the contradiction of making an uproar and assisting to listen, and the assistant listens to the earphone on the one hand and need let in the environment for example the sound of interlocutor through assisting listening the earphone to require low time delay, on the other hand, the assistant listens the earphone and still need let the noise reduction in the environment, plays the effect of making an uproar. In practical use, the noise in the environment and the voice of the interlocutor exist at the same time, but noise reduction is very difficult at this time, and if the noise is not reduced, the user feels that the environmental noise is too large, so that communication is influenced.

Disclosure of Invention

The method and the device for suppressing the noise and the auxiliary listening earphone are provided, and the noise reduction effect of the auxiliary listening earphone is achieved.

In a first aspect, an embodiment of the present disclosure provides a noise suppression method for noise suppression of an auxiliary listening headphone, where the noise suppression method includes the following steps: acquiring an environmental audio signal; after the environment audio signal is processed by an analysis filter bank, dividing the environment audio signal into a plurality of first sub-band parts according to frequency bands; down-sampling and transforming the environment audio signal to a frequency domain to obtain a plurality of second sub-band portions, wherein the frequency band interval of the plurality of second sub-band portions is less than or equal to the minimum frequency band interval of the plurality of first sub-band portions; determining the voice existence probability of each second sub-band part; determining the gain of each second subband portion based on the probability of speech presence in that second subband portion; processing each first subband portion based on the gain of each second subband portion; and synthesizing each processed first sub-band portion.

In a second aspect, embodiments of the present disclosure also provide a noise suppression apparatus, including a memory and a processor; the memory is configured to store a computer program, and the processor is configured to implement the steps of the noise suppression method according to the embodiments of the present disclosure when executing the computer program.

In a third aspect, embodiments of the present disclosure also provide a noise suppression component, including a processor configured to: down-sampling an environment audio signal acquired by a feedforward microphone of an auxiliary listening earphone and converting the environment audio signal into a frequency domain to obtain a plurality of second sub-band parts, wherein the frequency band intervals of the plurality of second sub-band parts are less than or equal to the minimum frequency band interval between the first sub-band parts processed by a first filter bank of the auxiliary listening earphone; determining the voice existence probability of each second sub-band part; determining the gain of each second subband portion based on the probability of speech presence in that second subband portion; each first subband unit is processed based on the gain of each second subband unit.

In a fourth aspect, an embodiment of the present disclosure further provides an auxiliary listening headphone, including a feedforward microphone, a first filter bank, a second filter bank, and the aforementioned noise suppression component: the feedforward microphone configured to acquire an ambient audio signal; the first filter bank is configured to divide the ambient audio signal into a plurality of first sub-band portions according to frequency bands; the second filter bank synthesizes the first subband portions processed by the noise suppression module.

By using the noise suppression method and device and the auxiliary listening earphone according to the embodiments of the present disclosure, the gain of each second subband portion is determined by determining the existence probability of the voices in the plurality of second subband portions in the frequency domain, so as to determine the gain of the second subband portion, and adjust the gain of the first subband portion corresponding to the hardware loop of the auxiliary listening earphone by using the gain of the second subband portion, thereby achieving good noise reduction effect while performing auxiliary listening, and effectively improving the use experience of the auxiliary listening earphone.

Drawings

In the drawings, which are not necessarily drawn to scale, like reference numerals may describe similar components in different views. Like reference numerals having letter suffixes or different letter suffixes may represent different instances of similar components. The drawings illustrate various embodiments generally by way of example and not by way of limitation, and together with the description and claims serve to explain the disclosed embodiments. Such embodiments are illustrative, and are not intended to be exhaustive or exclusive embodiments of the present apparatus or method.

Fig. 1 shows a signal processing flow diagram of a PSAP headset according to the present disclosure;

FIG. 2 illustrates a basic flow diagram of a noise suppression method according to the present disclosure;

FIG. 3 illustrates a signal processing flow diagram of a noise suppression method according to the present disclosure;

fig. 4 illustrates a flow chart for determining the gain for each first subband according to the noise suppression method of the present disclosure.

Detailed Description

For a better understanding of the technical aspects of the present disclosure, reference is made to the following detailed description taken in conjunction with the accompanying drawings. Embodiments of the present disclosure are described in further detail below with reference to the figures and the detailed description, but the present disclosure is not limited thereto.

The use of "first," "second," and similar terms in this disclosure are not intended to indicate any order, quantity, or importance, but rather are used for distinction. The word "comprising" or "comprises", and the like, means that the element preceding the word covers the element listed after the word, and does not exclude the possibility that other elements are also covered.

The embodiment of the present disclosure provides a noise suppression method, which is used for noise suppression of a hearing-assisted PSAP (public safety access point) earphone, where fig. 1 shows a schematic diagram of signal processing of the PSAP earphone, as shown in fig. 1, after an ambient audio signal x (n) passes through an analysis filter bank of the hearing-assisted earphone, the ambient audio signal x (n) is divided into a plurality of subband parts according to frequency bands, the plurality of subband parts may respectively correspond to a multiplier (Gain0-Gain n) behind the subband parts, for example, one multiplier may correspondingly process one frequency band, a signal processed by the multiplier enters a corresponding dynamic range control DRC, such as a signal DRC contained in fig. 1 (DRC0-DRCN), and finally, after passing through a synthesis filter bank, a synthesized signal y (n) is output. Due to the fact that the noise analysis module is high in calculation complexity and large in circuit scale, noise reduction of the auxiliary listening earphone is difficult to achieve directly on a hardware structure, and due to the fact that the noise reduction method of adding the noise analysis module in the hardware structure is adopted, a data buffer structure is added to an existing channel, more extra processing delay is introduced, user experience is affected, and the noise reduction method cannot be applied to a scene needing low delay of auxiliary listening. As shown in fig. 2, the noise suppression method proposed by the present disclosure starts with step S201, obtaining an ambient audio signal, for example, the ambient audio signal may include a speech signal and an ambient noise signal. After the environmental audio signal is processed by the analysis filter bank in step S202, the environmental audio signal is divided into a plurality of first sub-band portions according to frequency bands. That is, the analysis filter bank of the PSAP headset hardware loop may be directly utilized to divide the ambient audio signal into a plurality of first sub-band portions according to frequency band.

The ambient audio signal is then down-sampled and transformed to the frequency domain in step S203 to obtain a plurality of second sub-band portions, wherein the frequency band interval of the plurality of second sub-band portions is less than or equal to the minimum frequency band interval of the plurality of first sub-band portions. As shown in fig. 3, down-sampling and transforming the environmental audio signal to the frequency domain may be accomplished by a software algorithm, such as FFT transformation of the environmental audio signal, where the frequency band interval of the down-sampled second sub-band portions is less than or equal to the minimum frequency band interval of the first sub-band portions, and by this arrangement, it can be ensured that the second sub-band portions can be completely mapped onto the first sub-band portions, so that any segment of the environmental audio signal is not missed, and the integrity of the signal is ensured.

In step S204, the voice existence probability of each second sub-band portion is determined, for example, the voice existence probability of each frequency band may be determined based on the second sub-band portions of the respective frequency bands. The second sub-band portion of each frequency band uniquely identifies the corresponding speech presence probability.

The gain of each second sub-band portion is determined in step S205 based on the probability of speech presence in that second sub-band portion. For example, the gain of the second sub-band portion with the speech existence probability of 0 may be set to 0, so that noise in the frequency band can be directly filtered out, and the gain of the second sub-band portion may be set according to the magnitude of the speech existence probability in other frequency bands in which speech exists.

Finally, in step S206, each first subband unit is processed based on the gain of each second subband unit; and synthesizing each processed first sub-band portion. The gain determined by the software algorithm can be mapped into a hardware loop through the method disclosed by the invention, then the noise reduction processing can be realized through a multiplier in the hardware loop, and the signal after noise reduction is output after the synthesis through a subsequent synthesis filter.

In some embodiments, the frequency band is divided into a plurality of first subband parts by analysis filter processing, then gain adjustment of each subband is carried out, DRC processing and synthesis filter synthesis processing are completed through a hardware path of the auxiliary earphone, and the sampling frequency of the hardware path is greater than or equal to 96 kHz. Specific hardware paths can also refer to 3, and the hardware paths can include an analysis filter, a multiplier (Gain0-Gain N), a dynamic range control (DRC0-DRCN), and a synthesis filter, for example, the multiplier has a function of completing Gain adjustment according to each subband Gain obtained by a software algorithm. By setting the hardware path of the auxiliary listening earphone to work in a high sampling rate (more than or equal to 96kHz) mode, the path delay caused by the introduction of a down-sampling filter due to data down-sampling is avoided. And because the hardware path does not need software participation, and other data buffer structures which can introduce delay do not exist on the hardware path, the delay of the auxiliary earphone is greatly reduced by the method.

In some embodiments, as shown in fig. 4, processing each first subband portion based on the gain of each second subband portion may specifically include: in step S401, a first frequency band of each first sub-band portion is determined, and a second frequency band of each second sub-band portion is determined. For example, in fig. 3, the first frequency band obtained by dividing the environmental audio signal into a plurality of first sub-band portions according to the frequency band by the analysis filter bank of the hardware loop is 500, 1000,2000,4000 1000,2000,4000 …, and the second frequency band obtained by FFT transformation of the second sub-band portion by the software algorithm may be 125, 250, 375, 500, 625, 750, 875, 1000, 1125, …, which is only an example, and the specific sampling frequency band may be set according to the actual needs.

In step S402, the selectable gain of each first sub-band portion is determined based on the gain of each second sub-band portion and the inclusion relationship between each second frequency band and each first frequency band. For example, in an example where there is a correspondence between the 125, 250, 375, 500 frequency bands of the second sub-band portion and the 500 frequency bands of the first sub-band portion, the gain of the 125, 250, 375, 500 frequency bands may be used as the selectable gain of the 500 frequency bands of the first sub-band portion.

The gain of each first sub-band portion is determined in step S403 based on the selectable gain of that first sub-band portion. The gain of one frequency band may be selected, for example, from the selectable gains of 125, 250, 375, 500 frequency bands, as the gain of the 500 frequency bands of the first sub-band portion. In this way, the gain of each frequency band in the hardware loop can be determined, so that the purpose of noise reduction is further realized through the gain of each first frequency band in the process of signal processing.

Determining the gains of the first sub-band portions based on the selectable gains of the first sub-band portions specifically comprises: the maximum of the selectable gains of each first sub-band portion is determined as the gain of that first sub-band portion. As a specific way to determine the gain, a maximum value of the selectable gains corresponding to the second frequency band may be determined as the gain of the first sub-band portion, for example, assuming that the second frequency band of the FFT is: 125, 250, 375, 500, 625, 750, 875, 1000, 1125, 1250,...,8000. The gain corresponding to each second frequency band is: g0, G1, G2, G3, G4, G5, G6, G7, G8, G9. The first frequency band of each first sub-band portion is: 500, 1000,2000,4000, 8000. The gain of each first sub-band portion is G ' 0, G ' 1, G ' 2, G ' 3, G ' 4, then:

G’0＝max{G0，G1，G2，G3}；

G’1＝max{G4，G5，G6，G7}；

G’2＝max{G8，G9，G10，G11，G12，G13，G14，G15}；

G’3＝max{G16，G13，G14，G15，...，G31}；

G’4＝max{G32，G33，G34，G35，...，G63}。

the mode of selecting the maximum value can ensure that each first sub-band part achieves the best definition retention effect of voice and the proper filtering effect of environmental noise, thereby realizing the purpose of noise reduction while assisting listening. The software algorithm utilized by the method disclosed by the invention can obtain the effect of low time delay even under a higher sampling rate, so that the defect of high time delay and complexity caused by noise reduction realized by a hardware or software method is overcome. The noise reduction method is particularly suitable for PSAP earphones, on one hand, low time delay can be brought, on the other hand, good noise reduction and filtering effects can be achieved through the method, and the use experience of the auxiliary listening earphones is improved.

In some embodiments, the speech existence probability of each second sub-band portion is a posterior speech existence probability, and determining the speech existence probability of each second sub-band portion specifically includes: determining the a posteriori speech presence probability based on the a priori speech presence probability, the a priori signal-to-noise ratio and an intermediate variable represented by the a priori signal-to-noise ratio and the a posteriori signal-to-noise ratio of the second sub-band portions such that the a posteriori speech presence probability increases with increasing a priori signal-to-noise ratio and/or intermediate variable.

For example, as a specific example, the speech existence probability p (k) of each second subband portion satisfies:

where q (k) represents the prior speech presence probability, typically taking the value 0.5, ξ (k) represents the prior signal-to-noise ratio for frequency bin k, and υ (k) represents an intermediate variable that, in some embodiments, increases as the prior signal-to-noise ratio and/or the posterior signal-to-noise ratio increases. For example, ν (k) ═ γ (k) ξ (k)/(ξ (k) +1), γ (k) denotes the posterior signal-to-noise ratio. Xi (k) ═ alpha_pG²(k,l-1)|Y(k,l-1)|²+(1-α_p) max [ gamma (k, l) -1,0 ], where l denotes the current frame, l-1 denotes the previous frame, α_pIs constant between 0 and 1.

Representing the posterior signal-to-noise ratio of frequency band k, | Y (k) & ltnon calculation |)²Representing the signal power and λ (k) representing the noise power.

An exemplary iterative computation of the noise power λ (k) satisfies:

λ(k,l)＝α_powλ(k,l-1)+(1-α_pow)(1-p(k))|Y(k,l)|²

α_powis a constant between 0 and 1, where l represents the current frame and l-1 represents the previous frame.

In some embodiments, determining the gain of each second sub-band portion based on the probability of speech presence in the second sub-band portion comprises: the gain of each second sub-band is determined so as to increase as the posterior speech existence probability and the intermediate spectral gain of the second sub-band increase, based on the posterior speech existence probability, the intermediate spectral gain, and the lower gain limit in the absence of speech of the second sub-band. For example, the gain of each frequency band is determined, and the calculation method satisfies the following conditions:

wherein G is_minIs a constant, representing the lower noise reduction gain limit when no speech is present, and is at least 0. A is a constant typically having a value of 1/2.

Representing the intermediate spectral gain.

Satisfies the following conditions:

wherein the content of the first and second substances,

in order to be a function of the chi-square distribution,

is a confluent hyper-geometric function.

The noise reduction gain calculation method is only one embodiment listed in the present invention, and the specific noise reduction gain calculation method can also be obtained by using various single-channel or multi-channel microphone noise reduction schemes, such as a single-channel-based DNN method, an OMLSA method, an MMSE noise reduction method based on stationary noise estimation, and a multi-channel-based MVDR and DNN method.

PSAP earphone pursues low time delay hearing aid effect, if adopt general processing means, add the module of making an uproar before PSAP passageway, send PSAP passageway again after the noise reduction treatment, can to a great extent increase the passageway time delay, influence the naturalness of hearing aid effect. However, if the noise is not reduced, the user feels that the environmental noise is too large, and the communication is influenced. The noise suppression method disclosed by the invention can estimate the noise in the data coming from the external environment in real time under a low sampling rate, analyze the noise distribution condition and determine the gain of a frequency band, thereby controlling the gain of each sub-band on a PSAP (public safety access point) hardware channel (on a high sampling rate) in real time, reducing the noise and ensuring that a user can have a good use experience under different scenes.

The embodiment of the present disclosure further provides a noise suppression device, which includes a memory and a processor; the memory is configured to store a computer program, and the processor is configured to implement the steps of the noise suppression method according to the embodiments of the present disclosure when executing the computer program.

Embodiments of the present disclosure also provide a noise suppression component, comprising a processor configured to: down-sampling an environment audio signal acquired by a feedforward microphone of an auxiliary listening earphone and converting the environment audio signal into a frequency domain to obtain a plurality of second sub-band parts, wherein the frequency band intervals of the plurality of second sub-band parts are less than or equal to the minimum frequency band interval between the first sub-band parts processed by a first filter bank of the auxiliary listening earphone; determining the voice existence probability of each second sub-band part; determining the gain of each second subband portion based on the probability of speech presence in that second subband portion; each first subband unit is processed based on the gain of each second subband unit. In some embodiments the processor may be a processing device that includes one or more general purpose processing devices, such as a microprocessor, Central Processing Unit (CPU), Graphics Processing Unit (GPU), etc. More specifically, the processor may be a Complex Instruction Set Computing (CISC) microprocessor, Reduced Instruction Set Computing (RISC) microprocessor, Very Long Instruction Word (VLIW) microprocessor, running other instructionsA processor of a set or a processor executing a combination of instruction sets. The processor may also be one or more special-purpose processing devices such as an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), a system on a chip (SoC), or the like. As will be appreciated by those skilled in the art, in some embodiments, the processor may be a special purpose processor rather than a general purpose processor. The processor may include one or more known processing devices, such as a Pentium (TM), Core (TM), Xeon (TM) or Itanium (TM) family of microprocessors manufactured by Intel corporation, Turion (TM), Athlon (TM), Sempron (TM), Opteron (TM), FX (TM) manufactured by AMD corporation^TM、Phenom^TMA family of microprocessors or any of a variety of processors manufactured by Sun Microsystems. The processor may also include a graphics processing unit, such as from Nvidia corporation

GPU series, GMA manufactured by Intel, Iris, or Radon, manufactured by AMD. The processor may also include an accelerated processing unit such as the desktop A-4(6, 8) series manufactured by AMD, Inc., or the Xeon Phi (TM) series manufactured by Intel, Inc. The disclosed embodiments are not limited to any type of processor or processor circuit that is otherwise configured to satisfy the processing of the ambient audio signal. In addition, the terms "processor" or "image processor" may include more than one processor, e.g., a multi-core design or multiple processors, each of which has a multi-core design.

In some embodiments, the processor is further configured to determine a first frequency band of each first sub-band portion and to determine a second frequency band of each second sub-band portion; determining the selectable gain of each first sub-band part based on the gain of each second sub-band part and the inclusion relation between each second frequency band and each first frequency band; the gain of each first sub-band portion is determined based on the selectable gain of that first sub-band portion.

In some embodiments, the processor is further configured to determine the maximum of the selectable gains of each first sub-band portion as the gain of that first sub-band portion.

In some embodiments, the intermediate variable increases with an increase in the a priori signal-to-noise ratio and/or the a posteriori signal-to-noise ratio.

The embodiment of the present disclosure further provides an auxiliary listening earphone, including a feedforward microphone, a first filter bank, a second filter bank and the aforementioned noise suppression component: the feedforward microphone configured to acquire an ambient audio signal; the first filter bank is configured to divide the ambient audio signal into a plurality of first sub-band portions according to frequency bands; the second filter bank synthesizes the first subband portions processed by the noise suppression module. Wherein the first filter bank may be an analysis filter bank of a PSAP headset hardware loop and the second filter bank may be a synthesis filter bank of the PSAP headset hardware loop. The auxiliary hearing earphone can map the gain determined by the software algorithm into the hardware loop, and then can realize noise reduction processing through a multiplier in the hardware loop, and output a signal after noise reduction after synthesis through a subsequent synthesis filter, has high response speed, can meet the requirement of auxiliary hearing and simultaneously play a low-delay effect, and can effectively improve the use experience of a user.

In this document, the terms "a" or "an" are used, as is common in patent documents, to include one or more than one, independent of any other instances or usages of "at least one" or "one or more. Herein, unless otherwise indicated, the term "or" is used to refer to a non-exclusive or such that "a or B" includes "a but not B", "B but not a" and "a and B". In this document, the terms "including" and "in which" are used as the plain-english equivalents of the respective terms "comprising" and "in which". Furthermore, in the following claims, the terms "comprising" and "including" are intended to be open-ended, i.e., an apparatus, system, device, article, composition, formulation, or process that comprises elements other than those listed in a claim below as a matter of claim. Furthermore, in the following claims, the terms "first," "second," and "third," etc. are used merely as labels, and are not intended to impose numerical requirements on their objects.

The exemplary methods described herein may be machine or computer-implemented, at least in part. Some examples may include a computer-readable medium or machine-readable medium encoded with instructions operable to configure an electronic device to perform a method as described in the above examples. An implementation of such a method may include software code, such as microcode, assembly language code, higher level language code, or the like. Various programs or program modules may be created using various software programming techniques. For example, program segments or program modules may be designed using Java, Python, C + +, assembly language, or any known programming language. One or more of such software portions or modules may be integrated into a computer system and/or computer-readable medium. Such software code may include computer readable instructions for performing various methods. The software code may form part of a computer program product or a computer program module. Further, in one example, the software code can be tangibly stored on one or more volatile, non-transitory, or non-volatile tangible computer-readable media, such as during execution or at other times. Examples of such tangible computer-readable media may include, but are not limited to, hard disks, removable magnetic disks, removable optical disks (e.g., compact disks and digital video disks), magnetic cassettes, memory cards or sticks, Random Access Memories (RAMs), Read Only Memories (ROMs), and the like.

Moreover, although exemplary embodiments have been described herein, the scope thereof includes any and all embodiments based on the disclosure with equivalent elements, modifications, omissions, combinations (e.g., of various embodiments across), adaptations or alterations. The elements of the claims are to be interpreted broadly based on the language employed in the claims and not limited to examples described in the present specification or during the prosecution of the application, which examples are to be construed as non-exclusive. It is intended, therefore, that the specification and examples be considered as exemplary only, with a true scope and spirit being indicated by the following claims and their full scope of equivalents.

The above description is intended to be illustrative and not restrictive. For example, the above-described examples (or one or more versions thereof) may be used in combination with each other. For example, other embodiments may be used by those of ordinary skill in the art upon reading the above description. In addition, in the foregoing detailed description, various features may be grouped together to streamline the disclosure. This should not be interpreted as an intention that a disclosed feature not claimed is essential to any claim. Rather, the subject matter of the present disclosure may lie in less than all features of a particular disclosed embodiment. Thus, the following claims are hereby incorporated into the detailed description as examples or embodiments, with each claim standing on its own as a separate embodiment, and it is contemplated that these embodiments may be combined with each other in various combinations or permutations. The scope of the invention should be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled.

The above embodiments are only exemplary embodiments of the present disclosure, and are not intended to limit the present invention, the scope of which is defined by the claims. Various modifications and equivalents may be made thereto by those skilled in the art within the spirit and scope of the present disclosure, and such modifications and equivalents should be considered to be within the scope of the present invention.

Claims

1. A noise suppression method for noise suppression of an auxiliary listening headphone, the noise suppression method comprising the steps of:

acquiring an environmental audio signal;

after the environment audio signal is processed by an analysis filter bank, dividing the environment audio signal into a plurality of first sub-band parts according to frequency bands;

down-sampling and transforming the environment audio signal to a frequency domain to obtain a plurality of second sub-band portions, wherein the frequency band interval of the plurality of second sub-band portions is less than or equal to the minimum frequency band interval of the plurality of first sub-band portions;

determining the voice existence probability of each second sub-band part;

determining the gain of each second subband portion based on the probability of speech presence in that second subband portion;

processing each first subband portion based on the gain of each second subband portion; and

and synthesizing the processed first sub-band parts.

2. The method of noise suppression according to claim 1, wherein processing each first subband portion based on the gain of each second subband portion specifically comprises:

determining a first frequency band of each first sub-band portion and determining a second frequency band of each second sub-band portion;

determining the selectable gain of each first sub-band part based on the gain of each second sub-band part and the inclusion relation between each second frequency band and each first frequency band;

the gain of each first sub-band portion is determined based on the selectable gain of that first sub-band portion.

3. A method of noise suppression according to claim 2 wherein determining the gain of each first subband portion based on the selectable gains of that first subband portion comprises: the maximum of the selectable gains of each first sub-band portion is determined as the gain of that first sub-band portion.

4. The noise suppression method according to claim 1, wherein the speech existence probability of each second subband portion is an a posteriori speech existence probability, and the determining the speech existence probability of each second subband portion specifically comprises: determining the a posteriori speech presence probability based on the a priori speech presence probability, the a priori signal-to-noise ratio and an intermediate variable represented by the a priori signal-to-noise ratio and the a posteriori signal-to-noise ratio of the second sub-band portions such that the a posteriori speech presence probability increases with increasing a priori signal-to-noise ratio and/or intermediate variable.

5. The noise suppression method according to claim 4, characterized in that the intermediate variable increases with an increase of the a priori signal-to-noise ratio and/or the a posteriori signal-to-noise ratio.

6. The noise suppressing method according to claim 1, wherein the band is divided into a plurality of first subband parts by the analysis filter processing and each subband gain is adjusted, the DRC processing and the synthesis filter synthesis processing are performed by a hardware path of the headphone, and a sampling frequency of an operation of the hardware path is 96kHz or more.

7. A noise suppression apparatus comprising a memory and a processor; wherein the memory is adapted to store a computer program, and the processor is adapted to carry out the steps of the noise suppression method according to any one of claims 1 to 6 when executing the computer program.

8. A noise suppression component, comprising a processor configured to:

down-sampling an environment audio signal acquired by a feedforward microphone of an auxiliary listening earphone and converting the environment audio signal into a frequency domain to obtain a plurality of second sub-band parts, wherein the frequency band intervals of the plurality of second sub-band parts are less than or equal to the minimum frequency band interval between the first sub-band parts processed by a first filter bank of the auxiliary listening earphone;

determining the voice existence probability of each second sub-band part;

each first subband unit is processed based on the gain of each second subband unit.

9. A secondary listening headset comprising a feed forward microphone, a first filter bank, a second filter bank and a noise suppression component as claimed in claim 8:

the feedforward microphone configured to acquire an ambient audio signal;

the first filter bank is configured to divide the ambient audio signal into a plurality of first sub-band portions according to frequency bands;

the second filter bank synthesizes the first subband portions processed by the noise suppression module.