WO2007117785A2 - Adjustable noise suppression system - Google Patents

Adjustable noise suppression system Download PDF

Info

Publication number
WO2007117785A2
WO2007117785A2 PCT/US2007/063044 US2007063044W WO2007117785A2 WO 2007117785 A2 WO2007117785 A2 WO 2007117785A2 US 2007063044 W US2007063044 W US 2007063044W WO 2007117785 A2 WO2007117785 A2 WO 2007117785A2
Authority
WO
WIPO (PCT)
Prior art keywords
noise
signal
output
gain
input
Prior art date
Application number
PCT/US2007/063044
Other languages
French (fr)
Other versions
WO2007117785A3 (en
Inventor
Lucio F. Pessoa
Roman A. Dyba
David B. Melles
Original Assignee
Freescale Semiconductor Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Freescale Semiconductor Inc. filed Critical Freescale Semiconductor Inc.
Priority to EP07757694A priority Critical patent/EP2008379B1/en
Publication of WO2007117785A2 publication Critical patent/WO2007117785A2/en
Publication of WO2007117785A3 publication Critical patent/WO2007117785A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation

Definitions

  • This invention relates in general to data communication, and more specifically to techniques and apparatus for suppressing noise in a signal in a communication system.
  • High-level background noise in a wired or wireless telecommunications channel degrades in-band signaling and lowers the perceived voice quality of speech signals.
  • noise suppressors or noise reducers, are used to reduce the degradation caused by the background noise and to improve the signal-to- noise ratio (SNR) of noisy signals.
  • SNR signal-to- noise ratio
  • Spectral weighting means that different spectral regions of the mixed signal of speech and noise are attenuated or modified with different gain factors. The goal is to obtain a speech signal that contains less noise than the original speech signal. At the same time, the speech quality must remain substantially intact with a minimal distortion of the original speech.
  • Spectral weighting is typically performed in the frequency domain using the well- known Fourier transform.
  • Voice activity detectors are used to determine whether current signal samples represent predominantly voice or noise.
  • Energy estimators and signal-to-noise ratio estimators are used to calculate a factor that is then used to modify the level of a frequency-domain signal.
  • the signal to noise ratio is a measure of signal strength (e.g., voice strength) relative to background noise.
  • the frequency-domain signal as modified is then converted back to the time-domain.
  • noise suppressors One problem with noise suppressors is that the level of suppression can be too high or too low under various different conditions. Additionally, a noise suppressor that operates in the frequency domain, like the spectral weighting filter, can leave artifacts in the output signal, such as musical noise, jet engine roar, running water, or the like.
  • FIG. 1 depicts, in a simplified and representative form, a high-level block diagram of a communications system having voice enhancement devices connected through a communication channel in accordance with one or more embodiments;
  • FIG. 2 is a more detailed representative block diagram of a voice enhancement device in accordance with one or more embodiments
  • FIG. 3 depicts a block diagram of a noise suppressor system in accordance with one or more embodiments
  • FIG. 4 shows a more detailed block diagram of a post-filtering analyzer that can be used in conjunction with the FIG. 3 noise suppressor system in accordance with one or more embodiments;
  • FIG. 5 depicts a more detailed block diagram of a minimum gain adapter that can be used in conjunction with the FIG. 3 noise suppressor system in accordance with one or more embodiments.
  • FIG. 6 shows a high-level flowchart of processes executed by a noise suppressor system that can be used in conjunction with the FIG. 2 voice enhancement device in accordance with one or more embodiments.
  • the present disclosure concerns noise suppression in voice enhancement devices. More particularly various inventive concepts and principles embodied in methods and apparatus may be used for adjusting a minimum overall gain, i.e., level of noise suppression, in a noise suppression system in a voice enhancement device.
  • voice enhancement device of particular interest may vary widely, one embodiment may advantageously be used in a wireless communication system or a wireless networking system, such as a cellular wireless network. Additionally, the inventive concepts and principles taught herein can be advantageously applied to wired communications systems, such as a telephone system.
  • FIG. 1 there is depicted, in a simplified and representative form, a high-level block diagram of communications system 100 having voice enhancement devices
  • Voice enhancement devices 102 and 104 are generally devices for processing, filtering, and conditioning a voice signal to improve the voice quality and sound clarity of wireless and wired signals before they are transmitted through a communication network, such as communication network 106.
  • Communication network 106 can be a wired or wireless communication network.
  • signals e.g., voice signals v(n) 108 and v' ⁇ n) 110 or the like are combined, respectively, with noise signals d(n) 112 and d' ⁇ n) 114, which are shown at adders 116 and 118, to produce input signals x(n) 120 and x' ⁇ n) 122.
  • Noise signals 112 and 114 include the effects of ambient sounds 103 and 105 (i.e., sounds that surround the user who is the source of the voice signal), respectively, in addition to any noise or distortion caused by the equipment or the environment, such as the acoustics of the microphones, electronic interference or any electronic processing of the signal before voice signals 108 and 110 are input into voice enhancement devices 102 and 104.
  • Ambient sounds 103 and 105 can include, for example, road and wind noise in a car, motor or machine noises, construction site noises, background music, background conversations, and the like.
  • Voice enhancement devices 102 and 104 produce output signals y(n) 124 and y' ⁇ n) 126, respectively. Output signals 124 and 126 are then sent through communication network 106 where they are output as received signals r(n) 130 and r'(n) 128, respectively. Received signals 128 and 130 can be delayed, and can have missing packets, and other anomalies due to propagation through the communication network.
  • Received signals 128 and 130 can also be processed by voice enhancement devices 102 and 104, and output as received signals, e.g., voice signals z'( «) l32 and z(n) 134, respectively. Received voice signals 132 and 134 can then be output by a speaker or headphone for the user to hear.
  • Voice enhancement device 102 can include echo canceller 202, which produces an output signal e(n) 204 that is input into noise suppressor system 206.
  • Noise suppressor system 206 produces an output signal s(n) 208, which can be input into automatic level control 210.
  • the output of automatic level control 210 is output signal 124.
  • Echo canceller 202 is generally known and receives input signal 120, and receive signal 128, and processes the signals to remove unwanted echo signals. Such echo signals can come from electrical mismatches or from acoustical coupling between a speaker and microphone, and the echo typically affects input signal 120 by an additive echo signal that depends on the received signal 128. Thus, output signal 204 from echo canceller 202 is expected to have a reduced echo signal level.
  • Noise suppressor system 206 receives signal 204 as an input signal for processing and suppressing noise.
  • the output of noise suppressor system 206 is signal 208.
  • Noise suppressor system 206 can be implemented using one of several known processes and systems as modified and improved in accordance with one or more of the inventive concepts and principles discussed and disclosed herein.
  • One such process and system uses the noise suppression algorithm described in telecommunications standard IS— 127, which is known as the Enhanced Variable Rate Coder (EVRC) standard published by the Telecommunications Industry Association (TIA), Arlington, Virginia, 22201-3834, USA. This algorithm is also similar to the noise suppression system disclosed in U.S. Pat. No. 5,659,622 issued to Ashley.
  • EVRC Enhanced Variable Rate Coder
  • Automatic level control 210 is generally known and operates to adjust the volume of input signal 208 to produce output signal 124. Automatic level control 210 analyzes the volume level of received signal 128 when processing input signal 208 and makes level control adjustments based upon the level of the received signal 128. For example, if received signal 128 is large, automatic level control 210 may not make any level control adjustments. Automatic level control 210 may also need to estimate the ratio of input signal 208 to received signal 128 in order to increase the level of output signal 124.
  • voice enhancement device 102 Other components or functions that can be included in voice enhancement device 102 include, for example, an acoustic echo suppressor, a tone indicator/detector, a selective - band filter, and the like.
  • Noise suppressor system 206 includes noise suppressor 302 (which can also be called a noise reduction processor) and noise suppressor controller 304, which controls a minimum overall gain setting of noise suppressor 302 using a post- filtering analyzer that analyzes time-domain data.
  • noise suppressor 302 which can also be called a noise reduction processor
  • noise suppressor controller 304 which controls a minimum overall gain setting of noise suppressor 302 using a post- filtering analyzer that analyzes time-domain data.
  • Noise suppressor 302 receives input signal 204 into frequency-domain converter 310.
  • Frequency-domain converter 310 converts the time-domain input signal 204 into a frequency-domain signal.
  • This frequency-domain conversion can include high-pass filtering, pre-emphasis filtering, windowing, and a fast Fourier transform (FFT) operation.
  • the high- pass filtering can be represented by the equation (see IS- 127 for filter coefficient values):
  • the pre-emphasis filtering can be represented by the equation:
  • the windowing operation can use a trapezoidal window with 10 ms frames, 3 ms overlapping, and 3 ms zero-padding, which results in a 16 ms data frame that is then processed though a standard FFT operation to generate a frequency-domain signal, G m (k) .
  • the frequency-domain signal G m ⁇ k) can include one or more signals representing frequency ranges, or frequency bands, or channels, of the input signal.
  • the input signal is subdivided into sixteen channels (or sub-bands) of frequency-domain data corresponding to sixteen frequency ranges.
  • the frequency-domain signal G m ⁇ k) is coupled to an input of energy estimator 312, which estimates the energy in each of the one or more channels of the current frame (m) of the frequency-domain signal using the following equation:
  • noise update indicator 3114 The output of energy estimator 312 is coupled to an input of noise update indicator 314, which produces a noise indicator signal u(n) 316 (which may also be known as an
  • Noise indicator signal u(n) 316 indicates whether the current frame is noise data or voice data.
  • the process of classifying noise or voice data is a function of a voice metric calculation and spectral deviation estimator, which is explained in detail within IS-
  • Noise indicator signal u(n) 316 is set to one (i.e., whenever the current frame is regarded as noise, and it is used to control the periods of time when noise estimator 318 is actively estimating noise.
  • the output of energy estimator 312 is also coupled to an input of noise estimator 318, and signal to noise ratio (SNR) estimator 320.
  • Noise estimator 318 estimates noise energy in each of the one or more channels and performs calculations similar to energy estimator 312.
  • E N (m,i) Max ⁇ 0.065,0.9E N (m -l,i) + 0 ⁇ E(m,i) ⁇
  • ⁇ q (i) is equal to ⁇ (i) or equal to one, depending on the noise update decision (see IS-127).
  • SNR estimator 320 has outputs that provide SNR estimates to noise update indicator 314 and gain calculator 322. The SNR estimates are used in noise update indicator 314 to classify samples as either noise or voice in response to voice metric estimates (see IS- 127).
  • gain calculator 322 receives the estimates and calculates a gain for each of the one or more channels according to the formula:
  • ⁇ ⁇ is the total overall gain of 16 channel bands
  • ⁇ ⁇ ⁇ m is the unconstrained total overall gain
  • ⁇ mm is the minimum overall gain represented by the minimum overall gain control signal f mn (m) 328 (which is fixed at -13 dB in the prior art).
  • the minimum overall gain is not a fixed constant — Y mm ijn) can be advantageously set as a function of time on a frame -by-frame basis under the control of noise suppressor controller 304, which performs a post- filtering analysis to calculate a new minimum overall gain.
  • filtered signal H m (k) is converted back into the time-domain by time- domain converter 326 (which can, for example, use a 16 ms Inverse Fast Fourier Transform (IFFT) operator), which produces noise-reduced output signal s(n) 208.
  • Time-domain converter 326 can also include a de-emphasis filter having the equation:
  • noise suppressor controller 304 is coupled to input signal 204 and output signal 208 of noise suppressor 302.
  • Post- filtering analyzer 330 receives input signal 204 and output signal 208, which are both time- domain signals. By examining both the input and the output signals of noise suppressor 302, post-filtering analyzer 330 can calculate an SNR improvement signal SNRl ⁇ m) 332 for each frame of noise, where such noise frames are indicated by signal u(m) 334.
  • Noise indicator signal 316 can also be used in noise suppressor controller 304 in order to simplify and synchronize the process of distinguishing between noise and voice signals.
  • minimum gain adapter 336 can compare SNRI ⁇ i) 332 to SNR improvement reference signal SNRl REp im) 340 (which is one of control signals 338) to produce new minimum overall gain signal ⁇ mm ⁇ m) 328.
  • the value represented by the SNR improvement reference signal 340 may also be known as a target SNR improvement.
  • minimum gain adapter 336 can use a least mean squares (LMS) algorithm to calculate new minimum overall gain signal 328 to control noise suppressor 302 in a way that will reduce the difference between the SNR improvement 332 and the SNR improvement reference 340 (in a mean squared sense).
  • LMS least mean squares
  • Post-filtering analyzer 330 receives input signal 204, output signal 208, and noise indicator signal 316 to produce SNR improvement signal 332 and noise frame indicator signal 334.
  • Input signal 204 is coupled to down sampler 402, which down samples the digital signal at a rate R 1 .
  • Ri can be 1/8 rate, which outputs every eighth sample.
  • the output of 402 is coupled to absolute value squared 404, which takes the absolute value of the sample and squares it.
  • the purpose of 404 is to compute an instantaneous energy signal.
  • the output of 404 is coupled to low pass filter 406 for averaging-out noise fluctuations affecting the output of 404.
  • H LPF (z) ⁇ - ⁇ , O ⁇ a ⁇ l. l -az
  • noise indicator signal 316 (which is a binary signal indicating a noise sample) is down-sampled at the same rate, R 1 , which is also the rate used at 402.
  • R 1 which is also the rate used at 402.
  • the binary output of down sampler 408 and the output of low pass filter 406 are multiplied together at multiplier 410.
  • the output of 408 is also subtracted from 1 at adder 412, and the result is coupled to one input of multiplier 418.
  • the other input of multiplier 418 is coupled to the output of delay 424, which is the output of adder 420 that has been delayed by one sample at rate R 1 .
  • the output of multiplier 418 is coupled to one input of adder 420, while the other input is coupled to the output of multiplier 410.
  • the output of adder 420 is a signal, P e (R ⁇ n) 422, corresponding to an estimated noise power of the input signal 204.
  • the output of multiplier 444 is input to adder 434, which outputs an updated accumulation of estimated noise power when a noise sample is input, and outputs the previously accumulated estimated noise power when a voice sample is input.
  • the other input to adder 434 is the previously accumulated noise estimate delayed by one sample at the rate R 1 , as determined at adder 426 and multiplier 428.
  • signal P 5 (R 1 U) 430 corresponds to estimated noise power in output signal 208.
  • the signal to noise ratio improvement signal SNRl(m) 332 is calculated by further down sampling these signals at rate R 2 , as shown by down samplers 446 and 448.
  • rate R 2 is equal to the frame rate divided by Ri (i.e., RrR 2 equals the frame rate).
  • Noise indicator signal 316 (after being down sampled by down sampler 408) is also down sampled at rate R 2 by down sampler 456, which outputs noise frame indicator signal u(m) 334. Notice that both outputs 332 and 334 from post-filtering analyzer 330 are provided at a frame rate.
  • the signals 422 and 430 are down sampled, they are input into logarithmic calculators 450 and 452.
  • the output of logarithmic calculators 450 and 452 are input into adder 454, which calculates the SNR improvement SNRl(m) 332 in decibels for noise suppressor 302.
  • the SNRl(m) 332 signal is the difference between the estimated noise in input signal 204 and the estimated noise in output signal 208.
  • post-filtering analyzer 330 calculates signal-to-noise ratios of input signal 204 and output signal 208 using time-domain data to produce SNR improvement signal 332 that indicates the signal-to-noise ratio improvement of noise suppressor 302. These time- domain measurements are then used to compute minimum overall gain control signal 328 (at a frame rate), which controls a noise suppression process performed in the frequency-domain.
  • Minimum gain adapter 336 receives SNR improvement signal 332 and SNR improvement reference signal 340 and computes a difference between the two at adder 502, i.e., an error signal.
  • Noise frame indicator signal u(m) 334 is input into multiplier 504, where it is multiplied by the step size ⁇ 506 for correcting the error signal output by adder 502.
  • the error signal output by 502 is input into multiplier 508 where it is multiplied by the error correction step size from multiplier 504, if the frame is a noise frame.
  • the output of multiplier 508 is input into adder 510, where minimum overall gain control signal 328 from the previous frame, which has been delayed by 512, is added.
  • delay block 512 can be replaced by a multi-frame delay.
  • the output of adder 510 is input into maximum signal processor 514, which does not allow the signal to fall below lower gain limit ⁇ L 516.
  • the output of maximum signal processor 514 is input into minimum signal processor 518, which does not allow the signal to pass above maximum gain ⁇ H 520.
  • the output of minimum signal processor 518 is minimum overall gain control signal 328.
  • 514 and 518 place lower and upper limits on minimum overall gain control signal 328 (which can be viewed as a projection onto a convex set operator).
  • the resulting minimum overall gain adaptation is then given by the equation:
  • / mm (m) Min ⁇ Max ⁇ mm (m - 1) + ⁇ u(m)[SNRI(m) - SNRI ⁇ (m)], ⁇ L ⁇ , ⁇ H ⁇ .
  • Minimum overall gain control signal 328 is output for each frame, and can vary frame-by-frame, or by any other ratio of frames, e.g., every 3 rd frame (in which case the above update equation would be based on ⁇ mm (m -3) ).
  • SNR improvement reference signal 340 can be fixed at a desired level.
  • SNR improvement reference signal 340 can be set in the range between -30 dB and 0 dB.
  • SNR improvement reference signal 340 can vary over time.
  • the SNR reference level can be adjusted depending upon the characteristics of input signal 204 (e.g., whether input signal 204 is voice, noise, signaling tone, etc .).
  • the step size ⁇ 506 can also be adjusted in order to increase or decrease the minimum overall gain adaptation speed.
  • other adaptive algorithms may also be used to adjust minimum overall gain signal 328.
  • FIG. 6 there is depicted a high-level flowchart 600 of exemplary processes executed by portions of a noise suppressor system, such as noise suppressor system 206, which is shown in voice enhancement device 102 of FIG. 2, or executed by another similar apparatus, in accordance with one or more embodiments.
  • the process begins at 602, and thereafter passes to 604 wherein the process initializes the minimum overall gain f mn (m). This can be implemented by setting minimum overall gain control signal 328 to a preselected value (e.g., at -13 dB).
  • the process determines whether the minimum gain adaptation process is enabled, as shown at 606.
  • the process determines whether a new minimum overall gain value is available, as illustrated at 608. If the new minimum overall gain value is available, the process sets the current minimum overall gain value to the new minimum overall gain value, as depicted at 610. This process can be implemented by comparing a current minimum overall gain in a noise reduction processor to a new value for the minimum overall gain, and replacing the current minimum overall gain with the new minimum overall gain when the values are different.
  • the process passes to 612, wherein the process determines if new frames are available. If new frames are available, voice signal processing continues, and the process iteratively returns to 606.
  • the process receives new frames of input and output signals as depicted at 614, wherein the signals are time-domain signals input into, and output from, the noise suppressor, such as noise suppressor 302 in FIG. 3.
  • the new frames of input and output signals correspond to input signal e(n) 204 and output signal s(n) 208, which are shown in FIGS. 2, 3 and 4.
  • the process determines whether the update flag u(n) is set to indicate a noise sample, as illustrated at 616.
  • the update flag u(n) can be implemented with noise indicator signal 316, as shown in FIG. 3 as the output of noise update indicator 314.
  • Noise indicator signal 316 is a binary signal that, when set, indicates that a sample currently being processed is noise.
  • the process estimates a new SNR improvement for the new signal frame, as illustrated at 618.
  • the process of estimating a new SNR improvement can be implemented in the time-domain according to the process described and illustrated in FIG. 4, wherein SNRl ⁇ m) 332 is computed.
  • the process updates the minimum overall gain f mn (m), as depicted at 620.
  • This process can be implemented as described and illustrated in FIG. 5, wherein SNRl ⁇ m) 332 and SNRl mF ⁇ tn) 340 are used to compute a minimum overall gain control signal 328 that sets a new minimum overall gain f mn (m) in gain calculator 322 of noise suppressor 302 shown in FIG. 3.
  • the process passes to 612 to determine whether new frames are available. If new frames are available, the process iteratively returns to 606 to begin the process again for the new frame of data. If there are no new frames available, the process terminates at 622. The process can terminate when, for example, a telephone call ends and there are no new frames of voice data to process.
  • the minimum overall gain of the noise suppressor is not a fixed value, which can restrict the ability of the noise suppressor to further improve the SNR.
  • the method and system described herein can provide a larger minimum overall gain value, which may be needed in case multiple noise suppressors are connected in cascade.
  • one or more embodiments provide for adjusting the noise suppressor in order to deliver some target SNR improvement, regardless of the statistical characteristics of the noise signal.
  • the use of a time- varying SNR reference signal is capable of handling different signal conditions (e.g., emphasizing voice segments of input signal 204, if voice encoding is required).
  • SNR improvement i.e., noise suppression level
  • SNR improvement i.e., noise suppression level

Abstract

Methods and corresponding systems for suppressing noise in an input signal (204) include setting a minimum overall gain in a noise reduction processor (206) for processing a first frame of data associated with the input signal (204). In response to a new minimum overall gain being set, the minimum overall gain in the noise reduction processor (206) is replaced with the new minimum overall gain, and a second frame of data associated with the input signal (204) is processed to suppress noise using the new minimum overall gain. The new minimum overall gain can be a function of the input signal (204) or an output signal (208) of the noise reduction processor (206). The new minimum overall gain can correspond to a difference between an estimated signal-to-noise ratio (SNR) improvement that is calculated using time-domain data and a target SNR improvement.

Description

ADJUSTABLE NOISE SUPPRESSION SYSTEM
FIELD OF THE INVENTION
[0001] This invention relates in general to data communication, and more specifically to techniques and apparatus for suppressing noise in a signal in a communication system.
BACKGROUND OF THE INVENTION
[0002] High-level background noise in a wired or wireless telecommunications channel degrades in-band signaling and lowers the perceived voice quality of speech signals. To ensure quality of service in voice-band transmission, noise suppressors, or noise reducers, are used to reduce the degradation caused by the background noise and to improve the signal-to- noise ratio (SNR) of noisy signals.
[0003] Many popular noise reduction/suppression algorithms use the principles of spectral weighting. Spectral weighting means that different spectral regions of the mixed signal of speech and noise are attenuated or modified with different gain factors. The goal is to obtain a speech signal that contains less noise than the original speech signal. At the same time, the speech quality must remain substantially intact with a minimal distortion of the original speech.
[0004] Spectral weighting is typically performed in the frequency domain using the well- known Fourier transform. Voice activity detectors are used to determine whether current signal samples represent predominantly voice or noise. Energy estimators and signal-to-noise ratio estimators are used to calculate a factor that is then used to modify the level of a frequency-domain signal. The signal to noise ratio is a measure of signal strength (e.g., voice strength) relative to background noise. The frequency-domain signal as modified is then converted back to the time-domain.
[0005] One problem with noise suppressors is that the level of suppression can be too high or too low under various different conditions. Additionally, a noise suppressor that operates in the frequency domain, like the spectral weighting filter, can leave artifacts in the output signal, such as musical noise, jet engine roar, running water, or the like.
BRIEF DESCRIPTION OF THE DRAWINGS
[0006] The accompanying figures, wherein like reference numerals refer to identical or functionally similar elements throughout the separate views and which together with the detailed description below are incorporated in and form part of the specification, serve to further illustrate various embodiments and to explain various principles and advantages, all in accordance with the present invention.
[0007] FIG. 1 depicts, in a simplified and representative form, a high-level block diagram of a communications system having voice enhancement devices connected through a communication channel in accordance with one or more embodiments;
[0008] FIG. 2 is a more detailed representative block diagram of a voice enhancement device in accordance with one or more embodiments;
[0009] FIG. 3 depicts a block diagram of a noise suppressor system in accordance with one or more embodiments;
[0010] FIG. 4 shows a more detailed block diagram of a post-filtering analyzer that can be used in conjunction with the FIG. 3 noise suppressor system in accordance with one or more embodiments;
[0011] FIG. 5 depicts a more detailed block diagram of a minimum gain adapter that can be used in conjunction with the FIG. 3 noise suppressor system in accordance with one or more embodiments; and
[0012] FIG. 6 shows a high-level flowchart of processes executed by a noise suppressor system that can be used in conjunction with the FIG. 2 voice enhancement device in accordance with one or more embodiments. DETAILED DESCRIPTION
[0013] In overview, the present disclosure concerns noise suppression in voice enhancement devices. More particularly various inventive concepts and principles embodied in methods and apparatus may be used for adjusting a minimum overall gain, i.e., level of noise suppression, in a noise suppression system in a voice enhancement device.
[0014] While the voice enhancement device of particular interest may vary widely, one embodiment may advantageously be used in a wireless communication system or a wireless networking system, such as a cellular wireless network. Additionally, the inventive concepts and principles taught herein can be advantageously applied to wired communications systems, such as a telephone system.
[0015] The instant disclosure is provided to further explain, in an enabling fashion, the best modes, at the time of the application, of making and using various embodiments in accordance with the present invention. The disclosure is further offered to enhance an understanding and appreciation for the inventive principles and advantages thereof, rather than to limit the invention in any manner. The invention is defined solely by the appended claims, including any amendments made during the pendency of this application, and all equivalents of those claims as issued.
[0016] It is further understood that the use of relational terms, if any, such as first and second, top and bottom, and the like, are used solely to distinguish one entity or action from another without necessarily requiring or implying any such actual relationship or order between such entities or actions.
[0017] Much of the inventive functionality and many of the inventive principles are best implemented with, or in, integrated circuits (ICs), including possibly application specific ICs, or ICs with integrated processing controlled by embedded software or firmware. It is expected that one of ordinary skill, when guided by the concepts and principles disclosed herein, will be readily capable of generating such software instructions and programs and ICs with minimal experimentation — notwithstanding possibly significant effort and many design choices motivated by, for example, available time, current technology, and economic considerations. Therefore, in the interest of brevity and minimization of any risk of obscuring the principles and concepts according to the present invention, further discussion of such software and ICs, if any, will be limited to the essentials with respect to the principles and concepts of the various embodiments.
[0018] Referring to FIG. 1, there is depicted, in a simplified and representative form, a high-level block diagram of communications system 100 having voice enhancement devices
102 and 104 connected through communication network (or communication channel) 106 in accordance with one or more embodiments. Voice enhancement devices 102 and 104 are generally devices for processing, filtering, and conditioning a voice signal to improve the voice quality and sound clarity of wireless and wired signals before they are transmitted through a communication network, such as communication network 106. Communication network 106 can be a wired or wireless communication network.
[0019] When a telephone, radio, or cell phone is used, signals, e.g., voice signals v(n) 108 and v'{n) 110 or the like are combined, respectively, with noise signals d(n) 112 and d'{n) 114, which are shown at adders 116 and 118, to produce input signals x(n) 120 and x'{n) 122. Noise signals 112 and 114 include the effects of ambient sounds 103 and 105 (i.e., sounds that surround the user who is the source of the voice signal), respectively, in addition to any noise or distortion caused by the equipment or the environment, such as the acoustics of the microphones, electronic interference or any electronic processing of the signal before voice signals 108 and 110 are input into voice enhancement devices 102 and 104. Ambient sounds 103 and 105 can include, for example, road and wind noise in a car, motor or machine noises, construction site noises, background music, background conversations, and the like.
[0020] Voice enhancement devices 102 and 104 produce output signals y(n) 124 and y'{n) 126, respectively. Output signals 124 and 126 are then sent through communication network 106 where they are output as received signals r(n) 130 and r'(n) 128, respectively. Received signals 128 and 130 can be delayed, and can have missing packets, and other anomalies due to propagation through the communication network.
[0021] Received signals 128 and 130 can also be processed by voice enhancement devices 102 and 104, and output as received signals, e.g., voice signals z'(«) l32 and z(n) 134, respectively. Received voice signals 132 and 134 can then be output by a speaker or headphone for the user to hear. [0022] With reference now to FIG. 2, there is depicted a more detailed representative block diagram of a voice enhancement device in accordance with one or more embodiments. Voice enhancement device 102 can include echo canceller 202, which produces an output signal e(n) 204 that is input into noise suppressor system 206. Noise suppressor system 206 produces an output signal s(n) 208, which can be input into automatic level control 210. The output of automatic level control 210 is output signal 124.
[0023] Echo canceller 202 is generally known and receives input signal 120, and receive signal 128, and processes the signals to remove unwanted echo signals. Such echo signals can come from electrical mismatches or from acoustical coupling between a speaker and microphone, and the echo typically affects input signal 120 by an additive echo signal that depends on the received signal 128. Thus, output signal 204 from echo canceller 202 is expected to have a reduced echo signal level.
[0024] Noise suppressor system 206 receives signal 204 as an input signal for processing and suppressing noise. The output of noise suppressor system 206 is signal 208. Noise suppressor system 206 can be implemented using one of several known processes and systems as modified and improved in accordance with one or more of the inventive concepts and principles discussed and disclosed herein. One such process and system uses the noise suppression algorithm described in telecommunications standard IS— 127, which is known as the Enhanced Variable Rate Coder (EVRC) standard published by the Telecommunications Industry Association (TIA), Arlington, Virginia, 22201-3834, USA. This algorithm is also similar to the noise suppression system disclosed in U.S. Pat. No. 5,659,622 issued to Ashley. Note that one of the initial weighting rules proposed for audio noise reduction was that of spectral subtraction [see, S. F. Boll, Suppression of Acoustic Noise in Speech Using Spectral Subtraction, IEEE Trans, on Acoust. Speech, and Sign. Proc, Vol.ASSP-27, No.2., April 1979, pp.113-120]. One of its versions is the magnitude spectral subtraction. Although the noise level can be reduced by the spectral subtraction, its direct application poses a disadvantage, as the processed signal may sound unnatural, and processing may cause an effect known as "musical noise."
[0025] The components in noise suppressor system 206, its operation, and various inventive concepts and principles, are discussed in greater detail below. [0026] Automatic level control 210 is generally known and operates to adjust the volume of input signal 208 to produce output signal 124. Automatic level control 210 analyzes the volume level of received signal 128 when processing input signal 208 and makes level control adjustments based upon the level of the received signal 128. For example, if received signal 128 is large, automatic level control 210 may not make any level control adjustments. Automatic level control 210 may also need to estimate the ratio of input signal 208 to received signal 128 in order to increase the level of output signal 124.
[0027] Other components or functions that can be included in voice enhancement device 102 include, for example, an acoustic echo suppressor, a tone indicator/detector, a selective - band filter, and the like.
[0028] With reference now to FIG. 3, there is depicted a block diagram representation of a noise suppressor system, such as noise suppressor system 206 or another similar system, in accordance with one or more embodiments. Noise suppressor system 206 includes noise suppressor 302 (which can also be called a noise reduction processor) and noise suppressor controller 304, which controls a minimum overall gain setting of noise suppressor 302 using a post- filtering analyzer that analyzes time-domain data.
[0029] Noise suppressor 302 receives input signal 204 into frequency-domain converter 310. Frequency-domain converter 310 converts the time-domain input signal 204 into a frequency-domain signal. This frequency-domain conversion can include high-pass filtering, pre-emphasis filtering, windowing, and a fast Fourier transform (FFT) operation. The high- pass filtering can be represented by the equation (see IS- 127 for filter coefficient values):
Figure imgf000008_0001
[0030] The pre-emphasis filtering can be represented by the equation:
HPE(z) = l -0.Sz -1
[0031] The windowing operation can use a trapezoidal window with 10 ms frames, 3 ms overlapping, and 3 ms zero-padding, which results in a 16 ms data frame that is then processed though a standard FFT operation to generate a frequency-domain signal, Gm(k) . [0032] The frequency-domain signal Gm{k) can include one or more signals representing frequency ranges, or frequency bands, or channels, of the input signal. In one embodiment, the input signal is subdivided into sixteen channels (or sub-bands) of frequency-domain data corresponding to sixteen frequency ranges.
[0033] The frequency-domain signal Gm{k) is coupled to an input of energy estimator 312, which estimates the energy in each of the one or more channels of the current frame (m) of the frequency-domain signal using the following equation:
1 W ,2
E (m,i) = Y G (k)\
E(m,ϊ) =Max {0.0625, 0A5E(m -l,i) + 0.55Ec(m,i)}
[0034] The output of energy estimator 312 is coupled to an input of noise update indicator 314, which produces a noise indicator signal u(n) 316 (which may also be known as an
"update_flag"). Noise indicator signal u(n) 316 indicates whether the current frame is noise data or voice data. The process of classifying noise or voice data is a function of a voice metric calculation and spectral deviation estimator, which is explained in detail within IS-
127. Noise indicator signal u(n) 316 is set to one (i.e.,
Figure imgf000009_0001
whenever the current frame is regarded as noise, and it is used to control the periods of time when noise estimator 318 is actively estimating noise.
[0035] The output of energy estimator 312 is also coupled to an input of noise estimator 318, and signal to noise ratio (SNR) estimator 320. Noise estimator 318 estimates noise energy in each of the one or more channels and performs calculations similar to energy estimator 312. The output of noise estimator 318 can be represented by the following formula (for noise frames, i.e. having update flag = 1):
EN(m,i) = Max{0.065,0.9EN(m -l,i) + 0ΛE(m,i)}
[0036] SNR estimator 320 receives energy estimates from energy estimator 312 and noise estimates from noise estimator 318, and produces SNR estimates for each of the one or more channels. These channel SNR estimates can be represented by the formula:
Figure imgf000010_0001
σq " (i) =Max{6,σq(i)}
Where σq(i) is equal to σ (i) or equal to one, depending on the noise update decision (see IS-127).
[0037] SNR estimator 320 has outputs that provide SNR estimates to noise update indicator 314 and gain calculator 322. The SNR estimates are used in noise update indicator 314 to classify samples as either noise or voice in response to voice metric estimates (see IS- 127).
[0038] With the noise estimates and the SNR estimates calculated for the frame, gain calculator 322 receives the estimates and calculates a gain for each of the one or more channels according to the formula:
r(i) =Min{\,WdB(l)l20 ] ydB(i) = 0.39(σq " (i) -6) + yτ γτ = Max{γmmτ(m)}
rτ(m) = -101og10 ∑EN(m,i)
.J=O
Where γτ is the total overall gain of 16 channel bands, γτ{m) is the unconstrained total overall gain and γmm is the minimum overall gain represented by the minimum overall gain control signal fmn(m) 328 (which is fixed at -13 dB in the prior art). Thus, the minimum overall gain is not a fixed constant — Ymmijn) can be advantageously set as a function of time on a frame -by-frame basis under the control of noise suppressor controller 304, which performs a post- filtering analysis to calculate a new minimum overall gain.
[0039] The gains for each of the channels output by gain calculator 322 are used in gain modifier 324 to modify the frequency-domain signal Gm{k) to produce a filtered frequency- domain signal Hm(k) , which may also be known as a noise-reduced signal spectrum. [0040] Finally, filtered signal Hm(k) is converted back into the time-domain by time- domain converter 326 (which can, for example, use a 16 ms Inverse Fast Fourier Transform (IFFT) operator), which produces noise-reduced output signal s(n) 208. Time-domain converter 326 can also include a de-emphasis filter having the equation:
Figure imgf000011_0001
[0041] To produce minimum overall gain control signal 328, noise suppressor controller 304 is coupled to input signal 204 and output signal 208 of noise suppressor 302. Post- filtering analyzer 330 receives input signal 204 and output signal 208, which are both time- domain signals. By examining both the input and the output signals of noise suppressor 302, post-filtering analyzer 330 can calculate an SNR improvement signal SNRl{m) 332 for each frame of noise, where such noise frames are indicated by signal u(m) 334. Noise indicator signal 316 can also be used in noise suppressor controller 304 in order to simplify and synchronize the process of distinguishing between noise and voice signals.
[0042] Once the SNR improvement signal SNRl{m) 334 has been calculated, minimum gain adapter 336 can compare SNRIψi) 332 to SNR improvement reference signal SNRl REpim) 340 (which is one of control signals 338) to produce new minimum overall gain signal γmm{m) 328. The value represented by the SNR improvement reference signal 340 may also be known as a target SNR improvement. In one embodiment, minimum gain adapter 336 can use a least mean squares (LMS) algorithm to calculate new minimum overall gain signal 328 to control noise suppressor 302 in a way that will reduce the difference between the SNR improvement 332 and the SNR improvement reference 340 (in a mean squared sense).
[0043] Referring now to FIG. 4, there is depicted a high-level schematic representation of a post-filtering analyzer that can be used in conjunction with the FIG. 3 noise suppressor system 206 in accordance with one or more embodiments. Post-filtering analyzer 330 receives input signal 204, output signal 208, and noise indicator signal 316 to produce SNR improvement signal 332 and noise frame indicator signal 334. [0044] Input signal 204 is coupled to down sampler 402, which down samples the digital signal at a rate R1. In one embodiment, Ri can be 1/8 rate, which outputs every eighth sample. The output of 402 is coupled to absolute value squared 404, which takes the absolute value of the sample and squares it. The purpose of 404 is to compute an instantaneous energy signal. The output of 404 is coupled to low pass filter 406 for averaging-out noise fluctuations affecting the output of 404. In one embodiment, low pass filter 406 operates according to the equation, where, in one embodiment, a = 0.96875:
HLPF(z) = ^-^, O < a < l. l -az
[0045] At down sampler 408, noise indicator signal 316 (which is a binary signal indicating a noise sample) is down-sampled at the same rate, R1, which is also the rate used at 402. The binary output of down sampler 408 and the output of low pass filter 406 are multiplied together at multiplier 410.
[0046] The output of 408 is also subtracted from 1 at adder 412, and the result is coupled to one input of multiplier 418. The other input of multiplier 418 is coupled to the output of delay 424, which is the output of adder 420 that has been delayed by one sample at rate R1. The output of multiplier 418 is coupled to one input of adder 420, while the other input is coupled to the output of multiplier 410. The output of adder 420 is a signal, Pe(Rλn) 422, corresponding to an estimated noise power of the input signal 204.
[0047] In a similar estimated noise power calculation for the output signal 208, input signal 208 is down sampled at rate Ri at down sampler 438. Then, at 440, the absolute value of the signal is squared, and the result is passed through low pass filter 442, which is similar to low pass filter 406. The output of low pass filter 442 is coupled to multiplier 444, wherein it is multiplied by the output of down sampler 408. Since the output of down sampler 408 indicates the presence of a noise signal 316, the output of multiplier 444 is equal to zero when voice is present in a sample of signal 204. The output of multiplier 444 corresponds to estimated noise power in signal 208 when signal 316 indicates a noise sample.
[0048] The output of multiplier 444 is input to adder 434, which outputs an updated accumulation of estimated noise power when a noise sample is input, and outputs the previously accumulated estimated noise power when a voice sample is input. The other input to adder 434 is the previously accumulated noise estimate delayed by one sample at the rate R1, as determined at adder 426 and multiplier 428. Thus, signal P5(R1U) 430 corresponds to estimated noise power in output signal 208.
[0049] After the noise power has been estimated in the input and output signals 204 and 208 of noise suppressor 302, as represented by P6(R1Jt) 422 and P^R^n) 430, respectively, the signal to noise ratio improvement signal SNRl(m) 332 is calculated by further down sampling these signals at rate R2, as shown by down samplers 446 and 448. In one embodiment, rate R2 is equal to the frame rate divided by Ri (i.e., RrR2 equals the frame rate). Noise indicator signal 316 (after being down sampled by down sampler 408) is also down sampled at rate R2 by down sampler 456, which outputs noise frame indicator signal u(m) 334. Notice that both outputs 332 and 334 from post-filtering analyzer 330 are provided at a frame rate.
[0050] After the signals 422 and 430 are down sampled, they are input into logarithmic calculators 450 and 452. The output of logarithmic calculators 450 and 452 are input into adder 454, which calculates the SNR improvement SNRl(m) 332 in decibels for noise suppressor 302. The SNRl(m) 332 signal is the difference between the estimated noise in input signal 204 and the estimated noise in output signal 208.
[0051] Note that post-filtering analyzer 330 calculates signal-to-noise ratios of input signal 204 and output signal 208 using time-domain data to produce SNR improvement signal 332 that indicates the signal-to-noise ratio improvement of noise suppressor 302. These time- domain measurements are then used to compute minimum overall gain control signal 328 (at a frame rate), which controls a noise suppression process performed in the frequency-domain.
[0052] Turning now to FIG. 5, there is depicted a high-level block diagram of a minimum gain adapter that can be used in conjunction with the FIG. 3 noise suppressor system in accordance with one or more embodiments. Minimum gain adapter 336 receives SNR improvement signal 332 and SNR improvement reference signal 340 and computes a difference between the two at adder 502, i.e., an error signal. Noise frame indicator signal u(m) 334 is input into multiplier 504, where it is multiplied by the step size μ 506 for correcting the error signal output by adder 502. The error signal output by 502 is input into multiplier 508 where it is multiplied by the error correction step size from multiplier 504, if the frame is a noise frame. [0053] The output of multiplier 508 is input into adder 510, where minimum overall gain control signal 328 from the previous frame, which has been delayed by 512, is added. In alternative embodiments, delay block 512 can be replaced by a multi-frame delay. The output of adder 510 is input into maximum signal processor 514, which does not allow the signal to fall below lower gain limit γL 516. The output of maximum signal processor 514 is input into minimum signal processor 518, which does not allow the signal to pass above maximum gain γH 520. The output of minimum signal processor 518 is minimum overall gain control signal 328. Thus, 514 and 518 place lower and upper limits on minimum overall gain control signal 328 (which can be viewed as a projection onto a convex set operator). The resulting minimum overall gain adaptation is then given by the equation:
/mm (m) = Min{Max{γmm (m - 1) + μu(m)[SNRI(m) - SNRI^ (m)], γL}, γH}.
[0054] Minimum overall gain control signal 328 is output for each frame, and can vary frame-by-frame, or by any other ratio of frames, e.g., every 3rd frame (in which case the above update equation would be based on χmm(m -3) ). In some embodiments, SNR improvement reference signal 340 can be fixed at a desired level. For example, SNR improvement reference signal 340 can be set in the range between -30 dB and 0 dB. Alternatively, SNR improvement reference signal 340 can vary over time. For example, the SNR reference level can be adjusted depending upon the characteristics of input signal 204 (e.g., whether input signal 204 is voice, noise, signaling tone, etc .). Furthermore, the step size μ 506 can also be adjusted in order to increase or decrease the minimum overall gain adaptation speed. Alternatively, other adaptive algorithms may also be used to adjust minimum overall gain signal 328. In one embodiment, the step size can be set to μ = 1/8 .
[0055] Referring now to the operation of the noise suppressor system, in FIG. 6 there is depicted a high-level flowchart 600 of exemplary processes executed by portions of a noise suppressor system, such as noise suppressor system 206, which is shown in voice enhancement device 102 of FIG. 2, or executed by another similar apparatus, in accordance with one or more embodiments. As illustrated, the process begins at 602, and thereafter passes to 604 wherein the process initializes the minimum overall gain fmn(m). This can be implemented by setting minimum overall gain control signal 328 to a preselected value (e.g., at -13 dB). [0056] Next, the process determines whether the minimum gain adaptation process is enabled, as shown at 606. If the minimum gain adaptation is not enabled, the process determines whether a new minimum overall gain value is available, as illustrated at 608. If the new minimum overall gain value is available, the process sets the current minimum overall gain value to the new minimum overall gain value, as depicted at 610. This process can be implemented by comparing a current minimum overall gain in a noise reduction processor to a new value for the minimum overall gain, and replacing the current minimum overall gain with the new minimum overall gain when the values are different.
[0057] After the new minimum overall gain value has been set, or after it has been determined that there is no new value, the process passes to 612, wherein the process determines if new frames are available. If new frames are available, voice signal processing continues, and the process iteratively returns to 606.
[0058] If, at 606, the process determines that the minimum overall gain adaptation process is enabled, the process receives new frames of input and output signals as depicted at 614, wherein the signals are time-domain signals input into, and output from, the noise suppressor, such as noise suppressor 302 in FIG. 3. The new frames of input and output signals correspond to input signal e(n) 204 and output signal s(n) 208, which are shown in FIGS. 2, 3 and 4.
[0059] After receiving new frames of data, the process determines whether the update flag u(n) is set to indicate a noise sample, as illustrated at 616. The update flag u(n) can be implemented with noise indicator signal 316, as shown in FIG. 3 as the output of noise update indicator 314. Noise indicator signal 316 is a binary signal that, when set, indicates that a sample currently being processed is noise.
[0060] If the update flag (noise indicator signal) u(n) is set, the process estimates a new SNR improvement for the new signal frame, as illustrated at 618. The process of estimating a new SNR improvement can be implemented in the time-domain according to the process described and illustrated in FIG. 4, wherein SNRl{m) 332 is computed.
[0061] After estimating the SNR improvement, the process updates the minimum overall gain fmn(m), as depicted at 620. This process can be implemented as described and illustrated in FIG. 5, wherein SNRl{m) 332 and SNRl mF{tn) 340 are used to compute a minimum overall gain control signal 328 that sets a new minimum overall gain fmn(m) in gain calculator 322 of noise suppressor 302 shown in FIG. 3.
[0062] After calculating and updating a new minimum overall gain at 620, the process passes to 612 to determine whether new frames are available. If new frames are available, the process iteratively returns to 606 to begin the process again for the new frame of data. If there are no new frames available, the process terminates at 622. The process can terminate when, for example, a telephone call ends and there are no new frames of voice data to process.
[0063] It should be apparent to those skilled in the art that the method and system described herein provides a number of improvements over the prior art. First, the minimum overall gain of the noise suppressor is not a fixed value, which can restrict the ability of the noise suppressor to further improve the SNR. Second, the method and system described herein can provide a larger minimum overall gain value, which may be needed in case multiple noise suppressors are connected in cascade. Third, one or more embodiments provide for adjusting the noise suppressor in order to deliver some target SNR improvement, regardless of the statistical characteristics of the noise signal. Fourth, the use of a time- varying SNR reference signal is capable of handling different signal conditions (e.g., emphasizing voice segments of input signal 204, if voice encoding is required).
[0064] Experiments with the method and system described herein have shown that the minimum overall gain has an average behavior of a near-linear relationship with respect to
SNR improvement (i.e., noise suppression level), thus enabling a quite simple and low-cost control mechanism for achieving a target SNR improvement, as disclosed above. Persons skilled in the art frequently regard the use of SNR as a non-preferred method for noise suppression because it may also affect voiced segments of the signal. The method and system described herein can remove this limitation, as the disclosed minimum gain adapter (see 336 in FIGS. 3 and 5) may use any arbitrary target SNR improvement function of time.
[0065] The above described functions and structures can be implemented in one or more integrated circuits. For example, many or all of the functions can be implemented in the signal and data processing circuitry that is suggested by the block diagrams and schematic diagrams shown in FIGS. 1-5. [0066] The processes, apparatus, and systems, discussed above, and the inventive principles thereof are intended to produce a more effective noise suppression system. By changing and adapting the minimum overall gain, a noise suppressor can more aggressively suppress noise in parts of the speech data stream while being less aggressive in other parts of the data stream. Additional effectiveness is gained when the correction of a frequency- domain process is computed in the time-domain, as the actual output signal from the noise suppressor is processed by a post-filtering analyzer, which can be used to adjust the noise suppressor to achieve noise suppression performance according to a selected SNR improvement.
[0067] This disclosure is intended to explain how to fashion and use various embodiments in accordance with the invention, rather than to limit the true, intended, and fair scope and spirit thereof. The foregoing description is not intended to be exhaustive or to limit the invention to the precise form disclosed. Modifications or variations are possible in light of the above teachings. The embodiment(s) were chosen and described to provide the best illustration of the principles of the invention and its practical application, and to enable one of ordinary skill in the art to utilize the invention in various embodiments and with various modifications as are suited to the particular use contemplated. All such modifications and variations are within the scope of the invention as determined by the appended claims, as may be amended during the pendency of this application for patent, and all equivalents thereof, when interpreted in accordance with the breadth to which they are fairly, legally, and equitably entitled.

Claims

Claims:
What is claimed is:
1 1. A method of suppressing noise in an input signal comprising:
2 setting a minimum overall gain in a noise reduction processor for processing a first
3 frame of data associated with the input signal;
4 replacing, in response to a new minimum overall gain being set, the minimum overall
5 gain in the noise reduction processor with the new minimum overall gain; and e processing a second frame of data associated with the input signal to suppress noise
7 using the new minimum overall gain.
1 2. The method of suppressing noise according to claim 1 wherein the replacing the
2 minimum overall gain comprises replacing the minimum overall gain with the new
3 minimum overall gain, wherein the new minimum overall gain is a function of one or
4 more of the input signal and an output signal of the noise reduction processor.
1 3. The method for suppressing noise according to claim 1 comprising:
2 outputting from the noise reduction processor a noise indicator; and
3 calculating the new minimum overall gain using the input signal, an output signal, the
4 noise indicator, and a reference signal.
1 4. The method for suppressing noise according to claim 1 wherein the replacing the
2 minimum overall gain comprises:
3 estimating, using time domain data, a signal to noise ratio (SNR) improvement of the
4 noise reduction processor;
5 computing the new minimum overall gain corresponding to a difference between a e target SNR improvement and the estimated SNR improvement; and
7 replacing the minimum overall gain in the noise reduction processor with the new β minimum overall gain.
1 5. The method for suppressing noise according to claim 4 wherein the computing the new
2 minimum overall gain comprises computing the new minimum overall gain using a
3 least mean squares (LMS) algorithm and the difference between the target SNR
4 improvement and the estimated SNR improvement.
1 6. The method for suppressing noise according to claim 4 comprising updating the target
2 SNR improvement.
1 7. The method for suppressing noise according to claim 4 comprising initially setting the
2 minimum overall gain to a negative value of the target SNR improvement.
1 8. A system for suppressing noise in an input signal comprising:
2 a frequency domain converter adapted to convert the input signal to a frequency domain
3 signal;
4 a noise estimator adapted to estimate a noise level in the frequency domain signal;
5 a gain calculator adapted to calculate a gain based upon the estimated noise level and a e minimum gain control signal, wherein the minimum gain control signal varies 7 with a desired level of noise suppression; β a gain adjuster adapted to change the amplitude of the frequency domain signal based
9 upon the gain to produce a filtered signal; and 0 a time domain converter adapted to convert the filtered signal to an output signal in a 1 time domain.
1 9. The system for suppressing noise according to claim 8 wherein the minimum gain
2 control signal is responsive to a signal to noise ratio (SNR) of the input signal, an SNR
3 of the output signal, and a target SNR.
10. The system for suppressing noise according to claim 9 further comprising a noise indicator having an input coupled to the frequency domain signal, an input coupled to the SNR of the input signal, and having a noise indicator output signal responsive to a sample of the input signal being noise.
11. The system for suppressing noise according to claim 8 further comprising: a post-filter analyzer coupled to the input signal and the output signal, for producing an improvement signal; and a minimum gain adapter coupled to the improvement signal and a reference signal for producing the minimum gain control signal.
12. The system for suppressing noise according to claim 11 wherein the improvement signal is an SNR improvement signal responsive to a difference between an SNR of the input signal and an SNR of the output signal, and wherein the reference signal is an SNR target signal.
13. The system for suppressing noise according to claim 11 wherein the improvement signal is responsive to the noise indicator output signal.
14. A noise suppression device having adjustable noise suppression comprising: a noise suppressor having a noise suppressor input, a noise suppressor output, a noise indicator output, and a minimum gain control input; and a noise suppressor controller having inputs coupled to the noise suppressor input, the noise suppressor output, and the noise indicator output, and having an output for outputting a minimum gain control signal, wherein the minimum gain control signal is coupled to the minimum gain control input, wherein the noise suppressor is adapted to have a minimum gain controlled by the minimum gain control signal.
15. The noise suppression device according to claim 14 wherein the noise suppressor comprises: a frequency domain converter coupled to the noise suppressor input; a gain modifier coupled to an output of the frequency domain converter; a time domain converter having an input coupled to a gain modifier output, and an output coupled to the noise suppressor output; and a gain calculator having an input coupled to the minimum gain control signal, and an output coupled to the gain modifier and adapted to control the gain modifier in response to the minimum gain control signal.
16. The noise suppression device according to claim 15 wherein the noise suppressor comprises: an energy estimator having an input coupled to the output of the frequency domain converter; a noise estimator having an input coupled to an output of the energy estimator; and a signal-to-noise ratio (SNR) estimator having an input coupled to the output of the energy estimator, and an output coupled to an input of the gain calculator.
17. The noise suppression device according to claim 14 wherein the noise suppressor controller comprises: a post-filter analyzer having inputs coupled to the noise suppressor input, and the noise suppressor output, and having an improvement signal output; and a minimum gain adapter having an input coupled to the improvement signal, an input coupled to a reference signal, and an output for outputting the minimum gain control signal
18. The noise suppression device according to claim 17 wherein the post-filter analyzer has an input coupled to the noise indicator output.
19. The noise suppression device according to claim 17 wherein the minimum gain adapter has an input coupled to the noise indicator output.
20. The noise suppression device according to claim 14 comprising: an echo canceller having an output coupled to the noise suppressor input; and a level controller having an input coupled to the noise suppressor output.
PCT/US2007/063044 2006-04-07 2007-03-01 Adjustable noise suppression system WO2007117785A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07757694A EP2008379B1 (en) 2006-04-07 2007-03-01 Adjustable noise suppression system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/400,458 US7555075B2 (en) 2006-04-07 2006-04-07 Adjustable noise suppression system
US11/400,458 2006-04-07

Publications (2)

Publication Number Publication Date
WO2007117785A2 true WO2007117785A2 (en) 2007-10-18
WO2007117785A3 WO2007117785A3 (en) 2008-05-08

Family

ID=38575241

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/063044 WO2007117785A2 (en) 2006-04-07 2007-03-01 Adjustable noise suppression system

Country Status (3)

Country Link
US (1) US7555075B2 (en)
EP (1) EP2008379B1 (en)
WO (1) WO2007117785A2 (en)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7349841B2 (en) * 2001-03-28 2008-03-25 Mitsubishi Denki Kabushiki Kaisha Noise suppression device including subband-based signal-to-noise ratio
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
KR20080111290A (en) * 2007-06-18 2008-12-23 삼성전자주식회사 System and method of estimating voice performance for recognizing remote voice
US8326617B2 (en) * 2007-10-24 2012-12-04 Qnx Software Systems Limited Speech enhancement with minimum gating
US8015002B2 (en) * 2007-10-24 2011-09-06 Qnx Software Systems Co. Dynamic noise reduction using linear model fitting
US8606566B2 (en) * 2007-10-24 2013-12-10 Qnx Software Systems Limited Speech enhancement through partial speech reconstruction
US8355511B2 (en) * 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8606573B2 (en) * 2008-03-28 2013-12-10 Alon Konchitsky Voice recognition improved accuracy in mobile environments
US8521530B1 (en) 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8914282B2 (en) * 2008-09-30 2014-12-16 Alon Konchitsky Wind noise reduction
US20100082339A1 (en) * 2008-09-30 2010-04-01 Alon Konchitsky Wind Noise Reduction
KR101539268B1 (en) * 2008-12-22 2015-07-24 삼성전자주식회사 Apparatus and method for noise suppress in a receiver
JP5787126B2 (en) 2009-11-06 2015-09-30 日本電気株式会社 Signal processing method, information processing apparatus, and signal processing program
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
US8718290B2 (en) * 2010-01-26 2014-05-06 Audience, Inc. Adaptive noise reduction using level cues
US9008329B1 (en) 2010-01-26 2015-04-14 Audience, Inc. Noise reduction using multi-feature cluster tracker
TWI459828B (en) * 2010-03-08 2014-11-01 Dolby Lab Licensing Corp Method and system for scaling ducking of speech-relevant channels in multi-channel audio
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
US9343073B1 (en) * 2010-04-20 2016-05-17 Knowles Electronics, Llc Robust noise suppression system in adverse echo conditions
US9378754B1 (en) 2010-04-28 2016-06-28 Knowles Electronics, Llc Adaptive spatial classifier for multi-microphone systems
US8831937B2 (en) * 2010-11-12 2014-09-09 Audience, Inc. Post-noise suppression processing to improve voice quality
CN103200140B (en) * 2012-01-06 2016-05-11 微思泰(北京)信息技术有限公司 A kind of interference cancelling system and method based on preequalization
US9173025B2 (en) 2012-02-08 2015-10-27 Dolby Laboratories Licensing Corporation Combined suppression of noise, echo, and out-of-location signals
US8712076B2 (en) 2012-02-08 2014-04-29 Dolby Laboratories Licensing Corporation Post-processing including median filtering of noise suppression gains
FR3002679B1 (en) * 2013-02-28 2016-07-22 Parrot METHOD FOR DEBRUCTING AN AUDIO SIGNAL BY A VARIABLE SPECTRAL GAIN ALGORITHM HAS DYNAMICALLY MODULABLE HARDNESS
US9831898B2 (en) * 2013-03-13 2017-11-28 Analog Devices Global Radio frequency transmitter noise cancellation
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
GB201401689D0 (en) * 2014-01-31 2014-03-19 Microsoft Corp Audio signal processing
US9516165B1 (en) * 2014-03-26 2016-12-06 West Corporation IVR engagements and upfront background noise
US9978388B2 (en) 2014-09-12 2018-05-22 Knowles Electronics, Llc Systems and methods for restoration of speech components
CN104980337B (en) * 2015-05-12 2019-11-22 腾讯科技(深圳)有限公司 A kind of performance improvement method and device of audio processing
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones
EP3312838A1 (en) * 2016-10-18 2018-04-25 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for processing an audio signal
KR102327441B1 (en) * 2019-09-20 2021-11-17 엘지전자 주식회사 Artificial device
US11094328B2 (en) * 2019-09-27 2021-08-17 Ncr Corporation Conferencing audio manipulation for inclusion and accessibility
US20230154481A1 (en) * 2021-11-17 2023-05-18 Beacon Hill Innovations Ltd. Devices, systems, and methods of noise reduction

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6862567B1 (en) 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5659622A (en) * 1995-11-13 1997-08-19 Motorola, Inc. Method and apparatus for suppressing noise in a communication system
US5933495A (en) * 1997-02-07 1999-08-03 Texas Instruments Incorporated Subband acoustic noise suppression
US5937377A (en) * 1997-02-19 1999-08-10 Sony Corporation Method and apparatus for utilizing noise reducer to implement voice gain control and equalization
US6233549B1 (en) * 1998-11-23 2001-05-15 Qualcomm, Inc. Low frequency spectral enhancement system and method
US6366880B1 (en) * 1999-11-30 2002-04-02 Motorola, Inc. Method and apparatus for suppressing acoustic background noise in a communication system by equaliztion of pre-and post-comb-filtered subband spectral energies
JP4282227B2 (en) * 2000-12-28 2009-06-17 日本電気株式会社 Noise removal method and apparatus
US7617099B2 (en) * 2001-02-12 2009-11-10 FortMedia Inc. Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile
WO2003001173A1 (en) * 2001-06-22 2003-01-03 Rti Tech Pte Ltd A noise-stripping device
US7283956B2 (en) * 2002-09-18 2007-10-16 Motorola, Inc. Noise suppression
US7454332B2 (en) * 2004-06-15 2008-11-18 Microsoft Corporation Gain constrained noise suppression

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6862567B1 (en) 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2008379A4

Also Published As

Publication number Publication date
EP2008379A2 (en) 2008-12-31
US20070237271A1 (en) 2007-10-11
WO2007117785A3 (en) 2008-05-08
EP2008379A4 (en) 2010-09-22
US7555075B2 (en) 2009-06-30
EP2008379B1 (en) 2012-06-27

Similar Documents

Publication Publication Date Title
US7555075B2 (en) Adjustable noise suppression system
JP4423300B2 (en) Noise suppressor
KR100851716B1 (en) Noise suppression based on bark band weiner filtering and modified doblinger noise estimate
EP2143204B1 (en) Automatic volume and dynamic range adjustment for mobile audio devices
EP1252796B1 (en) System and method for dual microphone signal noise reduction using spectral subtraction
JP3568922B2 (en) Echo processing device
US5544250A (en) Noise suppression system and method therefor
EP2244254B1 (en) Ambient noise compensation system robust to high excitation noise
US8521530B1 (en) System and method for enhancing a monaural audio signal
US8085930B2 (en) Communication system
EP0790599B1 (en) A noise suppressor and method for suppressing background noise in noisy speech, and a mobile station
JP4836720B2 (en) Noise suppressor
WO2008121436A1 (en) Method and apparatus for quickly detecting a presence of abrupt noise and updating a noise estimate
WO2006001960A1 (en) Comfort noise generator using modified doblinger noise estimate
JP2014232331A (en) System and method for adaptive intelligent noise suppression
JP4018571B2 (en) Speech enhancement device
KR20070085729A (en) Noise reduction and comfort noise gain control using bark band weiner filter and linear attenuation
US7260209B2 (en) Methods and apparatus for improving voice quality in an environment with noise
EP2987314B1 (en) Echo suppression
JP2004341339A (en) Noise restriction device
CN110136734B (en) Method and audio noise suppressor for reducing musical artifacts using nonlinear gain smoothing
WO2017196382A1 (en) Enhanced de-esser for in-car communication systems
JP2003044087A (en) Device and method for suppressing noise, voice identifying device, communication equipment and hearing aid
Premananda et al. Speech enhancement to overcome the effect of near-end noise in mobile phones using psychoacoustics
JP2003517761A (en) Method and apparatus for suppressing acoustic background noise in a communication system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07757694

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2007757694

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE