WO2011133405A1 - Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system - Google Patents
Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system Download PDFInfo
- Publication number
- WO2011133405A1 WO2011133405A1 PCT/US2011/032578 US2011032578W WO2011133405A1 WO 2011133405 A1 WO2011133405 A1 WO 2011133405A1 US 2011032578 W US2011032578 W US 2011032578W WO 2011133405 A1 WO2011133405 A1 WO 2011133405A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- noise
- sub
- signal
- band signal
- level
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B15/00—Suppression or limitation of noise or interference
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers
- H04R3/002—Damping circuit arrangements for transducers, e.g. motional feedback circuits
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
Definitions
- the present invention relates generally to audio processing, and more particularly to adaptive noise reduction of an audio signal.
- the stationary noise suppression system will always provide an output noise that is a fixed amount lower than the input noise.
- the noise suppression is in the range of 12-13 decibels (dB).
- the noise suppression is fixed to this conservative level in order to avoid producing speech loss distortion, which will be apparent with higher noise suppression.
- SNR signal-to-noise ratios
- an enhancement filter may be derived based on an estimate of a noise spectrum.
- One common enhancement filter is the Wiener filter.
- the enhancement filter is typically configured to minimize certain mathematical error quantities, without taking into account a user's perception.
- a certain amount of speech degradation is introduced as a side effect of the signal enhancement which suppress noise.
- speech components that are lower in energy than the noise typically end up being suppressed by the enhancement filter, which results in a modification of the output speech spectrum that is perceived as speech distortion.
- This speech degradation will become more severe as the noise level rises and more speech components are attenuated by the enhancement filter. That is, as the SNR gets lower, typically more speech components are buried in noise or interpreted as noise, and thus there is more resulting speech loss distortion. This introduces more speech loss distortion and speech degradation.
- the present technology provides adaptive noise reduction of an acoustic signal using a sophisticated level of control to balance the tradeoff between speech loss distortion and noise reduction.
- the energy level of a noise component in a sub- band signal of the acoustic signal is reduced based on an estimated signal-to-noise ratio of the sub-band signal, and further on an estimated threshold level of speech distortion in the sub-band signal.
- the energy level of the noise component in the sub-band signal may be reduced to no less than a residual noise target level.
- a target level may be defined as a level at which the noise component ceases to be perceptible.
- a method for reducing noise within an acoustic signal as described herein includes receiving an acoustic signal and separating the acoustic signal into a plurality of sub-band signals. A reduction value is then applied to a sub-band signal in the plurality of sub-band signals to reduce an energy level of a noise component in the sub-band signal. The reduction value is based on an estimated signal-to-noise ratio of the sub-band signal, and further based on an estimated threshold level of speech loss distortion in the sub-band signal.
- a system for reducing noise within an acoustic signal as described herein includes a frequency analysis module stored in memory and executed by a processor to receive an acoustic signal and separate the acoustic signal into a plurality of sub- band signals.
- the system also includes a noise reduction module stored in memory and executed by a processor to apply a reduction value to a sub-band signal in the plurality of sub-band signals to reduce an energy level of a noise component in the sub-band signal.
- the reduction value is based on an estimated signal-to-noise ratio of the sub-band signal, and further based on an estimated threshold level of speech loss distortion in the sub-band signal.
- a computer readable storage medium as described herein has embodied thereon a program executable by a processor to perform a method for reducing noise within an acoustic signal as described above.
- Figure 1 is an illustration of an environment in which embodiments of the present technology may be used.
- Figure 2 is a block diagram of an exemplary audio device.
- Figure 3 is a block diagram of an exemplary audio processing system.
- Figure 4 is a block diagram of an exemplary mask generator module.
- Figure 5 is an illustration of exemplary look-up tables for maximum suppression values.
- Figure 6 illustrates exemplary suppression values for different levels of speech loss distortion.
- Figure 7 is an illustration of the final gain lower bound across the sub- bands.
- Figure 8 is a flowchart of an exemplary method for performing noise reduction for an acoustic signal.
- Figure 9 is a flowchart of an exemplary method for performing noise suppression for an acoustic signal.
- the present technology provides adaptive noise reduction of an acoustic signal using a sophisticated level of control to balance the tradeoff between speech loss distortion and noise reduction.
- Noise reduction may be performed by applying reduction values (e.g., subtraction values and/or multiplying gain masks) to corresponding sub-band signals of the acoustic signal, while also limiting the speech loss distortion introduced by the noise reduction to an acceptable threshold level.
- the reduction values and thus noise reduction performed can vary across sub-band signals.
- the noise reduction may be based upon the characteristics of the individual sub-band signals, as well as by the perceived speech loss distortion introduced by the noise reduction.
- the noise reduction may be performed to jointly optimize noise reduction and voice quality in an audio signal.
- the present technology provides a lower bound (i.e., lower threshold) for the amount of noise reduction performed in a sub-band signal.
- the noise reduction lower bound serves to limit the amount of speech loss distortion within the sub-band signal. As a result, a large amount of noise reduction may be performed in a sub- band signal when possible.
- the noise reduction may be smaller when conditions such as an unacceptably high speech loss distortion do not allow for a large amount of noise reduction.
- Noise reduction performed by the present system may be in the form of noise suppression and/or noise cancellation.
- the present system may generate reduction values applied to primary acoustic sub-band signals to achieve noise reduction.
- the reduction values may be implemented as a gain mask multiplied with sub-band signals to suppress the energy levels of noise components in the sub- band signals.
- the multiplicative process is referred to as multiplicative noise suppression.
- the reduction values can be derived as a lower bound for the amount of noise cancellation performed in a sub-band signal by subtracting a noise reference sub-band signal from the mixture sub-band signal.
- the present system may reduce the energy level of the noise component in the sub-band to no less than a residual noise target level.
- the residual noise target level may be fixed or slowly time-varying, and in some embodiments is the same for each sub-band signal.
- the residual noise target level may for example be defined as a level at which the noise component ceases to be audible or perceptible, or below a self-noise level of a microphone used to capture the acoustic signal.
- the residual noise target level may be below a noise gate of a component such as an internal AGC noise gate or baseband noise gate within a system used to perform the noise reduction techniques described herein.
- Some prior art systems invoke a generalized side-lobe canceller.
- the generalized side-lobe canceller is used to identify desired signals and interfering signals included by a received signal.
- the desired signals propagate from a desired location and the interfering signals propagate from other locations.
- the interfering signals are subtracted from the received signal with the intention of cancelling the interference. This subtraction can also introduce speech loss distortion and speech degradation.
- Embodiments of the present technology may be practiced on any audio device that is configured to receive and/or provide audio such as, but not limited to, cellular phones, phone handsets, headsets, and conferencing systems. While some embodiments of the present technology will be described in reference to operation on a cellular phone, the present technology may be practiced on any audio device.
- FIG. 1 is an illustration of an environment in which embodiments of the present technology may be used.
- a user may act as an audio (speech) source 102 to an audio device 104.
- the exemplary audio device 104 includes two microphones: a primary microphone 106 relative to the audio source 102 and a secondary microphone 106 relative to the audio source 102 and a secondary microphone 106 relative to the audio source 102 and a secondary microphone 106 relative to the audio source 102 and a secondary microphones.
- the audio device 104 may include a single microphone.
- the audio device 104 may include more than two microphones, such as for example three, four, five, six, seven, eight, nine, ten or even more microphones.
- the primary microphonel06 and secondary microphone 108 may be omni-directional microphones. Alternatively embodiments may utilize other forms of microphones or acoustic sensors. [0028] While the microphones 106 and 108 receive sound (i.e. acoustic signals) from the audio source 102, the microphones 106 and 108 also pick up noise 110.
- the noise 110 may include any sounds from one or more locations that differ from the location of audio source 102, and may include reverberations and echoes.
- the noise 110 may be stationary, non-stationary, and/or a combination of both stationary and non- stationary noise.
- Some embodiments may utilize level differences (e.g. energy differences) between the acoustic signals received by the two microphones 106 and 108. Because the primary microphone 106 is much closer to the audio source 102 than the secondary microphone 108, the intensity level is higher for the primary microphone 106, resulting in a larger energy level received by the primary microphone 106 during a speech/voice segment, for example.
- level differences e.g. energy differences
- the level difference may then be used to discriminate speech and noise in the time-frequency domain. Further embodiments may use a combination of energy level differences and time delays to discriminate speech. Based on binaural cue encoding, speech signal extraction or speech enhancement may be performed.
- FIG. 2 is a block diagram of an exemplary audio device 104.
- the audio device 104 includes a receiver 200, a processor 202, the primary microphone 106, an optional secondary microphone 108, an audio processing system 210, and an output device 206.
- the audio device 104 may include further or other components necessary for audio device 104 operations.
- the audio device 104 may include fewer components that perform similar or equivalent functions to those depicted in Figure 2.
- Processor 202 may execute instructions and modules stored in a memory (not illustrated in Figure 2) in the audio device 104 to perform functionality described herein, including noise suppression for an acoustic signal.
- Processor 202 may include hardware and software implemented as a processing unit, which may process floating point operations and other operations for the processor 202.
- the exemplary receiver 200 is an acoustic sensor configured to receive a signal from a communications network.
- the receiver 200 may include an antenna device.
- the signal may then be forwarded to the audio processing system 210 to reduce noise using the techniques described herein, and provide an audio signal to the output device 206.
- the present technology may be used in one or both of the transmit and receive paths of the audio device 104.
- the audio processing system 210 is configured to receive the acoustic signals from an acoustic source via the primary microphone 106 and secondary microphone 108 and process the acoustic signals. Processing may include performing noise reduction within an acoustic signal.
- the audio processing system 210 is discussed in more detail below.
- the primary and secondary microphones 106, 108 may be spaced a distance apart in order to allow for detection of an energy level difference between them.
- the acoustic signals received by primary microphone 106 and secondary microphone 108 may be converted into electrical signals (i.e. a primary electrical signal and a secondary electrical signal).
- the electrical signals may themselves be converted by an analog-to-digital converter (not shown) into digital signals for processing in accordance with some embodiments.
- the acoustic signal received by the primary microphone 106 is herein referred to as the primary acoustic signal
- the acoustic signal received from by the secondary microphone 108 is herein referred to as the secondary acoustic signal.
- the primary acoustic signal and the secondary acoustic signal may be processed by the audio processing system 210 to produce a signal with an improved signal-to-noise ratio. It should be noted that embodiments of the technology described herein may be practiced utilizing only the primary microphone 106.
- the output device 206 is any device which provides an audio output to the user.
- the output device 206 may include a speaker, an earpiece of a headset or handset, or a speaker on a conference device.
- a beamforming technique may be used to simulate forwards-facing and backwards- facing directional microphones.
- the level difference may be used to discriminate speech and noise in the time-frequency domain which can be used in noise reduction.
- FIG. 3 is a block diagram of an exemplary audio processing system 210 for performing noise reduction as described herein.
- the audio processing system 210 is embodied within a memory device within audio device 104.
- the audio processing system 210 may include a frequency analysis module 302, a feature extraction module 304, a source inference engine module 306, mask generator module 308, noise canceller (NPNS) module 310, modifier module 312, and reconstructor module 314.
- the mask generator module 308 in conjunction with the modifier module 312 and the noise canceller module 310 is also referred to herein as a noise reduction module.
- Audio processing system 210 may include more or fewer components than illustrated in Figure 3, and the functionality of modules may be combined or expanded into fewer or additional modules. Exemplary lines of communication are illustrated between various modules of Figure 3, and in other figures herein. The lines of communication are not intended to limit which modules are communicatively coupled with others, nor are they intended to limit the number of and type of signals communicated between modules.
- acoustic signals received from the primary microphone 106 and second microphone 108 are converted to electrical signals, and the electrical signals are processed through frequency analysis module 302.
- the frequency analysis module 302 takes the acoustic signals and mimics the frequency analysis of the cochlea (e.g., cochlear domain), simulated by a filter bank.
- the frequency analysis module 302 separates each of the primary and secondary acoustic signals into two or more frequency sub-band signals.
- a sub-band signal is the result of a filtering operation on an input signal, where the bandwidth of the filter is narrower than the bandwidth of the signal received by the frequency analysis module 302.
- a sub-band analysis on the acoustic signal determines what individual frequencies are present in each sub-band of the complex acoustic signal during a frame (e.g. a predetermined period of time). For example, the length of a frame may be 4 ms, 8 ms, or some other length of time. In some embodiments there may be no frame at all.
- the results may include sub-band signals in a fast cochlea transform (FCT) domain.
- FCT fast cochlea transform
- the sub-band frame signals are provided from frequency analysis module 302 to an analysis path sub-system 320 and to a signal path sub-system 330.
- the analysis path sub-system 320 may process the signal to identify signal features, distinguish between speech components and noise components of the sub-band signals, and generate a signal modifier.
- the signal path sub-system 330 is
- the modification may reduce noise and to preserve the desired speech components in the sub-band signals.
- Signal path sub-system 330 includes NPNS module 310 and modifier module 312.
- NPNS module 310 receives sub-band frame signals from frequency analysis module 302.
- NPNS module 310 may subtract (i.e., cancel) noise component from one or more sub-band signals of the primary acoustic signal.
- NPNS module 310 may ouptut sub-band estimates of noise components in the primary signal and sub-band estimates of speech components in the form of noise-subtracted sub-band signals.
- NPNS module 310 may be implemented in a variety of ways. In some embodiments, NPNS module 310 may be implemented with a single NPNS module. Alternatively, NPNS module 310 may include two or more NPNS modules, which may be arranged for example in a cascaded fashion.
- NPNS module 310 can provide noise cancellation for two-microphone configurations, for example based on source location, by utilizing a subtractive algorithm. It can also be used to provide echo cancellation. Since noise and echo cancellation can usually be achieved with little or no voice quality degradation, processing performed by NPNS module 310 may result in an increased SNR in the primary acoustic signal received by subsequent post-filtering and multiplicative stages. The amount of noise cancellation performed may depend on the diffuseness of the noise source and the distance between microphones. These both contribute towards the coherence of the noise between the microphones, with greater coherence resulting in better cancellation.
- the feature extraction module 304 of the analysis path sub-system 320 receives the sub-band frame signals derived from the primary and secondary acoustic signals provided by frequency analysis module 302.
- Feature extraction module 304 receives the output of NPNS module 310 and computes frame energy estimations of the sub-band signals, inter-microphone level difference (ILD) between the primary acoustic signal and the secondary acoustic signal, self-noise estimates for the primary and second microphones.
- Feature extraction module 304 may also compute other monaural or binaural features which may be required by other modules, such as pitch estimates and cross-correlations between microphone signals.
- the feature extraction module 304 may both provide inputs to and process outputs from NPNS module 310.
- Feature extraction module 304 may compute energy levels for the sub- band signals of the primary and secondary acoustic signal and an inter-microphone level difference (ILD) from the energy levels.
- the ILD may be determined by an ILD module within feature extraction module 304.
- Source inference engine module 306 may process the frame energy estimations to compute noise estimates and may derive models of the noise and speech in the sub-band signals.
- Source inference engine module 306 adaptively estimates attributes of the acoustic sources, such as their energy spectra of the output signal of the NPNS module 310. The energy spectra attribute may be used to generate a multiplicative mask in mask generator module 308.
- the source inference engine module 306 may receive the ILD from the feature extraction module 304 and track the ILD probability distributions or
- clusters of the target audio source 102, background noise and optionally echo.
- the classification boundary or dominance threshold is used to classify the signal as speech if the SNR is sufficiently positive or as noise if the SNR is sufficiently negative. This classification may be determined per sub-band and time-frame as a dominance mask, and output by a cluster tracker module to a noise estimator module within the source inference engine module 306.
- the cluster tracker module may generate a noise/speech classification signal per sub-band and provide the classification to NPNS module 310.
- the classification is a control signal indicating the differentiation between noise and speech.
- NPNS module 310 may utilize the classification signals to estimate noise in received microphone energy estimate signals.
- the results of cluster tracker module may be forwarded to the noise estimate module within the source inference engine module 306.
- a current noise estimate along with locations in the energy spectrum where the noise may be located are provided for processing a noise signal within audio processing system 210.
- Source inference engine module 306 may include a noise estimate module which may receive a noise/speech classification control signal from the cluster tracker module and the output of NPNS module 310 to estimate the noise N(t,w).
- the noise estimate determined by noise estimate module is provided to mask generator module 308.
- mask generator module 308 receives the noise estimate output of NPNS module 310 and an output of the cluster tracker module.
- the noise estimate module in the source inference engine module 306 may include an ILD noise estimator, and a stationary noise estimator.
- the noise estimates are combined with a max() operation, so that the noise suppression performance resulting from the combined noise estimate is at least that of the individual noise estimates.
- the ILD noise estimate is derived from the dominance mask and NPNS module 310 output signal energy.
- the mask generator module 308 receives models of the sub-band speech components and noise components as estimated by the source inference engine module 306. Noise estimates of the noise spectrum for each sub-band signal may be subtracted out of the energy estimate of the primary spectrum to infer a speech spectrum. Mask generator module 308 may determine a gain mask for the sub-band signals of the primary acoustic signal and provide the gain mask to modifier module 312. The modifier module 312 multiplies the gain masks to the noise-subtracted sub- band signals of the primary acoustic signal output by the NPNS module 310.
- Applying the mask reduces energy levels of noise components in the sub-band signals of the primary acoustic signal and performs noise reduction.
- the values of the gain mask output from mask generator module 308 are time and sub-band signal dependent and optimize noise reduction on a per sub-band basis.
- the noise reduction may be subject to the constraint that the speech loss distortion complies with a tolerable threshold limit.
- the threshold limit may be based on many factors, such as for example a voice quality optimized suppression (VQOS) level.
- VQOS level is an estimated maximum threshold level of speech loss distortion in the sub-band signal introduced by the noise reduction.
- the VQOS is tunable and takes into account the properties of the sub-band signal, thereby providing full design flexibility for system and acoustic designers.
- a lower bound for the amount of noise reduction performed in a sub-band signal is determined subject to the VQOS threshold, thereby limiting the amount of speech loss distortion of the sub-band signal. As a result, a large amount of noise reduction may be performed in a sub-band signal when possible. The noise reduction may be smaller when conditions such as unacceptably high speech loss distortion do not allow for the large amount of noise reduction.
- the energy level of the noise component in the sub-band signal may be reduced to no less than a residual noise target level.
- the residual noise target level may be fixed or slowly time- varying.
- the residual noise target level is the same for each sub-band signal.
- Such a target level may for example be a level at which the noise component ceases to be audible or perceptible, or below a self-noise level of a microphone used to capture the primary acoustic signal.
- the residual noise target level may be below a noise gate of a component such as an internal AGC noise gate or baseband noise gate within a system implementing the noise reduction techniques described herein.
- Reconstructor module 314 may convert the masked frequency sub-band signals from the cochlea domain back into the time domain.
- the conversion may include adding the masked frequency sub-band signals and phase shifted signals.
- the conversion may include multiplying the masked frequency sub- band signals with an inverse frequency of the cochlea channels.
- the synthesized acoustic signal may be output to the user via output device 206 and/or provided to a codec for encoding.
- additional post-processing of the synthesized time domain acoustic signal may be performed.
- comfort noise generated by a comfort noise generator may be added to the synthesized acoustic signal prior to providing the signal to the user.
- Comfort noise may be a uniform constant noise that is not usually discernable to a listener (e.g., pink noise). This comfort noise may be added to the synthesized acoustic signal to enforce a threshold of audibility and to mask low-level non-stationary output noise components.
- the comfort noise level may be chosen to be just above a threshold of audibility and may be settable by a user.
- the mask generator module 308 may have access to the level of comfort noise in order to generate gain masks that will suppress the noise to a level at or below the comfort noise.
- the system of FIG. 3 may process several types of signals processed by an audio device. The system may be applied to acoustic signals received via one or more microphones. The system may also process signals, such as a digital Rx signal, received through an antenna or other connection.
- FIG. 4 is an exemplary block diagram of the mask generator module 308.
- the mask generator module 308 may include a Wiener filter module 400, mask smoother module 402, signal-to-noise (SNR) ratio estimator module 404, VQOS mapper module 406, residual noise target suppressor (RNTS) estimator module 408, and a gain moderator module 410.
- Mask generator module 308 may include more or fewer components than those illustrated in Figure 4, and the functionality of modules may be combined or expanded into fewer or additional modules.
- the Wiener filter module 400 calculates Wiener filter gain mask values, ⁇ t, ⁇ )) , for each sub-band signal of the primary acoustic signal.
- the gain mask values may be based on the noise and speech short-term power spectral densities during time frame t and sub-band signal index ⁇ . This can be represented mathematically as:
- Ps is the estimated power spectral density of speech in the sub-band signal ⁇ of the primary acoustic signal during time frame t.
- Pn is the estimated power spectral density of the noise in the sub-band signal ⁇ of the primary acoustic signal during time frame t.
- Pn may be calculated by source inference engine module 306. Ps may be computed mathematically as:
- a s is the forgetting factor of a 1 st order recursive IIR filter or leaky integrator.
- P y is the power spectral density of the primary acoustic signal output by the NPNS module 310 as described above.
- the Wiener filter gain mask values, G wf ⁇ t, CO) derived from the speech and noise estimates may not be optimal from a perceptual sense. That is, the Wiener filter may typically be configured to minimize certain mathematical error quantities, without taking into account a user's perception of any resulting speech distortion. As a result, a certain amount of speech distortion may be introduced as a side effect of noise suppression using the Wiener filter gain mask values.
- speech components that are lower in energy than the noise typically end up being suppressed by the noise suppressor, which results in a modification of the output speech spectrum that is perceived as speech distortion.
- This speech degradation will become more severe as the noise level rises and more speech components are attenuated by the noise suppressor. That is, as the SNR gets lower, typically more speech components are buried in noise or interpreted as noise, and thus there is more resulting speech loss distortion.
- determining an initial gain value based on the speech and noise PSD may be utilized.
- the Wiener gain values may be lower bounded using a perceptually- derived gain lower bound, G lb (t, 60) :
- G n (t, 60) is the noise suppression mask
- G lb (t, 60) is a complex function of the instantaneous SNR in that sub-band signal, frequency, power and VQOS level.
- the gain lower bound is derived utilizing both the VQOS mapper module 406 and the RNTS estimator module 408 as discussed below.
- Wiener filter module 400 may also includes a global voice activity detector (VAD), and a sub-band VAD for each sub-band or "VAD mask".
- VAD global voice activity detector
- sub-band VAD mask can be used by mask generator module 308, e.g. within the mask smoother module 402, and outside of the mask generator module 308, e.g. an Automatic Gain Control (AGC).
- AGC Automatic Gain Control
- VAD(t) (n(t) > n l ) - (n(t) ⁇ n 2 )
- g l is a gain threshold
- ⁇ ⁇ and n 2 are thresholds on the number of sub-bands where the VAD mask must indicate active speech
- n l > n 2 .
- VAD and VAD mask are derived from the wiener filter gain, they are independent of the gain lower bound and VQOS level. This is advantageous, for example, in obtaining similar AGC behavior even as the amount of noise suppression varies.
- the SNR estimator module 404 receives energy estimations of a noise component and speech component in a particular sub-band and calculates the SNR per sub-band signal of the primary acoustic signal.
- the calculated per sub-band SNR is provided to and used by VQOS mapper module 406 and RNTS estimator module 408 to compute the perceptually-derived gain lower bound as described below.
- the SNR estimator module 404 calculates instantaneous SNR as the ratio of long-term peak speech energy, P s (t, CO) , to the instantaneous noise energy, P n (t, CO) :
- P s (t, CO) can be determined using one or more of mechanisms based upon the input instantaneous speech power estimate and noise power estimate P n (t, CO) .
- the mechanisms may include a peak speech level tracker, average speech energy in the highest x dB of the speech signal's dynamic range, reset the speech level tracker after sudden drop in speech level, e.g. after shouting, apply lower bound to speech estimate at low frequencies (which may be below the fundamental component of the talker), smooth speech power and noise power across sub-bands, and add fixed biases to the speech power estimates and SNR so that they match the correct values for a set of oracle mixtures.
- the SNR estimator module 404 can also calculate a global SNR (across all sub-band signals). This may be useful in other modules within the system 210, or may be configured as an output API of the OS for controlling other functions of the audio device 104.
- the VQOS mapper module 406 determines the minimum gain lower bound for each sub-band signal, G lb (t, CO) .
- the minimum gain lower bound is subject to the constraint that the introduced perceptual speech loss distortion should be no more than a tolerable threshold level as determined by the specified VQOS level.
- the maximum suppression value (inverse of G lb (t, CO) ), varies across the sub-band signals and is determined based on the frequency and SNR of each sub-band signal, and the VQOS level.
- the minimum gain lower bound for each sub-band signal can be represented mathematically as: G lb (t, CO) ⁇ f(VQOS, CO, SNR(t, CO) ) .
- the VQOS level defines the maximum tolerable speech loss distortion.
- the VQOS level can be selectable or tunable from among a number of threshold levels of speech distortion.
- the VQOS level takes into account the properties of the primary acoustic signal and provides full design flexibility for systems and acoustic designers.
- the minimum gain lower bound for each sub-band signal, G lb (t, CO) is determined using look-up tables stored in memory in the audio device 104.
- the look-up tables can be generated empirically using subjective speech quality assessment tests. For example, listeners can rate the level of speech loss distortion (VQOS level) of audio signals for various suppression levels and signal-to- noise ratios. These ratings can then be used to generate the look-up tables as a subjective measure of audio signal quality.
- Alternative techniques such as the use of objective measures for estimating audio signal quality using computerized techniques, may also be used to generate the look-up tables in some embodiments.
- the levels of speech loss distortion may be defined as:
- VQOS level 0 corresponds to zero suppression, so it is effectively a bypass of the noise suppressor.
- the look-up tables for VQOS levels between the above identified levels, such as VQOS level 5 between VQOS levels 4 and 6, can be determined by interpolation between the levels.
- the levels of speech distortion may also extend beyond excessive speech distortion. Since VQOS level 10 represents excessive speech distortion in the above example, each level higher than 10 may be represented as a fixed number of dB extra noise suppression, such as 3 dB.
- Figure 5 is an illustration of exemplary look-up tables for maximum suppression values (inverse of minimum G lb (t, 0)) ) for VQOS levels of 2, 4, 6, 8 and
- the tables indicate the maximum achievable suppression value before a certain level of speech distortion is obtained, as indicated by the title of each table illustrated in Figure 5. For example, for a signal-to-noise ratio of 18 dB, a sub-band center frequency of 0.5 kHz, and VQOS level 2, the maximum achievable suppression value is about 18 dB.As the suppression value is increased above 18 dB, the speech distortion is more than "No perceptible speech distortion.” As described above, the values in the look-up tables can be determined empirically, and can vary from embodiment to embodiment.
- the VQOS mapper module 406 is based on a perceptual model that maintains the speech loss distortion below some tolerable threshold level whilst at the same time maximizing the amount of suppression across SNRs and noise types. As a result, a large amount of noise suppression may be performed in a sub- band signal when possible. The noise suppression may be smaller when conditions such as unacceptably high speech loss distortion do not allow for the large amount of noise reduction.
- the RNTS estimator module 408 determines the final gain lower bound, G lb (t, O)) .
- the minimum gain lower bound , G lb (t, O)) provided by the VQOS mapper module 406 is subject to the constraint that the energy level of the noise component in each sub-band signal is reduced to no less than a residual noise target level (RNTL).
- RNTL residual noise target level
- minimum gain lower bound provided by the VQOS mapper module 406 may be lower than necessary to render the residual noise below the RNTL.
- the RNTS estimator module 408 limits the minimum gain lower bound, thereby backing off on the suppression and the resulting speech loss distortion. For example, a first value for the gain lower bound may be determined based exclusively on the estimated SNR and the VQOS level. A second value for the gain lower bound may be determined based on reducing the energy level of the noise component in the sub-band signal to the RNTL. The final GLB, G lb (t, CO) , can then be determined by selecting the smaller of the two suppression values.
- the final gain lower bound can be further limited so that the maximum suppression applied does not result in the noise being reduced if the energy level P n (t, 0)) of the noise component is below the energy level P mtl (t, 0)) of the RNTL.
- the final gain lower bound is unity.
- the final gain lower bound can be represented mathematically as:
- the residual noise may be audible, since the gain lower bound is generally lower bounded to avoid excessive speech loss distortion, as discussed above with respect to the VQOS mapper module 406.
- the residual noise may be rendered completely inaudible; in fact the minimum gain lower bound provided by the VQOS mapper module 406 may be lower than necessary to render the noise inaudible.
- using the minimum gain lower bound provided by the VQOS mapper module 406 may result in more speech loss distortion than is necessary to achieve the objective that the residual noise is below the RNTL.
- the RNTS estimator module 408 limits the minimum GLB, thereby backing off on the suppression.
- the choice of RNTL depends on the objective of the system.
- the RNTL may be static or adaptive, frequency dependent or a scalar, or computed at calibration time or settable through optional device dependent parameters or application program interface (API).
- API application program interface
- the RNTL is the same for each sub-band signal.
- the RNTL may for example be defined as a level at which the noise component ceases to be perceptible, or below a self-noise level energy estimate Pmsn of the primary microphone 106 used to capture the primary acoustic signal device.
- the self-noise level energy estimate can be pre-calibrated or derived by the feature extraction module 304.
- the RNTL may be below a noise gate of a component such as an internal AGC noise gate or baseband noise gate within a system used to perform the noise reduction techniques described herein.
- the residual noise is "whitened", i.e. it has a smoother and more constant magnitude spectrum over time, so that is sounds less annoying and more like comfort noise.
- the "whitening" effect results in less modulation over time being introduced. If the codec is receiving residual noise which is modulating a lot over time, the codec may incorrectly identify and encode some of the residual noise as speech, resulting in audible bursts of noise being injected into the noise reduced signal.
- the reduction in modulation over time also reduces the amount of MIPS needed to encode the signal, which saves power.
- the reduction in modulation over time further results in less bits per frame for the encoded signal, which also reduces the power needed to transmit the encoded signal and effectively increases network capacity used for a network carrying the encoded signal.
- Figure 6 illustrates exemplary suppression values as a function of sub- band SNR for different VQOS levels.
- exemplary suppression values are illustrated for sub-band signals having center frequencies of 0.2 kHz, 1 kHz and 5 kHz respectively.
- the exemplary suppression values are the inverse of the final gain lower bound, G lb (t, CO) as output from residual noise target suppressor estimator module 408.
- the sloped dashed lines labeled RNTS in each plot in Figure 6 indicate the minimum suppression necessary to place the residual noise for each sub-band signal below a given residual noise target level.
- the residual noise target level in this particular example is spectrally flat.
- the solid lines are the actual suppression values for each sub-band signal as determined by residual noise target suppressor estimator module 408 .
- the dashed lines extending from the solid lines and above the lines labeled RNTS show the suppression values for each sub-band signal in the absence of the residual noise target level constraint imposed by RNTS estimator module 408.
- the suppression value in the illustrated example would be about 48 dB for a VQOS level of 2, an SNR of 24 dB, and a sub-band center frequency of 0.2 kHz.
- the final suppression value is about 26 dB.
- suppression at high SNR values is bounded by residual noise target level imposed by the RNTS estimator module 408.
- moderate SNR values relatively high suppression can be applied before reaching the acceptable speech loss distortion threshold level.
- the suppression is largely bounded by the speech loss distortion introduced by the noise reduction, so the suppression is relatively small.
- Figure 7 is an illustration of the final gain lower bound, G lb (t, CO) across the sub-bands, for an exemplary input speech power spectrum 700, noise power spectrum 710, and RNTL 720.
- the final gain lower bound at frequency fl is limited to a suppression value less than that necessary to reduce the noise power 710 to the RNTL 720.
- the residual noise power at fl is above the RNTL 720.
- the final gain lower bound at frequency f2 results in a suppression of the noise power 710 down to the RNTL 720, and thus is limited by the residual noise target suppressor estimator module 408 using the techniques described above.
- the noise power 710 is less than the RNTL 720.
- the final gain lower bound is unity so that no suppression is applied and the noise power 710 is not changed.
- the wiener gain values from the wiener filter module 400 are also provided to the optional mask smoother module 402.
- the mask smoother module 402 performs temporal smoothing of the wiener gain values, which helps to reduce the musical noise.
- the wiener gain values may change quickly (e.g. from one frame to the next) and speech and noise estimates can vary greatly between each frame.
- the use of the wiener gain values may result in artifacts (e.g. discontinuities, blips, transients, etc.). Therefore, optional filter smoothing may be performed in the mask smoother module 402 to temporally smooth the Wiener gain values.
- the gain moderator module 410 then maintains a limit, or lower bounds, the smoothed wiener gain values and the gain lower bound provided by the residual noise target suppressor estimator module 408. This is done to moderate the mask so that it does not severely distort speech. This can be represented mathematically as:
- G n (t, ⁇ ) ma ⁇ ( ⁇ , ⁇ G lb (t, ⁇ ))
- the final gain lower bound for each sub-band signal is then provided from the gain moderator module 410 to the modifier module 312.
- the modifier module 312 multiplies the gain lower bounds with the noise- subtracted sub-band signals of the primary acoustic signal (output by the NPNS module 310). This multiplicative process reduces energy levels of noise components in the sub-band signals of the primary acoustic signal, thereby resulting in noise reduction.
- Figure 8 is a flowchart of an exemplary method for performing noise reduction of an acoustic signal. Each step of Figure 8 may be performed in any order, and the method of Figure 8 may include additional or fewer steps than those illustrated.
- acoustic signals are received by the primary microphone 106 and a secondary microphone 108.
- the acoustic signals are converted to digital format for processing.
- acoustic signals are received from more or fewer than two microphones.
- Frequency analysis is then performed on the acoustic signals in step 804 to separate the acoustic signals into sub-band signals.
- the frequency analysis may utilize a filter bank, or for example a discrete Fourier transform or discrete cosine transform.
- step 806 energy spectrums for the sub-band signals of the acoustic signals received at both the primary and second microphones are computed. Once the energy estimates are calculated, inter-microphone level differences (ILD) are computed in step 808. In one embodiment, the ILD is calculated based on the energy estimates (i.e. the energy spectrum) of both the primary and secondary acoustic signals.
- ILD inter-microphone level differences
- Step 810 includes analyzing the received energy estimates and, if available, the ILD to distinguish speech from noise in an acoustic signal.
- noise estimate for each sub-band signal is based on the primary acoustic signal received at the primary microphone 106.
- the noise estimate may be based on the current energy estimate for the sub-band signal of the primary acoustic signal received from the primary microphone 106 and a previously computed noise estimate.
- the noise estimation may be frozen or slowed down when the ILD increases, according to exemplary embodiments.
- step 813 noise cancellation is performed.
- step 814 noise cancellation is performed.
- the noise suppressed acoustic signal may then be output to the user in step 816.
- the digital acoustic signal is converted to an analog signal for output.
- the output may be via a speaker, earpieces, or other similar devices, for example.
- Figure 9 is a flowchart of an exemplary method for performing noise suppression for an acoustic signal. Each step of Figure 9 may be performed in any order, and the method of Figure 9 may include additional or fewer steps than those illustrated. [0097]
- the Wiener filter gain for each sub-band signal is computed at step 900.
- the estimated signal-to-noise ratio of each sub-band signal within the primary acoustic signal is computed at step 901.
- the SNR may be the instantaneous SNR, represented as the ratio of long-term peak speech energy to the instantaneous noise energy.
- the minimum gain lower bound, G lb (t, CO) , for each sub-band signal may be determined based on the estimated SNR for each sub-band signal at step 902.
- the minimum gain lower bound is determined such that the introduced perceptual speech loss distortion is no more than a tolerable threshold level.
- the tolerable threshold level may be determined by the specified VQOS level or based on some other criteria.
- the final gain lower bound is determined for each sub-band signal.
- the final gain lower bound may be determined by limiting the minimum gain lower bounds.
- the final gain lower bound is subject to the constraint that the energy level of the noise component in each sub-band signal is reduced to no less than a residual noise target level.
- step 906 the maximum of final gain lower bound and the Wiener filter gain for each sub-band signal is multiplied by the corresponding noise-subtracted sub-band signals of the primary acoustic signal output by the NPNS module 310.
- the multiplication reduces the level of noise in the noise-subtracted sub-band signals, resulting in noise reduction.
- the masked sub-band signals of the primary acoustic signal are converted back into the time domain.
- Exemplary conversion techniques apply an inverse frequency of the cochlea channel to the masked sub-band signals in order to synthesize the masked sub-band signals.
- additional post-processing may also be performed, such as applying comfort noise.
- the comfort noise is applied via an adder.
- Noise reduction techniques described herein implement the reduction values as gain masks which are multiplied to the sub-band signals to suppress the energy levels of noise components in the sub-band signals. This process is referred to as multiplicative noise suppression.
- the noise reduction techniques described herein can also or alternatively be utilized in subtractive noise cancellation process.
- the reduction values can be derived to provide a lower bound for the amount of noise cancellation performed in a sub-band signal, for example by controlling the value of the cross-fade between an optionally noise cancelled sub-band signal and the original noisy primary sub-band signals.
- This subtractive noise cancellation process can be carried out for example in NPNS module 310.
- the above described modules may be included as instructions that are stored in a storage media such as a machine readable medium (e.g., computer readable medium). These instructions may be retrieved and executed by the processor 202 to perform the functionality discussed herein. Some examples of instructions include software, program code, and firmware. Some examples of storage media include memory devices and integrated circuits.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Circuit For Audible Band Transducer (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FI20126083A FI20126083L (fi) | 2010-04-19 | 2011-04-14 | Menetelmä kohinanvaimennuksen ja äänenlaadun yhteiseksi optimoimiseksi yksi- tai monimikrofonijärjestelmässä |
| JP2013506188A JP2013525843A (ja) | 2010-04-19 | 2011-04-14 | 単一又は複数のマイクを備えるシステムにおける雑音低減と音声品質を共に最適化する方法 |
| KR1020127027238A KR20130061673A (ko) | 2010-04-19 | 2011-04-14 | 모노 또는 다중 마이크로폰 시스템 내의 잡음 제거 및 음성 품질을 공동으로 최적화하는 방법 |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US32576410P | 2010-04-19 | 2010-04-19 | |
| US61/325,764 | 2010-04-19 | ||
| US12/832,901 | 2010-07-08 | ||
| US12/832,901 US8473287B2 (en) | 2010-04-19 | 2010-07-08 | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2011133405A1 true WO2011133405A1 (en) | 2011-10-27 |
Family
ID=44788878
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2011/032578 Ceased WO2011133405A1 (en) | 2010-04-19 | 2011-04-14 | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
Country Status (6)
| Country | Link |
|---|---|
| US (4) | US8473287B2 (https=) |
| JP (1) | JP2013525843A (https=) |
| KR (1) | KR20130061673A (https=) |
| FI (1) | FI20126083L (https=) |
| TW (1) | TW201207845A (https=) |
| WO (1) | WO2011133405A1 (https=) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8473285B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
| CN104246877A (zh) * | 2012-04-23 | 2014-12-24 | 高通股份有限公司 | 用于音频信号处理的系统和方法 |
| US9343056B1 (en) | 2010-04-27 | 2016-05-17 | Knowles Electronics, Llc | Wind noise detection and suppression |
| US9431023B2 (en) | 2010-07-12 | 2016-08-30 | Knowles Electronics, Llc | Monaural noise suppression based on computational auditory scene analysis |
| US9437180B2 (en) | 2010-01-26 | 2016-09-06 | Knowles Electronics, Llc | Adaptive noise reduction using level cues |
| US9438992B2 (en) | 2010-04-29 | 2016-09-06 | Knowles Electronics, Llc | Multi-microphone robust noise suppression |
| US9830899B1 (en) | 2006-05-25 | 2017-11-28 | Knowles Electronics, Llc | Adaptive noise cancellation |
Families Citing this family (81)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110125497A1 (en) * | 2009-11-20 | 2011-05-26 | Takahiro Unno | Method and System for Voice Activity Detection |
| US9838784B2 (en) | 2009-12-02 | 2017-12-05 | Knowles Electronics, Llc | Directional audio capture |
| US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
| US8798992B2 (en) * | 2010-05-19 | 2014-08-05 | Disney Enterprises, Inc. | Audio noise modification for event broadcasting |
| US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
| US8768406B2 (en) * | 2010-08-11 | 2014-07-01 | Bone Tone Communications Ltd. | Background sound removal for privacy and personalization use |
| US10353495B2 (en) | 2010-08-20 | 2019-07-16 | Knowles Electronics, Llc | Personalized operation of a mobile device using sensor signatures |
| US9772815B1 (en) | 2013-11-14 | 2017-09-26 | Knowles Electronics, Llc | Personalized operation of a mobile device using acoustic and non-acoustic information |
| EP2686846A4 (en) * | 2011-03-18 | 2015-04-22 | Nokia Corp | DEVICE FOR AUDIO SIGNAL PROCESSING |
| US8972251B2 (en) * | 2011-06-07 | 2015-03-03 | Qualcomm Incorporated | Generating a masking signal on an electronic device |
| EP2590165B1 (en) | 2011-11-07 | 2015-04-29 | Dietmar Ruwisch | Method and apparatus for generating a noise reduced audio signal |
| US9159964B2 (en) | 2012-09-25 | 2015-10-13 | Front Edge Technology, Inc. | Solid state battery having mismatched battery cells |
| US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
| US9424859B2 (en) * | 2012-11-21 | 2016-08-23 | Harman International Industries Canada Ltd. | System to control audio effect parameters of vocal signals |
| US9330677B2 (en) | 2013-01-07 | 2016-05-03 | Dietmar Ruwisch | Method and apparatus for generating a noise reduced audio signal using a microphone array |
| US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
| CN104347067B (zh) | 2013-08-06 | 2017-04-12 | 华为技术有限公司 | 一种音频信号分类方法和装置 |
| US9508345B1 (en) | 2013-09-24 | 2016-11-29 | Knowles Electronics, Llc | Continuous voice sensing |
| US9781106B1 (en) | 2013-11-20 | 2017-10-03 | Knowles Electronics, Llc | Method for modeling user possession of mobile device for user authentication framework |
| US9953634B1 (en) | 2013-12-17 | 2018-04-24 | Knowles Electronics, Llc | Passive training for automatic speech recognition |
| JP6337519B2 (ja) * | 2014-03-03 | 2018-06-06 | 富士通株式会社 | 音声処理装置、雑音抑圧方法、およびプログラム |
| US9437188B1 (en) | 2014-03-28 | 2016-09-06 | Knowles Electronics, Llc | Buffered reprocessing for multi-microphone automatic speech recognition assist |
| US9500739B2 (en) | 2014-03-28 | 2016-11-22 | Knowles Electronics, Llc | Estimating and tracking multiple attributes of multiple objects from multi-sensor data |
| US10446168B2 (en) * | 2014-04-02 | 2019-10-15 | Plantronics, Inc. | Noise level measurement with mobile devices, location services, and environmental response |
| US9807725B1 (en) | 2014-04-10 | 2017-10-31 | Knowles Electronics, Llc | Determining a spatial relationship between different user contexts |
| AU2015246661A1 (en) * | 2014-04-17 | 2016-12-01 | Cirrus Logic International Semiconductor Limited | Retaining binaural cues when mixing microphone signals |
| WO2016033364A1 (en) | 2014-08-28 | 2016-03-03 | Audience, Inc. | Multi-sourced noise suppression |
| DE112015004185T5 (de) | 2014-09-12 | 2017-06-01 | Knowles Electronics, Llc | Systeme und Verfahren zur Wiederherstellung von Sprachkomponenten |
| US9712915B2 (en) | 2014-11-25 | 2017-07-18 | Knowles Electronics, Llc | Reference microphone for non-linear and time variant echo cancellation |
| DE102015201073A1 (de) | 2015-01-22 | 2016-07-28 | Sivantos Pte. Ltd. | Verfahren und Vorrichtung zur Rauschunterdrückung basierend auf Inter-Subband-Korrelation |
| WO2016123560A1 (en) | 2015-01-30 | 2016-08-04 | Knowles Electronics, Llc | Contextual switching of microphones |
| US9401158B1 (en) | 2015-09-14 | 2016-07-26 | Knowles Electronics, Llc | Microphone signal fusion |
| US10297269B2 (en) | 2015-09-24 | 2019-05-21 | Dolby Laboratories Licensing Corporation | Automatic calculation of gains for mixing narration into pre-recorded content |
| US10403259B2 (en) | 2015-12-04 | 2019-09-03 | Knowles Electronics, Llc | Multi-microphone feedforward active noise cancellation |
| US9779716B2 (en) | 2015-12-30 | 2017-10-03 | Knowles Electronics, Llc | Occlusion reduction and active noise reduction based on seal quality |
| US9830930B2 (en) | 2015-12-30 | 2017-11-28 | Knowles Electronics, Llc | Voice-enhanced awareness mode |
| US20170195811A1 (en) | 2015-12-30 | 2017-07-06 | Knowles Electronics Llc | Audio Monitoring and Adaptation Using Headset Microphones Inside User's Ear Canal |
| WO2017127646A1 (en) | 2016-01-22 | 2017-07-27 | Knowles Electronics, Llc | Shared secret voice authentication |
| US9812149B2 (en) | 2016-01-28 | 2017-11-07 | Knowles Electronics, Llc | Methods and systems for providing consistency in noise reduction during speech and non-speech periods |
| US10157629B2 (en) * | 2016-02-05 | 2018-12-18 | Brainchip Inc. | Low power neuromorphic voice activation system and method |
| US10923132B2 (en) | 2016-02-19 | 2021-02-16 | Dolby Laboratories Licensing Corporation | Diffusivity based sound processing method and apparatus |
| US9820042B1 (en) | 2016-05-02 | 2017-11-14 | Knowles Electronics, Llc | Stereo separation and directional suppression with omni-directional microphones |
| US10249305B2 (en) * | 2016-05-19 | 2019-04-02 | Microsoft Technology Licensing, Llc | Permutation invariant training for talker-independent multi-talker speech separation |
| FR3056813B1 (fr) * | 2016-09-29 | 2019-11-08 | Dolphin Integration | Circuit audio et procede de detection d'activite |
| EP3312838A1 (en) | 2016-10-18 | 2018-04-25 | Fraunhofer Gesellschaft zur Förderung der Angewand | Apparatus and method for processing an audio signal |
| WO2018148095A1 (en) | 2017-02-13 | 2018-08-16 | Knowles Electronics, Llc | Soft-talk audio capture for mobile devices |
| US10224053B2 (en) * | 2017-03-24 | 2019-03-05 | Hyundai Motor Company | Audio signal quality enhancement based on quantitative SNR analysis and adaptive Wiener filtering |
| EP3428918B1 (en) * | 2017-07-11 | 2020-02-12 | Harman Becker Automotive Systems GmbH | Pop noise control |
| US11263522B2 (en) | 2017-09-08 | 2022-03-01 | Analog Devices, Inc. | Analog switched-capacitor neural network |
| US10096311B1 (en) | 2017-09-12 | 2018-10-09 | Plantronics, Inc. | Intelligent soundscape adaptation utilizing mobile devices |
| US20190090052A1 (en) | 2017-09-20 | 2019-03-21 | Knowles Electronics, Llc | Cost effective microphone array design for spatial filtering |
| US10339949B1 (en) | 2017-12-19 | 2019-07-02 | Apple Inc. | Multi-channel speech enhancement |
| US10957337B2 (en) | 2018-04-11 | 2021-03-23 | Microsoft Technology Licensing, Llc | Multi-microphone speech separation |
| CN109003622B (zh) * | 2018-09-11 | 2021-06-04 | 广州小鹏汽车科技有限公司 | 降噪处理方法、装置、收音机及交通工具 |
| CN110517679B (zh) * | 2018-11-15 | 2022-03-08 | 腾讯科技(深圳)有限公司 | 一种人工智能的音频数据处理方法及装置、存储介质 |
| US10771887B2 (en) * | 2018-12-21 | 2020-09-08 | Cisco Technology, Inc. | Anisotropic background audio signal control |
| US11170799B2 (en) * | 2019-02-13 | 2021-11-09 | Harman International Industries, Incorporated | Nonlinear noise reduction system |
| US10964314B2 (en) * | 2019-03-22 | 2021-03-30 | Cirrus Logic, Inc. | System and method for optimized noise reduction in the presence of speech distortion using adaptive microphone array |
| EP3764358B1 (en) | 2019-07-10 | 2024-05-22 | Analog Devices International Unlimited Company | Signal processing methods and systems for beam forming with wind buffeting protection |
| EP3764660B1 (en) | 2019-07-10 | 2023-08-30 | Analog Devices International Unlimited Company | Signal processing methods and systems for adaptive beam forming |
| EP3764664B1 (en) | 2019-07-10 | 2025-02-12 | Analog Devices International Unlimited Company | Signal processing methods and systems for beam forming with microphone tolerance compensation |
| EP3764360B1 (en) | 2019-07-10 | 2024-05-01 | Analog Devices International Unlimited Company | Signal processing methods and systems for beam forming with improved signal to noise ratio |
| EP3764359B1 (en) | 2019-07-10 | 2024-08-28 | Analog Devices International Unlimited Company | Signal processing methods and systems for multi-focus beam-forming |
| US11587575B2 (en) * | 2019-10-11 | 2023-02-21 | Plantronics, Inc. | Hybrid noise suppression |
| US11238853B2 (en) | 2019-10-30 | 2022-02-01 | Comcast Cable Communications, Llc | Keyword-based audio source localization |
| CN113038318B (zh) * | 2019-12-25 | 2022-06-07 | 荣耀终端有限公司 | 一种语音信号处理方法及装置 |
| TWI760676B (zh) * | 2020-01-07 | 2022-04-11 | 瑞昱半導體股份有限公司 | 具有抗噪機制的音訊播放裝置及方法 |
| KR102726759B1 (ko) | 2020-02-10 | 2024-11-06 | 삼성전자 주식회사 | 음질 개선 방법 및 이를 이용한 전자 장치 |
| US20230267947A1 (en) * | 2020-07-31 | 2023-08-24 | Dolby Laboratories Licensing Corporation | Noise reduction using machine learning |
| US11053017B1 (en) * | 2020-08-20 | 2021-07-06 | Kitty Hawk Corporation | Rotor noise reduction using signal processing |
| CN112289333B (zh) * | 2020-12-25 | 2021-04-13 | 北京达佳互联信息技术有限公司 | 语音增强模型的训练方法和装置及语音增强方法和装置 |
| US12112741B2 (en) * | 2021-02-18 | 2024-10-08 | Microsoft Technology Licensing, Llc | System and method for data augmentation and speech processing in dynamic acoustic environments |
| CN113409813B (zh) * | 2021-05-26 | 2023-06-06 | 北京捷通华声科技股份有限公司 | 语音分离方法及装置 |
| US12028684B2 (en) | 2021-07-30 | 2024-07-02 | Starkey Laboratories, Inc. | Spatially differentiated noise reduction for hearing devices |
| CN114420152B (zh) * | 2021-12-08 | 2025-07-08 | 深圳市东微智能科技股份有限公司 | 声音频带调整方法、装置、设备及存储介质 |
| US12579984B2 (en) * | 2022-01-20 | 2026-03-17 | Microsoft Technology Licensing, Llc. | Data augmentation system and method for multi-microphone systems |
| US20230230581A1 (en) * | 2022-01-20 | 2023-07-20 | Nuance Communications, Inc. | Data augmentation system and method for multi-microphone systems |
| US12243514B2 (en) * | 2022-01-20 | 2025-03-04 | Microsoft Technology Licensing, Llc | Data augmentation system and method for multi-microphone systems |
| US12456456B2 (en) | 2022-01-20 | 2025-10-28 | Microsoft Technology Licensing, Llc | Data augmentation system and method for multi-microphone systems |
| CN118843905A (zh) * | 2022-03-23 | 2024-10-25 | 三菱电机株式会社 | 噪声去除装置和方法 |
| WO2025215471A1 (en) * | 2024-04-09 | 2025-10-16 | Cochlear Limited | Distortion reduction in noise cancellation systems |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040047474A1 (en) * | 2002-04-25 | 2004-03-11 | Gn Resound A/S | Fitting methodology and hearing prosthesis based on signal-to-noise ratio loss data |
| US20090012783A1 (en) * | 2007-07-06 | 2009-01-08 | Audience, Inc. | System and method for adaptive intelligent noise suppression |
| US20090220107A1 (en) * | 2008-02-29 | 2009-09-03 | Audience, Inc. | System and method for providing single microphone noise suppression fallback |
| US20090323982A1 (en) * | 2006-01-30 | 2009-12-31 | Ludger Solbach | System and method for providing noise suppression utilizing null processing noise subtraction |
| US20100067710A1 (en) * | 2008-09-15 | 2010-03-18 | Hendriks Richard C | Noise spectrum tracking in noisy acoustical signals |
Family Cites Families (317)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3581122A (en) | 1967-10-26 | 1971-05-25 | Bell Telephone Labor Inc | All-pass filter circuit having negative resistance shunting resonant circuit |
| US3989897A (en) | 1974-10-25 | 1976-11-02 | Carver R W | Method and apparatus for reducing noise content in audio signals |
| US4630304A (en) | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
| US4811404A (en) | 1987-10-01 | 1989-03-07 | Motorola, Inc. | Noise suppression system |
| US4910779A (en) | 1987-10-15 | 1990-03-20 | Cooper Duane H | Head diffraction compensated stereo system with optimal equalization |
| IL84948A0 (en) | 1987-12-25 | 1988-06-30 | D S P Group Israel Ltd | Noise reduction system |
| US4991166A (en) | 1988-10-28 | 1991-02-05 | Shure Brothers Incorporated | Echo reduction circuit |
| US5027306A (en) | 1989-05-12 | 1991-06-25 | Dattorro Jon C | Decimation filter as for a sigma-delta analog-to-digital converter |
| US5050217A (en) | 1990-02-16 | 1991-09-17 | Akg Acoustics, Inc. | Dynamic noise reduction and spectral restoration system |
| US5103229A (en) | 1990-04-23 | 1992-04-07 | General Electric Company | Plural-order sigma-delta analog-to-digital converters using both single-bit and multiple-bit quantization |
| JPH0566795A (ja) | 1991-09-06 | 1993-03-19 | Gijutsu Kenkyu Kumiai Iryo Fukushi Kiki Kenkyusho | 雑音抑圧装置とその調整装置 |
| JP3279612B2 (ja) | 1991-12-06 | 2002-04-30 | ソニー株式会社 | 雑音低減装置 |
| JP3176474B2 (ja) | 1992-06-03 | 2001-06-18 | 沖電気工業株式会社 | 適応ノイズキャンセラ装置 |
| JP2508574B2 (ja) | 1992-11-10 | 1996-06-19 | 日本電気株式会社 | 多チャンネルエコ―除去装置 |
| US5408235A (en) | 1994-03-07 | 1995-04-18 | Intel Corporation | Second order Sigma-Delta based analog to digital converter having superior analog components and having a programmable comb filter coupled to the digital signal processor |
| US5544250A (en) | 1994-07-18 | 1996-08-06 | Motorola | Noise suppression system and method therefor |
| JP3307138B2 (ja) | 1995-02-27 | 2002-07-24 | ソニー株式会社 | 信号符号化方法及び装置、並びに信号復号化方法及び装置 |
| US5828997A (en) | 1995-06-07 | 1998-10-27 | Sensimetrics Corporation | Content analyzer mixing inverse-direction-probability-weighted noise to input signal |
| US5809463A (en) | 1995-09-15 | 1998-09-15 | Hughes Electronics | Method of detecting double talk in an echo canceller |
| US5687104A (en) | 1995-11-17 | 1997-11-11 | Motorola, Inc. | Method and apparatus for generating decoupled filter parameters and implementing a band decoupled filter |
| FI100840B (fi) | 1995-12-12 | 1998-02-27 | Nokia Mobile Phones Ltd | Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin |
| US5819217A (en) | 1995-12-21 | 1998-10-06 | Nynex Science & Technology, Inc. | Method and system for differentiating between speech and noise |
| US5937060A (en) | 1996-02-09 | 1999-08-10 | Texas Instruments Incorporated | Residual echo suppression |
| US5774562A (en) | 1996-03-25 | 1998-06-30 | Nippon Telegraph And Telephone Corp. | Method and apparatus for dereverberation |
| JP3325770B2 (ja) | 1996-04-26 | 2002-09-17 | 三菱電機株式会社 | ノイズ低減回路及びノイズ低減装置及びノイズ低減方法 |
| US5701350A (en) | 1996-06-03 | 1997-12-23 | Digisonix, Inc. | Active acoustic control in remote regions |
| US5825898A (en) | 1996-06-27 | 1998-10-20 | Lamar Signal Processing Ltd. | System and method for adaptive interference cancelling |
| US5796819A (en) | 1996-07-24 | 1998-08-18 | Ericsson Inc. | Echo canceller for non-linear circuits |
| US5806025A (en) | 1996-08-07 | 1998-09-08 | U S West, Inc. | Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank |
| US5887032A (en) | 1996-09-03 | 1999-03-23 | Amati Communications Corp. | Method and apparatus for crosstalk cancellation |
| JPH10124088A (ja) | 1996-10-24 | 1998-05-15 | Sony Corp | 音声帯域幅拡張装置及び方法 |
| US5963651A (en) | 1997-01-16 | 1999-10-05 | Digisonix, Inc. | Adaptive acoustic attenuation system having distributed processing and shared state nodal architecture |
| JP3328532B2 (ja) | 1997-01-22 | 2002-09-24 | シャープ株式会社 | デジタルデータの符号化方法 |
| US5933495A (en) | 1997-02-07 | 1999-08-03 | Texas Instruments Incorporated | Subband acoustic noise suppression |
| US6104993A (en) | 1997-02-26 | 2000-08-15 | Motorola, Inc. | Apparatus and method for rate determination in a communication system |
| US6151397A (en) | 1997-05-16 | 2000-11-21 | Motorola, Inc. | Method and system for reducing undesired signals in a communication environment |
| TW392416B (en) | 1997-08-18 | 2000-06-01 | Noise Cancellation Tech | Noise cancellation system for active headsets |
| US6122384A (en) | 1997-09-02 | 2000-09-19 | Qualcomm Inc. | Noise suppression system and method |
| JP4132154B2 (ja) | 1997-10-23 | 2008-08-13 | ソニー株式会社 | 音声合成方法及び装置、並びに帯域幅拡張方法及び装置 |
| US6343267B1 (en) | 1998-04-30 | 2002-01-29 | Matsushita Electric Industrial Co., Ltd. | Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques |
| US6549586B2 (en) | 1999-04-12 | 2003-04-15 | Telefonaktiebolaget L M Ericsson | System and method for dual microphone signal noise reduction using spectral subtraction |
| US6160265A (en) | 1998-07-13 | 2000-12-12 | Kensington Laboratories, Inc. | SMIF box cover hold down latch and box door latch actuating mechanism |
| US6240386B1 (en) | 1998-08-24 | 2001-05-29 | Conexant Systems, Inc. | Speech codec employing noise classification for noise compensation |
| US6539355B1 (en) | 1998-10-15 | 2003-03-25 | Sony Corporation | Signal band expanding method and apparatus and signal synthesis method and apparatus |
| US6011501A (en) | 1998-12-31 | 2000-01-04 | Cirrus Logic, Inc. | Circuits, systems and methods for processing data in a one-bit format |
| US6453287B1 (en) | 1999-02-04 | 2002-09-17 | Georgia-Tech Research Corporation | Apparatus and quality enhancement algorithm for mixed excitation linear predictive (MELP) and other speech coders |
| US6381570B2 (en) | 1999-02-12 | 2002-04-30 | Telogy Networks, Inc. | Adaptive two-threshold method for discriminating noise from speech in a communication signal |
| US6377915B1 (en) | 1999-03-17 | 2002-04-23 | Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. | Speech decoding using mix ratio table |
| SE514948C2 (sv) | 1999-03-29 | 2001-05-21 | Ericsson Telefon Ab L M | Förfarande och anordning för att reducera överhörning |
| US6490556B2 (en) | 1999-05-28 | 2002-12-03 | Intel Corporation | Audio classifier for half duplex communication |
| US20010044719A1 (en) | 1999-07-02 | 2001-11-22 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for recognizing, indexing, and searching acoustic signals |
| US6453284B1 (en) | 1999-07-26 | 2002-09-17 | Texas Tech University Health Sciences Center | Multiple voice tracking system and method |
| US6480610B1 (en) | 1999-09-21 | 2002-11-12 | Sonic Innovations, Inc. | Subband acoustic feedback cancellation in hearing aids |
| US7054809B1 (en) | 1999-09-22 | 2006-05-30 | Mindspeed Technologies, Inc. | Rate selection method for selectable mode vocoder |
| US6326912B1 (en) | 1999-09-24 | 2001-12-04 | Akm Semiconductor, Inc. | Analog-to-digital conversion using a multi-bit analog delta-sigma modulator combined with a one-bit digital delta-sigma modulator |
| US6594367B1 (en) | 1999-10-25 | 2003-07-15 | Andrea Electronics Corporation | Super directional beamforming design and implementation |
| US6473733B1 (en) | 1999-12-01 | 2002-10-29 | Research In Motion Limited | Signal enhancement for voice coding |
| TW510143B (en) | 1999-12-03 | 2002-11-11 | Dolby Lab Licensing Corp | Method for deriving at least three audio signals from two input audio signals |
| US6934387B1 (en) | 1999-12-17 | 2005-08-23 | Marvell International Ltd. | Method and apparatus for digital near-end echo/near-end crosstalk cancellation with adaptive correlation |
| GB2357683A (en) | 1999-12-24 | 2001-06-27 | Nokia Mobile Phones Ltd | Voiced/unvoiced determination for speech coding |
| US6757395B1 (en) | 2000-01-12 | 2004-06-29 | Sonic Innovations, Inc. | Noise reduction apparatus and method |
| US7076315B1 (en) | 2000-03-24 | 2006-07-11 | Audience, Inc. | Efficient computation of log-frequency-scale digital filter cascade |
| US20010046304A1 (en) | 2000-04-24 | 2001-11-29 | Rast Rodger H. | System and method for selective control of acoustic isolation in headsets |
| JP2001318694A (ja) | 2000-05-10 | 2001-11-16 | Toshiba Corp | 信号処理装置、信号処理方法および記録媒体 |
| US7346176B1 (en) | 2000-05-11 | 2008-03-18 | Plantronics, Inc. | Auto-adjust noise canceling microphone with position sensor |
| US6377637B1 (en) | 2000-07-12 | 2002-04-23 | Andrea Electronics Corporation | Sub-band exponential smoothing noise canceling system |
| US20070233479A1 (en) | 2002-05-30 | 2007-10-04 | Burnett Gregory C | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
| US8254617B2 (en) | 2003-03-27 | 2012-08-28 | Aliphcom, Inc. | Microphone array with rear venting |
| US6782253B1 (en) | 2000-08-10 | 2004-08-24 | Koninklijke Philips Electronics N.V. | Mobile micro portal |
| ATE318465T1 (de) | 2000-08-11 | 2006-03-15 | Koninkl Philips Electronics Nv | Verfahren und anordnung zur synchronisierung eines sigma-delta-modulators |
| JP3566197B2 (ja) | 2000-08-31 | 2004-09-15 | 松下電器産業株式会社 | 雑音抑圧装置及び雑音抑圧方法 |
| US6804203B1 (en) | 2000-09-15 | 2004-10-12 | Mindspeed Technologies, Inc. | Double talk detector for echo cancellation in a speech communication system |
| US6859508B1 (en) | 2000-09-28 | 2005-02-22 | Nec Electronics America, Inc. | Four dimensional equalizer and far-end cross talk canceler in Gigabit Ethernet signals |
| US7472059B2 (en) | 2000-12-08 | 2008-12-30 | Qualcomm Incorporated | Method and apparatus for robust speech classification |
| US20020128839A1 (en) | 2001-01-12 | 2002-09-12 | Ulf Lindgren | Speech bandwidth extension |
| US20020097884A1 (en) | 2001-01-25 | 2002-07-25 | Cairns Douglas A. | Variable noise reduction algorithm based on vehicle conditions |
| US6990196B2 (en) | 2001-02-06 | 2006-01-24 | The Board Of Trustees Of The Leland Stanford Junior University | Crosstalk identification in xDSL systems |
| US7617099B2 (en) | 2001-02-12 | 2009-11-10 | FortMedia Inc. | Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile |
| US7006636B2 (en) | 2002-05-24 | 2006-02-28 | Agere Systems Inc. | Coherence-based audio coding and synthesis |
| WO2002093561A1 (de) | 2001-05-11 | 2002-11-21 | Siemens Aktiengesellschaft | Verfahren zur erweiterung der bandbreite eines schmalbandig gefilterten sprachsignals, insbesondere eines von einem telekommunikationsgerät gesendeten sprachsignals |
| US6675164B2 (en) | 2001-06-08 | 2004-01-06 | The Regents Of The University Of California | Parallel object-oriented data mining system |
| CN1326415C (zh) | 2001-06-26 | 2007-07-11 | 诺基亚公司 | 对音频信号进行代码变换的方法、码变换器、网元、无线通信网和通信系统 |
| US6876859B2 (en) | 2001-07-18 | 2005-04-05 | Trueposition, Inc. | Method for estimating TDOA and FDOA in a wireless location system |
| CA2354808A1 (en) | 2001-08-07 | 2003-02-07 | King Tam | Sub-band adaptive signal processing in an oversampled filterbank |
| US6895375B2 (en) | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
| US6988066B2 (en) | 2001-10-04 | 2006-01-17 | At&T Corp. | Method of bandwidth extension for narrow-band speech |
| JP3858668B2 (ja) * | 2001-11-05 | 2006-12-20 | 日本電気株式会社 | ノイズ除去方法及び装置 |
| DE60202881T2 (de) | 2001-11-29 | 2006-01-19 | Coding Technologies Ab | Wiederherstellung von hochfrequenzkomponenten |
| US7042934B2 (en) | 2002-01-23 | 2006-05-09 | Actelis Networks Inc. | Crosstalk mitigation in a modem pool environment |
| US8098844B2 (en) | 2002-02-05 | 2012-01-17 | Mh Acoustics, Llc | Dual-microphone spatial noise suppression |
| US8942387B2 (en) | 2002-02-05 | 2015-01-27 | Mh Acoustics Llc | Noise-reducing directional microphone array |
| US7171008B2 (en) | 2002-02-05 | 2007-01-30 | Mh Acoustics, Llc | Reducing noise in audio systems |
| US7050783B2 (en) | 2002-02-22 | 2006-05-23 | Kyocera Wireless Corp. | Accessory detection system |
| CA2420989C (en) | 2002-03-08 | 2006-12-05 | Gennum Corporation | Low-noise directional microphone system |
| US7590250B2 (en) | 2002-03-22 | 2009-09-15 | Georgia Tech Research Corporation | Analog audio signal enhancement system using a noise suppression algorithm |
| WO2003083828A1 (en) | 2002-03-27 | 2003-10-09 | Aliphcom | Nicrophone and voice activity detection (vad) configurations for use with communication systems |
| GB2387008A (en) | 2002-03-28 | 2003-10-01 | Qinetiq Ltd | Signal Processing System |
| US7072834B2 (en) | 2002-04-05 | 2006-07-04 | Intel Corporation | Adapting to adverse acoustic environment in speech processing using playback training data |
| US7065486B1 (en) | 2002-04-11 | 2006-06-20 | Mindspeed Technologies, Inc. | Linear prediction based noise suppression |
| US7190665B2 (en) | 2002-04-19 | 2007-03-13 | Texas Instruments Incorporated | Blind crosstalk cancellation for multicarrier modulation |
| CN1647156B (zh) | 2002-04-22 | 2010-05-26 | 皇家飞利浦电子股份有限公司 | 参数编码方法、参数编码器、用于提供音频信号的设备、解码方法、解码器、用于提供解码后的多声道音频信号的设备 |
| US7319959B1 (en) | 2002-05-14 | 2008-01-15 | Audience, Inc. | Multi-source phoneme classification for noise-robust automatic speech recognition |
| US7257231B1 (en) | 2002-06-04 | 2007-08-14 | Creative Technology Ltd. | Stream segregation for stereo signals |
| US7242762B2 (en) | 2002-06-24 | 2007-07-10 | Freescale Semiconductor, Inc. | Monitoring and control of an adaptive filter in a communication system |
| EP1523717A1 (en) | 2002-07-19 | 2005-04-20 | BRITISH TELECOMMUNICATIONS public limited company | Method and system for classification of semantic content of audio/video data |
| CA2399159A1 (en) | 2002-08-16 | 2004-02-16 | Dspfactory Ltd. | Convergence improvement for oversampled subband adaptive filters |
| JP4155774B2 (ja) | 2002-08-28 | 2008-09-24 | 富士通株式会社 | エコー抑制システム及び方法 |
| AU2003252143A1 (en) | 2002-08-29 | 2004-03-19 | Bae Systems Information And Electronic Systems Integration, Inc. | Method for separating interferering signals and computing arrival angles |
| US7574352B2 (en) | 2002-09-06 | 2009-08-11 | Massachusetts Institute Of Technology | 2-D processing of speech |
| US6917688B2 (en) | 2002-09-11 | 2005-07-12 | Nanyang Technological University | Adaptive noise cancelling microphone system |
| US7283956B2 (en) | 2002-09-18 | 2007-10-16 | Motorola, Inc. | Noise suppression |
| US20040114677A1 (en) | 2002-09-27 | 2004-06-17 | Ehud Langberg | Method and system for reducing interferences due to handshake tones |
| US7657427B2 (en) | 2002-10-11 | 2010-02-02 | Nokia Corporation | Methods and devices for source controlled variable bit-rate wideband speech coding |
| US7003099B1 (en) | 2002-11-15 | 2006-02-21 | Fortmedia, Inc. | Small array microphone for acoustic echo cancellation and noise suppression |
| US20040105550A1 (en) | 2002-12-03 | 2004-06-03 | Aylward J. Richard | Directional electroacoustical transducing |
| US7359504B1 (en) | 2002-12-03 | 2008-04-15 | Plantronics, Inc. | Method and apparatus for reducing echo and noise |
| US7162420B2 (en) | 2002-12-10 | 2007-01-09 | Liberato Technologies, Llc | System and method for noise reduction having first and second adaptive filters |
| US20060160581A1 (en) | 2002-12-20 | 2006-07-20 | Christopher Beaugeant | Echo suppression for compressed speech with only partial transcoding of the uplink user data stream |
| KR100477699B1 (ko) | 2003-01-15 | 2005-03-18 | 삼성전자주식회사 | 양자화 잡음 분포 조절 방법 및 장치 |
| GB2397990A (en) | 2003-01-31 | 2004-08-04 | Mitel Networks Corp | Echo cancellation/suppression and double-talk detection in communication paths |
| US7949522B2 (en) | 2003-02-21 | 2011-05-24 | Qnx Software Systems Co. | System for suppressing rain noise |
| US7895036B2 (en) | 2003-02-21 | 2011-02-22 | Qnx Software Systems Co. | System for suppressing wind noise |
| GB2398913B (en) | 2003-02-27 | 2005-08-17 | Motorola Inc | Noise estimation in speech recognition |
| WO2004084182A1 (en) | 2003-03-15 | 2004-09-30 | Mindspeed Technologies, Inc. | Decomposition of voiced speech for celp speech coding |
| EP1473964A3 (en) | 2003-05-02 | 2006-08-09 | Samsung Electronics Co., Ltd. | Microphone array, method to process signals from this microphone array and speech recognition method and system using the same |
| US7577084B2 (en) | 2003-05-03 | 2009-08-18 | Ikanos Communications Inc. | ISDN crosstalk cancellation in a DSL system |
| GB2401744B (en) | 2003-05-14 | 2006-02-15 | Ultra Electronics Ltd | An adaptive control unit with feedback compensation |
| JP4212591B2 (ja) | 2003-06-30 | 2009-01-21 | 富士通株式会社 | オーディオ符号化装置 |
| ATE487332T1 (de) | 2003-07-11 | 2010-11-15 | Cochlear Ltd | Verfahren und einrichtung zur rauschverminderung |
| US7289554B2 (en) | 2003-07-15 | 2007-10-30 | Brooktree Broadband Holding, Inc. | Method and apparatus for channel equalization and cyclostationary interference rejection for ADSL-DMT modems |
| JP2007502054A (ja) | 2003-08-07 | 2007-02-01 | ケラン インコーポレイテッド | クロストークキャンセルのための方法とシステム |
| US7245767B2 (en) | 2003-08-21 | 2007-07-17 | Hewlett-Packard Development Company, L.P. | Method and apparatus for object identification, classification or verification |
| US7516067B2 (en) | 2003-08-25 | 2009-04-07 | Microsoft Corporation | Method and apparatus using harmonic-model-based front end for robust speech recognition |
| US7099821B2 (en) | 2003-09-12 | 2006-08-29 | Softmax, Inc. | Separation of target acoustic signals in a multi-transducer arrangement |
| CA2452945C (en) | 2003-09-23 | 2016-05-10 | Mcmaster University | Binaural adaptive hearing system |
| US20050075866A1 (en) | 2003-10-06 | 2005-04-07 | Bernard Widrow | Speech enhancement in the presence of background noise |
| US7461003B1 (en) | 2003-10-22 | 2008-12-02 | Tellabs Operations, Inc. | Methods and apparatus for improving the quality of speech signals |
| AU2003274864A1 (en) | 2003-10-24 | 2005-05-11 | Nokia Corpration | Noise-dependent postfiltering |
| US7672693B2 (en) | 2003-11-10 | 2010-03-02 | Nokia Corporation | Controlling method, secondary unit and radio terminal equipment |
| US7725314B2 (en) | 2004-02-16 | 2010-05-25 | Microsoft Corporation | Method and apparatus for constructing a speech filter using estimates of clean speech and noise |
| EP1719114A2 (en) | 2004-02-18 | 2006-11-08 | Philips Intellectual Property & Standards GmbH | Method and system for generating training data for an automatic speech recogniser |
| DE602004004242T2 (de) | 2004-03-19 | 2008-06-05 | Harman Becker Automotive Systems Gmbh | System und Verfahren zur Verbesserung eines Audiosignals |
| CN1947171B (zh) | 2004-04-28 | 2011-05-04 | 皇家飞利浦电子股份有限公司 | 自适应波束形成器、旁瓣抑制器、自动语音通信设备 |
| US8712768B2 (en) | 2004-05-25 | 2014-04-29 | Nokia Corporation | System and method for enhanced artificial bandwidth expansion |
| US7254535B2 (en) | 2004-06-30 | 2007-08-07 | Motorola, Inc. | Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system |
| US7383179B2 (en) | 2004-09-28 | 2008-06-03 | Clarity Technologies, Inc. | Method of cascading noise reduction algorithms to avoid speech distortion |
| US20060089836A1 (en) | 2004-10-21 | 2006-04-27 | Motorola, Inc. | System and method of signal pre-conditioning with adaptive spectral tilt compensation for audio equalization |
| US8170879B2 (en) | 2004-10-26 | 2012-05-01 | Qnx Software Systems Limited | Periodic signal enhancement system |
| US7469155B2 (en) | 2004-11-29 | 2008-12-23 | Cisco Technology, Inc. | Handheld communications device with automatic alert mode selection |
| GB2422237A (en) | 2004-12-21 | 2006-07-19 | Fluency Voice Technology Ltd | Dynamic coefficients determined from temporally adjacent speech frames |
| US8873768B2 (en) | 2004-12-23 | 2014-10-28 | Motorola Mobility Llc | Method and apparatus for audio signal enhancement |
| US7561627B2 (en) | 2005-01-06 | 2009-07-14 | Marvell World Trade Ltd. | Method and system for channel equalization and crosstalk estimation in a multicarrier data transmission system |
| US8170221B2 (en) | 2005-03-21 | 2012-05-01 | Harman Becker Automotive Systems Gmbh | Audio enhancement system and method |
| BRPI0608270A2 (pt) | 2005-04-01 | 2009-10-06 | Qualcomm Inc | sistemas, métodos e equipamento para filtragem anti-dispersão |
| US8249861B2 (en) | 2005-04-20 | 2012-08-21 | Qnx Software Systems Limited | High frequency compression integration |
| US7813931B2 (en) | 2005-04-20 | 2010-10-12 | QNX Software Systems, Co. | System for improving speech quality and intelligibility with bandwidth compression/expansion |
| US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
| US8311819B2 (en) | 2005-06-15 | 2012-11-13 | Qnx Software Systems Limited | System for detecting speech with background voice estimates and noise estimates |
| US20070005351A1 (en) | 2005-06-30 | 2007-01-04 | Sathyendra Harsha M | Method and system for bandwidth expansion for voice communications |
| EP1923866B1 (en) | 2005-08-11 | 2014-01-01 | Asahi Kasei Kabushiki Kaisha | Sound source separating device, speech recognizing device, portable telephone, sound source separating method, and program |
| KR101116363B1 (ko) | 2005-08-11 | 2012-03-09 | 삼성전자주식회사 | 음성신호 분류방법 및 장치, 및 이를 이용한 음성신호부호화방법 및 장치 |
| US20070041589A1 (en) | 2005-08-17 | 2007-02-22 | Gennum Corporation | System and method for providing environmental specific noise reduction algorithms |
| US8326614B2 (en) | 2005-09-02 | 2012-12-04 | Qnx Software Systems Limited | Speech enhancement system |
| DK1760696T3 (en) | 2005-09-03 | 2016-05-02 | Gn Resound As | Method and apparatus for improved estimation of non-stationary noise to highlight speech |
| US20070053522A1 (en) | 2005-09-08 | 2007-03-08 | Murray Daniel J | Method and apparatus for directional enhancement of speech elements in noisy environments |
| CA2621940C (en) | 2005-09-09 | 2014-07-29 | Mcmaster University | Method and device for binaural signal enhancement |
| JP4742226B2 (ja) | 2005-09-28 | 2011-08-10 | 国立大学法人九州大学 | 能動消音制御装置及び方法 |
| EP1772855B1 (en) | 2005-10-07 | 2013-09-18 | Nuance Communications, Inc. | Method for extending the spectral bandwidth of a speech signal |
| US7813923B2 (en) | 2005-10-14 | 2010-10-12 | Microsoft Corporation | Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset |
| WO2007049644A1 (ja) | 2005-10-26 | 2007-05-03 | Nec Corporation | エコー抑圧方法及び装置 |
| US7546237B2 (en) | 2005-12-23 | 2009-06-09 | Qnx Software Systems (Wavemakers), Inc. | Bandwidth extension of narrowband speech |
| US8345890B2 (en) | 2006-01-05 | 2013-01-01 | Audience, Inc. | System and method for utilizing inter-microphone level differences for speech enhancement |
| US8032369B2 (en) | 2006-01-20 | 2011-10-04 | Qualcomm Incorporated | Arbitrary average data rates for variable rate coders |
| US8194880B2 (en) | 2006-01-30 | 2012-06-05 | Audience, Inc. | System and method for utilizing omni-directional microphones for speech enhancement |
| EP1827002A1 (en) | 2006-02-22 | 2007-08-29 | Alcatel Lucent | Method of controlling an adaptation of a filter |
| CN101385386B (zh) | 2006-03-03 | 2012-05-09 | 日本电信电话株式会社 | 混响除去装置和混响除去方法 |
| US7555075B2 (en) | 2006-04-07 | 2009-06-30 | Freescale Semiconductor, Inc. | Adjustable noise suppression system |
| US8180067B2 (en) | 2006-04-28 | 2012-05-15 | Harman International Industries, Incorporated | System for selectively extracting components of an audio input signal |
| US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
| US8150065B2 (en) | 2006-05-25 | 2012-04-03 | Audience, Inc. | System and method for processing an audio signal |
| US8934641B2 (en) | 2006-05-25 | 2015-01-13 | Audience, Inc. | Systems and methods for reconstructing decomposed audio signals |
| US8204253B1 (en) | 2008-06-30 | 2012-06-19 | Audience, Inc. | Self calibration of audio device |
| US20070299655A1 (en) | 2006-06-22 | 2007-12-27 | Nokia Corporation | Method, Apparatus and Computer Program Product for Providing Low Frequency Expansion of Speech |
| JP5249207B2 (ja) | 2006-06-23 | 2013-07-31 | ジーエヌ リザウンド エー/エス | 適応性のある方向性信号処理を備えた補聴装置 |
| JP5435204B2 (ja) | 2006-07-03 | 2014-03-05 | 日本電気株式会社 | 雑音抑圧の方法、装置、及びプログラム |
| JP4836720B2 (ja) * | 2006-09-07 | 2011-12-14 | 株式会社東芝 | ノイズサプレス装置 |
| US7587056B2 (en) | 2006-09-14 | 2009-09-08 | Fortemedia, Inc. | Small array microphone apparatus and noise suppression methods thereof |
| KR101061132B1 (ko) | 2006-09-14 | 2011-08-31 | 엘지전자 주식회사 | 다이알로그 증폭 기술 |
| DE102006051071B4 (de) * | 2006-10-30 | 2010-12-16 | Siemens Audiologische Technik Gmbh | Pegelabhängige Geräuschreduktion |
| CN101197592B (zh) | 2006-12-07 | 2011-09-14 | 华为技术有限公司 | 远端串扰抵消方法、装置及信号发送装置和信号处理系统 |
| CN101197798B (zh) | 2006-12-07 | 2011-11-02 | 华为技术有限公司 | 信号处理系统、芯片、外接卡、滤波、收发装置及方法 |
| EP1933303B1 (en) | 2006-12-14 | 2008-08-06 | Harman/Becker Automotive Systems GmbH | Speech dialog control based on signal pre-processing |
| US20080152157A1 (en) | 2006-12-21 | 2008-06-26 | Vimicro Corporation | Method and system for eliminating noises in voice signals |
| US7783478B2 (en) | 2007-01-03 | 2010-08-24 | Alexander Goldin | Two stage frequency subband decomposition |
| US7986794B2 (en) | 2007-01-11 | 2011-07-26 | Fortemedia, Inc. | Small array microphone apparatus and beam forming method thereof |
| TWI465121B (zh) | 2007-01-29 | 2014-12-11 | Audience Inc | 利用全方向麥克風改善通話的系統及方法 |
| US8103011B2 (en) | 2007-01-31 | 2012-01-24 | Microsoft Corporation | Signal detection using multiple detectors |
| JP5401760B2 (ja) | 2007-02-05 | 2014-01-29 | ソニー株式会社 | ヘッドフォン装置、音声再生システム、音声再生方法 |
| JP4882773B2 (ja) | 2007-02-05 | 2012-02-22 | ソニー株式会社 | 信号処理装置、信号処理方法 |
| US8060363B2 (en) | 2007-02-13 | 2011-11-15 | Nokia Corporation | Audio signal encoding |
| EP1962559A1 (en) | 2007-02-21 | 2008-08-27 | Harman Becker Automotive Systems GmbH | Objective quantification of auditory source width of a loudspeakers-room system |
| US8195454B2 (en) | 2007-02-26 | 2012-06-05 | Dolby Laboratories Licensing Corporation | Speech enhancement in entertainment audio |
| US20080208575A1 (en) | 2007-02-27 | 2008-08-28 | Nokia Corporation | Split-band encoding and decoding of an audio signal |
| US7925502B2 (en) | 2007-03-01 | 2011-04-12 | Microsoft Corporation | Pitch model for noise estimation |
| KR100905585B1 (ko) | 2007-03-02 | 2009-07-02 | 삼성전자주식회사 | 음성신호의 대역폭 확장 제어 방법 및 장치 |
| US7912567B2 (en) | 2007-03-07 | 2011-03-22 | Audiocodes Ltd. | Noise suppressor |
| EP1970900A1 (en) | 2007-03-14 | 2008-09-17 | Harman Becker Automotive Systems GmbH | Method and apparatus for providing a codebook for bandwidth extension of an acoustic signal |
| CN101266797B (zh) | 2007-03-16 | 2011-06-01 | 展讯通信(上海)有限公司 | 语音信号后处理滤波方法 |
| JP5260561B2 (ja) | 2007-03-19 | 2013-08-14 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 知覚モデルを使用した音声の強調 |
| US8005238B2 (en) | 2007-03-22 | 2011-08-23 | Microsoft Corporation | Robust adaptive beamforming with enhanced noise suppression |
| US7873114B2 (en) | 2007-03-29 | 2011-01-18 | Motorola Mobility, Inc. | Method and apparatus for quickly detecting a presence of abrupt noise and updating a noise estimate |
| US8180062B2 (en) | 2007-05-30 | 2012-05-15 | Nokia Corporation | Spatial sound zooming |
| US8982744B2 (en) | 2007-06-06 | 2015-03-17 | Broadcom Corporation | Method and system for a subband acoustic echo canceller with integrated voice activity detection |
| JP4455614B2 (ja) | 2007-06-13 | 2010-04-21 | 株式会社東芝 | 音響信号処理方法及び装置 |
| US8428275B2 (en) | 2007-06-22 | 2013-04-23 | Sanyo Electric Co., Ltd. | Wind noise reduction device |
| US8140331B2 (en) | 2007-07-06 | 2012-03-20 | Xia Lou | Feature extraction for identification and classification of audio signals |
| US20090012786A1 (en) | 2007-07-06 | 2009-01-08 | Texas Instruments Incorporated | Adaptive Noise Cancellation |
| US7817808B2 (en) | 2007-07-19 | 2010-10-19 | Alon Konchitsky | Dual adaptive structure for speech enhancement |
| US8189766B1 (en) | 2007-07-26 | 2012-05-29 | Audience, Inc. | System and method for blind subband acoustic echo cancellation postfiltering |
| US7856353B2 (en) | 2007-08-07 | 2010-12-21 | Nuance Communications, Inc. | Method for processing speech signal data with reverberation filtering |
| US20090043577A1 (en) | 2007-08-10 | 2009-02-12 | Ditech Networks, Inc. | Signal presence detection using bi-directional communication data |
| EP2026597B1 (en) | 2007-08-13 | 2009-11-11 | Harman Becker Automotive Systems GmbH | Noise reduction by combined beamforming and post-filtering |
| US8032365B2 (en) | 2007-08-31 | 2011-10-04 | Tellabs Operations, Inc. | Method and apparatus for controlling echo in the coded domain |
| WO2009035614A1 (en) | 2007-09-12 | 2009-03-19 | Dolby Laboratories Licensing Corporation | Speech enhancement with voice clarity |
| ATE501506T1 (de) | 2007-09-12 | 2011-03-15 | Dolby Lab Licensing Corp | Spracherweiterung mit anpassung von geräuschpegelschätzungen |
| US8073125B2 (en) | 2007-09-25 | 2011-12-06 | Microsoft Corporation | Spatial audio conferencing |
| US8954324B2 (en) | 2007-09-28 | 2015-02-10 | Qualcomm Incorporated | Multiple microphone voice activity detector |
| DE602007008429D1 (de) | 2007-10-01 | 2010-09-23 | Harman Becker Automotive Sys | Effiziente Audiosignalverarbeitung im Subbandbereich, Verfahren, Vorrichtung und dazugehöriges Computerprogramm |
| JP4339929B2 (ja) | 2007-10-01 | 2009-10-07 | パナソニック株式会社 | 音源方向検知装置 |
| US8107631B2 (en) | 2007-10-04 | 2012-01-31 | Creative Technology Ltd | Correlation-based method for ambience extraction from two-channel audio signals |
| US20090095804A1 (en) | 2007-10-12 | 2009-04-16 | Sony Ericsson Mobile Communications Ab | Rfid for connected accessory identification and method |
| US8046219B2 (en) * | 2007-10-18 | 2011-10-25 | Motorola Mobility, Inc. | Robust two microphone noise suppression system |
| JPWO2009051197A1 (ja) | 2007-10-19 | 2011-03-03 | 日本電気株式会社 | エコー抑圧方法及び装置 |
| US8606566B2 (en) | 2007-10-24 | 2013-12-10 | Qnx Software Systems Limited | Speech enhancement through partial speech reconstruction |
| EP2058803B1 (en) | 2007-10-29 | 2010-01-20 | Harman/Becker Automotive Systems GmbH | Partial speech reconstruction |
| EP2058804B1 (en) | 2007-10-31 | 2016-12-14 | Nuance Communications, Inc. | Method for dereverberation of an acoustic signal and system thereof |
| EP2058797B1 (en) | 2007-11-12 | 2011-05-04 | Harman Becker Automotive Systems GmbH | Discrimination between foreground speech and background noise |
| KR101444100B1 (ko) | 2007-11-15 | 2014-09-26 | 삼성전자주식회사 | 혼합 사운드로부터 잡음을 제거하는 방법 및 장치 |
| US20090150144A1 (en) | 2007-12-10 | 2009-06-11 | Qnx Software Systems (Wavemakers), Inc. | Robust voice detector for receive-side automatic gain control |
| US8175291B2 (en) | 2007-12-19 | 2012-05-08 | Qualcomm Incorporated | Systems, methods, and apparatus for multi-microphone based speech enhancement |
| EP2232704A4 (en) | 2007-12-20 | 2010-12-01 | Ericsson Telefon Ab L M | APPARATUS AND METHOD FOR NOISE SUPPRESSION |
| US8143620B1 (en) * | 2007-12-21 | 2012-03-27 | Audience, Inc. | System and method for adaptive classification of audio sources |
| GB0800891D0 (en) | 2008-01-17 | 2008-02-27 | Cambridge Silicon Radio Ltd | Method and apparatus for cross-talk cancellation |
| US8554551B2 (en) | 2008-01-28 | 2013-10-08 | Qualcomm Incorporated | Systems, methods, and apparatus for context replacement by audio level |
| US8223988B2 (en) | 2008-01-29 | 2012-07-17 | Qualcomm Incorporated | Enhanced blind source separation algorithm for highly correlated mixtures |
| DE102008039330A1 (de) | 2008-01-31 | 2009-08-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Berechnen von Filterkoeffizienten zur Echounterdrückung |
| US20090220197A1 (en) | 2008-02-22 | 2009-09-03 | Jeffrey Gniadek | Apparatus and fiber optic cable retention system including same |
| US8355511B2 (en) | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
| US8374854B2 (en) | 2008-03-28 | 2013-02-12 | Southern Methodist University | Spatio-temporal speech enhancement technique based on generalized eigenvalue decomposition |
| US20090248411A1 (en) | 2008-03-28 | 2009-10-01 | Alon Konchitsky | Front-End Noise Reduction for Speech Recognition Engine |
| US8275136B2 (en) | 2008-04-25 | 2012-09-25 | Nokia Corporation | Electronic device speech enhancement |
| US8131541B2 (en) | 2008-04-25 | 2012-03-06 | Cambridge Silicon Radio Limited | Two microphone noise reduction system |
| US9197181B2 (en) | 2008-05-12 | 2015-11-24 | Broadcom Corporation | Loudness enhancement system and method |
| US8831936B2 (en) | 2008-05-29 | 2014-09-09 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement |
| US20090315708A1 (en) | 2008-06-19 | 2009-12-24 | John Walley | Method and system for limiting audio output in audio headsets |
| US9253568B2 (en) | 2008-07-25 | 2016-02-02 | Broadcom Corporation | Single-microphone wind noise suppression |
| US20100027799A1 (en) | 2008-07-31 | 2010-02-04 | Sony Ericsson Mobile Communications Ab | Asymmetrical delay audio crosstalk cancellation systems, methods and electronic devices including the same |
| EP2151822B8 (en) | 2008-08-05 | 2018-10-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing an audio signal for speech enhancement using a feature extraction |
| US8923529B2 (en) | 2008-08-29 | 2014-12-30 | Biamp Systems Corporation | Microphone array system and method for sound acquisition |
| US8392181B2 (en) | 2008-09-10 | 2013-03-05 | Texas Instruments Incorporated | Subtraction of a shaped component of a noise reduction spectrum from a combined signal |
| ATE552690T1 (de) | 2008-09-19 | 2012-04-15 | Dolby Lab Licensing Corp | Upstream-signalverarbeitung für client- einrichtungen in einem drahtlosen kleinzellen- netz |
| US8583048B2 (en) | 2008-09-25 | 2013-11-12 | Skyphy Networks Limited | Multi-hop wireless systems having noise reduction and bandwidth expansion capabilities and the methods of the same |
| US20100082339A1 (en) | 2008-09-30 | 2010-04-01 | Alon Konchitsky | Wind Noise Reduction |
| US20100094622A1 (en) | 2008-10-10 | 2010-04-15 | Nexidia Inc. | Feature normalization for speech and audio processing |
| US8218397B2 (en) | 2008-10-24 | 2012-07-10 | Qualcomm Incorporated | Audio source proximity estimation using sensor array for noise reduction |
| US8724829B2 (en) | 2008-10-24 | 2014-05-13 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for coherence detection |
| US8111843B2 (en) | 2008-11-11 | 2012-02-07 | Motorola Solutions, Inc. | Compensation for nonuniform delayed group communications |
| US8243952B2 (en) | 2008-12-22 | 2012-08-14 | Conexant Systems, Inc. | Microphone array calibration method and apparatus |
| DK2211339T3 (en) | 2009-01-23 | 2017-08-28 | Oticon As | listening System |
| JP4892021B2 (ja) | 2009-02-26 | 2012-03-07 | 株式会社東芝 | 信号帯域拡張装置 |
| JP5127754B2 (ja) * | 2009-03-24 | 2013-01-23 | 株式会社東芝 | 信号処理装置 |
| US8359195B2 (en) * | 2009-03-26 | 2013-01-22 | LI Creative Technologies, Inc. | Method and apparatus for processing audio and speech signals |
| US8320852B2 (en) * | 2009-04-21 | 2012-11-27 | Samsung Electronic Co., Ltd. | Method and apparatus to transmit signals in a communication system |
| US8144890B2 (en) | 2009-04-28 | 2012-03-27 | Bose Corporation | ANR settings boot loading |
| US8184822B2 (en) | 2009-04-28 | 2012-05-22 | Bose Corporation | ANR signal processing topology |
| US8611553B2 (en) | 2010-03-30 | 2013-12-17 | Bose Corporation | ANR instability detection |
| US8071869B2 (en) | 2009-05-06 | 2011-12-06 | Gracenote, Inc. | Apparatus and method for determining a prominent tempo of an audio work |
| JP5169986B2 (ja) | 2009-05-13 | 2013-03-27 | 沖電気工業株式会社 | 電話装置、エコーキャンセラ及びエコーキャンセルプログラム |
| US8160265B2 (en) | 2009-05-18 | 2012-04-17 | Sony Computer Entertainment Inc. | Method and apparatus for enhancing the generation of three-dimensional sound in headphone devices |
| US8737636B2 (en) | 2009-07-10 | 2014-05-27 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for adaptive active noise cancellation |
| US7769187B1 (en) | 2009-07-14 | 2010-08-03 | Apple Inc. | Communications circuits for electronic devices and accessories |
| US8571231B2 (en) | 2009-10-01 | 2013-10-29 | Qualcomm Incorporated | Suppressing noise in an audio signal |
| US20110099010A1 (en) | 2009-10-22 | 2011-04-28 | Broadcom Corporation | Multi-channel noise suppression system |
| US8244927B2 (en) | 2009-10-27 | 2012-08-14 | Fairchild Semiconductor Corporation | Method of detecting accessories on an audio jack |
| US8340278B2 (en) | 2009-11-20 | 2012-12-25 | Texas Instruments Incorporated | Method and apparatus for cross-talk resistant adaptive noise canceller |
| US8526628B1 (en) | 2009-12-14 | 2013-09-03 | Audience, Inc. | Low latency active noise cancellation system |
| US8848935B1 (en) | 2009-12-14 | 2014-09-30 | Audience, Inc. | Low latency active noise cancellation system |
| US8385559B2 (en) | 2009-12-30 | 2013-02-26 | Robert Bosch Gmbh | Adaptive digital noise canceller |
| US9008329B1 (en) | 2010-01-26 | 2015-04-14 | Audience, Inc. | Noise reduction using multi-feature cluster tracker |
| US8718290B2 (en) | 2010-01-26 | 2014-05-06 | Audience, Inc. | Adaptive noise reduction using level cues |
| FI20125814A7 (fi) | 2010-01-26 | 2012-10-26 | Audience Inc | Adaptiivinen kohinanpoisto tasomerkkien avulla |
| US8700391B1 (en) | 2010-04-01 | 2014-04-15 | Audience, Inc. | Low complexity bandwidth expansion of speech |
| EP2556608A4 (en) | 2010-04-09 | 2017-01-25 | DTS, Inc. | Adaptive environmental noise compensation for audio playback |
| US8958572B1 (en) | 2010-04-19 | 2015-02-17 | Audience, Inc. | Adaptive noise cancellation for multi-microphone systems |
| US8606571B1 (en) | 2010-04-19 | 2013-12-10 | Audience, Inc. | Spatial selectivity noise reduction tradeoff for multi-microphone systems |
| US8473287B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
| US8538035B2 (en) | 2010-04-29 | 2013-09-17 | Audience, Inc. | Multi-microphone robust noise suppression |
| US8781137B1 (en) | 2010-04-27 | 2014-07-15 | Audience, Inc. | Wind noise detection and suppression |
| FI20126106A7 (fi) | 2010-04-29 | 2013-01-29 | Audience Inc | Voimakas monen mikrofonin kohinanvaimennus |
| US9245538B1 (en) | 2010-05-20 | 2016-01-26 | Audience, Inc. | Bandwidth enhancement of speech signals assisted by noise reduction |
| US8447595B2 (en) | 2010-06-03 | 2013-05-21 | Apple Inc. | Echo-related decisions on automatic gain control of uplink speech signal in a communications device |
| US8515089B2 (en) | 2010-06-04 | 2013-08-20 | Apple Inc. | Active noise cancellation decisions in a portable audio device |
| US8447596B2 (en) | 2010-07-12 | 2013-05-21 | Audience, Inc. | Monaural noise suppression based on computational auditory scene analysis |
| FI20135038A7 (fi) | 2010-07-12 | 2013-04-12 | Audience Inc | Monauraalinen kohinanvaimennus laskennallisen kuulomaisema-analyysin perusteella |
| US8719475B2 (en) | 2010-07-13 | 2014-05-06 | Broadcom Corporation | Method and system for utilizing low power superspeed inter-chip (LP-SSIC) communications |
| US8761410B1 (en) | 2010-08-12 | 2014-06-24 | Audience, Inc. | Systems and methods for multi-channel dereverberation |
| US8611552B1 (en) | 2010-08-25 | 2013-12-17 | Audience, Inc. | Direction-aware active noise cancellation system |
| US8447045B1 (en) | 2010-09-07 | 2013-05-21 | Audience, Inc. | Multi-microphone active noise cancellation system |
| US9049532B2 (en) | 2010-10-19 | 2015-06-02 | Electronics And Telecommunications Research Instittute | Apparatus and method for separating sound source |
| US8682006B1 (en) | 2010-10-20 | 2014-03-25 | Audience, Inc. | Noise suppression based on null coherence |
| US8311817B2 (en) | 2010-11-04 | 2012-11-13 | Audience, Inc. | Systems and methods for enhancing voice quality in mobile device |
| CN102486920A (zh) | 2010-12-06 | 2012-06-06 | 索尼公司 | 音频事件检测方法和装置 |
| US9229833B2 (en) | 2011-01-28 | 2016-01-05 | Fairchild Semiconductor Corporation | Successive approximation resistor detection |
| US9107023B2 (en) | 2011-03-18 | 2015-08-11 | Dolby Laboratories Licensing Corporation | N surround |
| US9049281B2 (en) | 2011-03-28 | 2015-06-02 | Conexant Systems, Inc. | Nonlinear echo suppression |
| JP5817366B2 (ja) | 2011-09-12 | 2015-11-18 | 沖電気工業株式会社 | 音声信号処理装置、方法及びプログラム |
| US8737188B1 (en) | 2012-01-11 | 2014-05-27 | Audience, Inc. | Crosstalk cancellation systems and methods |
-
2010
- 2010-07-08 US US12/832,901 patent/US8473287B2/en active Active
-
2011
- 2011-04-14 JP JP2013506188A patent/JP2013525843A/ja active Pending
- 2011-04-14 WO PCT/US2011/032578 patent/WO2011133405A1/en not_active Ceased
- 2011-04-14 KR KR1020127027238A patent/KR20130061673A/ko not_active Ceased
- 2011-04-14 FI FI20126083A patent/FI20126083L/fi not_active Application Discontinuation
- 2011-04-19 TW TW100113589A patent/TW201207845A/zh unknown
-
2012
- 2012-03-19 US US13/424,189 patent/US8473285B2/en active Active
-
2013
- 2013-05-07 US US13/888,796 patent/US9143857B2/en not_active Expired - Fee Related
-
2015
- 2015-09-10 US US14/850,911 patent/US9502048B2/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040047474A1 (en) * | 2002-04-25 | 2004-03-11 | Gn Resound A/S | Fitting methodology and hearing prosthesis based on signal-to-noise ratio loss data |
| US20090323982A1 (en) * | 2006-01-30 | 2009-12-31 | Ludger Solbach | System and method for providing noise suppression utilizing null processing noise subtraction |
| US20090012783A1 (en) * | 2007-07-06 | 2009-01-08 | Audience, Inc. | System and method for adaptive intelligent noise suppression |
| US20090220107A1 (en) * | 2008-02-29 | 2009-09-03 | Audience, Inc. | System and method for providing single microphone noise suppression fallback |
| US20100067710A1 (en) * | 2008-09-15 | 2010-03-18 | Hendriks Richard C | Noise spectrum tracking in noisy acoustical signals |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9830899B1 (en) | 2006-05-25 | 2017-11-28 | Knowles Electronics, Llc | Adaptive noise cancellation |
| US9437180B2 (en) | 2010-01-26 | 2016-09-06 | Knowles Electronics, Llc | Adaptive noise reduction using level cues |
| US8473285B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
| US9143857B2 (en) | 2010-04-19 | 2015-09-22 | Audience, Inc. | Adaptively reducing noise while limiting speech loss distortion |
| US9502048B2 (en) | 2010-04-19 | 2016-11-22 | Knowles Electronics, Llc | Adaptively reducing noise to limit speech distortion |
| US9343056B1 (en) | 2010-04-27 | 2016-05-17 | Knowles Electronics, Llc | Wind noise detection and suppression |
| US9438992B2 (en) | 2010-04-29 | 2016-09-06 | Knowles Electronics, Llc | Multi-microphone robust noise suppression |
| US9431023B2 (en) | 2010-07-12 | 2016-08-30 | Knowles Electronics, Llc | Monaural noise suppression based on computational auditory scene analysis |
| CN104246877A (zh) * | 2012-04-23 | 2014-12-24 | 高通股份有限公司 | 用于音频信号处理的系统和方法 |
| CN104246877B (zh) * | 2012-04-23 | 2017-05-10 | 高通股份有限公司 | 用于音频信号处理的系统和方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20160064009A1 (en) | 2016-03-03 |
| US20130251170A1 (en) | 2013-09-26 |
| US9502048B2 (en) | 2016-11-22 |
| US9143857B2 (en) | 2015-09-22 |
| KR20130061673A (ko) | 2013-06-11 |
| FI20126083A7 (fi) | 2012-10-18 |
| JP2013525843A (ja) | 2013-06-20 |
| US20120179461A1 (en) | 2012-07-12 |
| US8473287B2 (en) | 2013-06-25 |
| US8473285B2 (en) | 2013-06-25 |
| US20110257967A1 (en) | 2011-10-20 |
| FI20126083L (fi) | 2012-10-18 |
| TW201207845A (en) | 2012-02-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9502048B2 (en) | Adaptively reducing noise to limit speech distortion | |
| US9438992B2 (en) | Multi-microphone robust noise suppression | |
| US8606571B1 (en) | Spatial selectivity noise reduction tradeoff for multi-microphone systems | |
| TWI463817B (zh) | 可適性智慧雜訊抑制系統及方法 | |
| US8521530B1 (en) | System and method for enhancing a monaural audio signal | |
| US9558755B1 (en) | Noise suppression assisted automatic speech recognition | |
| US9076456B1 (en) | System and method for providing voice equalization | |
| US9185487B2 (en) | System and method for providing noise suppression utilizing null processing noise subtraction | |
| US8447596B2 (en) | Monaural noise suppression based on computational auditory scene analysis | |
| US8143620B1 (en) | System and method for adaptive classification of audio sources | |
| US8958572B1 (en) | Adaptive noise cancellation for multi-microphone systems | |
| US9699554B1 (en) | Adaptive signal equalization | |
| US8761410B1 (en) | Systems and methods for multi-channel dereverberation | |
| US10262673B2 (en) | Soft-talk audio capture for mobile devices | |
| US9343073B1 (en) | Robust noise suppression system in adverse echo conditions | |
| WO2011094232A1 (en) | Adaptive noise reduction using level cues |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11772464 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2013506188 Country of ref document: JP Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 20127027238 Country of ref document: KR Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 20126083 Country of ref document: FI |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 11772464 Country of ref document: EP Kind code of ref document: A1 |