US9576590B2 - Noise adaptive post filtering - Google Patents
Noise adaptive post filtering Download PDFInfo
- Publication number
- US9576590B2 US9576590B2 US14/375,639 US201214375639A US9576590B2 US 9576590 B2 US9576590 B2 US 9576590B2 US 201214375639 A US201214375639 A US 201214375639A US 9576590 B2 US9576590 B2 US 9576590B2
- Authority
- US
- United States
- Prior art keywords
- signal
- noise ratio
- filter
- audio signal
- post
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
Definitions
- the present application relates to a noise adaptive post filtering, and in particular, but not exclusively to a noise adaptive post filtering for use in speech or speech like audio.
- AMR adaptive multi-rate
- Post-processing can further be used to overcome quantisation noise generated by low bit rate speech encoders.
- Post-processing can be typically implemented in the form of post-filtering. In other words filtering the decoded speech signal with an adaptive filter in order to reduce the effects of environmental noise and enhancing the perceptual quality of the speech.
- Embodiments attempt to address the above problem.
- a method comprising: estimating a signal to noise ratio value for an audio signal; generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.
- the post-filter may be configured to move energy of the audio signal to higher frequencies.
- Generating a post-filter comprising a first formant frequency filter may comprise generating a first formant frequency parameter configured to attenuate the first formant frequency components of the audio signal dependent on the signal to noise ratio value for the audio signal.
- Generating a post-filter formant frequency parameter dependent on the signal to noise ratio value for the audio signal may comprise: comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; generating a maximum post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value; and generating a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value.
- Generating a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value may comprise: comparing the signal to noise ratio value for the audio signal against a second signal to noise ratio threshold value, wherein the second signal to noise ratio threshold value is lower than the first signal to noise ratio threshold value; setting the second post-filter first formant frequency parameter value to at least one of: a minimum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is equal to or less than the second signal to noise ratio threshold value, and an interpolated value between the minimum post-filter first formant frequency parameter value and the maximum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is greater than the second signal to noise ratio threshold value but less than the first signal to noise ratio threshold value.
- Setting the second post-filter first formant frequency parameter value to an interpolated value may comprise setting to at least one of: a linearly interpolated value; and a non-linearly interpolated value.
- the method may further comprise generating a post-filter neutralization factor dependent on the signal to noise ratio for the audio signal.
- Generating a post-filter neutralization factor dependent on the signal to noise ratio for the audio signal may comprise: comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; generating a minimum post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value; and generating a second post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value.
- Generating a second post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value may comprise: generating a maximum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than a third signal to noise ratio threshold, the third signal to noise ratio threshold being greater than the first signal to noise ratio threshold; and generating an interpolated post-filter neutralization factor between the maximum post-filter neutralization factor and the minimum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than the first signal to noise ratio threshold and less than the third signal to noise ratio threshold.
- Generating an interpolated post-filter neutralization factor may comprise generating: a linear interpolation; and a non-linear interpolation.
- Generating a post-filter comprising the second formant frequency filter may comprises generating a formant frequency parameter configured to amplify the second formant frequency component of the audio signal relative to the first formant frequency dependent on the signal to noise ratio value for the audio signal.
- the method may further comprise estimating the second formant frequency.
- the method may further comprise estimating the first formant frequency.
- Estimating a signal to noise ratio value for an audio signal may comprise at least one of: generating a smoothed signal to noise ratio: and low pass filtering an estimated signal to noise ratio over at least two frames of the audio signal.
- An apparatus comprising at least one processor and at least one memory including computer code for one or more programs, the at least one memory and the computer code configured to with the at least one processor to cause the apparatus to at least perform: estimating a signal to noise ratio value for an audio signal; generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.
- the post-filter may be configured to move energy of the audio signal to higher frequencies.
- Generating a post-filter comprising a first formant frequency filter may cause the apparatus to perform generating a first formant frequency parameter configured to attenuate the first formant frequency components of the audio signal dependent on the signal to noise ratio value for the audio signal.
- Generating a post-filter formant frequency parameter dependent on the signal to noise ratio value for the audio signal may cause the apparatus to perform: comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; generating a maximum post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value; and generating a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value.
- Generating a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value may cause the apparatus to perform: comparing the signal to noise ratio value for the audio signal against a second signal to noise ratio threshold value, wherein the second signal to noise ratio threshold value is lower than the first signal to noise ratio threshold value; setting the second post-filter first formant frequency parameter value to at least one of: a minimum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is equal to or less than the second signal to noise ratio threshold value, and an interpolated value between the minimum post-filter first formant frequency parameter value and the maximum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is greater than the second signal to noise ratio threshold value but less than the first signal to noise ratio threshold value.
- the interpolated value may comprise at least one of: a linearly interpolated value; and a non-linearly interpolated value.
- the apparatus may be caused to perform generating a post-filter neutralization factor dependent on the signal to noise ratio for the audio signal.
- Generating a post-filter neutralization factor dependent on the signal to noise ratio for the audio signal may cause the apparatus to perform: comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; generating a minimum post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value; and generating a second post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value.
- Generating a second post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value may cause the apparatus to perform: generating a maximum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than a third signal to noise ratio threshold, the third signal to noise ratio threshold being greater than the first signal to noise ratio threshold; and generating an interpolated post-filter neutralization factor between the maximum post-filter neutralization factor and the minimum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than the first signal to noise ratio threshold and less than the third signal to noise ratio threshold.
- Generating an interpolated post-filter neutralization factor may cause the apparatus to perform: a linear interpolation; and a non-linear interpolation.
- Generating a post-filter comprising the second formant frequency filter may cause the apparatus to perform generating a formant frequency parameter configured to amplify the second formant frequency component of the audio signal relative to the first formant frequency dependent on the signal to noise ratio value for the audio signal.
- the apparatus may be caused to perform estimating the second formant frequency.
- the apparatus may be caused to perform estimating the first formant frequency.
- Estimating a signal to noise ratio value for an audio signal may cause the apparatus to perform at least one of: generating a smoothed signal to noise ratio: and low pass filtering an estimated signal to noise ratio over at least two frames of the audio signal.
- An apparatus comprising: a signal to noise estimator configured to estimate a signal to noise ratio value for an audio signal; a post-filter generator configured to generate a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.
- the post-filter may be configured to move energy of the audio signal to higher frequencies.
- the post-filter generator may comprise a first formant filter generator configured to generate a first formant frequency parameter configured to attenuate the first formant frequency components of the audio signal dependent on the signal to noise ratio value for the audio signal.
- the first formant filter generator may comprise: a comparator configured to compare the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; a maximum parameter determiner configured to generate a maximum post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value; and a second parameter determiner configured to generate a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value.
- the second parameter determiner may comprise: a second parameter comparator configured to compare the signal to noise ratio value for the audio signal against a second signal to noise ratio threshold value, wherein the second signal to noise ratio threshold value is lower than the first signal to noise ratio threshold value; a parameter setter configured to set the second post-filter first formant frequency parameter value to at least one of: a minimum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is equal to or less than the second signal to noise ratio threshold value, and an interpolated value between the minimum post-filter first formant frequency parameter value and the maximum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is greater than the second signal to noise ratio threshold value but less than the first signal to noise ratio threshold value.
- the interpolated value may comprise at least one of: a linearly interpolated value; and a non-linearly interpolated value.
- the apparatus may further comprise a post-filter neutralization factor generator dependent on the signal to noise ratio for the audio signal.
- the post-filter neutralization factor generator may comprise: a comparator configured to compare the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; and a factor generator configured to generate a minimum post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value, and a second factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value.
- the factor generator may be configured to generate: a maximum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than a third signal to noise ratio threshold, the third signal to noise ratio threshold being greater than the first signal to noise ratio threshold; and an interpolated post-filter neutralization factor between the maximum post-filter neutralization factor and the minimum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than the first signal to noise ratio threshold and less than the third signal to noise ratio threshold.
- the interpolated post-filter neutralization factor may comprise: a linear interpolation; and a non-linear interpolation.
- the post-filter generator may comprise a second formant frequency parameter generator, the second formant frequency parameter configured to amplify the second formant frequency component of the audio signal relative to the first formant frequency dependent on the signal to noise ratio value for the audio signal.
- the apparatus may comprise a second formant frequency estimator configured to estimate the second formant frequency.
- the apparatus may comprise a first formant frequency estimator configured to estimate the first formant frequency.
- the signal to noise ratio estimator may comprise at least one of: a smoothed signal to noise ratio estimator configured to generate a smoothed signal to noise ratio: and a low pass filtered signal to noise ratio estimator configured to low pass filter an estimated signal to noise ratio over at least two frames of the audio signal.
- An apparatus comprising: means for estimating a signal to noise ratio value for an audio signal; means for generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.
- the post-filter may be configured to move energy of the audio signal to higher frequencies.
- the means for generating a post-filter comprising a first formant frequency filter may comprise means for generating a first formant frequency parameter configured to attenuate the first formant frequency components of the audio signal dependent on the signal to noise ratio value for the audio signal.
- the means for generating a post-filter formant frequency parameter dependent on the signal to noise ratio value for the audio signal may comprise: means for comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; means for generating a maximum post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value; and means for generating a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value.
- the means for generating a second post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value may comprise: means for comparing the signal to noise ratio value for the audio signal against a second signal to noise ratio threshold value, wherein the second signal to noise ratio threshold value is lower than the first signal to noise ratio threshold value; and means for setting the second post-filter first formant frequency parameter value to at least one of: a minimum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is equal to or less than the second signal to noise ratio threshold value, and an interpolated value between the minimum post-filter first formant frequency parameter value and the maximum post-filter first formant frequency parameter value when the signal to noise ratio value for the audio signal is greater than the second signal to noise ratio threshold value but less than the first signal to noise ratio threshold value.
- the means for setting the second post-filter first formant frequency parameter value to an interpolated value may comprise means for setting to at least one of: a linearly interpolated value; and a non-linearly interpolated value.
- the apparatus may further comprise means for generating a post-filter neutralization factor dependent on the signal to noise ratio for the audio signal.
- the means for generating a post-filter neutralization factor dependent on the signal to noise ratio for the audio signal may comprise: means for comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value; means for generating a minimum post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ratio threshold value; and means for generating a second post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value.
- the means for generating a second post-filter neutralization factor dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value may comprise: means for generating a maximum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than a third signal to noise ratio threshold, the third signal to noise ratio threshold being greater than the first signal to noise ratio threshold; and means for generating an interpolated post-filter neutralization factor between the maximum post-filter neutralization factor and the minimum post-filter neutralization factor when the signal to noise ratio value for the audio signal is greater than the first signal to noise ratio threshold and less than the third signal to noise ratio threshold.
- the means for generating an interpolated post-filter neutralization factor may comprise means for generating: a linear interpolation; and a non-linear interpolation.
- the means for generating a post-filter comprising the second formant frequency filter may comprise means for generating a formant frequency parameter configured to amplify the second formant frequency component of the audio signal relative to the first formant frequency dependent on the signal to noise ratio value for the audio signal.
- the apparatus may further comprise means for estimating the second formant frequency.
- the apparatus may further comprise means for estimating the first formant frequency.
- the means for estimating a signal to noise ratio value for an audio signal may comprise at least one of: means for generating a smoothed signal to noise ratio: and means for low pass filtering an estimated signal to noise ratio over at least two frames of the audio signal.
- An electronic device may comprise apparatus as described above.
- a chipset may comprise apparatus as described above.
- FIG. 1 shows schematically an electronic device employing some embodiments of the application
- FIG. 2 shows schematically an audio post processor apparatus according to some embodiments
- FIG. 3 shows schematically the operation of the audio post processor apparatus according to some embodiments
- FIG. 4 shows schematically the operation of the audio post processor formant filter noise adaptation according to some embodiments
- FIG. 5 shows a graphical representation of a spectral speech sample with various levels of filtering according to some embodiments.
- FIG. 6 shows a graphical representation of further spectral speech samples with various levels of filtering according to some embodiments.
- FIG. 1 shows a schematic block diagram of an exemplary electronic device or apparatus 10 , which may incorporate a noise adaptive post filtering apparatus according to an embodiment of the application.
- the apparatus 10 may for example be a mobile terminal or user equipment of a wireless communication system.
- the apparatus 10 may be an audio-video device such as video camera, a Television (TV) receiver, audio recorder or audio player such as a mp3 recorder/player, a media recorder (also known as a mp4 recorder/player), or any computer suitable for the processing of audio signals.
- an audio-video device such as video camera, a Television (TV) receiver, audio recorder or audio player such as a mp3 recorder/player, a media recorder (also known as a mp4 recorder/player), or any computer suitable for the processing of audio signals.
- TV Television
- mp3 recorder/player such as a mp3 recorder/player
- media recorder also known as a mp4 recorder/player
- the electronic device or apparatus 10 in some embodiments comprises a microphone 11 , which is linked via an analogue-to-digital converter (ADC) 14 to a processor 21 .
- the processor 21 is further linked via a digital-to-analogue (DAC) converter 32 to loudspeakers 33 .
- the processor 21 is further linked to a transceiver (RX/TX) 13 , to a user interface (UI) 15 and to a memory 22 .
- the processor 21 can in some embodiments be configured to execute various program codes.
- the implemented program codes in some embodiments comprise noise adaptive post filtering code as described herein.
- the implemented program codes 23 can in some embodiments be stored for example in the memory 22 for retrieval by the processor 21 whenever needed.
- the memory 22 could further provide a section 24 for storing data, for example data that has been encoded in accordance with the application.
- the noise adaptive post filtering code in embodiments can be implemented in hardware or firmware.
- the user interface 15 enables a user to input commands to the electronic device 10 , for example via a keypad, and/or to obtain information from the electronic device 10 , for example via a display.
- a touch screen may provide both input and output functions for the user interface.
- the apparatus 10 in some embodiments comprises a transceiver 13 suitable for enabling communication with other apparatus, for example via a wireless communication network.
- a user of the apparatus 10 for example can use the microphone 11 for inputting speech or other audio signals that are to be transmitted to some other apparatus or that are to be stored in the data section 24 of the memory 22 .
- the analogue-to-digital converter (ADC) 14 in some embodiments converts the input analogue audio signal into a digital audio signal and provides the digital audio signal to the processor 21 .
- the microphone 11 can comprise an integrated microphone and ADC function and provide digital audio signals directly to the processor for processing.
- the processor 21 in such embodiments then processes the digital audio signal according to any suitable encoding process, for example a suitable adaptable multi-rate (AMR) coding or codec.
- AMR adaptable multi-rate
- the resulting bit stream can in some embodiments be provided to the transceiver 13 for transmission to another apparatus.
- the coded audio data in some embodiments can be stored in the data section 24 of the memory 22 , for instance for a later transmission or for a later presentation by the same apparatus 10 .
- the apparatus 10 in some embodiments can also receive a bit stream with correspondingly encoded data from another apparatus via the transceiver 13 .
- the processor 21 may execute decoding program code stored in the memory 22 .
- the processor 21 in such embodiments decodes the received data.
- the processor 21 in some embodiments can be configured to apply noise adaptive post-filtering as described herein, and provide the signal output to a digital-to-analogue converter 32 .
- the digital-to-analogue converter 32 converts the signal into analogue audio data and can in some embodiments output the analogue audio via the loudspeakers 33 .
- Execution of the decoding and noise adaptive post filtering program code in some embodiments can be triggered by an application called by the user via the user interface 15 .
- the received encoded data in some embodiments can also be stored instead of an immediate presentation via the loudspeakers 33 in the data section 24 of the memory 22 , for instance for later decoding, noise adaptive post filtering and presentation or decoding and forwarding to still another apparatus.
- FIG. 2 the schematic structures described in FIG. 2 , and the method steps shown in FIGS. 3 and 4 represent only a part of the operation of an audio codec and specifically noise adaptive post filtering apparatus or method as exemplarily shown implemented in the apparatus shown in FIG. 1 .
- the concept of the application is to improve the intelligibility of mobile phone speech in severe noise conditions. High levels of environmental noise can corrupt the audio signal containing speech and produce poor quality outputs.
- the post-processing apparatus is configured too post-filter the audio signal so to attenuate the first formant and enhance the second formant adaptively according to the estimated noise level. In such a way the acoustic cues in higher frequencies are raised above the noise level.
- FIG. 2 an example post-processing apparatus is shown according to some embodiments of the application. Furthermore with respect to FIGS. 3 and 4 the operation of the post processing apparatus according to FIG. 2 is shown in further detail.
- the post processing apparatus comprises a signal formatter 102 configured to receive the input narrowband signal S nb and format the signal into a form suitable for post processing.
- the signal formatter comprises a pre-emphasis filter 101 .
- the output of the pre emphasis filter is passed to the framer/windower 103
- step 201 The operation of performing the pre emphasis filtering is shown in FIG. 3 by step 201 .
- the signal formatter 102 can comprise a framer/windower 103 .
- the framer/windower 103 can in some embodiments be configured to receive the audio signal and frame/window the audio signal into a suitable series of windowed or framed time samples. It would be understood that in some embodiments the audio signal is processed into separate frames of approximately 20 ms with a sampling frequency of 8 kHz. However it would be understood that any suitable frame length, sampling and overlap can be implemented. In some embodiments the framer/windower 103 is configured to extract the frames from the audio signal using a rectangular window. However any suitable windowing function can be used in some other embodiments. For example in some embodiments a regular hamming window can be used.
- the output of the framer/windower 103 can be passed to a signal analyser 104 .
- step 203 The operation of windowing the audio signal is shown in FIG. 3 by step 203 .
- the framer/windower 103 can be configured in some embodiments to output the filtered windowed audio signal to the signal analyser 104 .
- the post processor apparatus comprises a signal analyser 104 .
- the signal analyser 104 can be configured to analyse the signal to produce input or control values for the post-processor 106 .
- the signal analyser 104 comprises an energy estimator 105 .
- the energy estimator 105 is configured to determine the energy of the framed audio signal.
- the energy can be calculated using any suitable energy estimation method. For example in some embodiments the squares of the absolute sample values are summed or averaged over a frame to generate a frame audio signal energy value.
- step 205 The operation of estimating the energy of the frame is shown in FIG. 3 by step 205 .
- the signal analyser 104 comprises a voice activity detector 107 .
- the voice activity detector 107 can in some embodiments determine a gradient index for a frame (n) using the following expression
- the voice activity detector 107 can then be configured in some embodiments to classify whether the frame is voiced where the gradient index value (X GI ) is lower than a determined limit (GI Limit ) and the frame energy is above a predefined limit (E Limit ).
- the threshold or defined values can be determined by testing known speech material.
- the voice activity detector 107 can be configured to determine whether the current frame is voiced and pass this information onto the post-processor 106 . In some embodiments the voice activity detector 107 is configured to control the operation of the post processor dependent on the analysis of the current frame.
- step 207 The operation of analysing the current frame is voiced is shown in FIG. 3 by step 207 .
- the voice activity detector can be configured to control the post-processor to operate in a smoothing mode.
- the smoothing mode occurs where the post filter coefficients are interpolated from frame to frame.
- the smoothing mode is shown in FIG. 3 by step 213 .
- the post processor 106 is configured to post filter the audio signals from the signal formatter 102 .
- the signal analyser 104 comprises a signal to noise estimator 115 the signal to noise estimator 115 can determine a signal to noise level according to any suitable method.
- a noise estimation is performed as shown in FIG. 4 step 301 and then this value is compared against the signal energy value to determine a signal to noise estimation value.
- step 303 The determination of the signal to noise estimation is shown in FIG. 4 by step 303 .
- the signal to noise estimator 115 can further comprise a SNR smoothing filter.
- the smoothing filter can be any suitable smoothing filter.
- the SNR and/or smoothed SNR values can in some embodiments be used to control or determine the formant filters as described herein.
- step 305 The determination of a smoothed SNR value is shown in FIG. 4 by step 305 .
- the post processor apparatus comprises a post processor part 106 .
- the post processor part 106 is configured to determine the formant frequency estimates, determine the post filtering structure and apply and further post-processing on the audio signal.
- the post-processor part 106 can therefore in some embodiments be configured to receive the output of the signal formatter 102 in the form of the audio signal frames and furthermore the output from the signal analyser 104 in the form of the voice activity analysis and signal to noise estimation parameters.
- the post processor part 106 comprises a formant estimator 109 .
- the formant estimator 109 is configured to estimate the linear prediction coefficients of the frame.
- the linear prediction (LP) coefficients of the frame can in some embodiments be calculated by a 10 th order linear prediction.
- the formant frequencies can be estimated by picking the peaks of the linear prediction spectrum.
- a conventional post filter structure H enh (z) can be based on the determined linear prediction coefficients according to the following expression:
- H enh ⁇ ( z ) 1 - P ⁇ ( z 0.9 ) 1 - P ⁇ ( z 0.99 ) , where P(z) is the linear prediction polynomial.
- the formant estimator 109 can determine the amplitude response of the post-filter H enh (z) using a 256 sample Fast Fourier Transform (FFT). The formant determiner 109 then in some embodiments can locate the first 3 peaks and compare the determined peaks to the formant locations for a previous frame. The peaks which are closest to the formants of the previous frame can then be selected. Where none of the peaks are determined to be close enough the values of the previous frame can be used instead.
- FFT Fast Fourier Transform
- the estimated frequencies of the formants can be determined to the change at most 50 Hz between consecutive frames. In some embodiments the change can be more than or less than 50 Hz.
- the formant frequencies can be determined according to long term analysis of voice patterns, for example the formant frequencies can be determined by computing the averages of the first 2 formant frequencies.
- the formant frequencies can be predetermined, stored in memory and recovered by the formant estimator 109 .
- the formant estimator 109 is optional or configured to simply supply to the formant filter generator the constant values.
- the voice activity detector can be optional and the post-filter applied to all frames using the formant values and with the formant filter parameter r 1 dependent on the estimated signal to noise ratio.
- step 211 The operation of estimating the formant frequencies is shown in FIG. 3 by step 211 .
- the post processor 106 comprises a formant filter determiner/modifier 111 .
- the formant filter determiner/modifier 111 is configured in some embodiments to determine a filter structure where the first 2 formants are manipulated according to the determined signal to noise ratio values.
- the filters H 1 (z) and H 2 (z), which can also be referred to as the formant frequency filters can in some embodiments have the following transfer function structure
- the formant filter determiner/modifier 111 can be configured to modify formant parameters such that dependent on the signal to noise ratio the value of r 1 is within the range 0 to 0.9 (in other words attenuating the first formant) and r 2 is within the range 0.9 to 1 (in other words amplifying the second formant).
- the formant filter determiner/modifier 111 can be configured to receive suitable values of the formant locations ⁇ 1 and ⁇ 2 from the formant estimator 109 . As described herein these formant locations can be estimated and therefore variable or constant, for example predetermined values.
- the formant filter determiner/modifier 111 can be configured to determine a first set of r values where the signal to noise ratio is good or ‘optimal’.
- the formant filter determiner/modifier 111 can be configured to receive the signal to noise ratio and compare the signal to noise ratio (or smoothed signal to noise ratio) against a determined noise threshold or thresholds.
- a single noise threshold of 0 dB is used however it would be understood that in some embodiments other threshold values can be used.
- the formant filter determiner/modifier 111 can in some embodiments be configured to perform two stages of adaptation dependent on the level of the background noise.
- step 307 The operation of determining whether the signal to noise ratio (or smoothed signal to noise ratio) is greater than a determined noise threshold is shown in FIG. 4 by step 307 .
- the formant filter determiner/modifier 111 can be configured firstly to determine the value of r 1 dependent on the signal to noise ratio, and specifically whether the SNR (or smoothed SNR) is greater than the threshold value.
- the value of r 2 is also set to the ‘optimal’ value for the r 2 value.
- the ‘optimal’ value of r 2 is 0.93.
- the formant filter determiner/modifier 111 can be configured in some embodiments where the SNR (or smoothed SNR) is greater than the threshold value to perform a neutralisation of the post-filter where the signal to noise ratio (or smoothed SNR) is above the threshold.
- the neutralisation of the post-filter can be performed in some embodiments by moving the poles and zeros of the cascade of the two formant filters gradually closer to the origin z-plane. This can be expressed as
- H NA ⁇ ( z ) H 1 ⁇ ( z a ) ⁇ H 2 ⁇ ( z a ) ⁇ H TILT ⁇ ( z ) , where the neutralisation can be controlled by the factor ⁇ .
- the factor ⁇ can be interpolated linearly between 1 and 0 dependent on the SNR (or smoothed SNR) changes from 0 dB to 10 dB.
- the post filter obtained at 10 dB would in these embodiments produce a nearly flat amplitude response and would produce an almost inaudible processing effect.
- step 309 The operation of computing the value of a and setting the r 1 value to 0.46 when the signal to noise ratio is above the threshold is shown in FIG. 4 by step 309 .
- the formant filter determiner/modifier 111 can be configured in some embodiments to modify the value of r 1 to be moved closer to a minimum value.
- the formant filter determiner/modifier 111 can be configured to set the value according to a linear interpolation method.
- r 2 is also set with a value of 0.93.
- a frame by frame smoothing of the r 1 (and a) values can be implemented so that there are no sudden drastic changes in the frequency response of the post-filter.
- the formant filter determiner/modifier 111 can be configured to set the factor ⁇ to 1.
- step 311 The determination of the r 1 (and r 2 ) value and the setting ⁇ to 1 when the SNR is less than the threshold is shown in FIG. 4 by step 311 .
- the Formant filter determiner/modifier 111 can then be configured to construct the formant filters H 1 (z) and H 2 (z) using the determined r 1 , r 2 and ⁇ values.
- step 313 The operation of generating the formant filters is shown in FIG. 4 by step 313 .
- the post processor 106 comprised a tilt filter 117 .
- the tilt filter (H TILT (z)) is a filter configured to compensate for the possible spectral tilt in the processed speech caused by the cascade of the two formant filters.
- the tilt filter can in some embodiments be a first order low pass filter according to the following expression:
- H TILT ⁇ ( z ) 1 1 - ⁇ ⁇ ⁇ z - 1 , where ⁇ is computed from a first order linear prediction analysis of the cascade of the formant filters.
- step 315 The construction of the tilt filter is shown in FIG. 4 by step 315 .
- the post processor part comprises an interpolator 113 .
- the filter coefficients can be interpolated between frames to avoid generating audio artefacts caused by sudden transitions between consecutive frames in embodiments where the filter parameters are determined by non-smoothed signal to noise ratio estimation.
- the prevention of audio artefacts can be controlled by the use of smoothing to the signal to noise ratio estimation, in some embodiments by the smoothing of filter parameters from frame to frame.
- the interpolator 113 can be configured to perform interpolation every 20 th sample.
- the coefficients of the formant and tilt filters can be transformed to the line spectral frequency (LSF) domain and the interpolated linearly.
- the transformation to the line spectral frequencies is performed in some embodiments to ensure that the filter remains stable even though its coefficients change.
- the filter coefficients for a sub frame of 20 samples can be obtained according to the following expression:
- a sf ( 1 - i N - 1 ) ⁇ a cf + i N - 1 ⁇ a nf , where a sf denotes the subframe coefficients, a cf the coefficients of the current frame and a nt those of the next frame.
- the length of the frame is N and the starting index of subframe inside the larger frame is I, where i is less or equal to 0 but greater than or equal to N ⁇ 1.
- both the numerator and denominator coefficients of the subframe filter can be interpolated separately.
- step 219 The operation of interpolation is shown in FIG. 3 by step 219 .
- the post processor part 206 can then apply the combination of the formant and tilt filters to generate a post-filter output.
- step 317 The operation of post-filtering the audio signal is shown in FIG. 4 by step 317 .
- step 215 Furthermore the operation of generating the post-filter is shown in FIG. 3 by step 215 .
- the post processor part 106 comprises an adaptive gain controller 119 .
- the adaptive gain controller 119 can be configured to adjust the energy of the processed signal to correspond to that of the ordinary speech signal.
- the speech frames can be processed in 5 ms subframes with the scaling factor determined according to the following expression:
- the values of ⁇ (n) can in some embodiments be calculated for every sample and used to smooth the changes between samples.
- step 221 The operation of performing adaptive gain control is shown in FIG. 3 and in FIG. 4 by step 221 .
- a two formant frequency filter is configured and generated with parameters dependent on the signal to noise ratio of the input audio signal.
- the concept can be seen as generating a filter which is configured to move the audio signal energy from lower frequencies to higher frequencies. It would be understood that in some embodiments this can be achieved by other implementations such as a second or higher formant frequency filter which is configured to amplify the ‘filtered’ formant frequencies relative to earlier formant frequencies.
- a second or higher formant frequency filter which is configured to amplify the ‘filtered’ formant frequencies relative to earlier formant frequencies.
- more than two formants can be filtered dependent on the signal to noise ratio such that the higher formant frequency components are amplified relative to at least one lower formant frequency component.
- embodiments of the application operating within a codec within an apparatus 10
- the invention as described below may be implemented as part of any audio (or speech) codec, including any variable rate/adaptive rate audio (or speech) codec.
- embodiments of the application may be implemented in an audio codec which may implement audio coding over fixed or wired communication paths.
- user equipment may comprise an audio codec such as those described in embodiments of the application above.
- user equipment is intended to cover any suitable type of wireless user equipment, such as mobile telephones, portable data processing devices or portable web browsers.
- PLMN public land mobile network
- elements of a public land mobile network may also comprise audio codecs as described above.
- the various embodiments of the application may be implemented in hardware or special purpose circuits, software, logic or any combination thereof.
- some aspects may be implemented in hardware, while other aspects may be implemented in firmware or software which may be executed by a controller, microprocessor or other computing device, although the invention is not limited thereto.
- firmware or software which may be executed by a controller, microprocessor or other computing device, although the invention is not limited thereto.
- While various aspects of the application may be illustrated and described as block diagrams, flow charts, or using some other pictorial representation, it is well understood that these blocks, apparatus, systems, techniques or methods described herein may be implemented in, as non-limiting examples, hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controller or other computing devices, or some combination thereof.
- At least some embodiments may be an apparatus comprising at least one processor and at least one memory including computer program code the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to perform: estimating a signal to noise ratio value for an audio signal; generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.
- any blocks of the logic flow as in the Figures may represent program steps, or interconnected logic circuits, blocks and functions, or a combination of program steps and logic circuits, blocks and functions.
- At least some embodiments may be a computer-readable medium encoded with instructions that, when executed by a computer perform: estimating a signal to noise ratio value for an audio signal; generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.
- the memory may be of any type suitable to the local technical environment and may be implemented using any suitable data storage technology, such as semiconductor-based memory devices, magnetic memory devices and systems, optical memory devices and systems, fixed memory and removable memory.
- the data processors may be of any type suitable to the local technical environment, and may include one or more of general purpose computers, special purpose computers, microprocessors, digital signal processors (DSPs), application specific integrated circuits (ASIC), gate level circuits and processors based on multi-core processor architecture, as non-limiting examples.
- Embodiments of the application may be practiced in various components such as integrated circuit modules.
- the design of integrated circuits is by and large a highly automated process. Complex and powerful software tools are available for converting a logic level design into a semiconductor circuit design ready to be etched and formed on a semiconductor substrate.
- Programs such as those provided by Synopsys, Inc. of Mountain View, Calif. and Cadence Design, of San Jose, Calif. automatically route conductors and locate components on a semiconductor chip using well established rules of design as well as libraries of pre-stored design modules.
- the resultant design in a standardized electronic format (e.g., Opus, GDSII, or the like) may be transmitted to a semiconductor fabrication facility or “fab” for fabrication.
- a standardized electronic format e.g., Opus, GDSII, or the like
- circuitry refers to all of the following:
- circuitry applies to all uses of this term in this application, including any claims.
- circuitry would also cover an implementation of merely a processor (or multiple processors) or portion of a processor and its (or their) accompanying software and/or firmware.
- circuitry would also cover, for example and if applicable to the particular claim element, a baseband integrated circuit or applications processor integrated circuit for a mobile phone or similar integrated circuit in server, a cellular network device, or other network device.
Abstract
Description
H(z)=1+α1 z −1,
where a1 is a first order high pass filter coefficient.
where Nk is the frame size and s is the audio signal.
where P(z) is the linear prediction polynomial.
H pf(z)=H 1(z)H 2(z).
where the frequencies of the formants (in radians) is denoted by θi and the values of ri control whether the formants are amplified or supressed as well as the degree of the modification.
where the neutralisation can be controlled by the factor α. In some embodiments the factor α can be interpolated linearly between 1 and 0 dependent on the SNR (or smoothed SNR) changes from 0 dB to 10 dB. The post filter obtained at 10 dB would in these embodiments produce a nearly flat amplitude response and would produce an almost inaudible processing effect.
where μ is computed from a first order linear prediction analysis of the cascade of the formant filters.
where asf denotes the subframe coefficients, acf the coefficients of the current frame and ant those of the next frame. The length of the frame is N and the starting index of subframe inside the larger frame is I, where i is less or equal to 0 but greater than or equal to
where s(n) is the received or input signal and spf(n) is the post filtered signal.
s sc(n)=β(n)s pf(n),
where β(n)=0.9β(n−1)+0.1γ. The values of β(n) can in some embodiments be calculated for every sample and used to smooth the changes between samples.
-
- (a) hardware-only circuit implementations (such as implementations in only analog and/or digital circuitry) and
- (b) to combinations of circuits and software (and/or firmware), such as: (i) to a combination of processor(s) or (ii) to portions of processor(s)/software (including digital signal processor(s)), software, and memory(ies) that work together to cause an apparatus, such as a mobile phone or server, to perform various functions and
- (c) to circuits, such as a microprocessor(s) or a portion of a microprocessor(s), that require software or firmware for operation, even if the software or firmware is not physically present.
Claims (18)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/IB2012/050866 WO2013124712A1 (en) | 2012-02-24 | 2012-02-24 | Noise adaptive post filtering |
Publications (2)
Publication Number | Publication Date |
---|---|
US20150142425A1 US20150142425A1 (en) | 2015-05-21 |
US9576590B2 true US9576590B2 (en) | 2017-02-21 |
Family
ID=49005074
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/375,639 Active 2032-04-16 US9576590B2 (en) | 2012-02-24 | 2012-02-24 | Noise adaptive post filtering |
Country Status (2)
Country | Link |
---|---|
US (1) | US9576590B2 (en) |
WO (1) | WO2013124712A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190035414A1 (en) * | 2017-07-27 | 2019-01-31 | Harman Becker Automotive Systems Gmbh | Adaptive post filtering |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150162014A1 (en) * | 2013-12-06 | 2015-06-11 | Qualcomm Incorporated | Systems and methods for enhancing an audio signal |
EP2887350B1 (en) * | 2013-12-19 | 2016-10-05 | Dolby Laboratories Licensing Corporation | Adaptive quantization noise filtering of decoded audio data |
EP3252766B1 (en) | 2016-05-30 | 2021-07-07 | Oticon A/s | An audio processing device and a method for estimating a signal-to-noise-ratio of a sound signal |
DE15727008T1 (en) | 2014-06-13 | 2017-11-16 | Retune DSP ApS | MULTI-BAND NOISE REDUCTION SYSTEM AND METHOD FOR DIGITAL AUDIO SIGNALS |
EP3107097B1 (en) * | 2015-06-17 | 2017-11-15 | Nxp B.V. | Improved speech intelligilibility |
US10861478B2 (en) | 2016-05-30 | 2020-12-08 | Oticon A/S | Audio processing device and a method for estimating a signal-to-noise-ratio of a sound signal |
US11483663B2 (en) | 2016-05-30 | 2022-10-25 | Oticon A/S | Audio processing device and a method for estimating a signal-to-noise-ratio of a sound signal |
US10433076B2 (en) | 2016-05-30 | 2019-10-01 | Oticon A/S | Audio processing device and a method for estimating a signal-to-noise-ratio of a sound signal |
US11227622B2 (en) * | 2018-12-06 | 2022-01-18 | Beijing Didi Infinity Technology And Development Co., Ltd. | Speech communication system and method for improving speech intelligibility |
CN111739549B (en) * | 2020-08-17 | 2020-12-08 | 北京灵伴即时智能科技有限公司 | Sound optimization method and sound optimization system |
CN112151047B (en) * | 2020-09-27 | 2022-08-05 | 桂林电子科技大学 | Real-time automatic gain control method applied to voice digital signal |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6233552B1 (en) * | 1999-03-12 | 2001-05-15 | Comsat Corporation | Adaptive post-filtering technique based on the Modified Yule-Walker filter |
US6584441B1 (en) * | 1998-01-21 | 2003-06-24 | Nokia Mobile Phones Limited | Adaptive postfilter |
US20040143439A1 (en) * | 2000-04-17 | 2004-07-22 | At & T Corp. | Pseudo-cepstral adaptive short-term post-filters for speech coders |
US20050091046A1 (en) * | 2003-10-24 | 2005-04-28 | Broadcom Corporation | Method for adaptive filtering |
US6985855B2 (en) * | 1998-05-26 | 2006-01-10 | Koninklijke Philips Electronics N.V. | Transmission system with improved speech decoder |
US20060116874A1 (en) * | 2003-10-24 | 2006-06-01 | Jonas Samuelsson | Noise-dependent postfiltering |
US20060270467A1 (en) | 2005-05-25 | 2006-11-30 | Song Jianming J | Method and apparatus of increasing speech intelligibility in noisy environments |
EP2116997A1 (en) | 2007-03-02 | 2009-11-11 | Panasonic Corporation | Audio decoding device and audio decoding method |
US20090281800A1 (en) | 2008-05-12 | 2009-11-12 | Broadcom Corporation | Spectral shaping for speech intelligibility enhancement |
US20100004927A1 (en) | 2008-07-02 | 2010-01-07 | Fujitsu Limited | Speech sound enhancement device |
US20100088092A1 (en) * | 2007-03-05 | 2010-04-08 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and Arrangement for Controlling Smoothing of Stationary Background Noise |
US20110125507A1 (en) * | 2008-07-18 | 2011-05-26 | Dolby Laboratories Licensing Corporation | Method and System for Frequency Domain Postfiltering of Encoded Audio Data in a Decoder |
US20110125491A1 (en) | 2009-11-23 | 2011-05-26 | Cambridge Silicon Radio Limited | Speech Intelligibility |
-
2012
- 2012-02-24 WO PCT/IB2012/050866 patent/WO2013124712A1/en active Application Filing
- 2012-02-24 US US14/375,639 patent/US9576590B2/en active Active
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6584441B1 (en) * | 1998-01-21 | 2003-06-24 | Nokia Mobile Phones Limited | Adaptive postfilter |
US6985855B2 (en) * | 1998-05-26 | 2006-01-10 | Koninklijke Philips Electronics N.V. | Transmission system with improved speech decoder |
US6233552B1 (en) * | 1999-03-12 | 2001-05-15 | Comsat Corporation | Adaptive post-filtering technique based on the Modified Yule-Walker filter |
US20040143439A1 (en) * | 2000-04-17 | 2004-07-22 | At & T Corp. | Pseudo-cepstral adaptive short-term post-filters for speech coders |
US20050091046A1 (en) * | 2003-10-24 | 2005-04-28 | Broadcom Corporation | Method for adaptive filtering |
US20060116874A1 (en) * | 2003-10-24 | 2006-06-01 | Jonas Samuelsson | Noise-dependent postfiltering |
US20060270467A1 (en) | 2005-05-25 | 2006-11-30 | Song Jianming J | Method and apparatus of increasing speech intelligibility in noisy environments |
EP2116997A1 (en) | 2007-03-02 | 2009-11-11 | Panasonic Corporation | Audio decoding device and audio decoding method |
US8554548B2 (en) * | 2007-03-02 | 2013-10-08 | Panasonic Corporation | Speech decoding apparatus and speech decoding method including high band emphasis processing |
US20100088092A1 (en) * | 2007-03-05 | 2010-04-08 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and Arrangement for Controlling Smoothing of Stationary Background Noise |
US20090281800A1 (en) | 2008-05-12 | 2009-11-12 | Broadcom Corporation | Spectral shaping for speech intelligibility enhancement |
US20100004927A1 (en) | 2008-07-02 | 2010-01-07 | Fujitsu Limited | Speech sound enhancement device |
US20110125507A1 (en) * | 2008-07-18 | 2011-05-26 | Dolby Laboratories Licensing Corporation | Method and System for Frequency Domain Postfiltering of Encoded Audio Data in a Decoder |
US20110125491A1 (en) | 2009-11-23 | 2011-05-26 | Cambridge Silicon Radio Limited | Speech Intelligibility |
Non-Patent Citations (18)
Title |
---|
"3rd Generation Partnership Project;Technical Specification Group Services and System Aspects;Mandatory Speech Codec speech processing functions;Adaptive Multi-Rate (AMR) speech codec; Transcoding functions(Release 8)", 3GPP TS 26.090, V8.0.0, Dec. 2008, pp. 1-55. |
Chen et al., "Adaptive Post-filtering for Quality Enhancement of Coded Speech", IEEE Transactions on Speech and Audio Processing, vol. 3, No. 1, Jan. 1995, pp. 59-71. |
Chen et al., "Perceptual Postfilter Estimation for Low Bit Rate Speech Coders Using Gaussian Mixture Models", Proceedings of the Interspeech, Sep. 4-8, 2005, 4 pages. |
Grancharov et al., "Generalized Postfilter for Speech Quality Enhancement", IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, No. 1, Jan. 2008, pp. 57-64. |
Hall et al., Intelligibility and Listener Preference of Telephone Speech in the Presence of Babble Noise, the Journal of the Acoustical Society of America, vol. 127, No. 1, 2010, pp. 280-285. |
International Search Report received for corresponding Patent Cooperation Treaty Application No. PCT/IB2012/050866 , dated Feb. 7, 2013, 5 pages. |
Jayant, "Adaptive Post-Filtering of ADPCM Speech", The Bell System Technical Journal, vol. 60, No. 5, May-Jun. 1981, pp. 707-717. |
Jokinen E. "Adaptive post-filtering of speech in mobile communications", Aalto University Library 2010, p. 1-71. Retrieved from Internet . |
Jokinen E. "Adaptive post-filtering of speech in mobile communications", Aalto University Library 2010, p. 1-71. Retrieved from Internet <URL: https://aaltodoc.aalto.fi/bitstream/handle/123456789/3279/urn100278.pdf?sequence=1>. |
Laaksonen et al., "Artificial Bandwidth Expansion Method to Improve Intelligibility and Quality of AMR-Coded Narrowband Speech", IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, Mar. 18-23, 2005, pp. 809-812. |
Mustapha et al., An Adaptive Post-Filtering Technique Based on the Modified Yule-Walker Filter, IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, Mar. 15-19, 1999, pp. 197-200. |
Niederjohn et al., "The Enhancement of Speech Intelligibility in High Noise Levels by High-Pass Filtering Followed by Rapid Amplitude Compression", IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 24, No. 4, Aug. 1976, pp. 277-282. |
Pulakka et al., "Bandwidth Extension of Telephone Speech Using a Filter Bank Implementation for Highband Mel Spectrum", 18th European Signal Processing Conference, Aug. 23-27, 2010, pp. 979-983. |
Skowronski et al., "Applied Principles of Clear and Lombard Speech for Automated Intelligibility Enhancement in Noisy Environments", Speech Communication, vol. 48, No. 5, May 2006, pp. 549-558. |
Tang et al., "Energy Reallocation Strategies for Speech Enhancement in Known Noise Conditions", Proceedings of Interspeech, 2010, 4 Pages. |
Thomas et al., "The Intelligibility of Filtered-Clipped Speech in Noise", Journal of the Audio Engineering Society, vol. 18, No. 3, Jun. 1, 1970, pp. 299-302. |
Vainio et al., "Developing a speech Intelligibility Test Based on Measuring Speech Reception Thresholds in Noise for English and Finnish", The Journal of the Acoustical Society of America, vol. 118, No. 3, 2005, pp. 1742-1750. |
Yoo et al., Speech Signal Modification to Increase Intelligibility in Noisy Environments, the Journal of the Acoustical Society of America , vol. 122, No. 2, 2007, pp. 1138-1149. |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190035414A1 (en) * | 2017-07-27 | 2019-01-31 | Harman Becker Automotive Systems Gmbh | Adaptive post filtering |
Also Published As
Publication number | Publication date |
---|---|
US20150142425A1 (en) | 2015-05-21 |
WO2013124712A1 (en) | 2013-08-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9576590B2 (en) | Noise adaptive post filtering | |
JP6147744B2 (en) | Adaptive speech intelligibility processing system and method | |
US8571231B2 (en) | Suppressing noise in an audio signal | |
JP4376489B2 (en) | Frequency domain post-filtering method, apparatus and recording medium for improving the quality of coded speech | |
US7454335B2 (en) | Method and system for reducing effects of noise producing artifacts in a voice codec | |
US10043533B2 (en) | Method and device for boosting formants from speech and noise spectral estimation | |
US20160189707A1 (en) | Speech processing | |
JP6373873B2 (en) | System, method, apparatus and computer readable medium for adaptive formant sharpening in linear predictive coding | |
US20080312916A1 (en) | Receiver Intelligibility Enhancement System | |
US20110125490A1 (en) | Noise suppressor and voice decoder | |
JP2008065090A (en) | Noise suppressing apparatus | |
US11335355B2 (en) | Estimating noise of an audio signal in the log2-domain | |
US9633667B2 (en) | Adaptive audio signal filtering | |
US20140019125A1 (en) | Low band bandwidth extended | |
EP2828853B1 (en) | Method and system for bias corrected speech level determination | |
US20150071463A1 (en) | Method and apparatus for filtering an audio signal | |
US9928841B2 (en) | Method of packet loss concealment in ADPCM codec and ADPCM decoder with PLC circuit | |
US20230154479A1 (en) | Low cost adaptation of bass post-filter | |
WO2023172609A1 (en) | Method and audio processing system for wind noise suppression |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: NOKIA CORPORATION, FINLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SJOEBERG, JARI;MYLLYLAE, VILLE;JOKINEN, EMMA JOHANNA;AND OTHERS;SIGNING DATES FROM 20130205 TO 20130207;REEL/FRAME:034179/0953 |
|
AS | Assignment |
Owner name: NOKIA TECHNOLOGIES OY, FINLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NOKIA CORPORATION;REEL/FRAME:038315/0141 Effective date: 20150116 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: PROVENANCE ASSET GROUP LLC, CONNECTICUT Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NOKIA TECHNOLOGIES OY;NOKIA SOLUTIONS AND NETWORKS BV;ALCATEL LUCENT SAS;REEL/FRAME:043877/0001 Effective date: 20170912 Owner name: NOKIA USA INC., CALIFORNIA Free format text: SECURITY INTEREST;ASSIGNORS:PROVENANCE ASSET GROUP HOLDINGS, LLC;PROVENANCE ASSET GROUP LLC;REEL/FRAME:043879/0001 Effective date: 20170913 Owner name: CORTLAND CAPITAL MARKET SERVICES, LLC, ILLINOIS Free format text: SECURITY INTEREST;ASSIGNORS:PROVENANCE ASSET GROUP HOLDINGS, LLC;PROVENANCE ASSET GROUP, LLC;REEL/FRAME:043967/0001 Effective date: 20170913 |
|
AS | Assignment |
Owner name: NOKIA US HOLDINGS INC., NEW JERSEY Free format text: ASSIGNMENT AND ASSUMPTION AGREEMENT;ASSIGNOR:NOKIA USA INC.;REEL/FRAME:048370/0682 Effective date: 20181220 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: PROVENANCE ASSET GROUP LLC, CONNECTICUT Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CORTLAND CAPITAL MARKETS SERVICES LLC;REEL/FRAME:058983/0104 Effective date: 20211101 Owner name: PROVENANCE ASSET GROUP HOLDINGS LLC, CONNECTICUT Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CORTLAND CAPITAL MARKETS SERVICES LLC;REEL/FRAME:058983/0104 Effective date: 20211101 Owner name: PROVENANCE ASSET GROUP LLC, CONNECTICUT Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:NOKIA US HOLDINGS INC.;REEL/FRAME:058363/0723 Effective date: 20211129 Owner name: PROVENANCE ASSET GROUP HOLDINGS LLC, CONNECTICUT Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:NOKIA US HOLDINGS INC.;REEL/FRAME:058363/0723 Effective date: 20211129 |
|
AS | Assignment |
Owner name: RPX CORPORATION, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PROVENANCE ASSET GROUP LLC;REEL/FRAME:059352/0001 Effective date: 20211129 |
|
AS | Assignment |
Owner name: BARINGS FINANCE LLC, AS COLLATERAL AGENT, NORTH CAROLINA Free format text: PATENT SECURITY AGREEMENT;ASSIGNOR:RPX CORPORATION;REEL/FRAME:063429/0001 Effective date: 20220107 |