CN111091846B - Noise reduction method and echo cancellation system applying same - Google Patents
Noise reduction method and echo cancellation system applying same Download PDFInfo
- Publication number
- CN111091846B CN111091846B CN201911364046.0A CN201911364046A CN111091846B CN 111091846 B CN111091846 B CN 111091846B CN 201911364046 A CN201911364046 A CN 201911364046A CN 111091846 B CN111091846 B CN 111091846B
- Authority
- CN
- China
- Prior art keywords
- pow
- threshold
- far
- end sound
- max
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 230000005236 sound signal Effects 0.000 claims abstract description 47
- 230000002238 attenuated effect Effects 0.000 abstract description 3
- 238000005070 sampling Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000002592 echocardiography Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000001629 suppression Effects 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
Abstract
The invention belongs to the technical field of echo cancellation, and particularly relates to a noise reduction method and an echo cancellation system applying the same. The noise reduction method comprises the following steps: acquiring a burr threshold pow _ threshold according to the far-end sound signal; data w in the estimated sound signal smaller than the spur threshold pow threshold are attenuated. In the above technical solution, the spur threshold value pow _ threshold obtained based on the far-end sound signal can better evaluate whether the data in the estimated sound signal contains residual spur noise, so as to weaken the corresponding data w to remove the spur noise.
Description
Technical Field
The invention belongs to the technical field of echo cancellation, and particularly relates to a noise reduction method and an echo cancellation system applying the same.
Background
The basic principles of current acoustic echo cancellation are: the method comprises the steps of using a self-adaptive filter to carry out parameter identification on an unknown echo channel omega, establishing a far-end signal model based on the correlation between a loudspeaker signal and generated multi-channel echoes, simulating an echo path, and adjusting through a self-adaptive algorithm NLMS to enable the impulse response of the echo channel to be approximate to a real echo path. Then, the signal received by the microphone is subtracted by the estimated value, and the echo cancellation function can be realized. However, after the acoustic echo cancellation uses the NLMS algorithm to calculate the preliminary effect, there is often some spur noise.
The invention patent application of application publication No. CN105791611A, 2016, 7, 20 discloses an echo cancellation method and device. Whether the participating signals meet the preset output condition or not is detected, and when the participating signals do not meet the preset output condition, the residual signals are multiplied by the first attenuation factor to obtain output signals, so that the electronic equipment can further attenuate and output when detecting that the residual signals still contain stronger echo signals, and the problem that the residual signals obtained by subtracting the estimation signals from the near-end signals still contain stronger echo signals and influence the channel quality due to the fact that the estimation signals obtained by estimating the far-end signals by the NLMS algorithm are inaccurate is solved. However, in the technical scheme, the correlation between the far-end sound signal and the near-end sound needs to be evaluated to judge whether the near-end signal has an echo signal, so that the algorithm is complex to calculate, and the calculation requirement on the processor is high.
Disclosure of Invention
In order to solve the above technical problem, the present invention provides a noise reduction method, including:
acquiring a burr threshold pow _ threshold according to the far-end sound signal;
data w in the estimated sound signal smaller than the spur threshold pow threshold are attenuated.
In the above technical solution, the burr threshold pow _ threshold obtained based on the far-end sound signal can better evaluate whether the data in the sound signal contains residual burr noise, so as to weaken the corresponding data w to remove the burr noise.
Further, the data w in the impairment estimation sound signal smaller than the spur threshold pow _ threshold refers to: let w = w × coef.
Further, the obtaining the spur threshold pow _ threshold according to the far-end sound signal includes: calculating the maximum energy far _ max _ pow of the far-end sound according to the far-end sound signal; and adjusting a base burr threshold pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the burr threshold pow _ threshold. Therefore, different burr thresholds are adopted for the far-end sound signals with different intensities to determine the data needing to be weakened to remove noise in the estimation signal, and the estimation sound signal distortion is avoided.
Preferably, the adjusting the base spike threshold value pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the spike threshold value pow _ threshold means: pow _ threshold equals k times pow _ base _ threshold if far _ max _ pow is greater than the first threshold; otherwise, pow _ threshold equals pow _ base _ threshold.
Further, the calculating the far-end sound maximum energy far _ max _ pow according to the far-end sound signal includes: calculating the energy of each frame signal in the far-end sound signal; and taking the maximum value in the energy of each frame signal of the far-end sound signal as the maximum far-end sound energy far _ max _ pow.
Preferably, the reduction coefficient coef =2^ (ln (far _ max _ pow/near _ max _ pow)); wherein near _ max _ pow refers to the maximum energy of the near-end sound.
Further, the obtaining of the near-end sound maximum energy near _ max _ pow includes: calculating the energy of each frame signal in the near-end sound signal; and taking the maximum value in the energy of each frame signal of the near-end sound signal as the maximum energy near _ max _ pow of the far-end sound.
Preferably, the estimated sound is an output signal of an automatic echo cancellation process based on the far-end sound and the near-end sound.
Preferably, the automatic echo cancellation process uses an NLMS algorithm to perform echo cancellation.
The invention also provides an echo cancellation system, which is characterized in that: the denoising method according to any one of the above.
The invention has the following beneficial effects:
the far-end sound signal, the near-end sound signal and the estimation signal after the automatic echo cancellation processing are combined for calculation, the estimation signal is further subjected to burr cancellation, and the burr suppression method has low calculation complexity and good suppression effect on burrs.
Drawings
Fig. 1 is a schematic application diagram of an echo cancellation system according to an embodiment of the present invention.
Detailed Description
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. Unless otherwise defined, all terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that the conventional terms should be interpreted as having a meaning that is consistent with their meaning in the relevant art and this disclosure. The present disclosure is to be considered as an example of the invention and is not intended to limit the invention to the specific embodiments.
Example one
The basic principle of current acoustic echo cancellation is: the method comprises the steps of using a self-adaptive filter to carry out parameter identification on an unknown echo channel omega, establishing a far-end signal model based on the correlation between a loudspeaker signal and generated multi-channel echoes, simulating an echo path, and adjusting through a self-adaptive algorithm NLMS to enable the impulse response of the echo channel to be approximate to a real echo path. Then, the estimated value is subtracted from the signal received by the microphone, and the echo cancellation function can be realized. However, after the acoustic echo cancellation uses the NLMS algorithm to calculate the preliminary effect, there is often some spur noise.
The glitch noise is particularly likely to occur when the near-end sound is large and the far-end sound is small, and the glitch noise is often correlated with the near-end sound. The embodiment provides an echo cancellation system, which can perform noise reduction processing on a signal after an NLMS algorithm to eliminate spur noise.
As shown in fig. 1, the echo cancellation system of the present embodiment can be applied to two electronic devices (a first electronic device and a second electronic device) talking with each other, and is used for canceling an echo signal during communication of the electronic devices. In the conversation process, the electronic equipment inputs a far-end voice signal and a near-end voice signal into an Automatic Echo Cancellation (AEC) module of the electronic equipment, the far-end voice signal is a voice signal sent by another electronic equipment in conversation with the electronic equipment, and the near-end voice signal is a voice signal collected by a microphone of the electronic equipment. In this embodiment, the AEC module performs automatic echo cancellation processing on the input near-end speech signal according to the input far-end speech signal by using a normalized least mean square adaptive filtering (NLMS) algorithm, outputs an estimated speech signal out to the noise reduction module after performing automatic echo cancellation processing on the input near-end speech signal, and transmits the estimated speech signal out to another electronic device in conversation with the electronic device after performing noise reduction processing.
The noise reduction module is mainly used for carrying out noise reduction processing on the estimated voice signal output after the automatic echo elimination processing so as to further eliminate the burr noise. The noise reduction method mainly comprises the following steps:
first, a spur threshold pow _ threshold is obtained from the far-end sound signal. The noise reduction module stores a base spike threshold value pow _ base _ threshold set by a system default or a user, and the pow _ base _ threshold in this embodiment may be set to 300 by the system default, or may be set by the user in a range of 300-. The noise reduction module calculates a far-end sound maximum energy far _ max _ pow according to the far-end sound signal, and adjusts a base burr threshold pow _ base _ threshold based on the calculated far-end sound maximum energy far _ max _ pow to determine a burr threshold pow _ threshold:
if far _ max _ pow is greater than the first threshold, pow _ threshold is equal to k times pow _ base _ threshold;
otherwise, pow _ threshold equals pow _ base _ threshold.
The first threshold is a default value of the system stored in the noise reduction module, or the first threshold may be set empirically by the user. For example, the first threshold in this embodiment is a system default value of 1000. K is a system default value greater than 1 stored in the noise reduction module, or may be an empirical value greater than 1 set by the user. For example, k in the present embodiment is a system default value of 2. Therefore, when the base spur threshold pow _ base _ threshold =300 in the present embodiment: if the far-end sound maximum energy far _ max _ pow is 2000, the spur threshold pow _ threshold = pow _ base _ threshold × 2= 600; if the far-end sound maximum energy far _ max _ pow is 800, the spur threshold pow _ threshold = pow _ base _ threshold = 300.
Second, data w in the estimated sound signal that is less than the spur threshold pow _ threshold is attenuated. The noise reduction module compares each data in the estimated sound signal out output by the AEC module with pow _ threshold, if the value of the data is smaller than the burr threshold, the data is regarded as burr data or background noise data, and the data w is subjected to noise reduction processing according to the formula w = w \ coef; if the value of the data is greater than the glitch threshold then no puncturing is performed. The attenuation coefficient coef is calculated by the formula:
coef=2^(ln(far_max_pow/near_max_pow))
far _ max _ pow is the maximum energy of the far-end sound calculated from the far-end sound signal;
near _ max _ pow is the near-end sound maximum energy calculated from the near-end sound signal.
Preferably, the method for calculating the maximum energy far _ max _ pow of the far-end sound in this embodiment is as follows:
(a) and calculating the energy of each frame signal in the far-end sound signal. The present embodiment calculates the far-end sound maximum energy based on the far-end sound signal within the last 100 ms. In this embodiment, the far-end sound signal is divided into a total of 5 frames at every 20ms as one frame (i.e., one processing unit). Each frame of the far-end sound signal is sampled (8 k sampling rate), and the data type of each sampling point is short type and ranges from-32768 to 32768.
The far-end sound energy far _ pow = sqrt (mean (w (0: end) ^2) of each frame is calculated to obtain 5 far-end sound energy values.
(b) The maximum value of the energy of each frame signal of the far-end sound signal is taken as the far-end sound maximum energy far _ max _ pow. For example, in the present embodiment, the maximum value among the energies of 5 frame signals within 100ms is used as the far-end sound maximum energy far _ max _ pow.
Similarly, the method for calculating the maximum energy far _ max _ pow of the far-end sound in this embodiment is as follows:
(a) the energy of each frame signal in the near-end sound signal is calculated. In the present embodiment, the near-end sound maximum energy is calculated from the near-end sound signal within the last 100ms during the call. In this embodiment, a total of 5 frames of near-end sound signals are divided into one frame (i.e., one processing unit) every 20 ms. Each frame of the near-end sound signal is sampled (8 k sampling rate), and the data type of each sampling point is short type and ranges from-32768 to 32768.
Near-end sound energy near _ pow = sqrt (mean (w (0: end) ^2) of each frame, 5 near-end sound energy values are calculated.
(b) The maximum value among the energies of the respective frame signals of the near-end sound signal is taken as the near-end sound maximum energy near _ max _ pow. For example, in the present embodiment, the maximum value among the energies of 5 frame signals within 100ms is taken as the near-end sound maximum energy near _ max _ pow.
Although embodiments of the present invention have been described, various changes or modifications may be made by one of ordinary skill in the art within the scope of the appended claims.
Claims (6)
1. A method of noise reduction, comprising:
acquiring a burr threshold pow _ threshold according to the far-end sound signal;
attenuating data w in the estimated sound signal that is less than the spur threshold pow threshold;
the obtaining of the spur threshold value pow _ threshold according to the far-end sound signal includes:
calculating the maximum energy far _ max _ pow of the far-end sound according to the far-end sound signal;
adjusting a base burr threshold pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the burr threshold pow _ threshold; the adjusting a base spur threshold value pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the spur threshold value pow _ threshold means:
pow _ threshold equals k times pow _ base _ threshold if far _ max _ pow is greater than the first threshold; otherwise, pow _ threshold is equal to pow _ base _ threshold;
the estimated sound signal is an output signal that is subjected to automatic echo cancellation processing based on the far-end sound and the near-end sound.
2. A method of reducing noise according to claim 1, wherein the data w in the de-emphasized estimated acoustic signal that is less than the spur threshold pow _ threshold is:
let w = w × coef;
coef=2^(ln(far_max_pow/near_max_pow));
wherein near _ max _ pow refers to the maximum energy of the near-end sound.
3. The method of claim 1, wherein the calculating the far-end sound maximum energy far _ max _ pow according to the far-end sound signal comprises:
calculating the energy of each frame signal in the far-end sound signal;
and taking the maximum value in the energy of each frame signal of the far-end sound signal as the maximum far-end sound energy far _ max _ pow.
4. A method of noise reduction according to claim 2, wherein said obtaining of the near-end sound maximum energy near _ max _ pow comprises:
calculating the energy of each frame signal in the near-end sound signal;
and taking the maximum value in the energy of each frame signal of the near-end sound signal as the near-end sound maximum energy near _ max _ pow.
5. A method of noise reduction according to claim 1, characterized by:
the automatic echo cancellation process adopts NLMS algorithm to perform echo cancellation process.
6. An echo cancellation system, characterized by:
-using the noise reduction method according to any of claims 1 to 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911364046.0A CN111091846B (en) | 2019-12-26 | 2019-12-26 | Noise reduction method and echo cancellation system applying same |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911364046.0A CN111091846B (en) | 2019-12-26 | 2019-12-26 | Noise reduction method and echo cancellation system applying same |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111091846A CN111091846A (en) | 2020-05-01 |
CN111091846B true CN111091846B (en) | 2022-07-26 |
Family
ID=70397375
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911364046.0A Active CN111091846B (en) | 2019-12-26 | 2019-12-26 | Noise reduction method and echo cancellation system applying same |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111091846B (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1953060A (en) * | 2006-11-24 | 2007-04-25 | 北京中星微电子有限公司 | Echo elimination device for microphone and method thereof |
CN101268624A (en) * | 2005-09-20 | 2008-09-17 | 艾利森电话股份有限公司 | Method and test signal for measuring speech intelligibility |
CN102708874A (en) * | 2011-03-03 | 2012-10-03 | 微软公司 | Noise adaptive beamforming for microphone arrays |
CN103888107A (en) * | 2014-03-21 | 2014-06-25 | 天地融科技股份有限公司 | Data decoding method |
CN104395957A (en) * | 2012-04-30 | 2015-03-04 | 创新科技有限公司 | A universal reconfigurable echo cancellation system |
CN104980600A (en) * | 2014-04-02 | 2015-10-14 | 想象技术有限公司 | Auto-tuning Of Non-linear Processor Threshold |
CN105791611A (en) * | 2016-02-22 | 2016-07-20 | 腾讯科技(深圳)有限公司 | Echo cancellation method and device |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130332156A1 (en) * | 2012-06-11 | 2013-12-12 | Apple Inc. | Sensor Fusion to Improve Speech/Audio Processing in a Mobile Device |
GB2527865B (en) * | 2014-10-30 | 2016-12-14 | Imagination Tech Ltd | Controlling operational characteristics of an acoustic echo canceller |
-
2019
- 2019-12-26 CN CN201911364046.0A patent/CN111091846B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101268624A (en) * | 2005-09-20 | 2008-09-17 | 艾利森电话股份有限公司 | Method and test signal for measuring speech intelligibility |
CN1953060A (en) * | 2006-11-24 | 2007-04-25 | 北京中星微电子有限公司 | Echo elimination device for microphone and method thereof |
CN102708874A (en) * | 2011-03-03 | 2012-10-03 | 微软公司 | Noise adaptive beamforming for microphone arrays |
CN104395957A (en) * | 2012-04-30 | 2015-03-04 | 创新科技有限公司 | A universal reconfigurable echo cancellation system |
CN103888107A (en) * | 2014-03-21 | 2014-06-25 | 天地融科技股份有限公司 | Data decoding method |
CN104980600A (en) * | 2014-04-02 | 2015-10-14 | 想象技术有限公司 | Auto-tuning Of Non-linear Processor Threshold |
CN105791611A (en) * | 2016-02-22 | 2016-07-20 | 腾讯科技(深圳)有限公司 | Echo cancellation method and device |
Non-Patent Citations (2)
Title |
---|
"Statistical analysis of doubletalk detection for calibration and performance evaluation";James D. Gordy等;《 IEEE Transactions on Audio, Speech, and Language Processing》;20070220;第15卷(第3期);全文 * |
"基于WebRTC的可视对讲系统的研究与实现";何青山;《中国优秀硕士学位论文全文数据库》;20161231;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN111091846A (en) | 2020-05-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9443528B2 (en) | Method and device for eliminating echoes | |
CN110225214B (en) | Method, attenuation unit, system and medium for attenuating a signal | |
CN110838300B (en) | Echo cancellation processing method and processing system | |
CN110149453B (en) | Gain control system and method for dynamically tuning an echo canceller | |
CN106713570B (en) | Echo cancellation method and device | |
CN109716743B (en) | Full duplex voice communication system and method | |
US9319783B1 (en) | Attenuation of output audio based on residual echo | |
CN109273019B (en) | Method for double-talk detection for echo suppression and echo suppression | |
CN105577961A (en) | Automatic tuning of a gain controller | |
CN110956975B (en) | Echo cancellation method and device | |
CN110995951B (en) | Echo cancellation method, device and system based on double-end sounding detection | |
US6785382B2 (en) | System and method for controlling a filter to enhance speakerphone performance | |
CN106571147A (en) | Method for suppressing acoustic echo of network telephone | |
US8064966B2 (en) | Method of detecting a double talk situation for a “hands-free” telephone device | |
CN110634496A (en) | Double-talk detection method and device, computer equipment and storage medium | |
CN111524532A (en) | Echo suppression method, device, equipment and storage medium | |
CN111970610B (en) | Echo path detection method, audio signal processing method and system, storage medium, and terminal | |
CN111091846B (en) | Noise reduction method and echo cancellation system applying same | |
WO2019169272A1 (en) | Enhanced barge-in detector | |
KR101055793B1 (en) | Acoustic Echo Cancellation Using Segment Conditions in the Frequency Domain | |
CN111970410B (en) | Echo cancellation method and device, storage medium and terminal | |
CN113225442B (en) | Method and device for eliminating echo | |
KR101394504B1 (en) | Apparatus and method for adaptive noise processing | |
Wada et al. | Towards robust acoustic echo cancellation during double-talk and near-end background noise via enhancement of residual echo | |
WO2022202012A1 (en) | Echo suppressing device, echo suppressing method, and echo suppressing program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |