CN111091846B - Noise reduction method and echo cancellation system applying same - Google Patents

Noise reduction method and echo cancellation system applying same Download PDF

Info

Publication number
CN111091846B
CN111091846B CN201911364046.0A CN201911364046A CN111091846B CN 111091846 B CN111091846 B CN 111091846B CN 201911364046 A CN201911364046 A CN 201911364046A CN 111091846 B CN111091846 B CN 111091846B
Authority
CN
China
Prior art keywords
pow
threshold
far
end sound
max
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911364046.0A
Other languages
Chinese (zh)
Other versions
CN111091846A (en
Inventor
江亨湖
周鑫
刘友华
丁卓群
李晨煕
伍仁库
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201911364046.0A priority Critical patent/CN111091846B/en
Publication of CN111091846A publication Critical patent/CN111091846A/en
Application granted granted Critical
Publication of CN111091846B publication Critical patent/CN111091846B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

The invention belongs to the technical field of echo cancellation, and particularly relates to a noise reduction method and an echo cancellation system applying the same. The noise reduction method comprises the following steps: acquiring a burr threshold pow _ threshold according to the far-end sound signal; data w in the estimated sound signal smaller than the spur threshold pow threshold are attenuated. In the above technical solution, the spur threshold value pow _ threshold obtained based on the far-end sound signal can better evaluate whether the data in the estimated sound signal contains residual spur noise, so as to weaken the corresponding data w to remove the spur noise.

Description

Noise reduction method and echo cancellation system applying same
Technical Field
The invention belongs to the technical field of echo cancellation, and particularly relates to a noise reduction method and an echo cancellation system applying the same.
Background
The basic principles of current acoustic echo cancellation are: the method comprises the steps of using a self-adaptive filter to carry out parameter identification on an unknown echo channel omega, establishing a far-end signal model based on the correlation between a loudspeaker signal and generated multi-channel echoes, simulating an echo path, and adjusting through a self-adaptive algorithm NLMS to enable the impulse response of the echo channel to be approximate to a real echo path. Then, the signal received by the microphone is subtracted by the estimated value, and the echo cancellation function can be realized. However, after the acoustic echo cancellation uses the NLMS algorithm to calculate the preliminary effect, there is often some spur noise.
The invention patent application of application publication No. CN105791611A, 2016, 7, 20 discloses an echo cancellation method and device. Whether the participating signals meet the preset output condition or not is detected, and when the participating signals do not meet the preset output condition, the residual signals are multiplied by the first attenuation factor to obtain output signals, so that the electronic equipment can further attenuate and output when detecting that the residual signals still contain stronger echo signals, and the problem that the residual signals obtained by subtracting the estimation signals from the near-end signals still contain stronger echo signals and influence the channel quality due to the fact that the estimation signals obtained by estimating the far-end signals by the NLMS algorithm are inaccurate is solved. However, in the technical scheme, the correlation between the far-end sound signal and the near-end sound needs to be evaluated to judge whether the near-end signal has an echo signal, so that the algorithm is complex to calculate, and the calculation requirement on the processor is high.
Disclosure of Invention
In order to solve the above technical problem, the present invention provides a noise reduction method, including:
acquiring a burr threshold pow _ threshold according to the far-end sound signal;
data w in the estimated sound signal smaller than the spur threshold pow threshold are attenuated.
In the above technical solution, the burr threshold pow _ threshold obtained based on the far-end sound signal can better evaluate whether the data in the sound signal contains residual burr noise, so as to weaken the corresponding data w to remove the burr noise.
Further, the data w in the impairment estimation sound signal smaller than the spur threshold pow _ threshold refers to: let w = w × coef.
Further, the obtaining the spur threshold pow _ threshold according to the far-end sound signal includes: calculating the maximum energy far _ max _ pow of the far-end sound according to the far-end sound signal; and adjusting a base burr threshold pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the burr threshold pow _ threshold. Therefore, different burr thresholds are adopted for the far-end sound signals with different intensities to determine the data needing to be weakened to remove noise in the estimation signal, and the estimation sound signal distortion is avoided.
Preferably, the adjusting the base spike threshold value pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the spike threshold value pow _ threshold means: pow _ threshold equals k times pow _ base _ threshold if far _ max _ pow is greater than the first threshold; otherwise, pow _ threshold equals pow _ base _ threshold.
Further, the calculating the far-end sound maximum energy far _ max _ pow according to the far-end sound signal includes: calculating the energy of each frame signal in the far-end sound signal; and taking the maximum value in the energy of each frame signal of the far-end sound signal as the maximum far-end sound energy far _ max _ pow.
Preferably, the reduction coefficient coef =2^ (ln (far _ max _ pow/near _ max _ pow)); wherein near _ max _ pow refers to the maximum energy of the near-end sound.
Further, the obtaining of the near-end sound maximum energy near _ max _ pow includes: calculating the energy of each frame signal in the near-end sound signal; and taking the maximum value in the energy of each frame signal of the near-end sound signal as the maximum energy near _ max _ pow of the far-end sound.
Preferably, the estimated sound is an output signal of an automatic echo cancellation process based on the far-end sound and the near-end sound.
Preferably, the automatic echo cancellation process uses an NLMS algorithm to perform echo cancellation.
The invention also provides an echo cancellation system, which is characterized in that: the denoising method according to any one of the above.
The invention has the following beneficial effects:
the far-end sound signal, the near-end sound signal and the estimation signal after the automatic echo cancellation processing are combined for calculation, the estimation signal is further subjected to burr cancellation, and the burr suppression method has low calculation complexity and good suppression effect on burrs.
Drawings
Fig. 1 is a schematic application diagram of an echo cancellation system according to an embodiment of the present invention.
Detailed Description
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. Unless otherwise defined, all terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that the conventional terms should be interpreted as having a meaning that is consistent with their meaning in the relevant art and this disclosure. The present disclosure is to be considered as an example of the invention and is not intended to limit the invention to the specific embodiments.
Example one
The basic principle of current acoustic echo cancellation is: the method comprises the steps of using a self-adaptive filter to carry out parameter identification on an unknown echo channel omega, establishing a far-end signal model based on the correlation between a loudspeaker signal and generated multi-channel echoes, simulating an echo path, and adjusting through a self-adaptive algorithm NLMS to enable the impulse response of the echo channel to be approximate to a real echo path. Then, the estimated value is subtracted from the signal received by the microphone, and the echo cancellation function can be realized. However, after the acoustic echo cancellation uses the NLMS algorithm to calculate the preliminary effect, there is often some spur noise.
The glitch noise is particularly likely to occur when the near-end sound is large and the far-end sound is small, and the glitch noise is often correlated with the near-end sound. The embodiment provides an echo cancellation system, which can perform noise reduction processing on a signal after an NLMS algorithm to eliminate spur noise.
As shown in fig. 1, the echo cancellation system of the present embodiment can be applied to two electronic devices (a first electronic device and a second electronic device) talking with each other, and is used for canceling an echo signal during communication of the electronic devices. In the conversation process, the electronic equipment inputs a far-end voice signal and a near-end voice signal into an Automatic Echo Cancellation (AEC) module of the electronic equipment, the far-end voice signal is a voice signal sent by another electronic equipment in conversation with the electronic equipment, and the near-end voice signal is a voice signal collected by a microphone of the electronic equipment. In this embodiment, the AEC module performs automatic echo cancellation processing on the input near-end speech signal according to the input far-end speech signal by using a normalized least mean square adaptive filtering (NLMS) algorithm, outputs an estimated speech signal out to the noise reduction module after performing automatic echo cancellation processing on the input near-end speech signal, and transmits the estimated speech signal out to another electronic device in conversation with the electronic device after performing noise reduction processing.
The noise reduction module is mainly used for carrying out noise reduction processing on the estimated voice signal output after the automatic echo elimination processing so as to further eliminate the burr noise. The noise reduction method mainly comprises the following steps:
first, a spur threshold pow _ threshold is obtained from the far-end sound signal. The noise reduction module stores a base spike threshold value pow _ base _ threshold set by a system default or a user, and the pow _ base _ threshold in this embodiment may be set to 300 by the system default, or may be set by the user in a range of 300-. The noise reduction module calculates a far-end sound maximum energy far _ max _ pow according to the far-end sound signal, and adjusts a base burr threshold pow _ base _ threshold based on the calculated far-end sound maximum energy far _ max _ pow to determine a burr threshold pow _ threshold:
if far _ max _ pow is greater than the first threshold, pow _ threshold is equal to k times pow _ base _ threshold;
otherwise, pow _ threshold equals pow _ base _ threshold.
The first threshold is a default value of the system stored in the noise reduction module, or the first threshold may be set empirically by the user. For example, the first threshold in this embodiment is a system default value of 1000. K is a system default value greater than 1 stored in the noise reduction module, or may be an empirical value greater than 1 set by the user. For example, k in the present embodiment is a system default value of 2. Therefore, when the base spur threshold pow _ base _ threshold =300 in the present embodiment: if the far-end sound maximum energy far _ max _ pow is 2000, the spur threshold pow _ threshold = pow _ base _ threshold × 2= 600; if the far-end sound maximum energy far _ max _ pow is 800, the spur threshold pow _ threshold = pow _ base _ threshold = 300.
Second, data w in the estimated sound signal that is less than the spur threshold pow _ threshold is attenuated. The noise reduction module compares each data in the estimated sound signal out output by the AEC module with pow _ threshold, if the value of the data is smaller than the burr threshold, the data is regarded as burr data or background noise data, and the data w is subjected to noise reduction processing according to the formula w = w \ coef; if the value of the data is greater than the glitch threshold then no puncturing is performed. The attenuation coefficient coef is calculated by the formula:
coef=2^(ln(far_max_pow/near_max_pow))
far _ max _ pow is the maximum energy of the far-end sound calculated from the far-end sound signal;
near _ max _ pow is the near-end sound maximum energy calculated from the near-end sound signal.
Preferably, the method for calculating the maximum energy far _ max _ pow of the far-end sound in this embodiment is as follows:
(a) and calculating the energy of each frame signal in the far-end sound signal. The present embodiment calculates the far-end sound maximum energy based on the far-end sound signal within the last 100 ms. In this embodiment, the far-end sound signal is divided into a total of 5 frames at every 20ms as one frame (i.e., one processing unit). Each frame of the far-end sound signal is sampled (8 k sampling rate), and the data type of each sampling point is short type and ranges from-32768 to 32768.
The far-end sound energy far _ pow = sqrt (mean (w (0: end) ^2) of each frame is calculated to obtain 5 far-end sound energy values.
(b) The maximum value of the energy of each frame signal of the far-end sound signal is taken as the far-end sound maximum energy far _ max _ pow. For example, in the present embodiment, the maximum value among the energies of 5 frame signals within 100ms is used as the far-end sound maximum energy far _ max _ pow.
Similarly, the method for calculating the maximum energy far _ max _ pow of the far-end sound in this embodiment is as follows:
(a) the energy of each frame signal in the near-end sound signal is calculated. In the present embodiment, the near-end sound maximum energy is calculated from the near-end sound signal within the last 100ms during the call. In this embodiment, a total of 5 frames of near-end sound signals are divided into one frame (i.e., one processing unit) every 20 ms. Each frame of the near-end sound signal is sampled (8 k sampling rate), and the data type of each sampling point is short type and ranges from-32768 to 32768.
Near-end sound energy near _ pow = sqrt (mean (w (0: end) ^2) of each frame, 5 near-end sound energy values are calculated.
(b) The maximum value among the energies of the respective frame signals of the near-end sound signal is taken as the near-end sound maximum energy near _ max _ pow. For example, in the present embodiment, the maximum value among the energies of 5 frame signals within 100ms is taken as the near-end sound maximum energy near _ max _ pow.
Although embodiments of the present invention have been described, various changes or modifications may be made by one of ordinary skill in the art within the scope of the appended claims.

Claims (6)

1. A method of noise reduction, comprising:
acquiring a burr threshold pow _ threshold according to the far-end sound signal;
attenuating data w in the estimated sound signal that is less than the spur threshold pow threshold;
the obtaining of the spur threshold value pow _ threshold according to the far-end sound signal includes:
calculating the maximum energy far _ max _ pow of the far-end sound according to the far-end sound signal;
adjusting a base burr threshold pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the burr threshold pow _ threshold; the adjusting a base spur threshold value pow _ base _ threshold according to the far-end sound maximum energy far _ max _ pow to obtain the spur threshold value pow _ threshold means:
pow _ threshold equals k times pow _ base _ threshold if far _ max _ pow is greater than the first threshold; otherwise, pow _ threshold is equal to pow _ base _ threshold;
the estimated sound signal is an output signal that is subjected to automatic echo cancellation processing based on the far-end sound and the near-end sound.
2. A method of reducing noise according to claim 1, wherein the data w in the de-emphasized estimated acoustic signal that is less than the spur threshold pow _ threshold is:
let w = w × coef;
coef=2^(ln(far_max_pow/near_max_pow));
wherein near _ max _ pow refers to the maximum energy of the near-end sound.
3. The method of claim 1, wherein the calculating the far-end sound maximum energy far _ max _ pow according to the far-end sound signal comprises:
calculating the energy of each frame signal in the far-end sound signal;
and taking the maximum value in the energy of each frame signal of the far-end sound signal as the maximum far-end sound energy far _ max _ pow.
4. A method of noise reduction according to claim 2, wherein said obtaining of the near-end sound maximum energy near _ max _ pow comprises:
calculating the energy of each frame signal in the near-end sound signal;
and taking the maximum value in the energy of each frame signal of the near-end sound signal as the near-end sound maximum energy near _ max _ pow.
5. A method of noise reduction according to claim 1, characterized by:
the automatic echo cancellation process adopts NLMS algorithm to perform echo cancellation process.
6. An echo cancellation system, characterized by:
-using the noise reduction method according to any of claims 1 to 5.
CN201911364046.0A 2019-12-26 2019-12-26 Noise reduction method and echo cancellation system applying same Active CN111091846B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911364046.0A CN111091846B (en) 2019-12-26 2019-12-26 Noise reduction method and echo cancellation system applying same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911364046.0A CN111091846B (en) 2019-12-26 2019-12-26 Noise reduction method and echo cancellation system applying same

Publications (2)

Publication Number Publication Date
CN111091846A CN111091846A (en) 2020-05-01
CN111091846B true CN111091846B (en) 2022-07-26

Family

ID=70397375

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911364046.0A Active CN111091846B (en) 2019-12-26 2019-12-26 Noise reduction method and echo cancellation system applying same

Country Status (1)

Country Link
CN (1) CN111091846B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1953060A (en) * 2006-11-24 2007-04-25 北京中星微电子有限公司 Echo elimination device for microphone and method thereof
CN101268624A (en) * 2005-09-20 2008-09-17 艾利森电话股份有限公司 Method and test signal for measuring speech intelligibility
CN102708874A (en) * 2011-03-03 2012-10-03 微软公司 Noise adaptive beamforming for microphone arrays
CN103888107A (en) * 2014-03-21 2014-06-25 天地融科技股份有限公司 Data decoding method
CN104395957A (en) * 2012-04-30 2015-03-04 创新科技有限公司 A universal reconfigurable echo cancellation system
CN104980600A (en) * 2014-04-02 2015-10-14 想象技术有限公司 Auto-tuning Of Non-linear Processor Threshold
CN105791611A (en) * 2016-02-22 2016-07-20 腾讯科技(深圳)有限公司 Echo cancellation method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130332156A1 (en) * 2012-06-11 2013-12-12 Apple Inc. Sensor Fusion to Improve Speech/Audio Processing in a Mobile Device
GB2527865B (en) * 2014-10-30 2016-12-14 Imagination Tech Ltd Controlling operational characteristics of an acoustic echo canceller

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101268624A (en) * 2005-09-20 2008-09-17 艾利森电话股份有限公司 Method and test signal for measuring speech intelligibility
CN1953060A (en) * 2006-11-24 2007-04-25 北京中星微电子有限公司 Echo elimination device for microphone and method thereof
CN102708874A (en) * 2011-03-03 2012-10-03 微软公司 Noise adaptive beamforming for microphone arrays
CN104395957A (en) * 2012-04-30 2015-03-04 创新科技有限公司 A universal reconfigurable echo cancellation system
CN103888107A (en) * 2014-03-21 2014-06-25 天地融科技股份有限公司 Data decoding method
CN104980600A (en) * 2014-04-02 2015-10-14 想象技术有限公司 Auto-tuning Of Non-linear Processor Threshold
CN105791611A (en) * 2016-02-22 2016-07-20 腾讯科技(深圳)有限公司 Echo cancellation method and device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Statistical analysis of doubletalk detection for calibration and performance evaluation";James D. Gordy等;《 IEEE Transactions on Audio, Speech, and Language Processing》;20070220;第15卷(第3期);全文 *
"基于WebRTC的可视对讲系统的研究与实现";何青山;《中国优秀硕士学位论文全文数据库》;20161231;全文 *

Also Published As

Publication number Publication date
CN111091846A (en) 2020-05-01

Similar Documents

Publication Publication Date Title
US9443528B2 (en) Method and device for eliminating echoes
CN110225214B (en) Method, attenuation unit, system and medium for attenuating a signal
CN110838300B (en) Echo cancellation processing method and processing system
CN110149453B (en) Gain control system and method for dynamically tuning an echo canceller
CN106713570B (en) Echo cancellation method and device
CN109716743B (en) Full duplex voice communication system and method
US9319783B1 (en) Attenuation of output audio based on residual echo
CN109273019B (en) Method for double-talk detection for echo suppression and echo suppression
CN105577961A (en) Automatic tuning of a gain controller
CN110956975B (en) Echo cancellation method and device
CN110995951B (en) Echo cancellation method, device and system based on double-end sounding detection
US6785382B2 (en) System and method for controlling a filter to enhance speakerphone performance
CN106571147A (en) Method for suppressing acoustic echo of network telephone
US8064966B2 (en) Method of detecting a double talk situation for a “hands-free” telephone device
CN110634496A (en) Double-talk detection method and device, computer equipment and storage medium
CN111524532A (en) Echo suppression method, device, equipment and storage medium
CN111970610B (en) Echo path detection method, audio signal processing method and system, storage medium, and terminal
CN111091846B (en) Noise reduction method and echo cancellation system applying same
WO2019169272A1 (en) Enhanced barge-in detector
KR101055793B1 (en) Acoustic Echo Cancellation Using Segment Conditions in the Frequency Domain
CN111970410B (en) Echo cancellation method and device, storage medium and terminal
CN113225442B (en) Method and device for eliminating echo
KR101394504B1 (en) Apparatus and method for adaptive noise processing
Wada et al. Towards robust acoustic echo cancellation during double-talk and near-end background noise via enhancement of residual echo
WO2022202012A1 (en) Echo suppressing device, echo suppressing method, and echo suppressing program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant