CN112750447A - Method for removing wind noise - Google Patents
Method for removing wind noise Download PDFInfo
- Publication number
- CN112750447A CN112750447A CN202011504399.9A CN202011504399A CN112750447A CN 112750447 A CN112750447 A CN 112750447A CN 202011504399 A CN202011504399 A CN 202011504399A CN 112750447 A CN112750447 A CN 112750447A
- Authority
- CN
- China
- Prior art keywords
- noise
- wind noise
- signal
- removing wind
- signals
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000001228 spectrum Methods 0.000 claims description 9
- 238000001914 filtration Methods 0.000 claims description 8
- 238000009432 framing Methods 0.000 claims description 3
- 230000005236 sound signal Effects 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 2
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention relates to a method for removing wind noise, on one hand, frame level channel energy is used for selecting, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, the low-frequency band noise component is corrected by using the PR, and the noise interference is removed more accurately.
Description
Technical Field
The invention relates to the technical field of voice recognition, in particular to a method for removing wind noise.
Background
In outdoor voice product application, wind noise is often generated easily, the wind noise is often very strong in energy, a voice spectrum structure can be submerged, and the voice listening feeling and the intelligibility are greatly reduced.
Disclosure of Invention
The invention provides a method for removing wind noise, which can solve the technical problem of 2mic wind noise.
The technical scheme for solving the technical problems is as follows:
according to an aspect of an embodiment of the present invention, there is provided a method of removing wind noise, including:
firstly, transforming the input first path of signal Y1(t) and second path of signal Y2(t) to frequency domains Y1(l, k) and Y2(l, k) through a framing/windowing/FFT module;
secondly, calculating the minimum value of the frequency spectrums of the two paths of signals and the phase angle of the first path of signals through a frame-level signal selection module to obtain selected signals;
secondly, calculating a difference sum ratio by using a module PR (l, k);
thirdly, estimating steady-state noise V (l, k) through a steady-state noise tracking module;
fourthly, selectively correcting noise through a noise correction module;
a fifth step of determining a final noise estimate based on the noise stage PR (l, k) values;
and sixthly, filtering by using a filter through a wiener to obtain the audio signal with the wind noise removed.
Preferably, the calculating of the minimum value of the frequency spectrums of the two paths of signals is as follows: i Ys (l, k) | min (| Y1(l, k) |, | Y2(l, k) |).
Preferably, the phase angle of the first path of signal is: θ (l, k) ═ angle (Y1(l, k)).
Preferably, the fourth step comprises: when the value of the signal stage PR (l, k) is lower than a specific value, noise correction is not carried out; when the signal phase PR (l, k) value is higher than a specific value, noise correction is performed.
Preferably, the particular value is 0.1667.
Preferably, the fifth step comprises, when the first condition is satisfied, reestimating the noise and obtaining a final noise estimate; when the first condition is not satisfied, noise correction is not performed.
Preferably, the first condition is that the spectrum is below the 3kHz band interval and PR (l, k) is greater than the signal deviation 0.1667.
wherein phiss(l, k) are estimated using the DD method.
Therefore, on one hand, frame level channel energy competition selection is used, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, by correcting the low-band noise component using PR, noise interference is more accurately removed.
Drawings
Fig. 1 is a schematic flow chart of a method for removing wind noise according to an embodiment of the present invention.
Fig. 2 is a waveform diagram of 2mic actual data collected outside the electric vehicle according to the embodiment of the invention.
Fig. 3 is a waveform diagram of data processed by the method for removing wind noise according to the embodiment of the invention in fig. 2.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.
Wind noise suppression is classified into single-channel and multi-channel methods, and the multi-channel method generally uses beam forming and then cascading post-filtering. In actual speech there are usually two problems, one of which is: unlike the situation that the energy of signals received indoors is roughly equivalent, the noise energy received by two mics is sometimes greatly different due to extremely unbalanced and stable environment outdoors, for example, one mics has good tone quality, and the other mics has extremely strong noise, and at this time, the mixed signal obtained by using the beam forming technology is inferior to the channel with better quality in the original signal; the second step is as follows: the filter form commonly used in the post-filtering technique isWhereinFor a priori signal-to-noise ratio, λ (l, k)) When the wind noise is a fast-changing unsteady signal, a noise tracking algorithm cannot effectively track the wind noise, and a compensation mode is proposedWherein PR (l, k) ═ Φdiff(l,k)/φsum(l,k).,
Ydiff(l,k)=Yi(l,k)-Yj(l,k),Ysum(l,k)=Yi(l,k)+Yj(l,k),
Φdiff(l,k)=E{|Ydiff(l,k)|2},Φsum(l,k)=E{|Ysum(l,k)|2}.
Because wind noise is mainly distributed at low frequency, coherence among channels is weak, difference and energy ratio of wind noise signals are high, and the low frequency correlation of voice is strong, the difference and energy ratio is low, the method can be used as a compensation form of wind noise post-filtering, more noise is removed during wind noise, but PR is associated with wind noise, accurate estimation of the wind noise ratio cannot be obtained, only an empirical value is obtained, rho in a formula is also set through experience, and performance is not stable under different environments.
To solve the above problem, a method for removing wind noise according to an embodiment of the present invention is shown in fig. 1. The method comprises the following specific steps:
firstly, converting two input signals Y1(t) and Y2(t) into frequency domains Y1(l, k) and Y2(l, k) through a framing/windowing/FFT module, wherein l is a frame and k is a frequency point;
secondly, calculating two paths of signal spectrum minimum values | Ys (l, k) | min (| Y1(l, k) |, | Y2(l, k) |) and phase angle θ (l, k) | angle (Y1(l, k)) of the first path of signal through a frame-level signal selection module to obtain the selected signal Ys (l, k) | exp (i · θ (l, k)), wherein
Second, the difference sum ratio is calculated. The sum of differences ratio is calculated using a module PR (l, k), where PR (l, k) is Φ diff (l, k)/Φ sum (l, k).
And thirdly, estimating the steady-state noise V (l, k) by a steady-state noise tracking module.
The module can be the module described in rain Martin, noise power sensitivity estimation based on optimal smoothing and minimum statistics, IEEE Trans, Speech and Audio Processing,9(5), 504 and 512, July 2001, which will not be described herein.
And fourthly, selectively correcting the noise through a noise correction module.
The microphone usually cannot achieve ideal consistency, i.e. the difference of the signals is not 0, the energy deviation of the silicon microphone is generally within 1dB, the polar microphone is within 3dB, the signal phase PR (l, k) calculated according to the polar microphone is at most 0.1667, and no noise correction should be made when PR (l, k) is lower than the specific value, so as to avoid damaging the voice.
In some embodiments, the particular value may be set to 0.2.
In a fifth step, a final noise estimate is determined based on the noise stage PR (l, k) values.
The noise stage PR (l, k) value may be greater than 1, and if greater than 1 occurs, the constraint is forced to be 1, and in addition, since wind noise is concentrated below 3kHz, the spectrum above 3kHz does not need to be corrected for noise. In summary, in the frequency band interval below 3kHz, the noise is reestimated when PR (l, k) is greater than the signal deviation of 0.1667Final noise estimation
Sixth, wiener filtering is used by the filter:
wherein phiss(l, k) are estimated using the DD method.
According to the scheme, the problem of delay alignment between mics is not considered, errors caused by small low-frequency delay deviation are not large when the mics are small, but if the mics are large, the errors are increased, so that the PR value of the signal is increased, and noise overestimation is caused. Therefore, in some embodiments, a delay estimation module may be added in the previous stage to align the signals between mics.
Therefore, on one hand, frame level channel energy competition selection is used, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, by correcting the low-band noise component using PR, noise interference is more accurately removed. From a comparison of fig. 2 and fig. 3, it can be seen that noise interference can be more accurately removed by using the method for removing wind noise of the present invention.
The method for removing wind noise provided by the embodiment of the invention can be realized in the form of a software functional module, can be sold or used as an independent product, and can be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (10)
1. A method of removing wind noise, comprising:
firstly, transforming the input first path of signal Y1(t) and second path of signal Y2(t) to frequency domains Y1(l, k) and Y2(l, k) through a framing/windowing/FFT module;
secondly, calculating the minimum value of the frequency spectrums of the two paths of signals and the phase angle of the first path of signals through a frame-level signal selection module to obtain selected signals;
secondly, calculating a difference sum ratio by using a module PR (l, k);
thirdly, estimating steady-state noise V (l, k) through a steady-state noise tracking module;
fourthly, selectively correcting noise through a noise correction module;
a fifth step of determining a final noise estimate based on the noise stage PR (l, k) values;
and sixthly, filtering by using a filter through a wiener to obtain the audio signal with the wind noise removed.
2. The method of removing wind noise according to claim 1,
the minimum value of the frequency spectrums of the two paths of signals is calculated as follows: i Ys (l, k) | min (| Y1(l, k) |, | Y2(l, k) |).
3. The method of removing wind noise according to claim 2,
the phase angle of the first path of signal is as follows: θ (l, k) ═ angle (Y1(l, k)).
5. The method of removing wind noise according to claim 1,
the fourth step includes:
when the value of the signal stage PR (l, k) is lower than a specific value, noise correction is not carried out;
when the signal phase PR (l, k) value is higher than a specific value, noise correction is performed.
6. The method of removing wind noise according to claim 5,
the particular value is 0.1667.
7. The method of removing wind noise according to claim 1,
fifthly, when the first condition is met, reestimating the noise and obtaining the final noise estimation;
when the first condition is not satisfied, noise correction is not performed.
8. The method of removing wind noise according to claim 7,
the first condition is that the spectrum is below the 3kHz band interval and PR (l, k) is greater than the signal deviation 0.1667.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011504399.9A CN112750447B (en) | 2020-12-17 | 2020-12-17 | Method for removing wind noise |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011504399.9A CN112750447B (en) | 2020-12-17 | 2020-12-17 | Method for removing wind noise |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112750447A true CN112750447A (en) | 2021-05-04 |
CN112750447B CN112750447B (en) | 2023-01-24 |
Family
ID=75648559
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011504399.9A Active CN112750447B (en) | 2020-12-17 | 2020-12-17 | Method for removing wind noise |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112750447B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030147538A1 (en) * | 2002-02-05 | 2003-08-07 | Mh Acoustics, Llc, A Delaware Corporation | Reducing noise in audio systems |
WO2007059255A1 (en) * | 2005-11-17 | 2007-05-24 | Mh Acoustics, Llc | Dual-microphone spatial noise suppression |
US20120207325A1 (en) * | 2011-02-10 | 2012-08-16 | Dolby Laboratories Licensing Corporation | Multi-Channel Wind Noise Suppression System and Method |
CN103348686A (en) * | 2011-02-10 | 2013-10-09 | 杜比实验室特许公司 | System and method for wind detection and suppression |
US20160134984A1 (en) * | 2014-11-12 | 2016-05-12 | Cypher, Llc | Determining noise and sound power level differences between primary and reference channels |
CN109845289A (en) * | 2016-10-21 | 2019-06-04 | 诺基亚技术有限公司 | Detect the presence of wind noise |
-
2020
- 2020-12-17 CN CN202011504399.9A patent/CN112750447B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030147538A1 (en) * | 2002-02-05 | 2003-08-07 | Mh Acoustics, Llc, A Delaware Corporation | Reducing noise in audio systems |
WO2007059255A1 (en) * | 2005-11-17 | 2007-05-24 | Mh Acoustics, Llc | Dual-microphone spatial noise suppression |
US20120207325A1 (en) * | 2011-02-10 | 2012-08-16 | Dolby Laboratories Licensing Corporation | Multi-Channel Wind Noise Suppression System and Method |
CN103348686A (en) * | 2011-02-10 | 2013-10-09 | 杜比实验室特许公司 | System and method for wind detection and suppression |
US20160134984A1 (en) * | 2014-11-12 | 2016-05-12 | Cypher, Llc | Determining noise and sound power level differences between primary and reference channels |
CN109845289A (en) * | 2016-10-21 | 2019-06-04 | 诺基亚技术有限公司 | Detect the presence of wind noise |
Non-Patent Citations (1)
Title |
---|
乔健等: "抑制风噪声的频点离散值加权GCC-PHAT时延估计算法", 《电子技术应用》, no. 03, 6 March 2018 (2018-03-06) * |
Also Published As
Publication number | Publication date |
---|---|
CN112750447B (en) | 2023-01-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3454206B2 (en) | Noise suppression device and noise suppression method | |
JP3453898B2 (en) | Method and apparatus for reducing noise of audio signal | |
US8521530B1 (en) | System and method for enhancing a monaural audio signal | |
JP5805365B2 (en) | Noise estimation apparatus and method, and noise reduction apparatus using the same | |
CN108922554B (en) | LCMV frequency invariant beam forming speech enhancement algorithm based on logarithmic spectrum estimation | |
RU2127454C1 (en) | Method for noise suppression | |
US9113241B2 (en) | Noise removing apparatus and noise removing method | |
KR100304666B1 (en) | Speech enhancement method | |
Lin et al. | Adaptive noise estimation algorithm for speech enhancement | |
US20140025374A1 (en) | Speech enhancement to improve speech intelligibility and automatic speech recognition | |
CN110634500B (en) | Method for calculating prior signal-to-noise ratio, electronic device and storage medium | |
CN110782914B (en) | Signal processing method and device, terminal equipment and storage medium | |
JPH07306695A (en) | Method of reducing noise in sound signal, and method of detecting noise section | |
WO2010052749A1 (en) | Noise suppression device | |
CN112530451A (en) | Speech enhancement method based on denoising autoencoder | |
CN112802490B (en) | Beam forming method and device based on microphone array | |
US9177566B2 (en) | Noise suppression method and apparatus | |
CN112750447B (en) | Method for removing wind noise | |
CN113257268B (en) | Noise reduction and single-frequency interference suppression method combining frequency tracking and frequency spectrum correction | |
CN111933169B (en) | Voice noise reduction method for secondarily utilizing voice existence probability | |
CN114242104A (en) | Method, device and equipment for voice noise reduction and storage medium | |
CN112565124B (en) | Distortion-free signal processing method based on overlapping windowing, storage medium and system | |
Gui et al. | Adaptive subband Wiener filtering for speech enhancement using critical-band gammatone filterbank | |
CN112533120B (en) | Beam forming method and device based on dynamic compression of noisy speech signal magnitude spectrum | |
CN110931038B (en) | Voice enhancement method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |