CN112750447B - Method for removing wind noise - Google Patents
Method for removing wind noise Download PDFInfo
- Publication number
- CN112750447B CN112750447B CN202011504399.9A CN202011504399A CN112750447B CN 112750447 B CN112750447 B CN 112750447B CN 202011504399 A CN202011504399 A CN 202011504399A CN 112750447 B CN112750447 B CN 112750447B
- Authority
- CN
- China
- Prior art keywords
- noise
- wind noise
- signal
- value
- removing wind
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000001914 filtration Methods 0.000 claims description 8
- 238000001228 spectrum Methods 0.000 claims description 8
- 238000009432 framing Methods 0.000 claims description 3
- 230000005236 sound signal Effects 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 2
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention relates to a method for removing wind noise, on one hand, frame-level channel energy is used for selecting, and signals with small noise interference of each channel are selected, so that compromise caused by beam forming is avoided; on the other hand, the low-frequency band noise component is corrected by using the PR, so that the noise interference is removed more accurately.
Description
Technical Field
The invention relates to the technical field of voice recognition, in particular to a method for removing wind noise.
Background
In outdoor voice product application, wind noise is often generated easily, the wind noise is often very strong in energy, a voice spectrum structure can be submerged, and the voice listening feeling and the intelligibility are greatly reduced.
Disclosure of Invention
The invention provides a method for removing wind noise, which can solve the technical problem of 2mic wind noise.
The technical scheme for solving the technical problems is as follows:
according to an aspect of an embodiment of the present invention, there is provided a method of removing wind noise, including:
firstly, transforming a first path of input signal Y1 (t) and a second path of input signal Y2 (t) to a frequency domain Y1 (l, k), Y2 (l, k) through a framing/windowing/FFT module;
secondly, calculating the minimum value of the frequency spectrums of the two paths of signals and the phase angle of the first path of signals through a frame-level signal selection module to obtain selected signals;
secondly, calculating a difference sum ratio by using a module PR (l, k);
thirdly, estimating steady-state noise V (l, k) through a steady-state noise tracking module;
step four, selectively correcting noise through a noise correcting module;
a fifth step of determining a final noise estimate based on the noise stage PR (l, k) value;
and sixthly, filtering by using a filter through a wiener to obtain the audio signal with the wind noise removed.
Preferably, the calculating of the minimum value of the frequency spectrums of the two paths of signals is as follows: i Ys (l, k) | = min (| Y1 (l, k) |, | Y2 (l, k) |).
Preferably, the phase angle of the first path of signal is: θ (l, k) = angle (Y1 (l, k)).
Preferably, after said obtainingThe signals of (a) are: ys (l, k) = | Ys (l, k) |. Exp (i · θ (l, k)), wherein
Preferably, the fourth step comprises: when the value of the signal stage PR (l, k) is lower than a specific value, noise correction is not carried out; when the signal phase PR (l, k) value is higher than a specific value, noise correction is performed.
Preferably, the particular value is 0.1667.
Preferably, the fifth step comprises, when the first condition is met, reevaluating the noise and obtaining a final noise estimate; when the first condition is not satisfied, noise correction is not performed.
Preferably, the first condition is that the spectrum is below the 3kHz band interval and PR (l, k) is greater than the signal deviation 0.1667.
wherein phi ss (l, k) are estimated using a decision directed DD method.
Therefore, on one hand, frame level channel energy competition selection is used, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, by correcting the low-band noise component using PR, noise interference is more accurately removed.
Drawings
Fig. 1 is a schematic flow chart of a method for removing wind noise according to an embodiment of the present invention.
Fig. 2 is a waveform diagram of outdoor 2mic actual data of the electric vehicle according to the embodiment of the invention.
Fig. 3 is a waveform diagram of data processed by the method for removing wind noise according to the embodiment of the invention in fig. 2.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth to illustrate, but are not to be construed to limit the scope of the invention.
Wind noise suppression is classified into single-channel and multi-channel methods, and the multi-channel method generally uses beam forming and then cascading post-filtering. In actual speech there are usually two problems, one of which is: unlike the situation that the energy of signals received indoors is roughly equivalent, the noise energy received by two mics is sometimes greatly different due to extremely unbalanced and stable environment outdoors, for example, one mics has good tone quality, and the other mics has extremely strong noise, and at this time, the mixed signal obtained by using the beam forming technology is inferior to the channel with better quality in the original signal; the second step is as follows: a commonly used form of filter for post-filtering techniques isWhereinFor the prior snr, λ (l, k) =1 is the standard wiener filter, and since the wind noise is a fast-varying unsteady signal, the noise tracking algorithm usually cannot track effectively, and a compensation form has been proposedWherein PR (l, k) = Φ diff (l,k)/Φ sum (l,k).,
Ydiff(l,k)=Yi(l,k)-Yj(l,k),Ysum(l,k)=Yi(l,k)+Yj(l,k),
Φdiff(l,k)=E{|Ydiff(l,k)| 2 },Φsum(l,k)=E{|Ysum(l,k)| 2 }.
Because wind noise is mainly distributed at low frequency, coherence among channels is weak, difference and energy ratio of wind noise signals are high, and the low frequency correlation of voice is strong, the difference and energy ratio is low, the method can be used as a compensation form of wind noise post-filtering, more noise is removed during wind noise, but PR is associated with wind noise, accurate estimation of the wind noise ratio cannot be obtained, only an empirical value is obtained, rho in a formula is also set through experience, and performance is not stable under different environments.
To solve the above problem, a method for removing wind noise according to an embodiment of the present invention is shown in fig. 1. The method comprises the following specific steps:
firstly, converting two input signals Y1 (t) and Y2 (t) into frequency domains Y1 (l, k) and Y2 (l, k) through a framing/windowing/FFT module, wherein l is a frame and k is a frequency point;
secondly, calculating two paths of signal spectrum minimum values | Ys (l, k) | = min (| Y1 (l, k) |, | Y2 (l, k) |) and a phase angle θ (l, k) = angle (Y1 (l, k)) of the first path of signal through a frame level signal selection module to obtain a selected signal Ys (l, k) = | Ys (l, k) | exp (i · θ (l, k)), wherein
Second, the difference sum ratio is calculated. The difference sum ratio is calculated using a module PR (l, k), where PR (l, k) = Φ diff (l, k)/Φ sum (l, k).
And thirdly, estimating the steady-state noise V (l, k) through a steady-state noise tracking module.
The module can use the module described in Rainer Martin, noise power estimation based on optimal smoothing and minimum statistics, IEEE Trans, speech and Audio Processing,9 (5): 504-512, july 2001, which will not be described in detail herein.
And fourthly, selectively correcting the noise through a noise correction module.
The microphone usually cannot achieve ideal consistency, i.e. the difference of the signals is not 0, the energy deviation of the silicon microphone is generally within 1dB, the polar microphone is within 3dB, the signal phase PR (l, k) calculated according to the polar microphone is at most 0.1667, and no noise correction should be made when PR (l, k) is lower than the specific value, so as to avoid damaging the voice.
In some embodiments, the particular value may be set to 0.2.
In a fifth step, a final noise estimate is determined based on the noise stage PR (l, k) values.
The noise stage PR (l, k) value may be greater than 1, and if greater than 1 occurs, the constraint is forced to be 1, and in addition, since wind noise is concentrated below 3kHz, the spectrum above 3kHz does not need to be corrected for noise. In summary, in the frequency band interval below 3kHz, when PR (l, k) is larger than the signal deviation 0.1667, the noise is reestimatedFinal noise estimation
Sixth, wiener filtering is used by the filter:
wherein phi ss (l, k) are estimated using a decision directed DD method.
According to the scheme, the problem of delay alignment between mics is not considered, errors caused by small low-frequency delay deviation are not large when the mics are small, but if the mics are large, the errors are increased, so that the PR value of the signal is increased, and noise overestimation is caused. Therefore, in some embodiments, a delay estimation module may be added in the previous stage to align the inter-mic signals.
Therefore, on one hand, frame-level channel energy match selection is used, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, by correcting the low-band noise component using PR, noise interference is more accurately removed. From the comparison between fig. 2 and fig. 3, it can be seen that the noise interference can be more accurately removed by using the method for removing wind noise of the present invention.
The method for removing wind noise provided by the embodiment of the invention can be realized in the form of a software functional module, can be sold or used as an independent product, and can be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (7)
1. A method of removing wind noise, comprising:
firstly, converting a first path of input signals Y1 (t) and a second path of input signals Y2 (t) into frequency domains Y1 (l, k) and Y2 (l, k) through a framing/windowing/FFT module, wherein l is a frame and k is a frequency point;
secondly, calculating the minimum value of the frequency spectrums of the two paths of signals through a frame level signal selection module: i Ys (l, k) | = min (| Y1 (l, k) |, | Y2 (l, k) |), and the phase angle of the first signal: θ (l, k) = angle (Y1 (l, k)), a signal selected by comparison is obtained: ys (l, k) = | Ys (l, k) |. Exp (i · θ (l, k)), where
Third, the sum of differences is calculated using a module PR (l, k), where PR (l, k) = Φ diff (l, k)/Φ sum (l, k), ydiff (l, k) = Yi (l, k) -Yj (l, k), ysum (l, k) = Yi (l, k) + Yj (l, k), Φ diff (l, k) = E { | Ydiff (l, k) } 2 },Φsum(l,k)=E{|Ysum(l,k)| 2 };
Fourthly, estimating steady-state noise V (l, k) through a steady-state noise tracking module;
fifthly, selectively correcting noise through a noise correction module;
a sixth step of determining a final noise estimate based on the noise stage PR (l, k) value;
and seventhly, filtering by using a filter through a wiener to obtain the audio signal with the wind noise removed.
2. The method of removing wind noise according to claim 1,
the fifth step includes:
when the value of the signal stage PR (l, k) is lower than a specific value, noise correction is not carried out;
when the value of the signal period PR (l, k) is higher than a certain value, noise correction is performed.
3. The method of removing wind noise according to claim 2,
the particular value is 0.1667.
4. The method of removing wind noise according to claim 1,
the sixth step comprises, when the first condition is satisfied, reestimating the noise and obtaining a final noise estimate;
when the first condition is not satisfied, noise correction is not performed.
5. The method of removing wind noise according to claim 4,
the first condition is that the spectrum is below the 3kHz band interval and PR (l, k) is greater than the signal deviation 0.1667.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011504399.9A CN112750447B (en) | 2020-12-17 | 2020-12-17 | Method for removing wind noise |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011504399.9A CN112750447B (en) | 2020-12-17 | 2020-12-17 | Method for removing wind noise |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112750447A CN112750447A (en) | 2021-05-04 |
CN112750447B true CN112750447B (en) | 2023-01-24 |
Family
ID=75648559
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011504399.9A Active CN112750447B (en) | 2020-12-17 | 2020-12-17 | Method for removing wind noise |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112750447B (en) |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7171008B2 (en) * | 2002-02-05 | 2007-01-30 | Mh Acoustics, Llc | Reducing noise in audio systems |
WO2007059255A1 (en) * | 2005-11-17 | 2007-05-24 | Mh Acoustics, Llc | Dual-microphone spatial noise suppression |
US9357307B2 (en) * | 2011-02-10 | 2016-05-31 | Dolby Laboratories Licensing Corporation | Multi-channel wind noise suppression system and method |
CN105792071B (en) * | 2011-02-10 | 2019-07-05 | 杜比实验室特许公司 | The system and method for detecting and inhibiting for wind |
US10127919B2 (en) * | 2014-11-12 | 2018-11-13 | Cirrus Logic, Inc. | Determining noise and sound power level differences between primary and reference channels |
GB2555139A (en) * | 2016-10-21 | 2018-04-25 | Nokia Technologies Oy | Detecting the presence of wind noise |
-
2020
- 2020-12-17 CN CN202011504399.9A patent/CN112750447B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN112750447A (en) | 2021-05-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9502048B2 (en) | Adaptively reducing noise to limit speech distortion | |
US20200342887A1 (en) | Microphone array-based target voice acquisition method and device | |
US9558755B1 (en) | Noise suppression assisted automatic speech recognition | |
US9431023B2 (en) | Monaural noise suppression based on computational auditory scene analysis | |
KR100304666B1 (en) | Speech enhancement method | |
JP3454206B2 (en) | Noise suppression device and noise suppression method | |
US20140025374A1 (en) | Speech enhancement to improve speech intelligibility and automatic speech recognition | |
JPH08221094A (en) | Method and device for reducing noise in voice signals | |
Mirsamadi et al. | Causal speech enhancement combining data-driven learning and suppression rule estimation. | |
CN103632677A (en) | Method and device for processing voice signal with noise, and server | |
CN110782914A (en) | Signal processing method and device, terminal equipment and storage medium | |
CN113539285A (en) | Audio signal noise reduction method, electronic device, and storage medium | |
CN112530451A (en) | Speech enhancement method based on denoising autoencoder | |
Naik et al. | A literature survey on single channel speech enhancement techniques | |
US9373341B2 (en) | Method and system for bias corrected speech level determination | |
CN107045874B (en) | Non-linear voice enhancement method based on correlation | |
CN114242104A (en) | Method, device and equipment for voice noise reduction and storage medium | |
CN112634927B (en) | Short wave channel voice enhancement method | |
US8700391B1 (en) | Low complexity bandwidth expansion of speech | |
US9177566B2 (en) | Noise suppression method and apparatus | |
CN112750447B (en) | Method for removing wind noise | |
Rao et al. | Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration | |
CN111933169B (en) | Voice noise reduction method for secondarily utilizing voice existence probability | |
CN110168640A (en) | For enhancing the device and method for needing component in signal | |
CN110444222B (en) | Voice noise reduction method based on information entropy weighting |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |