CN112750447B - Method for removing wind noise - Google Patents

Method for removing wind noise Download PDF

Info

Publication number
CN112750447B
CN112750447B CN202011504399.9A CN202011504399A CN112750447B CN 112750447 B CN112750447 B CN 112750447B CN 202011504399 A CN202011504399 A CN 202011504399A CN 112750447 B CN112750447 B CN 112750447B
Authority
CN
China
Prior art keywords
noise
wind noise
signal
value
removing wind
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011504399.9A
Other languages
Chinese (zh)
Other versions
CN112750447A (en
Inventor
关海欣
梁家恩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisound Intelligent Technology Co Ltd
Xiamen Yunzhixin Intelligent Technology Co Ltd
Original Assignee
Unisound Intelligent Technology Co Ltd
Xiamen Yunzhixin Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unisound Intelligent Technology Co Ltd, Xiamen Yunzhixin Intelligent Technology Co Ltd filed Critical Unisound Intelligent Technology Co Ltd
Priority to CN202011504399.9A priority Critical patent/CN112750447B/en
Publication of CN112750447A publication Critical patent/CN112750447A/en
Application granted granted Critical
Publication of CN112750447B publication Critical patent/CN112750447B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention relates to a method for removing wind noise, on one hand, frame-level channel energy is used for selecting, and signals with small noise interference of each channel are selected, so that compromise caused by beam forming is avoided; on the other hand, the low-frequency band noise component is corrected by using the PR, so that the noise interference is removed more accurately.

Description

Method for removing wind noise
Technical Field
The invention relates to the technical field of voice recognition, in particular to a method for removing wind noise.
Background
In outdoor voice product application, wind noise is often generated easily, the wind noise is often very strong in energy, a voice spectrum structure can be submerged, and the voice listening feeling and the intelligibility are greatly reduced.
Disclosure of Invention
The invention provides a method for removing wind noise, which can solve the technical problem of 2mic wind noise.
The technical scheme for solving the technical problems is as follows:
according to an aspect of an embodiment of the present invention, there is provided a method of removing wind noise, including:
firstly, transforming a first path of input signal Y1 (t) and a second path of input signal Y2 (t) to a frequency domain Y1 (l, k), Y2 (l, k) through a framing/windowing/FFT module;
secondly, calculating the minimum value of the frequency spectrums of the two paths of signals and the phase angle of the first path of signals through a frame-level signal selection module to obtain selected signals;
secondly, calculating a difference sum ratio by using a module PR (l, k);
thirdly, estimating steady-state noise V (l, k) through a steady-state noise tracking module;
step four, selectively correcting noise through a noise correcting module;
a fifth step of determining a final noise estimate based on the noise stage PR (l, k) value;
and sixthly, filtering by using a filter through a wiener to obtain the audio signal with the wind noise removed.
Preferably, the calculating of the minimum value of the frequency spectrums of the two paths of signals is as follows: i Ys (l, k) | = min (| Y1 (l, k) |, | Y2 (l, k) |).
Preferably, the phase angle of the first path of signal is: θ (l, k) = angle (Y1 (l, k)).
Preferably, after said obtainingThe signals of (a) are: ys (l, k) = | Ys (l, k) |. Exp (i · θ (l, k)), wherein
Figure GDA0003954389380000021
Preferably, the fourth step comprises: when the value of the signal stage PR (l, k) is lower than a specific value, noise correction is not carried out; when the signal phase PR (l, k) value is higher than a specific value, noise correction is performed.
Preferably, the particular value is 0.1667.
Preferably, the fifth step comprises, when the first condition is met, reevaluating the noise and obtaining a final noise estimate; when the first condition is not satisfied, noise correction is not performed.
Preferably, the first condition is that the spectrum is below the 3kHz band interval and PR (l, k) is greater than the signal deviation 0.1667.
Preferably, said reestimating noise is implemented as:
Figure GDA0003954389380000022
the final noise estimate is implemented as:
Figure GDA0003954389380000023
preferably, the wiener filtering is implemented as:
Figure GDA0003954389380000024
wherein phi ss (l, k) are estimated using a decision directed DD method.
Therefore, on one hand, frame level channel energy competition selection is used, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, by correcting the low-band noise component using PR, noise interference is more accurately removed.
Drawings
Fig. 1 is a schematic flow chart of a method for removing wind noise according to an embodiment of the present invention.
Fig. 2 is a waveform diagram of outdoor 2mic actual data of the electric vehicle according to the embodiment of the invention.
Fig. 3 is a waveform diagram of data processed by the method for removing wind noise according to the embodiment of the invention in fig. 2.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth to illustrate, but are not to be construed to limit the scope of the invention.
Wind noise suppression is classified into single-channel and multi-channel methods, and the multi-channel method generally uses beam forming and then cascading post-filtering. In actual speech there are usually two problems, one of which is: unlike the situation that the energy of signals received indoors is roughly equivalent, the noise energy received by two mics is sometimes greatly different due to extremely unbalanced and stable environment outdoors, for example, one mics has good tone quality, and the other mics has extremely strong noise, and at this time, the mixed signal obtained by using the beam forming technology is inferior to the channel with better quality in the original signal; the second step is as follows: a commonly used form of filter for post-filtering techniques is
Figure GDA0003954389380000031
Wherein
Figure GDA0003954389380000032
For the prior snr, λ (l, k) =1 is the standard wiener filter, and since the wind noise is a fast-varying unsteady signal, the noise tracking algorithm usually cannot track effectively, and a compensation form has been proposed
Figure GDA0003954389380000033
Wherein PR (l, k) = Φ diff (l,k)/Φ sum (l,k).,
Ydiff(l,k)=Yi(l,k)-Yj(l,k),Ysum(l,k)=Yi(l,k)+Yj(l,k),
Φdiff(l,k)=E{|Ydiff(l,k)| 2 },Φsum(l,k)=E{|Ysum(l,k)| 2 }.
Because wind noise is mainly distributed at low frequency, coherence among channels is weak, difference and energy ratio of wind noise signals are high, and the low frequency correlation of voice is strong, the difference and energy ratio is low, the method can be used as a compensation form of wind noise post-filtering, more noise is removed during wind noise, but PR is associated with wind noise, accurate estimation of the wind noise ratio cannot be obtained, only an empirical value is obtained, rho in a formula is also set through experience, and performance is not stable under different environments.
To solve the above problem, a method for removing wind noise according to an embodiment of the present invention is shown in fig. 1. The method comprises the following specific steps:
firstly, converting two input signals Y1 (t) and Y2 (t) into frequency domains Y1 (l, k) and Y2 (l, k) through a framing/windowing/FFT module, wherein l is a frame and k is a frequency point;
secondly, calculating two paths of signal spectrum minimum values | Ys (l, k) | = min (| Y1 (l, k) |, | Y2 (l, k) |) and a phase angle θ (l, k) = angle (Y1 (l, k)) of the first path of signal through a frame level signal selection module to obtain a selected signal Ys (l, k) = | Ys (l, k) | exp (i · θ (l, k)), wherein
Figure GDA0003954389380000034
Second, the difference sum ratio is calculated. The difference sum ratio is calculated using a module PR (l, k), where PR (l, k) = Φ diff (l, k)/Φ sum (l, k).
And thirdly, estimating the steady-state noise V (l, k) through a steady-state noise tracking module.
The module can use the module described in Rainer Martin, noise power estimation based on optimal smoothing and minimum statistics, IEEE Trans, speech and Audio Processing,9 (5): 504-512, july 2001, which will not be described in detail herein.
And fourthly, selectively correcting the noise through a noise correction module.
The microphone usually cannot achieve ideal consistency, i.e. the difference of the signals is not 0, the energy deviation of the silicon microphone is generally within 1dB, the polar microphone is within 3dB, the signal phase PR (l, k) calculated according to the polar microphone is at most 0.1667, and no noise correction should be made when PR (l, k) is lower than the specific value, so as to avoid damaging the voice.
In some embodiments, the particular value may be set to 0.2.
In a fifth step, a final noise estimate is determined based on the noise stage PR (l, k) values.
The noise stage PR (l, k) value may be greater than 1, and if greater than 1 occurs, the constraint is forced to be 1, and in addition, since wind noise is concentrated below 3kHz, the spectrum above 3kHz does not need to be corrected for noise. In summary, in the frequency band interval below 3kHz, when PR (l, k) is larger than the signal deviation 0.1667, the noise is reestimated
Figure GDA0003954389380000041
Final noise estimation
Figure GDA0003954389380000042
Sixth, wiener filtering is used by the filter:
Figure GDA0003954389380000043
wherein phi ss (l, k) are estimated using a decision directed DD method.
According to the scheme, the problem of delay alignment between mics is not considered, errors caused by small low-frequency delay deviation are not large when the mics are small, but if the mics are large, the errors are increased, so that the PR value of the signal is increased, and noise overestimation is caused. Therefore, in some embodiments, a delay estimation module may be added in the previous stage to align the inter-mic signals.
Therefore, on one hand, frame-level channel energy match selection is used, signals with small noise interference of each channel are selected, and compromise caused by beam forming is avoided; on the other hand, by correcting the low-band noise component using PR, noise interference is more accurately removed. From the comparison between fig. 2 and fig. 3, it can be seen that the noise interference can be more accurately removed by using the method for removing wind noise of the present invention.
The method for removing wind noise provided by the embodiment of the invention can be realized in the form of a software functional module, can be sold or used as an independent product, and can be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (7)

1. A method of removing wind noise, comprising:
firstly, converting a first path of input signals Y1 (t) and a second path of input signals Y2 (t) into frequency domains Y1 (l, k) and Y2 (l, k) through a framing/windowing/FFT module, wherein l is a frame and k is a frequency point;
secondly, calculating the minimum value of the frequency spectrums of the two paths of signals through a frame level signal selection module: i Ys (l, k) | = min (| Y1 (l, k) |, | Y2 (l, k) |), and the phase angle of the first signal: θ (l, k) = angle (Y1 (l, k)), a signal selected by comparison is obtained: ys (l, k) = | Ys (l, k) |. Exp (i · θ (l, k)), where
Figure FDA0003958013370000011
Third, the sum of differences is calculated using a module PR (l, k), where PR (l, k) = Φ diff (l, k)/Φ sum (l, k), ydiff (l, k) = Yi (l, k) -Yj (l, k), ysum (l, k) = Yi (l, k) + Yj (l, k), Φ diff (l, k) = E { | Ydiff (l, k) } 2 },Φsum(l,k)=E{|Ysum(l,k)| 2 };
Fourthly, estimating steady-state noise V (l, k) through a steady-state noise tracking module;
fifthly, selectively correcting noise through a noise correction module;
a sixth step of determining a final noise estimate based on the noise stage PR (l, k) value;
and seventhly, filtering by using a filter through a wiener to obtain the audio signal with the wind noise removed.
2. The method of removing wind noise according to claim 1,
the fifth step includes:
when the value of the signal stage PR (l, k) is lower than a specific value, noise correction is not carried out;
when the value of the signal period PR (l, k) is higher than a certain value, noise correction is performed.
3. The method of removing wind noise according to claim 2,
the particular value is 0.1667.
4. The method of removing wind noise according to claim 1,
the sixth step comprises, when the first condition is satisfied, reestimating the noise and obtaining a final noise estimate;
when the first condition is not satisfied, noise correction is not performed.
5. The method of removing wind noise according to claim 4,
the first condition is that the spectrum is below the 3kHz band interval and PR (l, k) is greater than the signal deviation 0.1667.
6. The method of removing wind noise according to claim 4,
the reestimated noise is implemented as:
Figure FDA0003958013370000021
wherein l is a frame and k isFrequency point
The final noise estimate is implemented as:
Figure FDA0003958013370000022
7. the method of removing wind noise according to any one of claims 1 to 6,
the wiener filtering is implemented as:
Figure FDA0003958013370000023
where l is the frame, k is the frequency point, phi ss (l, k) are estimated using a decision directed DD method.
CN202011504399.9A 2020-12-17 2020-12-17 Method for removing wind noise Active CN112750447B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011504399.9A CN112750447B (en) 2020-12-17 2020-12-17 Method for removing wind noise

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011504399.9A CN112750447B (en) 2020-12-17 2020-12-17 Method for removing wind noise

Publications (2)

Publication Number Publication Date
CN112750447A CN112750447A (en) 2021-05-04
CN112750447B true CN112750447B (en) 2023-01-24

Family

ID=75648559

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011504399.9A Active CN112750447B (en) 2020-12-17 2020-12-17 Method for removing wind noise

Country Status (1)

Country Link
CN (1) CN112750447B (en)

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7171008B2 (en) * 2002-02-05 2007-01-30 Mh Acoustics, Llc Reducing noise in audio systems
WO2007059255A1 (en) * 2005-11-17 2007-05-24 Mh Acoustics, Llc Dual-microphone spatial noise suppression
US9357307B2 (en) * 2011-02-10 2016-05-31 Dolby Laboratories Licensing Corporation Multi-channel wind noise suppression system and method
CN105792071B (en) * 2011-02-10 2019-07-05 杜比实验室特许公司 The system and method for detecting and inhibiting for wind
US10127919B2 (en) * 2014-11-12 2018-11-13 Cirrus Logic, Inc. Determining noise and sound power level differences between primary and reference channels
GB2555139A (en) * 2016-10-21 2018-04-25 Nokia Technologies Oy Detecting the presence of wind noise

Also Published As

Publication number Publication date
CN112750447A (en) 2021-05-04

Similar Documents

Publication Publication Date Title
US9502048B2 (en) Adaptively reducing noise to limit speech distortion
US20200342887A1 (en) Microphone array-based target voice acquisition method and device
US9558755B1 (en) Noise suppression assisted automatic speech recognition
US9431023B2 (en) Monaural noise suppression based on computational auditory scene analysis
KR100304666B1 (en) Speech enhancement method
JP3454206B2 (en) Noise suppression device and noise suppression method
US20140025374A1 (en) Speech enhancement to improve speech intelligibility and automatic speech recognition
JPH08221094A (en) Method and device for reducing noise in voice signals
Mirsamadi et al. Causal speech enhancement combining data-driven learning and suppression rule estimation.
CN103632677A (en) Method and device for processing voice signal with noise, and server
CN110782914A (en) Signal processing method and device, terminal equipment and storage medium
CN113539285A (en) Audio signal noise reduction method, electronic device, and storage medium
CN112530451A (en) Speech enhancement method based on denoising autoencoder
Naik et al. A literature survey on single channel speech enhancement techniques
US9373341B2 (en) Method and system for bias corrected speech level determination
CN107045874B (en) Non-linear voice enhancement method based on correlation
CN114242104A (en) Method, device and equipment for voice noise reduction and storage medium
CN112634927B (en) Short wave channel voice enhancement method
US8700391B1 (en) Low complexity bandwidth expansion of speech
US9177566B2 (en) Noise suppression method and apparatus
CN112750447B (en) Method for removing wind noise
Rao et al. Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration
CN111933169B (en) Voice noise reduction method for secondarily utilizing voice existence probability
CN110168640A (en) For enhancing the device and method for needing component in signal
CN110444222B (en) Voice noise reduction method based on information entropy weighting

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant