CN1991980A - Method for removing background noise in voice signal - Google Patents

Method for removing background noise in voice signal Download PDF

Info

Publication number
CN1991980A
CN1991980A CNA2005101374510A CN200510137451A CN1991980A CN 1991980 A CN1991980 A CN 1991980A CN A2005101374510 A CNA2005101374510 A CN A2005101374510A CN 200510137451 A CN200510137451 A CN 200510137451A CN 1991980 A CN1991980 A CN 1991980A
Authority
CN
China
Prior art keywords
voice signal
frequency band
ground unrest
attenuation coefficient
gamma
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2005101374510A
Other languages
Chinese (zh)
Other versions
CN100565672C (en
Inventor
黄泰惠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial Technology Research Institute ITRI
Original Assignee
Industrial Technology Research Institute ITRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial Technology Research Institute ITRI filed Critical Industrial Technology Research Institute ITRI
Priority to CNB2005101374510A priority Critical patent/CN100565672C/en
Publication of CN1991980A publication Critical patent/CN1991980A/en
Application granted granted Critical
Publication of CN100565672C publication Critical patent/CN100565672C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

A method of eliminating background noise in voice signal includes steps as following. Firstly, the attenuation coefficient of frequency band i is defined, then it is brute-force filtered based on the attenuation coefficient of adjacent frequency band to calculate the forward attenuation coefficient and backward attenuation coefficient. Then the forward attenuation coefficient and backward attenuation coefficient are linear combined to calculate the smooth attenuation coefficient of frequency band i. then the voice spectrum estimating value is calculated with the smooth attenuation coefficient. At last, the voice signal that the background noise is eliminated can be gained after the Fourier inversion.

Description

Remove the method for ground unrest in the voice signal
Technical field
The present invention relates to the method for ground unrest in a kind of removal voice signal (background noise), and be particularly related to the audio signal processing method that a kind of attenuation coefficient to each frequency band (attenuation factor) is made The disposal of gentle filter.
Background technology
User satisfaction investigation result according to osophone shows that the osophone user often has the complaint of " the excessive amplification of environmental noise is made us feeling tired " and " hear but and can not hear clearly ".Therefore, the noise in the removal signal is worn comfort level with raising becomes one of important topic of present research and development digital deaf-aid technology.Though the method for the ground unrest in some removal voice signals can obviously improve signal to noise ratio (S/N ratio) (signal to noise ratio at present, abbreviate SNR as), but but, even the subsidiary fluency that produce extra noise (be loosely referred to as musical noise) or destroy voice also not obvious to the improvement of voice identification.
Ground unrest disturbs and to be a kind of time domain (time domain) Waveform Superimposed Action, and the noise voice signal that receives at first can be expressed as y[n]=x[n]+w[n], x[n wherein] represent undisturbed voice signal, w[n] then represent ground unrest.
Traditional removal noise method can be expressed as X ^ [ i ] = γ [ i ] Y [ i ] , Y[i wherein] be noise voice signal y[n] through belonging to the part of frequency band i after the fast fourier transform (fast Fourier transform), i ∈ [0, N-1], N is a number of frequency bands, | Y[i] | expression noise voice signal y[n] at the amplitude of frequency band i, and the attenuation coefficient of the above-mentioned amplitude of γ [i] expression.
Traditional attenuation coefficient computing method are γ [ i ] = | D [ i ] | 2 | Y [ i ] | 2 , Wherein | D [ i ] | 2 = | Y [ i ] | 2 - α | W [ i ] | 2 , if | Y [ i ] | 2 ≥ α 1 - β | W [ i ] | 2 β | Y [ i ] | 2 , elsewhere | W[i] 2Be the energy of ground unrest at frequency band i, α and β are default coefficient.So, calculate X ^ [ i ] = γ [ i ] Y [ i ] Afterwards, then right Do inverse fourier transform (inverse Fourier transform), can obtain removing the voice signal behind the ground unrest.
Voice signal has correlativity between adjacent frequency band, yet as mentioned above, classic method is not utilized this point, and traditional amplitude damping factor is in each frequency band separate computations.So classic method should have improved space.
Summary of the invention
The purpose of this invention is to provide a kind of method of removing ground unrest in the voice signal, but this method can improve speech quality and the identification of removing behind the ground unrest.
For reaching above-mentioned and other purpose, the present invention proposes a kind of method of removing ground unrest in the voice signal, comprises the following steps.At first, the attenuation coefficient of definition frequency band i γ [ i ] = | D [ i ] | 2 | Y [ i ] | 2 , Wherein | D [ i ] | 2 = | Y [ i ] | 2 - α | W [ i ] | 2 , if | Y [ i ] | 2 ≥ α 1 - β | W [ i ] | 2 β | Y [ i ] | 2 , elsewhere | Y[i] | 2Be the energy of noise voice signal at frequency band i, | W[i] | 2Be the energy of ground unrest at frequency band i, i ∈ [0, N-1], N is a number of frequency bands, α and β are default coefficient.Calculate the forward attenuation coefficient γ of frequency band i then f[i] ≡ γ [i]=λ fγ [i]+(1-λ f) γ [i-1], wherein λ fBe default coefficient.Calculate the reverse attenuation coefficient gamma of frequency band i then b[i]=λ bγ b[i]+(1-λ b) γ b[i-1], wherein γ b[i]=γ [N-1-i], λ bBe default coefficient.Then calculate the level and smooth attenuation coefficient of frequency band i γ ^ [ i ] = λ c · γ ‾ f [ i ] + ( 1 - λ c ) γ ‾ b [ N - 1 - i ] ) , λ wherein cBe default coefficient.Then according to level and smooth attenuation coefficient computing voice frequency spectrum estimated value X ^ [ i ] = γ ^ [ i ] Y [ i ] . At last, will Make inverse fourier transform, obtain removing the voice signal behind the ground unrest.
The method of ground unrest in the above-mentioned removal voice signal, in one embodiment, γ [1]=γ [0], and γ ‾ b [ - 1 ] = γ [ N - 1 ] .
Described according to preferred embodiment of the present invention, the method of ground unrest is to utilize the relevance of voice signal between nearby frequency bands that attenuation coefficient is made The disposal of gentle filter in the above-mentioned removal voice signal, replacing traditional amplitude damping factor, but experimental result proof the method can improve speech quality and the identification of removing behind the ground unrest.
For above and other objects of the present invention, feature and advantage can be become apparent, the present invention's cited below particularly preferred embodiment, and conjunction with figs. are described in detail below.
Description of drawings
Fig. 1 is for removing the method flow diagram of ground unrest in the voice signal according to an embodiment of the invention.
Fig. 2 is the attenuation coefficient contrast figure of conventional art and one embodiment of the invention.
The main element description of symbols
110~160: flow chart step
Embodiment
By the voice spectrum behind the resulting removal noise of classic method is independently to be calculated by each frequency band, but the method that the present invention proposes then is to utilize the dependence relation between frequency band to handle the identification of back voice to improve denoising.
Below explanation please refer to Fig. 1.Fig. 1 is for removing the method flow diagram of ground unrest in the voice signal according to an embodiment of the invention.At first, define the attenuation coefficient of each frequency band in step 110.The number of frequency bands of supposing present embodiment is N, i ∈ [0, N-1], and then the attenuation coefficient of frequency band i is defined as γ [ i ] = | D [ i ] | 2 | Y [ i ] | 2 , Wherein | D [ i ] | 2 = | Y [ i ] | 2 - α | W [ i ] | 2 , if | Y [ i ] | 2 ≥ α 1 - β | W [ i ] | 2 β | Y [ i ] | 2 , elsewhere | Y[i] | 2Be the energy of the initial noise voice signal that receives at frequency band i, | W[i] | 2Be the energy of ground unrest at frequency band i, α and β are default coefficient.
Behind the definition attenuation coefficient, utilize first order IIR (infinite impulse response) wave filter q[n in step 120]=λ p[n]+(1-λ) q[n-1] the attenuation coefficient γ [i] of frequency band i is made Filtering Processing, to calculate the forward attenuation coefficient γ of frequency band i f[i].The computing formula of present embodiment is γ f[i] ≡ γ [i]=λ fγ [i]+(1-λ f) γ [i-1], wherein λ fBe default coefficient.Calculate as can be known through simple, forward attenuation coefficient γ f[i] calculates to γ [i] according to γ [0].
Next, utilize above-mentioned first order IIR filtering device that the attenuation coefficient γ [i] that the frequency band order is inverted is made Filtering Processing, to calculate the reverse attenuation coefficient gamma of frequency band i in step 130 b[i].The computing formula of present embodiment is γ b[i]=λ bγ b[i]+(1-λ b) γ b[i-1], wherein γ b[i]=γ [N-1-i], λ bBe default coefficient.Calculate as can be known the reverse attenuation coefficient gamma through simple b[i] calculates to γ [N-1] according to γ [N-1-i].
In above-mentioned difierence equation computing, starting condition is γ [1]=γ [0], and γ b[1]=γ [N-1].
Next, will forward do linear combination to calculate the level and smooth attenuation coefficient of frequency band i in step 140 with reverse filtering result
Figure A20051013745100081
The computing formula of present embodiment is γ ^ [ i ] = λ c · γ ‾ f [ i ] + ( 1 - λ c ) γ ‾ b [ N - 1 - i ] ) , λ wherein cBe default coefficient.Voice spectrum estimated value after step 150 is calculated smoothing processing then X ^ [ i ] = γ ^ [ i ] Y [ i ] , At last, will in step 160 Make inverse fourier transform, just can obtain removing the voice signal behind the ground unrest.
Fig. 2 is the attenuation coefficient contrast figure of conventional art and present embodiment, and its transverse axis is a frequency band number, and its longitudinal axis is the attenuation coefficient value.Fig. 2 is set at λ fbc=0.5, except the broken line that is marked as conventional art, all the other broken lines are all the data of present embodiment.Can find by Fig. 2, merge and forward to reach reverse result and make the attenuation coefficient of each frequency band can be subjected to the influence of its left and right sides band attenuation coefficient and adjust its value, therefore can reach the purpose of utilizing dependence relation adjustment band attenuation coefficient between frequency band.
Below the experimental result of explanation present embodiment at first is the experiment about the syllable discrimination power.This experiment is to train Chinese syllable-based hmm with 18 male sex and 11 women at the indoor clean speech database of respectively reading 120 Chinese names of peace and quiet.As for ground unrest, be that this clean speech database is added operation room noise, white noise, people's acoustic noise and factory noise respectively, wherein every kind of noise is synthesized into by waveform according to signal to noise ratio (S/N ratio) 20dB, 15dB, 10dB, 5dB and 0dB respectively.With each voice archives of this noise speech database, do the removal noise processed with the method for present embodiment, do automatic syllable identification with the clean speech model then, obtain following result.Following each experimental data all is four kinds of noises and five kinds of signal to noise ratio (S/N ratio)s, the mean value of 20 kinds of combinations altogether.
The syllable discrimination power experimental data of table 1, present embodiment
The λ value 1.0 0.7 0.6 0.55 0.5 0.45 0.4
Syllable accuracy (%) 41.8 44.8 45.6 45.8 46.1 46.2 45.9
λ in this experiment fb=λ, λ=1 o'clock level and smooth attenuation coefficient Equal traditional attenuation coefficient γ [i], so λ=1 o'clock 41.8% is exactly the experimental data of classic method.On the other hand, not doing the syllable accuracy of removing noise fully is 32.9%.As shown in Table 1, the method for present embodiment can improve the discrimination power of removing behind the noise really, o'clock can reach the highest discrimination power 46.2% in λ=0.45.
Second experiment is the result who comes the comparison distinct methods with the speech quality sense of hearing amount of commenting (perceptualevaluation of speech quality abbreviates PESQ as) of measuring the tonequality quality.PESQ mark scope is [0,4], and wherein 4 is the undistorted highest score of tonequality.Experimental result is as shown in the table.
Speech quality behind table 2, the removal ground unrest is measured
The λ value 1.0 0.5
The PESQ mark 2.44 2.45
Same, λ in this experiment fb=λ, 2.44 of λ=1 o'clock is the PESQ mark of classic method.On the other hand, not doing the mark of removing noise fully is 2.08.As shown in Table 2, the method for present embodiment can improve the speech quality of removing behind the ground unrest really.
Though the present invention is the inspiration that is subjected to digital deaf-aid, application of the present invention is not limited to digital deaf-aid, also can be applied to other field, and for example the digital recording of recording pen and so on is used.
In sum, the method of ground unrest in the removal voice signal that the present invention proposes, be to utilize the relevance of voice signal between nearby frequency bands that attenuation coefficient is made The disposal of gentle filter, replacing traditional amplitude damping factor, but experimental result proof said method can improve speech quality and the identification of removing behind the ground unrest.
Though the present invention discloses as above with preferred embodiment; right its is not in order to limit the present invention; any person of ordinary skill in the field; without departing from the spirit and scope of the present invention; when can doing a little change and improvement, so protection scope of the present invention is as the criterion when looking the claim person of defining.

Claims (12)

1. method of removing ground unrest in the voice signal is characterized in that comprising the following steps: to define the attenuation coefficient of frequency band i γ [ i ] = | D [ i ] | 2 | Y [ i ] | 2 , Wherein | D [ i ] | 2 = | Y [ i ] | 2 - α | W [ i ] | 2 , if [ Y [ i ] | 2 ≥ α 1 - β | W [ i ] | 2 β | Y [ i ] | 2 , elsewhere , | Y[i] | 2Be the energy of noise voice signal at frequency band i, | W[i] | 2Be the energy of ground unrest at frequency band i, i ∈ [0, N-1], N is a number of frequency bands, α and β are default coefficient;
Calculate the forward attenuation coefficient γ of frequency band i to γ [i] according to γ [0] f[i];
Calculate the reverse attenuation coefficient gamma of frequency band i to γ [N-1] according to γ [N-1-i] b[i];
According to γ f[i] and γ b[i], the level and smooth attenuation coefficient of calculating frequency band i
Computing voice frequency spectrum estimated value X ^ [ i ] = γ ^ [ i ] Y [ i ] ; And
Will
Figure A2005101374510002C5
Make inverse fourier transform, obtain removing the voice signal behind the ground unrest.
2. according to the method for ground unrest in the described removal voice signal of claim 1, it is characterized in that γ f[i] ≡ γ [i]=λ fγ [i]+(1-λ f) γ [i-1], λ fBe default coefficient.
3. according to the method for ground unrest in the described removal voice signal of claim 2, it is characterized in that γ [1]=γ [0].
4. according to the method for ground unrest in the described removal voice signal of claim 2, it is characterized in that λ fBe 0.5.
5. according to the method for ground unrest in the described removal voice signal of claim 1, it is characterized in that γ b[i]=λ bγ b[i]+(1-λ b) γ b[i-1], γ b[i]=γ [N-1-i], λ bBe default coefficient.
6. according to the method for ground unrest in the described removal voice signal of claim 5, it is characterized in that γ b[1]=γ [N-1].
7. according to the method for ground unrest in the described removal voice signal of claim 5, it is characterized in that λ bBe 0.5.
8. according to the method for ground unrest in the described removal voice signal of claim 1, it is characterized in that γ ^ [ i ] = λ c · γ ‾ f [ i ] + ( 1 - λ c ) γ ‾ b [ N - 1 - i ] ) , λ cBe default coefficient.
9. the method for ground unrest is characterized in that λ in the described according to Claim 8 removal voice signal cBe 0.5.
10. method of removing ground unrest in the voice signal is characterized in that comprising the following steps: to define the attenuation coefficient of frequency band i γ [ i ] = | D [ i ] | 2 | Y [ i ] | 2 , Wherein | D [ i ] | 2 = | Y [ i ] | 2 - α | W [ i ] | 2 , if | Y [ i ] | 2 ≥ α 1 - β | W [ i ] | 2 β | Y [ i ] | 2 , elsewhere , | Y[i] | 2Be the energy of noise voice signal at frequency band i, | W[i] | 2Be the energy of ground unrest at frequency band i, i ∈ [0, N-1], N is a number of frequency bands, α and β are default coefficient;
Calculate the forward attenuation coefficient γ of frequency band i f[i] ≡ γ [i]=λ fγ [i]+(1-λ f) γ [i-1], wherein λ fBe default coefficient;
Calculate the reverse attenuation coefficient gamma of frequency band i b[i]=λ bγ b[i]+(1-λ b) γ b[i-1], wherein γ b[i]=γ [N-1-i], λ bBe default coefficient;
Calculate the level and smooth attenuation coefficient of frequency band i γ ^ [ i ] = λ c · γ ‾ f [ i ] + ( 1 - λ c ) γ ‾ b [ N - 1 - i ] ) , λ wherein cBe default coefficient;
Computing voice frequency spectrum estimated value X ^ [ i ] = γ ^ [ i ] Y [ i ] ; And
Will Make inverse fourier transform, obtain removing the voice signal behind the ground unrest.
11. the method according to ground unrest in the described removal voice signal of claim 10 is characterized in that γ [1]=γ [0], and γ b[1]=γ [N-1].
12. the method according to ground unrest in the described removal voice signal of claim 10 is characterized in that λ fbc=0.5.
CNB2005101374510A 2005-12-30 2005-12-30 Remove the method for ground unrest in the voice signal Expired - Fee Related CN100565672C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2005101374510A CN100565672C (en) 2005-12-30 2005-12-30 Remove the method for ground unrest in the voice signal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2005101374510A CN100565672C (en) 2005-12-30 2005-12-30 Remove the method for ground unrest in the voice signal

Publications (2)

Publication Number Publication Date
CN1991980A true CN1991980A (en) 2007-07-04
CN100565672C CN100565672C (en) 2009-12-02

Family

ID=38214192

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2005101374510A Expired - Fee Related CN100565672C (en) 2005-12-30 2005-12-30 Remove the method for ground unrest in the voice signal

Country Status (1)

Country Link
CN (1) CN100565672C (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102968230A (en) * 2012-11-07 2013-03-13 江苏美琪威电子科技有限公司 Method for eliminating noise of capacitive touch screen and capacitive touch screen
CN102341852B (en) * 2009-01-06 2013-11-20 斯凯普公司 Filtering speech
CN106911993A (en) * 2015-12-23 2017-06-30 Gn瑞声达A/S hearing device with sound pulse suppression
CN109063165A (en) * 2018-08-15 2018-12-21 深圳市诺信连接科技有限责任公司 A kind of ERP file polling management system
CN111383654A (en) * 2020-04-07 2020-07-07 东莞市凌毅电子商务有限公司 Method for eliminating environmental noise interference on audio indicator lamp

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102341852B (en) * 2009-01-06 2013-11-20 斯凯普公司 Filtering speech
CN102968230A (en) * 2012-11-07 2013-03-13 江苏美琪威电子科技有限公司 Method for eliminating noise of capacitive touch screen and capacitive touch screen
CN102968230B (en) * 2012-11-07 2017-07-28 江苏美琪威电子科技有限公司 Method for eliminating noise of capacitive touch screen and capacitive touch screen
CN106911993A (en) * 2015-12-23 2017-06-30 Gn瑞声达A/S hearing device with sound pulse suppression
CN106911993B (en) * 2015-12-23 2021-06-08 Gn瑞声达A/S Hearing device with sound pulse suppression
US11350224B2 (en) 2015-12-23 2022-05-31 Gn Hearing A/S Hearing device with suppression of sound impulses
CN109063165A (en) * 2018-08-15 2018-12-21 深圳市诺信连接科技有限责任公司 A kind of ERP file polling management system
CN109063165B (en) * 2018-08-15 2022-04-19 深圳市诺信连接科技有限责任公司 ERP file query management system
CN111383654A (en) * 2020-04-07 2020-07-07 东莞市凌毅电子商务有限公司 Method for eliminating environmental noise interference on audio indicator lamp

Also Published As

Publication number Publication date
CN100565672C (en) 2009-12-02

Similar Documents

Publication Publication Date Title
Zhao et al. Analyzing noise robustness of MFCC and GFCC features in speaker identification
CN101976566B (en) Voice enhancement method and device applying same
CN102982801B (en) Phonetic feature extracting method for robust voice recognition
EP3040991B1 (en) Voice activation detection method and device
EP2905779B1 (en) System and method for dynamic residual noise shaping
US6173258B1 (en) Method for reducing noise distortions in a speech recognition system
US6523003B1 (en) Spectrally interdependent gain adjustment techniques
JP5302968B2 (en) Speech improvement with speech clarification
EP2031583B1 (en) Fast estimation of spectral noise power density for speech signal enhancement
CN1416564A (en) Noise reduction appts. and method
CN110120225A (en) A kind of audio defeat system and method for the structure based on GRU network
CN102144258B (en) Method and apparatus to facilitate determining signal bounding frequencies
EP1386313B1 (en) Speech enhancement device
JP3205560B2 (en) Method and apparatus for determining tonality of an audio signal
Arslan Modified wiener filtering
CN1991980A (en) Method for removing background noise in voice signal
CN109297583B (en) Method for evaluating time-varying noise loudness of double-ear abnormal sound in automobile
US20030191640A1 (en) Method for extracting voice signal features and related voice recognition system
Kim et al. Robust speech recognition using a small power boosting algorithm
US20070150270A1 (en) Method for removing background noise in a speech signal
Kauppinen et al. Improved noise reduction in audio signals using spectral resolution enhancement with time-domain signal extrapolation
CN109346097B (en) Speech enhancement method based on Kullback-Leibler difference
Krishnamoorthy et al. Enhancement of noisy speech by spectral subtraction and residual modification
Gamliel et al. Perceptual time varying linear prediction model for speech applications
Umesh et al. The speech scale

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20091202

Termination date: 20211230