JP2003131689A

JP2003131689A - Noise removing method and device

Info

Publication number: JP2003131689A
Application number: JP2001327839A
Authority: JP
Inventors: Akihiko Sugiyama; 昭彦杉山; Masanori Kato; 正徳加藤; Masahiro Serizawa; 芹沢　　昌宏
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2001-10-25
Filing date: 2001-10-25
Publication date: 2003-05-09

Abstract

PROBLEM TO BE SOLVED: To provide a method and device for removing noise capable of obtaining dynamic voices excellent in subjective sound quality without generating noise in output signals. SOLUTION: This device has a curtain processing section 22 for subjecting the signal samples taken out of adjacent two frames of reverse Fourie transform outputs to curtain processing.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ノイズ除去方法及
び装置に関し、特に、音声信号に重畳されているノイズ
を除去するためのノイズ除去方法及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a noise removing method and apparatus, and more particularly to a noise removing method and apparatus for removing noise superimposed on an audio signal.

【０００２】[0002]

【従来の技術】従来、音声信号に重畳されている雑音
（ノイズ）を消去する技術として、ノイズ・サプレッサ
がある。ノイズ・サプレッサは、ｃｏｓ変換やフーリエ
変換などによって周波数領域に変換した入力信号を用い
てノイズ成分のパワースペクトルを推定し、推定したパ
ワースペクトルを入力信号から差し引くことにより、音
声信号に混在するノイズを抑圧するように動作する。2. Description of the Related Art Conventionally, there is a noise suppressor as a technique for eliminating noise superimposed on a voice signal. The noise suppressor estimates the power spectrum of the noise component by using the input signal transformed into the frequency domain by cos transform or Fourier transform, and subtracts the estimated power spectrum from the input signal to eliminate noise mixed in the voice signal. Operates to suppress.

【０００３】ノイズ成分のパワースペクトルを、音声の
無音区間を検出して更新することにより、非定常なノイ
ズの抑圧にも適用することができる。The power spectrum of the noise component can be applied to suppression of non-stationary noise by detecting and updating the silent section of the voice.

【０００４】ノイズ・サプレッサについては、例えば、
「1984年12月、アイ・イー・イー・イー・トランザクシ
ョンズ・オン・アクースティクス・スピーチ・アンド・
シグナル・プロセシング、第32巻、第6号(IEEE TRANSAC
TIONS ON ACOUSTICS,SPEECH,AND SIGNAL PROCESSING,VO
L.32,NO.6 PP.1109-1121,DEC,1984)、1109〜1121ペー
ジ」(以下、「文献１」と称する。)に記載されている方
式がある。Regarding the noise suppressor, for example,
"December 1984, IEE Transactions on Auctions Speech and
Signal Processing, Volume 32, Issue 6 (IEEE TRANSAC
TIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VO
L.32, NO.6 PP.1109-1121, DEC, 1984), pages 1109 to 1121 "(hereinafter referred to as" Reference 1 ").

【０００５】文献１に記載されている方式は、最小平均
２乗誤差短時間スペクトル振幅法として知られている。The method described in Document 1 is known as the minimum mean square error short-time spectrum amplitude method.

【０００６】図２５は、文献１等に記載されているノイ
ズ・サプレッサの模式的な構成を示すブロック図であ
る。FIG. 25 is a block diagram showing a schematic configuration of the noise suppressor described in Document 1 and the like.

【０００７】入力端子１１には、音声信号とノイズの混
在する信号である劣化音声信号が、サンプル値系列とし
て供給される。劣化音声信号は、フレーム分割部１に供
給され、Ｋ／２サンプル毎のフレームに分割される。こ
こで、Ｋは偶数とする。A degraded audio signal, which is a signal in which an audio signal and noise are mixed, is supplied to the input terminal 11 as a sample value series. The deteriorated audio signal is supplied to the frame division unit 1 and divided into K / 2 sample frames. Here, K is an even number.

【０００８】フレームに分割された劣化音声信号は、窓
がけ処理部２に供給され、窓関数ｗ（ｔ）との乗算が行
われる。第ｎフレームの入力信号ｙ_n（ｔ）(ｔ＝０，
１，…,Ｋ／２−１)に対するｗ（ｔ）で窓がけされた信
号ｙ_n（ｔ）バーは、数式１で与えられる。The deteriorated audio signal divided into frames is supplied to the windowing processing section 2 and is multiplied by the window function w (t). Input signal y _n (t) (t = 0,
The signal y _n (t) bar windowed with w (t) for 1, ..., K / 2−1) is given by Eq.

【０００９】[0009]

【数１】また、連続する２フレームの一部を重ね合わせ（オーバ
ラップ）して窓がけすることも広く行われている。オー
バラップ長としてフレーム長の５０％を仮定すれば、ｔ＝０，１，…，Ｋ／２−１に対して、数式２で得られるｙ_n（ｔ）バー(ｔ＝０，
１，…,Ｋ−１）が、窓がけ処理部２の出力となる。[Equation 1] It is also widely practiced to overlap a part of two consecutive frames and to open a window. Assuming that the overlap length is 50% of the frame length, y _n (t) bar (t = 0, t = 0,
, ..., K-1) are output from the windowing processing unit 2.

【００１０】[0010]

【数２】実数信号に対しては、左右対称窓関数が用いられる。ま
た、窓関数は、抑圧係数を１に設定したときの入力信号
と出力信号が計算誤差を除いて一致するように設計され
る。これは、ｗ（ｔ）＋ｗ（ｔ＋Ｋ／２）＝１となることを意味する。[Equation 2] A symmetric window function is used for real signals. Further, the window function is designed so that the input signal and the output signal when the suppression coefficient is set to 1 match each other except for a calculation error. This means that w (t) + w (t + K / 2) = 1.

【００１１】以後、連続する２フレームの５０％をオー
バラップして窓がけする場合を例として説明を続ける。Hereinafter, the description will be continued by taking as an example the case where a window is formed by overlapping 50% of two consecutive frames.

【００１２】ｗ（ｔ）としては、例えば数式３に示すハ
ニング窓を用いることができる。As w (t), for example, the Hanning window shown in Expression 3 can be used.

【００１３】[0013]

【数３】窓がけされた出力ｙ_n（ｔ）バーは、フーリエ変換部３
に供給され、劣化音声スペクトルＹ_n（ｋ）に変換され
る。劣化音声スペクトルＹ_n（ｋ）は位相と振幅に分離
され、劣化音声位相スペクトルのａｒｇＹ_n（ｋ）は逆
フーリエ変換部９に、劣化音声振幅スペクトル｜Ｙ
_n（ｋ）｜は音声検出部４、多重乗算部１６及び多重乗
算部１７にそれぞれ供給される。[Equation 3] The windowed output y _n (t) bar is the Fourier transform unit 3
And converted into a degraded speech spectrum Y _n (k). The deteriorated speech spectrum Y _n (k) is separated into a phase and an amplitude, and the deteriorated speech phase spectrum argY _n (k) is supplied to the inverse Fourier transform unit 9 and the deteriorated speech amplitude spectrum | Y.
_n (k) | is supplied to the voice detection unit 4, the multiple multiplication unit 16, and the multiple multiplication unit 17, respectively.

【００１４】音声検出部４は、劣化音声振幅スペクトル
｜Ｙ_n（ｋ）｜に基づいて音声の有無を検出し、その結
果によって定められる音声検出フラグを、推定雑音計算
部５１に伝達する。The voice detection unit 4 detects the presence or absence of voice based on the deteriorated voice amplitude spectrum | Y _n (k) |, and transmits the voice detection flag determined by the result to the estimated noise calculation unit 51.

【００１５】多重乗算部１７は、供給された劣化音声振
幅スペクトル｜Ｙ_n（ｋ）｜を用いて劣化音声パワース
ペクトルを計算し、推定雑音計算部５１と周波数別信号
対雑音比（以下、「ＳＮＲ」と略す。）計算部６とに伝
達する。The multiplying unit 17 calculates a deteriorated voice power spectrum using the supplied deteriorated voice amplitude spectrum | Y _n (k) |, and estimates the noise calculation unit 51 and the frequency-based signal-to-noise ratio (hereinafter, " It is abbreviated as “SNR”).

【００１６】推定雑音計算部５１は、音声検出フラグ、
劣化音声パワースペクトル、及びカウンタ１３から供給
されるカウント値を用いて雑音のパワースペクトルを推
定し、推定雑音パワースペクトルとして周波数別ＳＮＲ
計算部６に伝達する。The estimated noise calculation section 51 uses a voice detection flag,
The noise power spectrum is estimated using the degraded voice power spectrum and the count value supplied from the counter 13, and the SNR for each frequency is used as the estimated noise power spectrum.
It is transmitted to the calculation unit 6.

【００１７】周波数別ＳＮＲ計算部６は、入力された劣
化音声パワースペクトルと推定雑音パワースペクトルと
を用いて周波数別にＳＮＲを計算し、後天的ＳＮＲとし
て推定先天的ＳＮＲ計算部７と雑音抑圧係数生成部８と
に供給する。The frequency-based SNR calculation unit 6 calculates the SNR for each frequency using the input deteriorated speech power spectrum and estimated noise power spectrum, and estimates the a priori SNR calculation unit 7 and noise suppression coefficient as an acquired SNR. Supply to the part 8.

【００１８】推定先天的ＳＮＲ計算部７は、入力された
後天的ＳＮＲ及び雑音抑圧係数生成部８から供給された
抑圧係数を用いて先天的ＳＮＲを推定し、推定先天的Ｓ
ＮＲとして、雑音抑圧係数生成部８に帰還する。The estimated a priori SNR calculator 7 estimates the a priori SNR using the input a posteriori SNR and the suppression coefficient supplied from the noise suppression coefficient generator 8 to estimate the a priori SNR.
It is fed back to the noise suppression coefficient generation unit 8 as NR.

【００１９】雑音抑圧係数生成部８は、入力として供給
された後天的ＳＮＲと推定先天的ＳＮＲとを用いて雑音
抑圧係数を生成し、抑圧係数として推定先天的ＳＮＲ計
算部７に帰還すると同時に多重乗算部１６に伝達する。The noise suppression coefficient generation unit 8 generates a noise suppression coefficient using the acquired SNR and the estimated a priori SNR supplied as inputs, and returns the noise suppression coefficient to the estimated a priori SNR calculation unit 7 as a suppression coefficient and simultaneously multiplexes it. The result is transmitted to the multiplication unit 16.

【００２０】多重乗算部１６は、フーリエ変換部３から
供給された劣化音声振幅スペクトル｜Ｙ_n（ｋ）｜を、
雑音抑圧係数生成部８から供給された抑圧係数Ｇ
_n（ｋ）バーで重み付けすることによって強調音声振幅
スペクトル｜Ｘ_n（ｋ）｜バーを求め、逆フーリエ変換
部９に伝達する。｜Ｘ_n（ｋ）｜バーは、数式４で与え
られる。The multiple multiplication section 16 uses the deteriorated speech amplitude spectrum | Y _n (k) |
Suppression coefficient G supplied from the noise suppression coefficient generation unit 8
_The weighted _n (k) bar is used to obtain the emphasized speech amplitude spectrum | X _n (k) | bar, which is transmitted to the inverse Fourier transform unit 9. The | X _n (k) | bar is given by Equation 4.

【００２１】[0021]

【数４】逆フーリエ変換部９は、多重乗算部１６から供給された
強調音声振幅スペクトル｜Ｘ_n（ｋ）｜バーとフーリエ
変換部３から供給された劣化音声位相スペクトルａｒｇ
｜Ｙ_n（ｋ）｜とを乗算して、強調音声Ｘ_n（ｋ）バーを
求める。すなわち、数式５に示す演算を行う。[Equation 4] The inverse Fourier transform unit 9 receives the emphasized voice amplitude spectrum | X _n (k) | bar supplied from the multiplex multiplication unit 16 and the deteriorated voice phase spectrum arg supplied from the Fourier transform unit 3.
| Y _n (k) | is multiplied to obtain the emphasized speech X _n (k) bar. That is, the calculation shown in Formula 5 is performed.

【００２２】[0022]

【数５】得られた強調音声Ｘ_n（ｋ）バーに逆フーリエ変換を施
し、１フレームがＫサンプルから構成される時間領域サ
ンプル値系列Ｘ_n（ｔ）バー(ｔ＝０，１，…,Ｋ−１）
としてフレーム合成部１０に伝達する。[Equation 5] An inverse Fourier transform is applied to the obtained emphasized speech X _n (k) bar, and a time domain sample value series X _n (t) bar (t = 0, 1, ..., K−1) in which one frame is composed of K samples. )
Is transmitted to the frame synthesis unit 10.

【００２３】フレーム合成部１０は、Ｘ_n（ｔ）バーの
隣接する２フレームからＫ／２サンプルずつを取り出し
て重ね合わせ、数式６によって強調音声Ｘ_n（ｔ）ハッ
トを得る。得られた強調音声Ｘ_n（ｔ）ハット（ｔ＝
０，１，…,Ｋ−１）が、フレーム合成部１０の出力と
して出力端子１２に伝達される。The frame synthesizing unit 10 extracts K / 2 samples from two adjacent frames of the X _n (t) bar and superimposes them, and obtains the emphasized speech X _n (t) hat by the formula (6). Obtained emphasized speech X _n (t) hat (t =
0, 1, ..., K−1) is transmitted to the output terminal 12 as the output of the frame synthesizing unit 10.

【００２４】[0024]

【数６】音声検出部の実現方法について、文献１は詳細に記載さ
れていない。しかし、音声検出部の実現例としては、
「2000年3月、日本音響学会講演論文集、321〜322ペー
ジ」(以下、「文献２」と称する。)が知られているの
で、以降、文献２に示されたものを従来の方法として説
明する。[Equation 6] Document 1 does not describe the implementation method of the voice detection unit in detail. However, as an implementation example of the voice detection unit,
Since "March 2000, Acoustical Society of Japan, Proceedings, pages 321-322" (hereinafter referred to as "reference 2") is known, the method shown in reference 2 will be used as the conventional method. explain.

【００２５】図２６は、図２５の音声検出部４の構成を
示すブロック図である。音声検出部４は、閾値記憶部４
０１、比較部４０２、乗算器４０４、対数計算部４０
５、パワー計算部４０６、重みつき加算部４０７、重み
記憶部４０８、論理否定回路４０９を有する。FIG. 26 is a block diagram showing the structure of the voice detection unit 4 of FIG. The voice detection unit 4 includes a threshold storage unit 4
01, comparator 402, multiplier 404, logarithm calculator 40
5, a power calculation unit 406, a weighted addition unit 407, a weight storage unit 408, and a logical NOT circuit 409.

【００２６】図２５のフーリエ変換部３から供給された
劣化音声振幅スペクトルは、パワー計算部４０６に供給
される。パワー計算部４０６は、劣化音声振幅スペクト
ルのパワー｜Ｙ_n（ｋ）｜²のｋ＝０〜Ｋ−１に対する総
和を計算して、対数計算部４０５に伝達する。The deteriorated voice amplitude spectrum supplied from the Fourier transform unit 3 in FIG. 25 is supplied to the power calculation unit 406. The power calculation unit 406 calculates the sum of the power | Y _n (k) | ² of the deteriorated speech amplitude spectrum for k = 0 to K−1, and transmits the sum to the logarithmic calculation unit 405.

【００２７】対数計算部４０５は、入力された劣化音声
スペクトルパワーの対数を求め、乗算器４０４に伝達す
る。The logarithmic calculation unit 405 obtains the logarithm of the input deteriorated speech spectrum power and transmits it to the multiplier 404.

【００２８】乗算器４０４は、供給された対数値を定数
倍して劣化音声パワーＱ_n（９０１）を求め、比較部４
０２及び重みつき加算部４０７に供給する。すなわち、
第ｎフレームの劣化音声パワーＱ_nは、数式７で与えら
れる。The multiplier 404 multiplies the supplied logarithmic value by a constant to obtain the deteriorated voice power Q _n (901), and the comparison unit 4
02 and the weighted addition unit 407. That is,
The deteriorated voice power Q _n of the nth frame is given by Expression 7.

【００２９】[0029]

【数７】なお、文献２に記載された音声検出部は、時間領域サン
プルであるｙ_n（ｔ）バーを用いて、数式８に従ってＱ_n
を求めている。[Equation 7] Incidentally, speech detection unit described in the Reference 2, with y _n (t) bar is time-domain samples, Q _n according to equation 8
Are seeking.

【００３０】[0030]

【数８】しかし、例えば、「1985年、ディジタル信号処理の理
論、コロナ社、75〜76ページ」(以下、「文献３」と称
する。)にあるように、数式８と数式７とが等価である
ことは、パーセバル(Parseval)の等式として知られてい
る。[Equation 8] However, as shown in, for example, "Theory of Digital Signal Processing, 1985, Corona Publishing Co., Ltd., pp. 75-76" (hereinafter referred to as "reference 3"), the equation 8 and the equation 7 are not equivalent to each other. , Known as Parseval's equation.

【００３１】比較部４０２には、閾値記憶部４０１か
ら、閾値ＴＨ_nが供給されている。比較部４０２は、乗
算器４０４の出力と閾値ＴＨ_nとを比較し、ＴＨ_n＞Ｑ_n
のときは有音を表す“１”を、ＴＨ_n≦Ｑ_nのときは無音
を表す"０"を、音声検出フラグとして出力する。The threshold value TH _n is supplied from the threshold value storage unit 401 to the comparison unit 402. The comparison unit 402 compares the output of the multiplier 404 with the threshold value TH _n, and TH _n > Q _n
When it is, "1" indicating a voice is output, and when TH _n ≤Q _n , "0" indicating a silence is output as a voice detection flag.

【００３２】比較部４０２の出力は、音声検出部４の出
力である音声検出フラグとして外部に供給されると同時
に、否定演算回路４０９に供給される。否定演算回路４
０９の出力は、重みつき加算部制御信号９０５として重
みつき加算部４０７に供給される。The output of the comparison section 402 is supplied to the outside as a voice detection flag which is the output of the voice detection section 4, and at the same time, is supplied to the negative operation circuit 409. Negative operation circuit 4
The output of 09 is supplied to the weighted addition unit 407 as the weighted addition unit control signal 905.

【００３３】重みつき加算部４０７には、また、閾値記
憶部４０１から閾値と、重み記憶部４０８から重み９０
３とが供給される。重みつき加算部４０７は、閾値記憶
部４０１から供給される閾値９０２を、重みつき加算部
制御信号９０５に基づいて選択的に更新し、更新閾値９
０４として閾値記憶部４０１に帰還する。The weighted addition unit 407 also has a threshold value from the threshold value storage unit 401 and a weight value 90 from the weight value storage unit 408.
3 and 3 are supplied. The weighted addition unit 407 selectively updates the threshold value 902 supplied from the threshold value storage unit 401 based on the weighted addition unit control signal 905, and the updated threshold value 9
It returns to 04 as the threshold value storage part 401.

【００３４】更新閾値ＴＨ_nは、閾値ＴＨ_n-1と劣化音声
パワー９０１を、重み記憶部４０８から供給される重み
９０３を用いて重みつき加算することによって求める。
更新閾値ＴＨ_nの計算は、論理否定回路４０９の出力で
ある重みつき加算部制御信号９０５が“１”に等しいと
きだけ行われる。すなわち、無音のときだけ、閾値ＴＨ
_nが更新される。更新によって得られた更新閾値９０４
は、閾値記憶部４０１に帰還される。The update threshold TH _n is obtained by weighted addition of the threshold TH _n-1 and the deteriorated voice power 901 using the weight 903 supplied from the weight storage unit 408.
The update threshold TH _n is calculated only when the weighted adder control signal 905 output from the logical NOT circuit 409 is equal to “1”. That is, only when there is no sound, the threshold TH
_n is updated. Update threshold 904 obtained by update
Are returned to the threshold storage unit 401.

【００３５】図２７は、図２６のパワー計算部４０６の
構成を示すブロック図である。パワー計算部４０６は、
分離部４０６１、乗算器４０６２₀〜４０６２_K-1、加算
器４０６３を有する。FIG. 27 is a block diagram showing the structure of the power calculation unit 406 shown in FIG. The power calculator 406
It includes a separating unit 4061, multipliers 4062 _{0 to} 4062 _K−1 , and an adder 4063.

【００３６】多重化された状態で図２５のフーリエ変換
部３から供給された劣化音声振幅スペクトルは、分離部
４０６１において周波数別のＫサンプルに分離され、そ
れぞれ乗算器４０６２₀〜４０６２_K-1に供給される。The deteriorated speech amplitude spectrum supplied from the Fourier transform unit 3 in FIG. 25 in the multiplexed state is separated into K samples for each frequency in the separation unit 4061, and is divided into multipliers 4062 _{0 to} 4062 _K-1 . Supplied.

【００３７】乗算器４０６２₀〜４０６２_K-1は、それぞ
れ入力された信号を２乗し、加算器４０６３に伝達す
る。加算器４０６３は、入力された信号の総和を求めて
出力する。Multipliers 4062 _{0 to} 4062 _K-1 square the respective input signals and transmit them to the adder 4063. The adder 4063 calculates and outputs the total sum of the input signals.

【００３８】図２８は、図２６の重みつき加算部４０７
の構成を示すブロック図である。重みつき加算部４０７
は、乗算器４０７１，４０７３、定数乗算器４０７５、
加算器４０７２，４０７４を有する。FIG. 28 shows the weighted addition unit 407 of FIG.
3 is a block diagram showing the configuration of FIG. Weighted addition unit 407
Is a multiplier 4071, 4073, a constant multiplier 4075,
It has adders 4072 and 4074.

【００３９】重みつき加算部４０７には、図２６の乗算
器４０４から劣化音声パワー９０１と、図２６の閾値記
憶部４０１から閾値９０２と、図２６の重み記憶部４０
８から重み９０３と、図２６の論理否定回路４０９から
重みつき加算部制御信号９０５とが、それぞれ入力とし
て供給される。In the weighted addition unit 407, the deteriorated voice power 901 from the multiplier 404 in FIG. 26, the threshold value storage unit 401 to the threshold value 902 in FIG. 26, and the weight storage unit 40 in FIG.
8 to the weight 903 and the weighted addition unit control signal 905 from the logical NOT circuit 409 of FIG.

【００４０】値βを有する重み９０３は、定数乗算器４
０７５と乗算器４０７３とに伝達される。定数乗算器４
０７５は入力信号を−１倍して得られた−βを、加算器
４０７４に伝達する。加算器４０７４のもう一方の入力
としては１が供給されており、加算器４０７４の出力は
両者の和である１−βとなる。１−βは乗算器４０７１
に供給されて、もう一方の入力である劣化音声パワーＱ
_nと乗算され、積である(１−β)Ｑ_nが加算器４０７２に
伝達される。The weight 903 having the value β is the constant multiplier 4
075 and the multiplier 4073. Constant multiplier 4
075 transmits -β obtained by multiplying the input signal by -1 to the adder 4074. One is supplied to the other input of the adder 4074, and the output of the adder 4074 is the sum of the two, 1-β. 1-β is a multiplier 4071
To the other input, the degraded voice power Q
_It is multiplied by _n and the product (1-β) Q _n is transmitted to the adder 4072.

【００４１】一方、乗算器４０７３では、重み９０３と
して供給されたβと閾値９０２とが乗算され、積である
βＴＨ_n-1が加算器４０７２に伝達される。加算器４０
７２は、βＴＨ_n-1と(１−β)Ｑ_nとの和を、更新閾値９
０４として出力する。On the other hand, in the multiplier 4073, β supplied as the weight 903 is multiplied by the threshold value 902, and the product βTH _n-1 is transmitted to the adder 4072. Adder 40
72 indicates the sum of βTH _n-1 and (1-β) Q _n as the update threshold value 9
Output as 04.

【００４２】更新閾値ＴＨ_nの計算は、重みつき加算部
制御信号９０５が“１”に等しいときだけ行われる。す
なわち、重みつき加算部の機能は、無音のときに、閾値
ＴＨ _n-1を更新してＴＨ_nを求めることであり、数式９に
よって表すことができる。Update threshold TH_nIs the weighted addition part
It is performed only when the control signal 905 is equal to "1". You
That is, the function of the weighted addition unit is
TH _n-1To update TH_nIs calculated as
Therefore, it can be expressed.

【００４３】[0043]

【数９】ここで、βは重み９０３の値である。[Equation 9] Here, β is the value of the weight 903.

【００４４】図２９は、図２５の多重化乗算部１７の構
成を示すブロック図である。多重化乗算部１７は、乗算
器１７０１₀〜１７０１_K-1、分離部１７０２，１７０
３、多重化部１７０４を有する。多重化された状態で図
２５のフーリエ変換部３から供給された劣化音声振幅ス
ペクトルは、分離部１７０２，１７０３において周波数
別のＫサンプルに分離され、それぞれ乗算器１７０１₀
〜１７０１_K-1に供給される。FIG. 29 is a block diagram showing the structure of the multiplexing and multiplying unit 17 of FIG. The multiplexing multiplication unit 17 includes multipliers 1701 _{0 to} 1701 _K−1 and separation units 1702 and 170.
3 and a multiplexing unit 1704. The deteriorated speech amplitude spectrum supplied from the Fourier transform unit 3 in FIG. 25 in the multiplexed state is separated into K samples for each frequency in the separation units 1702 and 1703, and each is multiplied by a multiplier 1701 _0.
~ 1701 _K-1 .

【００４５】乗算器１７０１₀〜１７０１_K-1は、それぞ
れ入力された信号を２乗し、多重化部１７０４に伝達す
る。多重化部１７０４は、入力された信号を多重化し、
劣化音声パワースペクトルとして出力する。Multipliers 1701 _{0 to} 1701 _K-1 square the respective input signals and transmit them to multiplexing section 1704. The multiplexing unit 1704 multiplexes the input signals,
Output as degraded voice power spectrum.

【００４６】図３０は、図２５の推定雑音計算部５１の
構成を示すブロック図である。推定雑音計算部５１は、
分離部５０２、多重化部５０３、周波数別推定雑音計算
部５１４₀〜５１４_K-1を有する。FIG. 30 is a block diagram showing the structure of the estimated noise calculation section 51 of FIG. The estimated noise calculation unit 51
It includes a demultiplexing unit 502, a multiplexing unit 503, and frequency _- dependent estimated noise calculation units 514 _{0 to} 514 _K-1 .

【００４７】図２５の音声検出部４から供給された音声
検出フラグと図２５のカウンタ１３から供給されたカウ
ント値は、周波数別推定雑音計算部５１４₀〜５１４_K-1
に伝達される。The voice detection flag supplied from the voice detection unit 4 in FIG. 25 and the count value supplied from the counter 13 in FIG. 25 are the estimated noise calculation units by frequency 514 _{0 to} 514 _K-1.
Be transmitted to.

【００４８】図２５の多重乗算部１７から供給された劣
化音声パワースペクトルは、分離部５０２に伝達され
る。分離部５０２は、多重化された状態で供給された劣
化音声パワースペクトルをＫ個の周波数に対応した成分
に分離して、周波数別推定雑音計算部５１４₀〜５１４
_K-1に伝達する。The deteriorated voice power spectrum supplied from the multiplex multiplier 17 of FIG. 25 is transmitted to the separator 502. The demultiplexing unit 502 demultiplexes the deteriorated voice power spectrum supplied in the multiplexed state into components corresponding to K frequencies, and the frequency-dependent estimated noise calculation units 514 _{0 to} 514.
Transmit to _K-1 .

【００４９】周波数別推定雑音計算部５１４₀〜５１４
_K-1は、分離部５０２から供給された劣化音声パワース
ペクトルを用いて雑音パワースペクトルを計算し、多重
化部５０３に伝達する。雑音パワースペクトルの計算
は、カウント値と音声検出フラグの値によって制御さ
れ、予め定めた条件が満足されるときだけ実行される。Frequency-dependent estimated noise calculators 514 _{0 to} 514
_K-1 calculates a noise power spectrum using the deteriorated speech power spectrum supplied from demultiplexing section 502, and transfers it to multiplexing section 503. The calculation of the noise power spectrum is controlled by the count value and the value of the voice detection flag, and is executed only when a predetermined condition is satisfied.

【００５０】多重化部５０３は、供給されたＫ個の雑音
パワースペクトル値を多重化して、推定雑音パワースペ
クトルとして出力する。The multiplexing unit 503 multiplexes the supplied K noise power spectrum values and outputs them as an estimated noise power spectrum.

【００５１】図３１は、図３０の周波数別推定雑音計算
部５１４の構成を示すブロック図である。文献２に記載
された雑音推定は、無音区間において雑音推定値を更新
するものであり、雑音推定値として巡回型フィルタによ
る平均化を施した推定雑音の瞬時値を用いている。FIG. 31 is a block diagram showing the structure of the frequency-dependent estimated noise calculation unit 514 of FIG. The noise estimation described in Reference 2 updates the noise estimation value in a silent section, and uses the instantaneous value of the estimated noise averaged by the recursive filter as the noise estimation value.

【００５２】一方、「1998年5月、アイ・イー・イー・
イー・トランザクションズ・オン・スピーチ・アンド・
オーディオ・プロセシング、第6巻、第3号(IEEE TRANS-
ACTIONS ON SPEECH AND AUDIO PROCESSING,VOL.6,NO.3,
PP.287-292,MAY,1998)、287〜292ページ」(以下、「文
献４」と称する。)に記載された雑音推定では、推定雑
音の瞬時値を平均化して用いると記述されている。On the other hand, "May 1998, I-E-E-
E Transactions on Speech and
Audio Processing, Volume 6, Issue 3 (IEEE TRANS-
ACTIONS ON SPEECH AND AUDIO PROCESSING, VOL.6, NO.3,
PP.287-292, MAY, 1998), pp. 287-292 "(hereinafter referred to as" Reference 4 "), it is described that the instantaneous values of the estimated noise are averaged and used. .

【００５３】これは、巡回型の代わりにトランスバーサ
ル型フィルタ（シフトレジスタを用いた構成）を用いた
平均化の実現を示唆している。どちらの実現も機能は等
しいので、ここでは文献４に記載された方法について説
明する。This suggests the realization of averaging using a transversal type filter (configuration using shift registers) instead of the cyclic type. Since the functions are the same in both implementations, the method described in Reference 4 will be described here.

【００５４】周波数別推定雑音計算部５１４は、更新判
定部５２１、レジスタ長記憶部５０４１、スイッチ５０
４４、シフトレジスタ５０４５、加算器５０４６、最小
値選択部５０４７、除算部５０４８、カウンタ５０４９
を有する。The frequency-dependent estimated noise calculation unit 514 includes an update determination unit 521, a register length storage unit 5041, and a switch 50.
44, shift register 5045, adder 5046, minimum value selection unit 5047, division unit 5048, counter 5049
Have.

【００５５】スイッチ５０４４には、図３０の分離部５
０２から、周波数別劣化音声パワースペクトルが供給さ
れている。スイッチ５０４４を閉じたときに、周波数別
劣化音声パワースペクトルは、シフトレジスタ５０４５
に伝達される。The switch 5044 has a separating section 5 shown in FIG.
02, the deteriorated voice power spectrum for each frequency is supplied. When the switch 5044 is closed, the frequency-dependent degraded voice power spectrum is stored in the shift register 5045.
Be transmitted to.

【００５６】シフトレジスタ５０４５は、後述するよう
に、更新判定部５２１から供給される制御信号に応じ
て、内部レジスタの記憶値を隣接レジスタにシフトす
る。シフトレジスタ長は、後述するレジスタ長記憶部５
９４１に記憶されている値に等しい。シフトレジスタ５
０４５の全レジスタ出力は、加算器５０４６に供給され
る。As will be described later, the shift register 5045 shifts the storage value of the internal register to the adjacent register according to the control signal supplied from the update determining unit 521. The shift register length is the register length storage unit 5 described later.
Equal to the value stored in 941. Shift register 5
All 045 register outputs are provided to adder 5046.

【００５７】加算器５０４６は、供給された全レジスタ
出力を加算して、加算結果を除算部５０４８に伝達す
る。The adder 5046 adds all the supplied register outputs and sends the addition result to the division unit 5048.

【００５８】一方、更新判定部５２１には、カウント値
と音声検出フラグが供給されている。更新判定部５２１
は、カウント値が予め設定された値に到達するまでは常
に“１”を、到達した後は音声検出フラグが"０"である
（無音の）ときに“１”を、それ以外のときに“０”を
出力し、カウンタ５０４９、スイッチ５０４４及びシフ
トレジスタ５０４５に伝達する。On the other hand, the update determination section 521 is supplied with the count value and the voice detection flag. Update determination unit 521
Is always "1" until the count value reaches a preset value, "1" when the voice detection flag is "0" (silence) after the count value, and otherwise. “0” is output and transmitted to the counter 5049, the switch 5044, and the shift register 5045.

【００５９】スイッチ５０４４は、更新判定部５２１か
ら供給された信号が“１”のときに回路を閉じ、“０”
のときに開く。The switch 5044 closes the circuit when the signal supplied from the update judging section 521 is "1", and is "0".
Open at.

【００６０】カウンタ５０４９は、更新判定部５２１か
ら供給された信号が“１”のときにカウント値を増加
し、“０”のときには変更しない。The counter 5049 increments the count value when the signal supplied from the update judging section 521 is "1", and does not change it when it is "0".

【００６１】シフトレジスタ５０４５は、更新判定部か
ら供給された信号が“１”のときにスイッチ５０４４か
ら供給される信号サンプルを１サンプル取り込むと同時
に、内部レジスタの記憶値を隣接レジスタにシフトす
る。The shift register 5045 takes in one sample of the signal sample supplied from the switch 5044 when the signal supplied from the update determination section is "1", and at the same time shifts the stored value of the internal register to the adjacent register.

【００６２】最小値選択部５０４７には、カウンタ５０
４９の出力とレジスタ長記憶部５９４１の出力とが供給
されている。最小値選択部５０４７は、供給されたカウ
ント値とレジスタ長とのうち、小さい方を選択して、除
算部５０４８に伝達する。The minimum value selector 5047 has a counter 50.
The output of 49 and the output of the register length storage unit 5941 are supplied. The minimum value selection unit 5047 selects the smaller one of the supplied count value and the register length and transmits it to the division unit 5048.

【００６３】除算部５０４８は、加算器５０４６から供
給された周波数別劣化音声パワースペクトルの加算値を
カウント値とレジスタ長との小さい方の値で除算し、商
を周波数別推定雑音パワースペクトルλ_n（ｋ）として
出力する。Ｂ_n（ｋ）(ｎ＝０，１，…,Ｎ−１）をシフ
トレジスタ５０４５に保存されている劣化音声パワース
ペクトルのサンプル値とすると、λ_n（ｋ）は、数式１
０で示すことができる。The dividing unit 5048 divides the added value of the frequency-dependent deteriorated speech power spectrum supplied from the adder 5046 by the smaller value of the count value and the register length, and the quotient is the frequency-dependent estimated noise power spectrum λ _n. Output as (k). Letting B _n (k) (n = 0, 1, ..., N−1) be a sample value of the deteriorated speech power spectrum stored in the shift register 5045, λ _n (k) is represented by Formula 1
It can be indicated by 0.

【００６４】[0064]

【数１０】ただし、Ｎはカウント値とレジスタ長とのうち小さい方
の値である。カウント値はゼロから始まって単調に増加
するので、最初はカウント値で除算が行われ、後にはレ
ジスタ長で除算が行われる。レジスタ長で除算が行われ
ることは、シフトレジスタに格納された値の平均値を求
めることになる。[Equation 10] However, N is the smaller value of the count value and the register length. Since the count value starts from zero and increases monotonically, division is first performed by the count value and later division is performed by the register length. The division by the register length means obtaining the average value of the values stored in the shift register.

【００６５】最初は、シフトレジスタ５０４５に十分多
くの値が記憶されていないために、実際に値が記憶され
ているレジスタの数で除算する。実際に値が記憶されて
いるレジスタの数は、カウント値がレジスタ長より小さ
いときはカウント値に等しく、カウント値がレジスタ長
より大きくなると、レジスタ長と等しくなる。Initially, since not enough values are stored in the shift register 5045, division is performed by the number of registers in which values are actually stored. The number of registers in which values are actually stored is equal to the count value when the count value is smaller than the register length, and is equal to the register length when the count value is larger than the register length.

【００６６】図３２は、図３１の更新判定部５２１の構
成を示すブロック図である。更新判定部５２１は、論理
否定回路５２０２、比較部５２０３、閾値記憶部５２０
４、論理和計算部５２１１を有する。FIG. 32 is a block diagram showing the structure of the update determination unit 521 shown in FIG. The update determination unit 521 includes a logical NOT circuit 5202, a comparison unit 5203, and a threshold value storage unit 520.
4. It has a logical sum calculation unit 5211.

【００６７】図２５のカウンタ１３から供給されるカウ
ント値は、比較部５２０３に伝達される。閾値記憶部５
２０４の出力である閾値も、比較部５２０３に伝達され
る。比較部５２０３は、供給されたカウント値と閾値と
を比較し、カウント値が閾値より小さいときに“１”
を、カウント値が閾値より大きいときに"０"を論理和計
算部５２１１に伝達する。The count value supplied from the counter 13 in FIG. 25 is transmitted to the comparison section 5203. Threshold storage unit 5
The threshold value that is the output of 204 is also transmitted to the comparison unit 5203. The comparing unit 5203 compares the supplied count value with the threshold value, and when the count value is smaller than the threshold value, “1”
Is transmitted to the logical sum calculation unit 5211 when the count value is larger than the threshold value.

【００６８】一方、供給された音声検出フラグは論理否
定回路５２０２に伝達される。論理否定回路５２０２
は、入力された信号の論理否定値を求め、論理和計算部
５２１１に伝達する。すなわち、音声検出フラグが
“１”である有音部では“０”を、音声検出フラグが
“０”である無音部では“１”を論理和計算部５２１１
に伝達することになる。On the other hand, the supplied voice detection flag is transmitted to the logical NOT circuit 5202. Logical NOT circuit 5202
Calculates the logical negation value of the input signal and transmits it to the logical sum calculation unit 5211. That is, the logical sum calculation unit 5211 outputs “0” in the voiced part whose voice detection flag is “1” and “1” in the silent part whose voice detection flag is “0”.
Will be transmitted to.

【００６９】その結果、論理和計算部５２１１の出力
は、音声検出フラグが“０”である無音部のとき、又は
カウント値が閾値より小さいときに“１”となって、図
３１のスイッチを閉じ、カウンタ５０４９をカウントア
ップさせる。As a result, the output of the logical sum calculation unit 5211 becomes "1" when the voice detection flag is the silent part where the voice detection flag is "0" or when the count value is smaller than the threshold value, and the switch of FIG. Close and count up the counter 5049.

【００７０】図３３は、図２５の周波数別ＳＮＲ計算部
６の構成を示すブロック図である。周波数別ＳＮＲ計算
部６は、除算部６０１₀〜６０１_K-1、分離部６０２，６
０３、多重化部６０４を有する。FIG. 33 is a block diagram showing the configuration of the frequency-specific SNR calculation unit 6 of FIG. The frequency-based SNR calculation unit 6 includes division units 601 _{0 to} 601 _K−1 and separation units 602 and 6.
03, and a multiplexing unit 604.

【００７１】図２５の多重乗算部１７から供給される劣
化音声パワースペクトルは、分離部６０２に伝達され
る。図２５の推定雑音計算部５１から供給される推定雑
音パワースペクトルは、分離部６０３に伝達される。劣
化音声パワースペクトルは分離部６０２において、推定
雑音パワースペクトルは分離部６０３において、それぞ
れ周波数成分に対応したＫサンプルに分離され、それぞ
れ除算部６０１₀〜６０１_K-1に供給される。The deteriorated voice power spectrum supplied from the multiplex multiplier 17 of FIG. 25 is transmitted to the separator 602. The estimated noise power spectrum supplied from the estimated noise calculation unit 51 in FIG. 25 is transmitted to the separation unit 603. In noisy speech power spectrum separation unit 602, the estimated noise power spectrum in the separation unit 603 is separated into K samples corresponding to the frequency components, respectively, supplied to each divider 601 ₀ ~601 _K-1.

【００７２】除算部６０１₀〜６０１_K-1では、数式１１
に従って、供給された劣化音声パワースペクトルを推定
雑音パワースペクトルで除算して周波数別ＳＮＲ γ
_n（ｋ）を求め、多重化部６０４に伝達する。In the division units 601 _{0 to} 601 _K-1 ,
According to the above, the supplied degraded speech power spectrum is divided by the estimated noise power spectrum to obtain SNR γ for each frequency.
_n (k) is obtained and transmitted to the multiplexing unit 604.

【００７３】[0073]

【数１１】ここで、λ_n（ｋ）は推定雑音パワースペクトルであ
る。多重化部６０４は、伝達されたＫ個の周波数別ＳＮ
Ｒを多重化して、後天的ＳＮＲとして出力する。[Equation 11] Here, λ _n (k) is the estimated noise power spectrum. The multiplexing unit 604 receives the transmitted K SNs for each frequency.
R is multiplexed and output as an acquired SNR.

【００７４】図３４は、図２５の推定先天的ＳＮＲ計算
部７の構成を示すブロック図である。推定先天的ＳＮＲ
計算部７は、多重値域限定処理部７０１、後天的ＳＮＲ
記憶部７０２、抑圧係数記憶部７０３、多重乗算部７０
４，７０５、重み記憶部７０６、多重重みつき加算部７
０７、加算器７０８を有する。FIG. 34 is a block diagram showing the configuration of the estimated a priori SNR calculation unit 7 of FIG. Estimated innate SNR
The calculation unit 7 includes a multiple range limitation processing unit 701 and an acquired SNR.
Storage unit 702, suppression coefficient storage unit 703, multiplex multiplication unit 70
4, 705, weight storage unit 706, multiple weighted addition unit 7
07 and an adder 708.

【００７５】図２５の周波数別ＳＮＲ計算部６から供給
される後天的ＳＮＲ γ_n（ｋ）(ｋ＝０，１，…,Ｋ−
１）は、後天的ＳＮＲ記憶部７０２と加算器７０８とに
伝達される。The acquired SNR γ _n (k) (k = 0, 1, ..., K−) supplied from the frequency-based SNR calculation unit 6 of FIG.
1) is transmitted to the acquired SNR storage unit 702 and the adder 708.

【００７６】後天的ＳＮＲ記憶部７０２は、第ｎフレー
ムにおける後天的ＳＮＲ γ_n（ｋ）を記憶すると共に、
第ｎ−１フレームにおける後天的ＳＮＲ γ_n-1（ｋ）を
多重乗算部７０５に伝達する。The acquired SNR storage unit 702 stores the acquired SNR γ _n (k) in the nth frame, and
The acquired SNR γ _n-1 (k) in the ( _n-1 ) _th frame is transmitted to the multiplex multiplication unit 705.

【００７７】図２５の雑音抑圧係数生成部８から供給さ
れる抑圧係数Ｇ_n（ｋ）バー(ｋ＝０，１，…,Ｋ−１）
は、抑圧係数記憶部７０３に伝達される。抑圧係数記憶
部７０３は、第ｎフレームにおける抑圧係数Ｇ_n（ｋ）
バーを記憶すると共に、第ｎ−１フレームにおける抑圧
係数Ｇ_n-1（ｋ）バーを多重乗算部７０４に伝達する。The suppression coefficient G _n (k) bar (k = 0, 1, ..., K-1) supplied from the noise suppression coefficient generator 8 of FIG.
Is transmitted to the suppression coefficient storage unit 703. The suppression coefficient storage unit 703 stores the suppression coefficient G _n (k) in the nth frame.
The bar is stored, and the suppression coefficient G _n−1 (k) bar in the ( _n−1 ) _th frame is transmitted to the multiplex multiplication unit 704.

【００７８】多重乗算部７０４は、供給されたＧ
_n-1（ｋ）バーを２乗してＧ² _n-1（ｋ）バーを求め、多
重乗算部７０５に伝達する。The multiplying unit 704 receives the supplied G
_{The n-1} (k) bar is squared to obtain G ² _n-1 (k) bar, which is transmitted to the multiplex multiplication unit 705.

【００７９】多重乗算部７０５は、Ｇ² _n-1（ｋ）バーと
γ_n-1（ｋ）とをｋ＝０，１，…,Ｋ−１に対して乗算し
てＧ² _n-1（ｋ）バーγ_n-1（ｋ）を求め、結果を多重重
みつき加算部７０７に過去の推定ＳＮＲ９２２として伝
達する。The multiplying unit 705 multiplies G ² _n-1 (k) bar and γ _n-1 (k) by k = 0, 1, ..., K-1 to obtain G ² _n-1. (K) bar γ _n-1 (k) is obtained, and the result is transmitted to the multiple weighted addition unit 707 as the past estimated SNR 922.

【００８０】多重乗算部７０４及び７０５の構成は、図
２９を用いて説明した多重乗算部１７と同様である。The configurations of the multiple multiplication units 704 and 705 are the same as those of the multiple multiplication unit 17 described with reference to FIG.

【００８１】加算器７０８の他方の端子には−１が供給
されており、加算結果γ_n（ｋ）−１が多重値域限定処
理部７０１に伝達される。-1 is supplied to the other terminal of the adder 708, and the addition result γ _n (k) -1 is transmitted to the multiple range limiting processing unit 701.

【００８２】多重値域限定処理部７０１は、加算器７０
８から供給された加算結果γ_n（ｋ）−１に値域限定演
算子Ｐ［・］による演算を施し、結果であるＰ［γ
_n（ｋ）−１］を多重重みつき加算部７０７に瞬時推定
ＳＮＲ９２１として伝達する。ただし、Ｐ［ｘ］は数式
１２で定められる。The multi-value range limiting processing section 701 includes an adder 70.
The addition result γ _n (k) -1 supplied from 8 is subjected to calculation by the range limiting operator P [·], and the result P [γ
_[n (k) -1] is transmitted to the multiple weighted addition unit 707 as the instantaneous estimated SNR 921. However, P [x] is determined by Expression 12.

【００８３】[0083]

【数１２】多重重みつき加算部７０７には、また、重み記憶部７０
６から重み９２３が供給されている。多重重みつき加算
部７０７は、これらの供給された瞬時推定ＳＮＲ９２
１、過去の推定ＳＮＲ９２２、重み９２３を用いて推定
先天的ＳＮＲ９２４を求める。重み９２３をαとし、ξ
_n（ｋ）ハットを推定先天的ＳＮＲとすると、ξ_n（ｋ）
ハットは、数式１３を用いて計算される。[Equation 12] The multiple weighted addition unit 707 also includes a weight storage unit 70.
The weight 923 is supplied from 6. The multi-weighted adder 707 receives these supplied instantaneous estimated SNRs 92
1, the estimated a priori SNR 924 is obtained using the past estimated SNR 922 and weight 923. The weight 923 is α, and ξ
_{If n} (k) hat is the estimated innate SNR, then ξ _n (k)
The hat is calculated using Equation 13.

【００８４】[0084]

【数１３】ここで、Ｇ² _-1（ｋ）γ_-1（ｋ）バー＝１とする。[Equation 13] Here, it is assumed that G ² ₋₁ (k) γ ₋₁ (k) bar = 1.

【００８５】図３５は、図３４の多重値域限定処理部７
０１の構成を示すブロック図である。多重値域限定処理
部７０１は、定数記憶部７０１１、最大値選択部７０１
２₀〜７０１２_K-1、分離部７０１３、多重化部７０１４
を有する。FIG. 35 is a block diagram of the multiple range limitation processing unit 7 of FIG.
It is a block diagram which shows the structure of 01. The multiple value range limitation processing unit 701 includes a constant storage unit 7011 and a maximum value selection unit 701.
2 ₀ ~7012 _K-1, the separation section 7013, multiplexing section 7014
Have.

【００８６】分離部７０１３には、図３４の加算器７０
８からγ_n（ｋ）−１が供給される。分離部７０１３
は、供給されたγ_n（ｋ）−１をＫ個の周波数別成分に
分離し、最大値選択部７０１２₀〜７０１２_K-1に供給す
る。The separating section 7013 includes an adder 70 shown in FIG.
8 supplies γ _n (k) -1. Separation unit 7013
Separates the supplied γ _n (k) -1 into K frequency components and supplies them to the maximum value selection units 7012 _{0 to} 7012 _K-1 .

【００８７】最大値選択部７０１２₀〜７０１２_K-1の他
方の入力には、定数記憶部７０１１からゼロが供給され
ている。最大値選択部７０１２₀〜７０１２_K-1は、γ_n
（ｋ）−１をゼロと比較し、大きい方の値を多重化部７
０１４へ伝達する。この最大値選択演算は、数式１２を
実行することに相当する。多重化部７０１４は、これら
の値を多重化して出力する。Zeros are supplied from the constant storage unit 7011 to the other inputs of the maximum value selection units 7012 _{0 to} 7012 _K-1 . Maximum value selector 7012 ₀ ~7012 _K-1 are, gamma _n
(K) -1 is compared with zero, and the larger value is multiplexed by the multiplexing unit 7
It is transmitted to 014. This maximum value selection operation is equivalent to executing Expression 12. The multiplexing unit 7014 multiplexes these values and outputs them.

【００８８】図３６は、図３４の多重重みつき加算部７
０７の構成を示すブロック図である。多重重みつき加算
部７０７１は、重みつき加算部７０７１₀〜７０７
１_K-1、分離部７０７２，７０７４、多重化部７０７５
を有する。FIG. 36 shows the multiple weighted addition unit 7 of FIG.
It is a block diagram which shows the structure of 07. The multiple weighted addition unit 7071 is configured to add weighted addition units 7071 _{0 to} 707.
1 _K-1 , demultiplexing units 7072 and 7074, multiplexing unit 7075
Have.

【００８９】分離部７０７２には、図３４の多重値域限
定処理部７０１から、Ｐ[γ_n（ｋ）−１]が瞬時推定Ｓ
ＮＲ９２１として供給される。分離部７０７２は、Ｐ
[γ_n（ｋ）−１]をＫ個の周波数別成分に分離し、周波
数別瞬時推定ＳＮＲ９２１₀〜９２１_K-1として、重みつ
き加算部７０７１₀〜７０７１_K-1に伝達する。In the demultiplexing unit 7072, P [γ _n (k) -1] is instantaneously estimated S from the multiple value range limiting processing unit 701 of FIG.
Supplied as NR921. The separation unit 7072 is P
The [γ _n (k) -1] is separated into K frequency-components, as frequency-instantaneous estimation SNR921 ₀ ~921 _K-1, and transmits the weighted adder 7071 ₀ ~7071 _K-1.

【００９０】分離部７０７４には、図３４の多重乗算部
７０５からＧ² _n-1（ｋ）バーγ_n-1（ｋ）が過去の推定
ＳＮＲ９２２として供給される。分離部７０７４は、Ｇ
² _n-1（ｋ）バーγ_n-1（ｋ）をＫ個の周波数別成分に分
離し、過去の周波数別推定ＳＮＲ９２２₀〜９２２_K-1と
して、重みつき加算部７０７１₀〜７０７１_K-1に伝達す
る。The demultiplexing unit 7074 is supplied with the G ² _n-1 (k) bar γ _n-1 (k) as the past estimated SNR 922 from the multiplex multiplication unit 705 of FIG. Separation unit 7074 is G
² _n-1 (k) bar gamma _n-1 (k) of separating into K frequency-components, as the past frequency domain estimated SNR922 ₀ ~922 _K-1, weighted adder 7071 ₀ ~7071 _K- Propagate to ₁ .

【００９１】一方、重みつき加算部７０７１₀〜７０７
１_K-1には、重み９２３も供給される。重みつき加算部
７０７１₀〜７０７１_K-1は、数式１３によって表される
重みつき加算を実行し、周波数別推定先天的ＳＮＲ９２
４₀〜９２４_K-1を多重化部７０７５に伝達する。On the other hand, weighted addition units 7071 _{0 to} 707
A weight 923 is also supplied to 1 _K-1 . The weighted addition units 7071 _{0 to} 7071 _K-1 execute the weighted addition represented by Expression 13 and perform the frequency-specific estimation a priori SNR92.
The 4 ₀ ~924 _K-1 is transmitted to the multiplexing unit 7075.

【００９２】多重化部７０７５は、周波数別推定先天的
ＳＮＲ９２４₀〜９２４_K-1を多重化し、推定先天的ＳＮ
Ｒ９２４として出力する。The multiplexing unit 7075 multiplexes the estimated a priori SNRs 924 _{0 to} 924 _K-1 for each frequency to obtain the estimated a priori SN.
Output as R924.

【００９３】なお、重みつき加算部７０７１₀〜７０７
１_K-1の動作と構成は、図２８を用いて説明した重みつ
き加算部４０７と同様であるが、重みつき加算の計算は
常に行われる。The weighted addition units 7071 _{0 to} 707 are added.
The operation and configuration of 1 _K-1 are the same as those of the weighted addition unit 407 described with reference to FIG. 28, but the weighted addition calculation is always performed.

【００９４】図３７は、図２５の雑音抑圧係数生成部８
の構成を示すブロック図である。雑音抑圧係数生成部８
は、抑圧係数検索部８０１₀〜８０１_K-1、分離部８０
２，８０３、多重化部８０４を有する。FIG. 37 shows the noise suppression coefficient generator 8 of FIG.
3 is a block diagram showing the configuration of FIG. Noise suppression coefficient generator 8
Is a suppression coefficient search unit 801 _{0 to} 801 _K-1 and a separation unit 80.
2, 803 and a multiplexing unit 804.

【００９５】分離部８０２には、図２５の周波数別ＳＮ
Ｒ計算部６から後天的ＳＮＲが供給される。分離部８０
２は、供給された後天的ＳＮＲをＫ個の周波数別成分に
分離し、抑圧係数検索部８０１₀〜８０１_K-1に伝達す
る。The separating unit 802 stores the SN for each frequency in FIG.
The acquired SNR is supplied from the R calculation unit 6. Separation part 80
2 separates the supplied acquired SNR into K frequency components and transmits the _K frequency components to the suppression coefficient search units 801 _{0 to} 801 _K-1 .

【００９６】分離部８０３には、図２５の推定先天的Ｓ
ＮＲ計算部７から推定先天的ＳＮＲが供給される。分離
部８０３は、供給された推定先天的ＳＮＲをＫ個の周波
数別成分に分離し、抑圧係数検索部８０１₀〜８０１_K-1
に伝達する。The separating unit 803 stores the estimated a priori S
The estimated a priori SNR is supplied from the NR calculator 7. The separation unit 803 separates the supplied estimated a priori SNR into K frequency components, and the suppression coefficient search units 801 _{0 to} 801 _K-1.
Communicate to.

【００９７】抑圧係数検索部８０１₀〜８０１_K-1は、供
給された後天的ＳＮＲと推定先天的ＳＮＲに対応した抑
圧係数とを検索し、検索結果を多重化部８０４に伝達す
る。The suppression coefficient search units 801 _{0 to} 801 _K-1 search the supplied acquired SNR and the suppression coefficient corresponding to the estimated a priori SNR, and transmit the search result to the multiplexing unit 804.

【００９８】多重化部８０４は、供給された抑圧係数を
多重化して出力する。The multiplexing unit 804 multiplexes the supplied suppression coefficient and outputs it.

【００９９】図３８は、図３７の抑圧係数検索部８０１
₀〜８０１_K-1の構成を示すブロック図である。抑圧係数
検索部８０１は、抑圧係数テーブル８０１１、アドレス
変換部８０１２，８０１３を有する。FIG. 38 is a block diagram of the suppression coefficient search unit 801 of FIG.
It is a block diagram showing a _{0 ~801} _K-1 configuration. The suppression coefficient search unit 801 includes a suppression coefficient table 8011 and address conversion units 8012 and 8013.

【０１００】アドレス変換部８０１２には、図３７の分
離部８０２から周波数別後天的ＳＮＲが供給される。ア
ドレス変換部８０１２は、供給された周波数別後天的Ｓ
ＮＲを対応したアドレスに変換し、抑圧係数テーブル８
０１１に伝達する。The address conversion unit 8012 is supplied with the acquired SNR for each frequency from the separation unit 802 of FIG. The address conversion unit 8012 uses the acquired frequency-dependent acquired S
The NR is converted into a corresponding address, and the suppression coefficient table 8
011 is transmitted.

【０１０１】アドレス変換部８０１３には、図３７の分
離部８０３から、周波数別推定先天的ＳＮＲが供給され
る。アドレス変換部８０１３は、供給された周波数別推
定先天的ＳＮＲを対応したアドレスに変換し、抑圧係数
テーブル８０１１に伝達する。The address conversion unit 8013 is supplied with the estimated a priori SNR for each frequency from the separation unit 803 in FIG. The address conversion unit 8013 converts the supplied frequency-specific estimated a priori SNR into a corresponding address, and transfers it to the suppression coefficient table 8011.

【０１０２】抑圧係数テーブル８０１１は、アドレス変
換部８０１２とアドレス変換部８０１３とから供給され
たアドレスに対応した領域に格納されている抑圧係数
を、周波数別抑圧係数として出力する。The suppression coefficient table 8011 outputs the suppression coefficient stored in the area corresponding to the address supplied from the address conversion unit 8012 and the address conversion unit 8013 as the suppression coefficient for each frequency.

【０１０３】[0103]

【発明が解決しようとする課題】しかし、従来の技術
は、逆フーリエ変換して得られた時間領域信号の隣接す
る２フレームから取り出した信号サンプルを重ね合わせ
加算することによって、強調音声を得ていた、一方で、
フーリエ変換前に時間領域信号にかける窓関数は、雑音
抑圧処理を行わないときに、入力が出力において再現さ
れるように設計されていた。However, the conventional technique obtains emphasized speech by superposing and adding signal samples taken from two adjacent frames of the time domain signal obtained by the inverse Fourier transform. On the other hand,
The window function applied to the time domain signal before the Fourier transform was designed so that the input is reproduced at the output when no noise suppression processing is performed.

【０１０４】このため、重ね合わせ加算の対象となった
サンプルが、隣接するフレームにおいて異なった抑圧係
数値で抑圧されると、フレーム境界において信号サンプ
ルに不連続性を生じ、出力信号に発生する雑音によっ
て、主観的な音質が劣化するという問題があった。Therefore, when the samples to be superimposed and added are suppressed with different suppression coefficient values in adjacent frames, discontinuity occurs in the signal samples at the frame boundaries, and noise generated in the output signal is generated. There was a problem that subjective sound quality deteriorates.

【０１０５】そこで、本発明は、出力信号に雑音を発生
することがなく、主観的な音質に優れた強調音声を得る
ことのできるノイズ除去の方法及び装置を提供すること
を課題とする。Therefore, it is an object of the present invention to provide a noise removing method and apparatus capable of obtaining an emphasized voice excellent in subjective sound quality without generating noise in an output signal.

【０１０６】[0106]

【課題を解決するための手段】上記課題を解決するため
に、本発明のノイズ除去方法は、入力信号を周波数領域
信号に変換し、該周波数領域信号に基づいて信号対雑音
比を求め、該信号対雑音比に基づいて抑圧係数を定め、
該抑圧係数に基づいて前記周波数領域信号を重みづけ
し、該重みづけされた周波数領域信号を時間領域信号に
変換することによってノイズを除去した出力信号を得る
際に、前記入力信号に第１の窓がけを施してから周波数
領域信号に変換し、前記時間領域信号に第２の窓がけを
施して前記出力信号を得る。In order to solve the above problems, a noise removing method of the present invention converts an input signal into a frequency domain signal, obtains a signal-to-noise ratio based on the frequency domain signal, and Determine the suppression coefficient based on the signal-to-noise ratio,
When the noise-removed output signal is obtained by weighting the frequency domain signal based on the suppression coefficient and converting the weighted frequency domain signal into a time domain signal, the first signal is added to the input signal. After windowing, the signal is transformed into a frequency domain signal, and the time domain signal is subjected to a second windowing to obtain the output signal.

【０１０７】また、本発明のノイズ除去方法は、入力信
号を周波数領域信号に変換し、該周波数領域信号に基づ
いて推定雑音を求め、該推定雑音に対応した値を前記周
波数領域信号から差し引いて周波数領域の強調音声を求
め、該強調音声を時間領域信号に変換することによって
ノイズを除去した出力信号を得る際に、前記入力信号に
第１の窓がけを施してから周波数領域信号に変換し、前
記時間領域信号に第２の窓がけを施して出力信号を得
る。Further, the noise removing method of the present invention converts an input signal into a frequency domain signal, obtains estimated noise based on the frequency domain signal, and subtracts a value corresponding to the estimated noise from the frequency domain signal. When obtaining the emphasized speech in the frequency domain and obtaining the output signal from which noise is removed by converting the emphasized speech into the time domain signal, the input signal is first windowed and then converted into the frequency domain signal. , A second window is applied to the time domain signal to obtain an output signal.

【０１０８】さらに、本発明のノイズ除去装置は、入力
信号に窓がけ処理を施す第１の窓がけ処理部と、前記第
１の窓がけ処理部で窓がけ処理が施された入力信号を周
波数領域信号に変換する変換部と、前記変換部で変換さ
れた周波数領域信号の振幅成分に基づいて信号対雑音比
を求める信号対雑音比計算部と、前記信号雑音比計算部
で求められた信号対雑音比に基づいて抑圧係数を生成す
る抑圧係数生成部と、前記抑圧係数生成部で生成された
抑圧係数に基づいて前記周波数領域信号の振幅成分を重
みづけするための乗算部と、前記乗算部で重みづけされ
た周波数領域信号の振幅成分と前記変換部で変換された
周波数領域信号の位相成分を時間領域信号に変換する逆
変換部と、前記逆変換部で変換された時間領域信号に窓
がけ処理を施す第２の窓がけ処理部とを備える。Further, the noise removing apparatus of the present invention frequency-converts the first windowing processing section for subjecting the input signal to the windowing processing, and the input signal subjected to the windowing processing by the first windowing processing section. A conversion unit for converting into a domain signal, a signal-to-noise ratio calculation unit for obtaining a signal-to-noise ratio based on the amplitude component of the frequency-domain signal converted by the conversion unit, and a signal obtained by the signal-to-noise ratio calculation unit A suppression coefficient generation unit that generates a suppression coefficient based on a noise ratio; a multiplication unit that weights the amplitude component of the frequency domain signal based on the suppression coefficient generated by the suppression coefficient generation unit; An inverse transform unit for transforming the amplitude component of the frequency domain signal weighted by the unit and the phase component of the frequency domain signal transformed by the transform unit into a time domain signal, and a time domain signal transformed by the inverse transform unit. First window processing And a windowing processing unit.

【０１０９】また、本発明は、隣接する２フレームから
取り出した逆フーリエ変換出力を窓がけ処理してから重
ね合わせ加算することによって、強調音声を得ることを
特徴としている。より具体的には、逆フーリエ変換出力
の隣接する２フレームから取り出した信号サンプルを窓
がけ処理するための窓がけ処理部を備える。Further, the present invention is characterized in that emphasized speech is obtained by subjecting the inverse Fourier transform outputs extracted from two adjacent frames to windowing processing and then superposing and adding. More specifically, the windowing processing unit is provided for windowing the signal samples extracted from two adjacent frames of the inverse Fourier transform output.

【０１１０】すなわち、本発明は、隣接する２フレーム
を構成する逆フーリエ変換出力を窓がけ処理してから重
ね合わせ加算することによって、フレーム境界における
信号サンプルの連続性が改善され、雑音を防止するよう
にしている。That is, the present invention improves the continuity of signal samples at frame boundaries and prevents noise by windowing the inverse Fourier transform outputs that form two adjacent frames and then performing superposition addition. I am trying.

【０１１１】[0111]

【発明の実施の形態】以下、本発明の実施形態について
図面を用いて説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings.

【０１１２】（実施形態１）図１は、本発明の実施形態
１のノイズ除去装置の模式的な構成を示すブロック図で
ある。図１に示すノイズ除去装置は、図２５に示したノ
イズ除去装置と比較して、窓がけ処理部２２を有する点
が相違する。なお、図１において図２５に示した部分と
同様の部分には同一符号を付している。以下、この相違
点を中心に動作を説明する。(Embodiment 1) FIG. 1 is a block diagram showing a schematic configuration of a noise removing apparatus according to Embodiment 1 of the present invention. The noise eliminator shown in FIG. 1 is different from the noise eliminator shown in FIG. 25 in that it has a windowing processing unit 22. In FIG. 1, the same parts as those shown in FIG. 25 are designated by the same reference numerals. The operation will be described below focusing on this difference.

【０１１３】窓がけ処理部２２は、逆フーリエ変換部９
から供給されたＸ_n（ｔ）バーに窓関数ｈ（ｔ）を乗算
し、積であるｈ（ｔ）Ｘ_n（ｔ）バーをフレーム合成部
１０に伝達するものである。The windowing processing section 22 includes an inverse Fourier transform section 9
The window function h (t) is multiplied by the X _n (t) bar supplied from the above, and the product h (t) X _n (t) bar is transmitted to the frame synthesis unit 10.

【０１１４】フレーム合成部１０は、ｈ（ｔ）Ｘ
_n（ｔ）バーの隣接する２フレームからＫ／２サンプル
ずつを取り出して重ね合わせ、数式１４によって、強調
音声Ｘ_n（ｔ）ハットを得る。The frame synthesizing unit 10 uses h (t) X
K / 2 samples are taken out from two adjacent frames of the _n (t) bar and superposed on each other, and the emphasized speech X _n (t) hat is obtained by Expression 14.

【０１１５】[0115]

【数１４】得られた強調音声Ｘ_n（ｔ）ハット(ｔ＝０，１，…,Ｋ
／２−１)が、フレーム合成部１０の出力として、出力
端子１２に伝達される。オーバラップが、５０％ではな
くＭサンプルで、フレーム長がＬサンプル（Ｍ＜Ｌ）の
場合は、数式１５によって、強調音声Ｘ_n（ｔ）ハット
を得る。[Equation 14] Obtained emphasized speech X _n (t) hat (t = 0, 1, ..., K
/ 2-1) is transmitted to the output terminal 12 as the output of the frame synthesizing unit 10. When the overlap is M samples instead of 50% and the frame length is L samples (M <L), Expression 15 obtains the emphasized speech X _n (t) hat.

【０１１６】[0116]

【数１５】これに合わせて、フレーム分割部１も修正する。[Equation 15] In accordance with this, the frame division unit 1 is also modified.

【０１１７】すでに説明したように、実数信号に対して
は、左右対称窓関数が用いられる。また、窓関数は、抑
圧係数を１に設定したときの入力信号と出力信号とが計
算誤差を除いて一致するように設計される。As described above, the symmetric window function is used for real signals. Further, the window function is designed so that the input signal and the output signal when the suppression coefficient is set to 1 match each other except for a calculation error.

【０１１８】これらの条件を満たすいかなる窓関数であ
っても、ｈ（ｔ）として使用することができる。その一
例として、ハニング窓を開平した関数（ルートハニング
窓）をあげることができる。他にもこれらの条件を満た
す窓関数は存在するので、そちらを用いてもよい。Any window function satisfying these conditions can be used as h (t). As an example, a function (root Hanning window) in which the Hanning window is square rooted can be cited. Since there are other window functions that satisfy these conditions, they may be used.

【０１１９】（実施形態２）図２は、本発明の実施形態
２のノイズ除去装置の模式的な構成を示すブロック図で
ある。図２と図１とは、推定雑音計算部５、重みつき劣
化音声計算部１４及び抑圧係数補正部１５を除いて同様
とし、同様の部分には同一符号を付している。(Embodiment 2) FIG. 2 is a block diagram showing a schematic configuration of a noise removing apparatus according to Embodiment 2 of the present invention. 2 and 1 are the same except for the estimated noise calculation unit 5, the weighted deteriorated speech calculation unit 14, and the suppression coefficient correction unit 15, and the same reference numerals are given to the same portions.

【０１２０】図２に示す構成は、「2000年4月、電子情
報通信学会技術研究報告、DSP、53〜60ページ」(以下、
「文献５」と称する。)に記載されたものに、窓がけ処
理部２２を備えている。The structure shown in FIG. 2 is "April 2000, IEICE Technical Research Report, DSP, pages 53-60" (hereinafter,
It is referred to as "reference 5". ), The windowing processing unit 22 is provided.

【０１２１】文献５に記載された方法は、文献１に記載
された方法とは異なり、重みつき劣化音声スペクトルを
用いて、雑音のパワースペクトルを推定することによっ
て、正確な推定雑音を得ることができる。以下、これら
の相違点を中心に動作を説明する。Unlike the method described in Document 1, the method described in Document 5 can obtain an accurate estimated noise by estimating the power spectrum of noise using the weighted deteriorated speech spectrum. it can. The operation will be described below focusing on these differences.

【０１２２】図３は、図２の重みつき劣化音声計算部１
４の構成を示すブロック図である。重みつき劣化音声計
算部１４は、推定雑音記憶部１４０１、周波数別ＳＮＲ
計算部１４０２、多重非線形処理部１４０５、及び多重
乗算部１４０４を有する。FIG. 3 shows the weighted deteriorated speech calculation unit 1 of FIG.
4 is a block diagram showing the configuration of FIG. The weighted deteriorated speech calculation unit 14 includes an estimated noise storage unit 1401, SNR for each frequency.
The calculation unit 1402, the multiplex nonlinear processing unit 1405, and the multiplex multiplication unit 1404 are included.

【０１２３】推定雑音記憶部１４０１は、図２の推定雑
音計算部５から供給される推定雑音パワースペクトルを
記憶し、１フレーム前に記憶された推定雑音パワースペ
クトルを周波数別ＳＮＲ計算部１４０２へ出力する。The estimated noise storage section 1401 stores the estimated noise power spectrum supplied from the estimated noise calculation section 5 of FIG. 2 and outputs the estimated noise power spectrum stored one frame before to the frequency-based SNR calculation section 1402. To do.

【０１２４】周波数別ＳＮＲ計算部１４０２は、推定雑
音記憶部１４０１から供給される推定雑音パワースペク
トルと図２の多重乗算部１７から供給される劣化音声パ
ワースペクトルとを用いてＳＮＲを各周波数毎に求め、
多重非線形処理部１４０５に出力する。The frequency-based SNR calculation section 1402 uses the estimated noise power spectrum supplied from the estimated noise storage section 1401 and the degraded speech power spectrum supplied from the multiplex multiplication section 17 of FIG. 2 to calculate the SNR for each frequency. Seeking,
It outputs to the multiplex nonlinear processing unit 1405.

【０１２５】多重非線形処理部１４０５は、周波数別Ｓ
ＮＲ計算部１４０２から供給されるＳＮＲを用いて重み
係数ベクトルを求め、重み係数ベクトルを多重乗算部１
４０４に出力する。The multiple non-linear processing unit 1405 determines S for each frequency.
The weighting coefficient vector is obtained using the SNR supplied from the NR calculating section 1402, and the weighting coefficient vector is multiplied by the multiplying section 1.
Output to 404.

【０１２６】多重乗算部１４０４は、図２の多重乗算部
１７から供給される劣化音声パワースペクトルと、多重
非線形処理部１４０５から供給される重み係数ベクトル
との積を周波数毎に計算し、重みつき劣化音声パワース
ペクトルを図２の推定雑音記憶部５に出力する。The multiplex multiplication unit 1404 calculates, for each frequency, the product of the deteriorated speech power spectrum supplied from the multiplex multiplication unit 17 of FIG. 2 and the weighting coefficient vector supplied from the multiplex nonlinear processing unit 1405, and the weighting is performed. The degraded voice power spectrum is output to the estimated noise storage unit 5 in FIG.

【０１２７】周波数別ＳＮＲ計算部１４０２の構成は、
図３３を用いて説明した周波数別ＳＮＲ計算部６と同様
である。また、多重乗算部１４０４の構成は、図２９を
用いて説明した多重乗算部１７と同様である。The configuration of the frequency-dependent SNR calculation section 1402 is as follows.
This is the same as the frequency-based SNR calculation unit 6 described using FIG. 33. The configuration of the multiplex multiplication unit 1404 is similar to that of the multiplex multiplication unit 17 described with reference to FIG.

【０１２８】図４は、図３の多重非線形処理部１４０５
の構成を示すブロック図である。多重非線形処理部１４
０５は、分離部１４９５、非線形処理部１４８５₀〜１
４８５_K-1、及び多重化部１４７５を有する。FIG. 4 is a block diagram of the multiple nonlinear processor 1405 of FIG.
3 is a block diagram showing the configuration of FIG. Multiple Nonlinear Processing Unit 14
Reference numeral 05 denotes a separation unit 1495 and nonlinear processing units 1485 ₀ to 1 485.
It has 485 _K−1 and a multiplexing unit 1475.

【０１２９】分離部１４９５は、図３の周波数別ＳＮＲ
計算部１４０２から供給されるＳＮＲを周波数別のＳＮ
Ｒに分離し、非線形処理部１４８５₀〜１４８５_K-1に出
力する。The separating unit 1495 is provided with the SNR for each frequency in FIG.
The SNR supplied from the calculation unit 1402 is set to the SN for each frequency.
It is separated into R and output to the non-linear processing units 1485 _{0 to} 1485 _K-1 .

【０１３０】非線形処理部１４８５₀〜１４８５_K-1は、
それぞれ入力値に応じた実数値を出力する非線形関数を
有する。The non-linear processing units 1485 _{0 to} 1485 _K-1 are
Each has a non-linear function that outputs a real value according to an input value.

【０１３１】図５は、非線形処理部１４８５₀〜１４８
５_K-1等から出力される非線形関数の例を示す図であ
る。ｆ₁を入力値としたとき、図５に示される非線形関
数の出力値ｆ₂は、数式１６で示される。FIG. 5 shows the non-linear processing units 1485 _{0 to} 148.
It is a figure which shows the example of the nonlinear function output from _5K-1 . When f ₁ is used as an input value, the output value f ₂ of the nonlinear function shown in FIG.

【０１３２】[0132]

【数１６】非線形処理部１４８５₀〜１４８５_K-1は、分離部１４９
５から供給される周波数別ＳＮＲを、非線形関数によっ
て処理して重み係数を求め、多重化部１４７５に出力す
る。すなわち、非線形処理部１４８５₀〜１４８５_K-1は
ＳＮＲに応じた１から０までの重み係数を出力する。Ｓ
ＮＲが小さい時は１を、大きい時は０を出力する。[Equation 16] The non-linear processing units 1485 _{0 to} 1485 _K-1 are separated by the separation unit 149.
The frequency-dependent SNR supplied from the signal No. 5 is processed by a non-linear function to obtain a weighting coefficient, which is output to the multiplexing unit 1475. That is, the non-linear processing units 1485 _{0 to} 1485 _K-1 output weighting factors from 1 to 0 according to the SNR. S
When NR is small, 1 is output, and when it is large, 0 is output.

【０１３３】多重化部１４７５は、非線形処理部１４８
５₀〜１４８５_K-1から出力された重み係数を多重化し、
重み係数ベクトルを多重乗算部１４０４に出力する。The multiplexing unit 1475 is a nonlinear processing unit 148.
5 _0-1485 weighting coefficient outputted from the _K-1 multiplexing,
The weighting coefficient vector is output to the multiple multiplication unit 1404.

【０１３４】図３の多重乗算部１４０４で劣化音声パワ
ースペクトルと乗算される重み係数は、ＳＮＲに応じた
値になっており、ＳＮＲが大きい程、すなわち劣化音声
に含まれる音声成分が大きい程、重み係数の値は小さく
なる。The weighting coefficient to be multiplied by the deteriorated voice power spectrum in the multiplex multiplication unit 1404 in FIG. 3 has a value corresponding to the SNR. The larger the SNR, that is, the larger the voice component included in the deteriorated voice, The value of the weighting factor becomes smaller.

【０１３５】推定雑音の更新には一般に劣化音声パワー
スペクトルが用いられるが、推定雑音の更新に用いる劣
化音声パワースペクトルに対して、ＳＮＲに応じた重み
づけを行うことで、劣化音声パワースペクトルに含まれ
る音声成分の影響を小さくすることができ、より精度の
高い雑音推定を行うことができる。Generally, the deteriorated speech power spectrum is used for updating the estimated noise, but the deteriorated speech power spectrum used for updating the estimated noise is weighted according to the SNR to be included in the deteriorated speech power spectrum. The influence of the voice component generated can be reduced, and more accurate noise estimation can be performed.

【０１３６】なお、重み係数の計算に非線形関数を用い
た例を示したが、非線形関数以外にも線形関数や高次多
項式など、他の形で表されるＳＮＲの関数を用いること
も可能である。Although an example of using a non-linear function for the calculation of the weighting coefficient is shown, it is also possible to use a SNR function represented in another form such as a linear function or a high-order polynomial in addition to the non-linear function. is there.

【０１３７】図６は、図２の推定雑音計算部５の構成を
示すブロック図である。図２の推定雑音計算部５は、図
２５に示した推定雑音計算部５１に対して、分離部５０
５が存在する点と、周波数別推定雑音計算部５１４₀〜
５１４_K-1を周波数別推定雑音計算部５０４₀〜５０４
_K-1に置換している点とが相違する。なお、図６におい
て図３０に示した部分と同様の部分には同一符号を付し
ている。以下、これらの相違点を中心に動作を説明す
る。FIG. 6 is a block diagram showing the configuration of the estimated noise calculation section 5 of FIG. The estimated noise calculation unit 5 of FIG. 2 is different from the estimated noise calculation unit 51 shown in FIG.
5 and the estimated noise calculation unit for each frequency 514 ₀ to 514 ₀ to
By frequency 514 _K-1 estimated noise calculator 504 _0-504
_The difference is that it is replaced with _K-1 . In FIG. 6, the same parts as those shown in FIG. 30 are designated by the same reference numerals. The operation will be described below focusing on these differences.

【０１３８】分離部５０５は、図２の重みつき劣化音声
計算部から供給される重みつき劣化音声パワースペクト
ルを、周波数別の重みつき劣化音声パワースペクトルに
分離し、周波数別推定雑音計算部５０４₀〜５０４_K-1に
出力する。The separation unit 505 separates the weighted deteriorated speech power spectrum supplied from the weighted deteriorated speech calculation unit of FIG. 2 into frequency-dependent weighted deteriorated sound power spectra, and the frequency-dependent estimated noise calculation unit 504 _0. Output to ~ 504 _K-1 .

【０１３９】周波数別推定雑音計算部５０４₀〜５０４
_K-1は、分離部５０２から供給される周波数別劣化音声
パワースペクトル、分離部５０５から供給される周波数
別重みつき劣化音声パワースペクトル、図２の音声検出
部４から供給される音声検出フラグ及び図２のカウンタ
１３から供給されるカウント値から周波数別推定雑音パ
ワースペクトルを計算し、多重化部５０３へ出力する。Frequency-dependent estimated noise calculators 504 _{0 to} 504
_K-1 is the frequency-dependent deteriorated voice power spectrum supplied from the separation unit 502, the frequency-weighted deteriorated voice power spectrum supplied from the separation unit 505, the voice detection flag supplied from the voice detection unit 4 in FIG. The estimated noise power spectrum for each frequency is calculated from the count value supplied from the counter 13 in FIG. 2 and output to the multiplexing unit 503.

【０１４０】多重化部５０３は、周波数別推定雑音計算
部５０４₀〜５０４_K-1から供給される周波数別推定雑音
パワースペクトルを多重化し、推定雑音パワースペクト
ルを図２の周波数別ＳＮＲ計算部６と重みつき劣化音声
計算部１４とへ出力する。The multiplexing unit 503 multiplexes the frequency _- dependent estimated noise power spectrum supplied from the frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K-1 , and the estimated noise power spectrum is calculated by the frequency-based SNR calculation unit 6 in FIG. To the weighted deteriorated speech calculation unit 14.

【０１４１】図７は、図６の周波数別推定雑音計算部５
０４₀〜５０４_K-1の構成を示すブロック図である。図６
の周波数別推定雑音計算部５０４₀〜５０４_K-1は、図３
０に示した周波数別推定雑音計算部５１４に対して、周
波数別推定雑音計算部５０４ ₀〜５０４_K-1が推定雑音記
憶部５９４２を有する点と、更新判定部５２１が更新判
定部５２０に置換されている点と、スイッチ５０４４へ
の入力が周波数別劣化音声パワースペクトルから周波数
別重みつき劣化音声パワースペクトルに置換されている
点とが相違する。なお、図７において図３１に示した部
分と同様の部分には同一符号を付している。FIG. 7 shows the frequency-dependent estimated noise calculator 5 of FIG.
04₀~ 504_K-13 is a block diagram showing the configuration of FIG. Figure 6
Frequency-dependent estimated noise calculation unit 504₀~ 504_K-1Is shown in FIG.
The frequency-dependent estimated noise calculation unit 514 shown in FIG.
Estimated noise calculation unit 504 by wave number ₀~ 504_K-1Is estimated noise
In addition, the update determination unit 521 determines that the update determination unit 521 has the storage unit 5942.
To the switch 5044 and the point that the constant part 520 is replaced.
Input the frequency from the degraded voice power spectrum by frequency
It is replaced with another weighted speech power spectrum
The point is different. Note that the section shown in FIG. 31 in FIG.
The same reference numerals are given to the same parts as the minutes.

【０１４２】ちなみに、図６の周波数別推定雑音計算部
５０４₀〜５０４_K-1は、図９を用いて後述するような構
成としてもよい。Incidentally, the frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K-1 shown in FIG. 6 may be configured as will be described later with reference to FIG.

【０１４３】周波数別推定雑音計算部５０４₀〜５０４
_K-1は、推定雑音の計算に劣化音声パワースペクトルで
はなく重みつき劣化音声パワースペクトルを用いてお
り、また、推定雑音の更新判定に、推定雑音と劣化音声
パワースペクトルを用いているため、これらの相違点が
発生する。Frequency-dependent estimated noise calculators 504 _{0 to} 504
_K-1 uses the weighted deteriorated speech power spectrum instead of the deteriorated speech power spectrum to calculate the estimated noise, and uses the estimated noise and the deteriorated speech power spectrum to update the estimated noise. Difference occurs.

【０１４４】推定雑音記憶部５９４２は、除算部５０４
８から供給される周波数別推定雑音パワースペクトルを
記憶し、１フレーム前に記憶された周波数別推定雑音パ
ワースペクトルを更新判定部５２０に出力する。The estimated noise storage unit 5942 has a division unit 504.
The estimated noise power spectrum for each frequency supplied from 8 is stored, and the estimated noise power spectrum for each frequency stored one frame before is output to the update determination unit 520.

【０１４５】図８は、図７の更新判定部５２０の構成を
示すブロック図である。図７の更新判定部５２０は、図
３１に示した更新判定部５２１に対して、論理和計算部
５２１１が論理和計算部５２０１に置換されている点
と、更新判定部５２０が比較部５２０５、閾値記憶部５
２０６及び閾値計算部５２０７を有する点とが相違す
る。なお、図８において図３２に示した部分と同様の部
分には同一符号を付している。以下、これらの相違点を
中心に動作を説明する。FIG. 8 is a block diagram showing the structure of the update determination unit 520 shown in FIG. The update determination unit 520 of FIG. 7 is different from the update determination unit 521 shown in FIG. 31 in that the logical sum calculation unit 5211 is replaced by the logical sum calculation unit 5201, and the update determination unit 520 has a comparison unit 5205. Threshold storage unit 5
The difference is that 206 and a threshold calculation unit 5207 are included. In FIG. 8, the same parts as those shown in FIG. 32 are designated by the same reference numerals. The operation will be described below focusing on these differences.

【０１４６】閾値計算部５２０７は、図７の推定雑音記
憶部５９４２から供給される周波数別推定雑音パワース
ペクトルに応じた値を計算し、計算結果を閾値として閾
値記憶部５２０６に出力する。最も簡単には、閾値を周
波数別推定雑音パワースペクトルの定数倍とすることで
ある。その他に、高次多項式や非線形関数を用いて閾値
を計算することも可能である。The threshold calculation unit 5207 calculates a value according to the frequency-dependent estimated noise power spectrum supplied from the estimated noise storage unit 5942 of FIG. 7, and outputs the calculation result to the threshold storage unit 5206 as a threshold value. The simplest is to set the threshold value to a constant multiple of the frequency-dependent estimated noise power spectrum. Besides, it is also possible to calculate the threshold value using a high-order polynomial or a non-linear function.

【０１４７】閾値記憶部５２０６は、閾値計算部５２０
７から出力された閾値を記憶し、１フレーム前に記憶さ
れた閾値を比較部５２０５へ出力する。The threshold storage unit 5206 has a threshold calculation unit 520.
The threshold value output from No. 7 is stored, and the threshold value stored one frame before is output to the comparison unit 5205.

【０１４８】比較部５２０５は、閾値記憶部５２０６か
ら供給される閾値と図６の分離部５０２から供給される
周波数別劣化音声パワースペクトルとを比較し、周波数
別劣化音声パワースペクトルが閾値よりも小さければ
“１”を、大きければ“０”を論理和計算部５２０１に
出力する。すなわち、推定雑音パワースペクトルの大き
さをもとに、劣化音声信号が雑音であるか否かを判別し
ている。The comparing unit 5205 compares the threshold value supplied from the threshold value storage unit 5206 with the frequency-dependent deteriorated voice power spectrum supplied from the separating unit 502 of FIG. 6, and if the frequency-dependent deteriorated voice power spectrum is smaller than the threshold value. If it is larger, “1” is output to the logical sum calculation unit 5201. That is, whether or not the deteriorated voice signal is noise is determined based on the size of the estimated noise power spectrum.

【０１４９】論理和計算部５２０１は、比較部５２０３
の出力値、論理否定回路５２０２の出力値、及び比較部
５２０５の出力値の論理和を計算し、計算結果を図７の
スイッチ５０４４、シフトレジスタ５０４５及びカウン
タ５０４９に出力する。The logical sum calculation unit 5201 includes a comparison unit 5203.
Of the output value of, the output value of the logical NOT circuit 5202, and the output value of the comparison unit 5205 are calculated, and the calculation result is output to the switch 5044, the shift register 5045, and the counter 5049 in FIG. 7.

【０１５０】このように、初期状態や無音区間だけでな
く、有音区間でも劣化音声パワーが小さい場合には、更
新判定部５２０は“１”を出力する。すなわち、推定雑
音の更新が行われる。閾値の計算は各周波数毎に行われ
るため、各周波数毎に推定雑音の更新を行うことができ
る。As described above, when the deteriorated voice power is small not only in the initial state or in the silent section but also in the voiced section, the update determining section 520 outputs "1". That is, the estimated noise is updated. Since the threshold value is calculated for each frequency, the estimated noise can be updated for each frequency.

【０１５１】図７において、ＣＮＴをカウンタ５０４９
のカウント値、Ｎをシフトレジスタ５０４５のレジスタ
長とする。そして、Ｂ_n（ｋ）(ｎ＝０，１，…,Ｎ−
１）をシフトレジスタ５０４５に蓄積されている周波数
別重みつき劣化音声パワースペクトルとする。このと
き、除算部５０４８から出力される周波数別推定雑音パ
ワースペクトルλ_n（ｋ）は、数式１７に示すようにな
る。In FIG. 7, a CNT counter 5049
, And N is the register length of the shift register 5045. Then, B _n (k) (n = 0, 1, ..., N−
Let 1) be the frequency-dependent weighted deteriorated speech power spectrum accumulated in the shift register 5045. At this time, the frequency-dependent estimated noise power spectrum λ _n (k) output from the division unit 5048 is as shown in Expression 17.

【０１５２】[0152]

【数１７】すなわち、λ_n（ｋ）はシフトレジスタ５０４５に蓄積
されている周波数別重みつき劣化音声パワースペクトル
の平均値となる。平均値の計算は、重みつき加算部（巡
回型フィルタ）を用いて行うことも可能である。[Equation 17] That is, λ _n (k) is an average value of the weighted deteriorated speech power spectrum for each frequency stored in the shift register 5045. The average value may be calculated using a weighted addition unit (cyclic filter).

【０１５３】図９は、図６の周波数別推定雑音計算部５
０４₀〜５０４_K-1の構成例を示すブロック図である。図
６の周波数別推定雑音計算部５０４₀〜５０４_K-1は、図
７のシフトレジスタ５０４５、加算器５０４６、最小値
選択部５０４７、除算部５０４８、カウンタ５０４９、
レジスタ長記憶部５９４１に代えて、重みつき加算部５
０７１、重み記憶部５０７２を有する点が相違する。な
お、図９において図６に示した部分と同様の部分には同
一符号を付している。FIG. 9 shows the frequency-dependent estimated noise calculation unit 5 of FIG.
04 is a block diagram showing a configuration example of a _{0 ~504} _K-1. The frequency _- dependent estimated noise calculators 504 _{0 to} 504 _K-1 in FIG. 6 are the shift register 5045, the adder 5046, the minimum value selector 5047, the divider 5048, the counter 5049, and the shift register 5045 in FIG.
Instead of the register length storage unit 5941, the weighted addition unit 5
071 and a weight storage unit 5072 are different. In FIG. 9, the same parts as those shown in FIG. 6 are designated by the same reference numerals.

【０１５４】重みつき加算部５０７１は、推定雑音記憶
部５９４２から供給される１フレーム前の周波数別推定
雑音パワースペクトル、スイッチ５０４４から供給され
る周波数別重みつき劣化音声パワースペクトル及び重み
記憶部５０７２から出力される重みを用いて、周波数別
推定雑音を計算し、多重化部５０３へ出力する。The weighted addition unit 5071 outputs the estimated noise power spectrum for each frequency before one frame supplied from the estimated noise storage unit 5942, the weighted deteriorated speech power spectrum for each frequency supplied from the switch 5044, and the weight storage unit 5072. Frequency-based estimation noise is calculated using the output weights and output to the multiplexing unit 503.

【０１５５】すなわち、重み記憶部５０７２が記憶する
重みをδ、周波数別重みつき劣化音声パワースペクトル
を｜Ｙ_n（ｋ）｜²バーとしたとき、重みつき加算部５０
７１から出力される周波数別推定雑音パワースペクトル
λ_n（ｋ）は、数式１８で表せる。That is, when the weight stored in the weight storage unit 5072 is δ and the frequency-dependent weighted deteriorated speech power spectrum is | Y _n (k) | ² bar, the weighted addition unit 50
The estimated noise power spectrum λ _n (k) for each frequency output from 71 can be expressed by Equation 18.

【０１５６】[0156]

【数１８】重みつき加算部５０７１の構成は、図２８を用いて説明
した重みつき加算部４０７と同様であるが、重みつき加
算の計算は常に行われる。[Equation 18] The configuration of the weighted addition unit 5071 is the same as that of the weighted addition unit 407 described with reference to FIG. 28, but the weighted addition calculation is always performed.

【０１５７】図１０は、図２の抑圧係数補正部１５の構
成を示すブロック図である。ＳＮＲが低いときに抑圧不
足により発生する残留雑音や、ＳＮＲが高いときに過度
の抑圧で発生する音声の歪みによる音質劣化を防ぐため
に、抑圧係数補正部１５は、ＳＮＲに応じて抑圧係数の
補正を行う。FIG. 10 is a block diagram showing the configuration of the suppression coefficient correction unit 15 of FIG. In order to prevent residual noise generated due to insufficient suppression when the SNR is low and sound quality deterioration due to audio distortion that occurs due to excessive suppression when the SNR is high, the suppression coefficient correction unit 15 corrects the suppression coefficient according to the SNR. I do.

【０１５８】補正の例として、ＳＮＲが低いときには抑
圧係数に修正値を加えて残留雑音を抑圧し、ＳＮＲが高
いときには抑圧係数に下限値を設定して音声の歪みを防
止する。抑圧係数補正部１５は、周波数別抑圧係数補正
部１５０１₀〜１５０１_K-1、分離部１５０２，１５０３
及び多重化部１５０４を有する。As an example of correction, when the SNR is low, a correction value is added to the suppression coefficient to suppress residual noise, and when the SNR is high, a lower limit value is set to the suppression coefficient to prevent voice distortion. The suppression coefficient correction unit 15 includes frequency-based suppression coefficient correction units 1501 _{0 to} 1501 _K−1 and separation units 1502 and 1503.
And a multiplexing unit 1504.

【０１５９】分離部１５０２は、図２の推定先天的ＳＮ
Ｒ計算部７から供給される推定先天的ＳＮＲを周波数別
成分に分離し、それぞれ周波数別抑圧係数補正部１５０
１₀〜１５０１_K-1に出力する。Separation unit 1502 uses estimated congenital SN of FIG.
The estimated a priori SNR supplied from the R calculation unit 7 is separated into frequency components, and each frequency suppression coefficient correction unit 150
And outputs it to the ₁ ₀ ~1501 _K-1.

【０１６０】分離部１５０３は、図２の抑圧係数生成部
８から供給される抑圧係数を周波数別成分に分離し、そ
れぞれ周波数別抑圧係数補正部１５０１₀〜１５０１_K-1
に出力する。The separation unit 1503 separates the suppression coefficient supplied from the suppression coefficient generation unit 8 of FIG. 2 into frequency components, and the frequency suppression coefficient correction units 1501 _{0 to} 1501 _{K-1 respectively.}
Output to.

【０１６１】周波数別抑圧係数補正部１５０１₀〜１５
０１_K-1は、分離部１５０２から供給される周波数別推
定先天的ＳＮＲと、分離部１５０３から供給される周波
数別抑圧係数とから、周波数別補正抑圧係数を計算して
多重化部１５０４へ出力する。Frequency-dependent suppression coefficient correction units 1501 ₀ to 150
01 _K−1 calculates a frequency-specific correction suppression coefficient from the frequency-specific estimated a priori SNR supplied from the separation section 1502 and the frequency-specific suppression coefficient supplied from the separation section 1503, and outputs it to the multiplexing section 1504. To do.

【０１６２】多重化部１５０４は、周波数別抑圧係数補
正部１５０１₀〜１５０１_K-1から供給される周波数別補
正抑圧係数を多重化し、補正抑圧係数として多重乗算部
１６と推定先天的ＳＮＲ計算部７とへ出力する。The multiplexing unit 1504 multiplexes the frequency _- dependent correction suppression coefficients supplied from the frequency-specific suppression coefficient correction units 1501 _{0 to} 1501 _K-1, and multiplexes them as the correction suppression coefficient and the estimated a priori SNR calculation unit. And output to 7.

【０１６３】図１１は、図２の抑圧係数補正部１５に含
まれる周波数別抑圧係数補正部１５０１₀〜１５０１_K-1
の構成を示すブロック図である。周波数別抑圧係数補正
部１５０１は、最大値選択部１５９１、抑圧係数下限値
記憶部１５９２、閾値記憶部１５９３、比較部１５９
４、スイッチ１５９５、修正値記憶部１５９６及び乗算
器１５９７を有する。FIG. 11 shows frequency _- dependent suppression coefficient correction units 1501 _{0 to} 1501 _K-1 included in the suppression coefficient correction unit 15 shown in FIG.
3 is a block diagram showing the configuration of FIG. The frequency-dependent suppression coefficient correction unit 1501 includes a maximum value selection unit 1591, a suppression coefficient lower limit storage unit 1592, a threshold storage unit 1593, and a comparison unit 159.
4, a switch 1595, a correction value storage unit 1596, and a multiplier 1597.

【０１６４】抑圧係数下限値記憶部１５９２は、記憶し
ている抑圧係数の下限値を、最大値選択部１５９１に供
給する。The suppression coefficient lower limit storage unit 1592 supplies the stored lower limit value of the suppression coefficient to the maximum value selection unit 1591.

【０１６５】最大値選択部１５９１は、図１０の分離部
１５０３から供給される周波数別抑圧係数と抑圧係数下
限値記憶部１５９２から供給される抑圧係数下限値とを
比較し、大きい方の値をスイッチ１５９５に出力する。
すなわち、抑圧係数は抑圧係数下限値記憶部１５９２が
記憶する下限値以上の大きい値になる。従って、過度の
抑圧により発生する音声の歪みを防ぐことができる。The maximum value selection unit 1591 compares the suppression coefficient for each frequency supplied from the separation unit 1503 of FIG. 10 with the suppression coefficient lower limit value supplied from the suppression coefficient lower limit value storage unit 1592, and determines the larger value. Output to the switch 1595.
That is, the suppression coefficient has a large value equal to or larger than the lower limit value stored in the suppression coefficient lower limit value storage unit 1592. Therefore, it is possible to prevent the distortion of the voice generated by the excessive suppression.

【０１６６】比較部１５９４は、閾値記憶部１５９３か
ら供給される閾値と、図１０の分離部１５０２から供給
される周波数別推定先天的ＳＮＲとを比較し、周波数別
推定先天的ＳＮＲが閾値よりも大きければ“０”を、小
さければ“１”をスイッチ１５９５に供給する。The comparing unit 1594 compares the threshold value supplied from the threshold value storage unit 1593 with the frequency-specific estimated a priori SNR supplied from the separating unit 1502 in FIG. 10, and the frequency-specific estimated a priori SNR is higher than the threshold value. If it is larger, “0” is supplied to the switch 1595, and if it is smaller, “1” is supplied to the switch 1595.

【０１６７】スイッチ１５９５は、最大値選択部１５９
１から供給される信号を、比較部１５９４の出力値が
“１”のとき乗算器１５９７に出力し、比較部１５９４
の出力値が“０”のとき図１０の多重化部１５０４に出
力する。すなわち、周波数別推定先天的ＳＮＲが閾値よ
りも小さいときに、抑圧係数の補正が行われる。こうし
て、音声成分を過剰に抑圧することなく、残留雑音量を
減らすことができる。The switch 1595 has a maximum value selection section 159.
The signal supplied from 1 is output to the multiplier 1597 when the output value of the comparison unit 1594 is “1”, and the comparison unit 1594
When the output value of is 0, it is output to multiplexing section 1504 in FIG. That is, the suppression coefficient is corrected when the estimated a priori SNR for each frequency is smaller than the threshold value. In this way, the residual noise amount can be reduced without excessively suppressing the voice component.

【０１６８】乗算器１５７９は、スイッチ１５９５の出
力値と修正値記憶部１５９６の出力値との積を計算し、
計算結果を図１０の多重化部１５０４へ出力する。抑圧
係数値を小さくするため、修正値は１より小さい値が普
通であるが、目的によってはこの限りではない。The multiplier 1579 calculates the product of the output value of the switch 1595 and the output value of the correction value storage unit 1596,
The calculation result is output to the multiplexing unit 1504 in FIG. The correction value is usually smaller than 1 in order to reduce the suppression coefficient value, but it is not limited to this depending on the purpose.

【０１６９】なお、本実施形態では、抑圧係数を多重乗
算部１６と推定先天的ＳＮＲ計算部７とへ供給すること
なく、補正抑圧係数を多重乗算部１６と推定先天的ＳＮ
Ｒ計算部７とへ供給している。In this embodiment, the correction suppression coefficient is not supplied to the multiplex multiplier 16 and the estimated a priori SNR calculator 7, but the corrected suppression coefficient is supplied to the multiplex multiplier 16 and the estimated a priori SN.
It is supplied to the R calculation unit 7.

【０１７０】図１２は、図２の雑音抑圧係数生成部８の
変形例である雑音抑圧係数生成部８１の構成を示すブロ
ック図である。雑音抑圧係数生成部８１は、ゲイン関数
値計算部８１１、一般化尤度比計算部８１２、音声存在
確率記憶部８１３及び抑圧係数計算部８１４を有する。FIG. 12 is a block diagram showing the configuration of a noise suppression coefficient generator 81 which is a modification of the noise suppression coefficient generator 8 of FIG. The noise suppression coefficient generation unit 81 includes a gain function value calculation unit 811, a generalized likelihood ratio calculation unit 812, a voice existence probability storage unit 813, and a suppression coefficient calculation unit 814.

【０１７１】図１２に示す雑音抑圧係数生成部８１は、
供給された推定先天的ＳＮＲと後天的ＳＮＲとから抑圧
係数を演算で求める点が、検索で求める図２の雑音抑圧
係数生成部８と異なる。以下、文献１に記載されている
計算式をもとに、抑圧係数の計算方法を説明する。The noise suppression coefficient generator 81 shown in FIG.
The difference from the noise suppression coefficient generation unit 8 of FIG. 2 that is obtained by search is that the suppression coefficient is calculated from the supplied estimated innate SNR and acquired SNR. Hereinafter, the calculation method of the suppression coefficient will be described based on the calculation formula described in Document 1.

【０１７２】フレーム番号をｎ、周波数番号をｋとし、
γ_n（ｋ）を図２の周波数別ＳＮＲ計算部６から供給さ
れる周波数別後天的ＳＮＲ、ξ_n（ｋ）ハットを図２の
推定先天的ＳＮＲ計算部７から供給される周波数別推定
先天的ＳＮＲとする。また、 η_n（ｋ）＝ξ_n（ｋ）ハット／ｑｖ_n（ｋ）＝（η_n（ｋ）γ_n（ｋ））／（１＋η
_n（ｋ））とする。The frame number is n and the frequency number is k,
γ _n (k) is the frequency-dependent a posteriori SNR supplied from the frequency-based SNR calculating unit 6 in FIG. 2, and ξ _n (k) hat is the frequency-based estimation a priori supplied from the estimated a priori SNR calculating unit 7 in FIG. Target SNR. _{Moreover, η n (k) = ξ} n (k) hat _{/ q v n (k) =} (η n (k) γ n (k)) / (1 + η
_n (k)).

【０１７３】ゲイン関数値計算部８１１は、図２の周波
数別ＳＮＲ計算部６から供給される後天的ＳＮＲγ
_n（ｋ）、図２の推定先天的ＳＮＲ計算部７から供給さ
れる推定先天的ＳＮＲξ_n（ｋ）ハット及び音声存在確
率記憶部８１３から供給される音声存在確率ｑとをもと
に、各周波数毎に例えば文献１に記載されているような
ＭＭＳＥＳＴＳＡゲイン関数値を計算し、抑圧係数計
算部８１４に出力する。The gain function value calculation unit 811 receives the acquired SNR γ supplied from the frequency-based SNR calculation unit 6 of FIG.
_n (k), the estimated a priori SNR ξ _n (k) hat supplied from the estimated a priori SNR calculation unit 7 and the voice existence probability q supplied from the voice existence probability storage unit 813. For example, the MMSE STSA gain function value as described in Reference 1 is calculated for each frequency and output to the suppression coefficient calculation unit 814.

【０１７４】各周波数毎のＭＭＳＥＳＴＳＡゲイン関
数値Ｇ_n（ｋ）は、数式１９で表せる。The MMSE STSA gain function value G _n (k) for each frequency can be expressed by Equation 19.

【０１７５】[0175]

【数１９】ここで、Ｉ₀（ｚ）は０次変形ベッセル関数、Ｉ₁（ｚ）
は１次変形ベッセル関数である。変形ベッセル関数につ
いては、「1985年、数学辞典、岩波書店、374.Gペー
ジ」(文献６）に記載されている。[Formula 19] Where I ₀ (z) is the 0th-order modified Bessel function, and I ₁ (z)
Is a first-order modified Bessel function. The modified Bessel function is described in "1985, Mathematical Dictionary, Iwanami Shoten, page 374.G" (Reference 6).

【０１７６】一般化尤度比計算部８１２は、図２の周波
数別ＳＮＲ計算部６から供給される後天的ＳＮＲγ
_n（ｋ）、図２の推定先天的ＳＮＲ計算部７から供給さ
れる推定先天的ＳＮＲΛ_n（ｋ）ハット及び音声存在確
率記憶部８１３から供給される音声存在確率ｑをもと
に、周波数毎に一般化尤度比を計算し、抑圧係数計算部
８１４に出力する。The generalized likelihood ratio calculation unit 812 receives the acquired SNR γ supplied from the frequency-based SNR calculation unit 6 of FIG.
_n (k), the estimated a priori SNR Λ _n (k) hat supplied from the estimated a priori SNR calculation unit 7 in FIG. 2 and the voice existence probability q supplied from the voice existence probability storage unit 813. Then, the generalized likelihood ratio is calculated and output to the suppression coefficient calculation unit 814.

【０１７７】周波数毎の一般化尤度比Λ_n（ｋ）は、数
式２０で表せる。The generalized likelihood ratio Λ _n (k) for each frequency can be expressed by Equation 20.

【０１７８】[0178]

【数２０】抑圧係数計算部８１４は、ゲイン関数値計算部８１１か
ら供給されるＭＭＳＥＳＴＳＡゲイン関数値Ｇ_n（ｋ）
と一般化尤度比計算部８１２から供給される一般化尤度
比Λ_n（ｋ）とから周波数毎に抑圧係数を計算し、図２
の抑圧係数補正部１５へ出力する。[Equation 20] The suppression coefficient calculation unit 814 supplies the MMSESTSA gain function value G _n (k) supplied from the gain function value calculation unit 811.
2 and the generalized likelihood ratio Λ _n (k) supplied from the generalized likelihood ratio calculation unit 812, the suppression coefficient is calculated for each frequency, and FIG.
To the suppression coefficient correction unit 15.

【０１７９】周波数毎の抑圧係数Ｇ_n（ｋ）バーは、数
式２１で表せる。The suppression coefficient G _n (k) bar for each frequency can be expressed by Equation 21.

【０１８０】[0180]

【数２１】周波数別にＳＮＲを計算する代わりに、複数の周波数か
ら構成される帯域に共通なＳＮＲを求めて、これを用い
ることも可能である。[Equation 21] Instead of calculating the SNR for each frequency, it is also possible to obtain an SNR common to a band composed of a plurality of frequencies and use this.

【０１８１】図１３は、図２の周波数別ＳＮＲ計算部６
の変形例である周波数別ＳＮＲ計算部６１の構成例を示
すブロック図である。図１３の周波数別ＳＮＲ計算部６
１は、図２に示した周波数別ＳＮＲ計算部６に対して、
帯域別ＳＮＲ計算部６１が帯域別パワー計算部６１１，
６１２を有する点が相違する。なお、図１３において図
３３に示した部分と同様の部分には同一符号を付してい
る。FIG. 13 is a frequency-specific SNR calculation unit 6 of FIG.
9 is a block diagram showing a configuration example of a frequency-based SNR calculation unit 61 that is a modification example of FIG. Frequency-dependent SNR calculator 6 in FIG.
1 corresponds to the frequency-based SNR calculation unit 6 shown in FIG.
The band-based SNR calculation unit 61 is a band-based power calculation unit 611.
612 is different. In FIG. 13, the same parts as those shown in FIG. 33 are designated by the same reference numerals.

【０１８２】帯域別パワー計算部６１１は、分離部６０
２から供給される周波数別劣化音声パワースペクトルを
もとに帯域別のパワーを計算し、除算部６０１₀〜６０
１_K-1へ出力する。The power calculation section 611 for each band is composed of the separation section 60.
Frequency noisy speech power spectrum supplied from 2 to calculate the per-band power on the basis of the division unit 601 _0-60
Output to 1 _K-1 .

【０１８３】帯域別パワー計算部６１２は、分離部６０
３から供給される周波数別推定雑音パワースペクトルを
もとに帯域別のパワーを計算し、除算部６０１₀〜６０
１_K-1へ出力する。The power calculation section 612 for each band is divided by the separation section 60.
The power for each band is calculated based on the estimated noise power spectrum for each frequency supplied from No. 3, and the division units 601 _{0 to} 60 ₀
Output to 1 _K-1 .

【０１８４】図１４は、図１３の帯域別パワー計算部６
１１の構成を示すブロック図である。ここでは、帯域幅
Ｌを有するＭ個の帯域に等分割する例を説明する。ここ
で、ＬとＭは、Ｋ＝ＬＭの関係を満たす自然数であると
する。FIG. 14 is a band-specific power calculation unit 6 of FIG.
It is a block diagram which shows the structure of 11. Here, an example of equally dividing into M bands having a bandwidth L will be described. Here, it is assumed that L and M are natural numbers that satisfy the relationship of K = LM.

【０１８５】帯域別ＳＮＲ計算部６１は、加算器６１１
０₀〜６１１０_M-1を有する。図１３の分離部６０２から
供給される周波数別劣化音声パワースペクトル９１０₀
〜９１０_K-1(９１０₀〜９１０_ML-1)は、各周波数に対応
した加算器６１１０₀〜６１１０_M-1へそれぞれ伝達され
る。The band-by-band SNR calculation unit 61 uses the adder 611.
With a ₀ 0 ~6110 _M-1. Frequency-dependent degraded voice power spectrum 910 ₀ supplied from the separation unit 602 of FIG.
.About.910 _K-1 (910 _{0 to} 910 _ML-1 ) are transmitted to the adders 6110 _{0 to} 6110 _M-1 corresponding to the respective frequencies.

【０１８６】例えば、帯域番号０に対応する周波数番号
は０からＬ−１なので、周波数別劣化音声パワースペク
トル９１０₀〜９１０_L-1は加算器６１１０₀へ伝達され
る。また、帯域番号１に対応する周波数番号はＬから２
Ｌ−１なので、周波数別劣化音声パワースペクトル９１
０_L〜９１０_2L-1は加算器６１１０₁へ伝達される。[0186] For example, since L-1 frequency number from 0 corresponding to the band number 0, frequency noisy speech power spectrum 910 ₀ ～910 _L-1 are transmitted to adder 6110 _0. The frequency numbers corresponding to band number 1 are from L to 2
Since it is L-1, the degraded voice power spectrum 91 for each frequency
0 _{L to} 910 _2L-1 is transmitted to the adder 6110 ₁ .

【０１８７】加算器６１１０₀〜６１１０_M-1は、供給さ
れた周波数別劣化音声パワースペクトルの総和をそれぞ
れ計算し、帯域別劣化音声パワースペクトル９１１₀〜
９１１_ML-1（９１１₀〜９１１_K-1)を図１３の除算部６
０１₀〜６０１_K-1へ出力する。各加算器６１１０₀〜６
１１０_M-1の計算結果は、それぞれの帯域番号に応じた
周波数毎に帯域別劣化音声パワースペクトルとして出力
される。The adders 6110 _{0 to} 6110 _M-1 respectively calculate the sums of the supplied frequency _- dependent degraded voice power spectra, and the band-specific degraded voice power spectra 911 ₀ to 910.
911 _ML-1 (911 _{0 to} 911 _K-1 ) is divided by the division unit 6 in FIG.
Output to 01 _{0 to} 601 _K-1 . Each adder 6110 _{0 to} 6
The calculation result of 110 _M-1 is output as a band-specific deteriorated speech power spectrum for each frequency corresponding to each band number.

【０１８８】例えば、加算器６１１０₀の計算結果は、
帯域別劣化音声パワースペクトル９１１₀〜９１１_L-1と
して出力される。また、加算器６１１０₁の計算結果
は、帯域別劣化音声パワースペクトル９１１_L〜９１１
_2L-1として出力される。構成と動作は帯域別パワー計算
部６１１と同様である。For example, the calculation result of the adder 6110 ₀ is
The deteriorated voice power spectrum for each band is output as 911 _{0 to} 911 _L-1 . The calculation result of the adder 6110 _1, the bandwidth noisy speech power spectrum 911 _L ~911
It is output as _2L-1 . The configuration and operation are similar to those of the band-specific power calculation unit 611.

【０１８９】なお、ここでは複数の帯域に等分割する例
を示したが、「1980年、聴覚と音声、電子情報通信学
会、115〜118ページ」(文献７)に記載されている臨界帯
域に分割する方法、「1983年、マルチレート・ディジタ
ル・シグナル・プロセシング(Mulｔiraｔe Digiｔal Si
gnal Processing),1983,Prenｔice-Hall Inc.,USA」(以
下、「文献８」と称する。)に記載されているオクター
ブ帯域に分割する方法など、他の帯域分割方法を用いる
ことも可能である。Although an example of equally dividing into a plurality of bands has been shown here, the critical band described in "Hearing and Speech, The Institute of Electronics, Information and Communication Engineers, Pages 115 to 118, 1980" (Reference 7) is used. Dividing method, 1983, Multirate Digital Signal Processing
gnal Processing), 1983, Prentice-Hall Inc., USA "(hereinafter referred to as" Reference 8 "), and other band division methods can also be used. .

【０１９０】（実施形態３）図１５は、本発明の実施形
態３のノイズ除去装置の模式的な構成を示すブロック図
である。図１５に示すノイズ除去装置は、図２に対し
て、推定雑音計算部５が推定雑音計算部５２に置換され
ている点と、重みつき劣化音声計算部１４が存在しない
点とが相違する。なお、図１５において、図２に示した
部分と同様の部分には同一符号を付している。以下、こ
れらの相違点を中心に動作を説明する。(Embodiment 3) FIG. 15 is a block diagram showing a schematic configuration of a noise removing device according to Embodiment 3 of the present invention. The noise removal apparatus shown in FIG. 15 differs from FIG. 2 in that the estimated noise calculation unit 5 is replaced by the estimated noise calculation unit 52 and that the weighted deteriorated speech calculation unit 14 does not exist. In FIG. 15, the same parts as those shown in FIG. 2 are designated by the same reference numerals. The operation will be described below focusing on these differences.

【０１９１】図１６は、図１５の推定雑音計算部５２の
構成を示すブロック図である。図１５の推定雑音計算部
５２は、図２の推定雑音計算部５に対して、周波数別推
定雑音計算部５０４₀〜５０４_K-1が周波数別推定雑音計
算部５０６₀〜５０６_K-1に置換されている点と、推定雑
音計算部５２が入力信号に重みつき劣化音声パワースペ
クトルを有しない点とが相違する。なお、図１６におい
て、図６に示した部分と同様の部分には同一符号を付し
ている。FIG. 16 is a block diagram showing the structure of the estimated noise calculation section 52 shown in FIG. In the estimated noise calculation unit 52 of FIG. 15, in contrast to the estimated noise calculation unit 5 of FIG. 2, the frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K-1 are changed to frequency _- dependent estimated noise calculation units 506 _{0 to} 506 _K-1 . The difference lies in that the estimated noise calculation unit 52 does not have a weighted deteriorated speech power spectrum in the input signal. 16, the same parts as those shown in FIG. 6 are designated by the same reference numerals.

【０１９２】劣化音声パワースペクトルを有しないの
は、周波数別推定雑音計算部５０４₀〜５０４_K-1が入力
信号に周波数別重みつき劣化音声パワースペクトルを必
要とするのに対して、推定雑音計算部５０６₀〜５０６
_K-1は、入力信号に周波数別重みつき劣化音声パワース
ペクトルを必要としないためである。Since the frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K-1 require the frequency _- dependent weighted deteriorated speech power spectrum for the input signal, the estimated noise calculation does not have the deteriorated speech power spectrum. Parts 506 _{0 to} 506
_{This is because K-1} does not require a weighted degraded speech power spectrum for each input signal.

【０１９３】図１７は、図１６の周波数別推定雑音計算
部５０６₀〜５０６_K-1の構成を示すブロック図である。
図１６の周波数別推定雑音計算部５０６₀〜５０６
_K-1は、図６の周波数別推定雑音計算部５０４₀〜５０４
_K-1に対して、周波数別推定雑音計算部５０６が入力信
号に周波数別重みつき劣化音声パワースペクトルを有し
ていない点と、除算部５０４１、非線形処理部５０４２
及び乗算器５０４３を有している点とが相違する。な
お、図１７において、図７に示した部分と同様の部分に
は同一符号を付している。以下、これらの相違点を中心
に動作を説明する。FIG. 17 is a block diagram showing the structure of the frequency _- dependent estimated noise calculation units 506 _{0 to} 506 _K-1 shown in FIG.
Frequency-dependent estimated noise calculation units 506 _{0 to} 506 in FIG.
_K-1 is the frequency-dependent estimated noise calculation unit 504 _{0 to} 504 in FIG.
_{For K-1} , the frequency-dependent estimated noise calculation unit 506 does not have a frequency-dependent weighted deteriorated speech power spectrum in the input signal, the division unit 5041, and the non-linear processing unit 5042.
And a multiplier 5043 is provided. 17, the same parts as those shown in FIG. 7 are designated by the same reference numerals. The operation will be described below focusing on these differences.

【０１９４】除算部５０４１は、図１６の分離部５０２
から供給される周波数別劣化音声パワースペクトルを、
推定雑音記憶部５９４２から供給される１フレーム前の
推定雑音パワースペクトルで除算し、除算結果を非線形
処理部５０４２に出力する。図４に示した非線形処理部
１４８５と同様の構成及び機能を有する非線形処理部５
０４２は、除算部５０４１の出力値に応じた重み係数を
計算し、乗算器５０４３に出力する。The division unit 5041 is the separation unit 502 of FIG.
Deteriorated voice power spectrum for each frequency supplied from
It divides by the estimated noise power spectrum one frame before supplied from the estimated noise storage unit 5942, and outputs the division result to the non-linear processing unit 5042. A non-linear processing unit 5 having the same configuration and function as the non-linear processing unit 1485 shown in FIG.
042 calculates a weighting coefficient according to the output value of the division unit 5041, and outputs it to the multiplier 5043.

【０１９５】乗算器５０４３は、図１５の分離部５０２
から供給される周波数別劣化音声パワースペクトルと非
線形処理部５０４２から供給される重み係数の積を計算
し、スイッチ５０４４へ出力する。乗算器５０４３の出
力信号は、図５の周波数別推定雑音計算部５０４におけ
る周波数別重みつき劣化音声パワースペクトルと同様で
ある。すなわち、周波数別重みつき劣化音声パワースペ
クトルは、周波数別推定雑音計算部５０６の内部におい
て計算することも可能である。したがって、重みつき劣
化音声計算部１４を省略することが可能となる。The multiplier 5043 is the separation unit 502 of FIG.
Calculates the product of the frequency-dependent deteriorated speech power spectrum supplied from the product and the weighting coefficient supplied from the non-linear processing unit 5042, and outputs the product to the switch 5044. The output signal of the multiplier 5043 is the same as the weighted deteriorated speech power spectrum by frequency in the estimated noise by frequency calculation unit 504 in FIG. That is, the weighted deteriorated speech power spectrum for each frequency can be calculated in the estimated noise for each frequency calculation unit 506. Therefore, the weighted deteriorated speech calculation unit 14 can be omitted.

【０１９６】（実施形態４）図１８は、本発明の実施形
態４のノイズ除去装置の模式的な構成を示すブロック図
である。図１８に示すノイズ除去装置は、図２に示した
ノイズ除去装置に対して、推定先天的ＳＮＲ計算部７１
を備えている点が相違する。なお、図１８において、図
２に示した部分と同様の部分には同一符号を付してい
る。以下、この相違点を中心に動作を説明する。(Embodiment 4) FIG. 18 is a block diagram showing a schematic configuration of a noise removing device according to Embodiment 4 of the present invention. The noise removing apparatus shown in FIG. 18 is different from the noise removing apparatus shown in FIG. 2 in the estimated a priori SNR calculation unit 71.
The difference is that it has. 18, the same parts as those shown in FIG. 2 are designated by the same reference numerals. The operation will be described below focusing on this difference.

【０１９７】図１９は、図１８の推定先天的ＳＮＲ計算
部７１の構成を示すブロック図である。図１８の推定先
天的ＳＮＲ計算部７１は、図３４の推定先天的ＳＮＲ計
算部７の後天的ＳＮＲ記憶部７０２、抑圧係数記憶部７
０３、多重乗算部７０５，７０４に代えて、推定雑音記
憶部７１２、強調音声パワースペクトル記憶部７１３、
周波数別ＳＮＲ計算部７１５、多重乗算部７１６を有す
る。なお、図１９において、図３４に示した部分と同様
の部分には同一符号を付している。FIG. 19 is a block diagram showing the configuration of the estimated a priori SNR calculation unit 71 of FIG. The estimated a priori SNR calculation unit 71 of FIG. 18 includes the acquired a priori SNR storage unit 702 and the suppression coefficient storage unit 7 of the estimated a priori SNR calculation unit 7 of FIG.
03, instead of the multiple multiplication units 705 and 704, an estimated noise storage unit 712, an emphasized speech power spectrum storage unit 713,
It has a frequency-specific SNR calculation unit 715 and a multiple multiplication unit 716. In FIG. 19, the same parts as those shown in FIG. 34 are designated by the same reference numerals.

【０１９８】ここで、推定先天的ＳＮＲ計算部７１は、
抑圧係数に代えて、強調音声振幅スペクトルと推定雑音
パワースペクトルを入力信号としている。Here, the estimated a priori SNR calculation unit 71 is
Instead of the suppression coefficient, the emphasized speech amplitude spectrum and the estimated noise power spectrum are used as input signals.

【０１９９】多重乗算部７１６は、図１８の多重乗算部
１６から供給される強調音声振幅スペクトルを周波数毎
に２乗して強調音声パワースペクトルを求め、強調音声
パワースペクトル記憶部７１３に出力する。多重乗算部
７１６の構成は、図２９を用いて説明した多重乗算部１
７と同様である。The multiplying unit 716 squares the emphasized voice amplitude spectrum supplied from the multiplex multiplier 16 of FIG. 18 for each frequency to obtain an emphasized voice power spectrum, and outputs it to the emphasized voice power spectrum storage unit 713. The configuration of the multiple multiplication unit 716 is the multiple multiplication unit 1 described with reference to FIG.
Similar to 7.

【０２００】強調音声パワースペクトル記憶部７１３
は、多重乗算部７１６から供給される強調音声パワース
ペクトルを記憶し、１フレーム前に供給された強調音声
パワースペクトルを周波数別ＳＮＲ計算部７１５へ出力
する。周波数別ＳＮＲ計算部７１５の構成は、図３３を
用いて説明した周波数別ＳＮＲ計算部６と同様である。Enhanced voice power spectrum storage unit 713
Stores the emphasized voice power spectrum supplied from the multiplex multiplication unit 716, and outputs the emphasized voice power spectrum supplied one frame before to the frequency-based SNR calculation unit 715. The configuration of the frequency-based SNR calculation unit 715 is the same as that of the frequency-based SNR calculation unit 6 described with reference to FIG.

【０２０１】推定雑音記憶部７１２は、図１８の推定雑
音計算部５から供給される推定雑音パワースペクトルを
記憶し、１フレーム前に供給された推定音声パワースペ
クトルを周波数別ＳＮＲ計算部７１５へ出力する。The estimated noise storage unit 712 stores the estimated noise power spectrum supplied from the estimated noise calculation unit 5 of FIG. 18, and outputs the estimated speech power spectrum supplied one frame before to the frequency-based SNR calculation unit 715. To do.

【０２０２】周波数別ＳＮＲ計算部７１５は、強調音声
パワースペクトル記憶部７１３から供給される強調音声
パワースペクトルと、推定雑音記憶部７１２から供給さ
れる推定雑音パワースペクトルのＳＮＲを各周波数毎に
計算し、多重重みつき加算部７０７へ出力する。The frequency-based SNR calculation unit 715 calculates the SNR of the emphasized speech power spectrum supplied from the emphasized speech power spectrum storage unit 713 and the estimated noise power spectrum supplied from the estimated noise storage unit 712 for each frequency. , To the multiple weighted addition unit 707.

【０２０３】周波数別ＳＮＲ計算部７１５の出力信号と
図３４の多重乗算部７０５の出力信号は同様のものであ
る。したがって、本実施形態では、推定先天的ＳＮＲ計
算部７を推定先天的ＳＮＲ計算部１７に置換することが
可能となる。The output signal of the frequency-specific SNR calculation unit 715 and the output signal of the multiplex multiplication unit 705 of FIG. 34 are the same. Therefore, in the present embodiment, it is possible to replace the estimated a priori SNR calculator 7 with the estimated a priori SNR calculator 17.

【０２０４】（実施形態５）図２０は、本発明の実施形
態５のノイズ除去装置の模式的な構成を示すブロック図
である。図２０に示すノイズ除去装置は、図２に示した
ノイズ除去装置に対して、推定雑音計算部５が推定雑音
部５２に、推定先天的ＳＮＲ計算部７が推定先天的ＳＮ
Ｒ計算部７１に、それぞれ置換されている点と、重みつ
き劣化音声計算部１４が存在しない点とが相違する。(Fifth Embodiment) FIG. 20 is a block diagram showing a schematic configuration of a noise removing apparatus according to a fifth embodiment of the present invention. The noise removing apparatus shown in FIG. 20 is different from the noise removing apparatus shown in FIG. 2 in that the estimated noise calculation unit 5 is the estimated noise unit 52 and the estimated a priori SNR calculation unit 7 is the estimated a priori SN.
The difference is that the R calculation unit 71 is replaced and the weighted deteriorated speech calculation unit 14 does not exist.

【０２０５】推定雑音部５２の構成と動作は、実施形態
３で説明した推定雑音部と同様である。また、推定先天
的ＳＮＲ計算部７１は、実施形態４で説明した推定先天
的ＳＮＲ計算部と同様である。The structure and operation of the estimated noise unit 52 are the same as those of the estimated noise unit described in the third embodiment. The estimated a priori SNR calculation unit 71 is the same as the estimated a priori SNR calculation unit described in the fourth embodiment.

【０２０６】（実施形態６）図２１は、本発明の実施形
態６のノイズ除去装置の模式的な構成を示すブロック図
である。図２１のノイズ除去装置は、図２に示したノイ
ズ除去装置に対して、推定雑音計算部５が推定雑音計算
部５３で置換されている点と、音声検出部４が存在しな
い点とが相違する。すなわち、図２１のノイズ除去装置
は、雑音の推定に音声検出部を必要としない。以下、こ
れらの相違点を中心に動作を説明する。(Sixth Embodiment) FIG. 21 is a block diagram showing a schematic configuration of a noise removing device according to a sixth embodiment of the present invention. The noise removal apparatus of FIG. 21 differs from the noise removal apparatus shown in FIG. 2 in that the estimated noise calculation unit 5 is replaced by the estimated noise calculation unit 53 and that the voice detection unit 4 does not exist. To do. That is, the noise removal device of FIG. 21 does not need a voice detection unit for noise estimation. The operation will be described below focusing on these differences.

【０２０７】図２２は、図２１の推定雑音計算部５３の
構成を示すブロック図である。図２１の推定雑音計算部
５３は、図６に示した推定雑音計算部５に対して、周波
数別推定雑音計算部５０４₀〜５０４_K-1が周波数別推定
雑音計算部５０８₀〜５０８_K _-1に置換されている点と、
推定雑音計算部５３が入力信号に音声検出フラグを有し
ていない点とが相違する。FIG. 22 is a block diagram showing the structure of the estimated noise calculation unit 53 shown in FIG. Estimated noise calculation unit 53 of FIG. 21, with respect to estimated noise calculator 5 shown in FIG. 6, the frequency domain estimated noise calculator 504 ₀ ~504 _K-1 frequency domain estimated noise calculator 508 ₀ ~508 _K _{- The} point replaced by ₁ , and
The difference is that the estimated noise calculation unit 53 does not have a voice detection flag in the input signal.

【０２０８】図２３は、図２２の周波数別推定雑音計算
部５０８₀〜５０８_K-1の構成を示すブロック図である。
図２２の周波数別推定雑音計算部５０８₀〜５０８
_K-1は、図７に示した周波数別推定雑音計算部５０４に
対して、更新判定部５２０が更新判定部５２２に置換さ
れている点と、周波数別推定雑音計算部５０８₀〜５０
８_K-1が入力に音声検出フラグを有していない点とが相
違する。FIG. 23 is a block diagram showing the configuration of the frequency _- dependent estimated noise calculation units 508 _{0 to} 508 _K-1 shown in FIG.
Frequency-dependent estimated noise calculation units 508 _{0 to} 508 in FIG.
_K-1 is that the update determining unit 520 is replaced by the update determining unit 522 in the frequency-specific estimated noise calculating unit 504 shown in FIG. 7, and the frequency-specific estimated noise calculating units 508 ₀ to 50 _0.
The difference is that 8 _K-1 does not have a voice detection flag at the input.

【０２０９】図２４は、図２３の更新判定部５２２の構
成を示すブロック図である。図２３の更新判定部５２２
は、図８に示した更新判定部５２０に対して、論理和計
算部５２０１が論理和計算部５２２１に置換されている
点と、更新判定部５２２が論理否定回路５２０２を有し
ていない点と、入力信号に音声検出フラグを有していな
い点とが相違する。すなわち、更新判定部５２２は、推
定雑音の更新に音声検出フラグを用いていない。この点
が、図８の更新判定部５２０と異なる。FIG. 24 is a block diagram showing the structure of the update determination unit 522 of FIG. The update determination unit 522 of FIG.
8 is that the logical sum calculation unit 5201 is replaced by the logical sum calculation unit 5221 with respect to the update determination unit 520 shown in FIG. 8 and that the update determination unit 522 does not have the logical negation circuit 5202. The difference is that the input signal does not have a voice detection flag. That is, the update determination unit 522 does not use the voice detection flag for updating the estimated noise. This point is different from the update determination unit 520 of FIG.

【０２１０】論理和計算部５２２１は、比較部５２０５
の出力値と比較部５２０３の出力値の論理和を計算し、
計算結果を図２３のスイッチ５０４４、シフトレジスタ
５０４５及びカウンタ５０４９に出力する。すなわち、
更新判定部５２２は、カウント値が予め設定された値に
到達するまでは常に“１”を出力し、到達した後は、劣
化音声パワーが閾値よりも小さいときに“１”を出力す
る。The logical sum calculation unit 5221 has a comparison unit 5205.
And the output value of the comparison unit 5203 is calculated, and
The calculation result is output to the switch 5044, the shift register 5045, and the counter 5049 in FIG. That is,
The update determination unit 522 always outputs “1” until the count value reaches a preset value, and after reaching the count value, outputs “1” when the deteriorated voice power is smaller than the threshold value.

【０２１１】図８で説明した通り、比較部５２０５は劣
化音声信号が雑音であるか否かの判定を行っている。す
なわち、比較部５２０５は各周波数毎に音声検出を行っ
ているといえる。したがって、音声検出フラグを入力に
有しない更新判定部を実現することが可能となる。As described with reference to FIG. 8, the comparison unit 5205 determines whether or not the deteriorated voice signal is noise. That is, it can be said that the comparison unit 5205 performs voice detection for each frequency. Therefore, it is possible to realize an update determination unit that does not have a voice detection flag as an input.

【０２１２】以上、本発明の各実施形態では、ノイズ除
去の方式として、最小平均２乗誤差短時間スペクトル振
幅法を用いる場合を例に説明したが、その他の方法にも
適用することができる。As described above, in each of the embodiments of the present invention, the case of using the minimum mean square error short-time spectrum amplitude method has been described as an example of the noise removal method, but it can be applied to other methods.

【０２１３】例えば「1979年12月、プロシーディングス
・オブ・ザ・アイ・イー・イー・イー、第67巻、第12号
(PROCEEDINGS OF THE IEEE,VOL.67,NO.12,PP.1586-160
4,DEC,1979)、1586〜1604ページ」(文献９)に記載され
ているウィーナーフィルタ法や、「1979年4月、アイ・
イー・イー・イー・トランザクションズ・オン・アクー
スティクス・スピーチ・アンド・シグナル・プロセシン
グ、第27巻、第2号(IEEETRANSACTIONS ON ACOUSTICS,SP
EECH,AND SIGNAL PROCESSING,VOL.27,NO.2,PP.113-120,
APR,1979)、113〜120ページ」(以下、「文献１０」と称
する。)に記載されているスペクトル減算法などであ
る。[0213] For example, "December 1979, Proceedings of the IEE, Vol. 67, No. 12"
(PROCEEDINGS OF THE IEEE, VOL.67, NO.12, PP.1586-160
4, DEC, 1979), pp. 1586-1604 "(Reference 9), and the Wiener filter method described in" April 1979, Eye.
EEE Transactions on Acoustics Speech and Signal Processing, Volume 27, Issue 2 (IEEE TRANSACTIONS ON ACOUSTICS, SP
EECH, AND SIGNAL PROCESSING, VOL.27, NO.2, PP.113-120,
APR, 1979), pp. 113-120 "(hereinafter referred to as" Reference 10 ").

【０２１４】図２１は文献１０に記載されているスペク
トル減算法の概略動作の説明図である。図２１におい
て、多重乗算部１６を多重減算部に、雑音抑圧係数生成
部８を雑音抑圧量計算部に、抑圧係数補正部１５を抑圧
量補正部に置き換えれば、スペクトル減算法による動作
を実現することができる。FIG. 21 is an explanatory diagram of a schematic operation of the spectrum subtraction method described in Document 10. In FIG. 21, if the multiplex multiplication unit 16 is replaced with a multiple subtraction unit, the noise suppression coefficient generation unit 8 is replaced with a noise suppression amount calculation unit, and the suppression coefficient correction unit 15 is replaced with a suppression amount correction unit, the operation by the spectrum subtraction method is realized. be able to.

【０２１５】多重減算部において、補正された雑音抑圧
量を劣化音声振幅スペクトルから減算し、得られた結果
を逆フーリエ変換することによって、強調音声を得るこ
とができる。ここでは、ＳＮＲを計算してから、ＳＮＲ
に基づいて雑音抑圧量を計算する例について説明した
が、推定雑音計算部５３で得られた推定雑音を、直接劣
化音声振幅スペクトルから減算することもできる。In the multiple subtraction section, the emphasized voice can be obtained by subtracting the corrected noise suppression amount from the deteriorated voice amplitude spectrum and performing the inverse Fourier transform on the obtained result. Here, after calculating the SNR, the SNR
Although the example in which the noise suppression amount is calculated based on the above is described, the estimated noise obtained by the estimated noise calculation unit 53 can be directly subtracted from the deteriorated speech amplitude spectrum.

【０２１６】[0216]

【発明の効果】以上説明したように、本発明は、隣接す
る２フレームを構成する逆フーリエ変換出力を窓がけ処
理してから重ね合わせ加算するので、フレーム境界にお
ける信号サンプルの連続性が改善され、雑音を防止する
ことができる。As described above, the present invention improves the continuity of signal samples at frame boundaries because the inverse Fourier transform outputs that form two adjacent frames are windowed and then superimposed and added. , Noise can be prevented.

[Brief description of drawings]

【図１】本発明の実施形態１のノイズ除去装置の模式的
な構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a noise removal device according to a first exemplary embodiment of the present invention.

【図２】本発明の実施形態２のノイズ除去装置の模式的
な構成を示すブロック図である。FIG. 2 is a block diagram showing a schematic configuration of a noise removal device according to a second exemplary embodiment of the present invention.

【図３】図２の重みつき劣化音声計算部１４の構成を示
すブロック図である。3 is a block diagram showing a configuration of a weighted deteriorated speech calculation unit 14 of FIG.

【図４】図３の多重非線形処理部１４０５の構成を示す
ブロック図である。4 is a block diagram showing a configuration of a multiplex nonlinear processing unit 1405 of FIG.

【図５】非線形処理部１４８５₀〜１４８５_K-1等から出
力される非線形関数の例を示す図である。FIG. 5 is a diagram showing an example of a non-linear function output from non-linear processing units 1485 _{0 to} 1485 _K-1 .

【図６】図２の推定雑音計算部５の構成を示すブロック
図である。FIG. 6 is a block diagram showing a configuration of an estimated noise calculation unit 5 in FIG.

【図７】図６の周波数別推定雑音計算部５０４₀〜５０
４_K-1の構成を示すブロック図である。[7] the frequency domain estimated noise calculator 504 in FIG. 6 _0-50
It is a block diagram which shows the structure of _4K-1 .

【図８】図７の更新判定部５２０の構成を示すブロック
図である。8 is a block diagram showing a configuration of an update determination unit 520 of FIG.

【図９】図６の周波数別推定雑音計算部５０４₀〜５０
４_K-1の構成例を示すブロック図である。[9] the frequency domain estimated noise calculator 504 in FIG. 6 _0-50
It is a block diagram which shows the structural example of _4K-1 .

【図１０】図２の抑圧係数補正部１５の構成を示すブロ
ック図である。10 is a block diagram showing a configuration of a suppression coefficient correction unit 15 of FIG.

【図１１】図２の抑圧係数補正部１５に含まれる周波数
別抑圧係数補正部１５０１₀〜１５０１_K-1の構成を示す
ブロック図である。11 is a block diagram showing the configuration of frequency-specific suppression coefficient correction units 1501 _{0 to} 1501 _K-1 included in the suppression coefficient correction unit 15 of FIG.

【図１２】図２の雑音抑圧係数生成部８の変形例である
雑音抑圧係数生成部８１の構成を示すブロック図であ
る。12 is a block diagram showing a configuration of a noise suppression coefficient generation unit 81 which is a modified example of the noise suppression coefficient generation unit 8 in FIG.

【図１３】図２の周波数別ＳＮＲ計算部６の変形例であ
る周波数別ＳＮＲ計算部６１の構成例を示すブロック図
である。13 is a block diagram showing a configuration example of a frequency-based SNR calculation unit 61 which is a modified example of the frequency-based SNR calculation unit 6 of FIG.

【図１４】図１３の帯域別パワー計算部６１１の構成を
示すブロック図である。FIG. 14 is a block diagram showing a configuration of a band-specific power calculation unit 611 of FIG.

【図１５】本発明の実施形態３のノイズ除去装置の模式
的な構成を示すブロック図である。FIG. 15 is a block diagram showing a schematic configuration of a noise removal device according to a third exemplary embodiment of the present invention.

【図１６】図１５の推定雑音計算部５２の構成を示すブ
ロック図である。16 is a block diagram showing a configuration of an estimated noise calculation unit 52 in FIG.

【図１７】図１６の周波数別推定雑音計算部５０６₀〜
５０６_K-1の構成を示すブロック図である。17 is a diagram illustrating frequency-dependent estimated noise calculators 506 ₀ to 506 _{0 in} FIG.
It is a block diagram which shows the structure of 506 _K-1 .

【図１８】本発明の実施形態４のノイズ除去装置の模式
的な構成を示すブロック図である。FIG. 18 is a block diagram showing a schematic configuration of a noise removal device according to a fourth exemplary embodiment of the present invention.

【図１９】図１８の推定先天的ＳＮＲ計算部７１の構成
を示すブロック図である。19 is a block diagram showing a configuration of an estimated a priori SNR calculation unit 71 in FIG.

【図２０】本発明の実施形態５のノイズ除去装置の模式
的な構成を示すブロック図である。FIG. 20 is a block diagram showing a schematic configuration of a noise removing device according to a fifth embodiment of the present invention.

【図２１】本発明の実施形態６のノイズ除去装置の模式
的な構成を示すブロック図である。FIG. 21 is a block diagram showing a schematic configuration of a noise removing device according to a sixth embodiment of the present invention.

【図２２】図２１の推定雑音計算部５３の構成を示すブ
ロック図である。22 is a block diagram showing a configuration of an estimated noise calculation unit 53 in FIG.

【図２３】図２２の周波数別推定雑音計算部５０８₀〜
５０８_K-1の構成を示すブロック図である。FIG. 23 is a diagram illustrating frequency-dependent estimated noise calculation units 508 ₀ to 508 ₀ shown in FIG.
It is a block diagram which shows the structure of 508 _K-1 .

【図２４】図２３の更新判定部５２２の構成を示すブロ
ック図である。24 is a block diagram showing a configuration of an update determination unit 522 of FIG.

【図２５】従来の技術のノイズ・サプレッサの模式的な
構成を示すブロック図である。FIG. 25 is a block diagram showing a schematic configuration of a conventional noise suppressor.

【図２６】図２５の音声検出部４の構成を示すブロック
図である。FIG. 26 is a block diagram showing a configuration of a voice detection unit 4 in FIG.

【図２７】図２６のパワー計算部４０６の構成を示すブ
ロック図である。27 is a block diagram showing a configuration of a power calculation unit 406 in FIG.

【図２８】図２６の重みつき加算部４０７の構成を示す
ブロック図である。28 is a block diagram showing a configuration of a weighted addition unit 407 of FIG.

【図２９】図２５の多重化乗算部１７の構成を示すブロ
ック図である。FIG. 29 is a block diagram showing the structure of the multiplexing multiplication unit 17 of FIG. 25.

【図３０】図２５の推定雑音計算部５１の構成を示すブ
ロック図である。30 is a block diagram showing a configuration of an estimated noise calculation unit 51 of FIG. 25.

【図３１】図３０の周波数別推定雑音計算部５１４の構
成を示すブロック図である。31 is a block diagram showing a configuration of a frequency-dependent estimated noise calculation unit 514 of FIG. 30. FIG.

【図３２】図３１の更新判定部５２１の構成を示すブロ
ック図である。32 is a block diagram showing a configuration of an update determination unit 521 in FIG.

【図３３】図２５の周波数別ＳＮＲ計算部６の構成を示
すブロック図である。FIG. 33 is a block diagram showing a configuration of a frequency-specific SNR calculation unit 6 of FIG. 25.

【図３４】図２５の推定先天的ＳＮＲ計算部７の構成を
示すブロック図である。34 is a block diagram showing a configuration of an estimated a priori SNR calculation unit 7 in FIG. 25.

【図３５】図３４の多重値域限定処理部７０１の構成を
示すブロック図である。35 is a block diagram showing a configuration of a multiple value range limitation processing unit 701 of FIG. 34.

【図３６】図３４の多重重みつき加算部７０７の構成を
示すブロック図である。36 is a block diagram showing a configuration of a multiple weighted addition unit 707 of FIG. 34.

【図３７】図２５の雑音抑圧係数生成部８の構成を示す
ブロック図である。FIG. 37 is a block diagram showing the configuration of the noise suppression coefficient generation unit 8 in FIG. 25.

【図３８】図３７の抑圧係数検索部８０１₀〜８０１_K-1
の構成を示すブロック図である。FIG. 38 is a suppression coefficient search unit 801 _{0 to} 801 _K-1 shown in FIG. 37.
3 is a block diagram showing the configuration of FIG.

[Explanation of symbols]

１フレーム分割部２，２２窓がけ処理部３フーリエ変換部４音声検出部５，５１，５２，５３推定雑音計算部６，６１，７１５，１４０２周波数別ＳＮＲ計算部７，７１推定先天的ＳＮＲ計算部８，８１雑音抑圧係数生成部９逆フーリエ変換部１０フレーム合成部１１入力端子１２出力端子１３，５０４９カウンタ１４重みつき劣化音声計算部１５抑圧係数補正部１６，１７，７０４，７０５，７１６，１４０４多重
乗算部４０１，１５９３，５２０４，５２０６閾値記憶部４０２，１５９４，５２０３，５２０５比較部４０４，４０７５定数乗算器４０５対数計算部４０６パワー計算部４０７，５０７１，７０７１₀〜７０７１_K-1 重みつき
加算部４０８，７０６，５０７２重み記憶部４０９，５２０２論理否定回路５０２，５０５，６０２，６０３分離部５０３，６０４，８０４，１４７多重化部５０４₀〜５０４_K-1,５０６₀〜５０６_K-1，５０７，５
０８₀〜５０８_K-1，５１４₀〜５１４_K-1 周波数別推定
雑音計算部５２０，５２１，５２２更新判定部６０１₀〜６０１_K-1,５０４１，５０４８除算部６１１，６１２周波数別パワー計算部７０１多重値域限定処理部７０２後天的ＳＮＲ記憶部７０３抑圧係数記憶部７０７多重重みつき加算部７０８，４０６３，４０７２，４０７４，５０４６，６
１１０₀〜６１１０_M-1加算器７１２，１４０１，５９４２推定雑音記憶部７１３強調音声パワースペクトル記憶部８０１₀〜８０１_K-1 抑圧係数検索部８１１ゲイン関数値計算部８１２一般化尤度比計算部８１３音声存在確率記憶部８１４抑圧係数計算部９０１劣化音声パワー９０２閾値９０３，９２３重み９０４更新閾値９０５重みつき加算部制御信号９１０₀〜９１０_K-1，９１０₀〜９１０_ML-1 周波数別
劣化音声パワースペクトル９１１₀〜９１１_K-1，９１１₀〜９１１_ML-1 帯域別劣
化音声パワースペクトル９２１瞬時推定ＳＮＲ９２１₀〜９２１_K-1 周波数別瞬時推定ＳＮＲ９２２過去の推定ＳＮＲ９２２₀〜９２２_K-1 過去の周波数別推定ＳＮＲ９２４推定先天的ＳＮＲ９２４₀〜９２４_K-1 周波数別推定先天的ＳＮＲ１４０５多重非線形処理部１４８５₀〜１４８５_K-1,５０４２非線形処理部１５０１₀〜１５０１_K-1 周波数別抑圧係数補正部１５９１，７０１２₀〜７０１２_K-1 最大値選択部１５９２抑圧係数下限値記憶部１５９５，５０４４スイッチ１５９６修正量記憶部１５９７，１７０１₀〜１７０１_K-1,４０６２₀〜４０６
２_K-1,４０７１，４０７３，５０４３乗算器５０４５シフトレジスタ５０４７最小値選択部５２０１，５２１１，５２２１論理和計算部５２０７閾値計算部５９４１レジスタ長記憶部７０１１定数記憶部８０１１抑圧係数テーブル８０１２，８０１３アドレス変換部1 frame division unit 2, 22 windowing processing unit 3 Fourier transform unit 4 voice detection unit 5, 51, 52, 53 estimated noise calculation unit 6, 61, 715, 1402 frequency SNR calculation unit 7, 71 estimated a priori SNR calculation Section 8, 81 noise suppression coefficient generation section 9 inverse Fourier transform section 10 frame synthesis section 11 input terminal 12 output terminal 13, 5049 counter 14 weighted deteriorated speech calculation section 15 suppression coefficient correction section 16, 17, 704, 705, 716 1404 Multiple multiplication units 401, 1593, 5204, 5206 Threshold storage units 402, 1594, 5203, 5205 Comparison units 404, 4075 Constant multiplier 405 Logarithmic calculation unit 406 Power calculation units 407, 5071, 7071 _{0 to} 7071 _K-1 Weighted Adders 408, 706, 5072 Weight storages 409, 5202 Logical NOT circuit 502 505,602,603 separation unit 503,604,804,147 multiplexer _{_{504 0 ~504 K-1, 506}} 0 ~506 K-1, 507,5
08 _{0 to} 508 _K-1 , 514 _{0 to} 514 _K-1 Frequency _- dependent estimated noise calculation unit 520, 521, 522 Update determination unit 601 _{0 to} 601 _K-1 , 5041, 5048 Division unit 611, 612 Frequency-specific power calculation unit 701 Multiple range limitation processing section 702 Acquired SNR storage section 703 Suppression coefficient storage section 707 Multiple weighted addition sections 708, 4063, 4072, 4074, 5046, 6
110 _{0 to} 6110 _M-1 adder 712, 1401, 5942 Estimated noise storage unit 713 Enhanced speech power spectrum storage unit 801 _{0 to} 801 _K-1 suppression coefficient search unit 811 Gain function value calculation unit 812 Generalized likelihood ratio calculation unit 813 voice existence probability storage unit 814 suppression coefficient calculation unit 901 deteriorated voice power 902 threshold 903, 923 weight 904 update threshold 905 weighted addition unit control signals 910 _{0 to} 910 _K-1 , 910 _{0 to} 910 _ML-1 deteriorated voice by frequency Power spectrums 911 _{0 to} 911 _K-1 , 911 _{0 to} 911 _ML-1 Band-specific deteriorated speech power spectrum 921 Instantaneous estimation SNR 921 _{0 to} 921 _K-1 Instantaneous estimation SNR 922 by frequency Past estimation SNR 922 _{0 to} 922 _{K- 1} past of the frequency domain estimated SNR 924 estimated apriori SNR 924 ₀ ~924 _K-1 frequency domain estimated inherent SNR 1 05 Multiple nonlinear processor 1485 ₀ ~1485 _K-1, 5042 nonlinear processor 1501 ₀ ~1501 _K-1 frequency-suppression coefficient correction unit 1591,7012 ₀ ~7012 _K-1 maximum value selection unit 1592 suppression coefficient lower-limit value storage unit 1595, 5044 Switch 1596 Correction amount storage unit 1597, 1701 _{0 to} 1701 _K-1 , 4062 _{0 to} 406
2 _K-1 , 4071, 4073, 5043 Multiplier 5045 Shift register 5047 Minimum value selection unit 5201, 5211, 5221 Logical sum calculation unit 5207 Threshold calculation unit 5941 Register length storage unit 7011 Constant storage unit 8011 Suppression coefficient table 8012, 8013 Address Converter

───────────────────────────────────────────────────── フロントページの続き (72)発明者芹沢昌宏東京都港区芝五丁目７番１号日本電気株式会社内Ｆターム(参考） 5D015 EE05 ─────────────────────────────────────────────────── ─── Continued front page (72) Inventor Masahiro Serizawa 5-7 Shiba 5-1, Minato-ku, Tokyo NEC Corporation Inside the company F-term (reference) 5D015 EE05

Claims

[Claims]

1. An input signal is converted into a frequency domain signal, a signal-to-noise ratio is obtained based on the frequency-domain signal, a suppression coefficient is determined based on the signal-to-noise ratio, and the frequency is calculated based on the suppression coefficient. When obtaining a noise-removed output signal by weighting the domain signal and converting the weighted frequency domain signal into a time domain signal, the frequency domain is applied after the first windowing is applied to the input signal. Convert it to a signal,
A noise removing method, wherein the output signal is obtained by applying a second windowing to the time domain signal.

2. The first window function when applying the first windowing and the second window function when applying the second windowing, when the suppression coefficient is set to 1 The noise removing method according to claim 1, wherein the noise removing method has a characteristic that an input signal and the output signal match each other.

3. The first window function when applying the first windowing and the second window function when applying the second windowing are equal to each other. Noise removal method.

4. The noise removing method according to claim 1, wherein the frequency domain signal is weighted using a corrected suppression coefficient obtained by correcting the suppression coefficient.

5. The noise removing method according to claim 1, 2 or 3, wherein the suppression coefficient is determined based on a corrected signal-to-noise ratio obtained by correcting the signal-to-noise ratio.

6. A weighted frequency domain signal is obtained by weighting the frequency domain signal, an estimated noise is obtained using the weighted frequency domain signal, and a signal-to-noise is obtained using the estimated noise and the frequency domain signal. The method according to claim 1, 2, 3, 4, or 5, wherein a ratio is obtained.

7. An input signal is converted into a frequency domain signal, an estimated noise is obtained based on the frequency domain signal, and a value corresponding to the estimated noise is subtracted from the frequency domain signal to obtain an emphasized voice in the frequency domain, When the noise-removed output signal is obtained by converting the emphasized speech into a time domain signal, the input signal is first windowed and then converted into a frequency domain signal, and the time domain signal is converted into a second domain signal. A method of removing noise, characterized in that an output signal is obtained by applying the windowing of.

8. The noise removing method according to claim 7, wherein the frequency domain signal is weighted to obtain a weighted frequency domain signal, and the estimated noise is obtained using the weighted frequency domain signal.

9. The noise removing method according to claim 6, wherein when the weighted frequency domain signal is obtained by weighting the frequency domain signal, the weight is obtained using a signal-to-noise ratio.

10. When the frequency domain signal is weighted to obtain a weighted frequency domain signal, a weight is obtained using a signal-to-noise ratio, the weight is processed by a non-linear function, and a correction weight is obtained. 9. The noise removing method according to claim 6, wherein the frequency domain signal is weighted using a correction weight.

11. A first windowing processing unit that performs windowing processing on an input signal, and a conversion unit that converts the input signal that has been windowed by the first windowing processing unit into a frequency domain signal. A signal-to-noise ratio calculation unit that obtains a signal-to-noise ratio based on the amplitude component of the frequency-domain signal that has been converted by the conversion unit; And a multiplication unit for weighting the amplitude component of the frequency domain signal based on the suppression coefficient generated by the suppression coefficient generation unit, and the frequency domain weighted by the multiplication unit. An inverse transform unit that transforms the amplitude component of the signal and the phase component of the frequency domain signal transformed by the transform unit into a time domain signal; and a second windowing process for the time domain signal transformed by the inverse transform unit. Equipped with a window processing unit A noise eliminator characterized by.

12. A first window function when performing windowing processing in the first windowing processing unit and a second window function when performing windowing processing in the second windowing processing unit 12. The noise removing apparatus according to claim 11, wherein the noise removing device has a characteristic that an input signal and an output signal match when the suppression coefficient is set to 1.

13. A first window function used when performing windowing processing in the first windowing processing unit and a second window function used when performing windowing processing in the second windowing processing unit The noise removing device according to claim 11 or 12, wherein the noise removing devices are equal to each other.

14. A suppression coefficient correction unit that corrects the suppression coefficient, wherein the multiplication unit weights the amplitude component of the frequency domain signal using the corrected suppression coefficient corrected by the suppression coefficient correction unit. The noise removing device according to claim 11, 12, or 13.

15. The noise eliminator according to claim 14, wherein the suppression coefficient generation unit includes a correction signal-to-noise ratio calculation unit that corrects the signal-to-noise ratio to obtain a correction signal-to-noise ratio. .

16. The signal-to-noise ratio calculation unit includes an estimated noise calculation unit that estimates noise by using an amplitude component of the frequency domain signal, and the signal-to-noise ratio is calculated based on an estimation result of the estimated noise calculation unit. The ratio is calculated as claimed in claim 1.
The noise removing device according to 1, 12, 13, 14 or 15.

17. The noise removal according to claim 16, wherein the estimated noise calculation unit includes a weighted deteriorated speech calculation unit that weights an amplitude component of the frequency domain signal to obtain a weighted amplitude component. apparatus.

18. The weighted deteriorated speech calculation unit includes a non-linear processing unit that processes the signal-to-noise ratio by a non-linear function to obtain a weighting coefficient.
The noise removal device described.