JP2003140700A

JP2003140700A - Method and device for noise removal

Info

Publication number: JP2003140700A
Application number: JP2001339156A
Authority: JP
Inventors: Akihiko Sugiyama; 昭彦杉山
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2001-11-05
Filing date: 2001-11-05
Publication date: 2003-05-16
Anticipated expiration: 2021-11-05
Also published as: JP3858668B2

Abstract

PROBLEM TO BE SOLVED: To provide a device and method for noise removal which can obtain a stressed voice of superior quality. SOLUTION: The device has an injected noise calculation part 55 which calculates noise to be injected from a deteriorated voice power spectrum and an estimated noise power spectrum, two adders 56 and 57 which add the obtained noise to the deteriorated voice power spectrum and estimated noise power spectrum, and a noise suppression coefficient generation part 8 which determines a suppression coefficient according to the noise-added deteriorated voice power spectrum and estimated noise power spectrum. Further, the device has a windowing processing part 22 which performs windowing processing for a signal sample extracted from two adjacent frames of a reverse Fourier transform output.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ノイズ除去方法及
び装置に関し、より詳しくは、所望の音声信号に重畳さ
れているノイズを除去するノイズ除去方法及び装置に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a noise removing method and device, and more particularly, to a noise removing method and device for removing noise superimposed on a desired audio signal.

【０００２】[0002]

【従来の技術】ノイズ除去装置（ノイズ・サプレッサ）
は、所望の音声信号に重畳されている雑音（ノイズ）を
除去するものであり、時間領域から周波数領域に変換し
た入力信号を用いてノイズ成分のパワースペクトルを推
定し、この推定パワースペクトルを入力信号から差し引
くことにより、所望の音声信号に混在するノイズを抑圧
するように動作する。ノイズ成分のパワースペクトル
を、音声の無音区間を検出して更新することにより、非
定常なノイズの抑圧にも適用することができる。ノイズ
除去装置としては、例えば、「１９８４年１２月、アイ
・イー・イー・イー・トランザクションズ・オン・アク
ースティクス・スピーチ・アンド・シグナル・プロセシ
ング、第３２巻、第６号（IEEE TRANSACTIONS ON ACOUS
TICS, SPEECH, AND SIGNAL PROCESSING, VOL.32, NO.6,
PP.1109-1121, DEC, 1984）、１１０９〜１１２１ペー
ジ」(文献１)に記載されている方式がある。これは、最
小平均２乗誤差短時間スペクトル振幅法として知られて
いる。図４８に、文献１に記載されたノイズ除去装置の
構成を示す。2. Description of the Related Art Noise elimination device (noise suppressor)
Is to remove noise superimposed on a desired speech signal. The power spectrum of the noise component is estimated using the input signal transformed from the time domain to the frequency domain, and this estimated power spectrum is input. By subtracting from the signal, it operates so as to suppress the noise mixed in the desired audio signal. The power spectrum of the noise component can be applied to suppression of non-stationary noise by detecting and updating the silent section of the voice. As a noise eliminator, for example, “December 1984, I E E E Transactions on Aqueous Speech and Signal Processing, Volume 32, No. 6 (IEEE TRANSACTIONS ON ACOUS
TICS, SPEECH, AND SIGNAL PROCESSING, VOL.32, NO.6,
PP.1109-1121, DEC, 1984), pages 1109 to 1121 ”(reference 1). This is known as the minimum mean square error short time spectral amplitude method. FIG. 48 shows the configuration of the noise eliminator described in Document 1.

【０００３】入力端子１１には、劣化音声信号（所望音
声信号とノイズの混在する信号）が、時間領域サンプル
値系列として供給される。劣化音声信号サンプルは、フ
レーム分割部１に供給され、Ｋ/２サンプル毎のフレー
ムに分割される。ここに、Ｋは２以上の偶数とする。フ
レームに分割された劣化音声信号サンプルは、窓がけ処
理部２に供給され、窓関数ｗ（ｔ）との乗算が行なわれ
る。第ｎフレームの入力信号ｙ_n(ｔ）（ｔ＝０，
１，....，Ｋ／２−１）に対するｗ（ｔ）で窓がけされ
た信号ｙ_n(ｔ）バーは、式（１）で与えられる。A deteriorated voice signal (a signal in which a desired voice signal and noise are mixed) is supplied to the input terminal 11 as a time domain sample value sequence. The deteriorated audio signal sample is supplied to the frame division unit 1 and divided into K / 2 sample frames. Here, K is an even number of 2 or more. The deteriorated audio signal sample divided into frames is supplied to the windowing processing unit 2 and is multiplied by the window function w (t). The input signal y _n (t) (t = 0,
The signal y _n (t) bar windowed by w (t) for 1, ...., K / 2−1) is given by equation (1).

【０００４】[0004]

【数１】 [Equation 1]

【０００５】また、連続する２フレームの一部を重ね合
わせ（オーバラップ）して窓がけすることも広く行なわ
れている。オーバラップ長としてフレーム長の５０％を
仮定すれば、ｔ＝０，１，....，Ｋ／２−１に対して、
式（２）で得られるｙ_n(ｔ）バー（ｔ＝０，１，....，
Ｋ／２−１）が、窓がけ処理部２の出力となる。It is also widely practiced to overlap a part of two consecutive frames to form a window. Assuming 50% of the frame length as the overlap length, for t = 0, 1, ..., K / 2-1,
Y _n (t) bar (t = 0, 1, ...,
K / 2-1) is the output of the windowing processing unit 2.

【０００６】[0006]

【数２】 [Equation 2]

【０００７】実数信号に対しては、左右対称窓関数が用
いられる。また、窓関数は、後述する抑圧係数を１に設
定したときの入力信号と出力信号が計算誤差を除いて一
致するように設計される。これは、ｗ（ｔ）＋ｗ（ｔ＋
Ｋ／２）＝１となることを意味する。以後、連続する２
フレームの５０％をオーバラップして窓がけする場合を
例として説明を続ける。窓関数ｗ（ｔ）としては、例え
ば式（３）に示すハニング窓を用いることができる。A symmetric window function is used for real signals. Further, the window function is designed so that the input signal and the output signal when the suppression coefficient, which will be described later, is set to 1 match each other except for a calculation error. This is w (t) + w (t +
It means that K / 2) = 1. After that, 2 consecutive times
The description will be continued by taking as an example the case of opening windows by overlapping 50% of the frames. As the window function w (t), for example, the Hanning window shown in Expression (3) can be used.

【０００８】[0008]

【数３】 [Equation 3]

【０００９】窓がけされた出力ｙ_n(ｔ）バーは、フーリ
エ変換部３に供給され、周波数領域の劣化音声スペクト
ル（周波数領域信号）Ｙ_n(ｋ）に変換される。劣化音声
スペクトルＹ_n(ｋ）は位相と振幅に分離され、劣化音声
位相スペクトルのａｒｇＹ_n(ｋ）は逆フーリエ変換部９
に、劣化音声振幅スペクトル｜Ｙ_n(ｋ）｜は音声検出部
４、多重乗算部１６及び多重乗算部１７に供給される。The windowed output y _n (t) bar is supplied to the Fourier transform unit 3 and transformed into a degraded speech spectrum (frequency domain signal) Y _n (k) in the frequency domain. The deteriorated speech spectrum Y _n (k) is separated into a phase and an amplitude, and the argY _n (k) of the deteriorated speech phase spectrum is inverse Fourier transform unit 9.
Then, the deteriorated voice amplitude spectrum | Y _n (k) | is supplied to the voice detection unit 4, the multiple multiplication unit 16 and the multiple multiplication unit 17.

【００１０】音声検出部４は、劣化音声振幅スペクトル
｜Ｙ_n(ｋ）｜に基づいて音声の有無を検出し、その結果
によって定められる音声検出フラグを推定雑音計算部５
１に伝達する。多重乗算部１７は、供給された劣化音声
振幅スペクトル｜Ｙ_n(ｋ）｜を周波数別に２乗し、劣化
音声パワースペクトルとして推定雑音計算部５１と周波
数別ＳＮＲ（信号対雑音比）計算部６に伝達する。推定
雑音計算部５１は、音声検出フラグ、劣化音声パワース
ペクトル、及びカウンタ１３から供給されるカウント値
を用いて、上記劣化音声振幅スペクトルに含まれる雑音
（第２の雑音）のパワースペクトルを推定し、推定雑音
パワースペクトルとして周波数別ＳＮＲ計算部６に伝達
する。周波数別ＳＮＲ計算部６は、入力された劣化音声
パワースペクトルと推定雑音パワースペクトルを用いて
周波数別に除算し、後天的ＳＮＲ（a posteriori SNR）
として推定先天的ＳＮＲ計算部７と雑音抑圧係数生成部
８に供給する。後天的ＳＮＲは雑音を含む強調前音声と
雑音の比の推定値である。The voice detection unit 4 detects the presence or absence of voice based on the deteriorated voice amplitude spectrum | Y _n (k) |, and the voice detection flag determined by the result is used as the estimated noise calculation unit 5
Propagate to 1. The multiple multiplication unit 17 squares the supplied deteriorated speech amplitude spectrum | Y _n (k) | by frequency, and the estimated noise calculation unit 51 and frequency-dependent SNR (signal-to-noise ratio) calculation unit 6 as a deteriorated speech power spectrum. Communicate to. The estimated noise calculation unit 51 estimates the power spectrum of noise (second noise) included in the deteriorated voice amplitude spectrum, using the voice detection flag, the deteriorated voice power spectrum, and the count value supplied from the counter 13. , Is transmitted to the frequency-dependent SNR calculation unit 6 as an estimated noise power spectrum. The frequency-based SNR calculation unit 6 divides by frequency using the input deteriorated speech power spectrum and estimated noise power spectrum, and obtains an acquired SNR (a posteriori SNR).
Is supplied to the estimated a priori SNR calculation unit 7 and the noise suppression coefficient generation unit 8. The acquired SNR is an estimate of the ratio of the pre-emphasized speech including noise and the noise.

【００１１】推定先天的ＳＮＲ計算部７は、入力された
後天的ＳＮＲ、及び後述する雑音抑圧係数生成部８から
供給された抑圧係数Ｇ_n(ｋ）バーを用いて、真の音声対
雑音比を示す先天的ＳＮＲ（a priori SNR）を推定し、
推定先天的ＳＮＲとして雑音抑圧係数生成部８に帰還さ
せる。雑音抑圧係数生成部８は、入力として供給された
後天的ＳＮＲと推定先天的ＳＮＲを用いて雑音抑圧係数
を生成し、抑圧係数Ｇ _n(ｋ）バーとして推定先天的ＳＮ
Ｒ計算部７に帰還すると同時に多重乗算部１６に伝達す
る。多重乗算部１６は、フーリエ変換部３から供給され
た劣化音声振幅スペクトル｜Ｙ_n(ｋ）｜を、雑音抑圧係
数生成部８から供給された抑圧係数Ｇ_n(ｋ）バーで重み
づけすることによって強調音声振幅スペクトル｜Ｘ
_n(ｋ）｜バーを求め、逆フーリエ変換部９に伝達する。
｜Ｘ_n(ｋ）｜バーは、式（４）で与えられる。The estimated a priori SNR calculator 7 receives the input
From the acquired SNR and the noise suppression coefficient generation unit 8 described later
Supplied suppression coefficient G_nUse the (k) bar to find the true voice pair
Estimate the a priori SNR indicating the noise ratio,
The estimated a priori SNR is fed back to the noise suppression coefficient generator 8.
Let The noise suppression coefficient generator 8 is supplied as an input
The noise suppression coefficient is calculated using the acquired SNR and the estimated a priori SNR.
And the suppression coefficient G _n(k) Estimated innate SN as bar
It returns to the R calculation unit 7 and at the same time is transmitted to the multiple multiplication unit 16.
It The multiple multiplication unit 16 is supplied from the Fourier transform unit 3.
Deteriorated speech amplitude spectrum | Y_n(k) | is the noise suppressor
Suppression coefficient G supplied from the number generation unit 8_n(k) Weight at bar
Emphasized speech amplitude spectrum | X
_n(k) | Obtains the bar and transfers it to the inverse Fourier transform unit 9.
｜ X_n(k) | bar is given by equation (4).

【００１２】[0012]

【数４】 [Equation 4]

【００１３】逆フーリエ変換部９は、多重乗算部１６か
ら供給された強調音声振幅スペクトル｜Ｘ_n(ｋ）｜バー
とフーリエ変換部３から供給された劣化音声位相スペク
トルａｒｇＹ_n(ｋ）を乗算して、強調音声スペクトルＸ
_n(ｋ）バーを求める。すなわち、式（５）を実行する。[0013] The inverse Fourier transform unit 9, enhanced speech amplitude spectrum supplied from multiplexed multiplier 16 | multiply degraded supplied from the bar and a Fourier transform unit 3 audio phase spectrum _{_{argY n (k) | X n}} (k) And emphasized speech spectrum X
_{Find n} (k) bars. That is, the equation (5) is executed.

【００１４】[0014]

【数５】 [Equation 5]

【００１５】そして、得られた強調音声スペクトルＸ
_n(ｋ）バーに逆フーリエ変換を施し、１フレームがＫサ
ンプルから構成される時間領域サンプル値系列（時間領
域信号）ｘ_n(ｔ）バー（ｔ＝０，１，....，Ｋ−１）と
して、フレーム合成部１０に伝達する。フレーム合成部
１０は、ｘ_n(ｔ）バーの隣接する２フレームからＫ／２
サンプルずつを取り出して重ね合わせ、（６）式によっ
て強調音声ｘ_n(ｔ）ハット（ｔ＝０，１，....，Ｋ／２
−１）を得る。得られた強調音声ｘ_n(ｔ）ハットが、フ
レーム合成部１０の出力として、出力端子１２に伝達さ
れる。Then, the obtained emphasized speech spectrum X
_An inverse Fourier transform is applied to the _n (k) bar, and a time domain sample value sequence (time domain signal) x _n (t) bar (t = 0, 1, ..., K) in which one frame is composed of K samples -1) is transmitted to the frame synthesis unit 10. The frame composition unit 10 calculates K / 2 from two adjacent frames of the x _n (t) bar.
Samples are taken out and superposed, and the emphasized speech x _n (t) hat (t = 0, 1, ...
-1) is obtained. The obtained emphasized speech x _n (t) hat is transmitted to the output terminal 12 as the output of the frame synthesis unit 10.

【００１６】[0016]

【数６】 [Equation 6]

【００１７】次に、図４８に示したノイズ除去装置の各
部の構成及び動作について、さらに説明する。音声検出
部の実現方法について、文献１は詳細に開示していな
い。しかし、音声検出部の実現例としては、「２０００
年３月、日本音響学会講演論文集、３２１〜３２２ペー
ジ」（文献２）が知られているので、以降、文献２に示
されたものを従来の方法として説明する。図４９は、図
４８における音声検出部４の構成を示すブロック図であ
る。音声検出部４は、閾値記憶部４０１、比較部４０
２、乗算器４０４、対数計算部４０５、パワー計算部４
０６、重みつき加算部４０７、重み記憶部４０８、論理
否定回路４０９を有する。Next, the configuration and operation of each part of the noise eliminator shown in FIG. 48 will be further described. Document 1 does not disclose in detail how to realize the voice detection unit. However, as an implementation example of the voice detection unit, "2000
Since March 2013, a collection of lectures by the Acoustical Society of Japan, pp. 321 to 322 "(Reference 2), the method shown in Reference 2 will be described as a conventional method. FIG. 49 is a block diagram showing the configuration of the voice detection unit 4 in FIG. The voice detection unit 4 includes a threshold storage unit 401 and a comparison unit 40.
2, multiplier 404, logarithmic calculation unit 405, power calculation unit 4
06, a weighted addition unit 407, a weight storage unit 408, and a logical NOT circuit 409.

【００１８】図４８におけるフーリエ変換部３から供給
された劣化音声振幅スペクトルは、パワー計算部４０６
に供給される。パワー計算部４０６は、劣化音声振幅ス
ペクトルのパワー｜Ｙ_n(ｋ）｜² のｋ＝０からＫ−１に
対する総和を計算して、対数計算部４０５に伝達する。
対数計算部４０５は、入力された劣化音声スペクトルパ
ワー｜Ｙ_n(ｋ）｜² の対数を求め、乗算器４０４に伝達
する。乗算器４０４は、供給された対数値を定数倍（例
えば１０倍）して劣化音声パワーＱ_n を求め、比較部４
０２及び重みつき加算部４０７に供給する。すなわち、
第ｎフレームの劣化音声パワーＱ_n は、式（７）で与え
られる。The deteriorated voice amplitude spectrum supplied from the Fourier transform unit 3 in FIG.
Is supplied to. The power calculation unit 406 calculates the sum of the power | Y _n (k) | ² of the degraded speech amplitude spectrum for k = 0 to K−1, and transmits the sum to the logarithmic calculation unit 405.
The logarithmic calculation unit 405 obtains the logarithm of the input deteriorated speech spectral power | Y _n (k) | ² and transmits it to the multiplier 404. The multiplier 404 multiplies the supplied logarithmic value by a constant (for example, 10 times) to obtain the deteriorated voice power Q _n , and the comparison unit 4
02 and the weighted addition unit 407. That is,
The deteriorated voice power Q _n of the nth frame is given by the equation (7).

【００１９】[0019]

【数７】 [Equation 7]

【００２０】なお、文献２に開示された音声検出部は、
時間領域サンプルであるｙ_n(ｔ）バーを用いて、式
（８）に従ってＱ_nを求めている。The voice detection unit disclosed in Document 2 is
Using the y _n (t) bar which is a time domain sample, Q _n is calculated according to the equation (8).

【００２１】[0021]

【数８】 [Equation 8]

【００２２】しかし、例えば、「１９８５年、ディジタ
ル信号処理の理論、コロナ社、７５〜７６ページ」（文
献３）にあるように、式（８）と式（７）が等価である
ことは、パーセバル（Parseval）の等式として知られて
いる。However, for example, as shown in "1985, Theory of Digital Signal Processing, Corona Publishing Co., Ltd., pp. 75-76" (Reference 3), equations (8) and (7) are equivalent to each other. Known as the Parseval equation.

【００２３】比較部４０２には、閾値記憶部４０１か
ら、閾値ＴＨ_nが供給されている。比較部４０２は、乗
算器４０４の出力Ｑ_nと閾値ＴＨ_nを比較し、ＴＨ_n＞
Ｑ_nのときは有音を表す“１”を、ＴＨ_n≦Ｑ_nのとき
は無音を表す“０”を出力する。比較部４０２の出力
は、音声検出部４の出力である音声検出フラグとして外
部に供給されると同時に、否定演算回路４０９に供給さ
れる。否定演算回路４０９の出力は、重みつき加算部制
御信号９０５として重みつき加算部４０７に供給され
る。重みつき加算部４０７には、また、閾値記憶部４０
１から閾値（ＴＨ_n-1）９０２と、重み記憶部４０８か
ら重み９０３が供給される。The threshold value TH _n is supplied from the threshold value storage unit 401 to the comparison unit 402. The comparing unit 402 compares the output Q _n of the multiplier 404 with the threshold value TH _n , and TH _n >
When Q _n , "1" indicating a voice is output, and when TH _n ≤Q _n , "0" indicating a silence is output. The output of the comparison unit 402 is supplied to the outside as a voice detection flag which is the output of the voice detection unit 4, and at the same time, is supplied to the negative operation circuit 409. The output of the negative operation circuit 409 is supplied to the weighted addition unit 407 as the weighted addition unit control signal 905. The weighted addition unit 407 also includes a threshold storage unit 40.
The threshold value (TH _n-1 ) 902 is supplied from 1 and the weight 903 is supplied from the weight storage unit 408.

【００２４】重みつき加算部４０７は、閾値記憶部４０
１から供給される閾値（ＴＨ_n-1）９０２を、重みつき
加算部制御信号９０５に基づいて選択的に更新する。更
新閾値ＴＨ_nは、閾値（ＴＨ_n-1）９０２と劣化音声パ
ワー（Ｑ_n）９０１を、重み記憶部４０８から供給され
る重み９０３を用いて重みつき加算することによって求
められる。更新閾値ＴＨ_nの計算は、論理否定回路４０
９の出力である重みつき加算部制御信号９０５が“１”
に等しいときだけ行なわれる。すなわち、無音のときだ
け、閾値ＴＨ_n-1がＴＨ_nに更新される。更新によって
得られた更新閾値ＴＨ_nは、更新閾値９０４として閾値
記憶部４０１に帰還される。The weighted addition unit 407 includes a threshold storage unit 40.
The threshold value (TH _n-1 ) 902 supplied from 1 is selectively updated based on the weighted addition unit control signal 905. The update threshold TH _n is obtained by weighted addition of the threshold (TH _n−1 ) 902 and the deteriorated voice power (Q _n ) 901 using the weight 903 supplied from the weight storage unit 408. The logical NOT circuit 40 calculates the update threshold TH _n.
The weighted addition unit control signal 905 which is the output of 9 is "1"
Is performed only when That is, the threshold value TH _n-1 is updated to TH _n only when there is no sound. The update threshold TH _n obtained by the update is fed back to the threshold storage unit 401 as the update threshold 904.

【００２５】図５０は、図４９に示した音声検出部４に
含まれるパワー計算部４０６の構成を示すブロック図で
ある。パワー計算部４０６は、分離部４０６１、Ｋ個の
乗算器４０６２₀ 〜４０６２_K-1 、加算器４０６３を有
する。多重化された状態で図４８におけるフーリエ変換
部３から供給された劣化音声振幅スペクトル｜Ｙ_n(ｋ）
｜は、分離部４０６１において周波数別のＫサンプルに
分離され、それぞれ乗算器４０６２₀ 〜４０６２_K-1 に
供給される。乗算器４０６２₀ 〜４０６２_K-1は、それ
ぞれ入力された信号を２乗し、加算器４０６３に伝達す
る。加算器４０６３は、入力された信号の総和を求めて
出力する。FIG. 50 is a block diagram showing the configuration of power calculation section 406 included in voice detection section 4 shown in FIG. The power calculation unit 406 includes a separation unit 4061, K multipliers 4062 _{0 to} 4062 _K−1 , and an adder 4063. The deteriorated speech amplitude spectrum | Y _n (k) supplied from the Fourier transform unit 3 in FIG. 48 in the multiplexed state.
| Is separated into frequency-of K samples in the separation unit 4061 are supplied to the multipliers 4062 ₀ ~4062 _K-1. The multipliers 4062 _{0 to} 4062 _K-1 square the respective input signals and transmit the squared signals to the adder 4063. The adder 4063 calculates and outputs the total sum of the input signals.

【００２６】図５１は、図４９に示した音声検出部４に
含まれる重みつき加算部４０７の構成を示すブロック図
である。重みつき加算部４０７は、乗算器４０７１，４
０７３、定数乗算器４０７５、加算器４０７２，４０７
４を有する。図４９における乗算器４０４から劣化音声
パワー（Ｑ_n）９０１が、図４９における閾値記憶部４
０１から閾値（ＴＨ_n-1）９０２が、図４９における重
み記憶部４０８から重み９０３が、図４９における論理
否定回路４０９から重みつき加算部制御信号９０５が、
それぞれ入力として供給される。FIG. 51 is a block diagram showing the structure of the weighted addition unit 407 included in the voice detection unit 4 shown in FIG. The weighted addition unit 407 includes multipliers 4071, 4
073, constant multiplier 4075, adders 4072 and 407
Have 4. 49. The deteriorated voice power (Q _n ) 901 from the multiplier 404 in FIG.
01 to the threshold value (TH _n-1 ) 902, the weight storage unit 408 in FIG. 49 to the weight 903, the logical NOT circuit 409 in FIG. 49 to the weighted addition unit control signal 905,
Each is supplied as an input.

【００２７】値βを有する重み９０３は、定数乗算器４
０７５と乗算器４０７３に伝達される。定数乗算器４０
７５は入力信号を−１倍して得られた−βを、加算器４
０７４の一方の入力として供給する。加算器４０７４の
他方の入力としては１が供給されており、加算器４０７
４の出力は両者の和である１−βとなる。１−βは乗算
器４０７１の一方の入力として供給されて、他方の入力
である劣化音声パワー（Ｑ_n）９０１と乗算され、積で
ある（１−β）Ｑ_nが加算器４０７２に伝達される。The weight 903 having the value β is the constant multiplier 4
075 and the multiplier 4073. Constant multiplier 40
75 is -β obtained by multiplying the input signal by -1 to adder 4
074 as one input. 1 is supplied to the other input of the adder 4074, and the adder 407
The output of 4 is the sum of the two, 1-β. 1-β is supplied as one input of the multiplier 4071 and is multiplied by the deteriorated voice power (Q _n ) 901 which is the other input, and the product (1-β) Q _n is transmitted to the adder 4072. It

【００２８】一方、乗算器４０７３では、重み９０３と
して供給されたβと閾値（ＴＨ_n-1）９０２が乗算さ
れ、積であるβＴＨ_n-1が加算器４０７２に伝達され
る。加算器４０７２は、βＴＨ_n-1と（１−β）Ｑ_nの
和を、更新閾値（ＴＨ_n）９０４として出力する。更新
閾値ＴＨ_nの計算は、重みつき加算部制御信号９０５が
“１”に等しいときだけ行なわれる。すなわち、重みつ
き加算部４０７の機能は、無音のときに、閾値ＴＨ_{n -1}
を更新してＴＨ_nを求めることであり、式（９）によっ
て表すことができる。On the other hand, the multiplier 4073, supplied β and the threshold value (TH _n-1) 902 is multiplied as the weight 903, βTH _n-1 which is a product is transmitted to the adder 4072. The adder 4072 outputs the sum of βTH _n−1 and (1−β) Q _n as the update threshold (TH _n ) 904. The update threshold value TH _n is calculated only when the weighted adder control signal 905 is equal to “1”. That is, the function of the weighted addition unit 407 is that the threshold value TH _{n −1} is used when there is no sound.
Is calculated to obtain TH _n , which can be expressed by equation (9).

【００２９】[0029]

【数９】 [Equation 9]

【００３０】図４８における多重乗算部１７について説
明する。図５２は、多重乗算部１７の構成を示すブロッ
ク図である。多重乗算部１７は、Ｋ個の乗算器１７０１
₀ 〜１７０１_K-1 、分離部１７０２，１７０３、多重化
部１７０４を有する。多重化された状態で図４８におけ
るフーリエ変換部３から供給された劣化音声振幅スペク
トルは、分離部１７０２及び１７０３において周波数別
のＫサンプルに分離され、それぞれ乗算器１７０１₀ 〜
１７０１_K-1 に供給される。乗算器１７０１₀〜１７０
１_K-1 は、それぞれ入力された信号を２乗し、多重化部
１７０４に伝達する。多重化部１７０４は、入力された
信号を多重化し、劣化音声パワースペクトルとして出力
する。The multiplying unit 17 in FIG. 48 will be described. FIG. 52 is a block diagram showing the configuration of the multiple multiplication unit 17. The multiplex multiplication unit 17 includes K multipliers 1701.
_{It has 0 to} 1701 _K-1 , demultiplexing units 1702 and 1703, and a multiplexing unit 1704. The deteriorated voice amplitude spectrum supplied from the Fourier transform unit 3 in FIG. 48 in the multiplexed state is separated into K samples for each frequency in the separation units 1702 and 1703, and each of them is multiplied by 1701 ₀ to 1701.
Supplied to 1701 _K-1 . Multipliers 1701 _{0 to} 170
1 _K-1 squares the respective input signals and transmits them to multiplexing section 1704. Multiplexing section 1704 multiplexes the input signal and outputs it as a degraded voice power spectrum.

【００３１】図４８における推定雑音計算部５１につい
て説明する。図５３は、推定雑音計算部５１の構成を示
すブロック図である。推定雑音計算部５１は、分離部５
０２、多重化部５０３、Ｋ個の周波数別推定雑音計算部
５１４₀ 〜５１４_K-1 を有する。図４８における音声検
出部４から供給された音声検出フラグと図４８における
カウンタ１３から供給されたカウント値は、周波数別推
定雑音計算部５１４₀〜５１４_K-1 に伝達される。図４
８における多重乗算部１７から供給された劣化音声パワ
ースペクトルは、分離部５０２に伝達される。The estimated noise calculation unit 51 in FIG. 48 will be described. FIG. 53 is a block diagram showing the configuration of the estimated noise calculation unit 51. The estimated noise calculation unit 51 includes the separation unit 5
02, a multiplexing unit 503, and K frequency _- dependent estimated noise calculation units 514 _{0 to} 514 _K−1 . The voice detection flag supplied from the voice detection unit 4 in FIG. 48 and the count value supplied from the counter 13 in FIG. 48 are transmitted to the frequency _- dependent estimated noise calculation units 514 _{0 to} 514 _K-1 . Figure 4
The deteriorated voice power spectrum supplied from the multiplex multiplier 17 in 8 is transmitted to the separator 502.

【００３２】分離部５０２は、多重化された状態で供給
された劣化音声パワースペクトルをＫ個の周波数に対応
した成分に分離して、それぞれ周波数別推定雑音計算部
５１４₀ 〜５１４_K-1 に伝達する。周波数別推定雑音計
算部５１４₀ 〜５１４_K-1 は、分離部５０２から供給さ
れた劣化音声パワースペクトルを用いて雑音パワースペ
クトルを計算し、多重化部５０３に伝達する。雑音パワ
ースペクトルの計算は、カウント値と音声検出フラグの
値によって制御され、予め定めた条件が満足されるとき
だけ実行される。多重化部５０３は、供給されたＫ個の
雑音パワースペクトル値を多重化して、推定雑音パワー
スペクトルとして出力する。The separating unit 502 separates the deteriorated speech power spectrum supplied in the multiplexed state into components corresponding to K frequencies, and the estimated noise calculating units 514 _{0 to} 514 _K-1 for each frequency. introduce. The frequency _- dependent estimated noise calculation units 514 _{0 to} 514 _K-1 calculate a noise power spectrum using the deteriorated speech power spectrum supplied from the separation unit 502, and transmit the noise power spectrum to the multiplexing unit 503. The calculation of the noise power spectrum is controlled by the count value and the value of the voice detection flag, and is executed only when a predetermined condition is satisfied. The multiplexing unit 503 multiplexes the supplied K noise power spectrum values and outputs them as an estimated noise power spectrum.

【００３３】図５４は、図５３に示した推定雑音計算部
５１に含まれる周波数別推定雑音計算部５１４の構成を
示すブロック図である。文献２で開示された雑音推定
は、無音区間において雑音推定値を更新するものであ
り、雑音推定値として巡回型フィルタによる平均化を施
した推定雑音の瞬時値を用いている。一方、「１９９８
年５月、アイ・イー・イー・イー・トランザクションズ
・オン・スピーチ・アンド・オーディオ・プロセシン
グ、第６巻、第３号（IEEE TRANS-ACTIONS ON SPEECHAN
D AUDIO PROCESSING, VOL.6, NO.3, PP.287-292, MAY,
1998 ）、２８７〜２９２ページ」（文献４）に開示さ
れた雑音推定では、推定雑音の瞬時値を平均化して用い
ると記述されている。これは、巡回型の代わりにトラン
スバーサル型フィルタ（シフトレジスタを用いた構成）
を用いた平均化の実現を示唆している。どちらの実現も
機能は等しいので、ここでは文献４に開示された方法に
ついて説明する。FIG. 54 is a block diagram showing the configuration of the frequency-dependent estimated noise calculation unit 514 included in the estimated noise calculation unit 51 shown in FIG. The noise estimation disclosed in Reference 2 updates the noise estimation value in a silent section, and uses the instantaneous value of the estimated noise averaged by the recursive filter as the noise estimation value. On the other hand, "1998
May, IEE Transactions on Speech and Audio Processing, Volume 6, Issue 3 (IEEE TRANS-ACTIONS ON SPEECHAN
D AUDIO PROCESSING, VOL.6, NO.3, PP.287-292, MAY,
1998), pp. 287-292 ”(Reference 4), it is described that the instantaneous values of the estimated noise are averaged and used. This is a transversal type filter (configuration using shift register) instead of the cyclic type.
It suggests the realization of averaging using. Since the functions are the same in both implementations, the method disclosed in Document 4 will be described here.

【００３４】周波数別推定雑音計算部５１４は、更新判
定部５２１、レジスタ長記憶部５９４１、スイッチ５０
４４、シフトレジスタ５０４５、加算器５０４６、最小
値選択部５０４７、除算部５０４８、カウンタ５０４９
を有する。スイッチ５０４４には、図５３における分離
部５０２から、周波数別劣化音声パワースペクトルが供
給されている。スイッチ５０４４が回路を閉じたとき
に、周波数別劣化音声パワースペクトルは、シフトレジ
スタ５０４５に伝達される。シフトレジスタ５０４５
は、更新判定部５２１から供給される制御信号に応じ
て、内部レジスタの記憶値を隣接レジスタにシフトす
る。シフトレジスタ長は、後述するレジスタ長記憶部５
９４１に記憶されている値に等しい。シフトレジスタ５
０４５の全レジスタ出力は、加算器５０４６に供給され
る。加算器５０４６は、供給された全レジスタ出力を加
算して、加算結果を除算部５０４８に伝達する。The frequency-dependent estimated noise calculation unit 514 includes an update determination unit 521, a register length storage unit 5941, and a switch 50.
44, shift register 5045, adder 5046, minimum value selection unit 5047, division unit 5048, counter 5049
Have. The switch 5044 is supplied with the degraded voice power spectrum for each frequency from the separating unit 502 in FIG. When the switch 5044 closes the circuit, the frequency-dependent deteriorated voice power spectrum is transmitted to the shift register 5045. Shift register 5045
Shifts the storage value of the internal register to the adjacent register according to the control signal supplied from the update determination unit 521. The shift register length is the register length storage unit 5 described later.
Equal to the value stored in 941. Shift register 5
All 045 register outputs are provided to adder 5046. The adder 5046 adds all the supplied register outputs and transmits the addition result to the division unit 5048.

【００３５】一方、更新判定部５２１には、カウント値
と音声検出フラグが供給されている。更新判定部５２１
は、カウント値が予め設定された値に到達するまでは常
に“１”を、到達した後は音声検出フラグが“０”であ
る（無音の）ときに“１”を、それ以外のときに“０”
を出力し、制御信号としてカウンタ５０４９、スイッチ
５０４４、及びシフトレジスタ５０４５に伝達する。ス
イッチ５０４４は、更新判定部５２１から供給された制
御信号が“１”のときに回路を閉じ、“０”のときに開
く。カウンタ５０４９は、更新判定部５２１から供給さ
れた制御信号が“１”のときにカウント値を増加し、
“０”のときには変更しない。シフトレジスタ５０４５
は、更新判定部５２１から供給された信号が“１”のと
きにスイッチ５０４４から供給される信号サンプルを１
サンプル取り込むと同時に、内部レジスタの記憶値を隣
接レジスタにシフトする。On the other hand, the update determination unit 521 is supplied with the count value and the voice detection flag. Update determination unit 521
Is always "1" until the count value reaches a preset value, and when it reaches "1" when the voice detection flag is "0" (silence), otherwise. "0"
Is output and transmitted as a control signal to the counter 5049, the switch 5044, and the shift register 5045. The switch 5044 closes the circuit when the control signal supplied from the update determination unit 521 is “1”, and opens when the control signal is “0”. The counter 5049 increments the count value when the control signal supplied from the update determination unit 521 is “1”,
When it is "0", it is not changed. Shift register 5045
Is a signal sample supplied from the switch 5044 when the signal supplied from the update determination unit 521 is “1”.
Simultaneously with the sampling, the value stored in the internal register is shifted to the adjacent register.

【００３６】最小値選択部５０４７には、カウンタ５０
４９の出力とレジスタ長記憶部５９４１の出力が供給さ
れている。最小値選択部５０４７は、供給されたカウン
ト値とレジスタ長のうち、小さい方を選択して、除算部
５０４８に伝達する。除算部５０４８は、加算器５０４
６から供給された周波数別劣化音声パワースペクトルの
加算値をカウント値又はレジスタ長の小さい方の値で除
算し、商を周波数別推定雑音パワースペクトルλ_n(ｋ）
として出力する。Ｂ_n(ｋ）（ｎ＝０，１，....，Ｎ−
１）をシフトレジスタ５０４５に保存されている劣化音
声パワースペクトルのサンプル値とすると、λ_n(ｋ）は
式（１０）で与えられる。The minimum value selector 5047 has a counter 50.
The output of 49 and the output of the register length storage unit 5941 are supplied. The minimum value selection unit 5047 selects the smaller one of the supplied count value and the register length and transmits it to the division unit 5048. The division unit 5048 uses the adder 504.
The added value of the frequency-dependent deteriorated speech power spectrum supplied from 6 is divided by the count value or the smaller value of the register length, and the quotient is the frequency-dependent estimated noise power spectrum λ _n (k)
Output as. B _n (k) (n = 0, 1, ..., N−
Letting 1) be the sample value of the degraded voice power spectrum stored in the shift register 5045, λ _n (k) is given by equation (10).

【００３７】[0037]

【数１０】 [Equation 10]

【００３８】ただし、Ｎはカウント値とレジスタ長のう
ち、小さい方の値である。カウント値はゼロから始まっ
て単調に増加するので、最初はカウント値で除算が行な
われ、後にはレジスタ長で除算が行なわれる。一方、実
際に値が記憶されているレジスタの数は、カウント値が
レジスタ長より小さいときはカウント値に等しく、カウ
ント値がレジスタ長より大きくなると、レジスタ長と等
しくなる。したがって、加算器５０４６から供給された
周波数別劣化音声パワースペクトルの加算値を、実際に
値が記憶されているレジスタの数で除算することにな
る。カウント値がレジスタ長より大きいときは、シフト
レジスタ５０４５に格納された値の平均値を求めること
になる。この演算結果が周波数別推定雑音パワースペク
トルとなる。However, N is the smaller value of the count value and the register length. Since the count value starts from zero and monotonically increases, division is first performed by the count value and later division is performed by the register length. On the other hand, the number of registers in which values are actually stored is equal to the count value when the count value is smaller than the register length, and is equal to the register length when the count value is larger than the register length. Therefore, the added value of the frequency-dependent deteriorated voice power spectrum supplied from the adder 5046 is divided by the number of registers in which the values are actually stored. When the count value is larger than the register length, the average value of the values stored in the shift register 5045 is calculated. The result of this calculation is the estimated noise power spectrum for each frequency.

【００３９】図５５は、図５４に示した周波数別推定雑
音計算部５１４に含まれる更新判定部５２１の構成を示
すブロック図である。更新判定部５２１は、論理否定回
路５２０２、比較部５２０３、閾値記憶部５２０４、論
理和計算部５２１１を有する。図４８におけるカウンタ
１３から供給されるカウント値は、比較部５２０３に伝
達される。閾値記憶部５２０４の出力である閾値も、比
較部５２０３に伝達される。比較部５２０３は、供給さ
れたカウント値と閾値を比較し、カウント値が閾値より
小さいときに“１”を、カウント値が閾値より大きいと
きに“０”を、論理和計算部５２１１に伝達する。FIG. 55 is a block diagram showing the configuration of the update determination unit 521 included in the frequency-dependent estimated noise calculation unit 514 shown in FIG. The update determination unit 521 includes a logical NOT circuit 5202, a comparison unit 5203, a threshold value storage unit 5204, and a logical sum calculation unit 5211. The count value supplied from the counter 13 in FIG. 48 is transmitted to the comparison unit 5203. The threshold value output from the threshold value storage unit 5204 is also transmitted to the comparison unit 5203. The comparison unit 5203 compares the supplied count value with the threshold value, and transmits “1” to the logical sum calculation unit 5211 when the count value is smaller than the threshold value and “0” when the count value is larger than the threshold value. .

【００４０】一方、供給された音声検出フラグは論理否
定回路５２０２に伝達される。論理否定回路５２０２
は、入力された信号の論理否定値を求め、論理和計算部
５２１１に伝達する。すなわち、音声検出フラグが
“１”である有音部では“０”を、音声検出フラグが
“０”である無音部では“１”を、論理和計算部５２１
１に伝達することになる。その結果、論理和計算部５２
１１の出力は、音声検出フラグが“０”である無音部の
とき、又はカウント値が閾値より小さいときに“１”と
なって、図５４におけるスイッチ５０４４を閉じ、カウ
ンタ５０４９をカウントアップさせる。On the other hand, the supplied voice detection flag is transmitted to the logical NOT circuit 5202. Logical NOT circuit 5202
Calculates the logical negation value of the input signal and transmits it to the logical sum calculation unit 5211. That is, the voice-over portion whose voice detection flag is "1" is "0", and the voice-over portion whose voice detection flag is "0" is "1".
It will be transmitted to 1. As a result, the logical sum calculation unit 52
The output of 11 is "1" when the voice detection flag is a silent part where the voice detection flag is "0" or when the count value is smaller than the threshold value, and the switch 5044 in FIG. 54 is closed and the counter 5049 is counted up.

【００４１】図４８における周波数別ＳＮＲ計算部６に
ついて説明する。図５６は、周波数別ＳＮＲ計算部６の
構成を示すブロック図である。周波数別ＳＮＲ計算部６
は、Ｋ個の除算部６０１₀ 〜６０１_K-1 、分離部６０
２，６０３、多重化部６０４を有する。図４８における
多重乗算部１７から供給される劣化音声パワースペクト
ルは、分離部６０２に伝達される。図４８における推定
雑音計算部５１から供給される推定雑音パワースペクト
ルは、分離部６０３に伝達される。劣化音声パワースペ
クトルは分離部６０２において、推定雑音パワースペク
トルは分離部６０３において、それぞれ周波数成分に対
応したＫサンプルに分離され、それぞれ除算部６０１₀
〜６０１_K-1 に供給される。除算部６０１₀ 〜６０１
_K-1 では、式（１１）に従って、供給された劣化音声パ
ワースペクトル｜Ｙ_n(ｋ）｜²を推定雑音パワースペク
トルλ_n(ｋ）で除算して周波数別ＳＮＲγ_n(ｋ）を求
め、多重化部６０４に伝達する。多重化部６０４は、伝
達されたＫ個の周波数別ＳＮＲγ _n(ｋ）を多重化して、
後天的ＳＮＲとして出力する。In the frequency-dependent SNR calculation unit 6 in FIG.
explain about. FIG. 56 shows the SNR calculation unit 6 for each frequency.
It is a block diagram which shows a structure. Frequency SNR calculation unit 6
Is K division units 601₀ ~ 601_K-1 , Separation unit 60
2, 603, and a multiplexing unit 604. In FIG. 48
Deteriorated voice power spectrum supplied from multiplex multiplier 17
Are transmitted to the separating unit 602. Estimation in FIG. 48
Estimated noise power spectrum supplied from the noise calculator 51
Are transmitted to the separating unit 603. Degraded voice power spec
In the demultiplexing unit 602, Kuttle calculates the estimated noise power spectrum.
In the separation unit 603, the tor is paired with each frequency component.
The corresponding K samples are separated, and each division unit 601₀
~ 601_K-1 Is supplied to. Division unit 601₀ ~ 601
_K-1 Then, according to equation (11),
War spectrum ｜ Y_n(k) |²Estimated noise power spectrum
Tol λ_nDivide by (k) and divide by frequency SNRγ_nFind (k)
Therefore, it is transmitted to the multiplexing unit 604. The multiplexing unit 604 transmits
Reached K SNRγ for each frequency _n(k) is multiplexed,
Output as an acquired SNR.

【００４２】[0042]

【数１１】 [Equation 11]

【００４３】図４８における推定先天的ＳＮＲ計算部７
について説明する。図５７は、推定先天的ＳＮＲ計算部
７の構成を示すブロック図である。推定先天的ＳＮＲ計
算部７は、多重値域限定処理部７０１、後天的ＳＮＲ記
憶部７０２、抑圧係数記憶部７０３、多重乗算部７０
４，７０５、重み記憶部７０６、多重重みつき加算部７
０７、加算器７０８を有する。図４８における周波数別
ＳＮＲ計算部６から供給される後天的ＳＮＲγ_n(ｋ）
（ｋ＝０，１，....，Ｋ−１）は、加算器７０８の一方
の端子と、後天的ＳＮＲ記憶部７０２に伝達される。後
天的ＳＮＲ記憶部７０２は、第ｎフレームにおける後天
的ＳＮＲγ_n(ｋ）を記憶すると共に、第ｎ−１フレーム
における後天的ＳＮＲγ_n-1(ｋ）を多重乗算部７０５に
伝達する。The estimated a priori SNR calculator 7 in FIG.
Will be described. FIG. 57 is a block diagram showing the configuration of the estimated a priori SNR calculation unit 7. The estimated a priori SNR calculation unit 7 includes a multiple range limiting processing unit 701, an a posteriori SNR storage unit 702, a suppression coefficient storage unit 703, a multiple multiplication unit 70.
4, 705, weight storage unit 706, multiple weighted addition unit 7
07 and an adder 708. Acquired SNR γ _n (k) supplied from the frequency-dependent SNR calculation unit 6 in FIG.
(K = 0, 1, ..., K−1) is transmitted to one terminal of the adder 708 and the acquired SNR storage unit 702. The acquired SNR storage unit 702 stores the acquired SNR γ _n (k) in the n-th frame and transfers the acquired SNR γ _n-1 (k) in the ( _n-1 ) th frame to the multiplex multiplication unit 705.

【００４４】図４８における雑音抑圧係数生成部８から
供給される抑圧係数Ｇ_n(ｋ）バー（ｋ＝０，１，....，
Ｋ−１）は、抑圧係数記憶部７０３に伝達される。抑圧
係数記憶部７０３は、第ｎフレームにおける抑圧係数Ｇ
_n(ｋ）バーを記憶すると共に、第ｎ−１フレームにおけ
る抑圧係数Ｇ_n-1(ｋ）バーを多重乗算部７０４に伝達す
る。多重乗算部７０４は、供給されたＧ_n-1(ｋ）バーを
２乗してＧ² _n-1（ｋ）バーを求め、多重乗算部７０５に
伝達する。多重乗算部７０５は、Ｇ² _n-1（ｋ）バーとγ
_n-1(ｋ）をｋ＝０，１，....，Ｋ−１に対して乗算して
Ｇ² _n-1（ｋ）バーγ_n-1(ｋ）を求め、その結果を多重重
みつき加算部７０７に過去の推定ＳＮＲ９２２として伝
達する。多重乗算部７０４及び７０５の構成は、既に図
５２を用いて説明した多重乗算部１７に等しいので、詳
細な説明は省略する。The suppression coefficient G _n (k) bar (k = 0, 1, ..., And, supplied from the noise suppression coefficient generator 8 in FIG.
K-1) is transmitted to the suppression coefficient storage unit 703. The suppression coefficient storage unit 703 stores the suppression coefficient G in the nth frame.
_{The n} (k) bar is stored and the suppression coefficient G _n-1 (k) bar in the ( _n-1 ) _th frame is transmitted to the multiplex multiplication unit 704. The multiple multiplication unit 704 squares the supplied G _n-1 (k) bar to obtain a G ² _n-1 (k) bar, and transmits it to the multiple multiplication unit 705. The multiple multiplication unit 705 calculates the G ² _n-1 (k) bar and γ.
_n−1 (k) is multiplied by k = 0, 1, ..., K−1 to obtain G ² _n−1 (k) bar γ _n−1 (k), and the result is multiplied. The past estimated SNR 922 is transmitted to the weighted addition unit 707. The configurations of the multiplex multipliers 704 and 705 are the same as those of the multiplex multiplier 17 already described with reference to FIG. 52, and thus detailed description thereof will be omitted.

【００４５】加算器７０８の他方の端子には−１が供給
されており、加算結果γ_n(ｋ）−１が多重値域限定処理
部７０１に伝達される。多重値域限定処理部７０１は、
加算器７０８から供給された加算結果γ_n(ｋ）−１に値
域限定演算子Ｐ［・］による演算を施し、その結果であ
るＰ［γ_n(ｋ）−１］を多重重みつき加算部７０７に瞬
時推定ＳＮＲ９２１として伝達する。ただし、Ｐ［ｘ］
は式（１２）で定められる。-1 is supplied to the other terminal of the adder 708, and the addition result γ _n (k) -1 is transmitted to the multiple range limiting processing unit 701. The multiple range limitation processing unit 701
The addition result γ _n (k) -1 supplied from the adder 708 is calculated by the range limiting operator P [•], and the result P [γ _n (k) -1] is added to the multi-weighted addition unit. It is transmitted to 707 as the instantaneous estimated SNR 921. However, P [x]
Is defined by equation (12).

【００４６】[0046]

【数１２】 [Equation 12]

【００４７】多重重みつき加算部７０７には、また、重
み記憶部７０６から重み９２３が供給されている。多重
重みつき加算部７０７は、これらの供給された瞬時推定
ＳＮＲ９２１、過去の推定ＳＮＲ９２２、重み９２３を
用いて推定先天的ＳＮＲ９２４を求める。重み９２３を
αとし、ξ_n(ｋ）ハットを推定先天的ＳＮＲとすると、
ξ_n(ｋ）ハットは、式（１３）によって計算される。こ
こに、右辺第１項の初期値（ｎ＝０）を、γ_-1（ｋ）Ｇ
² _-1(ｋ）バー＝１とする。A weight 923 is supplied from the weight storage unit 706 to the multiple weighted addition unit 707. The multiple weighted addition unit 707 obtains an estimated a priori SNR 924 by using the supplied instantaneous estimated SNR 921, past estimated SNR 922, and weight 923. If the weight 923 is α and the ξ _n (k) hat is the estimated a priori SNR,
ξ _n (k) hat is calculated by the equation (13). Here, the initial value (n = 0) of the first term on the right side is set to γ ₋₁ (k) G
² ₋₁ (k) bar = 1.

【００４８】[0048]

【数１３】 [Equation 13]

【００４９】図５８は、図５７に示した推定先天的ＳＮ
Ｒ計算部７に含まれる多重値域限定処理部７０１の構成
を示すブロック図である。多重値域限定処理部７０１
は、定数記憶部７０１１、Ｋ個の最大値選択部７０１２
₀ 〜７０１２_K-1 、分離部７０１３、多重化部７０１４
を有する。分離部７０１３には、図５７における加算器
７０８から、γ_n(ｋ）−１が供給される。分離部７０１
３は、供給されたγ_n(ｋ）−１をＫ個の周波数別成分に
分離し、それぞれ最大値選択部７０１２₀ 〜７０１２
_K-1 の一方の入力に供給する。最大値選択部７０１２₀
〜７０１２_K-1の他方の入力には、定数記憶部７０１１
からゼロが供給されている。最大値選択部７０１２₀ 〜
７０１２_K-1 は、γ_n(ｋ）−１をゼロと比較し、大きい
方の値を多重化部７０１４へ伝達する。この最大値選択
演算は、式（１２）を実行することに相当する。多重化
部７０１４は、これらの値を多重化して出力する。FIG. 58 shows the estimated innate SN shown in FIG.
6 is a block diagram showing a configuration of a multiple range limiting processing unit 701 included in the R calculation unit 7. FIG. Multiple range limitation processing unit 701
Is a constant storage unit 7011 and K maximum value selection units 7012.
_{0 to} 7012 _K-1 , demultiplexing unit 7013, multiplexing unit 7014
Have. Γ _n (k) −1 is supplied to the separating unit 7013 from the adder 708 in FIG. 57. Separation unit 701
3 separates the supplied γ _n (k) -1 into K frequency components, and the maximum value selection units 7012 _{0 to} 7012 respectively.
Supply to one input of _K-1 . Maximum value selection unit 7012 ₀
~ 7012 _{K-1 has} the other input to the constant storage unit 7011.
Is being supplied by zero. Maximum value selection unit 7012 ₀ ~
7012 _K-1 compares γ _n (k) -1 with zero and transfers the larger value to multiplexing section 7014. This maximum value selection operation is equivalent to executing Expression (12). The multiplexing unit 7014 multiplexes these values and outputs them.

【００５０】図５９は、図５７に示した推定先天的ＳＮ
Ｒ計算部７に含まれる多重重みつき加算部７０７の構成
を示すブロック図である。多重重みつき加算部７０７
は、Ｋ個の重みつき加算部７０７１₀ 〜７０７１_K-1 、
分離部７０７２，７０７４、多重化部７０７５を有す
る。FIG. 59 shows the estimated innate SN shown in FIG.
6 is a block diagram showing the configuration of a multiple weighted addition unit 707 included in the R calculation unit 7. FIG. Multi-weighted addition unit 707
Are _K weighted addition units 7071 _{0 to} 7071 _K-1 ,
It has demultiplexing units 7072 and 7074 and a multiplexing unit 7075.

【００５１】分離部７０７２には、図５７における多重
値域限定処理部７０１から、Ｐ［γ _n(ｋ）−１］が瞬時
推定ＳＮＲ９２１として供給される。分離部７０７２
は、Ｐ［γ_n(ｋ）−１］をＫ個の周波数別成分に分離
し、周波数別瞬時推定ＳＮＲ９２１₀ 〜９２１_K-1 とし
て、それぞれ重みつき加算部７０７１₀ 〜７０７１_K-1
に伝達する。分離部７０７４には、図５７における多重
乗算部７０５から、Ｇ² _n-1（ｋ）バーγ_n-1(ｋ）が過去
の推定ＳＮＲ９２２として供給される。分離部７０７４
は、Ｇ² _n-1（ｋ）バーγ_n-1(ｋ）をＫ個の周波数別成分
に分離し、過去の周波数別推定ＳＮＲ９２２₀ 〜９２２
_K-1 として、それぞれ重みつき加算部７０７１₀ 〜７０
７１_K-1 に伝達する。一方、重みつき加算部７０７１₀
〜７０７１_K- ₁ には、重み９２３も供給される。重みつ
き加算部７０７１₀ 〜７０７１_K-1 は、式（１３）によ
って表される重みつき加算を実行し、周波数別推定先天
的ＳＮＲ９２４₀ 〜９２４_K-1 を多重化部７０７５に伝
達する。多重化部７０７５は、周波数別推定先天的ＳＮ
Ｒ９２４₀ 〜９２４_K-1 を多重化し、推定先天的ＳＮＲ
９２４として出力する。重みつき加算部７０７１₀ 〜７
０７１_K-1 の構成と動作は、既に図５１を用いて説明し
た重みつき加算部４０７と等しいので、詳細な説明は省
略する。但し、重みつき加算の計算は常に行なわれる。The demultiplexing unit 7072 is provided with the multiplexing shown in FIG.
From the range limiting processing unit 701, P [γ _n(k) -1] is instantaneous
Supplied as an estimated SNR 921. Separation unit 7072
Is P [γ_n(k) -1] is separated into K frequency components
Frequency-dependent instantaneous estimation SNR 921₀ ~ 921_K-1 age
Respectively, the weighted addition unit 7071₀ ~ 7071_K-1
Communicate to. The demultiplexing unit 7074 includes the multiplex shown in FIG.
From the multiplication unit 705, G² _n-1(K) bar γ_n-1(k) is the past
Of the estimated SNR 922. Separation unit 7074
Is G² _n-1(K) bar γ_n-1(k) is K frequency components
And separate past estimated SNR922 by frequency₀ ~ 922
_K-1 Respectively, the weighted addition unit 7071₀ ~ 70
71_K-1 Communicate to. On the other hand, the weighted addition unit 7071₀
~ 7071_K- ₁ Is also provided with a weight 923. Weight
Adder 7071₀ ~ 7071_K-1 According to equation (13)
The weighted addition represented by
SNR924₀ ~ 924_K-1 Is transmitted to the multiplexing unit 7075.
Reach The multiplexing unit 7075 uses the frequency-specific estimated innate SN.
R924₀ ~ 924_K-1 The estimated innate SNR
Output as 924. Weighted addition unit 7071₀ ~ 7
071_K-1 The configuration and operation of are already explained using FIG.
Since it is equal to the weighted addition unit 407, detailed description will be omitted.
I will omit it. However, the weighted addition calculation is always performed.

【００５２】図４８における雑音抑圧係数生成部８につ
いて説明する。図６０は、雑音抑圧係数生成部８の構成
を示すブロック図である。雑音抑圧係数生成部８は、Ｋ
個の抑圧係数検索部８０１₀ 〜８０１_K-1 、分離部８０
２，８０３、多重化部８０４を有する。分離部８０２に
は、図４８における周波数別ＳＮＲ計算部６から後天的
ＳＮＲが供給される。分離部８０２は、供給された後天
的ＳＮＲをＫ個の周波数別成分に分離し、それぞれ抑圧
係数検索部８０１₀ 〜８０１_K-1 に伝達する。分離部８
０３には、図４８における推定先天的ＳＮＲ計算部７か
ら推定先天的ＳＮＲが供給される。分離部８０３は、供
給された推定先天的ＳＮＲをＫ個の周波数別成分に分離
し、それぞれ抑圧係数検索部８０１₀ 〜８０１_K-1 に伝
達する。抑圧係数検索部８０１₀ 〜８０１_K-1 は、供給
された後天的ＳＮＲと推定先天的ＳＮＲに対応した抑圧
係数を検索し、検索結果を多重化部８０４に伝達する。
多重化部８０４は、供給された抑圧係数を多重化して出
力する。The noise suppression coefficient generator 8 in FIG. 48 will be described. FIG. 60 is a block diagram showing the configuration of the noise suppression coefficient generator 8. The noise suppression coefficient generation unit 8 uses K
Individual suppression coefficient search units 801 _{0 to} 801 _K-1 , separation unit 80
2, 803 and a multiplexing unit 804. The separation unit 802 is supplied with the acquired SNR from the frequency-based SNR calculation unit 6 in FIG. The separation unit 802 separates the supplied acquired SNR into K frequency components, and transfers the _K frequency components to the suppression coefficient search units 801 _{0 to} 801 _K-1 . Separation part 8
03 is supplied with the estimated a priori SNR from the estimated a priori SNR calculator 7 in FIG. Separation unit 803, the supplied estimated apriori SNR is separated into K frequency-component, and transmits the respective spectral gain search unit 801 ₀ ~801 _K-1. The suppression coefficient search units 801 _{0 to} 801 _K-1 search for suppression coefficients corresponding to the supplied acquired SNR and estimated a priori SNR, and transmit the search results to the multiplexing unit 804.
The multiplexing unit 804 multiplexes the supplied suppression coefficient and outputs it.

【００５３】図６１は、図６０に示した雑音抑圧係数生
成部８に含まれる抑圧係数検索部８０１₀ 〜８０１_K-1
の構成を示すブロック図である。抑圧係数検索部８０１
は、抑圧係数テーブル８０１１、アドレス変換部８０１
２，８０１３を有する。アドレス変換部８０１２には、
図６０における分離部８０２から、周波数別後天的ＳＮ
Ｒが供給される。アドレス変換部８０１２は、供給され
た周波数別後天的ＳＮＲを対応したアドレスに変換し、
抑圧係数テーブル８０１１に伝達する。アドレス変換部
８０１３には、図６０における分離部８０３から、周波
数別推定先天的ＳＮＲが供給される。アドレス変換部８
０１３は、供給された周波数別推定先天的ＳＮＲを対応
したアドレスに変換し、抑圧係数テーブル８０１１に伝
達する。抑圧係数テーブル８０１１は、アドレス変換部
８０１２とアドレス変換部８０１３から供給されたアド
レスに対応した領域に格納されている抑圧係数を、周波
数別抑圧係数として出力する。ここでは、特定の統計モ
デルに従う背景雑音を仮定して導出した抑制係数が用い
られている。FIG. 61 is a block diagram of the suppression coefficient search units 801 _{0 to} 801 _K-1 included in the noise suppression coefficient generation unit 8 shown in FIG.
3 is a block diagram showing the configuration of FIG. Suppression coefficient search unit 801
Is a suppression coefficient table 8011 and address conversion unit 801.
2,8013. The address conversion unit 8012 has
From the separation unit 802 in FIG.
R is supplied. The address conversion unit 8012 converts the supplied frequency-dependent acquired SNR into a corresponding address,
This is transmitted to the suppression coefficient table 8011. The address conversion unit 8013 is supplied with the estimated a priori SNR for each frequency from the separation unit 803 in FIG. Address conversion unit 8
013 converts the supplied inferred innate SNR for each frequency into a corresponding address, and transfers it to the suppression coefficient table 8011. The suppression coefficient table 8011 outputs the suppression coefficient stored in the area corresponding to the address supplied from the address conversion unit 8012 and the address conversion unit 8013 as the suppression coefficient for each frequency. Here, the suppression coefficient derived by assuming the background noise according to a specific statistical model is used.

【００５４】[0054]

【発明が解決しようとする課題】このように、従来のノ
イズ除去装置及び方法では、特定の統計モデルに従う背
景雑音を仮定して導出した抑圧係数を用いて雑音抑圧を
行なっていたため、その統計モデルに従わない雑音を効
果的に除去することができなかった。このため、十分高
い強調音声の品質を達成できなかった。また、従来のノ
イズ除去装置及び方法では、逆フーリエ変換して得られ
た時間領域信号の隣接する２フレームから取り出した信
号サンプルを重ね合わせ加算することによって、強調音
声を得ていた。一方、フーリエ変換前に時間領域信号に
かける窓関数は、雑音抑圧処理を行なわないときに、入
力が出力において再現されるように設計されていた。こ
のため、重ね合わせ加算の対象となった信号サンプル
が、隣接するフレームにおいて異なった抑圧係数値で抑
圧されると、フレーム境界において信号サンプルに不連
続性を生じ、出力信号に発生する雑音によって音質が劣
化してしまっていた。As described above, in the conventional noise removing apparatus and method, the noise suppression is performed by using the suppression coefficient derived assuming the background noise according to the specific statistical model. The noise that did not comply could not be effectively removed. For this reason, a sufficiently high quality of the emphasized voice cannot be achieved. Further, in the conventional noise removing apparatus and method, the emphasized speech is obtained by superposing and adding the signal samples extracted from two adjacent frames of the time domain signal obtained by the inverse Fourier transform. On the other hand, the window function applied to the time domain signal before the Fourier transform has been designed so that the input is reproduced at the output when the noise suppression processing is not performed. Therefore, if the signal samples that are subject to superposition addition are suppressed with different suppression coefficient values in adjacent frames, discontinuity occurs in the signal samples at frame boundaries, and noise generated in the output signal causes sound quality. Had deteriorated.

【００５５】以上のように従来のノイズ除去装置及び方
法には、優れた音質の強調音声を得ることができないと
いう問題があった。本発明はこのような課題を解決する
ためになされたものであり、その目的は、優れた音質の
強調音声を得ることができるノイズ除去装置及び方法を
提供することにある。As described above, the conventional noise removing apparatus and method have a problem that it is not possible to obtain an emphasized voice having excellent sound quality. The present invention has been made to solve such a problem, and an object of the present invention is to provide a noise removing apparatus and method capable of obtaining an emphasized voice with excellent sound quality.

【００５６】[0056]

【課題を解決するための手段】このような目的を達成す
るために、本発明のノイズ除去方法は、入力信号に基づ
いて擬似的な雑音を生成し、この擬似的な雑音を注入し
て得られた抑圧係数を用いることを特徴とする。抑圧係
数を定めるときに上述した擬似的な雑音を注入すること
により、特定の統計モデルに従う背景雑音を仮定して導
出した抑圧係数を、入力信号に応じて補正することがで
きる。In order to achieve such an object, the noise removing method of the present invention generates pseudo noise based on an input signal and obtains it by injecting this pseudo noise. It is characterized by using the suppressed suppression coefficient. By injecting the above-mentioned pseudo noise when determining the suppression coefficient, the suppression coefficient derived assuming the background noise according to the specific statistical model can be corrected according to the input signal.

【００５７】より具体的には、本発明のノイズ除去方法
は、入力信号を周波数領域信号に変換し、この周波数領
域信号に基づいて擬似的な第１の雑音を計算し、この第
１の雑音を周波数領域信号に付加し、第１の雑音を付加
した周波数領域信号を用いて信号対雑音比を求め、この
信号対雑音比に基づいて抑圧係数を定め、この抑圧係数
を用いて周波数領域信号を重みづけし、この重みづけし
た周波数領域信号を時間領域信号に変換することによっ
て、入力信号からノイズを除去した出力信号を得ること
を特徴とする。More specifically, the noise removal method of the present invention transforms an input signal into a frequency domain signal, calculates pseudo first noise based on the frequency domain signal, and calculates the first noise. Is added to the frequency domain signal, the signal-to-noise ratio is obtained using the frequency-domain signal to which the first noise is added, the suppression coefficient is determined based on this signal-to-noise ratio, and the frequency-domain signal is calculated using this suppression coefficient. Is weighted, and the weighted frequency domain signal is converted into a time domain signal to obtain an output signal from which noise is removed from the input signal.

【００５８】このノイズ除去方法において、周波数領域
信号に対する第１の雑音の付加を、入力信号の性質に応
じて選択的に行なってもよい。これにより、例えば抑圧
係数の導出に用いられた統計モデルに従わない雑音を含
む信号が入力された場合だけ第１の雑音を付加し、抑圧
係数の補正を選択的に行うことができる。ここで、入力
信号の性質として、信号の定常性を用いてもよい。言う
なれば、信号の性質、例えば平均パワーやスペクトル形
状等が、時間と共にどの程度変化するかを基準として、
第１の雑音の付加を行ってもよい。信号の定常性として
は、入力信号の振幅がゼロとなるゼロ交叉の数を用いて
もよいし、このゼロ交差の数と相関を示す前記周波数領
域信号の高域電力を用いてもよい。In this noise removing method, the first noise may be added to the frequency domain signal selectively depending on the nature of the input signal. Thereby, for example, the first noise can be added only when a signal including noise that does not follow the statistical model used for deriving the suppression coefficient is input, and the suppression coefficient can be selectively corrected. Here, the stationarity of the signal may be used as the property of the input signal. In other words, based on how much the characteristics of the signal, such as average power and spectral shape, change with time,
The first noise may be added. As the stationarity of the signal, the number of zero crossings at which the amplitude of the input signal becomes zero may be used, or the high frequency power of the frequency domain signal showing the correlation with the number of zero crossings may be used.

【００５９】また、入力信号を変換した周波数領域信号
に基づいて周波数領域信号に含まれる第２の雑音を推定
し、この第２の雑音と周波数領域信号とを用いて第１の
雑音のパワーを定めるようにしてもよい。また、入力信
号を変換した周波数領域信号に基づいて周波数領域信号
に含まれる第２の雑音を推定し、この第２の雑音と周波
数領域信号とを用いて第１の雑音を計算し、この第１の
雑音と周波数領域信号との和、及び第１の雑音と第２の
雑音との和を用いて信号対雑音比を求めるようにしても
よい。ここで、入力信号を変換した周波数領域信号を重
みづけし、この重みづけした周波数領域信号に基づいて
第２の雑音を推定するようにしてもよい。Further, the second noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, and the power of the first noise is calculated using the second noise and the frequency domain signal. You may decide. Also, the second noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, the first noise is calculated using the second noise and the frequency domain signal, and the first noise is calculated. The signal-to-noise ratio may be obtained using the sum of the first noise and the frequency domain signal and the sum of the first noise and the second noise. Here, the frequency domain signal obtained by converting the input signal may be weighted, and the second noise may be estimated based on the weighted frequency domain signal.

【００６０】また、本発明のノイズ除去方法は、入力信
号を周波数領域信号に変換し、この周波数領域信号を用
いて信号対雑音比を求め、この信号対雑音比を周波数領
域信号に基づいて補正し、この補正した信号対雑音比に
基づいて抑圧係数を定め、この抑圧係数を用いて周波数
領域信号を重みづけし、この重みづけした周波数領域信
号を時間領域信号に変換することによって、入力信号か
らノイズを除去した出力信号を得ることを特徴とする。Further, the noise removing method of the present invention converts an input signal into a frequency domain signal, obtains a signal to noise ratio using this frequency domain signal, and corrects this signal to noise ratio based on the frequency domain signal. Then, the suppression coefficient is determined based on the corrected signal-to-noise ratio, the frequency domain signal is weighted using this suppression coefficient, and the weighted frequency domain signal is converted into the time domain signal to obtain the input signal. It is characterized in that an output signal from which noise is removed is obtained.

【００６１】このノイズ除去方法において、信号対雑音
比の補正を、入力信号の性質に応じて選択的に行なって
もよい。これにより、例えば抑圧係数の導出に用いられ
た統計モデルに従わない雑音を含む信号が入力された場
合だけ信号対雑音比を補正し、抑圧係数の補正を選択的
に行うことができる。ここで、入力信号の性質として、
信号の定常性を用いてもよい。言うなれば、信号の性
質、例えば平均パワーやスペクトル形状等が、時間と共
にどの程度変化するかを基準として、信号対雑音比の補
正を行ってもよい。信号の定常性としては、入力信号の
振幅がゼロとなるゼロ交叉の数を用いてもよいし、この
ゼロ交差の数と相関を示す前記周波数領域信号の高域電
力を用いてもよい。In this noise elimination method, the signal-to-noise ratio may be corrected selectively depending on the nature of the input signal. As a result, the signal-to-noise ratio can be corrected and the suppression coefficient can be selectively corrected only when a signal including noise that does not follow the statistical model used for deriving the suppression coefficient is input. Here, as the nature of the input signal,
Signal stationarity may be used. In other words, the signal-to-noise ratio may be corrected based on how much the characteristics of the signal, such as the average power and the spectral shape, change with time. As the stationarity of the signal, the number of zero crossings at which the amplitude of the input signal becomes zero may be used, or the high frequency power of the frequency domain signal showing the correlation with the number of zero crossings may be used.

【００６２】また、入力信号を変換した周波数領域信号
に基づいて周波数領域信号に含まれる雑音を推定し、こ
の雑音と周波数領域信号とを用いて信号対雑音比の補正
量を定めるようにしてもよい。また、入力信号を変換し
た周波数領域信号に基づいて周波数領域信号に含まれる
雑音を推定し、この雑音及び信号対雑音比を用いて加算
信号を求め、この加算信号と周波数領域信号との和、及
び加算信号と雑音との和を用いて信号対雑音比を再計算
することによって信号対雑音比の補正を行なうようにし
てもよい。ここで、入力信号を変換した周波数領域信号
を重みづけし、この重みづけした周波数領域信号に基づ
いて雑音を推定するようにしてもよい。Further, the noise contained in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, and the correction amount of the signal-to-noise ratio is determined using this noise and the frequency domain signal. Good. Further, the noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, the addition signal is obtained using this noise and the signal-to-noise ratio, and the sum of the addition signal and the frequency domain signal, Alternatively, the signal-to-noise ratio may be corrected by recalculating the signal-to-noise ratio using the sum of the added signal and noise. Here, the frequency domain signal obtained by converting the input signal may be weighted, and the noise may be estimated based on the weighted frequency domain signal.

【００６３】また、上述したノイズ除去方法において、
周波数領域信号に基づいて抑圧係数を補正し、この補正
した抑圧係数を用いて周波数領域信号を重みづけするよ
うにしてもよい。これにより、信号対雑音比が低いとき
に抑圧不足により発生する残留雑音や、信号対雑音比が
高いときに過度の抑圧で発生する音声の歪みによる音質
劣化を防ぐことができる。また、上述したノイズ除去方
法において、周波数領域信号を変換した時間領域信号に
窓がけ処理を施してもよい。Further, in the above-mentioned noise removing method,
The suppression coefficient may be corrected based on the frequency domain signal, and the frequency domain signal may be weighted using the corrected suppression coefficient. As a result, it is possible to prevent residual noise that occurs due to insufficient suppression when the signal-to-noise ratio is low, and to prevent sound quality deterioration due to audio distortion that occurs due to excessive suppression when the signal-to-noise ratio is high. In the noise removal method described above, windowing processing may be performed on the time domain signal obtained by converting the frequency domain signal.

【００６４】また、本発明のノイズ除去方法は、入力信
号を周波数領域信号に変換し、この周波数領域信号に基
づいて周波数領域信号に含まれる第２の雑音を推定し、
その一方で周波数領域信号に基づいて擬似的な第１の雑
音を計算し、この第１の雑音を第２の雑音に付加した雑
音に対応した値を周波数領域信号から差し引いて周波数
領域の強調音声を求め、この強調音声を時間領域信号に
変換することによって、入力信号からノイズを除去した
出力信号を得ることを特徴とする。Further, the noise removing method of the present invention transforms the input signal into a frequency domain signal, estimates the second noise contained in the frequency domain signal based on this frequency domain signal,
On the other hand, a pseudo first noise is calculated based on the frequency domain signal, and a value corresponding to the noise obtained by adding the first noise to the second noise is subtracted from the frequency domain signal to emphasize the frequency domain speech. Is obtained, and the emphasized speech is converted into a time domain signal to obtain an output signal in which noise is removed from the input signal.

【００６５】このノイズ除去方法において、第２の雑音
に対する第１の雑音の付加を、入力信号の性質に応じて
選択的に行なってもよい。これにより、例えば抑圧係数
の導出に用いられた統計モデルに従わない雑音を含む信
号が入力された場合だけ第１の雑音を付加し、強調音声
の補正を選択的に行うことができる。ここで、入力信号
の性質として、信号の定常性を用いてもよい。言うなれ
ば、信号の性質、例えば平均パワーやスペクトル形状等
が、時間と共にどの程度変化するかを基準として、第１
の雑音の付加を行ってもよい。信号の定常性としては、
入力信号の振幅がゼロとなるゼロ交叉の数を用いてもよ
いし、このゼロ交差の数と相関を示す前記周波数領域信
号の高域電力を用いてもよい。In this noise removing method, the first noise may be added to the second noise selectively according to the property of the input signal. Thereby, for example, the first noise can be added only when a signal including noise that does not follow the statistical model used for deriving the suppression coefficient is input, and the enhanced voice can be selectively corrected. Here, the stationarity of the signal may be used as the property of the input signal. In other words, based on how much the characteristics of the signal, such as the average power and the spectral shape, change with time, the first
Noise may be added. The stationarity of the signal is
The number of zero crossings at which the amplitude of the input signal is zero may be used, or the high frequency power of the frequency domain signal that correlates with the number of zero crossings may be used.

【００６６】また、第１の雑音のパワーを、周波数領域
信号と第２の雑音とを用いて定めるようにしてもよい。
また、入力信号を変換した周波数領域信号を重みづけ
し、この重みづけした周波数領域信号に基づいて第２の
雑音を推定するようにしてもよい。ここで、入力信号を
変換した周波数領域信号を用いて信号対雑音比を求め、
この信号対雑音比を用いて重みを求め、この重みを用い
て周波数領域信号を重みづけするようにしてもよい。こ
れにより、周波数領域信号に含まれる音声成分の影響を
小さくし、第２の雑音の推定より高精度に行うことがで
きる。例えば、入力信号を変換した周波数領域信号を用
いて信号対雑音比を求め、この信号対雑音比を非線形処
理関数によって処理して重みを求め、この重みを用いて
周波数領域信号を重みづけするようにしてもよい。ま
た、上述したノイズ除去方法において、周波数領域の強
調音声を変換した時間領域信号に窓がけ処理を施しても
よい。The power of the first noise may be determined using the frequency domain signal and the second noise.
Further, the frequency domain signal obtained by converting the input signal may be weighted, and the second noise may be estimated based on the weighted frequency domain signal. Here, the signal-to-noise ratio is obtained using the frequency domain signal obtained by converting the input signal,
Weights may be obtained by using the signal-to-noise ratio, and the frequency domain signals may be weighted by using the weights. As a result, the influence of the voice component included in the frequency domain signal can be reduced and the estimation of the second noise can be performed with higher accuracy. For example, the signal-to-noise ratio is obtained by using the frequency-domain signal obtained by converting the input signal, the signal-to-noise ratio is processed by the non-linear processing function to obtain the weight, and the frequency-domain signal is weighted by using the weight. You may In the noise removal method described above, windowing processing may be performed on the time domain signal obtained by converting the emphasized speech in the frequency domain.

【００６７】また、本発明のノイズ除去方法は、周波数
領域の強調音声を変換した時間領域信号に窓がけ処理を
施すことを特徴とする。周波数領域の強調音声を変換し
た時間領域信号の隣接する２フレームを重ね合わせ加算
する場合に、重ね合わせ加算の対象となった信号サンプ
ルが各フレームにおいて異なった抑圧係数値で抑圧され
たとしても、各フレームを窓がけ処理してフレーム境界
における信号サンプルの振幅を小さくすることによっ
て、フレーム境界における信号サンプルの連続性を改善
することができる。Further, the noise removing method of the present invention is characterized in that windowing processing is applied to the time domain signal obtained by converting the emphasized speech in the frequency domain. When two adjacent frames of the time domain signal obtained by converting the emphasized speech in the frequency domain are superposed and added, even if the signal sample to be superposed and added is suppressed by different suppression coefficient values in each frame, The continuity of the signal samples at the frame boundaries can be improved by windowing each frame to reduce the amplitude of the signal samples at the frame boundaries.

【００６８】より具体的には、本発明のノイズ除去方法
は、入力信号を周波数領域信号に変換し、この周波数領
域信号を用いて信号対雑音比を求め、この信号対雑音比
に基づいて抑圧係数を定め、この抑圧係数を用いて周波
数領域信号を重みづけし、この重みづけした周波数領域
信号を時間領域信号に変換し、この時間領域信号に窓が
け処理を施すことを特徴とすことによって、入力信号か
らノイズを除去した出力信号を得ることを特徴とする。More specifically, the noise removal method of the present invention converts an input signal into a frequency domain signal, obtains a signal to noise ratio using this frequency domain signal, and suppresses the signal based on this signal to noise ratio. By defining a coefficient, weighting the frequency domain signal using this suppression coefficient, converting the weighted frequency domain signal into a time domain signal, and applying a windowing process to this time domain signal. , The output signal is obtained by removing noise from the input signal.

【００６９】また、本発明のノイズ除去方法は、入力信
号を周波数領域信号に変換し、この周波数領域信号に基
づいて周波数領域信号に含まれる第２の雑音を推定し、
この第２の雑音に対応した値を周波数領域信号から差し
引いて周波数領域の強調音声を求め、この強調音声を時
間領域信号に変換し、この時間領域信号に窓がけ処理を
施すことによって入力信号からノイズを除去した出力信
号を得ることを特徴とする。Further, the noise removing method of the present invention transforms the input signal into a frequency domain signal, estimates the second noise contained in the frequency domain signal based on this frequency domain signal,
A value corresponding to the second noise is subtracted from the frequency domain signal to obtain an emphasized voice in the frequency domain, the emphasized voice is converted into a time domain signal, and the time domain signal is subjected to windowing processing to obtain the emphasized voice from the input signal. It is characterized in that an output signal from which noise is removed is obtained.

【００７０】また、本発明のノイズ除去装置は、入力信
号に窓がけ処理を施して出力する第１の窓がけ処理部
と、この第１の窓がけ処理部により窓がけ処理された入
力信号を周波数領域信号に変換し，振幅成分と位相成分
に分離して出力する変換部と、周波数領域信号の振幅成
分に基づいて周波数領域信号に含まれる第２の雑音を推
定して出力する推定雑音計算部と、第２の雑音と周波数
領域信号の振幅成分を用いて擬似的な第１の雑音を計算
して出力する注入雑音計算部と、第１の雑音と周波数領
域信号の振幅成分を加算して出力する第１の加算器と、
第１の雑音と第２の雑音を加算して出力する第２の加算
器と、第１の加算器の出力信号と第２の加算器の出力信
号とを受けて第１の信号対雑音比を求めて出力する第１
の信号対雑音比計算部と、第１の信号対雑音比に基づい
て抑圧係数を定めて出力する抑圧係数生成部と、抑圧係
数を用いて周波数領域信号の振幅成分を重みづけして出
力する第１の乗算部と、この第１の乗算部により重みづ
けされた周波数領域信号の振幅成分と周波数領域信号の
位相成分を時間領域信号に変換して出力する逆変換部
と、時間領域信号に窓がけ処理を施して出力する第２の
窓がけ処理部とを少なくとも具備することを特徴とす
る。Further, the noise removing apparatus of the present invention includes a first windowing processing section which performs windowing processing on an input signal and outputs the windowed signal, and an input signal which is windowed by the first windowing processing section. A conversion unit that converts to a frequency domain signal and separates and outputs an amplitude component and a phase component, and an estimated noise calculation that estimates and outputs the second noise included in the frequency domain signal based on the amplitude component of the frequency domain signal. Section, an injection noise calculation section for calculating and outputting pseudo first noise using the second noise and the amplitude component of the frequency domain signal, and adding the amplitude components of the first noise and the frequency domain signal. A first adder for outputting
A second adder for adding and outputting the first noise and the second noise, and a first signal-to-noise ratio for receiving the output signal of the first adder and the output signal of the second adder To output and output
Signal-to-noise ratio calculation unit, a suppression coefficient generation unit that determines and outputs a suppression coefficient based on the first signal-to-noise ratio, and outputs by weighting the amplitude component of the frequency domain signal using the suppression coefficient. A first multiplication unit; an inverse transformation unit that transforms the amplitude component of the frequency domain signal and the phase component of the frequency domain signal weighted by the first multiplication unit into a time domain signal and outputs the time domain signal; At least a second windowing processing unit that performs windowing processing and outputs is provided.

【００７１】ここで、注入雑音計算部は、入力信号が入
力され，入力信号の振幅がゼロとなるゼロ交叉の数を計
算し，その計算結果に応じた制御信号を出力するゼロ交
叉計算部と、このゼロ交叉計算部から入力された制御信
号によって第１の雑音を選択的にゼロに設定するスイッ
チとを含む構成としてもよい。また、注入雑音計算部
は、変換部から入力された周波数領域信号の振幅成分の
高域電力を計算し，その計算結果に応じた制御信号を出
力する高域電力計算部と、この高域電力計算部から入力
された制御信号によって第１の雑音を選択的にゼロに設
定するスイッチとを含む構成としてもよい。Here, the injection noise calculator calculates the number of zero crossings at which the input signal is input and the amplitude of the input signal is zero, and outputs a control signal according to the calculation result. , And a switch for selectively setting the first noise to zero by the control signal input from the zero crossing calculation unit. The injection noise calculation unit calculates the high frequency power of the amplitude component of the frequency domain signal input from the conversion unit, and outputs the control signal according to the calculation result, and the high frequency power calculation unit. A switch for selectively setting the first noise to zero according to a control signal input from the calculator may be included.

【００７２】また、上述したノイズ除去装置は、周波数
領域信号の振幅成分を重みづけし，得られた重みつき振
幅成分を推定雑音計算部に出力し，推定雑音計算部に重
みつき振幅成分に基づいて第２の雑音を推定させる重み
つき劣化音声計算部を更に具備するものであってもよ
い。ここで、重みつき劣化音声計算部は、周波数領域信
号の振幅成分を用いて第２の信号対雑音比を計算して出
力する第２の信号対雑音比計算部と、この第２の信号対
雑音比計算部から入力された第２の信号対雑音比を非線
形関数によって処理して重みを求め出力する非線形処理
部と、この非線形処理部から入力された重みを用いて周
波数領域信号の振幅成分を重みづけし，推定雑音計算部
に出力する第２の乗算部とを含む構成としてもよい。Further, the above-mentioned noise removing apparatus weights the amplitude component of the frequency domain signal, outputs the obtained weighted amplitude component to the estimated noise calculation unit, and the estimated noise calculation unit is based on the weighted amplitude component. It may further include a weighted deteriorated speech calculation unit that estimates the second noise. Here, the weighted deteriorated speech calculation unit calculates a second signal-to-noise ratio using the amplitude component of the frequency domain signal and outputs the second signal-to-noise ratio calculation unit, and the second signal-to-noise ratio calculation unit. A non-linear processing unit that processes the second signal-to-noise ratio input from the noise ratio calculation unit by a non-linear function to obtain and output weights, and an amplitude component of the frequency domain signal using the weights input from the non-linear processing unit. May be weighted and output to the estimated noise calculation section and a second multiplication section may be included.

【００７３】また、上述したノイズ除去装置は、抑圧係
数生成部から入力された抑圧係数を，周波数領域信号に
基づいて補正して第１の乗算部に出力し，第１の乗算部
に補正した抑圧係数を用いて周波数領域信号の振幅成分
を重みづけさせる抑圧係数補正部を更に具備するもので
あってもよい。Further, the above-described noise removing apparatus corrects the suppression coefficient input from the suppression coefficient generating section based on the frequency domain signal, outputs the correction coefficient to the first multiplying section, and corrects it to the first multiplying section. A suppression coefficient correction unit that weights the amplitude component of the frequency domain signal using the suppression coefficient may be further included.

【００７４】また、本発明のノイズ除去装置は、入力信
号に窓がけ処理を施して出力する第１の窓がけ処理部
と、この第１の窓がけ処理部により窓がけ処理された入
力信号を周波数領域信号に変換し，振幅成分と位相成分
に分離して出力する変換部と、周波数領域信号の振幅成
分を用いて第１の信号対雑音比を求めて出力する第１の
信号対雑音比計算部と、周波数領域信号の振幅成分に基
づいて周波数領域信号に含まれる雑音を推定して出力す
る推定雑音計算部と、雑音と周波数領域信号の振幅成分
を用いて第１の信号対雑音比を補正し，補正信号対雑音
比として出力する信号対雑音比補正部と、補正信号対雑
音比に基づいて抑圧係数を定めて出力する抑圧係数生成
部と、抑圧係数を用いて周波数領域信号の振幅成分を重
みづけして出力する第１の乗算部と、この第１の乗算部
により重みづけされた周波数領域信号の振幅成分と周波
数領域信号の位相成分を時間領域信号に変換して出力す
る逆変換部と、時間領域信号に窓がけ処理を施す第２の
窓がけ処理部とを少なくとも具備することを特徴とす
る。The noise removing apparatus of the present invention further includes a first windowing processing section for performing windowing processing on the input signal and outputting the input signal, and an input signal subjected to the windowing processing by the first windowing processing section. A conversion unit that converts the signal into a frequency domain signal and outputs the signal after separating it into an amplitude component and a phase component, and a first signal-to-noise ratio that calculates and outputs a first signal-to-noise ratio using the amplitude component of the frequency domain signal. A calculation unit, an estimation noise calculation unit that estimates and outputs noise included in the frequency domain signal based on the amplitude component of the frequency domain signal, and a first signal-to-noise ratio using the noise and the amplitude component of the frequency domain signal. A signal-to-noise ratio correction unit that corrects and outputs a corrected signal-to-noise ratio, a suppression coefficient generation unit that determines and outputs a suppression coefficient based on the corrected signal-to-noise ratio, and a frequency-domain signal Output weighted amplitude components 1 multiplication unit, an inverse transformation unit that transforms the amplitude component of the frequency domain signal and the phase component of the frequency domain signal weighted by the first multiplication unit into a time domain signal and outputs the time domain signal, and a window for the time domain signal It is characterized by comprising at least a second window cliff processing unit for carrying out cliff processing.

【００７５】ここで、信号対雑音比補正部は、入力信号
が入力され，入力信号の振幅がゼロとなるゼロ交叉の数
を計算し，その計算結果に応じた制御信号を出力する判
定部と、この判定部から入力された制御信号によって補
正信号対雑音比を選択的に補正前の第１の信号対雑音比
と同じ値に設定するスイッチとを含む構成としてもよ
い。また、信号対雑音比補正部は、変換部から入力され
た周波数領域信号の振幅成分の高域電力を計算し，その
計算結果に応じた制御信号を出力する判定部と、この判
定部から入力された制御信号によって補正信号対雑音比
を選択的に補正前の第１の信号対雑音比と同じ値に設定
するスイッチとを含む構成としてもよい。Here, the signal-to-noise ratio correction unit calculates the number of zero crossings at which the input signal is input and the amplitude of the input signal becomes zero, and outputs a control signal according to the calculation result. A switch for selectively setting the correction signal-to-noise ratio to the same value as the pre-correction first signal-to-noise ratio by the control signal input from the determination unit may be included. Further, the signal-to-noise ratio correction unit calculates the high frequency power of the amplitude component of the frequency domain signal input from the conversion unit and outputs a control signal according to the calculation result, and a determination unit input from this determination unit. A switch for selectively setting the correction signal-to-noise ratio to the same value as the first signal-to-noise ratio before correction by the generated control signal may be included.

【００７６】また、上述したノイズ除去装置は、周波数
領域信号の振幅成分を重みづけし，得られた重みつき振
幅成分を推定雑音計算部に出力し，推定雑音計算部に重
みつき振幅成分に基づいて雑音を推定させる重みつき劣
化音声計算部を更に具備するものであってもよい。ここ
で、重みつき劣化音声計算部は、周波数領域信号の振幅
成分を用いて第２の信号対雑音比を計算して出力する第
２の信号対雑音比計算部と、この第２の信号対雑音比計
算部から入力された第２の信号対雑音比を非線形関数に
よって処理して重みを求め出力する非線形処理部と、こ
の非線形処理部から入力された重みを用いて周波数領域
信号の振幅成分を重みづけし，推定雑音計算部に出力す
る第２の乗算部とを含む構成としてもよい。Further, the above-mentioned noise removing apparatus weights the amplitude component of the frequency domain signal, outputs the obtained weighted amplitude component to the estimated noise calculation unit, and the estimated noise calculation unit based on the weighted amplitude component. It may further include a weighted deteriorated speech calculation unit for estimating noise. Here, the weighted deteriorated speech calculation unit calculates a second signal-to-noise ratio using the amplitude component of the frequency domain signal and outputs the second signal-to-noise ratio calculation unit, and the second signal-to-noise ratio calculation unit. A non-linear processing unit that processes the second signal-to-noise ratio input from the noise ratio calculation unit by a non-linear function to obtain and output weights, and an amplitude component of the frequency domain signal using the weights input from the non-linear processing unit. May be weighted and output to the estimated noise calculation section and a second multiplication section may be included.

【００７７】また、上述したノイズ除去装置は、抑圧係
数生成部から入力された抑圧係数を，周波数領域信号に
基づいて補正して第１の乗算部に出力し、第１の乗算部
に補正した抑圧係数を用いて周波数領域信号の振幅成分
を重みづけさせる抑圧係数補正部を更に具備するもので
あってもよい。Further, the above-mentioned noise removing apparatus corrects the suppression coefficient input from the suppression coefficient generating section based on the frequency domain signal, outputs it to the first multiplying section, and corrects it to the first multiplying section. A suppression coefficient correction unit that weights the amplitude component of the frequency domain signal using the suppression coefficient may be further included.

【００７８】[0078]

【発明の実施の形態】以下、図面を参照して、本発明の
実施の形態について詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described in detail below with reference to the drawings.

【００７９】（第１の実施の形態）図１は、本発明のノ
イズ除去装置の第１の実施の形態の全体構成を示すブロ
ック図である。このノイズ除去装置と、図４８に示した
従来のノイズ除去装置とは、窓がけ処理部２２、注入雑
音計算部５５、加算器５６，５７を除いて同一である。
この同一部分については同一符号を付している。以下、
上述の相違点を中心に詳細に説明する。(First Embodiment) FIG. 1 is a block diagram showing the overall configuration of a first embodiment of a noise eliminator of the present invention. This noise removing device and the conventional noise removing device shown in FIG. 48 are the same except for the windowing processing unit 22, injection noise calculation unit 55, and adders 56 and 57.
The same parts are designated by the same reference numerals. Less than,
The difference will be mainly described in detail.

【００８０】窓がけ処理部２２は、逆フーリエ変換部９
から供給された時間領域サンプル値系列ｘ_n(ｔ）バーに
窓関数ｈ（ｔ）を乗算し、積であるｈ（ｔ）ｘ_n(ｔ）バ
ーをフレーム合成部１０に伝達する。フレーム合成部１
０は、ｈ（ｔ）ｘ_n(ｔ）バーの隣接する２フレームから
Ｋ／２サンプルずつを取り出して重ね合わせ、式（１
４）によって、強調音声ｘ_n(ｔ）ハット（ｔ＝０，
１，....，Ｋ／２−１）を得る。得られた強調音声ｘ
_n(ｔ）ハットが、フレーム合成部１０の出力として、出
力端子１２に伝達される。The windowing processing section 22 includes an inverse Fourier transform section 9
The time domain sample value series x _n (t) bar supplied from the above is multiplied by the window function h (t), and the product h (t) x _n (t) bar is transmitted to the frame synthesis unit 10. Frame synthesizer 1
For 0, K / 2 samples are taken out from two adjacent frames of the h (t) × _n (t) bar and overlapped, and the expression (1
4), the emphasized speech x _n (t) hat (t = 0,
, ..., K / 2-1). Obtained emphasized voice x
_{The n} (t) hat is transmitted to the output terminal 12 as the output of the frame synthesis unit 10.

【００８１】[0081]

【数１４】 [Equation 14]

【００８２】オーバラップが、５０％ではなく、Ｍサン
プルで、フレーム長がＬサンプル（Ｍ＜Ｌ）の場合は、
式（１５）によって、強調音声ｘ_n(ｔ）ハットを得る。
これに合わせて、フレーム分割部も修正する。If the overlap is not 50% but M samples and the frame length is L samples (M <L),
The emphasized speech x _n (t) hat is obtained by the equation (15).
In accordance with this, the frame division unit is also modified.

【００８３】[0083]

【数１５】 [Equation 15]

【００８４】すでに述べたように、実数信号に対して
は、左右対称窓関数が用いられる。また、窓関数は、抑
圧係数を１に設定したときの入力信号と出力信号が計算
誤差を除いて一致するように設計される。これらの条件
を満たすいかなる窓関数であっても、ｗ（ｔ）、ｈ
（ｔ）として使用することができる。その一例として、
ハニング窓を開平した関数（ルートハニング窓）を挙げ
ることができる。他にもこれらの条件を満たす窓関数は
存在するが、詳細は省略する。隣接する２フレームを構
成するｘ_n-1(ｔ）バーとｘ_n(ｔ）バーが各フレームにお
いて異なった抑圧係数値で抑圧されたとしても、ｘ
_n-1(ｔ）バーとｘ_n(ｔ）バーのそれぞれに上述した窓関
数ｈ（ｔ）を乗算してフレーム境界におけるｘ_n-1(ｔ）
バーとｘ_n(ｔ）バーの振幅を小さくすることによって、
フレーム境界における連続性を改善し、雑音の発生を低
減することができる。よって、雑音による音質劣化を抑
制し、優れた音質の強調音声を得ることができる。As described above, the symmetric window function is used for real signals. Further, the window function is designed so that the input signal and the output signal when the suppression coefficient is set to 1 match each other except for a calculation error. For any window function that satisfies these conditions, w (t), h
It can be used as (t). As an example,
A function (root Hanning window) obtained by square rooting the Hanning window can be mentioned. There are other window functions that satisfy these conditions, but the details are omitted. Even if the x _n-1 (t) bar and the x _n (t) bar that form two adjacent frames are suppressed with different suppression coefficient values in each frame, x x
Each of the _n-1 (t) bar and the x _n (t) bar is multiplied by the window function h (t) described above to obtain x _n-1 (t) at the frame boundary.
By reducing the amplitude of the bar and x _n (t) bar,
It is possible to improve continuity at frame boundaries and reduce noise generation. Therefore, it is possible to suppress the sound quality deterioration due to noise and obtain an emphasized sound with excellent sound quality.

【００８５】注入雑音計算部５５は、それぞれ多重乗算
部１７及び推定雑音計算部５１から供給された劣化音声
パワースペクトル及び推定雑音パワースペクトルを用い
て、注入すべき擬似的な雑音（第１の雑音）を計算し、
加算器５６及び５７に伝達する。加算器５６は、推定雑
音計算部５１から供給された推定雑音パワースペクトル
に注入雑音計算部５５で得られた注入雑音を加算し、そ
の和を周波数別ＳＮＲ計算部６に伝達する。加算器５７
は、多重乗算部１７から供給された劣化音声パワースペ
クトルに注入雑音計算部５５で得られた注入雑音を加算
し、その和を周波数別ＳＮＲ計算部６に伝達する。The injection noise calculation unit 55 uses the deteriorated voice power spectrum and the estimated noise power spectrum supplied from the multiplex multiplication unit 17 and the estimated noise calculation unit 51, respectively, to generate pseudo noise (first noise). ),
It transmits to the adders 56 and 57. The adder 56 adds the injection noise obtained by the injection noise calculation unit 55 to the estimated noise power spectrum supplied from the estimation noise calculation unit 51, and transmits the sum to the frequency-specific SNR calculation unit 6. Adder 57
Adds the injection noise obtained by the injection noise calculation unit 55 to the degraded speech power spectrum supplied from the multiplex multiplication unit 17, and transmits the sum to the frequency-dependent SNR calculation unit 6.

【００８６】図２は、注入雑音計算部５５の構成を示す
ブロック図である。注入雑音計算部５５は、ＳＮＲ計算
部５５１、しきい値計算部５５２、注入レベル計算部５
５３を有する。図１における多重乗算部１７から供給さ
れた劣化音声パワースペクトルは、ＳＮＲ計算部５５１
に伝達される。図１における推定雑音計算部５１から供
給された推定雑音パワースペクトルは、ＳＮＲ計算部５
５１及びしきい値計算部５５２に伝達される。ＳＮＲ計
算部５５１で得られたＳＮＲとしきい値計算部５５２で
得られたしきい値は、注入レベル計算部５５３に供給さ
れる。注入レベル計算部５５３では、供給されたＳＮＲ
としきい値に応じて、注入すべき雑音レベルを計算し、
そのレベルに対応した信号を注入雑音として出力する。FIG. 2 is a block diagram showing the configuration of the injection noise calculator 55. The injection noise calculator 55 includes an SNR calculator 551, a threshold calculator 552, and an injection level calculator 5.
With 53. The deteriorated voice power spectrum supplied from the multiplex multiplication unit 17 in FIG. 1 is the SNR calculation unit 551.
Be transmitted to. The estimated noise power spectrum supplied from the estimated noise calculation unit 51 in FIG.
51 and the threshold value calculation unit 552. The SNR obtained by the SNR calculator 551 and the threshold obtained by the threshold calculator 552 are supplied to the injection level calculator 553. The injection level calculation unit 553 supplies the supplied SNR.
And calculate the noise level to be injected according to the threshold,
The signal corresponding to that level is output as injection noise.

【００８７】注入すべき雑音をＷ_n(ｋ）とすれば、Ｗ
_n(ｋ）はＳＮＲが大きいほど小さい値をとるように設定
される。このようなＳＮＲとＷ_n(ｋ）の関係として、Ｓ
ＮＲが第１のしきい値ＴＨ₁よりも大きいときに第１の
値Ｗ₁をとり、ＳＮＲが第２のしきい値ＴＨ₂（＜ＴＨ
₁）よりも小さいときに第２の値Ｗ₂（＞Ｗ₁）をと
り、ＳＮＲが第１のしきい値ＴＨ₁と第２のしきい値Ｔ
Ｈ₂の中間の値をとるときには、ＳＮＲに対応してＷ
_n(ｋ）が小さくなるような関数を考えることができる。
最も簡単な例は、図３に示すように、ＳＮＲが第１のし
きい値ＴＨ₁と第２のしきい値ＴＨ₂の中間の値をとる
ときには、第１の値Ｗ₁から第２の値Ｗ₂まで、直線的
に変化する関数である。If the noise to be injected is W _n (k), then W
_n (k) is set to take a smaller value as the SNR increases. As a relation between such SNR and W _n (k), S
When NR is larger than the first threshold TH ₁ , it takes the first value W ₁ , and SNR is the second threshold TH ₂ (<TH
When it is smaller than ₁ ), the second value W ₂ (> W ₁ ) is taken and the SNR is the first threshold TH ₁ and the second threshold T.
When taking an intermediate value of H ₂ , W corresponding to SNR
It is possible to consider a function such that _n (k) becomes small.
The simplest example is, as shown in FIG. 3, when the SNR takes an intermediate value between the _first threshold value TH ₁ and the second threshold value TH ₂ , the first value W ₁ It is a function that changes linearly up to the value W ₂ .

【００８８】第１と第２のしきい値ＴＨ₁，ＴＨ₂は独
立に決定することができるが、第２のしきい値ＴＨ₂を
第１のしきい値ＴＨ₁の定数倍に設定し、計算の簡略化
をはかることもできる。同様に、独立に決定することが
できるＷ_n(ｋ）の第１と第２の値Ｗ₁，Ｗ₂も第２の値
Ｗ₂を第１の値Ｗ₁の定数倍に設定することができる。
また、Ｗ_n(ｋ）の第１と第２の値Ｗ₁，Ｗ₂は、推定雑
音のレベルに対応して決定することができる。推定雑音
レベルが高い時はＷ_n(ｋ）の第１と第２の値Ｗ ₁，Ｗ₂
を小さくし、低い時は大きくする。このようにＷ_n(ｋ）
の第１と第２の値Ｗ₁，Ｗ₂を設定することで、同じＳ
ＮＲの値に対して、推定雑音レベルが高い時ほど容易に
小さなＷ_n(ｋ）が設定できる。この場合、注入レベル計
算部５５３に推定雑音パワースペクトルを供給する構成
とすることは、言うまでもない。First and second threshold values TH₁, TH₂Is German
Can be determined as the second threshold TH₂To
First threshold TH₁Set to a constant multiple of to simplify the calculation
You can also measure. Similarly, independent decisions can be made
W that can_nThe first and second values W of (k)₁, W₂Is also the second value
W₂Is the first value W₁It can be set to a constant multiple of.
Also, W_nThe first and second values W of (k)₁, W₂Is an estimated miscellaneous
It can be determined according to the sound level. Estimated noise
W when the level is high_nThe first and second values W of (k) ₁, W₂
Decrease, increase when low. Like this W_n(k)
The first and second values W of₁, W₂To set the same S
Easier when the estimated noise level is higher than the NR value
Small W_n(k) can be set. In this case, the injection level meter
Configuration for supplying the estimated noise power spectrum to the calculation unit 553
Needless to say,

【００８９】さらに、しきい値ＴＨ₁，ＴＨ₂も、推定
雑音のレベルに対応して決定することができる。推定雑
音レベルが高い時はしきい値ＴＨ₁，ＴＨ₂を小さく
し、低い時は大きくする。このようにしきい値ＴＨ₁，
ＴＨ₂を設定することで、同じＳＮＲの値に対して、推
定雑音レベルが高い時ほど容易に小さなＷ_n(ｋ）が設定
できる。推定雑音レベルが高い時ほどＷ_n(ｋ）を小さく
する理由は、推定雑音レベルが高い時には、従来の抑圧
係数がほぼ適切であり、雑音注入による抑圧係数の補正
量が小さいからである。この結果、本来の抑圧量が小さ
く、残留する雑音が知覚されやすいときに、中程度の振
幅を有した成分を相対的に大きく抑圧することができ、
主観音質の改善を達成することができる。Further, the thresholds TH ₁ and TH ₂ can also be determined corresponding to the estimated noise level. The thresholds TH ₁ and TH ₂ are reduced when the estimated noise level is high, and increased when the estimated noise level is low. Thus, the threshold value TH ₁ ,
By setting TH ₂ , a smaller W _n (k) can be easily set for the same SNR value when the estimated noise level is higher. The reason why W _n (k) is made smaller as the estimated noise level is higher is that the conventional suppression coefficient is almost appropriate and the amount of correction of the suppression coefficient by noise injection is smaller when the estimated noise level is higher. As a result, when the original amount of suppression is small and the residual noise is likely to be perceived, it is possible to suppress the component having a medium amplitude relatively greatly.
An improvement in subjective sound quality can be achieved.

【００９０】これまでの説明では、注入すべき雑音をＷ
_n(ｋ）としており、各周波数成分に対して異なった雑音
を注入する例について説明した。実際、注入雑音計算部
５５に供給される劣化音声パワースペクトル及び推定雑
音パワースペクトルは、全周波数成分に対応した値が多
重化されている。従って、ＳＮＲ計算部５５１で得られ
たＳＮＲとしきい値計算部５５２で得られたしきい値の
数は、周波数成分の数に対応している。しかし、これら
のＳＮＲとしきい値を、すべての周波数成分に対して共
通に設定しても良い。In the above description, the noise to be injected is W
_n (k), and an example of injecting different noises into each frequency component has been described. Actually, in the deteriorated voice power spectrum and the estimated noise power spectrum supplied to the injection noise calculation unit 55, values corresponding to all frequency components are multiplexed. Therefore, the SNR obtained by the SNR calculation unit 551 and the number of threshold values obtained by the threshold value calculation unit 552 correspond to the number of frequency components. However, these SNR and threshold value may be set in common for all frequency components.

【００９１】一例として、劣化音声パワースペクトル及
び推定雑音パワースペクトルを、全周波数成分に対して
加算して総和をとり、それらの比を共通ＳＮＲとし、ま
た、推定雑音パワースペクトルの平均値を用いてしきい
値を求めることができる。その際には、ＳＮＲ計算部５
５１及びしきい値計算部５５２では、各周波数成分に対
応した値を分離してから個々の値を用いてＳＮＲとしき
い値を計算する代わりに、前記総和と平均値を用いて、
全周波数成分に対して共通のＳＮＲとしきい値を計算す
ることになる。これらの値が、周波数別ＳＮＲ計算部６
に伝達される。As an example, the deteriorated speech power spectrum and the estimated noise power spectrum are added to all frequency components and summed, the ratio thereof is set as a common SNR, and the average value of the estimated noise power spectrum is used. The threshold can be calculated. In that case, the SNR calculation unit 5
51 and the threshold calculation unit 552, instead of separating the values corresponding to the respective frequency components and then calculating the SNR and the threshold using the individual values, the sum and average values are used,
A common SNR and threshold will be calculated for all frequency components. These values are the frequency-dependent SNR calculation unit 6
Be transmitted to.

【００９２】周波数別ＳＮＲ計算部６では、式（１１）
の代わりに、式（１６）によって、周波数別ＳＮＲγ
_n(ｋ）を計算する。In the frequency-dependent SNR calculation unit 6, equation (11)
Instead of
Calculate _n (k).

【００９３】[0093]

【数１６】 [Equation 16]

【００９４】式（１６）を参照すると、ＳＮＲ＞０の領
域では、｜Ｙ_n(ｋ）｜² ＞λ_n(ｋ）なので、雑音注入時
のＳＮＲγ_n(ｋ）は本来の値よりも小さくなるように修
正される。一方、文献１を参照すると、ＳＮＲに対する
抑圧係数の特性は、図４に示すように、ＳＮＲに対応し
て漸増した後、あるＳＮＲの値において急増し、再び漸
増から飽和をたどる。このため、雑音注入によってγ
_n(ｋ）の値が小さくなると、上記抑圧係数値が急変する
近傍のＳＮＲに対して、相対的に抑圧係数減少効果が大
きくなる。従って、そのようなＳＮＲに対応した周波数
成分、具体的には中程度の振幅を有した成分が、相対的
に大きく抑圧されることになる。このため、音声よりは
振幅が小さいが無視できない程度の背景雑音の一部がよ
り強く抑圧され、強調音声において雑音として知覚され
にくくなる。よって、実際の背景雑音に対して、十分高
い品質の強調音声を得ることができる。Referring to the equation (16), since | Y _n (k) | ² > λ _n (k) in the region of SNR> 0, the SNR γ _n (k) at the time of noise injection is smaller than the original value. Will be modified to On the other hand, referring to Reference 1, the characteristic of the suppression coefficient with respect to the SNR gradually increases corresponding to the SNR, then rapidly increases at a certain SNR value, and then gradually increases and reaches saturation again, as shown in FIG. Therefore, noise injection causes γ
_As the value of _n (k) becomes smaller, the effect of reducing the suppression coefficient becomes larger relative to the SNR in the vicinity where the suppression coefficient value suddenly changes. Therefore, a frequency component corresponding to such an SNR, specifically, a component having a medium amplitude is relatively suppressed. For this reason, a part of the background noise, which has a smaller amplitude than the voice but cannot be ignored, is more strongly suppressed, and is less likely to be perceived as noise in the emphasized voice. Therefore, it is possible to obtain emphasized speech of sufficiently high quality with respect to actual background noise.

【００９５】（第２の実施の形態）図５は、本発明のノ
イズ除去装置の第２の実施の形態の全体構成を示すブロ
ック図である。このノイズ除去装置は、図１に示したノ
イズ除去装置が具備する注入雑音計算部５５、加算器５
６，５７の代わりに、ＳＮＲ補正部６５を具備するもの
である。以下、これらの相違点を中心に詳細に説明す
る。(Second Embodiment) FIG. 5 is a block diagram showing the overall configuration of a second embodiment of the noise removing apparatus of the present invention. This noise removing device includes an injection noise calculating unit 55 and an adder 5 included in the noise removing device shown in FIG.
Instead of 6, 57, an SNR correction unit 65 is provided. Hereinafter, these differences will be mainly described in detail.

【００９６】ＳＮＲ補正部６５には、多重乗算部１７、
推定雑音計算部５１、及び周波数別ＳＮＲ計算部６か
ら、それぞれ劣化音声パワースペクトル、推定雑音パワ
ースペクトル、及び後天的ＳＮＲが供給されている。Ｓ
ＮＲ補正部６５からは、補正後天的ＳＮＲが推定先天的
ＳＮＲ計算部７及び雑音抑圧係数生成部８に供給され
る。すなわち、図１に示したノイズ除去装置では、雑音
を注入した劣化音声パワースペクトルと雑音を注入した
推定雑音パワースペクトルを用いて、後天的ＳＮＲを計
算していたのに対して、図５に示したノイズ除去装置で
は、劣化音声パワースペクトルと推定雑音パワースペク
トルを用いて計算した注入雑音を用いて、計算した後天
的ＳＮＲを補正する。The SNR correction unit 65 includes a multiple multiplication unit 17,
The deteriorated speech power spectrum, the estimated noise power spectrum, and the acquired SNR are supplied from the estimated noise calculation unit 51 and the frequency-based SNR calculation unit 6, respectively. S
The corrected a priori SNR is supplied from the NR corrector 65 to the estimated a priori SNR calculator 7 and the noise suppression coefficient generator 8. That is, in the noise eliminator shown in FIG. 1, the acquired SNR is calculated using the deteriorated speech power spectrum into which noise is injected and the estimated noise power spectrum into which noise is injected, whereas in FIG. In the noise removing device, the calculated acquired SNR is corrected using the injection noise calculated using the deteriorated speech power spectrum and the estimated noise power spectrum.

【００９７】図５におけるＳＮＲ補正部６５について、
さらに説明する。図６は、ＳＮＲ補正部６５の一構成例
を示すブロック図である。ＳＮＲ補正部６５は、Ｋ個の
補正ＳＮＲ計算部６５４₀ 〜６５４_K-1 、分離部６５
１、６５２、６５３、多重化部６５５を有する。分離部
６５１には、図５における周波数別ＳＮＲ計算部６から
後天的ＳＮＲが供給される。分離部６５１は、供給され
た後天的ＳＮＲをＫ個の周波数別成分に分離し、それぞ
れ補正ＳＮＲ計算部６５４₀ 〜６５４_K-1 に伝達する。
分離部６５２には、図５における多重乗算部１７から劣
化音声パワースペクトルが供給される。分離部６５２
は、供給された劣化音声パワースペクトルをＫ個の周波
数別成分に分離し、それぞれ補正ＳＮＲ計算部６５４₀
〜６５４_K-1 に伝達する。分離部６５３には、図５にお
ける推定雑音計算部５１から推定雑音パワースペクトル
が供給される。分離部６５３は、供給された推定雑音パ
ワースペクトルをＫ個の周波数別成分に分離し、それぞ
れ補正ＳＮＲ計算部６５４₀ 〜６５４_K-1 に伝達する。
補正ＳＮＲ計算部６５４₀ 〜６５４_K-1 は、供給された
劣化音声パワースペクトルと推定雑音パワースペクトル
に対応した補正を後天的ＳＮＲに加え、補正後天的ＳＮ
Ｒを多重化部６５５に伝達する。多重化部６５５は、供
給された補正後天的ＳＮＲを多重化して出力する。Regarding the SNR correction unit 65 in FIG.
Further description will be made. FIG. 6 is a block diagram showing a configuration example of the SNR correction unit 65. The SNR correction unit 65 includes K correction SNR calculation units 654 _{0 to} 654 _K−1 and a separation unit 65.
1, 652, 653 and a multiplexing unit 655. The acquired SNR is supplied from the frequency-based SNR calculation unit 6 in FIG. 5 to the separation unit 651. The separation unit 651 separates the acquired acquired SNR into K frequency components, and transmits the _K frequency components to the corrected SNR calculation units 654 _{0 to} 654 _K−1 .
The demultiplexing unit 652 is supplied with the degraded voice power spectrum from the multiplex multiplication unit 17 in FIG. Separation unit 652
Separates the supplied degraded speech power spectrum into K frequency components, and corrects the SNR calculation unit 654 _0.
~ 654 _K-1 . The estimated noise power spectrum is supplied from the estimated noise calculation unit 51 in FIG. 5 to the separation unit 653. The separation unit 653 separates the supplied estimated noise power spectrum into K frequency components, and transfers them to the corrected SNR calculation units 654 _{0 to} 654 _K-1 .
The corrected SNR calculation units 654 _{0 to} 654 _K-1 add the corrections corresponding to the supplied deteriorated speech power spectrum and estimated noise power spectrum to the acquired SNR and add the corrected acquired SN.
The R is transmitted to the multiplexing unit 655. The multiplexing unit 655 multiplexes and outputs the corrected post-correction SNR.

【００９８】図７は、図６に示したＳＮＲ補正部６５に
含まれる補正ＳＮＲ計算部６５４₀〜６５４_K-1 の構成
を示すブロック図である。補正ＳＮＲ計算部６５４は、
しきい値計算部６５４１、注入雑音計算部６５４２、加
算器６５４３，６５４４、除算部６５４５を有する。FIG. 7 is a block diagram showing the configuration of the corrected SNR calculation units 654 _{0 to} 654 _K-1 included in the SNR correction unit 65 shown in FIG. The corrected SNR calculation unit 654
It has a threshold value calculation unit 6541, an injection noise calculation unit 6542, adders 6543 and 6544, and a division unit 6545.

【００９９】しきい値計算部６５４１には、図６におけ
る分離部６５３から推定雑音パワースペクトルが供給さ
れており、図２におけるしきい値計算部５５２と同様の
動作によってしきい値を計算し、注入雑音計算部６５４
２に伝達する。注入雑音計算部６５４２には、図６にお
ける分離部６５１から後天的ＳＮＲも供給されており、
図２における注入レベル計算部５５３と同様の動作によ
って注入すべき擬似的な雑音（第１の雑音，加算信号）
を計算し、加算器６５４３及び６５４４に伝達する。加
算器６５４３には、図６における分離部６５３から推定
雑音パワースペクトルも供給されており、注入雑音計算
部６５４２から供給された雑音との加算結果を除算部６
５４５に伝達する。加算器６５４４には、図６における
分離部６５２から劣化音声パワースペクトルも供給され
ており、注入雑音計算部６５４２から供給された雑音と
の加算結果を除算部６５４５に伝達する。除算部６５４
５は、加算器６５４３の出力と加算器６５４４の出力か
ら求めた商を、補正後天的ＳＮＲとして出力する。The estimated noise power spectrum is supplied from the separation unit 653 in FIG. 6 to the threshold value calculation unit 6541, and the threshold value is calculated by the same operation as the threshold value calculation unit 552 in FIG. Injection noise calculator 654
Communicate to 2. The injection noise calculation unit 6542 is also supplied with the acquired SNR from the separation unit 651 in FIG.
Pseudo noise (first noise, addition signal) to be injected by the same operation as the injection level calculation unit 553 in FIG.
Is calculated and transmitted to the adders 6543 and 6544. The estimated noise power spectrum is also supplied from the separation unit 653 in FIG. 6 to the adder 6543, and the addition result with the noise supplied from the injection noise calculation unit 6542 is divided by the division unit 6
To 545. The adder 6544 is also supplied with the degraded voice power spectrum from the separation unit 652 in FIG. 6, and transfers the addition result with the noise supplied from the injection noise calculation unit 6542 to the division unit 6545. Division unit 654
5 outputs the quotient obtained from the output of the adder 6543 and the output of the adder 6544 as the corrected SNR.

【０１００】図８は、ＳＮＲ補正部６５の他の構成例を
示すブロック図である。この構成例では、ＳＮＲとしき
い値を、すべての周波数成分に対して共通に設定してい
る。このため、図６に示した構成例と比較すると、新た
に平均値計算部６６１，６６３、注入雑音計算部６６２
を有し、また補正ＳＮＲ計算部６５４₀ 〜６５４_K-1を
置き換える形で補正ＳＮＲ計算部６６４₀ 〜６６４_K-1
を有している。FIG. 8 is a block diagram showing another configuration example of the SNR correction unit 65. In this configuration example, the SNR and the threshold value are commonly set for all frequency components. Therefore, in comparison with the configuration example shown in FIG. 6, new average value calculation units 661 and 663 and injection noise calculation unit 662 are newly added.
Correcting SNR calculator 664 to have, also in the form of replacing a correction SNR calculator _{_{_{654 0 ~654 K-1 0 ~664}}} K-1
have.

【０１０１】平均値計算部６６１は、分離部６５１から
供給された後天的ＳＮＲγ_n(ｋ）のｋに関する平均を求
め、注入雑音計算部６６２へ伝達する。従って、注入雑
音計算部６６２へ伝達される値は、一つとなる。一方、
平均値計算部６６３は、分離部６５３から供給された推
定雑音パワースペクトルλ_n(ｋ）のｋに関する平均を求
め、しきい値計算部６５４１へ伝達する。しきい値計算
部６５４１は、すでに説明した動作によってしきい値を
求め、注入雑音計算部６６２へ伝達する。注入雑音計算
部６６２は、図７における注入雑音計算部６５４２と同
じ手順で注入すべき擬似的な雑音（第１の雑音，加算信
号）を計算し、補正ＳＮＲ計算部６６４ ₀ 〜６６４_K-1
へ伝達する。図６に示した構成例と異なり、補正ＳＮＲ
計算部６６４₀ 〜６６４_K-1 へ伝達される注入雑音は、
すべて同じ値である。The average value calculation unit 661 is provided by the separation unit 651.
Acquired SNRγ_nFind the average of (k) for k
Therefore, it is transmitted to the injection noise calculation unit 662. Therefore, injection miscellaneous
The value transmitted to the sound calculation unit 662 becomes one. on the other hand,
The average value calculation unit 663 uses the estimation value supplied from the separation unit 653.
Constant noise power spectrum λ_nFind the average of (k) for k
Therefore, it is transmitted to the threshold calculation unit 6541. Threshold calculation
Part 6541 sets the threshold value by the operation already described.
Obtained and transmitted to the injection noise calculation unit 662. Injection noise calculation
The unit 662 is the same as the injection noise calculation unit 6542 in FIG.
Pseudo noise to be injected in the same procedure (first noise, summing signal
No.) and the corrected SNR calculation unit 664 ₀ ~ 664_K-1
Communicate to. Unlike the configuration example shown in FIG. 6, the corrected SNR
Calculation unit 664₀ ~ 664_K-1 The injection noise transmitted to
All have the same value.

【０１０２】図９は、図８に示したＳＮＲ補正部６６に
含まれる補正ＳＮＲ計算部６６４₀〜６６４_K-1 の構成
を示すブロック図である。補正ＳＮＲ計算部６６４は、
注入雑音計算部６６２から供給された注入雑音を、推定
雑音パワースペクトル及び劣化音声パワースペクトルに
加算し、両者の商を求めてから、補正後天的ＳＮＲとし
て出力する。より具体的には、次のとおりである。すな
わち、注入雑音計算部６６２で計算された注入雑音は、
加算器６５４３及び６５４４に伝達される。加算器６５
４３には、図８における分離部６５３から推定雑音パワ
ースペクトルも供給されており、注入雑音計算部６６２
から供給された雑音との加算結果を除算部６５４５に伝
達する。加算器６５４４には、図８における分離部６５
２から劣化音声パワースペクトルも供給されており、注
入雑音計算部６５４２から供給された雑音との加算結果
を除算部６５４５に伝達する。除算部６５４５は、加算
器６５４３の出力と加算器６５４４の出力から求めた商
を、補正後天的ＳＮＲとして出力する。FIG. 9 is a block diagram showing the configuration of the corrected SNR calculation units 664 _{0 to} 664 _K-1 included in the SNR correction unit 66 shown in FIG. The corrected SNR calculation unit 664 is
The injection noise supplied from the injection noise calculation unit 662 is added to the estimated noise power spectrum and the deteriorated speech power spectrum, the quotient of the two is calculated, and the corrected noise SNR is output as the corrected SNR. More specifically, it is as follows. That is, the injection noise calculated by the injection noise calculation unit 662 is
It is transmitted to adders 6543 and 6544. Adder 65
The estimated noise power spectrum is also supplied to the 43 from the separating unit 653 in FIG.
The addition result with the noise supplied from is transmitted to the division unit 6545. The adder 6544 includes a separating unit 65 in FIG.
The degraded voice power spectrum is also supplied from No. 2, and the addition result with the noise supplied from the injection noise calculation unit 6542 is transmitted to the division unit 6545. The division unit 6545 outputs the quotient obtained from the output of the adder 6543 and the output of the adder 6544 as the corrected SNR.

【０１０３】図８，図９に示した構成例では、補正ＳＮ
Ｒ計算部６６４₀ 〜６６４_K-1 に対して注入雑音計算部
６６２としきい値計算部６５４１を共通化することによ
って、補正ＳＮＲ計算部６６４₀ 〜６６４_K-1 のすべて
に注入雑音計算部としきい値計算部を設ける必要がなく
なるので、構成を簡素化することができる。In the configuration example shown in FIGS. 8 and 9, the correction SN is
By sharing the injection noise calculation unit 662 and a threshold calculating unit 6541 for R calculation unit 664 ₀ ~664 _K-1, and injected noise calculation unit to all the correction SNR calculator 664 ₀ ~664 _K-1 Since it is not necessary to provide the threshold value calculation unit, the configuration can be simplified.

【０１０４】以上のようにしてＳＮＲ補正部６５，６６
で後天的ＳＮＲを補正し、その結果得られた補正後後天
的ＳＮＲを用いて抑圧係数を定めることによって、図１
に示したノイズ除去装置と同様に、実際の背景雑音に対
して十分高い品質の強調音声を得ることができる。As described above, the SNR correction units 65 and 66
1 is used to correct the acquired SNR, and the suppression coefficient is determined using the corrected acquired acquired SNR.
Similar to the noise eliminator shown in, it is possible to obtain emphasized speech of sufficiently high quality against actual background noise.

【０１０５】（第３の実施の形態）図１０は、本発明の
ノイズ除去装置の第３の実施の形態の全体構成を示すブ
ロック図である。このノイズ除去装置は、図１に示した
ノイズ除去装置において、注入雑音計算部５５を注入雑
音計算部５８で置換した構成になっている。以下、この
相違点を中心に詳細に説明する。図１０に示すノイズ除
去装置では、入力信号の性質に応じて、選択的に雑音注
入を適用する。このため、入力信号の性質を評価するた
めに、フレーム分割部１の出力である時間領域の劣化音
声信号が、注入雑音計算部５８に供給されている。(Third Embodiment) FIG. 10 is a block diagram showing the overall configuration of a third embodiment of the noise eliminator of the present invention. This noise removing device has a configuration in which the injection noise calculating unit 55 is replaced with an injection noise calculating unit 58 in the noise removing device shown in FIG. Hereinafter, this difference will be mainly described in detail. The noise eliminator shown in FIG. 10 selectively applies noise injection according to the nature of the input signal. Therefore, in order to evaluate the characteristics of the input signal, the time-domain degraded speech signal output from the frame division unit 1 is supplied to the injection noise calculation unit 58.

【０１０６】図１１は、図１０における注入雑音計算部
５８の構成を示すブロック図である。図２に示した注入
雑音計算部５５とは、ゼロ交叉計算部５８１とスイッチ
５８２をさらに具備する点が異なっている。フレーム分
割部１の出力である時間領域の劣化音声信号は、ゼロ交
叉計算部５８１に供給されている。ゼロ交叉計算部５８
１には、ＳＮＲ計算部５５１からＳＮＲが、しきい値計
算部５５２からしきい値が、それぞれ供給されている。
ゼロ交叉計算部５８１では、供給された劣化音声信号の
振幅がゼロとなるゼロ交叉を計数する。同時に、ＳＮＲ
としきい値から、ＳＮＲが前記第２のしきい値ＴＨ₂よ
り小さいか否かを評価する。ＳＮＲが前記第２のしきい
値ＴＨ₂より小さいときだけ、前記ゼロ交叉の数を過去
の数フレームに渡って平均化する。すなわち、劣化音声
が無音と判定したときだけ、平均値を求める。このよう
にして得られた平均値を第３のしきい値と比較し、平均
値の方が大きいときに“１”を、それ以外の場合は
“０”を、制御信号としてスイッチ５８２に伝達する。
第３のしきい値は、予め定めておくこともできるし、動
作途中で変更することもできる。FIG. 11 is a block diagram showing the structure of the injection noise calculation unit 58 in FIG. The injection noise calculation unit 55 shown in FIG. 2 is different in that a zero crossing calculation unit 581 and a switch 582 are further provided. The time domain deteriorated speech signal output from the frame division unit 1 is supplied to the zero crossing calculation unit 581. Zero crossing calculator 58
1, the SNR is supplied from the SNR calculation unit 551 and the threshold value is supplied from the threshold value calculation unit 552.
The zero crossing calculator 581 counts zero crossings at which the amplitude of the supplied deteriorated voice signal becomes zero. At the same time, SNR
Then, it is evaluated whether or not the SNR is smaller than the second threshold value TH _{2 from} the threshold value. Only when the SNR is less than the second threshold TH ₂ , the number of zero crossings is averaged over the past several frames. That is, the average value is obtained only when it is determined that the deteriorated voice is silent. The average value thus obtained is compared with the third threshold value, and when the average value is larger, "1" is transmitted, and otherwise "0" is transmitted to the switch 582 as a control signal. To do.
The third threshold value can be set in advance or can be changed during the operation.

【０１０７】スイッチ５８２には、注入レベル計算部５
５３からは注入雑音が、０と共に供給されている。スイ
ッチ５８２は、ゼロ交叉計算部５８１から制御信号とし
て“１”が供給されたときは注入レベル計算部５５３か
ら供給された注入雑音を、“０”が供給されたときは０
を選択し、注入雑音として出力する。従って、ゼロ交叉
の数の平均値が第３のしきい値より大きい場合のみに、
注入レベル計算部５５３からの注入雑音が、図１０にお
ける加算器５６，５７に供給されることになる。ゼロ交
叉の数は、非定常な信号ほど多くなることが知られてい
るので、非定常性が一定以上の信号に対してだけ、雑音
注入を実行し、抑圧係数の補正を行うことができる。The switch 582 has an injection level calculation unit 5
Injection noise is supplied together with 0 from 53. The switch 582 supplies the injection noise supplied from the injection level calculation unit 553 when “1” is supplied as the control signal from the zero crossing calculation unit 581 and 0 when “0” is supplied.
Is selected and output as injection noise. Therefore, only if the average number of zero crossings is greater than the third threshold,
The injection noise from the injection level calculator 553 is supplied to the adders 56 and 57 in FIG. It is known that the number of zero crossings increases in a non-stationary signal. Therefore, it is possible to perform noise injection and correct the suppression coefficient only for a signal whose non-stationarity is a certain value or more.

【０１０８】（第４の実施の形態）図１２は、本発明の
ノイズ除去装置の第４の実施の形態の全体構成を示すブ
ロック図である。このノイズ除去装置は、図１０に示し
たノイズ除去装置において、注入雑音計算部５８を注入
雑音計算部５９で置換した構成になっている。以下、こ
の相違点を中心に詳細に説明する。(Fourth Embodiment) FIG. 12 is a block diagram showing the overall structure of a fourth embodiment of the noise eliminator of the present invention. This noise removing device has a configuration in which the injection noise calculating unit 58 in the noise removing device shown in FIG. 10 is replaced with an injection noise calculating unit 59. Hereinafter, this difference will be mainly described in detail.

【０１０９】図１２に示すノイズ除去装置では、入力信
号の性質に応じて選択的に雑音注入を適用する点で、図
１０に示したノイズ除去装置と同じである。しかし、フ
レーム分割部１の出力である時間領域の劣化音声信号
が、注入雑音計算部５９に供給されていない。その理由
は、図１０に示したノイズ除去装置とは異なり、入力信
号の性質を評価するために、時間領域の劣化音声信号を
用いないためである。その代わりに、劣化音声パワース
ペクトルを用いる。図１０に示したノイズ除去装置で
は、フレーム当たりのゼロ交叉の数を用いて信号の非定
常性を評価していたが、ゼロ交叉の数と高周波領域（高
域）におけるパワースペクトルには相関があることが知
られているので、ゼロ交叉の数に代えて劣化音声パワー
スペクトルを用いることができる。The noise eliminator shown in FIG. 12 is the same as the noise eliminator shown in FIG. 10 in that noise injection is selectively applied according to the characteristics of the input signal. However, the degraded speech signal in the time domain, which is the output of the frame division unit 1, is not supplied to the injection noise calculation unit 59. The reason is that, unlike the noise removing apparatus shown in FIG. 10, the deteriorated speech signal in the time domain is not used in order to evaluate the property of the input signal. Instead, the degraded speech power spectrum is used. In the noise eliminator shown in FIG. 10, the non-stationarity of the signal is evaluated using the number of zero crossings per frame, but the number of zero crossings and the power spectrum in the high frequency region (high range) have no correlation. As is known, the degraded speech power spectrum can be used instead of the number of zero crossings.

【０１１０】図１３は、図１２における注入雑音計算部
５９の構成を示すブロック図である。図１１に示した注
入雑音計算部５８との違いは、ゼロ交叉計算部５８１が
高域電力計算部５９１に置換されていることである。高
域電力計算部５９１には、ＳＮＲ計算部５５１と共に、
劣化音声パワースペクトルが供給されている。高域電力
計算部５９１は、劣化音声パワースペクトル｜Ｙ_n(ｋ）
｜² のうち、ｋが基準値ｋ_THよりも大きいものの総和を
とる。基準値ｋ_THは、総和をとることによって、上述し
た劣化音声信号のゼロ交叉の数に対応する高域電力が得
られるように、劣化音声信号その他の条件に応じて設定
される。この結果、前記ゼロ交叉の数に対応する高域電
力が得られるので、この高域電力を第４のしきい値と比
較した結果を用いて、図１１に示した注入雑音計算部５
８と同様にスイッチ５８２を制御することができる。す
なわち、高域電力の値によって、注入レベル計算部５５
３から供給された注入雑音と０を選択し、注入雑音とし
て出力する。FIG. 13 is a block diagram showing the structure of the injection noise calculator 59 shown in FIG. The difference from the injection noise calculation unit 58 shown in FIG. 11 is that the zero crossing calculation unit 581 is replaced with a high frequency power calculation unit 591. The high frequency power calculation unit 591, together with the SNR calculation unit 551,
Degraded voice power spectrum is provided. The high frequency power calculation unit 591 uses the deteriorated voice power spectrum | Y _n (k)
Of ² |, those for which k is greater than the reference value k _TH are summed. The reference value k _TH is set according to the deteriorated sound signal and other conditions so that the high frequency power corresponding to the number of zero crossings of the deteriorated sound signal described above can be obtained by taking the sum. As a result, a high frequency power corresponding to the number of zero crossings is obtained, and therefore the injection noise calculation unit 5 shown in FIG.
The switch 582 can be controlled in the same manner as 8 above. That is, the injection level calculation unit 55 is determined by the value of the high frequency power.
The injection noise supplied from 3 and 0 are selected and output as injection noise.

【０１１１】なお、劣化音声パワースペクトル｜Ｙ
_n(ｋ）｜² のうち、ｋが基準値ｋ_THよりも大きいものを
重みづけして総和をとり、高域電力を求めるようにして
もよい。また、第４のしきい値は、予め定めておくこと
もできるし、動作途中で変更することもできる。Note that the degraded voice power spectrum | Y
_{Of n} (k) | ^2, the one in which k is larger than the reference value k _TH may be weighted and summed to obtain the high frequency power. Further, the fourth threshold value can be set in advance or can be changed during the operation.

【０１１２】（第５の実施の形態）図１４は、本発明の
ノイズ除去装置の第５の実施の形態の全体構成を示すブ
ロック図である。このノイズ除去装置は、図５に示した
ノイズ除去装置において、ＳＮＲ補正部６５をＳＮＲ補
正部６７で置換した構成になっている。以下、この相違
点を中心に詳細に説明する。図１４に示すノイズ除去装
置では、図１０に示したノイズ除去装置と同様に、入力
信号の性質に応じて、選択的に雑音注入を適用する。こ
のため、入力信号の性質を評価するために、フレーム分
割部１の出力である時間領域の劣化音声信号が、ＳＮＲ
補正部６７に供給されている。(Fifth Embodiment) FIG. 14 is a block diagram showing the overall configuration of a fifth embodiment of the noise removing apparatus of the present invention. This noise removing device has a configuration in which the SNR correcting unit 65 is replaced with an SNR correcting unit 67 in the noise removing device shown in FIG. Hereinafter, this difference will be mainly described in detail. The noise eliminator shown in FIG. 14 selectively applies noise injection according to the nature of the input signal, as in the noise eliminator shown in FIG. Therefore, in order to evaluate the property of the input signal, the degraded speech signal in the time domain, which is the output of the frame division unit 1, is
It is supplied to the correction unit 67.

【０１１３】図１５は、図１４におけるＳＮＲ補正部６
７の構成例を示すブロック図である。図８に示したＳＮ
Ｒ補正部６５の構成例とは、注入雑音計算部６６２が注
入雑音計算部６７２に置換されている点において異な
る。注入雑音計算部６６２とは異なり、注入雑音計算部
６７２には、入力信号の性質を評価するために、フレー
ム分割部１の出力である時間領域の劣化音声信号が供給
されている。FIG. 15 shows the SNR correction unit 6 in FIG.
7 is a block diagram showing a configuration example of 7. SN shown in FIG.
The difference from the configuration example of the R correction unit 65 is that the injection noise calculation unit 662 is replaced with the injection noise calculation unit 672. Unlike the injection noise calculation unit 662, the injection noise calculation unit 672 is supplied with the time domain deteriorated speech signal output from the frame division unit 1 in order to evaluate the property of the input signal.

【０１１４】図１６は、注入雑音計算部６７２の構成例
を示すブロック図である。注入雑音計算部６７２は、注
入レベル計算部６７２１、スイッチ６７２２、判定部６
７２３を有する。注入レベル計算部６７２１と判定部６
７２３には、図１５における平均値計算部６６１から後
天的ＳＮＲが、また図１５におけるしきい値計算部６５
４１からしきい値が、供給されている。判定部６７２３
にはさらに、劣化音声信号が供給されている。注入レベ
ル計算部６７２１は、図２における注入レベル計算部５
５３と同様の動作により、注入レベルを求め、スイッチ
６７２２に伝達する。判定部６７２３は、前記劣化音声
信号、前記後天的ＳＮＲ、前記しきい値を受け、入力信
号の性質に応じた、スイッチ６７２２の制御信号を発生
する。FIG. 16 is a block diagram showing a configuration example of the injection noise calculation unit 672. The injection noise calculation unit 672 includes an injection level calculation unit 6721, a switch 6722, and a determination unit 6.
723. Injection level calculation unit 6721 and determination unit 6
In 723, the acquired SNR from the average value calculation unit 661 in FIG. 15 and the threshold value calculation unit 65 in FIG.
The threshold value is supplied from 41. Judgment unit 6723
Is further supplied with a degraded audio signal. The injection level calculation unit 6721 is the injection level calculation unit 5 in FIG.
By the same operation as 53, the injection level is obtained and transmitted to the switch 6722. The determination unit 6723 receives the deteriorated voice signal, the acquired SNR, and the threshold value, and generates a control signal for the switch 6722 according to the property of the input signal.

【０１１５】ここで、判定部６７２３は、さらに、無音
区間検出部６７２３１、ゼロ交叉計算部６７２３２、比
較部６７２３３から構成される。無音区間検出部６７２
３１は、前記後天的ＳＮＲと前記しきい値を受け、ＳＮ
Ｒが前記第２のしきい値ＴＨ ₂より小さいときに“１”
を、それ以外の場合は“０”を、ゼロ交叉計算部６７２
３２に伝達する。すなわち、劣化音声が無音と判定され
ると“１”を、それ以外の場合は“０”をゼロ交叉計算
部６７２３２に伝達することになる。ゼロ交叉計算部６
７２３２は、供給された劣化音声信号の振幅がゼロとな
るゼロ交叉を計数し、無音区間検出部６７２３１から
“１”を受けたときだけ、前記ゼロ交叉の数を過去の数
フレームに渡って平均化する。このようにして得られた
平均値は、比較部６７２３３に伝達される。比較部６７
２３３は、供給された前記ゼロ交叉の平均値を前記第３
のしきい値と比較し、平均値の方が大きいときに“１”
を、それ以外の場合は“０”を、制御信号としてスイッ
チ６７２２に伝達する。Here, the determination unit 6723 further determines that there is no sound.
Section detection unit 67231, zero crossing calculation unit 67232, ratio
It is composed of a comparison unit 67233. Silent section detection unit 672
31 receives the acquired SNR and the threshold, and
R is the second threshold TH ₂"1" when smaller
Otherwise, "0" otherwise, the zero crossing calculation unit 672
32. That is, the degraded voice is determined to be silent.
Then, "1" is calculated, otherwise "0" is calculated as zero crossing
It will be transmitted to the part 67232. Zero crossing calculator 6
7232, the amplitude of the supplied degraded audio signal is zero.
Counting zero crossings from the silent interval detection unit 67231
Only when "1" is received, the number of zero crossings is the past number.
Average over frames. Thus obtained
The average value is transmitted to the comparison unit 67233. Comparison unit 67
233, the average value of the zero crossings supplied is the third value.
"1" when the average value is larger than the threshold value of
, Otherwise switch to “0” as the control signal.
C 6722.

【０１１６】スイッチ６７２２は、判定部６７２３の比
較部６７２３３から“１”が供給されたときは注入レベ
ル計算部６７２１から供給された注入雑音を、“０”が
供給されたときは０を選択し、注入雑音として出力す
る。すなわち、スイッチ６７２２の動作は図１１におけ
るスイッチ５８２の動作に等しく、非定常性が一定以上
の信号に対してだけ、雑音注入を実行し、抑圧係数の補
正を行うことができる。The switch 6722 selects the injection noise supplied from the injection level calculation unit 6721 when "1" is supplied from the comparison unit 67233 of the determination unit 6723, and selects 0 when "0" is supplied. , Output as injection noise. That is, the operation of the switch 6722 is the same as the operation of the switch 582 in FIG. 11, and the noise injection can be executed and the suppression coefficient can be corrected only for the signal whose non-stationarity is constant or more.

【０１１７】（第６の実施の形態）図１７は、本発明の
ノイズ除去装置の第６の実施の形態の全体構成を示すブ
ロック図である。このノイズ除去装置は、図１４に示し
たノイズ除去装置において、ＳＮＲ補正部６７をＳＮＲ
補正部６８で置換した構成になっている。以下、この相
違点を中心に詳細に説明する。(Sixth Embodiment) FIG. 17 is a block diagram showing the overall structure of a sixth embodiment of the noise eliminator of the present invention. This noise removing apparatus is the same as the noise removing apparatus shown in FIG.
The configuration is replaced by the correction unit 68. Hereinafter, this difference will be mainly described in detail.

【０１１８】図１７に示すノイズ除去装置では、入力信
号の性質に応じて、選択的に雑音注入を適用する。その
際、図１４に示したノイズ除去装置とは異なり、時間領
域の劣化音声信号の代わりに劣化音声パワースペクトル
を用いて、入力信号の性質を評価する。すなわち、フレ
ーム当たりのゼロ交叉数で信号の非定常性を評価してい
た第５の実施の形態と異なり、高周波領域（高域）にお
ける劣化音声パワースペクトルを用いて信号の非定常性
を評価する。このため、フレーム分割部１の出力である
時間領域の劣化音声信号が、ＳＮＲ補正部６８に供給さ
れていない。図１８は、図１７におけるＳＮＲ補正部６
８の構成例を示すブロック図である。図１５に示したＳ
ＮＲ補正部６７との違いは、注入雑音計算部６７２が注
入雑音計算部６８２に置換されていることである。In the noise eliminator shown in FIG. 17, noise injection is selectively applied according to the property of the input signal. At that time, unlike the noise eliminator shown in FIG. 14, the quality of the input signal is evaluated by using the degraded voice power spectrum instead of the degraded voice signal in the time domain. That is, unlike the fifth embodiment in which the non-stationarity of a signal is evaluated by the number of zero crossings per frame, the non-stationarity of a signal is evaluated using a deteriorated voice power spectrum in a high frequency region (high range). . For this reason, the time domain degraded audio signal output from the frame division unit 1 is not supplied to the SNR correction unit 68. FIG. 18 shows the SNR correction unit 6 in FIG.
8 is a block diagram showing a configuration example of No. 8. S shown in FIG.
The difference from the NR correction unit 67 is that the injection noise calculation unit 672 is replaced with the injection noise calculation unit 682.

【０１１９】図１９は、注入雑音計算部６８２の構成例
を示すブロック図である。図１６に示した注入雑音計算
部６７２との違いは、ゼロ交叉計算部６７２３２が高域
電力計算部６８２３２に置換されていることである。高
域電力計算部６８２３２には、無音区間計算部６７２３
１の出力信号と共に、劣化音声パワースペクトルが供給
されている。高域電力計算部６８２３２は、図１３にお
ける高域電力計算部５９１と同様の動作によって、劣化
音声パワースペクトル｜Ｙ_n(ｋ）｜² のうち、ｋが基準
値ｋ_THよりも大きいものの総和をとって、高域電力を求
める。この高域電力は、比較部６７２３３に伝達され
る。比較部６７２３３は、この高域電力を前記第４のし
きい値と比較した結果を用いて、スイッチ６７２２の制
御信号を発生する。すなわち、高域電力の値によって、
注入レベル計算部６７２１から供給された注入雑音と０
を選択し、注入雑音として出力する。FIG. 19 is a block diagram showing a configuration example of the injection noise calculation unit 682. The difference from the injection noise calculation unit 672 shown in FIG. 16 is that the zero crossing calculation unit 67232 is replaced with a high frequency power calculation unit 68232. The high frequency power calculator 68232 includes a silent section calculator 6723.
A degraded audio power spectrum is provided along with the 1 output signal. The high frequency power calculation unit 68232 performs the same operation as the high frequency power calculation unit 591 in FIG. 13 to calculate the sum of the deteriorated voice power spectrum | Y _n (k) | ² whose k is larger than the reference value k _TH. And obtain the high frequency power. This high frequency power is transmitted to the comparison unit 67233. The comparing unit 67233 generates a control signal for the switch 6722 using the result of comparing the high frequency power with the fourth threshold value. That is, depending on the value of high frequency power,
The injection noise supplied from the injection level calculator 6721 and 0
Is selected and output as injection noise.

【０１２０】（第７の実施の形態）図２０は、本発明の
ノイズ除去装置の第７の実施の形態の全体構成を示すブ
ロック図である。このノイズ除去装置と図１に示したノ
イズ除去装置とは、推定雑音計算部５、重みつき劣化音
声計算部１４及び抑圧係数補正部１５を除いて同一であ
る。図２０に示すノイズ除去装置の構成は、窓がけ処理
部２２及び注入雑音計算部５８を除けば、「２０００年
４月、電子情報通信学会技術研究報告、ＤＳＰ、５３〜
６０ページ」（文献５）に開示されたものに等しい。文
献５に開示された方法は、文献１に開示された従来の方
法とは異なり、重みつき劣化音声スペクトルを用いて、
雑音のパワースペクトルを推定することによって、正確
な推定雑音を得ることができる。以下、これらの相違点
を中心に詳細に説明する。(Seventh Embodiment) FIG. 20 is a block diagram showing the overall configuration of a seventh embodiment of the noise eliminator of the present invention. The noise removing apparatus and the noise removing apparatus shown in FIG. 1 are the same except for the estimated noise calculation unit 5, the weighted deteriorated speech calculation unit 14, and the suppression coefficient correction unit 15. The configuration of the noise eliminator shown in FIG. 20 is, except for the windowing processor 22 and the injection noise calculator 58, “April 2000, IEICE Technical Research Report, DSP, 53-”.
Page 60 ”(reference 5). Unlike the conventional method disclosed in Literature 1, the method disclosed in Literature 5 uses a weighted degraded speech spectrum to
By estimating the power spectrum of noise, an accurate estimated noise can be obtained. Hereinafter, these differences will be mainly described in detail.

【０１２１】まず、図２０における重みつき劣化音声計
算部１４について説明する。図２１は、重みつき劣化音
声計算部１４の構成を示すブロック図である。重みつき
劣化音声計算部１４は、推定雑音記憶部１４０１、周波
数別ＳＮＲ計算部１４０２、多重非線形処理部１４０
５、及び多重乗算部１４０４を有する。推定雑音記憶部
１４０１は、図２０における推定雑音計算部５から供給
される推定雑音パワースペクトルを記憶し、１フレーム
前に記憶された推定雑音パワースペクトルを周波数別Ｓ
ＮＲ計算部１４０２へ出力する。周波数別ＳＮＲ計算部
１４０２は、推定雑音記憶部１４０１から供給される推
定雑音パワースペクトルと、図２０における多重乗算部
１７から供給される劣化音声パワースペクトルを用い
て、ＳＮＲを各周波数毎に求め、多重非線形処理部１４
０５に出力する。多重非線形処理部１４０５は、周波数
別ＳＮＲ計算部１４０２から供給されるＳＮＲを用いて
重み係数ベクトルを計算し、重み係数ベクトルを多重乗
算部１４０４に出力する。多重乗算部１４０４は、図２
０における多重乗算部１７から供給される劣化音声パワ
ースペクトルと、多重非線形処理部１４０５から供給さ
れる重み係数ベクトルの積を周波数毎に計算し、重みつ
き劣化音声パワースペクトルを図２０における推定雑音
計算部５に出力する。First, the weighted deteriorated speech calculation unit 14 in FIG. 20 will be described. FIG. 21 is a block diagram showing the configuration of the weighted deteriorated speech calculation unit 14. The weighted deteriorated speech calculation unit 14 includes an estimated noise storage unit 1401, a frequency-based SNR calculation unit 1402, and a multiple nonlinear processing unit 140.
5, and multiplex multiplication section 1404. The estimated noise storage unit 1401 stores the estimated noise power spectrum supplied from the estimated noise calculation unit 5 in FIG. 20, and calculates the estimated noise power spectrum stored one frame before by S for each frequency.
Output to the NR calculation unit 1402. The frequency-based SNR calculation unit 1402 obtains an SNR for each frequency using the estimated noise power spectrum supplied from the estimated noise storage unit 1401 and the deteriorated voice power spectrum supplied from the multiplex multiplication unit 17 in FIG. Multiple Nonlinear Processing Unit 14
Output to 05. The multiplex nonlinear processing unit 1405 calculates a weighting coefficient vector using the SNR supplied from the frequency-based SNR calculating unit 1402, and outputs the weighting coefficient vector to the multiplex multiplication unit 1404. The multiplying unit 1404 is shown in FIG.
20 calculates the product of the deteriorated voice power spectrum supplied from the multiplex multiplication unit 17 and the weighting coefficient vector supplied from the multiplex nonlinear processing unit 1405 for each frequency, and calculates the weighted deteriorated voice power spectrum in FIG. Output to the unit 5.

【０１２２】周波数別ＳＮＲ計算部１４０２の構成は、
既に図５６を用いて説明した周波数別ＳＮＲ計算部６に
等しいので、詳細な説明は省略する。また、多重乗算部
１４０４の構成は、既に図５２を用いて説明した多重乗
算部１７に等しいので、詳細な説明は省略する。よって
次に、図２１における多重非線形処理部１４０５の構成
と動作について詳しく説明する。The configuration of the frequency-dependent SNR calculation section 1402 is as follows.
Since it is the same as the frequency-based SNR calculation unit 6 already described with reference to FIG. 56, detailed description thereof will be omitted. Further, since the configuration of the multiplex multiplication unit 1404 is the same as that of the multiplex multiplication unit 17 already described with reference to FIG. 52, detailed description thereof will be omitted. Therefore, next, the configuration and operation of the multiple nonlinear processing unit 1405 in FIG. 21 will be described in detail.

【０１２３】図２２は、重みつき劣化音声計算部１４に
含まれる多重非線形処理部１４０５の構成を示すブロッ
ク図である。多重非線形処理部１４０５は、分離部１４
９５、Ｋ個の非線形処理部１４８５₀ 〜１４８５_K-1 、
及び多重化部１４７５を有する。分離部１４９５は、図
２１における周波数別ＳＮＲ計算部１４０２から供給さ
れるＳＮＲを周波数別のＳＮＲに分離し、非線形処理部
１４８５₀ 〜１４８５_K- ₁ に出力する。非線形処理部１
４８５₀ 〜１４８５_K-1 は、それぞれ入力値に応じた実
数値を出力する非線形関数を有する。図２３に、非線形
関数の例を示す。ｆ₁ を入力値としたとき、図２３に示
される非線形関数の出力値ｆ₂ は、式（１７）で与えら
れる。FIG. 22 shows the weighted deteriorated speech calculation unit 14.
A block diagram showing the configuration of the included multiple nonlinear processing unit 1405.
It is a diagram. The multiple nonlinear processing unit 1405 includes a separating unit 14
95, K non-linear processing units 1485₀ ~ 1485_K-1 ,
And a multiplexing unit 1475. Separation unit 1495
21 is supplied from the frequency-dependent SNR calculation unit 1402.
The non-linear processing unit
1485₀ ~ 1485_K- ₁ Output to. Non-linear processing unit 1
485₀ ~ 1485_K-1 Are the actual values corresponding to the input values.
It has a non-linear function that outputs a numerical value. In FIG. 23, the nonlinear
Here is an example of a function: f₁ When the input value is
Output value f of the nonlinear function₂ Is given by equation (17)
Be done.

【０１２４】[0124]

【数１７】 [Equation 17]

【０１２５】非線形処理部１４８５₀ 〜１４８５_K-1
は、分離部１４９５から供給される周波数別ＳＮＲを、
上述した非線形関数によって処理して重み係数を求め、
多重化部１４７５に出力する。すなわち、非線形処理部
１４８５₀ 〜１４８５_K-1 は、ＳＮＲに応じた１から０
までの重み係数を出力する。ＳＮＲが小さい時は１を、
大きい時は０を出力する。多重化部１４７５は、非線形
処理部１４８５₀ 〜１４８５_K-1 から出力された重み係
数を多重化し、その結果得られた重み係数ベクトルを図
２１における多重乗算部１４０４に出力する。Non-linear processing section 1485 _{0 to} 1485 _K-1
Is the frequency-dependent SNR supplied from the separation unit 1495,
Calculate the weighting coefficient by processing with the above-mentioned nonlinear function,
Output to the multiplexing unit 1475. That is, the non-linear processing units 1485 _{0 to} 1485 _K−1 have 1 to 0 depending on the SNR.
The weighting factors up to are output. If the SNR is small, set 1
When it is larger, 0 is output. The multiplexing unit 1475 multiplexes the weighting factors output from the non-linear processing units 1485 _{0 to} 1485 _K-1, and outputs the resulting weighting factor vector to the multiplexing multiplication unit 1404 in FIG.

【０１２６】このように、図２１における多重乗算部１
４０４で劣化音声パワースペクトルと乗算される重み係
数は、ＳＮＲに応じた値になっており、ＳＮＲが大きい
程、すなわち劣化音声に含まれる音声成分が大きい程、
重み係数の値は小さくなる。推定雑音の更新には一般に
劣化音声パワースペクトルが用いられるが、推定雑音の
更新に用いる劣化音声パワースペクトルに対して、ＳＮ
Ｒに応じた重みづけを行うことで、劣化音声パワースペ
クトルに含まれる音声成分の影響を小さくすることがで
き、より精度の高い雑音推定を行うことができる。な
お、重み係数の計算に非線形関数を用いた例を示した
が、非線形関数以外にも線形関数や高次多項式など、他
の形で表されるＳＮＲの関数を用いることも可能であ
る。As described above, the multiple multiplication unit 1 in FIG.
The weighting coefficient to be multiplied by the deteriorated speech power spectrum in 404 has a value according to the SNR. The larger the SNR, that is, the larger the speech component included in the deteriorated speech,
The value of the weighting factor becomes smaller. Although the deteriorated speech power spectrum is generally used for updating the estimated noise, SN is used for the deteriorated speech power spectrum used for updating the estimated noise.
By weighting according to R, it is possible to reduce the influence of the voice component included in the deteriorated voice power spectrum, and it is possible to perform noise estimation with higher accuracy. Although an example in which a non-linear function is used for the calculation of the weighting coefficient is shown, it is also possible to use a SNR function represented in another form such as a linear function or a high-order polynomial in addition to the non-linear function.

【０１２７】次に、図２０における推定雑音計算部５に
ついて説明する。図２４は、推定雑音計算部５の構成を
示すブロック図である。この推定雑音計算部５と図５３
に示した推定雑音計算部５１とは、分離部５０５が存在
することと、周波数別推定雑音計算部５１４₀ 〜５１４
_K-1 が周波数別推定雑音計算部５０４₀ 〜５０４_K-1に
置換されていることを除いて同一である。以下、これら
の相違点を中心に詳細に説明する。Next, the estimated noise calculator 5 in FIG. 20 will be described. FIG. 24 is a block diagram showing the configuration of the estimated noise calculation unit 5. This estimated noise calculation unit 5 and FIG.
The estimated noise calculator 51 shown, and the separation unit 505 is present, the frequency domain estimated noise calculator 514 _0-514
It is identical except _{that K-1} has been replaced with a frequency domain estimated noise calculator 504 ₀ ~504 _K-1. Hereinafter, these differences will be mainly described in detail.

【０１２８】分離部５０５は、図２０における重みつき
劣化音声計算部１４から供給される重みつき劣化音声パ
ワースペクトルを、周波数別の重みつき劣化音声パワー
スペクトルに分離し、それぞれ周波数別推定雑音計算部
５０４₀ 〜５０４_K-1 に出力する。周波数別推定雑音計
算部５０４₀ 〜５０４_K-1 は、分離部５０２から供給さ
れる周波数別劣化音声パワースペクトル、分離部５０５
から供給される周波数別重みつき劣化音声パワースペク
トル、図２０における音声検出部４から供給される音声
検出フラグ、及び図２０におけるカウンタ１３から供給
されるカウント値から周波数別推定雑音パワースペクト
ルを計算し、多重化部５０３へ出力する。多重化部５０
３は、周波数別推定雑音計算部５０４₀ 〜５０４_K-1 か
ら供給される周波数別推定雑音パワースペクトルを多重
化し、その結果得られた推定雑音パワースペクトルを図
２０における加算器５６と注入雑音計算部５８と重みつ
き劣化音声計算部１４へ出力する。周波数別推定雑音計
算部５０４₀ 〜５０４_K-1の構成と動作の詳細な説明
は、図２５〜図２７を参照しながら行う。The separating section 505 separates the weighted deteriorated speech power spectrum supplied from the weighted deteriorated speech calculation section 14 in FIG. 20 into frequency-dependent weighted deteriorated speech power spectrums, and each frequency-dependent estimated noise calculation section. Output to 504 _{0 to} 504 _K-1 . The frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K−1 are the frequency _- dependent deteriorated speech power spectrum supplied from the separation unit 502, and the separation unit 505.
20 to calculate the frequency-dependent estimated noise power spectrum from the frequency-dependent weighted deteriorated voice power spectrum, the voice detection flag supplied from the voice detection unit 4 in FIG. 20, and the count value supplied from the counter 13 in FIG. , To the multiplexing unit 503. Multiplexer 50
3 multiplexes the frequency _- dependent estimated noise power spectrum supplied from the frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K-1, and the estimated noise power spectrum obtained as a result is added to the adder 56 and injection noise calculation in FIG. The data is output to the unit 58 and the weighted deteriorated speech calculation unit 14. A detailed description of the configuration and operation of the frequency _- dependent estimated noise calculation units 504 _{0 to} 504 _K-1 will be given with reference to FIGS. 25 to 27.

【０１２９】図２５は、図２４に示した推定雑音計算部
５に含まれる周波数別推定雑音計算部５０４₀ 〜５０４
_K-1 の第１の構成例を示すブロック図である。図５４に
示した周波数別推定雑音計算部５１４との相違点は、周
波数別推定雑音計算部５０４ ₀ 〜５０４_K-1 が推定雑音
記憶部５９４２を有すること、更新判定部５２１が更新
判定部５２０に置換されていること、及びスイッチ５０
４４への入力が周波数別劣化音声パワースペクトルから
周波数別重みつき劣化音声パワースペクトルに置換され
ていることである。周波数別推定雑音計算部５０４₀ 〜
５０４_K-1 は、推定雑音の計算に劣化音声パワースペク
トルではなく重みつき劣化音声パワースペクトルを用い
ており、また、推定雑音の更新判定に、推定雑音と劣化
音声パワースペクトルを用いているため、これらの相違
点が発生する。推定雑音記憶部５９４２は、除算部５０
４８から供給される周波数別推定雑音パワースペクトル
を記憶し、１フレーム前に記憶された周波数別推定雑音
パワースペクトルを更新判定部５２０に出力する。更新
判定部５２０の構成と動作の詳細な説明は、図２６を参
照しながら行う。FIG. 25 shows the estimated noise calculation section shown in FIG.
Frequency-dependent estimated noise calculation unit 504 included in No. 5₀ ~ 504
_K-1 3 is a block diagram showing a first configuration example of FIG. In Figure 54
The difference from the frequency-dependent estimated noise calculation unit 514 shown is that
Estimated noise calculation unit 504 by wave number ₀ ~ 504_K-1 Is the estimated noise
The storage unit 5942 is included, and the update determination unit 521 updates
The determination unit 520 is replaced, and the switch 50
The input to 44 is from the degraded speech power spectrum by frequency
Replaced by weighted degraded speech power spectrum by frequency
It is that. Frequency-dependent estimated noise calculation unit 504₀ ~
504_K-1 Degrades the speech power spectrum in the estimated noise calculation.
Using weighted degraded speech power spectrum instead of toll
In addition, the estimated noise and deterioration are
These differences are due to the use of the voice power spectrum.
Dots occur. The estimated noise storage unit 5942 has a division unit 50.
Estimated noise power spectrum by frequency supplied from 48
And the estimated noise for each frequency stored one frame before
The power spectrum is output to the update determination unit 520. update
See FIG. 26 for a detailed description of the configuration and operation of the determination unit 520.
Do it while shining.

【０１３０】図２６は、図２５に示した周波数別推定雑
音計算部５０４₀ 〜５０４_K-1 に含まれる更新判定部５
２０の構成を示すブロック図である。図５５に示した更
新判定部５２１との相違点は、論理和計算部５２１１が
論理和計算部５２０１に置換されていることと、更新判
定部５２０が比較部５２０５、閾値記憶部５２０６及び
閾値計算部５２０７を有することである。以下、これら
の相違点を中心に詳細な動作を説明する。閾値計算部５
２０７は、図２５における推定雑音記憶部５９４２から
供給される周波数別推定雑音パワースペクトルに応じた
値を計算し、閾値として閾値記憶部５２０６に出力す
る。最も簡単な閾値の計算方法は、周波数別推定雑音パ
ワースペクトルの定数倍である。その他に、高次多項式
や非線形関数を用いて閾値を計算することも可能であ
る。FIG. 26 is a diagram showing an update determining unit 5 included in the frequency _- dependent estimated noise calculating units 504 _{0 to} 504 _K-1 shown in FIG.
It is a block diagram which shows the structure of 20. 55 is different from the update determination unit 521 shown in FIG. 55 in that the logical sum calculation unit 5211 is replaced with the logical sum calculation unit 5201, and the update determination unit 520 includes a comparison unit 5205, a threshold value storage unit 5206, and a threshold value calculation unit. That is to have a part 5207. Hereinafter, detailed operations will be described focusing on these differences. Threshold calculator 5
207 calculates a value according to the frequency-dependent estimated noise power spectrum supplied from the estimated noise storage unit 5942 in FIG. 25, and outputs it to the threshold storage unit 5206 as a threshold value. The simplest threshold calculation method is a constant multiple of the estimated noise power spectrum for each frequency. Besides, it is also possible to calculate the threshold value using a high-order polynomial or a non-linear function.

【０１３１】閾値記憶部５２０６は、閾値計算部５２０
７から出力された閾値を記憶し、１フレーム前に記憶さ
れた閾値を比較部５２０５へ出力する。比較部５２０５
は、閾値記憶部５２０６から供給される閾値と図２４に
おける分離部５０２から供給される周波数別劣化音声パ
ワースペクトルを比較し、周波数別劣化音声パワースペ
クトルが閾値よりも小さければ“１”を、大きければ
“０”を論理和計算部５２０１に出力する。すなわち、
推定雑音パワースペクトルの大きさをもとに、劣化音声
信号が雑音であるか否かを判別している。論理和計算部
５２０１は、比較部５２０３の出力値、論理否定回路５
２０２の出力値、及び比較部５２０５の出力値の論理和
を計算し、計算結果を図２５におけるスイッチ５０４
４、シフトレジスタ５０４５及びカウンタ５０４９に出
力する。The threshold storage unit 5206 has a threshold calculation unit 520.
The threshold value output from No. 7 is stored, and the threshold value stored one frame before is output to the comparison unit 5205. Comparison unit 5205
Compares the threshold supplied from the threshold storage unit 5206 with the frequency-dependent deteriorated voice power spectrum supplied from the separation unit 502 in FIG. 24. If the frequency-dependent deteriorated voice power spectrum is smaller than the threshold, increase “1”. For example, “0” is output to the logical sum calculation unit 5201. That is,
Based on the size of the estimated noise power spectrum, it is determined whether the deteriorated speech signal is noise. The logical sum calculation unit 5201 outputs the output value of the comparison unit 5203 and the logical negation circuit 5
The logical sum of the output value of 202 and the output value of the comparison unit 5205 is calculated, and the calculation result is the switch 504 in FIG.
4, output to the shift register 5045 and the counter 5049.

【０１３２】従って、初期状態や無音区間だけでなく、
有音区間でも劣化音声パワーが小さい場合には、更新判
定部５２０は“１”を出力する。すなわち、推定雑音の
更新が行われる。閾値の計算は各周波数毎に行われるた
め、各周波数毎に推定雑音の更新を行うことができる。Therefore, in addition to the initial state and the silent section,
If the deteriorated voice power is small even in the voiced section, the update determination unit 520 outputs “1”. That is, the estimated noise is updated. Since the threshold value is calculated for each frequency, the estimated noise can be updated for each frequency.

【０１３３】図２５において、ＣＮＴをカウンタ５０４
９のカウント値、Ｎをシフトレジスタ５０４５のレジス
タ長とする。そして、Ｂ_n(ｋ）（ｎ＝０，１，....，Ｎ
−１）をシフトレジスタ５０４５に蓄積されている周波
数別重みつき劣化音声パワースペクトルとする。このと
き、除算部５０４８から出力される周波数別推定雑音パ
ワースペクトルλ_n(ｋ）は、式（１８）で与えられる。In FIG. 25, the CNT counter 504
The count value of 9 and N are the register length of the shift register 5045. Then, B _n (k) (n = 0, 1, ..., N
-1) is the weighted deteriorated speech power spectrum for each frequency stored in the shift register 5045. At this time, the frequency-dependent estimated noise power spectrum λ _n (k) output from the division unit 5048 is given by Expression (18).

【０１３４】[0134]

【数１８】 [Equation 18]

【０１３５】すなわち、λ_n(ｋ）はシフトレジスタ５０
４５に蓄積されている周波数別重みつき劣化音声パワー
スペクトルの平均値となる。平均値の計算は、重みつき
加算部（巡回型フィルタ）を用いて行うことも可能であ
る。次に、図２７を参照しながら、λ_n(ｋ）の計算に重
みつき加算部を用いる構成例について説明する。That is, λ _n (k) is the shift register 50
The average value of the weighted deteriorated speech power spectrum for each frequency stored in 45. The average value may be calculated using a weighted addition unit (cyclic filter). Next, with reference to FIG. 27, a configuration example in which a weighted addition unit is used for calculating λ _n (k) will be described.

【０１３６】図２７は、図２４に示した推定雑音計算部
５に含まれる周波数別推定雑音計算部５０４₀ 〜５０４
_K-1 の第２の構成例を示すブロック図である。図２５に
示した周波数別推定雑音計算部５０４におけるシフトレ
ジスタ５０４５、加算器５０４６、最小値選択部５０４
７、除算部５０４８、カウンタ５０４９、レジスタ長記
憶部５９４１、最小値選択部５０４７の代わりに、周波
数別推定雑音計算部５０７は、重みつき加算部５０７
１、重み記憶部５０７２を有する。FIG. 27 shows the frequency-dependent estimated noise calculators 504 _{0 to} 504 included in the estimated noise calculator 5 shown in FIG.
It is a block diagram showing the 2nd example of composition of _K-1 . The shift register 5045, the adder 5046, the minimum value selection unit 504 in the frequency-dependent estimated noise calculation unit 504 shown in FIG.
7, the division unit 5048, the counter 5049, the register length storage unit 5941, and the minimum value selection unit 5047 are replaced by the frequency-dependent estimated noise calculation unit 507, and the weighted addition unit 507.
1 has a weight storage unit 5072.

【０１３７】重みつき加算部５０７１は、推定雑音記憶
部５９４２から供給される１フレーム前の周波数別推定
雑音パワースペクトル、スイッチ５０４４から供給され
る周波数別重みつき劣化音声パワースペクトル及び重み
記憶部５０７２から出力される重みを用いて、周波数別
推定雑音を計算し、図２４における多重化部５０３へ出
力する。すなわち、重み記憶部５０７２が記憶する重み
をδ、周波数別重みつき劣化音声パワースペクトルを｜
Ｙ_n(ｋ）｜² バーとしたとき、重みつき加算部５０７１
から出力される周波数別推定雑音パワースペクトルλ
_n(ｋ）は、式（１９）で与えられる。The weighted addition unit 5071 outputs the estimated noise power spectrum for each frequency before one frame supplied from the estimated noise storage unit 5942, the weighted deteriorated speech power spectrum for each frequency supplied from the switch 5044, and the weight storage unit 5072. The estimated noise for each frequency is calculated using the output weight, and is output to the multiplexing unit 503 in FIG. That is, δ is the weight stored in the weight storage unit 5072, and |
Y _n (k) | ² bar, weighted addition unit 5071
Estimated noise power spectrum for each frequency output from λ
_n (k) is given by Expression (19).

【０１３８】[0138]

【数１９】 [Formula 19]

【０１３９】重みつき加算部５０７１の構成は、既に図
５１を用いて説明した重みつき加算部４０７に等しいの
で、詳細な説明は省略する。但し、重みつき加算の計算
は常に行なわれる。The structure of the weighted addition unit 5071 is the same as that of the weighted addition unit 407 already described with reference to FIG. 51, and therefore detailed description thereof will be omitted. However, the weighted addition calculation is always performed.

【０１４０】次に、図２０における抑圧係数補正部１５
について説明する。図２８は、図２０における抑圧係数
補正部１５の構成を示すブロック図である。ＳＮＲが低
いときに抑圧不足により発生する残留雑音や、ＳＮＲが
高いときに過度の抑圧で発生する音声の歪みによる音質
劣化を防ぐために、抑圧係数補正部１５は、ＳＮＲに応
じた抑圧係数の補正を行なう。補正の例として、ＳＮＲ
が低いときには抑圧係数に修正値を加えて残留雑音を抑
圧し、ＳＮＲが高いときには抑圧係数に下限値を設定し
て音声の歪みを防止することができる。抑圧係数補正部
１５は、Ｋ個の周波数別抑圧係数補正部１５０１₀ 〜１
５０１_K-1 、分離部１５０２，１５０３及び多重化部１
５０４を有する。Next, the suppression coefficient correction unit 15 in FIG.
Will be described. FIG. 28 is a block diagram showing the configuration of the suppression coefficient correction unit 15 in FIG. In order to prevent the residual noise generated due to insufficient suppression when the SNR is low and the sound quality deterioration due to the distortion of the voice that occurs due to the excessive suppression when the SNR is high, the suppression coefficient correction unit 15 corrects the suppression coefficient according to the SNR. Do. As an example of correction, SNR
When is low, a correction value is added to the suppression coefficient to suppress residual noise, and when the SNR is high, a lower limit value can be set to the suppression coefficient to prevent voice distortion. The suppression coefficient correction unit 15 includes K frequency-based suppression coefficient correction units 1501 ₀ to 1501.
501 _K-1 , demultiplexing units 1502 and 1503, and multiplexing unit 1
504.

【０１４１】分離部１５０２は、図２０における推定先
天的ＳＮＲ計算部７から供給される推定先天的ＳＮＲを
周波数別成分に分離し、それぞれ周波数別抑圧係数補正
部１５０１₀ 〜１５０１_K-1 に出力する。分離部１５０
３は、図２０における抑圧係数生成部８から供給される
抑圧係数を周波数別成分に分離し、それぞれ周波数別抑
圧係数補正部１５０１₀ 〜１５０１_K-1 に出力する。周
波数別抑圧係数補正部１５０１₀ 〜１５０１_K-1 は、分
離部１５０２から供給される周波数別推定先天的ＳＮＲ
と、分離部１５０３から供給される周波数別抑圧係数か
ら、周波数別補正抑圧係数を計算し、多重化部１５０４
へ出力する。多重化部１５０４は、周波数別抑圧係数補
正部１５０１₀ 〜１５０１_K-1 から供給される周波数別
補正抑圧係数を多重化し、補正抑圧係数として図２０に
おける多重乗算部１６と推定先天的ＳＮＲ計算部７へ出
力する。The separation unit 1502 separates the estimated a priori SNR supplied from the estimated a priori SNR calculation unit 7 in FIG. 20 into frequency components, and outputs them to the frequency suppression coefficient correction units 1501 _{0 to} 1501 _K-1 . To do. Separation part 150
3 separates the suppression coefficient supplied from the suppression coefficient generation unit 8 in FIG. 20 into frequency components, and outputs them to the frequency suppression coefficient correction units 1501 _{0 to} 1501 _K-1 . The frequency _- dependent suppression coefficient correction units 1501 _{0 to} 1501 _K-1 are frequency _- dependent estimated a priori SNRs supplied from the separation unit 1502.
And the correction suppression coefficient for each frequency is calculated from the suppression coefficient for each frequency supplied from the separating section 1503, and the multiplexing section 1504
Output to. The multiplexing unit 1504 multiplexes the frequency-specific correction suppression coefficients supplied from the frequency-specific suppression coefficient correction units 1501 _{0 to} 1501 _K-1 and, as the correction suppression coefficient, the multiplex multiplication unit 16 and the estimated a priori SNR calculation unit. Output to 7.

【０１４２】図２９は、図２８に示した抑圧係数補正部
１５に含まれる周波数別抑圧係数補正部１５０１₀ 〜１
５０１_K-1 の構成を示すブロック図である。周波数別抑
圧係数補正部１５０１は、最大値選択部１５９１、抑圧
係数下限値記憶部１５９２、閾値記憶部１５９３、比較
部１５９４、スイッチ１５９５、修正値記憶部１５９６
及び乗算器１５９７を有する。比較部１５９４は、閾値
記憶部１５９３から供給される閾値と、図２８における
分離部１５０２から供給される周波数別推定先天的ＳＮ
Ｒを比較し、周波数別推定先天的ＳＮＲが閾値よりも大
きければ“０”を、小さければ“１”をスイッチ１５９
５に供給する。FIG. 29 shows frequency-dependent suppression coefficient correction units 1501 ₀ to 1501 included in the suppression coefficient correction unit 15 shown in FIG.
It is a block diagram which shows the structure of 501 _K-1 . The frequency-dependent suppression coefficient correction unit 1501 includes a maximum value selection unit 1591, a suppression coefficient lower limit value storage unit 1592, a threshold value storage unit 1593, a comparison unit 1594, a switch 1595, and a correction value storage unit 1596.
And a multiplier 1597. The comparison unit 1594 compares the threshold value supplied from the threshold value storage unit 1593 with the frequency-based estimated a priori SN supplied from the separation unit 1502 in FIG.
R is compared, and if the inferred innate SNR for each frequency is larger than the threshold value, “0” is set.
Supply to 5.

【０１４３】スイッチ１５９５は、図２８における分離
部１５０３から供給される周波数別抑圧係数を、比較部
１５９４の出力値が“１”のとき乗算器１５９７に出力
し、比較部１５９４の出力値が“０”のとき、最大値選
択部１５９１に直接供給する。乗算器１５７９は、スイ
ッチ１５９５の出力値と修正値記憶部１５９６の出力値
との積を計算し、計算結果を最大値選択部１５９１に供
給する。抑圧係数値を小さくするため、修正値は１より
小さい値が普通であるが、目的によってはこの限りでは
ない。このように、周波数別推定先天的ＳＮＲが閾値よ
りも小さいときに、抑圧係数の補正を行なう。ＳＮＲが
小さい場合に抑圧係数の補正を行なうことで、音声成分
を過剰に抑圧することなく、残留雑音量を減らすことが
できる。The switch 1595 outputs the suppression coefficient for each frequency supplied from the separation unit 1503 in FIG. 28 to the multiplier 1597 when the output value of the comparison unit 1594 is "1", and the output value of the comparison unit 1594 is " When it is 0 ”, it is directly supplied to the maximum value selection unit 1591. The multiplier 1579 calculates the product of the output value of the switch 1595 and the output value of the correction value storage unit 1596, and supplies the calculation result to the maximum value selection unit 1591. The correction value is usually smaller than 1 in order to reduce the suppression coefficient value, but it is not limited to this depending on the purpose. In this way, the suppression coefficient is corrected when the estimated a priori SNR for each frequency is smaller than the threshold value. By correcting the suppression coefficient when the SNR is small, it is possible to reduce the residual noise amount without excessively suppressing the voice component.

【０１４４】抑圧係数下限値記憶部１５９２は、記憶し
ている抑圧係数の下限値を、最大値選択部１５９１に供
給する。最大値選択部１５９１は、スイッチ１５９５又
は乗算器１５９７から供給される信号と、抑圧係数下限
値記憶部１５９２から供給される抑圧係数下限値を比較
し、大きい方の値を周波数別補正抑圧係数として、図２
８における多重化部１５０４に出力する。これにより、
抑圧係数は抑圧係数下限値記憶部１５９２が記憶する下
限値よりも必ず大きい値になる。従って、過度の抑圧に
より発生する音声の歪みを防ぐことができる。なお、図
１、図５、図１０、図１２、図１４、図１７に示したノ
イズ除去装置では、抑圧係数が多重乗算部１６と推定先
天的ＳＮＲ計算部７へ供給されていたが、図２０に示し
たノイズ除去装置では、抑圧係数に代わって補正抑圧係
数が供給されている。The suppression coefficient lower limit storage unit 1592 supplies the stored lower limit value of the suppression coefficient to the maximum value selection unit 1591. The maximum value selection unit 1591 compares the signal supplied from the switch 1595 or the multiplier 1597 with the suppression coefficient lower limit value supplied from the suppression coefficient lower limit value storage unit 1592, and sets the larger value as the frequency-dependent correction suppression coefficient. , Fig. 2
8 to the multiplexing unit 1504. This allows
The suppression coefficient always has a value larger than the lower limit value stored in the suppression coefficient lower limit storage unit 1592. Therefore, it is possible to prevent the distortion of the voice generated by the excessive suppression. In the noise removal apparatus shown in FIGS. 1, 5, 10, 12, 14, and 17, the suppression coefficient is supplied to the multiplex multiplication unit 16 and the estimated a priori SNR calculation unit 7. In the noise eliminator shown in FIG. 20, the corrected suppression coefficient is supplied instead of the suppression coefficient.

【０１４５】次に、図２０における雑音抑圧係数生成部
８について説明する。図６０を用いて説明したように、
抑圧係数は、供給された推定先天的ＳＮＲと後天的ＳＮ
Ｒから検索で求めることができるが、演算で求めること
もできる。以下、文献１に記載されている計算式をもと
に、抑圧係数の計算方法と共に、雑音抑圧係数生成部８
の他の構成例について説明する。図３０は、図２０にお
ける雑音抑圧係数生成部８の他の構成例を示すブロック
図である。雑音抑圧係数生成部８１は、ＭＭＳＥＳＴ
ＳＡゲイン関数値計算部８１１、一般化尤度比計算部８
１２、音声存在確率記憶部８１３、及び抑圧係数計算部
８１４を有する。Next, the noise suppression coefficient generator 8 in FIG. 20 will be described. As described using FIG. 60,
The suppression coefficient is the estimated innate SNR and acquired SN provided.
The value can be obtained by searching from R, but can also be obtained by calculation. Hereinafter, based on the calculation formula described in Document 1, together with the calculation method of the suppression coefficient, the noise suppression coefficient generation unit 8
Another configuration example will be described. FIG. 30 is a block diagram showing another configuration example of the noise suppression coefficient generation unit 8 in FIG. The noise suppression coefficient generation unit 81 uses the MMSE ST
SA gain function value calculation unit 811, generalized likelihood ratio calculation unit 8
12, a voice existence probability storage unit 813, and a suppression coefficient calculation unit 814.

【０１４６】フレーム番号をｎ、周波数番号をｋとし、
γ_n(ｋ）を図２０における周波数別ＳＮＲ計算部６から
供給される周波数別後天的ＳＮＲ、ξ_n(ｋ）ハットを図
２０における推定先天的ＳＮＲ計算部７から供給される
周波数別推定先天的ＳＮＲとする。また、η_n(ｋ）＝ξ
_n(ｋ）ハット／ｑ、ｖ_n(ｋ）＝（η_n(ｋ）γ_n(ｋ））／
（１＋η_n(ｋ））とする。ＭＭＳＥＳＴＳＡゲイン関
数値計算部８１１は、図２０における周波数別ＳＮＲ計
算部６から供給される後天的ＳＮＲγ_n(ｋ）、図２０に
おける推定先天的ＳＮＲ計算部７から供給される推定先
天的ＳＮＲξ_n(ｋ）ハット及び音声存在確率記憶部８１
３から供給される音声存在確率ｑをもとに、各周波数毎
にＭＭＳＥＳＴＳＡゲイン関数値を計算し、抑圧係数計
算部８１４に出力する。各周波数毎のＭＭＳＥＳＴＳ
Ａゲイン関数値Ｇ_n(ｋ）は、式（２０）で与えられる。Let the frame number be n and the frequency number be k,
γ _n (k) is the frequency-dependent a posteriori SNR supplied from the frequency-based SNR calculating unit 6 in FIG. 20, and ξ _n (k) hat is the frequency-based estimated a priori supplied from the estimated a priori SNR calculating unit 7 in FIG. Target SNR. Also, η _n (k) = ξ
_n (k) hat / q, v _n (k) = (η _n (k) γ _n (k)) /
_Let (1 + η _n (k)). The MMSE STSA gain function value calculation unit 811 receives the acquired SNR γ _n (k) supplied from the frequency-based SNR calculation unit 6 in FIG. 20, and the estimated a priori SNR ξ _n supplied from the estimated a priori SNR calculation unit 7 in FIG. (k) Hat and voice existence probability storage unit 81
The MMMESTSA gain function value is calculated for each frequency based on the voice existence probability q supplied from No. 3, and is output to the suppression coefficient calculation unit 814. MMSE STS for each frequency
The A gain function value G _n (k) is given by the equation (20).

【０１４７】[0147]

【数２０】 [Equation 20]

【０１４８】ここに、Ｉ₀(ｚ）は０次変形ベッセル関
数、Ｉ₁(ｚ）は１次変形ベッセル関数である。変形ベッ
セル関数については、「１９８５年、数学辞典、岩波書
店、３７４．Ｇページ」（文献６）に記載されている。
一般化尤度比計算部８１２は、図２０における周波数別
ＳＮＲ計算部６から供給される後天的ＳＮＲγ_n(ｋ）、
図２０における推定先天的ＳＮＲ計算部７から供給され
る推定先天的ＳＮＲξ_n(ｋ）ハット及び音声存在確率記
憶部８１３から供給される音声存在確率ｑをもとに、周
波数毎に一般化尤度比を計算し、抑圧係数計算部８１４
に出力する。周波数毎の一般化尤度比Λ_n(ｋ）は、式
（２１）で与えられる。Here, I ₀ (z) is the 0th-order modified Bessel function, and I ₁ (z) is the 1st-order modified Bessel function. The modified Bessel function is described in "1985, Mathematics Dictionary, Iwanami Shoten, 374.G page" (Reference 6).
The generalized likelihood ratio calculation unit 812 receives the acquired SNR γ _n (k) supplied from the frequency-based SNR calculation unit 6 in FIG.
Based on the estimated a priori SNR ξ _n (k) hat supplied from the estimated a priori SNR calculation unit 7 and the voice existence probability q supplied from the voice existence probability storage unit 813 in FIG. 20, the generalized likelihood for each frequency. The ratio is calculated, and the suppression coefficient calculation unit 814
Output to. The generalized likelihood ratio Λ _n (k) for each frequency is given by equation (21).

【０１４９】[0149]

【数２１】 [Equation 21]

【０１５０】抑圧係数計算部８１４は、ＭＭＳＥＳＴ
ＳＡゲイン関数値計算部８１１から供給されるＭＭＳＥ
ＳＴＳＡゲイン関数値Ｇ_n(ｋ）と一般化尤度比計算部
８１２から供給される一般化尤度比Λ_n(ｋ）から周波数
毎に抑圧係数を計算し、図２０における抑圧係数補正部
１５へ出力する。周波数毎の抑圧係数Ｇ_n(ｋ）バーは、
式（２２）で与えられる。The suppression coefficient calculation unit 814 uses the MMSE ST
MMSE supplied from the SA gain function value calculation unit 811
A suppression coefficient is calculated for each frequency from the STSA gain function value G _n (k) and the generalized likelihood ratio Λ _n (k) supplied from the generalized likelihood ratio calculation unit 812, and the suppression coefficient correction unit 15 in FIG. Output to. The suppression coefficient G _n (k) bar for each frequency is
It is given by equation (22).

【０１５１】[0151]

【数２２】 [Equation 22]

【０１５２】周波数別にＳＮＲを計算する代わりに、複
数の周波数から構成される帯域に共通なＳＮＲを求め
て、これを用いることも可能である。よって次に、図２
０における周波数別ＳＮＲ計算部６の他の構成例とし
て、帯域毎にＳＮＲを計算する例について説明する。図
３１は、周波数別ＳＮＲ計算部６の他の構成例を示すブ
ロック図である。図５６に示した周波数別ＳＮＲ計算部
６との相違点は、帯域別ＳＮＲ計算部６１が帯域別パワ
ー計算部６１１，６１２を有することである。帯域別パ
ワー計算部６１１は、分離部６０２から供給される周波
数別劣化音声パワースペクトルをもとに帯域別のパワー
を計算し、除算部６０１₀ 〜６０１_K-1 へ出力する。ま
た、帯域別パワー計算部６１２は、分離部６０３から供
給される周波数別推定雑音パワースペクトルをもとに帯
域別のパワーを計算し、除算部６０１₀ 〜６０１_K-1 へ
出力する。Instead of calculating the SNR for each frequency, it is also possible to obtain the SNR common to the band composed of a plurality of frequencies and use it. Therefore, next, in FIG.
As another configuration example of the frequency-based SNR calculation unit 6 for 0, an example of calculating the SNR for each band will be described. FIG. 31 is a block diagram showing another configuration example of the frequency-based SNR calculation unit 6. The difference from the frequency-based SNR calculation unit 6 shown in FIG. 56 is that the band-based SNR calculation unit 61 includes band-based power calculation units 611 and 612. The band-specific power calculation unit 611 calculates the band-specific power based on the frequency-specific deteriorated voice power spectrum supplied from the separation unit 602, and outputs the calculated power to the division units 601 _{0 to} 601 _K-1 . Further, the band-specific power calculation unit 612 calculates the band-specific power based on the frequency-specific estimated noise power spectrum supplied from the separation unit 603, and outputs the calculated power to the division units 601 _{0 to} 601 _K-1 .

【０１５３】図３２は、帯域別ＳＮＲ計算部６１に含ま
れる帯域別パワー計算部６１１の構成を示すブロック図
である。ここでは、帯域幅ＬをもつＭ個の帯域に等分割
する例を説明する。ここに、ＬとＭは、Ｋ＝ＬＭの関係
を満たす自然数であるとする。帯域別ＳＮＲ計算部６１
は、Ｍ個の加算器６１１０₀〜６１１０_M-1を有する。図
３１における分離部６０２から供給される周波数別劣化
音声パワースペクトル９１０₀ 〜９１０_K-1 （９１０₀
〜９１０_ML-1）は、各周波数に対応した加算器６１１０
₀ 〜６１１０_M-1 へそれぞれ伝達される。例えば、帯域
番号０に対応する周波数番号は０からＬ−１なので、周
波数別劣化音声パワースペクトル９１０ ₀〜９１０_L-1
は加算器６１１０₀へ伝達される。また、帯域番号１に
対応する周波数番号はＬから２Ｌ−１なので、周波数別
劣化音声パワースペクトル９１０ _L〜９１０_2L-1は加算
器６１１０₁へ伝達される。FIG. 32 is included in the band-based SNR calculation unit 61.
Block diagram showing the configuration of a band-specific power calculation unit 611
Is. Here, it is equally divided into M bands having a bandwidth L.
An example will be described. Where L and M are K = LM
Let it be a natural number that satisfies. Band-wise SNR calculator 61
Is the M adders 6110₀~ 6110_M-1Have. Figure
Degradation by frequency supplied from the separation unit 602 in No. 31
Audio power spectrum 910₀ ~ 910_K-1 (910₀
~ 910_ML-1) Is an adder 6110 corresponding to each frequency
₀ ~ 6110_M-1 Respectively transmitted to. For example, the band
Frequency numbers corresponding to number 0 are 0 to L-1, so
Deteriorated voice power spectrum 910 by wave number ₀~ 910_L-1
Is the adder 6110₀Transmitted to. Also, for band number 1
Corresponding frequency numbers are from L to 2L-1.
Degraded speech power spectrum 910 _L~ 910_2L-1Is addition
Bowl 6110₁Transmitted to.

【０１５４】加算器６１１０₀ 〜６１１０_M-1 は、供給
された周波数別劣化音声パワースペクトルの総和をそれ
ぞれ計算し、帯域別劣化音声パワースペクトル９１１₀
〜９１１_ML-1（９１１₀ 〜９１１_K-1 ）を図３１におけ
る除算部６０１₀ 〜６０１_K- ₁ へ出力する。各加算器の
計算結果は、それぞれの帯域番号に応じた周波数毎に帯
域別劣化音声パワースペクトルとして出力される。例え
ば、加算器６１１０₀の計算結果は、帯域別劣化音声パ
ワースペクトル９１１₀ 〜９１１_L-1 として出力され
る。また、加算器６１１０₁ の計算結果は、帯域別劣化
音声パワースペクトル９１１_L 〜９１１_2L-1として出力
される。帯域別パワー計算部６１２の構成と動作は帯域
別パワー計算部６１１と等価であるので、その説明は省
略する。Adder 6110₀ ~ 6110_M-1 Supply
The sum of the degraded speech power spectrum by frequency
Each is calculated, and the degraded voice power spectrum 911 for each band₀
~ 911_ML-1(911₀ ~ 911_K-1 ) In Figure 31
Division unit 601₀ ~ 601_K- ₁ Output to. Of each adder
The calculation result is calculated for each frequency according to each band number.
It is output as the degraded voice power spectrum for each region. example
For example, adder 6110₀The calculation result of
War spectrum 911₀ ~ 911_L-1 Is output as
It Also, the adder 6110₁ The calculation result of is degraded by band
Audio power spectrum 911_L ~ 911_2L-1Output as
To be done. The configuration and operation of the power calculation unit 612 for each band is
Since it is equivalent to the separate power calculation unit 611, its description is omitted.
I will omit it.

【０１５５】なお、ここでは複数の帯域に等分割する例
を示したが、「１９８０年、聴覚と音声、電子情報通信
学会、１１５〜１１８ページ」（文献７）に記載されて
いる臨界帯域に分割する方法、「１９８３年、マルチレ
ート・ディジタル・シグナル・プロセシング（Multirat
e Digital Signal Processing），１９８３，Prentice-
Hall Inc.，USA」（文献８）に記載されているオクター
ブ帯域に分割する方法など、他の帯域分割方法を用いる
ことも可能である。Although an example of equally dividing into a plurality of bands is shown here, the critical band described in “Hearing and Speech, The Institute of Electronics, Information and Communication Engineers, pp. 115 to 118, 1980” (Reference 7) is used. The method of division, "1983, Multirate Digital Signal Processing (Multirat
e Digital Signal Processing), 1983, Prentice-
It is also possible to use another band division method such as the method of dividing into octave bands described in "Hall Inc., USA" (Reference 8).

【０１５６】（第８の実施の形態）図３３は、本発明の
ノイズ除去装置の第８の実施の形態の全体構成を示すブ
ロック図である。図２０に示したノイズ除去装置との相
違点は、注入雑音計算部５８、加算器５６，５７が、Ｓ
ＮＲ補正部６７に置換されていることである。図２０と
図３３の関係は、図１と図５の関係及び図１０と図１４
の関係に等しく、ＳＮＲ補正部６７については図１５及
び１４を参照して説明したので、図３３に示したノイズ
除去装置に関する詳細な説明は省略する。(Eighth Embodiment) FIG. 33 is a block diagram showing the overall structure of an eighth embodiment of the noise removing apparatus of the present invention. The difference from the noise removal apparatus shown in FIG. 20 is that the injection noise calculation unit 58 and the adders 56 and 57 are
That is, it is replaced with the NR correction unit 67. The relationship between FIGS. 20 and 33 is the relationship between FIGS. 1 and 5 and the relationship between FIGS.
Since the SNR correction unit 67 has been described with reference to FIGS. 15 and 14, the detailed description of the noise eliminator shown in FIG. 33 will be omitted.

【０１５７】（第９の実施の形態）図３４は、本発明の
ノイズ除去装置の第９の実施の形態の全体構成を示すブ
ロック図である。図２０に示したノイズ除去装置との相
違点は、推定雑音計算部５が推定雑音計算部５２に置換
されていること、及び重みつき劣化音声計算部１４が存
在しないことである。以下、これらの相違点を中心に詳
細に説明する。(Ninth Embodiment) FIG. 34 is a block diagram showing the entire structure of a ninth embodiment of the noise removing apparatus of the present invention. The difference from the noise removal apparatus shown in FIG. 20 is that the estimated noise calculation unit 5 is replaced by the estimated noise calculation unit 52 and that the weighted deteriorated speech calculation unit 14 does not exist. Hereinafter, these differences will be mainly described in detail.

【０１５８】図３５は、図３４における推定雑音計算部
５２の構成を示すブロック図である。図２４に示した推
定雑音計算部５との相違点は、周波数別推定雑音計算部
５０４₀ 〜５０４_K-1 が周波数別推定雑音計算部５０６
₀ 〜５０６_K-1 に置換されていることと、推定雑音計算
部５２が入力信号に重みつき劣化音声パワースペクトル
を有しないことである。これは、周波数別推定雑音計算
部５０４₀ 〜５０４_K- ₁ が入力信号に周波数別重みつき
劣化音声パワースペクトルを必要とするのに対して、推
定雑音計算部５０６₀ 〜５０６_K-1 は、入力信号に周波
数別重みつき劣化音声パワースペクトルを必要としない
ためである。以下、図３６を参照しながら、相違点であ
る周波数別推定雑音計算部５０６₀ 〜５０６_K-1 の構成
と動作を詳細に説明する。FIG. 35 shows the estimated noise calculating section in FIG.
It is a block diagram which shows the structure of 52. The estimation shown in FIG.
The difference from the constant noise calculation unit 5 is that the frequency-dependent estimated noise calculation unit
504₀ ~ 504_K-1 Is a frequency-dependent estimated noise calculation unit 506
₀ ~ 506_K-1 And the estimated noise calculation
The part 52 weights the input signal with the deteriorated speech power spectrum
Is not to have. This is the estimated noise calculation for each frequency
Part 504₀ ~ 504_K- ₁ Is weighted by frequency to the input signal
It requires a degraded speech power spectrum, whereas
Constant noise calculation unit 506₀ ~ 506_K-1 Is the frequency of the input signal
No weighted degraded speech power spectrum is required
This is because. Hereinafter, with reference to FIG.
Frequency-dependent estimated noise calculation unit 506₀ ~ 506_K-1 Configuration of
And the operation will be described in detail.

【０１５９】図３６は、図３５に示した推定雑音計算部
５２に含まれる周波数別推定雑音計算部５０６₀ 〜５０
６_K-1 の構成を示すブロック図である。図２５に示した
周波数別推定雑音計算部５０４との相違点は、周波数別
推定雑音計算部５０６が、入力信号に周波数別重みつき
劣化音声パワースペクトルを有していないことと、除算
部５０４１、非線形処理部５０４２、及び乗算器５０４
３を有していることである。以下、これらの相違点を中
心に詳細に説明する。FIG. 36 shows the frequency-dependent estimated noise calculators 506 ₀ to 50 ₀ included in the estimated noise calculator 52 shown in FIG.
It is a block diagram which shows the structure of 6 _K-1 . The difference from the frequency-specific estimated noise calculation unit 504 shown in FIG. 25 is that the frequency-specific estimated noise calculation unit 506 does not have a frequency-specific weighted deteriorated speech power spectrum in the input signal, and a division unit 5041, Non-linear processing unit 5042 and multiplier 504
3 is to have. Hereinafter, these differences will be mainly described in detail.

【０１６０】除算部５０４１は、図３５における分離部
５０２から供給される周波数別劣化音声パワースペクト
ルを、推定雑音記憶部５９４２から供給される１フレー
ム前の推定雑音パワースペクトルで除算し、除算結果を
非線形処理部５０４２に出力する。図２２に示した非線
形処理部１４８５と同一の構成と機能を有する非線形処
理部５０４２は、除算部５０４１の出力値に応じた重み
係数を計算し、乗算器５０４３に出力する。乗算器５０
４３は、図３５における分離部５０２から供給される周
波数別劣化音声パワースペクトルと非線形処理部５０４
２から供給される重み係数の積を計算し、スイッチ５０
４４へ出力する。The division unit 5041 divides the frequency-dependent deteriorated speech power spectrum supplied from the separation unit 502 in FIG. 35 by the estimated noise power spectrum one frame before supplied from the estimated noise storage unit 5942, and obtains the division result. It is output to the non-linear processing unit 5042. The non-linear processing unit 5042 having the same configuration and function as the non-linear processing unit 1485 shown in FIG. 22 calculates a weighting coefficient according to the output value of the division unit 5041, and outputs it to the multiplier 5043. Multiplier 50
Reference numeral 43 denotes a frequency-dependent deteriorated speech power spectrum supplied from the separation unit 502 in FIG.
Calculate the product of the weighting factors supplied from 2 and switch 50
Output to 44.

【０１６１】乗算器５０４３の出力信号は、図２５に示
した周波数別推定雑音計算部５０４における周波数別重
みつき劣化音声パワースペクトルと等価である。すなわ
ち、周波数別重みつき劣化音声パワースペクトルは、周
波数別推定雑音計算部５０６の内部において計算するこ
とも可能である。従って、図３４に示したノイズ除去装
置では、重みつき劣化音声計算部１４を省略することが
可能となる。The output signal of the multiplier 5043 is equivalent to the weighted deteriorated speech power spectrum by frequency in the estimated noise by frequency calculation unit 504 shown in FIG. That is, the weighted deteriorated speech power spectrum for each frequency can be calculated in the estimated noise for each frequency calculation unit 506. Therefore, in the noise eliminator shown in FIG. 34, the weighted deteriorated speech calculation unit 14 can be omitted.

【０１６２】（第１０の実施の形態）図３７は、本発明
のノイズ除去装置の第１０の実施の形態の全体構成を示
すブロック図である。図３４に示したノイズ除去装置と
の相違点は、注入雑音計算部５８、加算器５６，５７
が、ＳＮＲ補正部６７に置換されていることである。図
３４と図３７の関係は、図１と図５の関係、図１０と図
１４の関係、及び図２０と図３３の関係に等しく、ＳＮ
Ｒ補正部６７については図１５及び１４を参照して説明
したので、図３７に示したノイズ除去装置に関する詳細
な説明は省略する。(Tenth Embodiment) FIG. 37 is a block diagram showing the overall structure of a tenth embodiment of the noise removing apparatus of the present invention. The difference from the noise removing apparatus shown in FIG. 34 is that the injection noise calculating unit 58 and the adders 56 and 57 are provided.
Is replaced by the SNR correction unit 67. The relationship between FIGS. 34 and 37 is equal to the relationship between FIGS. 1 and 5, the relationship between FIGS. 10 and 14, and the relationship between FIGS.
Since the R correction unit 67 has been described with reference to FIGS. 15 and 14, detailed description of the noise removal device shown in FIG. 37 will be omitted.

【０１６３】（第１１の実施の形態）図３８は、本発明
のノイズ除去装置の第１１の実施の形態の全体構成を示
すブロック図である。図２０に示したノイズ除去装置と
は、推定先天的ＳＮＲ計算部７１を除いて同一であるの
で、以下、この相違点を中心に詳細に説明する。図３９
は、図３８における推定先天的ＳＮＲ計算部７１の構成
を示すブロック図である。図５７に示した推定先天的Ｓ
ＮＲ計算部７は後天的ＳＮＲ記憶部７０２、抑圧係数記
憶部７０３、多重乗算部７０５，７０４を有するのに対
し、推定先天的ＳＮＲ計算部７１はこれらの代わりに、
推定雑音記憶部７１２、強調音声パワースペクトル記憶
部７１３、周波数別ＳＮＲ計算部７１５、多重乗算部７
１６を有する。また、推定先天的ＳＮＲ計算部７は、入
力信号に抑圧係数を有するが、推定先天的ＳＮＲ計算部
７１は、抑圧係数の代わりに強調音声振幅スペクトルと
推定雑音パワースペクトルを入力信号に有する。以下、
推定先天的ＳＮＲ計算部７と７１との間に存在するこれ
らの相違点を中心に、詳細に説明する。(Eleventh Embodiment) FIG. 38 is a block diagram showing the overall structure of an eleventh embodiment of the noise removing apparatus of the present invention. The noise eliminator shown in FIG. 20 is the same as the noise eliminator except for the estimated a priori SNR calculator 71. Therefore, the difference will be mainly described below. FIG. 39
FIG. 39 is a block diagram showing a configuration of an estimated a priori SNR calculation unit 71 in FIG. 38. Estimated congenital S shown in FIG.
The NR calculation unit 7 has an acquired SNR storage unit 702, a suppression coefficient storage unit 703, and multiplex multiplication units 705 and 704, whereas the estimated a priori SNR calculation unit 71 replaces them.
Estimated noise storage unit 712, emphasized speech power spectrum storage unit 713, frequency-dependent SNR calculation unit 715, multiplex multiplication unit 7
Have 16. Further, the estimated a priori SNR calculator 7 has a suppression coefficient in the input signal, but the estimated a priori SNR calculator 71 has an emphasized speech amplitude spectrum and an estimated noise power spectrum in the input signal instead of the suppression coefficient. Less than,
The difference between the estimated a priori SNR calculators 7 and 71 will be mainly described below.

【０１６４】多重乗算部７１６は、図３８における多重
乗算部１６から供給される強調音声振幅スペクトル｜Ｘ
_n(ｋ）｜バー＝Ｇ_n(ｋ）バー・｜Ｙ_n(ｋ）｜を周波数毎
に２乗して強調音声パワースペクトルを求め、強調音声
パワースペクトル記憶部７１３に出力する。多重乗算部
７１６の構成は、既に図５２を用いて説明した多重乗算
部１７に等しいので、詳細な説明は省略する。強調音声
パワースペクトル記憶部７１３は、多重乗算部７１６か
ら供給される強調音声パワースペクトルを記憶し、１フ
レーム前に供給された強調音声パワースペクトルを周波
数別ＳＮＲ計算部７１５へ出力する。推定雑音記憶部７
１２は、図３８における推定雑音計算部５から供給され
る推定雑音パワースペクトルλ_n(ｋ）を記憶し、１フレ
ーム前に供給された推定音声パワースペクトルを周波数
別ＳＮＲ計算部７１５へ出力する。The multiplex multiplication section 716 receives the emphasized speech amplitude spectrum | X supplied from the multiplex multiplication section 16 in FIG.
_n (k) | bar = G _n (k) bar · | Y _n (k) | is squared for each frequency to obtain the emphasized voice power spectrum, and outputs the emphasized voice power spectrum storage unit 713. The configuration of the multiplex multiplying unit 716 is the same as that of the multiplex multiplying unit 17 already described with reference to FIG. 52, and thus detailed description thereof will be omitted. The emphasized sound power spectrum storage unit 713 stores the emphasized sound power spectrum supplied from the multiplex multiplication unit 716, and outputs the emphasized sound power spectrum supplied one frame before to the frequency-based SNR calculation unit 715. Estimated noise storage unit 7
The storage unit 12 stores the estimated noise power spectrum λ _n (k) supplied from the estimated noise calculation unit 5 in FIG. 38, and outputs the estimated speech power spectrum supplied one frame before to the frequency-based SNR calculation unit 715.

【０１６５】周波数別ＳＮＲ計算部７１５は、強調音声
パワースペクトル記憶部７１３から供給される強調音声
パワースペクトルＧ_n-1 ²（ｋ）バー・｜Ｙ_n-1(ｋ）｜²
と、推定雑音記憶部７１２から供給される推定雑音パワ
ースペクトルλ_n-1(ｋ）のＳＮＲを各周波数毎に計算
し、多重重みつき加算部７０７へ出力する。周波数別Ｓ
ＮＲ計算部７１５の構成は、既に図５６を用いて説明し
た周波数別ＳＮＲ計算部６に等しいので、詳細な説明は
省略する。周波数別ＳＮＲ計算部７１５の出力であるＧ
_n-1 ²（ｋ）バー・｜Ｙ_n-1(ｋ）｜ ² ／λ_n-1(ｋ）は、式
（１１）の関係から、図５７における多重乗算部７０５
の出力であるγ_n-1(ｋ）Ｇ_n-1 ²（ｋ）バーと等価であ
る。従って、図２０に示したノイズ除去装置に含まれる
推定先天的ＳＮＲ計算部７を推定先天的ＳＮＲ計算部７
１で置換することが可能となる。The frequency-based SNR calculation unit 715
Enhanced speech supplied from the power spectrum storage unit 713
Power spectrum G_n-1 ²(K) bar ・ | Y_n-1(k) |²
And the estimated noise power supplied from the estimated noise storage unit 712.
ー spectrum λ_n-1Calculate SNR of (k) for each frequency
And outputs it to the multiple weighted addition unit 707. Frequency S
The configuration of the NR calculation unit 715 has already been described using FIG.
Since it is the same as the frequency-based SNR calculator 6, a detailed description will be given.
Omit it. G which is the output of the frequency-dependent SNR calculation unit 715
_n-1 ²(K) bar ・ | Y_n-1(k) | ² / Λ_n-1(k) is the formula
From the relationship of (11), the multiple multiplication unit 705 in FIG.
Is the output of γ_n-1(k) G_n-1 ²(K) is equivalent to bar
It Therefore, it is included in the noise eliminator shown in FIG.
The estimated a priori SNR calculator 7 is replaced by the estimated a priori SNR calculator 7
It becomes possible to replace with 1.

【０１６６】（第１２の実施の形態）図４０は、本発明
のノイズ除去装置の第１２の実施の形態の全体構成を示
すブロック図である。図３８に示したノイズ除去装置と
の相違点は、注入雑音計算部５８、加算器５６，５７
が、ＳＮＲ補正部６７に置換されていることである。図
３８と図４０の関係は、図１と図５の関係、図１０と図
１４の関係、図２０と図３３の関係、及び図３４と図３
７の関係に等しく、ＳＮＲ補正部６７については図１５
及び１４を参照して説明したので、図４０に示したノイ
ズ除去装置に関する詳細な説明は省略する。(Twelfth Embodiment) FIG. 40 is a block diagram showing the overall structure of a twelfth embodiment of the noise removing apparatus of the present invention. The difference from the noise removal apparatus shown in FIG. 38 is that the injection noise calculation unit 58 and the adders 56 and 57 are provided.
Is replaced by the SNR correction unit 67. The relationship between FIGS. 38 and 40 is the relationship between FIGS. 1 and 5, the relationship between FIGS. 10 and 14, the relationship between FIGS. 20 and 33, and the relationship between FIGS.
7 and the SNR correction unit 67 is shown in FIG.
Since the description has been made with reference to FIGS. 14 and 14, the detailed description of the noise eliminator shown in FIG.

【０１６７】（第１３の実施の形態）図４１は、本発明
のノイズ除去装置の第１３の実施の形態の全体構成を示
すブロック図である。図２０に示したノイズ除去装置と
の相違点は、推定雑音計算部５が推定雑音部５２に、推
定先天的ＳＮＲ計算部７が推定先天的ＳＮＲ計算部７１
に、それぞれ置換されていることと、重みつき劣化音声
計算部１４が存在しないことである。推定雑音部５２の
構成と動作は、図３５及び図３６を参照して説明したの
と同様である。また、推定先天的ＳＮＲ計算部７１の構
成と動作は、図３９を参照して説明したのと同様であ
る。従って、図４１に示したノイズ除去装置は、図２０
に示したノイズ除去装置と等価な機能を実現する。(Thirteenth Embodiment) FIG. 41 is a block diagram showing the overall structure of a thirteenth embodiment of the noise removing apparatus of the present invention. The difference from the noise removal apparatus shown in FIG. 20 is that the estimated noise calculation unit 5 is an estimated noise unit 52 and the estimated a priori SNR calculation unit 7 is an estimated a priori SNR calculation unit 71.
Are replaced, and the weighted deteriorated speech calculation unit 14 does not exist. The configuration and operation of the estimated noise unit 52 are the same as those described with reference to FIGS. 35 and 36. Further, the configuration and operation of the estimated a priori SNR calculation unit 71 are the same as those described with reference to FIG. Therefore, the noise removing apparatus shown in FIG.
A function equivalent to that of the noise eliminator shown in is realized.

【０１６８】（第１４の実施の形態）図４２は、本発明
のノイズ除去装置の第１４の実施の形態の全体構成を示
すブロック図である。図４１に示したノイズ除去装置と
の相違点は、注入雑音計算部５８、加算器５６，５７
が、ＳＮＲ補正部６７に置換されていることである。図
４１と図４２の関係は、図１と図５の関係、図１０と図
１４の関係、図２０と図３３の関係、図３４と図３７の
関係、及び図３８と図４０の関係に等しく、ＳＮＲ補正
部６７については図１５及び１４を参照して説明したの
で、図４２に示したノイズ除去装置に関する詳細な説明
は省略する。(Fourteenth Embodiment) FIG. 42 is a block diagram showing the overall structure of a fourteenth embodiment of the noise eliminator of the present invention. The difference from the noise removal apparatus shown in FIG. 41 is that the injection noise calculation unit 58 and the adders 56 and 57 are
Is replaced by the SNR correction unit 67. The relationship between FIGS. 41 and 42 is the relationship between FIGS. 1 and 5, the relationship between FIGS. 10 and 14, the relationship between FIGS. 20 and 33, the relationship between FIGS. 34 and 37, and the relationship between FIGS. 38 and 40. Similarly, since the SNR correction unit 67 has been described with reference to FIGS. 15 and 14, a detailed description of the noise eliminator shown in FIG. 42 will be omitted.

【０１６９】（第１５の実施の形態）図４３は、本発明
のノイズ除去装置の第１５の実施の形態の全体構成を示
すブロック図である。図２０に示したノイズ除去装置と
の相違点は、推定雑音計算部５が推定雑音計算部５３で
置換されていることと、音声検出部４が存在しないこと
である。すなわち、雑音の推定に音声検出部を必要とし
ない構成になっている。以下、これらの相違点を中心に
詳細に説明する。図４４は、図４３における推定雑音計
算部５３の構成を示すブロック図である。図２４に示し
た推定雑音計算部５との相違点は、周波数別推定雑音計
算部５０４₀ 〜５０４_K-1 が周波数別推定雑音計算部５
０８₀ 〜５０８_K-1 に置換されていることと、推定雑音
計算部５３が入力信号に音声検出フラグを有していない
ことである。図４５を参照しながら、周波数別推定雑音
計算部５０８₀ 〜５０８_K- ₁ の構成と動作を詳細に説明
する。(Fifteenth Embodiment) FIG. 43 shows the present invention.
15 shows an overall configuration of a fifteenth embodiment of the noise eliminator
It is a block diagram. With the noise eliminator shown in FIG.
The difference is that the estimated noise calculation unit 5
It has been replaced and that the voice detector 4 does not exist
Is. That is, a voice detector is required for noise estimation.
It is not configured. Below, focusing on these differences
The details will be described. FIG. 44 shows the estimated noise meter in FIG.
It is a block diagram showing a configuration of a calculation unit 53. Shown in Figure 24
The difference from the estimated noise calculator 5 is that
Arithmetic part 504₀ ~ 504_K-1 Is the estimated noise calculation section 5 for each frequency
08₀ ~ 508_K-1 Replaced with the estimated noise
The calculator 53 does not have a voice detection flag in the input signal
That is. Estimation noise by frequency with reference to FIG.
Calculation unit 508₀ ~ 508_K- ₁ Explains the configuration and operation of
To do.

【０１７０】図４５は、図４４に示した推定雑音計算部
５３に含まれる周波数別推定雑音計算部５０８₀ 〜５０
８_K-1 の構成を示すブロック図である。図２５に示した
周波数別推定雑音計算部５０４との相違点は、更新判定
部５２０が更新判定部５２２に置換されていることと、
５０８₀ 〜５０８_K-1 が入力信号に音声検出フラグを有
していないことである。図４６は、図４５に示した周波
数別推定雑音計算部５０８に含まれる更新判定部５２２
の構成を示すブロック図である。図２６に示した更新判
定部５２０との相違点は、論理和計算部５２０１が論理
和計算部５２２１に置換されていること、更新判定部５
２２が論理否定回路５２０２を有していないこと、入力
信号に音声検出フラグを有していないことである。すな
わち、更新判定部５２２は、推定雑音の更新に音声検出
フラグを用いていない。この点が、図２６に示した更新
判定部５２０と異なる。FIG. 45 shows the frequency-dependent estimated noise calculators 508 ₀ to 50 included in the estimated noise calculator 53 shown in FIG.
It is a block diagram which shows the structure of _8K-1 . The difference from the frequency-dependent estimated noise calculation unit 504 shown in FIG. 25 is that the update determination unit 520 is replaced with the update determination unit 522.
That is, 508 _{0 to} 508 _K-1 do not have a voice detection flag in the input signal. FIG. 46 shows an update determination unit 522 included in the frequency-dependent estimated noise calculation unit 508 shown in FIG.
3 is a block diagram showing the configuration of FIG. The difference from the update determination unit 520 shown in FIG. 26 is that the logical sum calculation unit 5201 is replaced by the logical sum calculation unit 5221.
22 does not have the logical negation circuit 5202, and does not have a voice detection flag in the input signal. That is, the update determination unit 522 does not use the voice detection flag for updating the estimated noise. This point is different from the update determination unit 520 shown in FIG.

【０１７１】論理和計算部５２２１は、比較部５２０５
の出力値と比較部５２０３の出力値の論理和を計算し、
計算結果を図４５におけるスイッチ５０４４、シフトレ
ジスタ５０４５及びカウンタ５０４９に出力する。すな
わち、更新判定部５２２は、カウント値が予め設定され
た値に到達するまでは常に“１”を出力し、到達した後
は、劣化音声パワーが閾値よりも小さいときに“１”を
出力する。図２６を用いて説明した通り、比較部５２０
５は劣化音声信号が雑音であるか否かの判定を行なって
いる。すなわち、比較部５２０５は各周波数毎に音声検
出を行なっていると言える。従って、音声検出フラグを
入力信号に有しない更新判定部や推定雑音計算部を実現
することが可能となる。The logical sum calculation unit 5221 includes a comparison unit 5205.
And the output value of the comparison unit 5203 is calculated, and
The calculation result is output to the switch 5044, the shift register 5045, and the counter 5049 in FIG. That is, the update determination unit 522 always outputs "1" until the count value reaches the preset value, and after reaching the count value, outputs "1" when the deteriorated voice power is smaller than the threshold value. . As described using FIG. 26, the comparison unit 520
5 determines whether or not the deteriorated voice signal is noise. That is, it can be said that the comparison unit 5205 performs voice detection for each frequency. Therefore, it is possible to realize the update determination unit and the estimated noise calculation unit that do not have the voice detection flag in the input signal.

【０１７２】（第１６の実施の形態）図４７は、本発明
のノイズ除去装置の第１６の実施の形態の全体構成を示
すブロック図である。図４３に示したノイズ除去装置と
の相違点は、注入雑音計算部５８、加算器５６，５７
が、ＳＮＲ補正部６７に置換されていることである。図
４３と図４７の関係は、図１と図５の関係、図１０と図
１４の関係、図２０と図３３の関係、図３４と図３７の
関係、図３８と図４０の関係、及び図４１と図４２の関
係に等しく、ＳＮＲ補正部６７については図１５及び１
４を参照して説明したので、図４７に示したノイズ除去
装置に関する詳細な説明は省略する。(Sixteenth Embodiment) FIG. 47 is a block diagram showing the overall structure of a sixteenth embodiment of the noise removing apparatus of the present invention. The difference from the noise removing apparatus shown in FIG. 43 is that the injection noise calculating section 58 and the adders 56 and 57 are
Is replaced by the SNR correction unit 67. The relationship between FIGS. 43 and 47 is the relationship between FIGS. 1 and 5, the relationship between FIGS. 10 and 14, the relationship between FIGS. 20 and 33, the relationship between FIGS. 34 and 37, the relationship between FIGS. The relationship is the same as that of FIG. 41 and FIG.
Since it has been described with reference to FIG. 4, the detailed description of the noise elimination device shown in FIG. 47 will be omitted.

【０１７３】図２０、図３３、図３４、図３７、図３
８、図４０〜図４３、図４７に関しても、図１０と図１
２及び図１４と図１７の関係に相当するような、劣化音
声信号の代わりに劣化音声パワースペクトルを用いた選
択的な雑音注入が可能であるが、構成は明らかなので、
詳細は省略する。20, FIG. 33, FIG. 34, FIG. 37, FIG.
8, FIG. 40 to FIG. 43, and FIG. 47, FIG. 10 and FIG.
2 and selective noise injection using the degraded voice power spectrum instead of the degraded voice signal as in the relationship between FIG. 14 and FIG. 17 is possible, but the configuration is clear,
Details are omitted.

【０１７４】これまで説明したすべての実施の形態で
は、ノイズ除去の方式として、最小平均２乗誤差短時間
スペクトル振幅法を仮定してきたが、その他の方法にも
適用することができる。このような方法の例として、
「１９７９年１２月、プロシーディングス・オブ・ザ・
アイ・イー・イー・イー、第６７巻、第１２号（PROCEE
DINGS OF THE IEEE, VOL.67, NO.12, PP.1586-1604, DE
C, 1979 ）、１５８６〜１６０４ページ」（文献９）に
開示されているウィーナーフィルタ法や「１９７９年４
月、アイ・イー・イー・イー・トランザクションズ・オ
ン・アクースティクス・スピーチ・アンド・シグナル・
プロセシング、第２７巻、第２号（IEEE TRANSACTIONS
ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL.2
7, NO.2, PP.113-120, APR, 1979）、１１３〜１２０ペ
ージ」（文献１０）に開示されているスペクトル減算法
などがあるが、これらの詳細な構成例については、説明
を省略する。In all the embodiments described so far, the minimum mean square error short-time spectrum amplitude method has been assumed as the noise removal method, but it can be applied to other methods. As an example of such a method,
"December 1979, Proceedings of the
I E E E Vol. 67, No. 12 (PROCEE
DINGS OF THE IEEE, VOL.67, NO.12, PP.1586-1604, DE
C, 1979), pp. 1586-1604 "(Reference 9) and the Wiener filter method and" 1979 4
Mon, I E E Transactions On Auctions Speech And Signal
Processing, Volume 27, Issue 2 (IEEE TRANSACTIONS
ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL.2
7, NO.2, PP.113-120, APR, 1979), pages 113 to 120 ”(Reference 10), and the like, but a detailed configuration example of these will be described. Omit it.

【０１７５】文献１０に開示されているスペクトル減算
法の概略動作に関しては、例えば、図４３及び図４７を
参照することができる。図４３及び図４７において、多
重乗算部１６を多重減算部に、雑音抑圧係数生成部８を
雑音抑圧量計算部に、抑圧係数補正部１５を抑圧量補正
部に置き換えれば、スペクトル減算法による動作を実現
することができる。多重減算部において、補正された雑
音抑圧量を劣化音声振幅スペクトルから減算し、得られ
た結果を逆フーリエ変換することによって、強調音声を
得ることができる。ここでは、ＳＮＲを計算してから、
ＳＮＲに基づいて雑音抑圧量を計算する例について説明
したが、推定雑音計算部５３で得られた推定雑音を、直
接劣化音声振幅スペクトルから減算することもできる。Regarding the schematic operation of the spectral subtraction method disclosed in Document 10, for example, FIGS. 43 and 47 can be referred to. 43 and 47, if the multiplex multiplication unit 16 is replaced with a multiple subtraction unit, the noise suppression coefficient generation unit 8 is replaced with a noise suppression amount calculation unit, and the suppression coefficient correction unit 15 is replaced with a suppression amount correction unit, the operation by the spectrum subtraction method is performed. Can be realized. In the multiple subtraction unit, the corrected noise suppression amount is subtracted from the deteriorated voice amplitude spectrum, and the obtained result is subjected to the inverse Fourier transform to obtain the emphasized voice. Here, after calculating the SNR,
Although the example of calculating the noise suppression amount based on the SNR has been described, the estimated noise obtained by the estimated noise calculation unit 53 can be directly subtracted from the deteriorated speech amplitude spectrum.

【０１７６】[0176]

【発明の効果】以上説明したように、本発明では、入力
信号に基づいて擬似的な雑音を生成し、この擬似的な雑
音を注入して得られた抑圧係数を用いる。抑圧係数を定
めるときに上述した擬似的な雑音を注入することによ
り、特定の統計モデルに従う背景雑音を仮定して導出し
た抑圧係数を入力信号に応じて補正し、その統計モデル
に従わない雑音を効果的に除去することができる。従っ
て、あらゆる背景雑音に対して十分高い品質の強調音声
を得ることができる。As described above, in the present invention, pseudo noise is generated based on an input signal, and the suppression coefficient obtained by injecting this pseudo noise is used. By injecting the above-mentioned pseudo noise when determining the suppression coefficient, the suppression coefficient derived assuming the background noise according to a specific statistical model is corrected according to the input signal, and noise that does not follow the statistical model is corrected. It can be effectively removed. Therefore, it is possible to obtain the emphasized speech of sufficiently high quality for all background noises.

【０１７７】また、本発明では、周波数領域の強調音声
を変換した時間領域信号に窓がけ処理を施す。周波数領
域の強調音声を変換した時間領域信号の隣接する２フレ
ームを重ね合わせ加算する場合に、重ね合わせ加算の対
象となった信号サンプルが各フレームにおいて異なった
抑圧係数値で抑圧されたとしても、各フレームを窓がけ
処理してフレーム境界における信号サンプルの振幅を小
さくすることによって、フレーム境界における信号サン
プルの連続性を改善することができる。これにより、雑
音の発生を防止し、雑音による音質の劣化を低減するこ
とができる。Further, according to the present invention, windowing processing is applied to the time domain signal obtained by converting the emphasized speech in the frequency domain. When two adjacent frames of the time domain signal obtained by converting the emphasized speech in the frequency domain are superposed and added, even if the signal sample to be superposed and added is suppressed by different suppression coefficient values in each frame, The continuity of the signal samples at the frame boundaries can be improved by windowing each frame to reduce the amplitude of the signal samples at the frame boundaries. Thereby, it is possible to prevent the generation of noise and reduce the deterioration of the sound quality due to the noise.

[Brief description of drawings]

【図１】本発明のノイズ除去装置の第１の実施の形態
の全体構成を示すブロック図である。FIG. 1 is a block diagram showing an overall configuration of a first embodiment of a noise removing device of the present invention.

【図２】図１に示したノイズ除去装置に含まれる注入
雑音計算部の第１の構成を示すブロック図である。FIG. 2 is a block diagram showing a first configuration of an injection noise calculation unit included in the noise removal device shown in FIG.

【図３】ＳＮＲと注入雑音の関係の一例を示す図であ
る。FIG. 3 is a diagram showing an example of a relationship between SNR and injection noise.

【図４】ＳＮＲに対する抑圧係数の特性の一例を示す
図である。FIG. 4 is a diagram showing an example of characteristics of suppression coefficients with respect to SNR.

【図５】本発明のノイズ除去装置の第２の実施の形態
の全体構成を示すブロック図である。FIG. 5 is a block diagram showing an overall configuration of a second embodiment of a noise removing device of the present invention.

【図６】図５に示したノイズ除去装置に含まれるＳＮ
Ｒ補正部の第１の構成を示すブロック図である。6 is an SN included in the noise removing device shown in FIG.
It is a block diagram which shows the 1st structure of an R correction part.

【図７】図６に示したＳＮＲ補正部に含まれる補正Ｓ
ＮＲ計算部の構成を示すブロック図である。7 is a correction S included in the SNR correction unit shown in FIG.
It is a block diagram which shows the structure of an NR calculation part.

【図８】ＳＮＲ補正部の第２の構成を示すブロック図
である。FIG. 8 is a block diagram showing a second configuration of an SNR correction unit.

【図９】図８に示したＳＮＲ補正部に含まれる補正Ｓ
ＮＲ計算部の構成を示すブロック図である。9 is a correction S included in the SNR correction unit shown in FIG.
It is a block diagram which shows the structure of an NR calculation part.

【図１０】本発明のノイズ除去装置の第３の実施の形
態の全体構成を示すブロック図である。FIG. 10 is a block diagram showing an overall configuration of a third embodiment of a noise removing device of the present invention.

【図１１】注入雑音計算部の第２の構成を示すブロッ
ク図である。FIG. 11 is a block diagram showing a second configuration of the injection noise calculation unit.

【図１２】本発明のノイズ除去装置の第４の実施の形
態の全体構成を示すブロック図である。FIG. 12 is a block diagram showing an overall configuration of a fourth embodiment of a noise removing device of the invention.

【図１３】注入雑音計算部の第３の構成を示すブロッ
ク図である。FIG. 13 is a block diagram showing a third configuration of the injection noise calculation unit.

【図１４】本発明のノイズ除去装置の第５の実施の形
態の全体構成を示すブロック図である。FIG. 14 is a block diagram showing an overall configuration of a fifth embodiment of a noise removing device of the invention.

【図１５】ＳＮＲ補正部の第３の構成を示すブロック
図である。FIG. 15 is a block diagram showing a third configuration of the SNR correction unit.

【図１６】注入雑音計算部の第４の構成を示すブロッ
ク図である。FIG. 16 is a block diagram showing a fourth configuration of the injection noise calculation unit.

【図１７】本発明のノイズ除去装置の第６の実施の形
態の全体構成を示すブロック図である。FIG. 17 is a block diagram showing an overall configuration of a sixth embodiment of a noise removing device of the present invention.

【図１８】ＳＮＲ補正部の第４の構成を示すブロック
図である。FIG. 18 is a block diagram showing a fourth configuration of the SNR correction unit.

【図１９】注入雑音計算部の第５の構成を示すブロッ
ク図である。FIG. 19 is a block diagram showing a fifth configuration of the injection noise calculation unit.

【図２０】本発明のノイズ除去装置の第７の実施の形
態の全体構成を示すブロック図である。FIG. 20 is a block diagram showing the overall configuration of a seventh embodiment of a noise removal device of the present invention.

【図２１】図２０に示したノイズ除去装置に含まれる
重みつき劣化音声計算部の構成を示すブロック図であ
る。21 is a block diagram showing a configuration of a weighted deteriorated speech calculation unit included in the noise removal apparatus shown in FIG.

【図２２】図２１に示した重みつき劣化音声計算部に
含まれる多重非線形処理部の構成を示すブロック図であ
る。22 is a block diagram showing a configuration of a multiplex nonlinear processor included in the weighted deteriorated speech calculator shown in FIG. 21.

【図２３】非線形処理部における非線形関数の一例を
示す図である。FIG. 23 is a diagram showing an example of a non-linear function in a non-linear processing unit.

【図２４】図２０に示したノイズ除去装置に含まれる
推定雑音計算部の第１の構成を示すブロック図である。FIG. 24 is a block diagram showing a first configuration of an estimated noise calculator included in the noise eliminator shown in FIG.

【図２５】図２４に示した推定雑音計算部に含まれる
周波数別推定雑音計算部の第１の構成を示すブロック図
である。25 is a block diagram showing a first configuration of a frequency-specific estimated noise calculation unit included in the estimated noise calculation unit shown in FIG. 24.

【図２６】図２５に示した周波数別推定雑音計算部に
含まれる更新判定部の構成を示すブロック図である。26 is a block diagram showing a configuration of an update determination unit included in the frequency-dependent estimated noise calculation unit shown in FIG. 25.

【図２７】周波数別推定雑音計算部の第２の構成を示
すブロック図である。FIG. 27 is a block diagram showing a second configuration of the frequency-dependent estimated noise calculation unit.

【図２８】図２０に示したノイズ除去装置に含まれる
抑圧係数補正部の構成を示すブロック図である。28 is a block diagram showing a configuration of a suppression coefficient correction unit included in the noise removal device shown in FIG.

【図２９】図２８に示した抑圧係数補正部に含まれる
周波数別抑圧係数補正部の構成を示すブロック図であ
る。29 is a block diagram showing the configuration of a frequency-specific suppression coefficient correction unit included in the suppression coefficient correction unit shown in FIG. 28.

【図３０】雑音抑圧係数生成部の第２の構成を示すブ
ロック図である。FIG. 30 is a block diagram showing a second configuration of the noise suppression coefficient generation unit.

【図３１】周波数別ＳＮＲ計算部の第２の構成を示す
ブロック図である。FIG. 31 is a block diagram showing a second configuration of a frequency-specific SNR calculation unit.

【図３２】図３１に示した周波数別ＳＮＲ計算部に含
まれる帯域別パワー計算部の構成を示すブロック図であ
る。32 is a block diagram illustrating a configuration of a band-specific power calculation unit included in the frequency-based SNR calculation unit illustrated in FIG. 31.

【図３３】本発明のノイズ除去装置の第８の実施の形
態の全体構成を示すブロック図である。FIG. 33 is a block diagram showing the overall configuration of an eighth embodiment of a noise removal device of the present invention.

【図３４】本発明のノイズ除去装置の第９の実施の形
態の全体構成を示すブロック図である。FIG. 34 is a block diagram showing an overall configuration of a ninth embodiment of a noise removal device of the present invention.

【図３５】推定雑音計算部の第２の構成を示すブロッ
ク図である。FIG. 35 is a block diagram showing a second configuration of the estimated noise calculation unit.

【図３６】図３５に示した推定雑音計算部に含まれる
周波数別推定雑音計算部の構成を示すブロック図であ
る。36 is a block diagram showing a configuration of a frequency-specific estimated noise calculation unit included in the estimated noise calculation unit shown in FIG. 35.

【図３７】本発明のノイズ除去装置の第１０の実施の
形態の全体構成を示すブロック図である。FIG. 37 is a block diagram showing the overall structure of a tenth embodiment of a noise eliminator of the present invention.

【図３８】本発明のノイズ除去装置の第１１の実施の
形態の全体構成を示すブロック図である。FIG. 38 is a block diagram showing the overall configuration of an eleventh embodiment of a noise removal device of the present invention.

【図３９】図３８に示したノイズ除去装置に含まれる
推定先天的ＳＮＲ計算部の構成を示すブロック図であ
る。39 is a block diagram showing a configuration of an estimated a priori SNR calculation unit included in the noise eliminator shown in FIG.

【図４０】本発明のノイズ除去装置の第１２の実施の
形態の全体構成を示すブロック図である。FIG. 40 is a block diagram showing an overall configuration of a twelfth embodiment of a noise removal device of the present invention.

【図４１】本発明のノイズ除去装置の第１３の実施の
形態の全体構成を示すブロック図である。FIG. 41 is a block diagram showing the overall configuration of a thirteenth embodiment of a noise removal device of the present invention.

【図４２】本発明のノイズ除去装置の第１４の実施の
形態の全体構成を示すブロック図である。FIG. 42 is a block diagram showing the overall configuration of a fourteenth embodiment of a noise removal device of the present invention.

【図４３】本発明のノイズ除去装置の第１５の実施の
形態の全体構成を示すブロック図である。FIG. 43 is a block diagram showing the overall configuration of a fifteenth embodiment of a noise removal device of the present invention.

【図４４】推定雑音計算部の第３の構成を示すブロッ
ク図である。FIG. 44 is a block diagram showing a third configuration of the estimated noise calculation unit.

【図４５】図４４に示した推定雑音計算部に含まれる
周波数別推定雑音計算部の構成を示すブロック図であ
る。45 is a block diagram showing a configuration of a frequency-specific estimated noise calculation unit included in the estimated noise calculation unit shown in FIG. 44.

【図４６】図４５に示した周波数別推定雑音計算部含
まれる更新判定部の構成を示すブロック図である。FIG. 46 is a block diagram showing the configuration of an update determination unit included in the frequency-dependent estimated noise calculation unit shown in FIG. 45.

【図４７】本発明のノイズ除去装置の第１６の実施の
形態の全体構成を示すブロック図である。FIG. 47 is a block diagram showing the overall configuration of a sixteenth embodiment of a noise removal device of the present invention.

【図４８】従来のノイズ除去装置の全体構成を示すブ
ロック図である。FIG. 48 is a block diagram showing an overall configuration of a conventional noise removal device.

【図４９】従来のノイズ除去装置に含まれる音声検出
部の構成を示すブロック図である。FIG. 49 is a block diagram showing a configuration of a voice detection unit included in a conventional noise removal device.

【図５０】図４９に示した音声検出部に含まれるパワ
ー計算部の構成を示すブロック図である。50 is a block diagram showing a configuration of a power calculation unit included in the voice detection unit shown in FIG. 49.

【図５１】図４９に示した音声検出部に含まれる重み
つき加算部の構成を示すブロック図である。51 is a block diagram showing a configuration of a weighted addition unit included in the voice detection unit shown in FIG. 49.

【図５２】従来のノイズ除去装置に含まれる多重乗算
部の構成を示すブロック図である。FIG. 52 is a block diagram showing a configuration of a multiple multiplication unit included in a conventional noise removal device.

【図５３】従来のノイズ除去装置に含まれる推定雑音
計算部の構成を示すブロック図である。FIG. 53 is a block diagram showing a configuration of an estimated noise calculation unit included in a conventional noise removal device.

【図５４】図５３に示した推定雑音計算部に含まれる
周波数別推定雑音計算部の構成を示すブロック図であ
る。54 is a block diagram showing a configuration of a frequency-specific estimated noise calculation unit included in the estimated noise calculation unit shown in FIG. 53.

【図５５】図５４に示した周波数別推定雑音計算部に
含まれるの更新判定部の構成を示すブロック図である。FIG. 55 is a block diagram showing a configuration of an update determination unit included in the frequency-dependent estimated noise calculation unit shown in FIG. 54.

【図５６】従来のノイズ除去装置に含まれる周波数別
ＳＮＲ計算部の構成を示すブロック図である。FIG. 56 is a block diagram showing a configuration of a frequency-specific SNR calculation unit included in a conventional noise removal device.

【図５７】従来のノイズ除去装置に含まれる推定先天
的ＳＮＲ計算部の構成を示すブロック図である。FIG. 57 is a block diagram showing a configuration of an estimated a priori SNR calculation unit included in a conventional noise removal device.

【図５８】図５７に示した推定先天的ＳＮＲ計算部に
含まれる多重値域限定処理部の構成を示すブロック図で
ある。58 is a block diagram showing a configuration of a multiple range limitation processing unit included in the estimated a priori SNR calculation unit shown in FIG. 57.

【図５９】図５７に示した推定先天的ＳＮＲ計算部に
含まれる多重重みつき加算部の構成を示すブロック図で
ある。59 is a block diagram showing a configuration of a multiple weighted addition unit included in the estimated a priori SNR calculation unit shown in FIG. 57.

【図６０】従来のノイズ除去装置に含まれる雑音抑圧
係数生成部の構成を示すブロック図である。FIG. 60 is a block diagram showing a configuration of a noise suppression coefficient generation unit included in a conventional noise removal device.

【図６１】図６０に示した雑音抑圧係数生成部に含ま
れる抑圧係数検索部の構成を示すブロック図である。61 is a block diagram showing a configuration of a suppression coefficient search unit included in the noise suppression coefficient generation unit shown in FIG. 60.

[Explanation of symbols]

１…フレーム分割部、２，２２…窓がけ処理部、３…フ
ーリエ変換部、４…音声検出部、５，５１，５２，５３
…推定雑音計算部、６，６１，７１５，１４０２…周波
数別ＳＮＲ計算部、７，７１…推定先天的ＳＮＲ計算
部、８，８１…雑音抑圧係数生成部、９…逆フーリエ変
換部、１０…フレーム合成部、１１…入力端子、１２…
出力端子、１３，５０４９…カウンタ、１４…重みつき
劣化音声計算部、１５…抑圧係数補正部、１６，１７，
７０４，７０５，７１６，１４０４…多重乗算部、５
５，５８，５９，６６２，６７２，６８２，６５４２…
注入雑音計算部、５６，５７，７０８，４０６３，４０
７２，４０７４，５０４６，６１１０₀ 〜６１１０
_M-1 ，６５４３，６５４４…加算器、６５，６６，６
７，６８…ＳＮＲ補正部、４０１，１５９３，５２０
４，５２０６…閾値記憶部、４０２，１５９４，５２０
３，５２０５，６７２３３…比較部、４０４，４０７５
…定数乗算器、４０５…対数計算部、４０６…パワー計
算部、４０７，５０７１，７０７１₀ 〜７０７１_K-1 …
重みつき加算部、４０８，７０６，５０７２…重み記憶
部、４０９，５２０２…論理否定回路、５０２，５０
５，６０２，６０３，８０２，８０３，１４９５，１５
０２，１５０３，１７０２，１７０３，４０６１，５０
３，６０４，６５５，８０４，１４７５，１５０４，１
７０４，６１１５，７０１４，７０７５…多重化部、５
０４₀ 〜５０４_K-1 ，５０６₀ 〜５０６ _K-1 ，５０７，
５０８₀ 〜５０８_K-1 ，５１４₀ 〜５１４_K-1 …周波数
別推定雑音計算部、５２０，５２１，５２２…更新判定
部、５５１…ＳＮＲ計算部、５５２，６５４１…しきい
値計算部、５５３，６７２１…注入レベル計算部、５８
１，６７２３２…ゼロ交叉計算部、５８２，１５９５，
５０４４，６７２２…スイッチ、５９１，６８２３２…
高域電力計算部、６０１₀ 〜６０１_K-1 ，５０４１，５
０４８，６５４５…除算部、６１１，６１２…周波数別
パワー計算部、６５１，６５２，６５３，６１１１，７
０１３，７０７２，７０７４…分離部、６５４₀ 〜６５
４_K-1 ，６６４₀ 〜６６４_K-1 …補正ＳＮＲ計算部、６
６１，６６３…平均値計算部、７０１…多重値域限定処
理部、７０２…後天的ＳＮＲ記憶部、７０３…抑圧係数
記憶部、７０７…多重重みつき加算部、７１２，１４０
１，５９４２…推定雑音記憶部、７１３…強調音声パワ
ースペクトル記憶部、８０１₀〜８０１_K-1 …抑圧係数
検索部、８１１…ＭＭＳＥＳＴＳＡゲイン関数値計算
部、８１２…一般化尤度比計算部、８１３…音声存在確
率記憶部、８１４…抑圧係数計算部、９０１…劣化音声
パワー、９０２…閾値、９０３，９２３…重み、９０４
…更新閾値、９０５…重みつき加算部制御信号、９１０
₀ 〜９１０_K-1 ，９１０₀ 〜９１０_ML-1…周波数別劣化
音声パワースペクトル、９１１₀ 〜９１１ _K-1 ，９１１
₀ 〜９１１_ML-1…帯域別劣化音声パワースペクトル、９
２１…瞬時推定ＳＮＲ、９２１₀ 〜９２１_K-1 …周波数
別瞬時推定ＳＮＲ、９２２…過去の推定ＳＮＲ、９２２
₀ 〜９２２_K-1 …過去の周波数別推定ＳＮＲ、９２４…
推定先天的ＳＮＲ、９２４₀ 〜９２４_K-1 …周波数別推
定先天的ＳＮＲ、１４０５…多重非線形処理部、１４８
５₀ 〜１４８５_K-1 ，５０４２…非線形処理部、１５０
１₀ 〜１５０１_K-1 …周波数別抑圧係数補正部、１５９
１，７０１２₀ 〜７０１２_K-1 …最大値選択部、１５９
２…抑圧係数下限値記憶部、１５９６…修正量記憶部、
１５９７，１７０１₀ 〜１７０１_K-1 ，４０６２₀ 〜４
０６２_K-1 ，４０７１，４０７３，５０４３…乗算器、
５０４５…シフトレジスタ、５０４７…最小値選択部、
５２０１，５２１１，５２２１…論理和計算部、５２０
７…閾値計算部、５９４１…レジスタ長記憶部、６７２
３，６８２３…判定部、７０１１…定数記憶部、８０１
１…抑圧係数テーブル、８０１２，８０１３…アドレス
変換部、６７２３１…無音区間検出部。 1 ... Frame division unit, 2, 22 ... Windowing processing unit, 3 ... Frame
Fourier transforming unit, 4 ... Voice detecting unit, 5, 51, 52, 53
... Estimated noise calculator, 6, 61, 715, 1402 ... Frequency
SNR calculator by number, 7, 71 ... Estimated innate SNR calculation
, 8, 81 ... Noise suppression coefficient generator, 9 ... Inverse Fourier transform
Conversion unit, 10 ... Frame synthesizing unit, 11 ... Input terminal, 12 ...
Output terminal, 13, 5049 ... Counter, 14 ... Weighted
Degraded speech calculation unit, 15 ... Suppression coefficient correction unit, 16, 17,
704, 705, 716, 1404 ... Multiplexing unit, 5
5, 58, 59, 662, 672, 682, 6542 ...
Injection noise calculator, 56, 57, 708, 4063, 40
72,4074,5046,6110₀ ~ 6110
_M-1 , 6543, 6544 ... Adder, 65, 66, 6
7, 68 ... SNR correction unit, 401, 1593, 520
4, 5206 ... Threshold storage unit, 402, 1594, 520
3, 5205, 67233 ... Comparison section, 404, 4075
... Constant multiplier, 405 ... Logarithmic calculator, 406 ... Power meter
Arithmetic unit, 407, 5071, 7071₀ ~ 7071_K-1 …
Weighted addition unit, 408, 706, 5072 ... Weight storage
Section, 409, 5202 ... Logical NOT circuit, 502, 50
5,602,603,802,803,1495,15
02,1503,1702,1703,4061,50
3,604,655,804,1475,1504,1
704, 6115, 7014, 7075 ... Multiplexing unit, 5
04₀ ~ 504_K-1 , 506₀ ~ 506 _K-1 , 507,
508₀ ~ 508_K-1 , 514₀ ~ 514_K-1 …frequency
Another estimated noise calculation unit, 520, 521, 522 ... Update determination
Section, 551 ... SNR calculation section, 552, 6541 ... Threshold
Value calculator, 553, 6721 ... Injection level calculator, 58
1,67232 ... Zero crossing calculation unit, 582, 1595,
5044, 6722 ... Switch, 591, 68232 ...
High frequency power calculator, 601₀ ~ 601_K-1 , 5041, 5
048, 6545 ... Division unit, 611, 612 ... By frequency
Power calculator, 651, 652, 653, 6111, 7
013, 7072, 7074 ... Separation part, 654₀ ~ 65
Four_K-1 , 664₀ ~ 664_K-1 ... corrected SNR calculation unit, 6
61, 663 ... Average value calculation unit, 701 ... Multiple range limiting process
Science section, 702 ... Acquired SNR storage section, 703 ... Suppression coefficient
Storage unit, 707 ... Multiple weighted addition unit, 712, 140
1, 5942 ... Estimated noise storage unit, 713 ... Enhanced speech power
-Spectrum storage unit, 801₀~ 801_K-1 … Suppression coefficient
Search unit, 811 ... MMSE STSA gain function value calculation
Part, 812 ... Generalized likelihood ratio calculation part, 813 ...
Rate storage unit, 814 ... Suppression coefficient calculation unit, 901 ... Degraded voice
Power, 902 ... Threshold value, 903, 923 ... Weight, 904
... update threshold value, 905 ... weighted addition unit control signal, 910
₀ ~ 910_K-1 , 910₀ ~ 910_ML-1… Deterioration by frequency
Voice power spectrum, 911₀ ~ 911 _K-1 , 911
₀ ~ 911_ML-1... Deteriorated voice power spectrum by band, 9
21 ... Instantaneous estimated SNR, 921₀ ~ 921_K-1 …frequency
Another instantaneous estimated SNR, 922 ... Past estimated SNR, 922
₀ ~ 922_K-1 ... Estimated SNR by frequency in the past, 924 ...
Estimated innate SNR, 924₀ ~ 924_K-1 … Inference by frequency
Constant a priori SNR, 1405 ... Multiple nonlinear processing unit, 148
5₀ ~ 1485_K-1 , 5042 ... Non-linear processing unit, 150
1₀ ~ 1501_K-1 ... Suppression coefficient correction unit for each frequency, 159
1,7012₀ ~ 7012_K-1 ... Maximum value selection section, 159
2 ... Suppression coefficient lower limit storage unit, 1596 ... Correction amount storage unit,
1597, 1701₀ ~ 1701_K-1 , 4062₀ ~ 4
062_K-1 , 4071, 4073, 5043 ... Multipliers,
5045 ... Shift register, 5047 ... Minimum value selection unit,
5201, 5211, 5221 ... Logical sum calculator 520
7 ... Threshold value calculation unit, 5941 ... Register length storage unit, 672
3, 6823 ... Judgment unit, 7011 ... Constant storage unit, 801
1 ... Suppression coefficient table, 8012, 8013 ... Address
Conversion unit, 67231 ... Silent section detection unit.

Claims

[Claims]

1. An input signal is converted into a frequency domain signal, a signal to noise ratio is obtained using this frequency domain signal, a suppression coefficient is determined based on this signal to noise ratio, and the frequency is calculated using this suppression coefficient. In a noise removal method for obtaining an output signal from which noise is removed from the input signal by weighting a domain signal and converting the weighted frequency domain signal into a time domain signal, the frequency domain signal obtained by converting the input signal A method for removing noise, characterized in that the pseudo first noise is calculated based on the above, and the signal-to-noise ratio is obtained after adding the first noise to the frequency domain signal.

2. The noise removing method according to claim 1, wherein the addition of the first noise is selectively performed according to the property of the input signal.

3. The noise removing method according to claim 2, wherein the continuity of the signal is used as the property of the input signal.

4. The noise removing method according to claim 3, wherein a number of zero crossings at which the amplitude of the input signal becomes zero is used as the stationarity of the signal.

5. The noise removing method according to claim 3, wherein the high frequency power of the frequency domain signal obtained by converting the input signal is used as the stationarity of the signal.

6. The noise removing method according to claim 1, wherein second noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, and the second noise is estimated. Using the second noise and the frequency domain signal,
Noise removal method characterized by determining the power of the noise of the.

7. The noise removing method according to claim 1, wherein second noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, and the second noise is estimated. A first noise is calculated using the second noise and the frequency domain signal, and the sum of the first noise and the frequency domain signal and the sum of the first noise and the second noise are calculated. A noise removal method characterized by obtaining a signal-to-noise ratio using.

8. The noise removing method according to claim 6, wherein the frequency domain signal obtained by converting the input signal is weighted, and the second noise is estimated based on the weighted frequency domain signal. A noise removal method characterized by the above.

9. An input signal is converted into a frequency domain signal, a signal to noise ratio is obtained using this frequency domain signal, a suppression coefficient is determined based on this signal to noise ratio, and the frequency is calculated using this suppression coefficient. In a noise removal method for obtaining an output signal from which noise is removed from the input signal by weighting a domain signal and converting the weighted frequency domain signal into a time domain signal, the frequency domain signal obtained by converting the input signal A noise removal method characterized in that the signal-to-noise ratio is corrected based on the above, and the suppression coefficient is determined based on the corrected signal-to-noise ratio.

10. The noise removing method according to claim 9, wherein the correction of the signal-to-noise ratio is selectively performed according to the property of the input signal.

11. The noise removing method according to claim 10, wherein signal steadiness is used as the property of the input signal.

12. The noise removing method according to claim 11, wherein the continuity of the signal is a number of zero crossings at which the amplitude of the input signal becomes zero.

13. The noise removing method according to claim 11, wherein high frequency power of the frequency domain signal obtained by converting the input signal is used as stationarity of the signal.

14. The noise removing method according to claim 9, wherein noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, and the noise and the noise are estimated. A method of removing noise, wherein a correction amount of the signal-to-noise ratio is determined using a frequency domain signal.

15. The noise removing method according to claim 9, wherein noise included in the frequency domain signal is estimated based on the frequency domain signal obtained by converting the input signal, and the noise and the noise are included. A signal-to-noise ratio is used to obtain an addition signal, and the signal-to-noise ratio is recalculated using the sum of the addition signal and the frequency domain signal and the sum of the addition signal and the noise. A noise removing method characterized by correcting a noise ratio.

16. The noise removing method according to claim 14, wherein the frequency domain signal obtained by converting the input signal is weighted, and noise is estimated based on the weighted frequency domain signal. Noise removal method.

17. The noise removing method according to claim 1, wherein the suppression coefficient is corrected based on the frequency domain signal,
A method of removing noise, characterized in that the frequency domain signal is weighted using the corrected suppression coefficient.

18. The noise removing method according to claim 1, wherein the time domain signal obtained by converting the frequency domain signal is subjected to windowing processing.

19. Converting an input signal into a frequency domain signal,
The second noise included in the frequency domain signal is estimated based on the frequency domain signal, a value corresponding to the second noise is subtracted from the frequency domain signal to obtain a frequency domain enhanced voice, and the enhanced voice is obtained. A noise removal method for obtaining an output signal from which noise has been removed from the input signal by converting the input signal into a time domain signal, wherein a pseudo first noise is calculated based on the frequency domain signal obtained by converting the input signal, A noise removing method, characterized in that a value corresponding to the noise obtained by adding the first noise to the second noise is subtracted from the frequency domain signal to obtain a emphasized voice in the frequency domain.

20. The noise removing method according to claim 19, wherein the addition of the first noise is selectively performed according to the property of the input signal.

21. The noise removing method according to claim 20, wherein the stationarity of the signal is used as the property of the input signal.

22. The noise removing method according to claim 21, wherein the stationarity of the signal is a number of zero crossings at which the amplitude of the input signal becomes zero.

23. The noise removing method according to claim 21, wherein the high frequency power of the frequency domain signal obtained by converting the input signal is used as the stationarity of the signal.

24. The noise removing method according to claim 19, wherein the power of the first noise is determined by using the frequency domain signal and the second noise. Noise removal method.

25. The noise removing method according to claim 19, wherein the frequency domain signal obtained by transforming the input signal is weighted, and the second noise is based on the weighted frequency domain signal. A noise removal method characterized by estimating.

26. The noise removing method according to claim 25, wherein a signal-to-noise ratio is obtained by using the frequency domain signal obtained by converting the input signal, a weight is obtained by using the signal-to-noise ratio, and the weight is obtained. A method for removing noise, wherein the frequency domain signal is weighted using

27. The noise removing method according to claim 25, wherein a signal-to-noise ratio is obtained by using the frequency domain signal obtained by converting the input signal, and the signal-to-noise ratio is processed by a non-linear processing function to weight the signal. Is obtained, and the frequency domain signal is weighted using this weight.

28. The noise removing method according to claim 19, wherein windowing processing is performed on the time domain signal obtained by converting the emphasized speech in the frequency domain.

29. Converting an input signal into a frequency domain signal,
A signal-to-noise ratio is obtained using this frequency-domain signal, a suppression coefficient is determined based on this signal-to-noise ratio, the frequency-domain signal is weighted using this suppression coefficient, and this weighted frequency-domain signal is A noise removal method for obtaining an output signal from which noise has been removed from the input signal by converting into a time domain signal, wherein the time domain signal obtained by converting the frequency domain signal is subjected to windowing processing. .

30. Converting an input signal into a frequency domain signal,
The second noise included in the frequency domain signal is estimated based on the frequency domain signal, a value corresponding to the second noise is subtracted from the frequency domain signal to obtain a frequency domain enhanced voice, and the enhanced voice is obtained. To a time-domain signal to obtain an output signal from which noise has been removed from the input signal, wherein the time-domain signal obtained by converting the emphasized speech in the frequency domain is subjected to windowing processing. Noise removal method.

31. A first windowing processing section for applying windowing processing to an input signal and outputting the input signal, and an input signal windowed by the first windowing processing section is converted into a frequency domain signal, and an amplitude is obtained. A converter that separates and outputs a component and a phase component; an estimated noise calculator that estimates and outputs the second noise included in the frequency domain signal based on the amplitude component of the frequency domain signal; Noise and the amplitude component of the frequency domain signal to calculate and output a pseudo first noise, and an injection noise calculation unit that adds the amplitude components of the first noise and the frequency domain signal and outputs the result. A first adder, a second adder that adds and outputs the first noise and the second noise, an output signal of the first adder, and an output of the second adder A first signal that receives a signal and obtains and outputs a first signal-to-noise ratio A signal-to-noise ratio calculation unit, a suppression coefficient generation unit that determines and outputs a suppression coefficient based on the first signal-to-noise ratio, and weights the amplitude component of the frequency domain signal using the suppression coefficient. A first multiplying unit for outputting; an inverse transforming unit for transforming and outputting the amplitude component of the frequency domain signal and the phase component of the frequency domain signal weighted by the first multiplying unit into a time domain signal; A noise removing device comprising at least a second windowing processing unit that performs windowing processing on a time domain signal and outputs the windowed signal.

32. The noise removing device according to claim 31, wherein the injection noise calculation unit calculates the number of zero crossings at which the input signal is input and the amplitude of the input signal is zero, and the calculation result is obtained. A zero crossing calculation section for outputting a control signal according to the above, and a switch for selectively setting the first noise to zero according to the control signal input from the zero crossing calculation section. Removal device.

33. The noise eliminator according to claim 31, wherein the injection noise calculation unit calculates the high frequency power of the amplitude component of the frequency domain signal input from the conversion unit, and according to the calculation result. And a switch for selectively setting the first noise to zero according to the control signal input from the high band power calculation unit. Removal device.

34. The noise removing device according to claim 31, wherein the amplitude component of the frequency domain signal is weighted, and the obtained weighted amplitude component is output to the estimated noise calculation unit, The noise removing apparatus further comprising a weighted deteriorated speech calculation unit that causes the estimated noise calculation unit to estimate the second noise based on the weighted amplitude component.

35. The noise removing apparatus according to claim 34, wherein the weighted deteriorated speech calculation unit calculates and outputs a second signal-to-noise ratio using an amplitude component of the frequency domain signal. And a second signal-to-noise ratio calculation section that is input from the second signal-to-noise ratio calculation section.
A non-linear processing unit that processes the signal-to-noise ratio of the signal by a non-linear function to obtain a weight and outputs the weight, and an amplitude component of the frequency domain signal is weighted using the weight input from the non-linear processing unit, and the estimated noise A noise removing device, comprising: a second multiplying unit that outputs the calculating unit.

36. The noise removing device according to claim 31, wherein the suppression coefficient input from the suppression coefficient generation unit is corrected based on the frequency domain signal, and the first multiplication is performed. The noise removal apparatus further comprising a suppression coefficient correction unit that outputs the amplitude component of the frequency domain signal using the suppression coefficient corrected by the first multiplication unit.

37. A first windowing processing section for subjecting an input signal to windowing processing and outputting the same, and an input signal subjected to windowing processing by the first windowing processing section is converted into a frequency domain signal to obtain an amplitude. A converter for separating and outputting a component and a phase component; a first signal-to-noise ratio calculator for calculating and outputting a first signal-to-noise ratio using the amplitude component of the frequency-domain signal; An estimated noise calculator that estimates and outputs noise included in the frequency domain signal based on the amplitude component of the signal, and corrects the first signal-to-noise ratio using the noise and the amplitude component of the frequency domain signal. Then, a signal-to-noise ratio correction unit that outputs as a correction signal-to-noise ratio, a suppression coefficient generation unit that determines and outputs a suppression coefficient based on the corrected signal-to-noise ratio, and the frequency-domain signal using the suppression coefficient. Weight the amplitude components of A first multiplication unit that outputs the frequency domain signal and an inverse transformation unit that converts the amplitude component of the frequency domain signal weighted by the first multiplication unit and the phase component of the frequency domain signal into a time domain signal and outputs the time domain signal. A noise removing device comprising at least a second windowing processing unit that performs windowing processing on the time domain signal.

38. The noise removing device according to claim 37, wherein the signal-to-noise ratio correction unit calculates the number of zero crossings at which the input signal is input and the amplitude of the input signal is zero. A determination unit that outputs a control signal according to a calculation result, and the correction signal-to-noise ratio is selectively set to the same value as the first signal-to-noise ratio before correction by the control signal input from the determination unit. A noise removing device comprising a setting switch.

39. The noise eliminator according to claim 37, wherein the signal-to-noise ratio correction unit calculates the high frequency power of the amplitude component of the frequency domain signal input from the conversion unit, and the calculation result thereof. And a control unit that outputs a control signal according to the control signal, and the control signal input from the determination unit selectively sets the correction signal-to-noise ratio to the same value as the first signal-to-noise ratio before correction. A noise eliminator including a switch.

40. The noise eliminator according to claim 37, wherein the amplitude component of the frequency domain signal is weighted, and the obtained weighted amplitude component is output to the estimated noise calculator. The noise removing apparatus further comprising a weighted deteriorated speech calculation unit that causes the estimated noise calculation unit to estimate the noise based on the weighted amplitude component.

41. The noise removal apparatus according to claim 40, wherein the weighted deteriorated speech calculation unit calculates and outputs a second signal-to-noise ratio using an amplitude component of the frequency domain signal. And a second signal-to-noise ratio calculation section that is input from the second signal-to-noise ratio calculation section.
A non-linear processing unit that processes the signal-to-noise ratio of the signal by a non-linear function to obtain a weight and outputs the weight, and an amplitude component of the frequency domain signal is weighted using the weight input from the non-linear processing unit, and the estimated noise A noise removing device, comprising: a second multiplying unit that outputs the calculating unit.

42. The noise removing device according to claim 37, wherein the suppression coefficient input from the suppression coefficient generation unit is corrected based on the frequency domain signal, and the first multiplication is performed. The noise removal apparatus further comprising a suppression coefficient correction unit that outputs the amplitude component of the frequency domain signal using the suppression coefficient corrected by the first multiplication unit.