JPH09258791A

JPH09258791A - Noise suppressing device

Info

Publication number: JPH09258791A
Application number: JP8068228A
Authority: JP
Inventors: Ikuyo Masuda; 郁代増田; Kiyoaki Aikawa; 清明相川
Original assignee: ATR Advanced Telecommunications Research Institute International
Current assignee: ATR Advanced Telecommunications Research Institute International
Priority date: 1996-03-25
Filing date: 1996-03-25
Publication date: 1997-10-03
Anticipated expiration: 2016-03-25
Also published as: JP2880683B2

Abstract

PROBLEM TO BE SOLVED: To provide a noise suppressing device that can suppress noise by predicting the property of a signal from a preceding part of extracted signals even if the property of noise is unknown. SOLUTION: An input voice signal inputted from an input part 1 is intensified in high frequency by a preprocessing part 2, and amplitude spectrum is detected by a spectrum detecting part 3 and stored as measured spectrum in a first storage part 4. Spectral envelope at a following time point is predicted from the measured spectrum by a spectrum predicting part 5 on the basis of the time change locus of spectral envelope. The predicted spectrum and the measured spectrum are compared in a spectrum comparing part 6, and a signal not included in the predicted spectrum is extracted. The spectrum is determined by a spectrum determining part 7, and after being stored in a second storage part 8, the spectrum is Fourier-transformed by an output part 9 so as to be converted into a sound signal and outputted.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は雑音抑制装置に関
し、特に、スペクトルの変化軌跡から次時点のスペクト
ルを予測し、実測されたスペクトルと予測スペクトルと
を比較し、出力となるスペクトルを決定して雑音を抑制
するような雑音抑制装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a noise suppressor, and more particularly to predicting a spectrum at a next time point from a spectrum change locus, comparing an actually measured spectrum with a predicted spectrum, and determining an output spectrum. The present invention relates to a noise suppression device that suppresses noise.

【０００２】[0002]

【従来の技術】従来より音声認識装置において入力され
た音声信号に含まれている雑音を抑制するために、種々
の雑音抑制装置が提案されている。その一例として、特
開平１−１１８９００号公報に雑音抑圧装置が示されて
いる。この雑音抑圧装置は、音声信号を入力するための
マイクロフォンと、雑音信号を入力するためのマイクロ
フォンとを設け、これらのマイクロフォンの出力をバン
ドパスフィルタバンクに入力して帯域分割し、この帯域
分割された各信号成分に対し、２つのマイクロフォン出
力の位相の違いを求め、位相の違いに従って補正した信
号により帯域の雑音除去を行なうようにしたものであ
る。2. Description of the Related Art Conventionally, various noise suppression devices have been proposed in order to suppress noise contained in a voice signal input in a voice recognition device. As an example thereof, Japanese Patent Laid-Open No. 1-118900 discloses a noise suppression device. This noise suppressor is provided with a microphone for inputting a voice signal and a microphone for inputting a noise signal, and the outputs of these microphones are input to a bandpass filter bank for band division, and the band division is performed. For each signal component, the phase difference between the two microphone outputs is obtained, and the band noise is removed by the signal corrected according to the phase difference.

【０００３】[0003]

【発明が解決しようとする課題】上述の雑音抑圧装置
は、１つのマイクロフォンに入力されたモノラル信号に
複数の音響信号が混在するときには、信号の分離ができ
ないという欠点がある。The above noise suppressing device has a drawback in that the signals cannot be separated when a plurality of acoustic signals coexist in a monaural signal input to one microphone.

【０００４】雑音抑圧装置の他の例として、雑音の周波
数特性を推定して適応型櫛形フィルタの特性を定め、フ
ィルタリングにより信号を抽出する技術もあるが、雑音
が定常雑音でなければ、その周波数特性の推定が困難で
あるという問題がある。As another example of the noise suppressing device, there is a technique of estimating the frequency characteristic of noise to determine the characteristic of the adaptive comb filter and extracting the signal by filtering. There is a problem that it is difficult to estimate the characteristics.

【０００５】さらに、音声の調波生（周期性）やその他
のパラメントリックな特性を用いて分離する技術もある
が、雑音のパワーが大きく、分析によるパラメータの抽
出が困難な時点の音響信号を分離できないという問題点
がある。Further, there is a technique of separating by using harmonics (periodicity) of voice and other paramentic characteristics, but an acoustic signal at a time when it is difficult to extract parameters by analysis due to large noise power. There is a problem that cannot be separated.

【０００６】それゆえに、この発明の主たる目的は、モ
ノラル信号にも対応でき、雑音の性質が未知であっても
信号の性質が抽出できた前の部分から予測することによ
り信号の抽出を可能にした雑音抑制装置を提供すること
である。Therefore, the main object of the present invention is to deal with monaural signals, and enable signal extraction by predicting from the previous portion where the signal characteristics could be extracted even if the noise characteristics are unknown. Another object of the present invention is to provide a noise suppressing device.

【０００７】[0007]

【課題を解決するための手段】請求項１に係る発明は、
入力音声信号の雑音を抑制する雑音抑制装置であって、
入力音声信号のスペクトルの変化軌跡から次時点のスペ
クトル包絡を予測するスペクトル予測手段を備え、スペ
クトル予測手段によって予測されたスペクトルに含まれ
ない信号を抑制することによって雑音を抑制するように
したものである。The invention according to claim 1 is
A noise suppression device for suppressing noise of an input voice signal,
It is provided with a spectrum predicting unit that predicts a spectrum envelope at the next time point from a spectrum change trajectory of an input voice signal, and suppresses noise by suppressing a signal that is not included in the spectrum predicted by the spectrum predicting unit. is there.

【０００８】請求項２に係る発明は、請求項１の発明に
加えて、さらに入力音声信号のスペクトルを検出するス
ペクトル検出手段と、検出された実スペクトルを記憶す
る記憶手段を備え、スペクトル予測手段は、記憶手段に
記憶されている実スペクトルの変化軌跡から次時点のス
ペクトル包絡を予測し、さらに予測された予測スペクト
ルと記憶されている実スペクトルとを比較し、出力とな
るスペクトルを決定するスペクトル決定手段を備えて構
成される。According to a second aspect of the invention, in addition to the first aspect of the invention, a spectrum detecting means for detecting the spectrum of the input voice signal and a storage means for storing the detected real spectrum are provided, and the spectrum predicting means is provided. Is a spectrum for predicting the spectrum envelope at the next time point from the change trajectory of the real spectrum stored in the storage means, comparing the predicted spectrum and the stored real spectrum, and determining the spectrum to be the output. It is provided with a determining means.

【０００９】請求項３に係る発明では、請求項２の発明
に加えて、さらに入力音声信号の周期性を検出する周期
性検出手段を設け、スペクトル決定手段は周期性検出手
段によって検出された同期していない周波数チャネルの
信号を抑制する。According to a third aspect of the invention, in addition to the second aspect of the invention, a periodicity detecting means for detecting the periodicity of the input audio signal is further provided, and the spectrum determining means is the synchronization detected by the periodicity detecting means. Suppress the signal of the frequency channel that is not done.

【００１０】請求項４に係る発明では、請求項２または
３の発明に加えて、さらにピッチ周波数を予測するピッ
チ周波数予測手段を設け、スペクトル決定手段はピッチ
周波数予測手段によって予測されたピッチ周波数の高次
倍音以外の信号を抑制する。According to the invention of claim 4, in addition to the invention of claim 2 or 3, pitch frequency predicting means for predicting the pitch frequency is further provided, and the spectrum determining means determines the pitch frequency predicted by the pitch frequency predicting means. Signals other than higher harmonics are suppressed.

【００１１】[0011]

【発明の実施の形態】図１はこの発明の一実施形態を示
すブロック図である。まず、図１を参照して構成につい
て説明する。入力部１から音声入力信号が入力され、前
処理部２に与えられる。前処理部２はその入力音声信号
の高域を強調し、窓長４０（ｍｓｅｃ）のハミング窓で
２０（ｍｓｅｃ）毎に分析する。この分析結果はスペク
トル検出部３に与えられる。スペクトル検出部３はサン
プリング周波数８ｋＨｚにて振幅スペクトルをサンプル
して実測スペクトル包絡を検出し、第１の記憶部４に記
憶させる。FIG. 1 is a block diagram showing an embodiment of the present invention. First, the configuration will be described with reference to FIG. A voice input signal is input from the input unit 1 and given to the preprocessing unit 2. The pre-processing unit 2 emphasizes the high frequency band of the input audio signal and analyzes it every 20 (msec) with a Hamming window having a window length of 40 (msec). The result of this analysis is given to the spectrum detector 3. The spectrum detection unit 3 samples the amplitude spectrum at a sampling frequency of 8 kHz to detect the measured spectrum envelope, and stores it in the first storage unit 4.

【００１２】スペクトル予測部５はＡＲフィルタを用い
て、第１の記憶部４に記憶されている実測スペクトルの
変化軌跡から次時点のスペクトル包絡を予測し、その予
測結果をスペクトル比較部６に与える。スペクトル比較
部６には第１の記憶部４から実測スペクトルが与えられ
ており、スペクトル比較部６は実測スペクトルと予測ス
ペクトルとを比較し、実測スペクトルの振幅が予測スペ
クトルの振幅よりも大きいときは予測スペクトルの振幅
を出力し、実測スペクトルの振幅が予測スペクトルの振
幅よりも小さいときには、実測スペクトルの振幅値を出
力する。The spectrum predicting unit 5 uses the AR filter to predict the spectrum envelope at the next point in time from the change trajectory of the measured spectrum stored in the first storage unit 4, and gives the prediction result to the spectrum comparing unit 6. . The spectrum comparison unit 6 is provided with the measured spectrum from the first storage unit 4. The spectrum comparison unit 6 compares the measured spectrum with the predicted spectrum, and when the amplitude of the measured spectrum is larger than the amplitude of the predicted spectrum, The amplitude of the predicted spectrum is output, and when the amplitude of the measured spectrum is smaller than the amplitude of the predicted spectrum, the amplitude value of the measured spectrum is output.

【００１３】スペクトル比較部６から出力された振幅値
はスペクトル決定部７に与えられ、スペクトル決定部７
は入力された振幅値によりスペクトルを決定し、第２の
記憶部８に記憶した後、出力部９に出力する。The amplitude value output from the spectrum comparison unit 6 is given to the spectrum determination unit 7, and the spectrum determination unit 7
Determines the spectrum from the input amplitude value, stores it in the second storage unit 8, and then outputs it to the output unit 9.

【００１４】次に、図１に示した実施の形態の動作につ
いて説明する。入力信号が前処理部で前処理された後、
スペクトル検出部３において、１２８点ＦＦＴ（サンプ
リング周波数８ｋＨｚ）により振幅スペクトルが算出さ
れ、１２８点から８１９２点へアップサンプリングす
る。スペクトルの形状を積分スペクトル逆関数を用いて
表わす。この関数は正規化されたスペクトルの原点から
の積分値がある指定された例となる周波数として求めら
れる。積分値は等間隔に５０分割とし、各積分値Ｉ_kを
与える周波数ｖ_kを求める。ここで、振幅スペクトルを
Ｓ（ｖ）とすると、Next, the operation of the embodiment shown in FIG. 1 will be described. After the input signal is pre-processed by the pre-processing unit,
In the spectrum detection unit 3, an amplitude spectrum is calculated by a 128-point FFT (sampling frequency 8 kHz) and upsampling is performed from 128 points to 8192 points. The shape of the spectrum is represented by the inverse integral spectrum function. This function is determined as the specified example frequency with the integral value from the origin of the normalized spectrum. The integral value is divided into 50 at equal intervals, and the frequency v _k giving each integral value I _k is _obtained . Here, if the amplitude spectrum is S (v),

【００１５】[0015]

【数１】 [Equation 1]

【００１６】となる。ここで、時刻ｔにおいて積分値Ｉ
_k（ｋ＝１〜５０）を与える周波数をｖ_k（ｔ）とす
る。## EQU1 ## Here, at time t, the integrated value I
Let v _k (t) be the frequency that gives _k (k = 1 to 50).

【００１７】第１の記憶部４はスペクトル検出部３で検
出されたすべての時刻ｔにおけるＳ（ｖ），ｖ_k（ｔ）
を記憶する。The first storage unit 4 stores S (v), v _k (t) at all times t detected by the spectrum detection unit 3.
Is stored.

【００１８】スペクトル予測部５はＡＲフィルタを用い
てスペクトル包絡の予測を行なう。実測スペクトルの積
分値Ｉ_kを与える周波数ｖ_k（ｔ）を入力したとき、Ａ
Ｒフィルタを通ったスペクトル（以下、これをFiltered
スペクトルと称する）の積分値Ｉ_kを与える周波数ｕ_k
（ｔ）は次の第（２）式で求められる。ここで、Ｇ，α
₁，α₂は定数である。たとえば、Ｇ＝０．９９，α１
＝−０．１６，α２＝０．０２などである。ｕ_k（ｔ）
は過去の追跡軌跡からの外分成分と、現入力との内分に
よって決まる。The spectrum predictor 5 predicts the spectrum envelope using an AR filter. When the frequency v _k (t) that gives the integral value I _k of the measured spectrum is input, A
The spectrum that passed through the R filter (hereinafter, this is Filtered
Frequency u _k giving the integral value I _k of the spectrum)
(T) is calculated by the following equation (2). Where G, α
₁ and α ₂ are constants. For example, G = 0.99, α1
= −0.16, α2 = 0.02 and so on. u _k (t)
Is determined by the external component from the past tracking trajectory and the internal component of the current input.

【００１９】ｕ_k（ｔ）＝Ｇｖ_k（ｔ）−α₁ｕ_k（ｔ−１）−α₂ｕ_k（ｔ−２） …（２）さらに、時刻ｔ＋１における積分値Ｉ_kを与える周波数
ｖ_k（ｔ）を予測するために、外分の働きを用いる。２
次の伝達関数を用いる場合、時刻ｔのｕ_k（ｔ）と時刻
ｔ−１のｕ_k（ｔ−１）により時刻ｔ＋１での周波数ｕ
_k（ｔ＋１）を予測する。後ほど逆フィルタに通す都合
上、時刻ｔ＋１での実測値ｖ_k（ｔ＋１）が必要なの
で、便宜上時刻ｔの実測値ｖ_k（ｔ）を時刻ｔ＋１の仮
入力とする。時刻ｔ＋１の周波数ｕ_k（ｔ＋１）は次の
第（３）式で求められる。U _k (t) = Gv _k (t) -α ₁ u _k (t-1) -α ₂ u _k (t-2) (2) Further, the frequency that gives the integrated value I _k at time t + 1 The external work is used to predict v _k (t). Two
When the following transfer function is used, the frequency u at time t + 1 is calculated by u _k (t) at time t and u _k (t-1) at time t-1.
Predict _k (t + 1). Since the actual measurement value v _k (t + 1) at time t + 1 is required for the sake of passing through an inverse filter later, the actual measurement value v _k (t) at time t is provisionally input at time t + 1 for convenience. The frequency u _k (t + 1) at time t + 1 is calculated by the following expression (3).

【００２０】ｕ_k（ｔ＋１）＝Ｇｖ_k（ｔ）−α₁ｕ_k（ｔ）−α₂ｕ_k（ｔ−１） …（３）予測値ｕ_k（ｔ＋１）は、Filteredスペクトルの積分値
Ｉ_kを与える周波数である。予測されたFilteredスペク
トルのｕ_k（ｔ）を先程予測に用いたＡＲフィルタの逆
フィルタに通してｒ_k（ｔ）を得る。ｒ_k（ｔ）は次の
第（４）式で実現される。U _k (t + 1) = Gv _k (t) −α ₁ u _k (t) −α ₂ u _k (t−1) (3) The predicted value u _k (t + 1) is the integrated value of the Filtered spectrum. It is the frequency that gives I _k . The u _k (t) of the predicted Filtered spectrum is passed through an inverse filter of the AR filter used for the prediction to obtain r _k (t). r _k (t) is realized by the following equation (4).

【００２１】ｒ_k（ｔ）＝ｕ_k（ｔ）＋α₁ｒ_k（ｔ−１）＋α₂ｒ_k（ｔ−２） …（４）時刻ｔ＋１の予測スペクトルは、ｒ_k（ｔ＋１）および
ｒ_k（ｔ）およびｒ_k（ｔ−１）と、それらの周波数に
おけるスペクトルの振幅値により推定できる。予測スペ
クトルを算出する際、仮に入力を入れたことによる誤差
や変化方向の転換点付近の予測ずれによる信号の欠落を
防ぐために、予測スペクトルに、近傍関数としてガウス
関数を重畳する。R _k (t) = u _k (t) + α ₁ r _k (t-1) + α ₂ r _k (t-2) (4) The prediction spectrum at time t + 1 is r _k (t + 1) and r _It can be estimated by _k (t) and r _k (t-1) and the amplitude value of the spectrum at those frequencies. When the prediction spectrum is calculated, a Gaussian function is superimposed on the prediction spectrum as a neighborhood function in order to prevent errors due to provisional input and signal loss due to prediction deviation near the turning point in the direction of change.

【００２２】スペクトル比較部６は第１の記憶部４から
与えられる実測スペクトルの振幅がスペクトル予測部５
で予測された予測スペクトルの振幅より大きいときは予
測スペクトルの振幅を出力し、実測スペクトルの振幅が
予測スペクトルの振幅より小さいときは実測スペクトル
の振幅値を出力する。そして、スペクトル決定部７は与
えられた予測スペクトルまたは実測スペクトルの振幅値
によりスペクトルを決定し、決定したすべての時刻にお
けるスペクトルを第２の記憶部８に記憶させる。そし
て、出力部９は第２の記憶部８に記憶されたすべての時
刻におけるスペクトルをフーリエ逆変換し、音響信号に
変換して出力する。In the spectrum comparison unit 6, the amplitude of the actually measured spectrum given from the first storage unit 4 is the spectrum prediction unit 5.
If the amplitude of the predicted spectrum is larger than the amplitude of the predicted spectrum, the amplitude of the predicted spectrum is output, and if the amplitude of the measured spectrum is smaller than the amplitude of the predicted spectrum, the amplitude value of the measured spectrum is output. Then, the spectrum determination unit 7 determines the spectrum based on the amplitude value of the given predicted spectrum or the measured spectrum, and stores the spectra at all the determined times in the second storage unit 8. Then, the output unit 9 performs the inverse Fourier transform on the spectra at all times stored in the second storage unit 8, converts the spectrum into an acoustic signal, and outputs the acoustic signal.

【００２３】図２はこの発明の第２の実施形態を示す図
である。この実施形態は、前処理部２の出力とスペクト
ル決定部１１との間にチャネル間同期性検出部１０を設
けたものであって、それ以外の構成は図１と同じであ
る。チャネル間同期性検出部１０は実測スペクトルの振
幅に対して時間と周波数について相互相関をとり、ピッ
チ周波数および各周波数チャネルの同期性を検出する。
チャネル間の相互相関値が定められたしきい値以上のと
き、相互相関をとったチャネル同士は同期していいもの
として検出する。なお、しきい値は、たとえば０．９７
とするがこの値に限定されるものではない。FIG. 2 is a diagram showing a second embodiment of the present invention. In this embodiment, an inter-channel synchronism detecting section 10 is provided between the output of the pre-processing section 2 and the spectrum determining section 11, and the rest of the configuration is the same as in FIG. The inter-channel synchronism detecting section 10 detects the pitch frequency and the synchronism of each frequency channel by cross-correlating the amplitude of the actually measured spectrum with respect to time and frequency.
When the cross-correlation value between the channels is equal to or more than a predetermined threshold value, the channels having the cross-correlation are detected as being good in synchronization. The threshold value is 0.97, for example.
However, the value is not limited to this value.

【００２４】スペクトル決定部１１はスペクトル比較部
６の出力とチャネル間同期性検出部１０で検出された同
期性とを用いてスペクトルを決定する。The spectrum determining section 11 determines a spectrum using the output of the spectrum comparing section 6 and the synchronism detected by the inter-channel synchronism detecting section 10.

【００２５】図３はこの発明の第３の実施形態を示すブ
ロック図である。この実施形態は、ピッチ周波数を予測
し、予測されたピッチ周波数の高次倍音以外の信号を抑
制するようにしたものである。このために、前処理部２
とスペクトル決定部１７との間にはピッチ周波数実測部
１２と第３の記憶部１３とピッチ周波数予測部１４とピ
ッチ周波数比較・決定部１５と倍音推定部１６とが設け
られる。FIG. 3 is a block diagram showing a third embodiment of the present invention. In this embodiment, the pitch frequency is predicted and signals other than the higher harmonics of the predicted pitch frequency are suppressed. For this purpose, the preprocessing unit 2
A pitch frequency measuring unit 12, a third storage unit 13, a pitch frequency predicting unit 14, a pitch frequency comparing / determining unit 15, and an overtone estimating unit 16 are provided between the spectrum determining unit 17 and the spectrum determining unit 17.

【００２６】ピッチ周波数実測部１２は前処理部２で処
理された実測スペクトルから、同期しているチャネルの
自己相関量の極大値をとる時間間隔の逆数としてピッチ
周波数を実測する。そして、同じピッチ周波数検出視標
を求め、自己相関値の極大値が定められたしきい値より
も大きく求められた周波数がピッチ周波数として適当な
値である場合は、ピッチ周波数検出視標を１とし、それ
以外は０とする。ピッチ周波数実測部１２で実測された
ピッチ周波数をすべての時刻ｔについて第３の記憶部１
３に記憶する。The pitch frequency measuring unit 12 measures the pitch frequency as the reciprocal of the time interval at which the maximum value of the autocorrelation amount of the synchronized channels is obtained from the measured spectrum processed by the preprocessing unit 2. Then, the same pitch frequency detection target is obtained, and when the frequency obtained when the maximum value of the autocorrelation value is larger than the predetermined threshold value is an appropriate value as the pitch frequency, the pitch frequency detection target is set to 1 And 0 otherwise. The pitch frequency measured by the pitch frequency measurement unit 12 is set to the third storage unit 1 for all times t.
3 is stored.

【００２７】第３の記憶部１３に記憶されたピッチ周波
数に基づいて、ピッチ周波数予測部１４はスペクトル包
絡の場合と同様に２次のＡＲモデルを用いて予測を行な
う。このとき、ＡＲモデルのパラメータはスペクトル包
絡の予測に用いたフィルタのパラメータと違っていても
よい。ピッチ周波数比較・決定部１５はピッチ周波数予
測部１４で予測されたピッチ周波数の予測値に対して、
第３の記憶部１３から与えられるピッチ周波数の実測値
がたとえばその１０％以内であるとき、ピッチ周波数と
してピッチ周波数実測部１２で実測されたピッチ周波数
と決定し、それ以外のときは逆フィルタの出力値をピッ
チ周波数と決定する。倍音推定部１６はピッチ周波数比
較・決定部１５で求められたピッチの倍音関係のある周
波数を決定してスペクトル決定部１７に与える。On the basis of the pitch frequency stored in the third storage unit 13, the pitch frequency prediction unit 14 performs prediction using the secondary AR model as in the case of the spectrum envelope. At this time, the parameters of the AR model may be different from the parameters of the filter used to predict the spectral envelope. The pitch frequency comparison / determination unit 15 uses the pitch frequency prediction value predicted by the pitch frequency prediction unit 14,
When the measured value of the pitch frequency given from the third storage unit 13 is within 10%, for example, it is determined as the pitch frequency actually measured by the pitch frequency measurement unit 12, and in other cases, it is determined by the inverse filter. The output value is determined as the pitch frequency. The harmonic overtone estimation unit 16 determines a frequency having a harmonic overtone relation of the pitch obtained by the pitch frequency comparison / determination unit 15 and gives it to the spectrum determination unit 17.

【００２８】スペクトル決定部１７はスペクトル比較部
６の出力と第３の記憶部１３の出力のうち、ピッチ周波
数検出視標と倍音推定部１６の出力によりスペクトルを
決定する。The spectrum determining section 17 determines a spectrum from the outputs of the spectrum comparing section 6 and the third storage section 13 based on the outputs of the pitch frequency detecting target and the overtone estimating section 16.

【００２９】図４はこの発明の第４の実施形態を示す図
である。この実施形態は、図２に示したチャネル間同期
検出部１０を図３に示した実施形態に加えたものであ
る。すなわち、チャネル間同期検出部１０の同期検出出
力をピッチ周波数実測部１２に与えるとともに、スペク
トル決定部１７に与える。スペクトル決定部１７にはス
ペクトル比較部６の出力とチャネル間同期検出部１０の
出力と第３の記憶部１３に記憶されているピッチ周波数
検出視標と倍音推定部１６の出力が与えられる。FIG. 4 is a diagram showing a fourth embodiment of the present invention. In this embodiment, the inter-channel synchronization detector 10 shown in FIG. 2 is added to the embodiment shown in FIG. That is, the synchronization detection output of the inter-channel synchronization detection unit 10 is given to the pitch frequency measurement unit 12 and the spectrum determination unit 17. The spectrum determination unit 17 is supplied with the output of the spectrum comparison unit 6, the output of the inter-channel synchronization detection unit 10, the output of the pitch frequency detection target stored in the third storage unit 13, and the output of the overtone estimation unit 16.

【００３０】スペクトル決定部１７はスペクトル比較部
６の出力と第３の記憶部１３の出力のうちのピッチ周波
数検出視標と倍音推定部１６の出力とによりスペクトル
を決定する。ピッチ周波数と調波関係にある周波数、か
つ予測スペクトルの最大ピーク周波数における振幅の時
間変化と同期している周波数の信号を抽出信号として選
択する。このとき、たとえ採用された周波数がスペクト
ル包絡の連続性の条件を満たしていなくても採用する。
また、検出されたピッチの周波数が予測値を逆フィルタ
に通した値から大きく離れたとき、同期性を用いること
なく、スペクトル包絡の連続性の条件を満たす周波数の
うち、予測されたピッチ周波数を逆フィルタに通した値
と調波関係にある周波数のスペクトルのみを選択する。
これにより、スペクトル包絡の予測ミスや途中で消滅ま
たは出現することがある高次フォルマントの取りこぼし
を防ぐことができる。The spectrum determining section 17 determines a spectrum based on the output of the spectrum comparing section 6 and the output of the pitch frequency detecting target and the overtone estimating section 16 among the outputs of the third storing section 13. A signal having a frequency that is in a harmonic relationship with the pitch frequency and that is synchronized with the time change of the amplitude at the maximum peak frequency of the predicted spectrum is selected as the extraction signal. At this time, even if the adopted frequency does not satisfy the continuity condition of the spectral envelope, it is adopted.
In addition, when the frequency of the detected pitch greatly deviates from the value obtained by passing the predicted value through the inverse filter, the predicted pitch frequency is selected from among the frequencies satisfying the continuity of the spectrum envelope without using synchronism. Select only the spectrum of frequencies that have a harmonic relationship with the value passed through the inverse filter.
As a result, it is possible to prevent misprediction of the spectrum envelope and omission of higher-order formants that may disappear or appear in the middle.

【００３１】なお、前述の各実施例では前処理部２で高
域強調を行なうようにしたが、特に高域強調をしなくて
も実現できる。また、時間分割はハミング窓でなくても
よく、窓長さも４０ｍｓｅｃに限ることなく、分析間隔
も２０ｍｓｅｃに限ることはない。さらに、スペクトル
検出部３において、検出手段としてＦＦＴに限ることな
く、たとえばＬＰＣ，ケプストラム，狭帯域フィルタな
どを用いることにより推定してもよい。サンプリング周
波数や分析辞次数も任意の値に選んでもよい。さらに、
検出されるスペクトルは振幅スペクトルに限ることな
く、パワースペクトルであってもよい。さらに、スペク
トル形状の表現として積分逆関数を使うことなく、ケプ
ストラムのパラメータなどを使用してもよい。In each of the above-described embodiments, the high frequency region is emphasized by the preprocessing unit 2, but it can be realized without particularly enhancing the high frequency region. The time division does not have to be the Hamming window, the window length is not limited to 40 msec, and the analysis interval is not limited to 20 msec. Further, in the spectrum detection unit 3, the detection means is not limited to the FFT, but may be estimated by using, for example, an LPC, a cepstrum, a narrow band filter or the like. The sampling frequency and analysis order may be selected as desired. further,
The detected spectrum is not limited to the amplitude spectrum but may be the power spectrum. Further, the cepstrum parameter or the like may be used without using the integral inverse function as the expression of the spectrum shape.

【００３２】さらに、スペクトル予測後において、ＡＲ
フィルタのパラメータは他の値であってもよく、また予
測法は２次のＡＲフィルタに限ることはない。Furthermore, after spectrum prediction, AR
The parameters of the filter may have other values, and the prediction method is not limited to the second-order AR filter.

【００３３】[0033]

【発明の効果】以上のように、この発明によれば、スペ
クトル包絡の時間的変化軌跡に基づいて、次時点のスペ
クトル包絡を予測し、予測スペクトルに含まれない信号
を抑制して雑音を抑制するようにしたので、雑音の性質
が未知であっても、雑音のパワーが非常に大きく、その
時点では信号の特徴が得られなくても、信号の性質が抽
出できた前の部分から予測することができるので、雑音
を抑制して信号の抽出が可能となる。As described above, according to the present invention, the spectrum envelope at the next point in time is predicted based on the temporal change trajectory of the spectrum envelope, and signals not included in the predicted spectrum are suppressed to suppress noise. Therefore, even if the nature of the noise is unknown, the power of the noise is very large, and even if the characteristics of the signal cannot be obtained at that time, prediction is performed from the part before the characteristics of the signal could be extracted. Therefore, it is possible to suppress noise and extract a signal.

[Brief description of drawings]

【図１】この発明の第１の実施形態を示す図である。FIG. 1 is a diagram showing a first embodiment of the present invention.

【図２】この発明の第２の実施形態を示す図であって、
チャネル間同期検出部を設けた例を示す。FIG. 2 is a diagram showing a second embodiment of the present invention,
An example in which an inter-channel synchronization detector is provided is shown.

【図３】この発明の第３の実施形態を示す図であって、
ピッチ周波数を実測して比較する例を示す。FIG. 3 is a diagram showing a third embodiment of the present invention,
An example of actually measuring and comparing pitch frequencies will be shown.

【図４】この発明の第４の実施形態を示す図であって、
チャネル間同期検出部とピッチ周波数実測部と比較部と
を設けた例を示す。FIG. 4 is a diagram showing a fourth embodiment of the present invention,
An example in which an inter-channel synchronization detection unit, a pitch frequency measurement unit and a comparison unit are provided will be shown.

[Explanation of symbols]

１入力部２前処理部３スペクトル検出部４第１の記憶部５スペクトル予測部６スペクトル比較部７，１１，１７スペクトル決定部８第２の記憶部９出力部１０チャネル間同期検出部１２ピッチ周波数実測部１３第３の記憶部１４ピッチ周波数予測部１５ピッチ周波数比較・決定部１６倍音推定部 1 Input Section 2 Pre-Processing Section 3 Spectrum Detection Section 4 First Storage Section 5 Spectrum Prediction Section 6 Spectrum Comparison Section 7, 11, 17 Spectrum Determining Section 8 Second Storage Section 9 Output Section 10 Inter-Channel Synchronization Detection Section 12 Pitch Frequency measurement unit 13 Third storage unit 14 Pitch frequency prediction unit 15 Pitch frequency comparison / determination unit 16 Overtone estimation unit

Claims

[Claims]

1. A noise suppressing device for suppressing noise of an input voice signal, comprising: a spectrum predicting unit for predicting a spectrum envelope at a next time point from a change trajectory of a spectrum of the input voice signal. A noise suppression device, characterized in that noise is suppressed by suppressing signals not included in the generated spectrum.

2. A spectrum detecting means for detecting a spectrum of the input voice signal, a storage means for storing an actual spectrum detected by the spectrum detecting means, and the spectrum predicting means being stored in the storage means. The spectrum envelope at the next time point is predicted from the change trajectory of the real spectrum, and the predicted spectrum predicted by the spectrum prediction means is compared with the real spectrum stored in the storage means to determine an output spectrum. The noise suppression device according to claim 1, further comprising:

3. Further comprising a periodicity detecting means for detecting the periodicity of the input audio signal, wherein the spectrum determining means suppresses a signal of an unsynchronized frequency channel detected by the periodicity detecting means. The noise suppression device according to claim 2, wherein:

4. A pitch frequency predicting means for predicting a pitch frequency is further included, and the spectrum determining means suppresses signals other than higher harmonics of the pitch frequency predicted by the pitch frequency predicting means. The noise suppressing device according to claim 2 or 3.