JP5115952B2

JP5115952B2 - Noise suppression device and noise suppression method

Info

Publication number: JP5115952B2
Application number: JP2007071688A
Authority: JP
Inventors: 造田邉; 利博古川
Original assignee: Tokyo University of Science
Current assignee: Tokyo University of Science
Priority date: 2007-03-19
Filing date: 2007-03-19
Publication date: 2013-01-09
Anticipated expiration: 2027-03-19
Also published as: JP2008236270A

Description

本発明は、カルマンフィルタに基づく雑音抑圧装置および雑音抑圧方法に関する。 The present invention relates to a noise suppression device and a noise suppression method based on a Kalman filter.

所望の情報に不必要な情報が混在した受信情報（付加雑音などにより破損した情報）から不必要な情報を取り除き、所望情報のみを抽出することは、音声や無線通信、画像、姿勢制御などの分野における重要な技術であり、近年盛んに研究開発が行われている。 It is necessary to remove unnecessary information from received information (information damaged due to additional noise, etc.) that contains unnecessary information in desired information, and to extract only desired information, such as voice, wireless communication, image, attitude control, etc. It is an important technology in the field and has been actively researched and developed in recent years.

例えば、音声分野における公知の雑音抑圧方法としては、単一のマイクロホンを用いた方法や、複数のマイクロホンから構成されるマイクロホンアレイを用いた方法が提案されている。 For example, as a known noise suppression method in the voice field, a method using a single microphone or a method using a microphone array composed of a plurality of microphones has been proposed.

しかしながら、マイクロホンアレイを用いた方法では、雑音信号の数が増大すると、マイクロホンの数も比例して増加することが避けらず、コストが増大する。そのため、現在は、単一のマイクロホンを用いた雑音抑圧方法の開発が主流となっている。 However, in the method using the microphone array, when the number of noise signals increases, the number of microphones inevitably increases in proportion to the cost, and the cost increases. Therefore, at present, development of noise suppression methods using a single microphone has become the mainstream.

単一のマイクロホンしか用いない従来の雑音抑圧方法のアルゴリズムとしては、以下のようなものが知られている。 The following are known as algorithms of a conventional noise suppression method that uses only a single microphone.

非特許文献１記載のＡＮＣ（適応ノイズキャンセラ）アルゴリズムは、音声信号の周期性を利用してノイズ信号を低減する。 The ANC (adaptive noise canceller) algorithm described in Non-Patent Document 1 uses a periodicity of an audio signal to reduce a noise signal.

非特許文献２には、線形予測に基づいた雑音抑圧アルゴリズムが記載されている。このアルゴリズムは、ピッチ推定や、雑音パワースペクトラム、雑音の平均方向に関する事前知識を必要としない。 Non-Patent Document 2 describes a noise suppression algorithm based on linear prediction. This algorithm does not require prior knowledge of pitch estimation, noise power spectrum, and noise average direction.

また、上記アルゴリズムとは別に、カルマンフィルタに基づいた雑音抑圧アルゴリズムが、非特許文献３に提案されている。このアルゴリズムは、音声信号を自己回帰（ＡＲ：AutoRegressive）過程でモデル化する。さらに、このアルゴリズムは、自己回帰モデルの係数（以下「ＡＲ係数」という）を推定し、推定したＡＲ係数を用いたカルマンフィルタに基づいて雑音抑圧を実行する。このアルゴリズムは、ＡＲ係数を推定する必要があるため、上記他のアルゴリズムとは本質的に異なっている。 In addition to the above algorithm, Non-Patent Document 3 proposes a noise suppression algorithm based on a Kalman filter. This algorithm models an audio signal in an autoregressive (AR) process. Further, this algorithm estimates a coefficient of an autoregressive model (hereinafter referred to as “AR coefficient”), and performs noise suppression based on a Kalman filter using the estimated AR coefficient. This algorithm is essentially different from the other algorithms described above because it needs to estimate the AR coefficient.

カルマンフィルタに基づいたアルゴリズムの多くは、通常、２段階で動作する。すなわち、このようなアルゴリズムは、最初にＡＲ係数を推定し、次に推定したＡＲ係数を用いたカルマンフィルタに基づいて雑音抑圧を行う。
J.R. Deller, J.G. Proakis, J.H.L. Hansen, "Discrete-Time Processing of Speech Signals," Macmillan Press, 1993 A. Kawamura, K. Fujii, Y. Itoh and Y. Fukui, “A Noise Reduction Method Based on Linear Prediction Analysis,” IEICE Trans. Fundamentals, vol.J85-A, no.4, pp.415-423, May 2002 W. Kim and H. Ko, "Noise Variance Estimation for Kalman Filtering of Noise Speech," IEICE Trans. Inf. & syst., vol.E84-D, no.1, pp.155-160, Jan 2001 Many algorithms based on the Kalman filter usually operate in two stages. That is, such an algorithm first estimates an AR coefficient, and then performs noise suppression based on a Kalman filter using the estimated AR coefficient.
JR Deller, JG Proakis, JHL Hansen, "Discrete-Time Processing of Speech Signals," Macmillan Press, 1993 A. Kawamura, K. Fujii, Y. Itoh and Y. Fukui, “A Noise Reduction Method Based on Linear Prediction Analysis,” IEICE Trans. Fundamentals, vol.J85-A, no.4, pp.415-423, May 2002 W. Kim and H. Ko, "Noise Variance Estimation for Kalman Filtering of Noise Speech," IEICE Trans. Inf. & Syst., Vol.E84-D, no.1, pp.155-160, Jan 2001

しかしながら、非特許文献１に記載された公知のアルゴリズムは、音声信号のピッチ周期の正確な推定を必要とする。そのため、このアルゴリズムは、その雑音抑圧能力が付加雑音によって劣化してしまうという問題点を有している。 However, the known algorithm described in Non-Patent Document 1 requires accurate estimation of the pitch period of the audio signal. Therefore, this algorithm has a problem that its noise suppression capability is deteriorated by additional noise.

この点、非特許文献２記載のアルゴリズムは、音声信号のピッチ周期の正確な推定を必要とせずに、雑音抑圧が可能である。さらに、このアルゴリズムは、その原理が単純であり、演算量を少なくすることができるといった長所を有している。しかし、このアルゴリズムは、その雑音抑圧能力が入力音声信号の周期性や線形性などの特性に依存している。言い換えると、このアルゴリズムは、雑音抑圧能力の中に音声信号に依存するパラメータが存在しているため、その実用には一定の限界がある。 In this regard, the algorithm described in Non-Patent Document 2 can suppress noise without requiring accurate estimation of the pitch period of the audio signal. Further, this algorithm has the advantages that the principle is simple and the amount of calculation can be reduced. However, the noise suppression capability of this algorithm depends on characteristics such as periodicity and linearity of the input speech signal. In other words, this algorithm has a certain limit in its practical use because there exists a parameter depending on an audio signal in the noise suppression capability.

非特許文献３記載のアルゴリズムは、強力な雑音抑圧能力を有し、特に高い音質を得たい音響分野への応用に適した手法である。 The algorithm described in Non-Patent Document 3 has a strong noise suppression capability, and is a technique suitable for application to the acoustic field in which high sound quality is particularly desired.

しかしながら、一方で、このアルゴリズムは、ＡＲ係数を必要とするため、ＡＲ係数の推定精度に雑音抑圧性能（つまり、当該カルマンフィルタアルゴリズムの性能）が大きく依存してしまうという問題点を有している。すなわち、ＡＲ係数が正確に推定されない場合、雑音を除去し切れないのみならず、場合によっては雑音に加えて音声信号まで除去してしまう可能性がある。これらは、雑音除去された音声信号の音質の劣化を引き起こす要因となりうる。 However, since this algorithm requires an AR coefficient, it has a problem that noise suppression performance (that is, the performance of the Kalman filter algorithm) greatly depends on the estimation accuracy of the AR coefficient. That is, if the AR coefficient is not accurately estimated, not only the noise cannot be completely removed, but also the voice signal may be removed in addition to the noise. These can be a cause of deterioration of the sound quality of the sound signal from which noise is removed.

この点、一般には、ＡＲ係数の正確な推定は困難である。ＡＲ係数の正確な推定は、例えば、雑音除去であれば、クリアな音声信号に依存しているからである。このことは、音声信号が既知でなければならないことを意味しているため、リアルタイム処理は困難となる。また、仮に何らかの手法でリアルタイムにＡＲ係数を正確に推定することが可能となったとしても、処理が増加するため演算量の問題は避けられない。 In this respect, generally, it is difficult to accurately estimate the AR coefficient. This is because accurate estimation of the AR coefficient depends on a clear audio signal in the case of noise removal, for example. This means that the audio signal must be known, so real-time processing becomes difficult. Even if it becomes possible to accurately estimate the AR coefficient in real time by any method, the problem of computational complexity is inevitable because the processing increases.

本発明は、かかる点に鑑みてなされたものであり、ＡＲ係数の推定を必要としないシンプルで雑音抑圧能力が高い、カルマンフィルタに基づく雑音抑圧装置および雑音抑圧方法を提供することを目的とする。 The present invention has been made in view of the above points, and an object thereof is to provide a noise suppression device and a noise suppression method based on a Kalman filter that do not require estimation of an AR coefficient and have a high noise suppression capability.

本発明の雑音抑圧装置は、所望の情報に不必要な情報が混在した情報を取得する取得手段と、カルマンフィルタのみを用いて、取得された情報から前記不必要な情報を除去して前記所望情報を抽出する抽出手段と、を有し、前記カルマンフィルタは、状態空間モデルの観測方程式において自己回帰モデルの係数を使用しないように構成されている、構成を採る。 The noise suppression apparatus according to the present invention removes the unnecessary information from the acquired information by using only an acquisition unit that acquires information in which unnecessary information is mixed in desired information and a Kalman filter. And the Kalman filter adopts a configuration in which the coefficient of the autoregressive model is not used in the observation equation of the state space model.

本発明の雑音抑圧方法は、所望の情報に不必要な情報が混在した情報を取得する取得ステップと、カルマンフィルタのみを用いて、取得した情報から前記不必要な情報を除去して前記所望情報を抽出する抽出ステップと、を有し、前記カルマンフィルタは、状態空間モデルの観測方程式において自己回帰モデルの係数を使用しないように構成されている、ようにした。 The noise suppression method of the present invention includes an acquisition step of acquiring information in which unnecessary information is mixed with desired information, and removing the unnecessary information from the acquired information by using only a Kalman filter. The Kalman filter is configured not to use the coefficient of the autoregressive model in the observation equation of the state space model.

本発明によれば、カルマンフィルタを用いつつ、ＡＲ係数の推定を必要とすることなく、シンプルな構成で、雑音抑圧能力を向上することができる。 According to the present invention, it is possible to improve the noise suppression capability with a simple configuration without using an AR coefficient estimation while using a Kalman filter.

以下、本発明の実施の形態について、図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

（実施の形態１）
図１は、本発明の実施の形態１に係る雑音抑圧装置の構成を示すブロック図である。 (Embodiment 1)
FIG. 1 is a block diagram showing a configuration of a noise suppression apparatus according to Embodiment 1 of the present invention.

図１に示す雑音抑圧装置１００は、入力部１１０、サンプリング部１２０、Ａ／Ｄ変換部１３０、バッファ１４０、雑音抑圧処理部１５０、および出力部１６０を有する。 A noise suppression apparatus 100 illustrated in FIG. 1 includes an input unit 110, a sampling unit 120, an A / D conversion unit 130, a buffer 140, a noise suppression processing unit 150, and an output unit 160.

入力部１１０は、観測信号を入力する。観測信号は、信号源からのクリアな信号と、付加雑音信号とが合わさった信号である。入力部１１０は、例えば、入力したアナログの観測信号を入力処理して、サンプリング部１２０に出力する。入力処理は、例えば、帯域制限処理や自動利得制御処理などである。 The input unit 110 inputs an observation signal. The observation signal is a signal in which the clear signal from the signal source and the additional noise signal are combined. For example, the input unit 110 performs an input process on the input analog observation signal and outputs it to the sampling unit 120. The input process is, for example, a band limiting process or an automatic gain control process.

サンプリング部１２０は、所定のサンプリング周波数（例えば、１６ｋＨｚ）で、入力されたアナログの観測信号をサンプリング処理し、Ａ／Ｄ変換部１３０に出力する。Ａ／Ｄ変換部１３０は、サンプリングされた観測信号の振幅値を所定の分解能（例えば、８ｂｉｔ）でＡ／Ｄ変換処理し、バッファ１４０に送る。バッファ１４０は、所定のサンプリング数Ｎの信号フレーム（ブロック）を雑音抑圧処理部１５０に出力する。 The sampling unit 120 samples the input analog observation signal at a predetermined sampling frequency (for example, 16 kHz) and outputs the sampled analog observation signal to the A / D conversion unit 130. The A / D conversion unit 130 performs A / D conversion processing on the amplitude value of the sampled observation signal with a predetermined resolution (for example, 8 bits), and sends the result to the buffer 140. The buffer 140 outputs a signal frame (block) having a predetermined sampling number N to the noise suppression processing unit 150.

雑音抑圧処理部１４０は、本発明の特徴的な構成要素であり、ＡＲ係数を用いないカルマンフィルタを内蔵している。すなわち、カルマンフィルタに基づく従来の雑音抑圧方法（非特許文献３参照）（以下「従来手法」という）では、線形予測を用いてＡＲ係数を推定した後、その結果を用いてカルマンフィルタを実行することで雑音抑圧を実現しているのに対し、本発明の雑音抑圧方法（以下「本手法」という）では、カルマンフィルタのみを用いて雑音抑圧を実現している。そのため、本手法では、信号源からのクリアな信号のみを用いて状態方程式を構成し、そのクリアな信号と付加雑音信号を用いて観測方程式を構成している。以下、雑音抑圧処理部１４０で行われるカルマンフィルタに基づく雑音抑圧処理動作について詳細に説明する。 The noise suppression processing unit 140 is a characteristic component of the present invention, and includes a Kalman filter that does not use an AR coefficient. That is, in the conventional noise suppression method based on the Kalman filter (see Non-Patent Document 3) (hereinafter referred to as “conventional method”), after estimating the AR coefficient using linear prediction, the Kalman filter is executed using the result. In contrast to noise suppression, the noise suppression method of the present invention (hereinafter referred to as “the present method”) realizes noise suppression using only a Kalman filter. Therefore, in this method, the state equation is constructed using only the clear signal from the signal source, and the observation equation is constructed using the clear signal and the additional noise signal. Hereinafter, the noise suppression processing operation based on the Kalman filter performed by the noise suppression processing unit 140 will be described in detail.

雑音抑圧処理部１４０に入力される観測信号ｒ（ｎ）は、信号源からのクリアな信号（例えば、音声信号など）ｄ（ｎ）以外に付加雑音信号ｖ（ｎ）を含んでおり、次の式（１）を満たす。

ここで、ｎとは、装置の時刻ｎである。時刻ｎは、サンプリング部１２０で生成された離散的な時間系列において、処理開始時刻を時刻０と仮定したときに、そこからｎ番目の時刻のことを意味する。 The observation signal r (n) input to the noise suppression processing unit 140 includes an additional noise signal v (n) in addition to a clear signal (for example, an audio signal) d (n) from the signal source. (1) is satisfied.

Here, n is the time n of the apparatus. The time n means the nth time when the processing start time is assumed to be time 0 in the discrete time series generated by the sampling unit 120.

従来のＡＲ過程に基づくモデル化方法（従来手法）では、信号ｄ（ｎ）は、ＡＲ係数を用いて次の式（２）でモデル化されると仮定している。

ここで、α_ｋ（ｎ）は時刻ｎでのＡＲ係数、ＫはＡＲ係数の次数、ｅ（ｎ）は信号ｄ（ｎ）が、式（２）に示すＫ次のＡＲ過程でモデル化されるとした場合の予測誤差（モデルリング誤差）である。 In the conventional modeling method based on the AR process (conventional method), it is assumed that the signal d (n) is modeled by the following equation (2) using the AR coefficient.

Here, α _k (n) is the AR coefficient at time n, K is the order of the AR coefficient, e (n) is the signal d (n), and is modeled in the K-order AR process shown in Equation (2). This is a prediction error (modeling error).

公知のように、従来手法では、付加雑音信号ｖ（ｎ）は、零平均であり、白色雑音であることが前提条件である。言い換えると、従来手法では、信号ｄ（ｎ）と付加雑音信号ｖ（ｎ）は無相関であり、つまり、次の式（３）を満たす。

ここで、Ｅ［・］はアンサンブル平均を示す。 As is well known, in the conventional method, it is a precondition that the additional noise signal v (n) is zero average and white noise. In other words, in the conventional method, the signal d (n) and the additional noise signal v (n) are uncorrelated, that is, the following equation (3) is satisfied.

Here, E [•] indicates an ensemble average.

ＡＲ係数は、ＡＲ係数推定アルゴリズムにより推定される。従来手法において最も重要な点は、カルマンフィルタを用いた高性能の雑音抑圧を達成するために、ＡＲ係数の正確な推定を必要とすることである。このことからも、カルマンフィルタの雑音抑圧能力がＡＲ係数の推定精度に大きく依存しているため雑音抑圧能力が大きく劣化することは容易に想像可能である。 The AR coefficient is estimated by an AR coefficient estimation algorithm. The most important point in the conventional method is that accurate estimation of the AR coefficient is required to achieve high-performance noise suppression using the Kalman filter. From this, it can be easily imagined that the noise suppression capability of the Kalman filter greatly deteriorates because the noise suppression capability greatly depends on the estimation accuracy of the AR coefficient.

本手法では、ＡＲ係数を用いずにカルマンフィルタの状態空間モデル、つまり音源信号からのクリアな信号のみを用いて状態方程式、およびそのクリアな信号と付加雑音信号とを用いて観測方程式を構成している。 In this method, the state equation of the Kalman filter without using the AR coefficient, that is, the state equation using only the clear signal from the sound source signal, and the observation equation using the clear signal and the additional noise signal are constructed. Yes.

以下、本手法での状態空間モデルの構成法について説明する。表記を容易にするために、まずＫ×１次の信号ベクトルｘ_ｐ（ｎ）を次の式（４）で定義する。添え字“ｐ”は本発明により考案された表現であることを示す。

Hereinafter, a configuration method of the state space model in this method will be described. In order to facilitate the notation, first, a K × first order signal vector x _p (n) is defined by the following equation (4). The subscript “p” indicates an expression devised by the present invention.

次に、式（２）と同様にして信号ｄ（ｎ）のモデルを構成する。ここでは、ＡＲ係数を用いないとしても、従来手法と同様に、未知のＫ×Ｋ次の状態遷移行列Φ_ｐ（ｎ＋１）と、Ｋ×１次の駆動雑音ベクトルδ_ｐ（ｎ＋１）を導入する。これにより、本実施の形態の状態空間モデル（信号ベクトルにより記述される）の状態方程式の形として、次の式（５）が定まる。

Next, a model of the signal d (n) is constructed in the same manner as the equation (2). Here, even if the AR coefficient is not used, an unknown K × K-order state transition matrix Φ _p (n + 1) and a K × first-order drive noise vector δ _p (n + 1) are introduced as in the conventional method. . As a result, the following equation (5) is determined as the form of the state equation of the state space model (described by the signal vector) of the present embodiment.

式（５）が等式としてＡＲ係数を用いずに成立するためには、Ｋ×Ｋ次の状態遷移行列Φ_ｐ（ｎ＋１）、Ｋ×１次の駆動雑音ベクトルδ_ｐ（ｎ＋１）は、次の式（６）および式（７）を満たすことが求められる。

In order for Equation (5) to hold without using the AR coefficient as an equation, the K × K-order state transition matrix Φ _p (n + 1) and the K × first-order drive noise vector δ _p (n + 1) are It is calculated | required that Formula (6) and Formula (7) of these are satisfy | filled.

したがって、式（５）は次の式（８）に書き直せる。

Therefore, equation (5) can be rewritten into the following equation (8).

式（６）で表されるＫ×Ｋ次の状態遷移行列Φ_ｐ（ｎ＋１）の行列要素は０と１のみであり、特に第１行がすべて０である。このことは、本手法の状態空間モデルの特徴の一つである。また、上記から明らかなように、式（５）、式（６）、式（７）の導出において、駆動雑音ベクトルδ_ｐ（ｎ＋１）に対して前提条件を与えていないことに注意すべきである。すなわち、駆動雑音ベクトルδ_ｐ（ｎ＋１）は有色であってよい。この理由については後述する。 The matrix elements of the K × K-order state transition matrix Φ _p (n + 1) represented by Expression (6) are only 0 and 1, and in particular, all the first rows are 0. This is one of the features of the state space model of this method. Further, as is clear from the above, it should be noted that no precondition is given to the drive noise vector δ _p (n + 1) in the derivation of the equations (5), (6), and (7). is there. That is, the driving noise vector δ _p (n + 1) may be colored. The reason for this will be described later.

本手法のカルマンフィルタアルゴリズムでは、特異値分解を避けるためにＫ×１次の観測ベクトルｙ_ｐ（ｎ＋１）を次の式（９）で定義する。

In the Kalman filter algorithm of this method, in order to avoid singular value decomposition, a K × first-order observation vector y _p (n + 1) is defined by the following equation (9).

よって、式（１）および式（９）から、本手法の状態空間モデルの観測方程式として、次の式（１０）が導出される。

ここで、Ｍ_ｐはＫ×Ｋ次の状態遷移行列、ε_ｐはＫ×１次の付加雑音ベクトルであり、それぞれ次の式（１１）および式（１２）で定義される。

Therefore, the following equation (10) is derived from the equations (1) and (9) as an observation equation of the state space model of the present method.

Here, M _p is a K × K-order state transition matrix, and ε _p is a K × first-order additional noise vector, which are defined by the following equations (11) and (12), respectively.

以上より、本手法のカルマンフィルタアルゴリズムで用いる状態空間モデルの状態方程式および観測方程式として、式（５）および式（１０）がそれぞれ求められた。 From the above, Equation (5) and Equation (10) were obtained as the state equation and the observation equation of the state space model used in the Kalman filter algorithm of the present method, respectively.

図２は、本実施の形態に係る雑音抑圧装置の状態空間モデルを説明するブロック線図である。 FIG. 2 is a block diagram illustrating a state space model of the noise suppression device according to the present embodiment.

図２において、５００は時刻ｎにおける信号ベクトルｘ_ｐ（ｎ）、５０１は時刻ｎ＋１における信号ベクトルｘ_ｐ（ｎ＋１）、５０２は時刻ｎにおける観測ベクトルｙ_ｐ（ｎ）、５０３は時刻ｎにおける付加雑音ベクトルε_ｐ（ｎ）、５０４は時刻ｎ＋１における駆動雑音ベクトルδ_ｐ（ｎ＋１）、５０５は状態遷移行列Φ_ｐ、５０６は状態遷移行列Ｍ_ｐである。 In FIG. 2, 500 is a signal vector x _p (n) at time n, 501 is a signal vector x _p (n + 1) at time n + 1, 502 is an observation vector y _p (n) at time n, and 503 is an additional noise at time n. Vectors ε _p (n) and 504 are driving noise vectors δ _p (n + 1) at time n + 1, 505 is a state transition matrix Φ _p , and 506 is a state transition matrix M _p .

図２から理解できるように、本手法のカルマンフィルタアルゴリズムは、駆動雑音が有色であるにもかかわらず、従来のカルマンフィルタアルゴリズムと同様の手順で実行できる（理由は後述する）。本手法のカルマンフィルタアルゴリズムに基づいた信号推定は、後述するフローチャートに従って行われる。 As can be understood from FIG. 2, the Kalman filter algorithm of the present method can be executed in the same procedure as the conventional Kalman filter algorithm even though the driving noise is colored (the reason will be described later). Signal estimation based on the Kalman filter algorithm of this method is performed according to a flowchart described later.

図３は、本実施の形態に係る雑音抑圧装置のカルマンフィルタアルゴリズムに基づく信号推定手順を説明するフローチャートである。 FIG. 3 is a flowchart for explaining a signal estimation procedure based on the Kalman filter algorithm of the noise suppression apparatus according to the present embodiment.

まず、推定信号ベクトルの初期値ｘ_ｅ（０｜０）、推定誤差の共分散行列の初期値Ｐ（０｜０）およびスカラー量である付加雑音ベクトルの共分散Ｒε_ｐ（ｎ）［ｉ，ｊ］の値を、次の式（１３）に示すように設定する（ＳＴ１００）。

First, the initial value x _e (0 | 0) of the estimated signal vector, the initial value P (0 | 0) of the estimation error covariance matrix, and the covariance Rε _p (n) [i, j] is set as shown in the following equation (13) (ST100).

ここで、Ｉは単位行列である。また、σ_ｖ ^２は付加雑音信号ｖ（ｎ）の雑音分散であり、既知と仮定している。もし付加雑音信号ｖ（ｎ）が白色雑音であり零平均であれば、σ_ｖ ^２は、次の式（１４）で与えられる。

ここで、Ｎは所定のサンプル数である。 Here, I is a unit matrix. Also, σ _v ² is the noise variance of the additional noise signal v (n), and is assumed to be known. If the additional noise signal v (n) is white noise and is zero average, σ _v ² is given by the following equation (14).

Here, N is a predetermined number of samples.

次に、カルマンフィルタアルゴリズムを実行する（ＳＴ１０１）。このカルマンフィルタアルゴリズムの処理手順は、後で詳細に説明する。 Next, the Kalman filter algorithm is executed (ST101). The processing procedure of the Kalman filter algorithm will be described in detail later.

次に、時刻ｎが所定のサンプル数Ｎに達したか否かを判定し（ＳＴ１０２）、所定のサンプル数Ｎに達していない場合は（ＳＴ１０２：ＮＯ）、ステップＳＴ１０１に戻って、カルマンフィルタアルゴリズムを繰り返し、所定のサンプル数Ｎに達した場合は（ＳＴ１０２：ＹＥＳ）、上記一連の処理を終了する。 Next, it is determined whether or not the time n has reached a predetermined number of samples N (ST102). If the predetermined number of samples N has not been reached (ST102: NO), the process returns to step ST101 to perform the Kalman filter algorithm. If the predetermined number N of samples is reached repeatedly (ST102: YES), the above series of processing ends.

本手法では、ＡＲ係数を用いずに状態空間モデルを設定している。そのため、従来手法で必要であったＡＲ係数を推定するステップを削減することができ、１段階処理で信号推定が可能となる。このことは本発明の大きな特徴の一つである。 In this method, the state space model is set without using the AR coefficient. Therefore, it is possible to reduce the step of estimating the AR coefficient that is necessary in the conventional method, and it is possible to perform signal estimation by one-stage processing. This is one of the major features of the present invention.

図４は、図３のカルマンフィルタアルゴリズムの処理内容を示すフローチャートである。 FIG. 4 is a flowchart showing the processing contents of the Kalman filter algorithm of FIG.

ここで、開始時の時刻はｎ＋１であり、以下のフローでは、時刻ｎ＋１での観測信号ｒ（ｎ＋１）および観測ベクトルｙ_ｐ（ｎ＋１）と、時刻ｎでの推定誤差の共分散行列Ｐ（ｎ｜ｎ）とから、時刻ｎ＋１での推定信号ベクトルｘ_pe（ｎ＋１｜ｎ）を推定し、同時に推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ＋１）を更新する。 Here, the start time is n + 1, and in the following flow, the observation signal r (n + 1) and the observation vector y _p (n + 1) at time n + 1 and the covariance matrix P (n of the estimation error at time n) | N), the estimated signal vector x _pe (n + 1 | n) at time n + 1 is estimated, and at the same time, the covariance matrix P (n + 1 | n + 1) of the estimation error is updated.

まず、次の式（１５）を用いて、駆動雑音ベクトルの共分散Ｒδ_ｐ（ｎ）［ｉ，ｊ］の値を計算する（ＳＴ２００）。

First, the value of the covariance Rδ _p (n) [i, j] of the drive noise vector is calculated using the following equation (15) (ST200).

次に、推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ）を計算し、更新する（ＳＴ２０１）。ここで、推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ）の更新式は、次の式（１６）である。ここでは、以下、添え字“ｐｅ”および“ｅ”は、推定値であることを示す。

Next, the estimation error covariance matrix P (n + 1 | n) is calculated and updated (ST201). Here, the update equation of the estimation error covariance matrix P (n + 1 | n) is the following equation (16). Hereafter, the subscripts “pe” and “e” indicate estimated values.

この式（１６）を展開すると、次の式（１７）となる。

When this equation (16) is expanded, the following equation (17) is obtained.

ここで、式（１６）の第３項および第４項に注目する。第３項および第４項は、推定信号ベクトルｘ_ｐｅ（ｎ｜ｎ）と駆動雑音ベクトルδ_ｐ（ｎ｜ｎ）のアンサンブル平均を含んでいる、したがって、推定信号ベクトルｘ_ｐｅ（ｎ｜ｎ）と駆動雑音ベクトルδ_ｐ（ｎ｜ｎ）とが無相関でない、言い換えれば、駆動雑音ベクトルδ_ｐ（ｎ｜ｎ）が有色のとき、第３項および第４項は０にならない。すなわち、本手法のカルマンフィルタアルゴリズムが、駆動雑音信号が有色信号の場合に、従来と同様なカルマンフィルタアルゴリズム手順で適用可能か否かは、第３項および第４項が、式（１５）で表される推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ）の更新にどの程度影響するかによる。 Here, attention is focused on the third and fourth terms of Equation (16). The third and fourth terms include an ensemble average of the estimated signal vector x _pe (n | n) and the drive noise vector δ _p (n | n), and therefore the estimated signal vector x _pe (n | n). And the driving noise vector δ _p (n | n) are not uncorrelated, in other words, when the driving noise vector δ _p (n | n) is colored, the third and fourth terms do not become zero. That is, whether or not the Kalman filter algorithm of the present method can be applied by the same Kalman filter algorithm procedure as in the past when the driving noise signal is a colored signal is expressed by the equation (15). It depends on how much it affects the update of the covariance matrix P (n + 1 | n) of the estimation error.

そこで、式（１７）の第３項に着目する。第３項は、次の式（１８）のように、

と書き表せる。 Therefore, attention is focused on the third term of Expression (17). The third term is expressed by the following equation (18):

Can be written.

ここで、次の式（１９）のように、

とすれば、次の式（２０）となる。

Here, as in the following equation (19),

Then, the following equation (20) is obtained.

式（５）および次の式（２１）

を考慮すれば、式（１８）は、次の式（２２）のように書き直すことができる。

ただし、次の式（２３）である。

ここで、ｄ_ｅ（ｎ）は時刻ｎでの推定信号である。 Formula (5) and the following formula (21)

(18) can be rewritten as the following equation (22).

However, it is the following formula (23).

_{Here, d} e (n) is the estimated signal at time n.

式（２３）の第１項は真値のアンサンブル平均、第２項は推定値のアンサンブル平均を示す。したがって、その差は小さく、ｅ（ｌ＋１）は無視できる値である。よって、推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ）の更新式（１５）は、次の式（２４）のように書き直される。このことから、駆動雑音が有色であっても、本手法の式（５）および式（９）から構成される状態空間モデル(状態方程式と観測方程式)であれば、従来手法と同様なカルマンフィルタアルゴリズム手順が適用可能となる。

In Equation (23), the first term represents an ensemble average of true values, and the second term represents an ensemble average of estimated values. Therefore, the difference is small and e (l + 1) is a negligible value. Therefore, the update equation (15) of the estimation error covariance matrix P (n + 1 | n) is rewritten as the following equation (24). From this, even if the driving noise is colored, if it is a state space model (state equation and observation equation) composed of Equation (5) and Equation (9) of this method, the same Kalman filter algorithm as in the conventional method The procedure becomes applicable.

次に、カルマンゲインＫ_ｐ（ｎ＋１）を計算する（ＳＴ２０２）。ここで、カルマンゲインＫ_ｐ（ｎ＋１）は、次の式（２５）で与えられる。

Next, Kalman gain K _p (n + 1) is calculated (ST202). Here, the Kalman gain K _p (n + 1) is given by the following equation (25).

そして、これらの値から、まず推定信号ベクトルｘ_ｐｅ（ｎ＋１｜ｎ）を計算する（ＳＴ２０３）。ここで、推定信号ベクトルｘ_ｐｅ（ｎ＋１｜ｎ）は、次の式（２６）で与えられる。

Then, first, an estimated signal vector x _pe (n + 1 | n) is calculated from these values (ST203). Here, the estimated signal vector x _pe (n + 1 | n) is given by the following equation (26).

そして、推定信号ベクトルｘ_ｐｅ（ｎ＋１｜ｎ＋１）を計算する（ＳＴ２０４）。ここで、推定信号ベクトルｘ_ｐｅ（ｎ＋１｜ｎ＋１）は、次の式（２７）で与えられる。

Then, an estimated signal vector x _pe (n + 1 | n + 1) is calculated (ST204). Here, the estimated signal vector x _pe (n + 1 | n + 1) is given by the following equation (27).

最後に、推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ＋１）を計算、更新し（ＳＴ２０５）、フローを終了する。ここで、推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ＋１）は、次の式（２８）で与えられる。

Finally, the estimation error covariance matrix P (n + 1 | n + 1) is calculated and updated (ST205), and the flow ends. Here, the estimation error covariance matrix P (n + 1 | n + 1) is given by the following equation (28).

雑音抑圧処理部１４０は、上述した手順、アルゴリズムによって推定された推定信号をを出力部１５０に出力する。出力部１５０は、例えば、スピーカやディスプレイ、通信手段、記憶装置などで構成されている。 The noise suppression processing unit 140 outputs an estimation signal estimated by the above-described procedure and algorithm to the output unit 150. The output unit 150 includes, for example, a speaker, a display, a communication unit, a storage device, and the like.

ＡＲ係数の推定を必要とする従来のカルマンフィルタの問題点は、ＡＲ係数の推定精度にカルマンフィルタアルゴリズムの能力が依存していることである。これに対して、本手法は、カルマンフィルタアルゴリズムのみで実行しているため、ＡＲ係数の推定精度に依存しない。 A problem of the conventional Kalman filter that requires estimation of the AR coefficient is that the ability of the Kalman filter algorithm depends on the estimation accuracy of the AR coefficient. On the other hand, since this method is executed only by the Kalman filter algorithm, it does not depend on the estimation accuracy of the AR coefficient.

なお、本手法のカルマンアルゴリズムは、以下に説明する更に改良された方法（以下「改良手法」という）によっても実行できる。以下、この改良手法について説明する。 The Kalman algorithm of this method can also be executed by a further improved method (hereinafter referred to as “improved method”) described below. Hereinafter, this improved technique will be described.

改良手法のカルマンアルゴリズムの状態方程式は、上記式（５）と同一である。ただし、観測方程式を、次の式（２９）のように書き直す。改良手法の観測方程式（２９）は、本手法の観測方程式（１０）の一部の要素を取り出してきたものである。

ここで、ｙ’_ｐ（ｎ＋１）、ｍ^Ｔ _ｐ、ε’_ｐ（ｎ＋１）は、それぞれ改良手法における観測信号、雑音信号、状態遷移ベクトルであり、次の式（３０）を満たす。

The state equation of the improved Kalman algorithm is the same as the above equation (5). However, the observation equation is rewritten as the following equation (29). The observation equation (29) of the improved method is a partial extraction of the observation equation (10) of the present method.

Here, y ′ _p (n + 1), m ^T _p , and ε ′ _p (n + 1) are an observation signal, a noise signal, and a state transition vector in the improved method, respectively, and satisfy the following equation (30).

改良手法の大きな特徴の一つは、本手法の観測方程式（１０）と改良手法の観測方程式（３０）の比較からわかるように、改良前の方法（本手法）における、観測ベクトルｙ_ｐ（ｎ＋１）、雑音ベクトルε_ｐ（ｎ＋１）、および状態遷移行列ｍ_ｐが、それぞれスカラーとベクトルに変更されていることである。したがって、改良手法はより少ない演算量で実行できるであろうことは明らかである。また、上記から明らかなように、改良手法においても、本手法と同様に、駆動雑音ベクトルδ_ｐ（ｎ＋１）に対して前提条件を与えていないことに注意すべきである。すなわち、改良手法においても駆動雑音ベクトルδ_ｐ（ｎ＋１）は有色であってよい。 One of the major features of the improved method is that the observation vector y _p (n + 1) in the method before the improvement (this method), as can be seen from the comparison between the observation equation (10) of this method and the observation equation (30) of the improved method. ), The noise vector ε _p (n + 1), and the state transition matrix m _p are changed to a scalar and a vector, respectively. Therefore, it is clear that the improved method can be executed with a smaller amount of computation. Further, as is apparent from the above, it should be noted that the improved method does not give a precondition to the drive noise vector δ _p (n + 1) as in the present method. That is, even in the improved method, the driving noise vector δ _p (n + 1) may be colored.

すなわち、式（１７）で与えられた推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ）を再度、行列形式で書き下すと、次の式（３１）となる。

ここで、また以下の式で、行列式中、斜線網掛け部分で示される部分は駆動雑音が有色であるときに影響を受ける部分である。 That is, when the covariance matrix P (n + 1 | n) of the estimation error given by Expression (17) is written down again in matrix form, the following Expression (31) is obtained.

Here, in the following equation, the portion indicated by the hatched portion in the determinant is a portion that is affected when the driving noise is colored.

改良手法を用いたカルマンフィルタアルゴリズムは、図４に示す本手法のカルマンフィルタアルゴリズムとは、カルマンゲインＫ_ｐ（ｎ＋１）の計算（ＳＴ２０２）および推定信号ベクトルｘ_ｐｅ（ｎ＋１｜ｎ＋１）の計算（ＳＴ２０４）における計算式が異なる。 The Kalman filter algorithm using the improved method is different from the Kalman filter algorithm of the present method shown in FIG. 4 in the calculation of the Kalman gain K _p (n + 1) (ST202) and the calculation of the estimated signal vector x _pe (n + 1 | n + 1) (ST204). The calculation formula is different.

すなわち、カルマンゲインＫ_ｐ（ｎ＋１）の計算式は、次の式（３２）、つまり、

となる。ただし、雑音信号ε’_ｐ（ｎ＋１）の共分散をσ^２ _ｖとしたときに、次の式（３３）が成り立つ。

That is, the Kalman gain K _p (n + 1) is calculated by the following equation (32), that is,

It becomes. However, when the covariance of the noise signal ε ′ _p (n + 1) is σ ² _v , the following equation (33) is established.

また、推定信号ベクトルｘ_pｅ（ｎ＋１｜ｎ＋１）の計算式は、次の式（３４）となる。

となる The calculation formula of the estimated signal vector x _pe (n + 1 | n + 1) is the following formula (34).

Become

ここで、式（３４）のｘ_pｅ（ｎ＋１｜ｎ＋１）の第１要素に着目すると、斜線網掛けではない（つまり、有色雑音の影響を受けない）ことから、駆動雑音が有色であってもクリアな信号の推定信号を得られる。したがって、改良手法も、本手法と同様に、従来のカルマンフィルタアルゴリズムで実行可能である。既に述べたように、改良手法では、本手法に比べて、演算量を低減できる。しかしながら、本手法が観測ベクトルｙ_ｐ（ｎ＋１）を用いているのに対して、改良手法方法では、スカラー量である観測信号ｙ‘_ｐ（ｎ＋１）を用いている。そのサイズの違いから、過去の観測量をより積極的に用いることができる本手法のほうが、より雑音抑圧能力が高いことは明らかである。 Here, paying attention to the first element of x _pe (n + 1 | n + 1) in Expression (34), since it is not hatched (that is, not affected by the colored noise), even if the driving noise is colored, A clear estimated signal can be obtained. Therefore, the improved method can be executed by the conventional Kalman filter algorithm as in the present method. As described above, the improved method can reduce the amount of calculation compared to the present method. However, while this method uses the observation vector y _p (n + 1), the improved method uses the observation signal y ′ _p (n + 1) that is a scalar quantity. From the difference in size, it is clear that this method, which can use past observations more actively, has higher noise suppression capability.

以上をまとめると、本発明における上記二つの手法（本手法および改良手法）は、カルマンフィルタに必要な状態空間モデルの観測方程式を変更することによって、演算量を大幅に減少させることが可能である。より具体的には、本発明における上記二つの手法では、ＡＲ係数の推定を必要としないため、その演算量は従来手法に比べて少ない。また、改良手法では、演算に用いる変数のサイズが本手法に比べて小さいため、その演算量は本手法に比べて少ない。すなわち、演算量に関して、
改良手法＜本手法＜従来手法
である。 In summary, the above two methods in the present invention (the present method and the improved method) can greatly reduce the amount of calculation by changing the observation equation of the state space model necessary for the Kalman filter. More specifically, the above two methods in the present invention do not require estimation of the AR coefficient, and therefore the amount of calculation is less than that of the conventional method. Further, in the improved method, the size of the variable used for the calculation is smaller than that of the present method, so that the amount of calculation is smaller than that of the present method. That is, regarding the amount of calculation,
Improved method <this method <conventional method.

また、本発明における上記二つの手法を、例えば、半導体集積回路や半導体回路などのハードウエアとして実施する場合、また、パーソナルコンピュータなどで実行可能なソフトウエアとして実施する場合のいずれにおいても、その構成は従来手法よりも単純化される。したがって、本発明における二つの手法を用いれば、回路規模やプログラム量が大幅に低減できるであろうことは明らかである。 In addition, the above-described two methods in the present invention may be implemented, for example, when implemented as hardware such as a semiconductor integrated circuit or a semiconductor circuit, or as software that can be executed by a personal computer or the like. Is simpler than the conventional method. Therefore, it is clear that the circuit scale and the program amount can be greatly reduced by using the two methods in the present invention.

また、本発明における上記二つの手法は、カルマンフィルタに必要な状態空間モデルの観測方程式を変更することによって、雑音抑圧能力を大幅に向上させることが可能である。より具体的には、本発明における上記二つの手法では、ＡＲ係数の推定を必要としないため、雑音抑圧能力がＡＲ係数の推定精度に依存する従来手法に比べて、雑音抑圧能力が高い。また、改良手法では、演算に用いる変数のサイズが本手法に比べて小さいため、過去の観測量を積極的に用いていない。したがって、その雑音抑圧能力は本手法に比べて低い。すなわち、雑音抑圧能力に関して、
従来手法＜改良手法＜本手法
である。 In addition, the above two methods in the present invention can greatly improve the noise suppression capability by changing the observation equation of the state space model necessary for the Kalman filter. More specifically, since the above two methods in the present invention do not require estimation of AR coefficients, the noise suppression capability is higher than the conventional methods in which the noise suppression capability depends on the estimation accuracy of the AR coefficient. In the improved method, since the size of the variable used for the calculation is smaller than that of the present method, the past observation amount is not actively used. Therefore, its noise suppression capability is low compared to this method. That is, regarding noise suppression capability,
Conventional method <improved method <this method.

例えば、音響分野において、音質を若干落としても（ただし、従来手法よりは高い）、演算速度を速くしたい場合に、改良手法は有効である。 For example, in the acoustic field, the improved technique is effective when it is desired to increase the calculation speed even if the sound quality is slightly reduced (but higher than the conventional technique).

ＡＲ係数を用いない本発明における上記二つの手法は、状態方程式の駆動雑音が有色信号となっており、従来のカルマンフィルタの仮定、つまり、駆動雑音が白色信号であるという仮定に反している。しかしながら、本発明の状態空間モデルにおける状態方程式と観測方程式の構造的な性質により、駆動雑音が白色信号である従来のカルマンフィルタアルゴリズと全く同じアルゴリズムで実行可能であり、その理由は上記の通りである。また、本発明の改良手法においても、同様に実行可能であることは上記より明らかである。 The above two methods in the present invention that do not use the AR coefficient are contrary to the assumption of the conventional Kalman filter, that is, the driving noise is a white signal, because the driving noise of the state equation is a colored signal. However, due to the structural nature of the state equation and the observation equation in the state space model of the present invention, it can be executed with the same algorithm as the conventional Kalman filter algorithm in which the driving noise is a white signal, for the reason described above. . In addition, it is clear from the above that the improved technique of the present invention can be similarly implemented.

本発明における上記二つの方法は、ＡＲ係数を使わないようにカルマンフィルタを構成する。言い換えると、カルマンフィルタのみで実行可能な雑音抑圧方法であるため、従来の雑音抑圧能力の問題を解決していることは明らかである。 In the above two methods in the present invention, the Kalman filter is configured not to use the AR coefficient. In other words, since it is a noise suppression method that can be executed only by the Kalman filter, it is clear that the conventional problem of noise suppression capability is solved.

（実施の形態２）
実施の形態２は、実施の形態１に示す雑音抑圧装置を音声に適用した場合である。 (Embodiment 2)
The second embodiment is a case where the noise suppression apparatus shown in the first embodiment is applied to speech.

図５は、本発明の実施の形態２に係る雑音抑圧装置の構成を示すブロック図である。 FIG. 5 is a block diagram showing the configuration of the noise suppression apparatus according to Embodiment 2 of the present invention.

図５に示す雑音抑圧装置２００は、本実施の形態の雑音抑圧処理を実行できるパーソナルコンピュータ２１０、マイクロホン２２０、サンプリング部１２０、およびＡ／Ｄ変換部１３０を有する。 A noise suppression apparatus 200 illustrated in FIG. 5 includes a personal computer 210, a microphone 220, a sampling unit 120, and an A / D conversion unit 130 that can execute the noise suppression processing of the present embodiment.

パーソナルコンピュータ２１０は、操作装置２１１、ディスプレイ２１２、バスインタフェース２１３、記録装置２１４、主記憶メモリ２１５、および中央演算装置２１６を有する。操作装置２１１は、典型的にはキーボートやマウスなどであるが、音声認識装置などを用いてもよい。使用者は、操作装置２１１を用い、ディスプレイ２１２で確認をしながらコンピュータを操作できる。 The personal computer 210 includes an operation device 211, a display 212, a bus interface 213, a recording device 214, a main storage memory 215, and a central processing unit 216. The operation device 211 is typically a keyboard or a mouse, but a voice recognition device or the like may be used. The user can operate the computer while confirming on the display 212 using the operation device 211.

パーソナルコンピュータ２１０において本実施の形態の雑音抑圧処理を実行させるプログラムソフトウエアは、記録装置２１４に格納されていてもよいし、バスインタフェース２１３を介して外部からダウンロードされてきてもよい。記録装置２１４は、典型的にはハードディスク装置であるが、ＣＤ−ＲＯＭ装置やＤＶＤ装置、フラッシュメモリなどの可搬性のあるものであってもよい。また、それらの組み合わせであってもよい。 Program software for causing the personal computer 210 to execute the noise suppression processing of the present embodiment may be stored in the recording device 214 or downloaded from the outside via the bus interface 213. The recording device 214 is typically a hard disk device, but may be a portable device such as a CD-ROM device, a DVD device, or a flash memory. Moreover, those combinations may be sufficient.

サンプリング部１２０およびＡ／Ｄ変換部１３０は、パーソナルコンピュータ２１０内部に格納された内蔵カード（ボード）であってもよいし、バスインタフェース２１３を経由して接続された外部設置型機器であってもよい。 The sampling unit 120 and the A / D conversion unit 130 may be an internal card (board) stored in the personal computer 210 or may be an externally installed device connected via the bus interface 213. Good.

マイクロホン２２０からの観測音声信号は、サンプリング部１２０に入力される。サンプリング部１２０は、所定のサンプリング周波数（例えば、１６ｋＨｚ）で、入力されたアナログの観測音声信号をサンプリング処理し、Ａ／Ｄ変換部１３０に出力する。Ａ／Ｄ変換部１３０は、サンプリングされた観測音声信号の振幅値を所定の分解能（例えば、８ｂｉｔ）でＡ／Ｄ変換処理し、一時格納する。Ａ／Ｄ変換部１３０は、所定のサンプリング数Ｎの音声フレーム単位で、ディジタル化した観測音声信号をパーソナルコンピュータ２１０のバスインタフェース２１３に出力する。 The observation audio signal from the microphone 220 is input to the sampling unit 120. The sampling unit 120 performs sampling processing on the input analog observation voice signal at a predetermined sampling frequency (for example, 16 kHz), and outputs it to the A / D conversion unit 130. The A / D conversion unit 130 performs A / D conversion processing on the amplitude value of the sampled observation audio signal with a predetermined resolution (for example, 8 bits), and temporarily stores it. The A / D conversion unit 130 outputs the digitized observation audio signal to the bus interface 213 of the personal computer 210 in units of a predetermined sampling number N of audio frames.

パーソナルコンピュータ２１０はバスインタフェース２１３に出力された観測音声信号を一時、主記憶メモリ２１５に格納し、その後、所定の音声フレーム（サンプリング数）単位で、雑音抑圧処理を施した上で、再度主記憶メモリ２１５に格納する。雑音抑圧処理は、主記憶メモリ２１５や記録装置２１４に格納されたソフトウエアをバスインタフェース２１３経由で中央演算装置２１６に呼び出し、実行させることで行われる。 The personal computer 210 temporarily stores the observed audio signal output to the bus interface 213 in the main memory 215, and after performing noise suppression processing in a predetermined audio frame (sampling number) unit, the main computer again Store in the memory 215. The noise suppression process is performed by calling and executing software stored in the main memory 215 or the recording device 214 to the central processing unit 216 via the bus interface 213.

パーソナルコンピュータ２１０は、使用者の操作により、処理を実行したり、中断、終了させたりする。また使用者の操作により、処理した推定音声信号を外部に出力してもよい。 The personal computer 210 executes, interrupts, or terminates processing according to the user's operation. Moreover, you may output the estimated estimated audio | voice signal outside by operation of a user.

次に、本実施の形態における音声の雑音抑圧処理について、図面を参照しつつ説明する。 Next, speech noise suppression processing in the present embodiment will be described with reference to the drawings.

本実施の形態における音声の雑音抑圧の性能評価の目的で音声波形の数値シミュレーションを行った。 For the purpose of evaluating the performance of speech noise suppression in this embodiment, a numerical simulation of speech waveforms was performed.

図６は、本実施の形態における音声波形シミュレーションの第１の例の結果を示す図であり、図７は、本実施の形態における音声波形シミュレーションの第２の例の結果を示す図である。また、図８は、本実施の形態における音声波形シミュレーションの第３の例の結果を示す図であり、図９は、本実施の形態における音声波形シミュレーションの第４の例の結果を示す図である。 FIG. 6 is a diagram illustrating a result of the first example of the speech waveform simulation in the present embodiment, and FIG. 7 is a diagram illustrating a result of the second example of the speech waveform simulation in the present embodiment. FIG. 8 is a diagram showing a result of the third example of the speech waveform simulation in the present embodiment, and FIG. 9 is a diagram showing a result of the fourth example of the speech waveform simulation in the present embodiment. is there.

日本人成人男性および女性のオリジナル音声信号（クリアな音声信号）は、無響室において、１６ｋＨｚのサンプリングレートでサンプリング、ディジタル化した。本例では、二つの音声信号サンプルを検討した。二つの音声信号サンプルは、（Ａ−１）図６（Ａ）に示す無声区間のないクリアな音声信号、（Ａ−２）図７（Ａ）に示す無声区間をもつクリアな音声信号である。それぞれの音声信号サンプルは、以後、音声（Ａ−１）および音声（Ａ−２）と参照する。 Original audio signals (clear audio signals) of Japanese adult men and women were sampled and digitized in an anechoic chamber at a sampling rate of 16 kHz. In this example, two audio signal samples were considered. The two audio signal samples are (A-1) a clear audio signal without an unvoiced section shown in FIG. 6 (A), and (A-2) a clear audio signal with an unvoiced section shown in FIG. 7 (A). . Each audio signal sample is hereinafter referred to as audio (A-1) and audio (A-2).

雑音信号は、人工的な付加雑音信号であり、本例では、二つの雑音信号サンプルを検討した。二つの雑音信号サンプルは、（Ｂ−１）図６（Ｂ）および図７（Ｂ）に示す付加白色ガウス雑音、（Ｂ−２）図８（Ｂ）および図９（Ｂ）に示す付加有色バブル雑音である。それぞれの雑音信号サンプルは、以後、雑音（Ｂ−１）および雑音（Ｂ−２）と参照される。 The noise signal is an artificial additional noise signal. In this example, two noise signal samples were examined. The two noise signal samples are (B-1) the added white Gaussian noise shown in FIGS. 6 (B) and 7 (B), (B-2) the added colored colors shown in FIG. 8 (B) and FIG. 9 (B). Bubble noise. Each noise signal sample is hereinafter referred to as noise (B-1) and noise (B-2).

雑音信号の雑音分散σ_ｖ ^２は既知として次の式（３５）で表されるものとする。すなわち、

である。ここで、Ｌは全音声信号サンプル数である。 The noise variance σ _v ² of the noise signal is assumed to be expressed by the following equation (35). That is,

It is. Here, L is the total number of audio signal samples.

また、信号雑音比ＳＮＲ_ｉｎを式（３６）で定義する。

Further, the signal to noise ratio SNR _in is defined by Expression (36).

従来手法と本発明の方法（本手法）とによる雑音抑圧の結果を、音声（Ａ−１）および音声（Ａ−２）と雑音（Ｂ−１）の組み合わせ、ならびに音声（Ａ−１）および音声（Ａ−２）と雑音（Ｂ−２）の組み合わせからなる観測音声信号の信号波形から比較した。 The results of noise suppression by the conventional method and the method of the present invention (the present method) are expressed as speech (A-1), speech (A-2) and noise (B-1), and speech (A-1) and Comparison was made from the signal waveform of the observed voice signal consisting of a combination of voice (A-2) and noise (B-2).

図６（Ｃ）と図７（Ｃ）は、それぞれ音声（Ａ−１）と雑音（Ｂ−１）、音声（Ａ−２）と雑音（Ｂ−１）からなる観測音声信号波形である。 FIGS. 6C and 7C show observed sound signal waveforms composed of sound (A-1) and noise (B-1), and sound (A-2) and noise (B-1), respectively.

図６（Ｄ）と図７（Ｄ）は、それぞれ音声（Ａ−１）と雑音（Ｂ−１）、音声（Ａ−２）と雑音（Ｂ−１）の合成波形に対して従来手法で雑音抑圧を行った後の推定音声信号波形である。 FIGS. 6D and 7D show a conventional method for a synthesized waveform of speech (A-1) and noise (B-1), speech (A-2) and noise (B-1), respectively. It is an estimated speech signal waveform after noise suppression.

一方、図６（Ｅ）と図７（Ｅ）は、それぞれ音声（Ａ−１）と雑音（Ｂ−１）、音声（Ａ−２）と雑音（Ｂ−１）の合成波形に対して従来手法で雑音抑圧を行った後の推定音声信号波形である。 On the other hand, FIG. 6E and FIG. 7E show the conventional waveforms for speech (A-1) and noise (B-1) and speech (A-2) and noise (B-1), respectively. It is an estimated speech signal waveform after performing noise suppression by the technique.

図６（Ｄ）と図７（Ｄ）に対して図６（Ａ）と図７（Ａ）に示すオリジナルの音声信号とを参照すると、図６（Ｄ）と図７（Ｄ）に対して図６（Ａ）と図７（Ａ）に示すクリアな音声信号とを比較すると、従来手法による雑音抑圧では、雑音抑圧後に推定音声信号の振幅が小さくなっており、クリアな音声信号が抑圧されていることがわかる。加えて、従来手法による雑音抑圧では、サンプリング数の増加とともに、雑音抑圧後の推定音声信号の波形が、図６（Ａ）と図７（Ａ）に示すクリアな音声信号の波形から変形していく。 Referring to FIG. 6 (A) and FIG. 7 (A) with respect to FIG. 6 (D) and FIG. 7 (D), referring to FIG. 6 (D) and FIG. Comparing the clear speech signal shown in FIG. 6A and FIG. 7A, in the noise suppression by the conventional method, the amplitude of the estimated speech signal is reduced after the noise suppression, and the clear speech signal is suppressed. You can see that In addition, in the noise suppression by the conventional method, as the number of samplings increases, the waveform of the estimated speech signal after noise suppression is transformed from the clear speech signal waveform shown in FIGS. 6 (A) and 7 (A). Go.

さらに、従来手法の雑音抑圧では、無声区間を有する音声（Ａ−２）に対して、推定音声信号が抑圧されるだけでなく、無声区間においてオリジナルの雑音信号と異なる雑音が観察されている。これは、従来手法では、無声区間では信号ｄ（ｎ）がゼロであるにもかかわらず、式（２）でＡＲ係数を求めようとするため、ＡＲ係数の値は発散し、不安定な状態を与えるからと推測される。 Furthermore, in the noise suppression of the conventional method, not only the estimated speech signal is suppressed but also noise different from the original noise signal is observed in the unvoiced section for the voice (A-2) having the unvoiced section. This is because in the conventional method, although the signal d (n) is zero in the unvoiced interval, the AR coefficient is obtained by the equation (2), and therefore the AR coefficient value diverges and is unstable. Is presumed to be given.

また、このことから、雑音信号が有色の場合、従来手法の適用は困難であろうことは容易に推測される。 From this, it can be easily estimated that application of the conventional method will be difficult when the noise signal is colored.

従来手法とは対照的に、図６（Ｅ）および図７（Ｅ）に示す本発明の雑音抑圧では、雑音抑圧後の推定音声信号の波形が、図６（Ａ）と図７（Ａ）に示すクリアな音声信号の波形と非常に似通っている。 In contrast to the conventional method, in the noise suppression of the present invention shown in FIGS. 6 (E) and 7 (E), the waveform of the estimated speech signal after noise suppression is shown in FIGS. 6 (A) and 7 (A). It is very similar to the waveform of a clear audio signal shown in

次に、図８（Ｄ）と図９（Ｄ）に対して図８（Ａ）と図９（Ａ）に示すクリアな音声信号とを比較すると、従来手法による雑音抑圧では、雑音（Ｂ−２）を含む観測音声信号に対して、非常に劣った結果を与えていることがわかる。これは、従来手法では、有色雑音である雑音（Ｂ−２）を含んだ観測音声信号に対してＡＲ係数を正確に推定することが困難であるためである。 Next, when the clear audio signal shown in FIG. 8A and FIG. 9A is compared with FIG. 8D and FIG. 9D, noise (B− It can be seen that the inferior result is given to the observed speech signal including 2). This is because it is difficult for the conventional method to accurately estimate the AR coefficient for the observed speech signal including the noise (B-2) that is colored noise.

一方、本発明の雑音抑圧方法では、雑音（Ｂ−２）の場合も雑音（Ｂ−１）の場合と同程度の雑音抑圧が達成されている。 On the other hand, in the noise suppression method of the present invention, the same level of noise suppression is achieved in the case of noise (B-2) as in the case of noise (B-1).

すなわち、上述のように、本発明の雑音抑制方法は、白色、有色雑音、無声区間の有無に関わらず有効である。これは、本発明の雑音抑制方法の大きな特徴の一つである。 That is, as described above, the noise suppression method of the present invention is effective regardless of the presence or absence of white, colored noise, and unvoiced intervals. This is one of the major features of the noise suppression method of the present invention.

本発明の雑音抑制方法が有色の付加雑音にも適用できることは、既述のカルマンアルゴリズムにおける推定誤差の共分散行列Ｐ（ｎ＋１｜ｎ）の更新式（２３）の導出過程からも明らかであるが、上述の本実施の形態の数値シミュレーションは、このことを支持している。 The fact that the noise suppression method of the present invention can also be applied to colored additive noise is obvious from the process of deriving the update equation (23) of the covariance matrix P (n + 1 | n) of the estimation error in the Kalman algorithm described above. The numerical simulation of the present embodiment described above supports this.

次に、雑音抑制性能を数値的に比較するため、雑音抑圧性能ＳＮＲ_ｏｕｔを式（３７）で定義し、その数値シミュレーションを行った。

ここで、ｄ_ｅｉ（ｎ）は推定音声信号を表す。 Next, in order to numerically compare the noise suppression performance, the noise suppression performance SNR _out was defined by Expression (37), and the numerical simulation was performed.

Here, d _ei (n) represents the estimated speech signal.

表１は、本実施の形態における雑音抑圧性能の数値シミュレーションの一例の結果を示す表であり、音声（Ａ−１）と雑音（Ｂ−１）の組み合わせにおける、幾つかのＳＮＲ_ｉｎ、Ｋ（従来手法においては式（２）におけるＡＲ係数の次数、本発明では状態遷移行列Φ_ｐの次数）の値における、従来手法と本発明の方法の雑音抑圧性能ＳＮＲ_ｏｕｔを比較して示している。 Table 1 is a table showing a result of an example of numerical simulation of noise suppression performance in the present embodiment, and shows several SNR _in , K ( _in a combination of speech (A-1) and noise (B-1). In the conventional method, the noise suppression performance SNR _out of the conventional method and the method of the present invention in the value of the order of the AR coefficient in Expression (2), the value of the state transition matrix Φ _{p in} the present invention) is shown in comparison.

表２は、本実施の形態における雑音抑圧性能の数値シミュレーションの他の例の結果を示す表であり、音声（Ａ−１）と雑音（Ｂ−１）の組み合わせにおける、幾つかのＳＮＲ_ｉｎ、Ｋの値における、従来手法と本発明の方法の雑音抑圧性能ＳＮＲ_ｏｕｔを比較して示している。 Table 2 is a table showing the results of another example of the numerical simulation of the noise suppression performance in the present embodiment. Several SNR _in in the combination of speech (A-1) and noise (B-1) The noise suppression performance SNR _out of the conventional method and the method of the present invention in the value of K is compared and shown.

表１および表２を参照すると、本発明の方法は、すべてのＳＮＲ_ｉｎ、Ｋの値において、従来の方法に比べて雑音抑圧能力を改善していることがわかる。 Referring to Tables 1 and 2, it can be seen that the method of the present invention improves the noise suppression capability compared to the conventional method at all SNR _in and K values.

特に、表２に示す有色雑音の場合には、従来手法は非常に劣った結果を与えているのにも関わらず、本発明の方法は表１に示す白色雑音の場合と同程度の結果を示している。すなわち、本発明の雑音抑圧方法は、白色雑音、有色雑音両者に効果的で雑音の性質に堅牢な雑音抑圧方法であるといえる。 In particular, in the case of the colored noise shown in Table 2, the method of the present invention gives the same result as that of the white noise shown in Table 1, although the conventional method gives a very inferior result. Show. That is, it can be said that the noise suppression method of the present invention is a noise suppression method that is effective for both white noise and colored noise and is robust in noise characteristics.

また、表１および表２に見られるように、本発明の方法では、Ｋの値に対して雑音抑圧性能ＳＮＲ_ｏｕｔは安定であり、Ｋの値の増加に伴い増加する傾向にある。対照的に従来の方法では、表１に見られるように、Ｋの値に対して雑音抑圧性能ＳＮＲ_ｏｕｔは不安定である。これは、従来手法では、最適なＫの値、つまりＡＲ係数の次数を決定することが困難であることを意味している。 Further, as seen in Tables 1 and 2, in the method of the present invention, the noise suppression performance SNR _out is stable with respect to the value of K, and tends to increase as the value of K increases. In contrast, in the conventional method, as shown in Table 1, the noise suppression performance SNR _out is unstable with respect to the value of K. This means that it is difficult to determine the optimum value of K, that is, the order of the AR coefficient, in the conventional method.

ＡＲ係数推定を必要とする従来手法において最も問題になることは、ＡＲ係数の次数の正確な推定は一般に困難であることである。なぜなら、ＡＲ係数の次数の正確な推定は，例えば雑音除去であれば、クリアな音声信号に依存しているからである。 What is most problematic in the conventional method that requires AR coefficient estimation is that accurate estimation of the order of the AR coefficient is generally difficult. This is because accurate estimation of the order of the AR coefficient depends on a clear speech signal if, for example, noise removal is performed.

このことは、クリアな音声信号が既知でなければならないことを意味しているため、リアルタイム処理は困難となる。ＡＲ係数の次数が正確でない場合には、カルマンフィルタアルゴリズの性能が劣化することは容易に想像可能である。また、何らかの手法でリアルタイムに推定することが可能となったとしても、処理が増加することより演算量などの問題を避けることは不可能である。 This means that a clear audio signal must be known, making real-time processing difficult. If the order of the AR coefficient is not accurate, it can be easily imagined that the performance of the Kalman filter algorithm deteriorates. Even if it becomes possible to estimate in real time by some method, it is impossible to avoid problems such as the amount of computation due to the increase in processing.

次に、本実施の形態に係る推定音声信号の音声品質を評価するためにリスニングテストによる主観的評価を行った。 Next, in order to evaluate the voice quality of the estimated voice signal according to the present embodiment, a subjective evaluation by a listening test was performed.

音声品質評価に用いたオリジナル音声信号とオリジナル雑音信号は第２の実施の形態と同一であり、その説明は省略する。雑音信号は、異なるＳＮＲ_ｉｎ（０、５、および１０［ｄＢ］）で音声信号に加えた。 The original voice signal and the original noise signal used for the voice quality evaluation are the same as those in the second embodiment, and a description thereof will be omitted. The noise signal was added to the audio signal at different SNR _in (0, 5, and 10 [dB]).

音声品質評価は、ＡＣＲ（絶対範疇評価）に基づいた５段階ＭＯＳ（平均オピニオン値）を用いたリスニングテストにより行った。５０人の聴取者が雑音抑圧により得られた推定音声信号のうち幾つかを評価した。各々の聴取者は、ポイント１からポイント５を決定する。ポイント５が最良である。 The voice quality evaluation was performed by a listening test using a 5-step MOS (average opinion value) based on ACR (absolute category evaluation). Fifty listeners evaluated some of the estimated speech signals obtained by noise suppression. Each listener determines points 1 to 5. Point 5 is the best.

図１０は、本実施の形態における雑音抑圧後の音声品質の主観的評価結果の一つの例を示す図であり、音声（Ａ−１）と雑音（Ｂ−１）の組み合わせにおける、従来手法と本発明の方法とリスニングテストの結果を比較して示している。 FIG. 10 is a diagram illustrating an example of a subjective evaluation result of speech quality after noise suppression according to the present embodiment, and shows a conventional method in a combination of speech (A-1) and noise (B-1). The results of the method of the present invention and the listening test are shown in comparison.

また、図１１は、本実施の形態における雑音抑圧後の音声品質の主観的評価結果の他の例を示す図であり、音声（Ａ−１）と雑音（Ｂ−２）の組み合わせにおける、従来手法と本発明の方法とリスニングテストの結果を比較して示している。 FIG. 11 is a diagram showing another example of the subjective evaluation result of the speech quality after noise suppression in the present embodiment, in the conventional combination of speech (A-1) and noise (B-2). The method, the method of the present invention, and the result of the listening test are shown in comparison.

図１０および図１１から、本発明の方法で推定した音声信号のスコアは、すべてのＳＮＲ_ｉｎ値において従来手法のスコアより高いことがわかる。特にその差は、音声（Ａ−１）と雑音（Ｂ−２）の組み合わせに対して大きい。 10 and 11, it can be seen that the score of the speech signal estimated by the method of the present invention is higher than the score of the conventional method _in all SNR _in values. In particular, the difference is large for the combination of speech (A-1) and noise (B-2).

以上より、本発明の雑音抑圧方法は、音声信号の音声品質を犠牲にすることのない、白色雑音、有色雑音に効果的な優れた雑音抑圧方法であるといえる。 From the above, it can be said that the noise suppression method of the present invention is an excellent noise suppression method effective for white noise and colored noise without sacrificing the voice quality of the voice signal.

（実施の形態３）
本発明に係る雑音抑圧装置は、雑音抑圧以外の用途にも応用可能である。以下、その一つの応用例について説明する。 (Embodiment 3)
The noise suppression device according to the present invention can be applied to uses other than noise suppression. Hereinafter, one application example will be described.

図１２は、本発明の雑音抑圧方法、つまりＡＲ係数を必要としないカルマンフィルタが適用されたマルチキャリア受信装置の構成を示すブロック図である。 FIG. 12 is a block diagram showing a configuration of a multicarrier receiver to which the noise suppression method of the present invention, that is, a Kalman filter that does not require an AR coefficient is applied.

図１２に示すマルチキャリア受信装置３００は、主に、検波部３０１、ＧＩ（ガードインターバル）除去部３０２、チャネル等化部３０３、チャネル推定部３０４、ＦＦＴ部３０５、復調部３０６、および復号部３０７を有する。 12 mainly includes a detection unit 301, a GI (guard interval) removal unit 302, a channel equalization unit 303, a channel estimation unit 304, an FFT unit 305, a demodulation unit 306, and a decoding unit 307. Have

本実施の形態において、チャネル推定部３０４は、本発明の状態空間モデルを用いたカルマンフィルタに基づいたチャネル推定を実行できるように構成されている。より具体的には、チャネル推定部３０４は、サンプリング部１２０、Ａ／Ｄ変換部１３０、バッファ１４０、雑音抑圧処理部１５０、出力部１６０を有する構成をとる。 In the present embodiment, the channel estimation unit 304 is configured to be able to execute channel estimation based on a Kalman filter using the state space model of the present invention. More specifically, the channel estimation unit 304 has a configuration including a sampling unit 120, an A / D conversion unit 130, a buffer 140, a noise suppression processing unit 150, and an output unit 160.

検波部３０１は伝送路上で周波数選択性フェージング等の影響を受けた受信信号を、中間周波数で直交検波し、ＧＩ除去部４０２に出力する。ＧＩ除去部３０２は、受信信号のガードインターバルを除去し、シンボル単位に連なった信号をチャネル等化部３０３と、チャネル推定部３０４に出力する。 The detector 301 performs quadrature detection of the received signal affected by frequency selective fading or the like on the transmission path at the intermediate frequency, and outputs it to the GI removal unit 402. GI removal section 302 removes the guard interval of the received signal and outputs a signal that is continuous in symbol units to channel equalization section 303 and channel estimation section 304.

チャネル推定部３０４に入力された信号は、入力部１１０に出力される。入力部は信号に所定の入力信号処理を施し、サンプリング部１２０に出力する。サンプリング部１２０は、所定のサンプリング周波数（例えば１６ｋＨｚ）で、入力されたアナログの受信信号をサンプリング処理し、Ａ／Ｄ変換部１３０に出力する。Ａ／Ｄ変換部１３０は、サンプリングされた受信信号の振幅値を所定の分解能（例えば８ｂｉｔ）でＡ／Ｄ変換処理し、バッファ１４０に送る。バッファ１４０は所定のサンプリング数Ｎの受信信号フレーム（ブロック）を雑音抑圧処理部１５０に出力する。 The signal input to the channel estimation unit 304 is output to the input unit 110. The input unit performs predetermined input signal processing on the signal and outputs the signal to the sampling unit 120. The sampling unit 120 samples the input analog reception signal at a predetermined sampling frequency (for example, 16 kHz) and outputs the sampled analog reception signal to the A / D conversion unit 130. The A / D converter 130 subjects the amplitude value of the sampled received signal to A / D conversion processing at a predetermined resolution (for example, 8 bits), and sends it to the buffer 140. The buffer 140 outputs a received signal frame (block) having a predetermined sampling number N to the noise suppression processing unit 150.

雑音抑圧処理部１５０は、カルマンフィルタに基づいたチャネル推定をサブキャリア全体、または各サブキャリアついて行い、推定された（サブ）チャネルゲインを出力部１６０に出力する。出力部１６０は、入力された推定チャネルゲインをチャネル等化部３０２に出力する。雑音処理部１５０での処理の詳細については後述する。 The noise suppression processing unit 150 performs channel estimation based on the Kalman filter for the entire subcarrier or each subcarrier, and outputs the estimated (sub) channel gain to the output unit 160. The output unit 160 outputs the input estimated channel gain to the channel equalization unit 302. Details of the processing in the noise processing unit 150 will be described later.

チャネル等化部３０３は、入力された推定チャネルゲインを用いて、ＧＩ除去部３０２から入力した信号の同期検波を行い、結果をＦＦＴ部３０５に出力する。ＦＦＴ部３０５は、フーリエ変換処理を行い、受信信号を各サブキャリア信号成分に分離して、復調部３０６に出力する。復調部３０６は信号の復調処理を行い、結果を復号部３０７に出力する。復号部３０７は、信号の復号を行い、デジタルデータを出力する。 Channel equalization section 303 performs synchronous detection of the signal input from GI removal section 302 using the input estimated channel gain, and outputs the result to FFT section 305. FFT section 305 performs a Fourier transform process, separates the received signal into subcarrier signal components, and outputs the result to demodulation section 306. Demodulation section 306 performs signal demodulation processing and outputs the result to decoding section 307. The decoding unit 307 decodes the signal and outputs digital data.

本実施の形態では、まず、フェージングチャンネルを既述の状態空間モデルで表現する。マルチキャリア通信において、第ｋサブキャリアのみに着目すると、その受信信号ｙ_ｋ（ｔ）は、次の式（３８）

となる。ここでｎは時刻、Ｎは総キャリア数、Ｓ_ｋ（ｔ）、Ｈ_ｋ（ｔ）は第ｋキャリアの送信信号、チャネルゲインである。 In this embodiment, first, the fading channel is expressed by the state space model described above. In multicarrier communication, when attention is focused only on the k-th subcarrier, the received signal y _k (t) is expressed by the following equation (38).

It becomes. Here, n is time, N is the total number of carriers, S _k (t) and H _k (t) are the transmission signal and channel gain of the k-th carrier.

表記を容易にするために、Ｎ×１次チャネルゲインベクトルｈ_p（ｎ）を次の式（３９）で定義する。

ここで、Ｋは後述の状態遷移行列の次数である。 In order to facilitate the notation, the N × first-order channel gain vector h _p (n) is defined by the following equation (39).

Here, K is the order of a state transition matrix described later.

本実施の形態では、本発明の状態空間モデルを用いてカルマンフィルタに基づき、チャネルゲインの推定値を適応的に求める。状態空間モデルの状態ベクトルｘ_ｐ（ｎ＋１）を次の式（４０）で定義する。

In the present embodiment, the estimated value of the channel gain is adaptively obtained based on the Kalman filter using the state space model of the present invention. The state vector x _p (n + 1) of the state space model is defined by the following equation (40).

伝送路特性の時間変動を表す状態方程式は、次の式（４１）で記述される。

また、観測方程式は、次の式（４２）で記述される。

ここで、δ_ｐ（ｎ＋１）は駆動雑音ベクトル、ε_ｐ（ｎ＋１）は雑音信号ベクトル、Ｃ_ｐ、Ｄ_ｐは状態遷移行列、Ｇは定数行列である。 The state equation representing the time variation of the transmission path characteristic is described by the following equation (41).

The observation equation is described by the following equation (42).

Here, δ _p (n + 1) is a drive noise vector, ε _p (n + 1) is a noise signal vector, C _p and D _p are state transition matrices, and G is a constant matrix.

駆動雑音ベクトルδ_ｐ（ｎ＋１）、雑音信号ベクトルε_ｐ（ｎ＋１）、状態遷移行列Ｃ_ｐ、Ｄ_ｐ、定数行列Ｇは以下の式（４３）から式（４８）で定義される。

ここで、Ｉ_Ｎ×Ｎ、０_Ｎ×ＮはＮ×Ｎの単位行列、零行列である。 The drive noise vector δ _p (n + 1), the noise signal vector ε _p (n + 1), the state transition matrices C _p and D _p , and the constant matrix G are defined by the following formulas (43) to (48).

Here, I _{N × N} and 0 _{N × N} are N × N unit matrices and zero matrices.

上記の定式化は、従来Vector Kermanと呼ばれているチャネルゲイン推定法に、本発明の状態空間モデルを適用したものである。フェージングチャネル間の相関だけでなく、サブチャネル間の相関も利用しているため良い推定精度を有する。しかしながら、例えば、式（３８）の状態ベクトルｘ_ｐ（ｎ＋１）のサイズはＫ×Ｎであり、計算量が増大するといった問題がある。 In the above formulation, the state space model of the present invention is applied to a channel gain estimation method conventionally called Vector Kerman. Since not only the correlation between fading channels but also the correlation between subchannels is used, it has a good estimation accuracy. However, for example, the size of the state vector x _p (n + 1) in Expression (38) is K × N, and there is a problem that the amount of calculation increases.

この問題を改良したものが、従来Per Subcarrier Kalmanと呼ばれている手法である。この方法は、上記方法をサブキャリア単位に分割して処理を行うものである。本発明の状態空間モデルを適用した場合、上記、式（４０）から式（４８）をサブキャリアｋ単位に分割することで所望の式が得られる。 What improved this problem is a technique conventionally called Per Subcarrier Kalman. In this method, the above method is divided into subcarriers for processing. When the state space model of the present invention is applied, a desired equation can be obtained by dividing Equation (40) to Equation (48) into subcarriers k.

サブキャリア単位への分割は、状態遷移行列Ｃ_ｐを例にとると、式（４２）において、Ｎ×Ｎの単位行列、零行列であるＩ_Ｎ×Ｎ、０_Ｎ×Ｎを、スカラー量である１，０に置き換えることに相当する。したがって、行列のサイズは１／Ｎになり、計算量を低減することができる。無線伝送の場合には、音響分野とは異なり、推定される信号の品質によりも処理速度が要求される。そのため、本方法はより実用的である。 Taking the state transition matrix C _p as an example, the subcarrier unit is divided into an N × N unit matrix and a zero matrix I _{N × N} , 0 _{N × N} as scalar quantities in the equation (42). This corresponds to the replacement with a certain 1,0. Therefore, the size of the matrix becomes 1 / N, and the amount of calculation can be reduced. In the case of wireless transmission, unlike the acoustic field, processing speed is required depending on the estimated signal quality. Therefore, this method is more practical.

次に、本実施の形態におけるチャネル推定精度の比較結果の一例について、図面を参照しつつ説明する。 Next, an example of a comparison result of channel estimation accuracy in the present embodiment will be described with reference to the drawings.

チャネル推定精度の比較検討は数値シミュレーションにより行った。数値シミュレーションの条件は次のように設定した。送信フレームは６４のシンボルからなり、そのうち４シンボルがパイロットシンボル、６０シンボルがデータシンボルである。総送信フレーム数は２００、総送信データシンボル数は１２０００である．評価量はＮＭＳＥ（Normarized Mean Square Error）を用いた。 A comparative study of channel estimation accuracy was performed by numerical simulation. The numerical simulation conditions were set as follows. The transmission frame is composed of 64 symbols, of which 4 symbols are pilot symbols and 60 symbols are data symbols. The total number of transmission frames is 200, and the total number of transmission data symbols is 12000. The evaluation amount was NMSE (Normarized Mean Square Error).

図１３は、本実施の形態におけるチャネル推定精度数値シミュレーションの一つの例の結果を示す図であり、ｆ_ＤＴ＝０.０４５、（ＮＴ_ｓ）＝０.０８μｓにおける、信号雑音比ＳＮＲに対するＮＭＳＥ特性を示している。ここでｆ_Ｄは最大ドップラー周波数、ＴはＯＦＤＭシンボル周期、Ｔ_ｓはサンプリング間隔、τ_ｍａｘは最大遅延スプレッドである。図１３から、本発明の方法の推定精度が従来手法に比べ向上していることがわか。これは、本発明の方法では、ＡＲ係数の推定誤差によるチャネルの推定精度の劣化が軽減できたためと考えられる。また、ｆ_Ｄが１８０Ｈｚより大きい場合、本発明の方法の性能向上がより大きなものとなることは明らかである。 FIG. 13 is a diagram illustrating a result of one example of the numerical simulation of channel estimation accuracy according to the present embodiment. NMSE with respect to the signal-to-noise ratio SNR at f _D T = 0.045 and (NT _s ) = 0.08 μs. The characteristics are shown. Where f _D is the maximum Doppler frequency, T is the OFDM symbol period, T _s is the sampling interval, and τ _max is the maximum delay spread. FIG. 13 shows that the estimation accuracy of the method of the present invention is improved as compared with the conventional method. This is presumably because the degradation of the channel estimation accuracy due to the AR coefficient estimation error can be reduced in the method of the present invention. Also, if f _D is greater than 180 Hz, the performance improvement of the method of the present invention becomes larger one is obvious.

図１４は、本実施の形態におけるチャネル推定精度数値シミュレーションの他の例の結果を示す図であり、ＳＮＲ＝２０ｄＢ、τ_ｍａｘ／（ＮＴ_ｓ）＝０.０８μｓにおける、最大ドップラー周波数ｆ_Ｄに対するＮＭＳＥ特性を示している。この結果より、本発明の方法は、ＮＭＳＥが最大ドップラー周波数ｆ_Ｄに依存しないため、従来手法より良好な性能を有していることがわかる。すなわち、チャネル間干渉の影響が深刻となるフェージング変動の激しい環境においても、本発明の方法の有効性が確認できる。 FIG. 14 is a diagram showing the results of another example of the channel estimation accuracy numerical simulation according to the present embodiment. NMSE with respect to the maximum Doppler frequency f _D when SNR = 20 dB and τ _max / (NT _s ) = 0.08 μs. The characteristics are shown. From this result, the method of the present invention, since the NMSE is not dependent on the maximum Doppler frequency f _D, it can be seen that the prior art technique has a good performance. That is, the effectiveness of the method of the present invention can be confirmed even in an environment where the fading fluctuation is severe where the influence of inter-channel interference becomes serious.

本発明は、上記各実施の形態に限定されるものではない。 The present invention is not limited to the above embodiments.

本発明の雑音抑圧装置は、ノイズが含まれた音声信号（得られた情報）からクリアな音声信号（必要な情報）を取り出すことが可能である。その一つの実施の形態として、カーナビゲーション装置などに必要不可欠な音声認識装置の前処理雑音除去装置が考えられる。 The noise suppression device of the present invention can extract a clear audio signal (necessary information) from an audio signal containing noise (obtained information). As one embodiment thereof, a preprocessing noise removal device for a speech recognition device that is indispensable for a car navigation device or the like can be considered.

また、画像分野においては、何らかの原因でぼけてしまったぼけ画像（得られた情報）からぼけのとれたクリアな画像（必要な情報）を取り出すことが可能であり、画像処理装置として活用可能である。 In the image field, a clear image (necessary information) can be extracted from a blurred image (obtained information) that has been blurred for some reason, and can be used as an image processing apparatus. is there.

さらに、従来、ＡＲ過程によるモデル化とカルマンフィルタアルゴリズムとを組み合わせを用いた通信・信号処理全般にわたり、本発明が適応可能であることはいうまでもない。 Furthermore, it goes without saying that the present invention can be applied to communication and signal processing in general using a combination of modeling by an AR process and a Kalman filter algorithm.

また、医療分野では、従来、妊婦の胎児の状況を検査するには、個人が購入することができない高価な装置と高い専門知識とが必要があったが、本発明によれば、妊婦の体から母胎の心音や胎児の心音やその他の音（得られた情報）から不必要な音（情報）を取り除き胎児の心音（必要な情報）のみを取り出すことが可能になり、通院せずとも自宅で胎児の健康状態を、その心音から容易に確認することが可能となる。 Further, in the medical field, conventionally, in order to inspect the condition of a pregnant woman's fetus, an expensive device that cannot be purchased by an individual and high expertise are required. It is possible to remove unnecessary sounds (information) from the heart sounds of the mother and the fetus and other sounds (obtained information), and only the heart sounds of the fetus (necessary information) can be taken out. The fetal health can be easily confirmed from the heart sound.

上記各実施の形態の説明に用いた各機能要素は、例えば、集積回路として実現される。これらは、個別に１チップ化されてもよいし、一部または全てを含むように１チップ化されてもよい。また、集積回路製造後にプログラムすることが可能なＦＰＧＡ（Ｆield Programmable Gate Array）や、回路を再構成可能なリコンフィギュラブル・プロセッサを利用してもよい。 Each functional element used in the description of the above embodiments is realized as an integrated circuit, for example. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them. Further, an FPGA (Field Programmable Gate Array) that can be programmed after manufacturing the integrated circuit or a reconfigurable processor that can reconfigure the circuit may be used.

さらに、上記各実施の形態はハードウエアに限定されるものではなく、ソフトウエアによってもよい。その逆も真である。また、それらの組み合わせであってもよい。 Further, each of the above embodiments is not limited to hardware, and may be software. The reverse is also true. Moreover, those combinations may be sufficient.

本発明に係る雑音抑圧装置および雑音抑圧方法は、カルマンフィルタを用いつつ、ＡＲ係数の推定を必要とすることなく、シンプルな構成で、雑音抑圧能力を向上することができる雑音抑圧装置および雑音抑圧方法として有用である。 The noise suppression device and the noise suppression method according to the present invention are a noise suppression device and a noise suppression method that can improve the noise suppression capability with a simple configuration without using an AR coefficient estimation while using a Kalman filter. Useful as.

本発明の実施の形態１に係る雑音抑圧装置の構成を示すブロック図The block diagram which shows the structure of the noise suppression apparatus which concerns on Embodiment 1 of this invention. 本実施の形態に係る雑音抑圧装置の状態空間モデルを説明するブロック線図Block diagram for explaining a state space model of the noise suppression apparatus according to the present embodiment 本実施の形態に係る雑音抑圧装置のカルマンフィルタアルゴリズムに基づく信号推定手順を説明するフローチャートFlowchart for explaining a signal estimation procedure based on the Kalman filter algorithm of the noise suppression apparatus according to the present embodiment 図３のカルマンフィルタアルゴリズムの処理内容を示すフローチャートThe flowchart which shows the processing content of the Kalman filter algorithm of FIG. 本発明の実施の形態２に係る雑音抑圧装置の構成を示すブロック図The block diagram which shows the structure of the noise suppression apparatus which concerns on Embodiment 2 of this invention. 本実施の形態における音声波形数値シミュレーションの第１の例の結果を示す図The figure which shows the result of the 1st example of the speech waveform numerical simulation in this Embodiment. 本実施の形態における音声波形数値シミュレーションの第２の例の結果を示す図The figure which shows the result of the 2nd example of the speech waveform numerical simulation in this Embodiment. 本実施の形態における音声波形数値シミュレーションの第３の例の結果を示す図The figure which shows the result of the 3rd example of the speech waveform numerical simulation in this Embodiment. 本実施の形態における音声波形数値シミュレーションの第４の例の結果を示す図The figure which shows the result of the 4th example of the speech waveform numerical simulation in this Embodiment. 本実施の形態における雑音抑圧後の音声品質の主観的評価結果の一例を示す図The figure which shows an example of the subjective evaluation result of the speech quality after noise suppression in this Embodiment 本実施の形態における雑音抑圧後の音声品質の主観的評価結果の他の例を示す図The figure which shows the other example of the subjective evaluation result of the speech quality after noise suppression in this Embodiment 本発明の雑音抑圧方法が適用されたマルチキャリア受信装置の構成を示すブロック図The block diagram which shows the structure of the multicarrier receiver to which the noise suppression method of this invention was applied 本実施の形態におけるチャネル推定精度数値シミュレーションの一つの例の結果を示す図The figure which shows the result of one example of the channel estimation precision numerical simulation in this Embodiment 本実施の形態におけるチャネル推定精度数値シミュレーションの他の例の結果を示す図The figure which shows the result of the other example of the channel estimation accuracy numerical simulation in this Embodiment

Explanation of symbols

１００、２００雑音抑圧装置
１１０入力部
１２０サンプリング部
１３０Ａ／Ｄ変換部
１４０バッファ
１５０雑音抑圧処理部
１６０出力部
２１０パーソナルコンピュータ
２１１操作装置
２１２ディスプレイ
２１３バスインタフェース
２１４記録装置
２１５主記憶メモリ
２１６中央演算装置
２２０マイクロホン
３００マルチキャリア受信装置
３０１検波部
３０２ＧＩ除去部
３０３チャネル等化部
３０４チャネル推定部
３０５ＦＦＴ部
３０６復調部
３０７復号部
５００、５０１信号ベクトル
５０２観測ベクトル
５０３付加雑音信号ベクトル
５０４駆動雑音ベクトル
５０５状態遷移行列
５０６状態遷移行列
DESCRIPTION OF SYMBOLS 100,200 Noise suppression apparatus 110 Input part 120 Sampling part 130 A / D conversion part 140 Buffer 150 Noise suppression process part 160 Output part 210 Personal computer 211 Operation apparatus 212 Display 213 Bus interface 214 Recording apparatus 215 Main memory 216 Central processing unit DESCRIPTION OF SYMBOLS 220 Microphone 300 Multicarrier receiver 301 Detection part 302 GI removal part 303 Channel equalization part 304 Channel estimation part 305 FFT part 306 Demodulation part 307 Decoding part 500, 501 Signal vector 502 Observation vector 503 Additional noise signal vector 504 Drive noise vector 505 State transition matrix 506 State transition matrix

Claims

A noise suppression device that estimates the desired information only from observation information in which noise is mixed in the desired information,
Obtaining means for obtaining the observation information;
Using a Kalman filter using colored noise as a driving source , and extracting means for removing the noise from the acquired observation information and extracting the desired information,
The Kalman filter is
It is configured not to use autoregressive model coefficients in the state equation of the state space model,
Noise suppression device.

The extraction means includes
A first correlation calculation unit that calculates a first correlation value matrix of an estimation error when the state quantity of the system at time n + 1 including the desired information is estimated from information up to time n with respect to observation information only at time n When,
For observation information only at time n, using the first correlation value matrix of the estimation error calculated by the first correlation calculation unit, optimal estimation of the state quantity at that time by information up to time n + 1 A weighting coefficient matrix for defining the relationship between the value vector, the optimum estimated value vector of the state quantity at time n + 1 based on the information up to time n, and the estimated error vector of the observed quantity including the observation information is calculated. A weighting factor calculation unit;
A first optimum estimated value calculating unit for calculating a first optimum estimated value vector of the state quantity at time n + 1 based on information up to time n with respect to observation information only at time n;
For the observation information only at time n, using the weighting coefficient matrix calculated by the weighting coefficient calculation unit, calculate a second optimal estimated value vector of the state quantity at the time based on information up to time n + 1. A second optimum estimated value calculation unit;
A second correlation calculation unit that calculates a second correlation value matrix of an estimation error when the state quantity at the time is estimated from information up to time n + 1 with respect to observation information only at time n;
The noise suppression device according to claim 1, comprising:

The first correlation calculation unit includes:
Using a predetermined state transition matrix, an element value of covariance of a given drive source vector, and a second correlation value matrix of the estimation error given or previously calculated by the second correlation calculation unit, Calculate the first correlation value matrix of the estimation error,
The weight coefficient calculation unit includes:
Calculation of the weighting coefficient matrix using a first correlation value matrix of the estimation error calculated by the first correlation calculation unit, a given observation transition matrix, and a covariance element value of a given noise vector And
The first optimal estimated value calculation unit includes:
Using the state transition matrix and the second optimum estimated value vector of the state quantity given or previously calculated by the second optimum estimated value calculating unit, the first optimum estimated value vector of the state quantity Perform the calculation
The second optimum estimated value calculation unit includes:
The first optimal estimated value vector of the state quantity calculated by the first optimal estimated value calculating unit, the weighting coefficient matrix calculated by the weighting coefficient calculating unit, the observation transition matrix, and the observed amount only at time n + 1 To calculate a second optimum estimated value vector of the state quantity,
The second correlation calculation unit includes:
Using the weight coefficient matrix calculated by the weight coefficient calculation unit, the observation transition matrix, and the first correlation value matrix of the estimation error calculated by the first correlation calculation unit, the second estimation error is calculated. Calculate the correlation value matrix,
The noise suppression device according to claim 2.

A noise suppression method for estimating the desired information only from observation information in which noise is mixed in the desired information,
An acquisition step of acquiring the observation information;
An extraction step of extracting the desired information by removing the noise from the acquired observation information using a Kalman filter using colored noise as a driving source , and
The Kalman filter is
It is configured not to use autoregressive model coefficients in the state equation of the state space model,
Noise suppression method.

The extraction step includes
A first correlation calculation step of calculating a first correlation value matrix of an estimation error when the state quantity of the system at time n + 1 including the desired information is estimated from information up to time n with respect to observation information only at time n When,
For the observation information only at time n, using the first correlation value matrix of the estimation error calculated in the first correlation calculation step, the optimum estimated value of the state quantity at that time by the information up to time n + 1 A weight for calculating a weight coefficient matrix for defining a relationship between a vector, an optimal estimated value vector of the state quantity at time n + 1 based on information up to time n, and an estimated error vector of the observed quantity including the observation information Coefficient calculation step;
A first optimal estimated value calculating step of calculating a first optimal estimated value vector of the state quantity at time n + 1 based on information up to time n with respect to observation information only at time n;
For the observation information only at time n, the second optimal estimated value vector of the state quantity at the time according to the information up to time n + 1 is calculated using the weight coefficient matrix calculated in the weight coefficient calculation step. 2 optimal estimated value calculation steps;
A second correlation calculation step of calculating a second correlation value matrix of an estimation error when the state quantity at the time is estimated from information up to time n + 1 with respect to observation information only at time n;
The noise suppression method according to claim 4, comprising:

The first correlation calculation step includes:
The estimation is performed using a predetermined state transition matrix, a covariance element value of a given drive source vector, and a second correlation value matrix of the estimation error given or previously calculated in the second correlation calculation step. Calculate the first correlation value matrix of error,
The weighting factor calculating step includes
The weighting coefficient matrix is calculated using the first correlation value matrix of the estimation error calculated in the first correlation calculation step, the given observation transition matrix, and the covariance element value of the given noise vector. Done
The first optimum estimated value calculating step includes:
Calculation of the first optimum estimated value vector of the state quantity using the state transition matrix and the second optimum estimated value vector of the state quantity given or previously calculated in the second optimum estimated value calculating step And
The second optimum estimated value calculating step includes:
The first optimal estimated value vector of the state quantity calculated in the first optimal estimated value calculating step, the weighting coefficient matrix calculated in the weighting coefficient calculating step, the observation transition matrix, and the observed amount only at time n + 1 are used. Calculating a second optimal estimated value vector of the state quantity,
The second correlation calculation step includes:
A second correlation value of the estimation error using the weighting coefficient matrix calculated in the weighting coefficient calculation step, the observation transition matrix, and a first correlation value matrix of the estimation error calculated in the first correlation calculation step. Calculate the matrix,
The noise suppression method according to claim 5.

A noise suppression program for estimating the desired information only from observation information in which noise is mixed in the desired information,
On the computer,
An acquisition step of acquiring the observation information;
An extraction step of extracting the desired information by removing the noise from the acquired observation information using a Kalman filter using colored noise as a driving source; and the Kalman filter is a coefficient of an autoregressive model in a state equation of a state space model Configured to not use the
Noise suppression program for running.

The extraction step includes
A first correlation calculation step of calculating a first correlation value matrix of an estimation error when the state quantity of the system at time n + 1 including the desired information is estimated from information up to time n with respect to observation information only at time n When,
For the observation information only at time n, using the first correlation value matrix of the estimation error calculated in the first correlation calculation step, the optimum estimated value of the state quantity at that time by the information up to time n + 1 A weight for calculating a weight coefficient matrix for defining a relationship between a vector, an optimal estimated value vector of the state quantity at time n + 1 based on information up to time n, and an estimated error vector of the observed quantity including the observation information Coefficient calculation step;
A first optimal estimated value calculating step of calculating a first optimal estimated value vector of the state quantity at time n + 1 based on information up to time n with respect to observation information only at time n;
For the observation information only at time n, the second optimal estimated value vector of the state quantity at the time according to the information up to time n + 1 is calculated using the weight coefficient matrix calculated in the weight coefficient calculation step. 2 optimal estimated value calculation steps;
A second correlation calculation step of calculating a second correlation value matrix of an estimation error when the state quantity at the time is estimated from information up to time n + 1 with respect to observation information only at time n;
The noise suppression program according to claim 7.

The first correlation calculation step includes:
The estimation is performed using a predetermined state transition matrix, a covariance element value of a given drive source vector, and a second correlation value matrix of the estimation error given or previously calculated in the second correlation calculation step. Calculate the first correlation value matrix of error,
The weighting factor calculating step includes
The weighting coefficient matrix is calculated using the first correlation value matrix of the estimation error calculated in the first correlation calculation step, the given observation transition matrix, and the covariance element value of the given noise vector. Done
The first optimum estimated value calculating step includes:
Calculation of the first optimum estimated value vector of the state quantity using the state transition matrix and the second optimum estimated value vector of the state quantity given or previously calculated in the second optimum estimated value calculating step And
The second optimum estimated value calculating step includes:
The first optimal estimated value vector of the state quantity calculated in the first optimal estimated value calculating step, the weighting coefficient matrix calculated in the weighting coefficient calculating step, the observation transition matrix, and the observed amount only at time n + 1 are used. Calculating a second optimal estimated value vector of the state quantity,
The second correlation calculation step includes:
A second correlation value of the estimation error using the weighting coefficient matrix calculated in the weighting coefficient calculation step, the observation transition matrix, and a first correlation value matrix of the estimation error calculated in the first correlation calculation step. Calculate the matrix,
The noise suppression program according to claim 8.