JP4395195B1

JP4395195B1 - Echo canceller

Info

Publication number: JP4395195B1
Application number: JP2009052680A
Authority: JP
Inventors: 晶大山口
Original assignee: YUGENGAISYA CEPSTRUM
Current assignee: YUGENGAISYA CEPSTRUM
Priority date: 2009-03-05
Filing date: 2009-03-05
Publication date: 2010-01-06
Anticipated expiration: 2029-03-05
Also published as: JP2010206731A

Abstract

【課題】擬似エコー生成用フィルタの演算量を削減すること。
【解決手段】適応フィルタ１０６ａは、有限個の非零の音源パルス列である線形予測残差ベクトルε（ｔ）をタップ係数Ｈ（ｔ）で演算して白色化擬似エコーｒｗ（ｔ）を生成する。逆フィルタ１０６ｂは、ディジタル送信音声信号ｙ（ｔ）を白色化する。加算器１０６ｃは、白色化送信音声信号ｙｗ（ｔ）から白色化擬似エコーｒｗ（ｔ）を減算して白色化残差信号ｅｗ（ｔ）を得る。擬似エコー生成用フィルタ１０６ｄは、適応フィルタ１０６ａから出力されたタップ係数Ｈ（ｔ）で線形予測残差ベクトルε（ｔ）を演算して白色化擬似エコーｒｗ’（ｔ）を生成する。合成フィルタ１０６ｅは、白色化擬似エコーｒｗ’（ｔ）を、線形予測係数Ｌ（ｔ）を用いて合成し、擬似エコーｒ（ｔ）を生成する。加算器１０６ｆは、ディジタル送信音声信号ｙ（ｔ）から擬似エコーｒ（ｔ）を減算して残差信号ｅ（ｔ）を得る。
【選択図】図１A calculation amount of a filter for generating a pseudo echo is reduced.
An adaptive filter 106a generates a whitened pseudo echo rw (t) by calculating a linear prediction residual vector ε (t), which is a finite number of non-zero excitation pulse trains, with a tap coefficient H (t). . The inverse filter 106b whitens the digital transmission audio signal y (t). The adder 106c subtracts the whitened pseudo echo rw (t) from the whitened transmission audio signal yw (t) to obtain a whitened residual signal ew (t). The pseudo echo generation filter 106d calculates a linear prediction residual vector ε (t) using the tap coefficient H (t) output from the adaptive filter 106a to generate a whitened pseudo echo rw ′ (t). The synthesis filter 106e synthesizes the whitened pseudo echo rw ′ (t) using the linear prediction coefficient L (t) to generate the pseudo echo r (t). The adder 106f subtracts the pseudo echo r (t) from the digital transmission voice signal y (t) to obtain a residual signal e (t).
[Selection] Figure 1

Description

本発明は、スピーカから出力された音声がマイクロホンに帰還されることによって発生するエコーを消去するためのエコーキャンセラに関する。 The present invention relates to an echo canceller for canceling an echo generated when a sound output from a speaker is fed back to a microphone.

図１は、従来のエコーキャンセラの構成を示すブロック図である。図１において、符号化ディジタル信号（受信信号）は、音声復号回路（ＳＰＤＥＣ）１１でディジタル受信音声信号ｘ（ｔ）に復号され、ディジタル・アナログ変換器（ＤＡＣ）１２でアナログ受信音声信号に変換され、スピーカ１３から音声として出力される。また、マイクロホン１４に入力されたアナログ送信音声信号は、アナログ・ディジタル変換器（ＡＤＣ）１５でディジタル送信音声信号ｙ（ｔ）に変換され、音声符号化回路（ＳＰＣＯＤ）１７で符号化ディジタル信号（送信信号）に変換される。その際、スピーカ１３から出力された音声がマイクロホン１４に帰還されると、エコーが発生する。エコーが発生すると、自分の話した声がスピーカから遅れて聞こえるので、大変耳障りである。また、ハンズフリー電話等においては、エコーの中の特定の周波数成分が増幅されて、ハウリング（発振現象）が起きることもある。 FIG. 1 is a block diagram showing a configuration of a conventional echo canceller. In FIG. 1, an encoded digital signal (received signal) is decoded into a digital received voice signal x (t) by a voice decoding circuit (SPDEC) 11 and converted into an analog received voice signal by a digital / analog converter (DAC) 12. Then, it is output as sound from the speaker 13. The analog transmission voice signal input to the microphone 14 is converted into a digital transmission voice signal y (t) by an analog / digital converter (ADC) 15 and encoded digital signal (SPCOD) 17 by a voice encoding circuit (SPCOD) 17. (Transmission signal). At that time, when the sound output from the speaker 13 is fed back to the microphone 14, an echo is generated. When echoes occur, the voice you spoke is heard behind the speaker, which is very annoying. In hands-free telephones and the like, a specific frequency component in an echo is amplified and howling (oscillation phenomenon) may occur.

エコーキャンセラ１６は、エコー及びハウリングを防止するための回路であり、離散時刻ｔにおけるディジタル受信音声信号ｘ（ｔ）をタップ係数Ｈ（ｔ）で演算して擬似エコーｒ（ｔ）を生成する適応フィルタ１６ａと、アナログ・ディジタル変換器１５から出力されるディジタル送信音声信号ｙ（ｔ）から擬似エコーｒ（ｔ）を差し引いて残差信号ｅ（ｔ）を出力する加算器１６ｂで構成される。残差信号ｅ（ｔ）は、送信回線に出力されると共に、適応フィルタ１６ａにタップ係数更新用の信号として与えられる。 The echo canceller 16 is a circuit for preventing echo and howling. The echo canceller 16 is adapted to generate a pseudo echo r (t) by calculating a digital reception voice signal x (t) at a discrete time t with a tap coefficient H (t). The filter 16a and an adder 16b that subtracts the pseudo echo r (t) from the digital transmission voice signal y (t) output from the analog / digital converter 15 and outputs a residual signal e (t). The residual signal e (t) is output to the transmission line and given to the adaptive filter 16a as a tap coefficient update signal.

マイクロホン１４には、音声ｓ（ｔ）の他、背景雑音ｎ（ｔ）やスピーカ１３からのエコーｄ（ｔ）が入力され、これらがアナログ・ディジタル変換器１５でディジタル送信音声信号ｙ（ｔ）に変換される。ここで、エコーｄ（ｔ）のエコー経路のインパルス応答をｍ次のＦＩＲ（有限インパルス応答）フィルタで近似できると仮定すると、離散時刻ｔにおける適応フィルタ１６ａのタップ係数Ｈ（ｔ）と、適応フィルタ１６ａに入力されるディジタル受信音声信号ｘ（ｔ）は、それぞれ次の式（１）、（２）のように表される。但し、Ｔはベクトルの転置を表す。
Ｈ（ｔ）＝［ｈ１（ｔ），ｈ２（ｔ），…，ｈｍ（ｔ）］^Ｔ・・・（１）
ｘ（ｔ）＝［ｘ（ｔ），ｘ（ｔ−１），…，ｘ（ｔ−ｍ＋１）］^Ｔ・・・（２） In addition to the sound s (t), the background noise n (t) and the echo d (t) from the speaker 13 are input to the microphone 14, and these are output by the analog / digital converter 15 to the digital transmission sound signal y (t). Is converted to Assuming that the impulse response of the echo path of the echo d (t) can be approximated by an mth-order FIR (finite impulse response) filter, the tap coefficient H (t) of the adaptive filter 16a at the discrete time t, and the adaptive filter The digital reception voice signal x (t) input to 16a is expressed by the following equations (1) and (2), respectively. However, T represents transposition of a vector.
H (t) = [h1 (t), h2 (t),..., Hm (t)] ^T (1)
x (t) = [x (t), x (t−1),..., x (t−m + 1)] ^T (2)

これにより、擬似エコーｒ（ｔ）と残差信号ｅ（ｔ）は、次の（３）、（４）のように表される。
ｒ（ｔ）＝Ｈ（ｔ）^ＴＸ（ｔ）・・・（３）
ｅ（ｔ）＝ｙ（ｔ）−ｒ（ｔ）・・・（４） Thereby, the pseudo echo r (t) and the residual signal e (t) are expressed as the following (3) and (4).
r (t) = H (t) ^TX (t) (3)
e (t) = y (t) -r (t) (4)

即ち、残差信号ｅ（ｔ）は、ディジタル送信音声信号ｙ（ｔ）から擬似エコーｒ（ｔ）を差し引いて、ディジタル送信音声信号ｙ（ｔ）に含まれるエコー成分を抑圧した信号である。 That is, the residual signal e (t) is a signal obtained by subtracting the pseudo echo r (t) from the digital transmission voice signal y (t) and suppressing the echo component included in the digital transmission voice signal y (t).

タップ係数Ｈ（ｔ）の更新は、一般に次の式（５）のように表される。なお、式（５）中のμはタップ係数の収束速度を調整するステップサイズであり、更新ベクトルΔＨ（ｔ）は適応アルゴリズムの種類によって異なる。
Ｈ（ｔ＋１）＝Ｈ（ｔ）＋μΔＨ（ｔ）・・・（５） The update of the tap coefficient H (t) is generally expressed as the following equation (5). Note that μ in the equation (5) is a step size for adjusting the convergence speed of the tap coefficient, and the update vector ΔH (t) varies depending on the type of the adaptive algorithm.
H (t + 1) = H (t) + μΔH (t) (5)

ところが、図１に示したエコーキャンセラでは、適応フィルタ１６ａに入力されるディジタル受信音声信号ｘ（ｔ）が、音声のような有色信号（スペクトル分布が一様でない信号）の場合、収束速度が著しく低下するという課題がある。 However, in the echo canceller shown in FIG. 1, when the digital received speech signal x (t) input to the adaptive filter 16a is a colored signal such as speech (a signal with a nonuniform spectrum distribution), the convergence speed is remarkably high. There is a problem of lowering.

その解決策として、逆フィルタによって入力信号を白色信号（スペクトル分布が一様な信号）に変換して収束速度を高める方法が提案されている（特許文献１）。 As a solution, a method has been proposed in which an input signal is converted into a white signal (a signal having a uniform spectral distribution) by an inverse filter to increase the convergence speed (Patent Document 1).

特許文献１のエコーキャンセラは、マイクロホンに入力された自己の音声を白色化する第１の逆フィルタと、スピーカから再生される相手の音声を白色化する第２の逆フィルタとを有し、適応フィルタが、２つの逆フィルタから出力された白色信号を用いて擬似エコーを生成する。 The echo canceller of Patent Document 1 includes a first inverse filter that whitens the voice of its own input to the microphone, and a second inverse filter that whitens the voice of the other party reproduced from the speaker. The filter generates a pseudo echo using the white signal output from the two inverse filters.

また、非特許文献１には、音声復号回路を備えたディジタル通信装置に設けられるエコーキャンセラにおいて、音声符号化技術として線形予測係数及び線形予測残差ベクトルを利用する残差励振線形予測（「Residual Excited Linear Predictive (RELP)」）を用い、受信音声信号を白色化した信号を利用してフィルタのタップ係数を求めること（Figure 1 参照）に代えて、線形予測残差ベクトルに基づいて擬似エコー生成用フィルタによりフィルタのタップ係数を求めること（Figure 2 参照）が示されている（第１３３４頁、「III. ECHO CANCELLATION」及び「IV. VOICE CODEC AND ECHO CANCELLER INTEGRATION」参照）。 Further, Non-Patent Document 1 discloses a residual excitation linear prediction (“Residual” using a linear prediction coefficient and a linear prediction residual vector as a speech encoding technique in an echo canceller provided in a digital communication apparatus including a speech decoding circuit. Excited Linear Predictive (RELP) ”), instead of obtaining the tap coefficient of the filter using the whitened signal of the received speech signal (see Figure 1), pseudo echo generation based on the linear prediction residual vector (See Figure 2 on page 1334, "III. ECHO CANCELLATION" and "IV. VOICE CODEC AND ECHO CANCELLER INTEGRATION").

非特許文献１のエコーキャンセラは、白色化された線形予測合成音源信号が適応フィルタの入力になるので、受信音声信号を白色化するための逆フィルタを追加する必要がなくなり、特許文献１のエコーキャンセラに比べて、構成を簡略化することができ、収束速度を大幅に向上させることができる。また、非特許文献１のエコーキャンセラは、マルチパルス符号化等を用いて音源の情報圧縮を行った場合、適応フィルタへの入力信号が疎なパルス列となり、値０のサンプルに対する演算を省略することができるため、特許文献１のエコーキャンセラに比べて、適応フィルタの演算量を削減することができる。 The echo canceller of Non-Patent Document 1 eliminates the need to add an inverse filter for whitening the received speech signal because the whitened linear prediction synthesized sound source signal is input to the adaptive filter. Compared to a canceller, the configuration can be simplified and the convergence speed can be greatly improved. Further, the echo canceller of Non-Patent Document 1 omits the calculation for the sample of the value 0 when the input signal to the adaptive filter becomes a sparse pulse train when the information compression of the sound source is performed using multi-pulse encoding or the like. Therefore, the calculation amount of the adaptive filter can be reduced as compared with the echo canceller disclosed in Patent Document 1.

特開２００２−９４４１９号公報JP 2002-94419 A

WILSON P.J. et al.，‘An integrated voice codec and echo canceller implemented in a single DSP processor’， IEEE International Conference on Acoustics, Speech, and Signal Processing，米国，IEEE，１９８６年４月，vol.11，pp.1333-1336WILSON PJ et al., 'An integrated voice codec and echo canceller implemented in a single DSP processor', IEEE International Conference on Acoustics, Speech, and Signal Processing, USA, IEEE, April 1986, vol.11, pp.1333 -1336

しかしながら、上記非特許文献１に開示された従来のエコーキャンセラは、擬似エコー生成用フィルタへの入力信号が合成音声であるので擬似エコー生成用フィルタの演算量を削減することができない。 However, the conventional echo canceller disclosed in Non-Patent Document 1 cannot reduce the calculation amount of the pseudo echo generation filter because the input signal to the pseudo echo generation filter is synthesized speech.

本発明はかかる点に鑑みてなされたものであり、フィルタの演算量を削減することができるエコーキャンセラを提供することを目的とする。 The present invention has been made in view of such a point, and an object thereof is to provide an echo canceller capable of reducing the amount of calculation of a filter.

本発明のエコーキャンセラは、線形予測分析によって得られた線形予測係数および線形予測残差ベクトルを受信し、前記線形予測係数により合成フィルタを駆動し、前記線形予測残差ベクトルを合成して受信音声信号を得る音声復号回路を備えたディジタル通信装置に設けられるエコーキャンセラであって、送信音声信号を白色化した信号と前記線形予測残差ベクトルとに基づいてフィルタのタップ係数を求めるタップ係数設定手段と、前記タップ係数、前記線形予測係数および前記線形予測残差ベクトルを用いて擬似エコーを生成し、前記擬似エコーを用いて前記送信音声信号に含まれるエコーを抑圧するエコー除去手段と、を具備する構成を採る。 The echo canceller of the present invention receives a linear prediction coefficient and a linear prediction residual vector obtained by linear prediction analysis, drives a synthesis filter with the linear prediction coefficient, synthesizes the linear prediction residual vector, and receives received speech. An echo canceller provided in a digital communication apparatus having a speech decoding circuit for obtaining a signal, wherein the tap coefficient setting means obtains a filter tap coefficient based on a signal obtained by whitening a transmission speech signal and the linear prediction residual vector And echo removing means for generating a pseudo echo using the tap coefficient, the linear prediction coefficient, and the linear prediction residual vector, and suppressing an echo included in the transmission voice signal using the pseudo echo. The structure to do is taken.

本発明によれば、マルチパルス符号化等を用いて音源の情報圧縮を行った場合、擬似エコー生成用フィルタへの入力信号も疎なパルス列となり、値０のサンプルに対する演算を省略することができるため、従来のエコーキャンセラに比べて、フィルタの演算量を削減することができる。 According to the present invention, when sound source information compression is performed using multi-pulse coding or the like, the input signal to the pseudo-echo generation filter is also a sparse pulse train, and the calculation for the sample of value 0 can be omitted. Therefore, the calculation amount of the filter can be reduced as compared with the conventional echo canceller.

従来のエコーキャンセラを含むディジタル通信装置の構成を示すブロック図Block diagram showing the configuration of a digital communication apparatus including a conventional echo canceller 本発明の実施の形態１に係るエコーキャンセラを含むディジタル通信装置の構成を示すブロック図1 is a block diagram showing a configuration of a digital communication apparatus including an echo canceller according to Embodiment 1 of the present invention. 本発明の実施の形態１に係る合成フィルタの一例を示す図The figure which shows an example of the synthetic | combination filter which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る逆フィルタの一例を示す図The figure which shows an example of the inverse filter which concerns on Embodiment 1 of this invention 従来のエコーキャンセラの各フィルタの入力信号の状態を示す図The figure which shows the state of the input signal of each filter of the conventional echo canceller 本発明の実施の形態１に係るエコーキャンセラの各フィルタの入力信号の状態を示す図The figure which shows the state of the input signal of each filter of the echo canceller which concerns on Embodiment 1 of this invention. 本発明の実施の形態２に係るエコーキャンセラを含むディジタル通信装置の構成を示すブロック図FIG. 3 is a block diagram showing a configuration of a digital communication apparatus including an echo canceller according to Embodiment 2 of the present invention. 本発明の実施の形態２に係るエコーキャンセラの各フィルタの入力信号の状態を示す図The figure which shows the state of the input signal of each filter of the echo canceller which concerns on Embodiment 2 of this invention.

以下、本発明の実施の形態について図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

（実施の形態１）
図２は、本発明の実施の形態１に係るエコーキャンセラを含むディジタル通信装置の構成を示すブロック図である。 (Embodiment 1)
FIG. 2 is a block diagram showing a configuration of the digital communication apparatus including the echo canceller according to Embodiment 1 of the present invention.

図２に示すように、ディジタル通信装置は、音声復号回路１０１と、ディジタル・アナログ変換器（ＤＡＣ）１０２と、スピーカ１０３と、マイクロホン１０４と、アナログ・ディジタル変換器（ＡＤＣ）１０５と、エコーキャンセラ１０６と、音声符号化回路１０７と、から主に構成される。 As shown in FIG. 2, the digital communication apparatus includes a speech decoding circuit 101, a digital / analog converter (DAC) 102, a speaker 103, a microphone 104, an analog / digital converter (ADC) 105, and an echo canceller. 106 and a speech encoding circuit 107.

音声復号回路１０１は、デマルチプレクサ１０１ａと、音源信号発生回路１０１ｂと、合成フィルタ１０１ｃと、を有する。エコーキャンセラ１０６は、適応フィルタ１０６ａと、逆フィルタ１０６ｂと、加算器１０６ｃと、擬似エコー生成用フィルタ１０６ｄと、合成フィルタ１０６ｅと、加算器１０６ｆと、を有する。音声符号化回路１０７は、分析フィルタ１０７ａと、音源特性分析回路１０７ｂと、マルチプレクサ１０７ｃと、を有する。 The audio decoding circuit 101 includes a demultiplexer 101a, a sound source signal generation circuit 101b, and a synthesis filter 101c. The echo canceller 106 includes an adaptive filter 106a, an inverse filter 106b, an adder 106c, a pseudo echo generation filter 106d, a synthesis filter 106e, and an adder 106f. The speech encoding circuit 107 includes an analysis filter 107a, a sound source characteristic analysis circuit 107b, and a multiplexer 107c.

ここで、携帯電話システムや、テレビ電話システムでは、データ伝送量を削減するために、音源の情報量を圧縮する音源符号化処理および圧縮された音源の情報量を伸長する音源復号処理が必ず行われる。音源符号化処理は、ディジタル送信音声信号を１０〜２０ｍｓ程度のフレームに区切り、フレーム毎に線形予測分析を行い、線形予測係数（ＬＰＣ）と線形予測残差信号を求め、線形予測係数と線形予測残差信号とを個別に符号化する処理である。また、音源復号処理は、線形予測係数により合成フィルタを駆動し、線形予測残差ベクトルを合成してディジタル受信音声信号を得る処理である。なお、線形予測係数は、音声のスペクトラム包絡を表すパラメータである。また、線形予測残差ベクトルとは、線形予測残差信号を区分化して伝送した信号を元の連続したデータストリームに戻したものである。 Here, in a cellular phone system or a videophone system, in order to reduce the amount of data transmission, excitation encoding processing for compressing the information amount of the excitation and excitation decoding processing for expanding the information amount of the compressed excitation are always performed. Is called. In the sound source encoding process, the digital transmission speech signal is divided into frames of about 10 to 20 ms, linear prediction analysis is performed for each frame, linear prediction coefficients (LPC) and linear prediction residual signals are obtained, and linear prediction coefficients and linear prediction are calculated. This is a process of individually encoding the residual signal. The sound source decoding process is a process of obtaining a digital received speech signal by driving a synthesis filter with a linear prediction coefficient and synthesizing a linear prediction residual vector. Note that the linear prediction coefficient is a parameter representing the spectrum envelope of speech. The linear prediction residual vector is a signal obtained by segmenting and transmitting a linear prediction residual signal and returning it to the original continuous data stream.

通信相手の装置から送信された符号化ディジタル信号（受信信号）は、線形予測残差ベクトルを符号化した音源特性パラメータと、線形予測係数とから構成され、音声復号回路１０１に入力される。 An encoded digital signal (received signal) transmitted from a communication partner apparatus is composed of a sound source characteristic parameter obtained by encoding a linear prediction residual vector and a linear prediction coefficient, and is input to the speech decoding circuit 101.

音声復号回路１０１は、符号化ディジタル信号（受信信号）を復号し、ディジタル受信音声信号ｘ（ｔ）を得る。デマルチプレクサ１０１ａは、符号化ディジタル信号（受信信号）を、線形予測係数Ｌ（ｔ）と音源特性パラメータとに分離する。音源信号発生回路１０１ｂは、音源特性パラメータを復号し、線形予測残差ベクトルε（ｔ）を得る。合成フィルタ１０１ｃは、線形予測残差ベクトルε（ｔ）を、線形予測係数Ｌ（ｔ）を用いて合成し、ディジタル受信音声信号ｘ（ｔ）を得る。なお、合成フィルタ１０１ｃには、通常、ＰＡＲＣＯＲ（partial auto-correlation coefficient）係数を用いた全極型の格子型フィルタまたはＬＳＰ（line spectrum pair）パラメータを用いたＬＳＰ合成フィルタが用いられる。 The voice decoding circuit 101 decodes the encoded digital signal (received signal) to obtain a digital received voice signal x (t). The demultiplexer 101a separates the encoded digital signal (received signal) into a linear prediction coefficient L (t) and a sound source characteristic parameter. The sound source signal generation circuit 101b decodes the sound source characteristic parameter to obtain a linear prediction residual vector ε (t). The synthesis filter 101c synthesizes the linear prediction residual vector ε (t) using the linear prediction coefficient L (t) to obtain a digital received speech signal x (t). The synthesis filter 101c is usually an all-pole type lattice filter using PARCOR (partial auto-correlation coefficient) coefficients or an LSP synthesis filter using LSP (line spectrum pair) parameters.

ディジタル・アナログ変換器１０２は、音声復号回路１０１から出力されたディジタル受信音声信号ｘ（ｔ）をアナログ受信音声信号に変換する。アナログ受信音声信号は、スピーカ１０３から音声として出力される。 The digital / analog converter 102 converts the digital reception voice signal x (t) output from the voice decoding circuit 101 into an analog reception voice signal. The analog received audio signal is output as audio from the speaker 103.

アナログ・ディジタル変換器１０５は、マイクロホン１０４に入力されたアナログ送信音声信号を、ディジタル送信音声信号ｙ（ｔ）に変換する。なお、マイクロホン１０４には、音声ｓ（ｔ）の他、背景雑音ｎ（ｔ）やスピーカ１０３からのエコーｄ（ｔ）が入力される。 The analog / digital converter 105 converts the analog transmission voice signal input to the microphone 104 into a digital transmission voice signal y (t). In addition to the sound s (t), the background noise n (t) and the echo d (t) from the speaker 103 are input to the microphone 104.

エコーキャンセラ１０６は、アナログ・ディジタル変換器１０５から出力されたディジタル送信音声信号ｙ（ｔ）から擬似エコーｒ（ｔ）を減算して残差信号ｅ（ｔ）を得る。なお、エコーキャンセラ１０６内部の各構成の詳細な説明は後述する。 The echo canceller 106 subtracts the pseudo echo r (t) from the digital transmission voice signal y (t) output from the analog / digital converter 105 to obtain a residual signal e (t). A detailed description of each component inside the echo canceller 106 will be described later.

音声符号化回路１０７は、エコーキャンセラ１０６から出力された残差信号ｅ（ｔ）を符号化ディジタル信号（送信信号）に変換する。分析フィルタ１０７ａは、残差信号ｅ（ｔ）に対して線形予測分析を行ない、線形予測係数と線形予測残差信号を得る。音源特性分析回路１０７ｂは、ピッチ抽出処理とマルチパス符号化、または、ベクトル量子化等を行うことにより、線形予測残差信号を符号化して音源特性パラメータを得る。線形予測残差信号を符号化することにより、データ伝送量を圧縮することができる。マルチプレクサ１０７ｃは、線形予測係数と音源特性パラメータを多重して符号化ディジタル信号（送信信号）を得る。なお、分析フィルタ１０７ａには、通常、全零型の格子型フィルタが用いられる。 The voice encoding circuit 107 converts the residual signal e (t) output from the echo canceller 106 into an encoded digital signal (transmission signal). The analysis filter 107a performs linear prediction analysis on the residual signal e (t) to obtain a linear prediction coefficient and a linear prediction residual signal. The sound source characteristic analysis circuit 107b encodes the linear prediction residual signal to obtain a sound source characteristic parameter by performing pitch extraction processing and multi-pass encoding, vector quantization, or the like. By encoding the linear prediction residual signal, the data transmission amount can be compressed. The multiplexer 107c multiplexes the linear prediction coefficient and the sound source characteristic parameter to obtain a coded digital signal (transmission signal). The analysis filter 107a is usually an all-zero lattice filter.

符号化ディジタル信号（送信信号）は、通信相手の装置に送信され、通信相手の装置において音声合成に用いられる。 The encoded digital signal (transmission signal) is transmitted to the communication partner device and used for speech synthesis in the communication partner device.

なお、図２中のデマルチプレクサ１０１ａ、音源信号発生回路１０１ｂ、合成フィルタ１０１ｃ、分析フィルタ１０７ａ、音源特性分析回路１０７ｂおよびマルチプレクサ１０７ｃは、音声圧縮伸長処理のために必要なものであり、本実施の形態のエコーキャンセラを用いるか否かにかかわらず、元々、ディジタル通信装置に備えられているものである。 It should be noted that the demultiplexer 101a, the sound source signal generation circuit 101b, the synthesis filter 101c, the analysis filter 107a, the sound source characteristic analysis circuit 107b, and the multiplexer 107c in FIG. Regardless of whether or not the form of echo canceller is used, the digital communication apparatus is originally provided.

次に、エコーキャンセラ１０６内部の各構成について、詳細に説明する。 Next, each component inside the echo canceller 106 will be described in detail.

適応フィルタ１０６ａ及び擬似エコー生成用フィルタ１０６ｄには、音源信号発生回路１０１ｂから出力された線形予測残差ベクトルε（ｔ）が入力される。マルチパス符号化等を用いて音源情報を圧縮した場合、線形予測残差ベクトルε（ｔ）は、有限個の非零の音源パルス列（ごく少数のサンプルのみが０以外の値を有する音源パルス列）となる。 The linear prediction residual vector ε (t) output from the sound source signal generation circuit 101b is input to the adaptive filter 106a and the pseudo echo generation filter 106d. When the excitation information is compressed using multi-pass coding or the like, the linear prediction residual vector ε (t) is a finite number of non-zero excitation pulse sequences (excitation pulse sequences in which only a small number of samples have values other than 0). It becomes.

適応フィルタ１０６ａは、離散時刻ｔにおける線形予測残差ベクトルε（ｔ）をタップ係数Ｈ（ｔ）で演算して白色化擬似エコーｒｗ（ｔ）を生成する。また、適応フィルタ１０６ａは、加算器１０６ｃから出力された白色化残差信号ｅｗ（ｔ）を最適値にするようにタップ係数Ｈ（ｔ）の更新を行い、更新後のタップ係数Ｈ（ｔ）を擬似エコー生成用フィルタ１０６ｄに出力する。 The adaptive filter 106a calculates the linear prediction residual vector ε (t) at the discrete time t with the tap coefficient H (t) to generate the whitened pseudo echo rw (t). The adaptive filter 106a updates the tap coefficient H (t) so that the whitening residual signal ew (t) output from the adder 106c becomes an optimum value, and the updated tap coefficient H (t). Is output to the pseudo echo generation filter 106d.

このように、適応フィルタ１０６ａの入力信号を、有限個の非零の音源パルス列である線形予測残差ベクトルε（ｔ）とすることにより、値０のサンプルについての演算を省略することができるので、白色化された受信音声信号を適応フィルタの入力信号とする場合（特許文献１）に比べて、適応フィルタ１０６ａの演算量を低減することができる。 As described above, by using the linear prediction residual vector ε (t), which is a finite number of non-zero excitation pulse trains, as the input signal of the adaptive filter 106a, it is possible to omit the calculation for the zero-value sample. Compared with the case where the whitened received audio signal is used as the input signal of the adaptive filter (Patent Document 1), the calculation amount of the adaptive filter 106a can be reduced.

例えば、サンプリング周波数１０ｋＨｚ、有声音の基本周波数１００Ｈｚ／基本周期１０ｍｓ（基本周期内のサンプル数１００）、一周期分の音源信号を１０個の非零のパルスの組み合わせとした場合を考える。この場合、フィルタのタップ長Ｎに対してＮのオーダーの演算量を必要とする適応アルゴリズム用いた時と比較して、演算量が１０／１００＝１／１０に削減される。また、Ｎ^２のオーダーの演算量を必要とする適応アルゴリズムでは、演算量が（１０／１００）^２＝１／１００に削減される。 For example, consider a case where the sampling frequency is 10 kHz, the basic frequency of voiced sound is 100 Hz / the basic period is 10 ms (the number of samples in the basic period is 100), and the sound source signal for one period is a combination of ten non-zero pulses. In this case, the amount of calculation is reduced to 10/100 = 1/10 compared to when an adaptive algorithm that requires an amount of calculation in the order of N with respect to the tap length N of the filter is used. Further, in the adaptive algorithm which requires the calculation amount of the order of ^{N 2,} the amount of computation is reduced to ^(10/100) 2 = 1/100.

なお、適応フィルタ１０６ａの適応アルゴリズムには、一般的にＬＭＳ（最小平均二乗）アルゴリズム、ＮＬＭＳ（Normalized ＬＭＳ）アルゴリズム、射影法、ＲＬＳ（逐次最小二乗）アルゴリズム等が用いられる。これらは、新たな信号のサンプル値が入力されるたびに逐次演算を行って徐々にタップ係数が最適値に収束していく適応アルゴリズムである。また、上記のように演算量を大幅に削減することができるので、適応フィルタ１０６ａの適応アルゴリズムに、有限長の入出力信号を用いたブロック処理を行い、一度の演算で最適なタップ係数を求める最小二乗法を用いることができる。 In general, an LMS (Minimum Mean Square) algorithm, an NLMS (Normalized LMS) algorithm, a projection method, an RLS (Sequential Least Square) algorithm, or the like is used as an adaptive algorithm of the adaptive filter 106a. These are adaptive algorithms that perform sequential calculation each time a new signal sample value is input and gradually converge the tap coefficient to an optimum value. In addition, since the amount of calculation can be greatly reduced as described above, block processing using an input / output signal of a finite length is performed on the adaptive algorithm of the adaptive filter 106a, and the optimum tap coefficient is obtained by a single calculation. A least squares method can be used.

逆フィルタ１０６ｂは、合成フィルタ１０１ｃの逆特性を有し、アナログ・ディジタル変換器１０５から出力されたディジタル送信音声信号ｙ（ｔ）を、線形予測係数Ｌ（ｔ）を用いて白色化し、白色化送信音声信号ｙｗ（ｔ）を得る。なお、逆フィルタ１０６ｂには、合成フィルタ１０１ｃが全零型の格子型フィルタである場合には全極型の格子型フィルタが用いられ、合成フィルタ１０１ｃがＬＳＰ合成フィルタである場合には全極型のＬＳＰパラメータを用いたフィルタが用いられる。 The inverse filter 106b has the inverse characteristics of the synthesis filter 101c, and whitens the white speech signal y (t) output from the analog / digital converter 105 using the linear prediction coefficient L (t). A transmission voice signal yw (t) is obtained. The inverse filter 106b uses an all-pole lattice filter when the synthesis filter 101c is an all-zero lattice filter, and an all-pole type when the synthesis filter 101c is an LSP synthesis filter. A filter using the LSP parameter is used.

加算器１０６ｃは、逆フィルタ１０６ｂから出力された白色化送信音声信号ｙｗ（ｔ）から、適応フィルタ１０６ａから出力された白色化擬似エコーｒｗ（ｔ）を減算して白色化残差信号ｅｗ（ｔ）を得る。 The adder 106c subtracts the whitened pseudo echo rw (t) output from the adaptive filter 106a from the whitened transmission audio signal yw (t) output from the inverse filter 106b, and then the whitened residual signal ew (t )

擬似エコー生成用フィルタ１０６ｄは、単純なＦＩＲフィルタであり、音源信号発生回路１０１ｂから出力された線形予測残差ベクトルε（ｔ）を、適応フィルタ１０６ａから出力されたタップ係数Ｈ（ｔ）で演算して白色化擬似エコーｒｗ’（ｔ）を生成する。 The pseudo echo generation filter 106d is a simple FIR filter, and calculates the linear prediction residual vector ε (t) output from the sound source signal generation circuit 101b using the tap coefficient H (t) output from the adaptive filter 106a. Thus, the whitened pseudo echo rw ′ (t) is generated.

合成フィルタ１０６ｅは、白色化擬似エコーｒｗ’（ｔ）を、線形予測係数Ｌ（ｔ）を用いて合成し、擬似エコーｒ（ｔ）を生成する。 The synthesis filter 106e synthesizes the whitened pseudo echo rw '(t) using the linear prediction coefficient L (t) to generate the pseudo echo r (t).

加算器１０６ｆは、アナログ・ディジタル変換器１０５から出力されたディジタル送信音声信号ｙ（ｔ）から、合成フィルタ１０６ｅから出力された擬似エコーｒ（ｔ）を減算して残差信号ｅ（ｔ）を得る。これにより、ディジタル送信音声信号ｙ（ｔ）に含まれるエコー成分を抑圧することができる。 The adder 106f subtracts the pseudo echo r (t) output from the synthesis filter 106e from the digital transmission speech signal y (t) output from the analog / digital converter 105 to obtain a residual signal e (t). obtain. Thereby, the echo component contained in the digital transmission voice signal y (t) can be suppressed.

次に、合成フィルタ１０１ｃおよび逆フィルタ１０６ｂの一例について説明する。 Next, an example of the synthesis filter 101c and the inverse filter 106b will be described.

図３は、合成フィルタ１０１ｃがＬＳＰパラメータを用いたＬＳＰ合成フィルタである場合の一例を示す図であり、図４は、合成フィルタ１０１ｃに図３に示すＬＳＰ合成フィルタを用いた場合の逆フィルタ１０６ｂを示す図である。 FIG. 3 is a diagram illustrating an example when the synthesis filter 101c is an LSP synthesis filter using LSP parameters, and FIG. 4 is an inverse filter 106b when the LSP synthesis filter illustrated in FIG. 3 is used as the synthesis filter 101c. FIG.

図３および図４は、次数が６次の例である。図３および図４中のｚ^−１は、サンプル遅延を表し、Ｃ_ｉは、以下の式（６）によりＬＳＰパラメータω_ｉから求めたフィルタ係数である。

3 and 4 are examples of the sixth order. In FIG. 3 and FIG. 4, z ⁻¹ represents a sample delay, and C _i is a filter coefficient obtained from the LSP parameter ω _i by the following equation (6).

図３に示す合成フィルタ１０１ｃは、以下の式（７）の伝達係数Ｈ_Ｓ（ｚ）を有する。また、図４に示す逆フィルタ１０６ｂは、以下の式（８）の伝達係数Ｈ_Ａ（ｚ）を有する。なお、式（７）、（８）の定数Ｐはフィルタの次数であり、式（７）、（８）は次数Ｐが偶数の場合を示す。

The synthesis filter 101c shown in FIG. 3 has a transfer coefficient H _S (z) of the following equation (7). Further, the inverse filter 106b shown in FIG. 4 has a transfer coefficient H _A (z) of the following equation (8). Note that the constant P in the equations (7) and (8) is the order of the filter, and the equations (7) and (8) show the case where the order P is an even number.

合成フィルタ１０１ｃの伝達係数Ｈ_Ｓ（ｚ）と逆フィルタ１０６ｂの伝達係数Ｈ_Ａ（ｚ）とは、以下の式（９）の関係を有する。

The transmission coefficient of the transmission coefficient _H S (z) and inverse filter 106b of synthesis filter 101c _H A (z), have a relationship of the following equation (9).

次に、図５を用いて、本発明の実施の形態１に係るエコーキャンセラの各フィルタの入力信号の状態について、従来技術と対比して説明する。 Next, the state of the input signal of each filter of the echo canceller according to Embodiment 1 of the present invention will be described using FIG. 5 in comparison with the prior art.

図５Ａは、従来（非特許文献１）のエコーキャンセラの各フィルタの入力信号の状態を示す図である。図５Ｂは、本実施の形態に係るエコーキャンセラの各フィルタの入力信号の状態を示す図である。 FIG. 5A is a diagram illustrating a state of an input signal of each filter of a conventional echo canceller (Non-Patent Document 1). FIG. 5B is a diagram showing a state of an input signal of each filter of the echo canceller according to the present embodiment.

図５Ａ及び図５Ｂに示すように、音源信号発生回路にて、情報量が圧縮された音源特性パラメータを復号すると線形予測残差ベクトルが得られる。この線形予測残差ベクトルは、白色信号であり、かつ、ごく少数のサンプルのみが０以外の値を有する音源パルス列（疎なパルス列）である。また、線形予測係数により合成フィルタを駆動して線形予測残差ベクトルを合成すると、合成音声（受信音声信号）が得られる。 As shown in FIGS. 5A and 5B, when a sound source characteristic parameter whose information amount is compressed is decoded by the sound source signal generation circuit, a linear prediction residual vector is obtained. This linear prediction residual vector is a white signal and a sound source pulse train (sparse pulse train) in which only a small number of samples have a value other than 0. Further, when a synthesis filter is driven by a linear prediction coefficient to synthesize a linear prediction residual vector, synthesized speech (received speech signal) is obtained.

図５Ａに示すように、非特許文献１のエコーキャンセラでは、適応フィルタへの入力信号が疎なパルス列であるので適応フィルタの演算量を削減することができる。しかしながら、非特許文献１のエコーキャンセラでは、擬似エコー生成用フィルタへの入力信号が合成音声であるので擬似エコー生成用フィルタの演算量を削減することができない。 As shown in FIG. 5A, in the echo canceller of Non-Patent Document 1, since the input signal to the adaptive filter is a sparse pulse train, the calculation amount of the adaptive filter can be reduced. However, in the echo canceller of Non-Patent Document 1, since the input signal to the pseudo echo generation filter is synthesized speech, the amount of calculation of the pseudo echo generation filter cannot be reduced.

一方、図５Ｂに示すように、本実施の形態に係るエコーキャンセラでは、適応フィルタ１０６ａ及び擬似エコー生成用フィルタ１０６ｄのいずれの入力信号も疎なパルス列であり、次数が大きい（タップ長が長い）擬似エコー生成用フィルタ１０６ｄの処理の後に、次数が小さい（タップ長が短い）合成フィルタ１０６ｅの処理を行っている。したがって、本実施の形態に係るエコーキャンセラでは、適応フィルタ１０６ａ及び擬似エコー生成用フィルタ１０６ｄの演算量を削減することができる。 On the other hand, as shown in FIG. 5B, in the echo canceller according to the present embodiment, both the input signals of the adaptive filter 106a and the pseudo echo generation filter 106d are sparse pulse trains, and the order is large (the tap length is long). After the process of the pseudo echo generation filter 106d, the process of the synthesis filter 106e having a small order (short tap length) is performed. Therefore, in the echo canceller according to the present embodiment, the calculation amount of the adaptive filter 106a and the pseudo echo generation filter 106d can be reduced.

なお、本実施の形態のエコーキャンセラは、非特許文献１のエコーキャンセラに対して合成フィルタ１０６ｅが増えることになるが、合成フィルタ１０６ｅが追加されたことによる演算量の増加よりも、擬似エコー生成用フィルタ１０６ｄにおいて低減される演算量の方がはるかに大きい。これは、通常、消去すべきエコーの継続時間、すなわち擬似エコー生成用フィルタ１０６ｄのタップ長が非常に大きいので、擬似エコー生成用フィルタ１０６ｄのフィルタ次数は合成フィルタ１０６ｅのものに比べて非常に大きくなるからである。例えば、サンプリング周波数が１０ｋＨｚであれば、エコー長が５０ｍｓの擬似エコー生成用フィルタ１０６ｄのフィルタ次数は５００次であるのに対し、合成フィルタ１０６ｅのフィルタ次数は１０〜１８次程度である。 Note that the echo canceller of the present embodiment has an increased number of synthesis filters 106e compared to the echo canceller of Non-Patent Document 1, but the generation of pseudo echoes rather than an increase in the amount of computation due to the addition of the synthesis filter 106e. The amount of calculation reduced in the filter 106d for use is much larger. This is because, usually, the duration of the echo to be canceled, that is, the tap length of the pseudo echo generation filter 106d is very large, so that the filter order of the pseudo echo generation filter 106d is much larger than that of the synthesis filter 106e. Because it becomes. For example, if the sampling frequency is 10 kHz, the filter order of the pseudo-echo generation filter 106d having an echo length of 50 ms is 500th, whereas the filter order of the synthesis filter 106e is about 10-18.

このように、本実施の形態によれば、マルチパルス符号化等を用いて音源の情報圧縮を行った場合、擬似エコー生成用フィルタへの入力信号も疎なパルス列となり、擬似エコー生成用フィルタにおいて値０のサンプルに対する演算を省略することができるため、従来のエコーキャンセラに比べてフィルタの演算量を削減することができる。 Thus, according to the present embodiment, when sound source information compression is performed using multi-pulse encoding or the like, the input signal to the pseudo echo generation filter also becomes a sparse pulse train, and in the pseudo echo generation filter Since the calculation for the sample having the value 0 can be omitted, the calculation amount of the filter can be reduced as compared with the conventional echo canceller.

（実施の形態２）
図６は、本発明の実施の形態２に係るエコーキャンセラを含むディジタル通信装置の構成を示すブロック図である。なお、図６において、図２と共通する構成部分には、図２と同一符号を付し、その詳しい説明を省略する。 (Embodiment 2)
FIG. 6 is a block diagram showing a configuration of a digital communication apparatus including an echo canceller according to Embodiment 2 of the present invention. In FIG. 6, the same components as those in FIG. 2 are denoted by the same reference numerals as those in FIG. 2, and detailed description thereof is omitted.

図６に示すディジタル通信装置は、エコーキャンセラ２０６の内部構成が、図２のディジタル通信装置のエコーキャンセラ１０６のものと異なる。エコーキャンセラ２０６は、エコーキャンセラ１０６に対して、擬似エコー生成用フィルタ１０６ｄ及び合成フィルタ１０６ｅを削除し、合成フィルタ２０６ａを追加した構成を採る。 In the digital communication apparatus shown in FIG. 6, the internal configuration of the echo canceller 206 is different from that of the echo canceller 106 of the digital communication apparatus in FIG. The echo canceller 206 has a configuration in which the pseudo echo generation filter 106d and the synthesis filter 106e are deleted from the echo canceller 106, and a synthesis filter 206a is added.

合成フィルタ２０６ａは、適応フィルタ１０６ａで生成された白色化擬似エコーｒｗ（ｔ）を、線形予測係数Ｌ（ｔ）を用いて合成し、擬似エコーｒ（ｔ）を生成する。 The synthesis filter 206a synthesizes the whitened pseudo echo rw (t) generated by the adaptive filter 106a using the linear prediction coefficient L (t) to generate a pseudo echo r (t).

図７は、本実施の形態に係るエコーキャンセラの各フィルタの入力信号の状態を示す図である。 FIG. 7 is a diagram showing the state of the input signal of each filter of the echo canceller according to the present embodiment.

図７に示すように、本実施の形態に係るエコーキャンセラでは、適応フィルタ１０６ａへの入力信号が疎なパルス列であるので適応フィルタ１０６ａの演算量を削減することができる。また、本実施の形態に係るエコーキャンセラでは、次数が小さい（タップ長が短い）合成フィルタ２０６ａにより、適応フィルタ１０６ａから出力された白色化擬似エコーを用いて擬似エコーを生成するので、非特許文献１のエコーキャンセラに対してフィルタの演算量を削減することができる。 As shown in FIG. 7, in the echo canceller according to the present embodiment, since the input signal to the adaptive filter 106a is a sparse pulse train, the calculation amount of the adaptive filter 106a can be reduced. Further, in the echo canceller according to the present embodiment, the pseudo filter is generated using the whitened pseudo echo output from the adaptive filter 106a by the synthesis filter 206a having a small order (short tap length). The calculation amount of the filter can be reduced with respect to one echo canceller.

このように、本実施の形態によれば、マルチパルス符号化等を用いて音源の情報圧縮を行った場合、適応フィルタにおいて疎なパルス列を用いたフィルタ演算により得られた白色化擬似エコーを用いて擬似エコーを生成することができるので、従来のエコーキャンセラに比べてフィルタの演算量を削減することができる。 Thus, according to the present embodiment, when sound source information compression is performed using multi-pulse coding or the like, the whitened pseudo echo obtained by the filter operation using a sparse pulse train is used in the adaptive filter. Therefore, the amount of calculation of the filter can be reduced as compared with the conventional echo canceller.

なお、実施の形態１と実施の形態２とは、次数が小さい合成フィルタを追加することにより、次数が大きいすべてのフィルタ（適応フィルタ、擬似エコー生成用フィルタ）への入力信号を線形予測残差ベクトルとすることができ、マルチパス符号化等を用いて音源情報を圧縮する場合に、フィルタの演算量を削減することができるという共通の効果を有する。 In the first and second embodiments, a synthesis filter having a small order is added, so that input signals to all the filters (adaptive filter and pseudo echo generation filter) having a large order are converted into linear prediction residuals. It can be a vector, and has a common effect that the amount of calculation of the filter can be reduced when the sound source information is compressed using multi-pass coding or the like.

以下、非特許文献１、実施の形態１及び実施の形態２の各方式について、エコーキャンセラの演算量の比較を行う。ここで、適応フィルタおよび擬似エコー生成用フィルタのタップ長をＮ，線形予測分析合成次数をＰ次とする。また、適応フィルタの演算量はタップ長のｋ倍（すなわちｋＮ）、擬似エコー生成用フィルタの演算量はタップ長と等しいＮ、線形予測合成フィルタ及び逆フィルタの演算量は次数と等しいＰとする。また、マルチパルス符号化により線形予測合成音源の情報圧縮をした場合の圧縮比をαとする。なお、一周期Ｍサンプル分の音声を合成する時の合成音源信号のパルス数はαＭ個となる。 Hereinafter, the calculation amount of the echo canceller is compared for each method of Non-Patent Document 1, Embodiment 1 and Embodiment 2. Here, the tap length of the adaptive filter and the pseudo echo generation filter is N, and the linear prediction analysis synthesis order is P order. The calculation amount of the adaptive filter is k times the tap length (ie, kN), the calculation amount of the pseudo echo generation filter is N equal to the tap length, and the calculation amounts of the linear prediction synthesis filter and the inverse filter are P equal to the order. . In addition, the compression ratio when the information compression of the linear predictive synthesized sound source is performed by multipulse coding is α. Note that the number of pulses of the synthesized sound source signal when synthesizing speech for one period of M samples is αM.

＜非特許文献１＞
適応フィルタ αｋＮ
擬似エコー生成用フィルタＮ
逆フィルタＰ
−−−−−−−−−−−−−−−−−−−−−−−−−−
合計（αｋ＋１）Ｎ＋Ｐ <Non-Patent Document 1>
Adaptive filter αkN
Pseudo echo generation filter N
Inverse filter P
-------------------------
Total (αk + 1) N + P

＜実施の形態１＞
適応フィルタ１０６ａ αｋＮ
擬似エコー生成用フィルタ１０６ｄ αＮ
逆フィルタ１０６ｂＰ
合成フィルタ１０６ｅＰ
−−−−−−−−−−−−−−−−−−−−−−−−−−
合計（αｋ＋α）Ｎ＋２Ｐ <Embodiment 1>
Adaptive filter 106a αkN
Pseudo echo generation filter 106d αN
Inverse filter 106b P
Synthesis filter 106e P
-------------------------
Total (αk + α) N + 2P

＜実施の形態２＞
適応フィルタ１０６ａ αｋＮ
逆フィルタ１０６ｂＰ
合成フィルタ２０６ａＰ
−−−−−−−−−−−−−−−−−−−−−−−−−−
合計 αｋＮ＋２Ｐ <Embodiment 2>
Adaptive filter 106a αkN
Inverse filter 106b P
Synthesis filter 206a P
-------------------------
Total αkN + 2P

例えば、タップ長Ｎ＝５００，次数Ｐ＝１６，音源情報圧縮比α＝０．１，係数ｋ＝３とすると、演算量は以下の結果となる。なお、ｋの値は、使用する適応アルゴリズムにより異なる。ｋ＝３はＮＬＳＭアルゴリズムを使用した場合の値である。 For example, when the tap length N = 500, the order P = 16, the sound source information compression ratio α = 0.1, and the coefficient k = 3, the calculation amount is as follows. Note that the value of k varies depending on the adaptive algorithm used. k = 3 is a value when the NLSM algorithm is used.

＜非特許文献１＞
演算量＝（αｋ＋１）Ｎ＋Ｐ
＝（０．１×３＋１）×５００＋１６
＝６６６ <Non-Patent Document 1>
Calculation amount = (αk + 1) N + P
= (0.1 × 3 + 1) × 500 + 16
= 666

＜実施の形態１＞
演算量＝（αｋ＋α）Ｎ＋２Ｐ
＝（０．１×３＋０．１）×５００＋２×１６
＝２３２ <Embodiment 1>
Calculation amount = (αk + α) N + 2P
= (0.1 × 3 + 0.1) × 500 + 2 × 16
= 232

＜実施の形態２＞
演算量＝αｋＮ＋２Ｐ
＝０．１×３×５００＋２×１６
＝１８２ <Embodiment 2>
Calculation amount = αkN + 2P
= 0.1 × 3 × 500 + 2 × 16
= 182

上記の結果から明らかなように、実施の形態１及び実施の形態２の方式の方が、非特許文献１よりも演算量を大幅に削減することができる。また、実施の形態２の方式が最も演算量が少ないことが分かる。 As is clear from the above results, the amount of calculation can be significantly reduced in the methods of the first and second embodiments than in the non-patent document 1. It can also be seen that the method of Embodiment 2 has the least amount of computation.

なお、本発明は、図２における分析フィルタ１０７ａの線形予測係数出力と合成フィルタ１０１ｃの線形予測係数入力とを短絡し、音源特性分析回路１０７ｂの音源特性パラメータ出力と音源信号発生回路１０１ｂの音源特性パラメータ入力とを短絡した構成の拡声装置のハウリングキャンセラにも適用することができる。 In the present invention, the linear prediction coefficient output of the analysis filter 107a and the linear prediction coefficient input of the synthesis filter 101c in FIG. 2 are short-circuited, the sound source characteristic parameter output of the sound source characteristic analysis circuit 107b and the sound source characteristic of the sound source signal generation circuit 101b. The present invention can also be applied to a howling canceller of a loudspeaker having a configuration in which a parameter input is short-circuited.

本発明は、双方向通信システム（無線電話、有線電話、インターホン、ＴＶ会議システム等）のエコーキャンセラ、ハウリングキャンセラ、拡声装置のハウリングキャンセラ、補聴器のハウリングキャンセラ等に用いるに好適である。 The present invention is suitable for use in an echo canceller, a howling canceller, a howling canceller for a loudspeaker, a howling canceller for a hearing aid, etc. in a bidirectional communication system (wireless telephone, wired telephone, interphone, video conference system, etc.).

１０１音声復号回路
１０１ａデマルチプレクサ
１０１ｂ音源信号発生回路
１０１ｃ合成フィルタ
１０２ディジタル・アナログ変換器
１０３スピーカ
１０４マイクロホン
１０５アナログ・ディジタル変換器
１０６、２０６エコーキャンセラ
１０６ａ適応フィルタ
１０６ｂ逆フィルタ
１０６ｃ加算器
１０６ｄ擬似エコー生成用フィルタ
１０６ｅ、２０６ａ合成フィルタ
１０６ｆ加算器
１０７音声符号化回路
１０７ａ分析フィルタ
１０７ｂ音源特性分析回路
１０７ｃマルチプレクサ
２０６ａ合成フィルタ
DESCRIPTION OF SYMBOLS 101 Speech decoding circuit 101a Demultiplexer 101b Sound source signal generation circuit 101c Synthesis filter 102 Digital / analog converter 103 Speaker 104 Microphone 105 Analog / digital converter 106, 206 Echo canceller 106a Adaptive filter 106b Inverse filter 106c Adder 106d For pseudo echo generation Filters 106e and 206a Synthesis filter 106f Adder 107 Speech encoding circuit 107a Analysis filter 107b Sound source characteristic analysis circuit 107c Multiplexer 206a Synthesis filter

Claims

Speech decoding that receives a linear prediction coefficient and a linear prediction residual vector obtained by linear prediction analysis, drives a first synthesis filter with the linear prediction coefficient, and synthesizes the linear prediction residual vector to obtain a received speech signal An echo canceller provided in a digital communication device provided with a circuit,
An adaptive filter that computes the linear prediction residual vector with a tap coefficient to generate a first whitened pseudo echo, and updates the tap coefficient according to a given whitened residual signal;
An inverse filter having a reverse characteristic to the first synthesis filter and using the linear prediction coefficient to whiten a transmission voice signal and obtain a whitened transmission voice signal;
A first adder for subtracting the first whitened pseudo echo from the whitened transmission audio signal to obtain a whitened residual signal;
A pseudo-echo generation filter that calculates the linear prediction residual vector using the tap coefficient updated by the adaptive filter to generate a second whitened pseudo-echo;
A second synthesis filter for synthesizing the second whitened pseudo echo using the linear prediction coefficient to generate a pseudo echo;
A second adder that subtracts the pseudo echo from the transmission voice signal to obtain a residual signal;
An echo canceller comprising:

Speech decoding that receives a linear prediction coefficient and a linear prediction residual vector obtained by linear prediction analysis, drives a first synthesis filter with the linear prediction coefficient, and synthesizes the linear prediction residual vector to obtain a received speech signal An echo canceller provided in a digital communication device provided with a circuit,
An adaptive filter that computes the linear prediction residual vector with a tap coefficient to generate a whitened pseudo echo, and updates the tap coefficient according to a given whitened residual signal;
An inverse filter having a reverse characteristic to the first synthesis filter and using the linear prediction coefficient to whiten a transmission voice signal and obtain a whitened transmission voice signal;
A first adder that subtracts the whitened pseudo echo from the whitened transmission audio signal to obtain a whitened residual signal;
A second synthesis filter that has the same characteristics as the first synthesis filter, synthesizes the whitened pseudo echo using the linear prediction coefficient, and generates a pseudo echo;
A second adder that subtracts the pseudo echo from the transmission voice signal to obtain a residual signal;
An echo canceller comprising:

The echo canceller according to claim 1 or 2 , wherein the linear prediction residual vector is a finite number of non-zero excitation pulse trains.