JP2001273000A

JP2001273000A - Adaptive noise suppressing speech encoder

Info

Publication number: JP2001273000A
Application number: JP2000082697A
Authority: JP
Inventors: Shigeru Hosoi; 茂細井; Sachio Aoyama; 左千男青山; Shogo Iizuka; 捷吾飯塚; Kazuki Hoshino; 一樹星野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2000-03-23
Filing date: 2000-03-23
Publication date: 2001-10-05

Abstract

PROBLEM TO BE SOLVED: To provide an adaptive noise suppressing speech encoder which detects a noise level during current talk, makes it possible to eliminate the noise or reduce an influence thereof, and also prevents a signal processing time from increasing by integrating the speech encoder and a noise suppressor in one body and further simplifying the software. SOLUTION: An adaptive noise suppressing speech encoder without a malfunction due to noise can be obtained by paying attention to a feature of an auto- correlation function between a speech signal and noise, subtracting a noise component from the auto-correlation calculated by an auro-correlation function calculating circuit 1303, obtaining an auto-correlation function of only the speech signal, creating a speech encoding parameter by a prediction coefficient calculating circuit 102 using the value, and further eliminating the noise by an internal operation of the speech encoder by a block A for eliminating the noise during talking.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、有線、無線の電話
（携帯電話や自動車電話）等の通信技術分野に適用可能
な適応型雑音抑圧音声符号化装置に関し、特に、現在通
話中の雑音レベルを検出し、それを除去する、またはそ
の影響を軽減することを可能とするとともに、音声符号
化装置と雑音抑圧装置を一体化して、ハードウエアを簡
略化し、信号処理時間の増大を防止するよう構成したも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an adaptive noise suppression speech coding apparatus applicable to communication technology fields such as wired and wireless telephones (mobile telephones and automobile telephones). To reduce or reduce the effect thereof, and to integrate the speech encoder and the noise suppressor to simplify the hardware and prevent an increase in signal processing time. It is composed.

【０００２】[0002]

【従来の技術】近年、様々な環境の中で音声を入力情報
とした装置が使われるようになつてきたため、騒音環境
の中でも使用できることが重要となつてきた。携帯電話
や自動車電話等もその１例である。またIC化技術の進歩
により、このような装置にDSP(デジタルシグナルプロセ
ツサ)を用いて、かなり高度のデジタル信号処理技術を
用いて、音声信号に重畳した周囲雑音を低減する方法が
可能となつてきた。以下、従来の雑音抑圧装置につい
て、図面を参照しながら説明する。2. Description of the Related Art In recent years, devices using voice as input information have been used in various environments, and it has become important to be able to use the devices even in a noisy environment. Cellular phones and car phones are also examples. Advances in IC technology have made it possible to reduce the ambient noise superimposed on the audio signal by using a DSP (Digital Signal Processor) in such a device, and by using advanced digital signal processing technology. Have been. Hereinafter, a conventional noise suppression device will be described with reference to the drawings.

【０００３】従来の雑音抑圧装置に関しては、その方法
が、1979年4月に発表された、「IEEE Transaction on A
coustics, speech, and Signal Processing，vol. ASSP
-27， NO.2,pp, 113-120」" Supression of Acoustic N
oise Speech Using Spectral Subtraction "に、スペク
トルサブトラクション法として、開示されている。この
スペクトルサブトラクション法について、図２５に示す
従来の第１の雑音抑圧装置を用いて簡単に説明する。図
２５は従来の第１の雑音抑圧装置の構成を示したブロッ
ク図である。図２５において、901は切り出し部、902は
フーリエ分析部、903は雑音スペクトル平均算出部、904
は雑音スペクトルメモリ、905は雑音信号判定部、906は
振幅引算部、907は半波整流部、908は逆フーリエ変換
部、909は波形再生部である。以上のような構成要素か
らなる従来の雑音抑圧装置について、その動作を図２６
に示すフローチャートを用いて以下に説明する。[0003] Regarding the conventional noise suppressor, the method is described in "IEEE Transaction on A" published in April 1979.
coustics, speech, and Signal Processing, vol. ASSP
-27, NO.2, pp, 113-120 "" Supression of Acoustic N
oise Speech Using Spectral Subtraction "as a spectrum subtraction method. This spectrum subtraction method will be briefly described using a conventional first noise suppression device shown in FIG. Fig. 25 is a block diagram showing a configuration of a noise suppression device of Fig. 25. In Fig. 25, reference numeral 901 denotes a cutout unit, 902 denotes a Fourier analysis unit, 903 denotes a noise spectrum average calculation unit, and 904.
Is a noise spectrum memory, 905 is a noise signal determination unit, 906 is an amplitude subtraction unit, 907 is a half-wave rectification unit, 908 is an inverse Fourier transform unit, and 909 is a waveform reproduction unit. The operation of the conventional noise suppressor composed of the above components is shown in FIG.
This will be described below using the flowchart shown in FIG.

【０００４】ステップ1001として、入力された音声に対
して、切り出し部901において、後でフーリエ分析する
際に必要な周波数分解精度が得られる窓をかけて、10ms
程度の周期で20ms程度の区間の一定長のフレームにおけ
る波形を切り出す。このように連続する波形から一定区
間の波形を切り出すため窓の条件として、後の合成処理
で合成して接続したときに不連続な波形にならず、か
つ、フーリエ分析の精度を確保でき、元の波形が得られ
るように、50%ずつオーバーラツプさせた分析窓を選択
する。この窓関数として、下記のハニング窓が用いられ
る。ここで、フーリエ分析のデータ数をNとすると、窓
掛け後のn番目のデータに対する重みずけW(n)は式(１)
に示すように表される。In step 1001, the input speech is cut by a window for obtaining a frequency resolution accuracy required for Fourier analysis later in the cutout unit 901 for 10 ms.
A waveform in a fixed-length frame of a section of about 20 ms is cut out at a cycle of about. As a condition of the window for cutting out a waveform of a certain section from a continuous waveform in this manner, a discontinuous waveform does not occur when combined and connected in a later combining process, and the accuracy of Fourier analysis can be secured. Select an analysis window that overlaps by 50% so that the waveform of The following Hanning window is used as this window function. Here, assuming that the number of data of the Fourier analysis is N, the weighting W (n) for the n-th data after windowing is expressed by Equation (1).
It is expressed as shown below.

【０００５】 W ( n ) = 0.5-0.5＊cos[2nπ/ (N-1)] （１）W (n) = 0.5−0.5 * cos [2nπ / (N−1)] (1)

【０００６】ステップ1002として、フーリエ部902にお
いて、分析すべきフレームの音声波形を、フーリエ分析
で周波数分析(FFT分析)して、この分析結果が実数部と
虚数部から構成されるため、これらを振幅スペクトルと
位相スペクトルとに分解する。振幅スペクトルは、雑音
スペクトルメモリ904、雑音信号判定部905および雑音引
き算部906に出力される。一方、位相スペクトルは、音
声波形の再合成のため、逆フーリエ変換部908に出力さ
れる。In step 1002, the Fourier section 902 performs frequency analysis (FFT analysis) on the speech waveform of the frame to be analyzed by Fourier analysis, and the analysis result is composed of a real part and an imaginary part. Decompose into an amplitude spectrum and a phase spectrum. The amplitude spectrum is output to noise spectrum memory 904, noise signal determination section 905, and noise subtraction section 906. On the other hand, the phase spectrum is output to inverse Fourier transform section 908 for resynthesizing the speech waveform.

【０００７】ステップ1003として、雑音信号判定部905
において、入力された信号が音声信号に雑音信号が重畳
した信号であるか、雑音のみの信号であるかを判定す
る。この判定は、入力信号のパワー情報に基づいて行
い、パワーの大きいところでは、音声の有る音声区間と
し、パワーの小さいところでは、音声のない雑音区間と
する。この音声の”あり”、”なし” の情報を雑音ス
ペクトルメモリ904に出力する。In step 1003, a noise signal determination unit 905
In, it is determined whether the input signal is a signal in which a noise signal is superimposed on a voice signal or a signal including only noise. This determination is made based on the power information of the input signal. A high-power portion is determined to be a voiced voice section, and a low-power portion is determined to be a no-voice noise section. The information “present” and “absent” of this voice is output to the noise spectrum memory 904.

【０００８】ステップ1004として、雑音スペクトルメモ
リにおいて、過去数10msの区間の雑音スペクトルを記憶
している。雑音区間においては、フーリエ分析部902か
らの振幅スペクトルを記憶する。また音声区間において
は、雑音スペクトルメモリ904にフーリエ分析部902から
の振幅スペクトルを記憶しない。In step 1004, the noise spectrum in the past several tens of ms is stored in the noise spectrum memory. In the noise section, the amplitude spectrum from Fourier analysis section 902 is stored. In the voice section, the noise spectrum memory 904 does not store the amplitude spectrum from the Fourier analysis unit 902.

【０００９】このため、常に雑音スペクトルメモリ904
に最新の雑音スペクトルを記憶していることになる。ス
テップ1005として、雑音スペクトル平均算定部903で
は、雑音スペクトルメモリ904に記憶されている過去数1
0フレームの振幅スペクトルの平均スペクトルを、雑音
の推定量の振幅情報として計算し、音声發声の直前の雑
音の振幅スペクトル、すなわち雑音スペクトルとする。
このように雑音スペクトルを計算するために用いるメモ
リ量は、フーリエ分析で分析される、窓長個＊記憶フレ
ーム数となる。For this reason, the noise spectrum memory 904 is always used.
Means that the latest noise spectrum is stored. In step 1005, the noise spectrum average calculating unit 903 calculates the past number 1 stored in the noise spectrum memory 904.
The average spectrum of the amplitude spectrum of the 0 frame is calculated as the amplitude information of the estimated amount of noise, and is set as the amplitude spectrum of the noise immediately before speech utterance, that is, the noise spectrum.
Thus, the amount of memory used to calculate the noise spectrum is the number of window lengths * the number of storage frames analyzed by Fourier analysis.

【００１０】ステップ1006として、振幅引算部906で
は、フーリエ分析部902からの雑音の重畳した音声信号
の各周波数に対応するした振幅から、雑音スペクトル平
均算出部903の各周波数に対する雑音の振幅を、各々の
周波数について引算することによつて、音声のみの振幅
を計算する。At step 1006, the amplitude subtraction unit 906 calculates the noise amplitude for each frequency of the noise spectrum average calculation unit 903 from the amplitude corresponding to each frequency of the voice signal on which the noise from the Fourier analysis unit 902 is superimposed. , The amplitude of the voice only is calculated by subtraction for each frequency.

【００１１】ステップ1007として、半波整流部907で
は、振幅引算部906での引き算の結果、振幅が負になる
周波数について、その振幅を、ゼロにする。ステップ10
08として、逆フーリエ変換部908では、半波整流部907か
らの各周波数に対応する振幅と、フーリエ分析部902か
らの各信号に対応する位相とから、雑音が除かれた、音
声波形を合成することにより、１フレーム分の雑音のな
い音声波形を生成する。In step 1007, the half-wave rectification unit 907 sets the amplitude of the frequency at which the amplitude is negative as a result of the subtraction in the amplitude subtraction unit 906 to zero. Step 10
As 08, the inverse Fourier transform unit 908 synthesizes a voice waveform from which noise is removed from the amplitude corresponding to each frequency from the half-wave rectification unit 907 and the phase corresponding to each signal from the Fourier analysis unit 902. By doing so, a noise-free speech waveform for one frame is generated.

【００１２】ステップ1009として、波形再生部909で
は、逆フーリエ変換部908からの１フレーム分の雑音の
ない音声波形を、切り出し部における切り出し処理の逆
の処理を行つて、１フレームずつ、１フレームの音声波
形をオーバーラツプさせ、加算して元の連続波形を出力
する。In step 1009, the waveform reproducing unit 909 performs a reverse process of the clipping process in the clipping unit on the one-frame noise-free speech waveform from the inverse Fourier transform unit 908, and performs frame-by-frame processing for one frame. Are overlapped and added to output the original continuous waveform.

【００１３】以上説明した装置（または方法）は、雑音
の重畳した音声信号をデジタル化して、窓関数をかけて
一定フレームずつ取出し、FFT（高速フーリエ変換）し
て離散化された周波数領域の信号とし、一方、音声の無
い期間の雑音レベルを、予めフレーム単位で、同じくFF
T（高速フーリエ変換）して離散化された周波数領域の
信号として予め記憶装置に記憶しておき、その値を各周
波数ごとに差し引くと言う構成であるが、別の装置（方
法）も従来例として考えられている。The above-described apparatus (or method) digitizes an audio signal on which noise is superimposed, applies a window function to extract the audio signal at a constant frame rate, and performs FFT (Fast Fourier Transform) to discretize the signal in the frequency domain. On the other hand, the noise level during the period when there is no voice is previously
In this configuration, a signal is stored in a storage device in advance as a signal in the frequency domain that has been discretized by T (Fast Fourier Transform), and the value is subtracted for each frequency. However, another device (method) is also a conventional example. Is considered as.

【００１４】そのような方法の従来の第２の雑音抑圧装
置の構成を示すブロック図を図２７に示す。図２７にお
いて、図示番号901乃至905、ならびに図示番号908、909
は図２５に示すものと同一のため省略する。図示番号11
01は各チャンネルS/N測定部、1102は各チャンネル利得
制御部、1103は各チャンネル別可変利得増幅器である。FIG. 27 is a block diagram showing the configuration of a second conventional noise suppression device using such a method. 27, illustrated numbers 901 to 905, and illustrated numbers 908 and 909
Are the same as those shown in FIG. Drawing number 11
01 is an S / N measurement unit for each channel, 1102 is a gain control unit for each channel, and 1103 is a variable gain amplifier for each channel.

【００１５】以上のような構成要素からなる従来の第２
の雑音抑圧装置について、その動作を図２８に示すフロ
ーチャートを用いて以下に説明する。ステップ1001乃至
ステップ1005は、図２５ですでに説明した動作を行う。[0015] The second conventional system comprising the above components is described.
The operation of the noise suppressor described above will be described below with reference to the flowchart shown in FIG. Steps 1001 to 1005 perform the operation already described with reference to FIG.

【００１６】次にステップ1104では、各チャンネルS/N
測定部では、周波数領域に変換された各周波数ごとに
（周波数は離散的な値を取る）現在の雑音を含む音声信
号のレベルと、ステップ1005で雑音スペクトル・メモリ
904に貯えられた各周波数ごとの雑音レベルを取出し信
号対雑音比（S/N比）を計算し、ステップ1105では、各
チャンネル利得制御部1102で、その値を用いて（信号対
雑音比（S/N比）をそのまま用いるのではなく、ある換
算値を用いる場合もある。）利得制御の値として、各チ
ャンネル別可変利得制御部1103に渡す。Next, in step 1104, each channel S / N
In the measuring section, for each frequency converted to the frequency domain (the frequency takes a discrete value), the level of the current speech signal including noise is stored in the noise spectrum memory in step 1005.
The noise level of each frequency stored in 904 is taken out, and a signal-to-noise ratio (S / N ratio) is calculated. In step 1105, each channel gain control unit 1102 uses the value to obtain a signal-to-noise ratio (S / N ratio). Instead of using the S / N ratio as it is, a certain converted value may be used.) The gain control value is passed to the variable gain control unit 1103 for each channel.

【００１７】ステップ1106では、各チャンネル別可変利
得制御部からの値を用いて各チャンネル別に利得を変え
て雑音を含む音声信号を増幅する。すなわち比の良いチ
ャンネルでは、利得を大にし、比の悪いチャンネルでは
利得を小にすることにより雑音を抑圧することが可能と
なる。なる実際は各周波数ごとに利得を変えて制御する
のは、数が多くなるので、周波数を数ヶずつグループに
分けチャンネルとし、チャンネル別に制御している。ス
テップ1008、1009の動作は図２５で説明した。In step 1106, the audio signal containing noise is amplified by changing the gain for each channel using the value from the variable gain control unit for each channel. That is, it is possible to suppress noise by increasing the gain in a channel having a good ratio and decreasing the gain in a channel having a poor ratio. Actually, since controlling the gain by changing the frequency for each frequency increases the number, the frequency is divided into groups by several groups and the channels are controlled. The operations of steps 1008 and 1009 have been described with reference to FIG.

【００１８】以上に説明した、従来の雑音抑圧装置は、
過去に音声がないと判断したフレームの時の雑音レベル
を周波数帯域毎に記憶しておき、音声が有ると判断して
いる現在のフレームの雑音を含む音声信号を周波数領域
に変換し各周波数帯域ごとに、記憶してある雑音のレベ
ルを差し引く、または、各周波数毎にS/N比を計算し各
周波数毎に増幅器の利得を制御するというものであり、
現在通話しているフレームにおける雑音レベルに適応し
て、雑音を低減するものではない。また、従来の雑音抑
圧装置は、雑音抑圧装置を現在携帯電話で使用されてい
る音声符号化装置と一体化して、ハードウエアの簡易化
をはかると言うものではない。The conventional noise suppressor described above is
The noise level of a frame determined to have no voice in the past is stored for each frequency band, and a voice signal including noise of the current frame determined to have voice is converted to a frequency domain, and each frequency band is For each, subtract the stored noise level, or calculate the S / N ratio for each frequency and control the gain of the amplifier for each frequency,
It does not reduce noise by adapting to the noise level in the frame currently being talked to. Further, the conventional noise suppressor is not intended to simplify the hardware by integrating the noise suppressor with a speech encoder currently used in mobile phones.

【００１９】本発明は、音声信号の中に含まれる雑音の
量を推定するのに、雑音を含む入力信号の自己相関関数
を計算し、自己相関関数の有する特徴を利用して、現在
の音声信号の中に含まれる雑音の量を推定する。ところ
がこれを使用するデジタル方式（TDMA方式、または、CD
MA方式）携帯電話では、雑音抑圧装置の後には、音声符
号化装置が接続される。そして音声符号化装置では、や
はり入力信号の自己相関関数を計算し符号化のパラメー
タの算出系に利用している。According to the present invention, to estimate the amount of noise contained in a speech signal, an autocorrelation function of an input signal containing noise is calculated, and a characteristic of the current speech is utilized by utilizing characteristics of the autocorrelation function. Estimate the amount of noise contained in the signal. However, the digital method using this (TDMA method or CD
(MA system) In a mobile phone, a speech coding device is connected after the noise suppression device. In the speech coding apparatus, the autocorrelation function of the input signal is calculated and used in a coding parameter calculation system.

【００２０】このことに着目すると雑音抑圧装置と音声
符号化装置は、一体化できるはずであり、それにより、
雑音抑圧装置の採用による、回路の複雑化や消費電力の
増加を防げるはずである。そのためには現在、デジタル
方式携帯電話に使用されている音声符号化装置の内容を
十分に把握し、その上で一体化した装置を考案するのが
良いと考えられる。Focusing on this, the noise suppression device and the speech coding device should be able to be integrated, whereby
The adoption of the noise suppression device should prevent the circuit from becoming complicated and increasing power consumption. For that purpose, it is considered good to fully understand the contents of the voice encoding device currently used in the digital mobile phone, and then to devise an integrated device.

【００２１】現在のデジタル方式携帯電話に使用されて
いる音声符号化装置は各種のものがあるが、その殆どが
CELP(Code-Excited Linear Prediction ；符号励振線形
予測)と呼ばれるものである。従つて以下に先行技術と
して、最も広く採用されている、ACELP( Algebraic CEL
P; 代数CELP)と称されるものについて以下に説明する。
説明には、GSM方式携帯電話の規格書である以下の資料
を利用する。「GSM Global System For Moble Communications」Euro
pean Telecommunication Standard ETS 300726 March 1
997“Digital cellular telecommunications system;En
hanced Full Rate speech transcoding」(GSM 06.06)”There are various types of speech coding devices used in current digital cellular phones, but most of them are
This is called CELP (Code-Excited Linear Prediction). Therefore, ACELP (Algebraic CEL), which is most widely adopted as the prior art, is described below.
P; algebra CELP) is described below.
For the explanation, the following materials which are the standard documents of the GSM mobile phone are used. `` GSM Global System For Moble Communications '' Euro
pean Telecommunication Standard ETS 300 726 March 1
997 “Digital cellular telecommunications system; En
hanced Full Rate speech transcoding "(GSM 06.06)"

【００２２】ACELP音声符号化装置の簡易ブロック図を
図２９に示す。図２９において、図示番号1201は固定コ
ードブック、1202は増幅器、1203は適応コードブック、
1204は増幅器、1205は加算器、1206は線形予測合成回
路、1207は聴覚修正回路である。FIG. 29 is a simplified block diagram of the ACELP speech coding apparatus. In FIG. 29, the reference numbers 1201 are a fixed codebook, 1202 is an amplifier, 1203 is an adaptive codebook,
1204 is an amplifier, 1205 is an adder, 1206 is a linear prediction synthesis circuit, and 1207 is an auditory correction circuit.

【００２３】以下、図２９を用いて、ACELP音声符号化
装置の概略の動作を説明する。固定コードブック1201
は、振幅１の正負のランダムインパルス列である、適応
コードブック1203は一定周期の繰り返しパルス列であ
る。それらは増幅器1202及び増幅器1204でそれぞれ異な
るゲイン倍される。ここまでは人間の音声発生メカニズ
ムの音源部分に相当する。The general operation of the ACELP speech coding apparatus will be described below with reference to FIG. Fixed codebook 1201
Is a positive / negative random impulse train having an amplitude of 1. The adaptive codebook 1203 is a repetitive pulse train having a constant period. They are multiplied by different gains in the amplifiers 1202 and 1204, respectively. So far, it corresponds to the sound source part of the human voice generation mechanism.

【００２４】そして、両者は加算器1205に加えられ一緒
になつて線形予測合成回路1206に入力されその出力は音
声信号となる。ここまでのところは、人間の音声発生メ
カニズムの音道部分に相当する。すなわち喉や口、を経
由して音声が外部に出るところまでである。Then, both of them are added to the adder 1205, and they are input together to the linear predictive synthesis circuit 1206, and the output is a voice signal. So far, it corresponds to the sound path portion of the human voice generation mechanism. That is, the sound is output to the outside via the throat and the mouth.

【００２５】そして、線形予測合成回路1206の出力は聴
覚修正回路1207に入力される。その出力は人間の音声感
知特性を考慮して修正された音声信号となる。なお音声
符号化装置から、受信側の復号化装置に送られるのは、
固定コードブック1201、増幅器1202、適応コードブック
1203、増幅器1204、線形予測合成回路1206のパラメータ
である。The output of the linear prediction synthesis circuit 1206 is input to the auditory correction circuit 1207. The output is an audio signal modified in consideration of the human voice sensing characteristics. Note that what is sent from the speech encoding device to the decoding device on the receiving side is
Fixed codebook 1201, amplifier 1202, adaptive codebook
1203, amplifier 1204, and linear prediction synthesis circuit 1206.

【００２６】次にACELP音声符号化装置の詳細機能ブロ
ック図を図３０に示す。図３０において、図示番号1301
は前処理回路、1302は窓掛け回路、1303は自己相関関数
計算回路、1304は予測係数算出回路、1305は係数変換回
路、1306は係数量子化回路、1307はサブフレーム係数内
挿回路、1308Aはサブフレーム係数内挿回路（量子化
前）、1308Bはインパルス応答計算回路、1309は４サブ
フレーム重みずけ音声計算回路、1310Aは開ループピッ
チ遅れ発見回路、1310Bは開ループピッチゲイン発見回
路、1311Aは適用コードブック、1311Bはピッチゲイン調
整回路、1311Cはインパルス応答回路、1311Dは誤差計算
回路、1311Eは誤差最小化回路、1312は最適遅れとゲイ
ン発見回路、1313はピッチゲイン量子化回路、1314は適
用コードブック寄与度計算回路、1316は固定コードブッ
ク、1317Aは固定コードブックゲイン調整回路、1317Bは
インパルス応答回路、1317Cは誤差計算回路、1317Dは誤
差最小化回路、1317Eは固定コードブック最適値発見回
路、1318は固定コードブックゲイン量子化回路、1319は
励振計算回路、1320は最適フィルタ記憶回路である。Next, FIG. 30 shows a detailed functional block diagram of the ACELP speech coding apparatus. Referring to FIG.
Is a preprocessing circuit, 1302 is a windowing circuit, 1303 is an autocorrelation function calculation circuit, 1304 is a prediction coefficient calculation circuit, 1305 is a coefficient conversion circuit, 1306 is a coefficient quantization circuit, 1307 is a subframe coefficient interpolation circuit, and 1308A is Subframe coefficient interpolation circuit (before quantization), 1308B is an impulse response calculation circuit, 1309 is a 4-subframe weighted speech calculation circuit, 1310A is an open loop pitch delay detection circuit, 1310B is an open loop pitch gain detection circuit, 1311A Is an application codebook, 1311B is a pitch gain adjustment circuit, 1311C is an impulse response circuit, 1311D is an error calculation circuit, 1311E is an error minimization circuit, 1312 is an optimum delay and gain finding circuit, 1313 is a pitch gain quantization circuit, 1314 is Applied codebook contribution calculation circuit, 1316 is a fixed codebook, 1317A is a fixed codebook gain adjustment circuit, 1317B is an impulse response circuit, 1317C is an error calculation circuit, and 1317D is an error minimization circuit. 1313E, a fixed codebook optimum value finding circuit, 1318, a fixed codebook gain quantization circuit, 1319, an excitation calculation circuit, and 1320, an optimum filter storage circuit.

【００２７】なお、図２９、図３０で“回路“と言う名
称を使用したが、実際にはこれらは、現在ではデジタル
シグナルプロセッサ（DSP）と呼ばれる１チツプの半導
体集積回路であってコンピュータとプログラム記憶装置
で構成されているが、説明の便宜上、上記の名前を使用
する。Although the name "Circuit" is used in FIGS. 29 and 30, it is actually a one-chip semiconductor integrated circuit which is now called a digital signal processor (DSP). Although it is constituted by a storage device, the above names are used for convenience of explanation.

【００２８】以下に図３０を用いて、ACELP音声符号化
装置の概略の動作を説明する。The general operation of the ACELP speech coding apparatus will be described below with reference to FIG.

【００２９】[前処理回路1301]前処理回路1301は携帯電
話のマイクロホンが音声信号を電気信号として出力した
ものを増幅し、低域濾波器を通し、低域濾波器の遮断周
波数の２倍の周波数でサンプリングし、A/D変換器を通
しデジタル信号として音声符号化装置に供給する働きを
する。具体的な数値を１例として上げると、低域濾波器
の遮断周波数は４kHz、サンプリング周波数は８kHz、A/
D変換器は１サンプルを13bitsのデジタル信号とする。
このデジタル信号は窓掛け回路1302に入力される。[Pre-processing circuit 1301] The pre-processing circuit 1301 amplifies the output of the audio signal of the microphone of the portable telephone as an electric signal, passes through a low-pass filter, and is twice the cutoff frequency of the low-pass filter. It works by sampling at a frequency and supplying it as a digital signal to an audio encoder through an A / D converter. Taking specific values as an example, the cutoff frequency of the low-pass filter is 4 kHz, the sampling frequency is 8 kHz, and the A /
The D converter converts one sample into a 13-bit digital signal.
This digital signal is input to the windowing circuit 1302.

【００３０】[窓掛け回路1302]窓掛け回路1302で、デジ
タル信号に窓掛け関数を掛けて取り出す状況を図３１に
示す。すなわち音声信号の１フレームは20msであるが、
それを４等分し、１サブフレーム５msの４サブフレーム
で構成する。そして、実際にデータとして取り込むのは
第２サブフレームと第４サブフレームであり、信号の予
測係数の計算は、これらのサブフレームについて計算し
第１サブフレームと第３サブフレームについてはこれら
の計算結果を用いて、内挿計算により求める。第２サブ
フレームと第４サブフレームに掛ける窓関数のかたち
は、図３１に示すように異なる形をしている。取り込む
データは30msの期間で、240サンプル有ることは両者同
じである。[Windowing circuit 1302] FIG. 31 shows a situation where the windowing circuit 1302 multiplies a digital signal by a windowing function and extracts the digital signal. That is, one frame of the audio signal is 20 ms,
It is divided into four equal parts and is composed of four subframes of 5 ms per subframe. The data actually taken in as the data are the second subframe and the fourth subframe. The prediction coefficient of the signal is calculated for these subframes, and the calculation is performed for the first and third subframes. Using the results, it is determined by interpolation. The shapes of the window functions applied to the second subframe and the fourth subframe have different shapes as shown in FIG. The data to be fetched is in the period of 30 ms, and it is the same that there are 240 samples.

【００３１】[自己相関関数計算回路1303]自己相関関数
計算回路1303は、取り込んだ音声信号に冗長度が有るこ
とを利用して、過去のサンプルから現在のサンプルを予
測する。しかし完全には出来ないので誤差が残る。窓の
掛けられた音声信号を、s’(n)をとすると予測値s”(n)
は下式(２)で表される。ｓ”(n)=−Σ_k=0 ^k=10a_k s’(n-k) （２）すなわち過去10サンプル点のデータで予測している。予
測誤差e(n)は下式(３)で表わされる。ｅ(n)=s’(n)−ｓ”(n)=Σ_k=0 ^k=10a_k s’(n-k) ；ｋ=0,-,-,10；a₀=1 （３）ここでａ₁，ａ₂，−，−，ａ₁₀ を下記の式(４)及び式
（５）が最小になるように選べば良い。ｅ(n)＊ｅ(n)=Σ_k=0 ^k=10a_k s’(n-k)＊Σ_j=0 ^j=10a_j s’(n-ｊ) （４）ｅ(n)＊ｅ(n)=Σ_k=0 ^k=10Σ_j=0 ^j=10a_ka_js’(n-k)＊ s’(n-ｊ) （５）ここで、音声信号が窓を掛けて取り込んだ、短い時間内
では、定常性を仮定できるとすると、s’(n-k)＊ s’(n
-ｊ)は、(n-ｊ)-(n-k)=(k−ｊ)にのみ依ることになり、
下式(６)による自己相関関数で表わされる。ｅ(n)＊ｅ(n)=Σ_k=0 ^k=10Σ_j=0 ^j=10a_ka_j R(k-ｊ) （６）ここで、下式(７)が成立している。 R(k-j)= s’(n-k)＊ s’(n-ｊ) （７）厳密にいえば、ｎを取り込んだ全サンプル値で計算し平
均値をとるのである。[Autocorrelation Function Calculation Circuit 1303] The autocorrelation function calculation circuit 1303 predicts a current sample from a past sample by utilizing the fact that the taken audio signal has redundancy. However, since it cannot be done completely, an error remains. Assuming that the windowed audio signal is s' (n), a predicted value s "(n)
Is represented by the following equation (2). s ″ (n) = − Σ _{k = 0} ^{k = 10} a _k s ′ (nk) (2) That is, prediction is performed using data at the past 10 sample points. The prediction error e (n) is expressed by the following equation (3). . represented by e (n) = s '( n) -s "(n) = Σ k = 0 k = 10 a k s'(nk); k = 0, -, -, 10; a 0 = 1 ( 3) Here, a ₁ , a ₂ , −, −, and a ₁₀ may be selected so that the following equations (4) and (5) are minimized. e (n) * e (n ) = Σ k = 0 k = 10 a k s '(nk) * Σ j = 0 j = 10 a j s' (n-j) (4) e (n) * e (n) = Σ _{k = 0} ^{k = 10} Σ _{j = 0} ^{j = 10} a _k a _j s '(nk) * s' (n-j) (5) Here, the audio signal was captured through a window , S '(nk) * s' (n
-j) depends only on (n-j)-(nk) = (k-j),
It is represented by an autocorrelation function according to the following equation (6). e (n) * e (n) = Σ _{k = 0} ^{k = 10} Σ _{j = 0} ^{j = 10} a _k a _j R (k-j) (6) Here, the following equation (7) holds. . R (kj) = s '(nk) * s' (n-j) (7) Strictly speaking, it is calculated with all sample values in which n has been taken in and averaged.

【００３２】[予測係数算出回路1304]予測係数算出回路
1304は、式(６)を最小にする、ａ₁，ａ₂，-，-，ａ₁₀
を選ぶことにより予測係数を算出する。その方法は、式
(６)式のａ₁，ａ₂，-，-，ａ ₁₀ を、それぞれで偏微分
し、その結果をゼロとする値を選べば良い。計算過程は
省略するがその結果は下記の式(８)が得られる。[Prediction coefficient calculation circuit 1304] Prediction coefficient calculation circuit
1304 minimizes equation (6), a₁, A_Two,-,-, A_Ten
The prediction coefficient is calculated by selecting. The method is
A in equation (6)₁, A_Two,-,-, A _Ten And the partial differential
Then, a value that makes the result zero may be selected. The calculation process is
Although omitted, the result is given by the following equation (8).

【数１】 (Equation 1)

【００３３】上式(８)は、レビンソン・ダービンアル
ゴリズムで解くことが出来るので、最適な予測係数
ａ₁，ａ₂，-，-，ａ₁₀ が、自己相関関数を用いて求ま
ることになる。Since the above equation (8) can be solved by the Levinson-Durbin algorithm, the optimum prediction coefficients a ₁ , a ₂ ,-,-, and a ₁₀ are obtained using the autocorrelation function.

【００３４】[係数変換回路1305]係数変換回路1305は、
この様にして計算された最適な予測係数ａ₁，ａ₂，-，
-，ａ₁₀ は時間領域のパラメータ（インパルス応答の
係数）であるが、これを周波数領域のパラメータであ
る、ライン・スペクトル・ペア（ＬＳＰ）と呼ばれる別
の係数に以下のようにして、変換する。すでに説明した
ように、予測誤差は式(３)で表わされた。[Coefficient conversion circuit 1305] The coefficient conversion circuit 1305
The optimal prediction coefficients a ₁ , a ₂ ,-,
-, but a ₁₀ is a parameter in the time domain (the coefficient of the impulse response), which is a parameter of the frequency domain, the following manner to another factor, called line spectrum pair (LSP), is converted . As described above, the prediction error was expressed by equation (3).

【００３５】これをｚ変換すると、次式（９）が得られ
る。 E(z)= S(z)+Σ_k=1 ^k=10 a_kS(z)z^-k ； k=1,2,3,-,-,10 （９）そして、予測推定回路の伝達関数として、A(z)が下式(10)で定義される。 A(ｚ) = E(z)/S(z)= 1 +Σ_k=1 ^k=10 a_kz^-k ；k=1,2,3,-,-,10 (10) A(ｚ)を用いて、以下の関数(11)、(12)を定義する。 P(z)= A(z)−z^-11A(z^-1) (11) Q(z)= A(z)＋z^-11A(z^-1) （12） P(z)、Q(z)はそれぞれ、(1−z^-1)、(1＋z^-1)で割り切れ
る。そして残部は、zの多項式として表される。そして
この多項式は、zの単位円上に根を有する。従つて、ｚ =e^sT=ｅ^σ+jωT=e ^jωT (13) 上式で、T=1とおくと下式（14）を得る。 (ｚ＋z^-1)/2 =( e ^jω＋ e ^-jω )/2 = cosω(14) となり、この多項式はｃosωに関する５ヶの根をもち、
次式(15)、(16)で表わされる。 P(z)=(1−z^-1)Π_i=1 ⁱ⁼⁵(1−2 z^-1cos(ωi)+z^-2 ) ；ｉ=1,-,-,5 (15) Q(z)=(1＋z^-1)Π_i=1 ⁱ⁼⁵(1−2 z^-1cos(βｉ)+z^-2 ) ；ｉ=1,-,-,5 (16) この、ωiとβｉ合計10ヶの根がライン・スペクトル・
ペア（ＬＳＰ）と呼ばれるものである。When this is z-transformed, the following equation (9) is obtained. E (z) = S (z) + Σ _{k = 1} ^{k = 10} a _k S (z) z ^-k ; k = 1,2,3,-,-, 10 (9) A (z) is defined by the following equation (10) as a function. A (z) = E (z) / S (z) = 1 + Σ _{k = 1} ^{k = 10} a _k z ^-k ; k = 1,2,3,-,-, 10 (10) A (z) Are used to define the following functions (11) and (12). P (z) = A (z) −z ⁻¹¹ A (z ⁻¹ ) (11) Q (z) = A (z) + z ⁻¹¹ A (z ⁻¹ ) (12) P (z), Q ( z) is divisible by (1−z ⁻¹ ) and (1 + z ⁻¹ ), respectively. The remainder is expressed as a polynomial in z. And this polynomial has a root on the unit circle of z. Accordance connexion, at ^{^{z = e sT = e σ +}} jωT = e jωT (13) the above equation is obtained T = 1 far and the following expression (14). (z + z ⁻¹ ) / 2 = (e ^jω + e ^−jω ) / 2 = cosω (14), and this polynomial has five roots related to cosω,
It is expressed by the following equations (15) and (16). P (z) = (1−z ⁻¹ ) Π _{i = 1} ^{i = 5} (1-2 z ⁻¹ cos (ωi) + z ⁻² ); i = 1, −, −, 5 (15) Q ( z) = (1 + z ^-1 ) Π _{i = 1} ^{i = 5} (1-2 z ^-1 cos (βi) + z ^-2 ); i = 1,-,-, 5 (16) This is the sum of ωi and βi 10 roots are line spectrum
This is called a pair (LSP).

【００３６】この、ライン・スペクトル・ペア（ＬＳ
Ｐ）は以下のような利点が有ることが知られている。 (ａ)パラメターの比較だけでシステムの安定判別が出来
る。 (ｂ)周波数軸のパラメータであるため、パラメータの動
きとスペクトル包絡の動きが類似している。すなわちパ
ラメータの補間特性がよい。 (ｃ)スペクトルひずみに対するパラメータの感度がほぼ
一定である。This line spectrum pair (LS
P) is known to have the following advantages. (a) The stability of the system can be determined only by comparing the parameters. (b) Since it is a parameter on the frequency axis, the movement of the parameter is similar to the movement of the spectrum envelope. That is, the parameter interpolation characteristics are good. (c) The sensitivity of the parameter to spectral distortion is almost constant.

【００３７】[係数量子化回路1306]係数量子化回路1306
は、この様にして得られた、ＬＳＰを無線系で受信側に
伝送するために、適切なビツト数に変換する。[Coefficient quantization circuit 1306] Coefficient quantization circuit 1306
Converts the LSP thus obtained into an appropriate number of bits in order to transmit the LSP to the receiving side in a wireless system.

【００３８】[サブフレーム係数内挿回路1307]サブフレ
ーム係数内挿回路1307は、以上に説明した如くにして計
算された、第２サブフレームと第４サブフレームのLSP
の値を用いて、第１と第３サブフレームのLSPパラメー
タを下式(17)、(18)により、求める。 LSP（フレーム :ｍ；サブフレーム : 1） =0.5＊LSP（フレーム：(ｍ-1)；サフ゛フレーム:4）＋0.5＊ＬＳＰ（フレーム:(ｍ)；サフ゛フレーム:2） (17) LSP（フレーム :ｍ；サブフレーム : 3）=0.5＊LSPフレーム:(ｍ)；サフ゛フレーム:2）｝＋0.5＊ＬＳＰフレーム:(ｍ)；サフ゛フレーム:4）｝ (18) さらに、これらの値を用いて線形予測回路の伝達関数で
あるA’(z)を得る。Aに「 ’ 」を付けたのは、次に述
べる量子化をする前のLSPを用いて求めたA(z)と区別す
るためである。[Subframe coefficient interpolation circuit 1307] The subframe coefficient interpolation circuit 1307 calculates the LSPs of the second and fourth subframes calculated as described above.
Are used to determine the LSP parameters of the first and third subframes by the following equations (17) and (18). LSP (frame: m; subframe: 1) = 0.5 * LSP (frame: (m-1); subframe: 4) + 0.5 * LSP (frame: (m); subframe: 2) (17) LSP (frame: m; subframe: 3) = 0.5 * LSP frame: (m); subframe: 2) {+0.5 * LSP frame: (m); subframe: 4) (18) Using these values, A '(z) which is a transfer function of the linear prediction circuit is obtained. The reason why “′” is added to A is to distinguish it from A (z) obtained by using the LSP before quantization described below.

【００３９】[サブフレーム係数内挿回路（量子化前）1
308A]サブフレーム係数内挿回路（量子化前）1308は無
線系に伝送するための量子化を実施する前のLSPを用い
て、第１と第３サブフレームのLSPパラメータを同様に
して求め、それを用いて、線形予測回路の伝達関数であ
るA(z)を得る。[Subframe coefficient interpolation circuit (before quantization) 1
308A] The subframe coefficient interpolation circuit (before quantization) 1308 similarly obtains the LSP parameters of the first and third subframes using the LSP before performing quantization for transmission to the wireless system, Using it, the transfer function A (z) of the linear prediction circuit is obtained.

【００４０】[インパルス応答計算回路1308B]インパル
ス応答計算回路1308Bは、サブフレーム係数内挿回路
（量子化前）1308Aで得られた、線形予測回路の伝達関
数であるA(z)とサブフレーム係数内挿回路1307で得られ
た量子化後の線形予測回路の伝達関数であるA’(z)とを
用いて、人間の聴覚補正のされた線形予測合成回路のイ
ンパルス応答である、ｈ(ｎ)を求める。ｈ(ｎ)のｚ変換
は下式(19)で表される。 H（ｚ）＝(１/A’(z))・(Ｗ(ｚ))＝{A(ｚ/γ₁)}/{ A’(z)・A(ｚ/γ₂)} (19) 通常、γ₁＝0.9、γ₂＝0.6に選ばれる。H(ｚ)とA’(z)
とは逆数関係にある。A’(z)は入力音声信号から予測可
能な分を差し引き、残差を出力する。残差は予測が適切
に行われるとランダムパルスとなる。これに対してH
(ｚ)は残差すなわちランダムパルスを入力して音声信号
を出力し、その出力に人間の聴覚特性を補正したもので
あることを示している。ｈ(ｎ)を求める詳細方法は省略
するが、ｈ(ｎ)は上記のH(ｚ)の逆ｚ変換で求まる。こ
のｈ(ｎ)は適応コードブックと固定コードブックのサー
チに必要なものである。[Impulse Response Calculation Circuit 1308B] The impulse response calculation circuit 1308B includes A (z), which is a transfer function of the linear prediction circuit, obtained by the subframe coefficient interpolation circuit (before quantization) 1308A and the subframe coefficient Using the transfer function A '(z) of the quantized linear prediction circuit obtained by the interpolation circuit 1307, h (n) which is the impulse response of the linear prediction synthesis circuit corrected for human hearing. ). The z-transform of h (n) is represented by the following equation (19). H (z) = (1 / A ′ (z)) · (W (z)) = {A (z / γ ₁ )} / {A ′ (z) · A (z / γ ₂ )} (19) Usually, γ ₁ = 0.9 and γ ₂ = 0.6 are selected. H (z) and A '(z)
Is in a reciprocal relationship. A ′ (z) subtracts a predictable amount from the input audio signal and outputs a residual. The residual becomes a random pulse when prediction is properly performed. H
(z) indicates that a residual, that is, a random pulse is input, an audio signal is output, and the output is obtained by correcting human auditory characteristics. Although a detailed method of obtaining h (n) is omitted, h (n) is obtained by the above-described inverse z-transform of H (z). This h (n) is necessary for searching the adaptive codebook and the fixed codebook.

【００４１】[４サブフレーム重みずけ音声計算回路130
9]４サブフレーム重みずけ音声計算回路1309は窓掛けを
する前のデジタル音声信号４サブフレーム（すなわち音
声信号１フレーム）分の音声に人間の聴覚特性を考慮し
た伝達関数のフィルターを通す。人間の聴覚特性を考慮
した伝達関数のフィルターは下式(20)で表わされる。Ｗ(ｚ)= {A(ｚ/γ₁)} /{A(ｚ/γ₂)} (20) ここで、γ₁=0.9、γ₂=0.6である。すなわちサブフレー
ム係数内挿回路（量子化前）1308AでA(z)が判明したの
でＷ(ｚ)も求まることになる。[4 Subframe Weighted Speech Calculation Circuit 130
9] The 4-subframe weighted audio calculation circuit 1309 passes the audio of the 4 subframes (that is, one audio signal) of the digital audio signal before windowing through a filter of a transfer function in consideration of human auditory characteristics. The filter of the transfer function considering human auditory characteristics is expressed by the following equation (20). W (z) = {A (z / γ ₁ )} / {A (z / γ ₂ )} (20) Here, γ ₁ = 0.9 and γ ₂ = 0.6. That is, since A (z) is found by the subframe coefficient interpolation circuit (before quantization) 1308A, W (z) is also obtained.

【００４２】[開ループピッチ遅れ発見回路1310A]、[開
ループピッチゲイン発見回路1310B]開ループピッチ遅れ
発見回路1310A、開ループピッチゲイン発見回路1310B
は、10msに１回、すなわち、音声信号の１フレームであ
る20msに２回、ピッチの周期すなわち遅れ時間とゲイン
を推定する。開ループによるピッチ遅れとゲイン発見の
方法は、以下の関係式によって表される。 Rlag(open) = Σ_n=0 ⁿ⁼⁷⁹sw(n)sw(n-k) ;n=0,-,-,79 (21) この値は、次の３つの範囲に分けて計算される。（第３範囲；ｉ=３）： k= 18,-,-,-,35 (22) （第２範囲；ｉ=２）： k= 36,-,-,-,71 (23) （第１範囲；ｉ=１）： k= 72,-,-,-,143 (24)[Open loop pitch delay finding circuit 1310A], [Open loop pitch gain finding circuit 1310B] Open loop pitch delay finding circuit 1310A, open loop pitch gain finding circuit 1310B
Estimates the pitch cycle, ie, the delay time and the gain, once every 10 ms, that is, twice every 20 ms, which is one frame of the audio signal. The method of finding pitch delay and gain by open loop is represented by the following relational expression. Rlag (open) = Σ _{n = 0} ^{n = 79} sw (n) sw (nk); n = 0, −, −, 79 (21) This value is calculated in the following three ranges. (3rd range; i = 3): k = 18,-,-,-, 35 (22) (2nd range; i = 2): k = 36,-,-,-, 71 (23) (23rd) 1 range; i = 1): k = 72,-,-,-, 143 (24)

【００４３】その結果、Rlag(open)を最大にする、３つ
の範囲の中での３ヶの値の中からピッチ遅れとピッチゲ
イン最適値がRlag(open)を最大にする値として選ばれ、
遅れ時間は適応コードブック1311A、ピッチゲインはピ
ッチゲイン調整回路1311Bに夫々引き渡される。As a result, a pitch delay and a pitch gain optimum value are selected as values for maximizing Rlag (open) from among three values in three ranges for maximizing Rlag (open),
The delay time is passed to the adaptive codebook 1311A, and the pitch gain is passed to the pitch gain adjustment circuit 1311B.

【００４４】[適応コードブック1311A]、[ピッチゲイン
調整回路1311B] 適応コードブック1311Aは、音声信号のピッチ成分を表
すインパルスのコードブックであり適応的にピッチ遅れ
時間を記憶しパルス列を発生する。ピッチゲイン調整回
路1311Bはピッチゲインを記憶しかつ調整する機能を有
する。まず適応コードブック1311Aとピッチゲイン調整
回路1311Bの初期値は、上記の開ループピッチ遅れ発見
回路1310Aと開ループピッチゲイン発見回路1310Bで求め
られたピッチ遅れ時間とピッチゲインがそれぞれ、適応
コードブック1311Aと、ピッチゲイン調整回路1311Bで使
用される。[Adaptive Codebook 1311A], [Pitch Gain Adjustment Circuit 1311B] The adaptive codebook 1311A is a codebook of impulses representing a pitch component of a voice signal, and adaptively stores a pitch delay time and generates a pulse train. The pitch gain adjustment circuit 1311B has a function of storing and adjusting the pitch gain. First, the initial values of the adaptive codebook 1311A and the pitch gain adjustment circuit 1311B are the pitch delay time and the pitch gain obtained by the open loop pitch delay finding circuit 1310A and the open loop pitch gain finding circuit 1310B, respectively, as the adaptive codebook 1311A. Are used in the pitch gain adjustment circuit 1311B.

【００４５】[インパルス応答回路1311C]インパルス応
答回路1311Cはインパルス応答計算回路1308Bで求めたｈ
(ｎ)と適応コードブック1311A、ピッチゲイン調整回路1
311Bにより合成されたピッチゲインパルスとを畳み込み
積分して重みずけ音声信号に相当する合成信号に変換し
それを誤差計算回路1311Dに入力する。[Impulse response circuit 1311C] The impulse response circuit 1311C obtains h obtained by the impulse response calculation circuit 1308B.
(n), adaptive codebook 1311A, pitch gain adjustment circuit 1
The pitch gain pulse synthesized by 311B is convoluted and integrated to convert it into a synthesized signal corresponding to a weighted audio signal, which is input to an error calculation circuit 1311D.

【００４６】[誤差計算回路1311D]誤差計算回路1311D
は、４サブフレーム重みずけ音声計算回路1309で計算さ
れた、重みずけ音声信号と上記インパルス応答回路1311
Cの出力である重みずけ音声信号に相当する合成信号の
差すなわち誤差を計算し誤差最小化回路1311Eに出力す
る。[Error Calculation Circuit 1311D] Error Calculation Circuit 1311D
Is a weighted voice signal calculated by the 4-subframe weighted voice calculation circuit 1309 and the impulse response circuit 1311
The difference of the synthesized signal corresponding to the weighted audio signal which is the output of C, that is, the error is calculated and output to the error minimizing circuit 1311E.

【００４７】[誤差最小化回路1311E]誤差最小化回路131
1Eは誤差計算回路1311Dで計算された誤差の二乗平均値
を求め、その値を最小化するように適応コードブック13
11Aとピッチゲイン調整回路1311Bを制御する。その詳細
は省略するが、前述の開ループピッチ遅れ発見回路1310
Aと開ループピッチゲイン発見回路1310Bで求められたピ
ッチ遅れ時間とピッチゲインの周辺を精密にサーチす
る。[Error minimizing circuit 1311E] Error minimizing circuit 131
1E finds the mean square value of the error calculated by the error calculation circuit 1311D, and adapts the adaptive code book 13 so as to minimize the value.
11A and the pitch gain adjustment circuit 1311B. Although the details are omitted, the open loop pitch delay finding circuit 1310 described above is used.
A and the periphery of the pitch delay time and pitch gain obtained by the open loop pitch gain finding circuit 1310B are precisely searched.

【００４８】[最適遅れとゲイン発見回路1312]最適遅れ
とゲイン発見回路1312は誤差最小化回路1311Eの制御の
結果として最適な遅れ時間とゲイン値を発見する。[Optimal delay and gain finding circuit 1312] The optimal delay and gain finding circuit 1312 finds the optimum delay time and gain value as a result of the control of the error minimizing circuit 1311E.

【００４９】こうして求められた、適応コードブックの
値とピッチゲインとは夫々記憶され、それを利用して次
の、適応コードブックとピッチゲインの最適値がサーチ
されるので、動作が効率よく行われサーチに必要な遅れ
時間はサブフレーム；40ヶ分(200ｍｓ)以下に押さえら
れている。The value of the adaptive codebook and the pitch gain thus obtained are stored respectively, and the optimal values of the next adaptive codebook and the pitch gain are searched by using them, so that the operation can be performed efficiently. The delay time required for the search is suppressed to 40 subframes (200 ms) or less.

【００５０】[ピッチゲイン量子化回路1313]ピッチゲイ
ン量子化回路1313は、最適遅れとゲイン発見回路1312で
見出された最適ピッチゲインを、無線系を通じて伝送す
るために、適切なビツト数で量子化する。[Pitch Gain Quantizing Circuit 1313] The pitch gain quantizing circuit 1313 quantizes the optimum pitch gain found by the optimum delay and gain finding circuit 1312 with an appropriate number of bits in order to transmit the optimum pitch gain through a radio system. Become

【００５１】[適用コードブック寄与度計算回路1314]適
用コードブック寄与度計算回路1314は最適遅れとゲイン
発見回路1312で求められた値によるインパルス列を用い
て得られる誤差信号を誤差計算回路1311Dの出力として
誤差計算回路1317Cに出力する。[Applied Codebook Contribution Calculation Circuit 1314] The applied codebook contribution calculation circuit 1314 converts the error signal obtained by using the impulse train based on the value obtained by the optimum delay and gain finding circuit 1312 into the error calculation circuit 1311D. It outputs to the error calculation circuit 1317C as an output.

【００５２】[固定コードブック1316]固定コードブック
1316は、５つのトラツクに分けられた全体で10ヶの単パ
ルスの組み合わせで作られているそれらのパルスは、
“＋１”と“−１”の値を取ることが出来る。サブフレ
ームは、サンプル点として、40有るので各トラツクは８
サンプル点であり、各パルスの位置はトラツク毎に３ビ
ツトで表される。その状況は、図３２に示す。[Fixed Code Book 1316] Fixed Code Book
1316, those pulses made of a combination of a total of 10 single pulses divided into 5 tracks,
It can take the values of "+1" and "-1". There are 40 subframes as sample points, so each track is 8
This is a sample point, and the position of each pulse is represented by 3 bits for each track. The situation is shown in FIG.

【００５３】[固定ゲイン調整回路1317A]固定ゲイン調
整回路1317Aは固定コードブック1316が発生した１サブ
フレームににつき８ケのパルスの振幅を調整する回路で
有りその出力をインパルス応答回路1317Bに渡す。[Fixed Gain Adjustment Circuit 1317A] The fixed gain adjustment circuit 1317A is a circuit for adjusting the amplitude of eight pulses per subframe generated by the fixed codebook 1316, and outputs its output to the impulse response circuit 1317B.

【００５４】[インパルス応答回路1317B]インパルス応
答回路1317Bはインパルス応答計算回路1308Bで求めたｈ
(ｎ)と固定コードブック1316、固定ゲイン調整回路1317
Aにより合成された固定コードインパルスとを畳み込み
積分して重みずけ音声信号に相当する合成信号に変換し
それを誤差計算回路1317Cに入力する。[Impulse Response Circuit 1317B] The impulse response circuit 1317B obtains h obtained by the impulse response calculation circuit 1308B.
(n), fixed codebook 1316, fixed gain adjustment circuit 1317
The fixed code impulse synthesized by A is convoluted and integrated to convert it into a synthesized signal corresponding to a weighted audio signal, which is input to an error calculation circuit 1317C.

【００５５】[誤差計算回路1317C]誤差計算回路1317C
は、インパルス応答回路1315により、誤差最小化回路13
11Eで最小化された誤差信号を重みずけ音声信号に変換
した信号と、インパルス応答回路1317Bにより、固定コ
ードブック1316、固定ゲイン調整回路1317Aにより合成
された固定コードインパルスを、重みずけ音声信号に相
当する合成信号に変換した信号との差を計算しその値を
誤差最小化回路1317Dに出力する。[Error Calculation Circuit 1317C] Error Calculation Circuit 1317C
Is the error minimizing circuit 13 by the impulse response circuit 1315.
A signal obtained by converting the error signal minimized in 11E into a weighted audio signal, and a fixed code impulse synthesized by a fixed codebook 1316 and a fixed gain adjustment circuit 1317A by an impulse response circuit 1317B, a weighted audio signal Is calculated, and the difference is output to the error minimizing circuit 1317D.

【００５６】[誤差最小化回路1317D]誤差最小化回路131
7Dは誤差計算回路1317Cで計算された誤差の二乗平均値
を求め、その値を最小化するように固定コードブック13
1６と固定ゲイン調整回路131７Aを制御する。その詳細
は省略する。[Error minimizing circuit 1317D] Error minimizing circuit 131
7D calculates the mean square value of the error calculated by the error calculation circuit 1317C, and fixes the fixed code book 13 so as to minimize the value.
16 and the fixed gain adjustment circuit 1317A. The details are omitted.

【００５７】[固定コードブック最適値発見回路1317E]
固定コードブック最適値発見回路1317Eは誤差最小化回
路1317Dが誤差の二乗平均値を最小化するように固定コ
ードブック1316と固定ゲイン調整回路1317Aを制御した
結果として固定コードブックの最適値と固定ゲインの最
適値を発見する。[Fixed codebook optimum value finding circuit 1317E]
The fixed codebook optimum value finding circuit 1317E controls the fixed codebook 1316 and the fixed gain adjustment circuit 1317A so that the error minimizing circuit 1317D minimizes the mean square value of the error. To find the optimal value of.

【００５８】[固定コードブックゲイン量子化回路1318]
固定コードブックゲイン量子化回路1318は固定コードブ
ックのゲインを無線伝送に必要な適切なビツト数で量子
化する。[Fixed codebook gain quantization circuit 1318]
The fixed codebook gain quantization circuit 1318 quantizes the gain of the fixed codebook with an appropriate number of bits necessary for wireless transmission.

【００５９】[励振計算回路1319]励振計算回路1319は今
までに説明した事柄により、作られた適応コードブック
ならびにピッチゲインの最適値と、固定コードブックな
らびに固定ゲインの最適値により、サブフレーム毎の合
成音声の励振パルス信号を下式(25)により作成する。ｕ(n)= g_pv(n)+g_a C (n) ; n=0,1,2,-,-,-,39 (25) ここで、u(n)は励振ベクトルであり、v(n)は適応コード
ブックの励振ベクトルであり、g_pは、そのゲインであ
り、C(n)は固定コードブックの励振ベクトルであり、g_a
は、そのゲインである。[Excitation Calculation Circuit 1319] The excitation calculation circuit 1319 calculates the optimum values of the adaptive codebook and the pitch gain and the optimum values of the fixed codebook and the fixed gain for each sub-frame. The excitation pulse signal of the synthesized speech is created by the following equation (25). u (n) = g _p v (n) + g _a C (n); n = 0,1,2,-,-,-, 39 (25) where, u (n) is an excitation vector, v (n) is the excitation vector of the adaptive codebook, g _p is its gain, C (n) is the excitation vector of the fixed codebook, g _a
Is the gain.

【００６０】[最適フィルタ記憶回路回路1320]最適フィ
ルタ記憶回路回路1320は、図２９の線形予測合成回路の
パラメータやその初期条件を次の動作に備えて記憶して
おく。ＡＣＥＬＰ音声符号化装置により、符号化データ
として、受信側に送られるデータは以下である。 (１)ライン・スペクトル・ペア (２)ピッチの遅れ時間 (３)ピッチのゲイン (４)固定コードブックのアドレス (５)固定コードブックのゲイン[Optimal Filter Storage Circuit 1320] The optimum filter storage circuit 1320 stores the parameters of the linear prediction synthesis circuit of FIG. 29 and its initial conditions in preparation for the next operation. The data transmitted to the receiving side as encoded data by the ACELP audio encoding device is as follows. (1) Line spectrum pair (2) Delay time of pitch (3) Gain of pitch (4) Address of fixed codebook (5) Gain of fixed codebook

【００６１】[0061]

【発明が解決しようとする課題】しかしながら、以上に
説明した雑音抑圧装置は音声から除去される雑音は、過
去の雑音であり、現在通話中の雑音ではないので、通話
中に雑音が増減した場合には対応できていないという問
題点を有していた。However, in the above-described noise suppressor, the noise removed from the speech is past noise, not noise during the current call, so that the noise is reduced during the call. There was a problem that was not able to respond to.

【００６２】また、携帯電話には、音声符号化装置が使
用されており、かなり複雑な、信号処理を行っている
が、さらに雑音抑圧装置を追加することは、ハードウエ
アが、一層、複雑化することとなり、消費電流の増加、
信号処理に伴う遅延時間の増加等の問題点を有してい
た。Further, a mobile phone uses a speech coding apparatus, and performs considerably complicated signal processing. However, adding a noise suppression apparatus further increases the hardware complexity. Increase current consumption,
There were problems such as an increase in delay time due to signal processing.

【００６３】本発明は、このような問題点を解決し、現
在通話中の雑音レベルを検出し、それを除去する、また
はその影響を軽減することを可能とするとともに、音声
符号化装置と雑音抑圧装置を一体化して、ハードウエア
を簡略化し、信号処理時間の増大を防止する適応型雑音
抑圧音声符号化装置を提供することを第１の目的とす
る。The present invention solves such a problem, detects a noise level during a current call, removes the noise level, or reduces the influence of the noise level. It is a first object of the present invention to provide an adaptive noise-suppressed speech coding apparatus that integrates a suppression apparatus, simplifies hardware, and prevents an increase in signal processing time.

【００６４】また、雑音を含む音声から、雑音を除去し
た符号化情報を受信側に送ることにより受信側では雑音
の低減された音声の再生を可能とするだけでなく、音声
符号化装置内部の動作においても雑音が除去されるの
で、雑音のレベルが大になつても音声符号化装置が正常
に動作しなくなることはない。すなわち音声符号化装置
に過大な雑音が加わつても誤動作することの無い適応型
雑音抑圧音声符号化装置を提供することを第２の目的と
する。Further, by transmitting coded information from which noise has been removed to the receiving side, the receiving side can not only reproduce the noise-reduced sound, but also reproduce the noise inside the voice coder. Since noise is also removed during operation, even when the noise level becomes large, the speech encoding apparatus does not malfunction. That is, it is a second object of the present invention to provide an adaptive noise suppression speech coding apparatus which does not malfunction even when excessive noise is added to the speech coding apparatus.

【００６５】[0065]

【課題を解決するための手段】本発明の請求項１に記載
の発明は、周囲雑音の重畳された音声信号をアナログ・
デジタル変換器でデジタル信号に変換する手段、それを
一定長区間のデータとして取り出す手段、取り出した信
号の自己相関関数R(0),R(1),R(2),-,-,R(M)を計算する
手段、取り出した信号が雑音のみの区間か又は雑音の重
畳している音声信号区間かを判定する判定手段、前記判
定手段が雑音のみの区間と判定した場合は、その区間の
自己相関関数R(0),R(1),R(2),-,-,R(M)の値をRn_m(0),Rn
_m(1)、Rn _m(2)、-、-、Rn_m(M)として記憶する雑音自己相
関関数記憶手段、自己相関関数R(0),R(1),R(2),-,-,R
(M)の値に基づき第１の予測手段の定数を決める手段、
第１の予測手段の伝達関数の逆数のインパルス応答を計
算する第１のインパルス応答計算手段、第１の固定コー
ドブック、第１の固定コードゲイン調整手段、前記第１
の固定コードゲイン調整手段を経由した前記第１の固定
コードブックの出力パルス列と前記第１のインパルス応
答計算手段が計算したインパルス応答との畳み込み積分
出力を得る第１の畳み込み積分手段、前記第１の畳み込
み積分手段の出力と雑音との誤差を計算する第１の誤差
計算手段、前記誤差の二乗平均値を計算しその誤差を最
小化するように前記第１の固定ブックの出力コードと前
記第１の固定コードゲイン調整手段のゲインを制御する
第１の制御手段、最小化達成後に前記第１のインパルス
応答の定数と初期状態を記憶する第１の雑音状態記憶手
段を備え、更に、前記判定手段が雑音の重畳している音
声信号区間であると判定した場合は、前記雑音自己相関
関数記憶手段に記憶されたRn_m(0),Rn_m(1)、Rn_m(2)、-、
-、Rn_m(M)の値を読み出しそれらの値と、前記自己相関
関数計算手段が計算したR(0),R(1),R(2),-,-,R(M)のあ
る関数として決まる値Ｋを雑音の時間変動率として計算
する手段、このＫの値を用いて雑音を含まない音声信号
のみの自己相関関数Rs(0),Rs(1),Rs(2),-,-,Rs(M)とし
て、R(0)−KRn_m(0),R(1)−KRn_m(1),R(2)−KRn_m(2),-,-,
R(M)−KRn_m(M)を計算する手段、前記Rs(0),Rs(1),Rs
(2),-,-,Rs(M)を用いて第２の予測手段の定数を決める
手段、第２の予測手段の伝達関数の逆数のインパルス応
答を計算する第２のインパルス応答計算手段、第２の固
定コードブック、第２の固定コードゲイン調整手段、前
記第２の固定コードゲイン調整手段を経由した前記第２
の固定コードブックの出力パルス列と前記第２のインパ
ルス応答との畳み込み積分出力を得る第２の畳み込み積
分手段、前記第２の畳み込み積分手段の出力と雑音の重
畳している音声信号から雑音の除去された信号との誤差
を計算する第２の誤差計算手段、前記誤差の二乗平均値
を計算しその誤差を最小化するように前記第２の固定ブ
ックの出力コードと前記第２の固定コードゲイン調整手
段のゲインを制御する第２の制御手段、最小化を達成
後、前記第２のインパルス応答の定数と初期状態を記憶
する第２の音声信号状態記憶手段を備えるようにして構
成される。Means for Solving the Problems According to the first aspect of the present invention.
Of the present invention converts an audio signal on which ambient noise is superimposed into an analog signal.
A means for converting to a digital signal with a digital converter,
Means to extract as data of fixed length section, extracted signal
Calculate the autocorrelation function R (0), R (1), R (2),-,-, R (M) of the signal
Means, the extracted signal is a section with only noise,
Judging means for judging whether the audio signal section is folded,
If the determination means determines that the section is only noise,
Let the autocorrelation function R (0), R (1), R (2),-,-, R (M) be the value of Rn_m(0), Rn
_m(1), Rn _m(2),-,-, Rn_mNoise self-phase stored as (M)
Function storage means, autocorrelation function R (0), R (1), R (2),-,-, R
Means for determining a constant of the first prediction means based on the value of (M);
Measuring the impulse response of the reciprocal of the transfer function of the first predicting means
First impulse response calculating means for calculating
Book, first fixed code gain adjusting means,
The first fixed via the fixed code gain adjusting means of
A codebook output pulse train and the first impulse response
Convolution with the impulse response calculated by the answer calculation means
First convolution means for obtaining an output, said first convolution means
First error for calculating the error between the output of the integration means and the noise
Calculating means for calculating a mean square value of the error and calculating the error
The output code of the first fixed book and the previous
The gain of the first fixed code gain adjusting means is controlled.
First control means, said first impulse after minimization is achieved
A first noise state storage means for storing a response constant and an initial state
A step, wherein the determination means further includes a sound on which noise is superimposed.
If it is determined that the voice signal section, the noise autocorrelation
Rn stored in function storage means_m(0), Rn_m(1), Rn_m(2),-,
-, Rn_mRead the values of (M) and those values and the autocorrelation
R (0), R (1), R (2),-,-, R (M) calculated by the function calculation means
Calculates the value K determined as a function of
A noise-free audio signal using the value of K
Only the autocorrelation functions Rs (0), Rs (1), Rs (2),-,-, Rs (M)
And R (0) −KRn_m(0), R (1) −KRn_m(1), R (2) −KRn_m(2),-,-,
R (M) −KRn_mMeans for calculating (M), said Rs (0), Rs (1), Rs
(2) Determine the constant of the second prediction means using-,-, Rs (M)
Means, the impulse response of the inverse of the transfer function of the second predictor
Second impulse response calculation means for calculating the answer,
Constant code book, second fixed code gain adjustment means, before
The second fixed code gain adjusting means;
Output pulse train of the fixed codebook and the second impulse.
Second convolution product to obtain convolution integral output with Lus response
The output of the second convolution integrator and the weight of the noise.
Error between the demultiplexed audio signal and the denoised signal
Second error calculating means for calculating the mean square value of the error
And the second fixed block so as to minimize the error.
Output code and the second fixed code gain adjusting means.
Second control means to control stage gain, achieve minimization
After that, the constant and the initial state of the second impulse response are stored.
A second audio signal state storage means for
Is done.

【００６６】以上のように音声符号化装置を構成し、動
作させることにより音声符号化装置の内部動作において
雑音を消去し、信号対雑音比の改善された状態での動作
を実現し、音声符号化装置の雑音による誤動作を防止す
る効果を有する。By constructing and operating the speech encoding apparatus as described above, noise is eliminated in the internal operation of the speech encoding apparatus, and operation with an improved signal-to-noise ratio is realized. This has the effect of preventing a malfunction due to noise of the digitizing device.

【００６７】また、請求項２に記載の発明は、請求項１
記載の音声符号化装置において、前記判定手段が雑音の
重畳している音声区間と判定した場合は、前記第１の誤
差を最小化する制御手段の入力として、前記第２の誤差
計算手段の出力を用いるよう接続し、これにより第１の
誤差計算手段の動作の安定化をはかるものである。The invention described in claim 2 is the same as that in claim 1.
In the above-described speech encoding apparatus, when the decision unit decides that the speech section has noise superimposed thereon, the output of the second error calculation unit is used as an input to the control unit that minimizes the first error. Is used to stabilize the operation of the first error calculating means.

【００６８】また、請求項３に記載の発明は、請求項１
または請求項２記載の適応型雑音抑圧音声符号化装置に
おいて、前記第１の記憶手段に記憶されたRn_m(0),Rn
_m(1),Rn _m(2),-,-,Rn_m(M)の値を読み出し、それらの値
と、前記自己相関関数計算手段が雑音の重畳している音
声信号の値として計算したR(0),R(1),R(2),-,-,R(M)の
ある関数として定まる雑音の時間変動率値Ｋとして、R
(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）は夫々正または負の値を
取り、その正負と更にR(ｋ),Rn_m(ｋ)の大小関係を考慮
して、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）を複数の状態に分
類し、各状態で異なるＫの計算式で各状態毎に計算した
Ｋの最小値から、さらに最小のＫの値を選びそれをＫの
値とするようにした。The third aspect of the present invention is the first aspect of the present invention.
Or the adaptive noise-suppressed speech coding apparatus according to claim 2.
The Rn stored in the first storage means_m(0), Rn
_m(1), Rn _m(2),-,-, Rn_mRead the values of (M) and read those values
And the sound on which noise is superimposed by the autocorrelation function calculating means.
R (0), R (1), R (2),-,-, R (M)
As a time variation value K of the noise determined as a certain function, R
(k), Rn_m(k) (k = 1,2,-,-, M) is a positive or negative value, respectively.
And take the positive and negative and R (k), Rn_mConsider the magnitude relation of (k)
And R (k), Rn_m(k) (k = 1,2,-,-, M) divided into multiple states
And calculated for each state using a different K formula for each state
From the minimum value of K, select the minimum value of K
Value.

【００６９】このようにＫを選定することで、現在通話
中の雑音に適応して、雑音を除去することが可能にな
る。By selecting K in this way, it is possible to remove noise in accordance with the noise during the current call.

【００７０】また、請求項４に記載の発明は、請求項３
記載の音声符号化装置において、雑音の時間変動率値Ｋ
として、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）は夫々正または
負の値を取り、その正負と更にR(ｋ),Rn_m(ｋ)の大小関
係を考慮して、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）を分類す
る場合に、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）の正負関係に
より４ヶの状態に分類し、更にR(ｋ),Rn_m(ｋ)（k=1,2,
-,-,M）が同符号の場合には、R(ｋ),Rn_m(ｋ)の大小関係
により２分類し、合計６ヶの状態に分類し、各状態で異
なるＫの計算式で各状態毎に計算したＫの最小値から、
さらに最小のＫの値を選びそれをＫの値とする。The invention described in claim 4 is the same as the invention described in claim 3.
In the speech coding apparatus described above, the time variation rate value K of the noise
R (k), Rn _m (k) (k = 1, 2,-,-, M) take positive or negative values, respectively, and the positive and negative values of R (k), Rn _m (k) taking into account the magnitude _{relation, R (k), Rn m} (k) (k = 1,2, -, -, M) when _{classifying, R (k), Rn m} (k) (k = 1 , 2, -, -, and classified into the four states by positive and negative relationship M), further _{R (k), Rn m (} k) (k = 1,2,
-,-, M) have the same sign, R (k), Rn _m (k) are classified into two categories according to the magnitude relationship, and classified into a total of six states. From the minimum value of K calculated for each state,
Further, the minimum value of K is selected and set as the value of K.

【００７１】このようにＫを選定することで、現在通話
中の雑音に適応して、雑音を除去することが可能にな
る。By selecting K in this manner, it is possible to remove noise in accordance with the noise during the current call.

【００７２】また、請求項５に記載の発明は、請求項４
記載の適応型雑音抑圧音声符号化装置において、R(ｋ),
Rn_m(ｋ)（k=1,2,-,-,M）を６ヶの状態に分類し、各状態
で異なるＫの計算式としては以下の表１に記載されるＫ
を用いるようにした請求項４記載の適応型雑音抑圧音声
符号化装置。The invention described in claim 5 is the same as the invention in claim 4.
In the adaptive noise-suppressed speech coding apparatus as described, R (k),
Rn _m (k) (k = 1,2,-,-, M) is classified into six states, and the calculation formula of K different in each state is K shown in Table 1 below.
5. The adaptive noise-suppressed speech coding apparatus according to claim 4, wherein

【表１】 [Table 1]

【００７３】このようにＫを選定することで、現在通話
中の雑音に適応して、雑音を除去することが可能にな
る。By selecting K in this way, it becomes possible to remove noise in accordance with the noise during the current call.

【００７４】また、請求項６に記載の発明は、請求項１
または請求項２記載の適応型雑音抑圧音声符号化装置に
おいて、前記第１の記憶手段に記憶されたRn_m(0),Rn
_m(1),Rn _m(2),-,-,Rn_m(M)の値を読み出し、それらの値
と、前記自己相関関数計算手段が雑音の重畳している音
声信号の値として計算したR(0),R(1),R(2),-,-,R(M)の
ある関数として定まる雑音の時間変動率値Ｋとしては、
k=1,2,-,-,Mについて、以下に示す式によってＫを計算
しその最小値をＫとして選定するようにしたものであ
る。Ｋ＝{ｂ/ａ}[1−{1−（ca/b²）}^1/2] (58) ここで、a、b、cは以下の値とする。 a＝{R²n_m(0)−R²n_m(k)} (54) b＝{R (0)＊Rn_m(0)−R (k)＊Rn_m(k)} (55) c＝{R² (0)−R² (k)} (56) このようにＫを選定することで、現在通話中の雑音に適
応して、雑音を除去することが可能になる。The invention described in claim 6 is the first invention.
Or the adaptive noise-suppressed speech coding apparatus according to claim 2.
The Rn stored in the first storage means_m(0), Rn
_m(1), Rn _m(2),-,-, Rn_mRead the values of (M) and read those values
And the sound on which noise is superimposed by the autocorrelation function calculating means.
R (0), R (1), R (2),-,-, R (M)
As a time variation value K of noise determined as a certain function,
Calculate K for k = 1,2,-,-, M by the following formula
And the minimum value is selected as K.
You. K = {b / a} [1- {1- (ca / b^Two)}^1/2(58) Here, a, b, and c have the following values. a = {R^Twon_m(0) −R^Twon_m(k)} (54) b = {R (0) * Rn_m(0) −R (k) * Rn_m(k)} (55) c = {R^Two (0) −R^Two (k)} (56) By selecting K in this way, the
Accordingly, noise can be removed.

【００７５】また、請求項７に記載の発明は、請求項１
または請求項２記載の適応型雑音抑圧音声符号化装置に
おいて、音声信号のピッチ周期Ｌにおける自己相関関数
R(L)を計算する手段、その値を記憶する手段を更に備
え、前記雑音自己相関関数記憶手段から読み出されたRn
_m(0),Rn_m(1),Rn_m(2),-,-,Rn_m(M)の値と、前記記憶手段
から読み出されたピッチ周期の自己相関関数R(L)の値
と、前記自己相関関数計算手段が雑音の重畳している音
声信号の自己相関関数として計算したR(0) ,R(1) ,R
(2),-,-,R(M)の値とのある関数として決まる値Ｋを雑音
の時間変動率として定めたものである。The invention described in claim 7 is the first invention.
3. The adaptive noise suppression speech coding apparatus according to claim 2, wherein the autocorrelation function of the speech signal at a pitch period L is provided.
Means for calculating R (L), further comprising means for storing the value, Rn read from the noise autocorrelation function storage means
_m (0), Rn _m (1), Rn _m (2),-,-, the value of Rn _m (M) and the value of the autocorrelation function R (L) of the pitch period read from the storage means R (0), R (1), R calculated as the autocorrelation function of the voice signal on which the noise is superimposed by the autocorrelation function calculation means.
(2) A value K determined as a certain function with the values of-,-, and R (M) is defined as a time variation rate of noise.

【００７６】このようにＫの選定にピッチ周期Ｌを用い
た自己相関関数R(L)による推定を加えることにより、現
在通話中の雑音に適応して、精度良く雑音を除去するこ
とが可能になる。As described above, by adding the estimation based on the autocorrelation function R (L) using the pitch period L to the selection of K, it is possible to adapt to the noise during the current call and accurately remove the noise. Become.

【００７７】また、請求項８に記載の発明は、請求項７
記載の適応型雑音抑圧音声符号化装置において、雑音の
時間変動率Ｋとして、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）は
夫々正または負の値を取り、その正負と更にR(ｋ),Rn
_m(ｋ)の大小関係を考慮して、R(ｋ),Rn_m(ｋ)（k=1,2,-,
-,M）を複数の状態に分類し、各状態で異なるＫの計算
式で各状態毎に計算したＫの最小値から、さらに最小の
Ｋの値を選んだ値と、ピッチ周期の自己相関関数R(L)と
のある関数として決まる値のどちらか小の値を雑音の時
間変動率Ｋとしたものである。The invention according to claim 8 is the same as the invention according to claim 7.
In adaptive noise reduced speech coding apparatus according as the time variation rate K of the _{noise, R (k), Rn m} (k) (k = 1,2, -, -, M) is respectively positive or negative value And the sign plus and minus R (k), Rn
Considering the magnitude relation of _m (k), R (k), Rn _m (k) (k = 1, 2,-,
-, M) is classified into a plurality of states, and the autocorrelation of the pitch period and the value obtained by further selecting the minimum K value from the minimum value of K calculated for each state using a different K formula for each state The smaller value of the function R (L) and a value determined as a certain function is the noise time variation rate K.

【００７８】このようにＫの選定にピッチ周期Ｌを用い
た自己相関関数R(L)による推定を加えることにより、現
在通話中の雑音に適応して、精度良く雑音を除去するこ
とが可能になる。As described above, by adding the estimation based on the autocorrelation function R (L) using the pitch period L to the selection of K, it is possible to remove the noise with high accuracy by adapting to the noise during the current call. Become.

【００７９】また、請求項９に記載の発明は、請求項８
記載の適応型雑音抑圧音声符号化装置において、雑音の
時間変動率Ｋとして、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）は
夫々正または負の値を取り、その正負と更にR(ｋ),Rn
_m(ｋ)の大小関係を考慮して、R(ｋ),Rn_m(ｋ)（k=1,2,-,
-,M）を複数の状態に分類する場合に、R(ｋ),Rn_m(ｋ)
（k=1,2,-,-,M）の正負関係により４ヶの状態に分類
し、更にR(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）が同符号の場合
には、R(ｋ),Rn_m(ｋ)の大小関係により２分類し、合計
６ヶの状態に分類し、各状態で異なるＫの計算式で各状
態毎に計算したＫの最小値から、さらに最小のＫの値を
選んだ値と、ピッチ周期の自己相関関数R(L)により定ま
る値のどちらか小の値を雑音の時間変動率Ｋとしたもの
である。The invention described in claim 9 is the same as the invention described in claim 8
In adaptive noise reduced speech coding apparatus according as the time variation rate K of the _{noise, R (k), Rn m} (k) (k = 1,2, -, -, M) is respectively positive or negative value And the sign plus and minus R (k), Rn
Considering the magnitude relation of _m (k), R (k), Rn _m (k) (k = 1, 2,-,
-, M) are classified into multiple states, R (k), Rn _m (k)
(K = 1,2, -, - , M) are classified into the four states by positive and negative relationship, further _{R (k), Rn m (} k) (k = 1,2, -, -, M) is In the case of the same code, two classifications are made according to the magnitude relation of R (k) and Rn _m (k), the classification is made into a total of six states, and K calculated for each state by a different K formula for each state. The smaller of the value of K and the value determined by the autocorrelation function R (L) of the pitch period, whichever is smaller, is defined as the time variation rate K of the noise.

【００８０】このようにＫの選定にピッチ周期Ｌを用い
た自己相関関数R(L)による推定を加えることにより、現
在通話中の雑音に適応して、精度良く雑音を除去するこ
とが可能になる。As described above, by adding the estimation based on the autocorrelation function R (L) using the pitch period L to the selection of K, it is possible to remove the noise with high accuracy by adapting to the noise during the current call. Become.

【００８１】また、請求項１０に記載の発明は、請求項
９記載の適応型雑音抑圧音声符号化装置において、雑音
の時間変動率Ｋとして、R(ｋ),Rn_m(ｋ)（k=1,2,-,-,M）
は夫々正または負の値を取り、その正負と更にR(ｋ),Rn
_m(ｋ)の大小関係を考慮して、R(ｋ),Rn_m(ｋ)（k=1,2,-,
-,M）を複数の状態に分類する場合に、６ヶの状態に分
類し、各状態で異なるＫの計算式としては請求項５に示
す表１に記載されたものを用い、またピッチ周期の自己
相関関数R(L)により定まる値としては、以下に示す式に
よってＫ_Lを計算した上でその最小値を求め、Ｋ及びＫ_L
の値のどちらか小の値を雑音の時間変動率Ｋとするよう
にしたものである。Ｋ_L＝ [Ｒ(0)− R(L)] / Rn_m(0) (67)[0081] Further, an invention according to claim 10, in the adaptive noise suppression speech coding apparatus according to claim 9, as a time variation rate K of the _{noise, R (k), Rn m} (k) (k = 1,2,-,-, M)
Takes positive or negative values, respectively, and the positive and negative values and R (k), Rn
Considering the magnitude relation of _m (k), R (k), Rn _m (k) (k = 1, 2,-,
-, M) are classified into a plurality of states, and are classified into six states, and the calculation formula of K different in each state uses a formula described in Table 1 shown in claim 5, and a pitch period. As a value determined by the autocorrelation function R (L) of, the minimum value is calculated after calculating K _L by the following equation, and K and K _L
Is set as the time variation rate K of the noise. _{K L = [R (0)} - R (L)] / Rn m (0) (67)

【００８２】このようにＫの選定にピッチ周期Ｌを用い
た自己相関関数R(L)による推定を加えることにより、現
在通話中の雑音に適応して、精度良く雑音を除去するこ
とが可能になる。As described above, by adding the estimation based on the autocorrelation function R (L) using the pitch period L to the selection of K, it is possible to remove the noise with high accuracy by adapting to the noise during the current call. Become.

【００８３】また、請求項１１の発明は、請求項６また
は請求項７記載の適応型雑音抑圧音声符号化装置におい
て、雑音の時間変動率Ｋの値としては、請求項６に記載
した式より定まるＫの値と、請求項７に記載した、ピッ
チ周期の自己相関関数R(L)のある関数として定まるＫ_L
の値とのいずれか小なる値をＫの値として選定したもの
である。According to an eleventh aspect of the present invention, in the adaptive noise-suppressed speech coding apparatus according to the sixth or seventh aspect, the value of the time variation rate K of the noise is calculated according to the equation described in the sixth aspect. The value of K determined and the value of K _L determined as a function of the autocorrelation function R (L) of the pitch period according to claim 7.
Is selected as the value of K, whichever is smaller.

【００８４】このようにＫの選定にピッチ周期Ｌを用い
た自己相関関数R(L)による推定を加えることにより、現
在通話中の雑音に適応して、精度良く雑音を除去するこ
とが可能になる。As described above, by adding the estimation based on the autocorrelation function R (L) using the pitch period L to the selection of K, it is possible to remove the noise with high accuracy by adapting to the noise during the current call. Become.

【００８５】また、請求項１２の発明は、請求項１１記
載の適応型雑音抑圧音声符号化装置において、ピッチ周
期の自己相関関数R(L)により定まる値としては、請求項
１０に記載した式を用いるようにしたものである。According to a twelfth aspect of the present invention, in the adaptive noise suppression speech coder according to the eleventh aspect, the value determined by the autocorrelation function R (L) of the pitch period is expressed by the following expression. Is used.

【００８６】このようにＫの選定にピッチ周期Ｌを用い
た自己相関関数R(L)による推定を加えることにより、現
在通話中の雑音に適応して、精度良く雑音を除去するこ
とが可能になる。As described above, by adding the estimation based on the autocorrelation function R (L) using the pitch period L to the selection of K, it is possible to remove the noise with high accuracy by adapting to the noise during the current call. Become.

【００８７】また、請求項１３の発明は、請求項１また
は請求項２に記載の適応型雑音抑圧音声符号化装置にお
いて、前記判定手段が雑音の重畳している音声信号区間
であると判定した場合には、一つ前の一定区間長のデー
タで制御された前記第１の固定コードゲイン調整手段の
ゲインの最終値を用いて決まる値を雑音の時間変動率Ｋ
として計算する手段を更に備え、そのＫを用いて雑音を
含まない音声信号のみの自己相関関数、Rs(0),Rs(1),Rs
(2),-,-,Rs(M)をR(0)−KRn_m(0),R(1)−KRn_m(1),R(2)−K
Rn_m(2),-,-,R(M)−KRn_m(M)として定めようにしたもので
ある。According to a thirteenth aspect of the present invention, in the adaptive noise-suppressed speech coding apparatus according to the first or second aspect, the determining means determines that the section is a speech signal section in which noise is superimposed. In this case, the value determined using the final value of the gain of the first fixed code gain adjusting means controlled by the data of the immediately preceding fixed section length is changed to the noise time variation rate K.
, Rs (0), Rs (1), Rs
(2), -, -, Rs (M) and _{R (0) -KRn m (0} ), R (1) -KRn m (1), R (2) -K
_{Rn m (2), -,} -, it is obtained as defined as _{R (M) -KRn m (M} ).

【００８８】以上のように音声符号化装置を構成し、動
作させることにより音声符号化装置の内部動作において
雑音を消去し、信号対雑音比の改善された状態での動作
を実現し、音声符号化装置の雑音による誤動作を防止す
る効果を有する。The speech coding apparatus is constructed and operated as described above, thereby eliminating noise in the internal operation of the speech coding apparatus, realizing an operation with an improved signal-to-noise ratio, and implementing the speech coding apparatus. This has the effect of preventing a malfunction due to noise of the digitizing device.

【００８９】また、請求項１４の発明は、請求項１３記
載の適応型雑音抑圧音声符号化装置において、雑音の時
間変動率Ｋとしては、一つ前の一定区間長のデータで制
御された前記第１の固定コードゲイン調整手段のゲイン
の最終値をg_a’とすると、下式を用いてＫの値を求める
ようにした請求項１３記載の適応型雑音抑圧音声符号化
装置。Ｋ＝{g_a’}² Y /{Rn_m(0)+Σ_k=1 ^k=Ma_k Rｎ_m(k)} (80) ここで、Yは、固定コードブックの全サンプル数に対す
る実際のパルス数の比であり、a_kは線形予測回路の予測
係数である。Further, according to a fourteenth aspect of the present invention, in the adaptive noise suppression speech coder according to the thirteenth aspect, the time variation rate K of the noise is controlled by data of a preceding fixed section length. 14. The adaptive noise-suppressed speech coding apparatus according to claim 13, wherein the value of K is obtained by using the following equation, assuming that the final value of the gain of the first fixed code gain adjusting means is g _a '. K = {g _a '} ² Y / {R _nm (0) + Σ _{k = 1} ^{k = M} _ak R _nm (k)} (80) where Y is the actual number of samples in the fixed codebook. , And a _k is a prediction coefficient of the linear prediction circuit.

【００９０】以上のように音声符号化装置を構成し、動
作させることにより音声符号化装置の内部動作において
雑音を消去し、信号対雑音比の改善された状態での動作
を実現し、音声符号化装置の雑音による誤動作を防止す
る効果を有する。The speech coding apparatus is constructed and operated as described above, thereby eliminating noise in the internal operation of the speech coding apparatus, realizing an operation with an improved signal-to-noise ratio, and implementing the speech coding apparatus. This has the effect of preventing a malfunction due to noise of the digitizing device.

【００９１】[0091]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図１から図２４を用いて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below with reference to FIGS.

【００９２】（第１の実施の形態）本発明の第１の実施
の形態は、請求項１、３、４、５及び６の発明に対応す
るものであり、音声符号化装置としては、ACELP型のも
ので説明するが、音声信号の短期間予測に信号の自己相
関関数を使用する方式のすべての音声符号化装置に適用
できるものである。(First Embodiment) The first embodiment of the present invention corresponds to the first, third, fourth, fifth and sixth aspects of the present invention. The present invention will be described with reference to a type, but the present invention can be applied to all speech coding apparatuses that use a signal autocorrelation function for short-term prediction of a speech signal.

【００９３】図１は、請求項１に記載の発明を説明する
ものであり、第１の実施形態に係る適応型雑音抑圧音声
符号化装置または方法の機能ブロックを示す図である。
図１において図示番号が1301乃至1320までのものは図３
０に示すものと同一であり、すでに説明ずみのため省略
する。図示番号101A,101B,102A〜102Ｆ,103,105,106,11
6,117A〜117E,118〜120までのものが、本発明を実現す
るために追加されたものである。図示番号101Aは音声/
雑音区間判定回路、101Bは音声/雑音切替SW制御回路、1
02A〜102Fは音声/雑音切替SW、103は自己相関関数記憶
回路、105は雑音変化率推定回路、106は自己相関関数推
定回路である。FIG. 1 is a block diagram for explaining the first aspect of the present invention, and is a diagram showing functional blocks of an adaptive noise suppression speech coding apparatus or method according to the first embodiment.
In FIG. 1, the reference numerals 1301 to 1320 correspond to FIG.
This is the same as that shown in FIG. Illustrated numbers 101A, 101B, 102A to 102F, 103, 105, 106, 11
Items 6,117A to 117E and 118 to 120 are added to realize the present invention. Illustration number 101A is voice /
Noise section judgment circuit, 101B is voice / noise switch SW control circuit, 1
02A to 102F are voice / noise switching switches, 103 is an autocorrelation function storage circuit, 105 is a noise change rate estimation circuit, and 106 is an autocorrelation function estimation circuit.

【００９４】図示番号116、117A〜117E、118〜120は、
夫々を1316,1317A〜1317E,1318〜1320と読み替え、既に
説明した図３０における同一番号のものと同じ名称と機
能を有するものであるが、本発明で図３０とは異なる働
きをするために追加されたものである。すなわち図示番
号116は固定コードブック、117Aは固定コードブックゲ
イン調整回路、117Bはインパルス応答回路、117Cは誤差
計算回路、117Dは誤差最小化回路、117Eは固定コードブ
ック最適値発見回路、118は固定コードブックゲイン量
子化回路、119は励振計算回路、120は最適フィルタ記憶
回路である。The illustrated numbers 116, 117A to 117E and 118 to 120
30 are replaced with 1316, 1317A to 1317E and 1318 to 1320, respectively, and have the same names and functions as those of the same numbers in FIG. 30 described above, but have been added in the present invention in order to function differently from FIG. It was done. That is, the illustrated number 116 is a fixed codebook, 117A is a fixed codebook gain adjustment circuit, 117B is an impulse response circuit, 117C is an error calculation circuit, 117D is an error minimization circuit, 117E is a fixed codebook optimal value finding circuit, and 118 is fixed. A codebook gain quantization circuit, 119 is an excitation calculation circuit, and 120 is an optimum filter storage circuit.

【００９５】以上のように構成された第１の実施の形態
に係る適応型雑音抑圧音声符号化装置について、その動
作を以下に説明する。The operation of the adaptive noise suppression speech coder according to the first embodiment configured as described above will be described below.

【００９６】[前処理回路1301]、[窓掛け回路1302]、
[自己相関関数計算回路1303]従来例図３０で既に説明済
みであるが、これらの回路により、マイクロホンから
の、周囲雑音を含む入力信号は、アナログ信号からデジ
タル信号に変換され、窓を掛けられ取り込まれ、自己相
関関数が計算される。[Preprocessing circuit 1301], [Windowing circuit 1302],
[Autocorrelation Function Calculation Circuit 1303] Conventional Example As already described with reference to FIG. 30, the input signal including the ambient noise from the microphone is converted from an analog signal to a digital signal by these circuits and windowed. The autocorrelation function is calculated.

【００９７】[音声/雑音区間判定回路101A]取り込んだ
区間の信号が、雑音のみか又は音声を有する区間かを判
別する回路であるがその方法は公知の各種の方法があ
る。本発明では、自己相関関数計算回路1303で、取り込
んだ信号の自己相関関数が計算されているので、これら
をR(0),R(1),-,-,R(M)とすると、R(0)は取り込んだ信号
の電力を表しているのでこの値を用いて、R(0)がある一
定値を超えている場合は音声信号の存在する期間とし、
一定値を超えない場合は、音声信号のない雑音のみの期
間と判定するものとする。[Speech / Noise Section Judging Circuit 101A] A circuit for discriminating whether a signal of a fetched section is only a noise section or a section having speech, there are various known methods. In the present invention, the autocorrelation function calculation circuit 1303 calculates the autocorrelation function of the acquired signal.If these are R (0), R (1),-,-, R (M), R Since (0) represents the power of the captured signal, using this value, if R (0) exceeds a certain value, the period during which the audio signal is present,
If the value does not exceed a certain value, it is determined that the period is only a noise without a sound signal.

【００９８】[音声/雑音切替SW制御回路101B]、[音声/
雑音切替SW102A]〜[音声/雑音切替SW102Ｆ] 音声/雑音切替SW制御回路101Bは、音声/雑音区間判定回
路101Aにより音声信号が存在する区間か又は雑音のみの
区間かの判定信号に基づき、音声/雑音切替SW102A〜音
声/雑音切替SW102Ｆまでを制御する。本発明では、音声
信号が存在する区間か又は雑音のみの区間かによつて各
手段の動作の役割が異なるので、それを音声/雑音切替S
W102A〜音声/雑音切替SW102Fにより切り替えるのであ
り、その切り替え制御信号を音声/雑音切替SW制御回路1
01Bにより供給する。[Audio / Noise Switching SW Control Circuit 101B], [Audio / Noise Switching SW Control Circuit 101B]
Noise switch SW102A] to [voice / noise switch SW102F] The voice / noise switch SW control circuit 101B performs voice / noise determination based on a determination signal indicating whether a voice signal exists or is only a noise by the voice / noise interval determination circuit 101A. / Noise switch SW102A to voice / noise switch SW102F are controlled. In the present invention, the role of the operation of each means is different depending on whether a section where a voice signal exists or a section where only noise is present.
Switching is performed by W102A to voice / noise switch SW102F, and the switching control signal is transmitted to the voice / noise switch SW control circuit 1.
Supplied by 01B.

【００９９】このように同一の回路を切り替えて使用す
ることで回路規模の増大を防ぐ効果を有する。音声/雑
音切替SW102A〜音声/雑音切替SW102Ｆはコモン接点CとS
接点、V接点を有しており、音声信号が存在する区間に
はコモン接点CはS接点と接続され、雑音のみの区間には
コモン接点CはV接点と接続される。以下の説明は最初に
雑音のみの区間と判定された場合について進める。この
場合の図１、図２の音声/雑音切替SW102A〜音声/雑音切
替SW102Ｆの各コモン接点CがはV接点と接続されるてい
る。雑音のみの区間の動作では使用されない部分を省い
て図３に示す。以下、図３を用いて説明をする。As described above, switching the same circuit for use has an effect of preventing an increase in circuit scale. Voice / noise switch SW102A to voice / noise switch SW102F are common contacts C and S
A common contact C is connected to an S contact in a section where an audio signal is present, and a common contact C is connected to a V contact in a section where only noise is present. The following description will proceed with a case where it is initially determined that the section is only a noise section. In this case, each of the common contacts C of the voice / noise switch SW102A to the voice / noise switch SW102F in FIGS. 1 and 2 is connected to the V contact. FIG. 3 shows a portion that is not used in the operation in the section including only noise. Hereinafter, description will be made with reference to FIG.

【０１００】[音声/雑音切替SW102A]、[自己相関関数記
憶回路103] 従来例図３０で説明した自己相関関数計算回路1303で計
算された雑音区間の自己相関関数R(0),R(1),R(2),-,-,R
(M)を、音声/雑音切替SW102Aを介して記憶装置に、Rn
_m(0),R(1)n_m,R(2)n_m,-,-,Rn_m(M)として記憶する。[Speech / Noise Switching SW102A], [Autocorrelation Function Storage Circuit 103] Conventional Example The autocorrelation functions R (0), R (1) in the noise section calculated by the autocorrelation function calculation circuit 1303 described in FIG. ), R (2),-,-, R
(M) to the storage device via the voice / noise switch SW102A, and Rn
_{m (0), R (1} ) n m, R (2) n m, -, -, it is stored as Rn _{m (M).}

【０１０１】[音声/雑音切替SW102B]、[予測係数算出回
路1304] 従来例図３０で説明した自己相関関数計算回路1303で計
算された雑音の自己相関関数が、音声/雑音切替SW102B
により予測係数算出回路1304に入力される。予測係数算
出回路1304は既に説明した、レビンソン・ダービンア
ルゴリズムを用いて雑音の予測係数を求める。[Speech / Noise Switching SW 102B], [Predictive Coefficient Calculation Circuit 1304] Conventional Example The noise / auto-correlation function calculated by the auto-correlation function calculation circuit 1303 described with reference to FIG.
Is input to the prediction coefficient calculation circuit 1304. The prediction coefficient calculation circuit 1304 obtains a noise prediction coefficient by using the Levinson-Durbin algorithm described above.

【０１０２】[係数変換回路1305]係数変換回路1305は、
このようにして計算された最適な予測係数ａ₁，ａ₂，
-，-，ａ₁₀ を周波数領域のパラメータである、ライン
・スペクトル・ペア（ＬＳＰ）と呼ばれる係数に変換す
る。[Coefficient conversion circuit 1305] The coefficient conversion circuit 1305
The optimal prediction coefficients a ₁ , a ₂ ,
-, -, it converts the a ₁₀ is a frequency domain parameter, the coefficient called a line spectrum pair (LSP).

【０１０３】[サブフレーム係数内挿回路（量子化前）1
308A]サブフレーム係数内挿回路（量子化前）1308Aは量
子化前の第２と第４サブフレームのLSPを用いて第１と
第３サブフレームのLSPを内挿して求め、それらを用い
て各サブフレームの線形予測回路の伝達関数であるA(z)
を得る。[Subframe coefficient interpolation circuit (before quantization) 1
308A] The subframe coefficient interpolation circuit (before quantization) 1308A obtains the LSPs of the first and third subframes by using the LSPs of the second and fourth subframes before quantization, and uses them. A (z), which is the transfer function of the linear prediction circuit for each subframe
Get.

【０１０４】[４サブフレーム重みずけ音声計算回路130
9]４サブフレーム重みずけ音声計算回路1309は窓掛けを
する前の雑音４サブフレーム（すなわち１フレーム）分
の雑音に人間の聴覚特性を考慮した伝達関数のフィルタ
ーを通す。人間の聴覚特性を考慮した伝達関数のフィル
ターは既述した式(20)で表わされる。Ｗ(ｚ)= {A(ｚ/γ₁)} /{A(ｚ/γ₂)} (20) ここで、γ₁=0.9、γ₂=0.6である。すなわちサブフレー
ム係数内挿回路（量子化前）1308AでA(z)が判明したの
でＷ(ｚ)も求まる。[4 Subframe Weighted Speech Calculation Circuit 130
9] The four-subframe weighted speech calculation circuit 1309 passes a noise of four subframes (that is, one frame) before windowing through a filter of a transfer function in consideration of human auditory characteristics. The filter of the transfer function in consideration of the human auditory characteristics is represented by the above-described equation (20). W (z) = {A (z / γ ₁ )} / {A (z / γ ₂ )} (20) Here, γ ₁ = 0.9 and γ ₂ = 0.6. That is, since A (z) has been found by the subframe coefficient interpolation circuit (before quantization) 1308A, W (z) is also obtained.

【０１０５】[係数量子化回路1306]係数量子化回路1306
は、このようにして得られた、ＬＳＰを適切なビツト数
に変換する。[Coefficient quantization circuit 1306] Coefficient quantization circuit 1306
Converts the LSP thus obtained into an appropriate number of bits.

【０１０６】[サブフレーム係数内挿回路1307]サブフレ
ーム係数内挿回路1307は、以上に説明した如くにして計
算された、第２サブフレームと第４サブフレームのLSP
の値を用いて、第１と第３サブフレームのLSPパラメー
タを求める。さらに、これらの値を用いて線形予測回路
の伝達関数である)A’(z)を得る。サブフレーム係数内
挿回路（量子化前）1308Aで作られたフレームの線形予
測回路の伝達関数であるA(z)と区別するためA’(z)とし
ている。[Subframe coefficient interpolation circuit 1307] The subframe coefficient interpolation circuit 1307 calculates the LSP of the second subframe and the fourth subframe calculated as described above.
, LSP parameters of the first and third subframes are obtained. Further, using these values, a transfer function) A ′ (z) of the linear prediction circuit is obtained. A '(z) is used to distinguish it from A (z) which is a transfer function of a linear prediction circuit for a frame made by a subframe coefficient interpolation circuit (before quantization) 1308A.

【０１０７】[インパルス応答計算回路1308B]、[音声/
雑音切替SW102D]、[音声/雑音切替SW102F] インパルス応答計算回路1308Bは、サブフレーム係数内
挿回路（量子化前）1308Aで得られた、線形予測合成回
路の伝達関数であるA(z)とサブフレーム係数内挿回路13
07で得られたA’(z)とを用いて、人間の聴覚補正のなさ
れた線形予測合成回路のインパルス応答であるｈ(ｎ)を
求め,音声/雑音切替SW102D、SW102Fを経由して、ｈ(ｎ)
のｚ変換は既述した式(19)で表される。 H(ｚ)＝(１/A’(z))・(Ｗ(ｚ))＝{A(ｚ/γ₁)} /{ A’(z)・A(ｚ/γ₂)} (19) H(ｚ)とA’(z)とは逆数関係にある。A’(z)は入力音声
信号から予測可能な分を差し引き、残差を出力する。残
差は予測が適切に行われるとランダムパルスとなる。こ
れに対してH(ｚ)は残差すなわちランダムパルスを入力
して音声信号を出力し、その出力に人間の聴覚特性を補
正したものであることを示している。ｈ(ｎ)を求める詳
細方法は省略するが、ｈ(ｎ)は上記のH(ｚ)の逆ｚ変換
で求まる。[Impulse response calculation circuit 1308B], [voice /
Noise switch SW102D], [Speech / noise switch SW102F] The impulse response calculation circuit 1308B includes the transfer function A (z) of the linear prediction synthesis circuit obtained by the subframe coefficient interpolation circuit (before quantization) 1308A. Subframe coefficient interpolation circuit 13
Using A ′ (z) obtained in 07, h (n), which is an impulse response of the linear prediction synthesis circuit subjected to human auditory correction, is obtained, and via voice / noise switching SW102D and SW102F, h (n)
Is represented by equation (19) described above. H (z) = (1 / A ′ (z)) · (W (z)) = {A (z / γ ₁ )} / {A ′ (z) · A (z / γ ₂ )} (19) H (z) and A ′ (z) have an inverse relationship. A ′ (z) subtracts a predictable amount from the input audio signal and outputs a residual. The residual becomes a random pulse when prediction is properly performed. On the other hand, H (z) indicates that a residual, that is, a random pulse is input, an audio signal is output, and the output is obtained by correcting human auditory characteristics. Although a detailed method of obtaining h (n) is omitted, h (n) is obtained by the above-described inverse z-transform of H (z).

【０１０８】[音声/雑音切替SW102C]、[音声/雑音切替S
W102E]、[固定コードブック116]固定コードブック116
は、５つのトラツクに分けられた全体で10ヶの単パルス
の組み合わせで作られている。それらのパルスは、“＋
１”と“−１”の値を取ることが出来る。固定コードブ
ック116は雑音に相当するパルス列を発生する。[Voice / Noise Switching SW102C], [Voice / Noise Switching S
W102E], [Fixed codebook 116] Fixed codebook 116
Is composed of a total of 10 single pulses divided into 5 tracks. The pulses are "+
It can take the values of "1" and "-1". The fixed codebook 116 generates a pulse train corresponding to noise.

【０１０９】[固定ゲイン調整回路117A]固定ゲイン調整
回路117Aは固定コードブック1316が発生した１サブフレ
ームににつき10ケのパルスの振幅を調整する回路であ
り、その出力をインパルス応答回路117Bに渡す。[Fixed Gain Adjustment Circuit 117A] The fixed gain adjustment circuit 117A is a circuit for adjusting the amplitude of 10 pulses per subframe generated by the fixed codebook 1316, and passes its output to the impulse response circuit 117B. .

【０１１０】[インパルス応答回路117B]インパルス応答
回路117Bはインパルス応答計算回路1308Bで求めたｈ
(ｎ)と固定ゲイン調整回路1317Aを経由した固定コード
ブック1316の出力パルス列を畳み込み積分して重みずけ
音声信号に相当する合成信号に変換しそれを誤差計算回
路117Cに入力する。[Impulse Response Circuit 117B] The impulse response circuit 117B obtains h obtained by the impulse response calculation circuit 1308B.
The output pulse train of (n) and the fixed codebook 1316 via the fixed gain adjustment circuit 1317A is convoluted and integrated to be converted into a synthesized signal corresponding to a weighted audio signal, and is input to the error calculation circuit 117C.

【０１１１】[誤差計算回路117C]インパルス応答回路11
7Bの出力は誤差計算回路117Cに入力され、誤差計算回路
117C のもう一方には4サブフレーム重みずけ音声計算回
路1309により重みずけされた雑音がも、音声/雑音切替S
W102C、SW102Eを経由して入力される。両者の差が計算
されその値が、誤差最小化回路117Dに入力される。[Error Calculation Circuit 117C] Impulse Response Circuit 11
The output of 7B is input to the error calculation circuit 117C, and the error calculation circuit
On the other side of 117C, the noise weighted by the 4-subframe weighted voice calculation circuit 1309 is also the voice / noise switch S.
Input via W102C and SW102E. The difference between the two is calculated, and the value is input to the error minimizing circuit 117D.

【０１１２】[誤差最小化回路117D]誤差最小化回路117D
は誤差計算回路117Cで計算された誤差の二乗平均値を求
め、その値を最小化するように固定コードブック116と
固定ゲイン調整回路117Aを制御する。[Error minimizing circuit 117D] Error minimizing circuit 117D
Calculates the root-mean-square value of the error calculated by the error calculation circuit 117C, and controls the fixed codebook 116 and the fixed gain adjustment circuit 117A to minimize the value.

【０１１３】[固定コードブック最適値発見回路117E]固
定コードブック最適値発見回路117Eは、誤差最小化回路
117Dが誤差の二乗平均値を最小化するように固定コード
ブック116と固定ゲイン調整回路117Aを制御した結果と
して固定コードブックの最適値と固定ゲインの最適値を
発見する。[Fixed Codebook Optimal Value Finding Circuit 117E] The fixed codebook optimal value finding circuit 117E is an error minimizing circuit.
As a result of controlling the fixed codebook 116 and the fixed gain adjustment circuit 117A so that 117D minimizes the mean square value of the error, the optimum value of the fixed codebook and the optimum value of the fixed gain are found.

【０１１４】[固定コードブックゲイン量子化回路118]
固定コードブックゲイン量子化回路118は固定コードブ
ックのゲインを適切なビツト数で量子化する。[Fixed Codebook Gain Quantization Circuit 118]
The fixed codebook gain quantization circuit 118 quantizes the gain of the fixed codebook with an appropriate number of bits.

【０１１５】[励振計算回路119]励振計算回路119は今ま
でに説明した事柄により作られた固定コードブックなら
びに固定ゲインの最適値により、サブフレーム毎の雑音
の励振パルス信号を下式(26)により作成する。ｕ(n)=g_a C (n) ; n=0,1,2,-,-,-,39 (26) ここで、u(n)は励振ベクトルであり、C(n)は固定コード
ブックの励振ベクトルであり、g_aは、そのゲインであ
る。[Excitation Calculation Circuit 119] The excitation calculation circuit 119 converts the noise excitation pulse signal for each sub-frame into the following equation (26) by using the fixed codebook and the optimum value of the fixed gain created by the above-described matters. Create by u (n) = g _a C (n); n = 0,1,2,-,-,-, 39 (26) where u (n) is an excitation vector and C (n) is a fixed code The book's excitation vector, g _a is its gain.

【０１１６】[最適フィルタ記憶回路120]最適フィルタ
記憶回路120はインパルス応答計算回路1308Bで得られ
た、線形予測合成回路のインパルス応答であるｈ(ｎ）
とその初期条件を次の動作に備えて記憶しておく。[Optimal filter storage circuit 120] The optimal filter storage circuit 120 is h (n) which is the impulse response of the linear prediction synthesis circuit obtained by the impulse response calculation circuit 1308B.
And its initial conditions are stored in preparation for the next operation.

【０１１７】以上で雑音のみの区間での本発明の装置の
動作説明を完了し、次に音声信号が存在する区間の動作
を説明する。この場合は音声/雑音切替SW102A〜SW102Ｆ
はコモン接点CとS接点が接続される。この場合の状態を
図４、図５に示す。以下図４、図５を用いて音声信号が
存在する区間の動作を説明する。The operation of the apparatus of the present invention in the section where only the noise is present has been described above, and the operation in the section where the voice signal is present will be described. In this case, voice / noise switch SW102A ~ SW102F
Is connected to the common contact C and S contact. The state in this case is shown in FIGS. Hereinafter, an operation in a section in which an audio signal exists will be described with reference to FIGS.

【０１１８】[自己相関関数計算回路1303]従来例図３０
で説明したように自己相関関数計算回路1303で計算され
た音声信号が存在する区間の自己相関関数は、雑音変化
率推定回路105、そして自己相関関数推定回路106へ入力
される。雑音のみの区間でも、自己相関関数計算回路13
03で計算された自己相関関数は、雑音変化率推定回路10
5、そして自己相関関数推定回路106へ入力されている
が、雑音のみの区間では雑音変化率推定回路105、そし
て自己相関関数推定回路106は不動作状態である。[Autocorrelation Function Calculation Circuit 1303] Conventional Example FIG.
As described above, the autocorrelation function in the section where the speech signal exists calculated by the autocorrelation function calculation circuit 1303 is input to the noise change rate estimation circuit 105 and the autocorrelation function estimation circuit 106. Autocorrelation function calculation circuit 13
The autocorrelation function calculated in 03 is the noise change rate estimation circuit 10
5, and input to the autocorrelation function estimating circuit 106, but in the section of only noise, the noise change rate estimating circuit 105 and the autocorrelation function estimating circuit 106 are inactive.

【０１１９】[音声/雑音切替SW102A]、[自己相関関数記
憶回路103] 雑音区間に記憶された雑音の自己相関関数Rn_m (0),R(1)
n_m,R(2)n_m,-,-,Rn_m (M)が読み出されて雑音変化率推定
回路105、そして自己相関関数推定回路106へ入力され
る。[Speech / Noise Switching SW102A], [Autocorrelation Function Storage Circuit 103] The autocorrelation function Rn _m (0), R (1) of the noise stored in the noise section
_{n m, R (2) n} m, -, -, noise change ratio estimating circuit 105 Rn _m (M) are read out, and are input to the autocorrelation function estimation circuit 106.

【０１２０】[雑音変化率推定回路105]前述したように
自己相関関数計算回路1303から雑音を含む音声信号の自
己相関関数R(0),R(1),R(2),-,-,R(M)が入力され、自己
相関関数記憶回路103からは、雑音区間の自己相関関数R
n_m (0),R(1)n_m,R(2)n_m,-,-,Rn_m (M)が入力される。[Noise change rate estimation circuit 105] As described above, the autocorrelation function R (0), R (1), R (2),-,-, R (M) is input, and the autocorrelation function R
n _m (0), R (1) _nm , R (2) _nm ,-,-, R _nm (M) are input.

【０１２１】音声信号に雑音が重畳した信号区間の音声
信号のみ、および雑音のみの自己相関関数、Ｒｓ(0)、
Ｒｎ(0)、Ｒｓ(ｋ)、Ｒｎ(ｋ) ；ｋ=1 ,2,-,-,-,M は
未知であるが次式(27)、(28)、(29)が成立する。 R(0)=Ｒｓ(0) +Ｒｎ(0) ；（但し、Ｒｓ(0)、Ｒｎ(0)は正） (27) Ｒｓ(0) ＞｜Ｒｓ(ｋ)｜；ｋ=1,2,-,-,-,M (28) Ｒｎ(0) ＞｜Ｒｎ(ｋ)｜；ｋ=1,2,-,-,-,M (29)The autocorrelation function of only the speech signal in the signal section in which noise is superimposed on the speech signal and the noise only, Rs (0),
Rn (0), Rs (k), Rn (k); k = 1, 2,-,-,-, M are unknown, but the following equations (27), (28) and (29) hold. R (0) = Rs (0) + Rn (0); (however, Rs (0) and Rn (0) are positive) (27) Rs (0)> | Rs (k) |; k = 1,2 ,-,-,-, M (28) Rn (0)> | Rn (k) |; k = 1,2,-,-,-, M (29)

【０１２２】この性質を利用して、Rs(0)を推定するこ
とが出来る。以下に数ヶの例を示す。＜例１＞Ｒｓ(0)=function｛｜R(1)｜,｜R(2)｜,-,-,-,｜R(M)｜｝ (30) function｛｝は、｛｝の中の値のある関数として
定まることを意味する。又｜R(1)｜は絶対値をとること
を意味する。具体的には以下の方法が考えられる。＜例１−ａ＞Ｒｓ(0)= Max｛｜R(1)｜,｜R(2)｜,-,-,-,｜R(M)｜｝ (31) 従つて、Ｒｓ(0)が推定されるとRn(0)は下式(32)により
推定できる。Ｒｎ(0)=Ｒ(0)−Ｒｓ(0) (32) そして雑音変化率は下式(33)で表される。 K=Ｒｎ(0) ／Rn_m (0) (33)Using this property, Rs (0) can be estimated. Below are some examples. <Example 1> Rs (0) = function ｛| R (1) |, | R (2) |,-,-,-, | R (M) |｝ (30) function ｛｝ is in {｛ Is defined as a function with the value of | R (1) | means taking an absolute value. Specifically, the following method is conceivable. <Example 1-a> Rs (0) = Max ｛| R (1) |, | R (2) |,-,-,-, | R (M) |｝ (31) Therefore, Rs (0) Is estimated, Rn (0) can be estimated by the following equation (32). Rn (0) = R (0) -Rs (0) (32) The noise change rate is expressed by the following equation (33). K = Rn (0) / Rn _m (0) (33)

【０１２３】音声信号に含まれる雑音の統計的性質は音
声信号が存在しない区間の時と同じと考えられるとす
る。そうすると下式(34)が成立する。 Rn(k) =Ｋ＊Ｒn_m (k) ；k＝1,2-,-,-,M (34)It is assumed that the statistical properties of the noise included in the audio signal are considered to be the same as in the section where no audio signal exists. Then, the following equation (34) is established. Rn (k) = K * Rn m (k); k = 1,2 -, -, -, M (34)

【０１２４】Ｋの値は慎重に推定しないと雑音を効果的
に除去出来ないだけでなく、回路が発振するなど不安定
になることがある。より精密な推定として以下の方法が
考えられる。＜例２＞ Rs(0)= function{R(1), R(2),-,-, R(M),Rn_m(0), R(1)n_m,-,-,Rn_m (M)} (35) ＜例２−ａ＞具体例としてＫの値は下式(36)とする。Ｋ＝Ｍin(K₁,K₂,K_3a,K_3b,Ｋ_4a,Ｋ_4b) (36) Ｍinは最小値をとることを意味し、K₁,K₂,K_3a,K_3b,
Ｋ_4a,Ｋ_4bは以下の方法で決められる。＜K₁,K₂,K_3a,K_3b,Ｋ_4a,Ｋ_4bの決め方＞Ｒ（k）、Ｒn
_m(k)； k=1,2，-，-，Mは既知であり、それぞれ正また
は負の値をとるので、正と負、負と正、正と正、負と負
の４通りの組み合わせがある。更に正と正、負と負の場
合について、|Ｒ(k)|＞|Ｒn_m(k)|と、|Ｒ(k)|＜｜Ｒn
_m(k)|を区別すると全部で以下の６通りになる。 (１) Ｒ(k)は正, Ｒn_m(k)は負；k=1,2,-,-,M (37) (２) Ｒ(k)は負, Ｒn_m(k)は正；k=1,2,-,-,M (38) (３a)Ｒ(k)は正, Ｒn_m(k)は正：Ｒ(k)＞Ｒn_m(k)；k=1,2,-,-,M (39) (３b)Ｒ(k)は正, Ｒn_m(k)は正：Ｒ(k)＜Ｒn_m(k)；k=1,2,-,-,M (40) (４a)Ｒ(k)は負, Ｒn_m(k)は負：|Ｒ(k)|＞|Ｒn_m(k)|；k=1,2,-,-,M (41) (４b)Ｒ(k)は負, Ｒn_m(k)は負：|Ｒ(k)|＜|Ｒn_m(k)|；k=1,2,-,-,M (42)If the value of K is not carefully estimated, not only can noise not be effectively removed, but also the circuit may become unstable due to oscillation. The following method is considered as a more accurate estimation. <Example 2> Rs (0) = function {R (1), R (2), -, -, R (M), Rn m (0), R (1) n m, -, -, Rn m ( M)} (35) <Example 2-a> As a specific example, the value of K is represented by the following equation (36). K = Min (K ₁ , K ₂ , K _3a , K _3b , K _4a , K _4b ) (36) Min means taking the minimum value, and K ₁ , K ₂ , K _3a , K _3b ,
K _4a and K _4b are determined by the following method. _{_{<K 1, K 2, K}} 3a, K 3b, K 4a, how to determine the _{K 4b> R (k),} Rn
_m (k); k = 1, 2,-,-, and M are known and take positive or negative values, respectively. Therefore, there are four types of positive and negative, negative and positive, positive and positive, and negative and negative. There are combinations. For a more positive and positive, if negative and negative, | R (k) |> | Rn m (k) | with, | R (k) | < | Rn
_m (k) | is divided into the following six types in total. (1) R (k) is positive, Rn _m (k) is negative; k = 1,2, -, - , M (37) (2) R (k) is negative, Rn _m (k) is positive; k = 1,2, -, -, M (38) (3a) R (k) is positive, Rn _m (k) is _{positive: R (k)> Rn m} (k); k = 1,2, - , -, M (39) ( 3b) R (k) is positive, Rn _m (k) is _{positive: R (k) <Rn m} (k); k = 1,2, -, -, M (40) (4a) R (k) is negative, Rn _m (k) is negative: | R (k) |> | Rn m (k) |; k = 1,2, -, -, M (41) (4b) R (k) is negative, Rn _m (k) is negative: | R (k) | < | Rn m (k) |; k = 1,2, -, -, M (42)

【０１２５】まず、(１)Ｒ(k)は正,Ｒn_m(k)は負；k=1,
2,-,-,Mの場合について考える。未知のＲs(0）、Ｒs(k)
はK₁が正しく選ばれたとすると以下の式(43)で表され
る。Ｒ(ｋ) =Ｒｓ(ｋ)−K₁｜Ｒn_m(k)｜ (43) Ｒ(0) =Ｒｓ(0)＋K₁Ｒn_m(0) (44) 式(28)、式(29)の関係を利用すると以下の式(45)が成立
する。Ｒ (0)−K₁Ｒn_m(0)＞ = R(ｋ)＋K₁｜Ｒn_m(k)｜ (45)First, (1) R (k) is positive, R _nm (k) is negative; k = 1,
Consider the case of 2,-,-, M. Unknown Rs (0), Rs (k)
Is given by the following equation (43) if K ₁ is correctly selected. R (k) = Rs (k) −K ₁ | R _nm (k) | (43) R (0) = Rs (0) + K ₁ R _nm (0) (44) Equations (28) and (29) By using the relationship, the following equation (45) is established. _{_{R (0) -K 1 Rn m}} (0)> = R (k) + K 1 | Rn m (k) | (45)

【０１２６】なお上記において、＞ =は大または等しい
ことを表す。K₁の最大値として等号が成立するとすると
下式（46）が成立する。Ｒ (0)−K₁Ｒn_m(0) = R(ｋ)＋K₁｜Ｒn_m(k)｜ (46)In the above,> = means greater or equal. The following equation (46) is satisfied when the equality holds the maximum value of K _1. _{_{R (0) -K 1 Rn m}} (0) = R (k) + K 1 | Rn m (k) | (46)

【０１２７】従つてK₁が下式(47)により求まる K₁={Ｒ(0)−R(ｋ)}/{Ｒn_m(0)＋｜Ｒn_m(k)｜} (47) 実際には、(１)Ｒ(k)は正,Ｒn_m(k)は負；k=1,2,-,-,M
を満たす、すべてのｋについて計算しその最小の値をK₁
とする。[0127] accordance connexion K ₁ is obtained by the following equation _{(47) K 1 = {R} (0) -R (k)} / {Rn m (0) + | Rn m (k) |} (47) actually Is (1) R (k) is positive, R _nm (k) is negative; k = 1,2,-,-, M
Is calculated for all k that satisfies and the minimum value is K ₁
And

【０１２８】同様の計算を上記の他の５通りについて実
施すると表(１)を得る。When the same calculation is performed for the other five cases, Table (1) is obtained.

【表２】＜例２−ｂ＞もう一つの具体例として以下の例を説明す
る。 R(0)=Ｒｓ(0) + ＫRn_m (0) (48) R(k)=Ｒｓ(k) + ＫRn_m (k) ｋ=1,2,-,-,-,M (49) Ｒｓ(0) ＞＝｜Ｒｓ(ｋ)｜ｋ=1,2,-,-,-,M (50) R(0)−ＫRn_m (0) ＞＝｜R(k)−ＫRn_m (k)｜ (51) 式(51)の両辺を二乗すると下式を得る。 {R(0)−ＫRn_m (0)}² ＞＝{R(k)−ＫRn_m (k)}² (52) 等号が成立すとして整理すると下式を得る。ａＫ²−２ｂＫ＋ｃ＝０ (53) ここでa、b、cは以下の値とする。 a＝{Ｒ²ｎ_m(0)−R²n_m(k)} (54) b＝{R (0)＊Ｒｎ_m(0)−R (k)＊Rn_m(k)} (55) c＝{Ｒ² (0)−R² (k)} (56)[Table 2] <Example 2-b> The following example is described as another specific example. R (0) = Rs (0 ) + KRn m (0) (48) R (k) = Rs (k) + KRn m (k) k = 1,2, -, -, -, M (49) Rs (0)> = | Rs (k) | k = 1,2,-,-,-, M (50) R (0) −KRn _m (0)> = | R (k) −KRn _m (k) (51) By squaring both sides of equation (51), the following equation is obtained. {R (0) −KR _nm (0)} ² > = {R (k) −KR _nm (k)} ² (52) By rearranging assuming that the equality holds, the following equation is obtained. aK ² -2bK + c = 0 (53) Here, a, b, and c have the following values. a = {R ² _nm (0) −R ² _nm (k)} (54) b = {R (0) * R _nm (0) −R (k) * R _nm (k)} (55) c = {R ² (0) −R ² (k)} (56)

【０１２９】上式(54)〜(56)を用いてＫを求めると下式
(57)、(58)を得る。Ｋ₁＝{ｂ/ａ}[1＋{1−（ca/b²）}^1/2] (57) Ｋ＝{ｂ/ａ}[1−{1−（ca/b²）}^1/2] (58) Ｋの値として、Ｋ₁より小である(58)を選定する。When K is obtained using the above equations (54) to (56), the following equation is obtained.
(57) and (58) are obtained. K ₁ = {b / a} [1+ {1− (ca / b ² )} ^1/2 ] (57) K = {b / a} [1− {1− (ca / b ² )} ^1/2 ] (58) as the value of K, selecting a is smaller than K ₁ (58).

【０１３０】なお、以上の如き各種の方法を用いて求め
たＫの値に経験的に求められる安全係数（１より小）を
かけてＫとする方法も本発明に含まれる。以上の如き方
法で求められたKの値は、自己相関関数推定回路106に渡
される。The present invention also includes a method of multiplying the value of K obtained by using any of the various methods described above by an empirically obtained safety coefficient (smaller than 1) to obtain K. The value of K obtained by the above method is passed to the autocorrelation function estimation circuit 106.

【０１３１】[自己相関関数推定回路106]自己相関関数
推定回路106は、雑音変化率推定回路105から渡されたＫ
の値を用いて、現在の音声信号に混在している雑音の自
己相関関数を以下により推定する。 Rｎ(j)=K＊ Rｎ_m (j) ; j=1,2,-,-,M (59)[Autocorrelation function estimating circuit 106] The autocorrelation function estimating circuit 106 receives the K
Is used to estimate the autocorrelation function of the noise mixed in the current speech signal as follows. Rn (j) = K * Rn m (j); j = 1,2, -, -, M (59)

【０１３２】次に、音声信号のみの自己相関関数、Rs
(0), Rs(1),Rs(2),-,-,Rs(M)を下式(60)により計算す
る。 Rｓ(ｊ)=R(ｊ)−K＊ Rｎ_m (j) ; j=1,2,-,-,M (60) 以上により得られたＲｓ(0)、Rｓ(ｊ)の値を、音声/雑
音切替SW102Bを経由して予測係数算出回路1304に渡す。Next, the autocorrelation function of only the audio signal, Rs
(0), Rs (1), Rs (2),-,-, Rs (M) are calculated by the following equation (60). Rs (j) = R (j ) -K * Rn m (j); j = 1,2, -, -, Rs obtained by M (60) or (0), the value of Rs (j), The signal is passed to the prediction coefficient calculation circuit 1304 via the voice / noise switch SW102B.

【０１３３】[音声/雑音切替SW102B]、[予測係数算出回
路1304] 予測係数算出回路1304は、雑音を含まない音声信号のみ
の自己相関関数を用いてレビンソン・ダービンアルゴ
リズムで、最適な予測係数ａ₁，ａ₂，-，-，ａ₁ ₀ を求
める。[Speech / Noise Switching SW102B], [Prediction Coefficient Calculation Circuit 1304] The prediction coefficient calculation circuit 1304 uses the autocorrelation function of only the noise-free audio signal to obtain the optimum prediction coefficient a by the Levinson-Durbin algorithm. _{_{1, a 2, -, -}} , seeking a ₁ _0.

【０１３４】[係数変換回路1305]係数変換回路1305は、
このようにして計算された最適な予測係数ａ₁，ａ₂，
-，-，ａ₁₀は時間領域のパラメータ（インパルス応答の
係数）であり、これを周波数領域のパラメータである、
ライン・スペクトル・ペア（ＬＳＰ）と呼ばれる別の係
数に変換する。[Coefficient conversion circuit 1305] The coefficient conversion circuit 1305
The optimal prediction coefficients a ₁ , a ₂ ,
-, -, a ₁₀ is a parameter in the time domain (the coefficient of the impulse response), which is a parameter of the frequency domain,
Convert to another coefficient called a line spectrum pair (LSP).

【０１３５】[サブフレーム係数内挿回路（量子化前）1
308A]サブフレーム係数内挿回路（量子化前）1308Aは量
子化を実施する前の第２と第４サブフレームのLSPパラ
メータを用いて、第１と第３サブフレームのLSPパラメ
ータを求め、それを用いて、線形予測回路の伝達関数で
あるA(z)を得る。[Subframe coefficient interpolation circuit (before quantization) 1
308A] The subframe coefficient interpolation circuit (before quantization) 1308A obtains the LSP parameters of the first and third subframes using the LSP parameters of the second and fourth subframes before performing the quantization, Is used to obtain A (z) which is a transfer function of the linear prediction circuit.

【０１３６】[４サブフレーム重みずけ音声計算回路130
9]４サブフレーム重みずけ音声計算回路1309は窓掛けを
する前のデジタル音声信号４サブフレーム（すなわち音
声信号１フレーム）分の音声に人間の聴覚特性を考慮し
た伝達関数のフィルターを通す。人間の聴覚特性を考慮
した伝達関数のフィルターは既述した式(20)で表わされ
る。Ｗ(ｚ)= {A(ｚ/γ₁)} /{A(ｚ/γ₂)} (20)[4 Subframe Weighted Speech Calculation Circuit 130
9] The 4-subframe weighted audio calculation circuit 1309 passes the audio of the 4 subframes (that is, one audio signal) of the digital audio signal before windowing through a filter of a transfer function in consideration of human auditory characteristics. The filter of the transfer function in consideration of the human auditory characteristics is represented by the above-described equation (20). W (z) = {A (z / γ ₁ )} / {A (z / γ ₂ )} (20)

【０１３７】[係数量子化回路1306]係数量子化回路1306
は、このようにして得られた、ＬＳＰを適切なビツト数
に変換する。[Coefficient quantization circuit 1306] Coefficient quantization circuit 1306
Converts the LSP thus obtained into an appropriate number of bits.

【０１３８】[サブフレーム係数内挿回路1307]サブフレ
ーム係数内挿回路1307(量子化後)は、第２サブフレーム
と第４サブフレームのLSPの値を用いて、第１と第３サ
ブフレームのLSPパラメータを求める。さらに、これら
の値を用いて線形予測回路の伝達関数であるA’(z)を得
る。[Subframe coefficient interpolation circuit 1307] The subframe coefficient interpolation circuit 1307 (after quantization) uses the LSP values of the second subframe and the fourth subframe to calculate the first and third subframes. Find the LSP parameter of. Further, using these values, A '(z) which is a transfer function of the linear prediction circuit is obtained.

【０１３９】[音声/雑音切替SW102C]、[開ループピッチ
遅れ発見回路1310A]、[開ループピッチゲイン発見回路1
310B] 開ループピッチ遅れ発見回路1310A、開ループピッチゲ
イン発見回路1310Bは、10msに１回、すなわち、音声信
号の１フレームである20msに２回、ピッチの周期すなわ
ち遅れ時間とゲインを推定する。ピッチ遅れとピッチゲ
イン最適値がRlag(open)を最大にする値として選ばれ、
遅れ時間は適応コードブック1311A、ピッチゲインはピ
ッチゲイン調整回路1311Bに夫々引き渡される。[Voice / Noise Switching SW102C], [Open Loop Pitch Delay Finding Circuit 1310A], [Open Loop Pitch Gain Finding Circuit 1]
310B] The open loop pitch delay finding circuit 1310A and the open loop pitch gain finding circuit 1310B estimate the pitch period, that is, the delay time and the gain once every 10 ms, that is, twice every 20 ms, which is one frame of the audio signal. Pitch delay and pitch gain optimum values are selected as values that maximize Rlag (open),
The delay time is passed to the adaptive codebook 1311A, and the pitch gain is passed to the pitch gain adjustment circuit 1311B.

【０１４０】[インパルス応答計算回路1308B]インパル
ス応答計算回路1308Bは、サブフレーム係数内挿回路
（量子化前）1308Aで得られた、線形予測回路の伝達関
数であるA(z)とサブフレーム係数内挿回路1307（量子化
後）で得られたA’(z)とを用いて、人間の聴覚補正のさ
れた線形予測合成回路のインパルス応答である、ｈ(ｎ)
を求める。ｈ(ｎ)のｚ変換は既述した式(19)で表され
る。 H(ｚ)＝(１/A’(z))・(Ｗ(ｚ))＝{A(ｚ/γ₁)} /{ A’(z)・A(ｚ/γ₂)} (19) このｈ(ｎ)は適応コードブックと固定コードブックのサ
ーチに必要なものである。[Impulse Response Calculation Circuit 1308B] The impulse response calculation circuit 1308B includes A (z), which is the transfer function of the linear prediction circuit, obtained by the subframe coefficient interpolation circuit (before quantization) 1308A, and the subframe coefficient. Using A ′ (z) obtained by the interpolation circuit 1307 (after quantization), h (n), which is an impulse response of a linear prediction synthesis circuit corrected for human hearing,
Ask for. The z-transform of h (n) is represented by the above-described equation (19). H (z) = (1 / A ′ (z)) · (W (z)) = {A (z / γ ₁ )} / {A ′ (z) · A (z / γ ₂ )} (19) This h (n) is necessary for searching the adaptive codebook and the fixed codebook.

【０１４１】[適応コードブック1311A]、[ピッチゲイン
調整回路1311B] 適応コードブック1311Aとピッチゲイン調整回路1311Bの
初期値は、上記の開ループピッチ遅れ発見回路1310Aと
開ループピッチゲイン発見回路1310Bで求められたピッ
チ遅れ時間とピッチゲインがそれぞれ、適応コードブッ
ク1311Aと、ピッチゲイン調整回路1311Bで使用される。[Adaptive Codebook 1311A], [Pitch Gain Adjustment Circuit 1311B] The initial values of the adaptive codebook 1311A and pitch gain adjustment circuit 1311B are obtained by the open loop pitch delay finding circuit 1310A and open loop pitch gain finding circuit 1310B. The obtained pitch delay time and pitch gain are used in adaptive codebook 1311A and pitch gain adjustment circuit 1311B, respectively.

【０１４２】[音声/雑音切替SW102D]、[インパルス応答
回路1311C] インパルス応答回路1311Cはインパルス応答計算回路130
8Bで求めたｈ(ｎ)とピッチゲイン調整回路1311Bを経由
した適応コードブック1311Aの出力パルスとを畳み込み
積分して重みずけ音声信号に相当する合成信号に変換し
それを誤差計算回路1311Dに出力する。[Voice / Noise Switching SW102D], [Impulse Response Circuit 1311C] The impulse response circuit 1311C is an impulse response calculation circuit 130.
8 (b) and the output pulse of the adaptive codebook 1311A via the pitch gain adjustment circuit 1311B are convoluted and integrated to convert them into a synthesized signal corresponding to a weighted audio signal, which is then sent to an error calculation circuit 1311D. Output.

【０１４３】[誤差計算回路1311D]誤差計算回路1311D
は、４サブフレーム重みずけ音声計算回路1309で計算さ
れた、重みずけ音声信号と上記インパルス応答回路1311
Cの出力である重みずけ音声信号に相当する合成信号の
差すなわち誤差を計算し誤差最小化回路1311Eに出力す
る。[Error Calculation Circuit 1311D] Error Calculation Circuit 1311D
Is a weighted voice signal calculated by the 4-subframe weighted voice calculation circuit 1309 and the impulse response circuit 1311
The difference of the synthesized signal corresponding to the weighted audio signal which is the output of C, that is, the error is calculated and output to the error minimizing circuit 1311E.

【０１４４】[誤差最小化回路1311E]誤差最小化回路131
1Eは誤差計算回路1311Dで計算された誤差の二乗平均値
を求め、その値を最小化するように適応コードブック13
11Aとピッチゲイン調整回路1311Bを制御する。前述の開
ループピッチ遅れ発見回路1310Aと開ループピッチゲイ
ン発見回路1310Bで求められたピッチ遅れ時間とピッチ
ゲインの周辺を精密にサーチする。[Error minimizing circuit 1311E] Error minimizing circuit 131
1E finds the mean square value of the error calculated by the error calculation circuit 1311D, and adapts the adaptive code book 13 so as to minimize the value.
11A and the pitch gain adjustment circuit 1311B. The vicinity of the pitch delay time and the pitch gain obtained by the open loop pitch delay finding circuit 1310A and the open loop pitch gain finding circuit 1310B are precisely searched.

【０１４５】[最適遅れとゲイン発見回路1312]最適遅れ
とゲイン発見回路1312は誤差最小化回路1311Eの制御の
結果として最適な遅れ時間とゲイン値を発見する。求め
られた、適応コードブックの値とピッチゲインとは夫々
記憶され、それを利用して次の、適応コードブックとピ
ッチゲインの最適値がサーチされる。[Optimal delay and gain finding circuit 1312] The optimal delay and gain finding circuit 1312 finds the optimum delay time and gain value as a result of the control of the error minimizing circuit 1311E. The obtained values of the adaptive codebook and the pitch gain are respectively stored, and the optimum values of the next adaptive codebook and the pitch gain are searched using the stored values.

【０１４６】[ピッチゲイン量子化回路1313]ピッチゲイ
ン量子化回路1313は、最適遅れとゲイン発見回路1312で
見出された最適ピッチゲインを、適切なビツト数で量子
化する。[Pitch gain quantization circuit 1313] The pitch gain quantization circuit 1313 quantizes the optimum pitch gain found by the optimum delay and gain finding circuit 1312 with an appropriate number of bits.

【０１４７】[適用コードブック寄与度計算回路1314]適
用コードブック寄与度計算回路1314は最適遅れとゲイン
発見回路1312で求められた値によるインパルス列を用い
て得られた誤差計算回路1311Dの誤差信号を、音声/雑音
切替SW102Eを経由して誤差計算回路117Cに出力する。[Applied Codebook Contribution Calculation Circuit 1314] The applied codebook contribution calculation circuit 1314 is an error signal of the error calculation circuit 1311D obtained by using the impulse train based on the value obtained by the optimum delay and gain finding circuit 1312. Is output to the error calculation circuit 117C via the voice / noise switch SW102E.

【０１４８】[固定コードブック116]固定コードブック1
16は、５つのトラツクに分けられた全体で10ヶの単パル
スの組み合わせで作られ、パルスは、“＋１”と“−
１”の値を取ることが出来る。これらのパルスの初期値
は雑音区間の時の動作で設定された最適値となつてい
る。[Fixed Code Book 116] Fixed Code Book 1
16 is made up of a combination of a total of 10 single pulses divided into 5 tracks, with the pulses being “+1” and “−”.
The initial value of these pulses is the optimum value set in the operation in the noise section.

【０１４９】[固定ゲイン調整回路117A]固定ゲイン調整
回路117Aは固定コードブック116が発生した１サブフレ
ームににつき８ケのパルスの振幅を調整する回路であり
その出力をインパルス応答回路117Bに渡す。初期値は雑
音区間の時の動作で設定された最適値となつている。[Fixed Gain Adjustment Circuit 117A] The fixed gain adjustment circuit 117A is a circuit for adjusting the amplitude of eight pulses per subframe generated by the fixed codebook 116, and passes its output to the impulse response circuit 117B. The initial value is the optimum value set in the operation in the noise section.

【０１５０】[最適フィルタ記憶回路120]最適フィルタ
記憶回路120は、インパルス応答計算回路1308Bで得られ
た、人間の聴覚補正のされた線形予測合成回路のインパ
ルス応答であるｈ(ｎ)を次の動作に備えて記憶してお
く。最適フィルタ記憶回路回路120は雑音区間の動作で
記憶した線形予測合成回路のインパルス応答ｈ(ｎ)とそ
の初期条件をインパルス応答回路117Bに出力する。[Optimal filter storage circuit 120] The optimum filter storage circuit 120 stores the following impulse response (h (n)) of the linear prediction synthesis circuit corrected by the human auditory sense obtained by the impulse response calculation circuit 1308B. It is stored in preparation for the operation. The optimum filter storage circuit 120 outputs the impulse response h (n) of the linear prediction synthesis circuit stored in the operation in the noise section and its initial condition to the impulse response circuit 117B.

【０１５１】[インパルス応答回路117B]インパルス応答
回路117Bは前記最適フィルタ記憶回路120から出力され
た人間の聴覚補正のされた線形予測合成回路のインパル
ス応答であるｈ(ｎ)に対応するインパルス応答であるｈ
ｖ(ｎ)（但し、サフィクスvは雑音を表す）と固定コー
ドブック116、固定ゲイン調整回路117Aにより合成され
た固定コードパルス列とを畳み込み積分して重みずけ雑
音に相当する合成信号に変換しそれを誤差計算回路117C
に入力する。[Impulse Response Circuit 117B] The impulse response circuit 117B is an impulse response corresponding to h (n), which is the impulse response of the linear predictive synthesis circuit with human auditory correction output from the optimal filter storage circuit 120. A h
v (n) (where the suffix v represents noise) and the fixed codebook 116 and the fixed code pulse train synthesized by the fixed gain adjustment circuit 117A are convoluted and integrated to convert into a synthesized signal corresponding to weighted noise. Error calculation circuit 117C
To enter.

【０１５２】[誤差計算回路117C]誤差計算回路117Cは、
誤差最小化回路1311Eで誤差の二乗平均誤差が最小化さ
れた状態での誤差計算回路1311Dでの誤差信号が入力さ
れている。もう一方の入力には、固定コードブック11
6、固定ゲイン調整回路117Aにより合成された固定コー
ドパルス列を、インパルス応答回路117Bにより重みずけ
雑音の合成信号に変換された信号との差を計算しその値
を誤差最小化回路117Dに出力する。[Error Calculation Circuit 117C] The error calculation circuit 117C
The error signal from the error calculation circuit 1311D in a state where the root mean square error of the error is minimized by the error minimization circuit 1311E is input. The other input is a fixed codebook 11
6.Calculate the difference between the fixed code pulse train synthesized by the fixed gain adjustment circuit 117A and the signal converted to the synthesized signal of weighted noise by the impulse response circuit 117B, and output the value to the error minimization circuit 117D. .

【０１５３】[誤差最小化回路117D]誤差最小化回路117D
は誤差計算回路117Cで計算された誤差の二乗平均値を求
め、その値を最小化するように固定コードブック116と
固定ゲイン調整回路117Aを制御する。その方法の詳細は
省略するが、ここでの動作により雑音成分は大きく除去
され、一方音声信号成分はそのまま残り、誤差計算回路
117Cでの残留成分は誤差計算回路1317Cに入力される。[Error minimizing circuit 117D] Error minimizing circuit 117D
Calculates the root-mean-square value of the error calculated by the error calculation circuit 117C, and controls the fixed codebook 116 and the fixed gain adjustment circuit 117A to minimize the value. Although the details of the method are omitted, the noise component is largely removed by the operation here, while the audio signal component remains as it is, and the error calculation circuit
The residual component at 117C is input to error calculation circuit 1317C.

【０１５４】[固定コードブック最適値発見回路117E]固
定コードブック最適値発見回路117Eは誤差最小化回路11
7Dが誤差の二乗平均値を最小化するように固定コードブ
ック116と固定ゲイン調整回路117Aを制御した結果とし
て固定コードブックの最適値と固定ゲインの最適値を発
見する。これらの値は夫々雑音に相当する値である。[Fixed Codebook Optimal Value Finding Circuit 117E] The fixed codebook optimal value finding circuit 117E is an error minimizing circuit 11.
As a result of controlling the fixed codebook 116 and the fixed gain adjustment circuit 117A so that 7D minimizes the root mean square value of the error, the optimum value of the fixed codebook and the optimum value of the fixed gain are found. Each of these values is a value corresponding to noise.

【０１５５】[固定コードブックゲイン量子化回路118]
固定コードブックゲイン量子化回路118は固定コードブ
ックのゲインを適切なビツト数で量子化する。[Fixed Codebook Gain Quantization Circuit 118]
The fixed codebook gain quantization circuit 118 quantizes the gain of the fixed codebook with an appropriate number of bits.

【０１５６】[励振計算回路119]励振計算回路119は今ま
でに説明したようにして作られた固定コードブックなら
びに固定ゲインの最適値を、サブフレーム毎の合成雑音
の励振パルス信号として下式(61)により作成する。ｕ(n)= g_a C (n) ; n=0,1,2,-,-,-,39 (61) ここで、u(n)は励振ベクトルで、C(n)は固定コードブッ
クの励振ベクトル、g_aはそのゲインである。すべて雑音
に相当するものである。[Excitation Calculation Circuit 119] The excitation calculation circuit 119 uses the fixed codebook and the optimum value of the fixed gain created as described above as the excitation pulse signal of the synthesized noise for each sub-frame as Created according to 61). u (n) = g _a C (n); n = 0,1,2,-,-,-, 39 (61) where u (n) is an excitation vector and C (n) is a fixed codebook , The excitation vector, g _a is its gain. All correspond to noise.

【０１５７】[最適フィルタ記憶回路120]最適フィルタ
記憶回路120は雑音に対応する線形予測合成回路のイン
パルス応答であるｈ(ｎ)とその初期条件を次の動作に備
えて記憶しておく。[Optimal filter storage circuit 120] The optimal filter storage circuit 120 stores h (n), which is the impulse response of the linear predictive synthesis circuit corresponding to noise, and its initial condition in preparation for the next operation.

【０１５８】[固定コードブック1316]固定コードブック
1316は、５つのトラツクに分けられた全体で10ヶの単パ
ルスの組み合わせで作られているそれらのパルスは、
“＋１”と“−１”の値を取ることが出来る。[Fixed Code Book 1316] Fixed Code Book
1316, those pulses made of a combination of a total of 10 single pulses divided into 5 tracks,
It can take the values of "+1" and "-1".

【０１５９】[固定ゲイン調整回路1317A]固定ゲイン調
整回路1317Aは固定コードブック1316が発生した１サブ
フレームににつき10ケのパルスの振幅を調整する回路で
有りその出力をインパルス応答回路1317Bに渡す。[Fixed Gain Adjustment Circuit 1317A] The fixed gain adjustment circuit 1317A is a circuit for adjusting the amplitude of 10 pulses per subframe generated by the fixed codebook 1316, and outputs its output to the impulse response circuit 1317B.

【０１６０】[インパルス応答回路1317B]インパルス応
答回路1317Bはインパルス応答計算回路1308Bで求めたｈ
(ｎ)と、固定ゲイン調整回路1317Aを経由した固定コー
ドブック1316の出力パルス列とを畳み込み積分した重み
ずけ音声信号に相当する合成信号に変換しそれを誤差計
算回路1317Cに入力する。[Impulse Response Circuit 1317B] The impulse response circuit 1317B obtains h obtained by the impulse response calculation circuit 1308B.
(n) and the output pulse train of the fixed codebook 1316 via the fixed gain adjustment circuit 1317A are converted into a synthesized signal corresponding to a weighted audio signal obtained by convolution and integration, and input to the error calculation circuit 1317C.

【０１６１】[誤差計算回路1317C]誤差計算回路117Cの
誤差信号出力と、前記インパルス応答回路1317Bの出力
との差を計算しその値を誤差最小化回路1317Dに出力す
る。[Error Calculation Circuit 1317C] The difference between the error signal output of the error calculation circuit 117C and the output of the impulse response circuit 1317B is calculated, and the value is output to the error minimization circuit 1317D.

【０１６２】[誤差最小化回路1317D]誤差最小化回路131
7Dは誤差計算回路1317Cで計算された誤差の二乗平均値
を求め、その値を最小化するように固定コードブック13
16と固定ゲイン調整回路1317Aを制御する。[Error minimizing circuit 1317D] Error minimizing circuit 131
7D calculates the mean square value of the error calculated by the error calculation circuit 1317C, and fixes the fixed code book 13 so as to minimize the value.
16 and the fixed gain adjustment circuit 1317A.

【０１６３】[固定コードブック最適値発見回路1317E]
固定コードブック最適値発見回路1317Eは誤差最小化回
路1317Dが誤差の二乗平均値を最小化するように固定コ
ードブック1316と固定ゲイン調整回路1317Aを制御した
結果として固定コードブックの最適値と固定ゲインの最
適値を発見する。[Fixed codebook optimum value finding circuit 1317E]
The fixed codebook optimum value finding circuit 1317E controls the fixed codebook 1316 and the fixed gain adjustment circuit 1317A so that the error minimizing circuit 1317D minimizes the mean square value of the error. To find the optimal value of.

【０１６４】[固定コードブックゲイン量子化回路1318]
固定コードブックゲイン量子化回路1318は固定コードブ
ックのゲインを無線伝送に必要な適切なビツト数で量子
化する。[Fixed codebook gain quantization circuit 1318]
The fixed codebook gain quantization circuit 1318 quantizes the gain of the fixed codebook with an appropriate number of bits necessary for wireless transmission.

【０１６５】[励振計算回路1319]励振計算回路1319は今
までに説明した事柄により、作られた適応コードブック
ならびにピッチゲインの最適値と、固定コードブックな
らびに固定ゲインの最適値により、サブフレーム毎の合
成音声の励振パルス信号を既述の式(25)により作成す
る。ｕ(n)= g_pv(n)+g_a C (n) ; n=0,1,2,-,-,-,39 (25) ここで、u(n)は励振ベクトルであり、v(n)は適応コード
ブックの励振ベクトルであり、g_pは、そのゲインであ
り、C(n)は固定コードブックの励振ベクトルであり、g_a
は、そのゲインである。C(n)とg_aは雑音の除去された音
声に対応するものである。[Excitation Calculation Circuit 1319] The excitation calculation circuit 1319 calculates the optimum values of the adaptive codebook and the pitch gain and the optimum values of the fixed codebook and the fixed gain for each sub-frame. The excitation pulse signal of the synthesized speech is generated by the above-described equation (25). u (n) = g _p v (n) + g _a C (n); n = 0,1,2,-,-,-, 39 (25) where, u (n) is an excitation vector, v (n) is the excitation vector of the adaptive codebook, g _p is its gain, C (n) is the excitation vector of the fixed codebook, g _a
Is the gain. C (n) and g _a correspond to speech from which noise has been removed.

【０１６６】[最適フィルタ記憶回路1320]最適フィルタ
記憶回路1320は音声信号にに対応する線形予測合成回路
のインパルス応答であるｈ(ｎ)とその初期条件を次の動
作に備えて記憶しておく。[Optimal filter storage circuit 1320] The optimal filter storage circuit 1320 stores h (n), which is the impulse response of the linear prediction / synthesis circuit corresponding to the audio signal, and its initial condition in preparation for the next operation. .

【０１６７】以上のように本発明の第１の実施の形態に
よれば、音声符号化装置に存在する音声信号の短期間予
測係数計算回路、係数変換回路、係数量子化回路を、雑
音を取り除いた音声信号の自己相関関数Rs(0),Rs(1),Rs
(2),-,-,Rs(M)を用いて動作させることにより、周囲雑
音を低減可能な音声符号化装置を可能とするものであ
る。以上説明したようにして、音声信号に現在含まれて
いる雑音の量に適応して、それを除去する、適応型音声
符号化装置または方法が実現できることが明らかになっ
た。As described above, according to the first embodiment of the present invention, the short-term prediction coefficient calculation circuit, coefficient conversion circuit, and coefficient quantization circuit of the speech signal existing in the speech coding apparatus are used to remove noise. Autocorrelation functions Rs (0), Rs (1), Rs
(2) By operating using-,-, Rs (M), a speech coding apparatus capable of reducing ambient noise is made possible. As described above, it has become clear that an adaptive speech coding apparatus or method can be realized that adapts to the amount of noise currently contained in the speech signal and removes it.

【０１６８】以上のように本発明の第１の実施形態は、
GSM携帯電話の技術標準書に記載されているACELP音声符
号化装置に忠実に適用したものである。本発明の目的の
１つは、雑音を含む音声信号を音声符号化装置に入力す
ると、雑音レベルが増大した場合、音声符号化装置が正
しく動作しなくなり、信号対雑音比を更に悪化する恐れ
があるので、音声符号化装置の内部動作において、雑音
を消去し音声符号化装置が安定に動作し信号対雑音比の
悪化を防止することにある。As described above, the first embodiment of the present invention
This is a faithful application to the ACELP speech coding device described in the GSM mobile phone technical standard. One of the objects of the present invention is that when a speech signal containing noise is input to a speech encoding device, the speech encoding device does not operate properly when the noise level increases, and the signal-to-noise ratio may be further deteriorated. Therefore, it is an object of the present invention to eliminate noise in the internal operation of the speech coding apparatus, to operate the speech coding apparatus stably, and to prevent deterioration of the signal-to-noise ratio.

【０１６９】本発明の原理を分かりやすく説明するため
に、雑音を含む音声信号区間の動作説明図である図４、
図５を、特許の請求範囲とは関係の無い部分を削除して
簡略化した図を、図６として示す。以下、図６を用いて
説明する。To explain the principle of the present invention in an easy-to-understand manner, FIG.
FIG. 6 is a simplified diagram of FIG. 5 in which parts irrelevant to the claims are deleted. Hereinafter, description will be made with reference to FIG.

【０１７０】各部分を大きく、ブロックＡ、ブロック
Ｂ、ブロックＣ、ブロックＤと括ってある。そして夫々
のブロックの、出力を夫々出力Ａ、出力Ｂ、出力Ｃ、出
力Ｄと示してある。Each part is broadly grouped into blocks A, B, C and D. The output of each block is shown as output A, output B, output C, and output D, respectively.

【０１７１】ブロックＡは、入力信号から雑音成分を除
去する機能を有してをり、出力Ａには雑音成分の除去さ
れた信号が出力される。ブロックＢは、入力信号から音
声信号のピッチ成分を除去する機能を有しており、出力
Ｂにはピッチ成分の除去された信号が出力される。ブロ
ックＣは、入力信号から音声信号の予測可能な成分を除
去する機能を有しており、出力Ｃには音声信号の予測可
能な成分の除去された信号が出力される。そしてブロッ
クＤは、入力信号に含まれる雑音を推定する機能等を有
しており、出力Ｄには人間の聴覚を考慮して補正された
雑音を含む音声信号が出力される。The block A has a function of removing a noise component from an input signal, and a signal from which a noise component has been removed is output to an output A. The block B has a function of removing the pitch component of the audio signal from the input signal, and the output B outputs a signal from which the pitch component has been removed. The block C has a function of removing a predictable component of the audio signal from the input signal, and the output C outputs a signal from which the predictable component of the audio signal has been removed. The block D has a function of estimating noise included in the input signal and the like, and an output D outputs a sound signal including noise corrected in consideration of human hearing.

【０１７２】従来の音声符号化装置には、ブロックＡは
ないので入力信号から雑音成分を除去する機能は有して
おらず、従って、ブロックＣの出力Ｃには入力信号に含
まれる雑音がそのまま現れる。そのため雑音が大になる
と誤差最小化回路1317Dが動作不能に陥る危険性があ
る。Since the conventional speech coding apparatus does not have the block A, it does not have a function of removing a noise component from the input signal. Therefore, the noise contained in the input signal is directly output to the output C of the block C. appear. Therefore, when the noise becomes large, there is a risk that the error minimizing circuit 1317D becomes inoperable.

【０１７３】本発明は、ブロックＡが入力信号から雑音
成分を除去する機能を有しており、出力Ａには雑音成分
が出力されないので、その結果としてブロックＣの出力
Ｃには雑音が現れない。そのため誤差最小化回路1317D
が動作不能に陥る危険性はなく、本発明の音声符号化装
置は入力信号に雑音が含まれていてもその影響を受けず
に動作可能となる。According to the present invention, the block A has a function of removing a noise component from an input signal, and no noise component is output at the output A. As a result, no noise appears at the output C of the block C. . Therefore error minimizing circuit 1317D
Is not in danger of becoming inoperable, and the speech coding apparatus of the present invention can operate without being affected even if the input signal contains noise.

【０１７４】図６で明らかなようにブロックＢ、ブロッ
クＡ、ブロックＣの順に配列してあるが、これをブロッ
クＡ、ブロックＢ、ブロックＣの順に配置することも出
来る。この場合の本発明の原理を分かりやすく説明する
図を、図７に示す。動作の内容は既に説明した事柄から
から自明であり省略する。この場合の長所は、ピッチ成
分を除去する機能を有するブロックＢの入力信号には雑
音の取り除かれた音声信号が入力されるのでピッチ成分
の最適値を探す動作が雑音に妨害されずに行える点にあ
る。As is clear from FIG. 6, the blocks are arranged in the order of block B, block A and block C, but they can be arranged in the order of block A, block B and block C. FIG. 7 is a diagram for easily explaining the principle of the present invention in this case. The contents of the operation are self-evident from the already described matters and will not be described. An advantage of this case is that since an audio signal from which noise has been removed is input to the input signal of the block B having a function of removing the pitch component, the operation of searching for the optimum value of the pitch component can be performed without being disturbed by the noise. It is in.

【０１７５】ピッチ成分を除去するブロックＢは、音声
符号化装置としては重要であるが、請求項１には含んで
いない。その理由はそれを含んで記述するとあまりにも
長文となり意味が不明確になることと、音声符号化装置
がピッチ成分を予測する部分を有することは公知で不要
だからである。従つて図７に示す内容も本発明に含まれ
ることを指摘するにとどめる。The block B for removing the pitch component is important as a speech coding apparatus, but is not included in the first aspect. The reason for this is that if it is described in such a manner, the sentence will be too long and the meaning will be unclear, and it is publicly known and unnecessary that the speech coding apparatus has a part for predicting the pitch component. Therefore, it is only pointed out that the contents shown in FIG. 7 are also included in the present invention.

【０１７６】（第２の実施の形態）本発明の第２の実施
の形態は、請求項２の発明を説明するためのものであ
り、図８、図１１、図１２、図１３、図１４、図１５を
用いて説明する。(Second Embodiment) A second embodiment of the present invention is for describing the second aspect of the present invention, and is shown in FIGS. 8, 11, 12, 13, and 14. This will be described with reference to FIG.

【０１７７】図８は本発明の第２の実施形態の発明原理
を分かりやすく説明する図であり、図１３は雑音のみ存
在する区間の動作を説明する詳細図、図１４、図１５は
雑音を含む音声信号の区間の動作を説明する詳細図、図
１１、図１２は雑音のみの区間と雑音を含む音声信号の
区間の動作を音声/雑音切替SWを用いて統合した図であ
る。図１１、図１２で図１、図２に新しく追加されたも
のは、音声/雑音切替SW102Gと、音声/雑音切替SW102Hの
みである。FIG. 8 is a diagram for explaining the principle of the second embodiment of the present invention in an easy-to-understand manner. FIG. 13 is a detailed diagram for explaining the operation in a section where only noise exists, and FIG. 14 and FIG. FIG. 11 and FIG. 12 are diagrams illustrating the operation of a section of an audio signal including noise, and FIGS. 11 and 12 are diagrams in which the operation of the section of an audio signal including noise and the operation of the section of an audio signal including noise are integrated using a voice / noise switch. In FIGS. 11 and 12, what is newly added to FIGS. 1 and 2 is only the voice / noise switch SW102G and the voice / noise switch SW102H.

【０１７８】図６又は図７で明らかなようにブロック
Ａ、ブロックＢの出力は、夫々次のブロックに入力され
ると共に、夫々の誤差最小化回路に入力されている。こ
れと異なり図８ではブロックＡ、ブロックＢの出力は、
夫々次のブロックに入力されるだけで、夫々の誤差最小
化回路には入力されない。夫々の誤差最小化回路には最
終のブロックであるブロックＣの出力Ｃが入力されてい
る。As is clear from FIG. 6 or FIG. 7, the outputs of the blocks A and B are input to the next block and to the respective error minimizing circuits. In contrast to this, in FIG. 8, the outputs of blocks A and B are:
Each is only input to the next block, and is not input to each error minimizing circuit. The output C of the final block, block C, is input to each error minimizing circuit.

【０１７９】このように構成することにより、ブロック
Ａ、ブロックＢの誤差最小化回路にも、最終のブロック
であるブロックＣの誤差最小化回路と同じ出力Ｃが入力
されるので誤差最小化動作に不要な成分が除去されるこ
とになり、極めて安定にかつ迅速な動作が可能となる。
動作の詳細は第１の実施の形態の説明から自明であり省
略する。With this configuration, the same output C as the error minimizing circuit of the final block, block C, is input to the error minimizing circuits of blocks A and B, so that the error minimizing operation can be performed. Unnecessary components are removed, and extremely stable and quick operation becomes possible.
The details of the operation are obvious from the description of the first embodiment, and will not be repeated.

【０１８０】（第３の実施の形態）本発明の第３の実施
の形態は、請求項７、８、９、１０、１１、１２に対応
するものであり、音声信号のピッチ周期Ｌに相当する遅
延時間の自己相関関数R(L)を雑音の時間変動率の推定に
追加して使用する。(Third Embodiment) A third embodiment of the present invention corresponds to claims 7, 8, 9, 10, 11, and 12, and corresponds to a pitch period L of an audio signal. The autocorrelation function R (L) of the delay time is used in addition to the estimation of the time variation rate of the noise.

【０１８１】図１６、図１７、図１８、図１９は請求項
７、８、９、１０、１１、１２に記載の発明を説明する
ものであり、本発明の第３の実施形態の適応型雑音抑圧
音声符号化装置の機能ブロック図である。図１６、図１
７は夫々第１の実施形態の図１、図２、図４、図５に対
応し、図１８、図１９は雑音が音声に重畳している区間
の動作を説明するためのものである。FIG. 16, FIG. 17, FIG. 18, and FIG. 19 explain the invention according to claims 7, 8, 9, 10, 11, and 12, and show an adaptive type of the third embodiment of the present invention. It is a functional block diagram of a noise suppression speech coding device. 16 and FIG.
7 corresponds to FIGS. 1, 2, 4 and 5 of the first embodiment, respectively, and FIGS. 18 and 19 are for explaining the operation in a section where noise is superimposed on the voice.

【０１８２】図１６、図１７において、図１、図２に新
しく追加されたのはピッチ自己相関関数記憶回路107の
みである。以上のように構成された適応型雑音抑圧音声
符号化装置について、その動作を、図１６、図１７、図
１８、図１９を用いて説明する。In FIGS. 16 and 17, only the pitch autocorrelation function storage circuit 107 is newly added to FIGS. The operation of the adaptive noise suppression speech coding apparatus configured as described above will be described with reference to FIGS. 16, 17, 18, and 19. FIG.

【０１８３】雑音のみの区間の動作については第１の実
施の形態で説明した図３の場合と全く同様のため省略す
る。The operation in the noise-only section is exactly the same as in the case of FIG. 3 described in the first embodiment, and will not be described.

【０１８４】音声信号が存在する区間の動作を説明す
る。この場合は図１６、図１７で音声/雑音切替SW102A
〜SW102Fはコモン接点CとS接点が接続される。この場合
の状態を図１８、図１９に示す。The operation in a section where a voice signal is present will be described. In this case, the voice / noise switch SW102A is used in FIGS.
The common contact C and the S contact are connected to SW102F. The state in this case is shown in FIGS.

【０１８５】[自己相関関数記憶回路103]、[音声/雑音
切替SW102A]、[自己相関関数記憶回路103] 以上の動作は第１の実施形態で説明したものと同じであ
る。[Autocorrelation Function Storage Circuit 103], [Speech / Noise Switching SW 102A], [Autocorrelation Function Storage Circuit 103] The above operations are the same as those described in the first embodiment.

【０１８６】[ピッチ自己相関関数記憶回路107]ピッチ
自己相関関数記憶回路107は、一つ前のサブフレームに
おける、開ループピッチ遅れ発見回路1310A、開ループ
ピッチゲイン発見回路1310Bの動作において、ループピ
ッチ遅れとピッチゲインの発見は、既述の関係式、Rlag
(open) = Σ_n=0 ⁿ⁼⁷⁹sw(n)sw(n-k) ;n=0,-,-,79
(21)に基づいて最大値を記憶しておき、その値
をR(L)として雑音変化率推定回路105、自己相関関数推
定回路106に渡す。ピッチ自己相関関数R(L)の一例を図
２０に示す。[Pitch autocorrelation function storage circuit 107] The pitch autocorrelation function storage circuit 107 stores the loop pitch in the operation of the open loop pitch delay finding circuit 1310A and the open loop pitch gain finding circuit 1310B in the immediately preceding subframe. The discovery of delay and pitch gain is based on the previously described relational expression, Rlag
(open) = Σ _{n = 0} ^{n = 79} sw (n) sw (nk); n = 0,-,-, 79
The maximum value is stored based on (21), and the value is passed to the noise change rate estimation circuit 105 and the autocorrelation function estimation circuit 106 as R (L). FIG. 20 shows an example of the pitch autocorrelation function R (L).

【０１８７】[雑音変化率推定回路105]、[自己相関関数
推定回路106] 従来例図３０で説明した自己相関関数計算回路1303から
雑音を含む音声信号の自己相関関数R(0),R(1),R(2),-,
-,R(M)が入力され、自己相関関数記憶回路103からは、
雑音区間の自己相関関数Rn_m (0),R(1)n_m,R(2)n_m,-,-,Rn
_m (M)が入力され、更にピッチ自己相関関数記憶回路107
からピッチ自己相関関数R(L)が入力される。[Noise Change Rate Estimation Circuit 105] and [Autocorrelation Function Estimation Circuit 106] Conventional Example The autocorrelation function R (0), R (0) of the speech signal containing noise is obtained from the autocorrelation function calculation circuit 1303 described with reference to FIG. 1), R (2),-,
-, R (M) is input, and from the autocorrelation function storage circuit 103,
Autocorrelation function Rn _m (0), R (1) _nm , R (2) _nm ,-,-, Rn
_m (M) is input, and the pitch autocorrelation function storage circuit 107
, A pitch autocorrelation function R (L) is input.

【０１８８】音声信号に雑音が重畳した信号区間の音声
信号のみ、および雑音のみの自己相関関数、Ｒｓ(0)、
Ｒｎ(0)、Ｒｓ(ｋ)、Ｒｎ(ｋ) ；ｋ=1,,2,-,-,M は未
知である。Ｒｓ(0) ＞ R(L) (62) 厳密に言うと式(28)のＲｓ(0)はR(0)とすべきだが遅延
時間Ｌのところで雑音の自己相関関数はゼロと考えてよ
いので実用上差し支えない。これらの性質を利用して、
Rs(0)を推定することが出来る。以下に数ヶの例を示
す。The autocorrelation function of only the speech signal in the signal section in which noise is superimposed on the speech signal and the noise only, Rs (0),
Rn (0), Rs (k), Rn (k); k = 1, 2, 2,-,-, M is unknown. Rs (0)> R (L) (62) Strictly speaking, Rs (0) in equation (28) should be R (0), but the autocorrelation function of noise at delay time L may be considered to be zero. Therefore, there is no problem in practical use. Utilizing these properties,
Rs (0) can be estimated. Below are some examples.

【０１８９】＜例A＞Ｒs(0)= function｛｜R(1)｜,｜R(2)｜,-,-,-,｜R(M)｜, R(L)｝ (63) なおfunction｛｝は、｛｝の中の値のある関数とし
て定まることを意味する。又｜R(1)｜は絶対値をとるこ
とを意味する。具体的には以下の方法が考えられる。＜例A−ａ＞Ｒs(0)= Max｛｜R(1)｜,｜R(2)｜,-,-,-,｜R(M)｜,R(L)｝ (64) 従つて、Ｒｓ(0)が推定されるとRn(0)は既述の式(32)に
より推定できる。Ｒｎ(0)=Ｒ(0)−Ｒｓ(0) (32) そして雑音変化率は既述の式(33)で表される。 K=Ｒｎ(0) ／Rn_m (0) (33) 音声信号に含まれる雑音の統計的性質は音声信号が存在
しない区間の時と同じと考えられるとする。そうすると
既述の式(34)が成立する。 Rn(k) =Ｋ＊Ｒn_m (k) ；k＝1，2-,-,-,M (34) より精密な推定として以下の方法が考えられる。＜例B＞Ｒs(0)= function{R(1),R(2),-,-,R(M),Rn_m(0),R(1)n_m,-,-,Rn_m(M),R(L)｝(65) ＜例B−ａ＞具体例としてＫの値は下式(66)とする。Ｋ＝Ｍin(K₁,K₂,K_3a,K_3b,Ｋ_4a,Ｋ_4b,K_L) (66) Ｍinは最小値をとることを意味し、K₁,K₂,K_3a,K_3b,
Ｋ_4a,Ｋ_4b,K_Lは以下の方法で決められる。<Example A> Rs (0) = function ｛| R (1) |, | R (2) |,-,-,-, | R (M) |, R (L)｝ (63) function ｛｝ means to be defined as a function with a value in｝｛. | R (1) | means taking an absolute value. Specifically, the following method is conceivable. <Example A-a> Rs (0) = Max ｛| R (1) |, | R (2) |,-,-,-, | R (M) |, R (L)｝ (64) , Rs (0) is estimated, then Rn (0) can be estimated by the aforementioned equation (32). Rn (0) = R (0) -Rs (0) (32) The noise change rate is expressed by the above-described equation (33). Statistical properties of noise included in the K = Rn (0) / Rn m (0) (33) the audio signal is to be considered the same as when the section having no audio signal. Then, the aforementioned equation (34) is established. Rn (k) = K * Rn m (k); k = 1,2 -, -, -, the following methods can be considered as a precise estimation than M (34). <Example B> Rs (0) = function {R (1), R (2), -, -, R (M), Rn m (0), R (1) n m, -, -, Rn m ( M), R (L)｝ (65) <Example Ba> As a specific example, the value of K is expressed by the following equation (66). K = Min (K ₁ , K ₂ , K _3a , K _3b , K _4a , K _4b , K _L ) (66) Min means taking the minimum value, and K ₁ , K ₂ , K _3a , K _3b ,
K _4a, K _4b, K _L is determined by the following method.

【０１９０】＜K₁,K₂,K_3a,K_3b,Ｋ_4a,Ｋ_4b,K_Lの決め方＞
K₁,K₂,K_3a,K_3b,Ｋ_4a,Ｋ_4bの決め方は第１の実施の形態
で説明済みのためK_Lの決め方のみ説明する。ピッチ周期
の自己相関関数R(L)により定まる値としては、Ｋ_L＝ [Ｒ(0)− R(L)] / Rn_m(0) (67) を採用し、どちらか小の値を雑音の時間変動率Ｋとす
る。<How to determine K ₁ , K ₂ , K _3a , K _3b , K _4a , K _4b , and K _L >
_{_{_{K 1, K 2, K 3a}}} , K 3b, K 4a, how to determine the K _4b will be described only method of determining the K _L for detailed above in the first embodiment. The value determined by the pitch period autocorrelation function _{R (L), K L =} [R (0) - R (L)] / Rn m (0) (67) employs a noise either small value Is assumed to be the time variation rate K.

【０１９１】＜例B−ｂ＞もう一つの具体例として以下
の例を説明する。＜例２−ｂ＞において説明した方法に
よってＫを求め、その値とピッチ周期の自己相関関数R
(L)により定まる下記の値の、どちらか小の値を雑音の
時間変動率Ｋとする方法がある。Ｋ_L＝ [Ｒ(0)− R(L)] / Rn_m(0) (67) 自己相関関数推定回路106は以上の如き方法で求められ
たＫの値を用いて、現在の音声信号に混在している雑音
の自己相関関数を以下により推定する。 Rｎ(j)=K＊ Rｎ_m (j) ; j=1,2,-,-,M (59) 次に、音声信号のみの自己相関関数、Rs(0), Rs(1),Rs
(2),-,-,Rs(M)を下式により計算する。 Rｓ(ｊ)=R(ｊ)−K＊ Rｎ_m (j) ; j=1,2,-,-,M (60) 以上により得られたＲｓ(0)、Rｓ(ｊ)の値を、音声/雑
音切替SW102Bを経由して予測係数算出回路1304に渡す。<Example Bb> The following example will be described as another specific example. K is obtained by the method described in <Example 2-b>, and its value and the autocorrelation function R of the pitch period are calculated.
There is a method of setting the smaller one of the following values determined by (L) as the time variation rate K of noise. K _L = [R (0) −R (L)] / Rn _m (0) (67) The autocorrelation function estimating circuit 106 uses the value of K obtained by the above method to calculate the current speech signal. The autocorrelation function of the mixed noise is estimated as follows. Rn (j) = K * Rn m (j); j = 1,2, -, -, M (59) Next, the autocorrelation function of the speech signal only, Rs (0), Rs ( 1), Rs
(2) Calculate-,-, Rs (M) by the following formula. Rs (j) = R (j ) -K * Rn m (j); j = 1,2, -, -, Rs obtained by M (60) or (0), the value of Rs (j), The signal is passed to the prediction coefficient calculation circuit 1304 via the voice / noise switch SW102B.

【０１９２】[音声/雑音切替SW102B]、[予測係数算出回
路1304] これ以後の動作は第１の実施の形態と全く同一のため省
略する。[Speech / Noise Switching SW 102B], [Prediction Coefficient Calculation Circuit 1304] The subsequent operations are exactly the same as in the first embodiment, and will not be described.

【０１９３】以上のように本発明の第３の実施形態によ
れば、第１の実施形態、または第２の実施形態の発明に
おいて、音声符号化装置に存在する音声信号の短期間予
測係数計算回路、係数変換回路、係数量子化回路を、雑
音を取り除いた音声信号の自己相関関数Rs(0),Rs(1),Rs
(2),-,-,Rs(M)を用いて動作させることにより、周囲雑
音を低減可能な音声符号化装置を可能とするものであ
る。As described above, according to the third embodiment of the present invention, in the first or second embodiment, the short-term prediction coefficient calculation of the speech signal existing in the speech encoding device is performed. Circuit, coefficient conversion circuit, coefficient quantization circuit, the autocorrelation function Rs (0), Rs (1), Rs
(2) By operating using-,-, Rs (M), a speech coding apparatus capable of reducing ambient noise is made possible.

【０１９４】以上説明したようにして、音声信号に現在
含まれている雑音の量に適応して、それを除去する、適
応型音声符号化装置または方法が実現できることが明ら
かになった。As described above, it has become clear that an adaptive speech coding apparatus or method can be realized which adapts to the amount of noise currently contained in the speech signal and removes it.

【０１９５】（第４の実施の形態）図２１、図２２、図
２３、図２４は請求項１３に記載の発明を説明するもの
であり、図２１、図２２は第４の実施形態を示すもの
で、第１の実施形態に対応する適応型雑音抑圧音声符号
化装置の、雑音を含む音声信号区間の動作を説明する機
能ブロック図であり、図２３、図２４は第４の実施形態
を示すもので、第２の実施形態に対応する適応型雑音抑
圧音声符号化装置の、雑音を含む音声信号区間の動作を
説明する機能ブロック図である。(Fourth Embodiment) FIGS. 21, 22, 23 and 24 illustrate the invention according to claim 13, and FIGS. 21 and 22 show a fourth embodiment. FIG. 23 is a functional block diagram illustrating an operation of a speech signal section including noise of the adaptive noise suppression speech coding apparatus corresponding to the first embodiment. FIGS. 23 and 24 illustrate the fourth embodiment. FIG. 9 is a functional block diagram illustrating an operation of a speech signal section including noise of the adaptive noise suppression speech coder according to the second embodiment.

【０１９６】図２１、図２２、図２３、図２４におい
て、すべての図示番号は、第１の実施形態で使用した図
１、図２と同一であり、すでに説明済みのため省略す
る。In FIG. 21, FIG. 22, FIG. 23, and FIG. 24, all the reference numerals are the same as those in FIG. 1 and FIG. 2 used in the first embodiment, and will not be described because they have already been described.

【０１９７】以上のように構成された第１の実施形態に
対応する適応型雑音抑圧音声符号化装置について、その
動作を図２１、図２２を用いて説明する。大部分の動作
は図１、図２、図３、図４、図５を用いて説明した第１
の実施形態と同一であり、以下は相違点についてのみ説
明する。The operation of the adaptive noise suppression speech coder according to the first embodiment configured as described above will be described with reference to FIGS. 21 and 22. Most of the operations are performed using the first operation described with reference to FIGS. 1, 2, 3, 4, and 5.
This embodiment is the same as that of the first embodiment, and only the differences will be described below.

【０１９８】雑音のみの区間の動作は、図３を用いて既
に説明したものと同じである。次に音声信号が存在する
区間の動作を図２１、図２２を用いて説明する。The operation in the noise-only section is the same as that already described with reference to FIG. Next, an operation in a section in which a voice signal is present will be described with reference to FIGS.

【０１９９】[自己相関関数記憶回路103]自己相関関数
記憶回路103で計算された音声信号が存在する区間の自
己相関関数は自己相関関数推定回路106へ入力される。[Autocorrelation function storage circuit 103] The autocorrelation function in the section where the voice signal exists calculated by the autocorrelation function storage circuit 103 is input to the autocorrelation function estimation circuit 106.

【０２００】[音声/雑音切替SW102A]、[自己相関関数記
憶回路103] 雑音区間に記憶された雑音の自己相関関数Rn_m(0),R(1)n
_m,R(2)n_m,-,-,Rn_m(M)が読み出されて自己相関関数推定
回路106へ入力される。[Speech / Noise Switching SW102A], [Autocorrelation Function Storage Circuit 103] The autocorrelation function Rn _m (0), R (1) n of the noise stored in the noise section
_{_{m, R (2) n m}} , -, -, Rn m (M) is input read and the autocorrelation function estimation circuit 106.

【０２０１】[雑音変化率推定回路105]一つ前の一定区
間長のデータで制御された第１の固定コードゲイン調整
手段117Aのゲインの最終値が、雑音変化率推定回路105
に入力される。この値のある関数として定まる値を雑音
の時間変動率Ｋとして決める。[Noise change rate estimating circuit 105] The final value of the gain of the first fixed code gain adjusting means 117A controlled by the data of the immediately preceding fixed section length is the noise change rate estimating circuit 105.
Is input to A value determined as a function of this value is determined as a time variation rate K of noise.

【０２０２】[自己相関関数推定回路106]前述の自己相
関関数計算回路1303から雑音を含む音声信号の自己相関
関数R(0),R(1),R(2),-,-,R(M)が入力され、自己相関関
数記憶回路103からは、雑音区間の自己相関関数Rn_m(0),
R(1)n_m,R(2)n_m,-,-,Rn_m(M)が入力される。[Autocorrelation function estimating circuit 106] The autocorrelation function calculating circuit 1303 outputs the autocorrelation functions R (0), R (1), R (2),-,-, R ( M) is inputted, and the autocorrelation function Rn _m (0),
_{R (1) n m, R} (2) n m, -, -, Rn m (M) are input.

【０２０３】これらの値と雑音変化率推定回路105が決
めたＫの値を用いて音声信号のみの自己相関関数、Ｒｓ
(0)、Ｒｎ(0)、Ｒｓ(j)、Ｒｎ(ｊ)；j=1,2,-,-,-,Mを、 Rｓ(j) = R(j)−Ｋ＊Ｒn_m (j) ；ｊ＝1，2-,-,-,M (60) として推定する。Using these values and the value of K determined by the noise change rate estimation circuit 105, the autocorrelation function of only the speech signal, Rs
(0), Rn (0) , Rs (j), Rn (j); j = 1,2, -, -, -, and M, Rs (j) = R (j) -K * Rn m (j ); Estimate as j = 1,2-,-,-, M (60).

【０２０４】以上により得られたＲｓ(0)、Rｓ(j)の値
を、音声/雑音切替SW102Bを経由して予測係数算出回路1
304に渡す。The values of Rs (0) and Rs (j) obtained as described above are input to the prediction coefficient calculation circuit 1 via the voice / noise switch SW102B.
Pass to 304.

【０２０５】[音声/雑音切替SW102B]、[予測係数算出回
路1304] 予測係数算出回路1304は、雑音を含まない音声信号のみ
の自己相関関数を用いてレビンソン・ダービン・アルゴ
リズムで、最適な予測係数ａ₁,ａ₂,-，-，ａ₁₀を求め
る。以下の動作は、第１の実施形態で説明したものと同
一のため省略する。[Speech / Noise Switching SW102B], [Prediction Coefficient Calculation Circuit 1304] The prediction coefficient calculation circuit 1304 employs the Levinson-Durbin algorithm using the autocorrelation function of only the noise-free speech signal to obtain the optimum prediction coefficient. a ₁ , a ₂ ,-,-, a ₁₀ are obtained. The following operation is the same as that described in the first embodiment, and a description thereof will be omitted.

【０２０６】次に、第４の実施形態において第２の実施
形態に対応する適応型雑音抑圧音声符号化装置の、雑音
を含む音声信号区間の動作を、図２３、図２４を用いて
説明するが、図２１、図２２を用いて説明した、第４の
実施形態における第１の実施形態に対応する適応型雑音
抑圧音声符号化装置の動作の説明と同一のため省略す
る。Next, the operation of the adaptive noise suppression speech coder of the fourth embodiment corresponding to the second embodiment in the speech signal section containing noise will be described with reference to FIGS. 23 and 24. However, since it is the same as the description of the operation of the adaptive noise suppression speech coding apparatus corresponding to the first embodiment in the fourth embodiment described with reference to FIGS.

【０２０７】次に雑音変化率推定回路105の動作説明
で、“一つ前の一定区間長のデータで制御された第１の
固定コードゲイン調整手段117Aのゲインの最終値が、雑
音変化率推定回路105に入力され、この値のある関数と
して定まる値を雑音の時間変動率Ｋとして決める。”と
したが具体的にどのようにして決めるかを説明する。Next, the operation of the noise change rate estimating circuit 105 is described as follows: "The final value of the gain of the first fixed code gain adjusting means 117A controlled by the data of the immediately preceding fixed section length is the noise change rate estimating circuit The value which is input to the circuit 105 and is determined as a function of this value is determined as the time variation rate K of noise. "

【０２０８】予測誤差e(n)は既述の式(３)で表わされ
る。ｅ(n)=s'(n)−ｓ"(n)=Σa_k s'(n-k) ；k=,0,1,2,3,-,-,,10 ；a₀=1 （３）ここでいま、ｅ(n)=Δu(n) (68) とおく。u(n)はインパルスでありその振幅は１とする。
Δは正負の値を取る。ｅ(n)＊ｅ(n)=Δ²＝Σ_k=0 ^k=Ma_k s’(n-k)＊ Σ_j=0 ^j=Ma_j s’(n-ｊ) ＝Σ_k=0 ^k=MΣ_j=0 ^j=Ma_k a_js’(n-k)＊s’(n-ｊ) ＝Σ_k=0 ^k=MΣ_j=0 ^j=Ma_k a_j R(k-ｊ) (69) ｅ(n)＊ｅ(n)＝Δ²＝Σ_k=0 ^k=Ma_kΣ_j=0 ^j=M a_j R(k-ｊ) (70) 一方、既述の式（８）より下式(71)が成立する。 Σ_j=0 ^j=M a_j R(k-ｊ)＝−R(k) (71) 式（71）を式（70）に代入すると下式(72)を得る。ｅ(n)＊ｅ(n)＝Δ²＝Σ_k=0 ^k=Ma_k R(k)＝{R(0)+Σ_k=1 ^k=Ma_k R(k)} (72) 従つて、下式(73)を得る。ｅ(n)＝Δu’(n)＝{R(0)+Σ_k=1 ^k=Ma_k R(k)}^1/2u’(n) (73) すなわち、インパルスu’(n)のゲインとしては、 Δ＝ {R(0)+Σ_k=1 ^k=Ma_k R(k)}^1/2 (74)The prediction error e (n) is represented by the above-mentioned equation (3). e (n) = s '( n) -s "(n) = Σa k s'(nk); k =, 0,1,2,3 ,, - ,, 10; a 0 = 1 (3) Here, let e (n) = Δu (n) (68), where u (n) is an impulse and its amplitude is 1.
Δ takes a positive or negative value. e (n) * e (n ) = Δ 2 = Σ k = 0 k = M a k s '(nk) * Σ j = 0 j = M a j s' (n-j) = Σ k = 0 k ^{= M} Σ _{j = 0} ^{j = M} a _k a _j s '(nk) * s' (n-j) = Σ _{k = 0} ^{k = M} Σ _{j = 0} ^{j = M} a _k a _j R (k-j ) (69) e (n) * e (n) = Δ ² = Σ _{k = 0} ^{k = M} _ak Σ _{j = 0} ^{j = M} a _j R (k-j) (70) From equation (8), the following equation (71) holds. Σ _{j = 0} ^{j = M} a _j R (k−j) = − R (k) (71) By substituting equation (71) into equation (70), the following equation (72) is obtained. e (n) * e (n ) = Δ 2 = Σ k = 0 k = M a k R (k) = {R (0) + Σ k = 1 k = M a k R (k)} (72) Therefore, the following equation (73) is obtained. e (n) = Δu '( n) = {R (0) + Σ k = 1 k = M a k R (k)} 1/2 u' (n) (73) i.e., the impulse u '(n) Δ = {R (0) + Σ _{k = 1} ^{k =} _Mak R (k)} ^1/2 (74)

【０２０９】ここで、固定コードブック116は40サンプ
ル点に対して10ヶのパルスを出すことを考えると、u’
(n)の有する電力の1/4の電力を有しているのであるから
その補正をする必要がある。固定ゲイン調整回路117Aの
ゲインについては既に下式(61)を記述してある。ｕ(n)= g_a C (n) ; n=0,1,2,-,-,-,39 (61) 以上のことを考慮すると下式(75)を得る。 g_a=2＊{R(0)+Σ_k=1 ^k=Ma_k R(k)}^1/2 (75)Here, considering that the fixed codebook 116 issues 10 pulses for 40 sample points, u ′
Since it has 1/4 of the power of (n), it is necessary to correct it. The following equation (61) has already been described for the gain of the fixed gain adjustment circuit 117A. u (n) = g _a C (n); n = 0,1,2,-,-,-, 39 (61) In consideration of the above, the following equation (75) is obtained. g _a = 2 * {R (0) + Σ _{k = 1} ^{k = M} a _k R (k)} ^1/2 (75)

【０２１０】雑音のみの区間から、雑音を含む音声信号
の区間に移行した際、最初に固定ゲイン調整回路117Aに
設定されるのは既述の式(63)に示される値である。 g_a=2＊{Rn_m(0)+Σ_k=1 ^k=Ma_k Rｎ_m(k)}^1/2 (76) そして最適化動作が完了した際には、雑音を含む音声信
号の区間における雑音に適応した別の値となり下式(77)
が成立する。 g_a’＝2＊{KRn_m(0)+Σ_k=1 ^k=Ma_k KRｎ_m(k)}^1/2 (77) すなわち、下式(78)、(79)が成立する。 K＝{g_a’/ g_a}² (78) K＝{g_a’}² /[4＊{Rn_m(0)+Σ_k=1 ^k=Ma_k Rｎ_m(k)}] (79) 固定コードブック116の全サンプル数に対する実際のパ
ルス数の比をYとすれば下式(80)を得る。 K＝{g_a’}² Y /{Rn_m(0)+Σ_k=1 ^k=Ma_k Rｎ_m(k)} (80) このようにして、雑音変化率Ｋを決めることができる。When the section shifts from the section containing only noise to the section containing an audio signal containing noise, the value set in the fixed gain adjustment circuit 117A first is the value shown in the aforementioned equation (63). g _a = 2 * in _{{Rn m (0) + Σ} k = 1 k = M a k Rn m (k)} 1/2 (76) , and optimized operation is completed, the audio signal containing noise It becomes another value adapted to the noise in the section and the following equation (77)
Holds. g _a '= 2 * {KR nm _m (0) + Σ _{k = 1} ^{k = M} _ak KR nm _m (k)} ^1/2 (77) That is, the following equations (78) and (79) hold. K = {g _a '/ g _a } ² (78) K = {g _a '} ² / [4 * {Rn _m (0) + Σ _{k = 1} ^{k = M} _ak R _nm (k)}] ( 79) If the ratio of the actual number of pulses to the total number of samples in the fixed codebook 116 is Y, the following equation (80) is obtained. K = {g _a '} ² Y / {R _nm (0) + Σ _{k = 1} ^{k = M} _ak R _nm (k)} (80) In this way, the noise change rate K can be determined.

【０２１１】以上説明したようにして、音声信号に現在
含まれている雑音の量に適応して、それを除去する、適
応型音声符号化装置または方法が実現できることが明ら
かになった。As described above, it has become clear that an adaptive speech coding apparatus or method can be realized which adapts to the amount of noise currently contained in the speech signal and removes it.

【０２１２】[0212]

【発明の効果】以上のように本発明は、音声信号の自己
相関関数と、雑音の自己相関関数が夫々有している特徴
と、音声信号と雑音が重畳している信号の自己相関関数
の特徴を用いて信号処理を行い、現在通話中のフレーム
に含まれている雑音のレベルに適応して、雑音抑圧動作
を行うと共に、この雑音を含む音声信号の自己相関関数
は、音声符号化装置で、音声の符号化処理の過程で使用
されるもなので、雑音抑圧装置と音声符号化装置とを一
体化するとともに、音声符号化装置と同様の機能を有す
る雑音符号化手段を加えることにより、音声符号化装置
に過大な雑音が加わつても誤動作することのない、通話
中の雑音に適応した適応型雑音抑圧音声符号化装置を実
現することができるという効果を有する。As described above, according to the present invention, the autocorrelation function of the speech signal and the autocorrelation function of the noise have different characteristics and the autocorrelation function of the signal in which the speech signal and the noise are superimposed. The signal processing is performed using the features, the noise suppression operation is performed in accordance with the level of the noise included in the currently-talking frame, and the autocorrelation function of the voice signal including the noise is calculated by the voice coding apparatus. In addition, since it is also used in the process of speech encoding processing, by integrating the noise suppression device and the speech encoding device, and by adding noise encoding means having the same function as the speech encoding device, There is an effect that it is possible to realize an adaptive noise suppression speech coding apparatus which does not malfunction even when excessive noise is added to the speech coding apparatus and is adapted to noise during a call.

[Brief description of the drawings]

【図１】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の機能ブロック図の左半部、FIG. 1 is a left half of a functional block diagram of an adaptive noise suppression speech coding apparatus according to a first embodiment of the present invention;

【図２】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の機能ブロック図の右半部、FIG. 2 is a right half part of a functional block diagram of the adaptive noise suppression speech coding apparatus according to the first embodiment of the present invention;

【図３】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の機能ブロック図（雑音区間の動作説明
図）、FIG. 3 is a functional block diagram (an operation explanatory diagram of a noise section) of the adaptive noise suppression speech coding apparatus according to the first embodiment of the present invention;

【図４】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の機能ブロック図（雑音を含む音声区間の動
作説明図）の左半部、FIG. 4 is a left half part of a functional block diagram (an operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coder according to the first embodiment of the present invention;

【図５】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の機能ブロック図（雑音を含む音声区間の動
作説明図）の右半部、FIG. 5 is a functional block diagram of the adaptive noise-suppressed speech coding apparatus according to the first embodiment of the present invention (an operation explanatory diagram of a speech section including noise);

【図６】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の簡略機能ブロック図、FIG. 6 is a simplified functional block diagram of the adaptive noise suppression speech coding apparatus according to the first embodiment of the present invention;

【図７】本発明の第１の実施形態の適応型雑音抑圧音声
符号化装置の簡略機能ブロック図（別方式）、FIG. 7 is a simplified functional block diagram (another method) of the adaptive noise suppression speech coding apparatus according to the first embodiment of the present invention;

【図８】本発明の第２の実施形態の適応型雑音抑圧音声
符号化装置の簡略機能ブロック図、FIG. 8 is a simplified functional block diagram of an adaptive noise suppression speech coder according to a second embodiment of the present invention;

【図９】音声信号の自己相関関数のグラフ、FIG. 9 is a graph of an autocorrelation function of an audio signal,

【図１０】本発明の第１の実施形態における雑音変化率
Ｋの計算根拠説明図、FIG. 10 is a diagram illustrating a calculation basis of a noise change rate K according to the first embodiment of the present invention;

【図１１】本発明の第２の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図の左半部、FIG. 11 is a left half of a functional block diagram of an adaptive noise suppression speech coding apparatus according to a second embodiment of the present invention;

【図１２】本発明の第２の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図の右半部、FIG. 12 is a right half of a functional block diagram of an adaptive noise suppression speech coder according to a second embodiment of the present invention;

【図１３】本発明の第２の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図（雑音区間の動作説明
図）、FIG. 13 is a functional block diagram of an adaptive noise-suppressed speech coding apparatus according to a second embodiment of the present invention (operational explanatory diagram of a noise section);

【図１４】本発明の第２の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図（雑音を含む音声区間の
動作説明図）の左半部、FIG. 14 is a left half part of a functional block diagram (an operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coding apparatus according to the second embodiment of the present invention;

【図１５】本発明の第２の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図（雑音を含む音声区間の
動作説明図）の右半部、FIG. 15 is a functional block diagram (an operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coder according to the second embodiment of the present invention;

【図１６】本発明の第３の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図の左半部、FIG. 16 is a left half of a functional block diagram of an adaptive noise suppression speech coder according to a third embodiment of the present invention;

【図１７】本発明の第３の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図の右半部、FIG. 17 is a right half of a functional block diagram of an adaptive noise suppression speech coder according to a third embodiment of the present invention;

【図１８】本発明の第３の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図（雑音を含む音声区間の
動作説明図）の左半部、FIG. 18 is a left half part of a functional block diagram (an operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coder according to the third embodiment of the present invention;

【図１９】本発明の第３の実施形態の適応型雑音抑圧音
声符号化装置の機能ブロック図（雑音を含む音声区間の
動作説明図）の右半部、FIG. 19 is a right half part of a functional block diagram (an operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coding apparatus according to the third embodiment of the present invention;

【図２０】ピッチ周期を考慮した、音声信号の自己相関
関数のグラフ、FIG. 20 is a graph of an autocorrelation function of an audio signal in consideration of a pitch period;

【図２１】本発明の第４の実施形態の第１の実施形態に
対応する適応型雑音抑圧音声符号化装置の機能ブロック
図（雑音を含む音声区間の動作説明図）の左半部、FIG. 21 is a functional block diagram of an adaptive noise-suppressed speech coding apparatus corresponding to the first embodiment of the fourth embodiment of the present invention (an operation explanatory diagram of a speech section including noise);

【図２２】本発明の第４の実施形態の第１の実施形態に
対応する適応型雑音抑圧音声符号化装置の機能ブロック
図（雑音を含む音声区間の動作説明図）の右半部、FIG. 22 is a right half part of a functional block diagram (operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coding apparatus corresponding to the first embodiment of the fourth embodiment of the present invention;

【図２３】本発明の第４の実施形態の第２の実施形態に
対応する適応型雑音抑圧音声符号化装置の機能ブロック
図（雑音を含む音声区間の動作説明図）の左半部、FIG. 23 is a left half part of a functional block diagram (operation explanatory diagram of a speech section including noise) of an adaptive noise suppression speech coder corresponding to the second embodiment of the fourth embodiment of the present invention;

【図２４】本発明の第４の実施形態の第２の実施形態に
対応する適応型雑音抑圧音声符号化装置の機能ブロック
図（雑音を含む音声区間の動作説明図）の右半部、FIG. 24 is a right half part of a functional block diagram (an operation explanatory diagram of a speech section including noise) of the adaptive noise suppression speech coder corresponding to the second embodiment of the fourth embodiment of the present invention;

【図２５】従来の第１の雑音抑圧装置の構成を示すブロ
ック図、FIG. 25 is a block diagram showing a configuration of a conventional first noise suppression device.

【図２６】従来の第１の雑音抑圧装置の動作フローチャ
ート、FIG. 26 is an operation flowchart of a conventional first noise suppression device;

【図２７】従来の第２の雑音抑圧装置の構成を示すブロ
ック図、FIG. 27 is a block diagram showing a configuration of a second conventional noise suppression device.

【図２８】従来の第２の雑音抑圧装置の動作フローチャ
ート、FIG. 28 is an operation flowchart of a second conventional noise suppression device;

【図２９】従来のACELP音声符号化装置の簡易ブロック
図、FIG. 29 is a simplified block diagram of a conventional ACELP speech coding apparatus;

【図３０】従来のACELP音声符号化装置の詳細機能ブロ
ック図、FIG. 30 is a detailed functional block diagram of a conventional ACELP speech coding apparatus;

【図３１】従来の音声信号の窓掛け取り込み図、FIG. 31 is a diagram of a conventional audio signal windowing capture,

【図３２】従来のACELP音声符号化装置のコードブック
構成表である。FIG. 32 is a codebook configuration table of a conventional ACELP speech encoding device.

[Explanation of symbols]

101A 音声/雑音区間判定回路 101B 声/雑音切替SW制御回路 102A 音声/雑音切替SW 102B 音声/雑音切替SW 102C 音声/雑音切替SW 102D 音声/雑音切替SW 102E 音声/雑音切替SW 102F 音声/雑音切替SW 103 自己相関関数記憶回路 105 雑音変化率推定回路 106 自己相関関数推定回路 107 ピッチ自己相関関数記憶回路 116 固定コードブック 117A 固定ゲイン調整回路 117B インパルス応答回路 117C 誤差計算回路 117D 誤差最小化回路 117E 固定コードブック最適値発見回路 118 固定コードブックゲイン量子化回路 119 励振計算回路 120 最適フィルタ記憶回路回路 901 切り出し部 902 フーリエ分析部 903 雑音スペクトル平均算出部 904 雑音スペクトルメモリ 905 雑音信号判定部 906 振幅引算部 907 半波整流部 908 逆フーリエ変換部 909 波形再生部 1001 ステップ窓掛け 1002 ステップ FFT 分析 1003 ステップ音声区間検知 1004 ステップ雑音記憶 1005 ステップ平均スペクトル計算 1006 ステップ雑音引算部 1007 ステップ負係数ゼロ化 1008 ステップ IFFT 1009 ステップ波形合成 1101 各チャンネルS/N測定部 1102 各チャンネル利得制御部 1103 各チャンネル別可変利得増幅器 1104 ステップ各チャンネルS/N測定 1105 ステップ各チャンネル利得制御 1106 ステップ各チャンネル別可変利得増幅 1201 固定コードブック 1202 増幅器 1203 適応コードブック 1204 増幅器 1205 加算器 1206 線形予測合成回路 1207 聴覚修正回路 1301 前処理回路 1302 窓掛け回路 1303 自己相関関数計算回路 1304 予測係数算出回路 1305 係数変換回路 1306 係数量子化回路 1307 サブフレーム係数内挿回路 1308A サブフレーム係数内挿回路（量子化前） 1308B インパルス応答計算回路 1309 ４サブフレーム重みずけ音声計算回路 1310A 開ループピッチ遅れ発見回路 1310B 開ループピッチゲイン発見回路 1311A 適応コードブック 1311B ピッチゲイン調整回路 1311C インパルス応答回路 1311D 誤差計算回路 1311E 誤差最小化回路 1312 最適遅れとゲイン発見回路 1313 ピッチとゲイン量子化回路 1314 適用コードブック寄与度計算回路 1316 固定コードブック 1317A 固定ゲイン調整回路 1317B インパルス応答回路 1317C 誤差計算回路 1317D 誤差最小化回路 1317E 固定コードブック最適値発見回路 1318 固定コードブックゲイン量子化回路 1319 励振計算回路 1320 最適フィルタ記憶回路 101A voice / noise section determination circuit 101B voice / noise switch SW control circuit 102A voice / noise switch SW 102B voice / noise switch SW 102C voice / noise switch SW 102D voice / noise switch SW 102E voice / noise switch SW 102F voice / noise switch SW 103 Autocorrelation function storage circuit 105 Noise change rate estimation circuit 106 Autocorrelation function estimation circuit 107 Pitch autocorrelation function storage circuit 116 Fixed codebook 117A Fixed gain adjustment circuit 117B Impulse response circuit 117C Error calculation circuit 117D Error minimization circuit 117E Fixed Codebook optimal value discovery circuit 118 Fixed codebook gain quantization circuit 119 Excitation calculation circuit 120 Optimal filter storage circuit circuit 901 Extraction unit 902 Fourier analysis unit 903 Noise spectrum average calculation unit 904 Noise spectrum memory 905 Noise signal determination unit 906 Amplitude subtraction Section 907 Half-wave rectification section 908 Inverse Fourier transform section 909 Waveform reproduction section 1001 Step windowing 1002 step FFT analysis 1003 steps Step Voice section detection 1004 steps Noise storage 1005 steps Average spectrum calculation 1006 steps Noise subtraction unit 1007 steps Negative coefficient zeroization 1008 steps IFFT 1009 steps Waveform synthesis 1101 S / N measurement unit for each channel 1102 Gain control unit for each channel 1103 Each channel Variable gain amplifier 1104 step S / N measurement for each channel 1105 step Gain control for each channel 1106 step Variable gain amplification for each channel 1201 Fixed codebook 1202 Amplifier 1203 Adaptive codebook 1204 Amplifier 1205 Adder 1206 Linear prediction synthesis circuit 1207 Hearing correction circuit 1301 Preprocessing circuit 1302 Windowing circuit 1303 Autocorrelation function calculation circuit 1304 Prediction coefficient calculation circuit 1305 Coefficient conversion circuit 1306 Coefficient quantization circuit 1307 Subframe coefficient interpolation circuit 1308A Subframe coefficient interpolation circuit (before quantization) 1308B Impulse response Calculation circuit 1309 4 subframes without weight Voice calculation circuit 1310A Open loop pitch delay detection circuit 1310B Open loop pitch gain detection circuit 1311A Adaptive codebook 1311B Pitch gain adjustment circuit 1311C Impulse response circuit 1311D Error calculation circuit 1311E Error minimization circuit 1312 Optimal delay and gain detection circuit 1313 Pitch and gain Quantization circuit 1314 Applied codebook contribution calculation circuit 1316 Fixed codebook 1317A Fixed gain adjustment circuit 1317B Impulse response circuit 1317C Error calculation circuit 1317D Error minimization circuit 1317E Fixed codebook optimal value finding circuit 1318 Fixed codebook gain quantization circuit 1319 Excitation calculation circuit 1320 Optimal filter storage circuit

───────────────────────────────────────────────────── フロントページの続き (72)発明者飯塚捷吾神奈川県横浜市港北区綱島東四丁目３番１号松下通信工業株式会社内 (72)発明者星野一樹東京都目黒区三田一丁目４番４号恵比寿ビュータワー3102号室Ｆターム(参考） 5D045 CA01 9A001 GG01 HH15 ──────────────────────────────────────────────────続き Continuing on the front page (72) Shogo Iizuka 3-1-4 Tsunashimahigashi, Kohoku-ku, Yokohama-shi, Kanagawa Prefecture Inside Matsushita Communication Industrial Co., Ltd. (72) Kazuki Hoshino 4-1-1 Mita, Meguro-ku, Tokyo No.4 Ebisu View Tower Room 3102 F term (reference) 5D045 CA01 9A001 GG01 HH15

Claims

[Claims]

1. A means for converting an audio signal on which ambient noise is superimposed into a digital signal by an analog-to-digital converter, a means for extracting it as data of a fixed length section, an autocorrelation function R (0), R (1), means for calculating R (2),-,-, R (M), determination means for determining whether the extracted signal is a section of only noise or a voice signal section in which noise is superimposed, If the determination means determines that the section is only noise, the value of the autocorrelation function R (0), R (1), R (2),-,-, R (M) in that section is Rn
_m (0), Rn _m (1), Rn _m (2),-,-, noise autocorrelation function storage means to store as Rn _m (M), autocorrelation function R (0), R (1), R
(2) means for determining constants of the first predicting means based on the values of-,-, R (M), first impulse response calculating means for calculating an impulse response of a reciprocal of a transfer function of the first predicting means , A first fixed codebook, a first fixed code gain adjusting means,
A first convolution integrator for obtaining a convolution integral output of the output pulse train of the first fixed codebook via the first fixed code gain adjuster and the impulse response calculated by the first impulse response calculator; The first
A first error calculating means for calculating an error between the output of the convolution integrator and noise, calculating a mean square value of the error, and an output code of the first fixed book so as to minimize the error. A first control means for controlling the gain of the first fixed code gain adjustment means, a first noise state storage means for storing a constant of the first impulse response and an initial state after the minimization is achieved, and If it is determined that the means is a voice signal section in which noise is superimposed, Rn _m (0), Rn _m (1), Rn _m (2),-, stored in the noise autocorrelation function storage means -, The values of Rn _m (M) are read out and there are R (0), R (1), R (2),-,-, R (M) calculated by the autocorrelation function calculation means. Means for calculating a value K determined as a function as a time variation rate of noise;
R (0) -KRn as the autocorrelation function Rs (0), Rs (1), Rs (2),-,-, Rs (M) of only the speech signal without noise using the value of
_m (0), R (1) −KRn _m (1), R (2) −KRn _m (2), −, −, R (M) −KRn _m (M)
, Rs (0), Rs (1), Rs (2),-,-, Rs (M) to determine the constant of the second predicting means, and transmission of the second predicting means. Second to calculate the impulse response of the reciprocal of the function
Impulse response calculation means, a second fixed codebook,
Second fixed code gain adjusting means, second convolution integral for obtaining a convolution integrated output of the output pulse train of the second fixed codebook via the second fixed code gain adjusting means and the second impulse response Means, said second
Second error calculating means for calculating an error between the output of the convolution integrator and the signal from which noise has been removed from the speech signal on which noise is superimposed, and calculating the mean square value of the error and minimizing the error As described above, the second control means for controlling the output code of the second fixed book and the gain of the second fixed code gain adjustment means, after achieving the minimization, sets the constant and the initial state of the second impulse response. Adaptive noise suppression speech coding configured to include a second speech signal state storage means for storing and adapted to adapt to ambient noise during a call and reduce the influence of ambient noise to encode a speech signal. apparatus.

2. The method according to claim 1, wherein when the determining unit determines that the voice section has noise superimposed thereon, the output of the second error calculating unit is used as an input of the control unit that minimizes the first error. 2. The adaptive noise-suppressed speech coding apparatus according to claim 1, wherein said adaptive noise suppression speech coding apparatus is connected to stabilize the operation of said first error calculating means.

3. Rn stored in said first storage means.
_m (0), Rn _m (1), Rn _m (2),-,-, Rn _m (M) are read out, and those values and the voice in which the autocorrelation function calculating means is superimposed with noise R (0), R (1), R (2),-,-, R calculated as signal values
As a time variation rate value K of the noise determined as a function of (M), R (k) , Rn m (k) (k = 1,2, -, -, M) takes the respective positive or negative value, further R (k) and its sign, in view of the magnitude of _{Rn m (k), R (} k), Rn m (k) (k = 1,2, -, -, M) into a plurality of states 3. The method according to claim 1, wherein the classification is performed, and a minimum value of K is further selected from the minimum value of K calculated for each state using a different calculation formula of K for each state, and is set as the value of K. Adaptive noise suppression speech coding apparatus.

4. The noise time variation value K is represented by R (k), Rn
_m (k) (k = 1,2,-,-, M) takes a positive or negative value, respectively, and considering the magnitude relationship between the positive and negative and R (k), Rn _m (k), R
(k), Rn _m (k) (k = 1,2,-,-, M)
(k), Rn _m (k) (k = 1, 2,-,-, M) are classified into four states according to the positive / negative relationship, and further R (k), Rn _m (k) (k = 1, When (2,-,-, M) has the same sign, R (k) and Rn _m (k) are classified into two categories according to the magnitude relation, and classified into a total of six states. 4. The adaptive noise-suppressed speech coding apparatus according to claim 3, wherein a minimum value of K is further selected from the minimum value of K calculated for each state by the equation, and the selected value is used as the value of K.

5. The adaptive noise-suppressed speech coding apparatus according to claim 4, wherein R (k), Rn _m (k) (k = 1, 2,-,-, M) are classified into six states. 5. The adaptive noise-suppressed speech coding apparatus according to claim 4, wherein a calculation formula of K different in each state is K as shown in Table 1 below. [Table 1]

6. The method of claim 1 or the adaptive noise suppression speech encoding apparatus according to claim 2, wherein the first Rn _m stored in the storage means _{(0), Rn m (1} ), Rn m (2 ),-,-, Rn _m (M) are read out, and these values and R (0), R calculated by the autocorrelation function calculation means as a value of a voice signal on which noise is superimposed.
(1), R (2),-,-, R (M) As a time variation value K of noise determined as a certain function, for k = 1, 2,-,-, M, 3. The adaptive noise-suppressed speech coding apparatus according to claim 1, wherein K is calculated and the minimum value is selected as K. K = {b / a} [1− {1− (ca / b ² )} ^1/2 ] Here, a, b, and c have the following values. a = {R ² _nm (0) −R ² _nm (k)} b = {R (0) * R _nm (0) −R (k) * R _nm (k)} c = {R ² ( 0) −R ² (k)}

7. The apparatus further comprises means for calculating an autocorrelation function R (L) in a pitch period L of a voice signal, and means for storing the value, wherein Rn _m (0) read from the noise autocorrelation function storage means. ), Rn _m (1), Rn _m (2),-,-, the value of Rn _m (M), the value of the autocorrelation function R (L) of the pitch period read from the storage means, R (0), R (1), R calculated by the autocorrelation function calculation means as the autocorrelation function of the speech signal with noise superimposed
3. The adaptive noise-suppressed speech coding apparatus according to claim 1, wherein a value K determined as a certain function with a value of (2),-,-, R (M) is determined as a time variation rate of noise.

8. The adaptive noise-suppressed speech coding apparatus according to claim 7, wherein R (k), Rn
_m (k) (k = 1,2,-,-, M) takes a positive or negative value, respectively, and considering the magnitude relationship between the positive and negative and R (k), Rn _m (k), R
(k), Rn _m (k) (k = 1,2,-,-, M) are classified into multiple states,
A value determined as a function of a value obtained by further selecting the minimum K value from the minimum value of K calculated for each state using a different K calculation formula in each state, and a pitch period autocorrelation function R (L). 8. The adaptive noise-suppressed speech coding apparatus according to claim 7, wherein a smaller one of the values is a time variation rate K of noise.

9. The adaptive noise-suppressed speech coding apparatus according to claim 8, wherein R (k), Rn
_m (k) (k = 1,2,-,-, M) takes a positive or negative value, respectively, and considering the magnitude relationship between the positive and negative and R (k), Rn _m (k), R
_{(k), Rn m (k} ) (k = 1,2, -, -, M) when classifying the plurality of _{states, R (k), Rn m} (k) (k = 1,2, - , -, classified into four states by positive and negative relationship M), further _{R (k), Rn m (} k) (k = 1,2, -,
-, M) have the same sign, R (k), Rn _m (k) are classified into two states according to the magnitude relation, and classified into a total of six states. Either a value obtained by further selecting the minimum value of K from the minimum value of K calculated for each time or a value determined by the autocorrelation function R (L) of the pitch period is a smaller value as the time variation rate K of noise. An adaptive noise-suppressed speech coding apparatus according to claim 8.

10. The adaptive noise-suppressed speech coding apparatus according to claim 9, wherein R (k),
Rn _m (k) (k = 1,2,-,-, M) takes a positive or negative value, respectively.
Considering the magnitude relationship between the positive and negative and R (k), Rn _m (k), R
When (k), Rn _m (k) (k = 1,2,-,-, M) are classified into a plurality of states, they are classified into six states. It used those described in Table 1 shown in claim 5, and as the value determined by the pitch period autocorrelation function R (L), the minimum value calculated in terms of calculating the K _L by a formula shown below, adaptive noise suppression speech encoding apparatus according to claim 9, wherein either the small value was set as the time variation rate K of the noise values of K and K _L. K _L = [R (0) −R (L)] / Rn _m (0)

11. The value of the time variation rate K of the noise is a value of K determined by the expression described in claim 6, and a function of the autocorrelation function R (L) of the pitch period described in claim 7. adaptive noise reduced speech coding apparatus of any claim 6 or claim 7, wherein selecting smaller becomes the value as the value of K and the value of K _L defined as.

12. The adaptive noise suppression speech coding apparatus according to claim 11, wherein the expression defined in claim 10 is used as a value determined by an autocorrelation function R (L) of a pitch period. 12. The adaptive noise-suppressed speech coding apparatus according to claim 11.

13. When the determining means determines that the signal is a voice signal section in which noise is superimposed, the first fixed code gain adjusting means controlled by the data of the immediately preceding fixed section length. Means for calculating a value determined by using the final value of the gain as a time variation rate K of noise, and using the K, an autocorrelation function of only a noise-free speech signal;
(0), Rs (1), Rs (2),-,-, Rs (M) is R (0) −KRn _m (0), R (1) −KRn
3. The adaptive noise-suppressed speech code according to claim 1, wherein _m (1), R (2) −KRn _m (2), −, −, R (M) −KRn _m (M). Device.

14. The adaptive noise suppression speech coding apparatus according to claim 13, wherein the time variation rate K of the noise is the first fixed code gain adjustment controlled by data of a preceding fixed section length. The value of K is obtained by using the following equation, where g _a ′ is the final value of the gain of the means.
An adaptive noise-suppressed speech coding apparatus according to claim 1. K = {g _a ′} ² Y / {R _nm (0) + Σ _{k = 1} ^{k = M} _ak R _nm (k)} where Y is the actual number of pulses relative to the total number of samples in the fixed codebook. And a _k is a prediction coefficient of the linear prediction circuit.