JPH09297598A

JPH09297598A - Voice coding/decoding device

Info

Publication number: JPH09297598A
Application number: JP8113975A
Authority: JP
Inventors: Hiroyuki Ebara; 原宏幸江
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1996-05-08
Filing date: 1996-05-08
Publication date: 1997-11-18
Anticipated expiration: 2016-05-08
Also published as: JP3824706B2

Abstract

PROBLEM TO BE SOLVED: To make the same sound source vector obtainable on both the coder side and the decoder side even immediately after returning from a transmission line error and to mitigate the error of the sound source vector occurring on both the coder side and the decoder side occurring immediately after the trans mission line error. SOLUTION: A searching scope limiter 109 judges whether a transmission line error is generated in the just previous frame or subframe from a transmission line error monitoring code 110 inputted from the decoder side. When the error is judged to be generated in the just previous frame or subframe, the portion which is generated in the just previous frame in the sound source signals generated in the past and stored in an adaption code book 113 is excluded from the searching scope and subjected to searching which uses an adaption code book to output the searching scope of the book to an error minimizing means 108 so as to select the optimum code vector. When the transmission line error continues, the vector gain of the adation code book 113 is nullified or switched to another fixed code book.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ＣＥＬＰ型の音声
符号化／復号化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a CELP type voice encoding / decoding device.

【０００２】[0002]

【従来の技術】近年、ディジタル移動通信の需要の増加
により音声符号化の低ビットレート化が必要とされてお
り、数々の音声符号化装置が開発されている。その中
で、ＣＥＬＰ方式は、音声信号を声道情報と音源情報に
分離し、声道情報を線形予測係数から構成されるディジ
タルフィルタにより表現し、音源情報を数百〜千種類程
度の波形パターンから構成されている音源符号帳を用い
てベクトル量子化するもので、低ビットレート（４ｋｂ
／ｓ〜８ｋｂ／ｓ）においても高品質の音声を実現でき
る方式として広く用いられている。2. Description of the Related Art In recent years, the demand for digital mobile communications has increased, and it has become necessary to reduce the bit rate of voice encoding, and various voice encoding devices have been developed. Among them, in the CELP method, a voice signal is separated into vocal tract information and sound source information, the vocal tract information is expressed by a digital filter composed of linear prediction coefficients, and the sound source information is made into a waveform pattern of several hundred to several thousand types. Vector quantization using an excitation codebook composed of
/ S to 8 kb / s) is widely used as a method capable of realizing high quality voice.

【０００３】ＣＥＬＰ方式の音源は、適応符号帳と固定
符号帳（確率的符号帳と雑音符号帳）の２種類の符号帳
から選ばれる音源ベクトルから構成される。このうち、
適応符号帳は、音源信号（特に母音部）に含まれる周期
的成分を表現するもので、過去に合成した音源信号波形
を蓄えたものである。一方、固定符号帳は、音源信号か
ら周期的成分を取り除いた後のランダムな波形（音源信
号のランダム成分）を表現するために予め容易されるも
のである。固定符号帳は、乱数によって作成されたもの
や、多数の音声データを用いて学習して作成したもの、
パルス列によって構成されるものなど、多くの種類のも
のが提案され、用いられているが、ＣＥＬＰ方式の音声
符号化装置においては、過去に生成した音源信号を適応
符号帳として用いるため、伝送路誤りが生じると誤りか
ら復帰した後も、誤り時に生成した音源信号が適応符号
帳として保存されているため、誤りの影響が伝播すると
いう問題を有する。A CELP system excitation is composed of excitation vectors selected from two types of codebooks, an adaptive codebook and a fixed codebook (probabilistic codebook and random codebook). this house,
The adaptive codebook expresses the periodic component contained in the excitation signal (particularly the vowel part), and stores the excitation signal waveform synthesized in the past. On the other hand, the fixed codebook is facilitated in advance to represent a random waveform (random component of the excitation signal) after removing the periodic component from the excitation signal. The fixed codebook is created by random numbers, or is created by learning using a large number of voice data,
Many types, such as those composed of pulse trains, have been proposed and used. However, in a CELP type speech encoding device, since the excitation signal generated in the past is used as an adaptive codebook, a transmission path error is generated. When the error occurs, even after recovering from the error, since the excitation signal generated at the time of the error is stored as the adaptive codebook, there is a problem that the influence of the error propagates.

【０００４】以下にＣＥＬＰ方式に基づく従来の音声符
号化装置における適応符号帳探索部について説明する。
図５は一般的なＣＥＬＰ型音声符号化装置を示したもの
である。図５において、入力音声信号１は、前処理器２
によって波形整形された後、線形予測分析器３および加
算器４に出力される。線形予測分析器３は、前処理後の
入力音声信号を用いて線形予測分析を行い、線形予測係
数を合成フィルタ５に出力する。合成フィルタ５は、加
算器６から入力した音源信号と線形予測分析器３から入
力した線形予測係数とを用いて音声合成を行い、加算器
４に出力する。加算器４は、合成フィルタから入力した
合成信号と前処理器２から入力した前処理後の入力音声
信号との誤差を算出し、聴覚重み付け器７に出力する。
聴覚重み付け器７は、誤差信号に聴覚重み付けを行い、
誤差最小化手段８に出力する。誤差最小化手段８は、聴
覚重み付け器７から入力した聴覚重み付け誤差が最小と
なるように、固定符号ベクトル、適応符号ベクトル、固
定符号ベクトル利得、適応符号ベクトル利得を決定す
る。固定符号ベクトルは、固定符号帳９の中から選択さ
れ、固定符号ベクトル利得乗算器１０に出力される。固
定符号ベクトル利得乗算器１０は、固定符号帳９から出
力された固定符号ベクトルに固定符号ベクトル利得を乗
じて、加算器６に出力する。適応符号ベクトルは、適応
符号帳１１の中から選択され、適応符号ベクトル利得乗
算器１２に出力される。適応符号ベクトル利得乗算器１
２は、適応符号帳１１から出力された適応符号ベクトル
に適応符号ベクトル利得を乗じて、加算器６に出力す
る。加算器６は、固定符号ベクトル利得乗算器１０と適
応符号ベクトル利得乗算器１２から出力されたそれぞれ
のベクトルの加算を行い、音源ベクトルとして合成フィ
ルタ５に出力する。誤差最小となる固定符号ベクトル、
適応符号ベクトル、固定符号ベクトル利得、適応符号ベ
クトル利得の組み合わせによって加算器６によって生成
された音源ベクトルは、過去の音源信号をバッファリン
グしている適応符号帳に新しく付け加えられる。そし
て、この誤差最小となる音源ベクトルを生成する適応符
号ベクトル、固定符号ベクトル、適応符号ベクトル利
得、固定符号ベクトル利得の情報が復号器側に伝送され
る。The adaptive codebook search unit in the conventional speech coding apparatus based on the CELP method will be described below.
FIG. 5 shows a general CELP type speech coder. In FIG. 5, the input voice signal 1 is the preprocessor 2
The waveform is shaped by and then output to the linear prediction analyzer 3 and the adder 4. The linear prediction analyzer 3 performs a linear prediction analysis using the preprocessed input speech signal and outputs a linear prediction coefficient to the synthesis filter 5. The synthesis filter 5 performs speech synthesis using the sound source signal input from the adder 6 and the linear prediction coefficient input from the linear prediction analyzer 3, and outputs the speech to the adder 4. The adder 4 calculates an error between the combined signal input from the combining filter and the preprocessed input audio signal input from the preprocessor 2, and outputs the error to the perceptual weighting unit 7.
The perceptual weighter 7 perceptually weights the error signal,
It outputs to the error minimizing means 8. The error minimization means 8 determines a fixed code vector, an adaptive code vector, a fixed code vector gain, and an adaptive code vector gain so that the auditory weighting error input from the auditory weighting device 7 is minimized. The fixed code vector is selected from the fixed code book 9 and output to the fixed code vector gain multiplier 10. The fixed code vector gain multiplier 10 multiplies the fixed code vector output from the fixed code book 9 by the fixed code vector gain, and outputs the product to the adder 6. The adaptive code vector is selected from the adaptive codebook 11 and output to the adaptive code vector gain multiplier 12. Adaptive code vector gain multiplier 1
2 multiplies the adaptive code vector output from the adaptive codebook 11 by the adaptive code vector gain, and outputs the product to the adder 6. The adder 6 adds the respective vectors output from the fixed code vector gain multiplier 10 and the adaptive code vector gain multiplier 12, and outputs the addition to the synthesis filter 5 as the excitation vector. Fixed code vector with minimum error,
The excitation vector generated by the adder 6 by the combination of the adaptive code vector, the fixed code vector gain, and the adaptive code vector gain is newly added to the adaptive codebook buffering the past excitation signal. Then, information on the adaptive code vector, the fixed code vector, the adaptive code vector gain, and the fixed code vector gain that generate the excitation vector that minimizes this error is transmitted to the decoder side.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記従
来のＣＥＬＰ型音声符号化装置では、伝送路誤りが生じ
た場合、適応符号帳にバッファリングされている内容
が、符号器側と復号器側で異なってしまい、伝送路誤り
から復帰した後も誤りの影響を大きく受けてしまうとい
う問題を有していた。However, in the above-mentioned conventional CELP-type speech coding apparatus, when a transmission path error occurs, the contents buffered in the adaptive codebook are displayed on the encoder side and the decoder side. There is a problem in that they are different and are greatly affected by the error even after recovering from the transmission line error.

【０００６】本発明は、上記従来の問題を解決するもの
であり、伝送路誤りから復帰した直後でも符号器側と復
号器側で同一の音源ベクトルを得られるようにし、また
伝送路誤りから復帰した直後に生じる符号器側と復号器
側で生成される音源ベクトルの誤差を緩和することので
きる音声符号化／復号化装置を提供することを目的とす
る。The present invention solves the above-mentioned conventional problems, and enables an encoder side and a decoder side to obtain the same excitation vector immediately after recovering from a transmission path error, and recovering from a transmission path error. An object of the present invention is to provide a speech encoding / decoding device capable of mitigating an error between excitation vectors generated on the encoder side and the decoder side immediately after the operation.

【０００７】[0007]

【課題を解決するための手段】本発明は、上記目的を達
成するために、伝送路誤りの発生を監視するための情報
が復号器側から符号器側に送られ、伝送路誤りが発生し
たと判定される場合には、誤りの発生した次のフレーム
またサブフレームにおける適応符号帳探索の探索範囲を
制限するようにしたものである。また、連続して伝送路
誤りが発生した場合は、適応帳を用いずに、固定符号帳
のみによる符号化処理を伝送路誤りが解消されるまで続
けるようにしたものである。さらに、伝送路誤りを生じ
たときに生成された適応符号帳の使用を回避し、伝送路
誤りがないときに生成された適応符号帳を用いるように
したものである。According to the present invention, in order to achieve the above object, information for monitoring occurrence of a transmission line error is sent from a decoder side to an encoder side, and a transmission line error occurs. If it is determined that the search range of the adaptive codebook search in the next frame or subframe in which an error has occurred is limited. Further, when transmission line errors occur continuously, the coding process using only the fixed codebook is continued until the transmission line error is eliminated without using the adaptive book. Furthermore, the use of the adaptive codebook generated when a transmission line error occurs is avoided, and the adaptive codebook generated when there is no transmission line error is used.

【０００８】[0008]

【発明の実施の形態】本発明の請求項１に記載の発明
は、復号器側から伝送路誤り情報を受け取った符号器
が、直前のフレームまたはサブフレームに伝送路誤りが
生じたかを判断し、伝送路誤りが生じた場合には、直前
のフレームまたはサブフレームで生成した部分を適応符
号帳の探索範囲から除外する適応符号帳探索範囲限定手
段を備えたものであり、伝送路誤り解消直後のフレーム
またはサブフレームにおいても、符号器側と復号器側で
同一の音源ベクトルを生成することが可能となる。BEST MODE FOR CARRYING OUT THE INVENTION According to the first aspect of the present invention, an encoder which receives transmission path error information from a decoder side judges whether a transmission path error has occurred in the immediately preceding frame or subframe. When a transmission path error occurs, the adaptive codebook search range limiting means for excluding the portion generated in the immediately preceding frame or subframe from the search range of the adaptive codebook is provided. It is possible to generate the same excitation vector on the encoder side and the decoder side also in the frame or subframe.

【０００９】本発明の請求項２に記載の発明は、直前の
数フレームまたは数サブフレームに渡って伝送路誤りが
生じていると判断した場合、その連続して伝送路誤りが
生じたフレームまたはサブフレームで生成した適応符号
ベクトルの利得を零にして、音源ベクトルを固定符号帳
のみから生成するものであり、伝送路誤り解消直後のフ
レームまたはサブフレームにおいても、符号器側と復号
器側で同一の音源ベクトルを生成することが可能とな
る。According to the second aspect of the present invention, when it is determined that the transmission path error has occurred over the immediately preceding several frames or several subframes, the frames or the frames in which the transmission path error has occurred continuously or The gain of the adaptive code vector generated in the subframe is set to zero, and the excitation vector is generated only from the fixed codebook.Even in the frame or subframe immediately after the transmission path error is resolved, the encoder side and the decoder side have It is possible to generate the same sound source vector.

【００１０】本発明の請求項３に記載の発明は、直前の
数フレームまたは数サブフレームに渡って伝送路誤りが
生じていると判断した場合は、その連続して伝送路誤り
が生じたフレームまたはサブフレームで生成した適応符
号符号帳を固定符号帳に切り替えて、音源ベクトルを固
定符号帳のみから生成する手段を備えたものであり、伝
送路誤り解消直後のフレームまたはサブフレームにおい
ても、符号器側と復号器側で同一の音源ベクトルを生成
することが可能となる。According to the third aspect of the present invention, when it is determined that the transmission path error has occurred over the immediately preceding several frames or several subframes, the frames in which the transmission path error has occurred successively. Alternatively, the adaptive code codebook generated in the subframe is switched to the fixed codebook, and means for generating the excitation vector from only the fixed codebook is provided. It is possible to generate the same excitation vector on the device side and the decoder side.

【００１１】本発明の請求項４に記載の発明は、伝送路
誤りが生じた後、復号器が、正常な情報を請求項２また
は３記載の符号器側から受け取った場合、適応符号帳の
代わりに固定符号帳を用いて音声合成を行なうものであ
り、伝送路誤り解消直後のフレームまたはサブフレーム
においても、符号器側と復号器側で同一の音源ベクトル
を生成することが可能となる。According to a fourth aspect of the present invention, when a decoder receives normal information from the encoder side according to the second or third aspect after a transmission line error has occurred, the adaptive codebook Instead, the fixed codebook is used for speech synthesis, and the same excitation vector can be generated on the encoder side and the decoder side even in the frame or subframe immediately after the transmission path error is resolved.

【００１２】本発明の請求項５に記載の発明は、伝送路
誤りが生じた後、正常な情報を符号器側から受け取った
場合、復号器が、受け取ったピッチ音源が誤りフレーム
またはサブフレームで合成された励振音源情報を含むか
否かをチェックし、含まない場合は受け取った情報を用
いてそのまま適応符号帳からピッチ音源を生成し、含む
場合は受け取った情報をそのまま用いずにピッチ音源を
生成するものであり、符号器側と復号器側で得られる音
源ベクトルの誤差が大きくなることを避けることが可能
となる。According to a fifth aspect of the present invention, when normal information is received from the encoder side after a transmission path error has occurred, the decoder determines that the received pitch sound source is an error frame or subframe. It is checked whether the synthesized excitation sound source information is included, and if it is not included, the pitch information is generated from the adaptive codebook using the received information as it is, and if it is included, the pitch sound source is used without using the received information as it is. Since it is generated, it is possible to avoid a large error between excitation vectors obtained on the encoder side and the decoder side.

【００１３】（実施の形態１）以下、本発明の実施の形
態について、図面を参照しながら説明する。図１は本発
明の第１の実施の形態におけるＣＥＬＰ型音声符号化装
置の構成を示すものである。図１において、１０１は入
力音声信号、１０２は入力音声信号１０１を入力として
前処理後の入力音声信号を線形予測器１０３と加算器１
０４に出力する前処理器、１０３は前処理後の入力音声
信号を入力として線形予測分析を行い、線形予測係数を
合成フィルタ１０５に出力する線形予測分析器、１０４
は前処理後の音声信号と合成フィルタ１０５の出力信号
とを入力として差分信号を算出し、聴覚重み付け器１０
７に出力する加算器、１０５は加算器１０６から出力さ
れた音源ベクトルと線形予測分析器１０３から出力され
た線形予測係数とを入力として音声信号の合成を行なう
合成フィルタ、１０６は固定符号ベクトル利得乗算器１
１２と適応符号ベクトル利得乗算器１１４から出力され
るそれぞれのベクトルを加算して合成フィルタ１０５に
出力する加算器、１０７は加算器１０４から出力された
誤差信号を入力として聴覚的な重み付けを行い、誤差最
小化手段１０８に出力する聴覚重み付け器、１０８は聴
覚重み付け器１０７から出力された聴覚重み付けの誤差
パワーが最小となるような固定符号ベクトル、適応符号
ベクトル利得、適応符号ベクトル利得の組み合わせを、
探索範囲限定器１０９から出力された探索範囲に基づい
て決定する誤差最小化手段、１０９は伝送路誤り監視信
号を入力とし、誤差最小化手段１０８による適応符号等
の探索範囲を決定して誤差最小化手段１０８に出力する
探索範囲限定器、１１０は伝送路誤りの発生を検出する
ための伝送路誤り監視信号、１１１は固定符号ベクトル
利得乗算器１１２に出力する予め定められた数の固定符
号ベクトルを格納する固定符号帳、１１２は固定符号帳
１１１から出力された固定符号ベクトルに固定ベクトル
利得を乗じて加算器１０６に出力する固定符号ベクトル
利得乗算器、１１３は加算器１０６から出力された過去
の音源ベクトル（誤差最小化手段１０８によって最終的
に決定されたもの）のバッファからなり、バッファに格
納された信号列の一部を切り出して適応符号ベクトルと
して適応符号ベクトル利得乗算器１１４に出力する適応
符号帳、１１４は適応符号帳１１３から出力された適応
符号ベクトルに適応符号ベクトル利得を乗じて加算器１
０６に出力する適応符号ベクトル利得乗算器である。(Embodiment 1) Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows the configuration of a CELP type speech coding apparatus according to the first embodiment of the present invention. In FIG. 1, 101 is an input speech signal, 102 is an input speech signal 101, and the input speech signal after pre-processing is a linear predictor 103 and an adder 1.
104 is a preprocessor for outputting to 04, 103 is a linear prediction analyzer for inputting the input speech signal after preprocessing, and outputting a linear prediction coefficient to the synthesis filter 105;
Receives the pre-processed voice signal and the output signal of the synthesis filter 105 as input, calculates a difference signal, and the auditory weighting device 10
7 is an adder, 105 is a synthesizing filter for synthesizing a speech signal with the excitation vector output from the adder 106 and the linear prediction coefficient output from the linear prediction analyzer 103 as inputs, and 106 is a fixed code vector gain Multiplier 1
12 and an adder that outputs the respective vectors output from the adaptive code vector gain multiplier 114 and outputs the result to the synthesis filter 105, 107 performs auditory weighting using the error signal output from the adder 104 as an input, The perceptual weighting unit that outputs the error to the error minimization unit 108 includes a combination of a fixed code vector, an adaptive code vector gain, and an adaptive code vector gain that minimizes the perceptual weighting error power output from the perceptual weighting unit 107.
Error minimization means for deciding on the basis of the search range output from the search range limiter 109, 109 receives the transmission path error monitoring signal as an input, and the error minimization means 108 decides the search range for adaptive codes and the like to minimize the error. The search range limiter output to the conversion means 108, 110 a transmission path error monitoring signal for detecting the occurrence of a transmission path error, and 111 a fixed number of fixed code vectors output to the fixed code vector gain multiplier 112. , 112 is a fixed code vector gain multiplier that outputs the fixed code vector output from the fixed code book 111 by a fixed vector gain to the adder 106, and 113 is the past output from the adder 106. Of the sound source vector (finally determined by the error minimization means 108) of Adaptive codebook for outputting the adaptive code vector gain multiplier 114 as an adaptive code vector by cutting out part, 114 denotes an adder multiplied by the adaptive code vector gain to the adaptive code vector outputted from adaptive codebook 113 1
It is an adaptive code vector gain multiplier for outputting to 06.

【００１４】以上のように構成されたＣＥＬＰ型音声符
号化ー装置について、以下にその動作を説明する。図１
において、入力音声信号１０１は、定められたサンプル
数からなるディジタル信号であり、音声符号化処理は、
この定められたサンプル数の音声信号毎に行なわれる。
この定められたサンプル数の音声信号ブロックをフレー
ムまたはサブフレームと呼ぶ。入力音声信号１０１は、
前処理器１０２により帯域制限や利得調整が行なわれ
る。この前処理後の音声信号を用いて、線形予測分析器
１０３は、公知の線形予測分析を行い、線形予測係数を
算出する。合成フィルタ１０５は、線形予測分析器１０
３で算出された線形予測係数を用いてフィルタを構成
し、加算器６から出力されてくる音源ベクトルにフィル
タ処理を行なって音声を行なう。加算器１０４は、前処
理後の入力音声信号と合成フィルタ１０５によって合成
された音声信号との差分信号を計算する。聴覚重み付け
器１０７は、加算器１０４によって算出された差分信号
に聴覚的な重み付けを行い、誤差最小化手段１０８に出
力する。この聴覚的な重み付けは、一般的には、線形予
測分析器１０３で算出された線形予測係数と聴覚重み付
け係数を用いた線形予測フィルタを縦続接続したフィル
タを用いて行なわれる。誤差最小化手段１０８は、聴覚
重み付け後の差分信号（誤差信号）のパワーが最小とな
るように、合成フィルタ１０５に入力される音源ベクト
ルを、固定符号ベクトルと固定符号ベクトル利得と適応
符号ベクトルと適応符号ベクトル利得の組み合わせを変
えることによって調整する。一般的には、初めに適応符
号帳１１３から最適な適応符号ベクトルを取り出して、
乗算器１１４で適応符号ベクトル利得と乗算して加算器
１０６への出力を決定し、続いて固定符号帳１１１の中
から適応符号ベクトルと組み合わせた時に最適となる固
定符号ベクトルを取り出して、乗算器１１２で固定符号
ベクトル利得と乗算して加算器１０６への出力を決定す
る。探索範囲限定器１０９は、適応符号帳１１３の中か
ら最適な適応符号ベクトルを取り出すときに、適応符号
帳１１３の探索範囲を限定するものである。探索範囲限
定器１０９は、探索範囲限定器１０９に入力される伝送
路誤り監視信号１１０から、直前のフレームまたはサブ
フレームに伝送路誤りが生じたかを判定する。そして、
直前のフレームまたはサブフレームで伝送路誤りが生じ
たと判定した場合には、適応符号帳１１３に格納されて
いる過去に生成した音源信号のうち、直前のフレームで
生成した部分を探索範囲から外して適応符号帳探索を行
い、最適な符号ベクトルを選択するように、適応符号帳
１１３の探索範囲を誤差最小化手段１０８に出力する。
連続して直前のフレームまたはサブフレームに伝送路誤
りが生じたと判定されている場合は、適応符号帳１１３
に格納されている過去に生成した音源信号のうち、連続
した直前のフレームで生成した部分を探索範囲から外し
て適応符号帳探索を行なうように適応符号帳探索範囲を
決定し、誤差最小化手段１０８に出力する。しかしなが
ら、伝送路誤りの連続が長時間に渡ることによって、適
応符号帳１１３に格納されている音源符号帳の全てが探
索範囲から除外されてしまうような場合は、適応符号ベ
クトル利得を零にして、音源ベクトルを固定符号ベクト
ルのみから生成するように、誤差最小化手段１０８の探
索範囲を決定する。The operation of the CELP type speech coding apparatus configured as described above will be described below. FIG.
In, the input speech signal 101 is a digital signal consisting of a predetermined number of samples, and the speech coding process is
This is performed for each audio signal of the predetermined number of samples.
The audio signal block having the predetermined number of samples is called a frame or subframe. The input audio signal 101 is
Bandwidth limitation and gain adjustment are performed by the preprocessor 102. The linear prediction analyzer 103 performs publicly known linear prediction analysis using the audio signal after the pre-processing, and calculates a linear prediction coefficient. The synthesis filter 105 includes the linear prediction analyzer 10
A filter is formed by using the linear prediction coefficient calculated in step 3, and the sound source vector output from the adder 6 is filtered to give a voice. The adder 104 calculates a difference signal between the preprocessed input audio signal and the audio signal synthesized by the synthesis filter 105. The perceptual weighter 107 perceptually weights the difference signal calculated by the adder 104, and outputs the weighted difference signal to the error minimizing means 108. This perceptual weighting is generally performed using a filter in which a linear prediction filter calculated by the linear prediction analyzer 103 and a linear prediction filter using the perceptual weighting coefficient are connected in series. The error minimization means 108 converts the excitation vector input to the synthesis filter 105 into a fixed code vector, a fixed code vector gain, and an adaptive code vector so that the power of the difference signal (error signal) after the auditory weighting is minimized. Adjust by changing the combination of adaptive code vector gains. Generally, first, the optimum adaptive code vector is extracted from the adaptive codebook 113,
The multiplier 114 multiplies the adaptive code vector gain to determine the output to the adder 106, and then extracts the optimum fixed code vector when combined with the adaptive code vector from the fixed code book 111, and the multiplier At 112, the fixed code vector gain is multiplied to determine the output to the adder 106. The search range limiter 109 limits the search range of the adaptive codebook 113 when extracting the optimum adaptive code vector from the adaptive codebook 113. The search range limiter 109 determines from the transmission line error monitor signal 110 input to the search range limiter 109 whether a transmission line error has occurred in the immediately preceding frame or subframe. And
When it is determined that the transmission path error has occurred in the immediately preceding frame or subframe, the part generated in the immediately preceding frame among the excitation signals generated in the past stored in the adaptive codebook 113 is removed from the search range. The adaptive codebook search is performed, and the search range of the adaptive codebook 113 is output to the error minimizing means 108 so as to select the optimum code vector.
If it is determined that a transmission path error has occurred consecutively in the immediately preceding frame or subframe, the adaptive codebook 113
The adaptive codebook search range is determined so as to perform the adaptive codebook search by removing the part generated in the immediately preceding continuous frame from the excitation signal stored in the past stored in the To 108. However, if all of the excitation codebooks stored in adaptive codebook 113 are excluded from the search range due to a long series of transmission path errors, the adaptive code vector gain is set to zero. The search range of the error minimizing means 108 is determined so that the excitation vector is generated only from the fixed code vector.

【００１５】音声符号化装置を以上のように構成した場
合、復号化装置には、符号化装置に伝送路誤り監視信号
１１０を伝送する手段を付加する必要があるが、符号化
装置における復号処理は従来のものと全く同じものにな
るため、従来のものをそのまま用いることが可能であ
る。なお、伝送路誤り監視信号１１０としては、予め定
められた信号を一定時間間隔（１フレーム分の符号化パ
ラメータを伝送する時間間隔より短い）で送信するもの
などが考えられ、この場合、探索範囲限定器２０９で
は、予め定められた信号と異なる信号を受け取った場合
に、その時送信したフレームの符号化情報に伝送路誤り
が発生したと判断する。When the speech coding apparatus is configured as described above, it is necessary to add means for transmitting the transmission path error monitoring signal 110 to the decoding apparatus, but the decoding processing in the coding apparatus is required. Is the same as the conventional one, the conventional one can be used as it is. Note that the transmission path error monitoring signal 110 may be a signal that is transmitted at a predetermined time interval (shorter than the time interval for transmitting the coding parameter for one frame), or the like. In this case, the search range is When the limiter 209 receives a signal different from the predetermined signal, it determines that a transmission path error has occurred in the encoded information of the frame transmitted at that time.

【００１６】このように、上記第１の実施の形態によれ
ば、復号器側から伝送路誤り情報を受け取った符号器
が、直前のフレームまたはサブフレームに伝送路誤りが
生じたかを判断し、伝送路誤りが生じた場合には、直前
のフレームまたはサブフレームで生成した部分を適応符
号帳の探索範囲から除外する探索範囲限定器１０９を備
えたものであり、伝送路誤り解消直後のフレームまたは
サブフレームにおいても、符号器側と復号器側で同一の
音源ベクトルを生成することが可能となる。As described above, according to the first embodiment, the encoder that receives the transmission path error information from the decoder side determines whether or not a transmission path error has occurred in the immediately preceding frame or subframe, When a transmission path error occurs, a search range limiter 109 that excludes the portion generated in the immediately preceding frame or subframe from the search range of the adaptive codebook is provided. Also in the subframe, it is possible to generate the same excitation vector on the encoder side and the decoder side.

【００１７】（実施の形態２）次に、本発明の第２の実
施の形態について図２を参照しながら説明する。図２に
おいて、２０１は入力音声信号、２０２は入力音声信号
２０１を入力として前処理後の入力音声信号を線形予測
分析器２０３と加算器２０４に出力する前処理器、２０
３は前処理後の入力音声信号を入力として線形予測分析
を行い、線形予測係数を合成フィルタ２０５に出力する
線形予測分析器、２０４は前処理後の音声信号と合成フ
ィルタ２０５の出力信号とを入力として差分信号を算出
し、聴覚重み付け器２０７に出力する加算器、２０５は
加算器２０６から出力された音源ベクトルと線形予測分
析器２０３から出力された線形予測係数とを入力として
音声信号の合成を行なう合成フィルタ、２０６は固定符
号ベクトル利得乗算器２１２と適応符号ベクトル利得乗
算器２１６から出力されるそれぞれのベクトルを加算し
て合成フィルタ２０５に出力する加算器、２０７は加算
器２０４から出力された誤差信号を入力として聴覚的な
重み付けを行い、誤差最小化手段２０８に出力する聴覚
重み付け器、２０８は聴覚重み付け器２０７から出力さ
れた聴覚重み付け後の誤差パワーが最小となるような固
定符号ベクトル、適応符号ベクトル、固定符号ベクトル
利得、適応符号ベクトル利得の組み合わせを、探索範囲
限定器２０９から出力された探索範囲に基づいて決定す
る誤差最小化手段、２０９は伝送路誤り監視信号２１０
を入力とし、誤差最小化手段２０８による適応符号帳２
１４の探索範囲を決定して誤差最小化手段２０８および
符号帳選択器２１５に出力する探索範囲限定器、２１０
は伝送路誤りの発生を検出するための伝送路誤り監視信
号、２１１は固定符号ベクトルを固定符号ベクトル利得
乗算器２１２に出力する予め定められた数の固定符号ベ
クトルを格納する固定符号帳、２１２は固定符号帳２１
１から出力された固定符号ベクトルに固定符号ベクトル
利得を乗じて加算器２０６に出力する固定符号ベクトル
利得乗算器、２１３は固定符号ベクトルを符号帳選択器
２１５に出力する予め定められた数の固定符号ベクトル
を格納する固定符号帳、２１４は加算器２０６から出力
された過去の音源ベクトル（誤差最小化手段２０８によ
って最終的に決定されたもの）のバッファからなり、バ
ッファに格納された信号列の一部を切り出して適応符号
ベクトルとして符号帳選択器２１５に出力する適応符号
帳、２１５は探索範囲限定器２０９から探索範囲情報を
入力し、固定符号帳２１３と適応符号帳２１４からそれ
ぞれ入力したベクトルのうち一方のみを選択して符号ベ
クトル利得乗算器２１６へ出力する符号帳選択器、２１
６は符号帳選択器２１５から出力された符号ベクトルに
符号ベクトル利得を乗算して加算器２０６に出力する符
号ベクトル利得乗算器である。(Second Embodiment) Next, a second embodiment of the present invention will be described with reference to FIG. In FIG. 2, 201 is an input speech signal, 202 is a preprocessor for inputting the input speech signal 201, and outputting the preprocessed input speech signal to a linear prediction analyzer 203 and an adder 204, 20
Reference numeral 3 denotes a linear prediction analyzer that performs linear prediction analysis using the preprocessed input speech signal as an input and outputs linear prediction coefficients to the synthesis filter 205. Reference numeral 204 denotes the preprocessed speech signal and the output signal of the synthesis filter 205. An adder that calculates a difference signal as an input and outputs the difference signal to the perceptual weighting unit 207, and 205 synthesizes a speech signal with the sound source vector output from the adder 206 and the linear prediction coefficient output from the linear prediction analyzer 203 as inputs. 207 is an output from the adder 204. 207 is an adder that adds the vectors output from the fixed code vector gain multiplier 212 and the adaptive code vector gain multiplier 216 and outputs the added vector to the synthesis filter 205. The perceptual weighting device 20 which receives the error signal as an input, performs perceptual weighting, and outputs it to the error minimizing means 208, Is a combination of a fixed code vector, an adaptive code vector, a fixed code vector gain, and an adaptive code vector gain output from the perceptual weighting unit 207 that minimizes the error power after the perceptual weighting is output from the search range limiter 209. Error minimization means 209 for determining the transmission path error monitoring signal 210
As an input, the adaptive codebook 2 by the error minimizing means 208
A search range limiter 210 which determines the search ranges of 14 and outputs them to the error minimization means 208 and the codebook selector 215.
Is a transmission line error monitoring signal for detecting the occurrence of a transmission line error, 211 is a fixed codebook for storing a fixed number of fixed code vectors for outputting a fixed code vector to the fixed code vector gain multiplier 212, 212 Is the fixed codebook 21
The fixed code vector gain multiplier 213 that multiplies the fixed code vector output from 1 by the fixed code vector gain and outputs the fixed code vector to the adder 206. The fixed code vector gain multiplier 213 outputs the fixed code vector to the codebook selector 215. A fixed codebook for storing code vectors, and 214 is a buffer of past excitation vectors (finally determined by the error minimization means 208) output from the adder 206, and stores the signal sequence stored in the buffer. The adaptive codebook 215, which cuts out a part and outputs it as an adaptive code vector to the codebook selector 215, inputs search range information from the search range limiter 209 and inputs vectors from the fixed codebook 213 and the adaptive codebook 214, respectively. A codebook selector that selects only one of them and outputs it to the code vector gain multiplier 216;
A code vector gain multiplier 6 multiplies the code vector output from the code book selector 215 by a code vector gain and outputs the result to the adder 206.

【００１８】以上のように構成されたＣＥＬＰ型音声符
号化装置について、以下にその動作を説明する。図２に
おいて、入力音声信号２０１は、定められたサンプル数
からなるディジタル信号であり、音声符号化処理は、こ
の定められたサンプル数の音声信号毎に行なわれる。こ
の定められたサンプル数の音声信号ブロックをフレーム
またはサブフレームと呼ぶ。入力音声信号２０１は、前
処理器２０２により帯域制限や利得調整が行なわれる。
この前処理後の音声信号を用いて、線形予測分析器２０
３は、公知の線形予測分析を行い、線形予測係数を算出
する。合成フィルタ２０５は、線形予測分析器２０３で
算出された線形予測係数を用いてフィルタを構成し、加
算器２０６から出力されてくる音源ベクトルにフィルタ
処理を行なって音声合成を行なう。加算器２０４は、前
処理後の入力音声信号と合成フィルタ２０５によって合
成された音声信号との差分信号を計算する。聴覚重み付
け器２０７は、加算器２０４によって算出された差分信
号に聴覚的な重み付けを行ない、誤差最小化手段２０８
に出力する。この聴覚的な重み付けは、一般的には、線
形予測分析器２０３で算出された線形予測係数と聴覚重
み付け係数を用いた線形予測フィルタを縦続接続したフ
ィルタを用いて行なわれる。誤差最小化手段２０８は、
聴覚重み付けの後の差分信号（誤差信号）のパワーが最
小となるように、合成フィルタ２０５に入力される音源
ベクトルを、固定符号ベクトルと固定符号ベクトル利得
と適応符号ベクトルと適応符号ベクトル利得の組み合わ
せを変えることによって調整する。一般的には、初めに
適応符号帳２１４から最適な適応符号ベクトルを取り出
して、乗算器２１６で適応符号ベクトル利得と乗算して
加算器２０６への出力を決定し、続いて固定符号帳２１
１の中から適応ベクトルと組合わせた時に最適となる固
定符号ベクトルを取り出して、乗算器２１２で固定符号
ベクトル利得と乗算して加算器２０６への出力を決定す
る。探索範囲限定器２０９は、適応符号帳２１４の中か
ら最適な適応符号ベクトルを取り出すときに、適応符号
帳２１４の探索範囲を限定するものである。探索範囲限
定器２０９は、探索範囲限定器２０９に入力される伝送
路誤り監視信号２１０から直前のフレームまたはサブフ
レームに伝送路誤りが生じたかを判定する。そして、直
前のフレームまたはサブフレームで伝送路誤りが生じた
と判定した場合には、適応符号帳２１４に格納されてい
る過去に生成した音源信号のうち、直前のフレームで生
成した部分を探索範囲から外して適応符号帳探索を行な
い、最適な符号ベクトルを選択するように、適応符号帳
２１４の探索範囲を誤差最小化手段２０８および符号帳
選択器２１５に出力する。連続して直前のフレームまた
はサブフレームに伝送路誤りが生じたと判定した場合に
は、適応符号帳２１４に格納されている過去に生成した
音源信号のうち、連続した直前のフレームで生成した部
分を探索範囲から外して適応符号帳探索を行なうように
適応符号帳探索範囲を決定し、誤差最小化手段２０８に
出力する。しかしながら、伝送誤りの連続が長時間に渡
ることによって適応符号帳２１４に格納されている音源
符号帳の全てが探索範囲から除外されてしまう場合は、
適応符号帳２１４を用いずに固定符号帳２１３を用いて
音源ベクトルを生成するように、符号帳選択器２１５と
誤差最小化手段２０８に探索範囲を出力する。符号帳選
択器２１５は、入力された探索範囲が固定符号帳探索を
示す内容となっている場合には、固定符号帳２１３から
の入力される符号ベクトルを符号ベクトル利得乗算器２
１６に出力する。The operation of the CELP type speech coder constructed as above will be described below. In FIG. 2, the input voice signal 201 is a digital signal having a predetermined number of samples, and the voice encoding process is performed for each voice signal having the predetermined number of samples. The audio signal block having the predetermined number of samples is called a frame or subframe. The input audio signal 201 is band-limited and gain-adjusted by the preprocessor 202.
Using the speech signal after this pre-processing, the linear prediction analyzer 20
3 performs a known linear prediction analysis to calculate a linear prediction coefficient. The synthesis filter 205 forms a filter by using the linear prediction coefficient calculated by the linear prediction analyzer 203, and filters the sound source vector output from the adder 206 to perform speech synthesis. The adder 204 calculates a differential signal between the preprocessed input audio signal and the audio signal synthesized by the synthesis filter 205. The perceptual weighting unit 207 perceptually weights the difference signal calculated by the adder 204, and the error minimizing means 208 is provided.
Output to This perceptual weighting is generally performed using a cascaded filter of linear prediction filters using the linear prediction coefficient calculated by the linear prediction analyzer 203 and the perceptual weighting coefficient. The error minimizing means 208 is
The excitation vector input to the synthesis filter 205 is a combination of a fixed code vector, a fixed code vector gain, an adaptive code vector, and an adaptive code vector gain so that the power of the difference signal (error signal) after perceptual weighting is minimized. Adjust by changing. In general, first, the optimum adaptive code vector is extracted from the adaptive code book 214, multiplied by the adaptive code vector gain in the multiplier 216 to determine the output to the adder 206, and then the fixed code book 21.
A fixed code vector that is optimal when combined with an adaptive vector is taken out of 1 and multiplied by the fixed code vector gain in the multiplier 212 to determine the output to the adder 206. The search range limiter 209 limits the search range of the adaptive codebook 214 when extracting the optimum adaptive code vector from the adaptive codebook 214. The search range limiter 209 determines from the transmission line error monitor signal 210 input to the search range limiter 209 whether a transmission line error has occurred in the immediately preceding frame or subframe. Then, when it is determined that a transmission path error has occurred in the immediately preceding frame or subframe, of the excitation signals generated in the past stored in adaptive codebook 214, the portion generated in the immediately preceding frame is extracted from the search range. The adaptive codebook search is performed by removing the search range, and the search range of the adaptive codebook 214 is output to the error minimizing means 208 and the codebook selector 215 so that the optimum code vector is selected. When it is determined that a transmission path error has occurred in the immediately preceding frame or subframe in succession, the part generated in the immediately preceding continuous frame in the excitation signal generated in the past stored in the adaptive codebook 214 is detected. The adaptive codebook search range is determined so as to perform the adaptive codebook search outside the search range, and output to the error minimizing means 208. However, when all the excitation codebooks stored in the adaptive codebook 214 are excluded from the search range due to the continuous transmission error for a long time,
The search range is output to the codebook selector 215 and the error minimizing means 208 so that the fixed codebook 213 is used instead of the adaptive codebook 214 to generate the excitation vector. When the input search range has a content indicating a fixed codebook search, the codebook selector 215 calculates the code vector input from the fixed codebook 213 by using the code vector gain multiplier 2
Output to 16.

【００１９】音声符号化装置を以上のように構成した場
合、復号化装置には、適応符号帳と固定符号帳のどちら
か一方を選択する手段が必要となる。簡単な方法として
は、どちらの符号帳を用いているのかを示す情報を符号
化装置側で付加して復号化装置側へ伝送すればよい。こ
のためにビットを割くことが不可能な場合には、伝送路
誤り監視手段を付加して、過去に連続した伝送路誤りが
発生していた場合に、固定符号帳と適応符号帳の切り替
えを行なう必要がある。When the speech coding apparatus is configured as described above, the decoding apparatus needs a means for selecting either the adaptive codebook or the fixed codebook. As a simple method, information indicating which codebook is used may be added on the encoding device side and transmitted to the decoding device side. For this reason, if it is impossible to allocate bits, a transmission line error monitoring means is added to switch between the fixed codebook and the adaptive codebook when continuous transmission line errors have occurred in the past. I need to do it.

【００２０】なお、伝送路誤り監視信号２１０として
は、予め定められた信号を一定時間間隔（１フレーム分
の符号化パラメータを伝送する時間間隔より短い）で送
信するものなどが考えられ、この場合、探索範囲限定器
２０９では、予め定められた信号と異なる信号を受け取
った場合に、その時送信したフレームの符号化情報に伝
送路誤りが発生したと判断する。As the transmission path error monitoring signal 210, it is conceivable that a predetermined signal is transmitted at a constant time interval (shorter than the time interval for transmitting the coding parameter for one frame). In this case, When a signal different from a predetermined signal is received, search range limiter 209 determines that a transmission path error has occurred in the coded information of the frame transmitted at that time.

【００２１】このように、上記第２の実施の形態によれ
ば、誤りフレーム後の正常フレームにおいて、符号化装
置の音源ベクトルと復号化装置の音源ベクトルとの間に
歪みを生じることなく、同一の音源ベクトルが得られる
ようにすることができる。また、適応符号帳をキャンセ
ルして、適応符号ベクトルに割り当てられた情報量を使
用しない上記第１の実施の形態よりも音質を向上させる
ことができる。As described above, according to the second embodiment, in the normal frame after the error frame, the excitation vector of the encoding device and the excitation vector of the decoding device are the same without distortion. Can be obtained. Further, the adaptive codebook can be canceled to improve the sound quality as compared with the first embodiment in which the information amount assigned to the adaptive code vector is not used.

【００２２】（実施の形態３）次に、本発明の第３の実
施の形態における音声復号化装置について説明する。図
３は音声復号化装置の適応符号帳に格納されている音源
波形を示したものであり、３０１は適応符号帳に格納さ
れている音源波形、３０２は伝送路誤りによって正しく
復号されなかったフレームで生成された音源波形の部
分、Ｐｉは伝送路誤りのあったフレームの直後に符号化
装置から伝送された正常フレームのラグ値、Ｖｐｉはラ
グ値Ｐｉに基づいて適応符号帳から切り出された適応符
号ベクトルの区間、ＮＶｐｉはこれから音源波形を生成
する区間（現在のフレームまたはサブフレーム）を示し
ている。(Embodiment 3) Next, a speech decoding apparatus according to a third embodiment of the present invention will be described. FIG. 3 shows excitation waveforms stored in the adaptive codebook of the speech decoding apparatus, 301 is an excitation waveform stored in the adaptive codebook, and 302 is a frame that was not correctly decoded due to a transmission path error. , Pi is the lag value of the normal frame transmitted from the encoder immediately after the frame with the transmission path error, and Vpi is the adaptation cut out from the adaptive codebook based on the lag value Pi. The code vector section, NVpi, indicates the section (current frame or subframe) in which the excitation waveform is to be generated.

【００２３】図３において、符号化装置から伝送された
ラグ値Ｐｉによって表される適応符号ベクトル（区間Ｖ
ｐｉ）は、伝送路誤りによって正しく復号されなかった
フレームの音源ベクトルを含んでしまうため、波形歪み
が大きくなる。そこで、このように直前のフレームに伝
送路誤りなどがあった場合には、符号化装置から伝送さ
れたラグ値Ｐｉの整数倍のピッチ（ｎＰｉ）を用いて適
応符号ベクトルを生成する。このときの整数ｎは、誤っ
た情報によって生成された音源部分３０２を含まないた
めに必要な整数の最小値であり、図３においては、ｎ＝
３となり、Ｖｐ３が適応符号ベクトルとして切り出され
る。In FIG. 3, the adaptive code vector (interval V) represented by the lag value Pi transmitted from the encoder is shown.
Since pi) includes the excitation vector of the frame that was not correctly decoded due to the transmission path error, the waveform distortion becomes large. Therefore, if there is a transmission path error in the immediately preceding frame, an adaptive code vector is generated using a pitch (nPi) that is an integral multiple of the lag value Pi transmitted from the encoding device. The integer n at this time is the minimum value of the integers necessary for not including the sound source portion 302 generated by incorrect information, and in FIG. 3, n =
3, and Vp3 is cut out as an adaptive code vector.

【００２４】また、図４は図３の直後のフレームまたは
サブフレームにおける復号化装置の適応符号帳の音源波
形を示している。このとき、符号化装置から伝送された
ラグ値Ｐｉ＋１に基づいて切り出される適応符号ベクト
ルＶｐｉ＋１は、まだ誤りフレームにおいて生成した音
源波形を含んでいる。これを避けるためには、ｎＰｉ＋
１のｎ＝４として、Ｖｐ４を適応符号ベクトルとして用
いればよい。ただし、図４に示すような場合には、Ｖｐ
ｉ＋１に含まれる誤りフレームにおいて生成した音源波
形を含む割合が低いため、Ｖｐ４を用いずにＶｐｉ＋１
を用いても良いが、その場合はＶｐｉ＋１に含まれる誤
りフレームにおいて生成した音源波形を含む割合による
場合分けを行なう必要がある。FIG. 4 shows the excitation waveform of the adaptive codebook of the decoding device in the frame or subframe immediately after FIG. At this time, the adaptive code vector Vpi + 1 cut out based on the lag value Pi + 1 transmitted from the coding apparatus still includes the excitation waveform generated in the error frame. To avoid this, nPi +
Vp4 may be used as the adaptive code vector when n = 4 of 1. However, in the case shown in FIG. 4, Vp
Since the ratio of the source waveform generated in the error frame included in i + 1 is low, Vpi + 1 is used without using Vp4.
May be used, but in that case, it is necessary to perform case classification based on the ratio including the sound source waveform generated in the error frame included in Vpi + 1.

【００２５】なお、このような整数倍ピッチを用いる手
法が有効となるのは、ピッチ周期がはっきりした有声部
においてであり、符号化装置から伝送された適応符号ベ
クトル利得が１．０に近い値の時、または誤りフレーム
より前の数フレームにおけるラグ値の変化が小さく、誤
りフレーム直後の正常フレームにおけるラグ値と等しい
かほぼ等しい場合である。また、過去の誤り発生時に生
成した部分を避けて適応符号ベクトルを適応符号帳から
切り出すため、復号化装置の適応符号帳に格納される音
源波形は、符号化装置の適応符号帳よりも長時間格納す
る必要がある。It is to be noted that such a method using an integer multiple pitch is effective in a voiced part where the pitch period is clear, and the adaptive code vector gain transmitted from the encoder is close to 1.0. Or the change in the lag value in several frames before the error frame is small and equal to or almost equal to the lag value in the normal frame immediately after the error frame. Further, since the adaptive code vector is cut out from the adaptive codebook while avoiding the part generated when an error has occurred in the past, the excitation waveform stored in the adaptive codebook of the decoding device has a longer time than the adaptive codebook of the coding device. Must be stored.

【００２６】このように、上記第３の実施の形態によれ
ば、適応符号ベクトルが有効に働く部分において、誤り
フレーム後の正常フレームにおける適応符号ベクトルの
歪みを抑えることが可能となる。As described above, according to the third embodiment, the distortion of the adaptive code vector in the normal frame after the error frame can be suppressed in the portion where the adaptive code vector works effectively.

【００２７】[0027]

【発明の効果】以上のように、本発明は、ＣＥＬＰ型音
声符号化／復号化装置において、伝送路誤りから復帰し
た直後でも、符号器側と復号器側で同一の音源ベクトル
が得られ、また、伝送路誤りから復帰した直後に生じる
符号器側と復号器側で生成される音源ベクトルの誤差を
緩和することができる優れた音声符号化／復号化装置を
実現できるものである。As described above, according to the present invention, in the CELP type speech encoding / decoding device, the same excitation vector can be obtained on the encoder side and the decoder side even immediately after recovering from the transmission path error. Further, it is possible to realize an excellent speech coding / decoding device capable of mitigating the error between the excitation vector generated on the encoder side and the error generated on the decoder side immediately after the recovery from the transmission path error.

[Brief description of drawings]

【図１】本発明の第１の実施の形態における音声符号化
装置の構成を示すブロック図FIG. 1 is a block diagram showing a configuration of a speech encoding apparatus according to a first embodiment of the present invention.

【図２】本発明の第２の実施の形態における音声符号化
装置の構成を示すブロック図FIG. 2 is a block diagram showing a configuration of a speech encoding apparatus according to a second embodiment of the present invention.

【図３】本発明の第３の実施の形態における音声符号化
装置の適応符号帳の模式図FIG. 3 is a schematic diagram of an adaptive codebook of a speech encoding apparatus according to a third embodiment of the present invention.

【図４】本発明の第３の実施の形態における音声符号化
装置の適応符号帳の模式図FIG. 4 is a schematic diagram of an adaptive codebook of a speech encoding apparatus according to a third embodiment of the present invention.

【図５】一般的なＣＥＬＰ音声符号化装置の構成を示す
ブロック図FIG. 5 is a block diagram showing the configuration of a general CELP speech coding apparatus.

[Explanation of symbols]

１０４加算器１０６加算器１１２固定符号ベクトル利得乗算器１１４適応符号ベクトル利得乗算器２０４加算器２０６加算器２１２固定符号ベクトル利得乗算器２１６符号ベクトル利得乗算器３０１適応符号等音源波形３０２誤り発生フレームにおいて生成された適応符号
帳音源波形区間104 adder 106 adder 112 fixed code vector gain multiplier 114 adaptive code vector gain multiplier 204 adder 206 adder 212 fixed code vector gain multiplier 216 code vector gain multiplier 301 adaptive code equal excitation waveform 302 error in frame Generated adaptive codebook excitation waveform section

Claims

[Claims]

1. An encoder that receives transmission path error information from a decoder example determines whether a transmission path error has occurred in the immediately preceding frame or subframe, and if a transmission path error has occurred, the immediately preceding frame is detected. Alternatively, a speech coding / decoding device including adaptive codebook search range limiting means for excluding a portion generated in a subframe from the search range of the adaptive codebook.

2. When it is determined that a transmission path error has occurred over the immediately preceding several frames or several subframes, the gain of the adaptive code vector generated in the frame or subframe in which the transmission path error has occurred continuously. 2. The speech coding / decoding apparatus according to claim 1, wherein the excitation vector is generated only from the fixed codebook by setting 0 to zero.

3. When it is determined that a transmission path error has occurred over the immediately preceding several frames or several subframes, the adaptive codebook generated by the frames or subframes in which the transmission path error has occurred continuously. 2. The speech coding / decoding apparatus according to claim 1, further comprising means for switching from the fixed codebook to a fixed codebook and generating an excitation vector from the fixed codebook only.

4. When a decoder receives normal information from a coder according to claim 2 or 3 after a channel error has occurred, a fixed codebook is used instead of an adaptive codebook for speech synthesis. A voice encoding / decoding device for performing.

5. When normal information is received from the encoder side after a transmission path error has occurred, the decoder determines whether the received pitch excitation includes excitation excitation information combined in an error frame or subframe. A speech coding / decoding device that checks whether or not, and if not included, generates a pitch excitation from the adaptive codebook using the received information as it is, and if it does, generates a pitch excitation without using the received information as it is.