JP2001228888A

JP2001228888A - Speech-encoding device, speech decoding device and code word-arraying method

Info

Publication number: JP2001228888A
Application number: JP2000040127A
Authority: JP
Inventors: Hirohisa Tazaki; 裕久田崎
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2000-02-17
Filing date: 2000-02-17
Publication date: 2001-08-24
Anticipated expiration: 2020-02-17
Also published as: JP3808270B2

Abstract

PROBLEM TO BE SOLVED: To solve the problem where the reproduction quality of input speeches is deteriorated drastically when a speech decoding device makes erroneous recognition mode information by superposition of a transmission error on a speech code, while a distortion minimization section 7 of a speech encoding device selects optimum code information to minimize the power of an auditory sense weighting difference signal. SOLUTION: The storage sequence of the gain code words of gain code books 52 and 66 (or 57 and 71) is permuted, in correspondence with the sequence of the evaluation values related to the gain code words of other gain code books 57 and 71 (or 52 and 66).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、ディジタル音声
信号の情報量を圧縮する音声符号化装置、その音声符号
化装置などにより生成された音声符号を復号化してディ
ジタル音声信号を再生する音声復号化装置、その音声符
号化装置や音声復号化装置により使用されるベクトル符
号帳中の符号語の格納順序を更新して、音声符号に重畳
するビット誤りへの耐性を改善する符号語配列方法に関
するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio encoding apparatus for compressing the information amount of a digital audio signal, and an audio decoding apparatus for decoding an audio code generated by the audio encoding apparatus and reproducing the digital audio signal. Codeword arrangement method for updating the storage order of codewords in a vector codebook used by a device, its speech encoding device or speech decoding device and improving resistance to bit errors superimposed on speech code It is.

【０００２】[0002]

【従来の技術】従来の音声符号化装置の多くは、入力音
声をスペクトル包絡情報と音源に分けて、フレーム単位
で各々を符号化して音声符号を生成する構成を採用して
いる。一方、従来の音声復号化装置は、その音声符号を
復号化して、合成フィルタによってスペクトル包絡情報
と音源を合成することにより、復号音声を生成する構成
を採用している。また、様々な様態を有する音声信号と
背景雑音信号の両方の品質を高めるため、複数の符号化
モードを用意して、符号化モードを切り換えながら符号
化を行う方式（マルチモード符号化方式）を採用するも
のもある。2. Description of the Related Art Many conventional speech coding apparatuses adopt a configuration in which an input speech is divided into spectral envelope information and a sound source, and each is encoded in frame units to generate a speech code. On the other hand, the conventional speech decoding device adopts a configuration in which a decoded speech is generated by decoding the speech code and synthesizing the spectrum envelope information and the sound source by a synthesis filter. Also, in order to improve the quality of both the speech signal and the background noise signal having various aspects, a method of preparing a plurality of encoding modes and performing encoding while switching the encoding mode (multi-mode encoding method) is proposed. Some are employed.

【０００３】図１５は例えば文献「Ｈ．Ｔａｓａｋ
ｉ、”Ｈｉｇｈｌｅｖｅｌｄｅｓｃｒｉｐｔｉｏｎ
ｏｆＭｉｔｓｕｂｉｓｈｉ４−ｋｂｉｔ／ｓｓ
ｐｅｅｃｈｃｏｄｅｒ”、ＩＴＵＴｅｌｅｃｏｍｍ
ｕｎｉｃａｔｉｏｎＳｔａｎｄａｒｄｉｚａｔｉｏｎ
Ｓｅｃｔｏｒ、ＳｔｕｄｙＧｒｏｕｐ１６、Ｑｕ
ｅｓｔｉｏｎ１９−２１／１６Ｒａｐｐｏｒｔｅｕ
ｒＭｅｅｔｉｎｇ、Ｎｏ．ＡＣ−９９−０１６（１
９９９年９月）」に示された従来の音声符号化装置を示
す構成図である。FIG. 15 shows, for example, a document “H. Tasak”.
i, "High level description
of Mitsubishi 4-kbit / s s
"peech coder", ITU Telecomm
UNICATION STANDARDIZATION
Sector, Study Group 16, Qu
estion 19-21 / 16 Rapporteu
r Meeting, no. AC-99-016 (1
FIG. 9 is a configuration diagram showing a conventional speech encoding device shown in FIG.

【０００４】図において、１は入力音声に重畳している
背景雑音を抑圧する雑音抑圧処理を実行するとともに、
入力音声の直流成分をカットする低域阻止フィルタ処理
を実行する前処理部、２は前処理部１による前処理後の
入力音声を分析して、音声のスペクトル包絡情報である
線スペクトル対（以下、ＬＳＰという）を求めるスペク
トル分析部、３はスペクトル分析部２により求められた
ＬＳＰを符号化して、そのＬＳＰ符号を多重化部２０に
出力するとともに、そのＬＳＰを量子化して、量子化後
のＬＳＰ（ＬＳＰ符号を復号化した結果と同じ）を合成
フィルタ４のフィルタ係数（線形予測係数）に変換し、
そのフィルタ係数を合成フィルタ４と聴覚重み付け部６
に出力するスペクトル符号化部である。In FIG. 1, reference numeral 1 denotes a noise suppression process for suppressing background noise superimposed on input speech,
The pre-processing unit 2 that executes the low-frequency rejection filter processing for cutting the DC component of the input voice analyzes the input voice that has been pre-processed by the pre-processing unit 1, and analyzes a line spectrum pair (hereinafter, referred to as spectral envelope information of the voice) , LSP), encodes the LSP obtained by the spectrum analysis unit 2, outputs the LSP code to the multiplexing unit 20, quantizes the LSP, and performs quantization. LSP (same as the result of decoding the LSP code) is converted into a filter coefficient (linear prediction coefficient) of the synthesis filter 4,
The filter coefficient is combined with the synthesis filter 4 and the auditory weighting unit 6
Is a spectrum encoding unit that outputs the data to

【０００５】４はスペクトル符号化部３が出力するフィ
ルタ係数を用いて、切換スイッチ１９により選択された
仮の音源に対するフィルタリング処理を実行し、仮の合
成音を生成する合成フィルタ、５は合成フィルタ４によ
り生成された合成音と前処理部１による前処理後の入力
音声との差信号を出力する減算器、６はスペクトル符号
化部３が出力するフィルタ係数に基づいて聴覚重み付け
フィルタ係数を算出し、その聴覚重み付けフィルタ係数
を用いて、減算器５が出力する差信号に対する聴覚重み
付けフィルタ処理を実行して聴覚重み付け差信号を出力
する聴覚重み付け部である。[0005] Reference numeral 4 denotes a synthesis filter for executing a filtering process on the tentative sound source selected by the changeover switch 19 using the filter coefficient output from the spectrum encoding unit 3 to generate a tentative synthesized sound, and 5 denotes a synthesis filter. A subtracter for outputting a difference signal between the synthesized speech generated by the preprocessor 4 and the input speech after the preprocessing by the preprocessor 1; and 6 calculates an auditory weighting filter coefficient based on the filter coefficient output by the spectrum encoder 3. A hearing weighting unit that performs a hearing weighting filter process on the difference signal output by the subtractor 5 using the hearing weighting filter coefficient, and outputs a hearing weighted difference signal.

【０００６】７は聴覚重み付け部６が出力する聴覚重み
付け差信号のパワーを計算し、そのパワーの最小化を図
るため、インデックス（ゲイン符号、駆動音源符号、適
応音源符号）及び符号化モードを示すモード情報を逐次
更新する歪み最小化部、８，９は歪み最小化部７による
更新後のインデックスに対応する符号語を出力する符号
帳を有し、その符号語から仮の音源を生成する音源復号
化部である。Reference numeral 7 denotes an index (gain code, drive excitation code, adaptive excitation code) and an encoding mode for calculating the power of the perceptual weighting difference signal output from the perceptual weighting section 6 and minimizing the power. Distortion minimizing units 8 and 9 for sequentially updating mode information have codebooks for outputting codewords corresponding to indexes updated by distortion minimizing unit 7, and a sound source for generating a temporary sound source from the codewords It is a decoding unit.

【０００７】１０は過去の音源を所定長記憶し、歪み最
小化部７から適応音源符号を受けると、その適応音源符
号に対応する過去の音源を周期的に繰り返す時系列ベク
トルである適応符号ベクトルを出力する適応音源符号
帳、１１は非雑音的な複数の時系列ベクトルである駆動
符号ベクトルを格納し、歪み最小化部７から駆動音源符
号を受けると、その駆動音源符号に対応する駆動符号ベ
クトルを出力する駆動音源符号帳、１２はゲインに関す
る符号語（ゲイン値を示す語）を格納し、歪み最小化部
７からゲイン符号を受けると、そのゲイン符号に対応す
るゲイン値を出力するゲイン符号帳、１３はゲイン符号
帳１２が出力するゲイン値を適応音源符号帳１０が出力
する適応符号ベクトルに乗算する乗算器、１４はゲイン
符号帳１２が出力するゲイン値を駆動音源符号帳１１が
出力する駆動符号ベクトルに乗算する乗算器、１５は乗
算器１３の乗算結果と乗算器１４の乗算結果を加算し、
その加算結果（仮の音源）を出力する加算器である。An adaptive code vector 10 stores a predetermined length of a past excitation and, when receiving an adaptive excitation code from the distortion minimizing unit 7, periodically repeats the past excitation corresponding to the adaptive excitation code. An adaptive excitation codebook 11 that outputs a non-noise driving code vector, which is a non-noise plurality of time-series vectors, receives a driving excitation code from the distortion minimizing unit 7 and outputs a driving code corresponding to the driving excitation code. A driving excitation codebook 12 that outputs a vector stores a code word (a word indicating a gain value) related to the gain, and receives a gain code from the distortion minimizing unit 7 and outputs a gain value corresponding to the gain code. Codebook, 13 is a multiplier for multiplying the adaptive code vector output from adaptive excitation codebook 10 by the gain value output from gain codebook 12, and 14 is output from gain codebook 12. Multiplier gain value excitation code book 11 is multiplied by the driving code vector outputted, 15 adds the multiplication result of the multiplication result and the multiplier 14 of the multiplier 13,
The adder outputs the result of the addition (temporary sound source).

【０００８】１６は雑音的な複数の時系列ベクトルであ
る駆動符号ベクトルを格納し、歪み最小化部７から駆動
音源符号を受けると、その駆動音源符号に対応する駆動
符号ベクトルを出力する駆動音源符号帳、１７はゲイン
に関する符号語（ゲイン値を示す語）を格納し、歪み最
小化部７からゲイン符号を受けると、そのゲイン符号に
対応するゲイン値を出力するゲイン符号帳、１８はゲイ
ン符号帳１７が出力するゲイン値を駆動音源符号帳１６
が出力する駆動符号ベクトルに乗算し、その乗算結果
（仮の音源）を出力する乗算器である。Reference numeral 16 stores a driving code vector, which is a plurality of noise-like time-series vectors, and upon receiving a driving excitation code from the distortion minimizing unit 7, outputs a driving code vector corresponding to the driving excitation code. A code book 17 stores a code word (a word indicating a gain value) relating to gain, and receives a gain code from the distortion minimizing unit 7 and outputs a gain value corresponding to the gain code. The gain value output by the codebook 17 is
Is a multiplier that multiplies the driving code vector output by the controller and outputs the multiplication result (temporary sound source).

【０００９】１９は歪み最小化部７からモード情報を受
けると、そのモード情報にしたがって音源復号化部８が
出力する仮の音源又は音源復号化部９が出力する仮の音
源を選択し、その選択した仮の音源を合成フィルタ４に
与える切換スイッチ、２０はスペクトル符号化部３によ
り符号化されたＬＳＰ符号と、歪み最小化部７による更
新後のインデックス及びモード情報とを多重化して音声
符号を生成し、その音声符号を出力する多重化部であ
る。When the mode information 19 is received from the distortion minimizing unit 7, the temporary source selected by the source decoding unit 8 or the temporary source output by the source decoding unit 9 is selected according to the mode information. A changeover switch for giving the selected tentative sound source to the synthesis filter 4. A multiplexing unit 20 multiplexes the LSP code coded by the spectrum coding unit 3 with the index and mode information updated by the distortion minimizing unit 7 to generate a voice code. And a multiplexing unit that outputs the speech code.

【００１０】図１６は上記文献に示された従来の音声復
号化装置を示す構成図であり、図において、２１は音声
符号化装置により多重化されたＬＳＰ符号とインデック
スとモード情報とを分離する分離部、２２，２３は分離
部２１により分離されたインデックスに対応する符号語
を出力する符号帳を有し、その符号語から音源を生成す
る音源復号化部である。FIG. 16 is a block diagram showing a conventional speech decoding apparatus disclosed in the above-mentioned literature. In the figure, reference numeral 21 denotes an LSP code multiplexed by the speech encoding apparatus, an index and mode information. The separation units 22, 23 have a codebook that outputs a codeword corresponding to the index separated by the separation unit 21, and are excitation excitation decoding units that generate an excitation from the codeword.

【００１１】２４は過去の音源を所定長記憶し、分離部
２１から適応音源符号を受けると、その適応音源符号に
対応する過去の音源を周期的に繰り返す時系列ベクトル
である適応符号ベクトルを出力する適応音源符号帳、２
５は非雑音的な複数の時系列ベクトルである駆動符号ベ
クトルを格納し、分離部２１から駆動音源符号を受ける
と、その駆動音源符号に対応する駆動符号ベクトルを出
力する駆動音源符号帳、２６はゲインに関する符号語
（ゲイン値を示す語）を格納し、分離部２１からゲイン
符号を受けると、そのゲイン符号に対応するゲイン値を
出力するゲイン符号帳、２７はゲイン符号帳２６が出力
するゲイン値を適応音源符号帳２４が出力する適応符号
ベクトルに乗算する乗算器、２８はゲイン符号帳２６が
出力するゲイン値を駆動音源符号帳２５が出力する駆動
符号ベクトルに乗算する乗算器、２９は乗算器２７の乗
算結果と乗算器２８の乗算結果を加算し、その加算結果
（仮の音源）を出力する加算器である。Reference numeral 24 stores a past excitation of a predetermined length, and upon receiving an adaptive excitation code from the separation unit 21, outputs an adaptive code vector which is a time series vector that periodically repeats the past excitation corresponding to the adaptive excitation code. Adaptive excitation codebook, 2
Reference numeral 5 denotes a driving excitation codebook that stores a driving code vector that is a plurality of non-noise time-series vectors and, when receiving a driving excitation code from the separation unit 21, outputs a driving code vector corresponding to the driving excitation code. Is a gain codebook that stores a code word (a word indicating a gain value) related to the gain and receives a gain code from the separation unit 21 and outputs a gain value corresponding to the gain code. A multiplier for multiplying the gain value by the adaptive code vector output from the adaptive excitation codebook 24; a multiplier 28 for multiplying the gain value output by the gain codebook 26 by the driving code vector output by the driving excitation codebook 25; Is an adder that adds the multiplication result of the multiplier 27 and the multiplication result of the multiplier 28 and outputs the addition result (temporary sound source).

【００１２】３０は雑音的な複数の時系列ベクトルであ
る駆動符号ベクトルを格納し、分離部２１から駆動音源
符号を受けると、その駆動音源符号に対応する駆動符号
ベクトルを出力する駆動音源符号帳、３１はゲインに関
する符号語（ゲイン値を示す語）を格納し、分離部２１
からゲイン符号を受けると、そのゲイン符号に対応する
ゲイン値を出力するゲイン符号帳、３２はゲイン符号帳
３１が出力するゲイン値を駆動音源符号帳３０が出力す
る駆動符号ベクトルに乗算し、その乗算結果（仮の音
源）を出力する乗算器である。Reference numeral 30 denotes a driving excitation codebook which stores a plurality of driving code vectors, which are noise-like time series vectors, and outputs a driving code vector corresponding to the driving excitation code when receiving the driving excitation code from the separating unit 21. , 31 store a code word relating to the gain (word indicating the gain value).
, A gain codebook that outputs a gain value corresponding to the gain code, and 32 multiplies the gain value output by the gain codebook 31 with the driving code vector output by the driving excitation codebook 30, This is a multiplier that outputs a multiplication result (temporary sound source).

【００１３】３３は分離部２１からモード情報を受ける
と、そのモード情報にしたがって音源復号化部２２が出
力する仮の音源又は音源復号化部２３が出力する仮の音
源を選択し、その選択した仮の音源を合成フィルタ３５
に与える切換スイッチ、３４は分離部２１が出力するＬ
ＳＰ符号を復号化し、その復号結果を合成フィルタ３５
のフィルタ係数（線形予測係数）に変換して、そのフィ
ルタ係数を合成フィルタ３５と後処理部３６に出力する
スペクトル復号化部、３５はスペクトル復号化部３４が
出力するフィルタ係数を用いて、切換スイッチ３３によ
り選択された仮の音源に対するフィルタリング処理を実
行し、仮の合成音を生成する合成フィルタ、３６はスペ
クトル復号化部３４が出力するフィルタ係数等に基づい
て合成フィルタ３５により生成された合成音に対する音
声強調処理などの後処理を実行し、入力音声の再生結果
（出力音声）を出力する後処理部である。Upon receiving the mode information from the separating section 21, the 33 selects a temporary excitation output from the excitation decoding section 22 or a temporary excitation output from the excitation decoding section 23 in accordance with the mode information. Temporary sound source is synthesized by filter 35
, And a switch 34 that outputs L
The SP code is decoded, and the decoded result is output to the synthesis filter 35.
, And outputs the filter coefficients to the synthesis filter 35 and the post-processing unit 36. The spectrum decoding unit 35 uses the filter coefficients output from the spectrum decoding unit 34 to perform switching. A synthesis filter that performs a filtering process on the tentative sound source selected by the switch 33 to generate a tentative synthesized sound, and a synthesis filter 36 generated by the synthesis filter 35 based on the filter coefficient output from the spectrum decoding unit 34 and the like. This is a post-processing unit that performs post-processing such as voice emphasis processing on the sound and outputs a reproduction result (output sound) of the input sound.

【００１４】次に動作について説明する。従来の音声符
号化装置及び音声復号化装置は、５〜５０ｍｓ程度を１
フレームとして、フレーム単位に処理を実行する。Next, the operation will be described. The conventional speech encoding device and speech decoding device require about 5 to 50 ms for one.
The processing is executed for each frame as a frame.

【００１５】まず、音声符号化装置の前処理部１は、入
力音声を受けると、その入力音声に重畳している背景雑
音を抑圧する雑音抑圧処理を実行するとともに、入力音
声の直流成分をカットする低域阻止フィルタ処理を実行
する。スペクトル分析部２は、前処理部１が入力音声に
対する前処理を実行すると、前処理後の入力音声を分析
して、音声のスペクトル包絡情報であるＬＳＰを求め
る。First, upon receiving an input speech, the pre-processing unit 1 of the speech encoding device executes a noise suppression process for suppressing background noise superimposed on the input speech, and cuts a DC component of the input speech. To perform low-pass rejection filter processing. When the pre-processing unit 1 performs pre-processing on the input voice, the spectrum analysis unit 2 analyzes the input voice after the pre-processing and obtains LSP which is the spectrum envelope information of the voice.

【００１６】そして、スペクトル符号化部３は、スペク
トル分析部２により求められたＬＳＰを符号化して、そ
のＬＳＰ符号を多重化部２０に出力する。また、そのＬ
ＳＰを量子化して、量子化後のＬＳＰを合成フィルタ４
のフィルタ係数に変換し、そのフィルタ係数を合成フィ
ルタ４と聴覚重み付け部６に出力する。The spectrum encoding unit 3 encodes the LSP obtained by the spectrum analysis unit 2 and outputs the LSP code to the multiplexing unit 20. Also, its L
The SP is quantized, and the LSP after the quantization is combined with the synthesis filter 4.
, And outputs the filter coefficient to the synthesis filter 4 and the auditory weighting unit 6.

【００１７】合成フィルタ４は、スペクトル符号化部３
からフィルタ係数を受けると、そのフィルタ係数を用い
て、切換スイッチ１９により選択された仮の音源に対す
るフィルタリング処理を実行し、仮の合成音を生成す
る。仮の音源の生成処理は後述する。減算器５は、合成
フィルタ４が合成音を生成すると、その合成音と前処理
部１による前処理後の入力音声との差信号を出力し、聴
覚重み付け部６は、スペクトル符号化部３が出力するフ
ィルタ係数に基づいて聴覚重み付けフィルタ係数を算出
し、その聴覚重み付けフィルタ係数を用いて、減算器５
が出力する差信号に対する聴覚重み付けフィルタ処理を
実行して聴覚重み付け差信号を出力する。The synthesis filter 4 includes a spectrum encoding unit 3
When the filter coefficient is received from, a filtering process is performed on the temporary sound source selected by the changeover switch 19 using the filter coefficient to generate a temporary synthesized sound. The process of generating a temporary sound source will be described later. When the synthesis filter 4 generates a synthesized sound, the subtracter 5 outputs a difference signal between the synthesized sound and the input voice after the pre-processing by the pre-processing unit 1, and the auditory weighting unit 6 determines that the spectrum encoding unit 3 An auditory weighting filter coefficient is calculated based on the filter coefficient to be output, and the subtractor 5 is used by using the auditory weighting filter coefficient.
Performs an auditory weighting filter process on the difference signal output by the controller and outputs an auditory weighted difference signal.

【００１８】歪み最小化部７は、インデックス及び符号
化モードを逐次更新することにより、聴覚重み付け部６
が出力する聴覚重み付け差信号のパワーの最小化を図
る。即ち、インデックスとモード情報を適宜選択して、
音源復号化部８，９と切換スイッチ１９に出力する毎
に、その聴覚重み付け差信号のパワーを計算し、その計
算結果であるパワーが最も小さくなるインデックスとモ
ード情報の組合せを検索する。そして、聴覚重み付け差
信号のパワーが最小になるインデックスとモード情報が
求まると、そのインデックスとモード情報を多重化部２
０に出力する。ただし、音源復号化部９には適応音源符
号帳が内蔵されていないので、第二の符号化モードを示
すモード情報を出力する場合には、適応音源符号を出力
しない。The distortion minimizing section 7 sequentially updates the index and the encoding mode, thereby obtaining the auditory weighting section 6.
Minimizing the power of the auditory weighting difference signal output by. That is, the index and the mode information are appropriately selected,
Each time the signals are output to the sound source decoding units 8 and 9 and the changeover switch 19, the power of the perceptual weighting difference signal is calculated, and the result of the calculation is searched for the combination of the index and mode information that minimizes the power. When the index and the mode information that minimize the power of the auditory weighting difference signal are determined, the index and the mode information are multiplexed by the multiplexing unit 2.
Output to 0. However, since excitation excitation decoding section 9 does not include an adaptive excitation codebook, it does not output an adaptive excitation code when outputting mode information indicating the second encoding mode.

【００１９】音源復号化部８，９は、歪み最小化部７か
らインデックスを受けると、そのインデックスに応じて
仮の音源を生成する。具体的には、まず、音源復号化部
８の適応音源符号帳１０は、過去の音源を所定長記憶
し、歪み最小化部７から適応音源符号を受けると、その
適応音源符号に対応する過去の音源を周期的に繰り返す
時系列ベクトルを適応符号ベクトルとして出力する。な
お、適応音源符号帳１０は、歪み最小化部７がインデッ
クス及びモード情報を選択した後で、そのインデックス
及びモード情報に対して、切換スイッチ１９が出力した
仮の音源を選択して出力すると、その仮の音源を最終的
な音源として記憶する。Upon receiving the index from distortion minimizing section 7, excitation decoding sections 8 and 9 generate a temporary excitation according to the index. More specifically, first, adaptive excitation codebook 10 of excitation decoding section 8 stores a past excitation of a predetermined length, and receives an adaptive excitation code from distortion minimizing section 7, and stores the past excitation corresponding to the adaptive excitation code. Is output as an adaptive code vector. Note that adaptive excitation codebook 10 selects the temporary excitation output from changeover switch 19 in response to the index and mode information after distortion minimizing section 7 selects the index and mode information. The temporary sound source is stored as a final sound source.

【００２０】音源復号化部８の駆動音源符号帳１１は、
非雑音的な複数の時系列ベクトルである駆動符号ベクト
ルを格納し、歪み最小化部７から駆動音源符号を受ける
と、その駆動音源符号に対応する駆動符号ベクトルを出
力する。ただし、駆動音源符号帳１１は、予め、各時系
列ベクトルを複数のパルス位置と極性で表現する代数的
音源テーブルを備えることにより、歪み最小化部７が出
力する駆動音源符号に基づいて代数的音源を生成し、そ
の代数的音源を駆動符号ベクトルとして出力するように
してもよい。Driving excitation codebook 11 of excitation decoding section 8 has:
When a driving code vector, which is a plurality of non-noise time-series vectors, is stored and a driving excitation code is received from the distortion minimizing unit 7, a driving code vector corresponding to the driving excitation code is output. However, driving excitation codebook 11 is provided with an algebraic excitation table in which each time-series vector is represented by a plurality of pulse positions and polarities in advance, so that algebraic excitation based on the driving excitation code output from distortion minimizing section 7 is provided. A sound source may be generated, and the algebraic sound source may be output as a driving code vector.

【００２１】そして、ゲイン符号帳１２がゲイン符号に
対応するゲイン値を出力すると、適応音源符号帳１０か
ら出力された適応符号ベクトルと駆動音源符号帳１１か
ら出力された駆動符号ベクトルは、乗算器１３，１４に
よりゲイン値が乗算され、加算器１５により乗算器１
３，１４の乗算結果が相互に加算される。When the gain codebook 12 outputs a gain value corresponding to the gain code, the adaptive code vector output from the adaptive excitation codebook 10 and the driving code vector output from the driving excitation codebook 11 are multiplied by a multiplier. The gain value is multiplied by 13 and 14, and the multiplier 1 is multiplied by the adder 15.
The multiplication results of 3, 14 are added to each other.

【００２２】一方、音源復号化部９の駆動音源符号帳１
６は、雑音的な複数の時系列ベクトルである駆動符号ベ
クトルを格納し、歪み最小化部７から駆動音源符号を受
けると、その駆動音源符号に対応する駆動符号ベクトル
を出力する。ただし、駆動音源符号帳１６は、予め、各
時系列ベクトルを複数のパルス位置と極性で表現する代
数的音源テーブルを備えることにより、歪み最小化部７
が出力する駆動音源符号に基づいて代数的音源を生成
し、その代数的音源を駆動符号ベクトルとして出力する
ようにしてもよい。そして、ゲイン符号帳１７がゲイン
符号に対応するゲイン値を出力すると、駆動音源符号帳
１６から出力された駆動符号ベクトルは、乗算器１８に
よりゲイン値が乗算される。On the other hand, driving excitation codebook 1 of excitation decoding section 9
Numeral 6 stores a driving code vector which is a plurality of noise-like time-series vectors, and upon receiving a driving excitation code from distortion minimizing section 7, outputs a driving code vector corresponding to the driving excitation code. However, the driving excitation codebook 16 is provided with an algebraic excitation table in which each time-series vector is expressed in advance by a plurality of pulse positions and polarities.
May generate an algebraic excitation based on the driving excitation code output by the controller, and output the algebraic excitation as a driving code vector. When the gain codebook 17 outputs a gain value corresponding to the gain code, the driving code vector output from the driving excitation codebook 16 is multiplied by the gain value by the multiplier 18.

【００２３】このようにして、音源復号化部８の加算器
１５から仮の音源が出力され、音源復号化部９の乗算器
１８から仮の音源が出力されると、切換スイッチ１９
は、歪み最小化部７が出力するモード情報にしたがって
音源復号化部８が出力する仮の音源又は音源復号化部９
が出力する仮の音源の何れか一方を選択し、その選択し
た仮の音源を合成フィルタ４に与える。In this way, when the provisional excitation is output from adder 15 of excitation decoding section 8 and the temporary excitation is output from multiplier 18 of excitation decoding section 9, changeover switch 19
Is a temporary excitation or excitation decoding section 9 output by the excitation decoding section 8 according to the mode information output by the distortion minimizing section 7.
Selects one of the tentative sound sources output by, and gives the selected tentative sound source to the synthesis filter 4.

【００２４】多重化部２０は、スペクトル符号化部３に
より符号化されたＬＳＰ符号と、歪み最小化部７による
更新後のインデックス及びモード情報（聴覚重み付け差
信号のパワーが最小となるインデックス及びモード情
報）とを多重化して音声符号を生成し、その音声符号を
出力する。The multiplexing unit 20 includes the LSP code coded by the spectrum coding unit 3 and the index and mode information updated by the distortion minimizing unit 7 (the index and mode in which the power of the auditory weighting difference signal is minimized). ) To generate a speech code and output the speech code.

【００２５】次に、音声復号化装置の分離部２１は、音
声符号化装置から出力された音声符号を入力すると、そ
の音声符号に含まれているＬＳＰ符号とインデックスと
モード情報とを分離する。Next, upon inputting the speech code output from the speech encoding device, the separation unit 21 of the speech decoding device separates the LSP code, index, and mode information contained in the speech code.

【００２６】音源復号化部２２，２３は、分離部２１か
らインデックスを受けると、そのインデックスに応じて
仮の音源を生成する。具体的には、まず、音源復号化部
２２の適応音源符号帳２４は、過去の音源を所定長記憶
し、分離部２１から適応音源符号を受けると、その適応
音源符号に対応する過去の音源を周期的に繰り返す時系
列ベクトルを適応符号ベクトルとして出力する。なお、
適応音源符号帳２４は、切換スイッチ３３が仮の音源を
選択して出力すると、その仮の音源を最終的な音源とし
て記憶する。Upon receiving the index from separation section 21, excitation decoding sections 22 and 23 generate a temporary excitation according to the index. Specifically, first, adaptive excitation codebook 24 of excitation decoding section 22 stores past excitations for a predetermined length, and receives an adaptive excitation code from demultiplexing section 21, and receives past excitations corresponding to the adaptive excitation code. Is output as an adaptive code vector. In addition,
When the changeover switch 33 selects and outputs a temporary excitation, the adaptive excitation codebook 24 stores the temporary excitation as a final excitation.

【００２７】音源復号化部２２の駆動音源符号帳２５
は、非雑音的な複数の時系列ベクトルである駆動符号ベ
クトルを格納し、分離部２１から駆動音源符号を受ける
と、その駆動音源符号に対応する駆動符号ベクトルを出
力する。ただし、駆動音源符号帳２５は、予め、各時系
列ベクトルを複数のパルス位置と極性で表現する代数的
音源テーブルを備えることにより、分離部２１が出力す
る駆動音源符号に基づいて代数的音源を生成し、その代
数的音源を駆動符号ベクトルとして出力するようにして
もよい。Driving excitation codebook 25 of excitation decoding section 22
Stores a driving code vector, which is a plurality of non-noise time-series vectors, and outputs a driving code vector corresponding to the driving excitation code when receiving the driving excitation code from the separation unit 21. However, the driving excitation codebook 25 includes an algebraic excitation table in which each time-series vector is represented in advance by a plurality of pulse positions and polarities, so that the algebraic excitation is output based on the driving excitation code output from the separation unit 21. Alternatively, the algebraic excitation may be generated and output as a driving code vector.

【００２８】そして、ゲイン符号帳２６がゲイン符号に
対応するゲイン値を出力すると、適応音源符号帳２４か
ら出力された適応符号ベクトルと駆動音源符号帳２５か
ら出力された駆動符号ベクトルは、乗算器２７，２８に
よりゲイン値が乗算され、加算器２９により乗算器２
７，２８の乗算結果が相互に加算される。When the gain codebook 26 outputs a gain value corresponding to the gain code, the adaptive code vector output from the adaptive excitation codebook 24 and the driving code vector output from the driving excitation codebook 25 are multiplied by a multiplier. The gain value is multiplied by 27 and 28, and the multiplier 2 is
The multiplication results of 7, 28 are added to each other.

【００２９】一方、音源復号化部２３の駆動音源符号帳
３０は、雑音的な複数の時系列ベクトルである駆動符号
ベクトルを格納し、分離部２１から駆動音源符号を受け
ると、その駆動音源符号に対応する駆動符号ベクトルを
出力する。ただし、駆動音源符号帳３０は、予め、各時
系列ベクトルを複数のパルス位置と極性で表現する代数
的音源テーブルを備えることにより、分離部２１が出力
する駆動音源符号に基づいて代数的音源を生成し、その
代数的音源を駆動符号ベクトルとして出力するようにし
てもよい。そして、ゲイン符号帳３１がゲイン符号に対
応するゲイン値を出力すると、駆動音源符号帳３０から
出力された駆動符号ベクトルは、乗算器３２によりゲイ
ン値が乗算される。On the other hand, driving excitation codebook 30 of excitation decoding section 23 stores a driving code vector which is a plurality of noise-like time series vectors, and receives driving excitation code from separation section 21, and receives the driving excitation code. Is output. However, the driving excitation codebook 30 is provided with an algebraic excitation table in which each time-series vector is expressed by a plurality of pulse positions and polarities in advance, so that the algebraic excitation is output based on the driving excitation code output from the separation unit 21. Alternatively, the algebraic excitation may be generated and output as a driving code vector. When the gain codebook 31 outputs a gain value corresponding to the gain code, the driving code vector output from the driving excitation codebook 30 is multiplied by the gain value by the multiplier 32.

【００３０】このようにして、音源復号化部２２の加算
器２９から仮の音源が出力され、音源復号化部２３の乗
算器３２から仮の音源が出力されると、切換スイッチ３
３は、分離部２１が出力するモード情報にしたがって音
源復号化部２２が出力する仮の音源又は音源復号化部２
３が出力する仮の音源の何れか一方を選択し、その選択
した仮の音源を合成フィルタ３５に与える。As described above, when the provisional excitation is output from adder 29 of excitation decoding section 22 and the temporary excitation is output from multiplier 32 of excitation decoding section 23, changeover switch 3
Reference numeral 3 denotes a temporary excitation or excitation decoding section 2 output by the excitation decoding section 22 according to the mode information output by the separation section 21.
3 selects one of the tentative sound sources output, and gives the selected tentative sound source to the synthesis filter 35.

【００３１】スペクトル復号化部３４は、分離部２１が
ＬＳＰ符号を出力すると、そのＬＳＰ符号を復号化し、
その復号結果を合成フィルタ３５のフィルタ係数に変換
して、そのフィルタ係数を合成フィルタ３５と後処理部
３６に出力する。合成フィルタ３５は、スペクトル復号
化部３４からフィルタ係数を受けると、そのフィルタ係
数を用いて、切換スイッチ３３により選択された仮の音
源に対するフィルタリング処理を実行し、仮の合成音を
生成する。後処理部３６は、スペクトル復号化部３４が
出力するフィルタ係数等に基づいて合成フィルタ３５に
より生成された合成音に対する音声強調処理などの後処
理を実行し、入力音声の再生結果（出力音声）を出力す
る。When the separating section 21 outputs the LSP code, the spectrum decoding section 34 decodes the LSP code,
The decoding result is converted into a filter coefficient of the synthesis filter 35, and the filter coefficient is output to the synthesis filter 35 and the post-processing unit 36. Upon receiving the filter coefficient from the spectrum decoding unit 34, the synthesis filter 35 performs a filtering process on the temporary sound source selected by the changeover switch 33 using the filter coefficient, and generates a temporary synthesized sound. The post-processing unit 36 performs post-processing such as voice enhancement processing on the synthesized sound generated by the synthesis filter 35 based on the filter coefficients and the like output from the spectrum decoding unit 34, and reproduces the input sound (output sound). Is output.

【００３２】ここで、図１７は従来の音声符号化装置及
び音声復号化装置により使用されるゲイン符号帳の一例
を示す説明図である。特に、図１７（ａ）はゲイン符号
帳１２，２６の一例を示し、図１７（ｂ）はゲイン符号
帳１７，３１の一例を示している。FIG. 17 is an explanatory diagram showing an example of a gain codebook used by a conventional speech encoding apparatus and speech decoding apparatus. In particular, FIG. 17A shows an example of the gain codebooks 12 and 26, and FIG. 17B shows an example of the gain codebooks 17 and 31.

【００３３】この例の場合、各ゲイン符号帳は１２８個
のゲイン符号語を格納している。ただし、ゲイン符号帳
１２，２６に格納されているゲイン符号語は、適応符号
ベクトルと駆動符号ベクトルに乗じる２個のゲイン値の
組を示す符号語から構成され、ゲイン符号帳１７，３１
に格納されているゲイン符号語は、駆動符号ベクトルに
乗じる１個のゲイン値を示す符号語から構成されてい
る。インデックスと評価値順位は、各ゲイン符号帳内に
実際には格納されていないものであるが、説明の便宜の
ため記載している。インデックスは上の符号語から順番
に０から１２７の値となっている。評価値はゲイン符号
語のパワー（２乗和）の値である。例えば、インデック
スが「１」の符号語のパワーの順位は「１０２」であ
る。In this example, each gain codebook stores 128 gain codewords. However, the gain codewords stored in the gain codebooks 12 and 26 are composed of codewords indicating a set of two gain values that are multiplied by the adaptive code vector and the drive code vector.
Is composed of a codeword indicating one gain value by which the driving code vector is multiplied. Although the index and the evaluation value rank are not actually stored in each gain codebook, they are described for convenience of explanation. The index has a value of 0 to 127 in order from the upper code word. The evaluation value is the value of the power (sum of squares) of the gain codeword. For example, the power ranking of a codeword having an index of “1” is “102”.

【００３４】各ゲイン符号帳の動作としては、あるゲイ
ン符号を入力すると、そのゲイン符号に一致するインデ
ックス位置に格納しているゲイン符号語を出力する。各
ゲイン符号帳に格納されているゲイン符号語は、学習用
音声とその符号化音声との歪みが小さくなるように学習
して作成される。そして、音声符号を伝送する際の符号
誤りによる出力音声の劣化を最小限に抑えるため、適切
にゲイン符号語の並べ換えが行われる。As an operation of each gain codebook, when a certain gain code is input, a gain codeword stored at an index position corresponding to the gain code is output. The gain codeword stored in each gain codebook is created by learning so that distortion between the learning speech and the encoded speech is reduced. Then, in order to minimize the deterioration of the output voice due to a code error when transmitting the voice code, the gain codeword is appropriately rearranged.

【００３５】例えば、ゲイン符号に１ビット誤りを実際
に与えたときに生じる劣化の大きさの期待値を計算し、
さらに、ランダムに選択した２つのゲイン符号語を交換
したときに生じる劣化の大きさの期待値を計算し、前者
の期待値と比べて後者の期待値が減少するときに実際に
ゲイン符号語の格納順序を交換する。この作業を期待値
の減少が微小になるまで繰り返す。従来の音声符号化装
置及び音声復号化装置は、このようなゲイン符号語の並
べ換えが行われたゲイン符号帳を使用している。For example, an expected value of the magnitude of the degradation that occurs when a one-bit error is actually given to the gain code is calculated,
Furthermore, an expected value of the magnitude of the degradation that occurs when two randomly selected gain codewords are exchanged is calculated, and when the expected value of the latter is reduced as compared with the expected value of the former, the gain codeword of the gain is actually calculated. Swap the storage order. This operation is repeated until the decrease in the expected value becomes very small. Conventional speech encoding devices and speech decoding devices use a gain codebook in which such gain codewords are rearranged.

【００３６】[0036]

【発明が解決しようとする課題】従来の音声符号化装置
及び音声復号化装置は以上のように構成されているの
で、音声符号化装置の歪み最小化部７が、聴覚重み付け
差信号のパワーが最小化するように最適なモード情報を
選択するが、音声符号に伝送路誤りが重畳して、音声復
号化装置がモード情報を誤認すると、入力音声の再生品
質が大きく劣化する課題があった。また、符号誤りによ
る劣化を最小限に抑えるため、各符号帳毎に符号語の並
べ換えを実施しているが、モード情報が誤認される場合
があることを考慮した並べ換えを実施していないため、
モード情報の誤りに対する耐性を高めることができない
課題があった。Since the conventional speech coding apparatus and speech decoding apparatus are constructed as described above, the distortion minimizing section 7 of the speech coding apparatus uses the power of the auditory weighting difference signal. Although the optimal mode information is selected so as to minimize it, if the transmission path error is superimposed on the audio code and the audio decoding device mistakenly recognizes the mode information, there is a problem that the reproduction quality of the input audio is significantly deteriorated. In addition, in order to minimize deterioration due to code errors, codewords are rearranged for each codebook.However, since rearrangement is not performed in consideration that mode information may be erroneously recognized,
There has been a problem that it is not possible to increase the resistance to mode information errors.

【００３７】具体的には、ゲイン符号帳１２，２６にお
ける符号語の並べ換えと、ゲイン符号帳１７，３１にお
ける符号語の並べ換えを無関係に実施しているため、ゲ
イン符号語のパワー（評価値）の順位に着目すると、図
１７に示すように、ゲイン符号帳１２，２６とゲイン符
号帳１７，３１間の相関関係が全くなくなっている。こ
のため、例えば、インデックスが「０」のゲイン符号を
復号する場合、モード情報を誤認して、本来第一の符号
化モードが選択されるところを第二の符号化モードが選
択されると、評価値順位が「４１」のゲイン値ではな
く、「１２１」のゲイン値が選択される。これにより、
出力音声の振幅が大きく変化し、局所的な大劣化を引き
起こすことになる。More specifically, since the rearrangement of the codewords in the gain codebooks 12 and 26 and the rearrangement of the codewords in the gain codebooks 17 and 31 are performed independently, the power (evaluation value) of the gain codeword is obtained. Paying attention to the rank of, as shown in FIG. 17, there is no correlation between the gain codebooks 12 and 26 and the gain codebooks 17 and 31 at all. For this reason, for example, when decoding a gain code having an index of “0”, if the mode information is erroneously recognized and the second coding mode is selected instead of the originally selected first coding mode, The gain value of “121” is selected instead of the gain value of the evaluation value rank of “41”. This allows
The amplitude of the output voice changes greatly, causing large local degradation.

【００３８】この発明は上記のような課題を解決するた
めになされたもので、モード情報を誤認しても、音声の
再生品質の劣化を抑制することができる音声符号化装
置、音声復号化装置及び符号語配列方法を得ることを目
的とする。The present invention has been made in order to solve the above-described problems, and a speech encoding apparatus and a speech decoding apparatus capable of suppressing deterioration of speech reproduction quality even if mode information is erroneously recognized. And a code word arrangement method.

【００３９】[0039]

【課題を解決するための手段】この発明に係る音声符号
化装置は、複数の符号帳が他の符号帳の符号語に関する
評価値の順位と相応して、符号語の格納順序が並び換え
られているようにしたものである。In the speech coding apparatus according to the present invention, the storage order of codewords of a plurality of codebooks is rearranged in accordance with the order of evaluation values for codewords of another codebook. It is as if it were.

【００４０】この発明に係る音声符号化装置は、符号語
に関する評価値として、その符号語のパワー又は平均振
幅を用いるようにしたものである。The speech coding apparatus according to the present invention uses the power or the average amplitude of the code word as the evaluation value for the code word.

【００４１】この発明に係る音声符号化装置は、複数の
符号帳が音源ゲインを出力する符号帳であるようにした
ものである。[0041] In the speech coding apparatus according to the present invention, the plurality of codebooks are codebooks for outputting excitation gains.

【００４２】この発明に係る音声符号化装置は、複数の
符号帳間の対応する各符号語に関する評価値の偏差の合
計値が最小となるように、複数の符号帳の符号語の格納
順序が並び換えられているようにしたものである。According to the speech encoding apparatus of the present invention, the storage order of the codewords of the plurality of codebooks is adjusted so that the total value of the deviations of the evaluation values of the corresponding codewords among the plurality of codebooks is minimized. It is made to be rearranged.

【００４３】この発明に係る音声符号化装置は、符号語
から音源を生成して、その音源から合成音を生成する場
合、その合成音に関する期待値を評価値として取り扱う
ようにしたものである。In the speech coding apparatus according to the present invention, when a sound source is generated from a codeword and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value.

【００４４】この発明に係る音声符号化装置は、インデ
ックスをマッピングするマッピング手段を有し、少なく
とも１以上の符号帳がマッピング後のインデックスに対
応する符号語を出力することにより、複数の符号帳の符
号語の格納順序を予め評価値の順位を基準にして更新す
ることなく、更新後の格納順序と等価な状態を構築する
ようにしたものである。The speech coding apparatus according to the present invention has mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the index after the mapping, thereby forming a plurality of codebooks. A state equivalent to the updated storage order is constructed without updating the storage order of the codewords in advance based on the rank of the evaluation value.

【００４５】この発明に係る音声符号化装置は、インデ
ックスをマッピングするマッピング手段を有し、少なく
とも１以上の符号帳がマッピング後のインデックスに対
応する符号語を出力することにより、複数の符号帳の符
号語の格納順序を予め評価値の偏差の合計値が最小とな
るように更新することなく、更新後の格納順序と等価な
状態を構築するようにしたものである。The speech coding apparatus according to the present invention has a mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the index after the mapping, so that a plurality of codebooks are output. A state equivalent to the storage order after the update is constructed without updating the storage order of the codewords in advance so that the total value of the deviations of the evaluation values is minimized.

【００４６】この発明に係る音声復号化装置は、複数の
符号帳が他の符号帳の符号語に関する評価値の順位と相
応して、符号語の格納順序が並び換えられているように
したものである。A speech decoding apparatus according to the present invention is arranged such that a plurality of codebooks are rearranged in a storage order of codewords corresponding to the order of evaluation values of codewords of another codebook. It is.

【００４７】この発明に係る音声復号化装置は、符号語
に関する評価値として、その符号語のパワー又は平均振
幅を用いるようにしたものである。The speech decoding apparatus according to the present invention uses the power or average amplitude of a code word as an evaluation value for the code word.

【００４８】この発明に係る音声復号化装置は、複数の
符号帳が音源ゲインを出力する符号帳であるようにした
ものである。[0048] In the speech decoding apparatus according to the present invention, the plurality of codebooks are codebooks for outputting excitation gains.

【００４９】この発明に係る音声復号化装置は、複数の
符号帳間の対応する各符号語に関する評価値の偏差の合
計値が最小となるように、複数の符号帳の符号語の格納
順序が並び換えられているようにしたものである。According to the speech decoding apparatus of the present invention, the storage order of the codewords of the plurality of codebooks is set so that the total value of the deviations of the evaluation values of the corresponding codewords among the plurality of codebooks is minimized. It is made to be rearranged.

【００５０】この発明に係る音声復号化装置は、符号語
から音源を生成して、その音源から合成音を生成する場
合、その合成音に関する期待値を評価値として取り扱う
ようにしたものである。In the speech decoding apparatus according to the present invention, when a sound source is generated from a codeword and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value.

【００５１】この発明に係る音声復号化装置は、インデ
ックスをマッピングするマッピング手段を有し、少なく
とも１以上の符号帳がマッピング後のインデックスに対
応する符号語を出力することにより、複数の符号帳の符
号語の格納順序を予め評価値の順位を基準にして更新す
ることなく、更新後の格納順序と等価な状態を構築する
ようにしたものである。The speech decoding apparatus according to the present invention has mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the mapped index, so that a plurality of codebooks are output. A state equivalent to the updated storage order is constructed without updating the storage order of the codewords in advance based on the rank of the evaluation value.

【００５２】この発明に係る音声復号化装置は、インデ
ックスをマッピングするマッピング手段を有し、少なく
とも１以上の符号帳がマッピング後のインデックスに対
応する符号語を出力することにより、複数の符号帳の符
号語の格納順序を予め評価値の偏差の合計値が最小とな
るように更新することなく、更新後の格納順序と等価な
状態を構築するようにしたものである。The speech decoding apparatus according to the present invention has mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the mapped index, thereby obtaining a plurality of codebooks. A state equivalent to the storage order after the update is constructed without updating the storage order of the codewords in advance so that the total value of the deviations of the evaluation values is minimized.

【００５３】この発明に係る符号語配列方法は、各符号
帳の符号語に関する評価値を調査し、他の符号帳の符号
語に関する評価値の順位と相応して、少なくとも１以上
の符号帳の符号語の格納順序を並び換えるようにしたも
のである。The codeword arranging method according to the present invention examines the evaluation values of the codewords of each codebook, and, according to the order of the evaluation values of the codewords of the other codebooks, at least one or more codebooks. The storage order of the codewords is rearranged.

【００５４】この発明に係る符号語配列方法は、符号語
に関する評価値として、その符号語のパワー又は平均振
幅を用いるようにしたものである。In the code word arranging method according to the present invention, the power or the average amplitude of the code word is used as the evaluation value for the code word.

【００５５】この発明に係る符号語配列方法は、複数の
符号帳が音源ゲインを出力する符号帳であるようにした
ものである。In the codeword arrangement method according to the present invention, the plurality of codebooks are codebooks for outputting excitation gains.

【００５６】この発明に係る符号語配列方法は、複数の
符号帳間の対応する各符号語に関する評価値の偏差の合
計値を計算し、その合計値が減少して最小化するまで、
少なくとも１以上の符号帳の符号語の格納順序を更新す
るようにしたものである。The code word arranging method according to the present invention calculates the sum of the deviations of the evaluation values of the corresponding code words among a plurality of codebooks, and calculates the difference until the sum is reduced and minimized.
The storage order of at least one or more codebooks is updated.

【００５７】この発明に係る符号語配列方法は、符号語
から音源を生成して、その音源から合成音を生成する場
合、その合成音に関する期待値を評価値として取り扱う
ようにしたものである。In the code word arranging method according to the present invention, when a sound source is generated from a code word and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value.

【００５８】[0058]

【発明の実施の形態】以下、この発明の実施の一形態を
説明する。実施の形態１．図１はこの発明の実施の形態１による音
声符号化装置を示す構成図であり、図において、４１は
入力音声に重畳している背景雑音を抑圧する雑音抑圧処
理を実行するとともに、入力音声の直流成分をカットす
る低域阻止フィルタ処理を実行する前処理部、４２は前
処理部４１による前処理後の入力音声を分析して、音声
のスペクトル包絡情報である線スペクトル対（以下、Ｌ
ＳＰという）を求めるスペクトル分析部、４３はスペク
トル分析部４２により求められたＬＳＰを符号化して、
そのＬＳＰ符号を多重化部６０に出力するとともに、そ
のＬＳＰを量子化して、量子化後のＬＳＰ（ＬＳＰ符号
を復号化した結果と同じ）を合成フィルタ４４のフィル
タ係数（線形予測係数）に変換し、そのフィルタ係数を
合成フィルタ４４と聴覚重み付け部４６に出力するスペ
クトル符号化部である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below. Embodiment 1 FIG. FIG. 1 is a block diagram showing a speech encoding apparatus according to Embodiment 1 of the present invention. In the figure, reference numeral 41 denotes a noise suppression process for suppressing background noise superimposed on input speech, A pre-processing unit that executes a low-frequency rejection filter process for cutting a DC component analyzes an input voice that has been pre-processed by the pre-processing unit 41, and obtains a line spectrum pair (hereinafter, L) that is spectral envelope information of the voice.
SP), and 43 encodes the LSP determined by the spectrum analyzer 42,
The LSP code is output to the multiplexing unit 60, the LSP is quantized, and the quantized LSP (same as the result of decoding the LSP code) is converted into a filter coefficient (linear prediction coefficient) of the synthesis filter 44. And a spectrum encoding unit that outputs the filter coefficients to the synthesis filter 44 and the auditory weighting unit 46.

【００５９】４４はスペクトル符号化部４３が出力する
フィルタ係数を用いて、切換スイッチ５９により選択さ
れた仮の音源に対するフィルタリング処理を実行し、仮
の合成音を生成する合成フィルタ、４５は合成フィルタ
４４により生成された合成音と前処理部４１による前処
理後の入力音声との差信号を出力する減算器、４６はス
ペクトル符号化部４３が出力するフィルタ係数に基づい
て聴覚重み付けフィルタ係数を算出し、その聴覚重み付
けフィルタ係数を用いて、減算器４５が出力する差信号
に対する聴覚重み付けフィルタ処理を実行して聴覚重み
付け差信号を出力する聴覚重み付け部である。Reference numeral 44 denotes a synthesis filter that performs a filtering process on the temporary sound source selected by the changeover switch 59 using the filter coefficients output from the spectrum encoding unit 43 to generate a temporary synthesized sound, and 45 denotes a synthesis filter. A subtractor that outputs a difference signal between the synthesized sound generated by the pre-processing unit 41 and the input voice that has been pre-processed by the pre-processing unit 41; A hearing weighting unit that performs a hearing weighting filter process on the difference signal output by the subtracter 45 using the hearing weighting filter coefficient, and outputs a hearing weighted difference signal.

【００６０】４７は聴覚重み付け部４６が出力する聴覚
重み付け差信号のパワーを計算し、そのパワーの最小化
を図るため、インデックス（ゲイン符号、駆動音源符
号、適応音源符号）及び符号化モードを示すモード情報
を逐次更新する歪み最小化部、４８，４９は歪み最小化
部４７による更新後のインデックスに対応する符号語を
出力する符号帳を有し、その符号語から仮の音源を生成
する音源復号化部である。Reference numeral 47 denotes an index (gain code, driving excitation code, adaptive excitation code) and an encoding mode for calculating the power of the perceptual weighting difference signal output from the perceptual weighting unit 46 and minimizing the power. Distortion minimizing units 48 and 49 for sequentially updating mode information have a codebook for outputting a codeword corresponding to the index updated by the distortion minimizing unit 47, and a sound source for generating a temporary sound source from the codeword. It is a decoding unit.

【００６１】５０は過去の音源を所定長記憶し、歪み最
小化部４７から適応音源符号を受けると、その適応音源
符号に対応する過去の音源を周期的に繰り返す時系列ベ
クトルである適応符号ベクトルを出力する適応音源符号
帳、５１は非雑音的な複数の時系列ベクトルである駆動
符号ベクトルを格納し、歪み最小化部４７から駆動音源
符号を受けると、その駆動音源符号に対応する駆動符号
ベクトルを出力する駆動音源符号帳、５２はゲインに関
する符号語（ゲイン値を示す語）を格納し、歪み最小化
部４７からゲイン符号を受けると、そのゲイン符号に対
応するゲイン値を出力するゲイン符号帳、５３はゲイン
符号帳５２が出力するゲイン値を適応音源符号帳５０が
出力する適応符号ベクトルに乗算する乗算器、５４はゲ
イン符号帳５２が出力するゲイン値を駆動音源符号帳５
１が出力する駆動符号ベクトルに乗算する乗算器、５５
は乗算器５３の乗算結果と乗算器５４の乗算結果を加算
し、その加算結果（仮の音源）を出力する加算器であ
る。Reference numeral 50 denotes an adaptive code vector which is a time series vector which stores a past excitation of a predetermined length and receives the adaptive excitation code from the distortion minimizing section 47, and periodically repeats the past excitation corresponding to the adaptive excitation code. An adaptive excitation codebook 51 that outputs a driving code vector that is a plurality of non-noise time-series vectors, receives a driving excitation code from the distortion minimizing unit 47, and outputs a driving code corresponding to the driving excitation code. A driving excitation codebook 52 for outputting a vector stores a codeword relating to the gain (a word indicating a gain value), and receives a gain code from the distortion minimizing section 47 to output a gain value corresponding to the gain code. Codebook 53 is a multiplier for multiplying the gain value output from gain codebook 52 by the adaptive code vector output from adaptive excitation codebook 50, and 54 is Driving the gain value to force excitation codebook 5
A multiplier for multiplying the driving code vector output by 1; 55
Is an adder that adds the multiplication result of the multiplier 53 and the multiplication result of the multiplier 54 and outputs the addition result (temporary sound source).

【００６２】５６は雑音的な複数の時系列ベクトルであ
る駆動符号ベクトルを格納し、歪み最小化部４７から駆
動音源符号を受けると、その駆動音源符号に対応する駆
動符号ベクトルを出力する駆動音源符号帳、５７はゲイ
ンに関する符号語（ゲイン値を示す語）を格納し、歪み
最小化部４７からゲイン符号を受けると、そのゲイン符
号に対応するゲイン値を出力するゲイン符号帳、５８は
ゲイン符号帳５７が出力するゲイン値を駆動音源符号帳
５６が出力する駆動符号ベクトルに乗算し、その乗算結
果（仮の音源）を出力する乗算器である。Reference numeral 56 stores a driving code vector which is a plurality of noise-like time-series vectors, and upon receiving a driving excitation code from the distortion minimizing section 47, outputs a driving code vector corresponding to the driving excitation code. A codebook 57 stores a codeword (a word indicating a gain value) related to the gain, and receives a gain code from the distortion minimizing unit 47 and outputs a gain value corresponding to the gain code. A multiplier that multiplies the gain value output from the codebook 57 by the driving code vector output from the driving excitation codebook 56 and outputs the multiplication result (temporary excitation).

【００６３】５９は歪み最小化部４７からモード情報を
受けると、そのモード情報にしたがって音源復号化部４
８が出力する仮の音源又は音源復号化部４９が出力する
仮の音源を選択し、その選択した仮の音源を合成フィル
タ４４に与える切換スイッチである。なお、前処理部４
１，スペクトル分析部４２，スペクトル符号化部４３，
合成フィルタ４４，減算器４５，聴覚重み付け部４６，
歪み最小化部４７，音源復号化部４８，４９及び切換ス
イッチ５９から符号化手段が構成されている。６０はス
ペクトル符号化部４３により符号化されたＬＳＰ符号
と、歪み最小化部４７による更新後のインデックス及び
モード情報とを多重化して音声符号を生成し、その音声
符号を出力する多重化部（多重化手段）である。When the mode information 59 is received from the distortion minimizing section 47, the excitation decoding section 4 receives the mode information in accordance with the mode information.
A switch for selecting a temporary sound source output by 8 or a temporary sound source output by the sound source decoding unit 49 and providing the selected temporary sound source to the synthesis filter 44. The pre-processing unit 4
1, spectrum analysis unit 42, spectrum encoding unit 43,
Synthesis filter 44, subtractor 45, auditory weighting unit 46,
Encoding means is composed of the distortion minimizing section 47, the excitation decoding sections 48 and 49, and the changeover switch 59. A multiplexing unit 60 generates a speech code by multiplexing the LSP code encoded by the spectrum encoding unit 43 and the index and mode information updated by the distortion minimizing unit 47, and outputs the speech code. Multiplexing means).

【００６４】図２はこの発明の実施の形態１による音声
復号化装置を示す構成図であり、図において、６１は音
声符号化装置により多重化されたＬＳＰ符号とインデッ
クスとモード情報とを分離する分離部（分離手段）、６
２，６３は分離部６１により分離されたインデックスに
対応する符号語を出力する符号帳を有し、その符号語か
ら音源を生成する音源復号化部である。FIG. 2 is a block diagram showing a speech decoding apparatus according to Embodiment 1 of the present invention. In the figure, reference numeral 61 demultiplexes an LSP code, an index, and mode information multiplexed by the speech encoding apparatus. Separation unit (separation means), 6
Reference numerals 2 and 63 denote a sound source decoding unit that has a codebook that outputs a codeword corresponding to the index separated by the separation unit 61 and generates a sound source from the codeword.

【００６５】６４は過去の音源を所定長記憶し、分離部
６１から適応音源符号を受けると、その適応音源符号に
対応する過去の音源を周期的に繰り返す時系列ベクトル
である適応符号ベクトルを出力する適応音源符号帳、６
５は非雑音的な複数の時系列ベクトルである駆動符号ベ
クトルを格納し、分離部６１から駆動音源符号を受ける
と、その駆動音源符号に対応する駆動符号ベクトルを出
力する駆動音源符号帳、６６はゲインに関する符号語
（ゲイン値を示す語）を格納し、分離部６１からゲイン
符号を受けると、そのゲイン符号に対応するゲイン値を
出力するゲイン符号帳、６７はゲイン符号帳６６が出力
するゲイン値を適応音源符号帳６４が出力する適応符号
ベクトルに乗算する乗算器、６８はゲイン符号帳６６が
出力するゲイン値を駆動音源符号帳６５が出力する駆動
符号ベクトルに乗算する乗算器、６９は乗算器６７の乗
算結果と乗算器６８の乗算結果を加算し、その加算結果
（仮の音源）を出力する加算器である。Reference numeral 64 stores a past excitation of a predetermined length, and upon receiving an adaptive excitation code from the separation unit 61, outputs an adaptive code vector which is a time series vector that periodically repeats the past excitation corresponding to the adaptive excitation code. Adaptive excitation codebook, 6
Reference numeral 5 denotes a driving excitation codebook that stores a driving code vector that is a plurality of non-noise time-series vectors and, when receiving a driving excitation code from the separation unit 61, outputs a driving code vector corresponding to the driving excitation code. Is a gain codebook that stores a code word (a word indicating a gain value) related to the gain and receives a gain code from the separation unit 61 and outputs a gain value corresponding to the gain code. A multiplier that multiplies the gain value by the adaptive code vector output by the adaptive excitation codebook 64; 68, a multiplier that multiplies the gain value output by the gain codebook 66 by the driving code vector output by the driving excitation codebook 65; Is an adder that adds the multiplication result of the multiplier 67 and the multiplication result of the multiplier 68 and outputs the addition result (temporary sound source).

【００６６】７０は雑音的な複数の時系列ベクトルであ
る駆動符号ベクトルを格納し、分離部６１から駆動音源
符号を受けると、その駆動音源符号に対応する駆動符号
ベクトルを出力する駆動音源符号帳、７１はゲインに関
する符号語（ゲイン値を示す語）を格納し、分離部６１
からゲイン符号を受けると、そのゲイン符号に対応する
ゲイン値を出力するゲイン符号帳、７２はゲイン符号帳
７１が出力するゲイン値を駆動音源符号帳７０が出力す
る駆動符号ベクトルに乗算し、その乗算結果（仮の音
源）を出力する乗算器である。Reference numeral 70 denotes a driving excitation codebook that stores a driving code vector, which is a plurality of noise-like time-series vectors, and outputs a driving code vector corresponding to the driving excitation code when receiving the driving excitation code from the separation unit 61. , 71 store a code word (a word indicating a gain value) relating to the gain, and
, A gain codebook that outputs a gain value corresponding to the gain code, 72 multiplies the gain value output by the gain codebook 71 with the driving code vector output by the driving excitation codebook 70, This is a multiplier that outputs a multiplication result (temporary sound source).

【００６７】７３は分離部６１からモード情報を受ける
と、そのモード情報にしたがって音源復号化部６２が出
力する仮の音源又は音源復号化部６３が出力する仮の音
源を選択し、その選択した仮の音源を合成フィルタ７５
に与える切換スイッチ、７４は分離部６１が出力するＬ
ＳＰ符号を復号化し、その復号結果を合成フィルタ７５
のフィルタ係数（線形予測係数）に変換して、そのフィ
ルタ係数を合成フィルタ７５と後処理部７６に出力する
スペクトル復号化部である。Upon receiving the mode information from the separating section 61, the 73 selects a temporary excitation output from the excitation decoding section 62 or a temporary excitation output from the excitation decoding section 63 according to the mode information. Synthesizing filter 75 for temporary sound source
, A switch 74 provided to the L
The SP code is decoded and the decoded result is
And a spectrum decoding unit that converts the filter coefficients to a synthesis filter 75 and a post-processing unit 76.

【００６８】７５はスペクトル復号化部７４が出力する
フィルタ係数を用いて、切換スイッチ７３により選択さ
れた仮の音源に対するフィルタリング処理を実行し、仮
の合成音を生成する合成フィルタ、７６はスペクトル復
号化部７４が出力するフィルタ係数等に基づいて合成フ
ィルタ７５により生成された合成音に対する音声強調処
理などの後処理を実行し、入力音声の再生結果（出力音
声）を出力する後処理部である。なお、音源復号化部６
２，６３，切換スイッチ７３，スペクトル復号化部７
４，合成フィルタ７５及び後処理部７６から復号化手段
が構成されている。図３はこの発明の実施の形態１によ
る符号語配列方法を示すフローチャートである。Reference numeral 75 denotes a synthesis filter that performs a filtering process on the temporary sound source selected by the changeover switch 73 using the filter coefficients output from the spectrum decoding unit 74 to generate a temporary synthesized sound, and 76 denotes a spectrum decoding filter. A post-processing unit that performs post-processing such as voice emphasis processing on the synthesized sound generated by the synthesis filter 75 based on the filter coefficients and the like output by the conversion unit 74, and outputs the reproduction result (output sound) of the input sound. . The sound source decoding unit 6
2, 63, changeover switch 73, spectrum decoding section 7
4. Decoding means is composed of the synthesis filter 75 and the post-processing unit 76. FIG. 3 is a flowchart showing a codeword arrangement method according to Embodiment 1 of the present invention.

【００６９】次に動作について説明する。従来の音声符
号化装置及び音声復号化装置は、５〜５０ｍｓ程度を１
フレームとして、フレーム単位に処理を実行する。Next, the operation will be described. The conventional speech encoding device and speech decoding device require about 5 to 50 ms for one.
The processing is executed for each frame as a frame.

【００７０】まず、音声符号化装置の前処理部４１は、
入力音声を受けると、その入力音声に重畳している背景
雑音を抑圧する雑音抑圧処理を実行するとともに、入力
音声の直流成分をカットする低域阻止フィルタ処理を実
行する。スペクトル分析部４２は、前処理部４１が入力
音声に対する前処理を実行すると、前処理後の入力音声
を分析して、音声のスペクトル包絡情報であるＬＳＰを
求める。First, the pre-processing unit 41 of the speech encoding device
Upon receiving the input voice, the CPU executes noise suppression processing for suppressing background noise superimposed on the input voice, and also executes low-frequency rejection filter processing for cutting a DC component of the input voice. When the pre-processing unit 41 performs pre-processing on the input voice, the spectrum analysis unit 42 analyzes the input voice after the pre-processing and obtains LSP, which is spectrum envelope information of the voice.

【００７１】そして、スペクトル符号化部４３は、スペ
クトル分析部４２により求められたＬＳＰを符号化し
て、そのＬＳＰ符号を多重化部６０に出力する。また、
そのＬＳＰを量子化して、量子化後のＬＳＰを合成フィ
ルタ４４のフィルタ係数に変換し、そのフィルタ係数を
合成フィルタ４４と聴覚重み付け部４６に出力する。Then, spectrum encoding section 43 encodes the LSP obtained by spectrum analysis section 42 and outputs the LSP code to multiplexing section 60. Also,
The LSP is quantized, the LSP after the quantization is converted into a filter coefficient of the synthesis filter 44, and the filter coefficient is output to the synthesis filter 44 and the auditory weighting unit 46.

【００７２】合成フィルタ４４は、スペクトル符号化部
４３からフィルタ係数を受けると、そのフィルタ係数を
用いて、切換スイッチ５９により選択された仮の音源に
対するフィルタリング処理を実行し、仮の合成音を生成
する。仮の音源の生成処理は後述する。減算器４５は、
合成フィルタ４４が合成音を生成すると、その合成音と
前処理部４１による前処理後の入力音声との差信号を出
力し、聴覚重み付け部４６は、スペクトル符号化部４３
が出力するフィルタ係数に基づいて聴覚重み付けフィル
タ係数を算出し、その聴覚重み付けフィルタ係数を用い
て、減算器４５が出力する差信号に対する聴覚重み付け
フィルタ処理を実行して聴覚重み付け差信号を出力す
る。Upon receiving the filter coefficients from the spectrum encoding unit 43, the synthesis filter 44 performs a filtering process on the temporary sound source selected by the changeover switch 59 using the filter coefficients to generate a temporary synthesized sound. I do. The process of generating a temporary sound source will be described later. The subtractor 45
When the synthesis filter 44 generates a synthesized sound, a difference signal between the synthesized sound and the input voice after preprocessing by the preprocessing unit 41 is output.
Calculates an auditory weighting filter coefficient on the basis of the filter coefficient output by the above, and performs an auditory weighting filter process on the difference signal output by the subtractor 45 using the auditory weighting filter coefficient to output an auditory weighting difference signal.

【００７３】歪み最小化部４７は、インデックス及び符
号化モードを逐次更新することにより、聴覚重み付け部
４６が出力する聴覚重み付け差信号のパワーの最小化を
図る。即ち、インデックスとモード情報を適宜選択し
て、音源復号化部４８，４９と切換スイッチ５９に出力
する毎に、その聴覚重み付け差信号のパワーを計算し、
その計算結果であるパワーが最も小さくなるインデック
スとモード情報の組合せを検索する。そして、聴覚重み
付け差信号のパワーが最小になるインデックスとモード
情報が求まると、そのインデックスとモード情報を多重
化部６０に出力する。ただし、音源復号化部４９には適
応音源符号帳が内蔵されていないので、第二の符号化モ
ードを示すモード情報を出力する場合には、適応音源符
号を出力しない。The distortion minimizing section 47 attempts to minimize the power of the perceptual weighting difference signal output from the perceptual weighting section 46 by sequentially updating the index and the coding mode. That is, each time the index and the mode information are appropriately selected and output to the excitation decoding sections 48 and 49 and the changeover switch 59, the power of the perceptual weighting difference signal is calculated,
As a result of the calculation, a combination of an index and mode information with the smallest power is searched. Then, when the index and the mode information that minimize the power of the auditory weighting difference signal are obtained, the index and the mode information are output to the multiplexing unit 60. However, since excitation excitation decoding section 49 does not include an adaptive excitation codebook, it does not output an adaptive excitation code when outputting mode information indicating the second encoding mode.

【００７４】音源復号化部４８，４９は、歪み最小化部
４７からインデックスを受けると、そのインデックスに
応じて仮の音源を生成する。具体的には、まず、音源復
号化部４８の適応音源符号帳５０は、過去の音源を所定
長記憶し、歪み最小化部４７から適応音源符号を受ける
と、その適応音源符号に対応する過去の音源を周期的に
繰り返す時系列ベクトルを適応符号ベクトルとして出力
する。なお、適応音源符号帳５０は、歪み最小化部４７
がインデックス及びモード情報を選択した後で、そのイ
ンデックス及びモード情報に対して、切換スイッチ５９
が出力した仮の音源を選択して出力すると、その仮の音
源を最終的な音源として記憶する。Upon receiving the index from distortion minimizing section 47, excitation decoding sections 48 and 49 generate a temporary excitation according to the index. More specifically, first, adaptive excitation codebook 50 of excitation decoding section 48 stores a past excitation of a predetermined length, and receives an adaptive excitation code from distortion minimizing section 47, and stores the past excitation corresponding to the adaptive excitation code. Is output as an adaptive code vector. Note that adaptive excitation codebook 50 includes distortion minimizing section 47.
After selecting the index and mode information, the changeover switch 59
When the selected temporary sound source is output, the temporary sound source is stored as the final sound source.

【００７５】音源復号化部４８の駆動音源符号帳５１
は、非雑音的な複数の時系列ベクトルである駆動符号ベ
クトルを格納し、歪み最小化部４７から駆動音源符号を
受けると、その駆動音源符号に対応する駆動符号ベクト
ルを出力する。ただし、駆動音源符号帳５１は、予め、
各時系列ベクトルを複数のパルス位置と極性で表現する
代数的音源テーブルを備えることにより、歪み最小化部
４７が出力する駆動音源符号に基づいて代数的音源を生
成し、その代数的音源を駆動符号ベクトルとして出力す
るようにしてもよい。Driven excitation codebook 51 of excitation decoding section 48
Stores a driving code vector, which is a plurality of non-noise time-series vectors, and upon receiving a driving excitation code from the distortion minimizing unit 47, outputs a driving code vector corresponding to the driving excitation code. However, the driving excitation codebook 51 has a
By providing an algebraic excitation table that expresses each time-series vector by a plurality of pulse positions and polarities, an algebraic excitation is generated based on the driving excitation code output by the distortion minimizing unit 47, and the algebraic excitation is driven. You may make it output as a code vector.

【００７６】そして、ゲイン符号帳５２がゲイン符号に
対応するゲイン値を出力すると、適応音源符号帳５０か
ら出力された適応符号ベクトルと駆動音源符号帳５１か
ら出力された駆動符号ベクトルは、乗算器５３，５４に
よりゲイン値が乗算され、加算器５５により乗算器５
３，５４の乗算結果が相互に加算される。When gain codebook 52 outputs a gain value corresponding to the gain code, adaptive code vector output from adaptive excitation codebook 50 and drive code vector output from drive excitation codebook 51 are multiplied by a multiplier. The gain value is multiplied by 53 and 54, and the multiplier 5 is multiplied by an adder 55.
The 3,54 multiplication results are added to each other.

【００７７】一方、音源復号化部４９の駆動音源符号帳
５６は、雑音的な複数の時系列ベクトルである駆動符号
ベクトルを格納し、歪み最小化部４７から駆動音源符号
を受けると、その駆動音源符号に対応する駆動符号ベク
トルを出力する。ただし、駆動音源符号帳５６は、予
め、各時系列ベクトルを複数のパルス位置と極性で表現
する代数的音源テーブルを備えることにより、歪み最小
化部４７が出力する駆動音源符号に基づいて代数的音源
を生成し、その代数的音源を駆動符号ベクトルとして出
力するようにしてもよい。そして、ゲイン符号帳５７が
ゲイン符号に対応するゲイン値を出力すると、駆動音源
符号帳５６から出力された駆動符号ベクトルは、乗算器
５８によりゲイン値が乗算される。On the other hand, driving excitation codebook 56 of excitation decoding section 49 stores a driving code vector, which is a plurality of noise-like time-series vectors, and receives driving excitation code from distortion minimizing section 47 to drive the excitation codebook. A driving code vector corresponding to the excitation code is output. However, the driving excitation codebook 56 is provided with an algebraic excitation table in which each time-series vector is represented by a plurality of pulse positions and polarities in advance, so that an algebraic excitation code output from the distortion minimizing unit 47 is obtained. A sound source may be generated, and the algebraic sound source may be output as a driving code vector. When the gain codebook 57 outputs a gain value corresponding to the gain code, the driving code vector output from the driving excitation codebook 56 is multiplied by the gain value by the multiplier 58.

【００７８】このようにして、音源復号化部４８の加算
器５５から仮の音源が出力され、音源復号化部４９の乗
算器５８から仮の音源が出力されると、切換スイッチ５
９は、歪み最小化部４７が出力するモード情報にしたが
って音源復号化部４８が出力する仮の音源又は音源復号
化部４９が出力する仮の音源の何れか一方を選択し、そ
の選択した仮の音源を合成フィルタ４４に与える。As described above, when the provisional excitation is output from adder 55 of excitation decoding section 48 and the temporary excitation is output from multiplier 58 of excitation decoding section 49, changeover switch 5
9 selects either the temporary excitation output by the excitation decoding section 48 or the temporary excitation output by the excitation decoding section 49 in accordance with the mode information output by the distortion minimizing section 47, and selects the selected temporary excitation. Is given to the synthesis filter 44.

【００７９】多重化部６０は、スペクトル符号化部４３
により符号化されたＬＳＰ符号と、歪み最小化部４７に
よる更新後のインデックス及びモード情報（聴覚重み付
け差信号のパワーが最小となるインデックス及びモード
情報）とを多重化して音声符号を生成し、その音声符号
を出力する。The multiplexing section 60 includes a spectrum encoding section 43
Is multiplexed with the index and mode information (index and mode information that minimizes the power of the auditory weighting difference signal) updated by the distortion minimizing unit 47 to generate a speech code. Outputs speech codes.

【００８０】次に、音声復号化装置の分離部６１は、音
声符号化装置から出力された音声符号を入力すると、そ
の音声符号に含まれているＬＳＰ符号とインデックスと
モード情報とを分離する。Next, upon input of the speech code output from the speech encoding device, the separation unit 61 of the speech decoding device separates the LSP code, index, and mode information contained in the speech code.

【００８１】音源復号化部６２，６３は、分離部６１か
らインデックスを受けると、そのインデックスに応じて
仮の音源を生成する。具体的には、まず、音源復号化部
６２の適応音源符号帳６４は、過去の音源を所定長記憶
し、分離部６１から適応音源符号を受けると、その適応
音源符号に対応する過去の音源を周期的に繰り返す時系
列ベクトルを適応符号ベクトルとして出力する。なお、
適応音源符号帳６４は、切換スイッチ７３が仮の音源を
選択して出力すると、その仮の音源を最終的な音源とし
て記憶する。Upon receiving the index from separation section 61, excitation decoding sections 62 and 63 generate a temporary excitation according to the index. Specifically, first, adaptive excitation codebook 64 of excitation decoding section 62 stores a past excitation at a predetermined length and, when receiving an adaptive excitation code from separation section 61, outputs a past excitation corresponding to the adaptive excitation code. Is output as an adaptive code vector. In addition,
When the changeover switch 73 selects and outputs a temporary excitation, the adaptive excitation codebook 64 stores the temporary excitation as the final excitation.

【００８２】音源復号化部６２の駆動音源符号帳６５
は、非雑音的な複数の時系列ベクトルである駆動符号ベ
クトルを格納し、分離部６１から駆動音源符号を受ける
と、その駆動音源符号に対応する駆動符号ベクトルを出
力する。ただし、駆動音源符号帳６５は、予め、各時系
列ベクトルを複数のパルス位置と極性で表現する代数的
音源テーブルを備えることにより、分離部６１が出力す
る駆動音源符号に基づいて代数的音源を生成し、その代
数的音源を駆動符号ベクトルとして出力するようにして
もよい。Driving excitation codebook 65 of excitation decoding section 62
Stores a driving code vector, which is a plurality of non-noise time-series vectors, and upon receiving a driving excitation code from the separation unit 61, outputs a driving code vector corresponding to the driving excitation code. However, the driving excitation codebook 65 is provided with an algebraic excitation table in which each time-series vector is represented by a plurality of pulse positions and polarities in advance, so that the algebraic excitation is output based on the driving excitation code output by the separation unit 61. Alternatively, the algebraic excitation may be generated and output as a driving code vector.

【００８３】そして、ゲイン符号帳６６がゲイン符号に
対応するゲイン値を出力すると、適応音源符号帳６４か
ら出力された適応符号ベクトルと駆動音源符号帳６５か
ら出力された駆動符号ベクトルは、乗算器６７，６８に
よりゲイン値が乗算され、加算器６９により乗算器６
７，６８の乗算結果が相互に加算される。When gain codebook 66 outputs a gain value corresponding to the gain code, the adaptive code vector output from adaptive excitation codebook 64 and the driving code vector output from driving excitation codebook 65 are multiplied by a multiplier. The gain value is multiplied by 67 and 68, and the multiplier 6 is multiplied by an adder 69.
The 7,68 multiplication results are added to each other.

【００８４】一方、音源復号化部６３の駆動音源符号帳
７０は、雑音的な複数の時系列ベクトルである駆動符号
ベクトルを格納し、分離部６１から駆動音源符号を受け
ると、その駆動音源符号に対応する駆動符号ベクトルを
出力する。ただし、駆動音源符号帳７０は、予め、各時
系列ベクトルを複数のパルス位置と極性で表現する代数
的音源テーブルを備えることにより、分離部６１が出力
する駆動音源符号に基づいて代数的音源を生成し、その
代数的音源を駆動符号ベクトルとして出力するようにし
てもよい。そして、ゲイン符号帳７１がゲイン符号に対
応するゲイン値を出力すると、駆動音源符号帳７０から
出力された駆動符号ベクトルは、乗算器７２によりゲイ
ン値が乗算される。On the other hand, driving excitation codebook 70 of excitation decoding section 63 stores a driving excitation vector which is a plurality of noise-like time-series vectors, and receives driving excitation code from separation section 61, and receives the excitation excitation code. Is output. However, the driving excitation codebook 70 is provided with an algebraic excitation table in which each time-series vector is expressed by a plurality of pulse positions and polarities in advance, so that the algebraic excitation is output based on the driving excitation code output by the separation unit 61. Alternatively, the algebraic excitation may be generated and output as a driving code vector. When the gain codebook 71 outputs a gain value corresponding to the gain code, the driving code vector output from the driving excitation codebook 70 is multiplied by the gain value by the multiplier 72.

【００８５】このようにして、音源復号化部６２の加算
器６９から仮の音源が出力され、音源復号化部６３の乗
算器７２から仮の音源が出力されると、切換スイッチ７
３は、分離部６１が出力するモード情報にしたがって音
源復号化部６２が出力する仮の音源又は音源復号化部６
３が出力する仮の音源の何れか一方を選択し、その選択
した仮の音源を合成フィルタ７５に与える。As described above, when the provisional excitation is output from adder 69 of excitation decoding section 62 and the temporary excitation is output from multiplier 72 of excitation decoding section 63, changeover switch 7
3 is a temporary excitation or excitation decoding unit 6 output by the excitation decoding unit 62 according to the mode information output by the separation unit 61.
3 selects one of the tentative sound sources output, and gives the selected tentative sound source to the synthesis filter 75.

【００８６】スペクトル復号化部７４は、分離部６１が
ＬＳＰ符号を出力すると、そのＬＳＰ符号を復号化し、
その復号結果を合成フィルタ７５のフィルタ係数に変換
して、そのフィルタ係数を合成フィルタ７５と後処理部
７６に出力する。合成フィルタ７５は、スペクトル復号
化部７４からフィルタ係数を受けると、そのフィルタ係
数を用いて、切換スイッチ７３により選択された仮の音
源に対するフィルタリング処理を実行し、仮の合成音を
生成する。後処理部７６は、スペクトル復号化部７４が
出力するフィルタ係数等に基づいて合成フィルタ７５に
より生成された合成音に対する音声強調処理などの後処
理を実行し、入力音声の再生結果（出力音声）を出力す
る。When the separating section 61 outputs the LSP code, the spectrum decoding section 74 decodes the LSP code,
The decoding result is converted into a filter coefficient of the synthesis filter 75, and the filter coefficient is output to the synthesis filter 75 and the post-processing unit. Upon receiving the filter coefficient from the spectrum decoding unit 74, the synthesis filter 75 performs a filtering process on the temporary sound source selected by the changeover switch 73 using the filter coefficient, and generates a temporary synthetic sound. The post-processing unit 76 performs post-processing such as voice enhancement processing on the synthesized sound generated by the synthesis filter 75 based on the filter coefficients and the like output from the spectrum decoding unit 74, and reproduces the input sound (output sound). Is output.

【００８７】ここで、図４は音声符号化装置及び音声復
号化装置により使用されるゲイン符号帳の一例を示す説
明図である。特に、図４（ａ）はゲイン符号帳５２，６
６の一例を示し、図４（ｂ）はゲイン符号帳５７，７１
の一例を示している。FIG. 4 is an explanatory diagram showing an example of the gain codebook used by the speech encoding device and the speech decoding device. Particularly, FIG. 4A shows the gain codebooks 52 and 6.
FIG. 4B shows an example of gain codebooks 57 and 71.
An example is shown.

【００８８】この例の場合、各ゲイン符号帳は１２８個
のゲイン符号語を格納している。ただし、ゲイン符号帳
５２，６６に格納されているゲイン符号語は、適応符号
ベクトルと駆動符号ベクトルに乗じる２個のゲイン値の
組を示す符号語から構成され、ゲイン符号帳５７，７１
に格納されているゲイン符号語は、駆動符号ベクトルに
乗じる１個のゲイン値を示す符号語から構成されてい
る。インデックスと評価値順位は、各ゲイン符号帳内に
実際には格納されていないものであるが、説明の便宜の
ため記載している。インデックスは上の符号語から順番
に０から１２７の値となっている。評価値はゲイン符号
語のパワー（２乗和）の値である（評価値としては、ゲ
イン符号語のパワーに限るものではなく、ゲイン符号語
の平均振幅などでもよい）。例えば、インデックスが
「１」の符号語のパワーの順位は「１０２」である。In this example, each gain codebook stores 128 gain codewords. However, the gain codewords stored in the gain codebooks 52 and 66 are composed of codewords indicating sets of two gain values that are multiplied by the adaptive code vector and the driving code vector, and the gain codebooks 57 and 71
Is composed of a codeword indicating one gain value by which the driving code vector is multiplied. Although the index and the evaluation value rank are not actually stored in each gain codebook, they are described for convenience of explanation. The index has a value of 0 to 127 in order from the upper code word. The evaluation value is the value of the power (sum of squares) of the gain codeword (the evaluation value is not limited to the power of the gain codeword, but may be the average amplitude of the gain codeword). For example, the power ranking of a codeword having an index of “1” is “102”.

【００８９】各ゲイン符号帳の動作としては、あるゲイ
ン符号を入力すると、そのゲイン符号に一致するインデ
ックス位置に格納しているゲイン符号語を出力する。各
ゲイン符号帳に格納されているゲイン符号語は、学習用
音声とその符号化音声との歪みが小さくなるように学習
して作成される。そして、音声符号を伝送する際の符号
誤りによる出力音声の劣化を最小限に抑えるため、適切
にゲイン符号語の並べ換えが行われる。As an operation of each gain codebook, when a certain gain code is input, a gain codeword stored at an index position corresponding to the gain code is output. The gain codeword stored in each gain codebook is created by learning so that distortion between the learning speech and the encoded speech is reduced. Then, in order to minimize the deterioration of the output voice due to a code error when transmitting the voice code, the gain codeword is appropriately rearranged.

【００９０】例えば、ゲイン符号に１ビット誤りを実際
に与えたときに生じる劣化の大きさの期待値を計算し、
さらに、ランダムに選択した２つのゲイン符号語を交換
したときに生じる劣化の大きさの期待値を計算し、前者
の期待値と比べて後者の期待値が減少するときに実際に
ゲイン符号語の格納順序を交換する。この作業を期待値
の減少が微小になるまで繰り返す。For example, the expected value of the magnitude of the degradation that occurs when a one-bit error is actually given to the gain code is calculated,
Furthermore, an expected value of the magnitude of the degradation that occurs when two randomly selected gain codewords are exchanged is calculated, and when the expected value of the latter is reduced as compared with the expected value of the former, the gain codeword of the gain is actually calculated. Swap the storage order. This operation is repeated until the decrease in the expected value becomes very small.

【００９１】ただし、ゲイン符号帳５７，７１に格納さ
れているゲイン符号語については、各ゲイン符号語のパ
ワーを調査し、そのパワーを基準にして、ゲイン符号帳
５７，７１に格納されているゲイン符号語の格納順序を
更新する。即ち、ゲイン符号帳５７，７１に格納されて
いるゲイン符号語のパワーをそれぞれ調査すると（ステ
ップＳＴ１）、既にゲイン符号語の並べ換えを完了して
いるゲイン符号帳５２，６６に格納されているゲイン符
号語のパワーの順位と同じ順番になるように、ゲイン符
号帳５７，７１に格納されているゲイン符号語の格納順
序を並べ換える処理を実行する（ステップＳＴ２）。However, with respect to the gain codewords stored in the gain codebooks 57 and 71, the power of each gain codeword is checked and stored in the gain codebooks 57 and 71 based on the power. Update the storage order of gain codewords. That is, when the powers of the gain codewords stored in the gain codebooks 57 and 71 are checked (step ST1), the gains stored in the gain codebooks 52 and 66 in which the rearrangement of the gain codewords has already been completed. A process of rearranging the storage order of the gain codewords stored in the gain codebooks 57 and 71 so as to be in the same order as the power order of the codewords is executed (step ST2).

【００９２】図４の各ゲイン符号帳は既に並べ換えが完
了したものである。ゲイン符号帳５２，６６では、例え
ば、インデックスが「０」に対応するゲイン符号語のパ
ワー（評価値）順位が「４１」であるので、ゲイン符号
帳５７，７１ではパワー（評価値）順位が「４１」のゲ
イン符号語が「０」のインデックスに対応するように格
納されている。インデックスが「１」以降のゲイン符号
語についても同様にして、格納順序が並び換えられる。Each gain codebook in FIG. 4 has already been rearranged. In the gain codebooks 52 and 66, for example, the power (evaluation value) rank of the gain codeword corresponding to the index “0” is “41”. The gain codeword of “41” is stored so as to correspond to the index of “0”. Similarly, the storage order is rearranged for gain codewords whose index is “1” or later.

【００９３】図５は多重化部６０から出力される音声符
号の一例を示す説明図である。多重化部６０では、ＬＳ
Ｐ符号，モード情報，ゲイン符号，駆動音源符号及び適
応音源符号を多重化して（ただし、適応音源符号は第一
の符号化モードの場合に限り多重化の対象に含められ
る）、音声符号を生成するが、この実施の形態１では、
符号化モードが第一の符号化モードであっても、第二の
符号化モードであっても、ゲイン符号の符号化ビット数
と、ゲイン符号の多重化位置とが変化しないように音声
符号を生成している。FIG. 5 is an explanatory diagram showing an example of a speech code output from the multiplexing section 60. In the multiplexing unit 60, LS
P code, mode information, gain code, driving excitation code and adaptive excitation code are multiplexed (however, the adaptive excitation code is included in the multiplexing target only in the first coding mode) to generate a speech code. However, in the first embodiment,
Whether the encoding mode is the first encoding mode or the second encoding mode, the audio code is encoded so that the number of encoded bits of the gain code and the multiplexing position of the gain code do not change. Has been generated.

【００９４】ここで、音声符号化装置が第一の符号化モ
ードで符号化して生成した音声符号（図５（ａ）を参
照）に伝送誤りが重畳することにより、その音声符号が
図５（ｂ）に示すように変化した場合を想定する。この
場合、音声復号化装置は、符号化モードが第二の符号化
モードであると誤認して、入力音声の復号化処理を実施
するが、上述したように、符号化モードが第一の符号化
モードであっても、第二の符号化モードであっても、ゲ
イン符号の符号化ビット数と、ゲイン符号の多重化位置
とが変化しないように音声符号を生成しているので、モ
ード情報に伝送誤りが生じても、音声復号化装置はゲイ
ン符号の値を正確に認識することができる。図５の例で
は、モード情報の誤認の有無に拘わらず、ゲイン符号の
値が２になる。Here, the transmission error is superimposed on the speech code (see FIG. 5 (a)) generated by the speech coding apparatus in the first coding mode, so that the speech code is changed as shown in FIG. It is assumed that the state changes as shown in b). In this case, the speech decoding device erroneously recognizes that the encoding mode is the second encoding mode and performs the decoding process of the input speech, but as described above, the encoding mode is the first encoding mode. In the coding mode and the second coding mode, the speech code is generated so that the number of bits of the gain code and the multiplexing position of the gain code do not change. Therefore, even if a transmission error occurs, the speech decoding apparatus can accurately recognize the value of the gain code. In the example of FIG. 5, the value of the gain code is 2 irrespective of whether or not the mode information is erroneously recognized.

【００９５】したがって、音声復号化装置におけるゲイ
ン符号帳６６，７１は、モード情報に伝送誤りが生じて
も、同一値のゲイン符号に対応するゲイン符号語（ゲイ
ン値）を出力することができる。また、ゲイン符号帳６
６，７１に格納されているゲイン符号語は、上述したよ
うに、パワー値順位が同じ順番になるように並べ換えら
れているので、モード情報に伝送誤りが生じても、同一
値のゲイン符号を入力できれば、出力するゲイン値の大
きさが極端に変化することはない。Therefore, even if a transmission error occurs in the mode information, the gain codebooks 66 and 71 in the speech decoding apparatus can output a gain codeword (gain value) corresponding to the same gain code. Also, gain codebook 6
As described above, the gain code words stored in 6, 6 and 71 are rearranged so that the power value order is the same. Therefore, even if a transmission error occurs in mode information, a gain code having the same value is used. If it can be input, the magnitude of the output gain value will not change drastically.

【００９６】以上で明らかなように、この実施の形態１
によれば、ゲイン符号帳５２，６６（または５７，７
１）のゲイン符号語の格納順序が、他のゲイン符号帳５
７，７１（または５２，６６）のゲイン符号語に関する
評価値の順位と相応して、並び換えられているように構
成したので、音声符号化装置においては、伝送誤りが発
生して、音声復号化装置がモード情報を誤認しても、音
声の再生品質の劣化を抑制することが可能な音声符号を
生成することができる効果を奏する。一方、音声復号化
装置においては、モード情報を誤認しても、音声の再生
品質の劣化を抑制することができる効果を奏する。As is apparent from the above, the first embodiment
According to the above, the gain codebooks 52, 66 (or 57, 7
The storage order of the gain codeword of 1) is different from that of the other gain codebooks 5.
Since the rearrangement is performed in accordance with the order of the evaluation values of the gain codewords of 7, 71 (or 52, 66), the speech encoding apparatus generates a transmission error and performs speech decoding. This makes it possible to generate a speech code capable of suppressing the deterioration of the reproduction quality of the speech even if the coding device misidentifies the mode information. On the other hand, in the audio decoding device, even if the mode information is erroneously recognized, it is possible to suppress the deterioration of the audio reproduction quality.

【００９７】また、この実施の形態１によれば、ゲイン
符号語に関する評価値として、そのゲイン符号語のパワ
ー又は平均振幅を用いるように構成したので、音声符号
化装置においては、伝送誤りが発生して、音声復号化装
置がモード情報を誤認しても、音声復号化装置により再
生される音声のパワーや振幅が大きく劣化することのな
い音声符号を生成することができる効果を奏する。一
方、音声復号化装置においては、モード情報を誤認して
も、音声のパワーや振幅の大きな劣化を招くことなく、
音声を再生することができる効果を奏する。According to the first embodiment, the power or the average amplitude of the gain codeword is used as the evaluation value for the gain codeword. As a result, even if the speech decoding device mistakenly recognizes the mode information, it is possible to generate a speech code that does not significantly degrade the power and amplitude of the speech reproduced by the speech decoding device. On the other hand, in the audio decoding device, even if the mode information is erroneously recognized, the power and amplitude of the audio are not significantly deteriorated.
It has the effect of being able to reproduce audio.

【００９８】さらに、この実施の形態１によれば、ゲイ
ン符号帳５２，６６，５７，７１が音源ゲイン（ゲイン
値）を出力する符号帳であるように構成したので、音声
符号化装置においては、伝送誤りが発生して、音声復号
化装置がモード情報を誤認しても、音声復号化装置によ
り再生される音声のゲイン値が大きく劣化することのな
い音声符号を生成することができる効果を奏する。一
方、音声復号化装置においては、モード情報を誤認して
も、音声のゲイン値の大きな劣化を招くことなく、音声
を再生することができる効果を奏する。Further, according to the first embodiment, gain codebooks 52, 66, 57, and 71 are configured to be codebooks that output excitation gains (gain values). Even if a transmission error occurs and the speech decoding device misidentifies the mode information, it is possible to generate a speech code that does not significantly degrade the gain value of the speech reproduced by the speech decoding device. Play. On the other hand, in the audio decoding device, even if the mode information is erroneously recognized, there is an effect that the audio can be reproduced without causing significant deterioration of the audio gain value.

【００９９】実施の形態２．図６はこの発明の実施の形
態２による音声符号化装置を示す構成図であり、図にお
いて、図１と同一符号は同一または相当部分を示すので
説明を省略する。８１はスペクトル符号化部４３により
量子化されたＬＳＰからモード情報を決定する音源モー
ド選択部、８２は音源モード選択部８１からモード情報
を受けると、そのモード情報にしたがって駆動音源符号
帳５１が出力する駆動符号ベクトル又は駆動音源符号帳
５６が出力する駆動符号ベクトルを選択するとともに、
ゲイン符号帳５２が出力するゲイン値又はゲイン符号帳
５７が出力するゲイン値を選択する切換スイッチであ
る。Embodiment 2 FIG. 6 is a block diagram showing a speech encoding apparatus according to Embodiment 2 of the present invention. In the figure, the same reference numerals as in FIG. 1 denote the same or corresponding parts, and a description thereof will be omitted. Reference numeral 81 denotes an excitation mode selection unit that determines mode information from the LSP quantized by the spectrum encoding unit 43, and 82 receives mode information from the excitation mode selection unit 81, and the driving excitation codebook 51 outputs according to the mode information. While selecting the driving code vector to be output or the driving code vector output by the driving excitation codebook 56,
A changeover switch for selecting a gain value output by the gain codebook 52 or a gain value output by the gain codebook 57.

【０１００】８３は切換スイッチ８２により選択された
ゲイン値を適応音源符号帳５０が出力する適応符号ベク
トルに乗算する乗算器、８４は切換スイッチ８２により
選択されたゲイン値を切換スイッチ８２により選択され
た駆動符号ベクトルに乗算する乗算器、８５は乗算器８
３の乗算結果と乗算器８４の乗算結果を加算し、その加
算結果（仮の音源）を出力する加算器である。なお、音
源モード選択部８１，切換スイッチ８２，乗算器８３，
８４及び加算器８５は符号化手段を構成する。A multiplier 83 multiplies the adaptive code vector output from the adaptive excitation codebook 50 by the gain value selected by the changeover switch 82, and a multiplier 84 selects the gain value selected by the changeover switch 82 by the changeover switch 82. A multiplier for multiplying the driving code vector obtained by the driving,
This is an adder that adds the multiplication result of No. 3 and the multiplication result of the multiplier 84 and outputs the addition result (temporary sound source). Note that a sound source mode selection unit 81, a changeover switch 82, a multiplier 83,
The 84 and the adder 85 constitute an encoding unit.

【０１０１】図７はこの発明の実施の形態２による音声
復号化装置を示す構成図であり、図において、図２と同
一符号は同一または相当部分を示すので説明を省略す
る。９１はスペクトル復号化部７４により量子化された
ＬＳＰからモード情報を決定する音源モード選択部、９
２は音源モード選択部９１からモード情報を受けると、
そのモード情報にしたがって駆動音源符号帳６５が出力
する駆動符号ベクトル又は駆動音源符号帳７０が出力す
る駆動符号ベクトルを選択するとともに、ゲイン符号帳
６６が出力するゲイン値又はゲイン符号帳７１が出力す
るゲイン値を選択する切換スイッチである。FIG. 7 is a block diagram showing a speech decoding apparatus according to Embodiment 2 of the present invention. In the figure, the same reference numerals as in FIG. 2 denote the same or corresponding parts, and a description thereof will be omitted. Reference numeral 91 denotes a sound source mode selection unit for determining mode information from the LSP quantized by the spectrum decoding unit 74;
2 receives the mode information from the sound source mode selection unit 91,
According to the mode information, the drive code vector output from the drive excitation codebook 65 or the drive code vector output from the drive excitation codebook 70 is selected, and the gain value output from the gain codebook 66 or the gain codebook 71 is output. A changeover switch for selecting a gain value.

【０１０２】９３は切換スイッチ９２により選択された
ゲイン値を適応音源符号帳６４が出力する適応符号ベク
トルに乗算する乗算器、９４は切換スイッチ９２により
選択されたゲイン値を切換スイッチ９２により選択され
た駆動符号ベクトルに乗算する乗算器、９５は乗算器９
３の乗算結果と乗算器９４の乗算結果を加算し、その加
算結果（仮の音源）を出力する加算器である。なお、音
源モード選択部９１，切換スイッチ９２，乗算器９３，
９４及び加算器９５は復号化手段を構成する。Reference numeral 93 denotes a multiplier for multiplying the adaptive code vector output from the adaptive excitation codebook 64 by the gain value selected by the changeover switch 92, and 94 denotes a gain value selected by the changeover switch 92 by the changeover switch 92. Multiplier for multiplying the driving code vector by
This is an adder that adds the multiplication result of No. 3 and the multiplication result of the multiplier 94 and outputs the addition result (temporary sound source). Note that a tone generator mode selector 91, a changeover switch 92, a multiplier 93,
The adder 94 and the adder 95 constitute a decoding unit.

【０１０３】次に動作について説明する。上記実施の形
態１では、切換スイッチ５９（または７３）が音源復号
化部４８（または６２）が出力する仮の音源又は音源復
号化部４９（または６３）が出力する仮の音源を選択し
て、その選択した仮の音源を合成フィルタ４４（または
７５）に出力するものについて示したが、図６及び図７
に示すように、切換スイッチ８２（または９２）が駆動
音源符号帳５１（または６５）の駆動符号ベクトル又は
駆動音源符号帳５６（または７０）の駆動符号ベクトル
を選択して乗算器８３（または９３）に出力するととも
に、ゲイン符号帳５２（または６６）のゲイン値又はゲ
イン符号帳５７（または７１）のゲイン値を選択して乗
算器８４（または９４）に出力し、加算器８５（または
９５）が乗算器８３（または９３）の乗算結果と乗算器
８４（または９４）の乗算結果を加算し、その加算結果
を仮の音源として合成フィルタ４４（または７５）に出
力するようにしてもよい。この場合でも、上記実施の形
態１と同様の効果を奏することができる。Next, the operation will be described. In the first embodiment, the changeover switch 59 (or 73) selects the temporary sound source output from the sound source decoding unit 48 (or 62) or the temporary sound source output from the sound source decoding unit 49 (or 63). , The selected temporary sound source is output to the synthesis filter 44 (or 75).
As shown in (5), the changeover switch 82 (or 92) selects the driving code vector of the driving excitation codebook 51 (or 65) or the driving code vector of the driving excitation codebook 56 (or 70) and the multiplier 83 (or 93). ) And the gain value of the gain codebook 52 (or 66) or the gain value of the gain codebook 57 (or 71) is selected and output to the multiplier 84 (or 94), and the adder 85 (or 95) is selected. ) May add the multiplication result of the multiplier 83 (or 93) and the multiplication result of the multiplier 84 (or 94), and output the addition result to the synthesis filter 44 (or 75) as a temporary sound source. . In this case, the same effect as in the first embodiment can be obtained.

【０１０４】ただし、ゲイン符号帳５７，７１に格納さ
れているゲイン符号語は、ゲイン符号帳５２，６６に格
納されているゲイン符号語と同様に、適応符号ベクトル
と駆動符号ベクトルに乗じる２個のゲイン値の組を示す
符号語から構成されているものとする。However, the gain codewords stored in the gain codebooks 57 and 71 are the same as the gain codewords stored in the gain codebooks 52 and 66. Is assumed to be composed of a code word indicating a set of gain values of.

【０１０５】なお、上記実施の形態１では、各ゲイン符
号帳のゲイン符号語の格納順序を並べ換えるものについ
て示したが、これに限るものではなく、パワー符号帳や
ＬＳＰ符号帳などのベクトル符号帳についても、モード
毎に異なる符号帳を使用する構成であれば、各符号語の
パワーや振幅を評価値として、その順位が一致するよう
に並び換えられた符号帳を使用する構成も可能である。In the first embodiment, the order in which the gain codewords are stored in each gain codebook is rearranged. However, the present invention is not limited to this, and vector code such as a power codebook or an LSP codebook may be used. As for the book, if the configuration uses a different codebook for each mode, it is also possible to use a codebook that is rearranged so that its order matches the power or amplitude of each codeword as an evaluation value. is there.

【０１０６】また、２つの符号帳の評価値順位について
は、順位の差が小さい範囲であれば、完全に一致してい
なくてもよく、同様の効果を奏することができる。ま
た、上記の方法で２つの符号帳の評価値順位を一致させ
た後に、２つの符号帳中の符号語を同時に並べ換えて、
ゲイン符号にビット誤りが重畳したときの劣化を最小限
に抑制するなど、様々な方法で並び換えを行うことが可
能である。The evaluation value ranking of the two codebooks does not need to completely match as long as the difference between the rankings is small, and the same effect can be obtained. Also, after matching the evaluation value ranks of the two codebooks by the above method, the codewords in the two codebooks are rearranged simultaneously,
The rearrangement can be performed by various methods, such as minimizing deterioration when a bit error is superimposed on the gain code.

【０１０７】実施の形態３．上記実施の形態１では、ゲ
イン符号帳５２，６６及びゲイン符号帳５７，７１に格
納されているゲイン符号語の格納順序を図４に示すよう
に並べ換えるものについて示したが、図８に示すように
並べ換えるようにしてもよい。Embodiment 3 In the first embodiment, the storage order of the gain codewords stored in the gain codebooks 52 and 66 and the gain codebooks 57 and 71 is rearranged as shown in FIG. 4, but is shown in FIG. May be rearranged as follows.

【０１０８】具体的には、ゲイン符号帳５２，６６には
１２８個のゲイン符号語を格納し、ゲイン符号帳５７，
７１には２５６個のゲイン符号語を格納する。ゲイン符
号帳５２，６６に格納されているゲイン符号語は、適応
符号ベクトルと駆動符号ベクトルに乗じる２個のゲイン
値の組を示す符号語から構成され、ゲイン符号帳５７，
７１に格納されているゲイン符号語は、駆動符号ベクト
ルに乗じる１個のゲイン値を示す符号語から構成されて
いる。More specifically, gain codebooks 52 and 66 store 128 gain codewords, and gain codebooks 57 and
The 71 stores 256 gain codewords. The gain codewords stored in the gain codebooks 52 and 66 are composed of codewords indicating sets of two gain values that are multiplied by the adaptive code vector and the driving code vector.
The gain codeword stored in 71 is composed of a codeword indicating one gain value by which the drive code vector is multiplied.

【０１０９】インデックスと、インデックスの上位７ビ
ットの値と、評価値順位とは、各ゲイン符号帳内に実際
には格納されていないものであるが、説明の便宜のため
記載している。インデックスは上の符号語から順番に０
から１２７の値、または、０から２５５の値となってい
る。インデックスの上位７ビットの値は、例えば、イン
デックスが「０」又は「１」の場合に「０」となり、イ
ンデックスが「２」又は「３」の場合に「１」となるよ
うに、２つずつが同じ値を持っている。Although the index, the value of the upper 7 bits of the index, and the evaluation value rank are not actually stored in each gain codebook, they are described for convenience of explanation. The index is 0 from the codeword above
To 127 or a value from 0 to 255. The value of the upper 7 bits of the index is, for example, two such that the index is “0” when the index is “0” or “1” and “1” when the index is “2” or “3”. Each have the same value.

【０１１０】評価値はゲイン符号語のパワー（２乗和）
の値であり（評価値としては、ゲイン符号語のパワーに
限るものではなく、ゲイン符号語の平均振幅などでもよ
い）、ゲイン符号帳５２，６６については、各ゲイン符
号語の評価値順位が示されている。ゲイン符号帳５７，
７１については、インデックスの上位７ビットが同じ値
である２つのゲイン符号語における評価値の平均値に関
する順位が評価値平均順位として示されている。The evaluation value is the power (sum of squares) of the gain codeword.
(The evaluation value is not limited to the power of the gain codeword, but may be the average amplitude of the gain codeword.) For the gain codebooks 52 and 66, the evaluation value ranking of each gain codeword is It is shown. Gain codebook 57,
Regarding 71, the rank regarding the average of the evaluation values in the two gain codewords in which the upper 7 bits of the index have the same value is shown as the evaluation value average rank.

【０１１１】各ゲイン符号帳の動作としては、あるゲイ
ン符号を入力すると、そのゲイン符号に一致するインデ
ックス位置に格納しているゲイン符号語を出力する。各
ゲイン符号帳に格納されているゲイン符号語は、学習用
音声とその符号化音声との歪みが小さくなるように学習
して作成される。そして、ゲイン符号帳５２，６６につ
いては、音声符号を伝送する際の符号誤りによる出力音
声の劣化を最小限に抑えるため、適切にゲイン符号語の
並べ換えを行う。As an operation of each gain codebook, when a certain gain code is input, a gain codeword stored at an index position corresponding to the gain code is output. The gain codeword stored in each gain codebook is created by learning so that distortion between the learning speech and the encoded speech is reduced. Then, for the gain codebooks 52 and 66, the gain codewords are appropriately rearranged in order to minimize the deterioration of the output voice due to a code error when transmitting the voice code.

【０１１２】例えば、ゲイン符号に１ビット誤りを実際
に与えたときに生じる劣化の大きさの期待値を計算し、
さらに、ランダムに選択した２つのゲイン符号語を交換
したときに生じる劣化の大きさの期待値を計算し、前者
の期待値と比べて後者の期待値が減少するときに実際に
ゲイン符号語の格納順序を交換する。この作業を期待値
の減少が微小になるまで繰り返す。For example, the expected value of the magnitude of the degradation that occurs when a one-bit error is actually given to the gain code is calculated,
Furthermore, an expected value of the magnitude of the degradation that occurs when two randomly selected gain codewords are exchanged is calculated, and when the expected value of the latter is reduced as compared with the expected value of the former, the gain codeword of the gain is actually calculated. Swap the storage order. This operation is repeated until the decrease in the expected value becomes very small.

【０１１３】ゲイン符号帳５７，７１については、最初
に、ゲイン符号帳５２，６６と同様に、音声符号を伝送
する際の符号誤りによる出力音声の劣化を最小限に抑え
るために、適切にゲイン符号語の並べ換えを行う。次
に、その時点でインデックスの上位７ビットが同じ値と
なる２つのゲイン符号語を対とする。そして、各ゲイン
符号語対のパワーの平均値を求め、ゲイン符号帳５７，
７１におけるパワーの平均値の順位を調べて、既にゲイ
ン符号語の並べ換えが完了しているゲイン符号帳５２，
６６のゲイン符号語のパワー順位と同じ順番になるよう
に、ゲイン符号帳５７，７１中のゲイン符号語対を並べ
換える。First, the gain codebooks 57 and 71 are appropriately gained in the same manner as the gain codebooks 52 and 66 in order to minimize the deterioration of the output voice due to a code error when transmitting the voice code. Reorder codewords. Next, two gain codewords whose upper 7 bits of the index have the same value at that time are paired. Then, the average value of the power of each gain codeword pair is obtained, and the gain codebook 57,
By examining the rank of the average value of the power in the gain codebook 71, the gain codebooks 52,
The gain codeword pairs in the gain codebooks 57 and 71 are rearranged so as to be in the same order as the power order of the 66 gain codewords.

【０１１４】図８の各ゲイン符号帳は既に並べ換えが完
了したものである。ゲイン符号帳５２，６６では、例え
ば、インデックスが「０」に対応するゲイン符号語のパ
ワー（評価値）順位が「４１」であるので、ゲイン符号
帳５７，７１ではパワー（評価値）平均順位が「４１」
のゲイン符号語対が「０」のインデックスに対応するよ
うに格納されている。インデックスが「１」以降のゲイ
ン符号語についても同様にして、格納順序が並び換えら
れる。Each gain codebook in FIG. 8 has already been rearranged. In the gain codebooks 52 and 66, for example, the power (evaluation value) rank of the gain codeword corresponding to the index “0” is “41”. Is "41"
Are stored so as to correspond to the index of “0”. Similarly, the storage order is rearranged for gain codewords whose index is “1” or later.

【０１１５】図９は多重化部６０から出力される音声符
号の一例を示す説明図である。多重化部６０では、ＬＳ
Ｐ符号，モード情報，ゲイン符号，駆動音源符号及び適
応音源符号を多重化して（ただし、適応音源符号は第一
の符号化モードの場合に限り多重化の対象に含められ
る）、音声符号を生成するが、この実施の形態３では、
符号化モードが第一の符号化モードの場合はゲイン符号
の符号化ビット数が「７」であり、第二の符号化モード
の場合はゲイン符号の符号化ビット数が「８」である。
ただし、第一の符号化モードにおけるゲイン符号７ビッ
トと、第二の符号化モードにおけるゲイン符号の上位７
ビットの多重化位置が一致するように音声符号を生成し
ている。FIG. 9 is an explanatory diagram showing an example of a speech code output from the multiplexing section 60. In the multiplexing unit 60, LS
P code, mode information, gain code, driving excitation code and adaptive excitation code are multiplexed (however, the adaptive excitation code is included in the multiplexing target only in the first coding mode) to generate a speech code. However, in the third embodiment,
When the encoding mode is the first encoding mode, the number of encoded bits of the gain code is “7”, and when the encoding mode is the second encoding mode, the number of encoded bits of the gain code is “8”.
However, the gain code 7 bits in the first coding mode and the upper 7 bits of the gain code in the second coding mode
The speech code is generated such that the bit multiplexing positions match.

【０１１６】ここで、音声符号化装置が第一の符号化モ
ードで符号化して生成した音声符号（図９（ａ）を参
照）に伝送誤りが重畳することにより、その音声符号が
図９（ｂ）に示すように変化した場合を想定する。この
場合、音声復号化装置は、符号化モードが第二の符号化
モードであると誤認して、入力音声の復号化処理を実施
するが、上述したように、第一の符号化モードにおける
ゲイン符号７ビットと、第二の符号化モードにおけるゲ
イン符号の上位７ビットの多重化位置が一致するように
音声符号を生成しているので、モード情報に伝送誤りが
生じても、音声復号化装置はゲイン符号の値を正確に認
識することができる。Here, a transmission error is superimposed on a speech code (see FIG. 9A) generated by the speech coding apparatus in the first coding mode, so that the speech code becomes It is assumed that the state changes as shown in b). In this case, the speech decoding apparatus erroneously recognizes that the encoding mode is the second encoding mode, and performs the decoding process on the input speech. However, as described above, the gain in the first encoding mode is Since the audio code is generated such that the multiplexing position of the 7-bit code and the upper 7 bits of the gain code in the second encoding mode match, even if a transmission error occurs in the mode information, the audio decoding device Can accurately recognize the value of the gain code.

【０１１７】即ち、符号化モードが第一の符号化モード
であるため、図９（ａ）に示すように、ゲイン符号の値
が「１」であり、モード情報に伝送誤りがなければ、ゲ
イン符号帳６６のインデックスが「１」であるゲイン符
号語（評価値順位が「１０２」の符号語）を用いて復号
処理を行う。しかし、モード情報に伝送誤りが生じる
と、符号化モードが第二の符号化モードであると誤認す
るが、この実施の形態３では、誤認の有無に拘わらず、
ゲイン符号帳７１のインデックスの上位７ビットが
「１」であるゲイン符号語を用いて復号処理を行うこと
になる。具体的には、図９（ｂ）に示すように、ゲイン
符号の次のビットが「０」であるため、インデックスが
「２」（＝１×２＋０）であるゲイン符号語（評価値順
位が「１０２」の符号語）を用いて復号処理を行うこと
になる。That is, since the encoding mode is the first encoding mode, as shown in FIG. 9A, if the value of the gain code is “1” and there is no transmission error in the mode information, the gain The decoding process is performed using a gain codeword whose index in the codebook 66 is “1” (a codeword whose evaluation value rank is “102”). However, if a transmission error occurs in the mode information, the coding mode is erroneously recognized as the second coding mode. In the third embodiment, regardless of the presence or absence of the erroneous recognition,
The decoding process is performed using the gain codeword whose upper 7 bits of the index of the gain codebook 71 is “1”. Specifically, as shown in FIG. 9B, since the next bit of the gain code is “0”, the gain codeword whose index is “2” (= 1 × 2 + 0) (evaluation value rank is The decoding process is performed using the code word “102”).

【０１１８】したがって、音声復号化装置におけるゲイ
ン符号帳６６，７１は、モード情報に伝送誤りが生じて
も、評価値順位と評価値平均順位が一致又は略一致する
ゲイン符号に対応するゲイン符号語（ゲイン値）を出力
することができるので、モード情報に伝送誤りが生じて
も、出力するゲイン値の大きさが極端に変化することは
ない。Therefore, even if a transmission error occurs in the mode information, the gain codebooks 66 and 71 in the speech decoding apparatus store the gain codeword corresponding to the gain code whose evaluation value rank and the evaluation value average rank match or almost match. (Gain value) can be output, so that even if a transmission error occurs in the mode information, the magnitude of the output gain value does not extremely change.

【０１１９】これにより、上記実施の形態１と同様の効
果を奏することができる。なお、この実施の形態３で
は、図１の音声符号化装置及び図２の音声復号化装置に
適用するものについて示したが、上記実施の形態２のよ
うに、図６の音声符号化装置及び図７の音声復号化装置
に適用するようにしてもよい。As a result, the same effects as in the first embodiment can be obtained. Although the third embodiment has been described with reference to the case where the present invention is applied to the speech encoding apparatus of FIG. 1 and the speech decoding apparatus of FIG. 2, as in the second embodiment, the speech encoding apparatus of FIG. The present invention may be applied to the audio decoding device of FIG.

【０１２０】実施の形態４．図１０はこの発明の実施の
形態４による符号語配列方法が適用する符号語配列装置
を示す構成図であり、図において、１０１は駆動音源符
号帳５１，６５に相当する駆動音源符号帳、１０２は駆
動音源符号帳５６，７０に相当する駆動音源符号帳、１
０３，１０４は合成フィルタ、１０５は距離計算部、１
０６は距離計算部１０５の計算結果（評価値の偏差の合
計値）が減少して最小化するまで、駆動音源符号帳１０
２に格納されている符号語である駆動符号ベクトルの格
納順序を更新する符号語入れ換え部である。Embodiment 4 FIG. 10 is a configuration diagram showing a codeword arrangement apparatus to which a codeword arrangement method according to Embodiment 4 of the present invention is applied. In the figure, reference numeral 101 denotes a driving excitation codebook corresponding to the driving excitation codebooks 51 and 65; Are driving excitation codebooks equivalent to driving excitation codebooks 56 and 70,
03 and 104 are synthesis filters, 105 is a distance calculation unit, 1
Reference numeral 06 denotes the excitation codebook 10 until the calculation result of the distance calculation unit 105 (the total value of the deviations of the evaluation values) decreases and is minimized.
2 is a codeword exchanging unit that updates the storage order of the drive code vector, which is the codeword stored in 2.

【０１２１】次に動作について説明する。駆動音源符号
帳１０１については、上記実施の形態１におけるゲイン
符号帳５２等と同様に、音声符号を伝送する際の符号誤
りによる出力音声の劣化を最小限に抑えるために、予め
適切な符号語の並べ換えを実施する。また、駆動音源符
号帳１０１，１０２を使用して、多くの学習用の音声信
号を入力とする音声符号化処理を実施し、各フレーム毎
に、合成フィルタ１０３，１０４のためのフィルタ係
数、駆動音源符号語、ゲイン値を学習用データとして、
別途蓄積する。Next, the operation will be described. As for the excitation codebook 101, as in the case of the gain codebook 52 in the first embodiment, in order to minimize the deterioration of the output speech due to a code error when transmitting the speech code, an appropriate codeword Perform the rearrangement. Further, using the driving excitation codebooks 101 and 102, a speech encoding process is performed using a large number of learning speech signals as inputs, and a filter coefficient for the synthesis filters 103 and 104 is calculated for each frame. The excitation codeword and gain value are used as learning data.
Accumulate separately.

【０１２２】まず、距離計算部１０５は、上記学習用デ
ータに含まれる各フレーム毎の駆動音源符号を駆動音源
符号帳１０１，１０２に出力し、フィルタ係数を合成フ
ィルタ１０３，１０４に出力する。First, distance calculation section 105 outputs the driving excitation code for each frame included in the learning data to driving excitation codebooks 101 and 102, and outputs the filter coefficients to synthesis filters 103 and 104.

【０１２３】駆動音源符号帳１０１は、距離計算部１０
５から駆動音源符号を受けると、その駆動音源符号に対
応する駆動符号ベクトルを出力し、合成フィルタ１０３
は、距離計算部１０５から出力されたフィルタ係数を用
いて、その駆動符号ベクトルに対する合成フィルタリン
グを実施して第一の合成音を生成する。駆動音源符号帳
１０２は、距離計算部１０５から駆動音源符号を受ける
と、その駆動音源符号に対応する駆動符号ベクトルを出
力し、合成フィルタ１０４は、距離計算部１０５から出
力されたフィルタ係数を用いて、その駆動符号ベクトル
に対する合成フィルタリングを実施して第二の合成音を
生成する。Driving excitation codebook 101 includes distance calculation section 10
5 receives a driving excitation code, outputs a driving code vector corresponding to the driving excitation code,
Uses the filter coefficient output from the distance calculation unit 105 to perform synthesis filtering on the driving code vector to generate a first synthesized sound. Upon receiving the driving excitation code from distance calculating section 105, driving excitation codebook 102 outputs a driving code vector corresponding to the driving excitation code, and synthesis filter 104 uses the filter coefficient output from distance calculating section 105. Then, synthesis filtering is performed on the driving code vector to generate a second synthesized sound.

【０１２４】距離計算部１０５は、合成フィルタ１０３
により生成された第一の合成音と合成フィルタ１０４に
より生成された第二の合成音との距離をフレーム毎に計
算し、全フレームの距離値を合計して、その合計距離を
符号語入れ換え部１０６に出力する。The distance calculation unit 105 includes the synthesis filter 103
Is calculated for each frame, the distance between the first synthesized sound generated by the above and the second synthesized sound generated by the synthesis filter 104 is summed, and the total distance is calculated by the code word replacing unit. Output to 106.

【０１２５】符号語入れ換え部１０６は、距離計算部１
０５が合計距離を出力すると、その合計距離を記憶す
る。ここまでが図１０の符号語配列装置の初期化処理で
ある。続いて行われる繰返し処理は以下の通りである。The code word exchange unit 106 is provided with the distance calculation unit 1
When 05 outputs the total distance, the total distance is stored. Up to this point is the initialization processing of the code word array device of FIG. Subsequent repetitive processing is as follows.

【０１２６】符号語入れ換え部１０６は、ランダムに選
択した２つの駆動音源符号に対応する駆動音源符号帳１
０２の符号語の入れ換えを実施する。距離計算部１０５
は、再度、上記学習用データに含まれる各フレーム毎の
駆動音源符号を駆動音源符号帳１０１と符号語の入れ換
えが行われた駆動音源符号帳１０２に出力し、フィルタ
係数を合成フィルタ１０３，１０４に出力する。The codeword exchanging section 106 generates the driving excitation codebook 1 corresponding to the two driving excitation codes selected at random.
The codeword of 02 is exchanged. Distance calculator 105
Outputs the driving excitation code for each frame included in the learning data to the driving excitation codebook 101 and the driving excitation codebook 102 in which codewords have been exchanged again, and converts the filter coefficients into synthesis filters 103 and 104. Output to

【０１２７】そして、距離計算部１０５は、同様にして
生成された第一の合成音と第二の合成音を合成フィルタ
１０３，１０４から入力し、第一の合成音と第二の合成
音との距離をフレーム毎に計算し、全フレームの距離値
を合計して、その合計距離を符号語入れ換え部１０６に
出力する。The distance calculation section 105 receives the first synthesized sound and the second synthesized sound generated in the same manner from the synthesis filters 103 and 104, and outputs the first synthesized sound and the second synthesized sound. Is calculated for each frame, the distance values of all the frames are summed, and the total distance is output to the codeword replacing unit 106.

【０１２８】符号語入れ換え部１０６は、距離計算部１
０５から合計距離を受けると、その合計距離と、予め記
憶しておいた合計距離とを比較する。合計距離が減少し
ている場合には、今回入力した合計距離を新たに記憶
し、合計距離が減少していない場合には、前回の符号語
の入れ換えを元に戻す処理を実施する。そして、上記繰
返し処理の最初に戻る。ここまでの繰返し処理を合計距
離の減少が少なくなるまで繰り返し、駆動音源符号帳１
０２の符号語の並び換えを完了する。[0128] The code word exchanging unit 106 includes the distance calculating unit 1
When the total distance is received from 05, the total distance is compared with the total distance stored in advance. If the total distance has decreased, the total distance inputted this time is newly stored, and if the total distance has not decreased, processing for restoring the previous replacement of the code word is performed. Then, the process returns to the beginning of the repetition processing. The repetition processing up to this point is repeated until the decrease in the total distance is reduced, and driving excitation codebook 1
The rearrangement of the code word 02 is completed.

【０１２９】以上で明らかなように、この実施の形態４
によれば、距離計算部１０５により計算された合計距離
が減少して最小化するまで、駆動音源符号帳１０２の符
号語の入れ換えを実施するように構成したので、音声符
号化装置においては、伝送誤りが発生して、音声復号化
装置がモード情報を誤認しても、所定の評価値に関する
劣化の期待値が小さくなり、その結果、音声の再生品質
の劣化を抑制することが可能な音声符号を生成すること
ができる効果を奏する。一方、音声復号化装置において
は、モード情報を誤認しても、所定の評価値に関する劣
化の期待値が小さくなり、その結果、音声の再生品質の
劣化を抑制することができる効果を奏する。また、符号
化時と復号化時に異なる駆動音源符号帳が使用された場
合でも、所定評価値に関する劣化の期待値が低い復号結
果を与えることができるベクトル符号帳が得られる効果
も奏する。As is clear from the above, this embodiment 4
According to the configuration described above, the codewords of the driving excitation codebook 102 are exchanged until the total distance calculated by the distance calculation unit 105 is reduced and minimized. Even if an error occurs and the speech decoding device mistakenly recognizes the mode information, the expected value of the degradation related to the predetermined evaluation value is reduced, and as a result, the speech code capable of suppressing the degradation of the speech reproduction quality. Is produced. On the other hand, in the audio decoding device, even if the mode information is erroneously recognized, the expected value of the deterioration with respect to the predetermined evaluation value is reduced, and as a result, there is an effect that the deterioration of the sound reproduction quality can be suppressed. Further, even when different excitation codebooks are used at the time of encoding and at the time of decoding, a vector codebook that can provide a decoding result with a low expected value of deterioration related to the predetermined evaluation value is obtained.

【０１３０】なお、この実施の形態４では、距離計算部
１０５における距離としては、２つの合成音におけるサ
ンプル毎の値の差の２乗和、聴覚重み付けを行った２つ
の合成音におけるサンプル毎の値の差の２乗和、２つの
合成音のパワー差など様々なものを適用することができ
る。また、ここでは、駆動音源符号帳１０１，１０２に
関する並び換えについて説明したが、ゲイン符号帳、Ｌ
ＳＰ符号帳などの他の符号帳についても、複数備えてモ
ード切換を実施する場合には、同様な逐次交換処理によ
って符号語を並べ換えるようにしてもよい。In the fourth embodiment, as the distance in distance calculating section 105, the sum of squares of the difference between the values of the two synthesized sounds and the weight of the per-sample of the two synthesized sounds subjected to auditory weighting are used. Various things such as the sum of squares of the difference between the values and the power difference between two synthesized sounds can be applied. Also, here, the rearrangement of the driving excitation codebooks 101 and 102 has been described.
When a plurality of other codebooks such as the SP codebook are provided and the mode is switched, codewords may be rearranged by a similar sequential exchange process.

【０１３１】実施の形態５．図１１はこの発明の実施の
形態５による音声符号化装置を示す構成図であり、図１
２はこの発明の実施の形態５による音声復号化装置を示
す構成図である。図において、図１及び図２と同一符号
は同一または相当部分を示すので説明を省略する。１１
１は歪み最小化部４７による更新後のゲイン符号をマッ
ピングし、マッピング後のゲイン符号をゲイン符号帳５
７に出力するマッピング部、１１２は分離部６１により
分離されたゲイン符号をマッピングし、マッピング後の
ゲイン符号をゲイン符号帳７１に出力するマッピング部
である。なお、マッピング部１１１，１１２はマッピン
グ手段を構成している。Embodiment 5 FIG. FIG. 11 is a configuration diagram showing a speech encoding apparatus according to Embodiment 5 of the present invention.
FIG. 2 is a configuration diagram showing a speech decoding apparatus according to Embodiment 5 of the present invention. In the drawings, the same reference numerals as those in FIGS. 1 and 2 denote the same or corresponding parts, and a description thereof will not be repeated. 11
1 maps the gain code after updating by the distortion minimizing section 47 and maps the mapped gain code to the gain codebook 5.
7 is a mapping unit that maps the gain code separated by the separation unit 61 and outputs the mapped gain code to the gain codebook 71. Note that the mapping units 111 and 112 constitute mapping means.

【０１３２】次に動作について説明する。まず、音声符
号化装置のマッピング部１１１は、歪み最小化部４７か
らゲイン符号を受けると、所定のルールにしたがって写
像処理を実施し、そのゲイン符号に対応する写像ゲイン
符号（マッピング後のゲイン符号）をゲイン符号帳５７
に出力する。ただし、この実施の形態５におけるゲイン
符号帳５７は、上記実施の形態１におけるゲイン符号帳
５７のような評価順位を基準とするゲイン符号語の並べ
換えが実施されていないものとする。即ち、ゲイン符号
帳５７の格納順序が図１７（ｂ）に示す通りであるとす
る。Next, the operation will be described. First, upon receiving the gain code from the distortion minimizing unit 47, the mapping unit 111 of the speech coding apparatus performs a mapping process according to a predetermined rule, and performs a mapping gain code corresponding to the gain code (a gain code after mapping). ) To the gain codebook 57
Output to However, in the gain codebook 57 according to the fifth embodiment, it is assumed that the rearrangement of the gain codewords based on the evaluation order as in the gain codebook 57 according to the first embodiment is not performed. That is, it is assumed that the storage order of the gain codebook 57 is as shown in FIG.

【０１３３】ゲイン符号帳５７は、マッピング部１１１
から写像ゲイン符号を受けると、その写像ゲイン符号に
一致するインデックス位置に格納されているゲイン符号
語を出力する。ただし、この実施の形態５では、上記実
施の形態１におけるゲイン符号語と同様の並べ換え結果
を得ることができるように、マッピング部１１１は、図
１３に示すようなマッピング用テーブルを備えている。The gain codebook 57 includes a mapping unit 111
When receiving the mapping gain code from, it outputs the gain codeword stored at the index position corresponding to the mapping gain code. However, in the fifth embodiment, mapping section 111 has a mapping table as shown in FIG. 13 so that the same reordering result as the gain codeword in the first embodiment can be obtained.

【０１３４】例えば、図１３のマッピング用テーブルの
場合、マッピング部１１１が「０」のゲイン符号を入力
すると、「１」の写像ゲイン符号を出力する。これによ
り、ゲイン符号帳５７は、「１」の写像ゲイン符号に対
応する評価値順位が「４１」のゲイン値を出力すること
になる（図１７（ｂ）を参照）。したがって、上記実施
の形態１におけるゲイン符号帳５７が出力するゲイン値
と同一のゲイン値が得られる。なお、音声符号化装置の
その他の動作は上記実施の形態１と同様であるため説明
を省略する。For example, in the case of the mapping table of FIG. 13, when the mapping unit 111 inputs a gain code of “0”, it outputs a mapping gain code of “1”. As a result, the gain codebook 57 outputs a gain value whose evaluation value rank corresponding to the mapping gain code of “1” is “41” (see FIG. 17B). Therefore, the same gain value as the gain value output by gain codebook 57 in the first embodiment is obtained. Note that the other operations of the speech encoding apparatus are the same as those in the first embodiment, and a description thereof will not be repeated.

【０１３５】次に、音声復号化装置のマッピング部１１
２は、分離部６１からゲイン符号を受けると、所定のル
ールにしたがって写像処理を実施し、そのゲイン符号に
対応する写像ゲイン符号（マッピング後のゲイン符号）
をゲイン符号帳７１に出力する。ただし、この実施の形
態５におけるゲイン符号帳７１は、上記実施の形態１に
おけるゲイン符号帳７１のような評価順位を基準とする
ゲイン符号語の並べ換えが実施されていないものとす
る。即ち、ゲイン符号帳７１の格納順序が図１７（ｂ）
に示す通りであるとする。Next, the mapping unit 11 of the speech decoding apparatus
2 receives a gain code from the separation unit 61, performs mapping processing according to a predetermined rule, and maps a gain code corresponding to the gain code (gain code after mapping).
Is output to the gain codebook 71. However, in the gain codebook 71 according to the fifth embodiment, it is assumed that the rearrangement of the gain codeword based on the evaluation order as in the gain codebook 71 according to the first embodiment is not performed. That is, the storage order of the gain codebook 71 is as shown in FIG.
Is assumed to be as shown in FIG.

【０１３６】ゲイン符号帳７１は、マッピング部１１２
から写像ゲイン符号を受けると、その写像ゲイン符号に
一致するインデックス位置に格納されているゲイン符号
語を出力する。ただし、この実施の形態５では、上記実
施の形態１におけるゲイン符号語と同様の並べ換え結果
を得ることができるように、マッピング部１１２は、図
１３に示すようなマッピング用テーブルを備えている。The gain codebook 71 has a mapping unit 112
When receiving the mapping gain code from, it outputs the gain codeword stored at the index position corresponding to the mapping gain code. However, in the fifth embodiment, mapping section 112 has a mapping table as shown in FIG. 13 so that the same rearrangement result as the gain codeword in the first embodiment can be obtained.

【０１３７】例えば、図１３のマッピング用テーブルの
場合、マッピング部１１２が「０」のゲイン符号を入力
すると、「１」の写像ゲイン符号を出力する。これによ
り、ゲイン符号帳７１は、「１」の写像ゲイン符号に対
応する評価値順位が「４１」のゲイン値を出力すること
になる（図１７（ｂ）を参照）。したがって、上記実施
の形態１におけるゲイン符号帳７１が出力するゲイン値
と同一のゲイン値が得られる。なお、音声復号化装置の
その他の動作は上記実施の形態１と同様であるため説明
を省略する。For example, in the case of the mapping table of FIG. 13, when the mapping unit 112 inputs a gain code of “0”, it outputs a mapping gain code of “1”. As a result, the gain codebook 71 outputs a gain value whose evaluation value rank corresponding to the mapping gain code of “1” is “41” (see FIG. 17B). Therefore, the same gain value as the gain value output by gain codebook 71 in the first embodiment is obtained. Note that the other operations of the speech decoding device are the same as those in the first embodiment, and a description thereof will not be repeated.

【０１３８】以上で明らかなように、この実施の形態５
によれば、ゲイン符号をマッピングし、マッピング後の
ゲイン符号をゲイン符号帳５７，７１に出力するマッピ
ング部１１１，１１２を設けるように構成したので、ゲ
イン符号語の格納順序を予め評価値を基準にして更新す
ることなく、更新後の格納順序と等価な状態を構築する
ことができる効果を奏する。As is clear from the above, the fifth embodiment
According to the configuration, the mapping units 111 and 112 for mapping the gain codes and outputting the gain codes after the mapping to the gain codebooks 57 and 71 are provided. This makes it possible to construct a state equivalent to the storage order after the update without updating.

【０１３９】また、マッピング部１１１，１１２の写像
を複数用意して、音声符号に重畳する誤り条件に最適な
写像を使用するようにした場合には、メモリ量を大きく
増やすことなく、幅広い誤り条件下で品質劣化の少ない
音声符号化装置と音声復号化装置が得られる効果を奏す
る。When a plurality of mappings of the mapping units 111 and 112 are prepared and an optimum mapping is used for an error condition to be superimposed on a speech code, a wide range of error conditions can be used without greatly increasing the memory amount. There is an effect that an audio encoding device and an audio decoding device with low quality degradation can be obtained below.

【０１４０】なお、この実施の形態５では、ゲイン符号
帳５７，７１の前段に限りマッピング部１１１，１１２
を設けるものについて示したが、ゲイン符号帳５２，６
６の前段にもマッピング部１１１，１１２を設けるよう
にしてもよい。また、ゲイン符号帳５２，６６の前段に
限りマッピング部１１１，１１２を設けるようにしても
よい。In the fifth embodiment, mapping sections 111 and 112 are provided only in the preceding stage of gain codebooks 57 and 71.
Is shown, but the gain codebooks 52, 6
The mapping units 111 and 112 may also be provided at a stage preceding the step # 6. Further, mapping sections 111 and 112 may be provided only in the preceding stage of gain codebooks 52 and 66.

【０１４１】また、ゲイン符号帳以外の符号帳の前段に
マッピング部１１１，１１２を導入する構成も可能であ
るし、図６の音声符号化装置及び図７の音声復号化装置
におけるゲイン符号帳の前段にマッピング部１１１，１
１２を導入する構成も可能である。It is also possible to adopt a configuration in which mapping sections 111 and 112 are introduced at the preceding stage of a codebook other than the gain codebook, and the gain codebook in the speech encoding apparatus shown in FIG. 6 and the speech decoding apparatus shown in FIG. The mapping unit 111, 1
A configuration in which 12 is introduced is also possible.

【０１４２】さらに、ここで導入したマッピング部１１
１，１１２の写像を固定とせず、音声符号に対して外部
で適用される誤り訂正符号の条件に従って、複数の写像
を切り換えて使用する構成も可能である。例えば、モー
ド情報が強く保護されている場合には、ゲイン符号帳５
７，７１を単独でビット誤りに強いように設計した写像
を適用し、モード情報の保護が弱い場合には、これまで
説明してきた方法によってモード情報を誤ったときの劣
化を抑制するように写像を設計すればよい。Further, the mapping unit 11 introduced here
A configuration is also possible in which the mapping of 1,112 is not fixed, and a plurality of mappings are switched and used according to the conditions of an error correction code externally applied to the speech code. For example, if the mode information is strongly protected, the gain codebook 5
If mappings designed to be strong against bit errors are applied independently to each other, if the protection of the mode information is weak, the mapping is performed so as to suppress the deterioration when the mode information is erroneous by the method described above. Can be designed.

【０１４３】図１４は２つの写像を切り換えて使用する
場合の２つのマッピング用テーブルを示す説明図であ
る。第一のマッピング用テーブル（図１４（ａ）を参
照）は、モード情報が強く保護されている場合に使用す
るものであり、ゲイン符号帳５７，７１が既に単独でビ
ット誤りに強いように設計しておくことで、写像によっ
て符号が変化しないようになっている。第二のマッピン
グ用テーブル（図１４（ｂ）を参照）は、モード情報の
保護が弱い場合に使用するものであり、図１３のマッピ
ング用テーブルと同じものである。なお、第一のマッピ
ング用テーブルは省略して、写像を行うか否かを切り換
える方法でも構わない。FIG. 14 is an explanatory diagram showing two mapping tables when two mappings are switched and used. The first mapping table (see FIG. 14A) is used when the mode information is strongly protected, and is designed so that the gain codebooks 57 and 71 are already alone and resistant to bit errors. By doing so, the sign is not changed by the mapping. The second mapping table (see FIG. 14B) is used when the protection of the mode information is weak, and is the same as the mapping table in FIG. The first mapping table may be omitted, and a method of switching whether or not to perform mapping may be used.

【０１４４】[0144]

【発明の効果】以上のように、この発明によれば、複数
の符号帳が他の符号帳の符号語に関する評価値の順位と
相応して、符号語の格納順序が並び換えられているよう
に構成したので、伝送誤りが発生して、音声復号化装置
がモード情報を誤認しても、音声の再生品質の劣化を抑
制することが可能な音声符号を生成することができる効
果がある。As described above, according to the present invention, the storage order of codewords in a plurality of codebooks is rearranged in accordance with the order of evaluation values for codewords in other codebooks. Thus, even if a transmission error occurs and the speech decoding device mistakenly recognizes the mode information, there is an effect that a speech code capable of suppressing deterioration of speech reproduction quality can be generated.

【０１４５】この発明によれば、符号語に関する評価値
として、その符号語のパワー又は平均振幅を用いるよう
に構成したので、伝送誤りが発生して、音声復号化装置
がモード情報を誤認しても、音声復号化装置により再生
される音声のパワーや振幅が大きく劣化することのない
音声符号を生成することができる効果がある。According to the present invention, since the power or the average amplitude of the code word is used as the evaluation value for the code word, a transmission error occurs, and the speech decoding apparatus misidentifies the mode information. Also, there is an effect that it is possible to generate a speech code without significantly deteriorating the power and amplitude of the speech reproduced by the speech decoding device.

【０１４６】この発明によれば、複数の符号帳が音源ゲ
インを出力する符号帳であるように構成したので、伝送
誤りが発生して、音声復号化装置がモード情報を誤認し
ても、音声復号化装置により再生される音声のゲイン値
が大きく劣化することのない音声符号を生成することが
できる効果がある。According to the present invention, since the plurality of codebooks are configured to be codebooks for outputting excitation gains, even if a transmission error occurs and the speech decoding device misidentifies mode information, the There is an effect that it is possible to generate a speech code that does not greatly degrade the gain value of the speech reproduced by the decoding device.

【０１４７】この発明によれば、複数の符号帳間の対応
する各符号語に関する評価値の偏差の合計値が最小とな
るように、複数の符号帳の符号語の格納順序が並び換え
られているように構成したので、伝送誤りが発生して、
音声復号化装置がモード情報を誤認しても、所定の評価
値に関する劣化の期待値が小さくなり、その結果、音声
の再生品質の劣化を抑制することが可能な音声符号を生
成することができる効果がある。According to the present invention, the storage order of the codewords of the plurality of codebooks is rearranged so that the total value of the deviations of the evaluation values of the corresponding codewords among the plurality of codebooks is minimized. Transmission error occurs,
Even if the speech decoding device mistakenly recognizes the mode information, the expected value of the degradation related to the predetermined evaluation value becomes small, and as a result, it is possible to generate a speech code capable of suppressing the degradation of the speech reproduction quality. effective.

【０１４８】この発明によれば、符号語から音源を生成
して、その音源から合成音を生成する場合、その合成音
に関する期待値を評価値として取り扱うように構成した
ので、音声の再生品質の劣化を抑制することが可能な音
声符号を生成することができる効果がある。According to the present invention, when a sound source is generated from a code word and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value. There is an effect that a speech code capable of suppressing deterioration can be generated.

【０１４９】この発明によれば、インデックスをマッピ
ングするマッピング手段を有し、少なくとも１以上の符
号帳がマッピング後のインデックスに対応する符号語を
出力することにより、複数の符号帳の符号語の格納順序
を予め評価値の順位を基準にして更新することなく、更
新後の格納順序と等価な状態を構築するように構成した
ので、事前にゲイン符号語の格納順序を更新する処理が
不要になる効果がある。According to the present invention, there is provided a mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the mapped index, thereby storing codewords of a plurality of codebooks. A configuration equivalent to the storage order after the update is constructed without updating the order in advance based on the rank of the evaluation value, so that it is not necessary to update the storage order of the gain codewords in advance. effective.

【０１５０】この発明によれば、インデックスをマッピ
ングするマッピング手段を有し、少なくとも１以上の符
号帳がマッピング後のインデックスに対応する符号語を
出力することにより、複数の符号帳の符号語の格納順序
を予め評価値の偏差の合計値が最小となるように更新す
ることなく、更新後の格納順序と等価な状態を構築する
ように構成したので、事前にゲイン符号語の格納順序を
更新する処理が不要になる効果がある。According to the present invention, there is provided a mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the mapped index, thereby storing codewords of a plurality of codebooks. Since the order is configured so that a state equivalent to the storage order after the update is constructed without updating the sum of the deviations of the evaluation values in advance, the storage order of the gain codeword is updated in advance. There is an effect that processing becomes unnecessary.

【０１５１】この発明によれば、複数の符号帳が他の符
号帳の符号語に関する評価値の順位と相応して、符号語
の格納順序が並び換えられているように構成したので、
モード情報を誤認しても、音声の再生品質の劣化を抑制
することができる効果がある。According to the present invention, since the plurality of codebooks are arranged such that the storage order of the codewords is rearranged in accordance with the order of the evaluation values regarding the codewords of the other codebooks,
Even if the mode information is erroneously recognized, there is an effect that the deterioration of the sound reproduction quality can be suppressed.

【０１５２】この発明によれば、符号語に関する評価値
として、その符号語のパワー又は平均振幅を用いるよう
に構成したので、モード情報を誤認しても、音声のパワ
ーや振幅の大きな劣化を招くことなく、音声を再生する
ことができる効果がある。According to the present invention, since the power or the average amplitude of the code word is used as the evaluation value relating to the code word, even if the mode information is erroneously recognized, the power and the amplitude of the voice are greatly deteriorated. There is an effect that the sound can be reproduced without the need.

【０１５３】この発明によれば、複数の符号帳が音源ゲ
インを出力する符号帳であるように構成したので、モー
ド情報を誤認しても、音声のゲイン値の大きな劣化を招
くことなく、音声を再生することができる効果がある。According to the present invention, since the plurality of codebooks are configured to be codebooks for outputting the excitation gain, even if the mode information is erroneously recognized, the voice gain value is not significantly deteriorated without causing significant deterioration of the voice gain value. There is an effect that can be reproduced.

【０１５４】この発明によれば、複数の符号帳間の対応
する各符号語に関する評価値の偏差の合計値が最小とな
るように、複数の符号帳の符号語の格納順序が並び換え
られているように構成したので、モード情報を誤認して
も、所定の評価値に関する劣化の期待値が小さくなり、
その結果、音声の再生品質の劣化を抑制することができ
る効果がある。According to the present invention, the storage order of the codewords of the plurality of codebooks is rearranged so that the total value of the deviations of the evaluation values of the corresponding codewords among the plurality of codebooks is minimized. Configuration, even if the mode information is erroneously recognized, the expected value of the deterioration with respect to the predetermined evaluation value decreases,
As a result, there is an effect that deterioration of the sound reproduction quality can be suppressed.

【０１５５】この発明によれば、符号語から音源を生成
して、その音源から合成音を生成する場合、その合成音
に関する期待値を評価値として取り扱うように構成した
ので、音声の再生品質の劣化を抑制することができる効
果がある。According to the present invention, when a sound source is generated from a codeword and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value. There is an effect that deterioration can be suppressed.

【０１５６】この発明によれば、インデックスをマッピ
ングするマッピング手段を有し、少なくとも１以上の符
号帳がマッピング後のインデックスに対応する符号語を
出力することにより、複数の符号帳の符号語の格納順序
を予め評価値の順位を基準にして更新することなく、更
新後の格納順序と等価な状態を構築するように構成した
ので、事前にゲイン符号語の格納順序を更新する処理が
不要になる効果がある。According to the present invention, there is provided a mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the mapped index, thereby storing codewords of a plurality of codebooks. A configuration equivalent to the storage order after the update is constructed without updating the order in advance based on the rank of the evaluation value, so that it is not necessary to update the storage order of the gain codewords in advance. effective.

【０１５７】この発明によれば、インデックスをマッピ
ングするマッピング手段を有し、少なくとも１以上の符
号帳がマッピング後のインデックスに対応する符号語を
出力することにより、複数の符号帳の符号語の格納順序
を予め評価値の偏差の合計値が最小となるように更新す
ることなく、更新後の格納順序と等価な状態を構築する
ように構成したので、事前にゲイン符号語の格納順序を
更新する処理が不要になる効果がある。According to the present invention, there is provided a mapping means for mapping an index, and at least one or more codebooks output a codeword corresponding to the mapped index, thereby storing codewords of a plurality of codebooks. Since the order is configured so that a state equivalent to the storage order after the update is constructed without updating the sum of the deviations of the evaluation values in advance, the storage order of the gain codeword is updated in advance. There is an effect that processing becomes unnecessary.

【０１５８】この発明によれば、各符号帳の符号語に関
する評価値を調査し、他の符号帳の符号語に関する評価
値の順位と相応して、少なくとも１以上の符号帳の符号
語の格納順序を並び換えるように構成したので、伝送誤
りが発生して、音声復号化装置がモード情報を誤認して
も、音声の再生品質の劣化を抑制することができる符号
帳が得られる効果がある。According to the present invention, the evaluation values for the codewords of each codebook are examined, and at least one or more codewords of the codebook are stored in accordance with the order of the evaluation values for the codewords of the other codebooks. Since the order is rearranged, even if a transmission error occurs and the speech decoding device mistakenly recognizes the mode information, it is possible to obtain a codebook capable of suppressing deterioration of speech reproduction quality. .

【０１５９】この発明によれば、符号語に関する評価値
として、その符号語のパワー又は平均振幅を用いるよう
に構成したので、音声のパワーや振幅の大きな劣化を招
くことなく、音声を再生することができる符号帳が得ら
れる効果がある。According to the present invention, since the power or the average amplitude of the code word is used as the evaluation value relating to the code word, it is possible to reproduce the sound without causing significant deterioration in the power or amplitude of the sound. This has the effect of obtaining a codebook that can be used.

【０１６０】この発明によれば、複数の符号帳が音源ゲ
インを出力する符号帳であるように構成したので、音声
のゲイン値の大きな劣化を招くことなく、音声を再生す
ることができる符号帳が得られる効果がある。According to the present invention, since the plurality of codebooks are configured to be codebooks that output excitation gains, a codebook capable of reproducing a sound without causing significant deterioration of a sound gain value. The effect is obtained.

【０１６１】この発明によれば、複数の符号帳間の対応
する各符号語に関する評価値の偏差の合計値を計算し、
その合計値が減少して最小化するまで、少なくとも１以
上の符号帳の符号語の格納順序を更新するように構成し
たので、モード情報を誤認しても、所定の評価値に関す
る劣化の期待値が小さくなり、その結果、音声の再生品
質の劣化を抑制することができる符号帳が得られる効果
がある。According to the present invention, the total value of the deviations of the evaluation values for the corresponding codewords among a plurality of codebooks is calculated,
Since the storage order of at least one or more codebooks is updated until the total value decreases and is minimized, even if the mode information is erroneously recognized, the expected value of deterioration with respect to a predetermined evaluation value is obtained. Is reduced, and as a result, there is an effect of obtaining a codebook capable of suppressing deterioration of the reproduction quality of audio.

【０１６２】この発明によれば、符号語から音源を生成
して、その音源から合成音を生成する場合、その合成音
に関する期待値を評価値として取り扱うように構成した
ので、音声の再生品質の劣化を抑制することができる符
号帳が得られる効果がある。According to the present invention, when a sound source is generated from a code word and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value. There is an effect of obtaining a codebook capable of suppressing deterioration.

[Brief description of the drawings]

【図１】この発明の実施の形態１による音声符号化装
置を示す構成図である。FIG. 1 is a configuration diagram showing a speech encoding device according to Embodiment 1 of the present invention.

【図２】この発明の実施の形態１による音声復号化装
置を示す構成図である。FIG. 2 is a configuration diagram showing a speech decoding device according to Embodiment 1 of the present invention.

【図３】この発明の実施の形態１による符号語配列方
法を示すフローチャートである。FIG. 3 is a flowchart showing a codeword arrangement method according to the first embodiment of the present invention.

【図４】音声符号化装置及び音声復号化装置により使
用されるゲイン符号帳の一例を示す説明図である。FIG. 4 is an explanatory diagram illustrating an example of a gain codebook used by a speech encoding device and a speech decoding device.

【図５】多重化部から出力される音声符号の一例を示
す説明図である。FIG. 5 is an explanatory diagram showing an example of a speech code output from a multiplexing unit.

【図６】この発明の実施の形態２による音声符号化装
置を示す構成図である。FIG. 6 is a configuration diagram illustrating a speech encoding device according to a second embodiment of the present invention.

【図７】この発明の実施の形態２による音声復号化装
置を示す構成図である。FIG. 7 is a configuration diagram showing a speech decoding apparatus according to Embodiment 2 of the present invention.

【図８】ゲイン符号帳の一例を示す説明図である。FIG. 8 is an explanatory diagram showing an example of a gain codebook.

【図９】多重化部から出力される音声符号の一例を示
す説明図である。FIG. 9 is an explanatory diagram illustrating an example of a speech code output from a multiplexing unit.

【図１０】この発明の実施の形態４による符号語配列
方法が適用する符号語配列装置を示す構成図である。FIG. 10 is a configuration diagram showing a codeword arrangement device to which a codeword arrangement method according to a fourth embodiment of the present invention is applied.

【図１１】この発明の実施の形態５による音声符号化
装置を示す構成図である。FIG. 11 is a configuration diagram illustrating a speech encoding device according to a fifth embodiment of the present invention.

【図１２】この発明の実施の形態５による音声復号化
装置を示す構成図である。FIG. 12 is a configuration diagram showing a speech decoding apparatus according to Embodiment 5 of the present invention.

【図１３】マッピング用テーブルを示す説明図であ
る。FIG. 13 is an explanatory diagram showing a mapping table.

【図１４】マッピング用テーブルを示す説明図であ
る。FIG. 14 is an explanatory diagram showing a mapping table.

【図１５】従来の音声符号化装置を示す構成図であ
る。FIG. 15 is a configuration diagram showing a conventional speech encoding device.

【図１６】従来の音声復号化装置を示す構成図であ
る。FIG. 16 is a configuration diagram showing a conventional speech decoding device.

【図１７】従来の音声符号化装置及び音声復号化装置
により使用されるゲイン符号帳の一例を示す説明図であ
る。FIG. 17 is an explanatory diagram showing an example of a gain codebook used by a conventional speech encoding device and speech decoding device.

[Explanation of symbols]

４１前処理部（符号化手段）、４２スペクトル分析
部（符号化手段）、４３スペクトル符号化部（符号化
手段）、４４合成フィルタ（符号化手段）、４５減
算器（符号化手段）、４６聴覚重み付け部（符号化手
段）、４７歪み最小化部（符号化手段）、４８音源
復号化部（符号化手段）、４９音源復号化部（符号化
手段）、５０適応音源符号帳、５１駆動音源符号
帳、５２ゲイン符号帳、５３乗算器、５４乗算器、
５５加算器、５６駆動音源符号帳、５７ゲイン符
号帳、５８乗算器、５９切換スイッチ（符号化手
段）、６０多重化部（多重化手段）、６１分離部
（分離手段）、６２音源復号化部（復号化手段）、６
３音源復号化部（復号化手段）、６４適応音源符号
帳、６５駆動音源符号帳、６６ゲイン符号帳、６７
乗算器、６８乗算器、６９加算器、７０駆動音
源符号帳、７１ゲイン符号帳、７２乗算器、７３
切換スイッチ（復号化手段）、７４スペクトル復号化
部（復号化手段）、７５合成フィルタ（復号化手
段）、７６後処理部（復号化手段）、８１音源モード
選択部（符号化手段）、８２切換スイッチ（符号化手
段）、８３乗算器（符号化手段）、８４乗算器（符号
化手段）、８５加算器（符号化手段）、９１音源モ
ード選択部（復号化手段）、９２切換スイッチ（復号
化手段）、９３乗算器（復号化手段）、９４乗算器
（復号化手段）、９５加算器（復号化手段）、１０１
駆動音源符号帳、１０２駆動音源符号帳、１０３合
成フィルタ、１０４合成フィルタ、１０５距離計算
部、１０６符号語入れ換え部、１１１マッピング部
（マッピング手段）、１１２マッピング部（マッピン
グ手段）。41 preprocessing unit (encoding unit), 42 spectrum analysis unit (encoding unit), 43 spectrum encoding unit (encoding unit), 44 synthesis filter (encoding unit), 45 subtractor (encoding unit), 46 Auditory weighting unit (encoding unit), 47 distortion minimizing unit (encoding unit), 48 sound source decoding unit (encoding unit), 49 sound source decoding unit (encoding unit), 50 adaptive excitation codebook, 51 driving Excitation codebook, 52 gain codebook, 53 multiplier, 54 multiplier,
55 adder, 56 driving excitation codebook, 57 gain codebook, 58 multiplier, 59 changeover switch (encoding means), 60 multiplexing section (multiplexing means), 61 demultiplexing section (separating means), 62 excitation decoding Part (decoding means), 6
3 Excitation decoder (decoding means), 64 adaptive excitation codebook, 65 driving excitation codebook, 66 gain codebook, 67
Multiplier, 68 multiplier, 69 adder, 70 excitation codebook, 71 gain codebook, 72 multiplier, 73
Changeover switch (decoding means), 74 spectrum decoding section (decoding means), 75 synthesis filter (decoding means), 76 post-processing section (decoding means), 81 sound source mode selection section (coding means), 82 Changeover switch (encoding means), 83 multiplier (encoding means), 84 multiplier (encoding means), 85 adder (encoding means), 91 sound source mode selection section (decoding means), 92 changeover switch (encoding means) Decoding means), 93 multiplier (decoding means), 94 multiplier (decoding means), 95 adder (decoding means), 101
Driving excitation codebook, 102 Driving excitation codebook, 103 synthesis filter, 104 synthesis filter, 105 distance calculation unit, 106 codeword replacement unit, 111 mapping unit (mapping unit), 112 mapping unit (mapping unit).

Claims

[Claims]

1. A codebook corresponding to mode information is selected from a plurality of codebooks that output a codeword corresponding to an index, and input speech is converted for each frame using a codeword output by the codebook. In a speech encoding apparatus provided with an encoding unit for encoding and a multiplexing unit for multiplexing an encoding result of the encoding unit into a bit string, the plurality of codebooks are related to codewords of another codebook. A speech coding apparatus, wherein the storage order of codewords is rearranged in accordance with the order of evaluation values.

2. The speech encoding apparatus according to claim 1, wherein the power or the average amplitude of the code word is used as the evaluation value for the code word.

3. The codebook according to claim 1, wherein the plurality of codebooks are codebooks that output excitation gains.
A speech encoding device according to claim 1.

4. A codebook corresponding to mode information is selected from a plurality of codebooks that output a codeword corresponding to an index, and input speech is converted for each frame using a codeword output by the codebook. In a speech coding apparatus comprising coding means for coding, and multiplexing means for multiplexing the coding result of the coding means into a bit string, an evaluation value for each code word corresponding to the plurality of codebooks Wherein the storage order of the codewords of the plurality of codebooks is rearranged such that the total value of the deviations of the codebooks becomes minimum.

5. A method according to claim 1, wherein when a sound source is generated from a code word and a synthesized sound is generated from the sound source, an expected value relating to the synthesized sound is treated as an evaluation value. The speech encoding device according to claim 1.

6. A mapping means for mapping an index, wherein at least one or more codebooks output a codeword corresponding to the mapped index, thereby preliminarily evaluating the storage order of the codewords of the plurality of codebooks. 2. The speech coding apparatus according to claim 1, wherein a state equivalent to the storage order after the update is constructed without updating based on the order of the values.

7. A storage unit for mapping an index, wherein at least one or more codebooks output codewords corresponding to the index after the mapping, thereby preliminarily evaluating the storage order of the codewords of the plurality of codebooks. The speech encoding apparatus according to claim 4, wherein a state equivalent to the storage order after the update is constructed without updating the sum of the deviations of the values to be a minimum.

8. A separating means for separating an index from an encoding result multiplexed into a bit string, and an arbitrary codebook among a plurality of codebooks for outputting a codeword corresponding to the index separated by the separating means. And a decoding means for decoding the encoding result using the codeword output by the codebook, wherein the plurality of codebooks are codes of another codebook. A speech decoding apparatus, wherein the storage order of codewords is rearranged in accordance with the order of evaluation values for words.

9. The speech decoding apparatus according to claim 8, wherein the power or the average amplitude of the code word is used as the evaluation value for the code word.

10. The speech decoding apparatus according to claim 8, wherein the plurality of codebooks are codebooks that output excitation gains.

11. A separating means for separating an index from an encoding result multiplexed into a bit string, and an arbitrary codebook among a plurality of codebooks for outputting a codeword corresponding to the index separated by the separating means. And a decoding unit that decodes the encoded result using the codeword output by the codebook.
The speech order, wherein the storage order of the codewords of the plurality of codebooks is rearranged so that the total value of the deviations of the evaluation values for the corresponding codewords among the plurality of codebooks is minimized. Decryption device.

12. A method according to claim 8, wherein when a sound source is generated from a code word and a synthesized sound is generated from the sound source, an expected value related to the synthesized sound is treated as an evaluation value. The speech decoding device according to claim 1.

13. A mapping means for mapping an index, wherein at least one or more codebooks output codewords corresponding to the mapped index, thereby preliminarily evaluating the storage order of the codewords of the plurality of codebooks. 9. The speech decoding apparatus according to claim 8, wherein a state equivalent to the storage order after the update is constructed without updating based on the order of the values.

14. A method for mapping indices, wherein at least one or more codebooks output codewords corresponding to the indices after mapping, thereby pre-evaluating the storage order of codewords in a plurality of codebooks. 12. The speech decoding apparatus according to claim 11, wherein a state equivalent to the storage order after the update is constructed without updating the sum of the deviations of the values to be a minimum.

15. When a plurality of codebooks for outputting a codeword corresponding to an index are mounted on a speech encoding device or a speech decoding device, an evaluation value relating to a codeword of each codebook is examined, and another codebook is examined. A codeword arrangement method for rearranging the storage order of codewords of at least one or more codebooks in accordance with the order of evaluation values of the codewords.

16. The codeword arrangement method according to claim 15, wherein the power or the average amplitude of the codeword is used as the evaluation value for the codeword.

17. The codeword arrangement method according to claim 15, wherein the plurality of codebooks are codebooks that output excitation gains.

18. When a plurality of codebooks for outputting codewords corresponding to an index are mounted on a speech coding apparatus or a speech decoding apparatus, deviations of evaluation values of the codebooks corresponding to the plurality of codebooks are provided. A code word arrangement method for calculating the total value of the code words and updating the storage order of the code words of at least one or more code books until the total value decreases and minimizes.

19. The code according to claim 15, wherein, when a sound source is generated from the code word and a synthesized sound is generated from the sound source, an expected value related to the synthesized sound is treated as an evaluation value. Word alignment method.