JPH05108096A

JPH05108096A - Vector drive type speech encoding device

Info

Publication number: JPH05108096A
Application number: JP3271543A
Authority: JP
Inventors: Tatsuo Inoue; 健生井上; Hiroyuki Hirai; 啓之平井; Hideji Nishida; 秀治西田; Shozo Sugishita; 正蔵杉下
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 1991-10-18
Filing date: 1991-10-18
Publication date: 1993-04-30

Abstract

PURPOSE:To improve the quality of a reproduced speech even when the bit rate is low. CONSTITUTION:A 1st sound source which uses the components of the pitch vector of an adaptive code book 12a and the components of the code vector of a noise code book 14a, a 2nd sound source which uses only the components of the code vector, and a 3rd sound source which uses only the components of the pitch vector are generated as driving sound sources so as to minimize the error between a weighted input signal and a reproduced signal. For the purpose, an adaptive bit assignment part 22a adaptively assigns bit information to the index of the pitch vector and a multiplier 16a, and the index of the code vector and a multiplier 16b to vary the search ranges of the indexes of the code books and the numbers of quantized bits of the gains of the multipliers. Then a driving sound source signal determination part 18 determines the best driving sound source and the gain of the driving sound source.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、ベクトル駆動型音声
符号化装置に関し、特に入力音声をたとえば４．８ｋｂ
／ｓ程度の低ビットレートで符号化するベクトル駆動型
音声符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a vector drive type speech coding apparatus, and more particularly to an input speech of 4.8 kb.
The present invention relates to a vector-driven type audio encoding device that encodes at a low bit rate of about / s.

【０００２】[0002]

【従来の技術】音声信号を再現性よく低ビットレートで
符号化するための効果的な方法として、ＣＥＬＰ（Code
Excited Linear Prediction）符号化方式によるベクト
ル駆動型符号化装置が提案されている。この方式の詳細
については、ＩＣＡＳＳＰ−８５ｐｐ．９３７〜９４０
に掲載されたＢ．Ｓ．Ａｔａｌ氏らによる“Code Excit
ed Linear Prediction（ＣＥＬＰ）：High QualitySpee
ch at Very Low Bit Rates.”，信学技報ＣＳ８９−６
４に掲載された谷口氏らによる“主軸ベクトル駆動音声
符号化方式”，あるいは１９９１年電子情報通信学会春
期全国大会論文集Ａ−２１３に掲載された谷口氏らによ
る“ＣＥＬＰ符号化における適応符号帳の改良”におい
て説明されている。2. Description of the Related Art As an effective method for encoding a voice signal with good reproducibility and a low bit rate, CELP (Code
A vector-driven encoder based on the Excited Linear Prediction) encoding method has been proposed. For details of this method, see ICASSP-85pp. 937-940
B. S. “Code Excit” by Atal et al.
ed Linear Prediction (CELP): High QualitySpee
ch at Very Low Bit Rates. ”, IEICE Technical Report CS89-6
4 "Spindle Vector Driven Speech Coding" by Taniguchi et al., Or "Adaptive Codebook for CELP Coding by Taniguchi et al. Improvements ”.

【０００３】従来のＣＥＬＰ符号化方式による音声符号
化装置の一例を図３および図４に示す。この従来の音声
符号化装置１は、図３に示す符号化装置１ａおよび図４
に示す復号化装置１ｂを含む。符号化装置１ａは適応コ
ードブック２ａおよび雑音コードブック３ａを有し、復
号化装置１ｂは適応コードブック２ｂおよび雑音コード
ブック３ｂを有する。そして、適応コードブック２ａお
よび２ｂはそれぞれ同じ音声パターンを記録しており、
雑音コードブック３ａおよび３ｂもそれぞれ同じ音声パ
ターンを記録している。An example of a conventional speech coding apparatus based on the CELP coding method is shown in FIGS. 3 and 4. This conventional speech coding apparatus 1 includes a coding apparatus 1a shown in FIG. 3 and a coding apparatus 1 shown in FIG.
The decoding device 1b shown in FIG. The coding device 1a has an adaptive codebook 2a and a noise codebook 3a, and the decoding device 1b has an adaptive codebook 2b and a noise codebook 3b. The adaptive codebooks 2a and 2b respectively record the same voice pattern,
The noise codebooks 3a and 3b also record the same voice pattern.

【０００４】図３に示す符号化装置１ａにおいては、予
めインデックス数が定められた適応コードブック２ａか
ら抽出されたピッチベクトル（Ｐ）およびこれも予めイ
ンデックス数が決められた雑音コードブック３ａから抽
出されたコードベクトル（Ｃ）に対して、それぞれのゲ
イン（ピッチ：ｂ，コード：ｇ）を乗じて加え合わせた
駆動音源信号（ｂＰ＋ｇＣ）が生成される。符号化の過
程では、駆動音源信号を重み付け合成フィルタ４ａを通
して得られる再生信号（ｂＡＰ＋ｇＡＣ）と重み付け入
力信号（ＡＸ）との間の誤差信号（Ｅ）の電力を誤差電
力評価部５において評価関数とし、コードブックの探索
が行われる。このときのコードブック探索のためのイン
デックスのビット数はコードブックのインデックス数に
合わせて予め定められている。そして、誤差信号の電力
を最小とする駆動音源信号が最適駆動音源信号（ｂ₀Ｐ
₀＋ｇ₀Ｃ₀）として決定される。このとき、この最適
駆動音源信号をフレーム遅延部を介して、入力信号をた
とえば７．５ｍｓ毎に区切った１フレーム分遅延させた
後適応コードブック２に帰還させることによって適応コ
ードブック２ａの内容が更新される。In the encoding device 1a shown in FIG. 3, the pitch vector (P) extracted from the adaptive codebook 2a having a predetermined number of indexes and also the noise vector 3a having a predetermined number of indexes are extracted. A driving sound source signal (bP + gC) is generated by multiplying the generated code vector (C) by each gain (pitch: b, code: g) and adding them. In the encoding process, the power of the error signal (E) between the reproduction signal (bAP + gAC) obtained by passing the driving excitation signal through the weighting synthesis filter 4a and the weighted input signal (AX) is used as an evaluation function in the error power evaluation unit 5. , A codebook search is performed. The number of bits of the index for the codebook search at this time is predetermined according to the number of codebook indexes. The driving sound source signal that minimizes the power of the error signal is the optimum driving sound source signal (b ₀ P
₀ + g ₀ C ₀ ). At this time, the contents of the adaptive codebook 2a are delayed by feeding back the optimum driving sound source signal to the adaptive codebook 2 through the frame delay unit by delaying the input signal by one frame divided for example every 7.5 ms. Will be updated.

【０００５】図４に示す復号化装置１ｂへは、適応コー
ドブック２ａから抽出されたピッチベクトル（Ｐ₀）の
インデックスおよびゲイン（ｂ₀），雑音コードブック
３ａから抽出されたコードベクトル（Ｃ₀）のインデッ
クスおよびゲイン（ｇ₀）ならびに重み付け合成フィル
タ４のフィルタ係数が伝送路７を通じて送られる。そし
て、復号化装置１ｂでは、それに基づいて最適駆動音源
信号（ｂ₀Ｐ₀＋ｇ₀Ｃ₀）を作成し、重み付け合成フ
ィルタ４ｂを介して再生信号（ｂ₀ＡＰ₀＋ｇ ₀Ａ
Ｃ₀）を出力する。The decoding device 1b shown in FIG.
Pitch vector (P₀)of
Index and gain (b₀), Noise Codebook
Code vector (C₀) Index
And gain (g₀) And weighted composite fill
The filter coefficient of the filter 4 is sent through the transmission line 7. That
Then, in the decoding device 1b, the optimum driving sound source is
Signal (b₀P₀+ G₀C₀) And create a weighted composite
Playback signal (b₀AP₀+ G ₀A
C₀) Is output.

【０００６】[0006]

【発明が解決しようとする課題】従来技術において、適
応コードブック２ａから抽出されたピッチベクトルはピ
ッチ周期を再生することを仮定したものであり、有声音
源を想定している。したがって、有声音に対する効果は
大きいが無声音などの非周期的な音声に対しては効果が
少ない。これに対して、雑音コードブック３ａから抽出
されたコードベクトルは非周期的な無声音源を想定した
ものであり、有声音などの周期的な音声に対しては効果
が少ない。したがって、転送レートが４．８ｋｂ／ｓ程
度である場合には再生音質は十分満足できるものではな
い。In the prior art, it is assumed that the pitch vector extracted from the adaptive codebook 2a reproduces a pitch period, and a voiced sound source is assumed. Therefore, the effect is great for voiced sounds, but less effective for non-periodic sounds such as unvoiced sounds. On the other hand, the code vector extracted from the noise codebook 3a is based on the assumption of an aperiodic unvoiced sound source, and has little effect on a periodic voice such as a voiced sound. Therefore, when the transfer rate is about 4.8 kb / s, the reproduced sound quality is not sufficiently satisfactory.

【０００７】それゆえに、この発明の主たる目的は、た
とえば４．８ｋｂ／ｓ程度の低ビットレートにおいても
再生音質を向上できる、ベクトル駆動型音声符号化装置
を提供することである。Therefore, a main object of the present invention is to provide a vector drive type audio encoding device capable of improving reproduced sound quality even at a low bit rate of, for example, about 4.8 kb / s.

【０００８】[0008]

【課題を解決するための手段】この発明は、駆動音源と
して、適応コードブックから抽出されるピッチベクトル
の成分と雑音コードブックから抽出されるコードベクト
ルの成分とを使用する第１の音源と、雑音コードブック
のコードベクトルの成分のみを使用する第２の音源と、
適応コードブックのピッチベクトルの成分のみを使用す
る第３の音源とを作成する駆動音源作成手段、第１，第
２および第３の音源に応じて情報量を適応コードブック
のピッチベクトルのインデックスおよびゲインならびに
雑音コードブックのコードベクトルのインデックスおよ
びゲインに適応的に割り当てる適応ビット割当手段、駆
動音源を重み付け合成手段に通して得られる再生信号と
重み付け入力信号とを比較する比較手段、および比較手
段による比較結果に応じて第１，第２および第３の音源
のいずれかを決定する駆動音源決定手段を備える、ベク
トル駆動型音声符号化装置である。According to the present invention, a first sound source that uses, as a driving sound source, a pitch vector component extracted from an adaptive codebook and a code vector component extracted from a noise codebook, A second sound source that uses only the code vector components of the noise codebook;
A driving sound source creating means for creating a third sound source that uses only the components of the pitch vector of the adaptive codebook, and an information amount according to the first, second and third sound sources, and an index of the pitch vector of the adaptive codebook and By adaptive bit allocation means for adaptively allocating to gain and code vector index and gain of the noise codebook, comparison means for comparing the reproduction signal obtained by passing the driving sound source through the weighting synthesis means, and the weighting input signal, and the comparison means It is a vector-driven speech encoding apparatus including a drive excitation determination unit that determines any one of the first, second, and third excitations according to the comparison result.

【０００９】[0009]

【作用】駆動音源作成手段が、駆動音源として適応コー
ドブックから抽出されるピッチベクトルの成分と雑音コ
ードブックのコードベクトルの成分との両方を使用する
第１の状態の音源と、駆動音源として雑音コードブック
から抽出されるコードベクトルの成分のみを使用する第
２の状態の音源と、駆動音源として適応コードブックか
ら抽出されるピッチベクトルの成分のみを使用する第３
の状態の音源とを作成する。そのために、適応ビット割
当手段が最適駆動音源が第１，第２および第３の音源の
各状態に合うように、各コードブックに与えるインデッ
クスのビット情報量とゲインの量子化ビット数の割り当
て方を変える。適応ビット割当手段は、第１の音源の場
合には、所定のビット情報量を適応コードブックのピッ
チベクトルのインデックスおよびそのゲインならびに雑
音コードブックのコードベクトルのインデックスおよび
そのゲインに割り当てる。第２の音源の場合には、駆動
音源として適応コードブックのピッチベクトルの成分は
使われないので、第１の状態でそのピッチベクトルおよ
びゲインに割り当てていたビット情報量を雑音コードブ
ックのコードベクトルのインデックスおよびゲインのみ
に割り当てる。これによって、雑音コードブックのコー
ドベクトルのインデックスおよびゲインとに割り当てら
れるビット情報量は多くなり、第２の音源を抽出する際
のインデックスの探索範囲が広がり、ゲインを量子化す
る量子化ビット数が変動して、最適な第２の駆動音源が
作られる。第３の音源の場合には、駆動音源として雑音
コードブックのコードベクトルの成分は使われないの
で、第１の状態でそれに割り当てていた情報量を適応コ
ードブックのピッチベクトルのインデックスおよびゲイ
ンのみに割り当てる。これによって、適応コードブック
のピッチベクトルのインデックスとゲインに割り当てら
れる情報量は多くなり、第１の音源を抽出する際のイン
デックスの探索範囲が広がり、ゲインを量子化する量子
化ビット数が変動して、最適な第３の音源を作成する。
このようにビット情報量の割り当て方を変え、再生信号
と重み付け入力信号との誤差電力が最小となる状態にお
いて駆動音源を作成することによって最適な駆動音源を
得ることができる。The driving sound source creating means uses the sound source in the first state in which both the pitch vector component extracted from the adaptive codebook and the code vector component of the noise codebook are used as the driving sound source, and the noise is used as the driving sound source. A sound source in the second state that uses only the components of the code vector extracted from the codebook, and a third that uses only the components of the pitch vector extracted from the adaptive codebook as the driving sound source.
Create a sound source in the state of. Therefore, the adaptive bit allocation means allocates the bit information amount of the index and the quantization bit number of the gain given to each codebook so that the optimum driving sound source matches each state of the first, second and third sound sources. change. In the case of the first sound source, the adaptive bit allocation means allocates a predetermined bit information amount to the pitch vector index of the adaptive codebook and its gain and the code vector index of the noise codebook and its gain. In the case of the second sound source, since the pitch vector component of the adaptive codebook is not used as the driving sound source, the bit information amount assigned to the pitch vector and the gain in the first state is used as the code vector of the noise codebook. Assign only to the index and gain of. This increases the amount of bit information assigned to the index and gain of the code vector of the noise codebook, widens the search range of the index when extracting the second sound source, and reduces the number of quantization bits for quantizing the gain. It fluctuates and an optimum second driving sound source is created. In the case of the third sound source, since the code vector component of the noise codebook is not used as the driving sound source, the information amount assigned to it in the first state is used only as the index and gain of the pitch vector of the adaptive codebook. assign. This increases the amount of information assigned to the pitch vector index and gain of the adaptive codebook, widens the search range of the index when extracting the first sound source, and changes the number of quantization bits for quantizing the gain. Then, an optimal third sound source is created.
In this way, the optimum driving sound source can be obtained by changing the allocation method of the bit information amount and creating the driving sound source in the state where the error power between the reproduction signal and the weighted input signal is minimized.

【００１０】[0010]

【発明の効果】この発明によれば、音源が有声音源であ
るか無声音源であるかに拘わらず最適な駆動音源を作る
ことができるので、たとえば４．８ｋｂ／ｓ程度の低ビ
ットレートにおける再生音質が従来と比較して向上す
る。この発明の上述の目的，その他の目的，特徴および
利点は、図面を参照して行う以下の実施例の詳細な説明
から一層明らかとなろう。According to the present invention, an optimum driving sound source can be produced regardless of whether the sound source is a voiced sound source or an unvoiced sound source, and therefore reproduction at a low bit rate of, for example, about 4.8 kb / s. The sound quality is improved compared to the conventional one. The above-mentioned objects, other objects, features and advantages of the present invention will become more apparent from the following detailed description of the embodiments with reference to the drawings.

【００１１】[0011]

【実施例】図１および図２を参照して、この実施例のベ
クトル駆動型音声符号化装置１０は符号化装置１０ａお
よび復号化装置１０ｂを含む。符号化装置１０ａはピッ
チベクトルを記録する適応コードブック１２ａおよびコ
ードベクトルを記録する雑音コードブック１４ａを有
し、適応コードブック１２ａから抽出されたピッチベク
トル（Ｐ）は乗算器２０ａに入力され、雑音コードブッ
ク１４ａから抽出されたコードベクトル（Ｃ）は乗算器
２０ｂに入力される。乗算器１６ａでは、ピッチベクト
ル（Ｐ）と駆動音源信号決定部１８より与えられるゲイ
ン（ｂ）とが乗算され、乗算器１６ｂでは、コードベク
トル（Ｃ）と駆動音源信号決定部１８より与えられるゲ
イン（ｇ）とが乗算される。そして、乗算器１６ａおよ
び１６ｂからの出力が加算器２０において加算され、駆
動音源信号（ｂＰ＋ｇＣ）が生成される。1 and 2, a vector driven speech coder 10 of this embodiment includes a coder 10a and a decoder 10b. The encoding device 10a has an adaptive codebook 12a for recording a pitch vector and a noise codebook 14a for recording a code vector, and the pitch vector (P) extracted from the adaptive codebook 12a is input to a multiplier 20a to generate noise. The code vector (C) extracted from the codebook 14a is input to the multiplier 20b. The multiplier 16a multiplies the pitch vector (P) by the gain (b) given by the driving sound source signal determining unit 18, and the multiplier 16b multiplies the code vector (C) by the gain given by the driving sound source signal determining unit 18. (G) is multiplied. Then, the outputs from the multipliers 16a and 16b are added in the adder 20 to generate the driving sound source signal (bP + gC).

【００１２】適応コードブック１２ａ，雑音コードブッ
ク１４ａ，乗算器１６ａおよび乗算器１６ｂには適応ビ
ット割当部２２ａから予め決められたビット情報量が与
えられる。この適応ビット割当部２２ａは駆動音源とし
て適応コードブックのピッチベクトル（Ｐ）の成分と雑
音コードブックのコードベクトル（Ｃ）の成分との両方
を使用する状態ａ，駆動音源として雑音コードブックの
コードベクトル（Ｃ）の成分のみを使用する状態ｂおよ
び駆動音源として適応コードブックのピッチベクトル
（Ｐ）の成分のみを使用する状態ｃのそれぞれの状態に
よってビット情報量の割り当て方を変化させる。これに
よって、適応コードブック１２ａおよび雑音コードブッ
ク１４ａのインデックスの探索範囲は変動し、また、乗
算器１６ａおよび１６ｂのそれぞれの量子化ビット数が
変動する。The adaptive codebook 12a, the noise codebook 14a, the multiplier 16a and the multiplier 16b are provided with a predetermined bit information amount from the adaptive bit allocation unit 22a. The adaptive bit allocation unit 22a uses both the pitch vector (P) component of the adaptive codebook and the code vector (C) component of the noise codebook as the driving sound source, and the code of the noise codebook as the driving sound source. The bit information allocation method is changed depending on the state b in which only the component of the vector (C) is used and the state c in which only the component of the pitch vector (P) of the adaptive codebook is used as the driving sound source. As a result, the search ranges of the indexes of the adaptive codebook 12a and the noise codebook 14a change, and the quantization bit numbers of the multipliers 16a and 16b also change.

【００１３】加算器２０から出力された駆動音源信号は
重み付け合成フィルタ２４ａを介して再生信号（ｂＡＰ
＋ｇＡＣ）となって比較器２６に入力される。比較器２
６ではこの再生信号と重み付け入力信号（ＡＸ）とを比
較しその誤差を誤差電力（Ｅ）として駆動音源信号決定
部１８に入力する。駆動音源信号決定部１８はこの誤差
電力が最小になるように前述の状態ａ〜ｃを選び、適応
コードブック１２ａと雑音コードブック１４ａとの中か
らそれぞれピッチベクトルとコードベクトルとを１つず
つまたはどちらか一方のみを選び出し、また、それぞれ
またはどちらか一方に乗じるゲインを決定する。そし
て、誤差電力を最小とする駆動音源信号が最適駆動音源
信号として決定され、これがフレーム遅延部２８ａを介
して適応コードブック１２ａに帰還され、適応コードブ
ック１２ａの内容が更新される。The driving sound source signal output from the adder 20 is reproduced through the weighting synthesis filter 24a (bAP).
+ GAC) and is input to the comparator 26. Comparator 2
At 6, the reproduced signal is compared with the weighted input signal (AX), and the error is input to the drive sound source signal determination unit 18 as error power (E). The driving sound source signal determination unit 18 selects the above states a to c so that this error power is minimized, and selects one pitch vector and one code vector from the adaptive codebook 12a and the noise codebook 14a, respectively. Only one of them is selected, and the gain to be multiplied to each or either one is determined. Then, the driving sound source signal that minimizes the error power is determined as the optimum driving sound source signal, and this is fed back to the adaptive codebook 12a via the frame delay unit 28a, and the contents of the adaptive codebook 12a are updated.

【００１４】駆動音源を決める際に、適応コードブック
１２ａのピッチベクトル（Ｐ）の成分に対するゲイン
（ｂ）が「０」でなく、かつ雑音コードブック１４ａの
コードベクトル（Ｃ）の成分に対するゲイン（ｇ）も
「０」でない場合には、状態ａで符号化するのが最適で
ある。この場合、予め決められたビット情報量が適応ビ
ット割当部２２ａにより適応コードブック１２ａのピッ
チベクトル（Ｐ）のインデックスおよびゲイン（ｂ）な
らびに雑音コードブック１４ａのコードベクトル（Ｃ）
のインデックスおよびゲイン（ｇ）に割り当てられる。
そして、適応コードブック１２ａから抽出されたピッチ
ベクトル（Ｐ）および雑音コードブック１４ａから抽出
されたコードベクトル（Ｃ）に対し、乗算器１６ａおよ
び１６ｂでそれぞれのゲインを乗じて加算器２０で加え
合わせ駆動音源信号（ｂＰ＋ｇＣ）を生成する。In determining the driving sound source, the gain (b) for the pitch vector (P) component of the adaptive codebook 12a is not "0" and the gain (b) for the code vector (C) component of the noise codebook 14a ( If g) is also not "0", it is optimal to code in state a. In this case, the predetermined bit information amount is determined by the adaptive bit allocation unit 22a as the index and gain (b) of the pitch vector (P) of the adaptive codebook 12a and the code vector (C) of the noise codebook 14a.
Assigned to the index and gain (g).
Then, the pitch vector (P) extracted from the adaptive codebook 12a and the code vector (C) extracted from the noise codebook 14a are multiplied by respective gains in the multipliers 16a and 16b and added in the adder 20. A driving sound source signal (bP + gC) is generated.

【００１５】符号化の過程では、駆動音源信号に重み付
け合成フィルタ２４ａを介して得られる再生信号（ｂＡ
Ｐ＋ｇＡＣ）と重み付け入力信号（ＡＸ）とを比較器２
６で比較する。そして、比較結果として出力される誤差
信号（Ｅ）の電力を評価関数として駆動音源信号決定部
１８がこの誤差信号（Ｅ）の電力が最小となるように、
適応コードブック１２ａおよび雑音コードブック１４ａ
を探索して最適なピッチベクトル（Ｐ₀）および最適な
コードベクトル（Ｃ₀）を決定し、さらに最適なピッチ
ベクトルのゲイン（ｂ₀）および最適なコードベクトル
のゲイン（ｇ₀）を決定する。そして、これらから駆動
音源信号が作成され、最適駆動音源信号Ａ（ｂ₀Ｐ₀＋
ｇ₀Ｃ₀）として決定される。In the encoding process, the drive signal is reproduced signal (bA) obtained through the weighting synthesis filter 24a.
P + gAC) and the weighted input signal (AX) are compared 2
Compare with 6. Then, the drive sound source signal determination unit 18 uses the power of the error signal (E) output as the comparison result as an evaluation function so that the power of the error signal (E) is minimized.
Adaptive codebook 12a and noise codebook 14a
To determine the optimum pitch vector (P ₀ ) and the optimum code vector (C ₀ ), and further to determine the optimum pitch vector gain (b ₀ ) and the optimum code vector gain (g ₀ ). .. Then, a driving sound source signal is created from these, and the optimum driving sound source signal A (b ₀ P ₀ +
g ₀ C ₀ ).

【００１６】駆動音源を決める際、ピッチベクトルのゲ
イン（ｂ）が「０」であり、かつコードベクトルのゲイ
ン（ｇ）が「０」でない場合には状態ｂで符号化するの
が最適である。この場合、適応ビット割当部２２ａによ
り、状態ａのとき適応コードブック１２ａのピッチベク
トル（Ｐ）のインデックスおよびゲイン（ｂ）とに割り
当てていたビット情報量を「０」にし、その分を雑音コ
ードブック１４ａのコードベクトル（Ｃ）のインデック
スおよびゲイン（ｇ）に割り当てる。これによって、雑
音コードブック１４ａの探索範囲は広がる。そして、雑
音コードブック１４ａより抽出されたコードベクトル
（Ｃ）に対して、乗算器１６ｂでゲイン（ｇ）を乗じて
駆動音源信号（ｇＣ）を生成する。When determining the driving sound source, when the gain (b) of the pitch vector is "0" and the gain (g) of the code vector is not "0", it is optimal to code in the state b. .. In this case, the adaptive bit allocation unit 22a sets the bit information amount allocated to the index and the gain (b) of the pitch vector (P) of the adaptive codebook 12a in the state a to “0”, and the corresponding amount is set to the noise code. It is assigned to the index and gain (g) of the code vector (C) of the book 14a. This expands the search range of the noise codebook 14a. Then, the multiplier 16b multiplies the code vector (C) extracted from the noise codebook 14a by the gain (g) to generate the driving sound source signal (gC).

【００１７】符号化の過程では、駆動音源信号に重み付
け合成フィルタ２４ａを介して得られる再生信号（ｇＡ
Ｃ）と重み付け入力信号（ＡＸ）とを比較器２６で比較
する。そして、比較結果として出力される誤差信号
（Ｅ）の電力を評価関数として駆動音源信号決定部１８
がこの誤差信号（Ｅ）の電力が最小となるように、雑音
コードブック１４ａを探索して最適なコードベクトル
（Ｃ₁）を決定し、さらに、最適なコードベクトルのゲ
イン（ｇ₁）を決定する。そして、これらから駆動音源
信号が作成され、最適駆動音源信号Ｂ（ｇ₁Ｃ₁）とし
て決定される。In the encoding process, a reproduction signal (gA) obtained by the weighting synthesis filter 24a is added to the driving sound source signal.
C) and the weighted input signal (AX) are compared by the comparator 26. Then, the driving sound source signal determination unit 18 uses the power of the error signal (E) output as the comparison result as an evaluation function.
So as to minimize the power of this error signal (E), the noise codebook 14a is searched to determine the optimum code vector (C ₁ ) and further the optimum code vector gain (g ₁ ) is determined. To do. Then, a driving sound source signal is created from these, and is determined as the optimum driving sound source signal B (g ₁ C ₁ ).

【００１８】駆動音源を決める際、ピッチベクトルのゲ
イン（ｂ）が「０」でなく、かつコードベクトルのゲイ
ン（ｇ）が「０」である場合には状態ｃで符号化するの
が最適である。この場合、適応ビット割当手段２２によ
り、状態ａのとき雑音コードブック１４ａのコードベク
トル（Ｃ）のインデックスおよびゲイン（ｇ）に割り当
てていた情報量を「０」にし、その分を適応コードブッ
ク１２ａのピッチベクトル（Ｐ）のインデックスとゲイ
ン（ｂ）に割り当てる。これによって、適応コードブッ
ク１２ａの探索範囲は広がる。そして、適応コードブッ
ク１２ａより抽出されたピッチベクトル（Ｐ）に対し
て、乗算器１６ａでゲイン（ｂ）を乗じて駆動音源信号
（ｂＰ）を生成する。When determining the driving sound source, when the pitch vector gain (b) is not "0" and the code vector gain (g) is "0", it is optimal to code in the state c. is there. In this case, the adaptive bit allocation means 22 sets the amount of information allocated to the index and the gain (g) of the code vector (C) of the noise codebook 14a in the state a to "0", and the corresponding amount is adapted codebook 12a. Of the pitch vector (P) and the gain (b). This widens the search range of the adaptive codebook 12a. Then, the multiplier 16a multiplies the pitch vector (P) extracted from the adaptive codebook 12a by the gain (b) to generate the driving sound source signal (bP).

【００１９】符号化の過程では、駆動音源信号に重み付
け合成フィルタ２４ａを介して得られる再生信号（ｂＡ
Ｐ）と重み付け入力信号（ＡＸ）とを比較器２６で比較
する。そして、比較結果として出力される誤差信号
（Ｅ）の電力を評価関数として駆動音源信号決定部１８
がこの誤差信号（Ｅ）の電力が最小となるように、適応
コードブック１２ａを探索して最適なピッチベクトル
（Ｐ₂）を決定し、さらに、最適なピッチベクトルのゲ
イン（ｂ₂）を決定する。そして、これらから駆動音源
信号が作成され、最適駆動音源信号Ｃ（ｂ₂Ｐ₂）とし
て決定される。In the encoding process, a reproduction signal (bA) obtained by the weighting synthesis filter 24a is added to the drive excitation signal.
P) and the weighted input signal (AX) are compared by the comparator 26. Then, the driving sound source signal determination unit 18 uses the power of the error signal (E) output as the comparison result as an evaluation function.
So as to minimize the power of this error signal (E), the adaptive codebook 12a is searched to determine the optimum pitch vector (P ₂ ) and further the optimum pitch vector gain (b ₂ ) is determined. To do. Then, a driving sound source signal is created from these, and is determined as the optimum driving sound source signal C (b ₂ P ₂ ).

【００２０】このようにして決定された最適駆動音源信
号Ａ〜Ｃの中で、より誤差信号の電力が最小になるもの
を最終的に伝送路３０を通じて復号化装置１０ｂに送
り、最適駆動音源信号Ｙとして用いる。最適駆動音源信
号Ａが最適駆動音源信号Ｙとして用いられる場合には、
適応コードブック１２ａから適応コードブック１２ｂに
最適なピッチベクトル（Ｐ₀）が送られ、雑音コードブ
ック１４ａから雑音コードブック１４ｂに最適なコード
ベクトル（Ｃ₀）が送られる。また、乗算器１６ａから
乗算器１６ｃに最適なピッチベクトル（Ｐ₀）のゲイン
（ｂ₀）が送られ、乗算器１６ｂから乗算器１６ｄに最
適なコードベクトル（Ｃ₀）のゲイン（ｇ₀）が送られ
る。さらに、重み付け合成フィルタ２４ａからは重み付
け合成フィルタ２４ｂにフィルタ係数が送られる。そし
て、適応ビット割当部２２ｂが、乗算器１６ｃおよび１
６ｄから入力されるゲインから状態ａであることを判断
し、適応コードブック１２ｂおよび雑音コードブック１
４ｂに対してＰ₀およびＣ₀が抽出されるようにビット
情報を与える。そして、Ｐ₀，Ｃ₀，ｂ₀およびｇ₀か
ら最適駆動音源信号Ｙ（ｂ₀Ｐ₀＋ｇ₀Ｃ₀）が作成さ
れ、重み付け合成フィルタ２４ｂを経て再生信号（Ａ
Ｙ）が得られる。Among the optimum driving excitation signals A to C thus determined, the one having the smallest error signal power is finally sent to the decoding device 10b through the transmission line 30 to obtain the optimum driving excitation signal. Used as Y. When the optimum driving sound source signal A is used as the optimum driving sound source signal Y,
The optimum code vector (P ₀ ) is sent from the adaptive codebook 12a to the adaptive codebook 12b, and the optimum code vector (C ₀ ) is sent from the noise codebook 14a to the noise codebook 14b. The gain of the multiplier 16a from the multiplier 16c to the optimum pitch vector (P _₀₎ (b ₀₎ are sent, the gain of the optimal code vector (C ₀₎ to the multiplier 16d from multiplier 16b (g ₀₎ Will be sent. Further, the filter coefficients are sent from the weighting synthesis filter 24a to the weighting synthesis filter 24b. Then, the adaptive bit allocation unit 22b causes the multipliers 16c and 1
It is determined from the gain input from 6d that the state is a, and the adaptive codebook 12b and the noise codebook 1
Bit information is given so that P ₀ and C ₀ are extracted for 4b. _{_{Then, P 0, C 0, b}} 0 and g ₀ optimal excitation signal _{_{Y (b 0 P 0 + g}} 0 C 0) is created from, through the weighting synthesis filter 24b reproduced signal (A
Y) is obtained.

【００２１】最適駆動音源信号Ｂが最適駆動音源信号Ｙ
として用いられる場合には、雑音コードブック１４ａか
ら雑音コードブック１４ｂに最適なコードベクトル（Ｃ
₁）が送られる。また、乗算器１６ｂから乗算器１６ｄ
に最適なコードベクトル（Ｃ ₁）のゲイン（ｇ₁）が送
られる。さらに、重み付け合成フィルタ２４ａから重み
付け合成フィルタ２４ｂにフィルタ係数が送られる。そ
して、適応ビット割当部２２ｂが、乗算器１６ｃおよび
１６ｄから入力されるゲインから状態ｂであることを判
断し、雑音コードブック１４ｂに対してＣ₁が抽出され
るようにビット情報を与える。そして、Ｃ₁およびｇ₁
から最適駆動音源信号Ｙ（ｇ₁Ｃ₁）が作成され、重み
付け合成フィルタ２４ｂを経て再生信号（ＡＹ）が得ら
れる。The optimum driving sound source signal B is the optimum driving sound source signal Y.
Noise codebook 14a?
Optimal code vector (C
₁) Is sent. In addition, the multipliers 16b to 16d
Code vector (C ₁) Gain (g₁) Sent
Be done. In addition, the weighting synthesis filter 24a
The filter coefficient is sent to the attached synthesis filter 24b. So
Then, the adaptive bit allocation unit 22b causes the multiplier 16c and
From the gain input from 16d, determine that it is in state b.
Turn off and C for noise codebook 14b₁Is extracted
To give the bit information. And C₁And g₁
From the optimum driving sound source signal Y (g₁C₁) Is created and weights
The reproduction signal (AY) is obtained through the attachment synthesis filter 24b.
Be done.

【００２２】最適駆動音源信号Ｃが最適駆動音源信号Ｙ
として用いられる場合には、雑音コードブック１２ａか
ら雑音コードブック１２ｂに最適なピッチベクトル（Ｐ
₂）が送られる。また、乗算器１６ａから乗算器１６ｃ
に最適なピッチベクトル（Ｐ ₂）のゲイン（ｂ₂）が送
られる。さらに、重み付け合成フィルタ２４ａから重み
付け合成フィルタ２４ｂにフィルタ係数が送られる。そ
して、適応ビット割当部２２ｂが、乗算器１６ｃおよび
１６ｄから入力されるゲインから状態ｃであることを判
断し、雑音コードブック１２ｂに対してＰ₂が抽出され
るようにビット情報を与える。そして、Ｐ₂およびｂ₂
から最適駆動音源信号Ｙ（ｂ₂Ｐ₂）が作成され、重み
付け合成フィルタ２４ｂを経て再生信号（ＡＹ）が得ら
れる。The optimum driving sound source signal C is the optimum driving sound source signal Y.
Noise codebook 12a?
Optimal pitch vector (P
₂) Is sent. In addition, the multipliers 16a to 16c
Optimum pitch vector (P ₂) Gain (b₂) Sent
Be done. In addition, the weighting synthesis filter 24a
The filter coefficient is sent to the attached synthesis filter 24b. So
Then, the adaptive bit allocation unit 22b causes the multiplier 16c and
From the gain input from 16d, determine that it is in state c.
No, P for noise codebook 12b₂Is extracted
To give the bit information. And P₂And b₂
From the optimum driving sound source signal Y (b₂P₂) Is created and weights
The reproduction signal (AY) is obtained through the attachment synthesis filter 24b.
Be done.

【００２３】なお、復号化装置１０ｂにおいても最適駆
動音源信号Ｙはフレーム遅延部２８ｂを介して適応コー
ドブック１２ｂに帰還され、適応コードブック１２ｂの
記録内容が適応コードブック１２ａの記録内容と同じに
なるように更新される。また、上述の実施例では、ベク
トル駆動型音声符号化装置について述べたが、同様の方
式を用いてコンピュータシミュレーションによって符号
化および復号化を行うことも可能である。この場合、駆
動音源決定部１８，適応ビット割当部２２ａおよび適応
ビット割当部２２ｂの動作はソフトウェアによって行わ
れ、他の構成部の動作はハードウェアあるいはソフトウ
ェアによって行われる。したがって、駆動音源信号決定
に利用される誤差信号は電力に限られるものではない。
また、この場合には、符号化の前段で重み付け入力信号
をアナログ／ディジタル変換する必要があることはいう
までもない。Also in the decoding device 10b, the optimum driving excitation signal Y is fed back to the adaptive codebook 12b via the frame delay unit 28b, and the recorded content of the adaptive codebook 12b becomes the same as the recorded content of the adaptive codebook 12a. Will be updated. Further, in the above-mentioned embodiment, the vector-driven type speech coding apparatus has been described, but it is also possible to perform coding and decoding by computer simulation using the same method. In this case, the operation of the driving sound source determination unit 18, the adaptive bit allocation unit 22a, and the adaptive bit allocation unit 22b is performed by software, and the operation of other components is performed by hardware or software. Therefore, the error signal used for determining the driving sound source signal is not limited to the power.
In this case, needless to say, it is necessary to perform analog / digital conversion on the weighted input signal before the encoding.

[Brief description of drawings]

【図１】この発明の一実施例のベクトル駆動型音声符号
化装置の符号化装置を示すブロック図である。FIG. 1 is a block diagram showing an encoding device of a vector-driven speech encoding device according to an embodiment of the present invention.

【図２】この発明の一実施例のベクトル駆動型音声符号
化装置の復号化装置を示すブロック図である。FIG. 2 is a block diagram showing a decoding device of a vector-driven voice encoding device according to an embodiment of the present invention.

【図３】従来のベクトル駆動型音声符号化装置の符号化
装置を示すブロック図である。FIG. 3 is a block diagram showing an encoding device of a conventional vector-driven speech encoding device.

【図４】従来のベクトル駆動型音声符号化装置の復号化
装置を示すブロック図である。FIG. 4 is a block diagram showing a decoding device of a conventional vector-driven voice encoding device.

[Explanation of symbols]

１０ …ベクトル駆動型音声符号化装置１０ａ …符号化装置１０ｂ …復号化装置１２ａ，１２ｂ …適応コードブック１４ａ，１４ｂ …雑音コードブック１６ａ，１６ｂ，１６ｃ，１６ｄ …乗算器１８ …駆動音源信号決定部２０ …加算器２２ａ，２２ｂ …適応ビット割当部２４ａ，２４ｂ …重み付け合成フィルタ２６ …比較器 10 ... Vector-driven speech coding device 10a ... Encoding device 10b ... Decoding device 12a, 12b ... Adaptive codebook 14a, 14b ... Noise codebook 16a, 16b, 16c, 16d ... Multiplier 18 ... Driving excitation signal determination unit 20 ... Adder 22a, 22b ... Adaptive bit allocation part 24a, 24b ... Weighting synthesis filter 26 ... Comparator

───────────────────────────────────────────────────── フロントページの続き (72)発明者杉下正蔵大阪府守口市京阪本通２丁目18番地三洋電機株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Shozo Sugishita 2-18 Keihan Hondori, Moriguchi City, Osaka Sanyo Electric Co., Ltd.

Claims

[Claims]

1. A first sound source using a pitch vector component extracted from an adaptive codebook and a code vector component extracted from a noise codebook as driving sound sources, and a code vector of the noise codebook. Driving sound source creating means for creating a second sound source that uses only components and a third sound source that uses only components of the pitch vector of the adaptive codebook, according to the first, second, and third sound sources Adaptive bit allocation means for adaptively allocating the amount of information to the pitch vector index and gain of the adaptive codebook and the code vector index and gain of the noise codebook, and obtaining the driving sound source through a weighting synthesis means. Comparing means for comparing the reproduced signal to be reproduced with the weighted input signal, and the comparing means. Comparison said first according to the result, a drive sound source determination means for determining one of the second and third sound source vector driven speech coding apparatus.