JP3292227B2

JP3292227B2 - Code-excited linear predictive speech coding method and decoding method thereof

Info

Publication number: JP3292227B2
Application number: JP32862294A
Authority: JP
Inventors: 一則間野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-12-28
Filing date: 1994-12-28
Publication date: 2002-06-17
Anticipated expiration: 2017-06-17
Also published as: JPH08185198A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は適応符号帳から選択し
た過去の駆動音源ベクトルをピッチ周期で繰り返すベク
トルと、雑音符号帳から選択した雑音ベクトルとを重み
付け合成し、その合成ベクトルで線形予測合成フィルタ
を駆動して音声合成し、この合成音声信号の入力音声信
号に対する歪みが最小になるように、適応符号帳及び雑
音符号帳からの各ベクトルの選択を行って入力音声信号
を符号化する符号励振線形予測音声符号化方法、及びこ
の符号化方法による復号化出力を復号する復号化方法に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention weights and synthesizes a vector that repeats a past excitation vector selected from an adaptive codebook at a pitch cycle and a noise vector selected from a noise codebook, and performs linear prediction synthesis using the synthesized vector. A code for encoding the input speech signal by selecting each vector from the adaptive codebook and the noise codebook so that the filter is driven to perform speech synthesis, and the distortion of the synthesized speech signal with respect to the input speech signal is minimized. The present invention relates to an excitation linear predictive speech coding method and a decoding method for decoding a decoded output by the coding method.

【０００２】[0002]

【従来の技術】ディジタル移動通信や、音声蓄積サービ
スでは、電波や記憶媒体の効率的利用を図るために、種
々の高能率音声符号化法が用いられている。例えば、符
号励振線形予測符号化（ＣｏｄｅＥｘｃｉｔｅｄＬ
ｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ，ＣＥＬＰ）、ベク
トル和励振線形予測符号化（ＶｅｃｔｏｒＳｕｍＥ
ｘｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ，
ＶＳＥＬＰ）といった音声符号化法が知られている。そ
れぞれの技術については、Ｍ．Ｒ．Ｓｃｈｒｏｅｄｅｒ
ａｎｄＢ．Ｓ．Ａｔａｌ：“Ｃｏｄｅ−Ｅｘｃｉｔ
ｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ（ＣＥＬ
Ｐ）：Ｈｉｇｈ−ｑｕａｌｉｔｙＳｐｅｅｃｈａｔ
ＶｅｒｙＬｏｗＢｉｔＲａｔｅｓ”，Ｐｒｏ
ｃ．ＩＣＡＳＳＰ−８５，２５．１．１，ｐｐ．９３７
−９４０，（１９８５年）、および、Ｉ．Ａ．Ｇｅｒｓ
ｏｎａｎｄＭ．Ａ．Ｊａｓｉｕｋ：“Ｖｅｃｔｏｒ
ＳｕｍＥｘｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉｃ
ｔｉｏｎ（ＶＳＥＬＰ）ＳｐｅｅｃｈＣｏｄｉｎｇ
ａｔ８ｋｂｐｓ”，Ｐｒｏｃ．ＩＣＡＳＳＰ−９０，
Ｓ９．３，ｐｐ．４６１−４６４，（１９９０年）に述
べられている。2. Description of the Related Art In digital mobile communications and voice storage services, various high-efficiency voice coding methods are used in order to efficiently use radio waves and storage media. For example, code excitation linear prediction coding (Code Excited L)
inner Prediction, CELP), Vector Sum Excited Linear Prediction Coding (Vector Sum E)
xcited Linear Prediction,
VSELP) is known. For each technology, see M.E. R. Schroeder
and B. S. Atal: "Code-Excit
ed Linear Prediction (CEL
P): High-quality Speech at
Very Low Bit Rates ”, Pro
c. ICASSP-85, 25.1.1, pp. 937
-940, (1985); A. Gers
on and M. A. Jasiuk: "Vector
SumExcited Linear Predic
Tion (VSELP) Speech Coding
at 8 kbps ", Proc. ICASP-90,
S9.3, pp. 461-464 (1990).

【０００３】これらは、５ｍｓから５０ｍｓ程度を１フ
レームとし、過去の励振信号からなるピッチ適応符号帳
中の一つの適応符号ベクトルと、あらかじめ蓄積してお
いた固定的な雑音又はパルス列からなる雑音符号帳の雑
音符号ベクトルとの重み付き和を励振信号とする。この
励振信号を線形予測合成フィルタに通した合成波形と入
力音声との聴覚重み付き波形歪みを最小とするように、
適応符号，雑音符号，利得符号を決定する。[0003] In these, one frame is composed of about 5 ms to 50 ms, and one adaptive code vector in a pitch adaptive codebook composed of past excitation signals and a noise code composed of a fixed noise or a pulse train stored in advance. The weighted sum with the noise code vector of the book is used as the excitation signal. To minimize the auditory weighted waveform distortion between the input signal and the synthesized waveform obtained by passing the excitation signal through the linear prediction synthesis filter,
Determine the adaptive code, noise code, and gain code.

【０００４】図５に従来法として、ＣＥＬＰ符号化法の
基本的な構成を示す。まず、入力端子１からの入力音声
信号がディジタル信号として入力され、この入力音声信
号は線形予測分析部２において音声の線形予測分析が行
われ、更に量子化され、予測係数Ａとして符号化出力の
一部とされると共に合成フィルタ３の係数として設定さ
れる。適応符号帳４には、直前の過去の合成フィルタ３
への駆動入力として使用された励振音源ベクトルが蓄え
られ、この適応符号帳４中のベクトルを切出し合成フィ
ルタ３の合成波形歪みが最小となるように、適応符号帳
４の符号Ｌに対応するピッチ周期Ｔ（Ｌ）で繰り返して
適応符号ベクトルとする。つまり図６Ａに示すように適
応符号帳４に蓄えられている過去の励振信号Ｓpからピ
ッチ周期Ｔ（Ｌ）だけ切り出し、これを繰り返し、つま
り周期化して適応符号ベクトルＶa とし、このようにし
て得られる各種ピッチ周期Ｔ（Ｌ）の適応符号ベクトル
Ｖa 中の合成音声の歪みが最小となるものを選択する。FIG. 5 shows a basic configuration of a CELP coding method as a conventional method. First, the input audio signal from the input terminal 1 is inputted as a digital signal, the input audio signal in the linear prediction analyzer 2 are linear prediction analysis of the speech is done, be further quantized, the coded output as prediction coefficients A It is set as a part and set as a coefficient of the synthesis filter 3. The adaptive codebook 4 includes the immediately preceding synthesis filter 3
The excitation source vector used as a driving input to the adaptive codebook 4 is stored, the vector in the adaptive codebook 4 is cut out, and the pitch corresponding to the code L of the adaptive codebook 4 is minimized so that the synthesized waveform distortion of the synthesis filter 3 is minimized. An adaptive code vector is repeated at a period T (L). That is, as shown in FIG. 6A, the past excitation signal Sp stored in the adaptive codebook 4 is cut out by the pitch period T (L), and this is repeated, that is, is cycled to obtain an adaptive code vector Va. In the adaptive code vector Va of various pitch periods T (L), the one that minimizes the distortion of the synthesized speech is selected.

【０００５】雑音符号帳５には複数の雑音又はパルス列
からなる雑音符号ベクトルが予め格納されており、この
雑音符号ベクトルの中から適応符号ベクトルとともに合
成波形歪み最小となるベクトルを選択する。最後に、乗
算器７，８でそれぞれ選択した適応符号ベクトル、選択
した雑音符号ベクトルに対し、それぞれ利得を乗算して
重み付けし、この重み付けされた両ベクトルを加算器９
で加算して合成フィルタ３に対する励振信号とし、乗算
器７，８の各利得符号Ｇ０，Ｇ１も、波形歪み最小とな
るように最適化される。合成フィルタ３からの合成波形
と入力音声波形との聴覚重み付きの歪みが歪み計算部１
０で求められ、この歪みが最小になるように符号帳検索
部１１で適応符号帳４、雑音符号帳５の各ベクトルの選
択、乗算器７，８に与える各利得を制御し、その結果と
しての適応符号帳４の選択符号（ピッチ符号）Ｌと、雑
音符号帳５の選択符号Ｃと、乗算器７，８に利得符号Ｇ
０，Ｇ１と、前記量子化された予測係数Ａとを入力音声
の符号化出力とする。この符号化はフレーム単位（５〜
５０ミリ秒程度）ごとに行われる。[0005] A noise code vector composed of a plurality of noises or pulse trains is stored in the noise codebook 5 in advance, and a vector having the minimum combined waveform distortion is selected from the noise code vectors together with the adaptive code vector. Lastly, the adaptive code vector selected by the multipliers 7 and 8 and the selected noise code vector are multiplied by gains and weighted, and the weighted vectors are added to the adder 9.
, And the gain signals G0 and G1 of the multipliers 7 and 8 are also optimized to minimize the waveform distortion. The distortion with the auditory weight between the synthesized waveform from the synthesis filter 3 and the input speech waveform is calculated by the distortion calculator 1.
0, and the codebook search unit 11 controls the selection of each vector of the adaptive codebook 4 and the noise codebook 5 and the gain applied to the multipliers 7 and 8 so that this distortion is minimized. , The selection code (pitch code) L of the adaptive codebook 4, the selection code C of the noise codebook 5, and the gain code G
0, G1 and the quantized prediction coefficient A are coded output of input speech. This encoding is performed on a frame basis (5 to 5).
(About 50 milliseconds).

【０００６】雑音符号帳５は、適応符号ベクトルで表現
しきれなかった励振信号の残差波形を、さらに符号化す
るものである。この雑音符号帳５の構成には、当初、８
ｋｂｉｔ／ｓ程度の符号化では、ガウス雑音などランダ
ムな時系列ベクトルを生成し、それを用いていた。しか
し、４ｋｂｉｔ／ｓ以下の低ビットレート化では、ベク
トルあたりのビット割当を減らしたり、ベクトルの次元
数を増やしたりする必要があり、この場合には、雑音符
号帳で表現する残差信号は、必ずしもランダムなもので
はなく、ピッチ周期性やパルスを多く含んだ信号となる
ので、雑音符号帳の表現の多様化が重要になってきてい
る。The noise codebook 5 further encodes the residual waveform of the excitation signal that could not be represented by the adaptive code vector. The configuration of the random codebook 5 initially includes 8
In encoding at about kbit / s, a random time-series vector such as Gaussian noise is generated and used. However, to reduce the bit rate to 4 kbit / s or less, it is necessary to reduce the bit allocation per vector or increase the number of dimensions of the vector. In this case, the residual signal expressed by the noise codebook is Since the signal is not necessarily random but contains a large number of pitch periodicity and pulses, diversification of the expression of the random codebook is becoming important.

【０００７】雑音励振源の多様化技術として、ピッチ同
期雑音励振源符号化方法（ＰｉｔｃｈＳｙｎｃｈｏｒ
ｎｏｕｓＩｎｎｏｖａｔｉｏｎ−ＣＥＬＰ：ＰＳＩ−
ＣＥＬＰ）がある。この技術については、三樹，守谷，
間野，大室：“ピッチ同期雑音励振源をもつＣＥＬＰ符
号化（ＰＳＩ−ＣＥＬＰ）”，電子情報通信学会論文
誌，Ａ，Ｖｏｌ．，Ｊ７７−Ａ，Ｎｏ．３，ｐｐ．３１
４−３２４（１９９４年３月）に述べられている。これ
は、図６Ｂに示すように適応符号帳４から得られるピッ
チ周期Ｔ（Ｌ）を用いて、適応符号ベクトルＶa と同様
に、雑音符号帳５から選択した固定雑音ベクトルＶscを
先頭からＴ（Ｌ）だけ取り出し、これを繰り返し、つま
り周期化して雑音符号化ベクトルＶs として用いる。な
お適応符号帳４でピッチが得られない場合は雑音ベクト
ルの周期化は行わない。As a technique for diversifying noise excitation sources, a pitch synchronous noise excitation source coding method (Pitch Syncor)
nous Innovation-CELP: PSI-
CELP). About this technology, Miki, Moriya,
Mano, Omuro: “CELP coding with pitch synchronous noise excitation source (PSI-CELP)”, IEICE Transactions, A, Vol. , J77-A, No. 3, pp. 31
4-324 (March 1994). This uses a pitch period T (L) obtained from the adaptive codebook 4 as shown in FIG. 6B, and adds a fixed noise vector Vsc selected from the noise codebook 5 from the top to T (L) in the same manner as the adaptive code vector Va. L), and this is repeated, that is, periodicized and used as the noise coded vector Vs. If the pitch cannot be obtained by the adaptive codebook 4, the noise vector is not periodized.

【０００８】さらに、ＰＳＩ−ＣＥＬＰでは、雑音符号
帳は、大規模なデータベースからの学習によって求めて
いる。この技術については、守谷，三樹，大室，間野：
“ＣＥＬＰ符号化における励振符号帳の学習”，電子情
報通信学会論文誌，Ａ，Ｖｏｌ．，Ｊ７７−Ａ，Ｎｏ．
３，ｐｐ．４８５−４９３（１９９４年３月）に述べら
れている。これは、一般化Ｌｌｏｙｄアルゴリズムに基
づくベクトル量子化器設計法を用い、実際の符号化時の
聴感重み付き波形歪みを最小にする尺度を用いる手法で
ある。[0008] In PSI-CELP, the noise codebook is obtained by learning from a large-scale database. About this technology, Moriya, Miki, Omuro, Mano:
“Exercise Codebook Learning in CELP Coding”, IEICE Transactions, A, Vol. , J77-A, No.
3, pp. 485-493 (March 1994). This is a method that uses a vector quantizer design method based on a generalized Lloyd algorithm and uses a measure that minimizes auditory weighted waveform distortion during actual encoding.

【０００９】これらの雑音符号帳は、ＣＥＬＰの場合に
は、常に固定されたものであり、また、ＰＳＩ−ＣＥＬ
Ｐにおいてもピッチ周期化を行なう前の符号帳は、ピッ
チ周期に関係なく常に同じ雑音符号帳から雑音ベクトル
を生成する構成になっている。[0009] These noise codebooks are always fixed in the case of CELP, and PSI-CEL
In P, the codebook before pitch periodization is configured to always generate a noise vector from the same noise codebook regardless of the pitch period.

【００１０】[0010]

【発明が解決しようとする課題】適応符号ベクトルにピ
ッチ周期性がある場合には、ピッチピーク位置周辺に大
きなパワの残差が偏って分布する。しかし、従来技術で
は、雑音符号帳自体は、固定されたものであり、適応符
号ベクトルのピッチ周期にあわせて雑音符号ベクトルを
ピッチ周期化する手法を適用しても、雑音符号ベクトル
に含まれているピッチ周期性が適応符号ベクトルのピッ
チ周期性と異なる場合には、符号化効率を十分あげるこ
とができない。When the adaptive code vector has pitch periodicity, large power residuals are distributed around the pitch peak position. However, in the related art, the random codebook itself is fixed, and even if a method of making the random code vector pitch-period in accordance with the pitch period of the adaptive code vector is included in the random code vector. If the pitch periodicity is different from the pitch periodicity of the adaptive code vector, the coding efficiency cannot be sufficiently improved.

【００１１】この発明の目的は、新たな伝送情報を増や
さずに、各フレームに適合した雑音符号ベクトルを多く
含むような適応化機能を雑音符号帳にもたせて励振音源
を多様化し、高品質音声を再生する低ビットレート音声
符号化方法及びその復号化方法を提供することにある。An object of the present invention is to provide a noise codebook with an adaptation function that includes a large number of noise code vectors suitable for each frame without increasing new transmission information, thereby diversifying excitation sources and providing high-quality speech. To provide a low bit rate audio encoding method for reproducing audio and a decoding method thereof.

【００１２】[0012]

【課題を解決するための手段】この発明の符号化方法
は、適応符号ベクトルのピッチ周期と対応する複数個の
雑音符号帳を用意し、適応符号帳で選択されたピッチ周
期に対応して雑音符号帳を選択し、その選択した雑音符
号帳を用いて雑音符号ベクトルの選択を行う。この発明
の復号化方法はこの発明の符号化方法と同様に複数の雑
音符号帳を用意し、適応符号帳で選択されたピッチ周期
に対応した雑音符号帳を選択し、その選択した雑音符号
帳から雑音符号ベクトルを選択する。According to the encoding method of the present invention, a plurality of noise codebooks corresponding to a pitch period of an adaptive code vector are prepared, and a noise code corresponding to a pitch period selected by the adaptive codebook is prepared. A codebook is selected, and a random code vector is selected using the selected random codebook. The decoding method of the present invention prepares a plurality of noise codebooks in the same manner as the encoding method of the present invention, selects a noise codebook corresponding to the pitch period selected by the adaptive codebook, and selects the selected noise codebook. From the random code vector.

【００１３】[0013]

【実施例】図１に提案された音声符号化方法の実施例を
示し、図５と同じ番号の部分は同一のものを示してい
る。図１は、図５と比較して複数の雑音符号帳５₁ 〜５
_Nが用意され、適応符号帳４からのピッチ周期符号Ｌに
より切換器１５が制御されて雑音符号帳５₁ 〜５_Nの１
つが選択される点が異なる。雑音符号帳５₁ 〜５_Nは、
例えば図２Ａに示す各Ｌの範囲毎のデータをもとにベク
トル量子化器設計によって作られる。この時、ＬはＴ
（Ｌ）の小さい順に番号付けられているとする。これに
より、雑音符号帳５₁ 〜５_Nには、ピッチ周期に対応し
た雑音符号ベクトルを格納しておくことができる。例え
ば多数の学習音声を逆フィルタに通して、適応符号帳４
に格納される成分を取り去り、その残差信号中の、ピッ
チ周期Ｔ（Ｌ）がＴ（ｌ₀ ）とＴ（ｌ₁ ）との範囲の学
習音声と対応するものを集め、これを雑音符号の学習デ
ータとして複数の代表雑音符号ベクトルを作成し、これ
を雑音符号帳５₁ とし、前記残差信号中の、ピッチ周期
Ｔ（Ｌ）がＴ（ｌ₁ ）とＴ（ｌ₂ ）との範囲の学習音声
と対応するものを集め、これを雑音符号の学習データと
して複数の代表雑音符号ベクトルを作成し、これを雑音
符号帳５₂ とし、以下同様に他の雑音符号帳を作成す
る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 shows an embodiment of the proposed speech coding method, and the same reference numerals in FIG. 5 denote the same parts. 1, a plurality of random codebook 5 ₁ to 5 in comparison with FIG. 5
_N are provided, the first noise codebook 5 ₁ to 5 _N switching unit 15 by the pitch period codes L from the adaptive codebook 4 is controlled
The difference is that one is selected. Noise codebook 5 ₁ to 5 _N is
For example, it is created by vector quantizer design based on the data for each range of L shown in FIG. 2A. At this time, L is T
It is assumed that numbers are assigned in ascending order of (L). Thus, the noise codebook 5 ₁ to 5 _N can be stored the random code vector corresponding to the pitch period. For example, a large number of learning voices are passed through an inverse filter to generate an adaptive codebook 4
Are collected, and those in the residual signal corresponding to the learning speech whose pitch period T (L) is in the range of T (l ₀ ) and T (l ₁ ) are collected, and are collected as noise codes. of creating a plurality of representative noise code vector as the learned data, which was used as a random codebook 5 _1, in the residual signal, a pitch period T (L) is T (l ₁₎ and T (l ₂₎ and the collect those corresponding range of training speech, which creates a plurality of representative noise code vector as a noise code training data, which was used as a random codebook 5 _2, similarly create other noise codebook below.

【００１４】この符号化装置により入力音声を符号化す
るには、従来と同様に入力音声の線形予測分析を行い、
量子化された予測係数Ａを求めて合成フィルタ３の係数
とし、適応符号帳４のベクトルから、合成波形歪み最小
となるように、ピッチ周期Ｔ（Ｌ）で周期化した適応符
号ベクトルを求める。この求めたベクトルと対応する符
号Ｌに基づいて切換器１５が、図２Ａの対応表により、
雑音符号帳５₁〜５_Nから使用する雑音符号帳の番号を
決定し、対応する雑音符号帳５_iを選択する。こうして
選択された雑音符号帳５_iから、まず、乗算器７，８の
利得符号Ｇ０，Ｇ１を最適な値をとれるとして、波形歪
み最小となる雑音符号ベクトルＣを決める。その後、決
定した適応符号ベクトルと雑音符号ベクトルＣに関し
て、利得符号Ｇ０，Ｇ１を、波形歪み最小となるように
量子化する。その他は図５に示した場合と同一である。In order to encode the input speech by this encoding device, linear prediction analysis of the input speech is performed in the same manner as in the prior art.
An quantized prediction coefficient A is obtained and used as a coefficient of the synthesis filter 3, and an adaptive code vector periodicized by the pitch period T (L) is obtained from the vector of the adaptive codebook 4 so as to minimize the synthesized waveform distortion. On the basis of the obtained vector and the code L corresponding to the obtained vector, the switch 15 uses the correspondence table of FIG.
Determining the number of codebook to be used from the noise codebook 5 ₁ to 5 _N, selects the corresponding codebook 5 _i. From the noise codebook 5 _i selected in this way, first, assuming that the gain codes G0 and G1 of the multipliers 7 and 8 can take optimal values, the noise code vector C with the minimum waveform distortion is determined. Thereafter, with respect to the determined adaptive code vector and noise code vector C, the gain codes G0 and G1 are quantized so as to minimize the waveform distortion. Others are the same as those shown in FIG.

【００１５】図２Ｂに、提案された音声復号化方法の実
施例を示し、図１に示した符号化方法で符号化された符
号を復号化する場合であって、図１中の適応符号帳４、
雑音符号帳５₁ 〜５_Nとそれぞれ同一の適応符号帳２
４、雑音符号帳２５₁ 〜２５_Nとが設けられ、入力符号
Ａ，Ｌ，Ｃ，Ｇ０，Ｇ１を復号化する。その入力された
量子化予測係数Ａが合成フィルタ２３の係数として設定
され、乗算器２７，２８はそれぞれ利得符号Ｇ０，Ｇ１
で決まる利得が設定される。適応符号帳２４に図１の場
合と同様に直前の過去の合成フィルタ２３への入力とし
て使用された励振音源ベクトルが蓄えられ、この適応符
号帳２４中のベクトルから、入力符号Ｌで決まるピッチ
周期Ｔ（Ｌ）で周期化した適応符号ベクトルが作られて
乗算器２７へ出力される。また入力符号Ｌの値に基づい
て、切換器２６に制御されて雑音符号帳２５₁ 〜２５_N
から使用する雑音符号帳を選択し、その選択した雑音符
号帳の中から、入力符号Ｃの雑音符号ベクトルを選択し
て乗算器２８へ出力する。乗算器２７，２８の出力は加
算器２９で加算されて合成フィルタ２３に駆動信号とし
て供給され、合成フィルタ２３の出力合成音声はポスト
フィルタ３１でフォルマント強調、ピッチ強調が実施さ
れて最終的な音声出力が得られる。FIG. 2B shows an embodiment of the proposed speech decoding method, in which a code coded by the coding method shown in FIG. 1 is decoded. 4,
Noise codebook 5 ₁ to 5 _N and same adaptive each codebook 2
4. The random codebooks 25 _{1 to} 25 _N are provided to decode the input codes A, L, C, G0, and G1. The input quantized prediction coefficient A is set as a coefficient of the synthesis filter 23, and the multipliers 27 and 28 output gain codes G0 and G1 respectively.
Is set. As in the case of FIG. 1, the excitation source vector used as an input to the immediately preceding past synthesis filter 23 is stored in the adaptive codebook 24, and the pitch period determined by the input code L from the vector in the adaptive codebook 24 is stored. An adaptive code vector periodicized by T (L) is generated and output to the multiplier 27. Further, based on the value of the input code L, it is controlled by the switch 26 to control the noise codebooks 25 _{1 to} 25 _N
, A noise codebook to be used is selected, and a noise code vector of the input code C is selected from the selected noise codebook and output to the multiplier 28. The outputs of the multipliers 27 and 28 are added by an adder 29 and supplied as a drive signal to a synthesis filter 23. The output synthesized voice of the synthesis filter 23 is subjected to formant emphasis and pitch emphasis by a post filter 31 and the final voice The output is obtained.

【００１６】図３に請求項１の音声符号化方法の実施例
を示し、図１と対応する部分に同一番号を付けてある。
この装置は従来の技術の項中で述べたＰＳＩ−ＣＥＬＰ
にこの発明を適用した場合で、適応符号帳４には、ピッ
チ周期性の無い場合に、直前の過去の励振音源ベクトル
からなる符号ベクトル以外に、予め固定した雑音ベクト
ルが格納されている固定符号帳４ｂが設けられ、また、
切換器６１から出力される雑音符号ベクトルをピッチ周
期化する周期化制御部３２が設けられている。雑音符号
帳５₁ 〜５_Nの各符号帳は、図１の雑音符号帳設計と同
様に作られるが、この中には、適応符号ベクトルにピッ
チ周期性が無い場合に対応する雑音符号帳も含まれてい
る。FIG. 3 shows an embodiment of the speech encoding method according to claim 1 , in which parts corresponding to those in FIG. 1 are given the same numbers.
This device is the PSI-CELP described in the section of the prior art.
In the case where the present invention is applied, the adaptive codebook 4 has a fixed code in which, when there is no pitch periodicity, a noise vector fixed in advance is stored in addition to the code vector including the immediately preceding excitation source vector. A book 4b is provided,
A period control section 32 is provided to make the pitch cycle of the noise code vector output from the switch 61. Each codebook codebook 5 ₁ to 5 _N, which is made in the same manner as the noise codebook design of FIG. 1, therein is also random codebook corresponding to the case where there is no pitch periodicity to the adaptive code vector include.

【００１７】入力音声は図１の場合と同様に線形予測分
析され、量子化され、その量子化予測係数Ａが合成フィ
ルタ３の係数となる。適応符号帳４では、合成波形歪み
最小となるように、適応符号帳４ａのベクトルあるい
は、固定符号帳４ｂの中の固定的な雑音ベクトルのいず
れかが選択され、この選択されたベクトルを示す符号Ｌ
には、ピッチ周期を示す符号と固定符号帳４ｂの番号を
示す符号が含まれる。The input speech is subjected to linear prediction analysis and quantization as in the case of FIG. 1, and the quantized prediction coefficient A becomes a coefficient of the synthesis filter 3. In the adaptive codebook 4, either the vector of the adaptive codebook 4a or the fixed noise vector in the fixed codebook 4b is selected so as to minimize the synthesized waveform distortion, and a code indicating the selected vector is selected. L
Includes a code indicating the pitch period and a code indicating the number of the fixed codebook 4b.

【００１８】切換器６１では、この符号Ｌの値に基づい
て図２Ａに示す対応表により、使用する雑音符号帳の番
号を決定し、その雑音符号帳を選択する。例えば、ここ
で、雑音符号帳５₁〜５_N-1は、符号Ｌがピッチ周期性
のある場合の符号帳とし、雑音符号帳５_Nはピッチ周期
性のない符号Ｌに対応するものとする。こうして選択さ
れた雑音符号帳の格納ベクトルに対して、ピッチ周期化
制御部３２において、符号Ｌがピッチ周期性のある符号
の場合には、入力されている雑音ベクトルを図６Ｂに示
したようにピッチ周期Ｔ（Ｌ）で繰り返して、周期化さ
れた雑音ベクトルを生成する。この操作により、雑音ベ
クトルとして、符号Ｌに対応するピッチ周期性をもつ雑
音符号帳の格納ベクトルを対応するピッチ周期で周期化
することにより、量子化歪みを少なくする可能性の高い
ベクトル候補を多く生成することができる。符号Ｌがピ
ッチ周期性のない場合には、雑音ベクトルは周期化しな
い。つまり雑音符号帳５_Nが選択され、その雑音符号帳
５_Nから選択した雑音符号ベクトルは周期化しないで乗
算器８へ供給する。その他は図１の場合と同一である。The switch 61 determines the number of the noise codebook to be used from the correspondence table shown in FIG. 2A based on the value of the code L, and selects the noise codebook. For example, where the noise codebook 5 _₁ ~5 _N-1 is the codebook when the sign L is a pitch periodicity, the noise codebook 5 _N shall correspond to the code L no pitch periodicity . When the code L is a code having a pitch periodicity with respect to the storage vector of the noise codebook selected in this way, the pitch noise control unit 32 converts the input noise vector as shown in FIG. By repeating with a pitch period T (L), a periodic noise vector is generated. By this operation, as a noise vector, the storage vector of the noise codebook having the pitch periodicity corresponding to the code L is periodicized at the corresponding pitch period, thereby increasing the number of vector candidates likely to reduce the quantization distortion. Can be generated. If the code L has no pitch periodicity, the noise vector is not periodic. That codebook 5 _N is selected, and supplies a random code vector selected from the noise code book 5 _N is to the multiplier 8 without periodic. Others are the same as those in FIG.

【００１９】図４に、請求項２の音声復号化方法の実施
例を示し、図２Ｂと対応する部分に同一符号を付けてあ
る。この復号化法は図３に示した符号化方法による符号
を復号化する場合である。従って適応符号帳４として適
応符号帳４ａと固定符号帳４ｂとそれぞれ同一の適応符
号帳２４ａと固定符号帳２４ｂとよりなり、ピッチ周期
化制御部３３が切換器２６の出力側に設けられる。FIG. 4 shows an embodiment of the speech decoding method according to claim 2 , in which the same reference numerals are given to the parts corresponding to those in FIG. 2B. This decoding method is for decoding a code by the encoding method shown in FIG. Therefore, the adaptive codebook 4 is composed of the same adaptive codebook 24a and fixed codebook 24b as the adaptive codebook 4a and the fixed codebook 4b, respectively, and the pitch period control unit 33 is provided on the output side of the switch 26.

【００２０】入力量子化された予測係数Ａが合成フィル
タ２３の係数として設定され、適応符号帳２４では入力
符号Ｌに基づいて、適応符号帳２４ａからの適応符号ベ
クトルあるいは、固定符号帳２４ｂからの固定的な雑音
ベクトルのいずれかが選択され、切換器２６では、この
符号Ｌの値に基づいて、図２Ａに示す対応表により、使
用する雑音符号帳の番号を決定し、対応する雑音符号帳
を選択する。こうして選択された雑音符号帳の格納ベク
トルに対して、ピッチ周期化制御部３３において、符号
Ｌがピッチ周期性のある符号の場合には、選択されてい
る雑音ベクトルを図６Ｂに示したように、ピッチ周期で
繰り返して、周期化された雑音ベクトルを生成する。符
号Ｌがピッチ周期性のない場合には、雑音ベクトルは周
期化しない。そして、適応符号ベクトルと雑音符号ベク
トルに乗算器２７，２８での利得符号Ｇ０，Ｇ１と対応
する利得を乗じ、合成フィルタ２３に通す。その合成フ
ィルタ２３の出力をポストフィルタ３１でフォルマント
強調、ピッチ強調を実施し、最終的な音声出力を得る。The input quantized prediction coefficient A is set as a coefficient of the synthesis filter 23. In the adaptive codebook 24, based on the input code L, the adaptive code vector from the adaptive codebook 24a or the adaptive codebook 24b from the fixed codebook 24b. One of the fixed noise vectors is selected, and the switch 26 determines the number of the noise codebook to be used from the correspondence table shown in FIG. Select With respect to the storage vector of the noise codebook thus selected, in the case where the code L is a code having a pitch periodicity, the selected noise vector is stored in the pitch period control unit 33 as shown in FIG. 6B. , And a pitch cycle to generate a periodic noise vector. If the code L has no pitch periodicity, the noise vector is not periodic. Then, the adaptive code vector and the noise code vector are multiplied by gains corresponding to the gain codes G0 and G1 in the multipliers 27 and 28, and are passed through the synthesis filter 23. The output of the synthesis filter 23 is subjected to formant emphasis and pitch emphasis by the post filter 31 to obtain a final audio output.

【００２１】[0021]

【発明の効果】以上のように、この発明は、複数の雑音
符号帳を備え、適応符号帳で得られるピッチ周期、ある
いは、ピッチ周期性の有無によって雑音符号帳を切り換
えるので、入力音声の特徴に適応した雑音符号ベクトル
を選択することができる。さらに、雑音符号帳切換えに
は適応符号帳の選択したピッチ周期を用いるため新たな
情報を伝送する必要はなく、伝送情報を増やさずに、雑
音符号帳を適応化させて符号化効率を高め、低ビットレ
ートでも高品質な音声符号化が可能になる。As described above, the present invention comprises a plurality of random codebooks and switches between random codebooks depending on the pitch period obtained by the adaptive codebook or the presence or absence of pitch periodicity. Can be selected. Furthermore, it is not necessary to transmit new information because the pitch period selected by the adaptive codebook is used for switching the noise codebook, and without increasing the transmission information, the noise codebook is adapted to increase the coding efficiency, High quality speech coding is possible even at a low bit rate.

[Brief description of the drawings]

【図１】提案された音声符号化装置の例を示すブロック
図。FIG. 1 is a block diagram showing an example of a proposed speech encoding device.

【図２】Ａはこの発明の実施例による適応符号帳からの
符号Ｌと切換える雑音符号帳との対応を示す図、Ｂは提
案された音声復号化装置の例を示すブロック図である。[2] A is a diagram showing the correspondence between the noise codebook for switching the sign L from the adaptive codebook according to an embodiment of the invention, B is Hisage
It is a block diagram which shows the example of the devised audio | voice decoding apparatus.

【図３】請求項１の発明の実施例を適用した音声符号化
装置の例を示すブロック図。FIG. 3 is a block diagram showing an example of a speech encoding apparatus to which the embodiment of the first invention is applied.

【図４】請求項２の発明の実施例を適用した音声復号化
装置の例を示すブロック図。FIG. 4 is a block diagram showing an example of a speech decoding apparatus to which the embodiment of the invention of claim 2 is applied.

【図５】従来の符号励振線形予測音声符号化方法を用い
た符号化装置を示すブロック図。FIG. 5 is a block diagram showing a coding apparatus using a conventional code excitation linear prediction speech coding method.

【図６】Ａは適応符号帳の格納ベクトルから適応符号ベ
クトルを作成する様子を示す図、Ｂは雑音符号ベクトル
をピッチで周期化する様子を示す図である。FIG. 6A is a diagram illustrating a state in which an adaptive code vector is created from a vector stored in an adaptive codebook, and FIG. 6B is a diagram illustrating a state in which a random code vector is periodicized at a pitch.

Claims

(57) [Claims]

1. A linear predictive synthesis filter is driven by a weighted sum of a vector selected from an adaptive codebook that repeats a past excitation vector at a pitch cycle and a vector selected from a noise codebook stored in advance. In the code-excitation linear predictive speech coding method for synthesizing a speech signal, selecting a vector from each of the codebooks so as to minimize distortion of the synthesized speech signal with respect to the input speech signal, and encoding the input speech signal. Removes the above adaptive codebook vector components from the training speech
Learned with signals within a predetermined pitch period in the signal
A plurality of random codebooks storing the random codebooks stored therein , and selecting one random codebook from the plurality of random codebooks in accordance with the pitch period of the vector selected in the adaptive codebook; and selects the vector from the noise codebook, said at a pitch period vector selected in the adaptive codebook
Periodize the vector selected from the random codebook and calculate the weight
Adds the sum, and the vector selected in the above adaptive codebook has no pitch period
If this is the case, select the multiple random codebooks
The selected vector from the selected random codebook
A code excitation linear predictive speech coding method characterized by not performing the above-mentioned periodization.

2. A vector is selected from an adaptive codebook that repeats a past excitation vector at a pitch cycle in accordance with an input code and a noise codebook that has been stored in advance. In a code excitation linear predictive speech decoding method for decoding a speech signal by driving a predictive synthesis filter, a vector component of the adaptive codebook is removed from a learning speech.
Learned with signals within a predetermined pitch period in the signal
Multiple noise codebooks containing random code vectors
A plurality of random codebooks are selected from the plurality of random codebooks according to the pitch period selected by the adaptive codebook , and the vector is selected from the selected random codebooks. Above with the pitch period of the selected vector
Periodize the vector selected from the random codebook and calculate the weight
Adds the sum, and the vector selected in the above adaptive codebook has no pitch period
If this is the case, select the multiple random codebooks
The selected vector from the selected random codebook
A code-excited linear predictive speech decoding method characterized by not performing the above-mentioned periodization.