JP3275247B2

JP3275247B2 - Audio encoding / decoding method

Info

Publication number: JP3275247B2
Application number: JP11764691A
Authority: JP
Inventors: 聡三樹; 健弘守谷; 一則間野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1991-05-22
Filing date: 1991-05-22
Publication date: 2002-04-15
Anticipated expiration: 2017-04-15
Also published as: JPH04344699A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は雑音符号帳を用い、符
号駆動線形予測符号化、ベクトル和駆動線形予測符号化
に適用され、音声の信号系列を少ない情報量でデジタル
符号化する高能率音声符号化方法、その復号化方法に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is applied to code-driven linear predictive coding and vector-sum driven linear predictive coding using a noise codebook, and is a highly efficient speech for digitally encoding a speech signal sequence with a small amount of information. The present invention relates to an encoding method and a decoding method thereof.

【０００２】[0002]

【従来の技術】現在、音声を高能率に符号化する方法と
して、原音声をフレームと呼ばれる５〜５０ms程度の一
定間隔の区間に分割し、その１フレームの音声を周波数
スペクトルの包絡形状と、その包絡形状に対応する線形
フィルタを駆動するための駆動音源信号という２つの情
報に分離し、それぞれを符号化することが提案されてい
る。その場合、駆動音源信号を符号化する方法として、
駆動音源信号を音声の基本周波数（ピッチ周期）に対応
すると考えられる周期成分と、それ以外の成分（言い換
えれば非周期成分）とに分離して符号化する方法が知ら
れている。この駆動音源情報の符号化法として符号駆動
線形予測符号化（Code-Excited Linear Prediction Cod
ing:CELP）およびベクトル和駆動線形予測符号化(Vecto
r Sum Excited Linear Prodiction Coding:VSELP）法が
ある。それぞれの技術については、M.R.Schroeder and
B.S.Atal : "Code-ExcitedLinear Prediction(CELP) :H
igh-quality Speech at Very Low Bit Rates", Proc.IC
ASSP'85,25.1.1,pp.937-940,1985 、およびI.A.Gerson
and M.A.Jasiuk :"Vector Sum Excited Linear Predict
ion (VSELP) Speech Coding at 8 kbps", Proc. ICASS
P'90,S9.3,pp.461-464,1990、に述べられている。2. Description of the Related Art At present, as a method for encoding speech with high efficiency, an original speech is divided into sections called frames, which have a constant interval of about 5 to 50 ms, and the speech of one frame is divided into an envelope shape of a frequency spectrum and It has been proposed to separate the information into two pieces of information, that is, a drive excitation signal for driving a linear filter corresponding to the envelope shape, and encode each of them. In that case, as a method of encoding the drive excitation signal,
There has been known a method of separating and encoding a drive excitation signal into a periodic component considered to correspond to a fundamental frequency (pitch cycle) of a voice and another component (in other words, an aperiodic component). Code-Excited Linear Prediction Cod (Code-Excited Linear Prediction Cod)
ing: CELP) and vector sum-driven linear prediction coding (Vecto
r Sum Excited Linear Prodiction Coding (VSELP) method. About each technology, MRSchroeder and
BSAtal: "Code-ExcitedLinear Prediction (CELP): H
igh-quality Speech at Very Low Bit Rates ", Proc.IC
ASSP'85, 25.1.1, pp. 937-940, 1985, and IAGerson
and MAJasiuk: "Vector Sum Excited Linear Predict
ion (VSELP) Speech Coding at 8 kbps ", Proc. ICASS
P'90, S9.3, pp. 461-464, 1990.

【０００３】これらの符号化方法は、図９に示すよう
に、入力端子１１に入力された原音声について音声分析
部１２において、その周波数スペクトルの包絡形状を表
すパラメータが計算される。この分析には通常、線形予
測法が用いられる。その線形予測パラメータは線形予測
パラメータ符号化部１３で符号化され、その符号化出力
は分岐され、線形予測パラメータ復号化部１４で復号化
され、その復号化された線形予測パラメータが線形予測
合成フィルタ１５のフィルタ係数として設定される。In these encoding methods, as shown in FIG. 9, a parameter representing an envelope shape of a frequency spectrum is calculated in a voice analysis unit 12 for an original voice input to an input terminal 11. Usually, a linear prediction method is used for this analysis. The linear prediction parameter is encoded by a linear prediction parameter encoding unit 13, the encoded output is branched, and decoded by a linear prediction parameter decoding unit 14, and the decoded linear prediction parameter is converted to a linear prediction synthesis filter. It is set as 15 filter coefficients.

【０００４】適応符号帳１６において直前の過去の駆動
音源ベクトルをある周期（ピッチ周期）に相当する長さ
で切り出し、その切り出したベクトルをフレームの長さ
になるまで繰り返し、音声の周期成分と対応する時系列
符号ベクトルの候補が出力される。また雑音符号帳１
７，１８から音声の非周期成分と対応する時系列符号ベ
クトルの候補が出力される。雑音符号帳１７，１８は図
１０に示すように通常白色ガウス性雑音を基調とし、１
フレーム分の長さの各種の符号ベクトルが入力音声とは
独立にあらかじめ記憶されている。In the adaptive codebook 16, the immediately preceding past excitation vector is cut out at a length corresponding to a certain period (pitch period), and the cut-out vector is repeated until the length of the frame becomes equal to the frame length. Is output. Noise codebook 1
7, 18 output time-series code vector candidates corresponding to the non-periodic components of the voice. The noise codebooks 17 and 18 are usually based on white Gaussian noise as shown in FIG.
Various code vectors of the length of the frame are stored in advance independently of the input speech.

【０００５】適応符号帳１６，雑音符号帳１７，１８か
らの各時系列ベクトルの候補は重みつき加算部１９にお
いて、それぞれ乗算部２１₁，２１₂，２１₃で重みｇ
₁，ｇ₂，ｇ₃が乗算され、これら乗算出力は加算部２
２で加算される。この加算出力は駆動音源ベクトルとし
て線形予測合成フィルタ１５へ供給され、合成フィルタ
１５から合成（再生）音声が出力される。この合成音声
の入力端子１１からの原音声に対する歪みが距離計算部
２３で計算され、その計算結果に応じて符号帳検索部２
４により、適応符号帳１６における切り出し長さをかえ
た候補が選択され、かつ雑音符号帳１７，１８から他の
符号ベクトルが選択され、さらに重みつき加算部１９の
重みｇ₁，ｇ₂，ｇ₃が変更され、距離計算部２３で計
算された歪みが最小になるようにされる。歪み最小とな
ったときの適応符号帳１６の切り出し長を示す周期符号
と、雑音符号帳１７，１８の各符号ベクトルを示す雑音
符号と、重みｇ₁，ｇ₂，ｇ₃を示す重み符号と、線形
予測パラメータ符号とが符号化出力として出力され、伝
送または蓄積される。The time series vector candidates from the adaptive codebook 16 and the noise codebooks 17 and 18 are weighted by an adder 19 in multipliers 21 ₁ , 21 ₂ and 21 ₃ , respectively.
₁ , g ₂ , and g ₃ are multiplied, and the output of these multiplications is
It is added by two. This added output is supplied to the linear prediction synthesis filter 15 as a drive excitation vector, and the synthesis filter 15 outputs a synthesized (reproduced) voice. The distortion of the synthesized speech with respect to the original speech from the input terminal 11 is calculated by the distance calculation unit 23, and the codebook search unit 2 is operated in accordance with the calculation result.
4, a candidate having a different cutout length in the adaptive codebook 16 is selected, another code vector is selected from the noise codebooks 17 and 18, and the weights g ₁ , g ₂ , and g of the weighted addition unit 19 are further selected. ₃ is changed so that the distortion calculated by the distance calculation unit 23 is minimized. A periodic code indicating the cut-out length of the adaptive codebook 16 when the distortion is minimized, a noise code indicating each code vector of the noise codebooks 17 and 18, and a weight code indicating weights g ₁ , g ₂ and g _3. , Linear prediction parameter codes are output as encoded outputs and transmitted or stored.

【０００６】復号化は図１１に示すように入力された線
形予測パラメータ符号が線形予測パラメータ復号化部２
６で復号化され、その予測パラメータが線形予測合成フ
ィルタ２７にフィルタ係数として設定される。それまで
に得られた直前の過去の駆動音源ベクトルと、入力され
た周期符号とを用いて適応符号帳２８からその周期で過
去の駆動音源ベクトルを切り出し、これをフレーム分繰
り返した時系列符号ベクトルが出力され、また入力され
た雑音符号が示す符号ベクトルが雑音符号帳２９，３１
からそれぞれ時系列ベクトルとして読み出される。これ
ら時系列ベクトルは重みつき加算部３２で入力された重
み符号に応じて、それぞれ重み付けがなされた後、加算
され、その加算出力が駆動音源ベクトルとして合成フィ
ルタ２７へ供給され、合成フィルタ２７から再生音声が
得られる。[0006] In the decoding, as shown in FIG.
6 and the prediction parameters are set as filter coefficients in the linear prediction synthesis filter 27. A time-series code vector obtained by cutting out the previous driving excitation vector from the adaptive codebook 28 in the cycle using the immediately preceding past driving excitation vector obtained up to that time and the input periodic code, and repeating this for the number of frames. Are output, and the code vector indicated by the input noise code is a random codebook 29, 31.
Are read out as time-series vectors. These time-series vectors are weighted in accordance with the weighting code input by the weighted addition unit 32, and then added, and the added output is supplied to the synthesis filter 27 as a drive excitation vector, and reproduced from the synthesis filter 27. Voice is obtained.

【０００７】雑音符号帳２９，３１は符号化に用いられ
た雑音符号帳１７，１８と同一のものとされる。雑音符
号帳は１個のみ、あるいはさらに多くのものが用いられ
ることもある。符号駆動線形予測符号化においては、雑
音符号帳には、候補となるべきすべての符号ベクトルが
直接記憶されてある。つまり、候補となるべき符号ベク
トルの数がＮならば、雑音符号帳に記憶されている符号
ベクトルの数もＮである。The random codebooks 29 and 31 are the same as the random codebooks 17 and 18 used for encoding. Only one or more noise codebooks may be used. In code-driven linear predictive coding, all code vectors to be candidates are directly stored in a random codebook. That is, if the number of code vectors to be candidates is N, the number of code vectors stored in the noise codebook is also N.

【０００８】ベクトル和駆動線形予測符号化では、雑音
符号帳は図１２に示すように、記憶されているすべての
符号ベクトル（基本ベクトルと呼ぶ）が同時に読み出さ
れ、乗算部３３₁〜３３_Mでそれぞれ雑音符号帳用復号
器３４により＋１または−１が乗算され、その乗算出力
が加算されて出力符号ベクトルとして出力される。従っ
て、各基本ベクトルに乗算する＋１，−１の組み合わせ
により、出力符号ベクトルの数は２^Mとなり、歪みが最
小となるようにこの２^Mの出力符号ベクトルの１つが選
択される。[0008] In the vector sum excited linear predictive coding, the noise codebook, as shown in FIG. 12, (referred to as a basic vector) all code vectors stored are read out simultaneously, multiplying unit 33 ₁ ~ 33 _M Are multiplied by +1 or −1 by the noise codebook decoder 34, and the multiplied outputs are added and output as an output code vector. Therefore, the number of output code vectors becomes 2 ^M by a combination of +1 and −1 by which each basic vector is multiplied, and one of the 2 ^M output code vectors is selected so as to minimize distortion.

【０００９】重みつき加算部１９での重みは、周期検索
時（適応符号帳１６の検索時）および符号ベクトル検索
時（雑音符号帳１７，１８の検索時）に論理的に最適に
定まるものをスカラー量子化する方法と、重み用符号帳
を持ち、これの検索を行って歪みが最小となるものを定
める方法とがある。The weight in the weighted adder 19 is determined logically optimally at the time of periodic search (when searching the adaptive codebook 16) and at the time of code vector search (when searching the noise codebooks 17 and 18). There is a method of performing scalar quantization and a method of having a codebook for weight and performing a search on the codebook to determine a codebook with minimum distortion.

【００１０】[0010]

【発明が解決しようとする課題】ところが、これらの従
来の方法では、駆動音源信号の周期性が前フレームの成
分のみに限定されるため、周期性の表現力が弱く、再生
音声がざらざらして滑らかさに欠けるという欠点を有し
ていた。この発明の目的は従来、前フレームに関する成
分のみで表現されていた駆動音源の周期性の表現力を強
化し、再生された音声をより滑らかに正確に表現する方
法を提供することにある。However, in these conventional methods, since the periodicity of the driving sound source signal is limited to only the components of the previous frame, the expressiveness of the periodicity is weak, and the reproduced sound is rough. It had the disadvantage of lacking smoothness. An object of the present invention is to provide a method for enhancing the expressiveness of the periodicity of a driving sound source, which has been conventionally expressed only with a component related to the previous frame, and more smoothly and accurately expressing reproduced sound.

【００１１】[0011]

【課題を解決するための手段】この発明によれば、音声
の周期性の表現力を強化するため、従来周期性をもたな
かった雑音符号帳から出力される符号ベクトルの一部ま
たは全部、あるいは出力される符号ベクトルの成分の一
部、もしくは複数の雑音符号帳の一部に適応符号帳の出
力時系列符号ベクトルの周期性と同一の周期性をもたせ
る。According to the present invention, in order to enhance the expressiveness of the periodicity of speech, a part or all of the code vector output from the noise codebook which has not had the periodicity conventionally, Alternatively, a part of a component of the output code vector or a part of a plurality of noise codebooks has the same periodicity as the periodicity of the output time-series code vector of the adaptive codebook.

【００１２】[0012]

【実施例】まずこの発明をＣＥＬＰ（符号駆動線形予測
符号化）の符号化部に適用した場合を示す。全体構成は
図９と同じであり、従来と同様にまず、適応符号帳を用
いて、前フレームの駆動音源ベクトルから、対象フレー
ムの（ピッチ周期に対応すると考えられる）基本周期検
索およびそれに基づいた駆動音源周期成分を作成する。
続いて、雑音符号帳の検索を行うが、この発明では雑音
符号帳の符号ベクトルを周期化する。つまり図１に示す
ように、雑音符号帳１７から１つの符号ベクトルを、基
本周期検索で得られた基本周期Ｌの長さ分３６を切り出
す。この切り出しは符号ベクトルの最初から後ろに向け
て長さＬ分切り出す方法と、最後から前に向けて長さＬ
分切り出す方法とがあるが、図には最初から切り出す方
法を示している。ａに示すように、その切り出し部分３
６をフレーム長に達するまで何度も繰り返し配列して、
周期性符号ベクトルを作成して出力符号ベクトルとす
る。それを雑音符号帳１７中のすべての符号ベクトルに
ついて行い、その中で、合成フィルタに通した再生音声
と原音声間の距離が最小になるものを、最適符号ベクト
ルとする。その後の各駆動音源成分の重みの決定は従来
の技術と同様に行う。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First, the case where the present invention is applied to a coding unit of CELP (Code Driven Linear Predictive Coding) will be described. The overall configuration is the same as that of FIG. 9. As in the conventional case, first, using the adaptive codebook, a basic period search (which is considered to correspond to the pitch period) of the target frame from the driving excitation vector of the previous frame and a search based on it Create a driving sound source periodic component.
Subsequently, a search for the random codebook is performed. In the present invention, the code vector of the random codebook is made periodic. That is, as shown in FIG. 1, one code vector is cut out from the noise codebook 17 by a length 36 of the basic period L obtained by the basic period search. This extraction is performed by a method of extracting a length L from the beginning of the code vector to the rear, and a method of extracting the length L from the end to the front.
There is a method of cutting out the image, and the figure shows the method of cutting out from the beginning. As shown in FIG.
6 is repeatedly arranged until the frame length is reached,
A periodic code vector is created and used as an output code vector. This is performed for all the code vectors in the noise codebook 17, and among those, the one that minimizes the distance between the reproduced sound passed through the synthesis filter and the original sound is defined as the optimum code vector. Subsequent determination of the weight of each driving sound source component is performed in the same manner as in the related art.

【００１３】複数の雑音符号帳を使う場合、その雑音符
号帳のうちの一部の雑音符号帳を図１に示した方法によ
って周期性を持たせ、符号ベクトルを出力させ、残りの
雑音符号帳は非周期性のまゝ用いてもよい。この構成を
とった例を図２に示す。この場合は雑音符号帳１７は周
期性を付けて符号ベクトルを出力し、雑音符号帳１８は
非周期性のまゝ出力している。これによって、駆動音源
における周期成分と非周期成分との自由度比を、周期化
する雑音符号帳と周期化しない雑音符号帳との個数を適
当に振り分けることによって任意に設定でき、その比を
最適に近づけることが可能になる。When a plurality of random codebooks are used, some of the random codebooks are provided with periodicity by the method shown in FIG. 1, code vectors are output, and the remaining random codebooks are output. May be used as non-periodic. FIG. 2 shows an example of this configuration. In this case, the random codebook 17 outputs a code vector with periodicity, and the random codebook 18 outputs a non-periodic code vector. This makes it possible to arbitrarily set the degree of freedom between the periodic component and the non-periodic component in the driving excitation by appropriately allocating the number of noise codebooks to be periodicized and the number of noise codebooks not to be periodicized. Can be approached.

【００１４】さらに、ＣＥＬＰの符号化部において１個
の雑音符号帳内の一部の符号ベクトルのみを、上記方法
によって周期化させ、残りを非周期性のまゝにしてもよ
い。例えば図３に示すように、雑音符号帳１７中の符号
ベクトル１〜Ｎ_Sはそれぞれ周期性をもたせて出力し、
その他の符号ベクトルＮ_S＋１〜Ｎについては非周期性
のまゝ出力する。この構成によれば、あるフレーム用の
駆動音源信号として、周期化処理された符号ベクトル
と、周期化処理されていない符号ベクトルとのどちらが
適するかを、従来方法とまったく同じ符号帳検索法によ
って、符号ベクトル検索と同時に自動的に決定すること
ができる。つまり、周期成分と非周期成分との自由度比
を各フレームごとに変化させてそれを最適に近づけるこ
とが可能になる。Further, in the coding section of the CELP, only a part of the code vectors in one noise codebook may be made periodic by the above-described method, and the rest may be made non-periodic. For example, as shown in FIG. 3, reference numeral vectors 1 to N _S in the noise codebook 17 outputs each remembering periodicity,
The other code vectors N _S +1 to N are output as non-periodic. According to this configuration, as the drive excitation signal for a certain frame, which of the periodicized code vector and the non-periodicized code vector is suitable is determined by the same codebook search method as the conventional method, It can be automatically determined at the same time as the code vector search. In other words, it is possible to change the degree of freedom between the periodic component and the non-periodic component for each frame and make it closer to the optimum.

【００１５】また、図１および図２に示した周期性の付
与はＶＳＥＬＰ（ベクトル和駆動線形予測符号化）にお
ける雑音符号帳についても同様に適応可能である。次
に、ＶＳＥＬＰによる符号化手法にこの発明を適用した
例を示す。図４に示すように、Ｍ個の基本ベクトルのう
ちあらかじめ決めたものについては、前記方法で周期化
して出力し、その他は非周期性のまゝ出力し、これら周
期化された符号ベクトルと、非周期性の符号ベクトルと
についてそれぞれ＋１または−１が乗算された後、加算
されて出力符号ベクトルとされる。乗算部３３₁〜３３
_Mに対する符号の変更は従来と同様に行って出力符号ベ
クトルの最適化を行う。このようにしてある雑音符号帳
における基本ベクトルの一部のみに周期性をもたせ、残
りをそのまゝ非周期性にすることによって、周期性の基
本ベクトルと非周期性の基本ベクトルの本数比、つまり
駆動音源ベクトルの周期成分と非周期成分との自由度比
を任意に設定でき、比を最適に近づけることが可能にな
る。この比はあらかじめ設定する。The periodicity shown in FIGS. 1 and 2 can be similarly applied to a noise codebook in VSELP (Vector Sum Driven Linear Predictive Coding). Next, an example in which the present invention is applied to an encoding method based on VSELP will be described. As shown in FIG. 4, a predetermined one of the M basic vectors is output in a periodic manner by the above-described method, and the others are output in a non-periodic manner. Each of the non-periodic code vectors is multiplied by +1 or -1, and then added to obtain an output code vector. Multiplying section 33 _to 333
_The code change for _M is performed in the same manner as in the prior art, and the output code vector is optimized. In this way, only a part of the basic vectors in the random codebook are given a periodicity, and the rest are made aperiodic as they are, whereby the ratio of the number of the periodic basic vector to the aperiodic basic vector, That is, the degree of freedom between the periodic component and the aperiodic component of the driving sound source vector can be set arbitrarily, and the ratio can be made closer to the optimum. This ratio is set in advance.

【００１６】またこの方法によれば、最適符号ベクトル
検索の後、符号ベクトルの周期化成分（周期化を行った
基本ベクトルのみをＶＳＥＬＰ方式により重みつき加算
する）と非周期化成分（同じく周期化を行っていない基
本ベクトルのみをＶＳＥＬＰ方式により重みつき加算す
る）とを分離することが可能である。そこで例えば図５
に示すように最適符号ベクトル検索後の各駆動音源成分
の重み符号化において、出力される１つの符号ベクトル
内の周期成分と非周期成分とを別々の重みをつけること
ができる。基本ベクトル１〜Ｍ_Sについては周期性を与
え、これらに±１を乗算したものを加算部３７で加算
し、残りの基本ベクトルＭ_S＋１〜Ｍについては非周期
性のまゝ±を乗算したものを加算部３８で加算し、これ
ら加算部３７，３８の出力をそれぞれ乗算部２１₂′，
２１₂″で重みｇ₂′，ｇ₂″を乗算して加算部２２へ
供給するようにする。この場合、まず最適符号ベクトル
の決定を行い、その後、上記手法で符号ベクトル内の周
期成分と非周期成分とに分けて、それぞれに最適な重み
の検索を行う。このようにしてフレームごとに駆動音源
ベクトルの周期成分と非周期成分との成分比を変えて、
そのフレームに最適な値にすることが可能である。According to this method, after the search for the optimal code vector, the periodic component of the code vector (weighted addition of only the periodicized basic vector by the VSELP method) and the non-periodic component (also the periodic Is weighted and added only by the VSELP method). So, for example, FIG.
As shown in (1), in the weight coding of each drive excitation component after the search for the optimal code vector, the periodic component and the aperiodic component in one output code vector can be given different weights. Given periodicity basic vector 1 to M _S, are added in the addition portion 37 multiplied by the ± 1 to, for the remaining basis vectors M _S + 1 to M multiplied by the a orゝ± aperiodic Are added by an adder 38, and the outputs of the adders 37 and 38 are respectively multiplied by a multiplier 21 ₂ ′,
The weights g ₂ ′ and g ₂ ″ are multiplied by 21 ₂ ″ and supplied to the addition unit 22. In this case, the optimum code vector is determined first, and thereafter, the optimum weight is searched for for each of the periodic component and the non-periodic component in the code vector by the above-described method. In this way, by changing the component ratio between the periodic component and the aperiodic component of the driving sound source vector for each frame,
It is possible to set an optimum value for the frame.

【００１７】図６に示すように、例えば各４つの符号ベ
クトルからなる副雑音符号帳３９_1,３９₂からそれぞれ
１つ選択した各符号ベクトルをそれぞれ基本ベクトルと
し、その一部、この例では副雑音符号帳３９₁の出力を
周期化し、その他は非周期性のまゝとし、これらにそれ
ぞれ±１を乗算して、両基本ベクトルを加算して出力符
号ベクトルとしてもよい。As shown in FIG. 6, for example, each of four consisting of codevectors by-noise codebook 39 _1, 39 ₂ each one of the selected respective basis vectors each code vector, a portion, in this example the sub the output of the noise codebook 39 ₁ to periodic, others aperiodic orゝcity multiplies ± 1 each of these may be output code vector by adding the two basis vectors.

【００１８】この図６の構成において図７に示すよう
に、各副雑音符号帳３９₁，３９₂において一部の符号
ベクトルのみを周期化してもよい。図７では４個の符号
ベクトル中の各２個の符号ベクトルを周期化している。
上述では、符号化についてこの発明を説明したが、復号
化においても、符号化における雑音符号帳と同一とす
る。In the configuration of FIG. 6, as shown in FIG. 7, only a part of the code vectors in each of the sub-noise codebooks 39 ₁ and 39 ₂ may be made periodic. In FIG. 7, each of two code vectors in the four code vectors is periodic.
In the above description, the present invention has been described with respect to encoding, but it is assumed that decoding is the same as the random codebook in encoding.

【００１９】[0019]

【発明の効果】以上述べたように、この発明によれば、
駆動音源信号中の雑音符号ベクトルが周期化されるた
め、再生音声は滑らかなものとなる。その場合、駆動音
源信号の周期成分および非周期成分の比を任意に設定で
き、比を最適に近づけることができる。また、１個の雑
音符号帳の一部の符号ベクトルを周期化することによっ
て、この自由度比をフレームごとに変化させることがで
きる。さらに、周期・非周期それぞれの成分に対し、フ
レームごとに異なった重みをつけることができ、重み符
号帳の検索によってそのフレームに最適な重み比にする
ことが可能になる。As described above, according to the present invention,
Since the noise code vector in the driving sound source signal is made periodic, the reproduced sound becomes smooth. In that case, the ratio between the periodic component and the non-periodic component of the driving sound source signal can be set arbitrarily, and the ratio can be made closer to the optimum. In addition, by terminating some code vectors of one noise codebook, the degree of freedom can be changed for each frame. Furthermore, a different weight can be assigned to each component of the period and the non-period for each frame, and the weight ratio can be optimized by searching the weight codebook.

【００２０】４kbit/s程度の音声符号化の場合の音質改
善効果の一例を、図８に示す。図８Ａは周期化処理を行
ったＭ_S個の基本ベクトルのＶＳＥＬＰ形式雑音符号帳
と、周期化処理を行わない（１２−Ｍ_S）個の基本ベク
トルのＶＳＥＬＰ形式雑音符号帳とをそれぞれ１個ずつ
使った場合の、ＳＮＲおよびセグメンタルＳＮＲであ
る。また、図８Ｂは図４で、基本ベクトルの数Ｍを１２
とした１個の雑音符号帳を用い、その基本ベクトルの内
Ｍ_S個を周期化処理した場合のＳＮＲおよびセグメンタ
ルＳＮＲである。これらによれば、この発明は４kbit/s
程度の符号化で周期化処理を行わない従来方式（Ｍ_S＝
０）と比較して量子化雑音を１dB程度小さくすることが
でき、この発明によって合成音声品質を改善することが
できることがわかる。聴感から判断すると、Ｍ_S＝９か
１０程度が特によい。FIG. 8 shows an example of the sound quality improvement effect in the case of voice coding of about 4 kbit / s. FIG. 8A shows one VSELP-type noise codebook of M _S basic vectors subjected to the periodic processing and one VSELP-type noise codebook of (12−M _S ) basic vectors not subjected to the periodic processing. SNR and segmental SNR when each is used. FIG. 8B is FIG. 4 where the number M of basic vectors is 12
Using one of the noise codebook and an SNR and segmental SNR when treated inner M _S number of periods of the basic vectors. According to these, the present invention provides 4 kbit / s
The conventional method (M _S =
0), the quantization noise can be reduced by about 1 dB, and it can be seen that the present invention can improve the synthesized speech quality. Judging from hearing, it is particularly preferable that M _S = 9 or 10.

[Brief description of the drawings]

【図１】この発明の第１の実施例で、ＣＥＬＰ系雑音符
号帳における符号ベクトル周期化部を示す図。FIG. 1 is a diagram showing a code vector periodizing unit in a CELP noise codebook according to a first embodiment of the present invention.

【図２】この発明の第２の実施例で、複数の雑音符号帳
を用いたとき、一部の符号帳内のすべての符号ベクトル
に周期化処理をした場合の符号帳および符号帳検索部を
示す図。FIG. 2 is a diagram illustrating a codebook and a codebook search unit according to a second embodiment of the present invention when a plurality of noise codebooks are used and all code vectors in some codebooks are subjected to a periodic process; FIG.

【図３】この発明の第３の実施例で、１つの雑音符号帳
内の一部の符号ベクトルに周期化処理をした場合の符号
帳を示す図。FIG. 3 is a diagram showing a codebook in a case where a periodic process is performed on a part of code vectors in one noise codebook in the third embodiment of the present invention.

【図４】この発明の第４の実施例で、ＶＳＥＬＰ系雑音
符号帳における符号ベクトル周期化部を示す図。FIG. 4 is a diagram showing a code vector periodicization unit in a VSELP noise codebook according to a fourth embodiment of the present invention.

【図５】この発明の第５の実施例で、ＶＳＥＬＰ系雑音
符号帳の周期成分と非周期成分に別々に重みをつける場
合の、雑音符号帳、雑音符号帳検索部および駆動音源重
み検索部を示す図。FIG. 5 is a diagram illustrating a noise codebook, a noise codebook search unit, and a driving excitation weight search unit in a case where a periodic component and an aperiodic component of a VSELP noise codebook are separately weighted according to a fifth embodiment of the present invention; FIG.

【図６】この発明の第６の実施例で、主雑音符号帳の各
符号ベクトルを複数のＣＥＬＰ形式副雑音符号帳出力の
線形結合として構成したときに、複数の副雑音符号帳の
うち一部の符号帳内のすべての符号ベクトルを周期化し
た場合の例を示す図。FIG. 6 shows a sixth embodiment of the present invention, in which each code vector of the main noise codebook is configured as a linear combination of a plurality of CELP-format auxiliary noise codebook outputs, The figure which shows the example at the time of making all code vectors in the codebook of a part periodic.

【図７】この発明の第７の実施例で、主雑音符号帳の各
符号ベクトルを複数のＣＥＬＰ形式副雑音符号帳出力の
線形結合として構成したときに、それぞれの副雑音符号
帳の一部の符号ベクトルを周期化した場合の例を示す
図。FIG. 7 shows a seventh embodiment of the present invention, in which each code vector of the main noise codebook is configured as a linear combination of a plurality of CELP-format subnoise codebook outputs, and a part of each subnoise codebook. The figure which shows the example at the time of making the code vector of periodical.

【図８】この発明の効果を示すＳＮＲおよびセグメンタ
ルＳＮＲの図。FIG. 8 is a diagram of an SNR and a segmental SNR showing the effect of the present invention.

【図９】線形予測符号化装置の一般的構成を示すブロッ
ク図。FIG. 9 is a block diagram showing a general configuration of a linear prediction encoding device.

【図１０】ＣＥＬＰにおける雑音符号帳を示す図。FIG. 10 is a diagram showing a noise codebook in CELP.

【図１１】線形予測符号化の復号化装置の一般的構成を
示すブロック図。FIG. 11 is a block diagram showing a general configuration of a decoding device for linear predictive coding.

【図１２】ＶＳＥＬＰにおける雑音符号帳を示す図。FIG. 12 is a diagram showing a noise codebook in VSELP.

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平２−84699（ＪＰ，Ａ) 特開昭63−37724（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 19/14 H03M 7/30 H04B 14/04 ──────────────────────────────────────────────────続き Continuation of the front page (56) References JP-A-2-84699 (JP, A) JP-A-63-37724 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 19/00-19/14 H03M 7/30 H04B 14/04

Claims

(57) [Claims]

1. An output from an adaptive codebook for each frame.
Code vector and the output code vector from the random codebook
In a speech encoding method that uses a driving filter obtained by adding and driving a synthesis filter to reproduce speech, a pitch period is obtained for each frame, and a past driving sound source vector is cut based on the pitch period.
Output from the adaptive codebook as an output code vector, and the code vector of the noise codebook is cycled repeatedly for each of the pitch periods, so that distortion of the reproduced sound output from the synthesis filter with respect to the input sound is minimized. that searches over SL noise codebook output sign-vector, the noise code book has been encoded with the search indicating the pitch period
And a code indicating the code vector of the speech signal.

2. A method according to claim 1, wherein a plurality of random code books are stored on a frame basis .
Fetch the code vector, and fetch each code vector
Multiplied by a combination of +1 or -1
Force code vector and output code vector from adaptive codebook
In the speech coding method utilizing the driving of the synthesis filter with the drive vector obtained by weighting and adding the weights to reproduce the speech , a pitch cycle is obtained for each frame , and the past drive sound source vector is cut based on the pitch cycle.
Output from the adaptive codebook as an output code vector, and at least the code vector extracted from the noise codebook.
Is also partly cycled for each of the pitch periods, and is used as the input sound of the reproduced sound output from the synthesis filter.
The combination of +1 and -1 that minimizes the distortion
And the code indicating the combination and the pitch cycle
Speech encoder Goka method and outputting a code indicating.

3. The noise codebook according to claim 2,FromOf the above periodic
Linear between code vector and non-periodic code vector
Combine the above noise codebookOutput fromSignal vector
2. The method according to claim 1, whereinOrSpeech code described in 2Transformation
Law.

Wherein the said periodic been code not vector and periodic code vector from the noise code book by the weight ratio between constant heavy Mitsuki adds the output sign-vector of the noise codebook, the noise code On the book sign vector
Search for the code vector of the random codebook that minimizes the recording distortion ,
3. The speech encoding method according to claim 1, wherein said weight ratio is changed with respect to a code vector constituting said code vector having a minimum distortion to determine a weight ratio having a minimum distortion.

(5) Output code from adaptive codebook for each frame
Weighting the output vector from the random codebook
Drive the synthesis filter with the added drive vector
In a voice decoding method for reproducing voice, Past driving sound source vector based on the input pitch period
And output code vector from the above adaptive codebook
age, The code vector from the noise codebook is
The output code from the above random codebook
Signal vector A speech decoding method characterized by the above-mentioned.

6. The periodic code of the random codebook
Linear combination of vector and non-periodic code vector
Output from the random codebook
The speech decoding method according to claim 5, wherein