JP3088204B2

JP3088204B2 - Code-excited linear prediction encoding device and decoding device

Info

Publication number: JP3088204B2
Application number: JP04278455A
Authority: JP
Inventors: 浩桂川; 弘美青柳; 義博有山; 克俊伊東
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1992-10-16
Filing date: 1992-10-16
Publication date: 2000-09-18
Anticipated expiration: 2015-09-18
Also published as: JPH06130996A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、コード励振線形予測符
号化方式（ＣＥＬＰ符号化方式）に従うコード励振線形
予測符号化装置及び復号化装置に適用して好適なもので
ある。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is suitable for application to a code-excited linear predictive coding apparatus and a decoding apparatus which comply with a code-excited linear predictive coding method (CELP coding method).

【０００２】[0002]

【従来の技術】８ｋビット／秒以下の圧縮率での、音声
信号の高能率符号化方式としては、Ａｔａｌ等によって
提案されたＣＥＬＰ符号化方式が有効な手法である。2. Description of the Related Art The CELP coding method proposed by Atal et al. Is an effective method as a high-efficiency coding method for audio signals at a compression rate of 8 kbit / s or less.

【０００３】これは、音声信号を固定の長さのフレーム
毎に符号化するフレーム符号化方式に属しており、フレ
ームに１回求められる声道のパラメータと、フレームを
いくつかに分解したサブフレーム毎に求められる励振源
のパラメータによって表現するものである。入力音声か
ら声道パラメータを得るフォワード型のＣＥＬＰ符号化
方式においては、励振源のパラメータとして、無声音の
符号化のための統計コードブックと、有声音の符号化の
ための適応コードブックの２つによってベクトル量子化
するものが多く、例えば、下記文献に記載されている。
音声信号は、定常的は有声音の区間と、過渡的な無声音
の区間に分けて考えることができ、両者はその統計的な
性質が大きく異なっており、そのため、上述のように、
統計コードブックと適応コードブックとが別個に用意さ
れている。[0003] This belongs to a frame encoding method for encoding a speech signal for each frame of a fixed length, and includes a vocal tract parameter required once for each frame and a subframe obtained by decomposing the frame into several subframes. It is expressed by the parameters of the excitation source obtained for each. In a forward-type CELP coding method for obtaining vocal tract parameters from input speech, two parameters are used as excitation source parameters: a statistical codebook for coding unvoiced sounds and an adaptive codebook for coding voiced sounds. In many cases, vector quantization is performed, for example, as described in the following document.
A speech signal can be considered as being divided into a stationary voiced section and a transient unvoiced section, and both have significantly different statistical properties. Therefore, as described above,
A statistical codebook and an adaptive codebook are separately prepared.

【０００４】文献：N.S.Jayant & J.H.Chen,"Speech Co
ding with Time-Varying BitAllocations to Excitatio
n and LPC parameters",Proc,ICASSP-89,(1989) 図２は適応コードブックによる符号化の概念を説明する
ためのものである。周期性（自己相関性）を有する有声
音区間の音質に対する寄与が大きい適応コードブックに
は、現時点で対象となっているサブフレームの直前過去
の長時間Ｔの励振源波形Ｓ１が例えばシフトレジスタに
よって格納されている。この長時間の励振源波形Ｓ１に
おいて、サブフレーム期間を有する部分的波形（統計励
振コードベクトル）Ｓ11〜Ｓ1xを考える。現時点で対象
となっているサブフレームの励振源波形として、いずれ
の部分的波形Ｓ11、Ｓ12、…、Ｓ1xを用いると、入力音
声と局部再生音声とが最も近似するかを探索し、探索さ
れた部分的波形Ｓ1nを特定する情報（例えば長時間Ｔの
最初のサンプルからの部分的波形Ｓ1nの最初のサンプル
までのサンプル数をビット表現したもの）を適応コード
ブックからの現時点の処理対象サブフレームについての
出力符号とする。ここで、出力符号は、結局、現時点の
処理対象サブフレームとほぼ同様な波形形状を有する過
去の時点との時間差（ラグ：ピッチ周期）である。Reference: NSJayant & JHChen, "Speech Co
ding with Time-Varying BitAllocations to Excitatio
n and LPC parameters ", Proc, ICASSP-89, (1989) Fig. 2 illustrates the concept of coding using an adaptive codebook. The sound quality of a voiced section having periodicity (autocorrelation) is shown. In the adaptive codebook having a large contribution, an excitation source waveform S1 for a long time T immediately before the current subframe is stored, for example, by a shift register. Consider partial waveforms (statistical excitation code vectors) S11 to S1x having a sub-frame period.If any of partial waveforms S11, S12,... , Searching for whether the input sound and the locally reproduced sound are the closest, and specifying the searched partial waveform S1n (eg, the first partial waveform S1n from the first sample of the long time T). The number of samples up to the sample is expressed in bits) as an output code for the current processing target subframe from the adaptive codebook, where the output code is a waveform substantially similar to the current processing target subframe. It is a time difference (lag: pitch cycle) from a past time point having a shape.

【０００５】[0005]

【発明が解決しようとする課題】ところで、適応コード
ブックが大きく寄与する有声音は定常性が高いため、ラ
グ（ピッチ周期）の変動速度は遅く相前後するサブフレ
ーム間のラグは近似したものとなる。Since the voiced sound to which the adaptive codebook greatly contributes has a high stationarity, the lag (pitch cycle) fluctuates slowly and the lag between adjacent subframes is similar. Become.

【０００６】しかしながら、従来では、このようなラグ
の性質を利用しておらず、サブフレーム毎に独立にラグ
を求めていた。すなわち、上述のように、サブフレーム
毎に、その処理時点での多くの部分的波形Ｓ11、Ｓ12、
…、Ｓ1xから最適なものを探索してラグを決定してい
た。そのため、ラグを得るために多くの処理及び時間が
かかっていた。However, conventionally, such a property of the lag is not used, and the lag is independently obtained for each subframe. That is, as described above, for each subframe, many partial waveforms S11, S12,
... The optimum lag was determined from S1x to determine the lag. Therefore, much processing and time were required to obtain a lag.

【０００７】また、上述のように、相前後するサブフレ
ーム間のラグは近似しているという性質を利用せずに、
サブフレーム毎に得られたラグを送信しているため、伝
送情報に冗長な情報が入り込んでいるということができ
る。ＣＥＬＰ符号化方式は、低ビットレートの音声符号
化として採用されることが多く、伝送情報の各ビットを
有効に利用することが求められ、この点、ラグに対する
符号化は不十分ということができる。Further, as described above, without utilizing the property that the lag between successive subframes is similar,
Since the lag obtained for each subframe is transmitted, it can be said that redundant information is included in the transmission information. The CELP coding method is often adopted as low bit rate voice coding, and it is required to effectively use each bit of transmission information. In this regard, coding for lag can be considered to be insufficient. .

【０００８】本発明は、以上の点を考慮してなされたも
のであり、有声音に寄与する励振源情報の符号を決定す
る処理及び時間を従来より削減することができるコード
励振線形予測符号化器を提供しようとしたものである。SUMMARY OF THE INVENTION The present invention has been made in view of the above points, and has a code excitation linear predictive coding which can reduce the processing and time required to determine the sign of excitation source information contributing to voiced sound. It was intended to provide a container.

【０００９】また、本発明は、有声音に寄与する励振源
情報の符号に必要なビット数を従来より削減することが
できるコード励振線形予測符号化器及び復号化器を提供
しようとしたものである。Another object of the present invention is to provide a code excitation linear prediction encoder and a decoder capable of reducing the number of bits required for encoding excitation source information contributing to voiced sound as compared with the prior art. is there.

【００１０】[0010]

【課題を解決するための手段】かかる課題を解決するた
め、本発明においては、適応コードブックを備えたコー
ド励振線形予測符号化装置において、適応コードブック
を利用した励振源情報の符号化周期の整数倍の周期であ
る長周期毎に入力音声信号のピッチ周期を求める長周期
分析器を備え、適応コードブックに格納されている複数
の適応励振コードベクトルから長周期中の各符号化周期
それぞれについて最適な適応励振コードベクトルを探索
する際、各符号化周期についての最適な適応励振コード
ベクトルを、各符号化周期に対してピッチ周期だけずれ
た時間位置の適応励振コードベクトルを中心とした前後
所定数の適応励振コードベクトルの中から探索するよう
にした。According to the present invention, there is provided a code excitation linear predictive coding apparatus having an adaptive codebook, comprising: a coding period of excitation source information using an adaptive codebook; integer multiple of the period der
Includes a long-period analyzer for determining the pitch period of the input speech signal for each long period that, each coding cycle in the long period of a plurality of adaptive excitation code vectors stored in the adaptive codebook
When searching for the optimal adaptive excitation code vector for each, the optimal adaptive excitation code for each coding cycle
Shift vector by pitch period for each encoding period
The search is performed from a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector at the time position .

【００１１】ここで、コード励振線形予測復号化装置に
向けて伝送する符号化音声に、長周期毎に、ピッチ周期
と、長周期中の全ての各符号化周期に対応する、符号化
周期についての探索の結果得られた最適な適応励振コー
ドベクトルとピッチ周期だけずれた時間位置の適応励振
コードベクトルとの各時間差情報とを含めることが好ま
しい。Here, the coded speech transmitted to the code-excitation linear predictive decoding apparatus has a pitch cycle corresponding to every long cycle, and a pitch cycle corresponding to every coding cycle in the long cycle. Coding
It is preferable to include each time difference information between the optimal adaptive excitation code vector obtained as a result of the search for the period and the adaptive excitation code vector at a time position shifted by the pitch period .

【００１２】このような好ましい態様のコード励振線形
予測符号化装置に対応した復号化装置においては、符号
化音声に含まれているピッチ周期と長周期中の各符号化
周期に対応する各時間差情報とを加算して得た各値を、
それぞれ適応コードブックに対するインデックスとして
各符号化周期に対応する適応励振コードベクトルを出力
させることが好ましい。[0012] In the decoding apparatus corresponding to the code excitation linear predictive encoding apparatus of such a preferred embodiment , each of the encoding in the pitch period and the long period included in the encoded speech is performed .
Each value obtained by adding each time difference information corresponding to the cycle ,
Each as an index into the adaptive codebook
It is preferable to output an adaptive excitation code vector corresponding to each encoding cycle .

【００１３】[0013]

【作用】本発明は、有声音区間のピッチ周期を、励振源
情報の符号化周期の整数倍の周期である長周期で求めて
も、符号化周期で求めてもほとんど同じ値をとることに
着目してなされたものである。According to the present invention, the pitch period of a voiced sound section is
Obtained in a long cycle that is an integral multiple of the information encoding cycle
Are also based on the fact that they take almost the same value even when they are obtained in the coding cycle .

【００１４】そして、この性質を利用することで適応コ
ードブックの探索時間及び処理を軽減させることを意図
した。本発明のコード励振線形予測符号化装置におい
て、長周期分析器は、適応コードブックを利用した励振
源情報の符号化周期（探索周期）の整数倍の周期である
長周期毎に、入力音声信号のピッチ周期を求め、適応コ
ードブックについての符号を決定する構成は、適応コー
ドブックに格納されている複数の適応励振コードベクト
ルから長周期中の各符号化周期それぞれについて最適な
適応励振コードベクトルを探索する際、各符号化周期に
ついての最適な適応励振コードベクトルを、各符号化周
期に対してピッチ周期だけずれた時間位置の適応励振コ
ードベクトルを中心とした前後所定数の適応励振コード
ベクトルの中から探索するようにした。By using this property, it is intended to reduce the search time and processing of the adaptive codebook. In the code excitation linear prediction encoding device according to the present invention, the long-period analyzer has a period that is an integral multiple of the encoding period (search period) of the excitation source information using the adaptive codebook.
For each long period , the pitch period of the input speech signal is determined, and the code for the adaptive codebook is determined by using a plurality of adaptive excitation code vectors stored in the adaptive codebook from each of the encoding periods in the long period. When searching for the optimal adaptive excitation code vector for
The optimal adaptive excitation code vector for each encoding cycle.
The search is made from a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector at the time position shifted by the pitch period from the period .

【００１５】ここで、コード励振線形予測復号化装置に
向けて伝送する符号化音声に、長周期毎に、ピッチ周期
と、長周期中の全ての各符号化周期に対応する、符号化
周期についての探索の結果得られた最適な適応励振コー
ドベクトルとピッチ周期だけずれた時間位置の適応励振
コードベクトルとの各時間差情報とを含めるようにする
と、従来に比べ、圧縮効率を高めることができて好まし
い。Here, the coded speech transmitted to the code-excitation linear predictive decoding apparatus has a pitch cycle corresponding to every long cycle, and a pitch cycle corresponding to every coding cycle in the long cycle. Coding
By including each time difference information between the optimal adaptive excitation code vector obtained as a result of the search for the period and the adaptive excitation code vector at the time position shifted by the pitch period , the compression efficiency can be increased compared to the conventional case. It is possible and preferable.

【００１６】このような好ましい態様のコード励振線形
予測符号化装置に対応した復号化装置においては、適応
コードブックを直接アクセスできる情報が伝送されてこ
ないので、符号化音声に含まれているピッチ周期と長周
期中の各符号化周期に対応する各時間差情報とを加算し
て得た各値を、それぞれ適応コードブックに対するイン
デックスとして各符号化周期に対応する適応励振コード
ベクトルをアクセスすることになる。[0016] In the decoding apparatus corresponding to the code-excitation linear predictive coding apparatus of such a preferable embodiment, since information for directly accessing the adaptive codebook is not transmitted, the pitch period included in the coded speech is not determined. Long circumference
Each value obtained by adding the respective time difference information corresponding to each coding cycle during the period will access the adaptive excitation code vector corresponding to each coding cycle as an index to each adaptive codebook.

【００１７】[0017]

【実施例】以下、本発明の実施例によるコード励振線形
予測符号化装置及び復号化装置を、図面を参照しながら
この順序で詳述する。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing a linear excitation coding apparatus and a decoding apparatus according to an embodiment of the present invention;

【００１８】図１は、実施例のコード励振線形予測符号
化装置の構成を示すものである。FIG. 1 shows the configuration of a code excitation linear prediction encoding apparatus according to an embodiment.

【００１９】図１において、アナログ／デジタル変換さ
れた入力音声信号系列は、特定のフレーム長単位で当該
符号化装置に入力される。In FIG. 1, an input audio signal sequence that has been subjected to analog / digital conversion is input to the encoding apparatus in units of a specific frame length.

【００２０】声道分析器１１は、入力音声信号を声道分
析（例えばＬＰＣ分析）し、分析で得た声道パラメータ
αを、長周期分析器１２、励振源評価回路１３及び多重
化器１７に与える。ここで、声道パラメータαはフレー
ム毎に得られてフレーム毎に出力される。The vocal tract analyzer 11 performs vocal tract analysis (for example, LPC analysis) of the input speech signal, and converts the vocal tract parameter α obtained by the analysis into a long-period analyzer 12, an excitation source evaluation circuit 13, and a multiplexer 17 Give to. Here, the vocal tract parameter α is obtained for each frame and output for each frame.

【００２１】長周期分析器１２は、入力音声信号の長周
期の相関を計算し、入力音声信号のフレーム全体につい
てのピッチ周期情報であるラグ（以下、フレームラグと
呼ぶ）Ｌを求め、励振源評価回路１３及び多重化器１７
に与える。このようなフレームラグＬも各フレームで１
回ずつ求められる。なお、この長周期分析器１２には、
短周期パラメータである音道パラメータαも与えられて
おり、この音道パラメータαを利用して入力音声信号に
おける短周期成分の影響を除去して長周期の相関を検出
するようになされている。The long-period analyzer 12 calculates a long-period correlation of the input voice signal, obtains a lag (hereinafter, referred to as a frame lag) L which is pitch period information for the entire frame of the input voice signal, and Evaluation circuit 13 and multiplexer 17
Give to. Such a frame lug L is also 1 in each frame.
It is asked every time. The long-period analyzer 12 includes:
A sound path parameter α, which is a short-period parameter, is also given. Using this sound path parameter α, the influence of a short-period component in the input speech signal is removed to detect a long-period correlation.

【００２２】なお、長周期分析器１２によって求められ
るフレームラグＬの表記方法や検出精度については後述
する。The notation method and detection accuracy of the frame lag L obtained by the long period analyzer 12 will be described later.

【００２３】適応コードブック１４は、有声音区間に大
きく寄与する複数の適応励振コードベクトルを格納して
いるものであり、例えば、シフトレジスタやテーブル等
によって構成されている。後述するように、最適な適応
励振コードベクトルは、励振源評価回路１３による制御
下で１フレーム期間をいくつかに等分したサブフレーム
毎に求められる。The adaptive code book 14 stores a plurality of adaptive excitation code vectors that greatly contribute to a voiced sound section, and includes, for example, a shift register and a table. As will be described later, the optimum adaptive excitation code vector is obtained for each sub-frame obtained by equally dividing one frame period under the control of the excitation source evaluation circuit 13.

【００２４】統計コードブック１５は、無声音区間に大
きく寄与する複数の統計励振コードベクトルを格納して
いるものである。後述するように、最適な統計励振コー
ドベクトルも、励振源評価回路１３による制御下でサブ
フレーム毎に求められる。The statistical code book 15 stores a plurality of statistical excitation code vectors that greatly contribute to an unvoiced sound section. As will be described later, the optimum statistical excitation code vector is also obtained for each subframe under the control of the excitation source evaluation circuit 13.

【００２５】ゲインコードブック１６は、適応励振コー
ドベクトルと統計励振コードベクトルのそれぞれに対す
る複数のゲイン情報を格納している。後述するように、
最適な適応励振コードベクトル及び統計励振コードベク
トルに対する最適なゲイン（β，γ）も、励振源評価回
路１３による制御下でサブフレーム毎に求められる。The gain codebook 16 stores a plurality of pieces of gain information for each of the adaptive excitation code vector and the statistical excitation code vector. As described below,
The optimal gain (β, γ) for the optimal adaptive excitation code vector and the statistical excitation code vector is also obtained for each subframe under the control of the excitation source evaluation circuit 13.

【００２６】励振源評価回路１３は、フレームラグＬを
除いた励振源情報ΔＬ、Ｉ、（β，γ）をサブフレーム
毎に決定させるものである。励振源評価回路１３は、適
応コードブック１４に係る励振源情報（以下、サブフレ
ーム補正ラグと呼ぶ）ΔＬ、統計コードブック１５に係
る励振源情報（以下、統計コードインデックスと呼ぶ）
Ｉ、ゲインコードブック１６に係る励振源情報（以下、
最適度ゲインと呼ぶ）（β，γ）の順に決定動作を行な
うものである。The excitation source evaluation circuit 13 determines excitation source information ΔL, I, (β, γ) excluding the frame lag L for each subframe. The excitation source evaluation circuit 13 includes excitation source information (hereinafter, referred to as a subframe correction lag) ΔL relating to the adaptive codebook 14 and excitation source information (hereinafter, referred to as a statistical code index) relating to the statistical codebook 15.
I, excitation source information related to the gain codebook 16 (hereinafter, referred to as
The determination operation is performed in the order of (β, γ).

【００２７】励振源評価回路１３は、励振コードベクト
ル及び声道パラメータαから局部再生の音声を得る構成
や、その局部再生の音声と入力音声との類似度を求める
構成を備えており（例えば、特願平３−３２７４４３号
明細書及び図面等を参照）、類似度が最も大きくなるよ
うな情報を、適応コードブック１４、統計コードブック
１５、ゲインコードブック１６等について探索する。The excitation source evaluation circuit 13 has a configuration for obtaining a local reproduction voice from the excitation code vector and the vocal tract parameter α, and a configuration for calculating the similarity between the local reproduction voice and the input voice (for example, (See Japanese Patent Application No. 3-327443, drawings, etc.), and search for information that maximizes the similarity in the adaptive codebook 14, statistical codebook 15, gain codebook 16, and the like.

【００２８】多重化器１７は、フレームに１回与えられ
る声道パラメータα及びフレームラグＬ、サブフレーム
に１回与えられるサブフレーム補正ラグΔＬ、統計コー
ドインデックスＩ及び最適度ゲイン（β，γ）を適宜量
子化しながら多重化して符号化音声信号を得る。The multiplexor 17 includes a vocal tract parameter α and a frame lag L given once to a frame, a subframe correction lag ΔL given once to a subframe, a statistical code index I and an optimum gain (β, γ). Are multiplexed while appropriately quantizing to obtain an encoded audio signal.

【００２９】以上の構成を有する実施例のコード励振線
形予測符号化装置は、以下のように動作する。The code-excited linear prediction encoding apparatus according to the embodiment having the above-described configuration operates as follows.

【００３０】１フレーム分の入力音声信号が与えられる
と、音声分析器１１及び長周期分析器１２はそれぞれ、
分析処理して声道パラメータα及びフレームラグＬを求
めて出力する。When an input speech signal for one frame is given, the speech analyzer 11 and the long-period analyzer 12 respectively
The vocal tract parameter α and the frame lag L are obtained and analyzed and output.

【００３１】励振源評価回路１３は、サブフレーム毎に
励振源情報を決定する。まず、励振源評価回路１３は、
統計コードブック１５及びゲインコードブック１６から
の出力を停止させた状態で（又はデフォルト値を出力さ
せた状態で）、適応コードブック１４から最適な適応励
振コードベクトルの候補としての複数の適応励振コード
ベクトルを、例えば時間順次に又は並列に出力させ、各
候補の適応励振コードベクトルを利用してそれぞれ局部
再生の音声を得て評価し、入力音声信号に対する類似度
が最も大きい局部再生音声を合成させた適応励振コード
ベクトルを最適な適応励振コードベクトルとする。The excitation source evaluation circuit 13 determines excitation source information for each subframe. First, the excitation source evaluation circuit 13
With the output from the statistical codebook 15 and the gain codebook 16 stopped (or with the default value output), a plurality of adaptive excitation codes from the adaptive codebook 14 as candidates for the optimal adaptive excitation code vector. Vectors are output, for example, time-sequentially or in parallel, the respective local reproduction sounds are obtained and evaluated using the adaptive excitation code vector of each candidate, and the local reproduction sound having the highest similarity to the input sound signal is synthesized. The obtained adaptive excitation code vector is set as an optimal adaptive excitation code vector.

【００３２】ここで、励振源評価回路１３は、今回のサ
ブフレームに対してフレームラグＬだけずれた時間位置
の適応励振コードベクトルを中心とした前後所定数の適
応励振コードベクトルを候補とする。すなわち、従来と
は異なって、適応コードブック１４に格納されている全
ての適応励振コードベクトルを候補とすることはない。
なお、候補ベクトルの決定方法については後述する。Here, the excitation source evaluating circuit 13 sets a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector at the time position shifted from the current subframe by the frame lag L as candidates. That is, unlike the related art, all adaptive excitation code vectors stored in the adaptive codebook 14 are not set as candidates.
The method for determining the candidate vector will be described later.

【００３３】励振源評価回路１３は、最適な適応励振コ
ードベクトルを決定すると、今回のサブフレームに対し
てフレームラグＬだけずれた時間位置の適応励振コード
ベクトルと、最適な適応励振コードベクトルとの時間差
情報を得てサブフレーム補正ラグΔＬとして多重化器１
７に出力する。Upon determining the optimal adaptive excitation code vector, the excitation source evaluation circuit 13 determines the optimal adaptive excitation code vector at the time position shifted by the frame lag L from the current subframe and the optimal adaptive excitation code vector. Multiplexer 1 obtains time difference information and sets it as subframe correction lag ΔL
7 is output.

【００３４】次に、励振源評価回路１３は、ゲインコー
ドブック１６からの出力を停止させた状態で（又はデフ
ォルト値を出力させた状態で）、統計コードブック１５
から最適な統計励振コードベクトルの候補として全ての
統計励振コードベクトルを、例えば時間順次に又は並列
に出力させ、各候補の統計励振コードベクトルと最適な
適応励振コードベクトルとを加算し、加算により得られ
た各励振コードベクトルを利用してそれぞれ局部再生の
音声を得て評価し、入力音声信号に対する類似度が最も
大きい局部再生音声を合成させた統計励振コードベクト
ルを最適な統計励振コードベクトルとする。そして、最
適な統計励振コードベクトルに対するインデックス（統
計コードインデックス）Ｉを多重化器１７に出力する。Next, the excitation source evaluation circuit 13 stops the output from the gain codebook 16 (or outputs the default value), and outputs the statistical codebook 15.
All statistical excitation code vectors are output as candidates for the optimal statistical excitation code vector from, for example, time-sequentially or in parallel, and the statistical excitation code vector of each candidate and the optimal adaptive excitation code vector are added and obtained by addition. Using the obtained excitation code vectors, a local reproduction audio is obtained and evaluated, and a statistical excitation code vector obtained by synthesizing a local reproduction audio having the highest similarity to the input audio signal is set as an optimal statistical excitation code vector. . Then, an index (statistical code index) I for the optimal statistical excitation code vector is output to the multiplexer 17.

【００３５】続いて、励振源評価回路１３は、ゲインコ
ードブック１６から最適度ゲインの候補として全てのゲ
イン情報を、例えば時間順次に又は並列に出力させ、各
候補のゲイン情報によって最適な統計励振コードベクト
ルと最適な適応励振コードベクトルとを重付け加算し、
加算により得られた各励振コードベクトルを利用してそ
れぞれ局部再生の音声を得て評価し、入力音声信号に対
する類似度が最も大きい局部再生音声を合成させたゲイ
ン情報を最適度ゲインに決定する。この実施例の場合に
は、最適度ゲイン（β，γ）そのものを多重化器１７に
与える。Subsequently, the excitation source evaluation circuit 13 causes the gain codebook 16 to output all the gain information as candidates for the optimality gain, for example, in time sequence or in parallel, and optimizes the statistical excitation based on the gain information of each candidate. Weighted addition of the code vector and the optimal adaptive excitation code vector,
Using each excitation code vector obtained by the addition, a locally reproduced sound is obtained and evaluated, and gain information obtained by synthesizing the locally reproduced sound having the highest similarity to the input sound signal is determined as the optimum gain. In the case of this embodiment, the optimum gain (β, γ) itself is given to the multiplexer 17.

【００３６】以上のようにして、今回のサブフレームに
ついての最適な適応励振コードベクトル、統計励振コー
ドベクトル及びゲインを得ると、これら情報から、今回
のサブフレームについての最終的な励振コードベクトル
（声道パラメータαと合成処理される励振コードベクト
ル）を得て適応コードブック１４に与える。適応コード
ブック１４は、この励振コードベクトルを最新のものと
して取込むと共に、最も古い方の適応励振コードベクト
ルを破棄させる。As described above, when the optimal adaptive excitation code vector, statistical excitation code vector, and gain for the current subframe are obtained, the final excitation code vector (voice voice) for the current subframe is obtained from these information. An excitation code vector (combined with the path parameter α) is obtained and given to the adaptive codebook 14. The adaptive codebook 14 takes this excitation code vector as the latest one, and discards the oldest adaptive excitation code vector.

【００３７】このような適応処理が終了すると、次のサ
ブフレームの処理に進む。When such adaptive processing is completed, the process proceeds to the processing of the next subframe.

【００３８】そして、多重化器１７によって、フレーム
に１回与えられる声道パラメータα及びフレームラグ
Ｌ、サブフレームに１回与えられるサブフレーム補正ラ
グΔＬ、統計コードインデックスＩ及び最適度ゲイン
（β，γ）から符号化音声信号が組み立てられて通信路
に出力される。Then, the multiplexer 17 gives the vocal tract parameter α and the frame lag L given once to the frame, the subframe correction lag ΔL given once to the subframe, the statistical code index I, and the optimality gain (β, The encoded audio signal is assembled from γ) and output to the communication path.

【００３９】図３は、長周期分析器１２によって求めら
れるフレームラグＬの表記方法（検出精度）についての
説明図である。図３は、フレームラグＬの表記方法や検
出精度を３種類示しており、実際上、いずれを採用する
こともできる。FIG. 3 is an explanatory diagram of the notation method (detection accuracy) of the frame lag L obtained by the long period analyzer 12. FIG. 3 shows three types of notation method and detection accuracy of the frame lag L, and any of them can be used in practice.

【００４０】音声のピッチ周波数が５０Ｈｚから４００
Ｈｚ程度なので、例えば、入力音声信号が８０００Ｈｚ
でアナログ／デジタル変換されている場合には、フレー
ムラグＬの値としては、２０（＝８０００÷４００）サ
ンプリング周期から１５０（８０００÷５０に近い値）
サンプリング周期を取り得る値の範囲とすることが適当
である。この実施例の場合、フレームラグＬも当然にデ
ジタル値として処理されるのでビット表現される。The pitch frequency of voice is from 50 Hz to 400
Hz, so for example, if the input audio signal is 8000 Hz
When the analog / digital conversion is performed, the value of the frame lag L is set to 150 (a value close to 8000/50) from the sampling cycle of 20 (= 8000/400).
It is appropriate to set the sampling period to a range of values that can be taken. In the case of this embodiment, since the frame lag L is naturally processed as a digital value, it is expressed in bits.

【００４１】図３（ａ）は、フレームラグＬの第１の表
記方法を示している。ビット表現を考慮して、フレーム
ラグＬの値の範囲を２０サンプリング周期から１４７サ
ンプリング周期とし、１サンプリング周期ずつ異なる全
ての値（１２８個の値）を取るものとする。従って、こ
の場合にはフレームラグＬは７ビットで表現される。FIG. 3A shows a first notation method of the frame lag L. In consideration of the bit representation, the range of the value of the frame lag L is set to 20 sampling periods to 147 sampling periods, and all values (128 values) different by one sampling period are taken. Therefore, in this case, the frame lag L is represented by 7 bits.

【００４２】図３（ｂ）は、フレームラグＬの第２の表
記方法を示している。ビット表現を考慮して、フレーム
ラグＬの値の範囲を２０サンプリング周期から１４６サ
ンプリング周期とし、２サンプリング周期ずつ異なる全
ての値（６４個の値）を取るものとする。従って、この
場合にはフレームラグＬは６ビットで表現される。フレ
ームラグＬは、適応コードブック１４における最適な適
応コードベクトルが存在する時間位置の目安を与えるも
のであるので、このように２サンプリング周期ずつ異な
るように表記しても（精度を落としても）問題となるこ
とはない。FIG. 3B shows a second notation method of the frame lag L. Considering the bit representation, the range of the value of the frame lag L is set to 20 sampling periods to 146 sampling periods, and all values (64 values) differing by two sampling periods are assumed. Therefore, in this case, the frame lag L is represented by 6 bits. Since the frame lag L gives an indication of the time position where the optimal adaptive code vector exists in the adaptive code book 14, even if the frame lag L is described so as to differ by two sampling periods (even if the precision is lowered). There is no problem.

【００４３】図３（ｃ）は、フレームラグＬの第３の表
記方法を示している。この方法は、フレームラグＬの最
小値を２０サンプリング周期とし、高いピッチ周波数に
対応する部分では１サンプリング周期ずつの間隔で隣合
うフレームラグＬの値を定め、ピッチ周波数が低い値に
対応する部分になるに従い広い間隔で隣合うフレームラ
グＬの値を定めるようにしたものである。このようにし
た場合には、少ないビット数でフレームラグＬを表現す
る場合に効率良く表現することができる。FIG. 3C shows a third notation method of the frame lag L. In this method, the minimum value of the frame lag L is set to 20 sampling periods, the value of the adjacent frame lag L is determined at intervals of one sampling period in the portion corresponding to the high pitch frequency, and the portion corresponding to the low pitch frequency is determined. , The values of the adjacent frame lags L are determined at wider intervals. In this case, when the frame lag L is expressed with a small number of bits, it can be efficiently expressed.

【００４４】図４は、最適な適応励振コードベクトルの
探索範囲及びサブフレーム補正ラグΔＬの表現方法を説
明するものであり、４種類の方法を示している。FIG. 4 explains the search range of the optimal adaptive excitation code vector and the method of expressing the subframe correction lag ΔL, and shows four types of methods.

【００４５】図４（ａ）は、第１の方法の説明図であ
り、フレームラグＬに対応した適応励振コードベクトル
の先頭サンプルから８サンプリング周期だけ前のサンプ
ルを先頭サンプルとする適応励振コードベクトルから、
フレームラグＬに対応した適応励振コードベクトルの先
頭サンプルから７サンプリング周期だけ後のサンプルを
先頭サンプルとする適応励振コードベクトルまでの計１
６個の適応励振コードベクトルを候補ベクトルとするこ
とを示している。従って、サブフレーム補正ラグΔＬを
４ビットで表現することができる。[0045] FIG. 4 (a) is an explanatory view of the first method, the adaptive excitation code the samples prior to only 8 sampling periods from the beginning samples of the adaptive excitation code vector corresponding to full Remuragu L and the top sample vector From
A total of 1 from the first sample of the adaptive excitation code vector corresponding to the frame lag L to the adaptive excitation code vector whose first sample is a sample after 7 sampling periods.
This shows that six adaptive excitation code vectors are used as candidate vectors. Therefore, the subframe correction lag ΔL can be expressed by 4 bits.

【００４６】図４（ｂ）は、第２の方法の説明図であ
り、フレームラグＬに対応した適応励振コードベクトル
の先頭サンプルから８サンプリング周期だけ前のサンプ
ルを先頭サンプルとする適応励振コードベクトルから、
フレームラグＬに対応した適応励振コードベクトルの先
頭サンプルから７．５サンプリング周期だけ後のサンプ
ルを先頭サンプルとする適応励振コードベクトルまで
の、適応励振コードベクトルであって相前後する適応励
振コードベクトルの時間差が０．５サンプリング周期で
ある計３２個の適応励振コードベクトルを候補ベクトル
とすることを示している。従って、サブフレーム補正ラ
グΔＬを５ビットで表現することができる。これは、ア
ップサンプリングフィルターを適応コードブック１４に
用いた場合の方法である。FIG. 4B is an explanatory diagram of the second method, in which the adaptive excitation code vector whose first sample is a sample eight sampling periods before the first sample of the adaptive excitation code vector corresponding to the frame lag L. From
The adaptive excitation code vector, which is 7.5 seconds after the first sample of the adaptive excitation code vector corresponding to the frame lag L to the adaptive excitation code vector having the first sample, This shows that a total of 32 adaptive excitation code vectors having a time difference of 0.5 sampling period are set as candidate vectors. Therefore, the subframe correction lag ΔL can be represented by 5 bits. This is a method when an upsampling filter is used for the adaptive codebook 14.

【００４７】図４（ｃ）は第３の方法の説明図である。
この方法も、アップサンプリングフィルターを適応コー
ドブック１４に用いることを前提としており、フレーム
ラグＬに対応した適応励振コードベクトルの近傍では、
１サンプリング周期より小さい時間ずつ先頭サンプルの
位置をずらして候補とする適応励振コードベクトルを考
え、フレームラグＬに対応した適応励振コードベクトル
から遠くなるに従い、隣合う候補の適応励振コードベク
トル間の時間差を大きくするようにしたものである。従
って、サブフレーム補正ラグΔＬを効率良く表現するこ
とができる。FIG. 4C is an explanatory diagram of the third method.
This method also assumes that an upsampling filter is used for the adaptive codebook 14, and in the vicinity of the adaptive excitation code vector corresponding to the frame lag L,
Consider an adaptive excitation code vector which is a candidate by shifting the position of the first sample by a time shorter than one sampling period, and as the distance from the adaptive excitation code vector corresponding to the frame lag L increases, the time difference between the adaptive excitation code vectors of adjacent candidates. Is made larger. Therefore, the subframe correction lag ΔL can be efficiently expressed.

【００４８】図４（ｄ）は第４の方法の説明図である。
この方法は、フレームラグＬが小さい範囲（ピッチ周波
数が高い範囲）では、隣合う候補の適応励振コードベク
トル間の時間差を小さくし、フレームラグＬが大きい範
囲（ピッチ周波数が低い範囲）では、隣合う候補の適応
励振コードベクトル間の時間差を大きく設定したもので
ある。従って、サブフレーム補正ラグΔＬを所定ビット
数で表現しても、フレームラグＬの値によって１ビット
が意味する値が異なったものとなっている。FIG. 4D is an explanatory diagram of the fourth method.
This method reduces the time difference between the adaptive excitation code vectors of adjacent candidates in a range where the frame lag L is small (range where the pitch frequency is high), and reduces the time difference in a range where the frame lag L is large (range where the pitch frequency is low). The time difference between adaptive excitation code vectors of matching candidates is set to be large. Therefore, even if the subframe correction lag ΔL is represented by a predetermined number of bits, the value of one bit differs depending on the value of the frame lag L.

【００４９】以上のように、この実施例においては、フ
レームラグＬを求めて適応コードブック１４の探索範囲
を設定している。これは、以下の理由による。有声音区
間におけるピッチ周期の変動は遅く、フレームについて
ピッチ周期（フレームラグＬ）を求めても、サブフレー
ムについてピッチ周期（フレームラグＬ＋サブフレーム
補正ラグΔＬ）を求めてもその値は似たものとなってい
る。従って、サブフレームのピッチ周期の予測値として
フレームについてのピッチ周期（フレームラグＬ）を用
い、その前後でサブフレームについてのピッチ周期を探
索しても正確なサブフレームについてのピッチ周期を得
ることができ、これにより探索時間及び探索処理量を軽
減することができる。As described above, in this embodiment, the search range of the adaptive codebook 14 is set by obtaining the frame lag L. This is for the following reason. The fluctuation of the pitch period in the voiced sound section is slow, and the values are similar whether the pitch period (frame lag L) is obtained for a frame or the pitch period (frame lag L + subframe correction lag ΔL) is obtained for a subframe. It has become. Therefore, even if the pitch period of the frame (frame lag L) is used as the predicted value of the pitch period of the subframe, and the pitch period of the subframe is searched before and after that, the accurate pitch period of the subframe can be obtained. Thus, the search time and the amount of search processing can be reduced.

【００５０】また、この実施例において、符号化音声信
号にフレームラグＬとサブフレーム補正ラグΔＬとを含
めて伝送するようにしたのは、以下の理由による。従来
では、サブフレーム毎にピッチ周期（フレームラグＬ＋
サブフレーム補正ラグΔＬに相当する）を伝送してい
た。仮に、１フレームが４サブフレームとし、ピッチ周
期情報を７ビットとすると、１フレーム当り従来では２
８ビット伝送することになる。一方、サブフレーム補正
ラグΔＬを４ビットとすると、この実施例の場合、１フ
レームについて、フレームについてのピッチ情報（フレ
ームラグＬ）を１回、サブフレームについてのピッチ情
報（サブフレーム補正ラグΔＬ）を４回伝送するので、
１フレームに伝送するピッチ情報のビット数は、７＋４
×４＝２３ビットとなる。すなわち、従来より圧縮効率
を高めることができる。Further, in this embodiment, the coded voice signal is transmitted including the frame lag L and the subframe correction lag ΔL for the following reason. Conventionally, the pitch period (frame lag L +
(Corresponding to the subframe correction lag ΔL). If one frame is composed of 4 subframes and the pitch period information is composed of 7 bits, 2 frames per frame are conventionally used.
8 bits will be transmitted. On the other hand, assuming that the subframe correction lag ΔL is 4 bits, in this embodiment, the pitch information (frame lag L) for the frame is once for one frame, and the pitch information (subframe correction lag ΔL) for the subframe. Is transmitted four times,
The number of bits of pitch information transmitted in one frame is 7 + 4
× 4 = 23 bits. That is, the compression efficiency can be increased as compared with the related art.

【００５１】次に、実施例のコード励振線形予測復号化
装置の構成及び動作を、図５を参照しながら詳述する。Next, the configuration and operation of the code-excited linear prediction decoding apparatus according to the embodiment will be described in detail with reference to FIG.

【００５２】コード励振線形予測符号化装置（図１）か
ら送信されてきた符号化音声信号は、多重分離器２０に
与えられ、声道パラメータα、フレームラグＬ、サブフ
レーム補正ラグΔＬ、統計コードインデックスＩ、最適
度ゲイン（β，γ）に分離される。分離された声道パラ
メータαは合成フィルタ２６に与えられ、分離されたフ
レームラグＬ及びサブフレーム補正ラグΔＬは多重分離
器２０内で加算されて適応コードブック２１に与えら
れ、分離された統計コードインデックスＩは統計コード
ブック２２に与えられ、分離された最適度ゲイン（β，
γ）の内適応励振コードベクトル用のゲインβは乗算器
２３に与えられ、他方の統計励振コードベクトル用のゲ
インγはの乗算器２５に与えられる。The coded speech signal transmitted from the code-excited linear predictive coding apparatus (FIG. 1) is supplied to the demultiplexer 20, where the vocal tract parameter α, the frame lag L, the subframe correction lag ΔL, the statistical code The index I is separated into the optimality gain (β, γ). The separated vocal tract parameter α is provided to the synthesis filter 26, and the separated frame lag L and subframe correction lag ΔL are added in the demultiplexer 20 and provided to the adaptive codebook 21, and the separated statistical code The index I is provided to the statistical codebook 22, and the separated optimality gain (β,
The gain β for the adaptive excitation code vector of γ) is given to the multiplier 23, and the gain γ for the other statistical excitation code vector is given to the multiplier 25 of γ).

【００５３】かくして、適応コードブック２１からその
サブフレームについて最適な適応励振コードベクトルが
出力され、これが乗算器２３によってβ倍されて加算器
２４に入力され、また、統計コードブック２２からその
サブフレームについて最適な統計励振コードベクトルが
出力され、これが乗算器２５によってγ倍されて加算器
２４に入力される。これにより、加算器２４において、
最終的にこのサブフレームについての励振コードベクト
ルが得られ、合成フィルタ２６がこの励振コードベクト
ルに対して声道パラメータαを適用して合成処理し、入
力音声信号に対応した再生音声信号を得る。加算器２４
から出力された励振コードベクトルは適応コードブック
２１にフィードバックされ、適応コードブック２１内の
適応コードベクトルが適応的に変更される。Thus, the adaptive adaptive excitation code vector for the subframe is output from the adaptive codebook 21, multiplied by β by the multiplier 23 and input to the adder 24, and the subframe is output from the statistical codebook 22. Is output, and this is multiplied by γ by the multiplier 25 and input to the adder 24. Thereby, in the adder 24,
Finally, an excitation code vector for this subframe is obtained, and the synthesis filter 26 synthesizes the excitation code vector by applying the vocal tract parameter α to obtain a reproduced audio signal corresponding to the input audio signal. Adder 24
Are fed back to the adaptive codebook 21, and the adaptive codevectors in the adaptive codebook 21 are adaptively changed.

【００５４】従って、上記実施例によれば、フレーム毎
に求めたフレームラグＬによって適応コードブック１４
の探索範囲を変更すると共に、符号化音声信号にフレー
ムラグＬ及びサブフレーム補正ラグΔＬを含めて伝送
し、復号化装置においてフレームラグＬ及びサブフレー
ム補正ラグΔＬからサブフレームラグ（サブフレームに
ついてのピッチ周期）Ｌ＋ΔＬを得て適応コードブック
２１をアクセスするようにしたので、有声音に寄与する
励振源情報の符号を決定する処理及び時間を従来より削
減することができ、また、有声音に寄与する励振源情報
の符号に必要なビット数を従来より削減することができ
る。Therefore, according to the above embodiment, the adaptive code book 14 is determined by the frame lag L obtained for each frame.
, The coded audio signal is transmitted together with the frame lag L and the subframe correction lag ΔL, and the decoding device converts the frame lag L and the subframe correction lag ΔL from the subframe lag (for the subframe). Since the adaptive codebook 21 is accessed by obtaining (pitch period) L + ΔL, the processing and time for determining the sign of the excitation source information contributing to the voiced sound can be reduced as compared with the conventional art, and the contribution to the voiced sound can be reduced. The number of bits required for the sign of the excitation source information to be performed can be reduced as compared with the related art.

【００５５】本発明は、適応コードブックを有するコー
ド励振線形予測符号化装置及び復号化装置であれば適用
することができ、フォワード型のものに限定されるもの
ではない。The present invention can be applied to any code-excited linear prediction encoding device and decoding device having an adaptive codebook, and is not limited to the forward type.

【００５６】上記実施例においては、長周期相関値を声
道パラメータの分析周期であるフレーム毎に求めるもの
を示したが、本発明はこれに限定されるものではなく、
声道パラメータの分析周期と長周期相関値を求める周期
とが異なっていても良い。但し、長周期相関値を求める
周期が、適応コードブックの探索周期の整数倍であるこ
とは必要である。In the above embodiment, the long-period correlation value is obtained for each frame which is the analysis period of the vocal tract parameters. However, the present invention is not limited to this.
The analysis cycle of the vocal tract parameters and the cycle for obtaining the long-period correlation value may be different. However, it is necessary that the cycle for obtaining the long-period correlation value is an integral multiple of the search cycle of the adaptive codebook.

【００５７】また、上記実施例においては、フレームラ
グＬとサブフレーム補正ラグΔＬを符号化音声信号に含
めて伝送するものを示したが、フレームラグＬとサブフ
レーム補正ラグΔＬとを加算したサブフレームラグＬ＋
ΔＬを符号化音声信号に含めて伝送するようにしても良
い。この場合には、コード励振線形予測復号化装置の構
成は従来と同様になる。このようにした場合、励振源情
報の圧縮効率を従来より高めるという効果を得ることが
できないが、適応コードブック１４の探索時間及び処理
を軽減できるという効果は得ることができる。In the above embodiment, the frame lag L and the sub-frame correction lag ΔL are transmitted while being included in the coded audio signal. However, the frame lag L and the sub-frame correction lag ΔL are added. Frame lug L +
ΔL may be included in the encoded audio signal and transmitted. In this case, the configuration of the code-excited linear predictive decoding device is the same as the conventional one. In this case, the effect of increasing the compression efficiency of the excitation source information cannot be obtained as compared with the related art, but the effect of reducing the search time and processing of the adaptive codebook 14 can be obtained.

【００５８】[0058]

【発明の効果】以上のように、請求項１の本発明によれ
ば、適応コードブックを利用した励振源情報の符号化周
期の整数倍の周期である長周期毎に入力音声信号のピッ
チ周期を求める長周期分析器を備え、適応コードブック
に格納されている複数の適応励振コードベクトルから長
周期中の各符号化周期それぞれについて最適な適応励振
コードベクトルを探索する際、各符号化周期についての
最適な適応励振コードベクトルを、各符号化周期に対し
てピッチ周期だけずれた時間位置の適応励振コードベク
トルを中心とした前後所定数の適応励振コードベクトル
の中から探索するようにしたので、適応コードブックに
対する探索時間及び処理を軽減することができるコード
励振線形予測符号化装置を実現できる。As described above, according to the first aspect of the present invention, the pitch of the input audio signal is changed every long period which is an integral multiple of the encoding period of the excitation source information using the adaptive codebook.
It includes a long-period analyzer for determining the switch period, the length of a plurality of adaptive excitation code vectors stored in the adaptive codebook
When searching for the optimal adaptive excitation code vector for each coding cycle in the cycle,
The optimal adaptive excitation code vector is calculated for each encoding cycle.
The search is performed from a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector at the time position shifted by the pitch period , so that the search time and processing for the adaptive codebook can be reduced. An excitation linear prediction encoding device can be realized.

【００５９】請求項２の本発明によれば、さらに、コー
ド励振線形予測復号化装置に向けて伝送する符号化音声
に、長周期毎に、ピッチ周期と、長周期中の全ての各符
号化周期に対応する、符号化周期についての探索の結果
得られた最適な適応励振コードベクトルとピッチ周期だ
けずれた時間位置の適応励振コードベクトルとの各時間
差情報とを含めるようにしたので、圧縮効率を高めるこ
とができるという効果をも得ることができるコード励振
線形予測符号化装置を実現できる。According to the second aspect of the present invention, the coded speech transmitted to the code-excited linear predictive decoding apparatus is further provided with a pitch cycle for every long cycle and all the codes in the long cycle.
The optimal adaptive excitation code vector and pitch period obtained as a result of the search for the encoding period corresponding to the encoding period
Since each time difference information from the adaptive excitation code vector at the displaced time position is included, it is possible to realize a code excitation linear prediction encoding device that can also obtain the effect of increasing the compression efficiency.

【００６０】請求項３の本発明によれば、請求項２の本
発明のコード励振線形予測符号化装置から送信されてき
た符号化音声を復号処理するコード励振線形予測復号化
装置において、符号化音声に含まれているピッチ周期と
長周期中の各符号化周期に対応する各時間差情報とを加
算して得た各値を、それぞれ適応コードブックに対する
インデックスとして各符号化周期に対応する適応励振コ
ードベクトルを出力させるようにしたので、請求項２の
本発明による効果を実効あるものとすることができる。According to a third aspect of the present invention, there is provided a code excitation linear prediction decoding apparatus for decoding coded speech transmitted from the code excitation linear prediction encoding apparatus according to the second aspect of the present invention. The pitch period included in the voice
Each value obtained by adding the respective time difference information corresponding to each coding cycle in the long period, since so as to output an adaptive excitation code vector corresponding to each coding cycle as an index to each adaptive codebook Thus, the effect according to the second aspect of the present invention can be made effective.

[Brief description of the drawings]

【図１】実施例のコード励振線形予測符号化装置の構成
を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a code excitation linear prediction encoding device according to an embodiment.

【図２】適応コードブックの概念の説明図である。FIG. 2 is an explanatory diagram of a concept of an adaptive code book.

【図３】実施例のフレームラグの表記方法（検出精度）
の説明図である。FIG. 3 is a notation method (detection accuracy) of a frame lag according to the embodiment.
FIG.

【図４】実施例の適応コードブックの探索範囲及びサブ
フレーム補正ラグの説明図である。FIG. 4 is an explanatory diagram of a search range of an adaptive codebook and a subframe correction lag according to the embodiment.

【図５】実施例のコード励振線形予測復号化装置の構成
を示すブロック図である。FIG. 5 is a block diagram illustrating a configuration of a code excitation linear prediction decoding device according to an embodiment.

[Explanation of symbols]

１２…長周期分析器、１３…励振源評価回路、１４…適
応コードブック、２０…多重分離器、Ｌ…フレームラグ
（長周期相関値）、ΔＬ…サブフレーム補正ラグ（最適
な適応励振コードベクトルと長周期相関値に応じた適応
励振コードベクトルとの時間差情報）。12: long-period analyzer, 13: excitation source evaluation circuit, 14: adaptive codebook, 20: demultiplexer, L: frame lag (long-period correlation value), ΔL: subframe correction lag (optimal adaptive excitation code vector (Time difference information between the adaptive excitation code vector corresponding to the long-period correlation value).

フロントページの続き (72)発明者伊東克俊東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (56)参考文献特開平４−270398（ＪＰ，Ａ) 特開平１−179100（ＪＰ，Ａ) 特開平５−289700（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 11/00 - 21/06 H03M 7/30 H04B 14/04 ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (72) Inventor Katsutoshi Ito 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd. (56) References JP-A-4-270398 (JP, A) JP-A-1- 179100 (JP, A) JP-A-5-289700 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 11/00-21/06 H03M 7/30 H04B 14/04 JICST File (JOIS)

Claims

(57) [Claims]

1. A code excitation linear predictive coding apparatus provided with an adaptive codebook, wherein the pitch of the input speech signal is changed every long period that is an integral multiple of the encoding period of the excitation source information using the adaptive codebook.
Includes a long-period analyzer for determining the switch period, the coding cycle in the long period of a plurality of adaptive excitation code vectors stored in said adaptive codebook it
Search for optimal adaptive excitation code vector
The optimal adaptive excitation code vector for each encoding cycle.
To the encoding period by the pitch period described above.
A code excitation linear predictive coding apparatus characterized in that a search is performed from a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector at the time position .

2. The coded speech transmitted to the code-excited linear predictive decoding device includes, for each of the long periods, the pitch period and the code corresponding to all the coding periods in the long period.
A time difference information between the optimal adaptive excitation code vector obtained as a result of the search for the encoding period and the adaptive excitation code vector at a time position shifted by the pitch period. Item 2. A code excitation linear prediction encoding device according to item 1.

3. A code-excited linear prediction decoding device for decoding encoded speech transmitted from the code-excited linear prediction encoding device according to claim 2, wherein the pitch period included in the encoded speech is And the above long period
Code each value obtained by adding the respective time difference information, and characterized by outputting the adaptive excitation code vector corresponding to each coding cycle as an index to each adaptive codebook corresponding to each coding cycle in Excitation linear prediction decoding device.