JPH04301900A

JPH04301900A - Audio encoding device

Info

Publication number: JPH04301900A
Application number: JP3091470A
Authority: JP
Inventors: Masami Akamine; 政巳赤嶺; Masahiro Oshikiri; 正浩押切
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1991-03-29
Filing date: 1991-03-29
Publication date: 1992-10-26
Anticipated expiration: 2017-01-21
Also published as: JP3249144B2

Abstract

PURPOSE:To provide an audio encoding device which raises the resolution of an adaptive code book to improve the quality and does not increase the calculation volume neither the code volume. CONSTITUTION:The above device is provided with an adaptive code book 110 where driving signals of a synthesizing filter is stored, a minimum distortion search circuit 115 which searches the optimum driving signal in the adaptive code book 110, a synthesizing filter 112 which uses the searched optimum driving signal to synthesize audio signal, an up sampling circuit 131 which up-samples the driving signal read out from the adaptive code book 110, a delay circuit 132 which delays the up-sampled driving signal, a thinning circuit 133 which thins the delayed driving signal and stores the result in the adaptive code book 110, and a start point/end point determining circuit 105 which determines the start point and the end point of the driving signal stored in the adaptive code book 110.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は音声符号化装置に係り、
特に音声信号を８ｋｂｐｓ　程度以下の低ビットレート
で符号化するのに適した音声符号化装置に関する。[Industrial Application Field] The present invention relates to a speech encoding device.
In particular, the present invention relates to an audio encoding device suitable for encoding audio signals at a low bit rate of about 8 kbps or less.

【０００２】0002

【従来の技術】音声信号を低ビットレートで高能率に符
号化する技術は、自動車電話などの移動体通信や、企業
内通信において、電波の有効利用や通信コスト削減のた
めの重要な技術である。８ｋｂｐｓ　以下のビットレー
トで品質の優れた音声符号化方式として、ＣＥＬＰ（Ｃ
ｏｄｅ　Ｅｘｃｉｔｅｄ　Ｌｉｎｅａｒ　Ｐｒｅｄｉｃ
ｔｉｏｎ）方式が知られている。[Background Art] Technology to encode audio signals with high efficiency at a low bit rate is an important technology for effective use of radio waves and reduction of communication costs in mobile communications such as car phones and in-house communications. be. CELP (C
ode Excited Linear Predic
tion) method is known.

【０００３】このＣＥＬＰ方式は、ＡＴ＆Ｔベル研のＭ
．Ｒ．Ｓｃｈｒｏｅｄｅｒ　氏とＢ．Ｓ．Ａｔａｌ氏に
より“Ｃｏｄｅ−Ｅｘｃｉｔｅｄ　Ｌｉｎｅａｒ　Ｐｒ
ｅｄｉｃｔｉｏｎ（ＣＥＬＰ）“Ｈｉｇｈ−Ｑｕａｌｉ
ｔｙ　Ｓｐｅｅｃｈ　ａｔ　Ｖｅｒｙ　ＬｏｗＢｉｔ　
Ｒａｔｅｓ　”Ｐｒｏｃ．ＩＣＡＳＳＰ；１９８５，ｐ
ｐ．９３７−９３９　（文献１）で発表されて以来、高
品質の音声が合成できる方式として注目され、品質の改
善や、計算量の削減など、種々の検討がなされて来た。ＣＥＬＰ方式の特徴は、ＬＰＣ（Ｌｉｎｅｒ　Ｐｒｅｄ
ｉｃｔｉｖｅＣｏｄｉｎｇ：線形予測符号化）合成フィ
ルタの駆動信号を駆動信号ベクトルとしてコードブック
に格納し、合成音声信号と入力音声信号の誤差を評価し
ながら、最適な駆動信号ベクトルをコードブックから探
索する点にある。[0003] This CELP method was developed by AT&T Bell Labs' M
．． R. Schroeder and B. S. “Code-Excited Linear Pr” by Mr. Atal
edition(CELP) “High-Quali
ty Speech at Very LowBit
Rates “Proc. ICASSP; 1985, p.
p. Since it was announced in 937-939 (Reference 1), it has attracted attention as a method that can synthesize high-quality speech, and various studies have been conducted to improve the quality and reduce the amount of calculation. The CELP method is characterized by LPC (Liner Pred
(active coding: linear predictive coding) The drive signal of the synthesis filter is stored in the codebook as a drive signal vector, and the optimal drive signal vector is searched from the codebook while evaluating the error between the synthesized speech signal and the input speech signal. be.

【０００４】図５は、最新のＣＥＬＰ方式による音声符
号化装置のブロック図である。同図において、入力信号
であるサンプリングされた音声信号系列は入力端子６０
０からフレーム単位で入力される。フレームはＬ個の信
号サンプルからなり、サンプリング周波数が８ｋＨｚの
場合、一般にＬ＝１６０が用いられる。図５には示され
ていないが、駆動信号ベクトルの探索に先立ち、入力さ
れたＬサンプルの音声信号系列に対してＬＰＣ分析が行
われ、ＬＰＣ予測パラメータ｛αｉ　，ｉ＝１，２，…
ｐ｝が抽出される。このＬＰＣ予測パラメータαｉ　は
、ＬＰＣ合成フィルタ６３０に供給される。なお、ｐは
予測次数であり、一般にｐ＝１０が用いられる。ＬＰＣ
合成フィルタ６３０の伝達関数Ｈ（ｚ）　は、［数１］
で与えられる。FIG. 5 is a block diagram of a speech encoding device using the latest CELP method. In the same figure, the sampled audio signal sequence that is the input signal is input to the input terminal 60.
It is input in frame units starting from 0. A frame consists of L signal samples, and for a sampling frequency of 8 kHz, L=160 is typically used. Although not shown in FIG. 5, prior to searching for the drive signal vector, LPC analysis is performed on the input audio signal sequence of L samples, and LPC prediction parameters {αi, i=1, 2, . . .
p} is extracted. This LPC prediction parameter αi is supplied to the LPC synthesis filter 630. Note that p is the prediction order, and generally p=10 is used. LPC
The transfer function H(z) of the synthesis filter 630 is [Equation 1]
is given by

【０００５】[0005]

【数１】次に、音声信号を合成しながら最適な駆動信号ベクトル
を探索する過程について説明する。まず、入力端子６０
０に入力された１フレームの音声信号から、減算器６１
０で前フレームでの合成フィルタ６３０の内部状態が現
フレームに与える影響が減算される。減算器６１０から
得られた信号系列は４個のサブフレームに分割され、各
サブフレームの目標信号ベクトルとなる。[Equation 1] Next, a process of searching for an optimal drive signal vector while synthesizing audio signals will be described. First, input terminal 60
0, the subtracter 61
With 0, the influence of the internal state of the synthesis filter 630 in the previous frame on the current frame is subtracted. The signal sequence obtained from the subtracter 610 is divided into four subframes, and becomes a target signal vector for each subframe.

【０００６】ＬＰＣ合成フィルタ６３０の入力信号であ
る駆動信号ベクトルは、適応コードブック６４０から選
択された駆動信号ベクトルに乗算器６５０で所定のゲイ
ンを乗算したものと、白色雑音コードブック７１０から
選択された雑音ベクトルに乗算器７２０で所定のゲイン
を乗算したものとを加算器６６０で加算することで得ら
れる。The drive signal vector that is the input signal to the LPC synthesis filter 630 is obtained by multiplying the drive signal vector selected from the adaptive codebook 640 by a predetermined gain in the multiplier 650, and the drive signal vector selected from the white noise codebook 710. The noise vector obtained by multiplying the noise vector by a predetermined gain by the multiplier 720 is added by the adder 660.

【０００７】ここで、適応コードブック６４０は文献１
に記載されているピッチ予測分析を閉ループ動作または
合成による分析（Ａｎａｌｙｓｉｓ　ｂｙ　Ｓｙｎｔｈ
ｅｓｉｓ）　によって行うものであり、詳細はＷ．Ｂ．
Ｋｌｅｉｊｉｎ　Ｄ．Ｊ．Ｋｒａｓｉｎｓｋｉ　ａｎｄ
　Ｒ．Ｈ．Ｋｅｔｃｈｕｍ，”Ｉｍｐｒｏｖｅｄ　Ｓｐ
ｅｅｃｈＱｕａｌｉｔｙ　ａｎｄ　Ｅｆｆｉｃｉｅｎｔ
　Ｖｅｃｔｏｒ　Ｑｕａｎｔｉｚａｔｉｏｎ　ｉｎ　Ｃ
ＥＬＰ”，Ｐｒｏｃ．ＩＣＡＳＳＰ，１９８８，ｐｐ．
１５５−１５８　（文献２）に述べられている。この文
献２によると、ＬＰＣ合成フィルタ６３０の駆動信号を
ピッチ探索範囲ａ〜ｂ（ａ，ｂは駆動信号のサンプル番
号であり、通常ａ＝２０，ｂ＝１４７）にわたって遅延
回路６７０で１サンプルづつ遅延させることにより、ａ
〜ｂサンプルのピッチ周期に対する駆動信号ベクトルを
作成し、これがコードワードとして適応コードブックに
格納される。[0007] Here, the adaptive codebook 640 is
Analysis by Synth
esis), and the details can be found in W. B.
Kleijin D. J. Krasinski and
R. H. Ketchum, “Improved Sp.
eechQuality and Efficiency
Vector Quantization in C
ELP”, Proc. ICASSP, 1988, pp.
155-158 (Reference 2). According to this document 2, the drive signal of the LPC synthesis filter 630 is processed one sample at a time by the delay circuit 670 over the pitch search range a to b (a, b are the sample numbers of the drive signal, usually a=20, b=147). By delaying a
Create a drive signal vector for a pitch period of ~b samples, which is stored as a codeword in the adaptive codebook.

【０００８】最適な駆動信号ベクトルの探索を行う場合
、適応コードブック６４０から各ピッチ周期に対応する
駆動信号ベクトルのコードワードが１個ずつ読み出され
、乗算器６５０で所定のゲインと乗算される。そして、
ＬＰＣ合成フィルタ６３０によりフィルタ演算が行われ
、合成音声信号ベクトルが生成される。生成された合成
音声信号ベクトルは、減算器６２０で目標信号ベクトル
と減算される。この減算器６２０の出力は聴感重み付け
フィルタ６８０を経て誤差計算回路６９０に入力され、
平均２乗誤差が求められる。平均２乗誤差の情報は更に
最小歪探索回路７００に入力され、その最小値が検出さ
れる。When searching for an optimal drive signal vector, the codewords of the drive signal vector corresponding to each pitch period are read out one by one from the adaptive codebook 640 and multiplied by a predetermined gain in a multiplier 650. . and,
A filter operation is performed by the LPC synthesis filter 630 to generate a synthesized speech signal vector. The generated synthetic speech signal vector is subtracted from the target signal vector by a subtracter 620. The output of this subtracter 620 is input to an error calculation circuit 690 via an auditory weighting filter 680.
The mean squared error is determined. The information on the mean squared error is further input to the minimum distortion search circuit 700, and its minimum value is detected.

【０００９】以上の過程は、適応コードブック６４０中
の全ての駆動信号ベクトルのコードワードについて行わ
れ、最小歪探索回路７００において平均２乗誤差の最小
値を与えるコードワードの番号が求められる。また、乗
算器６５０で乗じられるゲインも平均２乗誤差が最小に
なるよう決定される。The above process is performed for all the codewords of the drive signal vectors in the adaptive codebook 640, and the minimum distortion search circuit 700 finds the number of the codeword that gives the minimum value of the mean squared error. Further, the gain multiplied by the multiplier 650 is also determined so that the mean square error is minimized.

【００１０】次に、同様の方法で最適な白色雑音ベクト
ルの探索が行われる。すなわち、白色雑音コードブック
７１０から雑音ベクトルのコードワードが１個ずつ読み
出され、乗算器７２０でのゲインとの乗算、ＬＰＣ合成
フィルタ６３０でのフィルタ演算を経て、合成音声信号
ベクトルの生成、目標ベクトルとの平均２乗誤差の計算
が全ての雑音ベクトルについて行われる。そして、平均
２乗誤差の最小値を与える雑音ベクトルの番号及びゲイ
ンが求められる。なお、聴感重み付けフィルタ６８０は
減算器６２０から出力される誤差信号のスペクトルを整
形して、人間に知覚される歪を低減するために用いられ
る。Next, a search for an optimal white noise vector is performed in a similar manner. That is, codewords of noise vectors are read out one by one from the white noise codebook 710, multiplied by a gain in a multiplier 720, and filtered in an LPC synthesis filter 630 to generate a synthesized speech signal vector and obtain the target. A calculation of the mean squared error with the vector is performed for all noise vectors. Then, the number and gain of the noise vector that gives the minimum value of the mean squared error are determined. Note that the perceptual weighting filter 680 is used to shape the spectrum of the error signal output from the subtracter 620 to reduce distortion perceived by humans.

【００１１】このようにＣＥＬＰ方式は、合成音声信号
と入力音声信号との誤差が最小になるような最適の駆動
信号ベクトルを求めているので、８ｋｂｐｓ　程度の低
ビットレートでも高品質の音声を合成することができる
。しかし、８ｋｂｐｓ　以下のビットレートでは品質の
劣化が知覚され、まだ不十分である。[0011] In this way, the CELP method seeks the optimal drive signal vector that minimizes the error between the synthesized speech signal and the input speech signal, so it is possible to synthesize high-quality speech even at a low bit rate of about 8 kbps. can do. However, at a bit rate of 8 kbps or less, quality deterioration is perceived, and this is still insufficient.

【００１２】そこで、適応コードブックの解像度を上げ
て品質を向上させる方法がＰ．ｋｒｏｏｎ氏とＢ．Ｓ．
Ａｔａｌ氏によって”ＰＩＴＣＨ　ＰＲＥＤＩＣＴＯＲ
Ｓ　ＷＩＴＨ　ＴＥＭＰＯＲＡＬ　ＲＥＳＯＬＵＴＩＯ
Ｎ”　，ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ．６６１−６６
４，１９９０（文献３）および”ＩＭＰＲＯＶＥＤ　Ｐ
ＩＴＣＨ　ＰＲＥＤＩＣＴＩＯＮ　ＷＩＴＨＦＲＡＣＴ
ＩＯＮＡＬ　ＤＥＬＡＹＳ　ＩＮ　ＳＥＬＰ　ＣＯＤＩ
ＮＧ”　，ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ．６６５−６
６８，１９９０（文献４）で提案されている。これらの
文献３および４に記載された方法は、図５におけるＬＰ
Ｃ合成フィルタ６３０の駆動信号をピッチ探索範囲ａ〜
ｂにわたって遅延させる際、駆動信号のサンプリング周
波数を入力音声信号のそれより高くし（アップサンプリ
ング）、このアップサンプリングされた駆動信号を１サ
ンプル以下の単位で遅延した後、適応コードブック６４
０に格納する点が特徴である。Therefore, P. Mr. kroon and B. S.
“PITCH PREDICTOR” by Mr. Atal
S WITH TEMPORAL RESOLUTIO
N”, proc. ICASSP, pp. 661-66
4, 1990 (Reference 3) and “IMPROVED P
ITCH PREDICTION WITHFRACT
IONAL DELAYS IN SELP CODI
NG”, proc. ICASSP, pp.665-6
68, 1990 (Reference 4). The methods described in these documents 3 and 4 are based on the LP in FIG.
The drive signal of the C synthesis filter 630 is set in the pitch search range a~
b, the sampling frequency of the drive signal is made higher than that of the input audio signal (up-sampling), and after delaying this up-sampled drive signal in units of one sample or less, the adaptive codebook 64
The feature is that it is stored as 0.

【００１３】具体的にはアップサンプリングは、サンプ
ル間に所定数の０を挿入し、内挿フィルタでサンプル値
間を内挿することによって行うことができる。アップサ
ンプリングされた駆動信号は１サンプル単位で遅延回路
によって遅延された後、所定数のサンプル単位でサンプ
ルが間引きされて元のサンプリング周波数に戻され、適
応コードブックに格納される。この場合、例えばアップ
サンプリングの倍率を２倍とすれば、適応コードブック
には入力音声信号と同一サンプル周期の駆動信号ベクト
ルと、これらを１／２サンプル周期ずらせた駆動信号ベ
クトルが格納されることになる。Specifically, upsampling can be performed by inserting a predetermined number of 0s between samples and interpolating between sample values using an interpolation filter. The upsampled drive signal is delayed by a delay circuit in units of one sample, and then the samples are thinned out in units of a predetermined number of samples to return to the original sampling frequency and stored in the adaptive codebook. In this case, for example, if the upsampling magnification is doubled, the adaptive codebook will store a drive signal vector with the same sample period as the input audio signal and a drive signal vector with a 1/2 sample period shift between these two. become.

【００１４】このように、適応コードブックのサンプリ
ング周波数を上げると、入力音声信号の１サンプル以下
の単位でピッチ探索を行うことができるので、ピッチ予
測の精度が向上し、符号化音声の品質が改善される。文
献３によれば、サンプリング周波数を２倍にすることに
よってセグメンタルＳＮＲが女性音声で０．９ｄＢ、男
性音声で０．６ｄＢ向上するとされている。[0014] In this way, by increasing the sampling frequency of the adaptive codebook, pitch search can be performed in units of one sample or less of the input speech signal, which improves pitch prediction accuracy and improves the quality of encoded speech. Improved. According to Document 3, doubling the sampling frequency improves the segmental SNR by 0.9 dB for female voices and by 0.6 dB for male voices.

【００１５】しかしながら、この方法ではアップサンプ
リングによって、適応コードブックのサイズがアップサ
ンプリングの倍率に比例して大きくなり、それに伴い適
応コードブックからの駆動信号ベクトル探索に要する計
算量がアップサンプリングの倍率に比例して増大すると
共に、受信側に駆動信号ベクトルのコード番号（インデ
ックス）を伝送するのに必要な符号量が増加してしまう
。例えばアップサンプリングの倍率を２倍にした場合、
駆動信号ベクトルの探索に要する計算量が２倍になると
共に、１フレーム当り４ビット符号量が増加する。However, in this method, due to upsampling, the size of the adaptive codebook increases in proportion to the upsampling magnification, and accordingly, the amount of calculation required to search for the drive signal vector from the adaptive codebook increases with the upsampling magnification. As the number increases proportionally, the amount of code required to transmit the code number (index) of the drive signal vector to the receiving side also increases. For example, if the upsampling magnification is doubled,
The amount of calculation required to search for the drive signal vector doubles, and the amount of 4-bit code per frame increases.

【００１６】[0016]

【発明が解決しようとする課題】上述したように、適応
コードブックを有する従来のＣＥＬＰ方式において、ア
ップサンプリングにより適応コードブックの分解能を上
げた場合、品質は向上するものの符号化処理の大部分を
占める適応コードブックからの駆動信号ベクトル探索に
要する計算量が増大する。[Problems to be Solved by the Invention] As mentioned above, in the conventional CELP method having an adaptive codebook, when the resolution of the adaptive codebook is increased by upsampling, although the quality improves, most of the encoding process is The amount of calculation required to search for the drive signal vector from the adaptive codebook increases.

【００１７】このため、音声符号化を実時間処理で実現
しようとすると高速のＤＳＰ（ディジタル信号処理回路
）が複数個必要となり、回路規模の大型化と価格上昇お
よび消費電力の増加を招くという問題があった。[0017] Therefore, if audio encoding is to be realized through real-time processing, a plurality of high-speed DSPs (digital signal processing circuits) will be required, which poses the problem of increasing the circuit scale, increasing prices, and increasing power consumption. was there.

【００１８】また、適応コードブックの分解能向上に伴
い、駆動信号ベクトルのコード番号を伝送するのに必要
な符号量が増加するため、伝送ビットレートが高くなる
という問題もあった。Furthermore, as the resolution of the adaptive codebook improves, the amount of code necessary to transmit the code number of the drive signal vector increases, resulting in a higher transmission bit rate.

【００１９】本発明は上記の問題点に鑑みてなされたも
ので、計算量および符号量を増加させることなく、適応
コードブックの分解能を上げて品質の向上を図ることが
できる音声符号化装置を提供することを目的とする。The present invention has been made in view of the above problems, and provides a speech encoding device that can improve the quality by increasing the resolution of an adaptive codebook without increasing the amount of calculation and code. The purpose is to provide.

【００２０】[0020]

【課題を解決するための手段】本発明は上記の課題を解
決するために、駆動信号をコードワードとして格納した
コードブック（適応コードブック）と、入力音声信号を
参照して適応コードブックから最適な駆動信号を探索す
る探索手段と、探索された最適な駆動信号を用いて音声
信号を合成する合成フィルタと、適応コードブックから
読み出される駆動信号をアップサンプリングするサンプ
リング変換手段と、アップサンプリングされた駆動信号
を遅延する遅延手段と、遅延された駆動信号を間引いて
適応コードブックに格納する手段と、適応コードブック
に格納される駆動信号の始点と終点を決定する始点／終
点決定手段とを具備することを特徴とする。[Means for Solving the Problems] In order to solve the above problems, the present invention provides a codebook (adaptive codebook) in which drive signals are stored as codewords, and an optimal codebook from the adaptive codebook by referring to an input audio signal. a search means for searching for a suitable drive signal; a synthesis filter for synthesizing an audio signal using the searched optimum drive signal; a sampling conversion means for upsampling the drive signal read from the adaptive codebook; The apparatus includes a delay means for delaying a drive signal, a means for thinning out the delayed drive signal and storing it in an adaptive codebook, and a start point/end point determining means for determining a start point and an end point of the drive signal to be stored in the adaptive codebook. It is characterized by

【００２１】適応コードブックに格納される駆動信号の
始点と終点の決定は、例えば入力音声信号をピッチ分析
して求められたピッチ周期、または駆動信号の探索過程
で求められた前フレームでのピッチ周期、あるいは前フ
レームで適応コードブックから探索された駆動信号ベク
トルの符号に基づいて行われる。The start and end points of the drive signal stored in the adaptive codebook are determined by, for example, the pitch cycle found by pitch analysis of the input audio signal, or the pitch in the previous frame found in the process of searching for the drive signal. This is done based on the cycle or the sign of the drive signal vector searched from the adaptive codebook in the previous frame.

【００２２】[0022]

【作用】本発明では駆動信号をアップサンプリングして
適応コードブックに格納することにより、入力音声信号
の１サンプル以下の高い分解能でピッチ周期を求めるこ
とができ、符号化品質が向上する。According to the present invention, by upsampling the drive signal and storing it in an adaptive codebook, the pitch period can be determined with a high resolution of one sample or less of the input audio signal, thereby improving the encoding quality.

【００２３】しかも、適応コードブックに格納する駆動
信号の始点と終点を予め決定することによって、アップ
サンプリングにもかかわらず適応コードブックのサイズ
またはピッチ探索数が一定に保たれるようになるので、
適応コードブックから駆動信号を探索する際の計算量の
増大や、符号量の増加が避けられる。Furthermore, by determining the start and end points of the drive signal stored in the adaptive codebook in advance, the size of the adaptive codebook or the number of pitch searches can be kept constant despite upsampling.
An increase in the amount of calculation and code amount when searching for a drive signal from an adaptive codebook can be avoided.

【００２４】また、ピッチ周期はフレーム単位の短い時
間では変化が少ないという音声信号の性質に基づいて、
入力音声信号に対するピッチ分析によって求められるピ
ッチ周期、または前フレームで求められたピッチ周期に
基づいて駆動信号の始点と終点を決定して適応コードブ
ック探索の範囲を定めることにより、適応コードブック
からの駆動信号の探索数をアップサンプリングに関らず
一定にすることに起因する品質劣化が防止される。[0024] Furthermore, based on the property of the audio signal that the pitch period does not change much in a short period of time in units of frames,
The start and end points of the drive signal are determined based on the pitch period obtained by pitch analysis of the input audio signal or the pitch period obtained in the previous frame, and the scope of the adaptive codebook search is determined. Quality deterioration caused by keeping the number of drive signal searches constant regardless of upsampling is prevented.

【００２５】[0025]

【実施例】以下、図面を参照しながら本発明の実施例を
説明する。図１は本発明の一実施例に係る音声符号化装
置のブロック図である。Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram of a speech encoding device according to an embodiment of the present invention.

【００２６】図１において、入力音声信号は入力端子１
００からフレームバッファ１０１に入力される。フレー
ムバッファ１０１では、入力音声信号系列をＬ個のサン
プル単位で切出し、１フレームの信号として記憶する。Ｌは、通常１６０である。フレームバッファ１０１から
の１フレームの入力音声信号系列は、ＬＰＣ分析回路１
０２および重み付けフィルタ１０６へ供給される。In FIG. 1, the input audio signal is input to input terminal 1.
00 to the frame buffer 101. The frame buffer 101 cuts out the input audio signal sequence in units of L samples and stores them as one frame signal. L is usually 160. One frame of input audio signal sequence from the frame buffer 101 is sent to the LPC analysis circuit 1
02 and weighting filter 106.

【００２７】ＬＰＣ分析回路１０２は、例えば自己相関
法を用いて入力音声信号に対してＬＰＣ（Ｌｉｎｅａｒ
　Ｐｒｅｄｉｃｔｉｖｅ　Ｃｏｄｉｎｇ：線形予測符号
化）分析を行い、Ｐ個のＬＰＣ予測係数｛αｉ　、ｉ＝
１，２，…ｐ、｝、または反射係数｛ｋｉ　、ｉ＝１，
２，…，ｐ｝を抽出する。抽出された予測係数または反
射係数は、符号化回路１０３において所定のビット数で
符号化された後、重み付けフィルタ１０６および重み付
け合成フィルタ１０７，１１２，１２２で利用される。[0027] The LPC analysis circuit 102 performs LPC (Linear
Predictive Coding (Linear Predictive Coding) analysis is performed, and P LPC prediction coefficients {αi, i=
1,2,...p,} or reflection coefficient {ki, i=1,
2,...,p} are extracted. The extracted prediction coefficients or reflection coefficients are encoded with a predetermined number of bits in the encoding circuit 103 and then used in the weighting filter 106 and the weighting synthesis filters 107, 112, and 122.

【００２８】重み付けフィルタ１０６は、適応コードブ
ック１１０および雑音コードブック１２０から合成フィ
ルタの駆動信号ベクトルを探索する際に、入力音声信号
系列に重み付けを行うものである。合成フィルタ１０７
，１１２，１２２の伝達関数Ｈ（ｚ）　は、［数１］で
記述される。この時、重み付けフィルタ１０６の伝達関
数Ｗ（ｚ）　は［数２］で表される。The weighting filter 106 weights the input speech signal sequence when searching for a driving signal vector for the synthesis filter from the adaptive codebook 110 and the noise codebook 120. Synthesis filter 107
, 112, 122 is described by [Equation 1]. At this time, the transfer function W(z) of the weighting filter 106 is expressed by [Equation 2].

【００２９】[0029]

【数２】但し、γは重み付けの強さを制御するパラメータである
（０≦γ≦１）。[Equation 2] However, γ is a parameter that controls the strength of weighting (0≦γ≦1).

【００３０】重み付け合成フィルタ１１２，１２２は、
Ｈ（ｚ）　なる伝達関数の合成フィルタと、Ｗ（ｚ）　
なる伝達関数の重み付けフィルタを縦続接続したフィル
タであり、その伝達関数ＨＷ　（ｚ）　は［数３］で記
述される。The weighted synthesis filters 112 and 122 are
A synthesis filter with a transfer function of H(z) and W(z)
This is a filter in which weighting filters with transfer functions are connected in cascade, and the transfer function HW (z) is described by [Equation 3].

【００３１】[0031]

【数３】本実施例のように重み付けフィルタ１０６を用いると、
聴感上の符号化歪を低減することが可能になる。また、
本実施例では重み付けフィルタを１０６を駆動信号ベク
トルの探索ループの外に設けた構成になっており、この
結果、探索に要する計算量が大幅に削減される。[Equation 3] When the weighting filter 106 is used as in this embodiment,
It becomes possible to reduce perceptual coding distortion. Also,
In this embodiment, the weighting filter 106 is provided outside the drive signal vector search loop, and as a result, the amount of calculation required for the search is significantly reduced.

【００３２】さらに、重み付け合成フィルタ１１２，１
２２が駆動信号ベクトルの探索に影響を与えないように
、初期メモリを持った重み付け合成フィルタ１０７が設
けられている。この重み付け合成フィルタ１０７は、前
フレームの最後に重み付け合成フィルタ１１２，１２２
が保持していた内部状態を初期状態として持つ。Furthermore, weighted synthesis filter 112,1
A weighted synthesis filter 107 having an initial memory is provided so that the filter 22 does not affect the search for the drive signal vector. This weighted synthesis filter 107 is applied to the weighted synthesis filters 112 and 122 at the end of the previous frame.
has as its initial state the internal state held by .

【００３３】そして、重み付け合成フィルタ１０７の零
入力応答ベクトルを作成し、減算器１０８において重み
付けフィルタ１０６の出力から上記零入力応答ベクトル
を減算する。これにより、重み付け合成フィルタ１１２
，１２２の初期状態を零とすることができ、前フレーム
の影響を考慮せずに駆動信号ベクトルの探索を行うこと
ができる。Then, a zero-input response vector for the weighted synthesis filter 107 is created, and the zero-input response vector is subtracted from the output of the weighting filter 106 in the subtracter 108. As a result, the weighted synthesis filter 112
, 122 can be set to zero, and the drive signal vector can be searched without considering the influence of the previous frame.

【００３４】ＬＰＣ分析回路１０２は、ＬＰＣ分析を行
うと共に予測残差信号を計算し、ピッチ分析回路１０４
に残差信号を供給する。ピッチ分析回路１０４は、共分
散法等公知の方法を用いてピッチ周期Ｔｐ　を求め、こ
れを始点／終点決定回路１０５とマルチプレクサ１４２
へ与える。The LPC analysis circuit 102 performs LPC analysis and calculates a prediction residual signal, and the pitch analysis circuit 104
supply the residual signal to. The pitch analysis circuit 104 determines the pitch period Tp using a known method such as the covariance method, and uses this to determine the pitch period Tp between the start point/end point determination circuit 105 and the multiplexer 142.
give to

【００３５】以上の処理は、全てフレーム単位で行われ
る。次に、フレームをＭ個（通常、Ｍ＝４）のサブフレ
ームに分割し、サブフレーム単位で行う駆動信号ベクト
ル探索の処理について説明する。All of the above processing is performed frame by frame. Next, a process of dividing a frame into M subframes (usually M=4) and searching for a drive signal vector performed in subframe units will be described.

【００３６】駆動信号ベクトルの探索は適応コードブッ
ク１１０、雑音コードブック１２０の順に行われる。ま
ず、適応コードブック１１０からピッチ周期ｊに対応す
る駆動信号ベクトルＸｊ　（ベクトルの次元は、Ｌ／Ｍ
＝Ｋ）を順次読み出し、乗算器１１１でＸｊ　に所定の
ゲインβを乗じた後、重み付け合成フィルタ１１２に供
給する。重み付け合成フィルタ１１２では、フィルタリ
ング演算を行って合成音声ベクトルを作成する。The search for the drive signal vector is performed in the order of adaptive codebook 110 and noise codebook 120. First, from the adaptive codebook 110, drive signal vector Xj corresponding to pitch period j (vector dimension is L/M
=K) are sequentially read out, Xj is multiplied by a predetermined gain β in a multiplier 111, and then supplied to a weighted synthesis filter 112. The weighted synthesis filter 112 performs a filtering operation to create a synthesized speech vector.

【００３７】一方、フレームバッファ１０１から読み出
された入力音声信号は、重み付けフィルタ１０６によっ
て重み付けられた後、減算器１０８で前フレームの影響
が差し引かれる。この減算器１０８から出力される音声
信号ベクトルＹを目標ベクトルとして、減算器１１３で
重み付け合成フィルタ１１２からの合成音声ベクトルと
の誤差ベクトルＥｊ　が計算される。そして、２乗誤差
計算回路１１４で誤差の２乗和‖Ｅｊ　‖が計算され、
この‖Ｅｊ　‖の最小値および最小値を与えるインデッ
クスｊが最小歪探索回路１１５で検出される。このイン
デックスｊが適応コードブック１１０とマルチプレクサ
１４２に与えられる。On the other hand, the input audio signal read from the frame buffer 101 is weighted by a weighting filter 106, and then the influence of the previous frame is subtracted by a subtracter 108. Using the audio signal vector Y output from the subtracter 108 as a target vector, a subtracter 113 calculates an error vector Ej with respect to the synthesized audio vector from the weighted synthesis filter 112. Then, the squared error calculation circuit 114 calculates the squared sum of errors ‖Ej ‖,
The minimum value of |Ej || and the index j giving the minimum value are detected by the minimum distortion search circuit 115. This index j is provided to adaptive codebook 110 and multiplexer 142.

【００３８】具体的には、誤差ベクトルＥｊ　は例えば
［数４］で表わされる。この誤差ベクトル‖Ｅｊ　‖を
βで偏微分して零と置くことによって、βを最適化した
場合の‖Ｅｊ　‖の最小値が［数５］で表される。但し
、βは乗算器１１１で与えられるゲインである。Specifically, the error vector Ej is expressed, for example, by [Equation 4]. By partially differentiating this error vector ‖Ej ‖ with respect to β and setting it to zero, the minimum value of ‖Ej ‖ when β is optimized is expressed by [Equation 5]. However, β is the gain given by the multiplier 111.

【００３９】[0039]

【数４】[Math 4]

【００４０】[0040]

【数５】ここで、‖Ｘ‖は２乗ノルム、（Ｘ，Ｙ）は内積をそれ
ぞれ表し、Ｈは［数６］で与えられる重み付け合成フィ
ルタ（伝達関数：ＨＷ　（ｚ）　）のインパルス応答行
列である。[Equation 5] Here, ‖X‖ represents the square norm, (X, Y) represent the inner product, and H is the impulse response of the weighted synthesis filter (transfer function: HW (z)) given by [Equation 6] It's a queue.

【００４１】[0041]

【数６】［数６］から明らかなように、適応コードブック１１０
からの駆動信号ベクトルの探索は、全てのコードワード
Ｘｊ　に対し［数６］の右辺第２項を計算し、それが最
大になるインデックスｊを検出することによって行う。[Equation 6] As is clear from [Equation 6], the adaptive codebook 110
The search for the drive signal vector from is performed by calculating the second term on the right side of [Equation 6] for all code words Xj and detecting the index j at which it is maximum.

【００４２】このようにして適応コードブック１１０か
ら最適な駆動信号ベクトルＸｏｐｔ　が探索されると、
減算器１１３で目標ベクトルＹからＸｏｐｔ　に対応す
る重み付け合成フィルタ１１２の出力が差し引かれ、こ
の減算器１１３の出力が雑音コードブック１２０からの
雑音ベクトル探索の目標ベクトルとされる。雑音コード
ブック１２０からの雑音ベクトルの探索も、適応コード
ブック１１０らの駆動信号ベクトルの探索と全く同様に
行うことができる。この雑音ベクトル１２０からの探索
で得られたコードベクトルをＮｏｐｔ　とすると、合成
フィルタの駆動信号ベクトルＸはＸ＝β・Ｘｏｐｔ　＋ｇ・Ｎｏｐｔ　と表される。但し、β，ｇはそれぞれ乗算器１１１，１
２１において適応コードブック１１０および雑音コード
ブック１２０から探索された駆動信号ベクトルおよび雑
音ベクトルに与えられるゲインである。When the optimal drive signal vector Xopt is searched from the adaptive codebook 110 in this way,
A subtracter 113 subtracts the output of the weighted synthesis filter 112 corresponding to Xopt from the target vector Y, and the output of the subtracter 113 is used as the target vector for noise vector search from the noise codebook 120. The search for the noise vector from the noise codebook 120 can be performed in exactly the same way as the search for the drive signal vector from the adaptive codebook 110 and the like. If the code vector obtained by searching from this noise vector 120 is Nopt, then the drive signal vector X of the synthesis filter is expressed as X=β·Xopt +g·Nopt. However, β and g are multipliers 111 and 1, respectively.
21 is the gain given to the drive signal vector and noise vector searched from the adaptive codebook 110 and the noise codebook 120.

【００４３】次に、駆動信号ベクトルＸを適応コードブ
ック１１０に格納する方法について説明する。図１にお
いて、加算器１３０から出力された駆動信号ベクトルは
、アップサンプリング回路１３１でサンプル間に零が挿
入された後、内挿フィルタを通過することによってアッ
プサンプリングされ、サンプリング周波数が例えば２倍
の信号系列とされる。ここで、内挿フィルタはナイキス
ト周波数にカットオフ周波数を持つローパスフィルタで
あり、ナイキストフィルタともいう。Next, a method of storing the drive signal vector X in the adaptive codebook 110 will be explained. In FIG. 1, the drive signal vector output from the adder 130 is upsampled by inserting zero between samples in an upsampling circuit 131, and then passed through an interpolation filter, so that the sampling frequency is doubled, for example. It is considered to be a signal sequence. Here, the interpolation filter is a low-pass filter having a cutoff frequency at the Nyquist frequency, and is also called a Nyquist filter.

【００４４】こうしてアップサンプリングされた駆動信
号系列は、遅延回路１３２において始点／終点決定回路
１０５により与えられる始点ａから終点ｂに渡って１サ
ンプル単位で遅延される。始点／終点決定回路１０５は
、この例ではピッチ分析回路１０４で求められたピッチ
周期Ｔｐ　のサンプルを中心として、適応コードブック
１１０のサイズがアップサンプリングによって変化しな
いように、適応コードブック１１０に格納する駆動信号
ベクトルの始点ａと終点ｂを例えば次式によって決定す
る。The drive signal sequence thus upsampled is delayed by one sample in the delay circuit 132 from the start point a given by the start point/end point determining circuit 105 to the end point b. In this example, the start point/end point determination circuit 105 stores samples of the pitch period Tp determined by the pitch analysis circuit 104 in the adaptive codebook 110 so that the size of the adaptive codebook 110 does not change due to upsampling. The starting point a and the ending point b of the drive signal vector are determined by, for example, the following equation.

【００４５】ａ＝ＤＴｐ　−Ｎｐ　／２ｂ＝ＤＴｐ　＋
Ｎｐ　／２但し、Ｎｐ　はアップサンプリングしない場合の適応コ
ードブックのサイズであり、一般的にピッチ探索は２０
サンプルから１４７サンプルまで行われることが多いこ
とから、Ｎｐ　＝１２８とする。また、Ｄはアップサン
プリングの倍率であり、本実施例ではＤ＝２である。a=DTp-Np/2b=DTp+
Np /2 However, Np is the size of the adaptive codebook without upsampling, and pitch search is generally 20
Since this is often done for samples up to 147 samples, Np is set to 128. Further, D is the upsampling magnification, and in this embodiment, D=2.

【００４６】遅延回路１３２の出力は、間引き回路１３
３で（Ｄ−１）サンプル毎に間引きされた後、適応コー
ドブック１１０の次元（４０サンプル＝１サブフレーム
）毎に切り出され、適応コードブック１１０に格納され
る。The output of the delay circuit 132 is sent to the thinning circuit 13.
After being thinned out every (D-1) sample in step 3, it is cut out every dimension of the adaptive codebook 110 (40 samples = 1 subframe) and stored in the adaptive codebook 110.

【００４７】前述したように、従来の技術による適応コ
ードブックの分解能を上げる方法では、アップサンプリ
ングによって適応コードブックのサイズがＤ倍になり、
適応コードブックの探索に要する計算量および符号量が
Ｄ倍に増大するという問題があった。サブフレーム長及
び適応コードブックの次元をＫ、入力音声信号のサンプ
リング周波数を８ｋＨｚとすると、［数５］の右辺第２
項を計算するのに必要な１秒間当りの乗算回数は、　　
［｛Ｋ（Ｋ＋１）／２｝＋２Ｋ］・Ｎｐ　・８×１０３
　／Ｋとなる。Ｋ，Ｎｐ　の値が通常用いられるＫ＝４
０，Ｎｐ　＝１２８の場合、この乗算回数は２２，５２
８，０００回となる。従って、この値がＤ倍になると計
算量は膨大になり、これを実現する回路も大規模になる
。As described above, in the conventional method of increasing the resolution of the adaptive codebook, the size of the adaptive codebook is increased by D times by upsampling.
There is a problem in that the amount of calculation and code required for searching the adaptive codebook increases by a factor of D. Assuming that the subframe length and the dimension of the adaptive codebook are K, and the sampling frequency of the input audio signal is 8kHz, the second right-hand side of [Equation 5]
The number of multiplications per second required to calculate the term is
[{K(K+1)/2}+2K]・Np・8×103
/K. The value of K, Np is usually used K=4
0, Np = 128, the number of multiplications is 22,52
8,000 times. Therefore, if this value is multiplied by D, the amount of calculation becomes enormous, and the circuit for realizing this also becomes large-scale.

【００４８】これに対し、本実施例によれば始点／終点
決定回路１０５で適応コードブック１１０に格納する駆
動信号ベクトルの始点ａと終点ｂを定めることにより、
アップサンプリング回路１３１で駆動信号をアップサン
プリングしているにも関わらず、適応コードブック１１
０のサイズを一定に保つことができる。従って、計算量
および符号量の増加を防止することができる。しかも、
アップサンプリングによる符号化品質の向上は享受でき
る。On the other hand, according to this embodiment, by determining the starting point a and the ending point b of the drive signal vector to be stored in the adaptive codebook 110 in the starting point/end point determining circuit 105,
Although the drive signal is upsampled by the upsampling circuit 131, the adaptive codebook 11
The size of 0 can be kept constant. Therefore, an increase in the amount of calculation and code can be prevented. Moreover,
Encoding quality can be improved by upsampling.

【００４９】以上の処理の過程で求められた符号化パラ
メータは、マルチプレクサ１４２で多重化され、出力端
子１４３から伝送路へ符号化出力として送出される。す
なわち、マルチプレクサ１４２ではＬＰＣ分析回路１０
２で求められたＬＰＣ予測係数の情報を符号化回路１０
３で符号化したコードと、ピッチ分析回路１０４で求め
られたフレーム単位のピッチ周期Ｔｐ　のコードと、最
小歪探索回路１１５で求められた適応コードブック１１
０のインデックスのコードと、乗算器１１１で乗じられ
るゲインの情報をゲイン符号化回路１４０で符号化した
コードと、最小歪探索回路１２５で求められた雑音コー
ドブック１２０のインデックスのコードおよび、乗算器
１２１で乗じられるゲインの情報をゲイン符号化回路１
４１で符号化したコードが多重化される。The encoding parameters obtained through the above processing are multiplexed by the multiplexer 142 and sent as encoded outputs from the output terminal 143 to the transmission path. That is, in the multiplexer 142, the LPC analysis circuit 10
The information on the LPC prediction coefficients obtained in step 2 is sent to the encoding circuit 10.
3, the code of the pitch period Tp for each frame found by the pitch analysis circuit 104, and the adaptive codebook 11 found by the minimum distortion search circuit 115.
0 index code, a code obtained by encoding the gain information multiplied by the multiplier 111 by the gain encoding circuit 140, a code of the index of the noise codebook 120 found by the minimum distortion search circuit 125, and the multiplier. Gain information multiplied by 121 is transmitted to gain encoding circuit 1.
The codes encoded in step 41 are multiplexed.

【００５０】次に、図１の音声符号化装置に対応した音
声復号化装置の構成を図２により説明する。図２におい
て、入力された符号化パラメータは、まずデマルチプレ
クサ２０１で個々のパラメータに分解された後、復号化
器２０２，２０３，２０４でそれぞれ復号化される。そ
して、復号化された適応コードブックのインデックス及
びゲイン、雑音コードブックのインデックスおよびゲイ
ンに基づいて駆動信号が作成される。この駆動信号が合
成フィルタ２１５でフィルタリングされることによって
、合成音声信号が作成される。この合成音声信号は、ポ
ストフィルタ２１６でスペクトルの整形が行われ、聴感
的な歪が抑圧された後、出力端子２１７より出力される
。Next, the configuration of a speech decoding device corresponding to the speech encoding device of FIG. 1 will be explained with reference to FIG. 2. In FIG. 2, input encoding parameters are first decomposed into individual parameters by a demultiplexer 201, and then decoded by decoders 202, 203, and 204, respectively. Then, a drive signal is created based on the index and gain of the decoded adaptive codebook and the index and gain of the noise codebook. This drive signal is filtered by a synthesis filter 215 to create a synthesized audio signal. This synthesized audio signal is subjected to spectrum shaping in a post filter 216 to suppress perceptual distortion, and then outputted from an output terminal 217.

【００５１】図２における始点／終点決定回路２０５、
アップサンプリング回路２２０、遅延回路２２１および
間引き回路２２２は、それぞれ図１の回路１０５，１３
１，１３２，１３３と同一の機能を有するので、説明を
省略する。The start point/end point determination circuit 205 in FIG.
The upsampling circuit 220, the delay circuit 221, and the thinning circuit 222 are the circuits 105 and 13 in FIG. 1, respectively.
Since it has the same function as No. 1, 132, and 133, the explanation will be omitted.

【００５２】図３に、本発明の他の実施例に係る音声符
号化装置のブロック図を示す。本実施例と先の実施例の
違いは、始点／終点決定回路１０５に用いるピッチ周期
Ｔｐ　の求め方にある。FIG. 3 shows a block diagram of a speech encoding device according to another embodiment of the present invention. The difference between this embodiment and the previous embodiment lies in the method of determining the pitch period Tp used in the start point/end point determining circuit 105.

【００５３】先の実施例においては、フレーム単位でピ
ッチ分析を行って、ピッチ周期を求めており、ピッチ周
期の情報をフレーム単位で伝送していた。これに対し、
本実施例では前フレームの適応コードブック探索によっ
て求められたサブフレーム毎のピッチ周期をメモリ１５
０に記憶し、記憶したピッチ周期に基づいてピッチ周期
推定回路１５１でフレーム単位のピッチ周期Ｔｐ　を推
定する。このＴｐ　の推定は、例えばサブフレーム毎の
ピッチ周期の平均をとることによって行えばよい。また
、サブフレーム毎のピッチ周期から外挿予測することに
よって、ピッチ周期Ｔｐ　を推定することもできる。In the previous embodiment, pitch analysis was performed on a frame-by-frame basis to determine the pitch period, and pitch period information was transmitted on a frame-by-frame basis. On the other hand,
In this embodiment, the pitch period for each subframe obtained by searching the adaptive codebook of the previous frame is stored in the memory 15.
0, and the pitch period estimating circuit 151 estimates the pitch period Tp in frame units based on the stored pitch period. This estimation of Tp may be performed, for example, by averaging the pitch period for each subframe. Furthermore, the pitch period Tp can also be estimated by extrapolating and predicting the pitch period for each subframe.

【００５４】このように前フレームでのピッチ周期に基
づいてピッチ周期Ｔｐ　を推定し、始点／終点を求める
方法は、Ｔｐ　の情報を伝送する必要がないため、伝送
ビットレートをより少なくできる効果がある。[0054] This method of estimating the pitch period Tp based on the pitch period of the previous frame and finding the start/end points has the effect of reducing the transmission bit rate because it is not necessary to transmit the information of Tp. be.

【００５５】なお、前フレームでのピッチ周期Ｔｐ　に
代えて、前フレームで適応コードブックから探索された
駆動信号ベクトルの符号を用いて始点／終点を決定して
も、同様の効果が期待できる。The same effect can be expected even if the start point/end point is determined using the sign of the drive signal vector searched from the adaptive codebook in the previous frame instead of the pitch period Tp in the previous frame.

【００５６】[0056]

【発明の効果】以上説明したように、本発明によれば適
応コードブックに格納する駆動信号の始点と終点を決定
することにより、駆動信号をアップサンプリングして符
号化品質の向上を図りながら、適応コードブックのサイ
ズが一定に保たれるので、計算量および符号量の増加を
避けることができる。従って、単一または少数のＤＳＰ
を用いて実時間処理を行うことが可能となり、低価格化
と消費電力の低減を図ることができるとともに、伝送ビ
ットレートの増加を抑えることもできる。As explained above, according to the present invention, by determining the start and end points of the drive signal to be stored in the adaptive codebook, the drive signal is upsampled to improve the encoding quality. Since the size of the adaptive codebook is kept constant, an increase in the amount of calculation and code can be avoided. Therefore, a single or small number of DSPs
It becomes possible to perform real-time processing using , making it possible to lower prices and power consumption, and also to suppress increases in transmission bit rate.

[Brief explanation of the drawing]

【図１】　　本発明の一実施例に係る音声符号化装置の
ブロック図[Fig. 1] Block diagram of a speech encoding device according to an embodiment of the present invention

【図２】　　同実施例に係る音声復号化装置のブロック
図[Figure 2] Block diagram of the audio decoding device according to the same embodiment

【図３】　　本発明の他の実施例に係る音声符号化装
置のブロック図[Fig. 3] Block diagram of a speech encoding device according to another embodiment of the present invention

【図４】　　同実施例に係る音声復号化装置のブロック
図[Figure 4] Block diagram of the audio decoding device according to the same embodiment

【図５】　　従来の音声符号化装置における駆動信号
ベクトル探索に係る構成を示すブロック図。FIG. 5 is a block diagram showing a configuration related to a drive signal vector search in a conventional audio encoding device.

[Explanation of symbols]

１００…音声信号入力端子　　　　　　　　　　　　１
０２…ＬＰＣ分析回路１０３…符号化回路　　　　　　　　　　　　　　　　
　　１０４…ピッチ分析回路１０５…始点／終点決定回路　　　　　　　　　　１０
６…重み付けフィルタ１０７…重み付け合成フィルタ　　　　　　　　１１０
…適応コードブック１１２…重み付け合成フィルタ　　　　　　　　１１４
…２乗誤差計算回路１１５…最小歪探索回路　　　　　　　　　　　　　　
１２０…雑音コードブック１２２…重み付け合成フィルタ　　　　　　　　１２４
…２乗誤差計算回路１２５…最小歪探索回路　　　　　　　　　　　　　　
１３１…アップサンプリング回路１３２…遅延回路　　　　　　　　　　　　　　　　　
　　　１３３…間引き回路１４０…ゲイン符号化回路　　　　　　　　　　　　１
４１…ゲイン符号化回路１４２…マルチプレクサ　　　　　　　　　　　　　　
１４３…出力端子１５０…メモリ　　　　　　　　　　　　　　　　　　
　　　　１５１…ピッチ周期推定回路100...Audio signal input terminal 1
02...LPC analysis circuit 103...encoding circuit
104...Pitch analysis circuit 105...Start point/end point determination circuit 10
6...Weighting filter 107...Weighting synthesis filter 110
...Adaptive codebook 112...Weighted synthesis filter 114
... Square error calculation circuit 115 ... Minimum distortion search circuit
120...Noise codebook 122...Weighting synthesis filter 124
... Square error calculation circuit 125 ... Minimum distortion search circuit
131... Upsampling circuit 132... Delay circuit
133... Thinning circuit 140... Gain encoding circuit 1
41...gain encoding circuit 142...multiplexer
143...Output terminal 150...Memory
151...Pitch period estimation circuit

Claims

[Claims]

1. A codebook storing drive signals as codewords, a search means for searching for an optimal drive signal from the codebook by referring to an input audio signal, and an optimal drive signal searched for by the search means. a synthesis filter for synthesizing audio signals using the above codebook, a sampling conversion means for upsampling the drive signal read from the codebook, a delay means for delaying the drive signal upsampled by the sampling conversion means, and the delay means. A speech code characterized by comprising means for thinning out drive signals delayed by and storing them in the codebook, and start point/end point determining means for determining the start point and end point of the drive signals to be stored in the codebook. conversion device.

2. A codebook storing drive signals as codewords, a search means for searching for an optimal drive signal from the codebook by referring to an input audio signal, and an optimal drive signal searched by the search means. a synthesis filter for synthesizing audio signals using the above codebook, a sampling conversion means for upsampling the drive signal read from the codebook, a delay means for delaying the drive signal upsampled by the sampling conversion means, and the delay means. means for thinning out the drive signal delayed by and storing it in the codebook; a pitch analysis means for pitch-analyzing the input audio signal to obtain a pitch period; and based on the pitch period obtained by the pitch analysis means, A speech encoding device comprising: a start point/end point determining means for determining a start point and an end point of a drive signal stored in the codebook.

3. A codebook in which drive signals are stored as codewords, a search means for searching for an optimal drive signal from the codebook by referring to an input audio signal input frame by frame, and a search means for searching for an optimal drive signal from the codebook. a synthesis filter that synthesizes an audio signal using the optimal drive signal,
a sampling conversion means for upsampling the drive signal read from the codebook; a delay means for delaying the drive signal upsampled by the sampling conversion means; and start point/end point determining means for determining the start point and end point of the drive signal to be stored in the codebook based on the pitch period in the previous frame found in the search process by the search means. A speech encoding device comprising: