JP3299099B2

JP3299099B2 - Audio coding device

Info

Publication number: JP3299099B2
Application number: JP33949295A
Authority: JP
Inventors: 一範小澤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1995-12-26
Filing date: 1995-12-26
Publication date: 2002-07-08
Anticipated expiration: 2015-12-26
Also published as: JPH09179593A

Abstract

PROBLEM TO BE SOLVED: To provide a voice encoding device with less amount of search operation and memory size and less degradation of sound quality by providing a function to search at least one of code vectors while shifting its position, when a sound source quantizing part searches for a code book. SOLUTION: This device is provided with a spectral parameter calculation part 4 to obtain a spectral parameter from an input voice signal to quantize, and a sound source quantizing part 12 to search fox a code book 13 storing a sound source signal of the voice signal beforehand and quantizes to output it. And when a sound source quantizing part 12 searches for a sound source code book 13, it searches for it while shifting a sample position of at least one of the code vectors. And if a size of the entire code book is expressed by B bits and a shift amount is expressed by A bits, the size of the code book to be stored becomes not B bits but B-A bits and this can decreases a memory size necessary for a storage.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音声符号化装置に関
し、特に音声信号を比較的少ない演算量およびメモリ量
で高品質に符号化する音声符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus, and more particularly to a speech coding apparatus for coding a speech signal with a relatively small amount of computation and a small amount of memory in high quality.

【０００２】[0002]

【従来の技術】音声符号化装置は、音声復号化装置と対
向して使用され、音声符号化装置で符号化した音声を音
声復号化装置が復号するものである。ここで、音声信号
を高能率に符号化する方法としては、例えば、エム・シ
ュレーダー（M.Schroeder ）とビー・アタル（B.Atal）
等がアイイーイーイー・プロシーディングス(IEEE Pro
c.)ICASSP-85,1985年、９３７〜９４０頁にコード・エ
キサイテド・リニア・プリディクション：ハイ・クオリ
ティ・スピーチ・アット・ベリー・ロウ・ビット・レイ
ツ（Code-excited linear prediction: High quality s
peech at very lowbit rates ）と題して発表した論文
（文献１）や、クレイジン（Kleijn）等によるアイイー
イーイー・プロシーディングス(IEEE Proc.)ICASSP-88,
1988年、１５５〜１５８頁にインプルーブド・スピーチ
・クオリティ・アンド・エフィシェント・ベクトル・ク
オンタイゼイション・イン・エスイーエルピー(Improve
d speech quality and efficient vector quantization
in SELP) と題して発表した論文（文献２）等に記載さ
れているＣＥＬＰ(Code Excited Linear Prediction Co
ding) が知られている。この方法では、送信側では、フ
レーム毎（例えば20ms）に音声信号から線形予測（ＬＰ
Ｃ）分析を用いて、音声信号のスペクトル特性を表すス
ペクトルパラメータを抽出し、フレームをさらに複数の
サブフレーム（例えば5ms)に分割し、サブフレーム毎に
過去の音源信号をもとに適応コードブックにおけるパラ
メータ（ピッチ周期に対応する遅延パラメータとゲイン
パラメータ）を抽出し、適応コードブックにより該当の
サブフレームの音声信号をピッチ予測し、ピッチ予測し
て求めた残差信号に対して、音源量子化部では、予め定
められた種類の雑音信号からなる音源コードブック（ベ
クトル量子化コードブック）を格納しており、このコー
ドブックから最適音源コードベクトルを選択し、最適な
ゲインを計算することにより、音源信号を量子化する。
音源コードベクトルの選択の仕方は、選択した雑音信号
により合成した信号と、前述の残差信号との誤差電力を
最小化するように行う。そして選択されたコードベクト
ルの種類を表すインデックスとゲインならびに、スペク
トルパラメータと適応コードブックのパラメータとをマ
ルチプレクサ部により組み合わせて伝送する。受信側の
説明は省略する。2. Description of the Related Art A speech encoding device is used opposite to a speech decoding device, and a speech decoding device decodes speech encoded by the speech encoding device. Here, as a method of encoding a speech signal with high efficiency, for example, M. Schroeder and B. Atal
IEEE Proceedings (IEEE Pro
c.) ICASSP-85, pp. 937-940, Code Excited Linear Prediction: High Quality Speech at Very Low Bit Rates.
Peech at very lowbit rates) (1), and Claizin (Kleijn) et al., IEEE Proc. ICASSP-88,
Improve Speech Quality and Efficient Vector Quantization in SLP, 1988, pp. 155-158.
d speech quality and efficient vector quantization
in CELP (Code Excited Linear Prediction Co.)
ding) is known. In this method, the transmitting side performs linear prediction (LP) from a speech signal every frame (for example, 20 ms).
C) Using analysis, extract a spectral parameter representing a spectral characteristic of the audio signal, further divide the frame into a plurality of subframes (for example, 5 ms), and apply an adaptive codebook to each subframe based on a past sound source signal. (Delay parameter and gain parameter corresponding to the pitch period) are extracted, the speech signal of the corresponding subframe is pitch-predicted by the adaptive codebook, and the residual signal obtained by pitch prediction is subjected to sound source quantization. The unit stores an excitation codebook (vector quantization codebook) composed of a predetermined type of noise signal, selects an optimal excitation codevector from this codebook, and calculates an optimal gain. Quantize the sound source signal.
The excitation code vector is selected so as to minimize the error power between the signal synthesized from the selected noise signal and the above-mentioned residual signal. Then, an index and a gain indicating the type of the selected code vector, a spectrum parameter and a parameter of the adaptive codebook are combined and transmitted by the multiplexer unit. Description on the receiving side is omitted.

【０００３】[0003]

【発明が解決しようとする課題】上述した従来の音声符
号化装置は、良好な音質を得るためには、ビットレート
が８kb/s以上必要であった。これは音源コードブックの
ビット数としては、例えば５msサブフレーム当たり１０
ビット以上の大規模なコードブックを必要としていた。
このため、音源コードブックの探索や、格納に、多くの
演算量や、多くのメモリ量を必要とするといえ問題点が
あった。例えば、５msサブフレームで１０ビットのコー
ドブックを考えると、最も単純な２乗距離で探索して
も、１秒当たり1024x40x200=8,192,000回の乗算回数を
必要とし、また、メモリ量は1024x40=40,240ワードを必
要とした。一方、演算量やメモリ量を下げるために、ビ
ット数を低減化すると、音質が劣化することになるとい
う問題点も発生することになった。The above-mentioned conventional speech coding apparatus requires a bit rate of 8 kb / s or more in order to obtain good sound quality. This means that the number of bits in the sound source codebook is, for example, 10 per 5 ms subframe.
A bit more codebook was needed.
For this reason, there is a problem that a large amount of calculation and a large amount of memory are required for searching and storing the sound source codebook. For example, considering a 10-bit codebook with a 5 ms subframe, the search with the simplest square distance requires 1024x40x200 = 8,192,000 multiplications per second, and the memory amount is 1024x40 = 40,240 words. Needed. On the other hand, when the number of bits is reduced in order to reduce the amount of calculation and the amount of memory, there is a problem that sound quality is deteriorated.

【０００４】上述した従来の音声符号化装置で、良好な
符号化音質を得るためにビット数の大きなコードブック
が必要な理由としては、信号の位相関係により、音源信
号波形はサブフレーム内で色々な位相をとりうる点にあ
る。従って、これら異なる位相を音源コードベクトルの
パターンとして表現するためには、ある程度以上大規模
なコードブックを必要とした。In the above-mentioned conventional speech coding apparatus, a codebook having a large number of bits is required in order to obtain good coded sound quality. The point is that it can take a great phase. Therefore, in order to express these different phases as a pattern of a sound source code vector, a codebook of a certain scale or more is required.

【０００５】本発明の目的は、上述の問題を解決し、従
来方式よりも一層少ない探索演算量とメモリ量とで音質
の劣化の少ない音声符号化装置を提供することにある。SUMMARY OF THE INVENTION It is an object of the present invention to solve the above-mentioned problems and to provide a speech coding apparatus with less deterioration in sound quality with a smaller amount of search operation and less memory than conventional methods.

【０００６】[0006]

【課題を解決するための手段】本発明の音声符号化装置
は、入力した音声信号からスペクトルパラメータを求め
て量子化するスペクトルパラメータ計算部と、前記スペ
クトルパラメータを用いて前記音声信号の音源信号を予
め格納してあるコードブックを探索し量子化して出力す
る音源量子化部とを有する音声符号化装置において、前
記音源量子化部が前記コードブックを探索するときにこ
のコードブックに格納してあるコードベクトルの中の少
なくとも一つについて位置をシフトさせながら探索する
機能を有する構成である。According to the present invention, there is provided a speech coding apparatus comprising: a spectrum parameter calculating section for obtaining a spectrum parameter from an input speech signal and quantizing the spectrum parameter; and using the spectrum parameter to generate a sound source signal of the speech signal. In a speech encoding apparatus having a sound source quantization unit for searching, quantizing and outputting a codebook stored in advance, the sound source quantization unit stores the codebook when searching for the codebook. This is a configuration having a function of searching for at least one of the code vectors while shifting the position.

【０００７】本発明の音声符号化装置は、入力した音声
信号からスペクトルパラメータを求めて量子化するスペ
クトルパラメータ計算部と、前記スペクトルパラメータ
を用いて前記音声信号の音源信号を予め格納されたコー
ドブックを探索して量子化して出力する音源量子化部と
を有する音声符号化装置において、前記音声信号をフレ
ーム単位に聴感重み付けを行った聴感重み付け信号から
モードを判別しモード情報を出力するモード判別部と、
前記音源量子化部が前記コードブックを探索するときに
予め定められたモードではコードブックに格納されたコ
ードベクトルの少なくとも一つについて位置をシフトさ
せながら探索する機能を有する構成である。A speech coding apparatus according to the present invention comprises a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal, and a codebook in which a sound source signal of the speech signal is stored in advance using the spectrum parameter. A sound source quantization unit that searches for, quantizes, and outputs an audio signal. A mode discrimination unit that discriminates a mode from an audibility weighted signal obtained by performing audibility weighting on the audio signal in frame units and outputs mode information. When,
In a predetermined mode when the sound source quantization unit searches the codebook, the sound source quantization unit has a function of searching for at least one of the code vectors stored in the codebook while shifting the position.

【０００８】本発明の音声符号化装置は、入力した音声
信号からスペクトルパラメータを求めて量子化するスペ
クトルパラメータ計算部と、前記スペクトルパラメータ
を用いて前記音声信号の音源信号を予め格納されたコー
ドブックを探索して量子化して出力する音源量子化部と
を有する音声符号化装置において、前記音声信号をフレ
ーム単位に聴感重み付けを行った聴感重み付け信号から
モードを判別しモード情報を出力するモード判別部と、
前記音源量子化部が前記コードブックを探索するときに
前記コードブックに格納されたコードベクトルの少なく
とも一つについて位置をシフトさせる量を前記モード情
報に応じて変化させながら探索する機能を有する構成で
ある。A speech coding apparatus according to the present invention comprises a spectrum parameter calculating section for obtaining and quantizing a spectrum parameter from an input speech signal, and a codebook in which a sound source signal of the speech signal is stored in advance using the spectrum parameter. A sound source quantization unit that searches for, quantizes, and outputs an audio signal. A mode discrimination unit that discriminates a mode from an audibility weighted signal obtained by performing audibility weighting on the audio signal in frame units and outputs mode information. When,
When the sound source quantization unit searches the codebook, the sound source quantization unit has a function of performing a search while changing an amount to shift a position of at least one of the code vectors stored in the codebook according to the mode information. is there.

【０００９】本発明の音声符号化装置は、入力した音声
信号からスペクトルパラメータを求めて量子化するスペ
クトルパラメータ計算部と、前記スペクトルパラメータ
を用いて前記音声信号の音源信号を予め格納してあるコ
ードブックを探索し量子化して出力する音源量子化部と
を有する音声符号化装置において、前記音源量子化部が
前記コードブックを探索するときにこのコードブックに
格納された各コードベクトルごとに定める値に従って位
置をシフトさせる量を変化させながら探索する機能を有
する構成である。A speech coding apparatus according to the present invention comprises a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal, and a code which previously stores a sound source signal of the speech signal using the spectrum parameter. A sound source quantization unit for searching, quantizing, and outputting a book, wherein a value determined for each code vector stored in the code book when the excitation quantization unit searches the code book. And a function of searching while changing the amount by which the position is shifted in accordance with

【００１０】［作用］第１の発明では、音源量子化部に
おいて音源コードブックを探索するときに、少なくとも
一つのコードベクトルのサンプル位置をシフトさせなが
ら探索する。簡単のために、すべてのコードベクトルを
シフトさせながら探索するものとし、コードブック全体
のサイズをＢビット、このうちシフト量をＡビットで表
すとすれば、格納すべきコードブックのサイズはＢビッ
トではなく、Ｂ―Ａビットとなり、格納に必要なメモリ
を低減化できる。従って、一部のコードベクトルをシフ
トさせる場合にも、メモリ量を従来の方法よりも低減化
できることは明らかである。[Operation] In the first invention, when the excitation codebook is searched in the excitation quantization unit, the search is performed while shifting the sample position of at least one code vector. For the sake of simplicity, it is assumed that the search is performed while shifting all the code vectors. If the entire codebook size is represented by B bits and the shift amount is represented by A bits, the codebook size to be stored is B bits. Instead, the number of bits becomes BA, and the memory required for storage can be reduced. Therefore, it is clear that even when shifting some code vectors, the memory amount can be reduced as compared with the conventional method.

【００１１】次に音源コードブックの探索法について説
明する。探索には例えば自己相関近似法を使用するもの
とする。この方法では下式の右辺第２項を最大化するよ
うな音源コードベクトルc_k(n) を選択する。Next, a method of searching for a sound source codebook will be described. For the search, for example, an autocorrelation approximation method is used. In this method, a sound source code vector c _k (n) that maximizes the second term on the right side of the following equation is selected.

【００１２】 [0012]

【００１３】ここでWhere

【００１４】 [0014]

【００１５】である。さらに## EQU1 ## further

【００１６】 [0016]

【００１７】である。この方法の詳細は、アイ・トラン
コス(I.Trancoso)等によるアイイーイーイー・プロシー
ディングス(IEEE Proc.)ICASSP-86,1986年、２３７５〜
２３７８頁にイフィセント・プロシジャー・フォー・フ
ァインデング・ジ・アプティマム・イノベイション・イ
ン・スタカスティック・コーダーズ (Efficient proced
ures for finding the optimum innovation in stochas
tic coders) と題した論文（文献３）等を参照できるの
で、説明は省略する。## EQU1 ## Details of this method are described in I. Trancoso et al., IEEE Proc. ICASSP-86, 1986, 2375
Efficient Procedure for Finding the Uptime Innovation in Statistical Coders on page 2378
ures for finding the optimum innovation in stochas
The description is omitted because a paper (Reference 3) entitled "tic coders" can be referred to.

【００１８】ここで、位相シフトしたコードベクトル成
分については、分母の値P_kは同一であるので計算は不要
である。従って、分母の計算に必要な演算量はシフトの
ビット数Ａだけ低減化される。Here, for the phase-shifted code vector component, no calculation is necessary because the denominator value _Pk is the same. Therefore, the amount of calculation required for calculating the denominator is reduced by the number A of bits of the shift.

【００１９】第２の発明では、あらかじめ定められた時
間区間（以下フレームと呼ぶ）の入力音声から特徴量を
求め、フレームの音声を、複数種類のモードのうちの一
つに分類する。以下では、モードの種類は４種類とし、
これはモード情報として２ビットで表して伝送するもの
とする。予め定められたモードの場合に、音源量子化部
において、音源コードブックを探索するときに、少なく
とも一つのコードベクトルのサンプル位置をシフトさせ
ながら探索する。コードベクトルをシフトさせながら探
索する方法を第１の発明と同一である。In the second invention, a feature amount is obtained from input speech in a predetermined time section (hereinafter referred to as a frame), and the speech of the frame is classified into one of a plurality of types of modes. In the following, there are four modes,
This is represented by two bits as mode information and transmitted. In the case of the predetermined mode, the excitation quantization section searches while shifting the sample position of at least one code vector when searching the excitation codebook. The method of searching while shifting the code vector is the same as in the first invention.

【００２０】第３の発明では、第２の発明において、モ
ードごとにサンプル位置のシフト量Ａを変化させること
を特徴とする。例えば、Ａは、モード０では０ビット、
モード１では５ビット、モード２では４ビット、モード
３では３ビットという値をとる。According to a third aspect, in the second aspect, the shift amount A of the sample position is changed for each mode. For example, A is 0 bits in mode 0,
The value takes 5 bits in mode 1, 4 bits in mode 2, and 3 bits in mode 3.

【００２１】第４の発明では、第１の発明の音源量子化
部において、音源コードブックを探索するときに、コー
ドベクトルに応じてサンプル位置のシフト量を変化させ
ながら探索する。ただし、コードブック全体のシフト量
の合計は一定値、例えばＡビットとする。According to a fourth aspect of the present invention, in the excitation quantization section of the first invention, when searching the excitation codebook, the search is performed while changing the shift amount of the sample position according to the code vector. However, the total shift amount of the entire codebook is a fixed value, for example, A bits.

【００２２】[0022]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して説明する。Next, embodiments of the present invention will be described with reference to the drawings.

【００２３】図１は本発明の第１の実施の形態を示すブ
ロック図である。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【００２４】本発明の第１の実施の形態の音声符号化装
置１は、入力した音声信号を予め定める時間長のフレー
ムに分割するフレーム分割回路２と、フレームの音声信
号をフレームよりも短い時間長のサブフレームに分割す
るサブフレーム分割回路３と、フレーム分割回路２の出
力する一連のフレームの音声信号を受信し少なくとも１
つのサブフレームの音声信号に対してサブフレームの時
間長よりも長い窓をかけて音声信号を切り出してスペク
トルパラメータを予め定められた次数まで計算するスペ
クトルパラメータ計算回路４と、線スペクトル対パラメ
ータコードブック（以下ＬＳＰコードブックと記す）６
を用いてスペクトルパラメータ計算回路４の計算した予
め定めるサブフレームで量子化したＬＳＰパラメータを
ベクトル量子化するスペクトルパラメータ量子化回路５
と、スペクトルパラメータ計算回路４の計算した複数の
サブフレームの線形予測係数を受け各サブフレームの音
声信号に対して聴感重み付けを行い聴感重み付け信号を
出力する聴感重み付け回路７と、スペクトルパラメータ
計算回路４の計算した複数のサブフレームの線形予測係
数とスペクトルパラメータ量子化回路５が復元した線形
予測係数とを、サブフレームごとに入力し、応答信号を
１サブフレーム分計算し減算器８に出力する応答信号計
算回路９と、スペクトルパラメータ量子化回路５が復元
した線形予測係数を受け、聴感重み付けフィルタのイン
パルス応答を予め定める点数計算するインパルス応答計
算回路１０と、出力側から帰還する過去の音源信号と減
算器８の出力信号と聴感重み付けフィルタのインパルス
応答とを入力しピッチに対応する遅延を求め遅延を表す
インテックスを出力する適応コードブック回路１１と、
音源コードブック１３を用いて音源信号を量子化する音
源量子化回路１２と、ゲインコードブック１５からゲイ
ンコードベクトルを読みだし最適なゲインコードベクト
ルを選択し、この選択したゲインコードベクトルを表す
インデックスをマルチプレクサ１６に出力するゲイン量
子化回路１４と、ゲイン量子化回路１４の出力を入力し
インデックスからこれに対応するコードベクトルを読み
だし駆動音源信号を求める重み付け信号計算回路１７と
からなる。A speech encoding apparatus 1 according to a first embodiment of the present invention comprises a frame dividing circuit 2 for dividing an inputted speech signal into frames of a predetermined time length, and a speech signal of a frame having a shorter time than the frame. A sub-frame division circuit 3 for dividing into a long sub-frame, and receiving at least one audio signal of a series of frames output from the frame division circuit 2
A spectrum parameter calculation circuit 4 that cuts out the audio signal by applying a window longer than the subframe time length to the audio signal of one subframe and calculates spectral parameters to a predetermined order, and a line spectrum versus parameter codebook (Hereinafter referred to as LSP codebook) 6
A parameter quantization circuit 5 for vector-quantizing an LSP parameter quantized in a predetermined subframe calculated by the spectrum parameter calculation circuit 4 using
A perceptual weighting circuit 7 that receives the linear prediction coefficients of a plurality of subframes calculated by the spectrum parameter calculation circuit 4 and weights the perceptual weight of the audio signal of each subframe and outputs a perceptual weighting signal; Are input for each sub-frame, the linear prediction coefficients restored by the spectral parameter quantization circuit 5 are calculated for each sub-frame, the response signal is calculated for one sub-frame, and output to the subtractor 8 A signal calculation circuit 9, an impulse response calculation circuit 10 that receives the linear prediction coefficient restored by the spectrum parameter quantization circuit 5 and calculates a predetermined score of an impulse response of the perceptual weighting filter, and a past sound source signal that returns from the output side. The output signal of the subtractor 8 and the impulse response of the perceptual weighting filter are input and An adaptive codebook circuit 11 for outputting a Intex representing a delay determined delay corresponding to the switch,
A sound source quantization circuit 12 that quantizes a sound source signal using a sound source code book 13 and a gain code vector are read from a gain code book 15 to select an optimal gain code vector, and an index representing the selected gain code vector is set as an index. It comprises a gain quantization circuit 14 that outputs to the multiplexer 16, and a weighting signal calculation circuit 17 that receives the output of the gain quantization circuit 14 and reads a code vector corresponding to the output from the index to obtain a drive excitation signal.

【００２５】次に本装置の動作について説明する。Next, the operation of the present apparatus will be described.

【００２６】まず、入力端子から音声信号を入力し、フ
レーム分割回路２では音声信号をフレーム（例えば 10m
s ）ごとに分割し、サブフレーム分割回路３では、フレ
ームの音声信号をフレームよりも短いサブフレーム（例
えば 2.5ms）に分割する。スペクトルパラメータ計算回
路４では、少なくとも一つのサブフレームの音声信号に
対して、サブフレーム長よりも長い窓（例えば 24ms ）
をかけて音声を切り出してスペクトルパラメータをあら
かじめ定められた次数（例えば P=10 次）計算する。こ
こでスペクトルパラメータの計算には、周知のＬＰＣ分
析や、バーグ(Burg)分析等を用いることができる。ここ
では、バーグ(Burg)分析を用いることとする。バーグ(B
urg)分析の詳細については、中溝著による”信号解析と
システム同定”と題した単行本（コロナ社1988年刊）の
82〜87頁（文献４）等に記載されているので説明は省略
する。First, an audio signal is input from an input terminal, and the frame dividing circuit 2 converts the audio signal into a frame (for example, 10 m).
s), and the subframe division circuit 3 divides the audio signal of the frame into subframes (for example, 2.5 ms) shorter than the frame. In the spectrum parameter calculation circuit 4, a window (for example, 24 ms) longer than the subframe length is applied to the audio signal of at least one subframe.
, The speech is cut out, and the spectral parameters are calculated in a predetermined order (for example, P = 10th order). Here, a well-known LPC analysis, a Burg analysis, or the like can be used for calculating the spectrum parameters. Here, Burg analysis is used. Berg (B
urg) For details of the analysis, see the book “Corona Analysis and System Identification” written by Nakamizo (Corona Publishing Co., 1988).
The description is omitted since it is described on pages 82 to 87 (Document 4) and the like.

【００２７】さらにスペクトルパラメータ計算回路４で
は、バーグ(Burg)法により計算された線形予測係数α
_i(i=1,…,10)量子化や補間に適したＬＳＰパラメータに
変換する。ここで、線形予測係数からＬＳＰへの変換
は、菅村他による”線スペクトル対（ＬＳＰ）音声分析
合成方式による音声情報圧縮”と題した論文（電子通信
学会論文誌、J64-A、pp.599-606、1981年）（文献５）
を参照することができる。例えば、第２，４サブフレー
ムでバーグ(Burg)法により求めた線形予測係数を、ＬＳ
Ｐパラメータに変換し、第１，３サブフレームのＬＳＰ
を直線補間により求めて、第１，３サブフレームのＬＳ
Ｐを逆変換して線形予測係数に戻し、第１〜４サブフレ
ームの線形予測係数α_ili=1,…,10,l=1,…,5) を聴感重
み付け回路７に出力する。また、第４サブレームのＬＳ
Ｐをスペクトルパラメータ量子化回路５に出力する。Further, in the spectrum parameter calculation circuit 4, the linear prediction coefficient α calculated by the Burg method
_i (i = 1,..., 10) is converted to LSP parameters suitable for quantization and interpolation. Here, the conversion from the linear prediction coefficient to the LSP is performed by Sugamura et al. In a paper entitled "Speech Information Compression by Line Spectrum Pair (LSP) Speech Analysis / Synthesis Method" (Transactions of the Institute of Electronics, Information and Communication Engineers, J64-A, pp.599). -606, 1981) (Reference 5)
Can be referred to. For example, the linear prediction coefficient obtained by the Burg method in the second and fourth subframes is represented by LS
Convert to P parameter, LSP of 1st and 3rd subframe
Is obtained by linear interpolation, and the LS of the first and third sub-frames is calculated.
P is inversely transformed back to the linear prediction coefficient, and the linear prediction coefficients α _il i = 1,..., 10, l = 1,. Also, the LS of the fourth sub-frame
P is output to the spectrum parameter quantization circuit 5.

【００２８】スペクルパラメータ量子化回路５では、Ｌ
ＳＰレコードブック６を用いてあらかじめ定められたサ
ブフレームのＬＳＰパラメータを効率的にベクトル量子
化し、下式の歪みを最小化する量子化値を出力する。In the speckle parameter quantization circuit 5, L
The LSP parameter of a predetermined subframe is efficiently vector-quantized using the SP record book 6, and a quantized value that minimizes the distortion of the following equation is output.

【００２９】 [0029]

【００３０】ここで、LSP(i), QLSP(i)_j，W(i)はそれぞ
れ、量子化前のｉ次目のＬＳＰ，ＬＳＰコードブック６
のｊ番目のコードベクトル、重み係数である。Here, LSP (i), QLSP (i) _j and W (i) are the i-th LSP and LSP codebook 6 before quantization, respectively.
Are the j-th code vector and the weight coefficient.

【００３１】以下では、第４サブフレームのＬＳＰパラ
メータを量子化するものとする。ＬＳＰパラメータのベ
クトル量子化の手法は周知の手法を用いることができ
る。具体的な方法は例えば、特開平４―１７１５００号
公報（文献６）あるいは特開平４―３６３０００号公報
（文献７）や、特開平５―６１９９号公報（文献８）
や、ティー・ノムラ(T.Nomura)等によるアイイーイーイ
ー・プロシーディングス．モバイル・マルチメディア・
コミュニケーションズ(IEEE Proc．Mobile Multimedia
Communications.)1993年、Ｂ．２．５頁にエルエスピー
・コーディング・ユージング・ブイキュー−エスブイキ
ュー・ウイズ・インターポウレーション・イン・４．０
７５・ケービーピーエス・エム−エルシーイーエルピー
・スピーチ・コーダー (LSP Coding Using VQ-SVQ Wit
h Interpolation in 4.075 kbps M-LCELP Speech Code
r) と題した論文（文献９）等を参照できるのでここで
は説明は略する。In the following, it is assumed that the LSP parameter of the fourth sub-frame is quantized. A well-known method can be used for the method of vector quantization of LSP parameters. Specific methods are described in, for example, JP-A-4-171500 (Reference 6), JP-A-4-363000 (Reference 7), and JP-A-5-6199 (Reference 8).
And IEE Proceedings by T. Nomura and others. Mobile multimedia
Communications (IEEE Proc. Mobile Multimedia
Communications.) 1993; On page 2.5, LSP Coding Using Viku-SV-Q-with with Interpolation in 4.0
75KPS M-LSP Coding Using VQ-SVQ Wit
h Interpolation in 4.075 kbps M-LCELP Speech Code
r), the description of which is omitted here.

【００３２】また、スペクトルパラメータ量子化回路５
では、第４サブフレームで量子化したＬＳＰパラメータ
をもとに、第１〜第４サブフレームのＬＳＰパラメータ
を復元する。ここでは、現フレームの第４サブフレーム
の量子化ＬＳＰパラメータと１つ過去のフレームの第４
サブフレームの量子化ＬＳＰを直線補間して、第１〜第
３サブフレームのＬＳＰを復元する。ここで、量子化前
のＬＳＰと量子化後のＬＳＰとの誤差電力を最小化する
コードベクトルを１種類選択した後に、直線補間により
第１〜第４サブフレームのＬＳＰを復元できる。さらに
性能を向上させるためには、誤差電力を最小化するコー
ドベクトルを複数候補選択したのちに、各々の候補につ
いて、累積歪を評価し、累積歪を最小化する候補と補間
ＬＳＰの組を選択するようにすることができる。詳細
は、例えば、特願平５―８７３７号明細書（文献１０）
を参照することができる。The spectrum parameter quantization circuit 5
Then, the LSP parameters of the first to fourth subframes are restored based on the LSP parameters quantized in the fourth subframe. Here, the quantization LSP parameter of the fourth sub-frame of the current frame and the fourth
The LSPs of the first to third subframes are restored by linearly interpolating the quantized LSPs of the subframe. Here, after selecting one type of code vector that minimizes the error power between the LSP before quantization and the LSP after quantization, the LSPs of the first to fourth subframes can be restored by linear interpolation. In order to further improve performance, after selecting a plurality of candidates for the code vector that minimizes the error power, evaluate the cumulative distortion for each candidate, and select a combination of the candidate and the interpolation LSP that minimizes the cumulative distortion. You can make it. For details, see, for example, Japanese Patent Application No. 5-8737 (Reference 10).
Can be referred to.

【００３３】以上により復元した第１ー３サブフレーム
のＬＳＰと第４サブフレームの量子化ＬＳＰをサブフレ
ームごとに線形予測係数α'_il(i=1,…,10, l=,…,5) に
変換し、インパルス応答計算回路１０に出力する。ま
た、第４サブフレームの量子化ＬＳＰのコードベクトル
を表すインデクスをマルチプレクサ１６に出力する。聴
感重み付け回路７は、スペクトルパラメータ計算回路４
から、各サブフレームごとに量子化前の線形予測係数α
_il (i=1,…,10, l=,…,5) を入力し、文献１にもとづ
き、サブフレームの音声信号に対して聴感重み付けを行
い、聴感重み付け信号を出力する。The LSPs of the first to third sub-frames and the quantized LSPs of the fourth sub-frame, which have been reconstructed as described above, are converted into linear prediction coefficients α ′ _il (i = 1,..., 10, l =,. ) And outputs it to the impulse response calculation circuit 10. Further, an index representing the code vector of the quantized LSP of the fourth subframe is output to the multiplexer 16. The audibility weighting circuit 7 includes a spectrum parameter calculation circuit 4
From the linear prediction coefficient α before quantization for each subframe.
_il (i = 1,..., 10, l =,..., 5) is input, and the perceptual weighting is performed on the audio signal of the subframe based on Document 1 to output a perceptual weighting signal.

【００３４】応答信号計算回路９は、スペクトルパラメ
ータ計算回路４から、各サブフレームごとに線形予測係
数α_ilを入力し、スペクトルパラメータ量子化回路５か
ら、量子化、補間して復元した線形予測係数α'_il をサ
ブフレームごとに入力し、保存されているフィルタメモ
リの値を用いて、入力信号を零d(n)=0とした応答信号を
１サブフレーム分計算し、減算器８に出力する。ここ
で、応答信号x_z(n) は下式で表される。The response signal calculation circuit 9 receives the linear prediction coefficient α _il for each subframe from the spectrum parameter calculation circuit 4, and quantizes, interpolates and restores the linear prediction coefficient α _il from the spectrum parameter quantization circuit 5. α ′ _il is input for each sub-frame, a response signal with the input signal set to zero d (n) = 0 is calculated for one sub-frame using the stored value of the filter memory, and output to the subtractor 8 I do. Here, the response signal x _z (n) is represented by the following equation.

【００３５】 [0035]

【００３６】但し、n-i ≦ 0のときは y(n-i)=p(N+(n-i)) (8) x_z(n-i)=s_w(N+(n-i)) (9) ここでＮはサブフレーム長を示す。γは、聴感重み付け
量を制御する重み係数であり、下記の式(11)と同一の値
である。s_w(n) ，p(n)は、それぞれ、重み付け信号計算
回路１７の出力信号、後述の式(11)における右辺第１項
のフィルタの分母の項の出力信号をそれぞれ示す。However, when ni ≦ 0, y (ni) = p (N + (ni)) (8) x _z (ni) = s _w (N + (ni)) (9) where N is the subframe length Is shown. γ is a weight coefficient for controlling the perceptual weighting amount, and is the same value as the following equation (11). s _w (n) and p (n) represent the output signal of the weighting signal calculation circuit 17 and the output signal of the denominator term of the first term filter on the right-hand side in Expression (11) described later, respectively.

【００３７】減算器８は、下式により、聴感重み付け信
号から応答信号を１サブフレーム分減算し、x'_w(n)を適
応コードブック回路１１に出力する。 x'_w(n)=x_w(n)-x_z(n) (10) インパルス応答計算回路１０は、z 変換が下式で表され
る聴感重み付けフィルタのインパルス応答 h_w(n)をあら
かじめ定められた点数Ｌだけ計算し、適応コードブック
回路１１と音源量子化回路１２とゲイン量子化回路１４
とに出力する。The subtractor 8 subtracts the response signal by one subframe from the perceptual weighting signal according to the following equation, and outputs x ′ _w (n) to the adaptive codebook circuit 11. x ′ _w (n) = x _w (n) −x _z (n) (10) The impulse response calculation circuit 10 calculates in advance the impulse response h _w (n) of the auditory weighting filter whose z-transformation is expressed by the following equation. Calculation is performed for a predetermined number L, and the adaptive codebook circuit 11, the sound source quantization circuit 12, and the gain quantization circuit 14
And output to

【００３８】 [0038]

【００３９】適応コードブック回路１１では、ゲイン量
子化回路１４からは過去の音源信号v(n)を、減算器８か
らは出力信号x'_w(n)を、インパルス応答計算回路１０か
らは聴感重み付けインパルス応答 h_w(n)を入力する。ピ
ッチに対応する遅延Ｔを下式の歪みを最小化するように
求め、遅延を表すインデクスをマルチプレクサ１６に出
力する。In the adaptive code book circuit 11, the past sound source signal v (n) is output from the gain quantization circuit 14, the output signal x ′ _w (n) is output from the subtractor 8, and the audibility is output from the impulse response calculation circuit 10. Enter the weighted impulse response h _w (n). The delay T corresponding to the pitch is determined so as to minimize the distortion of the following expression, and an index representing the delay is output to the multiplexer 16.

【００４０】 [0040]

【００４１】ここで、 y_w(n−T)＝v(n −T)＊h_w(n) (13) であり、記号＊は畳み込み演算を表す。ゲインβを下式
に従い求める。Here, y _w (n−T) = v (n−T) * h _w (n) (13), and the symbol * represents a convolution operation. The gain β is obtained according to the following equation.

【００４２】 [0042]

【００４３】ここで、女性音や、子供の声に対して、遅
延の抽出精度を向上させるために、遅延を整数サンプル
ではなく、小数サンプル値で求めてもよい。具体的な方
法は、例えば、ピー・クルーン(P.Kroon) 等によるアイ
イーイーイー・プロシーディングス(IEEE Proc.)ICASSP
-90,1990年、６６１〜６６４頁にピッチ・プリディクタ
ーズ・ウイズ・ハイ・テンポラル・ソリューション(Pit
ch predictors with high temporal resolution)と題し
て発表した論文（文献１１）等を参照することができ
る。Here, in order to improve the accuracy of extracting delays for female sounds and children's voices, the delays may be determined by decimal sample values instead of integer samples. A concrete method is, for example, IACSSP by P. Kroon et al. (IEEE Proc.)
-90, 1990, Pitch Predictors with High Temporal Solution (Pit
For example, a paper (Reference 11) published under the title of "Ch predictors with high temporal resolution" can be referred to.

【００４４】さらに、適応コードブック回路１１では下
式に従いピッチ予測を行い、予測残差信号e_w(n) を音源
量子化回路１２に出力する。 e_w(n) ＝x'_w(n)- βv(n-T)*h_w(n) (15) 音源量子化回路１２では、作用で述べたように、音源コ
ードブックの探索に特徴がある。Further, the adaptive codebook circuit 11 performs pitch prediction according to the following equation, and outputs a prediction residual signal e _w (n) to the sound source quantization circuit 12. e _w (n) = x ′ _w (n) −βv (nT) * h _w (n) (15) The sound source quantization circuit 12 has a feature in the search of the sound source codebook as described in the operation.

【００４５】図２は図１内の音源量子化回路の構成を示
すブロック図である。FIG. 2 is a block diagram showing the configuration of the sound source quantization circuit in FIG.

【００４６】以下の説明では、音源コードブック全体の
伝送すべきインデクスをＢビット、シフト量をＡビット
とする。In the following description, it is assumed that the index to be transmitted of the entire sound source codebook is B bits and the shift amount is A bits.

【００４７】音源量子化回路１２の逆フィルタリング回
路１８は、適応コードブック予測残差信号e_w(n) および
聴感重み付けインパルス応答h_w(n) を入力し、下式の計
算を行う。The inverse filtering circuit 18 of the sound source quantization circuit 12 receives the adaptive codebook prediction residual signal e _w (n) and the perceptual weighting impulse response h _w (n) and calculates the following equation.

【００４８】 [0048]

【００４９】音源コードブック１３は、（Ｂ−Ａ）ビッ
トのサイズである。自己相関計算回路２０は、音源コー
ドベクトルc_k(n) を音源コードブック１３から読み出
し、下式を用いて自己相関を計算する。この値は、同一
のコードベクトルに対して位置をシフトしたコードベク
トルについても共通に使用する。The sound source code book 13 has a size of (BA) bits. The autocorrelation calculation circuit 20 reads out the sound source code vector c _k (n) from the sound source codebook 13 and calculates the autocorrelation using the following equation. This value is commonly used for a code vector whose position is shifted with respect to the same code vector.

【００５０】 [0050]

【００５１】また、下式により、聴感重み付けインパル
ス応答の自己相関も計算する。Also, the autocorrelation of the auditory weighting impulse response is calculated by the following equation.

【００５２】 [0052]

【００５３】位置シフト回路１９は、下式により、音源
コードベクトルc_k(n) の位置を順番にシフトする。 c_kl(n) = c_k(n + l), l=0,...,2^A-1 (19) ここで、Ａはシフト量を表すためのビット数を示す。相
互相関計算回路２１は、下式に従い相互相関を計算す
る。The position shift circuit 19 sequentially shifts the position of the excitation code vector c _k (n) by the following equation. c _kl (n) = c _k (n + 1), l = 0,..., 2 ^A −1 (19) Here, A represents the number of bits for representing the shift amount. The cross-correlation calculation circuit 21 calculates a cross-correlation according to the following equation.

【００５４】 [0054]

【００５５】２乗計算回路２２は、相互相関ＣＣの２乗
を計算する。割算回路２３は、ＣＣ²と式 (2)のP_kとの
割算結果を最大値判別回路２４に出力する。最大値判別
回路２４は、割算結果の最大を判別し、そのときの音源
コードブック１３のインデクスとシフト量を加味した合
計のインデクスをゲイン量子化回路１４に出力する。The square calculation circuit 22 calculates the square of the cross-correlation CC. The division circuit 23 outputs the result of the division between CC ² and P _{k in} Equation (2) to the maximum value discrimination circuit 24. The maximum value discriminating circuit 24 discriminates the maximum of the division result, and outputs to the gain quantization circuit 14 a total index in which the index of the sound source codebook 13 and the shift amount are added at that time.

【００５６】ゲイン量子化回路１４は、ゲインコードブ
ック１５からゲインコードベクトルを読みだし、選択さ
れた音源コードベクトルに対して、下式を最小化するよ
うにゲインコードベクトルを選択する。ここでは、適応
コードブックのゲインと音源のゲインの両者を同時にベ
クトル量子化する例について示す。The gain quantization circuit 14 reads a gain code vector from the gain code book 15 and selects a gain code vector for the selected excitation code vector so as to minimize the following equation. Here, an example in which both the gain of the adaptive codebook and the gain of the sound source are simultaneously vector-quantized will be described.

【００５７】 [0057]

【００５８】ここで、β'_k，G'_k は、ゲインコードブッ
ク１５に格納された２次元ゲインコードブックにおける
ｋ番目のコードベクトルである。選択されたゲインコー
ドベクトルを表すインデクスをマルチプレクサ１６に出
力する。Here, β ′ _k and G ′ _k are k-th code vectors in the two-dimensional gain codebook stored in the gain codebook 15. An index representing the selected gain code vector is output to the multiplexer 16.

【００５９】重み付け信号計算回路１７は、スペクトル
パラメータ計算回路４の出力パラメータおよびそれぞれ
のインデクスを入力し、インデクスからそれぞれに対応
するコードベクトルを読みだし、まず下式にもとづき駆
動音源信号v(n)を求める。 v(n)= β'_kv(n-T)+G'_kc_k(n) (22) v(n)は適応コードブック回路１１に出力される。The weighting signal calculation circuit 17 receives the output parameters of the spectrum parameter calculation circuit 4 and the respective indexes, reads out the corresponding code vectors from the indexes, and firstly obtains the driving sound source signal v (n) based on the following equation. Ask for. v (n) = β ′ _k v (nT) + G ′ _k c _k (n) (22) v (n) is output to the adaptive codebook circuit 11.

【００６０】次に、スペクトルパラメータ計算回路４の
出力パラメータおよびスペクトルパラメータ量子化回路
５の出力パラメータを用いて下式により、応答信号(s
_w(n))をサブフレームごとに計算し、応答信号計算回路
９に出力する。Next, using the output parameter of the spectrum parameter calculation circuit 4 and the output parameter of the spectrum parameter quantization circuit 5, the response signal (s
_w (n)) is calculated for each subframe, and output to the response signal calculation circuit 9.

【００６１】 [0061]

【００６２】以上により、本発明の第１の実施の形態の
説明を終える。Thus, the description of the first embodiment of the present invention is completed.

【００６３】図３は本発明の第２の実施の形態を示すブ
ロック図である。FIG. 3 is a block diagram showing a second embodiment of the present invention.

【００６４】第２の実施の形態である音声符号化装置２
５が、第１の実施の形態と異なる点は、モード判別回路
２６を新たに設け、音源量子化回路２７の機能の一部を
変更した点である。その他の図１と同一の番号を付した
構成要素は、図１と同じ動作をするので説明は省略す
る。Speech Encoding Apparatus 2 of Second Embodiment
5 is different from the first embodiment in that a mode discrimination circuit 26 is newly provided and a part of the function of a sound source quantization circuit 27 is changed. The other components having the same reference numerals as those in FIG. 1 perform the same operations as those in FIG.

【００６５】モード判別回路２６は、聴感重み付け回路
１７からフレーム単位で聴感重み付け信号を受取り、モ
ード判別情報を出力する。ここでは、モード判別に、現
在のフレームの特徴量を用いる。特徴量としては、例え
ば、フレームで平均したピッチ予測ゲインを用いる。ピ
ッチ予測ゲインの計算には、例えば下式を用いる。The mode discriminating circuit 26 receives the perceptual weighting signal from the perceptual weighting circuit 17 in frame units, and outputs mode discriminating information. Here, the feature amount of the current frame is used for mode determination. As the characteristic amount, for example, a pitch prediction gain averaged in a frame is used. For example, the following equation is used for calculating the pitch prediction gain.

【００６６】 [0066]

【００６７】ここで、Ｌはフレームに含まれるサブフレ
ームの個数である。P_i，E_iはそれぞれ、ｉ番目のサブフ
レームでの音声パワー、ピッチ予測誤差パワーを示す。Here, L is the number of subframes included in the frame. P _i and E _i indicate the speech power and the pitch prediction error power in the i-th subframe, respectively.

【００６８】 [0068]

【００６９】ここで、Ｔは予測ゲインを最大化する最適
遅延である。Here, T is an optimal delay for maximizing the prediction gain.

【００７０】つぎに、フレーム平均ピッチ予測ゲインＧ
をあらかじめ定められた複数個のしきい値と比較して複
数種類のモードに分類する。モードの個数としては、例
えば４を用いることができる。モード判別回路２６は、
モード判別情報を音源量子化回路２７およびマルチプレ
クサ１６に出力する。Next, the frame average pitch prediction gain G
Is compared with a plurality of predetermined threshold values, and classified into a plurality of types of modes. As the number of modes, for example, 4 can be used. The mode determination circuit 26
The mode determination information is output to the sound source quantization circuit 27 and the multiplexer 16.

【００７１】音源量子化回路２７は、モード判別情報が
予め定められたモードを示す場合に音源コードベクトル
をシフトしながら探索する。When the mode discrimination information indicates a predetermined mode, the sound source quantization circuit 27 searches while shifting the sound source code vector.

【００７２】図４は図３内の音源量子化回路の構成を示
すブロック図である。音源量子化回路２７が音源量子化
回路１２と異なる点は、位置シフト回路２８の機能の一
部を変更した点である。その他の図２と同一の番号を付
した構成要素は、図２と同じ動作を行うので説明は省略
する。FIG. 4 is a block diagram showing the configuration of the sound source quantization circuit in FIG. The sound source quantization circuit 27 differs from the sound source quantization circuit 12 in that a part of the function of the position shift circuit 28 is changed. The other components having the same reference numerals as those in FIG. 2 perform the same operations as those in FIG.

【００７３】位置シフト回路２８は、モード判別回路２
６からモード情報を入力し、予め定められたモードの場
合に音源コードベクトルの位置のシフトを行うようにす
る。以後の動作は図２の位置シフト回路１９と同一であ
る。以上で第２の発明の説明を終了する。The position shift circuit 28 includes a mode discriminating circuit 2
The mode information is input from step 6 to shift the position of the excitation code vector in the case of a predetermined mode. Subsequent operations are the same as those of the position shift circuit 19 in FIG. This concludes the description of the second invention.

【００７４】図５は本発明の第３の実施の形態を示すブ
ロック図である。FIG. 5 is a block diagram showing a third embodiment of the present invention.

【００７５】第３の実施の形態である音声符号化装置２
９が、第２の実施の形態と異なる点は、音源量子化回路
３０の機能の一部を変更した点である。その他の図３と
同一の番号を付した構成要素は、図３と同じ動作をする
ので説明は省略する。Speech coding apparatus 2 according to a third embodiment
9 differs from the second embodiment in that a part of the function of the sound source quantization circuit 30 is changed. The other components having the same reference numerals as those in FIG. 3 perform the same operations as those in FIG.

【００７６】図６は図５内の音源量子化回路の構成を示
すブロック図である。音源量子化回路３０が音源量子化
回路２７と異なる点は、位置シフト回路３１の機能の一
部を変更した点である。その他の図２および４と同一の
番号を付した構成要素は、図２および４と同じ動作を行
うので説明は省略する。FIG. 6 is a block diagram showing the configuration of the sound source quantization circuit in FIG. The sound source quantization circuit 30 differs from the sound source quantization circuit 27 in that a part of the function of the position shift circuit 31 is changed. The other components denoted by the same reference numerals as those in FIGS. 2 and 4 perform the same operations as those in FIGS.

【００７７】位置シフト回路３１は、作用の項で説明し
たようにモード情報を入力し、モードごとにコードベク
トルの位置のシフト量を変化させる。即ち、モードごと
に、シフトに要するビット数Ａを変化させる。例えば、
モード０では０ビット、モード１ではＡ₁ ビット、モー
ド２ではＡ₂ ビット、モード３ではＡ₃ ビットという値
をとる。図７は本発明の第４の実施の形態を示すブロッ
ク図である。The position shift circuit 31 inputs the mode information as described in the section of operation, and changes the shift amount of the position of the code vector for each mode. That is, the number of bits A required for the shift is changed for each mode. For example,
Mode 0, 0 bits, A ₁ bit in mode 1, A ₂ bit in mode 2, the value A _3-bit in mode 3 take. FIG. 7 is a block diagram showing a fourth embodiment of the present invention.

【００７８】第４の実施の形態である音声符号化装置３
２が、第１の実施の形態と異なる点は、音源量子化回路
３３の機能の一部を変更した点である。その他の図１と
同一の番号を付した構成要素は、図１と同じ動作をする
ので説明は省略する。Speech coding apparatus 3 according to a fourth embodiment
2 is different from the first embodiment in that a part of the function of the sound source quantization circuit 33 is changed. The other components having the same reference numerals as those in FIG. 1 perform the same operations as those in FIG.

【００７９】図８は図７内の音源量子化回路の構成を示
すブロック図である。音源量子化回路１２が音源量子化
回路３３と異なる点は、割り当て回路３４が、音源コー
ドブック１３に格納されたコードベクトルのインデクス
に応じて、位置をシフトさせる量を割り当てる。ただ
し、コードブック全体ではシフト量の合計をＡビットと
して定めておく。位置シフト回路３５は、割り当て回路
３４からシフト量を入力されると、コードベクトルの位
置をシフトさせる。FIG. 8 is a block diagram showing the configuration of the sound source quantization circuit in FIG. The difference between the sound source quantization circuit 12 and the sound source quantization circuit 33 is that the assignment circuit 34 assigns an amount to shift the position according to the index of the code vector stored in the sound source codebook 13. However, the total shift amount is determined as A bits in the entire codebook. When the shift amount is input from the assignment circuit 34, the position shift circuit 35 shifts the position of the code vector.

【００８０】以上で本発明の実施例の説明を終える。The description of the embodiment of the present invention has been completed.

【００８１】なお、本発明は、上述した実施の形態に限
らず、種々の変形が可能である。例えば、音源コードブ
ックは、従来から使用されているような構成でもよい
し、複数個のパルス列からなる構成でもよい。音源コー
ドブックのコードベクトルは音声信号データを用いてあ
らかじめ学習して構成してもよい。さらに、モード判別
情報を用いて適応コードブック回路、音源コードブック
や、ゲインコードブックを切替える構成とすることも可
能である。The present invention is not limited to the above-described embodiment, but can be variously modified. For example, the sound source codebook may have a configuration as conventionally used, or may have a configuration including a plurality of pulse trains. The code vector of the sound source codebook may be configured by learning in advance using audio signal data. Furthermore, it is also possible to adopt a configuration in which the adaptive codebook circuit, the sound source codebook, and the gain codebook are switched using the mode determination information.

【００８２】[0082]

【発明の効果】以上説明したように、本発明は、音源量
子化部が、コードブックを探索するときに、コードブッ
クに格納されたコードベクトルの少なくとも一つについ
て位置をシフトさせながら探索することにより、また、
シフトさせる量をモードごとに変化させることにより、
従来の方法と同一のビットレートでも、コードブックの
探索に必要な演算量と、コードブックの格納に必要なメ
モリ量の両者を低減化できるという効果がある。また、
この効果はシフトに費やすビット数を増すことにより増
大するという効果もある。As described above, according to the present invention, when searching for a codebook, the sound source quantization unit searches for at least one of the code vectors stored in the codebook while shifting the position. By and
By changing the shift amount for each mode,
Even with the same bit rate as the conventional method, there is an effect that both the amount of calculation required for searching the codebook and the amount of memory required for storing the codebook can be reduced. Also,
This effect is also increased by increasing the number of bits spent for shifting.

[Brief description of the drawings]

【図１】本発明の第１の実施の形態を示すブロック図で
ある。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【図２】図１内の音源量子化回路の構成を示すブロック
図である。FIG. 2 is a block diagram showing a configuration of a sound source quantization circuit in FIG.

【図３】本発明の第２の実施の形態を示すブロック図で
ある。FIG. 3 is a block diagram showing a second embodiment of the present invention.

【図４】図３内の音源量子化回路の構成を示すブロック
図である。FIG. 4 is a block diagram illustrating a configuration of a sound source quantization circuit in FIG. 3;

【図５】本発明の第３の実施の形態を示すブロック図で
ある。FIG. 5 is a block diagram showing a third embodiment of the present invention.

【図６】図５内の音源量子化回路の構成を示すブロック
図である。FIG. 6 is a block diagram showing a configuration of a sound source quantization circuit in FIG. 5;

【図７】本発明の第４の実施の形態を示すブロック図で
ある。FIG. 7 is a block diagram showing a fourth embodiment of the present invention.

【図８】図７内の音源量子化回路の構成を示すブロック
図である。FIG. 8 is a block diagram showing a configuration of a sound source quantization circuit in FIG. 7;

[Explanation of symbols]

１，２５，２９，３２音声符号化装置２フレーム分割回路３サブフレーム分割回路４スペクトルパラメータ計算回路５スペクトルパラメータ量子化回路６線スペクトル対パラメータコードブック（ＬＳＰ
コードブック）７聴感重み付け回路８減算器９応答信号計算回路１０インパルス応答計算回路１１適応コードブック回路１２，２７，３０，３３音源量子化回路１３音源コードブック１４ゲイン量子化回路１５ゲインコードブック１６マルチプレクサ１７重み付け信号計算回路１８逆フィルタリング回路１９，２８，３１，３５位置シフト回路２０自己相関計算回路２１相互相関計算回路２２２乗計算回路２３割算回路２４最大値判別回路２６モード判別回路３４割り当て回路1, 25, 29, 32 Speech coding apparatus 2 Frame division circuit 3 Subframe division circuit 4 Spectrum parameter calculation circuit 5 Spectrum parameter quantization circuit 6 Line spectrum pair parameter codebook (LSP
Codebook) 7 auditory weighting circuit 8 subtractor 9 response signal calculation circuit 10 impulse response calculation circuit 11 adaptive codebook circuit 12, 27, 30, 33 sound source quantization circuit 13 sound source codebook 14 gain quantization circuit 15 gain codebook 16 Multiplexer 17 Weighting signal calculation circuit 18 Inverse filtering circuit 19, 28, 31, 35 Position shift circuit 20 Autocorrelation calculation circuit 21 Cross-correlation calculation circuit 22 Square calculation circuit 23 Division circuit 24 Maximum value determination circuit 26 Mode determination circuit 34 Assignment circuit

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平２−79100（ＪＰ，Ａ) 特開平５−281999（ＪＰ，Ａ) 特開平６−222797（ＪＰ，Ａ) 特開平６−266395（ＪＰ，Ａ) 特表平２−501166（ＪＰ，Ａ) ────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-2-79100 (JP, A) JP-A-5-281999 (JP, A) JP-A-6-222797 (JP, A) JP-A-6-222797 266395 (JP, A) Special table Hei 2-501166 (JP, A)

Claims

(57) [Claims]

A spectrum parameter calculator for calculating and quantizing a spectrum parameter from an input speech signal;
A sound source quantization unit that searches for, quantizes, and outputs a codebook in which a sound source signal of the sound signal is stored in advance using the spectrum parameter, wherein the sound source quantization unit includes the codebook. When searching for, a part of the code vectors stored in this codebook is searched while shifting the position , and the
Search for other code vectors without shifting the position.
A speech encoding device having a search function.

2. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal,
A sound source quantization unit for searching, quantizing, and outputting a code book in which a sound source signal of the sound signal is stored in advance by using the spectrum parameter; and A mode discriminator that discriminates a mode from the perceived weighting signal and outputs mode information, and a code vector stored in the code book in a predetermined mode when the sound source quantization unit searches the code book. We searched while shifting the <br/> position for some in the other code vectors
An audio coding apparatus having a function of searching for a file without shifting its position .

3. A spectrum parameter calculator for obtaining and quantizing a spectrum parameter from an input speech signal,
A sound source quantization unit that searches for a code book in which the sound source signal of the sound signal is stored in advance by using the spectrum parameter, and quantizes and outputs the code book. A mode discrimination unit that discriminates a mode from the perceived weighting signal that has been subjected to and outputs mode information, and a position of at least one of the code vectors stored in the codebook when the sound source quantization unit searches the codebook. A function of performing a search while changing an amount by which to shift according to the mode information.

4. A spectrum parameter calculator for obtaining and quantizing a spectrum parameter from an input speech signal;
A sound source quantization unit that searches, quantizes, and outputs a codebook in which a sound source signal of the sound signal is stored in advance using the spectrum parameter, wherein the sound source quantization unit includes the codebook. A speech encoding apparatus having a function of performing a search while changing an amount of position shift according to a value determined for each code vector stored in the codebook when searching for.