JPH1011093A

JPH1011093A - Signal encoding device

Info

Publication number: JPH1011093A
Application number: JP8164840A
Authority: JP
Inventors: Kazunori Ozawa; 一範小澤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-06-25
Filing date: 1996-06-25
Publication date: 1998-01-16
Anticipated expiration: 2016-06-25
Also published as: JP3092654B2

Abstract

PROBLEM TO BE SOLVED: To obtain excellent sound quality for voices from more than one speaker and musical sound signals generated by more than one musical instrument by calculating a sound source signal after finding delays from input signals. SOLUTION: A spectrum parameter calculating circuit 200 performs calculation in predetermined degree of a spectrum parameter by providing a window which is longer than subframe length for a voice signal of at least one subframe and segmenting a voice. A pitch extracting circuit 390 finds delays and gains corresponding to a pitch cycle by using the output of an auditory weighting circuit 230. Then a sound source quantizing circuit 350 performs vector quantization for a sound source signal by using a sound source code book 351. The output of a subtracter 236 and the output of an impulse response calculating circuit 310 are used to retrieve a sound source code vector cj (n) from the sound source code book 351. Further, the index of the selected sound source vector is outputted to a multiplexer 400.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声や音楽信号を
低いビットレートで高品質に符号化するための信号符号
化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a signal encoding apparatus for encoding a speech or music signal at a low bit rate with high quality.

【０００２】[0002]

【従来の技術】音声信号を高能率に符号化する方式とし
ては、例えば、Ｍ．ＳｃｈｒｏｅｄｅｒａｎｄＢ．
Ａｔａｌによる“Ｃｏｄｅ−ｅｘｃｉｔｅｄｌｉｎｅ
ａｒｐｒｅｄｉｃｔｉｏｎ：Ｈｉｇｈｑｕａｌｉｔｙ
ｓｐｅｅｃｈａｔｖｅｒｙｌｏｗｂｉｔｒ
ａｔｅｓ”（Ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ．９３７−
９４０，１９８５年）と題した論文（文献１）や、Ｋｌ
ｅｉｊｎらによる“Ｉｍｐｒｏｖｅｄｓｐｅｅｃｈ
ｑｕａｌｉｔｙａｎｄｅｆｆｉｃｅｉｎｔｖｅｃｔ
ｏｒｑｕａｎｔｉｚａｔｉｏｎｉｎＳＥＬＰ”
（Ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ．１５５−１５８，１
９８８年）と題した論文（文献２）などに記載されてい
るＣＥＬＰ（ＣｏｄｅＥｘｃｉｔｅｄＬｉｎｅａｒ
ＰｒｅｄｉｃｔｉｖｅＣｏｄｉｎｇ）が知られてい
る。この従来例では、送信側では、フレーム毎（例えば
２０ｍｓ）に音声信号から線形予測（ＬＰＣ）分析を用
いて、音声信号のスペクトル特性を表すスペクトルパラ
メータを抽出する。フレームをさらにサブフレーム（例
えば５ｍｓ）に分割し、サブフレーム毎に過去の音源信
号を基に適応コードブックにおけるパラメータ（ピッチ
周期に対応する遅延パラメータとゲインパラメータ）を
抽出し、適応コードブックにより前記サブフレームの音
声信号をピッチ予測する。ピッチ予測して求めた音源信
号に対して、予め定められた種類の雑音信号からなる音
源コードブック（ベクトル量子化コードブック）から最
適な音源コードベクトルを選択し、最適なゲインを計算
することにより、音源信号を量子化する。音源コードベ
クトルの選択の仕方は、選択した雑音信号により合成し
た信号と、前記残差信号との誤差電力を最小化するよう
に行う。そして、選択されたコードベクトルの種類を表
すインデクスとゲインならびに、前記スペクトルパラメ
ータと適応コードブックのパラメータをマルチプレクサ
部により組み合わせて伝送する。受信側の説明は省略す
る。2. Description of the Related Art As a method for encoding a speech signal with high efficiency, for example, M.I. Schroeder and B.S.
"Code-excited line by Atal
arprediction: High quality
speech at very low bitr
ates "(Proc. ICASPS, pp. 937-
940, 1985), Kl.
"Improved speech" by Eijn et al.
quality and efficiencyintectect
or quantification in SELP "
(Proc. ICASSP, pp. 155-158, 1
1988), and a CELP (Code Excited Linear) described in a paper (Reference 2) and the like.
Predictive Coding) is known. In this conventional example, the transmitting side extracts a spectral parameter representing a spectral characteristic of an audio signal from the audio signal for each frame (for example, 20 ms) by using linear prediction (LPC) analysis. The frame is further divided into subframes (for example, 5 ms), and parameters (a delay parameter and a gain parameter corresponding to a pitch period) in the adaptive codebook are extracted for each subframe based on a past sound source signal. Pitch prediction of the audio signal of the subframe. For an excitation signal obtained by pitch prediction, an optimal excitation code vector is selected from an excitation codebook (vector quantization codebook) composed of predetermined types of noise signals, and an optimal gain is calculated. , Quantize the sound source signal. The excitation code vector is selected so as to minimize the error power between the signal synthesized from the selected noise signal and the residual signal. Then, the index and gain indicating the type of the selected code vector, the spectrum parameter and the parameter of the adaptive codebook are combined and transmitted by the multiplexer unit. Description on the receiving side is omitted.

【０００３】[0003]

【発明が解決しようとする課題】前記従来法では、音源
コードブックから最適な音源コードベクトルを選択する
のに多大な演算量を要するという問題があった。これ
は、文献１や２の方法では、音源コードベクトルを選択
するのに、各コードベクトルに対して一旦フィルタリン
グもしくは畳み込み演算を行ない、この演算をコードブ
ックに格納されているコードベクトルの個数だけ繰り返
すことに起因する。例えば、コードブックのビット数が
Ｂビットで、次元数がＮのときは、フィルタリングある
いは畳み込み演算のときのフィルタあるいはインパルス
応答長をＫとすると、演算量は１秒当たり、Ｎ×Ｋ×２
^B×８０００／Ｎだけ必要となる。一例として、Ｂ＝１
０、Ｎ＝４０、Ｋ＝１０とすると、１秒当たり８１，９
２０，０００回の演算が必要となり、極めて膨大である
という問題点があった。また、この問題点は、入力信号
の帯域が電話帯域よりも広く、標本化周波数が高くなる
ほど、深刻であった。The conventional method has a problem that a large amount of calculation is required to select an optimal excitation code vector from an excitation codebook. In this method, in order to select a sound source code vector, filtering or convolution operation is once performed on each code vector, and this operation is repeated by the number of code vectors stored in the code book. Due to that. For example, when the number of bits in the codebook is B and the number of dimensions is N, the amount of operation is N × K × 2 per second, where K is the filter or impulse response length in the filtering or convolution operation.
Only ^B × 8000 / N is required. As an example, B = 1
If 0, N = 40 and K = 10, 81,9 per second
20,000 operations are required, which is extremely large. This problem becomes more serious as the input signal band is wider than the telephone band and the sampling frequency becomes higher.

【０００４】音源コードブック探索に必要な演算量を低
減する方法として、種々のものが提案されている。例え
ば、ＡＣＥＬＰ（ＡｒｇｅｂｒａｉｃＣｏｄｅＥｘ
ｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ）方
式が提案されている。これは、例えば、Ｃ．Ｌａｆｌａ
ｍｍｅらによる“１６ｋｂｐｓｗｉｄｅｂａｎｄｓ
ｐｅｅｃｈｃｏｄｉｎｇｔｅｃｈｎｉｑｕｅｂａ
ｓｅｄｏｎａｌｇｅｂｒａｉｃＣＥＬＰ”と題し
た論文（Ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ．１３−１６，
１９９１）（文献３）等を参照することができる。文献
３の方法によれば、音源信号を複数個のパルスで表し、
各パルスの位置をあらかじめ定められたビット数で表し
伝送する。ここで、各パルスの振幅は＋１．０もしくは
−１．０に限定されているため、パルス探索の演算量を
大幅に低減化できる。[0004] Various methods have been proposed as a method for reducing the amount of calculation required for searching the sound source codebook. For example, ACELP (Argebraic Code Ex)
Citated Linear Prediction) has been proposed. This is, for example, C.I. Lafla
"16 kbps widebands by Mme et al.
peech coding technique ba
Sed on algebric CELP "(Proc. ICASP, pp. 13-16, pp. 13-16).
1991) (Literature 3). According to the method of Document 3, the sound source signal is represented by a plurality of pulses,
The position of each pulse is represented by a predetermined number of bits and transmitted. Here, since the amplitude of each pulse is limited to +1.0 or -1.0, the amount of calculation for the pulse search can be significantly reduced.

【０００５】しかし、以上述べたいずれの手法も、ピッ
チが一つの音声に対しては比較的良好な音質が得られる
ものの、会議などの用途で、複数話者の声や、複数種の
楽器が混在するためにピッチが複数個含まれる音楽信号
に対しては、低いビットレートでは甚だしく音質が劣化
していた。[0005] However, in any of the above-mentioned methods, although relatively good sound quality can be obtained for a single-pitch sound, voices of a plurality of speakers or a plurality of types of musical instruments are used for a conference or the like. For a music signal containing a plurality of pitches due to the coexistence, the sound quality is significantly deteriorated at a low bit rate.

【０００６】本発明の目的は、上述の問題を解決し、ビ
ットレートが低い場合にも、広帯域の音声のみならず音
楽信号に対しても、比較的少ない演算量で音質の劣化の
少ない信号符号化方式を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned problems and to provide a signal code with a relatively small amount of operation and a small deterioration in sound quality not only for a wideband voice but also for a music signal even when the bit rate is low. It is to provide a generalization method.

【０００７】[0007]

【課題を解決するための手段】本発明の第１の態様によ
れば、入力信号からスペクトルパラメータを求めて量子
化するスペクトルパラメータ計算部と、前記入力信号か
ら第１のピッチ周期に対応する第１の遅延を求めピッチ
予測残差信号を求め前記ピッチ予測残差信号から第２の
ピッチ周期に対応する第２の遅延を求め、これら第１及
び第２の遅延を少なくとも含むあらかじめ定められた複
数個の遅延を求めて出力する遅延計算部と、前記複数個
の遅延を用いてピッチ予測した残差信号に対して音源信
号を求めて量子化して出力する音源量子化部を有するこ
とを特徴とする信号符号化装置が得られる。According to a first aspect of the present invention, there is provided a spectrum parameter calculator for obtaining and quantizing a spectrum parameter from an input signal, and a spectrum parameter calculator corresponding to a first pitch period from the input signal. 1 delay, a pitch prediction residual signal is determined, a second delay corresponding to a second pitch period is determined from the pitch prediction residual signal, and a plurality of predetermined delays including at least the first and second delays are determined. A delay calculation unit that calculates and outputs the number of delays, and a sound source quantization unit that obtains and quantizes and outputs a sound source signal for a pitch-predicted residual signal using the plurality of delays. Thus, a signal encoding device that performs

【０００８】本発明の第２の態様によれば、前記信号の
音源信号が振幅が非零の複数個のパルスから構成される
ことを特徴とする信号符号化装置が得られる。According to a second aspect of the present invention, there is provided a signal encoding apparatus characterized in that an excitation signal of the signal is composed of a plurality of pulses having a non-zero amplitude.

【０００９】本発明の第３の態様によれば、入力信号か
らスペクトルパラメータを求めて量子化するスペクトル
パラメータ計算部と、前記入力信号から特徴量を抽出し
てモードを判別するモード判別部と、あらかじめ定めら
れたモードにおいて前記入力信号から第１のピッチ周期
に対応する第１の遅延を求めピッチ予測残差信号を求め
前記ピッチ予測残差信号から第２のピッチ周期に対応す
る第２の遅延を求め、これら第１及び第２の遅延を少な
くとも含むあらかじめ定められた複数個の遅延を求めて
出力する遅延計算部と、前記複数個の遅延を用いてピッ
チ予測した残差信号に対して音源信号を求めて量子化し
て出力する音源量子化部を有することを特徴とする信号
符号化装置が得られる。According to a third aspect of the present invention, there is provided a spectrum parameter calculator for obtaining and quantizing a spectrum parameter from an input signal, a mode discriminator for extracting a feature quantity from the input signal and discriminating a mode, A first delay corresponding to a first pitch period is obtained from the input signal in a predetermined mode, a pitch prediction residual signal is obtained, and a second delay corresponding to a second pitch period is obtained from the pitch prediction residual signal. And a delay calculator for calculating and outputting a plurality of predetermined delays including at least the first and second delays; and a sound source for the residual signal pitch-predicted using the plurality of delays. A signal encoding device having a sound source quantization unit for obtaining a signal, quantizing the signal, and outputting the signal is obtained.

【００１０】本発明の第４の態様によれば、前記信号の
音源信号が振幅が非零の複数個のパルスから構成される
ことを特徴とする信号符号化装置が得られる。According to a fourth aspect of the present invention, there is provided a signal encoding apparatus characterized in that an excitation signal of the signal is constituted by a plurality of non-zero amplitude pulses.

【００１１】本発明の第５の態様によれば、入力信号か
らスペクトルパラメータを求めて量子化するスペクトル
パラメータ計算部と、前記入力信号から第１のピッチ周
期に対応する第１の遅延を求めピッチ予測残差信号を求
め第１のピッチ予測利得を計算し、前記ピッチ予測残差
信号から第２のピッチ周期に対応する第２の遅延を求め
これらの処理を繰り返すピッチ予測部と、前記ピッチ予
測利得があらかじめ定められた条件を満たすかどうかを
判別する判別部と、前記ピッチ予測利得があらかじめ定
められた条件を満たさない場合に前記遅延を用いてピッ
チ予測した残差信号に対して音源信号を求めて量子化し
て出力する音源量子化部を有することを特徴とする信号
符号化装置が得られる。According to a fifth aspect of the present invention, there is provided a spectrum parameter calculator for obtaining and quantizing a spectrum parameter from an input signal, and obtaining a first delay corresponding to a first pitch period from the input signal. A pitch prediction unit for obtaining a prediction residual signal, calculating a first pitch prediction gain, obtaining a second delay corresponding to a second pitch period from the pitch prediction residual signal, and repeating these processes; A determination unit that determines whether the gain satisfies a predetermined condition, and a sound source signal for a residual signal pitch-predicted using the delay when the pitch prediction gain does not satisfy a predetermined condition. A signal encoding apparatus characterized by having a sound source quantization unit for obtaining, quantizing and outputting the obtained signal is obtained.

【００１２】本発明の第６の態様によれば、前記信号の
音源信号が振幅が非零の複数個のパルスから構成される
ことを特徴とする信号符号化装置が得られる。According to a sixth aspect of the present invention, there is provided a signal encoding apparatus characterized in that an excitation signal of the signal is composed of a plurality of pulses having a non-zero amplitude.

【００１３】本発明の第７の態様によれば、入力信号か
らスペクトルパラメータを求めて量子化するスペクトル
パラメータ計算部と、前記入力信号から特徴量を抽出し
てモードを判別するモード判別部と、あらかじめ定めら
れたモードにおいて前記入力信号から第１のピッチ周期
に対応する第１の遅延を求めピッチ予測残差信号を求め
第１のピッチ予測利得を計算し、前記ピッチ予測残差信
号から第２のピッチ周期に対応する第２の遅延を求めこ
れらの処理を繰り返すピッチ予測部と、前記ピッチ予測
利得があらかじめ定められた条件を満たすかどうかを判
別する判別部と、前記ピッチ予測利得があらかじめ定め
られた条件を満たさない場合は前記遅延を用いてピッチ
予測した残差信号に対して音源信号を求めて量子化して
出力する音源量子化部を有することを特徴とする信号符
号化装置が得られる。According to a seventh aspect of the present invention, there is provided a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input signal, a mode discrimination unit for extracting a feature amount from the input signal and discriminating a mode, In a predetermined mode, a first delay corresponding to a first pitch period is obtained from the input signal, a pitch prediction residual signal is obtained, a first pitch prediction gain is calculated, and a second pitch prediction gain is calculated from the pitch prediction residual signal. A pitch prediction unit that determines a second delay corresponding to the pitch period and repeats these processes; a determination unit that determines whether the pitch prediction gain satisfies a predetermined condition; If the given condition is not satisfied, a sound source quantum for obtaining and quantizing a sound source signal for the residual signal pitch-predicted using the delay and outputting the sound source signal. Signal encoding device is obtained, characterized in that it comprises a part.

【００１４】本発明の第８の態様によれば、前記信号の
音源信号が振幅が非零の複数個のパルスから構成される
ことを特徴とする信号符号化装置が得られる。According to an eighth aspect of the present invention, there is provided a signal encoding apparatus characterized in that an excitation signal of the signal is composed of a plurality of pulses having a non-zero amplitude.

【００１５】本発明の第１の態様では、入力信号からピ
ッチ周期に相当する遅延の計算とピッチ予測残差信号の
計算を繰り返し、あらかじめ定められた個数の遅延を求
め、前記複数個の遅延を用いてピッチ予測した残差信号
に対して音源信号を求めて量子化して出力する。In the first aspect of the present invention, the calculation of a delay corresponding to a pitch period and the calculation of a pitch prediction residual signal are repeated from an input signal to obtain a predetermined number of delays, and the plurality of delays are calculated. A sound source signal is obtained from the residual signal whose pitch has been predicted using the obtained signal, quantized, and output.

【００１６】本発明の第２の態様では、第１の態様にお
いて、音源信号が個数Ｍの振幅が非零のパルス列から構
成され、パルスの振幅と位置を求めることにより、音源
信号を量子化する。According to a second aspect of the present invention, in the first aspect, the sound source signal is composed of a pulse train having a number M of non-zero amplitudes, and the amplitude and position of the pulse are obtained to quantize the sound source signal. .

【００１７】本発明の第３の態様では、入力信号から特
徴量を抽出してモードを判別し、あらかじめ定められた
モードにおいてのみ、第１の発明と同一の動作を行な
う。According to the third aspect of the present invention, the mode is determined by extracting the characteristic amount from the input signal, and the same operation as in the first aspect is performed only in a predetermined mode.

【００１８】本発明の第４の態様では、第３の態様にお
いて、音源信号が個数Ｍの振幅が非零のパルス列から構
成され、パルスの振幅と位置を求めることにより、音源
信号を量子化する。According to a fourth aspect of the present invention, in the third aspect, the sound source signal is composed of a pulse train of a number M having a non-zero amplitude, and the sound source signal is quantized by obtaining the amplitude and position of the pulse. .

【００１９】本発明の第５の態様では、入力信号からピ
ッチ周期に相当する遅延の計算とピッチ予測残差信号の
計算とピッチ予測利得の計算を繰り返し、前記ピッチ予
測利得があらかじめ定められた条件を満たすかどうかを
判別し前記ピッチ予測利得があらかじめ定められた条件
を満たさない場合に前記遅延を用いてピッチ予測した残
差信号に対して音源信号を求めて量子化して出力する音
源量子化部を有する。According to a fifth aspect of the present invention, the calculation of the delay corresponding to the pitch period, the calculation of the pitch prediction residual signal, and the calculation of the pitch prediction gain are repeated from the input signal, and the pitch prediction gain is set to a predetermined condition. A sound source quantizing unit that determines whether the pitch prediction gain does not satisfy a predetermined condition and obtains and quantizes a sound source signal for a residual signal pitch-predicted using the delay when the pitch prediction gain does not satisfy a predetermined condition. Having.

【００２０】本発明の第６の態様では、第５の態様にお
いて、音源信号が個数Ｍの振幅が非零のパルス列から構
成され、パルスの振幅と位置を求めることにより、音源
信号を量子化する。According to a sixth aspect of the present invention, in the fifth aspect, the sound source signal is constituted by a pulse train of a number M having a non-zero amplitude, and the sound source signal is quantized by obtaining the amplitude and position of the pulse. .

【００２１】本発明の第７の態様では、入力信号から特
徴量を抽出してモードを判別し、あらかじめ定められた
モードにおいてのみ、第５の態様と同一の動作を行う。In the seventh aspect of the present invention, the mode is determined by extracting the characteristic amount from the input signal, and the same operation as in the fifth aspect is performed only in a predetermined mode.

【００２２】本発明の第８の態様では、第７の態様にお
いて、音源信号が個数Ｍの振幅が非零のパルス列から構
成され、パルスの振幅と位置を求めることにより、音源
信号を量子化する。According to an eighth aspect of the present invention, in the seventh aspect, the sound source signal is constituted by a pulse train having a number M of non-zero amplitudes, and the amplitude and position of the pulse are obtained to quantize the sound source signal. .

【００２３】[0023]

【発明の実施の形態】次に本発明の実施例について図面
を参照して説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, embodiments of the present invention will be described with reference to the drawings.

【００２４】図１は本発明による音声符号化装置の一実
施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a speech coding apparatus according to the present invention.

【００２５】図において、入力端子１００から音声信号
を入力し、フレーム分割回路１１０では音声信号をフレ
ーム（例えば１０ｍｓ）毎に分割し、サブフレーム分割
回路１２０では、フレームの音声信号をフレームよりも
短いサブフレーム（例えば５ｍｓ）に分割する。In the figure, an audio signal is input from an input terminal 100, a frame dividing circuit 110 divides the audio signal for each frame (for example, 10 ms), and a sub-frame dividing circuit 120 divides the audio signal of the frame into a frame shorter than the frame. It is divided into subframes (for example, 5 ms).

【００２６】スペクトルパラメータ計算回路２００で
は、少なくとも一つのサブフレームの音声信号に対し
て、サブフレーム長よりも長い窓（例えば２４ｍｓ）を
かけて音声を切り出してスペクトルパラメータをあらか
じめ定められた次数（例えばＰ＝１０次）計算する。こ
こでスペクトルパラメータの計算には、周知のＬＰＣ分
析や、Ｂｕｒｇ分析等を用いることができる。ここで
は、Ｂｕｒｇ分析を用いることとする。Ｂｕｒｇ分析の
詳細については、中溝著による“信号解析とシステム同
定”と題した単行本（コロナ社１９８８年刊）の８２〜
８７頁（文献４）等に記載されているので説明は略す
る。さらにスペクトルパラメータ計算部では、Ｂｕｎｇ
法により計算された線形予測係数α_i（ｉ＝１，…，１
０）を量子化や補間に適したＬＳＰパラメータに変換す
る。ここで、線形予測係数からＬＳＰへの変換は、菅村
他による“線スペクトル対（ＬＳＰ）音声分析合成方式
による音声情報圧縮”と題した論文（電子通信学会論文
誌、Ｊ６４−Ａ、ｐｐ．５９９−６０６、１９８１年）
（文献５）を参照することができる。例えば、第２サブ
フレームでＢｕｒｇ法により求めた線形予測係数を、Ｌ
ＳＰパラメータに変換し、第１サブフレームのＬＳＰを
直線補間により求めて、第１サブフレームのＬＳＰを逆
変換して線形予測係数に戻し、第１，２サブフレームの
線形予測係数α_il（ｉ＝１，…，１０、ｌ＝１，…，
２）を聴感重み付け回路２３０に出力する。また、第２
サブフレームのＬＳＰをスペクトルパラメータ量子化回
路２１０へ出力する。The spectrum parameter calculation circuit 200 cuts out the speech signal by applying a window (for example, 24 ms) longer than the subframe length to the speech signal of at least one subframe, and sets the spectrum parameter to a predetermined order (for example, (P = 10th order) is calculated. Here, a well-known LPC analysis, Burg analysis, or the like can be used for calculating the spectrum parameters. Here, Burg analysis is used. For details of the Burg analysis, see the book entitled "Signal Analysis and System Identification" written by Nakamizo (Corona Publishing Co., 1988), 82-.
Since it is described on page 87 (Document 4) and the like, the description is omitted. Further, in the spectrum parameter calculation unit, Bung
Linear prediction coefficient α _i (i = 1,..., 1)
0) is converted into LSP parameters suitable for quantization and interpolation. Here, the conversion from the linear prediction coefficient to the LSP is performed by a paper entitled "Speech Information Compression by Line Spectrum Pair (LSP) Speech Analysis / Synthesis Method" by Sugamura et al. -606, 1981)
(Reference 5). For example, the linear prediction coefficient obtained by the Burg method in the second subframe is represented by L
The LSP of the first sub-frame is converted to SP parameters, the LSP of the first sub-frame is obtained by linear interpolation, and the LSP of the first sub-frame is inversely converted to a linear prediction coefficient, and the linear prediction coefficient α _il (i = 1, ..., 10, l = 1, ...,
2) is output to the auditory weighting circuit 230. Also, the second
The LSP of the subframe is output to spectrum parameter quantization circuit 210.

【００２７】スペクトルパラメータ量子化回路２１０で
は、あらかじめ定められたサブフレームのＬＳＰパラメ
ータを効率的に量子化する。量子化法として、ベクトル
量子化を用いるものとし、第２サブフレームのＬＳＰパ
ラメータを量子化するものとする。ＬＳＰパラメータの
ベクトル量子化の手法は周知の手法を用いることができ
る。具体的な方法は例えば、特開平４−１７１５００号
公報（特願平２−２９７６００号）（文献６）や特開平
４−３６３００号公報（特願平３−２６１９２５号）文
献７）や、特開平５−６１９９号公報（特願平３−１５
５０４９号）（文献８）や、Ｔ．Ｎｏｍｕｒａｅｔ
ａｌ．，による“ＬＳＰＣｏｄｉｎｇＵｓｉｎｇＶ
Ｑ−ＡＶＱＷｉｔｈＩｎｔｅｒｐｏｌａｔｉｏｎ
ｉｎ４．０７５ｋｂｐｓＭ−ＬＣＥＬＰＳｐｅｅｃ
ｈＣｏｄｅｒ”と題した論文（Ｐｒｏｃ．Ｍｏｂｉｌ
ｅＭｕｌｔｉｍｅｄｉａＣｏｍｍｕｎｉｃａｔｉｏ
ｎｓ，ｐｐ．Ｂ．２．５，１９９３）（文献９）等を参
照できる。The spectrum parameter quantization circuit 210 efficiently quantizes LSP parameters of a predetermined subframe. It is assumed that vector quantization is used as a quantization method, and LSP parameters of the second subframe are quantized. A well-known method can be used for the method of vector quantization of LSP parameters. Specific methods are described in, for example, JP-A-4-171500 (Japanese Patent Application No. 2-297600) (Reference 6), JP-A-4-36300 (Japanese Patent Application No. 3-261925) Reference 7), and JP-A-5-6199 (Japanese Patent Application No. 3-15)
No. 5049) (Reference 8) and T.I. Nomura et
al. "LSP CodingUsing V
Q-AVQ With Interpolation
in4.075kbps M-LCELP Spec
h Coder "(Proc. Mobil
e Multimedia Communicatio
ns, pp. B. 2.5, 1993) (Reference 9).

【００２８】ＬＳＰコードブック２１１を用いて、下記
の数式１で表わされる歪みを最小化するコードベクトル
を選択して出力する。Using the LSP codebook 211, a code vector that minimizes distortion represented by the following equation 1 is selected and output.

【００２９】[0029]

【数１】ここで、ＬＳＰ（ｉ），ＱＬＳＰ（ｉ）_j，Ｗ（ｉ）は
それぞれ、量子化前のｉ次目のＬＳＰ、量子化後のｊ番
目の結果、重み係数である。(Equation 1) Here, LSP (i), QLSP (i) _j , and W (i) are the i-th LSP before quantization, the j-th result after quantization, and the weight coefficient, respectively.

【００３０】また、スペクトルパラメータ量子化回路２
１０では、第２サブフレームで量子化したＬＳＰパラメ
ータをもとに、第１サブフレームのＬＳＰパラメータを
復元する。ここでは、現フレームの第２サブフレームの
量子化ＬＳＰパラメータと１つ過去のフレームの第２サ
ブフレームの量子化ＬＳＰを直線補間して、第１サブフ
レームのＬＳＰを復元する。ここで、量子化前のＬＳＰ
と量子化後のＬＳＰとの誤差電力を最小化するコードベ
クトルを１種類選択した後に、直線補間により第１サブ
フレームのＬＳＰを復元できる。The spectrum parameter quantization circuit 2
At 10, the LSP parameters of the first sub-frame are restored based on the LSP parameters quantized in the second sub-frame. Here, the LSP of the first subframe is restored by linearly interpolating the quantized LSP parameter of the second subframe of the current frame and the quantized LSP of the second subframe of the previous frame. Here, LSP before quantization
After selecting one type of code vector that minimizes the error power between the LSP and the quantized LSP, the LSP of the first subframe can be restored by linear interpolation.

【００３１】以上により復元した第１サブフレームのＬ
ＳＰと第２サブフレームの量子化ＬＳＰをサブフレーム
毎に線形予測係数α′_i（ｉ＝１，…，１０）に変換
し、インパルス応答計算回路３１０へ出力する。また、
第２サブフレームの量子化ＬＳＰのコードベクトルを表
すインデクスをマルチプレクサ４００に出力する。The L of the first subframe restored as described above
The SP and the quantized LSP of the second subframe are converted into linear prediction coefficients α ′ _i (i = 1,..., 10) for each subframe and output to the impulse response calculation circuit 310. Also,
An index representing the code vector of the quantized LSP of the second subframe is output to the multiplexer 400.

【００３２】聴感重み付け回路２３０は、スペクトルパ
ラメータ計算回路２００から、各サブフレーム毎に量子
化前の線形予測係数α_i（ｉ＝１，…，Ｐ）を入力し、
前記文献１にもとづき、サブフレームの音声信号に対し
て聴感重み付けを行い、聴感重み付け信号を出力する。The perceptual weighting circuit 230 inputs the linear prediction coefficients α _i (i = 1,..., P) before quantization from the spectrum parameter calculation circuit 200 for each subframe,
Based on Document 1, perceptual weighting is performed on the audio signal of the subframe, and a perceptual weighting signal is output.

【００３３】応答信号計算回路２４０は、スペクトルパ
ラメータ計算回路２００から、各サブフレーム毎に線形
予測係数α_iを入力し、スペクトルパラメータ量子化回
路２１０から、量子化、補間して復元した線形予測係数
α′_iをサブフレーム毎に入力し、保存されているフィ
ルタメモリの値を用いて、入力信号を零ｄ（ｎ）＝０と
した応答信号を１サブフレーム分計算し、減算器２３５
へ出力する。ここで、応答信号ｘ_z（ｎ）は下記の数式
２で表される。The response signal calculation circuit 240 receives the linear prediction coefficient α _i for each subframe from the spectrum parameter calculation circuit 200, and quantizes, interpolates, and restores the linear prediction coefficient α _i from the spectrum parameter quantization circuit 210. α ′ _i is input for each sub-frame, and a response signal with the input signal set to zero d (n) = 0 is calculated for one sub-frame using the stored value of the filter memory, and the subtractor 235
Output to Here, the response signal x _z (n) is represented by Equation 2 below.

【００３４】[0034]

【数２】但し、ｎ−ｉ≦０のときはｙ（ｎ−ｉ）＝ｐ（Ｎ＋（ｎ−ｉ））（３）ｘ_z（ｎ−ｉ）＝ｓ_w（Ｎ＋（ｎ−ｉ））（４）ここでＮはサブフレーム長を示す。γは、聴感重み付け
量を制御する重み係数であり、下記の式（６）と同一の
値である。ｓ_w（ｎ）、ｐ（ｎ）は、それぞれ、重み付
け信号計算回路の出力信号、後述の式（６）における右
辺第１項のフィルタの分母の項の出力信号をそれぞれ示
す。(Equation 2) However, when ni ≦ 0, y (ni) = p (N + (ni)) (3) _xz (ni) = _sw (N + (ni)) (4) Here, N indicates a subframe length. γ is a weight coefficient for controlling the perceptual weighting amount, and is the same value as the following equation (6). s _w (n) and p (n) denote the output signal of the weighting signal calculation circuit and the output signal of the denominator term of the filter on the right-hand side first term in equation (6) described later, respectively.

【００３５】ピッチ抽出回路３９０は、聴感重み付け回
路２３０の出力を用いてピッチ周期に対応する遅延とゲ
インを複数個求める。以下ではこの個数を２とする。ピ
ッチ抽出回路３９０の構成を図２に示す。The pitch extraction circuit 390 obtains a plurality of delays and gains corresponding to the pitch period using the output of the auditory weighting circuit 230. Hereinafter, this number is set to two. FIG. 2 shows the configuration of the pitch extraction circuit 390.

【００３６】図２において、端子３９１から聴感重み付
け信号ｘ_w（ｎ）を入力する。第１の遅延、ゲイン計算
回路３９２は、下記の数式３によって表わされる式
（５）を最小化するように第１の遅延Ｔ₁を求め、下記
の数式４によって表わされる式（６）から第１のゲイン
β₁を求める。In FIG. 2, a perceptual weighting signal x _w (n) is input from a terminal 391. The first delay / gain calculation circuit 392 obtains the first delay T ₁ so as to minimize the equation (5) represented by the following equation 3, and obtains the first delay T ₁ from the equation (6) represented by the following equation 4. A gain β ₁ of ₁ is obtained.

【００３７】[0037]

【数３】 (Equation 3)

【００３８】[0038]

【数４】さらに、次式に従い第１のピッチ予測信号ｙ₁（ｎ）を
求め減算器３９４に出力する。(Equation 4) Further, a first pitch prediction signal y ₁ (n) is obtained according to the following equation and output to the subtractor 394.

【００３９】ｙ₁（ｎ）＝β₁ｘ_w（ｎ−Ｔ₁）（７）減算器３９４は、次式により、第１のピッチ予測残差信
号ｅ₁（ｎ）を求める。Y ₁ (n) = β ₁ _xw (n−T ₁ ) (7) The subtractor 394 obtains the first pitch prediction residual signal e ₁ (n) by the following equation.

【００４０】ｅ₁（ｎ）＝ｘ_w（ｎ）−ｙ₁（ｎ）（８）第２の遅延、ゲイン計算回路３９３は、ｅ₁（ｎ）から
第２の遅延Ｔ₂（ｎ）、ゲインβ₂を求める。これら
は、式（５），（６）において、ｘ_w（ｎ）のかわりに
ｅ₁（ｎ）を用いれば良い。E ₁ (n) = x _w (n) −y ₁ (n) (8) The second delay / gain calculation circuit 393 calculates the second delay T ₂ (n) from e ₁ (n), determine the gain β _2. For these, in equations (5) and (6), e ₁ (n) may be used instead of x _w (n).

【００４１】Ｔ₂，Ｔ₁は、それぞれ、端子３９７，３
９８から出力される。T ₂ and T ₁ are terminals 397 and 3 respectively.
98.

【００４２】インパルス応答計算回路３１０は、ｚ変換
が下記の数式５で表される聴感重み付けフィルタのイン
パルス応答ｈ_w（ｎ）をあらかじめ定められた点数Ｌだ
け計算し、適応コードブック回路３００、音源量子化回
路３５０へ出力する。The impulse response calculation circuit 310 calculates the impulse response h _w (n) of the perceptual weighting filter whose z-transform is expressed by the following equation 5 by a predetermined point L, and the adaptive codebook circuit 300 and the sound source Output to the quantization circuit 350.

【００４３】[0043]

【数５】適応コードブック回路３００は、Ｔ₁の近傍のサンプル
において、下記の数式６を最小化する遅延Ｔ_c1を計算す
る。なお、ここでは、適応コードブックの次数を１とす
る。(Equation 5) The adaptive codebook circuit 300 calculates a delay T _c1 that minimizes the following Expression 6 for samples near T ₁ . Here, the order of the adaptive codebook is 1.

【００４４】[0044]

【数６】ここで、ｙ_w（ｎ−Ｔ_c1）＝ｖ₁（ｎ−Ｔ_c1）＊ｈ_w（ｎ）（11）であり、記号＊は畳み込み演算を表す。(Equation 6) Here, y _w (n-T _c1 ) = v ₁ (n-T _c1 ) * h _w (n) (11), and the symbol * represents a convolution operation.

【００４５】ゲインβ_c1を下記の数式７に従い求める。The gain β _c1 is obtained according to the following equation (7).

【００４６】[0046]

【数７】ここで、女性音や子供の声に対して、遅延の抽出精度を
向上させるために、遅延を整数サンプルではなく、小数
サンプル値で求めてもよい。具体的な方法は、例えば、
Ｐ．Ｋｒｏｏｎらによる、“Ｐｉｔｃｈｐｒｅｄｉｃ
ｔｏｒｓｗｉｔｈｈｉｇｈｔｅｍｐｏｒａｌｒ
ｅｓｏｌｕｔｉｏｎ”と題した論文（Ｐｒｏｃ．ＩＣＡ
ＳＳＰ，ｐｐ．６６１−６６４，１９９０年）（文献１
１）等を参照することができる。(Equation 7) Here, in order to improve the extraction accuracy of the delay for the female sound and the voice of the child, the delay may be obtained by a decimal sample value instead of the integer sample. The specific method is, for example,
P. "Pitch predic," by Kron et al.
tors with high temporal r
esolution ”(Proc. ICA
SSP, pp. 661-664, 1990) (Reference 1).
1) etc. can be referred to.

【００４７】同様の手法により、Ｔ₂の近傍のサンプル
において、遅延Ｔ_c2とゲインβ_c2を探索する。In a similar manner, a delay T _c2 and a gain β _c2 are searched for in a sample near T ₂ .

【００４８】次に、ピッチ予測信号を計算し、音源量子
化回路３５０に出力する。Next, a pitch prediction signal is calculated and output to the excitation quantization circuit 350.

【００４９】ｑ_w1(n) ＝β_c1ｖ（ｎ−Ｔ_c1）＊ｈ_w(n) ＋β_c2ｖ（ｎ−Ｔ_c2）＊ｈ_w(n) （13）遅延Ｔ_c1，Ｔ_c2はマルチプレクサ４００に出力される。Q _w1 (n) = β _c1 v (n−T _c1 ) * h _w (n) + β _c2 v (n−T _c2 ) * h _w (n) (13) The delays T _c1 and T _c2 are multiplexers. Output to 400.

【００５０】減算器２３６は、次式の計算を行ない、減
算結果を音源量子化回路３５０へ出力する。The subtractor 236 calculates the following equation and outputs the subtraction result to the sound source quantization circuit 350.

【００５１】ｚ_w（ｎ）＝ｘ′_w（ｎ）−ｑ_w（ｎ）（14）音源量子化回路３５０では、音源信号を、音源コードブ
ック３５１を用いてベクトル量子化する。減算器２３６
の出力と、インパルス応答計算回路３１０の出力を用い
て、下記の数式８を最小化するように、音源コードブッ
ク３５１から音源コードベクトルｃ_j（ｎ）を探索す
る。Z _w (n) = x ′ _w (n) −q _w (n) (14) In the excitation quantization circuit 350, the excitation signal is vector-quantized using the excitation codebook 351. Subtractor 236
Using the output of the impulse response calculation circuit 310 and the output of the impulse response calculation circuit 310, a sound source code vector c _j (n) is searched from the sound source codebook 351 so as to minimize the following Expression 8.

【００５２】[0052]

【数８】ここで、記号＊は、畳み込み演算を示す。(Equation 8) Here, the symbol * indicates a convolution operation.

【００５３】選択された音源コードベクトルのインデク
スは、マルチプレクサ４００に出力される。The index of the selected sound source code vector is output to the multiplexer 400.

【００５４】ゲイン量子化回路３６５は、ゲインコード
ブック３５５からゲインコードベクトルを読みだし、選
択された音源コードベクトルに対して、下記の数式９を
最小化するゲインコードベクトルを選択する。The gain quantization circuit 365 reads a gain code vector from the gain code book 355, and selects a gain code vector that minimizes the following equation 9 for the selected excitation code vector.

【００５５】ここでは、音源コードベクトルのゲインを
ベクトル量子化する例について示す。Here, an example in which the gain of the excitation code vector is vector-quantized will be described.

【００５６】[0056]

【数９】ここで、Ｇ′_tは、ゲインコードブック３５５に格納さ
れたゲインコードブックにおけるｔ番目のコードベクト
ルである。(Equation 9) Here, G ′ _t is the t-th code vector in the gain codebook stored in the gain codebook 355.

【００５７】選択されたゲインコードベクトルを表すイ
ンデクスをマルチプレクサ４００に出力する。An index representing the selected gain code vector is output to the multiplexer 400.

【００５８】重み付け信号計算回路３６０は、それぞれ
のインデクスを入力し、インデクスからそれに対応する
コードベクトルを読みだし、まず下式にもとづき駆動音
源信号ｖ（ｎ）を求める。The weighting signal calculation circuit 360 receives the respective indexes, reads out the corresponding code vectors from the indexes, and obtains the driving sound source signal v (n) based on the following equation.

【００５９】ｖ（ｎ）＝ｇ（ｎ）＋Ｇ′_tｃ_j（ｎ）（19）ｖ（ｎ）を適応コードブック回路３００に出力する。V (n) = g (n) + G ′ _t c _j (n) (19) Outputs v (n) to the adaptive codebook circuit 300.

【００６０】次に、スペクトルパラメータ計算回路２０
０の出力パラメータ、スペクトルパラメータ量子化回路
２１０の出力パラメータを用いて下記の数式１０によ
り、応答信号ｓ_w（ｎ）をサブフレーム毎に計算し、応
答信号計算回路２４０へ出力する。Next, the spectrum parameter calculation circuit 20
Using the output parameter of 0 and the output parameter of the spectrum parameter quantization circuit 210, the response signal s _w (n) is calculated for each subframe by the following equation 10, and output to the response signal calculation circuit 240.

【００６１】[0061]

【数１０】以上により、第１の発明に対応する実施例の説明を終え
る。(Equation 10) This concludes the description of the embodiment corresponding to the first invention.

【００６２】図３は、第２の実施例の構成を示すブロッ
ク図である。図において、図１と異なるのは、音源量子
化回路５００、振幅コードブック５４０、ゲイン量子化
回路５５０、ゲインコードブック５６０である。FIG. 3 is a block diagram showing the configuration of the second embodiment. In the drawing, what is different from FIG. 1 is a sound source quantization circuit 500, an amplitude codebook 540, a gain quantization circuit 550, and a gain codebook 560.

【００６３】音源量子化回路５００は、Ｍ個の振幅が非
零のパルス列の位置と振幅を計算する。The sound source quantization circuit 500 calculates the positions and the amplitudes of the M non-zero amplitude pulse trains.

【００６４】音源量子化回路５００の構成を示すブロッ
ク図を図４に示す。図４において、相関係数計算回路５
１０は、端子５０１，５０２からそれぞれ、ｘ
_w（ｎ），ｈ_w（ｎ）を入力し、下記の数式１１及び数
式１２に従い、２種の相関係数ｄ（ｎ），φを計算し、
位置計算回路５２０、振幅量子化回路５３０に出力す
る。FIG. 4 is a block diagram showing the configuration of the sound source quantization circuit 500. In FIG. 4, a correlation coefficient calculation circuit 5
10 is x from the terminals 501 and 502, respectively.
_w (n) and h _w (n) are input, and two types of correlation coefficients d (n) and φ are calculated according to the following Expressions 11 and 12.
Output to the position calculation circuit 520 and the amplitude quantization circuit 530.

【００６５】[0065]

【数１１】 [Equation 11]

【００６６】[0066]

【数１２】位置計算回路５２０は、あらかじめ定められた個数Ｍの
非零の振幅のパルスの位置を計算する。これには、文献
３と同様に、各パルス毎に、あらかじめ定められた位置
の候補について、次式を最大化するパルスの位置を求め
る。(Equation 12) The position calculation circuit 520 calculates the positions of a predetermined number M of non-zero amplitude pulses. For this, as in Reference 3, for each pulse, the position of the pulse that maximizes the following equation is determined for a predetermined position candidate.

【００６７】例えば、位置の候補の例は、サブフレーム
長をＮ＝４０、パルスの個数をＭ＝５とすると、下記の
表１のように表せる。For example, as an example of a position candidate, if the subframe length is N = 40 and the number of pulses is M = 5, it can be expressed as shown in Table 1 below.

【００６８】[0068]

【表１】各パルスについて、位置の候補を調べ、次式を最大化す
る位置を選択する。[Table 1] For each pulse, a candidate position is examined and the position that maximizes the following equation is selected.

【００６９】Ｄ＝Ｃ_k ²／Ｅ_k （23）ここで、Ｃ_k及びＥ_kは下記の数式１３及び１４により
表わされる。D = C _k ² / E _k (23) Here, C _k and E _k are represented by the following equations (13) and (14).

【００７０】[0070]

【数１３】 (Equation 13)

【００７１】[0071]

【数１４】ここで、ｍ_kは、ｋ番目のパルスの位置を示すｓｇｎ
（ｋ）はｋ番目のパルスの極性である。[Equation 14] Here, m _k is, sgn indicating the position of the k-th pulse
(K) is the polarity of the k-th pulse.

【００７２】Ｍ個のパルスの位置は振幅量子化回路５３
０に出力される。The positions of the M pulses are determined by the amplitude quantization circuit 53.
Output to 0.

【００７３】振幅量子化回路５３０は、パルスの振幅を
振幅コードブック５４０を用いて量子化する。次式を最
大化する振幅コードベクトル選択する。The amplitude quantization circuit 530 quantizes the pulse amplitude using the amplitude codebook 540. The amplitude code vector that maximizes the following equation is selected.

【００７４】Ｃ_j ²／Ｅ_j （26）ここで、Ｃ_j及びＥ_jは下記の数式１５及び１６により
表わされる。C _j ² / E _j (26) Here, C _j and E _j are represented by the following Expressions 15 and 16.

【００７５】[0075]

【数１５】 (Equation 15)

【００７６】[0076]

【数１６】ここで、ｇ′_kjは、ｉ番目の振幅コードベクトルにおけ
るｋ番目のパルスの振幅を示す。(Equation 16) Here, g ′ _kj indicates the amplitude of the k-th pulse in the i-th amplitude code vector.

【００７７】なお、パルスの振幅を量子化するための振
幅コードブックを、音声信号を用いてあらかじめ学習し
て格納しておくこともできる。コードブックの学習法
は、例えば、Ｌｉｎｄｅ氏らによる“Ａｎａｌｇｏｒ
ｉｔｈｍｆｏｒｖｅｃｔｏｒｑｕａｎｔｉｚａｔ
ｉｏｎｄｅｓｉｇｎ，”と題した論文（ＩＥＥＥＴ
ｒａｎｓ．Ｃｏｍｍｕｎ．，ｐｐ．８４−９５，Ｊａｎ
ｕａｒｙ，１９８０）（文献１２）等を参照できる。Note that an amplitude codebook for quantizing the pulse amplitude can be learned and stored in advance using an audio signal. Codebook learning methods are described, for example, by Linde et al., “An algor.
ism for vector quantizat
ion design, "(IEEE T
rans. Commun. Pp. 84-95, Jan
uary, 1980) (Literature 12).

【００７８】振幅コードベクトルのインデクスと位置の
情報は、それぞれ、端子５０３，５０４から出力され
る。The index and position information of the amplitude code vector are output from terminals 503 and 504, respectively.

【００７９】ゲイン量子化回路５５０は、ゲインコード
ブック５６０を用いてパルスのゲインを量子化する。下
記の数式１７を最小化するようなゲインコードベクトル
を選択し、インデクスをマルチプレクサ４００へ出力す
る。The gain quantization circuit 550 quantizes the pulse gain using the gain codebook 560. A gain code vector that minimizes the following Expression 17 is selected, and the index is output to the multiplexer 400.

【００８０】[0080]

【数１７】重み付け信号計算回路５７０は、それぞれのインデクス
を入力し、インデクスからそれに対応するコードベクト
ルを読みだし、まず下式にもとづき駆動音源信号ｖ
（ｎ）を求める。[Equation 17] The weighting signal calculation circuit 570 inputs the respective indexes, reads out the corresponding code vectors from the indexes, and firstly obtains the driving sound source signal v based on the following equation.
Find (n).

【００８１】ｖ（ｎ）＝ｇ（ｎ）＋Ｇ′_tｇ′_kjｈ_w（ｎ−ｍ_k）（30）ｖ（ｎ）を適応コードブック回路３００に出力する。[0081] v (n) = g (n ) + G 't g' kj h w (n-m k) (30) outputs v (n) to the adaptive codebook circuit 300.

【００８２】次に、スペクトルパラメータ計算回路２０
０の出力パラメータ、スペクトルパラメータ量子化回路
２１０の出力パラメータを用いて下記の数式１８によ
り、応答信号ｓ_w（ｎ）をサブフレーム毎に計算し、応
答信号計算回路２４０へ出力する。Next, the spectrum parameter calculation circuit 20
Using the output parameter of 0 and the output parameter of the spectrum parameter quantization circuit 210, the response signal s _w (n) is calculated for each subframe by the following Expression 18, and is output to the response signal calculation circuit 240.

【００８３】[0083]

【数１８】図５は第３の実施例の構成を示すブロック図である。(Equation 18) FIG. 5 is a block diagram showing the configuration of the third embodiment.

【００８４】モード判別回路９００は、聴感重み付け回
路２３０からフレーム単位で聴感重み付け信号を受取
り、モード判別情報をピッチ抽出回路６００、マルチプ
レクサ４００に出力する。The mode discriminating circuit 900 receives the perceptual weighting signal from the perceptual weighting circuit 230 in frame units, and outputs the mode discriminating information to the pitch extracting circuit 600 and the multiplexer 400.

【００８５】ここでは、モード判別に、現在のフレーム
の特徴量を用いる。特徴量としては、例えば、フレーム
で平均したピッチ予測ゲインを用いる。ピッチ予測ゲイ
ンの計算は、例えば下記の数式１９を用いる。Here, the feature amount of the current frame is used for mode determination. As the characteristic amount, for example, a pitch prediction gain averaged in a frame is used. For example, the following equation 19 is used to calculate the pitch prediction gain.

【００８６】[0086]

【数１９】ここで、Ｌはフレームに含まれるサブフレームの個数で
ある。Ｐ_i，Ｅ_iはそれぞれ、ｉ番目のサブフレームで
の音声パワー、ピッチ予測誤差パワーを示し、下記の数
式２０及び２１により表わされる。[Equation 19] Here, L is the number of subframes included in the frame. P _i and E _i represent the speech power and the pitch prediction error power in the i-th subframe, respectively, and are represented by the following equations 20 and 21.

【００８７】[0087]

【数２０】 (Equation 20)

【００８８】[0088]

【数２１】ここで、Ｔは予測ゲインを最大化する最適遅延である。(Equation 21) Here, T is an optimal delay for maximizing the prediction gain.

【００８９】フレーム平均ピッチ予測ゲインＧをあらか
じめ定められた複数個のしきい値と比較して複数種類の
モードに分類する。モードの個数としては、例えば４を
用いることができる。The frame average pitch prediction gain G is compared with a plurality of predetermined thresholds, and classified into a plurality of types of modes. As the number of modes, for example, 4 can be used.

【００９０】ピッチ抽出回路６００は、モード判別情報
を入力し、あらかじめ定められたモードの場合に図２と
同一の処理を行ない、複数個の遅延を出力する。それ以
外のモードでは、遅延の出力は行なわない。Pitch extraction circuit 600 receives the mode discrimination information, performs the same processing as in FIG. 2 in the case of a predetermined mode, and outputs a plurality of delays. In other modes, no delay is output.

【００９１】図６は、第４の実施例の構成を示すブロッ
ク図である。図５におけるモード判別回路９００を図３
に付加し、ピッチ抽出回路６００を用いたものであるの
で、説明は省略する。FIG. 6 is a block diagram showing the configuration of the fourth embodiment. The mode discriminating circuit 900 in FIG.
, And the pitch extraction circuit 600 is used, so that the description is omitted.

【００９２】図７は、第５の実施例の構成を示すブロッ
ク図である。図において、図１の構成と異なるのは、ピ
ッチ抽出回路７００、音源量子化回路８５０、第１の音
源コードブック８５１、第２の音源コードブック８５２
であるので、これらを説明する。FIG. 7 is a block diagram showing the configuration of the fifth embodiment. In the figure, the configuration different from that of FIG.
Therefore, these will be described.

【００９３】図８はピッチ抽出回路７００の構成を示す
ブロック図である。FIG. 8 is a block diagram showing a configuration of the pitch extraction circuit 700.

【００９４】第１のピッチ予測利得計算回路７１０で
は、第１の遅延、ゲイン計算回路３９２で求めた遅延を
用いて第１のピッチ予測利得を下記の数式２２から求め
る。The first pitch prediction gain calculation circuit 710 obtains the first pitch prediction gain from the following equation 22 using the first delay and the delay obtained by the gain calculation circuit 392.

【００９５】[0095]

【数２２】判別回路７３０は、ピッチ予測ゲインＧ₁があらかじめ
定められたしきい値よりも大きい場合は第２の遅延、ゲ
イン計算回路に対して処理を継続させる。(Equation 22) Discriminating circuit 730, if the pitch prediction gain G ₁ is greater than the predetermined threshold is a second delay, to continue the process on the gain calculating circuit.

【００９６】Ｇ₁がしきい値よりも小さい場合は、第２
の遅延、ゲイン計算回路３９３の処理は行なわずに、第
１の遅延Ｔ₁を出力する。If G ₁ is smaller than the threshold, the second
, And outputs the first delay T ₁ without performing the processing of the gain calculation circuit 393.

【００９７】処理を継続する場合は、第２の遅延、ゲイ
ン計算回路３９３は、減算器３９４に出力に対して、式
（５）及び式（６）のｘ_w（ｎ）をｅ₁（ｎ）におきか
えて第２の遅延、ゲインを計算する。When the processing is continued, the second delay / gain calculation circuit 393 outputs x _w (n) of the equations (5) and (6) to e ₁ (n ) And calculate the second delay and gain.

【００９８】第２のピッチ予測利得計算回路７２０は、
式（３５）においてｘ_w（ｎ）ｅ₁（ｎ）におきかえて
第２のピッチ予測利得Ｇ₂を計算する。The second pitch prediction gain calculation circuit 720
In the equation (35), the second pitch prediction gain G ₂ is calculated in place of x _w (n) e ₁ (n).

【００９９】判別回路７３０は、Ｇ₂をしきい値と判別
し、あらかじめ定められたしきい値よりも大きいとき
は、第１の遅延と第２の遅延を出力する。あらかじめ定
められたしきい値よりもＧ₂が小さい時は、第１の遅延
のみを出力する。さらに、遅延の個数を端子３９９から
出力する。The discriminating circuit 730 discriminates G ₂ as a threshold value, and outputs a first delay and a second delay when it is larger than a predetermined threshold value. When advance G ₂ is smaller than the threshold value defined outputs only the first delay. Further, the number of delays is output from a terminal 399.

【０１００】図７にもどって、音源量子化回路８５０
は、まず、ピッチ抽出回路７００から出力される遅延の
個数を調べ、遅延の個数が２であれば、通常のビット数
（Ｂ₁ビット）である第１の音源コードブック８５１を
使用して音源信号を量子化するが、遅延の個数が１のと
きは、遅延を表すビット数と同一のビッチ数（Ｂ₂）の
第２の音源コードブック８５２を音源コードブック８５
１と併用する。Returning to FIG. 7, the sound source quantization circuit 850
First, the number of delays output from the pitch extraction circuit 700 is checked, and if the number of delays is 2, the sound source is generated using the first sound source codebook 851, which is a normal number of bits (B ₁ bit). The signal is quantized. If the number of delays is 1, the second excitation codebook 852 having the same number of bits (B ₂ ) as the number of bits representing the delay is transmitted to the excitation codebook 85.
Use together with 1.

【０１０１】図９は第６の実施例の構成を示すブロック
図である。図３において、図７のピッチ抽出回路７００
を用いたものである。FIG. 9 is a block diagram showing the configuration of the sixth embodiment. 3, the pitch extraction circuit 700 of FIG.
Is used.

【０１０２】音源量子化回路８６０は、ピッチ抽出回路
７００から遅延の個数を入力し、遅延の個数により、パ
ルスの個数をＭ₁とＭ₂（Ｍ₁＜Ｍ₂）に切替え、さら
に２種のビット数の異なる振幅コードブックを切替え
る。遅延の個数が２個の時はパルスの個数はＭ₁とし、
第１の振幅コードブック８６１を用いる。遅延の個数が
１のときはパルスの個数をＭ₂とし、第２の振幅コード
ブック８６２を用いる。The sound source quantization circuit 860 inputs the number of delays from the pitch extraction circuit 700, switches the number of pulses to M ₁ and M ₂ (M ₁ <M ₂ ) according to the number of delays, and Switching between amplitude codebooks having different numbers of bits. The number of pulses when the number is two of delay is set to M _1,
The first amplitude codebook 861 is used. When the number of delays is 1, the number of pulses is M ₂ and the second amplitude codebook 862 is used.

【０１０３】図１０は、第７の実施例の構成を示すブロ
ック図である。FIG. 10 is a block diagram showing the configuration of the seventh embodiment.

【０１０４】図７において、図５のモード判別回路９０
０を付加し、ピッチ抽出回路８００は、あらかじめ定め
られたモードのときに、図７のピッチ抽出回路７００と
同一の動作を行なう。In FIG. 7, the mode discriminating circuit 90 shown in FIG.
0 is added, and the pitch extraction circuit 800 performs the same operation as the pitch extraction circuit 700 in FIG. 7 in a predetermined mode.

【０１０５】図１１は、第８の実施例の構成を示すブロ
ック図である。図１１は、図９において、図１０に示し
たモード判別回路９００、ピッチ抽出回路８００を付加
したものであるので、説明を省略する。FIG. 11 is a block diagram showing the configuration of the eighth embodiment. FIG. 11 is obtained by adding the mode determining circuit 900 and the pitch extracting circuit 800 shown in FIG. 10 to FIG.

【０１０６】上述した実施例に限らず、種々の変形が可
能である。The present invention is not limited to the above-described embodiment, and various modifications are possible.

【０１０７】モード情報を用いて、音源量子化回路や、
ゲインコードブックを切替える構成とすることもでき
る。Using the mode information, a sound source quantization circuit,
A configuration in which the gain codebook is switched may be adopted.

【０１０８】音源コードブックを用いる場合、式（１
５）で示した歪みの小さい順に、複数個のコードベクト
ルを選択し、ゲイン量子化回路でゲインを量子化しなが
ら、式（１８）を最小化する音源コードベクトルとゲイ
ンコードベクトルの組合せを選択しても良い。When the sound source codebook is used, the expression (1)
A plurality of code vectors are selected in ascending order of the distortion shown in 5), and while the gain is quantized by the gain quantization circuit, a combination of the excitation code vector and the gain code vector that minimizes the expression (18) is selected. May be.

【０１０９】また、パルス列で音源を表す場合、パルス
の振幅を量子化するさいに、パルスの位置を複数セット
求め、これらの各々に対して振幅コードブックを探索
し、式（２６）を最大化する組合せを選択してもよい。
また、これらの組合せを複数種類ゲイン量子化回路に出
力し、ゲイン量子化しながら、式（２９）を最小化する
ような位置、振幅コードベクトル、ゲインコードベクト
ルの組合せを選択してもも良い。When the sound source is represented by a pulse train, when quantizing the pulse amplitude, a plurality of sets of pulse positions are obtained, an amplitude codebook is searched for each of these positions, and equation (26) is maximized. May be selected.
Alternatively, a combination of a position, an amplitude code vector, and a gain code vector that minimizes the expression (29) may be selected by outputting these combinations to a plurality of types of gain quantization circuits and performing gain quantization.

【０１１０】振幅コードブック５４０のかわりに、あら
かじめ定められたビット数の極性コードブックを用いて
も良い。Instead of the amplitude codebook 540, a polarity codebook having a predetermined number of bits may be used.

【０１１１】ピッチ抽出回路で求めた複数個の遅延は、
差分符号化することにより量子化ビット数を削減でき
る。A plurality of delays obtained by the pitch extraction circuit are as follows:
By performing differential coding, the number of quantization bits can be reduced.

【０１１２】[0112]

【発明の効果】以上説明したように、本発明によれば、
入力信号から複数個の遅延を求めた後に音源信号を計算
することで、複数話者からなる音声や、複数個の楽器か
ら構成される音楽信号に対して、従来よりも良好な音質
が得られるという効果がある。As described above, according to the present invention,
By calculating the sound source signal after obtaining a plurality of delays from the input signal, it is possible to obtain better sound quality than before for voices composed of multiple speakers or music signals composed of a plurality of musical instruments. This has the effect.

【０１１３】さらに、遅延を求めながらピッチ予測利得
を求め、ピッチ予測利得があらかじめ定められた条件を
満たすかどうかを判別することで遅延の個数を可変にし
ているので、入力信号の特徴に応じて遅延の個数を適切
に選ぶことができ、入力信号を良好に符号化することが
できる。Further, the pitch prediction gain is obtained while obtaining the delay, and it is determined whether or not the pitch prediction gain satisfies a predetermined condition, thereby making the number of delays variable. The number of delays can be appropriately selected, and the input signal can be satisfactorily encoded.

[Brief description of the drawings]

【図１】本発明の第１の実施例による信号符号化装置の
ブロック図である。FIG. 1 is a block diagram of a signal encoding device according to a first embodiment of the present invention.

【図２】図１の信号符号化装置のピッチ抽出回路３９０
のブロック図である。2 is a pitch extraction circuit 390 of the signal encoding device of FIG.
It is a block diagram of.

【図３】本発明の第２の実施例による信号符号化装置の
ブロック図である。FIG. 3 is a block diagram of a signal encoding device according to a second embodiment of the present invention.

【図４】図３の信号符号化装置の音源量子化回路５００
のブロック図である。FIG. 4 is an excitation quantization circuit 500 of the signal encoding apparatus of FIG. 3;
It is a block diagram of.

【図５】本発明の第３の実施例による信号符号化装置の
ブロック図である。FIG. 5 is a block diagram of a signal encoding device according to a third embodiment of the present invention.

【図６】本発明の第４の実施例による信号符号化装置の
ブロック図である。FIG. 6 is a block diagram of a signal encoding device according to a fourth embodiment of the present invention.

【図７】本発明の第５の実施例による信号符号化装置の
ブロック図である。FIG. 7 is a block diagram of a signal encoding device according to a fifth embodiment of the present invention.

【図８】図７の信号符号装置のピッチ抽出回路７００の
ブロック図である。8 is a block diagram of a pitch extracting circuit 700 of the signal encoding device of FIG.

【図９】本発明の第６の実施例による信号符号化装置の
ブロック図である。FIG. 9 is a block diagram of a signal encoding device according to a sixth embodiment of the present invention.

【図１０】本発明の第７の実施例による信号符号化装置
のブロック図である。FIG. 10 is a block diagram of a signal encoding device according to a seventh embodiment of the present invention.

【図１１】本発明の第８の実施例による信号符号化装置
のブロック図である。FIG. 11 is a block diagram of a signal encoding device according to an eighth embodiment of the present invention.

[Explanation of symbols]

１１０フレーム分割回路１２０サブフレーム分割回路２００スペクトルパラメータ計算回路２１０スペクトルパラメータ量子化回路２３０聴感重み付け回路２３５，２３６減算回路２４０応答信号計算回路３００適応コードブック回路３１０インパルス応答計算回路３５０，５００，８５０，８６０音源量子化回路３５１音源コードブック３５５，５６０ゲインコードブック３６５，５５０ゲイン量子化回路３９０，６００，７００，８００ピッチ抽出回路３９２第１の遅延計算回路３９３第２の遅延計算回路４００マルチプレクサ５１０相関係数計算回路５２０位置計算回路５３０振幅量子化回路５４０振幅コードブック７１０第１のピッチ予測利得計算回路７２０第２のピッチ予測利得計算回路７３０判別回路８５１第１の音源コードブック、８５２第２の音源コードブック、８６１第１の振幅コードブック８６２第２の振幅コードブック９００モード判別回路 Reference Signs List 110 frame division circuit 120 subframe division circuit 200 spectrum parameter calculation circuit 210 spectrum parameter quantization circuit 230 auditory weighting circuit 235,236 subtraction circuit 240 response signal calculation circuit 300 adaptive codebook circuit 310 impulse response calculation circuit 350, 500, 850, 860 Sound source quantization circuit 351 Sound source codebook 355, 560 Gain codebook 365, 550 Gain quantization circuit 390, 600, 700, 800 Pitch extraction circuit 392 First delay calculation circuit 393 Second delay calculation circuit 400 Multiplexer 510 phase Relation number calculation circuit 520 Position calculation circuit 530 Amplitude quantization circuit 540 Amplitude codebook 710 First pitch prediction gain calculation circuit 720 Second pitch prediction gain calculation circuit 730 Another circuit 851 first excitation codebook, 852 second excitation codebook 861 first amplitude codebook 862 second amplitude codebook 900 mode discriminating circuit

Claims

[Claims]

1. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input signal, obtaining a first delay corresponding to a first pitch period from the input signal, obtaining a pitch prediction residual signal, and obtaining the pitch prediction residual signal. A delay calculating unit that calculates a second delay corresponding to a second pitch period from the residual signal, and calculates and outputs a plurality of predetermined delays including at least the first and second delays;
A signal encoding apparatus comprising: an excitation quantization unit that obtains an excitation signal for a residual signal whose pitch has been predicted using the plurality of delays, quantizes and outputs the excitation signal.

2. The signal encoding apparatus according to claim 1, wherein the excitation signal of the signal is composed of a plurality of pulses having a non-zero amplitude.

3. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input signal, a mode discrimination unit for extracting a feature quantity from the input signal and discriminating a mode, and the input signal in a predetermined mode. , A first delay corresponding to a first pitch period is obtained, a pitch prediction residual signal is obtained, a second delay corresponding to a second pitch period is obtained from the pitch prediction residual signal, and the first and second delays are obtained. A delay calculation unit for obtaining and outputting a plurality of predetermined delays, and obtaining and quantizing an excitation signal for a pitch-predicted residual signal using the plurality of delays. A signal encoding device, comprising:

4. The signal encoding apparatus according to claim 3, wherein the excitation signal of the signal is composed of a plurality of non-zero amplitude pulses.

5. A spectrum parameter calculator for obtaining and quantizing a spectrum parameter from an input signal, a first delay corresponding to a first pitch period is obtained from the input signal, and a pitch prediction residual signal is obtained. A pitch prediction unit that calculates a pitch prediction gain, obtains a second delay corresponding to a second pitch period from the pitch prediction residual signal, and repeats these processes; and the pitch prediction gain satisfies a predetermined condition. A determining unit that determines whether the pitch prediction gain does not satisfy a predetermined condition, and obtains and quantizes a source signal with respect to a residual signal pitch-predicted using the delay when the pitch prediction gain does not satisfy a predetermined condition. A signal encoding device comprising an encoding unit.

6. The signal encoding apparatus according to claim 5, wherein the excitation signal of the signal is composed of a plurality of pulses having a non-zero amplitude.

7. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input signal, a mode discrimination unit for extracting a feature amount from the input signal and discriminating a mode, the input signal in a predetermined mode , A first delay corresponding to a first pitch period is obtained, a pitch prediction residual signal is obtained, a first pitch prediction gain is calculated, and a second pitch corresponding to a second pitch period is calculated from the pitch prediction residual signal. A pitch prediction unit that repeats these processes to determine the delay, a determination unit that determines whether the pitch prediction gain satisfies a predetermined condition, and a case where the pitch prediction gain does not satisfy a predetermined condition. A sound source quantizing unit that obtains a sound source signal for the residual signal whose pitch is predicted using the delay, and quantizes and outputs the sound source signal. Signal encoding device.

8. The signal encoding apparatus according to claim 7, wherein the excitation signal of the signal includes a plurality of non-zero amplitude pulses.