JP3471542B2

JP3471542B2 - Audio coding device

Info

Publication number: JP3471542B2
Application number: JP30714396A
Authority: JP
Inventors: 澤一範小
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-10-31
Filing date: 1996-10-31
Publication date: 2003-12-02
Anticipated expiration: 2016-10-31
Also published as: JPH10133696A

Abstract

PROBLEM TO BE SOLVED: To obtain excellent speech quality even when the bit rate is lowered by finding the position where predetermined conditions are met by a sound source quantization part, and searching for the best position in the search range of positions of pulses representing a sound source signal on the basis of the found position. SOLUTION: A spectrum parameter calculating circuit 200 finds spectrum parameters from an input speech signal and quantizes them. An adaptive code book circuit 300 finds delay corresponding to a pitch cycle from the speech signal and calculates a pitch prediction signal to predict a pitch. Then a sound source quantizing circuit 350 constitutes a sound source signal of the speech signal with M pulses whose amplitudes are not zero, finds the sample position corresponding to a pulse position where the predetermined conditions are met for the pitch prediction signal, and sets a range of a search for the position of a pulse on the basis of a position which is shifted from the found sample position by a predetermined number of samples, thereby searching for and outputting the best position for the set range.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、低ビットレート且
つ高品質で音声信号を符号化する音声符号化装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus for coding a speech signal with a low bit rate and a high quality.

[Prior art]

【０００２】音声信号を高能率に符号化する方式として
は、例えば、M.Schroeder and B.Atal氏による”Code-e
xcited linear prediction:High quality speech at ve
rylow bit rates"(Proc.ICASSP,pp. 937-940,1985年)と
題した論文（文献１）や、Klejin氏らによる”Improved
speech quality and efficeint vector quantization
in SELP”（Proc.ICASSP,pp,155-158,1988年）と題した
論文（文献２）等に記載されているCELP（Code Excited
Linear Predictive Coding)が知られている。この従来
例では、送信側では、フレーム毎（例えば２０ｍｓ）に
音声信号から線形予測（ＬＰＣ）分析を用いて、音声信
号のスペクトル特性を表わすスペクトルパラメータを抽
出する。フレームを更にサブフレーム（例えば５ｍｓ）
に分割し、サブフレーム毎に過去の音源信号を基に適応
コードブックにおけるパラメータ（ピッチ周期に対応す
る遅延パラメータとゲインパラメータ）を抽出し、適応
コードブックにより前記サブフレームの音声信号をピッ
チ予測する。ピッチ予測して求めた音源信号に対して、
予め定められた種類の雑音信号からなる音源コードブッ
ク（ベクトル量子化コードブック）から最適な音源コー
ドベクトルを選択し、最適なゲインを計算することによ
り、音源信号を量子化する。音源コードベクトルの選択
の仕方は、選択した雑音信号により合成した信号と、前
記残差信号との誤差電力を最小化するように行う。そし
て、選択されたコードベクトルの種類を表わすインデク
スとゲインならびに、前記スペクトルパラメータと適応
コードブックのパラメータをマルチプレクサ部により組
み合わせて伝送する。受信側の動作、構成は周知である
ので説明は省略する。As a method for efficiently encoding a voice signal, for example, "Code-e" by M. Schroeder and B. Atal.
xcited linear prediction: High quality speech at ve
rylow bit rates "(Proc.ICASSP, pp. 937-940, 1985) (Reference 1) and Klejin et al.," Improved
speech quality and efficeint vector quantization
CELP (Code Excited) described in a paper (reference 2) entitled "in SELP" (Proc.ICASSP, pp, 155-158,1988)
Linear Predictive Coding) is known. In this conventional example, the transmission side extracts a spectrum parameter representing a spectrum characteristic of the voice signal by using linear prediction (LPC) analysis from the voice signal for each frame (for example, 20 ms). Subframe (for example, 5ms)
And the parameters in the adaptive codebook (delay parameters and gain parameters corresponding to the pitch period) are extracted based on the past excitation signal for each subframe, and the pitch prediction of the speech signal of the subframe is performed by the adaptive codebook. . For the sound source signal obtained by pitch prediction,
The excitation signal is quantized by selecting an optimal excitation code vector from an excitation codebook (vector quantization codebook) consisting of noise signals of a predetermined type and calculating an optimal gain. The method of selecting the sound source code vector is to minimize the error power between the residual signal and the signal synthesized by the selected noise signal. Then, the index and the gain indicating the type of the selected code vector, the spectrum parameter and the parameter of the adaptive codebook are combined and transmitted by the multiplexer unit. Since the operation and configuration on the receiving side are well known, description thereof will be omitted.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、前記の
音声符号化装置では、音源コードブックから最適な音源
コードベクトルを選択するのに多大な演算量を要すると
いう問題があった。これは、文献１や２の方法では、音
源コードベクトルを選択するのに、各コードベクトルに
対してフィルタリングもしくは畳み込み演算を、コード
ブックに格納されているコードベクトルの個数だけ繰り
返さなければならないことに起因する。例えば、コード
ブックのビット数がＢビットで次元数がＮのときは、フ
ィルタリングあるいは畳み込み演算の時のフィルタある
いはインパルス応答長をＫとすると、演算量は１秒当た
り、Ｎ×Ｋ×２^B×８０００／Ｎだけ必要となる。一例
として、Ｂ＝１０、Ｎ＝４０、Ｋ＝１０とすると、１秒
当たり８１，９２０，０００回の演算が必要となり、極
めて膨大な演算量になってしまうという問題点があっ
た。However, the speech coding apparatus described above has a problem that a large amount of calculation is required to select the optimum excitation code vector from the excitation codebook. This is because, in the methods of Documents 1 and 2, in order to select a sound source code vector, filtering or convolution operation must be repeated for each code vector by the number of code vectors stored in the codebook. to cause. For example, when the number of bits of the codebook is B bits and the number of dimensions is N, if the filter or impulse response length in the filtering or convolution calculation is K, the calculation amount is N × K × 2 ^B × Only 8000 / N is required. As an example, if B = 10, N = 40, and K = 10, there is a problem that 819,20,000 times of calculations are required per second, resulting in an extremely large amount of calculations.

【０００４】そこで、音源コードブック探索に必要な演
算量を低減する方法として、種々のものが提案されてい
る。例えば、ACELP（Argebraic Code Excited Linear P
rediction）方式が、例えば、C.Laflammeらによる“16
kbps wideband speech coding technique based on al
gebraic CELP"と題した論文（Proc.ICASSP,pp.13-16,19
91)（文献３）等に開示されている。ACELP方式によれ
ば、音源信号を複数個のパルスで表わし、各パルスのた
つ位置は、各パルス毎に予め定められた位置の候補から
選択し、これを予め定められたビット数で表わして伝送
する。ここで、各パルスの振幅は＋１．０もしくは−
１．０に限定されているため、パルス探索の演算量を大
幅に低減化できる。Therefore, various methods have been proposed as methods for reducing the amount of calculation required for the search of the sound source codebook. For example, ACELP (Argebraic Code Excited Linear P
rediction) method is, for example, “16 by C. Laflamme et al.
kbps wideband speech coding technique based on al
gebraic CELP "(Proc.ICASSP, pp.13-16,19
91) (Reference 3) and the like. According to the ACELP method, a sound source signal is represented by a plurality of pulses, and the pulse position of each pulse is selected from a candidate of a position predetermined for each pulse, and this is expressed by a predetermined number of bits for transmission. To do. Here, the amplitude of each pulse is +1.0 or −
Since it is limited to 1.0, the calculation amount of pulse search can be significantly reduced.

【０００５】文献３の従来方式では、演算量を大幅に低
減化することが可能となるが、ビットレートを低減化す
ると、サブフレーム当たりのパルスの個数が急速に減少
し、音質が大幅に劣化するという問題がある。In the conventional method of Document 3, it is possible to significantly reduce the amount of calculation. However, when the bit rate is reduced, the number of pulses per subframe is rapidly reduced, and the sound quality is significantly deteriorated. There is a problem of doing.

【０００６】そこで、本発明の目的は、上述の問題点を
解決し、ビットレートが低い場合にも比較的少ない演算
量で音質の劣化の少ない音声符号化方式を提供すること
にある。SUMMARY OF THE INVENTION It is therefore an object of the present invention to solve the above problems and provide a voice encoding system with a relatively small amount of calculation even when the bit rate is low and with little deterioration in sound quality.

【０００７】[0007]

【課題を解決するための手段】前述の課題を解決するた
め、本発明の第１の態様による音声符号化方式は、入力
音声信号からスペクトルパラメータを求めて量子化する
スペクトルパラメータ計算部と、前記音声信号からピッ
チ周期に対応する遅延を求めてピッチ予測信号を計算し
ピッチ予測を行なう適応コードブック部と、前記音声信
号の音源信号を個数Ｍの振幅が非零のパルスから構成
し、前記ピッチ予測信号に対して予め定められた条件を
満たす前記パルス位置対応のサンプル位置を求め、求め
られたサンプル位置から予め定められたサンプル数だけ
ずらせた位置をもとに前記パルスの位置を探索する範囲
を設定し、設定された範囲に対して最良の位置を探索し
出力する音源量子化部とを有する。In order to solve the above-mentioned problems, the speech coding method according to the first aspect of the present invention comprises a spectrum parameter calculating unit for obtaining and quantizing a spectrum parameter from an input speech signal, and An adaptive codebook unit for calculating a pitch prediction signal by calculating a delay corresponding to a pitch period from a speech signal and performing pitch prediction; and a sound source signal of the speech signal is composed of a number M of non-zero-amplitude pulses. A range in which the sample position corresponding to the pulse position satisfying a predetermined condition with respect to the prediction signal is obtained, and the position of the pulse is searched based on the position shifted from the obtained sample position by a predetermined number of samples. And a sound source quantizer that searches for and outputs the best position for the set range.

【０００８】また、本発明の第２の態様による音声符号
化装置は、入力音声信号からスペクトルパラメータを求
めて量子化するスペクトルパラメータ計算部と、前記音
声信号からピッチ周期に当たる遅延を求めピッチ予測信
号を計算しピッチ予測を行なう適応コードブック部と、
前記音声信号の音源信号を個数Ｍの振幅が非零のパルス
で構成し、先頭からピッチ周期に等しい長さの区間にお
いて前記ピッチ予測信号に対して予め定められた条件を
満たすサンプル位置を求め前記位置から予め定められた
サンプル数だけずらせた位置をもとにパルスの位置を探
索する範囲を設定し、前記範囲に対して最良の位置を探
索し出力する音源量子化部とを有する。Further, the speech coding apparatus according to the second aspect of the present invention comprises a spectrum parameter calculation section for obtaining and quantizing a spectrum parameter from an input speech signal, and a pitch prediction signal for obtaining a delay corresponding to a pitch period from the speech signal. An adaptive codebook section that calculates
The sound source signal of the audio signal is composed of a number M of pulses having non-zero amplitude, and a sample position satisfying a predetermined condition for the pitch prediction signal is obtained in a section having a length equal to the pitch cycle from the beginning. A sound source quantizing unit that sets a range for searching the position of the pulse based on a position shifted from the position by a predetermined number of samples, and searches for and outputs the best position with respect to the range.

【０００９】本発明の第３の態様による音声符号化装置
は、入力音声信号からスペクトルパラメータを求めて量
子化するスペクトルパラメータ計算部と、前記音声信号
からピッチ周期に当たる遅延を求めピッチ予測信号を計
算しピッチ予測を行なう適応コードブック部と、前記音
声信号の音源信号を個数Ｍの振幅が非零のパルスで構成
し、先頭からピッチ周期に等しい長さの区間において前
記ピッチ予測信号に対して予め定められた条件を満たす
サンプル位置を求め、前記位置から予め定められたサン
プル数だけずらぜた位置をもとにパルスの位置の候補を
前記ピッチ周期だけずらせながら設定し、前記候補位置
を探索し最良の位置を出力する音源量子化部とを有す
る。A speech coder according to a third aspect of the present invention calculates a pitch prediction signal by obtaining a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal and a delay corresponding to a pitch period from the speech signal. Then, an adaptive codebook unit for performing pitch prediction and a sound source signal of the speech signal are composed of a number M of pulses of which amplitude is non-zero, and the pitch prediction signal is preliminarily compared with the pitch prediction signal in a section having a length equal to the pitch period from the beginning. A sample position satisfying a predetermined condition is obtained, a pulse position candidate is set while being shifted by the pitch period based on a position shifted by a predetermined number of samples from the position, and the candidate position is searched. And a sound source quantizer that outputs the best position.

【００１０】ここで、音源量子化部において、複数個の
パルスの振幅もしくは極性をまとめて量子化するための
コードブックを有する。Here, the excitation quantizer has a codebook for collectively quantizing the amplitudes or polarities of a plurality of pulses.

【００１１】本発明の第４の態様による音声符号化装置
は、入力音声信号からスペクトルパラメータを求めて量
子化するスペクトルパラメータ計算部と、前記音声信号
からピッチ周期に当たる遅延を求めピッチ予測信号を計
算しピッチ予測を行なう適応コードブック部と、前記音
声信号の音源信号を個数Ｍの振幅が非零のパルスで構成
し、前記ピッチ予測信号に対して予め定められた条件を
満たすサンプル位置を求め、複数種のずらし量の各々を
用いて前記位置からずらした後の位置をもとに前記パル
スの位置を探索する範囲を設定し前記範囲に対して位置
を探索し、最良となるずらし量とパルスの位置の組合せ
を出力する音源量子化部とを有する。A speech coder according to a fourth aspect of the present invention calculates a pitch prediction signal by obtaining a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal and a delay corresponding to a pitch period from the speech signal. Then, an adaptive codebook unit for performing pitch prediction and a sound source signal of the speech signal are composed of a number M of pulses having non-zero amplitude, and a sample position satisfying a predetermined condition for the pitch prediction signal is obtained. A range for searching the position of the pulse is set based on the position after shifting from the position by using each of a plurality of types of shift amounts, and a position is searched for the range, and the best shift amount and pulse And a sound source quantization unit that outputs a combination of positions.

【００１２】本発明の第５の態様による音声符号化装置
は、入力音声信号からスペクトルパラメータを求めて量
子化するスペクトルパラメータ計算部と、前記音声信号
からピッチ周期に当たる遅延を求めピッチ予測信号を計
算しピッチ予測を行なう適応コードブック部と、前記音
声信号の音源信号を個数Ｍの振幅が非零のパルスで構成
し、先頭からピッチ周期に等しい長さの区間において前
記ピッチ予測信号に対して予め定められた条件を満たす
サンプル位置を求め、複数種のずらし量の各々を用いて
前記位置からずらせた後の位置をもとに前記パルスの位
置を探索する範囲を設定し前記範囲に対して位置を探索
し、最良となるずらし量とパルスの位置の組合せを出力
する音源量子化部とを有する。A speech coder according to a fifth aspect of the present invention calculates a pitch prediction signal by obtaining a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal and a delay corresponding to a pitch period from the speech signal. Then, an adaptive codebook unit for performing pitch prediction and a sound source signal of the speech signal are composed of a number M of pulses of which amplitude is non-zero, and the pitch prediction signal is preliminarily compared with the pitch prediction signal in a section having a length equal to the pitch period from the beginning. Obtain a sample position that satisfies the specified conditions, set a range to search the position of the pulse based on the position after shifting from the position using each of a plurality of types of shift amounts, and set the position with respect to the range. And a sound source quantization unit that outputs the best combination of the shift amount and the pulse position.

【００１３】本発明の第６の態様による音声符号化装置
は、入力音声信号からスペクトルパラメータを求めて量
子化するスペクトルパラメータ計算部と、前記音声信号
からピッチ周期に当たる遅延を求めピッチ予測信号を計
算しピッチ予測を行なう適応コードブック部と、前記音
声信号の音源信号を個数Ｍの振幅が非零のパルスで構成
し、先頭からピッチ周期に等しい長さの区間において前
記ピッチ予測信号に対して予め定められた条件を満たす
サンプル位置を求め、複数種のずらし量の各々を用いて
前記位置からずらせた後の位置をもとに、更に前記ピッ
チ周期だけずらせながら前記パルスをたてる位置の候補
を設定し、前記位置を探索し、最良となるずらし量とパ
ルスの位置の組合せを出力する音源量子化部とを有す
る。A speech coding apparatus according to a sixth aspect of the present invention calculates a pitch prediction signal by obtaining a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal and a delay corresponding to a pitch period from the speech signal. Then, an adaptive codebook unit for performing pitch prediction and a sound source signal of the speech signal are composed of a number M of pulses of which amplitude is non-zero, and the pitch prediction signal is preliminarily compared with the pitch prediction signal in a section having a length equal to the pitch period from the beginning. Obtaining a sample position that satisfies the specified condition, based on the position after being shifted from the position using each of a plurality of types of shift amount, further candidates for the position to which the pulse is to be shifted while being shifted by the pitch period. A sound source quantization unit that sets and searches for the position and outputs the best combination of the shift amount and the pulse position.

【００１４】ここで、音源量子化部において、複数個の
パルスの振幅もしくは極性をまとめて量子化するための
コードブックを有する。Here, the excitation quantizer has a codebook for collectively quantizing the amplitudes or polarities of a plurality of pulses.

【００１５】本発明の第７の態様による音声符号化装置
は、入力音声信号からスペクトルパラメータを求めて量
子化するスペクトルパラメータ計算部と、入力音声信号
から特徴量を抽出して複数のモードを判別し出力するモ
ード判別部と、前記音声信号からピッチ周期に当たる遅
延を求めピッチ予測信号を計算しピッチ予測を行なう適
応コードブック部と、前記音声信号の音源信号を個数Ｍ
の振幅が非零のパルスで構成し、予め定められたモード
の場合に、前記ピッチ予測信号に対して予め定められた
条件を満たすサンプル位置を求め、前記位置をもとに、
前記パルスの位置を探索する範囲を設定し、前記範囲に
対して最良を探索し出力する音源量子化部とを有する。A speech coding apparatus according to a seventh aspect of the present invention comprises a spectrum parameter calculation section for obtaining and quantizing a spectrum parameter from an input speech signal, and a feature quantity extracted from the input speech signal to discriminate a plurality of modes. And a mode discriminator that outputs the speech signal, an adaptive codebook unit that calculates a pitch prediction signal by obtaining a delay corresponding to a pitch period from the speech signal, and performs pitch prediction,
The amplitude of a non-zero pulse, in the case of a predetermined mode, to obtain a sample position that satisfies a predetermined condition for the pitch prediction signal, based on the position,
A sound source quantizer that sets a range for searching the position of the pulse and searches for and outputs the best for the range.

【００１６】ここで、前記特徴量は平均ピッチ予測ゲイ
ンであり、また前記モード判別部は前記平均ピッチ予測
ゲインと予め定められた複数個のしきい値との比較結果
に基づいてモードを判別する。Here, the feature quantity is an average pitch prediction gain, and the mode discriminating section discriminates a mode based on a result of comparison between the average pitch prediction gain and a plurality of predetermined threshold values. .

【００１７】本発明の第８の態様による音声符号化装置
は、入力音声信号からスペクトルパラメータを求めて量
子化するスペクトルパラメータ計算部と、前記音声信号
からピッチ周期に対応する遅延を求めてピッチ予測信号
を計算し、ピッチ予測を行なう適応コードブック部と、
前記適応コードブックで求めたピッチ予測信号に対して
予め定められた条件を満たす位置を求め、求められた位
置に基づいて音源信号を表わす複数個のパルスの位置の
探索範囲を設定し、この探索範囲の中で前記複数個のパ
ルスの最良の位置を探索する音源量子化部とを備えて成
る。A speech coding apparatus according to an eighth aspect of the present invention comprises a spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal, and a pitch prediction by obtaining a delay corresponding to a pitch period from the speech signal. An adaptive codebook section that calculates a signal and performs pitch prediction,
A position satisfying a predetermined condition is found with respect to the pitch prediction signal obtained by the adaptive codebook, a search range of positions of a plurality of pulses representing a sound source signal is set based on the obtained position, and this search is performed. And a sound source quantizer for searching for the best position of the plurality of pulses in the range.

【００１８】[0018]

【実施態様】図１は本発明による音声符号化装置の第１
の実施の形態を示すブロック図である。図１において、
入力端子１００から音声信号が入カされ、フレーム分割
回路１１０では上記音声信号がフレーム（例えば１０ｍ
ｓ）毎に分割され、サブフレーム分割回路１２０では、
上記フレーム音声信号をフレームよりも短いサブフレー
ム（例えば、５ｍｓ）に分割される。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a first block diagram of a speech coder according to the present invention.
It is a block diagram showing an embodiment of. In FIG.
An audio signal is input from the input terminal 100, and the frame division circuit 110 outputs the audio signal into a frame (for example, 10 m).
s), and the sub-frame division circuit 120
The frame audio signal is divided into subframes (for example, 5 ms) shorter than the frame.

【００１９】スペクトルパラメータ計算回路２００は、
少なくとも一つのサブフレームの音声信号に対して、サ
ブフレーム長よりも長い窓（例えば、２４ｍｓ）をかけ
て音声を切り出してスペクトルパラメータを予め定めら
れた次数（例えばＰ＝１０次）計算する。ここで、スペ
クトルパラメータの計算には、周知のＬＰＣ分析や、Ｂ
ｕｒｇ分析等を用いることができる。ここでは、Ｂｕｒ
ｇ分析を用いることとする。Ｂｕｒｇ分析の詳細につい
ては、中溝著による”信号解析とシステム同定”と題し
た単行本（コロナ社１９８８年刊）の８２〜８７頁（文
献４）等に記載されているので説明は省略する。更に、
スペクトルパラメータ計算部は、Ｂｕｒｇ法により計算
された線形予測係数α_i（ｉ＝１，・・・，１０）を量
子化や補間に適したＬＳＰパラメータに変換する。ここ
で、線形予測係数からＬＳＰへの変換は、菅村他によ
る”線スペクトル対（ＬＳＰ）音声分析合成方式による
音声情報圧縮”と題した論文（電子通信学会論文誌、Ｊ
６４−Ａ、ｐｐ．５９９−６０６、１９８１年）（文献
５）を参照することができる。例えば、第２サブフレー
ムでＢｕｒｇ法により求めた線形予測係数を、ＬＳＰパ
ラメータに変換し、第１サブフレームのＬＳＰを直線補
間により求めて、第１サブフレームのＬＳＰを逆変換し
て線形予測係数に戻し、第１，２サブフレームの線形予
測係数α_il、ｉ＝１，・・・，１０、ｌ＝１，・・・，
２）を聴感重み付け回路２３０に出力する。また、第２
サブフレームのＬＳＰをスペクトルパラメータ量子化回
路２１０へ出力する。The spectrum parameter calculation circuit 200 is
For a voice signal of at least one subframe, a voice is cut out by applying a window longer than the subframe length (for example, 24 ms), and a spectrum parameter is calculated in a predetermined order (for example, P = 10th order). Here, the calculation of the spectral parameters includes the well-known LPC analysis and B
Urg analysis or the like can be used. Here, Bur
g analysis will be used. The details of the Burg analysis are described in the book “Signal Analysis and System Identification” by Nakamizo (page 1988), pages 82 to 87 (Corona Publishing Co., Ltd.), and therefore the description thereof is omitted. Furthermore,
The spectrum parameter calculation unit converts the linear prediction coefficient α _i (i = 1, ..., 10) calculated by the Burg method into an LSP parameter suitable for quantization and interpolation. Here, the conversion from linear prediction coefficient to LSP is performed by Sugamura et al., "Speech information compression by line spectrum pair (LSP) speech analysis and synthesis method" (Journal of the Institute of Electronics and Communication Engineers, J.
64-A, pp. 599-606, 1981) (reference 5). For example, the linear prediction coefficient obtained by the Burg method in the second subframe is converted into an LSP parameter, the LSP of the first subframe is obtained by linear interpolation, and the LSP of the first subframe is inversely transformed to obtain the linear prediction coefficient. , The linear prediction coefficient α _il of the first and second subframes, i = 1, ..., 10, l = 1 ,.
2) is output to the perceptual weighting circuit 230. Also, the second
The LSP of the subframe is output to the spectrum parameter quantization circuit 210.

【００２０】スペクトルパラメータ量子化回路２１０
は、予め定められたサブフレームのＬＳＰパラメータを
コードブック２２０を用いて効率的に量子化し、下式の
歪みを景小化する量子化値を出力する。Spectral parameter quantization circuit 210
Efficiently quantizes the LSP parameter of a predetermined subframe using the codebook 220, and outputs a quantized value that reduces the distortion of the following equation.

【数１】ここで、ＬＳＰ（ｉ），ＱＬＳＰ（ｉ）、Ｗ（ｉ）はそ
れぞれ、量子化前のｉ次目のＬＳＰ、コードブック２２
０に格納されたｊ番目のコードベクトル、重み係数であ
る。[Equation 1] Here, LSP (i), QLSP (i), and W (i) are the i-th order LSP and the codebook 22 before quantization, respectively.
It is the j-th code vector and weighting coefficient stored in 0.

【００２１】以下では、量子化法として、ベクトル量子
化を用いるものとし、第２サブフレームのＬＳＰパラメ
ータを量子化するものとする。ＬＳＰパラメータのベク
トル量子化の手法としては周知の手法を用いることがで
きる。具体的な手法は、例えば、特開平４−１７１５０
０号公報（特願平５−２９７６００号）（文献６）、特
開平４−３６３０００号公報（特願平３−２６１９２５
号）（文献７）、特開平５−６１９９号公報（特願平３
−１５５０４９号）（文献８）、T.Nomura etal.,によ
る“LSP Coding Using VQSVQ with Interpolation in
4.075kbps M-LCELP Speech Coder”と題した論文（Pro
c.Mobile Multimedia Communications,pp.B.2.5,1993）
（文献９）等を参照できるのでここでは説明は略する。In the following, it is assumed that vector quantization is used as the quantization method and the LSP parameter of the second subframe is quantized. A well-known method can be used as a method for vector quantization of the LSP parameter. A specific method is, for example, Japanese Patent Laid-Open No. 4-17150.
No. 0 (Japanese Patent Application No. 5-297600) (Reference 6) and Japanese Patent Laid-Open No. 4-363000 (Japanese Patent Application No. 3-261925).
(Japanese Patent Application Laid-Open No. Hei 3-6199)
−155049) (reference 8), T. Nomura et al., “LSP Coding Using VQSVQ with Interpolation in
4.075kbps M-LCELP Speech Coder ”(Pro
c.Mobile Multimedia Communications, pp.B.2.5, 1993)
(Reference 9) and the like can be referred to, so the description is omitted here.

【００２２】また、スペクトルパラメータ量子化回路２
１０は、第２サブフレームで量子化したＬＳＰパラメー
タをもとに、第１サブフレームのＬＳＰパラメータを復
元する。ここでは、現フレームの第２サブフレームの量
子化ＬＳＰパラメータと１つ過去のフレームの第２サブ
フレームの量子化ＬＳＰを直線補間して、第１サブフレ
ームのＬＳＰを復元する。ここで、量子化前のＬＳＰと
量子化後のＬＳＰとの誤差電力を量子化するコードベク
トルを１種類選択した後に、直線補間により第１サブフ
レームのＬＳＰを復元できる。Further, the spectrum parameter quantization circuit 2
10 restores the LSP parameter of the first subframe based on the LSP parameter quantized in the second subframe. Here, the quantized LSP parameter of the second subframe of the current frame and the quantized LSP of the second subframe of the frame one previous frame are linearly interpolated to restore the LSP of the first subframe. Here, after selecting one type of code vector for quantizing the error power of the LSP before quantization and the LSP after quantization, the LSP of the first subframe can be restored by linear interpolation.

【００２３】以上により復元した第１サブフレームのＬ
ＳＰと第２サブフレームの量子化ＬＳＰをサブフレーム
毎に線形予測係数α_il'（ｉ＝１，・・・，１０，ｌ＝
１，…，２）に変換し、インパルス応答計算回路３１０
へ出力する。また、第２サブフレームの量子化ＬＳＰの
コードベクトルを表わすインデクスをマルチプレクサ４
００に出力する。The L of the first subframe restored as described above
A linear prediction coefficient α _il '(i = 1, ..., 10, l =
1, ..., 2), and the impulse response calculation circuit 310
Output to. In addition, the multiplexer 4 calculates the index representing the code vector of the quantized LSP of the second subframe.
Output to 00.

【００２４】聴感重み付け回路２３０は、スペクトルパ
ラメータ計算回路２００から、各サブフレーム毎に量子
化前の線形予測係数α_ij'（ｉ＝１，・・・，Ｐ）を入
力し、前記文献１に基づき、サブフレームの音声信号に
対して聴感重み付けを行い、聴感重み付け信号を出力す
る。The perceptual weighting circuit 230 inputs the linear prediction coefficient α _ij ′ (i = 1, ..., P) before quantization for each subframe from the spectrum parameter calculation circuit 200, and the above-mentioned reference 1 is used. Based on this, the perceptual weighting is performed on the audio signal of the sub-frame, and the perceptual weighting signal is output.

【００２５】応答信号計算回路２４０は、スペクトルパ
ラメータ計算回路２００から、各サブフレーム毎に線形
予測係数α_iを入力し、スペクトルパラメータ量子化回
路２１０から、量子化、補間して復元した線形予測係数
α_i'をサブフレーム毎に入力し、保存されているフィル
タメモリの値を用いて、入力信号を零ｄ（ｎ）＝０とし
た応答信号を１サブフレーム分計算し、減算器２３５へ
出力する。ここで、応答信号ｘ_z（ｎ）は下式で表され
る。The response signal calculation circuit 240 receives the linear prediction coefficient α _i for each sub-frame from the spectrum parameter calculation circuit 200, and the spectrum parameter quantization circuit 210 quantizes and interpolates and restores the linear prediction coefficient α _i. Input α _i 'for each subframe, calculate the response signal for one subframe with the input signal being zero d (n) = 0 using the value of the stored filter memory, and output it to the subtractor 235. To do. Here, the response signal x _z (n) is expressed by the following equation.

【数２】但し、ｎ−ｉ≦０のときは[Equation 2] However, when n−i ≦ 0

【数３】 [Equation 3]

【数４】ここで、Ｎはサブフレーム長を示す。γは、聴感重み付
け量を制御する重み係数であり、下記の式（６）と同一
の値である。ｓ_w（ｎ）、ｐ（ｎ）は、それぞれ、重み
付け信号計算回路の出力信号、後述の式（６）における
右辺第１項のフィルタの分母の項の出力信号をそれぞれ
示す。[Equation 4] Here, N indicates the subframe length. γ is a weighting coefficient that controls the perceptual weighting amount, and has the same value as the following equation (6). s _w (n) and p (n) respectively represent the output signal of the weighting signal calculation circuit, and the output signal of the denominator term of the filter of the first term on the right side in Expression (6) described later.

【００２６】滅算器２３５は、下式により、聴感重み付
け信号から応答信号を１サブフレーム分減算し、ｘ_w'
（ｎ）を適応コードブック回路３００へ出力する。The subtractor 235 subtracts the response signal for one subframe from the perceptual weighting signal according to the following equation, and x _w '
(N) is output to the adaptive codebook circuit 300.

【数５】 [Equation 5]

【００２７】インパルス応答計算回路３１０は、ｚ変換
が下式で表される聴感重み付けフィルタのインパルス応
答ｈ_w（ｎ）を予め定められた点数Ｌだけ計算し、適応
コードブック回路３００、音源量子化回路３５０へ出力
する。The impulse response calculation circuit 310 calculates the impulse response h _w (n) of the perceptual weighting filter whose z-transform is expressed by the following equation, by a predetermined score L, and the adaptive codebook circuit 300 and the excitation quantization. Output to the circuit 350.

【数６】 [Equation 6]

【００２８】適応コードブック回路３００は、重み付け
信号計算回路３６０から遇去の音源信号ｖ（ｎ）を、減
算器２３５から出力信号ｘ_w'（ｎ）を、インパルス応答
計算回路３１０から聴感重み付けインパルス応答ｈ
_w（ｎ）を入力する。ピッチ周期に対応する遅延Ｔを下
式の歪みを最小化するように求め、遅延を表わすインデ
クスをマルチプレクサ４００に出力する。The adaptive codebook circuit 300 receives the sound source signal v (n) from the weighted signal calculation circuit 360, the output signal x _w '(n) from the subtractor 235, and the perceptual weighted impulse from the impulse response calculation circuit 310. Response h
Enter _w (n). The delay T corresponding to the pitch period is calculated so as to minimize the distortion in the following equation, and the index representing the delay is output to the multiplexer 400.

【数７】ここで、[Equation 7] here,

【数８】はピッチ予測信号を示し、記号＊は畳み込み演算を表わ
す。ゲインβは下式に従い求める。[Equation 8] Indicates a pitch prediction signal, and the symbol * indicates a convolution operation. The gain β is calculated according to the following formula.

【数９】 [Equation 9]

【００２９】ここで、女性音や、子供の声に対して、遅
延の抽出楕度を向上させるために、遅延を整数サンブル
ではなく、小数サンブル値で求めてもよい。具体的な方
法は、例えぱ、P.Kroonらによる、“Pitch predictors
with high temporal resolution"と題した論文（Proc.I
CASSP,pp.661-664,1990年）（文献１０）等を参照する
ことができる。Here, in order to improve the extraction ellipticity of the delay with respect to a female sound or a child's voice, the delay may be obtained with a decimal sample number instead of an integer sample number. A concrete method is, for example, “Pitch predictors” by P. Kroon et al.
A paper entitled "with high temporal resolution" (Proc.I
CASSP, pp.661-664, 1990) (Reference 10) and the like can be referred to.

【００３０】更に、適応コードブック回路３００は、選
択された遅延とゲインを用いて下式に従いピッチ予測を
行ない、予測残差信号ｚ_w（ｎ）を音源量子化回路３５
０へ出力する。Further, the adaptive codebook circuit 300 performs pitch prediction according to the following equation using the selected delay and gain, and outputs the prediction residual signal z _w (n) to the excitation quantization circuit 35.
Output to 0.

【数１０】更に、選択された遅延を用いたピッチ予測信号を音源量
子化回路３５０へ出力する。[Equation 10] Further, the pitch prediction signal using the selected delay is output to the excitation quantization circuit 350.

【００３１】音源量子化回路３５０では、サブフレーム
に対して、振幅が非零のＭ個のパルスをたてる。Excitation quantization circuit 350 produces M pulses of non-zero amplitude for a subframe.

【００３２】音源量子化回路３５０の構成を示すブロッ
ク図を図２に示す。絶対値最大位置検出回路３５１は、
ピッチ予測信号ｙ_w（ｎ）に対して、予め定められた条
件を満たすサンプル位置を検出する。ここでは、「振幅
の絶対値が最大」という条件を使用し、それを満たすサ
ンプル位置を検出し、位置探索範囲設定回路３５２へ出
力する。A block diagram showing the configuration of the excitation quantization circuit 350 is shown in FIG. The absolute maximum position detection circuit 351
A sample position satisfying a predetermined condition is detected with respect to the pitch prediction signal y _w (n). Here, the condition that “the absolute value of the amplitude is maximum” is used, and the sample position satisfying the condition is detected and output to the position search range setting circuit 352.

【００３３】位置探索範囲設定回路３５２は、入力した
サンプル位置に対して予め定められた固定のサンプル数
Ｌだけ未来あるいは過去にずらした後に、各パルスの位
置の探索範囲を設定する。The position search range setting circuit 352 sets the search range of the position of each pulse after shifting the input sample position by a predetermined fixed number L of samples in the future or in the past.

【００３４】例えば、入力したサンプル位置をＤとし、
５ｍｓサブフレーム（４０サンプル）に５個のパルスを
求める例を考えると、各パルスの探索範囲に含まれる位
置の候補の例は下表のようになる。第１パルスＤ−Ｌ，Ｄ−Ｌ＋５，．．．第２パルスＤ−Ｌ＋１，Ｄ−Ｌ＋６，．．．第３パルスＤ−Ｌ＋２，Ｄ−Ｌ＋７，．．．第４パルスＤ−Ｌ＋３，Ｄ−Ｌ＋８，．．．第５パルスＤ−Ｌ＋４，Ｄ−Ｌ＋９，．．．For example, let D be the input sample position,
Considering an example in which five pulses are obtained in a 5 ms subframe (40 samples), examples of candidates for positions included in the search range of each pulse are as shown in the table below. First pulse D-L, D-L + 5 ,. ．． Second pulse D-L + 1, D-L + 6 ,. ．． Third pulse D-L + 2, D-L + 7 ,. ．． Fourth pulse D-L + 3, D-L + 8 ,. ．． Fifth pulse D-L + 4, D-L + 9 ,. ．．

【００３５】次に、ｚ_w（ｎ），ｈ_w（ｎ）を入力し、第
１の相関関数計算回路３５３、第２の相関関数計算回路
３５４は、それぞれ、下式に従い、第１の相関関数ｄ
（ｎ）、第２の相関関数φを計算する。Next, by inputting z _w (n) and h _w (n), the first correlation function calculating circuit 353 and the second correlation function calculating circuit 354 respectively calculate the first correlation function according to the following equation. Function d
(N), the second correlation function φ is calculated.

【数１１】 [Equation 11]

【数１２】 [Equation 12]

【００３６】パルス極性設定回路３５５は、位置探索範
囲設定回路３５２で設定された探索範囲における各パル
スの候補位置に対して、第１の相関関数ｄ（ｎ）の極性
を抽出し出力する。The pulse polarity setting circuit 355 extracts and outputs the polarity of the first correlation function d (n) for the candidate position of each pulse in the search range set by the position search range setting circuit 352.

【００３７】パルス位置探索回路３５６は、上表に示し
た候補位置の組合せに対して次式を計算し、次式を最大
化する位置を最適位置として選択する。The pulse position searching circuit 356 calculates the following formula for the combination of candidate positions shown in the above table, and selects the position that maximizes the following formula as the optimum position.

【数１３】ここで、パルスの個数をＭとすると、[Equation 13] Here, if the number of pulses is M,

【数１４】 [Equation 14]

【数１５】である。ここで、ｓｉｇｎ（ｋ）は、ｋ番目のパルスの
極性を示し、パルス極性設定回路３５５にて予め抽出し
たものを使用する。以上により、Ｍ個のパルスの極性と
位置がゲイン量子化回路３６５に出力される。[Equation 15] Is. Here, sign (k) indicates the polarity of the k-th pulse, and the one extracted in advance by the pulse polarity setting circuit 355 is used. By the above, the polarities and positions of the M pulses are output to the gain quantization circuit 365.

【００３８】また、パルスの位置を予め定められたビッ
ト数で量子化し、位置を表わすインデクスをマルチプレ
クサに出力する。また、パルスの極性をマルチプレクサ
４００に出力する。Further, the pulse position is quantized by a predetermined number of bits, and an index representing the position is output to the multiplexer. In addition, the polarity of the pulse is output to the multiplexer 400.

【００３９】ゲイン量子化回路３６５は、ゲインコード
ブック３６７からゲインコードベクトルを読み出し、選
択された位置に対して、下式を最小化するゲインコード
ベクトルを選択し、最終的に歪みを最小化する振幅コー
ドベクトルとゲインコードベクトルの組合せを選択す
る。The gain quantization circuit 365 reads the gain code vector from the gain code book 367, selects the gain code vector that minimizes the following expression for the selected position, and finally minimizes the distortion. Select the combination of amplitude code vector and gain code vector.

【００４０】ここでは、適応コードブックのゲインβ’
と、パルスで表わした音源のゲインＧ’の２種のゲイン
を同時にベクトル量子化する例について示す。Here, the gain β'of the adaptive codebook
And an example of simultaneously vector-quantizing two types of gains of a sound source gain G ′ represented by pulses.

【数１６】ここで、βt’、Ｇt’は、ゲインコードブック３６７に
格納された２次元ゲインコードベクトルにおけるｔ番目
の要素である。上式の計算を、ゲインコードベクトルの
各々に対して繰り返し、歪みＤｔを最小化するゲインコ
ードベクトルを選択する。選択されたゲインコードベク
トルを表わすインデクスをマルチプレクサ４００に出力
する。[Equation 16] Here, βt ′ and Gt ′ are the t-th element in the two-dimensional gain code vector stored in the gain codebook 367. The calculation of the above equation is repeated for each gain code vector, and the gain code vector that minimizes the distortion Dt is selected. The index representing the selected gain code vector is output to the multiplexer 400.

【００４１】重み付け信号計算回路３６０は、それぞれ
のインデクスを入力し、インデクスからそれに対応する
コードベクトルを読み出し、まず下式に基づき駆動音源
信号ｖ（ｎ）を求める。The weighting signal calculation circuit 360 inputs each index, reads the code vector corresponding to the index, and first obtains the driving sound source signal v (n) based on the following equation.

【数１７】ｖ（ｎ）は適応コードブック回路３００に出力される。[Equation 17] v (n) is output to the adaptive codebook circuit 300.

【００４２】次に、スペクトルパラメータ計算回路２０
０の出力パラメータ、スペクトルパラメータ量子化回路
２１０の出力パラメータを用いて下式により、応答信号
ｓ_w（ｎ）をサブフレーム毎に計算し、応答信号計算回
路２４０へ出力する。Next, the spectrum parameter calculation circuit 20
Using the output parameter of 0 and the output parameter of the spectrum parameter quantization circuit 210, the response signal s _w (n) is calculated for each subframe by the following equation and output to the response signal calculation circuit 240.

【数１８】 [Equation 18]

【００４３】第２の実施の形態を示すブロック図を図３
に示す。ここでは、音源量子化回路４５０の動作が図１
と異なる。FIG. 3 is a block diagram showing the second embodiment.
Shown in. Here, the operation of the excitation quantization circuit 450 is shown in FIG.
Different from

【００４４】音源量子化回路４５０の構成を図４に示
す。音源量子化回路４５０は、予測信号ｙ_w（ｎ）、予
測残差信号ｚ_w（ｎ）、聴感重み付けインパルス応答ｈ_w
（ｎ）のみならず、適応コードブックの遅延Ｔを入力す
る。The structure of the excitation quantization circuit 450 is shown in FIG. The excitation quantization circuit 450 has a prediction signal y _w (n), a prediction residual signal z _w (n), and a perceptual weighting impulse response h _w.
Input not only (n), but the delay T of the adaptive codebook.

【００４５】絶対値最大位置計算回路４５１は、ピッチ
周期に相当する遅延Ｔを入力し、ピッチ予測信号ｙ
_w（ｎ）に対して、サブフレームの先頭からＴサンプル
までの範囲で絶対値を最大にするサンプル位置を検出
し、位置探索範囲設定回路３５２出力する。The maximum absolute value position calculation circuit 451 inputs the delay T corresponding to the pitch period, and outputs the pitch prediction signal y.
_{For w} (n), the sample position that maximizes the absolute value is detected in the range from the beginning of the subframe to T samples, and the position search range setting circuit 352 is output.

【００４６】第３の実施の形態を示すブロック図を図５
に示す。ここでは、音源量子化回路５００の動作が図３
と異なる。音源量子化回路５５０の構成図を図６に示
す。FIG. 5 is a block diagram showing the third embodiment.
Shown in. Here, the operation of the excitation quantization circuit 500 is shown in FIG.
Different from FIG. 6 shows a configuration diagram of the excitation quantization circuit 550.

【００４７】位置探索範囲設定回路５５２は、入力した
サンプル位置に対して予め定められた固定のサンプル数
Ｌだけ未来あるいは過去にずらした位置を基点とし、遅
延Ｔだけずらしながら、各パルスの位置の候補を設定
し、パルス位置探索回路３５６に出力する。The position search range setting circuit 552 uses the position shifted in the future or the past by a predetermined fixed number of samples L with respect to the input sample position as a base point, and shifts by the delay T to determine the position of each pulse. The candidates are set and output to the pulse position search circuit 356.

【００４８】例えば、入力したサンプル位置をＤとし、
５ｍｓサブフレーム（４０サンプル）に５個のパルスを
求める例を考えると、各パルスの位置の候補の例は下表
のようになる。第１パルスＤ−Ｌ，Ｄ−Ｌ＋Ｔ，… 第２パルスＤ−Ｌ＋１，Ｄ−Ｌ＋Ｔ，… 第３パルスＤ−Ｌ＋２，Ｄ−Ｌ＋Ｔ，… 第４パルスＤ−Ｌ＋３，Ｄ−Ｌ＋Ｔ，… 第５パルスＤ−Ｌ＋４，Ｄ−Ｌ＋Ｔ，…For example, let the input sample position be D,
Considering an example in which 5 pulses are obtained in a 5 ms subframe (40 samples), examples of candidates for the position of each pulse are as shown in the table below. First pulse D-L, D-L + T, ... Second pulse D-L + 1, D-L + T, ... Third pulse D-L + 2, D-L + T, ... Fourth pulse D-L + 3, D-L + T, ... Fifth pulse Pulse D-L + 4, D-L + T, ...

【００４９】第４の実施の形態を示すブロック図を図７
に示す。ここでは、第１の実施の形態において、振幅コ
ードブックを使用する例について説明するが、第２、第
３の実施の形態に対して振幅コードブックを使用する場
合も同様の変更により実現できる。FIG. 7 is a block diagram showing the fourth embodiment.
Shown in. Here, an example in which the amplitude codebook is used in the first embodiment will be described, but the same changes can be made when the amplitude codebook is used in the second and third embodiments.

【００５０】図７は、図１に比べ、音源量子化回路３９
０と振幅コードブック３９５が異なっている。音源量子
化回路３９０の構成を図８に示す。振幅コードブック３
９５を用いてパルスの振幅を量子化する。FIG. 7 is different from FIG. 1 in that the excitation quantization circuit 39
0 and the amplitude codebook 395 are different. The configuration of the excitation quantization circuit 390 is shown in FIG. Amplitude Codebook 3
Quantize the amplitude of the pulse using 95.

【００５１】パルス位置探索回路３５６においてＭ個の
パルスに対して位置が求まった後で、振幅量子化回路３
９７において、次式を最大化するように、振幅コードベ
クトルを振幅コードブック３９５から選択し、インデク
スを出力する。After the positions have been obtained for M pulses in the pulse position search circuit 356, the amplitude quantization circuit 3
At 97, the amplitude codevector is selected from the amplitude codebook 395 so as to maximize the following expression and the index is output.

【数１９】ここで、[Formula 19] here,

【数２０】 [Equation 20]

【数２１】である。ここで、ｇ_k,jは、ｋ番目のパルスのｊ番目の
振幅コードベクトルである。[Equation 21] Is. Here, g _{k, j} is the j-th amplitude code vector of the k-th pulse.

【００５２】音源量子化回路３９０は、選択された振幅
コードベクトルを表わすインデクスをマルチプレクサ４
００に出力する。また、位置の値、振幅コードベクトル
の値をゲイン量子化回路４００に出力する。The source quantization circuit 390 uses the multiplexer 4 as an index representing the selected amplitude code vector.
Output to 00. Also, the position value and the amplitude code vector value are output to the gain quantization circuit 400.

【００５３】なお、本実施例では、振幅コードブックを
使用したが、代わりに、各パルスの極性を示す極性コー
ドブックを使用して探索してもよい。Although the amplitude codebook is used in this embodiment, a polarity codebook indicating the polarity of each pulse may be used instead for the search.

【００５４】図９は、第５の実施の形態を示すブロック
図である。図において、音源量子化回路６００の動作が
図１と異なるので、図１０を用いて構成を説明する。FIG. 9 is a block diagram showing the fifth embodiment. In the figure, the operation of the excitation quantization circuit 600 is different from that in FIG. 1, so the configuration will be described using FIG.

【００５５】図１０は音源量子化回路６００の構成を示
すブロック図である。位置探索範囲設定回路６５２は、
絶対値最大位置検出回路３５１の出力位置に対して、複
数種（例えばＱ種）のずらし量の各々の分だけずらした
位置を基点として各パルスの探索範囲ならびに位置のセ
ットを設定し、パルスの位置の設置のセットをずらし量
の種類分だけパルス極性設定回路６５５とパルス位置探
索回路６５６に出力する。FIG. 10 is a block diagram showing the configuration of the excitation quantization circuit 600. The position search range setting circuit 652 is
With respect to the output position of the absolute maximum position detection circuit 351, a search range of each pulse and a set of positions are set with the position shifted by each of a plurality of types (for example, Q types) of shift amounts as a base point, The position setting sets are output to the pulse polarity setting circuit 655 and the pulse position searching circuit 656 by the amount of the shift amount.

【００５６】パルス極性設定回路６５５は、位置探索回
路６５２の複数種の候補位置の各々に対して極性を抽出
し、パルス位置探索回路６５６へ出力する。The pulse polarity setting circuit 655 extracts the polarity for each of the plural types of candidate positions of the position searching circuit 652 and outputs it to the pulse position searching circuit 656.

【００５７】パルス位置探索回路６５６は、複数種の候
補位置の各々に対して、第１の相関関数、第２の相関関
数、極性を用いて、式（１３）を最大化する位置を探索
する。この処理をずらしの種類であるＱ回操り返し、Ｑ
種の中で、式（１３）を最大化する位置を最終的に選択
し、各パルスの位置と、ずらし量とを出力する。なお、
ずらし量はマルチプレクサ４００に出力される。The pulse position search circuit 656 searches for a position that maximizes the equation (13) by using the first correlation function, the second correlation function, and the polarity for each of the plurality of types of candidate positions. . This process is repeated Q times, which is a type of shift,
Among the seeds, the position that maximizes the equation (13) is finally selected, and the position of each pulse and the shift amount are output. In addition,
The shift amount is output to the multiplexer 400.

【００５８】図１１は、第６の実施の形態を示すブロッ
ク図である。図において、音源量子化回路６５０の動作
が図３と異なるので、図１２を用いて構成を説明する。FIG. 11 is a block diagram showing a sixth embodiment. In the figure, the operation of the excitation quantization circuit 650 is different from that in FIG. 3, so the configuration will be described using FIG.

【００５９】図１２は音源量子化回路６５０の構成を示
すブロック図である。位置探索範囲設定回路６５２は、
絶対値最大位置検出回路４５１の出力位置に対して、複
数種（例えばＱ種）のずらし量の各々の分だけずらした
位置を基点として、各パルスの位置を設定し、パルスの
位置のセットをずらし量の種類分だけパルス極性設定回
路６５５とパルス位置探索回路６５６に出力する。FIG. 12 is a block diagram showing the structure of the excitation quantization circuit 650. The position search range setting circuit 652 is
With respect to the output position of the absolute maximum position detection circuit 451, each pulse position is set with the position shifted by each of the displacement amounts of a plurality of types (for example, Q types) as a base point, and the pulse position is set. It outputs to the pulse polarity setting circuit 655 and the pulse position searching circuit 656 by the amount of the shift amount.

【００６０】パルス極性設定回路６５５は、位置探索回
路６５２の複数種の候補位置の各々に対して極性を抽出
し、パルス位置探索回路６５６へ出力する。The pulse polarity setting circuit 655 extracts the polarity for each of the plurality of types of candidate positions of the position searching circuit 652 and outputs it to the pulse position searching circuit 656.

【００６１】パルス位置探索回路６５６は、複数種の候
補位置の各々に対して、第１の相関関数、第２の相関関
数、極性を用いて、式（１３）を最大化する位置を探索
する。この処理をずらしの種類であるＱ回繰り返し、Ｑ
種の中で、式（１３）を最大化する位置を最終的に選択
し、各パルスの位置と、ずらし量とを出力する。なお、
ずらし量はマルチプレクサ４００に出力される。The pulse position searching circuit 656 searches for a position that maximizes the equation (13) by using the first correlation function, the second correlation function and the polarity for each of the plurality of types of candidate positions. . This process is repeated Q times, which is the type of shifting,
Among the seeds, the position that maximizes the equation (13) is finally selected, and the position of each pulse and the shift amount are output. In addition,
The shift amount is output to the multiplexer 400.

【００６２】図１３は、第７の実施の形態を示すブロッ
ク図である。図において、音源量子化回路７５０の動作
が図５と異なるので、図１４を用いて構成を説明する。FIG. 13 is a block diagram showing the seventh embodiment. In the figure, the operation of the excitation quantization circuit 750 is different from that in FIG. 5, so the configuration will be described using FIG.

【００６３】図１４は音源量子化回路７５０の構成を示
すブロック図である。位置探索範囲設定回路７５２は、
絶対値最大位置検出回路４５１の出力位置に対して、複
数種（例えば、Ｑ種）のずらし量の各々の分だけずらし
た位置を基点として、更に遅延Ｔだけずらしながら各パ
ルスの位置を設定する。このようにして各パルスの位置
のセットをＱ種類分パルス極性設定回路６５５とパルス
位置探索回路６５６に出力する。FIG. 14 is a block diagram showing the structure of the excitation quantization circuit 750. The position search range setting circuit 752
With respect to the output position of the absolute maximum position detecting circuit 451, the position of each pulse is set while further shifting by the delay T, with the position shifted by each of a plurality of types (for example, Q types) of shift amounts as a base point. . In this way, the set of the position of each pulse is output to the Q type pulse polarity setting circuit 655 and the pulse position searching circuit 656.

【００６４】パルス極性設定回路６５５は、位置探索回
路６５２の複数種の候補位置の各々に対して極性を抽出
し、パルス位置探索回路６５６へ抽出する。The pulse polarity setting circuit 655 extracts the polarity for each of the plurality of types of candidate positions of the position searching circuit 652, and extracts it into the pulse position searching circuit 656.

【００６５】パルス位置探索回路６５６は、複数種の候
補位置の各々に対して、第１の相関関数、第２の相関関
数、極性を用いて、式（１３）を最大化する位置を探索
する。この処理をずらしの種類であるＱ回繰り返し、Ｑ
種の中で、式（１３）を最大化する位置を最終的に選択
し、各パルスの位置と、ずらし量とを出力する。なお、
ずらし量はマルチプレクサ４００に出力される。The pulse position search circuit 656 searches for a position that maximizes the equation (13) by using the first correlation function, the second correlation function, and the polarity for each of the plurality of types of candidate positions. . This process is repeated Q times, which is the type of shifting,
Among the seeds, the position that maximizes the equation (13) is finally selected, and the position of each pulse and the shift amount are output. In addition,
The shift amount is output to the multiplexer 400.

【００６６】図１５は、第８の実施の形態を示すブロッ
ク図である。ここでは、第５の実施例の形態を示すブロ
ック図に、パルスの振幅を量子化する振幅コードブック
を付加する例について示すが、第６、第７の実施の形態
に付加することもできる。FIG. 15 is a block diagram showing the eighth embodiment. Here, an example in which an amplitude codebook for quantizing the pulse amplitude is added to the block diagram showing the form of the fifth embodiment is shown, but it is also possible to add to the sixth and seventh embodiments.

【００６７】図において、音源量子化回路８５０の動作
が図７と異なるので、音声量子化ｋ回路８５０の構成を
図１６を用いて説明する。In the figure, the operation of the excitation quantization circuit 850 is different from that of FIG. 7, so the configuration of the speech quantization k circuit 850 will be described with reference to FIG.

【００６８】図１６は音源量子化回路８５の構成を示す
ブロック図である。位置探索範囲設定回路６５２は、絶
対値最大位置検出回路３５１の出力位置に対して、複数
種（例えぱＱ種）のずらし量の各々の分だサずらした位
置を基点として、各パルスの位置を設定し、パルスの位
置のセットをずらし量の種類分だけパルス極性設定回路
６５５とパルス位置探索回路６５６に出力する。FIG. 16 is a block diagram showing the structure of the excitation quantization circuit 85. The position search range setting circuit 652 sets the position of each pulse with respect to the output position of the absolute maximum position detection circuit 351 as the base point of the position shifted by a plurality of types (for example, Q types) of shift amounts. Is set and the pulse position set is output to the pulse polarity setting circuit 655 and the pulse position searching circuit 656 by the amount of the shift amount.

【００６９】パルス極性設定回路６５５は、位置探索回
路６５２の複数種の候補位置の各々に対して極性を抽出
し、パルス位置探索回路６５６へ出力する。The pulse polarity setting circuit 655 extracts the polarity for each of the plural types of candidate positions of the position searching circuit 652 and outputs it to the pulse position searching circuit 656.

【００７０】パルス位置探索回路６５６は、複数種の候
補位置の各々に対して、第１の相関関数、第２の相関関
数、極性を用いて、式（１３）を最大化する位置を探索
する。この処理をずらしの種類であるＱ回操り返し、Ｑ
種の中で、式（１３）を最大化する位置を最終的に選択
し、各パルスの位置と、ずらし量とを出力する。なお、
ずらし量はマルチプレクサ４００に出力される。振幅量
子化回路３９７は図８と同一の動作を行なう。The pulse position search circuit 656 searches for a position that maximizes the equation (13) by using the first correlation function, the second correlation function, and the polarity for each of the plurality of types of candidate positions. . This process is repeated Q times, which is a type of shift,
Among the seeds, the position that maximizes the equation (13) is finally selected, and the position of each pulse and the shift amount are output. In addition,
The shift amount is output to the multiplexer 400. The amplitude quantization circuit 397 performs the same operation as in FIG.

【００７１】図１７は、第９の実施の形態を示すブロッ
ク図である。ここでは、第１の実施の形態をもとにする
例について示すが、他の実施の形態をもとにすることも
できる。FIG. 17 is a block diagram showing the ninth embodiment. Here, an example based on the first embodiment is shown, but other embodiments can also be used.

【００７２】モード判別回路９００は、聴感重み付け回
路２３０からフレーム単位で聴感重み付け信号を受け取
り、モード判別情報を適応コードブック回路９５０、音
源量子化回路９６０、ゲイン量子化回路９６５とマルチ
プレクサ４００へ出力する。ここでは、モード判別に、
現在のフレームの特徴量を用いる。特徴量としては、例
えば、フレームで平均したピッチ予測ゲインを用いる。
ピッチ予測ゲインの計算は例えば下式を用いる。The mode discrimination circuit 900 receives the perceptual weighting signal from the perceptual weighting circuit 230 on a frame-by-frame basis, and outputs the mode discrimination information to the adaptive codebook circuit 950, the excitation quantization circuit 960, the gain quantization circuit 965 and the multiplexer 400. . Here, for mode discrimination,
The feature amount of the current frame is used. As the feature amount, for example, a pitch prediction gain averaged in frames is used.
For example, the following formula is used to calculate the pitch prediction gain.

【数２２】ここで、Ｌはフレームに含まれるサブフレームの個数で
ある。Ｐi、Ｅiはそれぞれ、ｉ番目のサブフレームでの
音声パワー、ピッチ予測誤差パワーを示す。[Equation 22] Here, L is the number of subframes included in the frame. Pi and Ei indicate the speech power and the pitch prediction error power in the i-th subframe, respectively.

【数２３】 [Equation 23]

【数２４】ここで、Ｔは予測ゲインを最大化する最適遅延である。[Equation 24] Here, T is the optimum delay that maximizes the prediction gain.

【００７３】フレーム平均ピッチ予測ゲインＧを予め定
められた複数個のしきい値と比較して複数種類（例えば
Ｒ種）のモードに分類する。モードの個数Ｒとしては、
例えば４を用いることが出来る。The frame average pitch prediction gain G is compared with a plurality of predetermined threshold values and classified into a plurality of types (for example, R types) of modes. As the number of modes R,
For example, 4 can be used.

【００７４】適応コードブック回路９５０は、モード情
報を受け取り、予め定められたモードの場合に、図１の
適応コードブック回路３００と同一の動作を行い、遅
延、適応コードブック予測信号、予測残差信号を出力す
る。その他のモードに対しては、減算器２３５からの入
力信号をそのまま出力する。The adaptive codebook circuit 950 receives the mode information, and in the case of a predetermined mode, performs the same operation as the adaptive codebook circuit 300 of FIG. 1, delay, adaptive codebook prediction signal, prediction residual error. Output a signal. For other modes, the input signal from the subtractor 235 is output as it is.

【００７５】音源量子化回路９６０は、モード情報を受
け取り、予め定められたモードの際に図１の音源量子化
回路３５０と同一の動作を行う。The source quantization circuit 960 receives the mode information and performs the same operation as the source quantization circuit 350 of FIG. 1 in the predetermined mode.

【００７６】ゲイン量子化回路９６５は、モード情報を
入力し、モード毎に設計された複数種のゲインコードブ
ック３６７₁から３６７_Rを切り替えてゲイン量子化に使
用する。ゲイン量子化の動作は図１のゲイン量子化回路
３６５と同一である。The gain quantization circuit 965 inputs the mode information, switches a plurality of types of gain codebooks 367 ₁ to 367 _R designed for each mode, and uses them for gain quantization. The operation of gain quantization is the same as that of the gain quantization circuit 365 shown in FIG.

【００７７】上述した実施形態例に限らず、種々の変形
が可能である。例えば、複数パルスの振幅を量子化する
ためのコードブックを、音声信号を用いて予め学習して
格納しておくこともできる。コードブックの学習法は、
例えば、Linde氏らによる“An algorithm for vector
quantization design"と題した論文（IEEE Trans.Commu
n.,pp．８４−９５，Januay,１９８０）（文献１１）等
を参照できる。The present invention is not limited to the embodiment described above, but various modifications are possible. For example, a codebook for quantizing the amplitudes of a plurality of pulses can be learned and stored in advance using a voice signal. The codebook learning method is
For example, Linde et al. “An algorithm for vector
Paper entitled "quantization design" (IEEE Trans.Commu
n., pp. 84-95, Januay, 1980) (Reference 11) and the like.

【００７８】振幅コードブックの代わりに、パルスの個
数に等しいビット数だけ各パルスの極性の組み合わせを
用意した極性コードブックを有するようにしてもよい。Instead of the amplitude codebook, it is possible to have a polarity codebook in which a combination of the polarities of each pulse is prepared by the number of bits equal to the number of pulses.

【００７９】[0079]

【発明の効果】以上説明したように、本発明によれば、
音源量子化部において、適応コードブックで求めたピッ
チ予測信号に対して予め定められた条件を満たす位置を
求め、前記位置を基に、音源信号を表わす複数個のパル
スの位置の探索範囲を設定し、この範囲の中で最良の位
置を探索する。これにより、パルスの位置の探索範囲を
ピッチ波形に同期させて、ピッチ波形を表わすための音
源信号を良好に表わすことが出来るので、ビットレート
を低減化しても、従来方式に比べ良好な音質が得られ
る。As described above, according to the present invention,
In the excitation quantizer, a position satisfying a predetermined condition is obtained for the pitch prediction signal obtained by the adaptive codebook, and a search range of positions of a plurality of pulses representing the excitation signal is set based on the position. Then, search for the best position within this range. As a result, since the search range of the pulse position can be synchronized with the pitch waveform and the sound source signal for expressing the pitch waveform can be expressed satisfactorily, even if the bit rate is reduced, better sound quality than the conventional method can be obtained. can get.

【００８１】更に本発明によれば、入力音声から特徴量
を抽出して複数のモードを判別し、予め定められたモー
ドにおいて、音源量子化部で上述の処理を行うことによ
り、音声の周期性が強いモード部分に対する音質を改善
することが出来る。Further, according to the present invention, the feature quantity is extracted from the input speech to discriminate a plurality of modes, and the sound source quantizing section performs the above-described processing in a predetermined mode to thereby obtain the periodicity of the speech. It can improve the sound quality for the strong mode part.

[Brief description of drawings]

【図１】本発明による音声符号化装置の第１の実施形態
を示す構成ブロック図である。FIG. 1 is a configuration block diagram showing a first embodiment of a speech encoding apparatus according to the present invention.

【図２】第１の実施の形態における音源量子化回路３５
０の構成を示す図である。FIG. 2 is an excitation quantization circuit 35 according to the first embodiment.
It is a figure which shows the structure of 0.

【図３】本発明による音声符号化装置の第２の実施形態
を示す構成ブロック図である。FIG. 3 is a configuration block diagram showing a second embodiment of a speech encoding device according to the present invention.

【図４】第２の実施の形態における音源量子化回路４５
０の構成を示す図である。FIG. 4 is an excitation quantization circuit 45 according to the second embodiment.
It is a figure which shows the structure of 0.

【図５】本発明による音声符号化装置の第３の実施形態
を示す構成ブロック図である。FIG. 5 is a configuration block diagram showing a third embodiment of a speech encoding apparatus according to the present invention.

【図６】第３の実施の形態における音源量子化回路５５
０の構成を示す図である。FIG. 6 is a sound source quantization circuit 55 according to the third embodiment.
It is a figure which shows the structure of 0.

【図７】本発明による音声符号化装置の第４の実施形態
を示す構成ブロック図である。FIG. 7 is a configuration block diagram showing a fourth embodiment of a speech encoding apparatus according to the present invention.

【図８】第４の実施の形態における音源量子化回路３９
０の構成を示す図である。FIG. 8 is a sound source quantization circuit 39 according to the fourth embodiment.
It is a figure which shows the structure of 0.

【図９】本発明による音声符号化装置の第５の実施形態
を示す構成ブロック図である。[Fig. 9] Fig. 9 is a configuration block diagram showing a fifth embodiment of a speech encoding device according to the present invention.

【図１０】第５の実施の形態における音源量子化回路６
００の構成を示す図である。FIG. 10 is an excitation quantization circuit 6 according to the fifth embodiment.
It is a figure which shows the structure of 00.

【図１１】本発明による音声符号化装置の第６の実施形
態を示す構成ブロック図である。FIG. 11 is a configuration block diagram showing a sixth embodiment of a speech coding apparatus according to the present invention.

【図１２】第６の実施の形態における音源量子化回路６
５０の構成を示す図である。FIG. 12 is an excitation quantization circuit 6 according to the sixth embodiment.
It is a figure which shows the structure of 50.

【図１３】本発明による音声符号化装置の第７の実施形
態を示す構成ブロック図である。[Fig. 13] Fig. 13 is a configuration block diagram showing a seventh embodiment of a speech encoding device according to the present invention.

【図１４】第７の実施の形態における音源量子化回路７
５０の構成を示す図である。FIG. 14 is an excitation quantization circuit 7 according to the seventh embodiment.
It is a figure which shows the structure of 50.

【図１５】本発明による音声符号化装置の第８の実施形
態を示す構成ブロック図である。[Fig. 15] Fig. 15 is a configuration block diagram showing an eighth embodiment of a speech encoding device according to the present invention.

【図１６】第８の実施の形態における音源量子化回路８
５０の構成を示す図である。FIG. 16 is an excitation quantization circuit 8 according to the eighth embodiment.
It is a figure which shows the structure of 50.

【図１７】本発明による音声符号化装置の第９の実施形
態を示す構成ブロック図である。FIG. 17 is a configuration block diagram showing a ninth embodiment of a speech encoding device according to the present invention.

[Explanation of symbols]

１１０フレーム分割回路１２０サブフレーム分割回路２００スペクトルパラメータ計算回路２１０スペクトルパラメータ量子化回路２２０コードブック２３０聴感重み付け回路２３５減算回路２４０応答信号計算回路３１０インパルス応答計算回路３５０、３９０、４５０、５００、６００、６５０、７
５０、８５０、９６０音源量子化回路３６０重み付け信号計算回路３６５、９６５ゲイン量子化回路３９５振幅コードブック３６７ゲインコードブック４００マルチプレクサ９００モード判別回路110 frame division circuit 120 sub-frame division circuit 200 spectrum parameter calculation circuit 210 spectrum parameter quantization circuit 220 codebook 230 perceptual weighting circuit 235 subtraction circuit 240 response signal calculation circuit 310 impulse response calculation circuit 350, 390, 450, 500, 600, 650, 7
50, 850, 960 Excitation quantization circuit 360 Weighting signal calculation circuit 365, 965 Gain quantization circuit 395 Amplitude codebook 367 Gain codebook 400 Multiplexer 900 Mode discrimination circuit

Claims

(57) [Claims]

1. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input voice signal, and an adaptive codebook unit for obtaining a pitch prediction signal by obtaining a delay corresponding to a pitch period from the voice signal and calculating a pitch prediction signal. And the sound source signal of the audio signal is composed of a number M of pulses of which amplitude is non-zero, the sample position corresponding to the pulse position satisfying a predetermined condition for the pitch prediction signal is obtained, and the obtained sample And a sound source quantizer that sets a range for searching the position of the pulse based on a position shifted by a predetermined number of samples from the position and searches for and outputs the best position for the set range. Speech coding device.

2. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input voice signal, an adaptive codebook unit for obtaining a pitch prediction signal by obtaining a delay corresponding to a pitch period from the voice signal, and an adaptive codebook unit, The sound source signal of the voice signal is composed of M pulses of non-zero amplitude, and a sample position satisfying a predetermined condition for the pitch prediction signal is obtained in a section having a length equal to the pitch period from the beginning and the position A speech encoding apparatus having a sound source quantization unit that sets a range for searching a pulse position based on a position shifted by a predetermined number of samples from, and searches for and outputs the best position with respect to the range. .

3. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input voice signal, an adaptive codebook unit for obtaining a pitch prediction signal by obtaining a delay corresponding to a pitch period from the voice signal, and an adaptive codebook unit, A sound source signal of a voice signal is composed of a number M of pulses having non-zero amplitude, and a sample position satisfying a predetermined condition with respect to the pitch prediction signal is obtained in a section having a length equal to a pitch cycle from the beginning, Based on the position shifted by a predetermined number of samples from the position, the pulse position candidates are set while being shifted by the pitch period, and a sound source quantizer that searches the candidate position and outputs the best position is used. Speech coding apparatus having.

4. The speech coding apparatus according to claim 1, wherein the excitation quantizer has a codebook for collectively quantizing the amplitudes or polarities of a plurality of pulses.

5. A spectrum parameter calculation unit that obtains and quantizes a spectrum parameter from an input speech signal, an adaptive codebook unit that obtains a delay corresponding to a pitch period from the speech signal, calculates a pitch prediction signal, and performs pitch prediction. A sound source signal is composed of a number M of non-zero-amplitude pulses, a sample position satisfying a predetermined condition for the pitch prediction signal is obtained, and the position is determined using each of a plurality of types of shift amounts. A sound source quantization unit that sets a range for searching the position of the pulse based on the position after the shift and searches for a position with respect to the range, and outputs a combination of the best shift amount and the position of the pulse. A speech encoding apparatus having.

6. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input voice signal, an adaptive codebook unit for obtaining a pitch corresponding to a pitch period from the voice signal, calculating a pitch prediction signal and performing pitch prediction, A sound source signal of a voice signal is composed of a number M of pulses of which amplitude is non-zero, and a sample position satisfying a predetermined condition for the pitch prediction signal is obtained in a section having a length equal to a pitch cycle from the beginning, Using each of the shift amounts of the seed, set a range for searching the position of the pulse based on the position after shifting from the position, and search the position with respect to the range, and the best shift amount and pulse A speech coding apparatus having a sound source quantization unit that outputs a combination of positions.

7. A spectrum parameter calculating section for obtaining and quantizing a spectrum parameter from an input speech signal, an adaptive codebook section for obtaining a pitch corresponding to a pitch period from the speech signal, calculating a pitch prediction signal and performing pitch prediction, A sound source signal of a voice signal is composed of a number M of pulses having non-zero amplitude, and a sample position satisfying a predetermined condition with respect to the pitch prediction signal is obtained in a section having a length equal to a pitch cycle from the beginning, Based on the position after shifting from the position by using each of the shift amounts of the seeds, the position of the pulse is set while shifting the pitch period, and the position is searched for. A speech encoding apparatus having an excitation quantizer that outputs a combination of a shift amount and a pulse position.

8. The speech coding apparatus according to claim 5, wherein the excitation quantizer has a codebook for collectively quantizing the amplitudes or polarities of a plurality of pulses.

9. A spectrum parameter calculation unit that obtains and quantizes a spectrum parameter from an input voice signal, a mode determination unit that extracts a feature amount from the input voice signal and determines and outputs a plurality of modes, and a voice signal from the voice signal. An adaptive codebook unit that calculates a pitch prediction signal by calculating a delay corresponding to a pitch period and a pitch prediction signal, and a sound source signal of the voice signal is composed of a number M of pulses whose amplitude is non-zero, and in the case of a predetermined mode. , Obtaining a sample position satisfying a predetermined condition with respect to the pitch prediction signal, setting a range for searching the position of the pulse based on the position, searching for and outputting the best for the range A speech coding apparatus comprising: a sound source quantization unit.

10. The speech coding apparatus according to claim 9, wherein the feature amount is an average pitch prediction gain.

11. The speech coding apparatus according to claim 9, wherein the mode discrimination unit discriminates a mode based on a result of comparison between the average pitch prediction gain and a plurality of predetermined threshold values.

12. A spectrum parameter calculation unit for obtaining and quantizing a spectrum parameter from an input speech signal, an adaptive codebook for performing pitch estimation by calculating a pitch prediction signal by obtaining a delay corresponding to a pitch period from the speech signal. And a position for satisfying a predetermined condition with respect to the pitch prediction signal obtained by the adaptive codebook, and setting a search range of a plurality of pulse positions representing a sound source signal based on the obtained position. A speech coding apparatus, comprising: a sound source quantization unit that searches for the best positions of the plurality of pulses within the search range.