JPH1097294A

JPH1097294A - Voice coding device

Info

Publication number: JPH1097294A
Application number: JP9036726A
Authority: JP
Inventors: Hiroyuki Ebara; 原宏幸江; Toshiyuki Morii; 井利幸森
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1996-02-21
Filing date: 1997-02-20
Publication date: 1998-04-14
Anticipated expiration: 2017-02-20
Also published as: JP4063911B2

Abstract

PROBLEM TO BE SOLVED: To improve the sound quality of the sound source generating section in a CELP(code excited linear prediction) type voice coding device. SOLUTION: A pitch peak position calculator 12 determines the pitch peak position of an adaptive code vector, an amplitude emphasizing window generator 13 generates a window for emphasizing the amplitude of the pitch peak position and an amplitude emphasizing window applicator 16 emphasizes the amplitude of the noise code vector corresponding to the pitch peak position. Or it determines the search positions of pulses so that they are dense near the pitch peak position and sparse in the other regions, and, based on the determined search positions, searches pulse positions. Or it adaptively switches backward the source constitution to improve sound quality and suppress the propagation of the effects of transmission line error, making use of the pitch peak position and pitch period information in the just precedent subframe and the pitch period information in the present subframe.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声信号をコード
化して伝送する移動通信システム等におけるＣＥＬＰ
（Code Excited Linear Predicion)型音声符号化装置に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a CELP in a mobile communication system for coding and transmitting a voice signal.
The present invention relates to a (Code Excited Linear Predicion) type speech encoding device.

【０００２】[0002]

【従来の技術】ＣＥＬＰ型音声符号化装置は、音声をあ
る一定のフレーム長に区切り、各フレーム毎に音声の線
形予測を行い、フレーム毎の線形予測による予測残差
（励起信号）を既知の波形からなる適応符号ベクトルと
雑音符号ベクトルを用いて符号化するものである。適応
符号ベクトルと雑音符号ベクトルは、図３１に示すよう
に、それぞれ適応符号帳１および雑音符号帳２に格納さ
れた適応符号ベクトルと雑音符号ベクトルをそのまま使
用する場合と、図３２に示すように、適応符号帳１から
の適応符号ベクトルと、雑音符号帳２からの雑音符号ベ
クトルを適応符号帳１のピッチ同期Ｌに同期させた雑音
符号ベクトルを用いる場合とがある。図３２は特開平５
−１９７９５号公報および特開平５−１９７９６号公報
に開示されているＣＥＬＰ型音声符号化装置における雑
音音源ベクトル生成部の構成である。図３２において、
適応符号帳１から適応符号ベクトルが選択されるととも
に、ピッチ周期Ｌが出力され、雑音符号帳２から選択さ
れた雑音符号ベクトルが、周期化器３によりピッチ周期
Ｌを用いて周期化される。周期化は、雑音符号ベクトル
を先頭からピッチ周期分切り出し、それをサブフレーム
長に達するまで複数回繰り返して接続することによって
行われる。2. Description of the Related Art A CELP-type speech coding apparatus divides speech into a certain frame length, performs linear prediction of speech for each frame, and obtains a prediction residual (excitation signal) by linear prediction for each frame. The coding is performed using an adaptive code vector composed of a waveform and a noise code vector. The adaptive code vector and the noise code vector are, as shown in FIG. 31, the case where the adaptive code vector and the noise code vector stored in the adaptive codebook 1 and the noise codebook 2 are used as they are, respectively, as shown in FIG. In some cases, a noise code vector obtained by synchronizing the adaptive code vector from the adaptive codebook 1 and the noise code vector from the noise codebook 2 with the pitch synchronization L of the adaptive codebook 1 is used. FIG.
1 is a configuration of a noise excitation vector generation unit in a CELP-type speech coding apparatus disclosed in Japanese Patent Application Laid-Open No. 19795 and Japanese Patent Application Laid-Open No. 5-19796. In FIG. 32,
An adaptive code vector is selected from the adaptive codebook 1, and a pitch period L is output. The noise code vector selected from the noise codebook 2 is periodicized by the periodicizer 3 using the pitch period L. Periodization is performed by cutting out the noise code vector by the pitch period from the beginning and connecting it repeatedly a plurality of times until it reaches the subframe length.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記従
来の雑音符号ベクトルをピッチ周期化するＣＥＬＰ型音
声符号化装置では、適応符号ベクトル成分を取り除いた
後に残留するピッチ周期成分を、雑音符号ベクトルをピ
ッチ周期で周期化することによって取り除いているた
め、１ピッチ波形内に存在する位相情報、すなわちどこ
にピッチパルスのピークが存在するかという情報を積極
的に用いることがなく、音声品質の向上を図る上で限界
があった。However, in the above-described conventional CELP speech coding apparatus for pitch-performing a noise code vector, the pitch period component remaining after removing the adaptive code vector component is determined by using the noise code vector as a pitch. Since it is removed by cycling at intervals, the phase information existing in one pitch waveform, that is, information on where the peak of the pitch pulse exists, is not actively used, and the sound quality is improved. There was a limit.

【０００４】本発明は、このような従来の問題を解決す
るものであり、音声品質を一段と向上させることのでき
る音声符号化装置を提供することを目的とする。An object of the present invention is to solve such a conventional problem, and an object of the present invention is to provide a speech encoding device capable of further improving speech quality.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、適応符号ベクトルのピッチピーク位置に
対応する雑音符号ベクトルの振幅を強調することによっ
て、１ピッチ波形内に存在する位相情報を利用して、音
質向上を図るようにしたものである。To achieve the above object, the present invention enhances the amplitude of a noise code vector corresponding to the pitch peak position of an adaptive code vector by increasing the phase present in one pitch waveform. The information is used to improve the sound quality.

【０００６】本発明はまた、適応符号ベクトルのピッチ
ピーク近傍のみに限定した雑音符号ベクトルを用いるこ
とにより、雑音符号ベクトルに割り当てられるビット数
が少ない場合でも、音質劣化を少なくするようにしたも
のである。Further, the present invention uses a noise code vector limited only to the vicinity of the pitch peak of an adaptive code vector to reduce sound quality deterioration even when the number of bits allocated to the noise code vector is small. is there.

【０００７】本発明はまた、適応符号ベクトルのピッチ
ピークの位置とピッチ周期を用いてパルス位置の探索範
囲を限定することにより、パルスの位置を表すビット数
が少ない場合でも、音質劣化を少なくしながら探索範囲
を狭めるようにしたものである。The present invention also limits the search range of the pulse position using the position of the pitch peak and the pitch period of the adaptive code vector, thereby reducing sound quality degradation even when the number of bits representing the pulse position is small. While narrowing the search range.

【０００８】本発明はまた、適応符号ベクトルのピッチ
ピークの位置とピッチ周期を用いてパルス位置の探索範
囲を限定する際に、特に１〜２ピッチ波形のパルス位置
探索精度を細かくとることによって、ピッチ周期が短い
音声の有声部の音質向上を図るようにしたものである。In the present invention, when the search range of the pulse position is limited by using the position of the pitch peak and the pitch period of the adaptive code vector, particularly, the pulse position search accuracy of the 1 to 2 pitch waveform is made fine, This is to improve the sound quality of a voiced portion of a voice having a short pitch cycle.

【０００９】本発明はまた、ピッチ周期の値によってパ
ルス音源のパルス本数を変化させることにより、音質向
上を図るようにしたものである。The present invention is also intended to improve the sound quality by changing the number of pulses of the pulse sound source according to the value of the pitch period.

【００１０】本発明はまた、適応符号ベクトルのピッチ
ピーク位置付近とそれ以外の部分のパルス振幅を予め決
定してからパルス音源探索を行うことによって、音質向
上を図るようにしたものである。In the present invention, the sound quality is improved by performing a pulse sound source search after determining the pulse amplitudes in the vicinity of the pitch peak position of the adaptive code vector and other portions in advance.

【００１１】本発明はまた、ピッチゲインを多段量子化
して初段の量子化情報を適応符号帳探索直後に行うこと
によって、ピッチゲインの初段量子化情報を雑音符号帳
切り替えの為のモード情報として利用できるようにして
符号化効率の向上を図るようにしたものである。The present invention also utilizes the first-stage quantized information of the pitch gain as mode information for switching the noise codebook by performing the multi-stage quantization of the pitch gain and performing the first-stage quantized information immediately after searching for the adaptive codebook. The encoding efficiency is improved as much as possible.

【００１２】本発明はまた、量子化ピッチ周期情報また
は直前のサブフレームあるいは現サブフレームにおける
量子化ピッチゲイン情報をもちいて、パルス音源の探索
位置を切り替える制御を行うことにより、音声品質の向
上を図るようにしたものである。The present invention also improves the voice quality by controlling the switching of the search position of the pulse sound source using the quantization pitch period information or the quantization pitch gain information in the immediately preceding subframe or the current subframe. It is designed to work.

【００１３】本発明はまた、サブフレーム間における位
相の連続性をバックワードで判定し、位相が連続してい
ると判定されたサブフレームについてのみ位相適応処理
を適用することにより、伝送する情報量を増やさずに位
相適応処理の切り替えを行い、音声品質の向上を図るよ
うにしたものである。なお、位相適応処理を行わない場
合に固定符号帳を使用すれば、伝送路誤りの伝播を防ぐ
効果を得ることも可能となる。According to the present invention, the amount of information to be transmitted is determined by determining the continuity of the phase between subframes in the backward direction and applying the phase adaptation process only to the subframes determined to have the continuous phase. The switching of the phase adaptive processing is performed without increasing the sound quality, thereby improving the sound quality. If the fixed codebook is used when the phase adaptation process is not performed, it is possible to obtain the effect of preventing transmission path error propagation.

【００１４】本発明はまた、適応符号ベクトルにおける
ピッチピーク位置近傍への信号パワーの集中度によっ
て、位相適応処理を適用するかしないかを決定すること
とにより、伝送する情報量を増やさずに位相適応処理の
切り替えを行い、音声品質の向上を図るようにしたもの
である。なお、位相適応処理を行わない場合に固定符号
帳を使用すれば、伝送路誤りの伝播を防ぐ効果を得るこ
とも可能となる。The present invention also determines whether or not to apply the phase adaptation processing based on the concentration of signal power in the vicinity of a pitch peak position in an adaptive code vector. The adaptive processing is switched to improve the voice quality. If the fixed codebook is used when the phase adaptation process is not performed, it is possible to obtain the effect of preventing transmission path error propagation.

【００１５】本発明はまた、ピッチピーク位置からの相
対位置で音源パルス探索を行うＣＥＬＰ型音声符号化装
置において、サブフレームの先頭側から順番にパルス位
置のインデックスを付けるようにすることによって、あ
るフレームにおいて発生した伝送路誤りの影響が後続の
伝送路誤りのないフレームに伝播することを防ぐように
したものである。The present invention also provides a CELP-type speech coding apparatus for searching for an excitation pulse at a relative position from a pitch peak position, by indexing pulse positions sequentially from the head of a subframe. This is to prevent the effect of a transmission path error occurring in a frame from propagating to a subsequent frame without a transmission path error.

【００１６】本発明はまた、ピッチピーク位置からの相
対位置で音源パルス探索を行うＣＥＬＰ型音声符号化装
置において、サブフレームの先頭側から順番にパルス位
置のインデックスを付けるとともに、同じインデックの
異なるパルスにおいてもサブフレームの先頭側から順番
にパルス番号を付けるようにすることによって、あるフ
レームにおいて発生した伝送路誤りの影響が後続の伝送
路誤りのないフレームに伝播することを防ぐようにした
ものである。The present invention also provides a CELP-type speech coding apparatus for searching for an excitation pulse at a relative position from a pitch peak position, in which a pulse position is indexed sequentially from the head of a subframe and different pulses of the same index. Also, by assigning pulse numbers sequentially from the beginning of the subframe, the effect of a transmission line error generated in a certain frame is prevented from propagating to a subsequent frame without a transmission line error. is there.

【００１７】本発明はまた、ピッチピーク位置からの相
対位置で音源パルス探索を行うＣＥＬＰ型音声符号化装
置において、パルス探索位置の全てを相対位置で表すの
ではなく、ピッチピーク近傍の一部のみを相対位置で表
現し、残りの部分は予め定められた固定位置にすること
により、あるフレームにおいて発生した伝送路誤りの影
響が後続の伝送路誤りのないフレームに伝播することを
防ぐようにしたものである。The present invention also provides a CELP-type speech coding apparatus that performs a sound source pulse search at a relative position from a pitch peak position, wherein not all the pulse search positions are represented by relative positions, but only a part near the pitch peak. Is expressed as a relative position, and the remaining portion is set at a predetermined fixed position, so that the effect of a transmission line error generated in a certain frame is prevented from propagating to a subsequent frame without a transmission line error. Things.

【００１８】本発明はまた、ピッチピーク位置を求める
際に、対象となる信号全体に対してピッチピーク位置の
探索を行うのではなく、切り出したピッチ周期長の信号
の中でピッチピーク位置の探索を行う手段を備えること
により、より正確に先頭のピッチピーク位置を抽出でき
るようにしたものである。According to the present invention, when the pitch peak position is obtained, the search for the pitch peak position is not performed for the entire target signal, but for the search for the pitch peak position in the extracted signal having the pitch period length. Is provided so that the leading pitch peak position can be more accurately extracted.

【００１９】本発明はまた、サブフレーム間でピッチ周
期が連続している部分、即ち有声定常部と思われる部分
において、直前のサブフレームにおけるピッチピーク位
置と直前のサブフレームにおけるピッチ周期と現在のサ
ブフレームにおけるピッチ周期を用いて現在のサブフレ
ームにおけるピッチピーク位置を予測し、予測されたピ
ッチピーク位置に基づいて現在のサブフレームにおける
ピッチピーク位置の存在範囲を限定することにより、有
声定常部における位相の不連続が生じないようにピッチ
ピーク位置を抽出できるようにしたものである。In the present invention, in a portion where the pitch period is continuous between subframes, that is, a portion considered to be a voiced stationary portion, the pitch peak position in the immediately preceding subframe, the pitch period in the immediately preceding subframe and the current By predicting the pitch peak position in the current sub-frame using the pitch period in the sub-frame, and limiting the range of the pitch peak position in the current sub-frame based on the predicted pitch peak position, in the voiced stationary part The pitch peak position can be extracted so that phase discontinuity does not occur.

【００２０】本発明はまた、サブフレーム長が１０ｍｓ
程度以上を有し、かつ雑音符号帳情報に割り当てられる
情報量がサブフレーム当たり１５ビット程度のように比
較的少なく、雑音符号帳としてパルス音源を適用する場
合において、パルス数を少なくして各パルスの位置情報
を十分にとるモードと各パルスの位置情報を粗くする代
わりにパルス数を増やしたモードとをそれぞれ少なくと
も１モード以上（合計２モード以上）備える構成とする
ことにより、音声信号の有声立ち上がり部分の品質向上
を図り、またパルス数を増やすことによって各パルスの
位置情報が粗くなることによる音声品質の劣化を抑える
ことを可能としたものである。The present invention also has a subframe length of 10 ms.
And when the amount of information allocated to the noise codebook information is relatively small, such as about 15 bits per subframe, and when applying a pulse excitation as a noise codebook, the number of pulses is reduced and each pulse is reduced. , And a mode in which the number of pulses is increased instead of coarsening the position information of each pulse. By improving the quality of the portion and increasing the number of pulses, it is possible to suppress the deterioration of the voice quality due to coarse position information of each pulse.

【００２１】[0021]

【発明の実施の形態】本発明の請求項１に記載の発明
は、ＣＥＬＰ型音声符号化装置において、適応符号ベク
トルのピッチピーク位置に対応する雑音符号ベクトルの
振幅を強調する音源生成部を備えた音声符号化装置であ
り、１ピッチ波形内に存在する位相情報を利用して、音
質向上を図ることができる。1 is a block diagram showing a configuration of a CELP type speech coding apparatus according to a first embodiment of the present invention; FIG. 2 is a block diagram showing a configuration of a CELP type speech coding apparatus according to a first embodiment of the present invention; The present invention provides a sound encoding device that can improve sound quality by using phase information existing in one pitch waveform.

【００２２】本発明の請求項２に記載の発明は、請求項
１記載の音声符号化装置において、音声生成部が、適応
符号ベクトルのピッチ周期と同期した振幅強調窓を雑音
符号ベクトルに乗ずることによって、適応符号ベクトル
のピッチピークの位置に対応する雑音符号ベクトルの振
幅を強調するものであり、雑音音源ベクトルの振幅をピ
ッチ周期に同期して強調することによって、音質向上を
図ることができる。According to a second aspect of the present invention, in the speech encoding apparatus according to the first aspect, the speech generating unit multiplies the noise code vector by an amplitude emphasis window synchronized with a pitch period of the adaptive code vector. Thus, the amplitude of the noise code vector corresponding to the position of the pitch peak of the adaptive code vector is emphasized, and the sound quality can be improved by emphasizing the amplitude of the noise excitation vector in synchronization with the pitch cycle.

【００２３】本発明の請求項３に記載の発明は、請求項
２記載の音声符号化装置において、音声生成部が、適応
符号ベクトルのピッチピーク位置を中心とする三角窓を
振幅強調窓として使用するものであり、振幅強調窓長の
制御を容易に行うことができる。According to a third aspect of the present invention, in the speech encoding apparatus according to the second aspect, the speech generation unit uses a triangular window centered on a pitch peak position of the adaptive code vector as an amplitude emphasizing window. This makes it possible to easily control the amplitude emphasis window length.

【００２４】本発明の請求項４に記載の発明は、ＣＥＬ
Ｐ型音声符号化装置において、適応符号ベクトルのピッ
チピーク近傍のみに限定した雑音符号ベクトルを用いる
音源生成部を備えた音声符号化装置であり、適応符号ベ
クトルのピッチピーク近傍のみに限定した雑音符号ベク
トルを用いることにより、雑音符号ベクトルに割り当て
られるビット数が少ない場合でも、音質劣化を少なくで
き、ピッチパルス近傍に残差パワーが集中するような有
声部で音質向上を図ることができる。According to a fourth aspect of the present invention, a CEL
In a P-type speech coding apparatus, a speech coding apparatus including a sound source generation unit that uses a noise code vector limited only to the vicinity of a pitch peak of an adaptive code vector is provided. By using the vector, even when the number of bits allocated to the noise code vector is small, deterioration of sound quality can be reduced, and sound quality can be improved in a voiced part where residual power is concentrated near the pitch pulse.

【００２５】本発明の請求項５に記載の発明は、パルス
音源を雑音符号帳に用いるＣＥＬＰ型音声符号化装置に
おいて、パルス位置の探索範囲を適応符号ベクトルのピ
ッチ周期およびピッチピーク位置によって決定する音源
生成部を備えた音声符号化装置であり、パルス位置に割
り当てられるビット数が少ない場合でも、音質劣化を少
なくできる。According to a fifth aspect of the present invention, in a CELP type speech coding apparatus using a pulse excitation as a noise codebook, a search range of a pulse position is determined by a pitch period and a pitch peak position of an adaptive code vector. This is a speech encoding device including a sound source generation unit, and can reduce sound quality degradation even when the number of bits allocated to pulse positions is small.

【００２６】本発明の請求項６に記載の発明は、請求項
５記載の音声符号化装置において、音源生成部が、適応
符号ベクトルのピッチピーク位置近傍は密に、それ以外
の部分は疎になるようにパルス位置の探索範囲を決定す
るものであり、パルスが立てられる確率が高くなる部分
を細かく探索するので、音声向上を図ることができる。According to a sixth aspect of the present invention, in the speech coding apparatus according to the fifth aspect, the excitation generating section is arranged so that the vicinity of the pitch peak position of the adaptive code vector is dense, and the other parts are sparse. The search range of the pulse position is determined in such a manner as to make it possible to finely search for a portion where the probability of forming a pulse is high, so that the sound can be improved.

【００２７】本発明の請求項７記載の発明は、ピッチ周
期によってパルス位置の探索範囲を切り替える請求項５
または請求項６記載の音声符号化装置であり、ピッチ周
期に基づいてパルス位置の探索範囲を伸縮するので、ピ
ッチ周期が短い場合に１〜２ピッチの波形をより細かく
表現することができ、音声品質の向上を図ることができ
る。According to a seventh aspect of the present invention, the search range of the pulse position is switched according to the pitch period.
7. The speech encoding apparatus according to claim 6, wherein the search range of the pulse position is expanded or contracted based on the pitch cycle. The quality can be improved.

【００２８】本発明の請求項８記載の発明は、適応符号
ベクトルに複数のピッチピークが存在する場合に、少な
くとも２つのピッチピークの位置が探索範囲に含まれる
ようにパルス位置の探索範囲を限定する請求項７記載の
音声符号化装置であり、検出された先頭のピッチピーク
の位置が誤っていた場合の影響を緩和することができ、
また、先頭のピッチピーク付近の波形と２番目のピッチ
ピーク付近の波形の形状変化にも対応することができる
ので、音声品質の向上を図ることができる。According to an eighth aspect of the present invention, when a plurality of pitch peaks exist in the adaptive code vector, the search range of the pulse position is limited so that the positions of at least two pitch peaks are included in the search range. 8. The speech encoding apparatus according to claim 7, wherein the effect of a case where the position of the detected leading pitch peak is incorrect is reduced.
Further, since it is possible to cope with shape changes of the waveform near the first pitch peak and the waveform near the second pitch peak, it is possible to improve the sound quality.

【００２９】本発明の請求項９記載の発明は、ＣＥＬＰ
型音声符号化装置において、音声の分析結果によって雑
音符号帳を切り替える音源生成部を備えた音声符号化装
置であり、入力音声の特徴に応じて雑音符号帳を切り替
えることができるので、音声品質の向上を図ることがで
きる。The ninth aspect of the present invention provides a CELP
In the speech coding apparatus of the type, the speech coding apparatus includes a sound source generation unit that switches a noise codebook according to a result of speech analysis. Improvement can be achieved.

【００３０】本発明の請求項１０記載の発明は、ＣＥＬ
Ｐ型音声符号化装置において、雑音符号帳探索を行う以
前に抽出された伝送パラメータを用いて雑音符号帳を切
り替える音源生成部を備えた音声符号化装置であり、す
でに伝送することが決定されている情報を用いて雑音符
号帳を切り替えるので、情報量を増加させることなく雑
音符号帳の切り替えが行うことができる。The invention according to claim 10 of the present invention provides a CEL
The P-type speech coding apparatus is a speech coding apparatus including a sound source generation unit that switches a random codebook using transmission parameters extracted before performing a random codebook search, and has already been determined to be transmitted. Since the random codebook is switched using the existing information, the random codebook can be switched without increasing the amount of information.

【００３１】本発明の請求項１１記載の発明は、音声信
号の分析結果によってパルス本数を切り替える構成の請
求項５から請求項８のいずれかに記載の音声符号化装置
であり、入力音声の特徴に応じてパルス本数を切り替え
るため、音声品質の向上を図ることができる。An eleventh aspect of the present invention is the audio encoding apparatus according to any one of the fifth to eighth aspects, wherein the number of pulses is switched according to an analysis result of the audio signal. Since the number of pulses is switched according to, the sound quality can be improved.

【００３２】本発明の請求項１２記載の発明は、雑音符
号帳探索を行う以前に抽出されている情報を用いてパル
ス本数を切り替える構成を有する、請求項５から請求項
８または請求項１１のいずれかに記載の音声符号化装置
であり、すでに伝送することが決定している情報を用い
てパルス本数を切り替えるため、伝送する情報量を増加
させることなくパルス本数の切り替えを行うことができ
る。The invention according to claim 12 of the present invention has a configuration in which the number of pulses is switched using information extracted before performing a random codebook search. The speech encoding apparatus according to any of the above, wherein the number of pulses is switched using information that has already been determined to be transmitted, so that the number of pulses can be switched without increasing the amount of information to be transmitted.

【００３３】本発明の請求項１３記載の発明は、ピッチ
周期によってパルス本数を切り替える音源生成部を備え
た請求項５から請求項８または請求項１１または請求項
１２のいずれかに記載の音声符号化装置であり、ピッチ
周期を用いてパルス数を切り替えるため、伝送情報を増
加させることなくパルス本数を切り替えることができ
る。また、ピッチ周期によって最適なパルス本数が異な
るため、音声品質の向上を図ることができる。According to a thirteenth aspect of the present invention, the speech codec according to any one of the fifth to eighth or eleventh or twelfth aspects, further comprising a sound source generation unit for switching the number of pulses according to a pitch period. Since the number of pulses is switched using the pitch period, the number of pulses can be switched without increasing transmission information. In addition, since the optimum number of pulses differs depending on the pitch period, the sound quality can be improved.

【００３４】本発明の請求項１４記載の発明は、連続す
るサブフレーム間でピッチ周期の変動が小さい場合とそ
うでない場合でパルス本数を切り替える請求項１３記載
の記載の音声符号化装置であり、音声信号の有声部の立
ち上がり部分と定常部分で使用するパルスの本数を切り
替えることになるので、音声品質の向上を図ることがで
きる。According to a fourteenth aspect of the present invention, there is provided the speech encoding apparatus according to the thirteenth aspect, wherein the number of pulses is switched between a case in which the pitch cycle varies between consecutive subframes and a case in which the variation is not so. Since the number of pulses used in the rising portion and the steady portion of the voiced portion of the voice signal is switched, the voice quality can be improved.

【００３５】本発明の請求項１５記載の発明は、雑音音
源としてパルス音源を用いる雑音符号ベクトル生成部に
おいて、パルス位置探索に先立ってパルス振幅を決定す
る請求項５から請求項８記載または請求項１１から請求
項１４のいずれかに記載の音声符号化装置であり、パル
ス音源に振幅のバリエーションを持たせるため、音声品
質の向上が図れる。また、パルス探索前に振幅を決定す
るため、その振幅に対して最適なパルス位置を決定する
ことができる。According to a fifteenth aspect of the present invention, in the noise code vector generation unit using a pulse excitation as a noise excitation, the pulse amplitude is determined prior to the pulse position search. The speech encoding apparatus according to any one of claims 11 to 14, wherein the pulse sound source has a variation in amplitude, thereby improving speech quality. Further, since the amplitude is determined before the pulse search, an optimum pulse position can be determined for the amplitude.

【００３６】本発明の請求項１６記載の発明は、雑音音
源としてパルス音源を用いる雑音符号ベクトル生成部に
おいて、適応符号ベクトルのピッチピーク近傍とそれ以
外の部分でパルス振幅を変える請求項１５記載の音声符
号化装置であり、音源信号のピッチピーク近傍とそれ以
外の部分の振幅を変化させるので、音源信号のピッチ構
造の形状を効率的に表現でき、音声品質の向上およびパ
ルス振幅情報の量子化の効率化を図ることができる。According to the invention of claim 16 of the present invention, in the noise code vector generation section using a pulse excitation as a noise excitation, the pulse amplitude is changed near the pitch peak of the adaptive code vector and other portions. Since it is a speech coding device, it changes the amplitude near the pitch peak of the sound source signal and the rest of the pitch, so that the pitch structure of the sound source signal can be expressed efficiently, improving speech quality and quantizing pulse amplitude information. Efficiency can be improved.

【００３７】本発明の請求項１７記載の発明は、統計的
にあるいは学習によって、使用するパルス音源のパルス
数をピッチ周期に基づいて決定する請求項１３記載の音
声符号化装置であり、各ピッチ周期に対する最適パルス
本数を統計的あるいはその他の学習方法によって決定す
るため、音声品質の向上を図ることができる。According to a seventeenth aspect of the present invention, there is provided the speech encoding apparatus according to the thirteenth aspect, wherein the number of pulses of the pulse sound source to be used is determined based on the pitch period, either statistically or by learning. Since the optimal number of pulses for the cycle is determined statistically or by another learning method, it is possible to improve speech quality.

【００３８】本発明の請求項１８記載の発明は、ＣＥＬ
Ｐ型音声符号化装置において、ピッチゲインを多段量子
化する音源生成部を備え、初段においては適応符号帳探
索直後に求められる値を量子化ターゲットとし、2段目
以降においては音源探索を全て終えた後に閉ループ探索
で決定されたピッチゲインと初段で量子化された値の差
分を量子化ターゲットとする音声符号化装置であり、適
応符号帳と固定符号帳（雑音符号帳）の和で駆動音源ベ
クトルを生成するＣＥＬＰ型音声符号化装置において
は、固定符号帳（雑音符号帳）探索前に得られる情報を
量子化して伝送するため、独立したモード情報を付加せ
ずに固定符号帳（雑音符号帳）の切り替え等を行うこと
が可能となり、効率的に音声情報を符号化することが可
能となる。The invention according to claim 18 of the present invention provides a CEL
The P-type speech coding apparatus includes a sound source generation unit that quantizes pitch gains in multiple stages. In the first stage, a value obtained immediately after the adaptive codebook search is used as a quantization target, and in the second and subsequent stages, all sound source searches are completed. This is a speech coding apparatus that uses a difference between a pitch gain determined in a closed loop search and a value quantized in a first stage as a quantization target, and is driven by the sum of an adaptive codebook and a fixed codebook (noise codebook). In a CELP type speech coding apparatus that generates a vector, information obtained before searching for a fixed codebook (noise codebook) is quantized and transmitted. Therefore, a fixed codebook (noise codebook) is added without adding independent mode information. ) Can be switched, and audio information can be efficiently encoded.

【００３９】本発明の請求項１９記載の発明は、請求項
１８記載の音声符号化装置において適応符号帳探索直後
に求められたピッチゲインの量子化値を用いて固定符号
帳を切り替える構成を有する、請求項９から請求項１２
または請求項１５から請求項１７のいずれかに記載の音
声符号化装置であり、固定符号帳探索前に求められるピ
ッチゲインと固定符号帳探索後に求められるピッチゲイ
ンの値が大きく異ならないことを利用して、モード情報
を付加することなく固定符号帳のモード切り替えを可能
とし、音声品質の向上を図ることができる。According to a nineteenth aspect of the present invention, in the speech coding apparatus according to the eighteenth aspect, a fixed codebook is switched by using a quantization value of a pitch gain obtained immediately after an adaptive codebook search. , Claims 9 to 12
Alternatively, the speech coding apparatus according to any one of claims 15 to 17, wherein the pitch gain obtained before searching for the fixed codebook and the pitch gain calculated after searching for the fixed codebook do not greatly differ. Thus, it is possible to switch the mode of the fixed codebook without adding mode information, and it is possible to improve speech quality.

【００４０】本発明の請求項２０記載の発明は、ピッチ
周期のサブフレーム間変化に基づいて固定符号帳を切り
替える請求項９から１２または請求項１５から請求項１
９のいずれかに記載の音声符号化装置であり、ピッチ周
期のサブフレーム間の連続性等を利用することによっ
て、有声・有声定常部であるか否かの判定を行い、有声
・有声定常部に有効な音源とそれ以外の部分（無声・立
ち上がり部等）に有効な音源との切り替えを行うことに
よって、音声品質の向上を図ることができる。According to a twentieth aspect of the present invention, the fixed codebook is switched based on a change in pitch period between subframes.
9. The speech coding apparatus according to any one of claims 9 to 9, wherein the speech encoding unit determines whether or not the voiced / voiced stationary unit is used by utilizing continuity between subframes of the pitch period and the like. The sound quality can be improved by switching between a sound source that is effective for the sound source and a sound source that is effective for the other parts (silent / rising portion, etc.).

【００４１】本発明の請求項２１記載の発明は、直前の
サブフレームで量子化されたピッチゲインを用いて固定
符号帳を切り替える請求項９から請求項１２または請求
項１５から請求項１７のいずれかに記載の音声符号化装
置であり、ピッチゲインのサブフレーム間の連続性等を
利用することによって、有声・有声定常部であるか否か
の判定を行い、有声・有声定常部に有効な音源とそれ以
外の部分（無声・立ち上がり部等）に有効な音源との切
り替えを行うことによって、音声品質の向上を図ること
ができる。According to a twenty-first aspect of the present invention, any one of the ninth to twelfth or fifteenth to seventeenth aspects switches the fixed codebook using the pitch gain quantized in the immediately preceding subframe. The voice coding apparatus according to any one of (1) to (3), by utilizing the continuity between pitch gain subframes and the like, to determine whether or not the voiced / voiced stationary section is effective. By switching between the sound source and a sound source that is effective for other parts (unvoiced, rising part, etc.), it is possible to improve the sound quality.

【００４２】本発明の請求項２２記載の発明は、ピッチ
周期のサブフレーム間変化および量子化ピッチゲインに
基づいて固定符号帳を切り替える請求項９から請求項１
２または請求項１５から請求項１７のいずれかに記載の
音声符号化装置であり、伝送パラメータであるピッチ周
期およびピッチゲインの情報を用いて、有声・有声定常
部であるか否かの判定を行い、有声・有声定常部に有効
な音源とそれ以外の部分（無声・立ち上がり部等）に有
効な音源との切り替えを行うことによって、音声品質の
向上を図ることができる。According to a twenty-second aspect of the present invention, the fixed codebook is switched based on a change in pitch period between subframes and a quantized pitch gain.
18. The speech coding apparatus according to claim 15, wherein a determination is made as to whether or not the voiced / voiced stationary section is to be used, using information on a pitch period and a pitch gain, which are transmission parameters. By performing the switching between the sound source effective for the voiced / voiced stationary part and the sound source effective for the other parts (unvoiced / rising part, etc.), the sound quality can be improved.

【００４３】本発明の請求項２３記載の発明は、固定符
号帳にパルス音源符号帳を用いる請求項１９から請求項
２２のいずれかに記載の音声符号化装置であり、雑音符
号帳にパルス音源を用いるので、雑音符号帳に要するメ
モリ量や雑音符号帳探索時の演算量を少なくすることが
でき、さらに有声部の立ち上がりの表現性を向上するこ
とができる。According to a twenty-third aspect of the present invention, the speech coding apparatus according to any one of the nineteenth to twenty-second aspects uses a pulse excitation codebook as the fixed codebook, and includes a pulse excitation codebook as the noise codebook. Is used, it is possible to reduce the amount of memory required for the random codebook and the amount of calculation at the time of searching for the random codebook, and it is possible to further improve the expression of the rise of the voiced part.

【００４４】本発明の請求項２４記載の発明は、所定の
時間長を有するサブフレーム毎に音声符号化処理を行う
ＣＥＬＰ型音声符号化装置において、現在のサブフレー
ムにおける位相と直前のサブフレームにおける位相とが
連続しているかどうかを判定し、連続していると判定さ
れた場合と連続していないと判定された場合とで用いる
音源を切り替える音声符号化装置であり、有声（定常）
部とそれ以外の部分を切り分けた音源構成が実現でき、
音質向上を図ることができる。According to a twenty-fourth aspect of the present invention, in a CELP type speech coding apparatus for performing speech coding processing for each subframe having a predetermined time length, a phase in a current subframe and a phase in a previous subframe are compared. A speech encoding device that determines whether a phase is continuous and switches a sound source used when it is determined that the phases are continuous and when it is determined that the phases are not continuous.
The sound source configuration can be realized by separating the part from the rest.
Sound quality can be improved.

【００４５】本発明の請求項２５記載の発明は、直前の
サブフレームにおけるピッチピーク位置と、直前のサブ
フレームにおけるピッチ周期と、現在のサブフレームに
おけるピッチ周期を用いて現在のサブフレームにおける
ピッチピーク位置を予測し、この予測によって得られた
現在のサブフレームにおけるピッチピーク位置が、現在
のサブフレームにおけるデータのみから求められたピッ
チピーク位置に近いかどうかによって、直前のサブフレ
ームにおける位相と現在のサブフレームにおける位相と
が連続しているかどうかを判定し、その判定結果によっ
て音源の符号化処理方法を切り替える請求項２４記載の
ＣＥＬＰ型音声符号化装置であり、既に伝送されたまた
は伝送されることになっている情報を用いて判定結果を
得るので、判定結果を新たな伝送情報を用いて伝送する
必要がない。According to a twenty-fifth aspect of the present invention, the pitch peak position in the current subframe is determined by using the pitch peak position in the immediately preceding subframe, the pitch period in the immediately preceding subframe, and the pitch period in the current subframe. The position is predicted, and the phase in the immediately preceding subframe and the current position are determined depending on whether or not the pitch peak position in the current subframe obtained by this prediction is close to the pitch peak position obtained only from the data in the current subframe. The CELP-type speech encoding apparatus according to claim 24, wherein it is determined whether or not the phase in the subframe is continuous, and the encoding processing method of the excitation is switched based on the determination result. The judgment result is obtained by using the information The do not need to be transmitted using the new transmission information.

【００４６】本発明の請求項２６記載の発明は、直前の
サブフレームにおける位相と現在のサブフレームにおけ
る位相とが連続していると判定された場合には、位相適
応処理を雑音符号帳に対して行い、直前のサブフレーム
における位相と現在のサブフレームにおける位相とが連
続していないと判定された場合には、位相適応処理を雑
音符号帳に対して行わない請求項２４または請求項２５
記載の音声符号化装置であり、効果的な位相適応処理を
行うことができる。なお、サブフレーム間の位相の連続
性はバックワードで判定されるため、位相適応処理を適
用するかしないかの切り替え情報を新たに伝送する必要
もない。さらに、位相適応処理を適用しない場合は固定
符号帳を使用することにより、伝送路誤りの影響の伝播
を抑える効果を得ることも可能である。According to a twenty-sixth aspect of the present invention, when it is determined that the phase in the immediately preceding subframe and the phase in the current subframe are continuous, the phase adaptation processing is performed on the noise codebook. 26. When it is determined that the phase in the immediately preceding subframe and the phase in the current subframe are not continuous, the phase adaptation processing is not performed on the noise codebook.
It is possible to perform effective phase adaptation processing. In addition, since the continuity of the phase between subframes is determined backward, there is no need to newly transmit switching information as to whether or not to apply the phase adaptation processing. Further, when the phase adaptation processing is not applied, by using a fixed codebook, it is possible to obtain the effect of suppressing the propagation of the influence of the transmission path error.

【００４７】本発明の請求項２７記載の発明は、所定の
時間長を有するサブフレーム毎に音声符号化処理を行う
ＣＥＬＰ型音声符号化装置において、現在のサブフレー
ムにおける適応符号ベクトルのピッチピーク付近におけ
る信号パワーの集中度を基準として、音源信号の符号化
処理方法を切り替える音声符号化装置であり、音源構成
（音源信号の符号化処理方法）の切り替えのために新た
な伝送情報を必要とせずに、適応的に音源構成の切り替
えを行うことができる。According to a twenty-seventh aspect of the present invention, there is provided a CELP-type speech coding apparatus for performing a speech coding process for each subframe having a predetermined time length, in the vicinity of a pitch peak of an adaptive code vector in a current subframe. Is a speech encoding apparatus that switches the encoding processing method of the excitation signal based on the signal power concentration level in, and does not require new transmission information for switching the excitation configuration (the encoding processing method of the excitation signal). In addition, the sound source configuration can be adaptively switched.

【００４８】本発明の請求項２８記載の発明は、現在の
サブフレームにおける適応符号ベクトルのピッチピーク
付近における信号パワーの１ピッチ周期長の信号全体に
占める割合が所定の値以上である場合には、位相適応処
理を雑音符号帳に対して行い、所定の値未満である場合
には、位相適応処理を雑音符号帳に対して行わない請求
項２７記載の音声符号化装置であり、適応符号ベクトル
のパルス性の強さによって適応的に位相適応処理を制御
する（切り替える）ことができ、音声品質の向上を図る
ことができる。また、位相適応処理の制御（切り替え）
のための新たな伝送情報も不要である。さらに、位相適
応処理を行わない場合に固定符号帳を用いれば、伝送路
誤りの影響の伝播を抑える効果を得ることも可能であ
る。[0048] The invention according to claim 28 of the present invention is arranged such that the ratio of the signal power in the vicinity of the pitch peak of the adaptive code vector in the current subframe to the entire signal of one pitch period length is equal to or more than a predetermined value. 28. The speech encoding apparatus according to claim 27, wherein the phase adaptation process is performed on the noise codebook and the phase adaptation process is not performed on the noise codebook if the value is less than a predetermined value. , The phase adaptive processing can be adaptively controlled (switched) depending on the strength of the pulse characteristic of, and the sound quality can be improved. Control (switching) of phase adaptation processing
No new transmission information is required. Furthermore, if the fixed codebook is used when the phase adaptation process is not performed, it is possible to obtain an effect of suppressing the propagation of the influence of the transmission path error.

【００４９】本発明の請求項２９記載の発明は、位相適
応処理として、ピッチピーク近傍は密にパルス位置探索
を行い、ピッチピーク近傍以外の部分は疎にパルス位置
探索を行う、パルス音源を雑音音源に適用した請求項２
６または請求項２８記載の音声符号化装置であり、雑音
符号帳にパルス音源を用いるので、雑音符号帳に要する
メモリ量や雑音符号帳探索時の演算量を少なくすること
ができ、さらに有声部の立ち上がりの表現性を向上する
ことができる。According to a twenty-ninth aspect of the present invention, as a phase adaptation process, a pulse position search is performed densely near a pitch peak, and a sparse pulse position search is performed in a portion other than the vicinity of a pitch peak. Claim 2 applied to a sound source
29. The speech coding apparatus according to claim 26, wherein a pulse excitation is used for the noise codebook, so that the amount of memory required for the noise codebook and the amount of calculation at the time of searching for the noise codebook can be reduced. The expression of the rising edge can be improved.

【００５０】本発明の請求項３０記載の発明は、パルス
の位置を表すインデックスを、サブフレームの先頭側か
ら順番に並ぶように付ける請求項５から請求項８または
請求項１１から請求項１７または請求項２３または請求
項２９のいずれかに記載の音声符号化装置であり、パル
スの位置を表すインデックスを、インデックスの番号が
若いほどサブフレームの先頭付近にあるように、サブフ
レームの先頭から付けることによって、ピッチピーク位
置が誤った場合に生じるパルス位置のずれを小さくする
ことが可能となり、伝送路誤りの影響の伝播を和らげる
ことができる。According to a thirty-fifth aspect of the present invention, the indices indicating the positions of the pulses are added so as to be arranged in order from the head of the subframe. 30. The speech encoding device according to claim 23, wherein an index representing a pulse position is assigned from the beginning of the subframe such that the index number is smaller near the beginning of the subframe. This makes it possible to reduce the deviation of the pulse position that occurs when the pitch peak position is incorrect, thereby reducing the propagation of the influence of the transmission path error.

【００５１】本発明の請求項３１記載の発明は、同じイ
ンデックス番号である場合、サブフレームの先頭側から
順番にパルスの番号を付け、さらにピッチピーク位置近
傍は密に、ピッチピーク近傍以外の部分は疎になるよう
に、各パルスの探索位置が決定されている請求項３０記
載の音声符号化装置であり、同じインデックス番号の場
合、パルスの番号が若いほどサブフレームの先頭側にな
るように、各パルスの番号が決められので、パルスのイ
ンデックスに加えてパルスの番号のつけかたも定義さ
れ、ピッチピーク位置が誤った場合に生じるパルス位置
のずれをさらに小さくすることが可能となり、伝送路誤
りの影響の伝播をさらに減らすことができる。According to the invention of claim 31 of the present invention, when the index numbers are the same, the pulse numbers are sequentially numbered from the head side of the subframe, and the vicinity of the pitch peak position is dense, and the parts other than the pitch peak are densely arranged. 31. The speech coding apparatus according to claim 30, wherein a search position of each pulse is determined so as to be sparse. Since the number of each pulse is determined, the numbering method of the pulse is also defined in addition to the index of the pulse. Can be further reduced.

【００５２】本発明の請求項３２記載の発明は、パルス
探索位置の一部をピッチピーク位置によって決定し、そ
の他のパルス探索位置はピッチピーク位置に関係なく予
め定められた固定位置である請求項５から請求項８また
は請求項１１から請求項１７または請求項２３または請
求項２９のいずれかに記載の音声符号化装置であり、ピ
ッチピーク位置が誤った場合においても、音源パルスの
位置を誤る確率が減るので、伝送路誤りの影響の伝播を
抑えることができる。In the invention according to claim 32 of the present invention, a part of the pulse search position is determined by the pitch peak position, and the other pulse search positions are predetermined fixed positions irrespective of the pitch peak position. The speech encoding apparatus according to any one of claims 5 to 8, 11 to 17, or 23 or 29, wherein the position of the excitation pulse is incorrect even when the pitch peak position is incorrect. Since the probability decreases, propagation of the influence of the transmission path error can be suppressed.

【００５３】本発明の請求項３３記載の発明は、所定の
時間長を有する音声あるいは音源信号のピッチピーク位
置を求める際に、当該信号からピッチ周期長のみを切り
出し、切り出した信号内においてピッチピーク位置を決
定するピッチピーク位置算出手段を有する請求項１から
請求項８または請求項１１から請求項１７または請求項
１９から請求項２３または請求項２５から請求項３２の
いずれかに記載の音声符号化装置であり、１ピッチ波形
の中からピッチピークを選択するため、単純に振幅値
（絶対値）が最大になる点を探索すれば良く、サブフレ
ームに１ピッチ周期を超える波形が含まれていても正確
にピッチピーク位置を求めることができる。According to a thirty-third aspect of the present invention, when determining the pitch peak position of a voice or sound source signal having a predetermined time length, only the pitch period length is cut out from the signal and the pitch peak is included in the cut out signal. The speech code according to any one of claims 1 to 8, or 11 to 17, or 19 to 23, or 25 to 32, comprising a pitch peak position calculation means for determining a position. In order to select a pitch peak from a one-pitch waveform, it is sufficient to simply search for a point where the amplitude value (absolute value) becomes maximum, and a subframe includes a waveform that exceeds one pitch period. However, the pitch peak position can be accurately obtained.

【００５４】本発明の請求項３４記載の発明は、当該信
号からピッチ周期長のみを切り出す場合に、まず１周期
長を切り出さずに当該信号全体を用いてピッチピーク位
置を決定し、この決定されたピッチピーク位置を切り出
し開始点として１ピッチ周期長を切り出し、切り出した
信号内においてピッチピーク位置を決定する請求項３３
記載の音声符号化装置であり、当該信号全体を用いてピ
ッチピーク位置を決定した場合に発生する、１ピッチ波
形内のセカンドピークをピッチピーク位置としてしまう
現象を回避することが可能となる。すなわち、ピッチ周
期とサブフレーム長が同期していないことに起因するピ
ッチピーク位置の抽出誤りを回避することが可能とな
る。According to the thirty-fourth aspect of the present invention, when only the pitch period length is cut out from the signal, the pitch peak position is determined by using the entire signal without cutting out one cycle length, and the pitch peak position is determined. 34. One pitch period length is cut out using the cut-out pitch peak position as a cut-out start point, and the pitch peak position is determined in the cut-out signal.
It is possible to avoid a phenomenon in which a second peak in one pitch waveform is set as a pitch peak position, which occurs when a pitch peak position is determined using the entire signal. That is, it is possible to avoid a pitch peak position extraction error caused by the synchronization between the pitch period and the subframe length.

【００５５】本発明の請求項３５記載の発明は、所定の
時間長を有するサブフレーム毎に音声符号化処理を行う
ＣＥＬＰ型音声符号化装置において、現在のサブフレー
ムにおけるピッチピーク位置を算出する際、直前のサブ
フレームにおけるピッチ周期と現在のサブフレームにお
けるピッチ周期との差が予め定められた範囲内である場
合は、直前のサブフレームにおけるピッチピーク位置
と、直前のサブフレームにおけるピッチ周期と、現在の
サブフレームにおけるピッチ周期を用いて現在のサブフ
レームにおけるピッチピーク位置を予測し、この予測に
よって得られた現在のサブフレームにおけるピッチピー
ク位置を用いて現在のサブフレームにおけるピッチピー
ク位置の存在範囲を予め限定し、その範囲内でピッチピ
ーク位置探索を行う請求項１から請求項８または請求項
１１から請求項１７または請求項１９から請求項２３ま
たは請求項２５から請求項３２のいずれかに記載の音声
符号化装置であり、直前のサブフレームのピッチピーク
位置を考慮して現在のサブフレームのピッチピーク位置
を決定するため、現在のサブフレームのみからピッチピ
ーク位置を求めると１ピッチピーク波形内のセカンドピ
ーク位置を誤検出してしまうような場合に、誤検出を回
避する手法となる。According to a thirty-fifth aspect of the present invention, there is provided a CELP-type speech coding apparatus for performing speech coding processing for each subframe having a predetermined time length when calculating a pitch peak position in a current subframe. If the difference between the pitch cycle in the immediately preceding subframe and the pitch cycle in the current subframe is within a predetermined range, the pitch peak position in the immediately preceding subframe, and the pitch cycle in the immediately preceding subframe, Predict the pitch peak position in the current subframe using the pitch period in the current subframe, and use the pitch peak position in the current subframe obtained by this prediction to determine the range of the pitch peak position in the current subframe Is limited in advance and a pitch peak position search is performed within that range. The speech encoding apparatus according to any one of claims 1 to 8 or claim 11 to claim 17 or claim 19 to claim 23 or claim 25 to claim 32, wherein the pitch of the immediately preceding subframe is In order to determine the pitch peak position of the current subframe in consideration of the peak position, if the pitch peak position is obtained only from the current subframe, the second peak position in one pitch peak waveform may be erroneously detected. This is a technique for avoiding erroneous detection.

【００５６】本発明の請求項３６記載の発明は、所定の
時間長を有するサブフレーム毎に音声符号化処理を行う
ＣＥＬＰ型音声符号化装置において、雑音符号帳として
パルス音源を用い、雑音符号帳のモードを少なくとも２
モード以上有し、音源パルスの本数はモードを切り替え
ることによって変化させることができ、少なくとも１つ
は各パルスの位置情報が十分に取ってあるパルス本数の
少ないモードであり、その他は各パルスの位置情報が不
足するがパルス数の多いモードであり、モードの切り替
え情報を伝送してモードの切り替えを行う音声符号化装
置であり、位置情報が十分な少ない音源パルス数のモー
ドを備えることによって、音声信号の有声立ち上がり部
の品質向上を図り、また位置情報が不十分である音源パ
ルス数の多いモードを有効利用することができる。According to a thirty-sixth aspect of the present invention, in a CELP type speech coding apparatus for performing speech coding processing for each subframe having a predetermined time length, a pulse excitation is used as a noise codebook and a noise codebook is used. Mode of at least 2
The number of sound source pulses can be changed by switching the mode. At least one is a mode in which the position information of each pulse is sufficiently small and the number of pulses is small, and the other is the position of each pulse. This is a speech encoding device that lacks information but has a large number of pulses, and is a speech encoding device that performs mode switching by transmitting mode switching information. The quality of the voiced rising portion of the signal can be improved, and a mode having a large number of sound source pulses with insufficient position information can be effectively used.

【００５７】本発明の請求項３７記載の発明は、ピッチ
周期が短い場合には、ピッチ周期に対応して音源パルス
の探索範囲を狭い範囲内に限定することによって、音源
パルスの位置情報を減らして音源パルスの本数を増やす
請求項３６記載の音声符号化装置であり、短いピッチ周
期のピッチ周期性を有する音源信号に対しては、１ピッ
チ周期当たりにおける音源パルスの位置情報を十分に保
ったまま音源パルスの本数を増やすことができ、音声品
質の向上を図ることができる。According to the thirty-seventh aspect of the present invention, when the pitch period is short, the search range of the source pulse is limited to a narrow range corresponding to the pitch period, thereby reducing the position information of the source pulse. 37. The speech encoding apparatus according to claim 36, wherein the number of excitation pulses is increased by increasing the number of excitation pulses, and position information of the excitation pulses per pitch period is sufficiently maintained for an excitation signal having a short pitch period. The number of sound source pulses can be increased as it is, and the sound quality can be improved.

【００５８】本発明の請求項３８記載の発明は、各パル
スの位置情報が不足するがパルス数の多いモードにおい
ては、ピッチピーク位置近傍は音源パルスの探索位置を
密に、それ以外の部分においては音源パルスの探索位置
を疎になるように、パルス位置の探索範囲を決定する請
求項３６または請求項３７記載の音声符号化装置であ
り、音源パルスが立てられる確率の高い部分に音源パル
スの位置情報を集中させるため、音源パルスの位置情報
が不十分である音源パルス数の多いモードの利用効率を
高めることができる。According to the thirty-eighth aspect of the present invention, in the mode in which the position information of each pulse is insufficient but the number of pulses is large, the search position of the sound source pulse is densely located near the pitch peak position, and in other parts. The speech encoding apparatus according to claim 36 or 37, wherein a search range of the pulse position is determined so that the search position of the excitation pulse is sparse. Since the position information is concentrated, the use efficiency of a mode having a large number of sound source pulses where the position information of the sound source pulse is insufficient can be improved.

【００５９】本発明の請求項３９記載の発明は、請求項
３６または請求項３７または請求項３８記載のＣＥＬＰ
型音声符号化装置において、パルス数が少なく位置情報
が十分である音源モードにおいて、位置情報の一部を雑
音性の音源コードベクトルを表すインデックスに割り当
てるようにした音声符号化装置であり、新たなモードを
設けることなく無声子音部や雑音的な入力信号にも対応
することができる。The invention according to claim 39 of the present invention provides the CELP according to claim 36 or 37 or 38.
In a speech coding apparatus of the type, in a sound source mode in which the number of pulses is small and the position information is sufficient, a part of the position information is assigned to an index representing a noisy excitation code vector, and a new It is possible to cope with an unvoiced consonant part or a noise-like input signal without providing a mode.

【００６０】本発明の請求項４０に記載の発明は、請求
項１から３９のいずれかに記載の音声符号化装置の機能
を実行させるためのプログラムを記録したコンピュータ
読み取り可能な記録媒体であり、このような記録媒体を
コンピュータで読み取ることにより音声符号化装置の機
能を実現することができる。According to a forty-third aspect of the present invention, there is provided a computer-readable recording medium having recorded thereon a program for executing the function of the speech coding apparatus according to any one of the first to thirty-ninth aspects. By reading such a recording medium by a computer, the function of the audio encoding device can be realized.

【００６１】以下、本発明の実施の形態における音声符
号化装置の音源生成部について、図１から図１０を用い
て説明する。Hereinafter, the excitation generator of the speech encoding apparatus according to the embodiment of the present invention will be described with reference to FIGS.

【００６２】（実施の形態１）図１は本発明の第１の実
施の形態を示し、適応符号ベクトルのピッチピーク位置
に対応する雑音符号ベクトルの振幅を強調する音声符号
化装置の音源生成部を示す。図１において、１１は適応
符号ベクトルをピッチピーク位置検出器１２に出力する
適応符号帳、１２は適応符号帳１１から出力された適応
符号ベクトルを入力として、ピッチピーク位置を振幅強
調窓生成器１３に出力するピッチピーク位置算出器、１
３はピッチピーク位置算出器１２から出力されたピッチ
ピーク位置を入力として、振幅強調窓を振幅強調窓掛け
器１６に出力する振幅強調窓生成器、１４は雑音符号ベ
クトルを格納し、周期化器１５へ出力する雑音符号帳、
１５は雑音符号帳１４から出力された雑音符号ベクトル
とピッチ周期Ｌを入力として、雑音符号ベクトルをピッ
チ周期化して振幅強調窓掛け器１６に出力する周期化
器、１６は振幅強調窓生成器１３から出力された振幅強
調窓と周期化器１５から出力された雑音符号ベクトルを
入力とし、雑音符号ベクトルに振幅強調窓を乗じて、最
終的な雑音符号ベクトルを出力する振幅強調窓掛け器で
ある。(Embodiment 1) FIG. 1 shows a first embodiment of the present invention, in which a sound source generator of a speech coding apparatus for enhancing the amplitude of a noise code vector corresponding to a pitch peak position of an adaptive code vector. Is shown. In FIG. 1, reference numeral 11 denotes an adaptive codebook that outputs an adaptive code vector to a pitch peak position detector 12. Pitch position calculator to output to
Reference numeral 3 denotes an amplitude emphasis window generator which receives the pitch peak position output from the pitch peak position calculator 12 and outputs an amplitude emphasis window to the amplitude emphasis window multiplier 16, and 14 stores a noise code vector and a periodizer. 15, a random codebook to be output to
Reference numeral 15 denotes a periodicizer which receives the noise code vector output from the noise codebook 14 and the pitch period L as input, converts the noise code vector to a pitch period, and outputs the pitch period to an amplitude emphasizing window multiplier 16. Is an amplitude emphasizing window multiplier that receives the amplitude emphasizing window output from the controller and the noise code vector output from the periodizer 15 as inputs, multiplies the noise code vector by the amplitude emphasizing window, and outputs a final noise code vector. .

【００６３】以上のように構成されたＣＥＬＰ型音声符
号化装置の音源生成部の動作について図１を用いて説明
する。ピッチピーク位置算出器１２は、入力された適応
符号ベクトルを用いて適応符号ベクトル内に存在するピ
ッチパルスの位置を決定する。ピッチパルスの位置は、
ピッチ周期で並べたインパルス列と適応符号ベクトルと
の正規化相互相関を最大化することによって行うことが
できる。また、ピッチ周期で並べたインパルス列を合成
フィルタに通したものと、適応符号ベクトルを合成フィ
ルタに通したもとの誤差を最小化することによっても可
能である。The operation of the excitation generating section of the CELP speech coding apparatus configured as described above will be described with reference to FIG. The pitch peak position calculator 12 determines the position of the pitch pulse existing in the adaptive code vector using the input adaptive code vector. The position of the pitch pulse is
This can be performed by maximizing the normalized cross-correlation between the impulse train arranged in the pitch period and the adaptive code vector. It is also possible to minimize the error caused by passing the impulse train arranged in the pitch cycle through the synthesis filter and by minimizing the error caused by passing the adaptive code vector through the synthesis filter.

【００６４】振幅強調窓生成器１３は、ピッチピーク位
置算出器１２によって決定されたピッチパルス位置に基
づいて振幅強調窓を生成する。振幅強調窓としては、種
々なものを用いることが可能であるが、例えば、ピッチ
パルス位置を中心とする三角窓が窓長の制御が容易な点
において有利である。The amplitude emphasis window generator 13 generates an amplitude emphasis window based on the pitch pulse position determined by the pitch peak position calculator 12. Various types of amplitude emphasis windows can be used. For example, a triangular window centered on the pitch pulse position is advantageous in that the window length can be easily controlled.

【００６５】図２は振幅強調窓生成器１３から出力され
る振幅強調窓の形状と適応符号ベクトルの形状の対応を
示す。図中破線の位置がピッチピーク位置算出器１２に
よって決定されたピッチパルス位置である。FIG. 2 shows the correspondence between the shape of the amplitude enhancement window output from the amplitude enhancement window generator 13 and the shape of the adaptive code vector. The position indicated by the broken line in the figure is the pitch pulse position determined by the pitch peak position calculator 12.

【００６６】周期化器１５は、雑音符号帳１４から出力
された雑音符号ベクトルをピッチ周期化する。ピッチ周
期化は、雑音符号ベクトルをピッチ周期で周期化するも
ので、雑音符号帳の格納ベクトルを先頭からピッチ周期
Ｌの分だけ切り出し、それをサブフレーム長に達するま
で複数回繰り返して接続することによって行われる。た
だし、ピッチ周期化が行われるのは、ピッチ周期がサブ
フレーム長以下の場合のみである。The periodizer 15 pitch-cycles the random code vector output from the random codebook 14. Pitch periodicization is a method of periodicizing a random code vector with a pitch period. The stored vector of the random codebook is cut out from the beginning by the pitch period L, and connected repeatedly a plurality of times until the subframe length is reached. Done by However, the pitch period is set only when the pitch period is equal to or less than the subframe length.

【００６７】振幅強調窓掛け器１６は、周期化器１５か
ら出力された雑音符号ベクトルに振幅強調窓生成器１３
から出力された振幅強調窓を乗ずる。The amplitude emphasizing window generator 16 adds the noise emphasizing window generator 13 to the noise code vector output from the periodizer 15.
Multiply by the amplitude emphasis window output from.

【００６８】このように、上記第１の実施の形態によれ
ば、１ピッチ波形内に存在する位相情報を利用して、音
質向上を図ることができる。As described above, according to the first embodiment, the sound quality can be improved by utilizing the phase information existing in one pitch waveform.

【００６９】なお、図１では、雑音符号ベクトルの周期
化を行うＣＥＬＰ型音声符号化装置の音源部分について
説明したが、図１１に示すような雑音符号帳に格納され
た雑音符号ベクトルをそのまま使用する一般的なＣＥＬ
Ｐ型音声符号化装置の音源部分に対しても実施は可能で
あり、その例を図３に示す。図３において、２１は適応
符号帳、２２はピーチピーク位置算出器、２３は振幅強
調窓生成器、２４は雑音符号帳、２５は振幅強調窓掛け
器であり、雑音音源をピッチ同期に同期させないことだ
けが図１の音源生成部と異なる。In FIG. 1, the excitation part of the CELP-type speech coding apparatus for performing the periodicization of the random code vector has been described. However, the random code vector stored in the random code book as shown in FIG. 11 is used as it is. Common CEL
The present invention is also applicable to the sound source portion of the P-type speech coding apparatus, and an example is shown in FIG. In FIG. 3, 21 is an adaptive codebook, 22 is a peach peak position calculator, 23 is an amplitude emphasis window generator, 24 is a noise codebook, and 25 is an amplitude emphasis window multiplier, which does not synchronize a noise source with pitch synchronization. This is different from the sound source generation unit in FIG.

【００７０】（実施の形態２）図４は本発明の第２の実
施の形態を示し、音声信号の有声部の立ち上がり部分に
対してパルス列音源と雑音音源を組み合わせた音源を適
用する構成を有するＣＥＬＰ型音声符号化装置に対し
て、パルス列音源のパルス位置に対応する雑音符号ベク
トルの振幅を強調する音声符号化装置の音源生成部を示
している。図４において、３１は振幅強調窓生成器３２
および加算器３３に出力されて、ピッチパルスの位置に
置かれたピッチ周期Ｌの間隔で並べられたインパルス列
からなるパルス列音源、３２はパルス列のパルス位置に
対応する位置の雑音符号ベクトル振幅を強調するための
振幅強調窓を生成して、乗算器３５に出力する振幅強調
窓生成器、３３はパルス列音源と乗算器３５から出力さ
れた振幅強調窓掛け後の雑音符号ベクトルを加算して、
励振ベクトルとして出力する加算器、３４は雑音符号ベ
クトルで表現され、乗算器３５へ出力される雑音音源、
３５は雑音音源３４から出力された雑音音源ベクトルに
対して振幅強調窓生成器３２から出力された振幅強調窓
を乗ずる乗算器である。(Embodiment 2) FIG. 4 shows a second embodiment of the present invention, which has a configuration in which a sound source combining a pulse train sound source and a noise sound source is applied to the rising portion of a voiced portion of a voice signal. FIG. 3 shows a sound source generation unit of a speech coding device that emphasizes the amplitude of a noise code vector corresponding to a pulse position of a pulse train excitation with respect to a CELP type speech coding device. In FIG. 4, reference numeral 31 denotes an amplitude enhancement window generator 32.
And a pulse train source composed of impulse trains output to the adder 33 and arranged at intervals of the pitch period L placed at the position of the pitch pulse. 32 emphasizes the noise code vector amplitude at a position corresponding to the pulse position of the pulse train. An amplitude emphasis window generator for generating an amplitude emphasis window for output to the multiplier 35 and adding the pulse train excitation and the noise code vector after the amplitude emphasis window output from the multiplier 35,
An adder that outputs as an excitation vector; 34 is a noise source represented by a noise code vector and output to a multiplier 35
A multiplier 35 multiplies the noise source vector output from the noise source 34 by the amplitude enhancement window output from the amplitude enhancement window generator 32.

【００７１】以上のように構成された音源生成部につい
て、図４を用いてその動作を説明する。パルス列音源３
１は、ピッチ周期Ｌと初期位相Ｐによってパルスの位置
と間隔が決定されているパルス列であり、ピッチ周期Ｌ
および初期位相Ｐは音源生成部の外部で別途計算され
る。なお、パルス列音源は、インパルスを並べたもので
も良いが、サンプリング点とサンプリング点の間に存在
するのインパルスを表現できる方が性能がよい。同様に
初期位相（最初のパルスの位置）も、サンプリング点と
サンプリング点の間を表現できる分数精度で表す方が性
能が良くなるが、この情報に割り当てることが可能なビ
ット数が十分でない場合は、整数精度でも良い性能が得
られ、位置決定のための探索も容易である。The operation of the sound source generator configured as described above will be described with reference to FIG. Pulse train sound source 3
1 is a pulse train whose pulse position and interval are determined by the pitch period L and the initial phase P.
And the initial phase P is separately calculated outside the sound source generation unit. The pulse train sound source may be one in which impulses are arranged, but the performance is better if the impulse existing between the sampling points can be expressed. Similarly, the performance of the initial phase (the position of the first pulse) is better if it is expressed with fractional precision that can represent between sampling points. However, if the number of bits that can be allocated to this information is not enough, Good performance can be obtained even with integer precision, and search for position determination is easy.

【００７２】振幅強調窓生成器３２は、パルス列音源ベ
クトルのパルスの位置に対応する位置の雑音音源ベクト
ルの振幅を強調するための窓であり、第１の実施の形態
で説明した振幅強調窓と同様のものである。パルスの位
置を中心とする三角窓などを用いることができる。The amplitude emphasis window generator 32 is a window for emphasizing the amplitude of the noise source vector at the position corresponding to the pulse position of the pulse train excitation vector, and includes the amplitude emphasis window described in the first embodiment. It is similar. A triangular window centered on the position of the pulse can be used.

【００７３】加算器３３は、パルス列音源ベクトル３１
と振幅強調窓が乗算器３５によって乗ぜられた雑音音源
ベクトル３４とを加算して、励振音源ベクトルとして出
力する。The adder 33 outputs the pulse train excitation vector 31
And the noise source vector 34 multiplied by the amplitude emphasizing window by the multiplier 35 and output as an excitation source vector.

【００７４】なお、図４には示されていないが、加算器
３３に入力される前にパルス列音源ベクトルおよび雑音
音源ベクトルのそれぞれに適切な利得を乗ずる構成にす
ると、より表現性の高い音源生成部となる。ただし、そ
の場合、利得情報を別途伝送する必要が生ずる。また、
パルス列音源ベクトルと雑音音源ベクトルの利得を固定
する場合は、パルス列音源ベクトルが雑音音源ベクトル
に埋もれてしまわないように、パルス列音源ベクトルの
パワーと雑音音源ベクトルのパワーが等しくなるように
調整するなどの利得調整は必要である。Although not shown in FIG. 4, if the pulse train excitation vector and the noise excitation vector are each multiplied by an appropriate gain before being input to the adder 33, the sound source generation with higher expressiveness can be achieved. Department. However, in that case, it is necessary to separately transmit the gain information. Also,
When fixing the gain of the pulse train excitation vector and the noise excitation vector, adjust the power of the pulse train excitation vector and the power of the noise excitation vector to be equal so that the pulse train excitation vector is not buried in the noise excitation vector. Gain adjustment is required.

【００７５】このように、上記第２の実施の形態によれ
ば、雑音音源ベクトルの振幅をピッチ周期に同期して強
調することによって、音質向上を図ることができる。As described above, according to the second embodiment, the sound quality can be improved by emphasizing the amplitude of the noise source vector in synchronization with the pitch period.

【００７６】（実施の形態３）図５は本発明の第３の実
施の形態を示し、ＣＥＬＰ型音声符号化装置において、
適応符号ベクトルのピッチピーク近傍のみに限定した雑
音符号ベクトルを用いた音声符号化装置の音源生成部を
示す。(Embodiment 3) FIG. 5 shows a third embodiment of the present invention.
5 shows a sound source generation unit of a speech coding apparatus using a noise code vector limited to only the vicinity of a pitch peak of an adaptive code vector.

【００７７】図５において、４１は適応符号ベクトルを
出力する適応符号帳、４２は適応符号帳４１から出力さ
れた適応符号ベクトルとピッチ周期Ｌを入力として、ピ
ッチピークの位置（位相情報）を雑音符号ベクトル生成
器４４に出力する位相探索器、４３はピッチピークの近
傍のみにベクトル長を限定した雑音符号ベクトルを格納
し、ピッチパルス位置近傍の雑音符号ベクトルを雑音符
号ベクトル生成器４４に出力するピッチパルス位置近傍
限定型雑音符号帳、４４はピッチパルス位置近傍限定型
雑音符号帳４３から出力された雑音符号ベクトルと位相
探索器４２から出力された位相情報とピッチ周期Ｌを入
力として、雑音符号ベクトルを周期化器４５に出力する
雑音符号ベクトル生成器、４５は雑音符号ベクトル生成
器４４から出力された雑音符号ベクトルとピッチ周期Ｌ
を入力として、最終的な雑音符号ベクトルを出力する周
期化器である。In FIG. 5, reference numeral 41 denotes an adaptive codebook for outputting an adaptive code vector, and reference numeral 42 denotes an input of the adaptive code vector output from the adaptive codebook 41 and the pitch period L, and the position (phase information) of a pitch peak is determined by noise. The phase searcher 43 outputs to the code vector generator 44 a noise code vector whose vector length is limited only in the vicinity of the pitch peak, and outputs the noise code vector near the pitch pulse position to the noise code vector generator 44. A noise codebook 44 limited to near the pitch pulse position is input with the noise code vector output from the noise codebook 43 limited to the near pitch pulse position, the phase information output from the phase searcher 42, and the pitch period L as inputs. A random code vector generator that outputs the vector to the periodicizer 45, and 45 is a signal output from the random code vector generator 44. Noise code vector and the pitch cycle L
, And outputs a final random code vector.

【００７８】以上のように構成された音声符号化装置の
音源生成部について、図５を用いてその動作を説明す
る。位相探索器４２は、適応符号帳４１から出力された
適応符号ベクトルを用いて、適応符号ベクトル内に存在
するピッチパルスの位置（位相）を決定する。ピッチパ
ルスの位置は、ピッチ周期で並べたインパルス列と適応
符号ベクトルとの正規化相互相関を最大化することによ
って行うことができる。また、ピッチ周期で並べたイン
パルス列を合成フィルタに通したものと、適応符号ベク
トルを合成フィルタに通したもとの誤差を最小化するこ
とによって、より精度良く求めることも可能である。The operation of the sound source generation unit of the speech coding apparatus configured as described above will be described with reference to FIG. The phase searcher 42 uses the adaptive code vector output from the adaptive codebook 41 to determine the position (phase) of the pitch pulse existing in the adaptive code vector. The position of the pitch pulse can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. Further, it is possible to obtain the impulse trains arranged in the pitch cycle through a synthesis filter, and to minimize the error caused by passing the adaptive code vector through the synthesis filter, thereby obtaining the error more accurately.

【００７９】ピッチパルス位置近傍限定型雑音符号帳４
３は、適応符号ベクトルのピッチピーク近傍に適用する
ための雑音符号ベクトルを格納しており、ベクトル長
は、ピッチ周期やフレーム（サブフレーム）長によらず
固定長である。ピッチピーク近傍の範囲としては、ピッ
チピークを中心として前後等しい長さとしてもよいが、
ピッチピークの後の範囲を前よりも長く取る方が、音質
劣化が少ない。例えば、近傍の範囲を５ｍｓｅｃとした
場合、ピッチピークの前後を２．５ｍｓｅｃずつ取るよ
りも、ピッチピークの前を０．６２５ｍｓｅｃ、ピッチ
ピークの後ろを４．３７５ｍｓｅｃの様にした方が良
い。また、ベクトル長としては、サブフレーム長が１０
ｍｓｅｃの場合で、５ｍｓｅｃ程度であればベクトル長
を１０ｍｓｅｃ以上にした場合とほぼ同等の音質を実現
できる。Noise code book 4 limited to near pitch pulse position
Numeral 3 stores a noise code vector to be applied to the vicinity of the pitch peak of the adaptive code vector, and the vector length is fixed regardless of the pitch period and the frame (subframe) length. The range near the pitch peak may be equal in length before and after the pitch peak,
When the range after the pitch peak is longer than before, the sound quality is less deteriorated. For example, when the vicinity range is set to 5 msec, it is better to set 0.625 msec before the pitch peak and 4.375 msec after the pitch peak, rather than 2.5 msec before and after the pitch peak. As the vector length, the subframe length is 10
In the case of msec, if the vector length is about 5 msec, almost the same sound quality as when the vector length is 10 msec or more can be realized.

【００８０】雑音符号ベクトル生成器４４は、ピッチパ
ルス位置限定型雑音符号帳４３から出力された雑音符号
ベクトルを、位相探索器４２によって決定されたピッチ
パルスの位置に配置する。The noise code vector generator 44 arranges the noise code vector output from the pitch pulse position limited noise codebook 43 at the position of the pitch pulse determined by the phase searcher 42.

【００８１】図６および図７はピッチパルス位置限定型
雑音符号帳４３から出力された雑音符号ベクトルを、雑
音符号ベクトル生成器４４によってピッチパルス位置に
対応する位置に配置する方法を図解したものである。基
本的には、図６（ａ）に示すように、ピッチパルス位置
の近傍にピッチパルス位置限定雑音符号ベクトルを配置
する。図６において、ピッチ周期化範囲と示されている
部分（斜線部）は、周期化部４５においてピッチ周期化
する場合に対象となる部分である。図６（ａ）のような
場合、雑音符号ベクトル生成器４４においてピッチ周期
化を行う必要はないが、図６（ｂ）に示すような場合に
は、ピッチパルスの位置がサブフレーム境界の近くにあ
るため、ピッチパルス位置限定型雑音符号帳４３から出
力された雑音符号ベクトルの前半部（サブフレーム境界
より前の部分）を周期化部４５において周期化すること
ができないので（周期化部４５においては、サブフレー
ム境界からピッチ周期長だけ切り出したベクトルをピッ
チ周期で繰り返し並べる。）、雑音符号ベクトル生成器
４４において予めピッチ周期化するように動作させる。
また、サブフレーム境界の直前にピッチパルス位置があ
る場合、サブフレームの先頭からピッチ周期だけ切り出
して周期化すると、ピッチパルス位置近傍限定ベクトル
の後半部分が適切にピッチ周期化されないため、図７
（ａ）に示すように、雑音ベクトル生成器４４は時間軸
の負の方向にもピッチ周期化するように動作する。ただ
し、ピッチパルス位置がサブフレーム先頭からピッチ周
期長の間に存在しない場合はこの周期化は必要ない。こ
のようにピッチ周期化部４５に先立ってピッチ周期化を
行なっておくことにより、ピッチ位置近傍限定ベクトル
の全ての部分を有効に用いたピッチ周期化がピッチ周期
化部４５で行われるようにしている。なお、ピッチ周期
がピッチパルス位置近傍に限定したベクトル長より短い
場合は、限定ベクトルの中からピッチ周期長だけ切り出
してピッチ周期化を行う。この場合、切り出し方はいろ
いろ考えられるが、ピッチパルス位置が切り出したベク
トルに含まれるように切り出す。例えば、ピッチパルス
位置から４分の１ピッチ周期前の点から１ピッチ周期分
を切り出すというように、ピッチパルス位置とピッチ周
期を用いて切り出し開始点を決定する。FIGS. 6 and 7 illustrate a method of arranging the noise code vector output from the pitch pulse position limited type noise codebook 43 at a position corresponding to the pitch pulse position by the noise code vector generator 44. FIG. is there. Basically, as shown in FIG. 6A, a pitch pulse position limited noise code vector is arranged near the pitch pulse position. In FIG. 6, a portion (hatched portion) indicated as a pitch period range is a portion which is a target when the pitch period is set by the period unit 45. In the case of FIG. 6A, it is not necessary to perform the pitch period in the noise code vector generator 44. However, in the case of FIG. 6B, the position of the pitch pulse is near the subframe boundary. , The first half (the part before the subframe boundary) of the noise code vector output from the pitch pulse position limited type noise codebook 43 cannot be periodicized by the periodicizing unit 45 (the periodicizing unit 45). In, the vector cut out from the subframe boundary by the pitch period length is repeatedly arranged in the pitch period.) The noise code vector generator 44 is operated to make the pitch period in advance.
In addition, when the pitch pulse position is located immediately before the subframe boundary, if the pitch pulse is cut out from the head of the subframe and is periodicized, the latter half of the limited vector near the pitch pulse position is not properly pitched.
As shown in (a), the noise vector generator 44 operates to make the pitch period in the negative direction of the time axis. However, if the pitch pulse position does not exist between the head of the subframe and the pitch period length, this periodization is not necessary. As described above, by performing the pitch period prior to the pitch period unit 45, the pitch period unit 45 effectively uses all the parts of the pitch position vicinity limited vector so that the pitch period unit 45 performs the pitch period. I have. When the pitch cycle is shorter than the vector length limited to the vicinity of the pitch pulse position, the pitch cycle is performed by cutting out the limited vector by the pitch cycle length. In this case, although there are various ways of cutting out, the cutting is performed so that the pitch pulse position is included in the cut out vector. For example, the clipping start point is determined using the pitch pulse position and the pitch cycle, such that one pitch cycle is cut from a point one quarter pitch cycle before the pitch pulse position.

【００８２】図７（ｂ）はピッチ周期が限定ベクトル長
より短い場合の、雑音符号ベクトルの切り出し方法の一
例を示す。この場合、ピッチパルス位置近傍限定雑音符
号ベクトルの先頭からピッチ周期長を切り出すようにし
ている。このようにすると切り出し開始点を毎回算出す
る必要がなくなる。すなわち、上記したようにピッチパ
ルス位置から４分の１ピッチ周期前の点から１ピッチ周
期分を切り出す場合、ピッチ周期が変数であるため、４
分の１ピッチ周期を毎回計算する必要があるが、ピッチ
パルス位置近傍限定雑音符号ベクトルの先頭位置は固定
値であるため、この計算が不要となる。ただし、ピッチ
パルス位置近傍限定雑音符号ベクトルの先頭からピッチ
周期長だけ切り出したベクトルに、ピッチパルス位置に
対応する部分が含まれない場合は、ピッチパルス位置に
対応する部分が含まれるように切り出しを開始する位置
をずらす必要がある。FIG. 7B shows an example of a method for extracting a noise code vector when the pitch period is shorter than the limited vector length. In this case, the pitch period length is cut out from the head of the limited noise code vector near the pitch pulse position. This eliminates the need to calculate the cutout start point every time. That is, as described above, when one pitch period is cut out from a point one quarter pitch period before the pitch pulse position, the pitch period is a variable.
Although it is necessary to calculate the one-half pitch period every time, this calculation is unnecessary because the head position of the limited noise code vector near the pitch pulse position is a fixed value. However, if the vector extracted by the pitch period length from the head of the pitch pulse position vicinity limited noise code vector does not include the portion corresponding to the pitch pulse position, the extraction is performed so that the portion corresponding to the pitch pulse position is included. It is necessary to shift the starting position.

【００８３】周期化器４５は、雑音符号ベクトル生成器
４４から出力された雑音符号ベクトルをピッチ周期化す
る。ピッチ周期化は、雑音符号ベクトルをピッチ周期で
周期化するもので、雑音符号ベクトルを先頭からピッチ
周期Ｌの分だけ切り出し、それをサブフレーム長に達す
るまで複数回繰り返して接続することによって行われ
る。ただし、ピッチ周期化が行われるのは、ピッチ周期
がサブフレーム長以下の場合のみである。なお、分数精
度のピッチ周期の場合は、分数精度の点を補間によって
算出して得られるベクトルを接続する。The periodizer 45 pitch-cycles the noise code vector output from the noise code vector generator 44. Pitch periodization is a method of periodizing a noise code vector with a pitch period, and is performed by cutting out the noise code vector by the pitch period L from the head and repeating and connecting it a plurality of times until the subframe length is reached. . However, the pitch period is set only when the pitch period is equal to or less than the subframe length. In the case of a pitch cycle with fractional precision, vectors obtained by calculating points with fractional precision by interpolation are connected.

【００８４】このように、上記第３の実施の形態によれ
ば、適応符号ベクトルのピッチピーク近傍のみに限定し
た雑音符号ベクトルを用いることにより、雑音符号ベク
トルに割り当てられるビット数が少ない場合でも、音質
劣化を少なくでき、ピッチパルス近傍に残差パワーが集
中するような有声部で音質向上を図ることができる。As described above, according to the third embodiment, by using the noise code vector limited only to the vicinity of the pitch peak of the adaptive code vector, even when the number of bits allocated to the noise code vector is small, Sound quality deterioration can be reduced, and sound quality can be improved in voiced parts where residual power is concentrated near the pitch pulse.

【００８５】（実施の形態４）図８は本発明の第４の実
施の形態を示し、パルス位置の探索範囲を適応符号ベク
トルのピッチ周期およびピッチピーク位置によって決定
する音声符号化装置の音源生成部を示す。図８におい
て、５１は過去の励振音源ベクトルを保存し、選択され
た適応符号ベクトルをピッチピーク位置算出器５２およ
びピッチゲイン乗算器５５に出力する適応符号帳、５２
は適応符号帳５１から出力された適応符号ベクトルとピ
ッチ周期Ｌを入力としてピッチピーク位置を算出し、探
索範囲算出器５３に出力するピッチピーク位置算出器、
５３はピッチピーク位置算出器５２から出力されたピッ
チピーク位置とピッチ周期Ｌを入力としてパルス音源を
探索する範囲を算出し、パルス音源探索器５４へ出力す
る探索範囲算出器、５４は探索範囲算出器５３から出力
された探索範囲と、ピッチ周期Ｌを入力としてパルス音
源を探索し、パルス音源ベクトルをパルス音源ゲイン乗
算器５６に出力するパルス音源探索器、５５は適応符号
帳から出力された適応符号ベクトルにピッチゲインを乗
じて加算器５７に出力する乗算器、５６はパルス音源探
索器から出力されたパルス音源ベクトルにパルス音源ゲ
インを乗じて加算器５７に出力する乗算器、５７は乗算
器５５からの出力と乗算器５６からの出力を入力とし、
加算して、励振音源ベクトルとして出力する加算器であ
る。(Embodiment 4) FIG. 8 shows a fourth embodiment of the present invention, in which sound source generation of a speech coding apparatus for determining a search range of a pulse position by a pitch period and a pitch peak position of an adaptive code vector. Indicates a part. In FIG. 8, reference numeral 51 denotes an adaptive codebook which stores the past excitation source vector and outputs the selected adaptive code vector to the pitch peak position calculator 52 and the pitch gain multiplier 55.
Calculates a pitch peak position using the adaptive code vector output from the adaptive code book 51 and the pitch period L as inputs, and outputs a pitch peak position calculator to the search range calculator 53;
Reference numeral 53 denotes a search range calculator for calculating a range in which to search for a pulse sound source using the pitch peak position and the pitch period L output from the pitch peak position calculator 52 as inputs, and 54 a search range calculator. A pulse excitation searcher that searches for a pulse excitation by using the search range output from the multiplier 53 and the pitch period L as inputs, and outputs a pulse excitation vector to a pulse excitation gain multiplier 56. 55 is an adaptive excitation output from the adaptive codebook. A multiplier that multiplies the code vector by the pitch gain and outputs the result to the adder 57, 56 is a multiplier that multiplies the pulse excitation vector output from the pulse excitation searcher by the pulse excitation gain and outputs the result to the adder 57, 57 is a multiplier The output from 55 and the output from multiplier 56 are input,
This is an adder that adds and outputs as an excitation sound source vector.

【００８６】以上のように構成された、音源生成部の動
作について、図８を用いて説明する。図８において、適
応符号帳５１は、音源生成部の外部で予め算出されるピ
ッチ周期Ｌだけ過去にさかのぼった点から、適応符号ベ
クトルをサブフレーム長だけ切り出して、適応符号ベク
トルとして出力する。ピッチ周期Ｌがサブフレーム長に
満たない場合は、切り出したピッチ周期Ｌのベクトル
を、サブフレーム長に達するまで繰り返して接続したも
のを適応符号ベクトルとして出力する。The operation of the sound source generator configured as described above will be described with reference to FIG. In FIG. 8, adaptive codebook 51 cuts out an adaptive code vector by a subframe length from a point that has been traced back by a pitch period L calculated in advance outside the excitation generation unit, and outputs it as an adaptive code vector. If the pitch period L is less than the subframe length, a vector obtained by repeatedly connecting the extracted pitch period L until the subframe length is reached is output as an adaptive code vector.

【００８７】ピッチピーク位置算出器５２は、適応符号
帳５１から出力された適応符号ベクトルを用いて適応符
号ベクトル内に存在するピッチパルスの位置を決定す
る。ピッチパルスの位置は、ピッチ周期で並べたインパ
ルス列と適応符号ベクトルとの正規化相互相関を最大化
することによって行うことができる。また、ピッチ周期
で並べたインパルス列を合成フィルタに通したものと、
適応符号ベクトルを合成フィルタに通したもとの誤差を
最小化することによって、より精度良く求めることも可
能である。The pitch peak position calculator 52 determines the position of the pitch pulse existing in the adaptive code vector using the adaptive code vector output from the adaptive code book 51. The position of the pitch pulse can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. In addition, an impulse train arranged in a pitch cycle is passed through a synthesis filter,
By minimizing the error caused by passing the adaptive code vector through the synthesis filter, it is possible to obtain the error more accurately.

【００８８】探索範囲算出器５３は、入力されたピッチ
ピーク位置およびピッチ周期Ｌを用いて、パルス音源を
探索する範囲を算出する。すなわち、ピッチピークの位
置情報から１ピッチ波形の中でも聴覚的に重要な範囲を
算出し、その範囲を探索範囲として決定する。探索範囲
算出器５３によって決定される具体的な探索範囲を図９
および図１０に示す。図９（ａ）においては、ピッチピ
ーク位置から５サンプル前の位置から始めて３２サンプ
ルの範囲を探索範囲として決定する場合を示している。
有声部においては、予めピッチ周期で並べたインパルス
列をパルス音源として用いるようにすれば、２番目のパ
ルスの探索範囲の同じ位置にパルスを立てられ、効率的
に音源を表現できる。図９（ｂ）は、ピッチ周期が図９
（ａ）の時よりも長くなった場合に決定される探索範囲
の一例を示している。ピッチ周期が長い場合、図９
（ａ）のようにピッチパルス近傍を集中的に探索するよ
うにすると、１ピッチ波形に対する相対的な探索範囲が
狭くなり、表現できる周波数帯域が狭まるなどして、特
定の帯域の周波数成分の表現性が悪くなる場合がある。
このような場合は、図９（ｂ）に示すように、ピッチ周
期に応じて探索範囲を広げる代わりに、全てのサンプル
点を探索せずに１つおきあるいは２つおきのサンプル点
を探索する部分を設けることで、探索する位置の数を増
やさずに、特定の帯域の周波数成分の表現性が悪くなる
ことを回避することができる。The search range calculator 53 uses the input pitch peak position and pitch period L to calculate the range in which to search for a pulsed sound source. That is, a perceptually important range in one pitch waveform is calculated from the pitch peak position information, and the range is determined as a search range. The specific search range determined by the search range calculator 53 is shown in FIG.
And FIG. FIG. 9A shows a case where a range of 32 samples is determined as a search range starting from a position 5 samples before the pitch peak position.
In a voiced part, if an impulse train arranged in advance at a pitch cycle is used as a pulse sound source, a pulse can be raised at the same position in the search range of the second pulse, and the sound source can be expressed efficiently. FIG. 9B shows that the pitch period is
FIG. 6 shows an example of a search range that is determined when the length is longer than that of FIG. When the pitch period is long, FIG.
When the vicinity of the pitch pulse is intensively searched as in (a), the relative search range for one pitch waveform is narrowed, and the expressible frequency band is narrowed. May deteriorate.
In such a case, as shown in FIG. 9B, instead of expanding the search range according to the pitch cycle, every other or every second sample point is searched without searching all the sample points. By providing the portion, it is possible to prevent the expressiveness of the frequency component of a specific band from being deteriorated without increasing the number of search positions.

【００８９】また、図１０にはピッチパルス位置近傍は
密に、それ以外の部分は疎に、パルス位置探索範囲を限
定する方法を示している。この限定方法は、パルスが立
てられる確率が高い位置がピッチパルス近傍に集中する
統計的結果に基づいている。パルス位置探索範囲を限定
しない場合、有声部においてはピッチパルス近傍にパル
スが立てられる確率がその他の部分に立てられる確率に
比べて高くなる。ただし、その他の部分にパルスが立て
られる確率が無視できるほど小さくなるわけではない。
図１０に示すパルス位置探索範囲限定方法は、図９
（ｂ）に示す方法において、パルスが立てられる確率分
布に基づいて探索範囲限定を行う一例と言える。なお、
図９（ａ）において、ピッチ周期が短く、最初のパルス
の探索範囲が２番目のパルスの探索範囲と重なるような
場合は、２番目のパルスの探索範囲に重ならないよう
に、最初のパルスの探索範囲を狭める代わりにパルス数
を増やすという方法と、２番目のパルスの探索範囲に重
なった探索範囲に決定する方法（図９（ａ）と同じ探索
範囲決定方法）とがある。FIG. 10 shows a method for limiting the pulse position search range densely in the vicinity of the pitch pulse position and sparsely in other portions. This limiting method is based on a statistical result in which positions where the pulse is likely to be raised are concentrated near the pitch pulse. If the pulse position search range is not limited, the probability that a pulse will be raised near the pitch pulse in a voiced part will be higher than the probability that it will be raised in other parts. However, the probability that a pulse is generated in other portions is not so small as to be negligible.
The pulse position search range limiting method shown in FIG.
In the method shown in (b), it can be said that the search range is limited based on the probability distribution of the pulse. In addition,
In FIG. 9A, when the pitch period is short and the search range of the first pulse overlaps the search range of the second pulse, the first pulse is searched so as not to overlap the search range of the second pulse. There is a method of increasing the number of pulses instead of narrowing the search range, and a method of determining a search range overlapping the search range of the second pulse (the same search range determination method as in FIG. 9A).

【００９０】パルス位置探索器５４は、探索範囲算出器
５３で決定された探索範囲（位置）にパルス音源を立て
て、合成音声が入力音声と最も近くなる位置を出力す
る。特に、サブフレーム長が複数のピッチパルスを含む
ような長さでかつ有声定常部においては、ピッチ周期間
隔で並べたインパルス列をパルス音源としてインパルス
列の１本目のパルス位置を探索範囲の中から決定するの
が効率的である。パルスの立て方としては種々考えら
れ、定数本例えば４本のパルスを探索範囲、例えば３２
箇所の位置のどこかに立てる場合、３２箇所を４つに分
けて１本のパルスを割り当てられた８箇所の中の１箇所
に決定するように全ての組み合わせ（８×８×８×８通
り）を探索する方法や、３２箇所の中から４箇所を選び
だす組み合わせ全てについて探索する方法などがある。
なお、振幅１のインパルスの組み合わせの他に、複数本
例えば２本のパルスを組み合わせたパルス対の組み合わ
せや、振幅の異なるインパルスの組み合わせによるパル
スの立て方も可能である。The pulse position searcher 54 sets a pulse sound source in the search range (position) determined by the search range calculator 53 and outputs a position where the synthesized voice is closest to the input voice. In particular, in the voiced stationary part, the subframe length is a length including a plurality of pitch pulses, and in the voiced stationary part, the first pulse position of the impulse train is determined from the search range using the impulse train arranged at the pitch period interval as the pulse sound source. It is efficient to decide. There are various ways of setting up the pulse.
When standing somewhere in the location, all combinations (8 × 8 × 8 × 8 ways) are made so that 32 locations are divided into four and one pulse is determined as one of eight assigned locations. ), And a method of searching for all combinations that select four locations from 32 locations.
In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【００９１】乗算器５５および５６で乗ずる利得は、適
応符号帳５１から出力された適応符号ベクトルとパルス
位置探索器５４から出力されたパルス音源ベクトルとを
用いて音声合成を行って、入力音声との誤差が最小とな
るようにそれぞれのベクトルに対して決定された値であ
る。ここで、適応符号ベクトルに乗ずる利得をピッチゲ
イン、パルス音源ベクトルに乗ずる利得をパルス音源ゲ
インとすると、乗算器５５は、適応符号ベクトルにピッ
チゲインを乗じて、加算器５７に出力する。乗算器５６
は、パルス音源ベクトルにパルス音源ゲインを乗じて、
加算器５７に出力する。The gain to be multiplied by the multipliers 55 and 56 is determined by performing speech synthesis using the adaptive code vector output from the adaptive codebook 51 and the pulse excitation vector output from the pulse position searcher 54, and Is a value determined for each vector such that the error of Here, assuming that the gain multiplied by the adaptive code vector is a pitch gain and the gain multiplied by the pulse excitation vector is a pulse excitation gain, the multiplier 55 multiplies the adaptive code vector by the pitch gain and outputs the result to the adder 57. Multiplier 56
Multiplies the pulse source vector by the pulse source gain,
Output to the adder 57.

【００９２】加算器５７は、乗算器５５から出力された
最適利得乗算後の適応符号ベクトルと、乗算器５６から
出力された最適利得乗算後のパルス音源ベクトルとを加
算して、励振音源ベクトルとして出力する。The adder 57 adds the adaptive code vector after the optimal gain multiplication output from the multiplier 55 and the pulse excitation vector after the optimal gain multiplication output from the multiplier 56 to generate an excitation excitation vector. Output.

【００９３】このように、上記第４の実施の形態によれ
ば、パルスに割り当てられるビット数が少ない場合で
も、音質劣化を少なくできる。As described above, according to the fourth embodiment, even when the number of bits allocated to the pulse is small, deterioration in sound quality can be reduced.

【００９４】（実施の形態５）図１１（ａ）は本発明の
第５の実施の形態を示し、パルス位置の探索位置を適応
符号ベクトルのピッチ周期およびピッチピーク位置によ
って決定する音源生成部のパルス探索位置決定部を示
し、図８における探索範囲算出器５３をさらに細かく示
したものである。図１１（ａ）において、６１は、ピッ
チ周期Ｌを入力として、パルス探索位置決定器６２にパ
ルス探索位置パターンを出力する、パルス探索位置パタ
ーン選択器であり、６２は、パルス探索位置パターン選
択器６１からパルス探索位置パターンを、ピッチピーク
位置算出器５２からピッチピーク位置を、それぞれ入力
し、探索範囲（パルス探索位置）をパルス位置探索器５
４に出力する、パルス探索位置決定器である。(Embodiment 5) FIG. 11A shows a fifth embodiment of the present invention, in which a search position for a pulse position is determined by a pitch period and a pitch peak position of an adaptive code vector. FIG. 10 shows a pulse search position determination unit, and shows the search range calculator 53 in FIG. 8 in more detail. In FIG. 11A, reference numeral 61 denotes a pulse search position pattern selector which outputs a pulse search position pattern to a pulse search position determiner 62 with a pitch period L as an input, and 62 denotes a pulse search position pattern selector. The pulse search position pattern is inputted from the pulse search position pattern 61 and the pitch peak position from the pitch peak position calculator 52.
4 is a pulse search position determiner.

【００９５】以上のように構成された、音源生成部の探
索範囲算出器５３の動作について、図１１（ａ）、
（ｂ）、（ｃ）を用いて説明する。パルス探索位置パタ
ーン選択器６１は、複数種類のパルス探索位置パターン
をあらかじめ持っており（このパルス探索位置パターン
は、パルス探索を行うサンプル点の位置の集合から成
り、ピッチピーク位置を０とする相対位置でサンプル点
を表現している）、ピッチ分析によって得られたピッチ
周期Ｌを用いて、どのパルス探索位置パターンを使用す
るかを決定し、パルス探索位置パターンをパルス探索位
置決定器６２に出力する。The operation of the search range calculator 53 of the sound source generator configured as described above will be described with reference to FIG.
This will be described with reference to (b) and (c). The pulse search position pattern selector 61 has a plurality of types of pulse search position patterns in advance (the pulse search position pattern is composed of a set of sample point positions for performing pulse search, and is a relative position where the pitch peak position is 0). Using the pitch period L obtained by pitch analysis, which pulse search position pattern is to be used is determined, and the pulse search position pattern is output to the pulse search position determiner 62. I do.

【００９６】図１１（ｂ）、（ｃ）は、パルス探索位置
パターン選択器６１が、予め持っているパルス探索位置
パターンの一例を示したものである。図中の目盛りはサ
ンプル点の位置を示しており、矢印がつけられたサンプ
ル点がパルス探索位置である（矢印が付いていない部分
は探索しない）。目盛りの数値は適応符号ベクトルから
求められるピッチピーク位置を０とした相対位置を表す
数値である。また、図１１（ｂ）、（ｃ）では、１サブ
フレーム８０サンプルの場合を示している。図１１
（ｂ）ではピッチ周期Ｌが長い（たとえば４５サンプル
以上）の場合の探索位置パターンを示しており、図１１
（ｃ）ではピッチ周期Ｌが短い（たとえば４４サンプル
未満）の場合の探索位置パターンを示している。ピッチ
周期Ｌが短い場合はサブフレーム全体の探索をしないこ
ととなるが、ピッチ周期化処理を行うことによって、サ
ブフレーム全体にパルスを立てることが可能となる。ピ
ッチ周期化は、下記式（１）を用いることによって容易
に行うことができる（ITU-T STUDY GROUP15 - CONTRIBU
TION 152, "G.729-CODING OF SPEECH AT 8 KBIT/S USIN
G CONJUGATE-STRUCTURE ALGEBRAIC-CODE-EXCITED LINEA
R-PREDICTION(CS-ACELP)", COM 15-152-E July 199
5）。ｃｏｄｅ（ｉ）＝ｃｏｄｅ（ｉ）＋β×ｃｏｄｅ（ｉ−Ｌ）．．．（１）式（１）において、ｃｏｄｅ（）はパルス音源ベクトル
を表し、ｉはサンプル番号（図１１の例では、０〜７
９）を表す。また、βは周期化の強さを示す利得値で、
周期性が強い場合は大きく周期性が弱い場合は小さくす
る（一般的には０〜１．０の値を用いる）。図１１
（ｃ）では（−４）〜４８サンプルの範囲（５３サンプ
ルの範囲）でパルス探索を行うこととなる。このため、
ピッチ周期Ｌが５３（または５４）未満の場合に図１１
（ｃ）の探索範囲パターンを用いることも可能である。
しかし、ピッチ周期Ｌが４５サンプル程度未満の場合に
することによって、２つのピッチピーク位置を探索範囲
内に含むことができ、１周期目のピッチパルス波形と２
周期目のピッチパルス波形が変化する場合や、求められ
たピッチピーク位置が実際のピッチピーク位置よりも１
周期前の位置として誤検出された場合に対応することが
できる。FIGS. 11B and 11C show an example of a pulse search position pattern that the pulse search position pattern selector 61 has in advance. The scale in the figure indicates the position of the sample point, and the sample point with an arrow is the pulse search position (the part without the arrow is not searched). The scale value is a numerical value representing a relative position with the pitch peak position obtained from the adaptive code vector being 0. FIGS. 11B and 11C show the case of 80 samples per subframe. FIG.
FIG. 11B shows a search position pattern when the pitch period L is long (for example, 45 samples or more).
(C) shows a search position pattern when the pitch period L is short (for example, less than 44 samples). When the pitch period L is short, the search of the entire subframe is not performed. However, by performing the pitch period process, it is possible to raise a pulse in the entire subframe. Pitch periodicization can be easily performed by using the following equation (1) (ITU-T STUDY GROUP15-CONTRIBU
TION 152, "G.729-CODING OF SPEECH AT 8 KBIT / S USIN
G CONJUGATE-STRUCTURE ALGEBRAIC-CODE-EXCITED LINEA
R-PREDICTION (CS-ACELP) ", COM 15-152-E July 199
Five). code (i) = code (i) + β × code (i-L). . . (1) In equation (1), code () represents a pulse excitation vector, and i is a sample number (in the example of FIG. 11, 0 to 7).
9). Β is a gain value indicating the strength of the period,
When the periodicity is strong, the value is large, and when the periodicity is weak, the value is small (generally, a value of 0 to 1.0 is used). FIG.
In (c), a pulse search is performed in the range of (-4) to 48 samples (range of 53 samples). For this reason,
When the pitch period L is less than 53 (or 54), FIG.
It is also possible to use the search range pattern of (c).
However, by setting the pitch period L to be less than about 45 samples, two pitch peak positions can be included in the search range, and the pitch pulse waveform of the first cycle and 2
When the pitch pulse waveform in the cycle changes, or when the calculated pitch peak position is one
It is possible to cope with a case where the position is erroneously detected as the position before the cycle.

【００９７】パルス探索位置決定器６２は、パルス探索
位置パターン選択器から出力されたパルス探索位置パタ
ーンを用いて現サブフレームにおけるパルス探索位置を
決定し、パルス位置探索器５４に出力する。パルス探索
位置パターン選択器６２から出力されるパルス探索位置
パターンは、ピッチピーク位置を０とする相対位置で表
現されているため、そのままではパルス探索に用いるこ
とができない。このため、サブフレームの先頭を０とす
る絶対位置に変換してパルス位置探索器５４に出力す
る。The pulse search position determiner 62 determines the pulse search position in the current subframe using the pulse search position pattern output from the pulse search position pattern selector, and outputs the pulse search position to the pulse position searcher 54. Since the pulse search position pattern output from the pulse search position pattern selector 62 is expressed as a relative position where the pitch peak position is 0, it cannot be used for pulse search as it is. For this reason, the sub-frame is converted into an absolute position where the head of the sub-frame is 0 and output to the pulse position searcher 54.

【００９８】（実施の形態６）図１２は本発明の第６の
実施の形態を示し、パルス位置の探索位置を適応符号ベ
クトルのピッチ周期およびピッチピーク位置によって決
定するとともに、パルス音源に使用するパルス数を切り
替える構成を有する音声符号化装置の音源生成部を示
す。図１２において、７１は、適応符号ベクトルをピッ
チピーク位置算出器７２と乗算器７６に出力する、適応
符号帳、７２は、ピッチ分析あるいは適応符号帳探索に
よって外部で求められたピッチ周期Ｌと適応符号帳から
出力された適応符号ベクトルを入力とし、ピッチピーク
位置を探索位置算出器７４に出力する、ピッチピーク位
置算出器、７３はピッチ分析あるいは適応符号帳探索に
よって外部で求められたピッチ周期Ｌを入力として、パ
ルス数を探索位置算出器７４に出力するパルス数決定
器、７４はピッチ分析あるいは適応符号帳探索によって
外部で求められたピッチ周期Ｌとパルス数決定器７３か
ら出力されたパルス数とピッチピーク位置算出器７２か
ら出力されたピッチピーク位置を入力とし、パルスの探
索位置をパルス位置探索器７５に出力する探索位置算出
器、７５はピッチ分析あるいは適応符号帳探索によって
外部で求められたピッチ周期Ｌと探索位置算出器７４か
ら出力されたパルス探索位置を入力とし、パルス音源に
用いるパルスを立てる位置の組み合わせを決定してその
組み合わせによって生成されるパルス音源ベクトルを乗
算器７７に出力するパルス位置探索器、７６は、適応符
号帳から出力された適応符号ベクトルを入力とし、適応
符号ベクトル利得を乗じて加算器７８に出力する乗算
器、７７は、パルス位置探索器から出力されたパルス音
源ベクトルを入力とし、パルス音源ベクトル利得を乗じ
て加算器７８に出力する乗算器、７８は乗算器７６およ
び７７から出力されたベクトルを入力とし、ベクトル加
算をおこなって音源ベクトルとして出力する加算器であ
る。(Embodiment 6) FIG. 12 shows a sixth embodiment of the present invention, in which a search position of a pulse position is determined by a pitch period and a pitch peak position of an adaptive code vector and used for a pulse sound source. 3 shows a sound source generation unit of a speech encoding device having a configuration for switching the number of pulses. In FIG. 12, reference numeral 71 denotes an adaptive code vector which outputs an adaptive code vector to a pitch peak position calculator 72 and a multiplier 76. The adaptive code book 72 includes an adaptive code book and a pitch period L externally obtained by pitch analysis or adaptive code book search. The adaptive code vector output from the codebook is input, and the pitch peak position is output to a search position calculator 74. The pitch peak position calculator 73 has a pitch period L externally obtained by pitch analysis or adaptive codebook search. , And outputs the number of pulses to the search position calculator 74. The number of pulses 74 is the pitch period L obtained externally by pitch analysis or adaptive codebook search and the number of pulses output from the number of pulses determiner 73. And the pitch peak position output from the pitch peak position calculator 72 as an input, and a pulse search position is determined as a pulse position search. A search position calculator 75 outputs a pulse period L externally obtained by pitch analysis or adaptive codebook search and a pulse search position output from the search position calculator 74, and outputs a pulse used as a pulse source. A pulse position searcher 76 that determines a combination of standing positions and outputs a pulse excitation vector generated by the combination to a multiplier 77. The pulse position searcher 76 receives the adaptive code vector output from the adaptive codebook as an input, The multiplier 77 receives the pulse excitation vector output from the pulse position searcher as an input, multiplies the pulse excitation vector gain and outputs the result to the adder 78, and 78 is a multiplier. Inputs the vectors output from 76 and 77, performs vector addition, and outputs as a sound source vector It is an adder.

【００９９】以上のように構成されたＣＥＬＰ型音声符
号化装置の音源生成部の動作について、図１２を参照し
ながら説明する。適応符号帳７１から出力された適応符
号ベクトルは乗算器７６に出力され、適応符号ベクトル
利得が乗算されて加算器７８に出力される。ピッチピー
ク位置算出器７２は適応符号ベクトルからピッチピーク
を検出し、その位置を探索位置算出器７４に出力する。
ピッチピーク位置の検出（算出）は、ピッチ周期Ｌで並
べたインパルス列ベクトルと適応符号ベクトルの内積を
最大化することによって行うことができる。また、ピッ
チ周期Ｌで並べたインパルス列ベクトルに合成フィルタ
のインパルス応答を畳み込んだベクトルと適応符号ベク
トルに合成フィルタのインパルス応答を畳み込んだベク
トルの内積を最大化することによって、より精度良くピ
ッチピーク位置の検出を行うことも可能である。The operation of the excitation generating section of the CELP-type speech coding apparatus configured as described above will be described with reference to FIG. The adaptive code vector output from adaptive codebook 71 is output to multiplier 76, multiplied by the adaptive code vector gain, and output to adder 78. The pitch peak position calculator 72 detects a pitch peak from the adaptive code vector and outputs the position to the search position calculator 74.
The detection (calculation) of the pitch peak position can be performed by maximizing the inner product of the impulse train vector arranged in the pitch period L and the adaptive code vector. In addition, by maximizing the inner product of the vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and the vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector, the pitch can be more accurately determined. It is also possible to detect a peak position.

【０１００】パルス数決定器７３はピッチ周期Ｌの値に
基づいて、パルス音源に使用するパルスの本数を決定し
て、探索位置算出器７４に出力する。パルス数とピッチ
周期の関係は予め学習的あるいは統計的に定められてお
り、たとえばピッチ周期が４５サンプル以下の場合は５
本、４５サンプルを超えて８０サンプル未満の場合は４
本、８０サンプル以上の場合は３本、というようにピッ
チ周期の値の範囲によってそれぞれのパルス本数が定め
られている。ピッチ周期が短い場合はピッチ周期化処理
を用いることによってパルス探索範囲を１〜２ピッチ周
期に限定できるので、位置情報を減らす代わりにパルス
数を増やすことができる。また、波形的にもピッチ周期
が短い女声とピッチ周期の長い男声では、波形の特徴が
異なり、それぞれに適したパルス数が存在する。一般的
には、男声の方がパルス性が強いためパルス数よりもパ
ルス位置が重要となる傾向があり、女声ではパルス性が
弱い為パルス数を増やしてパワーの集中を避けた方が良
くなる傾向がある。これらのことから、ピッチ周期が長
い場合はパルス数を少なく、ピッチ周期が短い場合はあ
る程度パルス数を多くすることが有効となる。さらに、
連続するサブフレーム間のパルス本数の変化やピッチ周
期Ｌの変化などを考慮に入れてパルス数を決定すると、
連続するサブフレーム間の不連続性緩和や有声部の立ち
上がり部の品質向上を図ることができる。具体的には、
連続するサブフレームで、ピッチ周期Ｌから決定された
パルス数が５本から３本に減少したときは、パルス数の
減少にヒステリシスを持たせて、５本から急に３本に減
らすのではなく４本にすることによってサブフレーム間
でパルス数が大きく変化することを避けるようにする、
あるいは、連続するサブフレーム間でピッチ周期Ｌが大
きく異なる場合は、有声部の立ち上がりである可能性が
大きいので、パルス数を減らしてパルス位置の精度を向
上させた方が音声品質が向上するため、前サブフレーム
のピッチ周期Ｌと現サブフレームのピッチ周期Ｌが大き
く異なる場合は現サブフレームのピッチ周期Ｌの値に関
わらずパルス数を３本とする、等という手法によってパ
ルス数の決定を行うとより音声品質の向上を図ることが
可能である。なお、これらの手法を用いる場合には、ピ
ッチ分析における倍ピッチ誤りや半ピッチ誤り等の影響
を受けやすくなるので、これらの影響を緩和するパルス
数決定法（たとえば、半ピッチや倍ピッチの可能性を考
慮に入れてピッチ周期の連続性を判定するなど）を取り
入れたり、ピッチ分析の精度をでき得る限り上げると、
より効果的である。The number-of-pulses determiner 73 determines the number of pulses used for the pulse sound source based on the value of the pitch period L, and outputs it to the search position calculator 74. The relationship between the number of pulses and the pitch cycle is predetermined by learning or statistically. For example, when the pitch cycle is 45 samples or less, 5 is used.
Book, 4 if more than 45 samples and less than 80 samples
The number of each pulse is determined by the range of the pitch period value, such as three, for 80 samples or more. When the pitch period is short, the pulse search range can be limited to 1 to 2 pitch periods by using the pitch period processing, so that the number of pulses can be increased instead of reducing the position information. In terms of waveforms, a female voice having a short pitch cycle and a male voice having a long pitch cycle have different waveform characteristics, and the number of pulses suitable for each exists. In general, the pulse position tends to be more important than the number of pulses because a male voice has a stronger pulse, and it is better to increase the number of pulses and avoid power concentration in a female voice because the pulse is weak. Tend. From these facts, it is effective to reduce the number of pulses when the pitch period is long, and to increase the number of pulses to some extent when the pitch period is short. further,
When the number of pulses is determined in consideration of changes in the number of pulses between successive subframes, changes in the pitch period L, and the like,
It is possible to reduce the discontinuity between consecutive subframes and improve the quality of the rising part of the voiced part. In particular,
When the number of pulses determined from the pitch period L is reduced from 5 to 3 in consecutive subframes, instead of providing a hysteresis to the reduction in the number of pulses and reducing the number of pulses from 5 to 3 suddenly, By making the number of pulses four, it is possible to avoid a large change in the number of pulses between subframes.
Alternatively, when the pitch period L is greatly different between consecutive subframes, there is a high possibility that a voiced part is rising, so that the voice quality is improved by reducing the number of pulses and improving the accuracy of the pulse position. When the pitch period L of the previous sub-frame is greatly different from the pitch period L of the current sub-frame, the number of pulses is determined by a method of setting the number of pulses to 3 regardless of the value of the pitch period L of the current sub-frame. By doing so, it is possible to further improve the voice quality. When these methods are used, the influence of double pitch errors and half pitch errors in pitch analysis becomes liable to occur. Therefore, a pulse number determination method (for example, half pitch or double pitch To determine the continuity of the pitch cycle taking into account the characteristics of the pitch), or to increase the accuracy of pitch analysis as much as possible.
More effective.

【０１０１】探索位置算出器７４は、ピッチピーク位置
とパルス本数をもとにして、パルス探索を行う位置を決
定する。パルスの探索位置はピッチピーク付近は密に、
それ以外の部分は疎になるように配分される（全てのサ
ンプル点を探索するだけの十分なビット配分がないとき
に有効である）。すなわち、ピッチピーク位置近傍は全
てのサンプル点がパルス位置探索の対象となるが、ピッ
チピーク位置から離れた部分は2サンプル毎や３サンプ
ル毎というようにパルス位置探索の間隔を広くする（た
とえば、図１１（ｂ）、（ｃ）のように探索位置を決定
する）。また、パルス数が多いときは1本あたりのパル
スに配分されるビット数は少なくなるため、疎になる部
分の間隔がパルス数が少ない場合より広くなる（パルス
位置の精度が荒くなる）。なお、ピッチ周期が短い場合
は、実施の形態５に示したように、サブフレーム内の最
初のピッチピークから１ピッチ周期強の範囲のみに探索
範囲を限定すると、より音声品質を向上することが可能
である。The search position calculator 74 determines the position where the pulse search is to be performed based on the pitch peak position and the number of pulses. The search position of the pulse is dense near the pitch peak,
The other parts are sparsely allocated (effective when there is not enough bit allocation to search all sample points). That is, all sample points near the pitch peak position are subjected to the pulse position search, but the portions far from the pitch peak position are widened at intervals of the pulse position search such as every two samples or every three samples (for example, The search position is determined as in FIGS. 11B and 11C). Also, when the number of pulses is large, the number of bits allocated to one pulse is small, so that the interval between sparse portions is wider than when the number of pulses is small (the accuracy of the pulse position is rough). When the pitch period is short, as described in Embodiment 5, if the search range is limited to only a range slightly more than one pitch period from the first pitch peak in the subframe, the voice quality can be further improved. It is possible.

【０１０２】パルス位置探索器７５は、探索位置算出器
７４で決定された探索位置に基づいてパルスを立てる位
置の最適な組み合わせを決定する。パルス探索の方法は
「 ITU-T STUDY GROUP15 - CONTRIBUTION 152, "G.729-
CODING OF SPEECH AT 8 KBIT/S USING CONJUGATE-STRUC
TURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (CS-
ACELP)", COM 15-152-E July 1995 」に示されているよ
うに、たとえばパルス数が４本の場合は式（２）を最大
化するようにｉ０からｉ３の組み合わせを決定する。（ＤＮ×ＤＮ）／ＲＲＤＮ＝ｄｎ（ｉ０）＋ｄｎ（ｉ１）＋ｄｎ（ｉ２）＋ｄｎ（ｉ３）ＲＲ＝ｒｒ（ｉ０，ｉ０）＋ｒｒ（ｉ１，ｉ１）＋２× ｒｒ（ｉ０，ｉ１）＋ｒｒ（ｉ２，ｉ２）＋２×（ｒｒ（ｉ０，ｉ２）＋ｒｒ（ｉ１，ｉ２））＋ｒｒ（ｉ３，ｉ３）＋２×（ｒｒ（ｉ０，ｉ３）＋ｒｒ（ｉ１，ｉ３）＋ｒｒ（ｉ２，ｉ３））．．．（２）ここで、ｄｎ（ｉ）（ｉ＝０〜７９：サブフレーム長８
０サンプルの場合）はパルス音源成分のターゲットベク
トルｘ‘（ｉ）を合成フィルタのインパルス応答でバッ
クワードフィルタリングしたもので、ｒｒ（ｉ，ｉ）
は、式（３）のようにインパルス応答の自己相関行列で
ある。また、ｉ０、ｉ１、ｉ２、ｉ３が取り得る位置の
範囲は探索位置算出器７４で求められたものである。具
体的にはパルス数が４本の場合、図１３（ａ）〜（ｄ）
のようになる（図中矢印をつけた部分が取り得る位置、
なお、目盛りの数値はピッチピーク位置を０とした相対
値である）。The pulse position search unit 75 determines an optimum combination of positions where pulses are to be formed based on the search position determined by the search position calculator 74. See “ITU-T STUDY GROUP15-CONTRIBUTION 152,” G.729-
CODING OF SPEECH AT 8 KBIT / S USING CONJUGATE-STRUC
TURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (CS-
ACELP) ", COM 15-152-E July 1995", for example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize Expression (2). (DN × DN) / RR DN = dn (i0) + dn (i1) + dn (i2) + dn (i3) RR = rr (i0, i0) + rr (i1, i1) + 2 × rr (i0, i1) + rr (I2, i2) + 2 × (rr (i0, i2) + rr (i1, i2)) + rr (i3, i3) + 2 × (rr (i0, i3) + rr (i1, i3) + rr (i2) i3)). . . (2) Here, dn (i) (i = 0 to 79: subframe length 8)
In the case of 0 sample), the target vector x ′ (i) of the pulse sound source component is subjected to backward filtering with the impulse response of the synthesis filter, and rr (i, i)
Is the autocorrelation matrix of the impulse response as in equation (3). The range of positions that i0, i1, i2, and i3 can take has been obtained by the search position calculator 74. Specifically, when the number of pulses is 4, FIGS. 13A to 13D
(Positions where the parts marked with arrows in the figure can be taken,
The scale value is a relative value when the pitch peak position is 0).

【数１】．．．（３）(Equation 1) . . . (3)

【０１０３】パルス位置探索器７５によって、最適パル
ス位置の組み合わせた決定されると、その組み合わせに
よって生成されるパルス音源ベクトルが乗算器７７に出
力され、パルス符号ベクトル利得が乗算され、加算器７
８に出力される。When the combination of the optimum pulse positions is determined by the pulse position search unit 75, the pulse excitation vector generated by the combination is output to the multiplier 77, multiplied by the pulse code vector gain, and
8 is output.

【０１０４】加算器７８は適応符号ベクトル成分とパル
ス音源ベクトル成分の加算を行い、励振音源ベクトルと
して出力する。An adder 78 adds the adaptive code vector component and the pulse excitation vector component, and outputs the result as an excitation excitation vector.

【０１０５】（実施の形態７）図１４は、本発明の第７
の発明の実施の形態を示し、パルス探索前にパルスの振
幅を決定する構成を有する、ＣＥＬＰ型音声符号化装置
の音源生成部を示している。図１４において、８１は過
去の励振音源信号のバッファから構成され、適応符号ベ
クトルをピッチピーク位置算出器８２と乗算器８８に出
力する適応符号帳、８２はピッチ分析あるいは適応符号
帳探索によって外部で求められたピッチ周期Ｌと適応符
号帳８１から出力された適応符号ベクトルを入力とし、
ピッチピーク位置を探索位置算出器８４とパルス振幅算
出器８７に出力するピッチピーク位置算出器、８３はピ
ッチ分析あるいは適応符号帳探索によって外部で求めら
れたピッチ周期Ｌを入力として、パルス数を探索位置算
出器８４に出力するパルス数決定器、８４はピッチ分析
あるいは適応符号帳探索によって外部で求められたピッ
チ周期Ｌとパルス数決定器８３から出力されたパルス数
とピッチピーク位置算出器８２から出力されたピッチピ
ーク位置を入力とし、パルスの探索位置をパルス位置探
索器８５に出力する探索位置算出器、８５はピッチ分析
あるいは適応符号帳探索によって外部で求められたピッ
チ周期Ｌと探索位置算出器８４から出力されたパルス探
索位置とパルス振幅算出器８７から出力されたパルス振
幅を入力とし、パルス音源に用いるパルスを立てる位置
の組み合わせを決定してその組み合わせによって生成さ
れるパルス音源ベクトルを乗算器８９に出力するパルス
位置探索器、８６は外部のＬＰＣ分析およびＬＰＣ量子
化器によって決定された線形予測フィルタによって得ら
れる予測残差信号から、乗算器８８から出力された（利
得乗算後の）適応符号ベクトルを減算し、差分信号をパ
ルス振幅算出器８７に出力する加算器、８７は加算器８
６から出力された差分信号を入力とし、パルス振幅情報
をパルス位置探索器８５に出力するパルス振幅算出器、
８８は適応符号帳８１から出力された適応符号ベクトル
を入力として適応符号ベクトル利得を乗算し、加算器９
０および８６に出力する乗算器、８９は、パルス位置探
索器８５から出力されたパルス音源ベクトルを入力とし
てパルス音源ベクトル利得を乗算し、加算器９０に出力
する乗算器、９０は乗算器８８および８９から出力され
たベクトルの加算をおこない、励振音源ベクトルとして
出力する加算器である。(Embodiment 7) FIG. 14 shows a seventh embodiment of the present invention.
1 shows an embodiment of the present invention, and shows an excitation generation unit of a CELP-type speech coding apparatus having a configuration for determining the amplitude of a pulse before searching for a pulse. In FIG. 14, reference numeral 81 denotes an adaptive codebook which is composed of a buffer of a past excitation source signal and outputs an adaptive code vector to a pitch peak position calculator 82 and a multiplier 88. Reference numeral 82 denotes an external codebook by pitch analysis or adaptive codebook search. The obtained pitch period L and the adaptive code vector output from the adaptive codebook 81 are input,
A pitch peak position calculator 83 that outputs the pitch peak position to a search position calculator 84 and a pulse amplitude calculator 87. The number 83 searches for the number of pulses by using a pitch period L externally obtained by pitch analysis or adaptive codebook search. The number-of-pulses determiner 84 which outputs to the position calculator 84 is composed of a pitch period L externally obtained by pitch analysis or adaptive codebook search, the number of pulses output from the number-of-pulses determiner 83, and the pitch-peak position calculator 82. A search position calculator that receives the output pitch peak position as an input and outputs a pulse search position to a pulse position searcher 85. The search position calculator 85 calculates a pitch period L and a search position externally obtained by pitch analysis or adaptive codebook search. The pulse search position output from the calculator 84 and the pulse amplitude output from the pulse amplitude calculator 87 are input, and the The pulse position searcher 86 determines the combination of the positions where the pulses to be used for the sound sources are formed, and outputs the pulse excitation vector generated by the combination to the multiplier 89. The pulse position searcher 86 is determined by an external LPC analysis and LPC quantizer. An adder that subtracts the adaptive code vector (after gain multiplication) output from the multiplier 88 from the prediction residual signal obtained by the linear prediction filter and outputs the difference signal to the pulse amplitude calculator 87. 8
6, a pulse amplitude calculator that receives the difference signal output from 6 and outputs pulse amplitude information to a pulse position searcher 85;
An adder 9 multiplies an adaptive code vector gain by an adaptive code vector output from the adaptive code book 81 as an input.
A multiplier 89 that outputs to 0 and 86, a multiplier that multiplies the pulse excitation vector gain output from the pulse position searcher 85 as an input and outputs the result to an adder 90, and 90 is a multiplier 88 and This is an adder that adds the vectors output from 89 and outputs the result as an excitation sound source vector.

【０１０６】以上のように構成されたＣＥＬＰ型音声符
号化装置の音源生成部について、図１４を用いてその動
作を説明する。適応符号帳８１から出力された適応符号
ベクトルは乗算器８８に出力され、適応符号ベクトル利
得が乗算されて加算器９０および８６に出力される。The operation of the excitation generator of the CELP-type speech coding apparatus configured as described above will be described with reference to FIG. The adaptive code vector output from adaptive codebook 81 is output to multiplier 88, multiplied by the adaptive code vector gain, and output to adders 90 and 86.

【０１０７】ピッチピーク位置算出器８２は適応符号ベ
クトルからピッチピークを検出し、その位置を探索位置
算出器８４およびパルス振幅算出器８７に出力する。ピ
ッチピーク位置の検出（算出）は、ピッチ周期Ｌで並べ
たインパルス列ベクトルと適応符号ベクトルの内積を最
大化することによって行うことができる。また、ピッチ
周期Ｌで並べたインパルス列ベクトルに合成フィルタの
インパルス応答を畳み込んだベクトルと適応符号ベクト
ルに合成フィルタのインパルス応答を畳み込んだベクト
ルの内積を最大化することによって、より精度良くピッ
チピーク位置の検出を行うことも可能である。The pitch peak position calculator 82 detects a pitch peak from the adaptive code vector, and outputs the position to the search position calculator 84 and the pulse amplitude calculator 87. The detection (calculation) of the pitch peak position can be performed by maximizing the inner product of the impulse train vector arranged in the pitch period L and the adaptive code vector. In addition, by maximizing the inner product of the vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and the vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector, the pitch can be more accurately determined. It is also possible to detect a peak position.

【０１０８】パルス数決定器８３は、ピッチ周期Ｌの値
に基づいて、パルス音源に使用するパルスの本数を決定
して、探索位置算出器８４に出力する。パルス数とピッ
チ周期の関係は予め学習的あるいは統計的に定められて
おり、たとえばピッチ周期が４５サンプル以下の場合は
５本、４５サンプルを超えて８０サンプル未満の場合は
４本、８０サンプル以上の場合は３本、というようにピ
ッチ周期の値の範囲によってそれぞれのパルス本数が定
められている。さらに、連続するサブフレーム間のパル
ス本数の変化やピッチ周期Ｌの変化などを考慮に入れて
パルス数を決定すると、連続するサブフレーム間の不連
続性緩和や有声部の立ち上がり部の品質向上を図ること
ができる。具体的には、連続するサブフレームで、ピッ
チ周期Ｌから決定されたパルス数が５本から３本に減少
したときは、パルス数の減少にヒステリシスを持たせ
て、５本から急に３本に減らすのではなく４本にするこ
とによってサブフレーム間でパルス数が大きく変化する
ことを避けるようにする、あるいは、連続するサブフレ
ーム間でピッチ周期Ｌが大きく異なる場合は、有声部の
立ち上がりである可能性が大きいので、パルス数を減ら
してパルス位置の精度を向上させた方が音声品質が向上
するため、前サブフレームのピッチ周期Ｌと現サブフレ
ームのピッチ周期Ｌが大きく異なる場合は現サブフレー
ムのピッチ周期Ｌの値に関わらずパルス数を３本とす
る、等という手法によってパルス数の決定を行うとより
音声品質の向上を図ることが可能である。なお、これら
の手法を用いる場合には、ピッチ分析における倍ピッチ
誤りや半ピッチ誤り等の影響を受けやすくなるので、こ
れらの影響を緩和するパルス数決定法（たとえば、半ピ
ッチや倍ピッチの可能性を考慮に入れてピッチ周期の連
続性を判定するなど）を取り入れたり、ピッチ分析の精
度をでき得る限り上げると、より効果的である。The number-of-pulses determiner 83 determines the number of pulses used for the pulse sound source based on the value of the pitch period L, and outputs it to the search position calculator 84. The relationship between the number of pulses and the pitch period is predetermined by learning or statistically. For example, when the pitch period is 45 samples or less, 5 lines, when the pitch period is more than 45 samples and less than 80 samples, 4 lines, and 80 samples or more. In the case of (3), the number of each pulse is determined by the range of the pitch period value, such as three pulses. Furthermore, when the number of pulses is determined in consideration of a change in the number of pulses between consecutive subframes, a change in the pitch period L, and the like, the discontinuity between consecutive subframes and the quality of the rising portion of the voiced portion can be improved. Can be planned. Specifically, in consecutive subframes, when the number of pulses determined from the pitch period L decreases from five to three, the decrease in the number of pulses is given a hysteresis, and three pulses suddenly change from five. By avoiding a large change in the number of pulses between subframes by reducing the number of pulses to four instead of reducing the number to four, or when the pitch period L differs greatly between consecutive subframes, Since there is a high possibility that the accuracy of the pulse position is improved by reducing the number of pulses, the voice quality is improved. Therefore, when the pitch period L of the previous subframe and the pitch period L of the current subframe are significantly different, the current If the number of pulses is determined by a method such as setting the number of pulses to three irrespective of the value of the pitch period L of the subframe, voice quality can be further improved. A. When these methods are used, the influence of double pitch errors and half pitch errors in pitch analysis becomes liable to occur. Therefore, a pulse number determination method (for example, half pitch or double pitch For example, it is more effective to adopt a method of determining the continuity of the pitch period in consideration of the characteristics, and to increase the accuracy of pitch analysis as much as possible.

【０１０９】探索位置算出器８４は、ピッチピーク位置
とパルス本数をもとにして、パルス探索を行う位置を決
定する。パルスの探索位置はピッチピーク付近は密に、
それ以外の部分は疎になるように配分される（全てのサ
ンプル点を探索するだけの十分なビット配分がないとき
に有効である）。すなわち、ピッチピーク位置近傍は全
てのサンプル点がパルス位置探索の対象となるが、ピッ
チピーク位置から離れた部分は２サンプル毎や３サンプ
ル毎というようにパルス位置探索の間隔を広くする（た
とえば、図１１（ｂ）、（ｃ）のように探索位置を決定
する）。また、パルス数が多いときは１本あたりのパル
スに配分されるビット数は少なくなるため、疎になる部
分の間隔がパルス数が少ない場合より広くなる（パルス
位置の精度が荒くなる）。なお、ピッチ周期が短い場合
は実施の形態５に示したように、サブフレーム内の最初
のピッチピークから１ピッチ周期強の範囲のみに探索範
囲を限定すると、より音声品質を向上することが可能で
ある。The search position calculator 84 determines the position where the pulse search is to be performed based on the pitch peak position and the number of pulses. The search position of the pulse is dense near the pitch peak,
The other parts are sparsely allocated (effective when there is not enough bit allocation to search all sample points). That is, all the sample points near the pitch peak position are subjected to the pulse position search, but the interval apart from the pitch peak position is widened every two samples or every three samples (for example, The search position is determined as in FIGS. 11B and 11C). Further, when the number of pulses is large, the number of bits allocated to one pulse is small, so that the interval between sparse portions is wider than when the number of pulses is small (the accuracy of the pulse position is rough). When the pitch period is short, as described in Embodiment 5, if the search range is limited to only a range slightly more than one pitch period from the first pitch peak in the subframe, voice quality can be further improved. It is.

【０１１０】パルス位置探索器８５は、探索位置算出器
８４で決定された探索位置と後述のパルス振幅算出器８
７で決定されたパルス振幅情報に基づいてパルスを立て
る位置の最適な組み合わせを決定する。パルス探索の方
法は「 ITU-T STUDY GROUP15- CONTRIBUTION 152,"G.72
9-CODING OF SPEECH AT 8 KBIT/S USING CONJUGATE-STR
UCTURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (C
S-ACELP)", COM 15-152-E July 1995」に示されている
ように、たとえばパルス数が４本の場合は、式（４）を
最大化するようにｉ０からｉ３の組み合わせを決定す
る。ＤＮ×ＤＮ／ＲＲＤＮ＝a0×ｄｎ（ｉ０）＋a1×ｄｎ（ｉ１）＋a2×ｄｎ（ｉ２）＋a3×ｄｎ（ｉ３）ＲＲ＝ a0 ×a0×ｒｒ（ｉ０，ｉ０）＋ a1×a1×ｒｒ（ｉ１，ｉ１）＋２×a0×a1×ｒｒ（ｉ０，ｉ１）＋ a2×a2×ｒｒ（ｉ２，ｉ２）＋２×（ a0 ×a2×ｒｒ（ｉ０，ｉ２）＋ a1 ×a2×ｒｒ（ｉ１，ｉ２））＋ a3 ×a3×ｒｒ（ｉ３，ｉ３）＋２×（a0×a3×ｒｒ（ｉ０，ｉ３）＋ a1 ×a3×ｒｒ（ｉ１，ｉ３）＋ a2 ×a3×ｒｒ（ｉ２，ｉ３））．．．（４）ここで、ｄｎ（ｉ）（ｉ＝０〜７９：サブフレーム長８
０サンプルの場合）はパルス音源成分のターゲットベク
トルに合成フィルタのインパルス応答を畳み込んだもの
で、ｒｒ（ｉ，ｉ）は式（３）のようにインパルス応答
の自己相関行列である。また、ｉ０、ｉ１、ｉ２、ｉ３
が取り得る位置の範囲は探索位置算出器８４で求められ
たものである。具体的にはパルス数が４本の場合、図１
３（ａ）〜（ｄ）のようになる（図中矢印をつけた部分
が取りうる位置、なお、目盛りの数値はピッチピーク位
置を０とした相対値である）。また、a0、a1、a2、a3は
パルス振幅算出器８７で求められたパルス振幅である。The pulse position search unit 85 is connected to the search position determined by the search position calculator 84 and a pulse amplitude calculator 8 described later.
Based on the pulse amplitude information determined in step 7, an optimal combination of positions where pulses are to be set is determined. See “ITU-T STUDY GROUP15-CONTRIBUTION 152,” G.72
9-CODING OF SPEECH AT 8 KBIT / S USING CONJUGATE-STR
UCTURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (C
S-ACELP) ", COM 15-152-E July 1995", for example, when the number of pulses is 4, the combination of i0 to i3 is determined so as to maximize the equation (4). I do. DN × DN / RR DN = a0 × dn (i0) + a1 × dn (i1) + a2 × dn (i2) + a3 × dn (i3) RR = a0 × a0 × rr (i0, i0) + a1 × a1 × rr ( i1, i1) + 2 × a0 × a1 × rr (i0, i1) + a2 × a2 × rr (i2, i2) + 2 × (a0 × a2 × rr (i0, i2) + a1 × a2 × rr (i1, i2 )) + A3 * a3 * rr (i3, i3) + 2 * (a0 * a3 * rr (i0, i3) + a1 * a3 * rr (i1, i3) + a2 * a3 * rr (i2, i3)). . . (4) Here, dn (i) (i = 0 to 79: subframe length 8)
In the case of 0 sample), the impulse response of the synthesis filter is convolved with the target vector of the pulse sound source component, and rr (i, i) is the autocorrelation matrix of the impulse response as shown in Expression (3). Also, i0, i1, i2, i3
The range of positions that can be taken by is obtained by the search position calculator 84. Specifically, when the number of pulses is four, FIG.
3 (a) to 3 (d) (possible positions indicated by arrows in the figure, and the scale values are relative values with the pitch peak position being 0). A0, a1, a2, and a3 are pulse amplitudes obtained by the pulse amplitude calculator 87.

【０１１１】パルス位置探索器８５によって、最適パル
ス位置の組み合わせた決定されると、その組み合わせに
よって生成されるパルス音源ベクトルが乗算器８９に出
力され、パルス符号ベクトル利得が乗算され、加算器９
０に出力される。When the combination of the optimum pulse positions is determined by the pulse position searcher 85, the pulse excitation vector generated by the combination is output to the multiplier 89, multiplied by the pulse code vector gain, and
Output to 0.

【０１１２】加算器８６は、外部で行われるＬＰＣ分析
によって得られる線形予測残差信号（予測残差ベクト
ル）から適応符号ベクトル成分（適応符号ベクトルに適
応符号ベクトル利得を乗算したもの）を減算し、差分信
号をパルス振幅算出器８７に出力する。なお、ＣＥＬＰ
型音声符号化装置の音源部においては、一般的には適応
符号ベクトル利得と雑音符号ベクトル（本発明ではパル
ス音源ベクトルに相当）利得は、適応符号帳探索と雑音
符号帳探索（本発明ではパルス位置探索に相当）の双方
が終わった後に決定されるため、適応部号ベクトルに適
応符号ベクトル利得を乗算したベクトルを、パルス位置
探索以前に得ることはできない。このため、加算器８６
で減算に使用する適応符号ベクトル成分は、適応符号帳
探索時に式（５）から求められる適応符号ベクトル利得
（最終的な最適適応符号ベクトル利得ではない）を適応
符号ベクトルに乗算したものである。An adder 86 subtracts an adaptive code vector component (a product obtained by multiplying an adaptive code vector by an adaptive code vector gain) from a linear prediction residual signal (prediction residual vector) obtained by an externally performed LPC analysis. , And outputs the difference signal to the pulse amplitude calculator 87. In addition, CELP
In the sound source section of the adaptive speech coding apparatus, in general, the adaptive code vector gain and the noise code vector (corresponding to the pulse excitation vector in the present invention) gain are adaptive codebook search and noise codebook search (the pulse code search in the present invention). (Corresponding to position search), the vector obtained by multiplying the adaptive signal vector by the adaptive code vector gain cannot be obtained before the pulse position search. Therefore, the adder 86
The adaptive code vector component used in the subtraction is obtained by multiplying the adaptive code vector by the adaptive code vector gain (not the final optimal adaptive code vector gain) obtained from Expression (5) at the time of searching the adaptive code book.

【数２】．．．（５）ここで、ｘ（ｎ）はいわゆるターゲットベクトルで、こ
こでは聴覚重みづけした入力信号から現サブフレームの
ＬＰＣ合成フィルタの零入力応答を除去したものであ
る。また、ｙ（ｎ）は合成音声信号のうち、適応符号ベ
クトルによって生成される成分で、ここでは適応符号ベ
クトルに、現サブフレームのＬＰＣ合成フィルタと聴覚
重みづけフィルタを縦続接続したフィルタのインパルス
応答を、畳み込んだものである。(Equation 2) . . . (5) Here, x (n) is a so-called target vector, which is obtained by removing the zero input response of the LPC synthesis filter of the current subframe from the input signal weighted in the auditory sense. Further, y (n) is a component generated by the adaptive code vector in the synthesized speech signal. Is convoluted.

【０１１３】パルス振幅算出器８７は、ピッチピーク位
置算出器８２によって求められたピッチピーク位置を用
いて、加算器８６から出力された差分信号をピッチピー
ク位置近傍とそれ以外の部分に分割し、それぞれの部分
のパワーの平均値またはそれぞれの部分に含まれる各サ
ンプル点における信号振幅の絶対値の平均値を求め、そ
れぞれの振幅をピッチピーク位置近傍のパルス振幅およ
びそれ以外の部分のパルス振幅としてパルス位置探索器
８５に出力する。パルス位置探索器８５では、ピッチパ
ルス近傍のパルスとそれ以外の部分のパルスとで異なる
振幅を用いて式（４）の評価を行い、パルス位置探索を
行う。パルス位置探索で決定されたパルス位置とその位
置のパルスに割り当てられたパルス振幅によって表現さ
れる、パルス音源ベクトルがパルス位置探索器８５から
出力される。Using the pitch peak position calculated by the pitch peak position calculator 82, the pulse amplitude calculator 87 divides the difference signal output from the adder 86 into the vicinity of the pitch peak position and other parts. The average value of the power of each part or the average value of the absolute value of the signal amplitude at each sample point included in each part is obtained, and the respective amplitudes are used as the pulse amplitude near the pitch peak position and the pulse amplitude of the other parts. Output to the pulse position searcher 85. The pulse position searcher 85 evaluates Expression (4) using different amplitudes between the pulse near the pitch pulse and the pulse in the other part, and performs a pulse position search. A pulse sound source vector, which is expressed by the pulse position determined in the pulse position search and the pulse amplitude assigned to the pulse at that position, is output from the pulse position search device 85.

【０１１４】加算器９０は適応符号ベクトル成分とパル
ス音源ベクトル成分の加算を行い、励振音源ベクトルと
して出力する。The adder 90 adds the adaptive code vector component and the pulse excitation vector component, and outputs the result as an excitation excitation vector.

【０１１５】（実施の形態８）図１５は、本発明の第８
の発明の実施の形態を示し、ピッチ周期の連続性の判定
結果に基づいてパルス探索に用いる探索位置を切り替え
る構成を有する、ＣＥＬＰ型音声符号化装置の音源生成
部を示している。図１５において、９１は適応符号ベク
トルをピッチピーク位置算出器９２と乗算器９９に出力
する適応符号帳、９２は適応符号帳９１から出力された
適応符号ベクトルとピッチ周期Ｌを入力として、適応符
号ベクトル内のピッチピーク位置を探索位置算出器９４
に出力するピッチピーク位置算出器、９３はピッチ周期
Ｌを入力として、パルス音源のパルス数を探索位置算出
器９４に出力するパルス数決定器、９４は、ピッチ周期
Ｌとピッチピーク位置算出器９２から出力されたピッチ
ピーク位置とパルス数決定器９３から出力されたパルス
数を入力として、パルスの探索位置をスイッチ９８を介
してパルス位置探索器９７に出力する探索位置算出器、
９５は、現サブフレームのピッチ周期Ｌを入力とし、1
サブフレーム分遅延させて判定器９６に出力する遅延
器、９６は現サブフレームのピッチ周期Ｌと遅延器９５
から出力された前サブフレームのピッチ周期を入力とし
て、ピッチ周期の連続性の判定結果をスイッチ９８に出
力する判定器、９７はスイッチ９８を介して探索位置算
出器９４から入力されるパルスの探索位置またはスイッ
チ９８を介して入力される固定探索位置と、スイッチ９
８を介して入力されるピッチ周期Ｌをそれぞれ入力と
し、入力された探索位置とピッチ周期Ｌを用いてパルス
位置の探索を行い、パルス音源ベクトルを乗算器１００
に出力するパルス位置探索器、９８は判定器９６から入
力される判定結果に基づいて切り替わる連動する２系統
のスイッチで、一方の系統のスイッチは、パルスの探索
位置を探索位置算出器９４によって算出された探索位置
と予め定められている固定探索位置との切り替えに用い
られ、もう一方の系統のスイッチは、ピッチ周期Ｌをパ
ルス位置探索器９７に入力するかしないかのＯＮ／ＯＦ
Ｆに用いられる。９９は適応符号帳９１から出力された
適応符号ベクトルを入力とし、適応符号ベクトル利得を
乗じて加算器１０１に出力する乗算器、１００はパルス
位置探索器９７から出力されたパルス音源ベクトルを入
力とし、パルス音源ベクトル利得を乗じて加算器１０１
に出力する乗算器、１０１は乗算器９９および１００か
ら入力されたベクトルの加算を行い、励振音源ベクトル
として出力する加算器である。(Embodiment 8) FIG. 15 shows an eighth embodiment of the present invention.
And shows an excitation generator of a CELP-type speech coding apparatus having a configuration in which a search position used for pulse search is switched based on a determination result of continuity of a pitch period. In FIG. 15, reference numeral 91 denotes an adaptive codebook that outputs an adaptive code vector to a pitch peak position calculator 92 and a multiplier 99. Reference numeral 92 denotes an adaptive codebook that receives the adaptive code vector output from the adaptive codebook 91 and the pitch period L as inputs. Search position calculator 94 for pitch peak position in vector
The pitch number calculator 93 outputs the pitch period L as an input, and the pulse number determiner 93 outputs the number of pulses of the pulse sound source to the search position calculator 94. The pitch number L and the pitch peak position calculator 92 A search position calculator that receives the pitch peak position output from the controller and the number of pulses output from the pulse number determiner 93 and outputs a pulse search position to a pulse position searcher 97 via a switch 98;
95 receives the pitch period L of the current subframe as an input,
A delay unit that delays by a subframe and outputs the result to the decision unit 96.
A decision unit which outputs the pitch cycle continuity determination result to the switch 98 using the pitch period of the previous subframe output from the switch 98 as an input, and a search unit 97 for searching for a pulse input from the search position calculator 94 via the switch 98 The position or fixed search position input via switch 98 and switch 9
8, a pitch position L is input, a search for a pulse position is performed using the input search position and the input pitch period L, and the pulse excitation vector is multiplied by the multiplier 100.
The pulse position searcher 98 outputs a pulse search position 98 which is switched based on the determination result input from the determiner 96. One of the switches is used to calculate the pulse search position by the search position calculator 94. Is used to switch between the searched search position and a predetermined fixed search position, and the other system of switches is used to determine whether or not to input the pitch period L to the pulse position searcher 97 or not.
Used for F. A multiplier 99 receives the adaptive code vector output from the adaptive codebook 91 as an input, multiplies the adaptive code vector gain and outputs the result to the adder 101, and 100 receives a pulse excitation vector output from the pulse position searcher 97 as an input. , Multiplying by the pulse source vector gain
Is an adder that adds the vectors input from the multipliers 99 and 100 and outputs the result as an excitation sound source vector.

【０１１６】以上の様に構成された、ＣＥＬＰ型音声符
号化装置の音源生成部について、図１５を用いてその動
作を説明する。適応符号帳９１は、過去の励振音源のバ
ッファにより構成され、外部のピッチ分析または適応符
号帳探索手段によって求められたピッチ周期またはピッ
チラグに基づいて励振音源のバッファから該当する部分
を取り出し、適応符号ベクトルとしてピッチピーク位置
算出器９２および乗算器９９に出力する。適応符号帳９
１から乗算器９９に出力された適応符号ベクトルは、適
応符号ベクトル利得が乗算されて加算器１０１に出力さ
れる。The operation of the excitation generator of the CELP-type speech coding apparatus configured as described above will be described with reference to FIG. The adaptive codebook 91 is composed of a buffer of a past excitation source, extracts a corresponding portion from the buffer of the excitation source based on a pitch cycle or pitch lag obtained by an external pitch analysis or an adaptive codebook search means, and generates an adaptive codebook. The vector is output to the pitch peak position calculator 92 and the multiplier 99 as a vector. Adaptive codebook 9
The adaptive code vector output from 1 to the multiplier 99 is multiplied by the adaptive code vector gain and output to the adder 101.

【０１１７】ピッチピーク位置算出器９２は、適応符号
ベクトルからピッチピークを検出し、その位置を探索位
置算出器９４に出力する。ピッチピーク位置の検出（算
出）は、ピッチ周期Ｌで並べたインパルス列ベクトルと
適応符号ベクトルの内積を最大化することによって行う
ことができる。また、ピッチ周期Ｌで並べたインパルス
列ベクトルに合成フィルタのインパルス応答を畳み込ん
だベクトルと適応符号ベクトルに合成フィルタのインパ
ルス応答を畳み込んだベクトルの内積を最大化すること
によって、より精度良くピッチピーク位置の検出を行う
ことも可能である。The pitch peak position calculator 92 detects a pitch peak from the adaptive code vector and outputs the position to the search position calculator 94. The detection (calculation) of the pitch peak position can be performed by maximizing the inner product of the impulse train vector arranged in the pitch period L and the adaptive code vector. In addition, by maximizing the inner product of the vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and the vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector, the pitch can be more accurately determined. It is also possible to detect a peak position.

【０１１８】パルス数決定器９３はピッチ周期Ｌの値に
基づいて、パルス音源に使用するパルスの本数を決定し
て、探索位置算出器９４に出力する。パルス数とピッチ
周期の関係は予め学習的あるいは統計的に定められてお
り、たとえばピッチ周期が４５サンプル以下の場合は５
本、４５サンプルを超えて８０サンプル未満の場合は４
本、８０サンプル以上の場合は３本、というようにピッ
チ周期の値の範囲によってそれぞれのパルス本数が定め
られている。The number-of-pulses determiner 93 determines the number of pulses used for the pulse sound source based on the value of the pitch period L, and outputs it to the search position calculator 94. The relationship between the number of pulses and the pitch cycle is predetermined by learning or statistically. For example, when the pitch cycle is 45 samples or less, 5 is used.
Book, 4 if more than 45 samples and less than 80 samples
The number of each pulse is determined by the range of the pitch period value, such as three, for 80 samples or more.

【０１１９】探索位置算出器９４は、ピッチピーク位置
とパルス本数をもとにして、パルス探索を行う位置を決
定する。パルスの探索位置はピッチピーク付近は密に、
それ以外の部分は疎になるように配分される（全てのサ
ンプル点を探索するだけの十分なビット配分がないとき
に有効である）。すなわち、ピッチピーク位置近傍は全
てのサンプル点がパルス位置探索の対象となるが、ピッ
チピーク位置から離れた部分は２サンプル毎や３サンプ
ル毎というようにパルス位置探索の間隔を広くする（た
とえば、図１１（ｂ）、（ｃ）のように探索位置を決定
する）。また、パルス数が多いときは１本あたりのパル
スに配分されるビット数は少なくなるため、疎になる部
分の間隔がパルス数が少ない場合より広くなる（パルス
位置の精度が荒くなる）。なお、ピッチ周期が短い場合
は実施の形態５に示したように、サブフレーム内の最初
のピッチピークから１ピッチ周期強の範囲のみに探索範
囲を限定すると、より音声品質を向上することが可能で
ある。The search position calculator 94 determines the position where the pulse search is to be performed based on the pitch peak position and the number of pulses. The search position of the pulse is dense near the pitch peak,
The other parts are sparsely allocated (effective when there is not enough bit allocation to search all sample points). That is, all the sample points near the pitch peak position are subjected to the pulse position search, but the interval apart from the pitch peak position is widened every two samples or every three samples (for example, The search position is determined as in FIGS. 11B and 11C). Further, when the number of pulses is large, the number of bits allocated to one pulse is small, so that the interval between sparse portions is wider than when the number of pulses is small (the accuracy of the pulse position is rough). When the pitch period is short, as described in Embodiment 5, if the search range is limited to only a range slightly more than one pitch period from the first pitch peak in the subframe, voice quality can be further improved. It is.

【０１２０】パルス位置探索器９７は、探索位置算出器
９４で決定された探索位置または予め決められている固
定探索位置とピッチ周期Ｌに基づいてパルスを立てる位
置の最適な組み合わせを決定する。パルス探索の方法は
「 ITU-T STUDY GROUP15 - CONTRIBUTION 152,"G.729-C
ODING OF SPEECH AT 8 KBIT/S USING CONJUGATE-STRUCT
URE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (CS-A
CELP)", COM 15-152-EJuly 1995」に示されているよう
に、たとえばパルス数が４本の場合は式（２）を最大化
するようにｉ０からｉ３の組み合わせを決定する。The pulse position searcher 97 determines an optimum combination of the search position determined by the search position calculator 94 or a predetermined fixed search position and a position where a pulse is to be formed based on the pitch period L. For the method of pulse search, see "ITU-T STUDY GROUP15-CONTRIBUTION 152," G.729-C
ODING OF SPEECH AT 8 KBIT / S USING CONJUGATE-STRUCT
URE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (CS-A
CELP) ", COM 15-152-EJuly 1995", for example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize Expression (2).

【０１２１】スイッチ９８の切り替えは、判定器９６の
判定結果に基づいて行われる。判定器９６は、現サブフ
レームのピッチ周期Ｌと遅延器９５から入力された直前
のサブフレームにおけるピッチ周期を用いて、ピッチ周
期が連続しているか否かを判定する。具体的には、現サ
ブフレームのピッチ周期の値と直前のサブフレームのピ
ッチ周期の値の差が予め定められたあるいは計算により
求められた閾値以下の場合に、ピッチ周期が連続してい
ると判定する。ピッチ周期が連続であると判定された場
合、現サブフレームは有声・有声定常部であるとみな
し、スイッチ９８は探索位置算出器９４とパルス位置探
索器９７を接続し、ピッチ周期Ｌをパルス位置探索器９
７に入力する（スイッチ９８の一方の系統は探索位置算
出器９４に切り替えられ、もう一方の系統はＯＮ状態と
なってピッチ周期Ｌをパルス位置探索器９７に入力す
る）。ピッチ周期が連続でない（現サブフレームのピッ
チ周期と直前のサブフレームのピッチ周期の差が閾値を
超えてい）と判定された場合、現サブフレームは有声・
有声定常部でない（無声部・有声立ち上がり部である）
とみなし、スイッチ９８は予め定められている固定探索
位置をパルス探索器９７に入力し、ピッチ周期Ｌはパル
ス位置探索器に入力しない（スイッチ９８の一方の系統
は固定探索位置に切り替えられ、もう一方の系統はＯＦ
Ｆ状態となってピッチ周期Ｌはパルス位置探索器９７に
入力されない）。The switching of the switch 98 is performed on the basis of the judgment result of the judgment unit 96. The determiner 96 determines whether or not the pitch periods are continuous using the pitch period L of the current subframe and the pitch period of the immediately preceding subframe input from the delay unit 95. Specifically, when the difference between the value of the pitch period of the current subframe and the value of the pitch period of the immediately preceding subframe is equal to or less than a predetermined or calculated threshold, the pitch period is considered to be continuous. judge. If it is determined that the pitch period is continuous, the current subframe is regarded as a voiced / voiced stationary part, and the switch 98 connects the search position calculator 94 and the pulse position searcher 97 to change the pitch period L to the pulse position. Searcher 9
7 (one system of the switch 98 is switched to the search position calculator 94, and the other system is turned on to input the pitch period L to the pulse position search device 97). If it is determined that the pitch cycle is not continuous (the difference between the pitch cycle of the current subframe and the pitch cycle of the immediately preceding subframe exceeds a threshold), the current subframe is voiced.
Not voiced stationary part (voiceless part / voiced rising part)
The switch 98 inputs a predetermined fixed search position to the pulse searcher 97 and does not input the pitch period L to the pulse position searcher (one system of the switch 98 is switched to the fixed search position, and One system is OF
In the F state, the pitch period L is not input to the pulse position search unit 97).

【０１２２】パルス位置探索器９７によって、最適パル
ス位置の組み合わせた決定されると、その組み合わせに
よって生成されるパルス音源ベクトルが乗算器１００に
出力され、パルス符号ベクトル利得が乗算され、加算器
１０１に出力される。When the combination of the optimum pulse positions is determined by the pulse position search unit 97, the pulse excitation vector generated by the combination is output to the multiplier 100, multiplied by the pulse code vector gain, and added to the adder 101. Is output.

【０１２３】加算器１０１は適応符号ベクトル成分とパ
ルス音源ベクトル成分の加算を行い、励振音源ベクトル
として出力する。The adder 101 adds the adaptive code vector component and the pulse excitation vector component, and outputs the result as an excitation excitation vector.

【０１２４】なお、図１６に示した表は図１５の固定探
索位置の内容の一例を示している。図１６（ｂ）は、図
１３に示した探索位置と同様に１パルスあたり８個所の
位置を割り当てた場合に、サブフレーム全体に均等に探
索位置が散らばるように探索位置を固定したものである
（ピッチピーク近傍を密に、その他の部分を疎に、とい
うのではなく、全体的に均等な密度にしている）。ま
た、図１６（ａ）は、４パルスのうち２パルスに割り当
てる探索位置を４個所ずつに減らす代わりに、探索位置
の種類を４種類にして、サブフレーム内の全てのサンプ
ル点がどれかの探索位置グループに含まれる様にしたも
のである（パルス位置を表現する為のビット数は（ａ）
も（ｂ）も図１３も全て同じ）。このようにすると、図
１６（ｂ）のように、全く探索されない位置がなくなる
為、同一のビット数でも一般的に図１６（ａ）の方が性
能が良くなる。The table shown in FIG. 16 shows an example of the contents of the fixed search position in FIG. FIG. 16B shows a case where the search positions are fixed so that the search positions are evenly scattered over the entire subframe when eight positions are assigned per pulse in the same manner as the search positions shown in FIG. (Rather than densely near the pitch peak and sparsely in other parts, the density is made uniform as a whole). FIG. 16 (a) shows that, instead of reducing the search positions assigned to two of the four pulses to four, four search positions are used and all the sample points in the sub-frame are located at one of the four positions. It is included in the search position group (the number of bits for expressing the pulse position is (a)
13 (b) and FIG. 13). In this case, as shown in FIG. 16B, there are no positions that are not searched at all, so that the performance of FIG. 16A generally improves with the same number of bits.

【０１２５】なお、本実施例では、パルス数決定器９３
を有するパルス数可変型の音声符号化装置の音源生成部
を示したが、パルス数決定器９３を持たないパルス数固
定型のものにおいても、ピッチ周期の連続性を用いたパ
ルス探索位置切り替えは有効である。また、本実施例で
は、ピッチ周期の連続性を直前のサブフレームと現在の
サブフレームのピッチ周期のみから判定しているが、さ
らに過去のサブフレームのピッチ周期を利用することに
よって判定確度を向上させることも可能である。In this embodiment, the pulse number determiner 93 is used.
Although the excitation generation unit of the variable pulse number speech coding apparatus having the above is shown, the pulse search position switching using the continuity of the pitch period can be performed even in the fixed pulse number type without the pulse number determiner 93. It is valid. Further, in the present embodiment, the continuity of the pitch cycle is determined only from the pitch cycle of the immediately preceding subframe and the current subframe. However, the determination accuracy is further improved by using the pitch cycle of the past subframe. It is also possible to make it.

【０１２６】（実施の形態９）図１７は、本発明の第９
の発明の実施の形態を示し、ピッチゲイン（適応符号ベ
クトル利得）の量子化が２段量子化構成になっており、
初段のターゲットが適応符号帳探索直後に算出されるピ
ッチゲインであり、この初段の量子化ピッチゲインに基
づいてパルス探索に用いる探索位置を切り替える構成を
有する、ＣＥＬＰ型音声符号化装置の音源生成部を示し
ている。図１７において、１１１は、適応符号ベクトル
をピッチピーク位置算出器１１２とピッチゲイン算出器
１１６と乗算器１２３に出力する適応符号帳、１１２
は、適応符号帳１１１から出力された適応符号ベクトル
とピッチ周期Ｌを入力として、適応符号ベクトル内のピ
ッチピーク位置を探索位置算出器１１４に出力するピッ
チピーク位置算出器、１１３はピッチ周期Ｌを入力とし
て、パルス音源のパルス数を探索位置算出器１１４に出
力するパルス数決定器、１１４はピッチ周期Ｌとピッチ
ピーク位置算出器１１２から出力されたピッチピーク位
置とパルス数決定器１１３から出力されたパルス数を入
力として、パルスの探索位置をスイッチ１１５を介して
パルス位置探索器１１９に出力する探索位置算出器、１
１５は判定器１１８から入力される判定結果に基づいて
切り替わる連動する２系統のスイッチで、一方の系統の
スイッチはパルスの探索位置を探索位置算出器１１４に
よって算出された探索位置と予め定められている固定探
索位置との切り替えに用いられ、もう一方の系統のスイ
ッチは、ピッチ周期Ｌをパルス位置探索器１１９に入力
するかしないかのＯＮ／ＯＦＦに用いられる。１１６は
適応符号帳１１１から出力された適応符号ベクトルと現
フレームのターゲットベクトルとインパルス応答を入力
とし、ピッチゲインを量子化器１１７に出力するピッチ
ゲイン算出器、１１７はピッチゲイン算出器１１６から
出力されるピッチゲインを量子化して、判定器１１８と
加算器１２０および１２２に出力する量子化器、１１８
は量子化器１１７から出力された初段量子化ピッチゲイ
ンを入力として、ピッチ周期性の判定結果をスイッチ１
１５に出力する判定器、１１９はスイッチ１１５を介し
て探索位置算出器１１４から入力されるパルスの探索位
置またはスイッチ１１５を介して入力される固定探索位
置と、スイッチ１１５を介して入力されるピッチ周期Ｌ
をそれぞれ入力とし、入力された探索位置とピッチ周期
Ｌを用いてパルス位置の探索を行い、パルス音源ベクト
ルを乗算器１２４に出力するパルス位置探索器、１２０
は量子化器１１７から出力された初段量子化ピッチゲイ
ンと差分量子化器１２１から出力された差分量子化ピッ
チゲインとを入力として、加算結果を最適量子化ピッチ
ゲイン（適応符号ベクトル利得）として乗算器１２３に
出力する加算器、１２１は加算器１２２から出力された
差分値を入力とし、その量子化値を加算器１２０に出力
する差分量子化器、１２２は適応符号ベクトルとパルス
音源ベクトルが決定された後に外部で算出される最適ピ
ッチゲイン（適応符号ベクトル利得）と量子化器１１７
から出力された初段量子化ピッチゲイン（適応符号ベク
トル利得）とを入力とし、これらの差分を差分量子化器
１２１に出力する加算器、１２３は適応符号帳１１１か
ら出力された適応符号ベクトルを入力とし、加算器１２
０から出力された量子化ピッチゲイン（適応符号ベクト
ル利得）を乗じて加算器１２５に出力する乗算器、１２
４はパルス位置探索器１１９から出力されたパルス音源
ベクトルを入力とし、パルス音源ベクトル利得を乗じて
加算器１２５に出力する乗算器、１２５は乗算器１２３
および１２４から入力されたベクトルの加算を行い、励
振音源ベクトルとして出力する加算器である。(Embodiment 9) FIG. 17 shows a ninth embodiment of the present invention.
And the pitch gain (adaptive code vector gain) has a two-stage quantization configuration.
The first-stage target is a pitch gain calculated immediately after the adaptive codebook search, and has a configuration in which a search position used for pulse search is switched based on the first-stage quantization pitch gain. Is shown. In FIG. 17, reference numeral 111 denotes an adaptive codebook that outputs an adaptive code vector to a pitch peak position calculator 112, a pitch gain calculator 116, and a multiplier 123.
Is a pitch peak position calculator that receives the adaptive code vector and the pitch period L output from the adaptive codebook 111 and outputs a pitch peak position in the adaptive code vector to the search position calculator 114. As an input, a pulse number determiner that outputs the number of pulses of the pulse sound source to the search position calculator 114. The pitch period L and the pitch peak position output from the pitch peak position calculator 112 are output from the pulse number determiner 113. A search position calculator that outputs the pulse search position to the pulse position searcher 119 via the switch 115 with the number of pulses input as input,
Reference numeral 15 denotes an interlocking two-system switch that switches based on the judgment result input from the judgment unit 118. One of the switches is configured to set the pulse search position to the search position calculated by the search position calculator 114 in advance. The switch of the other system is used for ON / OFF of whether or not to input the pitch period L to the pulse position search unit 119. A pitch gain calculator 116 receives the adaptive code vector output from the adaptive codebook 111, the target vector of the current frame, and an impulse response as input, and outputs a pitch gain to a quantizer 117. 117 outputs from the pitch gain calculator 116. Quantizer 118 which quantizes the pitch gain to be output and outputs the quantized pitch gain to determiner 118 and adders 120 and 122.
The switch 1 receives the first-stage quantization pitch gain output from the quantizer 117 and determines the pitch periodicity determination result using the switch 1.
A decision unit 119 for outputting to the reference numeral 15 is a pulse search position input from the search position calculator 114 via the switch 115 or a fixed search position input via the switch 115, and a pitch input via the switch 115. Period L
, A pulse position searcher that searches for a pulse position using the input search position and the pitch period L, and outputs a pulse excitation vector to the multiplier 124.
Is input with the first-stage quantization pitch gain output from the quantizer 117 and the difference quantization pitch gain output from the difference quantizer 121, and multiplies the addition result as an optimum quantization pitch gain (adaptive code vector gain). An adder 121 outputs the difference value output from the adder 122, and a difference quantizer 121 outputs the quantized value to the adder 120. The adder 122 determines the adaptive code vector and the pulse excitation vector. Pitch gain (adaptive code vector gain) calculated externally after quantization and a quantizer 117
The adder 123 receives the first-stage quantization pitch gain (adaptive code vector gain) output from the input device and outputs a difference between them to a difference quantizer 121. The adder 123 inputs the adaptive code vector output from the adaptive codebook 111. And the adder 12
A multiplier that multiplies the quantization pitch gain (adaptive code vector gain) output from 0 and outputs the result to the adder 125;
4 is a multiplier which receives the pulse excitation vector output from the pulse position searcher 119 as input, multiplies the pulse excitation vector gain and outputs the result to the adder 125, and 125 is a multiplier 123
And an adder that adds the vectors input from the and 124 and outputs the result as an excitation sound source vector.

【０１２７】以上のように構成された音声符号化装置の
音源生成部について、図１７を用いてその動作を説明す
る。適応符号帳１１１は、過去の励振音源のバッファに
より構成され、外部のピッチ分析または適応符号帳探索
手段によって求められたピッチ周期またはピッチラグに
基づいて励振音源のバッファから該当する部分を取り出
し、適応符号ベクトルとしてピッチピーク位置算出器１
１２およびピッチゲイン算出器１１６および乗算器１２
３に出力する。適応符号帳１１１から乗算器１２３に出
力された適応符号ベクトルは、加算器１２０から出力さ
れる量子化ピッチゲイン（適応符号ベクトル利得）が乗
算されて加算器１２５に出力される。The operation of the sound source generation unit of the speech coding apparatus configured as described above will be described with reference to FIG. The adaptive codebook 111 is composed of a buffer of a past excitation source, extracts a corresponding portion from the buffer of the excitation source based on the pitch period or pitch lag obtained by an external pitch analysis or an adaptive codebook search means, Pitch peak position calculator 1 as vector
12 and pitch gain calculator 116 and multiplier 12
Output to 3. The adaptive code vector output from the adaptive codebook 111 to the multiplier 123 is multiplied by the quantization pitch gain (adaptive code vector gain) output from the adder 120 and output to the adder 125.

【０１２８】ピッチピーク位置算出器１１２は、適応符
号ベクトルからピッチピークを検出し、その位置を探索
位置算出器１１４に出力する。ピッチピーク位置の検出
（算出）は、ピッチ周期Ｌで並べたインパルス列ベクト
ルと適応符号ベクトルの内積を最大化することによって
行うことができる。また、ピッチ周期Ｌで並べたインパ
ルス列ベクトルに合成フィルタのインパルス応答を畳み
込んだベクトルと適応符号ベクトルに合成フィルタのイ
ンパルス応答を畳み込んだベクトルの内積を最大化する
ことによって、より精度良くピッチピーク位置の検出を
行うことも可能である。The pitch peak position calculator 112 detects the pitch peak from the adaptive code vector and outputs the position to the search position calculator 114. The detection (calculation) of the pitch peak position can be performed by maximizing the inner product of the impulse train vector arranged in the pitch period L and the adaptive code vector. In addition, by maximizing the inner product of the vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and the vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector, the pitch can be more accurately determined. It is also possible to detect a peak position.

【０１２９】パルス数決定器１１３はピッチ周期Ｌの値
に基づいて、パルス音源に使用するパルスの本数を決定
して、探索位置算出器１１４に出力する。パルス数とピ
ッチ周期の関係は予め学習的あるいは統計的に定められ
ており、たとえばピッチ周期が４５サンプル以下の場合
は５本、４５サンプルを超えて８０サンプル未満の場合
は４本、８０サンプル以上の場合は３本、というように
ピッチ周期の値の範囲によってそれぞれのパルス本数が
定められている。The number-of-pulses determiner 113 determines the number of pulses used for the pulse sound source based on the value of the pitch period L, and outputs it to the search position calculator 114. The relationship between the number of pulses and the pitch period is predetermined by learning or statistically. For example, when the pitch period is 45 samples or less, 5 lines, when the pitch period is more than 45 samples and less than 80 samples, 4 lines, and 80 samples or more. In the case of (3), the number of each pulse is determined by the range of the pitch period value, such as three pulses.

【０１３０】探索位置算出器１１４は、ピッチピーク位
置とパルス本数をもとにして、パルス探索を行う位置を
決定する。パルスの探索位置はピッチピーク付近は密
に、それ以外の部分は疎になるように配分される（全て
のサンプル点を探索するだけの十分なビット配分がない
ときに有効である）。すなわち、ピッチピーク位置近傍
は全てのサンプル点がパルス位置探索の対象となるが、
ピッチピーク位置から離れた部分は2サンプル毎や３サ
ンプル毎というようにパルス位置探索の間隔を広くする
（たとえば、図１１（ｂ）、（ｃ）のように探索位置を
決定する）。また、パルス数が多いときは１本あたりの
パルスに配分されるビット数は少なくなるため、疎にな
る部分の間隔がパルス数が少ない場合より広くなる（パ
ルス位置の精度が荒くなる）。なお、ピッチ周期が短い
場合は、実施の形態５に示したように、サブフレーム内
の最初のピッチピークから１ピッチ周期強の範囲のみに
探索範囲を限定すると、より音声品質を向上することが
可能である。The search position calculator 114 determines the position where the pulse search is to be performed based on the pitch peak position and the number of pulses. Pulse search positions are distributed so as to be dense near the pitch peak and sparse in other portions (effective when there is not enough bit allocation to search all sample points). That is, all sample points near the pitch peak position are subject to pulse position search,
In a portion away from the pitch peak position, the interval of the pulse position search is widened such as every two samples or every three samples (for example, the search position is determined as shown in FIGS. 11B and 11C). Further, when the number of pulses is large, the number of bits allocated to one pulse is small, so that the interval between sparse portions is wider than when the number of pulses is small (the accuracy of the pulse position is rough). When the pitch period is short, as described in Embodiment 5, if the search range is limited to only a range slightly more than one pitch period from the first pitch peak in the subframe, the voice quality can be further improved. It is possible.

【０１３１】パルス位置探索器１１９は、探索位置算出
器１１４で決定された探索位置または予め決められてい
る固定探索位置とピッチ周期Ｌに基づいてパルスを立て
る位置の最適な組み合わせを決定する。パルス探索の方
法は「 ITU-T STUDY GROUP15- CONTRIBUTION 152,"G.72
9-CODING OF SPEECH AT 8 KBIT/S USING CONJUGATE-STR
UCTURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (C
S-ACELP)", COM 15-152-E July 1995」に示されている
ように、たとえばパルス数が４本の場合は、式（２）を
最大化するようにｉ０からｉ３の組み合わせを決定す
る。The pulse position search unit 119 determines an optimum combination of the search position determined by the search position calculator 114 or a predetermined fixed search position and a position where a pulse is to be formed based on the pitch period L. See “ITU-T STUDY GROUP15-CONTRIBUTION 152,” G.72
9-CODING OF SPEECH AT 8 KBIT / S USING CONJUGATE-STR
UCTURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (C
S-ACELP) ", COM 15-152-E July 1995", for example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize Expression (2). I do.

【０１３２】スイッチ１１５の切り替えは、判定器１１
８の判定結果に基づいて行われる。判定器１１８は、量
子化器１１７から出力された初段量子化ピッチゲインを
用いて、現サブフレームがピッチ周期性の強いサブフレ
ームか否かを判定する。具体的には、初段量子化ピッチ
ゲインが予め定められたあるいは計算により求められた
範囲内にある場合に、ピッチ周期性が強いと判定する。
ピッチ周期性が強いと判定された場合、現サブフレーム
は有声・有声定常部であるとみなし、スイッチ１１５は
探索位置算出器１１４とパルス位置探索器１１９を接続
し、ピッチ周期Ｌをパルス位置探索器に入力する（スイ
ッチ１１５の一方の系統は探索位置算出器１１４に切り
替えられ、もう一方の系統はＯＮ状態となってピッチ周
期Ｌをパルス位置探索器１１９に入力する）。ピッチ周
期が連続でない（現サブフレームのピッチ周期と直前の
サブフレームのピッチ周期の差が閾値を超えてい）と判
定された場合、現サブフレームは有声・有声定常部でな
い（無声部・有声立ち上がり部である）とみなし、スイ
ッチ１１５は予め定められている固定探索位置をパルス
探索器１１９に入力し、ピッチ周期Ｌはパルス位置探索
器に入力しない（スイッチ１１５の一方の系統は固定探
索位置に切り替えられ、もう一方の系統はＯＦＦ状態と
なってピッチ周期Ｌはパルス位置探索器１１９に入力さ
れない）。The switching of the switch 115 is performed by the decision unit 11.
The determination is performed based on the determination result of Step 8. The determiner 118 uses the first-stage quantization pitch gain output from the quantizer 117 to determine whether the current subframe is a subframe with a strong pitch periodicity. Specifically, when the first-stage quantization pitch gain is within a predetermined range or a range obtained by calculation, it is determined that the pitch periodicity is strong.
If it is determined that the pitch periodicity is strong, the current subframe is regarded as a voiced / voiced stationary part, and the switch 115 connects the search position calculator 114 and the pulse position searcher 119 to determine the pitch period L as the pulse position search. (One system of the switch 115 is switched to the search position calculator 114, and the other system is turned on to input the pitch period L to the pulse position search device 119). If it is determined that the pitch cycle is not continuous (the difference between the pitch cycle of the current subframe and the pitch cycle of the immediately preceding subframe exceeds a threshold), the current subframe is not a voiced / voiced stationary part (unvoiced part / voiced rising) The switch 115 inputs a predetermined fixed search position to the pulse searcher 119, and does not input the pitch period L to the pulse position searcher (one system of the switch 115 is set to the fixed search position). The other system is switched off, and the pitch period L is not input to the pulse position search unit 119).

【０１３３】パルス位置探索器１１９によって、最適パ
ルス位置の組み合わせた決定されると、その組み合わせ
によって生成されるパルス音源ベクトルが乗算器１２４
に出力され、パルス符号ベクトル利得が乗算され、加算
器１２５に出力される。When the combination of the optimum pulse positions is determined by the pulse position searcher 119, the pulse excitation vector generated by the combination is multiplied by the multiplier 124.
Are multiplied by the pulse code vector gain and output to the adder 125.

【０１３４】ピッチゲイン算出器１１６は、現サブフレ
ームの量子化ＬＰＣ合成フィルタと聴覚重みづけフィル
タを縦続接続したフィルタのインパルス応答とターゲッ
トベクトルと適応符号帳から出力された適応符号ベクト
ルとを用いて、式（５）によってピッチゲイン（適応符
号ベクトル利得）を算出する。算出されたピッチゲイン
は量子化器１１７で量子化され、ピッチ周期性の強さを
判定する判定器１１８と加算器１２０および１２２に出
力される。加算器１２２では、音源符号帳探索（適応符
号帳探索と雑音符号帳探索（本実施例ではパルス位置探
索））が全て終了した後に算出される最適量子化ピッチ
ゲインと、量子化器１１７から出力される（初段）量子
化ピッチゲインとの差分を計算し、差分量子化器１２１
に出力する。差分量子化器１２１で量子化された差分値
は、加算器１２０によって、量子化器１１７から出力さ
れる初段量子化ピッチゲインと加算されて、最適量子化
ピッチゲインとして乗算器１２３に出力される。The pitch gain calculator 116 uses the impulse response of the filter obtained by cascading the quantized LPC synthesis filter and the perceptual weighting filter of the current subframe, the target vector, and the adaptive code vector output from the adaptive codebook. , Equation (5) is used to calculate a pitch gain (adaptive code vector gain). The calculated pitch gain is quantized by the quantizer 117 and output to the determiner 118 for determining the strength of the pitch periodicity and to the adders 120 and 122. The adder 122 outputs the optimum quantization pitch gain calculated after all of the excitation codebook search (the adaptive codebook search and the noise codebook search (the pulse position search in this embodiment)) and the output from the quantizer 117. (First stage) and a quantized pitch gain, and calculates a difference quantizer 121.
Output to The difference value quantized by the difference quantizer 121 is added by the adder 120 to the first-stage quantization pitch gain output from the quantizer 117, and is output to the multiplier 123 as the optimum quantization pitch gain. .

【０１３５】乗算器１２３は、適応符号帳１１１から出
力された適応符号ベクトルに最適量子化ピッチゲインを
乗じて、加算器１２５に出力する。Multiplier 123 multiplies the adaptive code vector output from adaptive codebook 111 by the optimum quantization pitch gain, and outputs the result to adder 125.

【０１３６】加算器１２５は、適応符号ベクトル成分と
パルス音源ベクトル成分の加算を行い、励振音源ベクト
ルとして出力する。The adder 125 adds the adaptive code vector component and the pulse excitation vector component, and outputs the result as an excitation excitation vector.

【０１３７】なお、本実施例では、判定器１１８の入力
として、現サブフレームの初段量子化ピッチゲインを用
いたが、一般的なゲイン量子化を用いた場合（本実施例
に示したような多段量子化を用いない場合）には、直前
のサブフレームの量子化ピッチゲイン（適応符号ベクト
ル利得）を判定器１１８の入力とすることも可能であ
る。また、本実施例では、パルス数決定器を有するパル
ス数可変型の音声符号化装置の音源生成部を示したが、
パルス数決定器を持たないパルス数固定型のものにおい
ても、ピッチゲインの値を用いて周期性の強さを判定し
てパルス探索位置切り替えを行うことは有効である。In this embodiment, the first-stage quantization pitch gain of the current subframe is used as an input to the decision unit 118. However, when general gain quantization is used (as shown in this embodiment). In the case where multi-stage quantization is not used), the quantization pitch gain (adaptive code vector gain) of the immediately preceding subframe can be used as an input to the determiner 118. Also, in the present embodiment, the excitation generator of the variable pulse number speech coding apparatus having the pulse number determiner has been described.
Even in a fixed pulse number type having no pulse number determiner, it is effective to switch the pulse search position by determining the strength of the periodicity using the pitch gain value.

【０１３８】（実施の形態１０）図１８は本発明の第１
０の実施の形態を示し、連続するサブフレーム間の音源
信号波形の位相の連続性を利用して、バックワードで雑
音符号帳に対する位相適応処理の切り替えを行う音声符
号化装置の音源生成部を示す。図１８において、１８０
１は適応符号ベクトルをピッチピーク位置算出器１８０
２と乗算器１８１０に出力する適応符号帳、１８０２は
適応符号帳１８０１から出力された適応符号ベクトルと
ピッチ周期Ｌとを入力として、適応符号ベクトル内のピ
ッチピーク位置を遅延器１８０３と判定器１８０６と探
索位置算出器１８０７とに出力するピッチピーク位置算
出器、１８０３はピッチピーク位置算出器１８０２から
出力されたピッチピーク位置を入力として、１サブフレ
ーム分遅延させてピッチピーク位置予測器１８０５に出
力する遅延器、１８０４はピッチ周期Ｌを入力として、
１サブフレーム分遅延させてピッチピーク位置予測器１
８０５に出力する遅延器、１８０５は遅延器１８０３か
ら出力された直前のサブフレームにおけるピッチピーク
位置と遅延器１８０４から出力された直前のサブフレー
ムにおけるピッチ周期と現在のサブフレームにおけるピ
ッチ周期Ｌとを入力として、予測ピッチピーク位置を判
定器１８０６に出力するピッチピーク位置予測器、１８
０６はピッチピーク位置算出器１８０２から出力された
ピッチピーク位置とピッチピーク位置予測器１８０５か
ら出力された予測ピッチピーク位置とを入力として、直
前のサブフレームと現在のサブフレームで位相の連続性
があるかどうかを判定し、判定結果をスイッチ１８０８
に出力する判定器、１８０７はピッチピーク位置算出器
１８０２から出力されたピッチピーク位置とピッチ周期
Ｌとを入力として、音源パルスの探索位置をスイッチ１
８０８を介してパルス位置探索器１８０９に出力する探
索位置算出器、１８０８は判定器１８０６から出力され
た判定結果に基づいて切り替わるスイッチで、探索位置
算出器から出力された探索位置と予め定められている固
定探索位置との切り替えに用いられる。１８０９はスイ
ッチ１８０８を介して探索位置算出器１８０７から入力
される音源パルスの探索位置またはスイッチ１８０８を
介して入力される固定探索位置と、ピッチ周期Ｌをそれ
ぞれ入力とし、入力された音源パルス探索位置とピッチ
周期Ｌを用いて音源パルスの位置を探索し、パルス音源
ベクトルを乗算器１８１２に出力するパルス位置探索
器、１８１０は適応符号帳１８０１から出力された適応
符号ベクトルを入力として量子化適応符号ベクトル利得
を乗じて加算器１８１１に出力する乗算器、１８１２は
パルス位置探索器１８０９から出力されるパルス音源ベ
クトルを入力として量子化パルス音源ベクトル利得を乗
じて加算器１８１１に出力する乗算器、１８１１は乗算
器１８１０および１８１２から出力されたベクトルをそ
れぞれ入力とし、入力されたベクトルの加算を行い、励
振音源ベクトルとして出力する加算器である。(Embodiment 10) FIG. 18 shows the first embodiment of the present invention.
0, and uses a continuity of the phase of the excitation signal waveform between consecutive subframes to switch the phase adaptation process for the noise codebook in the backward direction. Show. In FIG. 18, 180
Reference numeral 1 denotes an adaptive code vector for calculating a pitch peak position calculator 180.
2 and an adaptive codebook 1802 to be output to the multiplier 1810, and the adaptive codebook 1802 receives the adaptive code vector output from the adaptive codebook 1801 and the pitch period L as inputs and determines the pitch peak position in the adaptive code vector by the delay unit 1803 and the decision unit 1806. A pitch peak position calculator 1803 that outputs the pitch peak position output from the pitch peak position calculator 1802 to the search position calculator 1807, and delays it by one subframe and outputs it to the pitch peak position predictor 1805 A delay device 1804 receives a pitch period L as an input,
Pitch peak position estimator 1 delayed by one subframe
The delay unit 1805 outputs the pitch peak position in the immediately preceding subframe output from the delay unit 1803, the pitch period in the immediately preceding subframe output from the delay unit 1804, and the pitch period L in the current subframe. A pitch peak position predictor that outputs a predicted pitch peak position to a determiner 1806 as an input;
06 receives the pitch peak position output from the pitch peak position calculator 1802 and the predicted pitch peak position output from the pitch peak position estimator 1805, and determines the phase continuity between the immediately preceding subframe and the current subframe. It is determined whether or not there is a switch.
, And 1807 receives the pitch peak position and the pitch period L output from the pitch peak position calculator 1802 as inputs, and switches the search position of the sound source pulse to the switch 1.
A search position calculator that outputs to the pulse position searcher 1809 via 808, a switch 1808 that switches based on the determination result output from the determiner 1806, and is set in advance as the search position output from the search position calculator. It is used for switching between fixed search positions. Reference numeral 1809 designates a search position of a sound source pulse input from the search position calculator 1807 via the switch 1808 or a fixed search position input via the switch 1808, and a pitch period L, respectively. A pulse position searcher that searches for the position of the excitation pulse by using the pulse period L and the pulse excitation vector, and outputs the pulse excitation vector to the multiplier 1812. The adaptive position detection unit 1810 receives the adaptive code vector output from the adaptive codebook 1801 as an input, and A multiplier 1812 which multiplies by a vector gain and outputs it to an adder 1811; a multiplier 1812 which receives the pulse excitation vector output from the pulse position searcher 1809 as an input, multiplies by a quantized pulse excitation vector gain, and outputs it to an adder 1811; Are the vectors output from the multipliers 1810 and 1812, respectively. As input, performs addition of the input vector, an adder which outputs as excitation excitation vector.

【０１３９】以上のように構成された音声符号化装置の
音源生成部について、図１８を用いてその動作を説明す
る。適応符号帳１８０１は、過去の励振音源のバッファ
により構成され、外部のピッチ分析または適応符号帳探
索手段によって求められたピッチ周期またはピッチラグ
に基づいて励振音源のバッファから該当する部分を取り
出し、適応符号ベクトルとしてピッチピーク位置算出器
１８０２および乗算器１８１０に出力する。適応符号帳
１８０１から乗算器１８１０に出力された適応符号ベク
トルは、外部のゲイン量子化器によって量子化された量
子化適応符号ベクトル利得が乗算されて加算器１８１１
に出力される。The operation of the sound source generation unit of the speech coding apparatus configured as described above will be described with reference to FIG. The adaptive codebook 1801 is constituted by a buffer of a past excitation source, extracts a corresponding part from the buffer of the excitation source based on a pitch cycle or pitch lag obtained by an external pitch analysis or an adaptive codebook search means, The vector is output to the pitch peak position calculator 1802 and the multiplier 1810 as a vector. The adaptive code vector output from the adaptive codebook 1801 to the multiplier 1810 is multiplied by a quantized adaptive code vector gain quantized by an external gain quantizer to obtain an adder 1811.
Is output to

【０１４０】ピッチピーク位置算出器１８０２は、適応
符号ベクトルからピッチピークを検出し、その位置を遅
延器１８０３と判定器１８０６と探索位置算出器１８０
７のそれぞれに出力する。ピッチピーク位置の検出（算
出）は、ピッチ周期Ｌで並べたインパルス列ベクトルと
適応符号ベクトルの正規化相互相関関数を最大化するこ
とによって行うことができる。また、ピッチ周期Ｌで並
べたインパルス列ベクトルに合成フィルタのインパルス
応答を畳み込んだベクトルと、適応符号ベクトルに合成
フィルタのインパルス応答を畳み込んだベクトルとの正
規化相互相関関数を最大化することによって、より精度
良くピッチピーク位置の検出を行うことも可能である。
さらに、検出されたピッチピーク位置を含む１ピッチ周
期波形の中から振幅値最大となる位置をピッチピークと
する後処理を加えれば、１ピッチ周期波形内のセカンド
ピークを誤検出することを回避することも可能である。A pitch peak position calculator 1802 detects a pitch peak from the adaptive code vector, and determines the position of the pitch peak by using a delay unit 1803, a determiner 1806, and a search position calculator 1802.
7 is output. The detection (calculation) of the pitch peak position can be performed by maximizing the normalized cross-correlation function between the impulse train vector arranged in the pitch period L and the adaptive code vector. Also, maximizing a normalized cross-correlation function between a vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and a vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector. Accordingly, the pitch peak position can be detected with higher accuracy.
Furthermore, if post-processing is performed to set the position where the amplitude value becomes maximum among the one-pitch periodic waveforms including the detected pitch-peak position as the pitch peak, erroneous detection of the second peak in the one-pitch periodic waveform is avoided. It is also possible.

【０１４１】遅延器１８０３は、ピッチピーク位置算出
器１８０２で算出されたピッチピーク位置を１サブフレ
ーム分だけ遅延させてピッチピーク位置予測器１８０５
に出力する。即ち、ピッチピーク位置予測器１８０５に
は直前のサブフレームにおけるピッチピーク位置が遅延
器１８０３から入力される。遅延器１８０４は、ピッチ
周期Ｌを１サブフレーム分だけ遅延させてピッチピーク
位置算出器１８０５に出力する。即ち、ピッチピーク位
置予測器１８０５には直前のサブフレームにおけるピッ
チ周期が遅延器１８０４から入力される。The delay unit 1803 delays the pitch peak position calculated by the pitch peak position calculator 1802 by one subframe, and delays the pitch peak position by a subframe.
Output to That is, the pitch peak position in the immediately preceding subframe is input from the delay unit 1803 to the pitch peak position predictor 1805. Delay device 1804 delays pitch period L by one subframe and outputs the result to pitch peak position calculator 1805. That is, the pitch period in the immediately preceding subframe is input from the delay unit 1804 to the pitch peak position predictor 1805.

【０１４２】ピッチピーク位置予測器１８０５は、遅延
器１８０３から入力される直前のサブフレームにおける
ピッチピーク位置と、遅延器１８０４から入力される直
前のサブフレームにおけるピッチ周期と、現在のサブフ
レームにおけるピッチ周期Ｌを入力として、現在のサブ
フレームにおけるピッチピーク位置を予測し、予測ピッ
チピーク位置を判定器１８０６に出力する。予測ピッチ
ピーク位置は（６）式によって求められる（図１９参
照）。 Φ（Ｎ）＝Φ（Ｎ−１）＋ｎ×Ｔ（Ｎ−１）＋Ｔ（Ｎ）−Ｌ，ｎ＝ＩＮＴ（（Ｌ−Φ（Ｎ−１））／Ｔ（Ｎ−１））．．．（６）The pitch peak position predictor 1805 calculates the pitch peak position in the subframe immediately before input from the delay unit 1803, the pitch period in the subframe immediately before input from the delay unit 1804, and the pitch in the current subframe. With the cycle L as an input, a pitch peak position in the current subframe is predicted, and the predicted pitch peak position is output to the determiner 1806. The predicted pitch peak position is obtained by Expression (6) (see FIG. 19). Φ (N) = Φ (N−1) + n × T (N−1) + T (N) −L, n = INT ((L−Φ (N−1)) / T (N−1)). . . (6)

【０１４３】上式において、Φ（ｋ）は第ｋ番目のサブ
フレームにおける最初のピッチピーク位置をそのサブフ
レームの先頭を０として表したものであり、Ｔ（ｋ）は
第ｋ番目のサブフレームにおける音源（音声）信号のピ
ッチ周期であり、Ｌはサブフレーム長である。また、ｎ
は第ｋ番目のサブフレームにおける最初のピッチピーク
位置（Φ（ｋ））から第ｋ番目のサブフレームの最後の
までの間にいくつのピッチ周期長が含まれるか（小数点
以下切り捨て）を示す整数値である。（ｋ＝０，１，
２，…）In the above equation, Φ (k) represents the first pitch peak position in the k-th subframe with the beginning of the subframe being 0, and T (k) is the k-th subframe. Is a pitch cycle of a sound source (sound) signal in L, and L is a subframe length. Also, n
Is an integer indicating how many pitch period lengths are included from the first pitch peak position (Φ (k)) in the k-th subframe to the end of the k-th subframe (rounded down to the decimal point). It is a numerical value. (K = 0,1,
2, ...)

【０１４４】判定器１８０６は、ピッチピーク位置算出
器１８０２から出力されたピッチピーク位置とピッチピ
ーク位置予測器１８０５から出力された予測ピッチピー
ク位置とを入力とし、ピッチピーク位置が予測ピッチピ
ーク位置と大きくかけ離れていない場合は位相が連続し
ていると判定し、ピッチピーク位置が予測ピッチピーク
位置と大きく異なる場合は位相が連続していないと判定
する。そして、判定結果をスイッチ１８０８に出力す
る。なお、ピッチピーク位置を予測ピッチピーク位置と
比較する際、ピッチピーク位置または予測ピッチピーク
位置がサブフレーム境界付近に存在する場合は、１ピッ
チ周期後の位置がピッチピーク位置である可能性も考慮
して、ピッチピーク位置と予測ピッチピーク位置の比較
を行って位相の連続性の判定を行う。The determiner 1806 receives the pitch peak position output from the pitch peak position calculator 1802 and the predicted pitch peak position output from the pitch peak position predictor 1805 as inputs, and determines the pitch peak position as the predicted pitch peak position. If not significantly different, it is determined that the phases are continuous. If the pitch peak position is significantly different from the predicted pitch peak position, it is determined that the phases are not continuous. Then, the determination result is output to the switch 1808. When comparing the pitch peak position with the predicted pitch peak position, when the pitch peak position or the predicted pitch peak position exists near the subframe boundary, the possibility that the position one pitch cycle later is the pitch peak position is also considered. Then, the continuity of the phase is determined by comparing the pitch peak position with the predicted pitch peak position.

【０１４５】探索位置算出器１８０７は、ピッチピーク
位置を基準として音源パルスの探索位置を決定し、探索
位置をスイッチ１８０８を介してパルス位置探索器１８
０９に出力する。探索位置の決定法としては、例えば実
施の形態６や実施の形態８に示したようにピッチピーク
近傍は密にそれ以外の部分は疎に探索位置が分布するよ
うに決定される。なお、実施の形態６や実施の形態８に
示したようにピッチ周期情報を用いて音源パルス数を変
化させたり、音源パルスの探索範囲を限定したりするこ
とを適用することも有効である。The search position calculator 1807 determines the search position of the sound source pulse based on the pitch peak position, and sets the search position via the switch 1808 to the pulse position searcher 18.
09 is output. As a method of determining the search position, for example, as described in the sixth and eighth embodiments, the search position is determined so that the vicinity of the pitch peak is densely distributed and the other portions are sparsely distributed. It is also effective to change the number of sound source pulses using the pitch period information or limit the search range of the sound source pulses as described in the sixth and eighth embodiments.

【０１４６】スイッチ１８０８は、判定器１８０６の判
定結果に基づいて位相適応型の音源パルス探索を行う
か、固定位置による音源パルス探索（または一般の雑音
符号帳探索）を行うかを切り替えるものである。即ち、
判定器１８０６の判定結果が、「位相の連続性あり」の
場合は探索位置算出器１８０７とパルス位置探索器１８
０９を接続して、探索位置算出器１８０７によって算出
された音源パルス探索位置をパルス位置探索器１８０９
に入力させる（つまり、位相適応型の音源パルス探索を
行わせる）。反対に、判定器１８０６の判定結果が、
「位相の連続性なし」の場合は固定探索位置をパルス位
置探索器１８０９に入力させるように切り替わる（一般
の雑音符号帳探索と切り替える場合は別途雑音符号帳探
索器を備える構成とし、パルス位置探索器１８０９と切
り替えて用いる構成にする）。The switch 1808 switches between performing a phase-adaptive excitation pulse search or a fixed-position excitation pulse search (or a general noise codebook search) based on the determination result of the determiner 1806. . That is,
When the determination result of the determiner 1806 is “with phase continuity”, the search position calculator 1807 and the pulse position searcher 18
09 and connects the sound source pulse search position calculated by the search position calculator 1807 to the pulse position searcher 1809.
(That is, a phase-adaptive sound source pulse search is performed). Conversely, the determination result of the determiner 1806 is
In the case of "no phase continuity", switching is performed so that the fixed search position is input to the pulse position searcher 1809 (when switching from the general noise codebook search, a configuration is provided with a separate noise codebook searcher. To be used by switching to the device 1809).

【０１４７】パルス位置探索器１８０９は、探索位置算
出器１８０７で決定された音源パルス探索位置または予
め決められている固定探索位置と、別途入力されるピッ
チ周期Ｌを用いて、音源パルスを立てる位置の最適な組
み合わせを決定する。パルス探索の方法は「ITU-T Reco
mmendation G.729: Coding of Speech at 8 kbits/susi
ng Conjugate-Structure Algebraic-Code-Excited Line
ar-Prediction (CS-ACELP), March 1996 」に示されて
いるように、例えばパルス数が４本の場合は実施の形態
６で示した式（２）を最大化するようにｉ０からｉ３の
組み合わせを決定する。なお、この時の各音源パルスの
極性は、雑音符号帳成分のターゲットベクトル、即ち聴
覚重みづけされた入力音声から聴覚重みづけ合成フィル
タの零入力応答信号と適応符号帳成分の信号を減じた信
号ベクトル、の各位置における極性と等しくなるように
パルス位置探索を行う前に予め決定している。また、ピ
ッチ周期がサブフレーム長より短い場合には実施の形態
５にも示したようにピッチ周期化フィルタをかけること
によって、音源パルスをインパルスではなくピッチ周期
のパルス列になるようにしている。このようなピッチ周
期化処理を行う場合は聴覚重みづけ合成フィルタのイン
パルス応答ベクトルにピッチ周期化フィルタを予めかけ
ておけば、ピッチ周期化を行わない場合と同様にして式
（２）の最大化によって音源パルスの探索を行うことが
できる。このようにして決定された各音源パルスの位置
に、決定された各音源パルスの極性にしたがってパルス
を立て、ピッチ周期Ｌを用いてピッチ周期化フィルタを
かければ、パルス音源ベクトルが生成される。生成され
たパルス音源ベクトルは乗算器１８１２に出力される。
パルス位置探索器１８０９から乗算器１８１２に出力さ
れたパルス音源ベクトルは、外部のゲイン量子化器によ
って量子化された量子化パルス音源ベクトル利得が乗算
されて加算器１８１１に出力される。The pulse position searcher 1809 uses the sound source pulse search position determined by the search position calculator 1807 or a predetermined fixed search position, and a position at which a sound source pulse is to be formed using the pitch period L input separately. To determine the optimal combination of See ITU-T Reco
mmendation G.729: Coding of Speech at 8 kbits / susi
ng Conjugate-Structure Algebraic-Code-Excited Line
ar-Prediction (CS-ACELP), March 1996 ”, for example, when the number of pulses is four, the values of i0 to i3 are set so as to maximize the expression (2) shown in the sixth embodiment. Determine the combination. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. Is determined before performing the pulse position search so as to be equal in polarity at each position of the vector. When the pitch period is shorter than the subframe length, a pitch period filter is applied as described in the fifth embodiment, so that the excitation pulse is not an impulse but a pulse train of the pitch period. In the case of performing such a pitch periodization process, if the pitch periodization filter is applied to the impulse response vector of the auditory weighting synthesis filter in advance, the maximization of Expression (2) is performed in the same manner as the case where the pitch periodization is not performed. Thus, a search for a sound source pulse can be performed. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse excitation vector is output to multiplier 1812.
The pulse excitation vector output from the pulse position searcher 1809 to the multiplier 1812 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer and output to the adder 1811.

【０１４８】加算器１８１１は、乗算器１８１０から出
力された適応符号ベクトル成分と、乗算器１８１２から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。The adder 1811 performs vector addition of the adaptive code vector component output from the multiplier 1810 and the pulse excitation vector component output from the multiplier 1812, and outputs the result as an excitation excitation vector.

【０１４９】なお、本発明による音声符号化装置におい
ては、有声定常部以外の部分では、固定探索位置が選択
され続ける状態が生じ易いので、伝送路誤りの影響が伝
播している場合にはリセットをかける効果を得ることも
できる。（ピッチピーク位置を０とする相対位置でパル
ス位置を表現する場合は、一度伝送路誤りが生じて符号
器側と復号器側の適応符号帳の内容が大きく異なってし
まうと、後続のフレームにおいて伝送路誤りがなくても
ピッチピーク位置が符号器側と復号器側で一致しなくな
り続ける現象が発生することがあり、誤りの影響を長く
引きずることになる。）In the speech coding apparatus according to the present invention, a state where the fixed search position is continuously selected is apt to occur in portions other than the voiced stationary portion, so that the reset is performed when the influence of the transmission path error propagates. Can be obtained. (In the case where the pulse position is expressed by a relative position where the pitch peak position is 0, if a transmission path error occurs once and the contents of the adaptive codebooks on the encoder side and the decoder side greatly differ, the following frame is used. Even if there is no transmission path error, a phenomenon may occur in which the pitch peak position continues to be inconsistent between the encoder side and the decoder side, and the effect of the error is dragged for a long time.

【０１５０】また、パルスの立て方としては、定数本例
えば４本のパルスを探索範囲、例えば３２箇所の位置の
どこかに立てる場合においては、前述のように３２箇所
を４つに分けて１本のパルスを割り当てられた８箇所の
中の１箇所に決定するように全ての組み合わせ（８×８
×８×８通り）を探索する方法の他に、３２箇所の中か
ら４箇所を選びだす組み合わせ全てについて探索する方
法などがある。なお、振幅１のインパルスの組み合わせ
の他に、複数本例えば２本のパルスを組み合わせたパル
ス対の組み合わせや、振幅の異なるインパルスの組み合
わせによるパルスの立て方も可能である。As for the method of setting the pulses, when a fixed number of pulses, for example, four pulses are set in a search range, for example, somewhere in 32 positions, the 32 positions are divided into four as described above, and one pulse is set. All of the combinations (8 × 8
In addition to the method of searching (× 8 × 8 ways), there is a method of searching for all the combinations that select four places from 32 places. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０１５１】（実施の形態１１）図２０は本発明の第１
１の実施の形態を示し、適応符号ベクトルの形状に強い
パルス性が存在するか否かで、位相適応処理を行うか行
わないかの切り替えを行うＣＥＬＰ型音声符号化装置の
音源生成部を示している。図２０において、２００１は
適応符号ベクトルをピッチピーク位置算出器２００２と
パルス性判定器２００３と乗算器２００７に出力する適
応符号帳、２００２は適応符号帳２００１から出力され
た適応符号ベクトルとピッチ周期Ｌとを入力として、適
応符号ベクトル内のピッチピーク位置をパルス性判定器
２００３と探索位置算出器２００４とに出力するピッチ
ピーク位置算出器、２００３は適応符号帳２００１から
出力された適応符号ベクトルとピッチピーク位置算出器
２００２から出力されたピッチピーク位置と外部から入
力するピッチ周期Ｌを入力として、適応符号ベクトルに
よいパルス性が存在するか否かを判定し、判定結果をス
イッチ２００５に出力するパルス性判定器、２００４は
外部から入力するピッチ周期Ｌとピッチピーク位置算出
器２００２から出力されるピッチピーク位置を入力とし
て、音源パルスの探索位置をスイッチ２００５を介して
パルス位置探索器２００６に出力する探索位置算出器、
２００５はパルス性判定器２００３から出力された判定
結果に基づいて切り替わるスイッチで、探索位置算出器
２００４から出力された探索位置と予め定められている
固定探索位置との切り替えに用いられる。２００６はス
イッチ２００５を介して探索位置算出器２００４から入
力される音源パルスの探索位置またはスイッチ２００５
を介して入力される固定探索位置と外部から入力される
ピッチ周期Ｌをそれぞれ入力とし、入力された音源パル
ス探索位置とピッチ周期Ｌを用いて音源パルスの位置を
探索し、パルス音源ベクトルを乗算器２００９に出力す
るパルス位置探索器、２００７は適応符号帳２００１か
ら出力された適応符号ベクトルを入力として量子化適応
符号ベクトル利得を乗じて加算器２００８に出力する乗
算器、２００９はパルス位置探索器２００６から出力さ
れるパルス音源ベクトルを入力として量子化パルス音源
ベクトル利得を乗じて加算器２００８に出力する乗算
器、２００８は乗算器２００７および２００９から出力
されたベクトルをそれぞれ入力とし、入力されたベクト
ルの加算を行い、励振音源ベクトルとして出力する加算
器である。(Embodiment 11) FIG. 20 shows a first embodiment of the present invention.
1 shows an embodiment, and shows a sound source generation unit of a CELP-type speech coding apparatus that switches between performing and not performing phase adaptation processing depending on whether or not a strong pulse property exists in the shape of an adaptive code vector. ing. In FIG. 20, reference numeral 2001 denotes an adaptive codebook that outputs an adaptive code vector to a pitch peak position calculator 2002, a pulse determination unit 2003, and a multiplier 2007. 2002 denotes an adaptive code vector output from the adaptive codebook 2001 and a pitch period L. And a pitch peak position calculator that outputs the pitch peak position in the adaptive code vector to the pulse property determiner 2003 and the search position calculator 2004, and the adaptive code vector and the adaptive code vector 2003 output from the adaptive codebook 2001. Using the pitch peak position output from the peak position calculator 2002 and the pitch period L input from the outside as inputs, it is determined whether or not the adaptive code vector has a good pulse property, and the determination result is output to the switch 2005. The sex determiner 2004 has a pitch period L and a pitch peak that are input from the outside. As input pitch peak position output from the position calculator 2002, search position calculator which outputs the search position of the sound source pulse in the pulse position searcher 2006 through the switch 2005,
A switch 2005 is switched based on the determination result output from the pulse determination unit 2003, and is used to switch between the search position output from the search position calculator 2004 and a predetermined fixed search position. Reference numeral 2006 denotes a search position of a sound source pulse input from the search position calculator 2004 via the switch 2005 or a switch 2005.
A fixed search position input through the input unit and a pitch period L input from the outside are input, and the input sound source pulse search position and the pitch period L are used to search for the position of the sound source pulse, and the pulse sound source vector is multiplied. A pulse position searcher that outputs to the adder 2009, a multiplier that receives the adaptive code vector output from the adaptive codebook 2001 as an input, multiplies the quantized adaptive code vector gain, and outputs the result to the adder 2008, and 2009 a pulse position searcher A multiplier that receives the pulse excitation vector output from 2006 as an input, multiplies the quantized pulse excitation vector gain, and outputs the result to the adder 2008. The input 2008 receives the vectors output from the multipliers 2007 and 2009, respectively, and receives the input vector. Is an adder that performs the addition and outputs as an excitation sound source vector.

【０１５２】以上のように構成された音声符号化装置の
音源生成部について、図２０を用いてその動作を説明す
る。適応符号帳２００１は、過去の励振音源のバッファ
により構成され、外部のピッチ分析または適応符号帳探
索手段によって求められたピッチ周期またはピッチラグ
に基づいて励振音源のバッファから該当する部分を取り
出し、適応符号ベクトルとしてピッチピーク位置算出器
２００２およびパルス性判定器２００３および乗算器２
００７に出力する。適応符号帳２００１から乗算器２０
０７に出力された適応符号ベクトルは、外部のゲイン量
子化器によって量子化された量子化適応符号ベクトル利
得が乗算されて加算器２００８に出力される。The operation of the sound source generating section of the speech coding apparatus configured as described above will be described with reference to FIG. The adaptive codebook 2001 includes a buffer of a past excitation source, extracts a corresponding portion from a buffer of the excitation source based on a pitch period or pitch lag obtained by an external pitch analysis or an adaptive codebook search means, and extracts an adaptive code. As a vector, a pitch peak position calculator 2002, a pulse property determiner 2003, and a multiplier 2
007. Multiplier 20 from adaptive codebook 2001
The adaptive code vector output to 07 is multiplied by a quantized adaptive code vector gain quantized by an external gain quantizer and output to an adder 2008.

【０１５３】ピッチピーク位置算出器２００２は、適応
符号ベクトルからピッチピークを検出し、その位置をパ
ルス性判定器２００３と探索位置算出器２００４のそれ
ぞれに出力する。ピッチピーク位置の検出（算出）は、
ピッチ周期Ｌで並べたインパルス列ベクトルと適応符号
ベクトルの正規化相互相関関数を最大化することによっ
て行うことができる。また、ピッチ周期Ｌで並べたイン
パルス列ベクトルに合成フィルタのインパルス応答を畳
み込んだベクトルと、適応符号ベクトルに合成フィルタ
のインパルス応答を畳み込んだベクトルとの正規化相互
相関関数を最大化することによって、より精度良くピッ
チピーク位置の検出を行うことも可能である。さらに、
検出されたピッチピーク位置を含む１ピッチ周期波形の
中から振幅値最大となる位置をピッチピークとする後処
理を加えれば、１ピッチ周期波形内のセカンドピークを
誤検出することを回避することも可能である。The pitch peak position calculator 2002 detects a pitch peak from the adaptive code vector, and outputs the position to each of the pulse determination unit 2003 and the search position calculator 2004. The detection (calculation) of the pitch peak position
This can be performed by maximizing the normalized cross-correlation function between the impulse train vector and the adaptive code vector arranged in the pitch period L. Also, maximizing a normalized cross-correlation function between a vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and a vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector. Accordingly, the pitch peak position can be detected with higher accuracy. further,
If post-processing is performed to set the position where the amplitude value becomes maximum from the one-pitch periodic waveform including the detected pitch-peak position as the pitch peak, it is possible to avoid erroneous detection of the second peak in the one-pitch periodic waveform. It is possible.

【０１５４】パルス性判定器２００３は、ピッチピーク
位置算出器２００２で算出されたピッチピーク位置付近
に適応符号ベクトルの信号パワーが集中しているかどう
かを判定し、信号パワーの集中がある場合は「パルス性
あり」の判定結果をスイッチ２００５に出力、信号パワ
ーの集中が見られない場合は「パルス性なし」の判定結
果をスイッチ２００５に出力する。信号パワーが集中し
ているかどうかを調べる手法としては、例えば以下のよ
うな方法が考えられる。まず、ピッチピーク位置を含む
１ピッチ周期長の適応符号ベクトルを切り出し、切り出
した信号全体のパワーを算出しこれをＰＷ０とする。次
に、ピッチピーク位置の近傍の２分の１から３分の１ピ
ッチ長の適応符号ベクトルを切り出し、この切り出した
信号パワーを算出しこれをＰＷ１とする。ＰＷ１／ＰＷ
０の値が所定の値（例えば０．５から０．６程度）以上
である場合は、ピッチピーク近傍に信号パワーが集中し
ているので、パルス性が高いを判定することができる。
また、別の判定方法としては、ピッチピーク位置に最初
のインパルスが立つピッチ周期間隔で並べたインパルス
列ベクトルで適応符号ベクトルを近似した場合の、イン
パルス列ベクトルと適応符号ベクトルのとの誤差を用い
た判定方法がある。さらに、ピッチ周期Ｌで並べたイン
パルス列ベクトルに合成フィルタのインパルス応答を畳
み込んだベクトルと、適応符号ベクトルに合成フィルタ
のインパルス応答を畳み込んだベクトルとの正規化相互
相関関数を最大化することによってピッチピーク位置を
求めた場合には、ピッチ周期Ｌで並べたインパルス列ベ
クトルに合成フィルタのインパルス応答を畳み込んだベ
クトルと、適応符号ベクトルに合成フィルタのインパル
ス応答を畳み込んだベクトルとの誤差を用いた判定方法
がある。これらベクトル間の誤差を評価する手段として
は、式（７）に示すような予測ゲインや式（８）に示す
ような正規化相互相関関数などをを利用する。式（７）
および（８）において、x(n)は適応符号ベクトルまたは
適応符号ベクトルに合成フィルタのインパルス応答を畳
み込んだベクトル、y(n)はインパルス列ベクトルまたは
インパルス列ベクトルに合成フィルタのインパルス応答
を畳み込んだベクトルである。どちらの式においても値
が例えば０．３〜０．４以上あれば、ある程度強いパル
ス性が適応符号ベクトルに存在していると判定できる。The pulse property determiner 2003 determines whether or not the signal power of the adaptive code vector is concentrated near the pitch peak position calculated by the pitch peak position calculator 2002. The determination result of "with pulse property" is output to the switch 2005, and the determination result of "no pulse property" is output to the switch 2005 when no signal power concentration is observed. As a method for checking whether or not the signal power is concentrated, for example, the following method can be considered. First, an adaptive code vector having one pitch cycle length including a pitch peak position is cut out, the power of the whole cut out signal is calculated, and this is set to PW0. Next, an adaptive code vector having a length of one-half to one-third of the pitch in the vicinity of the pitch peak position is cut out, and the cut-out signal power is calculated to be PW1. PW1 / PW
When the value of 0 is equal to or more than a predetermined value (for example, about 0.5 to 0.6), since the signal power is concentrated near the pitch peak, it is possible to determine that the pulse property is high.
As another determination method, an error between the impulse sequence vector and the adaptive code vector when the adaptive code vector is approximated by an impulse sequence vector arranged at a pitch cycle interval in which the first impulse is at a pitch peak position is used. There is a judgment method that was used. Further, maximizing a normalized cross-correlation function between a vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged at the pitch period L and a vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector. When the pitch peak position is obtained by the above, an error between a vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and a vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector is obtained. There is a determination method using As means for evaluating the error between these vectors, a prediction gain as shown in equation (7), a normalized cross-correlation function as shown in equation (8), and the like are used. Equation (7)
In (8) and (8), x (n) is the adaptive code vector or a vector obtained by convolving the adaptive code vector with the impulse response of the synthesis filter, and y (n) is the impulse train vector or the impulse response of the synthesis filter convolved with the impulse train vector. This is the embedded vector. In either case, if the value is, for example, 0.3 to 0.4 or more, it can be determined that the adaptive code vector has a somewhat strong pulse property.

【数３】．．．（７）(Equation 3) . . . (7)

【数４】．．．（８）(Equation 4) . . . (8)

【０１５５】探索位置算出器２００４は、ピッチピーク
位置を基準として音源パルスの探索位置を決定し、探索
位置をスイッチ２００５を介してパルス位置探索器２０
０６に出力する。探索位置の決定法としては、例えば実
施の形態６や実施の形態８に示したようにピッチピーク
近傍は密にそれ以外の部分は疎に探索位置が分布するよ
うに決定される。なお、実施の形態６や実施の形態８に
示したようにピッチ周期情報を用いて音源パルス数を変
化させたり、音源パルスの探索範囲を限定したりするこ
とを適用することも有効である。The search position calculator 2004 determines the search position of the sound source pulse with reference to the pitch peak position, and sets the search position to the pulse position searcher 20 via the switch 2005.
06 is output. As a method of determining the search position, for example, as described in the sixth and eighth embodiments, the search position is determined so that the vicinity of the pitch peak is densely distributed and the other portions are sparsely distributed. It is also effective to change the number of sound source pulses using the pitch period information or limit the search range of the sound source pulses as described in the sixth and eighth embodiments.

【０１５６】スイッチ２００５は、パルス性判定器２０
０３の判定結果に基づいて位相適応型の音源パルス探索
を行うか、固定位置による音源パルス探索を行うかを切
り替えるものである。即ち、パルス性判定器２００３の
判定結果が、「パルス性あり」の場合は探索位置算出器
２００４とパルス位置探索器２００６を接続して、探索
位置算出器２００４によって算出された音源パルス探索
位置をパルス位置探索器２００６に入力させる（つま
り、位相適応型の音源パルス探索を行わせる）。反対
に、パルス性判定器２００３の判定結果が、「パルス性
なし」の場合は固定探索位置をパルス位置探索器２００
６に入力させるように切り替わる。The switch 2005 is connected to the pulse
It is to switch between performing a phase-adaptive sound source pulse search or performing a sound source pulse search at a fixed position based on the determination result of 03. That is, when the determination result of the pulse property determiner 2003 is “with pulse property”, the search position calculator 2004 and the pulse position searcher 2006 are connected, and the sound source pulse search position calculated by the search position calculator 2004 is determined. The signal is input to the pulse position search unit 2006 (that is, a phase-adaptive sound source pulse search is performed). Conversely, when the determination result of the pulse property determination unit 2003 is “no pulse property”, the fixed search position is set to the pulse position search unit 200.
6 is switched to input.

【０１５７】パルス位置探索器２００６は、探索位置算
出器２００４で決定された音源パルス探索位置または予
め決められている固定探索位置と、別途入力されるピッ
チ周期Ｌを用いて、音源パルスを立てる位置の最適な組
み合わせを決定する。パルス探索の方法は「ITU-T Reco
mmendation G.729: Coding of Speech at 8 kbits/susi
ng Conjugate-Structure Algebraic-Code-Excited Line
ar-Prediction (CS-ACELP), March 1996 」に示されて
いるように、例えばパルス数が４本の場合は実施の形態
６で示した式（２）を最大化するようにｉ０からｉ３の
組み合わせを決定する。なお、この時の各音源パルスの
極性は、雑音符号帳成分のターゲットベクトル、即ち聴
覚重みづけされた入力音声から聴覚重みづけ合成フィル
タの零入力応答信号と適応符号帳成分の信号を減じた信
号ベクトル、の各位置における極性と等しくなるように
パルス位置探索を行う前に予め決定している。また、ピ
ッチ周期がサブフレーム長より短い場合には実施の形態
５にも示したようにピッチ周期化フィルタをかけること
によって、音源パルスをインパルスではなくピッチ周期
のパルス列になるようにしている。このようなピッチ周
期化処理を行う場合は、聴覚重みづけ合成フィルタのイ
ンパルス応答ベクトルにピッチ周期化フィルタを予めか
けておけば、ピッチ周期化を行わない場合と同様にして
式（２）の最大化によって音源パルスの探索を行うこと
ができる。このようにして決定された各音源パルスの位
置に、決定された各音源パルスの極性にしたがってパル
スを立て、ピッチ周期Ｌを用いてピッチ周期化フィルタ
をかければ、パルス音源ベクトルが生成される。生成さ
れたパルス音源ベクトルは乗算器２００９に出力され
る。パルス位置探索器２００６から乗算器２００９に出
力されたパルス音源ベクトルは、外部のゲイン量子化器
によって量子化された量子化パルス音源ベクトル利得が
乗算されて加算器２００８に出力される。The pulse position searcher 2006 uses the sound source pulse search position determined by the search position calculator 2004 or a predetermined fixed search position, and a position at which a sound source pulse is formed using a pitch period L separately input. To determine the optimal combination of See ITU-T Reco
mmendation G.729: Coding of Speech at 8 kbits / susi
ng Conjugate-Structure Algebraic-Code-Excited Line
ar-Prediction (CS-ACELP), March 1996 ”, for example, when the number of pulses is four, the values of i0 to i3 are set so as to maximize the expression (2) shown in the sixth embodiment. Determine the combination. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. Is determined before performing the pulse position search so as to be equal in polarity at each position of the vector. When the pitch period is shorter than the subframe length, a pitch period filter is applied as described in the fifth embodiment, so that the excitation pulse is not an impulse but a pulse train of the pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse excitation vector is output to multiplier 2009. The pulse excitation vector output from the pulse position searcher 2006 to the multiplier 2009 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer, and output to the adder 2008.

【０１５８】加算器２００８は、乗算器２００７から出
力された適応符号ベクトル成分と、乗算器２００９から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。The adder 2008 performs vector addition of the adaptive code vector component output from the multiplier 2007 and the pulse excitation vector component output from the multiplier 2009, and outputs the result as an excitation excitation vector.

【０１５９】なお、本発明による音声符号化装置におい
ては、有声定常部以外の部分では、固定探索位置が選択
され続ける状態が生じ易いので、伝送路誤りの影響が伝
播している場合にはリセットをかける効果を得ることも
できる。（ピッチピーク位置を０とする相対位置でパル
ス位置を表現する場合は、一度伝送路誤りが生じて符号
器側と復号器側の適応符号帳の内容が大きく異なってし
まうと、後続のフレームにおいて伝送路誤りがなくても
ピッチピーク位置が符号器側と復号器側で一致しなくな
り続ける現象が発生することがあり、誤りの影響を長く
引きずることになる。）In the speech coding apparatus according to the present invention, a state where the fixed search position is continuously selected is apt to occur in portions other than the voiced stationary portion, so that the reset is performed when the influence of the transmission path error is propagated. Can be obtained. (In the case where the pulse position is expressed by a relative position where the pitch peak position is 0, if a transmission path error occurs once and the contents of the adaptive codebooks on the encoder side and the decoder side greatly differ, the following frame is used. Even if there is no transmission path error, a phenomenon may occur in which the pitch peak position continues to be inconsistent between the encoder side and the decoder side, and the effect of the error is dragged for a long time.

【０１６０】なお、パルスの立て方としては、定数本例
えば４本のパルスを探索範囲、例えば３２箇所の位置の
どこかに立てる場合においては、前述のように３２箇所
を４つに分けて１本のパルスを割り当てられた８箇所の
中の１箇所に決定するように全ての組み合わせ（８×８
×８×８通り）を探索する方法の他に、３２箇所の中か
ら４箇所を選びだす組み合わせ全てについて探索する方
法などがある。なお、振幅１のインパルスの組み合わせ
の他に、複数本例えば２本のパルスを組み合わせたパル
ス対の組み合わせや、振幅の異なるインパルスの組み合
わせによるパルスの立て方も可能である。As for the method of setting pulses, when a fixed number of pulses, for example, four pulses are set in a search range, for example, somewhere in 32 positions, the 32 positions are divided into four as described above and one pulse is set. All of the combinations (8 × 8
In addition to the method of searching (× 8 × 8 ways), there is a method of searching for all the combinations that select four places from 32 places. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０１６１】（実施の形態１２）図２１は本発明の第１
２の実施の形態を示し、パルス探索位置のインデックス
を付け替えるインデックス更新手段を備え、パルス位置
の探索範囲を適応符号ベクトルのピッチ周期およびピッ
チピーク位置によって決定するＣＥＬＰ型音声符号化装
置の符号器側の音源生成部を示す。より具体的には、ピ
ッチピーク位置からの相対位置で音源パルス探索を行う
ＣＥＬＰ型音声符号化装置において、サブフレームの先
頭側から順番にパルス位置のインデックスを付けるよう
にすることによって、あるフレームにおいて発生した伝
送路誤りの影響が後続の伝送路誤りのないフレームに伝
播することを防ぐようにした音源生成部を示す。(Embodiment 12) FIG. 21 shows a first embodiment of the present invention.
2 shows the second embodiment, comprising an index updating means for changing the index of the pulse search position, and the encoder side of the CELP type speech coding apparatus for determining the search range of the pulse position by the pitch period and the pitch peak position of the adaptive code vector. Is shown. More specifically, in a CELP-type speech coding apparatus that performs excitation pulse search at a relative position from a pitch peak position, a pulse position is indexed sequentially from the head of a subframe, so that in a certain frame, FIG. 6 shows a sound source generation unit configured to prevent the influence of a generated transmission path error from propagating to a subsequent frame having no transmission path error. FIG.

【０１６２】図２１において、２１０１は過去の励振音
源ベクトルを保存し、選択された適応符号ベクトルをピ
ッチピーク位置算出器２１０２およびピッチゲイン乗算
器２１０６に出力する適応符号帳、２１０２は適応符号
帳２１０１から出力された適応符号ベクトルとピッチ周
期Ｌを入力としてピッチピーク位置を算出し、探索位置
算出器２１０３に出力するピッチピーク位置算出器、２
１０３はピッチピーク位置算出器２１０２から出力され
たピッチピーク位置とピッチ周期Ｌを入力としてパルス
音源を探索する範囲を算出し、インデックス更新手段２
１０４へ出力する探索位置算出器、２１０４は探索位置
算出器２１０３から出力された、各音源パルスの各位置
のインデックスを付け替えてパルス位置探索器２１０５
に出力するインデックス更新手段、２１０５はインデッ
クス更新手段２１０４から出力された探索位置（パルス
位置を表すインデックスが付け直されている）と、音源
生成部の外部で別途算出されたピッチ周期Ｌとを入力と
してパルス音源を探索し、パルス音源ベクトルをパルス
音源ゲイン乗算器２１０７に出力し、符号化出力として
パルス音源ベクトルを表すインデックスを音源生成部の
外部に出力するパルス位置探索器、２１０６は適応符号
帳２１０１から出力された適応符号ベクトルに適応符号
ベクトル利得を乗じて加算器２１０８に出力する乗算
器、２１０７はパルス位置探索器２１０５から出力され
たパルス音源ベクトルにパルス音源ベクトル利得を乗じ
て加算器２１０８に出力する乗算器、２１０８は乗算器
２１０６からの出力と乗算器２１０７からの出力を入力
とし、ベクトル加算して励振音源ベクトルとして出力す
る加算器である。In FIG. 21, reference numeral 2101 denotes an adaptive codebook for storing a past excitation source vector and outputting the selected adaptive code vector to a pitch peak position calculator 2102 and a pitch gain multiplier 2106. Reference numeral 2102 denotes an adaptive codebook 2101. Calculates the pitch peak position by using the adaptive code vector and pitch period L output from the as input, and outputs the pitch peak position calculator 2103 to the search position calculator 2103.
103 calculates a range in which to search for a pulse sound source using the pitch peak position and the pitch period L output from the pitch peak position calculator 2102 as inputs, and
A search position calculator 2104 for outputting to the 104 is a pulse position searcher 2105 that changes the index of each position of each sound source pulse output from the search position calculator 2103.
The index updating means 2105 outputs the search position (the index indicating the pulse position is re-assigned) output from the index updating means 2104 and the pitch period L separately calculated outside the sound source generation unit. The pulse position searcher 2106 outputs a pulse excitation vector to the pulse excitation gain multiplier 2107 and outputs an index representing the pulse excitation vector to the outside of the excitation generation unit as an encoded output. A multiplier 2107 multiplies the adaptive code vector output from 2101 by an adaptive code vector gain and outputs the result to an adder 2108. A multiplier 2107 multiplies the pulse excitation vector output from the pulse position searcher 2105 by the pulse excitation vector gain, and 2108 is the output from the multiplier 2106 Receives the output from the multiplier 2107, an adder which outputs as excitation excitation vector by vector addition.

【０１６３】以上のように構成された、音源生成部の動
作について、図２１および図２２を用いて説明する。図
２１において、適応符号帳２１０１は、音源生成部の外
部で予め算出されるピッチ周期Ｌだけ過去に溯った点か
ら、適応符号ベクトルをサブフレーム長だけ切り出し
て、適応符号ベクトルとして出力する。ピッチ周期Ｌが
サブフレーム長に満たない場合は、切り出したピッチ周
期Ｌのベクトルを、サブフレーム長に達するまで繰り返
して接続したものを適応符号ベクトルとして出力する。The operation of the sound source generator configured as described above will be described with reference to FIGS. 21 and 22. In FIG. 21, adaptive codebook 2101 cuts out an adaptive code vector by a subframe length from a point that has been advanced in the past by a pitch period L calculated in advance outside the excitation generation unit, and outputs it as an adaptive code vector. If the pitch period L is less than the subframe length, a vector obtained by repeatedly connecting the extracted pitch period L until the subframe length is reached is output as an adaptive code vector.

【０１６４】ピッチピーク位置算出器２１０２は、適応
符号帳２１０１から出力された適応符号ベクトルを用い
て適応符号ベクトル内に存在するピッチピークの位置を
決定する。ピッチピークの位置は、ピッチ周期で並べた
インパルス列と適応符号ベクトルとの正規化相互相関を
最大化することによって行うことができる。また、ピッ
チ周期で並べたインパルス列を合成フィルタに通したも
のと、適応符号ベクトルを合成フィルタに通したもとの
誤差を最小化することによって、より精度良く求めるこ
とも可能である。Pitch peak position calculator 2102 uses the adaptive code vector output from adaptive code book 2101 to determine the position of the pitch peak existing in the adaptive code vector. The position of the pitch peak can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. Further, it is possible to obtain the impulse trains arranged in the pitch cycle through a synthesis filter, and to minimize the error caused by passing the adaptive code vector through the synthesis filter, thereby obtaining the error more accurately.

【０１６５】探索位置算出器２１０３は、ピッチピーク
位置を基準として音源パルスの探索位置を決定し、イン
デックス更新手段２１０４に出力する。探索位置の決定
法としては、例えば実施の形態５や実施の形態６に示し
たようにピッチピーク近傍は密にそれ以外の部分は疎に
探索位置が分布するように決定される。なお、実施の形
態６や実施の形態８に示したようにピッチ周期情報を用
いて音源パルス数を変化させたり、音源パルスの探索範
囲を限定したりすることを適用することも有効である。
探索位置算出器２１０３によって決定される具体的な探
索位置の例は図１０、図１１（ｂ）、図１１（ｃ）、図
１３に示している。例えば図１０においては、ピッチパ
ルス位置近傍は密に、それ以外の部分は疎に、パルス位
置探索範囲を限定する方法を具体的に示している。この
限定方法は、パルスが立てられる確率が高い位置がピッ
チパルス近傍に集中する統計的結果に基づいている。パ
ルス位置探索範囲を限定しない場合、有声部においては
ピッチパルス近傍にパルスが立てられる確率がその他の
部分に立てられる確率に比べて高くなる。なお、探索位
置算出器で算出されるのは、ピッチピーク位置からの相
対位置を用いた、音源パルスの探索位置であり、この時
点では、ピッチピーク位置を０とする相対位置の数値が
小さいものから順にインデックスが付けられている（図
２２参照。なお図２２ではパルス数を４本とした場合の
図１３（ａ）に対応する場合を示している。）。The search position calculator 2103 determines the search position of the excitation pulse with reference to the pitch peak position, and outputs it to the index updating means 2104. As a method of determining the search position, for example, as shown in the fifth and sixth embodiments, the search position is determined so that the vicinity of the pitch peak is densely distributed and the other portions are sparsely distributed. It is also effective to change the number of sound source pulses using the pitch period information or limit the search range of the sound source pulses as described in the sixth and eighth embodiments.
Examples of specific search positions determined by the search position calculator 2103 are shown in FIGS. 10, 11B, 11C, and 13. FIG. For example, FIG. 10 specifically shows a method of limiting the pulse position search range densely in the vicinity of the pitch pulse position and sparsely in other portions. This limiting method is based on a statistical result in which positions where the pulse is likely to be raised are concentrated near the pitch pulse. If the pulse position search range is not limited, the probability that a pulse will be raised near the pitch pulse in a voiced part will be higher than the probability that it will be raised in other parts. The search position calculator calculates the sound source pulse search position using the relative position from the pitch peak position. At this time, the numerical value of the relative position where the pitch peak position is 0 is small. (See FIG. 22. FIG. 22 shows a case corresponding to FIG. 13 (a) where the number of pulses is four.)

【０１６６】インデックス更新手段２１０４は、ピッチ
ピーク位置からの相対位置が小さいものから順番にイン
デックスが付けられている（図２２の相対位置）音源パ
ルス探索位置を、サブフレームの先頭を０とする絶対位
置に変換した後に絶対位置が小さいものから順番にイン
デックスを付け直して（図２２の絶対位置）、パルス位
置探索器２１０５へ出力する。このようにすることによ
って、伝送路誤りが生じるなどして符号器側と復号器側
とで算出されるピッチピーク位置が異なった場合におい
て、パルス位置のずれを小さくすることができる。The index updating means 2104 sets the sound source pulse search position indexed in ascending order of relative position from the pitch peak position (relative position in FIG. 22) to the absolute position where the head of the subframe is 0. After conversion into the position, the index is re-indexed in ascending order of the absolute position (absolute position in FIG. 22) and output to the pulse position searcher 2105. In this way, when the pitch peak positions calculated on the encoder side and the decoder side are different due to a transmission path error or the like, the pulse position shift can be reduced.

【０１６７】パルス位置探索器２１０５は、インデック
ス更新手段２１０４によって各探索位置を示すインデッ
クの付け直しが行われた音源パルス探索位置と、別途入
力されるピッチ周期Ｌを用いて、音源パルスを立てる位
置の最適な組み合わせを決定する。パルス探索の方法は
「ITU-T Recommendation G.729: Coding of Speech at
8 kbits/s using Conjugate-Structure Algebraic-Code
-Excited Linear-Prediction (CS-ACELP), March 1996
」に示されているように、例えばパルス数が４本の場
合は実施の形態６で示した式（２）を最大化するように
ｉ０からｉ３の組み合わせを決定する。なお、この時の
各音源パルスの極性は、雑音符号帳成分のターゲットベ
クトル、即ち聴覚重みづけされた入力音声から聴覚重み
づけ合成フィルタの零入力応答信号と適応符号帳成分の
信号を減じた信号ベクトル、の各位置における極性と等
しくなるようにパルス位置探索を行う前に予め決定すれ
ば探索のための演算量を大幅に軽減できる。また、ピッ
チ周期がサブフレーム長より短い場合には実施の形態５
にも示したようにピッチ周期化フィルタをかけることに
よって、音源パルスをインパルスではなくピッチ周期の
パルス列になるようにしている。このようなピッチ周期
化処理を行う場合は、聴覚重みづけ合成フィルタのイン
パルス応答ベクトルにピッチ周期化フィルタを予めかけ
ておけば、ピッチ周期化を行わない場合と同様にして式
（２）の最大化によって音源パルスの探索を行うことが
できる。このようにして決定された各音源パルスの位置
に、決定された各音源パルスの極性にしたがってパルス
を立て、ピッチ周期Ｌを用いてピッチ周期化フィルタを
かければ、パルス音源ベクトルが生成される。生成され
たパルス音源ベクトルは乗算器２１０７に出力される。
パルス位置探索器２１０５から乗算器２１０７に出力さ
れたパルス音源ベクトルは、外部のゲイン量子化器によ
って量子化された量子化パルス音源ベクトル利得が乗算
されて加算器２１０８に出力される。なお、パルス位置
探索器２１０５においては、パルス音源ベクトルととも
にパルス音源ベクトルを表す各音源パルスの極性および
インデックス情報が別途音源生成部の外部に出力され
る。この音源パルスの極性およびインデックス情報は符
号化器や多重化器などを通って伝送路へ出力されるデー
タ系列に変換されて伝送路へ送り出される。The pulse position search unit 2105 generates a sound source pulse using the pitch period L input separately from the sound source pulse search position where the index update unit 2104 re-indexes the search position. To determine the optimal combination of See `` ITU-T Recommendation G.729: Coding of Speech at
8 kbits / s using Conjugate-Structure Algebraic-Code
-Excited Linear-Prediction (CS-ACELP), March 1996
", For example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize the expression (2) shown in the sixth embodiment. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. If the pulse position is determined in advance before performing the pulse position search so as to be equal to the polarity at each position of the vector, the calculation amount for the search can be greatly reduced. When the pitch period is shorter than the subframe length, the fifth embodiment
By applying a pitch period filter as described above, the sound source pulse is not an impulse but a pulse train having a pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse excitation vector is output to multiplier 2107.
The pulse excitation vector output from the pulse position searcher 2105 to the multiplier 2107 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer, and output to the adder 2108. In the pulse position searcher 2105, the polarity and index information of each sound source pulse representing the pulse sound source vector together with the pulse sound source vector are separately output to the outside of the sound source generation unit. The polarity and index information of the excitation pulse are converted into a data sequence to be output to the transmission path through an encoder, a multiplexer, and the like, and sent out to the transmission path.

【０１６８】加算器２１０８は、乗算器２１０６から出
力された適応符号ベクトル成分と、乗算器２１０７から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。The adder 2108 performs vector addition of the adaptive code vector component output from the multiplier 2106 and the pulse excitation vector component output from the multiplier 2107, and outputs the result as an excitation excitation vector.

【０１６９】なお、本実施の形態に基づくインデックス
の割り当て方法は、音源の位置情報が相対的な値で表現
される全ての場合に適用することが可能であり、インデ
ックスの割り当てかたのみの違いであるので、性能に全
く影響を及ぼさずに伝送路誤りの伝播を抑える効果を得
ることができる。The index allocating method according to the present embodiment can be applied to all cases where the position information of the sound source is expressed by relative values, and the difference is only in the index allocating method. Therefore, it is possible to obtain an effect of suppressing propagation of a transmission line error without affecting performance at all.

【０１７０】なお、復号器側にも符号器側と同様のイン
デックス更新手段を備える。また、パルスの立て方とし
ては、定数本例えば４本のパルスを探索範囲、例えば３
２箇所の位置のどこかに立てる場合においては、前述の
ように３２箇所を４つに分けて１本のパルスを割り当て
られた８箇所の中の１箇所に決定するように全ての組み
合わせ（８×８×８×８通り）を探索する方法の他に、
３２箇所の中から４箇所を選びだす組み合わせ全てにつ
いて探索する方法などがある。なお、振幅１のインパル
スの組み合わせの他に、複数本例えば２本のパルスを組
み合わせたパルス対の組み合わせや、振幅の異なるイン
パルスの組み合わせによるパルスの立て方も可能であ
る。It should be noted that the decoder side has the same index updating means as the encoder side. In addition, as a method of setting up a pulse, a constant number of pulses, for example, four pulses are searched in a search range, for example, three pulses.
When standing at any of the two positions, as described above, 32 combinations are divided into four and one pulse is determined as one of eight assigned positions, and all combinations (8 × 8 × 8 × 8 ways)
There is a method of searching for all combinations in which four locations are selected from 32 locations. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０１７１】（実施の形態１３）図２３は本発明の第１
３の実施の形態を示し、パルス探索位置のインデックス
およびパルス番号の割り当てを行うパルス番号およびイ
ンデックスの更新手段を備えた、パルス位置の探索範囲
を適応符号ベクトルのピッチ周期およびピッチピーク位
置によって決定するＣＥＬＰ型音声符号化装置の符号器
側の音源生成部を示す。より具体的には、ピッチピーク
位置からの相対位置で音源パルス探索を行うＣＥＬＰ型
音声符号化装置において、サブフレームの先頭側から順
番にパルス位置のインデックスを付けるとともに、同じ
インデックス番号である異なる番号のパルスに対して
は、サブフレームの先頭側から順番にパルスの番号を付
ける、即ち同じインデックス番号の場合パルスの番号が
若いほどサブフレームの先頭側になるように各パルスの
番号を決めるようにすることによって、あるフレームに
おいて発生した伝送路誤りの影響が後続の伝送路誤りの
ないフレームに伝播することを防ぐようにした音源生成
部を示す。(Embodiment 13) FIG. 23 shows a first embodiment of the present invention.
3 shows the third embodiment, and includes a pulse number and index updating means for assigning a pulse search position index and a pulse number, and determines a pulse position search range based on a pitch period and a pitch peak position of an adaptive code vector. 2 shows a sound source generation unit on the encoder side of the CELP type speech encoding device. More specifically, in a CELP-type speech coding apparatus that performs excitation pulse search at a relative position from a pitch peak position, a pulse position is indexed sequentially from the head of a subframe, and different numbers having the same index number are used. For each pulse, the pulse numbers are assigned in order from the head of the subframe, that is, in the case of the same index number, the pulse number is determined so that the smaller the pulse number, the closer to the head of the subframe. 2 shows a sound source generation unit that prevents the influence of a transmission path error occurring in a certain frame from propagating to a subsequent frame without a transmission path error.

【０１７２】図２３において、２３０１は過去の励振音
源ベクトルを保存し、選択された適応符号ベクトルをピ
ッチピーク位置算出器２３０２およびピッチゲイン乗算
器２３０６に出力する適応符号帳、２３０２は適応符号
帳２３０１から出力された適応符号ベクトルとピッチ周
期Ｌを入力としてピッチピーク位置を算出し、探索位置
算出器２３０３に出力するピッチピーク位置算出器、２
３０３はピッチピーク位置算出器２３０２から出力され
たピッチピーク位置とピッチ周期Ｌを入力としてパルス
音源を探索する範囲を算出し、パルス番号およびインデ
ックスの更新手段２３０４へ出力する探索位置算出器、
２３０４は探索位置算出器２３０３から出力された、各
音源パルスの番号と各音源パルスの各位置のインデック
スを付け替えてパルス位置探索器２３０５に出力するパ
ルス番号およびインデックスの更新手段、２３０５はパ
ルス番号およびインデックスの更新手段２３０４から出
力された探索位置（パルスの番号とパルス位置を表すイ
ンデックスが付け直されている）と、音源生成部の外部
で別途算出されたピッチ周期Ｌとを入力としてパルス音
源を探索し、パルス音源ベクトルをパルス音源ゲイン乗
算器２３０７に出力し、符号化出力としてパルス音源ベ
クトルを表すインデックスを音源生成部の外部に出力す
るパルス位置探索器、２３０６は適応符号帳２３０１か
ら出力された適応符号ベクトルに適応符号ベクトル利得
を乗じて加算器２３０８に出力する乗算器、２３０７は
パルス位置探索器２３０５から出力されたパルス音源ベ
クトルにパルス音源ベクトル利得を乗じて加算器２３０
８に出力する乗算器、２３０８は乗算器２３０６からの
出力と乗算器２３０７からの出力を入力とし、ベクトル
加算して励振音源ベクトルとして出力する加算器であ
る。In FIG. 23, reference numeral 2301 denotes an adaptive codebook for storing the past excitation source vector and outputting the selected adaptive code vector to the pitch peak position calculator 2302 and pitch gain multiplier 2306. Calculates the pitch peak position using the adaptive code vector and the pitch period L output from the as input, and outputs the pitch peak position calculator 2303 to the search position calculator 2303.
303, a search position calculator that calculates a range in which to search for a pulse sound source by using the pitch peak position and the pitch period L output from the pitch peak position calculator 2302 as input, and outputs the range to the pulse number and index updating unit 2304;
A pulse number and index updating unit 2304 outputs the pulse number and index output from the search position calculator 2303 to the pulse position search unit 2305 by changing the number of each sound source pulse and the index of each position of each sound source pulse. A pulse sound source is input using the search position (the pulse number and the index indicating the pulse position are output again) output from the index updating unit 2304 and the pitch period L separately calculated outside the sound source generation unit. A pulse position searcher that searches for and outputs a pulse excitation vector to a pulse excitation gain multiplier 2307 and outputs an index representing the pulse excitation vector to the outside of the excitation generator as an encoded output. Multiplied by the adaptive code vector gain to the adaptive code vector Multiplier outputs 08, 2307 adder multiplies a pulse excitation vector gain pulse excitation vector outputted from the pulse position searcher 2305 230
8 is an adder which receives the output from the multiplier 2306 and the output from the multiplier 2307 as inputs, adds the vectors, and outputs the result as an excitation sound source vector.

【０１７３】以上のように構成された、音源生成部の動
作について、図２３および図２４を用いて説明する。図
２３において、適応符号帳２３０１は、音源生成部の外
部で予め算出されるピッチ周期Ｌだけ過去に溯った点か
ら、適応符号ベクトルをサブフレーム長だけ切り出し
て、適応符号ベクトルとして出力する。ピッチ周期Ｌが
サブフレーム長に満たない場合は、切り出したピッチ周
期Ｌのベクトルを、サブフレーム長に達するまで繰り返
して接続したものを適応符号ベクトルとして出力する。The operation of the sound source generator configured as described above will be described with reference to FIGS. In FIG. 23, adaptive codebook 2301 cuts out an adaptive code vector by a subframe length from a point that has been traced back by a pitch period L calculated in advance outside the excitation generation unit, and outputs the cutout as an adaptive code vector. If the pitch period L is less than the subframe length, a vector obtained by repeatedly connecting the extracted pitch period L until the subframe length is reached is output as an adaptive code vector.

【０１７４】ピッチピーク位置算出器２３０２は、適応
符号帳２３０１から出力された適応符号ベクトルを用い
て適応符号ベクトル内に存在するピッチピークの位置を
決定する。ピッチピークの位置は、ピッチ周期で並べた
インパルス列と適応符号ベクトルとの正規化相互相関を
最大化することによって行うことができる。また、ピッ
チ周期で並べたインパルス列を合成フィルタに通したも
のと、適応符号ベクトルを合成フィルタに通したもとの
誤差を最小化することによって、より精度良く求めるこ
とも可能である。The pitch peak position calculator 2302 determines the position of the pitch peak existing in the adaptive code vector using the adaptive code vector output from the adaptive code book 2301. The position of the pitch peak can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. Further, it is possible to obtain the impulse trains arranged in the pitch cycle through a synthesis filter, and to minimize the error caused by passing the adaptive code vector through the synthesis filter, thereby obtaining the error more accurately.

【０１７５】探索位置算出器２３０３は、ピッチピーク
位置を基準として音源パルスの探索位置を決定し、パル
ス番号およびインデックスの更新手段２３０４に出力す
る。探索位置の決定法としては、例えば実施の形態６や
実施の形態８に示したようにピッチピーク近傍は密にそ
れ以外の部分は疎に探索位置が分布するように決定され
る。なお、実施の形態６や実施の形態８に示したように
ピッチ周期情報を用いて音源パルス数を変化させたり、
音源パルスの探索範囲を限定したりすることを適用する
ことも有効である。探索位置算出器２３０３によって決
定される具体的な探索位置の例は図１０、図１１
（ｂ）、図１１（ｃ）、図１３に示している。例えば図
１０においては、ピッチパルス位置近傍は密に、それ以
外の部分は疎に、パルス位置探索範囲を限定する方法を
具体的に示している。この限定方法は、パルスが立てら
れる確率が高い位置がピッチパルス近傍に集中する統計
的結果に基づいている。パルス位置探索範囲を限定しな
い場合、有声部においてはピッチパルス近傍にパルスが
立てられる確率がその他の部分に立てられる確率に比べ
て高くなる。なお、探索位置算出器で算出されるのは、
ピッチピーク位置からの相対位置を用いた、音源パルス
の探索位置であり、この時点では、ピッチピーク位置を
０とする相対位置の数値が小さいものから順にパルス番
号およびインデックスがつけられている（図２４（ｂ）
参照）。なお図２４では、パルス数を４本とした場合の
図１１（ｂ）、図１３に対応する場合を示している。図
２４（ａ）はパルス数を４本とした場合に探索位置算出
器２１０３によって決定される音源パルス探索位置を示
しており、矢印の長短、上向き下向は４種類の各音源パ
ルス探索位置を示している。また、図２４（ａ）の相対
位置はピッチピーク位置を０として−４から＋７５の数
値で各サンプル点が表されており、−４より前の点はサ
ブフレーム境界より後ろにはみ出す点を折り返すことに
より＋の数値で表現している。The search position calculator 2303 determines the search position of the excitation pulse based on the pitch peak position, and outputs it to the pulse number and index updating means 2304. As a method of determining the search position, for example, as described in the sixth and eighth embodiments, the search position is determined so that the vicinity of the pitch peak is densely distributed and the other portions are sparsely distributed. Note that, as described in Embodiments 6 and 8, the number of sound source pulses is changed using the pitch period information,
It is also effective to limit the search range of the sound source pulse. Examples of specific search positions determined by the search position calculator 2303 are shown in FIGS.
(B), FIG. 11 (c), and FIG. For example, FIG. 10 specifically shows a method of limiting the pulse position search range densely in the vicinity of the pitch pulse position and sparsely in other portions. This limiting method is based on a statistical result in which positions where the pulse is likely to be raised are concentrated near the pitch pulse. If the pulse position search range is not limited, the probability that a pulse will be raised near the pitch pulse in a voiced part will be higher than the probability that it will be raised in other parts. In addition, what is calculated by the search position calculator is
This is the search position of the sound source pulse using the relative position from the pitch peak position. At this point, the pulse number and index are assigned in ascending order of the numerical value of the relative position with the pitch peak position being 0 (see FIG. 24 (b)
reference). FIG. 24 shows a case corresponding to FIGS. 11B and 13 when the number of pulses is four. FIG. 24A shows the sound source pulse search positions determined by the search position calculator 2103 when the number of pulses is four, and the length of the arrow and the upward and downward directions indicate the four types of sound source pulse search positions. Is shown. In the relative position of FIG. 24A, each sample point is represented by a numerical value from -4 to +75, with the pitch peak position being 0, and points before -4 are turned back to points protruding beyond the subframe boundary. This is represented by the numerical value of +.

【０１７６】パルス番号およびインデックスの更新手段
２３０４は、ピッチピーク位置からの相対位置が小さい
ものから順番にインデックスがつけられている（図２４
（ｂ））音源パルス探索位置を、サブフレームの先頭を
０とする絶対位置に変換した後に絶対位置が小さいもの
から順番にパルス番号およびインデックスを付け直して
（図２４（ｃ））、パルス位置探索器２３０５へ出力す
る。このようにすることによって、伝送路誤りが生じる
などして符号器側と復号器側とで算出されるピッチピー
ク位置が異なった場合において、パルス位置のずれを小
さくすることができる。The pulse number and index updating means 2304 is indexed in ascending order of relative position from the pitch peak position (FIG. 24).
(B)) The sound source pulse search position is converted to an absolute position where the head of the subframe is 0, and then the pulse number and index are re-assigned in ascending order of the absolute position (FIG. 24 (c)). Output to searcher 2305. In this way, when the pitch peak positions calculated on the encoder side and the decoder side are different due to a transmission path error or the like, the pulse position shift can be reduced.

【０１７７】パルス位置探索器２３０５は、パルス番号
およびインデックスの更新手段２３０４によって各探索
位置を示すインデックの付け直しが行われた音源パルス
探索位置と、別途入力されるピッチ周期Ｌを用いて、音
源パルスを立てる位置の最適な組み合わせを決定する。
パルス探索の方法は「ITU-T Recommendation G.729:Cod
ing of Speech at 8 kbits/s using Conjugate-Structu
re Algebraic-Code-Excited Linear-Prediction (CS-AC
ELP), March 1996」に示されているように、例えばパ
ルス数が４本の場合は実施の形態６で示した式（２）を
最大化するようにｉ０からｉ３の組み合わせを決定す
る。なお、この時の各音源パルスの極性は、雑音符号帳
成分のターゲットベクトル、即ち聴覚重みづけされた入
力音声から聴覚重みづけ合成フィルタの零入力応答信号
と適応符号帳成分の信号を減じた信号ベクトル、の各位
置における極性と等しくなるようにパルス位置探索を行
う前に予め決定すれば探索のための演算量を大幅に軽減
できる。また、ピッチ周期がサブフレーム長より短い場
合には実施の形態５にも示したようにピッチ周期化フィ
ルタをかけることによって、音源パルスをインパルスで
はなくピッチ周期のパルス列になるようにしている。こ
のようなピッチ周期化処理を行う場合は、聴覚重みづけ
合成フィルタのインパルス応答ベクトルにピッチ周期化
フィルタを予めかけておけば、ピッチ周期化を行わない
場合と同様にして式（２）の最大化によって音源パルス
の探索を行うことができる。このようにして決定された
各音源パルスの位置に、決定された各音源パルスの極性
にしたがってパルスを立て、ピッチ周期Ｌを用いてピッ
チ周期化フィルタをかければ、パルス音源ベクトルが生
成される。生成されたパルス音源ベクトルは乗算器２３
０７に出力される。パルス位置探索器２３０５から乗算
器２３０７に出力されたパルス音源ベクトルは、外部の
ゲイン量子化器によって量子化された量子化パルス音源
ベクトル利得が乗算されて加算器２３０８に出力され
る。なお、パルス位置探索器２３０５においては、パル
ス音源ベクトルとともにパルス音源ベクトルを表す各音
源パルスの極性およびインデックス情報が別途音源生成
部の外部に出力される。この音源パルスの極性およびイ
ンデックス情報は符号化器や多重化器などを通って伝送
路へ出力されるデータ系列に変換されて伝送路へ送り出
される。The pulse position searcher 2305 uses the pulse period L input separately from the sound source pulse search position at which the index indicating the search position has been re-assigned by the pulse number and index updating means 2304, and Determine the optimal combination of pulse positions.
For the method of pulse search, see `` ITU-T Recommendation G.729: Cod
ing of Speech at 8 kbits / s using Conjugate-Structu
re Algebraic-Code-Excited Linear-Prediction (CS-AC
ELP), March 1996, for example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize the equation (2) shown in the sixth embodiment. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. If the pulse position is determined in advance before performing the pulse position search so as to be equal to the polarity at each position of the vector, the calculation amount for the search can be greatly reduced. When the pitch period is shorter than the subframe length, a pitch period filter is applied as described in the fifth embodiment, so that the excitation pulse is not an impulse but a pulse train of the pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse sound source vector is
07. The pulse excitation vector output from the pulse position searcher 2305 to the multiplier 2307 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer, and output to the adder 2308. In the pulse position searcher 2305, the polarity and index information of each excitation pulse representing the pulse excitation vector together with the pulse excitation vector are separately output outside the excitation generator. The polarity and index information of the excitation pulse are converted into a data sequence to be output to the transmission path through an encoder, a multiplexer, and the like, and sent out to the transmission path.

【０１７８】加算器２３０８は、乗算器２３０６から出
力された適応符号ベクトル成分と、乗算器２３０７から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。The adder 2308 performs vector addition of the adaptive code vector component output from the multiplier 2306 and the pulse excitation vector component output from the multiplier 2307, and outputs the result as an excitation excitation vector.

【０１７９】なお、本実施の形態に基づくインデックス
の割り当て方法は、音源の位置情報が相対的な値で表現
される全ての場合に適用することが可能であり、パルス
番号とインデックスの割り当てかたのみの違いであるの
で、性能に影響を及ぼさずに伝送路誤りの伝播を抑える
効果を得ることができる。また、固定探索位置のパルス
音源との切り替え使用を行えば、さらに伝送路誤りの影
響の伝播を抑えることも可能である。The index assignment method according to the present embodiment can be applied to all cases where the position information of the sound source is expressed by relative values. Therefore, the effect of suppressing propagation of transmission path errors can be obtained without affecting performance. Further, by switching and using the pulse sound source at the fixed search position, the propagation of the influence of the transmission path error can be further suppressed.

【０１８０】なお、復号器側も同様のパルス番号および
インデックスの更新手段２３０４を備える。また、パル
スの立て方としては、定数本例えば４本のパルスを探索
範囲、例えば３２箇所の位置のどこかに立てる場合にお
いては、前述のように３２箇所を４つに分けて１本のパ
ルスを割り当てられた８箇所の中の１箇所に決定するよ
うに全ての組み合わせ（８×８×８×８通り）を探索す
る方法の他に、３２箇所の中から４箇所を選びだす組み
合わせ全てについて探索する方法などがある。なお、振
幅１のインパルスの組み合わせの他に、複数本例えば２
本のパルスを組み合わせたパルス対の組み合わせや、振
幅の異なるインパルスの組み合わせによるパルスの立て
方も可能である。The decoder side also has a similar pulse number and index updating means 2304. In addition, as a method of setting a pulse, when a fixed number of, for example, four pulses are set in a search range, for example, somewhere in 32 positions, as described above, 32 positions are divided into four and one pulse is set. In addition to the method of searching all combinations (8 × 8 × 8 × 8 ways) so as to determine one of eight locations to which is assigned, all combinations that select four locations from 32 locations There are ways to search. In addition to the combination of the impulses of the amplitude 1, a plurality
It is also possible to form a pulse by a combination of a pulse pair obtained by combining these pulses or a combination of impulses having different amplitudes.

【０１８１】（実施の形態１４）図２５は本発明の第１
４の実施の形態を示し、固定探索位置と位相適応型探索
位置との両者によって生成される音源パルス探索位置を
用いてパルス探索を行うＣＥＬＰ型音声符号化装置の音
源生成部を示す。(Embodiment 14) FIG. 25 shows a first embodiment of the present invention.
7 shows the sound source generation unit of the CELP-type speech coding apparatus that performs a pulse search using the sound source pulse search positions generated by both the fixed search position and the phase adaptive search position.

【０１８２】図２５において、２５０１は過去の励振音
源ベクトルを保存し、選択された適応符号ベクトルをピ
ッチピーク位置算出器２５０２およびピッチゲイン乗算
器２５０６に出力する適応符号帳、２５０２は適応符号
帳２５０１から出力された適応符号ベクトルと外部から
入力されるピッチ周期Ｌを入力としてピッチピーク位置
を算出し、探索位置算出器２５０３に出力するピッチピ
ーク位置算出器、２５０３はピッチピーク位置算出器２
５０２から出力されたピッチピーク位置と外部から入力
されるピッチ周期Ｌを入力としてパルス音源を探索する
位置を算出し、加算器２５０４へ出力する探索位置算出
器、２５０４は探索位置算出器２５０３から出力され
た、ピッチピーク位置を０とする相対位置で表される探
索位置と固定位置で探索される探索位置とを合わせて
（数値加算をするものではなく、２種類の探索位置の集
合の和を求める）パルス位置探索器２５０５に出力する
加算器、２５０５は加算器２５０４から出力された探索
位置と、音源生成部の外部で別途算出されたピッチ周期
Ｌとを入力としてパルス音源を探索し、パルス音源ベク
トルをパルス音源ゲイン乗算器２５０７に出力するパル
ス位置探索器、２５０６は適応符号帳２５０１から出力
された適応符号ベクトルに適応符号ベクトル利得を乗じ
て加算器２５０８に出力する乗算器、２５０７はパルス
位置探索器２５０５から出力されたパルス音源ベクトル
にパルス音源ベクトル利得を乗じて加算器２５０８に出
力する乗算器、２５０８は乗算器２５０６からの出力と
乗算器２５０７からの出力を入力とし、ベクトル加算し
て励振音源ベクトルとして出力する加算器である。In FIG. 25, reference numeral 2501 denotes an adaptive codebook for storing the past excitation source vector and outputting the selected adaptive code vector to the pitch peak position calculator 2502 and pitch gain multiplier 2506. Reference numeral 2502 denotes the adaptive codebook 2501. A pitch peak position calculator that calculates a pitch peak position by using the adaptive code vector output from the input device and a pitch period L input from the outside as inputs, and outputs a pitch peak position calculator 2503 to the search position calculator 2503.
A search position calculator that calculates a pulse sound source search position using the pitch peak position output from 502 and the pitch period L input from the outside as inputs, and outputs the calculated position to an adder 2504, which outputs from the search position calculator 2503. The search position represented by the relative position with the pitch peak position being 0 and the search position searched at the fixed position are combined (not a numerical addition, but a sum of a set of two types of search positions. The adder outputs to the pulse position searcher 2505. The search position 2505 searches for a pulse sound source using the search position output from the adder 2504 and the pitch period L separately calculated outside the sound source generator as inputs. A pulse position searcher that outputs an excitation vector to a pulse excitation gain multiplier 2507, and an adaptive code vector 2506 output from an adaptive codebook 2501 Is multiplied by an adaptive code vector gain and output to an adder 2508. A multiplier 2507 multiplies the pulse excitation vector output from the pulse position searcher 2505 by a pulse excitation vector gain and outputs the result to an adder 2508. An adder that receives an output from the multiplier 2506 and an output from the multiplier 2507 as inputs, performs vector addition, and outputs the resultant as an excitation sound source vector.

【０１８３】以上のように構成された、音源生成部の動
作について、図２５および図２６を用いて説明する。図
２５において、適応符号帳２５０１は、音源生成部の外
部で予め算出されるピッチ周期Ｌだけ過去に溯った点か
ら、適応符号ベクトルをサブフレーム長だけ切り出し
て、適応符号ベクトルとして出力する。ピッチ周期Ｌが
サブフレーム長に満たない場合は、切り出したピッチ周
期Ｌのベクトルを、サブフレーム長に達するまで繰り返
して接続したものを適応符号ベクトルとして出力する。The operation of the sound source generator configured as described above will be described with reference to FIGS. 25 and 26. In FIG. 25, adaptive codebook 2501 cuts out an adaptive code vector by a subframe length from a point that has been advanced in the past by a pitch period L calculated in advance outside the excitation generation unit, and outputs it as an adaptive code vector. If the pitch period L is less than the subframe length, a vector obtained by repeatedly connecting the extracted pitch period L until the subframe length is reached is output as an adaptive code vector.

【０１８４】ピッチピーク位置算出器２５０２は、適応
符号帳２５０１から出力された適応符号ベクトルを用い
て適応符号ベクトル内に存在するピッチピークの位置を
決定する。ピッチピークの位置は、ピッチ周期で並べた
インパルス列と適応符号ベクトルとの正規化相互相関を
最大化することによって行うことができる。また、ピッ
チ周期で並べたインパルス列を合成フィルタに通したも
のと、適応符号ベクトルを合成フィルタに通したもとの
誤差を最小化する（正規化相互相関関数を最大化する）
ことによって、より精度良く求めることも可能である。The pitch peak position calculator 2502 uses the adaptive code vector output from the adaptive code book 2501 to determine the position of the pitch peak existing in the adaptive code vector. The position of the pitch peak can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. Further, an error in which the impulse train arranged in the pitch cycle is passed through the synthesis filter and an original error in passing the adaptive code vector through the synthesis filter are minimized (maximize the normalized cross-correlation function).
By doing so, it is also possible to obtain it with higher accuracy.

【０１８５】探索位置算出器２５０３は、ピッチピーク
位置を基準として音源パルスの探索位置を決定し、加算
器２５０４に出力する。探索位置の決定法としては、例
えば図２６に示すようにピッチピーク近傍の固定探索位
置と重ならない点を出力するような決定法を用いる。な
お、実施の形態６や実施の形態８に示したようにピッチ
周期情報を用いて音源パルス数を変化させたり、音源パ
ルスの探索範囲を限定したりすることを適用する場合も
同様である。探索位置算出器２５０３によって決定され
る具体的な探索位置の例は図２６（ｂ）、（ｃ）に示し
ている。図２６においては固定探索位置を奇数サンプル
点に設定し（図２６（ａ））、ピッチピーク近傍の偶数
サンプル点に探索位置算出器２５０３が探索位置を設定
する様子（図２６（ｂ）、（ｃ））を示している。図２
６（ｂ）はピッチピーク位置が偶数サンプル点にある
（ピッチピーク位置が固定探索位置に含まれない）場合
を、図２６（ｃ）はピッチピーク位置が奇数サンプル点
にある（ピッチピーク位置が固定探索位置に含まれる）
場合を、それぞれ示している。図２６（ｂ）、（ｃ）の
比較から分かるように、ピッチピーク位置の場所によっ
て若干探索位置（ピッチピーク位置を０とする相対位
置）が異なる。The search position calculator 2503 determines the search position of the excitation pulse based on the pitch peak position, and outputs it to the adder 2504. As a method of determining the search position, for example, as shown in FIG. 26, a method of outputting a point that does not overlap with the fixed search position near the pitch peak is used. The same applies to the case where the number of sound source pulses is changed using the pitch period information or the search range of the sound source pulse is limited as described in the sixth and eighth embodiments. Examples of specific search positions determined by the search position calculator 2503 are shown in FIGS. In FIG. 26, the fixed search position is set to an odd-numbered sample point (FIG. 26A), and the search position calculator 2503 sets the search position to an even-numbered sample point near the pitch peak (FIGS. 26B and 26B). c)). FIG.
6B shows a case where the pitch peak position is at an even-numbered sample point (the pitch peak position is not included in the fixed search position), and FIG. 26C shows a case where the pitch peak position is at an odd-numbered sample point (when the pitch peak position is (Included in the fixed search position)
Each case is shown. As can be seen from the comparison between FIGS. 26B and 26C, the search position (the relative position where the pitch peak position is 0) slightly differs depending on the position of the pitch peak position.

【０１８６】加算器２５０４は、探索位置算出器２５０
３から出力された音源パルス探索位置の集合（図２６
（ｂ）、（ｃ））と予め定められている固定探索位置の
集合（図２６（ａ））との和集合（図２６（ｄ））を求
めて、パルス位置探索器２５０５へ出力する。このよう
にすることによってピッチピーク位置近傍は密に、それ
以外の部分は疎に、音源パルスの探索位置を限定してい
る。この限定方法は、パルスが立てられる確率が高い位
置がピッチパルス近傍に集中する統計的結果に基づいて
いる。パルス位置探索範囲を限定しない場合、有声部に
おいてはピッチパルス近傍にパルスが立てられる確率が
その他の部分に立てられる確率に比べて高くなる。な
お、伝送路誤り等の影響で復号器側におけるピッチピー
ク位置の算出が誤った場合、探索位置算出器２５０３で
算出される音源パルスの探索位置が符号器側と復号器側
で異なってしまうが、パルス位置探索器２５０５に入力
される音源パルス探索位置の一部は固定探索位置になっ
ているので、符号器側と復号器側のパルス位置が異なっ
てしまう確率を低くすることができ、伝送路誤りの影響
を緩和することができる。The adder 2504 is provided with a search position calculator 250
A set of sound source pulse search positions output from FIG.
(B), (c)) and a predetermined set of fixed search positions (FIG. 26 (a)) are obtained and output to the pulse position searcher 2505 (FIG. 26 (d)). By doing so, the search position of the sound source pulse is limited densely in the vicinity of the pitch peak position and sparsely in other portions. This limiting method is based on a statistical result in which positions where the pulse is likely to be raised are concentrated near the pitch pulse. If the pulse position search range is not limited, the probability that a pulse will be raised near the pitch pulse in a voiced part will be higher than the probability that it will be raised in other parts. If the calculation of the pitch peak position on the decoder side is erroneous due to the influence of a transmission path error or the like, the search position of the excitation pulse calculated by the search position calculator 2503 differs between the encoder side and the decoder side. Since a part of the excitation pulse search position input to the pulse position search unit 2505 is a fixed search position, it is possible to reduce the probability that the pulse positions of the encoder side and the decoder side differ from each other. The effect of a road error can be reduced.

【０１８７】パルス位置探索器２５０５は、加算器２５
０４から出力された音源パルス探索位置と、別途入力さ
れるピッチ周期Ｌを用いて、音源パルスを立てる位置の
最適な組み合わせを決定する。パルス探索の方法は「IT
U-T Recommendation G.729:Coding of Speech at 8 kbi
ts/s using Conjugate-Structure Algebraic-Code-Exci
ted Linear-Prediction (CS- ACELP), March 1996」に
示されているように、例えばパルス数が４本の場合は実
施の形態６で示した式（２）を最大化するようにｉ０か
らｉ３の組み合わせを決定する。なお、この時の各音源
パルスの極性は、雑音符号帳成分のターゲットベクト
ル、即ち聴覚重みづけされた入力音声から聴覚重みづけ
合成フィルタの零入力応答信号と適応符号帳成分の信号
を減じた信号ベクトル、の各位置における極性と等しく
なるようにパルス位置探索を行う前に予め決定すれば探
索のための演算量を大幅に軽減できる。また、ピッチ周
期がサブフレーム長より短い場合には実施の形態５にも
示したようにピッチ周期化フィルタをかけることによっ
て、音源パルスをインパルスではなくピッチ周期のパル
ス列になるようにしている。このようなピッチ周期化処
理を行う場合は、聴覚重みづけ合成フィルタのインパル
ス応答ベクトルにピッチ周期化フィルタを予めかけてお
けば、ピッチ周期化を行わない場合と同様にして式
（２）の最大化によって音源パルスの探索を行うことが
できる。このようにして決定された各音源パルスの位置
に、決定された各音源パルスの極性にしたがってパルス
を立て、ピッチ周期Ｌを用いてピッチ周期化フィルタを
かければ、パルス音源ベクトルが生成される。生成され
たパルス音源ベクトルは乗算器２５０７に出力される。
パルス位置探索器２５０５から乗算器２５０７に出力さ
れたパルス音源ベクトルは、外部のゲイン量子化器によ
って量子化された量子化パルス音源ベクトル利得が乗算
されて加算器２５０８に出力される。なお図２５では省
略しているが、パルス位置探索器２５０５においては、
パルス音源ベクトルとともにパルス音源ベクトルを表す
各音源パルスの極性およびインデックス情報が別途音源
生成部の外部に出力される。この音源パルスの極性およ
びインデックス情報は符号化器や多重化器などを通って
伝送路へ出力されるデータ系列に変換されて伝送路へ送
り出される。[0187] The pulse position search unit 2505 includes the adder 25.
Using the sound source pulse search position output from the signal generator 04 and the pitch period L separately input, an optimal combination of the positions where the sound source pulses are set is determined. The method of pulse search is "IT
UT Recommendation G.729: Coding of Speech at 8 kbi
ts / s using Conjugate-Structure Algebraic-Code-Exci
As shown in “ted Linear-Prediction (CS-ACELP), March 1996”, for example, when the number of pulses is four, i0 to i3 are used to maximize the equation (2) shown in the sixth embodiment. Is determined. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. If the pulse position is determined in advance before performing the pulse position search so as to be equal to the polarity at each position of the vector, the calculation amount for the search can be greatly reduced. When the pitch period is shorter than the subframe length, a pitch period filter is applied as described in the fifth embodiment, so that the excitation pulse is not an impulse but a pulse train of the pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse excitation vector is output to multiplier 2507.
The pulse excitation vector output from the pulse position searcher 2505 to the multiplier 2507 is multiplied by the quantized pulse excitation vector gain quantized by the external gain quantizer, and output to the adder 2508. Although omitted in FIG. 25, in the pulse position searcher 2505,
The polarity and index information of each excitation pulse representing the pulse excitation vector together with the pulse excitation vector are separately output to the outside of the excitation generator. The polarity and index information of the excitation pulse are converted into a data sequence to be output to the transmission path through an encoder, a multiplexer, and the like, and sent out to the transmission path.

【０１８８】加算器２５０８は、乗算器２５０６から出
力された適応符号ベクトル成分と、乗算器２５０７から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。Adder 2508 performs vector addition of the adaptive code vector component output from multiplier 2506 and the pulse excitation vector component output from multiplier 2507, and outputs the result as an excitation excitation vector.

【０１８９】なお、固定探索位置のパルス音源との切り
替え使用を行えば、さらに伝送路誤りの影響の伝播を抑
えることも可能である。[0189] It is possible to further suppress the propagation of the effect of the transmission path error by switching and using the pulse sound source at the fixed search position.

【０１９０】また、パルスの立て方としては、定数本例
えば４本のパルスを探索範囲、例えば３２箇所の位置の
どこかに立てる場合においては、前述のように３２箇所
を４つに分けて１本のパルスを割り当てられた８箇所の
中の１箇所に決定するように全ての組み合わせ（８×８
×８×８通り）を探索する方法の他に、３２箇所の中か
ら４箇所を選びだす組み合わせ全てについて探索する方
法などがある。なお、振幅１のインパルスの組み合わせ
の他に、複数本例えば２本のパルスを組み合わせたパル
ス対の組み合わせや、振幅の異なるインパルスの組み合
わせによるパルスの立て方も可能である。As for the method of setting the pulses, when a fixed number of pulses, for example, four pulses are set in a search range, for example, somewhere in 32 positions, the 32 positions are divided into four as described above, and one pulse is set. All of the combinations (8 × 8
In addition to the method of searching (× 8 × 8 ways), there is a method of searching for all the combinations that select four places from 32 places. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０１９１】（実施の形態１５）図２７は本発明の第１
５の実施の形態を示し、ピッチピーク位置補正器を備え
た実施の形態５記載のＣＥＬＰ型音声符号化装置の音源
生成部を示している。(Embodiment 15) FIG. 27 shows a first embodiment of the present invention.
15 shows an excitation generator of the CELP-type speech coding apparatus according to the fifth embodiment, which is provided with a pitch peak position corrector.

【０１９２】図２７において、２７０１は過去の励振音
源ベクトルを保存し、選択された適応符号ベクトルをピ
ッチピーク位置算出器２７０２およびピッチピーク位置
補正器２７０３およびピッチゲイン乗算器２７０６に出
力する適応符号帳、２７０２は適応符号帳２７０１から
出力された適応符号ベクトルと外部から入力されるピッ
チ周期Ｌを入力としてピッチピーク位置を算出し、ピッ
チピーク位置補正器２７０３に出力するピッチピーク位
置算出器、２７０３は適応符号帳２７０１から出力され
る適応符号ベクトルとピッチピーク位置算出器２７０２
から出力されたピッチピーク位置と外部から入力される
ピッチ周期Ｌを入力としてピッチピーク位置を補正し、
探索位置算出器２７０４へ出力するピッチピーク位置補
正器、２７０４はピッチピーク位置補正器２７０３から
出力されたピッチピーク位置と別途入力されるピッチ周
期Ｌとを入力として、音源パルスの探索位置をパルス位
置探索器２７０５に出力する探索位置算出器、２７０５
は探索位置算出器２７０４から出力された探索位置と、
音源生成部の外部で別途算出されたピッチ周期Ｌとを入
力としてパルス音源を探索し、パルス音源ベクトルをパ
ルス音源ゲイン乗算器２７０７に出力するパルス位置探
索器、２７０６は適応符号帳２７０１から出力された適
応符号ベクトルに適応符号ベクトル利得を乗じて加算器
２７０８に出力する乗算器、２７０７はパルス位置探索
器２７０５から出力されたパルス音源ベクトルにパルス
音源ベクトル利得を乗じて加算器２７０８に出力する乗
算器、２７０８は乗算器２７０６からの出力と乗算器２
７０７からの出力を入力とし、ベクトル加算して励振音
源ベクトルとして出力する加算器である。In FIG. 27, reference numeral 2701 denotes an adaptive codebook which stores the past excitation source vector and outputs the selected adaptive code vector to pitch peak position calculator 2702, pitch peak position corrector 2703, and pitch gain multiplier 2706. , 2702 calculate the pitch peak position using the adaptive code vector output from the adaptive codebook 2701 and the externally input pitch period L as inputs, and output a pitch peak position calculator to a pitch peak position corrector 2703. Adaptive code vector output from adaptive codebook 2701 and pitch peak position calculator 2702
The pitch peak position output from and the pitch period L input from the outside are input to correct the pitch peak position,
Pitch peak position corrector output to search position calculator 2704. 2704 receives as input the pitch peak position output from pitch peak position corrector 2703 and separately input pitch period L, and sets the search position of the sound source pulse to the pulse position. Search position calculator for outputting to searcher 2705, 2705
Is the search position output from the search position calculator 2704,
A pulse position searcher that searches for a pulse excitation using the pitch period L separately calculated outside the excitation generation unit as an input and outputs a pulse excitation vector to a pulse excitation gain multiplier 2707 is output from the adaptive codebook 2701. The multiplier 2707 multiplies the adaptive code vector by the adaptive code vector gain and outputs the result to the adder 2708. The multiplier 2707 multiplies the pulse excitation vector output from the pulse position searcher 2705 by the pulse excitation vector gain and outputs the result to the adder 2708. And 2708 are the output from the multiplier 2706 and the multiplier 2706.
This is an adder that receives the output from 707 as an input, adds the vector, and outputs the result as an excitation sound source vector.

【０１９３】以上のように構成された、音源生成部の動
作について、図２７および図２８を用いて説明する。図
２７において、適応符号帳２７０１は、音源生成部の外
部で予め算出されるピッチ周期Ｌだけ過去に溯った点か
ら、適応符号ベクトルをサブフレーム長だけ切り出し
て、適応符号ベクトルとして出力する。ピッチ周期Ｌが
サブフレーム長に満たない場合は、切り出したピッチ周
期Ｌのベクトルを、サブフレーム長に達するまで繰り返
して接続したものを適応符号ベクトルとして出力する。The operation of the sound source generator configured as described above will be described with reference to FIGS. 27 and 28. In FIG. 27, adaptive codebook 2701 cuts out an adaptive code vector by a subframe length from a point that has been advanced in the past by a pitch period L calculated in advance outside the excitation generation unit, and outputs it as an adaptive code vector. If the pitch period L is less than the subframe length, a vector obtained by repeatedly connecting the extracted pitch period L until the subframe length is reached is output as an adaptive code vector.

【０１９４】ピッチピーク位置算出器２７０２は、適応
符号帳２７０１から出力された適応符号ベクトルを用い
て適応符号ベクトル内に存在するピッチピークの位置を
決定する。ピッチピークの位置は、ピッチ周期で並べた
インパルス列と適応符号ベクトルとの正規化相互相関を
最大化することによって行うことができる。また、ピッ
チ周期で並べたインパルス列を合成フィルタに通したも
のと、適応符号ベクトルを合成フィルタに通したもとの
誤差を最小化する（正規化相互相関関数を最大化する）
ことによって、より精度良く求めることも可能である。Pitch peak position calculator 2702 uses the adaptive code vector output from adaptive code book 2701 to determine the position of the pitch peak existing in the adaptive code vector. The position of the pitch peak can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. Further, an error in which the impulse train arranged in the pitch cycle is passed through the synthesis filter and an original error in passing the adaptive code vector through the synthesis filter are minimized (maximize the normalized cross-correlation function).
By doing so, it is also possible to obtain it with higher accuracy.

【０１９５】ピッチピーク位置補正器２７０３は、適応
符号帳２７０１から出力された適応符号ベクトルから、
ピッチピーク位置算出器２７０２によって算出されたピ
ッチピーク位置の点を含む１ピッチ周期長Ｌの長さをも
つベクトルを切り出し、この切り出した波形の中から振
幅値が最大となる点を探し出して探索位置算出器２７０
４に出力する。なお、この処理はピッチ周期Ｌがサブフ
レーム長よりも短い場合についてのみ行われる。ピッチ
周期Ｌがサブフレーム長より長い場合はピッチピーク位
置算出器２７０２が出力したピッチピーク位置をそのま
まパルス位置探索器２７０５に出力する。ピッチピーク
位置算出器２７０２から出力されるピッチピーク位置
は、１サブフレーム長が1ピッチ周期程度の長さに相当
する場合、1ピッチ波形内の2番目に振幅が高い場所にな
っている可能性がある（図２８（ａ）、（ｂ）：ピッチ
ピークは１サブフレーム内に１個所しか存在しないが、
１ピッチ周期波形内で２番目に大きい振幅値を有する点
（セカンドピーク）が１サブフレーム内に２個所存在す
るために、セカンドピークをピッチピークと誤検出して
しまう）。このため、ピッチピーク位置補正器２７０３
により、ピッチピーク位置算出器２７０２から出力され
たピッチピーク位置から1ピッチ周期長以内により大き
い振幅値を有する点が存在しないかチェックし、ピッチ
ピーク位置算出器２７０２から出力されたピッチピーク
位置付近の点の振幅値より大きい振幅値を有する点が存
在する場合は、その大きい振幅値を有する点の方をピッ
チピーク位置とする。例えば図２８（ｃ）においてセカ
ンドピークをピッチピーク位置算出器２７０２が出力し
た場合は、このセカンドピークから１ピッチ周期分の適
応符号ベクトル（図２８（ｃ）の太線部）の中で振幅が
最大となる位置をピッチピークとする。Pitch peak position corrector 2703 calculates the adaptive code vector output from adaptive codebook 2701
A vector having a length of one pitch cycle length L including the point of the pitch peak position calculated by the pitch peak position calculator 2702 is cut out, and a point having the maximum amplitude value is searched out from the cut out waveform to search for a position. Calculator 270
4 is output. This process is performed only when the pitch period L is shorter than the subframe length. If the pitch period L is longer than the subframe length, the pitch peak position output from the pitch peak position calculator 2702 is output to the pulse position searcher 2705 as it is. The pitch peak position output from the pitch peak position calculator 2702 may be the second highest amplitude place in one pitch waveform when one subframe length is equivalent to about one pitch period. (FIGS. 28A and 28B: There is only one pitch peak in one subframe,
(Since there are two points (second peaks) having the second largest amplitude value in one pitch period waveform in one subframe, the second peak is erroneously detected as a pitch peak.) Therefore, the pitch peak position corrector 2703
By checking whether there is a point having a larger amplitude value within one pitch cycle length from the pitch peak position output from the pitch peak position calculator 2702, the vicinity of the pitch peak position output from the pitch peak position calculator 2702 is checked. When there is a point having an amplitude value larger than the amplitude value of the point, the point having the larger amplitude value is set as the pitch peak position. For example, when the pitch peak position calculator 2702 outputs the second peak in FIG. 28 (c), the amplitude is the largest in the adaptive code vector for one pitch period from the second peak (the thick line portion in FIG. 28 (c)). Is a pitch peak.

【０１９６】探索位置算出器２７０４は、ピッチピーク
位置補正器２７０３から出力されたピッチピーク位置を
基準として音源パルスの探索位置を決定し、パルス位置
探索器２７０５に出力する。探索位置の決定法として
は、実施の形態５または実施の形態６または実施の形態
１４などのように、ピッチピーク位置近傍は密に、それ
以外の部分は疎に、音源パルスの探索位置を限定する方
法がある。この限定方法は、パルスが立てられる確率が
高い位置がピッチパルス近傍に集中する統計的結果に基
づいている。パルス位置探索範囲を限定しない場合、有
声部においてはピッチパルス近傍にパルスが立てられる
確率がその他の部分に立てられる確率に比べて高くなる
ことを利用するものである。The search position calculator 2704 determines the search position of the excitation pulse based on the pitch peak position output from the pitch peak position corrector 2703, and outputs it to the pulse position searcher 2705. As a method of determining the search position, the search position of the sound source pulse is limited densely in the vicinity of the pitch peak position and sparsely in other portions, as in the fifth embodiment, the sixth embodiment or the fourteenth embodiment. There is a way to do that. This limiting method is based on a statistical result in which positions where the pulse is likely to be raised are concentrated near the pitch pulse. In the case where the pulse position search range is not limited, the fact that the probability that a pulse is raised near a pitch pulse in a voiced part is higher than the probability that a pulse is raised in other parts is used.

【０１９７】パルス位置探索器２７０５は、探索位置算
出器２７０４から出力された音源パルス探索位置と、別
途入力されるピッチ周期Ｌを用いて、音源パルスを立て
る位置の最適な組み合わせを決定する。パルス探索の方
法は「ITU-T RecommendationG.729: Coding of Speecha
t 8 kbits/s using Conjugate-Structure Algebraic-Co
de-Excited Linear-Prediction (CS-ACELP), March 199
6」に示されているように、例えばパルス数が４本の場
合は実施の形態６で示した式（２）を最大化するように
ｉ０からｉ３の組み合わせを決定する。なお、この時の
各音源パルスの極性は、雑音符号帳成分のターゲットベ
クトル、即ち聴覚重みづけされた入力音声から聴覚重み
づけ合成フィルタの零入力応答信号と適応符号帳成分の
信号を減じた信号ベクトル、の各位置における極性と等
しくなるようにパルス位置探索を行う前に予め決定すれ
ば探索のための演算量を大幅に軽減できる。また、ピッ
チ周期がサブフレーム長より短い場合には実施の形態５
にも示したようにピッチ周期化フィルタをかけることに
よって、音源パルスをインパルスではなくピッチ周期の
パルス列になるようにしている。このようなピッチ周期
化処理を行う場合は、聴覚重みづけ合成フィルタのイン
パルス応答ベクトルにピッチ周期化フィルタを予めかけ
ておけば、ピッチ周期化を行わない場合と同様にして式
（２）の最大化によって音源パルスの探索を行うことが
できる。このようにして決定された各音源パルスの位置
に、決定された各音源パルスの極性に従ってパルスを立
て、ピッチ周期Ｌを用いてピッチ周期化フィルタをかけ
れば、パルス音源ベクトルが生成される。生成されたパ
ルス音源ベクトルは乗算器２７０７に出力される。パル
ス位置探索器２７０５から乗算器２７０７に出力された
パルス音源ベクトルは、外部のゲイン量子化器によって
量子化された量子化パルス音源ベクトル利得が乗算され
て加算器２７０８に出力される。なお図２７では省略し
ているが、符号器のパルス位置探索器２７０５において
は、パルス音源ベクトルとともにパルス音源ベクトルを
表す各音源パルスの極性およびインデックス情報が別途
音源生成部の外部に出力される。この音源パルスの極性
およびインデックス情報は符号化器や多重化器などを通
って伝送路へ出力されるデータ系列に変換されて伝送路
へ送り出される。The pulse position search unit 2705 determines the optimum combination of the position where the sound source pulse is to be formed using the sound source pulse search position output from the search position calculator 2704 and the pitch period L separately input. The method of pulse search is described in "ITU-T Recommendation G.729: Coding of Speecha
t 8 kbits / s using Conjugate-Structure Algebraic-Co
de-Excited Linear-Prediction (CS-ACELP), March 199
As shown in “6”, for example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize Expression (2) shown in the sixth embodiment. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. If the pulse position is determined in advance before performing the pulse position search so as to be equal to the polarity at each position of the vector, the calculation amount for the search can be greatly reduced. When the pitch period is shorter than the subframe length, the fifth embodiment
By applying a pitch period filter as described above, the sound source pulse is not an impulse but a pulse train having a pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the determined polarity of each sound source pulse, and a pitch sound source vector is generated by applying a pitch period filter using the pitch period L. The generated pulse excitation vector is output to multiplier 2707. The pulse excitation vector output from the pulse position searcher 2705 to the multiplier 2707 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer and output to the adder 2708. Although omitted in FIG. 27, in the pulse position searcher 2705 of the encoder, the polarity and index information of each excitation pulse representing the pulse excitation vector together with the pulse excitation vector are separately output outside the excitation generation unit. The polarity and index information of the excitation pulse are converted into a data sequence to be output to the transmission path through an encoder, a multiplexer, and the like, and sent out to the transmission path.

【０１９８】加算器２７０８は、乗算器２７０６から出
力された適応符号ベクトル成分と、乗算器２７０７から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。Adder 2708 performs vector addition of the adaptive code vector component output from multiplier 2706 and the pulse excitation vector component output from multiplier 2707, and outputs the resultant as an excitation excitation vector.

【０１９９】なお、本実施の形態のおいて、実施の形態
１２または実施の形態１３または実施の形態１４のよう
にインデックス更新手段またはパルス番号およびインデ
ックスの更新手段または固定探索位置と位相適応探索位
置の併用を取り入れれば、伝送路誤りの影響を緩和する
ことができる。また、固定探索位置のパルス音源との切
り替え使用を行えば、さらに伝送路誤りの影響の伝播を
抑えることも可能である。In this embodiment, the index updating means or the pulse number and index updating means or the fixed search position and the phase adaptive search position are different from those in the twelfth, thirteenth, or fourteenth embodiment. By adopting the combination of the above, the influence of the transmission path error can be reduced. Further, by switching and using the pulse sound source at the fixed search position, the propagation of the influence of the transmission path error can be further suppressed.

【０２００】また、本発明のピッチピーク位置補正器
は、実施の形態３から実施の形態１１までのいずれの音
声符号化装置にも適用することが可能である。Further, the pitch peak position corrector of the present invention can be applied to any of the speech coding apparatuses according to the third to eleventh embodiments.

【０２０１】なお、パルスの立て方としては、定数本例
えば４本のパルスを探索範囲、例えば３２箇所の位置の
どこかに立てる場合においては、前述のように３２箇所
を４つに分けて１本のパルスを割り当てられた８箇所の
中の１箇所に決定するように全ての組み合わせ（８×８
×８×８通り）を探索する方法の他に、３２箇所の中か
ら４箇所を選びだす組み合わせ全てについて探索する方
法などがある。なお、振幅１のインパルスの組み合わせ
の他に、複数本例えば２本のパルスを組み合わせたパル
ス対の組み合わせや、振幅の異なるインパルスの組み合
わせによるパルスの立て方も可能である。As for the method of setting pulses, when a constant number of pulses, for example, four pulses are set in a search range, for example, somewhere in 32 positions, 32 positions are divided into four as described above, and one pulse is set. All of the combinations (8 × 8
In addition to the method of searching (× 8 × 8 ways), there is a method of searching for all the combinations that select four places from 32 places. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０２０２】（実施の形態１６）図２９は本発明の第１
６の実施の形態を示し、連続するサブフレーム間の音源
信号波形の位相の連続性を利用して、ピッチピーク位置
の存在範囲をピッチピーク位置算出前に予め限定するＣ
ＥＬＰ型音声符号化装置の音源生成部を示す。図２９に
おいて、２９０１は適応符号ベクトルをピッチピーク位
置算出器２９０２と乗算器２９０８に出力する適応符号
帳、２９０２は適応符号帳２９０１から出力された適応
符号ベクトルと音声生成部の外部から入力されるピッチ
周期Ｌとピッチピーク探索範囲限定器２９０３から出力
されるピッチピーク探索範囲を入力として、適応符号ベ
クトル内のピッチピーク位置を算出して遅延器２９０４
と探索位置算出器２９０６とに出力するピッチピーク位
置算出器、２９０３は遅延器２９０４から出力された直
前のサブフレームにおけるピッチピーク位置と遅延器２
９０５から出力された直前のサブフレームにおけるピッ
チ周期と音源生成部の外部から入力される現在のサブフ
レームにおけるピッチ周期Ｌとを入力として、現在のサ
ブフレームにおけるピッチピーク位置を予測し、予測し
たピッチピーク位置に基づいてピッチピーク位置を探索
する範囲を限定して、その範囲をピッチピーク位置算出
器２９０２に出力するピッチピーク探索範囲限定器、遅
延器２９０４はピッチピーク位置算出器から出力された
ピッチピーク位置を入力として、１サブフレーム分遅延
させてピッチピーク探索範囲限定器２９０３に出力する
遅延器、２９０５は音声生成部の外部から入力されるピ
ッチ周期Ｌを入力として、１サブフレーム分遅延させて
ピッチピーク探索範囲限定器２９０３に出力する遅延
器、２９０６はピッチピーク位置算出器２９０２から出
力されたピッチピーク位置と音源生成部の外部から入力
されるピッチ周期Ｌとを入力として、音源パルスの探索
位置をパルス位置探索器２９０７に出力する探索位置算
出器、２９０７は探索位置算出器２９０６から入力され
る音源パルスの探索位置と音源生成部の外部から入力さ
れるピッチ周期Ｌとを入力とし、入力された音源パルス
探索位置とピッチ周期Ｌを用いて音源パルスの位置を探
索し、パルス音源ベクトルを乗算器２９０９に出力する
パルス位置探索器、２９０８は適応符号帳から出力され
た適応符号ベクトルを入力として量子化適応符号ベクト
ル利得を乗じて加算器２９１０に出力する乗算器、２９
０９はパルス位置探索器２９０７から出力されるパルス
音源ベクトルを入力として量子化パルス音源ベクトル利
得を乗じて加算器２９１０に出力する乗算器、２９１０
は乗算器２９０８および２９０９から出力されたベクト
ルをそれぞれ入力とし、入力されたベクトルの加算を行
い、励振音源ベクトルとして出力する加算器である。(Embodiment 16) FIG. 29 shows the first embodiment of the present invention.
6 shows an embodiment 6 in which the existence range of the pitch peak position is previously limited before calculating the pitch peak position by using the continuity of the phase of the sound source signal waveform between consecutive subframes.
2 shows a sound source generation unit of the ELP type speech coding apparatus. In FIG. 29, reference numeral 2901 denotes an adaptive codebook that outputs an adaptive code vector to a pitch peak position calculator 2902 and a multiplier 2908, and reference numeral 2902 denotes an adaptive code vector output from the adaptive codebook 2901 and input from outside the speech generation unit. Taking the pitch period L and the pitch peak search range output from the pitch peak search range limiter 2903 as input, calculate the pitch peak position in the adaptive code vector, and
And a search position calculator 2906 that outputs a pitch peak position calculator 2903, and 2903 is a pitch peak position in the immediately preceding subframe output from the delay unit 2904 and the delay unit 2906.
Using the pitch period in the immediately preceding subframe output from 905 and the pitch period L in the current subframe input from outside the sound source generation unit as input, predicts the pitch peak position in the current subframe, and predicts the predicted pitch. A pitch peak search range limiter that limits the range in which the pitch peak position is searched based on the peak position and outputs the range to the pitch peak position calculator 2902, and the delay unit 2904 outputs the pitch output from the pitch peak position calculator. A delay device that receives a peak position as input and delays it by one subframe and outputs it to a pitch peak search range limiter 2903. A delay unit 2905 receives a pitch period L input from outside the voice generation unit as input and delays it by one subframe. The delay device that outputs to the pitch peak search range limiter 2903 A search position calculator that outputs a search position of a sound source pulse to a pulse position searcher 2907 by using the pitch peak position output from the peak position calculator 2902 and the pitch period L input from outside the sound source generator as inputs. Receives the search position of the sound source pulse input from the search position calculator 2906 and the pitch period L input from outside the sound source generator, and uses the input sound source pulse search position and the pitch period L to generate the sound source pulse. A pulse position searcher that searches for a position and outputs a pulse excitation vector to a multiplier 2909, receives the adaptive code vector output from the adaptive codebook as an input, multiplies the quantized adaptive code vector gain, and outputs the result to an adder 2910. Multiplier, 29
09 is a multiplier which receives the pulse excitation vector output from the pulse position searcher 2907 as input, multiplies the quantization pulse excitation vector gain, and outputs the result to the adder 2910;
Is an adder that receives the vectors output from the multipliers 2908 and 2909 as inputs, performs addition of the input vectors, and outputs the result as an excitation sound source vector.

【０２０３】以上のように構成された音声符号化装置の
音源生成部について、図２９を用いてその動作を説明す
る。適応符号帳２９０１は、過去の励振音源のバッファ
により構成され、外部のピッチ分析または適応符号帳探
索手段によって求められたピッチ周期またはピッチラグ
に基づいて励振音源のバッファから該当する部分を取り
出し、適応符号ベクトルとしてピッチピーク位置算出器
２９０２および乗算器２９０８に出力する。適応符号帳
２９０１から乗算器２９０８に出力された適応符号ベク
トルは、外部のゲイン量子化器によって量子化された量
子化適応符号ベクトル利得が乗算されて加算器２９１０
に出力される。[0203] The operation of the sound source generation unit of the speech coding apparatus configured as described above will be described with reference to FIG. The adaptive codebook 2901 is constituted by a buffer of a past excitation source, extracts a corresponding portion from the buffer of the excitation source based on a pitch period or pitch lag obtained by an external pitch analysis or an adaptive codebook search means, and generates an adaptive codebook. The vector is output to the pitch peak position calculator 2902 and the multiplier 2908 as a vector. The adaptive code vector output from the adaptive codebook 2901 to the multiplier 2908 is multiplied by a quantized adaptive code vector gain quantized by an external gain quantizer and added to the adder 2910.
Is output to

【０２０４】ピッチピーク位置算出器２９０２は、適応
符号ベクトルからピッチピークを検出し、その位置を遅
延器２９０４と探索位置算出器２９０６のそれぞれに出
力する。ピッチピーク位置の検出（算出）は、ピッチ周
期Ｌで並べたインパルス列ベクトルと適応符号ベクトル
の正規化相互相関関数を最大化することによって行うこ
とができる。また、ピッチ周期Ｌで並べたインパルス列
ベクトルに合成フィルタのインパルス応答を畳み込んだ
ベクトルと、適応符号ベクトルに合成フィルタのインパ
ルス応答を畳み込んだベクトルとの正規化相互相関関数
を最大化することによって、より精度良くピッチピーク
位置の検出を行うことも可能である。さらに、検出され
たピッチピーク位置を含む１ピッチ周期波形の中から振
幅値最大となる位置をピッチピークとする後処理を加え
れば、１ピッチ周期波形内のセカンドピークを誤検出す
ることを回避することも可能である。[0204] Pitch peak position calculator 2902 detects a pitch peak from the adaptive code vector, and outputs the position to delay device 2904 and search position calculator 2906, respectively. The detection (calculation) of the pitch peak position can be performed by maximizing the normalized cross-correlation function between the impulse train vector arranged in the pitch period L and the adaptive code vector. Also, maximizing a normalized cross-correlation function between a vector obtained by convolving the impulse response of the synthesis filter with the impulse train vector arranged in the pitch period L and a vector obtained by convolving the impulse response of the synthesis filter with the adaptive code vector. Accordingly, the pitch peak position can be detected with higher accuracy. Furthermore, if post-processing is performed to set the position where the amplitude value becomes maximum among the one-pitch periodic waveforms including the detected pitch-peak position as the pitch peak, erroneous detection of the second peak in the one-pitch periodic waveform is avoided. It is also possible.

【０２０５】遅延器２９０４は、ピッチピーク位置算出
器２９０２で算出されたピッチピーク位置を１サブフレ
ーム分だけ遅延させてピッチピーク探索範囲限定器２９
０３に出力する。即ち、ピッチピーク探索範囲限定器２
９０３には直前のサブフレームにおけるピッチピーク位
置が遅延器２９０４から入力される。遅延器２９０５
は、音源生成部の外部から入力されるピッチ周期Ｌを１
サブフレーム分だけ遅延させてピッチピーク探索範囲限
定器２９０３に出力する。即ち、ピッチピーク探索範囲
限定器２９０３には直前のサブフレームにおけるピッチ
周期が遅延器２９０５から入力される。The delay unit 2904 delays the pitch peak position calculated by the pitch peak position calculator 2902 by one subframe, and sets the pitch peak search range limiter 29
03 is output. That is, the pitch peak search range limiter 2
To 903, the pitch peak position in the immediately preceding subframe is input from the delay unit 2904. Delay unit 2905
Sets the pitch period L input from outside the sound source generation unit to 1
Delayed by a subframe and output to pitch peak search range limiter 2903. That is, the pitch period in the immediately preceding subframe is input from the delay unit 2905 to the pitch peak search range limiter 2903.

【０２０６】ピッチピーク探索範囲限定器２９０３は、
まず始めに遅延器２９０５から入力される直前のサブフ
レームにおけるピッチ周期と現在のサブフレームにおけ
るピッチ周期の比較を行い、現在のサブフレームが有声
（定常）部であるかどうかの判定を行う。具体的には、
直前のサブフレームにおけるピッチ周期と現在のサブフ
レームにおけるピッチ周期との差が小さい場合（例えば
±５サンプル以内のとき）に有声（定常）部であると判
定する。なお、遅延器を増やして数サブフレーム前まで
のピッチ周期を用いて有声判定を行うこともできる。有
声（定常）部であると判定されると、ピッチピーク探索
範囲限定器２９０３は、遅延器２９０４から入力される
直前のサブフレームにおけるピッチピーク位置と、遅延
器２９０５から入力される直前のサブフレームにおける
ピッチ周期と、現在のサブフレームにおけるピッチ周期
Ｌを入力として、現在のサブフレームにおけるピッチピ
ーク位置を予測し、その予測位置の前後（例えば１０サ
ンプル）をピッチピーク位置の探索を行う範囲とする。
なお、予測したピッチピーク位置がサブフレーム先頭付
近にある場合は、１ピッチ周期後ろの付近も探索範囲に
加え、予測したピッチピーク位置がサブフレームの先頭
から１ピッチ周期後ろの位置の付近にある場合は、サブ
フレーム先頭付近も探索範囲に加える。なお、有声（定
常）部でないと判定された場合は、ピッチピーク探索範
囲の限定は行わずに、サブフレーム全体をピッチピーク
探索範囲とする。このようにしてピッチピーク探索範囲
限定器２９０３で求められたピッチピーク探索範囲は、
ピッチピーク位置算出器２９０２に出力される。なお、
音声符号化処理を開始した時点（最初のサブフレーム）
においては、過去に入力された（直前のサブフレームに
おける）ピッチ周期Ｌが存在しないため、適当な定数
（例えばピッチ周期の最大値や最小値あるいは０など有
り得ないピッチ周期）を遅延器２９０５が出力するよう
にしておく。遅延器２９０４についても同様である。な
お、予測ピッチピーク位置は実施の形態１０に示される
（６）式によって求められる（図１９参照）。The pitch peak search range limiter 2903 is
First, a comparison is made between the pitch cycle in the immediately preceding subframe input from the delay unit 2905 and the pitch cycle in the current subframe, and it is determined whether the current subframe is a voiced (stationary) part. In particular,
If the difference between the pitch cycle in the immediately preceding subframe and the pitch cycle in the current subframe is small (for example, within ± 5 samples), it is determined to be a voiced (stationary) part. Note that voiced determination can be performed using a pitch period up to several subframes before the number of delay units is increased. If it is determined that the voice frame is a voiced (stationary) part, the pitch peak search range limiter 2903 determines the pitch peak position in the subframe immediately before input from the delay unit 2904 and the subframe immediately before input from the delay unit 2905. , And the pitch period L in the current subframe are input, the pitch peak position in the current subframe is predicted, and the range before and after the predicted position (for example, 10 samples) is set as the range in which to search for the pitch peak position. .
When the predicted pitch peak position is near the head of the subframe, the vicinity after one pitch cycle is added to the search range, and the predicted pitch peak position is near the position one pitch cycle after the head of the subframe. In this case, the vicinity of the head of the subframe is also added to the search range. If it is determined that the pitch is not a voiced (stationary) part, the entire subframe is set as the pitch peak search range without limiting the pitch peak search range. The pitch peak search range obtained by the pitch peak search range limiter 2903 in this manner is
It is output to pitch peak position calculator 2902. In addition,
The point when the audio encoding process started (first subframe)
Since there is no pitch period L input in the past (in the immediately preceding subframe), the delay unit 2905 outputs an appropriate constant (for example, a pitch period that is impossible such as the maximum value or the minimum value of the pitch period or 0). Keep it. The same applies to the delay unit 2904. Note that the predicted pitch peak position is obtained by Expression (6) shown in Embodiment 10 (see FIG. 19).

【０２０７】探索位置算出器２９０６は、ピッチピーク
位置を基準として音源パルスの探索位置を決定し、探索
位置をパルス位置探索器２９０７に出力する。探索位置
の決定法としては、例えば実施の形態６や実施の形態８
に示したようにピッチピーク近傍は密にそれ以外の部分
は疎に探索位置が分布するように決定される。なお、実
施の形態６や実施の形態８に示したようにピッチ周期情
報を用いて音源パルス数を変化させたり、音源パルスの
探索範囲を限定したりすることを適用することも有効で
ある。また、実施の形態１２から実施の形態１４のいず
れかに示したように探索位置を決定すれば、伝送路誤り
の影響を緩和することも可能である。The search position calculator 2906 determines the search position of the sound source pulse based on the pitch peak position, and outputs the search position to the pulse position searcher 2907. As a method of determining a search position, for example, the sixth embodiment or the eighth embodiment
As shown in (1), the search position is determined so that the search position is distributed densely in the vicinity of the pitch peak and sparsely in other portions. It is also effective to change the number of sound source pulses using the pitch period information or limit the search range of the sound source pulses as described in the sixth and eighth embodiments. Further, if the search position is determined as described in any of Embodiments 12 to 14, it is also possible to reduce the influence of transmission path errors.

【０２０８】パルス位置探索器２９０７は、探索位置算
出器２９０６で決定された音源パルス探索位置または予
め決められている固定探索位置と、別途入力されるピッ
チ周期Ｌを用いて、音源パルスを立てる位置の最適な組
み合わせを決定する。パルス探索の方法は「ITU-T Reco
mmendation G.729: Coding of Speech at 8 kbits/susi
ng Conjugate-Structure Algebraic-Code-Excited Line
ar-Prediction (CS-ACELP), March 1996 」に示されて
いるように、例えばパルス数が４本の場合は実施の形態
６で示した式（２）を最大化するようにｉ０からｉ３の
組み合わせを決定する。なお、この時の各音源パルスの
極性は、雑音符号帳成分のターゲットベクトル、即ち聴
覚重みづけされた入力音声から聴覚重みづけ合成フィル
タの零入力応答信号と適応符号帳成分の信号を減じた信
号ベクトル、の各位置における極性と等しくなるように
パルス位置探索を行う前に予め決定している。また、ピ
ッチ周期がサブフレーム長より短い場合には実施の形態
５にも示したようにピッチ周期化フィルタをかけること
によって、音源パルスをインパルスではなくピッチ周期
のパルス列になるようにしている。このようなピッチ周
期化処理を行う場合は、聴覚重みづけ合成フィルタのイ
ンパルス応答ベクトルにピッチ周期化フィルタを予めか
けておけば、ピッチ周期化を行わない場合と同様にして
式（２）の最大化によって音源パルスの探索を行うこと
ができる。このようにして決定された各音源パルスの位
置に、決定された各音源パルスの極性にしたがってパル
スを立て、ピッチ周期Ｌを用いてピッチ周期化フィルタ
をかければ、パルス音源ベクトルが生成される。生成さ
れたパルス音源ベクトルは乗算器２９０９に出力され
る。パルス位置探索器２９０７から乗算器２９０９に出
力されたパルス音源ベクトルは、外部のゲイン量子化器
によって量子化された量子化パルス音源ベクトル利得が
乗算されて加算器２９１０に出力される。The pulse position search unit 2907 generates a sound source pulse using the pitch period L input separately from the sound source pulse search position determined by the search position calculator 2906 or a predetermined fixed search position. To determine the optimal combination of See ITU-T Reco
mmendation G.729: Coding of Speech at 8 kbits / susi
ng Conjugate-Structure Algebraic-Code-Excited Line
ar-Prediction (CS-ACELP), March 1996 ”, for example, when the number of pulses is four, the values of i0 to i3 are set so as to maximize the expression (2) shown in the sixth embodiment. Determine the combination. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. Is determined before performing the pulse position search so as to be equal in polarity at each position of the vector. When the pitch period is shorter than the subframe length, a pitch period filter is applied as described in the fifth embodiment, so that the excitation pulse is not an impulse but a pulse train of the pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse excitation vector is output to multiplier 2909. The pulse excitation vector output from the pulse position searcher 2907 to the multiplier 2909 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer and output to the adder 2910.

【０２０９】加算器２９１０は、乗算器２９０８から出
力された適応符号ベクトル成分と、乗算器２９０９から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。The adder 2910 performs vector addition of the adaptive code vector component output from the multiplier 2908 and the pulse excitation vector component output from the multiplier 2909, and outputs the result as an excitation excitation vector.

【０２１０】なお、パルスの立て方としては、定数本例
えば４本のパルスを探索範囲、例えば３２箇所の位置の
どこかに立てる場合においては、前述のように３２箇所
を４つに分けて１本のパルスを割り当てられた８箇所の
中の１箇所に決定するように全ての組み合わせ（８×８
×８×８通り）を探索する方法の他に、３２箇所の中か
ら４箇所を選びだす組み合わせ全てについて探索する方
法などがある。なお、振幅１のインパルスの組み合わせ
の他に、複数本例えば２本のパルスを組み合わせたパル
ス対の組み合わせや、振幅の異なるインパルスの組み合
わせによるパルスの立て方も可能である。As for the method of setting pulses, when a constant number of pulses, for example, 4 pulses are set in a search range, for example, somewhere in 32 positions, 32 positions are divided into four as described above, and 1 pulse is set. All combinations (8 × 8) are determined so that one pulse is determined to be one of eight assigned pulse positions.
In addition to the method of searching (× 8 × 8 ways), there is a method of searching for all the combinations that select four places from 32 places. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０２１１】（実施の形態１７）図３０は本発明の第１
７の実施の形態を示し、パルス本数が少なくて各パルス
に割り当てられている位置情報が十分である固定探索位
置を用いたパルス探索器と、パルス本数が多くて各パル
スに割り当てられている位置情報が必ずしも十分でない
音源パルス探索位置を用いたパルス探索器と、これら複
数のパルス探索器から出力されたパルス音源ベクトルの
中から最適なパルス音源ベクトルを選択する選択器とを
備えたＣＥＬＰ型音声符号化装置の音源生成部を示して
いる。(Embodiment 17) FIG. 30 shows a first embodiment of the present invention.
7 shows a seventh embodiment in which a pulse searcher uses a fixed search position in which the number of pulses is small and the position information assigned to each pulse is sufficient, and a position in which the number of pulses is large and assigned to each pulse. A CELP-type speech comprising a pulse searcher using a sound source pulse search position where information is not always sufficient, and a selector for selecting an optimum pulse sound source vector from among the pulse sound source vectors output from the plurality of pulse searchers 3 shows a sound source generation unit of the encoding device.

【０２１２】図３０において、３００１は過去の励振音
源ベクトルを保存し、選択された適応符号ベクトルをピ
ッチピーク位置算出器３００２およびピッチゲイン乗算
器３００７に出力する適応符号帳、３００２は適応符号
帳３００１から出力された適応符号ベクトルと外部から
入力されるピッチ周期Ｌを入力としてピッチピーク位置
を算出し、探索位置算出器３００３に出力するピッチピ
ーク位置算出器、３００３はピッチピーク位置算出器３
００２から出力されたピッチピーク位置と音源生成部の
外部から入力されるピッチ周期Ｌとを入力として、音源
パルスの探索位置をパルス位置探索器３００４に出力す
る探索位置算出器、３００４は探索位置算出器３００３
から出力された探索位置と、音源生成部の外部で別途算
出されたピッチ周期Ｌとを入力としてパルス音源を探索
し、パルス音源ベクトル１を選択器３００５に出力する
パルス位置探索器、８００５はパルス位置探索器３００
４から出力されるパルス音源ベクトル１とパルス位置探
索器３００６から出力されるパルス音源ベクトル２とを
入力とし、最適であるパルス音源ベクトルを選択して乗
算器３００８に出力する選択器、３００６は予め定めら
れた固定探索位置と音源生成部の外部から入力されるピ
ッチ周期Ｌとを入力としてパルス音源を探索し、パルス
音源ベクトル２として選択器３００５へ出力するパルス
位置探索器、３００７は適応符号帳３００１から出力さ
れた適応符号ベクトルに適応符号ベクトル利得を乗じて
加算器３００９に出力する乗算器、３００８は選択器３
００５から出力されたパルス音源ベクトルにパルス音源
ベクトル利得を乗じて加算器３００９に出力する乗算
器、３００９は乗算器３００７からの出力と乗算器３０
０８からの出力を入力とし、ベクトル加算して励振音源
ベクトルとして出力する加算器である。In FIG. 30, reference numeral 3001 denotes an adaptive codebook for storing a past excitation source vector and outputting a selected adaptive code vector to a pitch peak position calculator 3002 and a pitch gain multiplier 3007; Calculates the pitch peak position by using the adaptive code vector output from the PID and the pitch period L input from the outside, and outputs the pitch peak position to the search position calculator 3003.
A search position calculator that outputs a search position of a sound source pulse to a pulse position searcher 3004 by using the pitch peak position output from 002 and the pitch period L input from outside the sound source generation unit as inputs, and 3004 calculates a search position. Container 3003
A pulse position searcher that searches for a pulsed sound source by using the search position output from the above and a pitch period L separately calculated outside the sound source generation unit and outputs a pulsed sound source vector 1 to a selector 3005, Position searcher 300
The selector 3006 receives the pulse excitation vector 1 output from the pulse excitation vector 4 and the pulse excitation vector 2 output from the pulse position searcher 3006, selects the optimal pulse excitation vector, and outputs the selected pulse excitation vector to the multiplier 3008. A pulse position searcher that searches for a pulse sound source by using a fixed fixed search position and a pitch period L input from the outside of the sound source generation unit and outputs it as a pulse sound source vector 2 to a selector 3005. Reference numeral 3007 denotes an adaptive codebook. A multiplier that multiplies the adaptive code vector output from 3001 by the adaptive code vector gain and outputs the result to the adder 3009;
A multiplier that multiplies the pulse excitation vector output from 005 by the pulse excitation vector gain and outputs the result to the adder 3009. The output 3009 is the output from the multiplier 3007 and the multiplier 309.
This is an adder that receives an output from the input device 08, adds a vector, and outputs the resultant as an excitation sound source vector.

【０２１３】以上のように構成された、音源生成部の動
作について、図３０を用いて説明する。図３０におい
て、適応符号帳３００１は、音源生成部の外部で予め算
出されるピッチ周期Ｌだけ過去に溯った点から、適応符
号ベクトルをサブフレーム長だけ切り出して、適応符号
ベクトルとして出力する。ピッチ周期Ｌがサブフレーム
長に満たない場合は、切り出したピッチ周期Ｌのベクト
ルを、サブフレーム長に達するまで繰り返して接続した
ものを適応符号ベクトルとして出力する。The operation of the sound source generator configured as described above will be described with reference to FIG. In FIG. 30, adaptive codebook 3001 cuts out an adaptive code vector by a subframe length from a point that has been retroactive by a pitch period L calculated in advance outside the excitation generation unit, and outputs the cutout as an adaptive code vector. If the pitch period L is less than the subframe length, a vector obtained by repeatedly connecting the extracted pitch period L until the subframe length is reached is output as an adaptive code vector.

【０２１４】ピッチピーク位置算出器３００２は、適応
符号帳３００１から出力された適応符号ベクトルを用い
て適応符号ベクトル内に存在するピッチピークの位置を
決定する。ピッチピークの位置は、ピッチ周期で並べた
インパルス列と適応符号ベクトルとの正規化相互相関を
最大化することによって行うことができる。また、ピッ
チ周期で並べたインパルス列を合成フィルタに通したも
のと、適応符号ベクトルを合成フィルタに通したもとの
誤差を最小化する（正規化相互相関関数を最大化する）
ことによって、より精度良く求めることも可能である。
なお、実施の形態１５に示したようなピッチピーク補正
器を備えるとピッチピーク位置の算出誤りを減らすこと
ができる。[0214] Pitch peak position calculator 3002 determines the position of the pitch peak existing in the adaptive code vector using the adaptive code vector output from adaptive code book 3001. The position of the pitch peak can be determined by maximizing the normalized cross-correlation between the impulse train arranged in the pitch cycle and the adaptive code vector. Further, an error in which the impulse train arranged in the pitch cycle is passed through the synthesis filter and an original error in passing the adaptive code vector through the synthesis filter are minimized (maximize the normalized cross-correlation function).
By doing so, it is also possible to obtain it with higher accuracy.
In addition, if the pitch peak corrector as shown in the fifteenth embodiment is provided, it is possible to reduce the calculation error of the pitch peak position.

【０２１５】探索位置算出器３００３は、ピッチピーク
位置算出器３００２から出力されたピッチピーク位置を
基準として音源パルスの探索位置を決定し、パルス位置
探索器３００４に出力する。探索位置の決定法として
は、実施の形態５または実施の形態６または実施の形態
１４などのように、ピッチピーク位置近傍は密に、それ
以外の部分は疎に、音源パルスの探索位置を限定する方
法がある。この限定方法は、パルスが立てられる確率が
高い位置がピッチパルス近傍に集中する統計的結果に基
づいている。パルス位置探索範囲を限定しない場合、有
声部においてはピッチパルス近傍にパルスが立てられる
確率がその他の部分に立てられる確率に比べて高くなる
ことを利用するものである。なお、実施の形態１２から
実施の形態１４のいずれかに示すような音源パルス探索
位置の決定法を用いれば、伝送路誤りの影響を緩和する
ことも可能である。The search position calculator 3003 determines the search position of the excitation pulse based on the pitch peak position output from the pitch peak position calculator 3002, and outputs it to the pulse position searcher 3004. As a method of determining the search position, the search position of the sound source pulse is limited densely in the vicinity of the pitch peak position and sparsely in other portions, as in the fifth embodiment, the sixth embodiment or the fourteenth embodiment. There is a way to do that. This limiting method is based on a statistical result in which positions where the pulse is likely to be raised are concentrated near the pitch pulse. In the case where the pulse position search range is not limited, the fact that the probability that a pulse is raised near a pitch pulse in a voiced part is higher than the probability that a pulse is raised in other parts is used. Note that the use of a method for determining a sound source pulse search position as described in any of Embodiments 12 to 14 can also reduce the effect of transmission path errors.

【０２１６】パルス位置探索器３００４は、探索位置算
出器３００３から出力された音源パルス探索位置と、別
途入力されるピッチ周期Ｌを用いて、音源パルスを立て
る位置の最適な組み合わせを決定する。パルス探索の方
法は「ITU-T RecommendationG.729: Coding of Speecha
t 8 kbits/s using Conjugate-Structure Algebraic-Co
de-Excited Linear-Prediction (CS-ACELP), March 199
6」に示されているように、例えばパルス数が４本の場
合は実施の形態６で示した式（２）を最大化するように
ｉ０からｉ３の組み合わせを決定する。なお、この時の
各音源パルスの極性は、雑音符号帳成分のターゲットベ
クトル、即ち聴覚重みづけされた入力音声から聴覚重み
づけ合成フィルタの零入力応答信号と適応符号帳成分の
信号を減じた信号ベクトル、の各位置における極性と等
しくなるようにパルス位置探索を行う前に予め決定すれ
ば探索のための演算量を大幅に軽減できる。また、ピッ
チ周期がサブフレーム長より短い場合には実施の形態５
にも示したようにピッチ周期化フィルタをかけることに
よって、音源パルスをインパルスではなくピッチ周期の
パルス列になるようにしている。このようなピッチ周期
化処理を行う場合は、聴覚重みづけ合成フィルタのイン
パルス応答ベクトルにピッチ周期化フィルタを予めかけ
ておけば、ピッチ周期化を行わない場合と同様にして式
（２）の最大化によって音源パルスの探索を行うことが
できる。このようにして決定された各音源パルスの位置
に、決定された各音源パルスの極性に従ってパルスを立
て、ピッチ周期Ｌを用いてピッチ周期化フィルタをかけ
れば、パルス音源ベクトルが生成される。生成されたパ
ルス音源ベクトルはパルス音源ベクトル１として選択器
３００５に出力される。なお、パルス位置探索器３００
４に用いられる音源パルス探索位置は、音源パルス数を
多くしているので各音源パルスに割り振られる位置情報
は必ずしも十分でないものである。すなわち、パルス位
置探索器３００４を使用するモードは、パルス数は多い
が各パルスの位置を必ずしも厳密に表すことはできない
モードである。このような各パルスの位置情報が不足し
ている場合は、探索位置算出器３００３で行われるよう
なパルス探索位置の決定法を用いることの効果を得るこ
とができる。The pulse position searcher 3004 determines the optimum combination of the position where the sound source pulse is to be formed using the sound source pulse search position output from the search position calculator 3003 and the pitch period L separately input. The method of pulse search is described in "ITU-T Recommendation G.729: Coding of Speecha
t 8 kbits / s using Conjugate-Structure Algebraic-Co
de-Excited Linear-Prediction (CS-ACELP), March 199
As shown in “6”, for example, when the number of pulses is four, the combination of i0 to i3 is determined so as to maximize Expression (2) shown in the sixth embodiment. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. If the pulse position is determined in advance before performing the pulse position search so as to be equal to the polarity at each position of the vector, the calculation amount for the search can be greatly reduced. When the pitch period is shorter than the subframe length, the fifth embodiment
By applying a pitch period filter as described above, the sound source pulse is not an impulse but a pulse train having a pitch period. In the case of performing such pitch periodization processing, if a pitch periodization filter is previously applied to the impulse response vector of the auditory weighting synthesis filter, the maximum value of Expression (2) can be obtained in the same manner as when pitch periodization is not performed. The search for the sound source pulse can be performed by the conversion. A pulse is generated at the position of each sound source pulse determined in this way according to the determined polarity of each sound source pulse, and a pitch sound source vector is generated by applying a pitch period filter using the pitch period L. The generated pulse excitation vector is output to the selector 3005 as the pulse excitation vector 1. The pulse position searcher 300
The sound source pulse search position used in No. 4 has a large number of sound source pulses, so that the position information allocated to each sound source pulse is not always sufficient. That is, the mode in which the pulse position searcher 3004 is used is a mode in which the number of pulses is large but the position of each pulse cannot always be strictly represented. When the position information of each pulse is insufficient, the effect of using the method of determining the pulse search position as performed by the search position calculator 3003 can be obtained.

【０２１７】パルス位置探索器３００６は、予め定めら
れた固定探索位置と音源生成部の外部から別途入力され
るピッチ周期Ｌを用いて、音源パルスを立てる位置の最
適な組み合わせを決定する。パルス探索の方法は「ITU-
T Recommendation G.729: Coding of Speech at 8 kbit
s/s using Conjugate-Structure Algebraic-Code-Excit
ed Linear-Prediction (CS-ACELP),March 1996」に示さ
れているように、例えばパルス数が４本の場合は実施の
形態６で示した式（２）を最大化するようにｉ０からｉ
３の組み合わせを決定する。なお、この時の各音源パル
スの極性は、雑音符号帳成分のターゲットベクトル、即
ち聴覚重みづけされた入力音声から聴覚重みづけ合成フ
ィルタの零入力応答信号と適応符号帳成分の信号を減じ
た信号ベクトル、の各位置における極性と等しくなるよ
うにパルス位置探索を行う前に予め決定すれば探索のた
めの演算量を大幅に軽減できる。また、ピッチ周期がサ
ブフレーム長より短い場合には実施の形態５にも示した
ようにピッチ周期化フィルタをかけることによって、音
源パルスをインパルスではなくピッチ周期のパルス列に
なるようにしている。このようなピッチ周期化処理を行
う場合は聴覚重みづけ合成フィルタのインパルス応答ベ
クトルにピッチ周期化フィルタを予めかけておけば、ピ
ッチ周期化を行わない場合と同様にして式（２）の最大
化によって音源パルスの探索を行うことができる。この
ようにして決定された各音源パルスの位置に、決定され
た各音源パルスの極性にしたがってパルスを立て、ピッ
チ周期Ｌを用いてピッチ周期化フィルタをかければ、パ
ルス音源ベクトルが生成される。生成されたパルス音源
ベクトルは、パルス音源ベクトル２として選択器３００
５に出力される。ここで、パルス位置探索器３００６に
入力される固定探索位置は、各音源パルスに割り当てら
れる位置情報が十分になるように（具体的にはサブフレ
ーム内の全ての点がこの固定探索位置のパターンに含ま
れるように）音源パルスの数を絞り込んだものでなけれ
ばならない。パルス数を減らして、その分パルスを立て
る位置を正確に表せるようにすることによって、有声立
ち上がり部分などにおける合成音声品質を向上すること
が可能となる。また、このような位置情報が十分である
モードを設けることによって、位置情報が不足するモー
ドのみを使用した場合に生じる劣化を回避することも可
能となる。The pulse position searcher 3006 determines an optimum combination of a position where a sound source pulse is to be set, using a predetermined fixed search position and a pitch period L separately input from outside the sound source generation unit. The method of pulse search is “ITU-
T Recommendation G.729: Coding of Speech at 8 kbit
s / s using Conjugate-Structure Algebraic-Code-Excit
As shown in “ed Linear-Prediction (CS-ACELP), March 1996”, for example, when the number of pulses is four, i0 to i are set so as to maximize the equation (2) shown in the sixth embodiment.
3 is determined. Note that the polarity of each excitation pulse at this time is the target vector of the noise codebook component, that is, a signal obtained by subtracting the zero input response signal of the auditory weighting synthesis filter and the signal of the adaptive codebook component from the auditory weighted input speech. If the pulse position is determined in advance before performing the pulse position search so as to be equal to the polarity at each position of the vector, the calculation amount for the search can be greatly reduced. When the pitch period is shorter than the subframe length, a pitch period filter is applied as described in the fifth embodiment, so that the excitation pulse is not an impulse but a pulse train of the pitch period. In the case of performing such a pitch periodization process, if the pitch periodization filter is applied to the impulse response vector of the auditory weighting synthesis filter in advance, the maximization of Expression (2) is performed in the same manner as the case where the pitch periodization is not performed. Thus, a search for a sound source pulse can be performed. A pulse is generated at the position of each sound source pulse determined in this way according to the polarity of each determined sound source pulse, and if a pitch period filter is applied using the pitch period L, a pulse sound source vector is generated. The generated pulse excitation vector is selected as a pulse excitation vector 2 by the selector 300.
5 is output. Here, the fixed search position input to the pulse position searcher 3006 is set so that the position information assigned to each sound source pulse is sufficient (specifically, all points in the subframe are in the fixed search position pattern). Must be narrowed down to the number of source pulses. By reducing the number of pulses and accurately representing the position at which the pulse is raised, it is possible to improve the quality of synthesized speech in a voiced rising portion or the like. Further, by providing such a mode in which the positional information is sufficient, it is possible to avoid deterioration that occurs when only the mode in which the positional information is insufficient is used.

【０２１８】なお、図３０においてはパルス位置探索器
は２種類の場合を示しているが、３種類以上に増やして
入力信号の特徴に応じた切り替えを行うことも可能であ
る。また、パルス位置探索器３００４に入力する音源パ
ルス探索位置を、探索位置算出器３００３から出力され
たものの代わりに、予め定められている固定探索位置と
する構成であっても、各パルスに割り当てられる位置情
報が十分である少ないパルス数のモードを備える構成
は、有声立ち上がり部分などにおける合成音声品質を向
上する効果や位置情報が不足するモードのみを使用した
場合に生じる合成音声品質の劣化を回避する効果が得ら
れる。しかし、探索位置算出器３００３によって決定さ
れる音源パルス探索位置を用いてパルス位置探索器３０
０４がパルス位置探索を行う方が、ピッチピーク付近に
音源パルスが立てられやすい特徴を有する有声部分にお
いては、パルス数の多いモードの利用効率を上げること
ができる。Although FIG. 30 shows two types of pulse position searchers, it is possible to increase the number of types to three or more and perform switching according to the characteristics of the input signal. Also, even if the sound source pulse search position input to the pulse position search unit 3004 is set to a predetermined fixed search position instead of the one output from the search position calculator 3003, it is assigned to each pulse. The configuration having a mode with a small number of pulses with sufficient position information avoids the effect of improving the synthesized voice quality in a voiced rising portion or the like and the deterioration of the synthesized voice quality that occurs when only the mode with insufficient position information is used. The effect is obtained. However, using the sound source pulse search position determined by the search position calculator 3003, the pulse position searcher 30
In the case of the voice part where the sound source pulse is easily formed near the pitch peak, the use efficiency of the mode having a large number of pulses can be improved when the pulse position search is performed by the signal line 04.

【０２１９】選択器３００５は、パルス位置探索器３０
０４から出力されたパルス音源ベクトル１とパルス位置
探索器３００６から出力されたパルス音源ベクトル２と
を比較し、合成音声の歪みが小さくなる方を最適パルス
音源ベクトルとして乗算器３００８に出力する。選択器
３００５から乗算器３００８に出力されたパルス音源ベ
クトルは、外部のゲイン量子化器によって量子化された
量子化パルス音源ベクトル利得が乗算されて加算器３０
０９に出力される。なお図３０では省略しているが、符
号器のパルス位置探索器３００４および３００６におい
ては、パルス音源ベクトル１、２とともに各パルス音源
ベクトルを表す各音源パルスの極性およびインデックス
情報が別途選択器３００５に出力される。さらに選択器
３００５は、パルス音源ベクトル１と２のどちらを選択
したかという情報と、選択したパルス音源ベクトルを表
す、各パルスの極性およびインデックスが音源生成部の
外部に出力される。この選択情報および音源パルスの極
性およびインデックス情報は、符号化器や多重化器など
を通って伝送路へ出力されるデータ系列に変換されて伝
送路へ送り出される。The selector 3005 is the pulse position searcher 30
The pulse excitation vector 1 output from the signal generator 04 and the pulse excitation vector 2 output from the pulse position searcher 3006 are compared, and the one in which the distortion of the synthesized speech is smaller is output to the multiplier 3008 as the optimal pulse excitation vector. The pulse excitation vector output from the selector 3005 to the multiplier 3008 is multiplied by a quantized pulse excitation vector gain quantized by an external gain quantizer and added to the adder 30.
09 is output. Although omitted in FIG. 30, the pulse position searchers 3004 and 3006 of the encoder separately output the polarity and index information of each excitation pulse representing each pulse excitation vector together with the pulse excitation vectors 1 and 2 to the selector 3005. Is output. Further, the selector 3005 outputs information indicating which one of the pulse excitation vectors 1 and 2 has been selected, and the polarity and index of each pulse representing the selected pulse excitation vector, to the outside of the excitation generator. The selection information and the polarity and index information of the excitation pulse are converted into a data sequence to be output to the transmission path through an encoder, a multiplexer and the like, and sent out to the transmission path.

【０２２０】加算器３００９は、乗算器３００７から出
力された適応符号ベクトル成分と、乗算器３００８から
出力されたパルス音源ベクトル成分とのベクトル加算を
行い、励振音源ベクトルとして出力する。Adder 3009 performs vector addition of the adaptive code vector component output from multiplier 3007 and the pulse excitation vector component output from multiplier 3008, and outputs the result as an excitation excitation vector.

【０２２１】なお、本実施の形態において、実施の形態
１２または実施の形態１３または実施の形態１４のよう
にインデックス更新手段またはパルス番号およびインデ
ックスの更新手段または固定探索位置と位相適応探索位
置の併用をパルス位置探索器３００４の前段に備えれ
ば、探索位置算出器３００３を用いることに起因する伝
送路誤りの影響を受けやすいという性質を低くすること
ができる。In this embodiment, as in Embodiment 12 or Embodiment 13 or Embodiment 14, index updating means or pulse number and index updating means or a combination of fixed search position and phase adaptive search position is used. Is provided before the pulse position searcher 3004, it is possible to reduce the property of being easily affected by a transmission path error caused by using the search position calculator 3003.

【０２２２】また、パルスの立て方としては、定数本例
えば４本のパルスを探索範囲、例えば３２箇所の位置の
どこかに立てる場合においては、前述のように３２箇所
を４つに分けて１本のパルスを割り当てられた８箇所の
中の１箇所に決定するように全ての組み合わせ（８×８
×８×８通り）を探索する方法の他に、３２箇所の中か
ら４箇所を選びだす組み合わせ全てについて探索する方
法などがある。なお、振幅１のインパルスの組み合わせ
の他に、複数本例えば２本のパルスを組み合わせたパル
ス対の組み合わせや、振幅の異なるインパルスの組み合
わせによるパルスの立て方も可能である。As for the method of setting pulses, in the case where a constant number of pulses, for example, four pulses are set in a search range, for example, somewhere in 32 positions, 32 positions are divided into four as described above, and one pulse is set. All combinations (8 × 8) are determined so that one pulse is determined to be one of eight assigned pulse positions.
In addition to the method of searching (× 8 × 8 ways), there is a method of searching for all the combinations that select four places from 32 places. In addition to the combination of the impulses having the amplitude of 1, a combination of a plurality of pulses, for example, two pulses, or a combination of impulses having different amplitudes may be used.

【０２２３】なお、パルス数が少なくパルス位置情報が
十分であるモードにおいては、パルス位置情報が不足し
ない範囲内において、パルス位置情報の一部を雑音コー
ドベクトルを表すインデックスに割り当てることによ
り、有声立ち上がり部のみならず無声子音部や雑音的な
入力信号に対する性能向上を図ることも可能である。In a mode in which the number of pulses is small and the pulse position information is sufficient, a portion of the pulse position information is assigned to an index representing a noise code vector within a range where the pulse position information is not insufficient, so that voiced start-up is performed. It is possible to improve the performance not only for the unvoiced part but also for the unvoiced consonant part and the noise-like input signal.

【０２２４】また、上記実施の形態１から１７に示した
音声符号化装置の機能は、磁気ディスク、光磁気ディス
ク、ＩＣカード、ＲＯＭ、ＲＡＭ等の記録媒体にプログ
ラムとして記録することができる。よって、この記録媒
体をコンピュータで読み取ることにより、音声符号化装
置の機能を実現することができる。Further, the functions of the audio encoding apparatus described in the first to seventeenth embodiments can be recorded as a program on a recording medium such as a magnetic disk, a magneto-optical disk, an IC card, a ROM, and a RAM. Therefore, by reading the recording medium with a computer, the function of the audio encoding device can be realized.

【０２２５】[0225]

【発明の効果】本発明は、上記実施の形態から明らかな
ように、適応符号ベクトルのピッチピーク位置に対応す
る雑音符号ベクトルの振幅を強調するための振幅強調窓
を雑音符号ベクトルに乗ずるようにしたので、１ピッチ
波形内に存在する位相情報を利用して、音質向上を図る
ことができる。According to the present invention, as is apparent from the above embodiment, an amplitude emphasis window for emphasizing the amplitude of a noise code vector corresponding to a pitch peak position of an adaptive code vector is multiplied by the noise code vector. Therefore, the sound quality can be improved by using the phase information existing in one pitch waveform.

【０２２６】本発明はまた、適応符号ベクトルのピッチ
ピーク近傍のみに限定した雑音符号ベクトルを用いるよ
うにしたので、雑音符号ベクトルに割り当てられるビッ
ト数が少ない場合でも、音質劣化を少なくでき、ピッチ
ピーク近傍にパワーが集中する有声部の音声品質の向上
を図ることができる。In the present invention, since the noise code vector limited only to the vicinity of the pitch peak of the adaptive code vector is used, even if the number of bits allocated to the noise code vector is small, the sound quality degradation can be reduced and the pitch peak can be reduced. It is possible to improve the voice quality of a voiced part where power is concentrated in the vicinity.

【０２２７】本発明はまた、適応符号ベクトルのピッチ
ピーク位置とピッチ周期に基づいてパルス位置の探索範
囲を決定するようにしたので、１ピッチ波形内でピッチ
周期に応じたパルス位置探索を行うことができ、パルス
位置に割り当てられるビット数が少ない場合でも、音声
品質の劣化を抑えることができる。According to the present invention, the search range of the pulse position is determined based on the pitch peak position and the pitch cycle of the adaptive code vector. Therefore, the pulse position search according to the pitch cycle within one pitch waveform is performed. Therefore, even when the number of bits allocated to the pulse position is small, it is possible to suppress the deterioration of the voice quality.

【０２２８】本発明はまた、パルス探索の範囲を１ピッ
チ周期強の長さに限定することにより、ピッチ周期性の
ある音源信号を効率的に表現できる。また、探索範囲内
に２つのピッチピークを含む為、１つめのピッチピーク
と２つめのピッチピークの形が異なる場合や、１つめの
ピッチピークの位置を誤って検出した場合への対応が可
能である。According to the present invention, a sound source signal having a pitch periodicity can be efficiently represented by limiting the range of the pulse search to a length slightly longer than one pitch period. Also, since two pitch peaks are included in the search range, it is possible to handle cases where the shape of the first pitch peak is different from the shape of the second pitch peak, or where the position of the first pitch peak is erroneously detected. It is.

【０２２９】本発明はまた、入力音声信号のピッチ周期
に応じて適応的にパルス数を変化させる構成を有するの
で、パルス数の切り替えのために新たな情報を必要とせ
ずに音声品質の向上を図ることができる。The present invention also has a configuration in which the number of pulses is adaptively changed in accordance with the pitch period of the input audio signal. Can be planned.

【０２３０】本発明はまた、パルス位置探索の前にピッ
チピーク近傍とそれ以外の部分のパルス振幅を決定する
ため、１ピッチ波形の形状を効率的に表現することがで
きる。According to the present invention, since the pulse amplitude in the vicinity of the pitch peak and in the other portion is determined before the pulse position search, the shape of one pitch waveform can be efficiently expressed.

【０２３１】本発明はまた、ピッチ周期の連続性を用い
てパルスの探索位置を切り替えることによって、有声の
立ち上がり部・無声部と有声定常部・有声部のそれぞれ
に適したパルス音源探索を行うことができるので、音声
品質の向上を図ることができる。According to the present invention, a pulse sound source search suitable for a voiced rising part / unvoiced part and a voiced stationary part / voiced part is performed by switching the pulse search position using the continuity of the pitch period. Therefore, the sound quality can be improved.

【０２３２】本発明はまた、現サブフレームのピッチゲ
イン（適応符号ベクトル利得）を、適応符号帳探索直後
に求めたピッチゲインを用いて初段量子化を行い、音源
探索の最後に求められた最適ピッチゲインと初段量子化
ピッチゲインの差分を２段目で量子化することによっ
て、適応符号帳と固定符号帳（雑音符号帳）の和で駆動
音源ベクトルを生成するＣＥＬＰ型音声符号化装置にお
いては、固定符号帳（雑音符号帳）探索前に得られる情
報を量子化して伝送するため、独立したモード情報を付
加せずに固定符号帳（雑音符号帳）の切り替え等を行う
ことが可能となり、効率的に音声情報を符号化すること
が可能となる。In the present invention, the first stage quantization is performed on the pitch gain (adaptive code vector gain) of the current subframe by using the pitch gain obtained immediately after the adaptive codebook search, and the optimum gain obtained at the end of the excitation search is obtained. In a CELP-type speech coding apparatus that generates a driving excitation vector by the sum of an adaptive codebook and a fixed codebook (noise codebook) by quantizing the difference between the pitch gain and the first-stage quantization pitch gain in the second step, Since the information obtained before the fixed codebook (noise codebook) search is quantized and transmitted, it is possible to switch the fixed codebook (noise codebook) without adding independent mode information, Audio information can be efficiently encoded.

【０２３３】本発明はまた、過去に符号化したピッチ周
期の連続性あるいは過去に符号化したピッチゲインの大
きさ（あるいは連続性）に基づいて現在のサブフレーム
の音声信号のピッチ周期性を判定し、パルス音源の探索
位置を切り替えるため、ピッチ周期性が高いところと低
いところの判定に新たな情報を付加することなく、それ
ぞれの部分に適したパルス音源探索を行うことができる
ようになるので、同一情報量下での音声品質の向上を図
ることができる。The present invention also determines the pitch periodicity of the audio signal of the current subframe based on the continuity of the previously encoded pitch period or the magnitude (or continuity) of the previously encoded pitch gain. However, since the search position of the pulse sound source is switched, the pulse sound source search suitable for each part can be performed without adding new information to the determination of the high and low pitch periodicity. Thus, it is possible to improve the voice quality under the same information amount.

【０２３４】本発明はまた、直前のサブフレームにおけ
るピッチピーク位置と直前のサブフレームにおけるピッ
チ周期と現在のサブフレームにおけるピッチ周期を用い
ることにより、バックワードで現在のサブフレームにお
けるピッチピーク位置を予測でき、この予測ピッチピー
ク位置を用いて位相適応処理を行うか否かを切り替える
ため、切り替え情報の新たな伝送なしに位相適応処理の
切り替えを行うことができ、同一情報量下での音声品質
の向上を図ることができる。なお、位相適応処理を行わ
ないモードにおいては、固定符号帳を使用すれば良く、
無音部等において固定符号帳が使用され続ける様な状態
が生じることにより、位相適応型音源に対する誤りの伝
播をリセットする効果も得ることができる。According to the present invention, the pitch peak position in the current subframe is predicted backward by using the pitch peak position in the immediately preceding subframe, the pitch period in the immediately preceding subframe, and the pitch period in the current subframe. Since it is possible to switch whether or not to perform the phase adaptation process using the predicted pitch peak position, it is possible to switch the phase adaptation process without newly transmitting the switching information, and to improve the voice quality under the same information amount. Improvement can be achieved. In a mode in which the phase adaptation process is not performed, a fixed codebook may be used.
The occurrence of a state where the fixed codebook is continuously used in a silent part or the like occurs, whereby the effect of resetting the propagation of an error to the phase adaptive excitation can be obtained.

【０２３５】本発明はまた、適応符号ベクトルのピッチ
ピーク近傍への信号パワー集中度を用いて位相適応を行
うか否かを切り替えるため、切り替え情報の新たな伝送
無しに位相適応処理の切り替えを行うことができ、同一
情報量下での音声品質の向上を図ることができる。な
お、位相適応処理を行わないモードにおいては、固定符
号帳を使用すれば良く、無音部等において固定符号帳が
使用され続ける様な状態が生じることにより、位相適応
型音源に対する誤りの伝播をリセットする効果も得るこ
とができる。In the present invention, since the phase adaptation is switched by using the signal power concentration in the vicinity of the pitch peak of the adaptive code vector, the phase adaptation process is switched without newly transmitting switching information. Therefore, it is possible to improve the voice quality under the same information amount. In the mode in which the phase adaptation process is not performed, the fixed codebook may be used, and a state in which the fixed codebook is continuously used in a silent part or the like occurs. Can be obtained.

【０２３６】本発明はまた、ピッチピーク位置を０とす
る相対位置で音源パルスの位置を表現するＣＥＬＰ型音
声符号化装置において、音源パルスの各位置を表すイン
デックスをサブフレーム先頭から順番に並ぶように付け
ることにより、伝送路誤りの影響等によってピッチピー
ク位置を誤ってしまった場合において、音源パルス位置
のずれが非常に大きくならないようにすることができ
る。The present invention also provides a CELP-type speech coding apparatus that expresses the position of an excitation pulse by a relative position where the pitch peak position is 0, so that indices indicating the positions of the excitation pulse are arranged in order from the head of the subframe. In the case where the pitch peak position is erroneously set due to the influence of a transmission path error or the like, the deviation of the sound source pulse position can be prevented from becoming very large.

【０２３７】本発明はまた、ピッチピーク位置を０とす
る相対位置で音源パルスの位置を表現するＣＥＬＰ型音
声符号化装置において、音源パルスの各位置を表すイン
デックスをサブフレーム先頭から順番に並ぶように付け
るとともに、同じインデックス番号で表される別々のパ
ルスに付ける番号もサブフレームの先頭から順番になる
ように定義することにより、伝送路誤りの影響等によっ
てピッチピーク位置を誤ってしまった場合において、音
源パルス位置のずれが小さくなるようにすることができ
る。According to the present invention, in a CELP type speech coding apparatus which expresses the position of an excitation pulse by a relative position where a pitch peak position is 0, an index indicating each position of an excitation pulse is arranged in order from the head of a subframe. In addition to the above, when the pitch peak position is erroneously set due to the influence of a transmission line error, etc., by defining the numbers to be attached to the different pulses represented by the same index number in order from the head of the subframe. , The displacement of the sound source pulse position can be reduced.

【０２３８】本発明はまた、ピッチピーク位置を０とす
る相対位置で音源パルスの位置を表現するＣＥＬＰ型音
声符号化装置において、音源パルスの探索位置の全てを
相対位置で表現するのではなく、一部分のみを相対位置
で表現して残りの探索位置は予め定められた固定位置に
することにより、伝送路誤りの影響等によってピッチピ
ーク位置を誤ってしまった場合において、音源パルスの
位置がずれてしまう確率を減らすことにより、伝送路誤
りの影響が長く伝播することを防ぐことができる。The present invention also provides a CELP-type speech coding apparatus which expresses the position of an excitation pulse by a relative position where the pitch peak position is 0, instead of expressing all the search positions of the excitation pulse by a relative position. By expressing only a part as a relative position and setting the remaining search position to a predetermined fixed position, when the pitch peak position is incorrect due to the influence of a transmission path error, the position of the sound source pulse is shifted. By reducing the probability of occurrence, it is possible to prevent the effect of the transmission path error from propagating for a long time.

【０２３９】本発明はまた、１ピッチ波形内のピーク位
置をピッチピーク位置として探し出すため、サブフレー
ム長とピッチ周期とが一致しないことに起因するセカン
ドピークをピッチピークとしてしまう誤検出を防ぐこと
ができる。According to the present invention, since a peak position within one pitch waveform is searched for as a pitch peak position, it is possible to prevent erroneous detection in which a second peak caused by a mismatch between a subframe length and a pitch period is regarded as a pitch peak. it can.

【０２４０】本発明はまた、連続する有声定常部におい
ては、直前のサブフレームにおけるピッチピークの位置
と直前のサブフレームにおけるピッチ周期と現在のサブ
フレームにおけるピッチ周期の情報を用いて現在のピッ
チピーク位置の存在範囲を限定し、その範囲内でピッチ
ピーク位置を探索する構成とすることにより、現在のサ
ブフレームの信号のみを用いてピッチピーク位置を探索
したときに生じる、１ピッチ波形内のセカンドピークを
ピッチピークとする誤検出を防ぐことができる。According to the present invention, in a continuous voiced stationary part, the current pitch peak is obtained by using information on the position of the pitch peak in the immediately preceding subframe, the pitch period in the immediately preceding subframe, and the pitch period in the current subframe. By limiting the existing range of the position and searching for the pitch peak position within the range, the second peak in one pitch waveform generated when searching for the pitch peak position using only the signal of the current subframe is obtained. It is possible to prevent erroneous detection using a peak as a pitch peak.

【０２４１】本発明はまた、パルス音源を雑音符号帳に
適用したＣＥＬＰ型音声符号化装置において、音源パル
ス数が少ない代わりに各音源パルスの位置情報が十分な
モードと、各音源パルスの位置情報が粗い代わりに音源
パルス数が多いモードとの双方を有する雑音符号帳構成
としたので、有声立ち上がり部分の音声品質の向上と音
源パルス数が多いモードの有効利用との双方を実現でき
るものである。The present invention also provides a CELP-type speech coding apparatus in which a pulse excitation is applied to a noise codebook, in a mode in which the number of excitation pulses is small and the position information of each excitation pulse is sufficient. The noise codebook configuration has both a mode with a large number of excitation pulses instead of a coarse, so that it is possible to achieve both an improvement in the voice quality of voiced rising portions and an effective use of the mode with a large number of excitation pulses. .

[Brief description of the drawings]

【図１】本発明の第１の実施の形態におけるＣＥＬＰ音
声符号化装置の音源生成部の構成を示すブロック図FIG. 1 is a block diagram showing a configuration of a sound source generation unit of a CELP speech coding apparatus according to a first embodiment of the present invention.

【図２】本発明の第１の実施の形態における振幅強調窓
の形状と適応符号ベクトルおよびピッチパルス位置の関
係を表す模式図FIG. 2 is a schematic diagram illustrating the relationship between the shape of an amplitude enhancement window, an adaptive code vector, and a pitch pulse position according to the first embodiment of the present invention.

【図３】本発明の第１の実施の形態の変形例におけるＣ
ＥＬＰ音声符号化装置の音源生成部の構成を示すブロッ
ク図FIG. 3 shows C in a modification of the first embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the ELP speech encoding device.

【図４】本発明の第２の実施の形態におけるＣＥＬＰ音
声符号化装置の音源生成部の構成を示すブロック図FIG. 4 is a block diagram showing a configuration of a sound source generation unit of a CELP speech coding apparatus according to a second embodiment of the present invention.

【図５】本発明の第３の実施の形態におけるＣＥＬＰ音
声符号化装置の音源生成部の構成を示すブロック図FIG. 5 is a block diagram illustrating a configuration of a sound source generation unit of a CELP speech coding apparatus according to a third embodiment of the present invention.

【図６】本発明の第３の実施の形態におけるパルス位置
近傍限定ベクトルの配置の様子を示す模式図FIG. 6 is a schematic diagram showing an arrangement of a pulse position vicinity limited vector according to a third embodiment of the present invention.

【図７】本発明の第３の実施の形態におけるパルス位置
近傍限定ベクトルの配置の様子を示す模式図（続き）FIG. 7 is a schematic diagram showing an arrangement of a pulse position neighborhood limited vector according to the third embodiment of the present invention (continued).

【図８】本発明の第４の実施の形態におけるＣＥＬＰ音
声符号化装置の音源生成部の構成を示すブロック図FIG. 8 is a block diagram showing a configuration of a sound source generation unit of a CELP speech coding apparatus according to a fourth embodiment of the present invention.

【図９】本発明の第４の実施の形態におけるパルス音源
探索範囲を示す模式図FIG. 9 is a schematic diagram showing a pulse sound source search range according to a fourth embodiment of the present invention.

【図１０】本発明の第４の実施の形態におけるパルス音
源探索範囲を示す模式図（続き）FIG. 10 is a schematic diagram showing a pulse sound source search range according to the fourth embodiment of the present invention (continued).

【図１１】（ａ）本発明の第5の実施の形態における探
索位置算出器の構成を示すブロック図（ｂ）、（ｃ）パルス探索位置パターンの一例を示す模
式図11A is a block diagram illustrating a configuration of a search position calculator according to a fifth embodiment of the present invention. FIG. 11B is a schematic diagram illustrating an example of a pulse search position pattern.

【図１２】本発明の第6の実施の形態におけるＣＥＬＰ
型音声符号化装置の音源生成部の構成を示すブロック図FIG. 12 shows CELP according to a sixth embodiment of the present invention.
Block diagram showing a configuration of a sound source generation unit of the type speech coding apparatus

【図１３】（ａ）〜（ｄ）本発明の第６の実施の形態に
おける探索位置算出器で算出されるパルス探索位置の一
例を示す模式図13A to 13D are schematic diagrams illustrating an example of a pulse search position calculated by a search position calculator according to the sixth embodiment of the present invention.

【図１４】本発明の第７の実施の形態におけるＣＥＬＰ
型音声符号化装置の音源生成部の構成を示すブロック図FIG. 14 shows a CELP according to a seventh embodiment of the present invention.
Block diagram showing a configuration of a sound source generation unit of the type speech coding apparatus

【図１５】本発明の第８の実施の形態におけるＣＥＬＰ
型音声符号化装置の音源生成部の構成を示すブロック図FIG. 15 shows a CELP according to an eighth embodiment of the present invention.
Block diagram showing a configuration of a sound source generation unit of the type speech coding apparatus

【図１６】（ａ）、（ｂ）本発明の第8の実施の形態に
用いられる固定探索位置パターンの一例を示す一覧図FIGS. 16A and 16B are list views showing examples of fixed search position patterns used in the eighth embodiment of the present invention.

【図１７】本発明の第9の実施の形態におけるＣＥＬＰ
型音声符号化装置の音源生成部の構成を示すブロック図FIG. 17 shows a CELP according to a ninth embodiment of the present invention.
Block diagram showing a configuration of a sound source generation unit of the type speech coding apparatus

【図１８】本発明の第１０の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 18 shows a CEL according to a tenth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図１９】本発明の第１０の実施の形態のピッチピーク
位置予測器における予測原理を表す模式図FIG. 19 is a schematic diagram illustrating a prediction principle in a pitch peak position predictor according to a tenth embodiment of the present invention.

【図２０】本発明の第１１の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 20 shows a CEL according to an eleventh embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図２１】本発明の第１２の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 21 shows a CEL according to a twelfth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図２２】本発明の第１２の実施の形態における探索位
置算出器が出力するある音源パルスの探索位置パターン
と、インデックス更新手段を備えない場合の各位置に対
応するインデックスと、インデックス更新手段を備えた
場合の各位置に対応するインデックスをそれぞれ示す模
式図FIG. 22 shows a search position pattern of a sound source pulse output by a search position calculator in the twelfth embodiment of the present invention, an index corresponding to each position when no index updating means is provided, and an index updating means. Schematic diagram showing the index corresponding to each position when equipped

【図２３】本発明の第１３の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 23 shows a CEL according to a thirteenth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図２４】（ａ）本発明の第１３の実施の形態における
探索位置算出器が出力する音源パルス探索位置のパター
ンおよび各位置に対応する相対位置と絶対位置の対応を
示す模式図（ｂ）本発明の第１３の実施の形態におけるパルス番号
およびインデックスの更新手段を備えない場合に、各音
源パルスに割り当てられるパルス番号およびインデック
スを示す模式図（ｃ）本発明の第１３の実施の形態におけるパルス番号
およびインデックスの更新手段を備えた場合に、各音源
パルスに割り当てられるパルス番号およびインデックス
を示す模式図FIG. 24 (a) is a schematic diagram showing a pattern of a sound source pulse search position outputted by a search position calculator in a thirteenth embodiment of the present invention, and a correspondence between a relative position and an absolute position corresponding to each position. Schematic diagram showing a pulse number and an index assigned to each sound source pulse in a case where the means for updating a pulse number and an index in the thirteenth embodiment of the present invention is not provided. (C) In the thirteenth embodiment of the present invention Schematic diagram showing pulse numbers and indices assigned to each sound source pulse when a pulse number and index updating means are provided.

【図２５】本発明の第１４の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 25 shows a CEL according to a fourteenth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図２６】（ａ）本発明の第１４の実施の形態で用いら
れる固定探索位置パターンの一例を表す模式図（ｂ）、（ｃ）本発明の第１４の実施の形態で用いられ
る探索位置算出器で生成される音源パルス探索位置のパ
ターンの一例を示す模式図（ｄ）本発明の第１４の実施の形態のパルス位置探索器
において用いられる音源パルス探索位置のパターンの一
例を示す模式図FIGS. 26A and 26B are schematic diagrams showing an example of a fixed search position pattern used in the fourteenth embodiment of the present invention. FIGS. 26B and 26C are search positions used in the fourteenth embodiment of the present invention. Schematic diagram illustrating an example of a pattern of a sound source pulse search position generated by a calculator. (D) Schematic diagram illustrating an example of a pattern of a sound source pulse search position used in the pulse position search device according to the fourteenth embodiment of the present invention.

【図２７】本発明の第１５の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 27 shows a CEL according to a fifteenth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図２８】（ａ）、（ｂ）ピッチピーク算出器において
ピッチピークとセカンドピークを誤る適応符号ベクトル
波形の一例を示す模式図（ｃ）ピッチピーク位置補正器においてピッチピーク位
置を探索する範囲を図示した適応符号ベクトル波形の一
例を示す模式図FIGS. 28A and 28B are schematic diagrams showing an example of an adaptive code vector waveform in which a pitch peak and a second peak are incorrect in a pitch peak calculator. FIG. Schematic diagram showing an example of the illustrated adaptive code vector waveform

【図２９】本発明の第１６の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 29 shows a CEL according to a sixteenth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図３０】本発明の第１７の実施の形態におけるＣＥＬ
Ｐ型音声符号化装置の音源生成部の構成を示すブロック
図FIG. 30 shows a CEL according to a seventeenth embodiment of the present invention.
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the P-type speech encoding device.

【図３１】従来の一般的なＣＥＬＰ音声符号化装置の音
源生成部の構成を示すブロック図FIG. 31 is a block diagram showing a configuration of a sound source generation unit of a conventional general CELP speech coding apparatus.

【図３２】従来の雑音音源のピッチ周期化部を有するＣ
ＥＬＰ音声符号化装置の音源生成部の構成を示すブロッ
ク図FIG. 32 shows a conventional noise source having a pitch periodicization unit C
FIG. 2 is a block diagram illustrating a configuration of a sound source generation unit of the ELP speech encoding device.

[Explanation of symbols]

１１適応符号帳１２ピッチピーク位置算出器１３振幅強調窓生成器１４雑音符号帳１５周期化器１６振幅強調窓掛け器２１適応符号帳２２ピッチピーク位置算出器２３振幅強調窓生成器２４雑音符号帳２５振幅強調窓掛け器３１パルス列音源３２振幅強調窓生成器３３加算器３４雑音音源３５乗算器４１適応符号帳４２位相探索器４３ピッチパルス位置近傍限定型雑音符号帳４４雑音ベクトル生成器４５周期化器５１適応符号帳５２ピッチピーク位置算出器５３探索範囲算出器５４パルス位置探索器５５ピッチゲイン乗算器５６パルス音源ゲイン乗算器５７加算器６１パルス探索位置パターン選択器６２パルス探索位置決定器７１適応符号帳７２ピッチピーク位置算出器７３パルス数決定器７４探索位置算出器７５パルス位置探索器７６乗算器７７乗算器７８加算器８１適応符号帳８２ピッチピーク位置算出器８３パルス数決定器８４探索位置算出器８５パルス位置探索器８６加算器８７パルス振幅算出器８８乗算器８９乗算器９０加算器９１適応符号帳９２ピッチピーク位置算出器９３パルス数決定器９４探索位置算出器９５遅延器９６判定器９７パルス位置探索器９８スイッチ９９乗算器１００乗算器１０１加算器１１１適応符号帳１１２ピッチピーク位置算出器１１３パルス数決定器１１４探索位置算出器１１５スイッチ１１６ピッチゲイン算出器１１７量子化器１１８判定器１１９パルス位置探索器１２０加算器１２１差分量子化器１２２加算器１２３乗算器１２４乗算器１２５加算器１８０１適応符号帳１８０２ピッチピーク位置算出器１８０３遅延器１８０４遅延器１８０５ピッチピーク位置予測器１８０６判定器１８０７探索位置算出器１８０８スイッチ１８０９パルス位置探索器１８１０乗算器１８１１加算器１８１２乗算器２００１適応符号帳２００２ピッチピーク位置算出器２００３パルス性判定器２００４探索位置算出器２００５スイッチ２００６パルス位置探索器２００７乗算器２００８加算器２００９乗算器２１０１適応符号帳２１０２ピッチピーク位置算出器２１０３探索位置算出器２１０４インデックス更新手段２１０５パルス位置探索器２１０６乗算器２１０７乗算器２１０８加算器２３０１適応符号帳２３０２ピッチピーク位置算出器２３０３探索位置算出器２３０４パルス番号およびインデックスの更新手段２３０５パルス位置探索器２３０６乗算器２３０７乗算器２３０８加算器２５０１適応符号帳２５０２ピッチピーク位置算出器２５０３探索位置算出器２５０４加算器２５０５パルス位置探索器２５０６乗算器２５０７乗算器２５０８加算器２７０１適応符号帳２７０２ピッチピーク位置算出器２７０３ピッチピーク位置補正器２７０４探索位置算出器２７０５パルス位置探索器２７０６乗算器２７０７乗算器２７０８加算器２９０１適応符号帳２９０２ピッチピーク位置算出器２９０３ピッチピーク探索範囲限定器２９０４遅延器２９０５遅延器２９０６探索位置算出器２９０７パルス位置探索器２９０８乗算器２９０９乗算器２９１０加算器３００１適応符号帳３００２ピッチピーク位置算出器３００３探索位置算出器３００４パルス位置探索器３００５選択器３００６パルス位置探索器３００７乗算器３００８乗算器３００９加算器 DESCRIPTION OF SYMBOLS 11 Adaptive codebook 12 Pitch peak position calculator 13 Amplitude emphasis window generator 14 Noise codebook 15 Periodizer 16 Amplitude emphasis window multiplier 21 Adaptive codebook 22 Pitch peak position calculator 23 Amplitude emphasis window generator 24 Noise codebook 25 Amplitude emphasis window multiplier 31 Pulse train sound source 32 Amplitude emphasis window generator 33 Adder 34 Noise source 35 Multiplier 41 Adaptive codebook 42 Phase searcher 43 Pitch pulse position neighborhood limited noise codebook 44 Noise vector generator 45 Periodization Unit 51 adaptive codebook 52 pitch peak position calculator 53 search range calculator 54 pulse position searcher 55 pitch gain multiplier 56 pulse excitation gain multiplier 57 adder 61 pulse search position pattern selector 62 pulse search position determiner 71 adaptation Codebook 72 Pitch peak position calculator 73 Pulse number determiner 7 Search position calculator 75 Pulse position searcher 76 Multiplier 77 Multiplier 78 Adder 81 Adaptive codebook 82 Pitch peak position calculator 83 Number of pulses determiner 84 Search position calculator 85 Pulse position searcher 86 Adder 87 Pulse amplitude calculation Unit 88 multiplier 89 multiplier 90 adder 91 adaptive codebook 92 pitch peak position calculator 93 pulse number determiner 94 search position calculator 95 delay unit 96 determiner 97 pulse position searcher 98 switch 99 multiplier 100 multiplier 101 Adder 111 adaptive codebook 112 pitch peak position calculator 113 pulse number determiner 114 search position calculator 115 switch 116 pitch gain calculator 117 quantizer 118 determiner 119 pulse position searcher 120 adder 121 difference quantizer 122 Adder 123 Multiplier 124 Arithmetic unit 125 adder 1801 adaptive codebook 1802 pitch peak position calculator 1803 delay unit 1804 delay unit 1805 pitch peak position predictor 1806 determiner 1807 search position calculator 1808 switch 1809 pulse position searcher 1810 multiplier 1811 adder 1812 multiplication 2001 Adaptive codebook 2002 Pitch peak position calculator 2003 Pulsability determinator 2004 Search position calculator 2005 Switch 2006 Pulse position searcher 2007 Multiplier 2008 Adder 2009 Multiplier 2101 Adaptive codebook 2102 Pitch peak position calculator 2103 Search position Calculator 2104 Index updating means 2105 Pulse position searcher 2106 Multiplier 2107 Multiplier 2108 Adder 2301 Adaptive codebook 2302 Pitch peak position calculation 2303 Search position calculator 2304 Pulse number and index updating means 2305 Pulse position searcher 2306 Multiplier 2307 Multiplier 2308 Adder 2501 Adaptive codebook 2502 Pitch peak position calculator 2503 Search position calculator 2504 Adder 2505 Pulse position search 2506 Multiplier 2507 Multiplier 2508 Adder 2701 Adaptive codebook 2702 Pitch peak position calculator 2703 Pitch peak position corrector 2704 Search position calculator 2705 Pulse position searcher 2706 Multiplier 2707 Multiplier 2708 Adder 2901 Adaptive codebook 2902 Pitch peak position calculator 2903 Pitch peak search range limiter 2904 Delay device 2905 Delay device 2906 Search position calculator 2907 Pulse position search device 2908 Multiplier 2 09 multipliers 2910 adder 3001 adaptive codebook 3002 pitch peak position calculator 3003 search position calculator 3004 pulse position searcher 3005 selector 3006 pulse position searcher 3007 multiplier 3008 multiplier 3009 adder

Claims

[Claims]

1. A speech encoding apparatus comprising: a CELP-type speech encoding apparatus; and a sound source generation unit for enhancing an amplitude of a noise code vector corresponding to a pitch peak position of an adaptive code vector.

2. A sound source generating unit which multiplies a noise code vector by an amplitude emphasis window synchronized with a pitch period of the adaptive code vector, thereby enhancing the amplitude of the noise code vector corresponding to the position of the pitch peak of the adaptive code vector. The speech encoding device according to claim 1.

3. The speech encoding apparatus according to claim 2, wherein the sound source generation unit uses a triangular window centered on a pitch peak position of the adaptive code vector as an amplitude emphasizing window.

4. A speech encoding apparatus comprising a CELP-type speech encoding apparatus and an excitation generation unit using a noise code vector limited only to a vicinity of a pitch peak of an adaptive code vector.

5. A CEL using a pulse excitation as a noise codebook
A P-type speech encoding device, comprising a sound source generation unit that determines a search range of a pulse position by a pitch period and a pitch peak position of an adaptive code vector.

6. The speech coding apparatus according to claim 5, wherein the sound source generation unit determines the search range of the pulse position so that the vicinity of the pitch peak position of the adaptive code vector is dense and the other parts are sparse.

7. The speech coding apparatus according to claim 5, wherein a search range of a pulse position is switched according to a pitch period.

8. The speech coding according to claim 7, wherein when a plurality of pitch peaks are present in the adaptive code vector, the search range of the pulse position is limited so that at least two pitch peak positions are included in the search range. apparatus.

9. A CLP-type speech coding apparatus having a configuration in which a noise codebook is switched according to a speech analysis result.
ELP type speech coding device.

10. A CELP-type speech coding apparatus,
A speech coding apparatus including a sound source generation unit that switches a noise codebook using transmission parameters extracted before performing a noise codebook search.

11. The speech encoding apparatus according to claim 5, further comprising a sound source generation unit that switches the number of pulses according to a result of the speech signal analysis.

12. The speech according to claim 5, further comprising a sound source generation unit that switches the number of pulses using a transmission parameter extracted before performing a random codebook search. Encoding device.

13. The speech encoding apparatus according to claim 5, further comprising a sound source generation unit that switches the number of pulses according to a pitch period.

14. The speech coding apparatus according to claim 13, wherein the number of pulses is switched between a case where the fluctuation of the pitch period between successive subframes is small and a case where the fluctuation is not so.

15. The noise code vector generation unit using a pulsed sound source as a noise source determines the pulse amplitude prior to the pulse position search. A speech encoding device according to claim 1.

16. The speech coding apparatus according to claim 15, wherein the noise code vector generation unit using a pulse sound source as the noise sound source changes the pulse amplitude in the vicinity of the pitch peak of the adaptive code vector and in other parts.

17. The speech encoding apparatus according to claim 13, wherein the number of pulses of a pulse sound source to be used is determined based on a pitch period, either statistically or by learning.

18. A CELP-type speech coding apparatus, comprising:
Equipped with an excitation generator that quantizes the pitch gain in multiple stages, the first stage uses the value obtained immediately after the adaptive codebook search as the quantization target, and the second and subsequent stages are determined by closed loop search after completing all the excitation search An audio encoding device that uses a difference between a pitch gain and a value quantized in a first stage as a quantization target.

19. The fixed codebook is switched using the quantization value of the pitch gain obtained immediately after the adaptive codebook search of the speech coding apparatus according to claim 18.
The speech encoding device according to claim 2 or any one of claims 15 to 17.

20. The fixed codebook is switched based on a change in pitch period between subframes.
20. The speech encoding device according to any one of claims 15 to 19.

21. The speech encoding apparatus according to claim 9, wherein the fixed codebook is switched using the pitch gain quantized in the immediately preceding subframe.

22. The speech coding apparatus according to claim 9, wherein the fixed codebook is switched based on a change in pitch period between sub-frames and a quantization pitch gain. .

23. The speech coding apparatus according to claim 19, wherein a pulse excitation codebook is used as the fixed codebook.

24. A CELP-type speech coding apparatus that performs a speech coding process for each subframe having a predetermined time length, and determines whether a phase in a current subframe and a phase in a previous subframe are continuous. An audio encoding device that determines and switches a sound source to be used when it is determined to be continuous and when it is determined to be not continuous.

25. Predict a pitch peak position in a current subframe using a pitch peak position in the immediately preceding subframe, a pitch period in the immediately preceding subframe, and a pitch period in the current subframe.
Depending on whether the pitch peak position in the current subframe obtained by this prediction is close to the pitch peak position obtained only from data in the current subframe, the phase in the immediately preceding subframe and the phase in the current subframe are 26. The CELP-type speech encoding apparatus according to claim 24, wherein it is determined whether or not the sound source is continuous, and the encoding processing method of the excitation is switched according to the determination result.

26. When it is determined that the phase in the immediately preceding subframe and the phase in the current subframe are continuous, phase adaptive processing is performed on the noise codebook, and the phase in the immediately preceding subframe is determined. 26. The speech encoding apparatus according to claim 24, wherein when it is determined that the phase in the current subframe is not continuous with the phase in the current subframe, the phase adaptation processing is not performed on the noise codebook.

27. A CELP-type speech coding apparatus for performing speech coding processing for each subframe having a predetermined time length, based on a signal power concentration near a pitch peak of an adaptive code vector in a current subframe. , An audio encoding device that switches an encoding method of an excitation signal.

28. When the ratio of the signal power in the vicinity of the pitch peak of the adaptive code vector in the current subframe to the entire signal of one pitch cycle length is equal to or greater than a predetermined value, the phase adaptation processing is performed in the noise codebook. Done against
28. The speech encoding apparatus according to claim 27, wherein when the value is less than a predetermined value, the phase adaptation processing is not performed on the noise codebook.

29. A pulse sound source is applied to a noise sound source as a phase adaptation process, wherein a pulse position search is performed densely near a pitch peak and a pulse position search is performed sparsely in a portion other than the vicinity of a pitch peak. Item 30. The audio encoding device according to Item 28.

30. An index representing the position of a pulse is
30. The speech encoding apparatus according to claim 5, wherein the speech encoding apparatus is arranged so as to be arranged in order from the head side of the subframe.

31. In the case of the same index number, pulse numbers are assigned in order from the head of the subframe, and each pulse is so dense that the vicinity of the pitch peak position is dense and the parts other than the vicinity of the pitch peak are sparse. 31. The speech encoding device according to claim 30, wherein a search position is determined.

32. A part of the pulse search position is determined by the pitch peak position, and the other pulse search positions are fixed positions that are predetermined regardless of the pitch peak position.
Claims 5 to 8 or Claims 11 to 17
30. The speech encoding device according to claim 23.

33. A pitch peak position calculation for determining a pitch peak position of a voice or sound source signal having a predetermined time length, extracting only one pitch period length from the signal and determining a pitch peak position in the extracted signal. The speech encoding apparatus according to any one of claims 1 to 8, or 11 to 17, or 19 to 23, or 25 to 32, comprising means.

34. When extracting only one pitch period length from the signal, first determine a pitch peak position using the entire signal without extracting one pitch period length, and start extracting the determined pitch peak position. 34. The speech encoding apparatus according to claim 33, wherein one pitch period length is cut out as a point, and a pitch peak position is determined in the cut out signal.

35. A CELP-type speech coding apparatus that performs a speech coding process for each subframe having a predetermined time length, calculates a pitch peak position in a current subframe, and calculates a pitch period in a preceding subframe. If the difference from the pitch cycle in the current subframe is within a predetermined range, the pitch peak position in the immediately preceding subframe, the pitch cycle in the immediately preceding subframe, and the pitch cycle in the current subframe are used. Predict the pitch peak position in the current sub-frame, and using the pitch peak position in the current sub-frame obtained by this prediction to limit the range of the pitch peak position in the current sub-frame in advance,
Speech coding according to any one of claims 1 to 8, claim 11 to claim 17, or claim 19 to claim 23, or claim 25 to 32, wherein a pitch peak position search is performed within the range. apparatus.

36. A CELP-type speech coding apparatus for performing speech coding processing for each subframe having a predetermined time length, wherein a pulse excitation is used as a noise codebook, and the noise codebook has at least two or more modes. The number of sound source pulses can be changed by switching the mode. At least one is a mode in which the position information of each pulse is sufficient and the number of pulses is small, and the other is a mode in which the position information of each pulse is insufficient. A speech encoding device that has many modes and switches modes by transmitting mode switching information.

37. When the pitch period is short, the search range of the source pulse is limited to a narrow range corresponding to the pitch period, thereby reducing the position information of the source pulse and increasing the number of source pulses. 36. The speech encoding device according to 36.

38. In a mode in which the position information of each pulse is insufficient but the number of pulses is large, the search position of the sound source pulse is made dense near the pitch peak position, and the search position of the sound source pulse is made sparse in other portions. 38. The speech encoding device according to claim 36, wherein the search range of the pulse position is determined so as to be as follows.

39. A CELP-type speech coding apparatus according to claim 36, wherein in a sound source mode in which the number of pulses is small and the position information is sufficient, a part of the position information is converted to a noise source. An audio encoding device that is assigned to an index representing a code vector.

40. A computer-readable storage medium storing a program for executing the functions of the speech encoding device according to claim 1. Description: