JP3435310B2

JP3435310B2 - Voice coding method and apparatus

Info

Publication number: JP3435310B2
Application number: JP15555297A
Authority: JP
Inventors: 正浩押切; 公生三関; 政巳赤嶺
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1997-06-12
Filing date: 1997-06-12
Publication date: 2003-08-11
Anticipated expiration: 2017-06-12
Also published as: JPH113098A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、音声信号の圧縮符
号化を行う音声符号化方法に係り、特に音声信号の符号
化処理の中のピッチ周期を求める処理に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice coding method for compressing and coding a voice signal, and more particularly to a process for obtaining a pitch period in a voice signal coding process.

【０００２】[0002]

【従来の技術】音声信号を低ビットレートで高能率に圧
縮符号化する技術は、自動車電話等の移動体通信や企業
内通信において、電波の有効利用や通信コストの削減に
重要な技術である。2. Description of the Related Art A technique of compressing and encoding a voice signal at a low bit rate with high efficiency is an important technique for effective use of radio waves and reduction of communication cost in mobile communications such as car telephones and in-house communications. .

【０００３】８ｋｂｐｓ以下のビットレートで品質の優
れた音声合成が可能な音声符号化方式として、ＣＥＬＰ
（Code Excited Linear Prediction）方式が知られてい
る。ＣＥＬＰ方式は、M.R.Schrodeder氏とB.S.Atal氏に
より、“Code-Excited Linear Prediction(CELP) High-
Quality Speech at Very low Bit Rates”,Proc.ICASS
P;1985,pp.937-939（文献１）で発表されて以来、高品
質な音声が合成できる方式として注目され、品質の改善
や計算量の削減について、種々の検討がなされてきた。CELP is used as a voice coding method capable of performing high quality voice synthesis at a bit rate of 8 kbps or less.
(Code Excited Linear Prediction) method is known. The CELP method was developed by MR Schrodeder and BSAtal in "Code-Excited Linear Prediction (CELP) High-
Quality Speech at Very low Bit Rates ”, Proc.ICASS
Since it was announced in P; 1985, pp.937-939 (reference 1), it has attracted attention as a method capable of synthesizing high-quality speech, and various studies have been made to improve quality and reduce the amount of calculation.

【０００４】ＣＥＬＰ方式による音声符号化を実現する
ための構成要素の一つとして、適応符号帳がある。適応
符号帳は、入力音声信号のピッチ予測分析を閉ループ動
作または合成による分析（Analysis by Synthesis)によ
って行うものである。一般に適応符号帳によるピッチ予
測分析は、２０〜１４７サンプルの探索範囲（１２８候
補）でピッチ周期の探索を行って、目標信号に対する歪
が最小となるピッチ周期を求め、このピッチ周期の情報
を７ビットの符号化データとして伝送することが多い。An adaptive codebook is one of the constituent elements for implementing the CELP audio coding. The adaptive codebook performs pitch prediction analysis of an input speech signal by closed loop operation or analysis by synthesis. In general, pitch prediction analysis using an adaptive codebook searches a pitch period in a search range of 20 to 147 samples (128 candidates) to find a pitch period that minimizes distortion with respect to a target signal, and obtains information on this pitch period from 7 It is often transmitted as bit encoded data.

【０００５】上述した従来のＣＥＬＰ方式では、サブフ
レーム単位に閉ループ動作によってピッチ周期を決定す
るため、前述のようにピッチ周期の探索範囲が１２８候
補と大きい場合、計算量が膨大になってしまう。さら
に、このような直接的なピッチ周期探索法では、ピッチ
周期の情報はサブフレーム当たり７ビット必要となり、
仮に４サブフレームで１フレームが構成されると、フレ
ーム当たり２８ビットものビット数が必要になってしま
う。In the above-mentioned conventional CELP method, the pitch period is determined by the closed loop operation in units of subframes, so that the calculation amount becomes enormous when the pitch period search range is as large as 128 candidates as described above. Furthermore, in such a direct pitch period search method, pitch period information requires 7 bits per subframe,
If one frame is composed of four subframes, a bit number of 28 bits is required for each frame.

【０００６】本来、音声信号のピッチ周期の変動はゆる
やかな部分が多く、サブフレーム毎に全探索を行う必要
はない。このようなピッチ周期の性質を利用して、計算
量の削減およびビット数の削減が可能である。そのよう
な観点から、ピッチ周期の探索範囲を限定する差分ピッ
チ表現を用いる方法が報告されている。Originally, there are many gradual changes in the pitch period of a voice signal, and it is not necessary to perform a full search for each subframe. It is possible to reduce the amount of calculation and the number of bits by using such a property of the pitch period. From such a viewpoint, a method using a differential pitch expression that limits the search range of the pitch cycle has been reported.

【０００７】その一つに、ピッチ周期の探索に際し奇数
サブフレームについては全ての候補探索を行い、偶数サ
ブフレームについては奇数サブフレームの近傍の候補の
みを探索することで、計算量とビット数を削減する方法
がJ.P.Campbell,Jr.氏らによって“An Expandable Erro
r-Protected 4800 bps CELP coder(U.S.Federal Standa
rd 4800 bps Voice Coder)”，Proc.ICASSP;1989,pp.73
5-738 （文献２）で報告されている。この方法によれ
ば、奇数サブフレームについては１２８候補の全探索、
偶数サブフレームについては例えば３２候補に限定して
ピッチ周期の探索を行うと、ピッチ周期の情報量をフレ
ーム当たり２４ビットに削減することができる。One of them is to search all the candidates for odd subframes and only the candidates near the odd subframes for even subframes in the pitch period search, thereby reducing the calculation amount and the number of bits. A way to reduce it is by JP Campbell, Jr. et al. “An Expandable Erro
r-Protected 4800 bps CELP coder (USFederal Standa
rd 4800 bps Voice Coder) ”, Proc.ICASSP; 1989, pp.73
5-738 (reference 2). According to this method, a full search of 128 candidates for odd subframes,
For an even number of subframes, for example, if the pitch period is limited to 32 candidates, the pitch period information amount can be reduced to 24 bits per frame.

【０００８】しかし、この方法では奇数サブフレームで
求めたピッチ周期が実際のピッチ周期と大きく異なる値
が選択された場合、次のサブフレームにまで影響を与え
てしまい、品質劣化が知覚されるという問題が生じる。
実際、サブフレームのように短い区間でピッチ分析を行
うと、実際と異なるピッチ周期が求まりやすくなってし
まう。However, according to this method, when a value in which the pitch period obtained in an odd number of sub-frames is significantly different from the actual pitch period is selected, it affects the next sub-frame and the quality deterioration is perceived. The problem arises.
In fact, if the pitch analysis is performed in a short section such as a subframe, it becomes easy to find a pitch cycle different from the actual one.

【０００９】また、安定的にフレームのピッチ周期を求
め、各サブフレームをそれぞれフレームピッチ周期から
の変動量として表し、その近傍のみピッチ周期を探索す
る方法がM.Yong氏らによって“Efficient Encoding of
the Long-Term Predictor inVector Excitation Coders
”，Boston,MA:Kluwer,1991,pp329-338 （文献３）で
報告されている。この方法によれば、フレームのピッチ
周期の情報を７ビットで表し、サブフレームのピッチ周
期の情報を５ビットで表している。Further, a method of stably obtaining the pitch period of a frame, expressing each subframe as a variation amount from the frame pitch period, and searching the pitch period only in the vicinity thereof is described in "Efficient Encoding of
the Long-Term Predictor inVector Excitation Coders
, Boston, MA: Kluwer, 1991, pp329-338 (reference 3). According to this method, the pitch period information of a frame is represented by 7 bits, and the pitch period information of a subframe is 5 bits. It is expressed in bits.

【００１０】この方法の利点は、全てのサブフレームの
ピッチ周期探索は３２候補だけに対して行えばよく、大
幅な計算量削減が可能となる点にある。しかしながら、
この方法には、フレーム内でピッチ周期が変動するよう
な場合に大きな劣化が生じてしまうという問題点があ
る。この問題点を図１（ａ）を用いて説明する。The advantage of this method is that the pitch period search for all subframes only needs to be performed for 32 candidates, and the amount of calculation can be greatly reduced. However,
This method has a problem that a large deterioration occurs when the pitch period varies within a frame. This problem will be described with reference to FIG.

【００１１】図１（ａ）に示すように、実際のピッチ周
期がフレーム内で変動するような場合、フレームのピッ
チ周期はその中での平均的な周期として求められる。各
サブフレームのピッチ周期は、フレームのピッチ周期の
近傍を探索範囲とする。しかし、探索範囲が十分な大き
さでない場合、図１（ａ）に示すように実際のピッチ周
期を求めることはできない。また、探索範囲を大きくし
てしまうと、それだけピッチ周期の情報を表すのに必要
なビット数が必要になり、計算量やビット削減の効果が
小さくなってしまう。As shown in FIG. 1A, when the actual pitch period fluctuates within a frame, the frame pitch period is obtained as an average period among them. The pitch period of each sub-frame has a search range in the vicinity of the frame pitch period. However, if the search range is not large enough, the actual pitch period cannot be obtained as shown in FIG. Further, if the search range is increased, the number of bits required to express the information of the pitch period is correspondingly increased, and the amount of calculation and the effect of bit reduction are reduced.

【００１２】[0012]

【発明が解決しようとする課題】上述したように、従来
の音声符号化方法であるＣＥＬＰ方式では、サブフレー
ム単位の閉ループ探索によってピッチ周期を求めるた
め、ピッチ周期を求めるのに必要な計算量が膨大とな
り、しかもピッチ周期情報のビット数が増大するという
問題点がある。As described above, in the CELP system, which is a conventional speech coding method, the pitch period is obtained by a closed loop search in subframe units, so that the amount of calculation required to obtain the pitch period is small. There is a problem that the number of bits becomes huge and the number of bits of pitch period information increases.

【００１３】また、文献２に記載されたような探索範囲
を限定してピッチ周期を求める方法では、ピッチ周期を
求めるための計算量およびピッチ周期情報のビット数が
減少するが、ピッチ周期を正しく求めることが難しいと
いう問題点がある。Further, in the method for obtaining the pitch period by limiting the search range as described in the reference 2, the amount of calculation for obtaining the pitch period and the number of bits of the pitch period information are reduced, but the pitch period is set correctly. There is a problem that it is difficult to ask.

【００１４】さらに、文献３に記載されたようにサブフ
レームのピッチ周期をフレームのピッチ周期からの変動
量としてフレームのピッチ周期近傍においてのみ探索す
る方法では、大幅な計算量の削減が可能であるが、フレ
ーム内でピッチ周期が変動するような場合はサブフレー
ムのピッチ周期を正しく求めることができないという問
題点があった。Further, the method of searching the pitch period of the sub-frame as the variation amount from the pitch period of the frame only in the vicinity of the pitch period of the frame as described in Document 3 can significantly reduce the calculation amount. However, there is a problem that the pitch period of the sub-frame cannot be correctly obtained when the pitch period varies within the frame.

【００１５】本発明は、このような従来技術の問題点を
解消するためになされたもので、音声信号のフレーム周
期を少ない計算量で正しく求めることができ、かつピッ
チ周期を少ない情報量で表現できる音声符号化方法を提
供することを目的とする。The present invention has been made in order to solve the above-mentioned problems of the prior art. The frame period of a voice signal can be correctly obtained with a small amount of calculation, and the pitch period can be expressed with a small amount of information. An object of the present invention is to provide a speech coding method capable of performing the speech coding.

【００１６】[0016]

【課題を解決するための手段】上記の課題を解決するた
め、本発明は入力音声信号を予め定められた長さのフレ
ームに分割し、入力音声信号のピッチ周期を求める処理
を含む音声符号化処理において、符号化しようとする現
フレームに対して時間的に未来のフレームのピッチ周期
を求める処理と、このピッチ周期を符号化する処理とを
含むことを特徴とする。In order to solve the above problems, the present invention is a speech coding including a process of dividing an input speech signal into frames of a predetermined length and obtaining a pitch period of the input speech signal. The process is characterized by including a process of obtaining a pitch cycle of a future frame in time with respect to a current frame to be coded, and a process of coding this pitch cycle.

【００１７】また、本発明は入力音声信号を予め定めら
れた長さのフレームに分割し、各フレームの音声信号を
さらにサブフレームに分割して、音声信号のピッチ周期
を求める処理を含む音声符号化処理において、符号化し
ようとする現フレームと現フレームに対して過去のフレ
ームおよび未来のフレームのうちの少なくとも二つのフ
レームのピッチ周期を用いて現フレーム中のサブフレー
ムのピッチ周期予測値を求め、このピッチ周期予測値を
用いて現フレーム中のサブフレームのピッチ周期を求め
ることを特徴とする。Further, according to the present invention, a voice code including a process of dividing an input voice signal into frames of a predetermined length, further dividing the voice signal of each frame into subframes, and obtaining a pitch period of the voice signal. In the coding process, the pitch period prediction value of the subframe in the current frame is obtained by using the pitch periods of at least two frames of the past frame and the future frame for the current frame to be encoded and the current frame. The pitch period prediction value is used to obtain the pitch period of the subframe in the current frame.

【００１８】このように本発明では、現フレームに対し
て未来のフレームのピッチ周期を求めるため、現フレー
ムのピッチ周期や過去のフレームのピッチ周期とともに
用いて、補間により現フレーム中のサブフレームのピッ
チ周期予測値を求め、このピッチ周期予測値を用いて現
フレーム中のサブフレームのピッチ周期を求めることに
より、フレーム内でピッチ周期が変動する場合であって
も、サブフレームのピッチ周期を精度よく、かつ少ない
計算量で求め、しかも少ない情報量で表すことができ
る。As described above, according to the present invention, in order to obtain the pitch period of the future frame with respect to the present frame, the pitch period of the present frame and the pitch period of the past frame are used to interpolate the subframes in the present frame. Even if the pitch cycle varies within a frame, the pitch cycle prediction value is calculated, and the pitch cycle of the subframe in the current frame is calculated using this pitch cycle prediction value. It can be calculated well and with a small amount of calculation, and can be expressed with a small amount of information.

【００１９】図１（ｂ）を用いて説明すると、同図に示
されるように図１（ａ）と同様に実際のピッチ周期はフ
レーム内で変化している。本発明では例えば現フレーム
のピッチ周期と過去のフレームのピッチ周期とで補間を
行い、各サブフレーム毎にサブフレームピッチ周期予測
値を求め、その近傍にサブフレームのピッチ周期の探索
範囲を決定する。このため、実際のピッチ周期は探索範
囲の中に含まれ、ピッチ周期の正確な探索を行うことが
できる。Explaining with reference to FIG. 1B, as shown in FIG. 1A, the actual pitch period changes within the frame as in FIG. 1A. In the present invention, for example, interpolation is performed with the pitch period of the current frame and the pitch period of the past frame, the subframe pitch period prediction value is obtained for each subframe, and the search range of the pitch period of the subframe is determined in the vicinity thereof. . Therefore, the actual pitch period is included in the search range, and the pitch period can be accurately searched.

【００２０】また、サブフレームピッチ周期予測値は実
際のピッチ周期をかなり正確に近似するため、サブフレ
ームのピッチ周期の探索範囲を例えば８候補のように狭
めても問題は生じない。サブフレームピッチ周期の探索
範囲を８候補とすると、フレームのピッチ周期は７ビッ
ト、サブフレーム当たり３ビットで表されるため、４サ
ブフレームで１フレームとすれば、サブフレームのピッ
チ周期は、従来ではフレーム当たり２８ビットで必要で
あったの対し、７ビット＋３ビット＊４＝１９ビットの
情報量で表すことができる。また、サブフレームピッチ
周期の探索範囲が８候補と小さいため、計算量の削減に
も大きく寄与する。Since the predicted value of the subframe pitch period approximates the actual pitch period quite accurately, there is no problem even if the search range of the pitch period of the subframe is narrowed to, for example, 8 candidates. If the search range of the subframe pitch period is 8 candidates, the pitch period of the frame is represented by 7 bits and 3 bits per subframe. Therefore, if 4 subframes are 1 frame, the pitch period of the subframe is In contrast to 28 bits required for each frame, the amount of information can be represented by 7 bits + 3 bits * 4 = 19 bits. Further, since the search range of the subframe pitch period is as small as 8 candidates, it greatly contributes to the reduction of the calculation amount.

【００２１】本発明においては、上記のようにして求め
られた現フレーム中のサブフレームのピッチ周期を符号
化してもよいし、入力される音声信号のピッチ周期成分
を強調するピッチフィルタを有する場合、上記のように
して求められた現フレーム中のサブフレームのピッチ周
期を用いてピッチフィルタの伝達関数を決定するように
してもよい。ピッチフィルタは、例えば聴感重みフィル
タやポストフィルタの一構成要素として知られる。In the present invention, the pitch period of the subframe in the current frame obtained as described above may be encoded, or a pitch filter for emphasizing the pitch period component of the input voice signal may be provided. The transfer function of the pitch filter may be determined using the pitch period of the subframe in the current frame obtained as described above. The pitch filter is known as one component of, for example, a perceptual weighting filter or a post filter.

【００２２】さらに、本発明は過去の駆動信号系列を予
め定められた範囲に含まれる周期で繰り返して生成され
る複数の適応ベクトルを格納した適応符号帳を有し、こ
の適応符号帳から取り出される適応ベクトルを所定のフ
ィルタに通して得られる信号と目標ベクトルとの誤差が
最小となる周期の適応ベクトルを所定の探索範囲から探
索する処理を含む音声符号化処理において、入力音声信
号を予め定められた長さのフレームに分割し、各フレー
ムの音声信号をさらにサブフレームに分割した後、符号
化しようとする現フレームと現フレームに対して過去の
フレームおよび未来のフレームのうちの少なくとも二つ
のフレームのピッチ周期を用いて現フレーム中のサブフ
レームのピッチ周期予測値を求め、このピッチ周期予測
値を用いて現フレーム中のサブフレームについての前記
探索範囲を決定することを特徴とする。Further, the present invention has an adaptive codebook in which a plurality of adaptive vectors generated by repeating a past drive signal sequence in a cycle included in a predetermined range are stored, and is extracted from this adaptive codebook. In a voice encoding process including a process of searching an adaptive vector having a period in which an error between a signal obtained by passing an adaptive vector through a predetermined filter and a target vector is a predetermined search range, an input voice signal is predetermined. At least two of the current frame to be encoded and the past frame and future frame after the audio signal of each frame is further divided into subframes. The pitch period prediction value of the subframe in the current frame is calculated using this pitch period, and this pitch period prediction value is used to calculate the current frame. And determines the search range for the sub-frame in the arm.

【００２３】本発明においては、フレームのピッチ周期
を求める際、フレーム毎にピッチ周期の分析位置を適応
的に決定してもよい。より具体的には、ピッチ周期分析
位置を音声信号、予測残差信号、または低域通過フィル
タ通過後の予測残差信号の短区間パワーの大きさで判定
する。このようにすることにより、ピッチ周期をより正
確に求めることができ、復号音声の品質向上が図られ
る。In the present invention, when the pitch period of the frame is obtained, the analysis position of the pitch period may be adaptively determined for each frame. More specifically, the pitch period analysis position is determined by the magnitude of the short-term power of the voice signal, the prediction residual signal, or the prediction residual signal after passing the low pass filter. By doing so, the pitch period can be obtained more accurately, and the quality of the decoded speech can be improved.

【００２４】また、ピッチ周期の連続性に応じて現フレ
ーム中のサブフレームのピッチ周期を求める方法を選択
してもよい。例えば、ピッチ周期が連続的に変化してい
ると判定された場合は、サブフレームピッチ周期予測値
を求め、この近傍を探索することによりサブフレームピ
ッチ周期を求める。逆に、ピッチ周期の変化が不連続で
あると判定された場合、サブフレームピッチ周期は全探
索によって求める。このような適応処理により、ピッチ
周期の連続性に応じて最適なサブフレームピッチ周期の
探索法が選択されるため、復号音声の品質が向上する。Further, a method of obtaining the pitch period of the subframe in the current frame may be selected according to the continuity of the pitch period. For example, when it is determined that the pitch period is continuously changing, the subframe pitch period prediction value is obtained, and the subframe pitch period is obtained by searching this neighborhood. On the contrary, when it is determined that the change of the pitch period is discontinuous, the subframe pitch period is obtained by the full search. By such an adaptive process, the optimum search method for the subframe pitch period is selected according to the continuity of the pitch period, so that the quality of the decoded speech is improved.

【００２５】また、複数のサブフレームのピッチ周期の
変動を表す相対ピッチパターンを複数個格納した相対ピ
ッチパターン符号帳をさらに有し、サブフレームのピッ
チ周期の変化を相対ピッチパターン符号帳から所定の指
標に基づいて選択した一つの相対ピッチパターンによっ
て表すようにしてもよく、これによりサブフレームピッ
チ周期を表す情報のさらなるビット数削減を図ってい
る。Further, the apparatus further comprises a relative pitch pattern codebook in which a plurality of relative pitch patterns representing fluctuations in the pitch cycle of a plurality of subframes are stored, and a change in the pitch cycle of the subframes is predetermined from the relative pitch pattern codebook. It may be represented by one relative pitch pattern selected on the basis of the index, thereby further reducing the number of bits of the information representing the subframe pitch period.

【００２６】すなわち、相対ピッチパターン符号帳は、
例えば出現頻度の高い相対ピッチパターンをベクトルと
して格納している。これと複数のサブフレームのピッチ
周期をベクトル化したものとマッチングをとり、最も適
した相対ピッチパターンで複数のサブフレームのピッチ
周期を表すようにする。例えば、サブフレームのピッチ
周期を個別に表すのにサブフレーム当たり３ビット必要
であれば、４サブフレーム当たり１２ビット必要になる
が、この４次元ベクトルを相対ピッチパターン符号帳内
の７ビットの大きさを持つ一つの相対ピッチパターンで
表現すれば、フレーム当たり５ビットのビット削減につ
ながる。That is, the relative pitch pattern codebook is
For example, a relative pitch pattern having a high appearance frequency is stored as a vector. This is matched with a vectorized pitch cycle of a plurality of sub-frames, and the pitch cycle of a plurality of sub-frames is represented by the most suitable relative pitch pattern. For example, if 3 bits are required for each subframe to individually represent the pitch period of the subframe, 12 bits are required for every 4 subframes. However, this 4D vector requires a size of 7 bits in the relative pitch pattern codebook. If expressed with one relative pitch pattern having a certain length, it leads to a bit reduction of 5 bits per frame.

【００２７】また、本発明においては入力音声信号を予
め定められた長さのフレームに分割し、入力音声信号の
ピッチ周期を求める処理を含む音声符号化処理を行うた
めのプログラムを記録したコンピュータ読み取り可能な
記録媒体であって、符号化しようとする現フレームに対
して時間的に未来のフレームのピッチ周期を求める処理
と、このピッチ周期を符号化する処理とを実行するため
のプログラムを記録した記録媒体が提供される。Further, according to the present invention, the input voice signal is divided into frames of a predetermined length, and a computer readable recording a program for performing a voice encoding process including a process for obtaining a pitch period of the input voice signal. It is a possible recording medium, and records a program for executing a process of obtaining a pitch period of a temporally future frame with respect to a current frame to be encoded and a process of encoding this pitch period. A recording medium is provided.

【００２８】また、本発明によると入力音声信号を予め
定められた長さのフレームに分割し、各フレームの音声
信号をさらにサブフレームに分割して、音声信号のピッ
チ周期を求める処理を含む音声符号化処理を行うための
プログラムを記録したコンピュータ読み取り可能な記録
媒体であって、符号化しようとする現フレームと現フレ
ームに対して過去のフレームおよび未来のフレームのう
ちの少なくとも二つのフレームのピッチ周期を用いて現
フレーム中のサブフレームのピッチ周期予測値を求め、
このピッチ周期予測値を用いて現フレーム中のサブフレ
ームのピッチ周期を求める処理を実行するためのプログ
ラムを記録した記録媒体が提供される。Further, according to the present invention, the input voice signal is divided into frames of a predetermined length, the voice signal of each frame is further divided into subframes, and a voice including a process of obtaining a pitch period of the voice signal is included. A computer-readable recording medium in which a program for performing an encoding process is recorded, and a current frame to be encoded and a pitch of at least two frames of a past frame and a future frame with respect to the current frame. Calculate the pitch period prediction value of the subframe in the current frame using the period,
There is provided a recording medium recording a program for executing a process of obtaining a pitch period of a subframe in a current frame using the pitch period prediction value.

【００２９】さらに、本発明においては過去の駆動信号
系列を予め定められた範囲に含まれる周期で繰り返して
生成される複数の適応ベクトルを格納した適応符号帳を
有し、この適応符号帳から取り出される適応ベクトルを
所定のフィルタに通して得られる信号と目標ベクトルと
の誤差が最小となる周期の適応ベクトルを所定の探索範
囲から探索する処理を含む音声符号化処理を行うための
プログラムを記録したコンピュータ読み取り可能な記録
媒体であって、入力音声信号を予め定められた長さのフ
レームに分割し、各フレームの音声信号をさらにサブフ
レームに分割した後、符号化しようとする現フレームと
現フレームに対して過去のフレームおよび未来のフレー
ムのうちの少なくとも二つのフレームのピッチ周期を用
いて現フレーム中のサブフレームのピッチ周期予測値を
求め、このピッチ周期予測値を用いて現フレーム中のサ
ブフレームについての前記探索範囲を決定する処理を実
行するためのプログラムを記録した記録媒体が提供され
る。Furthermore, the present invention has an adaptive codebook that stores a plurality of adaptive vectors generated by repeating the past drive signal sequence in a cycle included in a predetermined range, and extracts the adaptive codebook from this adaptive codebook. A program for performing a voice coding process including a process for searching an adaptive vector of a period in which the error between the target vector and the signal obtained by passing the adaptive vector through a predetermined filter from a predetermined search range is recorded. A computer-readable recording medium, in which an input audio signal is divided into frames of a predetermined length, the audio signal of each frame is further divided into subframes, and then the current frame and the current frame to be encoded For the current frame using the pitch period of at least two of the past and future frames Determine the pitch period prediction value of the sub-frame, a recording medium recording a program for executing processing for determining the search range for the sub-frame of the current frame by using the pitch period prediction value is provided.

【００３０】[0030]

【発明の実施の形態】以下、本発明による音声符号化方
法の実施形態について説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of a speech coding method according to the present invention will be described below.

【００３１】（第１の実施形態）図２は、本発明の第１
の実施形態に係る音声符号化方法におけるピッチ周期分
析の基本動作を説明するためのブロック図である。同図
において、入力端子１１にはディジタル化された音声信
号（以下、入力音声信号という）が順次入力される。こ
の入力音声信号は、フレーム構成部１２で予め定められ
た長さのフレームと呼ばれる単位に分割される（フレー
ム化）。このフレーム構成部１２でフレーム化された音
声信号はピッチ周期分析部１３によりピッチ周期の分析
が行われ、ピッチ周期の情報が出力端子１４から出力さ
れる。(First Embodiment) FIG. 2 shows a first embodiment of the present invention.
6 is a block diagram for explaining a basic operation of pitch period analysis in the speech coding method according to the embodiment of FIG. In the figure, digitized audio signals (hereinafter referred to as input audio signals) are sequentially input to an input terminal 11. This input audio signal is divided by the frame configuration unit 12 into units called frames each having a predetermined length (frame formation). The pitch cycle analysis section 13 analyzes the pitch cycle of the voice signal framed by the frame configuration section 12, and the pitch cycle information is output from the output terminal 14.

【００３２】次に、フレーム構成部１２の動作を図３を
用いて説明する。便宜上、ここでは後述する第２の実施
形態におけるフレーム・サブフレーム構成部２２（図
４）におけるサブフレーム構成部の動作についても説明
する。Next, the operation of the frame forming section 12 will be described with reference to FIG. For convenience, the operation of the subframe configuration unit in the frame / subframe configuration unit 22 (FIG. 4) in the second embodiment described later will also be described here.

【００３３】図３は、本実施形態における入力音声信号
のフレーム構造とサブフレーム構造を表す。図３では、
フレームを特定するためにｆ(m) という記号を付してい
る。ここで、ｍ＝０のフレームはこれから符号化しよう
とするフレーム（現フレームという）を表し、ｍ≦−１
のフレームは既に符号化が終了している過去のフレーム
を表す。さらに、ｆｒは現フレームに対して時間的に未
来のフレーム（これを先読み部という）を表す。FIG. 3 shows a frame structure and a subframe structure of an input voice signal in this embodiment. In Figure 3,
The symbol f (m) is added to identify the frame. Here, a frame of m = 0 represents a frame to be encoded (hereinafter referred to as a current frame), and m ≦ −1.
Frame represents a past frame that has already been encoded. Further, fr represents a frame that is temporally future with respect to the current frame (this is referred to as a prefetch section).

【００３４】入力音声信号をｓ(n) と表し、ｓ(0) を現
フレームの先頭に位置する音声信号とすると、ｓ(n) と
ｆ(m) およびｆｒの間には次のような関係が成り立つ。Letting s (n) be the input voice signal and s (0) being the voice signal located at the beginning of the current frame, the following is given between s (n) and f (m) and fr: Relationship is established.

【００３５】ｆ(m) ＝｛ｓ(n) ；ｎ＝m*ＮＦ〜(m+1)*ＮＦ−１｝（１）（ｍ≦０）ｆｒ＝｛ｓ(n) ；ｎ＝ＮＦ〜ＮＦ＋ＮＤ−１｝（２）ここで、ＮＦはフレーム長、ＮＤは先読み部の長さを表
す。本実施形態ではＮＦ＝１６０、ＮＤ＝１２０として
説明する。F (m) = {s (n); n = m * NF to (m + 1) * NF-1} (1) (m ≦ 0) fr = {s (n); n = NF to NF + ND-1} (2) Here, NF represents the frame length, and ND represents the length of the prefetch section. In this embodiment, NF = 160 and ND = 120 will be described.

【００３６】また、フレームｆ(0) のサブフレームをｓ
ｆ(k) とすると、ｓ(m) とｓｆ(k)の間には次のような
関係が成り立つ。ｓｆ(k) ＝｛ｓ(n) ；ｎ＝k*ＮＳＦ〜(k+1)*ＮＳＦ−１｝（０≦ｋ≦Ｋ−１）（３）ここで、ＮＳＦはサブフレーム長、Ｋはサブフレーム数
をそれぞれ表す。本実施形態ではＮＳＦ＝４０、Ｋ＝４
として説明を行う。また、ｎはサンプルを表す変数、ｍ
はフレームを表す変数、ｋはサブフレームを表す変数で
ある。Also, the subframe of the frame f (0) is s
If f (k), the following relationship holds between s (m) and sf (k). sf (k) = {s (n); n = k * NSF to (k + 1) * NSF-1} (0≤k≤K-1) (3) where NSF is the subframe length and K is Each represents the number of subframes. In this embodiment, NSF = 40, K = 4
Will be described as. Further, n is a variable representing a sample, m
Is a variable that represents a frame, and k is a variable that represents a subframe.

【００３７】このようにフレームとサブフレームを構成
した後に、ピッチ周期分析部１３においてピッチ周期分
析を行う。ピッチ周期分析部１３は、先読み部ｆｒに分
析の中心を置いてピッチ周期の分析を行う点に特徴があ
る。分析の中心の定義は後で述べる。After the frame and sub-frames are constructed in this way, the pitch cycle analysis unit 13 performs pitch cycle analysis. The pitch period analysis unit 13 is characterized in that the pitch period is analyzed with the analysis centered on the look-ahead unit fr. The definition of the center of analysis will be described later.

【００３８】ピッチ周期の分析には既存の技術を適用で
きる。例えば、「音声のディジタル信号処理」コロナ社
発行（文献４）にいくつかピッチ分析法が紹介されてい
る。本実施形態では、次のようなピッチ周期分析法を用
いている。Existing techniques can be applied to the analysis of the pitch period. For example, some pitch analysis methods are introduced in "Digital signal processing of voice", published by Corona (Reference 4). In this embodiment, the following pitch period analysis method is used.

【００３９】まず、図示しないＬＰＣ分析部において、
入力音声信号ｓ(n) を分析してＬＰＣ係数を求め、この
ＬＰＣ係数を使って次式で表される伝達関数Ａ(z) の予
測フィルタを構成する。First, in an LPC analysis section (not shown),
The input speech signal s (n) is analyzed to find the LPC coefficient, and the LPC coefficient is used to construct a prediction filter of the transfer function A (z) represented by the following equation.

【００４０】[0040]

【数１】 [Equation 1]

【００４１】ここで、α(i) はＬＰＣ係数、ＩＰは分析
次数をそれぞれ表す。本実施形態では、ＩＰ＝１０とし
て説明を進める。この伝達関数Ａ(z) の予測フィルタに
音声信号ｓ(n) を通して、予測残差信号ｅ(n) を次式に
従い求める。Here, α (i) represents the LPC coefficient, and IP represents the analysis order. In this embodiment, the description will proceed with IP = 10. The prediction residual signal e (n) is obtained according to the following expression by passing the speech signal s (n) through the prediction filter of this transfer function A (z).

【００４２】[0042]

【数２】 [Equation 2]

【００４３】次に、予測残差信号ｅ(n) にハミング窓を
掛けて窓掛け信号ｕ(n) を求め、この窓掛け信号ｕ(n)
の相関分析を行い、次式に示す相関値ρ(t) を用いてピ
ッチ周期を抽出する。Next, a Hamming window is applied to the prediction residual signal e (n) to obtain a windowing signal u (n), and this windowing signal u (n) is obtained.
Then, the pitch period is extracted using the correlation value ρ (t) shown in the following equation.

【００４４】[0044]

【数３】 [Equation 3]

【００４５】ここで、ＮＷはピッチ周期分析長、ＮＣは
ピッチ周期分析位置をそれぞれ表す。本実施形態では、
ＮＷ＝１６０、ＮＣ＝２００として説明を行う。また、
相関値ρ(t) の分析範囲は｛２０≦ｔ≦１４７｝とす
る。Here, NW represents the pitch period analysis length, and NC represents the pitch period analysis position. In this embodiment,
The description will be made assuming that NW = 160 and NC = 200. Also,
The analysis range of the correlation value ρ (t) is {20 ≦ t ≦ 147}.

【００４６】そして、相関値ρ(t) が最大となるときの
ｔがフレームのピッチ周期Ｔの情報として出力端子１４
から出力される。このピッチ周期Ｔの情報は例えば符号
化され、図示しない復号器へ伝送される。Then, t when the correlation value ρ (t) becomes maximum is output terminal 14 as the information of the pitch period T of the frame.
Is output from. The information on the pitch period T is encoded, for example, and transmitted to a decoder (not shown).

【００４７】窓関数にハミング窓や方形窓のような対称
窓を用いた場合、ピッチ分析位置は窓関数の中心と考え
ることができる。また、窓の形が左と右で異なる非対称
窓を用いた場合、その分析位置は対称窓と同様に中心部
と考えることはできない。この場合、本実施形態では窓
関数の振幅値が最大となる位置を中心としてみなすこと
にする。When a symmetrical window such as a Hamming window or a rectangular window is used as the window function, the pitch analysis position can be considered as the center of the window function. Also, when an asymmetric window having different left and right window shapes is used, the analysis position cannot be considered to be the central portion like the symmetrical window. In this case, in the present embodiment, the position where the amplitude value of the window function is maximum is regarded as the center.

【００４８】このように本実施形態では、現フレームに
対して未来のフレームのピッチ周期を求めるため、これ
を後述するように現フレームのピッチ周期や過去のフレ
ームのピッチ周期とともに用いて、補間により現フレー
ム中のサブフレームのピッチ周期予測値を求め、このピ
ッチ周期予測値を用いて現フレーム中のサブフレームの
ピッチ周期を求めることにより、フレーム内でピッチ周
期が変動する場合であっても、サブフレームのピッチ周
期を精度よく、かつ少ない計算量で求め、しかも少ない
情報量で表すことができる。As described above, in the present embodiment, the pitch period of the future frame is calculated with respect to the present frame. Therefore, this is used together with the pitch period of the current frame and the pitch period of the past frame to interpolate, as will be described later. Obtaining the pitch period prediction value of the subframe in the current frame, by obtaining the pitch period of the subframe in the current frame using this pitch period prediction value, even if the pitch period varies in the frame, It is possible to obtain the pitch period of a subframe with high accuracy and a small amount of calculation, and to represent it with a small amount of information.

【００４９】（第２の実施形態）図４は、本発明の第２
の実施形態に係る音声符号化方法におけるサブフレーム
ピッチ抽出の基本動作を説明するためのブロック図であ
る。同図において、入力端子２１へのディジタル化され
た入力音声信号は、フレーム・サブフレーム構成部２２
でフレームに分割され、各フレームはさらにサブフレー
ムに分割される。このフレーム・サブフレーム構成部２
２でフレーム化された音声信号およびサブフレーム化さ
れた音声信号は、ピッチ周期分析部２３およびサブフレ
ームピッチ周期抽出部２４に入力される。ピッチ周期分
析部２３では、ピッチ周期Ｔの分析が行われ、ピッチ周
期Ｔの情報がサブフレームピッチ周期抽出部２４に入力
される。(Second Embodiment) FIG. 4 shows a second embodiment of the present invention.
6 is a block diagram for explaining a basic operation of subframe pitch extraction in the speech encoding method according to the embodiment of FIG. In the figure, the digitized input audio signal to the input terminal 21 is the frame / subframe configuration section 22.
Are divided into frames, and each frame is further divided into subframes. This frame / subframe construction part 2
The speech signal framed in 2 and the speech signal subframed are input to the pitch cycle analysis unit 23 and the subframe pitch cycle extraction unit 24. The pitch cycle analysis unit 23 analyzes the pitch cycle T, and the information of the pitch cycle T is input to the subframe pitch cycle extraction unit 24.

【００５０】サブフレームピッチ周期抽出部２４では、
ピッチ周期分析部２３で求められたピッチ周期Ｔを基
に、サブフレーム毎のピッチ周期ＳＴ(k) が求められ、
このサブフレームピッチ周期ＳＴ(k) の情報が出力端子
２５から出力される。In the subframe pitch period extraction section 24,
Based on the pitch cycle T calculated by the pitch cycle analysis unit 23, the pitch cycle ST (k) for each subframe is calculated,
Information on the subframe pitch period ST (k) is output from the output terminal 25.

【００５１】次に、図５を用いて本実施形態の特徴部分
であるサブフレームピッチ周期抽出部２４について説明
する。図５はサブフレームピッチ周期抽出部２４の構成
を示すブロック図であり、入力端子１０１に入力される
図４のピッチ周期分析部２３からのピッチ周期Ｔの情報
を用いて、サブフレームピッチ周期予測値算出部１０２
でサブフレーム毎にサブフレームピッチ周期予測値を算
出する。サブフレームピッチ周期算出部１０４では、サ
ブフレームピッチ周期予測値算出部１０２で求めたサブ
フレームピッチ周期予測値を基に、入力端子１０３に入
力される図４のフレーム・サブフレーム構成部２２から
の音声信号を用いてサブフレーム毎にサブフレームピッ
チ周期ＳＴ(k) を算出し、このサブフレームピッチ周期
ＳＴ(k)の情報を出力端子１０５へ出力する。Next, the subframe pitch period extraction section 24, which is a characteristic part of this embodiment, will be described with reference to FIG. FIG. 5 is a block diagram showing the configuration of the subframe pitch period extraction unit 24. Using the information of the pitch period T from the pitch period analysis unit 23 of FIG. 4 input to the input terminal 101, the subframe pitch period prediction is performed. Value calculation unit 102
Then, a subframe pitch cycle prediction value is calculated for each subframe. The subframe pitch period calculation unit 104 receives the subframe pitch period prediction value calculated by the subframe pitch period prediction value calculation unit 102 from the frame / subframe configuration unit 22 of FIG. 4 which is input to the input terminal 103. The subframe pitch period ST (k) is calculated for each subframe using the audio signal, and the information of the subframe pitch period ST (k) is output to the output terminal 105.

【００５２】図６を用いて、サブフレームピッチ周期の
算出方法をさらに詳細に説明する。まず、ピッチ周期分
析部２３において現フレームｆ(0) で求めたピッチ周期
をＴ(0) とし、過去のフレームｆ(-1)で求めたピッチ周
期をＴ(-1)とする。サブフレームピッチ周期抽出部２４
では、これらの２つのピッチ周期を用いてサブフレーム
ピッチ周期予測値ＳＴＰ(k) をサブフレーム毎に求め
る。本実施形態では、Ｔ(-1)とＴ(0) とから補間により
サブフレームピッチ周期予測値ＳＴＰ(k) を求める方法
について説明する。音声信号の性質から、本来のサブフ
レームピッチ周期ＳＴ(k) はサブフレームピッチ周期予
測値ＳＴＰ(k) の近傍にあると考えることができる。そ
の理由として、音声信号のピッチ周期は時間的な変動が
小さいということが挙げられる。The method of calculating the subframe pitch period will be described in more detail with reference to FIG. First, the pitch cycle analysis unit 23 sets the pitch cycle found in the current frame f (0) to T (0), and the pitch cycle found in the past frame f (-1) to T (-1). Subframe pitch period extraction unit 24
Then, a subframe pitch period prediction value STP (k) is obtained for each subframe using these two pitch periods. In the present embodiment, a method of obtaining a subframe pitch period prediction value STP (k) from T (-1) and T (0) by interpolation will be described. Due to the nature of the audio signal, the original subframe pitch period ST (k) can be considered to be in the vicinity of the subframe pitch period predicted value STP (k). The reason is that the pitch period of the audio signal has a small temporal variation.

【００５３】次に、サブフレームピッチ周期予測値ＳＴ
Ｐ(k) を用いてサブフレームピッチ周期ＳＴ(k) を求め
る方法を図７を用いて説明する。上述のようにサブフレ
ームピッチ周期ＳＴ(k) はサブフレームピッチ周期予測
値ＳＴＰ(k) の近傍にあるという仮定が成り立つことか
ら、その近傍のみをピッチ周期抽出のための範囲として
考えることができる。図７では、第０サブフレームｓｆ
(0) におけるサブフレームピッチ周期予測値ＳＴＰ(0)
と、サブフレームピッチ周期ＳＴ(0) を抽出するための
相関値計算の範囲ＮＲを表している。Next, the subframe pitch period prediction value ST
A method of obtaining the subframe pitch period ST (k) using P (k) will be described with reference to FIG. As described above, since it is assumed that the subframe pitch period ST (k) is in the vicinity of the subframe pitch period prediction value STP (k), only the neighborhood can be considered as the range for pitch period extraction. . In FIG. 7, the 0th sub-frame sf
Subframe pitch period prediction value STP (0) at (0)
And the correlation value calculation range NR for extracting the subframe pitch period ST (0).

【００５４】このように本実施形態では、サブフレーム
ピッチ周期予測値ＳＴＰ(0) を中心として±ＮＲ／２の
範囲をサブフレームピッチ周期ＳＴ(0) を求めるための
相関値計算の範囲としている。すなわち、ＳＴＰ(0) −
ＮＲ／２≦ｔ≦ＳＴＰ(0) ＋ＮＲ／２−１の範囲に含ま
れる周期についてのみ、先に示した式（６）で規定され
る相関値ρ(t) を算出し、これが最大値をとるときのＳ
ＴＰ(0) に対する相対ピッチ周期ΔＴ(0) の情報を出力
端子２５より出力する。従って、サブフレームｓｆ(0)
のサブフレームピッチ周期ＳＴ(0) は、サブフレームピ
ッチ周期予測値ＳＴＰ(0) と相対ピッチ周期ΔＴ(0) と
の和ＳＴＰ(0) ＋ΔＴ(0) として求めることができる。
同様にして、他のサブフレームｓｆ(k) の相対ピッチ周
期ΔＴ(k) を求め、この情報を出力端子２５より出力す
る。本実施形態ではＮＲ＝８としている。As described above, in this embodiment, the range of ± NR / 2 around the subframe pitch period prediction value STP (0) is set as the correlation value calculation range for obtaining the subframe pitch period ST (0). . That is, STP (0)-
Only for the period included in the range of NR / 2 ≦ t ≦ STP (0) + NR / 2−1, the correlation value ρ (t) defined by the equation (6) shown above is calculated, and this is the maximum value. S when taking
Information on the relative pitch period ΔT (0) with respect to TP (0) is output from the output terminal 25. Therefore, subframe sf (0)
The subframe pitch period ST (0) of can be obtained as the sum STP (0) + ΔT (0) of the subframe pitch period predicted value STP (0) and the relative pitch period ΔT (0).
Similarly, the relative pitch period ΔT (k) of the other subframe sf (k) is obtained, and this information is output from the output terminal 25. In this embodiment, NR = 8.

【００５５】本実施形態によると、次のような効果が得
られる。まず、サブフレームピッチ周期を求めるための
相関値計算の範囲をＮＲとすることができるため、大幅
な計算量削減が実現できる。仮に、サブフレーム毎にサ
ブフレームピッチ周期を式（６）により求めると、全て
のピッチ周期候補（１２８候補）について計算する場
合、約４０，０００回の積和演算が必要になる。これに
対し、本実施形態を適用した場合、相関値計算範囲をＮ
Ｒ＝３とすると積和演算は約４，０００回で済み、約９
０％計算量を削減できる。但し、これはピッチ周期分析
長ＮＷ＝１６０のもとで計算した結果である。According to this embodiment, the following effects can be obtained. First, since the range of correlation value calculation for obtaining the subframe pitch period can be set to NR, a significant reduction in the amount of calculation can be realized. If the sub-frame pitch period is calculated by the equation (6) for each sub-frame, about 40,000 product-sum operations are required to calculate all the pitch period candidates (128 candidates). On the other hand, when the present embodiment is applied, the correlation value calculation range is N
If R = 3, the sum of products operation will be about 4,000 times, and about 9
The amount of calculation can be reduced by 0%. However, this is the result of calculation under the pitch period analysis length NW = 160.

【００５６】また、従来ではサブフレームピッチ周期を
表すのに必要な情報量は、サブフレーム毎に１２８候補
の全範囲について計算する場合、４サブフレームで１フ
レームが構成されるとすると、フレーム当たり７ビット
＊４＝２８ビットが必要であった。これに対し、本実施
形態ではフレームピッチ周期Ｔ(0) を表すのに７ビッ
ト、相対ピッチ周期ΔＴ(k) を表すのにサブフレーム当
たり３ビットあればよいから、サブフレームピッチ周期
を表すのに必要な情報量はフレーム当たり７ビット＋３
ビット＊４＝１９ビットとなり、約３５％のビット数削
減につながる。なお、過去のフレームのピッチ周期Ｔ(-
1)は、１フレーム前の処理におけるピッチ周期Ｔ(0) で
代用することができるため、これを新たにビットを使っ
て表す必要はない。In addition, conventionally, the amount of information required to express the subframe pitch period is such that, if one subframe is composed of four subframes when the entire range of 128 candidates is calculated for each subframe. 7 bits * 4 = 28 bits were needed. On the other hand, in the present embodiment, the frame pitch period T (0) is represented by 7 bits, and the relative pitch period ΔT (k) is represented by 3 bits per subframe. Therefore, the subframe pitch period is represented. The amount of information required for each frame is 7 bits + 3
Bit * 4 = 19 bits, which leads to a reduction in the number of bits by about 35%. The pitch period T (-
Since 1) can be substituted with the pitch period T (0) in the processing one frame before, it is not necessary to newly represent this with a bit.

【００５７】次に、本実施形態におけるサブフレームピ
ッチ周期算出の処理手順を図８のフローチャートにより
説明する。まず、ステップＳ１１で入力音声信号をフレ
ーム化、サブフレーム化を行う。次に、ステップＳ１２
で現フレーム内に分析の中心を置いてピッチ周期Ｔ(0)
を算出する。次に、ステップＳ１３で現フレームのピッ
チ周期Ｔ(0) と過去のフレームのピッチ周期Ｔ(-1)とを
用いて、サブフレーム毎のサブフレームピッチ周期予測
値ＳＴＰ(k) を求める。そして、ステップＳ１４でカウ
ンタｋを０に設定した後、ステップＳ１５でサブフレー
ムｓｆ(k) の相対ピッチ周期ΔＴ(k) を算出し、ステッ
プＳ１６でカウンタｋを１だけインクリメントする。ス
テップＳ１７でカウンタｋがＫ（Ｋはサブフレーム数）
に一致しているかどうかの判定を行う。ここでｋがＫに
一致していなければ、ステップＳ１５に戻って処理を継
続し、一致したならば処理は終了する。Next, the processing procedure for calculating the sub-frame pitch period in this embodiment will be described with reference to the flowchart of FIG. First, in step S11, the input audio signal is framed and subframed. Next, step S12
With the center of analysis in the current frame, pitch period T (0)
To calculate. Next, in step S13, the subframe pitch period prediction value STP (k) for each subframe is obtained using the pitch period T (0) of the current frame and the pitch period T (-1) of the past frame. Then, after setting the counter k to 0 in step S14, the relative pitch period ΔT (k) of the sub-frame sf (k) is calculated in step S15, and the counter k is incremented by 1 in step S16. In step S17, the counter k is K (K is the number of subframes)
It is determined whether or not If k does not match K, the process returns to step S15 to continue the process. If they match, the process ends.

【００５８】（第３の実施形態）図９を用いて本発明の
第３の実施形態を説明する。本実施形態と第２の実施形
態の違いは、サブフレームピッチ周期予測値ＳＴＰ(k)
を求める際に、先読み部ｆｒのピッチ周期Ｔｒを利用し
ている点にある。(Third Embodiment) A third embodiment of the present invention will be described with reference to FIG. The difference between the present embodiment and the second embodiment is that the subframe pitch period prediction value STP (k)
The point is that the pitch period Tr of the prefetching unit fr is used when obtaining

【００５９】サブフレームの中心が過去のフレームｆ(-
1)のピッチ周期Ｔ(-1)の分析位置と現フレームｆ(0) の
ピッチ周期Ｔ(0) の分析位置の間にある場合、そのサブ
フレームのサブフレームピッチ周期予測値ＳＴＰ(k)
は、Ｔ(-1)とＴ(0) を用いて求められる。同様に、サブ
フレームの中心が現フレームｆ(0) のピッチ周期Ｔ(0)
の分析位置と先読み部ｆｒのピッチ周期Ｔｒの分析位置
の間にある場合、そのサブフレームのサブフレームピッ
チ周期予測値ＳＴＰ(k) は、Ｔ(0) とＴｒを用いて求め
られる。本実施形態では、直線補間を用いてサブフレー
ムピッチ周期予測値ＳＴＰ(k) を算出している。The center of the sub-frame is the past frame f (-
1) between the analysis position of the pitch period T (-1) and the analysis position of the pitch period T (0) of the current frame f (0), the subframe pitch period prediction value STP (k) of the subframe
Is calculated using T (-1) and T (0). Similarly, the center of the subframe is the pitch period T (0) of the current frame f (0).
, And the analysis position of the pitch period Tr of the prefetching unit fr, the subframe pitch period prediction value STP (k) of the subframe is obtained using T (0) and Tr. In the present embodiment, the subframe pitch period prediction value STP (k) is calculated using linear interpolation.

【００６０】このようにしてサブフレームピッチ周期予
測値ＳＴＰ(k) を求めた後に、第２の実施形態と同様に
して相対ピッチ周期ΔＴ(k) をサブフレーム毎に求め
る。After the subframe pitch period predicted value STP (k) is obtained in this way, the relative pitch period ΔT (k) is obtained for each subframe in the same manner as in the second embodiment.

【００６１】また、本実施形態においてサブフレームピ
ッチ周期ＳＴ(k) を表すのに必要な情報は、現フレーム
のピッチ周期Ｔ(0) を１フレーム前の処理において求め
た先読み部ｆｒのピッチ周期Ｔｒで代用することができ
るため、先読み部ｆｒのピッチ周期Ｔｒとサブフレーム
毎の相対ピッチ周期ΔＴ(k) だけでよい。The information necessary to represent the sub-frame pitch period ST (k) in this embodiment is the pitch period of the look-ahead part fr obtained by processing the pitch period T (0) of the current frame one frame before. Since Tr can be used as a substitute, only the pitch period Tr of the look-ahead unit fr and the relative pitch period ΔT (k) for each subframe are sufficient.

【００６２】次に、図１０に示すフローチャートを用い
て本実施形態における処理手順を説明する。図１０のス
テップＳ２１、Ｓ２４、Ｓ２５、Ｓ２６およびＳ２７の
処理は、図８のステップＳ１１、Ｓ１４、Ｓ１５、Ｓ１
６およびＳ１７とそれぞれ同じであるので、ここでは説
明を省略する。図１０の処理手順の特徴的な部分は、ス
テップＳ２２で先読み部ｆｒに分析の中心を置いてピッ
チ周期Ｔｒを算出する点と、ステップＳ２３でサブフレ
ームピッチ周期予測値ＳＴＰ(k) を先読み部ｆｒのピッ
チ周期Ｔｒと現フレームｆ(0) のピッチ周期Ｔ(0) とを
用いて求めるか、もしくは現フレームｆ(0) のピッチ周
期Ｔ(0) と過去のフレームｆ(-1)のピッチ周期Ｔ(-1)と
を用いて求めるかのいずれかを使う点にある。Next, the processing procedure in this embodiment will be described with reference to the flow chart shown in FIG. The processing of steps S21, S24, S25, S26 and S27 of FIG. 10 is performed by steps S11, S14, S15 and S1 of FIG.
6 and S17, which are the same as those in S6 and S17, respectively, and the description thereof is omitted here. The characteristic part of the processing procedure of FIG. 10 is that the pitch cycle Tr is calculated with the analysis centered on the look-ahead section fr in step S22, and the sub-frame pitch cycle predicted value STP (k) is read in the step S23. The pitch period Tr of fr and the pitch period T (0) of the current frame f (0) are used to determine the pitch period T (0) of the current frame f (0) and the past frame f (-1). One of the points is to use one of the pitch periods T (-1).

【００６３】本実施形態では、いずれのサブフレームピ
ッチ周期予測値ＳＴＰ(k) も過去のフレームｆ(-1)のピ
ッチ周期Ｔ(-1)と現フレームｆ(0) のピッチ周期Ｔ(0)
、またはＴ(0) と先読み部ｆｒのピッチ周期Ｔｒの内
挿値として表されるため、より正確な予測値を得ること
ができる。In this embodiment, any subframe pitch cycle prediction value STP (k) has a pitch cycle T (-1) of the past frame f (-1) and a pitch cycle T (0 of the current frame f (0). )
, Or T (0) and an interpolated value of the pitch period Tr of the prefetching unit fr, a more accurate predicted value can be obtained.

【００６４】例えば、実際のピッチ周期が図１１に示す
ように変動している場合、第２の実施形態のように過去
のフレームのピッチ周期Ｔ(-1)と現フレームのピッチ周
期Ｔ(0) を基にサブフレームピッチ周期予測値ＳＴＰ
(k) を求める方法では、サブフレームピッチ周期予測値
ＳＴＰ(k) の信頼性が低下してしまうという問題が生じ
ることがある。これに対し、本実施形態では先読み部ｆ
ｒのピッチ周期Ｔｒが実際のピッチ周期を正確に表すた
め、その補間値も実際のピッチ周期を正確に表すことが
できるようになる。For example, when the actual pitch period fluctuates as shown in FIG. 11, the pitch period T (-1) of the past frame and the pitch period T (0 of the present frame are changed as in the second embodiment. ) Subframe pitch period prediction value STP
The method of obtaining (k) may cause a problem that the reliability of the subframe pitch period prediction value STP (k) is lowered. On the other hand, in the present embodiment, the prefetch unit f
Since the pitch period Tr of r accurately represents the actual pitch period, the interpolation value thereof can also accurately represent the actual pitch period.

【００６５】（第４の実施形態）図１２を用いて本発明
の第４の実施形態を説明する。本実施形態と第３の実施
形態の違いは、サブフレームピッチ周期予測値ＳＴＰ
(k) を求める際に、先読み部ｆｒのピッチ周期Ｔｒと現
フレームｆ(0) のピッチ周期Ｔ(0) を用いる点にある。(Fourth Embodiment) A fourth embodiment of the present invention will be described with reference to FIG. The difference between the present embodiment and the third embodiment is that the subframe pitch period prediction value STP is
The point is that the pitch period Tr of the prefetching unit fr and the pitch period T (0) of the current frame f (0) are used when obtaining (k).

【００６６】本実施形態の処理手順を図１３に示すフロ
ーチャートを用いて説明する。図１３におけるステップ
Ｓ３１、Ｓ３４〜Ｓ３７の処理は、図８におけるステッ
プＳ１１、Ｓ１４〜Ｓ１７の処理と同様であるので、こ
こでは説明を省略する。The processing procedure of this embodiment will be described with reference to the flowchart shown in FIG. The processing of steps S31 and S34 to S37 in FIG. 13 is the same as the processing of steps S11 and S14 to S17 in FIG.

【００６７】ステップＳ３２において、先読み部ｆｒに
分析の中心を置いてピッチ周期Ｔｒを算出する。次に、
ステップＳ３３で先読み部ｆｒのピッチ周期Ｔｒと現フ
レームｆ(0) のピッチ周期Ｔ(0) を用いてサブフレーム
ピッチ周期予測値ＳＴＰ(k)を算出する。In step S32, the pitch period Tr is calculated with the analysis centered on the look-ahead part fr. next,
In step S33, the subframe pitch cycle prediction value STP (k) is calculated using the pitch cycle Tr of the prefetching unit fr and the pitch cycle T (0) of the current frame f (0).

【００６８】本実施形態では、先読み部ｆｒのピッチ周
期Ｔｒと現フレームｆ(0) のピッチ周期Ｔ(0) とから補
間によりサブフレームピッチ周期予測値ＳＴＰ(k) を求
めることができるため、第３の実施形態に比べ先読み部
ｆｒの長さを短くでき、従って遅延を小さくできるとい
うメリットがある。In the present embodiment, the subframe pitch period prediction value STP (k) can be obtained by interpolation from the pitch period Tr of the prefetching unit fr and the pitch period T (0) of the current frame f (0). Compared with the third embodiment, there is an advantage that the length of the prefetching unit fr can be shortened and therefore the delay can be reduced.

【００６９】（第５の実施形態）図１４を用いて本発明
の第５の実施形態を説明する。本実施形態と第４の実施
形態の違いは、サブフレームピッチ周期予測値ＳＴＰ
(k) を求める際に、先読み部ｆｒのピッチ周期Ｔｒと過
去のフレームｆ(-1)のピッチ周期Ｔ(-1)を用いている点
にある。(Fifth Embodiment) A fifth embodiment of the present invention will be described with reference to FIG. The difference between the present embodiment and the fourth embodiment is that the subframe pitch period prediction value STP
The point is that the pitch period Tr of the prefetching unit fr and the pitch period T (-1) of the past frame f (-1) are used when obtaining (k).

【００７０】本実施形態によると、次のような効果があ
る。ピッチ周期の正確な分析は一般に困難であり、現在
においても確立された方式があるわけではない。本実施
形態では式（６）で表される相関分析による方法につい
て説明しているが、この場合も常に正しいピッチ周期を
求められるわけではなく、実際のピッチ周期の整数分の
１のピッチ周期が求められてしまったり、逆に整数倍の
ピッチ周期が求められてしまう場合がある。According to this embodiment, the following effects can be obtained. Accurate analysis of pitch period is generally difficult, and there is no established method even now. In the present embodiment, the method based on the correlation analysis represented by the equation (6) is described, but in this case as well, the correct pitch period cannot always be obtained, and the pitch period which is an integer fraction of the actual pitch period is There is a case where the pitch period is obtained or, conversely, an integral multiple pitch period is obtained.

【００７１】図１４に示されるように、現フレームｆ
(0) のピッチ周期Ｔ(0) の分析が失敗し、実際のピッチ
周期から大きく外れてしまった場合、先読み部ｆｒのピ
ッチ周期Ｔｒと過去のフレームｆ(-1)のピッチ周期Ｔ(-
1)とで補間を行い、サブフレームピッチ周期予測値ＳＴ
Ｐ(k) を求めるようにすれば、現フレームｆ(0) のピッ
チ周期Ｔ(0) が正確に求まった場合と同様に効率的にピ
ッチ周期を表現することができる。As shown in FIG. 14, the current frame f
When the analysis of the pitch period T (0) of (0) fails and is largely deviated from the actual pitch period, the pitch period Tr of the look-ahead part fr and the pitch period T (-of the past frame f (-1) are
Interpolation with 1) and subframe pitch period prediction value ST
By determining P (k), the pitch period can be efficiently expressed as in the case where the pitch period T (0) of the current frame f (0) is accurately determined.

【００７２】本実施形態の処理手順を図１５に示すフロ
ーチャートを用いて説明する。図１５におけるステップ
Ｓ４１、Ｓ４２、Ｓ４５、Ｓ４７、Ｓ４８、Ｓ４９およ
びＳ５０の処理は、図１０におけるステップＳ２１、Ｓ
２２、Ｓ２３、Ｓ２４、Ｓ２５、Ｓ２６およびＳ２７の
処理と同じであるので、ここでは説明を省略する。本実
施形態で特徴的な部分は、ステップＳ４３、Ｓ４４およ
びＳ４６の処理であるので、これらの処理について説明
する。The processing procedure of this embodiment will be described with reference to the flowchart shown in FIG. The processes of steps S41, S42, S45, S47, S48, S49 and S50 in FIG. 15 are the same as steps S21, S in FIG.
Since it is the same as the processing of S22, S23, S24, S25, S26 and S27, the description thereof is omitted here. The characteristic part of the present embodiment is the processing of steps S43, S44, and S46, so these processings will be described.

【００７３】ステップＳ４３およびＳ４４において先読
み部で求めたピッチ周期Ｔｒと過去のフレームのピッチ
周期Ｔ(-1)との平均値をステップＳ４３で算出し、その
平均値と現フレームのピッチ周期Ｔ(0) との大きさを次
式（７）に従い比較する（ステップＳ４４）。In step S43 and S44, the average value of the pitch cycle Tr obtained by the look-ahead section and the pitch cycle T (-1) of the past frame is calculated in step S43, and the average value and the pitch cycle T () of the current frame are calculated. And 0) are compared according to the following equation (7) (step S44).

【００７４】[0074]

【数４】 [Equation 4]

【００７５】式（７）が成立する場合、現フレームｆ
(0) のピッチ周期Ｔ(0) は信頼できるものとして、ステ
ップＳ４５に示すようにサブフレームピッチ周期予測値
ＳＴＰ(k) を算出するために現フレームｆ(0) のピッチ
周期Ｔ(0) を用いる。逆に、式（７）が成立しないとき
は、ステップＳ４６で示すように先読み部ｆｒのピッチ
周期Ｔｒと過去のフレームｆ(-1)のピッチ周期Ｔ(-1)の
みを用いてサブフレームピッチ周期予測値ＳＴＰ(k) を
算出する。If the expression (7) is satisfied, the current frame f
Assuming that the pitch period T (0) of (0) is reliable, the pitch period T (0) of the current frame f (0) is calculated in order to calculate the subframe pitch period prediction value STP (k) as shown in step S45. To use. On the contrary, when the expression (7) is not established, as shown in step S46, only the pitch period Tr of the look-ahead part fr and the pitch period T (-1) of the past frame f (-1) are used to determine the sub-frame pitch. The cycle prediction value STP (k) is calculated.

【００７６】（第６の実施形態）図１６を用いて本発明
の第６の実施形態を説明する。図１６において、入力端
子２０１、フレーム・サブフレーム構成部２０２および
ピッチ周期分析部２０３は、図１中の参照符号１０１〜
１０３を付した要素と同じであるので、ここでは説明を
省略する。(Sixth Embodiment) The sixth embodiment of the present invention will be described with reference to FIG. 16, the input terminal 201, the frame / subframe configuration unit 202, and the pitch period analysis unit 203 are denoted by reference numerals 101 to 101 in FIG.
Since it is the same as the element denoted by 103, its explanation is omitted here.

【００７７】本実施形態の特徴は、本発明を適応符号帳
を用いた音声符号化方法に適用した点にある。適応符号
帳とは、過去の駆動信号系列を予め定められた範囲に含
まれる周期で繰り返して生成される複数の適応ベクトル
を格納したコードブックをいう。A feature of this embodiment is that the present invention is applied to a speech coding method using an adaptive codebook. The adaptive codebook refers to a codebook that stores a plurality of adaptive vectors generated by repeating a past drive signal sequence in a cycle included in a predetermined range.

【００７８】サブフレームピッチ周期予測値算出部２０
４では、ピッチ周期分析部２０３で求めたピッチ周期を
基に、各サブフレーム毎にサブフレームピッチ周期予測
値ＳＴＰ(k) を算出する。サブフレームピッチ周期予測
値ＳＴＰ(k) の算出法には、図６、図９、図１２および
図１４で説明した方法のいずれも適用可能である。ま
た、それによる効果は前述した通りである。ただし、図
６で説明した方法ではピッチ周期分析部２０３でのピッ
チ周期分析の中心が現フレームｆ(0) 上にあり、図９、
図１２、図１４で説明した方法ではピッチ周期分析の中
心は先読み部ｆｒ上にあるという違いはある。本実施形
態では、図６で説明した方法に基づいた場合の例につい
て説明する。Subframe pitch period prediction value calculation unit 20
In 4, the subframe pitch cycle prediction value STP (k) is calculated for each subframe based on the pitch cycle obtained by the pitch cycle analysis unit 203. Any of the methods described with reference to FIGS. 6, 9, 12, and 14 can be applied to the method of calculating the subframe pitch period prediction value STP (k). Moreover, the effect by it is as above-mentioned. However, in the method described in FIG. 6, the center of the pitch period analysis in the pitch period analysis unit 203 is on the current frame f (0), and
In the methods described with reference to FIGS. 12 and 14, there is a difference that the center of the pitch period analysis is on the prefetching unit fr. In this embodiment, an example based on the method described in FIG. 6 will be described.

【００７９】サブフレームピッチ周期予測値算出部２０
４では、ピッチ周期分析部２０３で求めたピッチ周期Ｔ
(0) とｆ(-1)フレームのピッチ周期Ｔ(-1)とから補間に
よりサブフレームピッチ周期予測値ＳＴＰ(k) を求め
る。Subframe pitch period prediction value calculation unit 20
4, the pitch period T obtained by the pitch period analysis unit 203
A sub-frame pitch period predicted value STP (k) is obtained by interpolation from (0) and the pitch period T (-1) of the f (-1) frame.

【００８０】探索範囲決定部２０５では、サブフレーム
ピッチ周期ＳＴ(0) を求める際の探索範囲、つまり適応
符号帳２０６に対する適応ベクトルの探索範囲を決定す
る。具体的には、サブフレームピッチ周期予測値ＳＴＰ
(0) を中心として±ＮＲ／２を探索範囲とする。すなわ
ち、ＳＴＰ(0) −ＮＲ／２≦ｔ≦ＳＴＰ(0) ＋ＮＲ／２
−１の範囲に含まれる周期についてのみ探索を行う。探
索は、以下のような方法で行われる。Search range determination section 205 determines the search range for obtaining subframe pitch period ST (0), that is, the search range of the adaptive vector for adaptive codebook 206. Specifically, the subframe pitch cycle prediction value STP
The search range is ± NR / 2 centered on (0). That is, STP (0) -NR / 2≤t≤STP (0) + NR / 2
Only the cycle included in the range of -1 is searched. The search is performed by the following method.

【００８１】探索範囲決定部２０５で決定された探索範
囲に含まれるピッチ周期ｔに対応する適応ベクトルｑ
(t,n) を適応符号帳２０６から取り出し、これを聴感重
み付き合成フィルタ２０７に通して合成信号ｐ(t,n) を
生成する。そして、この合成信号ｐ(t,n) と、入力音声
信号を聴感重みフィルタ２０８に通して得られる目標信
号ｒ(n) との寄与度ｃｎｔｂ(t) を減算器２０９を介し
て歪算出部２１０で次式に従い算出する。The adaptive vector q corresponding to the pitch period t included in the search range determined by the search range determining unit 205.
(t, n) is taken out from the adaptive codebook 206 and is passed through the perceptual weighting synthesis filter 207 to generate a synthesis signal p (t, n). Then, the contribution degree cntb (t) between this synthesized signal p (t, n) and the target signal r (n) obtained by passing the input voice signal through the perceptual weighting filter 208 is passed through the subtractor 209 to the distortion calculation unit. At 210, it is calculated according to the following equation.

【００８２】[0082]

【数５】 [Equation 5]

【００８３】探索範囲に含まれるピッチ周期ｔの全候補
の中で、寄与度ｃｎｔｂ(t) が最大となるときのピッチ
周期ｔをサブフレームピッチ周期ＳＴ(0) とし、サブフ
レームピッチ周期ＳＴ(0) からサブフレームピッチ周期
予測値ＳＴＰ(0) を差し引いた値を相対ピッチ周期ΔＴ
(0) とする。この相対ピッチ周期ΔＴ(0) を求める処理
を全てのサブフレームｓｆ(k) について行い、全サブフ
レームの相対ピッチ周期ΔＴ(k) を求める。Of all the candidates of pitch period t included in the search range, the pitch period t when the contribution cntb (t) becomes maximum is set as the subframe pitch period ST (0), and the subframe pitch period ST ( The value obtained by subtracting the subframe pitch cycle prediction value STP (0) from (0) is the relative pitch cycle ΔT.
(0) The process of obtaining this relative pitch period ΔT (0) is performed for all subframes sf (k) to obtain the relative pitch period ΔT (k) of all subframes.

【００８４】最後に、出力端子２１２からピッチ周期分
析部２０３で求めたピッチ周期Ｔ(0) の情報を出力し、
出力端子２１１から歪算出部２１０で寄与度ｃｎｔｂ
(t)が最大となるときの相対ピッチ周期ΔＴ(k) の情報
を出力する。Finally, the information of the pitch period T (0) obtained by the pitch period analysis unit 203 is output from the output terminal 212,
The contribution degree cntb from the output terminal 211 in the distortion calculation unit 210
Information on the relative pitch period ΔT (k) when (t) becomes maximum is output.

【００８５】次に、本実施形態の処理手順を図１７〜１
８に示すフローチャートを用いて説明する。図１７〜１
８におけるステップＳ１０１、Ｓ１０２、Ｓ１０３、Ｓ
１０４、Ｓ１１４およびＳ１１５の処理は、図８におけ
るステップＳ１１、Ｓ１２、Ｓ１３、Ｓ１４、Ｓ１６お
よびＳ１７の処理と同じであるので、ここでは説明を省
略する。以下、本実施形態特有の処理について説明す
る。Next, the processing procedure of this embodiment will be described with reference to FIGS.
This will be described using the flowchart shown in FIG. 17 to 1
8 in steps S101, S102, S103, S
Since the processing of 104, S114 and S115 is the same as the processing of steps S11, S12, S13, S14, S16 and S17 in FIG. 8, description thereof will be omitted here. The processing unique to this embodiment will be described below.

【００８６】ステップＳ１０５では、サブフレームｓｆ
(k) の探索範囲を決定する。探索範囲は、前述したよう
にサブフレームピッチ周期予測値ＳＴＰ(k) を用いて定
められる。ステップＳ１０６で、変数ｃｎｔｂｍａｘと
サブフレームピッチ周期ＳＴ(k) の初期値を与える。ｃ
ｎｔｂｍａｘには初期値になり得る十分に大きな値を与
え、ＳＴ(k) には最小ピッチ周期を与える。ステップＳ
１０７では、探索範囲に含まれる周期ｔに対応する適応
ベクトルｑ(t,n) を適応符号帳から選択し、ステップＳ
１０８で適応ベクトルｑ(t,n) を聴感重み付き合成フィ
ルタに通して合成信号ｐ(t,n) を生成する。ステップＳ
１０９では、予め求めておいた目標信号ｒ(n) と合成信
号ｐ(t,n) との寄与度ｃｎｔｂ(t) を算出し、ステップ
Ｓ１１０ではピッチ周期ｔにおける寄与度ｃｎｔｂ(t)
とｃｎｔｂｍａｘの大きさを比較する。In step S105, the subframe sf
Determine the search range of (k). The search range is determined using the subframe pitch period prediction value STP (k) as described above. In step S106, initial values of the variable cntbmax and the subframe pitch period ST (k) are given. c
ntbmax is given a sufficiently large value that can be an initial value, and ST (k) is given a minimum pitch period. Step S
In step 107, the adaptive vector q (t, n) corresponding to the cycle t included in the search range is selected from the adaptive codebook, and step S
At 108, the adaptive vector q (t, n) is passed through a perceptual weighting synthesis filter to generate a synthesis signal p (t, n). Step S
In step 109, the contribution cntb (t) between the target signal r (n) and the combined signal p (t, n) which has been obtained in advance is calculated. In step S110, the contribution cntb (t) in the pitch cycle t is calculated.
And cntbmax are compared.

【００８７】ここで、ｃｎｔｂ(t) ＞ｃｎｔｂｍａｘが
成り立つ場合、ステップＳ１１１においてｃｎｔｂｍａ
ｘにｃｎｔｂ(t) を与え、ＳＴ(k) にｔを与えてステッ
プＳ１１２に進む。ｃｎｔｂ(t) ＞ｃｎｔｂｍａｘが成
り立たない場合、ステップＳ１１２に進む。Here, if cntb (t)> cntbmax holds, then in step S111, cntbma
cntb (t) is given to x, t is given to ST (k), and the process proceeds to step S112. If cntb (t)> cntbmax is not satisfied, the process proceeds to step S112.

【００８８】ステップＳ１１２では、探索範囲に含まれ
るピッチ周期ｔの候補を全て探索したか否かを判定し、
そうであればステップＳ１１３に進み、そうでなければ
ステップＳ１０７に戻って、まだ探索を行っていないピ
ッチ周期ｔを用いて処理を継続する。ステップＳ１１３
では、サブフレームｓｆ(k) の相対ピッチ周期ΔＴ(k)
をサブフレームピッチ周期ＳＴ(k) とサブフレームピッ
チ周期予測値ＳＴＰ(k) を用いて算出し、ステップＳ１
１４に進む。In step S112, it is determined whether or not all the candidates of the pitch cycle t included in the search range have been searched,
If so, the process proceeds to step S113, and if not, the process returns to step S107 to continue the process using the pitch cycle t which has not been searched yet. Step S113
Then, the relative pitch period ΔT (k) of the subframe sf (k)
Is calculated using the subframe pitch period ST (k) and the subframe pitch period prediction value STP (k), and step S1
Proceed to 14.

【００８９】（第７の実施形態）図１９〜２０のフロー
チャートを用いて本発明の第７の実施形態を説明する。
本実施形態と第６の実施形態との違いは、サブフレーム
ピッチ周期予測値の算出に図９で説明した方法を適用し
ている点にある。(Seventh Embodiment) The seventh embodiment of the present invention will be described with reference to the flowcharts of FIGS.
The difference between the present embodiment and the sixth embodiment is that the method described in FIG. 9 is applied to the calculation of the subframe pitch period prediction value.

【００９０】本実施形態によると、前述したようにサブ
フレームピッチ周期予測値ＳＴＰ(k) の実際のピッチ周
期に対する正確性が向上するため、サブフレームｓｆ
(k)のサブフレームピッチ周期ＳＴ(k) の探索範囲を小
さくでき、結果的に相対ピッチ周期ΔＴ(k) を表現する
のに必要なビット数および計算量を少なくすることがで
きるという効果がある。図１９〜２０において、ステッ
プＳ２０１およびＳ２０４〜Ｓ２１５の処理は、それぞ
れ図１７〜１８におけるステップＳ１０１およびＳ１０
４〜Ｓ１１５の処理と同様であるので、ここでは説明を
省略する。According to the present embodiment, as described above, the accuracy of the subframe pitch period prediction value STP (k) with respect to the actual pitch period is improved, so that the subframe sf
It is possible to reduce the search range of the subframe pitch period ST (k) of (k), and consequently reduce the number of bits and the amount of calculation required to express the relative pitch period ΔT (k). is there. 19 to 20, the processes of steps S201 and S204 to S215 are the same as steps S101 and S10 of FIGS.
Since it is the same as the processing of 4 to S115, the description thereof is omitted here.

【００９１】本実施形態で特徴的な処理は、ステップＳ
２０２とＳ２０３にある。ステップＳ２０２では、ピッ
チ周期分析の中心を先読み部に置いてピッチ周期を分析
している。ステップＳ２０３では、サブフレームピッチ
周期予測値ＳＴＰ(k) の算出を先読み部ｆｒのピッチ周
期Ｔｒとフレームｆ(0) のピッチ周期Ｔ(0) を用いて決
定するか、もしくはフレームｆ(0) のピッチ周期Ｔ(0)
とフレームｆ(-1)のピッチ周期Ｔ(-1)を用いて決定する
かのいずれか一方を用いている。The processing characteristic of this embodiment is step S
202 and S203. In step S202, the pitch period is analyzed with the center of the pitch period analysis placed in the look-ahead part. In step S203, the calculation of the subframe pitch period prediction value STP (k) is determined by using the pitch period Tr of the prefetching unit fr and the pitch period T (0) of the frame f (0), or the frame f (0). Pitch period of T (0)
And the pitch period T (-1) of the frame f (-1) is used.

【００９２】（第８の実施形態）図２１〜２２のフロー
チャートを用いて第８の実施形態を説明する。本実施形
態と第７の実施形態との違いは、サブフレームピッチ周
期予測値算出部に図１２で説明した方法を適用している
点にある。(Eighth Embodiment) The eighth embodiment will be described with reference to the flowcharts of FIGS. The difference between the present embodiment and the seventh embodiment is that the method described in FIG. 12 is applied to the subframe pitch period prediction value calculation unit.

【００９３】本実施形態によると、前述したように先読
み部の大きさを小さくできるため、遅延を短くすること
ができるという効果がある。図２１〜２２において、ス
テップＳ３０１、Ｓ３０２およびＳ３０４〜Ｓ３１５の
処理は、図１９〜２０におけるステップＳ２０１、Ｓ２
０２およびＳ２０４〜Ｓ２１５の処理と同様であるの
で、ここでは説明を省略する。According to this embodiment, the size of the read-ahead portion can be reduced as described above, so that the delay can be shortened. 21 to 22, the processes of steps S301, S302 and S304 to S315 are the same as steps S201 and S2 of FIGS.
02 and S204 to S215, the description is omitted here.

【００９４】本実施形態で特徴的な処理はステップＳ３
０３にあり、このステップＳ３０３ではサブフレームピ
ッチ周期予測値ＳＴＰ(k) を先読み部ｆｒのピッチ周期
Ｔｒとフレームｆ(0) のピッチ周期Ｔ(0) を用いて算出
している。The processing characteristic of this embodiment is step S3.
In step S303, the sub-frame pitch cycle predicted value STP (k) is calculated using the pitch cycle Tr of the prefetching unit fr and the pitch cycle T (0) of the frame f (0).

【００９５】（第９の実施形態）図２３〜２４のフロー
チャートを用いて本発明の第９の実施形態を説明する。
本実施形態と第８の実施形態との違いは、サブフレーム
ピッチ周期予測値算出部に図１４で説明した方法を適用
している点にある。(Ninth Embodiment) The ninth embodiment of the present invention will be described with reference to the flowcharts of FIGS.
The difference between this embodiment and the eighth embodiment is that the method described with reference to FIG. 14 is applied to the subframe pitch period prediction value calculation unit.

【００９６】本実施形態の方法を用いれば、前述したよ
うにフレームｆ(0) でのピッチ周期Ｔ(0) の分析が失敗
し実際のピッチ周期から大きく外れてしまった場合で
も、先読み部ｆｒで求めたピッチ周期Ｔｒとフレームｆ
(-1)のピッチ周期Ｔ(-1)とで補間を行うことでサブフレ
ームピッチ周期予測値ＳＴＰ(k) を正確に求めることが
でき、フレームｆ(0) のピッチ周期Ｔ(0) が正確に求ま
った場合と同様に効率的にピッチ周期を表現することが
できる。According to the method of this embodiment, even if the analysis of the pitch period T (0) in the frame f (0) fails as described above and the pitch period T (0) largely deviates from the actual pitch period, the prefetch section fr is used. Pitch cycle Tr and frame f calculated in
By interpolating with the pitch period T (-1) of (-1), the subframe pitch period prediction value STP (k) can be accurately obtained, and the pitch period T (0) of the frame f (0) is The pitch period can be efficiently expressed as in the case where the pitch period is accurately obtained.

【００９７】図２３〜２４において、ステップＳ４０
１、Ｓ４０２およびＳ４０７〜Ｓ４１８の処理は、図２
１〜２２におけるステップＳ３０１、Ｓ３０２およびＳ
３０４〜Ｓ３１５の処理と同様であり、また図２３〜２
４におけるステップＳ４０３〜Ｓ４０６の処理は、図１
５におけるステップＳ４３〜Ｓ４６の処理と同じである
ので、ここでは説明を省略する。23 to 24, step S40
2, the processing of S402 and S407 to S418 is as shown in FIG.
1 to 22 steps S301, S302 and S
The processing is the same as the processing of 304 to S315, and also shown in FIGS.
The processing of steps S403 to S406 in FIG.
Since it is the same as the processing of steps S43 to S46 in 5, the description thereof will be omitted here.

【００９８】（第１０の実施形態）図２５を用いて本発
明の第１０の実施形態を説明する。図２５において入力
端子３０１、フレーム・サブフレーム構成部３０２、ピ
ッチ周期分析部３０３、サブフレームピッチ周期算出部
３０４、探索範囲決定部３０５、適応符号帳３０６、聴
感重み付き合成フィルタ３０７、聴感重みフィル３０
８、減算器３０９、歪算出部３１０、出力端子３１１，
３１２は、図１６中の参照符号２０１〜２１２を付した
要素と同一であるので、ここでは説明を省略する。本実
施形態の特徴は、ピッチ周期分析部３０３でのピッチ周
期の分析位置をピッチ周期分析位置決定部３１３によっ
て決定する点にある。(Tenth Embodiment) The tenth embodiment of the present invention will be described with reference to FIG. In FIG. 25, an input terminal 301, a frame / subframe configuration unit 302, a pitch period analysis unit 303, a subframe pitch period calculation unit 304, a search range determination unit 305, an adaptive codebook 306, a perceptual weighting synthesis filter 307, and a perceptual weight filter. Thirty
8, subtractor 309, distortion calculator 310, output terminal 311,
The reference numeral 312 is the same as the elements denoted by reference numerals 201 to 212 in FIG. 16, and thus the description thereof is omitted here. The feature of this embodiment lies in that the pitch period analysis position is determined by the pitch period analysis position determination unit 313 in the pitch period analysis unit 303.

【００９９】本実施形態の効果を図２６を用いて説明す
る。同図のように入力音声信号の特性がフレーム内で変
化している場合、ピッチ周期分析位置が固定であると正
確なピッチ周期を求められないことがある。図２６の場
合、ピッチ周期分析位置が固定のときのピッチ分析対象
の信号に周期成分が含まれていないことが正確なピッチ
周期を求めることができない原因になっている。しか
し、ピッチ分析位置を可変にできれば、図２６に示すよ
うにピッチ周期成分を有する波形が存在する位置にピッ
チ周期分析位置を設定することができるので、正確なピ
ッチ周期を求めることが可能になる。ただし、この場合
にはピッチ周期分析位置を表すための付加情報が必要に
なる。The effect of this embodiment will be described with reference to FIG. When the characteristics of the input speech signal change within the frame as shown in the figure, an accurate pitch cycle may not be obtained if the pitch cycle analysis position is fixed. In the case of FIG. 26, the fact that the signal to be pitch-analyzed when the pitch-period analysis position is fixed does not include a periodic component is the reason why an accurate pitch period cannot be obtained. However, if the pitch analysis position can be made variable, the pitch period analysis position can be set at a position where a waveform having a pitch period component exists as shown in FIG. 26, so that an accurate pitch period can be obtained. . However, in this case, additional information is required to represent the pitch period analysis position.

【０１００】図２５において、ピッチ周期分析部３１３
はピッチ周期分析位置を決定する。本実施形態では、入
力音声信号の短区間パワーもしくは低域通過フィルタを
通過した後の予測残差信号のパワーなどを指標にして、
そのパワーが大きくなる部分にピッチ周期分析位置を設
定する方法を用いている。そのピッチ周期分析位置の情
報をピッチ周期分析部３０３に送ると同時に、出力端子
３１４から出力する。ピッチ周期分析部３０３では、ピ
ッチ周期分析位置決定部３１３から送られてきたピッチ
周期分析位置の情報に従ってピッチ周期の分析を行う。In FIG. 25, the pitch period analysis unit 313
Determines the pitch period analysis position. In the present embodiment, the short-term power of the input audio signal or the power of the prediction residual signal after passing the low-pass filter is used as an index,
A method of setting a pitch period analysis position in a portion where the power becomes large is used. The information of the pitch cycle analysis position is sent to the pitch cycle analysis unit 303 and simultaneously output from the output terminal 314. The pitch period analysis unit 303 analyzes the pitch period according to the information of the pitch period analysis position sent from the pitch period analysis position determination unit 313.

【０１０１】（第１１の実施形態）図２７を用いて本発
明の第１１の実施形態を説明する。図２７において、入
力端子４０１、フレーム・サブフレーム構成部４０２、
ピッチ周期分析部４０３、サブフレームピッチ周期算出
部４０４、探索範囲決定部４０５、適応符号帳４０６、
聴感重み付き合成フィルタ４０７、聴感重みフィルタ４
０８、減算器４０９、歪算出部４１０、出力端子４１
１，４１２は、図１６中の参照符号２０１〜２１２を付
した要素と同じであるので、ここでは説明を省略する。(Eleventh Embodiment) The eleventh embodiment of the present invention will be described with reference to FIG. In FIG. 27, an input terminal 401, a frame / subframe configuration unit 402,
Pitch period analysis unit 403, subframe pitch period calculation unit 404, search range determination unit 405, adaptive codebook 406,
Perceptual weighting synthesis filter 407, perceptual weighting filter 4
08, subtractor 409, distortion calculator 410, output terminal 41
Since 1 and 412 are the same as the elements denoted by reference numerals 201 to 212 in FIG. 16, description thereof will be omitted here.

【０１０２】本実施形態の特徴は、ピッチ周期分析部４
０３で求めたピッチ周期の変化が連続的であるか否かを
連続性判定部４１３で判定し、その判定結果に応じて適
応符号帳４０６の探索範囲を決定する点にある。すなわ
ち、本実施形態ではピッチ周期の変化が連続的であると
判定された場合、前述したようにサブフレームピッチ周
期予測値を求め、その値を基に探索範囲決定部４０５で
適応符号帳４０６の探索範囲を決定する。連続的でない
と判定されたなら、全探索部４１４で適応符号帳４０６
内の全候補について探索を行うようにしている。The feature of this embodiment is that the pitch period analysis unit 4
The continuity determination unit 413 determines whether or not the change in pitch period obtained in 03 is continuous, and determines the search range of the adaptive codebook 406 according to the determination result. That is, in the present embodiment, when it is determined that the pitch period changes continuously, the subframe pitch period predicted value is obtained as described above, and the search range determination unit 405 determines the adaptive codebook 406 based on the value. Determine the search range. If it is determined that the adaptive codebook 406 is not continuous.
The search is performed for all the candidates in.

【０１０３】本実施形態は、次のような効果を有する。
ピッチ周期分析部４０３で求めたピッチ周期が連続的に
変化している場合、これまで述べてきたようにピッチ周
期を補間してその周辺のみを探索する方法では効果的に
機能する。しかしながら、音声信号は常に安定してピッ
チ周期が変化しているわけではなく、ピッチ周期が大き
く変化する場合もあり、また無声部のようにピッチ成分
が存在しない場合もある。このようなとき、ピッチ周期
は連続的に変化しているという仮定の基に強制的に補間
して探索範囲を限定すると、深刻な品質劣化を招いてし
まう。このようにピッチ周期が連続的でない場合、本実
施形態では適応符号帳４０６の全候補を探索するように
切り替えることで、上述の問題を回避することができ
る。This embodiment has the following effects.
When the pitch period calculated by the pitch period analysis unit 403 is continuously changing, the method of interpolating the pitch period and searching only the periphery thereof as described above works effectively. However, the pitch period of the voice signal does not always change stably, the pitch period may change greatly, and the pitch component may not exist like the unvoiced part. In such a case, if the search range is limited by forcibly interpolating based on the assumption that the pitch period is continuously changing, serious quality deterioration will be caused. When the pitch period is not continuous in this way, in the present embodiment, the above problem can be avoided by switching to search for all candidates in the adaptive codebook 406.

【０１０４】また、全候補探索を行うかどうかをピッチ
周期分析部４０３で求めたピッチ周期Ｔのときの周期性
の強さに応じて切り替えても、同様の効果が得られる。
すなわち、式（６）のρ(T) で表されるようなピッチ周
期性の強さを表す変数が予め定められた閾値以下である
場合には、全探索部４１４で全候補を探索するようにス
イッチ４１５を切り替えることで、ピッチ周期が存在し
ない領域での劣化を回避することができる。The same effect can be obtained by switching whether or not to search all candidates according to the strength of periodicity at the pitch period T obtained by the pitch period analysis unit 403.
That is, when the variable representing the strength of the pitch periodicity, which is represented by ρ (T) in the equation (6), is equal to or less than a predetermined threshold value, the full search unit 414 searches all candidates. By switching the switch 415 to, it is possible to avoid deterioration in the region where the pitch period does not exist.

【０１０５】連続性判定部４１３では、ピッチ周期分析
部４０３で求めたピッチ周期を使ってピッチ周期の連続
性を判定する。本実施形態では、フレームｆ(0) で求め
たピッチ周期Ｔ(0) とフレームｆ(-1)で求めたピッチ周
期Ｔ(-1)とを使う場合について説明するが、先読み部ｆ
ｒのピッチ周期Ｔｒとフレームｆ(0) のピッチ周期Ｔ
(0) を用いる場合、もしくは先読み部ｆｒのピッチ周期
Ｔｒとフレームｆ(-1)のピッチ周期Ｔ(-1)の連続性を用
いる場合、先読み部ｆｒのピッチ周期Ｔｒとフレームｆ
(0) のピッチ周期Ｔ(0) とフレームｆ(-1)のピッチ周期
Ｔ(-1)の連続性を用いる場合にも適用できる。ピッチ周
期の連続性は、次式（９）に従って判定される。The continuity judging section 413 judges the continuity of the pitch cycle using the pitch cycle obtained by the pitch cycle analyzing section 403. In the present embodiment, the case where the pitch period T (0) obtained in the frame f (0) and the pitch period T (-1) obtained in the frame f (-1) are used will be described.
pitch period Tr of r and pitch period T of frame f (0)
When (0) is used, or when the continuity of the pitch period Tr of the prefetch unit fr and the pitch period T (-1) of the frame f (-1) is used, the pitch period Tr of the prefetch unit fr and the frame f
It is also applicable when using the continuity of the pitch period T (0) of (0) and the pitch period T (-1) of the frame f (-1). The continuity of the pitch period is determined according to the following equation (9).

【０１０６】[0106]

【数６】 [Equation 6]

【０１０７】式（９）が成立する場合、ピッチ周期は連
続的に変化していると判定され、スイッチ４１５は探索
範囲決定部４０５の出力を選択して、これまで説明した
ように限定した探索範囲についてのみ適応符号帳４０６
の探索を行う。式（９）が成立しない場合、ピッチ周期
は連続的に変化していないと判定され、スイッチ４１５
は全探索部４１４を選択し、適応符号帳４０６内の全て
の候補について探索を行う。連続性判定部４１４の判定
結果は、出力端子４１６から出力される。さらに、連続
性判定部４１４からの出力がピッチ周期は連続的に変化
しているものではないというものであれば、ピッチ周期
分析部４０３で求めたピッチ周期は出力端子４１２から
出力する必要はない。When the equation (9) is satisfied, it is determined that the pitch cycle is continuously changing, and the switch 415 selects the output of the search range determining unit 405 to perform the limited search as described above. Adaptive codebook 406 only for range
Search for. When the expression (9) is not satisfied, it is determined that the pitch cycle does not change continuously, and the switch 415
Selects all search section 414 and searches for all candidates in adaptive codebook 406. The determination result of the continuity determination unit 414 is output from the output terminal 416. Furthermore, if the output from the continuity determination unit 414 is such that the pitch cycle does not change continuously, it is not necessary to output the pitch cycle obtained by the pitch cycle analysis unit 403 from the output terminal 412. .

【０１０８】（第１２の実施形態）図２８を用いて本発
明の第１２の実施形態を説明する。図２８において、入
力端子５０１、フレーム・サブフレーム構成部５０２、
ピッチ周期分析部５０３、サブフレームピッチ周期算出
部５０４、適応符号帳５０６、聴感重み付き合成フィル
タ５０７、聴感重みフィルタ５０８、減算器５０９、歪
算出部５１０、出力端子５１１，５１２は、図１６中の
参照符号２０１〜２０４、２０６〜２１２を付した要素
と同じであるので、ここでは説明を省略する。(Twelfth Embodiment) The twelfth embodiment of the present invention will be described with reference to FIG. In FIG. 28, an input terminal 501, a frame / subframe configuration unit 502,
The pitch cycle analysis unit 503, the subframe pitch cycle calculation unit 504, the adaptive codebook 506, the perceptual weighting synthesis filter 507, the perceptual weighting filter 508, the subtractor 509, the distortion calculation unit 510, and the output terminals 511 and 512 are shown in FIG. Since they are the same as the elements denoted by reference numerals 201 to 204 and 206 to 212, the description thereof will be omitted here.

【０１０９】本実施形態の特徴は、複数のサブフレーム
のサブフレームピッチ周期の変動を表す相対ピッチ周期
のパターン（相対ピッチパターンという）の複数個の集
合で構成される相対ピッチパターン符号帳５０５を有
し、複数のサブフレームの相対ピッチ周期をベクトルと
みなして、このベクトルと最も類似している相対ピッチ
パターン符号帳５０５内の相対ピッチパターンを選択す
るようにしている点にある。The feature of this embodiment is that the relative pitch pattern codebook 505 is composed of a plurality of sets of patterns of relative pitch periods (referred to as relative pitch patterns) that represent variations in subframe pitch periods of a plurality of subframes. That is, the relative pitch periods of a plurality of subframes are regarded as a vector, and the relative pitch pattern in the relative pitch pattern codebook 505 that is most similar to this vector is selected.

【０１１０】本実施形態によると、次のような効果があ
る。サブフレーム毎に相対ピッチ周期ΔＴ(k) をスカラ
ー量子化すると、例えば前述のように探索範囲ＮＲ＝８
とした場合、それを表現するのに３ビット必要になる。
すなわち、相対ピッチ周期ΔＴ(k) の情報を表すのに、
フレーム当たり３ビット×４サブフレーム＝１２ビット
だけ必要になる。According to this embodiment, the following effects can be obtained. If the relative pitch period ΔT (k) is scalar-quantized for each subframe, for example, the search range NR = 8 as described above.
, Then 3 bits are required to represent it.
That is, to represent the information of the relative pitch period ΔT (k),
Only 3 bits x 4 subframes = 12 bits are required per frame.

【０１１１】これに対し、本実施形態では出現頻度の高
い相対ピッチ周期のパターンを相対ピッチパターンとし
て持つ相対ピッチパターン符号帳５０５を有している。
ここで、仮に４つのサブフレーム（ｓｆ(0) 〜ｓｆ(3)
）の相対ピッチ周期ΔＴ(0)〜ΔＴ(3) で１ベクトル
（４次元）を表し、相対ピッチパターン符号帳５０５が
１２８（７ビット）種類の相対ピッチパターンで構成さ
れたとすると、本来４つのサブフレームの相対ピッチ周
期ΔＴ(k) を表すのに１２ビット必要であったものが７
ビットで表されることになり、大幅なビット削減が達成
されるという利点がある。On the other hand, in the present embodiment, the relative pitch pattern codebook 505 having the pattern of the relative pitch period having a high appearance frequency as the relative pitch pattern is provided.
Here, it is assumed that four subframes (sf (0) to sf (3)
) Relative pitch period ΔT (0) to ΔT (3) represents one vector (4 dimensions), and if the relative pitch pattern codebook 505 is composed of 128 (7 bits) types of relative pitch patterns, there are originally four 12 bits were required to represent the relative pitch period ΔT (k) of the subframe, but 7
Since it is expressed in bits, there is an advantage that a significant bit reduction is achieved.

【０１１２】相対ピッチパターン符号帳５０５は、次式
に示すように予め求めておいた代表的な相対ピッチパタ
ーンｐｖ(j) をＪ種類有している。ｐｖ(j) ＝｛ｖ(j,k) ；ｋ＝０〜K-1 ｝（１０）（０≦ｊ≦Ｊ−１）加算器５１３では、次式に示されるようにサブフレーム
ピッチ周期予測値ＳＴＰ(k) と第ｊ番目の相対ピッチパ
ターンｐｖ(j) とで加算を行い、Ｋ個のサブフレームの
ピッチ周期の組Ｔｘ(j) を求める。Ｔｘ(j) ＝ＳＴＰ(k) ＋ｖ(j,k) （１１）このようにして求めたピッチ周期の組Ｔｘ(j) に基づい
て、Ｋ個のサブフレームにまたがる寄与度が最大となる
ときの相対ピッチパターンを閉ループ的に探索する。こ
のとき、適応符号帳５０６の内部状態はサブフレーム毎
に更新される。このときのサブフレーム数はＫ個に限定
されることはなく、２以上、Ｋ以下のサブフレーム数に
対する相対ピッチ周期を求めることができる。また、計
算量を削減するために、まず計算量の少ない簡単な指標
（例えば寄与度の分子項）によって相対ピッチパターン
の候補を限定し、その残った候補に対して正確な寄与度
を求めて最終的に１つの相対ピッチパターンを決定して
もよい。The relative pitch pattern codebook 505 has J types of typical relative pitch patterns pv (j) which are obtained in advance as shown in the following equation. pv (j) = {v (j, k); k = 0 to K-1} (10) (0≤j≤J-1) In the adder 513, the subframe pitch period prediction is performed as shown in the following equation. The value STP (k) and the j-th relative pitch pattern pv (j) are added to obtain a set Tx (j) of pitch periods of K subframes. Tx (j) = STP (k) + v (j, k) (11) Based on the set of pitch periods Tx (j) thus obtained, when the contribution over K subframes is maximized The relative pitch pattern of is searched in a closed loop. At this time, the internal state of adaptive codebook 506 is updated for each subframe. The number of subframes at this time is not limited to K, and the relative pitch period can be obtained for the number of subframes of 2 or more and K or less. Moreover, in order to reduce the amount of calculation, first, the candidates of the relative pitch pattern are limited by a simple index with a small amount of calculation (for example, the numerator term of the contribution), and the accurate contribution is calculated for the remaining candidates. Finally, one relative pitch pattern may be determined.

【０１１３】（第１３の実施形態）図２９を用いて本発
明の第１３の実施形態を説明する。図２９において、入
力端子６０１、フレーム・サブフレーム構成部６０２、
ピッチ周期分析部６０３、サブフレームピッチ周期算出
部６０４、相対ピッチパターン符号帳６０５、適応符号
帳６０６、聴感重み付き合成フィルタ６０７、聴感重み
フィルタ６０８、減算器６０９、歪算出部６１０、出力
端子６１１，６１２、加算器６１３は、図２０中の参照
符号５０１〜５１３を付した要素と同じであるので、こ
こでは説明を省略する。(Thirteenth Embodiment) The thirteenth embodiment of the present invention will be described with reference to FIG. In FIG. 29, an input terminal 601, a frame / subframe configuration unit 602,
Pitch cycle analysis section 603, subframe pitch cycle calculation section 604, relative pitch pattern codebook 605, adaptive codebook 606, perceptual weighting synthesis filter 607, perceptual weighting filter 608, subtracter 609, distortion calculation section 610, output terminal 611. , 612 and the adder 613 are the same as the elements denoted by reference numerals 501 to 513 in FIG. 20, and therefore the description thereof is omitted here.

【０１１４】本実施形態の特徴は、適応符号帳６０６と
ともに雑音符号帳６１４の影響を考慮に入れて相対ピッ
チパターンを決定する点にある。このような構成によ
り、さらに正確な相対ピッチパターンを決定することが
可能になる。The feature of this embodiment is that the relative pitch pattern is determined in consideration of the influence of the noise codebook 614 together with the adaptive codebook 606. With such a configuration, it becomes possible to determine a more accurate relative pitch pattern.

【０１１５】このとき、適応ベクトルに乗算器６１５で
乗じられるゲインは端子６１６より与えられる。このゲ
インは、次式で表される理想ゲインｇｏｐｔ、または、
ここでは図示していない適応ベクトルゲイン符号帳の中
で最も理想ゲインｇｏｐｔに最も近い要素が用いられ
る。At this time, the gain by which the adaptive vector is multiplied by the multiplier 615 is given from the terminal 616. This gain is the ideal gain gopt expressed by the following equation, or
Here, the element closest to the ideal gain gopt in the adaptive vector gain codebook (not shown) is used.

【０１１６】[0116]

【数７】 [Equation 7]

【０１１７】ここで、ｒ(n) は目標ベクトル、ｐ(t,n)
はピッチ周期ｔにおける適応ベクトルを聴感重み付き合
成フィルタ６０７に通した信号を表す。Where r (n) is the target vector and p (t, n)
Represents a signal obtained by passing the adaptive vector in the pitch period t through the perceptual weighting synthesis filter 607.

【０１１８】同様に、雑音ベクトルに乗算器６１７で乗
じられるゲインは端子６１８より与えられる。このゲイ
ンは、次式で表される理想ゲインｂｏｐｔ、または、こ
こでは図示していない雑音ベクトルゲイン符号帳の中で
最も理想ゲインｂｏｐｔに最も近い要素が用いられる。Similarly, the gain by which the noise vector is multiplied by the multiplier 617 is given from the terminal 618. As this gain, an ideal gain bopt represented by the following expression or an element closest to the ideal gain bopt in a noise vector gain codebook not shown here is used.

【０１１９】[0119]

【数８】 [Equation 8]

【０１２０】ここで、ｔ(n）は目標ベクトル、ｅ(i,n)
はインデックスｉにおける雑音ベクトルを聴感重み付き
合成フィルタ６０７に通した信号を表す。Where t (n) is the target vector and e (i, n)
Represents a signal obtained by passing the noise vector at index i through the perceptually weighted synthesis filter 607.

【０１２１】複数サブフレームにまたがる寄与度が最大
となるときの相対ピッチパターンが決定されたときに、
出力端子６１１からそのときの相対ピッチパターンのイ
ンデックスが出力される。また、出力端子６１２から
は、ピッチ周期分析部６０３で求めたピッチ周期の情報
が出力される。When the relative pitch pattern when the degree of contribution over a plurality of subframes is maximum is determined,
The index of the relative pitch pattern at that time is output from the output terminal 611. Further, the output terminal 612 outputs the pitch cycle information obtained by the pitch cycle analysis unit 603.

【０１２２】このとき、適応符号帳６０６の内部状態は
サブフレーム毎に更新される。このときのサブフレーム
数はＫ個に限定されず、２以上、Ｋ以下のサブフレーム
数に対する相対ピッチ周期を求めることができる。ま
た、計算量を削減するために、まず計算量の少ない簡単
な指標（例えば寄与度の分子項）によって相対ピッチバ
ターンの候補を限定し、その残った候補に対して正確な
寄与度を求めて最終的に１つの相対ピッチパターンを決
定してもよい。At this time, the internal state of adaptive codebook 606 is updated for each subframe. The number of subframes at this time is not limited to K, and the relative pitch period can be obtained for the number of subframes of 2 or more and K or less. Also, in order to reduce the amount of calculation, first, the relative pitch pattern candidates are limited by a simple index with a small amount of calculation (for example, the numerator term of the contribution degree), and the accurate contribution degree is calculated for the remaining candidates. Finally, one relative pitch pattern may be determined.

【０１２３】（第１４の実施形態）図３０を用いて本発
明の第１４の実施形態を説明する。本実施形態は、ＣＥ
ＬＰ方式のエンコーダ（音声符号化装置）に本発明を適
用した例である。また、本実施形態ではこれまで説明し
てきた実施形態をいくつか組み合わせた一つの代表的な
構成になっているが、これに限定されるものではなく、
様々な組み合わせをＣＥＬＰ方式に適用することができ
る。(Fourteenth Embodiment) The fourteenth embodiment of the present invention will be described with reference to FIG. In this embodiment, the CE
This is an example in which the present invention is applied to an LP encoder (speech encoder). Further, although the present embodiment has one representative configuration in which some of the embodiments described above are combined, the present invention is not limited to this.
Various combinations can be applied to the CELP method.

【０１２４】図３０においては、入力端子５０１からデ
ィジタル化された音声信号が入力され、フレーム・サブ
フレーム構成部５０２においてフレームとサブフレーム
が構成される。その後、フレーム・サブフレームに分割
された音声信号はＬＰＣ係数分析部５２１に与えられ、
ＬＰＣ分析によってＬＰＣ係数が算出される。このＬＰ
Ｃ係数は、聴感重みフィルタ５０８および聴感重み付き
合成フィルタ５０７の伝達関数を決定する際に利用され
る。In FIG. 30, a digitized audio signal is input from an input terminal 501, and a frame / subframe forming section 502 forms a frame and a subframe. After that, the audio signal divided into the frames and subframes is given to the LPC coefficient analysis unit 521,
The LPC coefficient is calculated by the LPC analysis. This LP
The C coefficient is used when determining the transfer functions of the perceptual weighting filter 508 and the perceptual weighting synthesis filter 507.

【０１２５】ＬＰＣ係数分析部５２１で求められたＬＰ
Ｃ係数は、ＬＰＣ係数量子化部５２２において量子化さ
れ、量子化によって求められるインデックスはマルチプ
レクサ５２３に与えられ、後述する他の情報とともに多
重化される。また、量子化後に復号されたＬＰＣ係数
は、聴感重み付き合成フィルタ５０７の伝達関数を決定
する際に用いられる。LP determined by the LPC coefficient analysis unit 521
The C coefficient is quantized in the LPC coefficient quantization unit 522, and the index obtained by the quantization is given to the multiplexer 523, and is multiplexed with other information described later. The LPC coefficient decoded after quantization is used when determining the transfer function of the perceptual weighting synthesis filter 507.

【０１２６】入力音声信号は、ピッチ周期分析位置決定
部５２５にも与えられる。ピッチ周期分析位置決定部５
２５では、音声信号の予め定められた短区間のパワーを
規定個数求め、その中で短区間パワーが最も大きくなる
領域を表すインデックスをピッチ周期分析部５０３とマ
ルチプレクサ５２３に与える。ピッチ周期分析部５０３
では、ピッチ周期分析位置決定部５２５で決定された分
析位置を中心に音声信号、予測残差信号または低域通過
フィルタを通過後の予測残差信号等を用いてピッチ周期
の分析を行う。The input voice signal is also given to the pitch period analysis position determining unit 525. Pitch cycle analysis position determination unit 5
In 25, a prescribed number of powers of a predetermined short section of the audio signal are obtained, and an index representing a region in which the short section power is the largest is given to the pitch cycle analysis unit 503 and the multiplexer 523. Pitch cycle analysis unit 503
Then, the pitch period is analyzed using the voice signal, the prediction residual signal, the prediction residual signal after passing through the low pass filter, and the like, centering on the analysis position determined by the pitch period analysis position determining unit 525.

【０１２７】ここで求めたピッチ周期と前フレーム処理
で求めたピッチ周期との連続性を連続性判定部５２６で
判定し、その判定結果をマルチプレクサ５２３に与え
る。判定の結果が連続であるとなった場合、ピッチ周期
分析部５０３で求めたピッチ周期をマルチプレクサ５２
３に与えると共に、サブフレームピッチ周期予測値算出
部５０４においてサブフレーム毎のピッチ周期予測値を
求める。探索範囲決定部５３０では、サブフレームピッ
チ周期予測値を基にサブフレーム毎のピッチ周期探索範
囲を決定する。この場合、探索範囲決定部５３０で決定
された探索範囲に含まれるピッチ周期の適応ベクトルが
候補として選択される。The continuity of the pitch cycle obtained here and the pitch cycle obtained in the previous frame processing is judged by the continuity judging section 526, and the judgment result is given to the multiplexer 523. If the determination result is continuous, the pitch period obtained by the pitch period analysis unit 503 is set to the multiplexer 52.
3, the subframe pitch period prediction value calculation unit 504 calculates the pitch period prediction value for each subframe. Search range determination section 530 determines a pitch period search range for each subframe based on the subframe pitch period prediction value. In this case, the adaptive vector of the pitch period included in the search range determined by the search range determining unit 530 is selected as a candidate.

【０１２８】連続性判定部５２６の判定結果が連続でな
いとなった場合、全探索部５２７が選択されるようスイ
ッチ５３１を切り替え、適応符号帳５２８に含まれる全
て適応ベクトルが候補として選択される。この場合、ピ
ッチ周期分析部５０３で求めたピッチ周期はマルチプレ
クサ５２３に与える必要はない。When the determination result of the continuity determination unit 526 is not continuous, the switch 531 is switched so that the full search unit 527 is selected, and all adaptive vectors included in the adaptive codebook 528 are selected as candidates. In this case, the pitch cycle calculated by the pitch cycle analysis unit 503 does not need to be given to the multiplexer 523.

【０１２９】乗算器５３２では、適応符号帳５２８から
選択される適応ベクトルと適応ベクトルゲイン符号帳５
３３から選択される適応ベクトルゲインとの積がとられ
る。乗算器５３４では同様に、雑音符号帳５２９から選
択される雑音ベクトルと雑音ベクトルゲイン符号帳５３
５から選択される雑音ベクトルゲインとの積がとられ
る。加算器５３６では、乗算器５３２，５３４から得ら
れるそれぞれの信号の和をとり、駆動ベクトルを生成す
る。In the multiplier 532, the adaptive vector selected from the adaptive codebook 528 and the adaptive vector gain codebook 5
The product is multiplied by the adaptive vector gain selected from 33. Similarly, in the multiplier 534, the noise vector selected from the noise codebook 529 and the noise vector gain codebook 53
The product with the noise vector gain selected from 5 is taken. The adder 536 takes the sum of the signals obtained from the multipliers 532 and 534 to generate a drive vector.

【０１３０】このようにして生成された駆動ベクトルを
聴感重み付き合成フィルタ５０７に通して、合成ベクト
ルを生成する。減算器５０９では、音声信号を聴感重み
フィルタ５０８に通して得られる目標ベクトルと合成ベ
クトルとの差をとり、この差信号を基に歪算出部５１１
で歪を求める。この歪が最小となるときの適応ベクト
ル、適応ベクトルゲイン、雑音ベクトルおよび雑音ベク
トルゲインの組み合わせを効率的に探索する。The drive vector thus generated is passed through the perceptual weighting synthesis filter 507 to generate a synthesis vector. The subtractor 509 calculates the difference between the target vector obtained by passing the voice signal through the perceptual weighting filter 508 and the combined vector, and based on this difference signal, the distortion calculation unit 511.
To find the distortion. The combination of the adaptive vector, the adaptive vector gain, the noise vector, and the noise vector gain when this distortion is minimized is efficiently searched.

【０１３１】例えば、図３１のフローチャートに示され
るように、サブフレーム毎に適応ベクトル、適応ベクト
ルゲイン、雑音ベクトル、雑音ベクトルゲインの順に直
列的に求めていく方法がある。また、図３２のフローチ
ャートに示されるように、サブフレーム毎に適応ベクト
ルゲインと雑音ベクトルゲインをベクトル量子化によっ
て同時最適化を図る構成の場合には、適応ベクトル、雑
音ベクトル、ゲインベクトルの順に求める方法もある。For example, as shown in the flowchart of FIG. 31, there is a method of serially obtaining an adaptive vector, an adaptive vector gain, a noise vector, and a noise vector gain for each subframe. Further, as shown in the flowchart of FIG. 32, in the case of a configuration in which the adaptive vector gain and the noise vector gain are simultaneously optimized by vector quantization for each subframe, the adaptive vector, the noise vector, and the gain vector are obtained in this order. There is also a method.

【０１３２】すなわち、図３１ではまずステップＳ６０
１でカウンタｋを０に設定した後、ステップＳ６０２，
Ｓ６０３，Ｓ６０４，Ｓ６０５で適応ベクトル、適応ベ
クトルゲイン、雑音ベクトル、雑音ベクトルゲインを順
次求め、ステップＳ６０６でカウンタｋを１だけインク
リメントする。ステップＳ６０７でカウンタｋがＫに一
致しているかどうかの判定を行う。一致していなけれ
ば、ステップＳ６０２に戻って処理を継続し、一致した
ならば処理は終了する。That is, in FIG. 31, first, step S60.
After setting the counter k to 0 in step 1, step S602
An adaptive vector, an adaptive vector gain, a noise vector, and a noise vector gain are sequentially obtained in S603, S604, and S605, and the counter k is incremented by 1 in step S606. In step S607, it is determined whether the counter k matches K. If they do not match, the process returns to step S602 to continue the process, and if they match, the process ends.

【０１３３】また、図３２ではまずステップＳ７０１で
カウンタｋを０に設定した後、ステップＳ７０２，Ｓ７
０３，Ｓ７０４で適応ベクトル、雑音ベクトル、ゲイン
ベクトル（適応ベクトルゲインと雑音ベクトルゲイン）
を順次求め、ステップＳ７０５でカウンタｋを１だけイ
ンクリメントする。ステップＳ７０６でカウンタｋがＫ
に一致しているかどうかの判定を行う。一致していなけ
れば、ステップＳ７０２に戻って処理を継続し、一致し
たならば処理は終了する。Further, in FIG. 32, the counter k is first set to 0 in step S701, and then steps S702 and S7 are performed.
03, S704: Adaptive vector, noise vector, gain vector (adaptive vector gain and noise vector gain)
Are sequentially obtained, and the counter k is incremented by 1 in step S705. In step S706, the counter k is K.
It is determined whether or not If they do not match, the process returns to step S702 to continue the process, and if they match, the process ends.

【０１３４】このようにして歪が最小となるときの適応
ベクトル、適応ベクトルゲイン、雑音ベクトル、雑音ベ
クトルゲインを表すインデックスをマルチプレクサ５２
３に与える。連続性判定部５２６において連続であると
判定されている場合、適応ベクトルのインデックスはサ
ブフレームピッチ周期予測値に対する相対値として表さ
れる。In this way, the multiplexer 52 is provided with an index representing the adaptive vector, the adaptive vector gain, the noise vector, and the noise vector gain when the distortion is minimized.
Give to 3. When the continuity determination unit 526 determines that the adaptive vector is continuous, the index of the adaptive vector is represented as a relative value with respect to the subframe pitch cycle predicted value.

【０１３５】マルチプレクサ５２３では、連続性判定部
５２６の判定結果が連続である場合とそうでない場合と
で多重化する対象が異なる。すなわち、連続性判定部５
２６の判定結果が連続である場合、ＬＰＣ係数量子化部
５２２で求めたＬＰＣ係数インデックス、ピッチ周期分
析位置決定部５２５で求めた分析位置インデックス、連
続性判定部５２６で求めた判定情報、ピッチ周期分析部
５０３で求めたピッチ周期情報、適応ベクトルのサブフ
レームピッチ周期予測値に対する相対値、適応ベクトル
ゲインのインデックス、雑音ベクトルインデックスおよ
び雑音ベクトルゲインのインデックスを多重化して符号
化データ出力端子５２４から符号化データとして出力す
る。一方、連続性判定部５２６の判定結果が連続でない
場合、ＬＰＣ係数量子化部５２２で求めたＬＰＣ係数イ
ンデックス、連続性判定部５２６で求めた判定情報、適
応ベクトルのインデックス、適応ベクトルゲインのイン
デックス、雑音ベクトルのインデックスおよび雑音ベク
トルゲインのインデックスを多重化して出力端子５２４
から出力する。In the multiplexer 523, the object to be multiplexed differs depending on whether the determination result of the continuity determining unit 526 is continuous or not. That is, the continuity determination unit 5
26 is continuous, the LPC coefficient index obtained by the LPC coefficient quantizing unit 522, the analysis position index obtained by the pitch period analysis position determining unit 525, the determination information obtained by the continuity determining unit 526, the pitch period The pitch period information obtained by the analysis unit 503, the relative value of the adaptive vector to the predicted value of the subframe pitch period, the index of the adaptive vector gain, the noise vector index, and the index of the noise vector gain are multiplexed and encoded from the encoded data output terminal 524. Output as the converted data. On the other hand, when the determination result of the continuity determining unit 526 is not continuous, the LPC coefficient index obtained by the LPC coefficient quantizing unit 522, the determination information obtained by the continuity determining unit 526, the adaptive vector index, the adaptive vector gain index, The noise vector index and the noise vector gain index are multiplexed and output terminal 524
Output from.

【０１３６】（第１５の実施形態）図３３を用いて本発
明の第１５の実施形態を説明する。本実施形態は本発明
をＣＥＬＰ方式のデコーダ（音声復号化装置）に適用し
た例を示している。本実施形態は、これまで説明した実
施形態をいくつか組み合わせて構成されたＣＥＬＰエン
コーダ部に対応するデコーダ部を表しているが、これに
限定されるものではなく、様々な組み合せにより構成さ
れるＣＥＬＰエンコーダ部に対応するデコーダ部に適用
することができる。(Fifteenth Embodiment) The fifteenth embodiment of the present invention will be described with reference to FIG. The present embodiment shows an example in which the present invention is applied to a CELP type decoder (speech decoding device). Although the present embodiment shows a decoder unit corresponding to a CELP encoder unit configured by combining some of the embodiments described above, the present invention is not limited to this, and CELP configured by various combinations. It can be applied to the decoder unit corresponding to the encoder unit.

【０１３７】入力端子８０１から多重化された符号化デ
ータが入力され、デマルチプレクサ８０２において種々
のパラメータのインデックスに変換される。ＬＰＣ係数
復号部８１１では、ＬＰＣ係数インデックスを基にＬＰ
Ｃ係数を復号し、合成フィルタ８１２に与える。サブフ
レームピッチ周期生成部８０３では、ピッチ周期に関連
する情報が与えられ、適応符号帳８０４で用いられるピ
ッチ周期を生成する。このピッチ周期の生成についての
詳細は、図３４を用いて後述する。The multiplexed coded data is input from the input terminal 801 and converted into indexes of various parameters in the demultiplexer 802. The LPC coefficient decoding unit 811, based on the LPC coefficient index, sets the LP
The C coefficient is decoded and given to the synthesis filter 812. The sub-frame pitch cycle generation unit 803 is given information related to the pitch cycle, and generates the pitch cycle used in the adaptive codebook 804. Details of generation of this pitch period will be described later with reference to FIG. 34.

【０１３８】ここで生成されたピッチ周期を指標とし
て、適応符号帳８０４から適応ベクトルを求める。この
適応ベクトルに、適応ベクトルゲインインデックスを指
標にして適応ベクトルゲイン符号帳８０５から求められ
た適応ベクトルゲインを乗算器８０６で乗じる。同様
に、雑音符号帳８０７から雑音ベクトルインデックスを
指標にして求められた雑音ベクトルに、雑音ベクトルゲ
インインデックスを指標に雑音ベクトルゲイン符号帳８
０８から求められた雑音ベクトルゲインを乗算器８０９
で乗じる。An adaptive vector is obtained from adaptive codebook 804 using the pitch period generated here as an index. A multiplier 806 multiplies this adaptive vector by the adaptive vector gain obtained from the adaptive vector gain codebook 805 using the adaptive vector gain index as an index. Similarly, a noise vector obtained from the noise codebook 807 using the noise vector index as an index, and a noise vector gain codebook 8 using the noise vector gain index as an index.
The noise vector gain obtained from 08 is multiplied by a multiplier 809.
Ride with.

【０１３９】これら乗算後の２つの信号を加算器８１０
で加算して駆動信号を生成し、合成フィルタ８１２に駆
動信号を与えて合成信号を生成する。この合成信号をポ
ストフィルタ８１３に入力し、ホルマン卜強調、ピッチ
強調、ゲイン調整などの聴感的な改善を施した後、出力
端子８１５より出力する。The two signals after the multiplication are added to the adder 810.
Is added to generate a drive signal, and the drive signal is given to the synthesis filter 812 to generate a synthesis signal. This combined signal is input to the post filter 813, and after being subjected to perceptual improvement such as formman emphasis, pitch emphasis, and gain adjustment, it is output from the output terminal 815.

【０１４０】次に、図３４を用いてサブフレームピッチ
周期生成部８０３の説明を行う。入力端子９０１からピ
ッチ周期の連続性の判定情報が入力され、その情報に応
じてスイッチ９０４とスイッチ９１１を切り替える。ピ
ッチ周期の連続性判定情報が「不連続」を表す揚合、入
力端子９０２からはサブフレーム毎のピッチ周期インデ
ックスが入力され、スイッチ９０４を経由してサブフレ
ームピッチ周期復号部９１０でサブフレームピッチ周期
が復号され、スイッチ９１１を経由して出力端子９１２
から出力される。Next, the subframe pitch period generator 803 will be described with reference to FIG. Pitch cycle continuity determination information is input from the input terminal 901, and the switch 904 and the switch 911 are switched according to the information. When the pitch cycle continuity determination information indicates “discontinuity”, the pitch cycle index for each subframe is input from the input terminal 902, and the subframe pitch cycle decoding unit 910 inputs the subframe pitch via the switch 904. The cycle is decoded, and the output terminal 912 is passed through the switch 911.
Is output from.

【０１４１】ピッチ周期の連続性判定情報が「連続」を
表す揚合、入力端子９０２からはピッチ周期分析位置イ
ンデックス、ピッチ周期インデックス、相対ピッチ周期
インデックスが与えられ、スイッチ９０４を経由してそ
れぞれピッチ周期分析位置復号部９０５、ピッチ周期復
号部９０６および相対ピッチ周期復号部９０７にそれぞ
れ与えられる。その結果求められる、ピッチ周期分析位
置およびピッチ周期を用いてサブフレームピッチ周期予
測値算出部９０８でサブフレームピッチ周期予測値を求
める。サブフレームピッチ周期算出部９０９では、サブ
フレームピッチ周期予測値と相対ピッチ周期とを用いて
サブフレームピッチ周期を算出し、スイッチ９１１を経
由して出力端子９１２より出力する。When the pitch cycle continuity determination information indicates "continuous", a pitch cycle analysis position index, a pitch cycle index, and a relative pitch cycle index are given from the input terminal 902, and the pitch is analyzed via the switch 904. It is given to the period analysis position decoding unit 905, the pitch period decoding unit 906, and the relative pitch period decoding unit 907, respectively. The subframe pitch period prediction value calculation unit 908 obtains the subframe pitch period prediction value using the pitch period analysis position and the pitch period obtained as a result. The subframe pitch cycle calculation unit 909 calculates the subframe pitch cycle using the predicted value of the subframe pitch cycle and the relative pitch cycle, and outputs the subframe pitch cycle from the output terminal 912 via the switch 911.

【０１４２】図３５に、本発明に係る音声符号化／復号
化方法を実現するコンピュータシステムの構成を示す。
このコンピュータシステムは例えばパーソナルコンピュ
ータであり、演算処理を司るＣＰＵ１００１と、キーボ
ード、ポインティングデバイスおよびマイクロフォンな
どの入力部１００２と、ディスプレイおよびスピーカな
どの出力部１００３と、主記憶としてのＲＯＭ１００４
およびＲＡＭ１００５と、外部記憶装置としてのハード
ディスク装置１００６、フロッピディスク装置１００７
および光ディスク装置１００８をバス１００９により接
続して構成されている。FIG. 35 shows the configuration of a computer system which realizes the voice encoding / decoding method according to the present invention.
This computer system is, for example, a personal computer, and has a CPU 1001 that controls arithmetic processing, an input unit 1002 such as a keyboard, a pointing device, and a microphone, an output unit 1003 such as a display and a speaker, and a ROM 1004 as main memory.
And a RAM 1005, a hard disk device 1006 as an external storage device, and a floppy disk device 1007.
And an optical disk device 1008 connected by a bus 1009.

【０１４３】ここで、ハードディスク装置１００６、フ
ロッピディスク装置１００７および光ディスク装置１０
０８のいずれかの記録媒体に、上述した実施形態で説明
した音声符号化処理を実行するためのプログラムおよび
符号化データが格納される。そして、このプログラムに
従って入力部１００１から入力される入力音声信号に対
してＣＰＵ１００１で音声符号化処理が行われ、また記
録媒体から読み出された符号化データに対してＣＰＵ１
００１により音声復号化処理が行われ、出力部１００３
から出力される。このようにすることにより、通常のパ
ーソナルコンピュータを用いて本発明の音声符号化／復
号化処理を実施することができる。Here, the hard disk drive 1006, floppy disk drive 1007 and optical disk drive 10
Any one of the recording media 08 stores a program and encoded data for executing the audio encoding process described in the above embodiment. Then, according to this program, the CPU 1001 performs the audio encoding process on the input audio signal input from the input unit 1001, and the CPU 1 executes the encoded data read from the recording medium.
The audio decoding processing is performed by 001, and the output unit 1003
Is output from. By doing so, it is possible to carry out the voice encoding / decoding processing of the present invention using an ordinary personal computer.

【０１４４】本発明は、以下のように種々変形して実施
することができる。（１）以上の実施形態では、サブフレームピッチ周期予
測値を求める方法として補間もしくは外挿を用いたが、
これに限定されるものではなく、例えば予測に用いるピ
ッチ周期の数を増やして予測の精度を向上させたり、高
次の関数やスプライン関数といったより高度な予測を行
うことも可能である。こうすることにより、サブフレー
ムピッチの実測値と予測値との誤差が減少し、より少な
い情報量でサブフレームピッチ周期を表すことができ
る。The present invention can be implemented with various modifications as follows. (1) In the above embodiments, interpolation or extrapolation is used as a method of obtaining the subframe pitch period prediction value.
The present invention is not limited to this. For example, the number of pitch periods used for prediction may be increased to improve the accuracy of prediction, or higher-level prediction such as a higher-order function or spline function may be performed. By doing so, the error between the actually measured value and the predicted value of the subframe pitch is reduced, and the subframe pitch period can be expressed with a smaller amount of information.

【０１４５】（２）ピッチ周期分析部で求めたピッチ周
期を量子化する際に、過去に求めたピッチ周期で予測し
て、その予測値との差分を量子化することにより量子化
効率を向上させることもできる。(2) When quantizing the pitch period obtained by the pitch period analysis unit, prediction is performed with the pitch period obtained in the past, and the difference from the predicted value is quantized to improve the quantization efficiency. You can also let it.

【０１４６】（３）サブフレームピッチ周期予測値から
探索範囲を決定する際に、サブフレームピッチ周期予測
値の近傍にのみ設定するのではなく、サブフレームピッ
チ周期予測値の整数倍の近傍もしくは整数分の１の近傍
に設定することもできる。(3) When the search range is determined from the subframe pitch cycle predicted value, the search range is not set only in the vicinity of the subframe pitch cycle predicted value, but in the vicinity of an integer multiple of the subframe pitch cycle predicted value or an integer. It can also be set in the vicinity of one-half.

【０１４７】（４）聴感重みフィルタの一構成要素であ
るピッチフィルタのピッチ周期の算出を目的として、こ
れまで説明してきた実施形態を適用してもよい。同様
に、ここでは明示していないポストフィルタの一構成要
素であるピッチフィルタのピッチ周期の算出を目的とし
て、これまで説明してきた実施形態を適用してもよい。
その場合、付加情報は必要がないので、容易に音声符号
化装置に適用できる。(4) The embodiments described above may be applied for the purpose of calculating the pitch period of the pitch filter which is one component of the perceptual weighting filter. Similarly, the embodiments described so far may be applied for the purpose of calculating the pitch period of the pitch filter, which is one component of the post filter not explicitly shown here.
In that case, no additional information is required, so that it can be easily applied to a voice encoding device.

【０１４８】（５）以上述べてきた実施形態による音声
符号化方法をフレーム単位で切り替えて使用してもよ
い。その場合、どの実施形態の方法を使用したかを表す
ための付加情報が新たに必要となる。(5) The speech coding method according to the above-described embodiments may be switched and used in frame units. In that case, additional information is newly required to indicate which method of the embodiment is used.

【０１４９】[0149]

【発明の効果】以上説明したように、本発明の音声符号
化方法によれば、合成音の品質を維持しつつ、サブフレ
ームピッチ周期を探索する際の計算量を削減することが
できるとともに、サブフレームピッチ周期の情報を表す
ためのビット数を効果的に削減することができる。As described above, according to the speech coding method of the present invention, it is possible to reduce the amount of calculation when searching the subframe pitch period while maintaining the quality of synthesized speech. It is possible to effectively reduce the number of bits for expressing the information of the subframe pitch period.

【０１５０】[0150]

[Brief description of drawings]

【図１】本発明の基本原理を説明するための図FIG. 1 is a diagram for explaining the basic principle of the present invention.

【図２】本発明に係る音声符号化方法におけるピッチ
周期分析の基本動作を説明するためのブロック図FIG. 2 is a block diagram for explaining the basic operation of pitch period analysis in the speech coding method according to the present invention.

【図３】フレーム構造とサブフレーム構造を示す図FIG. 3 is a diagram showing a frame structure and a subframe structure.

【図４】本発明の第２の実施形態に係る音声符号化方
法を説明するためのブロック図FIG. 4 is a block diagram for explaining a speech encoding method according to a second embodiment of the present invention.

【図５】第２の実施形態におけるサブフレームピッチ
抽出部の構成を示すブロック図FIG. 5 is a block diagram showing a configuration of a subframe pitch extraction unit according to the second embodiment.

【図６】第２の実施形態におけるサブフレームピッチ
周期の算出方法を説明するための図FIG. 6 is a diagram for explaining a method of calculating a subframe pitch period according to the second embodiment.

【図７】第２の実施形態におけるサブフレームピッチ
周期の算出方法を説明するための図FIG. 7 is a diagram for explaining a method of calculating a subframe pitch period according to the second embodiment.

【図８】第２の実施形態におけるサブフレームピッチ
周期算出の処理手順を説明するためのフローチャートFIG. 8 is a flowchart for explaining a processing procedure of subframe pitch period calculation in the second embodiment.

【図９】本発明の第３の実施形態におけるサブフレー
ムピッチ周期の算出方法を説明するための図FIG. 9 is a diagram for explaining a method of calculating a subframe pitch period according to the third embodiment of the present invention.

【図１０】第３の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャートFIG. 10 is a flowchart for explaining a processing procedure of subframe pitch period calculation in the third embodiment.

【図１１】第３の実施形態の効果を説明するための図FIG. 11 is a diagram for explaining the effect of the third embodiment.

【図１２】本発明の第４の実施形態におけるサブフレ
ームピッチ周期算出方法を説明するための図FIG. 12 is a diagram for explaining a subframe pitch period calculation method according to the fourth embodiment of the present invention.

【図１３】第４の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャートFIG. 13 is a flowchart for explaining a processing procedure of subframe pitch period calculation in the fourth embodiment.

【図１４】本発明の第５の実施形態におけるサブフレ
ームピッチ周期算出方法を説明するための図FIG. 14 is a diagram for explaining a subframe pitch period calculation method according to the fifth embodiment of the present invention.

【図１５】第５の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャートFIG. 15 is a flowchart for explaining a processing procedure of subframe pitch period calculation in the fifth embodiment.

【図１６】本発明の第６の実施形態に係る音声符号化
方法を説明するためのブロック図FIG. 16 is a block diagram for explaining a speech coding method according to a sixth embodiment of the present invention.

【図１７】第６の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャート
の一部を示す図FIG. 17 is a diagram showing a part of a flowchart for explaining a processing procedure of subframe pitch period calculation in the sixth embodiment.

【図１８】第６の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャート
の他の一部を示す図FIG. 18 is a diagram showing another part of the flowchart for explaining the processing procedure of subframe pitch period calculation in the sixth embodiment.

【図１９】本発明の第７の実施形態におけるサブフレ
ームピッチ周期算出の処理手順を説明するためのフロー
チャートの一部を示す図FIG. 19 is a diagram showing a part of a flowchart for explaining a processing procedure of subframe pitch period calculation in the seventh embodiment of the present invention.

【図２０】第７の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャート
の他の一部を示す図FIG. 20 is a diagram showing another part of the flowchart for explaining the processing procedure of subframe pitch period calculation in the seventh embodiment.

【図２１】本発明の第８の実施形態におけるサブフレ
ームピッチ周期算出の処理手順を説明するためのフロー
チャートの一部を示す図FIG. 21 is a diagram showing a part of a flowchart for explaining a processing procedure of subframe pitch period calculation according to an eighth embodiment of the present invention.

【図２２】第８の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャート
の他の一部を示す図FIG. 22 is a diagram showing another part of the flowchart for explaining the processing procedure of subframe pitch period calculation in the eighth embodiment.

【図２３】本発明の第９の実施形態におけるサブフレ
ームピッチ周期算出の処理手順を説明するためのフロー
チャートの一部を示す図FIG. 23 is a diagram showing a part of a flowchart for explaining a processing procedure of subframe pitch period calculation in the ninth embodiment of the present invention.

【図２４】第９の実施形態におけるサブフレームピッ
チ周期算出の処理手順を説明するためのフローチャート
の他の一部を示す図FIG. 24 is a diagram showing another part of the flowchart for explaining the processing procedure of subframe pitch period calculation in the ninth embodiment.

【図２５】本発明の第１０の実施形態に係る音声符号
化方法を説明するためのブロック図FIG. 25 is a block diagram for explaining a speech encoding method according to a tenth embodiment of the present invention.

【図２６】第１０の実施形態の効果を説明するための
図FIG. 26 is a diagram for explaining the effect of the tenth embodiment.

【図２７】本発明の第１１の実施形態に係る音声符号
化方法を説明するためのブロック図FIG. 27 is a block diagram for explaining a speech encoding method according to an eleventh embodiment of the present invention.

【図２８】本発明の第１２の実施形態に係る音声符号
化方法を説明するためのブロック図FIG. 28 is a block diagram for explaining a speech encoding method according to a twelfth embodiment of the present invention.

【図２９】本発明の第１３の実施形態に係る音声符号
化方法を説明するためのブロック図FIG. 29 is a block diagram for explaining a speech coding method according to a thirteenth embodiment of the present invention.

【図３０】本発明の音声符号化方法をＣＥＬＰ方式に
適用した第１４の実施形態に係る音声符号化装置の構成
を示すブロック図FIG. 30 is a block diagram showing the configuration of a speech encoding apparatus according to the fourteenth embodiment in which the speech encoding method of the present invention is applied to the CELP system.

【図３１】第１４の実施形態における適応ベクトル、
適応ベクトルゲイン、雑音ベクトルおよび雑音ベクトル
ゲインを求める手順を説明するためのフローチャートFIG. 31 is an adaptive vector according to the fourteenth embodiment,
Flowchart for explaining the procedure for obtaining adaptive vector gain, noise vector, and noise vector gain

【図３２】第１４の実施形態における適応ベクトル、
雑音ベクトルおよびゲインベクトルを求める手順を説明
するためのフローチャートFIG. 32 is an adaptive vector according to the fourteenth embodiment,
Flowchart for explaining the procedure for obtaining the noise vector and the gain vector

【図３３】本発明の音声復号化方法をＣＥＬＰ方式に
適用した第１５の実施形態に係る音声復号化装置の構成
を示すブロック図FIG. 33 is a block diagram showing the configuration of a speech decoding apparatus according to the fifteenth embodiment in which the speech decoding method of the present invention is applied to the CELP method.

【図３４】第１５の実施形態におけるサブフレームピ
ッチ周期生成部の構成を示すブロック図FIG. 34 is a block diagram showing the configuration of a subframe pitch period generation unit in the fifteenth embodiment.

【図３５】本発明に係る音声符号化／復号化方法を実
現するコンピュータシステムの構成を示すブロック図FIG. 35 is a block diagram showing the configuration of a computer system that realizes a speech encoding / decoding method according to the present invention.

[Explanation of symbols]

１１…音声信号入力端子１２…フレーム構成部１３…ピッチ周期分析部１４…ピッチ周期情報出力端子２１…音声信号入力端子２２…フレーム・サブフレーム構成部２３…ピッチ周期分析部２４…サブフレームピッチ周期抽出部２５…サブフレームピッチ周期情報出力端子２０１，３０１，４０１，５０１，６０１…音声信号入
力端子２０２，３０２，４０２，５０２，６０２…フレーム・
サブフレーム構成部２０３，３０３，４０３，５０３，６０３…ピッチ周期
分析部２０４，３０４，４０４，５０４，６０４…サブフレー
ムピッチ周期予測値算出部２０５，３０５，４０５，５３０…探索範囲決定部２０６，３０６，４０６，５０６，５２８，６０６…適
応符号帳２０７，３０７，４０７，５０７，６０７…聴感重み付
き合成フィルタ２０８，３０８，４０８，５０８，６０８…聴感重みフ
ィルタ２０９，３０９，４０９，５０９，６０９…減算器２１０，３１０，４１０，５１０，６１０…歪算出部２１１，３１１，４１１，５１１，６１１…歪最小時の
相対ピッチ周期情報出力端子２１２，３１２，４１２，５１２，６１２…ピッチ周期
情報出力端子３１３，５２５…ピッチ周期分析位置決定部３１４…ピッチ周期分析位置情報出力端子４１３，５２６…連続性判定部４１４，５２７…全探索部４１５，５３１…スイッチ４１６…連続性判定結果出力端子５０５，６０５…相対ピッチパターン符号帳５１３，５３６，６１３，６１９…加算器５２１…ＬＰＣ係数分析部５２２…ＬＰＣ係数量子化部５２３…マルチプレクサ５２４…符号化データ出力端子５２９，６１４…雑音符号帳５３２，５３４，６１５，６１７…乗算器５３３…適応ベクトル符号帳５３５…適応ベクトルゲイン符号帳８０１…符号化データ入力端子８０２…デマルチプレクサ８０３…サブフレームピッチ周期生成部８０４…適応符号帳８０５…適応ベクトルゲイン符号帳８０６，８０９…乗算器８０７…雑音符号帳８０８…雑音ベクトルゲイン符号帳８１０…加算器８１１…ＬＰＣ係数復号部８１２…合成フィルタ８１３…ポストフィルタ８１４…音声信号出力端子11 ... Voice signal input terminal 12 ... Frame configuration section 13 ... Pitch cycle analysis section 14 ... Pitch cycle information output terminal 21 ... Voice signal input terminal 22 ... Frame / subframe configuration section 23 ... Pitch cycle analysis section 24 ... Subframe pitch cycle Extraction unit 25 ... Subframe pitch period information output terminals 201, 301, 401, 501, 601 ... Audio signal input terminals 202, 302, 402, 502, 602 ... Frame
Subframe configuration units 203, 303, 403, 503, 603 ... Pitch period analysis units 204, 304, 404, 504, 604 ... Subframe pitch period prediction value calculation units 205, 305, 405, 530 ... Search range determination unit 206, 306, 406, 506, 528, 606 ... Adaptive codebook 207, 307, 407, 507, 607 ... Auditory weighted synthesis filter 208, 308, 408, 508, 608 ... Auditory weighted filter 209, 309, 409, 509, 609 ... Subtractors 210, 310, 410, 510, 610 ... Distortion calculation units 211, 311, 411, 511, 611 ... Relative pitch cycle information output terminals 212, 312, 412, 512, 612 when distortion is minimum ... Pitch cycle information output Terminals 313, 525 ... Pitch cycle analysis Position determination unit 314 ... Pitch cycle analysis Position information output terminals 413, 526 ... Continuity determination sections 414, 527 ... Full search section 415, 531 ... Switch 416 ... Continuity determination result output terminals 505, 605 ... Relative pitch pattern codebooks 513, 536, 613, 619 ... Addition 521 ... LPC coefficient analysis section 522 ... LPC coefficient quantization section 523 ... Multiplexer 524 ... Coded data output terminals 529, 614 ... Noise codebook 532, 534, 615, 617 ... Multiplier 533 ... Adaptive vector codebook 535 ... Adaptation Vector gain codebook 801 ... Encoded data input terminal 802 ... Demultiplexer 803 ... Subframe pitch period generation unit 804 ... Adaptive codebook 805 ... Adaptive vector gain codebook 806, 809 ... Multiplier 807 ... Noise codebook 808 ... Noise vector Gain codebook 810 ... Adder 811 ... LPC coefficient recovery Part 812 ... synthesis filter 813 ... postfilter 814 ... audio signal output terminal

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平９−34499（ＪＰ，Ａ) 特開平８−202398（ＪＰ，Ａ) 特開平10−97296（ＪＰ，Ａ) 特開平８−328588（ＪＰ，Ａ) 特開平５−73098（ＪＰ，Ａ) 特開平４−30200（ＪＰ，Ａ) 特表平９−512644（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/04 G10L 19/12 ─────────────────────────────────────────────────── ─── Continuation of the front page (56) Reference JP-A-9-34499 (JP, A) JP-A-8-202398 (JP, A) JP-A-10-97296 (JP, A) JP-A-8- 328588 (JP, A) JP 5-73098 (JP, A) JP 4-30200 (JP, A) Special table 9-512644 (JP, A) (58) Fields investigated (Int.Cl. ⁷ , DB name) G10L 19/04 G10L 19/12

Claims

(57) [Claims]

1. A voice encoding method for performing an encoding process, comprising dividing an input voice signal into frames of a predetermined length and encoding a pitch period of the input voice signal, wherein the frame is divided into the frames. A speech coding method, characterized in that, at the time of coding the pitch cycle of the current frame of the input speech signal, the pitch cycle of the future frame is obtained with respect to the current frame, and the pitch cycle of the future frame is coded. .

2. A voice encoding method for performing an encoding process, which comprises dividing an input voice signal into frames of a predetermined length and encoding a pitch period of the input voice signal. A voice code, characterized in that, at the time of encoding the pitch period of the current frame of the input speech signal, the pitch period of the frame of the look-ahead part is obtained for the current frame, and the pitch period of the frame of the look-ahead part is encoded. Method.

3. The pitch period of the future frame is adaptively determined based on the magnitude of the short-term power of the input speech signal, the prediction residual signal, or the prediction difference signal after passing through the low pass filter. The speech coding method according to claim 1, wherein the speech coding method is obtained at the analyzed position.

4. The pitch period of the frame of the prefetching unit is:
The analysis position adaptively determined based on the magnitude of the short-term power of the input voice signal, the prediction difference signal, or the prediction difference signal after passing through the low-pass filter. 2. The voice encoding method according to 2.

5. A speech coder that divides an input speech signal into frames of a predetermined length and performs a coding process including a process of coding a pitch period of the input speech signal, wherein the speech coding device is divided into the frames. Means for obtaining a pitch period of a future frame with respect to the present frame when encoding the pitch period of the present frame of the input speech signal, and means for encoding the pitch period of the future frame obtained by this means A speech coding apparatus comprising:

6. A speech coder that divides an input speech signal into frames of a predetermined length and performs a coding process including a process of coding a pitch period of the input speech signal, wherein the speech coding device is divided into the frames. When the pitch period of the current frame of the input audio signal is encoded, a means for obtaining the pitch period of the frame of the look-ahead portion for the current frame and the pitch period of the frame of the look-ahead portion obtained by this means are encoded. A speech coding apparatus having means.

7. A computer readable program recorded with a program for performing a voice coding process including a process of dividing an input voice signal into frames of a predetermined length and coding a pitch period of the input voice signal. A storage medium, wherein when the pitch period of the current frame of the input audio signal that has been divided into the frames is encoded, a process of obtaining the pitch period of the future frame for the frame and the pitch period of the future frame are encoded. A storage medium in which a program for executing the processing to be converted is recorded.

8. A computer readable program recording a program for performing a voice coding process, which comprises a process of dividing an input voice signal into frames of a predetermined length and coding a pitch period of the input voice signal. A storage medium, wherein, when encoding the pitch period of the current frame of the frame-divided input audio signal, the process of obtaining the pitch period of the frame of the look-ahead unit for the frame, and the pitch period of the frame of the look-ahead unit A storage medium recording a program for executing the process of encoding.