JP3138574B2

JP3138574B2 - Linear prediction coefficient interpolator

Info

Publication number: JP3138574B2
Application number: JP05217373A
Authority: JP
Inventors: 修一河間; 吉伸木村
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1993-09-01
Filing date: 1993-09-01
Publication date: 2001-02-26
Anticipated expiration: 2016-02-26
Also published as: JPH0774642A

Abstract

PURPOSE:To provide a linear predictive coefficient interpolating device which can extremely reduce its calculation quantity even in a digital signal processor. CONSTITUTION:A linear predictive coefficient interpolating device is provided with a hyperbolic conversion part 61 which applies the hyperbolic conversion to a partial autocorrelation coefficient of a prescribed degree acquired from an input voice signal of a fixed time length, a linear interpolation part 62 which is connected to the part 61 and applies the linear interpolation to the hyperbolic conversion result of the part 61, and a hyperbolic reverse conversion part 63 which connected to the part 62 and applies the reverse conversion to the linear interpolation result of the part 62.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、線形予測係数を使用す
る音声情報処理システムに関し、特に音声情報処理シス
テムに用いることができる線形予測係数補間装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech information processing system using linear prediction coefficients, and more particularly to a linear prediction coefficient interpolation device which can be used in a speech information processing system.

【０００２】[0002]

【従来の技術】従来の技術では、入力された音声信号を
線形予測分析し、線形予測係数によりスペクトル情報を
得て、得られたスペクトル情報を伝送することにより実
際の音声信号を伝送するよりも低ビットレートで音声の
通信を行う方式が実現されている。2. Description of the Related Art In a conventional technique, an input speech signal is subjected to linear prediction analysis, spectrum information is obtained from linear prediction coefficients, and the obtained spectrum information is transmitted. A system for performing voice communication at a low bit rate has been realized.

【０００３】上記の線形予測分析を使用して、低ビット
レートの伝送を行う為の音声符号化復号化方法として
は、ＣＥＬＰ（ＣｏｄｅＥｘｃｉｔｅｄＬｉｎｅａ
ｒＰｒｅｄｉｃｔｉｏｎ）方式が知られている。[0003] As a speech encoding / decoding method for transmitting at a low bit rate using the above-described linear prediction analysis, CELP (Code Excited Linea) is used.
r Prediction) method is known.

【０００４】ＣＥＬＰ方式は、音声の生成をモデル化し
たもので、声門で生じる気流に相当する信号の候補をコ
ードブックに持っており、この中の一つの信号に声帯の
開閉の周期に相当するピッチを付加するピッチ予測（ま
たは長期予測）フィルタ、口腔での調音に相当する（短
期）予測フィルタを通すことによって合成音声を生成す
る。このとき気流に相当する信号の候補からなるコード
ブックの中から最適な信号をＡｂＳ（Ａｎａｌｙｓｉｓ
ｂｙＳｙｎｔｈｅｓｉｓ）法、即ち合成による分析
法で求め、この信号の番号、利得、ピッチ予測フィルタ
の係数、ラグ（ピッチ長に相当）、線形予測フィルタの
係数を量子化及び符号化している。[0004] The CELP system is a model of voice generation, in which codebooks have candidates for signals corresponding to air currents generated in the glottis, and one of these signals corresponds to a period of opening and closing a vocal cord. The synthesized speech is generated by passing through a pitch prediction (or long-term prediction) filter for adding a pitch and a (short-term) prediction filter corresponding to articulation in the oral cavity. At this time, an optimal signal is selected from AbS (Analysis) from a code book including signal candidates corresponding to the airflow.
The signal number, gain, pitch prediction filter coefficient, lag (corresponding to pitch length), and linear prediction filter coefficient are quantized and coded by a synthesis analysis method, that is, a signal of this signal.

【０００５】ＣＥＬＰ方式では、入力音声信号を約２０
〜４０ｍｓのフレームに分割し、各フレームを４〜５の
サブフレームに分割して処理を行う。また、ＣＥＬＰ方
式ではフレーム単位に線形予測係数を求め、それらを補
間することによってサブフレーム単位の線形予測係数を
求める。本来、線形予測係数は、サブフレーム毎に求め
る方が合成音質の良い符号化を行える。しかし、計算量
が多くなり、伝送する符号の量が増加する為、一般的に
上記方式が用いられている。In the CELP system, an input audio signal is reduced to about 20
The frame is divided into ４０40 ms frames, and each frame is divided into 4 to 5 subframes for processing. In the CELP method, a linear prediction coefficient is obtained for each frame, and a linear prediction coefficient is obtained for each subframe by interpolating them. Essentially, the linear prediction coefficient can be obtained for each subframe, so that encoding with good synthesized sound quality can be performed. However, since the amount of calculation increases and the amount of codes to be transmitted increases, the above method is generally used.

【０００６】従来、ＣＥＬＰ方式では、線形予測係数の
補間方法としてＬＡＲ（ＬｏｇＡｒｅａＲａｔｉ
ｏ：パーコール（ＰＡＲＣＯＲ）係数の対数変換形（対
数断面積比））による補間を用いている。即ち、線形予
測係数からＰＡＲＣＯＲ係数に変換し、ＰＡＲＣＯＲ係
数から下記（１）式によりＬＡＲを求める。Conventionally, in the CELP system, LAR (Log Area Ratio) is used as a method of interpolating linear prediction coefficients.
o: Interpolation using a logarithmic conversion type (logarithmic cross-sectional ratio) of a PARCOR coefficient is used. That is, the linear prediction coefficient is converted into the PARCOR coefficient, and the LAR is obtained from the PARCOR coefficient by the following equation (1).

【０００７】[0007]

【数１】 (Equation 1)

【０００８】そしてＬＡＲの上で線形補間を行い、補間
結果を（１）式の逆変換によりＰＡＲＣＯＲ係数に変換
してサブフレーム単位の線形予測係数を求める。[0008] Then, linear interpolation is performed on the LAR, and the interpolation result is converted into a PARCOR coefficient by the inverse conversion of equation (1) to obtain a linear prediction coefficient in subframe units.

【０００９】ここでＬＡＲについて説明する。Here, the LAR will be described.

【００１０】ＰＡＲＣＯＲ係数は、相関値を表し、その
値は−１〜＋１を取るが、ＰＡＲＣＯＲ係数の次数によ
りその分布に偏りが存在する為、偏りの大きい２次まで
のＰＡＲＣＯＲ係数に関して非線形変換を施して補間を
行う。ここで、（１）式で示すｇｍの２倍、即ち２ｇｍ
をＬＡＲと呼ぶ。The PARCOR coefficient represents a correlation value, and its value ranges from -1 to +1. However, since there is a bias in the distribution due to the order of the PARCOR coefficient, nonlinear transformation is performed on the PARCOR coefficient up to the second order having a large bias. To perform interpolation. Here, twice the gm shown in the equation (1), that is, 2 gm
Is called LAR.

【００１１】図５は、ＰＡＲＣＯＲ係数からＬＡＲの変
換結果を示す。図５に示すように、ＬＡＲは、±１付近
のＰＡＲＣＯＲ係数に関しては敏感な変換特性を持つの
で、このＬＡＲ変換を予め施すことにより補間誤差は減
少する。また、中高次のＰＡＲＣＯＲ係数は偏在が少な
く０付近に分散している。ＬＡＲは０付近のＰＡＲＣＯ
Ｒ係数に関しては線形に近い変換を施す為、中高次のＰ
ＡＲＣＯＲ係数の特性にも合致する。FIG. 5 shows a conversion result of PAR to LAR from a PARCOR coefficient. As shown in FIG. 5, since the LAR has a sensitive conversion characteristic with respect to the PARCOR coefficient near ± 1, the interpolation error is reduced by performing the LAR conversion in advance. Further, the middle and high order PARCOR coefficients are less ubiquitous and are dispersed around zero. LAR is PARCO near 0
As for the R coefficient, a nearly linear conversion is performed,
It also matches the characteristics of the ARCOR coefficient.

【００１２】[0012]

【発明が解決しようとする課題】しかしながら、上述し
た従来の技術では、現在、線形予測係数の補間において
ＣＥＬＰで一般的に使用されているＬＡＲは、補間誤差
は少ないが対数を使用する必要があるため、ディジタル
・シグナル・プロセッサ（ＤＳＰ）上で実現した場合に
計算量が多くなるという問題点があった。However, in the above-mentioned conventional technology, the LAR generally used in CELP in the interpolation of the linear prediction coefficient at present has a small interpolation error but needs to use a logarithm. For this reason, there is a problem in that the amount of calculation increases when implemented on a digital signal processor (DSP).

【００１３】本発明の目的は、上記従来の技術における
問題点に鑑み、ＤＳＰ上で実現しても計算量を大幅に減
少することができる線形予測係数補間装置を提供するこ
とにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a linear prediction coefficient interpolation apparatus which can greatly reduce the amount of calculation even when implemented on a DSP in view of the above-mentioned problems in the conventional technology.

【００１４】[0014]

【課題を解決するための手段】本発明の目的は、一定時
間長の入力音声信号から得られた所定次数のＰＡＲＣＯ
Ｒ係数を双曲線変換する変換手段と、変換手段に接続さ
れており双曲線変換された変換結果を線形補間する補間
手段と、補間手段に接続されており線形補間された線形
補間結果を逆変換する逆変換手段とを備える線形予測係
数補間装置によって達成される。SUMMARY OF THE INVENTION An object of the present invention is to provide a PARCO of a predetermined order obtained from an input audio signal of a fixed time length.
Conversion means for hyperbolically converting the R coefficient, interpolation means connected to the conversion means for linearly interpolating the result of the hyperbolic conversion, and inverse means connected to the interpolation means for inversely converting the linearly interpolated linear interpolation result This is achieved by a linear prediction coefficient interpolation device including a conversion unit.

【００１５】本発明の線形予測係数補間装置では、双曲
線変換手段は、所定次数のＰＡＲＣＯＲ係数が±１の値
に近付く程、急傾斜を持つように双曲線変換を行うよう
に構成してもよい。In the linear predictive coefficient interpolating apparatus according to the present invention, the hyperbolic transform means may be configured to perform the hyperbolic transform such that the closer the PARCOR coefficient of a predetermined order approaches the value of ± 1, the steeper the slope.

【００１６】[0016]

【作用】本発明の線形予測係数補間装置では、変換手段
は一定時間長の入力音声信号から得られた所定次数のＰ
ＡＲＣＯＲ係数を双曲線変換し、補間手段は双曲線変換
された変換結果を線形補間し、逆変換手段は線形補間さ
れた線形補間結果を逆変換する。In the linear predictive coefficient interpolating apparatus according to the present invention, the converting means includes a predetermined-order P of a predetermined order obtained from an input audio signal having a fixed time length.
The ARCOR coefficient is hyperbolically transformed, the interpolation means linearly interpolates the hyperbolically transformed result, and the inverse transformation means inversely transforms the linearly interpolated linear interpolation result.

【００１７】本発明の線形予測係数補間装置では、双曲
線変換手段は、所定次数のＰＡＲＣＯＲ係数が±１の値
に近付く程、急傾斜を持つように双曲線変換する。In the linear predictive coefficient interpolating apparatus according to the present invention, the hyperbolic conversion means performs hyperbolic conversion so as to have a steep slope as the predetermined order PARCOR coefficient approaches a value of ± 1.

【００１８】[0018]

【実施例】以下、図面を参照して本発明の線形予測係数
補間装置の実施例を説明する。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of a linear prediction coefficient interpolation apparatus according to the present invention.

【００１９】図１は、本発明の線形予測係数補間装置で
ある予測係数補間部の一実施例を示すブロック図であ
る。FIG. 1 is a block diagram showing an embodiment of a prediction coefficient interpolation section which is a linear prediction coefficient interpolation apparatus according to the present invention.

【００２０】図２は、図１の主要部を備えたＣＥＬＰ方
式の符号化復号化装置の一実施例の構成を示すブロック
図である。FIG. 2 is a block diagram showing the configuration of an embodiment of a CELP encoding / decoding apparatus having the main parts of FIG.

【００２１】ここで、図１の説明を行う前に、図２の符
号化復号化装置を参照して、図１の予測係数補間部の役
割を示す。Before explaining FIG. 1, the role of the prediction coefficient interpolation unit in FIG. 1 will be described with reference to the encoding / decoding apparatus in FIG.

【００２２】まず、図２の符号化部を説明する。First, the encoding unit shown in FIG. 2 will be described.

【００２３】図２の符号化部では、線形予測分析（短期
予測）部１において、サンプリング周波数ｆｓでサンプ
リングされたディジタル入力信号ｓ（ｔ）の線形予測分
析があるフレーム周期で行われ、量子化されたフレーム
単位のＰＡＲＣＯＲ係数が求められる。ここで、ｔはサ
ンプル時点を示す。In the encoding unit shown in FIG. 2, the linear prediction analysis (short-term prediction) unit 1 performs linear prediction analysis of the digital input signal s (t) sampled at the sampling frequency fs at a certain frame period, and performs quantization. The obtained PARCOR coefficient is determined for each frame. Here, t indicates the sampling time.

【００２４】その後、予測係数補間部６（図１参照）中
の、変換手段である双曲線変換部６１において、低次の
ＰＡＲＣＯＲ係数の偏在を緩和する変換を行い、それら
の変換結果を補間手段である線形補間部６２にて線形補
間を行う。その後、逆変換手段である双曲線逆変換部６
３にて補間結果の逆変換を行い、サブフレーム単位の補
間ＰＡＲＣＯＲ係数を求める。Thereafter, in the prediction coefficient interpolator 6 (see FIG. 1), a hyperbolic converter 61, which is a conversion means, performs a conversion for alleviating the uneven distribution of low-order PARCOR coefficients, and converts the conversion results by the interpolation means. A certain linear interpolation unit 62 performs linear interpolation. Thereafter, a hyperbolic inverse transform unit 6 as an inverse transform means
In step 3, the inverse conversion of the interpolation result is performed, and an interpolation PARCOR coefficient for each subframe is obtained.

【００２５】聴覚重み付けフィルタ２は、式（２）で示
す伝達関数Ｗ（ｚ）を持つ。The auditory weighting filter 2 has a transfer function W (z) represented by equation (2).

【００２６】[0026]

【数２】 (Equation 2)

【００２７】ここで、α^(m) _i （０≦ｍ＜Ｍ，０＜ｉ≦
Ｐ）は第ｍサブフレームのｉ次の補間後の線形予測係数
を表し、Ｍは１フレームに含まれるサブフレーム数を表
す。Here, α ^(m) _i (0 ≦ m <M, 0 <i ≦
P) represents the ith-order interpolated linear prediction coefficient of the m-th subframe, and M represents the number of subframes included in one frame.

【００２８】つまり、聴覚重み付けフィルタ２は、式
（３）に基づいて入力信号の残差信号を得る逆フィルタ
２０１と、That is, the auditory weighting filter 2 comprises: an inverse filter 201 for obtaining a residual signal of the input signal based on the equation (3);

【００２９】[0029]

【数３】 (Equation 3)

【００３０】式（４）に基づいて重み付け線形予測フィ
ルタ２０２と、A weighted linear prediction filter 202 based on equation (4)

【００３１】[0031]

【数４】 (Equation 4)

【００３２】からなり、入力信号ｓ（ｔ）のスペクトル
の谷の部分を強調させた信号ｕ（ｔ）を決める。The signal u (t) is formed by emphasizing the valley portion of the spectrum of the input signal s (t).

【００３３】ここで、λはスペクトルの谷の部分をどれ
だけ強調するかを決定するパラメータであり、０に近い
ほど谷の部分が強調される。この信号ｕ（ｔ）にできる
だけ似た信号（聴覚重み付けされた合成信号）ｕ′
（ｔ）を合成するように符号化が行われる。この処理は
サブフレーム単位で行われる。１フレームあたりのサン
プル数をＦとする。Here, λ is a parameter for determining how much the valley portion of the spectrum is emphasized, and the valley portion is emphasized as it approaches 0. A signal as similar as possible to this signal u (t) (a composite signal weighted by hearing) u ′
Encoding is performed to synthesize (t). This processing is performed in subframe units. The number of samples per frame is F.

【００３４】この聴覚重み付けフィルタ２により、後に
説明する合成信号ｓ′（ｔ）はパワーの小さいスペクト
ルほど入力信号ｓ（ｔ）との誤差が小さくなり、聴覚の
マスキング特性によりマスクされ難いこれらのスペクト
ル成分の雑音を小さくすることができるので、聴覚的に
合成音質は良くなる。With the auditory weighting filter 2, the error of the synthesized signal s' (t), which will be described later, from the input signal s (t) decreases as the power of the spectrum decreases, and these spectra are hard to be masked by the masking characteristics of the auditory sense. Since the noise of the component can be reduced, the synthesized sound quality is improved audibly.

【００３５】コードブック３には、励起信号ベクトルが
Ｎ個（ここで、Ｎは正の整数）入っており、この中の一
つの励起信号ベクトルｃｊ（ｔ）（０≦ｊ＜Ｎ）は、乗
算器４によりγ倍され、ピッチ予測フィルタ５でピッチ
成分が付け加えられ、重み付け合成フィルタ７（伝達関
数はフィルタ２０２と同じ）を通ることにより、聴覚重
み付けされた合成信号ｕ′ｊ（ｔ）が得られる。ここ
で、最適な励起信号ベクトルｃｊ（ｔ）を選択する為に
ＡｂＳ（ＡｎａｌｙｓｉｓｂｙＳｙｎｔｈｅｓｉ
ｓ）法を用いる。The codebook 3 contains N excitation signal vectors (where N is a positive integer), and one of the excitation signal vectors cj (t) (0 ≦ j <N) is Multiplied by γ by the multiplier 4, the pitch component is added by the pitch prediction filter 5 and passed through the weighting synthesis filter 7 (the transfer function is the same as that of the filter 202), so that the perceptually weighted synthesized signal u′j (t) is obtained. can get. Here, in order to select the optimal excitation signal vector cj (t), AbS (Analysis by Synthesis) is used.
s) Method is used.

【００３６】図２においてＡｂＳ法を説明する。コード
ブック３中の励起信号ベクトルｃｊ（ｔ）が乗算器４で
利得γによる乗算が行われ、ピッチ予測フィルタ５でピ
ッチ成分が付加される。その後、重み付け合成フィルタ
７により合成音声信号ｕ′ｊ（ｔ）が生成される。合成
音声信号とターゲットの入力音声信号ｕ（ｔ）の差分が
減算器８で計算され、誤差信号ｅｊ（ｔ）が生成され
る。この誤差信号ｅｊ（ｔ）のパワーＰｊがパワー計算
部９によって式（５）に基づいて計算される。The AbS method will be described with reference to FIG. The multiplier 4 multiplies the excitation signal vector cj (t) in the codebook 3 by the gain γ, and the pitch prediction filter 5 adds a pitch component. After that, the weighted synthesis filter 7 generates a synthesized speech signal u′j (t). The difference between the synthesized voice signal and the input voice signal u (t) of the target is calculated by the subtracter 8, and an error signal ej (t) is generated. The power Pj of the error signal ej (t) is calculated by the power calculator 9 based on equation (5).

【００３７】[0037]

【数５】 (Equation 5)

【００３８】そして、このパワーＰｊが最小となるコー
ドブック３の励起信号ベクトルｃｊ、γ、ピッチ予測フ
ィルタの係数をエラー最小化部１０で検索する。Then, the error minimizing unit 10 searches for the excitation signal vectors cj, γ of the codebook 3 and the coefficients of the pitch prediction filter in which the power Pj is minimized.

【００３９】線形予測係数、励起信号のインデックス
ｊ、利得γ、ピッチ予測フィルタの係数、ピッチ長が符
号化・マルチプレクサ部１１で符号化、マルチプレクサ
化されて伝送路１２に送られる。この伝送路１２とし
て、無線系、有線系、蓄積系がある。The linear prediction coefficient, the index j of the excitation signal, the gain γ, the coefficient of the pitch prediction filter, and the pitch length are encoded and multiplexed by the encoding / multiplexing section 11 and sent to the transmission line 12. The transmission path 12 includes a wireless system, a wired system, and a storage system.

【００４０】次に、図２の復号化部を説明する。Next, the decoding unit shown in FIG. 2 will be described.

【００４１】復号化部では、デマルチプレクサ・復号化
部１３において、伝送された符号列がデマルチプレク
ス、復号化され、線形予測係数、励起信号のインデック
スｊ、利得γ、ラグ（ピッチ長）、ピッチ予測フィルタ
の係数が得られる。符号化部のコードブック３と同じ励
起信号ベクトルを持つコードブック１４の中のｊで示さ
れる励起信号ベクトルｃｊ（ｔ）が乗算器１５によりγ
倍され、符号化部のピッチ予測フィルタ５と同じ構造の
ピッチ予測フィルタ１６でピッチ成分が付加され、更
に、式（６）で示される伝達関数Ｆ（ｚ）In the decoding unit, the transmitted code string is demultiplexed and decoded in the demultiplexer / decoding unit 13, and the linear prediction coefficient, the index j of the excitation signal, the gain γ, the lag (pitch length), The pitch prediction filter coefficients are obtained. An excitation signal vector cj (t) indicated by j in a codebook 14 having the same excitation signal vector as the codebook 3 of the encoding unit is converted by the multiplier 15 into γ.
The pitch component is added by a pitch prediction filter 16 having the same structure as that of the pitch prediction filter 5 of the encoding unit, and further, a transfer function F (z) represented by Expression (6) is added.

【００４２】[0042]

【数６】 (Equation 6)

【００４３】を持つ線形予測フィルタ（合成フィルタ）
１７を通ることにより、合成信号ｓ′（ｔ）が得られ
る。式（６）で用いられる線形予測係数α^(m) _i は、伝
送されたフレーム毎の線形予測係数を予測係数補間部１
８において各サブフレーム毎に補間したものである。Linear prediction filter (synthesis filter) having
17, the synthesized signal s' (t) is obtained. The linear prediction coefficient α ^(m) _i used in the equation (6) is obtained by calculating the linear prediction coefficient for each transmitted frame by the prediction coefficient interpolation unit 1
In FIG. 8, interpolation is performed for each subframe.

【００４４】予測係数補間部１８は予測係数補間部６と
同様の構造をしており、双曲線変換を施した後、線形補
間を行い、その後、双曲線逆変換を行いサブフレーム毎
のＰＡＲＣＯＲ係数を求める。The prediction coefficient interpolation unit 18 has the same structure as that of the prediction coefficient interpolation unit 6, and performs a hyperbolic transformation, performs a linear interpolation, and then performs a hyperbolic inverse transformation to obtain a PARCOR coefficient for each subframe. .

【００４５】次に、図１の予測係数補間部６を詳細に説
明する。Next, the prediction coefficient interpolation unit 6 of FIG. 1 will be described in detail.

【００４６】図２の短期予測部１では１フレームの音声
区間毎にＰ次（Ｐは正の整数）のＰＡＲＣＯＲ係数を求
める。その後、ＰＡＲＣＯＲ係数ｋｊ（ｊ＝１〜Ｐ）を
量子化する。以降、この量子化したＰＡＲＣＯＲ係数を
使用する。The short-term prediction unit 1 in FIG. 2 obtains a P-order (P is a positive integer) PARCOR coefficient for each voice section of one frame. Thereafter, the PARCOR coefficient kj (j = 1 to P) is quantized. Hereinafter, this quantized PARCOR coefficient is used.

【００４７】予測係数補間部６では直前のフレームで求
めたＰＡＲＣＯＲ係数と現在のフレームで求めたＰＡＲ
ＣＯＲ係数との線形補間を行なうことによりＭ個（Ｍは
正の整数）のサブフレーム単位のＰＡＲＣＯＲ係数を求
める。この線形補間ＰＡＲＣＯＲ係数は、低次のＰＡＲ
ＣＯＲ係数については双曲線変換部６１で変換し、その
結果を線形補間部６２で補間し、その後、双曲線逆変換
部６３で逆変換することにより得る。また、中・高次の
ＰＡＲＣＯＲ係数については線形補間部６２で線形補間
のみ行う。The prediction coefficient interpolating unit 6 calculates the PAROR coefficient obtained in the immediately preceding frame and the PAR coefficient obtained in the current frame.
By performing linear interpolation with COR coefficients, M (M is a positive integer) PARCOR coefficients in subframe units are obtained. This linearly interpolated PARCO coefficient is calculated using the low-order PAR
The COR coefficient is converted by the hyperbolic conversion unit 61, the result is interpolated by the linear interpolation unit 62, and then inversely converted by the hyperbolic inverse conversion unit 63. In addition, only the linear interpolation is performed by the linear interpolation unit 62 for the middle and higher order PARCOR coefficients.

【００４８】上記方法により決定されたＰＡＲＣＯＲ係
数を線形予測係数αｉ（ｉ＝０〜Ｐ）に変換し、各々の
サブフレームで求めた線形予測係数を使用する。但し、
１次のＰＡＲＣＯＲ係数ｋ１の補間は、各フレームのｋ
１を下記の式（７）で変換した後に線形補間し、式
（８）により逆変換する。The PARCOR coefficient determined by the above method is converted into a linear prediction coefficient αi (i = 0 to P), and the linear prediction coefficient obtained in each subframe is used. However,
Interpolation of the first-order PARCOR coefficient k1 is performed by calculating k for each frame.
1 is converted by the following equation (7), linearly interpolated, and inversely converted by the equation (8).

【００４９】[0049]

【数７】 (Equation 7)

【００５０】[0050]

【数８】 (Equation 8)

【００５１】上記式（７）及び式（８）により、＋１に
偏在するｋ１は図３に示されるように分配される。この
変換式において、定数ａの値が大きくなる程±１付近で
の傾斜が大きくなる。２次のＰＡＲＣＯＲ係数も同様に
変換、補間可能である。また、±１に非常に近い値のＰ
ＡＲＣＯＲ係数をＬＡＲで表現すると非常に大きな値と
なり、大きなダイナミックレンジが必要となるが、この
変換式によればそのようなＰＡＲＣＯＲ係数を変換して
小さなダイナミックレンジで表現可能である。According to the above equations (7) and (8), k1 unevenly distributed to +1 is distributed as shown in FIG. In this conversion formula, as the value of the constant a increases, the inclination around ± 1 increases. The secondary PARCOR coefficient can be similarly converted and interpolated. In addition, a P value very close to ± 1
When the ARCOR coefficient is expressed by the LAR, the value becomes a very large value, and a large dynamic range is required. According to this conversion formula, such a PARCOR coefficient can be converted and expressed with a small dynamic range.

【００５２】次に、図４のフローチャートを参照して、
本実施例によるＰＡＲＣＯＲ係数補間処理の動作を説明
する。Next, referring to the flowchart of FIG.
The operation of the PARCOR coefficient interpolation processing according to the present embodiment will be described.

【００５３】まず、ｎ番目のフレームのＰＡＲＣＯＲ係
数ｋｉ（次数ｉ＝１〜Ｐ）が入力処理で予測係数補間部
６（図１）に入力され（４０１）、ＰＡＲＣＯＲ係数の
次数を表す変換ｉを１に初期化する（４０２）。ＰＡＲ
ＣＯＲ係数の次数が判定され（４０３）、低次のＰＡＲ
ＣＯＲ係数は式（７）によるｙｉ（ｎ）への変換が行わ
れ（４０４）、中高次のＰＡＲＣＯＲ係数はｙｉ（ｎ）
への無変換代入が行われる（４０５）。そして、次数を
表す変数ｉをインクリメントする（４０６）。全ての次
数のＰＡＲＣＯＲ係数におけるｙｉ（ｎ）を求める。そ
の処理の終了を判定する（４０７）。その後、ｙｉ
（ｎ）は前フレームのｙｉ（ｎ−１）と補間されてｙ
ｉ′（ｎ）が求まる（４０８）。その補間式を式（９）
に示す。First, the PARCOR coefficient ki (order i = 1 to P) of the n-th frame is input to the prediction coefficient interpolator 6 (FIG. 1) in the input processing (401), and the transform i representing the order of the PARCOR coefficient is calculated. Initialized to 1 (402). PAR
The order of the COR coefficient is determined (403) and the lower order PAR is determined.
The COR coefficient is converted to yi (n) according to equation (7) (404), and the middle-high order PARCOR coefficient is yi (n).
Is performed (405). Then, the variable i representing the order is incremented (406). Find yi (n) in the PARCOR coefficients of all orders. The end of the process is determined (407). Then yi
(N) is interpolated with yi (n-1) of the previous frame to obtain y
i '(n) is obtained (408). Equation (9)
Shown in

【００５４】[0054]

【数９】 (Equation 9)

【００５５】再び図４に戻って、前フレームのＰＡＲＣ
ＯＲ係数ｙｉ（ｎ−１）を現在のフレームのＰＡＲＣＯ
Ｒ係数ｙｉ（ｎ）により更新して（４０９）、上記ステ
ップ４０２と同様に、次数を表す変数ｉに初期値１を代
入する（４１０）。Referring back to FIG. 4, the PARC of the previous frame
The OR coefficient yi (n-1) is calculated using the PARCO of the current frame.
It is updated with the R coefficient yi (n) (409), and the initial value 1 is substituted for the variable i representing the degree (410), as in the above step 402.

【００５６】その処理後、低次のｙｉ′（ｎ）は双曲線
変換（式（８））されてＰＡＲＣＯＲ係数へと変換され
る（４１１、４１２）。また、中高次のｙｉ′（ｎ）は
そのままＰＡＲＣＯＲ係数に代入される（４１３）。こ
れらの変換処理は全ての次数のｙｉ′（ｎ）についてな
され、その後、次数を表す変数ｉをインクリメントして
（４１４）、その処理終了を判定する（４１５）。これ
により求まったＰＡＲＣＯＲ係数がサブフレーム単位の
処理に適用される（４１６）。After the processing, the low-order yi '(n) is subjected to hyperbolic transformation (Equation (8)) and transformed into PARCOR coefficients (411, 412). The middle and high order yi '(n) is directly substituted for the PARCOR coefficient (413). These conversion processes are performed for all the orders yi '(n), and thereafter, the variable i representing the order is incremented (414), and the end of the process is determined (415). The PARCOR coefficient thus obtained is applied to the processing on a subframe basis (416).

【００５７】図４のフローチャートにおいて、ステップ
４０２から４０７までが図１の双曲線変換部６１、ステ
ップ４０８が図１の線形補間部６２、ステップ４０９か
ら４１５までが図１の双曲線逆変換部６３で処理され
る。In the flowchart of FIG. 4, steps 402 to 407 are processed by the hyperbolic converter 61 of FIG. 1, steps 408 are processed by the linear interpolation unit 62 of FIG. 1, and steps 409 to 415 are processed by the inverse hyperbolic converter 63 of FIG. Is done.

【００５８】双曲線変換ステップ４０４及び双曲線逆変
換ステップ４１２の変換式として、本実施例では式
（７）及び式（８）に示す双曲線関数を用いたが、図３
に示すような±１付近に偏在するＰＡＲＣＯＲ係数を分
配変換する特性を持つ関数ならば、補間結果は改善され
るので、このような特性を持つ高次の曲線を用いても良
いが実現の際の計算量の観点からは双曲線関数が適当で
ある。なぜなら、双曲線関数であれば、その逆変換も簡
易であり計算量の点からも適していると判断できる。In this embodiment, the hyperbolic functions shown in equations (7) and (8) are used as the conversion equations in the hyperbolic transformation step 404 and the hyperbolic inverse transformation step 412.
If the function has a characteristic of distributing and transforming the PARCOR coefficient unevenly distributed near ± 1 as shown in (1), the interpolation result is improved, and a higher-order curve having such a characteristic may be used. A hyperbolic function is appropriate from the viewpoint of the computational complexity of. The reason is that if the hyperbolic function is used, its inverse transformation is simple and it can be determined that it is suitable from the viewpoint of the amount of calculation.

【００５９】上述したように、本発明では、線形予測係
数を補間する前にその分布を考慮した線形変換を施すこ
とによりＬＡＲに変換することなく良い補間特性を示
し、またＬＡＲに変換しないため計算量を大幅に減ずる
ことができる。即ち、補間誤差の影響削減の効果とし
て、実施例において説明したように、±１付近に偏在す
る提示ＰＡＲＣＯＲ係数の補間による誤差を、その分布
を考慮した変換を行うことにより削減する。また、計算
量削減の効果として、本方式による線形予測係数の補間
ではＬＡＲの計算に必要である対数の計算が不必要とな
り、この方式を実際に計算機上で実現する際に計算量を
減少できる。As described above, according to the present invention, a linear interpolation taking into account the distribution of the linear prediction coefficients before interpolation is performed, thereby exhibiting good interpolation characteristics without conversion to LAR. The amount can be greatly reduced. That is, as an effect of reducing the influence of the interpolation error, as described in the embodiment, the error due to the interpolation of the presented PARCOR coefficient unevenly distributed around ± 1 is reduced by performing the conversion in consideration of the distribution. Further, as an effect of reducing the amount of calculation, in the interpolation of the linear prediction coefficient according to the present method, the calculation of the logarithm required for the calculation of the LAR becomes unnecessary. .

【００６０】[0060]

【発明の効果】本発明の線形予測係数補間装置は、一定
時間長の入力音声信号から得られた所定次数のＰＡＲＣ
ＯＲ係数を双曲線変換する変換手段と、変換手段に接続
されており双曲線変換された変換結果を線形補間する補
間手段と、補間手段に接続されており線形補間された線
形補間結果を逆変換する逆変換手段とを備えるので、Ｌ
ＡＲの計算に必要である対数の計算が不必要となり、こ
の方式を実際に計算機上で実現する際に計算量を減少で
きる。The linear predictive coefficient interpolating apparatus according to the present invention provides a PARC of a predetermined order obtained from an input speech signal of a fixed time length.
Conversion means for hyperbolically transforming the OR coefficient, interpolation means connected to the conversion means for linearly interpolating the result of the hyperbolic transformation, and inverse means for inversely converting the linearly interpolated linear interpolation result connected to the interpolation means Conversion means,
The calculation of the logarithm required for the calculation of the AR becomes unnecessary, and the amount of calculation can be reduced when this method is actually realized on a computer.

【００６１】また、本発明の線形予測係数補間装置で
は、双曲線変換手段は、所定次数のＰＡＲＣＯＲ係数が
±１の値に近付く程、急傾斜を持つように双曲線変換を
行うので、±１付近に偏在する提示ＰＡＲＣＯＲ係数の
補間による誤差を、その分布を考慮した変換を行うこと
により削減できる。In the linear predictive coefficient interpolating apparatus according to the present invention, the hyperbolic conversion means performs the hyperbolic conversion so as to have a steep slope as the predetermined-order PARCOR coefficient approaches the value of ± 1, so that it is close to ± 1. An error due to interpolation of unevenly distributed presentation PARCOR coefficients can be reduced by performing conversion in consideration of the distribution.

[Brief description of the drawings]

【図１】本発明の線形予測係数補間装置の一実施例の構
成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of an embodiment of a linear prediction coefficient interpolation device according to the present invention.

【図２】図１の線形予測係数補間部を備えた音声符号化
復号化装置の一構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration example of a speech encoding / decoding device including the linear prediction coefficient interpolation unit illustrated in FIG. 1;

【図３】図１の線形予測係数補間部によるＰＡＲＣＯＲ
係数の分布を表す説明図である。FIG. 3 is a diagram showing a PARCOR by a linear prediction coefficient interpolation unit shown in FIG. 1;
FIG. 9 is an explanatory diagram illustrating a distribution of coefficients.

【図４】図１の線形予測係数補間部によるＰＡＲＣＯＲ
係数補間処理の動作を説明するためのフローチャートで
ある。FIG. 4 is a diagram showing a PARCOR by the linear prediction coefficient interpolation unit shown in FIG. 1;
It is a flow chart for explaining operation of coefficient interpolation processing.

【図５】ＰＡＲＣＯＲからＬＡＲへの変換結果の説明図
である。FIG. 5 is an explanatory diagram of a conversion result from PARCOR to LAR.

[Explanation of symbols]

１短期予測部２聴覚重み付けフィルタ３，１４コードブック４，１５乗算器５，１６ピッチ予測フィルタ６，１８予測係数補間部７，２０２重み付け合成フィルタ８減算器９パワー計算部１０エラー最小化部１１符号化、マルチプレクサ部１２伝送路１３デマルチプレクサ、復号化部１７合成フィルタ６１双曲線変換部６２線形補間部６３双曲線逆変換部２０１線形予測フィルタ DESCRIPTION OF SYMBOLS 1 Short-term prediction part 2 Perceptual weighting filter 3,14 Codebook 4,15 Multiplier 5,16 Pitch prediction filter 6,18 Prediction coefficient interpolation part 7,202 Weighting synthesis filter 8 Subtractor 9 Power calculation part 10 Error minimization part 11 Encoding and multiplexer unit 12 Transmission path 13 Demultiplexer, decoding unit 17 Synthesis filter 61 Hyperbolic transformation unit 62 Linear interpolation unit 63 Hyperbolic inverse transformation unit 201 Linear prediction filter

フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 13/00 G10L 19/00 - 19/14 H03M 7/30 H04B 14/04 Continuation of front page (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 13/00 G10L 19/00-19/14 H03M 7/30 H04B 14/04

Claims

(57) [Claims]

1. A conversion means for performing hyperbolic conversion of a predetermined order Percoll coefficient obtained from an input audio signal having a predetermined time length, and an interpolation means connected to the conversion means for linearly interpolating the hyperbolically converted result. A linear prediction coefficient interpolation apparatus, comprising: an inverse conversion means connected to the interpolation means for inversely converting the linearly interpolated linear interpolation result.

2. The linear prediction coefficient according to claim 1, wherein the hyperbolic transformation means performs the hyperbolic transformation so as to have a steep slope as the Percoll coefficient of the predetermined order approaches a value of ± 1. Interpolator.