JP3186013B2

JP3186013B2 - Acoustic signal conversion encoding method and decoding method thereof

Info

Publication number: JP3186013B2
Application number: JP00388895A
Authority: JP
Inventors: 健弘守谷; 直樹岩上
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1995-01-13
Filing date: 1995-01-13
Publication date: 2001-07-11
Anticipated expiration: 2016-07-11
Also published as: JPH08194497A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は音楽信号や音声信号の
ような音響信号を周波数領域の信号に変換し、これを、
そのパワースペクトル包絡の平方根で正規化して残差信
号を得、この残差信号を量子化すると共に、上記パワー
スペクトル包絡を得るための情報を補助情報として量子
化する高能率音響信号符号化方法及びその復号化方法に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention converts an acoustic signal such as a music signal or a voice signal into a signal in a frequency domain,
A high-efficiency audio signal encoding method for normalizing by the square root of the power spectrum envelope to obtain a residual signal, quantizing the residual signal, and quantizing information for obtaining the power spectrum envelope as auxiliary information; It relates to the decoding method.

【０００２】[0002]

【従来の技術】オーディオ信号を高能率に符号化する方
法として、原音信号をフレームと呼ばれる５〜５０ms程
度の一定間隔の区間に分割し、その１フレームの信号に
ＭＤＣＴ（変形離散コサイン変換）を用いて時間−周波
数変換を行って周波数領域信号を得、これを、そのパワ
ースペクトル包絡（周波数特性の包絡）と、そのパワー
スペクトル包絡で周波数領域信号を平坦化した残差信号
との２つの情報に分離し、それぞれを符号化することが
提案されている。2. Description of the Related Art As a method for encoding an audio signal with high efficiency, an original sound signal is divided into frames, which are called frames, at a fixed interval of about 5 to 50 ms, and MDCT (Modified Discrete Cosine Transform) is applied to one frame signal. Is used to perform time-frequency conversion to obtain a frequency domain signal, which is divided into two information, a power spectrum envelope (envelope of frequency characteristics) and a residual signal obtained by flattening the frequency domain signal with the power spectrum envelope. And encoding each of them.

【０００３】この提案されている符号化方法及びその復
号化方法を図７を参照して説明する。符号器３１におい
て入力端子３３からディジタル化した音響入力信号系列
がフレーム分割手段３４に入力されて、Ｎ入力サンプル
ごとに過去２×Ｎサンプルの入力系列を抽出し、長さ２
×Ｎサンプルの入力フレームに生成され、窓掛手段３５
でその入力フレームに時間窓がかけられる。その窓形状
はハニング窓を用いるのが一般的である。その窓かけさ
れた入力信号系列はＭＤＣＴ手段３６で変形離散コサイ
ン変換されて、Ｎサンプルの周波数領域信号に変換され
る。[0003] The proposed encoding method and its decoding method will be described with reference to FIG. The audio input signal sequence digitized from the input terminal 33 in the encoder 31 is input to the frame dividing means 34, and the input sequence of the past 2 × N samples is extracted for every N input samples, and the length is 2
XN samples are generated in the input frame and the windowing means 35
A time window is applied to the input frame. The window shape is generally a Hanning window. The windowed input signal sequence is subjected to a modified discrete cosine transform by the MDCT means 36 to be converted into a frequency domain signal of N samples.

【０００４】また前記窓かけされた入力信号系列は線形
予測分析手段３７で線形予測分析され、Ｐ次の予測係数
が求められる。この線形予測分析は自己相関を求めた後
に行われる。その予測係数は量子化手段３８で量子化さ
れる。この量子化の方法としては、予測係数をＬＳＰパ
ラメータに変換して量子化するＬＳＰ量子化の方法、予
測係数をｋパラメータに変換してから量子化する方法な
どを用いることができる。この量子化された予測係数を
示すインデックス３９が送出される。[0004] The windowed input signal sequence is subjected to linear prediction analysis by a linear prediction analysis means 37, and a P-order prediction coefficient is obtained. This linear prediction analysis is performed after obtaining the autocorrelation. The prediction coefficient is quantized by the quantization means 38. As the quantization method, an LSP quantization method of converting a prediction coefficient into an LSP parameter and quantizing it, or a method of converting a prediction coefficient into a k parameter and then quantizing it can be used. An index 39 indicating the quantized prediction coefficient is transmitted.

【０００５】また前記量子化予測係数は周波数概形計算
手段４１によりパワースペクトルを計算して周波数特性
概形信号が求められる。具体的には、量子化手段３８の
量子化出力を逆量子化し、例えば図８に示すようにその
Ｐ＋１個の逆量子化予測係数（αパラメータ）の後に２
×Ｎ−Ｐ−１個の０をつなげて作った長さ２×Ｎのサン
プル系列をＦＦＴ分析し（高速フーリエ変換：離散フー
リエ変換）、更にそのＮ次のパワースペクトルを計算し
する。０番目から始まってｉ番目の周波数特性概形の逆
数の各点は、ｉ＝Ｎ−１以外ではｉ＋１番目とｉ番目の
各パワースペクトルの平方根を平均して、つまり補間し
て得る。Ｎ−１番目の周波数特性概形の逆数は、Ｎ−１
番目のパワースペクトルの平方根をとって得る。The quantized prediction coefficients are used to calculate a power spectrum by a frequency outline calculating means 41 to obtain a frequency characteristic outline signal. Specifically, the quantized output of the quantizing means 38 is inversely quantized, and for example, as shown in FIG. 8, after the P + 1 inversely quantized prediction coefficients (α parameters), 2
An FFT analysis is performed on a 2 × N sample sequence formed by connecting × NP−1 zeros (fast Fourier transform: discrete Fourier transform), and the Nth-order power spectrum is calculated. Each point of the reciprocal of the i-th frequency characteristic outline starting from the 0-th is obtained by averaging the square roots of the i + 1-th and i-th power spectra except for i = N−1, that is, by interpolating. The reciprocal of the (N-1) th approximate frequency characteristic is N-1
Obtained by taking the square root of the power spectrum.

【０００６】図７の説明に戻って、正規化手段４２にお
いて、ＭＤＣＴ手段３６からの周波数領域信号の各サン
プルが、前記周波数概形の逆数の各サンプルとかけあわ
せて正規化され、平坦化された残差信号とされる。パワ
ー正規化・ゲイン量子化手段４３でこの残差信号はその
振幅の平均値、またはパワーの平均値の平方根である正
規化ゲインで割算されて正規化され、正規化残差信号と
され、更にその正規化ゲインが量子化され、その量子化
された正規化ゲインを示すインデックス４４が出力され
る。Returning to FIG. 7, in the normalizing means 42, each sample of the frequency domain signal from the MDCT means 36 is multiplied by each sample of the reciprocal of the above-mentioned frequency outline to normalize and flatten. The residual signal. The residual signal is normalized by the power normalizing / gain quantizing means 43 by dividing by an average value of the amplitude or a normalized gain which is a square root of the average value of the power to obtain a normalized residual signal. Further, the normalized gain is quantized, and an index 44 indicating the quantized normalized gain is output.

【０００７】また周波数概形計算手段４１からの周波数
特性概形の逆数の信号は必要に応じて重み計算手段４５
で聴感制御が施されて重み付け信号とされる。正規化残
差量子化手段４６で、手段４３からの正規化残差信号を
手段４５からの重み付け信号により適応重みづけベクト
ル量子化する。量子化手段４６で量子化されたベクトル
値を示すインデックス４７が出力される。以上のように
符号器３１から、予測係数量子化インデックス３９と、
ゲイン量子化インデックス４４と残差量子化インデック
ス４７とが出力される。[0007] The signal of the reciprocal of the approximate frequency characteristic from the approximate frequency calculation means 41 is weighted by the weight calculation means 45 if necessary.
Is subjected to audibility control to obtain a weighted signal. In the normalized residual quantization means 46, the normalized residual signal from the means 43 is adaptively weighted vector-quantized by the weight signal from the means 45. An index 47 indicating the vector value quantized by the quantization means 46 is output. As described above, from the encoder 31, the prediction coefficient quantization index 39,
The gain quantization index 44 and the residual quantization index 47 are output.

【０００８】これらインデックス３９，４４，４７を入
力された復号器３２は図７に示すように次のように復号
する。即ち予測係数量子化インデックス３９は再生手段
５６で対応する量子化予測係数が逆量子化されて再生さ
れ、その逆量子化予測係数は周波数概形計算手段５７で
周波数概形計算手段４１と同じ方法で周波数特性概形の
逆数、つまりパワースペクトル包絡の平方根の逆数が計
算され、一方再生手段５８で入力されたインデックス４
７から量子化正規化残差信号が再生される。再生手段５
９で入力されたインデックス４４から正規化ゲインが再
生される。パワー逆正規化手段６１において再生された
量子化正規化残差信号に再生された正規化ゲインが掛け
合わされてパワー逆正規化され量子化残差信号が得られ
る。その量子化残差信号は逆正規化手段６２で周波数概
形計算手段５７から周波数概形の逆数、つまりパワース
ペクトル包絡の平方根の逆数により各対応サンプルごと
に割算されて逆平坦化される。その逆平坦化された残差
信号は逆ＭＤＣＴ手段６３でＮ次の逆変形離散コサイン
変換されて、時間領域信号とされ、この時間領域信号に
対し、窓掛け手段６４で時間窓がかけられる。ここでは
窓形状としてハニング窓が用いられている。この窓掛け
された信号はフレーム重ね合せ手段６５で長さ２×Ｎサ
ンプルのフレームの前半Ｎサンプルと前フレームの後半
Ｎサンプルとが加え合わされて出力端子６６に出力され
る。The decoder 32 to which these indices 39, 44 and 47 are input decodes as follows, as shown in FIG. That is, the prediction coefficient quantization index 39 is reproduced by dequantizing the corresponding quantized prediction coefficient by the reproducing means 56, and the inverse quantized prediction coefficient is obtained by the frequency rough shape calculating means 57 in the same manner as the frequency rough shape calculating means 41. Calculates the reciprocal of the general frequency characteristic, that is, the reciprocal of the square root of the power spectrum envelope.
7, a quantized normalized residual signal is reproduced. Reproduction means 5
The normalized gain is reproduced from the index 44 input at 9. The quantized normalized residual signal reproduced by the power denormalization means 61 is multiplied by the reproduced normalization gain to perform power denormalization to obtain a quantized residual signal. The quantized residual signal is inversely flattened by the inverse normalizing means 62 for each corresponding sample by the inverse of the frequency outline, that is, the inverse of the square root of the power spectrum envelope from the frequency outline calculating means 57. The inverse-flattened residual signal is subjected to an Nth-order inversely modified discrete cosine transform by an inverse MDCT unit 63 to obtain a time-domain signal, and a time window is applied to the time-domain signal by a windowing unit 64. Here, a Hanning window is used as the window shape. The windowed signal is added to the first half N samples of the frame of length 2 × N samples and the second half N samples of the previous frame by the frame superimposing means 65 and output to the output terminal 66.

【０００９】[0009]

【発明が解決しようとする課題】図７に示した符号化、
復号化方法においては、復号化器３２において、インデ
ックス３９から逆量子化予測係数を得、これを図８に示
したようにパワースペクトルを求め、その各サンプルご
との平方根を求め、これとの逆数をそれぞれ求めている
が、各サンプルごとの平方根演算はかなりの処理量を必
要とし、実時間動作させるのに障害となる。かつ、その
ハードウエア規模も大となり、高価なものとなる。例え
ば符号化音楽信号を復号化する上で、上述のように処理
時間が長くなることは実用に適せず、またハードウエア
規模が大、高価な多数の復号化器を普及させる点から好
ましくない。The encoding shown in FIG.
In the decoding method, the decoder 32 obtains the inverse quantized prediction coefficient from the index 39, obtains the power spectrum as shown in FIG. 8, obtains the square root of each sample, and calculates the reciprocal thereof. , Respectively, but the square root operation for each sample requires a considerable amount of processing, which hinders real-time operation. In addition, the hardware scale becomes large and expensive. For example, when decoding an encoded music signal, it is not preferable that the processing time is long as described above because it is not suitable for practical use, and the hardware scale is large and a large number of expensive decoders are spread. .

【００１０】また逆量子化予測係数からパワースペクト
ル包絡を直接求めるには高い演算精度が要求され、演算
語長が短い復号化器で実現するには障害となる。この発
明の目的は復号化器でパワースペクトル包絡のサンプル
ごとの平方根を演算する必要をなくす変換符号化方法を
提供することになる。この発明の他の目的はサンプルご
との平方根演算を必要としない変換復号化方法を提供す
ることにある。Further, a high calculation accuracy is required to directly obtain the power spectrum envelope from the inversely quantized prediction coefficients, which is an obstacle to realizing a decoder having a short calculation word length. SUMMARY OF THE INVENTION It is an object of the present invention to provide a transform coding method that eliminates the need for a decoder to calculate the square root of the power spectrum envelope for each sample. It is another object of the present invention to provide a transform decoding method that does not require a square root operation for each sample.

【００１１】この発明の更に他の目的はサンプルごとの
平方根演算のみならず、演算語長の短い変換復号化方法
を提供することにある。Still another object of the present invention is to provide a conversion decoding method having a short operation word length as well as a square root operation for each sample.

【００１２】[0012]

【課題を解決するための手段】請求項１の発明の符号化
方法によれば、変換符号化中のスペクトル包絡の符号化
のために、パワースペクトル包絡の平方根を表す包絡を
線形予測分析でモデル化し、その線形予測パラメータを
量子化して、平方根パワースペクトル包絡を得るための
符号化情報とし、その量子化されたパラメータを逆量子
化し、その逆量子化されたパラメータから平方根パワー
スペクトル包絡の逆数を求め、これを周波数領域に変換
された信号の正規化に用いる。According to the encoding method of the present invention, an envelope representing a square root of a power spectrum envelope is modeled by linear prediction analysis for encoding a spectrum envelope during transform encoding. , Quantize the linear prediction parameters, and use it as coding information to obtain a square root power spectrum envelope, dequantize the quantized parameters, and calculate the reciprocal of the square root power spectrum envelope from the dequantized parameters. This is used for normalization of the signal converted to the frequency domain.

【００１３】つまり、パワースペクトル包絡の平方根を
表す包絡を線形予測分析でモデル化することは、例えば
入力音響信号の自己相関を求め、これをフーリエ変換
し、その変換結果の各サンプルのパワーの平方根を求
め、その平方根の系列を逆フーリエ変換し、この変換結
果、つまりパワースペクトルの平方根について線形予測
分析を行うことである。この線形予測分析パラメータの
量子化出力が復号化側に入力されるから、復号器では平
方根演算を行うことなく、平方根パワースペクトル包絡
またはその逆数を得ることができる。That is, modeling the envelope representing the square root of the power spectrum envelope by linear predictive analysis involves, for example, obtaining the autocorrelation of the input audio signal, performing a Fourier transform on the autocorrelation, and calculating the square root of the power of each sample of the conversion result. , And the series of square roots is subjected to inverse Fourier transform, and linear prediction analysis is performed on the result of the conversion, that is, the square root of the power spectrum. Since the quantized output of the linear prediction analysis parameter is input to the decoding side, the square root power spectrum envelope or its reciprocal can be obtained without performing the square root operation in the decoder.

【００１４】パワースペクトル包絡の平方根を線形予測
分析でモデル化するには、入力音響信号を線形予測分析
し、その線形予測パラメータからＬＰＣケプストラムを
求め、そのＬＰＣケプストラムの各係数を２で割り、そ
の２で割ったＬＰＣケプストラムから最小自乗法で線形
予測パラメータを求める（請求項２）。逆量子化線形予
測パラメータから平方根パワースペクトル包絡の逆数を
求めるには、逆量子化された線形予測パラメータから線
形予測係数（αパラメータ）を求め、その線形予測係数
をフーリエ変換し、その各サンプルの絶対値を求める
（請求項３）。In order to model the square root of the power spectrum envelope by linear prediction analysis, the input acoustic signal is subjected to linear prediction analysis, an LPC cepstrum is obtained from the linear prediction parameters, and each coefficient of the LPC cepstrum is divided by two. A linear prediction parameter is obtained from the LPC cepstrum divided by 2 by the least square method (claim 2). In order to obtain the reciprocal of the square root power spectrum envelope from the inverse quantized linear prediction parameter, a linear prediction coefficient (α parameter) is obtained from the inversely quantized linear prediction parameter, and the linear prediction coefficient is subjected to a Fourier transform. An absolute value is obtained (claim 3).

【００１５】逆量子化線形予測係数から平方根パワース
ペクトル包絡の逆数を求めるには逆量子化された線形予
測パラメータからＬＳＰパラメータを求め、そのＬＳＰ
パラメータについて単位円上でスペクトルを計算する
（請求項４）。上記モデル化は入力音響信号を線形予測
分析し、その線形予測分析結果のパラメータからＬＰＣ
ケプストラムを求め、そのＬＰＣケプストラムの各係数
を２で割ることであり、その２で割ったＬＰＣケプスト
ラムを量子化して、スペクトル包絡の平方根を得るため
の符号（インデックス）として出力し、上記パワースペ
クトルの平方根を得るには上記量子化ＬＰＣケプストラ
ムを逆量子化し、更にフーリエ変換し、そのフーリエ変
換された各サンプルを指数変換して得る（請求項５）。To find the reciprocal of the square root power spectrum envelope from the inversely quantized linear prediction coefficient, an LSP parameter is obtained from the inversely quantized linear prediction parameter, and the LSP parameter is obtained.
A spectrum is calculated on the unit circle for the parameter (claim 4). In the above modeling, the input acoustic signal is subjected to linear prediction analysis, and LPC is performed based on the parameters of the linear prediction analysis result.
A cepstrum is obtained, and each coefficient of the LPC cepstrum is divided by 2. The LPC cepstrum divided by the 2 is quantized and output as a code (index) for obtaining a square root of a spectrum envelope. In order to obtain a square root, the quantized LPC cepstrum is inversely quantized, Fourier-transformed, and each Fourier-transformed sample is exponentially transformed (claim 5).

【００１６】請求項５におけるモデル化において、上記
２での割算を省略し、代わりに逆量子化ＬＰＣケプスト
ラムのフーリエ変換した各サンプルを２で割算した後、
指数変換してもよい（請求項６）。請求項７の復号化方
法によれば、入力された符号から線形予測係数またはこ
れと等価なパラメータ（例えばＬＳＰ）を逆量子化し、
これから平方根パワースペクトル包絡の逆数を求め、そ
の平方根パワースペクトル包絡の逆数で、入力符号の逆
量子化により得た残差信号を除算して逆正規化した周波
数領域信号を得る。ここで平方根パワースペクトル包絡
とは、本来のシステムのパワースペクトル包絡のサンプ
ルごとの平方根に対応する周波数領域の系列である。上
記平方根パワースペクトル包絡の逆数は、再生された線
形予測パラメータから線形予測係数（αパラメータ）を
得、この線形予測係数をフーリエ変換し、その各サンプ
ルの絶対値をとって得る（請求項８）。又は再生された
線形予測パラメータからＬＳＰパラメータを求め、その
ＬＳＰパラメータについて単位円上でスペクトルを計算
する（請求項９）。In the modeling according to claim 5, the division by 2 is omitted, and instead, each sample of the inversely quantized LPC cepstrum obtained by Fourier transform is divided by 2.
Exponential conversion may be performed (claim 6). According to the decoding method of the seventh aspect, a linear prediction coefficient or a parameter equivalent thereto (for example, LSP) is inversely quantized from the input code,
From this, the reciprocal of the square root power spectrum envelope is obtained, and the residual signal obtained by the inverse quantization of the input code is divided by the reciprocal of the square root power spectrum envelope to obtain a denormalized frequency domain signal. Here, the square root power spectrum envelope is a sequence in the frequency domain corresponding to the square root for each sample of the power spectrum envelope of the original system. The reciprocal of the square root power spectrum envelope is obtained by obtaining a linear prediction coefficient (α parameter) from the reproduced linear prediction parameter, performing a Fourier transform on the linear prediction coefficient, and taking an absolute value of each sample (claim 8). . Alternatively, an LSP parameter is obtained from the reproduced linear prediction parameter, and a spectrum is calculated on the unit circle for the LSP parameter (claim 9).

【００１７】請求項１０の復号化方法によれば、入力さ
れた符号から線形予測係数またはこれと等価なパラメー
タ（例えばＬＳＰ）を逆量子化し、これから対応するイ
ンパルス応答を算出し、そのインパルス応答をフーリエ
変換して平方根パワースペクトル包絡を求め、その平方
根パワースペクトル包絡を、入力符号の逆量子化により
得た残差信号に乗算して逆正規化された周波数領域信号
を得る。According to the decoding method of the tenth aspect, a linear prediction coefficient or a parameter equivalent thereto (for example, LSP) is inversely quantized from the input code, a corresponding impulse response is calculated from this, and the impulse response is calculated. Fourier transform is performed to obtain a square root power spectrum envelope, and the square root power spectrum envelope is multiplied by a residual signal obtained by inverse quantization of an input code to obtain a denormalized frequency domain signal.

【００１８】請求項１１の復号化方法では、平方根パワ
ースペクトル包絡は、再生線形予測パラメータまたはこ
れと等価なパラメータからＬＰＣケプストラムを求め、
そのＬＰＣケプストラムをフーリエ変換し、そのフーリ
エ変換された各サンプルを指数変換して得る。請求項１
２の復号化方法では、平方根パワースペクトル包絡は、
再生線形予測パラメータまたはこれと等価なパラメータ
からＬＰＣケプストラムを求め、そのＬＰＣケプストラ
ムをフーリエ変換し、そのフーリエ変換の各サンプルを
２分の１してから指数変換して得る。In the decoding method according to the eleventh aspect, the square root power spectrum envelope is obtained from an LPC cepstrum from a reconstructed linear prediction parameter or a parameter equivalent thereto.
The LPC cepstrum is Fourier-transformed, and each Fourier-transformed sample is obtained by exponential conversion. Claim 1
In the decoding method of 2, the square root power spectrum envelope is
The LPC cepstrum is obtained from the reconstructed linear prediction parameter or a parameter equivalent thereto, the LPC cepstrum is subjected to Fourier transform, and each sample of the Fourier transform is halved and then subjected to exponential transform.

【００１９】[0019]

【実施例】図１に請求項１の発明の実施例を示し、図７
と対応する部分に同一符号を付けてある。この実施例で
は窓掛け手段３５の出力信号系列は、そのパワースペク
トル包絡の平方根を表す包絡を線形予測分析でモデル化
する手段７１に分岐供給される。手段７１は例えばまず
相関関数手段７２で入力信号の自己相関関数を１フレー
ム中のＮ個の点まで求める。この代わりに図２Ａに示す
ように自己相関関数はＰ＋１個の点まで相関関数手段７
２ａで求め、その結果についてＰ次の線形予測分析を線
形予測分析手段７２ｂで行い、求まった線形予測係数
を、先のＰ＋１点までの自己相関関数に対し、Ｐ＋２次
以降Ｎ点まで相関関数として外挿手段７２ｃで外挿して
もよい。次にＮ点中のこの自己相関関数をこの系列にＮ
点のゼロを付加するか、Ｎ点を対称化して代入して２Ｎ
点の実フーリエ変換をフーリエ変換手段７３で行う。相
関関数を自己相関法で求めたのであれば、変換後の実部
がパワースペクトルであり、演算精度の誤差を除いてす
べて正の値をとる。このように入力信号の自己相関を求
め、これをフーリエ変換するとパワースペクトルが得ら
れることは良く知られていることである。FIG. 1 shows an embodiment of the invention of claim 1, and FIG.
The same reference numerals are given to portions corresponding to. In this embodiment, the output signal sequence of the windowing means 35 is branched and supplied to means 71 for modeling the envelope representing the square root of the power spectrum envelope by linear prediction analysis. The means 71 first obtains the autocorrelation function of the input signal up to N points in one frame by the correlation function means 72, for example. Instead, as shown in FIG. 2A, the auto-correlation function is performed by the correlation function means 7 up to P + 1 points.
2a, a P-order linear prediction analysis is performed on the result by the linear prediction analysis means 72b, and the obtained linear prediction coefficient is calculated as a correlation function from the P + 2 order to the Nth point with respect to the autocorrelation function up to the P + 1 point. The extrapolation means 72c may extrapolate. Next, this autocorrelation function in N points is added to this series by N
Add a zero to the point or symmetrically substitute the N point to get 2N
The real Fourier transform of the point is performed by the Fourier transform means 73. If the correlation function is obtained by the autocorrelation method, the real part after the conversion is the power spectrum, and all take positive values except for the error in the calculation accuracy. It is well known that a power spectrum is obtained by obtaining the autocorrelation of an input signal and performing Fourier transform on the autocorrelation.

【００２０】このパワースペクトルの各点の平方根を平
方根手段７４で求める。このとき虚部はすべてゼロとし
た後（対称化して代入した場合はもともと虚部はすべて
ゼロ）、逆フーリエ変換を逆フーリエ変換手段７５で行
い、平方根パワースペクトルに対応する自己相関関数を
得る。最後にこの自己相関関数に基づいて線形予測分析
を線形予測分析手段７６で行い予測パラメータを求め、
つまりパワースペクトル包絡の平方根を表す線形予測分
析でモデル化したものを得る。これを予測係数量子化手
段３８で量子化してインデックス３９を得る。このイン
デックス３９は入力信号系列の線形予測分析により得た
ものではなく、入力信号系列の周波数特性、つまりパワ
ースペクトル包絡の平方根と対応する信号系列を線形予
測分析したものを量子化したものである。The square root of each point of the power spectrum is obtained by a square root means 74. At this time, after all the imaginary parts are set to zero (or the imaginary parts are all zero when the symmetry is substituted), the inverse Fourier transform is performed by the inverse Fourier transform means 75 to obtain an autocorrelation function corresponding to the square root power spectrum. Finally, a linear prediction analysis is performed by the linear prediction analysis means 76 based on the autocorrelation function to obtain prediction parameters,
That is, a model obtained by linear prediction analysis representing the square root of the power spectrum envelope is obtained. This is quantized by the prediction coefficient quantization means 38 to obtain an index 39. The index 39 is not obtained by the linear prediction analysis of the input signal sequence, but is obtained by quantizing a frequency characteristic of the input signal sequence, that is, a signal obtained by performing a linear prediction analysis on the signal sequence corresponding to the square root of the power spectrum envelope.

【００２１】この量子化手段３８の量子化出力を逆量子
化手段７７で逆量子化し、その逆量子化線形予測係数
を、フーリエ変換、絶対値手段７８でフーリエ変換し、
その各サンプルの複素数の絶対値を取って平方根パワー
スペクトル包絡の逆数を得、これをＭＤＣＴ手段３６よ
りの周波数領域信号に各サンプルごとに乗算器４２で乗
算して正規化する。その他の処理は図５の場合と同一で
ある。The quantized output of the quantizing means 38 is inversely quantized by an inverse quantizing means 77, and the inversely quantized linear prediction coefficient is subjected to a Fourier transform and an absolute value means 78 to perform a Fourier transform.
The absolute value of the complex number of each sample is taken to obtain the reciprocal of the square root power spectrum envelope, and the frequency domain signal from the MDCT means 36 is multiplied by the multiplier 42 for each sample for normalization. Other processes are the same as those in FIG.

【００２２】図２Ｂに示すように、相関関数のフーリエ
変換手段７３で得たパワースペクトルを平方根手段７４
で平方根をとってもよい。更に図２Ｃに示すように、入
力信号系列をＰ点の自己相関関数を手段７２ａで求め、
これを手段７２ｂで線形予測分析し、その分析結果と対
応するインパルス応答を手段７２ｄで求めてＮ点の自己
相関関数を得てもよい。つまり手段７２ｂの線形予測分
析結果をフィルタ係数とするフィルタのインパルス応答
を求めてＮ点の自己相関関数を求めることもできる。線
形予測係数をフーリエ変換するとパワースペクトル包絡
の逆数となるが、手段７２ｄで求めたインパルス応答を
手段７３でフーリエ変換して絶対値をとってパワースペ
クトル包絡を得る。このフーリエ変換手段７３の代わり
に、手段７２ｄで得たインパルス応答の相関関数を求
め、その相関関数をフーリエ変換してパワースペクトル
包絡を得てもよい。As shown in FIG. 2B, the power spectrum obtained by the Fourier transform means 73 of the correlation function is converted to the square root means 74.
May take the square root. Further, as shown in FIG. 2C, the autocorrelation function of the input signal sequence at point P is obtained by means 72a,
This may be subjected to linear prediction analysis by means 72b, and an impulse response corresponding to the analysis result obtained by means 72d to obtain an N-point autocorrelation function. That is, the autocorrelation function at N points can be obtained by obtaining the impulse response of the filter using the linear prediction analysis result of the means 72b as the filter coefficient. The Fourier transform of the linear prediction coefficient results in the reciprocal of the power spectrum envelope. The impulse response obtained by the means 72d is Fourier transformed by the means 73 to obtain an absolute value to obtain a power spectrum envelope. Instead of the Fourier transform means 73, a correlation function of the impulse response obtained by the means 72d may be obtained, and the correlation function may be Fourier-transformed to obtain a power spectrum envelope.

【００２３】なお、何れの場合も手段７６での線形予測
分析の次数に対して、相関関数手段７２の次数は例えば
５割増し程度として、周波数成分によりかたよることな
く、なるべく長い相関関数を求め、なるべく忠実な情報
がフーリエ変換手段７３に入力されるようにする。平方
根パワースペクトル包絡の線形予測でのモデル化法の更
に他の手法を図３Ａに示す（請求項２）。この例ではＬ
ＰＣケプストラムを介して平方根スペクトルに対応する
線形予測係数を求める。つまり相関関数手段７２ａによ
り入力音響信号の自己相関関数を求め、これを手段７２
ｂで線形予測分析する。この線形予測パラメータから漸
化式演算手段９１によりＬＰＣケプストラムを求める。
つまり入力音響信号のｐ次線形予測における予測パラメ
ータα_k（ｋ＝１，…，ｐ）からｊ次ＬＰＣケプストラ
ム係数ｃ_jを求める計算手順は以下のように知られてい
る。通常ｐは１０から２０程度とするがＬＰＣケプスト
ラムの次数ｎはｐの２倍から３倍必要である。In any case, the order of the correlation function means 72 is increased by, for example, about 50% with respect to the order of the linear prediction analysis by the means 76, and a correlation function as long as possible is obtained without depending on the frequency component. Faithful information is input to the Fourier transform means 73. FIG. 3A shows still another method of the modeling method in the linear prediction of the square root power spectrum envelope (Claim 2). In this example, L
A linear prediction coefficient corresponding to the square root spectrum is obtained via the PC cepstrum. That is, the autocorrelation function of the input sound signal is obtained by the correlation function means 72a,
Perform linear prediction analysis with b. An LPC cepstrum is obtained from the linear prediction parameters by the recurrence formula calculation means 91.
That is, the calculation procedure for calculating the j-th order LPC cepstrum coefficient c _j from the prediction parameter α _k (k = 1,..., P) in the p-order linear prediction of the input acoustic signal is known as follows. Usually, p is about 10 to 20, but the order n of the LPC cepstrum needs to be 2 to 3 times p.

【００２４】[0024]

【数１】 (Equation 1)

【００２５】パワースペクトルの平方根はＬＰＣケプス
トラムの領域では単にすべての係数に0.５をかけるだけ
であり、手段９１よりのＬＰＣケプストラムを手段９２
において２で割算してパワースペクトル包絡の平方根を
求め、これを正規方程式演算手段９３で線形予測パラメ
ータを求め、更に必要に応じて例えばＬＳＰパラメータ
で量子化手段３８において量子化してインデックス３９
として出力する。手段９３ではＬＰＣケプストラムと線
形予測パラメータとの上記の関係を逆に使えばよいが、
ＬＰＣケプストラム係数の個数は線形予測パラメータα
の個数よりはるかに多いので、すべてのＬＰＣケプスト
ラムの拘束をみたす線形予測パラメータは一般に存在し
ない。そこで、上記の関係を回帰式とみなしてＬＰＣケ
プストラムの回帰誤差ｅ_iの自乗を最小化するように線
形予測パラメータを求める。この場合、求まった線形予
測パラメータの安定性は保障されないので、例えばＰＡ
ＲＣＯＲ係数に変換するなどの安定性チェックが必要で
ある。線形予測パラメータとＬＰＣケプストラムの関係
を以下のように行列で表す。The square root of the power spectrum in the region of the LPC cepstrum simply multiplies all the coefficients by 0.5, and the LPC cepstrum from the means 91 is divided by the means 92
, The square root of the power spectrum envelope is obtained, and this is calculated by the normal equation calculation means 93 to obtain a linear prediction parameter, and further, if necessary, quantized by the LSP parameter in the quantization means 38 and index 39
Output as Means 93 may use the above relationship between the LPC cepstrum and the linear prediction parameter in reverse,
The number of LPC cepstrum coefficients is determined by the linear prediction parameter α
, There is generally no linear prediction parameter that satisfies all LPC cepstrum constraints. Therefore, the above relationship is regarded as a regression equation, and a linear prediction parameter is determined so as to minimize the square of the regression error e _i of the LPC cepstrum. In this case, since the stability of the obtained linear prediction parameter is not guaranteed, for example, PA
A stability check such as conversion into RCOR coefficients is necessary. The relationship between the linear prediction parameters and the LPC cepstrum is represented by a matrix as follows.

【００２６】[0026]

【数２】 (Equation 2)

【００２７】以上の関係で、ＬＰＣケプストラムの回帰
誤差エネルギｄ＝Ｅ^TＥを最小化するために以下の正規
方程式を手段９３で解けば線形予測パラメータを得るこ
とができる。Ｄ^TＤＡ＝−Ｄ^TＣ上述ではパワースペクトル包絡の平方根を線形予測モデ
ル化したものの量子化出力として線形予測パラメータ、
またはＬＳＰパラメータなどを用いたが、図３Ｂに示す
ように、１／２手段９２の出力を量子化手段３８で量子
化してインデックス３９として出力してもよい。この場
合は符号化器側で逆量子化手段７７で逆量子化された１
／２ＬＰＣケプストラムを手段７８でフーリエ変換して
対数パワースペクトル包絡に変換し、更に手段９４で指
数変換を行って平方根パワースペクトル包絡を得、この
逆数を手段９５でとって図１の乗算器４２へ出力する。[0027] In the above relation, it is possible to obtain a linear prediction parameters by solving the following normal equation by means 93 in order to minimize the regression error energy d = E ^T E of the LPC cepstrum. D ^T DA = −D ^T C In the above description, a linear prediction parameter is obtained as a quantization output of a linear prediction model of the square root of the power spectrum envelope,
Alternatively, although the LSP parameter or the like is used, the output of the 1/2 means 92 may be quantized by the quantization means 38 and output as an index 39 as shown in FIG. 3B. In this case, 1 which is inversely quantized by the inverse quantization means 77 on the encoder side is used.
The / 2 LPC cepstrum is Fourier-transformed by means 78 into a logarithmic power spectrum envelope, and further exponentially transformed by means 94 to obtain a square root power spectrum envelope, and the reciprocal is taken by means 95 to the multiplier 42 of FIG. Output.

【００２８】更に図３Ｃに示すように手段９２を省略
し、代わりにフーリエ変換手段７８より対数パワースペ
クトル包絡の各サンプルを手段９６を１／２して指数変
換手段９４へ供給してもよい。以上のようにインデック
ス３９として、パワースペクトル包絡の平方根（平方根
パワースペクトル）を線形予測分析して量子化したもの
を用いることにより、以下に述べるように復号化器での
処理が図５の場合よりも簡単になる。Further, as shown in FIG. 3C, the means 92 may be omitted, and each sample of the logarithmic power spectrum envelope may be supplied from the Fourier transform means 78 to the exponential transform means 94 by halving the means 96. As described above, by using, as the index 39, a value obtained by performing linear prediction analysis on the square root of the power spectrum envelope (square root power spectrum) and quantizing it, the processing in the decoder is performed as shown in FIG. Is also easier.

【００２９】請求項７の発明の復号化方法実施例を図４
に示し、図７と対応する部分に同一符号を付けてある。
この実施例でも再生手段５６でインデックス３９が逆量
子化されて線形予測係数が求められるが、この線形予測
係数は図１の説明から明らかなように、平方根パワース
ペクトルを線形予測分析したものであるから、これをフ
ーリエ変換手段８２によりフーリエ変換し、その絶対値
を得ることにより平方根パワースペクトル包絡の逆数が
得られ、この平方根パワースペクトル包絡の逆数で乗算
器６１よりの再生された残差信号が割算器６２において
割算され周波数領域信号が再生される。その他は図５の
場合と同様である。FIG. 4 shows an embodiment of the decoding method according to the seventh aspect of the present invention.
The same reference numerals are given to the portions corresponding to those in FIG.
In this embodiment as well, the index 39 is inversely quantized by the reproducing means 56 to obtain a linear prediction coefficient. The linear prediction coefficient is obtained by performing linear prediction analysis on the square root power spectrum, as is apparent from the description of FIG. Is Fourier-transformed by the Fourier transform means 82, and the reciprocal of the square root power spectrum envelope is obtained by obtaining its absolute value. The reciprocal of the square root power spectrum envelope is used to generate the residual signal reproduced from the multiplier 61. The frequency domain signal is reproduced in the divider 62. Others are the same as in FIG.

【００３０】図５Ａに請求項１０の発明の実施例を示
し、図７と対応する部分に同一符号を付けてある。この
実施例では、図１で得られたインデックス３９が再生手
段５６で逆量子化され、この逆量子化線形予測係数のイ
ンパルス応答が手段８４で求められ、そのインパルス応
答はフーリエ変換手段８５でフーリエ変換され、その絶
対値がとられて平方根パワースペクトル包絡が得られ、
これが乗算器６１からの残差信号に乗算器８６で乗算さ
れて周波数領域信号が再生される。FIG. 5A shows an embodiment of the tenth aspect of the present invention, in which parts corresponding to those in FIG. 7 are denoted by the same reference numerals. In this embodiment, the index 39 obtained in FIG. 1 is inversely quantized by the reproducing means 56, the impulse response of the inversely quantized linear prediction coefficient is obtained by the means 84, and the impulse response is Fourier transformed by the Fourier transforming means 85. Is transformed, its absolute value is taken and the square root power spectrum envelope is obtained,
This is multiplied by the residual signal from the multiplier 61 by the multiplier 86 to reproduce the frequency domain signal.

【００３１】フーリエ変換手段８２では逆量子化線形予
測係数にゼロを付加してフーリエ変換を行う。量子化手
段３８での量子化は線形予測係数をＬＳＰ係数に変換し
て行うと便利である。この点でインデックス３９として
ＬＳＰ係数を示すものを用いてもよい。この場合、復号
器におけるフーリエ変換手段８２，８５の代わりに、単
位円上でスペクトルの値を計算して電力伝達関数（パワ
ースペクトル）を求める。この計算方法は、例えば１９
８５年東海大学出版会発行、古井貞煕著「ディジタル音
声処理」９１頁に示されている。The Fourier transform means 82 performs a Fourier transform by adding zero to the inverse quantized linear prediction coefficient. It is convenient that the quantization by the quantization means 38 is performed by converting the linear prediction coefficients into LSP coefficients. At this point, an index indicating an LSP coefficient may be used as the index 39. In this case, instead of the Fourier transform means 82 and 85 in the decoder, the value of the spectrum is calculated on the unit circle to obtain the power transfer function (power spectrum). This calculation method is, for example, 19
This is shown on page 91 of "Digital Audio Processing" by Sadahiro Furui, published by Tokai University Press in 1985.

【００３２】図３Ｂの符号化方法に対応した復号化方法
の一部を図５Ｂに示す。この場合は予測パラメータ再生
手段５６で１／２ＬＰＣケプストラムであり、これがフ
ーリエ変換手段８２により対数パワースペクトル包絡に
変換し、これを手段９７により各サンプルごとに指数変
換して平方根パワースペクトル包絡を得、これを乗算器
８６へ供給する。図３Ｃに示す符号化方法に対する復号
化法では図６Ａに示すように、図５Ｂにおけるフーリエ
変換手段８２よりの対数パワースペクトル包絡を手段９
８で１／２にしてから指数変換手段９７へ供給すればよ
い。FIG. 5B shows a part of a decoding method corresponding to the encoding method of FIG. 3B. In this case, the prediction parameter reproducing means 56 is a Ｌ LPC cepstrum, which is converted into a logarithmic power spectrum envelope by a Fourier transform means 82, which is exponentially converted for each sample by means 97 to obtain a square root power spectrum envelope. This is supplied to the multiplier 86. In the decoding method corresponding to the encoding method shown in FIG. 3C, as shown in FIG. 6A, the logarithmic power spectrum envelope from the Fourier transform means 82 in FIG.
It is sufficient to supply the data to the exponential conversion means 97 after halving with 8.

【００３３】図５Ａにおいてインパルス応答を求める代
わりに、図６Ｂに示すように再生手段５６からの再生さ
れたＰ次の線形予測係数からＰＡＲＩＯＲ係数に変換し
さらにＰ次までの相関関数を求め、手段９９で漸化式で
Ｐ次以上を外挿してフーリエ変換手段８５へ供給しても
よい。上述において線形予測パラメータは、線形予測係
数（αパラメータ）、ＬＳＰパラメータ、パーコール係
数などを意味し、従って予測インデックス３９はこれら
各種の形態の何れかで量子化されたものである。そして
符号化法、復合化法においても、その予測インデックス
３９を逆量子化し、その逆量子化したものをそのまま利
用する場合と他の線形予測パラメータに変換して利用し
てもよい。Instead of obtaining the impulse response in FIG. 5A, as shown in FIG. 6B, the P-order linear prediction coefficient reproduced from the reproducing means 56 is converted into a PARIOR coefficient, and a correlation function up to the P-order is obtained. At 99, the P-th order or higher may be extrapolated by the recurrence formula and supplied to the Fourier transform means 85. In the above description, the linear prediction parameter means a linear prediction coefficient (α parameter), an LSP parameter, a Percoll coefficient, and the like. Therefore, the prediction index 39 is quantized in any of these various forms. Also in the encoding method and the decoding method, the prediction index 39 may be inversely quantized, and the inversely quantized one may be used as it is, or may be converted to another linear prediction parameter and used.

【００３４】上述において相関関数手段７２（７２ａ）
として、ＭＤＣＴ手段３６の出力、つまり周波数領域信
号の絶対値をとり、これを逆フーリエ変換して自己相関
関数を求めてもよい。更に時間領域から周波数領域への
変換は離散的フーリエ変換（高速フーリエ変換）、離散
的コサイン変換など他の手法によってもよい。In the above description, the correlation function means 72 (72a)
As an alternative, the absolute value of the output of the MDCT means 36, that is, the frequency domain signal may be obtained, and this may be subjected to inverse Fourier transform to obtain the autocorrelation function. Further, the conversion from the time domain to the frequency domain may be performed by another method such as a discrete Fourier transform (fast Fourier transform), a discrete cosine transform, or the like.

【００３５】[0035]

【発明の効果】以上述べたように、この発明の符号化方
法によれば、周波数領域信号の平方根パワースペクトル
の包絡を線形予測パラメータでモデル化し、その線形予
測パラメータを量子化して符号化出力としているため、
この発明の復号化方法に示すように、復号化において平
方根演算を必要とせず、復号化器の処理が軽減される。As described above, according to the encoding method of the present invention, the envelope of the square root power spectrum of the frequency domain signal is modeled by the linear prediction parameter, and the linear prediction parameter is quantized to produce an encoded output. Because
As shown in the decoding method of the present invention, the square root operation is not required for decoding, and the processing of the decoder is reduced.

【００３６】更に線形予測パラメータやこれと等価なパ
ラメータからなるパワースペクトル包絡を計算する際に
は、一般に高い演算精度が必要となるが、平方根パワー
スペクトルは変動の範囲が通常のパワースペクトルより
少ないので要求される精度も少なくて済む。また同じ理
由により、平方根パワースペクトルはパワースペクトル
より線形予測パラメータを量子化する際、特にベクトル
量子化を用いるときのビット数を削減できる。Further, when calculating a power spectrum envelope composed of linear prediction parameters and parameters equivalent thereto, generally high calculation accuracy is required. However, the variation of the square root power spectrum is smaller than that of a normal power spectrum. Less accuracy is required. For the same reason, the square root power spectrum can reduce the number of bits when quantizing linear prediction parameters from the power spectrum, particularly when vector quantization is used.

[Brief description of the drawings]

【図１】請求項１の発明の実施例を適用した符号化器の
例を示すブロック図。FIG. 1 is a block diagram showing an example of an encoder to which an embodiment of the present invention is applied.

【図２】図１中の平方根パワースペクトル包絡の線形予
測モデル化手段７１の他の各種手法を示すブロック図。FIG. 2 is a block diagram showing various other methods of the linear prediction modeling unit 71 of the square root power spectrum envelope in FIG.

【図３】Ａはモデル化手段７１の更に他の例を示すブロ
ック図、Ｂ及びＣはその更に他の例及び正規化のための
平方根スペクトル包絡の逆数を得る手段を示す図であ
る。FIG. 3A is a block diagram showing still another example of the modeling unit 71, and FIGS. 3B and 3C are diagrams showing still another example and a unit for obtaining the reciprocal of the square root spectral envelope for normalization.

【図４】請求項６の発明の実施例を適用した復号化器の
例を示すブロック図。FIG. 4 is a block diagram showing an example of a decoder to which the embodiment of the invention of claim 6 is applied.

【図５】Ａは請求項８の発明を適用した復号化器の例を
示すブロック図、Ｂは他の復号化法の一部を示すブロッ
ク図である。FIG. 5A is a block diagram showing an example of a decoder to which the invention of claim 8 is applied, and FIG. 5B is a block diagram showing a part of another decoding method.

【図６】平方根パワースペクトル包絡の復号化の他の例
を示すブロック図。FIG. 6 is a block diagram showing another example of decoding a square root power spectrum envelope.

【図７】先に提案した音響信号変換符号化復号化器を示
すブロック図。FIG. 7 is a block diagram showing a previously proposed acoustic signal transform coding / decoding device.

【図８】線形予測係数からフーリエ変換によりパワース
ペクトル包絡値を求める様子を示す図。 FIG. 8 is a graph showing the powers obtained by Fourier transform from linear prediction coefficients .
The figure which shows a mode that a vector envelope value is calculated | required.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩＧ１０Ｌ 9/18 Ｅ ──────────────────────────────────────────────────の Continued on the front page (51) Int.Cl. ⁷ Identification code Fig10L 9/18 E

Claims

(57) [Claims]

1. An input audio signal is converted from a time domain to a signal in a frequency domain, a square root of a power spectrum envelope of the converted frequency domain signal is obtained, and the square root of the power spectrum envelope is normalized with the frequency domain signal. In the audio signal transform encoding method for encoding the residual signal and the signal for obtaining the power spectrum envelope, respectively, a process of modeling an envelope representing a square root of the power spectrum envelope by linear predictive analysis, A process of quantizing the linear prediction parameters to obtain coding information for obtaining the power spectrum envelope; a process of inversely quantizing the quantized linear prediction parameters; and a process of calculating the power from the dequantized linear prediction parameters. Obtaining the reciprocal of the square root of the spectral envelope, and using the reciprocal of the square root, and Acoustic signal transform coding method that.

2. The step of modeling the square root of the power spectrum envelope by linear prediction analysis comprises the steps of: performing a linear prediction analysis of the input audio signal; and obtaining an LPC cepstrum from the linear prediction parameters obtained thereby. 2. The audio signal transform coding according to claim 1, further comprising a step of dividing each coefficient of the LPC cepstrum by 2, and a step of obtaining a linear prediction parameter from the LPC cepstrum divided by 2 by a least square method. Method.

3. The step of obtaining the reciprocal of the square root of the power spectrum envelope from the inversely quantized linear prediction parameters includes performing a Fourier transform on the linear prediction coefficients obtained from the inversely quantized linear prediction parameters, and calculating an absolute value of the linear prediction coefficient. 3. The acoustic signal conversion encoding method according to claim 1, further comprising:

4. The step of obtaining the reciprocal of the square root of the power spectrum envelope from the inversely quantized linear prediction parameters includes the step of obtaining L based on the inversely quantized linear prediction parameters.
3. The acoustic signal conversion method according to claim 1, wherein a step of obtaining a spectrum on a unit circle for the SP parameter is performed.

5. The step of modeling the envelope representing the square root of the power spectrum envelope by linear predictive analysis comprises the steps of linear predictive analysis of the input audio signal and LPC based on linear predictive parameters obtained by the linear predictive analysis. A process of obtaining a cepstrum, a process of dividing each coefficient of the obtained LPC cepstrum by 2, and a process of dividing the same by 2 as encoding information for quantizing the LPC cepstrum to obtain the power spectrum envelope. The process of obtaining the reciprocal of the square root power spectrum envelope from the inverse quantized linear prediction parameters includes a process of inversely quantizing the quantized LPC cepstrum, a process of performing a Fourier transform on the dequantized LPC cepstrum, and a process of performing a Fourier transform thereof. Converting the converted output to exponential conversion for each sample. 2. The acoustic signal conversion encoding method according to claim 1.

6. The step of modeling the envelope representing the square root of the power spectrum envelope by linear predictive analysis, the step of performing linear predictive analysis on the input audio signal and the LPC cepstrum based on linear predictive parameters obtained by the linear predictive analysis. The process of quantizing the LPC cepstrum as coding information for obtaining the power spectrum envelope, and calculating the reciprocal of the square root power spectrum envelope from the inverse quantized linear prediction parameter is performed by the quantization LPC Dequantizing the cepstrum, Fourier transforming the dequantized LPC cepstrum, dividing the Fourier transform output by 2 for each sample, and exponentially transforming each value divided by 2 2. The method according to claim 1, further comprising the steps of:

7. A residual signal is obtained by dequantizing a first index in an input code, a square root of a power spectrum envelope is obtained from a second index in the input code, and the residual signal is denormalized to obtain a frequency. An audio signal conversion decoding method for obtaining a domain signal and converting the frequency domain signal into a time domain signal to obtain an audio signal, wherein a square root power spectrum envelope is obtained from the linear prediction parameter indicated by the second index or a parameter equivalent thereto. Obtaining the reciprocal of the above, and performing the inverse normalization by dividing the residual signal by the square root power spectrum envelope.

8. The process of obtaining the reciprocal of the square root power spectrum envelope is a process of obtaining a linear prediction coefficient from the second index, performing a Fourier transform on the linear prediction coefficient, and obtaining an absolute value of each sample. The method for transforming and decoding an audio signal according to claim 7, characterized in that:

9. The method according to claim 7, wherein the step of obtaining the reciprocal of the square root power spectrum envelope is a step of obtaining an LSP parameter from the second index and obtaining a spectrum on the unit circle for the LSP parameter. Audio signal conversion decoding method.

10. A residual signal is obtained by dequantizing a first index in an input code, a square root of a power spectrum envelope is obtained from a second index in the input code, and the residual signal is denormalized to obtain a frequency. A sound signal conversion decoding method for obtaining a domain signal and converting the frequency domain signal into a time domain signal to obtain a sound signal, wherein a corresponding impulse response is obtained from a linear prediction parameter indicated by the second index or a parameter equivalent thereto. Calculating the Fourier transform of the calculated impulse response to obtain a square root power spectrum, and multiplying the residual signal by the square root power spectrum to perform the denormalization. Characteristic audio signal conversion decoding method.

11. A residual signal is obtained by dequantizing a first index in an input code, a square root of a power spectrum envelope is obtained from a second index in the input code, and the residual signal is denormalized to obtain a frequency. In an audio signal conversion decoding method for obtaining an area signal and converting the frequency domain signal into a time domain signal to obtain an audio signal, an LPC cepstrum is obtained from a linear prediction parameter indicated by the second index or a parameter equivalent thereto. A Fourier transform of the LPC cepstrum, an exponential transform of each of the Fourier transformed samples to obtain an envelope of the square root power spectrum, and a step of multiplying the residual signal by the square root power spectrum. Performing the inverse normalization.

12. A residual signal is obtained by dequantizing a first index in an input code, a square root of a power spectrum envelope is obtained from a second index in the input code, and the residual signal is denormalized. In an audio signal transform decoding method for obtaining a frequency domain signal and converting the frequency domain signal into a time domain signal to obtain an audio signal, an LPC cepstrum is obtained from a linear prediction parameter indicated by the second index or a parameter equivalent thereto. Obtaining, Fourier-transforming the LPC cepstrum, dividing each of the Fourier-transformed samples by 2, exponentially transforming each of the samples divided by 2 to obtain an envelope of the square root power spectrum. And a step of multiplying the residual signal by the square root power spectrum to perform the denormalization. Acoustic signal conversion decoding method comprising.