JPH0969782A

JPH0969782A - Audio data encoding device

Info

Publication number: JPH0969782A
Application number: JP7246953A
Authority: JP
Inventors: Toru Chinen; 徹知念
Original assignee: Nippon Steel Corp
Current assignee: Nippon Steel Corp
Priority date: 1995-08-31
Filing date: 1995-08-31
Publication date: 1997-03-11
Also published as: KR970013791A

Abstract

PROBLEM TO BE SOLVED: To reduce the degradation of the tone quality at the time of decoding by not performing the clipping processing at the time of difference encoding of exponent part data. SOLUTION: With respect to mantissa part data and exponent part data calculated by a block floating-point conversion circuit 13, a smoothing circuit 14 which performs such correction that the difference of exponent part data between at least adjacent bands is put in a preliminarily determined range is provided. Exponent part data smoothed by the smoothing circuit 14 is subjected to difference encoding by a difference encoding circuit 15, and thereby, exponent part data is not clipped at the time of difference encoding of exponent part data, and error does not occur when an MDCT coefficient is reconstituted from mantissa part data and exponent part data on the decoder side.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明はオーディオデータの
圧縮装置に関し、特に、ディジタルオーディオデータを
高能率符号化するオーディオデータ符号化装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio data compression apparatus, and more particularly to an audio data encoding apparatus for highly efficiently encoding digital audio data.

【０００２】[0002]

【従来の技術】オーディオデータの高能率符号化には、
オーディオデータを時間軸上で複数の帯域に分割して符
号化する帯域分割符号化や、オーディオデータを周波数
軸上に直交変換し複数の帯域に分割して符号化する変換
符号化がある。また、これらを組み合わせて、時間軸上
で複数の帯域に分割し、各帯域信号をさらに周波数軸上
に直交変換して符号化する高能率符号化もある。2. Description of the Related Art For high efficiency encoding of audio data,
There are band division coding in which audio data is divided into a plurality of bands on the time axis for coding, and transform coding in which audio data is orthogonally transformed on the frequency axis and divided into a plurality of bands for coding. There is also high-efficiency coding in which these are combined and divided into a plurality of bands on the time axis, and each band signal is further subjected to orthogonal transform on the frequency axis for coding.

【０００３】従来技術の一例として、図４に示すＭＤＣ
Ｔ（モディファイド離散余弦変換）を用いた変換符号化
について説明する。オーディオデータ入力端子１より入
力されたオーディオデータは、例えば、５１２サンプル
を１つのブロックとして、ブロック単位で、窓がけ回路
２により窓がけが行われる。窓がけ回路２では、前ブロ
ックに現ブロックの５０％がオーバーラップ加算され、
その加算結果がＭＤＣＴ回路３に入力される。As an example of the prior art, the MDC shown in FIG.
Transform coding using T (Modified Discrete Cosine Transform) will be described. The audio data input from the audio data input terminal 1 is subjected to windowing by the windowing circuit 2 in block units, for example, 512 samples as one block. In the windowing circuit 2, 50% of the current block is overlap-added to the previous block,
The addition result is input to the MDCT circuit 3.

【０００４】ＭＤＣＴ回路３では、ＭＤＣＴにより直交
変換が行われ、その結果得られるＭＤＣＴ係数データが
ブロック浮動小数点変換回路４に入力される。ブロック
浮動小数点変換回路４では、ＭＤＣＴ回路３より入力さ
れる５１２個のＭＤＣＴ係数データのうち、例えば、零
周波数からの２４０個がいくつかの帯域、例えば、６４
帯域に分割され、１帯域につき１つの指数部データと複
数の仮数部データとから成るブロック浮動小数点データ
に変換される。このとき、１帯域内の係数は、例えば、
１個から８個で、低周波数から高周波数へと１帯域内の
係数の個数が増加するような帯域分割方法をとる。In the MDCT circuit 3, orthogonal transformation is performed by MDCT, and the resulting MDCT coefficient data is input to the block floating point transformation circuit 4. In the block floating point conversion circuit 4, of the 512 MDCT coefficient data input from the MDCT circuit 3, for example, 240 from the zero frequency are in some bands, for example, 64.
It is divided into bands and converted into block floating point data consisting of one exponent part data and a plurality of mantissa part data per band. At this time, the coefficient in one band is, for example,
A band division method is adopted in which the number of coefficients in one band increases from low frequency to high frequency with 1 to 8.

【０００５】指数部データ差分符号化回路５では、ブロ
ック浮動小数点変換回路４より入力される指数部データ
を差分符号化する。また、ビット割り当て回路６では、
ブロック浮動小数点変換回路４より入力される指数部デ
ータを基に、人の聴覚マスキング特性等を利用して、仮
数部データの量子化ビット長を決定し、それを量子化符
号化回路７に入力する。量子化符号化回路７では、ビッ
ト割り当て回路６から入力される量子化ビット長の分、
ブロック浮動小数点変換回路４より入力される仮数部デ
ータのＭＳＢからデータを取り出して量子化符号化を行
う。The exponent part data differential encoding circuit 5 differentially encodes the exponent part data input from the block floating point conversion circuit 4. Further, in the bit allocation circuit 6,
Based on the exponent part data input from the block floating point conversion circuit 4, the quantization bit length of the mantissa part data is determined by utilizing human auditory masking characteristics and the like, and is input to the quantization coding circuit 7. To do. In the quantization coding circuit 7, the quantization bit length input from the bit allocation circuit 6 is
Data is extracted from the MSB of the mantissa data input from the block floating point conversion circuit 4 and quantized and encoded.

【０００６】以上の処理により、第１のデータ出力端子
８から指数部差分符号化データが出力され、第２のデー
タ出力端子９から仮数部符号化データが出力される。こ
れらの出力されたデータは、記録媒体に蓄積されたり、
伝送路を介して送信されたりする。By the above processing, the exponent part differential encoded data is output from the first data output terminal 8 and the mantissa encoded data is output from the second data output terminal 9. These output data are stored in a recording medium,
It is transmitted via a transmission line.

【０００７】[0007]

【発明が解決しようとする課題】ところで、指数部デー
タの差分符号化では、情報量圧縮のために、指数部デー
タをそのまま符号化するのではなく、低周波数から高周
波数へと差分値を符号化している。これにより指数部デ
ータの情報量を圧縮している。By the way, in the differential encoding of the exponent part data, in order to compress the information amount, the exponent part data is not encoded as it is, but the difference value is encoded from the low frequency to the high frequency. It has become. This compresses the information amount of the exponent part data.

【０００８】しかしながら、一般に、指数部データの差
分符号化では、ある値以上の差分値をクリッピングする
処理、例えば、差分値が４以上のものを４にし、−４以
下のものを−４にするという処理を行っている。このク
リッピング処理のために、復号器側で仮数部データと指
数部データとからＭＤＣＴ係数を再構成する際に誤差が
生じてしまい、音質を劣化させてしまうという問題があ
った。However, in the differential encoding of exponent part data, in general, a process of clipping a differential value of a certain value or more, for example, a differential value of 4 or more is set to 4, and a differential value of -4 or less is set to -4. Is being processed. Due to this clipping process, an error occurs when reconstructing the MDCT coefficient from the mantissa data and exponent data on the decoder side, and there is a problem that the sound quality is deteriorated.

【０００９】そこで、本発明は、指数部データを差分符
号化する際にクリッピング処理が行われないようにし、
復号化時に音質の劣化が少なくなるようなオーディオデ
ータ符号化装置を提供することを目的とする。Therefore, the present invention prevents clipping processing from being performed when differentially encoding exponent data,
It is an object of the present invention to provide an audio data encoding device that reduces deterioration of sound quality during decoding.

【００１０】[0010]

【課題を解決するための手段】本発明のオーディオデー
タ符号化装置は、指数部データの差分符号化の前に、隣
接する帯域の指数部データの差が所定範囲内に収まるよ
うに、仮数部データと指数部データとの平滑化処理を行
う。According to the audio data encoding apparatus of the present invention, before the differential encoding of exponent data, the mantissa part is set so that the difference between exponent data of adjacent bands falls within a predetermined range. The data and the exponent part data are smoothed.

【００１１】上記のように構成した本発明によれば、仮
数部データと指数部データとの平滑化処理を行うので、
指数部データの差分符号化の際に、指数部差分データが
クリッピング処理されることがなくなる。これにより、
復号器側で仮数部データと指数部データとからＭＤＣＴ
係数を再構成する際に誤差を生じることがなくなり、音
質を劣化させることがない。According to the present invention configured as described above, since the mantissa part data and the exponent part data are smoothed,
When the differential encoding of the exponent part data is performed, the exponent part difference data is not clipped. This allows
MDCT from the mantissa data and exponent data on the decoder side
No error occurs when reconstructing the coefficients, and sound quality is not deteriorated.

【００１２】[0012]

【発明の実施の形態】以下、本発明による一実施形態
を、図１を用いて説明する。オーディオデータ入力端子
１０より入力されたオーディオデータは、例えば、５１
２サンプルを１つのブロックとして、ブロック単位で、
窓がけ回路１１により窓がけが行われる。窓がけ回路１
１では、前ブロックに現ブロックの５０％がオーバーラ
ップ加算され、その加算結果がＭＤＣＴ回路１２に入力
される。DETAILED DESCRIPTION OF THE INVENTION An embodiment according to the present invention will be described below with reference to FIG. The audio data input from the audio data input terminal 10 is, for example, 51
2 samples as one block, block by block,
Windowing is performed by the windowing circuit 11. Windowing circuit 1
In 1, 50% of the current block is overlap-added to the previous block, and the addition result is input to the MDCT circuit 12.

【００１３】ＭＤＣＴ回路１２では、ＭＤＣＴにより直
交変換が行われ、その結果得られるＭＤＣＴ係数データ
がブロック浮動小数点変換回路１３に入力される。ブロ
ック浮動小数点変換回路１３では、ＭＤＣＴ回路１２よ
り入力される５１２個のＭＤＣＴ係数データのうち、例
えば、零周波数からの２４０個がいくつかの帯域、例え
ば、６４帯域に分割され、１帯域につき１つの指数部デ
ータと複数の仮数部データとから成るブロック浮動小数
点データに変換される。In the MDCT circuit 12, orthogonal transformation is performed by MDCT, and the resulting MDCT coefficient data is input to the block floating point transformation circuit 13. In the block floating-point conversion circuit 13, of the 512 MDCT coefficient data input from the MDCT circuit 12, for example, 240 from the zero frequency are divided into several bands, for example, 64 bands, and 1 per 1 band. It is converted into block floating point data composed of one exponent part data and a plurality of mantissa part data.

【００１４】ここで、ブロック浮動小数点変換について
説明する。まず、１帯域内のＭＤＣＴ係数データの絶対
値の中で最大値を見つける。次に、その最大値データＦ
を次式のように表現する。 F= M×2 exp (-N) ただし、 F:浮動小数点データ M:仮数部データ、 0.5≦M ＜1 , -1≦M ＜-0.5 N:指数部データ、正の整数The block floating point conversion will be described below. First, the maximum value is found among the absolute values of MDCT coefficient data within one band. Next, the maximum value data F
Is expressed as follows. F = M × 2 exp (-N) where F: Floating point data M: Mantissa part data, 0.5 ≦ M <1, -1 ≦ M <-0.5 N: Exponent part data, positive integer

【００１５】そして、最大値データＦの指数部 2exp(-
N) で１帯域内の他のＭＤＣＴ係数データを割って、そ
れらを仮数部データとする。この処理によって、１帯域
につき１つの指数部データと、１帯域内のＭＤＣＴ係数
データの個数に等しい数の仮数部データとが算出され
る。The exponent part 2exp (-of the maximum value data F is
NMD) divides the other MDCT coefficient data in one band to make them mantissa data. By this processing, one exponent part data for one band and mantissa part data of the number equal to the number of MDCT coefficient data in one band are calculated.

【００１６】このとき、１帯域に含まれる係数データの
個数は、例えば１個から８個で、低周波数から高周波数
へと１帯域内の係数の個数が２ｎ単位に増加する帯域分
割方法をとる。例えば、低周波数から順に、１帯域内の
係数が１個の帯域が１６個、１帯域内の係数が２個の帯
域が１６個、１帯域内の係数が４個の帯域が１６個、１
帯域内の係数が８個の帯域が１６個というふうにであ
る。この場合、指数部データは１６×４＝６４個、仮数
部データは１６×（１＋２＋４＋８）＝２４０個にな
る。この場合の帯域分割の様子を図２に示す。At this time, the number of coefficient data included in one band is, for example, 1 to 8, and a band division method is adopted in which the number of coefficients in one band increases from a low frequency to a high frequency in units of 2n. . For example, from low frequencies, 16 bands each having one coefficient in one band, 16 bands each having two coefficients in one band, 16 bands each having four coefficients in one band, 1
That is, there are 8 coefficients in the band and 16 in the band. In this case, the exponent part data is 16 × 4 = 64, and the mantissa part data is 16 × (1 + 2 + 4 + 8) = 240. The state of band division in this case is shown in FIG.

【００１７】次に、平滑化回路１４では、ブロック浮動
小数点変換回路１３より入力される仮数部データと指数
部データとを平滑化する。以下に、平滑化の方法につい
て説明する。一例として、差分符号化回路１５では、差
分値のR 以上のものをR にし、−R 以下のものを−R に
丸める処理を行うものとする。Next, the smoothing circuit 14 smoothes the mantissa part data and the exponent part data input from the block floating point conversion circuit 13. The smoothing method will be described below. As an example, it is assumed that the differential encoding circuit 15 performs a process of rounding a differential value greater than or equal to R and rounding a differential value less than or equal to −R.

【００１８】また、以下の説明のために、ブロック浮動
小数点変換回路１３より入力される６４個の指数部デー
タを、低周波数から高周波数へと順に、 N(x),x=0,
1,...,63とする。また、２４０個の仮数部データを、低
周波数から高周波数へと順に、 M(x,y),x=0,1,...,63,y
=0,1,...,L；ただし，x=0,..,15 のときL=0 ， x=1
6,..,31 のときL=1 ，x=32,..,47のときL=3 ，x=48,..,
63のときL=7 とする。Further, for the following description, the 64 exponent part data input from the block floating point conversion circuit 13 are N (x), x = 0, in order from low frequency to high frequency.
Let's say 1, ..., 63. In addition, 240 mantissa data are M (x, y), x = 0,1, ..., 63, y in order from low frequency to high frequency.
= 0,1, ..., L; However, when x = 0, .., 15, L = 0, x = 1
When 6, .., 31, L = 1, x = 32, .., 47, L = 3, x = 48, ..,
When 63, L = 7.

【００１９】ここで、仮数部データM(x,y)のx は、図２
の周波数の低い方から数えた何番目の帯域かを示す変数
であり、y はその帯域内で何番目の係数かを示す変数で
ある。例えば、図２の周波数の低い方から数えて３５番
目の帯域はx=35と表せる。このとき、この帯域には１つ
の帯域に４つの係数データが含まれているので、y は１
〜４の値をとることができる。Here, x of the mantissa data M (x, y) is as shown in FIG.
Is a variable indicating which band is counted from the lowest frequency of, and y is a variable indicating which coefficient is in that band. For example, the 35th band from the lowest frequency in FIG. 2 can be expressed as x = 35. At this time, since one band includes four coefficient data in this band, y is 1
It can take a value of ~ 4.

【００２０】このように、仮数部データを変数x,y で表
すと決めたときに、平滑化は、次に示す（１）〜（３）
式に従い、帯域x=0,1,..,63 の順に行われる。As described above, when it is determined that the mantissa data is represented by the variables x and y, smoothing is performed by the following (1) to (3).
According to the formula, the band x = 0,1 ,.

【００２１】N(x) > N(x+k) + k×R （１）の場合には、k 帯域離れた指数部データ間の値に k×R
以上の差が有ることになる。すなわち、k=1 の場合は隣
接する帯域間の指数部データの差を表すので、隣接する
帯域間にクリッピング処理の閾値であるR 以上の差が有
ることになる。In the case of N (x)> N (x + k) + k × R (1), k × R is set to the value between the exponent data separated by k bands.
There will be the above differences. That is, when k = 1, the difference in the exponent part data between adjacent bands is represented, and therefore there is a difference of R or more, which is the threshold value of clipping processing, between adjacent bands.

【００２２】また、k=2 の場合は両隣の帯域よりも更に
１つ離れた帯域との指数部データの差を表すので、この
差が 2×R 以上有ることは、２つ先の帯域との間で隣接
する帯域間にクリッピング処理の閾値であるR 以上の差
が必ず発生することを示していることになる。In the case of k = 2, the difference in the exponent part data from the band further apart from the adjacent bands on both sides is shown. Therefore, the difference of 2 × R or more means that the difference is 2 × R. This means that a difference equal to or greater than the threshold R of the clipping process always occurs between adjacent bands.

【００２３】同様に、k=ｉの場合は、両隣の帯域よりも
更にｉ-1離れた帯域との指数部データの差を表すので、
この差がｉ×R 以上有ることは、ｉだけ先の帯域との間
で隣接する２つの帯域間にクリッピング処理の閾値であ
るR 以上の差が発生する部分が存在することを示してい
ることになる。Similarly, in the case of k = i, the difference in the exponent part data from the band i−1 further apart than the bands on both sides is expressed.
The fact that this difference is i × R or more indicates that there is a portion in which a difference of R or more, which is the threshold value of the clipping process, occurs between two bands that are adjacent to the band that is ahead by i. become.

【００２４】図３に示すように、帯域１から見て、帯域
２との差Δ１がR 以下であったとしても、次の帯域２と
帯域３との差Δ２がR 以上であった場合には、帯域２と
帯域３との間だけで補正を行うよりも、帯域１と帯域３
の間、もしくは更に先の帯域４との間で補正を行った方
が、エンベロープをなだらかにできる。As shown in FIG. 3, even if the difference Δ1 from the band 1 to the band 2 is R or less as viewed from the band 1, if the difference Δ2 from the next band 2 to the band 3 is R or more, Is more effective than band 1 and band 3 rather than band 2 and band 3 only.
The envelope can be made smoother by performing the correction between the band 4 and the band 4 further ahead.

【００２５】そこで、本実施形態では、両隣だけではな
く、更に先の帯域間でのクリッピング処理を予測して、
指数部データの差がクリッピング処理の閾値であるR 以
下になるように、以下のようにして指数部データを事前
に補正する。Therefore, in the present embodiment, the clipping process is predicted not only on both sides, but also between bands further ahead,
The exponent part data is corrected in advance as follows so that the difference between the exponent part data becomes equal to or less than the threshold value R of the clipping process.

【００２６】 M(x,y) = M(x,y) × ( 2×exp(N(x+k) + k×R - N(x))), y=0,..,L （２） N(x) = N(x+k) + k×R （３）M (x, y) = M (x, y) × (2 × exp (N (x + k) + k × R-N (x))), y = 0, .., L (2 ) N (x) = N (x + k) + k × R (3)

【００２７】上記した（１）〜（３）式について詳しく
説明する。まず（１）式によって、平滑化する指数部デ
ータN(x)と、 k帯域離れた指数部データN(x+k)に k×R
を加えた値N(x+k) + k×R とを比較している。これによ
り、指数部データの差分符号化の際にクリッピング処理
の対象となる指数部データを予め検出している。The above formulas (1) to (3) will be described in detail. First, according to Eq. (1), the exponent data N (x) to be smoothed and the exponent data N (x + k) separated by k bands are k × R.
Is compared with the value N (x + k) + k × R. As a result, the exponent part data to be clipped is detected in advance when the differential encoding of the exponent part data is performed.

【００２８】上記（１）式を満足した場合には、（３）
式に示すように、平滑化する指数部データN(x)を、k 帯
域離れた指数部データN(x+k)に k×R を加えた値N(x+k)
+ k×R で置き換える。これにより、指数部データの差
分符号化の際にクリッピング処理の対象とならないよう
にすることができる。When the above equation (1) is satisfied, (3)
As shown in the equation, the exponent data N (x) to be smoothed is the value N (x + k) obtained by adding k × R to the exponent data N (x + k) separated by k bands.
Replace with + k × R. As a result, it is possible to prevent clipping processing from being the target of differential encoding of exponent part data.

【００２９】このとき、指数部データが置き換わるの
で、仮数部データもそれにあわせて修正する必要があ
る。これを（２）式によって行っている。（２）式で
は、k 帯域離れた指数部データN(x+k)に k×R を加えた
値N(x+k) + k×R からN(x)を減じた値(2×exp(N(x+k) +
k×R - N(x)))を、x 帯域すべての仮数部に乗じてい
る。At this time, since the exponent part data is replaced, the mantissa part data also needs to be corrected accordingly. This is done by the equation (2). In equation (2), the value obtained by subtracting N (x) from the value N (x + k) + k × R obtained by adding k × R to the exponent data N (x + k) separated by k bands (2 × exp (N (x + k) +
k × R-N (x))) multiplied by the mantissa of all x bands.

【００３０】これは、k 帯域離れた指数部データに許容
される差を超える部分をx 帯域の仮数部データで吸収し
ていることになる。実際の計算順序としては、（３）式
により指数部N(x)を置き換える前に、（２）式により仮
数部M(x,y)を修正しておかなくてはならない。This means that the mantissa data of the x band absorbs a portion exceeding the difference allowed for the exponent data separated by the k band. As an actual calculation order, the mantissa part M (x, y) must be corrected by the expression (2) before the exponent part N (x) is replaced by the expression (3).

【００３１】上記の平滑化処理では、x 帯域の指数部デ
ータN(x)に対して、k 帯域離れた指数部データN(x+k)
を、どこまで比較の対象とするか、すなわち、k の値を
１からいくつまで変化させるかが課題となる。In the above smoothing processing, the exponent data N (x + k) apart from the exponent data N (x) in the x band by k bands
The problem is how much is to be compared, that is, the value of k is changed from 1 to how many.

【００３２】x 帯域の指数部データN(x)を他のすべての
指数部データと比較して平滑化処理を行えば、差分符号
化の際にクリッピング処理の対象となるものは完全にな
くなる。しかし、その場合の計算量は膨大で、実際の符
号化処理には不向きなものとなる。そこで、本実施形態
では、 k=-2,-1,1,2だけを平滑化の対象範囲として計算
量を軽減させている。If the smoothing process is performed by comparing the exponent part data N (x) of the x band with all other exponent part data, there will be completely no clipping process target in the differential encoding. However, the amount of calculation in that case is enormous, which is unsuitable for actual encoding processing. Therefore, in the present embodiment, only k = -2, -1,1,2 is set as the target range for smoothing to reduce the calculation amount.

【００３３】音声信号を入力とする場合には、各指数部
データにはかなり相関があり、急激な変化が起きないた
め、ほとんどの場合、k=1,2 の平滑化処理によりクリッ
ピング処理の対象となることをなくすことができる。When a voice signal is input, the exponent data are highly correlated and do not change abruptly. Therefore, in most cases, the smoothing process of k = 1,2 is the target of the clipping process. Can be eliminated.

【００３４】次に、指数部データ差分符号化回路１５で
は、上記平滑化回路１４より入力される指数部平滑化デ
ータの差分値を算出し、算出した-RからR までの値域を
持つ指数部差分データを適当な符号に置き換えて符号化
する。また、ビット割り当て回路１６では、平滑化回路
１４より入力される指数部平滑化データを基に、人間の
聴覚マスキング特性等を利用して、仮数部データの量子
化ビット長を決定し、それを量子化符号化回路１７に入
力する。Next, the exponent part data difference encoding circuit 15 calculates the difference value of the exponent part smoothed data input from the smoothing circuit 14 and calculates the exponent part having a value range from -R to R. The difference data is encoded by replacing it with an appropriate code. Further, the bit allocation circuit 16 determines the quantized bit length of the mantissa data based on the exponential part smoothed data input from the smoothing circuit 14 by utilizing human auditory masking characteristics and the like. It is input to the quantization coding circuit 17.

【００３５】量子化符号化回路１７では、ビット割り当
て回路１６から入力される量子化ビット長の分、平滑化
回路１４より入力される仮数部平滑化データのＭＳＢか
らデータを取り出して量子化符号化を行う。そして、第
１のデータ出力端子１８から指数部差分符号化データが
出力され、第２のデータ出力端子１９から仮数部符号化
データが出力される。これらの出力されたデータは、記
録媒体に蓄積されたり、伝送路を介して送信されたりす
る。In the quantization coding circuit 17, data corresponding to the quantization bit length inputted from the bit allocation circuit 16 is taken out from the MSB of the mantissa smoothed data inputted from the smoothing circuit 14 and is quantization coded. I do. Then, the first data output terminal 18 outputs the exponential part difference encoded data, and the second data output terminal 19 outputs the mantissa part encoded data. These output data are stored in a recording medium or transmitted via a transmission path.

【００３６】[0036]

【実施例】以下、具体的な数値を用いて上述した平滑化
を説明する。ここでは、K=1 、N(x)=8、M(x,y)=0.8、N
(x+1)=2、R=4 とする。EXAMPLES The above-mentioned smoothing will be described below using specific numerical values. Here, K = 1, N (x) = 8, M (x, y) = 0.8, N
(x + 1) = 2 and R = 4.

【００３７】この場合、まず、帯域x の指数部データN
(x)と帯域x+1 の指数部データN(x+1)との差が６である
ので、クリッピング処理の閾値である R=4を超えている
ことになる。実際に（１）式に各値を代入すると、次式
のようになる。 8 > 2 + 1×4In this case, first, the exponent part data N of the band x
Since the difference between (x) and the exponent part data N (x + 1) of the band x + 1 is 6, it means that R = 4, which is the threshold value of the clipping process, is exceeded. Actually substituting each value into the equation (1) gives the following equation. 8> 2 + 1 x 4

【００３８】このように、（１）式の比較条件を満足す
るので、（２）式および（３）式の平滑化処理を行う。
すなわち、（２）式に各値を代入して、次式を得る。 M(x,y) = 0.8×(2×exp(2 + 1×4 - 8)) = 0.8×(2×exp(-2)) = 0.8×0.25 = 0.2As described above, since the comparison condition of the expression (1) is satisfied, the smoothing processing of the expressions (2) and (3) is performed.
That is, each value is substituted into the equation (2) to obtain the following equation. M (x, y) = 0.8 × (2 × exp (2 + 1 × 4-8)) = 0.8 × (2 × exp (-2)) = 0.8 × 0.25 = 0.2

【００３９】また、（３）式に各値を代入して、次式を
得る。 N(x) = 2 + 1×4 = 6Further, by substituting each value into the equation (3), the following equation is obtained. N (x) = 2 + 1 × 4 = 6

【００４０】これらの平滑化された仮数部データM(x,y)
=0.2と指数部データN(x)=6とから再構成したＭＤＣＴ係
数データ 0.2×2exp(-6) = 0.003125 は、平滑化前の仮
数部データM(x,y)=0.8と指数部データN(x)=8とから再構
成したＭＤＣＴ係数データ 0.8×2exp(-8) = 0.003125
と一致し、誤差が生じないことがわかる。These smoothed mantissa data M (x, y)
= 0.2 and exponent part data N (x) = 6, MDCT coefficient data 0.2 × 2exp (-6) = 0.003125 is the mantissa part data before smoothing M (x, y) = 0.8 and exponent part data MDCT coefficient data reconstructed from N (x) = 8 0.8 × 2 exp (-8) = 0.003125
It is found that there is no error.

【００４１】[0041]

【発明の効果】以上のように本発明によれば、指数部デ
ータの差分符号化の前に、仮数部データと指数部データ
との平滑化処理を行うようにしているので、指数部デー
タの差分符号化の際に、指数部差分データがクリッピン
グ処理されることをなくすことができる。これにより、
復号器側で仮数部データと指数部データとからＭＤＣＴ
係数を再構成する際に誤差が生じることをなくすことが
でき、音質の劣化を防止することができる。この場合、
実際の計算に適した形で平滑化の対象範囲を狭くして
も、音声信号を入力した場合の指数部データの相関性に
よって、復号器側でＭＤＣＴ係数データを再構成する際
の誤差をほとんどなくすことができる。As described above, according to the present invention, the smoothing process of the mantissa part data and the exponent part data is performed before the differential encoding of the exponent part data. It is possible to prevent the exponential part difference data from being clipped during the differential encoding. This allows
MDCT from the mantissa data and exponent data on the decoder side
It is possible to prevent an error from occurring when reconstructing the coefficient, and it is possible to prevent deterioration of sound quality. in this case,
Even if the smoothing target range is narrowed in a form suitable for actual calculation, there is almost no error when reconstructing MDCT coefficient data on the decoder side due to the correlativity of exponent data when a voice signal is input. It can be lost.

[Brief description of drawings]

【図１】本発明の一実施形態であるＭＤＣＴを用いたオ
ーディオデータ符号化装置の構成例を示すブロック図で
ある。FIG. 1 is a block diagram showing a configuration example of an audio data encoding device using MDCT according to an embodiment of the present invention.

【図２】ＭＤＣＴ係数の帯域分割例を示す図である。FIG. 2 is a diagram showing an example of band division of MDCT coefficients.

【図３】異なる帯域間における指数部データの差を説明
するための説明図である。FIG. 3 is an explanatory diagram for explaining a difference in exponent part data between different bands.

【図４】従来のＭＤＣＴを用いたオーディオデータ符号
化装置の構成例を示すブロック図である。FIG. 4 is a block diagram showing a configuration example of an audio data encoding device using a conventional MDCT.

【符号の説明】１０オーディオデータ入力端子１１窓がけ回路１２ＭＤＣＴ回路１３ブロック浮動小数点変換回路１４平滑化回路１５差分符号化回路１６ビット割り当て回路１７量子化符号化回路１８第１のデータ出力端子１９第２のデータ出力端子[Explanation of Codes] 10 Audio Data Input Terminal 11 Windowing Circuit 12 MDCT Circuit 13 Block Floating Point Conversion Circuit 14 Smoothing Circuit 15 Differential Encoding Circuit 16 Bit Allocation Circuit 17 Quantization Encoding Circuit 18 First Data Output Terminal 19 Second data output terminal

Claims

[Claims]

1. A windowing circuit for windowing audio data, and an MD for the data windowed by the windowing circuit.
MDCT circuit for orthogonal transformation by CT, block floating point conversion circuit for performing block floating point conversion on MDCT coefficient data orthogonally transformed by the MDCT circuit, and mantissa part data and exponent part data calculated by the block floating point conversion circuit And a smoothing circuit for correcting the difference of the exponent data so that the difference between the exponent data is at least between adjacent bands, and the exponent data smoothed by the smoothing circuit is subjected to a difference sign. Exponent part data differential encoding circuit for converting, bit allocation circuit for allocating bits by exponential part smoothed data calculated by the smoothing circuit, and quantization bit length of mantissa part data calculated by the bit allocation circuit And the mantissa part calculated by the smoothing circuit according to the band to which the mantissa part data belongs A smoothing data is quantized and coded, and a quantization coding circuit is provided. In the smoothing circuit, the mantissa part data and the exponent part data are smoothed, and the smoothed exponent part data is differentially coded. An audio data encoding device characterized by: