JP2001094432A

JP2001094432A - Sub-band coding and decoding method

Info

Publication number: JP2001094432A
Application number: JP26442699A
Authority: JP
Inventors: Yutaka Banba; 裕番場; Sadahiro Gomi; 貞博五味
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1999-09-17
Filing date: 1999-09-17
Publication date: 2001-04-06

Abstract

PROBLEM TO BE SOLVED: To provide a sub-band coding and decoding method for a digital audio signal that reduces a delay amount while maintaining its compression rate and to enhance decoding reproduction quality. SOLUTION: A sub-band division processing section 101 applies band split filtering to a received PCM audio signal to decimate the signal. A table value introduction section 106 introduces a table value from a scale factor obtained by a scale factor extract section 104 and a sub-band combining and processing section 107 calculates a combined waveform based on the table value of a preceding scale factor. An energy or maximum value calculation section 108 calculates a maximum value or energy of the combined waveform. A bit assignment processing section 110 calculates a sample assignment bit quantity based on a masking level calculated by an psychroacoustic model 109 and a quantization processing section 111 quantizes a sub-band signal. A frame forming section 112 forms a sign bit, a Huffman code and a sample into a bit stream and outputs the result. Since the processing is conducted with a very short frame, the delay quantity is small.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、サブバンド符号化
・復号方法に関し、特に、デジタルオーディオ信号を圧
縮伸張するサブバンド符号化・復号方法に関する。The present invention relates to a sub-band encoding / decoding method, and more particularly to a sub-band encoding / decoding method for compressing and expanding a digital audio signal.

【０００２】[0002]

【従来の技術】デジタルオーディオ信号を圧縮伸張する
サブバンド符号化・復号方法として、ＭＰＥＧ１オーデ
ィオ符号化(ISO/IEC 11172-3)が国際標準として規格化
されている。図４に、ＭＰＥＧ１layer1オーディオ符号
化のエンコーダ401の基本構造を示す。図４を参照し
て、従来のＭＰＥＧ１layer1オーディオ符号化方法を説
明する。2. Description of the Related Art MPEG1 audio encoding (ISO / IEC 11172-3) has been standardized as an international standard as a subband encoding / decoding method for compressing and expanding digital audio signals. FIG. 4 shows a basic structure of an encoder 401 for MPEG1 layer1 audio encoding. A conventional MPEG1 layer1 audio encoding method will be described with reference to FIG.

【０００３】デジタルオーディオ信号がエンコーダ401
に入力されると、サブバンド分析フィルタバンク402に
おいて、帯域分割フィルタリング及び間引き処理が行わ
れる。一方、入力デジタルオーディオ信号は、ＦＦＴ部
403にて周波数分析がなされ、周波数分析情報をもと
に、聴覚心理モデル405及び動的ビット割当部407よっ
て、適応的にサブバンドサンプルに対してビット割当数
が決定される。また、サブバンド分析フィルタバンク40
2の出力から、スケールファクタ値がスケールファクタ
抽出部404にて算出され、線形量子化器406にてサブバン
ド分析フィルタバンクの出力が量子化される。また、ス
ケールファクタやビット割当情報などは、サイド情報と
してサイド情報符号化部409にて符号化される。線形量
子化器406の出力であるサンプルと、サイド情報符号化
部409の出力であるサイド情報符号に、アンシラリデー
タを組み合わせて、ビットストリームがビットストリー
ム形成部408で生成される。A digital audio signal is transmitted from an encoder 401.
, The sub-band analysis filter bank 402 performs band division filtering and thinning processing. On the other hand, the input digital audio signal is
At 403, frequency analysis is performed, and the psychoacoustic model 405 and the dynamic bit allocation unit 407 adaptively determine the number of allocated bits for the subband samples based on the frequency analysis information. The sub-band analysis filter bank 40
From the output of 2, the scale factor value is calculated by the scale factor extraction unit 404, and the output of the sub-band analysis filter bank is quantized by the linear quantizer 406. Also, the scale factor, the bit allocation information, and the like are encoded by the side information encoding unit 409 as side information. A bit stream is generated by a bit stream forming unit 408 by combining ancillary data with a sample output from the linear quantizer 406 and a side information code output from the side information encoding unit 409.

【０００４】ビットストリームの構成を図６に示す。ビ
ットストリームは、ヘッダ601、アロケーション情報60
2、スケールファクタ603、サンプル604で構成されてい
る。このビットストリームを受信しオーディオデータを
復号するＭＰＥＧ１layer1オーディオ復号を行うデコー
ダを、図５に示す。図５を参照して、従来のＭＰＥＧ１
layer1オーディオ復号方法を説明する。FIG. 6 shows the structure of a bit stream. The bit stream is composed of a header 601, allocation information 60
2. It is composed of a scale factor 603 and a sample 604. FIG. 5 shows a decoder that receives the bit stream and decodes the audio data and performs MPEG1 layer 1 audio decoding. Referring to FIG. 5, a conventional MPEG1
The layer1 audio decoding method will be described.

【０００５】デコーダ501では、通信路または蓄積メデ
ィアを通して送られてきたストリーム情報を受信し、ビ
ットストリーム分解部502にてストリームが分解され
る。さらに、サイド情報としてスケールファクタ、ビッ
ト割当情報がサイド情報復号部503にて抽出され、逆量
子化部504にてサブバンド信号は逆量子化され、サブバ
ンド合成フィルタバンク505にて逆量子化されたサブバ
ンド信号は、もとのデジタルオーディオ信号に合成され
出力される。[0005] The decoder 501 receives the stream information sent through the communication path or the storage medium, and the bit stream decomposing unit 502 decomposes the stream. Further, a scale factor and bit allocation information are extracted as side information in the side information decoding unit 503, the subband signal is inversely quantized in the inverse quantization unit 504, and inversely quantized in the subband synthesis filter bank 505. The sub-band signal is synthesized with the original digital audio signal and output.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
サブバンド符号化・復号方法では、フレーム分析のため
のデジタルオーディオ信号のバッファリングが生じ、バ
ッファリング量が多いと、リアルタイム性が要求される
拡声システムなどでは、無視できない遅延が発生すると
いう問題があった。However, in the conventional subband encoding / decoding method, buffering of the digital audio signal for frame analysis occurs, and if the amount of buffering is large, real-time performance is required. In systems, there is a problem that non-negligible delay occurs.

【０００７】本発明は、上記従来の問題を解決するもの
で、デジタルオーディオ信号のサブバンド符号化・復号
方法において、圧縮率を維持したままで、遅延量を少な
くし、かつ復号再生品質を良好にしたサブバンド符号化
・復号方法を提供することを目的とする。SUMMARY OF THE INVENTION The present invention solves the above-mentioned conventional problems. In a method for encoding / decoding a sub-band of a digital audio signal, the delay amount is reduced and the decoding / reproduction quality is improved while maintaining a compression rate. It is an object of the present invention to provide a subband encoding / decoding method.

【０００８】[0008]

【課題を解決するための手段】上記の課題を解決するた
めに、本発明では、入力デジタルオーディオ信号をサブ
バンド分割したサブバンド信号より求めたサブバンド毎
のスケールファクタからテーブル値を導出し、導出され
たテーブル値と現フレーム以前の複数のフレームのスケ
ールファクタ値より導出されたテーブル値とを合成フィ
ルタリングして求めた波形のエネルギーまたは最大値を
もとにサンプルビット割当情報を算出し、サンプルビッ
ト割当情報を基にサブバンド信号を量子化してサンプル
値を算出するサブバンド符号化方法を、現フレームのス
ケールファクタと前フレームのサブバンド毎のスケール
ファクタの差分を求め、差分の絶対値のハフマン符号
と、差分の正負を示すサインビットと、サンプル値とに
よって構成されるストリームを出力する構成とした。According to the present invention, a table value is derived from a scale factor for each subband obtained from a subband signal obtained by dividing an input digital audio signal into subbands. Calculate sample bit allocation information based on the energy or maximum value of the waveform obtained by combining and filtering the derived table value and the table value derived from the scale factor values of a plurality of frames before the current frame, and A subband encoding method of quantizing a subband signal based on bit allocation information to calculate a sample value is performed by calculating a difference between a scale factor of a current frame and a scale factor of each subband of a previous frame, and calculating an absolute value of the difference. A code consisting of a Huffman code, a sign bit indicating whether the difference is positive or negative, and a sample value. Configured to output a stream.

【０００９】このように構成したことにより、フレーム
の時間的な長さがサブバンド分割数分のデジタルオーデ
ィオ信号分の時間となり、低遅延でかつ高品質なデジタ
ルオーディオ信号の復号が可能となる。さらに、エンコ
ーダから出力されるビットストリーム中のサンプル割当
情報を排除して、サンプルに割り当てるビット量を増加
するとともに、間引きされた波形部分のエネルギーや最
大値を考慮したビット割当量が算出できる。[0009] With this configuration, the temporal length of the frame is the time corresponding to the number of digital audio signals corresponding to the number of sub-band divisions, and a high-quality digital audio signal with low delay can be decoded. Furthermore, the sample allocation information in the bit stream output from the encoder is excluded, the bit amount allocated to the sample is increased, and the bit allocation amount in consideration of the energy and the maximum value of the thinned waveform portion can be calculated.

【００１０】[0010]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図１〜図３を参照しながら詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to FIGS.

【００１１】（第１の実施の形態）本発明の第１の実施
の形態は、サブバンド毎のスケールファクタから導出し
たテーブル値と、以前のスケールファクタ値より導出し
たテーブル値とを合成フィルタリングし、求めた波形の
エネルギーまたは最大値をもとに算出したサンプルビッ
ト割当情報を基に、サブバンド信号を量子化してサンプ
ル値を算出し、サブバンド毎のスケールファクタの差分
の絶対値のハフマン符号と、差分の正負を示すサインビ
ットと、サンプル値とからなるストリームを出力するサ
ブバンドエンコーダである。(First Embodiment) In a first embodiment of the present invention, a table value derived from a scale factor for each subband and a table value derived from a previous scale factor value are synthesized and filtered. Calculates the sample value by quantizing the sub-band signal based on the sample bit allocation information calculated based on the energy or the maximum value of the obtained waveform, and calculates the Huffman code of the absolute value of the difference between the scale factors for each sub-band. And a sub-band encoder that outputs a stream consisting of a sign value indicating the sign of the difference and a sample value.

【００１２】図１は、本発明の第１の実施の形態におけ
るサブバンドエンコーダの機能ブロック図である。図１
において、サブバンド分割処理部101は、入力されたデ
ジタルオーディオ信号を帯域分割フィルタリングし、間
引き処理を行う手段であり、分割数Ｍ個の帯域分割フィ
ルタと、Ｍ個のダウンサンプラから構成されている。ス
ケールファクタ抽出部104は、ダウンサンプリングされ
たサブバンド信号の倍率を表すスケールファクタを、サ
ブバンド毎に抽出する手段である。差分値抽出部103
は、前フレームのスケールファクタ値に対しての差分値
を算出し、その絶対値をとる手段である。サインビット
決定部102は、差分の正負の符号を求める手段である。
ハフマン符号化部105は、差分値抽出部103で算出された
スケールファクタ差分の絶対値を、ハフマン符号化する
手段である。テーブル値導出部106は、スケールファク
タよりテーブル値を導出する手段である。サブバンド合
成処理部107は、導出されたテーブル値と現フレーム以
前のスケールファクタより導出されたテーブル値をもと
に、サブバンド合成処理を行い、合成波形を算出する手
段である。エネルギーまたは最大値算出部108は、合成
波形よりフレーム内での合成波形の最大値またはエネル
ギーを算出する手段である。聴覚心理モデル109は、算
出された最大値またはエネルギーをもとに、聴覚心理に
基づいた量子化誤差のマスキングレベルを算出する手段
である。ビット割当処理部110は、サブバンド毎のサン
プル割当ビット数を算出する手段である。量子化処理部
111は、サンプル割当ビット数を基に、間引きされたサ
ブバンド信号をサンプル情報として量子化する手段であ
る。フレーム形成部112は、アンシラリデータ、サイン
ビット、ハフマン符号、サンプルを、ストリームに形成
する手段である。FIG. 1 is a functional block diagram of a sub-band encoder according to the first embodiment of the present invention. FIG.
, The sub-band division processing unit 101 is a unit that performs band division filtering on an input digital audio signal and performs thinning processing, and is configured by M division band filters and M down samplers. . The scale factor extraction unit 104 is a means for extracting a scale factor representing a magnification of a down-sampled sub-band signal for each sub-band. Difference value extraction unit 103
Is a means for calculating a difference value with respect to the scale factor value of the previous frame and taking the absolute value thereof. The sign bit determination unit 102 is a means for obtaining the sign of the difference.
The Huffman encoding unit 105 is a means for Huffman encoding the absolute value of the scale factor difference calculated by the difference value extracting unit 103. The table value deriving unit 106 is means for deriving a table value from a scale factor. The subband synthesis processing unit 107 is a unit that performs subband synthesis processing based on the derived table value and the table value derived from the scale factor before the current frame, and calculates a synthesized waveform. The energy or maximum value calculation unit 108 is means for calculating the maximum value or the energy of the combined waveform in the frame from the combined waveform. The psychoacoustic model 109 is means for calculating a masking level of a quantization error based on psychoacoustics based on the calculated maximum value or energy. The bit allocation processing section 110 is a means for calculating the number of sample allocated bits for each subband. Quantization processing section
Reference numeral 111 denotes means for quantizing the thinned subband signal as sample information based on the number of bits allocated to the sample. The frame forming unit 112 is a means for forming ancillary data, sign bits, Huffman codes, and samples into a stream.

【００１３】図２は、本発明の第１の実施の形態におけ
るサブバンドエンコーダから出力されるストリームの構
成を示す図である。FIG. 2 is a diagram showing a configuration of a stream output from the subband encoder according to the first embodiment of the present invention.

【００１４】以上のように構成された本発明の第１の実
施の形態におけるサブバンドエンコーダの動作を、図１
を参照しながら説明する。まず、分割数分のデジタルオ
ーディオ信号が、サブバンドエンコーダに入力される。
Ｍ分割フィルタバンクを用いている場合は、Ｍ個のＰＣ
Ｍ信号が入力される。入力信号は、サブバンド分割処理
部101にて帯域分割フィルタリングされ、Ｍサンプル毎
に間引きされた信号となって出力される。このとき、各
サブバンドフィルタ出力のフレームあたりの間引きされ
たサブバンド信号は、１サンプルとなっている。The operation of the subband encoder according to the first embodiment of the present invention configured as described above will be described with reference to FIG.
This will be described with reference to FIG. First, digital audio signals for the number of divisions are input to the sub-band encoder.
If an M-divided filter bank is used, M PCs
The M signal is input. The input signal is subjected to band division filtering in the sub-band division processing unit 101, and is output as a signal decimated every M samples. At this time, the thinned subband signal per frame of each subband filter output is one sample.

【００１５】この各サブバンド１サンプルの出力を、ス
ケールファクタ抽出部104にてスケーリングし、スケー
ルファクタを求める。各サブバンド分のＭ個のスケール
ファクタは、テーブル値導出部106に入力され、スケー
ルファクタにより決定されるテーブル値が導出される。
テーブル値は、現フレーム以前のスケールファクタより
導出されたテーブル値と、サブバンド合成処理部107に
おいてサブバンド合成処理が行われ、合成波形が算出さ
れる。The output of one sample of each subband is scaled by a scale factor extracting section 104 to obtain a scale factor. The M scale factors for each subband are input to table value deriving section 106, and a table value determined by the scale factor is derived.
The table value and the table value derived from the scale factor before the current frame and the subband synthesis processing are performed in the subband synthesis processing unit 107 to calculate a synthesized waveform.

【００１６】エネルギーまたは最大値算出部108では、
新しく生成されたフレーム分の合成波形部の最大値また
はエネルギーを算出し、聴覚心理モデル109にて、聴覚
心理に基づいた量子化誤差のマスキングレベルが計算さ
れる。このマスキングレベルをもとに、各サブバンド信
号のサンプル割当ビット量が、ビット割当処理部110に
て計算され、間引きされたサブバンド信号は、量子化処
理部111にて、サンプル割当ビット量に基づいて量子化
される。フレーム形成部112では、サインビットと、ハ
フマン符号、サンプルと、場合によりアンシラリデータ
を、ビットストリームに成形し出力する。このサブバン
ド符号化方法では、サンプル割当ビット数が増加し、か
つ非常に短いフレームで処理を行うため、遅延量が少な
い。In the energy or maximum value calculation unit 108,
The maximum value or energy of the synthesized waveform portion for the newly generated frame is calculated, and the masking level of the quantization error based on the psychoacoustic is calculated by the psychoacoustic model 109. Based on this masking level, the sample allocation bit amount of each subband signal is calculated by the bit allocation processing unit 110, and the decimated subband signal is converted into the sample allocation bit amount by the quantization processing unit 111. Quantized based on The frame forming section 112 shapes the sign bit, the Huffman code, the sample, and possibly the ancillary data into a bit stream and outputs it. In this subband encoding method, the number of bits allocated to the sample is increased and processing is performed in a very short frame, so that the amount of delay is small.

【００１７】図２に、このエンコーダから出力されるス
トリームの構成を示す。フレーム毎のハフマン符号化さ
れたスケールファクタ差分の絶対値201と、フレーム毎
のスケールファクタ差分の正負を示すサインビット202
と、サブバンド分のサンプル情報より構成されている。FIG. 2 shows the structure of a stream output from the encoder. The absolute value 201 of the Huffman-encoded scale factor difference for each frame and a sign bit 202 indicating the sign of the scale factor difference for each frame
And the sample information for the subband.

【００１８】上記のように、本発明の第１の実施の形態
では、サブバンドエンコーダを、サブバンド毎のスケー
ルファクタから導出したテーブル値と、以前のスケール
ファクタ値より導出したテーブル値とを合成フィルタリ
ングして求めた波形のエネルギーまたは最大値をもとに
算出したサンプルビット割当情報を基にサブバンド信号
を量子化してサンプル値を算出し、サブバンド毎のスケ
ールファクタの差分の絶対値のハフマン符号と、差分の
正負を示すサインビットと、サンプル値とからなるスト
リームを出力する構成としたので、サンプル割当ビット
数が増加し、低遅延で低ビットレートでサブバンド符号
化できる。As described above, in the first embodiment of the present invention, the subband encoder combines the table value derived from the scale factor of each subband with the table value derived from the previous scale factor value. The sample value is calculated by quantizing the subband signal based on the sample bit allocation information calculated based on the energy or the maximum value of the waveform obtained by filtering, and the Huffman of the absolute value of the difference of the scale factor difference for each subband is calculated. Since a stream composed of a code, a sign bit indicating the sign of the difference, and a sample value is output, the number of bits allocated to the sample is increased, and subband encoding can be performed at a low bit rate with a low delay.

【００１９】（第２の実施の形態）本発明の第２の実施
の形態は、スケールファクタ差分の絶対値のハフマン符
号とその増減符号とサンプルからなるストリームを受信
し、スケールファクタから導出したテーブル値と、以前
のフレームのテーブル値をサブバンド合成し、エネルギ
ーまたは最大値を求めてビット割当をするサブバンドデ
コーダである。(Second Embodiment) In a second embodiment of the present invention, a Huffman code of the absolute value of a scale factor difference, a stream composed of an increase / decrease code thereof, and a sample are received, and a table derived from the scale factor is received. This is a sub-band decoder that performs sub-band synthesis on the values and the table values of the previous frame to obtain the energy or the maximum value and allocate bits.

【００２０】図３は、本発明の第２の実施の形態におけ
るサブバンドデコーダの機能ブロック図である。図３に
おいて、アンシラリデータ抽出部301は、入力されたビ
ットストリームからアンシラリデータの抽出を行う手段
である。ハフマン復号部303は、受信したビットストリ
ームよりサブバンド分のハフマン符号を復号し、前フレ
ームのスケールファクタ値との差分の絶対値を算出する
手段である。サインビット抽出部302は、アンシラリデ
ータとハフマン符号抽出後のストリームより、前フレー
ムのスケールファクタ増減を示すサインビットを抽出す
る手段である。スケールファクタ算出部309は、前フレ
ームのスケールファクタ差分の絶対値と、サインビッ
ト、及び前フレームのスケールファクタ値より現フレー
ムのスケールファクタ値を算出する手段である。テーブ
ル値導出部308は、現フレームのスケールファクタ値よ
り、テーブル値を算出する手段である。サブバンド合成
処理部307は、算出されたテーブル値と現フレーム以前
のテーブル値をもとに、サブバンド合成処理を行い、合
成波形を算出する手段である。エネルギーまたは最大値
算出部306は、合成波形より新たに生成されたフレーム
区間での合成波形の最大値またはエネルギーを算出する
手段である。聴覚心理モデル305は、算出された最大値
またはエネルギーより、聴覚心理に基づいた量子化誤差
のマスキングレベルを算出する手段である。ビット割当
処理部304は、マスキングレベルをもとに、各サブバン
ド信号に割り当てられているサンプルのビット割当数を
算出する手段である。サンプル値抽出部310は、アンシ
ラリデータ、ハフマン符号、増減サインビットが除かれ
たストリームから、ビット割当情報をもとに、サンプル
値の抽出処理を行う手段である。逆量子化部311は、ス
ケールファクタ及び、サンプル値より、サブバンド信号
を逆量子化する手段である。サブバンド合成処理部312
は、逆量子化されたサブバンド信号の合成フィルタリン
グを行ない、デジタルオーディオ信号を出力する手段で
ある。FIG. 3 is a functional block diagram of a sub-band decoder according to the second embodiment of the present invention. In FIG. 3, an ancillary data extraction unit 301 is a means for extracting ancillary data from an input bit stream. The Huffman decoding unit 303 is means for decoding the Huffman code for the subband from the received bit stream and calculating the absolute value of the difference from the scale factor value of the previous frame. The sign bit extracting unit 302 is means for extracting a sign bit indicating the increase or decrease of the scale factor of the previous frame from the stream after extracting the ancillary data and the Huffman code. The scale factor calculation unit 309 is means for calculating the scale factor value of the current frame from the absolute value of the scale factor difference of the previous frame, the sign bit, and the scale factor value of the previous frame. The table value deriving unit 308 is means for calculating a table value from the scale factor value of the current frame. The subband synthesis processing unit 307 is a unit that performs subband synthesis processing based on the calculated table value and the table value before the current frame, and calculates a synthesized waveform. The energy or maximum value calculation unit 306 is means for calculating the maximum value or energy of the composite waveform in a frame section newly generated from the composite waveform. The psychoacoustic model 305 is means for calculating a masking level of a quantization error based on psychoacoustic from the calculated maximum value or energy. The bit allocation processing unit 304 is means for calculating the number of bits allocated to the samples allocated to each subband signal based on the masking level. The sample value extracting section 310 is a means for extracting a sample value from the stream from which the ancillary data, the Huffman code, and the increase / decrease sign bit have been removed, based on the bit allocation information. The inverse quantization unit 311 is a unit that inversely quantizes the subband signal based on the scale factor and the sample value. Subband synthesis processing unit 312
Is means for performing synthesis filtering of the dequantized sub-band signal and outputting a digital audio signal.

【００２１】以上のように構成された本発明の第２の実
施の形態におけるサブバンドデコーダの動作を、図３を
参照しながら説明する。まず、１フレーム分のビットス
トリームが、サブバンドデコーダに入力され、アンシラ
リデータが、アンシラリデータ抽出部301より抽出され
る。アンシラリデータを含まない場合は、アンシラリデ
ータ抽出部301を省略してもよい。さらに、アンシラリ
データ抽出後のストリームは、ハフマン復号部303に送
られ、ここで、前フレームに対するスケールファクタの
差分の絶対値が算出される。ハフマン符号抽出後のスト
リームデータは、サインビット抽出部302にて、前フレ
ームとのスケールファクタ値の増減を示すサインビット
が抽出される。The operation of the subband decoder according to the second embodiment of the present invention configured as described above will be described with reference to FIG. First, a bit stream for one frame is input to the subband decoder, and ancillary data is extracted by the ancillary data extraction unit 301. When ancillary data is not included, the ancillary data extraction unit 301 may be omitted. Further, the stream from which the ancillary data has been extracted is sent to the Huffman decoding unit 303, where the absolute value of the difference between the scale factor and the previous frame is calculated. From the stream data after the extraction of the Huffman code, a sign bit indicating an increase or decrease in the scale factor value from the previous frame is extracted by a sign bit extracting unit 302.

【００２２】スケールファクタ差分の絶対値と、サイン
ビット及び前フレームのスケールファクタ値をもとに、
スケールファクタ算出部309にて、現フレームのスケー
ルファクタが算出される。テーブル値導出部308では、
算出されたスケールファクタをもとにテーブル値が導出
され、現フレーム以前のテーブル値とから、サブバンド
合成処理部307にて合成波形が算出される。この波形
で、フレーム区間分の新しく生成された波形のエネルギ
ーまたは最大値が、エネルギーまたは最大値算出部306
において算出される。Based on the absolute value of the scale factor difference, the sign bit and the scale factor value of the previous frame,
Scale factor calculation section 309 calculates the scale factor of the current frame. In the table value derivation unit 308,
A table value is derived based on the calculated scale factor, and a subband synthesis processing unit 307 calculates a composite waveform from the table values before the current frame. In this waveform, the energy or maximum value of the newly generated waveform for the frame section is calculated by the energy or maximum value calculation unit 306.
Is calculated.

【００２３】聴覚心理モデル305では、この最大値また
はエネルギーをもとに、聴覚心理に基づいた量子化誤差
のマスキングレベルを算出する。ビット割当処理部304
では、聴覚心理モデル305の情報を基に、各サブバンド
サンプルに割り当てられているビット量を算出する。ア
ンシラリデータとハフマン符号と増減サインビットを抽
出した後のストリームデータは、サンプル値抽出部310
に入力され、ビット割当処理部304からのビット割当情
報を基に、各サブバンド分のサンプル値が抽出される。The psychoacoustic model 305 calculates a masking level of a quantization error based on psychoacoustic based on the maximum value or the energy. Bit allocation processing unit 304
Then, based on the information of the psychoacoustic model 305, the bit amount allocated to each subband sample is calculated. The stream data after extracting the ancillary data, the Huffman code, and the increase / decrease sign bit are sampled
, And a sample value for each subband is extracted based on the bit allocation information from the bit allocation processing unit 304.

【００２４】スケールファクタ値とサンプル値を基に、
逆量子化部311にてサブバンド信号が逆量子化される。
この逆量子化サブバンド信号は、サブバンド合成処理部
312にてＰＣＭデジタルデータに合成され、デコーダの
出力となる。Based on the scale factor value and the sample value,
The sub-band signal is inversely quantized by the inverse quantization unit 311.
This inversely quantized subband signal is supplied to a subband synthesis processing unit.
At 312, it is combined with the PCM digital data and becomes the output of the decoder.

【００２５】上記のように、本発明の第２の実施の形態
では、サブバンドデコーダを、スケールファクタ差分の
絶対値のハフマン符号とその増減符号とサンプルからな
るストリームを受信し、スケールファクタから導出した
テーブル値と、以前のフレームのテーブル値をサブバン
ド合成し、エネルギーまたは最大値を求めてビット割当
をする構成としたので、サンプル割当ビット量を増や
し、低遅延で低ビットレートでサブバンド復号すること
ができる。As described above, in the second embodiment of the present invention, the sub-band decoder receives the Huffman code of the absolute value of the scale factor difference, the stream composed of the increase / decrease code thereof, and the sample, and derives from the scale factor. The sub-band is synthesized by combining the table value obtained from the previous frame and the table value of the previous frame to determine the energy or the maximum value, thereby increasing the number of bits allocated to the sample, and performing sub-band decoding at a low bit rate with low delay. can do.

【００２６】[0026]

【発明の効果】以上の説明から明らかなように、本発明
では、入力デジタルオーディオ信号をサブバンド分割し
たサブバンド信号より求めたサブバンド毎のスケールフ
ァクタからテーブル値を導出し、導出されたテーブル値
と現フレーム以前の複数のフレームのスケールファクタ
値より導出されたテーブル値とを合成フィルタリングし
て求めた波形のエネルギーまたは最大値をもとにサンプ
ルビット割当情報を算出し、サンプルビット割当情報を
基にサブバンド信号を量子化してサンプル値を算出する
サブバンド符号化方法を、現フレームのスケールファク
タと前フレームのサブバンド毎のスケールファクタの差
分を求め、差分の絶対値のハフマン符号と、差分の正負
を示すサインビットと、サンプル値とによって構成され
るストリームを出力する構成としたので、ストリームに
ビット割当情報を割り当てる必要がなくなり、さらにサ
ンプル割当情報が増加して、低ビットレートでも高品質
低遅延で復号再生できるという効果が得られる。As is apparent from the above description, according to the present invention, a table value is derived from a scale factor for each subband obtained from a subband signal obtained by dividing an input digital audio signal into subbands. The sample bit allocation information is calculated based on the energy or the maximum value of the waveform obtained by combining and filtering the value and the table value derived from the scale factor values of a plurality of frames before the current frame. A sub-band encoding method of quantizing the sub-band signal based on the basis to calculate a sample value, obtains a difference between a scale factor of a current frame and a scale factor of each sub-band of a previous frame, and a Huffman code of an absolute value of the difference, A stream consisting of a sign value indicating the sign of the difference and a sample value is output. Since the configuration and the to, there is no need to allocate bit allocation information in the stream, further the sample allocation information is increased, the effect is obtained that decoding can be reproduced with high quality low delay even at low bit rates.

[Brief description of the drawings]

【図１】本発明の第１の実施の形態におけるサブバンド
エンコーダの機能ブロック図、FIG. 1 is a functional block diagram of a subband encoder according to a first embodiment of the present invention;

【図２】本発明の第１の実施の形態におけるサブバンド
エンコーダの出力ビットストリームの構成図、FIG. 2 is a configuration diagram of an output bit stream of a subband encoder according to the first embodiment of the present invention;

【図３】本発明の第２の実施の形態におけるサブバンド
デコーダの機能ブロック図、FIG. 3 is a functional block diagram of a subband decoder according to a second embodiment of the present invention;

【図４】従来のＭＰＥＧ１layer1エンコーダの機能ブロ
ック図、FIG. 4 is a functional block diagram of a conventional MPEG1 layer1 encoder;

【図５】従来のＭＰＥＧ１layer1デコーダの機能ブロッ
ク図、FIG. 5 is a functional block diagram of a conventional MPEG1 layer1 decoder;

【図６】従来のＭＰＥＧ１layer1エンコーダの出力ビッ
トストリームの構成図である。FIG. 6 is a configuration diagram of an output bit stream of a conventional MPEG1 layer1 encoder.

[Explanation of symbols]

101 サブバンド分割処理部 102 サインビット決定部 103 差分値抽出部 104 スケールファクタ抽出部 105 ハフマン符号化部 106 テーブル値導出部 107 サブバンド合成処理部 108 エネルギーまたは最大値算出部 109 聴覚心理モデル 110 ビット割当処理部 111 量子化処理部 112 フレーム形成部 201 フレーム毎のハフマン符号化されたスケールファ
クタの絶対値 202 フレーム毎のスケールファクタ差分の正負を示す
サインビット 203 サブバンド分のサンプル情報 301 アンシラリデータ抽出部 302 サインビット抽出部 303 ハフマン復号部 304 ビット割当処理部 305 聴覚心理モデル 306 エネルギーまたは最大値算出部 307 サブバンド合成処理部 308 テーブル値導出部 309 スケールファクタ算出部 310 サンプル値抽出部 311 逆量子化部 312 サブバンド合成処理部 401 エンコーダ 402 サブバンド分析フィルタバンク 403 ＦＦＴ部 404 スケールファクタ抽出部 405 聴覚心理モデル 406 線形量子化器 407 動的ビット割当部 408 ビットストリーム形成部 409 サイド情報符号化部 501 デコーダ 502 ビットストリーム分解部 503 サイド情報復号 504 逆量子化 505 サブバンド合成フィルタバンク 601 ヘッダ 602 アロケーション情報 603 スケールファクタ 604 サンプル101 Subband division processing unit 102 Sign bit determination unit 103 Difference value extraction unit 104 Scale factor extraction unit 105 Huffman coding unit 106 Table value derivation unit 107 Subband synthesis processing unit 108 Energy or maximum value calculation unit 109 Auditory psychological model 110 bits Allocation processing unit 111 Quantization processing unit 112 Frame formation unit 201 Huffman-coded absolute value of scale factor for each frame 202 Sign bit 203 indicating sign of scale factor difference for each frame 203 Sample information for subbands 301 Ancillary data Extraction unit 302 Sign bit extraction unit 303 Huffman decoding unit 304 Bit allocation processing unit 305 Psychoacoustic model 306 Energy or maximum value calculation unit 307 Subband synthesis processing unit 308 Table value derivation unit 309 Scale factor calculation unit 310 Sample value extraction unit 311 Inverse Quantization unit 312 Subband synthesis processing unit 401 Encoder 40 2 Subband analysis filter bank 403 FFT unit 404 Scale factor extraction unit 405 Psychological psychology model 406 Linear quantizer 407 Dynamic bit allocation unit 408 Bitstream formation unit 409 Side information encoding unit 501 Decoder 502 Bitstream decomposition unit 503 Side information Decoding 504 Inverse quantization 505 Subband synthesis filter bank 601 Header 602 Allocation information 603 Scale factor 604 samples

Claims

[Claims]

1. A table value is derived from a scale factor for each subband obtained from a subband signal obtained by dividing an input digital audio signal into subbands, and the derived table value and a scale factor of a plurality of frames before a current frame are derived. Calculate sample bit allocation information based on the energy of the waveform obtained by combining and filtering the table value derived from the value, and calculate the sample value by quantizing the sub-band signal based on the sample bit allocation information. In the sub-band encoding method, the difference between the scale factor of the current frame and the scale factor of each sub-band of the previous frame is obtained, the Huffman code of the absolute value of the difference, a sign bit indicating the sign of the difference, and the sample Output a stream composed of values and Band encoding method.

2. A table value is derived from a sub-band scale factor obtained from a sub-band signal obtained by dividing an input digital audio signal into sub-bands. The derived table value and a scale factor of a plurality of frames before a current frame are derived. Sample bit allocation information is calculated based on the maximum value of the waveform obtained by performing synthesis filtering with the table value derived from the value, and the subband signal is quantized based on the sample bit allocation information to obtain a sample value. In the subband encoding method to calculate, the difference between the scale factor of the current frame and the scale factor of each subband of the previous frame is obtained, the Huffman code of the absolute value of the difference, a sign bit indicating the sign of the difference, Outputting a stream composed of sample values Coding method.

3. A stream generated by the sub-band encoding method according to claim 1, wherein an absolute value of a difference between a scale factor value of a previous frame and a scale factor value of a current frame for each sub-band is Huffman-decoded. Ask
A Huffman-decoded value extracts a sign bit indicating whether the difference between the scale factors of the subbands other than 0 is positive or negative, and calculates a scale factor value for each subband of the current frame,
Deriving a table value, calculating sample bit allocation information based on the energy of the waveform obtained by performing synthesis filtering on the table value derived from the scale factor values of a plurality of frames before the current frame, and obtaining sample bit allocation information A sub-band decoding method comprising: extracting a sample from a stream based on a sub-band, dequantizing and obtaining a sub-band signal from a scale factor and a sample, and reproducing a digital audio signal through a sub-band synthesis filter.

4. A stream generated by the sub-band encoding method according to claim 2, wherein an absolute value of a difference between a scale factor value of a previous frame and a scale factor value of a current frame for each sub-band is Huffman-decoded. Ask
A Huffman-decoded value extracts a sign bit indicating whether the difference between the scale factors of the subbands other than 0 is positive or negative, and calculates a scale factor value for each subband of the current frame,
A table value is derived, and sample bit allocation information is calculated based on the maximum value of the waveform obtained by combining and filtering the table value derived from the scale factor value of a plurality of frames before the current frame and the sample bit allocation. A subband decoding method, comprising extracting a sample from a stream based on information, dequantizing and obtaining a subband signal from the scale factor and the sample, and reproducing the digital audio signal through a subband synthesis filter.