JP2001249699A

JP2001249699A - Sound compression device

Info

Publication number: JP2001249699A
Application number: JP2000062273A
Authority: JP
Inventors: Hirokuni Marumori; 宏晋丸森; Tsutomu Imai; 勉今井
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2000-03-07
Filing date: 2000-03-07
Publication date: 2001-09-14

Abstract

PROBLEM TO BE SOLVED: To improve sound quality by almost averaging the bit distribution number of each group in which sub-bands are divided, in layer-II mode of MPEG1 audio. SOLUTION: A scale factor(SF) is generated by normalizing audio data divided by a sub-band analysis part 2 in every plural sub-bands by a scaling part 3. A scale factor(SF) selection information calculating part 4 makes SF common in 3-frame coding units for each sub-band. A bit distribution table selection part 9 selects a bit distribution table corresponding to the number of the common SF from a bit distribution table part 8, and a bit distribution allocation part 10 allocates the bit distribution number of this bit distribution table, so that the sub-bands are divided into groups of different frequency bands and the bit distribution number of each group is almost averaged, and a quantization part 5 quantizes the data in coded units for each normalized sub-band according to this new bit distribution table.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ＤＶＤ（Digital
Versatile Disc）カメラやＤＶ（デジタルビデオ）カメ
ラ，電子スチルカメラ，電話機などに用いる音声圧縮装
置に関する。The present invention relates to a DVD (Digital)
The present invention relates to an audio compression device used for a Versatile Disc (Digital Video) camera, a DV (Digital Video) camera, an electronic still camera, a telephone, and the like.

【０００２】[0002]

【従来の技術】音声データの圧縮方式としては、従来、
例えば、ISO/IEC標準方式（ISO/IEC 11172-3:1993(E)）
に準拠して圧縮符号化するＭＰＥＧ１オーディオが知ら
れている。人間の聴覚心理的な特性を利用してディジタ
ル音声データの高能率な圧縮を実現するものである。2. Description of the Related Art Conventionally, as a compression method for audio data,
For example, ISO / IEC standard method (ISO / IEC 11172-3: 1993 (E))
MPEG1 audio which is compression-encoded in conformity with the standard is known. The present invention realizes highly efficient compression of digital audio data by utilizing human psychoacoustic characteristics.

【０００３】図６はこの標準方式に基づく音声圧縮装置
を示すブロック図であって、１は音声圧縮装置、２はサ
ブバンド分析部、３はスケーリング部、４はスケールフ
ァクタ選択情報算出部、７は聴覚心理処理部、５は量子
化部、６はビットストリーム生成部である。なお、ここ
では、符号化モードをレイヤIIとする。FIG. 6 is a block diagram showing an audio compression apparatus based on this standard system, wherein 1 is an audio compression apparatus, 2 is a subband analysis section, 3 is a scaling section, 4 is a scale factor selection information calculation section, 7 Denotes an psychoacoustic processing unit, 5 denotes a quantization unit, and 6 denotes a bit stream generation unit. Here, the encoding mode is layer II.

【０００４】同図において、図示しないＡ／Ｄ変換回路
から出力されるディジタル音声データＡ_INはサブバンド
分析部２に供給される。このサブバンド分析部２では、
このディジタル音声データＡ_INを１２サンプル単位でデ
ィジタルフィルタ処理することにより、３２個のサブバ
ンドに細分化し、３２個の異なるサブバンドのオーディ
オデータを出力する。ＭＰＥＧ１オーディオでは、この
１２サンプル／３２サブバンド（即ち、１２×３２＝３
８４サンプル）を１フレームとするが、レイヤIIモード
の場合、サブバンド毎に、連続する３フレームずつまと
めて符号化される。かかる３フレーム単位が単独で復号
できる最小の単位AAU（Audio Access Unit：オーディオ
復号単位）である。しかし、ここでは、オーディオデー
タを符号化するものであるから、便宜上、オーディオ符
号化単位ということにする。In FIG. 1, digital audio data A _IN output from an A / D conversion circuit (not shown) is supplied to a sub-band analyzer 2. In this sub-band analyzer 2,
The digital audio data A _IN is digitally filtered in units of 12 samples, thereby dividing it into 32 sub-bands and outputting audio data of 32 different sub-bands. In MPEG1 audio, this 12 samples / 32 subbands (ie, 12 × 32 = 3)
(84 samples) is regarded as one frame, but in the case of the layer II mode, three consecutive frames are encoded collectively for each subband. The three-frame unit is the smallest unit AAU (Audio Access Unit) that can be decoded independently. However, in this case, since audio data is encoded, it is referred to as an audio encoding unit for convenience.

【０００５】３２個のサブバンドのオーディオデータ
は、スケーリング部３に供給される。このスケーリング
部３は、１フレーム（１２サンプル）毎に、各サブバン
ドのオーディオデータの波形を、その最大振幅値のサン
プルが同じ規定の振幅となるように、正規化し、また、
この正規化したオーディオデータを復号時にもとの波形
に戻すための倍率（ｄＢ）をスケールファクタ(Scale F
actor)として生成する。ＭＰＥＧオーディオでは、上記
のISO/IEC標準方式によると、スケールファクタが値０
〜６２の６３個の６ビットからなるインデックス（Inde
x）で規定され、これらインデックスに＋６ｄＢ〜−１
１８ｄＢの範囲で約２ｄＢ間隔のスケールファクタが対
応付けられており、スケールファクタが６ビットのイン
デックスという番号で表わされている。従って、以下で
は、かかるインデックスをスケールファクタの番号と
し、ＳＦ番号ということにするが、ＳＦ番号が小さいス
ケールファクタほどその値が大きい。スケーリング部３
は、各サブバンドについて、１フレーム単位のオーディ
オデータを上記のように正規化する際に、上記のように
ISO/IEC標準方式で規定されているスケールファクタの
うちの最も適応するスケールファクタをＳＦ番号で表わ
し、これをこのサブバンドのスケールファクタとするも
のである。[0005] Audio data of 32 sub-bands is supplied to a scaling unit 3. The scaling unit 3 normalizes the waveform of the audio data of each sub-band so that the sample of the maximum amplitude value has the same specified amplitude for each frame (12 samples).
The scaling factor (dB) for returning the normalized audio data to the original waveform at the time of decoding is represented by a scale factor (Scale F).
actor). In MPEG audio, according to the above-mentioned ISO / IEC standard method, the scale factor has a value of 0.
Index consisting of 63 6-bits (Inde)
x), these indexes have +6 dB to -1
A scale factor of about 2 dB is associated with a range of 18 dB, and the scale factor is represented by a number called a 6-bit index. Therefore, in the following, such an index is referred to as a scale factor number and is referred to as an SF number. The smaller the SF number, the larger the scale factor. Scaling unit 3
Is used to normalize the audio data of one frame unit for each subband as described above.
The most suitable scale factor among the scale factors defined in the ISO / IEC standard system is represented by an SF number, and this is used as the scale factor of this subband.

【０００６】このようにして得られた各サブバンドの正
規化されたオーディオデータは量子化部５に、また、ス
ケールファクタはスケールファクタ選択情報算出部４に
夫々供給される。[0006] The thus-normalized audio data of each sub-band is supplied to the quantization unit 5, and the scale factor is supplied to the scale factor selection information calculation unit 4.

【０００７】このスケールファクタ選択情報算出部４
は、各サブバンド毎に、オーディオ符号化単位となる連
続する３フレームのオーディオ符号化単位でのスケール
ファクタの変化をＳＦ番号を用いて検出し、その変化量
が規定よりも少ないときには、これら連続するフレーム
間でスケールファクタを共通化するものである。例え
ば、いま、オーディオ符号化単位をなす３フレームのＳ
Ｆ番号を順にＳＦ１，ＳＦ２，ＳＦ３とし、ＳＦ１から
ＳＦ２への変化量が規定範囲内にあるとすると、このオ
ーディオ符号化単位では、ＳＦ２のスケールファクタは
使用されず、ＳＦ１，ＳＦ３の２つのスケールファクタ
が使用されることになる。ここで、「使用される」と
は、符号化されたスケールファクタがオーディオストリ
ームＡ_STに付加されて出力され、また、このオーディオ
ストリームＡ_STの復号に使用されるということである。
この場合には、ＳＦ１のスケールファクタを、復号時、
ＳＦ２の代わりに用いるものであって、ＳＦ１のスケー
ルファクタを本来ＳＦ２のスケールファクタを用いて復
号再生されるフレームにも、このＳＦ２のスケールファ
クタの代わりに、ＳＦ１のスケールファクタを割り当て
る共通化を行なうものである。かかる共通化の状態を示
すために、スケーリング部３では、さらに、スケールフ
ァクタがどのように共通化されたかを示すスケールファ
クタ選択情報（SCFSI：SCALEFACTOR SELECTION IMFORMA
TION)が形成される。このスケールファクタ選択情報SCF
SIも、共通化されたスケールファクタ（ＳＦ）ととも
に、量子化部５に供給される。The scale factor selection information calculation section 4
Detects, for each sub-band, a change in scale factor in an audio coding unit of three consecutive frames, which is an audio coding unit, by using an SF number. This is to make the scale factor common among the frames to be processed. For example, now, S frames of three frames forming an audio coding unit
Assuming that the F numbers are SF1, SF2, and SF3 in order, and the amount of change from SF1 to SF2 is within a specified range, in this audio coding unit, the scale factor of SF2 is not used, and the two scales of SF1 and SF3 are used. The factor will be used. Here, “used” means that the encoded scale factor is added to the audio stream A _ST and output, and is used for decoding the audio stream A _ST .
In this case, the scale factor of SF1 is
It is used in place of SF2, and performs commonality of assigning the scale factor of SF1 instead of the scale factor of SF2 to a frame in which the scale factor of SF1 is originally decoded and reproduced using the scale factor of SF2. Things. In order to indicate such a state of commonality, the scaling unit 3 further includes scale factor selection information (SCFSI: SCALEFACTOR SELECTION IMFORMA) indicating how the scale factors are standardized.
TION) is formed. This scale factor selection information SCF
The SI is also supplied to the quantization unit 5 together with the common scale factor (SF).

【０００８】なお、３つのスケールファクタ間の変化量
が大きく、スケールファクタの共通化が行なわれないオ
ーディオ符号化単位のサブバンドに対しても、そのこと
を示す３ビットのスケールファクタ選択情報SCFSIが形
成される。[0008] Even for a sub-band of an audio coding unit in which the amount of change between the three scale factors is large and the scale factors are not shared, the 3-bit scale factor selection information SCFSI indicating the same is also provided. It is formed.

【０００９】そして、この共通化されたスケールファク
タ（ＳＦ番号）とスケールファクタ選択情報SCFSIとに
より、サブバンドで実際に使用されるスケールファクタ
を表わしていることになり、これが後述のビットストリ
ーム生成部６に供給される。The scale factor (SF number) and the scale factor selection information SCFSI represent the scale factor actually used in the subband, and this is represented by a bit stream generation unit described later. 6.

【００１０】以上のように、共通化処理がなされたスケ
ールファクタ（ＳＦ番号）とスケールファクタ選択情報
SCFSIとは、ビットストリーム生成部６に供給される。As described above, the scale factor (SF number) and the scale factor selection information subjected to the common processing
The SCFSI is supplied to the bit stream generator 6.

【００１１】一方、聴覚心理処理部７は、連続する３フ
レームのオーディオ符号化単位毎に、入力されるディジ
タル音声データＡ_IN から人間の聴覚心理を利用して、
次の量子化部５で量子化する際のサブバンド毎のビット
数を配分するビット配分テーブル（アロケーション：Al
location）を生成する。従って、同じサブバンドでも、
オーディオ符号化単位毎に配分されるビット数が異なる
場合がある。このような人間の聴覚心理を利用してビッ
ト数を配分する場合には、静寂時の最小可聴限界やこれ
と入力ディジタル音声データＡ_INのサンプルとから決ま
るマスキング・スレッショルド（実際に原音を聞いてい
るときのノイズなどの不要音の知覚可能な最小限界レベ
ル）から量子化部５で量子化するサブバンドの量子化ビ
ット数を配分したものであって（以下、このようにサブ
バンドに配分する量子化ビット数をビット配分数とい
う）、この場合、量子化ノイズのレベルがこのマスキン
グ・スレッショルドよりも小さくなるように、量子化ス
テップが決められる。これにより、量子化ノイズによる
音質劣化を防止することができる。入力されるオーディ
オ符号化単位のディジタル音声データＡ_INこのマスキン
グ・スレッショルドよりも小さいサブバンドでは、配分
する量子化ビット数が０（アロケーション＝０）とされ
る。このビット配分テーブルは、量子化部５とビットス
トリーム生成部６とに供給される。On the other hand, the psychoacoustic processing unit 7 uses the psychoacoustics of a human from the input digital audio data A _IN for each three consecutive audio coding units.
A bit allocation table (allocation: Al) for allocating the number of bits for each subband when performing quantization in the next quantization unit 5
location). Therefore, even in the same subband,
The number of bits allocated to each audio coding unit may be different. In the case of allocating the number of bits using such a human auditory psychology, the minimum audible limit during silence and a masking threshold determined by this and the sample of the input digital audio data A _IN (by actually listening to the original sound) The number of quantization bits of the sub-band to be quantized by the quantization unit 5 is distributed from the minimum perceptible level of unnecessary sound such as noise when the sub-band is quantized (hereinafter, distributed to the sub-bands in this manner). The number of quantization bits is referred to as a bit allocation number). In this case, the quantization step is determined so that the level of the quantization noise is smaller than the masking threshold. As a result, sound quality degradation due to quantization noise can be prevented. Digital audio data A _IN of an input audio coding unit In a sub-band smaller than the masking threshold, the number of quantization bits to be allocated is set to 0 (allocation = 0). This bit allocation table is supplied to the quantization unit 5 and the bit stream generation unit 6.

【００１２】量子化部５は、オーディオ符号化単位毎
に、各サブバンドのオーディオデータを量子化し、符号
化する。この量子化は、３フレーム／３２個のサブバン
ドの各サンプルを所定の順番で行ない、各サンプルのビ
ット配分数は、聴覚心理処理部７で形成されたビット配
分テーブルによってこれが属するサブバンドに配分され
た量子化ビット数である。この量子化されたサンプルか
らなるオーディオ符号化単位のオーディオデータは、ビ
ットストリーム生成部６に供給される。The quantization unit 5 quantizes and encodes audio data of each subband for each audio encoding unit. This quantization is performed for each sample of 3 frames / 32 sub-bands in a predetermined order, and the bit allocation number of each sample is allocated to the sub-band to which it belongs according to the bit allocation table formed by the psychoacoustic processor 7. Is the number of quantized bits. The audio data of the audio coding unit including the quantized samples is supplied to the bit stream generation unit 6.

【００１３】ビットストリーム生成部６では、量子化部
５で量子化されたオーディオデータに聴覚心理処理部７
からのビット配分テーブルとスケールファクタ選択情報
算出部４からのスケールファクタ，スケールファクタ選
択情報SCFSIとが付加され、さらに、同期ワードやレイ
ヤ指定，ビットレート，サンプリング周波数などの情報
を含むヘッダ、エラーチェックコード（オプション）が
付加され、オーディオストリームＡ_STとして出力され
る。なお、ビット配分数が０のサブバンドに対しては、
スケールファクタ（ＳＦ番号）が省略される。The bit stream generator 6 adds the audio data quantized by the quantizer 5 to the psychoacoustic processor 7.
, A scale factor selection information calculation section 4 and a scale factor selection information SCFSI, and a header including information such as a synchronization word, a layer designation, a bit rate, and a sampling frequency, and an error check. A code (optional) is added, and output as an audio stream _AST . In addition, for the sub-band where the bit allocation number is 0,
The scale factor (SF number) is omitted.

【００１４】かかるＭＰＥＧオーディオでは、レベルが
低くて人間の耳にほとんど聞こえない音声情報や近接し
た周波数帯での高いレベルの音声情報によって聞き取り
にくくなった音声情報などを除くように、聴覚心理処理
部７でサブバンド毎のビット配分が行なわれるので、オ
ーディオデータの符号量を大幅に削減でき、また、狭い
バンド幅内では、低いレベルの音声が高いレベルの音声
によってマスクされて聞こえなくなるといったマスキン
グ特性などを利用していることにより（サブバンド分
割）、符号化に際して発生する量子化ノイズの影響を除
くことができ、高能率の圧縮を可能としている。In such MPEG audio, the psychoacoustic processing unit removes audio information that is low in level and hardly audible to human ears, or audio information that is difficult to hear due to high-level audio information in a nearby frequency band. 7, the bit allocation for each sub-band is performed, so that the code amount of the audio data can be greatly reduced, and within a narrow bandwidth, a low-level voice is masked by a high-level voice and becomes inaudible. The use of such a technique (sub-band division) makes it possible to eliminate the influence of quantization noise generated at the time of encoding, thereby enabling highly efficient compression.

【００１５】ところで、かかる音声圧縮装置では、聴覚
心理処理装置７でのサブバンド毎のビット配分処理（ビ
ット配分テーブルの作成処理）に膨大な演算量を必要と
してその作成処理に非常な時間を要し、マイコンを使用
してソフトウエアでリアルタイムにディジタル音声デー
タＡ_INを圧縮処理することができないため、音声専用の
ＬＳＩが必要であることから、先に、本出願人は、この
演算量を大幅に低減可能とした音声圧縮装置が特開平１
１ー１９５９９５号公報で提案した。By the way, in such a voice compression device, the bit allocation processing (processing for creating a bit allocation table) for each sub-band in the psychoacoustic processing apparatus 7 requires an enormous amount of calculation, and the creation processing takes an extremely long time. However, since the digital audio data A _IN cannot be compressed in real time by software using a microcomputer, a dedicated LSI for audio is required. Japanese Patent Laid-Open No.
This is proposed in Japanese Patent Publication No. 1-195995.

【００１６】これは、上記の音声圧縮装置のように入力
音声データからビット配分テーブルを聴覚心理の特性を
利用して作成するのではなく、サンプリング周波数やビ
ットレートなどから予め１乃至複数のビット配分テーブ
ルを作成し、これを用いるようにするものである。複数
のビット配分テーブルを用いる場合には、これらのビッ
ト配分テーブルは各サブバンドに配分されるビット総数
が異なっており、サブバンドへの配分は、上記の最小可
聴限界をもとに、人間が検知し易いサブバンドに多くの
ビットが割り当てられる。従って、低域側のサブバンド
ほど多くのビットが割り当てられる。また、スケールフ
ァクタ選択情報算出部（図６参照）で共通化されたスケ
ールファクタの総数をもとに、最適なビット配分テープ
ルが選択されて使用される。This is not to create a bit allocation table from input audio data using the psychoacoustic characteristics as in the above-described audio compression apparatus, but to use one or more bit allocation tables in advance based on the sampling frequency and bit rate. A table is created and used. When a plurality of bit allocation tables are used, these bit allocation tables differ in the total number of bits allocated to each subband, and the allocation to the subbands is performed by humans based on the minimum audible limit described above. Many bits are allocated to subbands that are easy to detect. Therefore, more bits are allocated to the lower band subband. Further, based on the total number of scale factors shared by the scale factor selection information calculation unit (see FIG. 6), an optimal bit distribution table is selected and used.

【００１７】これによると、普段よく使われているビッ
トレート（例えば、１２８ｋｂｐｓ程度）では、サブバ
ンド毎のビット配分数が多くなるため、効果的に圧縮処
理を行なうことができて、復号の際の音質も向上する。According to this, at a commonly used bit rate (for example, about 128 kbps), the number of bits allocated to each subband increases, so that the compression process can be performed effectively and Sound quality is also improved.

【００１８】また、特願平１０−２５３６４２号には、
サブバンドの大きさによってビット配分を算出し、さら
に、音質を向上させる例も挙げられている。Japanese Patent Application No. 10-253,642 discloses that
There is also an example in which bit allocation is calculated according to the size of a subband, and the sound quality is further improved.

【００１９】[0019]

【発明が解決しようとする課題】上記特開平１１ー１９
５９９５号公報の従来技術では、上記のように、サブバ
ンド毎のビット配分テーブルの再生処理を不要とし、ま
た、音質も向上するという効果があるが、その反面、サ
ンプリング周波数がＣＤやＭＤなどで使用される４４.
１ｋＨｚの場合はまだしも、ＲＴＲＷ規格に準拠するＤ
ＶＤカメラで使用される音声サンプリング周波数である
４８ｋＨｚとなると、ユーザとしては、さらなる高音質
を期待するのは否めない。従って、さらなる音質向上を
はかることが必要となる。Problems to be Solved by the Invention Japanese Patent Laid-Open No. 11-19 / 1999
In the prior art of No. 5995, as described above, there is an effect that the reproduction process of the bit allocation table for each sub-band is not required and the sound quality is improved, but on the other hand, the sampling frequency is limited to CD or MD. Used 44.
In the case of 1 kHz, the D
At 48 kHz, which is the audio sampling frequency used in the VD camera, the user cannot help but expect higher sound quality. Therefore, it is necessary to further improve the sound quality.

【００２０】また、上記のような予め作成したビット配
分テーブルを用いた場合には、サブバンド全てに対して
一括したビット配分を行なうと、必然的に低域側のサブ
バンドにビット配分の偏りができ、高域側に配分される
ビット数が少なくなってしまい、音質向上を期待できな
い場合がある。In the case of using the bit allocation table prepared in advance as described above, if the bit allocation is performed for all the sub-bands at one time, the bias of the bit allocation is inevitably shifted to the lower band side sub-band. And the number of bits allocated to the high frequency side is reduced, so that improvement in sound quality may not be expected.

【００２１】本発明の目的は、以上のような点に鑑みて
なされたものであって、その目的は、ビット配分テーブ
ルを用いながら、確実にさらなる音質向上を達成可能と
した音声圧縮装置を提供することにある。An object of the present invention has been made in view of the above points, and an object of the present invention is to provide an audio compression apparatus capable of surely achieving further improvement in sound quality while using a bit allocation table. Is to do.

【００２２】[0022]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、ディジタル音声データを一定サンプル数
のフレーム毎に処理して複数のサブバンドに分割し、所
定数のフレームの符号化単位毎に夫々のサブバンドのオ
ーディオデータを正規化して夫々毎のスケールファクタ
を形成し、符号化単位毎に該複数のサブバンドを帯域が
異なる複数のグループに区分けして、予め決められた複
数のビット配分数を、該グループ間でビット配分数の総
数がほぼ平均化するように、該グループに振り分けて、
かつ該グループ毎にスケールファクタに応じて該ビット
配分数をサブバンドに振り分けてビット配分テーブルを
作成し、該ビット配分テーブルのビット配分数の量子化
ビット数で、符号化単位毎に、各サブバンドの正規化さ
れたオーディオデータを量子化し、符号化単位の量子化
された該オーディオデータにヘッダやスケールファクタ
などを付加してオーディオストリームとするものであ
る。In order to achieve the above object, the present invention provides a method of processing digital audio data for each frame of a fixed number of samples, dividing the data into a plurality of sub-bands, and encoding a predetermined number of frames. Normalize the audio data of each sub-band for each unit to form a scale factor for each, and divide the plurality of sub-bands into a plurality of groups having different bands for each coding unit, and Are allocated to the groups such that the total number of allocated bits among the groups is substantially averaged,
In addition, a bit allocation table is created by allocating the number of allocated bits to sub-bands according to the scale factor for each group, and the quantization bit number of the number of allocated bits in the bit allocation table is used for each encoding unit. The quantized audio data of the band is quantized, and a header, a scale factor, and the like are added to the quantized audio data of a coding unit to form an audio stream.

【００２３】また、本発明は、ディジタル音声データを
一定サンプル数のフレーム毎に処理し、複数のサブバン
ドに分割するサブバンド分析部と、フレーム毎に、各サ
ブバンドのオーディオデータを正規化するとともに、正
規化のための倍率をスケールファクタとするスケーリン
グ部と、複数の連続するフレームを符号化単位とし、該
サブバンド毎に該符号化単位でのスケールファクタをフ
レーム間で共通化し、かつスケールファクタの共通化状
態を表わすスケールファクタ選択情報を生成するスケー
ルファクタ選択情報算出部と、予め各サブバンドに対す
るビット数が特定の方法で配分されてなるビット配分テ
ーブルが複数格納されたビット配分テーブル部と、符号
化単位毎に、該スケールファクタ選択情報に応じた該ビ
ット配分テーブルを該ビット配分テーブル部から選択す
るビット配分テーブル選択部と、該サブバンド分析部で
分割された該サブバンドを異なる複数の帯域のグループ
に区分けし、該符号化単位毎に、該サブバンドが属する
グループを判定するグループ判定部と、該符号化単位毎
に、該グループ判定部の判定結果に基づいて、該グルー
プ間で該量子化ビット数の配分がほぼ平均化するよう
に、該ビット配分テープル部で選択された該ビット配分
テーブルでのビット配分数を該サブバンドに振り分け、
新たなビット配分テーブルを作成するビット配分振り分
け部と、該符号化単位毎に、該サブバンド夫々のオーデ
ィオデータを該新たなビット配分テーブルによって配分
されるビット数で量子化する量子化部と、該量子化部で
量子化された該符号化単位のサブバンドのオーディオデ
ータに該新たなビット配分テーブル，共通化された該ス
ケールファクタ，該スケールファクタ選択情報などを付
加し、オーディオストリームを生成するビットストリー
ム生成部とからなる構成とするものである。Also, the present invention processes digital audio data for each frame of a fixed number of samples and divides the data into a plurality of subbands, and normalizes the audio data of each subband for each frame. A scaling unit that uses a scaling factor for normalization as a scale factor, and a plurality of consecutive frames as a coding unit, and a scale factor in the coding unit for each subband is shared between frames, and A scale factor selection information calculation unit for generating scale factor selection information representing a factor commonality state, and a bit distribution table unit storing a plurality of bit distribution tables in which the number of bits for each subband is allocated in a specific manner in advance And the bit allocation table corresponding to the scale factor selection information for each coding unit. A bit allocation table selecting unit for selecting from the bit allocation table unit, and the subband divided by the subband analyzing unit are divided into a plurality of different band groups, and the subband belongs to each coding unit. A group determining unit for determining a group; and a bit allocation table for each of the coding units, based on a determination result of the group determining unit, such that the distribution of the number of quantization bits among the groups is substantially averaged. Allocating the bit allocation number in the bit allocation table selected by the section to the sub-band,
A bit allocation allocating unit that creates a new bit allocation table; and a quantization unit that quantizes audio data of each of the subbands with the number of bits allocated by the new bit allocation table for each coding unit. The new bit allocation table, the common scale factor, the scale factor selection information, and the like are added to the audio data of the subband of the coding unit quantized by the quantization unit to generate an audio stream. And a bit stream generation unit.

【００２４】このように、グループ間でビット配分数の
総数をほぼ平均化し、各グループ内でスケールファクタ
に応じてサブバンドでのビット配分数を決めることによ
り、ビット配分数が低域側のサブバンドに偏ることがな
く、オーディオストリームを復号して得られる音声の音
質が向上する。また、量子化に用いるビット配分テーブ
ルは、予め作成されているビット配分テーブルのビット
配分数を振り分けることによって作成するものであるか
ら、量子化に用いるビット配分テーブルを少ない演算量
でほぼリアルタイムで得ることができる。As described above, the total number of bit allocations among the groups is substantially averaged, and the number of bits allocated in the subband is determined in each group according to the scale factor. The sound quality of the sound obtained by decoding the audio stream is improved without being biased to bands. In addition, since the bit allocation table used for quantization is created by allocating the number of bit allocations of the bit allocation table created in advance, the bit allocation table used for quantization is obtained almost in real time with a small amount of calculation. be able to.

【００２５】また、本発明は、ディジタル音声データか
ら分割されるサブバンドの個数は３２個とし、１フレー
ムを１２サンプル数として、符号化単位を連続する３フ
レームとすると、符号化単位での上記のスケールファク
タの共通化により、スケールファクタの符号量を削減す
ることができる。Further, according to the present invention, if the number of subbands divided from digital audio data is 32, one frame is 12 samples, and three consecutive coding units are used, By using the common scale factor, the code amount of the scale factor can be reduced.

【００２６】[0026]

【発明の実施の形態】以下、本発明の実施形態を図面を
用いて説明する。図１は本発明による音声圧縮装置の一
実施形態を示すブロック図であって、８はビット配分テ
ーブル部、９はビット配分テーブル選択部、１０はビッ
ト配分振り分け部、１１は帯域判定部であり、図６に対
応する部分には同一符号をつけて重複する説明を省略す
る。ここでも、先の各従来例と同様、モードをレイヤII
とする。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of an audio compression apparatus according to the present invention, wherein 8 is a bit allocation table section, 9 is a bit allocation table selection section, 10 is a bit allocation distribution section, and 11 is a band determination section. 6, parts corresponding to those in FIG. In this case, too, the mode is changed to Layer II, as in the previous examples.
And

【００２７】同図において、ビット配分テーブル部８に
は、ビット配分数の総和が異なる複数のビット配分テー
ブルが設定されている。ビット配分テーブル選択部９
は、スケールファクタ選択情報算出部４で上記のように
算出されたスケールファクタ選択情報SCFSIをもとにし
てオーディオストリームＡ_STに付加されるスケールファ
クタ（ＳＦ番号）の個数を求め、その個数に応じたビッ
ト配分総数のビット配分テーブルをビット配分テーブル
部８から選択する。In the figure, a plurality of bit allocation tables having different sums of the number of allocated bits are set in the bit allocation table section 8. Bit distribution table selector 9
Determines the number of scale factor scalefactor selection information SCFSI that the scale factor selection information calculating section 4 is calculated as described above based on is added to the audio stream A _ST (SF number), depending on the number The bit allocation table of the total number of allocated bits is selected from the bit allocation table section 8.

【００２８】帯域判定部１１はサブバンドグループの判
定を行なうものである。この実施形態では、サブバンド
分析部２で細分化される３２個のサブバンドを帯域が異
なる複数のグループ（サブバンドグループ）に区分けす
るものであって、帯域判定部１１はビット配分数の振り
分け処理をするサブバンドがどのサブバンドグループに
属するか判定して指定するものである。The band determining unit 11 determines a subband group. In this embodiment, 32 subbands subdivided by the subband analysis unit 2 are divided into a plurality of groups (subband groups) having different bands, and the band determination unit 11 allocates the number of allocated bits. This is to determine and specify which subband group the subband to be processed belongs to.

【００２９】ビット配分振り分け部１０は、スケーリン
グ部３が算出したスケールファクタを基にして、ビット
配分テーブル選択部９で選択されたビット配分テーブル
のビット配分数を帯域判定部１１で指定されたサブバン
ドグループのサブバンド毎に振り分け、新たなビット配
分テーブルに作成し直すものである。The bit allocation allocating unit 10 determines the number of bits allocated in the bit allocation table selected by the bit allocation table selecting unit 9 on the basis of the scale factor calculated by the The distribution is performed for each sub-band of the band group, and a new bit distribution table is created.

【００３０】量子化部５は、ビット配分振り分け部１０
で新たに作成されたビット配分テーブルとを用い、オー
ディオ符号化単位毎に、スケーリング部３で正規化され
たオーディオデータを、図６に示した従来の音声圧縮装
置と同様にして量子化する。また、ビットストリーム生
成部６は、必要に応じてＣＲＣ（Cyclic RedundancyChe
ck）を用いて誤り検出語を生成し、量子化部５で量子化
されたオーディオ符号化単位毎のオーディオデータに、
ヘッダや誤り検出語、ビット配分振り分け部１０で形成
されたビット配分テーブル、スケールファクタ選択情報
検出部４から出力されるスケールファクタ（ＳＦ番号）
やスケールファクタ選択情報SCFSIなどを付加し、オー
ディオストリームＡ_STとして出力する。The quantizing section 5 includes a bit distribution allocating section 10
Then, the audio data normalized by the scaling unit 3 is quantized for each audio coding unit in the same manner as in the conventional audio compression apparatus shown in FIG. In addition, the bit stream generation unit 6 may use a CRC (Cyclic Redundancy Che
ck), an error detection word is generated, and the audio data for each audio coding unit quantized by the quantization unit 5 is
Header, error detection word, bit allocation table formed by bit allocation distribution unit 10, scale factor (SF number) output from scale factor selection information detection unit 4
And scale factor selection information SCFSI and the like, and output as an audio stream _AST .

【００３１】図２はビット配分テーブル部８に予め格納
されているビット配分テーブルのー具体例を示す図であ
る。FIG. 2 is a diagram showing a specific example of a bit distribution table stored in the bit distribution table section 8 in advance.

【００３２】同図において、ここでは、３個のビット配
分テーブルＡ，Ｂ，Ｃを示す。これらは、サンプリング
周波数が４８ｋＨｚ、ビットレートが１２８kbpsのレイ
ヤIIの場合のものであり、この場合には、量子化ビット
が配分される使用サブバンドは、MPEGオーディオの規格
書ISO/IEC 11172−3:1993(E)により、３２サブバンド
中、２７サブバンドであり、残りの高域側の２８番目〜
３１番目のサブバンド（サブバンド２７〜３１という。
以下同様）のビット配分数を０としている。また、ここ
では、説明を簡明にするために、残りのサブバンド０〜
２６の２７個のサブバンド中、サブバンド０〜１４の１
５個のサブバンド分について説明する。なお、サブバン
ド０〜２６について、ビット配分テーブルＡでのビット
配分総数は５０であり、ビット配分テーブルＢでのビッ
ト配分総数は５１、ビット配分テーブルＣでのビット配
分総数は５２である。また、聴覚心理によるビット配分
の最大値は、１チャンネル当り１２８kbpsで７ビット程
度となる。In FIG. 3, three bit distribution tables A, B, and C are shown. These are for the case of layer II having a sampling frequency of 48 kHz and a bit rate of 128 kbps. In this case, the subbands to which the quantization bits are allocated are the MPEG audio standard ISO / IEC 11172-3. : According to 1993 (E), 27 sub-bands out of 32 sub-bands, and the remaining
The 31st subband (referred to as subbands 27 to 31).
The same applies to the following). Here, for the sake of simplicity, the remaining subbands 0 to
26 of 27 subbands, 1 of subbands 0 to 14
The five subbands will be described. For the subbands 0 to 26, the total number of bits allocated in the bit allocation table A is 50, the total number of bits allocated in the bit allocation table B is 51, and the total number of bits allocated in the bit allocation table C is 52. Further, the maximum value of the bit allocation based on psychological perception is about 7 bits at 128 kbps per channel.

【００３３】ビット配分テーブル部８に格納されている
ビット配分テーブルは、ビット配分テーブルＡ，Ｂ，Ｃ
で示すように、スケールファクタが大きい順にビットの
配分数が多くなるように、ビット配分数が配列された構
成をなしている。このようなビット配分テーブルでは、
低域側のサブバンドに多くの量子化ビット数が配分され
て偏るようになるし、例えば、１ｋＨｚ〜５ｋＨｚのよ
うな人間の耳に感知し易い帯域では、配分されるビット
数が少なくなり、復号される音声の音質が損なわれるこ
とになる。The bit distribution tables stored in the bit distribution table section 8 are bit distribution tables A, B, and C.
As shown by, the bit allocation number is arranged such that the bit allocation number increases in descending order of the scale factor. In such a bit distribution table,
A large number of quantization bits are allocated to the low-frequency side sub-band and become biased. For example, in a band easily perceived by the human ear such as 1 kHz to 5 kHz, the number of allocated bits decreases, The sound quality of the decoded speech will be impaired.

【００３４】そこで、この実施形態では、ビット配分テ
ーブル部８から１つのビット配分テーブルがビット配分
テーブル選択部９で選択されると、ビット配分振り分け
部１０により、スケーリング部３で得られたスケールフ
ァクタ（ＳＦ番号）と帯域判定部１１の判定結果とに基
づいて、サブバンドでのビット配分数を振り分け直し、
新たなビット配分テーブルを作成するものである。Therefore, in this embodiment, when one bit allocation table is selected from the bit allocation table section 8 by the bit allocation table selection section 9, the bit allocation distribution section 10 causes the scale factor obtained by the scaling section 3 to be obtained. (SF number) and the number of bits allocated to the subband are re-allocated based on the determination result of the band determination unit 11,
This is to create a new bit distribution table.

【００３５】かかるビット配分数の振り分けは、まず、
サブバンド全体にわたってビット数の配分がほぼ平均化
されるようにする。このために、この実施形態では、全
サブバンドを帯域が異なる複数のグループ（サブバンド
グループ）に区分し、サブバンドグループ毎のビット配
分総数がほぼ平均化するようにする。一例としては、全
サブバンドを低域，中域，高域の３個のサブバンドグル
ープに区分する。図２に示すビット配分テーブルＡ，
Ｂ，Ｃにおいて、左端のサブバンドが１５個のサブバン
ドのうちで最も低域のサブバンド、右端のサブバンドが
同じく最も高域のサブバンドであり、ここでは、左側５
個のサブバンドを低域のサブバンドグループのサブバン
ド、真中５個のサブバンドを中域のサブバンドグループ
のサブバンド、右側５個のサブバンドを高域のサブバン
ドグループのサブバンドとする。The distribution of the number of bits to be allocated is as follows.
The distribution of the number of bits over the sub-band is approximately averaged. For this purpose, in this embodiment, all the sub-bands are divided into a plurality of groups (sub-band groups) having different bands, and the total number of bits allocated to each sub-band group is substantially averaged. As an example, all the sub-bands are divided into three sub-band groups of a low band, a middle band, and a high band. The bit distribution table A shown in FIG.
In B and C, the leftmost subband is the lowest subband of the 15 subbands, and the rightmost subband is also the highest subband of the 15 subbands.
Subbands are subbands of a lowband subband group, the middle five subbands are subbands of a middle subband group, and the right five subbands are subbands of a high subband group. .

【００３６】そこで、いま、図２のビット配分テーブル
Ｃがビット配分テーブル選択部９で選択されたとする
と、ビット配分振り分け部１０は、上記の３つのサブバ
ンドグループで夫々のビット配分数の総数がほぼ平均化
するように、夫々のサブバンドグループに振り分ける。
ここでは、低域のサブバンドグループにビット配分テー
ブルＣでのビット配分数７，６，５，４，３（合計２５
ビット）を、中域のサブバンドグループに同じくビット
配分数６，４，２，２，１（合計１５ビット）を、高域
のサブバンドグループに同じくビット配分数５，３，
２，１，１（合計１２ビット）を夫々振り分けるものと
する。これは、図２のビット配分テーブルＣについて同
様にグループ分けした場合（低域で２９ビット、中域で
１６ビット、高域で７ビット）に比べ、サブバンドグル
ープ間で平均化されている。但し、低域，中域，高域間
で完全にビット配分数を平均化するよりも、情報量の関
係から低域ほど配分数を増やした方が良いので、低域で
ビット配分総数を最大とし、中域，高域の順にビット配
分総数が小さくなるようにしている。If the bit allocation table C in FIG. 2 is selected by the bit allocation table selecting section 9, the bit allocation allocating section 10 determines that the total number of bits allocated to each of the three subband groups is three. Assign to each sub-band group so that they are almost averaged.
Here, the bit allocation number 7, 6, 5, 4, 3 in the bit allocation table C is assigned to the low band subband group (total 25).
Bit), the bit allocation number 6, 4, 2, 2, 1 (total 15 bits) for the middle band sub-band group, and the bit allocation number 5, 3, also for the high band sub-band group.
2, 1, and 1 (total of 12 bits) are respectively allocated. This is averaged between the sub-band groups as compared with the case where the bit distribution table C in FIG. 2 is similarly grouped (29 bits in the low band, 16 bits in the middle band, 7 bits in the high band). However, rather than completely averaging the number of allocated bits among the low band, the middle band, and the high band, it is better to increase the number of allocated bits in the lower band from the relationship of the amount of information. Thus, the total number of allocated bits becomes smaller in the order of the middle band and the high band.

【００３７】なお、スケーリング部３で得られたスケー
ルファクタはビット配分振り分け部１０にも送られ、サ
ブバンドグループ毎に区分されて格納されている。The scale factor obtained by the scaling unit 3 is also sent to the bit allocation distribution unit 10 and stored by being divided for each subband group.

【００３８】帯域判定部１１が低域を指定する。これに
より、ビット配分振り分け部１０では、低域のサブバン
ドグループ内の５個のサブバンドのスケールファクタが
ＳＦ番号で比較される。いま、低域のサブバンドグルー
プでのサブバンドを低域側から順にサブバンド０，１，
２，３，４とし、これらのＳＦ番号（０），（１），
（２），（３），（４）が（０）＜（１）＜（２）＜（３）＜（４）の順とすると、まず、ＳＦ番号が最も小さいサブバンド
０に対して低域のサブバンドグループに振り分けられた
最も大きいビット配分数７が割り当てられ、次に、２番
目に小さいＳＦ番号のサブバンド１に２番目に大きいビ
ット配分数６が、次に、３番目に小さいＳＦ番号のサブ
バンド２に３番目に大きいビット配分数５が、次に、４
番目に小さいＳＦ番号のサブバンド３に５番目に大きい
ビット配分数４が、次に、最も大きいＳＦ番号のサブバ
ンド４に最も小さいビット配分数３が夫々割り付けられ
る。The band judging section 11 designates a low band. As a result, the bit allocation distribution unit 10 compares the scale factors of the five subbands in the low frequency subband group with the SF numbers. Now, the subbands in the lowband subband group are sequentially assigned to subbands 0, 1,
2, 3, 4 and these SF numbers (0), (1),
Assuming that (2), (3), and (4) are in the order of (0) <(1) <(2) <(3) <(4), first, low SF number is lower than subband 0. The highest allocated bit number 7 allocated to the subband group of the area is allocated, the second highest allocated bit number 6 is assigned to the subband 1 having the second lowest SF number, and the third lowest allocated number is next. The sub-band 2 of the SF number has the third largest bit allocation number 5,
The fifth largest bit allocation number 4 is allocated to the subband 3 having the lowest SF number, and the smallest bit allocation number 3 is allocated to the subband 4 having the highest SF number.

【００３９】このようにして、低域のサブバンドグルー
プでのビット配分数の振り分けが終わると、次に、帯域
判定部１１は中域を指定する。これにより、ビット配分
振り分け部１０では、中域のサブバンドグループ内の５
個のサブバンドのスケールファクタのＳＦ番号が比較さ
れる。いま、中域のサブバンドグループでのサブバンド
を低域側から順にサブバンド５，６，７，８，９とし、
これらのＳＦ番号（５），（６），（７），（８），
（９）が（５）＜（６）＜（７）＜（８）＜（９）の順とすると、まず、ＳＦ番号が最も小さいサブバンド
５に対して中域のサブバンドグループに割り付けられた
最も大きいビット配分数６が割り当てられ、次に、２番
目に小さいＳＦ番号のサブバンド６に２番目に大きいビ
ット配分数６が、次に、３番目に小さいＳＦ番号のサブ
バンド７に３番目に大きいビット配分数５が、次に、４
番目に小さいＳＦ番号のサブバンド８に４番目に大きい
ビット配分数４が、次に、最も大きいＳＦ番号のサブバ
ンド９に最も小さいビット配分数３が夫々割り付けられ
る。After the distribution of the number of allocated bits in the low-frequency sub-band group is completed in this way, the band determination unit 11 next specifies the middle frequency band. As a result, the bit allocation distribution unit 10 selects 5
The SF numbers of the scale factors of the subbands are compared. Now, the subbands in the subband group in the middle band are referred to as subbands 5, 6, 7, 8, 9 in order from the low band side,
These SF numbers (5), (6), (7), (8),
Assuming that (9) is in the order of (5) <(6) <(7) <(8) <(9), subband 5 having the smallest SF number is first allocated to the subband group in the middle band. Then, the largest bit allocation number 6 is assigned, the second largest bit allocation number 6 is assigned to the subband 6 having the second lowest SF number, and the third is assigned to the subband 7 having the third lowest SF number. The 5th largest bit allocation number is 5
The fourth largest bit allocation number 4 is allocated to the subband 8 having the lowest SF number, and the smallest bit allocation number 3 is allocated to the subband 9 having the highest SF number.

【００４０】このようにして、中域のサブバンドグルー
プでのビット配分数の振り分けが終わると、次に、帯域
判定部１１は高域を指定する。これにより、ビット配分
振り分け部１０では、高域のサブバンドグループ内の５
個のサブバンドのスケールファクタのＳＦ番号が比較さ
れる。いま、高域のサブバンドグループでのサブバンド
を低域側から順にサブバンド１０，１１，１２，１３，
１４とし、これらのＳＦ番号（１０），（１１），（１
２），（１３），（１４）が（１０）＜（１１）＜（１２）＜（１３）＜（１４）の順とすると、まず、ＳＦ番号が最も小さいサブバンド
１０に対して高域のサブバンドグループに割り付けられ
た最も大きいビット配分数５が割り当てられ、次に、２
番目に小さいＳＦ番号のサブバンド１１に２番目に大き
いビット配分数３が、次に、３番目に小さいＳＦ番号の
サブバンド１２に３番目に大きいビット配分数２が、次
に、４番目に小さいＳＦ番号のサブバンド１３に４番目
に大きいビット配分数１が、次に、最も大きいＳＦ番号
のサブバンド１４に最も小さいビット配分数１が夫々割
り付けられる。After the distribution of the bit allocation numbers in the middle band sub-band group is completed in this way, the band determination unit 11 next specifies the high band. As a result, the bit allocation distribution unit 10 selects 5
The SF numbers of the scale factors of the subbands are compared. Now, the subbands in the highband subband group are sequentially arranged from the lowband side to subbands 10, 11, 12, 13,.
14 and these SF numbers (10), (11), (1)
2), (13), and (14) are in the order of (10) <(11) <(12) <(13) <(14). Are assigned the largest bit allocation number of 5 assigned to the subband groups of
The sub-band 11 with the lowest SF number has the second largest bit allocation number 3, the sub-band 12 with the third lowest SF number has the third largest bit allocation number 2, and the fourth next. The fourth largest bit allocation number 1 is allocated to the sub-band 13 having the lowest SF number, and the smallest bit allocation number 1 is allocated to the sub-band 14 having the highest SF number.

【００４１】このようにして、図３（ａ）に示すような
新たなビット配分テーブルが作成される。この新たなビ
ット配分テーブルでは、上記のように、低域のサブバン
ドグループで配分されるビット数が２５、中域のサブバ
ンドグループに配分されるビット数が１５、低域に配分
されるビット数が１２であり、図２に示すビット配分
（低域：２９ビット、中域：１６ビット、低域：７ビッ
ト）に比べて平均化される。この新たなビット配分テー
ブルが量子化部５での各サブバンドの量子化に用いられ
るとともに、ビットストリーム生成部６でオーディオス
トリームＡ_STに付加される。Thus, a new bit distribution table as shown in FIG. 3A is created. In this new bit allocation table, as described above, the number of bits allocated to the low band sub-band group is 25, the number of bits allocated to the middle band sub-band group is 15, and the number of bits allocated to the low band is 15 The number is 12, which is averaged as compared with the bit allocation shown in FIG. 2 (low frequency: 29 bits, middle frequency: 16 bits, low frequency: 7 bits). This new bit allocation table is used for quantization of each subband in the quantization unit 5, and is added to the audio stream _AST by the bit stream generation unit 6.

【００４２】なお、このビット配分数の振り分け方法は
一例に過ぎず、これ以外の方法であってもよい。例え
ば、上記の例は、低域のサブバンドグループについて振
り分けを行ない、これが終わると、中域のサブバンドグ
ループについて振り分けを行ない、最後に、高域のサブ
バンドグループについて振り分けを行なうものであった
が、低域のサブバンドグループ，中域のサブバンドグル
ープ，高域のサブバンドグループの順に１つのサブバン
ドずつ振り分けを行なうようにしてもよい。The method of assigning the number of allocated bits is merely an example, and other methods may be used. For example, in the above example, the distribution is performed for the low band sub-band group, and when this is completed, the distribution is performed for the middle band sub-band group, and finally, the distribution is performed for the high band sub-band group. However, one subband may be sorted in the order of the low band subband group, the middle band subband group, and the high band subband group.

【００４３】図３（ｂ）はこの振り分け方法によって得
られたビット配分テーブルを示すものであって、この場
合、低域，中域，高域のサブバンドグループでのサブバ
ンドのＳＦ番号が上記と同様の大小関係にあるとする
と、まず、帯域判定部１１が低域を指定する。これによ
り、ビット配分振り分け部１０では、まず、低域のサブ
バンドグループ内の５個のサブバンドのＳＦ番号が比較
され、低域のサブバンドグループでのＳＦ番号が最も小
さいサブバンド０に図２のビット配分テーブルＣでの最
も大きいビット配分数７が割り当てられる。次に、帯域
判定部１１が中域を指定する。これにより、ビット配分
振り分け部１０では、中域のサブバンドグループ内の５
個のサブバンドのＳＦ番号が比較され、中域のサブバン
ドグループでのＳＦ番号が最も小さいサブバンド５に図
２のビット配分テーブルＣでの２番目に大きいビット配
分数６が割り当てられる。次に、帯域判定部１１が高域
を指定する。これにより、ビット配分振り分け部１０で
は、高域のサブバンドグループ内の５個のサブバンドの
ＳＦ番号が比較され、高域のサブバンドグループでのＳ
Ｆ番号が最も小さいサブバンド１０に図２のビット配分
テーブルＣでの３番目に大きいビット配分数６が割り当
てられる。次に、帯域判定部１１が低域を指定する。こ
れにより、ビット配分振り分け部１０では、低域のサブ
バンドグループ内のＳＦ番号が２番目に小さいサブバン
ド１に図２のビット配分テーブルＣでの残りのビット配
分数のうちの最も大きいビット配分数５が割り当てられ
る。次に、帯域判定部１１が中域を指定する。これによ
り、ビット配分振り分け部１０では、中域のサブバンド
グループ内のＳＦ番号が２番目に小さいサブバンド６に
図２のビット配分テーブルＣでの残りのビット配分数の
うちの２番目に大きいビット配分数５が割り当てられ
る。次に、帯域判定部１１が高域を指定する。これによ
り、ビット配分振り分け部１０では、高域のサブバンド
グループ内のＳＦ番号が２番目に小さいサブバンド１１
に図２のビット配分テーブルＣでの残りのビット配分数
のうちの３番目に大きいビット配分数４が割り当てられ
る。FIG. 3B shows a bit distribution table obtained by this distribution method. In this case, the SF numbers of the sub-bands in the low-band, middle-band, and high-band sub-band groups are set as described above. First, the band determination unit 11 specifies a low band. As a result, the bit allocation distribution unit 10 first compares the SF numbers of the five subbands in the low band subband group, and maps the subband 0 having the smallest SF number in the low band subband group. The largest bit allocation number 7 in the bit allocation table C of 2 is assigned. Next, the band determination unit 11 specifies a middle band. As a result, the bit allocation distribution unit 10 selects 5
The SF numbers of the subbands are compared, and the subband 5 having the lowest SF number in the subband group in the middle band is assigned the second largest bit allocation number 6 in the bit allocation table C of FIG. Next, the band determination unit 11 specifies a high band. As a result, the bit allocation distribution unit 10 compares the SF numbers of the five subbands in the high band subband group and determines the S number in the high band subband group.
The third largest bit allocation number 6 in the bit allocation table C of FIG. 2 is allocated to the subband 10 with the smallest F number. Next, the band determination unit 11 specifies a low band. Thereby, the bit allocation allocating unit 10 assigns the largest bit allocation among the remaining bit allocation numbers in the bit allocation table C of FIG. 2 to the subband 1 having the second lowest SF number in the low band subband group. Equation 5 is assigned. Next, the band determination unit 11 specifies a middle band. As a result, the bit allocation distribution unit 10 assigns the subband 6 having the second lowest SF number in the subband group in the middle band to the second largest among the remaining bit allocation numbers in the bit allocation table C in FIG. A bit allocation number of 5 is assigned. Next, the band determination unit 11 specifies a high band. As a result, the sub-band 11 having the second lowest SF number in the high-band sub-band group
Is assigned the third largest bit allocation number 4 among the remaining bit allocation numbers in the bit allocation table C of FIG.

【００４４】以下同様にして、サブバンド２にビット配
分数４が、サブバンド７にビット配分数３が、サブバン
ド１２にビット配分数２が、サブバンド３にビット配分
数３が、サブバンド８にビット配分数２が、サブバンド
１３にビット配分数１が、サブバンド４にビット配分数
２が、サブバンド９にビット配分数１が、サブバンド１
４にビット配分数１が順番に割り当てられる。Similarly, the bit allocation number 4 is assigned to the subband 2, the bit allocation number 3 is assigned to the subband 7, the bit allocation number 2 is assigned to the subband 12, the bit allocation number 3 is assigned to the subband 3, and 8, bit allocation number 1 for subband 13, bit allocation number 2 for subband 4, bit allocation number 1 for subband 9, subband 1
4, bit allocation numbers 1 are sequentially assigned.

【００４５】このようにして、図３（ｂ）に示すような
新たなビット配分テーブルが得られる。この新たなビッ
ト配分テーブルでは、低域のサブバンドグループに配分
されるビット数が２２、中域のサブバンドグループに配
分されるビット数が１７、低域に配分されるビット数が
１３であり、図２に示すビット配分（低域：２９ビッ
ト、中域：１６ビット、低域：７ビット）に比べて平均
化される。In this way, a new bit distribution table as shown in FIG. 3B is obtained. In this new bit allocation table, the number of bits allocated to the low band sub-band group is 22, the number of bits allocated to the middle band sub-band group is 17, and the number of bits allocated to the low band is 13 2 (low frequency: 29 bits, middle frequency: 16 bits, low frequency: 7 bits) shown in FIG.

【００４６】図４は図１におけるビット配分振り分け部
１０が図３（ａ）に示したビット配分テーブルを作成す
る場合の動作を示すフローチャートである。FIG. 4 is a flow chart showing the operation when the bit distribution distribution unit 10 in FIG. 1 creates the bit distribution table shown in FIG.

【００４７】同図において、まず、帯域判定部１１から
低域サブバンドグループが指定され（ステップ１０
０）、これにより、ステップ１０４からの動作を次のよ
うに実行する。In the figure, first, the low band sub-band group is designated by the band judgment section 11 (step 10).
0), whereby the operation from step 104 is executed as follows.

【００４８】即ち、レイヤIIの場合、オーディオ符号化
単位（連続した３フレーム）毎に符号化が行なわれる
が、これら３フレーム毎にスケールファクタ（ＳＦ番
号）が設定されている。そこで、各サブバンドについ
て、オーディオ符号化単位の３つのフレームに対するス
ケールファクタのうち、最大値のスケールファクタ（最
小値のＳＦ番号）を検出し、これをサブバンドのスケー
ルファクタとする（ステップ１０５）。これにより、低
域のサブバンドグループでの５個のサブバンド（図３
（ｂ）の場合、サブバンド０〜４）夫々のスケールファ
クタが決まる。そして、かかるスケールファクタが大き
い順に、これら５個のサブバンド０〜４の順番を設定す
る（ステップ１０５）。That is, in the case of Layer II, encoding is performed for each audio encoding unit (three consecutive frames), and a scale factor (SF number) is set for each of these three frames. Thus, for each subband, the scale factor of the maximum value (SF number of the minimum value) among the scale factors for the three frames of the audio coding unit is detected, and this is set as the scale factor of the subband (step 105). . As a result, five subbands in the lowband subband group (FIG. 3)
In the case of (b), the scale factor of each of the subbands 0 to 4) is determined. Then, the order of these five sub-bands 0 to 4 is set in the descending order of the scale factor (step 105).

【００４９】次に、スケールファクタ選択情報算出部４
からのスケールファクタ選択情報SCFSIをもとに低域の
サブバンドグループでのスケールファクタの数を算出
し、このスケールファクタ数に応じたビット配分テーブ
ルをビット配分テーブル選択部９によってビット配分テ
ーブル部８から読み取り、このビット配分テーブルでの
ビット配分数を、低域，中域，高域のサブバンドグルー
プでのビット配分総数がほぼ平均化されるように、低
域，中域，高域のサブバンドグループ毎に区分する（ス
テップ１０６）。この読み取ったビット配分テーブルが
図２に示すビット配分テーブルＣとすると、このビット
配分テーブルＣのビット配分数を、上記のように、低
域，中域，高域のサブバンドグループに振り分ける。Next, the scale factor selection information calculation section 4
The number of scale factors in the low-frequency subband group is calculated based on the scale factor selection information SCFSI from, and a bit allocation table selecting unit 9 generates a bit allocation table corresponding to the number of scale factors by the bit allocation table selecting unit 9. , And the number of bits allocated in the bit allocation table is adjusted so that the total number of bits allocated in the low band, middle band, and high band sub-band groups is substantially averaged. Classification is performed for each band group (step 106). Assuming that the read bit allocation table is the bit allocation table C shown in FIG. 2, the bit allocation number of the bit allocation table C is allocated to the low band, middle band, and high band sub-band groups as described above.

【００５０】そして、低域のサブバンドグループについ
て、ステップ１０５で設定された順番に従って、順番が
早いサブバンドほど大きいビット配分数が割り当てられ
るように、低域のサブバンドグループに対して振り分け
られたビット配分数を夫々のサブバンドに割り当てる
（ステップ１０７）。図２に示すビット配分テーブルが
ビット配分テーブル部８から選択された場合には、図３
（ｂ）で説明したように、サブバンド０〜４にビット配
分テーブルが割り当てられる。Then, the low band sub-band groups are allocated to the low band sub-band groups in accordance with the order set in step 105, so that the larger the number of allocated bits is, the faster the sub-band is. A bit allocation number is assigned to each subband (step 107). When the bit distribution table shown in FIG. 2 is selected from the bit distribution table section 8, FIG.
As described in (b), the bit allocation table is allocated to subbands 0 to 4.

【００５１】低域のサブバンドグループでのかかるビッ
ト配分数の割り当て処理が終了すると（ステップ１０
８）、次に、帯域判定部１１によって中域のサブバンド
グループが指定され、この指定が判定されることにより
（ステップ１０２）、中域のサブバンドグループについ
て、同様のステップ１０４〜１０７の処理が行なわれ、
サブバンド５〜９にビット配分数が割り当てられる。中
域のサブバンドグループでのかかるビット配分数の割り
当て処理が終了すると（ステップ１０８）、次に、帯域
判定部１１によって高域のサブバンドグループが指定さ
れ、この指定が判定されることにより（ステップ１０
３）、高域のサブバンドグループについて、同様のステ
ップ１０４〜１０７の処理が行なわれ、サブバンド１０
〜１４にビット配分数が割り当てられる。そして、全て
のサブバンドに対するビット配分数の割り当てが終了す
ると、図３（ｂ）で示したような新たなビット配分テー
ブルが得られる。When the process of allocating the number of allocated bits in the low band subband group is completed (step 10).
8) Next, the middle band sub-band group is designated by the band decision unit 11, and the designation is decided (step 102), whereby the same processing of steps 104 to 107 is performed for the middle band sub-band group. Is performed,
Bit allocation numbers are assigned to subbands 5 to 9. When the process of allocating the number of allocated bits in the middle band sub-band group is completed (step 108), the band judging unit 11 next specifies the high band sub-band group, and the designation is determined ( Step 10
3) The same processing of steps 104 to 107 is performed for the high band sub-band group, and the sub-band 10
The number of allocated bits is assigned to .about.14. When the allocation of the bit allocation numbers to all the subbands is completed, a new bit allocation table as shown in FIG. 3B is obtained.

【００５２】なお、図３（ｂ）に示した割り付けのビッ
ト配分テーブルを作成する場合には、１つのサブバンド
にビット配分数を割り付ける毎にサブバンドグループを
替えていくものであるから、例えば、ステップ１００で
低域のサブバンドグループと判定されると、１つのサブ
バンドについてステップ１０４〜１０７の動作が行なわ
れ、ステップ１０８で次の中域のサブバンドグループが
指定されるから、ステップ１０２からステップ１０４に
入る。以下、１つのサブバンドでのステップ１０４〜１
０７の動作が行なわれる毎に、ステップ１００，１０
２，１０３の順に判定が行なわれてステップ１０４に進
む。但し、この場合、ステップ１０４，１０５での最大
値のスケールファクタの検出やサブバンドの順番付け
は、既にビット配分数が割り当てられたもの以外のもの
について行なわれる。また、ステップ１０６は、最初の
サブバンドへのビット配分数の割り付けのときに行なわ
れ、このとき各サブバンドグループに振り分けられて残
っているビット配分数について行なわれる。When the allocation bit allocation table shown in FIG. 3B is created, the sub-band group is changed every time the number of allocated bits is allocated to one sub-band. If it is determined in step 100 that the subband group is a low band subband group, the operations of steps 104 to 107 are performed for one subband, and the next middle band subband group is specified in step 108. From step 104. Hereinafter, steps 104-1 in one subband
Each time the operation of step 07 is performed, steps 100, 10
Determinations are made in the order of 2, 103, and the routine proceeds to step 104. However, in this case, the detection of the scale factor of the maximum value and the ordering of the subbands in steps 104 and 105 are performed for those other than those to which the number of allocated bits has already been assigned. Step 106 is performed at the time of assigning the number of allocated bits to the first subband. At this time, the number of allocated bits remaining after being allocated to each subband group is performed.

【００５３】図５は本発明による音声圧縮装置を用いた
システムの一具体例を示すブロック図であって、ここで
は、ＤＶＤカメラとしており、２０はＤＶＤカメラ、２
１はレンズ、２２は撮像素子、２３はカメラ信号処理
部、２４はＭＰＥＧＣＯＤＥＣ部、２５はマイクロホ
ン（以下、マイクという）、２６はＡ／Ｄ変換部、２７
はオーディオバッファ、２８はマイクロコンピュータ
（以下、マイコンという）、２９は本発明による音声圧
縮装置を用いたＭＰＥＧオーディオ部、３０は多重化
部、３１は記録媒体である。FIG. 5 is a block diagram showing a specific example of a system using the audio compression apparatus according to the present invention. Here, a DVD camera is shown.
1 is a lens, 22 is an image sensor, 23 is a camera signal processing unit, 24 is an MPEG CODEC unit, 25 is a microphone (hereinafter referred to as a microphone), 26 is an A / D converter, 27
, An audio buffer; 28, a microcomputer; 29, an MPEG audio unit using the audio compression apparatus according to the present invention; 30, a multiplexing unit; and 31, a recording medium.

【００５４】同図において、ＤＶＤカメラ２０は、映像
信号をＭＰＥＧ１または２規格に準拠して符号化し、ま
た、音声信号をＭＰＥＧオーディオ規格に準拠して符
号化した後、これらを多重化して出力し、あるいは内蔵
の記録媒体（例えば、ＤＶＤ−ＲＡＭなどの光ディスク
や磁気テープなどの大容量記録媒体）３１に記録するも
のである。In FIG. 1, the DVD camera 20 encodes a video signal in accordance with the MPEG1 or 2 standard, encodes an audio signal in accordance with the MPEG audio standard, and multiplexes and outputs these. Or a built-in recording medium (for example, a large-capacity recording medium such as an optical disk such as a DVD-RAM or a magnetic tape) 31.

【００５５】レンズ２１を介して入射される被写体像は
撮像素子２２によって電気信号に変換される。この電気
信号は、カメラ信号処理部２３で処理されて映像信号に
変換され、ＭＰＥＧＣＯＤＥＣ部２４で圧縮符号化さ
れてビデオストリームが生成される。このビデオストリ
ームはマイコン２８に供給される。また、音声情報を入
力するマイク２５から出力される音声信号は、Ａ／Ｄ変
換部２６でディジタル音声データに変換された後、オー
ディオバッファ２７に一時的に蓄えられ、マイコン２８
によって読み取られる。The subject image incident through the lens 21 is converted into an electric signal by the image pickup device 22. This electric signal is processed by the camera signal processing unit 23, converted into a video signal, and compression-encoded by the MPEG CODEC unit 24 to generate a video stream. This video stream is supplied to the microcomputer 28. An audio signal output from a microphone 25 for inputting audio information is converted into digital audio data by an A / D converter 26, and then temporarily stored in an audio buffer 27.
Read by.

【００５６】マイコン２８は先に説明した本発明による
音声圧縮装置からなるＭＰＥＧオーディオ部２９と多
重化部３０とを備えており、オーディオバッファ２７か
ら読み取ったディジタル音声データを、先に説明したよ
うにして、ＭＰＥＧオーディオ部２９でＭＰＥＧオー
ディオ規格に準拠して圧縮符号化することにより、オー
ディオストリームを形成し、このオーディオストリーム
とＭＰＥＧＣＯＤＥＣ部２４から取り込んだビデオス
トリームとを多重化部３０で多重化する。これによって
得られる多重化信号は、外部に出力したり、内蔵の記録
媒体３１に記録することができる。The microcomputer 28 has an MPEG audio unit 29 and a multiplexing unit 30 each of which is the above-described audio compression device according to the present invention, and converts digital audio data read from the audio buffer 27 as described above. Then, the audio stream is formed by compression encoding in accordance with the MPEG audio standard in the MPEG audio unit 29, and the audio stream and the video stream fetched from the MPEG CODEC unit 24 are multiplexed in the multiplexing unit 30. . The multiplexed signal thus obtained can be output to the outside or recorded on the built-in recording medium 31.

【００５７】以上、本発明の実施形態を説明したが、本
発明は、かかる実施形態のみに限定されるものではな
い。Although the embodiments of the present invention have been described above, the present invention is not limited to only such embodiments.

【００５８】例えば、上記実施形態では、全サブバンド
を低域，中域，高域の３個のサブバンドグループに振り
分けるとしたが、これに限るものではなく、２以上の異
なる帯域のサブバンドグループに振り分けるようにすれ
ばよく、勿論、これらサブバンドグループ毎に配分され
るビット数はほぼ等しくし、平均化したビット配分が行
なわれる。For example, in the above embodiment, all the sub-bands are allocated to three sub-band groups of low band, middle band, and high band. However, the present invention is not limited to this. The number of bits allocated to each of the subband groups is, of course, substantially equal, and averaged bit allocation is performed.

【００５９】また、上記実施形態では、３フレームをオ
ーディオ符号化単位とするレイヤIIのモードとしたが、
１フレームをオーディオ符号化単位とするレイヤＩであ
ってもよい。但し、この場合には、スケールファクタの
共通化は行なわれないので、スケールファクタ選択情報
算出部４は用いられず、また、符号化単位でのスケール
ファクタ数は一定であるので、１個のビット配分テーブ
ルを用い、これをビット配分振り分け部１０で処理し
て、そのときの各サブバンドグループでのスケールファ
クタに応じて各サブバンドのビット配分数が決まる新た
なビット配分テーブルを作成するようにしてもよい。In the above-described embodiment, the mode is the layer II in which three frames are used as an audio coding unit.
It may be a layer I in which one frame is an audio coding unit. However, in this case, since the scale factor is not shared, the scale factor selection information calculation unit 4 is not used, and since the number of scale factors in the coding unit is constant, one bit Using a distribution table, this is processed by the bit distribution allocating unit 10 to create a new bit distribution table in which the number of bits allocated to each subband is determined according to the scale factor of each subband group at that time. You may.

【００６０】[0060]

【発明の効果】以上説明したように、本発明によれば、
ビット配分数の特定のサブバンドへの偏りを緩和するこ
とができて、音質がさらに向上したオーディオデータの
圧縮符号化が可能となる。As described above, according to the present invention,
The bias of the number of allocated bits to a specific sub-band can be alleviated, and compression encoding of audio data with further improved sound quality can be performed.

【００６１】また、本発明によると、予め作成したビッ
ト配分テーブルを使用することができるので、オーディ
オデータの圧縮符号化処理の演算量の大半を占める聴覚
真理特性を利用したビット配分テーブルの作成処理が不
要となり、ほぼリアルタイムの圧縮符号化処理が可能と
なる。According to the present invention, since a bit distribution table prepared in advance can be used, a bit distribution table making use of the auditory truth characteristic which occupies most of the operation amount of the compression encoding processing of audio data. Becomes unnecessary, and almost real-time compression encoding processing becomes possible.

[Brief description of the drawings]

【図１】本発明による音声圧縮装置の一実施形態を示す
ブロック図である。FIG. 1 is a block diagram showing an embodiment of an audio compression device according to the present invention.

【図２】図１におけるビット配分テーブル部に格納され
たビット配分テーブルの一具体例を示す図である。FIG. 2 is a diagram showing a specific example of a bit distribution table stored in a bit distribution table unit in FIG.

【図３】図１におけるビット配分振り分けで形成される
ビット配分テーブルの一具体例である。FIG. 3 is a specific example of a bit distribution table formed by bit distribution in FIG. 1;

【図４】図１に示した実施形態の動作の一具体例を示す
フローチャートである。FIG. 4 is a flowchart illustrating a specific example of the operation of the embodiment illustrated in FIG. 1;

【図５】本発明を用いたシステムの一具体例を示すブロ
ック図である。FIG. 5 is a block diagram showing a specific example of a system using the present invention.

【図６】従来の音声圧縮装置の一例を示すブロック図で
ある。FIG. 6 is a block diagram illustrating an example of a conventional audio compression device.

[Explanation of symbols]

１音声圧縮装置２サブバンド分析部３スケーリング部４スケールファクタ選択情報算出部５量子化部６ビットストリーム生成部８ビット配分テーブル部９ビット配分テーブル選択部１０ビット配分振り分け部１１帯域判定部 DESCRIPTION OF SYMBOLS 1 Speech compression apparatus 2 Subband analysis part 3 Scaling part 4 Scale factor selection information calculation part 5 Quantization part 6 Bit stream generation part 8 Bit distribution table part 9 Bit distribution table selection part 10 Bit distribution distribution part 11 Band judgment part

フロントページの続きＦターム(参考） 5D045 DA20 5J064 AA01 BC01 BC06 BD03 9A001 BB02 EE05 FF05 GG05 HH15 JJ71 KK43 Continued on the front page F term (reference) 5D045 DA20 5J064 AA01 BC01 BC06 BD03 9A001 BB02 EE05 FF05 GG05 HH15 JJ71 KK43

Claims

[Claims]

1. An audio compression apparatus which divides digital audio data into a plurality of sub-bands and quantizes each sub-band to generate a compression-encoded audio stream. And a scale factor that is a scaling factor for normalization is generated. The plurality of sub-bands are divided into a plurality of groups having different bands, and a predetermined number of allocated bits is assigned to each group. In such a way that the sum of the bit allocations at
Creating a bit allocation table for the sub-band, quantizing the audio data of the sub-band for each coding unit with the number of bits allocated based on the bit allocation table, and quantizing the audio data for each coding unit. The audio compression apparatus characterized in that the audio stream is generated by adding the scale factor and the bit distribution table to the audio stream.

2. A sub-band analysis unit that processes digital audio data for each frame of a fixed number of samples and divides the digital audio data into a plurality of sub-bands, and normalizes audio data of each sub-band for each frame. And a scaling unit that uses a scaling factor as a scale factor for a plurality of consecutive frames as a coding unit, and for each of the subbands, a scale factor in the coding unit is shared between frames, and a common scale factor is used. A scale factor selection information calculation unit for generating scale factor selection information representing a state; a bit allocation table unit in which a plurality of bit allocation tables in which the number of bits for each subband is allocated in a specific manner in advance; For each unit, the bit allocation table according to the scale factor selection information is stored in the bit allocation table. A bit allocation table selecting unit that selects from the table unit, is divided into groups of the sub-band analysis unit in divided the subbands multiple bands different for each unit the coding,
A group determining unit that determines a group to which the subband belongs; and, for each of the coding units, based on a determination result of the group determining unit, the distribution of the number of quantization bits among the groups is substantially averaged. A bit allocation allocating unit for allocating the number of bits allocated in the bit allocation table selected by the bit allocation table unit to the sub-band and creating a new bit allocation table; A quantizer for quantizing each audio data by the number of bits allocated by the new bit allocation table; and a new bit for the audio data of the subband of the coding unit quantized by the quantizer. A bit stream for generating an audio stream by adding an allocation table, the common scale factor, and the scale factor selection information. Audio compression apparatus characterized by comprising a generating unit.

3. The bit allocation table according to claim 2, wherein the plurality of bit allocation tables stored in the bit allocation table section have a bit allocation number allocated to all of the available subbands in the coding unit. An audio compression device characterized by the following.

4. The bit allocation table according to claim 2, wherein the plurality of bit allocation tables stored in the bit allocation table unit have a bit allocation number smaller than the number of subbands available in the coding unit. An audio compression device comprising:

5. The bit allocation table selection unit according to claim 2, wherein the bit distribution table selection unit calculates the number of the common scale factors based on the scale factor selection information generated by the scale factor selection information calculation unit. An audio compression device, wherein an optimal bit distribution table is selected according to the number.

6. The sub-band analysis unit according to claim 2, wherein the sub-band analysis unit divides the digital audio data into 32 sub-bands for each coding unit, An audio compression apparatus comprising: dividing a band into a plurality of groups; and determining to which group the subband being processed belongs.

7. The bit allocation allocating unit according to claim 2, wherein the bit allocation allocating unit allocates, for each of the groups, each of the sub-bands based on the scale factor generated by the scaling unit. An audio compression device characterized by forming.

8. The method according to claim 2, wherein the bit allocation allocation unit allocates, for each group, a larger number of bit allocations for the subbands having the larger scale factor generated by the scaling unit. Audio compression device.

9. The method according to claim 2, wherein the bit stream generation unit performs a CRC as needed.
An audio compression device characterized by adding an error detection word using (Cyclic Redundancy Check).

10. The audio compression device according to claim 2, wherein the bit stream generation unit generates the audio stream to which a header and an error detection word are added.