JPH07175499A

JPH07175499A - Device and method for encoding, recording medium and device and method for decoding

Info

Publication number: JPH07175499A
Application number: JP20670294A
Authority: JP
Inventors: Shinji Miyamori; 慎二宮森; Masatoshi Ueno; 正俊上野
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1993-10-26
Filing date: 1994-08-31
Publication date: 1995-07-14
Anticipated expiration: 2019-05-17
Also published as: AU689134B2; AU7745894A; JP3528260B2

Abstract

PURPOSE:To eliminate the redundancy of a bit distribution amount at the time of compression encoding in multichannel and to make compression encoding/ decoding high definition. CONSTITUTION:This device is constituted of an amplitude information detection circuit 200 detecting energy at every digital audio signals of plural channels, a bit distribution decision circuit 500 deciding the bit distribution amounts to respective channels based on the detection result, an encoder 400 compression encoding based on the bit distribution amount distributed at every channel according to the decision of the bit distribution amount and a formater 600 multiplexing a compression encoded signal at every channel, and the bit distribution amount decision circuit 500 is constituted so that the relation between the energy of the signal and the bit distribution amount becomes a nonlinear characteristic where the bit distribution amount is increased according to the increase of the energy of the signal as a whole.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、映画フィルム映写シス
テム、ビデオテープレコーダ、ビデオディスクプレーヤ
等のステレオや、いわゆるマルチサウンド音響システム
において用いられる、マルチチャンネルのオーディオ信
号を圧縮符号化する符号化装置及び方法と、記録媒体、
並びに圧縮符号化された信号を復号化する復号化装置及
び方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a coding apparatus for compressing and coding a multi-channel audio signal, which is used in a stereo such as a motion picture film projection system, a video tape recorder, a video disc player and so-called multi-sound audio system. And method and recording medium,
The present invention also relates to a decoding device and method for decoding a compression-coded signal.

【０００２】[0002]

【従来の技術】オーディオ或いは音声等の信号の高能率
符号化の手法及び装置には種々のものが知られている。2. Description of the Related Art Various methods and devices for high-efficiency coding of signals such as audio or voice are known.

【０００３】その手法としては、例えば、時間領域のオ
ーディオ信号等を単位時間毎にブロック化してこのブロ
ック毎の時間軸の信号を周波数軸上の信号に変換（直交
変換）して複数の周波数帯域に分割し、各帯域毎に符号
化するブロック化周波数帯域分割方式、いわゆる変換符
号化（トランスフォームコーティング）がある。As a method thereof, for example, a time domain audio signal or the like is divided into blocks for each unit time, and the time axis signal of each block is converted into a signal on the frequency axis (orthogonal conversion) to obtain a plurality of frequency bands. There is a so-called transform coding (block coating), which is a block frequency band division method in which each frequency band is coded.

【０００４】また、時間領域のオーディオ信号等を単位
時間毎にブロック化しないで、複数の周波数帯域に分割
して符号化する非ブロック化周波数帯域分割方式である
帯域分割符号化（サブバンドコーディング：ＳＢＣ）等
を挙げることができる。Further, band division coding (sub-band coding), which is a non-blocking frequency band division method in which a time domain audio signal or the like is not divided into blocks for each unit time and is divided into a plurality of frequency bands for encoding SBC) etc. can be mentioned.

【０００５】さらに、上述の帯域分割符号化と変換符号
化とを組み合わせた高能率符号化の手法及び装置も考え
られている。この場合には、例えば、入力信号を上記帯
域分割符号化で帯域分割した後、各帯域毎の信号を周波
数領域の信号に直交変換し、この直交変換された各帯域
毎の成分に符号化を施す。Further, a high-efficiency coding method and apparatus combining the above-mentioned band division coding and transform coding has been considered. In this case, for example, after the input signal is band-divided by the band-division coding, the signal for each band is orthogonally transformed into a signal in the frequency domain, and the orthogonally transformed components for each band are coded. Give.

【０００６】ここで、上述した帯域分割符号化の帯域分
割用フィルタとしては、例えばＱＭＦ等のフィルタがあ
り、これは例えば、文献「ディジタル・コーディング・
オブ・スピーチ・イン・サブバンズ」("Digital coding
of speech in subbands" R.E.Crochiere, Bell Syst.
Tech. J., Vol.55,No.8 1976) に述べられている。この
ＱＭＦのフィルタは、帯域を等バンド幅に２分割するも
のであり、当該フィルタにおいては上記分割した帯域を
後に合成する際にいわゆるエリアシングが発生しないこ
とが特徴となっている。[0006] Here, as the band division filter of the above-mentioned band division encoding, there is a filter such as QMF, which is disclosed in, for example, the document "Digital Coding."
Of Speech in Subvans "(" Digital coding
of speech in subbands "RE Crochiere, Bell Syst.
Tech. J., Vol.55, No.8 1976). This QMF filter divides the band into two equal bandwidths, and is characterized in that so-called aliasing does not occur when the divided bands are combined later.

【０００７】また、文献「ポリフェイズ・クァドラチュ
ア・フィルターズ −新しい帯域分割符号化技術」("Po
lyphase Quadrature filters -A new subband coding t
echnique", Joseph H. Rothweiler ICASSP 83, BOSTON)
には、等帯域幅のフィルタ分割手法が述べられている。
このポリフェイズ・クァドラチュア・フィルタにおいて
は、信号を等バンド幅の複数の帯域に分割する際に一度
に分割できることが特徴となっている。In addition, the document "Polyphase Quadrature Filters-New Band Division Coding Technique"("Po
lyphase Quadrature filters -A new subband coding t
echnique ", Joseph H. Rothweiler ICASSP 83, BOSTON)
Describes an equal bandwidth filter partitioning technique.
This polyphase quadrature filter is characterized in that when a signal is divided into a plurality of bands of equal bandwidth, it can be divided at one time.

【０００８】さらに、上述した直交変換としては、例え
ば、入力オーディオ信号を所定単位時間（フレーム）で
ブロック化し、ブロック毎に高速フーリエ変換（ＦＦ
Ｔ）、離散コサイン変換（ＤＣＴ）、モディファイドＤ
ＣＴ変換（ＭＤＣＴ）などを行うことで時間軸を周波数
軸に変換するような直交変換がある。Further, as the above-mentioned orthogonal transform, for example, the input audio signal is divided into blocks in a predetermined unit time (frame), and fast Fourier transform (FF) is performed for each block.
T), discrete cosine transform (DCT), modified D
There is orthogonal transformation in which a time axis is transformed into a frequency axis by performing CT transformation (MDCT) or the like.

【０００９】このＭＤＣＴについては、文献「時間領域
エリアシング・キャンセルを基礎とするフィルタ・バン
ク設計を用いたサブバンド／変換符号化」("Subband/Tr
ansform Coding Using Filter Bank Designs Based on
Time Domain Aliasing Cancellation," J.P.Princen A.
B.Bradley, Univ. of Surrey Royal Melbourne Inst.of
Tech. ICASSP 1987)に述べられている。Regarding this MDCT, the document "Subband / Transform Coding Using Filter Bank Design Based on Time Domain Aliasing Cancellation"("Subband / Tr
ansform Coding Using Filter Bank Designs Based on
Time Domain Aliasing Cancellation, "JPPrincen A.
B. Bradley, Univ. Of Surrey Royal Melbourne Inst.of
Tech. ICASSP 1987).

【００１０】更に、周波数帯域分割された各周波数成分
を量子化する場合の周波数分割幅としては、例えば人間
の聴覚特性を考慮した帯域分割がある。すなわち、一般
に臨界帯域（クリティカルバンド）と呼ばれている高域
程帯域幅が広くなるような帯域幅で、オーディオ信号を
複数（例えば２５バンド）の帯域に分割することがあ
る。Further, as a frequency division width in the case of quantizing each frequency component divided into frequency bands, there is a band division considering human auditory characteristics, for example. That is, an audio signal may be divided into a plurality of bands (for example, 25 bands) with a bandwidth that increases in a higher frequency range generally called a critical band.

【００１１】また、この時の各帯域毎のデータを符号化
する際には、各帯域毎に所定のビット配分(Bit allocat
ion)或いは、各帯域毎に適応的なビット配分による符号
化が行われる。When encoding the data for each band at this time, a predetermined bit allocation (Bit allocat
ion) or coding is performed by adaptive bit allocation for each band.

【００１２】例えば、上記ＭＤＣＴ処理されて得られた
係数データを上記ビット配分によって符号化する際に
は、上記各ブロック毎のＭＤＣＴ処理により得られる各
帯域毎のＭＤＣＴ係数データに対して、適応的な配分ビ
ット数で符号化が行われることになる。For example, when the coefficient data obtained by the MDCT processing is encoded by the bit allocation, it is adaptive to the MDCT coefficient data for each band obtained by the MDCT processing for each block. Encoding will be performed with various allocation bit numbers.

【００１３】上記ビット配分手法及びそのための装置と
しては、次の２手法及び装置が知られている。The following two techniques and devices are known as the above-mentioned bit allocation technique and devices therefor.

【００１４】例えば、文献「音声信号の適応変換符号
化」（"Adaptive Transform Coding of Speech Signal
s", IEEE Transactions of Accoustics, Speech, and S
ignal Processing, vol.ASSP-25, No.4, August 1977
）では、各帯域毎の信号の大きさをもとに、ビット割
当を行っている。For example, the document "Adaptive Transform Coding of Speech Signal"
s ", IEEE Transactions of Accoustics, Speech, and S
ignal Processing, vol.ASSP-25, No.4, August 1977
), Bit allocation is performed based on the signal size of each band.

【００１５】また、例えば文献「臨界帯域符号化器 −
聴覚システムの知覚の要求に関するディジタル符号化」
（"The critical band coder --digital encoding of
theperceptual requirements of the auditory syste
m", M.A.Kransner MIT, ICASSP 1980）では、聴覚マス
キングを利用することで、各帯域毎に必要な信号対雑音
比を得て固定的なビット割当を行う手法及び装置が述べ
られている。Further, for example, in the document "Critical band encoder-
Digital encoding of the auditory system's perceptual requirements "
("The critical band coder --digital encoding of
the perceptual requirements of the auditory syste
m ", MAKransner MIT, ICASSP 1980) describes a method and a device for performing fixed bit allocation by obtaining a necessary signal-to-noise ratio for each band by using auditory masking.

【００１６】[0016]

【発明が解決しようとする課題】ところで、例えば上述
したようなサブバンドコーディング等を用いたオーディ
オ信号の高能率圧縮符号化方式においては、人間の聴覚
上の特性を利用し、オーディオデータを約１／５に圧縮
するような方式が既に実用化されている。By the way, for example, in the high-efficiency compression encoding system for audio signals using the above-mentioned sub-band coding or the like, the human auditory characteristic is utilized to convert the audio data into about 1 A method of compressing to / 5 has already been put into practical use.

【００１７】なお、このオーディオデータを約１／５に
圧縮する高能率符号化方式としては、例えばＭＤ（SONY
社商標、Mini Disc)規格に使用されている、ＡＴＲＡＣ
（SONY社商標、Adaptive TRansform Acoustic Coding)
と呼ばれる方式がある。As a high-efficiency encoding method for compressing the audio data to about 1/5, for example, MD (SONY
ATRAC used in the company's trademark, Mini Disc) standard
(Trademark of SONY, Adaptive TRansform Acoustic Coding)
There is a method called.

【００１８】しかし、上記人間の聴覚上の特性を利用し
た高能率符号化方式では、圧縮符号化してその後復号化
して得られる楽器や人間の声などが、わずかながら、原
音から変化してしまうといった事例も見られる。特に原
音の忠実な再現が必要な記録メデイアの記録フォーマッ
トに用いる場合には、その高音質化が要求されている。However, in the high-efficiency coding method utilizing the human auditory characteristics, the musical instrument or human voice obtained by compression coding and then decoding is slightly changed from the original sound. There are also cases. In particular, when used in a recording medium recording format that requires faithful reproduction of the original sound, high sound quality is required.

【００１９】これに対し、上記オーディオ信号を約１／
５に圧縮するような高能率符号化方式（ＡＴＲＡＣ方式
等）のフォーマットは、既に実用化されていて、このフ
ォーマットが採用されたハードウェアも広まりつつあ
る。On the other hand, the audio signal is about 1 /
A high-efficiency encoding format (such as ATRAC method) that compresses to 5 has already been put into practical use, and hardware adopting this format is becoming widespread.

【００２０】したがって、上記フォーマットの互換性の
無い変更や拡張をすることは、上記フォーマットを使用
してきた生産者だけでなく、一般の使用者にも不利益と
なる。Therefore, incompatible changes or expansions of the above formats are not only disadvantageous to the producers who have used the above formats, but also to general users.

【００２１】このため、フォーマット自身は変更せず
に、エンコードやデコードの際に工夫することによる高
音質化の達成が望まれている。Therefore, it is desired to achieve high sound quality by modifying the format itself without changing the format itself.

【００２２】なお、高音質化の方法としては、他にリニ
アＰＣＭ音声を混在させることが考えられる。しかし、
上記高能率符号化方式の圧縮データとリニアデータとで
は、フレームの長さや１フレーム当たりの時間長が異な
るため、再生時に同期を取ることが難しい。従って、こ
れら２つのフォーマットのデータを同時に用いることは
非常に困難である。As a method of improving the sound quality, it is conceivable to mix linear PCM sound. But,
Since the frame length and the time length per frame are different between the compressed data of the high efficiency coding system and the linear data, it is difficult to synchronize the reproduction. Therefore, it is very difficult to use the data in these two formats at the same time.

【００２３】さらに、通常のオーディオ機器の場合のみ
ならず、例えば映画フィルム映写システム、高品位テレ
ビジョン、ビデオテープレコーダ、ビデオディスクプレ
ーヤ等のステレオないしはマルチサウンド音響システム
においては、４〜８チャンネルの複数チャンネルのオー
ディオ信号を扱うようになりつつある。この場合におい
ても、ビットレートを削減する高能率符号化を行うこと
が望まれている。Further, not only in the case of ordinary audio equipment but also in stereo or multi-sound sound systems such as movie film projection systems, high-definition televisions, video tape recorders, video disc players, etc., a plurality of 4 to 8 channels are used. It is beginning to handle audio signals of channels. Even in this case, it is desired to perform high efficiency coding that reduces the bit rate.

【００２４】特に、上記映画フィルムにおいて、例えば
レフトチャンネル，レフトセンターチャンネル，センタ
ーチャンネル，ライトセンターチャンネル，ライトチャ
ンネル，サラウンドレフトチャンネル，サラウンドライ
トチャンネル，サブウーファーチャンネルの８チャンネ
ルのディジタルオーディオ信号を記録するような場合が
ある。この場合には、上記ビットレートを削減する高能
率符号化が必要となる。Particularly, in the above-mentioned motion picture film, for example, eight channels of digital audio signals of a left channel, a left center channel, a center channel, a right center channel, a right channel, a surround left channel, a surround right channel and a subwoofer channel are recorded. There is a case. In this case, high-efficiency coding that reduces the bit rate is required.

【００２５】すなわち、いわゆるＣＤ（コンパクトディ
スク）などで用いているようなサンプリング周波数４
４．１ｋＨｚで１６ビットの直線量子化されたオーディ
オデータの上記８チャンネル分を記録できる領域を、上
記映画フィルム上に確保することは困難である。したが
って、当該オーディオデータの圧縮が必要になる。That is, the sampling frequency 4 as used in so-called CDs (compact discs), etc.
It is difficult to secure an area on the motion picture film where the above 8 channels of 16-bit linearly quantized audio data at 4.1 kHz can be recorded. Therefore, it is necessary to compress the audio data.

【００２６】なお、上記映画フィルムに記録する８チャ
ンネルの各チャンネルは、例えば映画フィルムの画像記
録領域から再生された画像が映写機によって投影される
スクリーン側に配置された、レフトスピーカ、レフトセ
ンタースピーカ、センタスピーカ、ライトセンタスピー
カ、ライトスピーカ、サラウンドレフトスピーカ、サラ
ウンドライトスピーカ、サブウーファースピーカとそれ
ぞれ対応するものである。Each of the eight channels recorded on the movie film is, for example, a left speaker, a left center speaker, which is arranged on the screen side on which an image reproduced from the image recording area of the movie film is projected by a projector. The speaker corresponds to a center speaker, a right center speaker, a right speaker, a surround left speaker, a surround right speaker, and a subwoofer speaker.

【００２７】ここで、上記センタスピーカは、スクリー
ン側の中央に配置され、センタチャンネルのオーディオ
データによる再生音を出力するものである。例えば俳優
のせりふ等の最も重要な再生音を出力する。Here, the center speaker is arranged at the center of the screen side and outputs a reproduced sound based on the audio data of the center channel. For example, the most important reproduced sound such as the dialogue of an actor is output.

【００２８】上記サブウーファースピーカは、サブウー
ファーチャンネルのオーディオデータによる再生音を出
力するものである。例えば爆発音などの低域の音という
よりは振動として感じられる音を効果的に出力するもの
であり、爆発シーンなどに効果的に使用されることが多
いものである。The subwoofer speaker outputs a reproduced sound based on audio data of the subwoofer channel. For example, it effectively outputs a sound that is felt as vibration rather than a low-frequency sound such as an explosion sound, and is often used effectively in an explosion scene or the like.

【００２９】上記レフトスピーカ及びライトスピーカ
は、上記スクリーンの左右に配置され、レフトチャンネ
ルのオーディオデータによる再生音とライトチャンネル
のオーディオデータによる再生音を出力するもので、ス
テレオ音響効果を発揮する。The left speaker and the right speaker are arranged on the left and right sides of the screen and output a reproduced sound by the left channel audio data and a reproduced sound by the right channel audio data, and exhibit a stereo sound effect.

【００３０】上記レフトセンタスピーカは、上記レフト
スピーカとセンタスピーカとの間に配置され、また上記
ライトセンタスピーカは、上記センタスピーカとライト
スピーカとの間に配置されるものである。上記レフトセ
ンタスピーカは、レフトセンタチャンネルのオーディオ
データによる再生音を出力し、上記ライトセンタスピー
カは、ライトセンタチャンネルのオーディオデータによ
る再生音を出力するものである。それぞれ上記レフトス
ピーカ若しくはライトスピーカの補助的な役割を果た
す。The left center speaker is arranged between the left speaker and the center speaker, and the right center speaker is arranged between the center speaker and the right speaker. The left center speaker outputs a reproduced sound based on the audio data of the left center channel, and the right center speaker outputs a reproduced sound based on the audio data of the right center channel. Each plays an auxiliary role of the left speaker or the right speaker.

【００３１】特にスクリーンが大きく収容人数の多い映
画館等では、座席の位置によって音像の定位が不安定に
なるという欠点がある。しかし、上記レフトセンタスピ
ーカとライトセンタスピーカを付加することにより、音
像のよりリアルな定位を作り出すのに効果を発揮する。Particularly in a movie theater or the like having a large screen and a large number of people, there is a drawback that the localization of the sound image becomes unstable depending on the position of the seat. However, the addition of the left center speaker and the right center speaker is effective in creating a more realistic localization of the sound image.

【００３２】さらに、上記サラウンドレフトスピーカと
サラウンドライトスピーカは、観客席を取り囲むように
配置される。サラウンドレフトチャンネルのオーディオ
データによる再生音と、サラウンドライトチャンネルの
オーディオデータによる再生音を出力するもので、残響
音や拍手、歓声に包まれた印象を与える効果がある。こ
れにより、より立体的な音像を作り出すことができる。Further, the surround left speaker and the surround right speaker are arranged so as to surround the audience seats. It outputs the reproduced sound based on the audio data of the surround left channel and the reproduced sound based on the audio data of the surround right channel, and has the effect of giving the impression of reverberation, applause, and cheers. As a result, a more stereoscopic sound image can be created.

【００３３】また、映画フィルムという媒体は、表面に
傷などが発生しやすいため、ディジタルデータをオリジ
ナルのまま記録していたのでは、データ欠けが激しく実
用にならない。このため、エラー訂正符号の能力が非常
に重要である。Further, since a medium called a motion picture film is apt to have scratches on its surface, if digital data is recorded as it is, the data will be seriously lost and it will not be put to practical use. For this reason, the capability of the error correction code is very important.

【００３４】従って、上記データ圧縮は、その訂正符号
のためのビットも考慮して、上記フィルム上の記録領域
に記録可能な程度まで圧縮処理を行う必要がある。Therefore, in the data compression, it is necessary to perform the compression processing to the extent that the data can be recorded in the recording area on the film in consideration of the bit for the correction code.

【００３５】以上より、上記８チャンネルのディジタル
オーディオデータを圧縮する圧縮方法としては、上述し
たような人間の聴覚の特性を考慮して最適なビット割り
当てを行うことによって、ＣＤ並の音質を達成する前記
高能率符号化方式（例えば上記ＡＴＲＡＣ方式）を適用
するようにしている。From the above, as a compression method for compressing the above-mentioned 8-channel digital audio data, optimum bit allocation is performed in consideration of the characteristics of human hearing as described above, thereby achieving a sound quality comparable to that of a CD. The high efficiency coding method (for example, the ATRAC method described above) is applied.

【００３６】しかし、当該高能率符号化方式では、前述
同様に一般の楽器や人間の声などが原音からわずかなが
ら変化するため、特に原音に忠実な再現を必要とする記
録フォーマットに採用する場合には、何らかの高音質化
の手段が必要となってくる。However, in the high-efficiency coding method, the general musical instrument or human voice slightly changes from the original sound as described above. Therefore, when it is adopted in a recording format that requires faithful reproduction to the original sound. Requires some means of improving the sound quality.

【００３７】そしてこの問題は、上記映画フィルムにお
いて、マルチチャンネル記録フォーマットとして、上記
高能率符号化方式以外を用いた場合、記録領域確保の点
から非可逆圧縮を採用する以上、常に存在する問題であ
る。This problem is always present when lossy compression is adopted from the viewpoint of securing the recording area when a multi-channel recording format other than the above high-efficiency encoding method is used in the above-mentioned motion picture film. is there.

【００３８】また、上述のようなマルチチャンネルのオ
ーディオ信号を高能率符号化する方式では、各チャンネ
ルが独立して圧縮処理が行われる。Further, in the above-described method of highly efficient encoding of multi-channel audio signals, each channel is independently compressed.

【００３９】そのため、例えば、ある１つのチャンネル
が無音状態であっても、そのチャンネルに固定ビット
（バイト）配分量が割り当てられることになる。Therefore, for example, even if a certain channel is in a silent state, a fixed bit (byte) allocation amount is assigned to that channel.

【００４０】このように、無音状態のチャンネルに固定
のビット配分量を与えることは、冗長である。As described above, it is redundant to give a fixed bit allocation amount to a silent channel.

【００４１】また、レベルの低い信号のチャンネルと、
高い信号のチャンネルとについても、ビット配分量が同
じであるため、各チャンネルにわたってビット配分量を
評価すると、冗長なビットが存在する。In addition, a channel of a low level signal,
Since the bit allocation amount is the same for the high signal channel as well, when the bit allocation amount is evaluated over each channel, there are redundant bits.

【００４２】特に、各チャンネル毎にビット配分量が固
定されている場合には、上記のような冗長がさらに顕著
になると考えられる。In particular, when the bit allocation amount is fixed for each channel, it is considered that the above redundancy becomes more remarkable.

【００４３】そこで、本発明は、上述したようなことに
鑑み、マルチチャンネルでの圧縮符号化の際のビット配
分量の冗長を無くすと共に、圧縮符号化復号化の高品位
化を可能とする符号化装置及び方法、それに対応する復
号化装置と、圧縮符号化された信号が記録される記録媒
体を提供することを目的としている。Therefore, in view of the above, the present invention eliminates the redundancy of the bit allocation amount at the time of multi-channel compression encoding, and also enables the high-quality compression encoding / decoding. It is an object of the present invention to provide an encoding device and method, a corresponding decoding device, and a recording medium on which a compression-encoded signal is recorded.

【００４４】[0044]

【課題を解決するための手段】本発明は、上述の目的を
達成するために提案されたものであり、本発明の符号化
方法が適用される符号化装置（高能率符号化装置）は、
複数チャンネルの信号を各々圧縮符号化する圧縮符号化
手段と、上記圧縮符号化前の各チャンネルの信号のエネ
ルギを検出するエネルギ検出手段と、上記エネルギの時
間的な変化に基づいて各チャンネルへのビット配分量を
決定するビット配分量決定手段とを有し、上記エネルギ
とビット配分量との関係が非線形とされ、複数チャンネ
ルの信号の時間領域サンプル若しくは周波数領域サンプ
ルに対してチャンネル間で可変ビット配分を行うように
したものである。DISCLOSURE OF THE INVENTION The present invention has been proposed in order to achieve the above object, and an encoding device (high efficiency encoding device) to which the encoding method of the present invention is applied is
Compression encoding means for compressing and encoding the signals of a plurality of channels, energy detecting means for detecting the energy of the signal of each channel before the compression encoding, and for each channel based on the temporal change of the energy A bit allocation amount determining means for determining a bit allocation amount, wherein the relationship between the energy and the bit allocation amount is non-linear, and variable bits between channels for time domain samples or frequency domain samples of signals of a plurality of channels It is designed to be distributed.

【００４５】本発明の第１の実施例の高能率符号化装置
は、上記エネルギ検出手段が、上記圧縮符号化前の各チ
ャンネルの信号の振幅情報を検出する振幅情報検出手段
であり、これと上記振幅情報の時間的な変化に基づいて
各チャンネルへのビット配分量を決定するビット配分量
決定手段で構成されることを特徴とする。In the high efficiency coding apparatus according to the first embodiment of the present invention, the energy detecting means is amplitude information detecting means for detecting the amplitude information of the signal of each channel before the compression coding. It is characterized by comprising bit allocation amount determining means for determining the bit allocation amount to each channel based on the temporal change of the amplitude information.

【００４６】ここで、前記ビット配分量決定手段は、聴
覚特性に基づいて各チャンネルの振幅情報のピーク値に
対するビット配分量を所定の換算式より換算し、当該換
算結果に基づいて各チャンネルに配分すべきビット量を
決定する。Here, the bit allocation amount determining means converts the bit allocation amount with respect to the peak value of the amplitude information of each channel based on the auditory characteristic by a predetermined conversion formula, and allocates to each channel based on the conversion result. Determine the amount of bits to do.

【００４７】また、前記ビット配分量決定手段は、所定
の換算式から各チャンネルに配分すべきビット量の概算
量をそれぞれ求め、各チャンネルのビット配分量をそれ
ぞれの概算量に比例して配分することによって全チャン
ネルの総ビット配分量を一定とする。Further, the bit allocation amount determining means obtains an approximate amount of bits to be allocated to each channel from a predetermined conversion formula, and allocates the bit allocation amount of each channel in proportion to each approximate amount. As a result, the total bit allocation amount of all channels is made constant.

【００４８】さらに、本発明の第１の実施例の高能率復
号化装置は、上記第１の実施例の高能率符号化装置によ
って符号化された各チャンネルの信号を復号化する復号
化手段を有するものである。Further, the high-efficiency decoding apparatus according to the first embodiment of the present invention comprises decoding means for decoding the signals of the respective channels encoded by the high-efficiency encoding apparatus according to the first embodiment. I have.

【００４９】また、本発明の第２の実施例の高能率符号
化装置は、上記エネルギ検出手段が、上記各チャンネル
の信号に対する所定のスケールファクタ（時間と周波数
の２次元領域（ブロックフローティングユニット）の正
規化した値）の時間的変化を検出する手段であり、上記
スケールファクタの変化に応じて、チャンネル間で可変
ビット配分を行うようにしたものである。Further, in the high-efficiency coding apparatus according to the second embodiment of the present invention, the energy detecting means has a predetermined scale factor (two-dimensional area of time and frequency (block floating unit)) for the signal of each channel. Is a means for detecting a temporal change in (normalized value of), and is adapted to perform variable bit allocation among channels according to the change in the scale factor.

【００５０】ここで、第２の実施例の高能率符号化装置
でも、前記ビット配分量決定手段は、人間の聴覚の特性
に基づいて各チャンネルのスケールファクタの総和の時
間的変化に対するビット配分量を所定の換算式より換算
し、当該換算結果に基づいて各チャンネルに配分すべき
ビット量を決定する。Here, also in the high-efficiency coding apparatus of the second embodiment, the bit allocation amount determining means determines the bit allocation amount with respect to the temporal change of the sum of the scale factors of the respective channels based on the characteristics of human hearing. Is converted by a predetermined conversion formula, and the bit amount to be distributed to each channel is determined based on the conversion result.

【００５１】さらに、前記ビット配分量決定手段は、所
定の換算式から各チャンネルに配分すべきビット量の概
算量をそれぞれ求め、各チャンネルのビット配分量をそ
れぞれの概算量に比例して配分することによって全チャ
ンネルの総ビット配分量を一定とする。Further, the bit allocation amount determining means obtains an approximate amount of bit amount to be allocated to each channel from a predetermined conversion formula, and allocates the bit allocation amount of each channel in proportion to each approximate amount. As a result, the total bit allocation amount of all channels is made constant.

【００５２】また、本発明の第２の実施例の高能率復号
化装置は、上記第２の実施例の高能率符号化装置によっ
て符号化された各チャンネルの信号を復号化する復号化
手段を有するものである。Further, the high-efficiency decoding apparatus according to the second embodiment of the present invention comprises decoding means for decoding the signals of the respective channels encoded by the high-efficiency encoding apparatus according to the second embodiment. I have.

【００５３】[0053]

【作用】本発明によれば、複数チャンネルのオーディオ
データの圧縮符号化の際には、各チャンネルのエネルギ
の時間的な変化に基づいて、各チャンネルへのビット配
分量を決定して圧縮符号化を行うようにしているため、
各チャンネルに対してその情報量に見合ったビット配分
が可能となる。According to the present invention, when compressing and coding audio data of a plurality of channels, the bit allocation amount to each channel is determined based on the temporal change of the energy of each channel and the coding is performed. Because I am trying to do
It is possible to allocate bits corresponding to the amount of information to each channel.

【００５４】また、本発明によれば、複数チャンネルの
オーディオデータの圧縮符号化の際には、各チャンネル
でのエネルギとビット配分量とが非線形に関係付けら
れ、そのビット配分量に基づき圧縮符号化を行うように
しているため、各チャンネルに対してその情報量に見合
ったビット配分が可能となる。Further, according to the present invention, when compressing and coding audio data of a plurality of channels, energy and bit allocation amount in each channel are associated with each other in a non-linear manner, and the compression code is based on the bit allocation amount. Since the conversion is performed, it is possible to allocate bits to each channel according to the amount of information.

【００５５】[0055]

【実施例】以下、本発明の実施例について図面を参照し
ながら説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００５６】図１及び図２に本発明の第１の実施例装置
の基本的な構造を示す。図１には第１の実施例の高能率
符号化装置（エンコーダ）の構成を示し、図２には第１
の実施例の高能率復号化装置（デコーダ）の構成を示し
ている。FIG. 1 and FIG. 2 show the basic structure of the first embodiment device of the present invention. FIG. 1 shows the configuration of the high-efficiency coding apparatus (encoder) of the first embodiment, and FIG.
2 shows the configuration of a high-efficiency decoding device (decoder) of the embodiment.

【００５７】先ず、図１に示すエンコーダの構成につい
て説明する。First, the configuration of the encoder shown in FIG. 1 will be described.

【００５８】複数チャンネル（ｃｈ１，ｃｈ２，・・
・，ｃｈｎ）のオーディオ信号は、これら各チャンネル
に対応する各入力端子２０₁〜２０_n及び伝送線路１₁
〜１_nを経て、同じく各チャンネルに対応する標本化及
び量子化器１００₁〜１００_ｎに送られる。これら標本
化及び量子化器１００_１〜１００_nでは各チャンネル
のオーディオ信号が量子化信号に変換される。これら各
標本化及び量子化器１００₁〜１００_nからの量子化さ
れた信号は、各伝送線路２₁〜２_nを経て、振幅情報検
出回路２００と、ディレイライン３００₁〜３００_nに
送られる。Multiple channels (ch1, ch2, ...
- audio signal chn), each input terminal corresponding to the respective channels 20 ₁ to 20 _n and the transmission line 1 ₁
.About.1 _n and then sent to the sampling and quantizers 100 _{1 to} 100 _n , which also correspond to the respective channels. In these sampling and quantizers 100 _{1 to} 100 _n , the audio signal of each channel is converted into a quantized signal. These quantized signals from the respective sampling and quantizer 100 ₁ to 100 _n is, through the transmission lines 2 ₁ to 2 _n, the amplitude information detecting circuit 200, is sent to a delay line 300 ₁ to 300 _n .

【００５９】上記振幅情報検出回路２００は、上記各チ
ャンネルの量子化された信号から振幅情報を検出する。
すなわち、当該振幅情報検出回路２００では、後述する
符号化器４００₁〜４００_nが一度に処理するオーディ
オデータのサンプル数分の周期毎（以後時間ブロックと
呼ぶ）に振幅情報のピーク値を求め、各チャンネルに対
応する伝送線路４₁〜４_nを経て当該ピーク値をビット
配分決定回路５００へ渡す。なお、当該振幅情報検出回
路２００は、伝送線路１₁〜１_nからの信号によって振
幅情報を検出するような構成にすることも可能である。The amplitude information detection circuit 200 detects amplitude information from the quantized signal of each channel.
That is, in the amplitude information detection circuit 200, the peak value of the amplitude information is obtained for each cycle (hereinafter referred to as a time block) of the number of samples of audio data processed by the encoders 400 _{1 to} 400 _n described later at a time, through the transmission lines 4 ₁ to 4 _n corresponding to respective channels pass the peak value to the bit allocation determining circuit 500. The amplitude information detection circuit 200 can also be configured to detect the amplitude information from the signals from the transmission lines 1 ₁ to 1 _n .

【００６０】上記ビット配分決定回路５００では、上記
各チャンネル毎のピーク値から各チャンネル毎のビット
配分量を後述するように換算し、当該ビット配分量を伝
送線路５₁〜５_nを経て各符号化器４００₁〜４００_n
に渡す。In the bit allocation determining circuit 500, the bit allocation amount for each channel is converted from the peak value for each channel as described later, and the bit allocation amount is passed through the transmission lines 5 _{1 to} 5 _n for each code. Chemist 400 _{1 to} 400 _n
Pass to.

【００６１】また、上記ディレイライン３００₁〜３０
０_nでは、伝送線路２₁〜２_nを介して受け取った信号
を上記時間ブロック分だけ遅延させ、当該遅延させた信
号を各伝送線路３₁〜３_nを介して各符号化器４００₁
〜４００_nへ渡す。In addition, the delay lines 300 ₁ to 30
At 0 _n , the signals received via the transmission lines 2 ₁ to 2 _n are delayed by the time block, and the delayed signals are transmitted via the transmission lines 3 _{1 to} 3 _n to the encoders 400 ₁
Pass to ~ 400 _n .

【００６２】各符号化器４００₁〜４００_nでは、上記
時間ブロック毎に圧縮動作を行う。このときの伝送線路
５₁〜５_nを介して受け取るビット配分量は、各ディレ
イライン３００₁〜３００_nでの遅延によって上記伝送
線路３₁〜３_nで受け取る信号のピーク情報を反映し
たものとなっている。各符号化器４００₁〜４００_nで
は、上記伝送線路５₁〜５_nを介して受け取ったビット
配分量まで上記伝送線路３₁〜３_nを介して受け取った
信号を圧縮し、当該圧縮した信号を各伝送線路６₁〜６
_nを経てフォーマッタ６００へ渡す。In each of the encoders 400 _{1 to} 400 _n , the compression operation is performed for each time block. Bit allocation amount received via transmission line 5 ₁ to 5 _n In this case, as reflecting the peak information of the signal received by the transmission line 3 ₁ to 3 _n by the delay in each delay line 300 ₁ to 300 _n Has become. In each of the encoders 400 _{1 to} 400 _n , the signals received via the transmission lines 3 _{1 to} 3 _n are compressed up to the bit allocation amount received via the transmission lines 5 _{1 to} 5 _n , and the compressed signals are compressed. Each transmission line 6 _{1 to} 6
_It is passed to the formatter 600 via _n .

【００６３】上記フォーマッタ６００は、上記各伝送線
路６₁〜６_nを経て受け取った上記各チャンネル毎の被
圧縮信号を、所定のフォーマットに従って、エラー訂正
処理を施して、伝送又は記録媒体への記録のためのビッ
トストリームへ組み立てる。このビットストリームは、
伝送線路７を経て出力端子２１から出力される。The formatter 600 performs error correction processing on the compressed signals for each of the channels received via the transmission lines 6 _{1 to} 6 _n according to a predetermined format, and then transmits or records them on a recording medium. Assemble into a bitstream for. This bitstream is
It is output from the output terminal 21 via the transmission line 7.

【００６４】更にこのビットストリームは、例えばレー
ザー記録装置２６により、映画フィルム２７上の所定の
記録エリア２８に書き込まれる。尚、図中の指示符号２
９はパーフォレーションを示し、フィルム送りのために
図示しない映写機のスプロケットが噛み合うための孔で
あり、上記記録エリア２８は例えば上記パーフォレーシ
ョン２９間に設けられる。Further, this bit stream is written in a predetermined recording area 28 on the motion picture film 27 by the laser recording device 26, for example. The reference numeral 2 in the figure
Reference numeral 9 denotes a perforation, which is a hole for engaging a sprocket of a projector (not shown) for feeding the film, and the recording area 28 is provided between the perforations 29, for example.

【００６５】次に、本実施例のデコーダ（高能率復号化
装置）の構成について説明する。Next, the configuration of the decoder (high efficiency decoding apparatus) of this embodiment will be described.

【００６６】上記図１のエンコーダ（高能率符号化装
置）で組み立てられたビットストリームは、伝送又は記
録媒体に記録される。この記録されたビットストリーム
は、図示しない所定の再生装置を経て、入力端子２２に
供給され、この入力端子２２から伝送線路８を経て、デ
フォーマッタ７００に送られてくる。The bit stream assembled by the encoder of FIG. 1 (high efficiency coding apparatus) is transmitted or recorded on a recording medium. The recorded bit stream is supplied to the input terminal 22 via a predetermined reproducing device (not shown), and is sent from the input terminal 22 to the deformatter 700 via the transmission line 8.

【００６７】当該デフォーマッタ７００では、上記伝送
線路８を介して送られてきたビットストリームを、所定
のフォーマットに従って各チャンネル毎の被圧縮信号に
分解する。当該各チャンネル毎に分解された被圧縮信号
は、各チャンネルに対応する各伝送線路９₁〜９_nを経
て、各チャンネル毎に対応して設けられた復号器８００
₁〜８００_nへ送られる。In the deformatter 700, the bit stream sent via the transmission line 8 is decomposed into compressed signals for each channel according to a predetermined format. The compressed signal decomposed for each channel passes through the transmission lines 9 _{1 to} 9 _n corresponding to each channel, and the decoder 800 provided corresponding to each channel.
_{1 to} 800 _n .

【００６８】各復号器８００₁〜８００_nでは、上記各
伝送線路９₁〜９_nを経て送られてきた被圧縮信号を伸
長し、対応する各伝送線路１０₁〜１０_nを経て、Ｄ／
Ａ（ディジタル／アナログ）変換器９００₁〜９００_n
へ送る。In each of the decoders 800 _{1 to} 800 _n , the compressed signal sent via each of the above transmission lines 9 _{1 to} 9 _n is expanded, and through each corresponding transmission line 10 ₁ to 10 _n , D /
A (digital / analog) converter 900 _{1 to} 900 _n
Send to.

【００６９】各Ｄ／Ａ変換器９００₁〜９００_nでは、
上記各伝送線路１０₁〜１０_nを経て送られてきた上記
伸長された信号（ディジタル信号）を、アナログ信号に
変換する。これらアナログに戻された信号は、それぞれ
対応する各伝送線路１１₁〜１１_n及び出力端子２３₁
〜２３_nを介して、各チャンネルｃｈ１〜ｃｈｎの復号
化された信号として出力される。In each of the D / A converters 900 _{1 to} 900 _n ,
The expanded signal (digital signal) sent through each of the transmission lines 10 ₁ to 10 _n is converted into an analog signal. The signals returned to analog are respectively associated with the respective transmission lines 11 _{1 to} 11 _n and the output terminal 23 _1.
23 to 23 _n and output as decoded signals of the respective channels ch1 to chn.

【００７０】上述したような本実施例の高能率符号化装
置において利用する圧縮符号化手法は、ビットレートを
可変することが出来るものであればすべてに応用が可能
である。ここでは、前述した人間の聴覚特性を利用し、
ステレオ２チャンネルのオーディオ信号を固定ビットレ
ートで約１／５に圧縮する圧縮符号化手法（例えばいわ
ゆるＭＤ（ミニディスク：Mini Disc)に用いられるＡＴ
ＲＡＣ方式）を例に挙げ、当該固定ビットレートの圧縮
を可変ビットレートにする本実施例の圧縮符号化の方法
について述べる。The compression coding method used in the high-efficiency coding apparatus of the present embodiment as described above can be applied to all as long as the bit rate can be changed. Here, using the human auditory characteristics described above,
A compression encoding method for compressing a stereo two-channel audio signal to about 1/5 at a fixed bit rate (for example, AT used in so-called MD (Mini Disc)).
(RAC system) as an example, the compression encoding method of this embodiment in which the compression of the fixed bit rate is changed to the variable bit rate will be described.

【００７１】図３にはいわゆるＡＴＲＡＣ方式が適用さ
れる符号化の構成を示す。なお、この図３の帯域分割フ
ィルタ４０１から再量子化器４０６，フォーマッタ４０
７までの構成は、図１の各チャンネルの各符号化器４０
０₁〜４００_nと対応するものである。FIG. 3 shows a coding structure to which the so-called ATRAC method is applied. It should be noted that the band quantizing filter 401 of FIG.
The configuration up to 7 is applied to each encoder 40 of each channel of FIG.
This corresponds to 0 _{1 to} 400 _n .

【００７２】この図３において、入力端子２４を介して
供給された標本化及び量子化されたオーディオデータ
は、先ず、帯域分割フィルタ４０１によって０〜５．５
ｋＨｚの低域と、５．５ｋＨｚ〜１１ｋＨｚの中域と、
１１ｋＨｚ以上（１１ｋＨｚ〜２２ｋＨｚ）の３つの周
波数帯域に分割される。In FIG. 3, the sampled and quantized audio data supplied through the input terminal 24 is first 0-5.5 by the band division filter 401.
low range of kHz and mid range of 5.5 kHz to 11 kHz,
It is divided into three frequency bands of 11 kHz or higher (11 kHz to 22 kHz).

【００７３】これら３つの周波数帯域の信号のうち、上
記帯域分割フィルタ４０１からの上記低域の信号はＭＤ
ＣＴ（Modified Discrete Cosine Transform：改良型離
散余弦変換）演算を行うＭＤＣＴ回路４０２Ｌに、中域
の信号は同じくＭＤＣＴ演算を行うＭＤＣＴ回路４０２
Ｍに、また、高域の信号はＭＤＣＴ回路４０２Ｈに送ら
れ、これらＭＤＣＴ回路４０２Ｌ〜４０２Ｈでそれぞれ
周波数成分に分解される。Of the signals in these three frequency bands, the low-frequency signal from the band-dividing filter 401 is MD.
An MDCT circuit 402L that performs CT (Modified Discrete Cosine Transform) calculation, and an MDCT circuit 402L that similarly performs MDCT calculation for mid-range signals
The signal of M and the high frequency signal are sent to the MDCT circuit 402H and are decomposed into frequency components by these MDCT circuits 402L to 402H.

【００７４】このとき、上記ＭＤＣＴを施すときの時間
ブロック長は、各周波数帯域毎に可変であり、信号が急
激に変化する部分では、時間ブロック長を短くして、時
間分解能を高め、信号が定常的な部分では時間ブロック
長を長くして、信号成分の有効伝送と量子化雑音を制御
する。At this time, the time block length when applying the MDCT is variable for each frequency band, and in the portion where the signal changes abruptly, the time block length is shortened to improve the time resolution and the signal is In the stationary part, the time block length is increased to control the effective transmission of signal components and the quantization noise.

【００７５】この時間ブロック長は、ブロックサイズ評
価器４０３にて決定されている。すなわち、上記帯域分
割フィルタ４０１からの３つの周波数帯域の信号は、ブ
ロックサイズ評価器４０３にも送られ、当該ブロックサ
イズ評価器４０３が上記ＭＤＣＴの時間ブロック長を決
定し、この決定した時間ブロック長を示す情報を上記Ｍ
ＤＣＴ回路４０２Ｌ〜４０２Ｈに送るようにしている。The time block length is determined by the block size evaluator 403. That is, the signals of the three frequency bands from the band division filter 401 are also sent to the block size evaluator 403, the block size evaluator 403 determines the time block length of the MDCT, and the determined time block length is determined. The information indicating
The data is sent to the DCT circuits 402L to 402H.

【００７６】なお、上記ＭＤＣＴでの２種類の時間ブロ
ック長のうち、長い時間ブロック長を使用するモードは
ロングモードと呼ばれ、１１．６ｍｓの時間のブロック
長を有する。また、短い時間ブロック長を使用するモー
ドはショートモードと呼ばれ、高域（１１ｋＨｚ以上）
で１．４５ｍｓの時間のブロック長を有し、、低域
（５．５ｋＨｚ以下）及び中域（５．５ｋＨｚから１１
ｋＨｚ）では２．９ｍｓの時間のブロック長を有するこ
とで、時間分解能を上げるようにしている。A mode using a long time block length of the two types of time block lengths in the MDCT is called a long mode and has a block length of 11.6 ms. In addition, the mode that uses a short time block length is called the short mode, and it is in the high range (11 kHz or more).
With a block length of 1.45 ms at low frequencies (less than 5.5 kHz) and mid-range (5.5 kHz to 11 kHz).
At (kHz), the time resolution is improved by having a block length of 2.9 ms.

【００７７】このようにして、時間と周波数の２次元領
域（これをブロックフローティングユニット：Block Fl
oating Unit と呼ぶ）上の信号成分に分解されたオーデ
ィオ信号は、正規化回路４０４Ｌ〜４０４Ｈによって低
域，中域，高域で合計５２個のブロックフローティング
ユニットに分けられると共に、ユニット毎に正規化され
る（スケールファクタの決定がなされる）。In this way, a two-dimensional area of time and frequency (this is a block floating unit: Block Fl
The audio signal decomposed into the above signal components is divided into a total of 52 block floating units in the low band, the middle band, and the high band by the normalization circuits 404L to 404H, and is normalized for each unit. (The scale factor is determined).

【００７８】また、上記ビット配分器４０５では、人間
の聴覚の特性を利用して、そのオーディオ信号がどのよ
うな成分から構成されているかを分析する。この分析結
果が上記正規化回路４０４Ｌ〜４０４Ｈからの各ユニッ
ト毎の信号が供給される再量子化器４０６に送られる。Further, the bit allocator 405 analyzes what kind of component the audio signal is composed of by utilizing the characteristics of human hearing. The result of this analysis is sent to the requantizer 406 to which the signals for each unit from the normalization circuits 404L to 404H are supplied.

【００７９】当該再量子化器４０６は、上記分析結果に
基づいて、各ユニットをどの程度の精度で符号化するか
を求めて、即ちワードレングスの決定を行い、パラメー
タを得ると共に、再量子化を行う。The requantizer 406 determines, based on the analysis result, with what accuracy each unit is to be encoded, that is, determines the word length, obtains the parameter, and requantizes I do.

【００８０】最後に、フォーマッタ４０７では、各ユニ
ット毎の各パラメータ情報と再量子化されたスペクトラ
ム信号とを、所定のフォーマットに従って多重化し、ビ
ットストリームとする。このフォーマッタ４０７の出力
が出力端子２５から出力される。Finally, the formatter 407 multiplexes each parameter information for each unit and the requantized spectrum signal according to a predetermined format to form a bit stream. The output of the formatter 407 is output from the output terminal 25.

【００８１】ここで、上述したような符号化の動作はサ
ウンドフレームという単位毎に行われる。Here, the above-described encoding operation is performed for each unit called a sound frame.

【００８２】図４には、当該サウンドフレーム４０内の
データの記録の様子を示す。FIG. 4 shows how the data in the sound frame 40 is recorded.

【００８３】この図４において、１サウンドフレームは
２１２ビットからなり、ここに４４．１ｋＨｚのサンプ
リングレートで５１２サンプル、１チャンネル相当のオ
ーディオ再生用データが圧縮符号化されている。In FIG. 4, one sound frame consists of 212 bits, in which 512 samples and audio reproduction data corresponding to one channel are compression-encoded at a sampling rate of 44.1 kHz.

【００８４】上記２１２ビットのサウンドフレームデー
タは、ブロックサイズモード４１、サブブインフオメー
ション量４２、ワードレングスデータ４３、スケールフ
ァクタデータ４４、スペクトラムデータ４５、冗長スケ
ールファクタデータ４６、冗長ワードレングスデータ４
７、下部のサブインフオメーション量４８、及び、下部
のブロックサイズモード４９から構成される。The 212-bit sound frame data includes the block size mode 41, the sub-information amount 42, the word length data 43, the scale factor data 44, the spectrum data 45, the redundant scale factor data 46, and the redundant word length data 4.
7, a lower sub-information amount 48, and a lower block size mode 49.

【００８５】ここで、２１２ビットのデータの中には、
エラー訂正用の２度書き部分が含まれている。即ち、冗
長スケールファクタデータ４６、冗長ワードレングスデ
ータ４７、下部サブインフォメーション量４８、下部ブ
ロックサイズモード４９である。Here, in the 212-bit data,
It contains a double-writing part for error correction. That is, the redundant scale factor data 46, the redundant word length data 47, the lower sub information amount 48, and the lower block size mode 49.

【００８６】この例では、２１２ビットのうち、１８６
ビットが２度書きを除いた部分に相当し、実質的なビッ
トレートに換算すると１２８ｋｂｐｓになる。In this example, 186 out of 212 bits
The bit corresponds to the part excluding double writing, which is 128 kbps when converted to a substantial bit rate.

【００８７】上記ブロックサイズモードは、図３のブロ
ックサイズ評価器４０３の評価結果を記録するためのデ
ータで、その内容は表１に示すようなものとなってい
る。The block size mode is data for recording the evaluation result of the block size evaluator 403 of FIG. 3, and the contents thereof are as shown in Table 1.

【００８８】[0088]

【表１】 [Table 1]

【００８９】この表１を見ればわかるように、ロングモ
ードのとき、低域及び中域ではＭＤＣＴ演算によりそれ
ぞれ１２８個の周波数成分に、高域では２５６個の周波
数成分に分解される。As can be seen from Table 1, in the long mode, the frequency component is decomposed into 128 frequency components by the MDCT operation in the low frequency region and the medium frequency region, and is decomposed into 256 frequency components in the high frequency region.

【００９０】また、ショートモードのとき、低域、中域
及び高域はそれぞれ３２個の周波数成分に分解される。In the short mode, each of the low frequency band, the middle frequency band and the high frequency band is decomposed into 32 frequency components.

【００９１】また、サブインフォメーション量４２に
は、アマウント１、アマウント２、アマウント３の３つ
の情報が記録される。アマウント１は、記録されている
ワードレングス及びスケールファクタの個数を表し、ア
マウント２は２度書きされているワードレングスの個数
を表し、アマウント３は２度書きされているスケールフ
ァクタの個数を表している。この内容については、表２
に示す。In the sub information amount 42, three pieces of information of amount 1, amount 2, and amount 3 are recorded. Amount 1 represents the number of recorded word lengths and scale factors, amount 2 represents the number of word lengths written twice, and amount 3 represents the number of scale factors written twice. There is. See Table 2 for details.
Shown in.

【００９２】[0092]

【表２】 [Table 2]

【００９３】ワードレングスは、各ユニットの再量子化
されたときの語長を表す。この内容については表３に示
す。The word length represents the word length of each unit when requantized. The contents are shown in Table 3.

【００９４】[0094]

【表３】 [Table 3]

【００９５】スケールファクタは各ユニットの正規化し
た値を表す。その内容については表４に示す。The scale factor represents the normalized value of each unit. The contents are shown in Table 4.

【００９６】[0096]

【表４】 [Table 4]

【００９７】ところで、上記図３におけるビット配分器
４０５は、再量子化の際に、１サウンドフレームのビッ
ト量を２１２ビットになるように人間の聴感特性を考慮
してワードレングスの値を決定していく。この２１２ビ
ットという値を可変にすることで、可変長の符号化装置
を構成できる。By the way, the bit allocator 405 in FIG. 3 determines the word length value in consideration of human auditory perception characteristics so that the bit amount of one sound frame becomes 212 bits at the time of requantization. To go. By making the value of 212 bits variable, a variable-length coding device can be configured.

【００９８】すなわち、本発明実施例の高能率符号化装
置の構成である図１のビット配分決定回路５００の出力
を、図３のビット配分器４０５に接続するような構成に
すれば、可変長の符号化装置が構成できるようになる。That is, if the output of the bit allocation determining circuit 500 of FIG. 1 which is the structure of the high efficiency coding apparatus of the embodiment of the present invention is connected to the bit distributor 405 of FIG. The encoding device can be configured.

【００９９】以下、このように図３のビット配分器４０
５に接続される図１のビット配分決定回路５００の動作
について、図５のグラフ及び図６のフローチャートによ
り説明する。Hereinafter, the bit allocator 40 of FIG.
The operation of the bit allocation determining circuit 500 of FIG. 1 connected to the circuit 5 will be described with reference to the graph of FIG. 5 and the flowchart of FIG.

【０１００】まず、ビット配分決定回路５００は、図６
のステップＳ６１で処理を開始すると、ステップＳ６２
で各チャンネル毎のピーク値を検出する。一般にピーク
値は、各チャンネルにおけるオーディオ信号のエネルギ
に相当する。First, the bit allocation determining circuit 500 is shown in FIG.
When the process is started in step S61 of step S62,
Detects the peak value for each channel. Generally, the peak value corresponds to the energy of the audio signal in each channel.

【０１０１】次に、ステップＳ６３では、求めたピーク
値に対応するビット配分量を算出する。この算出には、
ピーク値／ビット配分量の対応グラフをテーブルにした
ものを用いる。Next, in step S63, the bit distribution amount corresponding to the obtained peak value is calculated. For this calculation,
A table in which a correspondence graph of peak value / bit allocation amount is used is used.

【０１０２】図５のグラフが上記ピーク値に対するビッ
ト配分量を換算するためのグラフである。なお、この図
５に示すビット配分量の換算のグラフは、符号化方式と
していわゆるＡＴＲＡＣ方式を採用した場合のものであ
る。The graph of FIG. 5 is a graph for converting the bit allocation amount for the peak value. The graph of the bit distribution conversion shown in FIG. 5 is for the case where the so-called ATRAC method is adopted as the encoding method.

【０１０３】この図５において、横軸は、入力信号のピ
ーク値であり、その取りうる最大値を１に正規化してい
る。In FIG. 5, the horizontal axis is the peak value of the input signal, and the maximum value that can be taken is normalized to 1.

【０１０４】また、縦軸は、ビット配分量であり、最大
配分量を１８６バイトとしている。この値はいわゆるＭ
Ｄ（ミニディスク）装置におけるＡＴＲＡＣ方式の１サ
ウンドフレームの情報量に等しい。The vertical axis represents the bit distribution amount, and the maximum distribution amount is 186 bytes. This value is so-called M
It is equal to the information amount of one sound frame of the ATRAC system in the D (mini disk) device.

【０１０５】ここで、図５に示す換算のグラフは、様々
なオーディオ信号を用いて実験をして決定したものであ
る。Here, the conversion graph shown in FIG. 5 is determined by an experiment using various audio signals.

【０１０６】この図５において、ビット配分量の全体的
な傾向としては、ピーク値の増加と共にビット配分量も
増加するようになってくるが、ピーク値が２のマイナス
３乗を越えた当たりで、減少に転じる。In FIG. 5, the overall tendency of the bit distribution amount is such that the bit distribution amount increases as the peak value increases, but when the peak value exceeds 2 minus the third power. , Turn to decrease.

【０１０７】これは、信号レベルが充分に大きい（かな
り大きい）ところでは、再量子化による量子化ノイズが
信号レベルによってマスクされるため、再量子化ノイズ
の注入量を増やしても聞こえにくいという実験結果に基
づいている。This is an experiment in which the quantization noise due to requantization is masked by the signal level when the signal level is sufficiently large (quite large), so that it is difficult to hear even if the injection amount of requantization noise is increased. Based on the results.

【０１０８】一方、図５において、信号レベルが充分に
小さい（かなり小さい）場合、例えばピーク値が２のマ
イナス１２乗以下になるとビット配分量を一定（平坦な
ビット配分）としている。これは、ＡＴＲＡＣ方式の各
パラメータ情報（図４に示すワードレングスデータやス
ケールファクタデータなど）に必要なビット量がほぼ一
定であるため、ある程度のビット量を確保しておく必要
があるためである。On the other hand, in FIG. 5, when the signal level is sufficiently small (very small), for example, when the peak value becomes 2 −12 or less, the bit allocation amount is fixed (flat bit allocation). This is because the bit amount required for each parameter information of the ATRAC system (word length data, scale factor data, etc. shown in FIG. 4) is almost constant, and it is necessary to secure a certain bit amount. .

【０１０９】また、レベルが低くなってくるとランダム
なノイズ（白色雑音）が聞こえるようになり、このよう
な信号は全周波数帯域に周波数成分が一様に分布する傾
向にあるので、レベルが小さい割に多くのビット量を必
要とするためである。Further, as the level becomes lower, random noise (white noise) comes to be heard, and the frequency component of such a signal tends to be uniformly distributed in the entire frequency band, so that the level is small. This is because it requires a relatively large amount of bits.

【０１１０】以上のように、ビット配分量とピーク値と
の関係は、非線形（略Ｓ字カーブ）に特徴付けられる。
即ち、聴覚特性を考慮しなければ、ビット配分量とピー
ク値との関係は、比例関係となる。しかし本発明では、
各チャンネル毎に最低限度のビット配分量を確保しつ
つ、エネルギが所定レベル以上の場合には、逆にビット
配分量を減少させる。As described above, the relationship between the bit allocation amount and the peak value is characterized by a non-linearity (substantially S-shaped curve).
That is, if the auditory characteristics are not taken into consideration, the relationship between the bit allocation amount and the peak value is proportional. However, in the present invention,
While keeping the minimum bit allocation amount for each channel, when the energy is equal to or higher than a predetermined level, the bit allocation amount is decreased.

【０１１１】次に、ステップＳ６４では全体のビット量
は固定か否かの判断を行い、このステップＳ６４で全チ
ャンネルの総ビット配分量を固定にする必要があると判
断したときには、ステップＳ６５に進み、上記の換算が
終わったあと、後述する式（１）の計算を行い、最終的
な各チャンネルごとのビット配分量を計算する。Next, in step S64, it is determined whether or not the total bit amount is fixed. When it is determined in step S64 that the total bit distribution amount of all channels needs to be fixed, the process proceeds to step S65. After the above conversion, the equation (1) described later is calculated, and the final bit allocation amount for each channel is calculated.

【０１１２】すなわち、ｎチャンネルあるシステムの１
サウンドフレーム当たりの総ビット配分量をＧとし、上
記換算によるビット配分量をＣｉ（ｉ＝１，２，・・
・、ｎ）とすると、最終的な各チャンネルに配分される
ビット配分量Ｓｉは、Ｓｉ＝Ｇ＊Ｃｉ／（Ｃ１＋Ｃ２＋・・・＋Ｃｎ）・・（１）となる。That is, 1 in a system with n channels
Let G be the total bit allocation amount per sound frame, and Ci (i = 1, 2, ...
, N), the final bit allocation amount Si allocated to each channel is Si = G * Ci / (C1 + C2 + ... + Cn) ... (1).

【０１１３】上記ステップＳ６５の後、又は上記ステッ
プＳ６４でノーと判断された後は、ステップＳ６６に進
んで処理を終了する。After step S65, or after determining NO in step S64, the process proceeds to step S66 to end the process.

【０１１４】また、上述したようなＡＴＲＡＣ方式に対
応することで、図１のフォーマッタ６００と図２のデフ
ォーマッタ７００は以下のように動作する。By supporting the ATRAC system as described above, the formatter 600 of FIG. 1 and the deformatter 700 of FIG. 2 operate as follows.

【０１１５】図１のフォーマッタ６００は、１サウンド
フレーム毎に各チャンネルの符号化器４００₁〜４００
_nから図４に示すような形で送られてきたデータをチャ
ンネル順に並べて、ビットストリームとして伝送する。
すなわち、マルチプレクサの働きをする。The formatter 600 shown in FIG. 1 includes the encoders 400 _{1 to} 400 of the respective channels for each sound frame.
_The data sent from _{n in the} form shown in FIG. 4 is arranged in the order of channels and transmitted as a bit stream.
That is, it functions as a multiplexer.

【０１１６】また、図２のデフォーマッタ７００は、上
記フォーマッタ６００でマルチプレクスされたデータを
各チャンネル毎に分解して各復号化器４００₁〜４００
_nに渡すデマルチプレクサの働きをする。Further, the deformatter 700 of FIG. 2 decomposes the data multiplexed by the formatter 600 for each channel and decodes each of the decoders 400 _{1 to} 400.
Acts as a demultiplexer to pass to _n .

【０１１７】上述したように、本発明の第１の実施例装
置によれば、複数のチャンネルを持つオーディオデータ
の圧縮において、各チャンネルの振幅情報の時間的な変
化により、各チャンネルへのビット配分量を決定して符
号化するようにしているため、各チャンネルにその情報
量に見合ったビット配分が可能となり、更なる高能率符
号化が可能となる。As described above, according to the apparatus of the first embodiment of the present invention, in the compression of audio data having a plurality of channels, the bit allocation to each channel is made by the temporal change of the amplitude information of each channel. Since the amount is determined and encoded, it is possible to allocate bits to each channel according to the amount of information, and it is possible to perform higher efficiency encoding.

【０１１８】すなわち、更なる高音質化もしくは、全チ
ャンネルトータルにおける低ビットレート化が可能とな
る。なお、記録するメデイアによっては固定長が望まし
い場合があるので、第１の実施例装置では、全チャンネ
ルの総ビット配分量を概ね一定とする様に符号化するこ
とも可能である。That is, it is possible to further improve the sound quality or reduce the bit rate in all channels. Since a fixed length may be desirable depending on the medium to be recorded, the first embodiment can perform encoding so that the total bit allocation amount of all channels is substantially constant.

【０１１９】次に、本発明の第２の実施例について説明
する。Next, a second embodiment of the present invention will be described.

【０１２０】図７には第２の実施例の高能率符号化装置
（エンコーダ）の構成を示している。FIG. 7 shows the configuration of the high efficiency coding apparatus (encoder) of the second embodiment.

【０１２１】図７において、複数チャンネル（ｃｈ１，
ｃｈ２，・・・，ｃｈｎ）のオーディオ信号は、これら
各チャンネルに対応する各入力端子３０₁〜３０_n及び
伝送線路１０１₁〜１０１_nを経て、同じく各チャンネ
ルに対応する標本化及び量子化器１２０₁〜１２０_nに
送られる。これら標本化及び量子化器１２０₁〜１２０
_nでは各チャンネルのオーディオ信号が量子化信号に変
換され、これら各標本化及び量子化器１２０₁〜１２０
_nからの量子化された信号は、各伝送線路１０２₁〜１
０２_nを経て、各符号化器２１０₁〜２１０_nに送られ
る。In FIG. 7, a plurality of channels (ch1,
ch2, · · ·, audio signals chn) passes through each of the input terminals 30 ₁ to 30 _n and the transmission line 101 ₁ to 101 _n corresponding to respective channels, sampling and quantization unit also corresponds to the respective channels 120 _{1 to} 120 _n . These sampling and quantizers 120 _{1 to} 120
_{In n} , the audio signal of each channel is converted into a quantized signal, and each of the sampling and quantizers 120 _{1 to} 120 ₁
The quantized signal from _n is transmitted by each transmission line 102 _{1 -1.}
02 _n, and is sent to each of the encoders 210 _{1 to} 210 _n .

【０１２２】各符号化器２１０₁〜２１０_nでは、各チ
ャンネルのオーディオ信号を時間と周波数の二次元領域
（ブロックフローティングユニット）に分割し、そのブ
ロックフローティングユニットに属する信号成分をブロ
ックフローティングユニット毎にスケールファクタを用
いて正規化する。ここで求められた各ブロックフローテ
ィングユニットのスケールファクタは、伝送線路１０３
₁〜１０３_nを通してビット配分決定回路３１０へ送ら
れる。In each of the encoders 210 _{1 to} 210 _n , the audio signal of each channel is divided into a two-dimensional area of time and frequency (block floating unit), and the signal component belonging to the block floating unit is divided into block floating units. Normalize using the scale factor. The scale factor of each block floating unit obtained here is the transmission line 103
_It is sent to the bit allocation decision circuit 310 through _{1 to} 103 _n .

【０１２３】当該ビット配分決定回路３１０では、伝送
線路１０３₁〜１０３_nを介して受け取ったスケールフ
ァクタの各チャンネル毎の総和を求め、当該総和から各
チャンネルのビット配分量を後述する換算式（換算のグ
ラフ）により換算し、そのビット配分量を伝送線路１０
４₁〜１０４_nから各符号化器２１０₁〜２１０_nへ渡
す。In the bit allocation determining circuit 310, the sum of scale factors received via the transmission lines 103 _{1 to} 103 _n is calculated for each channel, and the bit allocation amount of each channel is calculated from the sum by the conversion formula (conversion described below). Graph), and the bit allocation amount is converted to the transmission line 10
4 passed from ₁ -104 _n to each encoding unit 210 ₁ to 210 _n.

【０１２４】したがって、各符号化器２１０₁〜２１０
_nでは、上記ビット配分量に応じて、上記伝送線路１０
２₁〜１０２_nからの信号を再量子化し、当該再量子化
すなわち圧縮した信号を伝送線路１０５₁〜１０５_nを
介してフォーマッタ４１０へ渡す。Therefore, each of the encoders 210 _{1 to} 210
_{At n} , depending on the bit allocation amount, the transmission line 10
The signals from 2 _{1 to} 102 _n are requantized, and the requantized or compressed signals are passed to the formatter 410 via the transmission lines 105 _{1 to} 105 _n .

【０１２５】フォーマッタ４１０は、複数のチャンネル
の上記伝送線路１０５₁〜１０５_nを経て受け取った被
圧縮信号を、所定のフォーマットに従って伝送又は記録
媒体への記録のためにビットストリームへ組み立てる。
このビットストリームは、伝送線路１０６を介して出力
端子３１から出力される。The formatter 410 assembles the compressed signals received via the transmission lines 105 _{1 to} 105 _n of a plurality of channels into a bit stream for transmission or recording on a recording medium according to a predetermined format.
This bit stream is output from the output terminal 31 via the transmission line 106.

【０１２６】更にこのビットストリームは、例えばレー
ザー記録装置２６により、映画フィルム２７上の所定の
記録エリア２８に書き込まれる。Further, this bit stream is written in a predetermined recording area 28 on the motion picture film 27 by the laser recording device 26, for example.

【０１２７】なお、この第２の実施例におけるデコーダ
側の高能率復号化装置の基本構成については、前記図２
と同様であるため、詳細な説明は省略する。The basic structure of the high-efficiency decoding apparatus on the decoder side in the second embodiment is shown in FIG.
Since it is the same as, the detailed description will be omitted.

【０１２８】簡単に前記図２を用いて説明すると、当該
第２の実施例の高能率復号化装置のデフォーマッタ７０
０でも、上記第２の実施例の高能率符号化装置からのビ
ットストリームを、所定のフォーマットに従って各チャ
ンネル毎の被圧縮信号に分解する。Briefly described with reference to FIG. 2, the deformatter 70 of the high-efficiency decoding apparatus of the second embodiment.
Even with 0, the bit stream from the high-efficiency encoder of the second embodiment is decomposed into compressed signals for each channel according to a predetermined format.

【０１２９】当該各チャンネル毎に分解された被圧縮信
号は、各チャンネル毎に対応して設けられた復号器８０
０₁〜８００_nにて伸長され、さらにＤ／Ａ（ディジタ
ル／アナログ）変換器９００₁〜９００_nでアナログ信
号に変換される。この各アナログ信号が、各チャンネル
ｃｈ１〜ｃｈｎの復号化された信号として出力される。The compressed signal decomposed for each channel is the decoder 80 provided corresponding to each channel.
The signal is expanded at 0 _{1 to} 800 _n , and further converted to an analog signal at D / A (digital / analog) converters 900 _{1 to} 900 _n . Each analog signal is output as a decoded signal of each channel ch1 to chn.

【０１３０】また、この第２の実施例の高能率符号化装
置において利用する圧縮符号化手法は、スケールファク
タを用いて圧縮符号化する方式であれば、全てに応用が
可能である。The compression coding method used in the high efficiency coding apparatus of the second embodiment can be applied to all compression coding methods using a scale factor.

【０１３１】すなわち、この第２の実施例においても、
前記図３を用いて説明すれば、図７のビット配分決定回
路３１０の出力を、図３のビット配分器４０５に接続す
るような構成にすれば、可変長の符号化装置が構成でき
るようになる。That is, also in this second embodiment,
Referring to FIG. 3, a variable length coding apparatus can be configured by connecting the output of the bit allocation determination circuit 310 of FIG. 7 to the bit allocation unit 405 of FIG. Become.

【０１３２】以下、このように図３のビット配分器４０
５に接続される図７のビット配分決定回路３１０の詳細
な動作について、図８のグラフ及び図９のフローチャー
トにより説明する。Hereinafter, the bit allocator 40 of FIG.
The detailed operation of the bit allocation determination circuit 310 of FIG. 7 connected to No. 5 will be described with reference to the graph of FIG. 8 and the flowchart of FIG.

【０１３３】先ず、ビット配分決定回路３１０は、図９
のステップＳ９１で処理を開始すると、ステップＳ９２
において符号化器２１０₁〜２１０_nからのスケールフ
ァクタから、各チャンネル毎のスケールファクタの総和
を算出する。First, the bit allocation determination circuit 310 is shown in FIG.
When the processing is started in step S91 of step S92,
In, the sum of the scale factors for each channel is calculated from the scale factors from the encoders 210 _{1 to} 210 _n .

【０１３４】次のステップＳ９３では、求めた各チャン
ネルのスケールファクタの総和より、各チャンネル毎の
ビット配分量を算出する。In the next step S93, the bit distribution amount for each channel is calculated from the obtained sum of the scale factors of each channel.

【０１３５】ここで、スケールファクタは、前述したよ
うに５２個あるブロックフローティングユニットに含ま
れる周波数成分を正規化した値である。通常は、そのブ
ロックフローティングユニット内の周波数成分の絶対値
を求め、その絶対値の最大値以上の値であって、かつそ
の中で最小の値のものを、前記表４に示す値の中から選
ぶことになる。Here, the scale factor is a value obtained by normalizing the frequency components included in the 52 block floating units as described above. Usually, the absolute value of the frequency component in the block floating unit is calculated, and the value that is greater than or equal to the maximum value of the absolute values and that is the minimum value is selected from the values shown in Table 4 above. I will choose.

【０１３６】すなわち、スケールファクタは、ブロック
フローティングユニット内のデータの代表値的な性格、
即ちエネルギを示すと考えられる。従って、スケールフ
ァクタの和を求めれば、全体の情報量を推定することが
できると考えられる。That is, the scale factor is the typical character of the data in the block floating unit,
That is, it is considered to indicate energy. Therefore, it is considered that the total amount of information can be estimated by obtaining the sum of scale factors.

【０１３７】図８には、図７のビット配分決定回路３１
０でのスケールファクタの和に対するビット配分量を示
す。FIG. 8 shows the bit allocation determining circuit 31 of FIG.
The bit allocation amount for the sum of scale factors at 0 is shown.

【０１３８】この図８も、第１の実施例同様に符号化方
式として、ＡＴＲＡＣ方式を使ったときのものである。
なお、この図８の縦軸は前記図５同様のビット配分量
（最大配分量は１８６バイト）であるが、横軸はスケー
ルファクタの和である。This FIG. 8 also shows the case where the ATRAC system is used as the encoding system as in the first embodiment.
The vertical axis of FIG. 8 is the bit allocation amount (the maximum allocation amount is 186 bytes) as in FIG. 5, but the horizontal axis is the sum of the scale factors.

【０１３９】この図８に示す換算のグラフも、第１の実
施例の図５同様に、様々なオーディオ信号を用いて実験
をしながら決定したものである。The conversion graph shown in FIG. 8 is also determined through experiments using various audio signals, as in FIG. 5 of the first embodiment.

【０１４０】全体的な傾向としては、スケールファクタ
の和の値の増加とともに、ビット配分量も増加する。As an overall tendency, the bit allocation amount increases as the value of the sum of scale factors increases.

【０１４１】しかし、図８において上記スケールファク
タの和の値が約７０００を越えた当たりで、ビット配分
量は減少に転じる。これは、上記スケールファクタの和
の値がかなり大きい（信号レベルが充分に大きい）とこ
ろでは、信号レベルも比較的大きく、再量子化による量
子化ノイズが信号レベルによってマスクされるため、再
量子化ノイズの注入量を増やしても聞こえにくいという
実験結果に基づいている。However, in FIG. 8, when the sum of the scale factors exceeds about 7,000, the bit allocation amount starts to decrease. This is because when the sum of the above scale factors is quite large (the signal level is sufficiently large), the signal level is also relatively large, and the quantization noise due to requantization is masked by the signal level, so requantization is performed. It is based on the experimental results that it is difficult to hear even if the injection amount of noise is increased.

【０１４２】一方、図８において、上記スケールファク
タの和の値が１．５以下（信号レベルが充分に小さい場
合）になるとビット配分量が一定となるのは、ＡＴＲＡ
Ｃ方式の各パラメータ情報（前記図４に示すワードレン
グスデータやスケールファクタデータなど）のために必
要なビット量がほぼ一定であるため、このビット量を確
保しておく必要があるためである。On the other hand, in FIG. 8, when the sum value of the scale factors becomes 1.5 or less (when the signal level is sufficiently low), the bit allocation amount becomes constant because ATRA
This is because the bit amount required for each parameter information of the C method (word length data, scale factor data, etc. shown in FIG. 4) is almost constant, and it is necessary to secure this bit amount.

【０１４３】この例においても、ビット配分量とスケー
ルファクタの総和との関係は、略Ｓ字カーブの非線形特
性を成す。Also in this example, the relationship between the bit allocation amount and the sum of the scale factors has a non-linear characteristic of a substantially S-shaped curve.

【０１４４】なお、この第２の実施例においても、ステ
ップＳ９４において全体のビット量は固定か否かの判断
を行い、このステップＳ９４で全チャンネルの総ビット
配分量を固定にする必要があると判断したときには、ス
テップＳ９５に進み、上記の換算が終わったあと、前記
式（１）の計算を行い、最終的な各チャンネル毎のビッ
ト配分量を計算する。Also in the second embodiment, it is necessary to determine whether or not the total bit amount is fixed in step S94, and to fix the total bit distribution amount of all channels in step S94. When the determination is made, the process proceeds to step S95, and after the above conversion is completed, the formula (1) is calculated, and the final bit allocation amount for each channel is calculated.

【０１４５】上記ステップＳ９５の後、又は上記ステッ
プＳ９４でノーと判断された後は、ステップＳ９６に進
む。After the above step S95 or after the judgment in step S94 is NO, the process proceeds to step S96.

【０１４６】また、当該第２の実施例においても、図７
のフォーマッタ４１０は、１サウンドフレーム毎に各チ
ャンネルの符号化器２１０₁〜２１０_nから図４に示す
ような形で送られてきたデータをチャンネル順に並べ
て、ビットストリームとして伝送する。すなわち、マル
チプレクサの働きをする。Also in the second embodiment, as shown in FIG.
The formatter 410 arranges the data sent from the encoders 210 _{1 to} 210 _n of the respective channels in the form as shown in FIG. 4 for each sound frame, and transmits the data as a bit stream. That is, it functions as a multiplexer.

【０１４７】さらに、第２の実施例の高能率復号化装置
におけるデフォーマッタも、上記フォーマッタ４１０で
マルチプレクスされたデータを各チャンネル毎に分解し
て、各復号化器に渡すデマルチプレクサの働きをする。Further, the deformatter in the high-efficiency decoding apparatus of the second embodiment also functions as a demultiplexer which decomposes the data multiplexed by the formatter 410 for each channel and passes it to each decoder. To do.

【０１４８】上述したように、第２の実施例装置によれ
ば、複数のチャンネルを持つオーディオデータの圧縮に
おいて、各チャンネルのスケールファクタの総和の時間
的な変化により、各チャンネルのビット配分量を決定し
て符号化するようにしている。このため、各チャンネル
にその情報量に見合ったビット配分が可能となり、更な
る高能率符号化が可能となる。As described above, according to the apparatus of the second embodiment, in the compression of audio data having a plurality of channels, the bit allocation amount of each channel is determined by the temporal change of the sum of the scale factors of each channel. It is decided and encoded. Therefore, it is possible to allocate bits to each channel according to the amount of information, and it is possible to perform higher efficiency coding.

【０１４９】それによって更なる高音質化もしくは、低
ビットレート化を図ることができる。すなわち、当該第
２の実施例装置においても、全チャンネルトータルにお
ける低ビットレート化又は高音質化が可能となる。As a result, higher sound quality or lower bit rate can be achieved. That is, also in the apparatus of the second embodiment, it is possible to reduce the bit rate or improve the sound quality of all channels in total.

【０１５０】また、この第２の実施例の場合において
も、記録するメデイアによっては固定長が望ましい場合
がある。その場合、全チャンネルの総ビット配分量を概
ね一定とする様に符号化することも可能である。Also in the case of the second embodiment, a fixed length may be desirable depending on the medium to be recorded. In that case, it is possible to perform encoding so that the total bit allocation amount of all channels is substantially constant.

【０１５１】以上、本発明の第１の実施例、及び第２の
実施例において、記録媒体として映画フィルムを例示し
た。しかし、本発明の要旨を変更しない範囲において、
記録媒体は映画フィルムに止まらず、様々なものが使用
可能である。例えば、光ディスク、磁気テープ等であ
る。As described above, in the first and second embodiments of the present invention, the motion picture film is exemplified as the recording medium. However, within the scope of not changing the gist of the present invention,
The recording medium is not limited to a movie film, and various recording media can be used. For example, it is an optical disk, a magnetic tape, or the like.

【０１５２】[0152]

【発明の効果】本発明においては、各チャンネルへのビ
ット配分量をそれぞれのチャンネルのエネルギ、例えば
振幅情報又はスケールファクタの総和の時間的な変化に
より決定しているため、各チャンネルにその情報量に見
合ったビット配分が可能となり、更なる高能率符号化が
可能となる。それによって更なる高音質化もしくは、低
ビットレート化が可能となる。According to the present invention, the amount of bit allocation to each channel is determined by the energy of each channel, for example, the amplitude information or the sum of the scale factors, which changes with time. It becomes possible to perform bit allocation commensurate with the above, and it becomes possible to perform higher efficiency coding. Thereby, higher sound quality or lower bit rate can be achieved.

【０１５３】また、本発明でのマルチチャンネルのオー
ディオ信号とは、少なくとも２チャンネルをいい、望ま
しくは映画のサウンドトラツクのように、５チャンネル
以上において、本発明の効果が顕著になる。Further, the multi-channel audio signal in the present invention means at least two channels, and the effect of the present invention becomes remarkable in five or more channels like a sound track of a movie.

[Brief description of drawings]

【図１】本発明の第１の実施例の高能率符号化装置の概
略構成を示すブロック回路図である。FIG. 1 is a block circuit diagram showing a schematic configuration of a high efficiency coding apparatus according to a first embodiment of the present invention.

【図２】本発明の第１及び第２の実施例の高能率復号化
装置の概略構成を示すブロック回路図である。FIG. 2 is a block circuit diagram showing a schematic configuration of a high-efficiency decoding device according to first and second embodiments of the present invention.

【図３】ＡＴＲＡＣ方式の高能率符号化装置及び本発明
実施例の高能率符号化装置におけるビット配分について
説明するためのブロック回路図である。FIG. 3 is a block circuit diagram for explaining bit allocation in the ATRAC high-efficiency encoder and the high-efficiency encoder of the embodiment of the present invention.

【図４】サウンドフレーム内のデータの記録の様子を説
明するための図である。FIG. 4 is a diagram for explaining how data is recorded in a sound frame.

【図５】第１の実施例におけるビット配分量を説明する
ための図である。FIG. 5 is a diagram for explaining a bit allocation amount in the first embodiment.

【図６】第１の実施例におけるビット配分決定の動作を
説明するためのフローチヤートである。FIG. 6 is a flow chart for explaining the bit allocation determination operation in the first embodiment.

【図７】本発明の第２の実施例の高能率符号化装置の概
略構成を示すブロック回路図である。FIG. 7 is a block circuit diagram showing a schematic configuration of a high-efficiency coding apparatus according to a second embodiment of the present invention.

【図８】第２の実施例におけるビット配分量を説明する
ための図である。FIG. 8 is a diagram for explaining a bit allocation amount in the second embodiment.

【図９】第２の実施例におけるビット配分決定の動作を
説明するためのフローチヤートである。FIG. 9 is a flow chart for explaining the operation of bit allocation determination in the second embodiment.

[Explanation of symbols]

２６レーザ記録装置２７映画フィルム２８記録エリア２９パーフォレーション１００標本化及び量子化器２００振幅情報検出回路３００ディレイライン３１０，５００ビット配分決定回路４００符号化器４０１帯域分割フィルタ４０２Ｌ，４０２Ｍ，４０２ＨＭＤＣＴ回路４０３ブロックサイズ評価器４０４Ｌ，４０４Ｍ，４０４Ｈ正規化回路４０５ビット配分器４０６再量子化器６００，４０７，４１０フォーマッタ７００デフォーマッタ８００復号器９００Ｄ／Ａ変換器 26 laser recording device 27 motion picture film 28 recording area 29 perforation 100 sampling and quantizer 200 amplitude information detection circuit 300 delay line 310,500 bit allocation determination circuit 400 encoder 401 band division filter 402L, 402M, 402H MDCT circuit 403 Block size evaluator 404L, 404M, 404H Normalization circuit 405 Bit distributor 406 Requantizer 600, 407, 410 Formatter 700 Deformatter 800 Decoder 900 D / A converter

Claims

[Claims]

1. An encoding device for compressing and encoding digital audio signals of a plurality of channels by utilizing the characteristics of the audio signals and human hearing, respectively. Energy detecting means for detecting energy of an audio signal, bit allocation amount determining means for determining a bit allocation amount to each channel based on the detection result, and for each channel according to the determination of the bit allocation amount. The bit allocation amount determining means is composed of compression encoding means for performing compression encoding based on the distributed bit allocation amount, and multiplexing means for multiplexing the compression encoded signals for each channel. The relationship between the signal energy and the bit allocation amount is that the bit allocation amount increases as the signal energy increases as a whole. Encoding device which is a linear characteristic.

2. The encoding device according to claim 1, wherein the non-linear characteristic is approximated by a characteristic of a substantially S-shaped curve.

3. The coding apparatus according to claim 1, wherein the non-linear characteristic has a flat bit allocation characteristic when the signal energy is sufficiently small.

4. The encoding apparatus according to claim 1, wherein the non-linear characteristic has a characteristic that bit allocation decreases when the signal energy is sufficiently large.

5. The bit allocation amount determining means respectively obtains an approximate amount of a required bit amount for each channel, and distributes a total bit amount of all channels per unit time in proportion to each approximate amount. The encoding device according to claim 1, wherein the bit allocation amount of each channel is determined.

6. The encoding apparatus according to claim 1, wherein the signal energy has an amplitude characteristic.

7. The encoding device according to claim 4, wherein the amplitude characteristic is a peak value.

8. The encoding device according to claim 1, wherein the signal energy is a scale factor.

9. A means for detecting the energy of the digital audio signal for each of the digital audio signals of a plurality of channels, a means for determining a bit allocation amount to each channel based on the detection result, and the bit allocation amount. The means for compressing and coding on the basis of the bit distribution amount distributed to each of the channels in accordance with the above decision, and the means for multiplexing the compression coded signal for each of the channels. The deciding means determines the digital energy of the plurality of channels by an encoding device in which the relationship between the signal energy and the bit allocation amount is a non-linear characteristic in which the bit allocation amount increases as the signal energy increases. Each of the audio signals is compression-encoded by using the characteristics of the audio signal and the human sense of hearing, and the multiplexed compression-encoded signal is generated. , A recording medium characterized by being recorded.

10. A coding method for compressing and coding digital audio signals of a plurality of channels by using the characteristics of the audio signals and human hearing, respectively. An energy detecting step of detecting energy, a bit allocation amount determining step of determining a bit allocation amount to each channel based on the detection result, and a bit allocated to each channel according to the determination of the bit allocation amount. The bit allocation amount determining step includes a compression encoding step for performing compression encoding based on an allocation amount, and a multiplexing step for multiplexing the compression encoded signals of the respective channels. The relationship with the bit allocation is that the bit allocation increases as the signal energy increases as a whole. Encoding method, which is a characteristic.

11. The non-linear characteristic is approximated by a characteristic of a substantially S-shaped curve.
The described encoding method.

12. The encoding method according to claim 10, wherein the non-linear characteristic has a flat bit allocation characteristic when the signal energy is sufficiently small.

13. The encoding method according to claim 10, wherein the non-linear characteristic has a characteristic that bit allocation decreases when the signal energy is sufficiently large.

14. A means for detecting the energy of the digital audio signal for each of the digital audio signals of a plurality of channels, a means for determining a bit allocation amount to each channel based on the detection result, and the bit allocation amount. The means for compressing and coding on the basis of the bit distribution amount distributed to each of the channels in accordance with the above decision, and the means for multiplexing the compression coded signal for each of the channels. The deciding means determines the digital signals of the plurality of channels by an encoding device in which the relationship between the signal energy and the bit allocation amount is a nonlinear characteristic in which the bit allocation amount increases as the signal energy increases as a whole. Whether the audio signals are compressed and coded by using the characteristics of the audio signals and the human hearing, respectively And a decoding device for decoding the signal of each channel.

15. The encoding device according to any one of claims 1 to 5 or the encoding method according to any one of claims 10 to 13,
A decoding device comprising a decoding means for decoding a signal of each channel from a recording medium in which a multiplexed compressed coded signal is recorded.

16. A step of detecting energy of the digital audio signal for each of the digital audio signals of a plurality of channels, a step of determining a bit allocation amount for each channel based on the detection result, and the bit allocation amount. Determining the bit allocation amount, the step of compression-encoding based on the bit allocation amount allocated to each channel, the step of multiplexing the compression-encoded signal of each channel, and the bit allocation amount determined. The step of performing the coding is such that the relationship between the signal energy and the bit allocation amount is a non-linear characteristic in which the bit allocation amount increases as the signal energy increases as a whole. Each of them is compressed and encoded by using the characteristics of the audio signal and the human sense of hearing, and multiplexed compression is performed. A decoding method comprising a decoding step of decoding a signal of each channel from a recording medium on which an encoded signal is recorded.