JPH07161140A

JPH07161140A - Apparatuses and methods for transmission and receiving of digital audio signal

Info

Publication number: JPH07161140A
Application number: JP5306892A
Authority: JP
Inventors: Katsuaki Tsurushima; 克明鶴島
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1993-12-07
Filing date: 1993-12-07
Publication date: 1995-06-23

Abstract

PURPOSE:To facilitate compression coding of high tone quality and, further, facilitate compression coding of a high compression factor for an important sound by a method wherein the coding corresponding to a channel is performed with the predetermined compression factor for a signal having a large influence upon the sensation of hearing corresponding to a channel. CONSTITUTION:High sensation compression coding circuits 217-222 which apply compression coding with a first coding system using a first compression factor to the digital audio signals of (m) channels (n>m) which have a larger influence upon the sensation of hearing than the digital audio signals of the other (n-m) channels among the digital audio signals of (n) channels are provided. Further, high compression factor coding circuits 223 and 224 which apply compression coding with a second coding system using a second compression factor which is higher than the first compression factor to the digital audio signals of the (n-m) channels are provided. Then the respective compression coding outputs of both the circuits 217-222 and the circuits 223 and 224 are transmitted.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、例えば、映画フィルム
映写システム、ビデオテープレコーダ、ビデオディスク
プレーヤ等のステレオや、いわゆるマルチサラウンド音
響システムにおいて用いられるマルチチャネルのディジ
タルオーディオ信号を圧縮符号化して伝送するディジタ
ルオーディオ信号の伝送装置及びそれに対応する受信装
置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention, for example, compresses and transmits a multi-channel digital audio signal used in a stereo such as a movie film projection system, a video tape recorder, a video disc player, or a so-called multi surround sound system. The present invention relates to a digital audio signal transmitting apparatus and a corresponding receiving apparatus.

【０００２】[0002]

【従来の技術】通常のオーディオ機器の場合のみなら
ず、例えば映画フィルム映写システム、高品位テレビジ
ョン、ビデオテープレコーダ、ビデオディスクプレーヤ
等のステレオないしはマルチサラウンド音響システムに
おいては、例えば４〜８チャネル等の複数チャネルのオ
ーディオ或いは音声信号を扱うようになりつつある。2. Description of the Related Art Not only in the case of ordinary audio equipment, but also in stereo or multi-surround sound systems such as movie film projection systems, high definition televisions, video tape recorders, video disc players, etc., for example, 4 to 8 channels, etc. It is becoming increasingly common to handle multiple channels of audio or voice signals.

【０００３】特に、業務用においては、ディジタルオー
ディオのマルチチャネル化が進んでおり、例えば８チャ
ネルのディジタルオーディオ信号を扱う機器が浸透して
きている。上記８チャネルのディジタルオーディオ信号
を扱う機器としては、例えば映画フィルム映写システム
等がある。また、高品位テレビジョン、ビデオテープレ
コーダ、ビデオディスクプレーヤ等のステレオないしは
マルチサラウンド音響システムにおいても、例えば４〜
８チャネル等の複数チャネルのオーディオ或いは音声信
号を扱うようになりつつある。In particular, for business use, digital audio has become multi-channel, and, for example, equipment that handles 8-channel digital audio signals has become widespread. An example of a device that handles the 8-channel digital audio signal is a movie film projection system. Also, in stereo or multi-surround sound systems such as high-definition televisions, video tape recorders, video disc players, etc.
A plurality of channels of audio or voice signals such as 8 channels are being handled.

【０００４】上記８チャネルのディジタルオーディオ信
号を扱う映画フィルム映写システムにおいては、上記映
画フィルムに対して、例えばレフトチャネル，レフトセ
ンタチャネル，センタチャネル，ライトセンタチャネ
ル，ライトチャネル，サラウンドレフトチャネル，サラ
ウンドライトチャネル，サブウーファチャネルの８チャ
ネルのディジタルオーディオ信号を記録することが行わ
れつつある。なお、上記映画フィルムに記録する上記８
チャネルの各チャネルは、例えば当該映画フィルムの画
像記録領域から再生された画像が映写機によって投影さ
れるスクリーン側に配置されるレフトスピーカ、レフト
センタスピーカ、センタスピーカ、ライトセンタスピー
カ、ライトスピーカ、サブウーファスピーカ、観客席を
取り囲むように左側に配置されるサラウンドレフトスピ
ーカ及び右側に配置されるサラウンドライトスピーカと
対応するものである。In the motion picture film projection system that handles the 8-channel digital audio signals, for the motion picture film, for example, a left channel, a left center channel, a center channel, a right center channel, a right channel, a surround left channel, a surround right. Recording of 8-channel digital audio signals of channels and subwoofer channels is being performed. It should be noted that the above 8 recorded on the movie film
Each channel is, for example, a left speaker, a left center speaker, a center speaker, a right center speaker, a right speaker, a subwoofer speaker, which is arranged on the screen side on which an image reproduced from the image recording area of the movie film is projected by the projector. , A surround left speaker arranged on the left side and a surround right speaker arranged on the right side so as to surround the auditorium.

【０００５】[0005]

【発明が解決しようとする課題】ところが、映画フィル
ムに上記８チャネルのディジタルオーディオ信号を記録
する場合において、映画フィルムには、例えばいわゆる
ＣＤ（コンパクトディスク）などで用いているようなサ
ンプリング周波数４４．１ｋＨｚで１６ビットの直線量
子化されたオーディオデータを上記８チャネル分も記録
できる領域を確保することは困難であるため、上記８チ
ャネルのオーディオデータを圧縮して記録する必要があ
る。However, in the case of recording the above 8-channel digital audio signals on a motion picture film, the motion picture film has a sampling frequency of 44. Since it is difficult to secure an area in which 16-bit linearly quantized audio data of 1 kHz can be recorded for 8 channels, it is necessary to compress and record the 8 channels of audio data.

【０００６】また、フィルムという媒体は、表面に傷な
どが発生しやすいため、ディジタルデータをオリジナル
のまま記録していたのでは、データ欠けが激しく実用に
ならない。このため、エラー訂正符号の能力が非常に重
要になり、上記データ圧縮は、その訂正符号も含めて上
記フィルム上の記録領域に記録可能な程度まで行う必要
がある。Further, since a medium called a film is apt to have scratches on its surface, if digital data is recorded as it is, the data will be seriously lost and it will not be put to practical use. For this reason, the capability of the error correction code becomes very important, and the data compression must be performed to the extent that it can be recorded in the recording area on the film including the correction code.

【０００７】しかし、圧縮符号化を行うと楽器や人間の
声などが原音から変化するため、特に上記映画フィルム
のように原音の忠実な再現が必要とされるメディアの記
録フォーマットとして採用する場合において人間の声な
ど重要な音に対しては何らかの高音質化の手段が必要と
なってくる。However, when compression coding is performed, the musical instrument, human voice, or the like changes from the original sound. Therefore, when it is adopted as a recording format of a medium in which faithful reproduction of the original sound is required as in the above-mentioned movie film. For important sounds such as human voice, some means for improving the sound quality is required.

【０００８】そこで、本発明は、上述したようなことに
鑑み、特に重要な音に対しては高音質の圧縮符号化が可
能であると共に、さらに高圧縮率の圧縮符号化をも可能
とする高能率符号化を用いたディジタルオーディオ信号
の伝送装置及びこれに対応する受信装置を提供すること
を目的としている。In view of the above, the present invention enables high-quality compression coding for particularly important sounds and also enables high-compression compression coding. It is an object of the present invention to provide a digital audio signal transmission apparatus using high-efficiency coding and a receiving apparatus corresponding to the transmission apparatus.

【０００９】[0009]

【課題を解決するための手段】本発明は、上述の目的を
達成するために提案されたものであり、本発明のディジ
タルオーディオ信号の伝送装置は、複数チャネル（ｎチ
ャネル、ｎは３より大なる正の整数）のディジタルオー
ディオ信号を圧縮符号化して伝送する伝送装置であり、
上記ｎ個のチャネルのディジタルオーディオ信号のう
ち、ｍ個（ｎ＞ｍ）のチャネルのディジタルオーディオ
信号であって他のｎ−ｍ個のチャネルのディジタルオー
ディオ信号より聴感上影響力の高い信号に対しては第１
の圧縮率の第１の符号化方式で圧縮符号化する第１の圧
縮符号化手段と、上記ｎ−ｍ個のチャネルのディジタル
オーディオ信号に対しては上記第１の圧縮率よりも高い
第２の圧縮率を有する第２の符号化方式で圧縮符号化す
る第２の圧縮符号化手段とを有し、上記第１の圧縮符号
化手段からの圧縮符号化出力と上記第２の圧縮符号化手
段からの圧縮符号化出力を共に伝送することを特徴とす
るものである。The present invention has been proposed in order to achieve the above object, and a digital audio signal transmitting apparatus of the present invention is provided with a plurality of channels (n channels, n is greater than 3). Is a transmission device that compresses and encodes a digital audio signal of
Among the digital audio signals of the above n channels, for the digital audio signals of m (n> m) channels and having a higher audible influence than the digital audio signals of the other nm channels. The first
Second compression coding means for compressing and coding by the first coding method having a compression rate of 1) and a second compression rate higher than the first compression rate for the digital audio signals of the n-m channels. Second compression encoding means for performing compression encoding with a second encoding method having a compression rate of, and a compression encoding output from the first compression encoding means and the second compression encoding. It is characterized in that the compression encoded output from the means is transmitted together.

【００１０】ここで、本発明の伝送装置において、上記
ｎ個のチャネルはセンタチャネルとレフトチャネルとラ
イトチャネルとサラウンドレフトチャネルとサラウンド
ライトチャネルであり、上記ｎ−ｍ個のチャネルはサラ
ウンドレフトチャネルとサラウンドライトチャネルであ
る。或いは、上記ｎ個のチャネルはセンタチャネルとレ
フトチャネルとライトチャネルとレフトセンタチャネル
とライトセンタチャネルとサラウンドレフトチャネルと
サラウンドライトチャネルであり、上記ｎ−ｍ個のチャ
ネルはサラウンドレフトチャネルとサラウンドライトチ
ャネルである。In the transmission device of the present invention, the n channels are a center channel, a left channel, a right channel, a surround left channel and a surround right channel, and the n-m channels are surround left channels. Surround light channel. Alternatively, the n channels are a center channel, a left channel, a right channel, a left center channel, a right center channel, a surround left channel and a surround right channel, and the nm channels are a surround left channel and a surround right channel. Is.

【００１１】さらに、本発明の伝送装置において、上記
第１の符号化方式は、入力ディジタルオーディオ信号を
複数帯域に分割し、各帯域毎のディジタルオーディオ信
号を複数サンプル毎にブロック化し、各ブロック単位で
直交変換したスペクトル成分を、聴覚特性に応じて適応
的に圧縮符号化する方式であり、上記第２の符号化方式
は、入力ディジタルオーディオ信号を複数サンプル毎に
直交変換した係数情報とそれに関連するサブ情報を得、
各チャネルのエネルギに応じて各チャネルのビット配分
量を決定し、このビット配分量で適応的に圧縮符号化す
る方式である。なお、上記伝送は、記録媒体への記録を
含む。Further, in the transmission apparatus of the present invention, the first encoding method divides the input digital audio signal into a plurality of bands, divides the digital audio signal in each band into blocks for a plurality of samples, and sets each block unit. Is a system for adaptively compressing and coding the spectrum component orthogonally transformed in accordance with the auditory characteristic, and the second coding system is related to coefficient information obtained by orthogonally transforming an input digital audio signal for every plural samples and its relation. Get sub information to
This is a method in which the bit allocation amount of each channel is determined according to the energy of each channel, and compression coding is adaptively performed with this bit allocation amount. The above transmission includes recording on a recording medium.

【００１２】また、本発明のディジタルオーディオ信号
の受信装置は、複数チャネル（ｎチャネル、ｎは３より
大なる正の整数）のディジタルオーディオ信号のうち、
ｍ個（ｎ＞ｍ）のチャネルのディジタルオーディオ信号
であって他のｎ−ｍ個のチャネルのディジタルオーディ
オ信号より聴感上影響力の高い信号に対しては第１の圧
縮率の第１の符号化方式で圧縮符号化がなされていると
共に、上記ｎ−ｍ個のチャネルのディジタルオーディオ
信号に対しては上記第１の圧縮率よりも高い第２の圧縮
率を有する第２の符号化方式で圧縮符号化がなされた信
号を受信するディジタルオーディオ信号の受信装置であ
り、上記ｎ個のチャネルのディジタルオーディオ信号の
うち、上記ｍ個のチャネルのディジタルオーディオ信号
に対して上記第１の符号化方式に対応する第１の伸張復
号化を施す第１の復号化手段と、上記ｎ−ｍ個のチャネ
ルのディジタルオーディオ信号に対して上記第２の符号
化方式に対応する第２の伸張復号化を施す第２の復号化
手段とを有することを特徴とするものである。Further, the digital audio signal receiving apparatus of the present invention is a digital audio signal of a plurality of channels (n channels, n is a positive integer greater than 3),
The first code having the first compression ratio for a digital audio signal of m (n> m) channels, which has a higher auditory influence than other digital audio signals of nm channels. The second encoding method has a second compression rate higher than the first compression rate with respect to the digital audio signals of the n−m channels while being compression-encoded by the encoding method. A receiver for a digital audio signal for receiving a compression-encoded signal, wherein the first encoding method is applied to the digital audio signals of the m channels among the digital audio signals of the n channels. Corresponding to the second encoding method for the digital audio signals of the n−m channels and the first decoding means for performing the first decompression / decoding. It is characterized in that a second decoding means for performing a second decompression and decoding.

【００１３】[0013]

【作用】本発明のディジタルオーディオ信号の伝送装置
によれば、複数チャネルの信号のうち、聴感上影響力の
高い信号からなるチャネルの信号に対しては聴感上の劣
化が少ない第１の圧縮率の第１の符号化方式によって圧
縮符号化を行い、聴感上影響力の低い信号からなるチャ
ネルの信号に対しては第１の圧縮率よりも高い第２の圧
縮率の第２の符号化方式で圧縮符号化を行う。According to the digital audio signal transmission apparatus of the present invention, the first compression rate is less likely to be deteriorated in hearing for a signal of a channel consisting of a signal having a high hearing influence among a plurality of channels. Compression encoding is performed by the first encoding method of No. 2, and a second encoding method of a second compression rate higher than the first compression rate for a signal of a channel composed of a signal having a low auditory influence. The compression coding is performed with.

【００１４】また、本発明のディジタルオーディオ信号
の受信装置によれば、複数チャネルの信号のうち、聴感
上影響力の高い信号からなるチャネルの信号に対して聴
感上の劣化が少ない第１の圧縮率の第１の符号化方式に
よって圧縮符号化がされた信号と、聴感上影響力の低い
信号からなるチャネルの信号に対して第１の圧縮率より
も高い第２の圧縮率の第２の符号化方式で圧縮符号化が
なされた信号を、それぞれ復号化するようにしている。Further, according to the digital audio signal receiving apparatus of the present invention, the first compression which causes less deterioration in the auditory sense than the signal of the channel consisting of the signals having a high auditory influence among the signals of the plurality of channels. The second compression rate of the second compression rate higher than the first compression rate with respect to the signal of the channel composed of the signal compression-encoded by the first rate encoding method and the signal having a low auditory influence. The signals compressed and encoded by the encoding method are each decoded.

【００１５】[0015]

【実施例】以下、本発明の実施例について図面を参照し
ながら説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１６】図１には、本発明のディジタルオーディオ
信号の伝送装置の構成を示す。本発明実施例のディジタ
ルオーディオ信号の伝送装置は、ｎ個（ｎは３より大な
る正の整数であり本実施例では例えば８個）のチャネル
のディジタルオーディオ信号を圧縮符号化して伝送する
伝送装置である。FIG. 1 shows the configuration of a digital audio signal transmission apparatus according to the present invention. A digital audio signal transmission apparatus according to an embodiment of the present invention is a transmission apparatus that compresses and encodes digital audio signals of n (n is a positive integer greater than 3; for example, 8 in this embodiment) channels. Is.

【００１７】ここで、本実施例では、上記ｎ個（８個）
のチャネルとして、例えば前述したレフトチャネル（Ｌ
ｃｈ）、レフトセンタチャネル（ＬＣｃｈ）、センタチ
ャネル（Ｃｃｈ）、サブウーファチャネル（ＳＷｃ
ｈ）、ライトセンタチャネル（ＲＣｃｈ）、ライトチャ
ネル（Ｒｃｈ）、サラウンドレフトチャネル（ＬＢｃ
ｈ）、サラウンドライトチャネル（ＲＢｃｈ）を用い
る。なお、本発明では、上記例に限らず、上記ｎ個のチ
ャネルを例えばセンタチャネルとレフトチャネルとライ
トチャネルとサラウンドレフトチャネルとサラウンドラ
イトチャネルの５チャネルとしたり、また、センタチャ
ネルとレフトチャネルとライトチャネルとレフトセンタ
チャネルとライトセンタチャネルとサラウンドレフトチ
ャネルとサラウンドライトチャネルの７チャネルとする
こともできる。Here, in this embodiment, the above n pieces (8 pieces) are used.
The left channel (L
ch), left center channel (LCch), center channel (Cch), subwoofer channel (SWc)
h), right center channel (RCch), right channel (Rch), surround left channel (LBc)
h), a surround right channel (RBch) is used. In the present invention, the n channels are not limited to the above example, but may be, for example, 5 channels of a center channel, a left channel, a right channel, a surround left channel and a surround right channel, or a center channel, a left channel and a right channel. It is also possible to use seven channels including a channel, a left center channel, a right center channel, a surround left channel and a surround right channel.

【００１８】本実施例のディジタルオーディオ信号の伝
送装置は、上記８チャネルのディジタルオーディオ信号
のうち、ｍ個（ｎ＞ｍで、図１の例ではｍ＝６）のチャ
ネルのディジタルオーディオ信号であって他のｎ−ｍ
（図１の例ではｎ−ｍ＝８−６＝２）個のチャネルのデ
ィジタルオーディオ信号より聴感上影響力の高い信号に
対しては第１の圧縮率の第１の符号化方式で圧縮符号化
する第１の圧縮符号化手段としての高聴感圧縮符号化回
路２１７〜２２２と、上記ｎ−ｍ個のチャネルのディジ
タルオーディオ信号に対しては上記第１の圧縮率よりも
高い第２の圧縮率を有する第２の符号化方式で圧縮符号
化する第２の圧縮符号化手段である高率圧縮符号化回路
２２３，２２４とを有し、上記高聴感圧縮符号化回路２
１７〜２２２からの圧縮符号化出力と上記高率圧縮符号
化回路２２３，２２４からの圧縮符号化出力を共に伝送
することを特徴とするものである。The digital audio signal transmitting apparatus according to the present embodiment is a digital audio signal of m (n> m, m = 6 in the example of FIG. 1) channels of the above eight channels of digital audio signals. Other nm
(In the example of FIG. 1, mn = 8−6 = 2) For a signal that has a higher auditory influence than a digital audio signal of the number of channels, a compression code is generated by the first encoding method with the first compression ratio. The high-audibility compression coding circuits 217 to 222 as the first compression coding means for converting the digital audio signals into the n-m channels and the second compression higher than the first compression ratio. A high-rate compression coding circuit 223, 224 which is a second compression coding means for performing compression coding by a second coding method having a rate.
It is characterized in that the compression coded outputs from 17 to 222 and the compression coded outputs from the high rate compression coding circuits 223 and 224 are both transmitted.

【００１９】なお、本実施例では、上記８チャネルのう
ち上記第１の符号化方式で圧縮する上記聴感上影響力の
高い信号からなるチャネル（６個のチャネル）を、上記
レフトチャネルとレフトセンタチャネルとセンタチャネ
ルとサブウーファチャネルとライトセンタチャネルとラ
イトチャネルとし、上記第２の符号化方式で圧縮するチ
ャネル（ｎ−ｍ個のチャネル）をサラウンドレフトチャ
ネルとサラウンドライトチャネルとしている。勿論、本
発明はこれらに限定されず、例えば上記ｎ個のチャネル
を例えばセンタチャネルとレフトチャネルとライトチャ
ネルとサラウンドレフトチャネルとサラウンドライトチ
ャネルの５チャネルとしたときには、上記ｍ個のチャネ
ルをセンタチャネルとレフトチャネルとライトチャネル
の３チャネルとし、上記ｎ−ｍ個のチャネルをサラウン
ドレフトチャネルとサラウンドライトチャネルとするこ
ともできる。同様に、上記ｎ個のチャネルを例えばセン
タチャネルとレフトチャネルとライトチャネルとレフト
センタチャネルとライトセンタチャネルとサラウンドレ
フトチャネルとサラウンドライトチャネルの７チャネル
としたときには、上記ｍ個のチャネルをセンタチャネル
とレフトチャネルとライトチャネルとし、上記ｎ−ｍ個
のチャネルをサラウンドレフトチャネルとサラウンドラ
イトチャネルとすることもできる。In the present embodiment, among the eight channels, the channels (six channels) made up of the signals having a high perceptual impact that are compressed by the first coding method are used as the left channel and the left center. A channel, a center channel, a subwoofer channel, a right center channel and a right channel, and channels (nm channels) compressed by the second encoding method are a surround left channel and a surround right channel. Of course, the present invention is not limited to these, and for example, when the above n channels are, for example, 5 channels of a center channel, a left channel, a right channel, a surround left channel and a surround right channel, the m channels are replaced by the center channel. The left channel and the right channel may be three channels, and the n−m channels may be the surround left channel and the surround right channel. Similarly, when the n channels are, for example, 7 channels of a center channel, a left channel, a right channel, a left center channel, a right center channel, a surround left channel and a surround right channel, the m channels are referred to as center channels. A left channel and a right channel may be used, and the above n−m channels may be a surround left channel and a surround right channel.

【００２０】さらに、本発明実施例の伝送装置におい
て、上記第１の符号化方式で圧縮符号化を行う高聴感圧
縮符号化回路２１７〜２２２には、前述したようなサブ
バンドコーディング等を用いたオーディオ信号の高能率
圧縮符号化方式であって人間の聴覚上の特性を利用し、
オーディオデータを約１／５に圧縮するような方式、す
なわち、入力ディジタルオーディオ信号を複数帯域に分
割し、各帯域毎のディジタルオーディオ信号を複数サン
プル毎にブロック化し、各ブロック単位で直交変換した
スペクトルデータを、人間の聴覚特性に応じて適応的に
圧縮符号化する方式（例えばいわゆるＡＴＲＡＣ：Adap
tive TRansform Acoustic Coding方式）を用いる。Further, in the transmission apparatus of the embodiment of the present invention, the above-described sub-band coding or the like is used for the high-audibility compression coding circuits 217 to 222 which perform compression coding by the first coding method. It is a high-efficiency compression encoding method for audio signals, which makes use of human auditory characteristics,
A method of compressing audio data to about ⅕, that is, a spectrum in which an input digital audio signal is divided into a plurality of bands, the digital audio signal of each band is divided into a plurality of samples, and orthogonally transformed in each block. A method of adaptively compression-coding data according to human auditory characteristics (for example, so-called ATRAC: Adap
tive TRansform Acoustic Coding method) is used.

【００２１】また、第２の符号化方式で圧縮符号化を行
う高率圧縮符号化回路２２３，２２４には、入力ディジ
タルオーディオ信号を複数サンプル毎に直交変換した係
数データとそれに関連するサブ情報（語長情報やスケー
ルファクタの情報）を得、各チャネルのエネルギに応じ
て各チャネルのビット配分を決定し、この各チャネルの
ビット配分によって適応的に圧縮符号化する方式を用い
る。勿論、本発明はこれら各符号化方式に限定されるも
のでないことは言うまでもない。Further, in the high-rate compression coding circuits 223 and 224 which carry out compression coding by the second coding method, coefficient data obtained by orthogonally transforming an input digital audio signal for every plural samples and sub-information related thereto ( The word length information and the scale factor information) are obtained, the bit allocation of each channel is determined according to the energy of each channel, and the compression encoding is adaptively performed by the bit allocation of each channel. Needless to say, the present invention is not limited to each of these encoding methods.

【００２２】さらに、本実施例では、上記伝送として、
例えば映画フィルムへの記録や、光ディスク，光磁気デ
ィスク，相変化型光ディスク，磁気ディスク等のディス
ク状記録媒体、磁気テープ等のテープ状記録媒体への記
録、半導体メモリ，ＩＣカードなどへの記録を挙げるこ
とができる。Further, in this embodiment, the above transmission is as follows.
For example, recording on motion picture film, recording on disk-shaped recording media such as optical disks, magneto-optical disks, phase-change optical disks, magnetic disks, tape-shaped recording media such as magnetic tape, recording on semiconductor memory, IC cards, etc. Can be mentioned.

【００２３】なお、上記伝送を映画フィルムへの記録と
した場合には、例えば図２に示すようにスピーカが配置
されるディジタルサラウンドシステムに対応することに
なる。各チャネルは、センタ（Ｃ）チャネル、サブウー
ファ（ＳＷ）チャネル、レフト（Ｌ）チャネル、レフト
センタ（ＣＬ）チャネル、ライト（Ｒ）チャネル、ライ
トセンタ（ＣＲ）チャネル、レフトサラウンド（ＬＢ）
チャネル、ライトサラウンド（ＲＢ）チャネルの８つで
ある。When the transmission is recorded on a motion picture film, it corresponds to a digital surround system in which speakers are arranged as shown in FIG. 2, for example. Each channel is a center (C) channel, a subwoofer (SW) channel, a left (L) channel, a left center (CL) channel, a right (R) channel, a right center (CR) channel, a left surround (LB) channel.
There are eight channels, a light surround (RB) channel.

【００２４】ここで、この図２において、上記映画フィ
ルムに記録する上記８チャネルの各チャネルは、例えば
当該映画フィルムの画像記録領域から再生された画像が
映写機（プロジェクタ１００）によって投影されるスク
リーン１０１側に配置されたレフトスピーカ１０６，レ
フトセンタースピーカ１０４，センタースピーカ１０
２，ライトセンタースピーカ１０５，ライトスピーカ１
０７，サラウンドレフトスピーカ１０８及び２００，サ
ラウンドライトスピーカ１０９及び２０１，サブウーフ
ァスピーカ１０３と対応するものである。In FIG. 2, each of the eight channels recorded on the motion picture film has a screen 101 on which an image reproduced from an image recording area of the motion picture film is projected by a projector (projector 100). Left speaker 106, left center speaker 104, center speaker 10 arranged on the side
2, light center speaker 105, light speaker 1
07, surround left speakers 108 and 200, surround right speakers 109 and 201, and subwoofer speaker 103.

【００２５】上記センタスピーカ１０２は、スクリーン
１０１側の中央に配置され、センタチャネルのオーディ
オデータによる再生音を出力するもので例えば俳優のせ
りふ等の最も重要な再生音を出力する。上記サブウーフ
ァスピーカ１０３は、サブウーファチャネルのオーディ
オデータによる再生音を出力するもので、例えば爆発音
などの低域の音というよりは振動として感じられる音を
効果的に出力するものであり、爆発シーンなどに効果的
に使用されることが多いものである。上記レフトスピー
カ１０６及びライトスピーカ１０７は、上記スクリーン
１０１の左右に配置され、レフトチャネルのオーディオ
データによる再生音とライトチャネルのオーディオデー
タによる再生音を出力するもので、ステレオ音響効果を
発揮する。上記レフトセンタスピーカ１０４とライトセ
ンタスピーカ１０５は、上記センタスピーカ１０２と上
記レフトスピーカ１０６及びライトスピーカ１０７との
間に配置され、レフトセンタチャネルのオーディオデー
タによる再生音とライトセンタチャネルのオーディオデ
ータによる再生音を出力するもので、それぞれ上記レフ
トスピーカ１０６及びライトスピーカ１０７の補助的な
役割を果たす。特にスクリーン１０１が大きく収容人数
の多い映画館等では、座席の位置によって音像の定位が
不安定になりやすいが、上記レフトセンタスピーカ１０
４とライトセンタスピーカ１０７を付加することによ
り、音像のよりリアルな定位を作り出すのに効果を発揮
する。さらに、上記サラウンドレフトスピーカ１０８と
サラウンドライトスピーカ１０９は、観客席を取り囲む
ように配置され、サラウンドレフトチャネルのオーディ
オデータによる再生音とサラウンドライトチャネルのオ
ーディオデータによる再生音を出力するもので、残響音
や拍手、歓声に包まれた印象を与える効果がある。これ
により、より立体的な音像を作り出すことができる。The center speaker 102 is arranged in the center of the screen 101 side and outputs a reproduced sound based on the audio data of the center channel, and outputs the most important reproduced sound such as an actor's dialogue. The subwoofer speaker 103 outputs a reproduced sound based on audio data of the subwoofer channel, and effectively outputs a sound that is felt as vibration rather than a low frequency sound such as an explosion sound, and an explosion scene. Is often used effectively. The left speaker 106 and the right speaker 107 are arranged on the left and right sides of the screen 101, and output a reproduced sound by the left channel audio data and a reproduced sound by the right channel audio data, and exhibit a stereo sound effect. The left center speaker 104 and the right center speaker 105 are arranged between the center speaker 102 and the left speaker 106 and the right speaker 107, and are reproduced by the left center channel audio data and reproduced by the right center channel audio data. It outputs sound and plays an auxiliary role for the left speaker 106 and the right speaker 107, respectively. Particularly in a movie theater or the like where the screen 101 is large and the number of persons accommodated is large, the localization of the sound image tends to become unstable depending on the position of the seat.
4 and the light center speaker 107 are added, it is effective in creating a more realistic localization of the sound image. Further, the surround left speaker 108 and the surround right speaker 109 are arranged so as to surround the spectators' seats, and output a reproduced sound by the audio data of the surround left channel and a reproduced sound by the audio data of the surround right channel. It has the effect of giving the impression of being wrapped in clapping and cheering. As a result, a more stereoscopic sound image can be created.

【００２６】図１に戻って、上述したような８チャネル
のディジタルオーディオ信号を扱う図１の伝送装置は、
上記８チャネルのディジタルオーディオ信号を得るため
に、各チャネルに対応するマイクロホン２０１〜２０８
からのアナログオーディオ信号をそれぞれ対応するＡ／
Ｄ変換器２０９〜２１６によってディジタルオーディオ
信号に変換する。Returning to FIG. 1, the transmission apparatus of FIG. 1 for handling the above-described 8-channel digital audio signal is
In order to obtain the above 8-channel digital audio signals, the microphones 201 to 208 corresponding to the respective channels are obtained.
Corresponding analog audio signals from A /
The D converters 209 to 216 convert the digital audio signals.

【００２７】各Ａ／Ｄ変換器２０９〜２１６からの各チ
ャネルのディジタルオーディオ信号は、それぞれ対応す
る圧縮符号化回路２１７〜２２４に送られて各々圧縮符
号化される。なお、これら各圧縮符号化回路２１７〜２
２４の具体的構成については後述する。The digital audio signals of the respective channels from the respective A / D converters 209 to 216 are sent to the corresponding compression encoding circuits 217 to 224 and compression encoded respectively. Note that each of these compression encoding circuits 217-2
The specific configuration of 24 will be described later.

【００２８】上記高聴感圧縮符号化回路２１７〜２２２
からの圧縮符号化されたオーディオデータは、それぞれ
対応する出力端子２３３〜２３８から後段の構成に送ら
れる。The above-mentioned high-audibility compression encoding circuits 217 to 222.
The compressed and encoded audio data from are sent to the subsequent stage from the corresponding output terminals 233 to 238.

【００２９】一方、上記高率圧縮符号化回路２２３，２
２４からの圧縮符号化されたオーディオデータは、それ
ぞれハイパスフィルタ２２５，２２６によって高域成分
のみが取り出された後、加算器２２７によって加算され
る。また、上記高率圧縮符号化回路２２３，２２４から
の圧縮符号化されたオーディオデータは、それぞれロー
パスフィルタ２２８，２２９にも送られる。上記加算器
２２７からのデータは、加算器２３０によって上記サラ
ウンドレフトチャネル側に対応するローパスフィルタ２
２８の出力と加算され、出力端子２３１から出力され、
上記サラウンドライトチャネルに対応するローパスフィ
ルタ２２９の出力は出力端子２３２から出力される。On the other hand, the high-rate compression coding circuits 223, 2
The compression-encoded audio data from 24 is added by the adder 227 after only the high frequency components are extracted by the high pass filters 225 and 226, respectively. The compression-encoded audio data from the high-rate compression encoding circuits 223 and 224 are also sent to low-pass filters 228 and 229, respectively. The data from the adder 227 is supplied to the low pass filter 2 corresponding to the surround left channel side by the adder 230.
28 output is added and output from the output terminal 231.
The output of the low pass filter 229 corresponding to the surround right channel is output from the output terminal 232.

【００３０】ここで、上記サラウンドレフトチャネルと
サラウンドライトチャネルにおいて上記ハイパスフィル
タ２２５，２２６からの出力を加算器２２７によって加
算し、当該加算器２２７の加算出力を、更に加算器２２
８によってサラウンドレフトチャネルのローパスフィル
タ２２８出力に加算するのは以下の理由による。Here, in the surround left channel and the surround right channel, the outputs from the high pass filters 225 and 226 are added by the adder 227, and the addition output of the adder 227 is further added.
The reason for adding to the output of the low-pass filter 228 of the surround left channel by 8 is as follows.

【００３１】すなわち、人間の耳は高域の成分に対する
定位感が少なく、このため高域成分については例えば複
数個のスピーカのうちのいずれか１つのスピーカからで
ていても人間にはどのスピーカから出てきているのか聞
き取り難いという性質がある。このため、複数チャネル
のオーディオ信号の各高域成分を、そのうちの例えば１
チャネルに対応するスピーカのみに送るようにしても、
人間には各チャネルのオーディオ信号の高域成分が当該
１つのスピーカのみから出力されているとは感じられな
い。したがって、本実施例では、上述のように、上記８
チャネルのうちの例えばサラウンドレフトチャネルとサ
ラウンドライトチャネルの音声の高域成分を加算して、
例えばサラウンドレフトチャネルの低域成分の音声に加
えるようにしている。これにより、サラウンドレフトチ
ャネルとサラウンドライトチャネルの２つのチャネルの
高域成分を１つのチャネル分に圧縮できることになる。
なお、このような高域成分におけるチャネル間のクロス
トーク処理は、図１の例のような上記サラウンドレフト
チャネルとサラウンドライトチャネル間のみならず、図
示は省略しているが他のチャネル間（例えば全チャネ
ル）についても同様に行うことができる。このように全
チャネルについて高域成分のチャネルクロストーク処理
を行うことで更に圧縮率を上げることが可能となる。That is, the human ear has less sense of localization for high-frequency components, and therefore, for a high-frequency component, no matter which speaker from a plurality of speakers, for example, which human speaker It is difficult to hear if it is coming out. Therefore, each high frequency component of the audio signals of a plurality of channels is
Even if you send only to the speaker corresponding to the channel,
It is not perceived by humans that the high frequency component of the audio signal of each channel is output from only the one speaker. Therefore, in this embodiment, as described above,
Of the channels, for example, add the high frequency components of the surround left channel and surround right channel audio,
For example, it is added to the sound of the low frequency component of the surround left channel. As a result, the high frequency components of the two channels, the surround left channel and the surround right channel, can be compressed into one channel.
The crosstalk processing between channels in such a high frequency component is performed not only between the surround left channel and the surround right channel as in the example of FIG. 1 but also between other channels (not shown) (for example, The same can be done for all channels). In this way, it is possible to further increase the compression rate by performing channel crosstalk processing of high frequency components for all channels.

【００３２】上述した図１の各出力端子２３３〜２３８
及び２３１，２３２からの各チャネルの圧縮符号化デー
タは、図３のそれぞれ対応する端子２３３〜２３８及び
２３１，２３２を介してマルチプレクサ２４１に送られ
てマルチプレクスされる。当該マルチプレクサ２４１の
出力は出力端子２４２を介して伝送路に伝送若しくは上
述した本発明の映画フィルムやディスク状記録媒体，テ
ープ状記録媒体などに記録される。The output terminals 233 to 238 shown in FIG.
, 231 and 232 of each channel are sent to the multiplexer 241 via the corresponding terminals 233 to 238 and 231, 232 of FIG. 3, respectively, and are multiplexed. The output of the multiplexer 241 is transmitted to the transmission line via the output terminal 242 or recorded on the above-mentioned movie film of the present invention, the disc-shaped recording medium, the tape-shaped recording medium, or the like.

【００３３】次に、上記図１の各高聴感圧縮符号化回路
２１７〜２２２について具体的に説明する。Next, the high-hearing-compression encoding circuits 217 to 222 shown in FIG. 1 will be described in detail.

【００３４】先ず、図４には、上記各高聴感圧縮符号化
回路２１７〜２２２の一具体例の構成を示す。First, FIG. 4 shows the configuration of a specific example of each of the above-described high-audibility compression encoding circuits 217 to 222.

【００３５】この図４に示す圧縮符号化回路では、入力
ディジタル信号をフィルタなどにより複数の周波数帯域
に分割すると共に、各周波数帯域毎に直交変換を行っ
て、得られた周波数軸のスペクトルデータを、後述する
人間の聴覚特性を考慮したいわゆる臨界帯域幅（クリテ
ィカルバンド）毎に適応的にビット配分して符号化して
いる。この時、高域では臨界帯域幅を更に分割した帯域
を用いる。もちろんフィルタなどによる非ブロッキング
の周波数分割幅は等分割幅としてもよい。さらに、本実
施例においては、直交変換の前に入力信号に応じて適応
的にブロックサイズ（ブロック長）を変化させると共
に、クリティカルバンド単位もしくは高域では臨界帯域
幅（クリティカルバンド）を更に細分化したブロックで
フローティング処理を行っている。In the compression encoding circuit shown in FIG. 4, the input digital signal is divided into a plurality of frequency bands by a filter or the like, and orthogonal transformation is performed for each frequency band to obtain the spectrum data on the frequency axis. The bits are adaptively allocated and encoded for each so-called critical band (critical band) in consideration of human auditory characteristics described later. At this time, a band obtained by further dividing the critical bandwidth is used in the high band. Of course, the non-blocking frequency division width by a filter or the like may be an equal division width. Further, in this embodiment, the block size (block length) is adaptively changed according to the input signal before the orthogonal transformation, and the critical bandwidth (critical band) is further subdivided in the critical band unit or in the high range. Floating process is performed in the block.

【００３６】なお、上記帯域分割用フィルタとしては、
例えばＱＭＦ等のフィルタがあり、これは1976 R.E.Cro
chiere Digital coding of speech in subbands Bell
Syst.Tech. J. Vol.55, No.8 1976に、述べられてい
る。同じくICASSP 83, BOSTONPolyphase Quadrature f
ilters-A new subband coding technique Joseph H.Ro
thweilerには等バンド幅のフィルタ分割手法及び装置が
述べられている。As the band division filter,
For example, there is a filter such as QMF, which is 1976 RECro
chiere Digital coding of speech in subbands Bell
Syst.Tech. J. Vol.55, No.8 1976. Also ICASSP 83, BOSTONPolyphase Quadrature f
ilters-A new subband coding technique Joseph H.Ro
Thweiler describes an equal bandwidth filter partitioning method and apparatus.

【００３７】さらに、上記クリティカルバンドとは、人
間の聴覚特性を考慮して分割された周波数帯域であり、
ある純音の周波数近傍の同じ強さの狭帯域バンドノイズ
によって当該純音がマスクされるときのそのノイズの持
つ帯域のことである。このクリティカルバンドは、高域
ほど帯域幅が広くなっており、上記０〜２２ｋＨｚの全
周波数帯域は例えば２５のクリティカルバンドに分割さ
れている。Further, the critical band is a frequency band divided in consideration of human auditory characteristics,
It is a band of a certain pure tone when the pure tone is masked by a narrow band noise having the same strength near the frequency of the pure tone. This critical band has a wider bandwidth as it goes higher, and the entire frequency band of 0 to 22 kHz is divided into, for example, 25 critical bands.

【００３８】すなわち、図４において、入力端子１０に
は例えば０〜２２ｋＨｚのオーディオＰＣＭ信号が供給
されている。この入力信号は、例えばいわゆるＱＭＦ等
の帯域分割フィルタ１１により０〜１１ｋＨｚ帯域と１
１ｋ〜２２ｋＨｚ帯域とに分割され、０〜１１ｋＨｚ帯
域の信号は同じくいわゆるＱＭＦ等の帯域分割フィルタ
１２により０〜５．５ｋＨｚ帯域と５．５ｋ〜１１ｋＨ
ｚ帯域とに分割される。帯域分割フィルタ１１からの１
１ｋ〜２２ｋＨｚ帯域の信号は、直交変換回路の一例で
あるＭＤＣＴ（Modified Discrete Cosine Transform）
回路１３に送られ、帯域分割フィルタ１２からの５．５
ｋ〜１１ｋＨｚ帯域の信号はＭＤＣＴ回路１４に送ら
れ、帯域分割フィルタ１２からの０〜５．５ｋＨｚ帯域
の信号はＭＤＣＴ回路１５に送られることにより、それ
ぞれＭＤＣＴ処理される。That is, in FIG. 4, an audio PCM signal of 0 to 22 kHz, for example, is supplied to the input terminal 10. This input signal is divided into 0 to 11 kHz band and 1 by a band division filter 11 such as so-called QMF.
The signal of 0 to 11 kHz band is divided into the 1 to 22 kHz band and the 0 to 5.5 kHz band and 5.5 to 11 kHz by the band dividing filter 12 such as so-called QMF.
and z-band. 1 from the band division filter 11
A signal in the 1 kHz to 22 kHz band is an MDCT (Modified Discrete Cosine Transform) that is an example of an orthogonal transformation circuit.
5.5 sent from the band division filter 12 to the circuit 13.
A signal in the k to 11 kHz band is sent to the MDCT circuit 14, and a signal in the 0 to 5.5 kHz band from the band division filter 12 is sent to the MDCT circuit 15 to be MDCT processed.

【００３９】各ＭＤＣＴ回路１３、１４、１５では、各
帯域毎に設けたブロック決定回路１９、２０、２１によ
り後述するように決定されたブロックサイズに基づいて
ＭＤＣＴ処理がなされる。In each MDCT circuit 13, 14, 15 MDCT processing is performed based on the block size determined as described later by the block determination circuits 19, 20, 21 provided for each band.

【００４０】なお、各ＭＤＣＴ回路１３、１４、１５に
おける上記ＭＤＣＴ処理については、ICASSP 1987 Sub
band/Transform Coding Using Filter Bank Designs B
asedon Time Domain Aliasing Cancellation J.P.Princ
en A.B.Bradley Univ. ofSurrey Royal Melbourne Ins
t.of Tech.に述べられている。The MDCT processing in each MDCT circuit 13, 14, 15 is described in ICASSP 1987 Sub.
band / Transform Coding Using Filter Bank Designs B
asedon Time Domain Aliasing Cancellation JPPrinc
en ABBradley Univ. of Surrey Royal Melbourne Ins
t.of Tech.

【００４１】上記ブロック決定回路１９、２０、２１で
決定されたブロックサイズを示す情報は、後述の適応ビ
ット配分符号化回路１６、１７、１８に送られると共
に、出力端子２３、２５、２７から出力される。The information indicating the block size decided by the block decision circuits 19, 20, 21 is sent to the adaptive bit allocation coding circuits 16, 17, 18 described later and output from the output terminals 23, 25, 27. To be done.

【００４２】ここで、上記適応ビット配分符号化回路１
６、１７、１８においては、各ＭＤＣＴ回路１３、１
４、１５の出力から、上記臨界帯域（クリティカルバン
ド）または高域では更にクリティカルバンドを分割した
帯域毎のエネルギが、例えば当該バンド内での各振幅値
の２乗平均の平方根を計算すること等により求められ、
この計算結果に基づいてビット配分が行われる。もちろ
ん、上記スケールファクタそのものを以後のビット配分
の為に用いるようにしてもよい。この場合には新たなエ
ネルギ計算の演算が不要となるため、ハード規模の節約
となる。また、各バンド毎のエネルギの代わりに、振幅
値のピーク値、平均値等を用いることも可能である。な
お、各ＭＤＣＴ回路１３、１４、１５にてＭＤＣＴ処理
されて得られた周波数領域のスペクトルデータあるいは
ＭＤＣＴ係数データは、いわゆる臨界帯域（クリティカ
ルバンド）または高域では更にクリティカルバンドを分
割した帯域毎にまとめられて当該適応ビット配分符号化
回路１６、１７、１８に送られている。Here, the adaptive bit allocation encoding circuit 1
6, 17 and 18, the MDCT circuits 13 and 1
From the outputs of 4 and 15, the energy of each critical band (critical band) or a band obtained by further dividing the critical band in the high band is calculated, for example, the root mean square of each amplitude value in the band is calculated. Sought by
Bit allocation is performed based on the calculation result. Of course, the scale factor itself may be used for subsequent bit allocation. In this case, the calculation of new energy is not required, and the hardware scale is saved. Further, instead of the energy for each band, it is also possible to use the peak value, the average value, etc. of the amplitude values. The spectrum data in the frequency domain or the MDCT coefficient data obtained by the MDCT processing in each MDCT circuit 13, 14, 15 is a so-called critical band (critical band) or a band in which a critical band is further divided in a high band. The data are collected and sent to the adaptive bit allocation coding circuits 16, 17, and 18.

【００４３】さらに、上記適応ビット配分符号化回路１
６、１７、１８では、上記ブロックサイズの情報、及び
臨界帯域（クリティカルバンド）または高域では更にク
リティカルバンドを分割した帯域毎に割り当てられたビ
ット数に応じて各スペクトルデータ（あるいはＭＤＣＴ
係数データ）を再量子化（正規化して量子化）するよう
にしている。適応ビット配分符号化回路１６、１７、１
８で符号化されたデータは、出力端子２２、２４、２６
を介して取り出される。また、当該適応ビット配分符号
化回路１６、１７、１８では、どのような信号の大きさ
に関する正規化がなされたかを示すスケールファクタ
と、どのようなビット長で量子化がされたかを示すビッ
ト長情報も求めており、これらも同時に出力端子２２、
２４、２６から出力される。Further, the adaptive bit allocation encoding circuit 1
6, 17, and 18 each spectrum data (or MDCT) according to the block size information and the number of bits assigned to each band obtained by further dividing the critical band (critical band) or the critical band in the high band.
The coefficient data) is requantized (normalized and quantized). Adaptive bit allocation coding circuits 16, 17, 1
The data encoded in 8 are output terminals 22, 24, 26
Taken out through. Further, in the adaptive bit allocation encoding circuits 16, 17, and 18, a scale factor indicating what kind of signal magnitude is normalized and a bit length indicating what kind of bit length is quantized. Information is also sought, and these are also output terminals 22,
It is output from 24 and 26.

【００４４】これら各出力端子２２〜２７からのデータ
はまとめられて上記各高聴感圧縮符号化回路２１７〜２
２２の出力となる。The data from the output terminals 22 to 27 are collected and combined into the above-described high-audibility compression encoding circuits 217 to 2 respectively.
22 output.

【００４５】ところで、上記図１及び図４の例では、各
高聴感圧縮符号化回路２１７〜２２２においては各チャ
ネル毎にビット配分を行って圧縮符号化を行っている
が、これら各高聴感圧縮符号化回路２１７〜２２２間で
ビット配分を行う（すなわち各回路２１７〜２２２に対
応するチャネル間でビット配分を行う）ことも可能であ
る。In the examples of FIGS. 1 and 4, the high-audibility compression encoding circuits 217 to 222 perform bit-wise bit allocation for each channel for compression encoding. It is also possible to perform bit allocation among the encoding circuits 217 to 222 (that is, perform bit allocation among channels corresponding to the circuits 217 to 222).

【００４６】この高聴感圧縮符号化回路２１７〜２２２
間でビット配分を行う場合の各回路構成について、以下
に説明する。図５には、チャネル間でビット配分を行う
高聴感圧縮符号化回路の構成を示す。なお、この図５の
構成のうち、適応ビット配分符号化回路１６，１７，１
８を除く他の構成要素は図４の対応する構成要素と基本
的には同一のものである。The high-audibility compression encoding circuits 217 to 222
Each circuit configuration in the case of performing bit allocation between channels will be described below. FIG. 5 shows the configuration of a high-audibility compression encoding circuit that allocates bits among channels. In the configuration of FIG. 5, adaptive bit allocation coding circuits 16, 17, 1
Other components except 8 are basically the same as the corresponding components in FIG.

【００４７】この図５に示される圧縮符号化回路におい
て、図４同様のブロック決定回路１９、２０、２１によ
り決定される各ＭＤＣＴ回路１３、１４、１５でのブロ
ックサイズの具体例を図６のＡ及びＢに示す。なお、図
６のＡには直交変換ブロックサイズが長い場合（ロング
モードにおける直交変換ブロックサイズ）を、図６のＢ
には直交変換ブロックサイズが短い場合（ショートモー
ドにおける直交変換ブロックサイズ）を示ししている。
この図６の具体例においては、３つのフィルタ出力は、
それぞれ２つの直交変換ブロックサイズを持つ。すなわ
ち、低域側の０〜５．５ｋＨｚ帯域の信号及び中域の
５．５ｋ〜１１ｋＨｚ帯域の信号に対しては、長いブロ
ック長の場合（図６のＡ）は１ブロック内のサンプル数
を１２８サンプルとし、短いブロックが選ばれた場合
（図６のＢ）には１ブロック内のサンプル数を３２サン
プル毎のブロックとしている。これに対して高域側の１
１ｋ〜２２ｋＨｚ帯域の信号に対しては、長いブロック
長の場合（図６のＡ）は１ブロック内のサンプル数を２
５６サンプルとし、短いブロックが選ばれた場合（図６
のＢ）には１ブロック内のサンプル数を３２サンプル毎
のブロックとしている。このようにして短いブロックが
選ばれた場合には各帯域の直交変換ブロックのサンプル
数を同じとして高域程時間分解能を上げ、なおかつブロ
ック化に使用するウインドウの種類を減らしている。な
お、図５の具体例のブロック決定回路１９、２０、２１
で決定されたブロックサイズを示す情報は、後述の適応
ビット配分符号化回路１６、１７、１８に送られると共
に、出力端子２３、２５、２７から出力される。In the compression encoding circuit shown in FIG. 5, a concrete example of the block size in each MDCT circuit 13, 14, 15 determined by the block determination circuits 19, 20, 21 similar to that in FIG. 4 is shown in FIG. Shown in A and B. 6A shows a case where the orthogonal transform block size is long (orthogonal transform block size in the long mode), FIG.
Shows the case where the orthogonal transform block size is short (orthogonal transform block size in the short mode).
In the specific example of FIG. 6, the three filter outputs are
Each has two orthogonal transform block sizes. That is, for a signal in the low band of 0 to 5.5 kHz and a signal in the mid band of 5.5 to 11 kHz, the number of samples in one block is set to a long block length (A in FIG. 6). When 128 blocks are selected and a short block is selected (B in FIG. 6), the number of samples in one block is a block for every 32 samples. On the other hand, the high side 1
For a signal in the 1 kHz to 22 kHz band, if the block length is long (A in FIG. 6), the number of samples in one block is 2
56 samples and a short block is selected (Fig. 6
In B), the number of samples in one block is a block for every 32 samples. When a short block is selected in this way, the number of samples of orthogonal transform blocks in each band is set to be the same, the time resolution is increased in the higher frequency range, and the number of windows used for blocking is reduced. In addition, the block determination circuits 19, 20, and 21 of the specific example of FIG.
The information indicating the block size determined in step 1 is sent to the adaptive bit allocation coding circuits 16, 17 and 18, which will be described later, and is also output from the output terminals 23, 25 and 27.

【００４８】この図５の具体例の適応ビット配分符号化
回路１６、１７、１８では、上記ブロックサイズの情
報、及び臨界帯域（クリティカルバンド）または高域で
は更にクリティカルバンドを分割した帯域毎に割り当て
られたビット数に応じて各スペクトルデータ（あるいは
ＭＤＣＴ係数データ）を再量子化（正規化して量子化）
するようにしている。この時、適応ビット配分符号化回
路１６、１７、１８では、各チャネル間でのチャネルビ
ット配分、すなわち各チャネルの信号全体を見ることに
より、チャネル毎の使用ビット量を適応的に最適に振り
分けるビット配分を同時に行う。この場合の当該チャネ
ルビット配分は、後述する適応ビット配分回路から端子
２８を介して供給されたチャネルビット配分信号に基づ
いて行われる。このようにして符号化されたデータは、
出力端子２２、２４、２６を介して取り出される。ま
た、当該適応ビット配分符号化回路１６、１７、１８で
は、どのような信号の大きさに関する正規化がなされた
かを示すスケールファクタと、どのようなビット長で量
子化がされたかを示すビット長情報も求めており、これ
らも同時に出力端子２２、２４、２６から出力される。In the adaptive bit allocation coding circuits 16, 17 and 18 of the specific example of FIG. 5, the block size information and the critical band (critical band) or the critical band in the high frequency band is further allocated. Re-quantization (normalized and quantized) of each spectrum data (or MDCT coefficient data) according to the number of bits
I am trying to do it. At this time, the adaptive bit allocation coding circuits 16, 17, and 18 adaptively and optimally allocate the used bit amount for each channel by observing the channel bit allocation among the channels, that is, the entire signal of each channel. Allocate simultaneously. The channel bit allocation in this case is performed based on the channel bit allocation signal supplied from the adaptive bit allocation circuit described later through the terminal 28. The data encoded in this way is
It is taken out through the output terminals 22, 24 and 26. Further, in the adaptive bit allocation encoding circuits 16, 17, and 18, a scale factor indicating what kind of signal magnitude is normalized and a bit length indicating what kind of bit length is quantized. Information is also sought, and these are also output from the output terminals 22, 24 and 26 at the same time.

【００４９】次に、上記ビット配分を行うための適応ビ
ット配分回路の具体的な構成及び動作を図７を用いて説
明する。なお、この図７の例では、図１に対応して前記
８チャネルのうちの６チャネルについてのビット配分に
対応している。すなわち、聴感上影響の高い信号からな
るチャネルであるレフトチャネルとレフトセンタチャネ
ルとセンタチャネルとサブウーファチャネルとライトセ
ンタチャネルとライトチャネルの６チャネルに対応して
いる。Next, the specific configuration and operation of the adaptive bit allocation circuit for performing the above bit allocation will be described with reference to FIG. The example of FIG. 7 corresponds to the bit allocation for 6 channels out of the 8 channels corresponding to FIG. That is, it corresponds to 6 channels of a left channel, a left center channel, a center channel, a subwoofer channel, a right center channel and a right channel, which are channels composed of signals that have a great influence on the auditory sense.

【００５０】この図７において、各チャネルの共通部に
ついて例えばレフトチャネル（Ｌｃｈ）を用いて説明す
る（他のチャネルについては同一の指示符号を付して説
明は省略している）と、レフトチャネルの入力情報信号
は当該レフトチャネル用の入力端子３１に与えられる。
なお、この端子３１は、図５の端子２９と対応してい
る。この入力情報信号はマッピング回路（Mapping)３２
により時間領域の信号から周波数領域に展開される。こ
こで、フィルタによる場合には、サブバンド信号として
時間領域サンプルが得られることになり、直交変換出力
の場合及びフィルタリング後に直交変換を行う場合には
周波数領域サンプルが得られることになる。In FIG. 7, the common part of each channel will be described by using, for example, the left channel (Lch) (the other channels will be denoted by the same reference numerals and description thereof will be omitted). Input information signal is applied to the input terminal 31 for the left channel.
The terminal 31 corresponds to the terminal 29 of FIG. This input information signal is a mapping circuit (Mapping) 32.
Is expanded from the time domain signal to the frequency domain. Here, in the case of using a filter, time domain samples are obtained as subband signals, and in the case of orthogonal transform output and in the case of performing orthogonal transform after filtering, frequency domain samples are obtained.

【００５１】これらのサンプルは、ブロッキング(Block
ing)回路３３によって複数サンプル毎にまとめられる。
ここで、フィルタによる場合には時間領域の複数サンプ
ルがまとめられることになり、直交変換出力の場合及び
フィルタリング後に直交変換を行う場合には周波数領域
の複数サンプルがまとめられることになる。These samples are
ing) circuit 33 collects a plurality of samples.
Here, a plurality of samples in the time domain are combined when using a filter, and a plurality of samples in the frequency domain are combined when an orthogonal transform output and when orthogonal transform is performed after filtering.

【００５２】また、本具体例では、マッピングの途中の
ＭＤＣＴ入力時間領域信号の時間変化を時間変化算出回
路３４により算出する。Further, in this specific example, the time change of the MDCT input time domain signal during the mapping is calculated by the time change calculation circuit 34.

【００５３】上記ブロッキング回路３３により複数のサ
ンプル毎にまとめられた各サンプルは正規化回路３７で
正規化される。ここで、正規化のための係数であるスケ
ールファクタは、スケールファクタ算出回路３５によっ
て得られる。同時にトーナリティの大きさがトーナリテ
ィ算出回路３６で算出される。Each sample collected by the blocking circuit 33 into a plurality of samples is normalized by a normalizing circuit 37. Here, the scale factor, which is a coefficient for normalization, is obtained by the scale factor calculation circuit 35. At the same time, the tonality magnitude is calculated by the tonality calculation circuit 36.

【００５４】以上で求められるパラメータは、ビット配
分回路３８でビット配分のために使用される。ここで、
ＭＤＣＴ係数を表現して伝送又は記録に使えるビット数
を、全チャネル（上記６チャネル）で８００Ｋｂｐｓと
すると、本具体例のビット配分回路３８では、チャネル
ビット配分を含む第１のビット配分（第１のビット配分
量）と、チャネルビット配分を含まない第２のビット配
分（第２のビット配分量）の２つを求める。The parameters obtained as described above are used by the bit allocation circuit 38 for bit allocation. here,
Assuming that the number of bits that can be used for transmission or recording by expressing MDCT coefficients is 800 Kbps for all channels (the above-mentioned 6 channels), the bit allocation circuit 38 of this specific example uses a first bit allocation (first bit allocation) including channel bit allocation (first bit allocation). Of the channel bit allocation) and a second bit allocation not including the channel bit allocation (second bit allocation amount).

【００５５】先ず、チャネルビット配分を含む第１のビ
ット配分の配分手法について説明する。ここではスケー
ルファクタの周波数領域の分布をみて適応的にビット配
分を行う。First, the allocation method of the first bit allocation including the channel bit allocation will be described. Here, bit distribution is adaptively performed by observing the distribution of the scale factor in the frequency domain.

【００５６】この場合、全チャネルのスケールファクタ
の周波数領域の分布をみてチャネル間でのビット配分を
行うことで有効なビット配分を行うことができる。この
とき、複数チャネルの信号情報が、スピーカの場合のよ
うに同一音場のなかで混合されて左右の耳に達する場合
を考えると、全チャネル信号の加算されたものでマスキ
ングが作用すると考えてよいから、図８のＡ，Ｆに示す
ように、同一帯域において各チャネルが同一のノイスレ
ベルになるようにビット配分を行うことが有効である。
このための一方法としてはスケールファクタ指標の大き
さに比例したビット配分を行えばよい。すなわち、以下
の式によってビット配分を行う。In this case, effective bit allocation can be performed by observing the frequency domain distribution of the scale factors of all channels and allocating bits among the channels. At this time, considering that the signal information of multiple channels reaches the left and right ears after being mixed in the same sound field as in the case of a speaker, it is thought that masking works with the sum of all channel signals. Therefore, as shown in A and F of FIG. 8, it is effective to perform bit allocation so that each channel has the same noise level in the same band.
One method for this purpose is to perform bit allocation proportional to the size of the scale factor index. That is, bit allocation is performed by the following formula.

【００５７】Ｂｍ＝Ｂ＊（ΣＳＦｎ）／ＳＳ＝Σ（ΣＳＦｎ）Bm = B * (ΣSFn) / S S = Σ (ΣSFn)

【００５８】ここで、Ｂｍは各チャネルへのビット配分
量、Ｂは全チャネルへのビット配分量、ＳＦｎはスケー
ルファクタ指標であり概略ピーク値の対数に対応してい
る。ｎは各チャネル内のブロックフローティングバンド
番号、ｍはチャネル番号、Ｓは全チャネルのスケールフ
ァクタ指標の和である。なお、図８には、レフトチャネ
ルとライトチャネルのみ示し他の４チャネルについては
図示を省略している。Here, Bm is the bit allocation amount to each channel, B is the bit allocation amount to all channels, and SFn is a scale factor index, which corresponds to the logarithm of the approximate peak value. n is the block floating band number in each channel, m is the channel number, and S is the sum of the scale factor indexes of all channels. In FIG. 8, only the left channel and the right channel are shown and the other four channels are not shown.

【００５９】以上に加えて、ビット配分回路３８は、各
チャネルの信号の時間変化特性を検出して、この指標に
よってチャネル毎のビット配分量を変えるプロセスを持
つ。この時間変化を表す指標は次のようにして求められ
る。In addition to the above, the bit allocation circuit 38 has a process of detecting the time change characteristic of the signal of each channel and changing the bit allocation amount for each channel by this index. The index showing this time change is obtained as follows.

【００６０】図９のＡ〜Ｆに示すように、チャネルが６
チャネルあるとすると、それぞれのチャネルの情報入力
信号についてビット配分の時間単位であるビット配分時
間ブロックを時間的に４分割し、それぞれの時間ブロッ
ク（サブブロック）のピーク値を得る。そして各サブブ
ロックのピーク値が小から大へと変わるところの差分の
大きさに応じてチャネル間でビットを分け合う。ここ
で、このビット配分のために６チャネル合計でＣビット
使えるとしたとき、各チャネルの各サブブロックのピー
ク値が小から大へと変わるところの差分の大きさがそれ
ぞれａ，ｂ，ｃ，ｄ，ｅ，ｆデシベル（ｄＢ）とする
と、それぞれＣ＊ａ／Ｔ，Ｃ＊ｂ／Ｔ，・・・・・，Ｃ
＊ｆ／Ｔビット（ビット）と配分することができる。こ
こで、Ｔ＝ａ＋ｂ＋ｃ＋ｄ＋ｅ＋ｆである。信号情報が
大きくなる程度が大であるほどそのチャネルに対しての
ビット配分量が大きくなる。なお、図９には、レフトチ
ャネルとレフトセンタチャネルとライトチャネルのみ示
し他の３つのチャネルについては図示を省略している。As shown in FIG. 9A to FIG.
Assuming that there is a channel, the bit allocation time block, which is the time unit of bit allocation for the information input signal of each channel, is temporally divided into four, and the peak value of each time block (sub-block) is obtained. Then, bits are shared among channels according to the magnitude of the difference where the peak value of each sub-block changes from small to large. Here, assuming that C bits can be used in a total of 6 channels for this bit allocation, the magnitudes of the differences at which the peak value of each sub-block of each channel changes from small to large are a, b, c, respectively. If d, e, and f decibel (dB), then C * a / T, C * b / T, ..., C, respectively.
* F / T bits (bits) can be allocated. Here, T = a + b + c + d + e + f. The larger the signal information becomes, the larger the bit allocation amount for the channel becomes. In FIG. 9, only the left channel, the left center channel and the right channel are shown and the other three channels are not shown.

【００６１】次に、チャネルビット配分を含まない第２
のビット配分の配分手法について説明する。ここでは、
チャネルビット配分を含まない第２のビット配分の手法
として更に２つのビット配分からなるビット配分手法に
ついて説明する。なお，この第２のビット配分は、前記
図４における適応ビット配分符号化回路でのビット配分
処理に対応している。Next, the second not including channel bit allocation
A method of allocating the bit will be described. here,
As a second bit allocation method not including channel bit allocation, a bit allocation method consisting of two bit allocations will be described. The second bit allocation corresponds to the bit allocation processing in the adaptive bit allocation encoding circuit in FIG.

【００６２】この２つのビット配分をそれぞれビット配
分(1) とビット配分(2) とする。以下のビット配分では
各チャネルで使用できるビットレートは事前にそれぞれ
のチャネルで固定的に決めておく。例えば、６チャネル
の内で音声など重要部分を担う２つのチャネルには１４
７ｋｂｐｓという比較的大きいビットを使い、サブウー
ハチャネルには高々２ｋｂｐｓ、それ以外のチャネルに
は１００ｋｂｐｓを割り当てておく。These two bit allocations are referred to as bit allocation (1) and bit allocation (2), respectively. In the following bit allocation, the bit rate that can be used in each channel is fixedly determined in advance for each channel. For example, of the 6 channels, 2 channels that play an important part such as voice have 14 channels.
A relatively large bit of 7 kbps is used, 2 kbps is assigned to the subwoofer channel at most, and 100 kbps is assigned to the other channels.

【００６３】先ず、ビット配分(1) に使うべきビット量
を確定する。そのためには、信号情報（ａ）のスペクト
ル情報のうちトーナリティ情報及び信号情報（ｂ）の時
間変化情報を使用する。First, the bit amount to be used for the bit allocation (1) is decided. For that purpose, the tonality information and the time change information of the signal information (b) of the spectrum information of the signal information (a) are used.

【００６４】ここで、トーナリティ情報について説明す
ると、指標としては、信号スペクトルの隣接値間の差の
絶対値の和を、信号スペクトル数で割った値を、指標と
して用いている。より簡単にはブロックフローティング
の為のブロックごとのスケールファクタの、隣接スケー
ルファクタ指標の間の差の平均値を用いる。スケールフ
ァクタ指標は、概略スケールファクタの対数値に対応し
ている。本実施例では、ビット配分(1) に使うべきビッ
ト量をこのトーナリティを表す値に対応させて最大８０
ｋｂｐｓ、最小１０ｋｂｐｓと設定している。ここでは
簡単のために、全チャネルそれぞれの割当を等しく１０
０ｋｂｐｓとしている。Tonality information will be described. As an index, a value obtained by dividing the sum of absolute values of differences between adjacent values of the signal spectrum by the number of signal spectra is used as the index. More simply, the average value of the difference between adjacent scale factor indexes of the scale factor for each block for block floating is used. The scale factor index corresponds to the logarithmic value of the rough scale factor. In the present embodiment, the maximum amount of bits to be used for bit allocation (1) is 80, corresponding to the value representing this tonality.
kbps, and the minimum is set to 10 kbps. Here, for simplification, the allocation of all channels is equalized to 10
It is set to 0 kbps.

【００６５】トーナリティ計算は次式のように行う。The tonality calculation is performed by the following equation.

【００６６】Ｔ＝（１／ＷＬｍａｘ）（ΣＡＢＳ（ＳＦｎ−１））T = (1 / WLmax) (ΣABS (SFn−1))

【００６７】なお、ＷＬｍａｘはワードレングス最大値
＝１６、ＳＦｎはスケールファクタ指標で概略ピーク値
の対数に対応している。ｎはブロックフローティングバ
ンド番号である。Note that WLmax is the maximum word length = 16, and SFn is a scale factor index and corresponds to the logarithm of the approximate peak value. n is a block floating band number.

【００６８】このようにして求められたトーナリティ情
報Ｔとビット配分(1)のビット配分量とは、図１０に示
すように対応付けられる。The tonality information T thus obtained is associated with the bit allocation amount of the bit allocation (1) as shown in FIG.

【００６９】これと共に本実施例においては、ビット配
分(1) とそれに付加するその他の少なくとも１つのビッ
ト配分との分割率は、情報信号の時間変化特性に依存す
る。本具体例では、直交変換時間ブロックサイズを更に
分割した時間区間毎に信号情報のピーク値を隣接ブロッ
ク毎に比較することにより情報信号の振幅が急激に大き
くなる時間領域を検出してその大きくなるときの状態の
程度により分割率を決定する。In addition, in this embodiment, the division ratio between the bit allocation (1) and at least one other bit allocation added thereto depends on the time change characteristic of the information signal. In this specific example, the peak value of the signal information is compared for each adjacent block for each time interval obtained by further dividing the orthogonal transform time block size, thereby detecting and increasing the time region in which the amplitude of the information signal rapidly increases. The division rate is determined according to the degree of the state.

【００７０】時間変化率計算は次式のように行う。The calculation of the time change rate is performed by the following equation.

【００７１】Ｖｔ＝ΣＶｍＶａｖ＝（１／Ｖｍａｘ）＊（１／Ｃｈ）ＶｔVt = ΣVm Vav = (1 / Vmax) * (1 / Ch) Vt

【００７２】ここで、Ｖｔは各チャネルの時間サブブロ
ックのピーク値の小から大への変化をｄＢ値で表しもの
のチャネルに関する和、Ｖｍは各チャネルの時間サブブ
ロックのピーク値の小から大への変化をｄＢ値で表しも
ので一番大きいものの大きさ（但し最大値を３０ｄＢに
制限しＶｍａｘであらわす。ｍはチャネル番号、Ｃｈは
チャネル数、Ｖａｖは時間サブブロックのピーク値の小
から大への変化をｄＢ値で表しもののチャネル平均であ
る。Here, Vt represents the change in peak value of the time sub-block of each channel from small to large in dB value, but Vm is the sum about the channel, and Vm is from small to large peak value of the time sub-block of each channel. The change in dB is expressed in dB and is the largest (however, the maximum value is limited to 30 dB and expressed as Vmax. M is the channel number, Ch is the number of channels, and Vav is the peak value of the time sub-block from small to large. It is the channel average of the change in dB in dB.

【００７３】このようにして求められた時間変化率Ｖａ
ｖとビット配分(1) の配分量とは、図１１に示すように
対応付けられる。最終的にビット配分(1) への配分量は
次の式で求められる。The time change rate Va thus obtained
v and the allocation amount of the bit allocation (1) are associated with each other as shown in FIG. Finally, the amount of allocation to bit allocation (1) is calculated by the following formula.

【００７４】Ｂ＝１／２（Ｂｆ＋Ｂｔ）B = 1/2 (Bf + Bt)

【００７５】ここで、Ｂは最終的なビット配分(1) への
配分量、ＢｆはＴｖａより求められたビット配分量、Ｂ
ｔはＶａｖより求められたビット配分量である。Here, B is the amount of allocation to the final bit allocation (1), Bf is the amount of bit allocation obtained from Tva, and B is
t is the bit allocation amount obtained from Vav.

【００７６】ここでのビット配分(1) はスケ−ルファク
タに依存した周波数、時間領域上の配分がなされる。The bit allocation (1) here is allocated in the frequency and time domain depending on the scale factor.

【００７７】このようにしてビット配分(1) に使用され
るビット量が決定されたならば、次にビット配分(1) で
使われなかったビットについての配分すなわちビット配
分(2) を決定する。ここでは多種のビット配分が行われ
る。After the bit amount used for the bit allocation (1) is determined in this way, the allocation for the bits not used in the bit allocation (1), that is, the bit allocation (2) is determined next. . Various bit allocations are performed here.

【００７８】第１に全てのサンプル値に対する均一配分
が行われる。この場合のビット配分に対する量子化雑音
スペクトルの一例を図１２に示す。この場合、全周波数
帯域で均一の雑音レベル低減が行える。First, a uniform distribution is made for all sample values. FIG. 12 shows an example of the quantization noise spectrum with respect to the bit allocation in this case. In this case, the noise level can be reduced uniformly over the entire frequency band.

【００７９】第２に信号情報の周波数スペクトル及びレ
ベルに対する依存性を持たせた聴覚的な効果を得るため
のビット配分が行われる。この場合のビット配分に対す
る量子化雑音スペクトルの一例を図１３に示す。この例
では情報信号のスペクトルに依存させたビット配分を行
っていて、特に情報信号のスペクトルの低域側にウエイ
トをおいたビット配分を行い、広域側に比して起きる低
域側でのマスキング効果の減少を補償している。これは
隣接臨界帯域間でのマスキングを考慮して、スペクトル
の低域側を重視したマスキングカーブの非対象性に基づ
いている。このように、図１３の例では低域を重視した
ビット配分が行われている。Secondly, bit allocation is performed to obtain an auditory effect having dependency on the frequency spectrum and level of signal information. FIG. 13 shows an example of the quantization noise spectrum with respect to the bit allocation in this case. In this example, bit allocation is performed depending on the spectrum of the information signal, and in particular, bit allocation with weighting is performed on the low frequency side of the spectrum of the information signal, and masking on the low frequency side that occurs compared to the wide area side is performed. It compensates for the decrease in effect. This is based on the asymmetry of the masking curve that attaches importance to the low frequency side of the spectrum, considering masking between adjacent critical bands. As described above, in the example of FIG. 13, bit allocation is performed with emphasis on the low frequency band.

【００８０】そして最終的にビット配分(1) とビット配
分(1) に付加されるビット配分の値の和が図７のビット
配分回路３８でとられる。最終的なビット配分は以上の
各ビット配分の和として与えられる。Finally, the sum of the bit allocation (1) and the value of the bit allocation added to the bit allocation (1) is taken by the bit allocation circuit 38 of FIG. The final bit allocation is given as the sum of the above bit allocations.

【００８１】なお、図１２，図１３の図中Ｓは信号スペ
クトルを、ＮＬ１は上記全てのサンプルに対する均一配
分による雑音レベルを、ＮＬ２は上記周波数スペクトル
及びレベルに対する依存正を持たせた聴覚的な高かを得
るためのビット配分による雑音レベルを示している。12 and 13, S is the signal spectrum, NL1 is the noise level due to uniform distribution to all the samples, and NL2 is an auditory sound having a positive dependence on the frequency spectrum and level. It shows the noise level due to the bit allocation to get high.

【００８２】次にチャネルビット配分を含まないビット
配分の別の手法を次に説明する。この場合の適応ビット
配分回路の動作を図１４で説明するとＭＤＣＴ係数の大
きさが各ブロックごとに求められ、そのＭＤＣＴ係数が
入力端子８０１に供給される。当該入力端子８０１に供
給されたＭＤＣＴ係数は、帯域毎のエネルギ算出回路８
０３に与えられる。帯域毎のエネルギ算出回路８０３で
は、クリティカルバンドまたは高域においてはクリティ
カルバンドを更に再分割したそれぞれの帯域に関する信
号エネルギを算出する。帯域毎のエネルギ算出回路８０
３で算出されたそれぞれの帯域に関するエネルギは、エ
ネルギ依存ビット配分回路８０４に供給される。Next, another method of bit allocation not including channel bit allocation will be described below. The operation of the adaptive bit allocation circuit in this case will be described with reference to FIG. 14, the magnitude of the MDCT coefficient is obtained for each block, and the MDCT coefficient is supplied to the input terminal 801. The MDCT coefficient supplied to the input terminal 801 is the energy calculation circuit 8 for each band.
Given to 03. The energy calculation circuit 803 for each band calculates the signal energy for each band obtained by further dividing the critical band or the critical band in the high band. Energy calculation circuit 80 for each band
The energy for each band calculated in 3 is supplied to the energy-dependent bit allocation circuit 804.

【００８３】エネルギ依存ビット配分回路８０４では、
使用可能総ビット発生回路８０２からの使用可能総ビッ
ト、本実施例では１２８Ｋｂｐｓの内のある割合（本実
施例では１００Ｋｂｐｓ）を用いて白色の量子化雑音を
作り出すようなビット配分を行う。このとき、入力信号
のトーナリティが高いほど、すなわち入力信号のスペク
トルの凸凹が大きいほど、このビット量が上記１２８Ｋ
ｂｐｓに占める割合が増加する。なお、入力信号のスペ
クトルの凸凹を検出するには、隣接するブロックのブロ
ックフローティング係数の差の絶対値の和を指標として
使う。そして、求められた使用可能なビット量につき、
各帯域のエネルギの対数値に比例したビット配分を行
う。In the energy-dependent bit allocation circuit 804,
The available total bit from the available total bit generation circuit 802, which is a ratio of 128 Kbps in this embodiment (100 Kbps in this embodiment), is used to perform bit allocation so as to generate white quantization noise. At this time, the higher the tonality of the input signal, that is, the larger the unevenness of the spectrum of the input signal, the more the bit amount becomes 128K.
The ratio to bps increases. In addition, in order to detect the unevenness of the spectrum of the input signal, the sum of the absolute values of the differences of the block floating coefficients of the adjacent blocks is used as an index. Then, regarding the available bit amount obtained,
Bit allocation is performed in proportion to the logarithmic value of the energy of each band.

【００８４】聴覚許容雑音レベルに依存したビット配分
算出回路８０５は、まず上記クリティカルバンド毎に分
割されたスペクトルデータに基づき、いわゆるマスキン
グ効果等を考慮した各クリティカルバンド毎の許容ノイ
ズ量を求め、次に聴覚許容雑音スペクトルを与えるよう
に使用可能総ビットからエネルギ依存ビットを引いたビ
ット分が配分される。このようにして求められたエネル
ギ依存ビットと聴覚許容雑音レベルに依存したビットは
加算されて、図５（図４の場合も同様）の適応ビット配
分符号化回路１６、１７、１８により各クリティカルバ
ンド毎もしくは高域においてはクリティカルバンドを更
に複数帯域に分割した帯域に割り当てられたビット数に
応じて各スペクトルデータ（あるいはＭＤＣＴ係数デー
タ）を再量子化するようにしている。このようにして符
号化されたデータは、図５の出力端子２２、２４、２６
を介して取り出される。The bit allocation calculation circuit 805 which depends on the permissible noise level for hearing first finds the permissible noise amount for each critical band in consideration of the so-called masking effect, etc., based on the spectral data divided for each critical band. Bits obtained by subtracting energy-dependent bits from the total available bits are distributed so as to give a perceptible noise spectrum to the. The energy-dependent bits thus obtained and the bits depending on the permissible noise level of the hearing are added, and the adaptive bit allocation coding circuits 16, 17, and 18 shown in FIG. In each or in the high frequency band, each spectrum data (or MDCT coefficient data) is requantized according to the number of bits assigned to the band obtained by further dividing the critical band into a plurality of bands. The data encoded in this way is output to the output terminals 22, 24, 26 of FIG.
Taken out through.

【００８５】さらに詳しく上記聴覚許容雑音スペクトル
依存のビット配分回路８０５中の聴覚許容雑音スペクト
ル算出回路について説明すると、ＭＤＣＴ回路１３、１
４、１５で得られたＭＤＣＴ係数が上記許容雑音算出回
路に与えられる。The auditory permissible noise spectrum calculation circuit in the bit allocation circuit 805 depending on the permissible auditory noise spectrum will be described in more detail. The MDCT circuits 13 and 1 will be described below.
The MDCT coefficients obtained in 4 and 15 are given to the allowable noise calculating circuit.

【００８６】図１５は上記許容雑音算出回路をまとめて
説明した一具体例の概略構成を示すブロック回路図であ
る。この図１５において、入力端子５２１には、ＭＤＣ
Ｔ回路１３、１４、１５からの周波数領域のスペクトル
データが供給されている。FIG. 15 is a block circuit diagram showing a schematic configuration of a specific example in which the allowable noise calculating circuits are collectively described. In FIG. 15, the input terminal 521 has an MDC
The spectrum data in the frequency domain is supplied from the T circuits 13, 14, and 15.

【００８７】この周波数領域の入力データは、帯域毎の
エネルギ算出回路５２２に送られて、上記クリティカル
バンド（臨界帯域）毎のエネルギが、例えば当該バンド
内での各振幅値２乗の総和を計算すること等により求め
られる。この各バンド毎のエネルギの代わりに、振幅値
のピーク値、平均値等が用いられることもある。このエ
ネルギ算出回路５２２からの出力として、例えば各バン
ドの総和値のスペクトルは、一般にバークスペクトルと
称されている。図１６はこのような各クリティカルバン
ド毎のバークスペクトルＳＢを示している。ただし、こ
の図１６では、図示を簡略化するため、上記クリティカ
ルバンドのバンド数を１２バンド（Ｂ1〜Ｂ12）で表現
している。The input data in the frequency domain is sent to the energy calculation circuit 522 for each band, and the energy of each critical band (critical band) is calculated, for example, as the sum of squared amplitude values in the band. It is required by doing. Instead of the energy for each band, a peak value, an average value, etc. of the amplitude value may be used. As an output from the energy calculation circuit 522, for example, the spectrum of the total sum value of each band is generally called a Bark spectrum. FIG. 16 shows the Bark spectrum SB for each such critical band. However, in FIG. 16, in order to simplify the illustration, the number of bands of the critical band is represented by 12 bands (B1 to B12).

【００８８】ここで、上記バークスペクトルＳＢのいわ
ゆるマスキングに於ける影響を考慮するために、該バー
クスペクトルＳＢに所定の重み付け関数を掛けて加算す
るような畳込み（コンボリューション）処理を施す。こ
のため、上記帯域毎のエネルギ算出回路５２２の出力す
なわち該バークスペクトルＳＢの各値は、畳込みフィル
タ回路５２３に送られる。該畳込みフィルタ回路５２３
は、例えば、入力データを順次遅延させる複数の遅延素
子と、これら遅延素子からの出力にフィルタ係数（重み
付け関数）を乗算する複数の乗算器（例えば各バンドに
対応する２５個の乗算器）と、各乗算器出力の総和をと
る総和加算器とから構成されるものである。なお、上記
マスキングとは、人間の聴覚上の特性により、ある信号
によって他の信号がマスクされて聞こえなくなる現象を
いうものであり、このマスキング効果には、時間領域の
オーディオ信号による時間軸マスキング効果と、周波数
領域の信号による同時刻マスキング効果とがある。これ
らのマスキング効果により、マスキングされる部分にノ
イズがあったとしても、このノイズは聞こえないことに
なる。このため、実際のオーディオ信号では、このマス
キングされる範囲内のノイズは許容可能なノイズとされ
る。Here, in order to consider the influence of the so-called masking of the Bark spectrum SB, a convolution process is performed such that the Bark spectrum SB is multiplied by a predetermined weighting function and added. Therefore, the output of the energy calculation circuit 522 for each band, that is, each value of the Bark spectrum SB is sent to the convolution filter circuit 523. The convolution filter circuit 523
Is, for example, a plurality of delay elements that sequentially delay input data, and a plurality of multipliers (for example, 25 multipliers corresponding to each band) that multiply outputs from these delay elements by a filter coefficient (weighting function). , A sum total adder that sums the outputs of the respective multipliers. Note that the masking is a phenomenon in which one signal is masked by another signal and becomes inaudible due to human auditory characteristics.The masking effect includes a time-axis masking effect by an audio signal in the time domain. And there is the same time masking effect by the signal in the frequency domain. Due to these masking effects, even if there is noise in the masked portion, this noise cannot be heard. Therefore, in the actual audio signal, the noise within the masked range is regarded as an acceptable noise.

【００８９】ここで、上記畳込みフィルタ回路５２３の
各乗算器の乗算係数（フィルタ係数）の一具体例を示す
と、任意のバンドに対応する乗算器Ｍの係数を１とする
とき、乗算器Ｍ−１で係数０．１５を、乗算器Ｍ−２で
係数０．００１９を、乗算器Ｍ−３で係数０．００００
０８６を、乗算器Ｍ＋１で係数０．４を、乗算器Ｍ＋２
で係数０．０６を、乗算器Ｍ＋３で係数０．００７を各
遅延素子の出力に乗算することにより、上記バークスペ
クトルＳＢの畳込み処理が行われる。ただし、Ｍは１〜
２５の任意の整数である。Here, a specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 523 will be described. When the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier M-1 gives a coefficient of 0.15, multiplier M-2 gives a coefficient of 0.0019, and multiplier M-3 gives a coefficient of 0.0000.
086, multiplier M + 1 gives a coefficient of 0.4, multiplier M + 2
By multiplying the output of each delay element by a coefficient of 0.06 and a coefficient of 0.007 by a multiplier M + 3, the convolution processing of the Bark spectrum SB is performed. However, M is 1 to
It is an arbitrary integer of 25.

【００９０】次に、上記畳込みフィルタ回路５２３の出
力は引算器５２４に送られる。該引算器５２４は、上記
畳込んだ領域での後述する許容可能なノイズレベルに対
応するレベルαを求めるものである。なお、当該許容可
能なノイズレベル（許容ノイズレベル）に対応するレベ
ルαは、後述するように、逆コンボリューション処理を
行うことによって、クリティカルバンドの各バンド毎の
許容ノイズレベルとなるようなレベルである。ここで、
上記引算器５２４には、上記レベルαを求めるるための
許容関数（マスキングレベルを表現する関数）が供給さ
れる。この許容関数を増減させることで上記レベルαの
制御を行っている。当該許容関数は、次に説明するよう
な（ｎ−ａｉ）関数発生回路５２５から供給されている
ものである。Next, the output of the convolution filter circuit 523 is sent to the subtractor 524. The subtractor 524 calculates a level α corresponding to an allowable noise level described later in the convoluted area. The level α corresponding to the permissible noise level (permissible noise level) is a level at which the critical noise band becomes the permissible noise level for each band by performing inverse convolution processing, as described later. is there. here,
The subtractor 524 is supplied with an allowance function (function expressing a masking level) for obtaining the level α. The level α is controlled by increasing or decreasing this allowance function. The permissible function is supplied from the (n-ai) function generating circuit 525 as described below.

【００９１】すなわち、許容ノイズレベルに対応するレ
ベルαは、クリティカルバンドのバンドの低域から順に
与えられる番号をｉとすると、次の式で求めることがで
きる。 α＝Ｓ−（ｎ−ａｉ）この式において、ｎ，ａは定数でａ＞０、Ｓは畳込み処
理されたバークスペクトルの強度であり、式中(n-ai)が
許容関数となる。例としてｎ＝３８，ａ＝−0.5を用い
ることができる。That is, the level α corresponding to the allowable noise level can be obtained by the following equation, where i is the number given in order from the low band of the critical band. α = S- (n-ai) In this equation, n and a are constants, a> 0, and S is the intensity of the convolution-processed Bark spectrum. In the equation, (n-ai) is the tolerance function. As an example, n = 38 and a = -0.5 can be used.

【００９２】このようにして、上記レベルαが求めら
れ、このデータは、割算器５２６に伝送される。当該割
算器５２６では、上記畳込みされた領域での上記レベル
αを逆コンボリューションするためのものである。した
がって、この逆コンボリューション処理を行うことによ
り、上記レベルαからマスキングスレッショールドが得
られるようになる。すなわち、このマスキングスレッシ
ョールドが許容ノイズスペクトルとなる。なお、上記逆
コンボリューション処理は、複雑な演算を必要とする
が、本実施例では簡略化した割算器５２６を用いて逆コ
ンボリューションを行っている。In this way, the level α is obtained, and this data is transmitted to the divider 526. The divider 526 is for deconvolution of the level α in the convolved area. Therefore, by performing the inverse convolution processing, the masking threshold can be obtained from the level α. That is, this masking threshold becomes the allowable noise spectrum. Although the above-mentioned inverse convolution processing requires complicated calculation, in this embodiment, the inverse convolution is performed using the simplified divider 526.

【００９３】次に、上記マスキングスレッショールド
は、合成回路５２７を介して減算器５２８に伝送され
る。ここで、当該減算器５２８には、上記帯域毎のエネ
ルギ検出回路５２２からの出力、すなわち前述したバー
クスペクトルＳＢが、遅延回路５２９を介して供給され
ている。したがって、この減算器５２８で上記マスキン
グスレッショールドとバークスペクトルＳＢとの減算演
算が行われることで、図１７に示すように、上記バーク
スペクトルＳＢは、該マスキングスレッショールドＭＳ
のレベルで示すレベル以下がマスキングされることにな
る。なお、遅延回路５２９は上記合成回路５２７以前の
各回路での遅延量を考慮してエネルギ検出回路５２２か
らのバークスペクトルＳＢを遅延させるために設けられ
ている。Next, the masking threshold is transmitted to the subtractor 528 via the synthesizing circuit 527. Here, the output from the energy detection circuit 522 for each band, that is, the above-described Bark spectrum SB is supplied to the subtractor 528 via the delay circuit 529. Therefore, the subtractor 528 performs a subtraction operation on the masking threshold and the Bark spectrum SB, so that the Bark spectrum SB shows the masking threshold MS as shown in FIG.
The level below the level indicated by will be masked. The delay circuit 529 is provided in order to delay the Bark spectrum SB from the energy detection circuit 522 in consideration of the delay amount in each circuit before the synthesis circuit 527.

【００９４】当該減算器５２８からの出力は、許容雑音
補正回路５３０を介し、出力端子５３１を介して取り出
され、例えば配分ビット数情報が予め記憶されたＲＯＭ
等（図示せず）に送られる。このＲＯＭ等は、上記減算
回路５２８から許容雑音補正回路５３０を介して得られ
た出力（上記各バンドのエネルギと上記ノイズレベル設
定手段の出力との差分のレベル）に応じ、各バンド毎の
配分ビット数情報を出力する。The output from the subtracter 528 is taken out through the output terminal 531 through the allowable noise correction circuit 530, and, for example, the ROM in which the distribution bit number information is stored in advance.
Etc. (not shown). The ROM or the like is distributed for each band according to the output (the level of the difference between the energy of each band and the output of the noise level setting means) obtained from the subtraction circuit 528 through the allowable noise correction circuit 530. Outputs bit number information.

【００９５】このようにしてエネルギ依存ビットと聴覚
許容雑音レベルに依存したビットは加算されてその配分
ビット数情報が図５の端子２８を介して上記適応ビット
配分符号化回路１６、１７、１８に送られることで、こ
こでＭＤＣＴ回路１３、１４、１５からの周波数領域の
各スペクトルデータがそれぞれのバンド毎に割り当てら
れたビット数で量子化されるわけである。In this way, the energy-dependent bit and the bit depending on the permissible auditory noise level are added, and the distribution bit number information is supplied to the adaptive bit distribution coding circuits 16, 17, and 18 through the terminal 28 of FIG. By being transmitted, each spectrum data in the frequency domain from the MDCT circuits 13, 14, 15 is quantized by the number of bits assigned to each band.

【００９６】すなわち要約すれば、適応ビット配分符号
化回路１６、１７、１８では、上記クリティカルバンド
の各バンド帯域（クリティカルバンド）毎もしくは高域
においてはクリティカルバンドを更に複数帯域に分割し
た帯域のエネルギもしくはピーク値と上記ノイズレベル
設定手段の出力との差分のレベルに応じて配分されたビ
ット数で上記各バンド毎のスペクトルデータを量子化す
ることになる。In summary, in the adaptive bit allocation coding circuits 16, 17, and 18, the energy of each band band (critical band) of the above critical band or the band energy obtained by further dividing the critical band into a plurality of bands in the high band. Alternatively, the spectrum data for each band is quantized by the number of bits distributed according to the level of the difference between the peak value and the output of the noise level setting means.

【００９７】ところで、上述した合成回路５２７での合
成の際には、最小可聴カーブ発生回路５３２から供給さ
れる図１７に示すような人間の聴覚特性であるいわゆる
最小可聴カーブＲＣを示すデータと、上記マスキングス
レッショールドＭＳとを合成することができる。この最
小可聴カーブにおいて、雑音絶対レベルがこの最小可聴
カーブ以下ならば該雑音は聞こえないことになる。この
最小可聴カーブは、コーディングが同じであっても例え
ば再生時の再生ボリュームの違いで異なるものとなが、
現実的なディジタルシステムでは、例えば１６ビットダ
イナミックレンジへの音楽のはいり方にはさほど違いが
ないので、例えば４ｋＨｚ付近の最も耳に聞こえやすい
周波数帯域の量子化雑音が聞こえないとすれば、他の周
波数帯域ではこの最小可聴カーブのレベル以下の量子化
雑音は聞こえないと考えられる。したがって、このよう
に例えばシステムの持つダイナミックレンジの４ｋＨｚ
付近の雑音が聞こえない使い方をすると仮定し、この最
小可聴カーブＲＣとマスキングスレッショールドＭＳと
を共に合成することで許容ノイズレベルを得るようにす
ると、この場合の許容ノイズレベルは、図１７中の斜線
で示す部分までとすることができるようになる。なお、
本実施例では、上記最小可聴カーブの４ｋＨｚのレベル
を、例えば２０ビット相当の最低レベルに合わせてい
る。また、この図１７は、信号スペクトルＳＳも同時に
示している。By the way, at the time of synthesizing by the above-mentioned synthesizing circuit 527, data showing a so-called minimum audible curve RC which is the human auditory characteristic as shown in FIG. 17 supplied from the minimum audible curve generating circuit 532. The masking threshold MS can be combined. In this minimum audible curve, if the absolute noise level is below this minimum audible curve, the noise will not be heard. Even if the coding is the same, this minimum audible curve is different due to the difference in the playback volume during playback,
In a realistic digital system, there is not much difference in how music is put into a 16-bit dynamic range, so if the quantization noise in the most audible frequency band around 4 kHz is inaudible, for example, other It is considered that quantization noise below the level of this minimum audible curve is inaudible in the frequency band. Therefore, for example, the dynamic range of the system is 4 kHz.
Assuming that the noise is not heard in the vicinity, and the allowable noise level is obtained by synthesizing the minimum audible curve RC and the masking threshold MS together, the allowable noise level in this case is shown in FIG. It will be possible to go up to the part shown by the diagonal line of. In addition,
In this embodiment, the level of 4 kHz of the minimum audible curve is set to the minimum level equivalent to 20 bits, for example. Further, FIG. 17 also shows the signal spectrum SS at the same time.

【００９８】また、上記許容雑音補正回路５３０では、
補正情報出力回路５３３から送られてくる例えば等ラウ
ドネスカーブの情報に基づいて、上記減算器５２８から
の出力における許容雑音レベルを補正している。ここ
で、等ラウドネスカーブとは、人間の聴覚特性に関する
特性曲線であり、例えば１ｋＨｚの純音と同じ大きさに
聞こえる各周波数での音の音圧を求めて曲線で結んだも
ので、ラウドネスの等感度曲線とも呼ばれる。またこの
等ラウドネス曲線は、図１７に示した最小可聴カーブＲ
Ｃと略同じ曲線を描くものである。この等ラウドネス曲
線においては、例えば４ｋＨｚ付近では１ｋＨｚのとこ
ろより音圧が８〜１０ｄＢ下がっても１ｋHzと同じ大き
さに聞こえ、逆に、５０Ｈｚ付近では１ｋＨｚでの音圧
よりも約１５ｄＢ高くないと同じ大きさに聞こえない。
このため、上記最小可聴カーブのレベルを越えた雑音
（許容ノイズレベル）は、該等ラウドネス曲線に応じた
カーブで与えられる周波数特性を持つようにするのが良
いことがわかる。このようなことから、上記等ラウドネ
ス曲線を考慮して上記許容ノイズレベルを補正すること
は、人間の聴覚特性に適合していることがわかる。In the allowable noise correction circuit 530,
The allowable noise level in the output from the subtractor 528 is corrected based on the information of the equal loudness curve sent from the correction information output circuit 533, for example. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics, for example, a curve obtained by obtaining the sound pressure of sound at each frequency that sounds the same as a pure tone of 1 kHz, and connecting the curves. Also called sensitivity curve. Further, this equal loudness curve is the minimum audible curve R shown in FIG.
It draws a curve substantially the same as C. In this equal loudness curve, for example, in the vicinity of 4 kHz, even if the sound pressure is reduced by 8 to 10 dB from 1 kHz, it sounds as loud as 1 kHz. It doesn't sound the same.
Therefore, it is understood that the noise exceeding the level of the minimum audible curve (allowable noise level) should have the frequency characteristic given by the curve corresponding to the equal loudness curve. From this, it can be seen that correcting the permissible noise level in consideration of the equal loudness curve is suitable for human hearing characteristics.

【００９９】以上述べた聴覚許容雑音レベルに依存した
スペクトル形状を使用可能総ビット１２８Ｋｂｐｓの内
のある割合を用いるビット配分でつくる。この割合は入
力信号のトーナリティが高くなるほど減少する。The spectrum shape depending on the permissible noise level of hearing described above is created by bit allocation using a certain ratio of the total usable bits of 128 Kbps. This ratio decreases as the tonality of the input signal increases.

【０１００】次に２つのビット配分手法の間でのビット
量分割手法について説明する。図１４に戻って、ＭＤＣ
Ｔ回路出力が供給される入力端子８０１からの信号は、
スペクトルの滑らかさ算出回路８０８にも与えられ、こ
こでスペクトルの滑らかさが算出される。本実施例で
は、信号スペクトルの絶対値の隣接値間の差の絶対値の
和を、信号スペクトルの絶対値の和で割った値を、上記
スペクトルの滑らかさとして算出している。Next, a bit amount dividing method between the two bit allocation methods will be described. Returning to FIG. 14, the MDC
The signal from the input terminal 801 to which the T circuit output is supplied is
The spectrum smoothness calculation circuit 808 is also applied to calculate the spectrum smoothness. In this embodiment, a value obtained by dividing the sum of the absolute values of the differences between the adjacent values of the absolute value of the signal spectrum by the sum of the absolute values of the signal spectrum is calculated as the smoothness of the spectrum.

【０１０１】上記スペクトルの滑らかさ算出回路８０８
の出力は、ビット分割率決定回路８０９に与えられ、こ
こでエネルギ依存のビット配分と、聴覚許容雑音スペク
トルによるビット配分間のビット分割率とが決定され
る。ビット分割率はスペクトルの滑らかさ算出回路８０
８の出力値が大きいほど、スペクトルの滑らかさが無い
と考えて、エネルギ依存のビット配分よりも、聴覚許容
雑音スペクトルによるビット配分に重点をおいたビット
配分を行う。ビット分割率決定回路８０９は、それぞれ
エネルギ依存のビット配分及び聴覚許容雑音スペクトル
によるビット配分の大きさをコントロールするマルチプ
ライヤ８１１及び８１２に対してコントロール出力を送
る。ここで、仮にスペクトルが滑らかであり、エネルギ
依存のビット配分に重きをおくように、マルチプライヤ
８１１へのビット分割率決定回路８０９の出力が０．８
の値を取ったとき、マルチプライヤ８１２へのビット分
割率決定回路８０９の出力は１−０．８＝０．２とす
る。これら２つのマルチプライヤの出力はアダー８０６
で足し合わされて最終的なビット配分情報となって、出
力端子８０７から出力される。The above-described spectrum smoothness calculation circuit 808
Is supplied to the bit division rate determination circuit 809, and the bit division rate between the energy-dependent bit allocation and the bit allocation according to the perceptual noise spectrum is determined. The bit division rate is the smoothness calculation circuit 80 of the spectrum.
It is considered that the larger the output value of 8, the smoother the spectrum is, and the bit allocation is performed with more emphasis on the bit allocation by the perceptual noise spectrum than the energy-dependent bit allocation. The bit division ratio determination circuit 809 sends control outputs to multipliers 811 and 812 which control the amount of bit distribution depending on the energy and the permissible noise spectrum of the auditory sense, respectively. Here, if the spectrum is smooth and the output of the bit division rate determination circuit 809 to the multiplier 811 is 0.8 so as to emphasize the energy-dependent bit allocation.
, The output of the bit division rate determination circuit 809 to the multiplier 812 is 1-0.8 = 0.2. The output of these two multipliers is the adder 806.
Are added together to form final bit allocation information, which is output from the output terminal 807.

【０１０２】このときのビット配分の様子を図１８、図
１９に示す。また、これに対応する量子化雑音の様子を
図２０、図２１に示す。図１８は信号のスペクトルが割
合平坦である場合を示しており、図１９は信号スペクト
ルが高いトーナリティを示す場合を示している。また、
図１８及び図１９の図中ＱＳは信号レベル依存分のビッ
ト量を示し、図中ＱＮは聴覚許容雑音レベル依存のビッ
ト割当分のビット量を示している。図２０及び図２１の
図中Ｌは信号レベルを示し、図中ＮＳは信号レベル依存
分による雑音低下分を、図中ＮＮは聴覚許容雑音レベル
依存のビット割当分による雑音低下分を示している。The state of bit allocation at this time is shown in FIGS. 20 and 21 show the states of quantization noise corresponding to this. FIG. 18 shows a case where the spectrum of the signal is relatively flat, and FIG. 19 shows a case where the signal spectrum shows a high tonality. Also,
In FIG. 18 and FIG. 19, QS indicates the bit amount corresponding to the signal level, and QN in the diagrams indicates the bit amount corresponding to the bit allocation depending on the permissible noise level. 20 and 21, L represents the signal level, NS represents the noise reduction due to the signal level dependency, and NN represents the noise reduction due to the bit allocation that depends on the permissible hearing noise level. .

【０１０３】先ず、信号のスペクトルが、割合平坦であ
る場合を示す図１８において、聴覚許容雑音レベルに依
存したビット配分は、全帯域に渡り大きい信号雑音比を
取るために役立つ。しかし低域及び高域では比較的少な
いビット配分が使用されている。これは聴覚的にこの帯
域の雑音に対する感度が小さいためである。信号エネル
ギレベルに依存したビット配分の分は量としては少ない
が、ホワイトな雑音スペクトルを生じるように、この場
合には中低域の信号レベルの高い周波数領域に重点的に
配分されている。First, in FIG. 18 showing the case where the spectrum of the signal is flat, the bit allocation depending on the permissible noise level of the hearing aids in obtaining a large signal-to-noise ratio over the entire band. However, relatively low bit allocations are used in the low and high frequencies. This is because auditory sensitivity to noise in this band is low. The bit allocation depending on the signal energy level is small in amount, but in this case, it is concentrated in the high frequency region of the signal level in the middle and low frequencies so as to generate a white noise spectrum.

【０１０４】これに対して、図１９に示すように、信号
スペクトルが高いトーナリティを示す場合には、信号エ
ネルギレベルに依存したビット配分量が多くなり、量子
化雑音の低下は極めて狭い帯域の雑音を低減するために
使用される。聴覚許容雑音レベルに依存したビット配分
分の集中はこれよりもきつくない。On the other hand, as shown in FIG. 19, when the signal spectrum exhibits a high tonality, the bit allocation amount depending on the signal energy level increases, and the quantization noise is reduced in a very narrow band. Used to reduce The concentration of the bit allocation depending on the permissible noise level of the auditory sense is less intense.

【０１０５】図１４に示すように、この両者のビット配
分の和により、孤立スペクトル入力信号での特性の向上
が達成される。As shown in FIG. 14, the improvement of the characteristics in the isolated spectrum input signal is achieved by the sum of the bit allocations of the both.

【０１０６】以上の様にして得られたチャネルビット配
分を含むビット配分とチャネルビット配分を含まないビ
ット配分の２つを用いて、次のようにして第１と第２の
量子化を行う。Using the two bit allocations including the channel bit allocations and the bit allocations not including the channel bit allocations obtained as described above, the first and second quantization are performed as follows.

【０１０７】図２２を用いて説明する。この例では、全
６チャネルのうちでチャネルビット配分を含むビット配
分により１４７ｋｂｐｓを越えるビット配分がなされる
チャネルはセンタチャネルとサブウーファチャネルとラ
イトチャネルである。Description will be made with reference to FIG. In this example, among all 6 channels, the channels to which the bit allocation including the channel bit allocation exceeds 147 kbps are the center channel, the subwoofer channel, and the write channel.

【０１０８】まず、チャネルビット配分を含むビット配
分量が１４７ｋｂｐｓを越えるチャネルについて、ある
一定のビット量例えば１２８ｋｂｐｓを最大とする部分
と１２８ｋｂｐｓを越える部分に２分する。First, for a channel in which the bit allocation amount including the channel bit allocation exceeds 147 kbps, it is divided into a certain fixed bit amount, for example, a portion where the maximum is 128 kbps and a portion where it exceeds 128 kbps.

【０１０９】この処理を行う構成を図２３に示す。図２
３の構成では、チャネルビット配分を含むビット配分で
の配分量が１４７ｋｂｐｓを越えるビット配分の各サン
プルについて、複数サンプルごとのブロックについての
正規化処理すなわちブロックフローティングを行う。こ
の時どの程度のブロックフローティングが行われたかを
示す係数としてスケールファクタが得られる。FIG. 23 shows a configuration for performing this processing. Figure 2
In the configuration of No. 3, the normalization process, that is, the block floating, is performed on the block for each of a plurality of samples for each sample of the bit allocation in which the allocation amount in the bit allocation including the channel bit allocation exceeds 147 kbps. At this time, a scale factor is obtained as a coefficient indicating how much block floating has been performed.

【０１１０】この図２３において、入力端子９００に供
給されたＭＤＣＴ係数（ＭＤＣＴサンプル）は正規化回
路９０５によって複数サンプル毎に、ブロックについて
の正規化処理すなわちブロックフローティングが施され
る。この時どの程度のブロックフローティングが行われ
たかを示す係数としてスケールファクタが得られる。In FIG. 23, the MDCT coefficient (MDCT sample) supplied to the input terminal 900 is subjected to normalization processing for each block, that is, block floating, by the normalization circuit 905. At this time, a scale factor is obtained as a coefficient indicating how much block floating has been performed.

【０１１１】次段の第１の量子化器(quantizer) ９０１
は、前記チャネルビット配分を含まないビット配分の各
サンプル語長で量子化を行なう。この時、量子化雑音を
少なくするためには四捨五入による量子化が行われる。Next-stage first quantizer 901
Performs quantization with each sample word length of bit allocation not including the channel bit allocation. At this time, quantization by rounding is performed in order to reduce the quantization noise.

【０１１２】次に、上記正規化回路９０５の出力と上記
量子化器９０１の出力が差分器９０２に送られる。すな
わち、当該差分器９０２では、量子化器９０１の入力と
出力の差（量子化誤差）が取られる。この差分器９０２
からの出力は、さらに正規化回路９０６を介して第２の
量子化器９０３に送られる。Next, the output of the normalization circuit 905 and the output of the quantizer 901 are sent to the difference unit 902. That is, the difference unit 902 takes the difference (quantization error) between the input and output of the quantizer 901. This differencer 902
Is further sent to the second quantizer 903 via the normalization circuit 906.

【０１１３】当該第２の量子化器９０３では、前記チャ
ネルビット配分を含むビット配分の各サンプル語長と前
記チャネルビット配分を含まないビット配分の各サンプ
ル語長の差の語長が各サンプル毎に使用される。この時
のフローティング係数は第１の量子化器９０１で用いら
れたフローティング係数と語長から自動的に決定され
る。すなわち第１の量子化器９０１で用いられた語長が
Ｎビットであったときには、（２＊＊Ｎ）で第２の量子
化器９０３で用いられるフローティング係数が得られ
る。In the second quantizer 903, the word length of the difference between each sample word length of the bit allocation including the channel bit allocation and each sample word length of the bit allocation not including the channel bit allocation is determined for each sample. Used for. The floating coefficient at this time is automatically determined from the floating coefficient used in the first quantizer 901 and the word length. That is, when the word length used in the first quantizer 901 is N bits, the floating coefficient used in the second quantizer 903 is obtained at (2 ** N).

【０１１４】また、上記第２の量子化器９０３では、第
１の量子化器９０１と同じように四捨五入処理を含むビ
ット配分を行う。このようにして２つの量子化により、
前記チャネルビット半分を含むビット配分で１４７ｋｂ
ｐｓを越えるビット配分を受けたチャネルのビットは、
１２８ｋｂｐｓ以下になるべく１２８ｋｂｐｓに近いビ
ット配分と残りのビット配分とに分けられる。Further, in the second quantizer 903, the bit allocation including the rounding processing is performed similarly to the first quantizer 901. In this way, by two quantization,
147 kb in bit allocation including half of the channel bits
The bit of the channel which received the bit allocation exceeding ps is
It is divided into a bit allocation as close to 128 kbps as possible and a remaining bit allocation as low as 128 kbps or less.

【０１１５】ここで、１２８ｋｂｐｓと１４７ｋｂｐｓ
という２つのスレッショールドを設けているのは、以下
のような理由による。すなわち、前記残りのビット配分
データも語長を表すサブ情報が必要であるので、このサ
ブ情報量も含めてデータ領域がとれるようなビット配分
がされる最下限量として１４７ｋｂｐｓが設定されてい
る。また、前記チャネルビット配分を含むビット配分量
が１２８ｋｂｐｓを上回り１４７ｋｂｐｓを下回る場合
には、１２８ｋｂｐｓを越えたデータ部分にはサブ情報
しか書き込めないのでサンプル情報を書き込む余地がな
く意味がなくなってしまう。このため、このような場合
にはこのチャネルは前記チャネルビット配分を含まない
ビット配分で１２８ｋｂｐｓよりも小さく、できるだけ
１２８ｋｂｐｓ近いビット配分を行うために、上記１２
８ｋｂｐｓが設定されている。Here, 128 kbps and 147 kbps
The reason why two thresholds are provided is as follows. That is, since the remaining bit allocation data also needs sub information indicating the word length, 147 kbps is set as the lower limit amount for bit allocation so that the data area can be obtained including the sub information amount. Also, when the bit allocation amount including the channel bit allocation exceeds 128 kbps and falls below 147 kbps, only sub information can be written in the data portion exceeding 128 kbps, so that there is no room to write sample information, which is meaningless. Therefore, in such a case, in order to perform bit allocation which is smaller than 128 kbps and which is as close as possible to 128 kbps, the channel allocation does not include the channel bit allocation.
8 kbps is set.

【０１１６】また、前記チャネルビット配分を含むビッ
ト配分で１２８ｋｂｐｓよりも小さいビット配分となっ
たチャネルは、そのままそのビット配分を使用する。Further, a channel whose bit allocation including the channel bit allocation is smaller than 128 kbps uses the bit allocation as it is.

【０１１７】前に述べたように、前記残りのビット配分
の成分の大きさは図２３で示されるようにビット配分
(1) のスケールファクタとワードレングスからスケール
ファクタを算出できるのでワードレングスのみがデコ−
ダに必要とされる。As described above, the magnitudes of the components of the remaining bit allocation are as shown in FIG.
Since the scale factor can be calculated from the scale factor and the word length in (1), only the word length is
Needed by da.

【０１１８】このようにして量子化器９０１及び９０３
では、それぞれ四捨五入された効率の高い量子化出力が
得られる。In this way, the quantizers 901 and 903 are
In the above, the rounded and highly efficient quantized output is obtained.

【０１１９】なお、図２３の構成（エンコーダ）に対応
する構成（デコーダ）では、上記正規化回路９０５，９
０６に対応する逆正規化処理を行う逆正規化回路９０
８，９０７が設けられ、これら逆正規化回路９０８，９
０７の出力が加算器９０４で加算される。その加算出力
が出力端子９１０から取りだされることになる。In the structure (decoder) corresponding to the structure (encoder) of FIG. 23, the normalizing circuits 905, 9 are used.
Denormalization circuit 90 for performing denormalization processing corresponding to 06
8 and 907 are provided, and these denormalization circuits 908 and 9 are provided.
The output of 07 is added by the adder 904. The added output is taken out from the output terminal 910.

【０１２０】次に、図１に示したサラウンドレフトチャ
ネルとサラウンドライトチャネルとの間でビット配分を
行う高率圧縮符号化回路２２３，２２４の具体的構成を
図２４に示す。Next, FIG. 24 shows a specific configuration of the high-rate compression coding circuits 223 and 224 for allocating bits between the surround left channel and the surround right channel shown in FIG.

【０１２１】この図２４において、入力端子３０１には
サラウンドレフトチャネルのディジタルオーディオ信号
が、入力端子３１１にはサラウンドライトチャネルのデ
ィジタルオーディオ信号が供給される。In FIG. 24, an input terminal 301 is supplied with a surround left channel digital audio signal, and an input terminal 311 is supplied with a surround right channel digital audio signal.

【０１２２】上記入力端子３０１と３１１からのディジ
タルオーディオ信号は、それぞれ対応するバッファ３０
２，３１２に一旦記憶される。このバッファ３０２，３
１２からは、各々５０％オーバーラップしたＮポイント
（Ｎサンプル）毎のブロックでデータが取り出される。
このブロック単位のデータは、直交変換回路３０３，３
１３に送られ、当該直交変換回路３０３，３１３によっ
てそれぞれＭＤＣＴ及びＭＤＳＴ(Modified Discrete S
ine transform)の直交変換が施される。The digital audio signals from the input terminals 301 and 311 are sent to the corresponding buffers 30.
2, 312 once. This buffer 302,3
From No. 12, data is taken out in blocks of N points (N samples) each of which overlaps by 50%.
This block-unit data is used for the orthogonal transformation circuits 303 and 3
13 and MDCT and MDST (Modified Discrete S
ine transform).

【０１２３】上記直交変換回路３０３からの係数データ
は、それぞれ対応するサブバンド・ブロックフローティ
ングポイント圧縮回路３０４，３１４によって圧縮され
る。上記サブバンド・ブロックフローティングポイント
圧縮回路３０４，３１４からの係数データと語長情報や
スケールファクタ等のサブ情報は、対応する適応量子化
回路３０５，３１５に送られる。また、サブバンド・ブ
ロックフローティングポイント圧縮回路３０４，３１４
からは、スペクトル情報がｌｏｇスペクトラルエンベロ
ープ検出回路３０８に送られる。The coefficient data from the orthogonal transform circuit 303 is compressed by the corresponding subband block floating point compression circuits 304 and 314, respectively. The coefficient data from the sub-band / block floating point compression circuits 304 and 314 and sub-information such as word length information and scale factor are sent to the corresponding adaptive quantization circuits 305 and 315. Also, the sub-band block floating point compression circuits 304, 314
From, the spectral information is sent to the log spectral envelope detection circuit 308.

【０１２４】上記適応量子化回路３０５は、ｌｏｇスペ
クトラルエンベロープ検出回路３０８によって検出され
たエンベロープ情報に基づいてチャネル間ビット配分量
を決定する分配決定回路３０９からのビット配分情報に
基づいて、上記係数データとサブ情報を適応的に量子化
する。この適応量子化回路３０５，３１５からは、量子
化された係数データとサブ情報及びビット配分情報とが
出力される。この適応量子化回路３０５，３１５の各出
力は、上記マルチプレクス・エラーコレクション回路３
０６，３１６に送られる。The adaptive quantization circuit 305 determines the coefficient data based on the bit allocation information from the distribution determination circuit 309 which determines the inter-channel bit allocation amount based on the envelope information detected by the log spectral envelope detection circuit 308. And sub information is adaptively quantized. The adaptive quantization circuits 305 and 315 output quantized coefficient data, sub information, and bit allocation information. The outputs of the adaptive quantization circuits 305 and 315 are the same as the multiplex error correction circuit 3 described above.
It is sent to 06, 316.

【０１２５】これらマルチプレクス・エラーコレクショ
ン回路３０６，３１６では、各チャネル毎に量子化され
た係数データとサブ情報及びビット配分情報をマルチプ
レクスすると共に、エラー訂正符号を付加する。これら
マルチプレクス・エラーコレクション回路３０６，３１
６からの出力端子３０７，３１７を介した出力が、前記
図１のサラウンドレフトチャネル用とサラウンドライト
チャネル用の高率圧縮符号化回路２２３，２２４の出力
となる。The multiplex error correction circuits 306 and 316 multiplex the quantized coefficient data, sub information and bit allocation information for each channel, and add an error correction code. These multiplex error correction circuits 306 and 31
The outputs from the output terminals 6 and 6 via the output terminals 307 and 317 are the outputs of the high-rate compression encoding circuits 223 and 224 for the surround left channel and the surround right channel of FIG.

【０１２６】次に、図２５には、前記図１の各高聴感圧
縮符号化回路２１７〜２２２に対応する高聴感伸張復号
化回路の構成を示す。すなわち、この図２５の高聴感伸
張復号化回路は、本発明の伝送装置に対応する受信装置
に適用されるものであり、前記ｎ個の複数チャネルのデ
ィジタルオーディオ信号のうち、上記ｍ個のチャネルの
ディジタルオーディオ信号に対して上記第１の符号化方
式で圧縮符号化がなされた信号を復号化する回路（１チ
ャネル分）の構成である。Next, FIG. 25 shows a configuration of a high-audibility decompression / decoding circuit corresponding to each of the high-audibility compression / encoding circuits 217 to 222 shown in FIG. That is, the high-audibility expansion decoding circuit of FIG. 25 is applied to the receiving device corresponding to the transmitting device of the present invention, and the above-mentioned m channels among the n-channel digital audio signals are used. Is a circuit (for one channel) that decodes a signal that has been compression-encoded by the first encoding method for the digital audio signal.

【０１２７】この図２５において、各帯域の量子化され
たＭＤＣＴ係数は復号化装置入力端子１２２、１２４、
１２６に与えられ、また使用されたブロックサイズ情報
及び適応ビット配分情報は入力端子１２３、１２５、１
２７に与えられる。復号化回路１１６、１１７、１１８
では、適応ビット配分情報を用いてビット割当を解除
し、ブロックサイズ情報を用いて伸張復号化を行う。In FIG. 25, the quantized MDCT coefficients of each band are the decoding device input terminals 122 and 124,
The block size information and adaptive bit allocation information given to and used by 126 are input terminals 123, 125, 1
Given to 27. Decoding circuits 116, 117, 118
Then, the adaptive bit allocation information is used to cancel the bit allocation, and the block size information is used to perform decompression decoding.

【０１２８】次に、ＩＭＤＣＴ回路１１３、１１４、１
１５では、周波数領域の信号が時間領域の信号に変換さ
れる。これらの部分帯域の時間領域信号は、ＩＱＭＦ回
路１１２、１１１により、全体域信号に復号化される。Next, the IMDCT circuits 113, 114, 1
At 15, the frequency domain signal is transformed into a time domain signal. IQMF circuits 112 and 111 decode the time domain signals of these subbands into whole-domain signals.

【０１２９】ここで、高聴感伸張復号化回路では、前記
チャネルビット配分を含む１２８ｋｂｐｓ以下のビット
配分(1) が行われるチャネルと、前記チャネルビット配
分を含む１４７ｋｂｐｓ以上のビット配分(2) が行われ
るチャネルにおけるある一定のビット量例えば１２８ｋ
ｂｐｓを最大とする部分と１２８ｋｂｐｓを越える部分
のそれぞれが、上記復号化回路１１６，１１７，１１８
で復号化される。但し、ビット配分(2) の２つ部分はそ
れそれが復号化された後、それぞれのサンプルが加算さ
れて精度の高いサンプルとなる。Here, in the high-audibility decompression decoding circuit, a channel for which bit allocation (1) of 128 kbps or less including the channel bit allocation is performed and a bit allocation (2) of 147 kbps or more including the channel bit allocation are performed. A certain amount of bits in the channel to be addressed, eg 128k
The decoding circuit 116, 117, 118 has a maximum bps portion and a portion exceeding 128 kbps, respectively.
Is decrypted with. However, the two parts of the bit allocation (2) become high-precision samples by adding the respective samples after they are decoded.

【０１３０】また、得られた各チャネルのデータの並べ
方については、シンクブロック中に、先ず、（１）前記
チャネルビット配分を含む１２８ｋｂｐｓ以下のビット
配分が行われるチャネル、（２）前記チャネルビット配
分を含む１４７ｋｂｐｓ以上のビット配分が行われるチ
ャネルにおけるある一定のビット量例えば１２８ｋｂｐ
ｓを最大とする部分を、チャネル順に並べ、次に前記チ
ャネルビット配分を含む１４７ｋｂｐｓ以上のビット配
分が行われるチャネルにおける１２８ｋｂｐｓを越える
部分をチャネル順に並べる。Regarding the way of arranging the obtained data of each channel, in the sync block, first, (1) a channel in which the bit allocation of 128 kbps or less including the channel bit allocation is performed, and (2) the channel bit allocation Including a certain bit amount in a channel in which a bit allocation of 147 kbps or more is performed, for example, 128 kbp
The part having the maximum s is arranged in the order of channels, and then the part exceeding 128 kbps in the channel in which the bit allocation of 147 kbps or more including the channel bit allocation is performed is arranged in the channel order.

【０１３１】次に、図１の高率圧縮符号化回路２２３，
２２４に対応する高率伸張復号化回路の構成を図２６
（１チャネル分）に示す。すなわち、この図２６の高率
伸張復号化回路は、本発明の伝送装置に対応する受信装
置に適用されるものであり、前記ｎ−ｍ個のチャネルの
ディジタルオーディオ信号に対して上記第２の符号化方
式で圧縮符号化がされた信号を復号化する回路（１チャ
ネル分）の構成である。Next, the high-rate compression coding circuit 223 of FIG.
FIG. 26 shows the configuration of a high rate expansion decoding circuit corresponding to H.224.
(1 channel). That is, the high-rate decompression decoding circuit of FIG. 26 is applied to the receiving device corresponding to the transmitting device of the present invention, and the second above-mentioned second is applied to the digital audio signals of the nm channels. This is a configuration of a circuit (for one channel) that decodes a signal that has been compression-encoded by an encoding method.

【０１３２】この図２６において、入力端子４１０に
は、前記高率圧縮符号化が施されたディジタルオーディ
オ信号が供給される。この信号は、デマルチプレクス・
エラーコレクション回路４１１によってデマルチプレク
スとエラー訂正が行われる。In FIG. 26, the input terminal 410 is supplied with the high-rate compression-encoded digital audio signal. This signal is demultiplexed
The error correction circuit 411 performs demultiplexing and error correction.

【０１３３】当該デマルチプレクス・エラーコレクショ
ン回路４１１からは、適応量子化された係数データとサ
ブ情報及びビット配分情報とが出力される。係数データ
及びサブ情報は、適応逆量子化回路４１２に送られる。
また、ビット配分情報は量子化ステップサイズコントロ
ール回路４１３に送られる。上記適応逆量子化回路４１
２は、上記量子化ステップサイズコントロール回路４１
３からの量子化ステップサイズ情報に基づいて、上記量
子化変換係数情報に対して逆量子化を施す。この適応逆
量子化回路４１２からの量子化圧縮変換係数はサブバン
ド・ブロックフローティングポイント伸張回路４１４に
送られる。The demultiplex error correction circuit 411 outputs adaptively quantized coefficient data, sub information, and bit allocation information. The coefficient data and the sub information are sent to the adaptive inverse quantization circuit 412.
Also, the bit allocation information is sent to the quantization step size control circuit 413. The adaptive inverse quantization circuit 41
2 is the quantization step size control circuit 41
Inverse quantization is performed on the quantized transform coefficient information based on the quantization step size information from 3. The quantized compression transform coefficient from the adaptive dequantization circuit 412 is sent to the subband block floating point expansion circuit 414.

【０１３４】上記サブバンド・ブロックフローティング
ポイント伸張回路４１４では、前記図２４のサブバンド
・ブロックフローティングポイント圧縮回路３０４，３
１４の逆処理を行う。この伸張回路４１４の出力は、同
じく図２４の直交変換回路３０３，３１３の逆変換処理
を行う逆直交変換回路４１５によってＮポイントのサン
プルデータに変換され、ウインドウ・オーバーラップ加
算回路４１６に送られる。当該ウインドウ・オーバーラ
ップ加算回路４１６では、前記オーバーラップが解除さ
れて、ＰＣＭオーディオ信号として出力される。このＰ
ＣＭオーディオ信号が出力端子４１６から取り出され
る。In the sub-band / block floating-point expansion circuit 414, the sub-band / block floating-point compression circuits 304 and 3 shown in FIG.
The reverse process of 14 is performed. The output of the decompression circuit 414 is converted into N-point sample data by the inverse orthogonal transform circuit 415 which similarly performs the inverse transform processing of the orthogonal transform circuits 303 and 313 of FIG. 24, and is sent to the window overlap addition circuit 416. The window overlap adding circuit 416 removes the overlap and outputs the PCM audio signal. This P
The CM audio signal is taken out from the output terminal 416.

【０１３５】[0135]

【発明の効果】以上の説明からも明らかなように、本発
明のディジタルオーディオ信号の伝送装置においては、
複数チャネルの信号のうち、聴感上影響力の高い信号か
らなるチャネルの信号に対しては聴感上の劣化が少ない
第１の圧縮率の第１の符号化方式によって圧縮符号化を
行い、聴感上影響力の低い信号からなるチャネルの信号
に対しては第１の圧縮率よりも高い第２の圧縮率の第２
の符号化方式で圧縮符号化を行うようにしているため、
特に重要な音に対しては高音質の圧縮符号化が可能であ
ると共に、ビット配分量（バイト配分量）の無駄を無く
すことが可能となる。As is apparent from the above description, in the digital audio signal transmitting apparatus of the present invention,
Among the signals of a plurality of channels, a signal of a channel consisting of a signal having a high influence on the auditory perception is compression-encoded by the first encoding method with the first compression rate that causes less degradation on the auditory sense, and The second compression rate higher than the first compression rate is applied to the signal of the channel including the low-impact signal.
Since the compression encoding is performed with the encoding method of
High-quality compression coding can be performed for particularly important sounds, and waste of the bit allocation amount (byte allocation amount) can be eliminated.

【０１３６】また、本発明のディジタルオーディオ信号
の受信装置においては、複数チャネルの信号のうち、聴
感上影響力の高い信号からなるチャネルの信号に対して
聴感上の劣化が少ない第１の圧縮率の第１の符号化方式
によって圧縮符号化がされた信号と、聴感上影響力の低
い信号からなるチャネルの信号に対して第１の圧縮率よ
りも高い第２の圧縮率の第２の符号化方式で圧縮符号化
がなされた信号を、それぞれ復号化可能となる。Further, in the digital audio signal receiving apparatus of the present invention, the first compression rate, which is less deteriorated in hearing than the signal of the channel consisting of the signals having a high auditory influence among the signals of the plurality of channels. Second code having a second compression rate higher than the first compression rate with respect to a channel signal composed of a signal that has been compression-encoded by the first encoding method and a signal that has a low auditory influence. It becomes possible to decode the respective signals that have been compression-encoded by the encoding method.

[Brief description of drawings]

【図１】本発明実施例のディジタルオーディオ信号の伝
送装置の主要部の概略構成を示すブロック回路図であ
る。FIG. 1 is a block circuit diagram showing a schematic configuration of a main part of a digital audio signal transmission apparatus according to an embodiment of the present invention.

【図２】８チャネルディジタルサラウンドシステムにお
けるスピーカの配置を説明するための図である。FIG. 2 is a diagram for explaining the arrangement of speakers in an 8-channel digital surround system.

【図３】各チャネルの圧縮されたオーディオ信号をマル
チプレクスする構成を示すブロック回路図である。FIG. 3 is a block circuit diagram showing a configuration for multiplexing a compressed audio signal of each channel.

【図４】高聴感圧縮符号化回路の一具体例（チャネル間
ビット配分を行わない例）の概略構成を示すブロック回
路図である。FIG. 4 is a block circuit diagram showing a schematic configuration of a specific example (an example in which bit allocation between channels is not performed) of a high-audibility compression encoding circuit.

【図５】高聴感圧縮符号化回路の一具体例（チャネル間
ビット配分を行う例）の概略構成を示すブロック回路図
である。FIG. 5 is a block circuit diagram showing a schematic configuration of a specific example (an example of performing bit allocation between channels) of a high-audibility compression encoding circuit.

【図６】高聴感圧縮符号化回路での信号の周波数及び時
間分割を示す図である。FIG. 6 is a diagram showing frequency and time division of a signal in a high-audibility compression encoding circuit.

【図７】高聴感圧縮符号化回路のマルチチャネルでのビ
ット配分用パラメータを求める構成の一例を示すブロッ
ク回路図である。FIG. 7 is a block circuit diagram showing an example of a configuration for obtaining a parameter for bit allocation in multi-channel of a high-audibility compression encoding circuit.

【図８】高聴感圧縮符号化回路におけるチャネル間でス
ペクトルの大きさからビット配分を行う概念を示す図で
ある。[Fig. 8] Fig. 8 is a diagram illustrating the concept of bit allocation based on the magnitude of a spectrum among channels in a high-audibility compression encoding circuit.

【図９】高聴感圧縮符号化回路におけるチャネル間での
情報信号の時間特性を考慮したビット配分の為のパラメ
ータの求め方を示す図である。[Fig. 9] Fig. 9 is a diagram illustrating a method of obtaining a parameter for bit allocation in consideration of a time characteristic of an information signal between channels in a high-audibility compression encoding circuit.

【図１０】ビット配分(1) のビット配分量とトーナリテ
ィとの間の関係を示す図である。FIG. 10 is a diagram showing the relationship between the bit allocation amount and the tonality of bit allocation (1).

【図１１】ビット配分(1) のビット配分量と時間変化率
との間の関係を示す図である。FIG. 11 is a diagram showing a relationship between a bit allocation amount of bit allocation (1) and a time change rate.

【図１２】均一配分の時のノイズスペクトルを示す図で
ある。FIG. 12 is a diagram showing a noise spectrum at the time of uniform distribution.

【図１３】情報信号の周波数スペクトル及びレベルに対
する依存性を持たした聴覚的な効果を得るためのビット
配分によるノイズスペクトルの例を示す図である。FIG. 13 is a diagram showing an example of a noise spectrum due to bit allocation for obtaining an auditory effect having dependency on a frequency spectrum and a level of an information signal.

【図１４】情報信号の大きさ及び聴覚許容雑音スペクト
ルの二者を用いたビット配分手法を実現する構成を示す
ブロック回路図である。FIG. 14 is a block circuit diagram showing a configuration for realizing a bit allocation method using the two of the magnitude of an information signal and the permissible noise spectrum of hearing.

【図１５】許容雑音レベルを求める構成を示すブロック
回路図である。FIG. 15 is a block circuit diagram showing a configuration for obtaining an allowable noise level.

【図１６】各帯域の信号レベルによるマスキングスレシ
ョールドの例を示す図である。FIG. 16 is a diagram showing an example of a masking threshold depending on the signal level of each band.

【図１７】情報スペクトル、マスキングスレショール
ド、最小可聴限を示す図である。FIG. 17 is a diagram showing an information spectrum, a masking threshold, and a minimum audible limit.

【図１８】トーナリティが低い情報信号に対する信号レ
ベル依存および聴覚許容雑音レベル依存のビット配分を
示す図である。FIG. 18 is a diagram showing signal level-dependent and auditory permissible noise level-dependent bit allocation for an information signal having low tonality.

【図１９】トーナリティが高い情報信号に対する信号レ
ベル依存および聴覚許容雑音レベル依存のビット配分を
示す図である。FIG. 19 is a diagram showing signal level-dependent and auditory permissible noise level-dependent bit allocation for an information signal having high tonality.

【図２０】トーナリティが低い情報信号に対する量子化
雑音レベルを示す図である。FIG. 20 is a diagram showing a quantization noise level for an information signal having low tonality.

【図２１】トーナリティが高い情報信号に対する量子化
雑音レベルを示す図である。FIG. 21 is a diagram showing a quantization noise level for an information signal having high tonality.

【図２２】高聴感圧縮符号化回路でのマルチチャネルに
おけるビット配分の関係を示す図である。[Fig. 22] Fig. 22 is a diagram illustrating the relationship of bit allocation in multiple channels in the high-hearingness compression encoding circuit.

【図２３】ビット配分の分割を行う具体的構成を示すブ
ロック回路図である。FIG. 23 is a block circuit diagram showing a specific configuration for dividing bit allocation.

【図２４】高率圧縮符号化回路の具体的構成例を示すブ
ロック回路図である。FIG. 24 is a block circuit diagram showing a specific configuration example of a high-rate compression encoding circuit.

【図２５】本発明実施例の受信装置の高聴感伸張復号化
回路の構成例を示すブロック回路図である。FIG. 25 is a block circuit diagram showing a configuration example of a high-audibility extension decoding circuit of the receiving device according to the embodiment of the present invention.

【図２６】本発明実施例の受信装置の高率伸張復号化回
路の構成例を示すブロック回路図である。FIG. 26 is a block circuit diagram showing a configuration example of a high-rate expansion decoding circuit of the receiving apparatus according to the embodiment of the present invention.

[Explanation of symbols]

２０９〜２１６・・・Ａ／Ｄ変換器２１７〜２２２・・・高聴感圧縮符号化回路２２３，２２４・・・高率圧縮符号化回路２２５，２２６・・・ハイパスフィルタ２２８，２２９・・・ローパスフィルタ２２７，２３０・・・加算器２４１・・・マルチプレクサ３０２，３１３・・・バッファ３０３，３１３・・・直交変換回路３０４，３１４・・・サブバンドブロックフローティン
グポイント圧縮回路３０５，３１５・・・適応量子化回路３０６，３１６・・・マルチプレクス・エラーコレクシ
ョン回路３０８・・・ｌｏｇスペクトラルエンベロープ検出回路３０９・・・分配決定回路１１，１２・・・帯域分割フィルタ１３，１４，１５・・・ＭＤＣＴ回路１６，１７，１８・・・適応ビット配分符号化回路１９，２０，２１・・・ブロックサイズ決定回路３１・・・各チャネル情報信号入力端子３２・・・マッピング回路３３・・・ブロッキング回路３４・・・時間変化算出回路３５・・・スケ−ルファクタ算出回路３６・・・トーナリティ算出回路３７・・・正規化回路３８・・・ビット配分回路１１６，１１７，１１８・・・適応ビット配分復号化回
路１１３，１１４，１１５・・・ＩＭＤＣＴ回路１１２，１１１・・・ＩＱＭＦ回路４１１・・・デマルチプレクス・エラーコレクション回
路４１２・・・適応逆量子化回路４１３・・・量子化ステップサイズコントロール回路４１４・・・サブバンドブロックフローティングポイン
ト伸張回路４１５・・・逆直交変換回路４１６・・・ウインドウ・オーバーラップ加算回路５２０・・・許容雑音算出回路５２１・・・許容雑音算出回路入力端子５２２・・・帯域毎のエネルギ検出回路５２３・・・畳込みフィルタ回路５２４・・・引算器５２５・・・ｎ−ａｉ関数発生回路５２６・・・割算器５２７・・・合成回路５２８・・・減算器５３０・・・許容雑音補正回路５３２・・・最小可聴カーブ発生回路５３３・・・補正情報出力回路８０２・・・使用可能総ビット発生回路８０３・・・帯域毎のエネルギ算出回路８０４・・・エネルギ依存のビット配分回路８０５・・・聴覚許容雑音レベル依存のビット配分回路８０６・・・アダー８０８・・・スペクトルの滑らかさ算出回路８０９・・・ビット分割率決定回路８１１、８１２・・・マルチプライヤ９０５，９０６・・・正規化回路９０１・・・第１の量子化器９０３・・・第２の量子化器９０７，９０９・・・逆正規化回路９０４・・・加算器209 to 216 ... A / D converter 217 to 222 ... High-audibility compression encoding circuit 223, 224 ... High-rate compression encoding circuit 225, 226 ... High-pass filter 228, 229 ... Low-pass Filters 227, 230 ... Adders 241 ... Multiplexers 302, 313 ... Buffers 303, 313 ... Orthogonal transformation circuits 304, 314 ... Subband block floating point compression circuits 305, 315 ... Adaptation Quantization circuit 306, 316 ... Multiplex error correction circuit 308 ... Log spectral envelope detection circuit 309 ... Distribution decision circuit 11, 12 ... Band division filter 13, 14, 15 ... MDCT circuit 16, 17, 18 ... Adaptive bit allocation encoding circuit 19, 20, 2・・・ Block size determination circuit 31 ・・・ Each channel information signal input terminal 32 ・・・ Mapping circuit 33 ・・・ Blocking circuit 34 ・・・ Time change calculation circuit 35 ・・・ Scale factor calculation circuit 36 ・・・Tonality calculation circuit 37 ... Normalization circuit 38 ... Bit allocation circuit 116, 117, 118 ... Adaptive bit allocation decoding circuit 113, 114, 115 ... IMDCT circuit 112, 111 ... IQMF circuit 411 Demultiplex error correction circuit 412 Adaptive dequantization circuit 413 Quantization step size control circuit 414 Subband block floating point expansion circuit 415 Inverse orthogonal transformation circuit 416 ..Window overlap adding circuit 520 ... Allowable noise Calculation circuit 521 ... Allowable noise calculation circuit input terminal 522 ... Energy detection circuit for each band 523 ... Convolution filter circuit 524 ... Subtractor 525 ... n-ai function generation circuit 526 ... -Divider 527 ... Synthesis circuit 528 ... Subtractor 530 ... Allowable noise correction circuit 532 ... Minimum audible curve generation circuit 533 ... Correction information output circuit 802 ... Usable total bit generation Circuit 803 ... Energy calculation circuit for each band 804 ... Energy-dependent bit allocation circuit 805 ... Auditory permissible noise level-dependent bit allocation circuit 806 ... Adder 808 ... Spectral smoothness calculation circuit 809 ... Bit division rate determination circuits 811, 812 ... Multipliers 905, 906 ... Normalization circuit 901 ... First quantizer 90 3 ... Second quantizer 907, 909 ... Inverse normalization circuit 904 ... Adder

【手続補正書】[Procedure amendment]

【提出日】平成６年４月８日[Submission date] April 8, 1994

【手続補正１】[Procedure Amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】全文[Correction target item name] Full text

【補正方法】変更[Correction method] Change

【補正内容】[Correction content]

【書類名】明細書[Document name] Statement

【発明の名称】ディジタルオーディオ信号の伝送装置
及び受信装置、並びにディジタルオーディオ信号の伝送
方法及び受信方法Title: Digital audio signal transmitting apparatus and receiving apparatus, and digital audio signal transmitting method and receiving method

【特許請求の範囲】[Claims]

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、例えば、映画フィルム
映写システム、ビデオテープレコーダ、ビデオディスク
プレーヤ等のステレオや、いわゆるマルチサラウンド音
響システムにおいて用いられるマルチチャネルのディジ
タルオーディオ信号を圧縮符号化して伝送するディジタ
ルオーディオ信号の伝送装置及びそれに対応する受信装
置、並びにディジタルオーディオ信号の伝送方法及びそ
れに対応する受信方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention, for example, compresses and transmits a multi-channel digital audio signal used in a stereo such as a movie film projection system, a video tape recorder, a video disc player, or a so-called multi surround sound system. The present invention relates to a digital audio signal transmitting apparatus and a corresponding receiving apparatus, a digital audio signal transmitting method, and a corresponding receiving method.

【０００２】[0002]

【０００５】[0005]

【０００９】[0009]

【００１３】次に、本発明のディジタルオーディオ信号
の伝送方法は、複数チャネル（ｎチャネル、ｎは３より
大なる正の整数）のディジタルオーディオ信号を圧縮符
号化して伝送するディジタルオーディオ信号の伝送方法
であり、上記ｎ個のチャネルのディジタルオーディオ信
号のうち、ｍ個（ｎ＞ｍ）のチャネルのディジタルオー
ディオ信号であって他のｎ−ｍ個のチャネルのディジタ
ルオーディオ信号より聴感上影響力の高い信号に対して
は第１の圧縮率の第１の符号化方式で圧縮符号化する第
１の圧縮符号化工程と、上記ｎ−ｍ個のチャネルのディ
ジタルオーディオ信号に対しては上記第１の圧縮率より
も高い第２の圧縮率を有する第２の符号化方式で圧縮符
号化する第２の圧縮符号化工程とを有し、上記第１の圧
縮符号化工程からの圧縮符号化出力と上記第２の圧縮符
号化工程からの圧縮符号化出力を共に伝送することを特
徴とするものである。Next, a method of transmitting a digital audio signal according to the present invention is a method of transmitting a digital audio signal in which digital audio signals of a plurality of channels (n channels, n is a positive integer greater than 3) are compression-encoded and transmitted. Of the n channels of digital audio signals, m (n> m) channels of digital audio signals are more audibly influential than the other n−m channels of digital audio signals. A first compression coding step of compressing and coding a signal using a first coding method with a first compression ratio, and the first compression coding step for the digital audio signals of the n−m channels. A second compression encoding step of performing compression encoding with a second encoding method having a second compression rate higher than the compression rate, and from the first compression encoding step It is characterized in transmitting the compressed encoded output from the compression encoded output and the second compression encoding step together.

【００１４】また、本発明のディジタルオーディオ信号
の受信方法は、複数チャネル（ｎチャネル、ｎは３より
大なる正の整数）のディジタルオーディオ信号のうち、
ｍ個（ｎ＞ｍ）のチャネルのディジタルオーディオ信号
であって他のｎ−ｍ個のチャネルのディジタルオーディ
オ信号より聴感上影響力の高い信号に対しては第１の圧
縮率の第１の符号化方式で圧縮符号化がなされていると
共に、上記ｎ−ｍ個のチャネルのディジタルオーディオ
信号に対しては上記第１の圧縮率よりも高い第２の圧縮
率を有する第２の符号化方式で圧縮符号化がなされた信
号を受信するディジタルオーディオ信号の受信方法であ
り、上記ｎ個のチャネルのディジタルオーディオ信号の
うち、上記ｍ個のチャネルのディジタルオーディオ信号
に対して上記第１の符号化方式に対応する第１の伸張復
号化を施す第１の復号化工程と、上記ｎ−ｍ個のチャネ
ルのディジタルオーディオ信号に対して上記第２の符号
化方式に対応する第２の伸張復号化を施す第２の復号化
工程とを有することを特徴とするものである。The method of receiving a digital audio signal according to the present invention is, among digital audio signals of a plurality of channels (n channels, n is a positive integer greater than 3),
The first code having the first compression ratio for a digital audio signal of m (n> m) channels, which has a higher auditory influence than other digital audio signals of nm channels. The second encoding method has a second compression rate higher than the first compression rate with respect to the digital audio signals of the n−m channels while being compression-encoded by the encoding method. A method of receiving a digital audio signal for receiving a signal that has been compression-encoded, wherein the first encoding system is applied to the digital audio signals of the m channels among the digital audio signals of the n channels. Corresponding to the second encoding method for the digital audio signals of the n−m channels and the first decoding step of performing the first decompression decoding corresponding to It is characterized in that a second decoding step of performing a second decompression and decoding.

【００１５】[0015]

【作用】本発明のディジタルオーディオ信号の伝送装置
及び方法によれば、複数チャネルの信号のうち、聴感上
影響力の高い信号からなるチャネルの信号に対しては聴
感上の劣化が少ない第１の圧縮率の第１の符号化方式に
よって圧縮符号化を行い、聴感上影響力の低い信号から
なるチャネルの信号に対しては第１の圧縮率よりも高い
第２の圧縮率の第２の符号化方式で圧縮符号化を行う。According to the apparatus and method for transmitting a digital audio signal of the present invention, of the signals of a plurality of channels, a signal of a channel consisting of a signal having a great influence on the auditory sense is less deteriorated in the auditory sense. A second code having a second compression rate higher than the first compression rate is applied to a signal of a channel composed of a signal having a low auditory influence, by performing compression encoding by the first encoding method with a compression rate. The compression coding is performed by the encoding method.

【００１６】また、本発明のディジタルオーディオ信号
の受信装置及び方法によれば、複数チャネルの信号のう
ち、聴感上影響力の高い信号からなるチャネルの信号に
対して聴感上の劣化が少ない第１の圧縮率の第１の符号
化方式によって圧縮符号化がされた信号と、聴感上影響
力の低い信号からなるチャネルの信号に対して第１の圧
縮率よりも高い第２の圧縮率の第２の符号化方式で圧縮
符号化がなされた信号を、それぞれ復号化するようにし
ている。Further, according to the digital audio signal receiving apparatus and method of the present invention, of the signals of a plurality of channels, the first signal is less deteriorated in the auditory sense than the signal of the channel consisting of the signal having a high auditory influence. Of the second compression rate higher than the first compression rate with respect to the signal of the channel composed of the signal compression-encoded by the first encoding method of the The signals that have been compression-encoded by the encoding method of No. 2 are each decoded.

【００１７】[0017]

【００１８】図１には、本発明のディジタルオーディオ
信号の伝送方法を実現する本発明のディジタルオーディ
オ信号の伝送装置の構成を示す。本発明実施例のディジ
タルオーディオ信号の伝送装置は、ｎ個（ｎは３より大
なる正の整数であり本実施例では例えば８個）のチャネ
ルのディジタルオーディオ信号を圧縮符号化して伝送す
る伝送装置である。FIG. 1 shows the configuration of a digital audio signal transmission apparatus of the present invention for implementing the digital audio signal transmission method of the present invention. A digital audio signal transmission apparatus according to an embodiment of the present invention is a transmission apparatus that compresses and encodes digital audio signals of n (n is a positive integer greater than 3; for example, 8 in this embodiment) channels. Is.

【００１９】ここで、本実施例では、上記ｎ個（８個）
のチャネルとして、例えば前述したレフトチャネル（Ｌ
ｃｈ）、レフトセンタチャネル（ＬＣｃｈ）、センタチ
ャネル（Ｃｃｈ）、サブウーファチャネル（ＳＷｃ
ｈ）、ライトセンタチャネル（ＲＣｃｈ）、ライトチャ
ネル（Ｒｃｈ）、サラウンドレフトチャネル（ＬＢｃ
ｈ）、サラウンドライトチャネル（ＲＢｃｈ）を用い
る。なお、本発明では、上記例に限らず、上記ｎ個のチ
ャネルを例えばセンタチャネルとレフトチャネルとライ
トチャネルとサラウンドレフトチャネルとサラウンドラ
イトチャネルの５チャネルとしたり、また、センタチャ
ネルとレフトチャネルとライトチャネルとレフトセンタ
チャネルとライトセンタチャネルとサラウンドレフトチ
ャネルとサラウンドライトチャネルの７チャネルとする
こともできる。Here, in this embodiment, the above n pieces (8 pieces) are used.
The left channel (L
ch), left center channel (LCch), center channel (Cch), subwoofer channel (SWc)
h), right center channel (RCch), right channel (Rch), surround left channel (LBc)
h), a surround right channel (RBch) is used. In the present invention, the n channels are not limited to the above example, but may be, for example, 5 channels of a center channel, a left channel, a right channel, a surround left channel and a surround right channel, or a center channel, a left channel and a right channel. It is also possible to use seven channels including a channel, a left center channel, a right center channel, a surround left channel and a surround right channel.

【００２０】本実施例のディジタルオーディオ信号の伝
送装置は、上記８チャネルのディジタルオーディオ信号
のうち、ｍ個（ｎ＞ｍで、図１の例ではｍ＝６）のチャ
ネルのディジタルオーディオ信号であって他のｎ−ｍ
（図１の例ではｎ−ｍ＝８−６＝２）個のチャネルのデ
ィジタルオーディオ信号より聴感上影響力の高い信号に
対しては第１の圧縮率の第１の符号化方式で圧縮符号化
する第１の圧縮符号化手段としての高聴感圧縮符号化回
路２１７〜２２２と、上記ｎ−ｍ個のチャネルのディジ
タルオーディオ信号に対しては上記第１の圧縮率よりも
高い第２の圧縮率を有する第２の符号化方式で圧縮符号
化する第２の圧縮符号化手段である高率圧縮符号化回路
２２３，２２４とを有し、上記高聴感圧縮符号化回路２
１７〜２２２からの圧縮符号化出力と上記高率圧縮符号
化回路２２３，２２４からの圧縮符号化出力を共に伝送
することを特徴とするものである。The digital audio signal transmitting apparatus according to the present embodiment is a digital audio signal of m (n> m, m = 6 in the example of FIG. 1) channels of the above eight channels of digital audio signals. Other nm
(In the example of FIG. 1, mn = 8−6 = 2) For a signal that has a higher auditory influence than a digital audio signal of the number of channels, a compression code is generated by the first encoding method with the first compression ratio. The high-audibility compression coding circuits 217 to 222 as the first compression coding means for converting the digital audio signals into the n-m channels and the second compression higher than the first compression ratio. A high-rate compression coding circuit 223, 224 which is a second compression coding means for performing compression coding by a second coding method having a rate.
It is characterized in that the compression coded outputs from 17 to 222 and the compression coded outputs from the high rate compression coding circuits 223 and 224 are both transmitted.

【００２１】なお、本実施例では、上記８チャネルのう
ち上記第１の符号化方式で圧縮する上記聴感上影響力の
高い信号からなるチャネル（６個のチャネル）を、上記
レフトチャネルとレフトセンタチャネルとセンタチャネ
ルとサブウーファチャネルとライトセンタチャネルとラ
イトチャネルとし、上記第２の符号化方式で圧縮するチ
ャネル（ｎ−ｍ個のチャネル）をサラウンドレフトチャ
ネルとサラウンドライトチャネルとしている。勿論、本
発明はこれらに限定されず、例えば上記ｎ個のチャネル
を例えばセンタチャネルとレフトチャネルとライトチャ
ネルとサラウンドレフトチャネルとサラウンドライトチ
ャネルの５チャネルとしたときには、上記ｍ個のチャネ
ルをセンタチャネルとレフトチャネルとライトチャネル
の３チャネルとし、上記ｎ−ｍ個のチャネルをサラウン
ドレフトチャネルとサラウンドライトチャネルとするこ
ともできる。同様に、上記ｎ個のチャネルを例えばセン
タチャネルとレフトチャネルとライトチャネルとレフト
センタチャネルとライトセンタチャネルとサラウンドレ
フトチャネルとサラウンドライトチャネルの７チャネル
としたときには、上記ｍ個のチャネルをセンタチャネル
とレフトチャネルとライトチャネルとし、上記ｎ−ｍ個
のチャネルをサラウンドレフトチャネルとサラウンドラ
イトチャネルとすることもできる。In the present embodiment, among the eight channels, the channels (six channels) made up of the signals that have a high perceptual influence on the signals and are compressed by the first coding method are used as the left channel and the left center. A channel, a center channel, a subwoofer channel, a right center channel and a right channel, and channels (nm channels) compressed by the second encoding method are a surround left channel and a surround right channel. Of course, the present invention is not limited to these, and for example, when the above n channels are, for example, 5 channels of a center channel, a left channel, a right channel, a surround left channel and a surround right channel, the m channels are replaced by the center channel. The left channel and the right channel may be three channels, and the n−m channels may be the surround left channel and the surround right channel. Similarly, when the n channels are, for example, 7 channels of a center channel, a left channel, a right channel, a left center channel, a right center channel, a surround left channel and a surround right channel, the m channels are referred to as center channels. A left channel and a right channel may be used, and the above n−m channels may be a surround left channel and a surround right channel.

【００２２】さらに、本発明実施例の伝送装置におい
て、上記第１の符号化方式で圧縮符号化を行う高聴感圧
縮符号化回路２１７〜２２２には、前述したようなサブ
バンドコーディング等を用いたオーディオ信号の高能率
圧縮符号化方式であって人間の聴覚上の特性を利用し、
オーディオデータを約１／５に圧縮するような方式、す
なわち、入力ディジタルオーディオ信号を複数帯域に分
割し、各帯域毎のディジタルオーディオ信号を複数サン
プル毎にブロック化し、各ブロック単位で直交変換した
スペクトルデータを、人間の聴覚特性に応じて適応的に
圧縮符号化する方式（例えばいわゆるＡＴＲＡＣ：Adap
tive TRansform Acoustic Coding方式）を用いる。Further, in the transmission apparatus of the embodiment of the present invention, the above-described sub-band coding or the like is used for the high-audibility compression coding circuits 217 to 222 which perform compression coding by the first coding method. It is a high-efficiency compression encoding method for audio signals, which makes use of human auditory characteristics,
A method of compressing audio data to about ⅕, that is, a spectrum in which an input digital audio signal is divided into a plurality of bands, the digital audio signal of each band is divided into a plurality of samples, and orthogonally transformed in each block. A method of adaptively compression-coding data according to human auditory characteristics (for example, so-called ATRAC: Adap
tive TRansform Acoustic Coding method) is used.

【００２３】また、第２の符号化方式で圧縮符号化を行
う高率圧縮符号化回路２２３，２２４には、入力ディジ
タルオーディオ信号を複数サンプル毎に直交変換した係
数データとそれに関連するサブ情報（語長情報やスケー
ルファクタの情報）を得、各チャネルのエネルギに応じ
て各チャネルのビット配分を決定し、この各チャネルの
ビット配分によって適応的に圧縮符号化する方式を用い
る。勿論、本発明はこれら各符号化方式に限定されるも
のでないことは言うまでもない。Further, in the high-rate compression coding circuits 223 and 224 which carry out compression coding by the second coding method, coefficient data obtained by orthogonally transforming an input digital audio signal for every plural samples and sub-information related thereto ( The word length information and the scale factor information) are obtained, the bit allocation of each channel is determined according to the energy of each channel, and the compression encoding is adaptively performed by the bit allocation of each channel. Needless to say, the present invention is not limited to each of these encoding methods.

【００２４】さらに、本実施例では、上記伝送として、
例えば映画フィルムへの記録や、光ディスク，光磁気デ
ィスク，相変化型光ディスク，磁気ディスク等のディス
ク状記録媒体、磁気テープ等のテープ状記録媒体への記
録、半導体メモリ，ＩＣカードなどへの記録を挙げるこ
とができる。Further, in the present embodiment, the transmission is as follows.
For example, recording on motion picture film, recording on disk-shaped recording media such as optical disks, magneto-optical disks, phase-change optical disks, magnetic disks, tape-shaped recording media such as magnetic tape, recording on semiconductor memory, IC cards, etc. Can be mentioned.

【００２５】なお、上記伝送を映画フィルムへの記録と
した場合には、例えば図２に示すようにスピーカが配置
されるディジタルサラウンドシステムに対応することに
なる。各チャネルは、センタ（Ｃ）チャネル、サブウー
ファ（ＳＷ）チャネル、レフト（Ｌ）チャネル、レフト
センタ（ＣＬ）チャネル、ライト（Ｒ）チャネル、ライ
トセンタ（ＣＲ）チャネル、レフトサラウンド（ＬＢ）
チャネル、ライトサラウンド（ＲＢ）チャネルの８つで
ある。When the transmission is recorded on a movie film, it corresponds to a digital surround system in which speakers are arranged as shown in FIG. 2, for example. Each channel is a center (C) channel, a subwoofer (SW) channel, a left (L) channel, a left center (CL) channel, a right (R) channel, a right center (CR) channel, a left surround (LB) channel.
There are eight channels, a light surround (RB) channel.

【００２６】ここで、この図２において、上記映画フィ
ルムに記録する上記８チャネルの各チャネルは、例えば
当該映画フィルムの画像記録領域から再生された画像が
映写機（プロジェクタ１００）によって投影されるスク
リーン１０１側に配置されたレフトスピーカ１０６，レ
フトセンタースピーカ１０４，センタースピーカ１０
２，ライトセンタースピーカ１０５，ライトスピーカ１
０７，サラウンドレフトスピーカ１０８及び２００，サ
ラウンドライトスピーカ１０９及び２０１，サブウーフ
ァスピーカ１０３と対応するものである。In FIG. 2, each of the eight channels recorded on the motion picture film has a screen 101 on which an image reproduced from an image recording area of the motion picture film is projected by a projector (projector 100). Left speaker 106, left center speaker 104, center speaker 10 arranged on the side
2, light center speaker 105, light speaker 1
07, surround left speakers 108 and 200, surround right speakers 109 and 201, and subwoofer speaker 103.

【００２７】上記センタスピーカ１０２は、スクリーン
１０１側の中央に配置され、センタチャネルのオーディ
オデータによる再生音を出力するもので例えば俳優のせ
りふ等の最も重要な再生音を出力する。上記サブウーフ
ァスピーカ１０３は、サブウーファチャネルのオーディ
オデータによる再生音を出力するもので、例えば爆発音
などの低域の音というよりは振動として感じられる音を
効果的に出力するものであり、爆発シーンなどに効果的
に使用されることが多いものである。上記レフトスピー
カ１０６及びライトスピーカ１０７は、上記スクリーン
１０１の左右に配置され、レフトチャネルのオーディオ
データによる再生音とライトチャネルのオーディオデー
タによる再生音を出力するもので、ステレオ音響効果を
発揮する。上記レフトセンタスピーカ１０４とライトセ
ンタスピーカ１０５は、上記センタスピーカ１０２と上
記レフトスピーカ１０６及びライトスピーカ１０７との
間に配置され、レフトセンタチャネルのオーディオデー
タによる再生音とライトセンタチャネルのオーディオデ
ータによる再生音を出力するもので、それぞれ上記レフ
トスピーカ１０６及びライトスピーカ１０７の補助的な
役割を果たす。特にスクリーン１０１が大きく収容人数
の多い映画館等では、座席の位置によって音像の定位が
不安定になりやすいが、上記レフトセンタスピーカ１０
４とライトセンタスピーカ１０７を付加することによ
り、音像のよりリアルな定位を作り出すのに効果を発揮
する。さらに、上記サラウンドレフトスピーカ１０８と
サラウンドライトスピーカ１０９は、観客席を取り囲む
ように配置され、サラウンドレフトチャネルのオーディ
オデータによる再生音とサラウンドライトチャネルのオ
ーディオデータによる再生音を出力するもので、残響音
や拍手、歓声に包まれた印象を与える効果がある。これ
により、より立体的な音像を作り出すことができる。The center speaker 102 is arranged in the center of the screen 101 side and outputs a reproduced sound based on audio data of the center channel, and outputs the most important reproduced sound such as an actor's dialogue. The subwoofer speaker 103 outputs a reproduced sound based on audio data of the subwoofer channel, and effectively outputs a sound that is felt as vibration rather than a low frequency sound such as an explosion sound, and an explosion scene. Is often used effectively. The left speaker 106 and the right speaker 107 are arranged on the left and right sides of the screen 101, and output a reproduced sound by the left channel audio data and a reproduced sound by the right channel audio data, and exhibit a stereo sound effect. The left center speaker 104 and the right center speaker 105 are arranged between the center speaker 102 and the left speaker 106 and the right speaker 107, and are reproduced by the left center channel audio data and reproduced by the right center channel audio data. It outputs sound and plays an auxiliary role for the left speaker 106 and the right speaker 107, respectively. Particularly in a movie theater or the like where the screen 101 is large and the number of persons accommodated is large, the localization of the sound image tends to become unstable depending on the position of the seat.
4 and the light center speaker 107 are added, it is effective in creating a more realistic localization of the sound image. Further, the surround left speaker 108 and the surround right speaker 109 are arranged so as to surround the spectators' seats, and output a reproduced sound by the audio data of the surround left channel and a reproduced sound by the audio data of the surround right channel. It has the effect of giving the impression of being wrapped in clapping and cheering. As a result, a more stereoscopic sound image can be created.

【００２８】図１に戻って、上述したような８チャネル
のディジタルオーディオ信号を扱う図１の伝送装置は、
上記８チャネルのディジタルオーディオ信号を得るため
に、各チャネルに対応するマイクロホン２０１〜２０８
からのアナログオーディオ信号をそれぞれ対応するＡ／
Ｄ変換器２０９〜２１６によってディジタルオーディオ
信号に変換する。Returning to FIG. 1, the transmission apparatus of FIG. 1 which handles the 8-channel digital audio signal as described above,
In order to obtain the above 8-channel digital audio signals, the microphones 201 to 208 corresponding to the respective channels are obtained.
Corresponding analog audio signals from A /
The D converters 209 to 216 convert the digital audio signals.

【００２９】各Ａ／Ｄ変換器２０９〜２１６からの各チ
ャネルのディジタルオーディオ信号は、それぞれ対応す
る圧縮符号化回路２１７〜２２４に送られて各々圧縮符
号化される。なお、これら各圧縮符号化回路２１７〜２
２４の具体的構成については後述する。The digital audio signals of the respective channels from the respective A / D converters 209 to 216 are sent to the corresponding compression encoding circuits 217 to 224 and compressed and encoded respectively. Note that each of these compression encoding circuits 217-2
The specific configuration of 24 will be described later.

【００３０】上記高聴感圧縮符号化回路２１７〜２２２
からの圧縮符号化されたオーディオデータは、それぞれ
対応する出力端子２３３〜２３８から後段の構成に送ら
れる。The above-mentioned high-audibility compression encoding circuits 217 to 222.
The compressed and encoded audio data from are sent to the subsequent stage from the corresponding output terminals 233 to 238.

【００３１】一方、上記高率圧縮符号化回路２２３，２
２４からの圧縮符号化されたオーディオデータは、それ
ぞれハイパスフィルタ２２５，２２６によって高域成分
のみが取り出された後、加算器２２７によって加算され
る。また、上記高率圧縮符号化回路２２３，２２４から
の圧縮符号化されたオーディオデータは、それぞれロー
パスフィルタ２２８，２２９にも送られる。上記加算器
２２７からのデータは、加算器２３０によって上記サラ
ウンドレフトチャネル側に対応するローパスフィルタ２
２８の出力と加算され、出力端子２３１から出力され、
上記サラウンドライトチャネルに対応するローパスフィ
ルタ２２９の出力は出力端子２３２から出力される。On the other hand, the high-rate compression coding circuits 223, 2
The compression-encoded audio data from 24 is added by the adder 227 after only the high frequency components are extracted by the high pass filters 225 and 226, respectively. The compression-encoded audio data from the high-rate compression encoding circuits 223 and 224 are also sent to low-pass filters 228 and 229, respectively. The data from the adder 227 is supplied to the low pass filter 2 corresponding to the surround left channel side by the adder 230.
28 output is added and output from the output terminal 231.
The output of the low pass filter 229 corresponding to the surround right channel is output from the output terminal 232.

【００３２】ここで、上記サラウンドレフトチャネルと
サラウンドライトチャネルにおいて上記ハイパスフィル
タ２２５，２２６からの出力を加算器２２７によって加
算し、当該加算器２２７の加算出力を、更に加算器２２
８によってサラウンドレフトチャネルのローパスフィル
タ２２８出力に加算するのは以下の理由による。Here, in the surround left channel and the surround right channel, the outputs from the high pass filters 225 and 226 are added by the adder 227, and the added output of the adder 227 is further added.
The reason for adding to the output of the low-pass filter 228 of the surround left channel by 8 is as follows.

【００３３】すなわち、人間の耳は高域の成分に対する
定位感が少なく、このため高域成分については例えば複
数個のスピーカのうちのいずれか１つのスピーカからで
ていても人間にはどのスピーカから出てきているのか聞
き取り難いという性質がある。このため、複数チャネル
のオーディオ信号の各高域成分を、そのうちの例えば１
チャネルに対応するスピーカのみに送るようにしても、
人間には各チャネルのオーディオ信号の高域成分が当該
１つのスピーカのみから出力されているとは感じられな
い。したがって、本実施例では、上述のように、上記８
チャネルのうちの例えばサラウンドレフトチャネルとサ
ラウンドライトチャネルの音声の高域成分を加算して、
例えばサラウンドレフトチャネルの低域成分の音声に加
えるようにしている。これにより、サラウンドレフトチ
ャネルとサラウンドライトチャネルの２つのチャネルの
高域成分を１つのチャネル分に圧縮できることになる。
なお、このような高域成分におけるチャネル間のクロス
トーク処理は、図１の例のような上記サラウンドレフト
チャネルとサラウンドライトチャネル間のみならず、図
示は省略しているが他のチャネル間（例えば全チャネ
ル）についても同様に行うことができる。このように全
チャネルについて高域成分のチャネルクロストーク処理
を行うことで更に圧縮率を上げることが可能となる。That is, the human ear has less sense of localization for high-frequency components. Therefore, for high-frequency components, no matter which speaker is selected from among a plurality of speakers, human beings recognize which speaker. It is difficult to hear if it is coming out. Therefore, each high frequency component of the audio signals of a plurality of channels is
Even if you send only to the speaker corresponding to the channel,
It is not perceived by humans that the high frequency component of the audio signal of each channel is output from only the one speaker. Therefore, in this embodiment, as described above,
Of the channels, for example, add the high frequency components of the surround left channel and surround right channel audio,
For example, it is added to the sound of the low frequency component of the surround left channel. As a result, the high frequency components of the two channels, the surround left channel and the surround right channel, can be compressed into one channel.
The crosstalk processing between channels in such a high frequency component is performed not only between the surround left channel and the surround right channel as in the example of FIG. 1 but also between other channels (not shown) (for example, The same can be done for all channels). In this way, it is possible to further increase the compression rate by performing channel crosstalk processing of high frequency components for all channels.

【００３４】上述した図１の各出力端子２３３〜２３８
及び２３１，２３２からの各チャネルの圧縮符号化デー
タは、図３のそれぞれ対応する端子２３３〜２３８及び
２３１，２３２を介してマルチプレクサ２４１に送られ
てマルチプレクスされる。当該マルチプレクサ２４１の
出力は出力端子２４２を介して伝送路に伝送若しくは上
述した本発明の映画フィルムやディスク状記録媒体，テ
ープ状記録媒体などに記録される。The output terminals 233 to 238 shown in FIG.
, 231 and 232 of each channel are sent to the multiplexer 241 via the corresponding terminals 233 to 238 and 231, 232 of FIG. 3, respectively, and are multiplexed. The output of the multiplexer 241 is transmitted to the transmission line via the output terminal 242 or recorded on the above-mentioned movie film of the present invention, the disc-shaped recording medium, the tape-shaped recording medium, or the like.

【００３５】次に、上記図１の各高聴感圧縮符号化回路
２１７〜２２２について具体的に説明する。Next, the high-hearing-compression compression encoding circuits 217 to 222 shown in FIG. 1 will be specifically described.

【００３６】先ず、図４には、上記各高聴感圧縮符号化
回路２１７〜２２２の一具体例の構成を示す。First, FIG. 4 shows a configuration of a specific example of each of the above-described high-audibility compression encoding circuits 217 to 222.

【００３７】この図４に示す圧縮符号化回路では、入力
ディジタル信号をフィルタなどにより複数の周波数帯域
に分割すると共に、各周波数帯域毎に直交変換を行っ
て、得られた周波数軸のスペクトルデータを、後述する
人間の聴覚特性を考慮したいわゆる臨界帯域幅（クリテ
ィカルバンド）毎に適応的にビット配分して符号化して
いる。この時、高域では臨界帯域幅を更に分割した帯域
を用いる。もちろんフィルタなどによる非ブロッキング
の周波数分割幅は等分割幅としてもよい。さらに、本実
施例においては、直交変換の前に入力信号に応じて適応
的にブロックサイズ（ブロック長）を変化させると共
に、クリティカルバンド単位もしくは高域では臨界帯域
幅（クリティカルバンド）を更に細分化したブロックで
フローティング処理を行っている。In the compression coding circuit shown in FIG. 4, the input digital signal is divided into a plurality of frequency bands by a filter and the orthogonal transformation is performed for each frequency band to obtain the spectrum data of the frequency axis. The bits are adaptively allocated and encoded for each so-called critical band (critical band) in consideration of human auditory characteristics described later. At this time, a band obtained by further dividing the critical bandwidth is used in the high band. Of course, the non-blocking frequency division width by a filter or the like may be an equal division width. Further, in this embodiment, the block size (block length) is adaptively changed according to the input signal before the orthogonal transformation, and the critical bandwidth (critical band) is further subdivided in the critical band unit or in the high range. Floating process is performed in the block.

【００３８】なお、上記帯域分割用フィルタとしては、
例えばＱＭＦ等のフィルタがあり、これは1976 R.E.Cro
chiere Digital coding of speech in subbands Bell
Syst.Tech. J. Vol.55, No.8 1976に、述べられてい
る。同じくICASSP 83, BOSTONPolyphase Quadrature f
ilters-A new subband coding technique Joseph H.Ro
thweilerには等バンド幅のフィルタ分割手法及び装置が
述べられている。As the band division filter,
For example, there is a filter such as QMF, which is 1976 RECro
chiere Digital coding of speech in subbands Bell
Syst.Tech. J. Vol.55, No.8 1976. Also ICASSP 83, BOSTONPolyphase Quadrature f
ilters-A new subband coding technique Joseph H.Ro
Thweiler describes an equal bandwidth filter partitioning method and apparatus.

【００３９】さらに、上記クリティカルバンドとは、人
間の聴覚特性を考慮して分割された周波数帯域であり、
ある純音の周波数近傍の同じ強さの狭帯域バンドノイズ
によって当該純音がマスクされるときのそのノイズの持
つ帯域のことである。このクリティカルバンドは、高域
ほど帯域幅が広くなっており、上記０〜２２ｋＨｚの全
周波数帯域は例えば２５のクリティカルバンドに分割さ
れている。Further, the critical band is a frequency band divided in consideration of human auditory characteristics,
It is a band of a certain pure tone when the pure tone is masked by a narrow band noise having the same strength near the frequency of the pure tone. This critical band has a wider bandwidth as it goes higher, and the entire frequency band of 0 to 22 kHz is divided into, for example, 25 critical bands.

【００４０】すなわち、図４において、入力端子１０に
は例えば０〜２２ｋＨｚのオーディオＰＣＭ信号が供給
されている。この入力信号は、例えばいわゆるＱＭＦ等
の帯域分割フィルタ１１により０〜１１ｋＨｚ帯域と１
１ｋ〜２２ｋＨｚ帯域とに分割され、０〜１１ｋＨｚ帯
域の信号は同じくいわゆるＱＭＦ等の帯域分割フィルタ
１２により０〜５．５ｋＨｚ帯域と５．５ｋ〜１１ｋＨ
ｚ帯域とに分割される。帯域分割フィルタ１１からの１
１ｋ〜２２ｋＨｚ帯域の信号は、直交変換回路の一例で
あるＭＤＣＴ（Modified Discrete Cosine Transform）
回路１３に送られ、帯域分割フィルタ１２からの５．５
ｋ〜１１ｋＨｚ帯域の信号はＭＤＣＴ回路１４に送ら
れ、帯域分割フィルタ１２からの０〜５．５ｋＨｚ帯域
の信号はＭＤＣＴ回路１５に送られることにより、それ
ぞれＭＤＣＴ処理される。That is, in FIG. 4, an audio PCM signal of 0 to 22 kHz, for example, is supplied to the input terminal 10. This input signal is divided into 0 to 11 kHz band and 1 by a band division filter 11 such as so-called QMF.
The signal of 0 to 11 kHz band is divided into the 1 to 22 kHz band and the 0 to 5.5 kHz band and 5.5 to 11 kHz by the band dividing filter 12 such as so-called QMF.
and z-band. 1 from the band division filter 11
A signal in the 1 kHz to 22 kHz band is an MDCT (Modified Discrete Cosine Transform) that is an example of an orthogonal transformation circuit.
5.5 sent from the band division filter 12 to the circuit 13.
A signal in the k to 11 kHz band is sent to the MDCT circuit 14, and a signal in the 0 to 5.5 kHz band from the band division filter 12 is sent to the MDCT circuit 15 to be MDCT processed.

【００４１】各ＭＤＣＴ回路１３、１４、１５では、各
帯域毎に設けたブロック決定回路１９、２０、２１によ
り後述するように決定されたブロックサイズに基づいて
ＭＤＣＴ処理がなされる。In each of the MDCT circuits 13, 14 and 15, MDCT processing is performed based on the block size determined by the block determining circuits 19, 20 and 21 provided for each band as described later.

【００４２】なお、各ＭＤＣＴ回路１３、１４、１５に
おける上記ＭＤＣＴ処理については、ICASSP 1987 Sub
band/Transform Coding Using Filter Bank Designs B
asedon Time Domain Aliasing Cancellation J.P.Princ
en A.B.Bradley Univ. ofSurrey Royal Melbourne Ins
t.of Tech.に述べられている。The MDCT processing in each MDCT circuit 13, 14, 15 is described in ICASSP 1987 Sub.
band / Transform Coding Using Filter Bank Designs B
asedon Time Domain Aliasing Cancellation JPPrinc
en ABBradley Univ. of Surrey Royal Melbourne Ins
t.of Tech.

【００４３】上記ブロック決定回路１９、２０、２１で
決定されたブロックサイズを示す情報は、後述の適応ビ
ット配分符号化回路１６、１７、１８に送られると共
に、出力端子２３、２５、２７から出力される。The information indicating the block size decided by the block decision circuits 19, 20, 21 is sent to adaptive bit allocation coding circuits 16, 17, 18 which will be described later, and output from the output terminals 23, 25, 27. To be done.

【００４４】ここで、上記適応ビット配分符号化回路１
６、１７、１８においては、各ＭＤＣＴ回路１３、１
４、１５の出力から、上記臨界帯域（クリティカルバン
ド）または高域では更にクリティカルバンドを分割した
帯域毎のエネルギが、例えば当該バンド内での各振幅値
の２乗平均の平方根を計算すること等により求められ、
この計算結果に基づいてビット配分が行われる。もちろ
ん、上記スケールファクタそのものを以後のビット配分
の為に用いるようにしてもよい。この場合には新たなエ
ネルギ計算の演算が不要となるため、ハード規模の節約
となる。また、各バンド毎のエネルギの代わりに、振幅
値のピーク値、平均値等を用いることも可能である。な
お、各ＭＤＣＴ回路１３、１４、１５にてＭＤＣＴ処理
されて得られた周波数領域のスペクトルデータあるいは
ＭＤＣＴ係数データは、いわゆる臨界帯域（クリティカ
ルバンド）または高域では更にクリティカルバンドを分
割した帯域毎にまとめられて当該適応ビット配分符号化
回路１６、１７、１８に送られている。Here, the adaptive bit allocation encoding circuit 1
6, 17 and 18, the MDCT circuits 13 and 1
From the outputs of 4 and 15, the energy of each critical band (critical band) or a band obtained by further dividing the critical band in the high band is calculated, for example, the root mean square of each amplitude value in the band is calculated. Sought by
Bit allocation is performed based on the calculation result. Of course, the scale factor itself may be used for subsequent bit allocation. In this case, the calculation of new energy is not required, and the hardware scale is saved. Further, instead of the energy for each band, it is also possible to use the peak value, the average value, etc. of the amplitude values. The spectrum data in the frequency domain or the MDCT coefficient data obtained by the MDCT processing in each MDCT circuit 13, 14, 15 is a so-called critical band (critical band) or a band in which a critical band is further divided in a high band. The data are collected and sent to the adaptive bit allocation coding circuits 16, 17, and 18.

【００４５】さらに、上記適応ビット配分符号化回路１
６、１７、１８では、上記ブロックサイズの情報、及び
臨界帯域（クリティカルバンド）または高域では更にク
リティカルバンドを分割した帯域毎に割り当てられたビ
ット数に応じて各スペクトルデータ（あるいはＭＤＣＴ
係数データ）を再量子化（正規化して量子化）するよう
にしている。適応ビット配分符号化回路１６、１７、１
８で符号化されたデータは、出力端子２２、２４、２６
を介して取り出される。また、当該適応ビット配分符号
化回路１６、１７、１８では、どのような信号の大きさ
に関する正規化がなされたかを示すスケールファクタ
と、どのようなビット長で量子化がされたかを示すビッ
ト長情報も求めており、これらも同時に出力端子２２、
２４、２６から出力される。Further, the adaptive bit allocation encoding circuit 1
6, 17, and 18 each spectrum data (or MDCT) according to the block size information and the number of bits assigned to each band obtained by further dividing the critical band (critical band) or the critical band in the high band.
The coefficient data) is requantized (normalized and quantized). Adaptive bit allocation coding circuits 16, 17, 1
The data encoded in 8 are output terminals 22, 24, 26
Taken out through. Further, in the adaptive bit allocation encoding circuits 16, 17, and 18, a scale factor indicating what kind of signal magnitude is normalized and a bit length indicating what kind of bit length is quantized. Information is also sought, and these are also output terminals 22,
It is output from 24 and 26.

【００４６】これら各出力端子２２〜２７からのデータ
はまとめられて上記各高聴感圧縮符号化回路２１７〜２
２２の出力となる。The data from the output terminals 22 to 27 are put together and the high-audibility compression encoding circuits 217 to 2 are combined.
22 output.

【００４７】ところで、上記図１及び図４の例では、各
高聴感圧縮符号化回路２１７〜２２２においては各チャ
ネル毎にビット配分を行って圧縮符号化を行っている
が、これら各高聴感圧縮符号化回路２１７〜２２２間で
ビット配分を行う（すなわち各回路２１７〜２２２に対
応するチャネル間でビット配分を行う）ことも可能であ
る。In the examples of FIGS. 1 and 4, the high-audibility compression coding circuits 217 to 222 allocate bits to each channel to perform compression coding. It is also possible to perform bit allocation among the encoding circuits 217 to 222 (that is, perform bit allocation among channels corresponding to the circuits 217 to 222).

【００４８】この高聴感圧縮符号化回路２１７〜２２２
間でビット配分を行う場合の各回路構成について、以下
に説明する。図５には、チャネル間でビット配分を行う
高聴感圧縮符号化回路の構成を示す。なお、この図５の
構成のうち、適応ビット配分符号化回路１６，１７，１
８を除く他の構成要素は図４の対応する構成要素と基本
的には同一のものである。The high-audibility compression encoding circuits 217 to 222
Each circuit configuration in the case of performing bit allocation between channels will be described below. FIG. 5 shows the configuration of a high-audibility compression encoding circuit that allocates bits among channels. In the configuration of FIG. 5, adaptive bit allocation coding circuits 16, 17, 1
Other components except 8 are basically the same as the corresponding components in FIG.

【００４９】この図５に示される圧縮符号化回路におい
て、図４同様のブロック決定回路１９、２０、２１によ
り決定される各ＭＤＣＴ回路１３、１４、１５でのブロ
ックサイズの具体例を図６のＡ及びＢに示す。なお、図
６のＡには直交変換ブロックサイズが長い場合（ロング
モードにおける直交変換ブロックサイズ）を、図６のＢ
には直交変換ブロックサイズが短い場合（ショートモー
ドにおける直交変換ブロックサイズ）を示ししている。
この図６の具体例においては、３つのフィルタ出力は、
それぞれ２つの直交変換ブロックサイズを持つ。すなわ
ち、低域側の０〜５．５ｋＨｚ帯域の信号及び中域の
５．５ｋ〜１１ｋＨｚ帯域の信号に対しては、長いブロ
ック長の場合（図６のＡ）は１ブロック内のサンプル数
を１２８サンプルとし、短いブロックが選ばれた場合
（図６のＢ）には１ブロック内のサンプル数を３２サン
プル毎のブロックとしている。これに対して高域側の１
１ｋ〜２２ｋＨｚ帯域の信号に対しては、長いブロック
長の場合（図６のＡ）は１ブロック内のサンプル数を２
５６サンプルとし、短いブロックが選ばれた場合（図６
のＢ）には１ブロック内のサンプル数を３２サンプル毎
のブロックとしている。このようにして短いブロックが
選ばれた場合には各帯域の直交変換ブロックのサンプル
数を同じとして高域程時間分解能を上げ、なおかつブロ
ック化に使用するウインドウの種類を減らしている。な
お、図５の具体例のブロック決定回路１９、２０、２１
で決定されたブロックサイズを示す情報は、後述の適応
ビット配分符号化回路１６、１７、１８に送られると共
に、出力端子２３、２５、２７から出力される。In the compression encoding circuit shown in FIG. 5, a concrete example of the block size in each MDCT circuit 13, 14, 15 determined by the block determination circuits 19, 20, 21 similar to that in FIG. 4 is shown in FIG. Shown in A and B. 6A shows a case where the orthogonal transform block size is long (orthogonal transform block size in the long mode), FIG.
Shows the case where the orthogonal transform block size is short (orthogonal transform block size in the short mode).
In the specific example of FIG. 6, the three filter outputs are
Each has two orthogonal transform block sizes. That is, for a signal in the low band of 0 to 5.5 kHz and a signal in the mid band of 5.5 to 11 kHz, the number of samples in one block is set to a long block length (A in FIG. 6). When 128 blocks are selected and a short block is selected (B in FIG. 6), the number of samples in one block is a block for every 32 samples. On the other hand, the high side 1
For a signal in the 1 kHz to 22 kHz band, if the block length is long (A in FIG. 6), the number of samples in one block is 2
56 samples and a short block is selected (Fig. 6
In B), the number of samples in one block is a block for every 32 samples. When a short block is selected in this way, the number of samples of orthogonal transform blocks in each band is set to be the same, the time resolution is increased in the higher frequency range, and the number of windows used for blocking is reduced. In addition, the block determination circuits 19, 20, and 21 of the specific example of FIG.
The information indicating the block size determined in step 1 is sent to the adaptive bit allocation coding circuits 16, 17 and 18, which will be described later, and is also output from the output terminals 23, 25 and 27.

【００５０】この図５の具体例の適応ビット配分符号化
回路１６、１７、１８では、上記ブロックサイズの情
報、及び臨界帯域（クリティカルバンド）または高域で
は更にクリティカルバンドを分割した帯域毎に割り当て
られたビット数に応じて各スペクトルデータ（あるいは
ＭＤＣＴ係数データ）を再量子化（正規化して量子化）
するようにしている。この時、適応ビット配分符号化回
路１６、１７、１８では、各チャネル間でのチャネルビ
ット配分、すなわち各チャネルの信号全体を見ることに
より、チャネル毎の使用ビット量を適応的に最適に振り
分けるビット配分を同時に行う。この場合の当該チャネ
ルビット配分は、後述する適応ビット配分回路から端子
２８を介して供給されたチャネルビット配分信号に基づ
いて行われる。このようにして符号化されたデータは、
出力端子２２、２４、２６を介して取り出される。ま
た、当該適応ビット配分符号化回路１６、１７、１８で
は、どのような信号の大きさに関する正規化がなされた
かを示すスケールファクタと、どのようなビット長で量
子化がされたかを示すビット長情報も求めており、これ
らも同時に出力端子２２、２４、２６から出力される。In the adaptive bit allocation coding circuits 16, 17, and 18 of the specific example of FIG. 5, the above block size information and the critical band (critical band) or in the high frequency band, the critical band is further divided and allocated. Re-quantization (normalized and quantized) of each spectrum data (or MDCT coefficient data) according to the number of bits
I am trying to do it. At this time, the adaptive bit allocation coding circuits 16, 17, and 18 adaptively and optimally allocate the used bit amount for each channel by observing the channel bit allocation among the channels, that is, the entire signal of each channel. Allocate simultaneously. The channel bit allocation in this case is performed based on the channel bit allocation signal supplied from the adaptive bit allocation circuit described later through the terminal 28. The data encoded in this way is
It is taken out through the output terminals 22, 24 and 26. Further, in the adaptive bit allocation encoding circuits 16, 17, and 18, a scale factor indicating what kind of signal magnitude is normalized and a bit length indicating what kind of bit length is quantized. Information is also sought, and these are also output from the output terminals 22, 24 and 26 at the same time.

【００５１】次に、上記ビット配分を行うための適応ビ
ット配分回路の具体的な構成及び動作を図７を用いて説
明する。なお、この図７の例では、図１に対応して前記
８チャネルのうちの６チャネルについてのビット配分に
対応している。すなわち、聴感上影響の高い信号からな
るチャネルであるレフトチャネルとレフトセンタチャネ
ルとセンタチャネルとサブウーファチャネルとライトセ
ンタチャネルとライトチャネルの６チャネルに対応して
いる。Next, the specific configuration and operation of the adaptive bit allocation circuit for performing the above bit allocation will be described with reference to FIG. The example of FIG. 7 corresponds to the bit allocation for 6 channels out of the 8 channels corresponding to FIG. That is, it corresponds to 6 channels of a left channel, a left center channel, a center channel, a subwoofer channel, a right center channel and a right channel, which are channels composed of signals that have a great influence on the auditory sense.

【００５２】この図７において、各チャネルの共通部に
ついて例えばレフトチャネル（Ｌｃｈ）を用いて説明す
る（他のチャネルについては同一の指示符号を付して説
明は省略している）と、レフトチャネルの入力情報信号
は当該レフトチャネル用の入力端子３１に与えられる。
なお、この端子３１は、図５の端子２９と対応してい
る。この入力情報信号はマッピング回路（Mapping)３２
により時間領域の信号から周波数領域に展開される。こ
こで、フィルタによる場合には、サブバンド信号として
時間領域サンプルが得られることになり、直交変換出力
の場合及びフィルタリング後に直交変換を行う場合には
周波数領域サンプルが得られることになる。In FIG. 7, the common part of each channel will be described using, for example, a left channel (Lch) (the other channels are denoted by the same reference numerals and the description thereof will be omitted). Input information signal is applied to the input terminal 31 for the left channel.
The terminal 31 corresponds to the terminal 29 of FIG. This input information signal is a mapping circuit (Mapping) 32.
Is expanded from the time domain signal to the frequency domain. Here, in the case of using a filter, time domain samples are obtained as subband signals, and in the case of orthogonal transform output and in the case of performing orthogonal transform after filtering, frequency domain samples are obtained.

【００５３】これらのサンプルは、ブロッキング(Block
ing)回路３３によって複数サンプル毎にまとめられる。
ここで、フィルタによる場合には時間領域の複数サンプ
ルがまとめられることになり、直交変換出力の場合及び
フィルタリング後に直交変換を行う場合には周波数領域
の複数サンプルがまとめられることになる。These samples are
ing) circuit 33 collects a plurality of samples.
Here, a plurality of samples in the time domain are combined when using a filter, and a plurality of samples in the frequency domain are combined when an orthogonal transform output and when orthogonal transform is performed after filtering.

【００５４】また、本具体例では、マッピングの途中の
ＭＤＣＴ入力時間領域信号の時間変化を時間変化算出回
路３４により算出する。Further, in this example, the time change calculation circuit 34 calculates the time change of the MDCT input time domain signal during the mapping.

【００５５】上記ブロッキング回路３３により複数のサ
ンプル毎にまとめられた各サンプルは正規化回路３７で
正規化される。ここで、正規化のための係数であるスケ
ールファクタは、スケールファクタ算出回路３５によっ
て得られる。同時にトーナリティの大きさがトーナリテ
ィ算出回路３６で算出される。Each sample collected by the blocking circuit 33 into a plurality of samples is normalized by a normalizing circuit 37. Here, the scale factor, which is a coefficient for normalization, is obtained by the scale factor calculation circuit 35. At the same time, the tonality magnitude is calculated by the tonality calculation circuit 36.

【００５６】以上で求められるパラメータは、ビット配
分回路３８でビット配分のために使用される。ここで、
ＭＤＣＴ係数を表現して伝送又は記録に使えるビット数
を、全チャネル（上記６チャネル）で８００Ｋｂｐｓと
すると、本具体例のビット配分回路３８では、チャネル
ビット配分を含む第１のビット配分（第１のビット配分
量）と、チャネルビット配分を含まない第２のビット配
分（第２のビット配分量）の２つを求める。The parameters calculated as described above are used by the bit allocation circuit 38 for bit allocation. here,
Assuming that the number of bits that can be used for transmission or recording by expressing MDCT coefficients is 800 Kbps for all channels (the above-mentioned 6 channels), the bit allocation circuit 38 of this specific example uses a first bit allocation (first bit allocation) including channel bit allocation (first bit allocation). Of the channel bit allocation) and a second bit allocation not including the channel bit allocation (second bit allocation amount).

【００５７】先ず、チャネルビット配分を含む第１のビ
ット配分の配分手法について説明する。ここではスケー
ルファクタの周波数領域の分布をみて適応的にビット配
分を行う。First, the allocation method of the first bit allocation including the channel bit allocation will be described. Here, bit distribution is adaptively performed by observing the distribution of the scale factor in the frequency domain.

【００５８】この場合、全チャネルのスケールファクタ
の周波数領域の分布をみてチャネル間でのビット配分を
行うことで有効なビット配分を行うことができる。この
とき、複数チャネルの信号情報が、スピーカの場合のよ
うに同一音場のなかで混合されて左右の耳に達する場合
を考えると、全チャネル信号の加算されたものでマスキ
ングが作用すると考えてよいから、図８のＡ，Ｆに示す
ように、同一帯域において各チャネルが同一のノイスレ
ベルになるようにビット配分を行うことが有効である。
このための一方法としてはスケールファクタ指標の大き
さに比例したビット配分を行えばよい。すなわち、以下
の式によってビット配分を行う。In this case, effective bit allocation can be performed by observing the distribution of the scale factors of all channels in the frequency domain and allocating bits among the channels. At this time, considering that the signal information of multiple channels reaches the left and right ears after being mixed in the same sound field as in the case of a speaker, it is thought that masking works with the sum of all channel signals. Therefore, as shown in A and F of FIG. 8, it is effective to perform bit allocation so that each channel has the same noise level in the same band.
One method for this purpose is to perform bit allocation proportional to the size of the scale factor index. That is, bit allocation is performed by the following formula.

【００５９】Ｂｍ＝Ｂ＊（ΣＳＦｎ）／ＳＳ＝Σ（ΣＳＦｎ）Bm = B * (ΣSFn) / S S = Σ (ΣSFn)

【００６０】ここで、Ｂｍは各チャネルへのビット配分
量、Ｂは全チャネルへのビット配分量、ＳＦｎはスケー
ルファクタ指標であり概略ピーク値の対数に対応してい
る。ｎは各チャネル内のブロックフローティングバンド
番号、ｍはチャネル番号、Ｓは全チャネルのスケールフ
ァクタ指標の和である。なお、図８には、レフトチャネ
ルとライトチャネルのみ示し他の４チャネルについては
図示を省略している。Here, Bm is a bit allocation amount to each channel, B is a bit allocation amount to all channels, SFn is a scale factor index, and corresponds to the logarithm of the approximate peak value. n is the block floating band number in each channel, m is the channel number, and S is the sum of the scale factor indexes of all channels. In FIG. 8, only the left channel and the right channel are shown and the other four channels are not shown.

【００６１】以上に加えて、ビット配分回路３８は、各
チャネルの信号の時間変化特性を検出して、この指標に
よってチャネル毎のビット配分量を変えるプロセスを持
つ。この時間変化を表す指標は次のようにして求められ
る。In addition to the above, the bit allocation circuit 38 has a process of detecting the time change characteristic of the signal of each channel and changing the bit allocation amount for each channel by this index. The index showing this time change is obtained as follows.

【００６２】図９のＡ〜Ｆに示すように、チャネルが６
チャネルあるとすると、それぞれのチャネルの情報入力
信号についてビット配分の時間単位であるビット配分時
間ブロックを時間的に４分割し、それぞれの時間ブロッ
ク（サブブロック）のピーク値を得る。そして各サブブ
ロックのピーク値が小から大へと変わるところの差分の
大きさに応じてチャネル間でビットを分け合う。ここ
で、このビット配分のために６チャネル合計でＣビット
使えるとしたとき、各チャネルの各サブブロックのピー
ク値が小から大へと変わるところの差分の大きさがそれ
ぞれａ，ｂ，ｃ，ｄ，ｅ，ｆデシベル（ｄＢ）とする
と、それぞれＣ＊ａ／Ｔ，Ｃ＊ｂ／Ｔ，・・・・・，Ｃ
＊ｆ／Ｔビット（ビット）と配分することができる。こ
こで、Ｔ＝ａ＋ｂ＋ｃ＋ｄ＋ｅ＋ｆである。信号情報が
大きくなる程度が大であるほどそのチャネルに対しての
ビット配分量が大きくなる。なお、図９には、レフトチ
ャネルとレフトセンタチャネルとライトチャネルのみ示
し他の３つのチャネルについては図示を省略している。As shown in FIG. 9A to FIG.
Assuming that there is a channel, the bit allocation time block, which is the time unit of bit allocation for the information input signal of each channel, is temporally divided into four, and the peak value of each time block (sub-block) is obtained. Then, bits are shared among channels according to the magnitude of the difference where the peak value of each sub-block changes from small to large. Here, assuming that C bits can be used in a total of 6 channels for this bit allocation, the magnitudes of the differences at which the peak value of each sub-block of each channel changes from small to large are a, b, c, respectively. If d, e, and f decibel (dB), then C * a / T, C * b / T, ..., C, respectively.
* F / T bits (bits) can be allocated. Here, T = a + b + c + d + e + f. The larger the signal information becomes, the larger the bit allocation amount for the channel becomes. In FIG. 9, only the left channel, the left center channel and the right channel are shown and the other three channels are not shown.

【００６３】次に、チャネルビット配分を含まない第２
のビット配分の配分手法について説明する。ここでは、
チャネルビット配分を含まない第２のビット配分の手法
として更に２つのビット配分からなるビット配分手法に
ついて説明する。なお，この第２のビット配分は、前記
図４における適応ビット配分符号化回路でのビット配分
処理に対応している。Next, the second not including channel bit allocation
A method of allocating the bit will be described. here,
As a second bit allocation method not including channel bit allocation, a bit allocation method consisting of two bit allocations will be described. The second bit allocation corresponds to the bit allocation processing in the adaptive bit allocation encoding circuit in FIG.

【００６４】この２つのビット配分をそれぞれビット配
分(1) とビット配分(2) とする。以下のビット配分では
各チャネルで使用できるビットレートは事前にそれぞれ
のチャネルで固定的に決めておく。例えば、６チャネル
の内で音声など重要部分を担う２つのチャネルには１４
７ｋｂｐｓという比較的大きいビットを使い、サブウー
ハチャネルには高々２ｋｂｐｓ、それ以外のチャネルに
は１００ｋｂｐｓを割り当てておく。These two bit allocations are referred to as bit allocation (1) and bit allocation (2), respectively. In the following bit allocation, the bit rate that can be used in each channel is fixedly determined in advance for each channel. For example, of the 6 channels, 2 channels that play an important part such as voice have 14 channels.
A relatively large bit of 7 kbps is used, 2 kbps is assigned to the subwoofer channel at most, and 100 kbps is assigned to the other channels.

【００６５】先ず、ビット配分(1) に使うべきビット量
を確定する。そのためには、信号情報（ａ）のスペクト
ル情報のうちトーナリティ情報及び信号情報（ｂ）の時
間変化情報を使用する。First, the bit amount to be used for the bit allocation (1) is decided. For that purpose, the tonality information and the time change information of the signal information (b) of the spectrum information of the signal information (a) are used.

【００６６】ここで、トーナリティ情報について説明す
ると、指標としては、信号スペクトルの隣接値間の差の
絶対値の和を、信号スペクトル数で割った値を、指標と
して用いている。より簡単にはブロックフローティング
の為のブロックごとのスケールファクタの、隣接スケー
ルファクタ指標の間の差の平均値を用いる。スケールフ
ァクタ指標は、概略スケールファクタの対数値に対応し
ている。本実施例では、ビット配分(1) に使うべきビッ
ト量をこのトーナリティを表す値に対応させて最大８０
ｋｂｐｓ、最小１０ｋｂｐｓと設定している。ここでは
簡単のために、全チャネルそれぞれの割当を等しく１０
０ｋｂｐｓとしている。Here, the tonality information will be described. As the index, a value obtained by dividing the sum of absolute values of differences between adjacent values of the signal spectrum by the number of signal spectra is used as the index. More simply, the average value of the difference between adjacent scale factor indexes of the scale factor for each block for block floating is used. The scale factor index corresponds to the logarithmic value of the rough scale factor. In the present embodiment, the maximum amount of bits to be used for bit allocation (1) is 80, corresponding to the value representing this tonality.
kbps, and the minimum is set to 10 kbps. Here, for simplification, the allocation of all channels is equalized to 10
It is set to 0 kbps.

【００６７】トーナリティ計算は次式のように行う。The tonality calculation is performed by the following equation.

【００６８】Ｔ＝（１／ＷＬｍａｘ）（ΣＡＢＳ（ＳＦｎ−１））T = (1 / WLmax) (ΣABS (SFn−1))

【００６９】なお、ＷＬｍａｘはワードレングス最大値
＝１６、ＳＦｎはスケールファクタ指標で概略ピーク値
の対数に対応している。ｎはブロックフローティングバ
ンド番号である。WLmax is a word length maximum value = 16, and SFn is a scale factor index and corresponds to the logarithm of the approximate peak value. n is a block floating band number.

【００７０】このようにして求められたトーナリティ情
報Ｔとビット配分(1)のビット配分量とは、図１０に示
すように対応付けられる。The tonality information T thus obtained is associated with the bit allocation amount of the bit allocation (1) as shown in FIG.

【００７１】これと共に本実施例においては、ビット配
分(1) とそれに付加するその他の少なくとも１つのビッ
ト配分との分割率は、情報信号の時間変化特性に依存す
る。本具体例では、直交変換時間ブロックサイズを更に
分割した時間区間毎に信号情報のピーク値を隣接ブロッ
ク毎に比較することにより情報信号の振幅が急激に大き
くなる時間領域を検出してその大きくなるときの状態の
程度により分割率を決定する。In addition to this, in the present embodiment, the division ratio between the bit allocation (1) and at least one other bit allocation added thereto depends on the time change characteristic of the information signal. In this specific example, the peak value of the signal information is compared for each adjacent block for each time interval obtained by further dividing the orthogonal transform time block size, thereby detecting and increasing the time region in which the amplitude of the information signal rapidly increases. The division rate is determined according to the degree of the state.

【００７２】時間変化率計算は次式のように行う。The time change rate calculation is performed by the following equation.

【００７３】Ｖｔ＝ΣＶｍＶａｖ＝（１／Ｖｍａｘ）＊（１／Ｃｈ）ＶｔVt = ΣVm Vav = (1 / Vmax) * (1 / Ch) Vt

【００７４】ここで、Ｖｔは各チャネルの時間サブブロ
ックのピーク値の小から大への変化をｄＢ値で表したも
ののチャネルに関する和、Ｖｍは各チャネルの時間サブ
ブロックのピーク値の小から大への変化をｄＢ値で表し
たもので一番大きいものの大きさ（但し最大値を３０ｄ
Ｂに制限しＶｍａｘであらわす。ｍはチャネル番号、Ｃ
ｈはチャネル数、Ｖａｖは時間サブブロックのピーク値
の小から大への変化をｄＢ値で表したもののチャネル平
均である。Here, Vt is the sum of the values of the change of the peak value of the time subblock of each channel from small to large expressed in dB, and Vm is the sum of the peak value of the time subblock of each channel from small to large. Change in dB value, the size of the largest one (however, the maximum value is 30d
It is limited to B and is represented by Vmax. m is the channel number, C
h is the number of channels, and Vav is the channel average of the change in peak value of the time subblock from small to large expressed in dB.

【００７５】このようにして求められた時間変化率Ｖａ
ｖとビット配分(1) の配分量とは、図１１に示すように
対応付けられる。最終的にビット配分(1) への配分量は
次の式で求められる。The time change rate Va thus obtained
v and the allocation amount of the bit allocation (1) are associated with each other as shown in FIG. Finally, the amount of allocation to bit allocation (1) is calculated by the following formula.

【００７６】Ｂ＝１／２（Ｂｆ＋Ｂｔ）B = 1/2 (Bf + Bt)

【００７７】ここで、Ｂは最終的なビット配分(1) への
配分量、ＢｆはＴｖａより求められたビット配分量、Ｂ
ｔはＶａｖより求められたビット配分量である。Here, B is the final bit allocation amount (1), Bf is the bit allocation amount obtained from Tva, and B is
t is the bit allocation amount obtained from Vav.

【００７８】ここでのビット配分(1) はスケ−ルファク
タに依存した周波数、時間領域上の配分がなされる。The bit allocation (1) here is allocated in the frequency and time domains depending on the scale factor.

【００７９】このようにしてビット配分(1) に使用され
るビット量が決定されたならば、次にビット配分(1) で
使われなかったビットについての配分すなわちビット配
分(2) を決定する。ここでは多種のビット配分が行われ
る。When the bit amount used for the bit allocation (1) is determined in this way, the allocation for the bits not used in the bit allocation (1), that is, the bit allocation (2) is determined next. . Various bit allocations are performed here.

【００８０】第１に全てのサンプル値に対する均一配分
が行われる。この場合のビット配分に対する量子化雑音
スペクトルの一例を図１２に示す。この場合、全周波数
帯域で均一の雑音レベル低減が行える。First, a uniform distribution is made for all sample values. FIG. 12 shows an example of the quantization noise spectrum with respect to the bit allocation in this case. In this case, the noise level can be reduced uniformly over the entire frequency band.

【００８１】第２に信号情報の周波数スペクトル及びレ
ベルに対する依存性を持たせた聴覚的な効果を得るため
のビット配分が行われる。この場合のビット配分に対す
る量子化雑音スペクトルの一例を図１３に示す。この例
では情報信号のスペクトルに依存させたビット配分を行
っていて、特に情報信号のスペクトルの低域側にウエイ
トをおいたビット配分を行い、広域側に比して起きる低
域側でのマスキング効果の減少を補償している。これは
隣接臨界帯域間でのマスキングを考慮して、スペクトル
の低域側を重視したマスキングカーブの非対象性に基づ
いている。このように、図１３の例では低域を重視した
ビット配分が行われている。Secondly, bit allocation is performed to obtain an auditory effect having dependency on the frequency spectrum and level of signal information. FIG. 13 shows an example of the quantization noise spectrum with respect to the bit allocation in this case. In this example, bit allocation is performed depending on the spectrum of the information signal, and in particular, bit allocation with weighting is performed on the low frequency side of the spectrum of the information signal, and masking on the low frequency side that occurs compared to the wide area side is performed. It compensates for the decrease in effect. This is based on the asymmetry of the masking curve that attaches importance to the low frequency side of the spectrum, considering masking between adjacent critical bands. As described above, in the example of FIG. 13, bit allocation is performed with emphasis on the low frequency band.

【００８２】そして最終的にビット配分(1) とビット配
分(1) に付加されるビット配分の値の和が図７のビット
配分回路３８でとられる。最終的なビット配分は以上の
各ビット配分の和として与えられる。Finally, the sum of the bit allocation (1) and the value of the bit allocation added to the bit allocation (1) is taken by the bit allocation circuit 38 of FIG. The final bit allocation is given as the sum of the above bit allocations.

【００８３】なお、図１２，図１３の図中Ｓは信号スペ
クトルを、ＮＬ１は上記全てのサンプルに対する均一配
分による雑音レベルを、ＮＬ２は上記周波数スペクトル
及びレベルに対する依存正を持たせた聴覚的な高かを得
るためのビット配分による雑音レベルを示している。12 and 13, S is the signal spectrum, NL1 is the noise level due to uniform distribution for all the samples, and NL2 is the auditory sense with a positive dependence on the frequency spectrum and level. It shows the noise level due to the bit allocation to get high.

【００８４】次にチャネルビット配分を含まないビット
配分の別の手法を次に説明する。この場合の適応ビット
配分回路の動作を図１４で説明するとＭＤＣＴ係数の大
きさが各ブロックごとに求められ、そのＭＤＣＴ係数が
入力端子８０１に供給される。当該入力端子８０１に供
給されたＭＤＣＴ係数は、帯域毎のエネルギ算出回路８
０３に与えられる。帯域毎のエネルギ算出回路８０３で
は、クリティカルバンドまたは高域においてはクリティ
カルバンドを更に再分割したそれぞれの帯域に関する信
号エネルギを算出する。帯域毎のエネルギ算出回路８０
３で算出されたそれぞれの帯域に関するエネルギは、エ
ネルギ依存ビット配分回路８０４に供給される。Next, another method of bit allocation not including channel bit allocation will be described below. The operation of the adaptive bit allocation circuit in this case will be described with reference to FIG. 14, the magnitude of the MDCT coefficient is obtained for each block, and the MDCT coefficient is supplied to the input terminal 801. The MDCT coefficient supplied to the input terminal 801 is the energy calculation circuit 8 for each band.
Given to 03. The energy calculation circuit 803 for each band calculates the signal energy for each band obtained by further dividing the critical band or the critical band in the high band. Energy calculation circuit 80 for each band
The energy for each band calculated in 3 is supplied to the energy-dependent bit allocation circuit 804.

【００８５】エネルギ依存ビット配分回路８０４では、
使用可能総ビット発生回路８０２からの使用可能総ビッ
ト、本実施例では１２８Ｋｂｐｓの内のある割合（本実
施例では１００Ｋｂｐｓ）を用いて白色の量子化雑音を
作り出すようなビット配分を行う。このとき、入力信号
のトーナリティが高いほど、すなわち入力信号のスペク
トルの凸凹が大きいほど、このビット量が上記１２８Ｋ
ｂｐｓに占める割合が増加する。なお、入力信号のスペ
クトルの凸凹を検出するには、隣接するブロックのブロ
ックフローティング係数の差の絶対値の和を指標として
使う。そして、求められた使用可能なビット量につき、
各帯域のエネルギの対数値に比例したビット配分を行
う。In the energy-dependent bit allocation circuit 804,
The available total bit from the available total bit generation circuit 802, which is a ratio of 128 Kbps in this embodiment (100 Kbps in this embodiment), is used to perform bit allocation so as to generate white quantization noise. At this time, the higher the tonality of the input signal, that is, the larger the unevenness of the spectrum of the input signal, the more the bit amount becomes 128K.
The ratio to bps increases. In addition, in order to detect the unevenness of the spectrum of the input signal, the sum of the absolute values of the differences of the block floating coefficients of the adjacent blocks is used as an index. Then, regarding the available bit amount obtained,
Bit allocation is performed in proportion to the logarithmic value of the energy of each band.

【００８６】聴覚許容雑音レベルに依存したビット配分
算出回路８０５は、まず上記クリティカルバンド毎に分
割されたスペクトルデータに基づき、いわゆるマスキン
グ効果等を考慮した各クリティカルバンド毎の許容ノイ
ズ量を求め、次に聴覚許容雑音スペクトルを与えるよう
に使用可能総ビットからエネルギ依存ビットを引いたビ
ット分が配分される。このようにして求められたエネル
ギ依存ビットと聴覚許容雑音レベルに依存したビットは
加算されて、図５（図４の場合も同様）の適応ビット配
分符号化回路１６、１７、１８により各クリティカルバ
ンド毎もしくは高域においてはクリティカルバンドを更
に複数帯域に分割した帯域に割り当てられたビット数に
応じて各スペクトルデータ（あるいはＭＤＣＴ係数デー
タ）を再量子化するようにしている。このようにして符
号化されたデータは、図５の出力端子２２、２４、２６
を介して取り出される。The bit allocation calculation circuit 805, which depends on the permissible noise level for hearing, first determines the permissible noise amount for each critical band in consideration of the so-called masking effect, etc., based on the spectral data divided for each critical band. Bits obtained by subtracting energy-dependent bits from the total available bits are distributed so as to give a perceptible noise spectrum to the. The energy-dependent bits thus obtained and the bits depending on the permissible noise level of the hearing are added, and the adaptive bit allocation coding circuits 16, 17, and 18 shown in FIG. In each or in the high frequency band, each spectrum data (or MDCT coefficient data) is requantized according to the number of bits assigned to the band obtained by further dividing the critical band into a plurality of bands. The data encoded in this way is output to the output terminals 22, 24, 26 of FIG.
Taken out through.

【００８７】さらに詳しく上記聴覚許容雑音スペクトル
依存のビット配分回路８０５中の聴覚許容雑音スペクト
ル算出回路について説明すると、ＭＤＣＴ回路１３、１
４、１５で得られたＭＤＣＴ係数が上記許容雑音算出回
路に与えられる。The auditory permissible noise spectrum calculation circuit in the bit allocation circuit 805 depending on the permissible auditory noise spectrum will be described in more detail.
The MDCT coefficients obtained in 4 and 15 are given to the allowable noise calculating circuit.

【００８８】図１５は上記許容雑音算出回路をまとめて
説明した一具体例の概略構成を示すブロック回路図であ
る。この図１５において、入力端子５２１には、ＭＤＣ
Ｔ回路１３、１４、１５からの周波数領域のスペクトル
データが供給されている。FIG. 15 is a block circuit diagram showing a schematic configuration of a specific example in which the allowable noise calculating circuits are collectively described. In FIG. 15, the input terminal 521 has an MDC
The spectrum data in the frequency domain is supplied from the T circuits 13, 14, and 15.

【００８９】この周波数領域の入力データは、帯域毎の
エネルギ算出回路５２２に送られて、上記クリティカル
バンド（臨界帯域）毎のエネルギが、例えば当該バンド
内での各振幅値２乗の総和を計算すること等により求め
られる。この各バンド毎のエネルギの代わりに、振幅値
のピーク値、平均値等が用いられることもある。このエ
ネルギ算出回路５２２からの出力として、例えば各バン
ドの総和値のスペクトルは、一般にバークスペクトルと
称されている。図１６はこのような各クリティカルバン
ド毎のバークスペクトルＳＢを示している。ただし、こ
の図１６では、図示を簡略化するため、上記クリティカ
ルバンドのバンド数を１２バンド（Ｂ1〜Ｂ12）で表現
している。The input data in the frequency domain is sent to the energy calculation circuit 522 for each band, and the energy of each critical band (critical band) is calculated, for example, as the sum of squared amplitude values in the band. It is required by doing. Instead of the energy for each band, a peak value, an average value, etc. of the amplitude value may be used. As an output from the energy calculation circuit 522, for example, the spectrum of the total sum value of each band is generally called a Bark spectrum. FIG. 16 shows the Bark spectrum SB for each such critical band. However, in FIG. 16, in order to simplify the illustration, the number of bands of the critical band is represented by 12 bands (B1 to B12).

【００９０】ここで、上記バークスペクトルＳＢのいわ
ゆるマスキングに於ける影響を考慮するために、該バー
クスペクトルＳＢに所定の重み付け関数を掛けて加算す
るような畳込み（コンボリューション）処理を施す。こ
のため、上記帯域毎のエネルギ算出回路５２２の出力す
なわち該バークスペクトルＳＢの各値は、畳込みフィル
タ回路５２３に送られる。該畳込みフィルタ回路５２３
は、例えば、入力データを順次遅延させる複数の遅延素
子と、これら遅延素子からの出力にフィルタ係数（重み
付け関数）を乗算する複数の乗算器（例えば各バンドに
対応する２５個の乗算器）と、各乗算器出力の総和をと
る総和加算器とから構成されるものである。なお、上記
マスキングとは、人間の聴覚上の特性により、ある信号
によって他の信号がマスクされて聞こえなくなる現象を
いうものであり、このマスキング効果には、時間領域の
オーディオ信号による時間軸マスキング効果と、周波数
領域の信号による同時刻マスキング効果とがある。これ
らのマスキング効果により、マスキングされる部分にノ
イズがあったとしても、このノイズは聞こえないことに
なる。このため、実際のオーディオ信号では、このマス
キングされる範囲内のノイズは許容可能なノイズとされ
る。Here, in order to consider the influence of so-called masking of the Bark spectrum SB, a convolution process is performed such that the Bark spectrum SB is multiplied by a predetermined weighting function and added. Therefore, the output of the energy calculation circuit 522 for each band, that is, each value of the Bark spectrum SB is sent to the convolution filter circuit 523. The convolution filter circuit 523
Is, for example, a plurality of delay elements that sequentially delay input data, and a plurality of multipliers (for example, 25 multipliers corresponding to each band) that multiply outputs from these delay elements by a filter coefficient (weighting function). , A sum total adder that sums the outputs of the respective multipliers. Note that the masking is a phenomenon in which one signal is masked by another signal and becomes inaudible due to human auditory characteristics.The masking effect includes a time-axis masking effect by an audio signal in the time domain. And there is the same time masking effect by the signal in the frequency domain. Due to these masking effects, even if there is noise in the masked portion, this noise cannot be heard. Therefore, in the actual audio signal, the noise within the masked range is regarded as an acceptable noise.

【００９１】ここで、上記畳込みフィルタ回路５２３の
各乗算器の乗算係数（フィルタ係数）の一具体例を示す
と、任意のバンドに対応する乗算器Ｍの係数を１とする
とき、乗算器Ｍ−１で係数０．１５を、乗算器Ｍ−２で
係数０．００１９を、乗算器Ｍ−３で係数０．００００
０８６を、乗算器Ｍ＋１で係数０．４を、乗算器Ｍ＋２
で係数０．０６を、乗算器Ｍ＋３で係数０．００７を各
遅延素子の出力に乗算することにより、上記バークスペ
クトルＳＢの畳込み処理が行われる。ただし、Ｍは１〜
２５の任意の整数である。Here, a specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 523 will be described. When the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier M-1 gives a coefficient of 0.15, multiplier M-2 gives a coefficient of 0.0019, and multiplier M-3 gives a coefficient of 0.0000.
086, multiplier M + 1 gives a coefficient of 0.4, multiplier M + 2
By multiplying the output of each delay element by a coefficient of 0.06 and a coefficient of 0.007 by a multiplier M + 3, the convolution processing of the Bark spectrum SB is performed. However, M is 1 to
It is an arbitrary integer of 25.

【００９２】次に、上記畳込みフィルタ回路５２３の出
力は引算器５２４に送られる。該引算器５２４は、上記
畳込んだ領域での後述する許容可能なノイズレベルに対
応するレベルαを求めるものである。なお、当該許容可
能なノイズレベル（許容ノイズレベル）に対応するレベ
ルαは、後述するように、逆コンボリューション処理を
行うことによって、クリティカルバンドの各バンド毎の
許容ノイズレベルとなるようなレベルである。ここで、
上記引算器５２４には、上記レベルαを求めるるための
許容関数（マスキングレベルを表現する関数）が供給さ
れる。この許容関数を増減させることで上記レベルαの
制御を行っている。当該許容関数は、次に説明するよう
な（ｎ−ａｉ）関数発生回路５２５から供給されている
ものである。Next, the output of the convolution filter circuit 523 is sent to the subtractor 524. The subtractor 524 calculates a level α corresponding to an allowable noise level described later in the convoluted area. The level α corresponding to the permissible noise level (permissible noise level) is a level at which the critical noise band becomes the permissible noise level for each band by performing inverse convolution processing, as described later. is there. here,
The subtractor 524 is supplied with an allowance function (function expressing a masking level) for obtaining the level α. The level α is controlled by increasing or decreasing this allowance function. The permissible function is supplied from the (n-ai) function generating circuit 525 as described below.

【００９３】すなわち、許容ノイズレベルに対応するレ
ベルαは、クリティカルバンドのバンドの低域から順に
与えられる番号をｉとすると、次の式で求めることがで
きる。 α＝Ｓ−（ｎ−ａｉ）この式において、ｎ，ａは定数でａ＞０、Ｓは畳込み処
理されたバークスペクトルの強度であり、式中(n-ai)が
許容関数となる。例としてｎ＝３８，ａ＝−0.5を用い
ることができる。That is, the level α corresponding to the allowable noise level can be obtained by the following equation, where i is the number given in order from the low band of the critical band. α = S- (n-ai) In this equation, n and a are constants, a> 0, and S is the intensity of the convolution-processed Bark spectrum. In the equation, (n-ai) is the tolerance function. As an example, n = 38 and a = -0.5 can be used.

【００９４】このようにして、上記レベルαが求めら
れ、このデータは、割算器５２６に伝送される。当該割
算器５２６では、上記畳込みされた領域での上記レベル
αを逆コンボリューションするためのものである。した
がって、この逆コンボリューション処理を行うことによ
り、上記レベルαからマスキングスレッショールドが得
られるようになる。すなわち、このマスキングスレッシ
ョールドが許容ノイズスペクトルとなる。なお、上記逆
コンボリューション処理は、複雑な演算を必要とする
が、本実施例では簡略化した割算器５２６を用いて逆コ
ンボリューションを行っている。In this way, the level α is obtained, and this data is transmitted to the divider 526. The divider 526 is for deconvolution of the level α in the convolved area. Therefore, by performing the inverse convolution processing, the masking threshold can be obtained from the level α. That is, this masking threshold becomes the allowable noise spectrum. Although the above-mentioned inverse convolution processing requires complicated calculation, in this embodiment, the inverse convolution is performed using the simplified divider 526.

【００９５】次に、上記マスキングスレッショールド
は、合成回路５２７を介して減算器５２８に伝送され
る。ここで、当該減算器５２８には、上記帯域毎のエネ
ルギ検出回路５２２からの出力、すなわち前述したバー
クスペクトルＳＢが、遅延回路５２９を介して供給され
ている。したがって、この減算器５２８で上記マスキン
グスレッショールドとバークスペクトルＳＢとの減算演
算が行われることで、図１７に示すように、上記バーク
スペクトルＳＢは、該マスキングスレッショールドＭＳ
のレベルで示すレベル以下がマスキングされることにな
る。なお、遅延回路５２９は上記合成回路５２７以前の
各回路での遅延量を考慮してエネルギ検出回路５２２か
らのバークスペクトルＳＢを遅延させるために設けられ
ている。Next, the masking threshold is transmitted to the subtractor 528 via the synthesizing circuit 527. Here, the output from the energy detection circuit 522 for each band, that is, the above-described Bark spectrum SB is supplied to the subtractor 528 via the delay circuit 529. Therefore, the subtractor 528 performs a subtraction operation on the masking threshold and the Bark spectrum SB, so that the Bark spectrum SB shows the masking threshold MS as shown in FIG.
The level below the level indicated by will be masked. The delay circuit 529 is provided in order to delay the Bark spectrum SB from the energy detection circuit 522 in consideration of the delay amount in each circuit before the synthesis circuit 527.

【００９６】当該減算器５２８からの出力は、許容雑音
補正回路５３０を介し、出力端子５３１を介して取り出
され、例えば配分ビット数情報が予め記憶されたＲＯＭ
等（図示せず）に送られる。このＲＯＭ等は、上記減算
回路５２８から許容雑音補正回路５３０を介して得られ
た出力（上記各バンドのエネルギと上記ノイズレベル設
定手段の出力との差分のレベル）に応じ、各バンド毎の
配分ビット数情報を出力する。The output from the subtracter 528 is taken out via the output terminal 531 via the allowable noise correction circuit 530, and, for example, the ROM in which the distribution bit number information is stored in advance.
Etc. (not shown). The ROM or the like is distributed for each band according to the output (the level of the difference between the energy of each band and the output of the noise level setting means) obtained from the subtraction circuit 528 through the allowable noise correction circuit 530. Outputs bit number information.

【００９７】このようにしてエネルギ依存ビットと聴覚
許容雑音レベルに依存したビットは加算されてその配分
ビット数情報が図５の端子２８を介して上記適応ビット
配分符号化回路１６、１７、１８に送られることで、こ
こでＭＤＣＴ回路１３、１４、１５からの周波数領域の
各スペクトルデータがそれぞれのバンド毎に割り当てら
れたビット数で量子化されるわけである。In this way, the energy-dependent bits and the bits depending on the permissible hearing noise level are added, and the distributed bit number information is supplied to the adaptive bit allocation coding circuits 16, 17, and 18 via the terminal 28 of FIG. By being transmitted, each spectrum data in the frequency domain from the MDCT circuits 13, 14, 15 is quantized by the number of bits assigned to each band.

【００９８】すなわち要約すれば、適応ビット配分符号
化回路１６、１７、１８では、上記クリティカルバンド
の各バンド帯域（クリティカルバンド）毎もしくは高域
においてはクリティカルバンドを更に複数帯域に分割し
た帯域のエネルギもしくはピーク値と上記ノイズレベル
設定手段の出力との差分のレベルに応じて配分されたビ
ット数で上記各バンド毎のスペクトルデータを量子化す
ることになる。In summary, in the adaptive bit allocation coding circuits 16, 17, and 18, the energy of each critical band of the critical band or the energy of a band obtained by further dividing the critical band into a plurality of bands in the high band. Alternatively, the spectrum data for each band is quantized by the number of bits distributed according to the level of the difference between the peak value and the output of the noise level setting means.

【００９９】ところで、上述した合成回路５２７での合
成の際には、最小可聴カーブ発生回路５３２から供給さ
れる図１７に示すような人間の聴覚特性であるいわゆる
最小可聴カーブＲＣを示すデータと、上記マスキングス
レッショールドＭＳとを合成することができる。この最
小可聴カーブにおいて、雑音絶対レベルがこの最小可聴
カーブ以下ならば該雑音は聞こえないことになる。この
最小可聴カーブは、コーディングが同じであっても例え
ば再生時の再生ボリュームの違いで異なるものとなが、
現実的なディジタルシステムでは、例えば１６ビットダ
イナミックレンジへの音楽のはいり方にはさほど違いが
ないので、例えば４ｋＨｚ付近の最も耳に聞こえやすい
周波数帯域の量子化雑音が聞こえないとすれば、他の周
波数帯域ではこの最小可聴カーブのレベル以下の量子化
雑音は聞こえないと考えられる。したがって、このよう
に例えばシステムの持つダイナミックレンジの４ｋＨｚ
付近の雑音が聞こえない使い方をすると仮定し、この最
小可聴カーブＲＣとマスキングスレッショールドＭＳと
を共に合成することで許容ノイズレベルを得るようにす
ると、この場合の許容ノイズレベルは、図１７中の斜線
で示す部分までとすることができるようになる。なお、
本実施例では、上記最小可聴カーブの４ｋＨｚのレベル
を、例えば２０ビット相当の最低レベルに合わせてい
る。また、この図１７は、信号スペクトルＳＳも同時に
示している。By the way, at the time of synthesizing by the above-mentioned synthesizing circuit 527, data indicating a so-called minimum audible curve RC which is a human auditory characteristic as shown in FIG. The masking threshold MS can be combined. In this minimum audible curve, if the absolute noise level is below this minimum audible curve, the noise will not be heard. Even if the coding is the same, this minimum audible curve is different due to the difference in the playback volume during playback,
In a realistic digital system, there is not much difference in how music is put into a 16-bit dynamic range, so if the quantization noise in the most audible frequency band around 4 kHz is inaudible, for example, other It is considered that quantization noise below the level of this minimum audible curve is inaudible in the frequency band. Therefore, for example, the dynamic range of the system is 4 kHz.
Assuming that the noise is not heard in the vicinity, and the allowable noise level is obtained by synthesizing the minimum audible curve RC and the masking threshold MS together, the allowable noise level in this case is shown in FIG. It will be possible to go up to the part shown by the diagonal line of. In addition,
In this embodiment, the level of 4 kHz of the minimum audible curve is set to the minimum level equivalent to 20 bits, for example. Further, FIG. 17 also shows the signal spectrum SS at the same time.

【０１００】また、上記許容雑音補正回路５３０では、
補正情報出力回路５３３から送られてくる例えば等ラウ
ドネスカーブの情報に基づいて、上記減算器５２８から
の出力における許容雑音レベルを補正している。ここ
で、等ラウドネスカーブとは、人間の聴覚特性に関する
特性曲線であり、例えば１ｋＨｚの純音と同じ大きさに
聞こえる各周波数での音の音圧を求めて曲線で結んだも
ので、ラウドネスの等感度曲線とも呼ばれる。またこの
等ラウドネス曲線は、図１７に示した最小可聴カーブＲ
Ｃと略同じ曲線を描くものである。この等ラウドネス曲
線においては、例えば４ｋＨｚ付近では１ｋＨｚのとこ
ろより音圧が８〜１０ｄＢ下がっても１ｋHzと同じ大き
さに聞こえ、逆に、５０Ｈｚ付近では１ｋＨｚでの音圧
よりも約１５ｄＢ高くないと同じ大きさに聞こえない。
このため、上記最小可聴カーブのレベルを越えた雑音
（許容ノイズレベル）は、該等ラウドネス曲線に応じた
カーブで与えられる周波数特性を持つようにするのが良
いことがわかる。このようなことから、上記等ラウドネ
ス曲線を考慮して上記許容ノイズレベルを補正すること
は、人間の聴覚特性に適合していることがわかる。In the allowable noise correction circuit 530,
The allowable noise level in the output from the subtractor 528 is corrected based on the information of the equal loudness curve sent from the correction information output circuit 533, for example. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics, for example, a curve obtained by obtaining the sound pressure of sound at each frequency that sounds the same as a pure tone of 1 kHz, and connecting the curves. Also called sensitivity curve. Further, this equal loudness curve is the minimum audible curve R shown in FIG.
It draws a curve substantially the same as C. In this equal loudness curve, for example, in the vicinity of 4 kHz, even if the sound pressure is reduced by 8 to 10 dB from 1 kHz, it sounds as loud as 1 kHz. It doesn't sound the same.
Therefore, it is understood that the noise exceeding the level of the minimum audible curve (allowable noise level) should have the frequency characteristic given by the curve corresponding to the equal loudness curve. From this, it can be seen that correcting the permissible noise level in consideration of the equal loudness curve is suitable for human hearing characteristics.

【０１０１】以上述べた聴覚許容雑音レベルに依存した
スペクトル形状を使用可能総ビット１２８Ｋｂｐｓの内
のある割合を用いるビット配分でつくる。この割合は入
力信号のトーナリティが高くなるほど減少する。The spectrum shape depending on the permissible hearing noise level described above is created by bit allocation using a certain ratio of the total usable bits of 128 Kbps. This ratio decreases as the tonality of the input signal increases.

【０１０２】次に２つのビット配分手法の間でのビット
量分割手法について説明する。図１４に戻って、ＭＤＣ
Ｔ回路出力が供給される入力端子８０１からの信号は、
スペクトルの滑らかさ算出回路８０８にも与えられ、こ
こでスペクトルの滑らかさが算出される。本実施例で
は、信号スペクトルの絶対値の隣接値間の差の絶対値の
和を、信号スペクトルの絶対値の和で割った値を、上記
スペクトルの滑らかさとして算出している。Next, a bit amount dividing method between the two bit allocation methods will be described. Returning to FIG. 14, the MDC
The signal from the input terminal 801 to which the T circuit output is supplied is
The spectrum smoothness calculation circuit 808 is also applied to calculate the spectrum smoothness. In this embodiment, a value obtained by dividing the sum of the absolute values of the differences between the adjacent values of the absolute value of the signal spectrum by the sum of the absolute values of the signal spectrum is calculated as the smoothness of the spectrum.

【０１０３】上記スペクトルの滑らかさ算出回路８０８
の出力は、ビット分割率決定回路８０９に与えられ、こ
こでエネルギ依存のビット配分と、聴覚許容雑音スペク
トルによるビット配分間のビット分割率とが決定され
る。ビット分割率はスペクトルの滑らかさ算出回路８０
８の出力値が大きいほど、スペクトルの滑らかさが無い
と考えて、エネルギ依存のビット配分よりも、聴覚許容
雑音スペクトルによるビット配分に重点をおいたビット
配分を行う。ビット分割率決定回路８０９は、それぞれ
エネルギ依存のビット配分及び聴覚許容雑音スペクトル
によるビット配分の大きさをコントロールするマルチプ
ライヤ８１１及び８１２に対してコントロール出力を送
る。ここで、仮にスペクトルが滑らかであり、エネルギ
依存のビット配分に重きをおくように、マルチプライヤ
８１１へのビット分割率決定回路８０９の出力が０．８
の値を取ったとき、マルチプライヤ８１２へのビット分
割率決定回路８０９の出力は１−０．８＝０．２とす
る。これら２つのマルチプライヤの出力はアダー８０６
で足し合わされて最終的なビット配分情報となって、出
力端子８０７から出力される。The spectrum smoothness calculation circuit 808 described above.
Is supplied to the bit division rate determination circuit 809, and the bit division rate between the energy-dependent bit allocation and the bit allocation according to the perceptual noise spectrum is determined. The bit division rate is the smoothness calculation circuit 80 of the spectrum.
It is considered that the larger the output value of 8, the smoother the spectrum is, and the bit allocation is performed with more emphasis on the bit allocation by the perceptual noise spectrum than the energy-dependent bit allocation. The bit division ratio determination circuit 809 sends control outputs to multipliers 811 and 812 which control the amount of bit distribution depending on the energy and the permissible noise spectrum of the auditory sense, respectively. Here, if the spectrum is smooth and the output of the bit division rate determination circuit 809 to the multiplier 811 is 0.8 so as to emphasize the energy-dependent bit allocation.
, The output of the bit division rate determination circuit 809 to the multiplier 812 is 1-0.8 = 0.2. The output of these two multipliers is the adder 806.
Are added together to form final bit allocation information, which is output from the output terminal 807.

【０１０４】このときのビット配分の様子を図１８、図
１９に示す。また、これに対応する量子化雑音の様子を
図２０、図２１に示す。図１８は信号のスペクトルが割
合平坦である場合を示しており、図１９は信号スペクト
ルが高いトーナリティを示す場合を示している。また、
図１８及び図１９の図中ＱＳは信号レベル依存分のビッ
ト量を示し、図中ＱＮは聴覚許容雑音レベル依存のビッ
ト割当分のビット量を示している。図２０及び図２１の
図中Ｌは信号レベルを示し、図中ＮＳは信号レベル依存
分による雑音低下分を、図中ＮＮは聴覚許容雑音レベル
依存のビット割当分による雑音低下分を示している。The state of bit allocation at this time is shown in FIGS. 20 and 21 show the states of quantization noise corresponding to this. FIG. 18 shows a case where the spectrum of the signal is relatively flat, and FIG. 19 shows a case where the signal spectrum shows a high tonality. Also,
In FIG. 18 and FIG. 19, QS indicates the bit amount corresponding to the signal level, and QN in the diagrams indicates the bit amount corresponding to the bit allocation depending on the permissible noise level. 20 and 21, L represents the signal level, NS represents the noise reduction due to the signal level dependency, and NN represents the noise reduction due to the bit allocation that depends on the permissible hearing noise level. .

【０１０５】先ず、信号のスペクトルが、割合平坦であ
る場合を示す図１８において、聴覚許容雑音レベルに依
存したビット配分は、全帯域に渡り大きい信号雑音比を
取るために役立つ。しかし低域及び高域では比較的少な
いビット配分が使用されている。これは聴覚的にこの帯
域の雑音に対する感度が小さいためである。信号エネル
ギレベルに依存したビット配分の分は量としては少ない
が、ホワイトな雑音スペクトルを生じるように、この場
合には中低域の信号レベルの高い周波数領域に重点的に
配分されている。First, in FIG. 18 showing the case where the spectrum of the signal is flat, the bit allocation depending on the permissible hearing noise level is useful for obtaining a large signal to noise ratio over the entire band. However, relatively low bit allocations are used in the low and high frequencies. This is because auditory sensitivity to noise in this band is low. The bit allocation depending on the signal energy level is small in amount, but in this case, it is concentrated in the high frequency region of the signal level in the middle and low frequencies so as to generate a white noise spectrum.

【０１０６】これに対して、図１９に示すように、信号
スペクトルが高いトーナリティを示す場合には、信号エ
ネルギレベルに依存したビット配分量が多くなり、量子
化雑音の低下は極めて狭い帯域の雑音を低減するために
使用される。聴覚許容雑音レベルに依存したビット配分
分の集中はこれよりもきつくない。On the other hand, as shown in FIG. 19, when the signal spectrum shows a high tonality, the bit allocation amount depending on the signal energy level becomes large, and the quantization noise is reduced in a very narrow band. Used to reduce The concentration of the bit allocation depending on the permissible noise level of the auditory sense is less intense.

【０１０７】図１４に示すように、この両者のビット配
分の和により、孤立スペクトル入力信号での特性の向上
が達成される。As shown in FIG. 14, the improvement of the characteristics in the isolated spectrum input signal is achieved by the sum of the bit allocations of the both.

【０１０８】以上の様にして得られたチャネルビット配
分を含むビット配分とチャネルビット配分を含まないビ
ット配分の２つを用いて、次のようにして第１と第２の
量子化を行う。Using the two bit allocations including the channel bit allocations and the bit allocations not including the channel bit allocations obtained as described above, the first and second quantization are performed as follows.

【０１０９】図２２を用いて説明する。この例では、全
６チャネルのうちでチャネルビット配分を含むビット配
分により１４７ｋｂｐｓを越えるビット配分がなされる
チャネルはセンタチャネルとサブウーファチャネルとラ
イトチャネルである。Description will be made with reference to FIG. In this example, among all 6 channels, the channels to which the bit allocation including the channel bit allocation exceeds 147 kbps are the center channel, the subwoofer channel, and the write channel.

【０１１０】まず、チャネルビット配分を含むビット配
分量が１４７ｋｂｐｓを越えるチャネルについて、ある
一定のビット量例えば１２８ｋｂｐｓを最大とする部分
と１２８ｋｂｐｓを越える部分に２分する。First, for a channel in which the bit allocation amount including the channel bit allocation exceeds 147 kbps, it is divided into a certain fixed bit amount, for example, a portion where the maximum is 128 kbps and a portion where it exceeds 128 kbps.

【０１１１】この処理を行う構成を図２３に示す。図２
３の構成では、チャネルビット配分を含むビット配分で
の配分量が１４７ｋｂｐｓを越えるビット配分の各サン
プルについて、複数サンプルごとのブロックについての
正規化処理すなわちブロックフローティングを行う。こ
の時どの程度のブロックフローティングが行われたかを
示す係数としてスケールファクタが得られる。FIG. 23 shows a configuration for performing this processing. Figure 2
In the configuration of No. 3, the normalization process, that is, the block floating, is performed on the block for each of a plurality of samples for each sample of the bit allocation in which the allocation amount in the bit allocation including the channel bit allocation exceeds 147 kbps. At this time, a scale factor is obtained as a coefficient indicating how much block floating has been performed.

【０１１２】この図２３において、入力端子９００に供
給されたＭＤＣＴ係数（ＭＤＣＴサンプル）は正規化回
路９０５によって複数サンプル毎に、ブロックについて
の正規化処理すなわちブロックフローティングが施され
る。この時どの程度のブロックフローティングが行われ
たかを示す係数としてスケールファクタが得られる。In FIG. 23, the MDCT coefficient (MDCT sample) supplied to the input terminal 900 is subjected to a normalization process for each block, that is, a block floating process by the normalization circuit 905 every plural samples. At this time, a scale factor is obtained as a coefficient indicating how much block floating has been performed.

【０１１３】次段の第１の量子化器(quantizer) ９０１
は、前記チャネルビット配分を含まないビット配分の各
サンプル語長で量子化を行なう。この時、量子化雑音を
少なくするためには四捨五入による量子化が行われる。Next-stage first quantizer 901
Performs quantization with each sample word length of bit allocation not including the channel bit allocation. At this time, quantization by rounding is performed in order to reduce the quantization noise.

【０１１４】次に、上記正規化回路９０５の出力と上記
量子化器９０１の出力が差分器９０２に送られる。すな
わち、当該差分器９０２では、量子化器９０１の入力と
出力の差（量子化誤差）が取られる。この差分器９０２
からの出力は、さらに正規化回路９０６を介して第２の
量子化器９０３に送られる。Next, the output of the normalization circuit 905 and the output of the quantizer 901 are sent to the difference unit 902. That is, the difference unit 902 takes the difference (quantization error) between the input and output of the quantizer 901. This differencer 902
Is further sent to the second quantizer 903 via the normalization circuit 906.

【０１１５】当該第２の量子化器９０３では、前記チャ
ネルビット配分を含むビット配分の各サンプル語長と前
記チャネルビット配分を含まないビット配分の各サンプ
ル語長の差の語長が各サンプル毎に使用される。この時
のフローティング係数は第１の量子化器９０１で用いら
れたフローティング係数と語長から自動的に決定され
る。すなわち第１の量子化器９０１で用いられた語長が
Ｎビットであったときには、（２＊＊Ｎ）で第２の量子
化器９０３で用いられるフローティング係数が得られ
る。In the second quantizer 903, the word length of the difference between each sample word length of the bit allocation including the channel bit allocation and each sample word length of the bit allocation not including the channel bit allocation is determined for each sample. Used for. The floating coefficient at this time is automatically determined from the floating coefficient used in the first quantizer 901 and the word length. That is, when the word length used in the first quantizer 901 is N bits, the floating coefficient used in the second quantizer 903 is obtained at (2 ** N).

【０１１６】また、上記第２の量子化器９０３では、第
１の量子化器９０１と同じように四捨五入処理を含むビ
ット配分を行う。このようにして２つの量子化により、
前記チャネルビット半分を含むビット配分で１４７ｋｂ
ｐｓを越えるビット配分を受けたチャネルのビットは、
１２８ｋｂｐｓ以下になるべく１２８ｋｂｐｓに近いビ
ット配分と残りのビット配分とに分けられる。In the second quantizer 903, bit allocation including rounding processing is performed in the same manner as the first quantizer 901. In this way, by two quantization,
147 kb in bit allocation including half of the channel bits
The bit of the channel which received the bit allocation exceeding ps is
It is divided into a bit allocation as close to 128 kbps as possible and a remaining bit allocation as low as 128 kbps or less.

【０１１７】ここで、１２８ｋｂｐｓと１４７ｋｂｐｓ
という２つのスレッショールドを設けているのは、以下
のような理由による。すなわち、前記残りのビット配分
データも語長を表すサブ情報が必要であるので、このサ
ブ情報量も含めてデータ領域がとれるようなビット配分
がされる最下限量として１４７ｋｂｐｓが設定されてい
る。また、前記チャネルビット配分を含むビット配分量
が１２８ｋｂｐｓを上回り１４７ｋｂｐｓを下回る場合
には、１２８ｋｂｐｓを越えたデータ部分にはサブ情報
しか書き込めないのでサンプル情報を書き込む余地がな
く意味がなくなってしまう。このため、このような場合
にはこのチャネルは前記チャネルビット配分を含まない
ビット配分で１２８ｋｂｐｓよりも小さく、できるだけ
１２８ｋｂｐｓ近いビット配分を行うために、上記１２
８ｋｂｐｓが設定されている。Here, 128 kbps and 147 kbps
The reason why two thresholds are provided is as follows. That is, since the remaining bit allocation data also needs sub information indicating the word length, 147 kbps is set as the lower limit amount for bit allocation so that the data area can be obtained including the sub information amount. Also, when the bit allocation amount including the channel bit allocation exceeds 128 kbps and falls below 147 kbps, only sub information can be written in the data portion exceeding 128 kbps, so that there is no room to write sample information, which is meaningless. Therefore, in such a case, in order to perform bit allocation which is smaller than 128 kbps and which is as close as possible to 128 kbps, the channel allocation does not include the channel bit allocation.
8 kbps is set.

【０１１８】また、前記チャネルビット配分を含むビッ
ト配分で１２８ｋｂｐｓよりも小さいビット配分となっ
たチャネルは、そのままそのビット配分を使用する。Further, a channel whose bit allocation including the channel bit allocation is smaller than 128 kbps uses the bit allocation as it is.

【０１１９】前に述べたように、前記残りのビット配分
の成分の大きさは図２３で示されるようにビット配分
(1) のスケールファクタとワードレングスからスケール
ファクタを算出できるのでワードレングスのみがデコ−
ダに必要とされる。As described above, the magnitudes of the components of the remaining bit allocation are as shown in FIG.
Since the scale factor can be calculated from the scale factor and the word length in (1), only the word length is
Needed by da.

【０１２０】このようにして量子化器９０１及び９０３
では、それぞれ四捨五入された効率の高い量子化出力が
得られる。In this way, the quantizers 901 and 903 are
In the above, the rounded and highly efficient quantized output is obtained.

【０１２１】なお、図２３の構成（エンコーダ）に対応
する構成（デコーダ）では、上記正規化回路９０５，９
０６に対応する逆正規化処理を行う逆正規化回路９０
８，９０７が設けられ、これら逆正規化回路９０８，９
０７の出力が加算器９０４で加算される。その加算出力
が出力端子９１０から取りだされることになる。In the structure (decoder) corresponding to the structure (encoder) of FIG. 23, the above normalizing circuits 905, 9 are used.
Denormalization circuit 90 for performing denormalization processing corresponding to 06
8 and 907 are provided, and these denormalization circuits 908 and 9 are provided.
The output of 07 is added by the adder 904. The added output is taken out from the output terminal 910.

【０１２２】次に、図１に示したサラウンドレフトチャ
ネルとサラウンドライトチャネルとの間でビット配分を
行う高率圧縮符号化回路２２３，２２４の具体的構成を
図２４に示す。Next, FIG. 24 shows a specific configuration of the high-rate compression coding circuits 223 and 224 for allocating bits between the surround left channel and the surround right channel shown in FIG.

【０１２３】この図２４において、入力端子３０１には
サラウンドレフトチャネルのディジタルオーディオ信号
が、入力端子３１１にはサラウンドライトチャネルのデ
ィジタルオーディオ信号が供給される。In FIG. 24, an input terminal 301 is supplied with a surround left channel digital audio signal, and an input terminal 311 is supplied with a surround right channel digital audio signal.

【０１２４】上記入力端子３０１と３１１からのディジ
タルオーディオ信号は、それぞれ対応するバッファ３０
２，３１２に一旦記憶される。このバッファ３０２，３
１２からは、各々５０％オーバーラップしたＮポイント
（Ｎサンプル）毎のブロックでデータが取り出される。
このブロック単位のデータは、直交変換回路３０３，３
１３に送られ、当該直交変換回路３０３，３１３によっ
てそれぞれＭＤＣＴ及びＭＤＳＴ(Modified Discrete S
ine transform)の直交変換が施される。Digital audio signals from the input terminals 301 and 311 are transferred to the corresponding buffers 30.
2, 312 once. This buffer 302,3
From No. 12, data is taken out in blocks of N points (N samples) each of which overlaps by 50%.
This block-unit data is used for the orthogonal transformation circuits 303 and 3
13 and MDCT and MDST (Modified Discrete S
ine transform).

【０１２５】上記直交変換回路３０３からの係数データ
は、それぞれ対応するサブバンド・ブロックフローティ
ングポイント圧縮回路３０４，３１４によって圧縮され
る。上記サブバンド・ブロックフローティングポイント
圧縮回路３０４，３１４からの係数データと語長情報や
スケールファクタ等のサブ情報は、対応する適応量子化
回路３０５，３１５に送られる。また、サブバンド・ブ
ロックフローティングポイント圧縮回路３０４，３１４
からは、スペクトル情報がｌｏｇスペクトラルエンベロ
ープ検出回路３０８に送られる。The coefficient data from the orthogonal transform circuit 303 is compressed by the corresponding subband block floating point compression circuits 304 and 314, respectively. The coefficient data from the sub-band / block floating point compression circuits 304 and 314 and sub-information such as word length information and scale factor are sent to the corresponding adaptive quantization circuits 305 and 315. Also, the sub-band block floating point compression circuits 304, 314
From, the spectral information is sent to the log spectral envelope detection circuit 308.

【０１２６】上記適応量子化回路３０５は、ｌｏｇスペ
クトラルエンベロープ検出回路３０８によって検出され
たエンベロープ情報に基づいてチャネル間ビット配分量
を決定する分配決定回路３０９からのビット配分情報に
基づいて、上記係数データとサブ情報を適応的に量子化
する。この適応量子化回路３０５，３１５からは、量子
化された係数データとサブ情報及びビット配分情報とが
出力される。この適応量子化回路３０５，３１５の各出
力は、上記マルチプレクス・エラーコレクション回路３
０６，３１６に送られる。The adaptive quantization circuit 305 determines the coefficient data based on the bit allocation information from the distribution determination circuit 309 which determines the inter-channel bit allocation amount based on the envelope information detected by the log spectral envelope detection circuit 308. And sub information is adaptively quantized. The adaptive quantization circuits 305 and 315 output quantized coefficient data, sub information, and bit allocation information. The outputs of the adaptive quantization circuits 305 and 315 are the same as the multiplex error correction circuit 3 described above.
It is sent to 06, 316.

【０１２７】これらマルチプレクス・エラーコレクショ
ン回路３０６，３１６では、各チャネル毎に量子化され
た係数データとサブ情報及びビット配分情報をマルチプ
レクスすると共に、エラー訂正符号を付加する。これら
マルチプレクス・エラーコレクション回路３０６，３１
６からの出力端子３０７，３１７を介した出力が、前記
図１のサラウンドレフトチャネル用とサラウンドライト
チャネル用の高率圧縮符号化回路２２３，２２４の出力
となる。These multiplex error correction circuits 306 and 316 multiplex the quantized coefficient data, sub information and bit allocation information for each channel, and add an error correction code. These multiplex error correction circuits 306 and 31
The outputs from the output terminals 6 and 6 via the output terminals 307 and 317 are the outputs of the high-rate compression encoding circuits 223 and 224 for the surround left channel and the surround right channel of FIG.

【０１２８】次に、図２５には、前記図１の各高聴感圧
縮符号化回路２１７〜２２２に対応する高聴感伸張復号
化回路の構成を示す。すなわち、この図２５の高聴感伸
張復号化回路は、本発明の伝送装置及び方法に対応する
受信装置及び方法に適用されるものであり、前記ｎ個の
複数チャネルのディジタルオーディオ信号のうち、上記
ｍ個のチャネルのディジタルオーディオ信号に対して上
記第１の符号化方式で圧縮符号化がなされた信号を復号
化する回路（１チャネル分）の構成である。Next, FIG. 25 shows the configuration of a high-audibility decompression / decoding circuit corresponding to each of the high-audibility compression / encoding circuits 217 to 222 shown in FIG. That is, the high-audibility expansion decoding circuit of FIG. 25 is applied to the receiving apparatus and method corresponding to the transmitting apparatus and method of the present invention, and among the n plural-channel digital audio signals, This is a configuration of a circuit (for one channel) that decodes a signal that is compression-encoded by the first encoding method with respect to digital audio signals of m channels.

【０１２９】この図２５において、各帯域の量子化され
たＭＤＣＴ係数は復号化装置入力端子１２２、１２４、
１２６に与えられ、また使用されたブロックサイズ情報
及び適応ビット配分情報は入力端子１２３、１２５、１
２７に与えられる。復号化回路１１６、１１７、１１８
では、適応ビット配分情報を用いてビット割当を解除
し、ブロックサイズ情報を用いて伸張復号化を行う。In FIG. 25, the quantized MDCT coefficients of each band are the decoding device input terminals 122 and 124,
The block size information and adaptive bit allocation information given to and used by 126 are input terminals 123, 125, 1
Given to 27. Decoding circuits 116, 117, 118
Then, the adaptive bit allocation information is used to cancel the bit allocation, and the block size information is used to perform decompression decoding.

【０１３０】次に、ＩＭＤＣＴ回路１１３、１１４、１
１５では、周波数領域の信号が時間領域の信号に変換さ
れる。これらの部分帯域の時間領域信号は、ＩＱＭＦ回
路１１２、１１１により、全体域信号に復号化される。Next, the IMDCT circuits 113, 114, 1
At 15, the frequency domain signal is transformed into a time domain signal. IQMF circuits 112 and 111 decode the time domain signals of these subbands into whole-domain signals.

【０１３１】ここで、高聴感伸張復号化回路では、前記
チャネルビット配分を含む１２８ｋｂｐｓ以下のビット
配分(1) が行われるチャネルと、前記チャネルビット配
分を含む１４７ｋｂｐｓ以上のビット配分(2) が行われ
るチャネルにおけるある一定のビット量例えば１２８ｋ
ｂｐｓを最大とする部分と１２８ｋｂｐｓを越える部分
のそれぞれが、上記復号化回路１１６，１１７，１１８
で復号化される。但し、ビット配分(2) の２つ部分はそ
れそれが復号化された後、それぞれのサンプルが加算さ
れて精度の高いサンプルとなる。Here, in the high-audibility decompression decoding circuit, a channel for which bit allocation (1) of 128 kbps or less including the channel bit allocation is performed and a bit allocation (2) of 147 kbps or more including the channel bit allocation are performed. A certain amount of bits in the channel to be addressed, eg 128k
The decoding circuit 116, 117, 118 has a maximum bps portion and a portion exceeding 128 kbps, respectively.
Is decrypted with. However, the two parts of the bit allocation (2) become high-precision samples by adding the respective samples after they are decoded.

【０１３２】また、得られた各チャネルのデータの並べ
方については、シンクブロック中に、先ず、（１）前記
チャネルビット配分を含む１２８ｋｂｐｓ以下のビット
配分が行われるチャネル、（２）前記チャネルビット配
分を含む１４７ｋｂｐｓ以上のビット配分が行われるチ
ャネルにおけるある一定のビット量例えば１２８ｋｂｐ
ｓを最大とする部分を、チャネル順に並べ、次に前記チ
ャネルビット配分を含む１４７ｋｂｐｓ以上のビット配
分が行われるチャネルにおける１２８ｋｂｐｓを越える
部分をチャネル順に並べる。Regarding the way of arranging the obtained data of each channel, in the sync block, first, (1) the channel in which the bit allocation of 128 kbps or less including the channel bit allocation is performed, and (2) the channel bit allocation Including a certain bit amount in a channel in which a bit allocation of 147 kbps or more is performed, for example, 128 kbp
The part having the maximum s is arranged in the order of channels, and then the part exceeding 128 kbps in the channel in which the bit allocation of 147 kbps or more including the channel bit allocation is performed is arranged in the channel order.

【０１３３】次に、図１の高率圧縮符号化回路２２３，
２２４に対応する高率伸張復号化回路の構成を図２６
（１チャネル分）に示す。すなわち、この図２６の高率
伸張復号化回路は、本発明の伝送装置に対応する受信装
置に適用されるものであり、前記ｎ−ｍ個のチャネルの
ディジタルオーディオ信号に対して上記第２の符号化方
式で圧縮符号化がされた信号を復号化する回路（１チャ
ネル分）の構成である。Next, the high-rate compression coding circuit 223 of FIG.
FIG. 26 shows the configuration of a high rate expansion decoding circuit corresponding to H.224.
(1 channel). That is, the high-rate decompression decoding circuit of FIG. 26 is applied to the receiving device corresponding to the transmitting device of the present invention, and the second above-mentioned second is applied to the digital audio signals of the nm channels. This is a configuration of a circuit (for one channel) that decodes a signal that has been compression-encoded by an encoding method.

【０１３４】この図２６において、入力端子４１０に
は、前記高率圧縮符号化が施されたディジタルオーディ
オ信号が供給される。この信号は、デマルチプレクス・
エラーコレクション回路４１１によってデマルチプレク
スとエラー訂正が行われる。In FIG. 26, the input terminal 410 is supplied with the high-rate compression-encoded digital audio signal. This signal is demultiplexed
The error correction circuit 411 performs demultiplexing and error correction.

【０１３５】当該デマルチプレクス・エラーコレクショ
ン回路４１１からは、適応量子化された係数データとサ
ブ情報及びビット配分情報とが出力される。係数データ
及びサブ情報は、適応逆量子化回路４１２に送られる。
また、ビット配分情報は量子化ステップサイズコントロ
ール回路４１３に送られる。上記適応逆量子化回路４１
２は、上記量子化ステップサイズコントロール回路４１
３からの量子化ステップサイズ情報に基づいて、上記量
子化変換係数情報に対して逆量子化を施す。この適応逆
量子化回路４１２からの量子化圧縮変換係数はサブバン
ド・ブロックフローティングポイント伸張回路４１４に
送られる。The demultiplex error correction circuit 411 outputs the adaptively quantized coefficient data, sub information, and bit allocation information. The coefficient data and the sub information are sent to the adaptive inverse quantization circuit 412.
Also, the bit allocation information is sent to the quantization step size control circuit 413. The adaptive inverse quantization circuit 41
2 is the quantization step size control circuit 41
Inverse quantization is performed on the quantized transform coefficient information based on the quantization step size information from 3. The quantized compression transform coefficient from the adaptive dequantization circuit 412 is sent to the subband block floating point expansion circuit 414.

【０１３６】上記サブバンド・ブロックフローティング
ポイント伸張回路４１４では、前記図２４のサブバンド
・ブロックフローティングポイント圧縮回路３０４，３
１４の逆処理を行う。この伸張回路４１４の出力は、同
じく図２４の直交変換回路３０３，３１３の逆変換処理
を行う逆直交変換回路４１５によってＮポイントのサン
プルデータに変換され、ウインドウ・オーバーラップ加
算回路４１６に送られる。当該ウインドウ・オーバーラ
ップ加算回路４１６では、前記オーバーラップが解除さ
れて、ＰＣＭオーディオ信号として出力される。このＰ
ＣＭオーディオ信号が出力端子４１６から取り出され
る。In the sub-band / block floating-point expansion circuit 414, the sub-band / block floating-point compression circuits 304 and 3 shown in FIG.
The reverse process of 14 is performed. The output of the decompression circuit 414 is converted into N-point sample data by the inverse orthogonal transform circuit 415 which similarly performs the inverse transform processing of the orthogonal transform circuits 303 and 313 of FIG. 24, and is sent to the window overlap addition circuit 416. The window overlap adding circuit 416 removes the overlap and outputs the PCM audio signal. This P
The CM audio signal is taken out from the output terminal 416.

【０１３７】[0137]

【発明の効果】以上の説明からも明らかなように、本発
明のディジタルオーディオ信号の伝送装置及び方法にお
いては、複数チャネルの信号のうち、聴感上影響力の高
い信号からなるチャネルの信号に対しては聴感上の劣化
が少ない第１の圧縮率の第１の符号化方式によって圧縮
符号化を行い、聴感上影響力の低い信号からなるチャネ
ルの信号に対しては第１の圧縮率よりも高い第２の圧縮
率の第２の符号化方式で圧縮符号化を行うようにしてい
るため、特に重要な音に対しては高音質の圧縮符号化が
可能であると共に、ビット配分量（バイト配分量）の無
駄を無くすことが可能となる。As is clear from the above description, in the digital audio signal transmitting apparatus and method of the present invention, among the signals of a plurality of channels, a signal of a channel consisting of a signal having a high auditory influence is applied. For example, the compression coding is performed by the first coding method with the first compression rate that causes less deterioration in the auditory sense, and the compression rate is lower than that of the first compression rate for the signal of the channel including the signal that has a low auditory influence. Since the compression encoding is performed by the second encoding method having the high second compression rate, it is possible to perform the high-quality sound compression encoding for the particularly important sound, and the bit allocation amount (byte It is possible to eliminate the waste of the allocation amount.

【０１３８】また、本発明のディジタルオーディオ信号
の受信装置及び方法においては、複数チャネルの信号の
うち、聴感上影響力の高い信号からなるチャネルの信号
に対して聴感上の劣化が少ない第１の圧縮率の第１の符
号化方式によって圧縮符号化がされた信号と、聴感上影
響力の低い信号からなるチャネルの信号に対して第１の
圧縮率よりも高い第２の圧縮率の第２の符号化方式で圧
縮符号化がなされた信号を、それぞれ復号化可能とな
る。Further, in the digital audio signal receiving apparatus and method of the present invention, of the signals of a plurality of channels, there is less deterioration in the auditory sense with respect to the signal of the channel consisting of the signal having a high auditory influence. A second compression rate higher than the first compression rate with respect to a channel signal composed of a signal compression-encoded by the first compression rate encoding method and a signal having a low auditory influence. It becomes possible to respectively decode the signals that have been compression-encoded by the encoding method of.

【図面の簡単な説明】[Brief description of drawings]

【符号の説明】２０９〜２１６・・・Ａ／Ｄ変換器２１７〜２２２・・・高聴感圧縮符号化回路２２３，２２４・・・高率圧縮符号化回路２２５，２２６・・・ハイパスフィルタ２２８，２２９・・・ローパスフィルタ２２７，２３０・・・加算器２４１・・・マルチプレクサ３０２，３１３・・・バッファ３０３，３１３・・・直交変換回路３０４，３１４・・・サブバンドブロックフローティン
グポイント圧縮回路３０５，３１５・・・適応量子化回路３０６，３１６・・・マルチプレクス・エラーコレクシ
ョン回路３０８・・・ｌｏｇスペクトラルエンベロープ検出回路３０９・・・分配決定回路１１，１２・・・帯域分割フィルタ１３，１４，１５・・・ＭＤＣＴ回路１６，１７，１８・・・適応ビット配分符号化回路１９，２０，２１・・・ブロックサイズ決定回路３１・・・各チャネル情報信号入力端子３２・・・マッピング回路３３・・・ブロッキング回路３４・・・時間変化算出回路３５・・・スケ−ルファクタ算出回路３６・・・トーナリティ算出回路３７・・・正規化回路３８・・・ビット配分回路１１６，１１７，１１８・・・適応ビット配分復号化回
路１１３，１１４，１１５・・・ＩＭＤＣＴ回路１１２，１１１・・・ＩＱＭＦ回路４１１・・・デマルチプレクス・エラーコレクション回
路４１２・・・適応逆量子化回路４１３・・・量子化ステップサイズコントロール回路４１４・・・サブバンドブロックフローティングポイン
ト伸張回路４１５・・・逆直交変換回路４１６・・・ウインドウ・オーバーラップ加算回路５２０・・・許容雑音算出回路５２１・・・許容雑音算出回路入力端子５２２・・・帯域毎のエネルギ検出回路５２３・・・畳込みフィルタ回路５２４・・・引算器５２５・・・ｎ−ａｉ関数発生回路５２６・・・割算器５２７・・・合成回路５２８・・・減算器５３０・・・許容雑音補正回路５３２・・・最小可聴カーブ発生回路５３３・・・補正情報出力回路８０２・・・使用可能総ビット発生回路８０３・・・帯域毎のエネルギ算出回路８０４・・・エネルギ依存のビット配分回路８０５・・・聴覚許容雑音レベル依存のビット配分回路８０６・・・アダー８０８・・・スペクトルの滑らかさ算出回路８０９・・・ビット分割率決定回路８１１、８１２・・・マルチプライヤ９０５，９０６・・・正規化回路９０１・・・第１の量子化器９０３・・・第２の量子化器９０７，９０９・・・逆正規化回路９０４・・・加算器[Description of Codes] 209 to 216 ... A / D converters 217 to 222 ... High-audibility compression coding circuit 223, 224 ... High-rate compression coding circuit 225, 226 ... High-pass filter 228, 229 ... Low-pass filter 227, 230 ... Adder 241 ... Multiplexer 302, 313 ... Buffer 303, 313 ... Orthogonal transformation circuit 304, 314 ... Subband block floating point compression circuit 305, 315 ... Adaptive quantization circuit 306, 316 ... Multiplex error correction circuit 308 ... Log spectral envelope detection circuit 309 ... Distribution decision circuit 11, 12 ... Band division filter 13, 14, 15 ... MDCT circuit 16, 17, 18 ... Adaptive bit allocation encoding circuit 9, 20, 21 ... Block size determination circuit 31 ... Each channel information signal input terminal 32 ... Mapping circuit 33 ... Blocking circuit 34 ... Time change calculation circuit 35 ... Scale factor calculation Circuit 36 ... Tonality calculation circuit 37 ... Normalization circuit 38 ... Bit allocation circuit 116, 117, 118 ... Adaptive bit allocation decoding circuit 113, 114, 115 ... IMDCT circuit 112, 111. ..IQMF circuit 411 ... Demultiplex error correction circuit 412 ... Adaptive inverse quantization circuit 413 ... Quantization step size control circuit 414 ... Subband block floating point expansion circuit 415 ... Inverse Orthogonal transformation circuit 416 ... Window overlap addition circuit 52 ... Allowable noise calculation circuit 521 ... Allowable noise calculation circuit input terminal 522 ... Energy detection circuit for each band 523 ... Convolution filter circuit 524 ... Subtractor 525 ... n-ai function Generation circuit 526 ... Divider 527 ... Synthesis circuit 528 ... Subtractor 530 ... Allowable noise correction circuit 532 ... Minimum audible curve generation circuit 533 ... Correction information output circuit 802 ... Total usable bit generation circuit 803 ... Energy calculation circuit for each band 804 ... Energy dependent bit allocation circuit 805 ... Auditory permissible noise level dependent bit allocation circuit 806 ... Adder 808 ... Spectrum Smoothness calculation circuit 809 ... Bit division rate determination circuit 811, 812 ... Multiplier 905, 906 ... Normalization circuit 901 ... First Quantizer 903 ... Second quantizer 907, 909 ... Inverse normalization circuit 904 ... Adder

Claims

[Claims]

1. A digital audio signal transmitting apparatus for compressing and encoding digital audio signals of a plurality of channels (n channels, n is a positive integer greater than 3) and transmitting the digital audio signals of the n channels. Of the digital audio signals of m (n> m) channels, which have a higher audible influence than the digital audio signals of the other nm channels, the first is applied.
Second compression coding means for compressing and coding by the first coding method with a compression ratio of 1) and a second compression ratio higher than the first compression ratio for the digital audio signals of the n-m channels. Second compression encoding means for performing compression encoding by the second encoding method having a compression rate of, and the compression encoding output from the first compression encoding means and the second compression encoding. A device for transmitting a digital audio signal, characterized in that the compressed and encoded output from the means is transmitted together.

2. The n channels are a center channel, a left channel, a right channel, a surround left channel and a surround right channel, and the n−m channels are a surround left channel and a surround right channel. The digital audio signal transmission device according to claim 1.

3. The n channels are a center channel, a left channel, a right channel, a left center channel, a right center channel, a surround left channel and a surround right channel, and the n−m channels are a surround left channel. 2. The digital audio signal transmission device according to claim 1, wherein the transmission device is a surround right channel.

4. The first encoding method divides an input digital audio signal into a plurality of bands, blocks the digital audio signal of each band into a plurality of samples, and orthogonally transforms spectrum components in each block. ,
This is a method of adaptively compression-coding according to the auditory characteristics, and the second coding method obtains coefficient information obtained by orthogonally transforming an input digital audio signal for each plurality of samples and sub-information related thereto, and obtains each channel. 2. The digital audio signal transmission apparatus according to claim 1, wherein the method is a system for adaptively compressing and coding by allocating the information amount of each channel according to the energy of.

5. The digital audio signal transmission apparatus according to claim 1, wherein the transmission includes recording on a recording medium.

6. A digital audio signal of a plurality of channels (n channels, n is a positive integer greater than 3),
The first code having the first compression ratio for a digital audio signal of m (n> m) channels, which has a higher auditory influence than other digital audio signals of nm channels. The second encoding method has a second compression rate higher than the first compression rate with respect to the digital audio signals of the n−m channels while being compression-encoded by the encoding method. A receiving device for a digital audio signal, which receives a signal encoded by compression, wherein the first encoding is performed on the digital audio signals of the m channels among the digital audio signals of the n channels. A first decoding means for performing a first decompression decoding corresponding to the system, and a second decoding system for the digital audio signals of the nm channels. That the receiving apparatus in a digital audio signal and having a second decoding means for performing a second decompression decoding.