JPH09102742A

JPH09102742A - Encoding method and device, decoding method and device and recording medium

Info

Publication number: JPH09102742A
Application number: JP25862095A
Authority: JP
Inventors: Masatoshi Ueno; 正俊上野; Shinji Miyamori; 慎二宮森
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1995-10-05
Filing date: 1995-10-05
Publication date: 1997-04-15

Abstract

PROBLEM TO BE SOLVED: To encode the multichannel signals with high efficiency. SOLUTION: A mixer 102 produces the mixture processing data based on the audio data supplied through the input terminals 101a to 101e, and these processing data are supplied to the processing data extractors 103a to 103e and the encoders 105f and 105g respectively. The extractors 103a to 103e supply the data which are not reproduced by the mixture processing data to the encoders 105a to 105e, and the reproduction parameters of other data which are reproduced by the mixture processing data are supplied to a multiplexer 106. The reproduction parameters and the data which are encoded by the encoders 105a to 105g are turned into a bit stream by the multiplexer 106 and then outputted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、符号化方法および
装置、復号化方法および装置、並びに記録媒体に関し、
例えば、ビデオテープレコーダ、ビデオディスクプレー
ヤ等のステレオや、いわゆるマルチサウンド音響システ
ムにおいて、マルチチャネルのディジタル信号を圧縮符
号化する場合に用いて好適な符号化方法および装置、復
号化方法および装置、並びに記録媒体に関する。TECHNICAL FIELD The present invention relates to an encoding method and apparatus, a decoding method and apparatus, and a recording medium,
For example, a stereo such as a video tape recorder, a video disc player, or a so-called multi-sound acoustic system, which is suitable for use in compressing and encoding a multi-channel digital signal, an encoding method and apparatus, a decoding method and apparatus, and Recording medium

【０００２】[0002]

【従来の技術】従来より、オーディオあるいは音声等の
信号の高能率符号化の手法および装置には種々あるが、
例えば、時間領域のオーディオ信号等を単位時間毎にブ
ロック化し、このブロック毎の時間軸の信号を周波数軸
上の信号に変換（直交変換）して複数の周波数帯域に分
割し、各帯域毎に符号化するブロック化周波数帯域分割
方式、いわゆる変換符号化（トランスフォームコーディ
ング）や、時間領域のオーディオ信号等を単位時間毎に
ブロック化しないで、複数の周波数帯域に分割して符号
化する非ブロック化周波数帯域分割方式である帯域分割
符号化（サブバンドコーディング：ＳＢＣ）等を挙げる
ことができる。2. Description of the Related Art Conventionally, there are various techniques and apparatuses for highly efficient encoding of signals such as audio or voice.
For example, an audio signal in the time domain is divided into blocks for each unit time, the time axis signal of each block is converted into a signal on the frequency axis (orthogonal conversion), and divided into a plurality of frequency bands. Blocking frequency band division method for encoding, so-called transform coding (transform coding), or non-blocking for dividing into a plurality of frequency bands and encoding without dividing the time domain audio signal into blocks for each unit time Band division coding (sub-band coding: SBC), which is a generalized frequency band division method, can be used.

【０００３】また、上述の帯域分割符号化と変換符号化
とを組み合わせた高能率符号化の手法および装置も考え
られており、この場合には、例えば、帯域分割符号化で
帯域分割を行った後、分割された各帯域毎の信号を周波
数領域の信号に直交変換し、直交変換された信号が各領
域毎に符号化される。Further, a high-efficiency coding method and apparatus combining the above-described band-division coding and transform coding has been considered. In this case, for example, band-division is performed by band-division coding. After that, the divided signal for each band is orthogonally transformed into a signal in the frequency domain, and the orthogonally transformed signal is encoded for each region.

【０００４】ここで、上述した帯域分割符号化において
用いられる帯域分割用フィルタとしては、例えばＱＭＦ
（直交ミラーフィルタ：Quadrature Mirror Filter）な
どのフィルタがあり、このＱＭＦフィルタは、文献「デ
ィジタル・コーディング・オブ・スピーチ・イン・サブ
バンズ」("Digital coding of speech in subbands"R.
E.Crochiere, BellSyst.Tech. J., Vol.55,No.8 1976)
に述べられている。このＱＭＦフィルタは、帯域を等バ
ンド幅に２分割するものであり、このフィルタにおいて
は、分割した帯域を後に合成する際にいわゆるエリアシ
ングが発生しないことが特徴となっている。Here, as the band division filter used in the above band division encoding, for example, QMF is used.
There is a filter such as (Quadrature Mirror Filter), and this QMF filter is referred to as "Digital coding of speech in subbands" R.
E. Crochiere, BellSyst.Tech. J., Vol.55, No.8 1976)
It is described in. This QMF filter divides the band into two equal bandwidths, and this filter is characterized in that so-called aliasing does not occur when the divided bands are combined later.

【０００５】また、文献「ポリフェイズ・クァドラチュ
ア・フィルターズ−新しい帯域分割符号化技術」("Poly
phase Quadrature filters -A newsubband coding tech
nique", oseph H. Rothweiler ICASSP 83, BOSTON) に
は、等帯域幅のフィルタ分割方法が述べられている。こ
のポリフェイズ・クァドラチュア・フィルタにおいて
は、信号を等バンド幅の複数の帯域に分割する際に一度
に分割できることが特徴となっている。In addition, the document "Polyphase Quadrature Filters-New Band Division Coding Technique"("Poly
phase Quadrature filters -A newsubband coding tech
nique ", oseph H. Rothweiler ICASSP 83, BOSTON) describes a method for dividing equal-bandwidth filters. In this polyphase quadrature filter, the signal is divided into multiple bands of equal bandwidth. The feature is that it can be divided at one time.

【０００６】さらに、上述した直交変換のスペクトル変
換としては、例えば、入力オーディオ信号を所定単位時
間（フレーム）でブロック化し、ブロック毎に離散フー
リエ変換（ＤＦＴ）、離散コサイン変換（ＤＣＴ）、ま
たはモディファイド離散コサイン変換（ＭＤＣＴ）等を
行うことで時間軸を周波数軸に変換するものがある。な
お、上記ＭＤＣＴについては、文献「時間領域エリアシ
ング・キャンセルを基礎とするフィルタ・バンク設計を
用いたサブバンド／変換符号化」("Subband/Transform
Coding Using Filter Bank Designs Based on Time Dom
ain Aliasing Cancellation", J.P.Princen A.B.Bradla
y, Univ.of Surrey Royal Melbourne Inst.of Tech. IC
ASSP 1987) に述べられている。Further, as the spectrum transformation of the above-mentioned orthogonal transformation, for example, an input audio signal is divided into blocks in a predetermined unit time (frame), and discrete Fourier transform (DFT), discrete cosine transform (DCT), or modified for each block. There is one that transforms a time axis into a frequency axis by performing discrete cosine transform (MDCT) or the like. Regarding the MDCT, reference is made to the document "Subband / Transform Coding Using Filter Bank Design Based on Time Domain Aliasing Cancellation"("Subband / Transform
Coding Using Filter Bank Designs Based on Time Dom
ain Aliasing Cancellation ", JPPrincen ABBradla
y, Univ.of Surrey Royal Melbourne Inst.of Tech. IC
ASSP 1987).

【０００７】このようにフィルタやスペクトル変換によ
って帯域毎に分割された信号を量子化することにより、
量子化雑音が発生する帯域を制御することができ、いわ
ゆるマスキング効果などの性質を利用して聴覚的により
高能率な符号化を行うことができる。また、ここで量子
化を行う前に、各帯域毎に例えばその帯域における信号
成分の絶対値の最大値で正規化を行うようにすれば、さ
らに高能率な符号化を行うことができる。In this way, by quantizing the signal divided for each band by the filter and the spectrum conversion,
It is possible to control the band in which the quantization noise is generated, and it is possible to perform auditory and more efficient encoding by utilizing the properties such as the so-called masking effect. Further, if the normalization is performed for each band, for example, with the maximum absolute value of the signal component in that band before the quantization is performed here, more efficient encoding can be performed.

【０００８】ここで、周波数帯域分割された各周波数成
分を量子化する場合の周波数分割幅としては、例えば人
間の聴覚特性を考慮した帯域幅を用いることが多い。す
なわち、一般に高域ほど帯域幅が広くなるような臨界帯
域（クリティカルバンド）と呼ばれている帯域幅で、オ
ーディオ信号を複数（例えば２５バンド）の帯域に分割
することがある。また、この時の各帯域毎のデータを符
号化する際には、各帯域毎に所定のビット配分、あるい
は、各帯域毎に適応的なビット割当て（ビットアロケー
ション）による符号化が行われる。例えば、上記ＭＤＣ
Ｔ処理されて得られた係数データを上記ビット配分によ
って符号化する際には、上記各ブロック毎のＭＤＣＴ処
理により得られる各帯域毎のＭＤＣＴ係数データに対し
て、適応的な配分ビット数（適応的な配分ビット数）で
符号化が行われることになる。Here, as the frequency division width in the case of quantizing each frequency component divided into frequency bands, for example, a bandwidth considering human auditory characteristics is often used. That is, the audio signal may be divided into a plurality of bands (for example, 25 bands) with a bandwidth generally called a critical band in which the higher the band, the wider the bandwidth. Further, at the time of encoding the data for each band at this time, encoding is performed by predetermined bit allocation for each band or adaptive bit allocation (bit allocation) for each band. For example, the MDC
When the coefficient data obtained by the T processing is encoded by the bit allocation, the number of adaptive allocation bits (adaptation to the MDCT coefficient data for each band obtained by the MDCT processing for each block) The encoding is performed with the number of distributed bits).

【０００９】上記ビット割当手法（ビット配分手法）と
しては、次の２つの手法が知られている。The following two methods are known as the bit allocation method (bit allocation method).

【００１０】例えば、文献「音声信号の適応変換符号
化」("Adaptive Transform Coding ofSpeech Signals",
IEEE Transactions of Acoustics, Speech, and Signa
l Processing, vol.ASSP-25, No.4, August 1977) で
は、各帯域毎の信号の大きさに基づいて、ビット割当を
行っている。For example, the document "Adaptive Transform Coding of Speech Signals",
IEEE Transactions of Acoustics, Speech, and Signa
Processing, vol.ASSP-25, No.4, August 1977), bit allocation is performed based on the signal size of each band.

【００１１】また、例えば文献「臨界帯域符号化器 −
ディジタル・エンコーディング・オブ・パーセプチュア
ル・リクワイアメンツ・オブ・ジ・オーディトリィ・シ
ステム」("The critical band coder --digital encodi
ng of the perceptual requirements of the auditory
system", M.A. Kransner MIT, ICASSP 1980) では、聴
覚マスキングを利用することで、各帯域毎に必要な信号
対雑音比を得て固定的なビット割当を行う手法が述べら
れている。In addition, for example, the document "Critical band encoder-
Digital Encoding of Perceptual Requirements of the Auditory System "(" The critical band coder --digital encodi
ng of the perceptual requirements of the auditory
system ", MA Kransner MIT, ICASSP 1980) describes a method that uses auditory masking to obtain a necessary signal-to-noise ratio for each band and perform fixed bit allocation.

【００１２】[0012]

【発明が解決しようとする課題】ところで、例えば上述
したようなサブバンドコーディング等を用いたオーディ
オ信号の高能率圧縮符号化方式においては、人間の聴覚
上の特性を利用し、オーディオデータを約１／５に圧縮
するような方式が既に実用化されている。なお、このオ
ーディオデータを約１／５に圧縮する高能率符号化方式
としては、例えばＭＤ（SONY社商標、Mini Disc）規格
に使用されている、ＡＴＲＡＣ（SONY社商標、Adaptive
TRansform Acoustic Coding）と呼ばれる方式がある。By the way, for example, in the high-efficiency compression encoding system for audio signals using the above-mentioned sub-band coding or the like, the human auditory characteristic is utilized to convert the audio data into about 1 A method of compressing to / 5 has already been put into practical use. As a high-efficiency encoding method for compressing the audio data to about 1/5, for example, ATRAC (a trademark of Sony Corporation, Adaptive, which is used in MD (a trademark of Sony Corporation, Mini Disc) standard).
There is a method called TRansform Acoustic Coding).

【００１３】また、通常のオーディオ機器の場合のみな
らず、例えば映画フィルム映写システム、高品位テレビ
ジョン、ビデオテープレコーダ、ビデオディスクプレー
ヤ等のステレオ若しくはマルチサウンド音響システムに
おいては、例えば４乃至８チャネル等の複数チャネルの
オーディオ信号あるいは音声信号を扱うようになりつつ
あり、この場合においても、ビットレートを削減する高
能率符号化を行うことが望まれている。Not only in the case of ordinary audio equipment but also in stereo or multi-sound sound systems such as movie film projection systems, high-definition televisions, video tape recorders, video disc players, etc., for example, 4 to 8 channels, etc. In this case, audio signals or voice signals of a plurality of channels are being handled, and even in this case, it is desired to perform high efficiency coding that reduces the bit rate.

【００１４】そして、それに加えて、複数チャネルのオ
ーディオ信号あるいは音声信号とは別に、既存のステレ
オ若しくは音響システムにおいても再生を可能とするた
めに、例えば４乃至８チャネル等の複数チャネルのオー
ディオ信号あるいは音声信号を、予めダウンミキシング
などの手法により、例えば２チャネルのデータに変換
し、変換されたこの２チャネルのデータを先の複数チャ
ネルのオーディオ信号とは別に記録する手法が取られる
こともある。In addition to this, in addition to audio signals or audio signals of a plurality of channels, in order to enable reproduction even in an existing stereo or acoustic system, audio signals of a plurality of channels such as 4 to 8 channels or In some cases, the audio signal may be converted into, for example, 2-channel data by a method such as down-mixing, and the converted 2-channel data may be recorded separately from the audio signals of the plurality of channels.

【００１５】図１０は、各チャネルの信号を圧縮符号化
するとともに、各チャネルの信号に対してミキシングな
どの処理を施して得られた信号も同時に符号化するマル
チチャネルの符号化装置の構成例を示している。FIG. 10 shows an example of the configuration of a multi-channel coding apparatus for compressing and coding the signals of the respective channels, and simultaneously coding the signals obtained by subjecting the signals of the respective channels to processing such as mixing. Is shown.

【００１６】図１０に示した符号化装置は、入力端子１
０１ａ乃至１０１ｅより入力された各チャネルの信号を
混合し、２チャネルの信号に変換する混合器１０２、各
チャネルの信号をそれぞれ符号化する符号器１０５ａ乃
至１０５ｅ、混合器１０２より供給される混合された信
号を符号化する符号器１０５ｆ，１０５ｇ、符号器１０
５ａ乃至１０５ｇからの符号化された信号をビットスト
リームに変換するマルチプレクサ１０６、およびマルチ
プレクサ１０６からのビットストリームを出力する出力
端子１０７より構成されている。The encoder shown in FIG. 10 has an input terminal 1
Mixers 102 for mixing the signals of the respective channels input from 01a to 101e and converting them into signals of the two channels, encoders 105a to 105e for encoding the signals of the respective channels, and mixers supplied by the mixer 102. Encoders 105f and 105g for encoding the generated signal, encoder 10
It comprises a multiplexer 106 for converting the coded signals from 5a to 105g into a bit stream, and an output terminal 107 for outputting the bit stream from the multiplexer 106.

【００１７】入力端子１０１ａ乃至１０１ｅを介して供
給された、例えば、センタ（Ｃ）チャネル、レフト
（Ｌ）チャネル、ライト（Ｒ）チャネル、レフトサラウ
ンド（ＳＬ）チャネル、およびライトサラウンド（Ｓ
Ｒ）チャネルの各オーディオデータは、シングルチャネ
ル用符号器１０５ａ乃至１０５ｅにそれぞれ供給され
る。これら符号器１０５ａ乃至１０５ｅでは、入力信号
に対して符号化が行われ、符号化されたデータがマルチ
プレクサ１０６に供給される。マルチプレクサ１０６で
は各チャネルの符号化されたデータが１つのビットスト
リームにされ、このビットストリームが出力端子１０７
から出力される。For example, a center (C) channel, a left (L) channel, a right (R) channel, a left surround (SL) channel, and a right surround (S) supplied through the input terminals 101a to 101e.
Each audio data of the R) channel is supplied to each of the single-channel encoders 105a to 105e. In these encoders 105a to 105e, the input signal is encoded, and the encoded data is supplied to the multiplexer 106. The multiplexer 106 converts the encoded data of each channel into one bit stream, and this bit stream is output to the output terminal 107.
Output from

【００１８】一方、入力端子１０１ａ乃至１０１ｅより
入力されたオーディオデータは、混合器１０２にも供給
され、例えば、次のような割合で混合処理が施され、２
チャネルのデータとして再構成される。On the other hand, the audio data input from the input terminals 101a to 101e is also supplied to the mixer 102, and, for example, the mixing processing is performed at the following ratio,
Reconstructed as channel data.

【００１９】すなわち、一方のチャネル（Ｌ_mixチャネ
ル）は、Ｌチャネル、Ｒチャネル、Ｃチャネル、ＳＬチ
ャネル、およびＳＲチャネルが、１．００００，０．０
０００，０．７０７１，０．７０７１，および０．００
００の割合で混合される。他方のチャネル（Ｒ_mixチャ
ネル）は、Ｌチャネル、Ｒチャネル、Ｃチャネル、ＳＬ
チャネル、およびＳＲチャネルが、０．００００，１．
００００，０．７０７１，０．００００，および０．７
０７１の割合で混合される。That is, in one channel (L _mix channel), the L channel, the R channel, the C channel, the SL channel, and the SR channel are 1.0000 and 0.0.
000, 0.7071, 0.7071, and 0.00
It is mixed in the ratio of 00. The other channel (R _mix channel) is L channel, R channel, C channel, SL
Channels and SR channels are 0.0000, 1.
0000, 0.7071, 0.0000, and 0.7
071 mixed.

【００２０】これら混合処理によって再構成された２チ
ャネルのデータは、符号器１０５ｆ，１０５ｇにそれぞ
れ供給され、符号化される。符号器１０５ｆ，１０５ｇ
において符号化されたデータはマルチプレクサ１０６に
供給され、上記２チャネルの符号化されたデータが１つ
のビットストリームにされた後、出力端子１０７より出
力される。The two-channel data reconstructed by these mixing processes are supplied to the encoders 105f and 105g and encoded. Encoders 105f and 105g
The encoded data in 1 is supplied to the multiplexer 106, and the encoded data of the above two channels is converted into one bit stream, and then output from the output terminal 107.

【００２１】図１１は、各チャネル毎に復号化を行うマ
ルチチャネルの復号化装置の構成例を示している。ま
た、図１２は、複数チャネルのディジタル信号の一部ま
たは全部が混合処理された信号の復号化を行う２チャネ
ルの復号化装置の構成例を示している。FIG. 11 shows an example of the configuration of a multi-channel decoding device that performs decoding for each channel. Further, FIG. 12 shows a configuration example of a two-channel decoding device that decodes a signal in which a part or all of digital signals of a plurality of channels are mixed.

【００２２】図１１に示した復号化装置は、入力端子１
３１より入力されたビットストリームを各チャネルの符
号化データに分割するデマルチプレクサ１３２、デマル
チプレクサ１３２からの各チャネルに対応する符号化デ
ータをそれぞれ復号する復号器１３３ａ乃至１３３ｅ、
および復号器１３３ａ乃至１３３ｅにおいて復号化され
た信号を出力する出力端子１３６ａ乃至１３６ｅより構
成されている。The decoding device shown in FIG. 11 has an input terminal 1
Demultiplexer 132 that divides the bit stream input from 31 into encoded data of each channel, decoders 133a to 133e that respectively decode encoded data corresponding to each channel from demultiplexer 132,
And output terminals 136a to 136e for outputting the signals decoded by the decoders 133a to 133e.

【００２３】入力端子１３１を介して供給される符号化
されたビットストリームデータは、デマルチプレクサ１
３２において各チャネルに対応する符号化データに分割
され、復号器１３３ａ乃至１３３ｅにそれぞれ供給され
る。復号器１３３ａ乃至１３３ｅに供給された符号化デ
ータは、復号器１３３ａ乃至１３３ｅにおいてそれぞれ
復号化され、復号化されたオーディオデータは、出力端
子１３６ａ乃至１３６ｅからそれぞれ出力される。The encoded bit stream data supplied through the input terminal 131 is the demultiplexer 1
At 32, the data is divided into encoded data corresponding to each channel and is supplied to each of the decoders 133a to 133e. The encoded data supplied to the decoders 133a to 133e are decoded in the decoders 133a to 133e, respectively, and the decoded audio data are output from the output terminals 136a to 136e, respectively.

【００２４】また、図１２に示した復号化装置は、入力
端子１３１より入力された符号化されたビットストリー
ムデータを混合処理された２チャネルの符号化データに
分割するデマルチプレクサ１３２、混合処理された２チ
ャネルの符号化データをそれぞれ復号化する復号器１３
３ｆ，１３３ｇより構成される。In the decoding apparatus shown in FIG. 12, the demultiplexer 132 for dividing the encoded bit stream data input from the input terminal 131 into the mixed two-channel encoded data is subjected to the mixing processing. Decoder 13 for decoding each of the two-channel encoded data
It is composed of 3f and 133g.

【００２５】入力端子１３１を介して供給された符号化
されたビットストリームデータは、デマルチプレクサ１
３２において混合処理された２チャネルの符号化データ
に分割される。分割された２チャネルの符号化データ
は、復号器１３３ｆ，１３３ｇにそれぞれ供給される。
復号器１３３ｆ，１３３ｇに供給された符号化データは
それぞれ復号化され、復号化されたオーディオデータ
は、出力端子１３６ｆ，１３６ｇよりそれぞれ出力され
る。The encoded bit stream data supplied through the input terminal 131 is the demultiplexer 1
In 32, the mixed data is divided into two-channel encoded data. The divided 2-channel encoded data is supplied to the decoders 133f and 133g, respectively.
The encoded data supplied to the decoders 133f and 133g are decoded, and the decoded audio data is output from the output terminals 136f and 136g, respectively.

【００２６】このように、複数チャネルのオーディオ信
号あるいは音声信号とは別に、それらの信号が混合処理
された例えば２チャネルの信号を記録するような場合に
おいては、さらにビットレートを削減する高能率符号化
を行うことが望まれている。As described above, in the case of recording, for example, a two-channel signal in which these signals are mixed and processed separately from the audio signal or the voice signal of a plurality of channels, a high efficiency code for further reducing the bit rate. It is desired to implement

【００２７】しかしながら、図１０，図１１、および図
１２に示したような構成の符号化装置および復号化装置
においては、上記ディジタルオーディオデータを約１／
５に圧縮する高能率符号化方式は、シングルチャネル用
の符号化方式であり、これを用いてマルチチャネルオー
ディオデータを符号化する場合には、チャネル間のデー
タの依存関係や、各チャネルのデータ特性、フォーマッ
ト特性といった要素を用いた効果的なデータ符号化処理
をすることができない。すなわち、ＣチャネルとＬチャ
ネルのオーディオデータが似ているとか、あるいは、フ
ォーマット的に左右のサラウンドチャネルのオーディオ
データが似ているといった要素を利用することができな
い。However, in the encoding device and the decoding device having the configurations shown in FIGS. 10, 11 and 12, the digital audio data is reduced to about 1 /
The high-efficiency coding method of compressing to 5 is a coding method for a single channel, and when coding multi-channel audio data using this, data dependence between channels and data of each channel It is not possible to perform effective data coding processing using elements such as characteristics and format characteristics. That is, it is impossible to use an element that the audio data of the C channel and the audio data of the L channel are similar to each other, or the audio data of the left and right surround channels are similar in terms of format.

【００２８】本発明は、このような状況に鑑みてなされ
たものであり、マルチチャネルの信号の圧縮符号化にお
いて、マルチチャネル間のディジタルデータの相関関係
の程度に適した高圧縮を、既存の符号器および復号器を
利用して実現可能にするものである。The present invention has been made in view of such a situation, and in the compression coding of multi-channel signals, high compression suitable for the degree of correlation of digital data between multi-channels has been achieved. It can be realized by using an encoder and a decoder.

【００２９】[0029]

【課題を解決するための手段】請求項１に記載の符号化
方法は、ディジタル信号の特性および再生環境に対応し
て、少なくとも１つのチャネルのディジタル信号の一部
または全部の周波数帯域を少なくとも１つの混合チャネ
ルに混合し、ディジタル信号から、混合チャネルの混合
ディジタル信号によって再現する信号を除いた、個別に
符号化する個別符号化ディジタル信号を抽出し、混合デ
ィジタル信号および個別符号化ディジタル信号を符号化
することを特徴とする。According to a first aspect of the present invention, there is provided at least one frequency band of at least one part of a digital signal of at least one channel corresponding to a characteristic of the digital signal and a reproduction environment. Individual mixed coded digital signals that are mixed into two mixed channels and are not encoded by the mixed digital signals of the mixed channels are reproduced. It is characterized by

【００３０】請求項２に記載の符号化装置は、ディジタ
ル信号の特性および再生環境に対応して、少なくとも１
つのチャネルのディジタル信号の一部または全部の周波
数帯域を少なくとも１つの混合チャネルに混合する混合
手段と、ディジタル信号から、混合チャネルの混合ディ
ジタル信号によって再現する信号を除いた、個別に符号
化する個別符号化ディジタル信号を抽出する抽出手段
と、混合ディジタル信号および個別符号化ディジタル信
号を符号化する符号化手段とを備えることを特徴とす
る。According to a second aspect of the present invention, the encoding device has at least 1 in accordance with the characteristics of the digital signal and the reproduction environment.
Mixing means for mixing part or all of the frequency bands of the digital signals of one channel into at least one mixing channel, and individually encoding the digital signal excluding the signal reproduced by the mixed digital signal of the mixing channel It is characterized in that it comprises extraction means for extracting the encoded digital signal and encoding means for encoding the mixed digital signal and the individual encoded digital signal.

【００３１】請求項１１に記載の復号化方法は、符号化
情報に基づいて、混合ディジタル信号を復号化し、復号
化された混合ディジタル信号の一部または全部を用い
て、ディジタル信号を復元するための復元用データを作
成し、個別符号化ディジタル信号と復元用データを合成
し、ディジタル信号を復元することを特徴とする。The decoding method according to claim 11 is for decoding the mixed digital signal based on the coded information and for restoring the digital signal by using a part or all of the decoded mixed digital signal. It is characterized in that the data for restoration is created, the individually encoded digital signal and the data for restoration are combined, and the digital signal is restored.

【００３２】請求項１２に記載の復号化装置は、符号化
情報に基づいて、混合ディジタル信号を復号化する復号
化手段と、復号化手段により復号化された混合ディジタ
ル信号の一部または全部を用いて、ディジタル信号を復
元するための復元用データを作成する復元用データ作成
手段と、復元用データ作成手段により作成された復元用
データと、個別符号化ディジタル信号を合成し、ディジ
タル信号を復元する復元手段とを備えることを特徴とす
る。According to a twelfth aspect of the present invention, there is provided a decoding device for decoding the mixed digital signal based on the coded information, and a part or all of the mixed digital signal decoded by the decoding means. Using the restoring data creating means for creating the restoring data for restoring the digital signal, the restoring data created by the restoring data creating means, and the individual encoded digital signal are combined to restore the digital signal. And a restoring means for performing the restoration.

【００３３】請求項１８に記載の記録媒体は、複数チャ
ネルの少なくとも１つのチャネルのディジタル信号の一
部または全部の周波数帯域が、少なくとも１つの混合チ
ャネルに混合され、ディジタル信号の周波数特性および
再生環境に対応して、ディジタル信号から、混合チャネ
ルの混合ディジタル信号によって再現する信号を除い
た、個別に符号化する個別符号化ディジタル信号が抽出
され、所定の符号化情報に基づいて符号化された混合デ
ィジタル信号および個別符号化ディジタル信号が記録さ
れることを特徴とする。In the recording medium according to the eighteenth aspect, a part or all of the frequency bands of the digital signals of at least one channel of the plurality of channels are mixed in at least one mixing channel, and the frequency characteristics of the digital signal and the reproduction environment. Corresponding to, the individual coded digital signal to be individually coded is extracted from the digital signal excluding the signal reproduced by the mixed digital signal of the mixed channel, and is mixed based on the predetermined coded information. A digital signal and an individually encoded digital signal are recorded.

【００３４】請求項１に記載の符号化方法においては、
ディジタル信号の特性および再生環境に対応して、少な
くとも１つのチャネルのディジタル信号の一部または全
部の周波数帯域が少なくとも１つの混合チャネルに混合
され、ディジタル信号から、混合チャネルの混合ディジ
タル信号によって再現する信号が除かれた、個別に符号
化する個別符号化ディジタル信号が抽出され、混合ディ
ジタル信号および個別符号化ディジタル信号が符号化さ
れる。従って、混合ディジタル信号に基づいて元のディ
ジタル信号を再現することができ、ディジタル信号の圧
縮率を高めることができる。In the encoding method according to claim 1,
Depending on the characteristics of the digital signal and the reproduction environment, some or all of the frequency bands of the digital signal of the at least one channel are mixed into the at least one mixed channel and reproduced from the digital signal by the mixed digital signal of the mixed channel. The individual coded digital signals to be coded separately from which the signals have been removed are extracted, and the mixed digital signal and the individual coded digital signals are coded. Therefore, the original digital signal can be reproduced based on the mixed digital signal, and the compression rate of the digital signal can be increased.

【００３５】請求項２に記載の符号化装置においては、
ディジタル信号の特性および再生環境に対応して、混合
手段により、少なくとも１つのチャネルのディジタル信
号の一部または全部の周波数帯域が少なくとも１つの混
合チャネルに混合され、抽出手段により、ディジタル信
号から、混合チャネルの混合ディジタル信号によって再
現する信号が除かれた、個別に符号化する個別符号化デ
ィジタル信号が抽出され、符号化手段により、混合ディ
ジタル信号および個別符号化ディジタル信号が符号化さ
れる。従って、混合ディジタル信号に基づいて元のディ
ジタル信号を再現することができ、ディジタル信号の圧
縮率を高めることができる。In the encoding device according to the second aspect,
Depending on the characteristics of the digital signal and the reproduction environment, some or all of the frequency bands of the digital signal of the at least one channel are mixed by the mixing means into the at least one mixing channel, and the mixing means mixes the digital signal by the extraction means. The individual coded digital signal to be individually coded is extracted by removing the signal reproduced by the mixed digital signal of the channel, and the mixed digital signal and the individual coded digital signal are coded by the coding means. Therefore, the original digital signal can be reproduced based on the mixed digital signal, and the compression rate of the digital signal can be increased.

【００３６】請求項１１に記載の復号化方法において
は、符号化情報に基づいて、混合ディジタル信号が復号
化され、復号化された混合ディジタル信号の一部または
全部を用いて、ディジタル信号を復元するための復元用
データが作成され、個別符号化ディジタル信号と復元用
データが合成され、ディジタル信号が復元される。従っ
て、混合ディジタル信号に基づいて元のディジタル信号
を復元することができる。In the decoding method according to the eleventh aspect, the mixed digital signal is decoded based on the coded information, and the digital signal is restored by using a part or all of the decoded mixed digital signal. Restoration data is created, and the individually encoded digital signal and the restoration data are combined to restore the digital signal. Therefore, the original digital signal can be restored based on the mixed digital signal.

【００３７】請求項１２に記載の復号化装置において
は、復号化手段により、符号化情報に基づいて、混合デ
ィジタル信号が復号化され、復元用データ作成手段によ
り、混合ディジタル信号の一部または全部を用いて、デ
ィジタル信号を復元するための復元用データが作成さ
れ、復元手段により、復元用データと個別符号化ディジ
タル信号が合成され、ディジタル信号が復元される。従
って、混合ディジタル信号に基づいて、元のディジタル
信号を復元することができる。In the decoding device according to the twelfth aspect of the present invention, the decoding means decodes the mixed digital signal based on the coded information, and the restoration data creating means makes part or all of the mixed digital signal. Is used to create restoration data for restoring the digital signal, and the restoration means synthesizes the restoration data and the individual encoded digital signal to restore the digital signal. Therefore, the original digital signal can be restored based on the mixed digital signal.

【００３８】請求項１８に記載の記録媒体においては、
所定の符号化情報に基づいて符号化された混合ディジタ
ル信号および個別符号化ディジタル信号が記録される。
従って、高能率符号化されたディジタル信号を記録し、
それを再生することができる。In the recording medium according to claim 18,
The mixed digital signal and the individually encoded digital signal encoded based on predetermined encoded information are recorded.
Therefore, a high efficiency coded digital signal is recorded,
You can play it.

【００３９】[0039]

【発明の実施の形態】以下、本発明の実施例について図
面を参照しながら説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings.

【００４０】図１は、本発明の符号化方法が適用される
符号化装置の一実施例の構成を示すブロック図である。
同図に示したように、本発明の符号化装置は、シングル
チャネル用の圧縮符号化器（例えば上述したいわゆるＡ
ＴＲＡＣ方式の符号化器）を複数用いて、マルチチャネ
ルの圧縮符号化を実現するものである。FIG. 1 is a block diagram showing the configuration of an embodiment of an encoding device to which the encoding method of the present invention is applied.
As shown in the figure, the encoding apparatus of the present invention is a single-channel compression encoder (for example, the so-called A described above).
A plurality of TRAC encoders are used to realize multi-channel compression encoding.

【００４１】すなわち、図１に示した符号化装置は、複
数チャネルのディジタルオーディオ信号を符号化し、符
号化されたディジタルオーディオ信号と共に符号化のパ
ラメータ情報を出力する装置であり、混合器１０２（混
合手段）は、複数チャネルのディジタル信号の一部また
は全部を混合する。処理データ抽出器１０３ａ乃至１０
３ｅ（抽出手段）は、混合器１０２より供給された混合
されたディジタル信号（混合処理データ）と、入力端子
１０１ａ乃至１０１ｅより供給されたオーディオデータ
を比較し、このオーディオデータから、混合処理データ
とは別に、個別に符号化する方が適切であると判断した
オーディオデータ（個別符号化データ）を抽出し、符号
化器１０５ａ乃至１０５ｅ（符号化手段）に供給する。
また、混合器１０２より供給された混合処理データによ
って元のオーディオデータを再現する方が有効であると
判断された残りのオーディオデータについては、それを
再現するための再現用のパラメータを生成し、マルチプ
レクサ１０６に供給するようになされている。That is, the encoding device shown in FIG. 1 is a device for encoding digital audio signals of a plurality of channels and outputting encoding parameter information together with the encoded digital audio signals. Means) mixes some or all of the digital signals of the plurality of channels. Processed data extractors 103a to 10
3e (extracting means) compares the mixed digital signal (mixing processing data) supplied from the mixer 102 with the audio data supplied from the input terminals 101a to 101e, and from this audio data, the mixing processing data is obtained. Separately, audio data (individually encoded data) determined to be more suitable for individual encoding is extracted and supplied to the encoders 105a to 105e (encoding means).
Further, for the remaining audio data for which it is judged that it is more effective to reproduce the original audio data by the mixed processing data supplied from the mixer 102, a reproduction parameter for reproducing the remaining audio data is generated, It is adapted to be supplied to the multiplexer 106.

【００４２】符号器１０５ａ乃至１０５ｅは、処理デー
タ抽出器１０３ａ乃至１０３ｅより供給された個別符号
化データを符号化し、出力する。符号器１０５ｆおよび
１０５ｇは、混合器１０２より供給された混合処理デー
タを符号化し、出力するようになされている。The encoders 105a to 105e encode the individual encoded data supplied from the processed data extractors 103a to 103e and output the encoded data. The encoders 105f and 105g are configured to encode and output the mixed processing data supplied from the mixer 102.

【００４３】マルチプレクサ１０６は、符号器１０５ａ
乃至１０５ｇより供給された符号化された符号化データ
および再現用のパラメータを１つのビットストリームに
した後、出力端子１０７より出力するようになされてい
る。The multiplexer 106 has an encoder 105a.
To 105g, the encoded data and the reproduction parameter supplied from the first to 105 g are converted into one bit stream, which is then output from the output terminal 107.

【００４４】ここでは、センタ（Ｃ）チャネル、レフト
（Ｌ）チャネル、ライト（Ｒ）チャネル、レフトサラウ
ンド（ＳＬ）チャネル、およびライトサラウンド（Ｓ
Ｒ）チャネルの５チャネルオーディオデータが、入力端
子１０１ａ乃至１０１ｅよりそれぞれ入力されるものと
して説明する。また、混合器１０２は、これら５チャネ
ルオーディオデータに混合処理を施して、２チャネルオ
ーディオデータを作成し、出力するものとする。Here, the center (C) channel, the left (L) channel, the right (R) channel, the left surround (SL) channel, and the right surround (S).
The description will be made assuming that the R channel 5-channel audio data is input from the input terminals 101a to 101e, respectively. Further, the mixer 102 performs a mixing process on these 5-channel audio data to create 2-channel audio data, and outputs the 2-channel audio data.

【００４５】図１に示した実施例においては、入力端子
１０１ａ乃至１０１ｅを介して供給されたセンタ
（Ｃ）、レフト（Ｌ）、ライト（Ｒ）、レフトサラウン
ド（ＳＬ）、およびライトサラウンド（ＳＲ）の各チャ
ネルのオーディオデータは、まず混合器１０２に入力さ
れる。混合器１０２は、基本的には図１０に示した混合
器１０２と同様のものであるのでその詳細な説明は省略
するが、図１０を参照して上述したように、入力端子１
０１ａ乃至１０１ｅより入力された各チャネルのオーデ
ィオデータが混合され、２チャネルのデータとして再構
成される。混合器１０２の出力は、符号器１０５ｆ乃至
１０５ｇに供給されるとともに、処理データ抽出器１０
３ａ乃至１０３ｅにも供給される。In the embodiment shown in FIG. 1, the center (C), the left (L), the right (R), the left surround (SL), and the right surround (SR) supplied via the input terminals 101a to 101e. The audio data of each channel is first input to the mixer 102. The mixer 102 is basically the same as the mixer 102 shown in FIG. 10, and a detailed description thereof will be omitted. However, as described above with reference to FIG.
The audio data of each channel input from 01a to 101e are mixed and reconfigured as two-channel data. The output of the mixer 102 is supplied to the encoders 105f to 105g and the processed data extractor 10
It is also supplied to 3a to 103e.

【００４６】処理データ抽出器１０３ａ乃至１０３ｅに
おいては、各チャネル毎に混合器１０２からの出力と入
力端子１０１ａ乃至１０１ｅより供給されたオーディオ
データとが比較され、個別に符号化するのが適切と判断
されたオーディオデータが抽出され、符号器１０５ａ乃
至１０５ｅに出力される。一方、混合器１０２からの出
力によって元のオーディオデータを再現する方が有効と
判断された残りのオーディオデータについては、再現用
のパラメータが生成され、マルチプレクサ１０６に供給
される。処理データ抽出器１０３ａ乃至１０３ｅの詳細
な構成および動作については後述する。In the processed data extractors 103a to 103e, the output from the mixer 102 is compared with the audio data supplied from the input terminals 101a to 101e for each channel, and it is judged appropriate to individually encode. The audio data thus extracted is extracted and output to the encoders 105a to 105e. On the other hand, for the remaining audio data for which it is determined that it is more effective to reproduce the original audio data by the output from the mixer 102, a reproduction parameter is generated and supplied to the multiplexer 106. Detailed configurations and operations of the processed data extractors 103a to 103e will be described later.

【００４７】符号器１０５ａ乃至１０５ｅにおいては、
処理データ抽出器１０３ａ乃至１０３ｅより供給された
個別に符号化すべきオーディオデータ（個別符号化デー
タ）がそれぞれ符号化され、マルチプレクサ１０６に供
給される。符号器１０５ｆ乃至１０５ｇにおいては、混
合器１０２からの出力データ（混合処理データ）がそれ
ぞれ符号化され、マルチプレクサ１０６に供給される。
符号器１０５ａ乃至１０５ｇは、図１０に示した符号器
１０２ａ乃至１０２ｇと基本的に同様のものである。そ
の詳細な構成および動作については後述する。In the encoders 105a to 105e,
The audio data (individually encoded data) to be encoded individually supplied from the processed data extractors 103a to 103e are each encoded and supplied to the multiplexer 106. In the encoders 105f to 105g, the output data (mixing processing data) from the mixer 102 is encoded and supplied to the multiplexer 106.
The encoders 105a to 105g are basically the same as the encoders 102a to 102g shown in FIG. The detailed configuration and operation will be described later.

【００４８】マルチプレクサ１０６においては、各符号
器１０５ａ乃至１０５ｇより供給された符号化された符
号化データと、処理データ抽出器１０３ａ乃至１０３ｅ
より供給された再現用のパラメータが１つのビットスト
リームにされ、出力端子１０７より出力される。In the multiplexer 106, the encoded data supplied from the respective encoders 105a to 105g and the processed data extractors 103a to 103e.
The parameters for reproduction supplied by the above are converted into one bit stream and output from the output terminal 107.

【００４９】図２は、図１の実施例における処理データ
抽出器１０３ａの内部構成例を示している。処理データ
抽出器１０３ｂ乃至１０３ｅの構成および動作は、処理
データ抽出器１０３ａの場合と基本的に同様であるの
で、その説明は省略する。FIG. 2 shows an internal configuration example of the processed data extractor 103a in the embodiment of FIG. The configurations and operations of the processed data extractors 103b to 103e are basically the same as those of the processed data extractor 103a, and therefore description thereof will be omitted.

【００５０】処理データ抽出器１０３ａを構成する帯域
分割フィルタ２０２ａ乃至２０２ｂは、入力端子２０１
ａより入力されたセンタチャネルのオーディオデータ
を、それを分析する周波数帯域毎に分割する。処理デー
タ分析器２０４ａ乃至２０４ｂは、混合処理が施された
混合処理データを、元のオーディオデータを再現するた
めの再現用信号として利用する各周波数帯域毎にあり、
当該チャネルのオーディオデータと混合処理データを比
較して、混合処理データから復元する周波数帯域につい
て、当該チャネルのオーディオデータを復元するのに有
効なパラメータ（例えばスケールファクタ）を分析し、
分析結果を処理データ作成器２０５に出力するようにな
されている。The band division filters 202a and 202b constituting the processed data extractor 103a are connected to the input terminal 201.
The center channel audio data input from a is divided for each frequency band in which it is analyzed. The processed data analyzers 204a and 204b are provided for each frequency band that uses the mixed processed data as a reproduction signal for reproducing the original audio data,
The audio data of the channel is compared with the mixed processing data, and for the frequency band restored from the mixed processing data, an effective parameter (for example, a scale factor) for restoring the audio data of the channel is analyzed,
The analysis result is output to the processed data generator 205.

【００５１】パラメータ記録メモリ２０３ａは、処理デ
ータ分析器２０４ａより供給された分析結果としてのパ
ラメータを記憶する。同様に、パラメータ記録メモリ２
０３ｂは、処理データ分析器２０４ｂより供給された分
析結果としてのパラメータを記憶するようになされてい
る。The parameter recording memory 203a stores the parameter as the analysis result supplied from the processed data analyzer 204a. Similarly, the parameter recording memory 2
03b stores parameters as analysis results supplied from the processed data analyzer 204b.

【００５２】処理データ作成器２０５は、処理データ分
析器２０４ａ乃至２０４ｂからの出力に基づいて、入力
端子２０１ａより入力された当該チャネルのオーディオ
データから、符号化に必要な周波数帯域成分のみを抜き
出し、出力端子２０６ａより出力する。また、処理デー
タ分析器２０４ａ乃至２０４ｂより供給された、当該チ
ャネルのオーディオデータを復元するためのパラメータ
を１つにまとめ、出力端子２０６ｂより出力するように
なされている。Based on the outputs from the processed data analyzers 204a and 204b, the processed data generator 205 extracts only the frequency band component required for encoding from the audio data of the channel input from the input terminal 201a, Output from the output terminal 206a. Further, the parameters for restoring the audio data of the channel, which are supplied from the processed data analyzers 204a and 204b, are put together into one and output from the output terminal 206b.

【００５３】入力端子２０１ａに、図１における入力端
子１０１ａからのセンタチャネルのオーディオデータが
入力され、入力端子２０１ｂ，２０１ｃに、混合器１０
２からの混合処理データが入力されると、入力端子２０
１ａ，２０１ｂ、および２０１ｃを介して入力されたデ
ータは、帯域分割フィルタ２０２ａ乃至２０２ｂにそれ
ぞれ供給され、このオーディオデータを分析する周波数
帯域（例えばクリティカルバンド）毎に分割され、処理
データ分析器２０４ａ乃至２０４ｂに供給される。The center channel audio data from the input terminal 101a in FIG. 1 is input to the input terminal 201a, and the mixer 10 is input to the input terminals 201b and 201c.
When the mixed processing data from 2 is input, the input terminal 20
The data input via 1a, 201b, and 201c are respectively supplied to band division filters 202a to 202b, divided into frequency bands (for example, critical bands) for analyzing the audio data, and processed data analyzers 204a to 204a. It is supplied to 204b.

【００５４】処理データ分析器２０４ａ乃至２０４ｂ
は、混合処理が施された混合処理データを、元のオーデ
ィオデータを再現するための再現用信号として利用する
各周波数帯域毎にあり、当該チャネルのオーディオデー
タと混合処理データを比較して、混合処理データから当
該チャネルのオーディオデータを復元するのに有効なパ
ラメータ（例えばスケールファクタ）が分析され、分析
結果が処理データ作成器２０５に出力される。Processed data analyzers 204a-204b
Is for each frequency band that uses the mixed processed data as a reproduction signal for reproducing the original audio data, compares the audio data of the channel and the mixed processed data, and mixes them. A parameter (for example, a scale factor) effective for restoring the audio data of the channel from the processed data is analyzed, and the analysis result is output to the processed data generator 205.

【００５５】このパラメータの分析には、それ以前のフ
レームにおける分析結果が用いられるため、分析結果は
これを保存するためのパラメータ記録メモリ２０３ａ乃
至２０３ｂにも供給され、記録される。処理データ分析
器２０４ａ乃至２０４ｂにおいては、パラメータ記録メ
モリ２０３ａ乃至２０３ｂに記録された分析結果が必要
に応じて適宜用いられ、パラメータが分析される。Since the analysis result of the previous frame is used for the analysis of this parameter, the analysis result is also supplied to and recorded in the parameter recording memories 203a and 203b for storing the analysis result. In the processed data analyzers 204a and 204b, the analysis results recorded in the parameter recording memories 203a and 203b are appropriately used as needed to analyze the parameters.

【００５６】処理データ作成器２０５においては、処理
データ分析器２０４ａ乃至２０４ｂからの出力に基づい
て、入力端子２０１ａより入力された当該チャネルのオ
ーディオデータから、符号化に必要な周波数帯域成分の
みが抜き出され、出力端子２０６ａより出力される。ま
た、処理データ分析器２０４ａ乃至２０４ｂより供給さ
れた、当該チャネルのオーディオデータを復元するため
のパラメータが１つにまとめられ、出力端子２０６ｂよ
り出力される。In the processed data generator 205, based on the outputs from the processed data analyzers 204a and 204b, only the frequency band component necessary for encoding is extracted from the audio data of the channel input from the input terminal 201a. And is output from the output terminal 206a. Further, the parameters for restoring the audio data of the channel, which are supplied from the processed data analyzers 204a and 204b, are put together into one and output from the output terminal 206b.

【００５７】従って、以前のフレームのパラメータを参
照しながら、再現用信号として使用する混合処理したチ
ャネル、またはその使用割合を、フレーム単位で変更し
たり、当該チャネルのデータを符号化する周波数帯域を
変更したり、各周波数帯域毎に適切なパラメータの値を
変更することができ、復元したときの違和感を軽減する
ことが可能となる。Therefore, with reference to the parameters of the previous frame, the mixed-processed channel to be used as the reproduction signal, or the ratio of its use, can be changed on a frame-by-frame basis, or the frequency band for coding the data of the channel can be changed. It is possible to change or change the value of an appropriate parameter for each frequency band, and it is possible to reduce the discomfort when restored.

【００５８】図３は、本発明の復号化方法が適用される
復号化装置の一実施例の構成を示すブロック図である。
図３に示した復号化装置は、シングルチャネル用の圧縮
符号復号化器（例えば上記のいわゆるＡＴＲＡＣ方式に
対応する復号化器）を複数用いて、マルチチャネルの復
号化を実現するものである。FIG. 3 is a block diagram showing the configuration of an embodiment of a decoding device to which the decoding method of the present invention is applied.
The decoding apparatus shown in FIG. 3 realizes multi-channel decoding by using a plurality of single-channel compression code decoders (for example, decoders corresponding to the so-called ATRAC system described above).

【００５９】デマルチプレクサ１３２は、入力端子１３
１より入力されたビットストリームに含まれる符号化さ
れた各チャネルのオーディオデータと、混合処理された
混合処理データから元のオーディオデータを再現するた
めの再現用のパラメータを、対応するチャネル毎に分割
し、符号化されたオーディオデータを、対応するチャネ
ルの復号器１３３ａ乃至１３３ｅ（復号手段）にそれぞ
れ供給し、再現用のパラメータを対応するチャネルの合
成データ作成器１３４ａ乃至１３４ｅ（復元用データ作
成手段）にそれぞれ供給する。また、符号化された混合
処理データを復号器１３３ｆ，１３３ｇに供給するよう
になされている。The demultiplexer 132 has the input terminal 13
The encoded audio data of each channel included in the bit stream input from 1 and the reproduction parameter for reproducing the original audio data from the mixed processed data are divided for each corresponding channel. Then, the encoded audio data is supplied to the decoders 133a to 133e (decoding means) of the corresponding channels, and the reproduction parameters are combined data generators 134a to 134e (restoring data generating means) of the corresponding channels. ) Respectively. Further, the encoded mixed processing data is supplied to the decoders 133f and 133g.

【００６０】復号器１３３ａ乃至１３３ｅは、デマルチ
プレクサ１３２より供給された符号化されたオーディオ
データを復号し、合成器１３５ａ乃至１３５ｅ（復元手
段）に供給する。復号器１３３ｆ，１３３ｇは、デマル
チプレクサ１３２より供給された混合処理データを復号
し、対応する合成データ作成器１３４ａ乃至１３４ｅに
それぞれ供給するようになされている。The decoders 133a to 133e decode the encoded audio data supplied from the demultiplexer 132, and supply the decoded audio data to the synthesizers 135a to 135e (restoring means). The decoders 133f and 133g are configured to decode the mixed processing data supplied from the demultiplexer 132 and supply it to the corresponding combined data generators 134a to 134e, respectively.

【００６１】合成データ作成器１３４ａ乃至１３４ｅ
は、復号器１３３ｆ，１３３ｇより供給された混合処理
データから、デマルチプレクサ１３２より供給された再
現用のパラメータに基づいて、当該チャネルのオーディ
オデータを再現するための再現用データを作成し、合成
器１３５ａ乃至１３５ｅにそれぞれ供給するようになさ
れている。Combined data generators 134a to 134e
Is based on the reproduction parameters supplied from the demultiplexer 132 from the mixed processing data supplied from the decoders 133f and 133g, and creates reproduction data for reproducing the audio data of the relevant channel. It is adapted to be supplied to each of 135a to 135e.

【００６２】合成器１３５ａ乃至１３５ｅは、合成デー
タ作成器１３４ａ乃至１３４ｅより供給された再現用デ
ータと、デマルチプレクサ１３２より供給された復号化
されたオーディオデータを合成し、各チャネルの元のオ
ーディオデータを復元し、出力端子１３６ａ乃至１３６
ｅより出力するようになされている。The synthesizers 135a to 135e synthesize the reproduction data supplied from the synthesized data generators 134a to 134e and the decoded audio data supplied from the demultiplexer 132, and generate the original audio data of each channel. And output terminals 136a through 136
It is designed to be output from e.

【００６３】入力端子１３１よりデマルチプレクサ１３
２にビットストリームが入力されると、ビットストリー
ム内には、各チャネルのオーディオデータと共に、混合
処理した混合処理データから元のデータを再現するため
のパラメータも含まれているので、デマルチプレクサ１
３２においては、符号化されたオーディオデータ、混合
処理された混合処理データ、および再現用パラメータが
チャネル毎にそれぞれ分割され、符号化されたオーディ
オデータと混合処理された混合処理データが復号器１３
３ａ乃至１３３ｇにそれぞれ供給され、再現用パラメー
タは合成器１３５ａ乃至１３５ｅにそれぞれ供給され
る。From the input terminal 131 to the demultiplexer 13
When the bitstream is input to the demultiplexer 1, the demultiplexer 1 receives the audio data of each channel and the parameters for reproducing the original data from the mixed processing data.
In 32, the encoded audio data, the mixed processing data subjected to the mixing processing, and the reproduction parameter are divided for each channel, and the encoded audio data and the mixed processing data subjected to the mixing processing are decoded by the decoder 13
3a to 133g, respectively, and the reproduction parameters are supplied to the combiners 135a to 135e, respectively.

【００６４】混合処理したチャネルに対応する復号器１
３３ｆ，１３３ｇにおいては、符号化された混合処理デ
ータが復号され、合成データ作成器１３４ａ乃至１３４
ｅにそれぞれ供給される。合成データ作成器１３４ａ乃
至１３４ｅにおいては、復号器１３３ｆ，１３３ｇより
供給された復号化された混合処理データと、デマルチプ
レクサ１３２より供給された、混合処理されたデータか
ら元のデータを再現するためのパラメータを用いて、当
該チャネル再現用のデータが作成され、合成器１３５ａ
乃至１３５ｅに供給される。合成データ作成器１３４ａ
乃至１３４ｅの詳細な構成、および動作については後述
する。Decoder 1 corresponding to the mixed-processed channel
In 33f and 133g, the encoded mixed processing data is decoded, and combined data generators 134a to 134
e respectively. In the synthesized data generators 134a to 134e, the original data is reproduced from the decoded mixed processing data supplied from the decoders 133f and 133g and the mixed processing data supplied from the demultiplexer 132. Data for reproducing the channel is created using the parameters, and the synthesizer 135a
To 135e. Synthetic data generator 134a
The detailed configurations and operations of the to 134e will be described later.

【００６５】合成器１３５ａ乃至１３５ｅにおいては、
復号器１３３ａ乃至１３３ｅにて復号化された各チャネ
ルのオーディオデータと、合成データ作成器１３４ａ乃
至１３４ｅにて作成された各チャネル再現用データがそ
れぞれ合成され、各チャネルの復号化されたオーディオ
データがそれぞれ復元され、出力端子１３６ａ乃至１３
６ｅより出力される。In the synthesizers 135a to 135e,
The audio data of each channel decoded by the decoders 133a to 133e and the data for reproducing each channel created by the combined data creating units 134a to 134e are respectively combined to obtain the decoded audio data of each channel. The output terminals 136a to 13a are respectively restored.
It is output from 6e.

【００６６】図４は、図３の合成データ作成器１３４ａ
の内部構成例を示している。合成データ作成器１３４ｂ
乃至１３４ｅの構成および動作は、合成データ作成器１
３４ａの場合と基本的に同様であるので、その説明は省
略する。FIG. 4 shows the combined data generator 134a of FIG.
2 shows an example of the internal configuration of FIG. Composite data generator 134b
To 134e are the same as those of the synthetic data generator 1
Since it is basically similar to the case of 34a, its description is omitted.

【００６７】合成データ作成器１３４ａを構成する帯域
分割フィルタ２１２ａ乃至２１２ｂは、入力端子２１１
ａ，２１１ｂより入力された２チャネルの混合処理デー
タを再現データを作成する周波数帯域毎に分割し、出力
する。パラメータ解析器２１３は、入力端子２１１ｃよ
り入力された再現用パラメータを解析し、周波数帯域毎
に分割し、出力するようになされている。The band division filters 212a and 212b constituting the combined data generator 134a are connected to the input terminal 211.
The mixed processing data of the two channels input from a and 211b is divided for each frequency band in which reproduction data is created and output. The parameter analyzer 213 analyzes the reproduction parameter input from the input terminal 211c, divides it for each frequency band, and outputs it.

【００６８】再現データ作成器２１４ａ乃至２１４ｂ
は、再現用パラメータを元に、各周波数帯域毎に、復号
化された混合処理データを変換し、出力する。再現デー
タ合成器２１５は、各周波数帯域毎の再現データ作成器
２１４ａ乃至２１４ｅからの出力を合成し、出力端子２
１６より出力するようになされている。Reproduction data generators 214a and 214b
Converts the decoded mixed processing data for each frequency band based on the reproduction parameter and outputs the converted mixed processing data. The reproduction data synthesizer 215 synthesizes the outputs from the reproduction data generators 214a to 214e for each frequency band, and outputs the output terminal 2
It is designed to output from 16.

【００６９】入力端子２１１ａ，２１１ｂには、図３に
示した復号器１３３ｆ，１３３ｇからの復号化された混
合処理データが入力され、入力端子２１１ｃには図３に
示したデマルチプレクサ１３２からの当該チャネルの再
現用パラメータが入力される。入力端子２１１ａ，２１
１ｂを介して供給された復号化された各混合処理データ
は、帯域分割フィルタ２１２ａ乃至２１２ｂにそれぞれ
供給され、再現データを作成する周波数帯域（例えばク
リティカルバンド）毎に分割された後、再現データ作成
器２１４ａ乃至２１４ｂにそれぞれ供給される。Decoded mixed processed data from the decoders 133f and 133g shown in FIG. 3 are input to the input terminals 211a and 211b, and the input mixed data from the demultiplexer 132 shown in FIG. 3 is input to the input terminal 211c. Channel reproduction parameters are input. Input terminals 211a, 21
The decoded mixed processing data supplied via 1b are supplied to the band division filters 212a to 212b, respectively, and are divided into frequency bands (for example, critical bands) for creating reproduction data, and then reproduction data creation is performed. To the containers 214a and 214b, respectively.

【００７０】一方、入力端子２１１ｃより入力された当
該チャネルの再現用パラメータは、パラメータ解析器２
１３に供給され、パラメータを解析して分析する周波数
帯域毎に分割され、該当帯域の再現データ作成器２１４
ａ乃至２１４ｂにそれぞれ供給される。On the other hand, the parameter for reproduction of the channel input from the input terminal 211c is the parameter analyzer 2.
13, the parameters are analyzed and divided into frequency bands for analysis, and a reproduction data generator 214 for the corresponding band is generated.
a to 214b, respectively.

【００７１】再現データ作成器２１４ａ乃至２１４ｂに
おいては、パラメータ解析器２１３より供給された再現
用パラメータを元に、各周波数帯域毎に、復号化された
混合処理データが変換され、再現データ合成器２１５に
供給される。In the reproduction data generators 214a and 214b, the decoded mixed processing data is converted for each frequency band based on the reproduction parameters supplied from the parameter analyzer 213, and the reproduction data synthesizer 215 is used. Is supplied to.

【００７２】再現データ合成器２１５においては、各周
波数帯域毎の再現データ作成器２１４ａ乃至２１４ｂか
らの出力が合成され、該当チャネルの再現用データが作
成された後、出力端子２１６より出力される。In the reproduction data synthesizer 215, the outputs from the reproduction data generators 214a to 214b for each frequency band are synthesized, the reproduction data of the corresponding channel is generated, and then output from the output terminal 216.

【００７３】このように、上記実施例においては、合成
データ作成器１３４ａ乃至１３４ｅにより、符号化装置
で決定された最適な再現方法に合わせて、混合処理デー
タが再現用データに変換され、合成器１３５により当該
チャネルのデータと合成され、元のオーディオデータが
復元される。従って、混合処理によって得られた混合処
理データを用いることにより、当該チャネルのビットレ
ートを削減することが可能となる。As described above, in the above embodiment, the combined data generators 134a to 134e convert the mixed processing data into reproduction data in accordance with the optimum reproduction method determined by the encoding device, and the combiner data is reproduced. The original audio data is restored by combining with the data of the channel by 135. Therefore, by using the mixed processing data obtained by the mixing processing, it is possible to reduce the bit rate of the channel.

【００７４】次に、本発明の他の実施例について述べ
る。上述した実施例では混合処理したチャネルは２チャ
ネルの場合で述べているが、混合処理したチャネルは少
なくとも１チャネルあれば本発明を使用することが可能
である。Next, another embodiment of the present invention will be described. In the above-described embodiment, the case where the number of the mixed-processed channels is two is described, but the present invention can be used as long as at least one channel is the mixed-processed channel.

【００７５】また、上述した実施例のチャネル選択は、
センタ（Ｃ）、レフト（Ｌ）、ライト（Ｒ）、レフトサ
ラウンド（ＳＬ）、およびライトサラウンド（ＳＲ）の
５チャネルの場合について述べているが、これにレフト
センタ（ＬＣ）チャネル、ライトセンタ（ＲＣ）チャネ
ルを加えた７チャネルの場合や、逆にセンタ（Ｃ）チャ
ネルを抜いた４チャネルの場合、さらにサブウーファー
（ＳＷ）チャネルが付加された場合など、複数チャネル
のディジタル信号を符号化し、かつ、これとは別に、こ
れらを混合処理したチャネルを持つマルチチャネルのデ
ィジタル信号を符号化する場合に使用可能である。The channel selection of the above embodiment is
The case of 5 channels of the center (C), the left (L), the right (R), the left surround (SL), and the right surround (SR) is described, and the left center (LC) channel and the right center ( For example, in the case of 7 channels including the RC) channel, in the case of 4 channels without the center (C) channel, and in the case where a subwoofer (SW) channel is further added, digital signals of a plurality of channels are encoded, In addition to this, it can be used when encoding a multi-channel digital signal having a channel obtained by mixing these.

【００７６】次に、上述した符号器１０５ａ乃至１０５
ｅの具体的構成および動作について、図５乃至図８を用
いて説明する。なお、図５には、１つのチャネルの符号
器１０５ａの構成例を示している。なお、符号器１０５
ｂ乃至１０５ｅの構成および動作は、符号器１０５ａの
場合と基本的に同様であるので、ここではその説明は省
略する。Next, the above-mentioned encoders 105a to 105
The specific configuration and operation of e will be described with reference to FIGS. Note that FIG. 5 shows a configuration example of the encoder 105a for one channel. The encoder 105
Since the configurations and operations of b to 105e are basically the same as those of the encoder 105a, the description thereof is omitted here.

【００７７】信号帯域分割フィルタ４０１は、入力端子
４２４を介して入力された信号を３つの周波数帯域に分
割するようになされている。低域ＭＤＣＴ（Modified D
iscrete Cosine Transform：改良型離散余弦変換）回路
４０２Ｌは、帯域分割フィルタ４０１より供給された０
ｋＨｚ乃至５．５ｋＨｚの低域の信号に対してＭＤＣＴ
演算を行う。中域ＭＤＣＴ回路４０２Ｍは、帯域分割フ
ィルタ４０１より供給された５．５ｋＨｚ乃至１１ｋＨ
ｚの中域の信号に対してＭＤＣＴ演算を行う。高域ＭＤ
ＣＴ回路４０２Ｈは、帯域分割フィルタ４０１より供給
された１１ｋＨｚ以上（１１ｋＨｚ乃至２２ｋＨｚ）の
高域の信号に対してＭＤＣＴ演算を行うようになされて
いる。The signal band division filter 401 is adapted to divide the signal inputted through the input terminal 424 into three frequency bands. Low frequency MDCT (Modified D
The iscrete Cosine Transform (improved discrete cosine transform) circuit 402L is 0 supplied from the band division filter 401.
MDCT for low frequency signals from kHz to 5.5 kHz
Perform the operation. The mid-range MDCT circuit 402M has 5.5 kHz to 11 kHz supplied from the band division filter 401.
MDCT operation is performed on the signal in the middle band of z. High range MD
The CT circuit 402H is configured to perform MDCT calculation on a high frequency signal of 11 kHz or higher (11 kHz to 22 kHz) supplied from the band division filter 401.

【００７８】ブロックサイズ評価器４０３は、後述する
時間ブロック長を決定する。正規化回路４０４Ｌ乃至４
０４Ｈは、低域、中域、および高域からなるオーディオ
信号を低域、中域、および高域の合計５２個のブロック
フローティングユニットに分けるとともに、ユニット毎
に規格化（正規化）するようになされている。The block size evaluator 403 determines a time block length described later. Normalization circuits 404L to 4
04H divides the audio signal consisting of low, mid, and high frequencies into a total of 52 block floating units of low, mid, and high frequencies, and normalizes (normalizes) each unit. Has been done.

【００７９】ビット配分器４０５は、後述する配分ビッ
ト数情報を求め、再量子化器４０６に供給する。再量子
化器４０６は、ビット配分器４０５より供給された配分
ビット数情報に基づいて、再量子化を行うようになされ
ている。フォーマッタ４０７は、再量子化器４０６から
の出力信号をビットストリームに変換するようになされ
ている。The bit allocator 405 obtains allocation bit number information, which will be described later, and supplies it to the requantizer 406. The requantizer 406 is adapted to perform requantization based on the distributed bit number information supplied from the bit distributor 405. The formatter 407 is adapted to convert the output signal from the requantizer 406 into a bitstream.

【００８０】図５において、入力端子４２４には、オー
ディオデータ（標本化および量子化されたオーディオデ
ータ）が供給される。入力端子４２４に供給されたデー
タは、先ず帯域分割フィルタ４０１によって０乃至５．
５ｋＨｚの低域と、５．５ｋＨｚ乃至１１ｋＨｚの中域
と、１１ｋＨｚ以上（１１ｋＨｚ乃至２２ｋＨｚ）の３
つの周波数帯域に分割される。In FIG. 5, the input terminal 424 is supplied with audio data (sampled and quantized audio data). The data supplied to the input terminal 424 is first supplied to the band splitting filter 401 through 0 to 5.
Low frequency of 5kHz, medium frequency of 5.5kHz to 11kHz, 11kHz or more (11kHz to 22kHz) 3
It is divided into two frequency bands.

【００８１】これら３つの周波数帯域の信号のうち、帯
域分割フィルタ４０１からの低域の信号はＭＤＣＴ演算
を行うＭＤＣＴ回路４０２Ｌに、中域の信号は同じくＭ
ＤＣＴ演算を行うＭＤＣＴ回路４０２Ｍに、また、高域
の信号はＭＤＣＴ回路４０２Ｈに供給され、これらＭＤ
ＣＴ回路４０２Ｌ乃至４０２Ｈにおいて、周波数成分に
それぞれ分解される。このとき、ＭＤＣＴ処理を施すと
きの時間ブロック長は、各周波数帯域毎に可変であり、
信号が急激に変化する部分では、時間ブロック長を短く
して、時間分解能を高め、信号が定常的な部分では時間
ブロック長を長くして、信号成分の有効伝送と量子化雑
音を制御するようにしている。Of these three frequency band signals, the low frequency signal from the band division filter 401 is input to the MDCT circuit 402L which performs MDCT operation, and the middle frequency signal is also M.
The MDCT circuit 402M that performs the DCT operation and the high-frequency signal are supplied to the MDCT circuit 402H.
Each of the CT circuits 402L to 402H is decomposed into frequency components. At this time, the time block length when performing MDCT processing is variable for each frequency band,
In the part where the signal changes abruptly, the time block length is shortened to improve the time resolution, and in the part where the signal is stationary, the time block length is lengthened to control the effective transmission of the signal component and the quantization noise. I have to.

【００８２】この時間ブロック長は、ブロックサイズ評
価器４０３にて決定される。すなわち、帯域分割フィル
タ４０１からの３つの周波数帯域の信号は、ブロックサ
イズ評価器４０３にも供給され、ブロックサイズ評価器
４０３がＭＤＣＴ処理の時間ブロック長を決定し、決定
された時間ブロック長を示す情報がＭＤＣＴ回路４０２
Ｌ乃至４０２Ｈに供給されるようにしている。The time block length is determined by the block size evaluator 403. That is, the signals of the three frequency bands from the band division filter 401 are also supplied to the block size evaluator 403, the block size evaluator 403 determines the time block length of MDCT processing, and indicates the determined time block length. Information is MDCT circuit 402
L to 402H are supplied.

【００８３】なお、ＭＤＣＴ処理での２種類の時間ブロ
ック長のうち、長い時間ブロック長はロングモードと呼
ばれ、例えば１１．６ｍｓの時間に相当する。また、短
い時間ブロック長はショートモードと呼ばれ、例えば高
域（１１ｋＨｚ以上）では１．４５ｍｓまで、低域
（５．５ｋＨｚ以下）および中域（５．５ｋＨｚから１
１ｋＨｚ）では２．９ｍｓまで時間分解能を上げるよう
にしている。Of the two types of time block lengths in MDCT processing, the long time block length is called the long mode and corresponds to a time of 11.6 ms, for example. The short time block length is called a short mode. For example, up to 1.45 ms in the high range (11 kHz or more), low range (5.5 kHz or less) and middle range (5.5 kHz to 1).
At 1 kHz), the time resolution is increased up to 2.9 ms.

【００８４】このようにして、時間と周波数の２次元領
域（これをブロックフローティングユニット：Block Fl
oating Unit と呼ぶ）上の信号成分に分解されたオーデ
ィオ信号は、正規化回路４０４Ｌ乃至４０４Ｈによって
低域、中域、および高域において合計５２個のブロック
フローティングユニットに分けられると共に、ユニット
毎に規格化（正規化）される（スケールファクタの決定
がなされる）。In this way, the two-dimensional area of time and frequency (this is a block floating unit: Block Fl
The audio signal decomposed into the above signal components is divided into a total of 52 block floating units in the low band, the middle band, and the high band by the normalization circuits 404L to 404H, and standardized for each unit. Is normalized (normalized) (scale factor is determined).

【００８５】また、ビット配分器４０５では、人間の聴
覚の特性を利用して、そのオーディオ信号がどのような
成分から構成されているかが分析される。この分析結果
は、正規化回路４０４Ｌ乃至４０４Ｈからの各ユニット
毎の信号が供給される再量子化器４０６に供給される。Further, in the bit allocator 405, the characteristics of the audio signal are analyzed by utilizing the characteristics of human hearing. The analysis result is supplied to the requantizer 406 to which the signals for each unit from the normalization circuits 404L to 404H are supplied.

【００８６】再量子化器４０６においては、上記分析結
果に基づいて、各ユニットをどの程度の精度で符号化す
るかが求められ、パラメータ化される（ワードレングス
の決定が行われる）と共に、再量子化が行われる。In the requantizer 406, the degree of accuracy with which each unit is coded is determined based on the above analysis result, parameterized (word length is determined), and re-quantized. Quantization is performed.

【００８７】最後に、フォーマッタ４０７においては、
各ユニット毎の各パラメータ情報と再量子化されたスペ
クトラム信号が、所定のフォーマットに従って、図１に
示したマルチプレクサ１０６に供給される１つのチャネ
ルにおけるビットストリームに組み立てられる。このフ
ォーマッタ４０７の出力が出力端子４２５から出力され
る。Finally, in the formatter 407,
The parameter information for each unit and the requantized spectrum signal are assembled into a bit stream in one channel supplied to the multiplexer 106 shown in FIG. 1 according to a predetermined format. The output of the formatter 407 is output from the output terminal 425.

【００８８】ここで、上述したような符号化の動作はサ
ウンドフレームという単位毎に行われる。Here, the above-described encoding operation is performed for each unit called a sound frame.

【００８９】また、上記ビット配分器４０５は具体的に
は図６に示すような構成を有するものである。すなわ
ち、エネルギ算出回路５２２は、入力端子５２１より入
力された信号のクリティカルバンド毎のエネルギを算出
するようになされている。畳み込みフィルタ回路５２３
は、エネルギ算出回路５２２からの出力信号に所定の重
み付け関数を掛けて加算する畳み込み処理を施すように
なされている。Further, the bit allocator 405 has a concrete structure as shown in FIG. That is, the energy calculation circuit 522 is configured to calculate energy for each critical band of the signal input from the input terminal 521. Convolution filter circuit 523
Performs a convolution process of multiplying an output signal from the energy calculation circuit 522 by a predetermined weighting function and adding the product.

【００９０】（ｎ−ａｉ）関数発生回路５２５は、許容
関数（ｎ−ａｉ）を発生し、出力する。引き算器５２４
は、畳み込みフィルタ回路５２３の出力から、（ｎ−ａ
ｉ）関数発生回路５２５からの出力を引き算し、その結
果を割算器５２６に供給するようになされている。割算
器５２６は、入力信号に対して、逆コンボリューション
処理を行い、マスキングスレショールドを得るようにな
されている。The (n-ai) function generating circuit 525 generates and outputs an allowable function (n-ai). Subtractor 524
From the output of the convolutional filter circuit 523, (n−a
i) The output from the function generating circuit 525 is subtracted, and the result is supplied to the divider 526. The divider 526 is adapted to perform inverse convolution processing on the input signal to obtain a masking threshold.

【００９１】最小可聴カーブ発生回路５３２は、最小可
聴カーブを示すデータを合成回路５２７に供給する。合
成回路５２７は、最小可聴カーブ発生回路５３２より出
力された最小可聴カーブを示すデータと、割算器５２６
より出力されたマスキングスレショールドを合成し、出
力する。減算器５２８は、合成回路５２７からの出力信
号と、遅延回路５２９を介して供給されたエネルギ算出
回路５２２からの出力に対して減算処理を施し、出力す
るようになされている。The minimum audible curve generating circuit 532 supplies the data indicating the minimum audible curve to the synthesizing circuit 527. The synthesizing circuit 527 and the data indicating the minimum audible curve output from the minimum audible curve generating circuit 532 and the divider 526.
The masking threshold output from the above is synthesized and output. The subtractor 528 subtracts the output signal from the synthesis circuit 527 and the output from the energy calculation circuit 522 supplied via the delay circuit 529, and outputs the result.

【００９２】補正情報出力回路５３３は、所定の等ラウ
ドネスカーブに対応するデータを出力する。許容雑音補
正回路５３０は、補正情報出力回路５３３からの出力信
号に基づいて、減算器５２８からの出力信号の許容雑音
レベルの補正を行い、出力端子５３１を介して出力する
ようになされている。The correction information output circuit 533 outputs data corresponding to a predetermined equal loudness curve. The permissible noise correction circuit 530 corrects the permissible noise level of the output signal from the subtractor 528 based on the output signal from the correction information output circuit 533, and outputs it via the output terminal 531.

【００９３】図６において、入力端子５２１には、ＭＤ
ＣＴ回路４０２Ｌ，４０２Ｍ、および４０２Ｈからの周
波数領域のスペクトルデータが供給されている。In FIG. 6, the input terminal 521 has an MD
The spectrum data in the frequency domain from the CT circuits 402L, 402M, and 402H is supplied.

【００９４】この周波数領域のデータは、帯域毎のエネ
ルギ算出回路５２２に供給されて、クリティカルバンド
（臨界帯域）毎のエネルギが、例えば当該バンド内での
各振幅値２乗（例えば振幅値のピーク値の２乗）の総和
を計算すること等により求められる。この各バンド毎の
エネルギの代わりに、振幅値のピーク値、平均値等が用
いられることもある。このエネルギ算出回路５２２から
の出力としての例えば各バンドの総和値のスペクトル
は、一般にバークスペクトル（ＳＢ）と称されている。
図７はこのような各クリティカルバンド毎のバークスペ
クトルを示している。ただし、図７では、図示を簡略化
するため、上記クリティカルバンドのバンド数を１２バ
ンド（バンドＢ1乃至バンドＢ12）で表現している。This frequency domain data is supplied to the energy calculation circuit 522 for each band, and the energy for each critical band (critical band) is, for example, squared for each amplitude value within the band (for example, the peak of the amplitude value). It can be obtained by calculating the sum of the squares of the values. Instead of the energy for each band, a peak value, an average value, etc. of the amplitude value may be used. The spectrum of the total sum value of each band as the output from the energy calculation circuit 522 is generally called a Bark spectrum (SB).
FIG. 7 shows the Bark spectrum for each such critical band. However, in FIG. 7, in order to simplify the illustration, the number of bands of the critical band is represented by 12 bands (bands B1 to B12).

【００９５】ここで、上記バークスペクトルのいわゆる
マスキングに於ける影響を考慮するために、このバーク
スペクトルに所定の重み付け関数を掛けて加算するよう
な畳込み（コンボリューション）処理を施す。このた
め、上記帯域毎のエネルギ算出回路５２２の出力、すな
わちバークスペクトルの各値は、畳込みフィルタ回路５
２３に供給される。畳込みフィルタ回路５２３は、例え
ば、入力データを順次遅延させる複数の遅延素子と、こ
れら遅延素子からの出力にフィルタ係数（重み付け関
数）を乗算する複数の乗算器（例えば各バンドに対応す
る２５個の乗算器）と、各乗算器出力の総和をとる総和
加算器とから構成されるものである。Here, in order to consider the influence of so-called masking of the Bark spectrum, a convolution processing is performed such that the Bark spectrum is multiplied by a predetermined weighting function and added. Therefore, the convolution filter circuit 5 outputs the output of the energy calculation circuit 522 for each band, that is, each value of the Bark spectrum.
23. The convolution filter circuit 523 includes, for example, a plurality of delay elements that sequentially delay input data, and a plurality of multipliers that multiply outputs from these delay elements by a filter coefficient (weighting function) (for example, 25 pieces corresponding to each band). And a sum total adder that sums the outputs of the respective multipliers.

【００９６】なお、上記マスキングとは、人間の聴覚上
の特性により、ある信号によって他の信号がマスクされ
て聞こえなくなる現象をいうものであり、このマスキン
グ効果には、時間領域のオーディオ信号による時間軸マ
スキング効果と、周波数領域の信号による同時刻マスキ
ング効果とがある。これらのマスキング効果により、マ
スキングされる部分にノイズがあったとしても、このノ
イズは聞こえないことになる。このため、実際のオーデ
ィオ信号では、このマスキングされる範囲内のノイズは
許容可能なノイズとされる。The above-mentioned masking is a phenomenon in which one signal is masked by another signal and becomes inaudible due to human auditory characteristics. The masking effect is due to the time domain audio signal. There are an axial masking effect and a simultaneous time masking effect by a signal in the frequency domain. Due to these masking effects, even if there is noise in the masked portion, this noise will not be heard. For this reason, in an actual audio signal, noise within the masked range is regarded as acceptable noise.

【００９７】ここで、上記畳込みフィルタ回路５２３の
各乗算器の乗算係数（フィルタ係数）の一具体例を示す
と、任意のバンドに対応する乗算器Ｍの係数を１とする
とき、乗算器Ｍ−１で係数０．１５を、乗算器Ｍ−２で
係数０．００１９を、乗算器Ｍ−３で係数０．００００
０８６を、乗算器Ｍ＋１で係数０．４を、乗算器Ｍ＋２
で係数０．０６を、乗算器Ｍ＋３で係数０．００７を各
遅延素子の出力に乗算することにより、上記バークスペ
クトルの畳込み処理が行われる。ただし、Ｍは１乃至２
５の任意の整数である。Here, a specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 523 will be described. When the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier M-1 gives a coefficient of 0.15, multiplier M-2 gives a coefficient of 0.0019, and multiplier M-3 gives a coefficient of 0.0000.
086, multiplier M + 1 gives a coefficient of 0.4, multiplier M + 2
By multiplying the output of each delay element by a coefficient of 0.06 with a coefficient of 0.007 by a multiplier M + 3, the convolution processing of the Bark spectrum is performed. However, M is 1 to 2
It is an arbitrary integer of 5.

【００９８】次に、上記畳込みフィルタ回路５２３の出
力は引算器５２４に供給される。引算器５２４は、上記
畳込んだ領域での後述する許容可能なノイズレベル（許
容ノイズレベル）に対応するレベルαを求めるものであ
る。なお、この許容可能なノイズレベルに対応するレベ
ルαは、後述するように、逆コンボリューション処理を
行うことによって、クリティカルバンドの各バンド毎の
許容ノイズレベルとなるようなレベルである。ここで、
引算器５２４には、レベルαを求めるための許容関数
（マスキングレベルを表現する関数）が供給される。こ
の許容関数を増減させることでレベルαの制御を行って
いる。許容関数は、次に説明するような（ｎ−ａｉ）関
数発生回路５２５から供給されているものである。Next, the output of the convolution filter circuit 523 is supplied to the subtractor 524. The subtractor 524 calculates a level α corresponding to an allowable noise level (allowable noise level) described later in the convoluted area. It should be noted that the level α corresponding to this allowable noise level is a level that becomes an allowable noise level for each band of the critical band by performing inverse convolution processing, as described later. here,
The subtractor 524 is supplied with an allowance function (function expressing a masking level) for obtaining the level α. The level α is controlled by increasing or decreasing this allowance function. The allowance function is supplied from the (n-ai) function generating circuit 525 as described below.

【００９９】すなわち、許容ノイズレベルに対応するレ
ベルαは、クリティカルバンドのバンドの低域から順に
与えられる番号をｉとすると、次の式で求めることがで
きる。That is, the level α corresponding to the allowable noise level can be obtained by the following equation, where i is the number given in order from the low band of the critical band.

【０１００】α＝Ｓ−（ｎ−ａｉ）Α = S- (n-ai)

【０１０１】この式において、ｎ，ａは定数であり、ａ
＞０、Ｓは畳込み処理されたバークスペクトルの強度で
あり、式中（ｎ−ａｉ）が許容関数となる。例としてｎ
＝３８、ａ＝−０．５を用いることができる。In this equation, n and a are constants, and a
> 0 and S are the intensities of the bark spectrum subjected to the convolution processing, and (n-ai) in the formula is the tolerance function. N as an example
= 38, a = -0.5 can be used.

【０１０２】このようにして、上記レベルαが求めら
れ、このデータは、割算器５２６に供給される。割算器
５２６は、畳込みされた領域でのレベルαを逆コンボリ
ューションするためのものである。したがって、この逆
コンボリューション処理を行うことにより、レベルαか
らマスキングスレッショールドが得られるようになる。
すなわち、このマスキングスレッショールドが許容ノイ
ズスペクトルとなる。なお、逆コンボリューション処理
は、複雑な演算を必要とするが、本実施例では簡略化し
た割算器５２６を用いて逆コンボリューションを行って
いる。In this way, the level α is obtained, and this data is supplied to the divider 526. The divider 526 is for deconvoluting the level α in the convolved region. Therefore, by performing this inverse convolution processing, the masking threshold can be obtained from the level α.
That is, this masking threshold becomes the allowable noise spectrum. Although the inverse convolution processing requires a complicated calculation, the inverse convolution is performed using the simplified divider 526 in this embodiment.

【０１０３】次に、上記マスキングスレッショールド
は、合成回路５２７を介して減算器５２８に供給され
る。ここで、減算器５２８には、上記帯域毎のエネルギ
検出回路５２２からの出力、すなわち前述したバークス
ペクトル（ＳＢ）が、遅延回路５２９を介して供給され
ている。したがって、この減算器５２８で上記マスキン
グスレッショールドとバークスペクトルとの減算演算が
行われることで、図７に示すように、上記バークスペク
トルは、マスキングスレッショールド（ＭＳ）のレベル
で示すレベル以下がマスキングされることになる。な
お、遅延回路５２９は合成回路５２７以前の各回路での
遅延量を考慮して、エネルギ算出回路５２２からのバー
クスペクトルを遅延させるために設けられている。Next, the masking threshold is supplied to the subtractor 528 via the synthesizing circuit 527. Here, the output from the energy detection circuit 522 for each band, that is, the above-described Bark spectrum (SB) is supplied to the subtractor 528 via the delay circuit 529. Therefore, the subtractor 528 performs a subtraction operation on the masking threshold and the Bark spectrum, so that the Bark spectrum is equal to or lower than the level indicated by the masking threshold (MS) level, as shown in FIG. Will be masked. The delay circuit 529 is provided to delay the Bark spectrum from the energy calculation circuit 522 in consideration of the delay amount in each circuit before the combining circuit 527.

【０１０４】減算器５２８からの出力は、許容雑音補正
回路５３０を介し、さらに出力端子５３１を介して取り
出され、例えば配分ビット数情報が予め記憶されたＲＯ
Ｍ等（図示せず）に供給される。このＲＯＭ等は、上記
減算回路５２８から許容雑音補正回路５３０を介して得
られた出力（上記各バンドのエネルギと上記合成回路５
２７の出力との差分のレベル）に応じ、各バンド毎の配
分ビット数情報を出力する。The output from the subtracter 528 is taken out via the allowable noise correction circuit 530 and further via the output terminal 531. For example, RO in which the distribution bit number information is stored in advance.
It is supplied to M and the like (not shown). This ROM or the like outputs the energy obtained from the subtraction circuit 528 through the allowable noise correction circuit 530 (the energy of each band and the synthesis circuit 5).
The distribution bit number information for each band is output according to the level of the difference from the output of 27).

【０１０５】このようにして求められた配分ビット数情
報が図５に示した再量子化器４０６に供給されること
で、再量子化器４０６において、ＭＤＣＴ回路４０４
Ｌ，４０４Ｍ、および４０４Ｈからの周波数領域の各ス
ペクトルデータが、それぞれのバンド毎に割り当てられ
たビット数で再量子化されるわけである。The distribution bit number information thus obtained is supplied to the requantizer 406 shown in FIG. 5, so that the MDCT circuit 404 in the requantizer 406.
Each spectrum data in the frequency domain from L, 404M, and 404H is requantized with the number of bits assigned to each band.

【０１０６】すなわち、要約すれば、再量子化器４０６
では、上記クリティカルバンドの各バンド帯域（クリテ
ィカルバンド）毎もしくは高域においては、クリティカ
ルバンドを更に複数帯域に分割した帯域のエネルギもし
くはピーク値と、合成回路５２７の出力との差分のレベ
ルに応じて配分されたビット数で、上記各バンド毎のス
ペクトルデータを量子化することになる。That is, in summary, the requantizer 406
Then, in each band band (critical band) of the critical band or in the high band, depending on the level of the difference between the energy or peak value of the band obtained by further dividing the critical band into a plurality of bands and the output of the synthesizing circuit 527. The spectrum data for each band is quantized with the allocated number of bits.

【０１０７】ところで、上述した合成回路５２７での合
成の際には、最小可聴カーブ発生回路５３２から供給さ
れる図８に示すような人間の聴覚特性であるいわゆる最
小可聴カーブ（ＲＣ）を示すデータと、上記マスキング
スレッショールド（ＭＳ）とを合成することができる。
この最小可聴カーブにおいて、雑音絶対レベルが最小可
聴カーブ以下ならば雑音は聞こえないことになる。By the way, at the time of synthesizing by the above-mentioned synthesizing circuit 527, data indicating a so-called minimum audible curve (RC) which is the human auditory characteristic as shown in FIG. 8 supplied from the minimum audible curve generating circuit 532. And the masking threshold (MS) can be combined.
In this minimum audible curve, if the absolute noise level is below the minimum audible curve, no noise will be heard.

【０１０８】最小可聴カーブは、コーディング（符号化
方法）が同じであっても例えば再生時の再生ボリューム
の違いで異なるものとなるが、現実的なディジタルシス
テムでは、例えば１６ビットダイナミックレンジへの音
楽のはいり方にはさほど違いがないので、例えば４ｋＨ
ｚ付近の最も耳に聞こえやすい周波数帯域の量子化雑音
が聞こえないとすれば、他の周波数帯域ではこの最小可
聴カーブのレベル以下の量子化雑音は聞こえないと考え
られる。Even if the coding (coding method) is the same, the minimum audible curve is different due to the difference in the reproduction volume at the time of reproduction. However, in a realistic digital system, for example, music to a 16-bit dynamic range is generated. Since there is not much difference in how to go, for example 4kH
If the quantization noise in the most audible frequency band around z is not heard, it is considered that the quantization noise below the level of this minimum audible curve is not heard in other frequency bands.

【０１０９】したがって、このように例えばシステムの
持つダイナミックレンジの４ｋＨｚ付近の雑音が聞こえ
ない使い方をすると仮定し、この最小可聴カーブ（Ｒ
Ｃ）とマスキングスレッショールド（ＭＳ）とを共に合
成することで許容ノイズレベルを得るようにすると、こ
の場合の許容ノイズレベルは、図８中の斜線で示す部分
までとすることができるようになる。なお、本実施例で
は、上記最小可聴カーブの４ｋＨｚのレベルを、例えば
２０ビット相当の最低レベルに合わせている。また、図
８には、信号スペクトル（ＳＳ）も同時に示されてい
る。Therefore, assuming that the system is used in such a manner that noise near the dynamic range of the system of 4 kHz cannot be heard, the minimum audible curve (R
If the allowable noise level is obtained by synthesizing C) and the masking threshold (MS) together, the allowable noise level in this case can be up to the shaded portion in FIG. Become. In this embodiment, the level of 4 kHz of the minimum audible curve is set to the minimum level equivalent to 20 bits, for example. Further, FIG. 8 also shows the signal spectrum (SS) at the same time.

【０１１０】また、上記許容雑音補正回路５３０では、
補正情報出力回路５３３から供給される例えば等ラウド
ネスカーブの情報に基づいて、上記減算器５２８からの
出力における許容雑音レベルを補正している。ここで、
等ラウドネスカーブとは、人間の聴覚特性に関する特性
曲線であり、例えば１ｋＨｚの純音と同じ大きさに聞こ
える各周波数での音の音圧を求めて曲線で結んだもの
で、ラウドネスの等感度曲線（等ラウドネス曲線）とも
呼ばれる。またこの等ラウドネス曲線は、図８に示した
最小可聴カーブ（ＲＣ）と略同じ曲線を描くものであ
る。In the allowable noise correction circuit 530,
The allowable noise level in the output from the subtractor 528 is corrected based on the information on the equal loudness curve supplied from the correction information output circuit 533, for example. here,
The equal loudness curve is a characteristic curve relating to human auditory characteristics, and is obtained by, for example, obtaining the sound pressure of sound at each frequency heard at the same loudness as a pure tone of 1 kHz and connecting the curves, and the equal sensitivity curve of loudness ( Isoloudness curve) is also called. Further, this equal loudness curve draws a curve substantially the same as the minimum audible curve (RC) shown in FIG.

【０１１１】この等ラウドネス曲線においては、例えば
４ｋＨｚ付近では１ｋＨｚのところより音圧が８乃至１
０ｄＢ下がっても１ｋＨｚと同じ大きさに聞こえ、逆
に、５０Ｈｚ付近では１ｋＨｚでの音圧よりも約１５ｄ
Ｂ高くないと同じ大きさに聞こえない。このため、最小
可聴カーブのレベルを越えた雑音（許容ノイズレベル）
は、等ラウドネス曲線に応じたカーブで与えられる周波
数特性を持つようにするのが良いことがわかる。このよ
うなことから、上記等ラウドネス曲線を考慮して上記許
容ノイズレベルを補正することは、人間の聴覚特性に適
合していることがわかる。In this equal loudness curve, for example, in the vicinity of 4 kHz, the sound pressure is 8 to 1 at 1 kHz.
Even if it drops by 0 dB, it sounds the same as 1 kHz. Conversely, the sound pressure around 50 Hz is about 15 dB lower than the sound pressure at 1 kHz.
If it is not high, it will not sound the same size. For this reason, noise that exceeds the level of the minimum audible curve (allowable noise level)
It is understood that it is better to have a frequency characteristic given by a curve corresponding to the equal loudness curve. From this, it can be seen that correcting the allowable noise level in consideration of the equal loudness curve is suitable for human auditory characteristics.

【０１１２】次に、図９に、図１の符号器１０５ａに対
応する図３の復号器１３３ａの具体的な構成例を示す。
なお、復号器１３３ｂ乃至１３３ｅの構成および動作
は、復号器１３３ａの場合と基本的に同様であるので、
ここではその説明は省略する。Next, FIG. 9 shows a concrete configuration example of the decoder 133a of FIG. 3 corresponding to the encoder 105a of FIG.
Since the configurations and operations of the decoders 133b to 133e are basically the same as those of the decoder 133a,
Here, the description is omitted.

【０１１３】すなわち、図９に示した復号器は、例え
ば、映画フィルムから、磁気ヘッドや光学ヘッドなどに
よって読み取った各チャネルのうちの１チャネル分の符
号化された信号を復号化するものである。That is, the decoder shown in FIG. 9 is, for example, for decoding a coded signal for one channel out of each channel read from a motion picture film by a magnetic head or an optical head. .

【０１１４】図９において、入力端子６０１には図３に
示したデマルチプレクサ１３２からの符号化されている
データが供給され、これがデフォーマッタ６０２に供給
される。デフォーマッタ６０２では、図５に示したフォ
ーマッタ４０７において実行される処理とは逆の処理が
行われ、各ユニット毎の各パラメータ情報と再量子化さ
れたスペクトラム信号（すなわち量子化されたＭＤＣＴ
係数）が得られる。In FIG. 9, the encoded data from the demultiplexer 132 shown in FIG. 3 is supplied to the input terminal 601, and this is supplied to the deformatter 602. The deformatter 602 performs the reverse process of the process executed by the formatter 407 shown in FIG. 5, and each parameter information of each unit and the requantized spectrum signal (that is, the quantized MDCT).
Coefficient) is obtained.

【０１１５】上記デフォーマッタ６０２からの各ユニッ
ト毎の量子化されたＭＤＣＴ係数は、低域用の復号化回
路６０３Ｌ、中域用の復号化回路６０３Ｍ、および高域
用の復号化回路６０３Ｈにそれぞれ供給される。また、
これら復号化回路６０３Ｌ，６０３Ｍ、および６０３Ｈ
には、デフォーマッタ６０２からパラメータ情報も与え
られる。各復号化回路６０３Ｌ，６０３Ｍ、および６０
３Ｈは、このパラメータ情報を用いてビット配分を解除
すると共に復号化を行う。The quantized MDCT coefficients for each unit from the deformatter 602 are respectively supplied to the low frequency decoding circuit 603L, the middle frequency decoding circuit 603M, and the high frequency decoding circuit 603H. Supplied. Also,
These decoding circuits 603L, 603M, and 603H
Is also given parameter information from the deformatter 602. Each decoding circuit 603L, 603M, and 60
3H uses this parameter information to cancel bit allocation and perform decoding.

【０１１６】これら復号化回路６０３Ｌ乃至６０３Ｈの
出力は、それぞれ対応するＩＭＤＣＴ（逆ＭＤＣＴ）回
路６０４Ｌ乃至６０４Ｈに供給される。また、各ＩＭＤ
ＣＴ回路６０４Ｌ乃至６０４Ｈにも上記パラメータ情報
が供給され、ここでは周波数領域の信号が時間領域の信
号に変換される。これらの部分帯域の時間領域信号は、
帯域合成回路６０５により、全帯域信号に復号化され、
出力端子６０６より出力される。The outputs of these decoding circuits 603L to 603H are supplied to the corresponding IMDCT (inverse MDCT) circuits 604L to 604H. Also, each IMD
The parameter information is also supplied to the CT circuits 604L to 604H, where the frequency domain signal is converted into the time domain signal. The time domain signals of these subbands are
The band synthesis circuit 605 decodes into a full band signal,
It is output from the output terminal 606.

【０１１７】図１に示したような符号化装置によって符
号化されたデータは、ＤＶＤ（ディジタルビデオディス
ク：Digital Video Disc）やＣＤ−ＲＯＭ（コンパクト
ディスク：Compact Disc Read Only Memory）、あるい
はＭＤ（ミニディスク：MiniDisc）などの記録媒体に記
録することが可能である。従って、符号化装置により高
能率符号化されたデータを、ＤＶＤ等の記録媒体に記録
し、それをビデオディスクプレーヤなどの復号化装置に
よって再生することが可能となる。The data encoded by the encoding device as shown in FIG. 1 is a DVD (Digital Video Disc), a CD-ROM (Compact Disc: Read Only Memory), or an MD (Mini-disc). It is possible to record on a recording medium such as a disc: MiniDisc. Therefore, it becomes possible to record the data that has been encoded with high efficiency by the encoding device in a recording medium such as a DVD and reproduce it by a decoding device such as a video disc player.

【０１１８】以上のように、複数チャネルのディジタル
信号の一部または全部を混合処理した１つまたは複数チ
ャネルの混合処理データを、再現用データとして使用す
ることにより、圧縮率を高めることができる。また、オ
ーディオ信号の特性や再生環境に対応して、再現用デー
タとして使用する混合処理データの属するチャネルや、
その使用割合をフレーム単位で変更することにより、元
のオーディオ信号を再現した場合における音場の変化を
抑制することができる。As described above, the compression rate can be increased by using the mixed processed data of one or a plurality of channels, which is obtained by mixing a part or all of the digital signals of a plurality of channels, as the reproduction data. In addition, depending on the characteristics of the audio signal and the playback environment, the channel to which the mixed processing data used as the reproduction data belongs,
By changing the usage rate in units of frames, it is possible to suppress changes in the sound field when the original audio signal is reproduced.

【０１１９】また、フレームの前または前後両方のフレ
ームのディジタル信号の特性に対応して、再現用データ
として使用する混合処理データの属するチャネルやその
使用割合を変更したり、元のディジタル信号を記録しな
い周波数帯域を変更することにより、処理方法の急激な
変化によって不安定な再生音場となることを回避するこ
とができる。Further, in accordance with the characteristics of the digital signals of the frames both before and after the frame, the channel to which the mixed processing data used as the reproduction data belongs and the ratio of use thereof are changed, or the original digital signal is recorded. By changing the frequency band that is not used, it is possible to avoid an unstable reproduced sound field due to a sudden change in the processing method.

【０１２０】さらにまた、複数チャネルのディジタル信
号の一部または全部を混合処理した１つまたは複数チャ
ネルの混合処理データの一部または全部を使用して、元
のディジタル信号の再現方法のフレーム単位での変更に
対応することにより、再現時の音場の変化を抑制するこ
とができる。Furthermore, by using a part or all of the mixed processing data of one or a plurality of channels obtained by mixing a part or all of the digital signals of a plurality of channels, in a frame unit of the method of reproducing the original digital signal. By responding to the change of, it is possible to suppress the change of the sound field at the time of reproduction.

【０１２１】なお、上述した本発明の符号化方法および
復号化方法は、実施例に用いたいわゆるＡＴＲＡＣ方式
だけでなく、その他の符号化方式においても適用可能で
ある。特に直交変換により周波数情報に変換する符号化
方式において効果が高い。The above-described coding method and decoding method of the present invention can be applied not only to the so-called ATRAC method used in the embodiments but also to other coding methods. In particular, the effect is high in a coding method in which frequency information is converted by orthogonal conversion.

【０１２２】また、上記各実施例においては、複数チャ
ネルのオーディオデータを符号化または復号化する場合
について説明したが、オーディオデータに限定されるも
のではない。Further, although cases have been described with the above embodiments where the audio data of a plurality of channels are encoded or decoded, the invention is not limited to audio data.

【０１２３】また、上記各実施例においては、混合処理
したチャネル数を２チャネルとしたが、これに限定され
るものではなく、任意のチャネル数とすることが可能で
ある。その場合、上記各実施例の構成は、混合処理した
チャネル数に対応して変更されることになる。In each of the above embodiments, the number of mixed channels is two, but the number is not limited to this and any number of channels can be used. In that case, the configuration of each of the above-described embodiments is changed in accordance with the number of channels subjected to the mixing process.

【０１２４】[0124]

【発明の効果】請求項１に記載の符号化方法、および請
求項２に記載の符号化装置によれば、ディジタル信号の
特性および再生環境に対応して、少なくとも１つのチャ
ネルのディジタル信号の一部または全部の周波数帯域が
少なくとも１つの混合チャネルに混合され、ディジタル
信号から、混合チャネルの混合ディジタル信号によって
再現する信号が除かれた、個別に符号化する個別符号化
ディジタル信号が抽出され、混合ディジタル信号および
個別符号化ディジタル信号が符号化されるようにしたの
で、混合ディジタル信号に基づいて元のディジタル信号
を再現することができ、ディジタル信号の圧縮率を高め
ることができる。また、混合ディジタル信号によって復
元する元のディジタル信号の周波数帯域をフレーム毎に
変更することにより、再現時の音場の変化を抑制するこ
とが可能となる。According to the encoding method of the first aspect and the encoding apparatus of the second aspect, one of the digital signals of at least one channel is selected according to the characteristics of the digital signal and the reproduction environment. Partially or completely frequency bands are mixed into at least one mixing channel, and the individually coded individually coded digital signals are extracted from the digital signal, excluding the signal reproduced by the mixed digital signals of the mixing channels, and mixed. Since the digital signal and the individually encoded digital signal are encoded, the original digital signal can be reproduced based on the mixed digital signal, and the compression rate of the digital signal can be increased. Further, by changing the frequency band of the original digital signal restored by the mixed digital signal for each frame, it is possible to suppress the change of the sound field at the time of reproduction.

【０１２５】請求項１１に記載の復号化方法、および請
求項１２に記載の復号化装置によれば、符号化情報に基
づいて、混合ディジタル信号が復号化され、復号化され
た混合ディジタル信号の一部または全部を用いて、ディ
ジタル信号を復元するための復元用データが作成され、
個別符号化ディジタル信号と復元用データが合成され、
ディジタル信号が復元されるようにしたので、混合ディ
ジタル信号に基づいて元のディジタル信号を復元するこ
とができる。また、混合ディジタル信号を用いた元のデ
ィジタル信号の復元方法をフレーム毎に変更することに
より、復元による音場の変化を抑制することが可能とな
る。According to the decoding method of the eleventh aspect and the decoding apparatus of the twelfth aspect, the mixed digital signal is decoded based on the encoded information, and the decoded mixed digital signal The restoration data for restoring the digital signal is created using part or all of the
Individually encoded digital signal and restoration data are combined,
Since the digital signal is restored, the original digital signal can be restored based on the mixed digital signal. Also, by changing the method of restoring the original digital signal using the mixed digital signal for each frame, it is possible to suppress the change in the sound field due to the restoration.

【０１２６】請求項１８に記載の記録媒体によれば、所
定の符号化情報に基づいて符号化された混合ディジタル
信号および個別符号化ディジタル信号が記録されるよう
にしたので、高能率符号化されたディジタル信号を記録
し、それを再生することができる。また、より安定な再
生音場を提供することが可能となる。According to the recording medium of the eighteenth aspect, since the mixed digital signal and the individually coded digital signal coded based on the predetermined coded information are recorded, the high efficiency coding is performed. The recorded digital signal can be recorded and reproduced. Further, it becomes possible to provide a more stable reproduced sound field.

[Brief description of the drawings]

【図１】本発明の符号化装置の一実施例の構成を示すブ
ロック図である。FIG. 1 is a block diagram showing a configuration of an embodiment of an encoding device of the present invention.

【図２】図１の処理データ抽出器の構成例を示すブロッ
ク図である。FIG. 2 is a block diagram showing a configuration example of a processed data extractor of FIG.

【図３】本発明の復号化装置の一実施例の構成を示すブ
ロック図である。FIG. 3 is a block diagram showing the configuration of an embodiment of a decoding device of the present invention.

【図４】図３の合成データ作成器の構成例を示すブロッ
ク図である。FIG. 4 is a block diagram showing a configuration example of a combined data generator of FIG.

【図５】図１の符号器の構成例を示すブロック図であ
る。5 is a block diagram showing a configuration example of the encoder of FIG. 1. FIG.

【図６】図５の符号器を構成するビット配分器の構成例
を示すブロック図である。6 is a block diagram showing a configuration example of a bit allocator that constitutes the encoder of FIG.

【図７】バークスペクトル（ＳＢ）とマスキングスレッ
ショールドレベル（ＭＳ）について説明するための図で
ある。FIG. 7 is a diagram for explaining a Bark spectrum (SB) and a masking threshold level (MS).

【図８】信号レベル（ＳＳ）、最小可聴カーブ（Ｒ
Ｃ）、およびマスキングスレッショールド（ＭＳ）を示
した図である。FIG. 8: Signal level (SS), minimum audible curve (R
It is a figure showing C) and a masking threshold (MS).

【図９】図３の復号器の構成例を示すブロック図であ
る。9 is a block diagram showing a configuration example of the decoder of FIG.

【図１０】本発明を用いない場合のマルチチャネルオー
ディオ符号化装置の構成例を示すブロック図である。FIG. 10 is a block diagram showing a configuration example of a multi-channel audio encoding device when the present invention is not used.

【図１１】本発明を用いない場合のマルチチャネルオー
ディオの復号化装置の構成例を示すブロック図である。FIG. 11 is a block diagram showing a configuration example of a multi-channel audio decoding device when the present invention is not used.

【図１２】本発明を用いない場合の２チャネルオーディ
オの復号化装置の構成例を示すブロック図である。FIG. 12 is a block diagram showing a configuration example of a 2-channel audio decoding device in the case where the present invention is not used.

[Explanation of symbols]

１０１入力端子１０２混合器１０３処理データ抽出器１０５符号器１０６マルチプレクサ１０７出力端子１３１入力端子１３２デマルチプレクサ１３３復号器１３４合成データ作成器１３５合成器１３６出力端子２０１入力端子２０２帯域分割フィルタ２０３パラメータ記録メモリ２０４処理データ分析器２０５処理データ作成器２０６出力端子２１１入力端子２１２帯域分割フィルタ２１３パラメータ解析器２１４再現データ作成器２１５再現データ合成器２１６出力端子４０１帯域分割フィルタ４０２Ｌ低域ＭＤＣＴ回路４０２Ｍ中域ＭＤＣＴ回路４０２Ｈ高域ＭＤＣＴ回路４０３ブロックサイズ評価器４０４Ｌ，４０４Ｍ，４０４Ｈ正規化回路４０５ビット配分器４０６再量子化器４０７フォーマッタ４２４入力端子４２５出力端子５２１入力端子５２２帯域毎のエネルギ算出回路５２３畳込みフィルタ回路５２４引算器５２５（ｎ−ａｉ）関数発生回路５２６割算器５２７合成回路５２８減算器５２９遅延回路５３０許容雑音補正回路５３１出力端子５３２最小可聴カーブ発生回路５３３補正情報出力回路６０１入力端子６０２デフォーマッタ６０３Ｌ，６０３Ｍ，６０３Ｈ復号化回路６０４Ｌ，６０４Ｍ，６０４ＨＩＭＤＣＴ回路６０５帯域合成回路６０６出力端子 101 Input Terminal 102 Mixer 103 Processed Data Extractor 105 Encoder 106 Multiplexer 107 Output Terminal 131 Input Terminal 132 Demultiplexer 133 Decoder 134 Synthetic Data Generator 135 Synthesizer 136 Output Terminal 201 Input Terminal 202 Band Division Filter 203 Parameter Recording Memory 204 Processed data analyzer 205 Processed data generator 206 Output terminal 211 Input terminal 212 Band division filter 213 Parameter analyzer 214 Reproduction data generator 215 Reproduction data synthesizer 216 Output terminal 401 Band division filter 402L Low band MDCT circuit 402M Medium band MDCT Circuit 402H High-frequency MDCT circuit 403 Block size evaluator 404L, 404M, 404H Normalization circuit 405 Bit distributor 406 Requantizer 407 Formatter 424 Input terminal 425 Output terminal 521 Input terminal 522 Energy calculation circuit for each band 523 Convolution filter circuit 524 Subtractor 525 (n-ai) function generation circuit 526 Divider 527 Synthesis circuit 528 Subtractor 529 Delay circuit 530 Allowable Noise correction circuit 531 Output terminal 532 Minimum audible curve generation circuit 533 Correction information output circuit 601 Input terminal 602 Deformatter 603L, 603M, 603H Decoding circuit 604L, 604M, 604H IMDCT circuit 605 Band synthesis circuit 606 Output terminal

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０４Ｎ 7/24 Ｈ０４Ｎ 7/13 Ｚ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁶ Identification code Agency reference number FI Technical display location H04N 7/24 H04N 7/13 Z

Claims

[Claims]

1. An encoding method for encoding a digital signal of a plurality of frames of a plurality of channels, comprising: corresponding to a characteristic of the digital signal and a reproduction environment,
A frequency band of a part or the whole of the digital signal of at least one of the channels is mixed into at least one mixing channel, and the digital signal except the signal reproduced by the mixing digital signal of the mixing channel is individually coded. An encoding method comprising extracting an individual encoded digital signal to be encoded and encoding the mixed digital signal and the individual encoded digital signal.

2. An encoding device for encoding digital signals of a plurality of frames of a plurality of channels, wherein:
Mixing means for mixing part or all of the frequency band of the digital signal of at least one of the channels into at least one mixing channel; and excluding the signal reproduced from the mixed digital signal of the mixing channel from the digital signal, An encoding device comprising: an extraction unit that extracts an individually encoded digital signal to be individually encoded, and an encoding unit that encodes the mixed digital signal and the individually encoded digital signal.

3. The individually coded digital signal, wherein the extraction means sets the mixing channel used for reproducing the digital signal in the frame unit, and excludes the mixing digital signal of the set mixing channel. The coding device according to claim 2, wherein the signal is extracted.

4. The extraction unit changes the ratio of the mixed channel to which the mixed digital signal used for reproducing the digital signal belongs, in units of the frames. Encoding device.

5. The extracting means changes the frequency band, in which the original digital signal is not encoded, in units of the frame, corresponding to the mixed digital signal used to reproduce the digital signal. The encoding device according to claim 2.

6. The encoding device according to claim 2, wherein the characteristic of the digital signal is a frequency characteristic.

7. The encoding device according to claim 2, wherein the characteristic of the digital signal is an amplitude characteristic.

8. The characteristic of the digital signal includes the characteristic of the digital signal of another frame before or after a predetermined one of the frame to be encoded, or both before and after. Item 2. The encoding device according to item 2.

9. The encoding device according to claim 2, wherein the digital signal is a digital audio signal.

10. The encoding apparatus according to claim 2, wherein the digital signal is a frequency component obtained by converting a time axis into a frequency axis.

11. A part or all of frequency bands of a digital signal of at least one channel of a plurality of channels are mixed in at least one mixing channel, and corresponding to a frequency characteristic of the digital signal and a reproduction environment,
An individual coded digital signal to be individually coded is extracted from the digital signal excluding a signal reproduced by the mixed digital signal of the mixed channel, and the mixed digital signal coded based on predetermined coding information. And a decoding method for decoding the individual coded digital signal, wherein the mixed digital signal is decoded based on the coded information, and a part or all of the decoded mixed digital signal is used. A decoding method comprising: creating restoration data for restoring the digital signal, synthesizing the individually encoded digital signal and the restoration data, and restoring the digital signal.

12. A part or all of frequency bands of digital signals of at least one channel of a plurality of channels are mixed in at least one mixing channel, and corresponding to frequency characteristics and reproduction environment of the digital signals,
An individual coded digital signal to be individually coded is extracted from the digital signal excluding a signal reproduced by the mixed digital signal of the mixed channel, and the mixed digital signal coded based on predetermined coding information. And a decoding device for decoding the individual coded digital signal, wherein the decoding means decodes the mixed digital signal based on the coded information, and the mixing means decoded by the decoding means. Restoration data creating means for creating restoration data for restoring the digital signal using part or all of the digital signal, the restoration data created by the restoration data creating means, and the individual Recovery means for synthesizing the encoded digital signals and restoring the digital signals. Apparatus.

13. The restoration data creating means changes the mixed channel to which the mixed digital signal used when creating the restoration data belongs, on a frame-by-frame basis. Decoding device.

14. The restoration data creating means changes the proportion of the mixed channel to which the mixed digital signal used when creating the restoration data belongs in the frame unit. Decoding device as described.

15. The decoding device according to claim 12, wherein a frequency band of the digital signal restored by the restoration data is changed in the frame unit.

16. The encoding device according to claim 12, wherein the digital signal is a digital audio signal.

17. The encoding apparatus according to claim 12, wherein the digital signal is a frequency component obtained by converting a time axis into a frequency axis.

18. A part or all of the frequency bands of digital signals of at least one channel of a plurality of channels are mixed in at least one mixing channel, and the digital signals are generated according to the frequency characteristics and reproduction environment of the digital signals. An individual coded digital signal to be individually coded is extracted from a signal excluding a signal reproduced by the mixed digital signal of the mixed channel, and the mixed digital signal coded based on predetermined coding information and the A recording medium on which an individually encoded digital signal is recorded.