JP3344581B2

JP3344581B2 - Audio coding device

Info

Publication number: JP3344581B2
Application number: JP2000323049A
Authority: JP
Inventors: 徳彦渕上; 昭治植野; 美昭田中
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1998-10-13
Filing date: 2000-10-23
Publication date: 2002-11-11
Anticipated expiration: 2019-10-13
Also published as: JP2001195095A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声信号を予測符
号化して圧縮するための音声符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus for predictively coding and compressing a speech signal.

【０００２】[0002]

【従来の技術】音声信号を予測符号化する方法として、
本発明者は先の出願（特願平９−２８９１５９号）にお
いて１チャネル（チャンネル）の原デジタル音声信号に
対して、特性が異なる複数の予測器により時間領域にお
ける過去の信号から現在の信号の複数の線形予測値を算
出し、原デジタル音声信号と、この複数の線形予測値か
ら予測器毎の予測残差を算出し、この複数の予測残差の
最小値を選択する方法を提案している。2. Description of the Related Art As a method of predictive encoding of a speech signal,
In the prior application (Japanese Patent Application No. 9-289159), the present inventor has applied a plurality of predictors having different characteristics to an original digital audio signal of one channel (channel) from a past signal in the time domain to a current signal. A method of calculating a plurality of linear prediction values, calculating a prediction residual for each predictor from the original digital audio signal and the plurality of linear prediction values, and selecting a minimum value of the plurality of prediction residuals; I have.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記方
法では原デジタル音声信号がサンプリング周波数＝９６
ｋＨｚ、量子化ビット数＝２０ビット程度の場合に、あ
る程度の圧縮効果を得ることができるが、近年のＤＶＤ
オーディオディスクではこの２倍のサンプリング周波数
（＝１９２ｋＨｚ）が使用され、また、量子化ビット数
も２４ビットが使用される傾向があるので、圧縮率を改
善する必要がある。また、近年のＤＶＤオーディオディ
スクでは、マルチチャネルが利用され、チャネル数が最
大６となるので圧縮率を改善する必要がある。However, in the above method, the original digital audio signal has a sampling frequency = 96.
In the case of kHz and the number of quantization bits = about 20 bits, a certain compression effect can be obtained.
Audio discs use twice this sampling frequency (= 192 kHz), and the number of quantization bits tends to use 24 bits. Therefore, it is necessary to improve the compression ratio. Further, in recent DVD audio disks, multi-channels are used, and the number of channels is up to six, so that it is necessary to improve the compression ratio.

【０００４】そこで本発明は、音声信号を予測符号化す
る場合に圧縮率を改善することができる音声符号化装置
を提供することを目的とする。Accordingly, an object of the present invention is to provide a speech coding apparatus capable of improving a compression ratio when predicting and coding a speech signal.

【０００５】[0005]

【課題を解決するための手段】本発明は上記目的を達成
するために、以下の手段よりなる。すなわち、The present invention comprises the following means to achieve the above object. That is,

【０００６】３以上のマルチチャネルの音声信号中の少
なくとも選択された第１及び第２の２つのチャネルの音
声信号をマトリクス演算して互いに相関ある２つの相関
チャネルに変換するチャネル相関手段と、前記チャネル
相関手段により変換された２つの相関チャネルを含む音
声信号を、チャネル毎に、入力される音声信号に応答し
て先頭サンプル値を得ると共に、特性が異なる複数の線
形予測方法により時間領域の過去から現在の信号の線形
予測値がそれぞれ予測され、その予測される線形予測値
と前記音声信号とから得られる予測残差が最小となるよ
うな線形予測方法を選択して予測符号化する予測符号化
手段と、ヘッダ情報と、圧縮ＰＣＭアクセスユニットを
含むユーザデータと、を含んだデータ構造にすると共
に、前記予測符号化手段により選択された各チャネルの
線形予測方法と予測残差と所定の先頭サンプル値を含む
予測符号化データを、前記圧縮ＰＣＭアクセスユニット
内に配置されるサブパケット内に格納するフォーマット
化手段とを、有する音声符号化装置。[0006] 3 or more multi-channel at least selected first and second of the two channels two correlation <br/> channels in correlation with each other a sound <br/> voice signal by matrix operation in the speech signal Channel correlation means for converting the audio signal including the two correlated channels converted by the channel correlation means into a plurality of channels having different characteristics while obtaining a leading sample value in response to an input audio signal for each channel. Line
Shape of past to present signal in time domain by shape prediction method
Each predicted value is predicted and its predicted linear predicted value
And the prediction residual obtained from the audio signal is minimized .
And predictive coding means for predictive coding by selecting a linear prediction method UNA, and header information, the compressed PCM access unit
And the user data that contains
In each of the channels selected by the predictive encoding means,
Includes linear prediction method, prediction residuals, and predetermined first sample value
The prediction coded data is stored in the compressed PCM access unit.
And a formatting means for storing in a sub-packet arranged in the audio encoding apparatus.

【０００７】[0007]

【発明の実施の形態】以下、図面を参照して本発明の実
施の形態を説明する。図１は本発明に係る音声符号化装
置とそれに対応した音声復号装置の第１の実施形態を示
すブロック図、図２は図１のエンコーダを詳しく示すブ
ロック図、図３は図２のマルチプレクサにより多重化さ
れる１フレームのフォーマットを示す説明図、図４はＤ
ＶＤのパックのフォーマットを示す説明図、図５はＤＶ
Ｄのオーディオパックのフォーマットを示す説明図、図
６は図１のデコーダを詳しく示すブロック図である。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a first embodiment of a speech coding apparatus according to the present invention and a speech decoding apparatus corresponding thereto, FIG. 2 is a block diagram showing the encoder of FIG. 1 in detail, and FIG. FIG. 4 is an explanatory diagram showing a format of one frame to be multiplexed, and FIG.
FIG. 5 is an explanatory diagram showing a format of a VD pack, and FIG.
FIG. 6 is an explanatory diagram showing the format of the audio pack D. FIG. 6 is a block diagram showing the decoder of FIG. 1 in detail.

【０００８】図１に示すチャネル相関回路Ａは加算回路
１ａと減算回路１ｂを有する。加算回路１ａは各チャネ
ル（以下、ch）が例えばサンプリング周波数＝１９２ｋ
Ｈｚ、量子化ビット数＝２４ビットのステレオ２ch信号
Ｌ、Ｒの和信号（Ｌ＋Ｒ）を算出して和ch用１chロスレ
ス・エンコーダ２Ｄ１に出力し、減算回路１ｂは差信号
（Ｌ−Ｒ）を算出して差ch用１chロスレス・エンコーダ
２Ｄ２に出力する。エンコーダ２Ｄ１、２Ｄ２は図２に
詳しく示すように、それぞれ和信号（Ｌ＋Ｒ）、差信号
（Ｌ−Ｒ）の差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）を予測符
号化して記録媒体や通信媒体を介して伝送する。The channel correlation circuit A shown in FIG. 1 has an addition circuit 1a and a subtraction circuit 1b. In the addition circuit 1a, each channel (hereinafter, ch) has, for example, a sampling frequency = 192 k
Calculates the sum signal (L + R) of the stereo 2ch signals L and R with the frequency and the number of quantization bits = 24 bits and outputs the sum signal to the 1ch lossless encoder 2D1 for the sum channel, and the subtraction circuit 1b calculates the difference signal (LR). The calculated value is output to the 1ch lossless encoder 2D2 for the difference channel. As shown in detail in FIG. 2, the encoders 2D1 and 2D2 predictively encode differences Δ (L + R) and Δ (LR) of the sum signal (L + R) and the difference signal (LR), respectively, and Transmitted via.

【０００９】そして、復号側では、図６に詳しく示すよ
うにデコーダ３Ｄ１、３Ｄ２がそれぞれ各chの予測符号
化データを和信号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）に復号
し、次いでチャネル相関回路Ｂがこの和信号（Ｌ＋
Ｒ）、差信号（Ｌ−Ｒ）をステレオ２ch信号Ｌ、Ｒに復
元する。On the decoding side, the decoders 3D1 and 3D2 decode the prediction coded data of each channel into a sum signal (L + R) and a difference signal (LR), respectively, as shown in detail in FIG. Circuit B outputs the sum signal (L +
R) and the difference signal (LR) are restored to stereo 2-ch signals L and R.

【００１０】図２を参照してエンコーダ２Ｄ１、２Ｄ２
について詳しく説明する。和信号（Ｌ＋Ｒ）と差信号
（Ｌ−Ｒ）は１フレーム毎に１フレームバッファ１０に
格納される。そして、１フレームの各サンプル値（Ｌ＋
Ｒ）、（Ｌ−Ｒ）がそれぞれ差分演算回路１１Ｄ１、１
１Ｄ２に印加され、今回と前回の差分Δ（Ｌ＋Ｒ）、Δ
（Ｌ−Ｒ）、すなわち差分ＰＣＭ（ＤＰＣＭ）データが
算出される。また、各フレームの先頭サンプル値（Ｌ＋
Ｒ）、（Ｌ−Ｒ）がマルチプレクサ１９に印加される。Referring to FIG. 2, encoders 2D1, 2D2
Will be described in detail. The sum signal (L + R) and the difference signal (LR) are stored in one frame buffer 10 for each frame. Then, each sample value (L +
R) and (LR) are the difference calculation circuits 11D1, 1D1,
1D2, the difference Δ (L + R), Δ
(LR), that is, differential PCM (DPCM) data is calculated. Also, the first sample value (L +
R) and (LR) are applied to the multiplexer 19.

【００１１】差分演算回路１１Ｄ１により算出された差
分Δ（Ｌ＋Ｒ）は、予測係数が異なる複数の予測器１２
ａ−１〜１２ａ−ｎと減算器１３ａ−１〜１３ａ−ｎに
印加される。そして、予測器１２ａ−１〜１２ａ−ｎで
はそれぞれ各予測係数に基づいて差分Δ（Ｌ＋Ｒ）の各
予測値が算出され、減算器１３ａ−１〜１３ｂ−ｎでは
それぞれこの各予測値と差分Δ（Ｌ＋Ｒ）の各予測残差
が算出される。バッファ・選択器１６Ｄ１はこの複数の
予測残差を一時記憶して、選択信号生成器１７により指
定されたサブフレーム毎に最小の予測残差を選択し、パ
ッキング回路１８に出力する。なお、このサブフレーム
はフレームの数十分の１程度のサンプル長であり、一例
として１フレームを８０サブフレームとする。ここで、
予測器１２ａ−１〜１２ａ−ｎと減算器１３ａ−１〜１
３ａ−ｎは和信号chの予測回路１５Ｄ１を構成し、ま
た、この予測回路１５Ｄ１とバッファ・選択器１６Ｄ１
は和信号chの予測符号化回路を構成している。The difference Δ (L + R) calculated by the difference calculation circuit 11D1 is calculated by a plurality of predictors 12 having different prediction coefficients.
a-1 to 12a-n and subtracters 13a-1 to 13a-n. Then, the predictors 12a-1 to 12a-n calculate the respective predicted values of the difference Δ (L + R) based on the respective prediction coefficients, and the subtractors 13a-1 to 13b-n calculate the respective predicted values and the difference Δ Each prediction residual of (L + R) is calculated. The buffer / selector 16D1 temporarily stores the plurality of prediction residuals, selects the minimum prediction residual for each subframe specified by the selection signal generator 17, and outputs the selected prediction residual to the packing circuit 18. Note that this subframe has a sample length of about one-tenth of a frame, and one frame is, for example, 80 subframes. here,
Predictors 12a-1 to 12a-n and subtracters 13a-1 to 13a-1
3a-n constitute a prediction circuit 15D1 for the sum signal ch, and the prediction circuit 15D1 and the buffer / selector 16D1
Constitutes a predictive encoding circuit for the sum signal ch.

【００１２】同様に、差分演算回路１１Ｄ２により算出
された差分Δ（Ｌ−Ｒ）は、予測係数が異なる複数の予
測器１２ｂ−１〜１２ｂ−ｎと減算器１３ｂ−１〜１３
ｂ−ｎに印加される。そして、予測器１２ｂ−１〜１２
ｂ−ｎではそれぞれ各予測係数に基づいて差分Δ（Ｌ−
Ｒ）の各予測値が算出され、減算器１３ｂ−１〜１３ｂ
−ｎではそれぞれこの各予測値と差分Δ（Ｌ−Ｒ）の各
予測残差が算出される。バッファ・選択器１６Ｄ２はこ
の複数の予測残差を一時記憶して、選択信号生成器１７
により指定されたサブフレーム毎に最小の予測残差を選
択し、パッキング回路１８に出力する。予測器１２ｂ−
１〜１２ｂ−ｎと減算器１３ｂ−１〜１３ｂ−ｎは差信
号chの予測回路１５Ｄ２を構成し、また、この予測回路
１５Ｄ２とバッファ・選択器１６Ｄ２は差信号chの予測
符号化回路を構成している。Similarly, the difference Δ (LR) calculated by the difference calculation circuit 11D2 is calculated by using a plurality of predictors 12b-1 to 12b-n and subtracters 13b-1 to 13b-13 having different prediction coefficients.
b-n. Then, the predictors 12b-1 to 12b-12
bn, the difference Δ (L−
R) are calculated, and the subtractors 13b-1 to 13b
In −n, each prediction residual of the difference Δ (LR) from each of the prediction values is calculated. The buffer / selector 16D2 temporarily stores the plurality of prediction residuals, and
The minimum prediction residual is selected for each sub-frame specified by, and is output to the packing circuit 18. Predictor 12b-
1 to 12b-n and the subtractors 13b-1 to 13b-n form a prediction circuit 15D2 for the difference signal ch, and the prediction circuit 15D2 and the buffer / selector 16D2 form a prediction encoding circuit for the difference signal ch. are doing.

【００１３】選択信号生成器１７は予測残差のビット数
フラグ（５ビット）をパッキング回路１８とマルチプレ
クサ１９に対して印加し、また、予測残差が最小の予測
器を示す予測器選択フラグ（その数ｎが２〜９個として
３ビット）をマルチプレクサ１９に対して印加する。パ
ッキング回路１８はバッファ・選択器１６Ｄ１、１６Ｄ
２により選択された２ch分の予測残差を、選択信号生成
器１７により指定されたビット数フラグに基づいて指定
ビット数でパッキングする。The selection signal generator 17 applies a bit number flag (5 bits) of the prediction residual to the packing circuit 18 and the multiplexer 19, and a predictor selection flag (predictor) indicating the predictor having the minimum prediction residual. The number n is 2 to 9 and 3 bits) are applied to the multiplexer 19. The packing circuit 18 includes buffer / selectors 16D1, 16D
The prediction residual for 2 ch selected by 2 is packed with the specified number of bits based on the bit number flag specified by the selection signal generator 17.

【００１４】続くマルチプレクサ１９は図３に示すよう
に１フレーム分に対して・フレームヘッダ（４０ビット）と、・和信号ｃｈ（Ｌ＋Ｒ）の１フレームの先頭サンプル値
（２５ビット）と、・差信号ｃｈ（Ｌ−Ｒ）の１フレームの先頭サンプル値
（２５ビット）と、・和信号ｃｈ（Ｌ＋Ｒ）のサブフレーム毎の予測器選択
フラグ（３ビット×８０）と、・差信号ｃｈ（Ｌ−Ｒ）のサブフレーム毎の予測器選択
フラグ（３ビット×８０）と、・和信号ｃｈ（Ｌ＋Ｒ）のサブフレーム毎のビット数フ
ラグ（５ビット×８０）と、・差信号ｃｈ（Ｌ−Ｒ）のサブフレーム毎のビット数フ
ラグ（５ビット×８０）と、・和信号ｃｈ（Ｌ＋Ｒ）の予測残差データ列（可変ビッ
ト数）と、・差信号ｃｈ（Ｌ−Ｒ）の予測残差データ列（可変ビッ
ト数）とをアクセスユニットとして多重化し、可変レー
トビットストリームとして出力する。上記予測残差デー
タ列はサブパケットを構成する。このような予測符号化
によれば、原信号が例えばサンプリング周波数＝１９２
ｋＨｚ、量子化ビット数＝２４ビット、２チャネルの場
合、５９％の圧縮率を実現することができる。As shown in FIG. 3, the following multiplexer 19 outputs a frame header (40 bits), a first sample value (25 bits) of one frame of the sum signal ch (L + R) for one frame, A head sample value (25 bits) of one frame of the signal ch (LR); a predictor selection flag (3 bits × 80) for each subframe of the sum signal ch (L + R); a difference signal ch (L) −R) a predictor selection flag (3 bits × 80) for each subframe; a sum signal ch (L + R) a bit number flag for each subframe (5 bits × 80); a difference signal ch (L− R) the number-of-bits flag (5 bits × 80) for each sub-frame; a prediction residual data sequence (variable number of bits) of the sum signal ch (L + R); and a prediction residual of the difference signal ch (LR). Difference data string (variable bit ) And multiplexed as an access unit, output as a variable rate bit stream. The prediction residual data sequence forms a subpacket. According to such predictive coding, the original signal is, for example, sampling frequency = 192.
In the case of kHz, the number of quantization bits = 24 bits, and two channels, a compression ratio of 59% can be realized.

【００１５】また、この可変レートビットストリームデ
ータをＤＶＤオーディオディスクに記録する場合には、
図４に示す圧縮ＰＣＭのオーディオ（Ａ）パックにパッ
キングされる。このパックは２０３４バイトのユーザデ
ータ（Ａパケット、Ｖパケット）に対して４バイトのパ
ックスタート情報と、６バイトのＳＣＲ（System Clock
Reference：システム時刻基準参照値）情報と、３バイ
トのMux レート（rate）情報と１バイトのスタッフィン
グの合計１４バイトのパックヘッダが付加されて構成さ
れている（１パック＝合計２０４８バイト）。この場
合、タイムスタンプであるＳＣＲ情報を、ＡＣＢユニッ
ト内の先頭パックでは「１」として同一タイトル内で連
続とすることにより同一タイトル内のＡパックの時間を
管理することができる。When recording the variable rate bit stream data on a DVD audio disk,
It is packed in the audio (A) pack of the compressed PCM shown in FIG. This pack has 4 bytes of pack start information for 2034 bytes of user data (A packet and V packet) and 6 bytes of SCR (System Clock).
Reference: system time reference value, 3-byte Mux rate (rate) information, and 1-byte stuffing for a total of 14-byte pack header (1 pack = 2048 bytes in total). In this case, the time of the A pack in the same title can be managed by setting the SCR information as the time stamp to be “1” in the first pack in the ACB unit so as to be continuous in the same title.

【００１６】圧縮ＰＣＭのＡパケットは図５に詳しく示
すように、１７、９又は１４バイトのパケットヘッダ
と、プライベートヘッダと、図３に示すフォーマットの
１ないし２０１５バイトのオーディオ圧縮ＰＣＭデータ
により構成されている。圧縮ＰＣＭのプライベートヘッ
ダは、・１バイトのサブストリームＩＤと、・２バイトのＵＰＣ／ＥＡＮ−ＩＳＲＣ（Universal Pr
oduct Code/European Article Number-International S
tandard Recording Code）番号、及びＵＰＣ／ＥＡＮ−
ＩＳＲＣデータと、・１バイトのプライベートヘッダ長と、・２バイトの第１アクセスユニットポインタと、・４バイトのオーディオデータ情報（ＡＤＩ）と、・０〜７バイトのスタッフィングバイトとに、より構成
されている。このように圧縮ＰＣＭのＡパケットのＡＤ
Ｉは、４バイトに選定され、通常の非圧縮のＰＣＭのＡ
パケットのＡＤＩよりも４バイトだけ短くされている。
したがってオーディオデータは４バイト分増加させるこ
とができる。As shown in detail in FIG. 5, the A packet of the compressed PCM is composed of a packet header of 17, 9, or 14 bytes, a private header, and audio compressed PCM data of 1 to 2015 bytes in the format shown in FIG. ing. The private header of the compressed PCM is: 1-byte substream ID, 2 bytes of UPC / EAN-ISRC (Universal Prism).
oduct Code / European Article Number-International S
tandard Recording Code) number and UPC / EAN-
ISRC data, 1-byte private header length, 2-byte first access unit pointer, 4 bytes of audio data information (ADI), and 0 to 7 bytes of stuffing bytes. ing. Thus, the AD of the A packet of the compressed PCM
I is chosen to be 4 bytes and is the normal uncompressed PCM A
It is shorter by 4 bytes than the ADI of the packet.
Therefore, the audio data can be increased by 4 bytes.

【００１７】次に図６を参照してデコーダ３Ｄ１、３Ｄ
２について説明する。図３に示したフォーマットの可変
レートビットストリームデータは、デマルチプレクサ２
１によりフレームヘッダに基づいて分離される。そし
て、和信号ｃｈ（Ｌ＋Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）の
１フレームの先頭サンプル値はそれぞれ累積演算回路２
５ａ、２５ｂに印加され、和信号ｃｈ（Ｌ＋Ｒ）及び差
信号ｃｈ（Ｌ−Ｒ）の予測器選択フラグはそれぞれ予測
器（２４ａ−１〜２４ａ−ｎ）、（２４ｂ−１〜２４ｂ
−ｎ）の各選択信号として印加され、和信号ｃｈ（Ｌ＋
Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）のビット数フラグと予測
残差データ列はアンパッキング回路２２に印加される。
ここで、予測器（２４ａ−１〜２４ａ−ｎ）、（２４ｂ
−１〜２４ｂ−ｎ）はそれぞれ、符号化側の予測器（１
２ａ−１〜１２ａ−ｎ）、（１２ｂ−１〜１２ｂ−ｎ）
と同一の特性であり、予測器選択フラグにより同一特性
のものが選択される。Next, referring to FIG. 6, decoders 3D1, 3D
2 will be described. The variable rate bit stream data of the format shown in FIG.
1 based on the frame header. Then, the leading sample values of one frame of the sum signal ch (L + R) and the difference signal ch (LR) are respectively calculated by the accumulation operation circuit 2.
5a and 25b, the predictor selection flags of the sum signal ch (L + R) and the difference signal ch (LR) are set to predictors (24a-1 to 24a-n) and (24b-1 to 24b, respectively).
−n), and the sum signal ch (L +
R) and the bit number flag of the difference signal ch (LR) and the prediction residual data string are applied to the unpacking circuit 22.
Here, the predictors (24a-1 to 24a-n), (24b
-1 to 24b-n) are predictors (1) on the encoding side, respectively.
2a-1 to 12a-n), (12b-1 to 12b-n)
And the same characteristic is selected by the predictor selection flag.

【００１８】アンパッキング回路２２は和信号ｃｈ（Ｌ
＋Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）の予測残差データ列を
ビット数フラグ毎に基づいて分離してそれぞれ加算回路
２３ａ、２３ｂに出力する。加算回路２３ａ、２３ｂで
はそれぞれ、アンパッキング回路２２からの和信号ｃｈ
（Ｌ＋Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）の今回の予測残差
データと、予測器（２４ａ−１〜２４ａ−ｎ）、（２４
ｂ−１〜２４ｂ−ｎ）の内、予測器選択フラグにより選
択された各１つにより予測された前回の予測値が加算さ
れて今回の予測値が算出される。この今回の予測値は、
図２に示す差分回路１１ａ、１１ｂによりそれぞれ算出
された差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）すなわちＤＰＣ
Ｍデータであり、予測器（２４ａ−１〜２４ａ−ｎ）、
（２４ｂ−１〜２４ｂ−ｎ）と累積演算回路２５ａ、２
５ｂに印加される。The unpacking circuit 22 outputs a sum signal ch (L
+ R) and the prediction residual data sequence of the difference signal ch (LR) are separated based on the bit number flags and output to the adders 23a and 23b, respectively. The adder circuits 23a and 23b respectively add the sum signal ch from the unpacking circuit 22.
(L + R) and the current prediction residual data of the difference signal ch (LR), and the predictors (24a-1 to 24a-n), (24
b-1 to 24b-n), the previous predicted value predicted by each one selected by the predictor selection flag is added to calculate the current predicted value. This forecast is
The differences Δ (L + R) and Δ (LR) calculated by the difference circuits 11a and 11b shown in FIG.
M data, predictors (24a-1 to 24a-n),
(24b-1 to 24b-n) and cumulative operation circuits 25a, 2
5b.

【００１９】累積演算回路２５ａ、２５ｂはそれぞれ、
１フレームの先頭サンプル値に対して差分Δ（Ｌ＋
Ｒ）、Δ（Ｌ−Ｒ）をサンプル毎に累積加算して和信号
ｃｈ（Ｌ＋Ｒ）、差信号ｃｈ（Ｌ−Ｒ）の各ＰＣＭデー
タを出力する。この和信号（Ｌ＋Ｒ）、差信号（Ｌ−
Ｒ）は図１に示すように加算回路４ａにより２Ｌ信号が
算出されるとともに、減算回路４ｂにより２Ｒ信号が算
出される。そして、２Ｌ信号と２Ｒ信号がそれぞれ割り
算器５ａ、５ｂにより１／２に割り算され、元のステレ
オ２チャネル信号Ｌ、Ｒが復元される。The cumulative operation circuits 25a and 25b are respectively
The difference Δ (L +
R) and Δ (LR) are cumulatively added for each sample to output PCM data of a sum signal ch (L + R) and a difference signal ch (LR). The sum signal (L + R) and the difference signal (L−
As for R), as shown in FIG. 1, a 2L signal is calculated by the adding circuit 4a, and a 2R signal is calculated by the subtracting circuit 4b. Then, the 2L signal and the 2R signal are each divided by 1/2 by the dividers 5a and 5b, and the original stereo two-channel signals L and R are restored.

【００２０】次に図７、図８を参照して第２の実施形態
について説明する。上記の実施形態では、和信号（Ｌ＋
Ｒ）、差信号（Ｌ−Ｒ）の各差分Δ（Ｌ＋Ｒ）、Δ（Ｌ
−Ｒ）、すなわちＤＰＣＭデータのみを予測符号化する
ように構成されているが、この第２の実施形態では和信
号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）すなわちＰＣＭデー
タ、又はその各差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）すなわ
ちＤＰＣＭデータを選択的に予測符号化するように構成
されている。Next, a second embodiment will be described with reference to FIGS. In the above embodiment, the sum signal (L +
R), each difference Δ (L + R), Δ (L) of the difference signal (LR)
-R), that is, predictive encoding is performed only on the DPCM data. In the second embodiment, the sum signal (L + R), the difference signal (LR), ie, the PCM data, or each difference Δ (L + R), Δ (LR), that is, DPCM data is selectively and predictively encoded.

【００２１】このため図７に示す符号化装置では、図２
に示す構成に対して和信号（Ｌ＋Ｒ）、差信号（Ｌ−
Ｒ）をそれぞれ予測符号化するための予測回路１５Ａ、
１５Ｓとバッファ・選択器１６Ａ、１６Ｓが追加されて
いる。また、選択信号生成器１７はバッファ・選択器１
６Ａ、１６Ｓによりそれぞれ選択された和信号（Ｌ＋
Ｒ）、差信号（Ｌ−Ｒ）と、バッファ・選択器１６Ｄ
１、１６Ｄ２によりそれぞれ選択された差分Δ（Ｌ＋
Ｒ）、Δ（Ｌ−Ｒ）の各予測残差の最小値に基づいて、
ＰＣＭデータとＤＰＣＭデータのどちらが圧縮率が高い
か否かを判断し、高い方のデータを選択する。このと
き、そのＰＣＭ／ＤＰＣＭの選択フラグ（予測回路選択
フラグ）を追加して多重化する。For this reason, the encoding apparatus shown in FIG.
The sum signal (L + R) and the difference signal (L−
R) for predictive encoding, respectively.
15S and buffer / selectors 16A and 16S are added. The selection signal generator 17 is a buffer / selector 1
6A, the sum signal (L +
R), the difference signal (LR) and the buffer / selector 16D
1, ΔD (L +
R), Δ (LR) based on the minimum value of each prediction residual.
It is determined whether the compression ratio of PCM data or DPCM data is higher, and the higher data is selected. At this time, the PCM / DPCM selection flag (prediction circuit selection flag) is added and multiplexed.

【００２２】ここで、図７に示す和信号（Ｌ＋Ｒ）の予
測回路１５Ａと差分Δ（Ｌ＋Ｒ）の予測回路１５Ｄ１が
同一の構成であり、また、差信号（Ｌ−Ｒ）の予測回路
１５Ｓと差分Δ（Ｌ−Ｒ）の予測回路１５Ｄ２が同一の
構成である場合、復号装置では図８に示すようにＰＣＭ
データとＤＰＣＭデータの両方の予測回路を設ける必要
はなく、１つのデータ分の予測回路でよい。そして、符
号化装置から伝送された予測回路選択フラグに基づいて
セレクタ２６ａ、２６ｂにより、ＤＰＣＭデータの場合
には累積演算回路２５ａ、２５ｂの出力を選択し、ＰＣ
Ｍデータの場合には加算回路２３ａ、２３ｂの出力を選
択する。Here, the prediction circuit 15A for the sum signal (L + R) and the prediction circuit 15D1 for the difference Δ (L + R) shown in FIG. 7 have the same configuration, and the prediction circuit 15S for the difference signal (L−R) has the same configuration. When the difference Δ (LR) prediction circuits 15D2 have the same configuration, the decoding apparatus uses the PCM as shown in FIG.
It is not necessary to provide a prediction circuit for both data and DPCM data, and a prediction circuit for one data may be used. Then, based on the prediction circuit selection flag transmitted from the encoding device, the selectors 26a and 26b select the outputs of the accumulator circuits 25a and 25b in the case of DPCM data.
In the case of M data, the outputs of the adders 23a and 23b are selected.

【００２３】第３の実施形態では図９に示すように、原
信号Ｌ、Ｒ（ＰＣＭデータ）と、和信号（Ｌ＋Ｒ）、差
信号（Ｌ−Ｒ）（ＰＣＭデータ）と、その各差分Δ（Ｌ
＋Ｒ）、Δ（Ｌ−Ｒ）（ＤＰＣＭデータ）の３グループ
の１つを選択的に予測符号化するように構成されてい
る。In the third embodiment, as shown in FIG. 9, original signals L and R (PCM data), a sum signal (L + R), a difference signal (LR) (PCM data), and each difference Δ (L
+ R) and one of the three groups Δ (LR) (DPCM data) are selectively predictively encoded.

【００２４】このため図９に示す符号化装置では、図７
に示す構成に対して原信号Ｌ、Ｒをそれぞれ予測符号化
するための予測回路１５Ｌ、１５Ｒとバッファ・選択器
１６Ｌ、１６Ｒが追加されている。また、選択信号生成
器１７はバッファ・選択器１６Ｌ、１６Ｒにより選択さ
れた原信号Ｌ、Ｒと、バッファ・選択器１６Ａ、１６Ｓ
により選択された和信号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）
と、バッファ・選択器１６Ｄ１、１６Ｄ２により選択さ
れた各差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）の各予測残差の
最小値に基づいて圧縮率が高いグループのデータを選択
する。このとき、その選択フラグ（予測回路選択フラ
グ）を追加して多重化する。For this reason, in the encoding device shown in FIG.
In addition to the configuration shown in (1), prediction circuits 15L and 15R for predictively encoding the original signals L and R, and buffers / selectors 16L and 16R are added. Further, the selection signal generator 17 includes the original signals L and R selected by the buffer / selectors 16L and 16R and the buffer / selectors 16A and 16S.
Signal (L + R), difference signal (LR) selected by
And data of a group having a high compression rate based on the minimum values of the prediction residuals of the differences Δ (L + R) and Δ (LR) selected by the buffer / selectors 16D1 and 16D2. At this time, the selection flag (prediction circuit selection flag) is added and multiplexed.

【００２５】また、図９に示す３グループの予測回路が
同一の構成である場合、復号装置では図１０に示すよう
に３グループ分の予測回路を設ける必要はなく、１つの
グループ分の予測回路でよい。そして、符号化装置から
伝送された予測回路選択フラグに基づいて、ＤＰＣＭデ
ータの場合には累積演算回路２５ａ、２５ｂの出力を選
択し、ＰＣＭデータの場合には加算回路２３ａ、２３ｂ
の出力を選択してチャネル相関回路Ｂにより原信号Ｌ、
Ｒを復元する。そして、更にセレクタ２７ａ、２７ｂに
より原信号Ｌ、Ｒのグループの場合には加算回路２３
ａ、２３ｂの出力を選択し、他の場合にはチャネル相関
回路Ｂの出力を選択するWhen the three groups of prediction circuits shown in FIG. 9 have the same configuration, the decoding device does not need to provide three groups of prediction circuits as shown in FIG. Is fine. Then, based on the prediction circuit selection flag transmitted from the encoding device, the output of the accumulation operation circuits 25a and 25b is selected in the case of DPCM data, and the addition circuits 23a and 23b in the case of PCM data.
And the channel correlation circuit B selects the original signal L,
Restore R. Further, in the case of the group of the original signals L and R by the selectors 27a and 27b, the addition circuit 23
a, 23b are selected, and in other cases, the output of the channel correlation circuit B is selected.

【００２６】また、符号化側により予測符号化された可
変レートビットストリームデータをネットワークを介し
て伝送する場合には、符号化側では図１１に示すように
伝送用にパケット化し（ステップＳ４１）、次いでパケ
ットヘッダを付与し（ステップＳ４２）、次いでこのパ
ケットをネットワーク上に送り出す（ステップＳ４
３）。復号側では図１２に示すようにヘッダを除去し
（ステップＳ５１）、次いでデータを復元し（ステップ
Ｓ５２）、次いでこのデータをメモリに格納して復号を
待つ（ステップＳ５３）。When the variable-rate bit stream data predicted and coded by the coding side is transmitted through a network, the coding side packetizes the data for transmission as shown in FIG. 11 (step S41). Next, a packet header is added (step S42), and then this packet is sent out onto the network (step S4).
3). On the decoding side, the header is removed as shown in FIG. 12 (step S51), the data is restored (step S52), and the data is stored in a memory and decoding is waited (step S53).

【００２７】上記第１の実施の形態は２チャネルの場合
について説明したが、２以上のマルチチャネルの場合の
第２の実施の形態について以下説明する。図１３は、本
発明の第２の実施の形態を示すブロック図である。図１
３は、図１の２チャネル用の構成に対して後方の２チャ
ネルＳL、ＳRを加えた４チャネル用として構成され、よ
って入力側にはチャネル相関回路Ａに加えて、同様な構
成のチャネル相関回路Ａ２が設けられている。また、出
力側にもチャネル相関回路Ｂに加えて、同様な構成のチ
ャネル相関回路Ｂ２が設けられている。また、ロスレス
・エンコーダ２Ｄとロスレス・デコーダ３Ｄはマルチチ
ャネル対応型として構成されている。なお、チャネル相
関回路Ａ、Ａ２、Ｂ、Ｂ２は、それぞれＬとＲ、ＳLと
ＳRを組み合わせの対象としている。なお、ロスレス・
エンコーダ２Ｄとロスレス・デコーダ３Ｄにおける一連
の動作である、差分の算出、予測値の算出、最小予測残
差の選択、最小予測残差を用いた予測値の算出などは、
第１の実施の形態と同様に行われる。The first embodiment has been described for the case of two channels, but the second embodiment for the case of two or more multi-channels will be described below. FIG. 13 is a block diagram showing a second embodiment of the present invention. FIG.
3 is configured for four channels by adding the rear two channels SL and SR to the two-channel configuration of FIG. 1, so that the input side has a channel correlation circuit A of the same configuration in addition to the channel correlation circuit A. A circuit A2 is provided. In addition, a channel correlation circuit B2 having a similar configuration is provided on the output side in addition to the channel correlation circuit B. Further, the lossless encoder 2D and the lossless decoder 3D are configured as multi-channel compatible types. Note that the channel correlation circuits A, A2, B, and B2 target L and R and SL and SR, respectively. In addition, lossless
A series of operations in the encoder 2D and the lossless decoder 3D, such as calculation of a difference, calculation of a prediction value, selection of a minimum prediction residual, calculation of a prediction value using the minimum prediction residual, and the like, include:
This is performed in the same manner as in the first embodiment.

【００２８】次に、第２の実施の形態の変形例としての
第３の実施の形態について、そのブロック図を示す図１
４に沿って説明する。図１４は、図１３の４チャネル用
の構成に対して更にセンタチャネルＣ及び低音効果チャ
ネルＬFEを加えた合計６チャネル用として構成されてい
る。ただし、センタチャネルＣ、後方の２チャネルＳ
L、ＳR、及び低周波音効果チャネルＬFEはＬとＲのよう
に相関をとることなく、直接ロスレス・エンコーダ２Ｄ
に入力され、また直接ロスレス・デコーダ３Ｄから出力
される。FIG. 1 is a block diagram showing a third embodiment as a modification of the second embodiment.
4 will be described. FIG. 14 is configured for a total of six channels in which a center channel C and a bass effect channel LFE are further added to the configuration for four channels in FIG. However, center channel C, rear two channels S
The L, SR, and low-frequency sound effect channels LFE are directly related to the lossless encoder 2D without correlation like L and R.
And directly output from the lossless decoder 3D.

【００２９】次に、第２の実施の形態及び第３の実施の
形態の変形例としての第４の実施の形態について、その
ブロック図を示す図１５に沿って説明する。図１５に示
すチャネル相関回路Ａ−１は加算回路１ａと減算回路１
ｂを有する。加算回路１ａはステレオ２ch信号Ｌ、Ｒの
和信号（Ｌ＋Ｒ）を算出し、この和信号（Ｌ＋Ｒ）を割
り算器５ａにより１／２に割り算してから、ロスレス・
エンコーダ２Ｄに出力し、減算回路１ｂは差信号（Ｌ−
Ｒ）を算出し、この差信号（Ｌ−Ｒ）を割り算器５ｂに
より１／２に割り算してから、ロスレス・エンコーダ２
Ｄに出力する。ロスレス・エンコーダ２Ｄは、１／２
（Ｌ＋Ｒ）と１／２（Ｌ−Ｒ）を用いてこれらを多重化
して多重化信号２５０を作る。多重化信号２５０はロス
レス・デコーダ３Ｄによりデコードされて、元の１／２
（Ｌ＋Ｒ）と１／２（Ｌ−Ｒ）が得られ、これらが、チ
ャネル相関回路Ｂ−１を構成する加算回路４ａと減算回
路４ｂにそれぞれ与えられ、出力信号としてステレオ２
chのＬ信号とＲ信号が得られる。なお、ロスレス・エン
コーダ２Ｄとロスレス・デコーダ３Ｄにおける一連の動
作である、差分の算出、予測値の算出、最小予測残差の
選択、最小予測残差を用いた予測値の算出などは、第１
の実施の形態と同様に行われる。第４の実施の形態から
わかるように、第２、第３の実施の形態におけるチャネ
ル相関回路Ａ、Ａ２はＬ＋Ｒ及びＬ−Ｒを演算するもの
に限らず、１／２（Ｌ＋Ｒ）、１／２（Ｌ−Ｒ）を演算
するものに置き換えることができる。この場合、ロスレ
ス・デコーダ３Ｄ側のチャネル相関回路Ｂ−１では１／
２の演算は不要である。Next, a fourth embodiment as a modification of the second and third embodiments will be described with reference to a block diagram of FIG. The channel correlation circuit A-1 shown in FIG.
b. The adder circuit 1a calculates a sum signal (L + R) of the stereo 2ch signals L and R, divides the sum signal (L + R) by 1/2 by a divider 5a, and
The signal is output to the encoder 2D, and the subtraction circuit 1b outputs the difference signal (L−
R), the difference signal (L−R) is divided by により by the divider 5b, and then the lossless encoder 2
Output to D. Lossless encoder 2D is ２
These are multiplexed using (L + R) and 1/2 (LR) to produce a multiplexed signal 250. The multiplexed signal 250 is decoded by the lossless decoder 3D, and
(L + R) and 1/2 (LR) are obtained, and these are given to the addition circuit 4a and the subtraction circuit 4b constituting the channel correlation circuit B-1, respectively, and the stereo 2 is output as an output signal.
The L and R signals of ch are obtained. Note that a series of operations in the lossless encoder 2D and the lossless decoder 3D, such as calculation of a difference, calculation of a prediction value, selection of a minimum prediction residual, and calculation of a prediction value using the minimum prediction residual are performed in the first step.
This is performed in the same manner as in the embodiment. As can be seen from the fourth embodiment, the channel correlation circuits A and A2 in the second and third embodiments are not limited to those that calculate L + R and L−R, but are １／ (L + R), 1 / 2 (LR) can be replaced with the one that calculates 2 (LR). In this case, in the channel correlation circuit B-1 on the lossless decoder 3D side, 1 /
The operation of 2 is unnecessary.

【００３０】なお、先に図３で説明したフォーマットは
１例であって、本発明における信号処理において記録あ
るいは伝送される信号のフォーマットは、これに限られ
るものでない。マルチチャネルの場合は、図１３に対応
してＬ、Ｒ信号に加えて、後方２チャネルＳL、ＳRも和
信号（ＳL＋ＳR）と差信号（ＳL−ＳR）の形で収納され
る（図１６のａ）。また、同様に図１４に対応してＬ、
Ｒ信号は和信号と差信号の形で収納され、これに加え
て、センターチャネルＣ、後方２チャネルＳL、ＳR、低
周波効果チャネルＬFEは、そのまま、すなわち和信号や
差信号の形をとることなく収納される（図１６のｂ）。The format described above with reference to FIG. 3 is an example, and the format of a signal recorded or transmitted in the signal processing according to the present invention is not limited to this. In the case of a multi-channel, in addition to the L and R signals, the rear two channels SL and SR are also stored in the form of a sum signal (SL + SR) and a difference signal (SL-SR), as shown in FIG. a). Similarly, L, corresponding to FIG.
The R signal is stored in the form of a sum signal and a difference signal. In addition, the center channel C, the rear two channels SL and SR, and the low-frequency effect channel LFE must be in the form of a sum signal or a difference signal. (B in FIG. 16).

【００３１】図１７は、図１６に示すようなマルチチャ
ネルの信号を図４のＡパックのユーザデータのパケット
とするときのフォーマットを示す図である。ビットスト
リームＢＳ０には、和信号（Ｌ＋Ｒ）と差信号（Ｌ−
Ｒ）が収納され、また他のビットストリームＢＳ１に
は、図１６のａに対応する場合は、和信号（ＳL＋ＳR）
と差信号の（ＳL−ＳR）が、一方図１６のｂに対応する
場合は、センターチャネルＣ、後方２チャネルＳL、Ｓ
R、低周波効果チャネルＬFEが、そのまま収納される。FIG. 17 is a diagram showing a format when the multi-channel signal as shown in FIG. 16 is converted into a packet of the A-pack user data in FIG. The bit stream BS0 includes a sum signal (L + R) and a difference signal (L−R).
R) is stored, and in the other bit stream BS1, in the case corresponding to FIG. 16A, the sum signal (SL + SR)
If the difference signal (SL−SR) corresponds to b in FIG. 16, the center channel C and the two rear channels SL and S
The R, low frequency effect channel LFE is stored as it is.

【００３２】図５に示す圧縮ＰＣＭ（ＰＰＣＭ）のオー
ディオ（Ａ）パケットの図３と異なる態様を図１８に示
す。この異なる態様では、圧縮ＰＣＭ（ＰＰＣＭ）のオ
ーディオ（Ａ）パケットにおけるオーディオデータエリ
アは、図１８に示すように複数のＰＰＣＭアクセスユニ
ットにより構成され、ＰＰＣＭアクセスユニットはＰＰ
ＣＭシンク情報とサブパケットにより構成されている。
最初のＰＰＣＭアクセスユニット内のサブパケットは、
ディレクトリと、ビットストリームＢＳ０と、ＣＲＣ
と、ビットストリームＢＳ１と、ＣＲＣとエクストラ情
報により構成され、ビットストリームＢＳ０，ＢＳ１は
ＰＰＣＭブロックのみにより構成されている。２番目以
降のＰＰＣＭアクセスユニット内のサブパケットは、デ
ィレクトリを除いてビットストリームＢＳ０と、ＣＲＣ
と、ビットストリームＢＳ１と、ＣＲＣとエクストラ情
報により構成され、フレーム先頭のビットストリームＢ
Ｓ０及びＢＳ１はリスタートヘッダとＰＰＣＭブロック
により構成されている。フレーム先頭のＰＰＣＭブロッ
クにフレーム先頭サンプル値を配する。FIG. 18 shows an aspect of the compressed PCM (PPCM) audio (A) packet shown in FIG. 5 which is different from FIG. In this different aspect, the audio data area in the audio (A) packet of the compressed PCM (PPCM) is composed of a plurality of PPCM access units as shown in FIG.
It is composed of CM sync information and subpackets.
The subpacket in the first PPCM access unit is:
Directory, bitstream BS0, CRC
, Bit stream BS1, CRC and extra information, and bit streams BS0 and BS1 are composed only of PPCM blocks. Sub-packets in the second and subsequent PPCM access units include a bit stream BS0 except for a directory and a CRC.
, A bit stream BS1, a CRC and extra information, and a bit stream B at the head of the frame.
S0 and BS1 are configured by a restart header and a PPCM block. The frame head sample value is allocated to the PPCM block at the head of the frame.

【００３３】ＰＰＣＭシンク情報（以下、同期情報とも
いう）は次の情報を含む。・１パケット当たりのサンプル数：サンプリング周波数
ｆｓに応じて４０、８０又は１６０が選択される。・データレート：ＶＢＲの場合には「０」（サブパケッ
ト内のデータが圧縮データであることを示す識別子）・サンプリング周波数ｆｓ及び量子化ビット数Ｑｂ・チャネル割り当て情報ここで、リスタートヘッダはフレーム毎にチャネル相関
回路Ａが加算回路と減算回路で構成されることを明記し
た情報を有している。これらのオーディオデータは図１
３と図１４においてデマルチプレクサ２１以下の構成か
らなるロスレス・デコーダ３Ｄ（図８）により元のマル
チチャネルオーディオ信号に復号される。図１８に示し
たフォーマットの可変レートビットストリームデータ
は、図１のチャネル相関回路を用いたか、図１５のチャ
ネル相関回路を用いたかを、例えばＰＰＣＭアクセスユ
ニットのリスタートヘッダに格納した識別子（図示せ
ず）で識別するようにしているので、いずれであっても
デコーダは確実にデコードできる。なお、フレーム毎の
ロスレス圧縮を例に説明したが、固定の長さに限らず区
間は可変の長さであってもよい。The PPCM sync information (hereinafter, also referred to as synchronization information) includes the following information. -Number of samples per packet: 40, 80 or 160 is selected according to the sampling frequency fs. -Data rate: "0" in the case of VBR (identifier indicating that the data in the subpacket is compressed data)-Sampling frequency fs and number of quantization bits Qb-Channel allocation information Here, the restart header is a frame. Each channel has information specifying that the channel correlation circuit A is constituted by an addition circuit and a subtraction circuit. These audio data are shown in FIG.
3 and FIG. 14, the original multi-channel audio signal is decoded by a lossless decoder 3D (FIG. 8) having a configuration of the demultiplexer 21 and below. The variable rate bit stream data having the format shown in FIG. 18 indicates whether the channel correlation circuit shown in FIG. 1 or the channel correlation circuit shown in FIG. 15 is used, for example, an identifier stored in the restart header of the PPCM access unit (shown in FIG. 18). ), The decoder can reliably decode any case. Although lossless compression for each frame has been described as an example, the section is not limited to a fixed length but may be a variable length.

【００３４】[0034]

【発明の効果】以上説明したように本発明によれば、特
に、チャネル相関回路により算出された２つの相関信号
を、チャネル毎に入力される音声信号に応答して先頭サ
ンプル値を得ると共に、時間領域に過去の信号から予測
される現在の信号の複数の予測値の中でその予測残差が
最小となる線形予測方式によりロスレス圧縮するように
したので、音声信号を予測符号化する場合に圧縮率を改
善することができる。As described above, according to the present invention, in particular, the two correlation signals calculated by the channel correlation circuit are obtained by obtaining the first sample value in response to the audio signal input for each channel, Lossless compression is performed by the linear prediction method that minimizes the prediction residual among the multiple prediction values of the current signal predicted from the past signal in the time domain. The compression ratio can be improved.

[Brief description of the drawings]

【図１】本発明に係る音声符号化装置とそれに対応した
音声復号装置の第１の実施形態を示すブロック図であ
る。FIG. 1 is a block diagram showing a first embodiment of a speech encoding apparatus according to the present invention and a speech decoding apparatus corresponding thereto.

【図２】図１のエンコーダを詳しく示すブロック図であ
る。FIG. 2 is a block diagram showing the encoder of FIG. 1 in detail.

【図３】図２のマルチプレクサにより多重化される１フ
レームのフォーマットを示す説明図である。FIG. 3 is an explanatory diagram showing a format of one frame multiplexed by the multiplexer of FIG. 2;

【図４】ＤＶＤのパックのフォーマットを示す説明図で
ある。FIG. 4 is an explanatory diagram showing a format of a DVD pack.

【図５】ＤＶＤのオーディオパックのフォーマットを示
す説明図である。FIG. 5 is an explanatory diagram showing a format of a DVD audio pack.

【図６】図１のデコーダを詳しく示すブロック図であ
る。FIG. 6 is a block diagram illustrating the decoder of FIG. 1 in detail;

【図７】第２の実施形態のエンコーダを示すブロック図
である。FIG. 7 is a block diagram illustrating an encoder according to a second embodiment.

【図８】第２の実施形態のデコーダを示すブロック図で
ある。FIG. 8 is a block diagram illustrating a decoder according to a second embodiment.

【図９】第３の実施形態のエンコーダを示すブロック図
である。FIG. 9 is a block diagram illustrating an encoder according to a third embodiment.

【図１０】第３の実施形態のデコーダを示すブロック図
である。FIG. 10 is a block diagram illustrating a decoder according to a third embodiment.

【図１１】音声伝送方法を示すフローチャートである。FIG. 11 is a flowchart showing a voice transmission method.

【図１２】音声伝送方法を示すフローチャートである。FIG. 12 is a flowchart illustrating a voice transmission method.

【図１３】本発明に係る音声符号化装置とそれに対応し
た音声復号装置の第２の実施形態を示すブロック図であ
る。FIG. 13 is a block diagram showing a second embodiment of the speech encoding apparatus according to the present invention and a speech decoding apparatus corresponding thereto.

【図１４】本発明に係る音声符号化装置とそれに対応し
た音声復号装置の第３の実施形態を示すブロック図であ
る。FIG. 14 is a block diagram showing a third embodiment of the speech encoding apparatus according to the present invention and a speech decoding apparatus corresponding thereto.

【図１５】本発明に係る音声符号化装置とそれに対応し
た音声復号装置の第４の実施形態を示すブロック図であ
る。FIG. 15 is a block diagram showing a fourth embodiment of the speech encoding apparatus according to the present invention and a speech decoding apparatus corresponding thereto.

【図１６】本発明における信号処理において記録あるい
は伝送されるマルチチャネル信号のフォーマットの例を
示す図である。FIG. 16 is a diagram illustrating an example of a format of a multi-channel signal recorded or transmitted in signal processing according to the present invention.

【図１７】マルチチャネルの信号を図４のＡパックのユ
ーザデータのパケットとするときのフォーマットを示す
図である。FIG. 17 is a diagram showing a format when a multi-channel signal is used as a packet of user data of the A-pack in FIG. 4;

【図１８】図５に示す圧縮ＰＣＭ（ＰＰＣＭ）のオーデ
ィオ（Ａ）パケットの図３と異なる態様を示すフォーマ
ット説明図である。18 is an explanatory diagram showing a format of an audio (A) packet of the compressed PCM (PPCM) shown in FIG. 5 which is different from that shown in FIG. 3;

[Explanation of symbols]

１ａ、４ａ加算回路（加算手段）１ｂ、４ｂ減算回路（減算手段）５ａ、５ｂ割り算器１１Ｄ１差分演算回路（第１の差分演算手段）１１Ｄ２差分演算回路（第２の差分演算手段）１２ａ−１〜１２ａ−ｎ予測器（減算器１３ａ−１〜
１３ａ−ｎ、バッファ・選択器１６Ｄ１と共に第１の予
測符号化手段を構成する。）１２ｂ−１〜１２ｂ−ｎ予測器（減算器１３ｂ−１〜
１３ｂ−ｎ、バッファ・選択器１６Ｄ２と共に第２の予
測符号化手段を構成する。）１３ａ−１〜１３ａ−ｎ，１３ｂ−１〜１３ｂ−ｎ減
算器１６Ｄ１，１６Ｄ２，１６Ａ，１６Ｓ，１６Ｌ，１６Ｒ
バッファ・選択器１５Ａ予測回路（バッファ・選択器１６Ａと共に第３
の予測符号化手段を構成する。）１５Ｓ予測回路（バッファ・選択器１６Ｓと共に第４
の予測符号化手段を構成する。）１５Ｌ予測回路（バッファ・選択器１６Ｌと共に第５
の予測符号化手段を構成する。）１５Ｒ予測回路路（バッファ・選択器１６Ｒと共に第
６の予測符号化手段を構成する。）1a, 4a Addition circuit (addition means) 1b, 4b Subtraction circuit (subtraction means) 5a, 5b Divider 11D1 Difference calculation circuit (first difference calculation means) 11D2 Difference calculation circuit (second difference calculation means) 12a-1 -12a-n predictor (subtractors 13a-1
13a-n and the buffer / selector 16D1 constitute a first predictive encoding means. ) 12b-1 to 12b-n predictors (subtractors 13b-1 to 13b-1)
13b-n and the buffer / selector 16D2 constitute a second predictive encoding means. 13a-1 to 13a-n, 13b-1 to 13b-n Subtractors 16D1, 16D2, 16A, 16S, 16L, 16R
Buffer / selector 15A Prediction circuit (third with buffer / selector 16A)
Of predictive encoding means. ) 15S prediction circuit (4th with buffer / selector 16S)
Of predictive encoding means. ) 15L prediction circuit (fifth with buffer / selector 16L)
Of predictive encoding means. 15R prediction circuit (constitutes sixth predictive encoding means with buffer / selector 16R)

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開昭64−44499（ＪＰ，Ａ) 特開平６−133252（ＪＰ，Ａ) 特開平７−143596（ＪＰ，Ａ) 特開平９−139937（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 19/14 G11B 20/10 - 20/12 H03M 7/30 - 7/40 H04B 14/04 ──────────────────────────────────────────────────続き Continuation of front page (56) References JP-A-64-44499 (JP, A) JP-A-6-133252 (JP, A) JP-A-7-143596 (JP, A) JP-A-9-99 139937 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 19/00-19/14 G11B 20/10-20/12 H03M 7/30-7/40 H04B 14/04

Claims

(57) [Claims]

An audio signal of at least selected first and second two channels among three or more multi-channel audio signals is matrix-operated to obtain two correlations which are correlated with each other. > a channel correlation means for converting the channel, two correlation tea converted by the channel correlation means
In response to the input audio signal for each channel, the audio signal containing the channel obtains the first sample value and has different characteristics.
Time domain past to present with multiple linear prediction methods
Of each signal is predicted, and
Predictive encoding means for selecting and predictively encoding a linear prediction method that minimizes a prediction residual obtained from the linear prediction value and the audio signal ; header information;
Data structure including the user data including the storage unit
And each selected by the predictive encoding means.
Linear prediction method of channel, prediction residual and predetermined head sample
The prediction encoded data including the
And a formatting means for storing the data in a subpacket arranged in the storage unit.