JP2001195092A

JP2001195092A - Voice coding device

Info

Publication number: JP2001195092A
Application number: JP2000321499A
Authority: JP
Inventors: Yoshiaki Tanaka; 美昭田中; Shoji Ueno; 昭治植野; Norihiko Fuchigami; 徳彦渕上
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2000-10-20
Filing date: 2000-10-20
Publication date: 2001-07-19
Anticipated expiration: 2018-10-13
Also published as: JP3387086B2

Abstract

PROBLEM TO BE SOLVED: To improve a compression rate when voice signals in a plurality of channels are subjected to predictive coding using the fact that the correlation among the upper bit data of each channel of voice signals of the plurality of channels is strong and no correlation exists among the lower bit data. SOLUTION: An adding circuit 1a computes the sum signal (L+R) of stereophonic 2ch signals L and R and outputs the signal to a single ch lossless/encoder 2D1 of a sum ch. A subtracting circuit 1b computes the difference signal (L-R) and outputs the signals to a single ch lossless/encoder 2D2 of a difference ch. The encoders 2D1 and 2D2 conduct predictive coding of the signals (L+R) of upper 16 bits, difference Δ(L+R) of the difference signals (L-R) and the Δ(L-R) among 24 bits, and lower 8 bits are transferred as they are through a recording medium or a communication medium.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声信号を予測符
号化して圧縮するための音声符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus for predictively coding and compressing a speech signal.

【０００２】[0002]

【従来の技術】音声信号を高能率符号化する方法として
は、例えば特開昭５８−７５３４１号公報に示されるよ
うにハフマン符号が知られている。また、音声信号を予
測符号化する方法として、本発明者は先の出願（特願平
９−２８９１５９号）において１チャネルの原デジタル
音声信号に対して、特性が異なる複数の予測器により時
間領域における過去の信号から現在の信号の複数の線形
予測値を算出し、原デジタル音声信号とこの複数の線形
予測値から予測器毎の予測残差を算出し、この複数の予
測残差の最小値を選択する方法を提案している。2. Description of the Related Art As a method for encoding a speech signal with high efficiency, for example, a Huffman code is known as disclosed in Japanese Patent Application Laid-Open No. 58-75341. As a method for predictive coding of an audio signal, the present inventor has proposed in a prior application (Japanese Patent Application No. 9-289159) that a plurality of predictors having different characteristics are used for a one-channel original digital audio signal in a time domain. Calculating a plurality of linear prediction values of the current signal from the past signal in, calculating a prediction residual for each predictor from the original digital audio signal and the plurality of linear prediction values, and calculating a minimum value of the plurality of prediction residuals Suggests how to choose.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記方
法では原デジタル音声信号がサンプリング周波数＝９６
ｋＨｚ、量子化ビット数＝２０ビット程度の場合に、あ
る程度の圧縮効果を得ることができるが、近年のＤＶＤ
オーディオディスクではこの２倍のサンプリング周波数
（＝１９２ｋＨｚ）が使用され、また、量子化ビット数
も２４ビットが使用される傾向があるので、圧縮率を改
善する必要がある。However, in the above method, the original digital audio signal has a sampling frequency = 96.
In the case of kHz and the number of quantization bits = about 20 bits, a certain compression effect can be obtained.
Audio discs use twice this sampling frequency (= 192 kHz), and the number of quantization bits tends to use 24 bits. Therefore, it is necessary to improve the compression ratio.

【０００４】ところで、ＣＤ再生装置のような簡易機種
と、ＤＶＤ再生装置のような上位機種の両方の再生装置
が再生可能にするために、例えば特開平９−３１２０６
６号公報に示されるように上位機種のみが再生可能な２
０ビット（又は２４ビット）のサンプルデータを、簡易
機種が再生可能な上位１６ビットと下位４ビット（又は
８ビット）に分離して伝送する方法が提案されている。Incidentally, in order to enable reproduction by both a simple model such as a CD reproducing apparatus and a higher model such as a DVD reproducing apparatus, for example, Japanese Patent Laid-Open No. 9-31206 is disclosed.
No. 6, as shown in Japanese Patent Publication No. 6
A method has been proposed in which sample data of 0 bits (or 24 bits) is separated into upper 16 bits and lower 4 bits (or 8 bits) which can be reproduced by a simple model and transmitted.

【０００５】そこで本発明は、複数チャネル（チャンネ
ル）の音声信号の各チャネルの上位ビットデータ間には
相関が強く、下位ビットデータ間には相関がないことに
鑑みて、複数チャネルの音声信号を予測符号化する場合
に圧縮率を改善することができる音声符号化装置を提供
することを目的とする。Accordingly, the present invention considers the fact that there is a strong correlation between the upper bit data of each channel of the audio signal of a plurality of channels (channels) and there is no correlation between the lower bit data, so that the audio signal of the plurality of channels (channels) is It is an object of the present invention to provide a speech encoding device capable of improving a compression ratio when performing predictive encoding.

【０００６】[0006]

【課題を解決するための手段】本発明は上記目的を達成
するために、以下の１）及び２）記載の手段よりなる。
すなわち、In order to achieve the above object, the present invention comprises the following means (1) and (2).
That is,

【０００７】１）同一のサンプリング周波数であると共
に２つの系統からなる第1の複数チャネルのデジタル音
声信号を所定のマトリクス演算により互いに相関性のあ
る第2の複数チャネルの音声信号に変換する相関手段
と、前記第2の複数チャネルの音声信号をチャネル毎に
必要に応じて上位ビットと下位ビットデータに分離する
分離手段と、前記分離手段により分離された場合には上
位ビットデータを各チャネル毎に、入力されるデータに
応答して、先頭サンプル値を得ると共に、時間領域の過
去の信号から現在の信号の複数の予測値の中でその予測
残差が最小となる線形予測方法を選択する予測符号化手
段と、前記予測符号化手段により選択された各チャネル
の先頭サンプル値と予測残差と線形予測方法を含む予測
化データと、前記分離手段により分離された場合は下位
ビットデータと分離フラグとを所定のフォーマットで多
重化する手段とを、有する音声符号化装置。２）同一のサンプリング周波数であると共に２つの系統
からなる第1の複数チャネルのデジタル音声信号を所定
のマトリクス演算により互いに相関性のある第2の複数
チャネルの音声信号に変換する相関手段と、前記変換さ
れた第2の複数チャネルの音声信号を、チャネル毎に入
力される音声信号に応答して先頭サンプル値を得ると共
に、時間領域の過去の信号から現在の信号の複数の予測
値の中でその予測残差が最小となる線形予測方法を選択
して予測符号化する予測符号化手段と、前記予測符号化
手段により選択された各チャネルの先頭サンプル値と予
測残差と線形予測方法を含む予測化データを所定のフォ
ーマットで多重化する手段とを、有する音声符号化装
置。1) Correlation means for converting digital audio signals of a first plurality of channels having the same sampling frequency and composed of two systems into audio signals of a plurality of second channels mutually correlated by a predetermined matrix operation. And separating means for separating the audio signals of the second plurality of channels into higher-order bits and lower-order bit data as needed for each channel, and, when separated by the separating means, separates the higher-order bit data for each channel. Prediction in response to input data, obtaining a first sample value, and selecting a linear prediction method that minimizes a prediction residual among a plurality of prediction values of a current signal from a past signal in a time domain. Encoding means; first sample value of each channel selected by the predictive encoding means; prediction residual; and prediction data including a linear prediction method; Speech encoding apparatus and a means for multiplexing the low-order bit data and the separation flag in a predetermined format, when it is separated by. 2) correlating means for converting digital audio signals of a first plurality of channels having the same sampling frequency and composed of two systems into audio signals of a plurality of channels correlated with each other by a predetermined matrix operation; The second plurality of channels of the converted audio signal, in response to the audio signal input for each channel, to obtain the first sample value, from a plurality of predicted values of the current signal from the past signal in the time domain A predictive encoding unit that selects and predictively codes a linear prediction method that minimizes the prediction residual; and includes a head sample value, a prediction residual, and a linear prediction method of each channel selected by the predictive encoding unit. Means for multiplexing the predicted data in a predetermined format.

【０００８】[0008]

【発明の実施の形態】以下、図面を参照して本発明の実
施の形態を説明する。図１は本発明に係る音声符号化装
置及びそれに対応する音声復号装置の第１の実施形態を
示すブロック図、図２は図１のエンコーダを詳しく示す
ブロック図、図３はＤＶＤのパックのフォーマットを示
す説明図、図４はＤＶＤのオーディオパックのフォーマ
ットを示す説明図、図５は図１のデコーダを詳しく示す
ブロック図である。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a first embodiment of an audio encoding device and a corresponding audio decoding device according to the present invention, FIG. 2 is a block diagram showing the encoder of FIG. 1 in detail, and FIG. 3 is a DVD pack format. FIG. 4 is an explanatory diagram showing the format of a DVD audio pack, and FIG. 5 is a block diagram showing the decoder of FIG. 1 in detail.

【０００９】図１に示すチャネル相関回路Ａは加算回路
１ａと減算回路１ｂを有する。加算回路１ａは各チャネ
ル（以下、ch）が例えばサンプリング周波数＝１９２ｋ
Ｈｚ、量子化ビット数＝２４ビットのステレオ２ch信号
Ｌ、Ｒの和信号（Ｌ＋Ｒ）を算出して和ch用１chロスレ
ス・エンコーダ２Ｄ１に出力し、減算回路１ｂは差信号
（Ｌ−Ｒ）を算出して差ch用１chロスレス・エンコーダ
２Ｄ２に出力する。エンコーダ２Ｄ１、２Ｄ２は図２に
詳しく示すように、それぞれ２４ビットの内、上位１６
ビットの和信号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）の差分Δ
（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）を予測符号化するとともに、
下位８ビットをそのまま記録媒体や通信媒体を介して伝
送する。The channel correlation circuit A shown in FIG. 1 has an addition circuit 1a and a subtraction circuit 1b. In the addition circuit 1a, each channel (hereinafter, ch) has a sampling frequency of 192 k
Calculates the sum signal (L + R) of the stereo 2ch signals L and R with the frequency and the number of quantization bits = 24 bits and outputs the sum signal to the 1ch lossless encoder 2D1 for the sum channel, and the subtraction circuit 1b calculates the difference signal (LR). The calculated value is output to the 1ch lossless encoder 2D2 for the difference channel. As shown in detail in FIG. 2, the encoders 2D1 and 2D2 each have the upper 16 bits of 24 bits.
Difference Δ between bit sum signal (L + R) and difference signal (L−R)
(L + R) and Δ (LR) are predictively coded,
The lower 8 bits are directly transmitted via a recording medium or a communication medium.

【００１０】なお、原データが２０ビットの場合には上
位ビット数を固定にし、下位ビット数を可変にして上位
１６ビットと下位４ビットに分離する。また、原データ
が１６ビットの場合には分離することなく１６ビットデ
ータを予測符号化する。When the original data is 20 bits, the number of upper bits is fixed, and the number of lower bits is made variable to separate the upper 16 bits and lower 4 bits. When the original data is 16 bits, the 16-bit data is predictively encoded without separation.

【００１１】そして、復号側では、図６に詳しく示すよ
うにデコーダ３Ｄ１、３Ｄ２がそれぞれ各chの上位１６
ビット分の予測符号化データを和信号（Ｌ＋Ｒ）、差信
号（Ｌ−Ｒ）に復号し、この１６ビットデータに対して
下位８ビットを付加して元の２４ビットデータに復元す
る。次いでチャネル相関回路Ｂがこの２４ビットの和信
号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）をステレオ２ch信号
Ｌ、Ｒに復元する。On the decoding side, as shown in detail in FIG. 6, the decoders 3D1 and 3D2 respectively include the upper 16 bits of each channel.
The bits of the prediction coded data are decoded into a sum signal (L + R) and a difference signal (LR), and the lower 8 bits are added to the 16-bit data to restore the original 24-bit data. Next, the channel correlation circuit B restores the 24-bit sum signal (L + R) and difference signal (LR) to stereo 2-ch signals L and R.

【００１２】図２を参照してエンコーダ２Ｄ１、２Ｄ２
について詳しく説明する。和信号（Ｌ＋Ｒ）と差信号
（Ｌ−Ｒ）は１フレーム毎に１フレームバッファ１０に
格納される。そして、１フレームの各上位１６ビットの
サンプル値（Ｌ＋Ｒ）、（Ｌ−Ｒ）がそれぞれ差分演算
回路１１Ｄ１、１１Ｄ２に印加され、今回と前回の差分
Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）、すなわち差分ＰＣＭ（Ｄ
ＰＣＭ）データが算出される。また、各フレームの先頭
サンプルの２４ビットデータ（Ｌ＋Ｒ）、（Ｌ−Ｒ）
と、各サンプルの下位８ビットデータ（Ｌ＋Ｒ）、（Ｌ
−Ｒ）がマルチプレクサ１９に印加される。Referring to FIG. 2, encoders 2D1, 2D2
Will be described in detail. The sum signal (L + R) and the difference signal (LR) are stored in one frame buffer 10 for each frame. Then, the sample values (L + R) and (LR) of the upper 16 bits of one frame are applied to the difference calculation circuits 11D1 and 11D2, respectively, and the difference Δ (L + R), Δ (LR) between the current and previous times is calculated. That is, the difference PCM (D
PCM) data is calculated. Also, 24-bit data (L + R), (LR) of the first sample of each frame.
And the lower 8-bit data (L + R) of each sample, (L
−R) is applied to the multiplexer 19.

【００１３】差分演算回路１１Ｄ１により算出された差
分Δ（Ｌ＋Ｒ）は、予測係数が異なる複数の予測器１２
ａ−１〜１２ａ−ｎと減算器１３ａ−１〜１３ａ−ｎに
印加される。そして、予測器１２ａ−１〜１２ａ−ｎで
はそれぞれ各予測係数に基づいて差分Δ（Ｌ＋Ｒ）の各
予測値が算出され、減算器１３ａ−１〜１３ｂ−ｎでは
それぞれこの各予測値と差分Δ（Ｌ＋Ｒ）の各予測残差
が算出される。バッファ・選択器１６Ｄ１はこの複数の
予測残差を一時記憶して、選択信号生成器１７により指
定されたサブフレーム毎に最小の予測残差を選択し、パ
ッキング回路１８に出力する。なお、このサブフレーム
はフレームの数十分の１程度のサンプル長であり、一例
として１フレームを８０サブフレームとする。ここで、
予測器１２ａ−１〜１２ａ−ｎと減算器１３ａ−１〜１
３ａ−ｎは和信号chの予測回路１５Ｄ１を構成し、ま
た、この予測回路１５Ｄ１とバッファ・選択器１６Ｄ１
は和信号chの予測符号化回路を構成している。The difference Δ (L + R) calculated by the difference calculation circuit 11D1 is calculated by a plurality of predictors 12 having different prediction coefficients.
a-1 to 12a-n and subtracters 13a-1 to 13a-n. Then, the predictors 12a-1 to 12a-n calculate the respective predicted values of the difference Δ (L + R) based on the respective prediction coefficients, and the subtractors 13a-1 to 13b-n calculate the respective predicted values and the difference Δ Each prediction residual of (L + R) is calculated. The buffer / selector 16D1 temporarily stores the plurality of prediction residuals, selects the minimum prediction residual for each subframe specified by the selection signal generator 17, and outputs the selected prediction residual to the packing circuit 18. Note that this subframe has a sample length of about one-tenth of a frame, and one frame is, for example, 80 subframes. here,
Predictors 12a-1 to 12a-n and subtracters 13a-1 to 13a-1
3a-n constitute a prediction circuit 15D1 for the sum signal ch, and the prediction circuit 15D1 and the buffer / selector 16D1
Constitutes a predictive encoding circuit for the sum signal ch.

【００１４】同様に、差分演算回路１１Ｄ２により算出
された差分Δ（Ｌ−Ｒ）は、予測係数が異なる複数の予
測器１２ｂ−１〜１２ｂ−ｎと減算器１３ｂ−１〜１３
ｂ−ｎに印加される。そして、予測器１２ｂ−１〜１２
ｂ−ｎではそれぞれ各予測係数に基づいて差分Δ（Ｌ−
Ｒ）の各予測値が算出され、減算器１３ｂ−１〜１３ｂ
−ｎではそれぞれこの各予測値と差分Δ（Ｌ−Ｒ）の各
予測残差が算出される。バッファ・選択器１６Ｄ２はこ
の複数の予測残差を一時記憶して、選択信号生成器１７
により指定されたサブフレーム毎に最小の予測残差を選
択し、パッキング回路１８に出力する。予測器１２ｂ−
１〜１２ｂ−ｎと減算器１３ｂ−１〜１３ｂ−ｎは差信
号chの予測回路１５Ｄ２を構成し、また、この予測回路
１５Ｄ２とバッファ・選択器１６Ｄ２は差信号chの予測
符号化回路を構成している。Similarly, the difference Δ (LR) calculated by the difference calculation circuit 11D2 is calculated by a plurality of predictors 12b-1 to 12b-n and subtracters 13b-1 to 13b-13 having different prediction coefficients.
b-n. Then, the predictors 12b-1 to 12b-12
bn, the difference Δ (L−
R) are calculated, and the subtractors 13b-1 to 13b
In −n, each prediction residual of the difference Δ (LR) from each of the prediction values is calculated. The buffer / selector 16D2 temporarily stores the plurality of prediction residuals, and
The minimum prediction residual is selected for each sub-frame specified by, and is output to the packing circuit 18. Predictor 12b-
1 to 12b-n and the subtractors 13b-1 to 13b-n form a prediction circuit 15D2 for the difference signal ch, and the prediction circuit 15D2 and the buffer / selector 16D2 form a prediction encoding circuit for the difference signal ch. are doing.

【００１５】選択信号生成器１７は予測残差のビット数
フラグをパッキング回路１８とマルチプレクサ１９に対
して印加し、また、予測残差が最小の予測器を示す予測
器選択フラグをマルチプレクサ１９に対して印加する。
パッキング回路１８はバッファ・選択器１６Ｄ１、１６
Ｄ２により選択された２ch分の予測残差を、選択信号生
成器１７により指定されたビット数フラグに基づいて指
定ビット数でパッキングする。The selection signal generator 17 applies a bit number flag of the prediction residual to the packing circuit 18 and the multiplexer 19, and outputs a predictor selection flag indicating the predictor having the minimum prediction residual to the multiplexer 19. To apply.
The packing circuit 18 includes buffer / selectors 16D1, 16D.
The prediction residual for 2 ch selected by D2 is packed with the specified number of bits based on the bit number flag specified by the selection signal generator 17.

【００１６】続くマルチプレクサ１９は１フレーム分の
上位１６ビットデータに対して・フレームヘッダと、・元の２４ビットデータを上位１６ビットと下位８ビッ
トに分離したか否かを示す分離フラグ１２０と、・和信号ｃｈ（Ｌ＋Ｒ）の１フレームの先頭サンプル値
と、・差信号ｃｈ（Ｌ−Ｒ）の１フレームの先頭サンプル値
と、・和信号ｃｈ（Ｌ＋Ｒ）のサブフレーム毎の予測器選択
フラグと、・差信号ｃｈ（Ｌ−Ｒ）のサブフレーム毎の予測器選択
フラグと、・和信号ｃｈ（Ｌ＋Ｒ）のサブフレーム毎のビット数フ
ラグと、・差信号ｃｈ（Ｌ−Ｒ）のサブフレーム毎のビット数フ
ラグと、・和信号ｃｈ（Ｌ＋Ｒ）の予測残差データ列（可変ビッ
ト数）と、・差信号ｃｈ（Ｌ−Ｒ）の予測残差データ列（可変ビッ
ト数）とを、多重化し、可変レートビットストリームと
して出力する。The following multiplexer 19 provides a frame header for the upper 16-bit data of one frame; a separation flag 120 indicating whether or not the original 24-bit data has been separated into the upper 16 bits and the lower 8 bits; A head sample value of one frame of the sum signal ch (L + R); a head sample value of one frame of the difference signal ch (LR); and a predictor selection flag for each subframe of the sum signal ch (L + R). A predictor selection flag for each sub-frame of the difference signal ch (LR); a bit number flag for each sub-frame of the sum signal ch (L + R); A bit number flag for each frame; a prediction residual data sequence (variable number of bits) of the sum signal ch (L + R); and a prediction residual data sequence (variable number of bits) of the difference signal ch (LR). , Many However, output as a variable rate bit stream.

【００１７】また、予測符号化されていない下位８ビッ
トについては、別のビットストリームとして出力する。
このような予測符号化によれば、原信号が例えばサンプ
リング周波数＝１９２ｋＨｚ、量子化ビット数＝２４ビ
ット、２チャネルの場合、平均で５７％の圧縮率を実現
することができる。The lower 8 bits that have not been predictively coded are output as another bit stream.
According to such predictive coding, when the original signal has, for example, a sampling frequency of 192 kHz, a quantization bit number of 24 bits, and two channels, a compression rate of 57% on average can be realized.

【００１８】また、この可変レートビットストリームデ
ータをＤＶＤオーディオディスクに記録する場合には、
図３に示す圧縮ＰＣＭのオーディオ（Ａ）パックにパッ
キングされる。このパックは２０３４バイトのユーザデ
ータ（Ａパケット、Ｖパケット）に対して４バイトのパ
ックスタート情報と、６バイトのＳＣＲ（System Clock
Reference：システム時刻基準参照値）情報と、３バイ
トのMux レート（rate）情報と１バイトのスタッフィン
グの合計１４バイトのパックヘッダが付加されて構成さ
れている（１パック＝合計２０４８バイト）。この場
合、タイムスタンプであるＳＣＲ情報を、ＡＣＢユニッ
ト内の先頭パックでは「１」として同一タイトル内で連
続とすることにより同一タイトル内のＡパックの時間を
管理することができる。When recording the variable rate bit stream data on a DVD audio disc,
It is packed in the audio (A) pack of the compressed PCM shown in FIG. This pack has 4 bytes of pack start information for 2034 bytes of user data (A packet and V packet) and 6 bytes of SCR (System Clock).
Reference: system time reference value, 3-byte Mux rate (rate) information, and 1-byte stuffing for a total of 14-byte pack header (1 pack = 2048 bytes in total). In this case, the time of the A pack in the same title can be managed by setting the SCR information as the time stamp to be “1” in the first pack in the ACB unit so as to be continuous in the same title.

【００１９】Ａパケットは図４に詳しく示すように、１
７、９又は１４バイトのパケットヘッダと、圧縮ＰＣＭ
のプライベートヘッダと、上記のフォーマットの１ない
し２０１１バイトのオーディオ圧縮ＰＣＭデータにより
構成され、上位１６ビットデータと下位８ビットデータ
は別のＡパケットに収容されている。圧縮ＰＣＭのプラ
イベートヘッダは、・１バイトのサブストリームＩＤと、・２バイトのＵＰＣ／ＥＡＮ−ＩＳＲＣ（Universal Pr
oduct Code/European Article Number-International S
tandard Recording Code）番号、及びＵＰＣ／ＥＡＮ−
ＩＳＲＣデータと、・１バイトのプライベートヘッダ長と、・２バイトの第１アクセスユニットポインタと、・８バイトのオーディオデータ情報（ＡＤＩ）と・０〜７バイトのスタッフィングバイトとに、より構成
されている。As shown in detail in FIG.
7, 9 or 14 byte packet header and compressed PCM
, And 1 to 2011 bytes of audio-compressed PCM data in the above format, and upper 16-bit data and lower 8-bit data are contained in another A packet. The private header of the compressed PCM is: 1-byte substream ID, 2 bytes of UPC / EAN-ISRC (Universal Prism).
oduct Code / European Article Number-International S
tandard Recording Code) number and UPC / EAN-
ISRC data, 1-byte private header length, 2 bytes of first access unit pointer, 8 bytes of audio data information (ADI), and 0 to 7 bytes of stuffing bytes. I have.

【００２０】次に図５を参照してデコーダＤ１、Ｄ２に
ついて説明する。前述したフォーマットの可変レートビ
ットストリームデータは、デマルチプレクサ２１により
フレームヘッダに基づいて分離される。そして、和信号
ｃｈ（Ｌ＋Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）の１フレーム
の先頭サンプル値はそれぞれ累積演算回路２５ａ、２５
ｂに印加され、和信号ｃｈ（Ｌ＋Ｒ）及び差信号ｃｈ
（Ｌ−Ｒ）の予測器選択フラグはそれぞれ予測器（２４
ａ−１〜２４ａ−ｎ）、（２４ｂ−１〜２４ｂ−ｎ）の
各選択信号として印加され、和信号ｃｈ（Ｌ＋Ｒ）及び
差信号ｃｈ（Ｌ−Ｒ）のビット数フラグと予測残差デー
タ列はアンパッキング回路２２に印加される。更に可変
レートビットストリームデータが下位ビットデータを含
む場合には、この下位ビットデータが分離されて加算器
２８ａ、２８ｂに印加され、また、分離フラグは制御信
号として加算器２８ａ、２８ｂに印加される。Next, the decoders D1 and D2 will be described with reference to FIG. The variable-rate bit stream data in the format described above is separated by the demultiplexer 21 based on the frame header. Then, the leading sample values of one frame of the sum signal ch (L + R) and the difference signal ch (LR) are calculated by the accumulator circuits 25a and 25, respectively.
b, the sum signal ch (L + R) and the difference signal ch
The predictor selection flags of (LR) are the predictors (24
a-1 to 24a-n) and (24b-1 to 24b-n) are applied as selection signals, and the bit number flags and the prediction residual data of the sum signal ch (L + R) and the difference signal ch (LR). The columns are applied to an unpacking circuit 22. Further, when the variable-rate bit stream data includes lower-order bit data, the lower-order bit data is separated and applied to the adders 28a and 28b, and the separation flag is applied as a control signal to the adders 28a and 28b. .

【００２１】ここで、予測器（２４ａ−１〜２４ａ−
ｎ）、（２４ｂ−１〜２４ｂ−ｎ）はそれぞれ、符号化
側の予測器（１２ａ−１〜１２ａ−ｎ）、（１２ｂ−１
〜１２ｂ−ｎ）と同一の特性であり、予測器選択フラグ
により同一特性のものが選択される。Here, the predictors (24a-1 to 24a-
n) and (24b-1 to 24b-n) are predictors (12a-1 to 12a-n) and (12b-1) on the encoding side, respectively.
To 12b-n), and those having the same characteristics are selected by the predictor selection flag.

【００２２】アンパッキング回路２２は和信号ｃｈ（Ｌ
＋Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）の予測残差データ列を
ビット数フラグ毎に基づいて分離してそれぞれ加算回路
２３ａ、２３ｂに出力する。加算回路２３ａ、２３ｂで
はそれぞれ、アンパッキング回路２２からの和信号ｃｈ
（Ｌ＋Ｒ）及び差信号ｃｈ（Ｌ−Ｒ）の今回の予測残差
データと、予測器（２４ａ−１〜２４ａ−ｎ）、（２４
ｂ−１〜２４ｂ−ｎ）の内、予測器選択フラグにより選
択された各１つにより予測された前回の予測値が加算さ
れて今回の予測値が算出される。この今回の予測値は、
図２に示す差分演算回路１１Ｄ１、１１Ｄ２によりそれ
ぞれ算出された差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）すなわ
ちＤＰＣＭデータであり、予測器（２４ａ−１〜２４ａ
−ｎ）、（２４ｂ−１〜２４ｂ−ｎ）と累積演算回路２
５ａ、２５ｂに印加される。The unpacking circuit 22 outputs the sum signal ch (L
+ R) and the prediction residual data sequence of the difference signal ch (LR) are separated based on the bit number flags and output to the adders 23a and 23b, respectively. The adder circuits 23a and 23b respectively add the sum signal ch from the unpacking circuit 22.
(L + R) and the current prediction residual data of the difference signal ch (LR), and the predictors (24a-1 to 24a-n), (24
b-1 to 24b-n), the previous predicted value predicted by each one selected by the predictor selection flag is added to calculate the current predicted value. This forecast is
The differences Δ (L + R) and Δ (LR) calculated by the difference calculation circuits 11D1 and 11D2 shown in FIG. 2, that is, DPCM data, are the predictors (24a-1 to 24a).
-N), (24b-1 to 24b-n) and the cumulative operation circuit 2
5a and 25b.

【００２３】累積演算回路２５ａ、２５ｂはそれぞれ、
１フレームの先頭サンプル値に対して差分Δ（Ｌ＋
Ｒ）、Δ（Ｌ−Ｒ）をサンプル毎に累積加算して和信号
ｃｈ（Ｌ＋Ｒ）、差信号ｃｈ（Ｌ−Ｒ）の各ＰＣＭデー
タ（上位１６ビットデータ）をそれぞれ加算器２８ａ、
２８ｂに出力する。加算器２８ａ、２８ｂは分離フラグ
が「分離有り」の場合にはこの和信号ｃｈ（Ｌ＋Ｒ）、
差信号ｃｈ（Ｌ−Ｒ）の各上位１６ビットＰＣＭデータ
と、デマルチプレクサ２１により分離された下位ビット
ＰＣＭデータを加算して出力し、他方、分離フラグが
「分離無し」の場合には上位１６ビットＰＣＭデータを
そのまま出力する。The accumulator circuits 25a and 25b are respectively
The difference Δ (L +
R) and Δ (LR) are cumulatively added for each sample, and the PCM data (upper 16-bit data) of the sum signal ch (L + R) and the difference signal ch (LR) are respectively added to the adder 28a.
28b. The adders 28a and 28b output the sum signal ch (L + R) when the separation flag is "separated".
The upper 16-bit PCM data of each of the difference signals ch (LR) and the lower-bit PCM data separated by the demultiplexer 21 are added and output. On the other hand, when the separation flag is "no separation", the upper 16 bits are output. The bit PCM data is output as it is.

【００２４】この和信号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）
は図１に示すように加算回路４ａにより２Ｌ信号が算出
されるとともに、減算回路４ｂにより２Ｒ信号が算出さ
れる。そして、２Ｌ信号と２Ｒ信号がそれぞれ割り算器
５ａ、５ｂにより１／２に割り算され、元のステレオ２
チャネル信号Ｌ、Ｒが復元される。The sum signal (L + R) and the difference signal (LR)
As shown in FIG. 1, the 2L signal is calculated by the addition circuit 4a and the 2R signal is calculated by the subtraction circuit 4b. Then, the 2L signal and the 2R signal are divided by 1/2 by the dividers 5a and 5b, respectively, and the original stereo 2
The channel signals L and R are restored.

【００２５】次に図６、図７を参照して第２の実施形態
について説明する。上記の実施形態では、和信号（Ｌ＋
Ｒ）、差信号（Ｌ−Ｒ）の各差分Δ（Ｌ＋Ｒ）、Δ（Ｌ
−Ｒ）、すなわちＤＰＣＭデータのみを予測符号化する
ように構成されているが、この第２の実施形態では和信
号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）すなわちＰＣＭデー
タ、又はその各差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）すなわ
ちＤＰＣＭデータを選択的に予測符号化するように構成
されている。Next, a second embodiment will be described with reference to FIGS. In the above embodiment, the sum signal (L +
R), each difference Δ (L + R), Δ (L) of the difference signal (LR)
-R), that is, predictive encoding is performed only on the DPCM data. In the second embodiment, the sum signal (L + R), the difference signal (LR), ie, the PCM data, or each difference Δ (L + R), Δ (LR), that is, DPCM data is selectively and predictively encoded.

【００２６】このため図６に示す符号化装置では、図２
に示す構成に対して和信号（Ｌ＋Ｒ）、差信号（Ｌ−
Ｒ）をそれぞれ予測符号化するための予測回路１５Ａ、
１５Ｓとバッファ・選択器１６Ａ、１６Ｓが追加されて
いる。また、選択信号生成器１７はバッファ・選択器１
６Ａ、１６Ｓによりそれぞれ選択された和信号（Ｌ＋
Ｒ）、差信号（Ｌ−Ｒ）と、バッファ・選択器１６Ｄ
１、１６Ｄ２によりそれぞれ選択された差分Δ（Ｌ＋
Ｒ）、Δ（Ｌ−Ｒ）の各予測残差の最小値に基づいて、
ＰＣＭデータとＤＰＣＭデータのどちらが圧縮率が高い
か否かを判断し、高い方のデータを選択する。このと
き、そのＰＣＭ／ＤＰＣＭの選択フラグ（予測回路選択
フラグ）を追加して多重化する。For this reason, the encoding apparatus shown in FIG.
The sum signal (L + R) and the difference signal (L−
R) for predictive encoding, respectively.
15S and buffer / selectors 16A and 16S are added. The selection signal generator 17 is a buffer / selector 1
6A, the sum signal (L +
R), the difference signal (LR) and the buffer / selector 16D
1, ΔD (L +
R), Δ (LR) based on the minimum value of each prediction residual.
It is determined whether the compression ratio of PCM data or DPCM data is higher, and the higher data is selected. At this time, the PCM / DPCM selection flag (prediction circuit selection flag) is added and multiplexed.

【００２７】ここで、図６に示す和信号（Ｌ＋Ｒ）の予
測回路１５Ａと差分Δ（Ｌ＋Ｒ）の予測回路１５Ｄ１が
同一の構成であり、また、差信号（Ｌ−Ｒ）の予測回路
１５Ｓと差分Δ（Ｌ−Ｒ）の予測回路１５Ｄ２が同一の
構成である場合、復号装置では図７に示すようにＰＣＭ
データとＤＰＣＭデータの両方の予測回路を設ける必要
はなく、１つのデータ分の予測回路でよい。そして、符
号化装置から伝送された予測回路選択フラグに基づいて
セレクタ２６ａ、２６ｂにより、ＤＰＣＭデータの場合
には累積演算回路２５ａ、２５ｂの出力を選択し、ＰＣ
Ｍデータの場合には加算回路２３ａ、２３ｂの出力を選
択する。そして、セレクタ２６ａ、２６ｂによりそれぞ
れ選択された各チャネルの上位１６ビットデータと下位
ビットデータが加算器２８ａ、２８ｂにより加算され
る。Here, the prediction circuit 15A for the sum signal (L + R) and the prediction circuit 15D1 for the difference Δ (L + R) shown in FIG. 6 have the same configuration, and the prediction circuit 15S for the difference signal (L−R) has the same configuration. If the difference Δ (LR) prediction circuits 15D2 have the same configuration, the decoding apparatus uses the PCM as shown in FIG.
It is not necessary to provide a prediction circuit for both data and DPCM data, and a prediction circuit for one data may be used. Then, based on the prediction circuit selection flag transmitted from the encoding device, the selectors 26a and 26b select the outputs of the accumulator circuits 25a and 25b in the case of DPCM data.
In the case of M data, the outputs of the adders 23a and 23b are selected. Then, the upper 16-bit data and lower bit data of each channel selected by the selectors 26a and 26b are added by the adders 28a and 28b.

【００２８】第３の実施形態では図８に示すように、原
信号Ｌ、Ｒ（ＰＣＭデータ）と、和信号（Ｌ＋Ｒ）、差
信号（Ｌ−Ｒ）（ＰＣＭデータ）と、その各差分Δ（Ｌ
＋Ｒ）、Δ（Ｌ−Ｒ）（ＤＰＣＭデータ）の３グループ
の１つを選択的に予測符号化するように構成されてい
る。In the third embodiment, as shown in FIG. 8, the original signals L and R (PCM data), the sum signal (L + R), the difference signal (LR) (PCM data), and each difference Δ (L
+ R) and one of the three groups Δ (LR) (DPCM data) are selectively predictively encoded.

【００２９】このため図８に示す符号化装置では、図６
に示す構成に対して原信号Ｌ、Ｒをそれぞれ予測符号化
するための予測回路１５Ｌ、１５Ｒとバッファ・選択器
１６Ｌ、１６Ｒが追加されている。また、選択信号生成
器１７はバッファ・選択器１６Ｌ、１６Ｒにより選択さ
れた原信号Ｌ、Ｒと、バッファ・選択器１６Ａ、１６Ｓ
により選択された和信号（Ｌ＋Ｒ）、差信号（Ｌ−Ｒ）
と、バッファ・選択器１６Ｄ１、１６Ｄ２により選択さ
れた各差分Δ（Ｌ＋Ｒ）、Δ（Ｌ−Ｒ）の各予測残差の
最小値に基づいて圧縮率が高いグループのデータを選択
する。このとき、その選択フラグ（予測回路選択フラ
グ）を追加して多重化する。For this reason, the encoding apparatus shown in FIG.
In addition to the configuration shown in (1), prediction circuits 15L and 15R for predictively encoding the original signals L and R, and buffers / selectors 16L and 16R are added. Further, the selection signal generator 17 includes the original signals L and R selected by the buffer / selectors 16L and 16R and the buffer / selectors 16A and 16S.
Signal (L + R), difference signal (LR) selected by
And data of a group having a high compression rate based on the minimum values of the prediction residuals of the differences Δ (L + R) and Δ (LR) selected by the buffer / selectors 16D1 and 16D2. At this time, the selection flag (prediction circuit selection flag) is added and multiplexed.

【００３０】また、図８に示す３グループの予測回路が
同一の構成である場合、復号装置では図９に示すように
３グループ分の予測回路を設ける必要はなく、１つのグ
ループ分の予測回路でよい。そして、符号化装置から伝
送された予測回路選択フラグに基づいて、ＤＰＣＭデー
タの場合には累積演算回路２５ａ、２５ｂの出力を選択
し、ＰＣＭデータの場合には加算回路２３ａ、２３ｂの
出力を選択してチャネル相関回路Ｂにより原信号Ｌ、Ｒ
（上位１６ビットデータ）を復元する。そして、更にセ
レクタ２７ａ、２７ｂにより原信号Ｌ、Ｒのグループの
場合には加算回路２３ａ、２３ｂの出力を選択し、他の
場合にはチャネル相関回路Ｂの出力を選択する。次いで
セレクタ２７ａ、２７ｂによりそれぞれ選択された各チ
ャネルの上位１６ビットデータと下位ビットデータが加
算器２８ａ、２８ｂにより加算される。When the three groups of prediction circuits shown in FIG. 8 have the same configuration, the decoding device does not need to provide three groups of prediction circuits as shown in FIG. Is fine. Then, based on the prediction circuit selection flag transmitted from the encoding device, the output of the accumulation operation circuits 25a and 25b is selected in the case of DPCM data, and the output of the addition circuits 23a and 23b is selected in the case of PCM data. The original signals L and R are
(Upper 16-bit data). Further, the selectors 27a and 27b select the outputs of the adders 23a and 23b in the case of the group of the original signals L and R, and select the output of the channel correlation circuit B in other cases. Next, the upper 16-bit data and lower bit data of each channel selected by the selectors 27a and 27b are added by the adders 28a and 28b.

【００３１】なお、上記の第１〜第３の実施形態では、
原信号が２チャネルの場合について説明したが、マルチ
チャネル信号の場合にも適用することができる。ここ
で、マルチチャネル方式としては次の４つの方式が知ら
れている。（１）ドルビーサラウンド方式前方Ｌ、Ｃ、Ｒの３チャネル＋後方Ｓの１チャネルの合
計４チャネル（２）ドルビーＡＣ−３方式前方Ｌ、Ｃ、Ｒ、ＳＷの４チャネル＋後方ＳＬ、ＳＲの
２チャネルの合計６チャネル（３）ＤＴＳ（Digital Theater System）方式ドルビーＡＣ−３方式と同様に６チャネル（Ｌ、Ｃ、
Ｒ、ＳＷ、ＳＬ、ＳＲ）（４）ＳＤＤＳ（Sony Dynamic Digital Sound）方式前方Ｌ、ＬＣ、Ｃ、ＲＣ、Ｒ、ＳＷの６チャネル＋後方
ＳＬ、ＳＲの２チャネルの合計８チャネルIn the first to third embodiments,
Although the case where the original signal has two channels has been described, the present invention can also be applied to the case of a multi-channel signal. Here, the following four systems are known as multi-channel systems. (1) Dolby Surround System 3 channels of front L, C, R + 1 channel of rear S in total 4 channels (2) Dolby AC-3 system 4 channels of front L, C, R, SW + rear SL, SR 6 channels in total of 2 channels (3) DTS (Digital Theater System) system 6 channels (L, C,
R, SW, SL, SR) (4) Sony Dynamic Digital Sound (SDDS) system: 6 channels of front L, LC, C, RC, R, SW + rear SL, 2 channels of SR, totaling 8 channels

【００３２】そこで、図１に示す相関回路Ａは、マルチ
チャネル信号の一例としてレフト（Ｌ）、センタ
（Ｃ）、ライト（Ｒ）、サラウンドレフト（ＳＬ）及び
サラウンドライト（ＳＲ）の５chのＰＣＭデータを、Ｌ
chを基準として次の５ch（Ｌ）、（Ｄ１）〜（Ｄ４）に
変換する。Ｌ＝Ｌ（基準チャネル）Ｄ１＝Ｃ−（Ｌ＋Ｒ）／２Ｄ２＝Ｒ−ＬＤ３＝ＳＬ−ａ×ＬＤ４＝ＳＲ−ｂ×Ｒ但し、０≦ａ，ｂ≦１Therefore, the correlation circuit A shown in FIG. 1 uses a 5-channel PCM of left (L), center (C), right (R), surround left (SL) and surround right (SR) as an example of a multi-channel signal. The data is L
The channel is converted into the following 5 channels (L) and (D1) to (D4) based on the channel. L = L (reference channel) D1 = C-(L + R) / 2 D2 = R-L D3 = SL-a x L D4 = SR-b x R where 0 ≤ a, b ≤ 1

【００３３】そして、この５chの各チャネルデータを上
位１６ビットと下位ビットに分離し、上位１６ビットデ
ータを予測符号化して伝送し、下位ビットデータをその
まま伝送する。また、相関を求める場合、次の５chに変
換するようにしてもよい。Ｌ＝Ｌ（基準チャネル）Ｄ１＝Ｃ−ＬＤ２＝Ｒ−ＬＤ３＝ＳＬ−ＬＤ４＝ＳＲ−ＲThen, each channel data of the 5 channels is separated into upper 16 bits and lower bits, the upper 16 bits data is predictively encoded and transmitted, and the lower bit data is transmitted as it is. When obtaining the correlation, the correlation may be converted to the next 5 channels. L = L (reference channel) D1 = CL-D2 = RL-D3 = SL-LD4 = SR-R

【００３４】符号化側により予測符号化された可変レー
トビットストリームデータをネットワークを介して伝送
する場合には、符号化側では図１０に示すように伝送用
にパケット化し（ステップＳ４１）、次いでパケットヘ
ッダを付与し（ステップＳ４２）、次いでこのパケット
をネットワーク上に送り出す（ステップＳ４３）。復号
側では図１１に示すようにヘッダを除去し（ステップＳ
５１）、次いでデータを復元し（ステップＳ５２）、次
いでこのデータをメモリに格納して復号を待つ（ステッ
プＳ５３）。When the variable-rate bit stream data that has been predictively coded by the coding side is transmitted through a network, the coding side packetizes the data for transmission as shown in FIG. A header is added (step S42), and the packet is sent out to the network (step S43). On the decoding side, the header is removed as shown in FIG.
51) Then, the data is restored (step S52), and the data is stored in the memory and decoding is waited (step S53).

【００３５】[0035]

【発明の効果】以上説明したように本発明によれば、複
数チャネルの音声信号をチャネル毎に上位ビットデータ
と下位ビットデータに分離して、上位ビットデータを予
測符号化するようにしたので、複数チャネルの音声信号
を予測符号化する場合に圧縮率を改善することができ
る。As described above, according to the present invention, audio signals of a plurality of channels are separated into upper bit data and lower bit data for each channel, and the upper bit data is predictively coded. It is possible to improve the compression ratio when predictive coding is performed on audio signals of a plurality of channels.

[Brief description of the drawings]

【図１】本発明に係る音声符号化装置及びそれに対応し
た音声復号装置の第１の実施形態を示すブロック図であ
る。FIG. 1 is a block diagram showing a first embodiment of a speech encoding apparatus according to the present invention and a speech decoding apparatus corresponding thereto.

【図２】図１のエンコーダを詳しく示すブロック図であ
る。FIG. 2 is a block diagram showing the encoder of FIG. 1 in detail.

【図３】ＤＶＤのパックのフォーマットを示す説明図で
ある。FIG. 3 is an explanatory diagram showing a format of a DVD pack.

【図４】ＤＶＤのオーディオパックのフォーマットを示
す説明図である。FIG. 4 is an explanatory diagram showing a format of a DVD audio pack.

【図５】図１のデコーダを詳しく示すブロック図であ
る。FIG. 5 is a detailed block diagram illustrating the decoder of FIG. 1;

【図６】第２の実施形態のエンコーダを示すブロック図
である。FIG. 6 is a block diagram illustrating an encoder according to a second embodiment.

【図７】第２の実施形態のデコーダを示すブロック図で
ある。FIG. 7 is a block diagram illustrating a decoder according to a second embodiment.

【図８】第３の実施形態のエンコーダを示すブロック図
である。FIG. 8 is a block diagram illustrating an encoder according to a third embodiment.

【図９】第３の実施形態のデコーダを示すブロック図で
ある。FIG. 9 is a block diagram illustrating a decoder according to a third embodiment.

【図１０】音声伝送方法を示すフローチャートである。FIG. 10 is a flowchart showing a voice transmission method.

【図１１】音声伝送方法を示すフローチャートである。FIG. 11 is a flowchart showing a voice transmission method.

[Explanation of symbols]

１ａ加算回路（加算手段）１ｂ減算回路（減算手段）１０１フレームバッファ（分離手段）１１Ｄ１差分演算回路（第１の差分演算手段）１１Ｄ２差分演算回路（第２の差分演算手段）１２ａ−１〜１２ａ−ｎ予測器（減算器１３ａ−１〜
１３ａ−ｎ、バッファ・選択器１６Ｄ１と共に第１の予
測符号化手段を構成する。）１２ｂ−１〜１２ｂ−ｎ予測器（減算器１３ｂ−１〜
１３ｂ−ｎ、バッファ・選択器１６Ｄ２と共に第２の予
測符号化手段を構成する。）１３ａ−１〜１３ａ−ｎ，１３ｂ−１〜１３ｂ−ｎ減
算器１６Ｄ１，１６Ｄ２，１６Ａ，１６Ｓ，１６Ｌ，１６Ｒ
バッファ・選択器１５Ａ予測回路（バッファ・選択器１６Ａと共に第３
の予測符号化手段を構成する。）１５Ｓ予測回路（バッファ・選択器１６Ｓと共に第４
の予測符号化手段を構成する。）１５Ｌ予測回路（バッファ・選択器１６Ｌと共に第５
の予測符号化手段を構成する。）１５Ｒ予測回路（バッファ・選択器１６Ｒと共に第６
の予測符号化手段を構成する。）１９マルチプレクサ（多重化手段）Ａ相関回路（相関手段）1a Addition circuit (addition means) 1b Subtraction circuit (subtraction means) 10 1 frame buffer (separation means) 11D1 Difference calculation circuit (first difference calculation means) 11D2 Difference calculation circuit (second difference calculation means) 12a-1 12a-n predictor (subtractors 13a-1 to 13a-1)
13a-n and the buffer / selector 16D1 constitute a first predictive encoding means. ) 12b-1 to 12b-n predictors (subtractors 13b-1 to 13b-1)
13b-n and the buffer / selector 16D2 constitute a second predictive encoding means. 13a-1 to 13a-n, 13b-1 to 13b-n Subtractors 16D1, 16D2, 16A, 16S, 16L, 16R
Buffer / selector 15A Prediction circuit (third with buffer / selector 16A)
Of predictive encoding means. ) 15S prediction circuit (4th with buffer / selector 16S)
Of predictive encoding means. ) 15L prediction circuit (fifth with buffer / selector 16L)
Of predictive encoding means. ) 15R prediction circuit (6th with buffer / selector 16R)
Of predictive encoding means. 19) Multiplexer (multiplexing means) A Correlation circuit (correlation means)

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成１３年２月７日（２００１．２．７）[Submission date] February 7, 2001 (2001.2.7)

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】特許請求の範囲[Correction target item name] Claims

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【特許請求の範囲】[Claims]

【手続補正２】[Procedure amendment 2]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】０００７[Correction target item name] 0007

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【０００７】１）同一のサンプリング周波数であると共
に２つの系統からなる第1の複数チャネルのデジタル音
声信号を所定のマトリクス演算により互いに相関性のあ
る第2の複数チャネルの音声信号に変換する相関手段
と、前記第2の複数チャネルの音声信号をチャネル毎に
上位ビットと下位ビットデータに分離する分離手段と、
前記分離手段により分離された上位ビットデータを各チ
ャネル毎に、入力されるデータに応答して、先頭サンプ
ル値を得ると共に、特性が異なる複数の線形予測方法に
より時間領域の過去から現在の信号の線形予測値がそれ
ぞれ予測され、その予測される線形予測値と前記音声信
号とから得られる予測残差が最小となるような線形予測
方法を選択する予測符号化手段と、前記予測符号化手段
により選択された各チャネルの先頭サンプル値と予測残
差と線形予測方法を含む予測符号化データと、前記分離
手段により分離された下位ビットデータと分離フラグと
を所定のフォーマットで多重化する手段とを、有する音
声符号化装置。２）同一のサンプリング周波数であると共に２つの系統
からなる第1の複数チャネルのデジタル音声信号を所定
のマトリクス演算により互いに相関性のある第2の複数
チャネルの音声信号に変換する相関手段と、前記変換さ
れた第2の複数チャネルの音声信号を、チャネル毎に入
力される音声信号に応答して先頭サンプル値を得ると共
に、特性が異なる複数の線形予測方法により時間領域の
過去から現在の信号の線形予測値がそれぞれ予測され、
その予測される線形予測値と前記音声信号とから得られ
る予測残差が最小となるような線形予測方法を選択して
予測符号化する予測符号化手段と、前記予測符号化手段
により選択された各チャネルの先頭サンプル値と予測残
差と線形予測方法を含む予測符号化データを所定のフォ
ーマットで多重化する手段とを、有する音声符号化装
置。1) Correlation means for converting digital audio signals of a first plurality of channels having the same sampling frequency and composed of two systems into audio signals of a plurality of second channels mutually correlated by a predetermined matrix operation. And the second plurality of audio signals for each channel.
Separating means for separating the upper and lower bits data,
The higher-order bit data separated by the separating means is obtained for each channel, in response to input data, to obtain a leading sample value and to perform a plurality of linear prediction methods having different characteristics.
A linear prediction of the signal from past to present in the more time domain
Respectively, and the predicted linear prediction value and the voice signal are predicted.
And a prediction encoding means for selecting a linear prediction method such that the prediction residual obtained from the signal is the minimum, and a head sample value, a prediction residual, and a linear prediction method for each channel selected by the prediction encoding means. A speech encoding apparatus comprising: a prediction coded data including the coded data; and means for multiplexing the lower bit data and the separation flag separated by the separation means in a predetermined format. 2) correlating means for converting digital audio signals of a first plurality of channels having the same sampling frequency and composed of two systems into audio signals of a plurality of mutually correlated channels by a predetermined matrix operation; The converted second plurality of channels of the audio signal, the head sample value is obtained in response to the audio signal input for each channel, and the current signal from the past in the time domain by a plurality of linear prediction methods having different characteristics . Each linear prediction is predicted,
Obtained from the predicted linear prediction value and the audio signal
That the predictive coding means for predictive residual is predictive coding by selecting a linear prediction method that minimizes the leading sample value and the prediction residual and linear prediction method for each channel selected by said predictive coding means Means for multiplexing predictive encoded data including a predetermined format in a predetermined format.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 9/14 Ｊ 9/18 Ｄ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 9/14 J 9/18 D

Claims

[Claims]

[Claim 1] The same sampling frequency and 2
Digital audio signals of a first plurality of channels composed of two systems are correlated with each other by a predetermined matrix operation.
Correlating means for converting the audio signals of the second plurality of channels into audio signals of two or more channels; separating means for separating the audio signals of the second plurality of channels into upper bit and lower bit data as necessary for each channel; In this case, the upper bit data is obtained for each channel in response to the input data, to obtain a first sample value, and to calculate a prediction residual among a plurality of prediction values of a current signal from a past signal in a time domain. Predictive encoding means for selecting a linear prediction method that minimizes the difference; predictive data including a head sample value of each channel selected by the predictive encoding means, a prediction residual, and a linear prediction method; Means for multiplexing the lower-order bit data and the separation flag in a predetermined format when separated by the above-mentioned method.

2. The same sampling frequency and 2
Digital audio signals of a first plurality of channels composed of two systems are correlated with each other by a predetermined matrix operation.
Correlation means for converting into a plurality of two-channel audio signals, the converted second plurality of channels of audio signals, in response to the audio signal input for each channel to obtain a leading sample value, and in the time domain Prediction encoding means for selecting and predictively encoding a linear prediction method whose prediction residual is minimized among a plurality of prediction values of the current signal from the past signal, and each of the prediction encoding means selected by the prediction encoding means Means for multiplexing, in a predetermined format, predicted data including a head sample value of a channel, a prediction residual, and a linear prediction method.