JPH09120646A

JPH09120646A - Audio signal-compressing/recording apparatus and audio signal-compressing apparatus and optical recording medium

Info

Publication number: JPH09120646A
Application number: JP6527496A
Authority: JP
Inventors: Shoji Ueno; 昭治植野; Norihiko Fuchigami; 徳彦渕上
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1995-08-22
Filing date: 1996-02-27
Publication date: 1997-05-06

Abstract

PROBLEM TO BE SOLVED: To record data to CD-ROMs or DVDs by formatting compressed quantized data to a user area by a formatting means, recording the data to a recording medium and compressing audio signals with a high compression rate. SOLUTION: An A/D converter 1 quantizes audio signals input to an input terminal IN with a frequency of not lower than 441kHz. The data are written to a memory 3 via a signal-processing circuit 2. The signal-processing circuit 3 sequentially writes fresh frames normalized quantized per band to the memory 3. A CD-ROM-coding circuit 4 reads out the data and adds a synchronous signal, a header, a sub header and the like to each sector. The data are input to a CD-coding circuit 5, formatted in a CD format and recorded by a recording head via a second output terminal OUT2.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、オーディオ信号を
高能率符号化して光記録媒体に記録するオーディオ信号
圧縮記録装置及びそのためのオーディオ信号圧縮装置、
さらに光記録媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal compression recording apparatus for high efficiency encoding an audio signal and recording it on an optical recording medium, and an audio signal compression apparatus therefor.
Furthermore, it relates to an optical recording medium.

【０００２】[0002]

【従来の技術】ＣＤ（コンパクトディスク）は１９８２
年に登場して十数年が経過し、現在では様々な展開によ
りデジタルストレージメディアとして定着している。オ
ーディオメディアの用途を考えると、サンプリング周波
数ｆs ＝４４．１ｋＨｚ、量子化ビット数＝１６ビット
のこのメディアは完全に成熟期に入っている。さらに、
ＤＶＤと呼ばれる高密度ディスクがコンピュータなどの
データ用のデジタルディスクとして利用されようとして
いる。なお、デジタルディスクとはＣＤ、ＣＤ−ＲＯ
Ｍ、ＤＶＤなどオーディオやビデオ信号がデジタル信号
として記録された光ディスクをいうものとする。2. Description of the Related Art CD (Compact Disc) is 1982
Ten years have passed since it first appeared in the year, and now it has become established as a digital storage medium due to various developments. Considering the use of audio media, this media with sampling frequency fs = 44.1 kHz and quantization bit rate = 16 bits is completely in its maturity stage. further,
High density discs called DVDs are about to be used as digital discs for data in computers and the like. A digital disc is a CD or CD-RO.
An optical disc such as an M or a DVD in which audio or video signals are recorded as digital signals is referred to.

【０００３】[0003]

【発明が解決しようとする課題】通常の音楽（オーディ
オ）用のＣＤ（以下ＣＤ−ＤＡという）はサンプリング
周波数ｆs ＝４４．１ｋＨｚ、量子化ビット数＝１６ビ
ットで２チャンネルのオーディオ信号を記録することが
できるが、これまでのＣＤ−ＤＡの規格では同一データ
量をＣＤ−ＲＯＭのフォーマットで記録することができ
なかった。これは、ＣＤ−ＲＯＭのフォーマットには同
期信号（ＳＹＮＣ）やアドレスやモードを含むヘッダが
あるため、オーディオ信号を記録するための記録容量が
ＣＤ−ＤＡより少ないためである。一方、パソコンやそ
の周辺機器の発達と急速な普及により、ＣＤ−ＲＯＭド
ライブを介して、音楽などを高音質で楽しみたいという
要望がある。An ordinary music (audio) CD (hereinafter referred to as CD-DA) records a two-channel audio signal with a sampling frequency fs = 44.1 kHz and a quantization bit number = 16 bits. However, the same data amount cannot be recorded in the CD-ROM format according to the conventional CD-DA standard. This is because the format of the CD-ROM has a header including a synchronization signal (SYNC), an address, and a mode, so that the recording capacity for recording the audio signal is smaller than that of the CD-DA. On the other hand, with the development and rapid spread of personal computers and their peripheral devices, there is a demand to enjoy music with high sound quality through a CD-ROM drive.

【０００４】さらに、ＤＶＤと呼ばれるデジタルディス
クでは音声がリニアＰＣＭにより圧縮されずに記録され
ているため、よりハイファイ性の高い記録のためにはデ
ータ量を要し、記録時間が短くなる。このディスクのた
めには、直交変換及び／又はハフマン符号によりデータ
処理してデータ量を削減するための圧縮を行って、ＤＶ
Ｄのフォーマットで記録する記録装置並びにかかる方式
で記録された光ディスクが考えられる。Further, in a digital disc called a DVD, sound is recorded without being compressed by the linear PCM, so that a data amount is required for recording with high hi-finess and the recording time is shortened. For this disc, data is processed by orthogonal transform and / or Huffman code to perform compression for reducing the data amount, and DV is used.
A recording device for recording in the D format and an optical disc recorded by such a method are conceivable.

【０００５】したがって、本発明は現在のＣＤ−ＤＡ用
のデータ処理に比較して高い圧縮率により、量子化ビッ
ト数１６ビット、標本化周波数４４．１ｋＨｚ又はそれ
以上の周波数で量子化して得たデータをＣＤ−ＲＯＭに
記録することができるオーディオ信号圧縮記録装置及び
オーディオ信号圧縮装置並びに光記録媒体を提供するこ
とを目的とする。Therefore, the present invention is obtained by quantizing at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher due to a higher compression rate than the current data processing for CD-DA. An object is to provide an audio signal compression recording device, an audio signal compression device, and an optical recording medium capable of recording data on a CD-ROM.

【０００６】また、本発明は、ＣＤバリエイション（サ
イズ・変調方式）の範囲内でデータフォーマットが一般
に異なると見られているＤＶＤオーディオに記録するこ
とができるオーディオ信号圧縮記録装置及びオーディオ
信号圧縮装置並びに光記録媒体を提供することを目的と
する。Further, the present invention provides an audio signal compression recording apparatus and an audio signal compression apparatus capable of recording on a DVD audio which is generally considered to have a different data format within the range of CD variation (size / modulation method). It is an object to provide an optical recording medium.

【０００７】[0007]

【課題を解決するための手段】本発明は上記目的を達成
するために、オーディオ信号を量子化ビット数１６ビッ
ト、標本化周波数４４．１ｋＨｚ又はそれ以上の周波数
で量子化し、量子化された所定量の量子化データ毎に直
交変換を適用してデータ量を削減・圧縮し、圧縮された
データをＣＤ−ＲＯＭＸＡ規格のモード２、フォーム２
のユーザデータ領域又はＣＤ−ＲＯＭ規格のモード２の
ユーザデータ領域、あるいはＤＶＤのユーザデータ領域
に配するようフォーマッティングし、フォーマッティン
グされたデータを記録媒体に記録するようにしている。In order to achieve the above object, the present invention quantizes an audio signal at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher and quantizes it. Orthogonal transformation is applied to each fixed amount of quantized data to reduce / compress the amount of data, and the compressed data is converted into the CD-ROMXA standard mode 2, form 2
The user data area, the user data area of the mode 2 of the CD-ROM standard, or the user data area of the DVD is formatted, and the formatted data is recorded on the recording medium.

【０００８】すなわち、本発明によれば、オーディオ信
号を量子化ビット数１６ビット、標本化周波数４４．１
ｋＨｚ又はそれ以上の周波数で量子化する量子化手段
と、前記量子化手段で量子化された所定量の量子化デー
タ毎に直交変換を適用してデータ量を圧縮するデータ圧
縮手段と、前記データ圧縮手段で圧縮されたデータをデ
ジタルディスクのユーザデータ領域に配するようフォー
マッティングするフォーマッティング手段と、前記フォ
ーマッティング手段でフォーマッティングされたデータ
をＣＤフォーマットとして記録媒体に記録する手段と
を、有するオーディオ信号圧縮記録装置が提供される。That is, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1.
a quantizing means for quantizing at a frequency of kHz or higher; a data compressing means for compressing the data quantity by applying orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means; Audio signal compression recording having formatting means for formatting the data compressed by the compression means so as to be arranged in the user data area of the digital disc, and means for recording the data formatted by the formatting means in a recording medium as a CD format. A device is provided.

【０００９】また、本発明によればオーディオ信号を量
子化ビット数１６ビット、標本化周波数４４．１ｋＨｚ
又はそれ以上の周波数で量子化する量子化手段と、前記
量子化手段で量子化された所定量の量子化データ毎に直
交変換を適用してデータ量を圧縮するデータ圧縮手段
と、前記データ圧縮手段で圧縮されたデータをＣＤ−Ｒ
ＯＭＸＡ規格のモード２、フォーム２のユーザデータ領
域に配するようフォーマッティングするフォーマッティ
ング手段と、前記フォーマッティング手段でフォーマッ
ティングされたデータをＣＤフォーマットとして記録媒
体に記録する手段とを、有するオーディオ信号圧縮記録
装置が提供される。According to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1 kHz.
Or a quantizing means for quantizing at a frequency higher than that, a data compressing means for compressing the data quantity by applying orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means, and the data compressing means. CD-R data compressed by means
An audio signal compression recording apparatus having formatting means for formatting so as to be arranged in a user data area of mode 2 of the OMXA standard, and means for recording the data formatted by the formatting means as a CD format on a recording medium. Provided.

【００１０】さらに、本発明によれば、オーディオ信号
を量子化ビット数１６ビット、標本化周波数４４．１ｋ
Ｈｚ又はそれ以上の周波数で量子化する量子化手段と、
前記量子化手段で量子化された所定量の量子化データ毎
に直交変換を適用してデータ量を圧縮するデータ圧縮手
段と、前記データ圧縮手段で圧縮されたデータをＣＤ−
ＲＯＭ規格のモード２のユーザデータ領域に配するよう
フォーマッティングするフォーマッティング手段と、前
記フォーマッティング手段でフォーマッティングされた
データをＣＤフォーマットとして記録媒体に記録する手
段とを、有するオーディオ信号圧縮記録装置が提供され
る。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1k.
A quantizing means for quantizing at a frequency of Hz or higher;
A data compression unit that applies an orthogonal transformation to each predetermined amount of quantized data quantized by the quantization unit to compress the data amount; and a data compressed by the data compression unit on a CD-
There is provided an audio signal compression recording apparatus having formatting means for formatting so as to be arranged in a user data area of ROM standard mode 2, and means for recording the data formatted by the formatting means on a recording medium as a CD format. .

【００１１】また、本発明によれば、オーディオ信号を
量子化ビット数１６ビット、標本化周波数４４．１ｋＨ
ｚ又はそれ以上の周波数で量子化する量子化手段と、前
記量子化手段で量子化された所定量の量子化データ毎に
直交変換を適用してデータ量を圧縮するデータ圧縮手段
と、前記データ圧縮手段で圧縮されたデータをＤＶＤの
ユーザデータ領域に配するようフォーマッティングする
フォーマッティング手段と、前記フォーマッティング手
段でフォーマッティングされたデータをＣＤフォーマッ
トとして記録媒体に記録する手段とを、有するオーディ
オ信号圧縮記録装置が提供される。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1 kHz.
a quantizing means for quantizing at a frequency of z or higher; a data compressing means for compressing the data quantity by applying orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means; Audio signal compression recording apparatus having formatting means for formatting the data compressed by the compression means so as to be arranged in the user data area of the DVD, and means for recording the data formatted by the formatting means as a CD format on a recording medium. Will be provided.

【００１２】なお、本発明は上記のようにオーディオ信
号圧縮記録装置として捉えられるが、さらに、再生専用
のディスクの製造のためには、ＣＤフォーマットとして
記録する工程はディスク製造工場側のタスクとなる。し
たがって、本発明はオーディオ信号圧縮装置としても捉
えることができる。Although the present invention can be regarded as an audio signal compression recording apparatus as described above, in order to manufacture a read-only disc, the step of recording as a CD format is a task of the disc manufacturing factory side. . Therefore, the present invention can be regarded as an audio signal compression device.

【００１３】すなわち、本発明によれば、オーディオ信
号を量子化ビット数１６ビット、標本化周波数４４．１
ｋＨｚ又はそれ以上の周波数で量子化する量子化手段
と、前記量子化手段で量子化された所定量の量子化デー
タ毎に直交変換を適用してデータ量を圧縮するデータ圧
縮手段と、前記データ圧縮手段で圧縮されたデータをデ
ジタルディスクのユーザデータ領域に配するようフォー
マッティングするフォーマッティング手段とを、有する
オーディオ信号圧縮装置が提供される。That is, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1.
a quantizing means for quantizing at a frequency of kHz or higher; a data compressing means for compressing the data quantity by applying orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means; An audio signal compression device is provided, which comprises formatting means for formatting the data compressed by the compression means so as to be arranged in a user data area of a digital disc.

【００１４】また、本発明によれば、オーディオ信号を
量子化ビット数１６ビット、標本化周波数４４．１ｋＨ
ｚ又はそれ以上の周波数で量子化する量子化手段と、前
記量子化手段で量子化された所定量の量子化データ毎に
直交変換を適用してデータ量を圧縮するデータ圧縮手段
と、前記データ圧縮手段で圧縮されたデータをＣＤ−Ｒ
ＯＭＸＡ規格のモード２、フォーム２のユーザデータ領
域に配するようフォーマッティングするフォーマッティ
ング手段とを、有するオーディオ信号圧縮装置が提供さ
れる。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1 kHz.
a quantizing means for quantizing at a frequency of z or higher; a data compressing means for compressing the data quantity by applying orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means; The data compressed by the compression means is CD-R
An audio signal compression apparatus is provided, which has a formatting means for formatting to be arranged in a user data area of mode 2 of the OMXA standard.

【００１５】また、本発明によれば、オーディオ信号を
量子化ビット数１６ビット、標本化周波数４４．１ｋＨ
ｚ又はそれ以上の周波数で量子化する量子化手段と、前
記量子化手段で量子化された所定量の量子化データ毎に
直交変換を適用してデータ量を圧縮するデータ圧縮手段
と、前記データ圧縮手段で圧縮されたデータをＤＶＤの
ユーザデータ領域に配するようフォーマッティングする
フォーマッティング手段とを、有するオーディオ信号圧
縮装置が提供される。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1 kHz.
a quantizing means for quantizing at a frequency of z or higher; a data compressing means for compressing the data quantity by applying orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means; An audio signal compression apparatus is provided, which includes formatting means for formatting the data compressed by the compression means so as to be arranged in the user data area of the DVD.

【００１６】また、本発明によれば、オーディオ信号を
量子化ビット数１６ビット、標本化周波数４４．１ｋＨ
ｚ又はそれ以上の周波数で量子化し、量子化された所定
量の量子化データ毎に直交変換を適用してデータ量を圧
縮し、圧縮されたデータをデジタルディスクのユーザデ
ータ領域に配するようフォーマッティングし、フォーマ
ッティングされたデータをＣＤフォーマットとして記録
した光記録媒体が提供される。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1 kHz.
Formatting is performed by quantizing at a frequency of z or higher, applying an orthogonal transform to each quantized predetermined amount of quantized data to compress the data amount, and arranging the compressed data in a user data area of a digital disc. Then, an optical recording medium in which the formatted data is recorded in the CD format is provided.

【００１７】また、本発明によれば、オーディオ信号を
量子化ビット数１６ビット、標本化周波数４４．１ｋＨ
ｚ又はそれ以上の周波数で量子化し、量子化された所定
量の量子化データ毎に直交変換を適用してデータ量を圧
縮し、圧縮されたデータをＣＤ−ＲＯＭＸＡ規格のモー
ド２、フォーム２のユーザデータ領域に配するようフォ
ーマッティングし、フォーマッティングされたデータを
ＣＤフォーマットとして記録した光記録媒体が提供され
る。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1 kHz.
Quantize at a frequency of z or higher, apply an orthogonal transform to each quantized predetermined amount of quantized data to compress the data amount, and compress the compressed data in the mode 2 of the CD-ROMXA standard, form 2. There is provided an optical recording medium which is formatted so as to be arranged in the user data area and in which the formatted data is recorded as a CD format.

【００１８】さらに、本発明によれば、オーディオ信号
を量子化ビット数１６ビット、標本化周波数４４．１ｋ
Ｈｚ又はそれ以上の周波数で量子化し、量子化された所
定量の量子化データ毎に直交変換を適用してデータ量を
圧縮し、圧縮されたデータをＤＶＤのユーザデータ領域
に配するようフォーマッティングし、フォーマッティン
グされたデータをＣＤフォーマットとして記録した光記
録媒体が提供される。Further, according to the present invention, the audio signal is quantized with 16 bits and the sampling frequency is 44.1k.
Quantize at a frequency of Hz or higher, apply orthogonal transformation to each quantized predetermined amount of quantized data, compress the data amount, and format the compressed data to be arranged in the user data area of the DVD. Provided is an optical recording medium in which formatted data is recorded in a CD format.

【００１９】[0019]

【発明の実施の形態】本発明のオーディオ信号圧縮記録
装置及びオーディオ信号圧縮装置並びに光記録媒体の実
施の形態を好ましい実施例によって説明する。図１は本
発明のオーディオ信号圧縮記録装置の好ましい実施例を
示すブロック図である。入力端子ＩＮには例えば音楽信
号などのアナログ信号が供給され、出力端子ＯＵＴは図
示省略のＣＤ原盤作成機、すなわちマスタリング装置に
必要に応じてプリマスタリング装置を介して接続され
る。マスタリング装置自体は従来のものと本質的に変ら
ないので、ここでは説明を省略する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of an audio signal compression recording apparatus, an audio signal compression apparatus and an optical recording medium according to the present invention will be described with reference to preferred embodiments. FIG. 1 is a block diagram showing a preferred embodiment of an audio signal compression recording apparatus of the present invention. An analog signal such as a music signal is supplied to the input terminal IN, and the output terminal OUT is connected to a CD master making machine (not shown), that is, a mastering device via a premastering device as necessary. Since the mastering device itself is essentially the same as the conventional one, its explanation is omitted here.

【００２０】図１の装置は入力端子ＩＮに接続されたＡ
／Ｄ変換器１と、その出力に接続された信号処理回路２
と、信号処理回路２に接続されたメモリ３と、信号処理
回路２の出力に接続されたＣＤ−ＲＯＭ符号化回路４
と、ＣＤ−ＲＯＭ符号化回路４の出力に接続されたＣＤ
符号化回路５を有している。ＣＤ−ＲＯＭ符号化回路４
の出力は第１出力端子ＯＵＴ１に接続され、ＣＤ符号化
回路５の出力は第２出力端子ＯＵＴ２に接続されてい
る。なお、後述するように、ＣＤ符号化回路５は不要な
場合がある。The device of FIG. 1 has an A connected to the input terminal IN.
/ D converter 1 and signal processing circuit 2 connected to its output
A memory 3 connected to the signal processing circuit 2 and a CD-ROM encoding circuit 4 connected to the output of the signal processing circuit 2.
And a CD connected to the output of the CD-ROM encoding circuit 4.
It has an encoding circuit 5. CD-ROM encoding circuit 4
Is connected to the first output terminal OUT1 and the output of the CD encoding circuit 5 is connected to the second output terminal OUT2. As described later, the CD encoding circuit 5 may be unnecessary.

【００２１】Ａ／Ｄ変換器１はオーディオ信号を量子化
ビット数１６ビット、標本化周波数４４．１ｋＨｚ又は
それ以上の周波数で量子化する量子化手段として動作す
る。標本化周波数は実施例により４４．１ｋＨｚ（ＤＶ
Ｄの場合は４８ｋＨｚ）又は８８．２ｋＨｚ（ＤＶＤの
場合は９６ｋＨｚ）のいずれかになっているが、４４．
１ｋＨｚ以上の適当な値とすることができる。音楽信号
を対象とする場合は、通常左右の２チャンネルである
が、サラウンドその他の必要に応じて４チャンネルや６
チャンネルなどとすることができる。ここでは２チャン
ネルである場合について説明する。Ａ／Ｄ変換器１で得
られた量子化データは１チャンネルあたり２^m個(mは正
の整数）を単位として、信号処理回路２を介してメモリ
３に書き込まれる。その後、信号処理回路２がこの２^m
個のデータの処理を開始する。The A / D converter 1 operates as a quantizing means for quantizing an audio signal at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher. The sampling frequency is 44.1 kHz (DV according to the embodiment).
It is either 48 kHz in the case of D) or 88.2 kHz (96 kHz in the case of DVD), but 44.
It can be set to an appropriate value of 1 kHz or more. When targeting music signals, there are usually 2 channels on the left and right, but 4 channels or 6 channels are required depending on the surround and other needs.
It can be a channel or the like. Here, the case of two channels will be described. The quantized data obtained by the A / D converter 1 is written in the memory 3 via the signal processing circuit 2 in units of 2 ^m pieces (m is a positive integer) per channel. Then, the signal processing circuit 2 is the 2 ^m
Start processing of this data.

【００２２】図２は信号処理回路２の一例を示すブロッ
ク図である。２^m個のデータは直交変換回路１０にて直
交変換が施され、周波数スペクトルが得られる。この周
波数スペクトルをバンド分割のための複数のフィルタ６
ａ，６ｂ，６ｃ．．．６ｎを有するフィルタバンク６と
選択手段としてのスイッチ回路７を介して正規化部・量
子化部１１に与え、バンド毎にまとめて正規化・量子化
する。ここで正規化レベル（ビット数）を補助情報、ス
ペクトルデータを主情報としてデータフレームとする。
このデータフレームからコードブックのインデックスを
補助情報、処理データを主情報として、新たなデータフ
レームを作成し、これを順次メモリ３に書き込む。次に
メモリ３からこの新たなデータフレームを読み出し、ア
ロケーション回路９を介して図１のＣＤ−ＲＯＭ符号化
回路４へ出力する。FIG. 2 is a block diagram showing an example of the signal processing circuit 2. The 2 ^m pieces of data are orthogonally transformed by the orthogonal transformation circuit 10 to obtain a frequency spectrum. A plurality of filters 6 for dividing the frequency spectrum into bands
a, 6b, 6c. . . It is given to the normalization section / quantization section 11 via the filter bank 6 having 6n and the switch circuit 7 as the selection means, and is normalized / quantized collectively for each band. Here, the normalization level (the number of bits) is used as auxiliary information, and the spectrum data is used as main information to form a data frame.
A new data frame is created from this data frame using the codebook index as auxiliary information and the processed data as main information, and this is sequentially written in the memory 3. Next, this new data frame is read from the memory 3 and output to the CD-ROM encoding circuit 4 of FIG. 1 via the allocation circuit 9.

【００２３】ＣＤ−ＲＯＭ符号化回路４では、図３によ
って後述する所定のフォーマットとなるように、各セク
タに同期信号（ＳＹＮＣ）やヘッダ、サブヘッダなどを
付加し、各セクタのユーザデータ領域に信号処理回路２
から与えられる圧縮オーディオデータを配して出力す
る。ＣＤ−ＲＯＭ符号化回路４の出力データは第１出力
端子ＯＵＴ１を介して出力され、例えば磁気テープに記
録されて、再生専用のＣＤを製造するためのプリマスタ
リング装置やマスタリング装置に供給される。一方、Ｃ
Ｄ−ＲＯＭ符号化回路４の出力データは、書込み可能
な、いわゆるライトワンスタイプのＣＤの場合は、ＣＤ
符号化回路５に与えられ、ＣＤフォーマット化され、第
２出力端子ＯＵＴ２を介して図示省略の記録ヘッドによ
り記録される。In the CD-ROM encoding circuit 4, a sync signal (SYNC), a header, a subheader, etc. are added to each sector so that a predetermined format described later with reference to FIG. Processing circuit 2
The compressed audio data given by is arranged and output. The output data of the CD-ROM encoding circuit 4 is output via the first output terminal OUT1, recorded on a magnetic tape, for example, and supplied to a premastering device or a mastering device for manufacturing a read-only CD. On the other hand, C
The output data of the D-ROM encoding circuit 4 is a CD in the case of a writable, so-called write-once type CD.
The data is supplied to the encoding circuit 5, is CD-formatted, and is recorded by a recording head (not shown) via the second output terminal OUT2.

【００２４】次に図３とともに本発明のいくつかの態様
について説明する。図３はＣＤの種々のフォーマットを
セクタ単位で示したもので、第１段には通常の音楽用Ｃ
Ｄである、ＣＤ−ＤＡを示し、以下第２段から第６段ま
で各種ＣＤ−ＲＯＭを示している。本発明の実施例とし
ては次の６つの態様がある。なお、ＤＶＤを示す図１７
については後述する。Next, some aspects of the present invention will be described with reference to FIG. FIG. 3 shows various formats of the CD in units of sectors. The first stage is a normal music C.
CD-DA, which is D, and various CD-ROMs from the second stage to the sixth stage are shown below. There are the following six modes as examples of the present invention. Note that FIG. 17 showing a DVD
Will be described later.

【００２５】[0025]

【表１】（１）ＣＤ−ＲＯＭＸＡモード２、フォーム２（図３の６段目）標本化周波数：４４．１ｋＨｚ量子化ビット数：１６ビット（２）ＣＤ−ＲＯＭＸＡモード２、フォーム２（図３の６段目）標本化周波数：８８．２ｋＨｚ量子化ビット数：１６ビット（３）ＣＤ−ＲＯＭモード２（図３の４段目）標本化周波数：４４．１ｋＨｚ量子化ビット数：１６ビット（４）ＣＤ−ＲＯＭモード２（図３の４段目）標本化周波数：８８．２ｋＨｚ量子化ビット数：１６ビット（５）ＤＶＤ（図１７）標本化周波数：４８ｋＨｚ量子化ビット数：１６ビット（６）ＤＶＤ（図１７）標本化周波数：９６ｋＨｚ量子化ビット数：１６ビット [Table 1] (1) CD-ROM XA mode 2, form 2 (6th stage in FIG. 3) Sampling frequency: 44.1 kHz Quantization bit number: 16 bits (2) CD-ROM XA mode 2, form 2 (6th stage of FIG. 3) Sampling frequency: 88.2 kHz Quantization bit number: 16 bits (3) CD-ROM mode 2 (4th stage of FIG. 3) Sampling frequency: 44.1 kHz Quantization bit number: 16 bits (4) CD-ROM mode 2 (4th stage of FIG. 3) Sampling frequency: 88.2 kHz Quantization bit number: 16 bits (5) DVD (FIG. 17) Sampling frequency: 48 kHz Quantization bit number: 16 bits (6) DVD (Fig. 17) Sampling frequency: 96 kHz Quantization bit number: 16 bits

【００２６】ＣＤ−ＲＯＭＸＡモード２、フォーム
２ではユーザデータは２３２４バイトである。また、Ｃ
Ｄ−ＲＯＭモード２では、ユーザデータは２３３６バ
イトである。これらの規格では、比較的ユーザデータの
データ量、すなわちバイト数が多いので、１枚のディス
クに記録収納可能なデータ量が多く、有利である。In the CD-ROM XA mode 2 and form 2, the user data is 2324 bytes. Also, C
In D-ROM mode 2, user data is 2336 bytes. According to these standards, since the data amount of user data, that is, the number of bytes is relatively large, a large amount of data can be recorded and stored in one disk, which is advantageous.

【００２７】また、上記（１）、（２）のＣＤ−ＲＯＭ
ＸＡモード２フォーム２を用いた場合は、独自の
割当てのサブヘッダを規定することができる。サブヘッ
ダの内容を表２に示す。The CD-ROM of the above (1) and (2)
When using XA Mode 2 Form 2, it is possible to specify a subheader with a unique allocation. Table 2 shows the contents of the subheader.

【００２８】[0028]

【表２】 [Table 2]

【００２９】上記サブヘッダ中、サブモードバイトのビ
ット５〜２をこの符号化ＩＤに用いることで、サブヘッ
ダを見ながら、このフォーマットのデコードを行うこと
ができる。以下の表３と表４に、サブヘッダ中のサブモ
ードと、コーディング情報の内容を示す。サブヘッダに
はフォーマット時の条件を記録することができるが、そ
の手法として２つの方法がある。その一つはそのセクタ
のフォーマット条件を入れる方法であり、他の方法はフ
ォーマット条件を複数のセクタに分けて記録する方法で
あり、この場合これら複数のセクタの情報を集合して解
読可能となる。By using bits 5 to 2 of the submode byte in the subheader as the coded ID, it is possible to decode this format while watching the subheader. Tables 3 and 4 below show the submode in the subheader and the content of the coding information. The conditions at the time of formatting can be recorded in the subheader, and there are two methods for that. One of them is a method of inserting the format condition of the sector, and the other is a method of recording the format condition by dividing it into a plurality of sectors. In this case, the information of the plurality of sectors can be collected and decoded. .

【００３０】[0030]

【表３】 [Table 3]

【００３１】[0031]

【表４】 [Table 4]

【００３２】上記４つの態様中、標本化周波数が８８．
２ｋＨｚである、（２）と（４）では、２ブロックで１
フレームを構成することとなる。したがって、４４．１
ｋＨｚの場合と比較して、記録できる時間は半分とな
る。In the above four modes, the sampling frequency is 88.
2 kHz, (2) and (4), 1 in 2 blocks
It constitutes a frame. Therefore, 44.1
The time that can be recorded is halved compared to the case of kHz.

【００３３】上記実施例は、信号処理回路２が可逆圧縮
方式である場合について説明したが、本発明者らが先に
開発したいわゆる準可逆符号化方式のものを適用するこ
とにより、更にデータ量を圧縮することができる。以下
にこの方式について説明する。In the above embodiment, the case where the signal processing circuit 2 is a reversible compression system has been described. However, by applying the so-called quasi-reversible coding system developed by the present inventors, the data amount can be further increased. Can be compressed. This method will be described below.

【００３４】図４は図１の信号処理回路に本発明者らが
開発した音声の準可逆符号化装置を適用する場合の例を
示すブロック図、図５は図４における聴覚心理分析と符
号量調整処理を説明するためのフローチャート、図６は
図４の準可逆符号化装置と従来例における符号量不足時
の再量子化ノイズレベルの比較例を示す説明図、図７は
図４の準可逆符号化装置と従来例における聴感上の音質
比較例を示す説明図である。FIG. 4 is a block diagram showing an example in which the quasi-reversible coding apparatus for speech developed by the present inventors is applied to the signal processing circuit of FIG. 1, and FIG. 5 is a psychoacoustic analysis and code amount in FIG. FIG. 6 is a flow chart for explaining the adjustment process, FIG. 6 is an explanatory view showing a comparative example of the requantization noise level when the code amount is insufficient in the semi-reversible coding apparatus of FIG. 4 and the conventional example, and FIG. It is explanatory drawing which shows the audio quality comparison example in an encoding device and a prior art example.

【００３５】図４に示す装置では先ず、従来の周波数領
域処理のエンコーダと同様に、バッファ２１が後段の窓
掛け・直交変換部２２が直交変換する際に必要なフレー
ム分のＰＣＭ信号をバッファリングし、窓掛け・直交変
換部２２はこのフレームデータに窓掛け（一般にはハニ
ング窓等の窓掛け）し、ＭＤＣＴ（変形離散コサイン変
換）等により直交変換し、この直交変換係数を複数のバ
ンドに分割する。正規化部２３はこのバンド毎の正規化
係数（スケールファクタ）を決定し、バンド内の直交変
換係数を正規化する。量子化・符号化部２４はこの正規
化後の係数を可逆に必要な精度で量子化し、必要であれ
ばエントロピー符号化する。In the apparatus shown in FIG. 4, first, similarly to the conventional frequency domain processing encoder, the buffer 21 buffers the PCM signals for the frames necessary for the windowing / orthogonal transformation unit 22 in the subsequent stage to perform the orthogonal transformation. Then, the windowing / orthogonal transformation unit 22 performs windowing (generally, windowing such as Hanning window) on this frame data, performs orthogonal transformation by MDCT (Modified Discrete Cosine Transform) or the like, and outputs the orthogonal transformation coefficients to a plurality of bands. To divide. The normalization unit 23 determines the normalization coefficient (scale factor) for each band and normalizes the orthogonal transform coefficient within the band. The quantizing / encoding unit 24 quantizes the coefficient after the normalization with a precision necessary for reversibility, and performs entropy coding if necessary.

【００３６】そして、図４の例では、聴覚心理分析部２
５と符号量制御部２６及び量子化・符号化部２４が図５
に示すような処理を行う。図５において、先ず、量子化
・符号化部２４により正規化された係数の１回目の量子
化ビット数（Bit[i]）を決定し、符号量を見積もって総
符号量（Total bit ）を算出する（ステップＳ１）。次
いでそのフレームの使用可能符号量（Avail bit ）を確
認又は算出し（ステップＳ２）、次いで総符号量（Tota
l bit ）と使用可能符号量（Avail bit ）を比較するこ
とにより符号量が不足するか否かをチェックする（ステ
ップＳ３）。In the example of FIG. 4, the psychoacoustic analysis unit 2
5, the code amount control unit 26, and the quantization / encoding unit 24 are shown in FIG.
The following processing is performed. In FIG. 5, first, the first quantization bit number (Bit [i]) of the coefficient normalized by the quantization / encoding unit 24 is determined, the code amount is estimated, and the total code amount (Total bit) is calculated. Calculate (step S1). Next, the usable code amount (Avail bit) of the frame is confirmed or calculated (step S2), and then the total code amount (Tota) is calculated.
It is checked whether or not the code amount is insufficient by comparing l bit) with the usable code amount (Avail bit) (step S3).

【００３７】そして、符号量が不足する場合（Total bi
t ＞Avail bit ）には、先ず、聴覚心理モデルのマスキ
ング効果と最小可聴限特性を考慮してバンドパワーｐ
[i] （＝正規化値² ＝scale[i]² ）からマスキングカー
ブｍ[i] を算出する（ステップＳ４）。この場合、マス
キングカーブｍ[i] は基準カーブcurve[i]とバンドパワ
ーｐ[i] を畳み込み演算することにより得られる。When the code amount is insufficient (Total bi
t> Avail bit), first, the band power p considering the masking effect of the psychoacoustic model and the minimum audible limit characteristic.
A masking curve m [i] is calculated from [i] (= normalized value ² = scale [i] ² ) (step S4). In this case, the masking curve m [i] is obtained by convolving the reference curve curve [i] and the band power p [i].

【００３８】次いで最小可聴限とマスキングカーブから
各バンドの標準ノイズレベルＮ[i]を算出し（ステップ
Ｓ５）、次いで標準ノイズレベルＮ[i] が高いバンドか
ら１ビットずつビット削減を行うことにより不足符号量
を各バンドに振り分ける。但し、バンドｉにおいて１ビ
ット削減を行う毎にＮ[i]から６．０を減算し、ビット
削減が標準ノイズレベルＮ[i] と相似形になるようにす
る（ステップＳ６）。そして、このように各バンド毎に
最終的に決定された量子化ビット数で、量子化・符号化
部２４で再量子化及び符号化する（ステップＳ７）。Next, the standard noise level N [i] of each band is calculated from the minimum audibility limit and the masking curve (step S5), and then the bit is reduced bit by bit from the band having the highest standard noise level N [i]. The insufficient code amount is distributed to each band. However, 6.0 is subtracted from N [i] every time one bit is reduced in band i so that the bit reduction is similar to the standard noise level N [i] (step S6). Then, the quantization / encoding unit 24 requantizes and encodes with the number of quantization bits finally determined for each band in this way (step S7).

【００３９】また、ステップＳ３において符号量が不足
しない場合には、余剰ビットを各バンドに割り当て又は
パディングし（ステップＳ８）、その量子化ビット数
で、量子化・符号化部２４で再量子化及び符号化する
（ステップＳ７）。フォーマット化出力部２７は一般
に、正規化係数（場合によっては量子化ビット数）と、
符号量制御部２６の符号量制御情報と、それにヘッダ等
の補助情報を付加してフォーマット化（ビットストリー
ム化）して伝送する。If the code amount is not insufficient in step S3, surplus bits are assigned or padded to each band (step S8), and requantization is performed by the quantization / encoding unit 24 with the number of quantization bits. And encoding (step S7). The formatted output unit 27 generally includes the normalization coefficient (the number of quantization bits in some cases),
The code amount control information of the code amount control unit 26 and auxiliary information such as a header are added to the code amount control information to format (bit stream) and transmit.

【００４０】図６は図４の例と、従来のエンコーダにお
いて符号量不足時の再量子化ノイズレベルの設定例を比
較した場合を示している。上記例によれば、再量子化ノ
イズ聴覚心理モデルに応じてシェーピングされており、
ノイズ量が同じであっても聴感上ではノイズレベルが下
がった場合と同等の効果を得ることができる。したがっ
て、聴感上の音質劣化を最小限にして準可逆的に符号化
することができる。FIG. 6 shows a comparison between the example of FIG. 4 and an example of setting the requantization noise level when the code amount is insufficient in the conventional encoder. According to the above example, the requantization noise is shaped according to the psychoacoustic model,
Even if the noise amount is the same, the same effect as when the noise level is lowered can be obtained in terms of hearing. Therefore, it is possible to perform quasi-reversible encoding while minimizing the sound quality deterioration in hearing.

【００４１】図７は従来例で非可逆符号化を行った場合
と、上記例の場合の音質の比較例を示し、図７（ａ）は
フレームの一部が非可逆となる場合、図７（ｂ）はフレ
ームの大部分が非可逆となる場合を示す。図のように非
可逆となる区間において太線で示す本発明の方が細線で
示す従来例より音質を改善することができ、したがっ
て、符号化全体として安定した音質を得ることができ
る。また、本発明によれば、非可逆符号化を行った場合
の音質を十分確保することができるので、各フレームの
使用可能符号量が一定の「固定伝送レート」で伝送する
ことができ、したがって、非可逆フレームが大幅に増加
しても音質上の問題は発生しない。この結果、オーサリ
ングや再生装置側の符号量制御に関わる処理を大幅に簡
略化することができる。FIG. 7 shows a comparative example of the sound quality between the case of lossy encoding in the conventional example and the case of the above example, and FIG. 7A shows the case where a part of the frame is lossy. (B) shows a case where most of the frame is irreversible. In the irreversible section as shown in the figure, the present invention shown by the thick line can improve the sound quality as compared with the conventional example shown by the thin line, and therefore stable sound quality can be obtained as a whole encoding. Further, according to the present invention, it is possible to sufficiently secure the sound quality when the lossy encoding is performed, and therefore, the usable code amount of each frame can be transmitted at a fixed “fixed transmission rate”. , Even if the number of irreversible frames is greatly increased, the sound quality problem does not occur. As a result, the processing relating to the authoring and the code amount control on the reproducing device side can be greatly simplified.

【００４２】次に、信号処理回路の他の例について図８
乃至図１４に沿って説明する。図８は図４同様、本発者
らが開発した音声の準可逆符号化装置の他の例を示すブ
ロック図、図９は図８における符号量補正値を算出する
処理を説明するためのフローチャート、図１０は符号量
偏差と符号量補正値の関係を示すグラフ、図１１〜図１
３は符号量補正前と補正後の符号量偏差ヒストグラムを
示す説明図、図１４は図８における聴覚心理分析と符号
量調整処理を説明するためのフローチャート、図１５は
図８の準可逆符号化装置と従来例における符号量過剰時
の再量子化ノイズレベルの比較例を示す説明図、図１６
は図８の準可逆符号化装置と従来例における聴感上の音
質比較例を示す説明図である。Next, another example of the signal processing circuit is shown in FIG.
It will be described with reference to FIG. Similar to FIG. 4, FIG. 8 is a block diagram showing another example of the quasi-reversible encoding device for speech developed by the present inventors, and FIG. 9 is a flowchart for explaining the process for calculating the code amount correction value in FIG. 10 is a graph showing the relationship between the code amount deviation and the code amount correction value, FIGS.
3 is an explanatory view showing the code amount deviation histogram before and after the code amount correction, FIG. 14 is a flowchart for explaining the psychoacoustic analysis and the code amount adjustment processing in FIG. 8, and FIG. 15 is the semi-reversible encoding in FIG. 16 is an explanatory view showing a comparative example of the requantization noise level when the code amount is excessive between the apparatus and the conventional example, FIG.
FIG. 9 is an explanatory diagram showing a comparative example of auditory sound quality between the semi-reversible encoding device of FIG. 8 and a conventional example.

【００４３】図８に示す装置では、先ず、従来の周波数
領域処理のエンコーダと同様に、バッファ２１が後段の
窓掛け・直交変換部２２が直交変換する際に必要なフレ
ーム分のＰＣＭ信号をバッファリングし、窓掛け・直交
変換部２２はこのフレームデータに窓掛け（一般にはハ
ニング窓等の窓掛け）し、ＭＤＣＴ（変形離散コサイン
変換）等により直交変換し、この直交変換係数を複数の
バンドに分割する。正規化部２３はこのバンド毎の正規
化係数（スケールファクタ）を決定し、バンド内の直交
変換係数を正規化する。量子化・符号化部２４はこの正
規化後の係数を可逆に必要な精度で量子化し、この場合
にも必要であればエントロピー符号化する。但し、図１
１に示す時間領域処理の場合よりエントロピ符号化の効
果は一般に少ない。In the apparatus shown in FIG. 8, first, similar to the conventional frequency domain processing encoder, the buffer 21 buffers the PCM signals for the frames necessary for the windowing / orthogonal transform unit 22 in the subsequent stage to perform the orthogonal transform. Then, the windowing / orthogonal transformation unit 22 performs windowing (generally, windowing such as Hanning window) on the frame data, performs orthogonal transformation by MDCT (Modified Discrete Cosine Transform), etc., and obtains the orthogonal transformation coefficients in a plurality of bands. Split into. The normalization unit 23 determines the normalization coefficient (scale factor) for each band and normalizes the orthogonal transform coefficient within the band. The quantizing / encoding unit 24 quantizes the coefficient after the normalization with an accuracy required for reversibility, and also in this case, performs entropy encoding if necessary. However, FIG.
The effect of entropy coding is generally less than that of the time domain processing shown in FIG.

【００４４】そして、この例では、聴覚心理分析部２５
と符号量制御部２６及び量子化・符号化部２４が区間毎
の符号量補正値Adj に基づいて以下のような処理を行
う。先ず、本発明では、オーディオメディアを制作する
場合に、１曲（例えば４〜６分）又は全曲（例えば４０
〜７４分）等の長時間平均で符号量が目標値になるよう
に制御する方法であり、エンコード処理は２パスで行
う。具体的には、（ａ）可逆符号化を仮定した１回目のエンコード処理を
行う。但し、各区間の使用符号量が得られればよく、実
際に量子化・符号化を行う必要はない。（ｂ）図９に示すように各区間の使用符号量と目標符号
量の差から各区間の符号量補正値Adj を算出する。（ｃ）２回目のエンコード処理を行う。この場合、可逆
符号化を仮定したビット割り当てを補正符号量と聴覚心
理モデルにより変更して量子化・符号化を行い、また、
ビット割り当て変更の情報を補助情報としてデコーダに
伝送する。In this example, the psychoacoustic analysis unit 25
The code amount control unit 26 and the quantization / encoding unit 24 perform the following processing based on the code amount correction value Adj for each section. First, in the present invention, when producing an audio medium, one song (for example, 4 to 6 minutes) or all songs (for example, 40 songs).
(~ 74 minutes) and the like so that the code amount is controlled to a target value on a long-term average, and the encoding process is performed in two passes. Specifically, (a) the first encoding process is performed assuming lossless encoding. However, it is only necessary to obtain the used code amount of each section, and it is not necessary to actually perform quantization / encoding. (B) As shown in FIG. 9, the code amount correction value Adj of each section is calculated from the difference between the used code amount of each section and the target code amount. (C) Perform the second encoding process. In this case, bit allocation assuming reversible coding is changed by the correction code amount and the psychoacoustic model to perform quantization / coding, and
The bit allocation change information is transmitted to the decoder as auxiliary information.

【００４５】次に、図９を参照して上記（ｂ）における
符号量補正値Adj を算出する処理について説明する。先ず、対象区間の使用符号量を入力して平均符号量Ｔ
ｍを算出し、目標符号量Ｔｄとの差を評価する（ステッ
プＳ１１、Ｓ１２）。次いで、符号量過剰な場合（平均符号量Ｔｍ＞目標符
号量Ｔｄ）には、各区間の使用符号量と目標符号量との
偏差Delta[bit]（但し、過剰な場合に正）を算出し、こ
の偏差Delta[bit]を適当なステップ幅step[bit] で量子
化し、ヒストグラムを作成する（ステップＳ１２→Ｓ１
３）。次いで、ヒストグラムの偏差が負の領域の偏差総量Ｓ
ｍと、正の領域の偏差総量Ｓｐを以下のように算出する
（ステップＳ１４）。Next, the process for calculating the code amount correction value Adj in (b) above will be described with reference to FIG. First, the average code amount T is input by inputting the used code amount of the target section.
m is calculated, and the difference from the target code amount Td is evaluated (steps S11 and S12). Next, when the code amount is excessive (average code amount Tm> target code amount Td), the deviation Delta [bit] between the used code amount and the target code amount of each section is calculated (however, positive when excess). , The deviation Delta [bit] is quantized with an appropriate step width step [bit] to create a histogram (steps S12 → S1).
3). Next, the total deviation S in the area where the deviation of the histogram is negative
m and the total deviation Sp of the positive region are calculated as follows (step S14).

【００４６】[0046]

【数１】 (Equation 1)

【００４７】次いで、負の領域の偏差総量Ｓｍの比率
Ｓｍ／（Ｓｍ＋Ｓｐ）が予め定めた値Bound （例えば
０．３３等）より大きい場合には、以下のように各区間
毎の符号量補正値Adj を求める（ステップＳ１５→Ｓ１
６）。Next, when the ratio Sm / (Sm + Sp) of the total deviation Sm in the negative region is larger than a predetermined value Bound (for example, 0.33), the code amount correction value for each section is as follows. Find Adj (step S15 → S1)
6).

【００４８】[0048]

【数２】 (Equation 2)

【００４９】’他方、比率Ｓｍ／（Ｓｍ＋Ｓｐ）が値
Bound より小さい場合には、比率Ｓｍ／（Ｓｍ＋Ｓｐ）
が値Bound より大きくなるようにヒストグラムのオフセ
ット値Off を決定し（ステップＳ１５→Ｓ１７）、以下
のように各区間毎の符号量補正値Adj を求める（ステッ
プＳ１８）。On the other hand, the ratio Sm / (Sm + Sp) is a value
If smaller than Bound, the ratio Sm / (Sm + Sp)
Is determined to be larger than the value Bound (step S15 → S17), and the code amount correction value Adj for each section is calculated as follows (step S18).

【００５０】[0050]

【数３】ここで、この手法を用いる理由は、ヒストグラムが極端
に「過剰」側に偏っている場合には、ある程度全フレー
ムにオフセットを掛けて補正する必要があるからであ
る。(Equation 3) Here, the reason for using this method is that when the histogram is extremely biased toward the "excessive" side, it is necessary to apply an offset to all frames to correct it.

【００５１】’また、ステップＳ１２において平均符
号量Ｔｍ＞目標符号量Ｔｄでない場合には、平均符号量
Ｔｍと目標符号量Ｔｄに基づいて以下のように各区間で
一定の符号量補正値Adj を求める（ステップＳ１９）。 Adj ＝（Ｔｄ−Ｔｍ） [bit]If the average code amount Tm> the target code amount Td is not satisfied in step S12, a constant code amount correction value Adj in each section is set as follows based on the average code amount Tm and the target code amount Td. Obtained (step S19). Adj = (Td-Tm) [bit]

【００５２】図１０は符号量偏差Delta[bit]と符号量補
正値Adj の関係を示し、偏差Delta[bit]が正であって大
きい程、補正値Adj も増大する。また、図１１〜図１３
は符号量補正前（実線）と補正後（破線）のヒストグラ
ムを示し、横軸がサンプル当たりの偏差（Delta ／区間
当たりのサンプル数）を、また、縦軸が度数を示す。詳
しくは図１１は上記のように補正値Adj を求めた場
合、また、図１２、図１３はそれぞれ上記 ’、’
のように補正値Adj を求めた場合を示している。FIG. 10 shows the relationship between the code amount deviation Delta [bit] and the code amount correction value Adj. As the deviation Delta [bit] is positive and larger, the correction value Adj also increases. Also, FIGS.
Shows histograms before correction (solid line) and after correction (broken line), the horizontal axis shows deviation per sample (Delta / number of samples per section), and the vertical axis shows frequency. Specifically, FIG. 11 shows the case where the correction value Adj is obtained as described above, and FIG. 12 and FIG.
The case where the correction value Adj is obtained is shown as.

【００５３】次に、図１４を参照して聴覚心理分析と符
号量調整処理を説明する。図１４において、先ず、量子
化・符号化部２４により正規化された係数の１回目（可
逆方式）の量子化ビット数（Bit[i]）を決定し、符号量
を見積もって総符号量（Total bit ）を算出する（ステ
ップＳ２１）。次いでそのフレームの符号量補正値Adj
を読み込み（ステップＳ２２）、補正値Adj が負（Adj
＜０）か否かをチェックする（ステップＳ２３）。Next, the psychoacoustic analysis and the code amount adjustment processing will be described with reference to FIG. In FIG. 14, first, the number of quantization bits (Bit [i]) for the first time (reversible method) of the coefficients normalized by the quantization / encoding unit 24 is determined, and the total amount of code ( Total bit) is calculated (step S21). Next, the code amount correction value Adj for that frame
(Step S22), the correction value Adj is negative (Adj
It is checked whether or not <0) (step S23).

【００５４】そして、補正値Adj が負の場合（符号量削
減）には、先ず、聴覚心理モデルのマスキング効果と最
小可聴限特性を考慮してバンドパワーｐ[i] （＝正規化
係数2 ＝scale[i]² ）からマスキングカーブｍ[i] を算
出する（ステップＳ４）。この場合、マスキングカーブ
ｍ[i] は基準カーブcurve[i]とバンドパワーｐ[i] を畳
み込み演算することにより得られる。When the correction value Adj is negative (code amount reduction
First, the masking effect of the psychoacoustic model and the maximum
Band power p [i] (= normalization considering small audible limit characteristics
coefficient2 ＝ scale [i]^Two ) Calculate masking curve m [i]
Take out (step S4). In this case, the masking curve
m [i] is the standard curve curve [i] and band power p [i]
It is obtained by performing a calculation.

【００５５】次いで最小可聴限とマスキングカーブから
各バンドの標準ノイズレベルＮ[i]を算出し（ステップ
Ｓ５）、次いで標準ノイズレベルＮ[i] が高いバンドか
ら１ビットずつビット削減を行うことにより符号量補正
値を各バンドに振り分ける。但し、バンドｉにおいて１
ビット削減を行う毎にＮ[i] から６．０を減算し、ビッ
ト削減が標準ノイズレベルＮ[i] と相似形になるように
する（ステップＳ６）。そして、このように各バンド毎
に最終的に決定された量子化ビット数で、量子化・符号
化部２４で再量子化及び符号化する（ステップＳ７）。Next, the standard noise level N [i] of each band is calculated from the minimum audibility limit and the masking curve (step S5), and then bit reduction is performed bit by bit from the band having the highest standard noise level N [i]. The code amount correction value is assigned to each band. However, 1 in band i
Each time bit reduction is performed, 6.0 is subtracted from N [i] so that the bit reduction becomes similar to the standard noise level N [i] (step S6). Then, the quantization / encoding unit 24 requantizes and encodes with the number of quantization bits finally determined for each band in this way (step S7).

【００５６】また、ステップＳ２３において補正値Adj
が負でない場合（符号量増加）には、余剰ビットを各バ
ンドに割り当て又はパディングし（ステップＳ８）、そ
の量子化ビット数で、量子化・符号化部２４で再量子化
及び符号化する（ステップＳ７）。フォーマット化出力
部２７は一般に、正規化係数（場合によっては量子化ビ
ット数）と、符号量制御部２６の符号量制御情報と、そ
れにヘッダ等の補助情報を付加してフォーマット化（ビ
ットストリーム化）して伝送する。Further, in step S23, the correction value Adj
If is not negative (increased code amount), surplus bits are assigned or padded to each band (step S8), and the quantization / encoding unit 24 requantizes and encodes with the number of quantization bits ( Step S7). The formatting output unit 27 generally adds a normalization coefficient (the number of quantization bits in some cases), the code amount control information of the code amount control unit 26, and auxiliary information such as a header to the formatting (bitstream conversion). ) And transmit.

【００５７】したがって、上記例によれば、算術エント
ロピーが大きく、聴感エントロピーが小さい区間ほど、
より多くの符号量補正（削減）を受けることになり、聴
感に対応した符号量配分を行うことができる。また、図
１５は図８の装置と、従来のエンコーダにおいて符号量
過剰時の再量子化ノイズレベルの設定例を比較した場合
を示し、この例によれば、非可逆符号化されるフレーム
においても再量子化ノイズ聴覚心理モデルに応じてシェ
ーピングされており、ノイズ量が同じであっても聴感上
ではノイズレベルが下がった場合と同等の効果を得るこ
とができる。したがって、聴感上の音質劣化を最小限に
して準可逆的に符号化することができる。Therefore, according to the above example, the interval where the arithmetic entropy is large and the auditory entropy is small,
Since more code amount correction (reduction) is performed, the code amount can be distributed according to the sense of hearing. Further, FIG. 15 shows a case where the apparatus of FIG. 8 is compared with an example of setting the requantization noise level when the code amount is excessive in the conventional encoder. According to this example, even in a frame to be lossy encoded. Requantization noise is shaped according to the psychoacoustic model, and even if the noise amount is the same, it is possible to obtain the same effect as when the noise level is lowered in the sense of hearing. Therefore, it is possible to perform quasi-reversible encoding while minimizing the sound quality deterioration in hearing.

【００５８】図１６は従来例において非可逆符号化を行
った場合と、上記図８の例の場合の音質の比較例を示
し、図１６（ａ）はフレームの一部が非可逆となる場
合、図１６（ｂ）はフレームの大部分が非可逆となる場
合を示す。図のように非可逆となる区間において太線で
示す本発明の方が細線で示す従来例より音質を改善する
ことができ、したがって、符号化全体として安定した音
質を得ることができる。FIG. 16 shows a comparative example of the sound quality between the case where lossy encoding is performed in the conventional example and the case of the example of FIG. 8 above. FIG. 16A shows the case where a part of the frame is lossy. 16 (b) shows a case where most of the frame is irreversible. In the irreversible section as shown in the figure, the present invention shown by the thick line can improve the sound quality as compared with the conventional example shown by the thin line, and therefore stable sound quality can be obtained as a whole encoding.

【００５９】次に本発明によるデータ圧縮率がどの程度
であるかについて検討する。表５は音楽の３つのジャン
ル別に、圧縮効果を実測した結果を示したものである。
なお、表中の１段目の１６４４はビット数が１６で、標
本化周波数が４４．１ｋＨｚｚであることを示してい
る。各ジャンルにおいて、５乃至１０曲を選定して調査
した。Next, the data compression rate according to the present invention will be examined. Table 5 shows the results of actually measuring the compression effect for each of the three music genres.
Note that the first stage 1644 in the table indicates that the number of bits is 16 and the sampling frequency is 44.1 kHzz. In each genre, 5 to 10 songs were selected and surveyed.

【００６０】[0060]

【表５】 [Table 5]

【００６１】表５中の数字は基のデータ量を１００とし
たときの圧縮後のデータ量を示している。この表から分
るように、例えば、１６４４をクラシック音楽に適用す
ると、平均で５０％、最大で６０％の圧縮が可能である
ことが分る。ポップスやジャズ・フュージョンではクラ
シック程の圧縮はできないが、平均的に２３％から３５
％の圧縮率が得られる。The numbers in Table 5 indicate the data amount after compression when the base data amount is 100. As can be seen from this table, for example, when 1644 is applied to classical music, it is possible to compress 50% on average and 60% at maximum. It's not possible to compress as much as classical music in pops and jazz fusion, but on average 23% to 35%
% Compression ratio is obtained.

【００６２】表５に示した圧縮の効果は、図４及び図８
に示した準可逆符号化装置を用いない場合のものであ
り、準可逆符号化装置を用いることにより、さらに圧縮
率を高くすることができる。The effect of compression shown in Table 5 is shown in FIGS.
This is a case where the quasi-lossless coding device shown in is not used, and the compression rate can be further increased by using the quasi-lossless coding device.

【００６３】上記本発明の実施例の４つの態様は、その
いずれかを選択できるように、本発明の圧縮記録装置又
は圧縮装置の使用者が手動で図示省略のセレクトボタン
などを操作することにより、切り替えて使用できる構成
とすることができる。なお、標本化周波数を４４．１ｋ
Ｈｚより高く設定した場合は、４４．１ｋＨｚのときの
一定線速度より更に速い線速度となるよう、ディスクの
回転数を制御する必要がある。標本化周波数を４４．１
ｋＨｚより高く設定した場合は、高域の周波数特性が改
善され、高音質化が図られる。図１７はＤＶＤのフォー
マットを図３と同様にセクタ単位で示すデータ配置模式
図である。図１７に示されるように、ＤＶＤでは通常１
パックが２０４８バイト（１論理セクタ）で構成され、
その中のパケット（ユーザデータ）２０３４バイトが利
用できる。なお、図１７において、「パックスタート」
は同期信号となるＳＹＮＣパターンを有し、「ＳＣＲ」
は時間情報であるシステム・クロック・レファレンスで
あり、「Ｍｕｘｒａｔｅ」は転送レート（マルチプレ
クシングレート）であり、「パケット（ユーザデー
タ）」はパケットヘッダとデータなどからなる。According to the four modes of the embodiment of the present invention described above, a user of the compression recording apparatus or the compression apparatus of the present invention manually operates a select button (not shown) or the like so that any one of them can be selected. , Can be switched and used. The sampling frequency is 44.1k.
When it is set higher than Hz, it is necessary to control the rotational speed of the disk so that the linear velocity becomes higher than the constant linear velocity at 44.1 kHz. Sampling frequency is 44.1
When the frequency is set higher than kHz, the high frequency characteristics are improved and the sound quality is improved. FIG. 17 is a schematic data layout diagram showing the format of a DVD in sector units as in FIG. As shown in FIG. 17, it is usually 1 for DVD.
The pack consists of 2048 bytes (1 logical sector),
2034 bytes of the packet (user data) in it can be used. In addition, in FIG. 17, "pack start"
Has a SYNC pattern that serves as a synchronization signal, and "SCR"
Is a system clock reference which is time information, "Mux rate" is a transfer rate (multiplexing rate), and "packet (user data)" is composed of a packet header and data.

【００６４】[0064]

【発明の効果】以上説明したように本発明によれば、オ
ーディオ信号を量子化ビット数１６ビット、標本化周波
数４４．１ｋＨｚ又はそれ以上の周波数で量子化し、量
子化された所定量の量子化データ毎に直交変換を適用し
てデータ量を圧縮し、圧縮されたデータをＣＤ−ＲＯＭ
ＸＡ規格のモード２、フォーム２のユーザデータ領域あ
るいはＣＤ−ＲＯＭ規格のモード２のユーザデータ領域
Ｍに配するようフォーマるいはＣＤ−ＲＯＭ規格のモー
ド２のユーザデータ領域、あるいはＤＶＤのユーザデー
タ領域に配するようフォーマッティングしているので、
音声信号を高圧縮率で圧縮して、ＣＤ−ＲＯＭやＤＶＤ
に記録することができる。したがって、音楽の内容によ
っては、ＣＤ−ＤＡより記録時間を長くすることも可能
である。また、新規格のＤＶＤオーディオを実現可能と
している。As described above, according to the present invention, an audio signal is quantized with a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or more, and a predetermined amount of quantized quantization is performed. Apply orthogonal transformation to each data to compress the data amount and compress the compressed data to CD-ROM
The format or the user data area of the CD-ROM standard or the user data area of the CD-ROM standard or the DVD or the user data area of the mode 2 of the XA standard or form 2 or the CD-ROM standard. Since it is formatted to be placed in
Audio signals are compressed at a high compression rate, CD-ROM or DVD
Can be recorded. Therefore, depending on the content of the music, the recording time can be longer than that of the CD-DA. In addition, the new standard DVD audio can be realized.

[Brief description of the drawings]

【図１】本発明のオーディオ信号圧縮記録装置の好まし
い実施例を示すブロック図である。FIG. 1 is a block diagram showing a preferred embodiment of an audio signal compression recording apparatus of the present invention.

【図２】図１中の信号処理回路２の一例を示すブロック
図である。FIG. 2 is a block diagram showing an example of a signal processing circuit 2 in FIG.

【図３】ＣＤの種々のフォーマットをセクタ単位で示し
たデータ配置摸式図である。FIG. 3 is a schematic diagram of data arrangement showing various formats of a CD in units of sectors.

【図４】図１中の信号処理回路２の他の例としての音声
の準可逆符号化装置の一例を示すブロック図である。FIG. 4 is a block diagram showing an example of a quasi-lossless coding apparatus for speech as another example of the signal processing circuit 2 in FIG.

【図５】図４における聴覚心理分析と符号量調整処理を
説明するためのフローチャートである。5 is a flowchart for explaining the psychoacoustic analysis and code amount adjustment processing in FIG.

【図６】図４の準可逆符号化装置と従来例における符号
量不足時の再量子化ノイズレベルの比較例を示す説明図
である。FIG. 6 is an explanatory diagram showing a comparative example of requantization noise levels when the code amount is insufficient in the semi-reversible encoding device of FIG. 4 and a conventional example.

【図７】図４の準可逆符号化装置と従来例における聴感
上の音質比較例を示す説明図である。FIG. 7 is an explanatory diagram showing a comparative example of auditory sound quality between the semi-reversible encoding device of FIG. 4 and a conventional example.

【図８】図１中の信号処理回路２のさらに他の例として
の音声の準可逆符号化装置の一例を示すブロック図であ
る。8 is a block diagram showing an example of a quasi-reversible coding apparatus for speech as still another example of the signal processing circuit 2 in FIG.

【図９】図８における符号量補正値を算出する処理を説
明するためのフローチャートである。9 is a flowchart for explaining a process of calculating a code amount correction value in FIG.

【図１０】符号量偏差と符号量補正値の関係を示すグラ
フである。FIG. 10 is a graph showing the relationship between code amount deviation and code amount correction value.

【図１１】符号量補正前と補正後の符号量偏差ヒストグ
ラムを示す説明図である。FIG. 11 is an explanatory diagram showing histograms of code amount deviations before and after code amount correction.

【図１２】符号量補正前と補正後の符号量偏差ヒストグ
ラムを示す説明図である。FIG. 12 is an explanatory diagram showing histograms of code amount deviations before and after code amount correction.

【図１３】符号量補正前と補正後の符号量偏差ヒストグ
ラムを示す説明図である。FIG. 13 is an explanatory diagram showing code amount deviation histograms before and after code amount correction.

【図１４】図８における聴覚心理分析と符号量調整処理
を説明するためのフローチャートである。FIG. 14 is a flowchart for explaining the psychoacoustic analysis and code amount adjustment processing in FIG.

【図１５】図８の準可逆符号化装置と従来例における符
号量不足時の再量子化ノイズレベルの比較例を示す説明
図である。FIG. 15 is an explanatory diagram showing a comparative example of the requantization noise level when the code amount is insufficient in the semi-reversible encoding device of FIG. 8 and the conventional example.

【図１６】図８の準可逆符号化装置と従来例における聴
感上の音質比較例を示す説明図である。16 is an explanatory diagram showing an audio quality comparison example between the semi-reversible encoding apparatus of FIG. 8 and a conventional example.

【図１７】ＤＶＤのフォーマットをセクタ単位で示した
データ配置摸式図である。FIG. 17 is a schematic diagram of data arrangement showing a DVD format in sector units.

[Explanation of symbols]

１Ａ／Ｄ変換回路（量子化手段）２信号処理部（メモリ３とともにデータ圧縮手段を構
成する）３メモリ４ＣＤ−ＲＯＭ符号化回路（フォーマッティング手
段）５ＣＤ符号化回路１０直交変換回路１１正規化部２２窓掛け・直交変換部２３正規化部２４量子化・符号化部２５聴覚心理分析部２６符号量制御部２７フォーマット化出力部ＩＮ入力端子ＯＵＴ１、ＯＵＴ２出力端子DESCRIPTION OF SYMBOLS 1 A / D conversion circuit (quantization means) 2 Signal processing part (it comprises a data compression means with the memory 3) 3 Memory 4 CD-ROM encoding circuit (formatting means) 5 CD encoding circuit 10 Orthogonal conversion circuit 11 Normal Conversion unit 22 Windowing / Orthogonal transformation unit 23 Normalization unit 24 Quantization / coding unit 25 Auditory psychological analysis unit 26 Code amount control unit 27 Formatting output unit IN input terminals OUT1 and OUT2 output terminals

Claims

[Claims]

1. Quantizing means for quantizing an audio signal at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and a predetermined amount of quantized data quantized by the quantizing means. Data compression means for applying orthogonal transformation for each data to compress the data amount, formatting means for formatting the data compressed by the data compression means so as to be arranged in the user data area of the digital disc, and formatting by the formatting means. And a means for recording the recorded data as a CD format on a recording medium.

2. Quantizing means for quantizing an audio signal with a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and a predetermined amount of quantized data quantized by the quantizing means. An audio signal compression apparatus comprising: a data compression unit that applies an orthogonal transformation for each data to compress a data amount; and a formatting unit that formats the data compressed by the data compression unit so as to be arranged in a user data area of a digital disc. .

3. An audio signal is quantized with a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and an orthogonal transform is applied to each quantized predetermined amount of quantized data to obtain a data amount. An optical recording medium in which the compressed data is formatted, the compressed data is formatted so as to be arranged in a user data area of a digital disc, and the formatted data is recorded in a CD format.

4. Quantizing means for quantizing an audio signal at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and a predetermined amount of quantized data quantized by the quantizing means. A data compression unit for compressing the data amount by applying orthogonal transformation for each of the data; and a CD-ROM for storing the data compressed by the data compression unit.
XA standard mode 2, form 2, or CD-RO
An audio signal compression recording apparatus comprising: formatting means for formatting so as to be arranged in a user data area of M standard mode 2; and means for recording the data formatted by the formatting means in a recording medium as a CD format.

5. Quantizing means for quantizing an audio signal at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and a predetermined amount of quantized data quantized by the quantizing means. A data compression unit for compressing the data amount by applying orthogonal transformation for each of the data; and a CD-ROM for storing the data compressed by the data compression unit.
XA standard mode 2, form 2, or CD-RO
An audio signal compression apparatus, comprising: formatting means for formatting so as to be arranged in a user data area of mode 2 of M standard.

6. An audio signal is quantized with a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and orthogonal transform is applied to each quantized predetermined amount of quantized data to obtain a data amount. A compressed optical disk, and the compressed data is formatted so as to be arranged in the user data area of the CD-ROM XA standard mode 2, the form 2 or the CD-ROM standard mode 2, and the formatted data is recorded in the CD format. recoding media.

7. Quantizing means for quantizing an audio signal at a quantization bit number of 16 bits and a sampling frequency of 44.1 kHz or higher, and a predetermined amount of quantized data quantized by the quantizing means. A data compression unit that applies orthogonal transformation for each data to compress the data amount, a formatting unit that formats the data compressed by the data compression unit so as to be arranged in the user data area of the DVD, and a format that is formatted by the formatting unit. A device for recording data in a recording medium as a CD format, the audio signal compression recording device.

8. The audio signal is a two-channel signal, and the data compression means is 2 ^m per channel.
The audio signal compression recording according to any one of claims 1, 4, and 7, wherein the orthogonal transformation is applied to each piece (m is a positive integer) of quantized data to compress the data amount. apparatus.

9. A means for the data compressing means to frame the audio signal for each predetermined section length, and a code amount required to encode the signal in the frame by a reversible method, A code amount control means for comparing with the usable code amount, a psychoacoustic analysis means for analyzing the signal in the frame by a psychoacoustic model, and a signal in the frame is reversible when the frame code amount is less than or equal to the usable code amount. Irreversible quantizing means for quantizing the signal in the frame by an irreversible method based on the output of the psychoacoustic analyzing means when the frame code quantity exceeds the usable code quantity. Item 9. The audio signal compression recording device according to any one of items 1, 4, 7, and 8.

10. A framing means for framing the audio signal for each predetermined section length by the data compression means, and calculating a difference between a target code amount and an actual code amount by a reversible method for all sections to be encoded. , A code amount correction value calculation means for calculating a correction value according to the excess or deficiency of the code amount for each section, a psychoacoustic analysis means for analyzing a signal in a frame with a psychoacoustic model, and an average code quantity for all sections Irreversible quantization in which the signal in each section is quantized by a reversible method based on the code amount correction value so that the target code amount becomes, or irreversibly quantized based on the output of the psychoacoustic analysis means. An audio signal compression recording apparatus according to any one of claims 1, 4, 7, and 8, further comprising:

11. The data compression means comprises orthogonal transformation means for orthogonally transforming the quantized signal, band division means for dividing the data orthogonally transformed by the orthogonal transformation means into a plurality of bands, and the band. 11. The normalizing means capable of responding to the output signal of the dividing means, and the selecting means for supplying the band-divided data by the band dividing means to the normalizing means for each band.
The audio signal compression recording device according to any one of 1.

12. The number of quantization bits of an audio signal is 16
Bits, a quantizing means for quantizing at a sampling frequency of 44.1 kHz or higher, and orthogonal transformation for each predetermined quantity of quantized data quantized by the quantizing means to compress the data quantity. An audio signal compression apparatus comprising: a data compression unit; and a formatting unit that formats the data compressed by the data compression unit so as to be arranged in a user data area of a DVD.

13. An audio signal having a quantization bit number of 16
Bit, sampling frequency is quantized at a frequency of 44.1 kHz or higher, orthogonal transformation is applied to each quantized predetermined amount of quantized data to compress the data amount, and the compressed data is used as DVD user data. Format the data so that it will be placed in the area, and put the formatted data on a CD.
An optical recording medium recorded as a format.