JPH08125544A

JPH08125544A - Method/device for compressing digital signal and recording medium

Info

Publication number: JPH08125544A
Application number: JP26501294A
Authority: JP
Inventors: Hiroyuki Suzuki; 浩之鈴木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1994-10-28
Filing date: 1994-10-28
Publication date: 1996-05-17
Anticipated expiration: 2017-10-15
Also published as: JP3334375B2

Abstract

PURPOSE: To prevent the efficiency deterioration of block floating and quantization by controlling the frequency size of a small block executing floating in accordance with a characteristic on the frequency axis of an input signal and executing optimum floating. CONSTITUTION: Spectrum data supplied to an input terminal 301 is transmitted to a change quantity calculation circuit 303. Change quantity for respective frequencies is calculated and it is transmitted to an integrating comparsion circuit 304. The circuit 304 integrates change quanitty data and compares it with a threshold. Then, the frequency when an integrating value exceeds the threshold is transmitted to a block floating unit decision circuit 305. A unit correction circuit 306 corrects the boundary of the block floating unit from the circuit 305 from output from an energy calculation circuit 307 and a standard unit output circuit 309 and outputs it to an adaptive bit encoding circuit. The circuit 306 evaluates the quantity of auxiliary data in the unit after correction by the output of the circuit 307, the efficiency of block floating and information quantity required at the time of quantization so as to decide the adoption of the unit.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディジタルオーディオ
信号等をビット圧縮した圧縮データの記録再生、その圧
縮データが記録される記録媒体、及び、圧縮データの伝
送系に関し、特に、入力信号の周波数軸上の変化に応じ
て、情報圧縮の為のフローティング及び／又は圧縮の為
のビット配分を行う時間と周波数によって細分化された
小ブロックの周波数的大きさを変化させるような、ディ
ジタル信号圧縮方法及び装置、並びに記録媒体に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to recording / reproducing of compressed data obtained by bit-compressing a digital audio signal or the like, a recording medium on which the compressed data is recorded, and a transmission system of the compressed data, and more particularly to a frequency of an input signal. A digital signal compression method in which the frequency size of a small block subdivided by time and frequency for floating and / or bit allocation for information compression is changed according to the change on the axis. And an apparatus and a recording medium.

【０００２】[0002]

【従来の技術】本件出願人は、先に、入力されたディジ
タルオーディオ信号をビット圧縮し、所定のデータ量を
記録単位としてバースト的に記録するような技術を、例
えば特願平２−２２１３６４号、特願平２−２２１３６
５号、特願平２−２２２８２１号、特願平２−２２２８
２３号の各明細書及び図面等において提案している。2. Description of the Related Art The applicant of the present application has previously proposed a technique for bit-compressing an input digital audio signal and recording it in bursts with a predetermined data amount as a recording unit, for example, Japanese Patent Application No. Hei 2-221364. , Japanese Patent Application No. 2-22136
No. 5, Japanese Patent Application No. 2-222821, Japanese Patent Application No. 2-2228
No. 23 specification, drawings, etc.

【０００３】この技術は、記録媒体として光磁気ディス
クを用い、いわゆるＣＤ−Ｉ（ＣＤ−インタラクティ
ブ）やＣＤ−ＲＯＭＸＡのオーディオデータフォーマ
ットに規定されているＡＤ（適応差分）ＰＣＭオーディ
オデータを記録再生するものであり、このＡＤＰＣＭデ
ータの例えば３２セクタ分とインターリーブ処理のため
のリンキング用の数セクタとを記録単位として、光磁気
ディスクにバースト的に記録している。This technique uses a magneto-optical disk as a recording medium, and records and reproduces AD (adaptive difference) PCM audio data defined in audio data formats of so-called CD-I (CD-interactive) and CD-ROM XA. For example, 32 sectors of this ADPCM data and several sectors for linking for interleave processing are recorded as a recording unit on the magneto-optical disk in a burst manner.

【０００４】この光磁気ディスクを用いた記録再生装置
におけるＡＤＰＣＭオーディオにはいくつかのモードが
選択可能になっており、例えば通常のＣＤの再生時間に
比較して、２倍の圧縮率でサンプリング周波数が３７．
８ｋＨｚのレベルＡ、４倍の圧縮率でサンプリング周波
数が３７．８ｋＨｚのレベルＢ、８倍の圧縮率でサンプ
リング周波数が１８．９ｋＨｚのレベルＣが規定されて
いる。すなわち、例えば上記レベルＢの場合には、ディ
ジタルオーディオデータが略々１／４に圧縮され、この
レベルＢのモードで記録されたディスクの再生時間（プ
レイタイム）は、標準的なＣＤフォーマット（ＣＤ−Ｄ
Ａフォーマット）の場合の４倍となる。これは、より小
型のディスクで標準１２ｃｍと同じ程度の記録再生時間
が得られることから、装置の小型化が図れることにな
る。Several modes can be selected for the ADPCM audio in the recording / reproducing apparatus using this magneto-optical disk. For example, the sampling frequency is doubled as compared with the reproduction time of a normal CD. Is 37.
A level A of 8 kHz, a level B of a sampling frequency of 37.8 kHz at a compression rate of 4 times, and a level C of a sampling frequency of 18.9 kHz at a compression rate of 8 times are specified. That is, for example, in the case of the level B, the digital audio data is compressed to approximately 1/4, and the reproduction time (play time) of the disc recorded in the level B mode is the standard CD format (CD -D
It is four times that of A format). This means that the recording / reproducing time of the standard 12 cm can be obtained with a smaller disc, and the device can be miniaturized.

【０００５】ただし、ディスクの回転速度は標準的なＣ
Ｄと同じであるため、例えば上記レベルＢの場合、所定
時間当たりその４倍の再生時間分の圧縮データが得られ
ることになる。このため、例えばセクタやクラスタ等の
時間単位で同じ圧縮データを重複して４回読み出すよう
にし、そのうちの１回分の圧縮データのみをオーディオ
再生にまわすようにしている。具体的には、スパイラル
状の記録トラックを走査（トラッキング）する際に、１
回転毎に元のトラック位置に戻るようなトラックジャン
プを行って、同じトラックを４回ずつ繰り返しトラッキ
ングするような形態で再生動作を進めることになる。こ
れは、例えば４回の重複読み取りの内、少なくとも１回
だけ正常な圧縮データが得られればよいことになり、外
乱等によるエラーに強く、特に携帯用小型機器に適用し
て好ましいものである。However, the rotation speed of the disk is standard C
Since it is the same as D, for example, in the case of the above level B, compressed data for a reproduction time four times as long as the predetermined time can be obtained. Therefore, for example, the same compressed data is read four times in a time unit such as a sector or a cluster, and only the compressed data for one time is sent to the audio reproduction. Specifically, when scanning (tracking) a spiral recording track,
A reproduction operation is performed in such a form that a track jump is performed to return to the original track position for each rotation, and the same track is repeatedly tracked four times. This means that normal compressed data only needs to be obtained at least once out of, for example, four times of redundant reading, is resistant to errors due to disturbances, etc., and is particularly preferable when applied to portable small devices.

【０００６】さらに、本出願人は、効率良く、良好な圧
縮を実現するためのビット割当手法を特願平４−３６９
５２号の明細書及び図面等において提案している。この
技術はビットの割当に際し、いわゆる臨界帯域（クリテ
ィカルバンド）等の各小ブロック中の代表値によって正
規化、いわゆるブロックフローティングを施し、各小ブ
ロック内の信号の大きさに依存したビット割り当てを、
当該小ブロックの対応する帯域に応じて重み付けして行
うというものである。この技術によれば各小ブロック内
のスペクトルの大きさに極端なばらつきが生じない場合
には、良好に圧縮を行うことが出来る。Furthermore, the applicant of the present invention has proposed a bit allocation method for realizing good compression efficiently and in Japanese Patent Application No. 4-369.
No. 52 specification and drawings. In this technique, when allocating bits, normalization is performed according to the representative value in each small block such as the so-called critical band, so-called block floating, and bit allocation depending on the signal size in each small block is performed.
The weighting is performed according to the band corresponding to the small block. According to this technique, good compression can be performed when there is no extreme variation in the size of the spectrum in each small block.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、前述の
技術を応用してディジタルデータの圧縮を行った場合、
ブロックフローティングを施す為の各小ブロック内のス
ペクトルの大きさに極端なバラツキや特出するピーク成
分を含む信号、例えば、単信号或いは複数の正弦波や矩
形波の圧縮に際しては、ピークを含む各小ブロック内の
データの偏差が大きい為に、各小ブロック内の代表値で
ブロックフローティングを行った際の効率が低下する場
合が生じる。さらに、入力信号の状態、即ち、各小ブロ
ック内のスペクトルの大きさのバラツキや特出するピー
ク成分が変化した場合、先のブロックフローティングの
効率も変化する結果となり、時間軸方向の処理ブロック
で圧縮効率の大きな偏りが生じる場合がある。また、ブ
ロックフローティングの効率が低下した状態で圧縮の為
のビット配分を行うと、発生する量子化ノイズを許容ノ
イズレベル以下とする為に、冗長なビットの配分を必要
とする場合が生じ、重ねて圧縮効率の低下又は大きな偏
りを招く場合がある。However, when the above-mentioned technique is applied to compress digital data,
Signals containing extreme fluctuations in the size of the spectrum in each small block for performing block floating and special peak components, for example, when compressing a single signal or multiple sine waves or rectangular waves, Since the deviation of the data in the small blocks is large, the efficiency may decrease when the block floating is performed using the representative value in each small block. Furthermore, when the state of the input signal, that is, the variation of the spectrum size in each small block or the peak component that is specially changed, the efficiency of the block floating described above also changes, and the processing block in the time axis direction changes A large deviation in compression efficiency may occur. In addition, if bit allocation for compression is performed in a state where the block floating efficiency is reduced, redundant bit allocation may be necessary in order to keep the generated quantization noise below the allowable noise level. As a result, compression efficiency may be reduced or a large deviation may occur.

【０００８】この場合、前述の技術を用いた応用例の殆
どの場合において、記録媒体や伝送経路の都合上、使用
可能なビットレートの上限が規定される為、そのビット
レートの上限を圧縮効率の低い処理ブロックに合わせる
と、圧縮効率の高い処理ブロックでは、ビットが過剰な
状態となり、全体の圧縮効率の低下を招く。また、先の
上限を圧縮効率の高い処理ブロックに合わせると、圧縮
効率の低い処理ブロックにおいて、情報量が不足し、聴
感上の問題が無視できなくなる可能性が生じる。この問
題は、使用可能なビットレートが低くなる程、大きな問
題となり、入力信号による処理ブロックの圧縮効率の偏
差が大きいほど、より低いビットレート、言い換えれ
ば、より高い圧縮率を実現することが難しくなる。In this case, in most of the application examples using the above-mentioned technique, the upper limit of the usable bit rate is defined for the convenience of the recording medium and the transmission path. Therefore, the upper limit of the bit rate is set to the compression efficiency. If the processing block has a low compression ratio, the processing block having a high compression efficiency has an excessive number of bits, resulting in a reduction in the overall compression efficiency. In addition, if the above upper limit is matched with a processing block with high compression efficiency, the amount of information in the processing block with low compression efficiency may be insufficient, and the auditory problem may not be ignored. This problem becomes more serious as the usable bit rate becomes lower, and it becomes more difficult to realize a lower bit rate, in other words, a higher compression rate, as the deviation of the compression efficiency of the processing block due to the input signal increases. Become.

【０００９】本発明はこの様な実情に鑑みてなされたも
のであり、入力信号の特性に関わらず、ブロックフロー
ティング並びに量子化の効率が低下することの無い情報
圧縮並びに情報圧縮の為のビット配分の手法が適用され
るディジタル信号圧縮方法及び装置、並びに記録媒体の
提供を目的とするものである。The present invention has been made in view of the above circumstances, and it is possible to compress the information without decreasing the efficiency of the block floating and the quantization regardless of the characteristics of the input signal and the bit allocation for the information compression. It is an object of the present invention to provide a digital signal compression method and apparatus to which the above method is applied, and a recording medium.

【００１０】[0010]

【課題を解決するための手段】本発明のディジタル信号
圧縮方法及び装置は、上述の目的を達成するために提案
されたものであり、入力信号の周波数軸上の特性を算出
し、入力信号を時間と周波数について細分化した小ブロ
ックに分配し、圧縮の為のフローティングを施し、上記
入力信号の周波数軸上の特性に応じて上記フローティン
グを施す小ブロックの周波数的大きさを制御して最適な
フローティングを行うことを特徴とするものである。DISCLOSURE OF THE INVENTION A digital signal compression method and apparatus according to the present invention have been proposed in order to achieve the above-mentioned object, and calculate the characteristic of the input signal on the frequency axis to obtain the input signal. It is divided into small blocks that are subdivided with respect to time and frequency, and is subjected to floating for compression, and the frequency size of the small blocks to which the floating is applied is controlled in accordance with the characteristics of the input signal on the frequency axis to optimize the operation. It is characterized by performing floating.

【００１１】また、本発明のディジタル信号圧縮方法及
び装置は、入力信号の周波数軸上のスペクトルデータを
得、上記スペクトルデータから許容可能なノイズスペク
トルを求め、上記求めた許容可能なノイズスペクトルを
時間と周波数について細分化した小ブロックに分配し、
圧縮の為のビット割当を行い、入力信号の周波数軸上の
特性に応じて、圧縮の為のビット割当を行う小ブロック
の周波数的大きさ変化させてビット割当を最適化するこ
とを特徴とする。Further, the digital signal compression method and apparatus of the present invention obtains spectrum data on the frequency axis of an input signal, obtains an acceptable noise spectrum from the spectrum data, and obtains the obtained acceptable noise spectrum with time. And divided into small blocks subdivided in frequency,
Bit allocation for compression is performed, and bit allocation is optimized by changing the frequency size of a small block for bit allocation for compression according to the characteristics of the input signal on the frequency axis. .

【００１２】さらに、本発明のディジタル信号圧縮方法
及び装置は、入力信号の周波数軸上の特性を算出する工
程又は手段と、入力信号を時間と周波数について細分化
した小ブロックに分配し、圧縮の為のフローティングを
施す工程又は手段と、周波数軸上のスペクトルデータを
得る工程又は手段と、上記スペクトルデータから許容可
能なノイズスペクトルを求める工程又は手段と、上記許
容可能なノイズスペクトルを時間と周波数について細分
化した小ブロックに分配し、圧縮の為のビット割当を行
う工程又は手段と、入力信号の周波数軸上の特性に応じ
て、圧縮の為のフローティングを施す時間と周波数につ
いて細分化した小ブロック及び／又は圧縮の為のビット
割当を行う時間と周波数について細分化した小ブロック
の周波数的大きさを変化させる工程又は手段とからな
り、上記ブロックフローティングの為の小ブロックとビ
ット割当の為の小ブロックを共に或いはそれぞれ独立し
て変化させることにより、ブロックフローティング並び
にビット割当を最適化することを特徴とする。Further, the digital signal compression method and apparatus of the present invention comprises a step or means for calculating the characteristics of the input signal on the frequency axis, and the compression of the input signal by dividing it into small blocks subdivided in time and frequency. For obtaining the floating noise for obtaining the spectrum data on the frequency axis, obtaining the acceptable noise spectrum from the spectrum data, and obtaining the acceptable noise spectrum with respect to time and frequency. Steps or means for allocating bits for compression and allocating bits for compression, and small blocks for time and frequency to perform floating for compression in accordance with the characteristics of the input signal on the frequency axis And / or frequency size of a small block subdivided with respect to time and frequency for bit allocation for compression A step or means for changing the block floating and bit allocation are optimized by changing the small block for the block floating and the small block for the bit allocation together or independently. To do.

【００１３】ここで、本発明のディジタル信号圧縮方法
及び装置においては、圧縮の為のブロックフローティン
グを施す時間と周波数について細分化した小ブロック及
び／又は圧縮の為のビット割当を行う時間と周波数につ
いて細分化した小ブロックの周波数的大きさを予め定
め、この予め定めた大きさを採用した際の圧縮効率と、
入力信号の周波数軸上の特性に応じて、圧縮の為のフロ
ーティングを施す小ブロック及び／又は圧縮の為のビッ
ト割当を行う小ブロックの周波数的大きさを変化させた
ときの圧縮効率とを比較し、上記比較結果に応じてより
高い効率で情報圧縮を行える小ブロックの大きさを選択
する。また、本発明のディジタル信号圧縮方法及び装置
は、圧縮の為のブロックフローティングを施す時間と周
波数について細分化した小ブロックと、圧縮の為のビッ
ト割当を行う時間と周波数について細分化した小ブロッ
クとを、入力信号の周波数軸上の特性に応じて、共通或
いは独立して構成する。さらに、本発明方法及び装置
は、入力信号に適応して情報圧縮の為の処理ブロックの
時間的長さを可変としており、当該処理ブロックの入力
信号の変化及び他の処理ブロックの入力信号の変化、及
び／又はパワー、或いはエネルギ又はピーク情報を基
に、当該処理ブロックの時間的長さを決定すること、及
び／又は、処理ブロックの入力信号の変化及び時間的に
処理ブロックの最大より長い時間幅の入力信号により得
られる入力信号の変化情報を基に当該処理ブロックの時
間的長さを決定するようにしている。上記処理ブロック
長の決定の際には、処理ブロックの時間的長さを決定す
る要素の決定に関与する割合を、固定或いは入力信号に
適応した割合、及び／又は所定の割合（例えば２倍，４
倍，８倍等）で、併用若しくは単独に使用する。なお、
入力信号はオーディオ信号であり、少なくとも大部分の
量子化雑音の発生をコントロールするブロックの周波数
幅を、高域ほど広くしてゆくようにしている。In the digital signal compression method and apparatus of the present invention, the time and frequency for performing block floating for compression are divided into small blocks and / or the time and frequency for allocating bits for compression. Predetermining the frequency size of the subdivided small blocks, and the compression efficiency when adopting this predetermined size,
Compare with the compression efficiency when the frequency size of the small block that performs floating for compression and / or the small block that performs bit allocation for compression is changed according to the characteristics of the input signal on the frequency axis. Then, the size of the small block that enables information compression with higher efficiency is selected according to the comparison result. Further, the digital signal compression method and apparatus of the present invention include a small block subdivided in time and frequency for performing block floating for compression, and a small block subdivided in time and frequency for bit allocation for compression. Are commonly or independently configured according to the characteristics of the input signal on the frequency axis. Further, the method and apparatus of the present invention make the time length of the processing block for information compression variable in accordance with the input signal, and the change of the input signal of the processing block and the change of the input signal of the other processing block. And / or determining the temporal length of the processing block based on power, or energy or peak information, and / or the change in the input signal of the processing block and the time longer than the maximum of the processing block in time. The time length of the processing block is determined based on the change information of the input signal obtained from the width input signal. When determining the processing block length, the ratio that is involved in the determination of the element that determines the temporal length of the processing block is fixed or adapted to the input signal, and / or a predetermined ratio (for example, double, Four
2 times, 8 times, etc.) and use together or alone. In addition,
The input signal is an audio signal, and the frequency width of the block that controls the generation of at least most of the quantization noise is made wider in the higher frequencies.

【００１４】また、本発明方法及び装置では、直交変換
を用いて、時間軸信号から周波数軸上の複数の帯域への
分割を行い、上記直交変換における直交変換サイズの可
変と共に当該直交変換時に使用する窓関数の形状も変化
させる。ここで、上記時間軸信号から周波数軸上の複数
の帯域への分割の際には、先ず上記時間軸信号を複数の
帯域に分割し、当該分割された帯域毎に複数のサンプル
からなるブロックを形成し、各帯域のブロック毎に直交
変換を行い係数データを得る。また、直交変換前の時間
軸信号から周波数軸上の複数の帯域への分割における分
割周波数幅は、略高域程広くし、上記分割周波数幅を最
低域の連続した２帯域で同一とする。さらに、上記ビッ
ト割当の際には、略信号通過帯域以上の帯域の信号成分
に対する圧縮符号のメイン情報及び／又はサブ情報への
割り当てを行わないようにし、上記複数の帯域への分割
にはクワドラチャ・ミラー・フィルタを用いる。また、
上記直交変換としては変更離散コサイン変換を用い、上
記処理ブロックの入力信号の変化を基に処理ブロックの
時間的長さを決定する際には、その時間的長さ決定のた
めの所定の境界値を入力信号の振幅、周波数に応じて可
変とする。また、上記境界値は、入力信号の振幅、周波
数に応じて複数の階段状の値をとる。また、上記処理ブ
ロック長の決定の際には、上記他の処理ブロックの信号
が前記処理ブロックの信号に及ぼす聴覚上の特性を、周
波数軸上のスペクトル及び／又は直交変換係数のエネル
ギ及び／又はパワー又はピーク情報を用いて計算し、当
該処理ブロックの時間的長さの決定を行う。上記他の処
理ブロックの信号が前記処理ブロックの信号に及ぼす聴
覚上の特性を計算する際に用いる周波数軸上のスペクト
ル及び／又は直交変換係数を、圧縮のためのビットの割
当及び／又はブロックフローティングに用いる直交変換
後の時間軸上のスペクトル及び／又は直交変換係数と共
用する。さらに、上記処理ブロックの入力信号の変化を
基に処理ブロックの時間的長さを決定する際には、入力
信号の周期的変化、及び／又は繰り返しのパルス又は周
期的特徴を基にした判断を行う。Further, in the method and apparatus of the present invention, the orthogonal transform is used to divide the time axis signal into a plurality of bands on the frequency axis, and the orthogonal transform size is changed in the orthogonal transform, and is used at the time of the orthogonal transform. The shape of the window function is changed. Here, when dividing the time axis signal into a plurality of bands on the frequency axis, first, the time axis signal is divided into a plurality of bands, and a block composed of a plurality of samples is divided for each of the divided bands. Then, coefficient data is obtained by performing orthogonal transformation for each block in each band. In addition, the division frequency width in the division from the time axis signal before orthogonal transformation into a plurality of bands on the frequency axis is made wider in the higher region, and the division frequency width is made the same in two consecutive lowest bands. Furthermore, when allocating the bits, the compression code is not allocated to the main information and / or the sub-information for the signal components in the band substantially equal to or more than the signal pass band, and the quadrature is used for the division into the plurality of bands. -Use a mirror filter. Also,
A modified discrete cosine transform is used as the orthogonal transform, and when determining the temporal length of the processing block based on the change of the input signal of the processing block, a predetermined boundary value for determining the temporal length is used. Is variable according to the amplitude and frequency of the input signal. The boundary value takes a plurality of stepwise values according to the amplitude and frequency of the input signal. Further, when the processing block length is determined, the auditory characteristics that the signal of the other processing block exerts on the signal of the processing block is determined by determining the spectrum of the frequency axis and / or the energy of the orthogonal transform coefficient and / or Calculation is performed using power or peak information, and the temporal length of the processing block is determined. Allocation of bits for compression and / or block floating of spectrum and / or orthogonal transform coefficients on the frequency axis used when calculating the auditory characteristics of the signal of the other processing block on the signal of the processing block. It is also used as the spectrum and / or the orthogonal transform coefficient on the time axis after the orthogonal transform used for. Furthermore, when determining the temporal length of the processing block based on the change of the input signal of the processing block, the judgment based on the periodic change of the input signal and / or the repeating pulse or the periodic characteristic is performed. To do.

【００１５】なお、本発明方法及び装置は、上記処理ブ
ロックの入力信号の変化を基にして処理ブロックの時間
的長さを決定する際に、上記境界値を入力信号の振幅、
周波数に応じて可変とする機能と、上記他の処理ブロッ
クの信号が前記処理ブロックの信号に及ぼす聴覚上の特
性を、周波数軸上のスペクトル及び／又は直交変換係数
のエネルギ及び／又はパワー又はピーク情報を用いて計
算して当該処理ブロックの時間的長さの決定を行う機能
を合わせ持つこともできる。In the method and apparatus of the present invention, when determining the temporal length of the processing block based on the change of the input signal of the processing block, the boundary value is used as the amplitude of the input signal,
The function of making it variable according to the frequency and the auditory characteristic of the signal of the other processing block exerted on the signal of the processing block, the energy and / or power or peak of the spectrum on the frequency axis and / or the orthogonal transform coefficient. It is also possible to have a function of performing calculation using information and determining the temporal length of the processing block.

【００１６】次に、本発明の記録媒体は、上述した本発
明のディジタル信号圧縮方法、又は、本発明のディジタ
ル信号圧縮方法によって圧縮した圧縮データを記録して
なるものである。Next, the recording medium of the present invention has the above-mentioned digital signal compression method of the present invention or the compressed data compressed by the digital signal compression method of the present invention recorded therein.

【００１７】また、本発明のディジタル信号圧縮方法及
び装置では、圧縮した圧縮データを伝送することも行
う。The digital signal compression method and apparatus of the present invention also transmits compressed compressed data.

【００１８】すなわち、本発明に係るディジタル信号圧
縮方法及び装置（高能率符号化方法及び装置）は、入力
信号の周波数軸上の特性を算出し、入力信号を時間と周
波数について細分化した小ブロックに分配し、圧縮の為
のフローティングを施し、入力信号の周波数軸上の特性
に応じて、フローティングを施す小ブロックの周波数的
大きさの制御を行い、最適なフローティングを施すこと
によって上述の問題を解決する。That is, the digital signal compression method and apparatus (high efficiency coding method and apparatus) according to the present invention calculates the characteristic of the input signal on the frequency axis and divides the input signal into time and frequency small blocks. The above problem is solved by performing the floating for compression, controlling the frequency size of the small blocks to be floated according to the characteristics of the input signal on the frequency axis, and performing the optimum floating. Solve.

【００１９】また、圧縮の為のビット割当を行うビット
割当を行う際に、入力信号の周波数軸上の特性に応じ
て、圧縮の為のビット割当を行う時間と周波数について
細分化した小ブロックの周波数的大きさ変化させること
によって上述の問題を解決する。Further, when bit allocation for compression is performed, a small block is subdivided in time and frequency for bit allocation for compression according to the characteristics of the input signal on the frequency axis. The above problem is solved by changing the frequency magnitude.

【００２０】さらには、上述のブロックフローティング
を施す為の時間と周波数について細分化した小ブロック
とビット配分の為の小ブロックを共に、或いは独立して
それぞれ変化させると一層、効果的である。Further, it is more effective to change the small block for subdividing the time and frequency for performing the above-mentioned block floating and the small block for bit allocation either together or independently.

【００２１】一方、ブロックフローティング及び／又は
ビット配分の為の時間と周波数に細分化された小ブロッ
クを入力信号の周波数軸上の特性に応じて求めた周波数
的大きさと予め、定めておいた当該小ブロックの周波数
的大きさを比較し、総合的に圧縮効率の高い大きさを選
択するとより効果的である。On the other hand, a small block subdivided into time and frequency for block floating and / or bit allocation is determined in advance with the frequency magnitude obtained according to the characteristics of the input signal on the frequency axis. It is more effective to compare the frequency sizes of the small blocks and select a size having a high compression efficiency comprehensively.

【００２２】[0022]

【作用】本発明のディジタル信号圧縮方法及び装置によ
れば、ブロックフローティング及び／又はビット配分の
効率の偏差が大きくなるような入力信号に対し、その効
率の偏差を小さく抑えるようなブロックフローティング
及び／またはビット配分を行う為の時間と周波数につい
て細分化した小ブロックの周波数的大きさの選択を行う
ことで効率の偏差の少ない圧縮を実現できる。これによ
り、圧縮の効率の低下を防ぐことができ、同一のビット
レートにおいてはより良好な音質を得ることができるよ
うになり、又、同一の音質においてはより低いビットレ
ートでの記録、伝送等を実現することが可能となる。According to the digital signal compression method and apparatus of the present invention, the block floating and / or the block floating and / or the bit floating efficiency for the input signal having the large deviation of the efficiency of the block allocation and / or the bit allocation are suppressed. Alternatively, compression with a small deviation in efficiency can be realized by selecting the frequency size of a small block subdivided with respect to time and frequency for bit allocation. As a result, it is possible to prevent a decrease in compression efficiency, obtain better sound quality at the same bit rate, and record or transmit at a lower bit rate at the same sound quality. Can be realized.

【００２３】[0023]

【実施例】先ず、図１は、本発明のディジタル信号圧縮
方法が適用される本発明のディジタル信号圧縮装置（圧
縮データ記録再生装置）の一実施例の概略構成を示すブ
ロック回路図である。1 is a block circuit diagram showing a schematic configuration of an embodiment of a digital signal compression apparatus (compressed data recording / reproducing apparatus) of the present invention to which the digital signal compression method of the present invention is applied.

【００２４】図１に示す圧縮データ記録再生装置におい
て、先ず記録媒体としては、スピンドルモータ５１によ
り回転駆動される光磁気ディスク１が用いられる。光磁
気ディスク１に対するデータの記録時には、例えば光学
ヘッド５３によりレーザ光を照射した状態で記録データ
に応じた変調磁界を磁気ヘッド５４により印加すること
によって、いわゆる磁界変調記録を行い、光磁気ディス
ク１の記録トラックに沿ってデータを記録する。また再
生時には、光磁気ディススク１の記録トラックを光学ヘ
ッド５３によりレーザ光でトレースして磁気光学的に再
生を行う。In the compressed data recording / reproducing apparatus shown in FIG. 1, the magneto-optical disk 1 rotated by a spindle motor 51 is used as a recording medium. When recording data on the magneto-optical disk 1, so-called magnetic field modulation recording is performed by applying a modulation magnetic field according to the recording data with the magnetic head 54 while irradiating the optical head 53 with laser light, for example. Data is recorded along the recording track of. During reproduction, the recording track of the magneto-optical disc 1 is traced with laser light by the optical head 53 to perform magneto-optical reproduction.

【００２５】光学ヘッド５３は、例えば、レーザダイオ
ード等のレーザ光源、コリメータレンズ、対物レンズ、
偏光ビームスプリッタ、シリンドリカルレンズ等の光学
部品及び所定パターンの受光部を有するフォトディテク
タ等から構成されている。この光学ヘッド５３は、光磁
気ディスク１を介して上記磁気ヘッド５４と対向する位
置に設けられている。光磁気ディスク１にデータを記録
するときには、後述する記録系のヘッド駆動回路６６に
より磁気ヘッド５４を駆動して記録データに応じた変調
磁界を印加すると共に、光学ヘッド５３により光磁気デ
ィスク１の目的トラックにレーザ光を照射することによ
って、磁界変調方式により熱磁気記録を行う。またこの
光学ヘッド５３は、目的トラックに照射したレーザ光の
反射光を検出し、例えばいわゆる非点収差法によりフォ
ーカスエラーを検出し、例えばいわゆるプッシュプル法
によりトラッキングエラーを検出する。光磁気ディスク
１からデータを再生するとき、光学ヘッド５３は上記フ
ォーカスエラーやトラッキングエラーを検出すると同時
に、レーザ光の目的トラックからの反射光の偏光角（カ
ー回転角）の違いを検出して再生信号を生成する。The optical head 53 includes, for example, a laser light source such as a laser diode, a collimator lens, an objective lens,
It is composed of a polarization beam splitter, an optical component such as a cylindrical lens, and a photodetector having a light receiving portion of a predetermined pattern. The optical head 53 is provided at a position facing the magnetic head 54 through the magneto-optical disk 1. When recording data on the magneto-optical disk 1, a magnetic head 54 is driven by a head driving circuit 66 of a recording system to be described later to apply a modulation magnetic field according to the recording data, and the optical head 53 is used for the purpose of the magneto-optical disk 1. By irradiating the track with laser light, thermomagnetic recording is performed by the magnetic field modulation method. The optical head 53 also detects the reflected light of the laser light applied to the target track, detects a focus error by, for example, a so-called astigmatism method, and detects a tracking error by, for example, a so-called push-pull method. When reproducing data from the magneto-optical disk 1, the optical head 53 detects the focus error and the tracking error, and at the same time, detects the difference in the polarization angle (Kerr rotation angle) of the reflected light of the laser light from the target track and reproduces it. Generate a signal.

【００２６】光学ヘッド５３の出力は、ＲＦ回路５５に
供給される。このＲＦ回路５５は、光学ヘッド５３の出
力から上記フォーカスエラー信号やトラッキングエラー
信号を抽出してサーボ制御回路５６に供給するととも
に、再生信号を２値化して後述する再生系のデコーダ７
１に供給する。The output of the optical head 53 is supplied to the RF circuit 55. The RF circuit 55 extracts the focus error signal and the tracking error signal from the output of the optical head 53 and supplies them to the servo control circuit 56, and binarizes the reproduction signal to reproduce the reproduction system decoder 7 described later.
Feed to 1.

【００２７】サーボ制御回路５６は、例えばフォーカス
サーボ制御回路やトラッキングサーボ制御回路、スピン
ドルモータサーボ制御回路、スレッドサーボ制御回路等
から構成される。上記フォーカスサーボ制御回路は、上
記フォーカスエラー信号がゼロになるように、光学ヘッ
ド５３の光学系のフォーカス制御を行う。また上記トラ
ッキングサーボ制御回路は、上記トラッキングエラー信
号がゼロになるように光学ヘッド５３の光学系のトラッ
キング制御を行う。さらに上記スピンドルモータサーボ
制御回路は、光磁気ディスク１を所定の回転速度（例え
ば一定線速度）で回転駆動するようにスピンドルモータ
５１を制御する。また、上記スレッドサーボ制御回路
は、システムコントローラ５７により指定される光磁気
ディスク１の目的トラック位置に光学ヘッド５３及び磁
気ヘッド５４を移動させる。このような各種制御動作を
行うサーボ制御回路５６は、該サーボ制御回路５６によ
り制御される各部の動作状態を示す情報をシステムコン
トローラ５７に送る。The servo control circuit 56 is composed of, for example, a focus servo control circuit, a tracking servo control circuit, a spindle motor servo control circuit, a sled servo control circuit and the like. The focus servo control circuit controls the focus of the optical system of the optical head 53 so that the focus error signal becomes zero. Further, the tracking servo control circuit controls the tracking of the optical system of the optical head 53 so that the tracking error signal becomes zero. Further, the spindle motor servo control circuit controls the spindle motor 51 so as to rotate the magneto-optical disk 1 at a predetermined rotation speed (for example, a constant linear speed). Further, the sled servo control circuit moves the optical head 53 and the magnetic head 54 to the target track position of the magneto-optical disk 1 designated by the system controller 57. The servo control circuit 56 that performs such various control operations sends information indicating the operating state of each unit controlled by the servo control circuit 56 to the system controller 57.

【００２８】システムコントローラ５７にはキー入力操
作部５８や表示部５９が接続されている。このシステム
コントローラ５７は、キー入力操作部５８による操作入
力情報により指定される動作モードで記録系及び再生系
の制御を行う。またシステムコントローラ７は、光磁気
ディスク１の記録トラックからヘッダタイムやサブコー
ドのＱデータ等により再生されるセクタ単位のアドレス
情報に基づいて、光学ヘッド５３及び磁気ヘッド５４が
トレースしている上記記録トラック上の記録位置や再生
位置を管理する。さらにシステムコントローラ５７は、
データ圧縮率と上記記録トラック上の再生位置情報とに
基づいて表示部５９に再生時間を表示させる制御を行
う。A key input operation unit 58 and a display unit 59 are connected to the system controller 57. The system controller 57 controls the recording system and the reproducing system in the operation mode designated by the operation input information from the key input operation unit 58. The system controller 7 also records the above-mentioned recording traced by the optical head 53 and the magnetic head 54 on the basis of address information in sector units reproduced from the recording track of the magneto-optical disc 1 by the header time, sub-code Q data and the like. Manages the recording and playback positions on the track. Further, the system controller 57
Based on the data compression rate and the reproduction position information on the recording track, control is performed to display the reproduction time on the display unit 59.

【００２９】この再生時間表示は、光磁気ディスク１の
記録トラックからいわゆるヘッダタイムやいわゆるサブ
コードＱデータ等により再生されるセクタ単位のアドレ
ス情報（絶対時間情報）に対し、データ圧縮率の逆数
（例えば１／４圧縮のときには４）を乗算することによ
り、実際の時間情報を求め、これを表示部５９に表示さ
せるものである。なお、記録時においても、例えば光磁
気ディスク等の記録トラックに予め絶対時間情報が記録
されている（プリフォーマットされている）場合に、こ
のプリフォーマットされた絶対時間情報を読み取ってデ
ータ圧縮率の逆数を乗算することにより、現在位置を実
際の記録時間で表示させることも可能である。The reproduction time display is the reciprocal of the data compression ratio (absolute time information) with respect to address information (absolute time information) in sector units reproduced from the recording track of the magneto-optical disk 1 by so-called header time or so-called sub-code Q data. For example, in the case of 1/4 compression, the actual time information is obtained by multiplying by 4), and this is displayed on the display unit 59. Even at the time of recording, for example, when absolute time information is previously recorded (pre-formatted) on a recording track of a magneto-optical disk or the like, the pre-formatted absolute time information is read to determine the data compression rate. It is also possible to display the current position at the actual recording time by multiplying by the reciprocal.

【００３０】次にこのディスク記録再生装置の記録系に
おいて、入力端子６０からのアナログオーディオ入力信
号ＡINがローパスフィルタ６１を介してＡ／Ｄ変換器６
２に供給され、このＡ／Ｄ変換器６２は上記アナログオ
ーディオ入力信号ＡINを量子化する。Ａ／Ｄ変換器６２
から得られたディジタルオーディオ信号は、ＡＴＣ（Ad
aptive Transform Coding ）ＰＣＭエンコーダ６３に供
給される。また、入力端子６７からのディジタルオーデ
ィオ入力信号ＤINがディジタル入力インターフェース回
路６８を介してＡＴＣエンコーダ６３に供給される。Ａ
ＴＣエンコーダ６３は、上記入力信号ＡINを上記Ａ／Ｄ
変換器６２により量子化した所定転送速度のディジタル
オーディオＰＣＭデータについて、ビット圧縮（データ
圧縮）処理を行う。ここではその圧縮率を４倍として説
明するが、本実施例はこの倍率には依存しない構成とな
っており、応用例により任意に選択が可能である。Next, in the recording system of this disc recording / reproducing apparatus, the analog audio input signal AIN from the input terminal 60 is passed through the low pass filter 61 to the A / D converter 6
2 and the A / D converter 62 quantizes the analog audio input signal AIN. A / D converter 62
The digital audio signal obtained from the ATC (Ad
aptive Transform Coding) The PCM encoder 63 is supplied. Further, the digital audio input signal DIN from the input terminal 67 is supplied to the ATC encoder 63 via the digital input interface circuit 68. A
The TC encoder 63 sends the input signal AIN to the A / D
Bit compression (data compression) processing is performed on the digital audio PCM data quantized by the converter 62 and having a predetermined transfer rate. Although the compression rate will be described as 4 times here, the present embodiment has a configuration that does not depend on this magnification and can be arbitrarily selected depending on the application example.

【００３１】次にメモリ６４は、データの書き込み及び
読み出しがシステムコントローラ５７により制御され、
ＡＴＣエンコーダ６３から供給されるＡＴＣデータを一
時的に記憶しておき、必要に応じてディスク上に記録す
るためのバッファメモリとして用いられている。すなわ
ち、例えばＡＴＣエンコーダ６３から供給される圧縮オ
ーディオデータは、そのデータ転送速度が、標準的なＣ
Ｄ−ＤＡフォーマットのデータ転送速度（７５セクタ／
秒）の１／４、すなわち１８．７５セクタ／秒に低減さ
れており、この圧縮データがメモリ１４に連続的に書き
込まれる。この圧縮データ（ＡＴＣデータ）は、前述し
たように４セクタにつき１セクタの記録を行えば足りる
が、このような４セクタおきの記録は事実上不可能に近
いため、後述するようなセクタ連続の記録を行うように
している。この記録は、休止期間を介して、所定の複数
セクタ（例えば３２セクタ＋数セクタ）から成るクラス
タを記録単位として、標準的なＣＤ−ＤＡフォーマット
と同じデータ転送速度（７５セクタ／秒）でバースト的
に行われる。すなわちメモリ１４においては、上記ビッ
ト圧縮レートに応じた１８．７５（＝７５／４）セクタ
／秒の低い転送速度で連続的に書き込まれたＡＴＣオー
ディオデータが、記録データとして上記７５セクタ／秒
の転送速度でバースト的に読み出される。この読み出さ
れて記録されるデータについて、記録休止期間を含む全
体的なデータ転送速度は、上記１８．７５セクタ／秒の
低い速度となっているが、バースト的に行われる記録動
作の時間内での瞬時的なデータ転送速度は上記標準的な
７５セクタ／秒となっている。従って、ディスク回転速
度が標準的なＣＤ−ＤＡフォーマットと同じ速度（一定
線速度）のとき、該ＣＤ−ＤＡフォーマットと同じ記録
密度、記憶パターンの記録が行われることになる。Next, in the memory 64, writing and reading of data are controlled by the system controller 57,
It is used as a buffer memory for temporarily storing ATC data supplied from the ATC encoder 63 and recording the ATC data on a disk as needed. That is, for example, the compressed audio data supplied from the ATC encoder 63 has a standard C data transfer rate.
Data transfer rate of D-DA format (75 sectors /
1/4 of the second), that is, 18.75 sectors / second, and this compressed data is continuously written in the memory 14. As for this compressed data (ATC data), it is sufficient to record one sector for every four sectors as described above. However, since recording every four sectors is practically impossible, the continuous sectors described below will be used. I try to keep a record. This recording bursts at a data transfer rate (75 sectors / second) same as that of the standard CD-DA format by using a cluster composed of a plurality of predetermined sectors (for example, 32 sectors + several sectors) as a recording unit through a pause period. Is done in a regular manner. That is, in the memory 14, ATC audio data continuously written at a low transfer rate of 18.75 (= 75/4) sectors / second corresponding to the bit compression rate is recorded as the recording data of 75 sectors / second. It is read in bursts at the transfer rate. The overall data transfer rate of the read and recorded data, including the recording pause period, is as low as 18.75 sectors / sec, but within the time of the recording operation performed in bursts. The instantaneous data transfer rate in the above is the standard 75 sectors / sec. Therefore, when the disc rotation speed is the same as the standard CD-DA format (constant linear velocity), the same recording density and storage pattern as those of the CD-DA format are recorded.

【００３２】メモリ６４から上記７５セクタ／秒の（瞬
時的な）転送速度でバースト的に読み出されたＡＴＣオ
ーディオデータすなわち記録データは、エンコーダ６５
に供給される。ここで、メモリ６４からエンコーダ６５
に供給されるデータ列において、１回の記録で連続記録
される単位は、複数セクタ（例えば３２セクタ）から成
るクラスタ及び該クラスタの前後位置に配されたクラス
タ接続用の数セクタとしている。このクラスタ接続用セ
クタは、エンコーダ６５でのインターリーブ長より長く
設定しており、インターリーブされても他のクラスタの
データに影響を与えないようにしている。The ATC audio data, that is, the recorded data, which is burst-read from the memory 64 at the (instantaneous) transfer rate of 75 sectors / second is recorded by the encoder 65.
Is supplied to. Here, from the memory 64 to the encoder 65
In the data string supplied to the above, the unit to be continuously recorded in one recording is a cluster composed of a plurality of sectors (for example, 32 sectors) and several sectors for cluster connection arranged at the front and rear positions of the cluster. This cluster connection sector is set to be longer than the interleave length in the encoder 65 so that interleaved data will not affect the data of other clusters.

【００３３】エンコーダ６５は、メモリ６４から上述し
たようにバースト的に供給される記録データについて、
エラー訂正のための符号化処理（パリティ付加及びイン
ターリーブ処理）やＥＦＭ符号化処理などを施す。この
エンコーダ６５による符号化処理の施された記録データ
が磁気ヘッド駆動回路６６に供給される。この磁気ヘッ
ド駆動回路６６は、磁気ヘッド５４が接続されており、
上記記録データに応じた変調磁界を光磁気ディスク１に
印加するように磁気ヘッド５４を駆動する。The encoder 65, regarding the recording data supplied from the memory 64 in bursts as described above,
Encoding processing for error correction (parity addition and interleave processing), EFM encoding processing, and the like are performed. The recording data encoded by the encoder 65 is supplied to the magnetic head drive circuit 66. The magnetic head drive circuit 66 is connected to the magnetic head 54,
The magnetic head 54 is driven so as to apply the modulation magnetic field according to the recording data to the magneto-optical disk 1.

【００３４】また、システムコントローラ５７は、メモ
リ６４に対する上述の如きメモリ制御を行うとともに、
このメモリ制御によりメモリ６４からバースト的に読み
出される上記記録データを光磁気ディスク２の記録トラ
ックに連続的に記録するように記録位置の制御を行う。
この記録位置の制御は、システムコントローラ５７によ
りメモリ６４からバースト的に読み出される上記記録デ
ータの記録位置を管理して、光磁気ディスク１の記録ト
ラック上の記録位置を指定する制御信号をサーボ制御回
路５６に供給することによって行われる。The system controller 57 controls the memory 64 as described above, and
By this memory control, the recording position is controlled so that the recording data read out in burst from the memory 64 is continuously recorded on the recording track of the magneto-optical disk 2.
The recording position is controlled by controlling the recording position of the recording data which is burst-read from the memory 64 by the system controller 57 and outputting a control signal for designating the recording position on the recording track of the magneto-optical disk 1 to the servo control circuit. By feeding 56.

【００３５】次に、この光磁気ディスク記録再生ユニッ
トの再生系について説明する。この再生系は、上述の記
録系により光磁気ディスク１の記録トラック上に連続的
に記録された記録データを再生するためのものであり、
光学ヘッド５３によって光磁気ディスク１の記録トラッ
クをレーザ光でトレースすることにより得られる再生出
力がＲＦ回路５５により２値化されて供給されるデコー
ダ７１を備えている。この時光磁気ディスクのみではな
く、コンパクトディクス（ＣＤ：COMPACT DISC）と同じ
再生専用光ディスクの読み出しも行うことができる。Next, the reproducing system of this magneto-optical disk recording / reproducing unit will be described. This reproducing system is for reproducing the record data continuously recorded on the recording track of the magneto-optical disk 1 by the above-mentioned recording system,
The decoder 71 is provided with a reproduction output obtained by tracing a recording track of the magneto-optical disk 1 with a laser beam by the optical head 53 and binarized and supplied by the RF circuit 55. At this time, not only the magneto-optical disk but also the read-only optical disk same as the compact disk (CD: COMPACT DISC) can be read.

【００３６】デコーダ７１は、上述の記録系におけるエ
ンコーダ６５に対応するものであって、ＲＦ回路５５に
より２値化された再生出力について、エラー訂正のため
の上述の如き復号化処理やＥＦＭ復号化処理などの処理
を行いオーディオデータを、正規の転送速度よりも早い
７５セクタ／秒の転送速度で再生する。このデコーダ７
１により得られる再生データは、メモリ７２に供給され
る。The decoder 71 corresponds to the encoder 65 in the above-mentioned recording system, and performs the above-mentioned decoding processing for error correction and EFM decoding on the reproduction output binarized by the RF circuit 55. By performing processing such as processing, the audio data is reproduced at a transfer rate of 75 sectors / second, which is faster than the normal transfer rate. This decoder 7
The reproduction data obtained by 1 is supplied to the memory 72.

【００３７】メモリ７２は、データの書き込み及び読み
出しがシステムコントローラ５７により制御され、デコ
ーダ７１から７５セクタ／秒の転送速度で供給される再
生データがその７５セクタ／秒の転送速度でバースト的
に書き込まれる。また、このメモリ７２は、上記７５セ
クタ／秒の転送速度でバースト的に書き込まれた上記再
生データが正規の７５セクタ／秒の転送速度１８．７５
セクタ／秒で連続的に読み出される。In the memory 72, writing and reading of data are controlled by the system controller 57, and reproduced data supplied from the decoder 71 at a transfer rate of 75 sectors / second is written in bursts at the transfer rate of 75 sectors / second. Be done. Further, in the memory 72, the reproduction data written in a burst at the transfer rate of 75 sectors / sec is a regular transfer rate of 75 sectors / sec of 18.75.
It is read continuously at sectors / second.

【００３８】システムコントローラ５７は、再生データ
をメモリ７２に７５セクタ／秒の転送速度で書き込むと
ともに、メモリ７２から上記再生データを上記１８．７
５セクタ／秒の転送速度で連続的に読み出すようなメモ
リ制御を行う。また、システムコントローラ５７は、メ
モリ７２に対する上述の如きメモリ制御を行うととも
に、このメモリ制御によりメモリ７２からバースト的に
書き込まれる上記再生データを光磁気ディスク１の記録
トラックから連続的に再生するように再生位置の制御を
行う。この再生位置の制御は、システムコントローラ５
７によりメモリ７２からバースト的に読み出される上記
再生データの再生位置を管理して、光磁気ディスク１も
しくは光ディスク１の記録トラック上の再生位置を指定
する制御信号をサーボ制御回路５６に供給することによ
って行われる。The system controller 57 writes the reproduced data in the memory 72 at a transfer rate of 75 sectors / second, and also writes the reproduced data from the memory 72 in the above 18.7.
Memory control is performed so that data is continuously read at a transfer rate of 5 sectors / second. Further, the system controller 57 performs the above-mentioned memory control on the memory 72 and continuously reproduces the above-mentioned reproduction data written in burst from the memory 72 from the recording track of the magneto-optical disc 1 by this memory control. Controls the playback position. This playback position is controlled by the system controller 5
By controlling the reproduction position of the reproduction data read out from the memory 72 in burst by 7 and supplying the servo control circuit 56 with a control signal designating the reproduction position on the recording track of the magneto-optical disk 1 or the optical disk 1. Done.

【００３９】メモリ７２から１８．７５セクタ／秒の転
送速度で連続的に読み出された再生データとして得られ
るＡＴＣオーディオデータは、ＡＴＣデコーダ７３に供
給される。このＡＴＣデコーダ７３は、ＡＴＣデータを
４倍にデータ伸張（ビット伸張）することで１６ビット
のディジタルオーディオデータを再生する。このＡＴＣ
デコーダ７３からのディジタルオーディオデータは、Ｄ
／Ａ変換器７４に供給される。ATC audio data obtained as reproduction data continuously read from the memory 72 at a transfer rate of 18.75 sectors / second is supplied to the ATC decoder 73. The ATC decoder 73 reproduces 16-bit digital audio data by expanding the ATC data four times (bit expanding). This ATC
The digital audio data from the decoder 73 is D
It is supplied to the / A converter 74.

【００４０】Ｄ／Ａ変換器７４は、ＡＴＣデコーダ７３
から供給されるディジタルオーディオデータをアナログ
信号に変換して、アナログオーディオ出力信号ＡOUT を
形成する。このＤ／Ａ変換器７４により得られるアナロ
グオーデイオ信号ＡOUT は、ローパスフィルタ７５を介
して出力端子７６から出力される。The D / A converter 74 is the ATC decoder 73.
The digital audio data supplied from the converter is converted into an analog signal to form an analog audio output signal AOUT. The analog audio signal AOUT obtained by the D / A converter 74 is output from the output terminal 76 via the low pass filter 75.

【００４１】次に、本発明のディジタル信号圧縮装置の
情報圧縮に適用される高能率圧縮符号化について詳述す
る。すなわち、オーディオＰＣＭ信号等の入力ディジタ
ル信号を、帯域分割符号化（ＳＢＣ）、適応変換符号化
（ＡＴＣ）及び適応ビット割当ての各技術を用いて高能
率符号化する技術について、図２以降を参照しながら説
明する。Next, the high efficiency compression coding applied to the information compression of the digital signal compression apparatus of the present invention will be described in detail. That is, refer to FIG. 2 and subsequent figures for a technique for highly efficient encoding of an input digital signal such as an audio PCM signal using band division encoding (SBC), adaptive transform encoding (ATC) and adaptive bit allocation techniques. While explaining.

【００４２】図２に示す具体的な高能率符号化装置で
は、定常状態においては、入力ディジタル信号を複数の
周波数帯域に分割すると共に、最低域の隣接した２帯域
の帯域幅は同じで、より高い周波数帯域では高い周波数
帯域ほどバンド幅を広く選定し、各周波数帯域毎に直交
変換を行って、得られた周波数軸のスペクトルデータ
を、低域では、後述する人間の聴覚特性を考慮したいわ
ゆる臨界帯域幅（クリティカルバンド）毎に、中高域で
はブロックフローティング効率を考慮して臨界帯域幅を
細分化した帯域毎に、適応的にビット割当して符号化し
ている。通常このブロックが量子化雑音発生ブロックと
なる。このクリティカルバンドとは、人間の聴覚特性を
考慮して分割された周波数帯域であり、ある純音の周波
数近傍の同じ強さの狭帯域バンドノイズによって当該純
音がマスクされるときのそのノイズの持つ帯域のことで
ある。このクリティカルバンドは、高域ほど帯域幅が広
くなっており、上記０〜２２ｋＨｚの全周波数帯域は例
えば２５のクリティカルバンドに分割されている。In the concrete high-efficiency encoder shown in FIG. 2, in the steady state, the input digital signal is divided into a plurality of frequency bands, and the two adjacent lowest bands have the same bandwidth. In the high frequency band, the higher the frequency band, the wider the bandwidth is selected, and the orthogonal transformation is performed for each frequency band. Bits are adaptively allocated and coded for each critical bandwidth (critical band) and for each band in which the critical bandwidth is subdivided in consideration of block floating efficiency in the middle and high frequencies. Normally, this block is the quantization noise generation block. This critical band is a frequency band divided in consideration of human auditory characteristics, and is the band of a pure tone when the pure tone is masked by narrow band noise of the same strength near the frequency of that pure tone. That is. This critical band has a wider bandwidth as it goes higher, and the entire frequency band of 0 to 22 kHz is divided into, for example, 25 critical bands.

【００４３】さらに、本発明実施例においては、直交変
換の前に入力信号に応じて適応的にブロックサイズ（処
理ブロック長）を変化させると共に、入力信号に応じ
て、フローテイング処理を行うブロックフローティング
ユニットの大きさも変化させる処理を行い、圧縮後の情
報の使用効率が最適となるように量子化を行っている。Furthermore, in the embodiment of the present invention, the block size (processing block length) is adaptively changed according to the input signal before the orthogonal transformation, and the floating block is subjected to the floating processing according to the input signal. The unit size is also changed, and the quantization is performed so that the usage efficiency of the compressed information is optimized.

【００４４】即ち、図２において、入力端子２００には
例えばサンプリング周波数が４４．１ｋＨｚの時、０〜
２２ｋＨｚのオーディオＰＣＭ信号が供給されている。
この入力信号は、例えばいわゆるＱＭＦ等のフィルタか
らなる帯域分割フィルタ２０１により０〜１１ｋＨｚ帯
域と１１ｋＨｚ〜２２ｋＨｚ帯域とに分割され、０〜１
１ｋＨｚ帯域の信号は同じくいわゆるＱＭＦ等のフィル
タからなる帯域分割フィルタ２０２により０〜５．５ｋ
Ｈｚ帯域と５．５ｋＨｚ〜１１ｋＨｚ帯域とに分割され
る。帯域分割フィルタ２０１からの１１ｋＨｚ〜２２ｋ
Ｈｚ帯域の信号は直交変換回路の一例であるＭＤＣＴ回
路２０３に送られ、帯域分割フィルタ２０２からの５.
５ｋＨｚ〜１１ｋＨｚ帯域の信号はＭＤＣＴ回路２０４
に送られ、帯域分割フィルタ２０２からの０〜５．５ｋ
Ｈｚ域の信号はＭＤＣＴ回路２０５に送られることによ
り、それぞれＭＤＣＴ処理される。That is, in FIG. 2, for example, when the sampling frequency is 44.1 kHz, 0 to 0 are input to the input terminal 200.
A 22 kHz audio PCM signal is supplied.
This input signal is divided into a band of 0 to 11 kHz and a band of 11 to 22 kHz by a band dividing filter 201 including a filter such as so-called QMF, and 0 to 1 is divided.
A signal in the 1 kHz band is 0 to 5.5 kHz by the band division filter 202 which is also a so-called QMF filter or the like.
It is divided into a Hz band and a 5.5 kHz to 11 kHz band. 11 kHz to 22 kHz from the band division filter 201
The Hz band signal is sent to the MDCT circuit 203, which is an example of an orthogonal transform circuit, and the 5.
Signals in the 5 kHz to 11 kHz band are sent to the MDCT circuit 204.
0 to 5.5k from the band division filter 202
The signals in the Hz range are sent to the MDCT circuit 205, where they are subjected to MDCT processing.

【００４５】ここで、上述した入力ディジタル信号を複
数の周波数帯域に分割する手法の一例としてのＱＭＦの
フィルタは、例えば文献「ディジタル・コーディング・
オブ・スピーチ・イン・サブバンズ」("Digital coding
of speech in subbands" R.E.Crochiere, Bell Syst.
Tech. J., Vol.55,No.8 1976) に述べられている。この
ＱＭＦのフィルタは、帯域を等バンド幅に２分割するも
のであり、当該フィルタにおいては上記分割した帯域を
後に合成する際にいわゆるエリアシングが発生しないこ
とが特徴となっている。Here, a QMF filter as an example of a method of dividing the above-mentioned input digital signal into a plurality of frequency bands is described in, for example, the document "Digital Coding."
Of Speech in Subvans "(" Digital coding
of speech in subbands "RE Crochiere, Bell Syst.
Tech. J., Vol.55, No.8 1976). This QMF filter divides the band into two equal bandwidths, and is characterized in that so-called aliasing does not occur when the divided bands are combined later.

【００４６】また、文献「ポリフェイズ・クァドラチュ
ア・フィルターズ −新しい帯域分割符号化技術」("Po
lyphase Quadrature filters -A new subband coding t
echnique", Joseph H. Rothweiler ICASSP 83, BOSTON)
には、等帯域幅のフィルタ分割手法が述べられている。
このポリフェイズ・クァドラチュア・フィルタにおいて
は、信号を等バンド幅の複数の帯域に分割する際に一度
に分割できることが特徴となっている。In addition, the document "Polyphase Quadrature Filters-New Band Division Coding Technology"("Po
lyphase Quadrature filters -A new subband coding t
echnique ", Joseph H. Rothweiler ICASSP 83, BOSTON)
Describes an equal bandwidth filter partitioning technique.
This polyphase quadrature filter is characterized in that when a signal is divided into a plurality of bands of equal bandwidth, it can be divided at one time.

【００４７】さらに、上述した直交変換としては、例え
ば、入力オーディオ信号を所定単位時間（フレーム）で
ブロック化し、当該ブロック毎に高速フーリエ変換（Ｆ
ＦＴ）、離散コサイン変換（ＤＣＴ）、モディファイド
ＤＣＴ変換（ＭＤＣＴ）などを行うことで時間軸を周波
数軸に変換するような直交変換がある。このＭＤＣＴに
ついては、文献「時間領域エリアシング・キャンセルを
基礎とするフィルタ・バンク設計を用いたサブバンド／
変換符号化」("Subband/Transform Coding Using Filte
r Bank Designs Based on Time Domain Aliasing Cance
llation," J.P.Princen A.B.Bradley, Univ. of Surrey
Royal Melbourne Inst. of Tech. ICASSP 1987)に述べ
られている。Further, as the above-mentioned orthogonal transform, for example, the input audio signal is divided into blocks in a predetermined unit time (frame), and the fast Fourier transform (F
FT), Discrete Cosine Transform (DCT), Modified DCT Transform (MDCT), etc. are used to convert the time axis into the frequency axis. For this MDCT, refer to the document "Subbands Using Filter Bank Design Based on Time Domain Aliasing Cancellation".
Transform Coding Using Filte
r Bank Designs Based on Time Domain Aliasing Cance
llation, "JPPrincen ABBradley, Univ. of Surrey
Royal Melbourne Inst. Of Tech. ICASSP 1987).

【００４８】ここで、各ＭＤＣＴ回路２０３、２０４、
２０５に供給する各帯域毎のブロックについての標準的
な入力信号に対する具体例を図３に示す。この図３の具
体例においては、３つのフィルタ出力信号は、各帯域ご
とに独立に各々複数の直交変換ブロックサイズを持ち、
信号の時間特性、周波数分布等により時間分解能を切り
換えられる様にしている。信号が時間的に準定常的であ
る場合には、直交変換ブロックサイズを１１．６ｍｓ、
即ち、図３の（Ａ）に示すロングモード（ＬｏｎｇＭ
ｏｄｅ）と大きくし、信号が非定常的である場合には、
直交変換ブロックサイズを更に２分割、４分割とする。
図３の（Ｂ）のショートモード（ＳｈｏｒｔＭｏｄ
ｅ）のごとく、すべてを４分割で２．９ｍｓとする場合
や、図３の（Ｃ）のミドルモードＡ（Ｍｉｄｄｌｅ
ＭｏｄｅＡ）、図３の（Ｄ）のミドルモードＢ（Ｍ
ｉｄｄｌｅＭｏｄｅＢ）のごとく、一部を２分割で
５．８ｍｓ、１部を４分割で２．９ｍｓの時間分解能と
することで、実際の複雑な入力信号に適応するようにな
っている。この直交変換ブロックサイズの分割は処理装
置の規模が許せば、さらに複雑な分割を行うと、より効
果的なことは明白である。このブロックサイズの決定は
図２のブロックサイズ決定回路２０６〜２０８で決定さ
れ、各ＭＤＣＴ回路２０３〜２０５に伝えられるととも
に、該当ブロックのブロックサイズ情報として出力端子
２１６〜２１８より出力される。Here, each MDCT circuit 203, 204,
FIG. 3 shows a specific example of the standard input signal for the blocks for each band supplied to 205. In the specific example of FIG. 3, the three filter output signals have a plurality of orthogonal transform block sizes independently for each band,
The time resolution can be switched depending on the time characteristic of the signal, the frequency distribution, and the like. If the signal is quasi-stationary in time, then the orthogonal transform block size is 11.6 ms,
That is, the long mode (Long M) shown in FIG.
ode) and the signal is non-stationary,
The orthogonal transform block size is further divided into two and four.
The short mode (Short Mod) of FIG.
e) as in the case where all are divided into 4 to make 2.9 ms, or in the middle mode A (Middle) of FIG. 3C.
Mode A), middle mode B (M) of FIG.
As in the idle mode B), a part of the time division has a time resolution of 5.8 ms and a part has a time resolution of 2.9 ms by four divisions to adapt to an actual complicated input signal. It is apparent that the division of the orthogonal transform block size is more effective if the division is made more complicated if the scale of the processing device permits. The block size is determined by the block size determination circuits 206 to 208 shown in FIG. 2, transmitted to the MDCT circuits 203 to 205, and output from the output terminals 216 to 218 as block size information of the corresponding block.

【００４９】次に、ブロックサイズ決定回路の一具体例
の概略構成を表すブロック回路図をを図４に示す。ここ
では図２のブロック決定回路２０６を例に説明する。図
２のＱＭＦからなる帯域分割フィルタ２０１の出力のう
ち、１１ｋＨｚ〜２２ｋＨｚの出力は図４の入力端子４
０１を介してパワー算出回路４０４に送られる。さら
に、図２の帯域分割フィルタ２０２の出力のうち、５．
５ｋＨｚ〜１１ｋＨｚの出力は図４の入力端子４０２を
介してパワー算出回路４０５へ、０〜５．５ｋＨｚの出
力は図４の入力端子４０３を介してパワー算出回路４０
６へとそれぞれ送られる。また、図２のブロックサイズ
決定回路２０７、２０８は図４の入力端子４０１〜４０
３へ入力される信号がブロックサイズ決定回路２０６の
場合と異なるだけで、動作は同一である。各ブロックサ
イズ決定回路２０６〜２０８におけるそれぞれの入力端
子４０１〜４０３はマトリクス構成となっており、即
ち、ブロックサイズ決定回路２０７の入力端子４０１に
は図２の帯域分割フィルタ２０２の５．５ｋＨｚ〜１１
ｋＨｚの出力が接続されており、同入力端子４０２には
０〜５．５ｋＨｚの出力が接続されている。ブロックサ
イズ決定回路２０８についても、同様である。Next, FIG. 4 shows a block circuit diagram showing a schematic configuration of a specific example of the block size determination circuit. Here, the block determination circuit 206 of FIG. 2 will be described as an example. Of the outputs of the band-splitting filter 201 including the QMF of FIG. 2, the output of 11 kHz to 22 kHz is the input terminal 4 of FIG.
01 to the power calculation circuit 404. Furthermore, among the outputs of the band division filter 202 of FIG.
The output of 5 kHz to 11 kHz is output to the power calculation circuit 405 via the input terminal 402 of FIG. 4, and the output of 0 to 5.5 kHz is output to the power calculation circuit 40 of the input terminal 403 of FIG.
6 respectively. Further, the block size determination circuits 207 and 208 in FIG. 2 have the input terminals 401 to 40 in FIG.
The operation is the same, except that the signal input to 3 is different from that of the block size determination circuit 206. The input terminals 401 to 403 of the block size determination circuits 206 to 208 have a matrix configuration, that is, the input terminal 401 of the block size determination circuit 207 has 5.5 kHz to 11 of the band division filter 202 of FIG.
The output of kHz is connected, and the output of 0 to 5.5 kHz is connected to the input terminal 402. The same applies to the block size determination circuit 208.

【００５０】図４において、各パワー算出回路４０４〜
４０６は入力された時間波形を一定時間、積分すること
によって、各周波数帯域のパワーを求めている。この
際、積分する時間幅は上述の直交変換ブロックサイズの
うち、最小時間ブロック以下である必要がある。また、
上述の算出法以外、例えば直交変換ブロックサイズの最
小時間幅内の最大振幅の絶対値或いは振幅の平均値を代
表パワーとして用いても同様の効果が得られる。パワー
算出回路４０４の出力は変化分抽出回路４０８及びパワ
ー比較回路４０９に、パワー算出回路４０５、４０６の
出力はパワー比較回路４０９にそれぞれ送られる。変化
分抽出回路４０８ではパワー算出回路４０４より送られ
たパワーの微係数を求めてパワーの変化情報として、ブ
ロックサイズ１次決定回路４１０及びメモリ４０７へ送
る。メモリ４０７では、変化分抽出回路４０８より送ら
れたパワーの変化情報を上述の直交変換ブロックサイズ
の最大時間以上、蓄積する。これは時間的に隣接する直
交変換ブロックが直交変換の際のウィンドウ処理によ
り、互いに影響を与え合うため、時間的に隣接する１つ
前のブロックのパワー変化情報をブロックサイズ１次決
定回路４１０において必要とするためである。ブロック
サイズ１次決定回路４１０では変化分抽出回路４０８よ
り送られた該当ブロックのパワー変化情報とメモリ４０
７より送られた時間的に隣接する該当ブロックの１つ前
のブロックのパワー変化情報をもとに、該当する周波数
帯域内のパワーの時間的変位から該当する周波数帯域の
直交変換ブロックサイズを決定する。この際、一定以上
の変位が認められた場合、より時間的に短い直交変換ブ
ロックイサイズを選択するわけであるが、その変位点
（境界値）は固定でも効果は得られる。さらに周波数に
比例した値、即ち、周波数が高い場合は大きな変位によ
って時間的に短いブロックサイズとなり、周波数が低い
場合は、高い場合のそれに比べ小さな変位で時間的に短
いブロックサイズに決定されると、より効果的である。
この値（境界値）はなめらかに変化することが望ましい
が、複数段階の階段状の変化であっても構わない。以上
のように決定されたブロックサイズはブロックサイズ修
正回路４１１へ伝送される。In FIG. 4, each power calculation circuit 404-
406 obtains the power of each frequency band by integrating the input time waveform for a fixed time. At this time, the integration time width needs to be equal to or smaller than the minimum time block of the above orthogonal transform block sizes. Also,
Other than the above calculation method, the same effect can be obtained by using the absolute value of the maximum amplitude or the average value of the amplitudes within the minimum time width of the orthogonal transform block size as the representative power. The output of the power calculation circuit 404 is sent to the change extraction circuit 408 and the power comparison circuit 409, and the outputs of the power calculation circuits 405 and 406 are sent to the power comparison circuit 409, respectively. The change extraction circuit 408 obtains the differential coefficient of the power sent from the power calculation circuit 404 and sends it as power change information to the block size primary determination circuit 410 and the memory 407. In the memory 407, the power change information sent from the change extraction circuit 408 is accumulated for the maximum time of the above orthogonal transform block size or more. This is because the orthogonally adjacent orthogonal transform blocks influence each other by the window processing at the time of orthogonal transform, so that the power change information of the immediately preceding block temporally adjacent to each other is obtained in the block size primary determination circuit 410. This is because it is necessary. In the block size primary determination circuit 410, the power variation information of the corresponding block sent from the variation extraction circuit 408 and the memory 40.
Based on the power change information of the block immediately before the corresponding block that is temporally adjacent to the block, the orthogonal transform block size of the corresponding frequency band is determined from the temporal displacement of the power in the corresponding frequency band. To do. At this time, when a displacement equal to or more than a certain amount is recognized, the orthogonal transform block size is selected in a shorter time. However, even if the displacement point (boundary value) is fixed, the effect can be obtained. Furthermore, a value proportional to the frequency, that is, when the frequency is high, a large displacement causes a short block size in time, and when the frequency is low, a small displacement causes a small displacement in a block size to be temporally determined. , More effective.
It is desirable that this value (boundary value) change smoothly, but it may be a stepwise change in a plurality of steps. The block size determined as described above is transmitted to the block size correction circuit 411.

【００５１】一方、パワー比較回路４０９において、各
パワー算出回路４０４〜４０６より送られた各周波数帯
域のパワー情報を同時刻及び時間軸上でマスキング効果
の発生する時間幅で比較を行い、パワー算出回路４０４
の出力周波数帯域に及ぼす他の周波数帯域の影響を求
め、ブロックサイズ修正回路４１１へ伝送する。ブロッ
クサイズ修正回路４１１ではパワー比較回路４０９より
送られたマスキング情報及びディレイ４１２〜４１４か
らなるディレイ群の各タップから送られた過去のブロッ
クサイズ情報を基に、ブロックサイズ１次決定回路４１
０より送られたブロックサイズをより時間的に長いブロ
ックサイズを選択するよう修正をかけ、ディレイ４１２
及びウィンドウ形状決定回路４１５へ出力している。ブ
ロックサイズ修正回路４１１における作用は、該当周波
数帯域においてプリエコーが問題となる場合でも、他の
周波数帯域、特に該当周波数帯域より低い帯域におい
て、大きな振幅を持つ信号が存在した場合、そのマスキ
ング効果により、プリエコーが聴感上問題とならない、
或いは問題が軽減される場合があるという特性を利用し
ている。なお、上記マスキングとは、人間の聴覚上の特
性により、ある信号によって他の信号がマスクされて聞
こえなくなる現象をいうものであり、このマスキング効
果には、時間軸上のオーデイオ信号による時間軸マスキ
ング効果と、周波数軸上の信号による同時刻マスキング
効果とがある。これらのマスキング効果により、マスキ
ングされる部分にノイズがあったとしても、このノイズ
は聞こえないことになる。このため、実際のオーデイオ
信号では、このマスキングされる範囲内のノイズは許容
可能なノイズとされる。On the other hand, in the power comparison circuit 409, the power information of each frequency band sent from each of the power calculation circuits 404 to 406 is compared at the same time and the time width in which the masking effect occurs on the time axis to calculate the power. Circuit 404
The effect of another frequency band on the output frequency band of the above is calculated and transmitted to the block size correction circuit 411. In the block size correction circuit 411, the block size primary determination circuit 41 is based on the masking information sent from the power comparison circuit 409 and the past block size information sent from each tap of the delay group consisting of delays 412 to 414.
Correct the block size sent from 0 to select a block size longer in time, and delay 412
And to the window shape determination circuit 415. The function of the block size correction circuit 411 is that even if the pre-echo becomes a problem in the corresponding frequency band, if a signal having a large amplitude exists in another frequency band, particularly in a band lower than the corresponding frequency band, the masking effect causes Pre-echo is not a problem in hearing,
Alternatively, it utilizes the property that the problem may be alleviated. Note that the masking is a phenomenon in which one signal is masked by another signal due to human auditory characteristics so that the other signal cannot be heard. There are an effect and a simultaneous masking effect by a signal on the frequency axis. Due to these masking effects, even if there is noise in the masked portion, this noise cannot be heard. Therefore, in the actual audio signal, the noise within the masked range is regarded as an acceptable noise.

【００５２】次に、ディレイ４１２〜４１４では過去の
直交変換ブロックサイズを順に記録しておき、各タッ
プ、即ち、ディレイ４１２〜４１４の出力より、ブロッ
クサイズ決定回路４１１へ出力している。同時に、ディ
レイ４１２の出力は出力端子４１７へ、ディレイ４１
２、４１３の出力はウィンドウ形状決定回路４１５へ接
続している。このディレイ４１２〜４１４からの出力は
ブロックサイズ修正回路４１１においてより長い時間幅
でのブロックサイズの変化を該当ブロックのブロックサ
イズの決定に役立てる働き、例えば、過去頻繁により時
間的に短いブロックサイズが選択されている場合は、時
間的に短いブロックサイズの選択を増やし、過去におい
て時間的に短いブロックサイズの選択がなされてない場
合においては、時間的に長いブロックサイズの選択を増
やす等の判断を可能としている。なお、このディレイ群
はウィンドウ決定回路４１５及び出力端子４１７に必要
なディレイ４１２、４１３を除けば、そのタップ数は装
置の実際的な構成、規模により増減させて用いられる場
合もある。ウィンドウ形状決定回路４１５ではブロック
サイズ修正回路４１１の出力、即ち、該当ブロックの時
間的に隣接する１つ後のブロックサイズととディレイ４
１２の出力、即ち、該当ブロックのブロックサイズとデ
ィレイ４１３の出力、即ち、該当ブロックの時間的隣接
する１つ前のブロックサイズとから、上述の図２の各Ｍ
ＤＣＴ回路２０３〜２０５において使用されるウィンド
ウの形状を決定し、出力端子４１６へ出力する。図４の
出力端子４１７、即ち、ブロックサイズ情報と出力端子
４１６、即ち、ウィンドウ形状情報が、図２のブロック
サイズ決定回路２０６〜２０８の出力として各部へ接続
される。Next, in the delays 412 to 414, the past orthogonal transform block sizes are recorded in order, and are output to the block size determination circuit 411 from each tap, that is, the output of the delays 412 to 414. At the same time, the output of the delay 412 is output to the output terminal 417 and the delay 41
The outputs of 2, 413 are connected to the window shape determining circuit 415. The outputs from the delays 412 to 414 serve in the block size correction circuit 411 to make use of the change in the block size in a longer time width in determining the block size of the corresponding block. If so, increase the selection of the block size that is shorter in time, and if the block size that is shorter in time has not been selected in the past, you can increase the selection of the block size that is longer in time. I am trying. Note that this delay group may be used by increasing or decreasing the number of taps depending on the actual configuration and scale of the device, except for the delays 412 and 413 required for the window determination circuit 415 and the output terminal 417. In the window shape determination circuit 415, the output of the block size correction circuit 411, that is, the block size immediately after the block of interest and the delay 4
2 from the output of 12, that is, the block size of the corresponding block and the output of the delay 413, that is, the size of the immediately preceding block that is adjacent to the corresponding block in time.
The shape of the window used in the DCT circuits 203 to 205 is determined and output to the output terminal 416. The output terminal 417 of FIG. 4, that is, the block size information and the output terminal 416, that is, the window shape information, are connected to the respective units as outputs of the block size determination circuits 206 to 208 of FIG.

【００５３】ここでウィンドウ形状決定回路４１５にお
いて決定されるウィンドウの形状について説明する。図
５に隣接するブロックとウィンドウの形状の様子を示
す。図６の（ａ）〜（ｃ）より判るように、図中点線及
び実線で示すように直交変換に使用されるウィンドウは
時間的に隣接するブロックとの間で重複する部分があ
り、本実施例では、隣接するブロックの中心まで重複す
る形状を採用しているため、隣接するブロックの直交変
換サイズによりウィンドウの形状が変化する。The window shape determined by the window shape determination circuit 415 will be described. FIG. 5 shows the shapes of adjacent blocks and windows. As can be seen from (a) to (c) of FIG. 6, the window used for orthogonal transform has a portion overlapping with a temporally adjacent block as shown by a dotted line and a solid line in the figure, and thus the present embodiment In the example, since the shape overlapping the centers of adjacent blocks is adopted, the shape of the window changes depending on the orthogonal transform size of the adjacent blocks.

【００５４】図６には上記ウィンドウ形状の詳細を示
す。図６においてウィンドウ関数ｆ（ｎ）、ｇ（ｎ＋
Ｎ）は次式（１）を満たす関数として与えられる。FIG. 6 shows details of the window shape. In FIG. 6, window functions f (n) and g (n +
N) is given as a function that satisfies the following equation (1).

【００５５】ｆ（ｎ）×ｆ（Ｌ−１−ｎ）＝ｇ（ｎ）×ｇ（Ｌ−１−ｎ）ｆ（ｎ）×ｆ（ｎ）＋ｇ（ｎ）×ｇ（ｎ）＝１・・・（１）０≦ｎ≦Ｌ−１。F (n) × f (L-1-n) = g (n) × g (L-1-n) f (n) × f (n) + g (n) × g (n) = 1 (1) 0 ≦ n ≦ L−1.

【００５６】この式（１）におけるＬは、隣接する変換
ブロック長が同一であればそのまま変換ブロック長とな
るが、隣接する変換ブロック長が異なる場合は、より短
いほうの変換ブロック長をＬとし、より長い変換ブロッ
ク長をＫとすると、ウィンドウが重複しない領域におい
ては、次式（２）として与えられる。If the adjacent conversion block lengths are the same, L in this equation (1) becomes the conversion block length as it is, but if the adjacent conversion block lengths are different, the shorter conversion block length is set to L. , And a longer transform block length is K, in a region where windows do not overlap, the following formula (2) is given.

【００５７】ｆ（ｎ）＝ｇ（ｎ）＝１Ｋ≦ｎ≦３Ｋ／２−Ｌ／２ｆ（ｎ）＝ｇ（ｎ）＝０３Ｋ／２＋Ｌ≦ｎ≦２Ｋ・・・（２）この様にウィンドウの重複部分をできる限り長く取るこ
とにより、直交変換の際のスペクトルの周波数分解能を
良好なものとしている。以上の説明から明らかな様に、
直交変換に使用するウィンドウの形状は時間的に連続す
る３ブロック分の直交変換サイズが確定した後、決定さ
れる。したがって、図４の入力端子４０１〜４０３から
入力される信号のブロックと出力端子４１６、４１７か
ら出力される信号のブロックは本実施例において１ブロ
ック分の差異を生じている。F (n) = g (n) = 1 K ≦ n ≦ 3K / 2−L / 2 f (n) = g (n) = 0 3K / 2 + L ≦ n ≦ 2K (2) By making the overlapping portions of the windows as long as possible, the frequency resolution of the spectrum at the time of orthogonal transformation is made good. As is clear from the above explanation,
The shape of the window used for orthogonal transformation is determined after the orthogonal transformation sizes for three blocks that are temporally consecutive are determined. Therefore, the block of signals input from the input terminals 401 to 403 and the block of signals output from the output terminals 416 and 417 in FIG. 4 are different by one block in this embodiment.

【００５８】また、図４のパワー算出回路４０５、４０
６及びパワー比較回路４０９を省略しても図２のブロッ
クサイズ決定回路２０６〜２０８を構成することは可能
である。さらにウィンドウの形状を直交変換ブロックの
取りうる時間的に最小のブロックサイズに固定すること
によってその種類を１種類とし、図４のディレイ４１２
〜４１４及びブロックサイズ修正回路４１１並びにウィ
ンドウ形状決定回路４１５を省略して構成することも可
能である。特に、処理時間の遅延を好まない応用例にお
いては上述の省略により遅延の少ない構成となり、有効
に作用する。Further, the power calculation circuits 405 and 40 shown in FIG.
6 and the power comparison circuit 409 can be omitted, the block size determination circuits 206 to 208 in FIG. 2 can be configured. Further, by fixing the window shape to the smallest block size in time that can be taken by the orthogonal transform block, the type is made one, and the delay 412 of FIG.
˜414, the block size correction circuit 411, and the window shape determination circuit 415 can be omitted. In particular, in the application example in which the processing time delay is not desired, the above-mentioned omission makes the configuration small in delay and works effectively.

【００５９】再び図２において、各ＭＤＣＴ回路２０３
〜２０５にてＭＤＣＴ処理されて得られた周波数軸上の
スペクトルデータ或いはＭＤＣＴ係数データは、適応ビ
ット割当符号化回路２１０〜２１２、ブロックフローテ
ィングユニット決定回路２１９〜２２１及びビット配分
算出回路２０９に伝送している。ブロックフローティン
グユニット決定回路２１９〜２２１では、先のスペクト
ルデータ或いはＭＤＣＴ係数データより、エネルギ或い
はパワーに集中度に応じてフローティング処理を行う単
位であるブロックフローティングのユニットを決定し、
適応ビット割当符号化回路２１０〜２１２に伝送すると
共に、前述のブロックサイズ情報、ウィンドウ形状情報
と合わせて、デコードする際の補助情報として出力端子
２１６〜２１８より出力している。Referring again to FIG. 2, each MDCT circuit 203
˜205, the spectrum data or MDCT coefficient data on the frequency axis obtained by the MDCT processing is transmitted to the adaptive bit allocation coding circuits 210 to 212, the block floating unit determination circuits 219 to 221, and the bit allocation calculation circuit 209. ing. The block floating unit determination circuits 219 to 221 determine a block floating unit, which is a unit for performing a floating process according to the degree of concentration of energy or power, from the previous spectrum data or MDCT coefficient data,
The data is transmitted to the adaptive bit allocation encoding circuits 210 to 212, and is output from the output terminals 216 to 218 as auxiliary information for decoding together with the block size information and the window shape information described above.

【００６０】ここで、図７に図２のブロックフローティ
ングユニット決定回路２１９〜２２１の一具体例の概略
構成を表すブロック回路図を示す。この図７を用いてブ
ロックフローティングユニット決定回路の作用について
説明する。この図７において、入力端子３０１には、上
記各ＭＤＣＴ回路２０３〜２０５からの周波数軸上のス
ペクトルデータ或いはＭＤＣＴ係数データが供給されて
いる。このスペクトルデータ或いはＭＤＣＴ係数データ
は、変化分算出回路３０３に送られている。変化分算出
回路３０３では、スペクトルデータ或いはＭＤＣＴ係数
データの強度を周波数で微分し、微係数を求めることに
よって、周波数毎の変化分を算出し、積算比較回路３０
４へ伝送している。積算比較回路３０４では、周波数毎
の変化分データを周波数の低い順に積算し、しきい値出
力回路３０８より出力されるしきい値と比較している。
さらに、積算値がしきい値を越えた時点の周波数をユニ
ットの境界の周波数として、ブロックフローティングユ
ニット決定回路３０５に伝送すると共に、積算のデータ
をゼロにして再度、積算を開始し、全ての周波数の変化
分データについて上述の動作を行っている。しきい値出
力回路３０８では、ブロックフローティングユニット内
の変化量が一定以下となるように、しきい値を出力する
働きをしており、本実施例においては、周波数的に高域
ほどしきい値が大きくなるような傾斜配分を施して良好
な結果を得ている。このしきい値を応用例、入力信号等
によっても可変とするとより良好な結果が得られると共
に、応用例によっては定数とし、構成を簡易化すること
も可能である。FIG. 7 is a block circuit diagram showing a schematic configuration of a specific example of the block floating unit decision circuits 219 to 221 shown in FIG. The operation of the block floating unit determination circuit will be described with reference to FIG. In FIG. 7, the input terminal 301 is supplied with spectrum data on the frequency axis or MDCT coefficient data from the MDCT circuits 203 to 205. This spectrum data or MDCT coefficient data is sent to the change amount calculation circuit 303. The change calculation circuit 303 calculates the change for each frequency by differentiating the intensity of the spectrum data or the MDCT coefficient data by frequency and obtaining the differential coefficient, and the integration comparison circuit 30.
4 is being transmitted. The integration / comparison circuit 304 integrates the variation data for each frequency in ascending order of frequency, and compares it with the threshold value output from the threshold value output circuit 308.
Furthermore, the frequency at the time when the integrated value exceeds the threshold is transmitted to the block floating unit determination circuit 305 as the frequency of the boundary of the unit, and the integrated data is set to zero, and the integration is started again, and all frequencies are The above-mentioned operation is performed for the change amount data. The threshold value output circuit 308 has a function of outputting the threshold value so that the amount of change in the block floating unit becomes equal to or less than a certain value. Good results have been obtained by allocating slopes so that Better results can be obtained by making this threshold variable depending on the application example, input signal, etc., and depending on the application example, a constant can be used to simplify the configuration.

【００６１】次に、ブロックフローティングユニット決
定回路３０５では、積算比較回路３０４より伝送された
ユニットの境界となる周波数と、入力端子３０２より入
力されるところの、図２のビット配分算出回路２０９の
出力であるビット配分を基にブロックフローティングユ
ニットを決定し、ユニット修正回路３０６及びエネルギ
ー算出回路３０７に伝送している。この際のブロックフ
ローティングユニットの境界は、基本的には上述の積算
比較回路３０４より伝送されたユニットの境界となる周
波数によって決定されるが、ビット配分の出力が、隣接
するユニットで共にゼロとなっている場合、ブロックフ
ローティングユニットを分離する利点が消滅するため
に、同一のブロックフローティングユニットとなるよう
に修正を行っている。また、ユニット修正回路３０６で
は、ブロックフローティングユニット決定回路３０５よ
り伝送されたブロックフローティングユニットの境界を
エネルギー算出回路３０７の出力と標準ユニット出力回
路３０７の出力から修正を行い、出力端子３１０より出
力し、図２の適応ビット割当符号化回路２１０〜２１２
及び出力端子２１６〜２１８に伝送している。エネルギ
ー算出回路３０７では、ブロックフローティングユニッ
ト決定回路３０５より伝送されたユニット内のスペクト
ルデータ或いはＭＤＣＴ係数データをユニット毎に積算
することによって、ユニット毎のエネルギーを算出し、
ユニット修正回路３０６に伝送している。ユニット修正
回路３０６では、伝送されたユニット毎のエネルギーが
一定以下の場合には、該当ユニットを独立とせずに、隣
接するユニットと共通のユニットとしている。これは周
波数的に幅の狭い限られた部分で強度の大きいデータは
独立のユニットを構成し、同様に幅の狭い部分で小さい
データは独立のユニットを構成しないようにする働きを
している。Next, in the block floating unit determination circuit 305, the frequency of the unit boundary transmitted from the integration comparison circuit 304 and the output of the bit allocation calculation circuit 209 of FIG. The block floating unit is determined based on the bit allocation, which is transmitted to the unit correction circuit 306 and the energy calculation circuit 307. The boundary of the block floating unit at this time is basically determined by the frequency of the boundary of the unit transmitted from the integration / comparison circuit 304 described above, but the output of the bit allocation becomes zero in both adjacent units. If so, the advantage of separating the block floating units disappears, and therefore, the block floating units are modified to be the same block floating unit. Further, the unit correction circuit 306 corrects the boundary of the block floating unit transmitted from the block floating unit determination circuit 305 from the output of the energy calculation circuit 307 and the output of the standard unit output circuit 307, and outputs it from the output terminal 310, Adaptive bit allocation encoding circuits 210 to 212 in FIG.
And output terminals 216 to 218. The energy calculation circuit 307 calculates the energy for each unit by integrating the spectrum data or MDCT coefficient data in the unit transmitted from the block floating unit determination circuit 305 for each unit,
It is transmitted to the unit correction circuit 306. In the unit correction circuit 306, when the transmitted energy of each unit is equal to or less than a certain value, the corresponding unit is not independent but is made a unit common to the adjacent unit. This works so that the data having high intensity in a limited portion having a narrow width in frequency constitutes an independent unit and the data having small intensity in a narrow portion does not constitute an independent unit.

【００６２】さらに、標準ユニット出力回路３０９で
は、標準ブロックフローティングユニットの構成、応用
例においては、最低域の隣接した２帯域の帯域幅は同じ
で、より高い周波数帯域では高い周波数帯域ほどバンド
幅を広く選定し、各周波数帯域毎に直交変換を行って、
得られた周波数軸のスペクトルデータを、低域では、前
述した人間の聴覚特性を考慮したいわゆる臨界帯域幅
（クリティカルバンド）毎に、中高域ではブロックフロ
ーティング効率を考慮して臨界帯域幅を細分化した帯域
毎に分割したユニットを出力している。応用例において
は、この標準ブロックフローティングユニッとは、量子
化の為のバンドと共通となっており、圧縮後のスペクト
ルデータ或いはＭＤＣＴ係数データ部分以外のデータ
（補助データ）が少なくて済むという利点を持ってい
る。一方、ブロックフローティングの効率についてのみ
注目すると、変化の大きい部分を全て独立のユニットと
構成した場合が、最良となるが、その場合の先の補助デ
ータは増加する結果となる。従って、総合的な情報の有
効利用は、補助データの量との兼ね合いで決定される。
そこで、図７のユニット補正回路３０６では、上述のエ
ネルギー算出回路３０７の出力による補正後のユニット
における補助データの量とブロックフローティングの効
率並びに量子化の際に必要とする情報量を評価し、標準
ユニットを採用した場合よりも効率が良い場合のみ、先
に求めたユニットを採用している。Further, in the standard unit output circuit 309, in the configuration and application of the standard block floating unit, the bandwidths of the two adjacent lowest bands are the same, and in the higher frequency band, the higher the frequency band, the wider the bandwidth. Widely selected, orthogonal transformation is performed for each frequency band,
The obtained spectrum data on the frequency axis is subdivided into the critical bandwidths in the low frequency range, taking into account the human auditory characteristics described above, in so-called critical bandwidths (critical bands), and in the high and mid frequency range, considering the block floating efficiency. It outputs the units divided for each band. In the application example, this standard block floating unit has a common band for quantization, and has the advantage of requiring less data (auxiliary data) other than the compressed spectrum data or MDCT coefficient data portion. have. On the other hand, if attention is paid only to the efficiency of the block floating, the case where all the parts that greatly change are configured as independent units is the best, but in that case, the above auxiliary data increases. Therefore, the effective use of comprehensive information is determined in consideration of the amount of auxiliary data.
Therefore, the unit correction circuit 306 of FIG. 7 evaluates the amount of auxiliary data in the unit after the correction by the output of the energy calculation circuit 307, the efficiency of block floating, and the amount of information required for quantization, and the standard Only when the efficiency of the unit is higher than that of the unit, the unit previously obtained is used.

【００６３】ここで、図８を用いて、上記ブロックフロ
ーティングユニットの作用について説明する。図８の
（ａ）は、図７の入力端子３０１より入力されたスペク
トルデータ或いはＭＤＣＴ係数データの一部を示してい
る。ここで、標準ブロックフローティングユニットの構
成では、上記周波数部分においては、波線で範囲を示し
てあるような、３個の均等な周波数幅を持つブロックフ
ローティングユニット、Ｎ−１，Ｎ，Ｎ＋１を構成する
ものとする。この状態において、Ｎ−１のユニットで
は、Ｎ−１（２）のデータのみ、他のデータよりも大き
く、このデータによってブロックフローティング係数が
決定される為に、Ｎ−１（２）以外のデータを先のブロ
ックフローティング係数で正規化すると、効率の悪い、
即ち、正規化後のデータの最上位桁のいくつかがゼロと
なる、結果となる。Here, the operation of the block floating unit will be described with reference to FIG. FIG. 8A shows part of the spectrum data or MDCT coefficient data input from the input terminal 301 of FIG. 7. Here, in the configuration of the standard block floating unit, in the frequency part, three block floating units having equal frequency widths, N-1, N, N + 1, as shown by the broken line range, are configured. I shall. In this state, in the N-1 unit, only the N-1 (2) data is larger than the other data, and the block floating coefficient is determined by this data. If is normalized with the block floating coefficient,
That is, the result is that some of the most significant digits of the normalized data are zero.

【００６４】図８の（ｂ）中、実線のグラフは、図８の
（ａ）のスペクトルデータ或いはＭＤＣＴ係数データを
周波数で微分した微係数、ｄＰ／ｄｆを表している。さ
らに、同図中の棒グラフは前述の微係数、ｄＰ／ｄｆを
積算した値、Σ（ｄＰ／ｄｆ）を示している。ここで、
図７のしきい値出力回路３０８の出力値を同図中、一点
鎖線で示した値とすると、Ｎ−１（２）とＮ−１（３）
のデータ間で積算値、Σ（ｄＰ／ｄｆ）が先のしきい値
を越えるため、ここにブロックフローティングユニッと
の境界が設定される。図８の（ｃ）は、以上の様にして
求めた境界によって構成したブロックフローティングユ
ニットを示す図である。図より明らかなように、図８の
（ｃ）では、最初のユニット、Ｎ’−１の周波数幅が狭
くなり、ユニット内の各データの偏差が減少している
為、図８（のａ）と比較して、ブロックフローティング
の効率が改善している。また、前述のように、周波数の
低いデータから高いデータへと変化分の積算を行うよう
にしている為、データの大きな変化の直前がユニットの
境界となる場合が多くなり、ピークのデータの周波数の
低いほうの近傍のデータのブロックフローティング、並
びに量子化の制御が容易となる。従って、より低い周波
数音へのマスキングが周波数の高い音よりも効きにくい
という、聴覚上の特性にも合致しており、良好な圧縮が
実現できる。In FIG. 8B, the solid line graph represents the differential coefficient dP / df obtained by differentiating the spectral data or MDCT coefficient data of FIG. 8A by the frequency. Further, the bar graph in the figure shows Σ (dP / df), which is a value obtained by integrating the above-mentioned differential coefficient, dP / df. here,
If the output value of the threshold value output circuit 308 in FIG. 7 is the value shown by the alternate long and short dash line in the figure, N-1 (2) and N-1 (3)
Since the integrated value, Σ (dP / df), exceeds the threshold value, the boundary with the block floating unit is set here. FIG. 8C is a diagram showing a block floating unit constituted by the boundaries obtained as described above. As is clear from the figure, in (c) of FIG. 8, the frequency width of the first unit, N′-1, is narrowed, and the deviation of each data in the unit is reduced. Compared with, the efficiency of block floating is improved. In addition, as described above, since the change is integrated from low-frequency data to high-frequency data, the unit boundary often occurs immediately before a large data change, and the peak data frequency It becomes easy to control the block floating of the data in the vicinity of the lower one and the quantization. Therefore, the masking to the lower frequency sound is less effective than the higher frequency sound, which is in accordance with the auditory characteristic, and good compression can be realized.

【００６５】再び図２において、ビット配分算出回路２
０９は、前述のクリティカルバンド及びブロックフロー
ティングユニット決定回路２１９〜２２１によって伝送
されたブロックフローティングを考慮して分割されたス
ペクトルデータに基づき、いわゆるマスキング効果等を
考慮してクリティカルバンド及びブロックフローティン
グを考慮した各分割帯域毎のマスキング量を求め、この
マスキング量とクリティカルバンド及びブロックフロー
ティングを考慮した各分割帯域毎のエネルギあるいはピ
ーク値等に基づいて、各帯域毎に割当ビット数を求め、
適応ビット割当符号化回路２１０〜２１２へ伝送してい
る。適応ビット割当符号化回路２１０〜２１２では各帯
域毎に割り当てられたビット数に応じて各スペクトルデ
ータ（あるいはＭＤＣＴ係数データ）を量子化してい
る。このようにして符号化されたデータは、出力端子２
１３〜２１５を介して取り出される。Referring again to FIG. 2, the bit allocation calculation circuit 2
09 is based on the spectrum data divided considering the block floating transmitted by the critical band and block floating unit determination circuits 219 to 221 described above, and the critical band and the block floating are considered in consideration of so-called masking effect. Obtaining the masking amount for each divided band, based on the masking amount and energy or peak value for each divided band considering the critical band and block floating, obtain the number of allocated bits for each band,
It is transmitted to the adaptive bit allocation encoding circuits 210 to 212. The adaptive bit allocation coding circuits 210 to 212 quantize each spectrum data (or MDCT coefficient data) according to the number of bits allocated for each band. The data encoded in this way is output to the output terminal 2
It is taken out via 13-215.

【００６６】次に、図９は上記ビット割当算出回路２０
９の一具体例の概略構成を示すブロック回路図である。
この図９を用いてビット割当算出回路の作用について説
明する。この図９において、入力端子７０１には、図２
のＭＤＣＴ回路２０３〜２０５からの周波数軸上のスペ
クトルデータ或いはＭＤＣＴ係数データが供給されてい
る。この周波数軸上の入力データは、帯域毎のエネルギ
算出回路７０２に送られて、上記マスキング量とクリテ
ィカルバンド及びブロックフローティングを考慮した各
分割帯域のエネルギが、例えば当該バンド内での各振幅
値の総和を計算すること等により求められる。この各バ
ンド毎のエネルギの代わりに、振幅値のピーク値、平均
値等が用いられることもある。このエネルギ算出回路７
０２からの出力として、例えば各バンドの総和値のスペ
クトルを図１０にＳＢとして示している。ただし、この
図１０では、図示を簡略化するため、上記マスキング量
とクリティカルバンド及びブロックフローティングを考
慮した分割帯域数を１２バンド（Ｂ1 〜Ｂ12）で表現し
ている。Next, FIG. 9 shows the bit allocation calculation circuit 20.
9 is a block circuit diagram showing a schematic configuration of one specific example of FIG.
The operation of the bit allocation calculation circuit will be described with reference to FIG. In FIG. 9, the input terminal 701 has
The spectrum data on the frequency axis or the MDCT coefficient data from the MDCT circuits 203 to 205 are supplied. The input data on the frequency axis is sent to the energy calculation circuit 702 for each band, and the energy of each divided band considering the masking amount, the critical band, and the block floating is, for example, the amplitude value of each amplitude value in the band. It can be obtained by calculating the sum. Instead of the energy for each band, a peak value, an average value, etc. of the amplitude value may be used. This energy calculation circuit 7
As the output from 02, for example, the spectrum of the sum value of each band is shown as SB in FIG. However, in FIG. 10, in order to simplify the illustration, the number of divided bands in consideration of the masking amount, the critical band, and the block floating is expressed by 12 bands (B1 to B12).

【００６７】ここで、上記スペクトルＳＢのいわゆるマ
スキングに於ける影響を考慮するために、該スペクトル
ＳＢに所定の重み付け関数を掛けて加算するような畳込
み（コンボリューション）処理を施す。このため、上記
帯域毎のエネルギ算出回路７０２の出力すなわち該スペ
クトルＳＢの各値は、畳込みフイルタ回路７０３に送ら
れる。該畳込みフイルタ回路７０３は、例えば、入力デ
ータを順次遅延させる複数の遅延素子と、これら遅延素
子からの出力にフイルタ係数（重み付け関数）を乗算す
る複数の乗算器（例えば各バンドに対応する２５個の乗
算器）と、各乗算器出力の総和をとる総和加算器とから
構成されるものである。この畳込み処理により、図１０
中点線で示す部分の総和がとられる。Here, in order to consider the influence of the spectrum SB on so-called masking, a convolution process is performed such that the spectrum SB is multiplied by a predetermined weighting function and added. Therefore, the output of the energy calculation circuit 702 for each band, that is, each value of the spectrum SB is sent to the convolution filter circuit 703. The convolution filter circuit 703 includes, for example, a plurality of delay elements that sequentially delay input data, and a plurality of multipliers that multiply outputs from these delay elements by a filter coefficient (weighting function) (for example, 25 corresponding to each band). Number of multipliers) and a summation adder that sums the outputs of the respective multipliers. By this convolution processing, FIG.
The sum of the parts indicated by the middle dotted line is taken.

【００６８】ここで、上記畳込みフイルタ回路７０３の
各乗算器の乗算係数（フイルタ係数）の一具体例を示す
と、任意のバンドに対応する乗算器Ｍの係数を１とする
とき、乗算器Ｍ−１で係数０．１５を、乗算器Ｍ−２で
係数０．００１９を、乗算器Ｍ−３で係数０．００００
０８６を、乗算器Ｍ＋１で係数０．４を、乗算器Ｍ＋２
で係数０．０６を、乗算器Ｍ＋３で係数０．００７を各
遅延素子の出力に乗算することにより、上記スペクトル
ＳＢの畳込み処理が行われる。ただし、Ｍは１〜２５の
任意の整数である。Here, a specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 703 will be described. When the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier is M-1 gives a coefficient of 0.15, multiplier M-2 gives a coefficient of 0.0019, and multiplier M-3 gives a coefficient of 0.0000.
086, multiplier M + 1 gives a coefficient of 0.4, multiplier M + 2
By multiplying the output of each delay element by a coefficient of 0.06 with a coefficient of 0.007 by a multiplier M + 3, the convolution processing of the spectrum SB is performed. However, M is an arbitrary integer of 1 to 25.

【００６９】次に、上記畳込みフイルタ回路７０３の出
力は引算器７０４に送られる。該引算器７０４は、上記
畳込んだ領域での後述する許容可能なノイズレベルに対
応するレベルαを求めるものである。なお、当該許容可
能なノイズレベル（許容ノイズレベル）に対応するレベ
ルαは、後述するように、逆コンボリューション処理を
行うことによって、クリテイカルバンドの各バンド毎の
許容ノイズレベルとなるようなレベルである。ここで、
上記引算器７０４には、上記レベルαを求めるための許
容関数（マスキングレベルを表現する関数）が供給され
る。この許容関数を増減させることで上記レベルαの制
御を行っている。当該許容関数は、次に説明するような
（ｎ−ａｉ）関数発生回路７０５から供給されているも
のである。Next, the output of the convolution filter circuit 703 is sent to the subtractor 704. The subtractor 704 obtains a level α corresponding to an allowable noise level described later in the convoluted area. It should be noted that the level α corresponding to the permissible noise level (permissible noise level) is a level at which the permissible noise level for each band of the critical band is obtained by performing inverse convolution processing, as described later. Is. here,
The subtractor 704 is supplied with an allowance function (function expressing a masking level) for obtaining the level α. The level α is controlled by increasing or decreasing this allowance function. The permissible function is supplied from the (n-ai) function generating circuit 705 described below.

【００７０】すなわち、許容ノイズレベルに対応するレ
ベルαは、クリテイカルバンドのバンドの低域から順に
与えられる番号をｉとすると、次の（３）式で求めるこ
とができる。 α＝Ｓ−（ｎ−ａｉ）・・・（３）この（３）式において、ｎ，ａは定数でａ＞０、Ｓは畳
込み処理されたバークスペクトルの強度であり、（３）
式中（ｎ−ａｉ）が許容関数となる。本実施例ではｎ＝３８，ａ＝１としており、この時の音質劣化はなく、良好な符号化が
行えた。That is, the level α corresponding to the allowable noise level can be obtained by the following equation (3), where i is the number given in order from the lower band of the critical band. α = S- (n-ai) (3) In this equation (3), n and a are constants, a> 0, S is the intensity of the convolved Bark spectrum, and (3)
In the formula, (n-ai) is the allowable function. In this embodiment, n = 38 and a = 1 are set, and there is no sound quality deterioration at this time, and good coding can be performed.

【００７１】このようにして、上記レベルαが求めら
れ、このデータは、割算器７０６に伝送される。当該割
算器７０６では、上記畳込みされた領域での上記レベル
αを逆コンボリューションするためのものである。した
がって、この逆コンボリューション処理を行うことによ
り、上記レベルαからマスキングスペクトルが得られる
ようになる。すなわち、このマスキングスペクトルが許
容ノイズスペクトルとなる。なお、上記逆コンボリュー
ション処理は、複雑な演算を必要とするが、本実施例で
は簡略化した割算器７０６を用いて逆コンボリューショ
ンを行っている。In this way, the level α is obtained, and this data is transmitted to the divider 706. The divider 706 is for inverse convolution of the level α in the convolved area. Therefore, the masking spectrum can be obtained from the level α by performing this inverse convolution processing. That is, this masking spectrum becomes the allowable noise spectrum. Although the above-mentioned inverse convolution processing requires complicated calculation, in this embodiment, the inverse convolution is performed using the simplified divider 706.

【００７２】次に、上記マスキングスペクトルは、合成
回路７０７を介して減算器７０８に伝送される。ここ
で、当該減算器７０８には、上記帯域毎のエネルギ検出
回路７０２からの出力、すなわち前述したスペクトルＳ
Ｂが、遅延回路７０９を介して供給されている。したが
って、この減算器７０８で上記マスキングスペクトルと
スペクトルＳＢとの減算演算が行われることで、図１１
示すように、上記スペクトルＳＢは、該マスキングスペ
クトルＭＳのレベルで示すレベル以下がマスキングされ
ることになる。Next, the masking spectrum is transmitted to the subtractor 708 via the synthesis circuit 707. Here, the subtracter 708 outputs the output from the energy detection circuit 702 for each band, that is, the spectrum S described above.
B is supplied via the delay circuit 709. Therefore, the subtraction operation of the masking spectrum and the spectrum SB is performed by the subtractor 708.
As shown, the spectrum SB is masked below the level indicated by the level of the masking spectrum MS.

【００７３】当該減算器７０８からの出力は、許容雑音
補正回路７１０を介し、出力端子７１１を介して取り出
され、例えば割当てビット数情報が予め記憶されたＲＯ
Ｍ等（図示せず）に送られる。このＲＯＭ等は、上記減
算回路７０８から許容雑音補正回路７１０を介して得ら
れた出力（上記各バンドのエネルギと上記ノイズレベル
設定手段の出力との差分のレベル）に応じ、各バンド毎
の割当ビット数情報を出力する。この割当ビット数情報
が図２の適応ビット割当符号化回路２１０〜２１２に送
られることで、図２のＭＤＣＴ回路２０３〜２０５から
の周波数軸上の各スペクトルデータがそれぞれのバンド
毎に割り当てられたビット数で量子化されるわけであ
る。The output from the subtractor 708 is taken out through the allowable noise correction circuit 710 and the output terminal 711. For example, RO in which the assigned bit number information is stored in advance.
M, etc. (not shown). This ROM or the like is assigned to each band in accordance with the output (the level of the difference between the energy of each band and the output of the noise level setting means) obtained from the subtraction circuit 708 through the allowable noise correction circuit 710. Outputs bit number information. This allocation bit number information is sent to the adaptive bit allocation coding circuits 210 to 212 in FIG. 2, whereby the spectrum data on the frequency axis from the MDCT circuits 203 to 205 in FIG. 2 are allocated to each band. It is quantized by the number of bits.

【００７４】すなわち要約すれば、図２の適応ビット割
当符号化回路２１０〜２１２では、上記マスキング量と
クリテイカルバンド及びブロックフローティングを考慮
した各分割帯域のエネルギと上記ノイズレベル設定手段
の出力との差分のレベルに応じて割当てられたビット数
で上記各バンド毎のスペクトルデータを量子化すること
になる。なお、図９の遅延回路７０９は上記合成回路７
０７以前の各回路での遅延量を考慮してエネルギ検出回
路７０２からのスペクトルＳＢを遅延させるために設け
られている。That is, in summary, in the adaptive bit allocation encoding circuits 210 to 212 of FIG. 2, the energy of each divided band considering the masking amount, critical band and block floating, and the output of the noise level setting means. The spectrum data for each band is quantized by the number of bits assigned according to the level of difference. The delay circuit 709 shown in FIG.
It is provided to delay the spectrum SB from the energy detection circuit 702 in consideration of the delay amount in each circuit before 07.

【００７５】ところで、上述した合成回路７０７での合
成の際には、最小可聴カーブ発生回路７１２から供給さ
れる図１２に示すような人間の聴覚特性であるいわゆる
最小可聴カーブＲＣを示すデータと、上記マスキングス
ペクトルＭＳとを合成することができる。この最小可聴
カーブにおいて、雑音絶対レベルがこの最小可聴カーブ
以下ならば該雑音は聞こえないことになる。この最小可
聴カーブは、コーデイングが同じであっても例えば再生
時の再生ボリユームの違いで異なるものとなが、現実的
なディジタルシステムでは、例えば１６ビットダイナミ
ツクレンジへの音楽のはいり方にはさほど違いがないの
で、例えば４ｋＨｚ付近の最も耳に聞こえやすい周波数
帯域の量子化雑音が聞こえないとすれば、他の周波数帯
域ではこの最小可聴カーブのレベル以下の量子化雑音は
聞こえないと考えられる。したがって、このように例え
ばシステムの持つワードレングスの４ｋＨｚ付近の雑音
が聞こえない使い方をすると仮定し、この最小可聴カー
ブＲＣとマスキングスペクトルＭＳとを共に合成するこ
とで許容ノイズレベルを得るようにすると、この場合の
許容ノイズレベルは、図１２中の斜線で示す部分までと
することができるようになる。なお、本実施例では、上
記最小可聴カーブの４ｋＨｚのレベルを、例えば２０ビ
ット相当の最低レベルに合わせている。また、この図１
２は、信号スペクトルＳＳも同時に示している。By the way, at the time of synthesizing by the above-mentioned synthesizing circuit 707, data indicating a so-called minimum audible curve RC which is the human auditory characteristic as shown in FIG. The masking spectrum MS can be combined. In this minimum audible curve, if the absolute noise level is below this minimum audible curve, the noise will not be heard. This minimum audible curve is different even if the coding is the same, for example, due to the difference in playback volume at the time of playback, but in a realistic digital system, for example, how to enter music into the 16-bit dynamic range is Since there is not much difference, if, for example, the quantization noise in the most audible frequency band around 4 kHz is not heard, it is considered that the quantization noise below the level of this minimum audible curve is not heard in other frequency bands. . Therefore, assuming that the system is used in such a manner that noise near the word length of the system of 4 kHz is inaudible, and the minimum noise curve RC and the masking spectrum MS are combined together to obtain an allowable noise level, In this case, the allowable noise level can be up to the shaded portion in FIG. In this embodiment, the level of 4 kHz of the minimum audible curve is set to the minimum level equivalent to 20 bits, for example. Also, this figure 1
2 also shows the signal spectrum SS.

【００７６】また、上記許容雑音補正回路７１０では、
補正情報出力回路７１３から送られてくる例えば等ラウ
ドネスカーブの情報に基づいて、上記減算器７０８から
の出力における許容雑音レベルを補正している。ここ
で、等ラウドネスカーブとは、人間の聴覚特性に関する
特性曲線であり、例えば１ｋＨｚの純音と同じ大きさに
聞こえる各周波数での音の音圧を求めて曲線で結んだも
ので、ラウドネスの等感度曲線とも呼ばれる。またこの
等ラウドネス曲線は、図１１に示した最小可聴カーブＲ
Ｃと略同じ曲線を描くものである。この等ラウドネス曲
線においては、例えば４ｋＨｚ付近では１ｋＨｚのとこ
ろより音圧が８〜１０ｄＢ下がっても１ｋＨｚと同じ大
きさに聞こえ、逆に、５０Ｈｚ付近では１ｋＨｚでの音
圧よりも約１５ｄＢ高くないと同じ大きさに聞こえな
い。このため、上記最小可聴カーブのレベルを越えた雑
音（許容ノイズレベル）は、該等ラウドネス曲線に応じ
たカーブで与えられる周波数特性を持つようにするのが
良いことがわかる。このようなことから、上記等ラウド
ネス曲線を考慮して上記許容ノイズレベルを補正するこ
とは、人間の聴覚特性に適合していることがわかる。Further, in the allowable noise correction circuit 710,
The allowable noise level in the output from the subtractor 708 is corrected based on the information on the equal loudness curve sent from the correction information output circuit 713, for example. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics, for example, a curve obtained by obtaining the sound pressure of sound at each frequency that sounds the same as a pure tone of 1 kHz, and connecting the curves. Also called sensitivity curve. Further, this equal loudness curve is the minimum audible curve R shown in FIG.
It draws a curve substantially the same as C. In this equal loudness curve, for example, at 4 kHz, even if the sound pressure drops by 8 to 10 dB from 1 kHz, it sounds as loud as 1 kHz, and conversely, at around 50 Hz, it must be about 15 dB higher than the sound pressure at 1 kHz. It doesn't sound the same. Therefore, it is understood that the noise exceeding the level of the minimum audible curve (allowable noise level) should have the frequency characteristic given by the curve corresponding to the equal loudness curve. From this, it can be seen that correcting the permissible noise level in consideration of the equal loudness curve is suitable for human hearing characteristics.

【００７７】さらに、補正情報出力回路７１３では、上
記適応ビット割当符号化回路２１０〜２１２における量
子化の際の出力情報量（データ量）の検出出力と、最終
符号化データのビットレート目標値との間の誤差の情報
に基づいて、上記許容ノイズレベルを補正するようにし
いる。これは、全てのビット割当単位ブロックに対して
予め一時的な適応ビット割当を行って得られた総ビット
数が、最終的な符号化出力データのビットレートによっ
て定まる一定のビット数（目標値）に対して誤差を持つ
ことがあり、その誤差分を０とするように再度ビット割
当をするものである。すなわち、目標値よりも総割当ビ
ット数が少ないときには、差のビット数を各単位ブロッ
クに割り振って付加するようにし、目標値よりも総割当
ビット数が多いときには、差のビット数を各単位ブロッ
クに割り振って削るように作用する。Further, the correction information output circuit 713 detects and outputs the output information amount (data amount) at the time of quantization in the adaptive bit allocation encoding circuits 210 to 212 and the bit rate target value of the final encoded data. The allowable noise level is corrected on the basis of the information on the error between the two. This is because the total number of bits obtained by performing temporary adaptive bit allocation in advance for all bit allocation unit blocks is a fixed number of bits (target value) determined by the bit rate of the final encoded output data. May have an error with respect to, and bit allocation is performed again so that the error may be zero. That is, when the total allocated bit number is smaller than the target value, the difference bit number is allocated to each unit block and added, and when the total allocated bit number is larger than the target value, the difference bit number is added to each unit block. It works by allocating to and shaving.

【００７８】以上のような動作を行うため、上記総割当
ビット数の上記目標値からの誤差を検出し、この誤差デ
ータに応じて補正情報出力回路７１３が各割当ビット数
を補正するための補正データを出力する。ここで、上記
誤差データがビット数不足を示す場合は、上記単位ブロ
ック当たり多くのビット数が使われることで上記データ
量が上記目標値よりも多くなっている場合を考えること
ができる。また、上記誤差データが、ビット数余りを示
すデータとなる場合は、上記単位ブロック当たり少ない
ビット数で済み、上記データ量が上記目標値よりも少な
くなっている場合を考えることができる。したがって、
上記補正情報出力回路７１３からは、この誤差データに
応じて、上記減算器７０８からの出力における許容ノイ
ズレベルを、例えば上記等ラウドネス曲線の情報データ
に基づいて補正させるための上記補正値のデータが出力
されるようになる。上述のような補正値が、上記許容雑
音補正回路７１０に伝送されることで、上記減算器７０
８からの許容ノイズレベルが補正されるようになる。以
上説明したようなシステムでは、メイン情報として直交
変換出力スペクトルをサブ情報により処理したデータと
サブ情報としてブロックフローティングの状態を示すス
ケールファクタ、語長を示すワードレングスが得られ、
エンコーダからデコーダに送られる。In order to perform the above operation, an error in the total allocated bit number from the target value is detected, and the correction information output circuit 713 corrects each allocated bit number according to the error data. Output the data. Here, when the error data indicates a bit number shortage, it can be considered that the data amount is larger than the target value because a large number of bits are used per unit block. Further, when the error data is data indicating a surplus of the number of bits, it can be considered that the number of bits per unit block is small and the amount of data is smaller than the target value. Therefore,
From the correction information output circuit 713, in accordance with the error data, the data of the correction value for correcting the allowable noise level in the output from the subtractor 708 based on, for example, the information data of the equal loudness curve is provided. It will be output. By transmitting the correction value as described above to the allowable noise correction circuit 710, the subtractor 70
The allowable noise level from 8 is corrected. In the system as described above, the data obtained by processing the orthogonal transform output spectrum by the sub information as the main information, the scale factor indicating the block floating state as the sub information, and the word length indicating the word length are obtained.
Sent from encoder to decoder.

【００７９】図１３は図１のＡＴＣデコーダ７３、即
ち、上述のごとく高能率符号化されて本発明記録媒体に
記録された信号を再生して得た再生信号、又は伝送媒体
を介して伝送されて受信された受信た信号を、再び復号
化するための復号回路を示している。各帯域の量子化さ
れたＭＤＣＴ係数、即ち、図２の出力端子２１３〜２１
５の出力信号と等価のデータは、復号回路入力１０７に
与えられ、使用されたブロックサイズ情報及び、ブロッ
クフローティング並びに量子化の為の小ブロックの周波
数的長さに関する情報、即ち、図２の出力端子２１６〜
２１８の出力信号と等価のデータは、入力端子１０８に
与えられる。適応ビット割当復号化回路１０６では適応
ビット割当情報を用いてビット割当を解除する。次に逆
直交変換（ＩＭＤＣＴ）回路１０３〜１０５では周波数
軸上の信号が時間軸上の信号に変換される。これらの部
分帯域の時間軸上信号は、帯域合成フィルタ（ＩＱＭ
Ｆ）回路１０２、１０１により、全帯域信号に復号化さ
れる。FIG. 13 shows the ATC decoder 73 shown in FIG. 1, that is, the reproduction signal obtained by reproducing the signal which is highly efficient coded and recorded on the recording medium of the present invention as described above, or is transmitted through the transmission medium. 2 shows a decoding circuit for decoding the received signal received again. The quantized MDCT coefficient of each band, that is, the output terminals 213 to 21 of FIG.
Data equivalent to the output signal of No. 5 is given to the decoding circuit input 107 and used block size information and information about the frequency length of the small blocks for block floating and quantization, that is, the output of FIG. Terminal 216-
Data equivalent to the output signal of 218 is provided to the input terminal 108. The adaptive bit allocation decoding circuit 106 cancels the bit allocation using the adaptive bit allocation information. Next, in the inverse orthogonal transform (IMDCT) circuits 103 to 105, the signal on the frequency axis is converted to the signal on the time axis. The signals on the time axis of these partial bands are the band synthesis filter (IQM
F) Decoded into full band signals by the circuits 102 and 101.

【００８０】なお、本発明は上記実施例のみに限定され
るものではなく、例えば、上記の記録再生媒体と信号圧
縮装置あるいは伸張装置と、さらには、信号圧縮装置と
伸張装置とは一体化されている必要はなく、記録媒体を
介せずに、その間をデータ転送用回線や光ケーブル，光
或いは電波による通信等で結ぶ事も可能である。更に例
えば、オーデイオＰＣＭ信号のみならず、ディジタル音
声（スピーチ）信号やディジタルビデオ信号等の信号処
理装置にも適用可能である。The present invention is not limited to the above-described embodiment, and for example, the recording / reproducing medium and the signal compression device or decompression device, and further, the signal compression device and the decompression device are integrated. It is not necessary to provide such a connection, and it is also possible to connect between them by a data transfer line, an optical cable, communication by light or radio waves, etc. without using a recording medium. Furthermore, for example, the present invention can be applied not only to audio PCM signals, but also to signal processing devices for digital audio (speech) signals, digital video signals, and the like.

【００８１】また、本発明の記録媒体は、上記ディジタ
ル信号圧縮装置により圧縮されたデータを記録すること
で、記録容量の有効利用が図れる。また、本発明の記録
媒体としては、上述した光ディスクのみならず、磁気デ
ィスク、ＩＣメモリ及びそのメモリを内蔵するカード
や、磁気テープ等の各種記録媒体とすることもできる。Further, the recording medium of the present invention records the data compressed by the digital signal compressing device, so that the recording capacity can be effectively utilized. Further, as the recording medium of the present invention, not only the above-mentioned optical disk but also various recording media such as a magnetic disk, an IC memory and a card having the memory built therein, and a magnetic tape can be used.

【００８２】[0082]

【発明の効果】以上の説明からも明らかなように、本発
明のディジタル信号圧縮方法及び装置によれば、ブロッ
クフローティング及び／又はビット配分の効率の偏差が
大きくなるような入力信号、言い換えれば、ブロックフ
ローティング及び／又はビット配分の為の時間と周波数
について細分化された各小ブロック内の周波数軸上のデ
ータの大きさに極端なバラツキや特出するピーク成分を
含む信号に対し、その効率の偏差を小さく抑えるような
当該小ブロックの周波数的大きさの選択を行うことで圧
縮効率の偏差の少ないビットの配分を実現できる。これ
により、圧縮の効率の低下を防ぐことができ、同一のビ
ットレートにおいてはより良好な音質を得ることができ
るようになり、又、同一の音質においてはより低いビッ
トレートでの記録、伝送等を実現することが可能とな
る。従って、聴感上、音質の勝れた高能率な圧縮、伸張
を行うことができる。As is apparent from the above description, according to the digital signal compression method and apparatus of the present invention, an input signal which causes a large deviation in efficiency of block floating and / or bit allocation, in other words, The efficiency of a signal including an extreme variation in the size of data on the frequency axis in each small block subdivided with respect to time and frequency for block floating and / or bit allocation and a particular peak component By selecting the frequency size of the small block so as to suppress the deviation to a small value, it is possible to realize the distribution of bits with a small deviation in the compression efficiency. As a result, it is possible to prevent a decrease in compression efficiency, obtain better sound quality at the same bit rate, and record or transmit at a lower bit rate at the same sound quality. Can be realized. Therefore, it is possible to perform highly efficient compression and decompression with excellent sound quality in terms of hearing.

【００８３】また、本発明のディジタル信号圧縮装置で
圧縮したデータを記録する本発明記録媒体は、従来のも
のよりも記憶容量の有効利用が図れることになる。さら
に、本発明のディジタル信号圧縮装置で圧縮したデータ
を伝送する経路は、従来のものよりも回線の利用効率を
高めることが出来る。Further, the recording medium of the present invention for recording the data compressed by the digital signal compression apparatus of the present invention can effectively utilize the storage capacity as compared with the conventional recording medium. Further, the path for transmitting the data compressed by the digital signal compression apparatus of the present invention can improve the utilization efficiency of the line as compared with the conventional one.

[Brief description of drawings]

【図１】本発明ディジタル信号圧縮装置の一実施例とし
ての圧縮データの記録再生装置（ディスク記録再生装
置）の構成例を示すブロック回路図である。FIG. 1 is a block circuit diagram showing a configuration example of a compressed data recording / reproducing apparatus (disk recording / reproducing apparatus) as an embodiment of a digital signal compressing apparatus of the present invention.

【図２】本実施例のビットレート圧縮符号化に使用可能
な高能率圧縮符号化エンコーダの一具体例を示すブロッ
ク回路図である。FIG. 2 is a block circuit diagram showing a specific example of a high-efficiency compression encoding encoder that can be used for bit rate compression encoding according to the present embodiment.

【図３】ビット圧縮の際の直交変換ブロックの構造を表
す図である。FIG. 3 is a diagram showing a structure of an orthogonal transform block at the time of bit compression.

【図４】直交変換ブロックサイズを決定する回路の構成
例を示すブロック回路図である。FIG. 4 is a block circuit diagram showing a configuration example of a circuit that determines an orthogonal transform block size.

【図５】時間的に隣接する直交変換ブロックの時間的長
さの変化と直交変換時に用いるウィンドウ形状の関係を
示す図である。FIG. 5 is a diagram showing a relationship between a change in temporal length of orthogonal transform blocks temporally adjacent to each other and a window shape used in orthogonal transform.

【図６】直交変換時に用いるウィンドウの形状の詳細例
を示す図である。FIG. 6 is a diagram showing a detailed example of the shape of a window used in orthogonal transformation.

【図７】ブロックフローティングユニットを決定する回
路の構成例を示すブロック回路図である。FIG. 7 is a block circuit diagram showing a configuration example of a circuit that determines a block floating unit.

【図８】本実施例におけるブロックフローティング決定
回路の作用について、その経過を明示したスペクトルな
らびに微分、積算データを示す図である。FIG. 8 is a diagram showing a spectrum, differentiation, and integration data clearly showing the progress of the operation of the block floating decision circuit in the present embodiment.

【図９】ビット配分演算機能を実現する畳込み演算を利
用したビット配分算出回路の例を示すブロック回路図で
ある。FIG. 9 is a block circuit diagram showing an example of a bit allocation calculation circuit using a convolution operation that realizes a bit allocation operation function.

【図１０】各臨界帯域及びブロックフローティングを考
慮して分割された帯域のスペクトルを示す図である。FIG. 10 is a diagram showing spectra of bands divided in consideration of each critical band and block floating.

【図１１】マスキングスペクトルを示す図である。FIG. 11 is a diagram showing a masking spectrum.

【図１２】最小可聴カーブ、マスキングスペクトルを合
成した図である。FIG. 12 is a diagram in which a minimum audible curve and a masking spectrum are combined.

【図１３】上記実施例のビットレート圧縮符号化に使用
可能な高能率圧縮符号化デコーダの一具体例を示すブロ
ック回路図である。FIG. 13 is a block circuit diagram showing a specific example of a high-efficiency compression encoding decoder that can be used for the bit rate compression encoding of the above embodiment.

[Explanation of symbols]

２０１、２０２帯域分割フィルタ２０３〜２０５直交変換回路（ＭＤＣＴ）２０６〜２０８ブロック決定回路２０９ビット配分算出回路２１０〜２１２適応ビット割当符号化回路３０３変化分算出回路３０４積算比較回路３０５ブロックフローティングユニット決定回路３０６ユニット修正回路３０７エネルギー算出回路３０８しきい値出力回路３０９標準ユニット出力回路４０４〜４０６パワー算出回路４０７メモリ４０８変化分抽出回路４０９パワー比較回路４１０ブロックサイズ１次決定回路４１１ブロックサイズ修正回路４１２〜４１４ディレイ４１５ウィンドウ形状決定回路 201, 202 Band division filter 203-205 Orthogonal transformation circuit (MDCT) 206-208 Block determination circuit 209 Bit allocation calculation circuit 210-212 Adaptive bit allocation coding circuit 303 Change calculation circuit 304 Integration comparison circuit 305 Block floating unit determination circuit 306 Unit correction circuit 307 Energy calculation circuit 308 Threshold output circuit 309 Standard unit output circuit 404 to 406 Power calculation circuit 407 Memory 408 Change extraction circuit 409 Power comparison circuit 410 Block size primary determination circuit 411 Block size correction circuit 412 to 412 414 Delay 415 Window shape determination circuit

Claims

[Claims]

1. A digital signal compression method for compressing information of a digital signal, wherein characteristics of an input signal on a frequency axis are calculated, and the input signal is distributed into small blocks subdivided with respect to time and frequency, and is floated for compression. And controlling the frequency size of the small blocks to be floated in accordance with the characteristics of the input signal on the frequency axis to perform optimum floating.

2. A digital signal compression method for compressing information of a digital signal, obtaining spectrum data on a frequency axis of an input signal, obtaining an acceptable noise spectrum from the spectrum data, and obtaining the obtained acceptable noise spectrum. It is divided into small blocks that are subdivided in terms of time and frequency, bits are allocated for compression, and the frequency size of small blocks that are allocated for compression is changed according to the characteristics of the input signal on the frequency axis. A digital signal compression method characterized by optimizing bit allocation.

3. A digital signal compression method for compressing information of a digital signal, the step of calculating the characteristic of the input signal on the frequency axis, and the step of distributing the input signal into small blocks subdivided in time and frequency for compression. Floating step, obtaining spectrum data on the frequency axis, obtaining an acceptable noise spectrum from the spectrum data, and dividing the acceptable noise spectrum into small blocks in terms of time and frequency. Distributing and allocating bits for compression, and small blocks and / or bits for compression that are subdivided in time and frequency for performing floating for compression according to the characteristics of the input signal on the frequency axis The step of changing the frequency size of the small blocks subdivided with respect to the time and frequency of allocation, A digital signal compression method characterized by optimizing block floating and bit allocation by changing the small block for block floating and the small block for bit allocation both or independently.

4. A frequency size of a small block subdivided in time and frequency for performing block floating for compression and / or a subdivided small block in time and frequency for bit allocation for compression is determined in advance. , A small block for performing floating for compression and / or a small block for allocating bits for compression according to the compression efficiency when adopting this predetermined size and the characteristics of the input signal on the frequency axis 2. The compression efficiency when the frequency size of is changed is compared, and the size of a small block that can perform information compression with higher efficiency is selected according to the comparison result. The digital signal compression method according to claim 3.

5. A small block subdivided in time and frequency for performing block floating for compression and a small block subdivided in time and frequency for bit allocation for compression are arranged on the frequency axis of an input signal. 5. The digital signal compression method according to claim 3 or 4, wherein the digital signal compression method is configured in common or independently according to the characteristics of.

6. The time length of a processing block for information compression is made variable by adapting to an input signal, and a change in the input signal of the processing block and a change in the input signal of another processing block, and / or power. , Or the time length of the processing block is determined based on energy or peak information.
A method for compressing a digital signal according to item.

7. A time length of a processing block for information compression is variable in accordance with an input signal, and a change in the input signal of the processing block and an input signal having a time width longer than the maximum of the processing block in terms of time. 6. The digital signal compression method according to claim 1, wherein the temporal length of the processing block is determined based on the change information of the input signal obtained by.

8. A time length of a processing block for information compression is made variable in accordance with an input signal, and a change of an input signal of the processing block and a change of an input signal of another processing block, and / or a power. Alternatively, the time length of the processing block is determined based on the energy or peak information, or the time length of the processing block is made variable by adapting to the input signal, and the input signal of the processing block is changed. The length of the processing block is determined based on change information and change information of the input signal obtained from an input signal having a time width longer than the maximum of the processing block in terms of time. 5. The digital signal compression method according to any one of 5 above.

9. When determining the processing block length, the ratio that is involved in the determination of the element that determines the temporal length of the processing block is fixed or applied to the input signal, and / or
9. The digital signal compression method according to claim 8, wherein the digital signal compression method is used in combination or alone at a predetermined ratio.

10. The input signal is an audio signal, and the frequency width of a block that controls the generation of at least most of the quantization noise is made wider in a higher frequency range. 10. The digital signal compression method according to any one of 1.

11. An orthogonal transform is used to divide a time-axis signal into a plurality of bands on the frequency axis, and the shape of a window function used at the time of the orthogonal transform is changed along with the change of the orthogonal transform size in the orthogonal transform. 11. The digital signal compression method according to claim 10, wherein

12. When dividing the time axis signal into a plurality of bands on the frequency axis, the time axis signal is first divided into a plurality of bands, and each divided band is made up of a plurality of samples. A coefficient is obtained by forming a block and performing orthogonal transformation for each block of each band.
1. The digital signal compression method described in 1.

13. The digital signal compression method according to claim 12, wherein a division frequency width in dividing the time-axis signal before orthogonal transformation into a plurality of bands on the frequency axis is set to be wider in a substantially higher range.

14. The digital signal compression method according to claim 13, wherein the divided frequency widths are the same in two consecutive bands of the lowest band.

15. The bit allocation according to claim 14, wherein the allocation of the compression code to the main information and / or the sub information for the signal component in the band substantially equal to or larger than the signal pass band is blocked. Digital signal compression method.

16. The digital signal compression method according to claim 12, wherein a quadrature mirror filter is used for the division into the plurality of bands.

17. The method according to claim 11, wherein a modified discrete cosine transform is used as the orthogonal transform.
7. The digital signal compression method according to any one of 6 above.

18. When determining a temporal length of a processing block based on a change of an input signal of the processing block, a predetermined boundary value for determining the temporal length is set as an amplitude of the input signal,
10. The digital signal compression method according to claim 6, wherein the digital signal compression method is variable according to the frequency.

19. The digital signal compression method according to claim 18, wherein the boundary value takes a plurality of stepwise values according to the amplitude and frequency of the input signal.

20. When determining the processing block length,
The auditory characteristic that the signal of the other processing block exerts on the signal of the processing block is calculated by using the energy and / or power or peak information of the spectrum on the frequency axis and / or the orthogonal transformation coefficient, and the processing concerned. 9. The digital signal compression method according to claim 6, wherein the time length of the block is determined.

21. Allocation of bits for compression of spectrum and / or orthogonal transform coefficients on the frequency axis used when calculating the auditory characteristics of the signal of the other processing block on the signal of the processing block. 21. The digital signal compression method according to claim 20, wherein the digital signal compression method is shared with a spectrum on a time axis after orthogonal transformation and / or an orthogonal transformation coefficient used for block floating.

22. A digital signal compression method having the functions of the digital signal compression method according to claim 18 and the digital signal compression method according to claim 20.

23. When determining the temporal length of a processing block based on the change of the input signal of the processing block, the periodic change of the input signal and / or the repeating pulse or periodic characteristic is used. 23. The digital signal compression method according to claim 7, 8, 9, 18, 19, 20, 21 or 22, wherein the determination is performed.

24. In a digital signal compression apparatus for compressing information of a digital signal, a characteristic calculation means for calculating a characteristic of an input signal on a frequency axis, and the input signal is distributed to small blocks subdivided in time and frequency and compressed. And a block floating means for performing floating for controlling the frequency magnitude of the small blocks to be subjected to the floating according to the characteristics of the input signal on the frequency axis. And a digital signal compression device.

25. In a digital signal compression apparatus for compressing information of a digital signal, spectrum data forming means for obtaining spectrum data on a frequency axis of an input signal, and allowable noise spectrum calculating means for obtaining an allowable noise spectrum from the spectrum data. And an allowable noise spectrum calculated by the allowable noise spectrum calculation means, which is divided into small blocks subdivided in time and frequency, and bit allocation means for allocating bits for compression. A digital signal compression apparatus characterized in that bit allocation is optimized by changing the frequency size of a small block for bit allocation for compression according to the characteristics on the axis.

26. In a digital signal compression apparatus for compressing information of a digital signal, characteristic calculation means for calculating a characteristic of an input signal on a frequency axis, and the input signal is divided into small blocks subdivided in time and frequency, and compressed. By means of a block floating means for performing floating for this purpose, a spectrum data forming means for obtaining spectrum data on the frequency axis, an allowable noise spectrum calculating means for obtaining an allowable noise spectrum from the spectrum data, and an allowable noise spectrum calculating means. The obtained allowable noise spectrum is divided into small blocks that are subdivided with respect to time and frequency, and bit allocation means that performs bit allocation for compression and floating for compression according to the characteristics of the input signal on the frequency axis. A small block that is subdivided in terms of time and frequency of application And a size control means for changing the frequency size of a small block subdivided with respect to time and frequency for bit allocation for compression and / or compression, and the small block and bit allocation for the block floating. A digital signal compression device characterized by optimizing block floating and bit allocation by changing the small blocks together or independently.

27. A small block which is subdivided in time and frequency for applying block floating for compression, and / or
Alternatively, the frequency size of a small block subdivided with respect to the time and frequency for bit allocation for compression is set in advance, and the compression efficiency and the characteristics of the input signal on the frequency axis when this predetermined size is adopted. According to the above, a comparison is made with the compression efficiency when the frequency size of the small block to be subjected to the floating for compression and / or the small block to which the bit allocation for the compression is changed is changed. 27. The digital signal compressing apparatus according to claim 24, further comprising block determining means for selecting a size of a small block capable of performing information compression with higher efficiency.

28. A small block subdivided in time and frequency for performing block floating for compression and a small block subdivided in time and frequency for bit allocation for compression are arranged on the frequency axis of an input signal. 28. The digital signal compression apparatus according to claim 26 or 27, wherein the digital signal compression apparatus is configured in common or independently according to the characteristics of.

29. A time length of a processing block for information compression is made variable according to an input signal, and a change of an input signal of the processing block and a change of an input signal of another processing block, and / or a power. Or a processing block length determining means for determining the temporal length of the processing block based on energy or peak information.
The digital signal compression device according to any one of claims 4 to 28.

30. A time length of a processing block for information compression is made variable according to an input signal, and a change in the input signal of the processing block and an input signal having a time width longer than the maximum of the processing block in terms of time. The processing block length determining means for determining the temporal length of the processing block based on the change information of the input signal obtained by the above step is provided.
The digital signal compression device according to any one of claims 4 to 28.

31. A time length of a processing block for information compression is made variable by adapting to an input signal, and a change of an input signal of the processing block and a change of an input signal of another processing block and / or a power. Alternatively, the function of determining the temporal length of the corresponding processing block based on energy or peak information, and the temporal length of the processing block is made variable by adapting to the input signal, and the input signal of the processing block is changed. A processing block length determining means having a function of determining the length of the processing block on the basis of change information and change information of the input signal obtained by an input signal having a time width longer than the maximum of the processing block in terms of time. 29. The digital signal compression apparatus according to claim 24, wherein the digital signal compression apparatus is a digital signal compression apparatus.

32. The processing block length determining means uses a combination of a ratio, which is fixed or adapted to an input signal, and / or a predetermined ratio, which is involved in the determination of an element that determines the temporal length of a processing block. 32. The digital signal compression device according to claim 31, which is also used alone.

33. The input signal is an audio signal, and the frequency width of a block that controls the generation of at least most of the quantization noise is made wider in a higher frequency range. One of
The digital signal compression device according to the item.

34. Split orthogonal transform means for splitting a time axis signal into a plurality of bands on the frequency axis by using orthogonal transform, and a window function used in the orthogonal transform together with variable orthogonal transform size in the orthogonal transform. 34. The digital signal compressing apparatus according to claim 33, further comprising an orthogonal transform size and window function shape changing means for changing the shape of.

35. The division orthogonal transformation means divides the time axis signal into a plurality of bands, forms a block made up of a plurality of samples for each of the divided bands, and performs an orthogonal transformation for each block of each band. The digital signal compression apparatus according to claim 34, wherein coefficient data is obtained by performing the operation.

36. The digital signal compressing apparatus according to claim 35, wherein the division frequency width in dividing the time axis signal before orthogonal transformation into a plurality of bands on the frequency axis is made wider toward a substantially higher frequency band.

37. The digital signal compressing device according to claim 36, wherein the divided frequency widths are the same in two consecutive bands of the lowest band.

38. The allocation of a compression code to main information and / or sub-information for a signal component in a band substantially equal to or more than a signal pass band is prevented in the bit allocation. Digital signal compression device.

39. The digital signal compression apparatus according to claim 35, wherein a quadrature mirror filter is used for the division into the plurality of bands.

40. The method according to claim 34, wherein a modified discrete cosine transform is used as the orthogonal transform.
9. The digital signal compression device according to any one of 9.

41. When determining a temporal length of a processing block based on a change of an input signal of the processing block, a predetermined boundary value for determining the temporal length is set as an amplitude and a frequency of the input signal. 33. The digital signal compressing device according to claim 29, wherein the digital signal compressing device is variable according to.

42. The digital signal compression apparatus according to claim 41, wherein the boundary value takes a plurality of stepwise values according to the amplitude and frequency of the input signal.

43. The processing block length determining means determines an auditory characteristic of a signal of the other processing block on a signal of the processing block, energy of spectrum and / or orthogonal transform coefficient on a frequency axis, and / or 32. The digital signal compression apparatus according to claim 29, wherein the processing is performed using power or peak information to determine the temporal length of the processing block.

44. Bit allocation for compression of spectrum and / or orthogonal transform coefficients on the frequency axis used in calculating the auditory characteristics of the signal of the other processing block on the signal of the processing block. 44. The digital signal compression apparatus according to claim 43, wherein the digital signal compression apparatus is also used as a spectrum on a time axis after orthogonal transformation and / or an orthogonal transformation coefficient used for block floating.

45. A digital signal compressing apparatus having the functions of the digital signal compressing apparatus according to claim 41 and the digital signal compressing apparatus according to claim 43.

46. When determining the temporal length of a processing block based on the change of the input signal of the processing block, the periodic change of the input signal and / or the repeating pulse or periodic characteristic is used. 46. The digital signal compression device according to claim 30, 31, 32, 41, 42, 43, 44 or 45, wherein the determination is made.

47. The digital signal compression method according to any one of claims 1 to 23, or the digital signal compression method according to any one of claims 24 to 46. A recording medium characterized by recording compressed data compressed by.

48. One of claims 1 to 23, characterized in that compressed compressed data is transmitted.
A method for compressing a digital signal according to item.

49. The digital signal compressing device according to claim 24, which transmits compressed compressed data.