JPH0591062A

JPH0591062A - Audio signal processing method

Info

Publication number: JPH0591062A
Application number: JP3276168A
Authority: JP
Inventors: Kiyouya Tsutsui; 京弥筒井
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1991-09-30
Filing date: 1991-09-30
Publication date: 1993-04-09
Anticipated expiration: 2016-08-20
Also published as: KR100275057B1; KR930007107A; JP3200886B2

Abstract

PURPOSE:To improve an audio quality by sharing bits of a parameter not recorded nor sent for quantization of an audio signal. CONSTITUTION:A scale factor SF of a parameter BF and data of a word length ML for each block relating to the block floating processing are recorded or sent up to a band requiring the parameter BF for each time frame and number of the parameters recorded or sent is recorded or sent.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、オーディオ信号をいわ
ゆるブロックフローティング処理して圧縮するオーディ
オ信号処理方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal processing method for compressing an audio signal by so-called block floating processing.

【０００２】[0002]

【従来の技術】従来より、オーディオ信号処理方法とし
て、オーディオ信号を圧縮して符号化する高能率符号化
技術には、例えば、入力オーディオ信号（ディジタルオ
ーディオデータ）を所定時間毎（所定時間フレーム毎）
に周波数軸上で複数のブロックに分割し、この各ブロッ
ク毎にいわゆるブロックフローティング処理を施すと共
に、各ブロック毎のデータを適応的なビット割り当てで
量子化するものがある。2. Description of the Related Art Conventionally, as an audio signal processing method, a high-efficiency coding technique of compressing and coding an audio signal includes, for example, input audio signal (digital audio data) at predetermined time intervals (every predetermined time frame). )
There is a method in which a block is divided into a plurality of blocks on the frequency axis, so-called block floating processing is performed for each block, and data in each block is quantized by adaptive bit allocation.

【０００３】ここで、上記ブロックフローティング処理
は、基本的には、ブロック内の各ワードに共通の値を掛
けて大きくし、量子化時の精度を上げるものであるが、
具体的には、例えばブロック内の各ワードの絶対値の内
で最も大きなもの（最大絶対値）を探し出し、この最大
絶対値が飽和しないような当該ブロック内の全ワードに
対して共通のフローティング係数を用いてフローティン
グ処理を行うものが一例としてある。より簡易なものと
しては、ビットシフトを利用する６ｄＢ単位のフローテ
ィングもある。Here, the block floating process is basically to multiply each word in the block by a common value to increase the size, thereby improving the precision in quantization.
Specifically, for example, the largest absolute value (maximum absolute value) among the absolute values of each word in the block is searched for, and a floating coefficient common to all words in the block so that the maximum absolute value is not saturated. As an example, a floating process is performed by using. As a simpler one, there is a floating unit of 6 dB using bit shift.

【０００４】ところで、上記ブロックフローティング処
理を行うオーディオ信号処理方法が適用されるシステム
のエンコーダ側では、通常、ブロックフローティング処
理に関連するパラメータＢＦとして、例えば、フローテ
ィング係数としてのスケールファクタＳＦの値と、上記
スケールファクタＳＦといわゆるマスキング効果を考慮
して各ブロック毎に求められる許容可能なノイズレベル
との差を示すワード長ＷＬのデータとを、量子化された
ブロックのデータ（以下例えばメイン情報とする）と共
に媒体に記録若しくは伝送することが行われる。On the encoder side of a system to which the audio signal processing method for performing the block floating process is applied, usually, as a parameter BF related to the block floating process, for example, a value of a scale factor SF as a floating coefficient, The data of the word length WL indicating the difference between the scale factor SF and the allowable noise level obtained for each block in consideration of the so-called masking effect is quantized block data (hereinafter referred to as main information, for example). ) Is recorded or transmitted to the medium.

【０００５】なお、上記マスキング効果とは、人間の聴
覚特性により、ある音により他の音がマスクされて聞こ
えなくなる現象を言う。換言すれば、上記マスキングと
は、ある信号によって他の信号がマスクされて聞こえな
くなる現象をいうものであり、このマスキング効果に
は、時間軸上のオーディオ信号による時間軸マスキング
効果と、周波数軸上の信号による同時刻マスキング効果
とがある。これらのマスキング効果により、マスキング
される部分にノイズがあったとしても、このノイズは聞
こえないことになる。このため、実際のオーディオ信号
では、このマスキングされる範囲内のノイズは許容可能
なノイズとされる。The masking effect means a phenomenon in which one sound is masked by another sound and becomes inaudible due to human auditory characteristics. In other words, the above-mentioned masking is a phenomenon in which one signal masks another signal so that it cannot be heard.This masking effect includes the time-axis masking effect of the audio signal on the time axis and the masking effect on the frequency axis. There is a simultaneous masking effect by the signal of. Due to these masking effects, even if there is noise in the masked portion, this noise will not be heard. Therefore, in the actual audio signal, the noise within the masked range is regarded as an acceptable noise.

【０００６】[0006]

【発明が解決しようとする課題】ここで、従来のオーデ
ィオ信号処理方法が適用されるシステムのエンコーダ側
では、上記記録又は伝送されるパラメータＢＦの個数は
すべての時間フレームに対して固定しておくものが一般
的である。図９は従来のオーディオ信号処理システムの
エンコーダ側での各時間フレーム内のデータの記録（或
いは伝送）の様子を示したものである。この図９の例の
場合、実際にはビットが割り振られないブロックに対し
てもそのパラメータＢＦの値を記録又は伝送する必要が
あり、したがって、その分だけ音声データに対して割り
当てられるビット数が少なくなり、特に、圧縮率が高い
場合（ビットレートが低い場合）には、上記エンコーダ
側に対応するデコーダ側で十分な音質を確保することが
困難であった。On the encoder side of the system to which the conventional audio signal processing method is applied, the number of parameters BF to be recorded or transmitted is fixed for all time frames. Things are common. FIG. 9 shows a state of recording (or transmitting) data in each time frame on the encoder side of the conventional audio signal processing system. In the case of the example of FIG. 9, it is necessary to record or transmit the value of the parameter BF even for a block to which bits are not actually allocated, and accordingly, the number of bits allocated to audio data is correspondingly increased. In particular, when the compression rate is high (when the bit rate is low), it is difficult to secure sufficient sound quality on the decoder side corresponding to the encoder side.

【０００７】これに対し、図１０に示すように、実際に
ビットが割り振られないブロック (ワード長ＷＬ＝０の
ブロック）に対しては上記スケールファクタＳＦの値を
記録又は伝送せずに、その分だけ音声データに多くのビ
ットを割り振るようにするシステムも提案されている。
この図１０においては、ビットが割り当てられていない
ブロック（０ビット割り当てのブロック）４個の分だけ
記録されているスケールファクタＳＦの個数が減ってい
る。On the other hand, as shown in FIG. 10, the value of the scale factor SF is not recorded or transmitted to a block (block of word length WL = 0) to which no bit is actually allocated, and A system has also been proposed in which many bits are allocated to audio data according to minutes.
In FIG. 10, the number of scale factors SF recorded is reduced by the number of blocks to which no bits are assigned (blocks with 0 bit assignment).

【０００８】ただし、この場合、上記ワード長ＷＬのデ
ータは、常に記録又は伝送する必要がある。また、後の
デコーダ側で上記スケールファクタＳＦを読み込む時に
そのブロックのワード長ＷＬが０でないかどうかブロッ
ク毎にチェックする必要がある。However, in this case, the data of the word length WL must always be recorded or transmitted. Further, when the scale factor SF is read by the decoder side later, it is necessary to check for each block whether or not the word length WL of the block is 0.

【０００９】更に、エンコーダ側は、通常、上記マスキ
ングを求めるための計算等により各ブロックのオーディ
オ信号を量子化するために必要なビット数を計算した
後、その時間フレームに割り当てられた総ビット数との
比較を行ない、各ブロックへのビット割り当ての調整を
行なうようになされる。ところが、このとき、上述のよ
うに各ブロックのビット割り当てによってそのブロック
のスケールファクタＳＦを記録するかどうかが変化する
と、それに伴い、上記メイン情報（音声データ）に割り
当てることのできるビットの総数も変化し、ビット割り
当ての調整が非常に複雑になる。Further, the encoder usually calculates the number of bits required to quantize the audio signal of each block by the calculation for obtaining the masking, and then the total number of bits allocated to the time frame. And the bit allocation to each block is adjusted. However, at this time, if whether or not the scale factor SF of the block is recorded changes depending on the bit allocation of each block as described above, the total number of bits that can be allocated to the main information (audio data) also changes accordingly. However, adjusting the bit allocation becomes very complicated.

【００１０】そこで、本発明は、上述のような実情に鑑
みて提案されたものであり、ビット割り当ての調整が比
較的容易で、ビット割り当ての調整を行っても音質を劣
化させることのないオーディオ信号処理方法を提供する
ことを目的とするものである。Therefore, the present invention has been proposed in view of the above-mentioned circumstances, and it is relatively easy to adjust the bit allocation, and audio quality does not deteriorate even if the bit allocation is adjusted. It is an object of the present invention to provide a signal processing method.

【００１１】[0011]

【課題を解決するための手段】本発明のオーディオ信号
処理方法は、上述の目的を達成するために提案されたも
のであり、入力オーディオ信号を所定の時間フレーム毎
に周波数軸上で複数のブロックに分割し、各ブロック毎
にブロックフローティング処理を施し、各ブロック毎の
データを適応的なビット割り当てで量子化し、当該量子
化されたオーディオ信号を記録又は伝送すると共に、上
記ブロックフローティング処理に関連するパラメータを
記録又は伝送するオーディオ信号処理方法であって、上
記ブロックフローティング処理に関連する各ブロック毎
のパラメータを上記時間フレーム毎に当該パラメータが
必要な帯域まで記録又は伝送すると共に、当該記録又は
伝送されるパラメータの個数も記録又は伝送するように
したものである。The audio signal processing method of the present invention has been proposed in order to achieve the above-mentioned object, and an input audio signal is divided into a plurality of blocks on the frequency axis at every predetermined time frame. Block, each block is subjected to block floating processing, data of each block is quantized by adaptive bit allocation, and the quantized audio signal is recorded or transmitted, and is related to the block floating processing. An audio signal processing method for recording or transmitting a parameter, comprising recording or transmitting a parameter for each block related to the block floating process up to a required band of the parameter for each time frame, and recording or transmitting the parameter. The number of parameters to be recorded is also recorded or transmitted.

【００１２】すなわち、時間フレーム毎に実際に記録又
は伝送されるオーディオ信号の帯域として、例えば、実
際に知覚され難い（耳に聞こえ難い）という理由から高
域の記録又は伝送を行わないようにした場合には、その
時間フレームの当該高域のブロックのパラメータ（スケ
ールファクタ，ワード長の値）を記録又は伝送しないよ
うにして、その分のビットを聴感上重要な低域のメイン
情報（ブロックの量子化されたデータ）のために割り振
るようにしている。したがって、記録又は伝送されるデ
ータは、この低域のパラメータと当該メイン情報とな
る。また、この場合は、時間フレーム毎に記録又は伝送
されるパラメータの個数も記録する（或いは伝送する）
ようにしている。That is, as the band of the audio signal actually recorded or transmitted for each time frame, for example, high frequency recording or transmission is not performed because it is difficult to be perceived (difficult to hear). In this case, the parameters (scale factor, word length value) of the high frequency block of the time frame are not recorded or transmitted, and those bits are perceived as important low-frequency main information (of the block). Quantized data). Therefore, the data to be recorded or transmitted becomes the low frequency parameter and the main information. In this case, the number of parameters recorded or transmitted for each time frame is also recorded (or transmitted).
I am trying.

【００１３】[0013]

【作用】本発明のオーディオ信号処理方法によれば、各
ブロック毎のパラメータを時間フレーム毎にこのパラメ
ータが必要な帯域まで記録又は伝送する、すなわち、パ
ラメータを必要としない帯域のパラメータは記録又は伝
送せず、その分のビットを音声データの符号に割り振る
ようにしている。According to the audio signal processing method of the present invention, the parameters for each block are recorded or transmitted for each time frame up to the band in which this parameter is required, that is, the parameters in the band not requiring the parameter are recorded or transmitted. Instead, the bits are allocated to the code of the audio data.

【００１４】[0014]

【実施例】以下、本発明の実施例を図面を参照しながら
説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１５】本実施例のオーディオ信号処理方法は、時
系列の入力オーディオ信号ＴＳを所定の時間フレーム毎
に周波数軸の信号（スペクトル信号ＳＰ）に変換し、こ
のスペクトル信号ＳＰを帯域毎のブロックに分割し、各
ブロック毎にブロックフローティング処理を施し、各ブ
ロック毎のデータを適応的なビット割り当てで量子化
し、当該量子化されたオーディオ信号を記録又は伝送す
ると共に、上記ブロックフローティング処理に関連する
パラメータＢＦとしてのフローティング係数（スケール
ファクタＳＦ）とワード長ＷＬのデータとを記録又は伝
送するオーディオ信号処理方法であって、図１に示すよ
うに、上記ブロックフローティング処理に関連する各ブ
ロック毎のパラメータＢＦを上記時間フレーム毎に当該
パラメータＢＦが必要な帯域まで記録又は伝送すると共
に、当該記録又は伝送されるパラメータＢＦの個数Ｎも
記録又は伝送するようにしたものである。In the audio signal processing method of this embodiment, the time-series input audio signal TS is converted into a frequency axis signal (spectrum signal SP) for each predetermined time frame, and the spectrum signal SP is divided into blocks for each band. The data is divided into blocks, a block floating process is performed for each block, the data of each block is quantized by adaptive bit allocation, the quantized audio signal is recorded or transmitted, and the parameters related to the block floating process. An audio signal processing method for recording or transmitting a floating coefficient (scale factor SF) as BF and data of a word length WL, as shown in FIG. 1, a parameter BF for each block related to the block floating process. The parameter BF is required for each of the above time frames. Thereby recorded or transmitted to a band, the number N of parameters BF being the recording or transmission is also obtained by such recording or transmission.

【００１６】なお、図１は１つの時間フレームと、該１
つの時間フレーム内の複数の周波数帯域のブロックを示
している。また、図１のメイン情報とは、量子化された
ブロックのデータ（当該オーディオ信号ＴＳを周波数分
割した各ブロック毎のスペクトル信号ＳＰの量子化され
た各ブロックのデータ）である。FIG. 1 shows one time frame and the one
It shows blocks of multiple frequency bands in one time frame. The main information in FIG. 1 is quantized block data (quantized block data of the spectrum signal SP for each block obtained by frequency-dividing the audio signal TS).

【００１７】ここで、本発明実施例の有効性は以下のよ
うな事実に基づいている。The effectiveness of the embodiment of the present invention is based on the following facts.

【００１８】すなわち、例えば１０ｋＨｚ以上の高い帯
域では、人間の聴覚特性に基づく後述するいわゆる最小
可聴限が高く、また、低域信号による前記マスキング効
果が有効に作用するため、量子化ノイズがそれ以下の帯
域（例えば１０ｋＨｚ以下）と比較して、大きくなった
としても、音質劣化が耳に知覚されにくい。特に、特に
１５ｋＨｚ以下の帯域の信号成分を例えば削除してしま
っても（０ビット割り当て）、聴感上、差異がわからな
いケースが多い。That is, in a high band of 10 kHz or higher, for example, the so-called minimum audible limit, which will be described later, based on human auditory characteristics is high, and since the masking effect by the low-frequency signal works effectively, quantization noise is less than that. Even if it becomes larger than the band (for example, 10 kHz or less), deterioration of sound quality is hard to be perceived by the ear. In particular, in particular, even if the signal component in the band of 15 kHz or less is deleted (0 bit is allocated), there are many cases in which the difference is not audible.

【００１９】したがって、本実施例では、図１に示すよ
うに、上記ブロックフローティング処理に関連する各ブ
ロック毎のパラメータＢＦであるフローティング係数と
しての上記スケールファクタＳＦ及び量子化の際の割り
当てビット数に対応する上記ワード長ＷＬのデータを、
上記時間フレーム毎に当該パラメータＢＦを必要とする
帯域まで（すなわち聴感上重要な低域のみ）として記録
又は伝送するようにしている。これにより、聴感上重要
なため省略できない低域の符号に多くのビットを割り当
てることができるようになって、結果として音質が向上
するようになる。Therefore, in the present embodiment, as shown in FIG. 1, the scale factor SF as a floating coefficient which is a parameter BF of each block related to the block floating processing and the number of allocated bits at the time of quantization are set. The corresponding data of the word length WL is
The parameter BF is recorded or transmitted for each time frame up to a required band (that is, only a low frequency band which is important for hearing). As a result, many bits can be assigned to a low-frequency code that is important for hearing and cannot be omitted, and as a result, sound quality is improved.

【００２０】なお、上記最小可聴限においては、雑音絶
対レベルがこの最小可聴限以下ならば該雑音は聞こえな
いことになる。この最小可聴限は、コーディングが同じ
であっても例えば再生時の再生ボリュームの違いで異な
るものとなるが、現実的なディジタルシステムでは、例
えば１６ビットダイナミックレンジへの音楽のはいり方
にはさほど違いがないので、例えば４ｋＨｚ付近の最も
耳に聞こえやすい周波数帯域の量子化雑音が聞こえない
とすれば、他の周波数帯域ではこの最小可聴カーブのレ
ベル以下の量子化雑音は聞こえないと考えられる。At the minimum audible limit, if the noise absolute level is below this minimum audible limit, the noise cannot be heard. Even if the coding is the same, the minimum audible limit varies depending on, for example, a difference in reproduction volume during reproduction. However, in a realistic digital system, for example, a method of inputting music to a 16-bit dynamic range is very different. Therefore, if the quantization noise in the most audible frequency band around 4 kHz is not heard, it is considered that the quantization noise below the level of the minimum audible curve is not heard in other frequency bands.

【００２１】また、上述のようにすると、上記パラメー
タＢＦの個数は時間フレーム毎に変化するものとなるた
め、本実施例では、パラメータＢＦの個数Ｎを時間フレ
ーム毎に記録又は伝送するようにしている。この場合、
当該個数Ｎのデータ量は少ない。例えば、パラメータＢ
Ｆの個数が数十個程度であれば７ビットあれば十分であ
り、また、当該時間フレームにおいて取りうる帯域が２
種類（ブロックが２個）ならば、１ビットあれば記録又
は伝送できることになる。Further, in the above-described manner, the number of the above parameters BF changes for each time frame. Therefore, in the present embodiment, the number N of the parameter BF is recorded or transmitted for each time frame. There is. in this case,
The data amount of the number N is small. For example, parameter B
If the number of Fs is about several tens, 7 bits is sufficient, and the bandwidth that can be taken in the time frame is 2
In the case of types (two blocks), recording or transmission is possible with 1 bit.

【００２２】上述のように、本実施例においては、時間
フレーム毎に実際に記録又は伝送されるオーディオ信号
の帯域として、例えば、実際に知覚し難い（耳に聞こえ
難い）という理由から高域の記録又は伝送がなされない
場合に、その時間フレームの高域のブロックのパラメー
タＢＦ（スケールファクタＳＦ，ワード長ＷＬの値）分
のビットを低域のメイン情報のために割り振るようにし
て、低域のパラメータＢＦと当該メイン情報を例えば媒
体に記録する（或いは伝送する）ようにしている。ま
た、その時間フレーム毎のパラメータＢＦの個数Ｎも記
録する（或いは伝送する）ようにしている。As described above, in the present embodiment, the band of the audio signal actually recorded or transmitted for each time frame is, for example, a high frequency band because it is hard to be perceived (difficult to hear). When recording or transmission is not performed, bits corresponding to parameter BF (scale factor SF, value of word length WL) of the high frequency block of the time frame are allocated for low frequency main information, and The parameter BF and the main information are recorded (or transmitted) on a medium, for example. Further, the number N of the parameters BF for each time frame is also recorded (or transmitted).

【００２３】なお、低域での信号レベルが低く、高域で
の信号のレベルが非常に高いような場合には、高域の信
号を削除すると、それによる音質劣化の差異が知覚され
易くなる。したがってこの場合には、図２に示すよう
に、高域の信号及びパラメータＢＦも記録するようにし
て符号化する。If the signal level in the low frequency range is low and the signal level in the high frequency range is very high, deleting the high frequency signal makes it easier to perceive the difference in sound quality deterioration. .. Therefore, in this case, as shown in FIG. 2, the high frequency signal and the parameter BF are also recorded and encoded.

【００２４】図３には、本実施例のオーディオ信号処理
方法が適用されるオーディオ信号処理システムのエンコ
ーダの具体的構成を示す。FIG. 3 shows a concrete configuration of an encoder of an audio signal processing system to which the audio signal processing method of this embodiment is applied.

【００２５】なお、この図３には、時系列のオーディオ
信号（波形データ）をバンド分割フィルタで分割し、こ
れを変更離散コサイン変換（ＭＤＣＴ）処理により周波
数軸の信号に変換したスペクトル信号（スペクトルデー
タ）ＳＰを符号化する構成を示している。In FIG. 3, a time series audio signal (waveform data) is divided by a band division filter, and this is converted into a frequency axis signal by a modified discrete cosine transform (MDCT) process. (Data) SP is encoded.

【００２６】すなわち、この図３に示すエンコーダは、
入力端子１に供給される時系列の入力オーディオ信号
（ディジタルオーディオデータ）ＴＳを所定の時間フレ
ーム毎に周波数軸上で複数のブロックに分割し、各ブロ
ック毎にブロックフローティング処理を施すと共に、各
ブロック毎のデータを適応的なビット割り当てで符号化
するものであって、上記ブロックフローティング処理に
関連する各ブロック毎のパラメータ（フローティング係
数としての前記スケールファクタＳＦ及び、前記ビット
割り当て数に対応するワード長ＷＬ）ＢＦを上記時間フ
レーム毎に当該パラメータＢＦを必要とする帯域まで記
録すると共に、当該記録されるパラメータの個数をも記
録するデータ記録回路１９を有するものである。このデ
ータ記録回路１９に記録されたデータが後述するデコー
ダ側に伝送される。That is, the encoder shown in FIG.
The time-sequential input audio signal (digital audio data) TS supplied to the input terminal 1 is divided into a plurality of blocks on the frequency axis for each predetermined time frame, and each block is subjected to block floating processing and Data for each block is encoded by adaptive bit allocation, and parameters for each block related to the block floating process (the scale factor SF as a floating coefficient and the word length corresponding to the bit allocation number) The data recording circuit 19 records the (WL) BF for each time frame up to the required band and also records the number of the recorded parameters. The data recorded in the data recording circuit 19 is transmitted to the decoder side described later.

【００２７】ここで、本具体例のエンコーダにおいて
は、上記時間フレームを複数の周波数帯域に分割した各
ブロックの帯域を可変として、当該帯域が狭くなってい
る場合には、上述の実施例のようにして上記データ記録
回路１９に記録するパラメータＢＦの個数を減らし、そ
の減らした分のビットをメイン情報に対して（後述する
オーディオ信号ＴＳを周波数分割した各ブロック毎のス
ペクトル信号ＳＰの量子化の際に）割り振るようにして
いる。Here, in the encoder of this specific example, the band of each block obtained by dividing the time frame into a plurality of frequency bands is made variable, and when the band is narrow, as in the above-described embodiment. Then, the number of parameters BF recorded in the data recording circuit 19 is reduced, and the reduced bits are used for the main information (for the quantization of the spectrum signal SP for each block obtained by frequency-dividing the audio signal TS described later). I'm trying to allocate it.

【００２８】このようなことを行うため、入力端子１に
供給された上記所定の時間フレーム毎の時系列のオーデ
ィオ信号（波形データ）ＴＳは、供給された時系列信号
をスペクトル信号に変換する時間／周波数変換回路１１
によりスペクトル信号ＳＰに変換される。当該時間／周
波数変換回路１１は、例えば、図１，図２に示したよう
に、時系列の上記オーディオ信号ＴＳを所定の時間フレ
ーム毎に区切ると共に各時間フレームを複数の周波数帯
域に分割してブロック化し、この各ブロックの帯域を可
変としている。In order to do this, the time-series audio signal (waveform data) TS supplied to the input terminal 1 for each predetermined time frame is the time for converting the supplied time-series signal into a spectrum signal. / Frequency conversion circuit 11
Is converted into a spectrum signal SP by. The time / frequency conversion circuit 11 divides the time-series audio signal TS into predetermined time frames and divides each time frame into a plurality of frequency bands, as shown in FIGS. 1 and 2, for example. It is divided into blocks and the band of each block is variable.

【００２９】また、この時間／周波数変換回路１１でブ
ロックの帯域を可変とする際には、上記ブロック幅とし
て、時間フレームを例えば人間の聴覚特性を考慮した帯
域分割により分割して得た各帯域をブロックとするよう
になされている。すなわち、一般に臨界帯域（クリティ
カルバンド）と呼ばれている高域程帯域幅が広くなるよ
うな帯域幅で、上記スペクトル信号ＳＰを複数の帯域に
分割しており、本具体例では、大別して高域，中域，低
域の３つの帯域に分割している。なお、このクリティカ
ルバンドとは、人間の聴覚特性を考慮して分割された周
波数帯域であり、ある純音の周波数近傍の同じ強さの狭
帯域バンドノイズによって当該純音がマスクされるとき
のそのノイズの持つ帯域のことである。また、上記クリ
ティカルバンドでの分割としては、例えば、０〜２０ｋ
Ｈｚの全周波数帯域を例えば２５のクリティカルバンド
に分割することも可能である。Further, when the band of the block is made variable by the time / frequency conversion circuit 11, each band obtained by dividing the time frame by, for example, band division in consideration of human auditory characteristics is used as the block width. It is designed to be a block. That is, the spectrum signal SP is divided into a plurality of bands with a bandwidth which is generally called a critical band and becomes wider in a higher frequency band. It is divided into three bands: the range, the middle range, and the low range. The critical band is a frequency band divided in consideration of human auditory characteristics, and when the pure tone is masked by a narrow band noise of the same strength in the vicinity of the frequency of a pure tone, the noise of that pure tone is masked. It is the bandwidth that you have. The division in the critical band is, for example, 0 to 20k.
It is also possible to divide the entire frequency band of Hz into 25 critical bands, for example.

【００３０】この時間／周波数変換回路１１からのスペ
クトル信号ＳＰは、スペクトル信号量子化回路１５に送
られ量子化される。すなわち、当該スペクトル信号量子
化回路１５は、供給された各ブロックのスペクトル信号
ＳＰをブロックフローティング処理により正規化（ノー
マライズ）した後、いわゆるマスキング効果を考慮した
適応的な割り当てビット数で量子化する。The spectrum signal SP from the time / frequency conversion circuit 11 is sent to the spectrum signal quantization circuit 15 and quantized. That is, the spectrum signal quantization circuit 15 normalizes (normalizes) the supplied spectrum signal SP of each block by the block floating process, and then quantizes it with an adaptive allocation bit number in consideration of a so-called masking effect.

【００３１】ここで、上記スペクトル信号量子化回路１
５で上記ブロックフローティング処理を行うためのフロ
ーティング係数（スケールファクタＳＦ）は、スケール
ファクタ計算回路１３から供給される。すなわち、上記
スケールファクタ計算回路１３には上記スペクトル信号
ＳＰが供給されており、このスケールファクタ計算回路
１３から、上記時間フレーム毎の複数の周波数帯域のブ
ロック毎のスペクトル信号ＳＰの例えばピーク或いは平
均値に所定の係数を乗算したフローティング係数（スケ
ールファクタＳＦ）が出力されるようになっている。Here, the spectrum signal quantization circuit 1 is used.
The floating factor (scale factor SF) for performing the block floating process in 5 is supplied from the scale factor calculation circuit 13. That is, the spectrum signal SP is supplied to the scale factor calculation circuit 13, and the scale factor calculation circuit 13 outputs, for example, a peak or an average value of the spectrum signal SP for each block in the plurality of frequency bands for each time frame. Is multiplied by a predetermined coefficient to output a floating coefficient (scale factor SF).

【００３２】また、上記スペクトル信号量子化回路１５
で上記適応的な割り当てビット数の量子化を行うため
に、上記スペクトル信号ＳＰはマスキング計算回路１７
にも送られている。当該マスキング計算回路１７では、
後述するようにして人間の聴覚特性に応じた各ブロック
毎のマスキング情報ＭＳＫＩ及び／又は任意の注目ブロ
ックに近接する他のブロックからのマスキング効果によ
る当該注目ブロックのマスキング情報ＭＳＫＩが得られ
る。このマスキング計算回路１７からのマスキング情報
ＭＳＫＩは、ビットアロケーシヨン計算回路１４に送ら
れ、当該ビットアロケーシヨン計算回路１４で上記マス
キング情報ＭＳＫＩに基づいた各ブロック毎のビット割
り当て情報としてのワード長ＷＬのデータが求められ
る。すなわち、このワード長ＷＬの情報に基づいて上記
スペクトル信号量子化回路１５では、供給されたスペク
トル信号ＳＰのブロック毎の適応的な量子化を行ってい
る。The spectrum signal quantizing circuit 15 is also used.
In order to quantize the adaptive number of allocated bits, the spectrum signal SP is masked by the masking calculation circuit 17
Has also been sent to. In the masking calculation circuit 17,
As will be described later, masking information MSKI for each block according to human auditory characteristics and / or masking information MSKI of the target block due to a masking effect from another block adjacent to the target block are obtained. The masking information MSKI from the masking calculation circuit 17 is sent to the bit allocation calculation circuit 14, and the bit allocation calculation circuit 14 uses the word length as bit allocation information for each block based on the masking information MSKI. WL data is required. That is, the spectrum signal quantization circuit 15 performs adaptive quantization for each block of the supplied spectrum signal SP based on the information of the word length WL.

【００３３】ここで、上記マスキング計算回路１７及び
ビットアロケーシヨン計算回路１４での処理は具体的に
は以下のようになされている。Here, the processing in the masking calculation circuit 17 and the bit allocation calculation circuit 14 is specifically as follows.

【００３４】すなわち、上記マスキング計算回路１７に
送られたスペクトル信号ＳＰは、先ず上記ブロック毎に
エネルギが算出される。このブロック毎のエネルギ算出
の際には例えば上記クリティカルバンド（臨界帯域）毎
のエネルギが、例えば当該バンド内での各振幅値の総和
を計算すること等により求められる。この各バンド毎の
エネルギの代わりに、振幅値のピーク値、平均値等が用
いられることもある。このエネルギ算出により求められ
る各バンドの総和値のスペクトルは、一般にバークスペ
クトルと称されている。That is, in the spectrum signal SP sent to the masking calculation circuit 17, the energy is first calculated for each block. When calculating the energy for each block, for example, the energy for each critical band (critical band) is obtained by, for example, calculating the sum of the amplitude values in the band. Instead of the energy for each band, the peak value, the average value, etc. of the amplitude value may be used. The spectrum of the sum total value of each band obtained by this energy calculation is generally called a Bark spectrum.

【００３５】次に、当該マスキング計算回路１７では、
上記バークスペクトルのいわゆるマスキングに於ける影
響を考慮するために、該バークスペクトルに所定の重み
付け関数を掛けて加算するような畳込み（コンボリュー
ション）処理を施す。この畳込み処理を行う構成として
は、例えば、入力データを順次遅延させる複数の遅延素
子と、これら遅延素子からの出力にフィルタ係数（重み
付け関数）を乗算する複数の乗算器（例えば各バンドに
対応する２５個の乗算器）と、各乗算器出力の総和をと
る総和加算器とから構成されるものである。Next, in the masking calculation circuit 17,
In order to consider the influence of the above-described Bark spectrum on so-called masking, a convolution process is performed such that the Bark spectrum is multiplied by a predetermined weighting function and added. As a configuration for performing this convolution processing, for example, a plurality of delay elements that sequentially delay input data and a plurality of multipliers that multiply outputs from these delay elements by a filter coefficient (weighting function) (for example, corresponding to each band) 25 multipliers) and a sum adder that sums the outputs of the respective multipliers.

【００３６】上記畳込み処理が施された後、逆コンボリ
ューション処理を行うことにより、マスキングスレッシ
ョールドが得られる。すなわちこのマスキングスレッシ
ョールドが許容可能なノイズスペクトルとなる。ここ
で、上記マスキングスレッショールドと、前記バークス
ペクトルとの減算を行うことで、当該バークスペクトル
が上記マスキングスレッショールドにより、マスキング
されるレベルが求められる。このマスキングレベルが上
記マスキング情報ＭＳＫＩとしてビットアロケーシヨン
計算回路１４に送られる。After the above convolution processing is performed, inverse convolution processing is performed to obtain a masking threshold. That is, this masking threshold has an allowable noise spectrum. Here, the masking threshold is subtracted from the Bark spectrum to obtain a level at which the Bark spectrum is masked by the masking threshold. This masking level is sent to the bit allocation calculation circuit 14 as the masking information MSKI.

【００３７】なお、上記マスキング情報ＭＳＫＩを求め
る際には、例えば、人間の聴覚特性である上記最小可聴
限を示すデータと、上記マスキングスレッショールドと
を合成することができる。この最小可聴限においては、
前述したように、雑音絶対レベルがこの最小可聴限以下
ならば該雑音は聞こえないことになる。When obtaining the masking information MSKI, for example, the data showing the minimum audible limit, which is a human auditory characteristic, and the masking threshold can be combined. In this minimum hearing limit,
As described above, if the absolute noise level is below this minimum audible limit, the noise will not be heard.

【００３８】上記ビットアロケーシヨン計算回路１４に
は、例えば割当てビット数情報が予め記憶されたＲＯＭ
等が設けられ、上記マスキング情報ＭＳＫＩのマスキン
グレベルと各バンドのエネルギとの差分のレベルに応じ
て、当該ＲＯＭ等から各バンド毎の割当ビット数情報が
求められる。更に、この各バンド毎の割り当てビット数
情報に基づいて、上記大別して高域，中域，低域の各ブ
ロック毎の割り当てビット数に対応するワード長ＷＬの
データを求める。The bit allocation calculation circuit 14 is, for example, a ROM in which information on the number of allocated bits is stored in advance.
Etc. are provided, and the allocated bit number information for each band is obtained from the ROM or the like according to the level of the difference between the masking level of the masking information MSKI and the energy of each band. Further, based on the information on the number of allocated bits for each band, the data of the word length WL corresponding to the number of allocated bits for each block of the high band, the middle band, and the low band is obtained.

【００３９】また、上記ビットアロケーシヨン計算回路
１４では、例えば等ラウドネスカーブの情報に基づい
て、上記マスキング情報ＭＳＫＩに基づく許容雑音レベ
ルを補正することも可能である。ここで、等ラウドネス
カーブとは、人間の聴覚特性に関する特性曲線であり、
例えば１ｋＨｚの純音と同じ大きさに聞こえる各周波数
での音の音圧を求めて曲線で結んだもので、ラウドネス
の等感度曲線とも呼ばれる。なお、この等ラウドネス曲
線は、上記最小可聴限のカーブと略同じ曲線を描くもの
である。したがって、この等ラウドネス曲線において
は、例えば４ｋＨｚ付近では１ｋＨｚのところより音圧
が８〜１０ｄＢ下がっても１ｋＨｚと同じ大きさに聞こ
え、逆に、５０ｋＨｚ付近では１ｋＨｚでの音圧よりも
約１５ｄＢ高くないと同じ大きさに聞こえない。このた
め、上記最小可聴カーブのレベルを越えた雑音（許容ノ
イズレベル）は、該等ラウドネス曲線に応じたカーブで
与えられる周波数特性を持つようにするのが良いことが
わかる。このようなことから、上記等ラウドネス曲線を
考慮して上記許容ノイズレベルを補正することは、人間
の聴覚特性に適合していることがわかる。Further, the bit allocation calculation circuit 14 can correct the allowable noise level based on the masking information MSKI, for example, based on the information of the equal loudness curve. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics,
For example, it is a curve obtained by finding the sound pressure of a sound at each frequency that sounds the same as a pure tone of 1 kHz, and is also called a loudness isosensitivity curve. It should be noted that this equal loudness curve is a curve that is substantially the same as the above-mentioned minimum audible limit curve. Therefore, in this equal loudness curve, for example, in the vicinity of 4 kHz, even if the sound pressure is reduced by 8 to 10 dB from 1 kHz, it sounds as loud as 1 kHz, and conversely, in the vicinity of 50 kHz, it is about 15 dB higher than that at 1 kHz. Without it, it doesn't sound the same. Therefore, it is understood that the noise (allowable noise level) exceeding the level of the minimum audible curve should have the frequency characteristic given by the curve corresponding to the equal loudness curve. From this, it is understood that correcting the permissible noise level in consideration of the equal loudness curve is suitable for human auditory characteristics.

【００４０】更に、上記マスキング計算回路１７では、
上記マスキング計算を行なう事により、上記スペクトル
信号量子化回路１５でスペクトル信号ＳＰを量子化して
データ記録回路１９に記録する際に、記録すべき最高の
帯域ＨＢがどの帯域となるかをも求めている。すなわ
ち、上記マスキング効果等を考慮することにより、記録
しなくても聴感上の音質に影響の少ない帯域と記録しな
ければ聴感上悪影響が現れる帯域とを分け、この記録す
べき帯域ＨＢの情報を、ブロックフローティング処理の
パラメータＢＦの個数を計算するパラメータ個数計算回
路１８に送っている。Further, in the masking calculation circuit 17,
By performing the masking calculation, when the spectrum signal quantization circuit 15 quantizes the spectrum signal SP and records it in the data recording circuit 19, which band is the highest band HB to be recorded is also obtained. There is. That is, by taking the masking effect and the like into consideration, the band in which the sound quality on hearing is not affected even if it is not recorded is separated from the band in which the hearing is adversely affected if it is not recorded, and the information of the band HB to be recorded is divided. , To the parameter number calculation circuit 18 which calculates the number of parameters BF of the block floating process.

【００４１】当該パラメータ個数計算回路１８では、上
記記録すべき帯域ＨＢの情報に基づいて、各ブロックの
各パラメータＢＦのうち、当該記録すべき帯域ＨＢ以下
の帯域（すなわち記録すべき例えば低域）のパラメータ
ＢＦの個数Ｎを算出している。或いは、当該記録すべき
帯域ＨＢ以上の帯域（すなわち記録しなくても良い例え
ば高域）のパラメータＢＦの個数を算出するようにして
もよい。In the parameter number calculation circuit 18, based on the information of the band HB to be recorded, of the parameters BF of each block, a band equal to or less than the band HB to be recorded (that is, a low band to be recorded). The number N of the parameters BF is calculated. Alternatively, the number of parameters BF in a band equal to or higher than the band HB to be recorded (that is, a high band that may not be recorded) may be calculated.

【００４２】当該個数Ｎの情報は、上記ビットアロケー
シヨン計算回路１４に送られる。したがって、当該ビッ
トアロケーシヨン計算回路１４では、前述したようなビ
ット割り当て数の計算処理と共に、上記個数Ｎの情報に
基づいて記録すべき帯域ＨＢ以上の帯域（すなわち記録
しなくても良い例えば高域）に対しては、ビットを割り
振らないようにしている。すなわち、上記ビットアロケ
ーシヨン計算回路１４では、上記個数Ｎが得られれば、
上記メイン情報に割り当てることのできるビットの総数
が求まるので、当該個数Ｎに応じて余るビット数を上記
記録すべき最高の帯域ＨＢまでの帯域のスペクトル信号
ＳＰに割り振るようにしている。The information on the number N is sent to the bit allocation calculation circuit 14. Therefore, in the bit allocation calculation circuit 14, the bit allocation number calculation process as described above and the band HB or more to be recorded on the basis of the information of the number N (that is, it is not necessary to record, for example, Area), bits are not allocated. That is, in the bit allocation calculation circuit 14, if the number N is obtained,
Since the total number of bits that can be assigned to the main information is obtained, the extra number of bits is assigned to the spectrum signal SP in the band up to the highest band HB to be recorded according to the number N.

【００４３】上記データ記録回路１９には、上記パラメ
ータＢＦの個数Ｎと、当該Ｎ個のパラメータＢＦと、量
子化されたスペクトル信号ＱＳＰとを記録する。The data recording circuit 19 records the number N of the parameters BF, the N parameters BF, and the quantized spectrum signal QSP.

【００４４】上記デコーダ記録回路１９からのコーディ
ングされたデータＣＤＴは、出力端子２を介して出力さ
れる。The coded data CDT from the decoder recording circuit 19 is output via the output terminal 2.

【００４５】図４には、図３のエンコーダの時間／周波
数変換回路１１の具体的構成を示す。FIG. 4 shows a specific configuration of the time / frequency conversion circuit 11 of the encoder shown in FIG.

【００４６】この図４の構成は、例えば、ＱＭＦフィル
タ等のバンド分割フィルタと、変更離散コサイン変換
（Modified Discrete Cosine Transform；ＭＤＣＴ）と
を組み合わせて、信号を圧縮するようにしている。な
お、上記ＱＭＦフィルタは、1976R.E Crochiere, Digit
al coding of speech in subbands, Bell Syst. Tech.
J.Vol.55, No.8 1976 に述べられている。また、ICAS
SP 83, BOSTON, PolyphaseQuadrature filter -A new s
ubband coding technique, Joseph H. Rothweilerに
は、等バンド幅のフィルタ分割手法が述べられている。
また、上記ＭＤＣＴについては、ICASSP 1987 Subband/
Transform Coding Using Filter Bank DesignsBased on
Time Domain Aliasing Cancellation, J.P.Princen,
A.B.Bradley,Univ. of Surrey Royal Melbourne Inst.
of Tech.に述べられている。なお、上記ＭＤＣＴの代わ
りに、例えば、高速フーリエ変換（ＦＦＴ）、離散的コ
サイン変換（ＤＣＴ）等を行うことで時間軸を周波数軸
に変換することも可能である。In the configuration of FIG. 4, for example, a signal is compressed by combining a band division filter such as a QMF filter and a modified discrete cosine transform (MDCT). The above QMF filter is a 1976R.E Crochiere, Digit
al coding of speech in subbands, Bell Syst. Tech.
J. Vol.55, No.8 1976. Also, ICAS
SP 83, BOSTON, Polyphase Quadrature filter -A new s
The ubband coding technique, Joseph H. Rothweiler, describes an equal bandwidth filter partitioning technique.
Regarding the above MDCT, ICASSP 1987 Subband /
Transform Coding Using Filter Bank Designs Based on
Time Domain Aliasing Cancellation, JPPrincen,
ABBradley, Univ. Of Surrey Royal Melbourne Inst.
of Tech. Instead of the MDCT, for example, a fast Fourier transform (FFT), a discrete cosine transform (DCT) or the like may be performed to convert the time axis into the frequency axis.

【００４７】この図４の構成では、時系列のＰＣＭ信号
等の入力オーディオ信号ＴＳを、前述したように、人間
の聴覚特性を考慮したいわゆるクリティカルバンドに基
づいて高域程帯域幅が広くなるように周波数分割してい
る。本具体例では、上記臨界帯域を考慮し、大別して高
域，中域，低域の３つの帯域に分割している。なお、こ
の帯域分割としては、クリティカルバンド単位もしくは
高域では臨界帯域（クリティカルバンド）幅を更に細分
化したブロックとしてもよい。In the configuration of FIG. 4, the bandwidth of the input audio signal TS such as a time-series PCM signal becomes wider as it goes higher, based on the so-called critical band in consideration of human auditory characteristics. The frequency is divided into. In this specific example, considering the above critical band, the band is roughly divided into three bands of a high band, a middle band, and a low band. The band division may be a block in which the critical band width is further subdivided in the critical band unit or in the high band.

【００４８】すなわち、図４において、入力端子１には
例えば０〜２０ｋＨｚのオーディオＰＣＭ信号である上
記入力オーディオ信号ＴＳが供給されている。この入力
オーディオ信号ＴＳは、例えばいわゆるＱＭＦフィルタ
等の帯域分割フィルタ７１により例えば０〜１０ｋＨｚ
帯域と１０ｋＨｚ〜２０ｋＨｚ帯域（高域）とに分割さ
れ、０〜１０ｋＨｚ帯域の信号は同じくいわゆるＱＭＦ
フィルタ等の帯域分割フィルタ７２により例えば０〜５
ｋＨｚ帯域（低域）と５ｋＨｚ〜１０ｋＨｚ帯域（中
域）とに分割される。帯域分割フィルタ７１からの高域
（１０ｋＨｚ〜２０ｋＨｚ帯域）の信号は、直交変換回
路の一例であるＭＤＣＴ回路７３に送られ、帯域分割フ
ィルタ７２からの中域（５ｋＨｚ〜１０ｋＨｚ帯域）の
信号はＭＤＣＴ回路７４に送られ、帯域分割フィルタ７
２からの低域（０〜５ｋＨｚ帯域）の信号はＭＤＣＴ回
路７５に送られることにより、それぞれＭＤＣＴ処理さ
れる。これらＭＤＣＴ処理された高域の信号は端子７６
を介して出力され、上記中域の信号は端子７７を介し
て、上記低域の信号は端子７８を介して出力される。That is, in FIG. 4, the input audio signal TS, which is an audio PCM signal of 0 to 20 kHz, for example, is supplied to the input terminal 1. This input audio signal TS is, for example, 0 to 10 kHz by a band division filter 71 such as a so-called QMF filter.
Band and 10 kHz to 20 kHz band (high frequency band), and signals in the 0 to 10 kHz band are also called QMF.
For example, 0 to 5 depending on the band division filter 72 such as a filter.
It is divided into a kHz band (low band) and a 5 kHz to 10 kHz band (middle band). The high-frequency (10 kHz to 20 kHz band) signal from the band-division filter 71 is sent to the MDCT circuit 73, which is an example of an orthogonal transformation circuit, and the medium-frequency (5 kHz to 10 kHz band) signal from the band-division filter 72 is MDCT. It is sent to the circuit 74, and the band division filter 7
The signals in the low frequency band (0 to 5 kHz band) from 2 are sent to the MDCT circuit 75 to be respectively MDCT processed. These MDCT-processed high-frequency signals are output to the terminal 76.
, The mid-range signal is output via a terminal 77, and the low-range signal is output via a terminal 78.

【００４９】ここで、各ＭＤＣＴ回路１３、１４、１５
のブロックサイズは具体例には、高域側ほど周波数帯域
を広げると共に時間分解能を高め（ブロック長を短く
し）ている。すなわち、低域側の０〜５ｋＨｚ帯域の信
号及び中域の５ｋＨｚ〜１０ｋＨｚ帯域の信号に対して
は１ブロックのサンプル数を例えば２５６サンプルと
し、高域側の１０ｋＨｚ〜２０ｋＨｚ帯域の信号に対し
ては、１ブロックを上記低域及び中域側のブロックのそ
れぞれ１／２の長さとしてブロック化している。このよ
うにして各帯域の直交変換ブロックサンプル数を同じと
している。また、各々の帯域は、信号の時間的変化が大
きい場合を想定して更に１／２、１／４等の適応的なブ
ロック分割が可能である。Here, each MDCT circuit 13, 14, 15
In the concrete example, the block size of (1) is such that the frequency band is widened toward the higher frequency side and the time resolution is increased (block length is shortened). That is, the number of samples in one block is, for example, 256 samples for a signal in the low frequency band of 0 to 5 kHz and a signal in the middle frequency band of 5 kHz to 10 kHz, and for a signal in the high frequency band of 10 kHz to 20 kHz. Divides one block into blocks each having a length of ½ of each of the blocks on the low band and the middle band. In this way, the number of orthogonal transform block samples in each band is the same. Further, each band can be further adaptively divided into ½, ¼, etc. blocks assuming a case where a signal temporal change is large.

【００５０】図５には、図３のエンコーダに対応するデ
コーダの構成を示す。FIG. 5 shows the configuration of a decoder corresponding to the encoder of FIG.

【００５１】すなわち、この図５において、入力端子５
１には、上記コーディングされたデータＣＤＴが供給さ
れる。このデータＣＤＴは、当該データＣＤＴから、量
子化スペクトル信号を読み込む（取り出す）量子化スペ
クトル信号読み込み回路５４と、ブロックフローティン
グのパラメータＢＦの上記個数Ｎを読み込む（取り出
す）パラメータ個数読み込み回路５２と、上記ブロック
フローティングのパラメータＢＦのデータを読み込む
（取り出す）パラメータ読み込み回路５３に送られる。
上記量子化スペクトル信号読み込み回路５４からの量子
化されたスペクトル信号ＱＳＰと、上記パラメータ個数
読み込み回路５２からの個数Ｎのデータと、上記パラメ
ータ読み込み回路５３からのパラメータＢＦすなわちス
ケールファクタＳＦ及びワード長ＷＬのデータとは、ス
ペクトル信号復元回路５５に送られる。当該スペクトル
信号復元回路５５は、供給された信号を用いて復号化処
理を施す。当該スペクトル信号復元回路５５で復号化さ
れたスペクトル信号ＲＳＰは、周波数／時間変換回路５
６で時系列のオーディオ信号ＲＴＳとされ、出力端子５
７から出力される。That is, in FIG. 5, the input terminal 5
1 is supplied with the coded data CDT. The data CDT includes a quantized spectrum signal reading circuit 54 for reading (retrieving) a quantized spectral signal from the data CDT, a parameter number reading circuit 52 for reading (retrieving) the number N of the block floating parameters BF, and The data of the block floating parameter BF is sent to the parameter reading circuit 53 which reads (takes out) the data.
The quantized spectrum signal QSP from the quantized spectrum signal reading circuit 54, the number N of data from the parameter number reading circuit 52, the parameter BF from the parameter reading circuit 53, that is, the scale factor SF and the word length WL. The data of 1 is sent to the spectrum signal restoration circuit 55. The spectrum signal restoration circuit 55 performs a decoding process using the supplied signal. The spectrum signal RSP decoded by the spectrum signal restoration circuit 55 is the frequency / time conversion circuit 5
6 as a time series audio signal RTS and output terminal 5
It is output from 7.

【００５２】図６には上記図５のデコーダの構成の周波
数／時間変換回路５６の具体的構成を示す。FIG. 6 shows a specific structure of the frequency / time conversion circuit 56 having the structure of the decoder shown in FIG.

【００５３】この図６において、各ブロックの上記スペ
クトル信号ＲＳＰは、各端子６１，６２，６３に与えら
れ、ＩＭＤＣＴ（逆ＭＤＣＴ）回路６４，６５，６６で
周波数軸上の信号が時間軸上の信号に変換される。これ
らの部分帯域の時間軸上の信号は、ＩＱＭＦ（逆ＱＭ
Ｆ）回路６７，７８により全帯域信号に復号化され、端
子６９より取り出される。In FIG. 6, the spectrum signal RSP of each block is given to the terminals 61, 62 and 63, and the IMDCT (inverse MDCT) circuits 64, 65 and 66 convert the signals on the frequency axis on the time axis. Converted to a signal. The signals on the time axis of these subbands are IQMF (inverse QM
F) The signals are decoded into full band signals by the circuits 67 and 78 and taken out from the terminal 69.

【００５４】図７には、上記図３のエンコーダにおける
信号処理のフローチャートを示示す。FIG. 7 shows a flowchart of signal processing in the encoder of FIG.

【００５５】すなわち、この図７のフローチャートにお
いて、ステップＳ１では、上記時間／周波数変換回路１
１により入力された時系列オーディオ信号ＴＳがスペク
トル信号（スペクトルデータ）ＳＰに変換される。この
ステップＳ１の後ステップＳ２に進み、上記スケールフ
ァクタ計算回路１３により上記スケールファクタＳＦを
求めるための計算が行われる。ステップＳ３では上記マ
スキング計算回路１７により上記マスキング計算処理が
行われ、ステップＳ４では上記パラメータ個数計算回路
１８によりビットを割り当てるべき帯域の確定とブロッ
クフローティング処理のパラメータＢＦの個数Ｎの確定
がなされる。ステップＳ５ではビットアロケーシヨン計
算回路１４によりビット割り当ての計算とワード長ＷＬ
の計算とが行われる。ステップＳ６では上記スペクトル
信号量子化回路１５によりスペクトル信号（スペクトル
データ）の量子化処理が行われる。最後にステップＳ７
では、上記データ記録回路１９により、上記個数Ｎのデ
ータの記録と、ブロックフローティング処理のパラメー
タＢＦの記録（記録すべき帯域のパラメータＢＦの記
録）と、上記量子化されたスペクトル信号（スペクトル
データ）ＱＳＰのデータの記録が行われる。That is, in the flowchart of FIG. 7, in step S1, the time / frequency conversion circuit 1 is operated.
The time series audio signal TS input by 1 is converted into a spectrum signal (spectrum data) SP. After step S1, the process proceeds to step S2, and the scale factor calculation circuit 13 performs the calculation for obtaining the scale factor SF. In step S3, the masking calculation circuit 17 performs the masking calculation process, and in step S4, the parameter number calculation circuit 18 determines the band to which the bit is to be allocated and the number N of the parameter BF of the block floating process. In step S5, the bit allocation calculation circuit 14 calculates the bit allocation and the word length WL.
Is calculated. In step S6, the spectrum signal quantization circuit 15 quantizes the spectrum signal (spectrum data). Finally step S7
Then, the data recording circuit 19 records the number N of data, records the parameter BF of the block floating process (records the parameter BF of the band to be recorded), and the quantized spectrum signal (spectrum data). Recording of QSP data is performed.

【００５６】図８は、上記図５に示すデコーダの構成に
おける処理のフローチャートを示したものである。FIG. 8 shows a flowchart of processing in the configuration of the decoder shown in FIG.

【００５７】すなわちこの図８のフローチャートにおい
て、先ず、ステップＳ１１では上記パラメータ個数読み
込み回路５２により最初にパラメータＢＦの個数Ｎを読
み出し、次に、ステップＳ１２では上記パラメータ読み
込み回路５３により上記個数Ｎの数だけのパラメータＢ
Ｆのデータを読み出す。続いて、ステップＳ１３では上
記量子化スペクトル信号読み込み回路５４により、上記
パラメータＢＦのうちのワード長ＷＬに応じて、上記量
子化されたスペクトル信号ＱＳＰを読み出していく。ま
た、ステップＳ１４では、上記スペクトル信号復元回路
５５により、上記スケールファクタＳＦとワード長ＷＬ
に基づいて、上記読み出された量子化されたスペクトル
信号ＱＳＰが、元のスペクトル信号ＳＰの近似値（上記
スペクトル信号ＲＳＰ）として復元される。最後に、ス
テップＳ１５では、周波数／時間変換回路５７により、
上記スペクトル信号ＲＳＰに上記ＭＤＣＴとは逆の処理
（逆ＭＤＣＴ）を行うと共に、帯域合成フィルターを通
すことにより、オーディオ信号ＲＴＳが復元される。That is, in the flowchart of FIG. 8, first, in step S11, the parameter number reading circuit 52 first reads the number N of the parameters BF, and then in step S12, the parameter reading circuit 53 reads the number N. Parameter B only
Read F data. Subsequently, in step S13, the quantized spectrum signal reading circuit 54 reads the quantized spectrum signal QSP according to the word length WL of the parameter BF. In step S14, the spectrum signal restoration circuit 55 causes the scale factor SF and the word length WL.
Based on the above, the read quantized spectrum signal QSP is restored as an approximate value of the original spectrum signal SP (the spectrum signal RSP). Finally, in step S15, the frequency / time conversion circuit 57
The audio signal RTS is restored by performing a process (inverse MDCT) opposite to the MDCT on the spectrum signal RSP and passing the band synthesis filter.

【００５８】なお、上述した本発明の実施例は、時系列
の入力オーディオ信号ＴＳをスペクトル信号ＳＰに変換
した信号を符号化するシステムについて説明を行なった
が、本発明は、この時系列信号をサブバンドに分割して
符号化を施す（いわゆるサブバンドコーディング）シス
テムにも適用することができる。Although the above-described embodiment of the present invention has described the system for encoding the signal obtained by converting the time-series input audio signal TS into the spectrum signal SP, the present invention describes the time-series signal. It can also be applied to a system in which sub-bands are divided and encoded (so-called sub-band coding).

【００５９】[0059]

【発明の効果】上述のように、本発明のオーディオ信号
処理方法においては、ブロックフローティング処理に関
連する各ブロック毎のパラメータを時間フレーム毎にパ
ラメータを必要とする帯域まで記録又は伝送すると共
に、この記録又は伝送されるパラメータの個数も記録又
は伝送するようにしたことにより、ビット割り当ての調
整が比較的容易で、ビット割り当ての調整を行っても音
質を劣化させることがない。すなわち、実際に知覚され
難いという理由から、例えば高域の信号が記録されない
場合に、その高域のパラメータ分のビットも低域のオー
ディオ信号の量子化のために割り振ることができ、音質
を向上させる事ができる。また、高域にレベルの高い信
号が含まれる場合にも、帯域を狭めることなく記録する
ことができるようになる。更にこれらの処理は、簡単な
構成により実現することができる。As described above, in the audio signal processing method according to the present invention, the parameters for each block related to the block floating process are recorded or transmitted for each time frame up to the required band, and Since the number of parameters to be recorded or transmitted is also recorded or transmitted, adjustment of bit allocation is relatively easy, and even if adjustment of bit allocation is performed, sound quality is not deteriorated. In other words, because it is hard to be perceived in practice, for example, when a high frequency signal is not recorded, the high frequency parameter bits can also be allocated for quantization of the low frequency audio signal, improving the sound quality. You can let me do it. Further, even when a high level signal is included in the high frequency band, recording can be performed without narrowing the band. Furthermore, these processes can be realized with a simple configuration.

[Brief description of drawings]

【図１】本発明実施例方法でのデータ記録又は伝送例
(高域の信号が記録又は伝送されない場合）を説明する
ための図である。FIG. 1 is an example of data recording or transmission in a method according to an embodiment of the present invention.
It is a figure for demonstrating (when a high-frequency signal is not recorded or transmitted).

【図２】本発明実施例方法でのデータ記録又は伝送例
(高域の信号も記録又は伝送される場合）を説明するた
めの図である。FIG. 2 is an example of data recording or transmission in the method of the embodiment of the present invention.
It is a figure for demonstrating (when a high-frequency signal is also recorded or transmitted).

【図３】エンコーダの概略構成を示すブロック回路図で
ある。FIG. 3 is a block circuit diagram showing a schematic configuration of an encoder.

【図４】図３の構成の時間／周波数変換回路の具体的構
成を示すブロック回路図である。FIG. 4 is a block circuit diagram showing a specific configuration of the time / frequency conversion circuit having the configuration of FIG.

【図５】デコーダ側の構成の概略構成を示すブロック回
路図である。FIG. 5 is a block circuit diagram showing a schematic configuration of a decoder side.

【図６】図５の構成の周波数／時間変換回路の具体的構
成を示すブロック回路図である。6 is a block circuit diagram showing a specific configuration of the frequency / time conversion circuit having the configuration of FIG.

【図７】エンコーダでの処理のフローチャートである。FIG. 7 is a flowchart of processing performed by the encoder.

【図８】デコーダでの処理のフローチャートである。FIG. 8 is a flowchart of processing in a decoder.

【図９】従来システムでの記録或いは伝送データ例 (パ
ラメータの個数が一定のシステムの場合）を説明するた
めの図である。FIG. 9 is a diagram for explaining an example of recording or transmission data in a conventional system (in the case of a system in which the number of parameters is constant).

【図１０】従来システムでの記録或いは伝送データ例
(スケールファクタの個数が可変のシステムの場合）を
説明するための図である。FIG. 10 is an example of recording or transmission data in a conventional system.
It is a figure for demonstrating (in the case of a system with a variable number of scale factors).

[Explanation of symbols]

ＷＬ・・・・・・ワード長ＳＦ・・・・・・スケールファクタ１１・・・・・・時間／周波数変換回路１３・・・・・・スケールファクタ計算回路１４・・・・・・ビットアロケーシヨン計算回路１５・・・・・・スペクトル信号量子化回路１７・・・・・・マスキング計算回路１８・・・・・・パラメータ個数計算回路１９・・・・・・データ記録回路 WL: Word length SF: Scale factor 11: Time / frequency conversion circuit 13: Scale factor calculation circuit 14: Bit allocation Case calculation circuit 15 ... Spectrum signal quantization circuit 17 ... Masking calculation circuit 18 ... Parameter number calculation circuit 19 ... Data recording circuit

Claims

[Claims]

1. An input audio signal is divided into a plurality of blocks on a frequency axis for each predetermined time frame, block floating processing is performed for each block, and data for each block is quantized by adaptive bit allocation. An audio signal processing method for recording or transmitting the quantized audio signal and recording or transmitting a parameter related to the block floating process, wherein the parameters for each block related to the block floating process are An audio signal processing method, wherein the parameter is recorded or transmitted up to a required band for each time frame, and the number of parameters recorded or transmitted is also recorded or transmitted.