JP3060576B2

JP3060576B2 - Digital signal encoding method

Info

Publication number: JP3060576B2
Application number: JP3091187A
Authority: JP
Inventors: 健三赤桐
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1991-03-29
Filing date: 1991-03-29
Publication date: 2000-07-10
Anticipated expiration: 2015-07-10
Also published as: JPH04302534A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、入力ディジタル信号の
符号化を行うディジタル信号の高能率符号化装置に関す
るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a high efficiency digital signal encoding apparatus for encoding an input digital signal.

【０００２】[0002]

【従来の技術】オーディオ_,音声等の信号の高能率符号
化においては、オーディオ，音声等の入力信号を時間軸
又は周波数軸で複数のチャンネルに分割すると共に、各
チャンネル毎のビット数を適応的に割当てるビットアロ
ケーシヨン（ビット割当て）による符号化技術がある。
例えば、オーディオ信号等の上記ビット割当てによる符
号化技術には、時間軸上のオーディオ信号等を複数の周
波数帯域に分割して符号化する帯域分割符号化（サブ・
バンド・コーディング：ＳＢＣ）や、時間軸の信号を周
波数軸上の信号に変換（直交変換）して複数の周波数帯
域に分割し各帯域毎で適応的に符号化するいわゆる適応
変換符号化（ＡＴＣ）、或いは、上記ＳＢＣといわゆる
適応予測符号化（ＡＰＣ）とを組み合わせ、時間軸の信
号を帯域分割して各帯域信号をベースバンド（低域）に
変換した後複数次の線形予測分析を行って予測符号化す
るいわゆる適応ビット割当て（ＡＰＣ−ＡＢ）等の符号
化技術がある。BACKGROUND ART _Audio, in high-efficiency encoding of the signal of voice, etc., audio, adaptive with, the number of bits of each channel is divided into a plurality of channels of the input signal in the time axis or the frequency axis such as voice There is a coding technique based on bit allocation (bit allocation).
For example, the encoding technique based on the above-mentioned bit allocation of an audio signal or the like includes band division encoding (sub-coding) in which an audio signal or the like on a time axis is divided into a plurality of frequency bands and encoded.
Band coding: SBC) or so-called adaptive conversion coding (ATC) in which a signal on the time axis is converted into a signal on the frequency axis (orthogonal conversion), divided into a plurality of frequency bands, and adaptively encoded for each band. Alternatively, the SBC is combined with so-called adaptive prediction coding (APC), a signal on the time axis is divided into bands, and each band signal is converted into a baseband (low band), and then a multi-order linear prediction analysis is performed. There is an encoding technique such as so-called adaptive bit allocation (APC-AB) for predictive encoding.

【０００３】上記高能率符号化においては、時間軸上の
オーディオ信号等を、所定の単位時間毎に例えば高速フ
ーリエ変換（ＦＦＴ）等の直交変換によって時間軸に直
交する軸（周波数軸）に変換し、その後複数の帯域に分
割して、これら分割された各帯域のＦＦＴ係数データを
適応的なビット割り当てによって符号化している。この
符号化データが伝送される。In the above-mentioned high efficiency coding, an audio signal or the like on a time axis is transformed into an axis (frequency axis) orthogonal to the time axis by an orthogonal transformation such as a fast Fourier transform (FFT) every predetermined unit time. Then, the data is divided into a plurality of bands, and the FFT coefficient data of each of the divided bands is encoded by adaptive bit allocation. This encoded data is transmitted.

【０００４】[0004]

【発明が解決しようとする課題】ところで、この各帯域
毎のＦＦＴ係数データを上記適応ビット割り当てにより
符号化する際には、例えば、上記周波数軸上のＦＦＴ係
数データをブロック化し、このブロック毎にいわゆるブ
ロックフローティング処理を施すことで、更にビット圧
縮を行う場合が多い。このため、後の復号化のための構
成には、上記帯域分割されると共に上記ブロック毎にブ
ロックフローティング処理されたＦＦＴ係数データと、
当該各ブロック毎のフローティング係数及び割り当てビ
ット数に応じたワード長情報からなるサブ情報とが伝送
されることになる。When the FFT coefficient data for each band is encoded by the adaptive bit allocation, for example, the FFT coefficient data on the frequency axis is divided into blocks, and In many cases, bit compression is further performed by performing a so-called block floating process. For this reason, the configuration for later decoding includes FFT coefficient data that has been band-divided and subjected to block floating processing for each block,
The floating coefficient for each block and the sub information including the word length information corresponding to the number of allocated bits are transmitted.

【０００５】しかし、上記高能率符号化においては、更
に圧縮効率を高めることが望まれている。However, in the high-efficiency coding, it is desired to further increase the compression efficiency.

【０００６】そこで、本発明は、上述のような実情に鑑
みて提案されたものであり、より高いビット圧縮が可能
なディジタル信号符号化方法を提供することを目的とす
るものである。Accordingly, the present invention has been proposed in view of the above-described circumstances, and has as its object to provide a digital signal encoding method capable of performing higher bit compression.

【０００７】[0007]

【課題を解決するための手段】本発明のディジタル信号
符号化方法は、上述の目的を達成するために提案された
ものであり、入力ディジタル信号を直交変換して臨界帯
域に分割し、当該臨界帯域毎のエネルギに基づいて設定
した各臨界帯域毎の許容ノイズレベルと当該各臨界帯域
毎のエネルギとの差分のレベルに応じたビット数で上記
各臨界帯域の信号成分を符号化すると共に、上記直交変
換後の信号成分をブロック化してこのブロック毎にブロ
ックフローティング処理を行い当該ブロック毎のフロー
ティング係数を伝送するディジタル信号符号化方法であ
って、上記ブロックフローティング処理を上記臨界帯域
よりも狭い帯域の小ブロック単位で行う場合には、上記
臨界帯域毎のフローティング係数の代わりに許容ノイズ
レベルに関する情報を伝送すると共に、上記小ブロック
毎に割り当てられるビット数に応じたワード長の情報を
伝送するようにしたものである。また、本発明は、上記
ブロックフローティング処理を上記臨界帯域よりも広い
帯域の大ブロック単位で行う場合には、上記大ブロック
毎のフローティング係数に関する情報を伝送すると共
に、上記臨界帯域毎に割り当てられるビット数に応じた
ワード長の情報を伝送するようにしたものである。ここ
で、上記割当ビット数決定の際には、例えば、各臨界帯
域毎のエネルギから人間の聴覚特性を考慮していわゆる
マスキング量を求め、このマスキング量に基づく許容ノ
イズレベルを用いて各臨界帯域の割当ビット数を決定す
ることが望ましい。SUMMARY OF THE INVENTION A digital signal encoding method according to the present invention has been proposed to achieve the above-mentioned object. An input digital signal is orthogonally transformed and divided into a critical band. Encoding the signal component of each critical band with the number of bits according to the level of the difference between the allowable noise level for each critical band and the energy for each critical band set based on the energy for each band, and A digital signal encoding method in which signal components after orthogonal transformation are divided into blocks, block floating processing is performed for each block, and floating coefficients for each block are transmitted, wherein the block floating processing is performed for a band narrower than the critical band. When the processing is performed in small block units, information on the permissible noise level is used instead of the floating coefficient for each critical band. While transmitting, is obtained so as to transmit word length information of corresponding to the number of bits allocated for each of the small blocks. In addition, according to the present invention, when the block floating process is performed in units of large blocks having a band wider than the critical band, information on a floating coefficient of each large block is transmitted, and bits allocated to each of the critical bands are transmitted. The information of the word length according to the number is transmitted. Here, when determining the number of allocated bits, for example, a so-called masking amount is determined from the energy of each critical band in consideration of human auditory characteristics, and each critical band is determined using an allowable noise level based on the masking amount. Is desirably determined.

【０００８】[0008]

【作用】本発明によれば、ブロックフローティング処理
が臨界帯域よりも狭い帯域の小ブロックについて行われ
る場合は、１つの臨界帯域内に複数の小ブロックが存在
することになり、この場合、各小ブロック毎のフローテ
ィング係数を伝送せずに、臨界帯域毎の許容ノイズレベ
ルに関する情報及びワード長情報を送るようにすること
で、フローティング係数のためのビット数を減らすこと
ができる。また、臨界帯域よりも広い大ブロックでフロ
ーティング処理を行う場合は、１つの大ブロック内に複
数の臨界帯域が存在することになり、この場合、各臨界
帯域毎のフローティング係数の情報を送らずに、大ブロ
ックで１つのフローティング係数に関する情報を伝送す
るようにすることで、各臨界帯域毎のフローティング係
数の情報を伝送するためのビット数を低減することがで
きる。According to the present invention, when the block floating processing is performed on a small block having a band narrower than the critical band, a plurality of small blocks exist in one critical band. By transmitting the information on the allowable noise level and the word length information for each critical band without transmitting the floating coefficient for each block, the number of bits for the floating coefficient can be reduced. In addition, when performing the floating process in a large block wider than the critical band, there are a plurality of critical bands in one large block, in this case, without sending the information of the floating coefficient for each critical band By transmitting information on one floating coefficient in a large block, the number of bits for transmitting information on the floating coefficient for each critical band can be reduced.

【０００９】[0009]

【実施例】以下、本発明を適用した実施例について図面
を参照しながら説明する。本発明のディジタル信号符号
化方法は、図１及び図２に示すように、入力ディジタル
信号を例えば高速フーリエ変換（ＦＦＴ）で直交変換し
て人間の聴覚特性を考慮した高域ほど帯域幅が広くなる
ようないわゆる臨界帯域（クリティカルバンド）Ｂ（Ｂ
１〜Ｂ４）に分割し、当該臨界帯域毎のエネルギに基づ
いて設定した各臨界帯域毎の許容ノイズレベルＮＬと当
該各臨界帯域毎のエネルギとの差分のレベルに応じたビ
ット数で上記各臨界帯域の信号成分（ＦＦＴ係数デー
タ）を符号化すると共に、上記直交変換後の信号成分を
ブロック化してこのブロックｂ毎にブロックフローティ
ング処理を行い当該ブロックｂ毎のフローティング係数
Ｆｃを伝送するディジタル信号符号化方法である。Embodiments of the present invention will be described below with reference to the drawings. In the digital signal encoding method of the present invention, as shown in FIGS. 1 and 2, an input digital signal is orthogonally transformed by, for example, a fast Fourier transform (FFT), and the bandwidth becomes wider in a higher frequency band in consideration of human auditory characteristics. A so-called critical band (critical band) B (B
1 to B4), and each of the thresholds is determined by the number of bits according to the level of the difference between the allowable noise level NL of each critical band and the energy of each critical band set based on the energy of each critical band. A digital signal code that encodes a band signal component (FFT coefficient data), blocks the orthogonally transformed signal component, performs block floating processing for each block b, and transmits a floating coefficient Fc for each block b. Method.

【００１０】ここで、本実施例は、例えば図１のよう
に、上記ブロックフローティング処理を上記臨界帯域Ｂ
よりも狭い帯域の小ブロック単位（小ブロックｂ１〜ｂ
４）で行う場合には、上記臨界帯域Ｂ毎のフローティン
グ係数の代わりに、許容ノイズレベルＮＬに関する情報
を伝送すると共に、上記小ブロックｂ１〜ｂ４毎に割り
当てられるビット数に応じたワード長Ｗ１〜Ｗ４の情報
を伝送するようにしたものである。In the present embodiment, for example, as shown in FIG.
A smaller block unit having a narrower band (small blocks b1 to b
In the case of 4), information on the allowable noise level NL is transmitted instead of the floating coefficient for each critical band B, and the word lengths W1 to W4 corresponding to the number of bits allocated to the small blocks b1 to b4. W4 information is transmitted.

【００１１】また、例えば図２に示すように、上記ブロ
ックフローティング処理を臨界帯域Ｂ１〜Ｂ４よりも広
い帯域の大ブロック単位（大ブロックｂ）で行う場合に
は、当該大ブロックｂ毎のフローティング係数に関する
情報を伝送すると共に、上記臨界帯域Ｂ１〜Ｂ４毎に割
り当てられるビット数に応じたワード長Ｗ１〜Ｗ４の情
報を伝送するようにしたものである。As shown in FIG. 2, for example, when the block floating processing is performed in units of large blocks (large blocks b) wider than the critical bands B1 to B4, the floating coefficient for each large block b is set. And information of word lengths W1 to W4 according to the number of bits allocated to each of the critical bands B1 to B4.

【００１２】なお、図１の例は、当該臨界帯域の帯域幅
の広い高域側の１つの臨界帯域Ｂを抜き出して示したも
のであり、図２の例は、帯域幅の狭い低域側の各帯域を
示したものである。また、図１，図２には、上記割り当
てビット数を求めるためのレベル差とワード長Ｗ１〜Ｗ
４とが対応するものであるため、該ワード長Ｗ１〜Ｗ４
を便宜的に図中に示している。The example of FIG. 1 shows one extracted critical band B on the high frequency side where the bandwidth of the critical band is wide, and the example of FIG. 2 shows the frequency band on the low band side where the bandwidth is narrow. Are shown in FIG. FIGS. 1 and 2 show a level difference and word lengths W1 to W for obtaining the number of allocated bits.
4 correspond to the word lengths W1 to W4.
Are shown in the figure for convenience.

【００１３】また、本実施例では前述したように、適応
的なビット割り当てによるＦＦＴ係数データの符号化処
理を行うようにしている。すなわち、本実施例での適応
的なビット割り当てによる符号化処理は、各臨界帯域毎
のエネルギに基づいて後述するような人間の聴覚特性を
考慮したいわゆるマスキング量を求め、このマスキング
量に基づいて設定される許容ノイズレベル（すなわち各
臨界帯域毎に略一定の許容ノイズレベル）と各臨界帯域
のエネルギとの差分のレベルに応じた割り当てビット数
を求めて、当該各臨界帯域毎の割り当てビット数に応じ
て行われる。Further, in this embodiment, as described above, encoding processing of FFT coefficient data by adaptive bit allocation is performed. That is, the encoding process by the adaptive bit allocation in the present embodiment obtains a so-called masking amount in consideration of human auditory characteristics as described later based on the energy of each critical band, and based on the masking amount. The number of bits to be allocated is determined according to the level of the difference between the set allowable noise level (that is, a substantially constant allowable noise level for each critical band) and the energy of each critical band, and the number of allocated bits for each critical band is determined. It is performed according to.

【００１４】更に、本実施例においては上記各臨界帯域
内の複数のＦＦＴ係数データの一定数毎にブロックを構
成し、このブロック毎にいわゆるブロックフローティン
グ処理を施して、ビット圧縮を行うようにしている。し
たがって、上記図１の例のように、臨界帯域の高域すな
わち帯域幅の広い臨界帯域Ｂ内には複数の上記ブロック
（例えば４つの上記小ブロックｂ１〜ｂ４）が存在する
ようになり、また図２の例のように帯域幅が狭い（臨界
帯域の低域）場合には、１つの大ブロックｂ内に複数臨
界帯域（例えば４つの臨界帯域Ｂ１〜Ｂ４）が存在する
ようになる。Further, in this embodiment, a block is formed for each fixed number of a plurality of FFT coefficient data in each of the above critical bands, and a so-called block floating process is performed for each block to perform bit compression. I have. Therefore, as in the example of FIG. 1 described above, a plurality of the blocks (for example, the four small blocks b1 to b4) are present in the high band of the critical band, that is, the critical band B having a wide bandwidth. When the bandwidth is narrow (low band of the critical band) as in the example of FIG. 2, a plurality of critical bands (for example, four critical bands B1 to B4) exist in one large block b.

【００１５】ところで、通常、このようにブロックフロ
ーティング処理を施すようにすると、後の復号化処理の
際には、該フローティング処理のフローティング係数
と、上記割り当てビット数に応じたワード長の情報とが
必要となる。すなわち、後の復号化のための構成に対し
ては、通常、上記ブロック毎のフローティング係数の情
報及び、該フローティング係数のレベルと該臨界帯域の
許容ノイズレベルとのレベル差に基づく割り当てビット
数に応じたワード長の情報を伝送することが必要とな
る。換言すれば、後の復号化の際には、上記フローティ
ング係数の情報から、上記ブロックフローティング処理
における最上位ビット（ＭＳＢ）が決まり、上記ワード
長の情報から最下位ビット（ＬＳＢ）が決まって上記許
容ノイズレベルが決定される。更に各ブロックのＦＦＴ
係数データ（メインデータ）から信号の大きさが決まる
ようになる。By the way, normally, when the block floating processing is performed as described above, in the subsequent decoding processing, the floating coefficient of the floating processing and the information of the word length according to the number of allocated bits are included. Required. That is, for a configuration for later decoding, usually, the information of the floating coefficient for each block and the number of bits allocated based on the level difference between the level of the floating coefficient and the allowable noise level of the critical band are set. It is necessary to transmit information of a corresponding word length. In other words, at the time of subsequent decoding, the most significant bit (MSB) in the block floating processing is determined from the information on the floating coefficient, and the least significant bit (LSB) is determined from the information on the word length. An allowable noise level is determined. FFT of each block
The magnitude of the signal is determined from the coefficient data (main data).

【００１６】ここで、通常、上記フローティング係数の
情報は例えば６ビットで表され、上記ワード長の情報は
それぞれ例えば４ビットで表される。なお、直交変換が
ＤＦＴ（離散的フーリエ変換）の場合、上記ワード長情
報は、大きさ（振幅）と位相若しくは実数部と虚数部を
上記４ビットで表すことになる。このため、例えば、１
つの臨界帯域を複数のフローティングブロックで分割し
た場合、当該ブロックフローティング処理のブロック数
（すなわち帯域の分割数）に応じた当該臨界帯域の全体
の伝送ビット数は、表１に示すようになる。Here, usually, the information of the floating coefficient is represented by, for example, 6 bits, and the information of the word length is represented by, for example, 4 bits. When the orthogonal transform is a DFT (Discrete Fourier Transform), the word length information indicates a magnitude (amplitude) and a phase or a real part and an imaginary part by the four bits. Thus, for example, 1
When one critical band is divided by a plurality of floating blocks, the total number of transmission bits of the critical band according to the number of blocks (that is, the number of divided bands) in the block floating process is as shown in Table 1.

【表１】 [Table 1]

【００１７】この表１において、臨界帯域を１つのブロ
ックで表した場合（１分割）は、フローティング係数に
６ビットで、ワード長に４ビットの合計１０ビットが伝
送されることになる。また、臨界帯域を２つのブロック
で表した場合（２分割）は、フローティング係数に６×
２（＝１２ビット）、ワード長に４×２（＝８ビット）
で合計２０ビットが伝送されることになる。以下同様
に、３分割した場合はフローティング係数に６×３（＝
１８ビット）でワード長に４×３（＝１２ビット）の合
計３０ビット、また、４分割（図１の例）の場合はフロ
ーティング係数に６×４（＝２４ビット）でワード長に
４×４（＝１６ビット）の合計４０ビットが伝送され
る。上述のように、１つの臨界帯域内のブロック数が増
加するほど伝送されるビット数も増加することになる。In Table 1, when the critical band is represented by one block (one division), a total of 10 bits of 6 bits for the floating coefficient and 4 bits for the word length are transmitted. When the critical band is represented by two blocks (divided into two), the floating coefficient is 6 ×
2 (= 12 bits), 4 × 2 (= 8 bits) in word length
Thus, a total of 20 bits are transmitted. Similarly, in the case of dividing into three, the floating coefficient is 6 × 3 (=
18 bits) and a word length of 4 × 3 (= 12 bits) for a total of 30 bits. In the case of 4 divisions (example of FIG. 1), a floating coefficient is 6 × 4 (= 24 bits) and a word length is 4 ×. A total of 40 bits of 4 (= 16 bits) are transmitted. As described above, as the number of blocks in one critical band increases, the number of transmitted bits also increases.

【００１８】これに対し、本発明実施例の上記図１の例
においては、臨界帯域Ｂ内の上記各小ブロックのフロー
ティング係数Ｆｃ１〜Ｆｃ４の情報を伝送せずに、臨界
帯域Ｂに対して１つのみ設定される上記許容ノイズレベ
ルＮＬの情報と上記割り当てビット数に応じたワード長
Ｗ１〜Ｗ４の情報を伝送するようにしている。すなわ
ち、後の復号化処理の際には、上記臨界帯域Ｂの許容ノ
イズレベルＮＬの情報が伝送されて来れば、当該許容ノ
イズレベルＮＬの情報と上記各小ブロックｂ１〜ｂ４の
ワード長Ｗ１〜Ｗ４の情報とに基づいて、上記各小ブロ
ックｂ１〜ｂ４毎のフローティング係数Ｆｃ１〜Ｆｃ４
の情報を求めることができるため、このフローティング
係数Ｆｃ１〜Ｆｃ４の情報を伝送しないようにしてい
る。これにより、上記臨界帯域Ｂに対して４つ必要な上
記フローティング係数Ｆｃ１〜Ｆｃ４を伝送するための
ビット数が低減できるようになる。On the other hand, in the example of FIG. 1 according to the embodiment of the present invention, the information of the floating coefficients Fc1 to Fc4 of each of the small blocks in the critical band B is not transmitted, and Only the information of the allowable noise level NL and the information of the word lengths W1 to W4 corresponding to the number of allocated bits are transmitted. That is, in the subsequent decoding process, if the information of the allowable noise level NL of the critical band B is transmitted, the information of the allowable noise level NL and the word length W1 of each of the small blocks b1 to b4 are transmitted. Based on the information of W4, the floating coefficients Fc1 to Fc4 of each of the small blocks b1 to b4
Therefore, the information of the floating coefficients Fc1 to Fc4 is not transmitted. Accordingly, the number of bits for transmitting the floating coefficients Fc1 to Fc4 required for the critical band B can be reduced.

【００１９】ここで、上記許容ノイズレベルＮＬは、上
述したように、人間の聴覚特性を考慮した臨界帯域毎に
求められており、当該臨界帯域では、１つの臨界帯域内
で許容ノイズレベルが略一定と考えることができるもの
である。したがって、上記図１の臨界帯域Ｂ内の各小ブ
ロックｂ１〜ｂ４においても、許容ノイズレベルＮＬは
同レベルであると考えることができる。ただし、全体の
ダイナミックレンジを例えば１２０ｄＢとし、上記フロ
ーティング係数を上記６ビットで表すと、該フローティ
ング係数は約２ｄＢの精度を持つことになり、また、上
記ワード長情報を上記４ビットで表すと、該ワード長情
報は約６ｄＢの精度となる。このため、上記図１の各小
ブロックｂ１〜ｂ４において、後の復号化の際には、フ
ローティング係数Ｆｃ１〜Ｆｃ４と、上記ワード長Ｗ１
〜Ｗ４の情報とから決まる許容ノイズレベルＮＬは、約
２ｄＢステップのズレを持つようになる。しかし、許容
ノイズレベルＮＬは、通常、略±３ｄＢの範囲内に収ま
る。このようなことから、本実施例では、許容ノイズレ
ベルＮＬを臨界帯域内で共通の荒い量子化と、臨界帯域
内のフローティング処理の各小ブロックそれぞれの持つ
細かい量子化との２段階で表して当該許容ノイズレベル
ＮＬを精度高く共通値として設定している。すなわち、
本実施例では、許容ノイズレベルＮＬが４ビットｌｏｇ
レベルであるため、この４ビットｌｏｇで表しきれなか
った許容ノイズレベルＮＬを２ビットｌｏｇで細かく表
すようにしている。したがって本実施例では、約６ｄＢ
を４分割して１．５ｄＢ精度とすることが可能となる。
このように、上記許容ノイズレベルＮＬが複数の小ブロ
ックに渡って略等しいことを用い、上記高精度のパラメ
ータを上記フローティング係数と許容ノイズレベルから
選択する事でビット数を低減することがてきることにな
る。このような図１の例におけるビット数低減の様子を
上記表１と比較して表２に示す。Here, the permissible noise level NL is determined for each critical band in consideration of human auditory characteristics, as described above. In the critical band, the permissible noise level is substantially within one critical band. It can be considered constant. Therefore, in each of the small blocks b1 to b4 in the critical band B in FIG. 1, the allowable noise level NL can be considered to be the same level. However, if the entire dynamic range is, for example, 120 dB and the floating coefficient is represented by the 6 bits, the floating coefficient has an accuracy of about 2 dB, and if the word length information is represented by the 4 bits, The word length information has an accuracy of about 6 dB. Therefore, in each of the small blocks b1 to b4 in FIG. 1, at the time of subsequent decoding, the floating coefficients Fc1 to Fc4 and the word length W1
The allowable noise level NL determined from the information of .about.W4 has a deviation of about 2 dB steps. However, the allowable noise level NL usually falls within a range of approximately ± 3 dB. For this reason, in the present embodiment, the allowable noise level NL is expressed in two stages: a common rough quantization in the critical band, and a fine quantization of each small block of the floating process in the critical band. The permissible noise level NL is set as a common value with high accuracy. That is,
In this embodiment, the allowable noise level NL is 4 bits log.
The allowable noise level NL that could not be represented by the 4-bit log is finely represented by the 2-bit log. Therefore, in this embodiment, about 6 dB
Can be divided into four to achieve 1.5 dB accuracy.
As described above, the number of bits can be reduced by using the fact that the allowable noise level NL is substantially equal over a plurality of small blocks and selecting the high-precision parameter from the floating coefficient and the allowable noise level. Will be. Table 2 shows how the number of bits is reduced in the example of FIG. 1 in comparison with Table 1 above.

【表２】 [Table 2]

【００２０】この表２において、臨界帯域Ｂを１つのブ
ロックで表した場合（１分割）は、上記許容ノイズレベ
ルＮＬを４ビットで、ワード長Ｗを４ビットで伝送す
る。ただし、上記許容ノイズレベルＮＬにおいては、上
述したように、２ｄＢ分のズレを補償するための２ビッ
トを加えて（４＋２ビット）いる。このため、該１分割
では合計１０ビットが伝送されることになる。同じく、
臨界帯域Ｂを２つの小ブロックで表した場合（２分割）
は、許容ノイズレベルＮＬに４＋２×２＝８ビット、ワ
ード長Ｗに４×２＝８ビットで合計１６ビットが伝送さ
れることになる。以下同様に、３分割した場合は許容ノ
イズレベルＮＬに４＋２×３＝１０ビット、ワード長Ｗ
に４×３＝１２ビットの合計２２ビット、４分割（図１
の例）の場合は許容ノイズレベルＮＬに４＋２×４＝１
２ビット、ワード長Ｗに４×４＝１６ビットの合計２８
ビットが伝送されることになる。このため、表１の例の
伝送ビット数を１００％としてこの表２の例と比較する
と、表２の例の場合、１分割では同じく１００％となる
のに対し、２分割では８０％、３分割では７３％、４分
割では７０％のように、分割数（ブロック数）が増加す
る程、ビット低減率が向上するようになる。したがっ
て、本実施例の方法は非常に有効であることが理解でき
る。In Table 2, when the critical band B is represented by one block (one division), the allowable noise level NL is transmitted by 4 bits and the word length W is transmitted by 4 bits. However, in the allowable noise level NL, as described above, 2 bits for compensating for a shift of 2 dB are added (4 + 2 bits). Therefore, a total of 10 bits are transmitted in the one division. Similarly,
When the critical band B is represented by two small blocks (divided into two)
Means that 4 + 2 × 2 = 8 bits for the allowable noise level NL and 4 × 2 = 8 bits for the word length W, for a total of 16 bits. Similarly, when the signal is divided into three parts, the allowable noise level NL becomes 4 + 2 × 3 = 10 bits and the word length W
4 × 3 = 12 bits, a total of 22 bits, divided into four (FIG. 1
In the case of Example 2), the allowable noise level NL is 4 + 2 × 4 = 1.
2 bits, word length W, 4 × 4 = 16 bits, total 28
Bits will be transmitted. Therefore, comparing the example of Table 2 with the example of Table 2 where the number of transmission bits in the example of Table 1 is 100%, the example of Table 2 has the same value of 100% for one division, but 80% and 3% for the two divisions. As the number of divisions (the number of blocks) increases, such as 73% for division and 70% for four divisions, the bit reduction rate increases. Therefore, it can be understood that the method of this embodiment is very effective.

【００２１】なお、ワード長のＷ１〜Ｗ４の情報の量子
化は適応的でなく一様な量子化とされている。The quantization of the information of word lengths W1 to W4 is not adaptive but uniform.

【００２２】また、上述した図１のように臨界帯域より
も狭い帯域の小ブロックでフローティング処理を行う場
合に対して、例えば、ブロックフローティング処理を上
記臨界帯域よりも広い帯域の大ブロックで行う場合に
は、後の復号化の際にも、通常は、各臨界帯域毎のブロ
ックフローティング係数と、各臨界帯域毎に求められる
上記ワード長の情報が必要となる。これに対し、本実施
例においては、図２のようにブロックフローティング処
理を臨界帯域よりも広い帯域の大ブロックｂで行う場合
には、前記フローティング係数に関する情報として当該
１つの大ブロックｂのフローティング係数Ｆｃの情報
と、臨界帯域Ｂ１〜Ｂ４毎に求められるワード長Ｗ１〜
Ｗ４の情報とを伝送するようにする。なお、各臨界帯域
Ｂ１〜Ｂ４の許容ノイズレベルＮＬ１〜ＮＬ４の情報は
伝送しない。すなわち、各臨界帯域Ｂ１〜Ｂ４ではフロ
ーティング係数Ｆｃが等しいため、該１つのフローティ
ング係数Ｆｃの情報と各臨界帯域毎のワード長Ｗ１〜Ｗ
４の情報から、上記許容ノイズレベルＮＬ１〜ＮＬ４を
求めることができる。したがって上記フローティング係
数Ｆｃの情報と各ワード長Ｗ１〜Ｗ４の情報のみをサブ
情報として伝送することにより、通常よりもサブ情報の
ためのビット数を減らすことが可能となる。In contrast to the case where the floating process is performed on a small block having a band narrower than the critical band as shown in FIG. 1, for example, the case where the block floating process is performed on a large block having a band wider than the critical band. In the subsequent decoding, usually, the block floating coefficient for each critical band and the word length information obtained for each critical band are required. On the other hand, in the present embodiment, when the block floating process is performed on the large block b having a band wider than the critical band as shown in FIG. 2, the floating coefficient of the one large block b is used as information on the floating coefficient. Fc information and word lengths W1 to W1 determined for each of the critical bands B1 to B4.
The information of W4 is transmitted. In addition, the information of the allowable noise levels NL1 to NL4 of each of the critical bands B1 to B4 is not transmitted. That is, since the floating coefficients Fc are equal in each of the critical bands B1 to B4, the information of the one floating coefficient Fc and the word length W1 to W for each critical band
4, the allowable noise levels NL1 to NL4 can be obtained. Therefore, by transmitting only the information of the floating coefficient Fc and the information of each of the word lengths W1 to W4 as sub-information, it is possible to reduce the number of bits for the sub-information more than usual.

【００２３】該図２の例の場合のビット数低減の様子を
上記表１と比較して表３に示す。Table 3 shows how the number of bits is reduced in the example of FIG. 2 in comparison with Table 1 above.

【表３】 [Table 3]

【００２４】この表３において、１つのブロックｂ内に
統合される臨界帯域の数（統合数）用いて説明すると、
１ブロックｂ内に統合される帯域が１つの場合（１帯
域）は、上記フローティング係数Ｆｃのレベルに対する
情報に６ビット、ワード長Ｗに４ビットとして合計１０
ビットを伝送する。同じく、臨界帯域数が２つ（２帯
域）の場合は、フローティング係数Ｆｃに６ビット、ワ
ード長Ｗに４×２＝８ビットで合計１４ビットが伝送さ
れることになる。以下同様に、３帯域とした場合はフロ
ーティング係数Ｆｃに６ビット、ワード長Ｗに４×３＝
１２ビットの合計１８ビット、４帯域（図２の例）の場
合はフローティング係数Ｆｃに６ビット、ワード長Ｗに
４×４＝１６ビットの合計２２ビットが伝送されること
になる。このため、表２の例の伝送ビット数を１００％
としてこの表３の例と比較すると、表３の例の場合、１
帯域は同じく１００％となるのに対し、２帯域では７０
％、３帯域では６０％、４帯域では５５％のように、帯
域数が増加する程、ビット低減率が向上するようにな
る。したがって、本実施例の方法は非常に有効であるこ
とが理解できる。In Table 3, using the number of critical bands integrated in one block b (the number of integration),
When one band is integrated in one block b (one band), the information on the level of the floating coefficient Fc is 6 bits and the word length W is 4 bits, for a total of 10 bits.
Transmit bits. Similarly, when the number of critical bands is two (two bands), six bits are transmitted for the floating coefficient Fc and 4 × 2 = 8 bits for the word length W, for a total of 14 bits. Similarly, when three bands are used, the floating coefficient Fc has 6 bits and the word length W has 4 × 3 =
In the case of a total of 18 bits of 12 bits and 4 bands (example of FIG. 2), a total of 22 bits of 6 bits for the floating coefficient Fc and 4 × 4 = 16 bits for the word length W are transmitted. Therefore, the number of transmission bits in the example of Table 2 is set to
As compared with the example of Table 3, in the case of Table 3,
The bandwidth is also 100%, while the bandwidth is 70%.
%, The band reduction rate increases as the number of bands increases, such as 60% for the three bands and 55% for the four bands. Therefore, it can be understood that the method of this embodiment is very effective.

【００２５】なお、上記各実施例において、許容ノイズ
レベルＮＬとして取り得るレベルはダイナミックレンジ
よりも小さい。すなわち、Ｓ／Ｎの最小値が約３０ｄＢ
程度なのでダイナミックレンジのピーク値（０ｄＢ）か
ら約３０ｄＢの領域には、許容ノイズレベルが来ること
がなく、エンコーダではこのように設定することが可能
となる。In each of the above embodiments, the level that can be taken as the allowable noise level NL is smaller than the dynamic range. That is, the minimum value of S / N is about 30 dB.
Therefore, the allowable noise level does not come in a region of about 30 dB from the peak value (0 dB) of the dynamic range, and the encoder can make such a setting.

【００２６】上述した許容ノイズレベルを求め、更にこ
の許容ノイズレベルに基づいた上記割当ビット数で信号
を符号化する本実施例の符号化方法が適用される構成の
例を図３に示す。FIG. 3 shows an example of a configuration to which the encoding method of the present embodiment is applied in which the above-mentioned allowable noise level is obtained and the signal is encoded with the above-mentioned number of allocated bits based on the allowable noise level.

【００２７】すなわちこの図３において、入力端子１に
供給された時間軸上のディジタルオーディオデータが、
高速フーリエ変換回路１１に伝送される。この高速フー
リエ変換回路１１では、上記時間軸上のオーディオデー
タが単位時間毎（単位ブロック）に周波数軸上のデータ
に変換され、実数成分値Ｒｅと虚数成分値Ｉｍとからな
るＦＦＴ係数データが得られる。これらＦＦＴ係数デー
タは振幅位相情報発生回路１２に伝送され、当該振幅位
相情報発生回路１２では上記実数成分値Ｒｅと虚数成分
値Ｉｍとから振幅情報Ａｍと位相情報Ｐｈとが得られ
て、該振幅情報Ａｍの情報のみが出力されるようにな
る。すなわち、一般に人間の聴覚は周波数領域の振幅
（パワー）には敏感であるが、位相についてはかなり鈍
感であるため、本実施例では上記振幅情報Ａｍのみを用
いて上記割当ビット数情報を得るようにしている。That is, in FIG. 3, the digital audio data on the time axis supplied to the input terminal 1 is:
The signal is transmitted to the fast Fourier transform circuit 11. The fast Fourier transform circuit 11 converts the audio data on the time axis into data on the frequency axis for each unit time (unit block), and obtains FFT coefficient data including a real component value Re and an imaginary component value Im. Can be These FFT coefficient data are transmitted to the amplitude / phase information generation circuit 12, and the amplitude / phase information generation circuit 12 obtains amplitude information Am and phase information Ph from the real component value Re and the imaginary component value Im. Only the information of the information Am is output. That is, although human hearing is generally sensitive to the amplitude (power) in the frequency domain, it is rather insensitive to the phase. Therefore, in the present embodiment, the above-mentioned allocated bit number information is obtained using only the amplitude information Am. I have to.

【００２８】上記振幅情報Ａｍは、先ず帯域分割回路１
３に伝送される。該帯域分割回路１３では、上記振幅情
報Ａｍで表現された入力ディジタル信号をいわゆる臨界
帯域幅（クリティカルバンド）に分割している。この臨
界帯域幅とは、人間の聴覚特性（周波数分析能力）を考
慮したものであり、例えば０〜２２ｋＨｚを２５帯域に
分け、高い周波数帯域ほど帯域幅を広く選定しているも
のである。すなわち人間の聴覚は、一種のバンドパスフ
ィルタのような特性を有していて、この各フィルタによ
って分けられたバンドを臨界帯域と呼んでいる。The amplitude information Am is first sent to the band dividing circuit 1
3 is transmitted. The band dividing circuit 13 divides the input digital signal represented by the amplitude information Am into a so-called critical bandwidth (critical band). The critical bandwidth is based on human auditory characteristics (frequency analysis capability). For example, 0 to 22 kHz is divided into 25 bands, and the higher the frequency band, the wider the bandwidth is selected. That is, human hearing has characteristics like a kind of band-pass filter, and the band divided by each filter is called a critical band.

【００２９】上記帯域分割回路１３で臨界帯域に分割さ
れた各帯域毎の上記振幅情報Ａｍは、各々上記総和検出
回路１４に伝送される。この総和検出回路１４では、各
帯域毎のエネルギ（各帯域でのスペクトル強度）が、各
帯域内のそれぞれの振幅情報Ａｍの総和（振幅情報Ａｍ
のピーク又は平均或いはエネルギ総和）をとることによ
り求められる。該総和検出回路１４の出力すなわち各帯
域の総和のスペクトルは、一般にバークスペクトルと呼
ばれ、この各帯域のバークスペクトルＳＢは例えば図４
に示すようになる。ただし、図４では図示を簡略化する
ため、上記クリティカルバンドのバンド数を１２の帯域
（Ｂ₁〜Ｂ₁₂）で表現している。The amplitude information Am for each band divided into the critical bands by the band division circuit 13 is transmitted to the sum detection circuit 14, respectively. In the sum detection circuit 14, the energy for each band (spectral intensity in each band) is calculated by summing the amplitude information Am in each band (amplitude information Am).
By taking the peak or the average or the total energy). The output of the sum detection circuit 14, that is, the sum spectrum of each band is generally called a bark spectrum, and the bark spectrum SB of each band is, for example, as shown in FIG.
It becomes as shown in. However, in FIG. 4, the number of the critical bands is represented by 12 bands (B _{1 to} B ₁₂ ) for simplicity of illustration.

【００３０】ここで、上記バークスペクトルＳＢのいわ
ゆるマスキングに於ける影響を考慮するため、該バーク
スペクトルＳＢに所定の重みづけの関数を畳込む（コン
ボリューション）。このため、上記総和検出回路１４の
出力すなわち該バークスペクトルＳＢの各値は、フィル
タ回路１５に送られる。該フィルタ回路１５は、例え
ば、入力データを順次遅延させる複数の遅延素子と、こ
れら遅延素子からの出力にフィルタ係数（重みづけの関
数）を乗算する複数の乗算器（例えば各帯域に対応する
２５個の乗算器）と、各乗算器出力の総和をとる総和加
算器とから構成されるものである。このフィルタ回路１
５の各乗算器において、例えば、任意の帯域に対応する
乗算器Ｍでフィルタ係数１を、乗算器Ｍ−１でフィルタ
係数０．１５を、乗算器Ｍ−２でフィルタ係数０．００
１９を、乗算器Ｍ−３でフィルタ係数０．０００００８
６を、乗算器Ｍ＋１でフィルタ係数０．４を、乗算器Ｍ
＋２でフィルタ係数０．０６を、乗算器Ｍ＋３でフィル
タ係数０．００７を各遅延素子の出力に乗算することに
より、上記バークスペクトルＳＢの畳込み処理が行われ
る。ただし、Ｍは１〜２５の任意の整数である。この畳
込み処理により、図４中点線で示す部分の総和がとられ
る。なお、上記マスキングとは、人間の聴覚上の特性に
より、ある信号によって他の信号がマスクされて聞こえ
なくなる現象をいうものであり、このマスキング効果に
は、時間軸上のオーディオ信号に対するマスキング効果
と周波数軸上の信号に対するマスキング効果とがある。
すなわち、該マスキング効果により、マスキングされる
部分にノイズがあったとしても、このノイズは聞こえな
いことになる。このため、実際のオーディオ信号では、
このマスキングされる部分内のノイズは許容可能なノイ
ズとされる。Here, in order to consider the influence of the bark spectrum SB on so-called masking, a function of a predetermined weight is convolved with the bark spectrum SB (convolution). Therefore, the output of the sum detection circuit 14, that is, each value of the bark spectrum SB is sent to the filter circuit 15. The filter circuit 15 includes, for example, a plurality of delay elements for sequentially delaying input data and a plurality of multipliers (for example, 25 corresponding to each band) for multiplying an output from these delay elements by a filter coefficient (weighting function). Multipliers) and a sum adder for summing the outputs of the multipliers. This filter circuit 1
In each of the multipliers 5, for example, the multiplier M corresponding to an arbitrary band has the filter coefficient 1, the multiplier M- 1 has the filter coefficient 0.15, and the multiplier M- 2 has the filter coefficient 0.00.
19 is multiplied by a filter coefficient 0.000008 by the multiplier M-3.
6, the filter coefficient 0.4 by the multiplier M + 1 and the multiplier M
By multiplying the output of each delay element by the filter coefficient 0.06 by +2 and the filter coefficient 0.007 by the multiplier M + 3, the convolution process of the bark spectrum SB is performed. Here, M is an arbitrary integer of 1 to 25. By this convolution processing, the sum of the parts shown by the dotted lines in FIG. 4 is obtained. The masking refers to a phenomenon that a certain signal masks another signal and makes it inaudible due to human auditory characteristics.This masking effect includes a masking effect for an audio signal on a time axis. There is a masking effect on signals on the frequency axis.
That is, even if there is noise in the masked portion due to the masking effect, this noise will not be heard. Therefore, in an actual audio signal,
The noise in this masked portion is considered acceptable noise.

【００３１】その後、上記フィルタ回路１５の出力は引
算器１６に送られる。該引算器１６は、上記畳込んだ領
域での後述する許容可能なノイズレベルに対応するレベ
ルαを求めるものである。なお、当該許容可能なノイズ
レベル（許容ノイズレベル）に対応するレベルαは、後
述するように、逆コンボリューション処理を行うことに
よって、臨界帯域の各帯域毎の許容ノイズレベルとなる
ようなレベルである。ここで、上記引算器１６には、上
記レベルαを求めるための許容関数（マスキングレベル
を表現する関数）が供給される。この許容関数を増減さ
せることで上記レベルαの制御を行っている。当該許容
関数は、後述する関数発生回路２９から供給されている
ものである。Thereafter, the output of the filter circuit 15 is sent to a subtractor 16. The subtracter 16 calculates a level α corresponding to an allowable noise level described later in the convolved area. The level α corresponding to the permissible noise level (permissible noise level) is, as described later, a level at which the permissible noise level of each critical band is obtained by performing inverse convolution processing. is there. Here, an allowance function (a function expressing a masking level) for obtaining the level α is supplied to the subtractor 16. The level α is controlled by increasing or decreasing the allowable function. The permissible function is supplied from a function generation circuit 29 described later.

【００３２】すなわち、許容ノイズレベルに対応するレ
ベルαは、臨界帯域幅の帯域の低域から順に与えられる
番号をｉとすると、次の式で求めることができる。 α＝Ｓ−（ｎ−ａｉ）この式において、ｎ，ａは定数でａ＞０、Ｓは畳込み処
理されたバークスペクトルの強度であり、該式中（ｎ−
ａｉ）が許容関数となる。本実施例ではｎ＝３８，ａ＝
１としており、この時の音質劣化はなく、良好な符号化
が行えた。That is, the level α corresponding to the allowable noise level can be obtained by the following equation, where i is a number sequentially given from the lower band of the critical bandwidth. α = S− (n−ai) In this equation, n and a are constants and a> 0, and S is the intensity of the convolution-processed Bark spectrum.
ai) is an allowable function. In this embodiment, n = 38, a =
It was set to 1. At this time, there was no sound quality deterioration, and good encoding was performed.

【００３３】このようにして、上記レベルαが求めら
れ、このデータは、割算器１７に伝送される。当該割算
器１７では、上記畳込みされた領域での上記レベルαを
逆コンボリューションするためのものである。したがっ
て、この逆コンボリューション処理を行うことにより、
上記レベルαからマスキングスペクトルが得られるよう
になる。すなわち、このマスキングスペクトルが許容ノ
イズスペクトルとなる。なお、上記逆コンボリューショ
ン処理は、複雑な演算を必要とするが、本実施例では簡
略化した割算器１７を用いて逆コンボリューションを行
っている。In this way, the level α is obtained, and this data is transmitted to the divider 17. The divider 17 is for inversely convolving the level α in the convolved region. Therefore, by performing this inverse convolution processing,
A masking spectrum can be obtained from the level α. That is, this masking spectrum becomes an allowable noise spectrum. Note that the above inverse convolution process requires a complicated operation, but in the present embodiment, inverse convolution is performed using a simplified divider 17.

【００３４】次に、上記マスキングスペクトルは、合成
回路１８を介して減算器１９に伝送される。ここで、当
該減算器１９には、上記総和検出回路１４の出力すなわ
ち前述した総和検出回路１４からのバークスペクトルＳ
Ｂが、遅延回路２１を介して供給されている。したがっ
て、この減算器１９で上記マスキングスペクトルとバー
クスペクトルＳＢとの減算演算が行われることで、図５
に示すように、上記バークスペクトルＳＢは、該マスキ
ングスペクトルＭＳのレベルで示すレベル以下がマスキ
ングされることになる。Next, the masking spectrum is transmitted to a subtractor 19 via a synthesis circuit 18. Here, the subtractor 19 outputs the output of the sum detection circuit 14, that is, the bark spectrum S from the sum detection circuit 14 described above.
B is supplied via a delay circuit 21. Therefore, the subtractor 19 performs a subtraction operation between the masking spectrum and the bark spectrum SB, thereby obtaining a signal shown in FIG.
As shown in (1), the bark spectrum SB is masked below the level indicated by the level of the masking spectrum MS.

【００３５】当該減算器１９の出力は、上記許容ノイズ
レベル補正回路２０を介してＲＯＭ３０に送られる。該
ＲＯＭ３０には、上記振幅情報Ａｍの量子化に用いる複
数の割当ビット数情報が格納されており、上記減算回路
１９の出力（上記各帯域のエネルギと上記ノイズレベル
設定手段の出力との差分のレベル）に応じた割当ビット
数情報を出力するようになっている。したがって、量子
化回路２４では、この割当ビット数情報に基づいて、上
記振幅情報Ａｍの量子化処理が行われ、この出力が出力
端子２から出力される。なお、遅延回路２１は上記合成
回路１８以前の各回路での遅延量を考慮して上記総和検
出回路１４からのバークスペクトルＳＢを遅延させ、上
記遅延回路２３は上記ＲＯＭ３０以前の各回路での遅延
量を考慮して上記振幅情報Ａｍを遅延させるために設け
られている。The output of the subtracter 19 is sent to the ROM 30 via the allowable noise level correction circuit 20. The ROM 30 stores a plurality of pieces of assigned bit number information used for quantization of the amplitude information Am, and outputs the information of the subtraction circuit 19 (the difference between the energy of each band and the output of the noise level setting means). ), And outputs information on the number of allocated bits according to the level. Therefore, in the quantization circuit 24, the amplitude information Am is quantized based on the allocated bit number information, and the output is output from the output terminal 2. The delay circuit 21 delays the bark spectrum SB from the sum detection circuit 14 in consideration of the delay amount in each circuit before the synthesis circuit 18, and the delay circuit 23 delays the bark spectrum SB in each circuit before the ROM 30. It is provided to delay the amplitude information Am in consideration of the amount.

【００３６】また、上述した合成回路１８での合成の際
には、最小可聴カーブ発生回路２２から供給される図６
に示すような人間の聴覚特性であるいわゆる最小可聴カ
ーブＲＣを示すデータと、上記マスキングスペクトルＭ
Ｓとを合成することができる。この最小可聴カーブにお
いて、雑音絶対レベルがこの最小可聴カーブ以下ならば
該雑音は聞こえないことになる。更に、該最小可聴カー
ブは、コーディングが同じであっても例えば再生時の再
生ボリュームの違いで異なるものとなる。ただし、現実
的なディジタルシステムでは、例えば１６ビットダイナ
ミックレンジへの音楽のはいり方にはさほど違いがない
ので、例えば４ｋＨｚ付近の最も耳に聞こえやすい周波
数帯域の量子化雑音が聞こえないとすれば、他の周波数
帯域ではこの最小可聴カーブのレベル以下の量子化雑音
は聞こえないと考えられる。したがって、このように例
えばシステムの持つワードレングスの４ｋＨｚ付近の雑
音が聞こえない使い方をすると仮定し、この最小可聴カ
ーブＲＣとマスキングスペクトルＭＳとを共に合成する
ことで許容ノイズレベルを得るようにすると、この場合
の許容ノイズレベルは、図中斜線で示す部分までとする
ことができるようになる。なお、本実施例では、上記最
小可聴カーブの４ｋＨｚのレベルを、例えば２０ビット
相当の最低レベルに合わせている。また、この図６は、
信号スペクトルＳＳも同時に示している。In addition, at the time of synthesizing by the synthesizing circuit 18 described above, FIG.
And data indicating a so-called minimum audible curve RC which is a human auditory characteristic as shown in FIG.
And S can be synthesized. At this minimum audible curve, if the absolute noise level is below this minimum audible curve, the noise will not be heard. Further, even if the coding is the same, the minimum audible curve differs depending on, for example, a reproduction volume at the time of reproduction. However, in a realistic digital system, for example, there is not much difference in how to enter music into the 16-bit dynamic range. For example, if quantization noise in the most audible frequency band around 4 kHz is not heard, It is considered that quantization noise below the level of the minimum audible curve is not audible in other frequency bands. Therefore, assuming that the system is used so that noise around 4 kHz of the word length of the system cannot be heard, and an allowable noise level is obtained by synthesizing the minimum audible curve RC and the masking spectrum MS together, In this case, the permissible noise level can be set up to the shaded portion in the figure. In this embodiment, the 4 kHz level of the minimum audible curve is adjusted to the lowest level corresponding to, for example, 20 bits. Also, FIG.
The signal spectrum SS is also shown.

【００３７】ここで、上記許容ノイズレベル補正回路２
０では、補正値決定回路２８から送られてくるいわゆる
等ラウドネス曲線の情報に基づいて、上記減算器１９か
らの許容ノイズレベルを補正している。すなわち、上記
補正値決定回路２８からは、上記減算器１９からの許容
ノイズレベルを、いわゆる等ラウドネス曲線の情報デー
タに基づいて補正させるための補正値データが出力さ
れ、この補正値データが上記許容ノイズレベル補正回路
２０に伝送されることで、上記減算器１９からの許容ノ
イズレベルの等ラウドネス曲線を考慮した補正がなされ
るようになる。なお、上記等ラウドネス曲線とは、人間
の聴覚特性に関するものであり、例えば１ｋＨｚの純音
と同じ大きさに聞こえる各周波数での音の音圧を求めて
曲線で結んだもので、ラウドネスの等感度曲線とも呼ば
れる。また、該等ラウドネス曲線は、図６に示した最小
可聴カーブＲＣと略同じ曲線を描くものである。該等ラ
ウドネス曲線においては、例えば４ｋＨｚ付近では１ｋ
Ｈｚのところより音圧が８〜１０ｄＢ下がっても１ｋＨ
ｚと同じ大きさに聞こえ、逆に５０ｋＨｚ付近では１ｋ
Ｈｚでの音圧よりも約１５ｄＢ高くないと同じ大きさに
聞こえない。このため、上記最小可聴カーブのレベルを
越えた雑音（許容ノイズレベル）は、該等ラウドネス曲
線に応じたカーブで与えられる周波数特性を持つように
するのが良いことがわかる。このようなことから、上記
等ラウドネス曲線を考慮して上記許容ノイズレベルを補
正することは、人間の聴覚特性に適合していることがわ
かる。Here, the allowable noise level correction circuit 2
In the case of 0, the allowable noise level from the subtractor 19 is corrected based on information of a so-called equal loudness curve sent from the correction value determination circuit 28. That is, the correction value determination circuit 28 outputs correction value data for correcting the allowable noise level from the subtractor 19 based on information data of a so-called equal loudness curve. By being transmitted to the noise level correction circuit 20, the allowable noise level from the subtracter 19 is corrected in consideration of the equal loudness curve. The above-mentioned equal loudness curve relates to human auditory characteristics. For example, the equal loudness curve is obtained by obtaining sound pressures of sounds at each frequency that sounds as loud as a pure sound of 1 kHz and connecting them with a curve. Also called a curve. In addition, the equal loudness curve draws substantially the same curve as the minimum audible curve RC shown in FIG. In the equal loudness curve, for example, 1 k
1 kHz even if the sound pressure falls 8-10 dB below the Hz
Sounds the same size as z, 1k around 50kHz
Unless it is higher than the sound pressure at Hz by about 15 dB, the sound cannot be heard at the same level. For this reason, it can be seen that noise exceeding the level of the minimum audible curve (allowable noise level) preferably has a frequency characteristic given by a curve corresponding to the equal loudness curve. From this, it can be seen that correcting the allowable noise level in consideration of the equal loudness curve is suitable for human auditory characteristics.

【００３８】なお、本実施例においては、上述した最小
可聴カーブの合成処理を行わない構成とすることもでき
る。すなわち、この場合には、最小可聴カーブ発生回路
２２，合成回路１８が不要となり、上記引算器１６から
の出力は、割算器１７で逆コンボリューションされた
後、すぐに減算器１９に伝送されることになる。In this embodiment, a configuration may be adopted in which the above-described minimum audible curve synthesizing process is not performed. That is, in this case, the minimum audible curve generating circuit 22 and the synthesizing circuit 18 become unnecessary, and the output from the subtractor 16 is inversely convolved by the divider 17 and immediately transmitted to the subtractor 19. Will be done.

【００３９】[0039]

【発明の効果】本発明のディジタル信号符号化方法にお
いては、入力ディジタル信号を直交変換して臨界帯域に
分割し、各臨界帯域毎の許容ノイズレベルに基づいた適
応的な割り当てビット数で各臨界帯域の信号成分を符号
化すると共に、直交変換後の信号成分をブロックフロー
ティング処理して得たフローティング係数を伝送するも
のであり、フローティング処理を臨界帯域よりも狭い帯
域の小ブロックで行う場合には、各臨界帯域毎のフロー
ティング係数の代わりに、各臨界帯域毎の許容ノイズレ
ベルに関する情報及び各小ブロックのワード長情報を伝
送するようにしたことにより、フローティング係数のた
めのビット数を減らすことができ、また、ブロックフロ
ーティング処理を臨界帯域よりも広い帯域の大ブロック
で行う場合には、大ブロック毎のフローティング係数に
関する情報を伝送するようにしたことにより、伝送ビッ
ト数を低減することができるようになる。したがって、
より高いビット圧縮が可能となる。According to the digital signal encoding method of the present invention, an input digital signal is orthogonally transformed and divided into critical bands, and each critical band is adaptively allocated based on the allowable noise level of each critical band. In addition to encoding the signal components of the band and transmitting the floating coefficient obtained by performing block floating processing on the signal components after the orthogonal transformation, when performing the floating processing on a small block of a band narrower than the critical band, By transmitting the information on the permissible noise level for each critical band and the word length information of each small block instead of the floating coefficient for each critical band, it is possible to reduce the number of bits for the floating coefficient. Yes, and when performing block floating processing in a large block with a band wider than the critical band, By which is adapted to transmit information relating to floating coefficients for each block, it is possible to reduce the number of transmission bits. Therefore,
Higher bit compression is possible.

[Brief description of the drawings]

【図１】本発明のフローティング処理を臨界帯域よりも
狭いブロックで行う場合を説明するための図である。FIG. 1 is a diagram for explaining a case where a floating process of the present invention is performed in a block narrower than a critical band.

【図２】本発明のフローティング処理を臨界帯域よりも
広いブロックで行う場合を説明するための図である。FIG. 2 is a diagram for explaining a case where the floating process of the present invention is performed in a block wider than a critical band.

【図３】許容ノイズレベル設定のための具体的構成を示
すブロック回路図である。FIG. 3 is a block circuit diagram showing a specific configuration for setting an allowable noise level.

【図４】バークスペクトルを示す図である。FIG. 4 is a diagram showing a bark spectrum.

【図５】マスキングスペクトルを示す図である。FIG. 5 is a diagram showing a masking spectrum.

【図６】最小可聴カーブ，マスキングスペクトルを合成
した図である。FIG. 6 is a diagram in which a minimum audible curve and a masking spectrum are combined.

[Explanation of symbols]

Ｂ，Ｂ１〜Ｂ４・・・・帯域ｂ，ｂ１〜ｂ４・・・・ブロックＷ１〜Ｗ４・・・・・・ワード長ＮＬ，ＮＬ１〜ＮＬ４・・・許容ノイズレベル B, B1 to B4 ... band b, b1 to b4 ... block W1 to W4 ... word length NL, NL1 to NL4 ... allowable noise level

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 7/30 ──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁷ , DB name) H03M 7/30

Claims

(57) [Claims]

1. An input digital signal is orthogonally transformed and divided into critical bands, and a difference between an allowable noise level for each critical band set based on energy for each critical band and energy for each critical band is determined. The digital component that encodes the signal components of each critical band with the number of bits according to the level, blocks the signal components after the orthogonal transformation, performs block floating processing for each block, and transmits a floating coefficient for each block. In the signal encoding method, when the block floating process is performed in small block units of a band narrower than the critical band, information on an allowable noise level is transmitted instead of the floating coefficient for each critical band, Transmission of word length information according to the number of bits allocated to each small block Digital signal encoding method.

2. An input digital signal is orthogonally transformed and divided into critical bands, and the difference between the allowable noise level for each critical band set based on the energy for each critical band and the energy for each critical band is calculated. The digital component that encodes the signal components of each critical band with the number of bits according to the level, blocks the signal components after the orthogonal transformation, performs block floating processing for each block, and transmits a floating coefficient for each block. In the signal encoding method, when the block floating processing is performed in units of large blocks having a band wider than the critical band, information on a floating coefficient of each large block is transmitted, and allocation is performed for each of the critical bands. Digital signal code for transmitting word length information according to the number of bits Method.