JP2913696B2

JP2913696B2 - Digital signal encoding method

Info

Publication number: JP2913696B2
Application number: JP25580089A
Authority: JP
Inventors: 義仁藤原; 知子梅沢; 正之西口
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1989-09-30
Filing date: 1989-09-30
Publication date: 1999-06-28
Anticipated expiration: 2014-06-28
Also published as: JPH03117921A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、入力ディジタル信号の符号化を行うディジ
タル信号符号化方法に関するものである。Description: TECHNICAL FIELD The present invention relates to a digital signal encoding method for encoding an input digital signal.

[Summary of the Invention]

本発明は、入力ディジタル信号を複数の周波数帯域に
分割すると共に、高い周波数帯域ほどバンド幅を広く選
定し、各バンド毎のエネルギに基づいて各バンド単位の
許容ノイズレベルを設定し、各バンドのエネルギと設定
された許容ノイズレベルの差のレベルに応じたビット数
で各バンドの成分を量子化するディジタル信号符号化方
法において、量子化後の出力情報量を検出し、この検出
出力と目標値の誤差に応じて量子化の際の割当てビット
数を補正するようにして、所定期間における情報量を一
定化するようにしたことにより、簡単な構成で、信号劣
化の目立たないビットレート調整（ビットパッキング）
を行うことが可能なディジタル信号符号化方法を提供す
るものである。The present invention divides an input digital signal into a plurality of frequency bands, selects a wider bandwidth for a higher frequency band, sets an allowable noise level for each band based on energy for each band, and In a digital signal encoding method for quantizing the components of each band with the number of bits according to the level of the difference between the energy and the set allowable noise level, the amount of output information after quantization is detected, and the detected output and the target value By correcting the number of allocated bits at the time of quantization in accordance with the error of, and by stabilizing the amount of information in a predetermined period, the bit rate adjustment (bit packing)
The present invention provides a digital signal encoding method capable of performing the following.

[Conventional technology]

オーディオ，音声等の信号の高能率符号化において
は、オーディオ，音声等の入力信号を時間軸又は周波数
軸で複数のチャンネルに分割すると共に、各チャンネル
毎のビット数を適応的に割当てるビットアロケーシヨン
（ビット割当て）による符号化技術がある。例えば、オ
ーディオ信号等の上記ビット割当てによる符号化技術に
は、時間軸上のオーディオ信号等を複数の周波数帯域に
分割して符号化する帯域分割符号化（サブ・バンド・コ
ーディング:SBC）や、時間軸の信号を周波数軸上の信号
に変換（直交変換）して複数の周波数帯域に分割し各帯
域毎で適応的に符号化するいわゆる適応変換符号化（AT
C）、或いは、上記SBCといわゆる適応予測符号化（AP
C）とを組み合わせ、時間軸の信号を帯域分割して各帯
域信号をベースバンド（低域）に変換した後複数次の線
形予測分析を行って予測符号化するいわゆる適応ビット
割当て（APC−AB）等の符号化技術がある。In high-efficiency coding of signals such as audio and voice, a bit allocator that divides an input signal such as audio and voice into a plurality of channels along a time axis or a frequency axis and adaptively allocates the number of bits for each channel. There is an encoding technique based on a shion (bit allocation). For example, coding techniques based on the above-mentioned bit allocation of audio signals and the like include band division coding (sub-band coding: SBC) in which an audio signal or the like on the time axis is divided into a plurality of frequency bands and encoded. A so-called adaptive transform coding (AT) that converts a signal on the time axis into a signal on the frequency axis (orthogonal transform), divides the signal into a plurality of frequency bands, and adaptively codes each band.
C) or the above-mentioned SBC and so-called adaptive prediction coding (AP
C), a so-called adaptive bit allocation (APC-AB) in which a signal on the time axis is band-divided, each band signal is converted into a baseband (low band), and then multi-order linear prediction analysis is performed to perform predictive coding. ).

ここで例えば、上記帯域分割符号化においては、圧縮
効率を上げるために、一定の単位時間ブロック毎のビッ
トレートを一定に保ちながら、帯域分割した各バンドに
与えるビット数を信号スペクトル強度の時間変動に応じ
てダイナミックに（適応的に）変化させている。また、
上記適応変換符号化においては、周波数軸上でダイナミ
ックに割当てビット数を変化させている。Here, for example, in the above-mentioned band division coding, in order to increase the compression efficiency, while keeping the bit rate per unit time block constant, the number of bits to be given to each band divided band is represented by the time variation of the signal spectrum intensity. Is dynamically (adaptively) changed according to the Also,
In the above adaptive transform coding, the number of allocated bits is dynamically changed on the frequency axis.

[Problems to be solved by the invention]

上述のようなビット数を適応的に割当てる高能率符号
化においては、ビット数の割当ての仕方しだいで単位ブ
ロック（単位時間ブロック或いは単位周波数ブロック）
当たりのビットレートが一定にならない場合が生じ、ビ
ットの過不足が起こる場合がある。すなわち、与えられ
たビットレートに対して、時間軸或いは周波数軸上で信
号レベルの低い所ではビットが余り、信号レベルの高い
所ではビットが足りないという現象が起こることがあ
る。In the high-efficiency coding that adaptively allocates the number of bits as described above, a unit block (a unit time block or a unit frequency block) depends on how the number of bits is allocated.
The bit rate per hit may not be constant, which may result in excess or deficiency of bits. In other words, for a given bit rate, there may be a phenomenon in which bits are left behind at low signal levels on the time axis or frequency axis, and bits are insufficient at high signal levels.

このようなビット数の過不足があった場合には、効果
的にビット数を調整するようなビットレート調整（いわ
ゆるビットパッキング）の手法を用いる必要があった。When there is such an excess or deficiency in the number of bits, it is necessary to use a bit rate adjustment (so-called bit packing) technique for effectively adjusting the number of bits.

このビットレート調整とは、単位ブロックでビットが
余ったときには、該単位ブロックに与えるビット数を減
らし、ビットが足りないときには与えるビット数を増加
させるようにすることにより、全体のビットレートを一
定に調整するものである。当該ビットレート調整は、例
えば、第９図に示すような機能ブロックを用いて行うこ
とができると考えられる。この第９図において、入力デ
ィジタル信号に対して、割当てビット数決定機能ブロッ
ク90で量子化の際の割当てビット数を決定し、ビットレ
ート調整機能ブロック91で、入力ディジタル信号の単位
ブロックのスペクトル強度に応じたビットレートの調整
（ビットパッキング）を行う。その後、ビットレート調
整されて決定されたビット数を用いて符号化機能ブロッ
ク92で符号化（再量子化）が行われて出力される。This bit rate adjustment is to reduce the number of bits to be given to the unit block when bits remain in the unit block and to increase the number of bits to be given when bits are insufficient, thereby keeping the overall bit rate constant. It is to adjust. It is considered that the bit rate adjustment can be performed using, for example, a functional block as shown in FIG. In FIG. 9, for the input digital signal, the number of allocated bits at the time of quantization is determined by the allocated bit number determining function block 90, and the spectrum intensity of the unit block of the input digital signal is determined by the bit rate adjusting function block 91. Of the bit rate (bit packing) according to. After that, the encoding function block 92 performs encoding (requantization) using the number of bits determined by adjusting the bit rate, and outputs the result.

ところが、上記ビットレート調整機能ブロック92で行
われるビットレートの調整としては、未だ、簡単でかつ
信号の劣化が目立たない効果的な方法がなく、このた
め、従来より、効果的なビットレート調整の方法の確立
が望まれている。However, as the bit rate adjustment performed by the bit rate adjustment function block 92, there is still no effective method that is simple and signal deterioration is not conspicuous. The establishment of a method is desired.

そこで、本発明は、上述のような事情に鑑みて提案さ
れたものであり、簡単な構成で、信号の劣化の目立たな
いビットレート調整（ビットパッキング）を行うことが
できるディジタル信号符号化方法を提供することを目的
とするものである。Accordingly, the present invention has been proposed in view of the above-described circumstances, and provides a digital signal encoding method capable of performing a bit rate adjustment (bit packing) with a simple configuration in which signal degradation is not noticeable. It is intended to provide.

［課題を解決するための手段］本発明に係るディジタル信号符号化方法は、上述の課
題を解決するために提案されたものであり、入力ディジ
タル信号を複数の周波数帯域に分割すると共に、高い周
波数帯域ほどバンド幅を広く選定し、当該各バンド毎の
エネルギに基づいて各バンド単位の許容ノイズレベルを
設定するノイズレベル設定工程と、上記各バンドのエネ
ルギと上記設定された許容ノイズレベルとの差のレベル
に応じたビット数で上記各バンドの成分を量子化する量
子化工程とを有するディジタル信号符号化方法であっ
て、上記量子化工程により得られる出力情報量を検出
し、この検出された誤差に応じで、上記量子化工程での
割当てビット数を、全帯域で均等に増減するように補正
して、所定期間における情報量を一定化するようにした
ことを特徴とするものである。[Means for Solving the Problems] A digital signal encoding method according to the present invention has been proposed to solve the above-described problems, and divides an input digital signal into a plurality of frequency bands and has a high frequency band. A noise level setting step of selecting a wider bandwidth for each band and setting an allowable noise level for each band based on the energy of each band; and a difference between the energy of each band and the set allowable noise level. And a quantization step of quantizing the components of each band with the number of bits according to the level of the digital signal, wherein the amount of output information obtained by the quantization step is detected, and Depending on the error, the number of bits allocated in the quantization step is corrected so as to increase or decrease evenly in all bands, so that the information amount in a predetermined period is made constant. It is characterized by having made it.

［作用］本発明によれば、量子化手段の出力情報量の検出出力
と目標値の誤差に応じて量子化の際の割当てビット数を
一定量増減することで、全体のビットレートが一定に保
たれている。[Operation] According to the present invention, the total bit rate is made constant by increasing or decreasing the number of bits allocated at the time of quantization by a fixed amount according to the error between the detection output of the output information amount of the quantization means and the target value. Is kept.

〔Example〕

以下、本発明を適用した実施例について図面を参照し
ながら説明する。Hereinafter, embodiments of the present invention will be described with reference to the drawings.

本実施例のディジタル信号符号化方法が適用されたデ
ィジタル信号符号化装置、オーディオ或いは音声等の入
力ディジタル信号を、例えば、帯域分割符号化（SBC）
や、適応変換符号化（ATC）、適応ビット割当て（APC−
AB）等により高能率符号化するものである。そのため、
本実施例装置では、入力ディジタル信号を複数の周波数
帯域に分割すると共に、高い周波数帯域ほどバンド幅を
広く選定している。すなわち、後述する人間の聴覚特性
を考慮したいわゆる臨界帯域幅（クリティカルバンド）
で上記入力ディジタル信号を分割している。また、第１
図に示すように、当該クリティカルバンドの各バンド毎
のエネルギ（又はピーク値，平均値）に基づいて各バン
ド単位の許容ノイズレベルを設定するノイズレベル設定
手段としての総和検出回路14及びフィルタ回路15と、上
記各バンドのエネルギと上記ノイズレベル設定手段の差
のレベルに応じて割当てられたビット数で上記各バンド
の成分を量子化する量子化回路24とを有するものであ
る。ここで、上記量子化回路24の出力情報量を後述する
データ量演算回路26で検出し、該検出出力と端子３から
の所定の目標値（ビットレート調整のためのビットレー
トの目標値）との誤差が後述する誤差検出回路27で検出
される。その後、割算器28でこの誤差出力を割り込むこ
とにより、本実施例装置では、当該誤差出力に応じて上
記量子化回路24の割当てビット数を補正するようにし
て、所定期間における情報量を一定化（ビットレートの
調整）を行うようにしている。A digital signal encoding apparatus to which the digital signal encoding method of the present embodiment is applied, an input digital signal such as audio or voice, for example, band division encoding (SBC)
And adaptive transform coding (ATC), adaptive bit allocation (APC-
AB) etc. to perform high-efficiency coding. for that reason,
In the present embodiment, the input digital signal is divided into a plurality of frequency bands, and the higher the frequency band, the wider the bandwidth is selected. That is, a so-called critical bandwidth (critical band) in consideration of human hearing characteristics described later.
Divides the input digital signal. Also, the first
As shown in the figure, a sum detection circuit 14 and a filter circuit 15 as noise level setting means for setting an allowable noise level for each band based on the energy (or peak value, average value) of each band of the critical band. And a quantization circuit 24 that quantizes the components of each band with the number of bits assigned according to the energy of each band and the level of the difference between the noise level setting means. Here, the output information amount of the quantization circuit 24 is detected by a data amount calculation circuit 26 described later, and the detected output and a predetermined target value (a target value of a bit rate for adjusting a bit rate) from a terminal 3 are output. Is detected by an error detection circuit 27 described later. Thereafter, the error output is interrupted by the divider 28, so that the apparatus of the present embodiment corrects the number of bits allocated to the quantization circuit 24 in accordance with the error output so that the information amount in a predetermined period is kept constant. (Adjustment of bit rate).

ところで、このビットレート調整を行った結果、例え
ば、ビット数が不足してビットを取り上げた（減らし
た）場合にはオーディオ信号のS/Nは悪化し、逆に、ビ
ット数が余ってビット数を増やした場合にはS/Nは良く
なる。すなわち、ビットレート調整を行うことで、信号
スペクトルに対するノイズスペクトルの形状が変化して
しまうために、S/Nの良い所と悪い所が発生してしま
う。このため、本実施例のディジタル信号符号化装置で
は、このオーディオ信号のS/Nの変化の周波数軸上での
変化を一定に保つことにより、再量子化時に発生するノ
イズスペクトルの形状が変化しないようにしている。By the way, as a result of adjusting the bit rate, for example, if the number of bits is insufficient and the bits are picked up (reduced), the S / N of the audio signal is degraded, and conversely, the number of bits is The S / N will be better if is increased. That is, by performing the bit rate adjustment, the shape of the noise spectrum with respect to the signal spectrum changes, so that good and bad S / N occur. For this reason, in the digital signal encoding device of the present embodiment, the shape of the noise spectrum generated at the time of requantization does not change by keeping the change of the S / N change of the audio signal on the frequency axis constant. Like that.

換言すれば、第２図に示すように、信号スペクトルSS
に対して、ノイズスペクトルNSは図中一点鎖線で示すよ
うになるが、ダイナミックなビット割当てにより、単位
ブロックでビットレートが一定にならないときに該ビッ
ト割当ての調整を周波数軸上で行っているものである。
例えば、信号の周波数分析を行って、ノイズスペクトル
NSから均等にビットを取り上げたり増やしたりすること
で、第３図に示すように、ビット数を取り上げた（減ら
した）場合には第３図中の点線で示すノイズスペクトル
NS_-のように補正され、ビット数を増やした場合にはノ
イズスペクトルNS₊のように補正される。In other words, as shown in FIG.
On the other hand, the noise spectrum NS is indicated by a dashed line in the figure, but the dynamic bit allocation adjusts the bit allocation on the frequency axis when the bit rate is not constant in the unit block. It is.
For example, by performing a frequency analysis of the signal,
If the number of bits is taken up (decreased) as shown in FIG. 3 by taking up or increasing bits evenly from NS, the noise spectrum shown by the dotted line in FIG.
NS _- is corrected as, in the case of increasing the number of bits are corrected as noise spectrum NS _+.

その後、上記量子化回路24からの量子化出力は、バッ
ファメモリ25を介して本実施例のディジタル信号符号化
装置の出力端子２から出力されるようになる。Thereafter, the quantized output from the quantizing circuit 24 is output from the output terminal 2 of the digital signal encoding device of the present embodiment via the buffer memory 25.

ここで、第１図に示す本実施例のディジタル信号符号
化装置は、オーディオ信号，音声信号等を高速フーリエ
変換（FFT）して、時間軸の信号を周波数軸に変換し、
符号化（再量子化）を行うものである。Here, the digital signal encoding apparatus of the present embodiment shown in FIG. 1 performs a fast Fourier transform (FFT) on an audio signal, a voice signal, and the like, and converts a time-axis signal into a frequency axis.
The encoding (requantization) is performed.

第１図において、入力端子１には、例えばオーディオ
信号が供給されており、該時間軸上のオーディオ信号が
高速フーリエ変換回路11に伝送される。この高速フーリ
エ変換回路11では、該時間軸上のオーディオ信号が所定
時間毎に周波数軸上の信号に変換され、実数成分値Reと
虚数成分値ImとからなるFFT係数が得られる。これらFFT
係数は振幅位相情報発生回路12に伝送され、当該振幅位
相情報発生回路12では上記実数成分値Reと虚数成分値Im
とから振幅値Amと位相値とが得られて、この振幅値Amの
情報が出力されるようになる。すなわち、一般の人間の
聴覚は周波数領域の振幅（パワー）には敏感であるが、
位相についてはかなり鈍感であるため、本実施例では上
記振幅位相情報発生回路12の出力から上記振幅値Amのみ
を取り出し、これを本発明実施例での入力ディジタル信
号としている。In FIG. 1, for example, an audio signal is supplied to an input terminal 1, and the audio signal on the time axis is transmitted to a fast Fourier transform circuit 11. The fast Fourier transform circuit 11 converts the audio signal on the time axis into a signal on the frequency axis at predetermined time intervals, and obtains an FFT coefficient including a real component value Re and an imaginary component value Im. These FFTs
The coefficient is transmitted to the amplitude / phase information generation circuit 12, where the real component value Re and the imaginary component value Im
Then, the amplitude value Am and the phase value are obtained from, and information on the amplitude value Am is output. In other words, general human hearing is sensitive to amplitude (power) in the frequency domain,
Since the phase is very insensitive, in the present embodiment, only the amplitude value Am is extracted from the output of the amplitude / phase information generating circuit 12, and this is used as the input digital signal in the embodiment of the present invention.

このようにして得られた振幅値Am等の入力ディジタル
信号は、帯域分割回路13に伝送される。この帯域分割回
路13では、上記振幅値Amで表現された入力ディジタル信
号をいわゆる臨界帯域幅（クリティカルバンド）に分割
している。当該クリティカルバンドとは、人間の聴覚特
性（周波数分析能力）を考慮したものであり、例えば０
〜16kHzを24バンドに分け、高い周波数帯域ほどバンド
幅を広く選定しているものである。すなわち人間の聴覚
は、一種のバンドパスフィルタのような特性を有してい
て、この各フィルタによって分けられたバンドを臨界帯
域と呼んでいる。ここで、第４図に上記クリティカルバ
ンドを示す。ただし、この第４図では図示を簡略化する
ため、上記クリティカルバンドのバンド数を12バンド
（B₁〜B₁₂）で表現している。The input digital signal such as the amplitude value Am obtained in this way is transmitted to the band dividing circuit 13. The band division circuit 13 divides the input digital signal represented by the amplitude value Am into a so-called critical bandwidth (critical band). The critical band is based on human auditory characteristics (frequency analysis ability), and is, for example, 0.
1616 kHz is divided into 24 bands, and the higher the frequency band, the wider the bandwidth is selected. That is, human hearing has characteristics like a kind of band-pass filter, and the band divided by each filter is called a critical band. Here, FIG. 4 shows the critical band. However, in FIG. 4, for simplicity of illustration, the number of the critical bands is represented by 12 bands (B _{1 to} B ₁₂ ).

上記帯域分割回路13でクリティカルバンドに分割され
た各バンド（例えば24バンド）毎の上記振幅値Amは、各
々上記総和検出回路14に伝送される。この総和検出回路
14では、上記各バンド毎のエネルギ（各バンドでのスペ
クトル強度）が、各バンドのそれぞれの振幅値Amの総和
（振幅値Amのピーク又は平均或いはエネルギ総和）をと
ることにより求められる。該総和検出回路14の出力すな
わち各バンドの総和のスペクトルは、一般にバークスペ
クトルと呼ばれ、この各バンドのバークスペクトルSBは
例えば第５図に示すようになる。The amplitude value Am for each band (for example, 24 bands) divided into critical bands by the band division circuit 13 is transmitted to the sum detection circuit 14, respectively. This sum detection circuit
In 14, the energy for each band (spectral intensity in each band) is obtained by taking the sum of the amplitude values Am of each band (peak or average or the energy sum of the amplitude values Am). The output of the sum detection circuit 14, that is, the spectrum of the sum of each band is generally called a bark spectrum, and the bark spectrum SB of each band is as shown in FIG. 5, for example.

ここで、上記バークスペクトルSBのいわゆるマスキン
グに於ける影響を考慮するため、該バークスペクトルSB
に所定の重みづけの関数を畳込む（コンボリューショ
ン）。このため、上記総和検出回路14の出力すなわち該
バークスペクトルSBの各値は、フィルタ回路15に送られ
る。該フィルタ回路15は、第６図に示すように、入力端
子100からの入力データを順次遅延させる遅延素子
（z^-1）101₁,101₂‥‥101_m-2〜101_m+3‥‥101₂₃,101₂₄
（クリティカルバンドに対応した例えば24個の遅延素
子）と、各遅延素子101₁〜101₂₄からの出力にフィルタ
係数（重みづけの関数）を乗算する例えば24個の乗算器
102₁,102₂‥‥102_m-3〜102_m+3‥‥102₂₃,102₂₄と、総和
加算器104とから構成されるものである。この時、上記
乗算器102_m-3〜102_m+3において、例えば、乗算器102_m-3
でフィルタ係数0.0000086を乗算し、乗算器102_m-2でフ
ィルタ係数0.0019を、乗算器102_m-1でフィルタ係数0.15
を、乗算器102_mでフィルタ係数１を、乗算器102_m+1でフ
ィルタ係数0.4を、乗算器102_m+2でフィルタ係数0.06
を、更に乗算器102_m+3でフィルタ係数0.007を各遅延素
子の出力に乗算することにより、上記バークスペクトル
SBの畳込み処理が行われる。この畳込み処理により、第
５図中点線で示す部分の総和がとられる。なお、上記マ
スキングとは、人間の聴覚上の特性により、ある信号に
よって他の信号がマスクされて聞こえなくなる現象をい
うものであり、このマスキング効果には、時間軸上のオ
ーディオ信号に対するマスキング効果と周波数軸上の信
号に対するマスキング効果とがある。すなわち、該マス
キング効果により、マスキングされる部分にノイズが合
ったとしても、このノイズは聞こえないことになる。こ
のため、実際のオーディオ信号では、このマスキングさ
れる部分内のノイズは許容可能なノイズとされる。Here, in order to consider the influence of the bark spectrum SB on so-called masking, the bark spectrum SB
Is convolved with a function of a predetermined weight (convolution). Therefore, the output of the sum detection circuit 14, that is, each value of the bark spectrum SB is sent to the filter circuit 15. The filter circuit 15, as shown in FIG. 6, the delay element (z ^-1) for sequentially delaying input data from the input terminal _{_{100 101 1, 101 2 ‥‥ 101}} m-2 ~101 m + 3 ‥‥ 101 ₂₃ , 101 ₂₄
(The example 24 delay elements corresponding to the critical band) and, for example, 24 multipliers for multiplying the filter coefficients (a function of the weighted) to the output from the delay elements 101 ₁ to 101 ₂₄
102 ₁ , 102 ₂ ‥‥ 102 _{m−3 to} 102 _m _{+ 3} ‥‥ 102 ₂₃ , 102 ₂₄ and the sum adder 104. At this time, in the multipliers 102 _{m−3 to} 102 _m _{+ 3} , for example, the multiplier 102 _m−3
Is multiplied by a filter coefficient of 0.0000086, the filter coefficient is 0.0019 by the multiplier 102 _m-2 , and the filter coefficient is 0.15 by the multiplier 102 _m-1.
The multiplier 102 the filter coefficient 1 at _m, a multiplier 102 the filter coefficients 0.4 _{m + 1,} filter coefficient 0.06 at multiplier 102 _{m + 2}
Is further multiplied by a filter coefficient of 0.007 with a multiplier 102 _{m + 3} to the output of each delay element, thereby obtaining the Bark spectrum.
SB convolution processing is performed. By this convolution processing, the sum of the parts indicated by the dotted lines in FIG. 5 is obtained. The masking refers to a phenomenon that a certain signal masks another signal and makes it inaudible due to human auditory characteristics.This masking effect includes a masking effect for an audio signal on a time axis. There is a masking effect on signals on the frequency axis. That is, even if noise is matched to a masked portion by the masking effect, this noise will not be heard. For this reason, in an actual audio signal, the noise in the masked portion is regarded as acceptable noise.

その後、上記フィルタ回路15の出力は引算器16に送ら
れる。当該引算器16は、上記畳込んだ領域での後述する
許容可能なノイズレベルに対応するレベルαを求めるも
のである。なお、当該許容可能なノイズレベル（許容ノ
イズレベル）に対応するレベルαは、後述するように、
逆コンボリューション処理を行うことによって、クリテ
ィカルバンドの各バンド毎の許容ノイズレベルとなるよ
うなレベルである。ここで、上記引算器16には、上記レ
ベルαを求めるための許容関数（マスキングレベルを表
現する関数）が供給される。この許容関数を増減させる
ことで上記レベルαの制御を行っている。当該許容関数
は、後述する関数発生回路29から供給されているもので
ある。Thereafter, the output of the filter circuit 15 is sent to the subtractor 16. The subtracter 16 calculates a level α corresponding to an allowable noise level described later in the convolved region. The level α corresponding to the allowable noise level (allowable noise level) is, as described later,
By performing the inverse convolution processing, the level is such that the permissible noise level of each critical band is obtained. Here, an allowance function (a function expressing a masking level) for obtaining the level α is supplied to the subtractor 16. The level α is controlled by increasing or decreasing the allowable function. The permissible function is supplied from a function generation circuit 29 described later.

すなわち、許容ノイズレベルに対応するレベルαは、
クリティカルバンドのバンドの低域から順に与えられる
番号をｉとすると、第（１）式で求めることができる。That is, the level α corresponding to the allowable noise level is
Assuming that the number given in order from the low band of the critical band is i, the critical band can be obtained by Expression (1).

α＝Ｓ−（ｎ−ai） ‥‥‥‥（１）この第（１）式において、n,aは定数でａ＞０、Ｓは
畳込み処理されたバークスペクトルの強度であり、第
（１）式中（ｎ−ai）が許容関数となる。本実施例では
ｎ＝38,a＝１としており、この時の音質劣化はなく、良
好な符号化が行えた。α = S− (n−ai) ‥‥‥‥ (1) In the equation (1), n and a are constants and a> 0, and S is the intensity of the convolution-processed bark spectrum. 1) (n-ai) in the equation is an allowable function. In the present embodiment, n = 38, a = 1, and there was no deterioration in sound quality at this time, and good encoding was performed.

このようにして、上記レベルαが求められ、このデー
タは、割算器17に伝送される。当該割算器17では、上記
畳込み処理された領域での許容ノイズスペクトルを逆コ
ンボリューションするためのものである。したがって、
この逆コンボリューション処理を行うことにより、上記
レベルαからマスキングスペクトルが得られるようにな
る。すなわち、このマスキングスペクトルが許容ノイズ
スペクトルとなる。なお、上記逆コンボリューション処
理は、複雑な演算を必要とするが、本実施例では、簡略
化した割算器17を用いて逆コンボリューションを行って
いる。Thus, the level α is obtained, and this data is transmitted to the divider 17. The divider 17 is for inversely convolving the allowable noise spectrum in the convolved region. Therefore,
By performing the inverse convolution processing, a masking spectrum can be obtained from the level α. That is, this masking spectrum becomes an allowable noise spectrum. Note that the above inverse convolution process requires a complicated operation, but in the present embodiment, inverse convolution is performed using a simplified divider 17.

次に、上記マスキングスペクトルは、合成回路18を介
して減算器19に伝送される。ここで、当該減算器19に
は、上記総和検出回路14の出力すなわち前述した総和検
出回路14からのバークスペクトルSBが、遅延回路21を介
して供給されている。したがって、この減算器19で上記
マスキングスペクトルとバークスペクトルSBとの減算演
算が行われることで、第７図に示すように、バースクペ
クトルSBは、該マスキングスペクトルMSの各レベルで示
すレベル以下がマスキングされることになる。Next, the masking spectrum is transmitted to the subtractor 19 via the synthesis circuit 18. Here, the output of the sum detection circuit 14, that is, the bark spectrum SB from the sum detection circuit 14 described above is supplied to the subtractor 19 via the delay circuit 21. Accordingly, by performing the subtraction operation of the masking spectrum and the bark spectrum SB in the subtractor 19, as shown in FIG. 7, the burspectrum SB has a level below the level indicated by each level of the masking spectrum MS. It will be masked.

当該減算器19の出力は、ROM20を介して量子化回路24
に供給されており、上記量子化回路24では、この減算器
19の出力に基づいて割当てられたビット数で、遅延回路
23を介して供給されている振幅値Amの量子化を行うこと
になる。すなわち、換言すれば、当該量子化回路24で
は、上記クリティカルバンドの各バンドのエネルギと上
記ノイズレベル設定手段の差のレベルに応じて割当てら
れたビット数で上記各バンドの成分を量子化することに
なる。なお、上記遅延回路21は上記合成回路18以前の各
回路での遅延量を考慮して上記総和検出回路14からのバ
ークスペクトルSBを遅延させ、上記遅延回路23は上記RO
M20以前の各回路での遅延量を考慮して上記振幅値Amを
遅延させるために設けられている。また、上記ROM20は
量子化の際の所定時間毎の上記減算器19の出力を一時格
納して送り出すために設けられている。The output of the subtracter 19 is supplied to the quantization circuit 24 via the ROM 20.
In the quantization circuit 24, the subtracter
Delay circuit with the number of bits allocated based on 19 outputs
The quantization of the amplitude value Am supplied via 23 is performed. In other words, in other words, the quantization circuit 24 quantizes the components of each band with the number of bits allocated according to the energy of each band of the critical band and the level of the difference between the noise level setting means. become. Note that the delay circuit 21 delays the bark spectrum SB from the sum detection circuit 14 in consideration of the delay amount in each circuit before the synthesis circuit 18, and the delay circuit 23
It is provided to delay the amplitude value Am in consideration of the delay amount in each circuit before M20. Further, the ROM 20 is provided for temporarily storing and sending out the output of the subtractor 19 every predetermined time at the time of quantization.

ここで、上記量子化回路24の量子化の際には、前述し
たように、上記量子化回路24の出力情報量を検出し、該
検出出力と目標値の誤差に応じて量子化の際に割当てビ
ット数を補正するようにして、所定期間における情報量
を一定化するようにしている。Here, at the time of quantization of the quantization circuit 24, as described above, the output information amount of the quantization circuit 24 is detected, and at the time of quantization according to the error between the detected output and the target value. By correcting the number of allocated bits, the amount of information in a predetermined period is made constant.

このようなことを行うため、本実施例装置において
は、上記バッファメモリ25からのデータは、データ量演
算回路26によってデータ量が求められた後、誤差検出回
路27に送られる。当該誤差検出回路27では、上記データ
量と端子３からの所定の目標値（ビットレート調整のた
めのビットレートの目標値）との誤差が検出され、その
誤差データは割算回路28に伝送される。当該割算回路28
では、所定期間としての単位時間ブロック（１フレー
ム）内の総サンプル数で上記誤差データが割り込まれ、
この割算データが、足算器31に伝送される。したがっ
て、当該足算器31では、上記ROM20の出力に上記割算デ
ータが加算されることで、量子化の際の割当てビット数
の補正（ビットレートの調整）が行われることになり、
上記所定期間における情報量を一定化するようになる。
ここで、当該足算器31の出力は、四捨五入回路32を介し
て上記量子化回路24に供給されているため、ビット数の
細かな変化が矯正されている。To do this, in the present embodiment, the data from the buffer memory 25 is sent to the error detection circuit 27 after the data amount is calculated by the data amount calculation circuit 26. The error detection circuit 27 detects an error between the data amount and a predetermined target value (a target value of a bit rate for adjusting a bit rate) from the terminal 3, and the error data is transmitted to a division circuit 28. You. The division circuit 28
Then, the error data is interrupted by the total number of samples in a unit time block (one frame) as a predetermined period,
The division data is transmitted to the adder 31. Therefore, in the adder 31, by adding the division data to the output of the ROM 20, the number of allocated bits at the time of quantization is corrected (adjustment of the bit rate).
The amount of information during the predetermined period is made constant.
Here, since the output of the adder 31 is supplied to the quantization circuit 24 via the rounding circuit 32, a small change in the number of bits is corrected.

なお、上述した合成回路18での合成の際には、最小可
聴カーブ発生回路22から供給される第８図に示すような
人間の聴覚特性であるいわゆる最小可聴カーブ（等ラウ
ドネス曲線）RCを示すデータと、上記マスキングスペク
トルMSとを合成することができる。したがって、この最
小可聴カーブRCとマスキングスペクトルMSとを共に合成
することで、許容ノイズレベルは図中斜線で示す部分ま
でとすることができるようになり、量子化の際に図中鎖
線で示す部分の割当てビット数を減らすことができるよ
うになる。なお、この第８図は、前述の第４図に示した
クリティカルバンドで表わされており、信号スペクトル
SSも同時に示している。At the time of the synthesis by the synthesizing circuit 18, the so-called minimum audible curve (equal loudness curve) RC, which is a human auditory characteristic as shown in FIG. The data and the masking spectrum MS can be synthesized. Therefore, by synthesizing the minimum audible curve RC and the masking spectrum MS together, the allowable noise level can be reduced to the portion indicated by the oblique line in the figure, and the portion indicated by the chain line in the diagram at the time of quantization. Can be reduced. FIG. 8 is represented by the critical band shown in FIG.
SS is also shown.

ただし、本実施例においては、上述した最小可聴カー
ブの合成処理を行わない構成とすることもできる。すな
わち、この場合には、最小可聴カーブ発生回路22,合成
回路18が不要となり、上記引算器16からの出力は、割算
器17で逆コンボリューションされた後、すぐに減算器19
に伝送されることになる。However, in the present embodiment, a configuration may be adopted in which the above-described minimum audible curve synthesis processing is not performed. That is, in this case, the minimum audible curve generating circuit 22 and the synthesizing circuit 18 become unnecessary, and the output from the subtracter 16 is deconvolved by the divider 17 and immediately thereafter,
Will be transmitted.

すなわち、本実施例のディジタル信号符号化装置にお
いては、オーディオ信号を符号化する際に、上述したよ
うにビットレート調整を行い、この時、ノイズスペクト
ルの形状が変化しないので、ビット数を減らした場合で
も、聴感上の劣化が少なくてすむ。また、ビットレート
調整のアルゴリズムが容易なのでハードウェアを簡単に
構成することができる。That is, in the digital signal encoding apparatus of the present embodiment, when encoding the audio signal, the bit rate was adjusted as described above, and at this time, the number of bits was reduced because the shape of the noise spectrum did not change. Even in this case, there is little deterioration in hearing. Also, since the algorithm for adjusting the bit rate is easy, the hardware can be easily configured.

本発明は、上述した第１図の実施例のように、入力デ
ィジタル信号を高速フーリエ変換して処理するいわゆる
適応変換符号化の他に、例えば、帯域分割符号化（SB
C）を行う装置にも適用することができる。この場合
は、信号をバンドパスフィルタ等で帯域分割して、この
各チャンネルに割り当てるビット数を量子化手段の出力
情報量の検出出力と目標値の誤差に応じて一定増減させ
るものとなる。当該帯域分割符号化の場合も上述同様の
効果を得ることができる。In the present invention, in addition to the so-called adaptive transform coding for processing an input digital signal by performing a fast Fourier transform as in the embodiment of FIG.
The present invention can be applied to an apparatus that performs C). In this case, the signal is band-divided by a band-pass filter or the like, and the number of bits allocated to each channel is increased or decreased by a certain amount in accordance with the error between the output output of the quantizer and the target value. In the case of the band division coding, the same effect as described above can be obtained.

［発明の効果］本発明のディジタル信号符号化方法によれば、量子化
後の出力情報量を検出し、該検出出力と目標値との誤差
を検出し、この検出されたに応じて、上記量子化工程で
の割当てビット数を、全帯域で均等に増減するように補
正して、所定期間における情報量を一定化するようにし
たことにより、計算量が少なく、簡単な構成にも拘ら
ず、信号劣化の目立たないビットレート調整（ビットパ
ッキング）を行うことが可能となる。したがって、例え
ば、量子化のビット数を減らしても聴感上の劣化を少な
くすることができるようになる。[Effects of the Invention] According to the digital signal encoding method of the present invention, the amount of output information after quantization is detected, an error between the detected output and a target value is detected, and in accordance with this detection, The number of bits allocated in the quantization step is corrected so as to increase or decrease evenly in all bands, so that the information amount in a predetermined period is made constant, so that the amount of calculation is small and despite the simple configuration, In addition, it is possible to perform bit rate adjustment (bit packing) in which signal deterioration is not conspicuous. Therefore, for example, even if the number of bits for quantization is reduced, deterioration in audibility can be reduced.

[Brief description of the drawings]

第１図は本発明の一実施例のディジタル信号符号化方法
が適用されたディジタル信号符号化装置の概略構成を示
すブロック回路図、第２図はビットレート調整する前の
ノイズスペクトルを示す特性図、第３図はビットレート
調整後のノイズスペクトルを示す特性図、第４図はクリ
ティカルバンドを示す図、第５図はバークスペクトルを
示す図、第６図はフィルタ回路を示す回路図、第７図は
マスキングスペクトルを示す図、第８図は最小可聴カー
ブ，マスキングスペクトルを合成した図、第９図はビッ
トレート調整のための機能ブロック図である。 11……高速フーリエ変換回路 12……振幅位相情報発生回路 13……帯域分割回路 14……総和検出回路 15……フィルタ回路 16……引算器 17,28……割算器 18……合成回路 19……減算器 20……ROM 21,23……遅延回路 22……最小可聴カーブ発生回路 24……量子化回路 25……バッファメモリ 26……データ量演算回路 27……誤差検出回路 29……関数発生回路 31……足算器 32……四捨五入回路FIG. 1 is a block circuit diagram showing a schematic configuration of a digital signal encoding apparatus to which a digital signal encoding method according to one embodiment of the present invention is applied, and FIG. 2 is a characteristic diagram showing a noise spectrum before bit rate adjustment. 3 is a characteristic diagram showing a noise spectrum after bit rate adjustment, FIG. 4 is a diagram showing a critical band, FIG. 5 is a diagram showing a bark spectrum, FIG. 6 is a circuit diagram showing a filter circuit, and FIG. The figure shows a masking spectrum, FIG. 8 is a view in which a minimum audible curve and a masking spectrum are combined, and FIG. 9 is a functional block diagram for adjusting a bit rate. 11 Fast Fourier transform circuit 12 Amplitude / phase information generation circuit 13 Band division circuit 14 Sum detection circuit 15 Filter circuit 16 Subtractor 17, 28 Divider 18 Synthesis Circuit 19 Subtractor 20 ROM 21, 23 Delay circuit 22 Minimum audible curve generation circuit 24 Quantization circuit 25 Buffer memory 26 Data amount calculation circuit 27 Error detection circuit 29 …… Function generation circuit 31 …… Adder 32 …… Rounding circuit

フロントページの続き (56)参考文献特開平３−117920（ＪＰ，Ａ) 特開昭60−96041（ＪＰ，Ａ) 特開昭62−57621（ＪＰ，Ａ) 特開昭61−110200（ＪＰ，Ａ) Ｐｒｏｃｅｅｄｉｎｇｓ．ＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，ＳｐｅｅｃｈａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ（1988）Ｖｏｌ．５，ｐ．2524−2527 Ｐｒｏｃｅｅｄｉｎｇｓ．ＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，ＳｐｅｅｃｈａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ（1989）Ｖｏｌ．３，ｐ．1993−1996 昭和59年度電子通信学会通信部門全国大会講演論文集［分冊１］第１−389〜 390Continuation of the front page (56) References JP-A-3-117920 (JP, A) JP-A-60-96041 (JP, A) JP-A-62-57621 (JP, A) JP-A-61-110200 (JP, A) , A) Proceedings. IEEE International Conferencing on Acoustics, Speech and Signal Processing (1988) Vol. 5, p. 2524-2527 Proceedings. IEEE International Conferencing on Acoustics, Speech and Signal Processing (1989) Vol. 3, p. 1993-1996 Proceedings of the National Meeting of the Institute of Electronics, Communications and Communication Engineers, 1984-1984 [Volume 1] 1-389-390

Claims

(57) [Claims]

1. A noise level for dividing an input digital signal into a plurality of frequency bands, selecting a wider bandwidth for a higher frequency band, and setting an allowable noise level for each band based on the energy of each band. A digital signal encoding method, comprising: a setting step; and a quantization step of quantizing components of each band with a number of bits corresponding to a level of a difference between the energy of each band and the set allowable noise level. Detecting the amount of output information obtained in the quantization step, detecting an error between the detected output and a target value, and calculating the number of bits allocated in the quantization step in accordance with the detected error. A digital signal encoding method, wherein a correction is performed so as to increase and decrease evenly in a band, and an information amount in a predetermined period is made constant.