JPH08167247A

JPH08167247A - High-efficiency encoding method and device as well as transmission medium

Info

Publication number: JPH08167247A
Application number: JP6311802A
Authority: JP
Inventors: Makoto Kono; 誠光野
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1994-12-15
Filing date: 1994-12-15
Publication date: 1996-06-25

Abstract

PURPOSE: To make it possible to execute high-efficiency encoding with good efficiency in effectively utilizing a sound hearing characteristic, to prevent the deterioration in sound quality at a low bit rate and to improve the sound quality at the same bit rate. CONSTITUTION: This high-efficiency encoding device breaks down input signals to plural frequency band components to provide signal components within plural two-dimensional blocks relating to time and frequency, determines the quantization coefft. expressing the characteristics of the signal components by every two-dimensional block, determines a bit distribution quantity in accordance therewith, compresses the information by quantizing the signal components therein by every two-dimensional block and outputs the compressed information together with the information compression parameters by every two-dimensional block. The device has a harmonic component detecting circuit 813 for detecting the harmonic relation possessed by the input signals and a bit fraction adjusting circuit 814 from a band-by-band energy calculating circuit 803 as constitution to determine the bit distribution quantity. The degree of concn. of the bit distribution is enhanced for the two-dimensional blocks in which the respective harmonic components are included at the time of determining the bit distribution quantity.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディジタルオーディオ
信号等のディジタル信号をビット圧縮する高能率符号化
方法及び装置、その圧縮データを伝送する伝送媒体（記
録媒体も含む）に関し、特に、処理ブロック毎のエネル
ギーの形状に応じて、該当処理ブロックのビット割当量
を変化させるような高能率符号化方法及び装置、並びに
伝送媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a high-efficiency coding method and device for bit-compressing a digital signal such as a digital audio signal, and a transmission medium (including a recording medium) for transmitting the compressed data, and more particularly to a processing block. The present invention relates to a high-efficiency coding method and device, and a transmission medium, which changes the bit allocation amount of a corresponding processing block according to the shape of energy for each.

【０００２】[0002]

【従来の技術】オーディオ或いは音声等の信号の高能率
符号化の手法及び装置には種々あるが、例えば、時間領
域のオーディオ信号等を単位時間毎にブロック化してこ
のブロック毎の時間軸の信号を周波数軸上の信号に変換
（直交変換）して複数の周波数帯域に分割し、各帯域毎
に符号化するブロック化周波数帯域分割方式であるいわ
ゆる変換符号化方式や、時間領域のオーディオ信号等を
単位時間毎にブロック化しないで、複数の周波数帯域に
分割して符号化する非ブロック化周波数帯域分割方式で
ある帯域分割符号化（サブ・バンド・コーディング：Ｓ
ＢＣ）方式等を挙げることができる。また、上述の帯域
分割符号化と変換符号化とを組み合わせた高能率符号化
の手法及び装置も考えられており、この場合には、例え
ば、上記帯域分割符号化方式で帯域分割を行った後、該
各帯域毎の信号を上記変換符号化方式で周波数領域の信
号に直交変換し、この直交変換された各帯域毎に符号化
を施すことになる。2. Description of the Related Art There are various techniques and devices for high-efficiency coding of audio or voice signals. For example, a time domain audio signal is divided into blocks for each unit time, and a signal on the time axis of each block is used. Is converted into a signal on the frequency axis (orthogonal conversion), divided into multiple frequency bands, and coded for each band, so-called transform coding method, which is a so-called transform coding method, time domain audio signal, etc. , Which is a non-blocking frequency band division method that encodes by dividing into a plurality of frequency bands without blocking each unit time (sub-band coding: S
BC) method etc. can be mentioned. Further, a method and apparatus for high efficiency coding in which the above band division coding and transform coding are combined are also considered, and in this case, for example, after performing band division by the above band division coding method. The signals in the respective bands are orthogonally transformed into signals in the frequency domain by the transform coding method, and the respective orthogonally transformed bands are encoded.

【０００３】ここで、上述した帯域分割符号化方式に使
用される帯域分割用フィルタとしては、例えばＱＭＦ(Q
uadrature Mirror filter)等のフィルタがあり、これは
例えば文献「ディジタル・コーディング・オブ・スピー
チ・イン・サブバンズ」("Digital coding of speech i
n subbands" R.E.Crochiere, Bell Syst.Tech. J.,Vo
l.55,No.8 1976) に述べられている。このＱＭＦのフィ
ルタは、帯域を等バンド幅に２分割するものであり、当
該フィルタにおいては上記分割した帯域を後に合成する
際にいわゆるエリアシングが発生しないことが特徴とな
っている。また、文献「ポリフェーズ・クワドラチァ・
フィルタ −新しい帯域分割符号化技術」("Polyphase
Quadrature filters -A new subband coding techniqu
e", Joseph H. Rothweiler ICASSP 83, BOSTON)には、
等帯域幅のフィルタ分割手法が述べられている。このポ
リフェーズ・クワドラチァ・フィルタにおいては、信号
を等バンド幅の複数の帯域に分割する際に一度に分割で
きることが特徴となっている。Here, as a band division filter used in the above-mentioned band division encoding method, for example, QMF (Q
There is a filter such as uadrature Mirror filter), which is, for example, the document "Digital coding of speech in subvans".
n subbands "RE Crochiere, Bell Syst.Tech. J., Vo
l.55, No.8 1976). This QMF filter divides the band into two equal bandwidths, and is characterized in that so-called aliasing does not occur when the divided bands are combined later. In addition, the document “Polyphase Quadratic
Filter-New Band Division Coding Technique "(" Polyphase
Quadrature filters -A new subband coding techniqu
e ", Joseph H. Rothweiler ICASSP 83, BOSTON)
Equal bandwidth filter partitioning techniques are described. This polyphase quadrature filter is characterized in that when a signal is divided into a plurality of bands of equal bandwidth, it can be divided at once.

【０００４】また、上述した直交変換としては、例え
ば、入力オーディオ信号を所定単位時間（フレーム）で
ブロック化し、当該ブロック毎に高速フーリエ変換（Ｆ
ＦＴ）や、離散コサイン変換（ＤＣＴ）、モディファイ
ドＤＣＴ変換（ＭＤＣＴ）などを行うことで時間軸を周
波数軸に変換するような直交変換がある。このＭＤＣＴ
については、文献「時間領域エリアシング・キャンセル
を基礎とするフィルタ・バンク設計を用いたサブバンド
／変換符号化」("Subband/Transform Coding Using Fil
ter Bank Designs Based on Time Domain Aliasing Can
cellation," J.P.Princen A.B.Bradley, Univ. of Surr
ey Royal Melbourne Inst. of Tech. ICASSP 1987)に述
べられている。As the above-mentioned orthogonal transform, for example, the input audio signal is divided into blocks in a predetermined unit time (frame), and the fast Fourier transform (F
FT), Discrete Cosine Transform (DCT), Modified DCT Transform (MDCT), etc. are used to transform the time axis into the frequency axis. This MDCT
For the paper "Subband / Transform Coding Using Fil".
ter Bank Designs Based on Time Domain Aliasing Can
cellation, "JPPrincen ABBradley, Univ. of Surr
ey Royal Melbourne Inst. of Tech. ICASSP 1987).

【０００５】更に、周波数帯域分割された各周波数成分
を量子化する場合の周波数分割幅としては、例えば人間
の聴覚特性を考慮した帯域分割がある。すなわち、一般
に臨界帯域（クリティカルバンド）と呼ばれている高域
ほど帯域幅が広くなるような帯域幅で、オーディオ信号
を複数（例えば２５バント）の帯域に分割することがあ
る。また、この時の各帯域毎のデータを符号化する際に
は、各帯域毎に所定のビット配分或いは、各帯域毎に適
応的なビット配分による符号化が行われる。例えば、上
記ＭＤＣＴ処理されて得られたＭＤＣＴ係数データを上
記ビット配分によって符号化する際には、上記各ブロッ
ク毎のＭＤＣＴ処理により得られる各帯域毎のＭＤＣＴ
係数データに対して、適応的な配分ビット数で符号化が
行われることになる。Further, as a frequency division width in the case of quantizing each frequency component divided into frequency bands, for example, there is a band division considering human auditory characteristics. That is, an audio signal may be divided into a plurality of bands (for example, 25 bands) with a bandwidth that is generally called a critical band and has a wider bandwidth in a higher band. Further, at the time of encoding the data for each band at this time, encoding is performed by predetermined bit allocation for each band or adaptive bit allocation for each band. For example, when the MDCT coefficient data obtained by the MDCT process is encoded by the bit allocation, the MDCT for each band obtained by the MDCT process for each block is performed.
Coding is performed on the coefficient data with an adaptive distribution bit number.

【０００６】上記ビット配分手法及びそのための装置と
しては、次の２手法及び装置が知られている。例えば、
文献「音声信号の適応変換符号化」（"Adaptive Transf
ormCoding of Speech Signals", IEEE Transactions of
Accoustics, Speech, andSignal Processing, vol.ASS
P-25, No.4, August 1977 ）では、各帯域毎の信号の大
きさをもとに、ビット割当を行っている。また、例えば
文献「臨界帯域符号化器 −聴覚システムの知覚の要求
に関するディジタル符号化」（"The critical band cod
er --digital encoding of the perceptual requireme
nts of the auditory system", M.A.Kransner MIT, ICA
SSP 1980）では、聴覚マスキングを利用することで、各
帯域毎に必要な信号対雑音比を得て固定的なビット割当
を行う手法及び装置が述べられている。The following two methods and apparatuses are known as the above-mentioned bit allocation method and an apparatus therefor. For example,
Reference "Adaptive Transf.
ormCoding of Speech Signals ", IEEE Transactions of
Accoustics, Speech, and Signal Processing, vol.ASS
P-25, No.4, August 1977) allocates bits based on the signal size of each band. In addition, for example, the document "Critical Band Coding-Digital Coding for Perceptual Requirements of Auditory System"("The critical band cod
er --digital encoding of the perceptual requireme
nts of the auditory system ", MAKransner MIT, ICA
SSP 1980) describes a method and a device for performing fixed bit allocation by obtaining a necessary signal-to-noise ratio for each band by using auditory masking.

【０００７】[0007]

【発明が解決しようとする課題】ところで、上述した従
来の高能率符号化手法及び装置においては、倍音関係を
持った、例えば楽器の再生音等のディジタルオーディオ
信号を符号化する場合、聴感上重要な倍音成分を含んだ
量子化ブロックに対してビット配分の集中度を向上させ
ることが困難である。そのため入力信号に対しより適応
的なビット配分を行うことができないことになり、特に
例えばビットレートが低くく使用可能なビットが少ない
場合には、聴感上良好な音質を得ることができない。By the way, in the above-mentioned conventional high-efficiency coding method and device, when a digital audio signal having a harmonic relationship, such as a reproduced sound of a musical instrument, is coded, it is important for hearing. It is difficult to improve the degree of concentration of bit allocation with respect to a quantized block including a large overtone component. Therefore, more adaptive bit allocation cannot be performed for the input signal, and particularly when the bit rate is low and the number of usable bits is small, it is not possible to obtain good sound quality in terms of hearing.

【０００８】本発明は、このような実情に鑑みてなされ
たものであり、聴覚特性をより活かした効率の良い高能
率符号化方法及び装置、並びに伝送媒体の提供を行い、
低ビットレートにおける音質劣化防止、及び同一ビット
レートにおける音質向上を目的とする。The present invention has been made in view of the above circumstances, and provides an efficient high-efficiency encoding method and apparatus and a transmission medium that make better use of auditory characteristics.
The purpose is to prevent deterioration of sound quality at a low bit rate and improve sound quality at the same bit rate.

【０００９】[0009]

【課題を解決するための手段】本発明の高能率符号化方
法及び装置は、上述の目的を達成するために提案された
ものであり、入力信号を複数の周波数帯域成分に分解し
て、時間と周波数に関する複数の２次元ブロック内の信
号成分を得、上記２次元ブロック毎に当該２次元ブロッ
ク内の信号成分の特徴を表す量子化係数を求め、これに
基づいてビット配分量を決定し、上記２次元ブロック毎
に当該２次元ブロック内の信号成分を量子化して情報圧
縮し、上記２次元ブロック毎の圧縮情報を上記２次元ブ
ロック毎の情報圧縮パラメータと共に出力するものであ
り、入力信号に適応したビット配分を行う際には、入力
信号が持つ倍音関係を検出し、倍音成分が含まれる各２
次元ブロックに対してビット配分量の集中度を高めるよ
うにするものである。The high-efficiency coding method and apparatus of the present invention has been proposed in order to achieve the above-mentioned object, and decomposes an input signal into a plurality of frequency band components, And a signal component in a plurality of two-dimensional blocks regarding frequency are obtained, a quantized coefficient representing a feature of the signal component in the two-dimensional block is obtained for each of the two-dimensional blocks, and a bit allocation amount is determined based on the quantized coefficient. For each two-dimensional block, the signal component in the two-dimensional block is quantized and information is compressed, and the compression information for each two-dimensional block is output together with the information compression parameter for each two-dimensional block. When performing the adaptive bit allocation, the harmonic relationship of the input signal is detected, and each 2
The degree of concentration of the bit allocation amount is increased with respect to the dimensional block.

【００１０】ここで、上記入力信号に適応したビット配
分を行う際には、入力信号の周波数軸上のスペクトラム
情報や、２次元ブロック内の信号成分を表す量子化係数
に基づいて、上記入力信号が持つ倍音関係を検出する。
また、上記倍音関係を検出する際には、入力信号の全て
又は一部の周波数帯域のスペクトラム情報及び／又は２
次元ブロック内の信号成分を現す量子化情報に基づい
て、低次の倍音成分及び／又は基音と第一倍音の倍音成
分のみを検出し、これらの倍音成分に基づいて他の倍音
成分の周波数帯域を算出し、全周波数帯域が持つ倍音関
係を検出することができる。また、入力信号に適応した
ビット配分を行う際には、全て又は一部の倍音成分が含
まれる２次元ブロックや、偶数次の倍音成分が含まれる
全て又は一部の２次元ブロック、奇数次の倍音成分が含
まれる全て又は一部の２次元ブロックに対して、ビット
配分の集中度を高めるようにする。さらに、上記入力信
号に適応したビット配分量の決定の際に、既定ビットレ
ートに対してビット配分量が少ないときには、倍音成分
が含まれる全て又は一部の２次元ブロックに対して優先
的にビットを追加してビットの端数調整行ったり、偶数
倍音成分が含まれる全て又は一部の２次元ブロックに対
して優先的にビットを追加してビットの端数調整を行う
ようにする。逆に、上記入力信号に適応したビット配分
量の決定の際に、既定ビットレートに対してビット配分
量が多いときには、倍音成分が含まれない全て又は一部
の２次元ブロックに対して優先的にビットを削除してビ
ットの端数調整を行ったり、奇数倍音成分が含まれない
全て又は一部の２次元ブロックに対して優先的にビット
を削除してビットの端数調整を行うようにする。Here, when performing bit allocation adapted to the input signal, based on the spectrum information on the frequency axis of the input signal and the quantization coefficient representing the signal component in the two-dimensional block, the input signal is Detects the overtone relationship of.
Further, when detecting the above-mentioned overtone relationship, the spectrum information of all or part of the frequency band of the input signal and / or 2
Based on the quantized information representing the signal component in the dimension block, only the low-order harmonic component and / or the fundamental and the first harmonic overtone component are detected, and the frequency band of the other harmonic components based on these harmonic components. Can be calculated to detect the overtone relationship of all frequency bands. Further, when performing bit allocation adapted to an input signal, a two-dimensional block containing all or some harmonic components, all or some two-dimensional blocks containing even harmonic components, and odd-numbered The degree of concentration of bit allocation is increased for all or some of the two-dimensional blocks containing overtone components. Further, when the bit allocation amount adapted to the input signal is determined, when the bit allocation amount is small with respect to the predetermined bit rate, the bit is preferentially applied to all or some of the two-dimensional blocks including the overtone component. Is added to adjust the fraction of bits, or bits are preferentially added to all or some of the two-dimensional blocks including even harmonic components to adjust the fraction of bits. On the contrary, when determining the bit allocation amount adapted to the input signal, if the bit allocation amount is large with respect to the predetermined bit rate, priority is given to all or some of the two-dimensional blocks that do not include the overtone component. The bit is adjusted by deleting the bit, or the bit is adjusted by deleting the bit preferentially for all or some of the two-dimensional blocks that do not include the odd harmonic component.

【００１１】また、本発明の伝送媒体は、伝送メディア
のみならず記録メディアをも含み、本発明の高能率符号
化方法又は装置によって圧縮符号化された圧縮情報と情
報圧縮パラメータとを伝送又は記録するものである。Further, the transmission medium of the present invention includes not only the transmission medium but also the recording medium, and transmits or records the compressed information and the information compression parameter compression-encoded by the high efficiency encoding method or apparatus of the present invention. To do.

【００１２】[0012]

【作用】本発明によれば、ビット配分を行う際、入力信
号が持つ倍音関係を検出し、その各倍音成分を含む量子
化のための２次元ブロックに対するビット配分の集中度
を向上させることにより、聴感特性をより活かした効率
の良いビット配分を行うことができ、ブロックフローテ
ィング効率を落とすことのない、効率的な圧縮を行うこ
とが可能となる。According to the present invention, when bit allocation is performed, the harmonic relationship of the input signal is detected, and the degree of concentration of bit allocation to the two-dimensional block for quantization including each harmonic component is improved. , It is possible to perform efficient bit allocation by making better use of the auditory characteristics, and it is possible to perform efficient compression without lowering the block floating efficiency.

【００１３】以上のことにより、同一のビットレートに
おいてより良好な音質を得ることが可能となる。また、
同等の音質を得るために、より低いビットレートで実施
可能となる。From the above, it becomes possible to obtain better sound quality at the same bit rate. Also,
It can be performed at a lower bit rate to obtain the same sound quality.

【００１４】[0014]

【実施例】以下、図面を参照し、本発明の実施例につい
て説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１５】本実施例では、オーディオＰＣＭ信号等の
入力ディジタル信号を、帯域分割符号化（ＳＢＣ）、適
応変換符号化（ＡＴＣ）、及び適応ビット配分（ＡＰＣ
−ＡＢ）の各技術を用いて高能率符号化する。この技術
について、図１を参照しながら説明する。In this embodiment, an input digital signal such as an audio PCM signal is subjected to band division coding (SBC), adaptive transform coding (ATC), and adaptive bit allocation (APC).
-High efficiency coding using each technique of AB). This technique will be described with reference to FIG.

【００１６】図１に示す本実施例の具体的な高能率符号
化装置では、入力ディジタル信号をフィルタなどにより
複数の周波数帯域に分割すると共に、各周波数帯域毎に
直交変換を行って、得られた周波数軸のスペクトルデー
タを、後述する人間の聴覚特性を考慮したいわゆる臨界
帯域幅（クリティカルバンド）毎に適応的にビット配分
して符号化している。この時、高域では臨界帯域幅を更
に分割した帯域を用いる。もちろん、上記フィルタなど
による非ブロッキングの周波数分割幅は、等分割幅とし
てもよい。In the concrete high-efficiency coding apparatus of the present embodiment shown in FIG. 1, the input digital signal is obtained by dividing the input digital signal into a plurality of frequency bands by a filter and performing orthogonal transformation for each frequency band. The frequency axis spectrum data is coded by adaptively allocating bits for each so-called critical band (critical band) in consideration of human auditory characteristics described later. At this time, a band obtained by further dividing the critical bandwidth is used in the high band. Of course, the non-blocking frequency division width by the filter or the like may be equal division width.

【００１７】さらに、本発明実施例においては、直交変
換の前に入力信号に応じて適応的にブロックサイズ（直
交変換のブロック長）を変化させると共に、クリティカ
ルバンド単位若しくは高域では臨界帯域幅（クリティカ
ルバンド）を更に細分化した帯域の信号で構成されるブ
ロックでフローティング処理を行っている。なお、この
クリティカルバンドとは、人間の聴覚特性を考慮して分
割された周波数帯域であり、ある純音の周波数近傍の同
じ強さの狭帯域バンドノイズによって当該純音がマスク
されるときのそのノイズの持つ帯域のことである。この
クリティカルバンドは、高域ほど帯域幅が広くなってお
り、例えば０〜２０ｋＨｚの全周波数帯域は例えば２５
のクリティカルバンドに分割されている。Further, in the embodiment of the present invention, the block size (block length of the orthogonal transform) is adaptively changed according to the input signal before the orthogonal transform, and the critical bandwidth (in the critical band unit or in the high band) is changed. The floating process is performed in a block composed of signals in a band in which the critical band) is further subdivided. The critical band is a frequency band divided in consideration of human auditory characteristics, and when the pure tone is masked by a narrow band noise of the same strength in the vicinity of the frequency of a pure tone, the noise of that pure tone is masked. It is the bandwidth that you have. The critical band has a wider bandwidth in a higher frequency range, and for example, the entire frequency band of 0 to 20 kHz is 25, for example.
Is divided into critical bands.

【００１８】すなわち、図１において、入力端子１０に
は例えば０〜２２ｋＨｚのオーディオＰＣＭ信号が供給
されている。この入力信号は、例えばいわゆるＱＭＦな
どの帯域分割フィルタ１１により０〜１１ｋＨｚ帯域と
１１ｋ〜２２ｋＨｚ帯域とに分割され、０〜１１ｋＨｚ
帯域の信号は同じくいわゆるＱＭＦ等の帯域分割フィル
タ１２により０〜５．５ｋＨｚ帯域と５．５ｋ〜１１ｋ
Ｈｚ帯域とに分割される。That is, in FIG. 1, an audio PCM signal of 0 to 22 kHz, for example, is supplied to the input terminal 10. This input signal is divided into a band of 0 to 11 kHz and a band of 11 to 22 kHz by a band dividing filter 11 such as a so-called QMF, and 0 to 11 kHz.
Similarly, the band signals are 0 to 5.5 kHz band and 5.5k to 11k by the band division filter 12 such as so-called QMF.
And the Hz band.

【００１９】上記帯域分割フィルタ１１からの１１ｋ〜
２２ｋＨｚ帯域の信号は、直交変換回路の一例であるＭ
ＤＣＴ（Modified Discrete Cosine Transform）回路１
３に送られ、上記帯域分割フィルタ１２からの５．５ｋ
〜１１ｋＨｚ帯域の信号はＭＤＣＴ回路１４に送られ、
上記帯域分割フィルタ１２からの０〜５．５ｋＨｚ帯域
の信号はＭＤＣＴ回路１５に送られることにより、それ
ぞれＭＤＣＴ処理される。なお、各ＭＤＣＴ回路１３〜
１５では、各帯域毎に設けたブロック決定回路１９、２
０、２１により決定されたブロックサイズ（直交変換ブ
ロックサイズ）に基づいてＭＤＣＴ処理がなされる。11k from the band division filter 11
The signal in the 22 kHz band is M, which is an example of an orthogonal transformation circuit.
DCT (Modified Discrete Cosine Transform) circuit 1
5.5k from the band division filter 12 described above.
The signal in the ~ 11 kHz band is sent to the MDCT circuit 14,
The signals in the 0 to 5.5 kHz band from the band division filter 12 are sent to the MDCT circuit 15 to be subjected to MDCT processing. Each MDCT circuit 13-
In block 15, block decision circuits 19 and 2 provided for each band
MDCT processing is performed based on the block size (orthogonal transform block size) determined by 0 and 21.

【００２０】ここで、上記ブロック決定回路１９〜２１
により決定される各ＭＤＣＴ回路１３〜１５でのブロッ
クサイズの具体例を図２の（Ａ）及び（Ｂ）に示す。な
お、図２の（Ａ）には直交変換ブロックサイズが長い場
合（ロングモードにおける直交変換ブロックサイズ）
を、図２の（Ｂ）には直交変換ブロックサイズが短い場
合（ショートモードにおける直交変換ブロックサイズ）
を示している。Here, the block decision circuits 19 to 21.
Specific examples of the block sizes in the MDCT circuits 13 to 15 determined by are shown in (A) and (B) of FIG. In addition, in FIG. 2A, when the orthogonal transform block size is long (orthogonal transform block size in the long mode).
2B shows a case where the orthogonal transform block size is short (orthogonal transform block size in short mode).
Is shown.

【００２１】この図２の具体例においては、３つのフィ
ルタ出力は、それぞれ２つの直交変換ブロックサイズを
持つ。すなわち、低域側の０〜５．５ｋＨｚ帯域の信号
及び中域の５．５ｋ〜１１ｋＨｚ帯域の信号に対して
は、長いブロック長の場合（図２の（Ａ））は１ブロッ
ク内のサンプル数を１２８サンプルとし、短いブロック
が選ばれた場合（図２の（Ｂ））には１ブロック内のサ
ンプル数を３２サンプル毎のブロックとしている。これ
に対して高域側の１１ｋ〜２２ｋＨｚ帯域の信号に対し
ては、長いブロック長の場合（図２の（Ａ））は１ブロ
ック内のサンプル数を２５６サンプルとし、短いブロッ
クが選ばれた場合（図２の（Ｂ））には１ブロック内の
サンプル数を３２サンプル毎のブロックとしている。こ
のようにして短いブロックが選ばれた場合には各帯域の
直交変換ブロックのサンプル数を同じとして高域程時間
分解能を上げ、なおかつブロック化に使用するウインド
ウの種類を減らしている。In the specific example of FIG. 2, the three filter outputs each have two orthogonal transform block sizes. That is, for a signal in the low frequency band of 0 to 5.5 kHz and a signal in the middle frequency band of 5.5 to 11 kHz, if the block length is long ((A) of FIG. 2), the samples in one block are sampled. The number of samples is 128, and when a short block is selected ((B) of FIG. 2), the number of samples in one block is a block for every 32 samples. On the other hand, for a signal in the 11 kHz to 22 kHz band on the high frequency side, in the case of a long block length ((A) of FIG. 2), the number of samples in one block is set to 256 samples, and a short block is selected. In this case ((B) of FIG. 2), the number of samples in one block is a block for every 32 samples. When a short block is selected in this way, the number of samples of orthogonal transform blocks in each band is set to be the same, the time resolution is increased in the higher frequency range, and the types of windows used for blocking are reduced.

【００２２】再び図１において、上記ブロック決定回路
１９〜２１で決定されたブロックサイズを示す情報は、
後述の適応ビット割当符号化回路１６、１７、１８に送
られると共に、出力端子２３、２５、２７から出力され
る。Referring again to FIG. 1, the information indicating the block size determined by the block determination circuits 19 to 21 is:
It is sent to adaptive bit allocation coding circuits 16, 17, and 18 which will be described later, and is also output from output terminals 23, 25, and 27.

【００２３】各ＭＤＣＴ回路１３〜１５にてＭＤＣＴ処
理されて得られた周波数領域のスペクトルデータあるい
はＭＤＣＴ係数データは、いわゆる臨界帯域（クリティ
カルバンド）または高域では更にクリティカルバンドを
分割した帯域毎にまとめられて、適応ビット割当符号化
回路１６〜１８に送られている。The spectrum data in the frequency domain or the MDCT coefficient data obtained by the MDCT processing in each of the MDCT circuits 13 to 15 is summarized by a so-called critical band or a band obtained by further dividing the critical band in the high band. And transmitted to the adaptive bit allocation encoding circuits 16-18.

【００２４】適応ビット割当符号化回路１６〜１８で
は、上記ブロックサイズの情報、及び臨界帯域（クリテ
ィカルバンド）または高域では更にクリティカルバンド
を分割した帯域毎に割り当てられたビット数に応じて、
各スペクトルデータ（あるいはＭＤＣＴ係数データ）を
再量子化（正規化して量子化）するようにしている。In the adaptive bit allocation encoding circuits 16 to 18, according to the information of the block size and the number of bits allocated for each band obtained by dividing the critical band (critical band) or the critical band in the high band,
Each spectrum data (or MDCT coefficient data) is requantized (normalized and quantized).

【００２５】これら各適応ビット割当符号化回路１６〜
１８によって符号化されたデータは、出力端子２２、２
４、２６を介して取り出される。また、当該適応ビット
割当符号化回路１６〜１８では、どのような信号の大き
さに関する正規化がなされたかを示すスケールファクタ
と、どのようなビット長で量子化がされたかを示すビッ
ト長情報も求めており、これらも同時に出力端子２２、
２４、２６から出力される。Each of these adaptive bit allocation encoding circuits 16 to
The data encoded by 18 is output to the output terminals 22, 2
It is taken out via 4, 26. Further, the adaptive bit allocation encoding circuits 16 to 18 also have a scale factor indicating what kind of signal magnitude is normalized and bit length information indicating what bit length is quantized. I am looking for these, and these are also output terminals 22,
It is output from 24 and 26.

【００２６】また、図１における各ＭＤＣＴ回路１３〜
１５の出力からは、上記臨界帯域（クリティカルバン
ド）または高域では更にクリティカルバンドを分割した
帯域毎のエネルギを、例えば当該バンド内での各振幅値
の２乗平均の平方根を計算すること等により求められ
る。もちろん、上記スケールファクタそのものを以後の
ビット配分の為に用いるようにしてもよい。この場合に
は新たなエネルギ計算の演算が不要となるため、ハード
規模の節約となる。また、各バンド毎のエネルギの代わ
りに、振幅値のピーク値、平均値等を用いることも可能
である。Further, the MDCT circuits 13 to 13 in FIG.
From the output of 15, by calculating the energy of each band obtained by further dividing the critical band in the above critical band (critical band) or high band, for example, calculating the square root of the root mean square of each amplitude value in the band. Desired. Of course, the scale factor itself may be used for subsequent bit allocation. In this case, the calculation of new energy is not required, and the hardware scale is saved. Further, instead of the energy for each band, it is also possible to use the peak value, the average value, etc. of the amplitude values.

【００２７】次に、適応ビット割当符号化回路１６〜１
８において、上記ビット配分の具体的な手法を説明す
る。Next, the adaptive bit allocation encoding circuits 16 to 1
8, a concrete method of the bit allocation will be described.

【００２８】この場合の適応ビット割当符号化回路の動
作を図３で説明すると、ＭＤＣＴ係数の大きさが各ブロ
ック毎に求められ、そのＭＤＣＴ係数が入力端子８０１
に供給される。当該入力端子８０１に供給されたＭＤＣ
Ｔ係数は、帯域毎のエネルギ算出回路８０３及びスペク
トルのなめらかさ算出回路８０８に与えられる。帯域毎
のエネルギ算出回路８０３では、クリティカルバンドま
たは高域においてはクリティカルバンドを更に再分割し
たそれぞれの帯域に関する信号エネルギを算出する。帯
域毎のエネルギ算出回路８０３で算出されたそれぞれの
帯域に関するエネルギは、エネルギ依存ビット配分回路
８０４、スペクトルのなめらかさ算出回路８０８、及び
本発明における倍音関係検出手段である倍音成分検出回
路８１３に供給される。The operation of the adaptive bit allocation encoding circuit in this case will be described with reference to FIG. 3. The magnitude of the MDCT coefficient is obtained for each block, and the MDCT coefficient is input terminal 801.
Is supplied to. MDC supplied to the input terminal 801
The T coefficient is supplied to the energy calculation circuit 803 and the spectrum smoothness calculation circuit 808 for each band. The energy calculation circuit 803 for each band calculates the signal energy for each band obtained by further dividing the critical band or the critical band in the high band. The energy related to each band calculated by the energy calculation circuit 803 for each band is supplied to the energy-dependent bit allocation circuit 804, the spectrum smoothness calculation circuit 808, and the overtone component detection circuit 813 which is the overtone relationship detection means in the present invention. To be done.

【００２９】倍音成分検出回路８１３では、帯域毎エネ
ルギ算出回路８０３からの帯域毎のエネルギを用いて入
力信号の倍音関係にある周波数帯域の検出を行う。The overtone component detection circuit 813 detects the frequency band related to the overtone of the input signal using the energy for each band from the energy calculation circuit 803 for each band.

【００３０】図４を用いて倍音について説明する。倍音
とは、周期性を有する信号、例えば楽音等が、図中のＦ
０，２Ｆ０，３Ｆ０，４Ｆ０，・・・といった整数倍の
関係を持つ周波数成分のことである。Ｆ０は基音又はピ
ッチ、２Ｆ０は第二倍音、３Ｆ０は第三倍音、４Ｆ０は
第四倍音と呼ばれている。つまり基音に対し、ｎ倍の周
波数を持つ成分を第ｎ倍音と呼ぶ。また、これらの倍音
は、基音Ｆ０の周波数に対し、偶数倍（２ｎ）又は奇数
倍（２ｎ＋１）の成分に分別され（ｎ＝１，２，３，・
・・）、それぞれ偶数次倍音、奇数次倍音と呼ばれる。
例えば、楽器のクラリネットの演奏音は、奇数次倍音に
比べ偶数次倍音の強度が著しく小さいという特徴を持
つ。Overtones will be described with reference to FIG. An overtone is a signal having periodicity, such as a musical sound, which is indicated by F in the figure.
It is a frequency component having an integral multiple relationship such as 0, 2F0, 3F0, 4F0, .... F0 is called the fundamental tone or pitch, 2F0 is called the second overtone, 3F0 is called the third overtone, and 4F0 is called the fourth overtone. That is, a component having a frequency n times that of the fundamental tone is called an nth harmonic. Further, these overtones are separated into even-numbered (2n) or odd-numbered (2n + 1) components (n = 1, 2, 3, ...) Of the frequency of the fundamental tone F0.
・・), And they are called even harmonics and odd harmonics, respectively.
For example, the performance sound of a clarinet of a musical instrument is characterized in that the intensity of even-order overtones is significantly smaller than that of odd-order overtones.

【００３１】倍音成分検出回路８１３では、入力信号の
倍音成分を検出する方法として、例えば、隣接するブロ
ックのブロックフローティング係数の差を指標として使
う。ブロックフローティング係数の差が大きくなるブロ
ックの周波数帯域が整数倍の関係にあれば、そのブロッ
クに入力信号の倍音成分が含まれている可能性が高い。
なおこの手法では、ＭＤＣＴ係数ではなくブロックフロ
ーティング係数の差を求めるので、計算量が少なくな
り、ハードウェア規模を小さくすることが可能である。The overtone component detection circuit 813 uses the difference between the block floating coefficients of adjacent blocks as an index as a method of detecting the overtone component of the input signal. If the frequency band of a block in which the difference between the block floating coefficients is large is in an integral multiple relationship, it is highly likely that the block contains a harmonic component of the input signal.
In this method, the difference between the block floating coefficients is obtained instead of the MDCT coefficient, so the amount of calculation is reduced and the hardware scale can be reduced.

【００３２】また、ブロックフローティング係数の代わ
りに、ＭＤＣＴ係数を直接使って倍音成分を検出するこ
とも可能である。この手法を用いれば、より精緻な検出
が可能となる。特に、１つのブロック内に複数の倍音成
分が含まれてしまうような場合には有効な手段である。It is also possible to detect the overtone component by directly using the MDCT coefficient instead of the block floating coefficient. If this method is used, more precise detection becomes possible. In particular, this is an effective means when one block contains a plurality of overtone components.

【００３３】入力信号の倍音成分を検出する別の方法と
して、基音と第一倍音だけを求め、これらの周波数関係
からすべての倍音成分の周波数を予想する方法も考えら
れる。これは、倍音の周波数は整数倍の関係を持つとい
う、上記した、倍音が持つ性質を利用している。この方
法を用いることにより、全帯域の隣接する各ブロックフ
ローティング係数又はＭＤＣＴ係数の差を調べなくて
も、基音及び第一倍音が含まれる帯域だけを調べるだけ
で済むので、計算量が少なくなり、ハードウェア規模を
小さくすることが可能である。As another method for detecting the overtone component of the input signal, a method of obtaining only the fundamental tone and the first overtone and predicting the frequencies of all the overtone components based on the frequency relationship between them can be considered. This utilizes the above-mentioned property of overtones that the frequencies of overtones have an integer multiple relationship. By using this method, it is only necessary to check the band containing the fundamental and the first overtone without checking the difference between adjacent block floating coefficients or MDCT coefficients in all bands, so the amount of calculation is reduced, It is possible to reduce the hardware scale.

【００３４】倍音成分検出回路８１３で検出された、倍
音成分が含まれる帯域に関する情報は、エネルギ依存の
ビット配分回路８０４及びビット端数調整回路８１４に
供給される。Information on the band containing the overtone component detected by the overtone component detection circuit 813 is supplied to the energy-dependent bit distribution circuit 804 and the bit fraction adjustment circuit 814.

【００３５】エネルギ依存ビット配分回路８０４では、
帯域毎エネルギ算出回路８０３からのエネルギ、使用可
能総ビット発生回路８０２からの使用可能総ビット、及
び倍音成分検出回路８１３からの倍音成分が含まれる帯
域を用いて、入力信号のエネルギに依存したビット配分
を、本実施例では１２８ｋｂｐｓの内のある割合を用い
て行う。入力信号のトーナリティが高い程、すなわち入
力信号のスペクトラムの凹凸が大きい程、このビット量
が上記１２８ｋｂｐｓに占める割合が増加する。なお、
入力信号の凹凸を検出するには、隣接するブロックのブ
ロックフローティング係数の差の絶対値の和を指標とし
て使う。そして、求められた使用可能なビット量につ
き、各帯域のエネルギの対数値に比例したビット配分を
行う。その際、倍音成分が含まれる帯域に対して、他の
帯域より多くのビットの配分を行う。図５を用いて説明
すると、図中ＳＰは入力信号のスペクトラム、Ｂ１〜Ｂ
８は量子化ブロックを示す。この図の入力信号の倍音成
分は、基音がブロックＢ２、第二倍音がブロックＢ４、
第三倍音がブロックＢ６、第四倍音がブロックＢ８にそ
れぞれ含まれている。図中Ｌ１は、倍音関係を考慮せず
にビット配分を行った場合の各ブロックにおける量子化
雑音のレベルを表し、図中Ｌ２は、倍音関係を考慮して
ビット配分を行った場合の各ブロックにおける量子化雑
音のレベルを表す。図中のＬ１とＬ２を比較すると分か
るように、倍音成分を考慮したビット配分を行った場合
には、倍音成分が含まれているブロック内の量子化雑音
レベルが低下し、聴感上良好な音質を得ることが可能で
ある。In the energy-dependent bit allocation circuit 804,
Bits depending on the energy of the input signal by using the band including the energy from the band-by-band energy calculation circuit 803, the total usable bit from the total usable bit generation circuit 802, and the overtone component from the overtone component detection circuit 813. In this embodiment, the distribution is performed using a certain ratio of 128 kbps. The higher the tonality of the input signal, that is, the larger the unevenness of the spectrum of the input signal, the more the ratio of this bit amount to the 128 kbps increases. In addition,
To detect the unevenness of the input signal, the sum of the absolute values of the differences between the block floating coefficients of adjacent blocks is used as an index. Then, for the obtained usable bit amount, bit distribution proportional to the logarithmic value of the energy of each band is performed. At that time, more bits are allocated to the band containing the overtone component than to the other bands. Explaining with reference to FIG. 5, SP in the figure is the spectrum of the input signal, B1 to B
Reference numeral 8 indicates a quantization block. As for the overtone component of the input signal in this figure, the fundamental tone is block B2, the second overtone is block B4,
The third overtone is included in the block B6, and the fourth overtone is included in the block B8. In the figure, L1 represents the level of the quantization noise in each block when bit allocation is performed without considering the overtone relationship, and L2 in the figure is each block when bit allocation is performed in consideration of the overtone relationship. Represents the level of quantization noise at. As can be seen by comparing L1 and L2 in the figure, when bit distribution is performed in consideration of the overtone component, the quantization noise level in the block including the overtone component is lowered, and the sound quality is good for hearing. It is possible to obtain

【００３６】また、倍音成分を考慮してビット配分を行
う際、偶数次倍音と奇数次倍音に分別してビット配分を
行うことも可能である。上述したように倍音は、偶数次
倍音と奇数次倍音という成分に分別される。種々の入力
信号の中には、例えば楽器のクラリネットの演奏音のよ
うに、奇数次倍音の方が偶数次倍音と比較して強度が大
きい性質を持つものがある。このような入力信号に対し
てビット配分する場合、偶数次倍音より奇数次倍音が含
まれるブロックに対しより多くのビットが配分されるよ
うに設定することにより、聴感上良好な音質を得ること
が可能な場合がある。また逆に、偶数次倍音の方が奇数
次倍音と比較して強度が大きい性質を持つ入力信号もあ
る。このような入力信号に対し、奇数次倍音を含むブロ
ックよりも偶数次倍音を含むブロックに対しより多くの
ビットを配分することにより、聴感上良好な音質を得る
ことが可能な場合もある。In addition, when bit distribution is performed in consideration of harmonic components, it is possible to divide into even-order harmonic overtones and odd-order harmonic overtones. As described above, the overtones are divided into even-order overtones and odd-order overtones. Among various input signals, for example, there is a property that the odd harmonics have a greater strength than the even harmonics, such as the playing sound of a clarinet of a musical instrument. When bits are distributed to such an input signal, by setting so that more bits are distributed to a block including odd-order overtones than even-order harmonics, it is possible to obtain a good sound quality in terms of hearing. May be possible. On the other hand, on the contrary, there is an input signal in which the even-numbered overtone has a greater strength than the odd-numbered overtone. In such an input signal, it may be possible to obtain a good sound quality by allocating more bits to a block including an even-order harmonic than a block including an odd-order harmonic.

【００３７】また、倍音成分検出回路８１３によって検
出した、各倍音が含まれる全ブロックに対してビット配
分の集中度を向上させることが望ましいが、例えば、基
音成分及び低次の倍音成分が含まれるブロックのみ、及
び又は偶数次倍音のみ又は奇数次倍音のみなど、一部の
ブロックだけに対して適用することによっても聴感上良
好な音質を得ることが可能である。Further, it is desirable to improve the degree of concentration of bit distribution with respect to all blocks containing each overtone detected by the overtone component detecting circuit 813. For example, a fundamental component and a lower harmonic component are included. It is also possible to obtain good sound quality by applying it to only some blocks such as only blocks and / or only even harmonics or odd harmonics.

【００３８】聴覚許容雑音レベルに依存したビット配分
算出回路８０５は、まず上記クリティカルバンド毎に分
割されたスペクトルデータに基づき、いわゆるマスキン
グ効果等を考慮した各クリティカルバンド毎の許容ノイ
ズ量を求め、次に聴覚許容雑音スペクトルを与えるよう
に使用可能総ビットからエネルギ依存ビットを引いたビ
ット分が配分される。The bit allocation calculation circuit 805, which depends on the permissible noise level for hearing, first determines the permissible noise amount for each critical band in consideration of so-called masking effect based on the spectrum data divided for each of the critical bands. Bits obtained by subtracting energy-dependent bits from the total available bits are distributed so as to give a perceptible noise spectrum to the.

【００３９】このようにして求められたエネルギ依存の
ビット数と聴覚許容雑音レベルに依存したビット数は加
算器８０６において加算され、ビット端数調整回路８１
４に供給される。The energy-dependent bit number and the hearing-acceptable noise level-dependent bit number thus obtained are added in the adder 806, and the bit fraction adjustment circuit 81 is added.
4 is supplied.

【００４０】ビット端数調整回路８１４では、加算器８
０６からのビット、使用可能総ビット発生回路８０２か
らの使用可能総ビット、及び倍音成分検出回路８１３か
らの倍音成分が含まれる帯域を用いて、本実施例では１
２８ｋｂｐｓのビットレートに合わせるようにビットの
端数調整を行う。In the bit fraction adjusting circuit 814, the adder 8
0 in this embodiment, the total available bit from the available total bit generation circuit 802, and the band containing the harmonic component from the harmonic component detection circuit 813.
Fractional adjustment of bits is performed to match the bit rate of 28 kbps.

【００４１】例えば、加算器８０６の出力が１２８ｋｂ
ｐｓを越えるビット数になっているような場合には、聴
感上最も影響を及ぼさない帯域、例えば高域側のブロッ
クから低域側に向かって順次１ビットずつ減少させる方
法、又、高域側のブロックから配分ビット数を０にす
る、つまり帯域制限を行う方法等がある。本実施例で
は、倍音成分が含まれていないブロックに対し優先的に
高域側から低域側に向かって１ビットずつ減少させる方
法を用い、１２８ｋｂｐｓと同等のビットレートになる
ように端数調整を行う。図５を用いて説明すると、例え
ば、１ビット分だけ１２８ｋｂｐｓを越えるビット数に
なっていると仮定する。まず最初に最も高域側のブロッ
クＢ９に着目すると、このブロックＢ９は倍音成分を含
んでいるので、ビットの削減は行わない。次に、隣接す
る低域側のブロックＢ８に着目すると、このブロックＢ
８は倍音成分を含んでいないので、１ビット分の削減を
行う。この時点で１２８ｋｂｐｓのビットレートと同等
のビット数となり端数調整は完了する。なお、例えば高
域側から低域側へ順次ビット数を削減していき、最も低
い周波数成分を含むブロックＢ１まで到達した時、さら
にビット数削減の必要性がある場合には、再度最も高い
周波数成分を含むブロックＢ９に戻り、同様の動作を繰
り返し行う。なおこのようにビットが大幅に不足してい
る場合など、倍音成分が含まれるブロックに対しビット
を削減する可能性もある。また、倍音を奇数次倍音と偶
数次倍音とに分別し、全て又は一部の奇数次倍音又は偶
数次倍音が含まれていないブロックに対し優先的に高域
側から１ビットずつ減少させる方法等も考えられる。For example, the output of the adder 806 is 128 kb
In the case where the number of bits exceeds ps, the band that has the least effect on hearing, for example, a method of sequentially decreasing one bit from the block on the high frequency side toward the low frequency side, or the high frequency side There is a method of setting the number of allocated bits to 0 from the block, that is, performing band limitation. In the present embodiment, a method of preferentially decreasing each block from the high frequency side to the low frequency side by 1 bit is used for the block not including the overtone component, and the fraction adjustment is performed so that the bit rate becomes equal to 128 kbps. To do. Explaining with reference to FIG. 5, it is assumed that the number of bits exceeds 128 kbps by one bit. First, focusing on the block B9 on the highest frequency side, since this block B9 contains harmonic components, no bit reduction is performed. Next, focusing attention on the adjacent low-frequency block B8, this block B8
Since 8 does not include the overtone component, reduction by 1 bit is performed. At this point, the bit number becomes equal to the bit rate of 128 kbps, and the fraction adjustment is completed. Note that, for example, when the number of bits is sequentially reduced from the high frequency side to the low frequency side, and when the block B1 including the lowest frequency component is reached, it is necessary to further reduce the number of bits. Returning to the block B9 including the component, the same operation is repeated. It should be noted that there is a possibility that the number of bits may be reduced for a block including an overtone component, such as when the number of bits is significantly insufficient. In addition, a method of classifying overtones into odd-order overtones and even-order overtones, and preferentially decreasing one or more bits from the high frequency side for all or some of the blocks that do not include odd-order overtones or even-order overtones Can also be considered.

【００４２】逆に、例えば加算器８０６の出力が１２８
ｋｂｐｓを下回るビット数の場合には、聴感上最も影響
を及ぼす帯域、例えば低域側のブロックから高域側に向
かって順次１ビットずつ増加させる方法等がある。本実
施例では、倍音成分が含まれているブロックに対し優先
的に低域側から１ビットずつ増加させる方法を用い、１
２８ｋｂｐｓと同じビット数に調整する。図５を用いて
説明すると、例えば、１ビット分だけ１２８ｋｂｐｓを
越えるビット数になっていると仮定する。まず最初に最
も低域側のブロックＢ１に着目すると、このブロックＢ
１は倍音成分を含んでいないので、ビット数の増加は行
わない。次に、隣接する高域側のブロックＢ２に着目す
ると、このブロックＢ２も倍音成分を含んでいないの
で、ビット数の増加は行わない。次に、隣接する高域側
んブロックＢ３に着目すると、このブロックＢ３は倍音
成分（基音）を含んでいるので、１ビット分の増加を行
う。この時点で１２８ｋｂｐｓのビットレートと同等の
ビット数となり端数調整は完了する。なお、例えば低域
側から高域側へ順次ビット数を増加させていき、最も高
い周波数帯域を含むブロックＢ９まで到達した時、さら
にビット数増加の必要性がある場合には、再度最も低い
周波数帯域を含むブロックＢ１に戻り、同様の動作を繰
り返し行う。なおこのようにビットが大幅に余っている
場合など、倍音成分が含まれないブロックに対してもビ
ット数を増加する可能性もある。また、倍音を奇数次倍
音と偶数次倍音とに分別し、全て又は一部の奇数次倍音
又は偶数次倍音が含まれているブロックに対し優先的に
低域側から１ビットずつ増加させる方法等も考えられ
る。On the contrary, for example, the output of the adder 806 is 128
When the number of bits is less than kbps, there is a method of sequentially increasing the bit by 1 bit from a block on the low frequency side toward a high frequency side, for example, a band that most affects the hearing. In the present embodiment, a method of preferentially increasing by 1 bit from the low frequency side for a block containing a harmonic component is used.
Adjust to the same number of bits as 28 kbps. Explaining with reference to FIG. 5, it is assumed that the number of bits exceeds 128 kbps by one bit. First, focusing on the block B1 on the lowest frequency side, this block B1
Since 1 does not include a harmonic component, the number of bits is not increased. Next, focusing on the adjacent block B2 on the high frequency side, since this block B2 also does not include a harmonic component, the number of bits is not increased. Next, focusing on the adjacent high-frequency side block B3, since this block B3 contains a harmonic component (fundamental tone), it is increased by one bit. At this point, the bit number becomes equal to the bit rate of 128 kbps, and the fraction adjustment is completed. Note that, for example, when the number of bits is sequentially increased from the low frequency side to the high frequency side, and when it reaches the block B9 including the highest frequency band, if it is necessary to further increase the number of bits, the lowest frequency is again set. Returning to the block B1 including the band, the same operation is repeated. It should be noted that there is a possibility that the number of bits may be increased even for a block that does not include an overtone component, such as when the number of bits is excessive. In addition, a method of classifying overtones into odd-order harmonic overtones and even-order harmonic overtones, and preferentially increasing by 1 bit from the low-frequency side for all or some of the blocks containing odd-order harmonic overtones or even-order harmonic overtones Can also be considered.

【００４３】この様に倍音関係を考慮するビット端数調
整手法を用いることにより、倍音成分が含まれる帯域の
量子化雑音を減少させることができ、聴感上良好な音質
を得ることが可能となる。特に、加算器８０６の出力が
１２８ｋｂｐｓを越えるビットレートになっている場
合、高域の倍音成分を削除する場合が少なくなり、入力
信号が持つ倍音構造を破壊してしまう可能性が減少する
ため、良好な音質を得ることができる。特に、本実施例
でのビットレート１２８ｋｂｐｓよりも低いビットレー
トを用いる高能率符号化装置において有効に作用する。As described above, by using the fractional bit adjustment method considering the overtone relation, it is possible to reduce the quantization noise in the band including the overtone component, and it is possible to obtain a good sound quality in hearing. In particular, when the output of the adder 806 has a bit rate exceeding 128 kbps, it is less likely that high-frequency overtone components will be deleted, and the possibility of destroying the overtone structure of the input signal is reduced. Good sound quality can be obtained. In particular, it works effectively in a high-efficiency coding apparatus using a bit rate lower than the bit rate of 128 kbps in this embodiment.

【００４４】このようにして１２８ｋｂｐｓのビットレ
ートに調整されたビットは、図１の適応ビット割当符号
化回路１６〜１８によって各クリティカルバンド毎若し
くは高域においてはクリティカルバンドを更に複数帯域
に分割した帯域に割り当てられたビット数に応じて、各
スペクトルデータ（あるいはＭＤＣＴ係数データ）が再
量子化されるようになっている。このようにして符号化
されたデータは、図１の出力端子２２、２４、２６を介
して取り出される。The bits adjusted to the bit rate of 128 kbps in this way are divided into a plurality of bands by dividing the critical band in each critical band or in the high band by the adaptive bit allocation coding circuits 16 to 18 in FIG. Each spectrum data (or MDCT coefficient data) is re-quantized according to the number of bits assigned to. The data encoded in this way is taken out via the output terminals 22, 24 and 26 of FIG.

【００４５】さらに詳しく上記聴覚許容雑音スペクトル
依存のビット配分回路８０５中の聴覚許容雑音スペクト
ル算出回路について説明すると、ＭＤＣＴ回路１３〜１
５で得られたＭＤＣＴ係数が当該ビット配分回路８０５
中の許容雑音スペクトル算出回路に与えられる。The auditory permissible noise spectrum calculating circuit in the bit allocation circuit 805 depending on the permissible auditory noise spectrum will be described in more detail. The MDCT circuits 13 to 1 will be described below.
The MDCT coefficient obtained in 5 is the bit allocation circuit 805.
It is given to the allowable noise spectrum calculation circuit therein.

【００４６】図６は上記許容雑音スペクトル算出回路を
まとめて説明した一具体例の概略構成を示すブロック回
路図である。この図６において、入力端子５２１には、
ＭＤＣＴ回路１３、１４、１５からの周波数領域のスペ
クトルデータが供給されている。FIG. 6 is a block circuit diagram showing a schematic configuration of a specific example in which the allowable noise spectrum calculating circuits are collectively described. In FIG. 6, the input terminal 521 has:
The spectrum data in the frequency domain is supplied from the MDCT circuits 13, 14, and 15.

【００４７】この周波数領域の入力データは、帯域毎の
エネルギ算出回路５２２に送られて、上記クリティカル
バンド（臨界帯域）毎のエネルギが、例えば当該バンド
内での各振幅値２乗の総和を計算すること等により求め
られる。この各バンド毎のエネルギの代わりに、振幅値
のピーク値、平均値等が用いられることもある。このエ
ネルギ算出回路５２２からの出力として、例えば各バン
ドの総和値のスペクトルは、一般にバークスペクトルと
称されている。図７はこのような各クリティカルバンド
毎のバークスペクトルＳＢを示している。ただし、この
図７では、図示を簡略化するため、上記クリティカルバ
ンドのバンド数を１２バンド（Ｂ１〜Ｂ１２）で表現し
ている。The input data in the frequency domain is sent to the energy calculation circuit 522 for each band, and the energy of each critical band (critical band) is calculated, for example, as the sum of the squared amplitude values in the band. It is required by doing. Instead of the energy for each band, a peak value, an average value, etc. of the amplitude value may be used. As an output from the energy calculation circuit 522, for example, the spectrum of the total sum value of each band is generally called a Bark spectrum. FIG. 7 shows the Bark spectrum SB for each such critical band. However, in FIG. 7, in order to simplify the illustration, the number of bands of the critical band is represented by 12 bands (B1 to B12).

【００４８】ここで、上記バークスペクトルＳＢのいわ
ゆるマスキングに於ける影響を考慮するために、該バー
クスペクトルＳＢに所定の重み付け関数を掛けて加算す
るような畳込み（コンボリューション）処理を施す。こ
のため、上記帯域毎のエネルギ算出回路５２２の出力す
なわち該バークスペクトルＳＢの各値は、畳込みフィル
タ回路５２３に送られる。該畳込みフィルタ回路５２３
は、例えば、入力データを順次遅延させる複数の遅延素
子と、これら遅延素子からの出力にフィルタ係数（重み
付け関数）を乗算する複数の乗算器（例えば各バンドに
対応する２５個の乗算器）と、各乗算器出力の総和をと
る総和加算器とから構成されるものである。Here, in order to consider the influence of so-called masking of the Bark spectrum SB, a convolution process is performed such that the Bark spectrum SB is multiplied by a predetermined weighting function and added. Therefore, the output of the energy calculation circuit 522 for each band, that is, each value of the Bark spectrum SB is sent to the convolution filter circuit 523. The convolution filter circuit 523
Is, for example, a plurality of delay elements that sequentially delay input data, and a plurality of multipliers (for example, 25 multipliers corresponding to each band) that multiply outputs from these delay elements by a filter coefficient (weighting function). , A sum total adder that sums the outputs of the respective multipliers.

【００４９】なお、上記マスキングとは、人間の聴覚上
の特性により、ある信号によって他の信号がマスクされ
て聞こえなくなる現象をいうものであり、このマスキン
グ効果には、時間領域のオーディオ信号による時間軸マ
スキング効果と、周波数領域の信号による同時刻マスキ
ング効果とがある。これらのマスキング効果により、マ
スキングされる部分にノイズがあったとしても、このノ
イズは聞こえないことになる。このため、実際のオーデ
ィオ信号では、このマスキングされる範囲内のノイズは
許容可能なノイズとされる。The above-mentioned masking means a phenomenon in which one signal is masked by another signal and becomes inaudible due to human auditory characteristics. This masking effect is caused by a time domain audio signal. There are an axial masking effect and a simultaneous time masking effect by a signal in the frequency domain. Due to these masking effects, even if there is noise in the masked portion, this noise cannot be heard. Therefore, in the actual audio signal, the noise within the masked range is regarded as an acceptable noise.

【００５０】また、上記畳込みフィルタ回路５２３の各
乗算器の乗算係数（フィルタ係数）の一具体例を示す
と、任意のバンドに対応する乗算器Ｍの係数を１とする
とき、乗算器Ｍ−１で係数０．１５を、乗算器Ｍ−２で
係数０．００１９を、乗算器Ｍ−３で係数０．００００
０８６を、乗算器Ｍ＋１で係数０．４を、乗算器Ｍ＋２
で係数０．０６を、乗算器Ｍ＋３で係数０．００７を各
遅延素子の出力に乗算することにより、上記バークスペ
クトルＳＢの畳込み処理が行われる。ただし、Ｍは１〜
２５の任意の整数である。A specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 523 will be described. When the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier M −1 gives a coefficient of 0.15, multiplier M-2 gives a coefficient of 0.0019, and multiplier M-3 gives a coefficient of 0.0000.
086, multiplier M + 1 gives a coefficient of 0.4, multiplier M + 2
By multiplying the output of each delay element by a coefficient of 0.06 and a coefficient of 0.007 by a multiplier M + 3, the convolution processing of the Bark spectrum SB is performed. However, M is 1 to
It is an arbitrary integer of 25.

【００５１】次に、上記畳込みフィルタ回路５２３の出
力は引算器５２４に送られる。該引算器５２４は、上記
畳込んだ領域での後述する許容可能なノイズレベルに対
応するレベルαを求めるものである。なお、当該許容可
能なノイズレベル（許容ノイズレベル）に対応するレベ
ルαは、後述するように、逆コンボリューション処理を
行うことによって、クリティカルバンドの各バンド毎の
許容ノイズレベルとなるようなレベルである。Next, the output of the convolution filter circuit 523 is sent to the subtractor 524. The subtractor 524 calculates a level α corresponding to an allowable noise level described later in the convoluted area. The level α corresponding to the permissible noise level (permissible noise level) is a level at which the critical noise band becomes the permissible noise level for each band by performing inverse convolution processing, as described later. is there.

【００５２】ここで、上記引算器５２４には、上記レベ
ルαを求めるるための許容関数（マスキングレベルを表
現する関数）が供給される。この許容関数を増減させる
ことで上記レベルαの制御を行っている。当該許容関数
は、次に説明するような（ｎ−ａｉ）関数発生回路５２
５から供給されているものである。Here, the subtractor 524 is supplied with an allowance function (function expressing a masking level) for obtaining the level α. The level α is controlled by increasing or decreasing this allowance function. The permissible function is the (n-ai) function generating circuit 52 as described below.
It is supplied from No. 5.

【００５３】すなわち、許容ノイズレベルに対応するレ
ベルαは、クリティカルバンドのバンドの低域から順に
与えられる番号をｉとすると、次の式で求めることがで
きる。That is, the level α corresponding to the allowable noise level can be obtained by the following equation, where i is the number given in order from the low band of the critical band.

【００５４】α＝Ｓ−（ｎ−ａｉ）この式において、ｎ，ａは定数でａ＞０、Ｓは畳込み処
理されたバークスペクトルの強度であり、式中（ｎ−ａ
ｉ）が許容関数となる。例としてｎ＝３８，ａ＝−０．
５を用いることができる。Α = S- (n-ai) In this equation, n and a are constants and a> 0, and S is the intensity of the convolved Bark spectrum.
i) is the tolerance function. As an example, n = 38, a = -0.
5 can be used.

【００５５】このようにして、上記レベルαが求めら
れ、このデータは、割算器５２６に伝送される。当該割
算器５２６では、上記畳込みされた領域での上記レベル
αを逆コンボリューションするためのものである。した
がって、この逆コンボリューション処理を行うことによ
り、上記レベルαからマスキングスレッショールドが得
られるようになる。すなわち、このマスキングスレッシ
ョールドが許容ノイズスペクトルとなる。なお、上記逆
コンボリューション処理は、複雑な演算を必要とする
が、本実施例では簡略化した割算器５２６を用いて逆コ
ンボリューションを行っている。In this way, the level α is obtained, and this data is transmitted to the divider 526. The divider 526 is for deconvolution of the level α in the convolved area. Therefore, by performing the inverse convolution processing, the masking threshold can be obtained from the level α. That is, this masking threshold becomes the allowable noise spectrum. Although the above-mentioned inverse convolution processing requires complicated calculation, in this embodiment, the inverse convolution is performed using the simplified divider 526.

【００５６】次に、上記マスキングスレッショールド
は、合成回路５２７を介して減算器５２８に伝送され
る。ここで、当該減算器５２８には、上記帯域毎のエネ
ルギ検出回路５２２からの出力、すなわち前述したバー
クスペクトルＳＢが、遅延回路５２９を介して供給され
ている。したがって、この減算器５２８で上記マスキン
グスレッショールドとバークスペクトルＳＢとの減算演
算が行われることで、図８に示すように、上記バークス
ペクトルＳＢは、当該マスキングスレッショールドＭＳ
のレベルで示すレベル以下がマスキングされることにな
る。なお、上記遅延回路５２９は、上記合成回路５２７
以前の各回路での遅延量を考慮してエネルギ検出回路５
２２からのバークスペクトルＳＢを遅延させるために設
けられている。Next, the masking threshold is transmitted to the subtractor 528 via the synthesizing circuit 527. Here, the output from the energy detection circuit 522 for each band, that is, the above-described Bark spectrum SB is supplied to the subtractor 528 via the delay circuit 529. Therefore, the subtractor 528 performs a subtraction operation on the masking threshold and the Bark spectrum SB, so that the Bark spectrum SB shows the masking threshold MS as shown in FIG.
The level below the level indicated by will be masked. The delay circuit 529 is the same as the synthesis circuit 527.
Energy detection circuit 5 considering the delay amount in each circuit before
It is provided to delay the Bark spectrum SB from 22.

【００５７】当該減算器５２８からの出力は、許容雑音
補正回路５３０を介し、出力端子５３１を介して取り出
され、例えば配分ビット数情報が予め記憶されたＲＯＭ
等（図示せず）に送られる。このＲＯＭ等は、上記減算
回路５２８から許容雑音補正回路５３０を介して得られ
た出力（上記各バンドのエネルギと上記ノイズレベル設
定手段の出力との差分のレベル）に応じ、各バンド毎の
配分ビット数情報を出力する。The output from the subtractor 528 is taken out via the allowable noise correction circuit 530 and the output terminal 531. For example, the ROM in which the distribution bit number information is stored in advance.
Etc. (not shown). The ROM or the like is distributed for each band according to the output (the level of the difference between the energy of each band and the output of the noise level setting means) obtained from the subtraction circuit 528 through the allowable noise correction circuit 530. Outputs bit number information.

【００５８】このようにしてエネルギ依存ビットと聴覚
許容雑音レベルに依存したビットは加算されてその配分
ビット数情報が上記適応ビット割当符号化回路１６〜１
８に送られることで、ここでＭＤＣＴ回路１３〜１５か
らの周波数領域の各スペクトルデータがそれぞれのバン
ド毎に割り当てられたビット数で量子化されるわけであ
る。In this way, the energy-dependent bits and the bits depending on the permissible noise level of the hearing are added, and the information on the number of allocated bits thereof is the adaptive bit allocation coding circuits 16-1.
By sending the data to the M.C.R.8, the spectrum data in the frequency domain from the MDCT circuits 13 to 15 are quantized by the number of bits assigned to each band.

【００５９】すなわち要約すれば、上記適応ビット割当
符号化回路１６〜１８では、上記クリティカルバンドの
各バンド帯域毎（クリティカルバンド毎）若しくは高域
においては当該クリティカルバンドを更に複数帯域に分
割した帯域のエネルギ若しくはピーク値と、上記ノイズ
レベル設定手段の出力との差分のレベルに応じて配分さ
れたビット数で上記各バンド毎のスペクトルデータを量
子化することになる。In summary, in the adaptive bit allocation encoding circuits 16 to 18, in each band band (each critical band) of the critical band or in the high band, the critical band is further divided into a plurality of bands. The spectrum data for each band is quantized by the number of bits distributed according to the level of the difference between the energy or peak value and the output of the noise level setting means.

【００６０】ところで、上述した合成回路５２７での合
成の際には、最小可聴カーブ発生回路５３２から供給さ
れる図９に示すような人間の聴覚特性であるいわゆる最
小可聴カーブＲＣを示すデータと、上記マスキングスレ
ッショールドＭＳとを合成することができる。この最小
可聴カーブにおいて、雑音絶対レベルがこの最小可聴カ
ーブ以下ならば該雑音は聞こえないことになる。この最
小可聴カーブは、コーディングが同じであっても例えば
再生時の再生ボリュームの違いで異なるものとなるが、
現実的なディジタルシステムでは、例えば１６ビットダ
イナミックレンジへの音楽のはいり方にはさほど違いが
ないので、例えば４ｋＨｚ付近の最も耳に聞こえやすい
周波数帯域の量子化雑音が聞こえないとすれば、他の周
波数帯域ではこの最小可聴カーブのレベル以下の量子化
雑音は聞こえないと考えられる。したがって、このよう
に例えばシステムの持つダイナミックレンジの４ｋＨｚ
付近の雑音が聞こえない使い方をすると仮定し、この最
小可聴カーブＲＣとマスキングスレッショールドＭＳと
を共に合成することで許容ノイズレベルを得るようにす
ると、この場合の許容ノイズレベルは、図９中の斜線で
示す部分までとすることができるようになる。なお、本
実施例では、上記最小可聴カーブの４ｋＨｚのレベル
を、例えば２０ビット相当の最低レベルに合わせてい
る。また、この図９は、信号スペクトルＳＳも同時に示
している。By the way, at the time of synthesizing by the above-mentioned synthesizing circuit 527, data showing a so-called minimum audible curve RC which is the human auditory characteristic as shown in FIG. The masking threshold MS can be combined. In this minimum audible curve, if the absolute noise level is below this minimum audible curve, the noise will not be heard. This minimum audible curve is different even if the coding is the same, for example, due to the difference in playback volume during playback,
In a realistic digital system, there is not much difference in how music is put into a 16-bit dynamic range, so if the quantization noise in the most audible frequency band around 4 kHz is inaudible, for example, other It is considered that quantization noise below the level of this minimum audible curve is inaudible in the frequency band. Therefore, for example, the dynamic range of the system is 4 kHz.
Assuming that the noise is not heard in the vicinity, and the allowable noise level is obtained by synthesizing the minimum audible curve RC and the masking threshold MS together, the allowable noise level in this case is shown in FIG. It will be possible to go up to the part shown by the diagonal line of. In this embodiment, the level of 4 kHz of the minimum audible curve is set to the minimum level equivalent to 20 bits, for example. Further, FIG. 9 also shows the signal spectrum SS at the same time.

【００６１】また、上記許容雑音補正回路５３０では、
補正情報出力回路５３３から送られてくる例えば等ラウ
ドネスカーブの情報に基づいて、上記減算器５２８から
の出力における許容雑音レベルを補正している。ここ
で、等ラウドネスカーブとは、人間の聴覚特性に関する
特性曲線であり、例えば１ｋＨｚの純音と同じ大きさに
聞こえる各周波数での音の音圧を求めて曲線で結んだも
ので、ラウドネスの等感度曲線とも呼ばれる。またこの
等ラウドネス曲線は、図９に示した最小可聴カーブＲＣ
と略同じ曲線を描くものである。この等ラウドネス曲線
においては、例えば４ｋＨｚ付近では１ｋＨｚのところ
より音圧が８〜１０ｄＢ下がっても１ｋＨｚと同じ大き
さに聞こえ、逆に、５０Ｈｚ付近では１ｋＨｚでの音圧
よりも約１５ｄＢ高くないと同じ大きさに聞こえない。
このため、上記最小可聴カーブのレベルを越えた雑音
（許容ノイズレベル）は、この等ラウドネス曲線に応じ
たカーブで与えられる周波数特性を持つようにするのが
良いことがわかる。このようなことから、上記等ラウド
ネス曲線を考慮して上記許容ノイズレベルを補正するこ
とは、人間の聴覚特性に適合していることがわかる。Further, in the allowable noise correction circuit 530,
The allowable noise level in the output from the subtractor 528 is corrected based on the information of the equal loudness curve sent from the correction information output circuit 533, for example. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics, for example, a curve obtained by obtaining the sound pressure of sound at each frequency that sounds the same as a pure tone of 1 kHz, and connecting the curves. Also called sensitivity curve. Further, this equal loudness curve is the minimum audible curve RC shown in FIG.
It draws almost the same curve as. In this equal loudness curve, for example, at 4 kHz, even if the sound pressure drops by 8 to 10 dB from 1 kHz, it sounds as loud as 1 kHz, and conversely, at around 50 Hz, it must be about 15 dB higher than the sound pressure at 1 kHz. It doesn't sound the same.
Therefore, it is understood that the noise exceeding the level of the minimum audible curve (allowable noise level) should have the frequency characteristic given by the curve corresponding to the equal loudness curve. From this, it can be seen that correcting the permissible noise level in consideration of the equal loudness curve is suitable for human hearing characteristics.

【００６２】以上述べた聴覚許容雑音レベルに依存した
スペクトル形状を使用可能総ビット１２８Ｋｂｐｓの内
のある割合を用いるビット配分でつくる。この割合は入
力信号のトーナリティが高くなるほど減少する。The spectrum shape depending on the permissible hearing noise level described above is created by bit allocation using a certain ratio of the total usable bits of 128 Kbps. This ratio decreases as the tonality of the input signal increases.

【００６３】次に２つのビット配分手法の間でのビット
量分割手法について説明する。Next, a bit amount dividing method between the two bit allocation methods will be described.

【００６４】図３に戻って、ＭＤＣＴ回路出力が供給さ
れる入力端子８０１からの信号は、スペクトルの滑らか
さ算出回路８０８にも与えられ、ここでスペクトルの滑
らかさが算出される。本実施例では、信号スペクトルの
絶対値の隣接値間の差の絶対値の和を信号スペクトルの
絶対値の和で割った値を、上記スペクトルの滑らかさと
して算出している。Returning to FIG. 3, the signal from the input terminal 801 to which the output of the MDCT circuit is supplied is also given to the spectrum smoothness calculating circuit 808, and the smoothness of the spectrum is calculated here. In this embodiment, a value obtained by dividing the sum of the absolute values of the differences between the adjacent values of the signal spectrum by the sum of the absolute values of the signal spectrum is calculated as the smoothness of the spectrum.

【００６５】上記スペクトルの滑らかさ算出回路８０８
の出力は、ビット分割率決定回路８０９に与えられ、こ
こでエネルギ依存のビット配分と、聴覚許容雑音スペク
トルによるビット配分間のビット分割率とが決定され
る。ビット分割率はスペクトルの滑らかさ算出回路８０
８の出力値が大きいほど、スペクトルの滑らかさが無い
と考えて、エネルギ依存のビット配分よりも、聴覚許容
雑音スペクトルによるビット配分に重点をおいたビット
配分を行う。ビット分割率決定回路８０９は、それぞれ
エネルギ依存のビット配分及び聴覚許容雑音スペクトル
によるビット配分の大きさをコントロールするマルチプ
ライヤ８１１及び８１２に対してコントロール出力を送
る。ここで、仮にスペクトルが滑らかであり、エネルギ
依存のビット配分に重きをおくように、マルチプライヤ
８１１へのビット分割率決定回路８０９の出力が０．８
の値を取ったとき、マルチプライヤ８１２へのビット分
割率決定回路８０９の出力は１−０．８＝０．２とす
る。これら２つのマルチプライヤの出力は加算器８０６
で足し合わされて最終的なビット配分情報となって、出
力端子８０７から出力される。The above-mentioned spectrum smoothness calculation circuit 808
Is supplied to the bit division rate determination circuit 809, and the bit division rate between the energy-dependent bit allocation and the bit allocation according to the perceptual noise spectrum is determined. The bit division rate is the smoothness calculation circuit 80 of the spectrum.
It is considered that the larger the output value of 8, the smoother the spectrum is, and the bit allocation is performed with more emphasis on the bit allocation by the perceptual noise spectrum than the energy-dependent bit allocation. The bit division ratio determination circuit 809 sends control outputs to multipliers 811 and 812 which control the amount of bit distribution depending on the energy and the permissible noise spectrum of the auditory sense, respectively. Here, if the spectrum is smooth and the output of the bit division rate determination circuit 809 to the multiplier 811 is 0.8 so as to emphasize the energy-dependent bit allocation.
, The output of the bit division rate determination circuit 809 to the multiplier 812 is 1-0.8 = 0.2. The outputs of these two multipliers are the adders 806.
Are added together to form final bit allocation information, which is output from the output terminal 807.

【００６６】このときのビット配分の様子を図１０、図
１１に示す。また、これに対応する量子化雑音の様子を
図１２、図１３に示す。図１０は信号のスペクトルが割
合平坦である場合を示しており、図１１は信号スペクト
ルが高いトーナリティを示す場合を示している。また、
図１０及び図１１の図中ＱＳは信号レベル依存分のビッ
ト量を示し、図中ＱＮは聴覚許容雑音レベル依存のビッ
ト割当分のビット量を示している。図１２及び図１３の
図中Ｌは信号レベルを示し、図中ＮＳは信号レベル依存
分による雑音低下分を、図中ＮＮは聴覚許容雑音レベル
依存のビット割当分による雑音低下分を示している。The state of bit allocation at this time is shown in FIGS. 12 and 13 show the state of quantization noise corresponding to this. FIG. 10 shows the case where the spectrum of the signal is flat, and FIG. 11 shows the case where the signal spectrum shows high tonality. Also,
In FIGS. 10 and 11, QS indicates the bit amount corresponding to the signal level, and QN in the diagrams indicates the bit amount corresponding to the bit allocation depending on the permissible hearing noise level. 12 and 13, L represents the signal level, NS in the figures represents the noise reduction due to the signal level dependency, and NN in the figures represents the noise reduction due to the permissible noise level dependent bit allocation. .

【００６７】先ず、信号のスペクトルが、割合平坦であ
る場合を示す図１２において、聴覚許容雑音レベルに依
存したビット配分は、全帯域に渡り大きい信号雑音比を
取るために役立つ。しかし低域及び高域では比較的少な
いビット配分が使用されている。これは聴覚的にこの帯
域の雑音に対する感度が小さいためである。信号エネル
ギレベルに依存したビット配分の分は量としては少ない
が、ホワイトな雑音スペクトルを生じるように、この場
合には中低域の信号レベルの高い周波数領域に重点的に
配分されている。First, in FIG. 12, which shows a case where the spectrum of the signal is flat, the bit allocation depending on the permissible noise level of the hearing aids in obtaining a large signal-to-noise ratio over the entire band. However, relatively low bit allocations are used in the low and high frequencies. This is because auditory sensitivity to noise in this band is low. The bit allocation depending on the signal energy level is small in amount, but in this case, it is concentrated in the high frequency region of the signal level in the middle and low frequencies so as to generate a white noise spectrum.

【００６８】これに対して、図１３に示すように、信号
スペクトルが高いトーナリティを示す場合には、信号エ
ネルギレベルに依存したビット配分量が多くなり、量子
化雑音の低下は極めて狭い帯域の雑音を低減するために
使用される。聴覚許容雑音レベルに依存したビット配分
の集中はこれよりもきつくない。On the other hand, as shown in FIG. 13, when the signal spectrum shows a high tonality, the amount of bit allocation depending on the signal energy level increases, and the quantization noise is reduced in a very narrow band. Used to reduce The concentration of bit allocation depending on the permissible noise level of the hearing is less tight than this.

【００６９】図１３に示すように、この両者のビット配
分の和により、孤立スペクトル入力信号での特性の向上
が達成される。As shown in FIG. 13, the improvement of the characteristics in the isolated spectrum input signal is achieved by the sum of the bit allocations of the both.

【００７０】図１４は、図１に示した高能率符号化装置
によって符号化された信号を、再び復号化するための基
本的な本発明実施例の高能率復号化装置を示している。FIG. 14 shows a basic high efficiency decoding apparatus of the present invention for decoding the signal coded by the high efficiency coding apparatus shown in FIG. 1 again.

【００７１】この図１４において、各帯域の量子化され
たＭＤＣＴ係数は復号化装置入力端子１２２、１２４、
１２６に与えられ、使用されたブロックサイズ情報は入
力端子１２３、１２５、１２７に与えられる。復号化回
路１１６、１１７、１１８では適応ビット配分情報を用
いてビット割当を解除する。In FIG. 14, the quantized MDCT coefficients of each band are the decoding device input terminals 122 and 124,
The block size information given to 126 and used is given to the input terminals 123, 125, 127. The decoding circuits 116, 117 and 118 cancel the bit allocation using the adaptive bit allocation information.

【００７２】次に、ＩＭＤＣＴ（逆変更離散コサイン変
換）回路１１３、１１４、１１５では周波数領域の信号
が時間領域の信号に変換される。これらの部分帯域の時
間領域信号は、ＩＱＭＦ（逆ＱＭＦ）回路１１２、１１
１により、全体域信号に復号化され、出力端子１１０へ
送られる。Next, the IMDCT (Inverse Change Discrete Cosine Transform) circuits 113, 114 and 115 convert the frequency domain signals into time domain signals. The time domain signals of these sub-bands are IQMF (inverse QMF) circuits 112 and 11
1, it is decoded into a full-range signal and sent to the output terminal 110.

【００７３】次に、本発明実施例の伝送媒体は、上述し
たような本発明実施例の高能率符号化装置により符号化
された信号を伝送するものであり、ここでの伝送には記
録も含まれ、したがって、本発明の記録媒体（記録メデ
ィア）としては、例えば、光ディスク，光磁気ディス
ク，磁気ディスク等のディスク状の記録媒体に上記符号
化信号が記録されたものや、磁気テープ等のテープ状記
録媒体に上記符号化信号が記録されたもの、或いは、符
号化信号が記憶された半導体メモリ，ＩＣカードなどを
挙げることができる。また、本発明の伝送媒体として伝
送系を挙げた場合、その伝送媒体（伝送メディア）とし
ては、例えば電線若しくは光ケーブルや電波等を挙げる
ことができる。Next, the transmission medium of the embodiment of the present invention transmits the signal coded by the high-efficiency encoder of the embodiment of the present invention as described above. Therefore, the recording medium (recording medium) of the present invention includes, for example, a disc-shaped recording medium such as an optical disc, a magneto-optical disc, and a magnetic disc in which the above-mentioned encoded signal is recorded, and a magnetic tape. Examples thereof include a tape-shaped recording medium in which the above-mentioned coded signal is recorded, a semiconductor memory in which the coded signal is stored, an IC card, and the like. When a transmission system is used as the transmission medium of the present invention, examples of the transmission medium (transmission medium) include electric wires, optical cables, and radio waves.

【００７４】なお、本発明はこの実施例にのみ限定され
るものではなく、例えば、上記の記録再生媒体と上記他
の記録再生媒体とは一体化されている必要はなく、その
間をデータ転送用回線等で結ぶことも可能である。更に
例えば、オーディオＰＣＭ信号のみならず、ディジタル
音声（スピーチ）信号やディジタルビデオ信号等の信号
処理装置にも適用可能である。また、上述した最小可聴
カーブの合成処理を行わない構成としてもよい。この場
合には、図６中の最小可聴カーブ発生回路５３２、合成
回路５２７が不要となり、上記引算器５２６からの出力
は、直ちに減算器５２８へ伝送されることになる。The present invention is not limited to this embodiment. For example, the recording / reproducing medium described above and the other recording / reproducing medium do not need to be integrated, and data is transferred between them. It is also possible to connect by a line or the like. Furthermore, for example, the present invention can be applied not only to audio PCM signals but also to signal processing devices for digital audio (speech) signals, digital video signals, and the like. Further, the above-described minimum audible curve synthesizing process may not be performed. In this case, the minimum audible curve generating circuit 532 and the synthesizing circuit 527 in FIG. 6 are unnecessary, and the output from the subtractor 526 is immediately transmitted to the subtractor 528.

【００７５】また、ビット配分手法は多種多様であり、
最も簡単には固定のビット配分もしくは信号の各帯域エ
ネルギによる簡単なビット配分もしくは固定分と可変分
を組み合わせたビット配分などを使うことが可能であ
る。There are various bit allocation methods,
In the simplest case, it is possible to use fixed bit allocation, simple bit allocation by each band energy of the signal, or bit allocation combining fixed and variable components.

【００７６】また、入力信号が倍音関係を持つか否かを
判断し、ビット配分を変える手法も考えられる。例え
ば、図３中の倍音成分検出回路８１３において明確な倍
音関係が検出できなかった場合、エネルギ依存のビット
配分回路８０４及びビット端数調整回路８１４への出力
を行わない構成にすることも可能である。A method is also conceivable in which it is determined whether or not the input signal has a harmonic relationship and the bit allocation is changed. For example, when a clear overtone relation cannot be detected by the overtone component detection circuit 813 in FIG. 3, it is possible to adopt a configuration in which output to the energy-dependent bit distribution circuit 804 and the bit fraction adjustment circuit 814 is not performed. .

【００７７】本発明実施例は、以上のような種々の変形
が考えられる。Various modifications as described above can be considered in the embodiment of the present invention.

【００７８】[0078]

【発明の効果】以上の説明からも明らかなように、本発
明においては以下の効果を得ることができる。すなわ
ち、入力信号が持つ倍音関係を考慮してビット配分を行
うことにより、入力信号に対し適応的なビットの配分を
行うことが可能となり、同一のビットレートでは、聴感
上、良好な音質を得ることが可能となる。また、低いビ
ットレートでは、音質劣化を防止することができる。As is clear from the above description, the following effects can be obtained in the present invention. In other words, by allocating bits in consideration of the harmonic relationship of the input signal, it becomes possible to allocate bits adaptively to the input signal, and at the same bit rate, good sound quality can be obtained in terms of hearing. It becomes possible. Further, at a low bit rate, it is possible to prevent sound quality deterioration.

[Brief description of drawings]

【図１】本発明実施例の高能率符号化装置の構成例を示
すブロック回路図である。FIG. 1 is a block circuit diagram showing a configuration example of a high-efficiency coding apparatus according to an embodiment of the present invention.

【図２】本実施例装置での周波数及び時間領域における
直交変換ブロックサイズの具体例を示す図である。FIG. 2 is a diagram showing a specific example of orthogonal transform block sizes in the frequency and time domains in the device of this embodiment.

【図３】本発明実施例のビット配分機能の構成例を示す
ブロック回路図である。FIG. 3 is a block circuit diagram showing a configuration example of a bit allocation function of the embodiment of the present invention.

【図４】入力信号の倍音関係を示す図である。FIG. 4 is a diagram showing a harmonic relationship of input signals.

【図５】入力信号の倍音関係を考慮した場合と考慮しな
い場合における、各ブロックの量子化雑音量の変化を示
す図である。FIG. 5 is a diagram showing changes in the amount of quantization noise in each block, with and without consideration of a harmonic relationship of an input signal.

【図６】本発明実施例の聴覚マスキングスレッショール
ド算定機能の構成例を示すブロック回路図である。FIG. 6 is a block circuit diagram showing a configuration example of a hearing masking threshold calculation function of an embodiment of the present invention.

【図７】各臨界帯域信号によるマスキングを示す図であ
る。FIG. 7 is a diagram showing masking by each critical band signal.

【図８】各臨界帯域信号によるマスキングスレッショー
ルドを示す図である。FIG. 8 is a diagram showing a masking threshold by each critical band signal.

【図９】情報スペクトル、マスキングスレショールド、
最小可聴限を示す図である。FIG. 9: Information spectrum, masking threshold,
It is a figure which shows the minimum audible limit.

【図１０】信号スペクトルが平坦な情報信号に対する信
号レベル依存及び聴覚許容雑音レベル依存のビット配分
を示す図である。10 is a diagram showing signal level-dependent and auditory permissible noise level-dependent bit allocation for an information signal having a flat signal spectrum. FIG.

【図１１】信号スペクトルのトーナリティが高い情報信
号に対する信号レベル依存及び聴覚許容雑音レベル依存
のビット配分を示す図である。FIG. 11 is a diagram showing signal level-dependent and auditory permissible noise level-dependent bit allocation for an information signal having a high tonality of a signal spectrum.

【図１２】信号スペクトルが平坦な情報信号に対する量
子化雑音レベルを示す図である。FIG. 12 is a diagram showing a quantization noise level for an information signal having a flat signal spectrum.

【図１３】トーナリティが高い情報信号に対する量子化
雑音レベルを示す図である。FIG. 13 is a diagram showing a quantization noise level for an information signal having high tonality.

【図１４】本発明実施例の高能率復号化装置の構成例を
示すブロック回路図である。FIG. 14 is a block circuit diagram showing a configuration example of a high efficiency decoding device according to an embodiment of the present invention.

[Explanation of symbols]

８０１ＭＤＣＴ回路出力入力端子８０２使用可能総ビット発生回路８０３帯域毎のエネルギ算出回路８０４エネルギ依存のビット配分回路８０５聴覚許容雑音レベル依存のビット配分回路８０６加算器８０７各帯域のビット割当量出力端子８０８スペクトルの滑らかさ算出回路８０９ビット分割率決定回路８１１、８１２マルチプライヤ８１３倍音成分検出回路８１４ビット端数調整回路 801 MDCT circuit output input terminal 802 Total usable bit generation circuit 803 Energy calculation circuit for each band 804 Energy-dependent bit allocation circuit 805 Hearing permissible noise level-dependent bit allocation circuit 806 Adder 807 Bit allocation amount output terminal for each band 808 Spectrum smoothness calculation circuit 809 Bit division rate determination circuit 811, 812 Multiplier 813 Overtone component detection circuit 814 Bit fraction adjustment circuit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０３Ｍ 7/02 9382−5Ｋ 7/30 Ａ 9382−5ＫＨ０４Ｂ 14/00 Ｅ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Office reference number FI Technical indication location H03M 7/02 9382-5K 7/30 A 9382-5K H04B 14/00 E

Claims

[Claims]

1. An input signal is decomposed into a plurality of frequency band components to obtain signal components in a plurality of two-dimensional blocks with respect to time and frequency, and features of the signal components in the two-dimensional block for each of the two-dimensional blocks. , The bit allocation amount is determined based on the quantized coefficient, and the signal component in the two-dimensional block is quantized for information compression for each two-dimensional block, and the compression information for each two-dimensional block is obtained. In the high-efficiency coding method for outputting together with the information compression parameter for each two-dimensional block, when performing bit allocation adapted to the input signal, the overtone relation of the input signal is detected, and each two-dimensional block containing the overtone component is detected. A high-efficiency coding method characterized by increasing the degree of concentration of bit allocation amounts for blocks.

2. The harmonic relation of the input signal is detected on the basis of spectrum information on the frequency axis of the input signal when performing bit allocation adapted to the input signal. High efficiency coding method.

3. When performing bit allocation adapted to an input signal, a harmonic relationship of the input signal is detected based on a quantized coefficient representing a signal component in a two-dimensional block. The high-efficiency coding method according to Item 1.

4. When performing bit allocation adapted to an input signal, all or part of the frequency band is examined to detect the overtone relationship of the input signal.
High efficiency coding method described.

5. When performing bit allocation adapted to an input signal, only low-order harmonic components are detected, the frequency bands of other harmonic components are calculated based on these harmonic components, and the entire frequency band is calculated. The high-efficiency coding method according to claim 1, wherein the overtone relation possessed by is detected.

6. When performing bit allocation adapted to an input signal, only the overtone components of the fundamental tone and the first overtone are detected, and the frequency bands of other overtone components are calculated based on these overtone components, The high-efficiency coding method according to claim 4, wherein the harmonic relations of all frequency bands are detected.

7. When performing bit allocation adapted to an input signal, based on spectrum information of all or a part of the frequency band of the input signal and / or quantization information representing a signal component in a two-dimensional block, Detect only low-order harmonic components and / or fundamental and first harmonic overtone components, calculate the frequency band of other harmonic components based on these harmonic components, and detect the harmonic relationship of all frequency bands. The high-efficiency coding method according to claim 1.

8. The method according to claim 1, wherein, when performing bit allocation adapted to an input signal, the degree of bit allocation is increased with respect to a two-dimensional block containing all or a part of overtone components. High efficiency coding method.

9. When performing bit allocation adapted to an input signal, the degree of concentration of bit allocation is increased for all or some of the two-dimensional blocks containing even harmonic components. The high-efficiency coding method described in 1.

10. When performing bit allocation adapted to an input signal, all or a part of 2 including an odd-order harmonic component is included.
The high-efficiency coding method according to claim 1, wherein the degree of concentration of bit allocation is increased with respect to the dimensional block.

11. When determining a bit allocation amount adapted to an input signal, when the bit allocation amount is small with respect to a predetermined bit rate, priority is given to all or some of the two-dimensional blocks containing overtone components. 2. The high-efficiency coding method according to claim 1, further comprising adding bits to the bits to adjust the fraction of the bits.

12. When determining the bit allocation amount adapted to an input signal, when the bit allocation amount is large with respect to a predetermined bit rate, priority is given to all or some of the two-dimensional blocks that do not include overtone components. 2. The high-efficiency coding method according to claim 1, wherein the bits are deleted to adjust the fraction of the bits.

13. When determining a bit allocation amount adapted to an input signal, when the bit allocation amount is small with respect to a predetermined bit rate, priority is given to all or some two-dimensional blocks containing even harmonic components. 2. The high-efficiency encoding method according to claim 1, wherein the fractional adjustment of bits is performed by adding bits.

14. When determining a bit allocation amount adapted to an input signal, when the bit allocation amount is large with respect to a predetermined bit rate, for all or some of the two-dimensional blocks that do not include an odd harmonic component. 2. The high efficiency encoding method according to claim 1, wherein the bits are preferentially deleted to adjust the fraction of the bits.

15. A block signal component forming means for decomposing an input signal into a plurality of frequency band components to form signal components in a plurality of two-dimensional blocks regarding time and frequency, and the two-dimensional block for each two-dimensional block. An information compression means for obtaining a quantized coefficient representing a characteristic of a signal component in a block, determining a bit allocation amount based on the quantized coefficient, and quantizing the signal component in the two-dimensional block for each two-dimensional block to compress information. In the high-efficiency coding apparatus for outputting the compression information for each of the two-dimensional blocks together with the information compression parameter for each of the two-dimensional blocks, the information compression means detects the overtone relationship of the input signal. Detecting means and bit allocation amount determining means for determining the bit allocation amount, and when performing bit allocation adapted to the input signal, harmonic components are included. High-efficiency encoding apparatus characterized by increasing the concentration of the bit allocation with respect to the original block.

16. The overtone relationship detecting means detects an overtone relationship of the input signal based on spectrum information on the frequency axis of the input signal.
The high-efficiency coding device described.

17. The high-efficiency code according to claim 15, wherein the overtone relationship detecting means detects an overtone relationship of the input signal based on a quantization coefficient representing a signal component in a two-dimensional block. Device.

18. The high-efficiency coding apparatus according to claim 1, wherein the overtone relation detecting means detects the overtone relation of the input signal by examining all or a part of frequency bands.

19. The overtone relationship detecting means detects only low-order overtone components, calculates frequency bands of other overtone components based on these overtone components, and detects overtone relationships of all frequency bands. The high-efficiency coding device according to claim 15, wherein

20. The overtone relation detecting means detects only the overtone components of the fundamental tone and the first overtone, calculates the frequency bands of the other overtone components based on these overtone components, and the overtones of the entire frequency band. The high efficiency encoding device according to claim 18, wherein the relationship is detected.

21. The harmonic overtone detection unit detects low order harmonic overtone components based on spectrum information of frequency bands of all or a part of an input signal and / or quantized information representing a signal component in a two-dimensional block. And / or detecting only the overtone components of the fundamental tone and the first overtone, calculating the frequency bands of other overtone components based on these overtone components, and detecting the overtone relationship of all frequency bands. 15. The high-efficiency coding device according to item 15.

22. The bit allocation amount determining means enhances the degree of concentration of bit allocation with respect to a two-dimensional block including all or some of the overtone components.
The high-efficiency coding device described.

23. The high efficiency according to claim 15, wherein the bit allocation amount determining means increases the degree of concentration of bit allocation for all or some of the two-dimensional blocks including even harmonic components. Encoding device.

24. The high efficiency according to claim 15, wherein the bit allocation amount determining means increases the degree of bit allocation concentration for all or some of the two-dimensional blocks containing odd harmonic components. Encoding device.

25. When the bit allocation amount adapted to an input signal is determined, the bit allocation amount determining means includes all or a part of the 2 or more parts which include harmonic components when the bit allocation amount is smaller than a predetermined bit rate. 16. The high-efficiency encoding apparatus according to claim 15, wherein bits are preferentially added to the dimensional block to adjust the fraction of the bits.

26. When determining the bit allocation amount adapted to an input signal, the bit allocation amount determining means does not include all or a part of overtone components when the bit allocation amount is large with respect to a predetermined bit rate. 16. The high-efficiency coding device according to claim 15, wherein bits are preferentially deleted from the two-dimensional block to adjust the fraction of the bits.

27. When determining the bit allocation amount adapted to an input signal, the bit allocation amount determining means includes all or a part of even harmonic components if the bit allocation amount is smaller than a predetermined bit rate. 16. The high-efficiency coding apparatus according to claim 15, wherein bits are preferentially added to the two-dimensional block to adjust the fraction of the bits.

28. The bit allocation amount determining means, when determining a bit allocation amount adapted to an input signal, does not include an odd harmonic component when the bit allocation amount is large with respect to a predetermined bit rate, or all or part of the odd harmonic components are not included. 16. The high efficiency encoding device according to claim 15, wherein bits are preferentially deleted in the two-dimensional block to adjust the fraction of the bits.

29. A transmission medium, which transmits compression information and information compression parameters compression-encoded by the high-efficiency encoding method according to any one of claims 1 to 14. .

30. A transmission medium, which transmits compression information and information compression parameters compression-encoded by the high-efficiency encoding device according to any one of claims 15 to 28. .