KR100351772B1

KR100351772B1 - Digital encoding device, digital recording signal derivation device and digital signal data derivation method

Info

Publication number: KR100351772B1
Application number: KR1019920013473A
Authority: KR
Inventors: 아까기리겐쪼
Original assignee: 소니 가부시끼 가이샤
Priority date: 1991-08-02
Filing date: 1992-07-28
Publication date: 2002-12-16
Also published as: JPH0537396A; JP3134383B2; KR930005379A

Abstract

(구성) 직교 변환되어 입력 단자(301)에서 공급된 스펙트럼 데이터를 크리티컬 밴드마다 적응적으로 비트 할당할 때, 비트 할당에 사용 가능한 전체 비트에 관하여, 신호 에너지에 의존한 비트 배분 회로(304)에서의 할당분과, 고정 비트 배분 회로(305)에서의 할당분으로 분할되어 배분된다.(Configuration) In the bit distribution circuit 304 depending on the signal energy, in terms of the total bits available for bit allocation, when adaptively bit-aligning the spectrum data supplied from the input terminal 301 by orthogonal conversion Is divided into allocations and the allocations in the fixed bit allocation circuit 305 are divided.

(효과) 신호 스펙트럼이 분산되어 있는 경우에도, 신호의 토낼러티가 높은 경우에도, 각각 양호한 특성을 얻을 수 있다.(Effect) Even when the signal spectrum is dispersed, even when the tonality of the signal is high, good characteristics can be obtained, respectively.

Description

Digital encoding device, digital record signal derivation device and digital signal data derivation method

본 발명은, 소위 고능률 부호화에 의해 입력 디지털 본 발명에 따른의 부호화를 행하고, 전송 또는 기록 재생하고, 복호화하여 재생신호를 얻는, 디지털 부호화 장치, 디지털 입력 신호 압축 시스템, 압축 디지털 기록 신호 도출 장치 및 방법에 관한 것이다.The present invention provides a digital encoding device, a digital input signal compression system, and a compressed digital recording signal deriving device, which perform encoding, transmission, recording, reproducing, and decoding according to an input digital according to the present invention by so-called high efficiency encoding to obtain a reproduction signal. And to a method.

오디오 또는 음성 등의 신호의 고능률 부호화의 수법에는 여러 가지가 있지만, 예를 들면, 시간축 상의 오디오 신호등을 블록화하지 않고, 복수의 주파수 대역에서 필터로 분할하여 부호화하는 비(非) 블록화 주파수 대역 분할 방식으로서, 대역 분할 부호화(서브 대역 코딩 : SBC)등을 들 수 있고, 또한 시간축의 신호를 주파수 축 상의 신호로 변환(직교 변환)하여 복수의 주파수 대역으로 분할하고, 각각의 대역마다 부호화하는 블록화 주파수 대역 분할 방식, 소위 변환 부호화 등을 들 수 있다.There are various methods of high-efficiency encoding of signals such as audio or voice. For example, non-blocking frequency band division in which a plurality of frequency bands are divided and encoded in a plurality of frequency bands without blocking the audio signals on the time axis. Examples of the scheme include band division coding (sub band coding: SBC), and a block that converts a signal on a time axis into a signal on a frequency axis (orthogonal transformation), divides the signal into a plurality of frequency bands, and encodes each band. Frequency band division, so-called transform coding, and the like.

또한, 상술의 대역 분할 부호화와 변환 부호화를 조합한 고능률 부호화의 수법도 생각할 수 있고, 이 경우에는, 예를 들면, 상기 대역 분할 부호화로 대역 분할을 행한 후, 그 각각의 대역마다의 신호를 주파수 축 상의 신호로 직교 변환하고, 이 직교 변환된 각각의 대역마다 부호화가 행해진다.In addition, a method of high efficiency coding combining the above-described band division coding and transform coding can also be considered. In this case, for example, after band division is performed by the band division coding, signals for each band are obtained. Orthogonal transformation is performed on the signal on the frequency axis, and encoding is performed for each of the orthogonally transformed bands.

여기서, 상술한 대역 분할을 위한 필터로서는, 예를 들면, QMF 필터가 있고, 1976 R.E.Crochiere, Digital coding of speech in subbands, Bell Syst. Tech. J. Vol. 55, No.8 1976 에 기술되어 있다. 또한, ICASSP 83, BOSTON Polyphase Quadrature filters - A new subband coding technique, Joseph H. Rothweiler 에는 등(等) 대역폭의 필터 분할 수법이 기술되어 있다. 다음에 상술한 직교 변환으로서는, 예를 들면, 입력 오디오 신호를 소정 단위시간(프레임)으로 블록화하고, 해당 블록마다 고속 푸리에 변환(FFT), 이산적 코사인 변환(DCT), 수정 DCT 변환 (MDCT)등을 행함으로써 시간축을 주파수축으로 변환하는 것을 들 수 있다. MDCT에 관해서는 ICASSP 1987 Subband/Transform Coding Using Filter Bank Designs Based On Time Domain Aliasing Cancellation. J.P.Princen, A.B.Bradley, Univ. of Surrey Royal Melbourne Inst. of Tech.에 기술되어 있다.Here, as the above-described filter for band division, for example, there is a QMF filter, 1976 R. E. Crochiere, Digital coding of speech in subbands, Bell Syst. Tech. J. Vol. 55, No. 8, 1976. In addition, ICASSP 83, BOSTON Polyphase Quadrature filters-A new subband coding technique, Joseph H. Rothweiler, describes an equal bandwidth filter division technique. Next, as the orthogonal transform described above, for example, the input audio signal is blocked at a predetermined unit time (frame), and for each block, a fast Fourier transform (FFT), a discrete cosine transform (DCT), and a modified DCT transform (MDCT). And the like to convert the time axis into a frequency axis. Regarding MDCT, ICASSP 1987 Subband / Transform Coding Using Filter Bank Designs Based On Time Domain Aliasing Cancellation. J. P. Princen, A. B. Bradley, Univ. of Surrey Royal Melbourne Inst. of Tech.

더욱이, 주파수 대역 분할된 각각의 주파수 성분을 양자화하는 주파수 분할폭으로서는, 예를 들면, 인간의 청각 특성을 고려한 대역 분할이 행해진다. 즉, 일반적으로 임계 대역(크리티컬 밴드)이라고 하는 고역일수록 대역폭이 넓어지는 대역폭으로, 오디오 신호를 복수의 대역(예를 들면, 25개의 대역)으로 분할하는 것이 있다. 또한, 이때 각각의 대역마다의 본 발명에 따른을 부호화할 때는 각각의 대역마다 소정의 비트 배분 또는 각각의 대역마다 적응적 비트 할당(bit allocation)에 의한 부호화가 행해진다. 예를 들면, 상기 MDCT 처리되어 얻어진 계수 데이터를 상기 비트 할당에 의해 부호화할 때는, 상기 각각의 블록마다의 MDCT 처리에 의해 얻을 수 있는 각각의 대역마다의 MDCT 계수 데이터에 대해 적응적 할당 비트수로 부호화가 행해지게 된다.Furthermore, as the frequency division width for quantizing each frequency component divided into frequency bands, for example, band division in consideration of human auditory characteristics is performed. In other words, the higher bandwidth generally referred to as the critical band (critical band) is a bandwidth in which the bandwidth is wider, and there is a case where an audio signal is divided into a plurality of bands (for example, 25 bands). Further, at this time, when encoding according to the present invention for each band, encoding by predetermined bit allocation for each band or adaptive bit allocation for each band is performed. For example, when encoding the coefficient data obtained by the MDCT processing by the bit allocation, the number of adaptive allocation bits for the MDCT coefficient data for each band that can be obtained by the MDCT processing for each block is obtained. Encoding is done.

비트 할당 수법으로서는, 다음의 2 수법이 알려져 있다.As the bit allocation method, the following two methods are known.

먼저, IEEE Transaction of Acoustics, Speech, and Signal Processing, vol. ASSP-25, No.4, August 1977에서는, 각각의 대역마다의 크기를 기초로 비트 할당을 행하고 있다. 이 방식에서는, 양자화 잡음 스펙트럼이 평탄하게 되고, 잡음 에너지가 최소로 되지만, 청각적으로는 마스킹 효과가 이용되고 있지 않기 때문에 실제의 잡음감은 최적이 아니다.First, IEEE Transaction of Acoustics, Speech, and Signal Processing, vol. In ASSP-25, No. 4, and August 1977, bit allocation is performed based on the size of each band. In this system, the quantization noise spectrum is flattened and the noise energy is minimized. However, since the masking effect is not used acoustically, the actual noise is not optimal.

다음에, ICASSP 1980 The critical hand coder-digital encoding of the perceptual requirements of the auditory system, M.A.Kransner, MIT 에서는, 청각 마스킹을 이용하는 것으로, 각각의 대역마다 필요한 신호 대 잡음비를 얻고 고정적인 비트 할당을 행하는 수법이 기술되어 있다. 그러나 이 수법으로는 사인파 입력 특성을 측정하는 경우라도, 비트 할당이 고정적이기 때문에 특성값은 그만큼 좋은 값으로 되지 않는다.Next, in ICASSP 1980 The critical hand coder-digital encoding of the perceptual requirements of the auditory system, MAKransner, MIT, auditory masking is used to obtain the required signal-to-noise ratio for each band and to perform fixed bit allocation. This is described. However, even when the sine wave input characteristic is measured by this method, the characteristic value does not become such a good value because the bit allocation is fixed.

이와 같이, 각각의 대역의 신호의 크기로 비트 배분을 행하고 양자화 잡음 에너지를 최소로 하면 청각적 잡음 레벨은 최소로 되지 않고, 마스킹 효과를 고려하여 각각의 대역에 고정적인 비트 배분을 행하면 사인파 입력시에는 좋은 신호대 잡음 특성을 얻기 어렵다.In this way, if the bit allocation is performed by the signal size of each band and the quantization noise energy is minimized, the auditory noise level is not minimized. If the bit allocation is fixed to each band in consideration of the masking effect, It is difficult to obtain good signal-to-noise characteristics.

본 발명은 이와 같은 실정을 감안하여 이루어진 것으로, 청각적으로도 바람직하고, 또한 1kHz 사인파 입력과 같은 고립 스펙트럼 입력에 대해서도 양호한 특성이 얻어지는 비트 배분 수법을 가지는 디지털 데이터의 고능률 부호화 방법 및장치의 제공을 목적으로 한다.SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and provides a method and apparatus for efficiently encoding digital data having a bit allocation method that is acoustically preferable and obtains good characteristics even for an isolated spectral input such as a 1 kHz sine wave input. For the purpose.

본 발명에 따른 디지털 데이터의 고능률 부호화 방법은, 입력 디지털 신호를 복수의 주파수 대역으로 분할하는 동시에, 각각의 주파수 대역마다 적응적으로 블록 사이즈를 변화시킨 후, 직교 변환을 행함으로써 스펙트럼 데이터를 구하고, 이 스펙트럼 데이터를 크리티컬 밴드마다 적응적으로 비트를 할당하는 디지털 데이터의 고능률 부호화 방법에 있어서, 비트 할당에 사용할 수 있는 전체 비트에 대해, 어떠한 시간에 대해서도 미리 정해진 고정 비트 할당 패턴분과, 시간과 주파수에 관련하여 세분화된 소블록 중의 신호의 크기에 의존하는 할당분으로 배분됨으로써 상술한 과제를 해결한다. 또한 고정 비트 할당 패턴을 복수 개 가짐으로써, 보다 효과적인 할당이 가능해 진다.The high efficiency coding method of digital data according to the present invention divides an input digital signal into a plurality of frequency bands, adaptively changes the block size for each frequency band, and then obtains spectrum data by performing orthogonal transformation. In the high-efficiency encoding method of digital data in which the spectral data is adaptively assigned bits for each critical band, a predetermined bit allocation pattern for a predetermined time, and a time for all the bits that can be used for bit allocation, The above-mentioned problem is solved by being distributed to allocations which depend on the magnitude of the signal in the small block subdivided with respect to frequency. In addition, by having a plurality of fixed bit allocation patterns, more efficient allocation is possible.

또한, 입력 디지털 신호를 복수의 주파수 대역으로 분할하는 수단과, 상기 각각의 주파수 대역마다 적응적으로 블록 사이즈를 변화시키는 수단과, 상기 블록 사이즈를 변화시킨 각각의 주파수 대역마다 직교 변환을 행하는 스펙트럼 데이터를 구하는 수단과, 상기 스펙트럼 데이터를 크리티컬 밴드마다 적응적으로 비트를 할당하는 수단을 구비하고, 상기 비트 할당 수단은 비트 할당에 사용할 수 있는 전체 비트에 대해, 어떠한 시간에 대해서도 미리 정해진 고정 비트 할당 패턴분과, 시간과 주파수에 관련하여 세분화된 소블록중의 신호의 크기에 의존하는 할당분으로 배분하는 디지털 데이터의 고능률 부호화 장치에 의해서도 상술의 과제를 해결한다.Further, means for dividing the input digital signal into a plurality of frequency bands, means for adaptively changing the block size for each frequency band, and spectral data for performing orthogonal transformation for each frequency band for which the block size is changed And means for adaptively allocating the spectral data to every critical band, wherein the bit allocation means is predetermined fixed bit allocation pattern for any time, for all the bits available for bit allocation. The above-mentioned problem is also solved by a high efficiency encoding apparatus for digital data distributed to an allocation portion depending on the magnitude of the signal in the small block subdivided with respect to the branch and time and frequency.

이 경우, 신호의 크기가 작을수록, 고역 비트 할당비가 작은 패턴을 선택하는 것은, 유효 배분을 행한 다음에 유익하다. 비 블록킹 주파수 분할 출력을 더욱블록킹 주파수 분할하는 주파수 분해 수법을 이용하는 고능률 부호에 있어서는, 상기 비 블록킹 주파수 분할 출력의 크기로 패턴을 선택하는 것도 유익하다. 또한, 복수 개의 고정 비트 할당 패턴의 비트수를 같게 함으로써 하드웨어 규모를 작게 할 수 있다.In this case, the smaller the signal size is, the more advantageous it is to select a pattern having a small high frequency bit allocation ratio after effective distribution. In a high efficiency code using a frequency decomposition technique of further blocking frequency dividing the non-blocking frequency division output, it is also advantageous to select a pattern by the magnitude of the non-blocking frequency division output. In addition, the hardware scale can be reduced by making the number of bits of a plurality of fixed bit allocation patterns the same.

본 발명에 의하면, 음악 신호와 같이 스펙트럼이 분산되어 있는 경우에도 마스킹 효과에 의해 청각적으로 잡음 레벨을 낮게 할 수 있고, 또한 사인파 입력시에도 신호가 큰 대역으로 비트를 모을 수 있기 때문에 신호 대 잡음비를 크게 할 수 있다.According to the present invention, even when the spectrum is dispersed like a music signal, the noise level can be lowered acoustically due to the masking effect, and the signal-to-noise ratio can be gathered into a larger band even when a sine wave is input. Can be increased.

오디오 PCM 신호 등의 입력 디지털 신호를, 대역 분할 부호화(SBC), 적응변환 부호화(ATC) 및 적응 비트 할당(APC-AB)의 각각의 기술을 이용하여 고능률 부호화하는 실시예에 대해 제 1 도를 참조하여 설명한다.1 shows a high efficiency encoding of an input digital signal such as an audio PCM signal using respective techniques of band division coding (SBC), adaptive transform coding (ATC), and adaptive bit allocation (APC-AB). It demonstrates with reference to.

제 1 도에 나타낸 구체적인 고능률 부호화 장치에서는, 입력 디지털 신호를 필터등에 의해 복수의 주파수 대역으로 분할하는 동시에, 각각의 주파수 대역마다 직교 변환을 행하고, 얻어진 주파수 축의 스펙트럼 데이터를, 후술하는 인간의 청각 특성을 고려한 소위 임계 대역(크리티컬 밴드) 폭마다 적응적으로 비트 할당하여 부호화하고 있다. 물론 필터 등에 의한 비블록킹의 주파수 분할폭은 등(等)분할 폭으로 해도 좋다. 더욱이, 본 발명의 실시예에 있어서는, 직교 변환 전에 입력 신호에 따라 적응적으로 블록 사이즈(블록 길이)를 변화시키는 동시에, 크리티컬 밴드 단위 또는 고역에서는 임계 대역폭을 더욱 세분화 한 블록으로 플로팅 처리를 행하고 있다. 이 크리티컬 밴드란, 인간의 청각 특성을 고려하여 분할된 주파수 대역이고, 어떤 순음(純音)의 주파수 근방의 같은 강도의 협대역 노이즈에 의해 해당 순음이 마스크될 때의 그 노이즈가 갖는 대역의 것이다. 이 크리티컬 밴드는 고역일수록 대역폭이 넓게 되어 있으며, 상기 0∼20kHz의 전체 주파수 대역은 예를 들면 25 크리티컬 밴드로 분할되어 있다.In the specific high efficiency encoding apparatus shown in FIG. 1, an input digital signal is divided into a plurality of frequency bands by a filter or the like, orthogonal transformation is performed for each frequency band, and the spectral data of the obtained frequency axis is described below. The encoding is performed by adaptively bit allocation for each so-called critical band (critical band) width in consideration of characteristics. Of course, the frequency division width of non-blocking by a filter or the like may be an equal division width. Furthermore, in the embodiment of the present invention, the block size (block length) is adaptively changed in accordance with the input signal before the orthogonal transform, and the floating processing is performed in the critical band unit or the high frequency block to further refine the threshold bandwidth. . This critical band is a frequency band divided in consideration of human auditory characteristics, and is a band of the noise when the pure tone is masked by narrow-band noise of the same intensity near the frequency of a certain pure tone. This critical band has a wider bandwidth at higher frequencies, and the entire frequency band of 0 to 20 kHz is divided into 25 critical bands, for example.

즉, 제 1 도에서, 입력 단자(10)에는 예를 들면 0∼20kHz의 오디오 PCM 신호가 공급되고 있다. 이 입력 신호는, 예를 들면 소위 QMF 필터 등의 대역 분할 필터(11)에 의해, 0∼10kHz 대역과 10k∼20kHz 대역으로 분할되고, 0∼10 kHz 대역의 신호는 마찬가지로 소위 QMF 필터등의 대역 분할 필터(12)에 의해 0∼5 kHz 대역과 5k∼10kHz 대역으로 분할된다. 대역 분할 필터(11)에서의 10k∼20kHz 대역의 신호는, 직교 변환 회로의 일예인 MDCT(Modified Discrete Cosine Transform) 회로(13)에 보내지고, 대역 분할 필터(12)에서의 5k∼10kHz 대역의 신호는 MDCT 회로(14)에 보내지며, 대역 분할 필터(12)에서의 0∼5 kHz 대역의 신호는 MDCT 회로(15)에 보내짐으로써 각각 MDCT 처리된다.That is, in Fig. 1, an audio PCM signal of 0 to 20 kHz is supplied to the input terminal 10, for example. This input signal is divided into a band of 0 to 10 kHz and a band of 10 k to 20 kHz by a band dividing filter 11 such as a QMF filter, for example, and a signal of the band of 0 to 10 kHz is similarly a band of a so-called QMF filter. The split filter 12 divides the signal into 0 to 5 kHz bands and 5 to 10 kHz bands. Signals in the 10 k to 20 kHz band of the band split filter 11 are sent to the Modified Discrete Cosine Transform (MDCT) circuit 13, which is an example of an orthogonal transform circuit, and has a band of 5 k to 10 kHz in the band split filter 12. The signal is sent to the MDCT circuit 14, and the signals of the 0 to 5 kHz band in the band division filter 12 are sent to the MDCT circuit 15 to be MDCT processed, respectively.

여기서, 각각의 MDCT 회로(13, 14, 15)의 블록 사이즈의 구체예를 제 2 도에 나타낸다. 제 2 도의 구체예에서는, 고역측일수록 주파수 대역을 넓힘과 동시에 시간 분해능을 높게(블록 길이를 짧게)하고 있다. 즉, 저역측의 0∼5kHz 대역의 신호 및 중역의 5k∼10kHz 대역의 신호에 대해서는 1 블록 b_L, b_M의 샘플수를 예를 들면 256 샘플로 하고, 고역측의 10k∼20kHz 대역의 신호에 대해서는, b_H를 상기 저역 및 중역측의 블록 b_L, b_M의 각각 1/2 의 길이 BL_L/2, BL_M/2의 길이로 블록화하고였다. 이와 같이 하여 각각의 대역의 직교 변환 블록 샘플수를 같게 하고 있다. 또한, 각각의 대역은, 신호의 시간적 변화가 큰 경우를 상정하여 다시 1/2, 1/4 등의 적응적 블록 분할이 가능하다.Here, the specific example of the block size of each MDCT circuit 13, 14, 15 is shown in FIG. In the specific example of FIG. 2, the higher frequency side makes the frequency band wider and the time resolution higher (shorter block length). That is, for the signals in the low band 0-5 kHz band and the signals in the mid-range 5k-10 kHz band, the number of samples in one block b _L and b _M is 256 samples, for example, and the signals in the high band 10k-20 kHz band are used. Regarding, b _H was blocked by the lengths BL _L / 2 and BL _M / 2 of 1/2 of the blocks b _L and b _M on the low and mid-range sides, respectively. In this way, the number of orthogonal transform block samples in each band is made the same. In addition, each band is capable of adaptive block division, such as 1/2, 1/4, again, assuming a large temporal change in the signal.

또한, 제 1 도에서, 각각의 MDCT 회로(13, 14, 15)에서 MDCT 처리되어 얻어진 주파수 축상의 스펙트럼 데이터 또는 MDCT 계수 데이터는, 소위 임계 대역(크리티컬 밴드)마다 또는 고역에서는 임계 대역폭을 더욱 세분화한 블록마다 합쳐져서 (그룹화되어), 적응비트 할당 부호화 회로(16, 17, 18)에 보내지고 있다.Further, in FIG. 1, spectrum data or MDCT coefficient data on the frequency axis obtained by MDCT processing in each MDCT circuit 13, 14, 15 is further subdivided into a so-called critical band (critical band) or in a high band. Each block is combined (grouped) and sent to the adaptive bit allocation coding circuits 16, 17, and 18.

적응 비트 할당 부호화 회로(16, 17, 18)에 의해 각각의 임계 대역마다 또는 고역에서는 임계 대역폭을 더욱 세분화한 블록마다 할당된 비트수에 따라, 각각의 스펙트럼 데이터(또는, MDCT 계수 데이터)를 재양자화하도록 하고 있다. 이와 같이 하여 부호화된 데이터는 출력 단자(22, 24, 26)를 통해 꺼내진다. 이 때 어떤 신호의 크기에 관한 정규화가 되었는지를 나타낸다. 플로팅 정보와 어떤 비트 길이로 양자화가 되었는지를 나타내는 비트 길이 정보가 동시에 보내진다.Each spectral data (or MDCT coefficient data) is re-coded by the adaptive bit allocation coding circuits 16, 17, and 18 according to the number of bits allocated for each threshold band or for each block in which the threshold bandwidth is further subdivided in the high range. I'm trying to quantize. The data encoded in this manner is taken out through the output terminals 22, 24, and 26. In this case, it indicates which signal has been normalized. Floating information and bit length information indicating which bit length has been quantized are sent simultaneously.

제 3 도는, 적응 비트 할당 부호화 회로(16, 17, 18)의 내부 기능의 구체예를 나타낸 기능 블록도이고, 제 1 도에서의 각각의 MDCT 회로(13, 14, 15)의 출력은, 제 3 도의 적응 비트 할당 기능부(300)의 입력 단자(301)를 통해 대역마다의 에너지 산출 회로(303)에 보내지고, 상기 크리티컬 밴드(임계 대역)마다의 에너지가, 예를 들면 해당 대역 내에서의 각각의 진폭값의 2 승(乘) 평균의 평방근을 계산하는 것 등에 의해 구해진다. 이 각각의 대역마다의 에너지 대신에, 진폭값의 피크값, 평균값 등이 이용되는 것도 있다. 제 4 도는, 상기 에너지 산출회로(303)에서의 출력으로서, 예를 들면 임계 대역(크리티컬 밴드)내의 또는 고역에서는 임계 대역(크리티컬 밴드)폭을 더욱 세분화한 블록 내의 총합 값의 스펙트럼의 예를 나타낸 것이지만, 이 제 4 도에는 도시를 간략화하기 위해, 상기 임계 대역(크리티컬 밴드)의 수 또는 고역에서는 임계 대역(크리티컬 밴드)폭을 더욱 세분화한 블록 대역 수를 12 밴드(B1∼B2)로 표현하고 있다.FIG. 3 is a functional block diagram showing a specific example of internal functions of the adaptive bit allocation coding circuits 16, 17, and 18. The output of each of the MDCT circuits 13, 14, and 15 in FIG. It is sent to the energy calculation circuit 303 for each band through the input terminal 301 of the adaptive bit allocation function unit 300 of 3 degrees, and the energy for each said critical band (critical band) is for example in the said band. It is calculated | required by calculating the square root of the square root mean of each amplitude value of, and the like. Instead of the energy for each of these bands, a peak value, an average value, or the like of an amplitude value may be used. 4 shows an example of the spectrum of the sum value in the block as an output from the energy calculation circuit 303, for example, within a critical band (critical band) or in a more detailed subdivision of the critical band (critical band) width. However, in FIG. 4, for the sake of simplicity, the number of the critical bands (critical bands) or the number of block bands in which the critical bands (critical bands) are further subdivided in a high band is represented by 12 bands (B1 to B2). have.

적응비트 할당 동작을 제 3 도를 참조하여 더욱 상세히 설명한다. 지금 MDCT 계수를 표현하여 전송 또는 기록에 사용하는 총 비트수(비트 레이트)를 100Kbps로 한다. 이를 제 3 도에서는 사용가능 총 비트를 나타내는 블록(302)에서의 총 비트 100 Kbps로 표시하고 있다. 본 실시예에서는, 그 중 60Kbps 를 고정 비트 배분 회로(305)에서의 고정 비트 배분을 위해 이용하고, 나머지 40Kbps를 대역마다의 버크 스펙트럼에 의존한 비트 배분 회로(304)에서의 비트 배분에 이용하는 것으로 한다. 고정 비트 배분을 위한 비트 할당 패턴은 복수 개 준비되어 있고, 신호의 성질에 의해, 여러 가지의 선택을 할 수 있다. 실시예에서는, 100Kbps에 대응하는 짧은 시간의 블록 비트량을, 각각의 주파수에 분포시킨, 여러 가지의 패턴을 갖는다.The adaptive bit allocation operation will be described in more detail with reference to FIG. Now, the total number of bits (bit rate) used for transmission or recording by expressing the MDCT coefficients is 100 Kbps. This is illustrated in Figure 3 as 100 Kbps total bits in block 302 representing the total available bits. In this embodiment, 60 Kbps is used for the fixed bit allocation in the fixed bit distribution circuit 305, and the remaining 40 Kbps is used for the bit allocation in the bit distribution circuit 304 depending on the buck spectrum for each band. do. A plurality of bit allocation patterns for fixed bit allocation are prepared, and various selections can be made depending on the nature of the signal. In the embodiment, there are various patterns in which a short amount of block bits corresponding to 100 Kbps is distributed to each frequency.

특히, 본 실시예에서는, 중저역과 고역의 비트 배분율을 다르게 한 패턴을 복수개 준비하고 있다. 그리고, 신호의 크기가, 작을수록, 고역에의 할당량이 적은 패턴을 선택하도록 한다. 이와 같이 하여 작은 신호 때일수록 고역의 감도가 저하하는 라우드니스 효과를 생기게 한다. 이때의 신호의 크기로는, 전체 대역의 신호의 크기를 사용할 수도 있지만, 그 위에 필터 등이 이용되고 있는, 비 블록킹 주파수 분할 회로의 출력, 또는 MDCT 출력을 이용한다.In particular, in the present embodiment, a plurality of patterns having different bit distribution ratios in the mid-low range and the high range are prepared. And, the smaller the signal size is, the more the pattern of the allocation to the high range is selected. In this way, a small signal time results in a loudness effect in which the sensitivity of the high range is lowered. Although the magnitude | size of the signal of the full band can also be used as a magnitude | size of the signal at this time, the output of a non-blocking frequency division circuit in which a filter etc. are used on it, or MDCT output is used.

그리고 고정 비트 배분과 임계 대역(크리티컬 밴드) 또는 고역에서는 임계 대역(크리티컬 밴드)폭을 더욱 세분화한 블록의 신호 값에 의존한 비트배분 값의 합이 합 산출 회로(306)에서 얻어지고, 출력 단자(307)를 통해 꺼내져서, 양자화 시에 사용된다. 이어서, 신호 스펙트럼이 평탄할 때의 비트 할당의 상태를 제 5 도에 나타내고, 이에 대응하는 양자화 잡음(노이즈 스펙트럼)의 상태를 제 6 도에 나타낸다. 또한, 신호 스펙트럼의 토낼러티(tonality)가 높을 때, 즉 청각적으로 신호의 음정감이 있고 주파수에 편향이 생기고 있을 때의 비트 할당의 상태를 제 7 도에 나타내고, 이에 대응하는 양자화 잡음(노이즈 스펙트럼)의 상태를 제 8 도에 나타낸다. 여기서, 제 5 도, 제 7 도에서, 흰부분은 고정 비트 할당분의 비트량을, 또한 사선 부분은 신호 레벨 의존분의 비트량을 각각 나타내고 있다. 또한, 제 6 도, 제 8 도내에 있어서, 곡선 a는 신호 레벨을, 곡선 b는 고정 비트 할당분에 의한 잡음 레벨을, 사선부 c는 신호 레벨 의존분에 의한 잡음 저하분을 각각 나타내고 있다.In the fixed bit allocation and the critical band (critical band) or high frequency, the sum of the bit allocation values depending on the signal value of the block further subdivided in the critical band (critical band) width is obtained by the sum calculating circuit 306, and the output terminal It is taken out through 307 and used for quantization. Next, FIG. 5 shows a state of bit allocation when the signal spectrum is flat, and FIG. 6 shows a state of quantization noise (noise spectrum) corresponding thereto. In addition, Fig. 7 shows the state of bit allocation when the tonality of the signal spectrum is high, i.e., when the signal is pitched acoustically and the frequency is deflected, and corresponding quantization noise (noise) is shown. The state of the spectrum) is shown in FIG. Here, in Figs. 5 and 7, the white portion represents the bit amount of the fixed bit allocation, and the diagonal portion represents the bit amount of the signal level dependency. 6 and 8, curve a represents a signal level, curve b represents a noise level due to fixed bit allocation, and an oblique section c represents noise reduction due to signal level dependency.

즉, 제 5 도 및 제 6 도는, 신호 스펙트럼이 비율 평탄인 경우를 도시하고 있으며, 고정 비트 할당분에 의한 잡음 레벨은 전체 대역에 걸쳐 어느 정도의 신호 잡음비를 취하는데 도움이 된다. 그러나 저역 및 고역에서는 비교적 작은 비트 할당이 사용되고 있다. 이는 청각적으로 이 대역의 중요도가 작기 때문이다. 동시에 신호 레벨 의존의 비트 배분을 행하는 정도에 의해, 신호의 크기가 큰 대역의 잡음 레벨이 선택적으로 저하될 수 있다. 그러나, 신호의 스펙트럼이 비율 평탄인 경우에는, 이 선택성도 비율이 넓은 대역에 걸쳐 작용하게 된다.That is, FIGS. 5 and 6 show the case where the signal spectrum is ratio flat, and the noise level by the fixed bit allocation helps to take some signal noise ratio over the entire band. However, relatively low bit allocations are used in the low and high frequencies. This is due to the low importance of this band acoustically. At the same time, the degree of bit allocation dependent on the signal level can selectively lower the noise level of the band having a large signal size. However, when the spectrum of the signal is ratio flat, this selectivity ratio acts over a wide band.

이에 대해, 제 7 도, 제 8 도에 나타낸 바와 같이, 신호 스펙트럼이 높은 토낼러티를 나타내는 경우에는, 신호 레벨 의존의 비트 배분을 행하는 정도에 의한 양자화 잡음의 저하는 극히 좁은 대역의 잡음을 저감하기 위해서 사용된다. 이에 의해 고립 스펙트럼 입력 신호에서의 특성의 향상이 달성된다.On the other hand, as shown in Figs. 7 and 8, when the signal spectrum exhibits high tonality, the reduction of the quantization noise due to the degree of bit allocation depending on the signal level can reduce the noise of the extremely narrow band. Used for This achieves an improvement in the characteristics in the isolated spectral input signal.

제 9 도는 이와 같이 하여 고능률 부호화된 신호를, 전송 또는 기록 재생한 후에, 다시 복호화하기 위한 복호 회로를 나타내고 있다. 각각의 대역의 양자화된 상기 MDCT 계수는, 복호 회로 입력 단자(122, 124, 126)에 부여되고, 사용된 블록 사이즈 정보는, 입력 단자(123, 125, 127)에 주어진다. 복호화 회로(116, 117, 118)에서는 적응 비트 할당 정보를 이용하여 비트 할당을 해제한다. 다음에, IMDCT(역 MDCT) 회로(113, 114, 115)에서 주파수 축 상의 신호가 시간축 상의 신호로 변환된다. 이들의 부분 대역의 시간축 상의 신호는, IQMF(역 QMF) 회로(112, 111)에 의해 전체 대역 신호로 복호화되고, 출력 단자(110)에서 꺼내진다.9 shows a decoding circuit for decoding the high efficiency coded signal in this manner and then decoding it again after transmission or recording and reproduction. The quantized MDCT coefficients of the respective bands are provided to the decoding circuit input terminals 122, 124, and 126, and the used block size information is given to the input terminals 123, 125, and 127. The decoding circuits 116, 117, and 118 release bit allocation using the adaptive bit allocation information. Next, in the IMDCT (inverse MDCT) circuits 113, 114, and 115, the signal on the frequency axis is converted into a signal on the time axis. The signals on the time axis of these partial bands are decoded into full band signals by the IQMF (inverse QMF) circuits 112 and 111 and taken out from the output terminal 110.

이상의 설명에서도 명백한 바와 같이, 본 발명에 따른 디지털 데이터의 고능률 부호화 방법에 의하면, 입력 디지털 신호를 복수의 주파수 대역으로 분할한 후, 직교 변환을 행하고, 얻어진 스펙트럼 데이터를 크리티컬 밴드마다 적응적으로 비트를 할당하는 디지털 데이터의 고능률 부호화 방법으로서, 비트 할당에 사용할 수 있는 전체 비트에 대해, 어떠한 시간에 대해서도 미리 정해진 고정 비트 할당 패턴분과, 시간과 주파수에 관련하여 세분화된 소블록 중의 신호의 크기에 의존하는 할당분으로 배분하도록 하고 있다. 이 비트 배분 수법은 청각적으로도 바람직하고, 또한 1 kHz 사인파 입력과 같은 고립 스펙트럼 입력에 대해서도 양호한 특성을, 몇번이나 반복해서 비트량 조정을 하지 않고 오직 1 회의 연산으로 얻는 비트 배분을 실현할 수 있다. 즉, 음악 신호와 같이 스펙트럼이 분산되어 있는 경우에도, 마스킹 효과에 의해 청각적으로 잡음 레벨을 낮출 수 있고, 또한 사인파 입력시에도 신호가 큰 대역에 비트를 모을 수 있기 때문에 신호 대 잡음비를 크게 할 수 있다.As is clear from the above description, according to the high-efficiency encoding method of digital data according to the present invention, after dividing an input digital signal into a plurality of frequency bands, performing orthogonal transformation, and adaptively bitting the obtained spectrum data for each critical band. A high-efficiency encoding method of digital data that allocates a number of bits to a predetermined fixed bit allocation pattern for any time for all bits that can be used for bit allocation, and to a signal size in a small block subdivided with respect to time and frequency. It is distributed among allocating dependencies. This bit allocation method is audibly preferable, and can realize the bit allocation obtained by only one operation without repeatedly adjusting the bit quantity many times with good characteristics for an isolated spectral input such as a 1 kHz sine wave input. . In other words, even when the spectrum is dispersed like a music signal, the masking effect can lower the noise level aurally, and even when the sine wave input collects bits in a large band, the signal-to-noise ratio can be increased. Can be.

또한, 상기 디지털 데이터의 고능률 부호화 방법에 의해, 디지털 데이터를 부호화하고, 전송 또는 기록을 행하도록 한 디지털 데이터의 고능률 부호화에 의해서도, 같은 효과를 얻을 수 있다.The same effect can also be obtained by the high efficiency encoding of the digital data which encodes the digital data and transmits or records the digital data by the high efficiency encoding method.

제 1 도는 본 발명의 일실시예에 따른 부호화 장치의 구성예를 나타낸 블록 회로도.1 is a block circuit diagram showing an example of a configuration of an encoding apparatus according to an embodiment of the present invention.

제 2 도는 그 실시예 장치의 신호의 주파수 및 시분할의 구체예를 나타낸 도면.2 is a diagram showing a specific example of the frequency and time division of a signal of the example apparatus.

제 3 도는 그 실시예 장치에 사용되는 적응 비트 할당 부호화 회로의 비트 배분 알고리즘의 일예를 설명하기 위한 기능 블록도.3 is a functional block diagram for explaining an example of a bit allocation algorithm of an adaptive bit allocation coding circuit used in the embodiment apparatus.

제 4 도는 버크(Berke) 스펙트럼을 나타낸 도면.4 shows a Berke spectrum.

제 5 도는 상기 실시예의 대략 평탄한 스펙트럼의 신호입력시의 비트 배분의 예를 나타낸 도면.5 is a diagram showing an example of bit allocation at the time of signal input of the substantially flat spectrum of the embodiment.

제 6 도는 상기 실시예의 대략 평탄한 스펙트럼의 신호입력시의 양자화 잡음 스펙트럼의 예를 나타낸 도면.6 is a diagram showing an example of a quantization noise spectrum at the time of input of a signal of approximately flat spectrum of the embodiment.

제 7 도는 상기 실시예의 높은 토낼러티를 갖는 신호입력시의 비트 배분의 예를 나타낸 도면.7 is a diagram showing an example of bit allocation at the time of signal input having the high tonality of the embodiment.

제 8 도는 상기 실시예의 높은 토낼러티를 갖는 신호입력시의 양자화 잡음 스펙트럼의 예를 나타낸 도면.8 is a diagram showing an example of a quantization noise spectrum at the time of signal input having the high tonality of the embodiment.

제 9 도는 상기 실시예의 부호화 장치에 대한 복호화 장치의 구성예를 나타낸 블록 회로도.9 is a block circuit diagram showing an example of the configuration of a decoding apparatus for the encoding apparatus of the embodiment.

* 도면의 주요 부분에 대한 부호의 설명** Explanation of symbols for the main parts of the drawings *

10 : 고능률 부호화 회로 입력 단자10: high efficiency coding circuit input terminal

11, 12 : QMF 회로11, 12: QMF circuit

13, 14, 15 : MDCT 회로13, 14, 15: MDCT circuit

16, 17, 18 : 적응 비트 할당 부호화 회로16, 17, 18: adaptive bit allocation coding circuit

19, 20, 21 : 블록 결정 회로19, 20, 21: block determination circuit

22, 24, 26 : 부호화 출력 단자22, 24, 26: coded output terminal

23, 25, 27 : 블록 사이즈 정보 출력 단자23, 25, 27: block size information output terminal

122, 124, 126 : 부호화 입력 단자122, 124, 126: coding input terminal

123, 125, 127 : 블록 사이즈 정보 입력 단자123, 125, 127: block size information input terminal

116, 117, 118 : 적응 비트 할당 복호화 회로116, 117, 118: adaptive bit allocation decoding circuit

113, 114, 115 : IMDCT 회로113, 114, 115: IMDCT circuit

111, 112 : IQMF 회로111, 112: IQMF Circuit

110 : 고능률 복호화 회로 출력 단자110: high efficiency decoding circuit output terminal

300 : 적응 비트 할당 기능부300: adaptive bit allocation function

302 : 사용 가능한 총 비트수를 나타낸 블록302: Block indicating total number of bits available

303 : 대역마다의 에너지 산출 회로303: energy output circuit for each band

304 : 에너지 의존 비트 배분 회로304: energy dependent bit distribution circuit

305 : 고정 비트 배분 회로305: fixed bit distribution circuit

306 : 비트의 합 연산회로306: sum operation circuit of bits

Claims

A digital encoding device for compressing a digital input signal to provide a compressed output signal,

Frequency dividing means for receiving the digital input signal and dividing the digital input signal into a plurality of frequency ranges;

Time division means for time division of at least one of the frequency ranges of the digital input signal into a plurality of blocks;

Orthogonal transform means for orthogonally transforming each block to provide a plurality of spectral coefficients;

Means for grouping the plurality of spectral coefficients from the orthogonal transforming means into threshold bands each having a band energy and a band frequency by frequency;

Bit allocation pattern selecting means for selecting one predetermined bit allocation pattern from the plurality of predetermined bit allocation patterns;

A first bit allocation means for allocating a total number of fixed bits between the threshold bands to assign a plurality of fixed bits to the threshold bands for quantizing each spectral coefficient within each threshold band, each threshold The first bit allocation means, the number of fixed bits allocated to a band determined according to the selected predetermined bit allocation pattern;

A second bit allocation means for allocating a total number of variable bits between the threshold bands for quantizing each spectral coefficient in each threshold band, for allocating a plurality of variable bits to the threshold band, each threshold The second bit allocation means, the number of variable bits allocated to a band determined according to the band energy of the threshold band;

Each using a plurality of bits equal to the sum of the number of fixed bits allocated to quantize each spectral coefficient in the threshold band and the number of variable bits allocated to quantize each spectral coefficient in the threshold band, Means for quantizing each spectral coefficient in a band, wherein the spectral coefficients in at least one of the threshold bands are quantized using more than zero fixed bits and more than zero variable bits, And a quantization means.

The method of claim 1,

And a total number of the fixed bits and a total number of the variable bits are each kept constant over time.

The method of claim 2,

Further comprising energy determining means for determining energy of the digital input signal,

The bit pattern selection means is arranged in threshold bands with band frequencies on higher frequencies, rather than threshold bands with band frequencies on intermediate frequencies when the energy determining means determines that the energy of the digital input signal is low. And selecting a predetermined bit allocation pattern to which fewer bits are allocated.

The method of claim 2,

Energy determining means for determining a band energy of one of the threshold bands,

The bit pattern selecting means, when the energy determining means determines that the band energy in the critical band of one of the critical bands is low, is a band toward higher frequencies than threshold bands with band frequencies toward intermediate frequencies. Selecting one bit allocation pattern of the plurality of predetermined bit allocation patterns to which fewer bits are allocated to the threshold bands having the frequency.

The method of claim 2,

Further comprising energy determining means for deriving a derived signal by frequency dividing one of the frequency ranges of the digital input signal, and for determining the energy of the derived signal,

The bit allocation pattern selecting means is that, when the energy determining means determines that the energy of the derived signal is low, fewer bits are allocated to the critical bands on the higher frequencies than to the critical bands on the intermediate frequencies. And a bit allocation pattern of one of the plurality of predetermined bit allocation patterns.

The method of claim 2,

Each block has a duration of time,

And said time division means for changing the duration of time of blocks in at least one of said frequency ranges in response to said digital input signal.

The method of claim 1,

Each block has a duration of time,

An apparatus for deriving a compressed digital recording signal from a digital input signal for recording on a recording medium, the apparatus comprising:

Means for dividing the digital input signal into a plurality of frequency ranges;

Means for time-dividing each of the frequency ranges of the digital input signal into a plurality of blocks;

Bit allocation pattern selecting means for selecting a predetermined bit allocation pattern from the plurality of predetermined bit allocation patterns;

First bit allocation means for allocating a total number of fixed bits between the threshold bands for quantizing each spectral coefficient within each threshold band, for allocating a plurality of fixed bits in the threshold band, each threshold The first bit allocation means, the number of fixed bits allocated to a band determined according to the selected predetermined bit allocation pattern;

Second bit allocation means for allocating a total number of variable bits between the threshold bands to quantize each spectral coefficient within each threshold band, to assign a plurality of variable bits to the threshold band, The second bit allocation means, wherein the total number of the variable bits allocated to a critical band is determined according to the band energy;

Each using a plurality of bits equal to the sum of the number of fixed bits allocated to quantize each spectral coefficient in the threshold band and the number of variable bits allocated to quantize each spectral coefficient in the threshold band, Means for quantizing each spectral coefficient in a band, the spectral coefficients in at least one of the critical bands being quantized using more than zero fixed bits and more than zero variable bits The quantization means for generating word length data indicative of the sum for each threshold band;

Means for multiplexing the quantized spectral coefficients and the word length data to provide the compressed digital signal;

And recording means for recording the compressed digital signal on the recording medium.

The method of claim 8,

And the recording medium is an optical disk.

The method of claim 8,

And the recording medium comprises a semiconductor memory.

The method of claim 8,

And the total number of fixed bits and the total number of variable bits are each kept constant over time.

The method of claim 11,

Each block has a duration of time,

And means for time-dividing each frequency range comprises means for changing a duration of time of blocks in at least one of the frequency ranges in response to the digital input signal.

A method for deriving compressed digital signal data from a digital input signal, the method comprising:

Dividing the digital input signal into a plurality of frequency ranges;

Time dividing each of the frequency ranges of the uncompressed digital input signal into a plurality of blocks;

Orthogonally transforming each block to provide a plurality of spectral coefficients;

Grouping the plurality of spectral coefficients provided by the orthogonal transformation by frequency into threshold bands each having a band energy and a band frequency;

Providing a total number of variable bits and a total number of fixed bits, wherein the total number of fixed bits is arranged in a plurality of predetermined bit allocation patterns;

Selecting one of the plurality of predetermined bit allocation patterns as the selected predetermined bit allocation pattern;

Allocating the total number of fixed bits between the threshold bands to assign a plurality of fixed bits to the threshold band to quantize each spectral coefficient within each threshold band, the allocation to each threshold band The fixed bit allocation step being determined according to the selected predetermined bit allocation pattern;

Allocating a total number of variable bits between the threshold bands to assign a plurality of variable bits to the threshold band, to quantize each spectral coefficient within each threshold band, The variable bit allocation step, wherein the total number of the variable bits is determined according to the band energy of the threshold band;

Each using a number and bits equal to the sum of the number of fixed bits allocated to quantize each spectral coefficient in the threshold band and the number of variable bits allocated to quantize each spectral coefficient in the threshold band, Quantizing each spectral coefficient in a critical band, wherein the spectral coefficients in at least one of the critical bands are quantized using more than zero fixed bits and more than zero variable bits The quantization step;

Generating word length data indicative of the sum for each threshold band;

Multiplexing the quantized spectral coefficients and the word length data to provide a compressed digital signal.

The method of claim 13,

In the providing of the total number of fixed bits and the total number of variable bits, the total number of provided fixed bits and the total number of provided variable bits are each kept constant over time.

The method of claim 14,

In the time division step, each block has a duration of time,

And wherein said time division comprises changing a duration of time of said blocks within at least one of said frequency ranges in response to said digital input signal.

The method of claim 13,

Determining the energy of the digital input signal;

Selecting a bit allocation pattern of one of the plurality of predetermined bit allocation patterns, when the energy determining step determines that the energy of the digital input signal is low, rather than threshold bands having band frequencies toward intermediate frequencies And a bit allocation pattern of one of a plurality of predetermined bit allocation patterns to which fewer bits are allocated to threshold bands having band frequencies toward higher frequencies is selected.

The method of claim 13,

Determining the band energy of one of the threshold bands;

Selecting a bit allocation pattern of one of the plurality of predetermined bit allocation patterns, when the energy determining step determines that the band energy of one of the critical bands is low, the band frequency toward the intermediate frequencies Digital signal data in which one bit allocation pattern of one of the plurality of predetermined bit allocation patterns to which fewer bits are allocated is selected in the threshold bands having the band frequencies toward higher frequencies, rather than the threshold bands with Derivation method.

The method of claim 13,

Deriving a derived signal by frequency dividing one of the frequency ranges of the digital input signal;

Determining the energy of the derived signal;

In the step of selecting one bit allocation pattern of the plurality of predetermined bit allocation patterns, when the energy determination step determines that the energy of the derived signal is low, rather than threshold bands having band frequencies toward intermediate frequencies, And in the critical bands with band frequencies towards higher frequencies, one bit allocation pattern of the plurality of predetermined bit allocation patterns to which fewer bits are allocated is selected.