KR100346733B1

KR100346733B1 - Audio coding/decoding method and apparatus capable of controlling scale of bit stream

Info

Publication number: KR100346733B1
Application number: KR1019950031353A
Authority: KR
Inventors: 김상욱; 김연배; 서양석
Original assignee: 삼성전자 주식회사
Priority date: 1995-09-22
Filing date: 1995-09-22
Publication date: 2002-11-23
Also published as: KR970019124A

Abstract

PURPOSE: An audio coding/decoding method and an apparatus capable of controlling scale of a bit stream are provided to improve the service quality by forming the bit stream in compatible with various transmitting paths and restoring devices. CONSTITUTION: A coding apparatus is comprised of a coder(21) for forming a bit stream and a bit stream forming unit(22). The bit stream forming unit(22) capable of controlling a scale thereof changes the formed bit stream to a bit stream capable of controlling the scale. A decoding apparatus is comprised of a bit stream decoding unit(23) capable of controlling the scale for restoring the bit stream and capable of controlling the coded scale and a decoder(24) for getting original data from the restored bit stream.

Description

Audio encoding / decoding method and apparatus capable of scaling a bitstream

본 발명은 오디오 부호화/복호화방법 및 장치에 관한 것으로서, 특히 일반적인 오디오 코덱에서 처리되는 데이타를 스케일 조절이 가능한 비트스트림으로 표현하기 위한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화방법 및 장치에 관한 것이다.The present invention relates to an audio encoding / decoding method and apparatus, and more particularly, to an audio encoding / decoding method and apparatus capable of scaling a bitstream for representing data processed by a general audio codec as a scalable bitstream. .

자연계에 존재하는 신호들은 일반적으로 연속된 신호형태인 아날로그 신호로 존재한다. 이러한 아날로그 신호들은 재현장치 또는 전송장치에서 사용할 경우 열화가 쉽게 발생하고, 발생가능한 오차신호에 대한 보상이 어려우며, 시스템 구성시 시스템의 크기가 커지는 문제가 있다. 이와 같은 문제들은 아날로그 신호를 샘플링과 양자화 처리를 통해 디지탈 신호로 변형해 줌으로써 개선해 줄 수 있다. 이때 입수되는 정보는 시간의 흐름에 따라 변화하기 때문에 매시간 간격마다 데이터를 입수해 주어야 하고, 이러한 점을 고려하여 임의의 시스템을 구현할 경우 기준이 되는 데이타의 실시간 처리에 적합만 샘플링 간격이 결정된다. 결정된 샘플링 간격에 의해 시간영역의 데이타들이 얻어지는데, 이와 같이 인간이 받아들이는 데이타정보는 시간영역의 값들이지만 인간이 실제로 느끼는 정보는 신호의 전력이다. 신호를 디지탈 방식으로 처리하면, 아날로그 방식에 비해 음질이 향상되는 효과가 있는 반면, 아날로그 방식에 비해 데이타양이 증가하는 문제가 있다.Signals in nature exist as analog signals, usually in the form of continuous signals. When the analog signals are used in the reproducing apparatus or the transmitting apparatus, deterioration occurs easily, compensation for possible error signals is difficult, and the size of the system increases when the system is configured. These problems can be improved by transforming analog signals into digital signals through sampling and quantization processing. At this time, since the information obtained changes with the passage of time, the data must be obtained at every hour interval. In consideration of this, when implementing any system, the sampling interval is determined only for real-time processing of the reference data. Data in the time domain are obtained by the determined sampling interval. The data information accepted by human beings is values in the time domain, but the information actually felt by human beings is the power of the signal. When the signal is processed in the digital method, the sound quality is improved compared to the analog method, but the data amount is increased in comparison with the analog method.

이러한 데이타양의 증가에 대한 영향을 감소시키기 위해 압축을 수행하는데, 압축을 수행하지 않을 경우 영상정보 혹은 오디오정보의 전달에 있어서 채널이 요구하는 대역폭을 제공하지 못하기 때문에 전송이 불가능한 문제가 발생한다. 이에 데이타 압축에 관한 많은 연구들이 진행되고 있다. 일예로 1000 × 1000 라인의 해상도를 갖는 그레이 스케일 사진영상에 있어서, 각 화소가 8 비트로 구성되어 각 화소에 대해 256 계조의 다른 밀도로 표현되는 경우가 있다고 가정한다. 이 경우,압축을 하지 않으면 이 사진영상의 전송에 사용되는 비트수는 8백만 비트에 달하고, 일반적인 전화선의 경우, 1초에 9600비트들을 전송할 수 있기 때문에 이 사진영상을 전송하기 위해서는 10분 이상이 걸린다. 다른 예로, 44. 1 kHz로 샘플링되는 오디오의 경우, 샘플링된 정보가 16비트로 샘플링되어 65536 계조를 갖는다고 할 때, 한개의 채널에 대한 비트수가 약 700 kbits 정도로 일반 전화선 사용시 약 73초가 걸려 실시간 처리가 불가능하다고 할 수 있다.Compression is performed to reduce the influence of the increase in data volume, but if compression is not performed, transmission cannot be performed because the channel does not provide the bandwidth required for the transmission of video information or audio information. . Many studies on data compression have been conducted. For example, in a gray scale photographic image having a resolution of 1000 × 1000 lines, it is assumed that each pixel is composed of 8 bits and is represented with different density of 256 gray levels for each pixel. In this case, without compression, the number of bits used to transmit this photographic image is 8 million bits, and in the case of a general telephone line, 9600 bits can be transmitted per second. Takes As another example, for audio sampled at 44.1 kHz, assuming that the sampled information is sampled with 16 bits and has 65536 gradations, the number of bits for one channel is about 700 kbits, which takes about 73 seconds when using a regular telephone line. Can be said to be impossible.

이러한 이유로 데이타 압축에 관한 연구들이 활발히 진행되어 왔고, 파형 부호화, 변환 부호화, 서브밴드 부호화, 프랙탈 부호화, 예측 부호화 및 움직임보상 부호화 등 여러가지 데이터 압축방식들이 제안되었다. 최근 이러한 데이타 압축방식들은 필립스에 의한 MPEG 오디오 1(ISO/IEC 11172)을 사용한 디지탈 컴팩트 카세트(DCC)와, 소니의 미니 디스크(MD)의 상품화에 사용되었고, HDTV 등을 위한 ISO/IEC 13818, 동영상 전화기 또는 멀티미디어기기에 사용하기 위한 MPEG-IV 등에 관한 연구가 화발히 진행되고 있다.For this reason, studies on data compression have been actively conducted, and various data compression schemes such as waveform coding, transform coding, subband coding, fractal coding, predictive coding, and motion compensation coding have been proposed. Recently, these data compression methods have been used in the commercialization of digital compact cassettes (DCC) using MPEG Audio 1 (ISO / IEC 11172) by Philips, Sony's mini-disc (MD), ISO / IEC 13818 for HDTV, etc. Research on MPEG-IV and the like for use in video telephones or multimedia devices is in progress.

이와 같이 압축된 데이타를 전송할 경우, 전송채널의 특징에 따라 전송에 사용되는 비트율과, 실시간 처리를 위한 비트율이 결정되기 때문에 결정된 비트율에서 최대의 음질 또는 화질을 보장할 수 있는 상태로 고정시켜 놓고 데이타를 전송하였다. 이러한 전송채널의 결정은 전송되는 데이타를 최적으로 복원해내기 위한 장치와 그 전송률을 최대한 효과적으로 사용하도록 부호화해 주는 장치들을 개발하게 되었다.In the case of transmitting the compressed data as described above, the bit rate used for transmission and the bit rate for real-time processing are determined according to the characteristics of the transmission channel, and the data is fixed at a state that can guarantee the maximum sound quality or image quality at the determined bit rate. Sent. The determination of the transport channel has led to the development of devices for optimally reconstructing the data to be transmitted and devices for encoding the most efficient use of the data rate.

그러나, 이러한 경우 전송률이 고정되어 있기 때문에 그 전송률에서 효과적인 특정 재현장치의 설계는 가능하였지만, 여러가지 시스템에서 사용되는 전송률에 맞도록 데이타를 형성해 주는 방법 및 여러가지 종류의 비트스트림을 연속해서 보내주는 방법은 제시되지 않았다. 즉, 임의의 정보가 전송되는 경우 전송되는 데이타의 비트율을 사용자가 임의로 선택하여 복원해 낼 수 없었다. 그리고, 복원하는 시스템에 있어서 일예로 광디스크에 영상과 음성정보가 저장되어 있을때 비트율로 조절하여 재현해 낼 수 없었다. 즉, 영상정보에 할당된 비트율과 음성정보에 할당된 비트율을 조절하면서 질을 조절하는 처리가 불가능했다.However, in this case, since the transmission rate is fixed, it is possible to design a specific reproducing apparatus that is effective at the transmission rate.However, the method of forming the data to match the transmission rate used in various systems and the method of continuously sending various kinds of bitstreams Not presented. That is, when arbitrary information is transmitted, the user cannot arbitrarily select and restore the bit rate of the transmitted data. In the restoring system, for example, when video and audio information are stored on an optical disc, the data cannot be reproduced by adjusting the bit rate. That is, the process of adjusting the quality while adjusting the bit rate assigned to the video information and the bit rate assigned to the audio information was not possible.

이러한 여러가지의 비트율에 적용가능한 비트스트림을 스케일 조절이 가능한 비트스트림이라 한다. 이러한 스케일 조절이 가능한 비트스트림에 대하여 K. Brandenburg은 "First Ideas on Scalable Audio Coding"(AES 97th Conv. Preprint 3924, San Francisco, 1994)에서 다양한 비트율에 대한 처리가 가능하도록 처리예를 보였다. 제1도에 도시된 이 예에서는, 먼저 코아 부분에 대한 비트스트림을 구한 뒤 그때의 오차신호를 계산하고, 그 오차신호들에 대해 발생된 오차를 보상해 주기 위한 비트스트림을 형성한다. 그 결과, 여러가지 비트율에 대한 비트스트림을 표현하기 위해서는 해당 비트스트림이 가지고 있는 비트율의 종류에 따라서 하드웨어의 복잡도 및 연산의 복잡도가 증가하고, 따라서 처리시간이 증가하는 문제점이 있었다. 예로서, 스케일 조절이 4단계로 가능한 비트스트림을 꾸밀 경우 4배의 복잡도를 갖는다.A bitstream applicable to these various bit rates is called a bitstream capable of scaling. For such a scalable bitstream, K. Brandenburg has shown an example of processing to enable various bit rates in "First Ideas on Scalable Audio Coding" (AES 97th Conv. Preprint 3924, San Francisco, 1994). In this example shown in FIG. 1, a bitstream for a core portion is first obtained, then an error signal at that time is calculated, and a bitstream for compensating for the errors generated for the error signals is formed. As a result, in order to express bitstreams for various bit rates, the complexity of hardware and the complexity of operations increase according to the kind of bit rate of the corresponding bit stream, and thus, the processing time increases. For example, if the bitstream can be scaled in four steps, it has four times the complexity.

따라서 본 발명의 목적은 상술한 문제점을 해결하기 위하여 사용자에 의해 비트율을 조절하여 전송되는 데이타의 음질 또는 화질을 바꾸고자 할 경우, 저장되어 있는 데이타 또는 전송되고 있는 데이터를 미리 정해진 종류의 비트율 중 선택된 비트율의 비트스트림으로 형성하기 위한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화방법을 제공하는데 있다.Accordingly, an object of the present invention is to select the stored data or the data being transmitted from a predetermined type of bit rate when changing the sound quality or image quality of the transmitted data by adjusting the bit rate by the user in order to solve the above problems. The present invention provides an audio encoding / decoding method capable of scaling a bitstream to form a bitstream of a bit rate.

본 발명의 다른 목적은 상기 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화방법을 실현하는데 가장 적합한 장치를 제공하는데 있다.Another object of the present invention is to provide an apparatus most suitable for realizing an audio encoding / decoding method capable of scaling the bitstream.

상기 목적을 달성하기 위하여 본 발명에 의한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화방법은In order to achieve the above object, an audio encoding / decoding method capable of scaling a bitstream according to the present invention is provided.

입력되는 오디오신호를 압축부호화하여 비트스트림을 형성하기 위한 부호화과정;An encoding process for compressing and encoding the input audio signal to form a bitstream;

상기 부호화과정에 의해 형성된 비트스트림을 스케일 조절이 가능한 비트스트림으로 바꾸는 스케일 조절이 가능한 비트스트림 형성과정;A scaleable bitstream forming step of changing the bitstream formed by the encoding process into a bitstream capable of scaling;

상기 부호화된 비트스트림을 복호화에 적합한 형태로 복원하기 위한 스케일 조절이 가능한 비트스트림 복원과정; 및A bitstream reconstruction process capable of scaling to reconstruct the encoded bitstream into a form suitable for decoding; And

상기 확원된 비트스트림를 복원하여 원래의 데이타를 구하기 위한 복호화과정으로 이루어지는 것을 특징으로 한다.And a decoding process for recovering the original bitstream to obtain the original data.

상기 다른 목적을 달성하기 위하여 본 발명에 의한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화장치는In order to achieve the above another object, an audio encoding / decoding apparatus capable of scaling a bitstream according to the present invention is provided.

오디오 부호화장치는Audio encoder

입력되는 오디오신호를 압축부호화하여 비트스트림을 형성하기 위한 부호화기; 및An encoder for compressing and encoding the input audio signal to form a bitstream; And

상기 부호화기에 의해 형성된 비트스트림을 스케일 조절이 가능한 비트스트림으로 바꾸는 스케일 조절이 가능한 비트스트림 형성부를 포함하고,A scaleable bitstream forming unit for converting the bitstream formed by the encoder into a scaleable bitstream,

오디오 복호화장치는Audio decoder

상기 부호화된 비트스트림을 복호화에 적합한 형태로 복원하기 위한 스케일 조절이 가능한 비트스트림 복원부; 및A bitstream reconstruction unit capable of scaling to reconstruct the encoded bitstream into a form suitable for decoding; And

상기 복원된 비트스트림를 복원하여 원래의 데이타를 구하기 위한 복호화기를 포함하는 것을 특징으로 한다.And a decoder for reconstructing the recovered bitstream to obtain original data.

이하 첨부된 도면을 참조하여 본 발명에 대하여 상세히 설명하기로 한다.Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

제2도는 본 발명에 의한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화장치를 나타낸 블럭도로서, 부호화부분은 비트스트림을 형성하는 부호화기(21), 형성된 비트스트림을 스케일 조절이 가능한 비트스트림으로 바꾸는 스케일 조절이 가능한 비트스트림 형성부(22)로 구성되고, 스케일 조절이 가능한 비트스트림 형성부(22)는 비트율 조절을 위해 비트율에 따른 비트스트림의 길이를 정해 주기 위한 제어신호를 입력하는 부분(미도시), 제어신호에 따라 비트스트림의 길이를 조절하기 위한 부분(미도시), 전체 프레임의 길이를 조절하기 위한 부분(미도시)으로 구성된다. 한편, 복호화부분은 부호화된 스케일 조절이 가능한 비트스트림을 복원하기 위한 스케일 조절이 가능한 비트스트림 복원부(23)와 복원된 비트스트림으로 부터 원래의 데이타를 구하기 위한 복호화기(24)로 구성된다.2 is a block diagram illustrating an audio encoding / decoding apparatus capable of scaling a bitstream according to an embodiment of the present invention, in which an encoding part includes an encoder 21 for forming a bitstream and a bitstream for scaling a bitstream. Composed of a bitstream forming unit 22 capable of scaling, the bitstream forming unit 22 capable of adjusting the scale inputs a control signal for determining the length of the bitstream according to the bit rate for bit rate adjustment (not shown). At the same time, a portion (not shown) for adjusting the length of the bitstream according to the control signal and a portion (not shown) for adjusting the length of the entire frame are provided. On the other hand, the decoding part is composed of a scalable bitstream recovery unit 23 for recovering the coded bit-adjustable bitstream and a decoder 24 for obtaining the original data from the recovered bitstream.

제2도의 구성에 의거하여 각 부분의 동작을 살펴보면 다음과 같다.The operation of each part based on the configuration of FIG. 2 is as follows.

제2도에 있어서, 부호화기(21)와 복호화기(24)는 데이터를 압축 혹은 복원하기 위한 것으로서, 채널의 전송능력 또는 저장능력을 최대한 활용하기 위한 것이다. 이 부분은 기존의 여러 압축방식을 사용하는 것이 가능하나 본 발명에서는 설명의 편의상 ISO/IEC 11172 오디오를 예로 들기로 한다.In FIG. 2, the encoder 21 and the decoder 24 are for compressing or reconstructing data, and make the best use of the channel transmission capacity or storage capacity. This part can use a variety of existing compression schemes, but in the present invention, for convenience of description, ISO / IEC 11172 audio will be taken as an example.

ISO/IEC 11172 오디오 부호화기(21)에 의해 부호화된 비트스트림은 제3토에 도시된 바와 같다. 제4도는 이러한 비트스트림을 복호화하는 일예를 나타낸 것으로서, 비트스트림 가운데 많은 부분이 항상 미리 정해진 형태에 의해 같은 크기의 비트가 정보의 표현에 사용된다는 것을 알 수 있다. 여기서 중요한 값들은 신호에 따라 변화하는 중요도에 의해 비트들을 할당해 준 값들을 저장하고 있는 allocation[ch][sb]의 값으로, 이 값에 의해 샘플들에 비트들을 할당해 준다.The bitstream encoded by the ISO / IEC 11172 audio encoder 21 is as shown in the third soil. 4 shows an example of decoding such a bitstream, and it can be seen that many parts of the bitstream are always used in the representation of information with the same size by a predetermined form. The important values here are allocation [ch] [sb], which stores the values assigned to the bits by their varying importance according to the signal, and assigns bits to the samples by this value.

이러한 정보를 이용하여 결정된 스케일 조절이 가능한 비트스트림을 구성하는 여러가지의 비트율, 즉 제1비트율, 제2비트율,.., 제n비트율이 있을 경우, 부호화된 비트스트림을 가지고 스케일 조절이 가능한 비트스트림 형성부(22)에서 제1비트율에서 부터 제n비트율을 모두 포함하는 스케일 조절이 가능한 비트스트림을 형성할 수 있다. 이 스케일 조절이 가능한 비트스트림 형성방법은 다음의 일련의 단계에 의해 수행된다.If there are various bit rates constituting the scalable bitstream determined using this information, that is, the first bit rate, the second bit rate, ..., the n-th bit rate, the bit stream is scalable with the encoded bit stream In the forming unit 22, a bitstream capable of adjusting a scale including all of the nth bit rate from the first bit rate may be formed. The scaleable bitstream forming method is performed by the following series of steps.

1. 사이드 정보(sideinfo)에 대하여 비트를 할당한다.1. Allocate bits for sideinfo.

2. 사이드 정보로 부터 각 처리대역별로 할당된 비트수 BB[ch][sb]=allocation[ch][sb]를 읽어 온다.2. From the side information, the number of bits BB [ch] [sb] = allocation [ch] [sb] allocated for each processing band is read.

3. 각 BB[ch][sb]에 따라 각 처리대역별로 샘플수 sample[ch][sb][s]을 읽어온다. 이 단계는 다음 a 내지 i 단계에 의해 수행된다.3. Read the number of samples sample [ch] [sb] [s] for each processing band according to each BB [ch] [sb]. This step is performed by the following steps a to i.

a. CC[.]=0, M=1로 초기화한다.a. Initialize to CC [.] = 0 and M = 1.

b. 저주파수대역에서 부터 고주파대역의 모든 BB[ch][sb]에 대하여 최대 BB[ch][sb]를 찾는다.b. Find the maximum BB [ch] [sb] for all BB [ch] [sb] in the high frequency band from the low frequency band.

c. 최대 BB[.]값을 갖는 대역의 샘플들에게 할당되는 비트수를 다음과 같이 조정한다.c. The number of bits allocated to the samples of the band having the maximum BB [.] Value is adjusted as follows.

CC[ch][sb] = CC[ch][sb] + 1CC [ch] [sb] = CC [ch] [sb] + 1

d. BB[ch][sb] 값을 다음과 같이 조정한다.d. Adjust the BB [ch] [sb] value as follows:

BB[ch][sb] = BB[ch][sb] = 1BB [ch] [sb] = BB [ch] [sb] = 1

e. 사용되는 비트수(Bits_used)를 다음과 같이 추정한다.e. The number of bits _used (Bits _used ) is estimated as follows.

여기서, SPL[.]은 각 처리대역별로 고정된 샘플의 수를 의미한다.Here, SPL [.] Means a fixed number of samples for each processing band.

f. Bits_used값이 비트율 M의 값을 넘는지를 비교하여 넘지 않을 경우 b 단계로 복귀한다.f. If the Bits _used value does not exceed the value of the bit rate M, the process returns to step b.

g. CC[ch][sb] 값을 이용하여 샘플수를 읽어준다. 이때 각 처리대역의 샘플들을 최상위비트(MSB)부터 CC[ch][sb] 만큼씩 읽어준다. 이후, 각 sample[ch][sb][s] 값을 CC[ch][sb] 만큼씩 좌로 이동시킨다.g. Read the number of samples using CC [ch] [sb]. At this time, samples of each processing band are read by the most significant bit (MSB) by CC [ch] [sb]. Thereafter, each sample [ch] [sb] [s] value is moved left by CC [ch] [sb].

h. CC[.]=0로 두고, 만약 M이 최종비트율이 아니면 b 단계로 복귀한다.h. Set CC [.] = 0 and return to step b if M is not the last bit rate.

i. 종료i. End

이와 같이 비트할당이 행해짐으로써 제5A도에서와 같이 각 대역이 비트들이 할당된 경우는 제5B도에서와 같이 표현된다. 처리결과에 의해 얻어진 비트스트림은 제6도에 도시된 바와 같다.In this way, when bit allocation is performed, as shown in FIG. 5A, bits are allocated to each band as in FIG. 5A. The bitstream obtained by the processing result is as shown in FIG.

여기서, 비트스트림을 형성하는 비트수가 정해지고, 각 비트율에 따른 비트정보는 원래 데이타의 의미있는 최고값(MSB)으로 부터 k번째 비트까지가 되기 때문에 최종 비트율이 N인 경우 원래의 부호화된 비트스트림을 스케일 조절이 가능한 비트스트림으로 바꾸면서 총 비트수는 변화하지 않는다. 그러나, 비트율 k에 대한 비트수를 넘기 전까지는 비트들이 많이 할당된 대역과 저주파수대역에서부타 각 대역의 샘플들의 수에 따라 비트들이 할당되다가 경우에 따라서 한 대역에 1 비트씩 더 할당되면서 경계치를 넘는 경우가 발생한다. 예로서,Here, since the number of bits forming the bitstream is determined, and the bit information according to each bit rate is from the meaningful maximum value (MSB) of the original data to the kth bit, the original coded bitstream when the final bit rate is N. Is changed to a scalable bitstream while the total number of bits does not change. However, until the number of bits for the bit rate k is exceeded, the bits are allocated according to the number of samples of each band in the band where the bits are allocated a lot and the low frequency band. The case occurs. For example,

k 단계; 대역을 찾아 비트를 할당한다. 이때 Bits_used를 m이라 한다.k step; Find the band and allocate bits. Bits _used is m.

K+1 단계 : 대역을 찾아 비트를 할당한다. 이때 Bits_used를 m'이라 하고, m'은 Bits(L)보다 크다. 여기서 Bits(L)은 비트율 L에서의 비트수이다 .K + 1 step: find the band and allocate bits. Bits _used is called m ', and m' is larger than Bits (L). Where Bits (L) is the number of bits at the bit rate L.

이때, 비트율이 L인 경우가 읽어들이는 비트수는 Bits(L)이므로 (Bits(L)-m)인 k+1 단계에서 증가한 비트들은 비트율이 L인 경우 의미없는 데이타로 인정하여 복호화시 처리를 하지 많고, L보다 높은 비트율의 경우에서는 유호한 데이타로 사용하게 된다. 제7도는 비트들이 경계에 위치할 경우에 대한 처리방식을 나타낸 것이다.In this case, since the number of bits read when the bit rate is L is Bits (L), the bits increased in the step k + 1 of (Bits (L) -m) are regarded as meaningless data when the bit rate is L, and processed during decoding. In many cases, a bit rate higher than L is used as a valid data. 7 shows a processing method for the case where the bits are located at the boundary.

한편, 스케일 조절이 가능한 비트스트림 복원부(23)에서 스케일 조절이 가능한 비트스트림으로 부터 복호화기(24)가 사용하는 비트스트림을 형성하는 방법은 다음 일련의 단계에 의해 수행된다.Meanwhile, the method of forming the bitstream used by the decoder 24 from the scalable bitstream in the scalable bitstream recovery unit 23 is performed by the following series of steps.

a. CC[.]=0, M=1로 초기화한다.a. Initialize to CC [.] = 0 and M = 1.

CC[ch][sb] = CC[ch][sb] + 1CC [ch] [sb] = CC [ch] [sb] + 1

BB[ch][sb] = BB[ch][sb] + 1BB [ch] [sb] = BB [ch] [sb] + 1

g. CC[ch][sb] 값을 이용하여 샘플수를 읽어준다.g. Read the number of samples using CC [ch] [sb].

h. CC[ch][sb] 값과 allocation[ch][sb] 를 이용하여 신호의 크기를 조절한다.h. The signal is adjusted using the CC [ch] [sb] value and allocation [ch] [sb].

i. 복호화기 입력 후 종료i. Decryptor input and exit

여기서, h 단계에서 하는 역할은 전체 할당된 비트들 가운데 일부분만을 임의의 복호화처리에서 사용하는 경우, 발생하는 오차를 최소화하기 위한 것이다. 예로, 임의의 대역에 각 샘플당 K개씩 비트들이 할당되었을때 가장 큰 비트율에서 그 대역에 각 샘플당 K+4개씩의 비트들이 할당된 경우 발생하는 오차신호를 최소화하기 위해 서는 MSB에서부터 K개의 비트가 같아야 하고, 이는 CC[ch][sb] 값을 이용한 데이타들을 가지고 왼쪽으로 allocation[ch][sb]-CC[ch][sb] 만큼 이동시킴으로써 크기를 맞추어 줄 수 있다.Here, the role played in step h is to minimize an error that occurs when only a part of all allocated bits are used in any decoding process. For example, if K bits are allocated to each sample in a band, KB to K bits are used to minimize the error signal generated when K + 4 bits are allocated to each band in that band at the highest bit rate. Must be the same, and this can be scaled by moving allocation [ch] [sb] -CC [ch] [sb] to the left with data using CC [ch] [sb] values.

상술한 바와 같이 본 발명에 의한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화방법 및 장치에서는 여러가지 전송선로 및 다양한 복원장치에 호환성을 갖도록 하기 위한 비트스트림을 형성함으로써 사용자의 욕구에 따라 재공되는 서비스의 질이 향상되고, 지불되는 비용에 따라 서비스의 차별화를 이룰수 있고 전송 및 저장에 있어서 효과적인 채널의 이음 및 저장되는 비트수를 조절할 수 있는 등의 효과를 얻을 수 있다. 또한, 사용되는 비트율이 다른 현존하는 여러 종류의 다양한 기기들을 연동시켜 사용가능하게 함으로써 사용자에게 다양한 서비스를 제공할 수 있다.As described above, in the audio encoding / decoding method and apparatus capable of adjusting the scale of a bitstream according to the present invention, by forming a bitstream to be compatible with various transmission lines and various reconstruction devices, a service provided according to a user's desire can be obtained. The quality can be improved, and the service can be differentiated according to the cost paid, and the effective channel transmission and storage number of bits can be controlled in transmission and storage. In addition, it is possible to provide a variety of services to the user by enabling the use of a variety of existing devices of different types of different bit rates used.

제1도는 비트스트림의 스케일 조절이 가능한 종래의 오디오 부호화기를 나타낸 블럭도.1 is a block diagram showing a conventional audio encoder capable of scaling a bitstream.

제2도는 본 발명에 의한 비트스트림의 스케일 조절이 가능한 오디오 부호화/복호화장치를 나타낸 블럭도.2 is a block diagram showing an audio encoding / decoding apparatus capable of scaling a bitstream according to the present invention.

제3도는 ISO/IEC 11172의 오디오 비트스트림을 나타낸 도면.3 illustrates an audio bitstream of ISO / IEC 11172.

제4도는 제3도에 도시된 비트스트림을 복호화하는 일예를 나타낸 도면.4 is a diagram illustrating an example of decoding a bitstream shown in FIG. 3;

제5도는 비트수 할당의 일예.5 is an example of bit number allocation.

제6도는 본 발명에 의해 스케일 조절이 가능한 비트스트림의 일예.6 is an example of a bitstream capable of scaling according to the present invention.

제7도는 비트들이 경계에 위치하는 경우 스케일 조절이 가능하도록 비트스트림을 표현하는 일예.7 is an example of representing a bitstream to enable scaling if bits are located at a boundary.

Claims

(a) encoding the received audio signal and converting the received audio signal into a bitstream;

(b) classifying the audio data of the coded bitstream according to a predetermined importance level and then reconstructing the audio data according to the order of importance to convert the coded bitstream into a scalable bitstream;

(c) restoring the scaleable bitstream into a bitstream suitable for decoding according to predetermined selected scale information; And

and (d) decoding the reconstructed bitstream.

The method of claim 1, wherein step (b)

A first step of allocating bits for sideinfo;

A second step of reading the number of bits BB [ch] [sb] = allocation [ch] [sb] allocated to each processing band from the side information; And

And a third step of reading the number of samples sample [ch] [sb] [s] for each processing band according to each BB [ch] [sb]. Way.

The method of claim 2, wherein the third step

Initializing CC [.] = 0 and M = 1;

Finding a maximum BB [ch] [sb] for all BB [ch] [sb] in the high frequency band from the low frequency band;

Increasing the number of bits allocated to samples of the band having the maximum BB [.] By one;

Increasing the BB [ch] [sb] value by 1;

Estimating the number of bits used from the side information, CC [ch] [sb], and a fixed number of samples for each processing band;

Returning to the step of finding the maximum BB [ch] [sb] if the used number of bits does not exceed the value of bit rate M;

Reading samples of each of the processing bands by the most significant bit (MSB) by CC [ch] [sb]; And

If the CC [.] = 0 is set and M is not the final bit rate, the method returns to the step of finding the maximum BB [ch] [sb]. .

The method of claim 1, wherein step (c)

A first step of allocating bits for sideinfo;

The method of claim 4, wherein the third step

Initializing CC [.] =: 0, M = 1;

Increasing the BB [ch] [sb] value by 1;

Returning to the step of finding the maximum BB [ch] [sb] if the number of used bits exceeds the value of the bit rate M;

And adjusting the magnitude of the signal by using the CC [ch] [sb] value and allocation [ch] [sb].

An encoder for encoding an input audio signal and converting the received audio signal into a bitstream; And

An audio encoding including a bitstream forming unit which classifies the audio data of the encoded bitstream according to a predetermined importance and then reconstructs the audio data according to an important order and converts the encoded bitstream into a scalable bitstream Device and

A bitstream recovery unit for restoring the scaleable bitstream into a bitstream suitable for decoding according to predetermined selected scale information; And

And an audio decoding device including a decoder for decoding the reconstructed bitstream.

The apparatus of claim 6, wherein the bitstream forming unit comprises: a control signal input unit configured to input a control signal for determining a length of the bitstream according to the bit rate for bit rate adjustment;

A first adjusting unit for adjusting the length of the bitstream according to the control signal; And

An audio encoding / decoding apparatus capable of adjusting a scale of a bitstream, characterized in that it comprises a second adjusting unit for adjusting the length of an entire frame.

The apparatus of claim 6, wherein the bitstream recovery unit comprises: a control signal input unit configured to input a control signal for determining a length of a bitstream according to a bit rate for bit rate adjustment;

An audio encoding / decoding apparatus capable of adjusting the scale of a bitstream, characterized in that it comprises a second adjusting unit for adjusting the length of the entire frame.