KR101434834B1

KR101434834B1 - Method and apparatus for encoding/decoding multi channel audio signal

Info

Publication number: KR101434834B1
Application number: KR1020070088315A
Authority: KR
Inventors: 김중회; 오은미
Original assignee: 삼성전자주식회사
Priority date: 2006-10-18
Filing date: 2007-08-31
Publication date: 2014-09-02
Also published as: KR20080035448A

Abstract

본 발명은 다채널 오디오 신호의 복호화 방법에 관한 것으로, 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하고, 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 코어 오디오 오브젝트 타입을 검출하며, 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 코어 오디오 데이터를 복호화하고, 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 레지듀얼 코딩 데이터를 복호화하며, 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱함으로써, 코어 오디오 데이터와 레지듀얼 코딩 데이터를 동일한 복호화 방식으로 복호화하여 복호화단의 복잡도를 줄일 수 있다.The present invention relates to a method of decoding a multi-channel audio signal, which detects a type of spatial extension data included in a result of encoding an audio signal, and outputs spatial extension data, which represents a core audio object type The core audio object type is detected and the core audio data is decoded by the decoding method according to the detected core audio object type. If the spatial extension data is the residual coding data, the decoding method according to the core audio object type The dual codec data is decoded and the decoded core audio data is upmixed by using the decoded residual coding data so that the core audio data and the residual coding data are decoded by the same decoding method to reduce the decoding step complexity.

Description

[0001] The present invention relates to a method and apparatus for encoding / decoding multi-channel audio signals,

본 발명은 다채널 오디오 신호의 부호화/복호화 방법 및 장치에 관한 것으로, 보다 상세하게는 오디오 신호를 업믹싱하는 데 사용되는 레지듀얼 신호의 부호화/복호화 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for encoding / decoding a multi-channel audio signal, and more particularly, to a method and apparatus for encoding / decoding a residual signal used for upmixing an audio signal.

MPEG(Moving Picture Experts Group) 서라운드(surround) 기술은 오디오 신호의부호화에 있어서 공간의 음원에 대한 오디오 데이터를 압축하는 기술로서, MP3(MPEG Audio Layer-3), MPEG-4 AAC(Advanced Audio Coding) 또는 MPEG-4 HE(High Efficiency)-AAC로 압축된 오디오 신호를 고품질 멀티 채널 서라운드 오디오로 바꿔준다. MPEG 서라운드는 기존 스테레오 장비에 대한 백워드(backward) 호환성을 유지하고, 기존 장비를 사용하면서도 고품질 멀티 채널 오디오 압축에 필요한 비트레이트(bitrate), 즉, 전송 속도를 줄일 수 있는 장점이 있다.MPEG (Moving Picture Experts Group) surround technology is a technology for compressing audio data of a spatial sound source in the encoding of an audio signal, and includes MPEG Audio Layer-3 (MP3), Advanced Audio Coding (MPEG) Or MPEG-4 High Efficiency (AAC) -AAC audio to high-quality multi-channel surround audio. MPEG Surround maintains backward compatibility with existing stereo equipment and has the advantage of reducing the bitrate, or transmission rate, required for high quality multi-channel audio compression while using existing equipment.

MPEG 서라운드 표준에 따르면 코어 오디오 신호(core audio signal)는 BSAC(Bit Sliced Arithmetic Coding), AAC, MP3(MPEG Audio Layer-3) 등 가운데 어느 하나의 부호화 방식을 이용하여 부호화하지만, 레지듀얼 신호(residual signal) 는 AAC에 의하여만 부호화한다.According to the MPEG Surround standard, a core audio signal is encoded using any one of BSAC (Bit Sliced Arithmetic Coding), AAC, and MP3 (MPEG Audio Layer-3) signal) is encoded by AAC only.

그러므로, MPEG 서라운드 표준에 따라 코어 오디오 신호를 AAC가 아닌 다른 부호화 방식으로 부호화할 경우, 부호화단에서 코어 오디오 신호와 레지듀얼 신호를 각기 다른 부호화 방식에 의하여 부호화하여야 한다. 마찬가지로, 복호화단에서도 코어 오디오 신호와 레지듀얼 신호를 각기 다른 복호화 방식에 의하여 복호화하여야 한다.Therefore, when the core audio signal is encoded according to the MPEG Surround standard in a coding scheme other than AAC, the core audio signal and the residual signal must be encoded by different coding schemes at the coding stage. Likewise, the decoding unit must decode the core audio signal and the residual signal by different decoding methods.

본 발명이 해결하고자 하는 과제는 레지듀얼 신호를 복호화하는 경우 복호화단의 복잡도를 줄일 수 있는 다채널 오디오 신호의 복호화 방법 및 장치를 제공하는데 있다.SUMMARY OF THE INVENTION It is an object of the present invention to provide a method and apparatus for decoding a multichannel audio signal capable of reducing the complexity of a decoding stage when decoding a residual signal.

또한, 본 발명이 해결하고자 하는 다른 과제는 레지듀얼 신호를 부호화하는 경우 부호화단의 복잡도를 줄일 수 있는 다채널 오디오 신호의 부호화 방법 및 장치를 제공하는데 있다.Another object of the present invention is to provide a method and apparatus for encoding a multi-channel audio signal that can reduce the complexity of an encoding step when a residual signal is encoded.

상기 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 복호화 방법은 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하는 단계; 상기 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 상기 코어 오디오 오브젝트 타입을 검출하는 단계; 상기 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 상기 코어 오디오 데이터를 복호화하는 단계; 상기 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 단계; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding a multi-channel audio signal, the method comprising: detecting a type of spatial extension data included in a result of encoding an audio signal; Detecting the core audio object type if the spatial extension data is data indicating a core audio object type in which core audio data is encoded; Decoding the core audio data using a decoding method according to the detected core audio object type; Decoding the residual coding data using the decoding method according to the core audio object type if the spatial extension data is residual coding data; And upmixing the decoded core audio data using the decoded residual coded data.

상기 과제는 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하는 단계; 상기 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 상기 코어 오디오 오브젝트 타입을 검출하는 단계; 상기 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 상기 코어 오디오 데이터를 복호화하는 단계; 상기 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 단계; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱하는 단계를 포함하는 다채널 오디오 신호의 복호화 방법를 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.The method includes detecting a type of spatial extension data included in an encoding result of an audio signal; Detecting the core audio object type if the spatial extension data is data indicating a core audio object type in which core audio data is encoded; Decoding the core audio data using a decoding method according to the detected core audio object type; Decoding the residual coding data using the decoding method according to the core audio object type if the spatial extension data is residual coding data; And upmixing the decoded core audio data using the decoded residual coding data. The present invention also provides a computer-readable recording medium having recorded thereon a program for executing a decoding method of a multi-channel audio signal.

또한, 상기 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 복호화 장치는 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하는 공간 확장 데이터 타입 검출부; 상기 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 상기 코어 오디오 오브젝트 타입을 검출하는 코어 오디오 오브젝트 타입 검출부; 상 기 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 상기 코어 오디오 데이터를 복호화하는 코어 오디오 데이터 복호화부; 상기 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 레지듀얼 코딩 데이터 복호화부; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱하는 업믹싱부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for decoding a multi-channel audio signal, the apparatus including: a spatial extension data type detector for detecting a type of spatial extension data included in a result of encoding an audio signal; A core audio object type detector for detecting the core audio object type when the spatial extension data is data indicating a core audio object type in which core audio data is encoded; A core audio data decoding unit decoding the core audio data using a decoding method according to the detected core audio object type; A residual coding data decoding unit for decoding the residual coding data using the decoding scheme according to the core audio object type if the spatial expansion data is residual coding data; And an upmixing unit for upmixing the decoded core audio data using the decoded residual coded data.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 부호화 방법은 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하는 단계; 상기 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하는 단계; 상기 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 상기 소정의 부호화 방식에 따라 상기 레지듀얼 데이터를 부호화하는 단계; 및 상기 부호화된 코어 오디오 데이터 및 상기 부호화된 레지듀얼 데이터를 상기 오디오 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of encoding a multi-channel audio signal, the method including generating down-mixed audio signals to generate core audio data and residual data; Encoding the core audio data according to a predetermined encoding scheme; Encoding the residual data according to the predetermined encoding scheme according to a core audio object type in which the core audio data is encoded; And outputting the encoded core audio data and the encoded residual data as a result of encoding the audio signal.

또한, 상기 또 다른 과제는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하는 단계; 상기 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하는 단계; 상기 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 상기 소정의 부호화 방식에 따라 상기 레지듀얼 데이터를 부호화하는 단계; 및 상기 부호화된 코어 오디오 데이터 및 상기 부호화된 레지듀얼 데이터를 상기 오디오 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 다채널 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method for reproducing audio data, the method comprising the steps of: generating a core audio data and a residual data by downmixing an input audio signal; Encoding the core audio data according to a predetermined encoding scheme; Encoding the residual data according to the predetermined encoding scheme according to a core audio object type in which the core audio data is encoded; And outputting the encoded core audio data and the encoded residual data as a result of encoding the audio signal. A computer-readable recording medium having recorded thereon a program for executing a method of encoding a multi-channel audio signal, Lt; / RTI >

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 부호화 장치는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하는 다운믹싱부; 상기 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하는 코어 오디오 데이터 부호화부; 상기 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 상기 소정의 부호화 방식에 따라 상기 레지듀얼 데이터를 부호화하는 레지듀얼 데이터 부호화부; 및 상기 부호화된 코어 오디오 데이터 및 상기 부호화된 레지듀얼 데이터를 상기 오디오 신호에 대한 부호화 결과로써 출력하는 다중화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for encoding a multi-channel audio signal, comprising: a downmixing unit for downmixing an input audio signal to generate core audio data and residual data; A core audio data encoding unit for encoding the core audio data according to a predetermined encoding scheme; A residual data encoding unit for encoding the residual data according to the predetermined encoding scheme according to a core audio object type in which the core audio data is encoded; And a multiplexer for outputting the encoded core audio data and the encoded residual data as a result of encoding the audio signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 복호화 방법은 다운 믹싱된 오디오 코어 신호에 해당하는 비트스트림과 다채널 생성을 위한 부가 정보가 포함된 비트스트림을 수신하는 단계; 상기 다운 믹싱된 오디오 코어 신호에 해당하는 비트스트림으로부터 코어 오브젝트 타입을 검출하는 단계; 상기 검출된 코어 오브젝트 타입에 의해 결정된 복호화 방식에 의해 상기 다운 믹싱된 오디오 코어 신호를 복호화하는 단계; 상기 다채널 생성을 위한 부가 정보에 포함된 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 단계; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업 믹싱하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding a multi-channel audio signal, the method comprising: receiving a bitstream including a bitstream corresponding to a downmixed audio core signal and additional information for generating a multi- ; Detecting a core object type from a bitstream corresponding to the downmixed audio core signal; Decoding the downmixed audio core signal by a decoding method determined by the detected core object type; Decoding the residual coding data using the decoding method according to the core audio object type if the spatial extension data included in the additional information for generating the multi-channel is residual coding data; And upmixing the decoded core audio data using the decoded residual coded data.

본 발명에 따르면, 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하고, 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 코어 오디오 오브젝트 타입을 검출하며, 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 코어 오디오 데이터를 복호화하고, 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 레지듀얼 코딩 데이터를 복호화하며, 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱함으로써, 코어 오디오 데이터와 레지듀얼 코딩 데이터를 동일한 복호화 방식으로 복호화하여 복호화단의 복잡도를 줄일 수 있다.According to the present invention, when the type of the spatial extension data included in the encoding result of the audio signal is detected and the spatial extension data is data indicating the core audio object type in which the core audio data is encoded, the core audio object type is detected Decodes the core audio data using a decoding scheme according to the detected core audio object type, decodes the residual coding data using a decoding scheme according to the core audio object type if the spatial extension data is residual coding data, By upmixing the core audio data using the decoded residual coding data, the core audio data and the residual coding data can be decoded by the same decoding method, thereby reducing the complexity of the decoding end.

또한, 본 발명에 따르면, 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하고, 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하며, 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 소정의 부호화 방식에 따라 레지듀얼 데이터를 부호화하고, 부호화된 코어 오디오 데이터 및 부호화된 레지듀얼 데이터를 오디오 신호에 대한 부호화 결과로써 출력함으로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.According to the present invention, the input audio signal is down-mixed to generate core audio data and residual data, the core audio data is encoded according to a predetermined encoding method, and the core audio data is encoded into a core audio object And outputs the encoded core audio data and the encoded residual data as a result of encoding the audio signal. By doing so, the core audio data and the residual data are encoded by the same encoding method And the complexity of the encoding stage can be reduced.

본문에 개시되어 있는 본 발명의 실시예들에 대해서, 특정한 구조적 내지 기능적 설명들은 단지 본 발명의 실시예를 설명하기 위한 목적으로 예시된 것으로, 본 발명의 실시예들은 다양한 형태로 실시될 수 있으며 본문에 설명된 실시예들에 한정되는 것으로 해석되어서는 아니 된다. For the embodiments of the invention disclosed herein, specific structural and functional descriptions are set forth for the purpose of describing an embodiment of the invention only, and it is to be understood that the embodiments of the invention may be practiced in various forms, The present invention should not be construed as limited to the embodiments described in Figs.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 본문에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 개시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 구성요소에 대해 사용하였다. The present invention is capable of various modifications and various forms, and specific embodiments are illustrated in the drawings and described in detail in the text. It should be understood, however, that the invention is not intended to be limited to the particular forms disclosed, but includes all modifications, equivalents, and alternatives falling within the spirit and scope of the invention. Similar reference numerals have been used for the components in describing each drawing.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다. Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in commonly used dictionaries are to be interpreted as having a meaning consistent with the contextual meaning of the related art and are to be interpreted as either ideal or overly formal in the sense of the present application Do not.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다. Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. The same reference numerals are used for the same constituent elements in the drawings and redundant explanations for the same constituent elements are omitted.

도 1은 본 발명의 일 실시예에 따른 다채널 오디오 신호의 복호화 장치를 나타내는 블록도이다.1 is a block diagram illustrating an apparatus for decoding a multi-channel audio signal according to an embodiment of the present invention.

도 1을 참조하면, 다채널 오디오 신호의 복호화 장치는 역다중화부(demultiplexing unit, 100), 공간 확장 데이터 타입 검출부(spatial extension data type detecting unit, 110), 코어 오디오 오브젝트 타입 검출부(core audio object type detecting unit, 120), 코어 오디오 데이터 복호화부(core audio data decoding unit, 130), 레지듀얼 코딩 데이터 복호화부(residual coding data decoding unit, 140), 아비트레리 다운 믹스 레지듀얼 코딩 데이터 복호화부(arbitrary down-mix residual coding data decoding unit, 150), 및 업믹싱부(up-mixing unit, 160)를 포함한다. 1, an apparatus for decoding a multi-channel audio signal includes a demultiplexing unit 100, a spatial extension data type detecting unit 110, a core audio object type detector 110, detecting unit 120, a core audio data decoding unit 130, a residual coding data decoding unit 140, an arbitrary down-mix residual coded data decoding unit 140, a residual coding data decoding unit 150, and an up-mixing unit 160.

역다중화부(100)는 입력단자 IN을 통하여 부호화단으로부터 비트스트림을 입력받아 역다중화한다. The demultiplexer 100 receives the bitstream from the encoder through the input terminal IN and demultiplexes the bitstream.

도 2는 본 발명의 일 실시예에 따른 공간 확장 데이터 타입을 검출하기 위한 신택스를 도시한 도면이다. 도 3은 도 2에 도시된 "bsSacExtType"에 대응하는 값들이 할당된 테이블의 일 실시예를 도시한 도면이다. 이하에서는, 도 1 내지 도 3를 참조하여, 공간 확장 데이터 타입 검출부(110)의 동작을 설명하기로 한다.2 is a diagram illustrating a syntax for detecting a spatial extension data type according to an embodiment of the present invention. Fig. 3 is a diagram showing an embodiment of a table to which values corresponding to "bsSacExtType" shown in Fig. 2 are assigned. Hereinafter, the operation of the extended spatial data type detection unit 110 will be described with reference to FIG. 1 to FIG.

공간 확장 데이터 타입 검출부(110)는 역다중화부(100)에서 역다중화된 데이터의 헤더에서 공간 확장 데이터(spatial extension data)의 타입을 검출한다. 보다 상세하기는, 공간 확장 데이터 타입 검출부(110)는 도 2에 도시된 함수 SpatialExtensionConfig()에 의하여 역다중화된 데이터의 헤더에서 공간 확장 데이터의 타입을 검출할 수 있다. 함수 SpatialExtensionConfig()에서 "bsSacExtType"은 공간 확장 데이터의 타입을 나타낸다. The spatial extension data type detection unit 110 detects the type of spatial extension data in the header of the demultiplexed data in the demultiplexing unit 100. More specifically, the spatial extension data type detection unit 110 can detect the type of the spatial extension data in the header of the data demultiplexed by the function SpatialExtensionConfig () shown in FIG. In the function SpatialExtensionConfig (), "bsSacExtType" indicates the type of space extension data.

도 3을 참조하면, 본 발명의 일 실시예에서 "bsSacExtType"이 '0'이면, 공간 확장 데이터는 레지듀얼 코딩 데이터이고, "bsSacExtType"이 '1'이면, 공간 확장 데이터는 아비트레리 다운믹스 레지듀얼 코딩 데이터이며, "bsSacExtType"이 '12' 이면, 공간 확장 데이터는 MPEG-4 오디오의 코어 오디오 오브젝트 타입(core Audio Object Type)이다. 여기서, 코어 오디오 오브젝트 타입은 부호화단에서 다운 믹스된 신호를 부호화하는 오디오 오브젝트 타입을 말한다. 그러나, 이는 본 발명의 일 실시예에 불과하고, 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 다양한 변형이 가능하다는 것을 이해할 수 있다.3, if 'bsSacExtType' is '0', the spatial extension data is residual coding data, and if 'bsSacExtType' is '1', the spatial extension data is an arbitrary downmix register Quot; bsSacExtType "is " 12 ", the spatial extension data is the core audio object type of MPEG-4 audio. Here, the core audio object type refers to an audio object type for encoding a downmixed signal at an encoding end. However, it is to be understood that this is only an embodiment of the present invention, and that those skilled in the art will understand that various modifications are possible.

다시 말해, 공간 확장 데이터 타입 검출부(110)는 "bsSacExtType"에 '0'이 할당된 경우 공간 확장 데이터의 타입이 레지듀얼 코딩 데이터라고 판단하고, "bsSacExtType"에 '1'이 할당된 경우 공간 확장 데이터의 타입이 아비트레리 다운믹스 레지듀얼 코딩 데이터라고 판단하며, "bsSacExtType"에 '12'가 할당된 경우 공간 확장 데이터의 타입이 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터라고 판단한다.In other words, when the space extension data type detecting unit 110 determines that the type of the space extension data is the residual coding data when '0' is assigned to 'bsSacExtType' and '1' is assigned to 'bsSacExtType' It is determined that the type of the data is the bit rate downmix residual coded data. When "12" is assigned to "bsSacExtType", it is determined that the type of the spatial extension data is data indicating the core audio object type of MPEG-4 audio.

이하에서는, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입에 따른 오디오 신호의 복호화 장치의 동작에 대하여 설명하기로 한다.Hereinafter, the operation of the apparatus for decoding an audio signal according to the spatial extension data type detected by the spatial extension data type detection unit 110 will be described.

첫째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입이 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우에 대하여 살펴보기로 한다. 다시 말해, 이 경우에는 "bsSacExtType"은 '12'이다.First, a case where the spatial extension data type detected by the spatial extension data type detection unit 110 is data representing a core audio object type of MPEG-4 audio will be described. In other words, in this case, "bsSacExtType" is "12".

도 4는 본 발명의 일 실시예에 따른 코어 오디오 오브젝트 타입을 독출하기 위한 신택스를 도시한 도면이다. 이하에서는, 도 1 및 4를 참조하여, 코어 오디오 오브젝트 타입 검출부(120)의 동작을 설명하기로 한다.4 is a diagram illustrating a syntax for reading a core audio object type according to an embodiment of the present invention. Hereinafter, the operation of the core audio object type detecting unit 120 will be described with reference to FIGS.

공간 확장 데이터 타입 검출부(110)에서 공간 확장 데이터의 타입을 검출한 결과, 공간 확장 데이터가 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터라고 판단되면, 코어 오디오 오브젝트 타입 검출부(120)는 코어 오디오 오브젝트 타입을 검출한다. If it is determined that the spatial extension data is the data indicating the core audio object type of MPEG-4 audio as a result of the detection of the type of the spatial extension data in the spatial extension data type detection unit 110, The object type is detected.

보다 상세하게는, 코어 오디오 오브젝트 타입 검출부(120)는 도 4에 도시된 함수 "SpatialExtensionConfigData(12)"에 의하여 코어 오디오 오브젝트 타입을 독출할 수 있다. 여기서, "coreAudioObjectType"은 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타낸다. More specifically, the core audio object type detection unit 120 can read the core audio object type according to the function "SpatialExtensionConfigData (12)" shown in FIG. Here, "coreAudioObjectType" indicates a core audio object type of MPEG-4 audio.

다시 도 1을 참조하면, 코어 오디오 데이터 복호화부(130)는 역다중화부(100)에서 역다중화된 코어 오디오 데이터를 복호화한다. 보다 상세하게는, 코어 오디오 데이터 복호화부(130)는 코어 오디오 오브젝트 타입 검출부(120)에서 검출된 코어 오디오 오브젝트 타입에 따라 역다중화된 코어 오디오 데이터를 복호화한다. Referring again to FIG. 1, the core audio data decoding unit 130 decodes the demultiplexed core audio data in the demultiplexer 100. [ More specifically, the core audio data decoding unit 130 decodes the demultiplexed core audio data according to the core audio object type detected by the core audio object type detection unit 120. [

상술한 바와 같이, 코어 오디오 오브젝트 타입은 부호화단에서 다운 믹스된 신호를 부호화하는 오디오 오브젝트 타입을 말한다. 여기서, 코어 오디오 데이터는 부호화단에서 BSAC(Bit Sliced Arithmetic Coding), AAC(Advanced Audio Coding), MP3(MPEG Audio Layer-3) 등과 같은 다양한 부호화 방식 가운데 어느 하나에 의하여 부호화될 수 있다. 여기서, BSAC, AAC, MP3 등은 본 발명의 실시예에 불과하며, 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 다양한 부호화 방식에 의해 코어 오디오 데이터는 부호화될 수 있음을 이해할 수 있다.As described above, the core audio object type refers to an audio object type that encodes a downmixed signal at an encoding end. Here, the core audio data may be encoded by any of a variety of encoding schemes such as BSAC (Bit Sliced Arithmetic Coding), AAC (Advanced Audio Coding), MP3 (MPEG Audio Layer-3) Here, BSAC, AAC, MP3, and the like are only examples of the present invention. Those skilled in the art can understand that core audio data can be encoded by various encoding methods.

둘째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타 입이 레지듀얼 코딩 데이터인 경우에 대하여 살펴보기로 한다. 다시 말해, 이 경우에는 "bsSacExtType"은 '0'이다.Second, a description will be made of a case where the spatial extension data type detected by the spatial extension data type detection unit 110 is residual coding data. In other words, "bsSacExtType" is '0' in this case.

도 5는 본 발명의 일 실시예에 따른 레지듀얼 코딩 데이터를 복호화하기 위한 신택스를 도시한 도면이다. 이하에서는, 도 1 및 5를 참조하여 레지듀얼 코딩 데이터 복호화부(140)의 동작에 대하여 설명하기로 한다. 5 is a diagram illustrating a syntax for decoding residual coding data according to an embodiment of the present invention. Hereinafter, the operation of the residual coding data decoding unit 140 will be described with reference to FIGS. 1 and 5. FIG.

레지듀얼 코딩 데이터 복호화부(140)는 제1 코어 오디오 오브젝트 타입 판별부(141), 제1 BSAC 복호화부(142), 및 제1 AAC 복호화부(143)를 포함하여 레지듀얼 코딩 데이터를 복호화한다.The residual coding data decoding unit 140 includes a first core audio object type determining unit 141, a first BSAC decoding unit 142 and a first AAC decoding unit 143 to decode the residual coding data .

공간 확장 데이터 타입 검출부(110)에서 공간 확장 데이터의 타입을 검출한 결과, 공간 확장 데이터가 레지듀얼 코딩 데이터라고 판단되면, 제1 코어 오디오 오브젝트 타입 판별부(141)는 코어 오디오 오브젝트 타입이 'BSAC'인지 여부를 판단한다. When it is determined that the spatial extension data is the residual coding data as a result of detecting the type of the spatial extension data in the spatial extension data type detection unit 110, the first core audio object type determination unit 141 determines that the core audio object type is' BSAC &Quot;

도 5를 참조하면, 'BSAC'의 코어 오디오 오브젝트 타입으로 '22'가 할당되었으므로, 제1 코어 오디오 오브젝트 타입 판별부(141)는 코어 오디오 오브젝트 타입 검출부(120)에서 검출된 "coreAudioObjectType"이 '22'에 해당하는지 여부를 판단한다.Referring to FIG. 5, since '22' is assigned to the core audio object type of 'BSAC', the first core audio object type determination unit 141 determines that the coreAudioObjectType detected by the core audio object type detection unit 120 is' 22 '.

제1 코어 오디오 오브젝트 타입 판별부(141)의 판단 결과, 코어 오디오 오브젝트 타입이 'BSAC'에 해당하면, 제1 BSAC 복호화부(142)는 'BSAC'에 의해서 레지듀얼 신호를 복호화한다. 예를 들어, 제1 BSAC 복호화부(142)는 도 5에 도시된 신택스의 식별번호 500 또는 520에 의하여 실시될 수 있다. 식별번호 500 또는 520에 서 제1 BSAC 복호화부(142)는 MPEG-4 ER BSAC에 정의되어 있는 함수 bsac_raw_data_block()에 의하여 레지듀얼 코딩 데이터를 복호화한다. 여기서, bsac_raw_data_block()의 "nch"는 항상 '1'로 설정되어 있어야 한다. 여기서, "nch"는 채널의 수(Number of Channels)를 나타낸다.If the first core audio object type determination unit 141 determines that the core audio object type corresponds to 'BSAC', the first BSAC decoding unit 142 decodes the residual signal by 'BSAC'. For example, the first BSAC decoding unit 142 may be implemented by the identification number 500 or 520 of the syntax shown in FIG. At the identification number 500 or 520, the first BSAC decoding unit 142 decodes the residual coding data by the function bsac_raw_data_block () defined in the MPEG-4 ER BSAC. Here, "nch" of bsac_raw_data_block () should always be set to "1". Here, "nch" represents the number of channels (Number of Channels).

제1 코어 오디오 오브젝트 타입 판별부(141)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되면, 제1 AAC 복호화부(143)는 'AAC'에 의해서 레지듀얼 코딩 데이터를 복호화한다. 예를 들어, 제1 AAC 복호화부(143)는 도 5에 도시된 신택스의 식별번호 510 또는 530에 의하여 실시될 수 있다. 식별번호 510 또는 530에서 제1 AAC 복호화부(143)는 ISO/IEC 13818-7의 subclause 6.3에 기재된 "MPEG-2 AAC Low Complexity profile bitstream syntax"에 정의되어 있는 individual_channel_stream(0)에 의하여 레지듀얼 코딩 데이터를 복호화한다.If the first core audio object type determining unit 141 determines that the core audio object type does not correspond to 'BSAC', the first AAC decoding unit 143 decodes the residual coding data by 'AAC'. For example, the first AAC decoding unit 143 may be implemented by the identification number 510 or 530 of the syntax shown in FIG. At the identification number 510 or 530, the first AAC decoding unit 143 performs residual coding on the basis of the individual_channel_stream (0) defined in the "MPEG-2 AAC Low Complexity profile bitstream syntax" described in subclause 6.3 of ISO / IEC 13818-7 Decodes the data.

그러나, 제1 AAC 복호화부(143)에서 'AAC'는 단순히 일 실시예에 불과하며, 제1 코어 오디오 오브젝트 타입 판별부(141)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제1 AAC 복호화부(143)에서는 제1 코어 오디오 오브젝트 타입 판별부(141)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 디코딩 방식으로 레지듀얼 코딩 데이터를 복호화할 수 있다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(141)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제1 AAC 복호화부(143)에서는 'MP3'에 의해서 레지듀얼 코딩 데이터를 복호화한다.However, 'AAC' in the first AAC decoding unit 143 is merely an embodiment. If it is determined in the first core audio object type determination unit 141 that the core audio object type does not correspond to 'BSAC' , The first AAC decoding unit 143 may decode the residual coding data using a decoding method corresponding to the core audio object type detected by the first core audio object type determination unit 141. [ For example, when the core audio object type detected by the first core audio object type determination unit 141 is 'MP3', the first AAC decoding unit 143 decodes the residual coding data by 'MP3'.

이와 같이, 제1 BSAC 복호화부(142) 또는 제1 AAC 복호화부(143)에서 복호화 된 레지듀얼 코딩 데이터를 이용하여 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 멀티 채널 신호로 업믹싱할 수 있다.In this manner, the core audio data decoded by the core audio data decoding unit 130 is converted into a multi-channel signal using the residual coding data decoded by the first BSAC decoding unit 142 or the first AAC decoding unit 143 Can be mixed.

셋째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입이 아비트레리 다운 믹스 레지듀얼 코딩 데이터인 경우에 대하여 살펴보기로 한다. 다시 말해, 이 경우에는 "bsSacExtType"은 '1'이다.Third, a case where the spatial extension data type detected by the spatial extension data type detection unit 110 is the arbitrary down-mix residual coding data will be described. In other words, "bsSacExtType" is '1' in this case.

도 6은 본 발명의 일 실시예에 따른 아비트레리 다운 믹스 레지듀얼 데이터를 복호화하기 위한 신택스를 도시한 도면이다. 이하에서는, 도 1 및 6을 참조하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터 복호화부(150)의 동작에 대하여 설명하기로 한다. FIG. 6 is a diagram illustrating a syntax for decoding arbitrary downmix residual data according to an embodiment of the present invention. Referring to FIG. Hereinafter, the operation of the arbitrary down-mix residual coded data decoding unit 150 will be described with reference to FIGS. 1 and 6. FIG.

아비트레리 다운 믹스 레지듀얼 코딩 데이터 복호화부(150)는 제2 코어 오디오 오브젝트 타입 판별부(151), 제2 BSAC 복호화부(152), 및 제2 AAC 복호화부(153)를 포함하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다.The downmix residual coded data decoding unit 150 includes a second core audio object type determining unit 151, a second BSAC decoding unit 152, and a second AAC decoding unit 153, And decodes the mixed residual coding data.

제2 코어 오디오 오브젝트 타입 판별부(151)의 판단 결과, 코어 오디오 오브젝트 타입이 'BSAC'에 해당하면, 제2 BSAC 복호화부(152)는 'BSAC'에 의해서 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 예를 들어, 제2 BSAC 복호화부(152)는 도 6에 도시된 신택스의 식별번호 600, 620, 640 및 660 중 적어도 어느 하나 이상에 의하여 실시될 수 있다. 식별번호 600, 620, 640 및 660 중 적어도 어느 하나 이상에서 제2 BSAC 복호화부(152)는 MPEG-4 ER BSAC에 정의되어 있는 함수 bsac_raw_data_block()에 의하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 여기서, bsac_raw_data_block()의 "nch"는 항상 '1'로 설정되어 있어 야 한다. 여기서, "nch"는 채널의 수(Number of Channels)를 나타낸다.If the second core audio object type determining unit 151 determines that the core audio object type corresponds to 'BSAC', the second BSAC decoding unit 152 outputs the avatar down-mix residual coding data by 'BSAC' Decryption. For example, the second BSAC decoding unit 152 may be implemented by at least one of identification numbers 600, 620, 640, and 660 of the syntax shown in FIG. At least one of the identification numbers 600, 620, 640, and 660, the second BSAC decoding unit 152 decodes the arbitrary down-mix mixed-coded data according to the function bsac_raw_data_block () defined in the MPEG-4 ER BSAC . Here, "nch" of bsac_raw_data_block () should always be set to '1'. Here, "nch" represents the number of channels (Number of Channels).

제1 코어 오디오 오브젝트 타입 판별부(151)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되면, 제2 AAC 복호화부(152)는 'AAC'에 의해서 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 예를 들어, 제2 AAC 복호화부(153)는 도 6에 도시된 신택스의 식별번호 610, 630, 650 및 670 중 적어도 어느 하나 이상에 의하여 실시될 수 있다. 식별번호 610 또는 650에서 제2 AAC 복호화부(153)는 ISO/IEC 13818-7의 subclause 6.3에 기재된 "MPEG-2 AAC Low Complexity profile bitstream syntax"에 정의되어 있는 individual_channel_stream(0)에 의하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 또한, 식별번호 630 또는 670에서 제2 AAC 복호화부(153)는 ISO/IEC 13818-7의 subclause 6.3에 기재된 "MPEG-2 AAC Low Complexity profile bitstream syntax"에 정의되어 있는 channel_pair_element()에 의하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 여기서, 파라미터 "common_window"는 '1'로 설정되어 있다.If the first core audio object type determination unit 151 determines that the core audio object type does not correspond to 'BSAC', the second AAC decoding unit 152 outputs the avatar downmix residual coding data by 'AAC' Decryption. For example, the second AAC decoding unit 153 may be implemented by at least one of the identification numbers 610, 630, 650, and 670 of the syntax shown in FIG. At the identification number 610 or 650, the second AAC decoding unit 153 decodes the avatar lyre down (0) according to the individual_channel_stream (0) defined in the "MPEG-2 AAC Low Complexity profile bitstream syntax" described in subclause 6.3 of ISO / IEC 13818-7. And decodes the mixed residual coding data. In addition, in the identification number 630 or 670, the second AAC decoding unit 153 decodes the AITRELERI () by the channel_pair_element () defined in the "MPEG-2 AAC Low Complexity profile bitstream syntax" described in subclause 6.3 of ISO / IEC 13818-7 And decodes the downmix residual coding data. Here, the parameter "common_window" is set to " 1 ".

그러나, 제2 AAC 복호화부(153)에서 'AAC'는 단순히 일 실시예에 불과하며, 제2 코어 오디오 오브젝트 타입 판별부(151)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제2 AAC 복호화부(153)에서는 제2 코어 오디오 오브젝트 타입 판별부(151)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 디코딩 방식으로 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화할 수 있다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(151)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제2 AAC 복호화부(153)에서는 'MP3'에 의해서 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다.However, 'AAC' in the second AAC decoding unit 153 is merely one embodiment. If it is determined in the second core audio object type determination unit 151 that the core audio object type does not correspond to 'BSAC' And the second AAC decoding unit 153 can decode the avatar downmix residual coding data using the decoding method corresponding to the core audio object type detected by the second core audio object type determination unit 151. [ For example, when the core audio object type detected by the first core audio object type determination unit 151 is' MP3 ', the second AAC decoding unit 153 may output the avatar down-mix residual coding data' .

이와 같이, 제2 BSAC 복호화부(152) 또는 제2 AAC 복호화부(153)에서 복호화된 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 이용하여 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 멀티 채널 신호로 업믹싱할 수 있다.In this manner, the core audio data decoded by the core audio data decoding unit 130 is multiplexed with the multi-bit audio data using the avatarary downmix residual coding data decoded by the second BSAC decoding unit 152 or the second AAC decoding unit 153, It can be upmixed to a channel signal.

넷째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입이 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터, 레지듀얼 코딩 데이터 또는 아비트레리 다운 믹스 레지듀얼 코딩 데이터가 아닌 경우에 대하여 살펴보기로 한다.Fourth, the case where the spatial extension data type detected by the spatial extension data type detection unit 110 is not the data indicating the core audio object type of MPEG-4 audio, the residual coding data, or the arbitrary down mix residual coding data Let's look at it.

공간 확장 데이터 복호화부(160)는 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터의 타입에 대응하는 방식으로 복호화를 수행한다. 이와 같이, 공간 확장 데이터 복호화부(160)에서 복호화된 데이터를 이용하여 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 멀티 채널 신호로 업믹싱할 수 있다.The spatial extension data decoding unit 160 performs decoding in a manner corresponding to the type of the spatial extension data detected by the spatial extension data type detection unit 110. [ In this manner, the core audio data decoded by the core audio data decoding unit 130 can be upmixed into the multi-channel signal by using the data decoded by the spatial extension data decoding unit 160. [

업믹싱부(170)는 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 제1 및 제2 BSAC 복호화부(142, 152), 제1 및 제2 ACC 복호화부(143, 153), 또는 공간 확장 데이터 복호화부(160)에서 복호화된 결과를 이용하여 멀티 채널 신호로 업믹싱한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The upmixing unit 170 encodes the core audio data decoded by the core audio data decoding unit 130 into first and second BSAC decoding units 142 and 152, first and second ACC decoding units 143 and 153, Or up-mixes the result into a multi-channel signal using the result decoded by the spatial extension data decoding unit 160. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

도 7은 본 발명의 일 실시예에 따른 다채널 오디오 신호의 복호화 방법을 나타내는 흐름도이다.7 is a flowchart illustrating a method of decoding a multi-channel audio signal according to an embodiment of the present invention.

도 7을 참조하면, 본 실시예에 따른 다채널 오디오 신호의 복호화 방법은 도 1에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 1에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 7, the method for decoding a multi-channel audio signal according to the present embodiment is comprised of steps of time-series processing in the apparatus for decoding an audio signal shown in FIG. Therefore, the contents described above with respect to the audio signal decoding apparatus shown in FIG. 1 are also applied to the decoding method of the audio signal according to the present embodiment, even if omitted below.

700 단계에서 공간 확장 데이터 타입 검출부(110)는 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출한다.In step 700, the spatial extension data type detection unit 110 detects the type of spatial extension data included in the encoding result of the audio signal.

710 단계에서 코어 오디오 오브젝트 타입 검출부(120)는 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 코어 오디오 오브젝트 타입을 검출한다.In step 710, the core audio object type detection unit 120 detects the core audio object type if the spatial extension data is data indicating the core audio object type in which the core audio data is encoded.

720 단계에서 코어 오디오 데이터 복호화부(130)는 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 코어 오디오 데이터를 복호화한다.In operation 720, the core audio data decoding unit 130 decodes the core audio data using a decoding method according to the detected core audio object type.

730 단계에서 레지듀얼 코딩 데이터 복호화부(140)는 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 레지듀얼 코딩 데이터를 복호화한다.In step 730, if the spatial extension data is residual coding data, the residual coding data decoding unit 140 decodes the residual coding data using the decoding method according to the core audio object type.

740 단계에서 업믹싱부(170)는 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱한다.In operation 740, the upmixing unit 170 upmixes the decoded core audio data using the decoded residual coding data.

본 실시예에서 따른 오디오 신호의 복호화 방법은 공간 확장 데이터가 아비 트레리 다운 믹스 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화하는 단계를 더 포함할 수 있다. 이 경우, 업믹싱부(170)는 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터 및 복호화된 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 이용하여 업믹싱할 수 있다.In the method of decoding an audio signal according to the present embodiment, when the spatial extension data is Abbatial downmix residual coding data, the step of decoding the arbitrary downmix residual coding data using a decoding method according to the core audio object type . In this case, the upmixing unit 170 may upmix the decoded core audio data using the decoded residual coding data and the decoded avitreler downmix residual coding data.

또한, 본 실시예에 따른 오디오 신호의 복호화 방법은 공간 확장 데이터가 코어 오디오 오브젝트 타입을 나타내는 데이터, 레지듀얼 코딩 데이터 및 아비트레리 다운 믹스 코딩 데이터 외의 데이터인 경우, 공간 확장 데이터의 타입에 따른 복호화 방식으로 공간 확장 데이터를 복호화하는 단계를 더 포함할 수 있다. 이 경우, 업믹싱부(170)는 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터, 복호화된 아비트레리 다운 믹스 레지듀얼 코딩 데이터, 및 복호화된 공간 확장 데이터를 이용하여 업믹싱할 수 있다.In the method of decoding an audio signal according to the present embodiment, when the spatial extension data is data other than the data indicating the core audio object type, the residual coding data, and the arbitrary down mix coding data, And decrypting the space extension data by using the space expansion data. In this case, the upmixing unit 170 may upmix the decoded core audio data using the decoded residual coding data, decoded avitreray downmix residual coding data, and decoded space expansion data.

도 8은 본 발명의 일 실시예에 따른 다채널 오디오 신호의 부호화 장치를 나타내는 블록도이다.8 is a block diagram illustrating an apparatus for encoding a multi-channel audio signal according to an embodiment of the present invention.

도 8을 참조하면, 다채널 오디오 신호의 부호화 장치는 다운믹싱부(down-mixing unit, 800), 코어 오디오 데이터 부호화부(core audio data encoding unit, 810), 레지듀얼 데이터 부호화부(residual data encoding unit, 820), 아비트레리 다운 믹스 레지듀얼 데이터 부호화부(arbitrary down-mix residual data encoding unit, 830), 및 다중화부(multiplexing unit, 840)를 포함한다.8, an apparatus for encoding a multi-channel audio signal includes a down-mixing unit 800, a core audio data encoding unit 810, a residual data encoding unit 810, unit 820, an arbitrary down-mix residual data encoding unit 830, and a multiplexing unit 840.

다운믹싱부(800)는 입력 신호(IN)를 다운믹싱한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM(Pulse Code Modulation) 신호일 수 있다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The downmixing unit 800 downmixes the input signal IN. Here, the input signal IN may be a PCM (Pulse Code Modulation) signal obtained by modulating an analog audio signal or an audio signal into a digital signal. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

코어 오디오 데이터 부호화부(810)는 다운믹싱부(800)에서 출력된 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화한다. 여기서, 코어 오디오 데이터는 BSAC(Bit Sliced Arithmetic Coding), AAC(Advanced Audio Coding), MP3(MPEG Audio Layer-3) 등과 같은 다양한 부호화 방식 가운데 어느 하나에 의하여 부호화될 수 있다. 여기서, BSAC, AAC, MP3 등은 본 발명의 실시예에 불과하며, 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 다양한 부호화 방식에 의해 코어 오디오 데이터는 부호화될 수 있음을 이해할 수 있다.The core audio data encoding unit 810 encodes the core audio data output from the downmixing unit 800 according to a predetermined encoding method. Here, the core audio data may be encoded by any of a variety of encoding schemes such as Bit Sliced Arithmetic Coding (BSAC), Advanced Audio Coding (AAC), and MPEG Audio Layer-3 (MP3). Here, BSAC, AAC, MP3, and the like are only examples of the present invention. Those skilled in the art can understand that core audio data can be encoded by various encoding methods.

레지듀얼 데이터 부호화부(820)는 제1 코어 오디오 오브젝트 타입 판별부(core audio object type determining unit, 821), 제1 BSAC 부호화부(BSAC encoding unit, 822) 및 제1 AAC 부호화부(AAC encoding unit, 823)를 포함하여 레지듀얼 데이터를 부호화한다.The residual data encoding unit 820 includes a first core audio object type determining unit 821, a first BSAC encoding unit 822, and a first AAC encoding unit 822. [ , And 823 to encode residual data.

제1 코어 오디오 오브젝트 타입 판별부(821)는 코어 오디오 데이터 부호화부(810)에서 코어 오디오 데이터를 부호화하는 방식인 코어 오디오 오브젝트 타입을 판별하여 레지듀얼 데이터의 부호화 방식을 결정한다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(821)는 코어 오디오 오브젝트 타입이 'BSAC'인 경우 레지듀얼 데이터의 부호화 방식을 'BSAC'으로 결정하고, 코어 오디오 오브젝트 타입 이 'AAC'인 경우 레지듀얼 데이터의 부호화 방식을 'AAC'로 결정한다.The first core audio object type determining unit 821 determines the encoding method of the residual data by determining the core audio object type which is a method of encoding the core audio data in the core audio data encoding unit 810. [ For example, if the core audio object type is 'BSAC', the first core audio object type determination unit 821 determines the encoding method of the residual data to be 'BSAC'. If the core audio object type is 'AAC' The encoding method of the residual data is determined to be 'AAC'.

제1 BSAC 부호화부(822)는 제1 코어 오디오 오브젝트 타입 판별부(821)의 판단 결과 코어 오디오 오브젝트 타입이 'BSAC'인 경우에 레지듀얼 데이터를 'BSAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The first BSAC encoding unit 822 encodes the residual data in the BSAC format when the core audio object type is 'BSAC' as a result of the determination by the first core audio object type determination unit 821. This makes it possible to reduce the complexity of the encoding stage by encoding the core audio data and the residual data in the same encoding scheme.

제1 AAC 부호화부(823)는 제1 코어 오디오 오브젝트 타입 판별부(821)의 판단 결과 코어 오디오 오브젝트 타입이 'AAC'인 경우에 레지듀얼 데이터를 'AAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The first AAC encoding unit 823 encodes the residual data in the 'AAC' format when the core audio object type is 'AAC' as a result of the determination by the first core audio object type determination unit 821. This makes it possible to reduce the complexity of the encoding stage by encoding the core audio data and the residual data in the same encoding scheme.

그러나, 제1 AAC 부호화부(823)에서 'AAC'는 단순히 일 실시예에 불과하며, 제1 코어 오디오 오브젝트 타입 판별부(821)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제1 AAC 복호화부(823)에서는 제1 코어 오디오 오브젝트 타입 판별부(821)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 부호화 방식으로 레지듀얼 데이터를 부호화할 수 있다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(821)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제1 AAC 부호화부(823)에서는 'MP3'에 의해서 레지듀얼 데이터를 부호화한다.However, 'AAC' in the first AAC encoding unit 823 is merely an embodiment. If it is determined in the first core audio object type determination unit 821 that the core audio object type does not correspond to 'BSAC' , The first AAC decoding unit 823 can encode the residual data using the encoding method corresponding to the core audio object type detected by the first core audio object type determining unit 821. [ For example, when the core audio object type detected by the first core audio object type determination unit 821 is 'MP3', the first AAC encoding unit 823 encodes the residual data by 'MP3'.

아비트레리 다운 믹스 레지듀얼 데이터 부호화부(830)는 제2 코어 오디오 오브젝트 타입 판별부(831), 제2 BSAC 부호화부(832) 및 제2 AAC 부호화부(833)를 포함하여 레지듀얼 데이터를 부호화한다.The downmix residual data encoding unit 830 includes a second core audio object type determining unit 831, a second BSAC encoding unit 832 and a second AAC encoding unit 833 to encode residual data, do.

제2 코어 오디오 오브젝트 타입 판별부(831)는 코어 오디오 데이터 부호화부(810)에서 코어 오디오 데이터를 부호화하는 방식인 코어 오디오 오브젝트 타입을 판별하여 레지듀얼 데이터의 부호화 방식을 결정한다. 예를 들어, 제2 코어 오디오 오브젝트 타입 판별부(831)는 코어 오디오 오브젝트 타입이 'BSAC'인 경우 레지듀얼 데이터의 부호화 방식을 'BSAC'으로 결정하고, 코어 오디오 오브젝트 타입이 'AAC'인 경우 레지듀얼 데이터의 부호화 방식을 'AAC'로 결정한다.The second core audio object type determining unit 831 determines the encoding method of the residual data by determining the core audio object type that is the method of encoding the core audio data in the core audio data encoding unit 810. [ For example, if the core audio object type is 'BSAC', the second core audio object type determination unit 831 determines that the residual data encoding method is 'BSAC'. If the core audio object type is 'AAC' The encoding method of the residual data is determined to be 'AAC'.

제2 BSAC 부호화부(832)는 제2 코어 오디오 오브젝트 타입 판별부(831)의 판단 결과 코어 오디오 오브젝트 타입이 'BSAC'인 경우에 레지듀얼 데이터를 'BSAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The second BSAC encoding unit 832 encodes the residual data in the BSAC format when the core audio object type is 'BSAC' as a result of the determination by the second core audio object type determination unit 831. [ This makes it possible to reduce the complexity of the encoding stage by encoding the core audio data and the residual data in the same encoding scheme.

제2 AAC 부호화부(833)는 제2 코어 오디오 오브젝트 타입 판별부(831)의 판단 결과 코어 오디오 오브젝트 타입이 'AAC'인 경우에 레지듀얼 데이터를 'AAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The second AAC encoding unit 833 encodes the residual data in the 'AAC' format when the core audio object type is 'AAC', as a result of the determination by the second core audio object type determination unit 831. This makes it possible to reduce the complexity of the encoding stage by encoding the core audio data and the residual data in the same encoding scheme.

그러나, 제2 AAC 부호화부(833)에서 'AAC'는 단순히 일 실시예에 불과하며, 제2 코어 오디오 오브젝트 타입 판별부(831)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제2 AAC 복호화부(833)에서는 제2 코어 오디오 오브젝트 타입 판별부(831)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 부호화 방식으로 레지듀얼 데이터를 부호화할 수 있다. 예를 들어, 제2 코어 오디오 오브젝트 타입 판별부(831)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제2 AAC 부호화부(833)에서는 'MP3'에 의해서 레지듀얼 데이터를 부호화한다.However, 'AAC' in the second AAC encoding unit 833 is merely one embodiment. If it is determined in the second core audio object type determination unit 831 that the core audio object type does not correspond to 'BSAC' , And the second AAC decoding unit 833 can encode the residual data using the encoding method corresponding to the core audio object type detected by the second core audio object type determining unit 831. [ For example, if the core audio object type detected by the second core audio object type determination unit 831 is 'MP3', the second AAC encoding unit 833 encodes the residual data by 'MP3'.

다중화부(840)는 코어 오디오 데이터 부호화부(810)에서 부호화된 결과, 제1 및 제2 BSAC 부호화부(822, 832)에서 부호화된 결과, 및 제1 및 제2 AAC 부호화부(823, 833)에서 부호화된 결과를 다중화하여 비트스트림을 생성하여 출력 단자 OUT로 출력한다.The multiplexing unit 840 multiplexes the result encoded by the first and second BSAC encoding units 822 and 832 and the encoded result of the first and second AAC encoding units 823 and 833 ) To generate a bitstream and output the bitstream to the output terminal OUT.

도 9는 본 발명의 일 실시예에 따른 다채널 오디오 신호의 부호화 방법을 나타내는 흐름도이다.9 is a flowchart illustrating a method of encoding a multi-channel audio signal according to an embodiment of the present invention.

도 9를 참조하면, 본 실시예에 따른 다채널 오디오 신호의 부호화 방법은 도 8에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 8에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 9, a method of encoding a multi-channel audio signal according to an embodiment of the present invention includes steps of time series processing in the apparatus for encoding an audio signal shown in FIG. Therefore, even if omitted below, the contents described above with respect to the audio signal encoding apparatus shown in FIG. 8 are also applied to the audio signal encoding method according to the present embodiment.

900 단계에서 다운믹싱부(900)는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성한다.In operation 900, the downmixing unit 900 downmixes the input audio signal to generate core audio data and residual data.

910 단계에서 코어 오디오 데이터 부호화부(910)는 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화한다. In step 910, the core audio data encoding unit 910 encodes the core audio data according to a predetermined encoding method.

920 단계에서 레지듀얼 데이터 부호화부(920)는 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 소정의 부호화 방식에 따라 레지듀얼 데이터를 부호화한다.In step 920, the residual data encoding unit 920 encodes the residual data according to a predetermined encoding scheme according to the core audio object type, which is a method in which the core audio data is encoded.

930 단계에서 다중화부(940)는 부호화된 코어 오디오 데이터 및 부호화된 레지듀얼 데이터를 다중화하여 오디오 신호에 대한 부호화 결과로써 출력한다.In operation 930, the multiplexer 940 multiplexes the encoded core audio data and the encoded residual data, and outputs the result as an encoding result for the audio signal.

상기 900 단계는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터, 레지듀얼 데이터, 및 아비트레리 다운 믹스 레지듀얼 데이터를 생성할 수 있다. 이 경우, 본 실시예에 따른 오디오 신호의 부호화 방법은 코어 오디오 오브젝트 타입에 따른 소정의 부호화 방식에 따라 아비트레리 다운 믹스 레지듀얼 데이터를 부호화하는 단계를 더 포함할 수 있다. 이 경우, 다중화부(940)는 부호화된 코어 오디오 데이터, 부호화된 레지듀얼 데이터, 및 부호화된 아비트레리 다운 믹스 레지듀얼 데이터를 다중화하여 오디오 신호에 대한 부호화 결과로써 출력할 수 있다.In operation 900, the input audio signal may be downmixed to generate core audio data, residual data, and arbitrary downmix residual data. In this case, the method of encoding an audio signal according to the present embodiment may further include the step of encoding the arbitrary down-mix residual data according to a predetermined encoding method according to the core audio object type. In this case, the multiplexer 940 multiplexes the encoded core audio data, the encoded residual data, and the encoded avatar downmix residual data, and outputs the multiplexed result as a result of encoding the audio signal.

본 발명은 상술한 실시예에 한정되지 않으며, 본 발명의 사상 내에서 당업자에 의한 변형이 가능함은 물론이다.It is needless to say that the present invention is not limited to the above-described embodiments, and can be modified by those skilled in the art within the scope of the present invention.

본 발명은 또한 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 컴퓨터가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 하드디스크, 플로피디스크, 플래쉬 메모리, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드로서 저장되고 실행될 수 있다.The present invention can also be embodied as computer-readable codes on a computer-readable recording medium. A computer-readable recording medium includes all kinds of recording apparatuses in which data that can be read by a computer system is stored. Examples of the computer-readable recording medium include ROM, RAM, CD-ROM, magnetic tape, hard disk, floppy disk, flash memory, optical data storage, And the like. The computer readable recording medium may also be distributed over a networked computer system and stored and executed as computer readable code in a distributed manner.

도 2는 본 발명의 일 실시예에 따른 공간 확장 데이터 타입을 검출하기 위한 신택스를 도시한 도면이다. 2 is a diagram illustrating a syntax for detecting a spatial extension data type according to an embodiment of the present invention.

도 3은 도 2에 도시된 "bsSacExtType"에 대응하는 값들이 할당된 테이블의 일 실시예를 도시한 도면이다. Fig. 3 is a diagram showing an embodiment of a table to which values corresponding to "bsSacExtType" shown in Fig. 2 are assigned.

도 4는 본 발명의 일 실시예에 따른 코어 오디오 오브젝트 타입을 독출하기 위한 신택스를 도시한 도면이다.4 is a diagram illustrating a syntax for reading a core audio object type according to an embodiment of the present invention.

도 5는 본 발명의 일 실시예에 따른 레지듀얼 코딩 데이터를 복호화하기 위한 신택스를 도시한 도면이다.5 is a diagram illustrating a syntax for decoding residual coding data according to an embodiment of the present invention.

도 6은 본 발명의 일 실시예에 따른 아비트레리 다운 믹스 레지듀얼 데이터를 복호화하기 위한 신택스를 도시한 도면이다.FIG. 6 is a diagram illustrating a syntax for decoding arbitrary downmix residual data according to an embodiment of the present invention. Referring to FIG.

Claims

Detecting a type of spatial extension data included in the bitstream;

Decoding core audio data by a decoding method according to a core audio object type;

Decoding the residual coding data using a first decoding scheme when the spatial extension data is residual coding data; And

And upmixing the decoded core audio data using the decoded residual coding data. &Lt; Desc / Clms Page number 19 >

The method according to claim 1,

Further comprising the step of decoding the arbitrary down-mix mixed-coded data according to the decoding method according to the core audio object type when the spatial expansion data is the arbitrary down-mix residual coding data A method for decoding a channel audio signal.

3. The method of claim 2,

The step of upmixing

And upmixing the decoded core audio data using the decoded residual coding data or the decoded avitreary downmix residual coding data.

3. The method of claim 2,

If the spatial extension data is data other than the data indicating the core audio object type, the residual coding data, and the arbitrary down mix coding data, the spatial expansion data is decoded by a decoding method according to the type of the spatial expansion data Further comprising the step of: decoding the multi-channel audio signal.

5. The method of claim 4,

The step of upmixing

And upmixing the decoded core audio data using the decoded residual coding data, the decoded avatareir downmix residual coding data, or the decoded spatial expansion data. Way.

Detecting a type of spatial extension data included in an encoding result of the audio signal;

Decoding the audio data using a decoding method according to a core audio object type;

And upmixing the decoded core audio data using the decoded residual coding data. The computer-readable recording medium according to claim 1,

A spatial extension data type detection unit for detecting a type of spatial extension data included in an encoding result of an audio signal;

A core audio data decoding unit decoding the core audio data by a decoding method according to a core audio object type;

A residual coding data decoding unit decoding the residual coding data using a first decoding scheme when the spatial expansion data is residual coding data; And

And an upmixing unit for upmixing the decoded core audio data using the decoded residual coding data.

8. The method of claim 7,

Down mix residual coded data decoding unit for decoding the avatar downmix residual coded data according to the decoding method according to the core audio object type when the spatial extension data is the arbitrary down mix residual coded data, Channel audio signal of the multi-channel audio signal.

9. The method of claim 8,

The upmixing unit

And upmixing the decoded core audio data using the decoded residual coding data or the decoded avatar-down-mix residual coding data.

9. The method of claim 8,

If the spatial extension data is data other than the data indicating the core audio object type, the residual coding data, and the arbitrary down mix coding data, the spatial expansion data is decoded by a decoding method according to the type of the spatial expansion data And a spatial extension data decoding unit for decoding the multi-channel audio signal.

11. The method of claim 10,

The upmixing unit

And upmixing the decoded core audio data using the decoded residual coding data, the decoded avatareir downmix residual coding data, or the decoded spatial expansion data. Device.

Downmixing the input audio signal to generate core audio data and residual data;

Encoding the core audio data according to a predetermined encoding scheme;

Encoding the residual data according to the predetermined encoding scheme according to a core audio object type in which the core audio data is encoded; And

And encoding the encoded core audio data and the encoded residual data into a bitstream together with information indicating that residual coding is applied.

13. The method of claim 12,

The downmixing step

Wherein the core audio data, the residual data, and the arbitrary down mix residual data are generated by downmixing the input audio signal.

14. The method of claim 13,

Further comprising the step of encoding the arbitrary down-mix residual data according to the predetermined encoding scheme according to the core audio object type.

15. The method of claim 14,

The bit stream

And outputting the encoded core audio data, the encoded residual data, or the encoded arbitrary down mix residual data as a result of encoding the audio signal.

Encoding the core audio data according to a predetermined encoding scheme;

And encoding the encoded core audio data and the encoded residual data into a bitstream together with information indicating that residual coding is applied. Recorded on a computer readable recording medium.

A downmixing unit for downmixing an input audio signal to generate core audio data and residual data;

A core audio data encoding unit for encoding the core audio data according to a predetermined encoding scheme;

A residual data encoding unit for encoding the residual data according to the predetermined encoding scheme according to a core audio object type in which the core audio data is encoded; And

And multiplexing the encoded core audio data and the encoded residual data into a bitstream together with information indicating that residual coding is applied.

18. The method of claim 17,

The downmixing unit

And generates the core audio data, the residual data, and the arbitrary down mix residual data by downmixing the input audio signal.

18. The method of claim 17,

Further comprising an arbitrary down-mix residual data encoding unit for encoding the arbitrary down-mix residual data according to the predetermined encoding scheme according to the core audio object type.

20. The method of claim 19,

The multiplexer

And outputs the encoded core audio data, the encoded residual data, and the encoded arbitrary down mix residual data as a result of encoding the audio signal.

Receiving a bitstream including a bitstream corresponding to a downmixed audio core signal and additional information for generating a multi-channel;

Detecting a core object type from a bitstream corresponding to the downmixed audio core signal;

Decoding the downmixed audio core signal by a decoding method determined by the detected core object type;

Decoding the residual coding data using the decoding method according to the core audio object type if the spatial extension data included in the additional information for generating the multi-channel is residual coding data; And

The method of claim 1, wherein the first decoding method is AAC.

8. The apparatus of claim 7, wherein the first decoding scheme is AAC.

Decoding the downmixed mono signal;

Decoding the residual signal by referring to information indicating whether residual coding is applied;

Decoding the additional information for generating a plurality of channel signals from the decoded downmixed mono signal; And

And restoring the plurality of channel signals by downmixed mono signals decoded using the decoded additional information and the decoded residual signals.