KR100923156B1

KR100923156B1 - System and Method for Encoding and Decoding for multi-channel audio

Info

Publication number: KR100923156B1
Application number: KR1020070042787A
Authority: KR
Inventors: 서정일; 백승권; 장인선; 장대영; 홍진우
Original assignee: 한국전자통신연구원
Priority date: 2006-05-02
Filing date: 2007-05-02
Publication date: 2009-10-23
Also published as: KR20070107615A

Abstract

1. 청구범위에 기재된 발명이 속한 기술분야1. TECHNICAL FIELD OF THE INVENTION

멀티채널 오디오 인코딩 및 디코딩 시스템 및 방법에 관한 것임.A multichannel audio encoding and decoding system and method.

2. 발명이 해결하고자 하는 기술적 과제2. Technical problem to be solved by the invention

멀티채널 오디오 신호를 디코딩하면서 동시에 음질을 향상시키는 멀티채널 오디오 인코딩 및 디코딩 시스템 및 방법을 제공함.Provided are a multichannel audio encoding and decoding system and method for decoding a multichannel audio signal while simultaneously improving sound quality.

3. 발명의 해결방법의 요지3. Summary of Solution to Invention

입력 신호인 다중화된 비트스트림을 낮은 샘플링 주파수의 멀티채널 오디오 비트스트림 및 SAC(Spatial Audio Coding) 비트스트림 및 SBR(Spectral Band Replication) 비트스트림으로 역다중화하는 비트스트림 역다중화부; 상기 멀티채널 오디오 비트스트림을 디코딩하여 다운믹스 신호 및 멀티채널 오디오 신호를 출력하는 멀티채널 오디오 디코딩부; 상기 다운믹스 신호 및 상기 SBR 비트스트림을 SBR 방식으로 디코딩하여 고주파 영역이 복원된 다운믹스 신호를 출력하는 SBR 디코딩부; 및 상기 SAC 비트스트림에 포함된 공간큐 및 상기 고주파 영역이 복원된 다운믹스 신호를 이용하여 상기 멀티채널 오디오 신호를 SAC 방식으로 디코딩하여 높은 샘플링 주파수의 멀티채널 오디오 신호를 출력하는 SAC 디코딩부를 포함함.A bitstream demultiplexer for demultiplexing the multiplexed bitstream as an input signal into a multichannel audio bitstream having a low sampling frequency, a spatial audio coding (SAC) bitstream, and a spectra band replication (SBR) bitstream; A multichannel audio decoder for decoding the multichannel audio bitstream and outputting a downmix signal and a multichannel audio signal; An SBR decoding unit for decoding the downmix signal and the SBR bitstream by an SBR method and outputting a downmix signal in which a high frequency region is restored; And a SAC decoding unit for outputting a multi-channel audio signal having a high sampling frequency by decoding the multi-channel audio signal using a spatial cue included in the SAC bitstream and a downmix signal from which the high frequency region is restored by the SAC method. .

4. 발명의 중요한 용도4. Important uses of the invention

멀티채널 오디오 신호의 인코딩 및 디코딩에 이용됨.Used to encode and decode multichannel audio signals.

멀티채널, 오디오, 인코딩, 디코딩, SAC, SBR Multichannel, Audio, Encoding, Decoding, SAC, SBR

Description

System and Method for Encoding and Decoding for multi-channel audio}

도 1은 종래의 스테레오 오디오 신호 디코더부의 일실시예 구성도,1 is a configuration diagram of an embodiment of a conventional stereo audio signal decoder;

도 2는 종래의 멀티채널 오디오 신호 디코더부의 일실시예 구성도,2 is a configuration diagram of an embodiment of a conventional multi-channel audio signal decoder;

도 3은 본 발명의 일실시예에 따른 멀티채널 오디오 디코딩 시스템의 구성도,3 is a block diagram of a multi-channel audio decoding system according to an embodiment of the present invention;

도 4는 상기 도 3의 멀티채널 오디오 디코딩부에 AAC 기술이 적용된 경우의 상세 구성도,4 is a detailed configuration diagram when AAC technology is applied to the multi-channel audio decoding unit of FIG. 3;

도 5는 본 발명의 일실시예에 따른 멀티채널 오디오 인코딩 시스템의 구성도,5 is a block diagram of a multi-channel audio encoding system according to an embodiment of the present invention;

도 6은 상기 도 5의 멀티채널 오디오 인코딩부에 AAC 기술이 적용된 경우의 상세 구성도.FIG. 6 is a detailed configuration diagram when AAC technology is applied to the multichannel audio encoding unit of FIG. 5. FIG.

도 7은 본 발명에 따른 멀티채널 오디오 비트스트림의 일실시예 구성도.7 is a diagram illustrating an embodiment of a multichannel audio bitstream in accordance with the present invention.

도 8은 도 3의 멀티채널 오디오 디코딩 시스템에서 수행되는 디코딩 과정을 나타내는 흐름도.8 is a flowchart illustrating a decoding process performed in the multichannel audio decoding system of FIG. 3.

도 9는 도 5의 멀티채널 오디오 인코딩 시스템에서 수행되는 인코딩 과정을 나타내는 흐름도.9 is a flowchart illustrating an encoding process performed in the multichannel audio encoding system of FIG. 5.

본 발명은 멀티채널 오디오 인코딩 및 디코딩 시스템 및 방법에 관한 것으로, 보다 상세하게는 SAC(Spatial Audio Coding)와 SBR(Spectral Band Replication)을 이용함으로써 기존의 멀티채널 오디오 수신기와 호환성을 유지하면서 고품질의 멀티채널 오디오 신호를 인코딩 및 디코딩하는 방법에 관한 것이다.The present invention relates to a multi-channel audio encoding and decoding system and method, and more particularly, by using SAC (Spatial Audio Coding) and SBR (Spectral Band Replication), high quality multi A method of encoding and decoding a channel audio signal.

도 1은 종래의 스테레오 오디오 신호 디코더의 일실시예 구성도이다. 1 is a configuration diagram of an embodiment of a conventional stereo audio signal decoder.

도 1에 도시된 바와 같이, MPEG-1 Audio Layer II 디코더(101)는 입력신호인 MPEG-2 Layer II 비트스트림으로부터 MPEG-1 Layer II 비트스트림만을 디코딩하여 스테레오 오디오 신호를 출력한다. 종래의 스테레오 오디오 신호 디코더는 24kHz, 32kHz, 44.1kHz, 48kHz의 다양한 샘플링 주파수를 지원하면서 고품질의 오디오 신호을 제공할 수 있으나 멀티채널 신호를 디코딩 할 수 없다는 단점이 있다.As shown in FIG. 1, the MPEG-1 Audio Layer II decoder 101 decodes only an MPEG-1 Layer II bitstream from an MPEG-2 Layer II bitstream as an input signal and outputs a stereo audio signal. The conventional stereo audio signal decoder can provide a high quality audio signal while supporting various sampling frequencies of 24 kHz, 32 kHz, 44.1 kHz, and 48 kHz, but has a disadvantage in that it cannot decode a multi-channel signal.

도 2는 종래의 멀티채널 DAB 수신기에서 오디오 신호 디코더의 일실시예 구성도이다. 2 is a diagram illustrating an embodiment of an audio signal decoder in a conventional multichannel DAB receiver.

도 2에 도시된 바와 같이, MPEG-2 Audio Layer II 디코더(201)는 입력 신호인 MPEG-2 Layer II 비트스트림으로부터 멀티채널 오디오 신호를 출력한다.As shown in FIG. 2, the MPEG-2 Audio Layer II decoder 201 outputs a multichannel audio signal from an MPEG-2 Layer II bitstream as an input signal.

한편, DAB 표준에 따르면 멀티채널 오디오 신호는 스테레오 오디오 신호에 대한 샘플링 주파수의 1/2로 샘플링된다. 따라서 도 2의 멀티채널 오디오 신호 디코더(201)로 DAB 표준에 따른 오디오 신호가 입력되는 경우, 스테레오 신호에 비하여 1/2로 다운샘플링 되어 음질이 열화된 멀티채널 오디오 신호 및 스테레오 오디오 신호가 출력된다는 단점이 있다. Meanwhile, according to the DAB standard, a multichannel audio signal is sampled at 1/2 of a sampling frequency for a stereo audio signal. Therefore, when the audio signal according to the DAB standard is input to the multi-channel audio signal decoder 201 of FIG. 2, the multi-channel audio signal and the stereo audio signal are downsampled to 1/2 compared to the stereo signal and the sound quality is degraded. There are disadvantages.

본 발명은, 상기 문제점을 해결하기 위하여 제안된 것으로, 멀티채널 오디오 신호를 디코딩하면서 동시에 음질을 향상시키는 멀티채널 오디오 인코딩 및 디코딩 시스템 및 방법을 제공하는데 그 목적이 있다.The present invention has been proposed to solve the above problems, and an object thereof is to provide a multichannel audio encoding and decoding system and method for decoding a multichannel audio signal and simultaneously improving sound quality.

상기 목적을 달성하기 위한 본 발명은, 멀티채널 오디오 디코딩 시스템에 있어서, 입력 신호인 다중화된 비트스트림을 낮은 샘플링 주파수의 멀티채널 오디오 비트스트림 및 SAC(Spatial Audio Coding) 비트스트림 및 SBR(Spectral Band Replication) 비트스트림으로 역다중화하는 비트스트림 역다중화부; 상기 멀티채널 오디오 비트스트림을 디코딩하여 다운믹스 신호 및 멀티채널 오디오 신호를 출력하는 멀티채널 오디오 디코딩부; 상기 다운믹스 신호 및 상기 SBR 비트스트림을 SBR 방식으로 디코딩하여 고주파 영역이 복원된 다운믹스 신호를 출력하는 SBR 디코딩부; 및 상기 SAC 비트스트림에 포함된 공간큐 및 상기 고주파 영역이 복원된 다운믹스 신호를 이용하여 상기 멀티채널 오디오 신호를 SAC 방식으로 디코딩하여 높은 샘플링 주파수의 멀티채널 오디오 신호를 출력하는 SAC 디코딩부를 포함한다.The present invention for achieving the above object, in the multi-channel audio decoding system, the multiplexed bitstream that is the input signal is a multi-channel audio bitstream and SAC (Spatial Audio Coding) bitstream and SBR (Spectral Band Replication) of low sampling frequency A bitstream demultiplexer for demultiplexing into a bitstream; A multichannel audio decoder for decoding the multichannel audio bitstream and outputting a downmix signal and a multichannel audio signal; An SBR decoding unit for decoding the downmix signal and the SBR bitstream by an SBR method and outputting a downmix signal in which a high frequency region is restored; And a SAC decoding unit which decodes the multichannel audio signal by the SAC method using a spatial cue included in the SAC bitstream and the downmix signal from which the high frequency region is restored, and outputs a multichannel audio signal having a high sampling frequency. .

또한, 상기 목적을 달성하기 위한 본 발명은, 멀티채널 오디오 인코딩 시스템 에 있어서, 입력 신호인 높은 샘플링 주파수의 멀티채널 오디오 신호의 샘플링 주파수를 다운샘플링하는 다운샘플링부; 상기 다운샘플링된 멀티채널 오디오 신호를 인코딩하여 멀티채널 오디오 비트스트림으로 출력하는 멀티채널 오디오 인코딩부; 상기 입력 신호인 멀티채널 오디오 신호를 SAC 방식으로 인코딩하여 SAC 비트스트림 및 다운믹스 신호를 출력하는 SAC 인코딩부; 상기 SAC 인코딩부로부터 출력되는 다운믹스 신호를 이용하여 상기 입력 신호인 멀티채널 오디오 신호를 SBR 방식으로 인코딩하여 SBR 비트스트림을 출력하는 SBR 인코딩부; 및 상기 멀티채널 오디오 비트스트림 및 상기 SAC 비트스트림 및 상기 SBR 비트스트림을 다중화하는 비트스트림 다중화부를 포함한다.The present invention also provides a multichannel audio encoding system comprising: a downsampling unit for downsampling a sampling frequency of a multichannel audio signal having a high sampling frequency as an input signal; A multichannel audio encoding unit for encoding the downsampled multichannel audio signal and outputting the multichannel audio bitstream; A SAC encoder for outputting a SAC bitstream and a downmix signal by encoding the input multi-channel audio signal using a SAC scheme; An SBR encoding unit for outputting an SBR bitstream by encoding the multichannel audio signal, which is the input signal, by using the downmix signal output from the SAC encoder; And a bitstream multiplexer configured to multiplex the multichannel audio bitstream, the SAC bitstream, and the SBR bitstream.

또한, 상기 목적을 달성하기 위한 본 발명은, 멀티채널 오디오 디코딩 방법에 있어서, 입력 신호인 다중화된 비트스트림을 낮은 샘플링 주파수의 멀티채널 오디오 비트스트림 및 SAC(Spatial Audio Coding) 비트스트림 및 SBR(Spectral Band Replication) 비트스트림으로 역다중화하는 비트스트림 역다중화단계; 상기 멀티채널 오디오 비트스트림을 디코딩하여 다운믹스 신호 및 멀티채널 오디오 신호를 출력하는 멀티채널 오디오 디코딩단계; 상기 다운믹스 신호 및 상기 SBR 비트스트림을 SBR 방식으로 디코딩하여 고주파 영역이 복원된 다운믹스 신호를 출력하는 SBR 디코딩단계; 및 상기 SAC 비트스트림에 포함된 공간큐 및 상기 고주파 영역이 복원된 다운믹스 신호를 이용하여 상기 멀티채널 오디오 신호를 SAC 방식으로 디코딩하여 높은 샘플링 주파수의 멀티채널 오디오 신호를 출력하는 SAC 디코딩단계를 포함한다.In addition, the present invention for achieving the above object, in the multi-channel audio decoding method, the multiplexed bitstream that is the input signal multi-channel audio bitstream and SAC (Spatial Audio Coding) bitstream and SBR (Spectral) Band Replication) a bitstream demultiplexing step of demultiplexing into a bitstream; A multichannel audio decoding step of decoding the multichannel audio bitstream to output a downmix signal and a multichannel audio signal; An SBR decoding step of decoding the downmix signal and the SBR bitstream in an SBR manner and outputting a downmix signal in which a high frequency region is restored; And a SAC decoding step of outputting a multi-channel audio signal having a high sampling frequency by decoding the multi-channel audio signal by a SAC method using a spatial cue included in the SAC bitstream and a downmix signal from which the high frequency region is restored. do.

또한, 상기 목적을 달성하기 위한 본 발명은, 멀티채널 오디오 인코딩 방법에 있어서, 입력 신호인 높은 샘플링 주파수의 멀티채널 오디오 신호의 샘플링 주파수를 다운샘플링하는 다운샘플링단계; 상기 다운샘플링된 멀티채널 오디오 신호를 인코딩하여 멀티채널 오디오 비트스트림으로 출력하는 멀티채널 오디오 인코딩단계; 상기 입력 신호인 멀티채널 오디오 신호를 SAC 방식으로 인코딩하여 SAC 비트스트림 및 다운믹스 신호를 출력하는 SAC 인코딩단계; 상기 SAC 인코딩단계에 의해 출력되는 다운믹스 신호를 이용하여 상기 입력 신호인 멀티채널 오디오 신호를 SBR 방식으로 인코딩하여 SBR 비트스트림을 출력하는 SBR 인코딩단계; 및 상기 멀티채널 오디오 비트스트림 및 상기 SAC 비트스트림 및 상기 SBR 비트스트림을 다중화하는 비트스트림 다중화단계를 포함한다.The present invention also provides a multichannel audio encoding method comprising: a downsampling step of downsampling a sampling frequency of a multichannel audio signal having a high sampling frequency as an input signal; A multichannel audio encoding step of encoding the downsampled multichannel audio signal and outputting the multichannel audio bitstream; A SAC encoding step of encoding the input multi-channel audio signal by the SAC method and outputting a SAC bitstream and a downmix signal; An SBR encoding step of outputting an SBR bitstream by encoding the multichannel audio signal, which is the input signal, by using the downmix signal output by the SAC encoding step by an SBR method; And a bitstream multiplexing step of multiplexing the multichannel audio bitstream, the SAC bitstream, and the SBR bitstream.

상술한 목적, 특징들 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시예를 상세히 설명한다.The above objects, features and advantages will become more apparent from the following detailed description taken in conjunction with the accompanying drawings. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 3은 본 발명의 일실시예에 따른 멀티채널 오디오 디코딩 시스템의 구성도이다.3 is a block diagram of a multi-channel audio decoding system according to an embodiment of the present invention.

도 3에 도시된 바와 같이, 본 발명에 따른 멀티채널 오디오 디코딩 시스템은 비트스트림 역다중화부(Bitstream De-Multiplexer, 301), 멀티채널 오디오 디코딩부(303), SBR 디코딩부(305), SAC 디코딩부(307)를 포함한다.As shown in FIG. 3, the multichannel audio decoding system according to the present invention includes a bitstream de-multiplexer 301, a multichannel audio decoding unit 303, an SBR decoding unit 305, and a SAC decoding. A portion 307 is included.

본 발명에 따른 디코딩 시스템의 입력 신호인 다중화된 비트스트림은 낮은 샘플링 주파수(Fs, 예를 들어, 24kHz)의 멀티채널 오디오 비트스트림, SBR 비트스트 림 및 SAC 비트스트림이 멀티플렉싱된 비트스트림이다. 이러한 다중화된 비트스트림의 생성에 대해서는 본 발명에 따른 멀티채널 오디오 인코딩 시스템에서 설명된다. 상기 다중화된 비트스트림은 DAB 오디오 신호가 될 수 있다.The multiplexed bitstream, which is an input signal of the decoding system according to the present invention, is a bitstream in which a multichannel audio bitstream, an SBR bitstream, and a SAC bitstream are multiplexed at a low sampling frequency (Fs, for example, 24 kHz). The generation of such multiplexed bitstreams is described in the multichannel audio encoding system according to the present invention. The multiplexed bitstream may be a DAB audio signal.

상기 비트스트림 역다중화부(301)는 상기 입력 신호인 다중화된 비트스트림으로부터 SAC 비트스트림 및 SBR 비트스트림을 추출한다.The bitstream demultiplexer 301 extracts a SAC bitstream and an SBR bitstream from the multiplexed bitstream as the input signal.

상기 멀티채널 오디오 디코딩부(303)는 상기 입력 신호인 다중화된 비트스트림을 디코딩하여 낮은 샘플링 주파수(예를 들어, 24kHz)를 갖는 스테레오 또는 모노 다운믹스 신호 및 멀티채널 오디오 신호를 출력한다.The multichannel audio decoding unit 303 decodes the multiplexed bitstream as the input signal and outputs a stereo or mono downmix signal having a low sampling frequency (eg, 24 kHz) and a multichannel audio signal.

상기 멀티채널 오디오 비트스트림이 MPEG-2 Layer II 비트스트림 또는 AAC 비트스트림인 경우, 상기 멀티채널 오디오 디코딩부(303)에는 MPEG-2 Audio Layer II 디코딩 기술 또는 AAC(Advanced Audio Coding) 디코딩 기술이 이용될 수 있다.When the multichannel audio bitstream is an MPEG-2 Layer II bitstream or an AAC bitstream, the multichannel audio decoding unit 303 uses MPEG-2 Audio Layer II decoding technology or AAC (Advanced Audio Coding) decoding technology. Can be.

상기 SBR 디코딩부(305)는 상기 멀티채널 오디오 디코딩부(303)로부터 출력된 스테레오 또는 모노 다운믹스 신호 및 상기 비트스트림 역다중화부(301)로부터 추출된 SBR 비트스트림을 이용하여 고주파 영역이 복원된 다운믹스 스테레오 또는 모노 신호(예를 들면, Fs = 48kHz)를 디코딩한다.The SBR decoder 305 restores a high frequency region by using a stereo or mono downmix signal output from the multichannel audio decoder 303 and an SBR bitstream extracted from the bitstream demultiplexer 301. Decode downmixed stereo or mono signals (e.g., Fs = 48kHz).

SBR(Spectral Band Replication)는 오디오 신호의 저주파 대역 성분을 분석하여 고주파 대역 성분을 복원하는 기술이다. SBR에 대해서는 국제 표준[ISO/IEC 14496-3 AMENDMENT 1: Bandwidth Extension]에 개시되어 있다.SBR (Spectral Band Replication) is a technique of restoring high frequency band components by analyzing low frequency band components of an audio signal. SBR is described in the international standard [ISO / IEC 14496-3 AMENDMENT 1: Bandwidth Extension].

상기 SAC 디코딩부(307)는 상기 비트스트림 역다중화부(301)로부터 추출된 SAC 비트스트림에 포함된 공간큐(spatial cue) 및 상기 SBR 디코딩부(305)로부터 출력된 다운믹스 스테레오 또는 모노 신호를 이용하여 오디오 신호에 대한 정보를 추출하고 제어함으로써, 상기 멀티채널 오디오 디코딩부(303)로부터 출력된 낮은 샘플링 주파수(예를 들어, 24kHz)를 갖는 멀티채널 오디오 신호를 높은 샘플링 주파수(예를 들어, 48kHz)를 갖는 멀티채널 오디오 신호로 디코딩한다.The SAC decoding unit 307 may output a spatial cue included in the SAC bitstream extracted from the bitstream demultiplexing unit 301 and a downmix stereo or mono signal output from the SBR decoding unit 305. By extracting and controlling information about the audio signal using the multi-channel audio signal having a low sampling frequency (eg, 24 kHz) output from the multi-channel audio decoding unit 303, a high sampling frequency (eg, 48 kHz) into a multichannel audio signal.

SAC(Spatial Audio Coding)는 멀티채널 오디오 신호를 다운믹스된 모노 또는 스테레오 신호 및 공간큐 정보로 표현, 전송 및 복원하는 방법으로 낮은 비트율에서도 고품질의 멀티채널 오디오 신호를 전송할 수 있다.SAC (Spatial Audio Coding) is a method of representing, transmitting, and restoring a multichannel audio signal as a downmixed mono or stereo signal and spatial cue information to transmit high quality multichannel audio signals at low bit rates.

상기 스테레오 또는 모노 다운믹스 신호 및 SAC 비트스트림에 포함된 공간큐(spatial cue) 파라미터를 이용하여 멀티채널 오디오 신호를 디코딩하는 방법은 Baumgarte와 Faller의 논문(C. Faller and F. Baumgarte, “Binaural Cue Coding applied to stereo and multi-channel audio compression,” 112th AES Convention, Munich, prepreint 5574, May 3002) 이나 MPEG Surround 표준 (ISO/IEC JTC1/SC29/WG11, N7947, ISO/IEC 23003-1:3006/FCD, MPEG Surround, Jan., 3006) 에 개시되어 있는 방법을 적용할 수 있으며, 이에 따라 청각적으로 원음과 차이가 없는 멀티채널 오디오 신호를 디코딩할 수 있다.A method for decoding a multichannel audio signal using the spatial cue parameter included in the stereo or mono downmix signal and the SAC bitstream is described by Baumgarte and Faller (C. Faller and F. Baumgarte, “Binaural Cue”). Coding applied to stereo and multi-channel audio compression, ”112th AES Convention, Munich, prepreint 5574, May 3002) or MPEG Surround standards (ISO / IEC JTC1 / SC29 / WG11, N7947, ISO / IEC 23003-1: 3006 / FCD , MPEG Surround, Jan., 3006) can be applied, and thus it is possible to decode a multi-channel audio signal that is acoustically indistinguishable from the original sound.

도 4는 상기 도 3의 멀티채널 오디오 디코딩부(303)에 AAC 기술이 적용된 경우의 상세 구성도이다.FIG. 4 is a detailed configuration diagram when AAC technology is applied to the multichannel audio decoding unit 303 of FIG. 3.

AAC(Advanced Audio Coding)는 MPEG-2 또는 MPEG-4에서 사용되는 오디오 신호 압축 방식으로, MPEG-1에 비해 압축률이 높으면서도 음질이 열화되지 않으며, 다양한 대역과 많은 채널에 대응할 수 있는 특징이 있다.AAC (Advanced Audio Coding) is an audio signal compression method used in MPEG-2 or MPEG-4. It has a high compression ratio and no sound quality deterioration compared to MPEG-1, and has a feature that can cope with various bands and many channels. .

도 4에 도시된 바와 같이, 상기 멀티채널 오디오 디코딩부(303)는 AAC 디코더(401), 채널 리믹서(403)를 포함한다.As shown in FIG. 4, the multichannel audio decoding unit 303 includes an AAC decoder 401 and a channel remixer 403.

상기 AAC 디코더(401)는 상기 비트스트림 역다중화부(301)으로부터 출력된 멀티채널 오디오 비트스트림(AAC 비트스트림)으로부터 낮은 샘플링 주파수(예를 들어, 24kHz)를 갖는 스테레오 또는 모노 다운믹스 신호 및 멀티채널(LO, RO, T, Q1, Q2) 오디오 신호를 디코딩한다.The AAC decoder 401 is a stereo or mono downmix signal having a low sampling frequency (eg, 24 kHz) and a multi-channel from the multichannel audio bitstream (AAC bitstream) output from the bitstream demultiplexer 301. Decode the channel (LO, RO, T, Q1, Q2) audio signals.

상기 채널 리믹서(403)는 상기 AAC 디코더(401)로부터 디코딩된 멀티채널(LO, RO, T, Q1, Q2) 오디오 신호를 멀티채널(L, R, C, Ls, Rs) 오디오 신호로 리믹싱하여 상기 SAC 디코딩부(307)에 전달한다.The channel remixer 403 converts the multichannel (LO, RO, T, Q1, Q2) audio signals decoded from the AAC decoder 401 into a multichannel (L, R, C, Ls, Rs) audio signal. The mixture is transferred to the SAC decoding unit 307.

상기 도 3의 멀티채널 오디오 디코딩 시스템으로 입력되는 다중화된 비트스트림은 후술되는 바와 같이 본 발명에 따른 멀티채널 오디오 인코딩 시스템에 의해 생성된다.The multiplexed bitstream input to the multichannel audio decoding system of FIG. 3 is generated by the multichannel audio encoding system according to the present invention as described below.

도 5는 본 발명의 일실시예에 따른 멀티채널 오디오 인코딩 시스템의 구성도이다. 5 is a block diagram of a multichannel audio encoding system according to an embodiment of the present invention.

도 5에 도시된 바와 같이, 본 발명에 따른 멀티채널 오디오 인코더 시스템은 다운샘플링부(501), 멀티채널 오디오 인코딩부(503), SAC 인코딩부(505), SBR 인코딩부(507), 비트스트림 다중화부(509)를 포함한다.As shown in FIG. 5, the multichannel audio encoder system according to the present invention includes a downsampling unit 501, a multichannel audio encoding unit 503, an SAC encoding unit 505, an SBR encoding unit 507, and a bitstream. The multiplexer 509 is included.

상기 다운샘플링부(501)는 입력 신호인 멀티채널 오디오 신호(예를 들면, Fs = 48kHz)의 샘플링 주파수를 1/2배로 다운샘플링하여 멀티채널 오디오 신호(예를 들면, Fs = 24kHz)를 출력한다.The downsampling unit 501 outputs a multichannel audio signal (for example, Fs = 24kHz) by downsampling a sampling frequency of a multichannel audio signal (for example, Fs = 48kHz) that is an input signal by 1/2 times. do.

상기 멀티채널 오디오 인코딩부(503)는 상기 다운샘플링부(501)로부터 출력되는 멀티채널 오디오 신호(예를 들면, Fs = 24kHz)를 멀티채널 오디오 비트스트림으로 인코딩하고, 상기 인코딩 과정에서 스테레오(또는 모노) 다운믹스 신호(예를 들면, Fs = 24kHz)를 생성한다.The multichannel audio encoding unit 503 encodes a multichannel audio signal (for example, Fs = 24kHz) output from the downsampling unit 501 into a multichannel audio bitstream, and performs stereo (or Mono) to generate a downmix signal (e.g., Fs = 24 kHz).

상기 멀티채널 오디오 인코딩부(303)에는 MPEG-2 Audio Layer II 인코딩 기술 또는 AAC 인코딩 기술이 이용될 수 있으며, 이 경우, 상기 멀티채널 오디오 비트스트림은 MPEG-2 Layer II 비트스트림 또는 AAC 비트스트림이다.The multichannel audio encoding unit 303 may use MPEG-2 Audio Layer II encoding technology or AAC encoding technology. In this case, the multichannel audio bitstream is an MPEG-2 Layer II bitstream or an AAC bitstream. .

상기 SAC 인코딩부(505)는 입력 신호인 멀티채널 오디오 신호로부터 공간큐(spatial cue) 파라미터를 추출하고 인코딩함으로써 SAC 부가정보 비트스트림을 생성하고, 상기 멀티채널 오디오 신호로부터 스테레오(또는 모노) 다운믹스 신호를 생성한다. 이때 SAC 부가정보 비트스트림을 구성하는 공간큐 파라미터는 상기 다운샘플링부(501)의 다운샘플링 과정에서 상쇄된 고주파수 성분만으로 구성된다.The SAC encoder 505 extracts and encodes a spatial cue parameter from a multichannel audio signal as an input signal to generate a SAC side information bitstream, and generates a stereo (or mono) downmix from the multichannel audio signal. Generate a signal. In this case, the spatial cue parameter constituting the SAC side information bitstream includes only high frequency components canceled during the downsampling process of the downsampling unit 501.

상기 멀티채널 오디오 인코딩부(503) 및 상기 SAC 인코딩부(505)가 멀티채널(예를 들어 5.1채널) 신호(L, R, C, Ls, Rs, Lfe)를 스테레오(또는 모노)로 다운믹스하는 방법은 ITU-R BS. 775-1에서 정의된 방법과 동일하며 아래 [수학식 1]에 기초하여 다운믹스한다. The multichannel audio encoding unit 503 and the SAC encoding unit 505 downmix a multichannel (for example, 5.1 channel) signal L, R, C, Ls, Rs, Lfe to stereo (or mono). How to ITU-R BS. Same as the method defined in 775-1 and downmixed based on Equation 1 below.

이때, L0와 R0는 스테레오 다운믹스 신호, L과 R은 좌우 메인채널, C(center)는 중앙 채널, Ls(left surround)는 좌측 서라운드 채널, Rs(right surround)는 우측 서라운드 채널이다. (5.1채널) In this case, L0 and R0 are stereo downmix signals, L and R are left and right main channels, C (center) is a center channel, Ls (left surround) is a left surround channel, and Rs (right surround) is a right surround channel. (5.1 channel)

상기 SBR 인코딩부(507)는 상기 SAC 인코딩부(505)로부터 출력되는 스테레오(또는 모노) 다운믹스 신호 및 상기 멀티채널 오디오 인코딩부(503)로부터 출력되는 스테레오(또는 모노) 다운믹스 신호를 이용하여 SBR 비트스트림을 생성한다.The SBR encoding unit 507 uses a stereo (or mono) downmix signal output from the SAC encoding unit 505 and a stereo (or mono) downmix signal output from the multichannel audio encoding unit 503. Create an SBR bitstream.

상기 비트스트림 다중화부(509)는 상기 멀티채널 오디오 인코딩부(503)로부터 출력되는 멀티채널 오디오 비트스트림 및 상기 SAC 인코딩부(505)로부터 출력되는 SAC 부가정보 비트스트림 및 상기 SBR 인코딩부(507)로부터 출력되는 SBR 비트스트림을 멀티플렉싱하여 다중화된 비트스트림을 생성한다. The bitstream multiplexer 509 is a multichannel audio bitstream output from the multichannel audio encoder 503 and an SAC side information bitstream output from the SAC encoder 505 and the SBR encoder 507. A multiplexed bitstream is generated by multiplexing the SBR bitstream output from the multiplexer.

상기 다중화된 비트스트림은 DAB 오디오 신호가 될 수 있다.The multiplexed bitstream may be a DAB audio signal.

도 6은 상기 도 5의 멀티채널 오디오 인코딩부(503)에 AAC 기술이 적용된 경우의 상세 구성도이다.FIG. 6 is a detailed configuration diagram when AAC technology is applied to the multi-channel audio encoding unit 503 of FIG. 5.

도 6에 도시된 바와 같이, 상기 멀티채널 오디오 인코딩부(503)는 채널 믹서(601) 및 AAC 인코더(603)를 포함한다.As shown in FIG. 6, the multichannel audio encoding unit 503 includes a channel mixer 601 and an AAC encoder 603.

상기 채널 믹서(601)는 상기 다운샘플링부(501)로부터 출력된 멀티채널 오디 오 신호(L, R, C, Ls, Rs)를 상기 [수학식 1] 및 다음의 [수학식 2]에 따라 멀티채널 오디오 신호(LO, RO, T, Q1, Q2)로 믹싱하여 출력한다.The channel mixer 601 outputs the multichannel audio signals L, R, C, Ls, and Rs output from the downsampling unit 501 according to Equation 1 and Equation 2 below. The multichannel audio signals LO, RO, T, Q1, and Q2 are mixed and output.

이때, C(center)는 중앙 채널, Ls(left surround)는 좌측 서라운드 채널, Rs(right surround)는 우측 서라운드 채널, T, Q1, Q2는 스테레오 다운믹스 신호를 제외한 나머지 멀티채널 신호이다. (5.1채널) In this case, C (center) is the center channel, Ls (left surround) is the left surround channel, Rs (right surround) is the right surround channel, T, Q1, Q2 are the remaining multi-channel signals except for the stereo downmix signal. (5.1 channel)

상기 AAC 인코더(603)는 상기 채널 믹서(601)로부터 출력된 멀티채널 오디오 신호를 AAC 비트스트림으로 인코딩하고, 상기 인코딩 과정에서 스테레오 또는 모노 다운믹스 신호를 생성한다.The AAC encoder 603 encodes the multichannel audio signal output from the channel mixer 601 into an AAC bitstream, and generates a stereo or mono downmix signal in the encoding process.

도 7은 본 발명에 따른 멀티채널 오디오 비트스트림의 일실시예 구성도로서, 1/2 다운샘플링된 후 MPEG-2 Layer II 표준으로 부호화된 멀티채널 오디오 신호, SBR 비트스트림 및 SAC 비트스트림을 부가데이터 영역(Ancillary Data)에 다중화한 경우의 비트스트림 구성도이다. 7 is a diagram illustrating an embodiment of a multichannel audio bitstream according to the present invention, in which a multichannel audio signal, an SBR bitstream, and a SAC bitstream, which are half-sampled and encoded in the MPEG-2 Layer II standard, are added. The bitstream configuration diagram when multiplexing the data area (Ancillary Data).

도 7에 도시된 바와 같이, 1/2 다운샘플링된 후 MPEG-2 Layer II로 부호화된 멀티채널 오디오 신호(T, Q1, Q2)는 Ancillary Data 1 영역에 다중화되고, 다운믹스 스테레오 신호의 고주파수 영역을 부호화한 SBR 비트스트림은 Ancillary Data 2 영역에 다중화되고, 멀티채널 부호화 정보인 SAC 비트스트림은 Ancillary Data 3 영역에 다중화된다.As shown in FIG. 7, the multi-channel audio signals T, Q1, and Q2 encoded by MPEG-2 Layer II after 1/2 down-sampling are multiplexed into an Ancillary Data 1 region, and a high frequency region of the downmix stereo signal. The SBR bitstream encoded by the multiplexer is multiplexed into an Ancillary Data 2 region, and the SAC bitstream, which is multichannel encoded information, is multiplexed into an Ancillary Data 3 region.

도 8은 도 3의 멀티채널 오디오 디코딩 시스템에서 수행되는 디코딩 과정을 나타내는 흐름도이다.8 is a flowchart illustrating a decoding process performed in the multichannel audio decoding system of FIG. 3.

도 8에 도시된 바와 같이, 상기 비트스트림 역다중화부(301)는 상기 입력 신호인 다중화된 비트스트림으로부터 SAC 비트스트림 및 SBR 비트스트림을 추출한다(801).As illustrated in FIG. 8, the bitstream demultiplexer 301 extracts an SAC bitstream and an SBR bitstream from the multiplexed bitstream that is the input signal (801).

상기 멀티채널 오디오 디코딩부(303)는 상기 입력 신호인 다중화된 비트스트림을 디코딩하여 낮은 샘플링 주파수를 갖는 스테레오 또는 모노 다운믹스 신호 및 멀티채널 오디오 신호를 출력한다(803).The multichannel audio decoding unit 303 decodes the multiplexed bitstream as the input signal and outputs a stereo or mono downmix signal having a low sampling frequency and a multichannel audio signal (803).

상기 SBR 디코딩부(305)는 상기 멀티채널 오디오 디코딩부(303)로부터 출력된 스테레오 또는 모노 다운믹스 신호 및 상기 비트스트림 역다중화부(301)로부터 추출된 SBR 비트스트림을 이용하여 고주파 영역이 복원된 다운믹스 스테레오 또는 모노 신호를 디코딩한다(805). The SBR decoder 305 restores a high frequency region by using a stereo or mono downmix signal output from the multichannel audio decoder 303 and an SBR bitstream extracted from the bitstream demultiplexer 301. Decode the downmix stereo or mono signal (805).

상기 SAC 디코딩부(307)는 상기 비트스트림 역다중화부(301)로부터 추출된 SAC 비트스트림에 포함된 공간큐(spatial cue) 및 상기 SBR 디코딩부(305)로부터 출력된 다운믹스 스테레오 또는 모노 신호를 이용하여 오디오 신호에 대한 정보를 추출하고 제어함으로써, 상기 멀티채널 오디오 디코딩부(303)로부터 출력된 낮은 샘플링 주파수를 갖는 멀티채널 오디오 신호를 높은 샘플링 주파수를 갖는 멀티채널 오디오 신호로 디코딩한다(807).The SAC decoding unit 307 may output a spatial cue included in the SAC bitstream extracted from the bitstream demultiplexing unit 301 and a downmix stereo or mono signal output from the SBR decoding unit 305. By extracting and controlling information on the audio signal using the multichannel audio decoding unit 303, the multichannel audio signal having the low sampling frequency is decoded into the multichannel audio signal having the high sampling frequency (807). .

도 9는 도 5의 멀티채널 오디오 인코딩 시스템에서 수행되는 인코딩 과정을 나타내는 흐름도이다.9 is a flowchart illustrating an encoding process performed in the multichannel audio encoding system of FIG. 5.

도 9에 도시된 바와 같이, 상기 다운샘플링부(501)는 입력 신호인 멀티채널 오디오 신호의 샘플링 주파수를 1/2배로 다운샘플링하여 멀티채널 오디오 신호를 출력한다(901).As illustrated in FIG. 9, the downsampling unit 501 outputs a multichannel audio signal by downsampling a sampling frequency of a multichannel audio signal that is an input signal by a factor of 1/2 (operation 901).

상기 SAC 인코딩부(505)는 입력 신호인 멀티채널 오디오 신호로부터 공간큐(spatial cue) 파라미터를 추출하고 인코딩함으로써 SAC 부가정보 비트스트림을 생성하고, 상기 멀티채널 오디오 신호로부터 스테레오(또는 모노) 다운믹스 신호를 생성한다(903).The SAC encoder 505 extracts and encodes a spatial cue parameter from a multichannel audio signal as an input signal to generate a SAC side information bitstream, and generates a stereo (or mono) downmix from the multichannel audio signal. Generate a signal (903).

상기 멀티채널 오디오 인코딩부(503)는 상기 다운샘플링부(501)로부터 출력되는 멀티채널 오디오 신호를 멀티채널 오디오 비트스트림으로 인코딩하고, 상기 인코딩 과정에서 스테레오(또는 모노) 다운믹스 신호를 생성한다(905).The multichannel audio encoding unit 503 encodes the multichannel audio signal output from the downsampling unit 501 into a multichannel audio bitstream and generates a stereo (or mono) downmix signal in the encoding process ( 905).

상기 SBR 인코딩부(507)는 상기 SAC 인코딩부(505)로부터 출력되는 스테레오(또는 모노) 다운믹스 신호 및 상기 멀티채널 오디오 인코딩부(503)로부터 출력되는 스테레오(또는 모노) 다운믹스 신호를 이용하여 SBR 비트스트림을 생성한다(907).The SBR encoding unit 507 uses a stereo (or mono) downmix signal output from the SAC encoding unit 505 and a stereo (or mono) downmix signal output from the multichannel audio encoding unit 503. A SBR bitstream is generated (907).

상기 비트스트림 다중화부(509)는 상기 멀티채널 오디오 인코딩부(503)로부터 출력되는 멀티채널 오디오 비트스트림 및 상기 SAC 인코딩부(505)로부터 출력되는 SAC 부가정보 비트스트림 및 상기 SBR 인코딩부(507)로부터 출력되는 SBR 비트스트림를 멀티플렉싱하여 다중화된 비트스트림을 생성한다(909).The bitstream multiplexer 509 is a multichannel audio bitstream output from the multichannel audio encoder 503 and an SAC side information bitstream output from the SAC encoder 505 and the SBR encoder 507. The multiplexed SBR bitstream is multiplexed to generate a multiplexed bitstream (909).

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 형태로 기록매체(씨디롬, 램, 롬, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다. 이러한 과정은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있으므로 더 이상 상세히 설명하지 않기로 한다.As described above, the method of the present invention may be implemented as a program and stored in a recording medium (CD-ROM, RAM, ROM, floppy disk, hard disk, magneto-optical disk, etc.) in a computer-readable form. Since this process can be easily implemented by those skilled in the art will not be described in more detail.

이상에서 설명한 본 발명은 전술한 실시예 및 첨부된 도면에 의해 한정되는 것이 아니고, 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능하다는 것이 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 있어 명백할 것이다.The present invention described above is not limited to the above-described embodiments and the accompanying drawings, and various substitutions, modifications, and changes can be made in the art without departing from the technical spirit of the present invention. It will be clear to those of ordinary knowledge.

상기와 같은 본 발명은, 모노 또는 스테레오 다운믹스 신호를 제공하므로 종래의 모노, 스테레오 및 멀티채널 오디오 수신기와 호환성을 유지하면서 48kHz의 샘플링 주파수를 지원하는 고품질의 멀티채널 오디오 서비스를 제공할 수 있다. 또한, SBR과 SAC기술을 이용하므로 비트레이트를 줄이면서 음질이 향상된 멀티채널 오디오 신호로 인코딩 및 디코딩할 수 있다.Since the present invention provides a mono or stereo downmix signal, it is possible to provide a high quality multichannel audio service supporting a sampling frequency of 48 kHz while maintaining compatibility with conventional mono, stereo and multichannel audio receivers. In addition, the SBR and SAC technology enables encoding and decoding of multichannel audio signals with improved sound quality while reducing bitrate.

Claims

In a multichannel audio decoding system,

A bitstream demultiplexer for demultiplexing a multiplexed bitstream of a DAB scheme as an input signal into a spatial audio coding (SAC) bitstream and a spectra band replication (SBR) bitstream;

A multichannel audio decoder for decoding the multiplexed bitstream and outputting a downmix signal and a multichannel audio signal;

An SBR decoding unit for decoding the downmix signal and the SBR bitstream by an SBR method and outputting a downmix signal in which a high frequency region is restored; And

SAC decoding unit for outputting a multi-channel audio signal of a high sampling frequency by decoding the multi-channel audio signal by the SAC method using the spatial cue included in the SAC bitstream and the downmix signal from which the high frequency region is restored

Multichannel audio decoding system comprising a.

delete

The method of claim 1,

The multichannel audio decoding unit

An AAC decoder for decoding the multichannel audio bitstream and outputting the downmix signal and the multichannel audio signal; And

The channel remixer for remixing the channels of the multi-channel audio signal decoded from the AAC decoder and outputs to the SAC decoding unit

Multichannel audio decoding system comprising a.

The method of claim 1,

The downmix signal is

Which can be either stereo or mono

Multichannel Audio Decoding System.

In a multichannel audio encoding system,

A downsampling unit for downsampling a sampling frequency of a multichannel audio signal having a high sampling frequency as an input signal;

A multichannel audio encoding unit for encoding the downsampled multichannel audio signal and outputting the multichannel audio bitstream;

A SAC encoder for outputting a SAC bitstream and a downmix signal by encoding the input multi-channel audio signal using a SAC scheme;

An SBR encoding unit for outputting an SBR bitstream by encoding the multichannel audio signal, which is the input signal, by using the downmix signal output from the SAC encoder; And

A bitstream multiplexer for outputting a DAB multiplexed bitstream for multiplexing the multichannel audio bitstream, the SAC bitstream, and the SBR bitstream.

Multichannel audio encoding system comprising a.

The method of claim 5,

The multichannel audio encoding unit

A channel mixer for mixing channels of the downsampled multichannel audio signal; And

AAC encoder for decoding the multi-channel audio signal output from the channel mixer and outputs the multi-channel audio bitstream

Multichannel audio encoding system comprising a.

The method of claim 5,

The multichannel audio encoding unit

Further outputting a downmix signal by encoding the downsampled multichannel audio signal,

The SBR encoding unit

Encoding a multichannel audio signal which is the input signal by further using a downmix signal output from the multichannel audio encoding unit

Multichannel Audio Encoding System.

The method according to claim 5 or 7,

The downmix signal is

Which can be either stereo or mono

Multichannel Audio Encoding System.

The method according to claim 5 or 7,

The downmix signal is

Generated by Equation 1 below

Multichannel Audio Encoding System.

[Equation 1]

In this case, L0 and R0 are stereo downmix signals, L and R are left and right main channels, C (center) is a center channel, Ls (left surround) is a left surround channel, and Rs (right surround) is a right surround channel.

The method of claim 6,

The channel mixer

Mixing channels of the multichannel audio signal by Equation 2 below

Multichannel Audio Encoding System.

[Equation 2]

In this case, C (center) is the center channel, Ls (left surround) is the left surround channel, Rs (right surround) is the right surround channel, T, Q1, Q2 are the remaining multi-channel signals except stereo downmix signal.

delete

In the multi-channel audio decoding method,

A bitstream demultiplexing step of demultiplexing a multiplexed bitstream of a DAB scheme as an input signal into a spatial audio coding (SAC) bitstream and a spectra band replication (SBR) bitstream;

A multichannel audio decoding step of decoding the multiplexed multichannel audio bitstream to output a downmix signal and a multichannel audio signal;

An SBR decoding step of decoding the downmix signal and the SBR bitstream in an SBR manner and outputting a downmix signal in which a high frequency region is restored; And

SAC decoding step of outputting a multi-channel audio signal of a high sampling frequency by decoding the multi-channel audio signal by the SAC method using a spatial cue included in the SAC bitstream and the downmix signal from which the high frequency region is restored

Multichannel audio decoding method comprising a.

delete

The method of claim 12,

The multichannel audio decoding step

An AAC decoding step of decoding the multichannel audio bitstream and outputting the downmix signal and the multichannel audio signal; And

The channel remixing step of remixing channels of the multichannel audio signal decoded by the AAC decoding step and outputting the SAC decoding step

Multichannel audio decoding method comprising a.

The method of claim 12,

The downmix signal is

Which can be either stereo or mono

Multichannel audio decoding method.

In the multi-channel audio encoding method,

A downsampling step of downsampling a sampling frequency of a multi-channel audio signal having a high sampling frequency as an input signal;

A multichannel audio encoding step of encoding the downsampled multichannel audio signal and outputting the multichannel audio bitstream;

A SAC encoding step of encoding the input multi-channel audio signal by the SAC method and outputting a SAC bitstream and a downmix signal;

An SBR encoding step of outputting an SBR bitstream by encoding the multichannel audio signal, which is the input signal, by using the downmix signal output by the SAC encoding step by an SBR method; And

A bitstream multiplexing step of outputting a DAB multiplexed bitstream by multiplexing the multichannel audio bitstream, the SAC bitstream, and the SBR bitstream

Multi-channel audio encoding method comprising a.

The method of claim 16,

The multichannel audio encoding step

A channel mixing step of mixing channels of the downsampled multichannel audio signal; And

AAC encoder for decoding the multi-channel audio signal output by the channel mixing step and outputs the multi-channel audio bitstream

Multi-channel audio encoding method comprising a.

The method of claim 16,

The multichannel audio encoding step

The SBR encoding step

Encoding a multichannel audio signal which is the input signal by further using a downmix signal output by the multichannel audio encoding step

Multichannel audio encoding method.

The method according to claim 16 or 18,

The downmix signal is

Which can be either stereo or mono

Multichannel audio encoding method.

The method according to claim 16 or 18,

The downmix signal is

Generated by Equation 1 below

Multichannel audio encoding method.

[Equation 1]

The method of claim 17,

The channel mixing step

Mixing the channels of the downsampled multichannel audio signal by Equation 2

Multichannel audio encoding method.

[Equation 2]

delete