KR101435893B1

KR101435893B1 - Method and apparatus for encoding and decoding audio signal using band width extension technique and stereo encoding technique

Info

Publication number: KR101435893B1
Application number: KR1020070086337A
Authority: KR
Inventors: 오은미; 주기현; 김중회
Original assignee: 삼성전자주식회사
Priority date: 2006-09-22
Filing date: 2007-08-28
Publication date: 2014-09-02
Also published as: CN101518083A; KR20080027129A; CN101518083B

Abstract

본 발명은 오디오 신호의 부호화 방법에 관한 것으로, 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호를 다운믹싱하며, 다운믹싱된 신호를 고주파수 밴드 신호와 저주파수 밴드 신호로 분할하고, 고주파수 밴드 신호와 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하며, 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하고, 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하며, 부호화된 스테레오 파라미터, 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 입력 신호에 대한 부호화 결과로써 출력함으로써, 한정된 비트율에서 고주파수 성분 및 스테레오 성분을 효율적으로 부호화하여 음질을 향상시킬 수 있다.The present invention relates to a method of encoding an audio signal, which comprises extracting and encoding a stereo parameter from an input signal, downmixing the input signal, dividing the downmixed signal into a high frequency band signal and a low frequency band signal, Frequency band signal from the time domain to the frequency domain, quantizes the converted low-frequency band signal, and encodes the low-frequency band signal into a bit plane on the basis of the context, and outputs a bandwidth representing the characteristics of the converted high-frequency band signal using the converted low- And outputs the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information as the encoding result for the input signal, thereby efficiently encoding the high frequency component and the stereo component at a limited bit rate, Can be improved.

Description

[0001] The present invention relates to a method and apparatus for encoding and decoding an audio signal using a bandwidth extension technique and a stereo coding technique,

본 발명은 오디오 신호의 부호화/복호화 방법 및 장치에 관한 것으로, 보다 상세하게는 대역폭 확장 기법 및 스테레오 부호화 기법을 이용한 오디오 신호의 부호화/복호화 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for encoding / decoding an audio signal, and more particularly, to a method and apparatus for encoding / decoding an audio signal using a bandwidth extension technique and a stereo coding technique.

오디오 신호를 부호화하거나 복호화함에 있어서 한정된 비트율을 이용하여 최대한 음질을 향상시키는 것이 요구된다. 저비트율에서는 가용 비트가 적으므로 부호화되는 오디오 신호의 주파수 대역을 줄여서 부호화해야 하므로, 음질이 저하될 수 있다. It is required to improve the sound quality as much as possible by using a limited bit rate in encoding or decoding an audio signal. Since the number of available bits is small at a low bit rate, the frequency band of an audio signal to be encoded must be reduced and encoded, which may degrade the sound quality.

일반적으로, 저주파수 성분은 고주파수 성분에 비하여 인간이 지각하는데 중요하게 영향을 미친다. 따라서, 저주파수 성분을 부호화하는데 할당되는 비트를 늘리고, 고주파수 성분을 부호화하는데 할당되는 비트를 줄이면서 음질을 향상시킬 수 있는 방법이 요구된다. Generally, the low frequency components significantly affect the human perception compared to the high frequency components. Therefore, there is a need for a method capable of increasing the bit allocated for encoding the low-frequency component and improving the sound quality while reducing the bits allocated for encoding the high-frequency component.

또한, 두 채널 이상의 스테레오 신호는 한 채널의 모노 신호에 비하여 부호 화하거나 복호화함에 있어서 많은 비트가 할당된다. 따라서, 스테레오 신호를 부호화하는데 할당되는 비트를 줄이면서 음질을 향상시킬 수 있는 방법이 요구된다.In addition, many bits of stereo signals of more than two channels are allocated in encoding or decoding as compared with a mono signal of one channel. Accordingly, there is a need for a method that can improve the sound quality while reducing the bits allocated for encoding the stereo signal.

본 발명이 해결하고자 하는 과제는 한정된 비트율에서 스테레오 신호 및 고주파수 성분을 효율적으로 부호화하여 음질을 향상시킬 수 있는 오디오 신호의 부호화 방법 및 장치를 제공하는데 있다.SUMMARY OF THE INVENTION It is an object of the present invention to provide a method and apparatus for encoding an audio signal capable of efficiently encoding a stereo signal and a high frequency component at a limited bit rate to improve sound quality.

본 발명이 해결하고자 하는 다른 과제는 한정된 비트율에서 부호화된 비트스트림으로부터 고주파수 성분 및 스테레오 성분을 효율적으로 복호화하는 오디오 신호의 복호화 방법 및 장치를 제공하는데 있다.Another object of the present invention is to provide a method and apparatus for decoding an audio signal that efficiently decodes a high frequency component and a stereo component from a bit stream encoded at a limited bit rate.

상기 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 단계; (b) 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하는 단계; (d) 상기 변환된 저주파수 밴드 신호에 대하여 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (e) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (f) 상기 부호화된 스테레오 파라미터, 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to an aspect of the present invention, there is provided a method of encoding an audio signal, the method including: (a) extracting and encoding a stereo parameter from an input signal and downmixing the input signal; (b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal; (c) converting the high-frequency band signal and the low-frequency band signal into a frequency domain in a time domain; (d) quantizing the transformed low frequency band signal and encoding the transformed low frequency band signal into a bit plane based on a context; (e) generating and encoding bandwidth extension information indicating characteristics of the converted high frequency band signal using the converted low frequency band signal; And (f) outputting the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal.

또한, 상기 과제는 (a) 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 단계; (b) 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하는 단계; (d) 상기 변환된 저주파수 밴드 신호에 대하여 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (e) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (f) 상기 부호화된 스테레오 파라미터, 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method for encoding a stereo signal, the method comprising: (a) extracting a stereo parameter from an input signal and encoding the input signal; (b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal; (c) converting the high-frequency band signal and the low-frequency band signal into a frequency domain in a time domain; (d) quantizing the transformed low frequency band signal and encoding the transformed low frequency band signal into a bit plane based on a context; (e) generating and encoding bandwidth extension information indicating characteristics of the converted high frequency band signal using the converted low frequency band signal; And (f) outputting the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal. The present invention is achieved by a recording medium readable by a computer.

또한, 상기 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 단계; (b) 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 저주파수 밴드 신호를 제1 변환 방식에 의해 시간 도메인에서 주파수 도메인으로 변환하는 단계; (d) 상기 제1 변환 방식에 의해 주파수 도메인으로 변환된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (e) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 제2 변환 방식에 의해 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (f) 상기 제2 변환 방식에 의해 변환된 저주파수 밴 드 신호를 이용하여 상기 제2 변환 방식에 의해 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (g) 상기 부호화된 스테레오 파라미터, 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, the method including: (a) extracting a stereo parameter from an input signal, encoding the input signal, and downmixing the input signal; (b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal; (c) converting the low frequency band signal from the time domain to the frequency domain by a first conversion method; (d) quantizing a signal converted into the frequency domain by the first conversion method and encoding the signal into a bit plane based on a context; (e) converting the high frequency band signal and the low frequency band signal into a frequency domain or a time / frequency domain in a time domain by a second conversion method; (f) generating and encoding bandwidth extension information indicating characteristics of the high frequency band signal converted by the second conversion method using the low frequency band signal converted by the second conversion method; And (g) outputting the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal.

또한, 상기 다른 과제는 (a) 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 단계; (b) 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 저주파수 밴드 신호를 제1 변환 방식에 의해 시간 도메인에서 주파수 도메인으로 변환하는 단계; (d) 상기 제1 변환 방식에 의해 주파수 도메인으로 변환된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (e) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 제2 변환 방식에 의해 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (f) 상기 제2 변환 방식에 의해 변환된 저주파수 밴드 신호를 이용하여 상기 제2 변환 방식에 의해 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (g) 상기 부호화된 스테레오 파라미터, 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of generating a stereo signal, the method including: (a) extracting a stereo parameter from an input signal, encoding the signal, and downmixing the input signal; (b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal; (c) converting the low frequency band signal from the time domain to the frequency domain by a first conversion method; (d) quantizing a signal converted into the frequency domain by the first conversion method and encoding the signal into a bit plane based on a context; (e) converting the high frequency band signal and the low frequency band signal into a frequency domain or a time / frequency domain in a time domain by a second conversion method; (f) generating and encoding bandwidth extension information indicating characteristics of the high frequency band signal converted by the second conversion method using the low frequency band signal converted by the second conversion method; And (g) outputting the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal. The present invention is achieved by a recording medium readable by a computer.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 단계; (b) 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 저주파수 밴드 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하는 단계; (d) 상기 저주파수 밴드 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 저주파수 밴드 신호를 시간 도메인에서 부호화하는 단계; (e) 상기 저주파수 밴드 신호를 주파수 도메인에서 부호화하는 것으로 결정된 경우 상기 저주파수 밴드 신호를 제1 변환 방식에 의해 시간 도메인에서 주파수 도메인으로 변환하고, 상기 제1 변환 방식에 의해 주파수 도메인으로 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (f) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호를 제2 변환 방식에 의해 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (g) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (h) 상기 부호화된 스테레오 파라미터, 상기 시간 도메인에서 부호화된 결과, 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, the method comprising the steps of: (a) extracting a stereo parameter from an input signal and encoding the input signal, and downmixing the input signal; (b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal; (c) determining whether to encode the low frequency band signal in the time domain or the frequency domain; (d) encoding the low frequency band signal in the time domain if it is determined to encode the low frequency band signal in the time domain; (e) if it is determined to encode the low-frequency band signal in the frequency domain, the low-frequency band signal is converted into a frequency domain in the time domain by a first conversion method, and a low- Encoding a signal into a bit plane based on a context; (f) converting the low-frequency band signal and the high-frequency band signal into a frequency domain or a time / frequency domain in a time domain by a second conversion method; (g) generating and encoding bandwidth extension information indicating characteristics of the converted high frequency band signal using the converted low frequency band signal; And (h) outputting the encoded stereo parameter, the time domain encoded result, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal.

또한, 상기 또 다른 과제는 (a) 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 단계; (b) 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 저주파수 밴드 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결 정하는 단계; (d) 상기 저주파수 밴드 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 저주파수 밴드 신호를 시간 도메인에서 부호화하는 단계; (e) 상기 저주파수 밴드 신호를 주파수 도메인에서 부호화하는 것으로 결정된 경우 상기 저주파수 밴드 신호를 제1 변환 방식에 의해 시간 도메인에서 주파수 도메인으로 변환하고, 상기 제1 변환 방식에 의해 주파수 도메인으로 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (f) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호를 제2 변환 방식에 의해 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (g) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (h) 상기 부호화된 스테레오 파라미터, 상기 시간 도메인에서 부호화된 결과, 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of generating a stereo signal, the method including: (a) extracting a stereo parameter from an input signal, encoding the signal, and downmixing the input signal; (b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal; (c) determining whether to encode the low frequency band signal in the time domain or the frequency domain; (d) encoding the low frequency band signal in the time domain if it is determined to encode the low frequency band signal in the time domain; (e) if it is determined to encode the low-frequency band signal in the frequency domain, the low-frequency band signal is converted into a frequency domain in the time domain by a first conversion method, and a low- Encoding a signal into a bit plane based on a context; (f) converting the low-frequency band signal and the high-frequency band signal into a frequency domain or a time / frequency domain in a time domain by a second conversion method; (g) generating and encoding bandwidth extension information indicating characteristics of the converted high frequency band signal using the converted low frequency band signal; And (h) outputting the encoded stereo parameter, the time domain encoded result, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal. The present invention is achieved by a computer-readable recording medium having recorded thereon a program for executing the program.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호를 시간 도메인에서 주파수 도메인으로 변환하는 단계; (b) 상기 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 변환된 신호를 다운믹싱하는 단계; (c) 상기 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화하는 단계; (d) 상기 다운믹싱된 신호를 시간 도메인으로 역변환하는 단계; (e) 상기 역변환된 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하고, 상기 결정 결과에 따라 상기 역변환된 신호를 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환하는 단계; (f) 상기 역변환된 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 시간 도메인으로 변환된 신호를 시간 도메인에서 부호화하는 단계; (g) 상기 역변환된 신호를 주파수 도메인으로 부호화하는 것으로 결정된 경우 상기 주파수 도메인으로 변환된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; 및 (h) 상기 부호화된 스테레오 파라미터, 상기 부호화된 대역폭 확장 정보, 상기 시간 도메인에서 부호화된 결과 및 상기 부호화된 비트플레인을 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, comprising: (a) converting an input signal from a time domain to a frequency domain; (b) extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal; (c) extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information; (d) inversely transforming the downmixed signal into a time domain; (e) determining whether to encode the inverse-converted signal in a time domain or a frequency domain, and converting the inverse-transformed signal into a time domain or a frequency domain on a subband-by-subband basis according to the determination result; (f) encoding the signal transformed into the time domain in the time domain if it is determined to encode the inverse transformed signal in the time domain; (g) quantizing the signal transformed into the frequency domain if the inverse-transformed signal is determined to be encoded in the frequency domain, and encoding the quantized signal into a bit plane based on the context; And (h) outputting the encoded stereo parameter, the encoded bandwidth extension information, the encoded result in the time domain, and the encoded bit plane as a result of encoding the input signal.

또한, 상기 또 다른 과제는 (a) 입력 신호를 시간 도메인에서 주파수 도메인으로 변환하는 단계; (b) 상기 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 변환된 신호를 다운믹싱하는 단계; (c) 상기 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화하는 단계; (d) 상기 다운믹싱된 신호를 시간 도메인으로 역변환하는 단계; (e) 상기 역변환된 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하고, 상기 결정 결과에 따라 상기 역변환된 신호를 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환하는 단계; (f) 상기 역변환된 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 시간 도메인으로 변환된 신호를 시간 도메인에서 부호화하는 단계; (g) 상기 역변환된 신호를 주파수 도메인으로 부호화하는 것으로 결정된 경우 상기 주파수 도메인으로 변환된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; 및 (h) 상기 부호화된 스테레오 파라미터, 상기 부호화된 대역폭 확장 정보, 상기 시간 도메인에서 부호화된 결과 및 상기 부호화된 비트플레인을 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.In another aspect of the present invention, there is provided a method of converting an input signal into a frequency domain from a time domain; (b) extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal; (c) extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information; (d) inversely transforming the downmixed signal into a time domain; (e) determining whether to encode the inverse-converted signal in a time domain or a frequency domain, and converting the inverse-transformed signal into a time domain or a frequency domain on a subband-by-subband basis according to the determination result; (f) encoding the signal transformed into the time domain in the time domain if it is determined to encode the inverse transformed signal in the time domain; (g) quantizing the signal transformed into the frequency domain if the inverse-transformed signal is determined to be encoded in the frequency domain, and encoding the quantized signal into a bit plane based on the context; And (h) outputting the encoded stereo parameter, the encoded bandwidth extension information, the result encoded in the time domain, and the encoded bit plane as a result of encoding the input signal. The present invention is achieved by a computer-readable recording medium having recorded thereon a program for executing the program.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하고, 상기 결정 결과에 따라 상기 입력 신호를 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환하는 단계; (b) 상기 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 변환된 신호를 다운믹싱하는 단계; (c) 상기 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화하는 단계; (d) 상기 다운믹싱된 신호가 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 다운믹싱된 신호를 시간 도메인에서 부호화하는 단계; (e) 상기 다운믹싱된 신호가 주파수 도메인에서 부호화하는 것으로 결정된 경우 상기 다운믹싱된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; 및 (f) 상기 부호화된 스테레오 파라미터, 상기 부호화된 대역폭 확장 정보, 상기 시간 도메인에서 부호화된 결과 및 상기 부호화된 비트플레인을 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, comprising: (a) determining whether to encode an input signal in a time domain or a frequency domain; Transforming each subband into a time domain or a frequency domain; (b) extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal; (c) extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information; (d) encoding the downmixed signal in the time domain if it is determined that the downmixed signal is to be encoded in the time domain; (e) if the downmixed signal is determined to be encoded in the frequency domain, encoding the downmixed signal into a bit plane based on a context; And (f) outputting the encoded stereo parameter, the encoded bandwidth extension information, the result encoded in the time domain, and the encoded bit plane as a coding result for the input signal.

또한, 상기 또 다른 과제는 (a) 입력 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하고, 상기 결정 결과에 따라 상기 입력 신호를 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환하는 단계; (b) 상기 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 변환된 신호를 다운믹싱하는 단계; (c) 상기 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화하는 단계; (d) 상기 다운믹싱된 신호가 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 다운믹싱된 신호를 시간 도메인에서 부호화하는 단계; (e) 상기 다운믹싱된 신호가 주파수 도메인에서 부호화하는 것으로 결정된 경우 상기 다운믹싱된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; 및 (f) 상기 부호화된 스테레오 파라미터, 상기 부호화된 대역폭 확장 정보, 상기 시간 도메인에서 부호화된 결과 및 상기 부호화된 비트플레인을 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of encoding an input signal, the method comprising: (a) determining whether to encode the input signal in a time domain or a frequency domain; and converting the input signal into a time domain or a frequency domain on a subband- (b) extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal; (c) extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information; (d) encoding the downmixed signal in the time domain if it is determined that the downmixed signal is to be encoded in the time domain; (e) if the downmixed signal is determined to be encoded in the frequency domain, encoding the downmixed signal into a bit plane based on a context; And (f) outputting the encoded stereo parameter, the encoded bandwidth extension information, the encoded result in the time domain, and the encoded bit plane as a result of encoding the input signal. The present invention is achieved by a computer-readable recording medium having recorded thereon a program for executing the program.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (d) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호 각각을 제1 역변환 방식에 의해 주파수 도메인에서 시간 도메인으로 변환하는 단계; (e) 상기 역변환된 저주파수 밴드 신호 및 상기 역변환된 고주파수 밴드 신호를 합성하는 단계; 및 (f) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding an audio signal, the method including: (a) receiving a result of encoding an audio signal; (b) generating a low frequency band signal by decoding the encoded bit plane included in the encoding result based on the context and dequantizing the encoded bit plane; (c) decoding the encoded bandwidth extension information included in the encoding result, and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; (d) converting each of the low-frequency band signal and the high-frequency band signal into a time domain from a frequency domain by a first inverse transformation method; (e) combining the inversely transformed low frequency band signal and the inverse transformed high frequency band signal; And (f) decoding the encoded stereo parameter included in the encoding result and upmixing the synthesized signal using the decoded stereo parameter.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (d) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호 각각을 제1 역변환 방식에 의해 주파수 도메인에서 시간 도메인으로 변환하는 단계; (e) 상기 역변환된 저주파수 밴드 신호 및 상기 역변환된 고주파수 밴드 신호를 합성하는 단계; 및 (f) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, (b) generating a low frequency band signal by decoding the encoded bit plane included in the encoding result based on the context and dequantizing the encoded bit plane; (c) decoding the encoded bandwidth extension information included in the encoding result, and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; (d) converting each of the low-frequency band signal and the high-frequency band signal into a time domain from a frequency domain by a first inverse transformation method; (e) combining the inversely transformed low frequency band signal and the inverse transformed high frequency band signal; And (f) decoding the encoded stereo parameter included in the encoding result and upmixing the synthesized signal using the decoded stereo parameter, the method comprising: recording a program for executing an audio signal decoding method And is achieved by a computer-readable recording medium.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 저주파수 밴드 신호를 제1 역변환 방식에 의해 주파수 도메인에서 시간 도메인으로 변환하는 단계; (d) 상기 제1 역변 환 방식에 의해 역변환된 저주파수 밴드 신호를 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (f) 상기 고주파수 밴드 신호를 제2 역변환 방식에 의해 시간 도메인으로 역변환하는 단계; (g) 상기 역변환된 고주파수 밴드 신호 및 상기 변환된 저주파수 밴드 신호를 합성하는 단계; 및 (h) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding an audio signal, the method including: (a) receiving a result of encoding an audio signal; (b) generating a low frequency band signal by decoding the encoded bit plane included in the encoding result based on the context and dequantizing the encoded bit plane; (c) converting the low frequency band signal from the frequency domain to the time domain by a first inverse transformation method; (d) converting a low-frequency band signal inversely transformed by the first inverse transformation method into a frequency domain or a time / frequency domain by a first conversion method; (e) decoding the encoded bandwidth extension information included in the encoding result, and extracting a high frequency band from the low frequency band signal converted into the frequency domain or the time / frequency domain by the first conversion method using the decoded bandwidth extension information, Generating a signal; (f) inversely transforming the high frequency band signal into a time domain by a second inverse transformation method; (g) combining the inversely transformed high frequency band signal and the converted low frequency band signal; And (h) decoding the encoded stereo parameter included in the encoding result, and upmixing the synthesized signal using the decoded stereo parameter.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 저주파수 밴드 신호를 제1 역변환 방식에 의해 주파수 도메인에서 시간 도메인으로 변환하는 단계; (d) 상기 제1 역변환 방식에 의해 역변환된 저주파수 밴드 신호를 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (f) 상기 고주파수 밴드 신호를 제2 역변환 방식에 의해 시간 도메인으로 역변환하는 단계; (g) 상기 역변환된 고주파수 밴드 신호 및 상기 변환된 저주파수 밴드 신호를 합성하는 단계; 및 (h) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, (b) generating a low frequency band signal by decoding the encoded bit plane included in the encoding result based on the context and dequantizing the encoded bit plane; (c) converting the low frequency band signal from the frequency domain to the time domain by a first inverse transformation method; (d) converting a low-frequency band signal inversely transformed by the first inverse transform method into a frequency domain or a time / frequency domain by a first conversion method; (e) decoding the encoded bandwidth extension information included in the encoding result, and extracting a high frequency band from the low frequency band signal converted into the frequency domain or the time / frequency domain by the first conversion method using the decoded bandwidth extension information, Generating a signal; (f) inversely transforming the high frequency band signal into a time domain by a second inverse transformation method; (g) combining the inversely transformed high frequency band signal and the converted low frequency band signal; And (h) decoding the encoded stereo parameters included in the encoding result, and upmixing the synthesized signal using the decoded stereo parameters, and recording a program for executing the decoding method of the audio signal And is achieved by a computer-readable recording medium.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 시간 도메인 또는 주파수 도메인에서의 부호화 결과를 입력받는 단계; (b) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 저주파수 밴드 신호를 제1 역변환 방식에 의해 시간 도메인으로 역변환하는 단계; (d) 상기 제1 역변환 방식에 의해 시간 도메인으로 역변환된 신호를 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (f) 상기 고주파수 밴드 신호를 제2 역변환 방식에 의해 시간 도메인으로 역변환하는 단계; (g) 상기 시간 도메인에서의 부호화 결과를 시간 도메인에서 복호화하여 상기 저주파수 밴드 신호를 생성하는 단계; (h) 상기 제1 역변환 방식에 의해 시간 도메인으로 역변환된 신호, 상기 제2 역변환 방식에 의해 시간 도메인으로 역변환된 고주파수 밴드 신호, 및 상기 시간 도메인에서 복호화된 저주파수 밴드 신호를 합성하는 단계; 및 (i) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding an audio signal, the method including the steps of: (a) receiving a result of encoding an audio signal in a time domain or a frequency domain; (b) generating a low frequency band signal by decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context; (c) inversely transforming the low frequency band signal into a time domain by a first inverse transform method; (d) converting a signal inversely transformed into a time domain by the first inverse transform method into a frequency domain or a time / frequency domain by a first transformation method; (e) decoding the encoded bandwidth extension information included in the encoding result in the frequency domain, and decoding the encoded low frequency band information in the frequency domain or the time / frequency domain by the first conversion method using the decoded bandwidth extension information. Generating a high frequency band signal from the signal; (f) inversely transforming the high frequency band signal into a time domain by a second inverse transformation method; (g) generating the low frequency band signal by decoding the encoding result in the time domain in the time domain; (h) synthesizing a signal inversely transformed into a time domain by the first inverse transform method, a high frequency band signal inversely transformed into a time domain by the second inverse transform method, and a low frequency band signal decoded in the time domain; And (i) decoding the encoded stereo parameter included in the encoding result, and upmixing the synthesized signal using the decoded stereo parameter.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 시간 도메인 또는 주파수 도메인에서의 부호화 결과를 입력받는 단계; (b) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 저주파수 밴드 신호를 제1 역변환 방식에 의해 시간 도메인으로 역변환하는 단계; (d) 상기 제1 역변환 방식에 의해 시간 도메인으로 역변환된 신호를 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 제1 변환 방식에 의해 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (f) 상기 고주파수 밴드 신호를 제2 역변환 방식에 의해 시간 도메인으로 역변환하는 단계; (g) 상기 시간 도메인에서의 부호화 결과를 시간 도메인에서 복호화하여 상기 저주파수 밴드 신호를 생성하는 단계; (h) 상기 제1 역변환 방식에 의해 시간 도메인으로 역변환된 신호, 상기 제2 역변환 방식에 의해 시간 도메인으로 역변환된 고주파수 밴드 신호, 및 상기 시간 도메인에서 복호화된 저주파수 밴드 신호를 합성하는 단계; 및 (i) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱 하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method for encoding an audio signal, the method comprising: (a) receiving a result of encoding an audio signal in a time domain or a frequency domain; (b) generating a low frequency band signal by decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context; (c) inversely transforming the low frequency band signal into a time domain by a first inverse transform method; (d) converting a signal inversely transformed into a time domain by the first inverse transform method into a frequency domain or a time / frequency domain by a first transformation method; (e) decoding the encoded bandwidth extension information included in the encoding result in the frequency domain, and decoding the encoded low frequency band information in the frequency domain or the time / frequency domain by the first conversion method using the decoded bandwidth extension information. Generating a high frequency band signal from the signal; (f) inversely transforming the high frequency band signal into a time domain by a second inverse transformation method; (g) generating the low frequency band signal by decoding the encoding result in the time domain in the time domain; (h) synthesizing a signal inversely transformed into a time domain by the first inverse transform method, a high frequency band signal inversely transformed into a time domain by the second inverse transform method, and a low frequency band signal decoded in the time domain; And (i) decoding the encoded stereo parameter included in the encoding result and upmixing the synthesized signal using the decoded stereo parameter, and recording a program for executing the decoding method of the audio signal And is achieved by a computer-readable recording medium.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 주파수 도메인 또는 시간 도메인에서의 부호화 결과를 입력받는 단계; (b) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하는 단계; (c) 상기 시간 도메인에서의 부호화 결과를 시간 도메인에서 복호화하는 단계; (d) 상기 (b) 단계에서 역양자화된 신호 또는 상기 (c) 단계에서 복호화된 신호에 대하여 역 FV-MLT를 수행하여 시간 도메인으로 변환하는 단계; (e) 상기 변환된 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (f) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 생성하는 단계; (g) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 복호화된 신호를 업믹싱하는 단계; 및 (h) 상기 업믹싱된 신호를 시간 도메인으로 역변환하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding an audio signal, the method comprising the steps of: (a) receiving an encoding result in a frequency domain or a time domain of an audio signal; (b) decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context; (c) decoding the encoded result in the time domain in the time domain; (d) performing an inverse FV-MLT on the signal dequantized in the step (b) or the signal decoded in the step (c) so as to convert the signal into the time domain; (e) converting the transformed signal into a frequency domain or a time / frequency domain; (f) decoding the encoded bandwidth extension information included in the encoding result, and generating a full-band signal from the frequency domain or time / frequency domain converted signal using the decoded bandwidth extension information; (g) decoding the encoded stereo parameter included in the encoding result, and upmixing the decoded signal using the decoded stereo parameter; And (h) inversely transforming the upmixed signal into a time domain.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 주파수 도메인 또는 시간 도메인에서의 부호화 결과를 입력받는 단계; (b) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하는 단계; (c) 상기 시간 도메인에서의 부호화 결과를 시간 도메인에서 복호화하는 단 계; (d) 상기 (b) 단계에서 역양자화된 신호 또는 상기 (c) 단계에서 복호화된 신호에 대하여 역 FV-MLT를 수행하여 시간 도메인으로 변환하는 단계; (e) 상기 변환된 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (f) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 생성하는 단계; (g) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 복호화된 신호를 업믹싱하는 단계; 및 (h) 상기 업믹싱된 신호를 시간 도메인으로 역변환하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, the method comprising: (a) receiving an encoding result in a frequency domain or a time domain of an audio signal; (b) decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context; (c) decoding the encoded result in the time domain in the time domain; (d) performing an inverse FV-MLT on the signal dequantized in the step (b) or the signal decoded in the step (c) so as to convert the signal into the time domain; (e) converting the transformed signal into a frequency domain or a time / frequency domain; (f) decoding the encoded bandwidth extension information included in the encoding result, and generating a full-band signal from the frequency domain or time / frequency domain converted signal using the decoded bandwidth extension information; (g) decoding the encoded stereo parameter included in the encoding result, and upmixing the decoded signal using the decoded stereo parameter; And (h) inversely transforming the upmixed signal into a time domain. The above and other objects, features and advantages of the present invention will become more apparent from the following detailed description when read in conjunction with the accompanying drawings.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 주파수 도메인 또는 시간 도메인에서의 부호화 결과를 입력받는 단계; (b) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하는 단계; (c) 상기 시간 도메인에서의 부호화 결과를 시간 도메인에서 복호화하는 단계; (d) 상기 (c) 단계에서 복호화된 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 단계; (e) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인에서 복호화된 신호 또는 상기 주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 생성하는 단계; (f) 상기 부호화 결과에 포함된 부호화된 스테레 오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 복호화된 신호를 업믹싱하는 단계; 및 (g) 상기 업믹싱된 신호에 대하여 역 FV-MLT를 적용하여 시간 도메인으로 변환하는 단계를 포함한다.According to another aspect of the present invention, there is provided a method of decoding an audio signal, the method comprising the steps of: (a) receiving an encoding result in a frequency domain or a time domain of an audio signal; (b) decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context; (c) decoding the encoded result in the time domain in the time domain; (d) performing MDCT on the signal decoded in the step (c) to convert the time domain to the frequency domain; (e) decoding the encoded bandwidth extension information included in the encoding result in the frequency domain, extracting the encoded bandwidth extension information from the decoded signal in the frequency domain or the signal converted in the frequency domain using the decoded bandwidth extension information, Generating a signal; (f) decoding the encoded stereo parameter included in the encoding result, and upmixing the decoded signal using the decoded stereo parameter; And (g) transforming the upmixed signal into a time domain by applying an inverse FV-MLT.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 주파수 도메인 또는 시간 도메인에서의 부호화 결과를 입력받는 단계; (b) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하는 단계; (c) 상기 시간 도메인에서의 부호화 결과를 시간 도메인에서 복호화하는 단계; (d) 상기 (c) 단계에서 복호화된 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 단계; (e) 상기 주파수 도메인에서의 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인에서 복호화된 신호 또는 상기 주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 생성하는 단계; (f) 상기 부호화 결과에 포함된 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 복호화된 신호를 업믹싱하는 단계; 및 (g) 상기 업믹싱된 신호에 대하여 역 FV-MLT를 적용하여 시간 도메인으로 변환하는 단계를 포함하는오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록 매체에 의해 달성된다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, the method comprising: (a) receiving an encoding result in a frequency domain or a time domain of an audio signal; (b) decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context; (c) decoding the encoded result in the time domain in the time domain; (d) performing MDCT on the signal decoded in the step (c) to convert the time domain to the frequency domain; (e) decoding the encoded bandwidth extension information included in the encoding result in the frequency domain, extracting the encoded bandwidth extension information from the decoded signal in the frequency domain or the signal converted in the frequency domain using the decoded bandwidth extension information, Generating a signal; (f) decoding the encoded stereo parameter included in the encoding result, and upmixing the decoded signal using the decoded stereo parameter; And (g) transforming the upmixed signal into a time domain by applying an inverse FV-MLT to the upmixed signal. .

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 스테레오 부호화부; 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 밴드 분할부; 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하는 변환부; 상기 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 저주파수 밴드 부호화부; 및 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 대역폭 확장 부호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for encoding an audio signal, the apparatus including: a stereo encoding unit for extracting and encoding a stereo parameter from an input signal and downmixing the input signal; A band dividing unit dividing the downmixed signal into a high frequency band signal and a low frequency band signal; A converter for converting the high-frequency band signal and the low-frequency band signal into time domain to frequency domain; A low frequency band encoding unit for quantizing the converted low frequency band signal and encoding the converted low frequency band signal into a bit plane based on a context; And a bandwidth extension encoding unit for generating and encoding bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 스테레오 부호화부; 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 밴드 분할부; 상기 저주파수 밴드 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 MDCT 적용부; 상기 MDCT가 수행된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 저주파수 밴드 부호화부; 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 및 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 대역폭 확장 부호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for encoding an audio signal, the apparatus including: a stereo encoding unit for extracting and encoding a stereo parameter from an input signal and downmixing the input signal; A band dividing unit dividing the downmixed signal into a high frequency band signal and a low frequency band signal; An MDCT applying unit for performing MDCT on the low frequency band signal to convert the time domain into a frequency domain; A low frequency band encoding unit for quantizing the signal subjected to the MDCT and encoding the signal into a bit plane based on a context; A converter for converting the high frequency band signal and the low frequency band signal into a frequency domain or a time / frequency domain in a time domain; And a bandwidth extension encoding unit for generating and encoding bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 입력 신호를 다운믹싱하는 스테레오 부호화부; 상기 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 밴드 분할부; 상기 저주파수 밴드 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하는 모드 결정부; 상기 저주파수 밴드 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 저주파수 밴드 신호를 CELP 방식에 따라 부호화하는 CELP 부호화부; 상기 저주파수 밴드 신호를 주파수 도메인에서 부호화하는 것으로 결정된 경우 상기 저주파수 밴드 신호에 대하여 MDCT를 수행하여 상기 저주파수 밴드 신호를 시간 도메인에서 주파수 도메인으로 변환하는 MDCT 적용부; 상기 MDCT가 수행된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 저주파수 밴드 부호화부; 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 및 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 대역폭 확장 부호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for encoding an audio signal, the apparatus including: a stereo encoding unit for extracting and encoding a stereo parameter from an input signal and downmixing the input signal; A band dividing unit dividing the downmixed signal into a high frequency band signal and a low frequency band signal; A mode determination unit for determining whether to encode the low frequency band signal in the time domain or the frequency domain; A CELP encoding unit for encoding the low frequency band signal according to the CELP scheme when it is determined to encode the low frequency band signal in the time domain; An MDCT applying unit for performing MDCT on the low frequency band signal and converting the low frequency band signal from a time domain into a frequency domain when it is determined to encode the low frequency band signal in the frequency domain; A low frequency band encoding unit for quantizing the low frequency band signal on which the MDCT has been performed and encoding the low frequency band signal into a bit plane based on a context; A converter for converting the low frequency band signal and the high frequency band signal into a frequency domain or a time / frequency domain in a time domain; And a bandwidth extension encoding unit for generating and encoding bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호를 시간 도메인에서 주파수 도메인으로 변환하는 변환부; 상기 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 변환된 신호를 다운믹싱하는 스테레오 부호화부; 상기 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화하는 대역폭 확장 부호화부; 상기 다운믹싱된 신호를 시간 도메인으로 역변환하는 역변환부; 상기 역변환된 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하는 모드 결정부; 상기 결정 결과에 따 라 상기 역변환된 신호에 대하여 FV-MLT를 적용하여 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환하는 FV-MLT 적용부; 상기 역변환된 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 시간 도메인으로 변환된 신호를 CELP 방식에 따라 부호화하는 CELP 부호화부; 및 상기 역변환된 신호를 주파수 도메인으로 부호화하는 것으로 결정된 경우 상기 주파수 도메인으로 변환된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 주파수 도메인 부호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for encoding an audio signal, the apparatus including: a transform unit for transforming an input signal from a time domain to a frequency domain; A stereo encoding unit for extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal; A bandwidth extension encoder for extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information; An inverse transformer for inversely transforming the downmixed signal into a time domain; A mode decision unit for deciding whether to encode the inversely transformed signal in a time domain or a frequency domain; An FV-MLT applying unit for applying FV-MLT to the inversely transformed signal according to the determination result and transforming it into a time domain or a frequency domain for each subband; A CELP encoding unit for encoding the signal converted into the time domain according to the CELP scheme when it is determined to encode the inversely converted signal in the time domain; And a frequency domain encoding unit for quantizing the signal converted into the frequency domain and encoding the signal into a bit plane based on the context, when it is determined to encode the inverse converted signal in the frequency domain.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하는 모드 결정부; 상기 결정 결과에 따라 상기 입력 신호에 대하여 FV-MLT를 적용하여 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환하는 FV-MLT 적용부; 상기 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 상기 변환된 신호를 다운믹싱하는 스테레오 부호화부; 상기 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화하는 대역폭 확장 부호화부; 상기 다운믹싱된 신호가 시간 도메인에서 부호화하는 것으로 결정된 경우 상기 다운믹싱된 신호를 CELP 방식에 따라 부호화하는 CELP 부호화부; 및 상기 다운믹싱된 신호가 주파수 도메인에서 부호화하는 것으로 결정된 경우 상기 다운믹싱된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 주파수 도메인 부호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for encoding an audio signal, the apparatus including: a mode determination unit for determining whether to encode an input signal in a time domain or a frequency domain; An FV-MLT applying unit for applying FV-MLT to the input signal according to the determination result and transforming the input signal into a time domain or a frequency domain for each subband; A stereo encoding unit for extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal; A bandwidth extension encoder for extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information; A CELP encoding unit for encoding the downmixed signal according to the CELP scheme when it is determined that the downmixed signal is encoded in the time domain; And a frequency domain encoding unit that quantizes the downmixed signal and encodes the downmixed signal into a bit plane based on the context, when the downmixed signal is determined to be encoded in the frequency domain.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저 주파수 밴드 신호를 생성하는 저주파수 밴드 복호화부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 대역폭 확장 복호화부; 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호 각각에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 역 MDCT 적용부; 상기 변환된 저주파수 밴드 신호 및 상기 변환된 고주파수 밴드 신호를 합성하는 밴드 합성부; 및 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 스테레오 복호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for decoding an audio signal, the apparatus comprising: a low frequency band decoding unit decoding a coded bit plane based on a context and generating a low frequency band signal by dequantizing the coded bit plane; A bandwidth extension decoder for decoding the encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; An inverse MDCT applying unit for performing inverse MDCT on each of the low frequency band signal and the high frequency band signal and converting the frequency band to the time domain; A band synthesizer for synthesizing the converted low frequency band signal and the converted high frequency band signal; And a stereo decoding unit decoding the encoded stereo parameter and upmixing the synthesized signal using the decoded stereo parameter.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 저주파수 밴드 복호화부; 상기 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 역 MDCT 적용부; 상기 역 MDCT가 수행된 저주파수 밴드 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 대역폭 확장 복호화부; 상기 생성된 고주파수 밴드 신호를 시간 도메인으로 역변환하는 역변환부; 상기 역변환된 고주파수 밴드 신호 및 상기 변환된 저주파수 밴드 신호를 합성하는 밴드 합성부; 및 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 스테레오 복호화 부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for decoding an audio signal, the apparatus comprising: a low frequency band decoding unit decoding a coded bit plane based on a context and generating a low frequency band signal by dequantizing the coded bit plane; An inverse MDCT applying unit that performs inverse MDCT on the low frequency band signal and converts the inverse MDCT from the frequency domain to the time domain; A transform unit for transforming the low frequency band signal in which the inverse MDCT is performed into a frequency domain or a time / frequency domain; A bandwidth extension decoder for decoding the encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information; An inverse transformer for inversely transforming the generated high frequency band signal into a time domain; A band synthesizer for synthesizing the inversely converted high frequency band signal and the converted low frequency band signal; And a stereo decoding unit decoding the encoded stereo parameter and upmixing the synthesized signal using the decoded stereo parameter.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 주파수 도메인에서 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 저주파수 밴드 복호화부; 상기 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 시간 도메인으로 역변환하는 역 MDCT 적용부; 상기 역 MDCT가 수행된 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 주파수 도메인에서 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호로부터 고주파수 밴드 신호를 생성하는 대역폭 확장 복호화부; 상기 생성된 고주파수 밴드 신호를 시간 도메인으로 역변환하는 역변환부; 시간 도메인에서 부호화된 CELP 부호화 정보를 복호화하여 상기 저주파수 밴드 신호를 생성하는 CELP 복호화부; 상기 역 MDCT가 수행된 신호, 상기 역변환된 고주파수 밴드 신호 및 상기 CELP 방식으로 생성된 저주파수 밴드 신호를 합성하는 밴드 합성부; 및 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 합성된 신호를 업믹싱하는 스테레오 복호화부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for decoding an audio signal, the apparatus comprising: a low frequency band decoding unit decoding a bit plane encoded in a frequency domain based on a context and generating a low frequency band signal by dequantizing the bit plane; An inverse MDCT applying unit that performs inverse MDCT on the low frequency band signal and inverse transforms the inverse MDCT into a time domain; A transform unit for transforming the signal in which the inverse MDCT is performed into a frequency domain or a time / frequency domain; A bandwidth extension decoder for decoding the bandwidth extension information encoded in the frequency domain and generating a high frequency band signal from the frequency domain or time / frequency domain converted signal using the decoded bandwidth extension information; An inverse transformer for inversely transforming the generated high frequency band signal into a time domain; A CELP decoding unit for decoding the CELP encoded information encoded in the time domain to generate the low frequency band signal; A band synthesizer for synthesizing the inverse MDCT-processed signal, the inverse transformed high-frequency band signal, and the low-frequency band signal generated by the CELP method; And a stereo decoding unit decoding the encoded stereo parameter and upmixing the synthesized signal using the decoded stereo parameter.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 주파수 도메인에서 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하는 주파수 도메인 복호화부; 시간 도메인에서 부호화된 CELP 부호화 정보를 복호화하는 CELP 복호화부; 상기 주파수 도메인 복호화부 또는 상기 CELP 복호화부에서 복호화된 신호에 대하여 역 FV-MLT를 수행하여 시간 도메인으로 변환하는 역 FV-MLT 적용부; 상기 변환된 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 생성하는 대역폭 확장 복호화부; 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 생성된 신호를 업믹싱하는 스테레오 복호화부; 및 상기 업믹싱된 신호를 시간 도메인으로 역변환하는 역변환부를 포함한다.According to another aspect of the present invention, there is provided an apparatus for decoding an audio signal, the apparatus including: a frequency domain decoding unit decoding and dequantizing a bit plane encoded in a frequency domain based on a context; A CELP decoding unit decoding the CELP encoded information encoded in the time domain; An inverse FV-MLT applying unit for performing an inverse FV-MLT on a signal decoded by the frequency domain decoding unit or the CELP decoding unit and converting the decoded signal into a time domain; A converter for converting the converted signal into a frequency domain or a time / frequency domain; A bandwidth extension decoder for decoding the coded bandwidth extension information and generating a signal of the entire band from the frequency domain or the time / frequency domain converted signal using the decoded bandwidth extension information; A stereo decoding unit decoding the encoded stereo parameter and upmixing the generated signal using the decoded stereo parameter; And an inverse transformer for inversely transforming the upmixed signal into a time domain.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 주파수 도메인에서 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하는 주파수 도메인 복호화부; 시간 도메인에서 부호화된 CELP 부호화 정보를 복호화하는 CELP 복호화부; 상기 CELP 복호화부에서 출력된 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 MDCT 적용부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 복호화부에서 출력된 신호 또는 상기 MDCT 적용부에서 출력된 신호로부터 전 대역의 신호를 생성하는 대역폭 확장 복호화부; 부호화된 스테레오 파라미터를 복호화하고, 상기 복호화된 스테레오 파라미터를 이용하여 상기 생성된 신호를 업믹싱하는 스테레오 복호화부; 및 상기 업믹싱된 신호에 대하여 역 FV-MLT를 적용하여 시간 도메인으로 변환하는 역 FV-MLT 적용부를 포함한다. According to another aspect of the present invention, there is provided an apparatus for decoding an audio signal, the apparatus including: a frequency domain decoding unit decoding and dequantizing a bit plane encoded in a frequency domain based on a context; A CELP decoding unit decoding the CELP encoded information encoded in the time domain; An MDCT applying unit for performing MDCT on the signal output from the CELP decoding unit and converting the time domain into a frequency domain; A bandwidth extension decoder for decoding the coded bandwidth extension information and generating a signal of the entire band from the signal output from the frequency domain decoding unit or the signal output from the MDCT application unit using the decoded bandwidth extension information; A stereo decoding unit decoding the encoded stereo parameter and upmixing the generated signal using the decoded stereo parameter; And an inverse FV-MLT applying unit for applying the inverse FV-MLT to the upmixed signal to convert the upmixed signal into the time domain.

본 발명에 따르면, 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호를 다운믹싱하며, 다운믹싱된 신호를 고주파수 밴드 신호와 저주파수 밴드 신호로 분할하고, 고주파수 밴드 신호와 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하며, 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하고, 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하며, 부호화된 스테레오 파라미터, 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 입력 신호에 대한 부호화 결과로써 출력함으로써, 한정된 비트율에서 고주파수 성분 및 스테레오 성분을 효율적으로 부호화하여 음질을 향상시킬 수 있다.According to the present invention, stereo parameters are extracted and encoded in an input signal, downmixed input signals are divided into a high frequency band signal and a low frequency band signal, and a high frequency band signal and a low frequency band signal are divided into a time domain Quantizes the converted low frequency band signal and encodes the converted low frequency band signal into a bit plane based on the context and generates and encodes bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal The coded stereo parameter, the encoded bit plane, and the coded bandwidth extension information as the encoding result for the input signal, thereby efficiently encoding the high frequency component and the stereo component at a limited bit rate, thereby improving the sound quality.

또한, 본 발명에 따르면, 오디오 신호의 부호화 결과를 입력받고, 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하며, 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하며, 저주파수 밴드 신호 및 고주파수 밴드 신호 각각을 제1 역변환 방식에 의해 주파수 도메인에서 시간 도메인으로 변환하고, 변환된 저주파수 밴드 신호 및 변환된 고주파수 밴드 신호를 합성하며, 부호화 결과에 포함된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 합성된 신호를 업믹싱함으로써, 한정된 비트율에서 부호화된 비트스트림으로부터 고주파수 성분 및 스테레오 성분을 효율적으로 복호화할 수 있다.According to another aspect of the present invention, there is provided an encoding method for encoding an encoded bit plane included in an encoding result by receiving an encoding result of an audio signal, decoding the encoded bit plane based on a context, and generating a low frequency band signal by inverse- Frequency band signal from the low-frequency band signal using the decoded bandwidth extension information, converts each of the low-frequency band signal and the high-frequency band signal from the frequency domain to the time domain by the first inverse transformation method, Frequency band signal and the converted high-frequency band signal, decodes the stereo parameter included in the encoding result, and upmixes the synthesized signal using the decoded stereo parameter to generate a high frequency component And Stas The Leo component can be efficiently decoded.

본문에 개시되어 있는 본 발명의 실시예들에 대해서, 특정한 구조적 내지 기능적 설명들은 단지 본 발명의 실시예를 설명하기 위한 목적으로 예시된 것으로, 본 발명의 실시예들은 다양한 형태로 실시될 수 있으며 본문에 설명된 실시예들에 한정되는 것으로 해석되어서는 아니 된다. For the embodiments of the invention disclosed herein, specific structural and functional descriptions are set forth for the purpose of describing an embodiment of the invention only, and it is to be understood that the embodiments of the invention may be practiced in various forms, The present invention should not be construed as limited to the embodiments described in Figs.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 본문에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 개시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 구성요소에 대해 사용하였다. The present invention is capable of various modifications and various forms, and specific embodiments are illustrated in the drawings and described in detail in the text. It should be understood, however, that the invention is not intended to be limited to the particular forms disclosed, but includes all modifications, equivalents, and alternatives falling within the spirit and scope of the invention. Similar reference numerals have been used for the components in describing each drawing.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다. Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in commonly used dictionaries are to be interpreted as having a meaning consistent with the contextual meaning of the related art and are to be interpreted as either ideal or overly formal in the sense of the present application Do not.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다. Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. The same reference numerals are used for the same constituent elements in the drawings and redundant explanations for the same constituent elements are omitted.

도 1은 본 발명의 일 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.1 is a block diagram illustrating an apparatus for encoding an audio signal according to an embodiment of the present invention.

도 1을 참조하면, 오디오 신호의 부호화 장치는 스테레오 부호화부(100), 밴드 분할부(110), 제1 MDCT 적용부(120), 주파수 선형 예측 수행부(130), 멀티-레졸루션 분석부(140), 양자화부(150), 문맥-기반 비트플레인 부호화부(160), 제2 MDCT 적용부(170), 대역폭 확장 부호화부(180) 및 다중화부(190)를 포함한다.1, an apparatus for encoding an audio signal includes a stereo encoding unit 100, a band dividing unit 110, a first MDCT applying unit 120, a frequency linear prediction performing unit 130, a multi-resolution analyzing unit A second quantization unit 140, a quantization unit 150, a context-based bit-plane coding unit 160, a second MDCT applying unit 170, a bandwidth extension coding unit 180 and a multiplexing unit 190.

스테레오 부호화부(100)는 입력 신호(IN)에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호(IN)를 다운믹싱(down-mixing)한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM(Pulse Code Modulation) 신호일 수 있다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The stereo encoding unit 100 extracts and encodes stereo parameters from the input signal IN and down-mixes the input signal IN. Here, the input signal IN may be a PCM (Pulse Code Modulation) signal obtained by modulating an analog audio signal or an audio signal into a digital signal. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

구체적으로, 스테레오 파라미터는 스테레오 신호에 대한 부가 정보(side information)를 나타내는 것으로, 부가 정보는 좌채널 신호 및 우채널 신호의 채널 간의 위상차 또는 강도차 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술 분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다.Specifically, the stereo parameter indicates side information for the stereo signal, and the additional information may include various information such as a phase difference or intensity difference between the left channel signal and the right channel signal, Those of ordinary skill in the art will understand.

밴드 분할부(110)는 스테레오 부호화부(100)에서 다운믹싱된 신호를 저주파수 밴드 신호(LB, low frequency band signal) 및 고주파수 밴드 신호(HB, high frequency band signal)로 분할한다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band dividing unit 110 divides the downmixed signal in the stereo encoding unit 100 into a low frequency band signal LB and a high frequency band signal HB. Here, the low frequency band signal is a signal corresponding to a frequency lower than a certain threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than a certain threshold value.

제1 MDCT 적용부(120)는 밴드 분할부(110)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, 시간 도메인은 시간의 경과에 따라 입력 신호의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이고, 주파수 도메인은 주파수의 변화에 따라 입력 신호의 크기를 나타내는 도메인이다.The first MDCT applying unit 120 performs MDCT on the low frequency band signal LB divided in the band dividing unit 110 to convert the low frequency band signal LB from the time domain to the frequency domain. Here, the time domain is a domain indicating the size of an input signal (for example, energy or sound pressure) with passage of time, and the frequency domain is a domain indicating the size of an input signal according to a change in frequency.

주파수 선형 예측 수행부(130)는 제1 MDCT 적용부(120)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합(linear combination)으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(130)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류(prediction error)가 최소가 되도록 선형 예측 필터의 계수(coefficient)를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(130)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 130 performs frequency linear prediction on the low frequency band signal converted into the frequency domain in the first MDCT applying unit 120. [ Here, the frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of a signal at a previous frequency. Specifically, the frequency linear prediction performing unit 130 calculates a coefficient of the linear prediction filter so that a prediction error, which is a difference between the signal subjected to the linear prediction and the signal at the current frequency, is minimized, The low-frequency band signal converted into the frequency domain is linearly predictively filtered according to the calculated coefficients. In this case, the frequency linear prediction performing unit 130 may improve the coding efficiency by performing vector quantization which expresses a value corresponding to the coefficient of the linear prediction filter by a vector index.

구체적으로, 주파수 선형 예측 수행부(130)는 제1 MDCT 적용부(120)에서 주파수 도메인으로 변환된 저주파수 밴드 신호(LB)가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(130)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the frequency linear prediction performing unit 130 performs frequency linear prediction when the low frequency band signal LB converted into the frequency domain in the first MDCT applying unit 120 is a speech signal or a pitched signal. Prediction can be performed. In other words, the frequency linear prediction performing unit 130 may improve the efficiency of encoding by performing frequency linear prediction according to characteristics of an input signal.

멀티-레졸루션 분석부(140)는 제1 MDCT 적용부(120)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(130)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(140)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(130)에서 필터링된 신호를 두 가지 유형(예를 들어, 스테이빌(stabile) 유형과 숏(short) 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzer 140 receives the low frequency band signal converted into the frequency domain or the signal filtered by the frequency linear prediction unit 130 in the first MDCT applying unit 120 and outputs an audio spectral coefficient To multi-resolution. In detail, the multi-resolution analyzing unit 140 may classify the filtered signal in the frequency linear prediction performing unit 130 into two types (for example, stabile type and short type) according to the intensity of change of the audio spectrum. (short) type).

구체적으로, 멀티-레졸루션 분석부(140)는 제1 MDCT 적용부(120)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(130)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(140)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, the multi-resolution analyzing unit 140 may be configured such that the low frequency band signal converted into the frequency domain in the first MDCT applying unit 120 or the signal filtered in the frequency linear prediction unit 130 is a transient signal In this case, it can be analyzed by multi-resolution. In other words, the multi-resolution analyzer 140 may improve the coding efficiency by performing multi-resolution analysis according to characteristics of an input signal.

양자화부(150)는 주파수 선형 예측 수행부(130)에서 필터링된 신호 또는 멀티-레졸루션 분석부(140)에서 출력된 결과를 양자화한다.The quantization unit 150 quantizes the signal filtered by the frequency linear prediction unit 130 or the result output from the multi-resolution analysis unit 140.

문맥-기반 비트플레인 부호화부(160)는 문맥을 기반으로 하여 양자화부(150)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(160)는 허프만 코딩(Huffman Coding)을 이용하여 양자화부(150)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bit-plane encoding unit 160 encodes the quantized result in the bit-plane by the quantization unit 150 based on the context. Specifically, the context-based bit-plane encoding unit 160 may encode the quantized result in the quantization unit 150 into a bit plane using Huffman coding.

이와 같이, 주파수 선형 예측 수행부(130), 멀티-레졸루션 분석부(140), 양자화부(150) 및 문맥-기반 비트플레인 부호화부(160)는 제1 MDCT 적용부(120)에서 출력된 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.In this manner, the frequency linear prediction unit 130, the multi-resolution analyzer 140, the quantization unit 150, and the context-based bit-plane encoding unit 160 may perform the transforms output from the first MDCT applying unit 120, Frequency band signal, and thus can be regarded as a low-frequency band encoding unit.

제2 MDCT 적용부(170)는 밴드 분할부(110)에서 분할된 고주파수 밴드 신호(HB)에 대하여 MDCT를 수행하여 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인으로 변환한다.The second MDCT applying unit 170 performs MDCT on the high frequency band signal HB divided by the band dividing unit 110 to convert the high frequency band signal HB from the time domain to the frequency domain.

대역폭 확장 부호화부(180)는 제2 MDCT 적용부(170)에서 주파수 도메인으 로변환된 고주파수 밴드 신호의 성분을 전달하기 위하여, 제1 MDCT 적용부(120)에서 주파수 도메인으로 변환된 저주파수 밴드 신호를 이용하여 제2 MDCT 적용부(170)에서 주파수 도메인으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(180)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드의 정보를 이용하여 상술한 바와 같은 대역폭 확장 정보를 생성할 수 있다. 본 발명의 다른 실시예에서, 대역폭 확장 부호화부(180)는 저주파수 밴드 신호에 대하여 부호화된 결과를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoding unit 180 encodes the low frequency band signal converted into the frequency domain in the first MDCT applying unit 120 to transmit the components of the high frequency band signal converted into the frequency domain in the second MDCT applying unit 170. [ The second MDCT applying unit 170 generates and encodes bandwidth extension information indicating the characteristics of the high frequency band signal converted into the frequency domain. It will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoding unit 180 can generate the bandwidth extension information as described above by using the information of the low frequency band based on the characteristic that a high correlation exists between the high frequency and low frequency bands of the audio signal . In another embodiment of the present invention, the bandwidth extension encoding unit 180 may generate the bandwidth extension information using the encoded result of the low frequency band signal.

다중화부(190)는 스테레오 부호화부(100), 주파수 선형 예측 수행부(130), 문맥-기반 비트플레인 부호화부(160) 및 대역폭 확장 부호화부(180)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한 다.The multiplexing unit 190 multiplexes the results of the encoding performed by the stereo encoding unit 100, the frequency linear prediction performing unit 130, the context-based bit plane encoding unit 160, and the bandwidth extension encoding unit 180, And outputs the stream through the output terminal OUT.

도 2는 본 발명의 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 2 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 2를 참조하면, 오디오 신호의 부호화 장치는 스테레오 부호화부(200), 밴드 분할부(210), MDCT 적용부(220), 주파수 선형 예측 수행부(230), 멀티-레졸루션 분석부(240), 양자화부(250), 문맥-기반 비트플레인 부호화부(260), 저주파수 밴드 변환부(270), 고주파수 밴드 변환부(275), 대역폭 확장 부호화부(280) 및 다중화부(290)를 포함한다.2, an apparatus for encoding an audio signal includes a stereo encoding unit 200, a band dividing unit 210, an MDCT applying unit 220, a frequency linear prediction performing unit 230, a multi-resolution analyzing unit 240, A quantization unit 250, a context-based bit-plane encoding unit 260, a low-frequency band transforming unit 270, a high-frequency band transforming unit 275, a bandwidth extension encoding unit 280, and a multiplexing unit 290 .

스테레오 부호화부(200)는 입력 신호(IN)에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호(IN)를 다운믹싱한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The stereo encoding unit 200 extracts a stereo parameter from the input signal IN and encodes it, and down-mixes the input signal IN. Here, the input signal IN may be an analog audio signal or a PCM signal obtained by modulating an audio signal into a digital signal. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

구체적으로, 스테레오 파라미터는 스테레오 신호에 대한 부가 정보를 나타내는 것으로, 부가 정보는 좌채널 신호 및 우채널 신호의 채널 간의 위상차 또는 강도차 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술 분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다.Specifically, the stereo parameter indicates additional information for a stereo signal, and the additional information may include various information such as a phase difference or intensity difference between channels of the left channel signal and the right channel signal. Those skilled in the art will understand.

밴드 분할부(210)는 스테레오 부호화부(200)에서 출력된 다운믹싱된 신호를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band division unit 210 divides the downmixed signal output from the stereo encoding unit 200 into a low frequency band signal LB and a high frequency band signal HB. Here, the low frequency band signal is a signal corresponding to a frequency lower than a certain threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than a certain threshold value.

MDCT 적용부(220)는 밴드 분할부(210)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다.The MDCT applying unit 220 performs MDCT on the low frequency band signal LB divided in the band dividing unit 210 to convert the low frequency band signal LB from the time domain to the frequency domain.

주파수 선형 예측 수행부(230)는 MDCT 적용부(220)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(230)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(230)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 230 performs frequency linear prediction on the low frequency band signal converted into the frequency domain in the MDCT applying unit 220. [ Here, the frequency linear prediction is a method of approximating a signal at a current frequency with a linear combination of signals at a previous frequency. In detail, the frequency linear prediction performing unit 230 calculates the coefficients of the linear prediction filter so that the prediction error, which is the difference between the signal subjected to the linear prediction and the signal at the current frequency, is minimized, Lt; RTI ID = 0.0 > low-frequency band < / RTI > In this case, the frequency linear prediction performing unit 230 may improve the efficiency of coding by performing vector quantization which expresses a value corresponding to the coefficient of the linear prediction filter by a vector index.

구체적으로, 주파수 선형 예측 수행부(230)는 MDCT 적용부(220)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(230)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, the frequency linear prediction performing unit 230 may perform frequency linear prediction when the low frequency band signal converted into the frequency domain in the MDCT applying unit 220 is a speech signal or a pitched signal have. In other words, the frequency linear prediction performing unit 230 may improve the efficiency of encoding by performing frequency linear prediction according to characteristics of an input signal.

멀티-레졸루션 분석부(240)는 MDCT 적용부(220)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(230)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도로 분석한다. 구체적으로, 멀티-레졸루션 분석부(240)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(230)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다.The multi-resolution analyzer 240 receives the low-frequency band signal converted into the frequency domain or the signal filtered by the frequency linear prediction unit 230 in the MDCT application unit 220 and calculates an audio spectral coefficient Resolution. Specifically, the multi-resolution analyzing unit 240 divides the filtered audio spectrum in the frequency linear prediction performing unit 230 into two types (for example, stabill type and short type) according to the intensity of change of the audio spectrum. .

구체적으로, 멀티-레졸루션 분석부(240)는 MDCT 적용부(220)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(230)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(240)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, when the low frequency band signal converted into the frequency domain in the MDCT applying unit 220 or the signal filtered in the frequency linear prediction unit 230 is a transient signal, the multi- It can be analyzed with multi-resolution. In other words, the multi-resolution analyzer 240 may improve the coding efficiency by performing multi-resolution analysis according to characteristics of an input signal.

양자화부(250)는 주파수 선형 예측 수행부(520)에서 필터링된 신호 또는 멀티-레졸루션 분석부(240)에서 출력된 결과를 양자화한다.The quantization unit 250 quantizes the filtered signal or the result output from the multi-resolution analyzer 240 by the frequency linear prediction unit 520.

문맥-기반 비트플레인 부호화부(260)는 문맥을 기반으로 하여 양자화부(250)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(260)는 허프만 코딩을 이용하여 양자화부(250)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bit-plane encoding unit 260 encodes the quantized result in the bit-plane by the quantization unit 250 based on the context. In detail, the context-based bit-plane encoding unit 260 can encode the quantized result in the quantization unit 250 into a bit plane using Huffman coding.

이와 같이, 주파수 선형 예측 수행부(230), 멀티-레졸루션 분석부(240), 양자화부(250) 및 문맥-기반 비트플레인 부호화부(260)는 MDCT 적용부(220)에서 출력된 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.In this manner, the frequency linear prediction performing unit 230, the multi-resolution analyzing unit 240, the quantizing unit 250, and the context-based bit-plane coding unit 260 may be configured to convert the converted low- Frequency band encoding unit, since the encoding is performed on the band signal.

저주파수 밴드 변환부(270)는 밴드 분할부(210)에서 분할된 저주파수 밴드 신호(LB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 예를 들어, 저주파수 밴드 변환부(270)는 MDST(Modified Discrete Sine Transform), FFT(Fast Fourier Transform), 및 QMF(Quadrature Mirror Filter) 등을 이용하여 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다. 여기서, 시간 도메인은 시간의 경과에 따라 저주파수 밴드 신호(LB)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이다. 이에 비해, 주파수 도메인은 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다. 시간/주파수 도메인은 시간의 경과 및 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다.The low frequency band transform unit 270 transforms the low frequency band signal LB divided in the band dividing unit 210 from the time domain into the frequency domain or the time / frequency domain using a conversion scheme other than MDCT. For example, the low-frequency band transformer 270 transforms a low-frequency band signal LB in the time domain into a frequency domain using Modified Discrete Sine Transform (MDST), Fast Fourier Transform (FFT), and Quadrature Mirror Filter (QMF) Or time / frequency domain. Here, the time domain is a domain representing the size (e.g., energy or sound pressure) of the low frequency band signal LB with the lapse of time. On the other hand, the frequency domain is a domain indicating the magnitude of the low frequency band signal LB in accordance with the change in frequency. The time / frequency domain is a domain indicating the size of the low-frequency band signal LB according to the passage of time and the change in frequency.

고주파수 밴드 변환부(275)는 밴드 분할부(210)에서 분할된 고주파수 밴드 신호(HB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 여기서, 고주파밴드 변환부(275)는 저주파밴드 변환부(270)에서 이용하는 동일한 변환 기법을 이용한다. 예를 들어, 고주파수 밴드 변환부(275)는 MDST, FFT, 및 QMF 등을 이용하여 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다.The high frequency band converter 275 converts the high frequency band signal HB divided by the band dividing unit 210 into a frequency domain or a time / frequency domain from a time domain using a conversion technique other than MDCT. Here, the high-frequency band converting unit 275 uses the same conversion technique used in the low-frequency band converting unit 270. For example, the high frequency band converter 275 may convert the high frequency band signal HB from the time domain to the frequency domain or the time / frequency domain using MDST, FFT, and QMF.

대역폭 확장 부호화부(280)는 저주파수 밴드 변환부(270)에서 주파수 영역 또는 시간/주파수 영역으로 변환된 저주파수 밴드 신호를 이용하여, 고주파수 밴드 변환부(275)에서 주파수 영역 또는 시간/주파수 영역으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(280)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 상술한 바와 같은 저주파수 밴드의 정보를 이용하여 대역폭 확장 정보를 생성할 수 있다. 본 발명의 다른 실시예에서, 대역폭 확장 부호화부(280)는 저주파수 밴드 신호에 대하여 부호화된 결과를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoding unit 280 converts the low frequency band signal from the high frequency band conversion unit 275 into the frequency domain or the time / frequency domain using the low frequency band signal converted into the frequency domain or the time / frequency domain by the low frequency band conversion unit 270. [ And generates bandwidth extension information indicating the characteristics of the high frequency band signal. It will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoding unit 280 can generate the bandwidth extension information using the information of the low frequency band as described above based on the characteristic that a high correlation exists between high and low frequency bands of the audio signal . In another embodiment of the present invention, the bandwidth extension encoding unit 280 may generate the bandwidth extension information using the encoded result of the low frequency band signal.

다중화부(290)는 스테레오 부호화부(200), 주파수 선형 예측 수행부(230), 문맥-기반 비트플레인 부호화부(260) 및 대역폭 확장 부호화부(280)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexing unit 290 multiplexes the results of the encoding performed by the stereo encoding unit 200, the frequency linear prediction performing unit 230, the context-based bit plane encoding unit 260 and the bandwidth extension encoding unit 280, And outputs the stream through the output terminal OUT.

도 3은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 3 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 3을 참조하면, 오디오 신호의 부호화 장치는 스테레오 부호화부(300), 밴드 분할부(310), 모드 결정부(320), MDCT 적용부(325), 주파수 선형 예측 수행부(330), 멀티-레졸루션 분석부(340), 양자화부(350), 문맥-기반 비트플레인 부호화부(360), 저주파수 밴드 변환부(370), 고주파수 밴드 변환부(375), 대역폭 확장 부호화부(380), CELP(Code Excited Linear Prediction) 부호화부(385) 및 다중화부(390)를 포함한다.3, an apparatus for encoding an audio signal includes a stereo encoding unit 300, a band dividing unit 310, a mode determining unit 320, an MDCT applying unit 325, a frequency linear prediction performing unit 330, Based bit-plane encoding unit 360, a low-frequency band transforming unit 370, a high-frequency band transforming unit 375, a bandwidth extension encoding unit 380, a CELP (Code Excited Linear Prediction) encoding unit 385 and a multiplexing unit 390.

스테레오 부호화부(300)는 입력 신호(IN)에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호(IN)를 다운믹싱한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The stereo encoding unit 300 extracts a stereo parameter from the input signal IN and encodes it, and down-mixes the input signal IN. Here, the input signal IN may be an analog audio signal or a PCM signal obtained by modulating an audio signal into a digital signal. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

밴드 분할부(310)는 스테레오 부호화부(300)에서 출력된 다운믹싱된 신호를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band dividing unit 310 divides the downmixed signal output from the stereo encoding unit 300 into a low frequency band signal LB and a high frequency band signal HB. Here, the low frequency band signal is a signal corresponding to a frequency lower than a certain threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than a certain threshold value.

모드 결정부(320)는 소정의 기준에 따라 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB)를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다. 예를 들어, 모드 결정부(320)는 MDCT 적용부(325)에서 출력된 정보에 따라 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB)를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다.The mode determining unit 320 determines whether to encode the low frequency band signal LB divided in the band dividing unit 310 in the time domain or the frequency domain according to a predetermined criterion. For example, the mode determination unit 320 determines whether to encode the low frequency band signal LB divided in the band division unit 310 in the time domain or the frequency domain according to the information output from the MDCT application unit 325 .

MDCT 적용부(325)는 모드 결정부(320)에서 저주파수 밴드 신호(LB)를 주파수 도메인으로 부호화하는 것으로 결정된 경우, 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, MDCT가 수행된 결과는 모드 결정부(320)에서 부호화 도메인을 결정하는데 이용될 수 있다.The MDCT applying unit 325 performs MDCT on the low frequency band signal LB and outputs the low frequency band signal LB to the low frequency band signal LB when it is determined that the low frequency band signal LB is encoded in the frequency domain by the mode determination unit 320 Time domain to the frequency domain. Here, the result of performing the MDCT may be used for determining the encoding domain in the mode determination unit 320. [

주파수 선형 예측 수행부(330)는 MDCT 적용부(325)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(330)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(330)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 330 performs frequency linear prediction on the low frequency band signal converted into the frequency domain by the MDCT applying unit 325. [ Here, the frequency linear prediction is a method of approximating a signal at a current frequency with a linear combination of signals at a previous frequency. In detail, the frequency linear prediction performing unit 330 calculates the coefficients of the linear prediction filter so that the prediction error, which is the difference between the signal subjected to the linear prediction and the signal at the current frequency, is minimized, Lt; RTI ID = 0.0 > low-frequency band < / RTI > In this case, the frequency linear prediction performing unit 330 may improve the efficiency of coding by performing vector quantization which expresses a value corresponding to the coefficient of the linear prediction filter by a vector index.

구체적으로, 주파수 선형 예측 수행부(330)는 MDCT 적용부(325)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(330)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, the frequency linear prediction performing unit 330 may perform frequency linear prediction when the low frequency band signal converted into the frequency domain in the MDCT applying unit 325 is a speech signal or a pitched signal have. In other words, the frequency linear prediction performing unit 330 may perform frequency linear prediction according to characteristics of an input signal, thereby improving coding efficiency.

멀티-레졸루션 분석부(340)는 MDCT 적용부(325)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(330)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도로 분석한다. 구체적으로, 멀티-레졸루션 분석부(340)는 오디오 스펙트럼의 변화의 격렬한 정도 에 따라 주파수 선형 예측 수행부(330)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzer 340 receives the low frequency band signal converted into the frequency domain or the signal filtered by the frequency linear prediction unit 330 in the MDCT applying unit 325 and calculates an audio spectral coefficient of the instantaneous varying signal Resolution. In detail, the multi-resolution analyzing unit 340 divides the filtered audio spectrum in the frequency linear prediction performing unit 330 into two types (for example, the stabill type and the short type) according to the intensity of change of the audio spectrum. .

구체적으로, 멀티-레졸루션 분석부(340)는 MDCT 적용부(325)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(330)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(340)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, when the low frequency band signal converted into the frequency domain in the MDCT applying unit 325 or the signal filtered by the frequency linear prediction unit 330 is a transient signal, the multi- It can be analyzed with multi-resolution. In other words, the multi-resolution analyzing unit 340 may perform the multi-resolution analysis according to the characteristics of the inputted signal to improve the coding efficiency.

양자화부(350)는 주파수 선형 예측 수행부(330)에서 필터링된 신호 또는 멀티-레졸루션 분석부(340)에서 출력된 결과를 양자화한다.The quantization unit 350 quantizes the signal filtered by the frequency linear prediction unit 330 or the result output from the multi-resolution analyzer 340.

문맥-기반 비트플레인 부호화부(360)는 문맥을 기반으로 하여 양자화부(350)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(360)는 허프만 코딩을 이용하여 양자화부(350)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bit-plane encoding unit 360 encodes the quantized result in the bit-plane by the quantization unit 350 based on the context. In detail, the context-based bit-plane encoding unit 360 can encode the quantized result in the quantization unit 350 into a bit plane using Huffman coding.

이와 같이, 주파수 선형 예측 수행부(330), 멀티-레졸루션 분석부(340), 양자화부(350) 및 문맥-기반 비트플레인 부호화부(360)는 MDCT 적용부(325)에서 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.In this manner, the frequency linear prediction performing unit 330, the multi-resolution analyzing unit 340, the quantizing unit 350, and the context-based bit plane coding unit 360 may be configured so as to convert the low- The low frequency band encoding unit can be regarded as a low frequency band encoding unit.

저주파수 밴드 변환부(370)는 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 예를 들어, 저주파수 밴드 변환부(370)는 MDST, FFT, 및 QMF 등을 이용하여 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다. 여기서, 시간 도메인은 시간의 경과에 따라 저주파수 밴드 신호(LB)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이다. 이에 비해, 주파수 도메인은 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다. 시간/주파수 도메인은 시간의 경과 및 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다.The low frequency band transformer 370 transforms the low frequency band signal LB divided by the band divider 310 into a frequency domain or a time / frequency domain in a time domain using a conversion technique other than MDCT. For example, the low-frequency band converter 370 may convert the low-frequency band signal LB from the time domain into the frequency domain or the time / frequency domain using MDST, FFT, and QMF. Here, the time domain is a domain representing the size (e.g., energy or sound pressure) of the low frequency band signal LB with the lapse of time. On the other hand, the frequency domain is a domain indicating the magnitude of the low frequency band signal LB in accordance with the change in frequency. The time / frequency domain is a domain indicating the size of the low-frequency band signal LB according to the passage of time and the change in frequency.

고주파수 밴드 변환부(375)는 밴드 분할부(310)에서 분할된 고주파수 밴드 신호(HB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 여기서, 고주파밴드 변환부(375)는 저주파밴드 변환부(370)에서 이용하는 동일한 변환 기법을 이용한다. 예를 들어, 고주파수 밴드 변환부(375)는 MDST, FFT, 및 QMF 등을 이용하여 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다.The high frequency band converting unit 375 converts the high frequency band signal HB divided in the band dividing unit 310 into a frequency domain or a time / frequency domain from a time domain using a conversion scheme other than MDCT. Here, the high-frequency band converting unit 375 uses the same conversion technique used in the low-frequency band converting unit 370. For example, the high frequency band converter 375 may convert the high frequency band signal HB from the time domain into the frequency domain or the time / frequency domain using MDST, FFT, and QMF.

대역폭 확장 부호화부(380)는 저주파수 밴드 변환부(370)에서 주파수 도메인또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호를 이용하여, 고주파수 밴드 변환부(375)에서 주파수 도메인 또는 시간/주파수 도메인으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(380)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드의 정보를 이용하여 상술한 바와 같은 대역폭 확장 정보를 생성할 수 있다. 본 발명의 다른 실시예에서, 대역폭 확장 부호화부(380)는 저주파수 밴드 신호에 대하여 부호화된 결과를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoding unit 380 performs conversion from the high frequency band conversion unit 375 to the frequency domain or the time / frequency domain using the low frequency band signal converted into the frequency domain or the time / frequency domain in the low frequency band conversion unit 370 And generates bandwidth extension information indicating the characteristics of the high frequency band signal. It will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoding unit 380 can generate the bandwidth extension information as described above using the information of the low frequency band based on the characteristic that there is a high correlation between the high frequency and low frequency bands of the audio signal . In another embodiment of the present invention, the bandwidth extension encoding unit 380 may generate the bandwidth extension information using the encoded result of the low frequency band signal.

CELP 부호화부(385)는 모드 결정부(320)에서 저주파수 밴드 신호(LB)를 시간 도메인으로 부호화하는 것으로 결정된 경우, 저주파수 밴드 신호(LB)를 CELP 방식에 의해 부호화한다. 여기서, CELP 방식은 입력된 저주파수 밴드 신호(LB)에 대하여 선형 예측을 수행하여 계산된 선형 예측 필터의 계수를 이용하여 저주파수 밴드 신호(LB)를 필터링하여 포먼트 성분을 부호화하고, 필터링된 신호에 대하여 적응 코드북(adaptive codebook) 및 고정 코드북(fixed codebook)를 검색하여 피치 성분을 부호화한다.The CELP encoding unit 385 encodes the low frequency band signal LB by the CELP method when it is determined that the low frequency band signal LB is encoded in the time domain by the mode determination unit 320. [ In the CELP method, the low-frequency band signal (LB) is filtered by using the coefficients of the linear prediction filter calculated by performing linear prediction on the input low-frequency band signal (LB), and the formant component is encoded. An adaptive codebook and a fixed codebook are searched for the pitch component to encode the pitch component.

다중화부(390)는 스테레오 부호화부(300), 주파수 선형 예측 수행부(330), 문맥-기반 비트플레인 부호화부(360), 대역폭 확장 부호화부(380) 및 CELP 부호화부(385)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexing unit 390 performs encoding in the stereo encoding unit 300, the frequency linear prediction performing unit 330, the context-based bit plane encoding unit 360, the bandwidth extension encoding unit 380 and the CELP encoding unit 385 Multiplexes the result, and generates a bit stream and outputs it through the output terminal OUT.

도 4는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 4 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 4를 참조하면, 오디오 신호의 부호화 장치는 스테레오 부호화부(400), 밴드 분할부(410), 모드 결정부(420), 제1 MDCT 적용부(425), 주파수 선형 예측 수행 부(430), 멀티-레졸루션 분석부(440), 양자화부(450), 문맥-기반 비트플레인 부호화부(460), 제2 MDCT 적용부(470), 제3 MDCT 적용부(475), 대역폭 확장 부호화부(480), CELP 부호화부(485) 및 다중화부(490)를 포함한다.4, an apparatus for encoding an audio signal includes a stereo encoding unit 400, a band dividing unit 410, a mode determining unit 420, a first MDCT applying unit 425, a frequency linear prediction performing unit 430, A second MDCT applying unit 470, a third MDCT applying unit 475, a bandwidth extension encoding unit 440, a context-based bit-plane encoding unit 470, a multi-resolution analyzing unit 440, a quantization unit 450, 480), a CELP encoding unit 485, and a multiplexing unit 490.

스테레오 부호화부(400)는 입력 신호(IN)에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호(IN)를 다운믹싱한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The stereo encoding unit 400 extracts a stereo parameter from the input signal IN and encodes it, and down-mixes the input signal IN. Here, the input signal IN may be an analog audio signal or a PCM signal obtained by modulating an audio signal into a digital signal. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

밴드 분할부(410)는 스테레오 부호화부(400)에서 출력된 다운믹싱된 신호를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band dividing unit 410 divides the downmixed signal output from the stereo encoding unit 400 into a low frequency band signal LB and a high frequency band signal HB. Here, the low frequency band signal is a signal corresponding to a frequency lower than a certain threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than a certain threshold value.

모드 결정부(420)는 소정의 기준에 따라 밴드 분할부(410)에서 분할된 저주파수 밴드 신호(LB)를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다. 예를 들어, 모드 결정부(420)는 제1 MDCT 적용부(425)에서 출력된 결과에 따라 밴드 분할부(410)에서 분할된 저주파수 밴드 신호(LB)를 시간 도메 인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다.The mode determining unit 420 determines whether to encode the low frequency band signal LB divided in the band dividing unit 410 in the time domain or the frequency domain according to a predetermined criterion. For example, the mode determining unit 420 may determine whether the low frequency band signal LB divided in the band dividing unit 410 is coded in the time domain or not in the frequency domain according to the result output from the first MDCT applying unit 425 And decides whether or not to encode it.

제1 MDCT 적용부(425)는 모드 결정부(420)에서 저주파수 밴드 신호(LB)를 주파수 도메인으로 부호화하는 것으로 결정된 경우, 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, 시간 도메인은 시간의 경과에 따라 저주파수 밴드 신호(LB)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이다. 이에 비해, 주파수 도메인은 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다. 여기서, MDCT가 수행된 결과는 모드 결정부(420)에서 부호화 도메인을 결정하는데 이용될 수 있다.The first MDCT applying unit 425 performs MDCT on the low frequency band signal LB and outputs the low frequency band signal LB when the mode determining unit 420 determines to encode the low frequency band signal LB in the frequency domain. ) From the time domain to the frequency domain. Here, the time domain is a domain representing the size (e.g., energy or sound pressure) of the low frequency band signal LB with the lapse of time. On the other hand, the frequency domain is a domain indicating the magnitude of the low frequency band signal LB in accordance with the change in frequency. Here, the result of performing the MDCT may be used for determining the encoding domain in the mode determination unit 420.

주파수 선형 예측 수행부(430)는 MDCT 적용부(425)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(430)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(430)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 430 performs frequency linear prediction on the low frequency band signal converted into the frequency domain by the MDCT applying unit 425. [ Here, the frequency linear prediction is a method of approximating a signal at a current frequency with a linear combination of signals at a previous frequency. In detail, the frequency linear prediction performing unit 430 calculates the coefficients of the linear prediction filter so that the prediction error, which is the difference between the signal subjected to the linear prediction and the signal at the current frequency, is minimized, Lt; RTI ID = 0.0 > low-frequency band < / RTI > In this case, the frequency linear prediction performing unit 430 may improve the coding efficiency by performing vector quantization which expresses a value corresponding to the coefficient of the linear prediction filter as a vector index.

구체적으로, 주파수 선형 예측 수행부(430)는 MDCT 적용부(425)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(430)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, the frequency linear prediction performing unit 430 may perform frequency linear prediction when the low frequency band signal converted into the frequency domain in the MDCT applying unit 425 is a speech signal or a pitched signal have. In other words, the frequency linear prediction performing unit 430 may perform frequency linear prediction according to characteristics of an input signal, thereby improving coding efficiency.

멀티-레졸루션 분석부(440)는 MDCT 적용부(425)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(430)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(440)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(430)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzer 440 receives the low-frequency band signal converted into the frequency domain or the signal filtered by the frequency linear prediction unit 430 in the MDCT application unit 425 and calculates an audio spectral coefficient Resolution (multi-resolution). In detail, the multi-resolution analyzer 440 may classify the audio spectrum filtered by the frequency linear prediction performing unit 430 into two types (for example, stabill type and short type) according to the intensity of change of the audio spectrum. .

구체적으로, 멀티-레졸루션 분석부(440)는 MDCT 적용부(425)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(430)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(440)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, when the low frequency band signal converted into the frequency domain in the MDCT application unit 425 or the signal filtered in the frequency linear prediction unit 430 is a transient signal, the multi- It can be analyzed with multi-resolution. In other words, the multi-resolution analyzer 440 may improve the coding efficiency by performing multi-resolution analysis according to characteristics of an input signal.

양자화부(450)는 주파수 선형 예측 수행부(430)에서 필터링된 신호 또는 멀티-레졸루션 분석부(440)에서 출력된 결과를 양자화한다.The quantization unit 450 quantizes the filtered signal or the result output from the multi-resolution analysis unit 440 by the frequency linear prediction unit 430.

문맥-기반 비트플레인 부호화부(460)는 문맥을 기반으로 하여 양자화부(450)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(460)는 허프만 코딩을 이용하여 양자화부(450)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bit-plane encoding unit 460 encodes the quantized result in the bit-plane by the quantization unit 450 based on the context. In detail, the context-based bit-plane encoding unit 460 can encode the quantized result in the quantization unit 450 into a bit plane using Huffman coding.

이와 같이, 주파수 선형 예측 수행부(430), 멀티-레졸루션 분석부(440), 양자화부(450) 및 문맥-기반 비트플레인 부호화부(460)는 MDCT 적용부(425)에서 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.In this manner, the frequency linear prediction unit 430, the multi-resolution analyzing unit 440, the quantizing unit 450 and the context-based bit-plane coding unit 460 may be configured so as to convert the low- The low frequency band encoding unit can be regarded as a low frequency band encoding unit.

제2 MDCT 적용부(470)는 밴드 분할부(410)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다. 모드 결정부(420)에서 저주파수 밴드 신호(LB)를 주파수 도메인으로 부호화하는 것으로 결정된 경우, 제2 MDCT 적용부(470)는 저주파수 밴드 신호(LB)에 대해 별도의 MDCT를 수행하지 않는다. 이 경우, 제1 MDCT 적용부(470)의 출력을 제2 MDCT 적용부(470)의 출력으로 대체한다.The second MDCT applying unit 470 performs MDCT on the low frequency band signal LB divided by the band dividing unit 410 to convert the low frequency band signal LB from the time domain to the frequency domain. When it is determined that the low frequency band signal LB is encoded in the frequency domain by the mode determination unit 420, the second MDCT applying unit 470 does not perform a separate MDCT on the low frequency band signal LB. In this case, the output of the first MDCT applying section 470 is replaced with the output of the second MDCT applying section 470.

제3 MDCT 적용부(475)는 밴드 분할부(410)에서 분할된 고주파수 밴드 신호(HB)에 대하여 MDCT를 수행하여, 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인으로 변환한다. The third MDCT applying unit 475 performs MDCT on the divided high frequency band signal HB in the band dividing unit 410 to convert the high frequency band signal HB from the time domain to the frequency domain.

대역폭 확장 부호화부(480)는 제2 MDCT 적용부(470)에서 주파수 도메인으로 변환된 저주파수 밴드 신호를 이용하여, 제3 MDCT 적용부(475)에서 주파수 도메인으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확 장 부호화부(480)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성을 기초로 저주파수 밴드의 정보를 이용하여 상술한 바와 같은 대역폭 확장 정보를 생성할 수 있다. 본 발명의 다른 실시예에서, 대역폭 확장 부호화부(480)는 저주파수 밴드 신호에 대하여 부호화된 결과를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoding unit 480 uses the low frequency band signal converted into the frequency domain in the second MDCT applying unit 470 to represent the characteristics of the high frequency band signal converted into the frequency domain in the third MDCT applying unit 475 And generates and encodes bandwidth extension information. It will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoding unit 480 can generate the bandwidth extension information as described above by using the information of the low frequency bands based on the characteristic that there is a high correlation between the high frequency and the low frequency bands of the audio signal have. In another embodiment of the present invention, the bandwidth extension encoding unit 480 may generate the bandwidth extension information using the encoded result of the low frequency band signal.

CELP 부호화부(485)는 모드 결정부(420)에서 저주파수 밴드 신호(LB)를 시간 도메인으로 부호화하는 것으로 결정된 경우, 저주파수 밴드 신호(LB)를 CELP 방식에 의해 부호화한다. 여기서, CELP 방식은 입력된 저주파수 밴드 신호(LB)에 대하여 선형 예측을 수행하여 계산된 선형 예측 필터의 계수를 이용하여 저주파수 밴드 신호(LB)를 필터링하여 포먼트 성분을 부호화하고, 필터링된 신호에 대하여 적응 코드북 및 고정 코드북를 검색하여 피치 성분을 부호화한다.The CELP encoding unit 485 encodes the low frequency band signal LB by the CELP method when it is determined that the low frequency band signal LB is encoded in the time domain by the mode determination unit 420. [ In the CELP method, the low-frequency band signal (LB) is filtered by using the coefficients of the linear prediction filter calculated by performing linear prediction on the input low-frequency band signal (LB), and the formant component is encoded. The adaptive codebook and the fixed codebook are searched to encode the pitch component.

다중화부(490)는 스테레오 부호화부(400), 주파수 선형 예측 수행부(430), 문맥-기반 비트플레인 부호화부(460), 대역폭 확장 부호화부(480) 및 CELP 부호화부(485)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexing unit 490 performs encoding in the stereo encoding unit 400, the frequency linear prediction performing unit 430, the context-based bit plane encoding unit 460, the bandwidth extension encoding unit 480, and the CELP encoding unit 485 Multiplexes the result, and generates a bit stream and outputs it through the output terminal OUT.

도 5는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 5 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 5를 참조하면, 오디오 신호의 부호화 장치는 변환부(500), 스테레오 부호화부(510), 역변환부(520), 모드 결정부(530), FV-MLT 적용부(535), 주파수 선형 예측 수행부(540), 멀티-레졸루션 분석부(550), 양자화부(560), 문맥-기반 비트플 레인 부호화부(570), 대역폭 확장 부호화부(580), CELP 부호화부(585) 및 다중화부(590)를 포함한다.5, an apparatus for encoding an audio signal includes a transform unit 500, a stereo encoding unit 510, an inverse transform unit 520, a mode determination unit 530, an FV-MLT application unit 535, a frequency linear prediction Based bit plane encoding unit 570, a bandwidth extension encoding unit 580, a CELP encoding unit 585, and a multiplexing unit 540. The context-based bit plane encoding unit 570, the multi-resolution analysis unit 550, the quantization unit 560, (590).

변환부(500)는 입력 신호(IN)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 구체적으로, 변환부(500)는 MDST, FFT 및 QMF 등을 이용하여 입력 신호(IN)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 이 경우, 변환부(500)는 MDCT를 이용할 수 없는 것은 아니지만, MDCT를 사용할 경우에는 다른 실시예가 보다 효율적이다. The conversion unit 500 converts the input signal IN from the time domain to the frequency domain or the time / frequency domain using a conversion technique other than MDCT. Specifically, the transform unit 500 transforms the input signal IN from the time domain into the frequency domain or the time / frequency domain using MDST, FFT, QMF, or the like. In this case, the converting unit 500 can not use the MDCT, but other embodiments are more efficient when the MDCT is used.

여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 시간 도메인은 시간의 경과에 따라 입력 신호(IN)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이다. 이에 비해, 주파수 도메인은 주파수의 변화에 따라 입력 신호(IN)의 크기를 나타내는 도메인이다. 시간/주파수 도메인은 시간의 경과 및 주파수의 변화에 따라 입력 신호(IN)의 크기를 나타내는 도메인이다.Here, the input signal IN may be an analog audio signal or a PCM signal obtained by modulating an audio signal into a digital signal. Here, the time domain is a domain representing the size (e.g., energy or sound pressure) of the input signal IN with the lapse of time. On the other hand, the frequency domain is a domain representing the magnitude of the input signal IN according to the frequency change. The time / frequency domain is a domain indicating the size of the input signal IN according to the passage of time and the change in frequency.

스테레오 부호화부(510)는 변환부(500)에서 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 변환된 신호를 다운믹싱한다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The stereo encoding unit 510 extracts and encodes stereo parameters in the frequency domain or time / frequency domain converted signals in the transform unit 500, and down-mixes the converted signals. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

구체적으로, 스테레오 파라미터는 스테레오 신호에 대한 부가 정보를 나타내 는 것으로, 부가 정보는 좌채널 신호 및 우채널 신호의 채널 간의 위상차 또는 강도차 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술 분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다.Specifically, the stereo parameter indicates additional information for a stereo signal, and the additional information may include various information such as a phase difference or intensity difference between channels of the left channel signal and the right channel signal. Those skilled in the art will understand.

역변환부(520)는 스테레오 부호화부(510)에서 다운믹싱된 신호를 주파수 도메인 또는 시간/주파수 도메인에서 시간 도메인으로 역변환한다. 이 경우, 역변환부(520)는 변환부(500)에서 이용하는 변환 기법에 대응되는 역변환 기법을 이용할 수 있다. 예를 들어, 변환부(500)에서 QMF를 이용한 경우, 역변환부(520)는 역 QMF를 이용할 수 있다.The inverse transform unit 520 inverse transforms the downmixed signal in the frequency domain or the time domain into the time domain in the stereo encoding unit 510. In this case, the inverse transform unit 520 may use an inverse transform technique corresponding to the transform technique used in the transform unit 500. For example, when the QMF is used in the transform unit 500, the inverse transform unit 520 may use the inverse QMF.

모드 결정부(530)는 소정의 기준에 따라 역변환부(520)에서 역변환된 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다. 예를 들어, 모드 결정부(530)는 FV-MLT 적용부(535)에서 출력된 결과에 따라 역변환부(520)에서 역변환된 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정할 수 있다.The mode determination unit 530 determines whether to encode a signal that is inversely transformed by the inverse transform unit 520 in a time domain or a frequency domain according to a predetermined criterion. For example, the mode determination unit 530 can determine whether to inverse-transform the signal in the time domain or the frequency domain according to the result output from the FV-MLT application unit 535 .

FV-MLT 적용부(535)는 모드 결정부(530)에서 부호화 도메인이 결정된 신호에 대하여 FV-MLT(Frequency Varying Modulated Lapped Transform)를 수행하여, 모드 결정부(420)에서 부호화 도메인이 결정된 신호를 서브 밴드 별로 시간 도메인 또는 주파수 도메인 변환한다. 보다 상세하게 설명하면, FV-MLT는 시간 도메인으로 표현된 신호를 주파수 도메인으로 변환한 후 밴드 별로 적절히 시간 해상도(temporal resolution)를 조절하여 소정의 서브 밴드에 대하여 시간 도메인 또는 주파수 도메인으로 표현할 수 있는 적응적인(flexible) 변환 방식이다. 여기서, FV-MLT가 수행 된 결과는 모드 결정부(530)에서 부호화 도메인을 결정하는데 이용될 수 있다.The FV-MLT applying unit 535 performs FV-MLT (Frequency Variant Modulated Lapped Transform) on the signal whose encoding domain is determined in the mode determining unit 530, and outputs the signal whose encoding domain is determined in the mode determining unit 420 And performs time domain or frequency domain conversion for each subband. More specifically, the FV-MLT can transform a signal represented by a time domain into a frequency domain and then adjust the temporal resolution appropriately for each band, so that it can be expressed in a time domain or a frequency domain for a predetermined subband It is an adaptive (flexible) conversion scheme. Here, the result of performing the FV-MLT may be used to determine an encoding domain in the mode determination unit 530. [

주파수 선형 예측 수행부(540)는 모드 결정부(530)에서 신호를 주파수 도메인으로 부호화하는 것으로 결정된 경우, FV-MLT 적용부(535)에서 주파수 도메인으로 변환된 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(540)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(540)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.When it is determined that the signal is encoded in the frequency domain by the mode determination unit 530, the frequency linear prediction performing unit 540 performs frequency linear prediction on the signal converted into the frequency domain in the FV-MLT applying unit 535 . Here, the frequency linear prediction is a method of approximating a signal at a current frequency with a linear combination of signals at a previous frequency. In detail, the frequency linear prediction performing unit 540 calculates the coefficients of the linear prediction filter so that the prediction error, which is the difference between the signal subjected to the linear prediction and the signal at the current frequency, is minimized, Lt; RTI ID = 0.0 > low-frequency band < / RTI > In this case, the frequency linear prediction performing unit 540 may improve the coding efficiency by performing vector quantization which expresses a value corresponding to the coefficient of the linear prediction filter by a vector index.

구체적으로, 주파수 선형 예측 수행부(540)는 FV-MLT 적용부(535)에서 주파수 도메인으로 변환된 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(540)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, the frequency linear prediction performing unit 540 may perform frequency linear prediction when the signal converted into the frequency domain in the FV-MLT applying unit 535 is a speech signal or a pitched signal have. In other words, the frequency linear prediction performing unit 540 may improve the efficiency of encoding by performing frequency linear prediction according to characteristics of an input signal.

멀티-레졸루션 분석부(550)는 FV-MLT 적용부(535)에서 주파수 도메인으로 변환된 신호 또는 주파수 선형 예측 수행부(540)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도로 분석한다. 구체적으로, 멀티-레졸루션 분석부(550)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파 수 선형 예측 수행부(540)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzing unit 550 receives the signal transformed into the frequency domain or the signal filtered by the frequency linear prediction unit 540 in the FV-MLT applying unit 535 and calculates an audio spectral coefficient Resolution. Specifically, the multi-resolution analyzing unit 550 may classify the filtered audio spectrum in the frequency-domain linear prediction performing unit 540 into two types (for example, a stave type and a short type) according to the intensity of change of the audio spectrum ).

구체적으로, 멀티-레졸루션 분석부(550)는 FV-MLT 적용부(535)에서 주파수 도메인으로 변환된 신호 또는 주파수 선형 예측 수행부(540)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(550)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, when the FV-MLT application unit 535 converts the signal into the frequency domain or the signal filtered by the frequency linear prediction unit 540 is a transient signal, the multi- It can be analyzed with multi-resolution. In other words, the multi-resolution analyzing unit 550 may perform the multi-resolution analysis according to the characteristics of the inputted signal to improve the coding efficiency.

양자화부(560)는 주파수 선형 예측 수행부(540)에서 필터링된 신호 또는 멀티-레졸루션 분석부(550)에서 출력된 결과를 양자화한다.The quantization unit 560 quantizes the signal output from the frequency linear prediction unit 540 or the result output from the multi-resolution analysis unit 550.

문맥-기반 비트플레인 부호화부(570)는 문맥을 기반으로 하여 양자화부(560)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(570)는 허프만 코딩을 이용하여 양자화부(560)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bit-plane encoding unit 570 encodes the quantized result in the bit-plane by the quantization unit 560 based on the context. In detail, the context-based bit-plane encoding unit 570 can encode the quantized result in the quantization unit 560 into a bit plane using Huffman coding.

대역폭 확장 부호화부(580)는 스테레오 부호화부(510)에서 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. The bandwidth extension encoding unit 580 extracts bandwidth extension information from the downmixed signal in the stereo encoding unit 510 and encodes the bandwidth extension information. It will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal.

CELP 부호화부(585)는 모드 결정부(530)에서 신호를 시간 도메인으로 부호화하는 것으로 결정된 경우, FV-MLT 적용부(535)에서 시간 도메인으로 변환된 신호를 CELP 방식에 의해 부호화한다. 여기서, CELP 방식은 입력된 신호에 대하여 선형 예측을 수행하여 계산된 선형 예측 필터의 계수를 이용하여 신호를 필터링하여 포먼트 성분을 부호화하고, 필터링된 신호에 대하여 적응 코드북 및 고정 코드북를 검색하여 피치 성분을 부호화한다.When it is determined that the signal is to be encoded in the time domain by the mode determination unit 530, the CELP encoding unit 585 encodes the signal converted into the time domain by the FV-MLT application unit 535 according to the CELP method. In the CELP method, the formant component is encoded by filtering the signal using the coefficients of the linear prediction filter calculated by performing linear prediction on the input signal, and an adaptive codebook and a fixed codebook are searched for the filtered signal, .

다중화부(590)는 스테레오 부호화부(500), 주파수 선형 예측 수행부(540), 문맥-기반 비트플레인 부호화부(570), 대역폭 확장 부호화부(580) 및 CELP 부호화부(585)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexing unit 590 performs encoding in the stereo encoding unit 500, the frequency linear prediction performing unit 540, the context-based bit plane encoding unit 570, the bandwidth extension encoding unit 580, and the CELP encoding unit 585 Multiplexes the result, and generates a bit stream and outputs it through the output terminal OUT.

도 6은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 6 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 6을 참조하면, 오디오 신호의 부호화 장치는 모드 결정부(600), FV-MLT 적용부(610), 스테레오 부호화부(620), 주파수 선형 예측 수행부(630), 멀티-레졸루션 분석부(640), 양자화부(650), 문맥-기반 비트플레인 부호화부(660), 대역폭 확장 부호화부(670), CELP 부호화부(680) 및 다중화부(690)를 포함한다.6, an apparatus for encoding an audio signal includes a mode determination unit 600, an FV-MLT application unit 610, a stereo encoding unit 620, a frequency linear prediction unit 630, a multi- 640, a quantization unit 650, a context-based bit-plane coding unit 660, a bandwidth extension coding unit 670, a CELP coding unit 680 and a multiplexing unit 690.

모드 결정부(600)는 소정의 기준에 따라 입력 신호(IN)를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 예를 들어, 모드 결정부(600)는 FV-MLT 적용부(610)에서 출력된 결과에 따라 입력 신호(IN)를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다.The mode determination unit 600 determines whether to encode the input signal IN in the time domain or the frequency domain according to a predetermined criterion. Here, the input signal IN may be an analog audio signal or a PCM signal obtained by modulating an audio signal into a digital signal. For example, the mode determination unit 600 determines whether to encode the input signal IN in the time domain or the frequency domain according to the result output from the FV-MLT application unit 610.

FV-MLT 적용부(610)는 모드 결정부(600)에서 부호화 도메인이 결정된 신호에 대하여 FL-MLT(Frequency Varying Modulated Lapped Transform)를 수행하여, 모드 결정부(600)에서 부호화 도메인이 결정된 신호를 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환한다. 보다 상세하게 설명하면, FV-MLT는 시간 도메인으로 표현된 신호를 주파수 도메인으로 변환한 후 밴드 별로 적절히 시간 해상도를 조절하여 소정의 서브 밴드에 대하여 시간 도메인 또는 주파수 도메인으로 표현할 수 있는 적응적인 변환 방식이다. 여기서, FV-MLT가 수행된 결과는 모드 결정부(600)에서 부호화 도메인을 결정하는데 이용될 수 있다. 여기서, 시간 도메인은 시간의 경과에 따라 입력 신호(IN)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이다. 이에 비해, 주파수 도메인은 주파수의 변화에 따라 신호의 크기를 나타내는 도메인이다. 시간/주파수 도메인은 시간의 경과 및 주파수의 변화에 따라 신호의 크기를 나타내는 도메인이다.The FV-MLT application unit 610 performs a FL-MLT (Frequency Variant Modulated Lapped Transform) on a signal whose encoding domain is determined in the mode determination unit 600, and outputs the signal whose encoding domain is determined in the mode determination unit 600 Into a time domain or a frequency domain for each subband. More specifically, the FV-MLT converts an input signal represented by a time domain into a frequency domain and then adjusts a time resolution appropriately for each band, thereby generating an adaptive conversion scheme to be. Here, the result of performing the FV-MLT may be used to determine the encoding domain in the mode determination unit 600. Here, the time domain is a domain representing the size (e.g., energy or sound pressure) of the input signal IN with the lapse of time. On the other hand, the frequency domain is a domain indicating the magnitude of a signal in accordance with a change in frequency. The time / frequency domain is a domain representing the size of a signal according to the passage of time and the change of frequency.

스테레오 부호화부(620)는 FV-MLT 적용부(610)에서 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 변환된 신호를 다운믹싱한다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The stereo encoding unit 620 extracts a stereo parameter from the converted signal by the FV-MLT applying unit 610, encodes the stereo parameter, and downmixes the converted signal. Here, downmixing is to generate mono signals of one channel from two or more stereo signals, and the amount of bits allocated to the encoding process can be reduced through downmixing.

주파수 선형 예측 수행부(630)는 모드 결정부(600)에서 입력 신호(IN)를 주파수 도메인으로 부호화하는 것으로 결정된 경우, FV-MLT 적용부(610)에서 주파수 도메인으로 변환된 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(630)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(630)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.When it is determined that the input signal IN is to be encoded in the frequency domain by the mode determination unit 600, the frequency linear prediction performing unit 630 performs frequency linear interpolation on the signal converted into the frequency domain in the FV-MLT applying unit 610, Perform the prediction. Here, the frequency linear prediction is a method of approximating a signal at a current frequency with a linear combination of signals at a previous frequency. Specifically, the frequency linear prediction performing unit 630 calculates the coefficients of the linear prediction filter so that the prediction error, which is the difference between the signal subjected to the linear prediction and the signal at the current frequency, is minimized, Lt; RTI ID = 0.0 > low-frequency band < / RTI > In this case, the frequency linear prediction performing unit 630 may improve the coding efficiency by performing vector quantization which expresses a value corresponding to the coefficient of the linear prediction filter by a vector index.

구체적으로, 주파수 선형 예측 수행부(630)는 FV-MLT 적용부(610)에서 주파수 도메인으로 변환된 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(630)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, the frequency linear prediction performing unit 630 can perform frequency linear prediction when the signal converted into the frequency domain in the FV-MLT applying unit 610 is a speech signal or a pitched signal have. In other words, the frequency linear prediction performing unit 630 may perform frequency linear prediction according to characteristics of an input signal to improve encoding efficiency.

멀티-레졸루션 분석부(640)는 FV-MLT 적용부(610)에서 주파수 도메인으로 변환된 신호 또는 주파수 선형 예측 수행부(630)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(640)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(630)에서 필터링된 오디오 스펙트럼을 두 가 지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzing unit 640 receives the signal transformed into the frequency domain or the signal filtered by the frequency linear prediction unit 630 in the FV-MLT applying unit 610 and calculates an audio spectral coefficient of the instantaneous varying signal Resolution (multi-resolution). In more detail, the multi-resolution analyzing unit 640 may classify the filtered audio spectrum in the frequency linear prediction performing unit 630 into two types (for example, a stave type and a short type) according to the intensity of change of the audio spectrum. ).

구체적으로, 멀티-레졸루션 분석부(640)는 FV-MLT 적용부(610)에서 주파수 도메인으로 변환된 신호 또는 주파수 선형 예측 수행부(630)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(640)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.More specifically, when the FV-MLT applying unit 610 transforms the signal into the frequency domain or the signal filtered by the frequency linear prediction unit 630 is a transient signal, the multi- It can be analyzed with multi-resolution. In other words, the multi-resolution analyzing unit 640 may improve the coding efficiency by performing multi-resolution analysis according to characteristics of an input signal.

양자화부(650)는 주파수 선형 예측 수행부(630)에서 필터링된 신호 또는 멀티-레졸루션 분석부(640)에서 출력된 결과를 양자화한다.The quantization unit 650 quantizes the filtered signal or the result output from the multi-resolution analysis unit 640 by the frequency linear prediction unit 630.

문맥-기반 비트플레인 부호화부(660)는 문맥을 기반으로 하여 양자화부(650)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(660)는 허프만 코딩을 이용하여 양자화부(650)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bit-plane encoding unit 660 encodes the quantized result in the bit-plane by the quantization unit 650 based on the context. Specifically, the context-based bit-plane encoding unit 660 can encode the quantized result in the quantization unit 650 into a bit plane using Huffman coding.

대역폭 확장 부호화부(670)는 스테레오 부호화부(620)에서 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. The bandwidth extension encoding unit 670 extracts bandwidth extension information from the downmixed signal in the stereo encoding unit 620 and encodes the bandwidth extension information. It will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal.

CELP 부호화부(680)는 모드 결정부(530)에서 입력 신호(IN)를 시간 도메인으로 부호화하는 것으로 결정된 경우, 스테레오 부호화부(620)에서 다운믹싱된 신호를 CELP 방식에 의해 부호화한다. 여기서, CELP 방식은 다운믹싱된 신호에 대하여 선형 예측을 수행하여 계산된 선형 예측 필터의 계수를 이용하여 다운믹싱된 신호를 필터링하여 포먼트 성분을 부호화하고, 필터링된 신호에 대하여 적응 코드북 및 고정 코드북을 검색하여 피치 성분을 부호화한다.The CELP encoding unit 680 encodes the downmixed signal in the stereo encoding unit 620 according to the CELP method when it is determined that the input signal IN is to be encoded in the time domain by the mode determination unit 530. In the CELP method, the formant component is encoded by filtering the downmixed signal using the coefficients of the linear prediction filter calculated by performing linear prediction on the downmixed signal, and the adaptive codebook and the fixed codebook And the pitch component is encoded.

다중화부(690)는 스테레오 부호화부(620), 주파수 선형 예측 수행부(630), 문맥-기반 비트플레인 부호화부(660), 대역폭 확장 부호화부(670) 및 CELP 부호화부(680)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexing unit 690 performs encoding in the stereo encoding unit 620, the frequency linear prediction performing unit 630, the context-based bit plane encoding unit 660, the bandwidth extension encoding unit 670, and the CELP encoding unit 680 Multiplexes the result, and generates a bit stream and outputs it through the output terminal OUT.

도 7은 본 발명의 일 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 7 is a block diagram illustrating an apparatus for decoding an audio signal according to an embodiment of the present invention.

도 7을 참조하면, 오디오 신호의 복호화 장치는 역다중화부(700), 문맥-기반 비트플레인 복호화부(710), 역양자화부(720), 멀티-레졸루션 합성부(730), 역 주파수 선형 예측 수행부(740), 대역폭 확장 복호화부(750), 제1 역 MDCT 적용부(760), 제2 역 MDCT 적용부(770), 밴드 합성부(780) 및 스테레오 복호화부(790)를 포함한다.7, an apparatus for decoding an audio signal includes a demultiplexer 700, a context-based bit plane decoder 710, an inverse quantizer 720, a multi-resolution synthesizer 730, an inverse frequency linear prediction A bandwidth extension decoder 750, a first inverse MDCT applying unit 760, a second inverse MDCT applying unit 770, a band combining unit 780, and a stereo decoding unit 790 .

역다중화부(700)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 여기서, 역다중화부(700)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, 신호 타입 정보(Signal type information), 주파수 선형 예측 및 벡터 양자화(Frequency-domain linear prediction and vector quantization)의 정보, 부호화된 대역폭 확장 정보 및 부호 화된 스테레오 파라미터 등이 있다.The demultiplexer 700 receives the bit stream output from the encoder and demultiplexes the bit stream. Here, the information output by the demultiplexing unit 700 may include description analysis of the audio spectrum stream, quantization value and other reconstruction information, quantization spectrum reconstruction information, context-based bit plane decoding information, signal type information Signal type information, information on frequency-domain linear prediction and vector quantization, encoded bandwidth extension information, and coded stereo parameters.

문맥-기반 비트플레인 복호화부(710, Context-dependent Bit plane decoding unit)는 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(710)는 역다중화부(700)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보 및 스케일 팩터(scale factor) 등을 복원한다. 구체적으로, 문맥-기반 비트플레인 복호화부(710)는 프레주디스 코딩 밴드 모드(Prejudice coding band mode) 정보, 프레주디스 코딩(Prejudice coding)의 스케일 팩터, 및 프레주디스 코딩(Prejudice coding)의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 복호화 코스메틱(decoding cosmetic) 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-dependent bit plane decoding unit 710 decodes the coded bit plane based on the context. The context-based bit-plane decoding unit 710 receives the information output from the demultiplexer 700 and performs Huffman decoding to obtain a frequency spectrum, coding band mode information, And restores a scale factor and the like. Specifically, the context-based bit-plane decoding unit 710 decodes a frequency spectrum of prejudice coding band mode information, a scale factor of prejudice coding, and a prejudice coding And outputs a coding band mode value, a decoding cosmetic expression of a scale factor, and a quantization value of the frequency spectrum.

역양자화부(720)는 문맥-기반 비트플레인 복호화부(710)에서 출력된 결과를 역양자화한다.The inverse quantization unit 720 dequantizes the result output from the context-based bit plane decoding unit 710.

멀티-레졸루션 합성부(730, Multi-resolution synthesis unit)는 역양자화부(720)의 출력을 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션(Multi-resolution)을 처리한다. 구체적으로, 멀티-레졸루션 합성부(730)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(720)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(740)는 역 양자화 스펙트럼/차 스텍트럼(reserve quantization spectrum/difference spectrum)을 입력받아 복원 스펙트럼/차 스펙트 럼(reconstruction spectrum/difference spectrum)을 출력한다.The multi-resolution synthesis unit 730 receives the output of the inverse quantization unit 720 and processes the multi-resolution of the audio spectral coefficients of the instantaneous varying signals. In detail, when the audio signal is analyzed in multi-resolution at the encoding end, the multi-resolution synthesizer 730 may synthesize the output of the dequantizer 720 in multi-resolution to improve decoding efficiency. Here, the multi-resolution synthesizer 740 receives the inverse quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(740, inverse frequency linear prediction performing unit)은 멀티-레졸루션 합성부(730)의 출력과 역다중화부(700)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성한다. 구체적으로, 역 주파수 선형 예측 수행부(740)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(720)의 출력 또는 멀티-레졸루션 합성부(730)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(740)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(740)은 차 스펙트럼(difference spectrum) 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 740 combines the output of the multi-resolution synthesizing unit 730 and the result of performing the frequency linear prediction on the encoding end input from the demultiplexing unit 700 . In more detail, the inverse linear prediction unit 740 may perform a frequency linear prediction on the output of the inverse quantization unit 720 or the output of the inverse quantization unit 720 when the audio signal is frequency- (730) to improve decoding efficiency. Here, the inverse frequency linear prediction performance unit 740 effectively utilizes the frequency domain prediction technique and the vector quantization technique of the prediction coefficients to effectively improve the coding efficiency. The inverse frequency linear prediction performing unit 740 receives the difference spectrum coefficient and index of the vector and outputs the MDCT spectrum coefficient.

대역폭 확장 부호화부(750)는 역다중화부(700)로부터 입력받은 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 역 주파수 선형 예측 수행부(740)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(750)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension encoding unit 750 decodes the encoded bandwidth extension information received from the demultiplexing unit 700 and extracts the low frequency band signal output from the inverse frequency linear prediction unit 740 using the decoded bandwidth extension information Thereby generating a high frequency band signal. Here, the bandwidth extension decoder 750 generates the high-frequency band signal by applying the decoded bandwidth extension information to the low-frequency band signal based on a characteristic that there is a high correlation between the high-frequency and low-frequency bands of the audio signal. Here, the bandwidth extension information is information such as an energy level or an envelope for the high frequency band signal, which is information capable of exhibiting characteristics of a high frequency band of a high frequency band.

제1 역 MDCT 적용부(760)는 부호화단에 대한 역변환 과정으로서 역 주파수 선형 예측 수행부(740)에서 출력된 저주파수 밴드 신호에 대하여 역 MDCT(inverse MDCT)를 수행하여 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 제1 역 MDCT 적용부(760)는 역 주파수 선형 예측 수행부(740)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The first inverse MDCT applying unit 760 performs an inverse MDCT on the low frequency band signal output from the inverse frequency linear prediction performing unit 740 as an inverse transform process on the encoding end, do. The first inverse MDCT applying unit 760 receives the frequency spectrum coefficient obtained as a result of inverse quantization by the inverse frequency linear prediction performing unit 740, and outputs the reconstructed audio data corresponding to the low frequency band.

제2 역 MDCT 적용부(770)는 대역폭 확장 복호화부(750)에서 복호화된 고주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다.The second inverse MDCT applying unit 770 inversely converts the high frequency band signal decoded by the bandwidth extension decoding unit 750 into a time domain in the frequency domain by the inverse MDCT.

밴드 합성부(780)는 제1 역 MDCT 적용부(760)에서 시간 도메인으로 변환된 저주파수 밴드 신호와 제2 역 MDCT 적용부(770)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성한다.The band combining unit 780 combines the low frequency band signal converted into the time domain in the first inverse MDCT applying unit 760 and the high frequency band signal converted into the time domain in the second inverse MDCT applying unit 770.

스테레오 복호화부(790)는 역다중화부(700)로부터 입력받은 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 밴드 합성부(780)에서 합성된 신호를 업믹싱(up-mixing)하여 출력단자 OUT을 통해 출력한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The stereo decoding unit 790 decodes the encoded stereo parameters received from the demultiplexing unit 700 and up-mixes the signals synthesized by the band synthesizing unit 780 using the decoded stereo parameters And output through the output terminal OUT. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

도 8은 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 8 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 8을 참조하면, 오디오 데이터 복호화 장치는 역다중화부(800), 문맥-기반 비트플레인 복호화부(810), 역양자화부(820), 멀티-레졸루션 합성부(830), 역 주파수 선형 예측 수행부(840), 역 MDCT 적용부(850), 변환부(855), 대역폭 확장 복호 화부(860), 역변환부(870), 밴드 합성부(880) 및 스테레오 복호화부(890)를 포함한다.8, an audio data decoding apparatus includes a demultiplexer 800, a context-based bit plane decoder 810, an inverse quantizer 820, a multi-resolution synthesizer 830, an inverse frequency linear prediction An inverse transform unit 840, an inverse MDCT applying unit 850, a transforming unit 855, a bandwidth extension decoding unit 860, an inverse transforming unit 870, a band combining unit 880 and a stereo decoding unit 890.

역다중화부(800)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 역다중화부(800)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(800)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, 신호 타입 정보, 주파수 선형 예측 및 벡터 양자화의 정보, 부호화된 대역폭 확장 정보 및 부호화된 스테레오 파라미터 등이 있다.The demultiplexer 800 receives the bit stream output from the encoder and demultiplexes the bit stream. The demultiplexer 800 separates each portion of the data level into data portions corresponding to the respective units, and analyzes and outputs the bitstream information related to the unit. Here, the information output by the demultiplexing unit 800 may include a description analysis of the audio spectrum stream, quantization value and other reconstruction information, quantization spectrum reconstruction information, context-based bit plane decoding information, signal type information, frequency linear prediction, Vector quantization information, encoded bandwidth extension information, and encoded stereo parameters.

문맥-기반 비트플레인 복호화부(810)는 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(810)는 역다중화부(800)로부터 출력된 정보를 입력받아 허프먼 복호화를 진행하여 주파수 스펙트럼, 코딩 밴드 모드 정보, 및 스케일 팩터를 복원한다. 구체적으로, 문맥-기반 비트플레인 복호화부(810)는 프레주디스 코딩 밴드 모드(Prejudice coding band mode) 정보, 프레주디스 코딩(Prejudice coding)의 스케일 팩터, 및 프레주디스 코딩(Prejudice coding)의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 복호화 코스메틱(decoding cosmetic) 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bit-plane decoding unit 810 decodes the coded bit-plane based on the context. Here, the context-based bit-plane decoding unit 810 receives information output from the demultiplexing unit 800 and performs Huffman decoding to recover a frequency spectrum, coding band mode information, and a scale factor. Specifically, the context-based bit-plane decoding unit 810 decodes the frequency spectrum of the Prejudice coding band mode information, the scale factor of the Prejudice coding, and the Prejudice coding And outputs a coding band mode value, a decoding cosmetic expression of a scale factor, and a quantization value of the frequency spectrum.

역양자화부(820)는 문맥-기반 비트플레인 복호화부(710)에서 출력된 결과를 역양자화한다.The inverse quantization unit 820 dequantizes the result output from the context-based bit plane decoding unit 710.

멀티-레졸루션 합성부(830)는 역양자화부(820)의 출력을 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수에 대하여 멀티-레졸루션을 처리한다. 구체적으로, 멀티-레졸루션 합성부(830)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(820)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(830)는 역 양자화 스펙트럼/차 스펙트럼(reserve quantization spectrum/difference spectrum)을 입력받아 복원 스펙트럼/차 스펙트럼(reconstruction spectrum/difference spectrum)을 출력한다.The multi-resolution synthesis unit 830 receives the output of the inverse quantization unit 820 and processes the multi-resolution with respect to the audio spectral coefficients of the instantaneous varying signals. More specifically, when the audio signal is analyzed in multi-resolution at the encoding end, the multi-resolution synthesizer 830 may synthesize the output of the dequantizer 820 in multi-resolution to improve decoding efficiency. Here, the multi-resolution synthesizer 830 receives the inverse quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(840)는 멀티-레졸루션 합성부(830)의 출력과 역다중화부(800)로부터 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(840)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(820)의 출력 또는 멀티-레졸루션 합성부(830)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(840)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(840)은 차 스펙트럼(difference spectrum) 계수, 벡터의 인덱스를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 840 combines the output of the multi-resolution synthesizing unit 830 and the result of performing the frequency linear prediction at the encoding end from the demultiplexing unit 800 and performs inverse-vector quantization. In more detail, the inverse linear prediction unit 840 performs the inverse quantization on the output of the inverse quantization unit 820 or the output of the inverse quantization unit 820 in the case where the audio signal is frequency- (830) to improve decoding efficiency. Herein, the inverse frequency linear prediction performing unit 840 effectively utilizes the frequency domain prediction technique and the vector quantization technique of the prediction coefficients to effectively improve the coding efficiency. The inverse frequency linear prediction performing unit 840 receives the difference spectrum coefficient and index of the vector and outputs the MDCT spectral coefficient.

역 MDCT 적용부(850)는 역 주파수 선형예측 수행부(840)에서 출력된 저주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역 MDCT 적용부(850)는 역 주파수 선형 예측 수행부(840)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The inverse MDCT applying unit 850 inversely converts the low frequency band signal output from the inverse frequency linear prediction performing unit 840 into a time domain in the frequency domain by the inverse MDCT. Here, the inverse MDCT applying unit 850 receives the frequency spectrum coefficient obtained as a result of inverse quantization by the inverse frequency linear prediction performing unit 840, and outputs the reconstructed audio data corresponding to the low frequency band.

변환부(855)는 역 MDCT 적용부(850)에서 시간 도메인으로 변환된 저주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 변환부(855)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다. 변환부(855)에서 MDCT를 적용하여 실시할 수 없는 것은 아니지만 MDCT를 사용할 경우에는 도 7에 도시된 실시예가 보다 효율적이다.The transforming unit 855 transforms the low frequency band signal converted into the time domain in the inverse MDCT applying unit 850 into a frequency domain or a time / frequency domain from a time domain using a conversion scheme other than MDCT. Examples of the conversion technique used in the conversion unit 855 include MDST, FFT, and QMF. Although the MDCT can not be applied in the converting unit 855, the embodiment shown in FIG. 7 is more efficient when MDCT is used.

대역폭 확장 복호화부(860)는 역다중화부(800)에서 출력된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 변환부(855)에서 주파수 영역 또는 시간/주파수 영역으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(860)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoding unit 860 decodes the encoded bandwidth extension information output from the demultiplexing unit 800 and converts the encoded bandwidth extension information into a frequency domain or a time / frequency domain in the converting unit 855 using the decoded bandwidth extension information. And generates a high frequency band signal from the low frequency band signal. Here, the bandwidth extension decoding unit 860 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the characteristic that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information such as an energy level or an envelope for the high frequency band signal, which is information capable of exhibiting characteristics of a high frequency band of a high frequency band.

역변환부(870)는 대역폭 확장 복호화부(860)에서 생성된 고주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 주파수 도메인 또는 시간/주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역변환부(870)는 변환부(855)에서 이용하는 동일한 변환 기법을 이용한다. 역변환부(870)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다.The inverse transform unit 870 inversely transforms the high frequency band signal generated by the bandwidth extension decoding unit 860 into a time domain in a frequency domain or a time / frequency domain using a conversion scheme other than MDCT. Here, the inverse transform unit 870 uses the same transform technique used in the transform unit 855. Examples of the conversion technique used in the inverse transform unit 870 include MDST, FFT, and QMF.

밴드 합성부(880)는 역 MDCT 적용부(850)에서 시간 도메인으로 변환된 저주파수 밴드 신호와 역변환부(870)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성한다.The band synthesizing unit 880 synthesizes the low frequency band signal converted into the time domain in the inverse MDCT applying unit 850 and the high frequency band signal converted into the time domain in the inverse transforming unit 870.

스테레오 복호화부(890)는 역다중화부(800)에서 출력된 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 밴드 합성부(880)에서 합성된 신호를 업믹싱하여 출력단자 OUT을 통해 출력한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The stereo decoding unit 890 decodes the encoded stereo parameter output from the demultiplexing unit 800 and upmixes the signal synthesized by the band synthesizing unit 880 using the decoded stereo parameter and outputs the upmixed signal through the output terminal OUT Output. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

도 9는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 9 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 9를 참조하면, 오디오 데이터 복호화 장치는 역다중화부(900), 문맥-기반 비트플레인 복호화부(910), 역양자화부(920), 멀티-레졸루션 합성부(930), 역 주파수 선형 예측 수행부(940), 역 MDCT 적용부(950), 변환부(955), 대역폭 확장 복호화부(960), 역변환부(965), CELP 복호화부(970), 밴드 합성부(980) 및 스테레오 복호화부(990)를 포함한다.9, the audio data decoding apparatus includes a demultiplexer 900, a context-based bit plane decoder 910, an inverse quantizer 920, a multi-resolution synthesizer 930, an inverse frequency linear prediction A CELP decoding unit 970, a band combining unit 980, and a stereo decoding unit 940. The inverse MDCT applying unit 950, the transforming unit 955, the bandwidth extension decoding unit 960, the inverse transforming unit 965, the CELP decoding unit 970, (990).

역다중화부(900)는 부호화단으로부터 출력된 비트 스트림을 입력받아 역다중화한다. 구체적으로, 역다중화부(900)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분 석하여 출력한다. 여기서, 역다중화부(900)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, 신호 타입 정보, 주파수 선형 예측 및 벡터 양자화의 정보, 부호화된 대역폭 확장 정보, CELP 부호화 정보 및 부호화된 스테레오 파라미터 등이 있다.The demultiplexer 900 receives the bitstream output from the encoder and demultiplexes the bitstream. Specifically, the demultiplexer 900 demultiplexes each portion of the data level into data portions corresponding to the respective units, and analyzes the information of the bitstreams related to the respective units, and outputs the information. Here, the information output by the demultiplexing unit 900 may include a descriptive analysis of the audio spectrum stream, quantization and other reconstruction information, quantization spectrum reconstruction information, context-based bit plane decoding information, signal type information, frequency linear prediction, Vector quantization information, encoded bandwidth extension information, CELP encoding information, and encoded stereo parameters.

문맥-기반 비트플레인 복호화부(910)는 역다중화부(900)에서 역다중화된 결과가 주파수 도메인에서 부호화된 경우, 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(910)는 역다중화부(900)로부터 출력된 정보를 입력받아 허프먼 복호화를 진행하여 주파수 스펙트럼, 코딩 밴드 모드 정보, 및 스케일 팩터를 복원한다. 구체적으로, 문맥-기반 비트플레인 복호화부(910)는 프레주디스(Prejudice coding band mode) 정보, 프레주디스 코딩(Prejudice coding)의 스케일 팩터, 및 프레주드시 코딩(Prejudice coding)의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 복호화 코스메틱(decoding cosmetic) 표시, 주파수 스펙트럼의 양자화 값을 출력한다.If the result of demultiplexing in the demultiplexer 900 is encoded in the frequency domain, the context-based bit-plane decoding unit 910 decodes the encoded bit-plane based on the context. Here, the context-based bit-plane decoding unit 910 receives the information output from the demultiplexer 900 and performs Huffman decoding to recover the frequency spectrum, the coding band mode information, and the scale factor. Specifically, the context-based bit-plane decoding unit 910 receives a frequency spectrum of prejudice coding band mode information, a scale factor of prejudice coding, and a prejudice coding A coding band mode value, a decoding cosmetic expression of a scale factor, and a quantization value of a frequency spectrum.

역양자화부(920)는 문맥-기반 비트플레인 복호화부(910)에서 출력된 결과를 역양자화한다.The inverse quantization unit 920 inversely quantizes the result output from the context-based bit plane decoding unit 910.

멀티-레졸루션 합성부(930)는 역양자화부(920)에서 출력된 결과를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션을 처리한다. 구체적으로, 멀티-레졸루션 합성부(930)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(920)의 출력을 멀티-레졸루션으로 합성하여 복호 화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(930)는 역 양자화 스펙트럼/차 스펙트럼(reserve quantization spectrum/difference spectrum)을 입력받아 복원 스펙트럼/차 스펙트럼(reconstruction spectrum/difference spectrum)을 출력한다.The multi-resolution synthesis unit 930 receives the result output from the inverse quantization unit 920 and processes the multi-resolution of the audio spectral coefficients of the instantaneous varying signal. More specifically, when the audio signal is analyzed in multi-resolution at the encoding end, the multi-resolution synthesizer 930 may synthesize the output of the dequantizer 920 in multi-resolution to improve the decoding efficiency . Here, the multi-resolution synthesizer 930 receives the inverse quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(940)는 멀티-레졸루션 합성부(930)의 출력과 역다중화부(900)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(940)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(920)의 출력 또는 멀티-레졸루션 합성부(930)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(940)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(940)은 차 스펙트럼(difference spectrum) 계수, 벡터의 인덱스를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 940 combines the output of the multi-resolution synthesizing unit 930 and the result of the frequency linear prediction performed at the encoding end input from the demultiplexing unit 900 and performs inverse-vector quantization . In more detail, the inverse linear prediction unit 940 performs the inverse quantization (IF) on the output of the inverse quantization unit 920 or the output of the inverse quantization unit 920 in the case where the audio signal is frequency- (930) to improve decoding efficiency. Here, the inverse frequency linear prediction performing unit 940 effectively employs the frequency domain prediction technique and the vector quantization technique of the prediction coefficients to effectively improve the coding efficiency. The inverse frequency linear prediction performing unit 940 receives the difference spectrum coefficient and index of the vector and outputs the MDCT spectrum coefficient.

역 MDCT 적용부(950)는 역 주파수 선형 예측 수행부(940)에서 출력된 저주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역 MDCT 적용부(950)는 역 주파수 선형 예측 수행부(740)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The inverse MDCT applying unit 950 inversely transforms the low frequency band signal output from the inverse frequency linear prediction performing unit 940 from the frequency domain to the time domain by the inverse MDCT. Here, the inverse MDCT applying unit 950 receives the frequency spectrum coefficient obtained as a result of inverse quantization by the inverse frequency linear prediction performing unit 740, and outputs it as reconstructed audio data corresponding to the low frequency band.

변환부(955)는 역 MDCT 적용부(950)에서 시간 도메인으로 변환된 저주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 변환부(955)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다. 변환부(955)에서 MDCT를 적용하여 실시할 수 없는 것은 아니지만 MDCT를 사용할 경우에는 도 7에 도시된 실시예가 보다 효율적이다.The transforming unit 955 transforms the low frequency band signal converted into the time domain from the time domain into the frequency domain or the time / frequency domain using a conversion scheme other than MDCT in the inverse MDCT applying unit 950. Examples of the conversion technique used in the conversion unit 955 include MDST, FFT, and QMF. Although the MDCT can not be applied in the converting unit 955, the embodiment shown in Fig. 7 is more efficient when MDCT is used.

대역폭 확장 복호화부(960)는 역다중화부(900)에서 출력된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 변환부(955)에서 주파수 영역 또는 시간/주파수 영역으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(960)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoding unit 960 decodes the encoded bandwidth extension information output from the demultiplexing unit 900 and converts the encoded bandwidth extension information output from the demultiplexing unit 900 into a frequency domain or a time / And generates a high frequency band signal from the low frequency band signal. Here, the bandwidth extension decoding unit 960 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the characteristic that there is a high correlation between high and low frequency bands of the audio signal. Here, the bandwidth extension information is information such as an energy level or an envelope for the high frequency band signal, which is information capable of exhibiting characteristics of a high frequency band of a high frequency band.

역변환부(965)는 대역폭 확장 복호화부(960)에서 복호화된 고주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 주파수 도메인 또는 시간-주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역변환부(965)는 변환부(955)에서 이용하는 동일한 변환 기법을 이용한다. 역변환부(965)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다.The inverse transform unit 965 inversely transforms the high frequency band signal decoded by the bandwidth enhancement decoding unit 960 into a time domain in a frequency domain or a time-frequency domain using a conversion scheme other than MDCT. Here, the inverse transform unit 965 uses the same transform technique used in the transform unit 955. Examples of the conversion technique used in the inverse transform unit 965 include MDST, FFT, and QMF.

CELP 복호화부(970)는 역다중화부(900)에서 역다중화된 결과가 시간 도메인에서 부호화된 경우, CELP 부호화 정보를 수신하여 저주파수 밴드 신호를 CELP 복 호화 방식에 의해 복호화한다. 여기서, CELP 부호화 정보는 시간 도메인에서 CELP 방식에 의해 부호화된 결과로서, 고정 코드북의 인덱스 및 게인, 적응 코드북의 지연 및 게인, 및 선형 예측 필터의 계수 등을 포함한다. 구체적으로, CELP 복호화 방식은 고정 코드북의 인덱스 및 게인, 적응 코드북의 지연 및 게인을 이용하여 신호를 복원하고, 선형 예측 필터의 계수를 이용하여 복원된 신호를 합성하여 CELP 부호화 방식에 의해 부호화된 신호를 복호화한다. When the demultiplexed result in the demultiplexer 900 is coded in the time domain, the CELP decoding unit 970 receives the CELP encoded information and decodes the low frequency band signal by the CELP decoding method. Here, the CELP encoded information is a result of coding in the time domain by the CELP method, and includes the index and gain of the fixed codebook, the delay and gain of the adaptive codebook, and coefficients of the linear prediction filter. Specifically, the CELP decoding method restores a signal using the index and gain of the fixed codebook, the delay and gain of the adaptive codebook, synthesizes the recovered signal using the coefficients of the linear prediction filter, and outputs the signal encoded by the CELP coding method .

밴드 합성부(980)는 역 MDCT 적용부(950)에서 출력된 저주파수 밴드 신호, 역변환부(965)에서 역변환된 고주파수 밴드 신호 및 CELP 복호화부(970)에서 복호화된 신호를 합성한다.The band synthesizing unit 980 synthesizes the low frequency band signal outputted from the inverse MDCT applying unit 950, the high frequency band signal inverted by the inverse transforming unit 965, and the signal decoded by the CELP decoding unit 970.

스테레오 복호화부(990)는 역다중화부(900)에서 출력된 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 밴드 합성부(980)에서 합성된 신호를 업믹싱하여 출력단자 OUT을 통해 출력한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The stereo decoding unit 990 decodes the encoded stereo parameters output from the demultiplexing unit 900 and upmixes the signals synthesized by the band synthesizing unit 980 using the decoded stereo parameters, Output. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

도 10은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.10 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 10을 참조하면, 오디오 신호의 복호화 장치는 역다중화부(1000), 문맥-기반 비트플레인 복호화부(1010), 역양자화부(1020), 멀티-레졸루션 합성부(1030), 역 주파수 선형 예측 수행부(1040), 제1 역 MDCT 적용부(1050), CELP 복호화부(1060), MDCT 적용부(1065), 대역폭 확장 복호화부(1070), 제2 역 MDCT 적용 부(1075), 밴드 합성부(1080) 및 스테레오 복호화부(1090)를 포함한다.10, an apparatus for decoding an audio signal includes a demultiplexer 1000, a context-based bit plane decoder 1010, an inverse quantizer 1020, a multi-resolution synthesizer 1030, an inverse frequency linear prediction The first reverse MDCT applying unit 1050, the CELP decoding unit 1060, the MDCT applying unit 1065, the bandwidth extension decoding unit 1070, the second inverse MDCT applying unit 1075, (1080) and a stereo decoding unit (1090).

역다중화부(1000)는 부호화단으로부터 출력된 비트 스트림을 입력받아 역다중화한다. 구체적으로, 역다중화부(1000)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(1000)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, 신호 타입 정보, 주파수 선형 예측 및 벡터 양자화의 정보, 부호화된 대역폭 확장 정보, CELP 부호화 정보 및 부호화된 스테레오 파라미터 등이 있다.The demultiplexer 1000 receives the bitstream output from the encoder and demultiplexes the bitstream. Specifically, the demultiplexer 1000 separates each portion of the data level into data portions corresponding to the respective units, and analyzes and outputs the bitstream information related to the unit. Here, the information output by the demultiplexing unit 1000 may include description analysis of the audio spectrum stream, quantization value and other reconstruction information, reconstruction information of the quantization spectrum, information of context-based bit plane decoding, signal type information, Vector quantization information, encoded bandwidth extension information, CELP encoding information, and encoded stereo parameters.

문맥-기반 비트플레인 복호화부(1010)는 역다중화부(1000)에서 역다중화된 결과가 주파수 도메인에서 부호화된 경우, 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(1010)는 역다중화부(1000)로부터 출력된 정보를 입력받아 허프먼 복호화를 진행하여 주파수 스펙트럼, 코딩 밴드 모드 정보 및 스케일 팩터를 복원한다. 구체적으로, 문맥-기반 비트플레인 복호화부(1010)는 프레주디스 코딩 밴드 모드(Prejudice coding band mode) 정보, 프레주디스 코딩(Prejudice coding)의 스케일 팩터, 및 프레주디스 코딩(Prejudice coding)의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 복호화 코스메틱(decoding cosmetic) 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bit-plane decoding unit 1010 decodes the encoded bit-plane based on the context when the result of demultiplexing in the demultiplexing unit 1000 is encoded in the frequency domain. The context-based bit-plane decoding unit 1010 receives the information output from the demultiplexing unit 1000 and performs Huffman decoding to recover the frequency spectrum, the coding band mode information, and the scale factor. Specifically, the context-based bit-plane decoding unit 1010 decodes a frequency spectrum of prejudice coding band mode information, a scale factor of prejudice coding, and a prejudice coding And outputs a coding band mode value, a decoding cosmetic expression of a scale factor, and a quantization value of the frequency spectrum.

역양자화부(1020)는 문맥-기반 비트플레인 복호화부(1010)에서 출력된 결과를 역양자화한다.The inverse quantization unit 1020 dequantizes the result output from the context-based bit plane decoding unit 1010. [

멀티-레졸루션 합성부(1030)는 역양자화부(1020)의 출력을 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션을 처리한다. 구체적으로, 멀티-레졸루션 합성부(1030)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(1020)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(1030)는 역 양자화 스펙트럼/차 스펙트럼(reserve quantization spectrum/difference spectrum)을 입력받아 복원 스펙트럼/차 스펙트럼(reconstruction spectrum/difference spectrum)을 출력한다.The multi-resolution synthesis unit 1030 receives the output of the inverse quantization unit 1020 and processes the multi-resolution of the audio spectral coefficients of the instantaneous varying signal. More specifically, when the audio signal is analyzed in multi-resolution at the encoding end, the multi-resolution synthesis unit 1030 may synthesize the output of the inverse quantization unit 1020 in a multi-resolution manner to improve decoding efficiency. Here, the multi-resolution synthesizer 1030 receives the inverse quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(1040)은 멀티-레졸루션 합성부(1030)의 출력과 역다중화부(1000)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성한다. 구체적으로, 역 주파수 선형 예측 수행부(1040)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(1020)의 출력 또는 멀티-레졸루션 합성부(1030)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(1040)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(1040)은 차 스펙트럼(difference spectrum) 계수, 벡터의 인덱스를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction unit 1040 synthesizes the output of the multi-resolution synthesis unit 1030 and the result of performing the frequency linear prediction at the encoding end input from the demultiplexing unit 1000. [ In more detail, the inverse linear prediction unit 1040 performs the inverse quantization (IF) on the output of the inverse quantization unit 1020 or the output of the inverse quantization unit 1020 in the case where the audio signal is frequency- (1030) to improve decoding efficiency. Here, the inverse frequency linear prediction performing unit 1040 effectively employs the frequency domain prediction technique and the vector quantization technique of the prediction coefficients to effectively improve the coding efficiency. The inverse frequency linear prediction performing unit 1040 receives the difference spectrum coefficient and index of the vector and outputs the MDCT spectrum coefficient.

제1 역 MDCT 적용부(1050)는 역 주파수 선형 예측 수행부(1040)에서 출력된 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 변환한다. 여기서, 제1 역 MDCT 적용부(1050)는 역 주파수 선형 예측 수행부(1040)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The first inverse MDCT applying unit 1050 transforms the signal output from the inverse frequency linear prediction performing unit 1040 from the frequency domain to the time domain by the inverse MDCT. Here, the first inverse MDCT applying unit 1050 receives the frequency spectrum coefficient obtained as a result of inverse quantization by the inverse frequency linear prediction performing unit 1040, and outputs the reconstructed audio data corresponding to the low frequency band.

CELP 복호화부(1060)는 역다중화부(1000)에서 역다중화된 결과가 시간 도메인에서 부호화된 경우, CELP 부호화 정보를 수신하여 저주파수 밴드 신호를 CELP 복호화 방식에 의해 복호화한다. 여기서, CELP 부호화 정보는 시간 도메인에서 CELP 방식에 의해 부호화된 결과이다.When the demultiplexed result in the demultiplexer 1000 is coded in the time domain, the CELP decoding unit 1060 receives the CELP encoded information and decodes the low frequency band signal by the CELP decoding method. Here, the CELP encoded information is the result of being encoded by the CELP method in the time domain.

MDCT 적용부(1065)는 역다중화부(1000)에서 역다중화된 결과가 시간 도메인에서 부호화된 경우, CELP 복호화부(1060)에서 복호화된 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환한다. 또한, MDCT 적용부(1065)는 역다중화부(1000)에서 역다중화된 결과가 주파수 도메인에서 부호화된 경우, 별도로 MDCT를 수행하지 않고, 역 주파수 선형 예측 수행부(1040)의 출력을 MDCT 적용부(1065)의 출력으로 대체한다.When the demultiplexed result in the demultiplexing unit 1000 is coded in the time domain, the MDCT applying unit 1065 performs MDCT on the signal decoded by the CELP decoding unit 1060 to convert the decoded signal from the time domain to the frequency domain . When the demultiplexed result in the demultiplexing unit 1000 is encoded in the frequency domain, the MDCT applying unit 1065 performs the MDCT on the MDCT applying unit 1065 and outputs the output of the inverse frequency linear prediction performing unit 1040 to the MDCT applying unit 1065. [ Gt; 1065 < / RTI >

대역폭확장 복호화부(1070)는 역다중화부(1000)에서 출력된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 MDCT 적용부(1065)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭확장 복호화부(1070)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고 주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoding unit 1070 decodes the encoded bandwidth extension information output from the demultiplexing unit 1000 and extracts a high frequency band signal from the low frequency band signal output from the MDCT applying unit 1065 using the decoded bandwidth extension information, . Here, the bandwidth extension decoding unit 1070 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the characteristic that a high correlation exists between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information such as an energy level or an envelope for the high frequency band signal, which can indicate characteristics of a high frequency band of a high frequency band.

제2 역 MDCT 적용부(1075)는 대역폭 확장 복호화부(1070)에서 복호화된 고주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다.The second inverse MDCT applying unit 1075 inversely converts the high frequency band signal decoded by the bandwidth extension decoding unit 1070 from the frequency domain to the time domain by the inverse MDCT.

밴드 합성부(1080)는 제1 역 MDCT 적용부(1050)에서 시간 도메인으로 변환된 저주파수 밴드 신호와 제2 역 MDCT 적용부(1075)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성한다.The band combining unit 1080 combines the low frequency band signal converted into the time domain in the first inverse MDCT applying unit 1050 and the high frequency band signal converted into the time domain in the second inverse MDCT applying unit 1075.

스테레오 복호화부(1090)는 역다중화부(1000)에서 출력된 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 밴드 합성부(1080)에서 합성된 신호를 업믹싱하여 출력단자 OUT을 통해 출력한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The stereo decoding unit 1090 decodes the encoded stereo parameter output from the demultiplexing unit 1000 and upmixes the signal synthesized by the band synthesizing unit 1080 using the decoded stereo parameter and outputs the resultant signal through the output terminal OUT Output. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

도 11은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.11 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 11을 참조하면, 오디오 데이터 복호화 장치는 역다중화부(1100), 문맥-기반 비트플레인 복호화부(1110), 역양자화부(1120), 멀티-레졸루션 합성부(1130), 역 주파수 선형 예측 수행부(1140), CELP 복호화부(1150), 역 FV-MLT 적용부(1160), 변환부(1165), 대역폭 확장 복호화부(1170), 스테레오 복호화부(1180) 및 역변환부(1190)를 포함한다.11, an audio data decoding apparatus includes a demultiplexer 1100, a context-based bit plane decoder 1110, an inverse quantizer 1120, a multi-resolution synthesizer 1130, an inverse frequency linear prediction A CELP decoding unit 1150, an inverse FV-MLT applying unit 1160, a transforming unit 1165, a bandwidth extension decoding unit 1170, a stereo decoding unit 1180 and an inverse transforming unit 1190 do.

역다중화부(1100)는 부호화단으로부터 출력된 비트 스트림을 입력받아 역다중화한다. 역다중화부(1100)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이 터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(1100)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, 신호 타입 정보(Signal type information), 주파수 선형 예측 및 벡터 양자화의 정보, CELP 부호화 정보, 부호화된 대역폭 확장 정보 및 부호화된 스테레오 파라미터 등이 있다.The demultiplexer 1100 receives the bitstream output from the encoder and demultiplexes the bitstream. The demultiplexing unit 1100 demultiplexes each portion of the data level into data portions corresponding to the respective units, and analyzes and outputs the bitstream information related to the unit. Here, the information output by the demultiplexing unit 1100 may include description analysis of the audio spectrum stream, quantization value and other reconstruction information, quantization spectrum reconstruction information, context-based bit plane decoding information, signal type information, , Frequency linear prediction and vector quantization information, CELP coding information, coded bandwidth extension information, and coded stereo parameters.

문맥-기반 비트플레인 복호화부(1110)는 역다중화부(1000)에서 역다중화된 결과가 주파수 도메인에서 부호화된 경우, 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(1110)는 역다중화부(1100)로부터 출력된 정보를 입력받아 허프먼 복호화를 진행하여 주파수 스펙트럼, 코딩 밴드 모드 정보, 및 스케일 팩터를 복원한다. 구체적으로, 문맥-기반 비트플레인 복호화부(1110)는 프레주디스 코딩 밴드 모드(Prejudice coding band mode) 정보, 프레주디스 코딩(Prejudice coding)의 스케일 팩터, 및 프레주디스 코딩(Prejudice coding)의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 복호화 코스메틱(decoding cosmetic) 표시, 주파수 스펙트럼의 양자화 값을 출력한다.If the demultiplexed result in the demultiplexer 1000 is encoded in the frequency domain, the context-based bit plane decoder 1110 decodes the encoded bit plane based on the context. Here, context-based bit-plane decoding unit 1110 receives information output from demultiplexing unit 1100 and performs Huffman decoding to recover frequency spectrum, coding band mode information, and scale factor. Specifically, the context-based bit-plane decoding unit 1110 decodes a frequency spectrum of prejudice coding band mode information, a scale factor of prejudice coding, and a prejudice coding And outputs a coding band mode value, a decoding cosmetic expression of a scale factor, and a quantization value of the frequency spectrum.

역양자화부(1120)는 문맥-기반 비트플레인 복호화부(1110)에서 출력된 결과를 역양자화한다.The inverse quantization unit 1120 dequantizes the result output from the context-based bit plane decoding unit 1110.

멀티-레졸루션 합성부(1130)는 역양자화부(1120)의 출력을 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수에 대하여 멀티-레졸루션을 처리한다. 구체적으로, 멀티-레졸루션 합성부(1130)는 오디오 신호가 부호화단에서 멀티-레졸루 션으로 분석된 경우에 역양자화부(1120)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(1130)는 역 양자화 스펙트럼/차 스펙트럼(reserve quantization spectrum/difference spectrum)을 입력받아 복원 스펙트럼/차 스펙트럼(reconstruction spectrum/difference spectrum)을 출력한다.The multi-resolution synthesis unit 1130 receives the output of the inverse quantization unit 1120 and processes the multi-resolution with respect to the audio spectral coefficients of the instantaneous varying signals. More specifically, when the audio signal is analyzed as multi-resolution at the encoding end, the multi-resolution synthesis unit 1130 synthesizes the output of the dequantization unit 1120 in a multi-resolution manner to improve decoding efficiency have. Here, the multi-resolution synthesizer 1130 receives the inverse quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(1140)는 멀티-레졸루션 합성부(1130)의 출력과 역다중화부(1100)로부터 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(1140)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(1120)의 출력 또는 멀티-레졸루션 합성부(1130)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(1140)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(1140)은 차 스펙트럼(difference spectrum) 계수, 벡터의 인덱스를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 1140 combines the output of the multi-resolution combining unit 1130 and the result of performing frequency linear prediction at the encoding end from the demultiplexing unit 1100, and performs inverse-vector quantization. In more detail, the inverse linear prediction unit 1140 may output the result of performing the frequency linear prediction when the audio signal is frequency-linearly predicted at the encoding end, or may output the result of the inverse quantization unit 1120 or the multi- (1130) to improve decoding efficiency. Here, the inverse frequency linear prediction performing unit 1140 effectively employs the frequency domain prediction technique and the vector quantization technique of the prediction coefficients to effectively improve the coding efficiency. The inverse frequency linear prediction unit 1140 receives the difference spectrum coefficient and index of the vector and outputs the MDCT spectrum coefficient.

CELP 복호화부(1150)는 역다중화부(1000)에서 역다중화된 결과가 시간 도메인에서 부호화된 경우, CELP 부호화 정보를 복호화한다. 여기서, CELP 부호화 정보는 시간 도메인에서 CELP 방식에 의해 부호화된 결과이다.The CELP decoding unit 1150 decodes the CELP encoded information when the demultiplexed result in the demultiplexing unit 1000 is encoded in the time domain. Here, the CELP encoded information is the result of being encoded by the CELP method in the time domain.

역 FV-MLT 적용부(1160)는 역 주파수 선형 예측 수행부(1140)에서 출력된 신호에 대하여 역 FV-MLT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하고, 상기 시간 도메인으로 변환된 신호와 CELP 복호화부(1150)에서 출력된 신호를 합성하여 시간 도메인으로 변환된 신호를 출력한다.The inverse FV-MLT applying unit 1160 performs inverse FV-MLT on the signal output from the inverse frequency linear prediction performing unit 1140 and converts the signal into the time domain in the frequency domain, And outputs the signal converted into the time domain by combining the signals output from the decoding unit 1150.

변환부(1165)는 역 FV-MLT 적용부(1160)에서 시간 도메인으로 변환된 신호를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 변환부(1165)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다. 변환부(1165)에서 MDCT를 적용하여 실시할 수 없는 것은 아니지만 MDCT를 사용할 경우에는 도 10에 도시된 실시예가 보다 효율적이다.The transforming unit 1165 transforms the signal converted into the time domain in the inverse FV-MLT applying unit 1160 into a frequency domain or a time / frequency domain from a time domain using a conversion scheme other than MDCT. Examples of the conversion technique used in the conversion unit 1165 include MDST, FFT, and QMF. Although the MDCT can not be applied in the converting unit 1165, the embodiment shown in FIG. 10 is more efficient when MDCT is used.

대역폭 확장 복호화부(1170)는 역다중화부(1100)에서 출력된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 변환부(1165)에서 주파수 영역 또는 시간/주파수 영역으로 변환된 신호로부터 전 대역의 신호를 생성한다. 여기서, 대역폭 확장 복호화부(1170)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 변환부(1165)에서 출력된 신호에 적용함으로써 전 대역의 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoding unit 1170 decodes the encoded bandwidth extension information output from the demultiplexing unit 1100 and converts the bandwidth extension information output from the demultiplexing unit 1100 into a frequency domain or a time / Band signal from the signal. Here, the bandwidth extension decoding unit 1170 applies the decoded bandwidth extension information to the signal output from the conversion unit 1165 based on the characteristic that a high correlation exists between high and low frequency bands of the audio signal, . Here, the bandwidth extension information is information such as an energy level or an envelope for the high frequency band signal, which is information capable of exhibiting characteristics of a high frequency band of a high frequency band.

스테레오 복호화부(1180)는 역다중화부(1100)에서 출력된 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 대역폭 확장 복호화부(1170)에서 출력된 신호를 업믹싱한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The stereo decoding unit 1180 decodes the encoded stereo parameter output from the demultiplexing unit 1100 and upmixes the signal output from the bandwidth extension decoding unit 1170 using the decoded stereo parameter. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

역변환부(1190)는 스테레오 복호화부(1180)에서 업믹싱된 신호를 MDCT 이외의 변환 기법을 이용하여 주파수 도메인 또는 시간/주파수 도메인에서 시간 도메인으로 역변환하여 출력 단자 OUT을 통해 출력한다. 여기서, 역변환부(1190)는 변환부(1165)에서 이용하는 동일한 변환 기법을 이용한다. 역변환부(1190)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다.The inverse transform unit 1190 inverse transforms the upmixed signal from the stereo decoding unit 1180 into a time domain in a frequency domain or a time / frequency domain using a conversion scheme other than MDCT, and outputs the inverse signal through an output terminal OUT. Here, the inverse transform unit 1190 uses the same transform technique used in the transform unit 1165. Examples of the conversion technique used in the inverse transform unit 1190 include MDST, FFT, and QMF.

도 12는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 12 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 12를 참조하면, 오디오 데이터 복호화 장치는 역다중화부(1200), 문맥-기반 비트플레인 복호화부(1210), 역양자화부(1220), 멀티-레졸루션 합성부(1230), 역 주파수 선형 예측 수행부(1240), CELP 복호화부(1250), MDCT 적용부(1260), 대역폭 확장 복호화부(1270), 스테레오 복호화부(1280) 및 역 FV-MLT 적용부(1290)를 포함한다.12, an audio data decoding apparatus includes a demultiplexer 1200, a context-based bit plane decoder 1210, an inverse quantizer 1220, a multi-resolution synthesizer 1230, an inverse frequency linear prediction A CELP decoding unit 1250, an MDCT applying unit 1260, a bandwidth extension decoding unit 1270, a stereo decoding unit 1280, and an inverse FV-MLT applying unit 1290.

역다중화부(1200)는 부호화단으로부터 출력된 비트 스트림을 입력받아 역다중화한다. 역다중화부(1200)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(1200)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, 신호 타입 정보, 주파수 선형 예측 및 벡터 양자화의 정보, CELP 부호화 정보, 부호화된 대역폭 확장 정보 및 부호화된 스테레오 파라미터 등이 있다.The demultiplexer 1200 receives the bitstream output from the encoder and demultiplexes the bitstream. The demultiplexer 1200 demultiplexes each portion of the data level into data portions corresponding to the respective units, and analyzes and outputs the bitstream information related to the unit. Here, the information output by the demultiplexing unit 1200 may include a descriptive analysis of the audio spectrum stream, quantization and other reconstruction information, reconstruction information of the quantization spectrum, information of context-based bit plane decoding, signal type information, Vector quantization information, CELP encoding information, encoded bandwidth extension information, and encoded stereo parameters.

문맥-기반 비트플레인 복호화부(1210)는 역다중화부(1000)에서 역다중화된 결과가 주파수 도메인에서 부호화된 경우, 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(1210)는 역다중화부(1200)로부터 출력된 정보를 입력받아 허프먼 복호화를 진행하여 주파수 스펙트럼, 코딩 밴드 모드 정보, 및 스케일 팩터를 복원한다. 문맥-기반 비트플레인 복호화부(1210)는 프레주디스 코딩 밴드 모드(Prejudice coding band mode) 정보, 프레주디스 코딩(Prejudice coding)의 스케일 팩터, 및 프레주디스 코딩(Prejudice coding)의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 복호화 코스메틱(decoding cosmetic) 표시, 주파수 스펙트럼의 양자화 값을 출력한다.If the result of demultiplexing in the demultiplexer 1000 is encoded in the frequency domain, the context-based bit plane decoder 1210 decodes the encoded bit plane based on the context. The context-based bit-plane decoding unit 1210 receives the information output from the demultiplexing unit 1200 and performs Huffman decoding to recover a frequency spectrum, coding band mode information, and a scale factor. The context-based bit-plane decoding unit 1210 receives the pre-judged coding band mode information, the scale factor of the pre-judged coding, and the frequency spectrum of the pre-judged coding, A band mode value, a decoding cosmetic expression of a scale factor, and a quantization value of a frequency spectrum.

역양자화부(1220)는 문맥-기반 비트플레인 복호화부(1210)에서 출력된 결과를 역양자화한다.The inverse quantization unit 1220 dequantizes the result output from the context-based bit plane decoding unit 1210.

멀티-레졸루션 합성부(1230)는 역양자화부(1220)의 출력을 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션을 처리한다. 구체적으로, 멀티-레졸루션 합성부(1230)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(1220)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(1230)는 역 양자화 스펙트럼/차 스펙트럼(reserve quantization spectrum/difference spectrum)을 입력받아 복원 스펙트럼/차 스펙트럼(reconstruction spectrum/difference spectrum)을 출력한다.The multi-resolution synthesis unit 1230 receives the output of the inverse quantization unit 1220 and processes the multi-resolution of the audio spectral coefficients of the instantaneous varying signal. In detail, when the audio signal is analyzed in multi-resolution at the encoding end, the multi-resolution synthesizer 1230 may synthesize the output of the dequantizer 1220 in multi-resolution to improve decoding efficiency. Here, the multi-resolution synthesizer 1230 receives the inverse quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(1240)는 멀티-레졸루션 합성부(1230)의 출력과 역다중화부(1200)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(1240)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(1220)의 출력 또는 멀티-레졸루션 합성부(1230)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(1240)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(1240)은 차 스펙트럼(difference spectrum) 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 1240 combines the output of the multi-resolution synthesizing unit 1230 and the result of the frequency linear prediction performed at the encoding end input from the demultiplexing unit 1200, and performs inverse-vector quantization . In more detail, the inverse linear prediction unit 1240 performs the inverse quantization (IF) on the output of the inverse quantization unit 1220 or the output of the inverse quantization unit 1220 in the case where the audio signal is frequency- (1230) to improve decoding efficiency. Here, the inverse frequency linear prediction performing unit 1240 effectively utilizes the frequency domain prediction technique and the vector quantization technique of the prediction coefficients to effectively improve the coding efficiency. The inverse frequency linear prediction performing unit 1240 receives the difference spectrum coefficient and index of the vector and outputs the MDCT spectrum coefficient.

CELP 복호화부(1250)는 역다중화부(1200)에서 역다중화된 결과가 시간 도메인에서 부호화된 경우, CELP 부호화 정보를 복호화한다. 여기서, CELP 부호화 정보는 시간 도메인에서 CELP 방식에 의해 부호화된 결과이다.The CELP decoding unit 1250 decodes the CELP encoded information when the demultiplexed result in the demultiplexing unit 1200 is coded in the time domain. Here, the CELP encoded information is the result of being encoded by the CELP method in the time domain.

MDCT 적용부(1260)는 CELP 복호화부(1250)에서 출력된 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환한다.The MDCT application unit 1260 performs MDCT on the signal output from the CELP decoding unit 1250 to convert the time domain to the frequency domain.

대역폭 확장 복호화부(1270)는 역다중화부(1200)에서 출력된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 역 주파수 선형 예측 수행부(1240)에서 출력된 신호 또는 MDCT 적용부(1260)에서 주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 생성한다. 구체적으로, 역다중화부(1200)에서 출력된 결과가 주파수 도메인에서 부호화된 경우, 대역폭 확장 복호화부(1270)는 복호화된 대역폭 확장 정보를 역 주파수 선형 예측 수행부(1240)에서 출력된 신호에 적용함으로써 전 대역의 신호를 생성한다. 또한, 역다중화부(1200)에서 출력된 결과가 시간 도메인에서 부호화된 경우, 대역폭 확장 복호화부(1270)는 복호화된 대역폭 확장 정보를 MDCT 적용부(1260)에서 주파수 도메인으로 변환된 신호에 적용함으로써 전 대역의 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoding unit 1270 decodes the encoded bandwidth extension information output from the demultiplexing unit 1200 and outputs the signal output from the inverse frequency linear prediction unit 1240 or the MDCT applied by the inverse frequency linear prediction unit 1240 using the decoded bandwidth extension information. And generates a signal of the entire band from the signal converted into the frequency domain in the frequency domain 1260. In detail, when the result output from the demultiplexing unit 1200 is encoded in the frequency domain, the bandwidth extension decoding unit 1270 applies the decoded bandwidth extension information to the signal output from the inverse frequency linear prediction unit 1240 Thereby generating a signal of the entire band. When the result output from the demultiplexing unit 1200 is coded in the time domain, the bandwidth extension decoding unit 1270 applies the decoded bandwidth extension information to the frequency domain-converted signal in the MDCT applying unit 1260 Thereby generating a signal of the entire band. Here, the bandwidth extension information is information such as an energy level or an envelope for the high frequency band signal, which is information capable of exhibiting characteristics of a high frequency band of a high frequency band.

스테레오 복호화부(1280)는 역다중화부(1200)에서 출력된 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 대역폭 확장 복호화부(1270)에서 출력된 신호를 업믹싱한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The stereo decoding unit 1280 decodes the encoded stereo parameters output from the demultiplexing unit 1200 and upmixes the signals output from the bandwidth extension decoding unit 1270 using the decoded stereo parameters. Here, upmixing is a concept contrary to downmixing, in which a stereo signal of two or more channels is generated from a mono signal.

역 FV-MLT 적용부(1290)는 스테레오 복호화부(1280)에서 업믹싱된 신호에 대하여 역 FV-MLT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하여 출력단자 OUT을 통해 출력한다.The inverse FV-MLT applying unit 1290 performs an inverse FV-MLT on the upmixed signal in the stereo decoding unit 1280, converts the upmixed signal into a time domain in the frequency domain, and outputs the transformed signal through the output terminal OUT.

도 13은 본 발명의 일 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다. 13 is a flowchart illustrating a method of encoding an audio signal according to an embodiment of the present invention.

도 13을 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 1에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 1에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 13, the method of encoding an audio signal according to the present embodiment is comprised of steps of time series processing in the apparatus for encoding an audio signal shown in FIG. Therefore, the contents described above with respect to the audio signal encoding apparatus shown in Fig. 1 are also applied to the audio signal encoding method according to this embodiment, even if omitted below.

1300 단계에서 스테레오 부호화부(100)는 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호를 다운믹싱한다.In operation 1300, the stereo encoding unit 100 extracts a stereo parameter from the input signal, encodes the stereo parameter, and downmixes the input signal.

1310 단계에서 밴드 분할부(110)는 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할한다.In step 1310, the band dividing unit 110 divides the downmixed signal into a high frequency band signal and a low frequency band signal.

1320 단계에서 변환부(120, 170)는 고주파수 밴드 신호 및 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환한다. 구체적으로, 변환부는 고주파수 밴드 신호 및 저주파수 밴드 신호에 대하여 MDCT를 수행하여 각각 시간 도메인에서 주파수 도메인으로 변환할 수 있다.In step 1320, the transformers 120 and 170 convert the high frequency band signal and the low frequency band signal from the time domain to the frequency domain, respectively. Specifically, the converting unit may perform MDCT on the high-frequency band signal and the low-frequency band signal, respectively, and convert the time domain to the frequency domain.

1330 단계에서 저주파수 밴드 부호화부는 변환된 저주파수 밴드 신호를 양자화하야 문맥을 기반으로 비트플레인으로 부호화한다. 구체적으로, 저주파수 밴드 부호화부는 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행하여 필터링하는 주파수 선형 예측 수행부, 변환된 저주파수 밴드 신호 또는 필터링된 신호에 대하여 멀티-레졸루션으로 분석하는 멀티-레졸루션 분석부, 멀티-레졸루션으로 분석된 신호를 양자화하는 양자화부, 및 양자화된 신호를 문맥을 기반으로 하여 비트플레인으로 부호화하는 문맥-기반 비트플레인 부호화부를 포함할 수 있다. 여기서, 주파수 선형 예측 수행부는 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행하여 계산된 선형 예측 필터의 계수에 대응되는 값을 벡터 인덱스로 표현할 수 있다.In step 1330, the low-frequency band encoding unit quantizes the converted low-frequency band signal to encode it into a bit plane based on the context. Specifically, the low-frequency band encoding unit includes a frequency linear prediction unit for performing frequency-linear-prediction and filtering on the converted low-frequency band signal, a multi-resolution analyzing unit for performing multi-resolution analysis on the converted low- A quantization unit for quantizing the signal analyzed by the multi-resolution, and a context-based bit-plane encoding unit for encoding the quantized signal into a bit-plane based on a context. Here, the frequency linear prediction performing unit may represent a value corresponding to the coefficient of the linear prediction filter calculated by performing frequency linear prediction on the converted low frequency band signal, as a vector index.

1340 단계에서 대역폭 확장 부호화부(180)는 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성 하여 부호화한다.In step 1340, the bandwidth extension encoding unit 180 generates and encodes bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

1350 단계에서 다중화부(190)는 부호화된 스테레오 파라미터, 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In step 1350, the multiplexer 190 multiplexes the encoded stereo parameters, the encoded bit planes, and the encoded bandwidth extension information, and outputs the result as an encoding result for the input signal.

도 14는 본 발명의 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.FIG. 14 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 14를 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 2에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 2에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 14, a method of encoding an audio signal according to an embodiment of the present invention includes steps of time series processing in the apparatus for encoding an audio signal shown in FIG. Therefore, the contents described above with respect to the audio signal encoding apparatus shown in Fig. 2 are applied to the audio signal encoding method according to the present embodiment, even if omitted below.

1400 단계에서 스테레오 부호화부(200)는 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호를 다운믹싱한다.In operation 1400, the stereo encoding unit 200 extracts a stereo parameter from the input signal, encodes the stereo parameter, and downmixes the input signal.

1410 단계에서 밴드 분할부(210)는 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할한다.In operation 1410, the band division unit 210 divides the downmixed signal into a high frequency band signal and a low frequency band signal.

1420 단계에서 MDCT 적용부(220)는 저주파수 밴드 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환한다.In operation 1420, the MDCT application unit 220 performs MDCT on the low frequency band signal to convert the time domain to the frequency domain.

1430 단계에서 저주파수 밴드 부호화부는 MDCT가 수행된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다.In step 1430, the low-frequency band coding unit quantizes the signal subjected to the MDCT and encodes the signal into a bit plane based on the context.

1440 단계에서 변환부(270, 275)는 고주파수 밴드 신호 및 저주파수 밴드 신 호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다.In step 1440, the transformers 270 and 275 convert the high-frequency band signal and the low-frequency band signal into the frequency domain or the time / frequency domain, respectively, from the time domain.

1450 단계에서 대역폭 확장 부호화부(280)는 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다.In step 1450, the bandwidth extension encoding unit 280 generates and encodes bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

1460 단계에서 다중화부(290)는 부호화된 스테레오 파라미터, 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In step 1460, the multiplexer 290 multiplexes the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information, and outputs the multiplexed result as an encoding result for the input signal.

도 15는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.15 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 15를 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 3 또는 4에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 3 또는 4에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 15, the method of encoding an audio signal according to the present embodiment is comprised of steps of time series processing in the apparatus for encoding an audio signal shown in FIG. 3 or 4. Therefore, the contents described above with respect to the audio signal encoding apparatus shown in FIG. 3 or 4 apply to the audio signal encoding method according to the present embodiment, even if omitted below.

1500 단계에서 스테레오 부호화부(300)는 입력 신호에서 스테레오 파라미터를 추출하여 부호화하고, 입력 신호를 다운믹싱한다.In operation 1500, the stereo encoding unit 300 extracts a stereo parameter from an input signal, and encodes the extracted stereo parameter to down-mix the input signal.

1510 단계에서 밴드 분할부(310)는 다운믹싱된 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할한다.In step 1510, the band dividing unit 310 divides the downmixed signal into a high frequency band signal and a low frequency band signal.

1520 단계에서 모드 결정부(320)는 저주파수 밴드 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정한다.In step 1520, the mode determination unit 320 determines whether to encode the low frequency band signal in the time domain or the frequency domain.

1530 단계에서 CELP 부호화부(385)는 저주파수 밴드 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 저주파수 밴드 신호를 CELP 방식에 따라 부호화한다.If it is determined in step 1530 that the low frequency band signal is encoded in the time domain, the CELP encoding unit 385 codes the low frequency band signal according to the CELP method.

1540 단계에서 MDCT 적용부(325)는 저주파수 밴드 신호를 주파수 도메인에서 부호화하는 것으로 결정된 경우 저주파수 밴드 신호에 대하여 MDCT를 수행하여 저주파수 밴드 신호를 시간 도메인에서 주파수 도메인으로 변환하고, 저주파수 밴드 부호화부는 MDCT가 수행된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다.If it is determined in step 1540 that the low frequency band signal is encoded in the frequency domain, the MDCT application unit 325 converts the low frequency band signal from the time domain into the frequency domain by performing MDCT on the low frequency band signal, The low-frequency band signal is quantized and encoded into a bit-plane based on the context.

1550 단계에서 변환부(370, 375)는 저주파수 밴드 신호 및 고주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 여기서, 변환부는 저주파수 밴드 신호 및 상기 고주파수 밴드 신호에 대하여 각각 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환할 수 있다. 이 경우, 저주파수 밴드 신호를 주파수 도메인에서 부호화하는 것으로 결정된 경우 변환부는 저주파수 밴드 신호에 대한 출력을 MDCT 적용부에서 주파수 도메인으로 변환된 신호로 대체할 수 있다.In step 1550, the converting units 370 and 375 convert the low frequency band signal and the high frequency band signal into the frequency domain or the time / frequency domain, respectively, in the time domain. Here, the converting unit may perform MDCT on the low frequency band signal and the high frequency band signal, respectively, and convert the time domain to the frequency domain. In this case, when it is decided to encode the low frequency band signal in the frequency domain, the converting unit may replace the output for the low frequency band signal with the frequency domain converted signal in the MDCT applying unit.

1560 단계에서 대역폭 확장 부호화부(380)는 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다.In step 1560, the bandwidth extension encoding unit 380 generates and encodes the bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

1570 단계에서 다중화부(390)는 부호화된 스테레오 파라미터, CELP 방식에 따라 부호화된 결과, 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 다중화 하여 입력 신호에 대한 부호화 결과로써 출력한다.In step 1570, the multiplexing unit 390 multiplexes the encoded stereo parameter, the encoded result according to the CELP scheme, the encoded bit plane, and the encoded bandwidth extension information, and outputs the multiplexed result as the encoding result for the input signal.

도 16은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.16 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 16을 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 5에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 5에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 16, the method of encoding an audio signal according to the present embodiment is comprised of steps of time series processing in the apparatus for encoding an audio signal shown in FIG. Therefore, even if omitted below, the contents described above with respect to the audio signal encoding apparatus shown in Fig. 5 also apply to the audio signal encoding method according to the present embodiment.

1600 단계에서 변환부(500)는 입력 신호를 시간 도메인에서 주파수 도메인으로 변환한다.In operation 1600, the transform unit 500 transforms the input signal from the time domain to the frequency domain.

1610 단계에서 스테레오 부호화부(510)는 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 변환된 신호를 다운믹싱한다.In operation 1610, the stereo encoding unit 510 extracts a stereo parameter from the converted signal, encodes the stereo parameter, and downmixes the converted signal.

1620 단계에서 대역폭 확장 부호화부(580)는 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화한다.In step 1620, the bandwidth extension encoding unit 580 extracts bandwidth extension information from the downmixed signal and encodes the bandwidth extension information.

1630 단계에서 역변환부(520)는 다운믹싱된 신호를 시간 도메인으로 역변환한다.In step 1630, the inverse transform unit 520 inversely transforms the downmixed signal into the time domain.

1640 단계에서 모드 결정부(530)는 역변환된 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하고, FV-MLT 적용부(535)는 결정 결과에 따라 역변환된 신호에 대하여 FV-MLT를 적용하여 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환한다.In step 1640, the mode determination unit 530 determines whether to encode the inverse-transformed signal in the time domain or the frequency domain, and the FV-MLT application unit 535 applies FV-MLT And transforms it into a time domain or a frequency domain for each subband.

1650 단계에서 CELP 부호화부(585)는 역변환된 신호를 시간 도메인에서 부호화하는 것으로 결정된 경우 시간 도메인으로 변환된 신호를 CELP 방식에 따라 부호화한다.In step 1650, if it is determined that the inverse transformed signal is to be encoded in the time domain, the CELP encoding unit 585 codes the transformed signal in the time domain according to the CELP method.

1660 단계에서 주파수 도메인 부호화부는 역변환된 신호를 주파수 도메인으로 부호화하는 것으로 결정된 경우 주파수 도메인으로 변환된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다.If it is determined in step 1660 that the inverse transformed signal is to be encoded in the frequency domain, the frequency domain encoding unit quantizes the signal transformed into the frequency domain and encodes the signal into the bit plane based on the context.

1670 단계에서 다중화부(590)는 부호화된 스테레오 파라미터, 부호화된 대역폭 확장 정보, CELP 방식에 따라 부호화된 결과 및 부호화된 비트플레인을 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In step 1670, the multiplexer 590 multiplexes the encoded stereo parameters, the encoded bandwidth extension information, the encoded result according to the CELP scheme, and the encoded bit planes, and outputs the multiplexed result as an encoding result for the input signal.

도 17은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.17 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 17을 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 6에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 6에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 17, an audio signal encoding method according to the present embodiment is comprised of steps of time series processing in the audio signal encoding apparatus shown in FIG. Therefore, the contents described above with respect to the audio signal encoding apparatus shown in FIG. 6 are applied to the audio signal encoding method according to the present embodiment, even if omitted below.

1700 단계에서 모드 결정부(600)는 입력 신호를 시간 도메인에서 부호화할지 주파수 도메인에서 부호화할지 여부를 결정하고, FV-MLT 적용부(610)는 결정 결과에 따라 입력 신호에 대하여 FV-MLT를 적용하여 서브 밴드 별로 시간 도메인 또는 주파수 도메인으로 변환한다.In step 1700, the mode determination unit 600 determines whether to encode the input signal in the time domain or the frequency domain, and the FV-MLT application unit 610 applies the FV-MLT to the input signal according to the determination result And converts it into a time domain or a frequency domain for each subband.

1710 단계에서 스테레오 부호화부(620)는 변환된 신호에서 스테레오 파라미터를 추출하여 부호화하고, 변환된 신호를 다운믹싱한다.In step 1710, the stereo encoding unit 620 extracts and encodes stereo parameters from the converted signal, and down-mixes the converted signals.

1720 단계에서 대역폭 확장 부호화부(670)는 다운믹싱된 신호에서 대역폭 확장 정보를 추출하여 부호화한다.In step 1720, the bandwidth extension encoding unit 670 extracts the bandwidth extension information from the downmixed signal and encodes it.

1730 단계에서 CELP 부호화부(680)는 다운믹싱된 신호가 시간 도메인에서 부호화하는 것으로 결정된 경우 다운믹싱된 신호를 CELP 방식에 따라 부호화한다.If it is determined in step 1730 that the downmixed signal is to be encoded in the time domain, the CELP encoding unit 680 codes the downmixed signal according to the CELP scheme.

1740 단계에서 주파수 도메인 부호화부는 다운믹싱된 신호가 주파수 도메인에서 부호화하는 것으로 결정된 경우 다운믹싱된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다.In step 1740, if it is determined that the downmixed signal is to be encoded in the frequency domain, the frequency domain encoding unit quantizes the downmixed signal and encodes the downmixed signal into a bit plane based on the context.

1750 단계에서 다중화부(690)는 부호화된 스테레오 파라미터, 부호화된 대역폭 확장 정보, CELP 방식에 따라 부호화된 결과 및 부호화된 비트플레인을 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In step 1750, the multiplexer 690 multiplexes the encoded stereo parameter, the encoded bandwidth extension information, the encoded result according to the CELP scheme, and the encoded bit plane, and outputs the multiplexed result as the encoding result for the input signal.

도 18은 본 발명의 일 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.18 is a flowchart illustrating a method of decoding an audio signal according to an embodiment of the present invention.

도 18을 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 7에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 7에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 18, the method for decoding an audio signal according to the present embodiment is comprised of steps that are performed in a time-series manner in the apparatus for decoding an audio signal shown in FIG. Therefore, the contents described above with respect to the audio signal decoding apparatus shown in FIG. 7 are also applied to the decoding method of the audio signal according to the present embodiment, even if omitted below.

1800 단계에서 역다중화부(700)는 오디오 신호의 부호화 결과를 입력받는다. 여기서, 부호화 결과는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인, 부호화된 대역폭 확장 정보 및 부호화된 스테레오 파라미터 등을 포함할 수 있다.In operation 1800, the demultiplexing unit 700 receives the encoding result of the audio signal. Here, the encoding result may include a bit plane encoded based on the context, a coded bandwidth extension information, and a coded stereo parameter for a low frequency band signal.

1810 단계에서 저주파수 밴드 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성한다. 구체적으로, 저주파수 밴드 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하는 문맥-기반 비트플레인 복호화부, 복호화된 신호를 역양자화하는 역양자화부, 역양자화된 신호를 멀티-레졸루션으로 합성하는 멀티-레졸루션 합성부, 및 벡터 인덱스를 이용하여 부호화단에서 주파수 선형 예측이 수행된 결과를 역양자화된 신호 또는 상기 합성된 신호에 합성하는 역 주파수 선형 예측 수행부를 포함할 수 있다.In step 1810, the low-frequency band decoding unit decodes the encoded bit-plane based on the context and generates a low-frequency band signal by inverse-quantizing the encoded bit-plane. Specifically, the low-frequency band decoding unit includes a context-based bit-plane decoding unit for decoding a coded bit-plane based on a context, an inverse quantization unit for dequantizing the decoded signal, a multi-resolution decoding unit for multi- And an inverse frequency linear prediction unit that synthesizes the result of performing the frequency linear prediction at the encoding end using the vector index, to the dequantized signal or the synthesized signal.

1820 단계에서 대역폭 확장 복호화부(750)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 생성된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. In step 1820, the bandwidth extension decoding unit 750 decodes the encoded bandwidth extension information and generates a high frequency band signal from the generated low frequency band signal using the decoded bandwidth extension information.

1830 단계에서 역 MDCT 적용부(760)는 저주파수 밴드 신호 및 고주파수 밴드 신호 각각에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환한다.In step 1830, the inverse MDCT applying unit 760 performs an inverse MDCT on each of the low frequency band signal and the high frequency band signal to convert the frequency domain to the time domain.

1840 단계에서 밴드 합성부(780)는 변환된 저주파수 밴드 신호 및 변환된 고주파수 밴드 신호를 합성한다.In step 1840, the band synthesizer 780 synthesizes the converted low frequency band signal and the converted high frequency band signal.

1850 단계에서 스테레오 복호화부(790)는 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 합성된 신호를 업믹싱한다.In step 1850, the stereo decoding unit 790 decodes the encoded stereo parameters and upmixes the synthesized signals using the decoded stereo parameters.

도 19는 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.19 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 19를 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 8에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 8에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 19, the method for decoding an audio signal according to the present embodiment is comprised of steps of time series processing in the apparatus for decoding an audio signal shown in FIG. Therefore, even if omitted in the following description, the contents described above with respect to the audio signal decoding apparatus shown in FIG. 8 also apply to the method of decoding an audio signal according to the present embodiment.

1900 단계에서 역다중화부(800)는 오디오 신호의 부호화 결과를 입력받는다. 여기서, 부호화 결과는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인, 부호화된 대역폭 확장 정보 및 부호화된 스테레오 파라미터 등을 포함할 수 있다.In operation 1900, the demultiplexing unit 800 receives the encoding result of the audio signal. Here, the encoding result may include a bit plane encoded based on the context, a coded bandwidth extension information, and a coded stereo parameter for a low frequency band signal.

1910 단계에서 저주파수 밴드 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성한다.In operation 1910, the low frequency band decoding unit decodes the encoded bit plane based on the context, and dequantizes the low frequency band signal to generate a low frequency band signal.

1920 단계에서 역 MDCT 적용부(850)는 생성된 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환한다.In step 1920, the inverse MDCT applying unit 850 performs inverse MDCT on the generated low frequency band signal to convert the frequency domain to the time domain.

1930 단계에서 변환부(855)는 역 MDCT가 수행된 저주파수 밴드 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환한다.In step 1930, the transforming unit 855 transforms the low frequency band signal in which the inverse MDCT has been performed into the frequency domain or the time / frequency domain.

1940 단계에서 대역폭 확장 복호화부(860)는 대역폭 확장 정보를 이용하여 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다.In step 1940, the bandwidth extension decoding unit 860 generates a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the bandwidth extension information.

1950 단계에서 역변환부(870)는 생성된 고주파수 밴드 신호를 시간 도메인으로 역변환한다.In step 1950, the inverse transform unit 870 inversely transforms the generated high frequency band signal into the time domain.

1960 단계에서 밴드 합성부(880)는 역변환된 고주파수 밴드 신호 및 변환된 저주파수 밴드 신호를 합성한다.In step 1960, the band combining unit 880 combines the inversely transformed high frequency band signal and the converted low frequency band signal.

1970 단계에서 스테레오 복호화부(890)는 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 합성된 신호를 업믹싱한다.In step 1970, the stereo decoding unit 890 decodes the encoded stereo parameters and upmixes the synthesized signals using the decoded stereo parameters.

도 20은 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.20 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 20을 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 9 또는 10에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 9 또는 10에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 20, a method for decoding an audio signal according to the present embodiment is comprised of steps of time series processing in the apparatus for decoding an audio signal shown in FIG. 9 or 10. FIG. Therefore, the contents described above with respect to the audio signal decoding apparatus shown in FIG. 9 or 10 are also applied to the decoding method of the audio signal according to the present embodiment, even if omitted below.

2000 단계에서 역다중화부(900)는 오디오 신호의 시간 도메인 또는 주파수 도메인에서의 부호화 결과를 입력받는다. 여기서, 부호화 결과는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인, 부호화된 대역폭 확장 정보, CELP 부호화 정보 및 부호화된 스테레오 파라미터 등을 포함할 수 있다.In operation 2000, the demultiplexer 900 receives the encoding result in the time domain or the frequency domain of the audio signal. Here, the encoding result may include a bit plane, a coded bandwidth extension information, a CELP encoding information, a coded stereo parameter, and the like, which are coded based on a context, for a low frequency band signal.

2010 단계에서 저주파수 밴드 복호화부는 저주파수 밴드 신호가 주파수 도메인에서 부호화된 경우, 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성한다. 여기서, 저주파수 밴드 복호화부는 부호화 된 비트플레인을 문맥을 기반으로 복호화하는 문맥-기반 비트플레인 복호화부, 복호화된 신호를 역양자화하는 역양자화부, 역양자화된 신호를 멀티-레졸루션으로 합성하는 멀티-레졸루션 합성부, 및 벡터 인덱스를 이용하여 부호화단에서 주파수 선형 예측이 수행된 결과를 역양자화된 신호 또는 합성된 신호에 합성하는 역 주파수 선형 예측 수행부를 포함할 수 있다. In step 2010, when the low frequency band signal is encoded in the frequency domain, the low frequency band decoding unit decodes the encoded bit plane based on the context and generates a low frequency band signal by inverse-quantizing the encoded bit plane. Here, the low-frequency band decoding unit includes a context-based bit-plane decoding unit for decoding a coded bit-plane based on a context, an inverse quantization unit for dequantizing the decoded signal, a multi-resolution decoding unit for multi- And an inverse frequency linear prediction unit that synthesizes the result of performing the frequency linear prediction at the encoding end using the vector index, to the dequantized signal or the synthesized signal.

2020 단계에서 역 MDCT 적용부(950)는 생성된 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 시간 도메인으로 역변환한다.In step 2020, the inverse MDCT applying unit 950 performs inverse MDCT on the generated low frequency band signal and performs inverse conversion to the time domain.

2030 단계에서 변환부(955)는 역 MDCT가 수행된 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환한다.In step 2030, the transforming unit 955 transforms the signal subjected to the inverse MDCT into a frequency domain or a time / frequency domain.

2040 단계에서 대역폭 확장 복호화부(960)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호로부터 고주파수 밴드 신호를 생성한다.In step 2040, the bandwidth extension decoding unit 960 decodes the encoded bandwidth extension information and generates a high frequency band signal from the frequency domain or time / frequency domain converted signal using the decoded bandwidth extension information.

2050 단계에서 역변환부(965)는 생성된 고주파수 밴드 신호를 시간 도메인으로 역변환한다.In step 2050, the inverse transform unit 965 inversely converts the generated high frequency band signal into the time domain.

2060 단계에서 CELP 복호화부(970)는 저주파수 밴드 신호가 시간 도메인에서 부호화된 경우, CELP 부호화 정보를 복호화하여 저주파수 밴드 신호를 생성한다.In step 2060, when the low frequency band signal is coded in the time domain, the CELP decoding unit 970 decodes the CELP encoded information to generate a low frequency band signal.

2070 단계에서 밴드 합성부(980)는 역 MDCT가 수행된 신호, 역변환된 고주파수 밴드 신호 및 CELP 방식으로 복호화된 저주파수 밴드 신호를 합성한다.In step 2070, the band combining unit 980 combines the signal subjected to the inverse MDCT, the inverse transformed high frequency band signal, and the low frequency band signal decoded by the CELP method.

2080 단계에서 스테레오 복호화부(990)는 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 합성된 신호를 업믹싱한다.In step 2080, the stereo decoding unit 990 decodes the encoded stereo parameters and upmixes the synthesized signals using the decoded stereo parameters.

도 21은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.21 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 21을 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 11에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 11에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 21, the method for decoding an audio signal according to the present embodiment is comprised of steps of time series processing in the apparatus for decoding an audio signal shown in FIG. Therefore, the contents described above with respect to the audio signal decoding apparatus shown in FIG. 11 are applied to the decoding method of the audio signal according to the present embodiment, even if omitted below.

2100 단계에서 역다중화부(1100)는 오디오 신호의 주파수 도메인 또는 시간 도메인에서의 부호화 결과를 입력받는다. 여기서, 부호화 결과는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인, 부호화된 대역폭 확장 정보, CELP 부호화 정보 및 부호화된 스테레오 파라미터 등을 포함할 수 있다.In step 2100, the demultiplexing unit 1100 receives the encoding result in the frequency domain or the time domain of the audio signal. Here, the encoding result may include a bit plane, a coded bandwidth extension information, a CELP encoding information, a coded stereo parameter, and the like, which are coded based on a context, for a low frequency band signal.

2110 단계에서 주파수 도메인 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화한다. In step 2110, the frequency domain decoding unit decodes and dequantizes the encoded bit plane based on the context.

2120 단계에서 CELP 복호화부(1150)는 CELP 부호화 정보를 복호화한다.In step 2120, the CELP decoding unit 1150 decodes the CELP coding information.

2130 단계에서 역 FV-MLT 적용부(1160)는 주파수 도메인 복호화부 또는 CELP 복호화부에서 복호화된 신호에 대하여 역 FV-MLT를 수행하여 시간 도메인으로 변환한다.In step 2130, the inverse FV-MLT applying unit 1160 performs an inverse FV-MLT on the signal decoded by the frequency domain decoding unit or the CELP decoding unit, and transforms the decoded signal into the time domain.

2140 단계에서 변환부(1165)는 변환된 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환한다.In step 2140, the transforming unit 1165 transforms the transformed signal into a frequency domain or a time / frequency domain.

2150 단계에서 대역폭 확장 복호화부(1170)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 주파수 도메인 또는 시간/주파수 도메인으로 변환된 신호로부터 전 대역의 신호를 복호화한다.In step 2150, the bandwidth extension decoding unit 1170 decodes the encoded bandwidth extension information and decodes the entire band signal from the frequency-domain or time / frequency-domain-converted signal using the decoded bandwidth extension information.

2160 단계에서 스테레오 복호화부(1180)는 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 복호화된 전 대역의 신호를 업믹싱한다.In step 2160, the stereo decoding unit 1180 decodes the encoded stereo parameters and upmixes the decoded signals of all bands using the decoded stereo parameters.

2170 단계에서 역변환부(1190)는 업믹싱된 신호를 시간 도메인으로 역변환한다.In step 2170, the inverse transform unit 1190 inversely transforms the upmixed signal into the time domain.

도 22는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.22 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 22를 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 12에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 12에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 22, a method of decoding an audio signal according to the present embodiment is comprised of steps of time series processing in the audio signal decoding apparatus shown in FIG. Therefore, the contents described above with respect to the audio signal decoding apparatus shown in FIG. 12 are also applied to a method of decoding an audio signal according to the present embodiment, even if omitted below.

2200 단계에서 역다중화부(1200)는 오디오 신호의 주파수 도메인 또는 시간 도메인에서의 부호화 결과를 수신한다. 여기서, 부호화 결과는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인, 부호화된 대역폭 확장 정보, CELP 부호화 정보 및 부호화된 스테레오 파라미터 등을 포함할 수 있다.In operation 2200, the demultiplexing unit 1200 receives the encoding result in the frequency domain or the time domain of the audio signal. Here, the encoding result may include a bit plane, a coded bandwidth extension information, a CELP encoding information, a coded stereo parameter, and the like, which are coded based on a context, for a low frequency band signal.

2210 단계에서 주파수 도메인 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화한다.In operation 2210, the frequency domain decoding unit decodes and dequantizes the encoded bit plane based on the context.

2220 단계에서 CELP 복호화부(1250)는 CELP 부호화 정보를 복호화한다.In step 2220, the CELP decoding unit 1250 decodes the CELP encoded information.

2230 단계에서 MDCT 적용부(1260)는 CELP 복호화부(1250)에서 출력된 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환한다.In step 2230, the MDCT application unit 1260 performs MDCT on the signal output from the CELP decoding unit 1250 to convert the time domain to the frequency domain.

2240 단계에서 대역폭 확장 복호화부(1270)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 주파수 도메인 복호화부 또는 MDCT 적용부(1260)에서 출력된 신호로부터 전 대역의 신호를 생성한다.In step 2240, the bandwidth extension decoding unit 1270 decodes the encoded bandwidth extension information and generates a signal of the entire band from the signal output from the frequency domain decoding unit or the MDCT applying unit 1260 using the decoded bandwidth extension information do.

2250 단계에서 스테레오 복호화부(1280)는 부호화된 스테레오 파라미터를 복호화하고, 복호화된 스테레오 파라미터를 이용하여 복호화된 전 대역의 신호를 업믹싱한다.In step 2250, the stereo decoding unit 1280 decodes the encoded stereo parameters and up-mixes the decoded signals of all bands using the decoded stereo parameters.

2260 단계에서 역 FV-MLT 적용부(1290)는 업믹싱된 신호에 대하여 역 FV-MLT를 적용하여 시간 도메인으로 변환한다.In step 2260, the inverse FV-MLT applying unit 1290 transforms the upmixed signal into the time domain by applying the inverse FV-MLT.

본 발명은 상술한 실시예에 한정되지 않으며, 본 발명의 사상 내에서 당업자에 의한 변형이 가능함은 물론이다.It is needless to say that the present invention is not limited to the above-described embodiments, and can be modified by those skilled in the art within the scope of the present invention.

본 발명은 또한 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 컴퓨터가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 하드디스크, 플로피디스크, 플래쉬 메모리, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드로서 저장되고 실행될 수 있다.The present invention can also be embodied as computer-readable codes on a computer-readable recording medium. A computer-readable recording medium includes all kinds of recording apparatuses in which data that can be read by a computer system is stored. Examples of the computer-readable recording medium include ROM, RAM, CD-ROM, magnetic tape, hard disk, floppy disk, flash memory, optical data storage, And the like. The computer readable recording medium may also be distributed over a networked computer system and stored and executed as computer readable code in a distributed manner.

도 1은 본 발명의 일 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 1 is a block diagram illustrating an apparatus for encoding an audio signal according to an embodiment of the present invention.

도 2는 본 발명의 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.2 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 3은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.3 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 5는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.5 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 6은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.6 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 7은 본 발명의 일 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.7 is a block diagram illustrating an apparatus for decoding an audio signal according to an embodiment of the present invention.

도 8은 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.8 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 9는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.9 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 10은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나 타내는 블록도이다.10 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 12는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.12 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 13는 본 발명의 일 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다. 13 is a flowchart illustrating a method of encoding an audio signal according to an embodiment of the present invention.

도 16는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.16 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 20은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 방법을 나 타내는 흐름도이다.20 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

Claims

delete

(a) extracting and encoding a stereo parameter from an input signal, and downmixing the input signal;

(b) dividing the downmixed signal into a high frequency band signal and a low frequency band signal;

(c) converting the low frequency band signal from the time domain to the frequency domain by a first conversion method;

(d) quantizing a signal converted into the frequency domain by the first conversion method and encoding the signal into a bit plane based on a context;

(e) converting the high frequency band signal and the low frequency band signal into a frequency domain or a time / frequency domain in a time domain by a second conversion method;

(f) generating and encoding bandwidth extension information indicating characteristics of the high frequency band signal converted by the second conversion method using the low frequency band signal converted by the second conversion method; And

(g) outputting the encoded stereo parameter, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal.

A computer-readable recording medium having recorded thereon a program for executing the method of encoding an audio signal according to claim 6.

(c) determining whether to encode the low frequency band signal in the time domain or the frequency domain;

(d) encoding the low frequency band signal in the time domain if it is determined to encode the low frequency band signal in the time domain;

(e) if it is determined to encode the low-frequency band signal in the frequency domain, the low-frequency band signal is converted into a frequency domain in the time domain by a first conversion method, and a low- Encoding a signal into a bit plane based on a context;

(f) converting the low frequency band signal and the high frequency band signal into a frequency domain or a time / frequency domain in a time domain by a second conversion method;

(g) generating and encoding bandwidth extension information indicating characteristics of the converted high frequency band signal using the converted low frequency band signal; And

(h) outputting the encoded stereo parameter, the result encoded in the time domain, the encoded bit plane, and the encoded bandwidth extension information as a result of encoding the input signal, / RTI >

9. The method of claim 8,

The step (f)

The low frequency band signal and the high frequency band signal are converted into a frequency domain in the time domain by the first conversion method,

Frequency band signal converted into the frequency domain by the first conversion method in step (e) when the low-frequency band signal is determined to be encoded in the frequency domain. A method of coding a signal.

A computer-readable recording medium having recorded thereon a program for executing the method of encoding an audio signal according to any one of claims 8 and 9.

(a) converting an input signal from a time domain to a frequency domain;

(b) extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal;

(c) extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information;

(d) inversely transforming the downmixed signal into a time domain;

(e) determining whether to encode the inverse-converted signal in a time domain or a frequency domain, and converting the inverse-transformed signal into a time domain or a frequency domain on a subband-by-subband basis according to the determination result;

(f) encoding the signal transformed into the time domain in the time domain if it is determined to encode the inverse transformed signal in the time domain;

(g) quantizing the signal transformed into the frequency domain if the inverse-transformed signal is determined to be encoded in the frequency domain, and encoding the quantized signal into a bit plane based on the context; And

(h) outputting the encoded stereo parameter, the encoded bandwidth extension information, the result encoded in the time domain, and the encoded bit plane as a result of encoding the input signal, / RTI >

12. The method of claim 11,

The step (e)

And performing an FV-MLT (Frequency Varying-Modulated Lapped Transform) on the inversely transformed signal according to the determination result to transform the time domain or frequency domain for each subband.

A computer-readable recording medium having recorded thereon a program for executing the method of encoding an audio signal according to any one of claims 11 to 12.

(a) determining whether to encode an input signal in a time domain or a frequency domain, and converting the input signal into a time domain or a frequency domain on a subband basis according to the determination result;

(d) encoding the downmixed signal in the time domain if it is determined that the downmixed signal is to be encoded in the time domain;

(e) if the downmixed signal is determined to be encoded in the frequency domain, encoding the downmixed signal into a bit plane based on a context; And

(f) outputting the encoded stereo parameter, the encoded bandwidth extension information, the result encoded in the time domain, and the encoded bit plane as a result of encoding the input signal, / RTI >

15. The method of claim 14,

The step (a)

And performing an FV-MLT on the input signal according to a result of the determination, thereby converting the input signal into a time domain or a frequency domain on a subband-by-subband basis.

A computer-readable recording medium having recorded thereon a program for executing the method of encoding an audio signal according to any one of claims 14 to 15.

(a) receiving an encoding result of an audio signal;

(b) generating a low frequency band signal by decoding the encoded bit plane included in the encoding result based on the context and dequantizing the encoded bit plane;

(c) decoding the encoded bandwidth extension information included in the encoding result, and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information;

(d) converting each of the low-frequency band signal and the high-frequency band signal into a time domain from a frequency domain by a first inverse transformation method;

(e) combining the inversely transformed low frequency band signal and the inverse transformed high frequency band signal; And

(f) decoding the encoded stereo parameter included in the encoding result, and upmixing the synthesized signal using the decoded stereo parameter.

18. The method of claim 17,

The step (b)

Synthesizing the dequantized signal with multi-resolution; And

And synthesizing the result of frequency-linear-prediction performed at the encoding end using the vector index included in the encoding result, into the inverse-quantized signal or the multi-resolution synthesized signal, And decodes the audio signal.

A computer-readable recording medium having recorded thereon a program for executing a method of decoding an audio signal according to any one of claims 17 and 18.

(a) receiving an encoding result of an audio signal;

(c) converting the low frequency band signal from the frequency domain to the time domain by a first inverse transformation method;

(d) converting a low-frequency band signal inversely transformed by the first inverse transform method into a frequency domain or a time / frequency domain by a first conversion method;

(e) decoding the encoded bandwidth extension information included in the encoding result, and extracting a high frequency band from the low frequency band signal converted into the frequency domain or the time / frequency domain by the first conversion method using the decoded bandwidth extension information, Generating a signal;

(f) inversely transforming the high frequency band signal into a time domain by a second inverse transformation method;

(g) combining the inversely transformed high frequency band signal and the converted low frequency band signal; And

(h) decoding the encoded stereo parameter included in the encoding result, and upmixing the synthesized signal using the decoded stereo parameter.

A computer-readable recording medium having recorded thereon a program for executing the decoding method of an audio signal according to claim 20.

(a) receiving a result of encoding an audio signal in a time domain or a frequency domain;

(b) generating a low frequency band signal by decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context;

(c) inversely transforming the low frequency band signal into a time domain by a first inverse transform method;

(d) converting a signal inversely transformed into a time domain by the first inverse transform method into a frequency domain or a time / frequency domain by a first transformation method;

(e) decoding the encoded bandwidth extension information included in the encoding result in the frequency domain, and decoding the encoded low frequency band information in the frequency domain or the time / frequency domain by the first conversion method using the decoded bandwidth extension information. Generating a high frequency band signal from the signal;

(g) generating the low frequency band signal by decoding the encoding result in the time domain in the time domain;

(h) synthesizing a signal inversely transformed into a time domain by the first inverse transform method, a high frequency band signal inversely transformed into a time domain by the second inverse transform method, and a low frequency band signal decoded in the time domain; And

(i) decoding the encoded stereo parameter included in the encoding result, and upmixing the synthesized signal using the decoded stereo parameter.

23. The method of claim 22,

The step (b)

(b1) synthesizing the dequantized signal with multi-resolution; And

(b2) synthesizing the result of frequency-linear-prediction performed at the encoding end using the vector index included in the encoding result, into the dequantized signal or the signal synthesized with the multi-resolution ,

The step (e)

And the high frequency band signal is generated from the signal synthesized in the step (b1) or (b2) using the decoded bandwidth extension information.

A computer-readable recording medium having recorded thereon a program for executing the method of decoding an audio signal according to any one of claims 22 and 23.

(a) receiving an encoding result in a frequency domain or a time domain of an audio signal;

(b) decoding and inverse-quantizing the encoded bit planes included in the encoding result in the frequency domain based on the context;

(c) decoding the encoded result in the time domain in the time domain;

(d) performing an inverse FV-MLT on the signal dequantized in the step (b) or the signal decoded in the step (c) so as to convert the signal into the time domain;

(e) converting the transformed signal into a frequency domain or a time / frequency domain;

(f) decoding the encoded bandwidth extension information included in the encoding result, and generating a full-band signal from the frequency domain or time / frequency domain converted signal using the decoded bandwidth extension information;

(g) decoding the encoded stereo parameter included in the encoding result, and upmixing the decoded signal using the decoded stereo parameter; And

(h) inversely transforming the upmixed signal into a time domain.

A computer-readable recording medium having recorded thereon a program for executing the decoding method of an audio signal according to claim 25.

(c) decoding the encoded result in the time domain in the time domain;

(d) performing MDCT on the signal decoded in the step (c) to convert the time domain to the frequency domain;

(e) decoding the encoded bandwidth extension information included in the encoding result in the frequency domain, extracting the encoded bandwidth extension information from the decoded signal in the frequency domain or the signal converted in the frequency domain using the decoded bandwidth extension information, Generating a signal;

(f) decoding the encoded stereo parameter included in the encoding result, and upmixing the decoded signal using the decoded stereo parameter; And

(g) transforming the upmixed signal into a time domain by applying an inverse FV-MLT to the upmixed signal.

A computer-readable recording medium having recorded thereon a program for executing the decoding method of an audio signal according to claim 27.

delete

A stereo encoding unit for extracting and encoding a stereo parameter from an input signal and down-mixing the input signal;

A band dividing unit dividing the downmixed signal into a high frequency band signal and a low frequency band signal;

An MDCT applying unit for performing MDCT on the low frequency band signal to convert the time domain into a frequency domain;

A low frequency band encoding unit for quantizing the signal subjected to the MDCT and decoding the signal into a bit plane based on a context;

A converter for converting the high frequency band signal and the low frequency band signal into a frequency domain or a time / frequency domain in a time domain; And

And a bandwidth extension encoding unit for generating and encoding bandwidth extension information indicating the characteristics of the converted high frequency band signal using the converted low frequency band signal.

A mode determination unit for determining whether to encode the low frequency band signal in the time domain or the frequency domain;

A CELP encoding unit for encoding the low frequency band signal according to the CELP scheme when it is determined to encode the low frequency band signal in the time domain;

An MDCT applying unit for performing MDCT on the low frequency band signal and converting the low frequency band signal from a time domain into a frequency domain when it is determined to encode the low frequency band signal in the frequency domain;

A low frequency band encoding unit for quantizing the low frequency band signal on which the MDCT has been performed and encoding the low frequency band signal into a bit plane based on a context;

A converter for converting the low frequency band signal and the high frequency band signal into a frequency domain or a time / frequency domain in a time domain; And

32. The method of claim 31,

The conversion unit

Performing MDCT on the low-frequency band signal and the high-frequency band signal, respectively, to convert the time domain to the frequency domain,

Wherein the low frequency band signal is replaced with a frequency domain converted signal by the MDCT application unit when it is determined to encode the low frequency band signal in the frequency domain.

A converter for converting an input signal from a time domain to a frequency domain;

A stereo encoding unit for extracting and encoding a stereo parameter from the converted signal, and downmixing the converted signal;

A bandwidth extension encoder for extracting bandwidth extension information from the downmixed signal and encoding the bandwidth extension information;

An inverse transformer for inversely transforming the downmixed signal into a time domain;

A mode decision unit for deciding whether to encode the inversely transformed signal in a time domain or a frequency domain;

An FV-MLT applying unit for applying FV-MLT to the inverse transformed signal according to the determination result and transforming the inverse transformed signal into a time domain or a frequency domain for each subband;

A CELP encoding unit for encoding the signal converted into the time domain according to the CELP scheme when it is determined to encode the inversely converted signal in the time domain; And

And a frequency domain coding unit for quantizing the signal converted into the frequency domain and encoding the signal into a bit plane based on the context, when it is determined that the inverse transformed signal is to be encoded in the frequency domain.

A mode decision unit for deciding whether to encode the input signal in the time domain or the frequency domain;

An FV-MLT applying unit for applying FV-MLT to the input signal according to the determination result and transforming the input signal into a time domain or a frequency domain for each subband;

A CELP encoding unit for encoding the downmixed signal according to the CELP scheme when it is determined that the downmixed signal is encoded in the time domain; And

And a frequency domain coding unit for quantizing the downmixed signal and encoding the downmixed signal into a bit plane based on a context when the downmixed signal is determined to be encoded in the frequency domain.

A low frequency band decoding unit decoding a coded bit plane based on a context and dequantizing the coded bit plane to generate a low frequency band signal;

A bandwidth extension decoder for decoding the encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information;

An inverse MDCT applying unit for performing inverse MDCT on each of the low frequency band signal and the high frequency band signal and converting the frequency band to the time domain;

A band synthesizer for synthesizing the converted low frequency band signal and the converted high frequency band signal; And

And a stereo decoding unit decoding the encoded stereo parameter and upmixing the synthesized signal using the decoded stereo parameter.

36. The method of claim 35,

The low-frequency band decoding unit

A multi-resolution synthesizer for synthesizing the dequantized signal with multi-resolution; And

Further comprising at least one of an inverse frequency linear prediction unit for combining a result obtained by performing frequency linear prediction at an encoding end using the vector index into the dequantized signal or the signal synthesized with the multi- An apparatus for decoding an audio signal.

An inverse MDCT applying unit that performs inverse MDCT on the low frequency band signal and converts the inverse MDCT from the frequency domain to the time domain;

A transform unit for transforming the low frequency band signal in which the inverse MDCT is performed into a frequency domain or a time / frequency domain;

A bandwidth extension decoder for decoding the encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information;

An inverse transformer for inversely transforming the generated high frequency band signal into a time domain;

A band synthesizer for synthesizing the inversely converted high frequency band signal and the converted low frequency band signal; And

A low frequency band decoding unit decoding a bit plane encoded in a frequency domain based on a context and generating a low frequency band signal by dequantizing the bit plane;

An inverse MDCT applying unit that performs inverse MDCT on the low frequency band signal and inverse transforms the inverse MDCT into a time domain;

A transform unit for transforming the signal in which the inverse MDCT is performed into a frequency domain or a time / frequency domain;

A bandwidth extension decoder for decoding the bandwidth extension information encoded in the frequency domain and generating a high frequency band signal from the frequency domain or time / frequency domain converted signal using the decoded bandwidth extension information;

A CELP decoding unit for decoding the CELP encoded information encoded in the time domain to generate the low frequency band signal;

A band synthesizer for synthesizing the inverse MDCT-processed signal, the inverse transformed high-frequency band signal, and the low-frequency band signal generated by the CELP method; And

A frequency domain decoding unit decoding and inverse-quantizing the bit plane encoded in the frequency domain based on the context;

A CELP decoding unit decoding the CELP encoded information encoded in the time domain;

An inverse FV-MLT applying unit for performing an inverse FV-MLT on a signal decoded by the frequency domain decoding unit or the CELP decoding unit and converting the decoded signal into a time domain;

A converter for converting the converted signal into a frequency domain or a time / frequency domain;

A bandwidth extension decoder for decoding the coded bandwidth extension information and generating a signal of the entire band from the frequency domain or the time / frequency domain converted signal using the decoded bandwidth extension information;

A stereo decoding unit decoding the encoded stereo parameter and upmixing the generated signal using the decoded stereo parameter; And

And an inverse transformer for inversely transforming the upmixed signal into a time domain.

An MDCT applying unit for performing MDCT on the signal output from the CELP decoding unit and converting the time domain into a frequency domain;

A bandwidth extension decoder for decoding the coded bandwidth extension information and generating a signal of the entire band from the signal output from the frequency domain decoding unit or the signal output from the MDCT application unit using the decoded bandwidth extension information;

And an inverse FV-MLT applying unit for applying the inverse FV-MLT to the upmixed signal to convert the upmixed signal into a time domain.