KR20080025636A

KR20080025636A - Method and apparatus for encoding and decoding audio signal using band width extension technique

Info

Publication number: KR20080025636A
Application number: KR1020070079781A
Authority: KR
Inventors: 주기현; 오은미; 레이 미아오
Original assignee: 삼성전자주식회사
Priority date: 2006-09-18
Filing date: 2007-08-08
Publication date: 2008-03-21
Also published as: CN101162584A; KR101346358B1

Abstract

A method and an apparatus for encoding/decoding an audio signal by using bandwidth expansion technique are provided to output an encoded bitplane and encoded bandwidth expansion information as an encoded result of an input signal, thereby efficiently encoding high frequency components at a limited bit rate and accordingly improving sound quality. An audio signal encoding method comprises the following steps of: splitting an input signal into a high band signal and a low band signal(1300); converting each of the high and low band signals from a time domain to a frequency domain(1310); performing quantization and context-dependent bitplane encoding of the converted low band signal(1320); generating and encoding bandwidth expansion information showing a characteristic of the converted high band signal by using the converted low band signal(1330); and outputting the encoded bitplane and the encoded bandwidth expansion information as an encoded result of the input signal(1340).

Description

Method and apparatus for encoding / decoding audio signal using bandwidth extension technique {Method and apparatus for encoding and decoding audio signal using band width extension technique}

본 발명은 오디오 신호의 부호화/복호화 방법 및 장치에 관한 것으로, 보다 상세하게는 대역폭 확장 기법을 이용한 오디오 신호의 부호화/복호화 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for encoding / decoding an audio signal, and more particularly, to a method and apparatus for encoding / decoding an audio signal using a bandwidth extension technique.

오디오 신호를 부호화하거나 복호화함에 있어서 한정된 비트율을 이용하여 최대한 음질을 향상시키는 것이 요구된다. 저비트율에서는 가용 비트가 적으므로 부호화되는 오디오 신호의 주파수 대역을 줄여서 부호화해야 하므로, 음질이 저하될 수 있다. In encoding or decoding an audio signal, it is required to improve sound quality as much as possible using a limited bit rate. At low bit rates, since the available bits are small, the frequency band of the audio signal to be encoded must be reduced so that the sound quality can be degraded.

일반적으로, 저주파수 성분은 고주파수 성분에 비하여 인간이 지각하는데 중요하게 영향을 미친다. 따라서, 저주파수 성분을 부호화하는데 할당되는 비트를 늘리고, 고주파수 성분을 부호화하는데 할당되는 비트를 줄이면서 음질을 향상시킬 수 있는 방법이 요구된다. In general, low frequency components have a significant effect on human perception compared to high frequency components. Therefore, there is a need for a method capable of improving sound quality while increasing the bits allocated for encoding low frequency components and reducing the bits allocated for encoding high frequency components.

본 발명이 해결하고자 하는 과제는 한정된 비트율에서 고주파수 성분을 효율적으로 부호화하여 음질을 향상시킬 수 있는 오디오 신호의 부호화 방법 및 장치를 제공하는데 있다.SUMMARY OF THE INVENTION An object of the present invention is to provide a method and apparatus for encoding an audio signal capable of improving sound quality by efficiently encoding high frequency components at a limited bit rate.

본 발명이 해결하고자 하는 다른 과제는 한정된 비트율에서 부호화된 비트스트림으로부터 고주파수 성분을 효율적으로 복호화하는 오디오 신호의 복호화 방법 및 장치를 제공하는데 있다.Another object of the present invention is to provide a method and apparatus for decoding an audio signal which efficiently decodes high frequency components from a bitstream encoded at a limited bit rate.

상기 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (b) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하는 단계; (c) 상기 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (d) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (e) 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.According to an aspect of the present invention, there is provided a method of encoding an audio signal, the method including: (a) dividing an input signal into a high frequency band signal and a low frequency band signal; (b) converting the high frequency band signal and the low frequency band signal from the time domain to the frequency domain, respectively; (c) quantizing the converted low frequency band signal and encoding the bitplane based on a context; (d) generating and encoding bandwidth extension information indicating a characteristic of the converted high frequency band signal using the converted low frequency band signal; And (e) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal.

또한, 상기 과제는 (a) 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (b) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하는 단계; (c) 상기 변환된 저주 파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (d) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (e) 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, the object is (a) dividing the input signal into a high frequency band signal and a low frequency band signal; (b) converting the high frequency band signal and the low frequency band signal from the time domain to the frequency domain, respectively; (c) quantizing the transformed low frequency band signal and encoding the bitplane based on a context; (d) generating and encoding bandwidth extension information indicating a characteristic of the converted high frequency band signal using the converted low frequency band signal; And (e) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal. A computer-readable recording medium having recorded thereon a program for executing an encoding method of an audio signal. Is achieved by.

또한, 상기 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (b) 상기 저주파수 밴드 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 단계; (c) 상기 MDCT가 수행된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (d) 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (f) 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.In addition, the method for encoding an audio signal according to the present invention for solving the other problem comprises the steps of: (a) dividing an input signal into a high frequency band signal and a low frequency band signal; (b) performing MDCT on the low frequency band signal and converting from the time domain to the frequency domain; (c) quantizing the signal on which the MDCT is performed and encoding the bitplane based on a context; (d) converting the high frequency band signal and the low frequency band signal from time domain to frequency domain or time / frequency domain, respectively; (e) generating and encoding bandwidth extension information representing a characteristic of the converted high frequency band signal using the converted low frequency band signal; And (f) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal.

또한, 상기 다른 과제는 (a) 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (b) 상기 저주파수 밴드 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 단계; (c) 상기 MDCT가 수행된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (d) 상기 고주 파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (f) 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, the other problem is (a) dividing the input signal into a high frequency band signal and a low frequency band signal; (b) performing MDCT on the low frequency band signal and converting from the time domain to the frequency domain; (c) quantizing the signal on which the MDCT is performed and encoding the bitplane based on a context; (d) converting the high frequency band signal and the low frequency band signal from time domain to frequency domain or time / frequency domain, respectively; (e) generating and encoding bandwidth extension information representing a characteristic of the converted high frequency band signal using the converted low frequency band signal; And (f) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal. A computer-readable recording medium having recorded thereon a program for executing an encoding method of an audio signal. Is achieved by.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 (a) 입력 신호를 시간 도메인에서 주파수 도메인으로 변환하는 단계; (b) 상기 변환된 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (d) 상기 저주파수 밴드 신호를 이용하여 상기 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 단계; 및 (e) 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.In addition, the method for encoding an audio signal according to the present invention for solving the another problem comprises the steps of (a) converting the input signal from the time domain to the frequency domain; (b) dividing the converted input signal into a high frequency band signal and a low frequency band signal; (c) quantizing the low frequency band signal and encoding the bitplane based on a context; (d) generating and encoding bandwidth extension information representing the characteristics of the high frequency band signal using the low frequency band signal; And (e) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal.

또한, 상기 또 다른 과제는 (a) 입력 신호를 시간 도메인에서 주파수 도메인으로 변환하는 단계; (b) 상기 변환된 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 단계; (c) 상기 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 단계; (d) 상기 저주파수 밴드 신호를 이용하여 상기 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호 화하는 단계; 및 (e) 상기 부호화된 비트플레인 및 상기 부호화된 대역폭 확장 정보를 상기 입력 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, the further object is (a) converting the input signal from the time domain to the frequency domain; (b) dividing the converted input signal into a high frequency band signal and a low frequency band signal; (c) quantizing the low frequency band signal and encoding the bitplane based on a context; (d) generating and encoding bandwidth extension information representing the characteristics of the high frequency band signal using the low frequency band signal; And (e) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal. A computer-readable recording medium having recorded thereon a program for executing an encoding method of an audio signal. Is achieved by.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 부호화 결과를 입력받는 단계, (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (d) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호 각각에 대하여 역 MDCT(Inverse Modified Discrete Cosine Transform)를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 단계; 및 (e) 상기 변환된 저주파수 밴드 신호 및 상기 변환된 고주파수 밴드 신호를 합성하는 단계를 포함한다.In addition, according to another aspect of the present invention, there is provided a method of decoding an audio signal, the method comprising: (a) receiving an encoding result of an audio signal; Decoding and inverse quantizing to generate a low frequency band signal; (c) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; (d) performing an Inverse Modified Discrete Cosine Transform (MDCT) on each of the low frequency band signal and the high frequency band signal to convert from the frequency domain to the time domain; And (e) synthesizing the converted low frequency band signal and the converted high frequency band signal.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (d) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호 각각에 대하여 역 MDCT(Inverse Modified Discrete Cosine Transform)를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 단계; 및 (e) 상기 변환된 저주파수 밴드 신호 및 상기 변환된 고주파수 밴드 신호를 합성하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, the another object is (a) receiving an encoding result of the audio signal; (b) generating a low frequency band signal by decoding and inverse quantizing the encoded bitplane included in the encoding result based on a context; (c) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; (d) performing an Inverse Modified Discrete Cosine Transform (MDCT) on each of the low frequency band signal and the high frequency band signal to convert from the frequency domain to the time domain; And (e) synthesizing the converted low frequency band signal and the converted high frequency band signal, by a computer readable recording medium having recorded thereon a program for executing a method of decoding an audio signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 단계; (d) 상기 역 MDCT가 수행된 저주파수 밴드 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (f) 상기 고주파수 밴드 신호를 시간 도메인으로 역변환하는 단계; 및 (g) 상기 역변환된 고주파수 밴드 신호 및 상기 변환된 저주파수 밴드 신호를 합성하는 단계를 포함한다.In addition, the decoding method of an audio signal according to the present invention for solving the another problem comprises the steps of: (a) receiving an encoding result of the audio signal; (b) generating a low frequency band signal by decoding and inverse quantizing the encoded bitplane included in the encoding result based on a context; (c) performing inverse MDCT on the low frequency band signal and converting from the frequency domain to the time domain; (d) converting the low frequency band signal subjected to the inverse MDCT into a frequency domain or a time / frequency domain; (e) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth information; (f) inversely converting the high frequency band signal into a time domain; And (g) synthesizing the inverse transformed high frequency band signal and the converted low frequency band signal.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 저주파수 밴 드 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 단계; (d) 상기 역 MDCT가 수행된 저주파수 밴드 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 단계; (e) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (f) 상기 고주파수 밴드 신호를 시간 도메인으로 역변환하는 단계; 및 (g) 상기 역변환된 고주파수 밴드 신호 및 상기 변환된 저주파수 밴드 신호를 합성하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, the another object is (a) receiving an encoding result of the audio signal; (b) generating a low frequency band signal by decoding and inverse quantizing the encoded bitplane included in the encoding result based on a context; (c) performing inverse MDCT on the low frequency band signal to convert from frequency domain to time domain; (d) converting the low frequency band signal subjected to the inverse MDCT into a frequency domain or a time / frequency domain; (e) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information; (f) inversely converting the high frequency band signal into a time domain; And (g) synthesizing the inversely transformed high frequency band signal and the converted low frequency band signal, by a computer readable recording medium having recorded thereon a program for executing a method for decoding an audio signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 방법은 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (d) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호를 합성하는 단계; 및 (d) 상기 합성된 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 단계를 포함한다.In addition, the decoding method of an audio signal according to the present invention for solving the another problem comprises the steps of: (a) receiving an encoding result of the audio signal; (b) generating a low frequency band signal by decoding and inverse quantizing the encoded bitplane included in the encoding result based on a context; (c) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; (d) synthesizing the low frequency band signal and the high frequency band signal; And (d) performing inverse MDCT on the synthesized signal to convert from the frequency domain to the time domain.

또한, 상기 또 다른 과제는 (a) 오디오 신호의 부호화 결과를 입력받는 단계; (b) 상기 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호 화하고 역양자화하여 저주파수 밴드 신호를 생성하는 단계; (c) 상기 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 단계; (d) 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호를 합성하는 단계; 및 (d) 상기 합성된 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 단계를 포함하는 오디오 신호의 복호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, the another object is (a) receiving an encoding result of the audio signal; (b) generating a low frequency band signal by decoding and inverse quantizing the encoded bitplane included in the encoding result based on a context; (c) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth information; (d) synthesizing the low frequency band signal and the high frequency band signal; And (d) performing inverse MDCT on the synthesized signal and converting the frequency signal from the frequency domain to the time domain, by means of a computer-readable recording medium having recorded thereon a program for executing a method of decoding an audio signal. .

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 밴드 분할부; 상기 고주파수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하는 변환부; 상기 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 저주파수 밴드 부호화부; 및 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 대역폭 확장 부호화부를 포함한다.In addition, an apparatus for encoding an audio signal according to the present invention for solving the above another problem comprises: a band dividing unit for dividing an input signal into a high frequency band signal and a low frequency band signal; A converter for converting the high frequency band signal and the low frequency band signal from a time domain to a frequency domain, respectively; A low frequency band encoder which quantizes the converted low frequency band signal and encodes the bit frequency into a bit plane based on a context; And a bandwidth extension encoder for generating and encoding bandwidth extension information indicating a characteristic of the converted high frequency band signal by using the converted low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 밴드 분할부; 상기 저주파수 밴드 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환하는 MDCT 적용부; 상기 MDCT 적용부의 출력을 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 저주파수 밴드 부호화부; 상기 고주파 수 밴드 신호 및 상기 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 및 상기 변환된 저주파수 밴드 신호를 이용하여 상기 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 대역폭 확장 부호화부를 포함한다.In addition, an apparatus for encoding an audio signal according to the present invention for solving the above another problem comprises: a band dividing unit for dividing an input signal into a high frequency band signal and a low frequency band signal; An MDCT application unit performing an MDCT on the low frequency band signal and converting the time domain from the time domain to the frequency domain; A low frequency band encoder for quantizing the output of the MDCT applier and encoding the bitplane based on a context; A converter for converting the high frequency band signal and the low frequency band signal from a time domain into a frequency domain or a time / frequency domain; And a bandwidth extension encoder for generating and encoding bandwidth extension information indicating a characteristic of the converted high frequency band signal by using the converted low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 장치는 입력 신호를 시간 도메인에서 주파수 도메인으로 변환하는 변환부; 상기 변환된 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하는 밴드 분할부; 상기 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하는 저주파수 밴드 부호화부; 및 상기 저주파수 밴드 신호를 이용하여 상기 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하는 대역폭 확장 부호화부를 포함한다.In addition, an apparatus for encoding an audio signal according to an embodiment of the present invention for solving the another problem includes a converter for converting the input signal from the time domain to the frequency domain; A band dividing unit dividing the converted input signal into a high frequency band signal and a low frequency band signal; A low frequency band encoder for quantizing the low frequency band signal and encoding the bit frequency based on a context; And a bandwidth extension encoder for generating and encoding bandwidth extension information indicating a characteristic of the high frequency band signal by using the low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 저주파수 밴드 복호화부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 대역폭 확장 복호화부; 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호 각각에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 역 MDCT 수행부; 및 상기 변환된 저주파수 밴드 신호 및 상기 변환된 고주파수 밴드 신호를 합성하는 밴드 합성부를 포함한다.In addition, an apparatus for decoding an audio signal according to the present invention for solving the another object is a low frequency band decoder for generating a low frequency band signal by decoding and inverse quantization of the encoded bit plane based on the context; A bandwidth extension decoder for decoding encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; An inverse MDCT performing unit performing inverse MDCT on each of the low frequency band signal and the high frequency band signal and converting from the frequency domain to the time domain; And a band synthesizer for synthesizing the converted low frequency band signal and the converted high frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 저주파수 밴드 복호화부; 상기 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 역 MDCT 적용부; 상기 역 MDCT가 수행된 저주파수 밴드 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환하는 변환부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 대역폭 확장 복호화부; 상기 복호화된 고주파수 밴드 신호를 시간 도메인으로 역변환하는 역변환부; 및 상기 역변환된 고주파수 밴드 신호 및 상기 변환된 저주파수 밴드 신호를 합성하는 밴드 합성부를 포함한다.In addition, an apparatus for decoding an audio signal according to the present invention for solving the another object is a low frequency band decoder for generating a low frequency band signal by decoding and inverse quantization of the encoded bit plane based on the context; An inverse MDCT application unit performing inverse MDCT on the low frequency band signal and converting from the frequency domain to the time domain; A converter for converting the low frequency band signal subjected to the inverse MDCT to a frequency domain or a time / frequency domain; A bandwidth extension decoder for decoding encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information; An inverse transformer for inversely transforming the decoded high frequency band signal into a time domain; And a band synthesizer configured to synthesize the inverse transformed high frequency band signal and the converted low frequency band signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 복호화 장치는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하는 저주파수 밴드 복호화부; 부호화된 대역폭 확장 정보를 복호화하고, 상기 복호화된 대역폭 확장 정보를 이용하여 상기 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하는 대역폭 확장 복호화부; 상기 저주파수 밴드 신호 및 상기 고주파수 밴드 신호를 합성하는 밴드 합성부; 및 상기 합성된 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환하는 역 MDCT 적용부를 포함한다.In addition, an apparatus for decoding an audio signal according to the present invention for solving the another object is a low frequency band decoder for generating a low frequency band signal by decoding and inverse quantization of the encoded bit plane based on the context; A bandwidth extension decoder for decoding encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information; A band synthesizer for synthesizing the low frequency band signal and the high frequency band signal; And an inverse MDCT application unit performing inverse MDCT on the synthesized signal and converting from the frequency domain to the time domain.

본 발명에 따르면, 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할하고, 고주파수 밴드 신호 및 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환하며, 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화하고, 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화하며, 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 입력 신호에 대한 부호화 결과로써 출력함으로써, 한정된 비트율에서 고주파수 성분을 효율적으로 부호화하여 음질을 향상시킬 수 있다.According to the present invention, the input signal is divided into a high frequency band signal and a low frequency band signal, the high frequency band signal and the low frequency band signal are respectively converted from the time domain to the frequency domain, and the converted low frequency band signal is quantized to perform bit based on context. And encoding and generating bandwidth extension information representing the characteristics of the transformed high frequency band signal using the transformed low frequency band signal, and outputting the encoded bitplane and the encoded bandwidth extension information as the encoding result of the input signal. As a result, the sound quality can be improved by efficiently encoding the high frequency components at a limited bit rate.

또한, 본 발명에 따르면, 오디오 신호의 부호화 결과를 입력받고, 부호화 결과에 포함된 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성하며, 부호화 결과에 포함된 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성하고, 저주파수 밴드 신호 및 고주파수 밴드 신호 각각에 대하여 역 MDCT(Inverse Modified Discrete Cosine Transform)를 수행하여 주파수 도메인에서 시간 도메인으로 변환하며, 변환된 저주파수 밴드 신호 및 변환된 고주파수 밴드 신호를 합성함으로써, 한정된 비트율에서 부호화된 비트스트림으로부터 고주파수 성분을 효율적으로 복호화할 수 있다.In addition, according to the present invention, the encoding result of the audio signal is input, the encoded bitplane included in the encoding result is decoded and dequantized based on a context to generate a low frequency band signal, and the encoded bandwidth included in the encoding result. Decode the extension information, generate a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information, and perform an Inverse Modified Discrete Cosine Transform (Inverse MDCT) on each of the low frequency band signal and the high frequency band signal in the frequency domain By converting to the time domain and synthesizing the converted low frequency band signal and the converted high frequency band signal, it is possible to efficiently decode the high frequency component from the bitstream encoded at the limited bit rate.

본문에 개시되어 있는 본 발명의 실시예들에 대해서, 특정한 구조적 내지 기능적 설명들은 단지 본 발명의 실시예를 설명하기 위한 목적으로 예시된 것으로, 본 발명의 실시예들은 다양한 형태로 실시될 수 있으며 본문에 설명된 실시예들에 한정되는 것으로 해석되어서는 아니 된다. With respect to the embodiments of the present invention disclosed in the text, specific structural to functional descriptions are merely illustrated for the purpose of describing embodiments of the present invention, embodiments of the present invention may be implemented in various forms and It should not be construed as limited to the embodiments described in.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 본문에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 개시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 구성요소에 대해 사용하였다. As the inventive concept allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the text. However, this is not intended to limit the present invention to the specific disclosed form, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. In describing the drawings, similar reference numerals are used for the components.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다. Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art. Terms such as those defined in the commonly used dictionaries should be construed as having meanings consistent with the meanings in the context of the related art and shall not be construed in ideal or excessively formal meanings unless expressly defined in this application. Do not.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다. Hereinafter, with reference to the accompanying drawings, it will be described in detail a preferred embodiment of the present invention. The same reference numerals are used for the same elements in the drawings, and duplicate descriptions of the same elements are omitted.

도 1은 본 발명의 일 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.1 is a block diagram illustrating an apparatus for encoding an audio signal according to an embodiment of the present invention.

도 1을 참조하면, 오디오 신호의 부호화 장치는 밴드 분할부(100), 제1 MDCT 적용부(110), 주파수 선형 예측 수행부(120), 멀티-레졸류션 분석부(130), 양자화 부(140), PQ-SPSC 모듈(150), 문맥-기반 비트플레인 부호화부(160), 제2 MDCT 적용부(170), 대역폭 확장 부호화부(180), 및 다중화부(190)를 포함한다.Referring to FIG. 1, an apparatus for encoding an audio signal includes a band splitter 100, a first MDCT applier 110, a frequency linear prediction performer 120, a multi-resolution analyzer 130, and a quantization unit ( 140, a PQ-SPSC module 150, a context-based bitplane encoder 160, a second MDCT applier 170, a bandwidth extension encoder 180, and a multiplexer 190.

밴드 분할부(100, band splitting unit)는 입력 신호(IN)를 저주파수 밴드 신호(LB, Low Band signal) 및 고주파수 밴드 신호(HB, high band signal)로 분할한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM(Pulse Code Modulation) 신호일 수 있다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band splitting unit 100 divides the input signal IN into a low band signal (LB) and a high frequency band signal (HB). Here, the input signal IN may be a pulse code modulation (PCM) signal obtained by modulating an analog voice signal or an audio signal into a digital signal. Here, the low frequency band signal may be a signal corresponding to a frequency lower than an arbitrary threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than an arbitrary threshold value.

제1 MDCT 적용부(110)는 밴드 분할부(100)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT(Modified Discrete Cosine Transform)를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다.The first MDCT applying unit 110 performs a modified discrete cosine transform (MDCT) on the low frequency band signal LB divided by the band dividing unit 100 to convert the low frequency band signal LB from the time domain to the frequency domain. To convert.

주파수 선형 예측 수행부(120, frequency linear prediction performing unit)은 제1 MDCT 적용부(110)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측(frequency linear prediction)을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합(linear combination)으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(120)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류(prediction error)가 최소가 되도록 선형 예측 필터의 계수(coefficient)를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(120)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 120 performs frequency linear prediction on the low frequency band signal converted into the frequency domain by the first MDCT applying unit 110. Here, frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of signals at a previous frequency. In detail, the frequency linear prediction performing unit 120 calculates a coefficient of the linear prediction filter such that a prediction error, which is a difference between the signal on which the linear prediction is performed and the signal at the current frequency, is minimized, Linear predictive filtering is performed on the low frequency band signal converted into the frequency domain according to the calculated coefficients. In this case, the frequency linear prediction performing unit 120 may improve the efficiency of encoding by performing vector quantization expressed by a vector index on values corresponding to coefficients of the linear prediction filter.

구체적으로, 주파수 선형 예측 수행부(120)는 제1 MDCT 적용부(110)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(120)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the frequency linear prediction performing unit 120 performs frequency linear prediction when the low frequency band signal converted into the frequency domain by the first MDCT applying unit 110 is a speech signal or a pitched signal. can do. In other words, the frequency linear prediction performing unit 120 may improve the efficiency of encoding by performing frequency linear prediction according to the characteristics of the input signal.

멀티-레졸루션 분석부(130, multi-resolution analysis unit)는 제1 MDCT 적용부(110)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(120)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(Multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(130)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(120)에서 필터링된 신호를 두 가지 유형(예를 들어, 스테이빌(stabile) 유형과 숏(short) 유형)으로 나누어 분석할 수 있다. The multi-resolution analysis unit 130 receives the low-frequency band signal converted into the frequency domain from the first MDCT application unit 110 or the signal filtered by the frequency linear prediction execution unit 120 and instantly. Analyze audio spectral coefficients of varying signals in multi-resolution. Specifically, the multi-resolution analysis unit 130 may filter the signal filtered by the frequency linear prediction performing unit 120 according to the intensity of the change in the audio spectrum in two types (eg, a stabilization type and a short). (short) type can be analyzed.

구체적으로, 멀티-레졸루션 분석부(130)는 제1 MDCT 적용부(110)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(120)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(130)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.Specifically, the multi-resolution analysis unit 130 is a low-frequency band signal transformed into the frequency domain in the first MDCT application unit 110 or a signal filtered by the frequency linear prediction performing unit 120 is a transient signal. In the case of multi-resolution. In other words, the multi-resolution analyzer 130 may improve the efficiency of encoding by performing multi-resolution analysis according to the characteristics of the input signal.

양자화부(140)는 주파수 선형 예측 수행부(120) 또는 멀티-레졸류션 분석 부(130)에서 출력된 결과를 양자화한다.The quantization unit 140 quantizes the result output from the frequency linear prediction execution unit 120 or the multi-resolution analysis unit 130.

PQ-SPSC 모듈(150, Post-quantization Square Polar Stereo Coding module)은 양자화부(140)에서 출력된 결과에 대하여 주파수 스펙트럼의 side-polar coordination stereo encoding을 수행한다.The PQ-SPSC module 150 performs a side-polar coordination stereo encoding of a frequency spectrum on the result output from the quantization unit 140.

문맥-기반 비트플레인 부호화부(160, Context-dependent Bit plane Coding unit)는 문맥을 기반으로 하여 PQ-SPSC 모듈(150)의 출력을 비트플레인으로 부호화한다. 여기서, 문맥-기반 비트플레인 부호화부(160)는 허프만 코딩(Huffman Coding)을 이용하여 PQ-SPSC 모듈(150)에서 출력된 결과를 비트플레인으로 부호화할 수 있다.The context-based bitplane coding unit 160 encodes the output of the PQ-SPSC module 150 into a bitplane based on a context. Here, the context-based bitplane encoder 160 may encode a result output from the PQ-SPSC module 150 into a bitplane by using Huffman coding.

제2 MDCT 적용부(170)는 밴드 분할부(100)에서 분할된 고주파수 밴드 신호(HB)에 대하여 MDCT를 수행하여, 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인으로 변환한다.The second MDCT applying unit 170 performs MDCT on the high frequency band signal HB divided by the band dividing unit 100 to convert the high frequency band signal HB from the time domain to the frequency domain.

대역폭 확장 부호화부(180)는 제2 MDCT 적용부(170)에서 주파수 영역으로 변환된 고주파수 밴드 신호의 성분을 전달하기 위하여, 제1 MDCT 적용부(110)에서 주파수 영역으로 변환된 저주파수 밴드 신호를 이용하여 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(180)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파 수 밴드 신호를 이용하여 대역폭 확장 정보를 생성할 수 있다. The bandwidth extension encoder 180 may convert the low frequency band signal converted into the frequency domain by the first MDCT applier 110 to transmit components of the high frequency band signal converted into the frequency domain by the second MDCT applier 170. Bandwidth extension information representing the characteristics of the high frequency band signal is generated and encoded. Here, it will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoder 180 may generate bandwidth extension information by using a low frequency band signal based on a property that a high correlation exists between a high frequency and a low frequency band of an audio signal.

다중화부(190)는 주파수 선형 예측 수행부(120), PQ-SPSC 모듈(150), 문맥-기반 비트플레인 부호화부(160) 및 대역폭 확장 부호화부(180)에서 부호화를 수행한 결과를 다중화하여 비트 스트림(bit stream)을 생성하고 출력단자 OUT을 통해 출력한다.The multiplexer 190 multiplexes the results of the encoding performed by the frequency linear prediction performer 120, the PQ-SPSC module 150, the context-based bitplane encoder 160, and the bandwidth extension encoder 180. Generates a bit stream and outputs it through the output terminal OUT.

도 2는 본 발명의 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.2 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 2를 참조하면, 오디오 신호의 부호화 장치는 밴드 분할부(200), MDCT 적용부(210), 주파수 선형 예측 수행부(220), 멀티-레졸류션 분석부(230), 양자화부(240), PQ-SPSC 모듈(250), 문맥-기반 비트플레인 부호화부(260), 저주파수 밴드 변환부(270), 고주파수 밴드 변환부(275), 대역폭 확장 부호화부(280) 및 다중화부(290)를 포함한다.Referring to FIG. 2, the apparatus for encoding an audio signal includes a band splitter 200, an MDCT applier 210, a frequency linear prediction performer 220, a multi-resolution analyzer 230, and a quantizer 240. The PQ-SPSC module 250, the context-based bitplane encoder 260, the low frequency band converter 270, the high frequency band converter 275, the bandwidth extension encoder 280, and the multiplexer 290. Include.

밴드 분할부(200)는 입력 신호(IN)를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band dividing unit 200 divides the input signal IN into a low frequency band signal LB and a high frequency band signal HB. The input signal IN may be a PCM signal obtained by modulating an analog voice signal or an audio signal into a digital signal. Here, the low frequency band signal may be a signal corresponding to a frequency lower than an arbitrary threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than an arbitrary threshold value.

MDCT 적용부(210)는 밴드 분할부(200)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT(Modified Discrete Cosine Transform)를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다.The MDCT application unit 210 performs a modified discrete cosine transform (MDCT) on the low frequency band signal LB divided by the band divider 200 to convert the low frequency band signal LB from the time domain to the frequency domain. .

주파수 선형 예측 수행부(220)는 MDCT 적용부(210)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(220)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(520)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 220 performs frequency linear prediction on the low frequency band signal converted into the frequency domain by the MDCT application unit 210. Here, frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of signals at a previous frequency. Specifically, the frequency linear prediction performing unit 220 calculates the coefficients of the linear prediction filter such that the prediction error, which is the difference between the signal on which the linear prediction is performed and the signal at the current frequency, is minimal, and according to the calculated coefficients Linear predictive filtering is performed on the low-frequency band signal transformed by. In this case, the frequency linear prediction performing unit 520 may improve the efficiency of encoding by performing vector quantization expressed by a vector index on a value corresponding to the coefficient of the linear prediction filter.

구체적으로, 주파수 선형 예측 수행부(220)는 MDCT 적용부(210)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(220)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.Specifically, the frequency linear prediction performing unit 220 may perform frequency linear prediction when the low frequency band signal converted into the frequency domain by the MDCT application unit 210 is a speech signal or a pitched signal. have. In other words, the frequency linear prediction performing unit 220 may improve the efficiency of encoding by performing frequency linear prediction according to the characteristics of the input signal.

멀티-레졸루션 분석부(230)는 주파수 선형 예측 수행부(220)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(230)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(220)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다.The multi-resolution analyzer 230 receives the filtered signal from the frequency linear prediction performer 220 and analyzes the audio spectral coefficients of the instantaneously changing signal with multi-resolution. In detail, the multi-resolution analyzer 230 may filter the audio spectrum filtered by the frequency linear prediction performer 220 according to the intensity of the change in the audio spectrum in two types (for example, a stabil type and a short type). It can be analyzed by dividing by.

구체적으로, 멀티-레졸루션 분석부(230)는 MDCT 적용부(210)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(220)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(230)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.Specifically, the multi-resolution analysis unit 230 is a case where the low-frequency band signal converted into the frequency domain in the MDCT application unit 210 or the signal filtered by the frequency linear prediction performing unit 220 is a transient signal. It can be analyzed by multi-resolution. In other words, the multi-resolution analyzer 230 may improve the efficiency of encoding by performing multi-resolution analysis according to the characteristics of the input signal.

양자화부(240)는 주파수 선형 예측 수행부(220)에서 필터링된 신호 또는 멀티-레졸루션 분석부(230)에서 출력된 결과를 양자화한다.The quantization unit 240 quantizes the signal filtered by the frequency linear prediction execution unit 220 or the result output from the multi-resolution analyzer 230.

PQ-SPSC 모듈(250)은 양자화부(240)에서 출력된 결과에 대하여 주파수 스펙트럼의 side-polar coordination stereo encoding을 수행한다.The PQ-SPSC module 250 performs side-polar coordination stereo encoding of the frequency spectrum on the result output from the quantization unit 240.

문맥-기반 비트플레인 부호화부(260)는 문맥을 기반으로 하여 PQ-SPSC 모듈(250)의 출력을 비트플레인으로 부호화한다. 여기서, 문맥-기반 비트플레인 부호화부(260)는 허프만 코딩(Huffman Coding)을 이용하여 PQ-SPSC 모듈(250)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bitplane encoder 260 encodes the output of the PQ-SPSC module 250 into a bitplane based on the context. Here, the context-based bitplane encoder 260 may encode the quantized result in the PQ-SPSC module 250 into a bitplane using Huffman Coding.

저주파수 밴드 변환부(270)는 밴드 분할부(200)에서 분할된 저주파수 밴드 신호(LB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 예를 들어, 저주파수 밴드 변환부(270)는 MDST(Modified Discrete Sine Transform), FFT(Fast Fourier Transform), 및 QMF(Quadrature Mirror Filters) 등을 이용하여 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다.The low frequency band converter 270 converts the low frequency band signal LB divided by the band divider 200 from a time domain to a frequency domain or a time / frequency domain using a conversion technique other than MDCT. For example, the low frequency band converter 270 uses the Modified Discrete Sine Transform (MDST), Fast Fourier Transform (FFT), Quadrature Mirror Filters (QMF), and the like to generate a low frequency band signal LB in the time domain Or convert to the time / frequency domain.

고주파수 밴드 변환부(275)는 밴드 분할부(200)에서 분할된 고주파수 밴드 신호(HB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 여기서, 고주파수 밴드 변환부(275)는 저주파수 밴드 변환부(270)과 동일한 변환 기법을 이용한다. 예를 들어, 고주파수 밴드 변환부(275)는 MDST, FFT, 및 QMF 등을 이용하여 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다.The high frequency band converter 275 converts the high frequency band signal HB split by the band divider 200 from the time domain to the frequency domain or the time / frequency domain using a conversion technique other than MDCT. Here, the high frequency band converter 275 uses the same conversion technique as the low frequency band converter 270. For example, the high frequency band converter 275 may convert the high frequency band signal HB from the time domain to the frequency domain or the time / frequency domain using MDST, FFT, and QMF.

대역폭 확장 부호화부(280)는 저주파수 밴드 변환부(270)에서 주파수 도메인으로 변환된 저주파수 밴드 신호를 이용하여, 고주파수 밴드 변환부(275)에서 주파수 도메인으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(280)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드 신호를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoder 280 uses the low frequency band signal converted into the frequency domain by the low frequency band converter 270 to increase the bandwidth of the high frequency band signal converted into the frequency domain by the high frequency band converter 275. Generate and encode information. Here, it will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoder 280 may generate bandwidth extension information by using the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal.

다중화부(290)는 주파수 선형 예측 수행부(220), PQ-SPSC 모듈(250), 문맥-기반 비트플레인 부호화부(260), 및 대역폭 확장 부호화부(280)에서 부호화를 수행한 결과를 다중화하여 비트 스트림(bit stream)을 생성하고 출력단자 OUT을 통해 출력한다.The multiplexer 290 multiplexes the result of the encoding performed by the frequency linear prediction performer 220, the PQ-SPSC module 250, the context-based bitplane encoder 260, and the bandwidth extension encoder 280. To generate a bit stream and output it through the output terminal OUT.

도 3은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.3 is a block diagram showing an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 3을 참조하면, 오디오 신호의 부호화 장치는 MDCT 적용부(300), 밴드 분할부(310), 주파수 선형 예측 수행부(320), 멀티-레졸루션 분석부(330), 양자화부(340), PQ-SPSC 모듈(350), 문맥-기반 비트플레인 부호화부(360), 대역폭 확장 부호화부(370) 및 다중화부(380)를 포함한다.Referring to FIG. 3, the apparatus for encoding an audio signal includes an MDCT application unit 300, a band divider 310, a frequency linear prediction performer 320, a multi-resolution analyzer 330, a quantizer 340, A PQ-SPSC module 350, a context-based bitplane encoder 360, a bandwidth extension encoder 370, and a multiplexer 380 are included.

MDCT 적용부(300)는 입력 신호(IN)에 대하여 MDCT를 수행하여, 입력 신호(IN)를 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다.The MDCT application unit 300 performs an MDCT on the input signal IN to convert the input signal IN from the time domain to the frequency domain. The input signal IN may be a PCM signal obtained by modulating an analog voice signal or an audio signal into a digital signal.

밴드 분할부(310)는 MDCT 적용부(300)에서 주파수 도메인으로 변환된 신호를 저주파수 밴드 신호(LB)와 고주파수 밴드 신호(HB)로 분할한다.The band dividing unit 310 divides the signal converted into the frequency domain by the MDCT applying unit 300 into a low frequency band signal LB and a high frequency band signal HB.

주파수 선형 예측 수행부(320)는 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB)에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(320)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(320)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performer 320 performs frequency linear prediction on the low frequency band signal LB divided by the band divider 310. Here, frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of signals at a previous frequency. Specifically, the frequency linear prediction performing unit 320 calculates coefficients of the linear prediction filter such that the prediction error, which is the difference between the signal on which the linear prediction is performed and the signal at the current frequency, is minimized, and according to the calculated coefficients Linear predictive filtering is performed on the low-frequency band signal transformed by. In this case, the frequency linear prediction performing unit 320 may improve the efficiency of encoding by performing vector quantization expressed by a vector index on values corresponding to coefficients of the linear prediction filter.

구체적으로, 주파수 선형 예측 수행부(320)는 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB)가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(320)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the frequency linear prediction performing unit 320 may perform frequency linear prediction when the low frequency band signal LB divided by the band dividing unit 310 is a speech signal or a pitched signal. have. In other words, the frequency linear prediction performing unit 320 may improve the efficiency of encoding by performing frequency linear prediction according to the characteristics of the input signal.

멀티-레졸루션 분석부(330)는 주파수 선형 예측 수행부(320)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도로 분석한다. 구체적으로, 멀티-레졸루션 분석부(330)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(320)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzer 330 receives the filtered signal from the frequency linear prediction performer 320 and analyzes the audio spectral coefficients of the signal that is instantaneously changed in multi resolution. In detail, the multi-resolution analyzer 330 may filter the audio spectrum filtered by the frequency linear prediction performer 320 according to the intensity of the change in the audio spectrum in two types (for example, a stabil type and a short type). It can be analyzed by dividing by.

구체적으로, 멀티-레졸루션 분석부(330)는 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB) 또는 주파수 선형 예측 수행부(320)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(330)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the multi-resolution analyzer 330 is a low frequency band signal LB divided by the band divider 310 or a signal filtered by the frequency linear prediction performer 320 when the signal is a transient signal. It can be analyzed by multi-resolution. In other words, the multi-resolution analyzer 330 may improve the encoding efficiency by performing multi-resolution analysis according to the characteristics of the input signal.

양자화부(340)는 주파수 선형 예측 수행부(320)에서 필터링된 신호 또는 멀티-레졸루션 분석부(330)에서 출력된 결과를 양자화한다.The quantizer 340 quantizes the signal filtered by the frequency linear prediction performer 320 or the result output from the multi-resolution analyzer 330.

PQ-SPSC 모듈(350)은 양자화부(340)에서 출력된 결과에 대하여 주파수 스펙트럼의 side-polar coordination stereo encoding을 수행한다.The PQ-SPSC module 350 performs side-polar coordination stereo encoding of the frequency spectrum on the result output from the quantization unit 340.

문맥-기반 비트플레인 부호화부(360)는 문맥을 기반으로 하여 PQ-SPSC 모듈(350)에서 출력된 결과를 비트플레인으로 부호화한다. 여기서, 문맥-기반 비트플 레인 부호화부(360)는 허프먼 코딩(Huffman coding)을 이용하여 PQ-SPSC 모듈(350)에서 출력된 결과를 비트플레인으로 부호화할 수 있다.The context-based bitplane encoder 360 encodes a result output from the PQ-SPSC module 350 into a bitplane based on the context. Here, the context-based bitplane encoder 360 may encode a result output from the PQ-SPSC module 350 into a bitplane by using Huffman coding.

대역폭 확장 부호화부(370)는 밴드 분할부(310)에서 분할된 저주파수 밴드 신호(LB)를 이용하여, 밴드 분할부(310)에서 분할된 고주파수 밴드 신호(HB)의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(370)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드 신호를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoder 370 uses the low frequency band signal LB divided by the band divider 310 to obtain bandwidth extension information indicating the characteristics of the high frequency band signal HB split by the band divider 310. Generate and encode. Here, it will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoder 370 may generate bandwidth extension information by using the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal.

다중화부(380)는 주파수 선형 예측 수행부(320), PQ-SPSC 모듈(350), 문맥-기반 비트플레인 부호화부(360) 및 대역폭 확장 부호화부(370)에서 부호화를 수행한 결과를 다중화하여 비트 스트림(bit stream)을 생성하고 출력단자 OUT을 통해 출력한다.The multiplexer 380 multiplexes the results of the encoding performed by the frequency linear prediction performer 320, the PQ-SPSC module 350, the context-based bitplane encoder 360, and the bandwidth extension encoder 370. Generates a bit stream and outputs it through the output terminal OUT.

도 4는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 4 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 4를 참조하면, 오디오 신호의 부호화 장치는 밴드 분할부(400), 제1 MDCT 적용부(410), 주파수 선형 예측 수행부(420), 멀티-레졸루션 분석부(430), 양자화부(440), 문맥-기반 비트플레인 부호화부(450), 제2 MDCT 적용부(460), 대역폭 확장 부호화부(470) 및 다중화부(480)를 포함한다. Referring to FIG. 4, the apparatus for encoding an audio signal includes a band splitter 400, a first MDCT applier 410, a frequency linear prediction performer 420, a multi-resolution analyzer 430, and a quantizer 440. ), A context-based bitplane encoder 450, a second MDCT applier 460, a bandwidth extension encoder 470, and a multiplexer 480.

밴드 분할부(400)는 입력 신호(IN)를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band divider 400 divides the input signal IN into a low frequency band signal LB and a high frequency band signal HB. The input signal IN may be a PCM signal obtained by modulating an analog voice signal or an audio signal into a digital signal. Here, the low frequency band signal may be a signal corresponding to a frequency lower than an arbitrary threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than an arbitrary threshold value.

제1 MDCT 적용부(410)는 밴드 분할부(400)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, 시간 도메인은 시간의 경과에 따라 입력 신호(IN)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이고, 주파수 도메인은 주파수의 변화에 따라 입력 신호(IN)의 크기를 나타내는 도메인이다.The first MDCT applying unit 410 performs MDCT on the low frequency band signal LB divided by the band dividing unit 400 to convert the low frequency band signal LB from the time domain to the frequency domain. Here, the time domain is a domain representing the magnitude of the input signal IN (for example, energy or sound pressure) over time, and the frequency domain is a domain representing the magnitude of the input signal IN according to the change of frequency. to be.

주파수 선형 예측 수행부(420)는 제1 MDCT 적용부(410)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합(linear combination)으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(420)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류(prediction error)가 최소가 되도록 선형 예측 필터의 계수(coefficient)를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(420)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 420 performs frequency linear prediction on the low frequency band signal converted into the frequency domain by the first MDCT applying unit 410. Here, frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of signals at a previous frequency. Specifically, the frequency linear prediction performing unit 420 calculates a coefficient of the linear prediction filter such that a prediction error, which is a difference between the signal on which the linear prediction is performed and the signal at the current frequency, is minimized, Linear predictive filtering is performed on the low frequency band signal converted into the frequency domain according to the calculated coefficients. In this case, the frequency linear prediction performing unit 420 may improve the efficiency of encoding by performing vector quantization expressed by a vector index on a value corresponding to the coefficient of the linear prediction filter.

구체적으로, 주파수 선형 예측 수행부(420)는 제1 MDCT 적용부(410)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(420)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the frequency linear prediction performing unit 420 performs frequency linear prediction when the low frequency band signal converted into the frequency domain by the first MDCT applying unit 410 is a speech signal or a pitched signal. can do. In other words, the frequency linear prediction performing unit 420 may improve the efficiency of encoding by performing frequency linear prediction according to the characteristics of the input signal.

멀티-레졸루션 분석부(430)는 주파수 선형 예측 수행부(420)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(430)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(420)에서 필터링된 신호를 두 가지 유형(예를 들어, 스테이빌(stabile) 유형과 숏(short) 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzer 430 receives the filtered signal from the frequency linear prediction performer 420 and analyzes the audio spectral coefficients of the instantaneously changing signal in multi-resolution. In detail, the multi-resolution analyzer 430 may filter the signal filtered by the frequency linear prediction performer 420 according to the intensity of the change in the audio spectrum in two types (eg, a stabilization type and a short). (short) type can be analyzed.

구체적으로, 멀티-레졸루션 분석부(430)는 제1 MDCT 적용부(410)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(420)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(430)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.Specifically, the multi-resolution analysis unit 430 is a low-frequency band signal transformed into a frequency domain in the first MDCT application unit 410 or a signal filtered by the frequency linear prediction performing unit 420 is a transient signal. In the case of multi-resolution. In other words, the multi-resolution analyzer 430 may improve encoding efficiency by performing multi-resolution analysis according to the characteristics of the input signal.

양자화부(440)는 주파수 선형 예측 수행부(420)에서 필터링된 신호 또는 멀티-레졸루션 분석부(430)에서 출력된 결과를 양자화한다.The quantizer 440 quantizes the signal filtered by the frequency linear prediction performer 420 or the result output from the multi-resolution analyzer 430.

문맥-기반 비트플레인 부호화부(450)는 문맥을 기반으로 하여 양자화부(440)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플 레인 부호화부(450)는 허프만 코딩(Huffman Coding)을 이용하여 양자화부(440)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bitplane encoder 450 encodes the result of the quantization in the quantizer 440 into a bitplane based on the context. In detail, the context-based bit plane encoder 450 may encode the result quantized by the quantizer 440 into a bitplane using Huffman coding.

이와 같이, 주파수 선형 예측 수행부(420), 멀티-레졸루션 분석부(430), 양자화부(440) 및 문맥-기반 비트플레인 부호화부(450)는 MDCT 적용부(410)에서 출력된 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.As such, the frequency linear prediction execution unit 420, the multi-resolution analysis unit 430, the quantization unit 440, and the context-based bitplane encoder 450 are converted low frequencies output from the MDCT application unit 410. Since the encoding is performed on the band signal, it may be referred to as a low frequency band encoder.

제2 MDCT 적용부(460)는 밴드 분할부(400)에서 분할된 고주파수 밴드 신호(HB)에 대하여 MDCT를 수행하여 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인으로 변환한다.The second MDCT application unit 460 converts the high frequency band signal HB from the time domain to the frequency domain by performing MDCT on the high frequency band signal HB divided by the band divider 400.

대역폭 확장 부호화부(470)는 제1 MDCT 적용부(410)에서 주파수 도메인으로 변환된 저주파수 밴드 신호를 이용하여, 제2 MDCT 적용부(460)에서 주파수 도메인으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(470)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드 신호를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoder 470 indicates the characteristics of the high frequency band signal converted into the frequency domain by the second MDCT applier 460 using the low frequency band signal converted into the frequency domain by the first MDCT applier 410. Generate and encode bandwidth extension information. Here, it will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoder 470 may generate bandwidth extension information by using the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal.

다중화부(480)는 주파수 선형 예측 수행부(420), 문맥-기반 비트플레인 부호화부(450) 및 대역폭 확장 부호화부(470)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexer 480 multiplexes the result of the encoding performed by the frequency linear prediction performer 420, the context-based bitplane encoder 450, and the bandwidth extension encoder 470 to generate a bit stream, and output terminal OUT. Output through

도 5는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 5 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 5를 참조하면, 오디오 신호의 부호화 장치는 밴드 분할부(500), MDCT 적용부(510), 주파수 선형 예측 수행부(520), 멀티-레졸루션 분석부(530), 양자화부(540), 문맥-기반 비트플레인 부호화부(550), 저주파수 밴드 변환부(560), 고주파수 밴드 변환부(570), 대역폭 확장 부호화부(580) 및 다중화부(590)를 포함한다.Referring to FIG. 5, the apparatus for encoding an audio signal includes a band splitter 500, an MDCT applier 510, a frequency linear prediction performer 520, a multi-resolution analyzer 530, a quantizer 540, A context-based bitplane encoder 550, a low frequency band converter 560, a high frequency band converter 570, a bandwidth extension encoder 580, and a multiplexer 590 are included.

밴드 분할부(500)는 입력 신호(IN)를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band divider 500 divides the input signal IN into a low frequency band signal LB and a high frequency band signal HB. The input signal IN may be a PCM signal obtained by modulating an analog voice signal or an audio signal into a digital signal. Here, the low frequency band signal may be a signal corresponding to a frequency lower than an arbitrary threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than an arbitrary threshold value.

MDCT 적용부(510)는 밴드 분할부(400)에서 분할된 저주파수 밴드 신호(LB)에 대하여 MDCT를 수행하여, 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인으로 변환한다.The MDCT application unit 510 performs MDCT on the low frequency band signal LB divided by the band divider 400 to convert the low frequency band signal LB from the time domain to the frequency domain.

주파수 선형 예측 수행부(520)는 MDCT 적용부(510)에서 주파수 도메인으로 변환된 저주파수 밴드 신호에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(520)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(520)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performing unit 520 performs frequency linear prediction on the low frequency band signal converted into the frequency domain by the MDCT application unit 510. Here, frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of signals at a previous frequency. Specifically, the frequency linear prediction performing unit 520 calculates coefficients of the linear prediction filter such that the prediction error, which is a difference between the signal on which the linear prediction is performed and the signal at the current frequency, is minimized, and according to the calculated coefficients, the frequency domain Linear predictive filtering is performed on the low-frequency band signal transformed by. In this case, the frequency linear prediction performing unit 520 may improve the efficiency of encoding by performing vector quantization expressed by a vector index on a value corresponding to the coefficient of the linear prediction filter.

구체적으로, 주파수 선형 예측 수행부(520)는 MDCT 적용부(510)에서 주파수 도메인으로 변환된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(520)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the frequency linear prediction performing unit 520 may perform frequency linear prediction when the low frequency band signal converted into the frequency domain by the MDCT application unit 510 is a speech signal or a pitched signal. have. In other words, the frequency linear prediction performing unit 520 may improve the efficiency of encoding by performing frequency linear prediction according to the characteristics of the input signal.

멀티-레졸루션 분석부(530)는 주파수 선형 예측 수행부(520)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도(multi-resolution)로 분석한다. 구체적으로, 멀티-레졸루션 분석부(530)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(420)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다.The multi-resolution analyzer 530 receives the filtered signal from the frequency linear prediction performer 520 and analyzes the audio spectral coefficients of the instantaneously changing signal in multi-resolution. In detail, the multi-resolution analyzer 530 may filter the audio spectrum filtered by the frequency linear prediction performer 420 according to the intensity of the change in the audio spectrum in two types (for example, a stabil type and a short type). It can be analyzed by dividing by.

구체적으로, 멀티-레졸루션 분석부(530)는 MDCT 적용부(510)에서 주파수 도메인으로 변환된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(520)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(530)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the multi-resolution analyzer 530 is a case where the low frequency band signal transformed into the frequency domain by the MDCT application unit 510 or the signal filtered by the frequency linear prediction performer 520 is a transient signal. It can be analyzed by multi-resolution. In other words, the multi-resolution analyzer 530 may improve the encoding efficiency by performing multi-resolution analysis according to the characteristics of the input signal.

양자화부(540)는 주파수 선형 예측 수행부(520)에서 필터링된 신호 또는 멀티-레졸루션 분석부(530)에서 출력된 결과를 양자화한다.The quantizer 540 quantizes the signal filtered by the frequency linear prediction performer 520 or the result output from the multi-resolution analyzer 530.

문맥-기반 비트플레인 부호화부(550)는 문맥을 기반으로 하여 양자화부(540)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플레인 부호화부(550)는 허프만 코딩(Huffman Coding)을 이용하여 양자화부(540)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bitplane encoder 550 encodes the quantized result in the quantizer 540 into a bitplane based on the context. In detail, the context-based bit plane encoder 550 may encode the quantized result in the quantizer 540 into a bit plane using Huffman Coding.

이와 같이, 주파수 선형 예측 수행부(520), 멀티-레졸루션 분석부(530), 양자화부(540) 및 문맥-기반 비트플레인 부호화부(550)는 MDCT 적용부(510)에서 출력된 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.As such, the frequency linear prediction execution unit 520, the multi-resolution analysis unit 530, the quantization unit 540, and the context-based bitplane encoder 550 are converted low frequencies output from the MDCT application unit 510. Since the encoding is performed on the band signal, it may be referred to as a low frequency band encoder.

저주파수 밴드 변환부(560)는 밴드 분할부(500)에서 분할된 저주파수 밴드 신호(LB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 예를 들어, 저주파수 밴드 변환부(560)는 MDST, FFT, 및 QMF 등을 이용하여 저주파수 밴드 신호(LB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다. 여기서, 시간 도메인은 시간의 경과에 따라 저주파수 밴드 신호(LB)의 크기(예를 들어, 에너지 또는 음압 등)를 나타내는 도메인이다. 이에 비해, 주파수 도메인은 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다. 시간/주파수 도메인은 시간의 경과 및 주파수의 변화에 따라 저주파수 밴드 신호(LB)의 크기를 나타내는 도메인이다.The low frequency band converter 560 converts the low frequency band signal LB divided by the band divider 500 from the time domain to the frequency domain or the time / frequency domain using a conversion technique other than MDCT. For example, the low frequency band converter 560 may convert the low frequency band signal LB from the time domain to the frequency domain or the time / frequency domain using MDST, FFT, and QMF. Here, the time domain is a domain representing the magnitude (eg, energy or sound pressure) of the low frequency band signal LB over time. In contrast, the frequency domain is a domain indicating the magnitude of the low frequency band signal LB according to the change of frequency. The time / frequency domain is a domain indicating the magnitude of the low frequency band signal LB according to the passage of time and the change of frequency.

고주파수 밴드 변환부(570)는 밴드 분할부(500)에서 분할된 고주파수 밴드 신호(HB)를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 여기서, 고주파밴드 변환부(570)는 저주파밴드 변환부(560)에서 이용하는 동일한 변환 기법을 이용한다. 예를 들어, 고주파수 밴드 변환부(570)는 MDST, FFT, 및 QMF 등을 이용하여 고주파수 밴드 신호(HB)를 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환할 수 있다.The high frequency band converter 570 converts the high frequency band signal HB split by the band divider 500 from the time domain to the frequency domain or the time / frequency domain using a conversion technique other than MDCT. Here, the high frequency band converter 570 uses the same conversion technique used by the low frequency band converter 560. For example, the high frequency band converter 570 may convert the high frequency band signal HB from the time domain to the frequency domain or the time / frequency domain using MDST, FFT, and QMF.

대역폭 확장 부호화부(580)는 저주파수 밴드 변환부(560)에서 주파수 도메인으로 변환된 저주파수 밴드 신호를 이용하여, 고주파수 밴드 변환부(570)에서 주파수 도메인으로 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(580)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드 신호를 이용하여 대역폭 확장 정보를 생성할 수 있다.The bandwidth extension encoder 580 uses the low frequency band signal converted into the frequency domain by the low frequency band converter 560 to increase the bandwidth of the high frequency band signal converted into the frequency domain by the high frequency band converter 570. Generate and encode information. Here, it will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoder 580 may generate bandwidth extension information using a low frequency band signal based on a property that a high correlation exists between a high frequency and a low frequency band of the audio signal.

다중화부(590)는 주파수 선형 예측 수행부(520), 문맥-기반 비트플레인 부호화부(550) 및 대역폭 확장 부호화부(580)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexer 590 multiplexes the result of the encoding performed by the frequency linear prediction performer 520, the context-based bitplane encoder 550, and the bandwidth extension encoder 580 to generate a bit stream, and outputs the output terminal OUT. Output through

도 6은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 6 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 6을 참조하면, 오디오 신호의 부호화 장치는 MDCT 적용부(600), 밴드 분할부(610), 주파수 선형 예측 수행부(620), 멀티-레졸루션 분석부(630), 양자화부(640), 문맥-기반 비트플레인 부호화부(650), 대역폭 확장 부호화부(660) 및 다중화부(670)를 포함한다.Referring to FIG. 6, the apparatus for encoding an audio signal includes an MDCT application unit 600, a band divider 610, a frequency linear prediction performer 620, a multi-resolution analyzer 630, a quantizer 640, The context-based bitplane encoder 650, the bandwidth extension encoder 660, and the multiplexer 670 are included.

MDCT 적용부(600)는 입력 신호(IN)에 대하여 MDCT를 수행하여, 입력 신호(IN)를 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM 신호일 수 있다.The MDCT application unit 600 performs an MDCT on the input signal IN to convert the input signal IN from the time domain to the frequency domain. The input signal IN may be a PCM signal obtained by modulating an analog voice signal or an audio signal into a digital signal.

밴드 분할부(610)는 MDCT 적용부(600)에서 주파수 도메인으로 변환된 신호를 저주파수 밴드 신호(LB) 및 고주파수 밴드 신호(HB)로 분할한다. 여기서, 저주파수 밴드 신호는 임의의 임계값 보다 낮은 주파수에 해당하는 신호이며, 고주파수 밴드 신호는 임의의 임계값 보다 높은 주파수에 해당하는 신호일 수 있다.The band dividing unit 610 divides the signal converted into the frequency domain by the MDCT applying unit 600 into a low frequency band signal LB and a high frequency band signal HB. Here, the low frequency band signal may be a signal corresponding to a frequency lower than an arbitrary threshold value, and the high frequency band signal may be a signal corresponding to a frequency higher than an arbitrary threshold value.

주파수 선형 예측 수행부(620)는 밴드 분할부(610)에서 분할된 저주파수 밴드 신호(LB)에 대하여 주파수 선형 예측을 수행한다. 여기서, 주파수 선형 예측은 현재 주파수에서의 신호를 이전 주파수에서의 신호의 선형 조합으로 근사하는 방법이다. 구체적으로, 주파수 선형 예측 수행부(620)는 선형 예측이 수행된 신호와 현재 주파수에서의 신호 사이의 차이인 예측 오류가 최소가 되도록 선형 예측 필터의 계수를 계산하고, 계산된 계수에 따라 주파수 도메인으로 변환된 저주파수 밴드 신호를 선형 예측 필터링한다. 이 때, 주파수 선형 예측 수행부(620)는 선형 예측 필터의 계수에 대응되는 값에 대하여 벡터 인덱스로 표현하는 벡터 양자화를 수행하 여 부호화의 효율을 향상시킬 수 있다.The frequency linear prediction performer 620 performs frequency linear prediction on the low frequency band signal LB divided by the band divider 610. Here, frequency linear prediction is a method of approximating a signal at a current frequency to a linear combination of signals at a previous frequency. Specifically, the frequency linear prediction performing unit 620 calculates coefficients of the linear prediction filter such that the prediction error, which is the difference between the signal on which the linear prediction is performed and the signal at the current frequency, is minimized, and according to the calculated coefficients, the frequency domain Linear predictive filtering is performed on the low-frequency band signal transformed by. In this case, the frequency linear prediction performing unit 620 may improve the efficiency of encoding by performing vector quantization expressed by a vector index on values corresponding to coefficients of the linear prediction filter.

구체적으로, 주파수 선형 예측 수행부(620)는 밴드 분할부(610)에서 분할된 저주파수 밴드 신호가 음성(speech) 신호 또는 피치드(pitched) 신호인 경우에 주파수 선형 예측을 수행할 수 있다. 다시 말해, 주파수 선형 예측 수행부(620)는 입력되는 신호의 특성에 따라 주파수 선형 예측을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the frequency linear prediction performing unit 620 may perform frequency linear prediction when the low frequency band signal divided by the band dividing unit 610 is a speech signal or a pitched signal. In other words, the frequency linear prediction performing unit 620 may improve the efficiency of encoding by performing frequency linear prediction according to the characteristics of the input signal.

멀티-레졸루션 분석부(630)는 주파수 선형 예측 수행부(620)에서 필터링된 신호를 입력받아 순간적으로 변하는 신호의 오디오 스펙트럼 계수를 다해상도로 분석한다. 구체적으로, 멀티-레졸루션 분석부(630)는 오디오 스펙트럼의 변화의 격렬한 정도에 따라 주파수 선형 예측 수행부(620)에서 필터링된 오디오 스펙트럼을 두 가지 유형(예를 들어, 스테이빌 유형과 숏 유형)으로 나누어 분석할 수 있다. The multi-resolution analyzer 630 receives the filtered signal from the frequency linear prediction performer 620 and analyzes the audio spectral coefficients of the signal that is instantaneously changed in multi resolution. In detail, the multi-resolution analyzer 630 may filter the audio spectrum filtered by the frequency linear prediction performer 620 according to the intensity of the change in the audio spectrum in two types (for example, a stabil type and a short type). It can be analyzed by dividing by.

구체적으로, 멀티-레졸루션 분석부(630)는 밴드 분할부(610)에서 분할된 저주파수 밴드 신호 또는 주파수 선형 예측 수행부(620)에서 필터링된 신호가 트렌젼트(transient) 신호인 경우에 멀티-레졸루션으로 분석할 수 있다. 다시 말해, 멀티-레졸루션 분석부(630)는 입력되는 신호의 특성에 따라 멀티-레졸루션 분석을 수행하여 부호화의 효율을 향상시킬 수 있다.In detail, the multi-resolution analyzer 630 performs the multi-resolution when the low frequency band signal divided by the band divider 610 or the signal filtered by the frequency linear prediction performer 620 is a transient signal. Can be analyzed. In other words, the multi-resolution analyzer 630 may improve the efficiency of encoding by performing multi-resolution analysis according to the characteristics of the input signal.

양자화부(640)는 주파수 선형 예측 수행부(620)에서 필터링된 신호 또는 멀티-레졸루션 분석부(630)에서 출력된 결과를 양자화한다.The quantizer 640 quantizes the signal filtered by the frequency linear prediction performer 620 or the result output from the multi-resolution analyzer 630.

문맥-기반 비트플레인 부호화부(650)는 문맥을 기반으로 하여 양자화부(640)에서 양자화된 결과를 비트플레인으로 부호화한다. 구체적으로, 문맥-기반 비트 플 레인 부호화부(650)는 허프만 코딩(Huffman Coding)을 이용하여 양자화부(640)에서 양자화된 결과를 비트플레인으로 부호화할 수 있다.The context-based bitplane encoder 650 encodes the quantized result in the quantizer 640 into a bitplane based on the context. In detail, the context-based bitplane encoder 650 may encode the quantized result in the quantizer 640 into a bitplane using Huffman Coding.

이와 같이, 주파수 선형 예측 수행부(620), 멀티-레졸루션 분석부(630), 양자화부(640) 및 문맥-기반 비트플레인 부호화부(650)는 MDCT 적용부(610)에서 출력된 변환된 저주파수 밴드 신호에 대해서 부호화를 수행하므로, 저주파수 밴드 부호화부라고 할 수 있다.As such, the frequency linear prediction performing unit 620, the multi-resolution analysis unit 630, the quantization unit 640, and the context-based bitplane encoder 650 are converted low frequencies output from the MDCT application unit 610. Since the encoding is performed on the band signal, it may be referred to as a low frequency band encoder.

대역폭 확장 부호화부(660)는 저주파수 밴드 신호(LB)를 이용하여, 고주파수 밴드 신호(HB)의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다. 여기서, 대역폭 확장 정보는 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 다양한 정보를 포함할 수 있음은 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 이해할 수 있을 것이다. 보다 상세하게는, 대역폭 확장 부호화부(660)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 저주파수 밴드 신호를 이용하여 대역폭 확장 정보를 생성할 수 있다. The bandwidth extension encoder 660 generates and encodes bandwidth extension information indicating a characteristic of the high frequency band signal HB using the low frequency band signal LB. Here, it will be understood by those skilled in the art that the bandwidth extension information may include various information such as energy level or envelope for the high frequency band signal. More specifically, the bandwidth extension encoder 660 may generate bandwidth extension information by using the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal.

다중화부(670)는 주파수 선형 예측 수행부(620), 문맥-기반 비트플레인 부호화부(650) 및 대역폭 확장 부호화부(660)에서 부호화를 수행한 결과를 다중화하여 비트 스트림을 생성하고 출력 단자 OUT을 통해 출력한다.The multiplexer 670 multiplexes the result of the encoding performed by the frequency linear prediction performer 620, the context-based bitplane encoder 650, and the bandwidth extension encoder 660 to generate a bit stream, and output terminal OUT. Output through

도 7은 본 발명의 일 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 7 is a block diagram illustrating an apparatus for decoding an audio signal according to an embodiment of the present invention.

도 7을 참조하면, 오디오 신호의 복호화 장치는 역다중화부(700), 문맥-기반 비트플레인 복호화부(710), PQ-SPSC 모듈(720), 역양자화부(730), 멀티-레졸루션 합성부(740), 역 주파수 선형 예측 수행부(750), 제1 역 MDCT 적용부(760), 대역폭 확장 복호화부(770), 제2 역 MDCT 적용부(780) 및 밴드 합성부(790)를 포함하여 이루어진다.Referring to FIG. 7, an apparatus for decoding an audio signal includes a demultiplexer 700, a context-based bitplane decoder 710, a PQ-SPSC module 720, a dequantizer 730, and a multi-resolution synthesizer. 740, an inverse frequency linear prediction execution unit 750, a first inverse MDCT application unit 760, a bandwidth extension decoding unit 770, a second inverse MDCT application unit 780, and a band synthesis unit 790. It is done by

역다중화부(700)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 구체적으로, 역다중화부(700)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(700)가 출력하는 정보는 PQ-SPSC 모듈(720)에서 이용할 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화의 정보, Quantization wide-polar coordination stereo decoding의 정보, 신호 타입 정보(Signal type information), 주파수 선형 예측 및 벡터 양자화(Frequency-domain linear prediction and vector quantization)의 정보, 및 부호화된 대역폭 확장 정보 등이 있다.The demultiplexer 700 demultiplexes a bit stream output from an encoding stage. In detail, the demultiplexer 700 separates each part of the data level into data parts corresponding to each unit, and analyzes and outputs information of a bitstream related thereto in the corresponding unit. Here, the information output from the demultiplexer 700 includes description analysis of the audio spectrum stream to be used by the PQ-SPSC module 720, quantization values and other reconstruction information, reconstruction information of the quantization spectrum, and context-based bitplanes. Information of decoding, information of quantization wide-polar coordination stereo decoding, signal type information, information of frequency-domain linear prediction and vector quantization, and coded bandwidth extension information have.

문맥-기반 비트플레인 복호화부(710, Context-dependent Bit plane decoding unit)는 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(710)는 역다중화부(700)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보 및 스케일 팩터(scale factor)를 복원한다. 문맥-기반 비트플레인 복호화부(710)는 Prejudice coding band mode 정보, Prejudice coding의 스케일 팩터, 및 Prejudice coding의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스 케일 팩터의 decoding cosmetic 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bitplane decoding unit 710 decodes the encoded bitplane based on the context. Here, the context-based bitplane decoder 710 receives information output from the demultiplexer 700 and performs Huffman decoding to perform frequency spectrum, coding band mode information, and scale. Restore the scale factor. The context-based bitplane decoder 710 receives Prejudice coding band mode information, a scale factor of Prejudice coding, and a frequency spectrum of Prejudice coding, and receives a coding band mode value, a decoding cosmetic display of a scale factor, and a quantization value of a frequency spectrum. Outputs

PQ-SPSC 모듈(720, Post-quantization Square Polar Stereo Coding module)은 문맥-기반 비트플레인 복호화부(710)에서 출력된 결과를 수신하고, 주파수 스펙트럼의 side-polar coordination stereo decoding을 수행한다. 여기서, PQ-SPSC 모듈(720)은 주파수 스펙트럼의 side-polar coordination stereo와 연결(coupling)의 정보를 입력받아 side-polar coordination stereo decoding을 수행한 후 양자화 주파수 스펙트럼을 출력한다.The PQ-SPSC module 720 receives a result output from the context-based bitplane decoder 710 and performs side-polar coordination stereo decoding of the frequency spectrum. Here, the PQ-SPSC module 720 receives side-polar coordination stereo and coupling information of the frequency spectrum, performs side-polar coordination stereo decoding, and then outputs a quantized frequency spectrum.

역양자화부(730)는 PQ-SPSC 모듈(720)에서 출력된 결과를 역양자화한다.The inverse quantization unit 730 inverse quantizes the result output from the PQ-SPSC module 720.

멀티-레졸루션 합성부(740, Multi-resolution synthesis unit)는 역양자화부(730)의 출력을 수신하여, 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션(Multi-resolution)으로 합성한다. 구체적으로, 멀티-레졸루션 합성부(740)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(730)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(740)는 reserve quantization spectrum/difference spectrum을 입력받아 reconstruction spectrum/difference spectrum을 출력한다.The multi-resolution synthesis unit 740 receives the output of the inverse quantization unit 730 and synthesizes the multi-resolution of the audio spectral coefficients of the instantaneously changing signal. In detail, the multi-resolution synthesis unit 740 may improve the decoding efficiency by synthesizing the output of the dequantization unit 730 into the multi-resolution when the audio signal is analyzed as the multi-resolution at the encoding end. Here, the multi-resolution synthesis unit 740 receives a reserve quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(750, inverse frequency linear prediction performing unit)은 역양자화부(730)의 출력 또는 멀티-레졸루션 합성부(740)의 출력과 역다중화부(700)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성한다. 구체적으로, 역 주파수 선형 예측 수행부(750)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결 과를 역양자화부(730)의 출력 또는 멀티-레졸루션 합성부(740)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(750)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(750)은 difference spectrum 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.An inverse frequency linear prediction performing unit (750) is a frequency output from an inverse quantizer 730 or an output of the multi-resolution synthesis unit 740 and an encoder received from the demultiplexer 700. Synthesize the result of performing the linear prediction. In detail, the inverse frequency linear prediction execution unit 750 outputs the result of the frequency linear prediction when the audio signal is encoded in the encoding stage, or multi-resolution synthesis of the inverse quantization unit 730. The efficiency of the decoding may be improved by combining with the output of the unit 740. Here, the inverse frequency linear prediction performing unit 750 effectively improves coding efficiency by employing a frequency domain prediction technique and a vector quantization technique of prediction coefficients. The inverse frequency linear prediction performing unit 750 receives a difference spectrum coefficient and an index of a vector and outputs an MDCT spectrum coefficient.

제1 역 MDCT 적용부(760)는 부호화단에 대한 역변환 과정으로서 멀티-레졸루션 합성부(740) 또는 역 주파수 선형 예측 수행부(750)에서 출력된 저주파수 밴드 신호에 대하여 역 MDCT(inverse MDCT)를 수행하여 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 제1 역 MDCT 적용부(760)는 멀티-레졸루션 합성부(740) 또는 역 주파수 선형 예측 수행부(750)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The first inverse MDCT application unit 760 performs an inverse MDCT (inverse MDCT) on the low frequency band signal output from the multi-resolution synthesis unit 740 or the inverse frequency linear prediction execution unit 750 as an inverse transform process for the encoding end. Perform inverse transform from frequency domain to time domain. Here, the first inverse MDCT applying unit 760 receives the frequency spectrum coefficients obtained as a result of the inverse quantization by the multi-resolution synthesis unit 740 or the inverse frequency linear prediction performing unit 750, and restores the frequency spectrum coefficients corresponding to the low frequency bands. Output as audio data.

대역폭 확장 복호화부(770)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 멀티-레졸루션 합성부(740) 또는 역 주파수 선형 예측 수행부(750)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(770)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoder 770 decodes the encoded bandwidth extension information and uses the decoded bandwidth extension information to output the low frequency band signal output from the multi-resolution synthesis unit 740 or the inverse frequency linear prediction performer 750. Generate a high frequency band signal. Here, the bandwidth extension decoder 770 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information that can indicate the characteristics of the high frequency band of the high frequency band and is information such as energy level or envelope of the high frequency band signal.

제2 역 MDCT 적용부(780)는 대역폭 확장 복호화부(770)에서 복호화된 고주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다.The second inverse MDCT application unit 780 inversely converts the high frequency band signal decoded by the bandwidth extension decoder 770 from the frequency domain to the time domain by inverse MDCT.

밴드 합성부(790)는 제1 역 MDCT 적용부(760)에서 시간 도메인으로 변환된 저주파수 밴드 신호와 제2 역 MDCT 적용부(780)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성하여 출력단자 OUT을 통해 출력한다.The band synthesizing unit 790 synthesizes the low frequency band signal converted into the time domain in the first inverse MDCT applying unit 760 and the high frequency band signal converted into the time domain in the second inverse MDCT applying unit 780 and outputs the output terminal OUT. Output through

도 8은 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 8 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 8을 참조하면, 오디오 데이터 복호화 장치는 역다중화부(800), 문맥-기반 비트플레인 복호화부(810), PQ-SPSC 모듈(820), 역양자화부(830), 멀티-레졸루션 합성부(840), 역 주파수 선형 예측 수행부(850), 역 MDCT 적용부(860), 변환부(865), 대역폭 확장 복호화부(870), 역변환부(880) 및 밴드 합성부(890)를 포함하여 이루어진다.Referring to FIG. 8, the audio data decoding apparatus includes a demultiplexer 800, a context-based bitplane decoder 810, a PQ-SPSC module 820, an inverse quantizer 830, and a multi-resolution synthesizer ( 840, an inverse frequency linear prediction execution unit 850, an inverse MDCT application unit 860, a transformer 865, a bandwidth extension decoder 870, an inverse transformer 880, and a band synthesizer 890. Is done.

역다중화부(800)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 역다중화부(800)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(800)가 출력하는 정보는 PQ-SPSC 모듈(820)에서 이용할 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화(Context-dependent Bit plane decoding without decoding)의 정보, Quantization wide-polar coordination stereo decoding의 정보, Signal type information, Frequency-domain linear prediction and vector quantization의 정보, 및 부호화된 대역폭 확장 정보 등이 있다.The demultiplexer 800 demultiplexes a bit stream output from an encoding stage. The demultiplexer 800 separates each part of the data level into data parts corresponding to each unit, and analyzes and outputs information of a bitstream related thereto in the corresponding unit. In this case, the information output from the demultiplexer 800 may include description analysis of an audio spectrum stream to be used by the PQ-SPSC module 820, quantization values and other reconstruction information, reconstruction information of the quantization spectrum, and context-based bitplanes. Information of context-dependent bit plane decoding without decoding, information of quantization wide-polar coordination stereo decoding, signal type information, information of frequency-domain linear prediction and vector quantization, and coded bandwidth extension information.

문맥-기반 비트플레인 복호화부(810)는 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(810)는 역다중화부(800)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보, 및 스케일 팩터(scale factor)를 복원한다. 문맥-기반 비트플레인 복호화부(810)는 Prejudice coding band mode 정보, Prejudice coding의 스케일 팩터, 및 Prejudice coding의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 decoding cosmetic 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bitplane decoder 810 decodes the encoded bitplane based on the context. Here, the context-based bitplane decoder 810 receives the information output from the demultiplexer 800 and performs Huffman decoding to obtain frequency spectrum, coding band mode information, and Restore the scale factor. The context-based bitplane decoder 810 receives prejudice coding band mode information, a scale factor of prejudice coding, and a frequency spectrum of prejudice coding to obtain a coding band mode value, a decoding cosmetic display of a scale factor, and a quantization value of a frequency spectrum. Output

PQ-SPSC 모듈(820)은 문맥-기반 비트플레인 복호화부(810)의 출력을 수신하고, 주파수 스펙트럼의 side-polar coordination stereo decoding을 수행한다. 여기서, PQ-SPSC 모듈(820)은 주파수 스펙트럼의 side-polar coordination stereo와 연결(coupling)의 정보를 입력받아 side-polar coordination stereo decoding을 수행한 후 양자화 주파수 스펙트럼을 출력한다.The PQ-SPSC module 820 receives the output of the context-based bitplane decoder 810 and performs side-polar coordination stereo decoding of the frequency spectrum. Here, the PQ-SPSC module 820 receives side-polar coordination stereo and coupling information of the frequency spectrum, performs side-polar coordination stereo decoding, and then outputs a quantized frequency spectrum.

역양자화부(830)는 PQ-SPSC 모듈(820)에서 출력된 결과를 역양자화한다.The dequantization unit 830 dequantizes the result output from the PQ-SPSC module 820.

멀티-레졸루션 합성부(840)는 역양자화부(830)의 출력을 수신하고, 순간적으로 변하는 신호의 오디오 스펙트럼 계수에 대하여 멀티-레졸루션(Multi-resolution)을 처리한다. 구체적으로, 멀티-레졸루션 합성부(840)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(830)의 출력을 멀티-레 졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(840)는 reserve quantization spectrum/difference spectrum을 입력받아 reconstruction spectrum/difference spectrum을 출력한다.The multi-resolution synthesizer 840 receives the output of the inverse quantizer 830 and processes multi-resolution for the audio spectral coefficients of the instantaneously changing signal. In detail, the multi-resolution synthesis unit 840 may improve the decoding efficiency by synthesizing the output of the dequantization unit 830 into a multi-resolution when the audio signal is analyzed as a multi-resolution at the encoding end. have. Here, the multi-resolution synthesis unit 840 receives a reserve quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(850)는 멀티-레졸루션 합성부(840)의 출력과 역다중화부(800)로부터 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(850)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(830)의 출력 또는 멀티-레졸루션 합성부(840)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(850)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(850)은 difference spectrum 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 850 synthesizes the output of the multi-resolution synthesis unit 840 and the result of performing the frequency linear prediction in the encoding stage from the demultiplexing unit 800 and performs inverse-vector quantization. In detail, the inverse frequency linear prediction performing unit 850 outputs the result of the frequency linear prediction when the audio signal is encoded in the encoding stage or outputs the inverse quantization unit 830 or the multi-resolution synthesis unit. In combination with the output of 840, the efficiency of decoding may be improved. Here, the inverse frequency linear prediction execution unit 850 effectively improves coding efficiency by employing a frequency domain prediction technique and a vector quantization technique of prediction coefficients. The inverse frequency linear prediction performing unit 850 receives a difference spectrum coefficient and an index of a vector and outputs an MDCT spectrum coefficient.

역 MDCT 적용부(860)는 부호화단에 대한 역변환 과정으로서 멀티-레졸루션 합성부(840) 또는 역 주파수 선형예측 수행부(850)에서 출력된 저주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역 MDCT 적용부(860)는 멀티-레졸루션 합성부(840) 또는 역 주파수 선형 예측 수행부(850)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The inverse MDCT application unit 860 converts the low frequency band signal output from the multi-resolution synthesis unit 840 or the inverse frequency linear prediction unit 850 from the frequency domain to the time domain by inverse MDCT as an inverse transform process on the encoding end. Invert Here, the inverse MDCT application unit 860 receives the frequency spectrum coefficients obtained as a result of inverse quantization by the multi-resolution synthesis unit 840 or the inverse frequency linear prediction performing unit 850, and then reconstructed audio data corresponding to the low frequency band. Will output

변환부(865)는 역 MDCT 적용부(860)에서 시간 도메인으로 변환된 저주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또 는 시간-주파수 도메인으로 변환한다. 변환부(865)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다. 변환부(865)에서 MDCT를 적용하여 실시할 수 없는 것은 아니지만 MDCT를 사용할 경우에는 도 7에 도시된 실시예가 보다 효율적이다.The conversion unit 865 converts the low frequency band signal converted into the time domain in the inverse MDCT application unit 860 from the time domain to the frequency domain or the time-frequency domain using a conversion technique other than MDCT. Examples of conversion techniques used by the conversion unit 865 include MDST, FFT, and QMF. Although the transform unit 865 may not be implemented by applying MDCT, the embodiment shown in FIG. 7 is more efficient when using MDCT.

대역폭 확장 복호화부(870)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 변환부(865)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(870)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoder 870 decodes the encoded bandwidth extension information and generates a high frequency band signal from the low frequency band signal output from the converter 865 using the decoded bandwidth extension information. Here, the bandwidth extension decoder 870 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information that can indicate the characteristics of the high frequency band of the high frequency band and is information such as energy level or envelope of the high frequency band signal.

역변환부(880)는 대역폭 확장 복호화부(870)에서 복호화된 고주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 주파수 도메인 또는 시간-주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역변환부(880)는 변환부(865)에서 이용하는 동일한 변환 기법을 이용한다. 역변환부(880)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다.The inverse transformer 880 inversely converts the high frequency band signal decoded by the bandwidth extension decoder 870 from the frequency domain or the time-frequency domain to the time domain using a conversion technique other than MDCT. Here, the inverse transformer 880 uses the same transformation technique used by the transformer 865. Examples of the conversion technique used by the inverse transform unit 880 include MDST, FFT, and QMF.

밴드 합성부(890)는 역 MDCT 적용부(860)에서 시간 도메인으로 변환된 저주파수 밴드 신호와 역변환부(880)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성하여 출력단자 OUT을 통해 출력한다.The band synthesizer 890 synthesizes the low frequency band signal converted into the time domain by the inverse MDCT application unit 860 and the high frequency band signal converted into the time domain by the inverse converter 880 and outputs the same through the output terminal OUT.

도 9는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타 내는 블록도이다. 9 is a block diagram showing an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 9를 참조하면, 오디오 데이터 복호화 장치는 역다중화부(900), 문맥-기반 비트플레인 복호화부(910), PQ-SPSC 모듈(920), 역양자화부(930), 멀티-레졸루션 합성부(940), 역 주파수 선형 예측 수행부(950), 대역폭 확장 복호화부(960), 밴드 합성부(970) 및 역 MDCT 적용부(980)를 포함한다.Referring to FIG. 9, an audio data decoding apparatus includes a demultiplexer 900, a context-based bitplane decoder 910, a PQ-SPSC module 920, an inverse quantizer 930, and a multi-resolution synthesis unit ( 940, an inverse frequency linear prediction performer 950, a bandwidth extension decoder 960, a band synthesizer 970, and an inverse MDCT application 980.

역다중화부(900)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 역다중화부(900)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(900)가 출력하는 정보는 PQ-SPSC 모듈(920)에서 이용할 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화(Context-dependent Bit plane decoding without decoding)의 정보, Quantization wide-polar coordination stereo decoding의 정보, Signal type information, Frequency-domain linear prediction and vector quantization의 정보, 및 부호화된 대역폭 확장 정보 등이 있다.The demultiplexer 900 demultiplexes a bit stream output from an encoding stage. The demultiplexer 900 separates each part of the data level into data parts corresponding to each unit, and analyzes and outputs information of a bitstream related thereto in the corresponding unit. Here, the information output from the demultiplexer 900 includes description analysis of the audio spectrum stream to be used by the PQ-SPSC module 920, quantization values and other reconstruction information, reconstruction information of the quantization spectrum, and context-based bitplanes. Information of context-dependent bit plane decoding without decoding, information of quantization wide-polar coordination stereo decoding, signal type information, information of frequency-domain linear prediction and vector quantization, and coded bandwidth extension information.

문맥-기반 비트플레인 복호화부(910)는 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(910)는 역다중화부(900)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보, 및 스케일 팩터(scale factor)를 복원한다. 문맥-기반 비트플레인 복호화부(910)는 Prejudice coding band mode 정보, Prejudice coding의 스케일 팩터, 및 Prejudice coding의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 decoding cosmetic 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bitplane decoder 910 decodes the encoded bitplane based on the context. Here, the context-based bitplane decoder 910 receives the information output from the demultiplexer 900 and performs Huffman decoding to obtain frequency spectrum, coding band mode information, and Restore the scale factor. The context-based bitplane decoder 910 receives prejudice coding band mode information, a scale factor of prejudice coding, and a frequency spectrum of prejudice coding, and receives a coding band mode value, a decoding cosmetic display of a scale factor, and a quantization value of a frequency spectrum. Output

PQ-SPSC 모듈(920)은 문맥-기반 비트플레인 복호화부(910)의 출력을 수신하고, 주파수 스펙트럼의 side-polar coordination stereo decoding을 수행한다. 여기서, PQ-SPSC 모듈(920)은 주파수 스펙트럼의 side-polar coordination stereo와 연결(coupling)의 정보를 입력받아 side-polar coordination stereo decoding을 수행한 후 양자화 주파수 스펙트럼을 출력한다.The PQ-SPSC module 920 receives the output of the context-based bitplane decoder 910 and performs side-polar coordination stereo decoding of the frequency spectrum. Here, the PQ-SPSC module 920 receives side-polar coordination stereo and coupling information of the frequency spectrum, performs side-polar coordination stereo decoding, and then outputs a quantized frequency spectrum.

역양자화부(930)는 PQ-SPSC 모듈(920)에서 출력된 결과를 역양자화한다.The dequantizer 930 dequantizes the result output from the PQ-SPSC module 920.

멀티-레졸루션 합성부(940)는 역양자화부(930)의 출력을 수신하여, 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션(Multi-resolution)을 처리한다. 구체적으로, 멀티-레졸루션 합성부(90)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(930)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(940)는 reserve quantization spectrum/difference spectrum을 입력받아 reconstruction spectrum/difference spectrum을 출력한다.The multi-resolution synthesis unit 940 receives the output of the inverse quantization unit 930 and processes multi-resolution of audio spectral coefficients of a signal that is instantaneously changed. In detail, the multi-resolution synthesis unit 90 may improve the decoding efficiency by synthesizing the output of the dequantization unit 930 into a multi-resolution when the audio signal is analyzed as a multi-resolution at the encoding end. Here, the multi-resolution synthesis unit 940 receives a reserve quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(950)는 멀티-레졸루션 합성부(940)의 출력과 역다중화부(900)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(950)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선 형 예측이 수행된 결과를 역양자화부(930)의 출력 또는 멀티-레졸루션 합성부(940)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(950)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(950)은 difference spectrum 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 950 synthesizes the output of the multi-resolution synthesis unit 940 and the result of performing the frequency linear prediction in the coding stage received from the demultiplexing unit 900 and performs inverse-vector quantization. . In detail, the inverse frequency linear prediction performing unit 950 outputs the result of the frequency linear prediction when the audio signal is encoded in the encoding stage, or outputs the multi-resolution synthesis of the inverse quantization unit 930. The efficiency of decoding may be improved by combining with the output of the unit 940. Here, the inverse frequency linear prediction performing unit 950 effectively improves coding efficiency by employing a frequency domain prediction technique and a vector quantization technique of prediction coefficients. The inverse frequency linear prediction performing unit 950 receives a difference spectrum coefficient and an index of a vector and outputs an MDCT spectrum coefficient.

대역폭 확장 복호화부(960)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 멀티-레졸루션 합성부(940) 또는 역 주파수 선형 예측 수행부(950)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(960)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoder 960 decodes the encoded bandwidth extension information and uses the decoded bandwidth extension information to output the low frequency band signal output from the multi-resolution synthesis unit 940 or the inverse frequency linear prediction performer 950. Generate a high frequency band signal. Here, the bandwidth extension decoder 960 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information that can indicate the characteristics of the high frequency band of the high frequency band and is information such as energy level or envelope of the high frequency band signal.

밴드 합성부(970)는 멀티-레졸루션 합성부(940) 또는 역 주파수 선형 예측 수행부(950)에서 출력된 저주파수 밴드 신호와 대역폭 확장 복호화부(960)에서 복호화된 고주파수 밴드 신호를 합성한다.The band synthesis unit 970 synthesizes the low frequency band signal output from the multi-resolution synthesis unit 940 or the inverse frequency linear prediction execution unit 950 and the high frequency band signal decoded by the bandwidth extension decoder 960.

역 MDCT 적용부(980)는 밴드 합성부(970)에서 출력된 신호를 역 MDCT(inverse MDCT)에 의해 주파수 도메인에서 시간 도메인으로 역변환하여 출력단자 OUT을 통해 출력한다. 여기서, 역 MDCT 적용부(980)는 역 주파수 선형 예측 수 행부(950)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The inverse MDCT application unit 980 inversely converts the signal output from the band synthesis unit 970 from the frequency domain to the time domain by an inverse MDCT and outputs it through the output terminal OUT. Here, the inverse MDCT application unit 980 receives the frequency spectrum coefficients obtained as the result of inverse quantization by the inverse frequency linear prediction performing unit 950 and outputs the recovered audio data corresponding to the low frequency band.

도 10은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.10 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 10을 참조하면, 오디오 신호의 복호화 장치는 역다중화부(1000), 문맥-기반 비트플레인 복호화부(1010), 역양자화부(1020), 멀티-레졸루션 합성부(1030), 역 주파수 선형 예측 수행부(1040), 대역폭 확장 복호화부(1050), 제1 역 MDCT 적용부(1060), 제2 역 MDCT 적용부(1070) 및 밴드 합성부(1080)를 포함한다.Referring to FIG. 10, an apparatus for decoding an audio signal includes a demultiplexer 1000, a context-based bitplane decoder 1010, a dequantizer 1020, a multi-resolution synthesizer 1030, and inverse frequency linear prediction. An execution unit 1040, a bandwidth extension decoding unit 1050, a first inverse MDCT application unit 1060, a second inverse MDCT application unit 1070, and a band synthesis unit 1080 are included.

역다중화부(1000)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 구체적으로, 역다중화부(1000)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(1000)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화(Context-dependent Bit plane decoding without decoding)의 정보, 신호 타입 정보(Signal type information), 주파수 선형 예측 및 벡터 양자화(Frequency-domain linear prediction and vector quantization)의 정보, 및 부호화된 대역폭 확장 정보 등이 있다.The demultiplexer 1000 demultiplexes a bit stream output from an encoding stage. Specifically, the demultiplexer 1000 separates each part of the data level into data parts corresponding to each unit, and analyzes and outputs information of a bitstream related thereto in the corresponding unit. In this case, the information output from the demultiplexer 1000 includes an explanatory analysis of the audio spectrum stream, quantization values and other reconstruction information, reconstruction information of the quantization spectrum, and context-dependent bit plane decoding without. information of decoding, signal type information, information of frequency-linear prediction and vector quantization, and coded bandwidth extension information.

문맥-기반 비트플레인 복호화부(1010)는 부호화된 비트플레인을 문맥을 기반으로 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(1010)는 역다중화 부(1000)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보 및 스케일 팩터(scale factor)를 복원한다. 문맥-기반 비트플레인 복호화부(1010)는 Prejudice coding band mode 정보, Prejudice coding의 스케일 팩터(scale factor), 및 Prejudice coding의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 decoding cosmetic 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bitplane decoder 1010 decodes the encoded bitplane based on the context. Here, the context-based bitplane decoder 1010 receives the information output from the demultiplexer 1000 and performs Huffman decoding to perform frequency spectrum, coding band mode information, and scale. Restore the scale factor. The context-based bitplane decoder 1010 receives prejudice coding band mode information, a scale factor of prejudice coding, and a frequency spectrum of prejudice coding, and displays a coding band mode value, a decoding cosmetic display of a scale factor, and a frequency spectrum. Output the quantization value of.

역양자화부(1020)는 문맥-기반 비트플레인 복호화부(1010)에서 출력된 결과를 역양자화한다.The inverse quantizer 1020 dequantizes the result output from the context-based bitplane decoder 1010.

멀티-레졸루션 합성부(1030)는 역양자화부(1020)에서 역양자화된 신호를 수신하여 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션(Multi-resolution)을 처리한다. 구체적으로, 멀티-레졸루션 합성부(1030)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(1020)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(1030)는 reserve quantization spectrum/difference spectrum을 입력받아 reconstruction spectrum/difference spectrum을 출력한다.The multi-resolution synthesis unit 1030 receives the dequantized signal from the inverse quantization unit 1020 and processes multi-resolution of audio spectral coefficients of a signal that is instantaneously changed. In detail, the multi-resolution synthesis unit 1030 may improve the decoding efficiency by synthesizing the output of the inverse quantizer 1020 into multi-resolution when the audio signal is analyzed as multi-resolution at the encoding end. Here, the multi-resolution synthesis unit 1030 receives a reserve quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(1040)은 멀티-레졸루션 합성부(1030)의 출력과 역다중화부(1000)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성한다. 구체적으로, 역 주파수 선형 예측 수행부(1040)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(1020)의 출력 또는 멀티-레졸루션 합성부(1030)의 출력과 합성하여 복 호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(1040)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(1040)은 difference spectrum 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction execution unit 1040 synthesizes the output of the multi-resolution synthesis unit 1030 and the result of performing the frequency linear prediction in the coding stage received from the demultiplexer 1000. In detail, the inverse frequency linear prediction execution unit 1040 outputs the result of the frequency linear prediction when the audio signal is encoded in the encoding stage, or outputs the inverse quantization unit 1020 or the multi-resolution synthesis unit. Combined with the output of 1030, the efficiency of copying can be improved. Here, the inverse frequency linear prediction execution unit 1040 effectively improves coding efficiency by employing a frequency domain prediction technique and a vector quantization technique of prediction coefficients. The inverse frequency linear prediction execution unit 1040 receives a difference spectrum coefficient and an index of a vector and outputs an MDCT spectrum coefficient.

이와 같이, 문맥-기반 비트플레인 복호화부(1010), 역양자화부(1020), 멀티-레졸루션 합성부(1030) 및 역 주파수 선형 예측 수행부(1040)는 부호화된 저주파수 밴드 신호에 대해서 복호화를 수행하므로, 저주파수 밴드 복호화부라고 할 수 있다.As such, the context-based bitplane decoder 1010, the inverse quantizer 1020, the multi-resolution synthesizer 1030, and the inverse frequency linear prediction performer 1040 perform decoding on the encoded low frequency band signal. Therefore, it can be referred to as a low frequency band decoder.

대역폭 확장 복호화부(1050)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 멀티-레졸루션 합성부(1030) 또는 역 주파수 선형 예측 수행부(1040)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(1050)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoder 1050 decodes the encoded bandwidth extension information and uses the decoded bandwidth extension information to output the low frequency band signal output from the multi-resolution synthesis unit 1030 or the inverse frequency linear prediction execution unit 1040. Generate a high frequency band signal. Here, the bandwidth extension decoder 1050 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information that can indicate the characteristics of the high frequency band of the high frequency band and is information such as energy level or envelope of the high frequency band signal.

제1 역 MDCT 적용부(1060)는 부호화단에 대한 역변환 과정으로서 멀티-레졸루션 합성부(1030) 또는 역 주파수 선형 예측 수행부(1040)에서 출력된 저주파수 밴드 신호에 대하여 역 MDCT(inverse MDCT)를 수행하여 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 제1 역 MDCT 적용부(1060)는 멀티-레졸루션 합성 부(1030) 또는 역 주파수 선형 예측 수행부(1040)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The first inverse MDCT application unit 1060 performs an inverse MDCT on the low frequency band signal output from the multi-resolution synthesis unit 1030 or the inverse frequency linear prediction unit 1040 as an inverse transform process for the encoding end. Perform inverse transform from frequency domain to time domain. Here, the first inverse MDCT application unit 1060 receives a frequency spectrum coefficient obtained as a result of inverse quantization by the multi-resolution synthesis unit 1030 or the inverse frequency linear prediction execution unit 1040, and restores the frequency spectrum coefficient corresponding to the low frequency band. Output as audio data.

제2 역 MDCT 적용부(1070)는 대역폭 확장 복호화부(1050)에서 생성된 고주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다.The second inverse MDCT application unit 1070 inversely converts the high frequency band signal generated by the bandwidth extension decoding unit 1050 from the frequency domain to the time domain by inverse MDCT.

밴드 합성부(1080)는 제1 역 MDCT 적용부(1060)에서 시간 도메인으로 변환된 저주파수 밴드 신호와 제2 역 MDCT 적용부(1070)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성하여 출력단자 OUT을 통해 출력한다.The band synthesizing unit 1080 synthesizes the low frequency band signal converted into the time domain in the first inverse MDCT applying unit 1060 and the high frequency band signal converted into the time domain in the second inverse MDCT applying unit 1070 and outputs the terminal OUT. Output through

도 11은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.11 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 11을 참조하면, 오디오 데이터 복호화 장치는 역다중화부(1100), 문맥-기반 비트플레인 복호화부(1110), 역양자화부(1120), 멀티-레졸루션 합성부(1130), 역 주파수 선형 예측 수행부(1140), 역 MDCT 적용부(1150), 변환부(1160), 대역폭 확장 복호화부(1170), 역변환부(1180) 및 밴드 합성부(1190)를 포함한다.Referring to FIG. 11, the audio data decoding apparatus performs demultiplexer 1100, context-based bitplane decoder 1110, dequantizer 1120, multi-resolution synthesizer 1130, and inverse frequency linear prediction. A unit 1140, an inverse MDCT application unit 1150, a transform unit 1160, a bandwidth extension decoder 1170, an inverse transform unit 1180, and a band synthesizer 1190 are included.

역다중화부(1100)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 역다중화부(1100)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(1100)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화(Context-dependent Bit plane decoding without decoding)의 정보, 신호 타입 정보(Signal type information), 주파수 선형 예측 및 벡터 양자화(Frequency-domain linear prediction and vector quantization)의 정보, 및 부호화된 대역폭 확장 정보 등이 있다.The demultiplexer 1100 receives a bit stream output from an encoding stage and demultiplexes the bit stream. The demultiplexer 1100 divides each part of the data level into data parts corresponding to each unit, and analyzes and outputs information of a bitstream related thereto in the corresponding unit. In this case, the information output from the demultiplexer 1100 may include descriptive analysis of the audio spectrum stream, quantization values and other reconstruction information, reconstruction information of the quantization spectrum, and context-dependent bit plane decoding without. information of decoding, signal type information, information of frequency-linear prediction and vector quantization, and coded bandwidth extension information.

문맥-기반 비트플레인 복호화부(1110)는 문맥을 기반으로 부호화된 비트플레인을 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(1110)는 역다중화부(1100)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보, 및 스케일 팩터(scale factor)를 복원한다. 문맥-기반 비트플레인 복호화부(1110)는 Prejudice coding band mode 정보, Prejudice coding의 스케일 팩터, 및 Prejudice coding의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 decoding cosmetic 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bitplane decoder 1110 decodes the bitplane encoded based on the context. Here, the context-based bitplane decoder 1110 receives the information output from the demultiplexer 1100 and performs Huffman decoding to perform frequency spectrum, coding band mode information, and Restore the scale factor. The context-based bitplane decoder 1110 receives prejudice coding band mode information, a scale factor of prejudice coding, and a frequency spectrum of prejudice coding, and receives a coding band mode value, a decoding cosmetic display of a scale factor, and a quantization value of a frequency spectrum. Output

역양자화부(1120)는 문맥-기반 비트플레인 복호화부(1110)에서 출력된 결과를 역양자화한다.The dequantizer 1120 dequantizes the result output from the context-based bitplane decoder 1110.

멀티-레졸루션 합성부(1130)는 역양자화부(1120)에서 역양자화된 신호를 수신하여, 순간적으로 변하는 신호의 오디오 스펙트럼 계수에 대하여 멀티-레졸루션(Multi-resolution)을 처리한다. 구체적으로, 멀티-레졸루션 합성부(1130)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(1120)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(1130)는 reserve quantization spectrum/difference spectrum을 입력받아 reconstruction spectrum/difference spectrum을 출력한다.The multi-resolution synthesis unit 1130 receives the dequantized signal from the inverse quantization unit 1120 and processes multi-resolution with respect to the audio spectral coefficients of a signal that is instantaneously changed. In detail, the multi-resolution synthesis unit 1130 may improve the decoding efficiency by synthesizing the output of the inverse quantization unit 1120 into a multi-resolution when the audio signal is analyzed as a multi-resolution at the encoding end. Here, the multi-resolution synthesis unit 1130 receives a reserve quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(1140)는 멀티-레졸루션 합성부(1130)에서 출력된 결과와 역다중화부(1100)로부터 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(1140)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(1120)의 출력 또는 멀티-레졸루션 합성부(1130)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(1140)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(1140)은 difference spectrum 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 1140 synthesizes the result output from the multi-resolution synthesis unit 1130 and the result of performing the frequency linear prediction in the encoding stage from the demultiplexing unit 1100 and performs inverse-vector quantization. . In detail, the inverse frequency linear prediction performing unit 1140 outputs the result of the frequency linear prediction when the audio signal is performed at the encoding end by the inverse quantization unit 1120 or the multi-resolution synthesis unit. The efficiency of the decoding may be improved by combining with the output of 1130. Here, the inverse frequency linear prediction performing unit 1140 effectively improves coding efficiency by employing a frequency domain prediction technique and a vector quantization technique of prediction coefficients. The inverse frequency linear prediction performing unit 1140 receives a difference spectrum coefficient and an index of a vector and outputs an MDCT spectrum coefficient.

이와 같이, 문맥-기반 비트플레인 복호화부(1110), 역양자화부(1120), 멀티-레졸루션 합성부(1130) 및 역 주파수 선형 예측 수행부(1140)는 부호화된 저주파수 밴드 신호에 대해서 복호화를 수행하므로, 저주파수 밴드 복호화부라고 할 수 있다.As such, the context-based bitplane decoder 1110, the inverse quantizer 1120, the multi-resolution synthesizer 1130, and the inverse frequency linear prediction performer 1140 decode the encoded low frequency band signal. Therefore, it can be referred to as a low frequency band decoder.

역 MDCT 적용부(1150)는 부호화단에 대한 역변환 과정으로서 멀티-레졸루션 합성부(1130) 또는 역 주파수 선형예측 수행부(1140)에서 출력된 저주파수 밴드 신호를 역 MDCT에 의해 주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역 MDCT 적용부(1150)는 멀티-레졸루션 합성부(1130) 또는 역 주파수 선형 예측 수행부(1140)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The inverse MDCT application unit 1150 converts the low frequency band signal output from the multi-resolution synthesis unit 1130 or the inverse frequency linear prediction unit 1140 from the frequency domain to the time domain by inverse MDCT as an inverse transform process for the encoding end. Invert Here, the inverse MDCT application unit 1150 receives the frequency spectrum coefficients obtained as a result of inverse quantization by the multi-resolution synthesis unit 1130 or the inverse frequency linear prediction execution unit 1140, and restores the audio data corresponding to the low frequency band. Will output

변환부(1160)는 역 MDCT 적용부(1150)에서 시간 도메인으로 변환된 저주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다. 변환부(1160)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다. 변환부(1160)에서 MDCT를 적용하여 실시할 수 없는 것은 아니지만 MDCT를 사용할 경우에는 도 10에 도시된 실시예가 보다 효율적이다.The transform unit 1160 converts the low frequency band signal converted into the time domain in the inverse MDCT application unit 1150 into the frequency domain or the time / frequency domain using a conversion technique other than MDCT. Examples of the conversion technique used by the transform unit 1160 include MDST, FFT, and QMF. Although the transformation unit 1160 may not implement and apply MDCT, the embodiment shown in FIG. 10 is more efficient when using MDCT.

대역폭 확장 복호화부(1170)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 변환부(1160)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(1170)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoder 1170 decodes the encoded bandwidth extension information and generates a high frequency band signal from the low frequency band signal output from the converter 1160 using the decoded bandwidth extension information. Here, the bandwidth extension decoder 1170 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information that can indicate the characteristics of the high frequency band of the high frequency band and is information such as energy level or envelope of the high frequency band signal.

역변환부(1180)는 대역폭 확장 복호화부(1170)에서 복호화된 고주파수 밴드 신호를 MDCT 이외의 변환 기법을 이용하여 주파수 도메인 또는 시간-주파수 도메인에서 시간 도메인으로 역변환한다. 여기서, 역변환부(1180)는 변환부(1160)에서 이용하는 동일한 변환 기법을 이용한다. 역변환부(1180)에서 이용하는 변환 기법의 예로 MDST, FFT, 및 QMF 등이 있다.The inverse transformer 1180 inversely transforms the high frequency band signal decoded by the bandwidth extension decoder 1170 from the frequency domain or the time-frequency domain to the time domain using a conversion technique other than MDCT. Here, the inverse transform unit 1180 uses the same transform technique used by the transform unit 1160. Examples of the conversion technique used by the inverse transform unit 1180 include MDST, FFT, and QMF.

밴드 합성부(1190)는 역 MDCT 적용부(1150)에서 시간 도메인으로 변환된 저 주파수 밴드 신호와 역변환부(1180)에서 시간 도메인으로 변환된 고주파수 밴드 신호를 합성하여 출력단자 OUT을 통해 출력한다.The band synthesizer 1190 synthesizes the low frequency band signal converted into the time domain in the inverse MDCT application unit 1150 and the high frequency band signal converted into the time domain in the inverse transform unit 1180 and outputs the same through the output terminal OUT.

도 12는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다. 12 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 12를 참조하면, 오디오 데이터 복호화 장치는 역다중화부(1200), 문맥-기반 비트플레인 복호화부(1210), 역양자화부(1220), 멀티-레졸루션 합성부(1230), 역 주파수 선형 예측 수행부(1240), 대역폭 확장 복호화부(1250), 밴드 합성부(1260) 및 역 MDCT 적용부(1270)를 포함한다.Referring to FIG. 12, the audio data decoding apparatus performs demultiplexer 1200, context-based bitplane decoder 1210, dequantizer 1220, multi-resolution synthesizer 1230, and inverse frequency linear prediction. A unit 1240, a bandwidth extension decoder 1250, a band synthesizer 1260, and an inverse MDCT application unit 1270 are included.

역다중화부(1200)는 부호화단으로부터 출력된 비트 스트림(bit stream)을 입력받아 역다중화한다. 역다중화부(1200)는 데이터 레벨의 각 부분을 각 단위에 대응하는 데이터 부분으로 분리하며, 해당 단위에 그와 관련된 비트스트림의 정보를 분석하여 출력한다. 여기서, 역다중화부(1200)가 출력하는 정보는 오디오 스펙트럼 스트림의 설명 분석, 양자화 값과 기타 복원(reconstruction) 정보, 양자화 스펙트럼의 복원 정보, 문맥-기반 비트플레인 복호화(Context-dependent Bit plane decoding without decoding)의 정보, 신호 타입 정보(Signal type information), 주파수 선형 예측 및 벡터 양자화(Frequency-domain linear prediction and vector quantization)의 정보, 및 부호화된 대역폭 확장 정보 등이 있다.The demultiplexer 1200 demultiplexes a bit stream output from an encoding stage. The demultiplexer 1200 divides each part of the data level into data parts corresponding to each unit, and analyzes and outputs information of a bitstream related thereto in the corresponding unit. In this case, the information output from the demultiplexer 1200 includes an explanatory analysis of the audio spectrum stream, quantization values and other reconstruction information, reconstruction information of the quantization spectrum, and context-dependent bit plane decoding without information of decoding, signal type information, information of frequency-linear prediction and vector quantization, and coded bandwidth extension information.

문맥-기반 비트플레인 복호화부(1210)는 문맥을 기반으로 부호화된 비트플레인을 복호화한다. 여기서, 문맥-기반 비트플레인 복호화부(1210)는 역다중화부(1200)로부터 출력된 정보를 입력받아 허프먼 복호화(Huffman decoding)를 진행 하여 주파수 스펙트럼, 코딩 밴드 모드(coding band mode) 정보, 및 스케일 팩터(scale factor)를 복원한다. 문맥-기반 비트플레인 복호화부(1210)는 Prejudice coding band mode 정보, Prejudice coding의 스케일 팩터, 및 Prejudice coding의 주파수 스펙트럼을 입력받아 코딩 밴드 모드 수치, 스케일 팩터의 decoding cosmetic 표시, 주파수 스펙트럼의 양자화 값을 출력한다.The context-based bitplane decoder 1210 decodes the bitplane encoded based on the context. Here, the context-based bitplane decoder 1210 receives the information output from the demultiplexer 1200 and performs Huffman decoding to perform frequency spectrum, coding band mode information, and Restore the scale factor. The context-based bitplane decoder 1210 receives prejudice coding band mode information, a scale factor of prejudice coding, and a frequency spectrum of prejudice coding, and receives a coding band mode value, a decoding cosmetic display of a scale factor, and a quantization value of a frequency spectrum. Output

역양자화부(1220)는 문맥-기반 비트플레인 복호화부(1210)에서 출력된 결과를 역양자화한다.The dequantizer 1220 dequantizes the result output from the context-based bitplane decoder 1210.

멀티-레졸루션 합성부(1230)는 역양자화부(1220)에서 역양자화된 결과를 수신하여, 순간적으로 변하는 신호의 오디오 스펙트럼 계수의 멀티-레졸루션(Multi-resolution)을 처리한다. 구체적으로, 멀티-레졸루션 합성부(1230)는 오디오 신호가 부호화단에서 멀티-레졸루션으로 분석된 경우에 역양자화부(1220)의 출력을 멀티-레졸루션으로 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 멀티-레졸루션 합성부(1230)는 reserve quantization spectrum/difference spectrum을 입력받아 reconstruction spectrum/difference spectrum을 출력한다.The multi-resolution synthesis unit 1230 receives the dequantized result from the inverse quantization unit 1220 and processes multi-resolution of the audio spectral coefficients of a signal that is instantaneously changed. In detail, the multi-resolution synthesis unit 1230 may improve the decoding efficiency by synthesizing the output of the dequantization unit 1220 into multi-resolution when the audio signal is analyzed as multi-resolution at the encoding end. Here, the multi-resolution synthesis unit 1230 receives a reserve quantization spectrum / difference spectrum and outputs a reconstruction spectrum / difference spectrum.

역 주파수 선형 예측 수행부(1240)는 멀티-레졸루션 합성부(1230)에서 출력된 결과와 역다중화부(1200)로부터 입력받은 부호화단에서 주파수 선형 예측을 수행한 결과를 합성하고 역-벡터 양자화를 수행한다. 구체적으로, 역 주파수 선형 예측 수행부(1240)는 오디오 신호가 부호화단에서 주파수 선형 예측이 수행된 경우에 상기 주파수 선형 예측이 수행된 결과를 역양자화부(1220)의 출력 또는 멀티-레졸루션 합성부(1230)의 출력과 합성하여 복호화의 효율을 향상시킬 수 있다. 여기서, 역 주파수 선형 예측 수행부(1240)은 주파수 도메인 예측 기술과 예측 계수의 벡터 양자화 기술을 채용하여 코딩 효율을 유효하게 제고하였다. 역 주파수 선형 예측 수행부(1240)은 difference spectrum 계수, 벡터의 인덱스(index)를 입력받아 MDCT 스펙트럼 계수를 출력한다.The inverse frequency linear prediction performing unit 1240 synthesizes the result output from the multi-resolution synthesis unit 1230 and the result of performing the frequency linear prediction in the coding stage input from the demultiplexing unit 1200 and performs inverse-vector quantization. Perform. In detail, the inverse frequency linear prediction execution unit 1240 outputs the result of the frequency linear prediction when the audio signal is encoded in the encoding stage by the inverse quantization unit 1220 or the multi-resolution synthesis unit. The efficiency of decoding can be improved by combining with the output of 1230. Here, the inverse frequency linear prediction performing unit 1240 effectively improves coding efficiency by employing a frequency domain prediction technique and a vector quantization technique of prediction coefficients. The inverse frequency linear prediction performing unit 1240 receives a difference spectrum coefficient and an index of a vector and outputs an MDCT spectrum coefficient.

이와 같이, 문맥-기반 비트플레인 복호화부(1210), 역양자화부(1220), 멀티-레졸루션 합성부(1230) 및 역 주파수 선형 예측 수행부(1240)는 부호화된 저주파수 밴드 신호에 대해서 복호화를 수행하므로, 저주파수 밴드 복호화부라고 할 수 있다.As such, the context-based bitplane decoder 1210, the inverse quantizer 1220, the multi-resolution synthesis unit 1230, and the inverse frequency linear prediction performer 1240 perform decoding on the encoded low frequency band signal. Therefore, it can be referred to as a low frequency band decoder.

대역폭 확장 복호화부(1250)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 멀티-레졸루션 합성부(1230) 또는 역 주파수 선형 예측 수행부(1240)에서 출력된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다. 여기서, 대역폭 확장 복호화부(1250)는 오디오 신호의 고주파수와 저주파수 밴드 사이에 높은 연관성이 존재한다는 특성에 기초하여 복호화된 대역폭 확장 정보를 저주파수 밴드 신호에 적용함으로써 고주파 밴드 신호를 생성한다. 여기서, 대역폭 확장 정보는 고주파수 밴드의 고주파수 밴드의 특성을 나타낼 수 있는 정보로써 고주파수 밴드 신호에 대한 에너지 레벨 또는 포락선 등의 정보이다.The bandwidth extension decoder 1250 decodes the encoded bandwidth extension information and uses the decoded bandwidth extension information from the low frequency band signal output from the multi-resolution synthesis unit 1230 or the inverse frequency linear prediction performer 1240. Generate a high frequency band signal. Here, the bandwidth extension decoder 1250 generates the high frequency band signal by applying the decoded bandwidth extension information to the low frequency band signal based on the property that there is a high correlation between the high frequency and the low frequency band of the audio signal. Here, the bandwidth extension information is information that can indicate the characteristics of the high frequency band of the high frequency band and is information such as energy level or envelope of the high frequency band signal.

밴드 합성부(1260)는 멀티-레졸루션 합성부(1230) 또는 역 주파수 선형 예측 수행부(1240)에서 출력된 저주파수 밴드 신호와 대역폭 확장 복호화부(1250)에서 생성된 고주파수 밴드 신호를 합성한다.The band synthesizer 1260 synthesizes the low frequency band signal output from the multi-resolution synthesis unit 1230 or the inverse frequency linear prediction performer 1240 and the high frequency band signal generated by the bandwidth extension decoder 1250.

역 MDCT 적용부(1270)는 밴드 합성부(1260)에서 출력된 신호를 역 MDCT(inverse MDCT)에 의해 주파수 도메인에서 시간 도메인으로 역변환하여 출력단자 OUT을 통해 출력한다. 여기서, 역 MDCT 적용부(1270)는 역 주파수 선형 예측 수행부(1240)에서 역양자화한 결과로 얻은 주파수 스펙트럼 계수를 입력받아 저주파수 밴드에 해당하는 복원된 오디오 데이터로 출력한다.The inverse MDCT application unit 1270 converts the signal output from the band synthesizing unit 1260 from the frequency domain to the time domain by inverse MDCT and outputs it through the output terminal OUT. Here, the inverse MDCT application unit 1270 receives the frequency spectrum coefficients obtained as the result of inverse quantization by the inverse frequency linear prediction performing unit 1240 and outputs the restored audio data corresponding to the low frequency band.

도 13은 본 발명의 일 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다. 13 is a flowchart illustrating a method of encoding an audio signal according to an embodiment of the present invention.

도 13을 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 4에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 4에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 13, the audio signal encoding method according to the present embodiment includes steps that are processed in time series in the audio signal encoding apparatus shown in FIG. 4. Therefore, even if omitted below, the above description of the audio signal encoding apparatus shown in FIG. 4 is also applied to the audio signal encoding method according to the present embodiment.

1300 단계에서 밴드 분할부(400)는 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할한다.In operation 1300, the band divider 400 divides the input signal into a high frequency band signal and a low frequency band signal.

1310 단계에서 제1 및 제2 MDCT 적용부(410, 460)는 고주파수 밴드 신호 및 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인으로 변환한다. 여기서, 상기 제1 및 제2 MDCT 적용부(410, 460)는 고주파수 밴드 신호 및 저주파수 밴드 신호에 대하여 MDCT(Modified Discrete Cosine Transform)를 수행하여 각각 시간 도메인에서 주파수 도메인으로 변환할 수 있다.In operation 1310, the first and second MDCT applying units 410 and 460 convert the high frequency band signal and the low frequency band signal from the time domain to the frequency domain, respectively. Herein, the first and second MDCT applying units 410 and 460 may perform a Modified Discrete Cosine Transform (MDCT) on the high frequency band signal and the low frequency band signal to convert the time domain into the frequency domain.

1320 단계에서 저주파수 밴드 부호화부는 변환된 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다. 여기서, 저주파수 밴드 부호 화부는 주파수 선형 예측 수행부(420), 멀티-레졸루션 분석부(430), 양자화부(440) 및 문맥-기반 비트플레인 부호화부(450)를 포함할 수 있다. 구체적으로, 주파수 선형 예측 수행부(420)는 변환된 저주파수 밴드 신호의 특성에 따라 주파수 선형 예측을 수행하여 필터링하고, 멀티-레졸루션 분석부(430)는 변환된 저주파수 밴드 신호 또는 필터링된 신호의 특성에 따라 멀티-레졸루션(multi-resolution)으로 분석한다. 또한, 양자화부(440)는 멀티-레졸루션으로 분석된 신호를 양자화하고, 문맥-기반 비트플레인 부호화부(450)는 양자화된 신호를 문맥을 기반으로 하여 비트플레인으로 부호화한다.In operation 1320, the low frequency band encoder quantizes the converted low frequency band signal and encodes the bit plane based on a context. Here, the low frequency band coder may include a frequency linear prediction performer 420, a multi-resolution analyzer 430, a quantizer 440, and a context-based bitplane encoder 450. In detail, the frequency linear prediction performing unit 420 performs the filtering by performing the frequency linear prediction according to the characteristics of the converted low frequency band signal, and the multi-resolution analyzer 430 performs the characteristics of the converted low frequency band signal or the filtered signal. According to the multi-resolution analysis. In addition, the quantization unit 440 quantizes the signal analyzed by the multi-resolution, and the context-based bitplane encoder 450 encodes the quantized signal into the bitplane based on the context.

1330 단계에서 대역폭 확장 부호화부(470)는 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다.In operation 1330, the bandwidth extension encoder 470 generates and encodes bandwidth extension information indicating a characteristic of the converted high frequency band signal using the converted low frequency band signal.

1340 단계에서 다중화부(480)는 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In operation 1340, the multiplexer 480 multiplexes the encoded bitplane and the encoded bandwidth extension information and outputs the result of encoding the input signal.

도 14는 본 발명의 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.14 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 14를 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 5에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 5에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 14, the method of encoding an audio signal according to the present embodiment includes the steps of time-series processing in the apparatus for encoding an audio signal illustrated in FIG. 5. Therefore, even if omitted below, the above description of the audio signal encoding apparatus shown in FIG. 5 is also applied to the audio signal encoding method according to the present embodiment.

1400 단계에서 밴드 분할부(500)는 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할한다.In operation 1400, the band divider 500 divides the input signal into a high frequency band signal and a low frequency band signal.

1410 단계에서 MDCT 적용부(510)는 저주파수 밴드 신호에 대하여 MDCT를 수행하여 시간 도메인에서 주파수 도메인으로 변환한다.In step 1410, the MDCT application unit 510 performs an MDCT on the low frequency band signal to convert from the time domain to the frequency domain.

1420 단계에서 저주파수 밴드 부호화부는 MDCT가 수행된 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다. 여기서, 저주파수 밴드 부호화부는 주파수 선형 예측 수행부(520), 멀티-레졸루션 분석부(530), 양자화부(540) 및 문맥-기반 비트플레인 부호화부(550)를 포함할 수 있다. 구체적으로, 주파수 선형 예측 수행부(520)는 변환된 저주파수 밴드 신호의 특성에 따라 주파수 선형 예측을 수행하여 필터링하고, 멀티-레졸루션 분석부(530)는 변환된 저주파수 밴드 신호 또는 필터링된 신호의 특성에 따라 멀티-레졸루션(multi-resolution)으로 분석한다. 또한, 양자화부(540)는 멀티-레졸루션으로 분석된 신호를 양자화하고, 문맥-기반 비트플레인 부호화부(550)는 양자화된 신호를 문맥을 기반으로 하여 비트플레인으로 부호화한다.In step 1420, the low frequency band encoder quantizes the signal on which the MDCT is performed and encodes the bit plane based on the context. Here, the low frequency band encoder may include a frequency linear prediction performer 520, a multi-resolution analyzer 530, a quantizer 540, and a context-based bitplane encoder 550. Specifically, the frequency linear prediction performing unit 520 performs the filtering by performing the frequency linear prediction according to the characteristics of the converted low frequency band signal, and the multi-resolution analyzer 530 performs the characteristics of the converted low frequency band signal or the filtered signal. According to the multi-resolution analysis. In addition, the quantization unit 540 quantizes the signal analyzed by the multi-resolution, and the context-based bitplane encoder 550 encodes the quantized signal into the bitplane based on the context.

1430 단계에서 저주파수 밴드 변환부(560) 및 고주파수 밴드 변환부(570)는 고주파수 밴드 신호 및 저주파수 밴드 신호를 각각 시간 도메인에서 주파수 도메인 또는 시간/주파수 도메인으로 변환한다.In operation 1430, the low frequency band converter 560 and the high frequency band converter 570 convert the high frequency band signal and the low frequency band signal from the time domain to the frequency domain or the time / frequency domain, respectively.

1440 단계에서 대역폭 확장 부호화부(580)는 변환된 저주파수 밴드 신호를 이용하여 변환된 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다.In operation 1440, the bandwidth extension encoder 580 generates and encodes bandwidth extension information indicating a characteristic of the converted high frequency band signal using the converted low frequency band signal.

1450 단계에서 다중화부(590)는 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In operation 1450, the multiplexer 590 multiplexes the encoded bitplane and the encoded bandwidth extension information and outputs the result of encoding the input signal.

도 15는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다.15 is a flowchart illustrating a method of encoding an audio signal according to another embodiment of the present invention.

도 15를 참조하면, 본 실시예에 따른 오디오 신호의 부호화 방법은 도 6에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 6에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 15, an audio signal encoding method according to the present embodiment includes steps that are processed in time series in an audio signal encoding apparatus of FIG. 6. Therefore, even if omitted below, the above description of the audio signal encoding apparatus shown in FIG. 6 is also applied to the audio signal encoding method according to the present embodiment.

1500 단계에서 MDCT 적용부(600)는 입력 신호를 시간 도메인에서 주파수 도메인으로 변환한다. 구체적으로, MDCT 적용부(600)는 입력 신호에 대하여 MDCT를 수행하여 입력 신호를 시간 도메인에서 주파수 도메인으로 변환할 수 있다.In operation 1500, the MDCT applying unit 600 converts the input signal from the time domain to the frequency domain. In detail, the MDCT applying unit 600 may perform an MDCT on the input signal to convert the input signal from the time domain to the frequency domain.

1510 단계에서 밴드 분할부(610)는 변환된 입력 신호를 고주파수 밴드 신호 및 저주파수 밴드 신호로 분할한다.In operation 1510, the band divider 610 divides the converted input signal into a high frequency band signal and a low frequency band signal.

1520 단계에서 저주파수 밴드 부호화부는 저주파수 밴드 신호를 양자화하여 문맥을 기반으로 비트플레인으로 부호화한다. 여기서, 저주파수 밴드 부호화부는 주파수 선형 예측 수행부(620), 멀티-레졸루션 분석부(630), 양자화부(640) 및 문맥-기반 비트플레인 부호화부(650)를 포함할 수 있다. 구체적으로, 주파수 선형 예측 수행부(620)는 변환된 저주파수 밴드 신호의 특성에 따라 주파수 선형 예측을 수행하여 필터링하고, 멀티-레졸루션 분석부(630)는 변환된 저주파수 밴드 신호 또 는 필터링된 신호의 특성에 따라 멀티-레졸루션(multi-resolution)으로 분석한다. 또한, 양자화부(640)는 멀티-레졸루션으로 분석된 신호를 양자화하고, 문맥-기반 비트플레인 부호화부(650)는 양자화된 신호를 문맥을 기반으로 하여 비트플레인으로 부호화한다.In operation 1520, the low frequency band encoder quantizes the low frequency band signal and encodes the bit frequency based on the context. Here, the low frequency band encoder may include a frequency linear prediction performer 620, a multi-resolution analyzer 630, a quantizer 640, and a context-based bitplane encoder 650. Specifically, the frequency linear prediction performing unit 620 performs the frequency linear prediction according to the characteristics of the converted low frequency band signal, and filters the multi-resolution analyzer 630 of the converted low frequency band signal or the filtered signal. Depending on the characteristics, it is analyzed by multi-resolution. In addition, the quantization unit 640 quantizes the signal analyzed by the multi-resolution, and the context-based bitplane encoder 650 encodes the quantized signal into the bitplane based on the context.

1530 단계에서 대역폭 확장 부호화부(660)는 저주파수 밴드 신호를 이용하여 고주파수 밴드 신호의 특성을 나타내는 대역폭 확장 정보를 생성하여 부호화한다.In operation 1530, the bandwidth extension encoder 660 generates and encodes bandwidth extension information indicating characteristics of the high frequency band signal using the low frequency band signal.

1540 단계에서 다중화부(670)는 부호화된 비트플레인 및 부호화된 대역폭 확장 정보를 다중화하여 입력 신호에 대한 부호화 결과로써 출력한다.In operation 1540, the multiplexer 670 multiplexes the encoded bitplane and the encoded bandwidth extension information and outputs the result of encoding the input signal.

도 16은 본 발명의 일 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.16 is a flowchart illustrating a method of decoding an audio signal according to an embodiment of the present invention.

도 16을 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 10에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 10에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 16, an audio signal decoding method according to the present embodiment includes steps processed in time series in an audio signal decoding apparatus illustrated in FIG. 10. Therefore, even if omitted below, the above description of the audio signal decoding apparatus shown in FIG. 10 is also applied to the audio signal decoding method according to the present embodiment.

1600 단계에서 역다중화부(1000)는 오디오 신호가 부호화된 결과를 입력받는다. 여기서, 부호화된 결과는 저주파수 밴드에 대하여 문맥을 기반으로 부호화된 비트플레인 및 부호화된 대역폭 확장 정보 등을 포함한다.In operation 1600, the demultiplexer 1000 receives a result of encoding the audio signal. Here, the encoded result includes a bit plane encoded based on the context and encoded bandwidth extension information for the low frequency band.

1610 단계에서 저주파수 밴드 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역다중화하여 저주파수 밴드 신호를 생성한다. 여기서, 저주파 수 밴드 복호화부는 문맥-기반 비트플레인 복호화부(1010), 역양자화부(1020), 멀티-레졸루션 합성부(1030) 및 역 주파수 선형 예측 수행부(1040)를 포함할 수 있다. 구체적으로, 문맥-기반 비트플레인 복호화부(1010)는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인을 복호화하고, 역양자화부(1020)는 복호화된 신호를 역양자화한다. 또한, 멀티-레졸루션 합성부(1030)는 부호화단에서 멀티-레졸루션 분석이 수행된 경우 역양자화된 신호를 멀티-레졸루션으로 합성하고, 역 주파수 선형 예측 수행부(1040)는 부호화단에서 주파수 선형 예측이 수행된 경우 벡터 인덱스를 이용하여 부호화단에서 주파수 선형 예측이 수행된 결과를 양자화된 신호 또는 멀티-레졸루션 합성된 신호에 합성한다.In operation 1610, the low frequency band decoder generates a low frequency band signal by decoding and demultiplexing the encoded bitplane based on a context. Here, the low frequency band decoder may include a context-based bitplane decoder 1010, an inverse quantizer 1020, a multi-resolution synthesizer 1030, and an inverse frequency linear prediction performer 1040. In detail, the context-based bitplane decoder 1010 decodes the bitplane encoded based on the context with respect to the low frequency band signal, and the dequantizer 1020 dequantizes the decoded signal. In addition, when the multi-resolution analysis is performed in the encoding stage, the multi-resolution synthesis unit 1030 synthesizes the dequantized signal into multi-resolution, and the inverse frequency linear prediction execution unit 1040 performs frequency linear prediction in the encoding stage. In this case, the result of performing the frequency linear prediction in the encoding end using the vector index is synthesized into a quantized signal or a multi-resolution synthesized signal.

1620 단계에서 대역폭 확장 복호화부(1050)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다.In operation 1620, the bandwidth extension decoder 1050 decodes the encoded bandwidth extension information and generates a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information.

1630 단계에서 제1 및 제2 역 MDCT 적용부(1060, 1070)는 저주파수 밴드 신호 및 고주파수 밴드 신호 각각에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환한다.In operation 1630, the first and second inverse MDCT applying units 1060 and 1070 perform inverse MDCT on each of the low frequency band signal and the high frequency band signal to convert from the frequency domain to the time domain.

1640 단계에서 밴드 합성부(1080)는 변환된 저주파수 밴드 신호 및 변환된 고주파수 밴드 신호를 합성한다.In operation 1640, the band synthesis unit 1080 synthesizes the converted low frequency band signal and the converted high frequency band signal.

도 17은 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.17 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 17을 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 11에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 11에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 17, an audio signal decoding method according to the present embodiment includes steps that are processed in time series in an audio signal decoding apparatus illustrated in FIG. 11. Therefore, even if omitted below, the above description of the audio signal decoding apparatus shown in FIG. 11 is also applied to the audio signal decoding method according to the present embodiment.

1700 단계에서 역다중화부(1100)는 오디오 신호가 부호화된 결과를 입력받는다. 여기서, 부호화된 결과는 저주파수 밴드에 대하여 문맥을 기반으로 부호화된 비트플레인 및 부호화된 대역폭 확장 정보 등을 포함한다.In operation 1700, the demultiplexer 1100 receives a result of encoding an audio signal. Here, the encoded result includes a bit plane encoded based on the context and encoded bandwidth extension information for the low frequency band.

1710 단계에서 저주파수 밴드 복호화부는 부호화된 비트플레인을 문맥을 기반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성한다. 여기서, 저주파수 밴드 복호화부는 문맥-기반 비트플레인 복호화부(1110), 역양자화부(1120), 멀티-레졸루션 합성부(1130) 및 역 주파수 선형 예측 수행부(1140)를 포함할 수 있다. 구체적으로, 문맥-기반 비트플레인 복호화부(1110)는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인을 복호화하고, 역양자화부(1120)는 복호화된 신호를 역양자화한다. 또한, 멀티-레졸루션 합성부(1130)는 부호화단에서 멀티-레졸루션 분석이 수행된 경우 역양자화된 신호를 멀티-레졸루션으로 합성한다. 그리고, 역 주파수 선형 예측 수행부(1140)는 부호화단에서 주파수 선형 예측이 수행된 경우 벡터 인덱스를 이용하여 부호화단에서 주파수 선형 예측이 수행된 결과를 역양자화된 신호 또는 멀티-레졸루션 합성된 신호에 합성한다.In operation 1710, the low frequency band decoder generates a low frequency band signal by decoding and inverse quantizing the encoded bitplane based on a context. Here, the low frequency band decoder may include a context-based bitplane decoder 1110, an inverse quantizer 1120, a multi-resolution synthesizer 1130, and an inverse frequency linear prediction performer 1140. In detail, the context-based bitplane decoder 1110 decodes the bitplane encoded based on the context with respect to the low frequency band signal, and the inverse quantizer 1120 dequantizes the decoded signal. In addition, the multi-resolution synthesis unit 1130 synthesizes the dequantized signal into a multi-resolution when the multi-resolution analysis is performed at the encoding end. In addition, when frequency linear prediction is performed at the encoding end, the inverse frequency linear prediction performing unit 1140 may convert the result of frequency linear prediction at the encoding end into a dequantized signal or a multi-resolution synthesized signal using a vector index. Synthesize

1720 단계에서 역 MDCT 적용부(1150)는 저주파수 밴드 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환한다.In operation 1720, the inverse MDCT application unit 1150 performs inverse MDCT on the low frequency band signal to convert from the frequency domain to the time domain.

1730 단계에서 변환부(1160)는 역 MDCT가 수행된 저주파수 밴드 신호를 주파수 도메인 또는 시간/주파수 도메인으로 변환한다.In operation 1730, the converter 1160 converts the low frequency band signal in which the inverse MDCT is performed into the frequency domain or the time / frequency domain.

1740 단계에서 대역폭 확장 복호화부(1170)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 주파수 도메인 또는 시간/주파수 도메인으로 변환된 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다.In operation 1740, the bandwidth extension decoder 1170 decodes the encoded bandwidth extension information and generates a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information.

1750 단계에서 역변환부(1180)는 고주파수 밴드 신호를 시간 도메인으로 역변환한다.In operation 1750, the inverse transformer 1180 inversely converts the high frequency band signal into the time domain.

1760 단계에서 밴드 합성부(1190)는 역변환된 고주파수 밴드 신호 및 변환된 저주파수 밴드 신호를 합성한다.In operation 1760, the band synthesizer 1190 synthesizes the inversely transformed high frequency band signal and the converted low frequency band signal.

도 18은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 방법을 나타내는 흐름도이다.18 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 18을 참조하면, 본 실시예에 따른 오디오 신호의 복호화 방법은 도 12에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 12에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 18, an audio signal decoding method according to the present embodiment includes steps processed in time series in an audio signal decoding apparatus shown in FIG. 12. Therefore, even if omitted below, the above description of the audio signal decoding apparatus shown in FIG. 12 is also applied to the audio signal decoding method according to the present embodiment.

1800 단계에서 역다중화부(1200)는 오디오 신호가 부호화된 결과를 입력받는다. 여기서, 부호화된 결과는 저주파수 밴드에 대하여 문맥을 기반으로 부호화된 비트플레인 및 부호화된 대역폭 확장 정보 등을 포함한다.In operation 1800, the demultiplexer 1200 receives a result of encoding an audio signal. Here, the encoded result includes a bit plane encoded based on the context and encoded bandwidth extension information for the low frequency band.

1810 단계에서 저주파수 밴드 복호화부는 부호화된 비트플레인을 문맥을 기 반으로 복호화하고 역양자화하여 저주파수 밴드 신호를 생성한다. 여기서, 저주파수 밴드 복호화부는 문맥-기반 비트플레인 복호화부(1210), 역양자화부(1220), 멀티-레졸루션 합성부(1230) 및 역 주파수 선형 예측 수행부(1240)를 포함할 수 있다. 구체적으로, 문맥-기반 비트플레인 복호화부(1210)는 저주파수 밴드 신호에 대하여 문맥을 기반으로 부호화된 비트플레인을 복호화하고, 역양자화부(1220)는 복호화된 신호를 역양자화한다. 또한, 멀티-레졸루션 합성부(1230)는 부호화단에서 멀티-레졸루션 분석이 수행된 경우 역양자화된 신호를 멀티-레졸루션으로 합성한다. 그리고, 역 주파수 선형 예측 수행부(1240)는 부호화단에서 주파수 선형 예측이 수행된 경우 벡터 인덱스를 이용하여 부호화단에서 주파수 선형 예측이 수행된 결과를 역양자화된 신호 또는 멀티-레졸루션 합성된 신호에 합성한다.In step 1810, the low frequency band decoder generates a low frequency band signal by decoding and inverse quantizing the encoded bitplane based on the context. Here, the low frequency band decoder may include a context-based bitplane decoder 1210, an inverse quantizer 1220, a multi-resolution synthesizer 1230, and an inverse frequency linear prediction performer 1240. In detail, the context-based bitplane decoder 1210 decodes the bitplane encoded based on the context with respect to the low frequency band signal, and the dequantizer 1220 dequantizes the decoded signal. In addition, the multi-resolution synthesis unit 1230 synthesizes the dequantized signal into a multi-resolution when the multi-resolution analysis is performed at the encoding end. In addition, when frequency linear prediction is performed at the encoding end, the inverse frequency linear prediction performing unit 1240 may convert the result of frequency linear prediction at the encoding end to a dequantized signal or a multi-resolution synthesized signal using a vector index. Synthesize

1820 단계에서 대역폭 확장 복호화부(1250)는 부호화된 대역폭 확장 정보를 복호화하고, 복호화된 대역폭 확장 정보를 이용하여 저주파수 밴드 신호로부터 고주파수 밴드 신호를 생성한다.In operation 1820, the bandwidth extension decoder 1250 decodes the encoded bandwidth extension information and generates a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information.

1830 단계에서 밴드 합성부(1260)는 저주파수 밴드 신호 및 고주파수 밴드 신호를 합성한다.In operation 1830, the band synthesizer 1260 synthesizes the low frequency band signal and the high frequency band signal.

1840 단계에서 역 MDCT 적용부(1270)는 합성된 신호에 대하여 역 MDCT를 수행하여 주파수 도메인에서 시간 도메인으로 변환한다.In operation 1840, the inverse MDCT applying unit 1270 performs inverse MDCT on the synthesized signal and converts the frequency domain from the frequency domain to the time domain.

본 발명은 상술한 실시예에 한정되지 않으며, 본 발명의 사상 내에서 당업자에 의한 변형이 가능함은 물론이다.The present invention is not limited to the above-described embodiment, and of course, modifications may be made by those skilled in the art within the spirit of the present invention.

본 발명은 또한 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 컴퓨터가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 하드디스크, 플로피디스크, 플래쉬 메모리, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드로서 저장되고 실행될 수 있다.The invention can also be embodied as computer readable code on a computer readable recording medium. The computer-readable recording medium includes all kinds of recording devices in which data that can be read by a computer system is stored. Examples of computer-readable recording media include ROM, RAM, CD-ROM, magnetic tape, hard disk, floppy disk, flash memory, optical data storage device, and also carrier waves (for example, transmission over the Internet). It also includes the implementation in the form of. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.

도 1은 본 발명의 일 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다. 1 is a block diagram illustrating an apparatus for encoding an audio signal according to an embodiment of the present invention.

도 5는 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.5 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 6은 본 발명의 또 다른 실시예에 따른 오디오 신호의 부호화 장치를 나타내는 블록도이다.6 is a block diagram illustrating an apparatus for encoding an audio signal according to another embodiment of the present invention.

도 7은 본 발명의 일 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.7 is a block diagram illustrating an apparatus for decoding an audio signal according to an embodiment of the present invention.

도 8은 본 발명의 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.8 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 9는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.9 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 10은 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나 타내는 블록도이다.10 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 12는 본 발명의 또 다른 실시예에 따른 오디오 신호의 복호화 장치를 나타내는 블록도이다.12 is a block diagram illustrating an apparatus for decoding an audio signal according to another embodiment of the present invention.

도 13는 본 발명의 일 실시예에 따른 오디오 신호의 부호화 방법을 나타내는 흐름도이다. 13 is a flowchart illustrating a method of encoding an audio signal according to an embodiment of the present invention.

Claims

(a) dividing an input signal into a high frequency band signal and a low frequency band signal;

(b) converting the high frequency band signal and the low frequency band signal from the time domain to the frequency domain, respectively;

(c) quantizing the converted low frequency band signal and encoding the bitplane based on a context;

(d) generating and encoding bandwidth extension information indicating a characteristic of the converted high frequency band signal using the converted low frequency band signal; And

and (e) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal.

The method of claim 1,

Step (b) is

And encoding the high frequency band signal and the low frequency band signal by performing a Modified Discrete Cosine Transform (MDCT) to transform them from the time domain to the frequency domain.

The method of claim 1,

(f) performing frequency linear prediction on the transformed low frequency band signal and filtering the result; And

(g) further analyzing at least one of the converted low frequency band signals with multi-resolution,

Step (c) is

And encoding the filtered signal or the signal analyzed by the multi-resolution by bit quantization based on a context.

The method of claim 3,

Step (f)

A coefficient of a linear prediction filter is calculated by performing frequency linear prediction on the transformed low frequency band signal, and a value corresponding to the coefficient is expressed as a vector index.

Step (e) is

And encoding the encoded bitplane, the encoded bandwidth extension information, and the vector index as an encoding result of the input signal.

A computer-readable recording medium having recorded thereon a program for executing an audio signal encoding method according to any one of claims 1 to 4.

(b) performing MDCT on the low frequency band signal and converting from the time domain to the frequency domain;

(c) quantizing the signal on which the MDCT is performed and encoding the bitplane based on a context;

(d) converting the high frequency band signal and the low frequency band signal from time domain to frequency domain or time / frequency domain, respectively;

(e) generating and encoding bandwidth extension information representing a characteristic of the converted high frequency band signal using the converted low frequency band signal; And

(f) outputting the encoded bitplane and the encoded bandwidth extension information as an encoding result of the input signal.

The method of claim 6,

Step (c) is

(g) performing frequency linear prediction on the signal on which the MDCT is performed and filtering the signal; And

(h) further analyzing at least one step of analyzing the signal on which the MDCT has been performed by multi-resolution,

Step (c) is

And encoding the filtered signal or the signal analyzed by the multi-resolution to a bit plane based on a context.

The method of claim 7, wherein

Step (g)

A coefficient of a linear prediction filter is calculated by performing frequency linear prediction on the low frequency band signal on which the MDCT is performed, and a value corresponding to the coefficient is expressed as a vector index.

Step (f)

A computer-readable recording medium having recorded thereon a program for executing the method of encoding an audio signal according to any one of claims 6 to 8.

(a) converting an input signal from time domain to frequency domain;

(b) dividing the converted input signal into a high frequency band signal and a low frequency band signal;

(c) quantizing the low frequency band signal and encoding the bitplane based on a context;

(d) generating and encoding bandwidth extension information representing the characteristics of the high frequency band signal using the low frequency band signal; And

The method of claim 10,

Step (a) is

And performing an MDCT on the input signal to convert the input signal from a time domain to a frequency domain.

The method of claim 10,

(f) performing frequency linear prediction on the low frequency band signal and filtering the result; And

(g) further analyzing at least one of the low frequency band signals by multi-resolution,

Step (c) is

The method of claim 12,

Step (f)

Calculating a coefficient of a linear prediction filter by performing frequency linear prediction on the low frequency band signal, and expressing a value corresponding to the coefficient as a vector index,

Step (e) is

A computer-readable recording medium having recorded thereon a program for executing an audio signal encoding method according to any one of claims 10 to 13.

(a) receiving an encoding result of an audio signal;

(b) generating a low frequency band signal by decoding and inverse quantizing the encoded bitplane included in the encoding result based on a context;

(c) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information;

(d) performing an Inverse Modified Discrete Cosine Transform (MDCT) on each of the low frequency band signal and the high frequency band signal to convert from the frequency domain to the time domain; And

(e) synthesizing the converted low frequency band signal and the converted high frequency band signal.

The method of claim 15,

Step (b) is

Synthesizing the dequantized signal into a multi-resolution; And

And synthesizing a result of the frequency linear prediction performed by the encoding end into the dequantized signal by using the vector index included in the encoding result.

A computer-readable recording medium having recorded thereon a program for executing a method for decoding an audio signal according to any one of claims 15 and 16.

(a) receiving an encoding result of an audio signal;

(c) performing inverse MDCT on the low frequency band signal and converting from the frequency domain to the time domain;

(d) converting the low frequency band signal subjected to the inverse MDCT into a frequency domain or a time / frequency domain;

(e) decoding the encoded bandwidth extension information included in the encoding result and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information;

(f) inversely converting the high frequency band signal into a time domain; And

and (g) synthesizing the inverse transformed high frequency band signal and the transformed low frequency band signal.

The method of claim 18,

Step (b) is

Synthesizing the dequantized signal into a multi-resolution; And

And synthesizing a result of performing frequency linear prediction at the encoding end into the dequantized signal by using the vector index included in the decoding result.

A computer-readable recording medium having recorded thereon a program for executing the method for decoding an audio signal according to any one of claims 18 and 19.

(a) receiving an encoding result of an audio signal;

(d) synthesizing the low frequency band signal and the high frequency band signal; And

(e) performing an inverse MDCT on the synthesized signal and converting from the frequency domain to the time domain.

The method of claim 21,

Step (b) is

Synthesizing the dequantized signal into a multi-resolution; And

A computer-readable recording medium having recorded thereon a program for executing the method for decoding an audio signal according to any one of claims 21 and 22.

A band dividing unit dividing the input signal into a high frequency band signal and a low frequency band signal;

A converter for converting the high frequency band signal and the low frequency band signal from a time domain to a frequency domain, respectively;

A low frequency band encoder which quantizes the converted low frequency band signal and encodes the bit frequency into a bit plane based on a context; And

And a bandwidth extension encoder configured to generate and encode bandwidth extension information indicating a characteristic of the converted high frequency band signal by using the converted low frequency band signal.

The method of claim 24,

The conversion unit

A first MDCT performing unit performing an MDCT on the high frequency band signal and converting from the time domain to the frequency domain; And

And a second MDCT performing unit performing an MDCT on the low frequency band signal and converting from the time domain to the frequency domain.

The method of claim 24,

The low frequency band encoder

A frequency linear prediction performing unit performing frequency linear prediction on the converted low frequency band signal to filter the converted low frequency band signal; And

Further comprising at least one of a multi-resolution analysis unit for analyzing the converted low frequency band signal by a multi-resolution,

The method of claim 26,

The frequency linear prediction performing unit performs frequency linear prediction on the transformed low frequency band signal to calculate coefficients of a linear prediction filter, and expresses a value corresponding to the coefficients as a vector index. Chi.

The method of claim 27,

And a multiplexer which multiplexes the encoded bitplane, the encoded bandwidth extension information, and the vector index.

An MDCT application unit performing an MDCT on the low frequency band signal and converting the time domain from the time domain to the frequency domain;

A low frequency band encoder for quantizing the output of the MDCT applier and encoding the bitplane based on a context;

A converter for converting the high frequency band signal and the low frequency band signal from a time domain into a frequency domain or a time / frequency domain; And

The method of claim 29,

The low frequency band encoder

A frequency linear prediction performing unit performing frequency linear prediction on the output of the MDCT application unit and filtering the output; And

Further comprising at least one of a multi-resolution analysis unit for analyzing the output of the MDCT application unit with a multi-resolution,

The method of claim 30,

The frequency linear prediction performing unit calculates a coefficient of the linear prediction filter by performing frequency linear prediction on the low frequency band signal on which the MDCT is performed, and expresses a value corresponding to the coefficient as a vector index. Encoding device.

The method of claim 31, wherein

And encoding multiplexing the encoded bitplane, the encoded bandwidth extension information, and the vector index.

A converter for converting an input signal from a time domain to a frequency domain;

A band dividing unit dividing the converted input signal into a high frequency band signal and a low frequency band signal;

A low frequency band encoder for quantizing the low frequency band signal and encoding the bit frequency based on a context; And

And a bandwidth extension encoder configured to generate and encode bandwidth extension information indicating a characteristic of the high frequency band signal by using the low frequency band signal.

The method of claim 33, wherein

And the conversion unit performs an MDCT on the input signal to convert the input signal from the time domain to the frequency domain.

The method of claim 33, wherein

The low frequency band encoder

A frequency linear prediction performing unit performing filtering on the low frequency band signal by performing frequency linear prediction; And

Further comprising at least one of a multi-resolution analysis unit for analyzing the low-frequency band signal in a multi-resolution,

36. The method of claim 35 wherein

The frequency linear prediction performing unit performs frequency linear prediction on the low frequency band signal to calculate coefficients of a linear prediction filter, and expresses a value corresponding to the coefficients as a vector index.

The method of claim 36,

A low frequency band decoder configured to decode and dequantize the encoded bitplane based on a context to generate a low frequency band signal;

A bandwidth extension decoder for decoding encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal using the decoded bandwidth extension information;

An inverse MDCT performing unit performing inverse MDCT on each of the low frequency band signal and the high frequency band signal and converting from the frequency domain to the time domain; And

And a band synthesizer for synthesizing the converted low frequency band signal and the converted high frequency band signal.

The method of claim 38,

The low frequency band decoder

A multi-resolution synthesis unit for synthesizing the dequantized signal into a multi-resolution; And

And at least one of an inverse frequency linear prediction performing unit configured to synthesize the result of the frequency linear prediction performed by the encoding end using the vector index to the dequantized signal.

An inverse MDCT application unit performing inverse MDCT on the low frequency band signal and converting from the frequency domain to the time domain;

A converter for converting the low frequency band signal subjected to the inverse MDCT to a frequency domain or a time / frequency domain;

A bandwidth extension decoding unit for decoding encoded bandwidth extension information and generating a high frequency band signal from the low frequency band signal converted into the frequency domain or the time / frequency domain using the decoded bandwidth extension information;

An inverse transformer for inversely converting the generated high frequency band signal into a time domain; And

And a band synthesizer configured to synthesize the inverse transformed high frequency band signal and the converted low frequency band signal.

The method of claim 40,

The low frequency band decoder

And at least one of an inverse frequency linear prediction performing unit for synthesizing a result of the frequency linear prediction performed by the encoding end using the vector index to the dequantized signal.

A band synthesizer for synthesizing the low frequency band signal and the high frequency band signal; And

And an inverse MDCT applying unit for performing inverse MDCT on the synthesized signal and converting the frequency signal from the frequency domain to the time domain.

The method of claim 42, wherein

The low frequency band decoder