KR101413969B1

KR101413969B1 - Method and apparatus for decoding audio signal

Info

Publication number: KR101413969B1
Application number: KR1020120149763A
Authority: KR
Inventors: 이건형; 이철우; 정종훈; 이남숙; 문한길
Original assignee: 삼성전자주식회사
Priority date: 2012-12-20
Filing date: 2012-12-20
Publication date: 2014-07-08
Also published as: KR20130007521A

Abstract

오디오 신호의 복호화 방법에 있어서, 소정의 코어 디코더를 이용하여 상기 오디오 신호의 저주파수 신호에 대한 복호화를 수행하는 단계; 상기 복호화된 저주파 신호를 이용하여 상기 오디오 신호의 고주파수 신호의 잔차 신호를 생성하는 단계; 비트스트림에 구비된 상기 고주파수 신호의 선형 예측 코딩 계수 및 상기 잔차 신호를 이용한 선형 예측 코딩 합성을 수행하여 상기 고주파수 신호를 복호화하는 단계; 및 상기 복호화된 저주파 신호와 고주파 신호를 결합하여 상기 오디오 신호를 복원하는 단계를 포함하는 것을 특징으로 하는 본 발명의 일 실시예에 따른 오디오 신호의 복호화 방법이 개시된다.A method of decoding an audio signal, the method comprising: decoding a low frequency signal of the audio signal using a predetermined core decoder; Generating a residual signal of a high frequency signal of the audio signal using the decoded low frequency signal; Decoding the high frequency signal by performing LPC synthesis using the LPC coefficient of the high frequency signal included in the bitstream and the residual signal; And reconstructing the audio signal by combining the decoded low-frequency signal and the high-frequency signal. According to another aspect of the present invention, there is provided a method of decoding an audio signal according to an embodiment of the present invention.

Description

[0001] The present invention relates to a method and apparatus for decoding audio signals,

본 발명은 오디오 신호를 부호화하거나 복호화하는 방법 및 장치에 관한 것으로, 보다 상세하게는 오디오 신호 중 소정 크로스 오버 주파수 이상의 고주파수 대역의 신호를 보다 효율적으로 부호화하거나 복호화하는 방법 및 장치에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for encoding or decoding an audio signal, and more particularly, to a method and apparatus for efficiently encoding or decoding a signal in a high frequency band of a predetermined crossover frequency or higher.

오디오 신호 중에서 고주파수 신호는 저주파수 신호에 비하여 인간의 심리음향적인(Psychoacoustic) 특성상 상대적으로 덜 중요하다. 따라서, 오디오 신호를 부호화할 때 이용가능한 비트량의 한계를 극복하면서 코딩 효율을 향상시키기 위해서, 저주파수 신호에 많은 비트를 할당하고 고주파수 신호에는 적은 비트를 할당하여 부호화를 수행하는 방식이 도입되었다. 이러한 방식의 일 예로 SBR(Spectral Band Replication)이 있다. Among audio signals, high frequency signals are relatively less important than human low frequency signals in terms of psychoacoustic characteristics. Therefore, in order to improve the coding efficiency while overcoming the limit of the amount of bits available for encoding an audio signal, a method of assigning a large number of bits to a low frequency signal and a small number of bits to a high frequency signal has been adopted. One example of this approach is Spectral Band Replication (SBR).

도 1은 종래 기술에 따른 SBR을 설명하기 위한 참조도이다.1 is a reference diagram for explaining an SBR according to the prior art.

SBR은 오디오 신호의 고주파수 신호와 저주파수 신호 사이에 높은 연관성이 존재한다는 가정에 기반을 둔다. 따라서, SBR에 따르면 이러한 연관성을 이용하여 저주파 대역의 정보를 이용해 고주파 대역 성분을 추정할 수 있다고 가정하여, 저주파수 신호는 소정의 코어 코덱으로 부호화하고 고주파수 신호는 저주파수 신호로부터 추정할 때 필요한 부가 정보만을 부호화한다. 여기서, 코어 코덱으로는 mp3, AAC(Advanced Audio Coding)에 기반한 코덱이 이용되며, 고주파수 신호의 복원을 위해 이용되는 부가 정보로는 저주파수 신호를 복사할 고주파수 신호의 대역 정보 등이 포함된다.SBR is based on the assumption that there is a high correlation between high and low frequency signals of an audio signal. Therefore, according to the SBR, it is assumed that high-frequency band components can be estimated using low-frequency band information by using this correlation, and only low-frequency signals are encoded with a predetermined core codec and high- . Here, codecs based on mp3 and AAC (Advanced Audio Coding) are used as the core codec, and band information of a high frequency signal for copying a low frequency signal is included as additional information used for reconstruction of a high frequency signal.

도 1을 참조하면, 부호화기에서는 오디오 신호에 포함된 소정의 크로스 오버 주파수(fc) 이하의 저주파수 신호 A(10)는 코어 코덱을 이용하여 부호화하고, 크로스 오버 주파수(fc) 이상의 고주파수 신호 B(11)는 직접 부호화하지 않고 저주파수 신호로부터 추정될 때 필요한 부가 정보만을 부호화한다.Referring to FIG. 1, in the encoder, a low frequency signal A 10 of a predetermined crossover frequency fc or lower included in an audio signal is encoded using a core codec, and a high frequency signal B 11 ) Encodes only the additional information required when it is estimated from the low-frequency signal without directly encoding.

복호화기에서는 이러한 SBR 방식에 의하여 부호화된 비트스트림을 수신하고, 코어 코덱을 이용하여 저주파수 신호 A'(20)를 복원하며, 복원된 저주파수 신호 A'(20)를 고주파수 대역으로 복사한 후 비트스트림에 구비된 부가 정보를 이용하여 복사된 고주파수 대역의 신호를 조정함으로써 고주파수 신호 B'(21)을 생성한다.The decoder receives the bitstream encoded by the SBR method, restores the low-frequency signal A '20 using the core codec, copies the restored low-frequency signal A' 20 into the high frequency band, The high frequency signal B '21 is generated by adjusting the signal of the copied high frequency band using the additional information.

이러한 SBR과 같이 저주파수 신호로부터 고주파수 신호를 추정하여 부호화하는 방식은 저주파수 신호의 하모닉스(harmonics)가 고주파수 신호에 비해서 강하거나, 저주파수 신호의 주파수 대역별 에너지의 편차가 심한 경우에는 음질이 열화되는 문제점이 있다.A method of estimating and encoding a high frequency signal from a low frequency signal, such as SBR, is problematic in that the harmonic of the low frequency signal is stronger than that of the high frequency signal, or the sound quality is deteriorated when the energy of each low frequency signal varies greatly. have.

그러므로 고주파수 영역에 해당하는 신호를 부호화함에 있어서 적은 비트를 이용하고도 인간이 인식하는 음질을 최대한 향상시킬 수 있는 방법 및 장치가 요구된다.Therefore, there is a need for a method and an apparatus capable of maximizing the sound quality recognized by humans even when using a small number of bits in coding a signal corresponding to a high frequency region.

본 발명이 해결하고자 하는 과제는 큰 음질의 손실없이 적은 비트레이트로 오디오 신호의 고주파수 성분을 효율적으로 부호화하거나 복호화하는 방법 및 장치를 제공하는 것이다.SUMMARY OF THE INVENTION It is an object of the present invention to provide a method and apparatus for efficiently encoding and decoding high frequency components of an audio signal at a low bit rate without loss of a large sound quality.

전술한 과제를 해결하기 위한 본 발명에 따른 오디오 신호의 부호화 방법은 상기 오디오 신호에 구비된 소정 임계 주파수 이상의 고주파수 신호에 대하여 선형 예측 코딩(Linear Predictive Coding:LPC) 분석을 수행함으로써 상기 고주파수 신호의 선형 예측 코딩 계수 및 잔차 신호를 출력하는 단계, 상기 잔차 신호의 진폭 변화를 나타내는 이득 정보를 추출하는 단계 및 상기 고주파수 신호의 선형 예측 코딩 계수 및 상기 잔차 신호의 이득 정보를 다중화하는 단계를 포함하는 것을 특징으로 한다.According to another aspect of the present invention, there is provided a method of encoding an audio signal, the method comprising: performing a linear predictive coding (LPC) analysis on a high-frequency signal of a predetermined threshold frequency or more included in the audio signal, A step of outputting a predictive coding coefficient and a residual signal, extracting gain information indicating an amplitude change of the residual signal, and multiplexing the LPC coefficient of the high frequency signal and the gain information of the residual signal .

본 발명에 따른 오디오 신호의 부호화 장치는 상기 오디오 신호에 구비된 소정 임계 주파수 이상의 고주파수 신호에 대하여 선형 예측 코딩(Linear Predictive Coding:LPC) 분석을 수행함으로써 상기 고주파수 신호의 선형 예측 코딩 계수 및 잔차 신호를 출력하는 선형 예측 코딩 분석부, 상기 잔차 신호의 진폭 변화를 나타내는 이득 정보를 추출하는 이득 정보 추출부 및 상기 고주파수 신호의 선형 예측 코딩 계수 및 상기 잔차 신호의 이득 정보를 다중화하는 다중화부를 포함하는 것을 특징으로 한다.An apparatus for encoding an audio signal according to the present invention performs linear predictive coding (LPC) analysis on a high frequency signal of a predetermined frequency or more included in the audio signal, thereby obtaining a linear predictive coding coefficient and a residual signal of the high frequency signal And a multiplexing unit for multiplexing the LPC coefficients of the high frequency signal and the gain information of the residual signal, and a multiplexing unit for multiplexing the gain information of the high frequency signal and the gain information of the residual signal. .

본 발명에 따른 오디오 신호의 복호화 방법은 소정의 코어 디코더를 이용하여 상기 오디오 신호의 저주파수 신호에 대한 복호화를 수행하는 단계, 상기 복호화된 저주파 신호를 이용하여 상기 오디오 신호의 고주파수 신호의 잔차 신호를 생성하는 단계, 비트스트림에 구비된 상기 고주파수 신호의 선형 예측 코딩 계수 및 상기 잔차 신호를 이용한 선형 예측 코딩 합성을 수행하여 상기 고주파수 신호를 복호화하는 단계 및 상기 복호화된 저주파 신호와 고주파 신호를 결합하여 상기 오디오 신호를 복원하는 단계를 포함하는 것을 특징으로 한다.A method for decoding an audio signal according to the present invention includes decoding a low frequency signal of the audio signal using a predetermined core decoder, generating a residual signal of the high frequency signal of the audio signal using the decoded low frequency signal, Decoding the high frequency signal by performing a LPC synthesis using the LPC coefficient of the high frequency signal included in the bitstream and the residual signal, and combining the decoded low frequency signal and the high frequency signal, And recovering the signal.

본 발명에 따르면 선형 예측 코딩 분석을 이용하여 고주파수 신호에 대한 부호화를 수행함으로써 발생되는 비트량을 상대적으로 감소시키면서 고주파수 신호에 대한 음질의 열화를 방지할 수 있다.According to the present invention, it is possible to prevent deterioration of the quality of a high-frequency signal while relatively reducing the amount of bits generated by performing encoding on the high-frequency signal using the LPC analysis.

도 1은 종래 기술에 따른 SBR을 설명하기 위한 참조도이다.
도 2는 본 발명에 따른 오디오 신호의 부호화 장치의 일 실시예를 나타낸 블록도이다.
도 3은 본 발명에 따른 잔차 신호의 시간 포락선의 일 예를 나타낸 도면이다.
도 4는 도 2의 이득 정보 추출부(250)의 구성을 구체적으로 나타낸 블록도이다.
도 5는 본 발명에 따른 오디오 신호의 부호화 방법을 나타낸 플로우 차트이다.
도 6은 본 발명에 따른 오디오 신호의 복호화 장치의 일 실시예를 나타낸 블록도이다.
도 7은 본 발명에 따른 오디오 신호의 복호화 방법을 나타낸 플로우 차트이다.
도 8은 도 7의 단계 720을 구체화한 플로우차트이다.1 is a reference diagram for explaining an SBR according to the prior art.
2 is a block diagram showing an embodiment of an apparatus for encoding an audio signal according to the present invention.
3 is a diagram illustrating an example of a temporal envelope of a residual signal according to the present invention.
4 is a block diagram specifically showing a configuration of the gain information extraction unit 250 of FIG.
5 is a flowchart illustrating a method of encoding an audio signal according to the present invention.
6 is a block diagram illustrating an apparatus for decoding an audio signal according to an embodiment of the present invention.
7 is a flowchart illustrating a method of decoding an audio signal according to the present invention.
8 is a flow chart embodying step 720 of FIG.

이하, 첨부된 도면들을 참조하여 본 발명의 바람직한 실시예에 대하여 구체적으로 설명한다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

본 발명에서는 종래 SBR과 같이 저주파 신호를 기반으로 고주파수 신호를 복사함으로써 고주파수 신호를 생성하는 대신에, 선형 예측 코딩(Linear Prediction Coding:LPC)을 이용해서 고주파수 신호의 부호화 및 복호화를 수행하는 방법 및 장치를 제안한다.A method and an apparatus for performing encoding and decoding of a high frequency signal using Linear Prediction Coding (LPC), instead of generating a high frequency signal by copying a high frequency signal based on a low frequency signal like the conventional SBR, Lt; / RTI >

도 2는 본 발명에 따른 오디오 신호의 부호화 장치의 일 실시예를 나타낸 블록도이다.2 is a block diagram showing an embodiment of an apparatus for encoding an audio signal according to the present invention.

도 2를 참조하면, 본 발명에 따른 오디오 신호의 부호화 장치(200)는 필터부(210), 코어 코더(220), 감산부(230), 선형 예측 코딩 분석부(240), 이득 정보 추출부(250), 양자화부(260) 및 다중화부(270)를 포함한다.2, an apparatus 200 for encoding an audio signal according to the present invention includes a filter unit 210, a core coder 220, a subtractor 230, a LPC analysis unit 240, A demultiplexer 250, a quantizer 260, and a multiplexer 270.

필터부(210)는 입력된 오디오 신호를 소정의 크로스 오버 주파수(임계 주파수)를 기준으로 하여 저주파수 신호와 고주파수 신호로 분할한다. 코어 코더(220)는 소정의 크로스 오버 주파수 이하의 저주파수 신호를 코어 코덱을 이용하여 부호화한다. 여기서, 코어 코덱으로는 MP3, AAC 등의 다양한 오디오 압축 코덱이 이용될 수 있다.The filter unit 210 divides the input audio signal into a low frequency signal and a high frequency signal based on a predetermined crossover frequency (a critical frequency). The core coder 220 encodes a low-frequency signal of a predetermined crossover frequency or lower using a core codec. As the core codec, various audio compression codecs such as MP3 and AAC can be used.

선형 예측 코딩 분석부(240)는 오디오 신호에 구비된 크로스 오버 주파수 이상의 고주파수 신호에 대하여 선형 예측 코딩(Linear Predictive Coding:LPC) 분석을 수행함으로써 고주파수 신호의 선형 예측 코딩 계수 및 잔차 신호를 출력한다. 여기서, 고주파수 신호로서 필터부(210)를 통해 필터링된 고주파수 신호를 이용하거나, 감산부(230)을 통해서 입력 오디오 신호로부터 코어 코더(220)에서 부호화된 후 복원된 저주파수 신호를 뺌으로서 생성된 고주파수 성분의 신호를 이용할 수 있다.The LPC analysis unit 240 performs linear predictive coding (LPC) analysis on a high frequency signal of a crossover frequency or higher frequency included in an audio signal to output a LPC coefficient and a residual signal of a high frequency signal. Here, a high frequency signal filtered through the filter unit 210 may be used as the high frequency signal, a high frequency signal generated by subtracting the low frequency signal reconstructed after being encoded by the core coder 220 from the input audio signal through the subtraction unit 230, Component signal can be used.

선형 예측 분석은 음성의 기본적인 파라미터를 음성 발생의 선형적인 모델에 기초하여 추출해내는 방법으로, 현재의 음성 신호 샘플값은 과거 M개(M은 양의 정수)의 음성 출력 샘플 값과의 선형 결합으로 근사할 수 있다는 가정에 기반한 음성 신호 모델링 방식을 말한다. 본 발명에 따른 오디오 신호의 부호화 방법 및 장치에서는 이러한 선형 예측 코딩 분석 방식을 코어 코더(220)에 의해 부호화되지 못한 고주파수 신호에 적용하여 고주파수 신호를 부호화한다. 선형 예측 코딩 분석부(240)는 공분산 방식(covariance method), 자기 상관 방식(autocorrelation method), 래티스 필터(Lattice filter), 레빈슨-더빈 알고리즘(Levinson-Durbin algorithm) 등을 이용하여 고주파수 신호로부터 선형 예측 코딩 계수(LPC 계수) 및 잔차 신호를 추출하여 출력한다.Linear prediction analysis is a method of extracting the basic parameters of speech based on a linear model of speech generation. The current speech signal sample value is a linear combination of past M (M is a positive integer) The speech signal modeling method is based on the assumption that it can be approximated. In the method and apparatus for encoding an audio signal according to the present invention, the linear prediction coding analysis method is applied to a high frequency signal that can not be encoded by the core coder 220 to encode a high frequency signal. The LPC analysis unit 240 performs linear prediction coding from the high frequency signal using a covariance method, an autocorrelation method, a Lattice filter, a Levinson-Durbin algorithm, Extracts coding coefficients (LPC coefficients) and residual signals and outputs them.

구체적으로, 본 발명에 따른 선형 예측 코딩 분석부(240)은 현재의 고주파수 신호 샘플값을 s(n)은 다음과 같이 그 이전의 p(p는 양의 정수)개의 고주파수 신호 샘플들(s(n-1), s(n-2),..., s(n-p))을 이용하여 다음의 수학식 1과 같이 모델링된다고 가정한다.Specifically, the LPC analysis unit 240 according to the present invention estimates a current high frequency signal sample value s (n) as follows: p (p is a positive integer) high frequency signal samples s (n-1), s (n-2), ..., s (np)

수학식 1에서 u(n)은 선형 예측 코딩 분석에 따라서 이전의 p개의 고주파수 신호 샘플들로부터 현재의 고주파수 신호 샘플값을 예측하였을 때의 예측 오차값에 해당하는 것으로 여기 신호(excitation signal) 또는 잔차 신호(residual signal)라고 한다. 이하, 본 발명을 설명함에 있어서 Gu(n)은 잔차 신호로 정의하기로 한다. G는 잔차 신호의 에너지에 따른 이득값(gain)을 의미한다. a_i는 선형 예측 코딩 계수(LPC 계수)를 나타내며, p는 선형 예측 코딩 계수의 차수로서 일반적으로 10~16의 값을 갖는다. In Equation (1), u (n) corresponds to the prediction error value when the current high frequency signal sample value is predicted from the previous p high frequency signal samples according to the LPC analysis. The excitation signal or residual It is called a residual signal. In the following description of the present invention, Gu (n) is defined as a residual signal. And G denotes a gain according to the energy of the residual signal. a _i denotes a linear predictive coding coefficient (LPC coefficient), and p is a degree of a linear predictive coding coefficient and generally has a value of 10 to 16.

수학식 1을 z-변환을 통해 변환하면 다음의 수학식 2와 같다.The equation (1) can be transformed through z-transform as shown in the following equation (2).

수학식 2에서 전달함수 H(z)의 분모 부분을 A(z)로 표시하였다.In Equation (2), the denominator part of the transfer function H (z) is denoted by A (z).

한편, 수학식 1로부터 잔차 신호 Gu(n)(또는 e(n))으로 표시함)은 다음의 수학식 3와 같다.On the other hand, the residual signal Gu (n) (or e (n)) from Equation 1 is expressed as Equation 3 below.

예측 오차에 해당하는 잔차 신호의 전달 함수는 다음의 수학식 4와 같이 표현될 수 있다.The transfer function of the residual signal corresponding to the prediction error can be expressed by the following Equation (4).

수학식 2와 수학식 4를 고려할 때, 잔차 신호의 전달 함수는 전달 함수 H(z)의 분모 부분에 해당됨을 알 수 있다. 따라서, 선형 예측 코딩 분석을 통해 선형 예측 코딩 계수 a_i들을 계산하여 A(z)를 결정하고, A(z)에 고주파수 신호를 입력하여 필터링하면 잔차 신호 Gu(n)이 추출된다.Considering equations (2) and (4), it can be seen that the transfer function of the residual signal corresponds to the denominator part of the transfer function H (z). Therefore, A (z) is calculated by calculating the LPC coefficients a _i through linear predictive coding analysis, and a residual signal Gu (n) is extracted by filtering the high frequency signal by inputting A (z).

이와 같이, 선형 예측 코딩 분석부(240)는 고주파수 신호에 대하여 선형 예측 코딩 분석을 수행하여 고주파수 신호의 예측 신호를 생성하기 위한 선형 예측 코딩 계수 및 예측 에러에 해당하는 잔차 신호를 출력한다.In this manner, the LPC analysis unit 240 performs LPC analysis on the high frequency signal to output a LPC coefficient for generating a prediction signal of the high frequency signal and a residual signal corresponding to the prediction error.

이득 정보 추출부(250)는 잔차 신호로부터 이득값(G)을 추출하여 부호화한다. The gain information extracting unit 250 extracts and codes the gain value G from the residual signal.

도 3은 본 발명에 따른 잔차 신호의 시간 포락선의 일 예를 나타낸 도면이며, 도 4는 도 2의 이득 정보 추출부(250)의 구성을 구체적으로 나타낸 블록도이다.FIG. 3 is a diagram illustrating an example of a temporal envelope of a residual signal according to an embodiment of the present invention. FIG. 4 is a block diagram specifically illustrating a configuration of the gain information extraction unit 250 of FIG.

도 3 및 도 4를 참조하면, 잔차 신호의 진폭 변화는 잔차 신호의 개략적인 외형을 나타내는 시간 포락선(time envelope)을 모델링함으로써 표현될 수 있다. 따라서, 이득 정보 추출부(250)에 구비된 분할부(251)는 잔차 신호의 시간 포락선을 소정 시간 단위로 분할하고, 포락선 파라미터 검출부(252)는 분할된 각 구간의 에너지를 이용하여 잔차 신호의 시간 포락선의 진폭 변화를 나타내는 파라미터를 생성한다. 일 예로 포락선 파라미터 검출부(252)는 잔차 신호의 분할된 각 구간의 평균 에너지를 계산하고, 이를 각 구간의 진폭을 나타내는 대표값으로 이용할 수 있다.Referring to FIGS. 3 and 4, the amplitude change of the residual signal can be represented by modeling a time envelope representing a rough outline of the residual signal. Therefore, the divider 251 included in the gain information extracting unit 250 divides the time envelope of the residual signal by a predetermined time unit, and the envelope parameter detecting unit 252 divides the time- And generates a parameter indicating the amplitude change of the time envelope. For example, the envelope parameter detector 252 may calculate the average energy of each divided interval of the residual signal and use it as a representative value representing the amplitude of each interval.

양자화부(260)는 선형 예측 코딩 분석부(240)에서 출력된 고주파수 신호의 선형 예측 코딩 계수 및 이득 정보 추출부(250)에서 출력된 이득 정보를 양자화하여 출력한다.The quantization unit 260 quantizes and outputs the LPC coefficients of the high frequency signal output from the LPC analysis unit 240 and the gain information output from the gain information extraction unit 250.

다중화부(270)는 저주파수 신호의 부호화된 데이터, 고주파수 신호의 선형 예측 코딩 계수 및 이득 정보 등을 다중화하여 비트스트림을 생성하여 출력한다. 이 때, 다중화부(270)는 선형 예측 코딩 분석의 역과정인 선형 예측 코딩 합성 과정을 통해 고주파수 신호의 복원을 위해 필요한 다양한 파라미터 정보들, 예를 들어 선형 예측 코딩 계수의 차수 정보, 복사 대역 정보 등을 부호화된 비트스트림에 부가하는 것이 바람직하다.The multiplexing unit 270 multiplexes the encoded data of the low-frequency signal, the LPC coefficient of the high-frequency signal, and the gain information to generate and output a bitstream. In this case, the multiplexing unit 270 performs various kinds of parameter information necessary for restoration of the high frequency signal, for example, order information of the LPC coefficient, copy band information Or the like to the coded bit stream.

이와 같이 본 발명에 따른 오디오 신호의 부호화 장치에 따르면, 코어 코더에 의하여 부호화되지 않는 고주파수 신호를 선형 예측 코딩 분석을 통해 부호화함으로써 비트량의 큰 증가없이 고주파수 신호의 코딩 효율을 향상시킨다.As described above, according to the apparatus for encoding an audio signal according to the present invention, a high-frequency signal that is not encoded by a core coder is encoded through a LPC analysis, thereby improving a coding efficiency of a high-frequency signal without increasing a bit amount.

도 5는 본 발명에 따른 오디오 신호의 부호화 방법을 나타낸 플로우 차트이다.5 is a flowchart illustrating a method of encoding an audio signal according to the present invention.

도 5를 참조하면, 단계 510에서 오디오 신호에 구비된 임계 주파수 이상의 고주파수 신호에 대하여 선형 예측 코딩 분석을 수행함으로써 고주파수 신호의 선형 예측 코딩 계수 및 잔차 신호를 출력한다. 전술한 바와 같이, 고주파수 신호로서 필터링된 고주파수 신호를 이용하거나, 입력 오디오 신호로부터 코어 코덱을 이용하여 부호화된 후 복원된 저주파수 신호를 뺌으로서 생성된 고주파수 성분의 신호를 이용할 수 있다.Referring to FIG. 5, in step 510, a LPC analysis is performed on a high-frequency signal having a frequency equal to or higher than a threshold frequency included in an audio signal, thereby outputting a LPC coefficient and a residual signal of a high-frequency signal. As described above, it is possible to use a filtered high-frequency signal as a high-frequency signal, or a high-frequency component signal generated by decoding a low-frequency signal that has been encoded using an input audio signal using a core codec.

단계 520에서 잔차 신호의 진폭 변화를 나타내는 이득 정보를 추출한다. 이득 정보로서 잔차 신호의 시간 포락선을 모델링한 파라미터 정보를 이용할 수 있다. 이 경우 잔차 신호의 시간 포락선을 소정 구간으로 분할하고, 분할된 각 구간의 평균 에너지를 계산하여 계산된 평균 에너지를 잔차 신호의 시간 포락선의 진폭 변화를 나타내는 파라미터로 이용할 수 있다.In step 520, gain information indicating the amplitude change of the residual signal is extracted. The parameter information obtained by modeling the time envelope of the residual signal can be used as the gain information. In this case, the time envelope of the residual signal is divided into a predetermined section, and the calculated average energy of each divided section is used as a parameter representing the amplitude change of the temporal envelope of the residual signal.

단계 540에서 고주파수 신호의 선형 예측 코딩 분석을 통해 생성된 선형 예측 코딩 계수 및 잔차 신호의 이득 정보를 양자화한다.In step 540, the linear prediction coding coefficient generated through the LPC analysis of the high frequency signal and the gain information of the residual signal are quantized.

단계 550에서, 부호화된 저주파수 신호의 데이터, 양자화된 고주파수 신호의 선형 예측 코딩 계수 및 잔차 신호의 이득 정보를 다중화한다. 이 때, 선형 예측 코딩 분석의 역과정인 선형 예측 코딩 합성 과정을 통해 고주파수 신호의 복원을 위해 필요한 다양한 파라미터 정보들, 예를 들어 선형 예측 코딩 계수의 차수 정보, 복사 대역 정보 등을 부호화된 비트스트림에 부가한다.In step 550, the encoded low-frequency signal data, the linear predictive coding coefficients of the quantized high-frequency signal, and the gain information of the residual signal are multiplexed. At this time, various parameter information necessary for restoration of a high-frequency signal, for example, degree information of linear prediction coding coefficients, copy band information, and the like, through the LPC synthesis process, which is an inverse process of the LPC analysis, .

도 6은 본 발명에 따른 오디오 신호의 복호화 장치의 일 실시예를 나타낸 블록도이다.6 is a block diagram illustrating an apparatus for decoding an audio signal according to an embodiment of the present invention.

도 6을 참조하면, 본 발명에 따른 오디오 신호의 복호화 장치는 역다중화부(610), 코어 디코더(620), 스펙트럼 화이트닝 수행부(630), 고주파수 대역 복사부(640), 포락선 조정부(650), 선형 예측 코딩 합성부(660) 및 결합부(670)를 포함한다. 여기서, 스펙트럼 화이트닝 수행부(630), 고주파수 대역 복사부(640), 포락선 조정부(650)는 복호화된 저주파수 신호를 이용하여 고주파수 신호의 잔차 신호를 생성하는데에 이용된다.6, an apparatus for decoding an audio signal according to the present invention includes a demultiplexer 610, a core decoder 620, a spectrum whitening unit 630, a high frequency band copy unit 640, an envelope adjustment unit 650, A linear prediction coding synthesis unit 660, and a combining unit 670. Here, the spectrum whitening performing unit 630, the high frequency band copying unit 640, and the envelope adjusting unit 650 are used to generate a residual signal of the high frequency signal using the decoded low frequency signal.

역다중화부(610)는 비트스트림에 대한 역다중화를 수행하여, 부호화된 저주파수 신호의 데이터, 고주파수 신호의 복원을 위해 필요한 선형 예측 코딩 계수의 차수(LPC 차수) 정보, 복사 대역 정보, 이득 정보 및 부호화시에 고주파수 신호에 대한 선형 예측 코딩 분석을 통해 생성된 선형 예측 코딩 계수(LPC 계수) 정보 등을 추출하여 출력한다.The demultiplexer 610 performs demultiplexing on the bitstream to generate the encoded low frequency signal data, order (LPC order) information of the LPC coefficients required for reconstruction of the high frequency signal, copy band information, gain information, (LPC coefficient) information generated by the LPC analysis on the high frequency signal during encoding and outputs the LPC coefficient information.

코어 디코더(620)는 코어 코덱을 이용하여 부호화된 오디오 신호의 저주파수 신호를 복호화한다.The core decoder 620 decodes the low-frequency signals of the audio signal encoded using the core codec.

스펙트럴 화이트닝 수행부(630)는 복호화된 저주파수 신호로부터 포락선을 제거하고 남은 잔차 신호를 추출한다. 일 예로, 스펙트럴 화이트닝 수행부(630)는 선형 예측 코딩 분석을 수행하여 복호화된 저주파수 신호의 잔차 신호를 생성할 수 있다. 이 때, 스펙트럴 화이트닝 수행부(630)는 비트스트림으로부터 출력된 선형 예측 코딩 계수의 차수 정보를 이용하여 부호화된 고주파수 신호와 동일한 선형 예측 코딩 계수 차수를 적용하여 선형 예측 코딩 분석을 수행하는 것이 바람직하다.The spectral whitening performing unit 630 removes the envelope from the decoded low frequency signal and extracts the residual signal. For example, the spectral whitening performing unit 630 may perform a LPC analysis to generate a residual signal of the decoded low frequency signal. In this case, the spectral whitening performing unit 630 preferably performs the LPC analysis using the same LPC coefficient order as the high frequency signal encoded using the order information of the LPC coefficient outputted from the bitstream Do.

고주파수 대역 복사부(640)는 스펙트럴 화이트닝 수행부(630)에서 출력된 저주파수 신호의 잔차 신호를 소정의 고주파수 대역으로 복사한다. 이때, 고주파수 대역 복사부(640)는 소정의 크로스 오버 주파수 이상의 고주파수 대역들 중 부호화화된 고주파수 대역을 나타내는 복사 대역 정보를 이용하여 해당 복사 대역에 저주파수 신호의 잔차 신호를 복사한다. 고주파수 대역 복사부(640)를 통해 저주파수 잔차 신호로부터 복사된 고주파수 신호는 고주파수 신호의 잔차 신호의 예측 신호에 해당하게 된다.The high frequency band copying unit 640 copies the residual signal of the low frequency signal output from the spectral whitening performing unit 630 into a predetermined high frequency band. At this time, the high frequency band copying unit 640 copies the residual signal of the low frequency signal to the corresponding copy band using the copy band information indicating the encoded high frequency band out of the high frequency bands above a predetermined crossover frequency. The high frequency signal copied from the low frequency residual signal through the high frequency band copying unit 640 corresponds to the prediction signal of the residual signal of the high frequency signal.

포락선 조정부(650)는 비트스트림으로부터 추출된 이득 정보를 이용하여, 복사된 고주파수 신호를 소정 구간으로 분할하고, 각 구간이 비트스트림으로부터 추출된 해당 구간의 이득 정보와 동일하게 되도록 복사된 고주파수 신호의 진폭을 조정한다. 전술한 바와 같이, 이득 정보로서 각 구간의 평균 에너지를 이용하는 경우, 복사된 고주파수 신호를 분할한 각 구간의 평균 에너지가 이득 정보에 구비된 해당 구간의 평균 에너지와 일치되도록 복사된 고주파수 신호의 진폭을 조정한다. 이와 같이 복사된 고주파수 신호의 진폭을 이득 정보를 통해 조정하여 시간 포락선을 조정함으로써 고주파수 신호의 잔차 신호가 생성된다.The envelope adjustment unit 650 divides the copied high frequency signal into a predetermined section using the gain information extracted from the bitstream, and outputs the copied high frequency signal so that each section is equal to the gain information of the corresponding section extracted from the bit stream. Adjust the amplitude. As described above, when the average energy of each section is used as the gain information, the amplitude of the copied high frequency signal is set so that the average energy of each section obtained by dividing the copied high frequency signal coincides with the average energy of the corresponding section included in the gain information Adjust. By adjusting the amplitude of the copied high frequency signal through the gain information and adjusting the time envelope, a residual signal of the high frequency signal is generated.

선형 예측 코딩 합성부(660)는 선형 예측 코딩 분석의 역과정인 선형 예측 코딩 합성을 통해 비트스트림으로부터 추출된 고주파수 신호의 선형 예측 코딩 계수 및 잔차 신호로부터 고주파수 신호를 복원한다. 전술한 수학식 1을 참조하면, 현재 고주파수 신호의 샘플값은 선형 예측 코딩 계수(a_i) 및 잔차 신호(Gu(n))가 결정되면 이전의 고주파수 신호의 샘플값을 통해 복원될 수 있다. 한편, 선형 예측 코딩 합성부(660)는 선형 예측 코딩 계수를 라인 스펙트럼 주파수(Line Spectral Frequencies:LSF)로 변환하고, 변환된 라인 스펙트럼 주파수를 보간하여 선형 예측 코딩 합성을 수행하는 것이 바람직하다.The LPC synthesis unit 660 reconstructs the LPC signals from the LPC coefficients of the high frequency signal extracted from the bitstream and the residual signal through the LPC synthesis, which is an inverse process of the LPC analysis. Referring to Equation (1), the sample value of the current high frequency signal can be recovered through the sample value of the previous high frequency signal when the LPC coefficient a _i and the residual signal Gu (n) are determined. Meanwhile, the LPC synthesis unit 660 preferably converts the LPC coefficients into Line Spectral Frequencies (LSF), and interpolates the converted line spectrum frequencies to perform LPC synthesis.

결합부(670)는 코어 디코더(620)를 통해 복원된 저주파수 신호와 선형 예측 코딩 합성부(660)를 통해 복원된 고주파수 신호를 결합하여 복호화된 오디오 신호를 출력한다.The combining unit 670 combines the low frequency signal reconstructed through the core decoder 620 and the high frequency signal reconstructed through the LPC synthesis unit 660 to output a decoded audio signal.

도 7은 본 발명에 따른 오디오 신호의 복호화 방법을 나타낸 플로우 차트이다.7 is a flowchart illustrating a method of decoding an audio signal according to the present invention.

도 7을 참조하면, 단계 710에서 코어 코덱을 이용하여 부호화된 비트스트림에 구비된 오디오 신호의 저주파수 신호에 대한 복호화를 수행한다.Referring to FIG. 7, in operation 710, a low-frequency signal of an audio signal included in a bitstream encoded using a core codec is decoded.

단계 720에서, 복호화된 저주파 신호를 이용하여 오디오 신호의 고주파수 신호의 잔차 신호를 생성한다. 구체적으로, 도 7의 단계 720을 구체화한 도 8을 참조하면 단계 721에서, 복호화된 저주파수 신호에 대한 스펙트럴 화이트닝을 수행하여 복호화된 저주파수 신호의 잔차 신호를 생성한다. 전술한 바와 같이, 선형 예측 코딩 분석을 이용하여 복호화된 저주파수 신호에서 포락선을 제거한 잔차 신호를 생성할 수 있다. 단계 722에서, 저주파수 신호의 잔차 신호를 복사 대역 정보를 이용하여 소정의 고주파수 대역으로 복사한다. 단계 723에서, 비트스트림에 구비된 고주파수 신호의 잔차 신호의 이득 정보를 이용하여 고주파수 대역으로 복사된 신호의 포락선을 조정한다.In step 720, the decoded low-frequency signal is used to generate a residual signal of the high-frequency signal of the audio signal. Specifically, referring to FIG. 8 illustrating step 720 of FIG. 7, at step 721, spectral whitening is performed on the decoded low frequency signal to generate a decoded residual signal of the low frequency signal. As described above, it is possible to generate a residual signal from which the envelope is removed in the low-frequency signal decoded by using the LPC analysis. In step 722, the residual signal of the low frequency signal is copied to a predetermined high frequency band using the copy band information. In step 723, the envelope of the signal copied to the high frequency band is adjusted using the gain information of the residual signal of the high frequency signal included in the bit stream.

다시 도 7을 참조하면, 단계 730에서 비트스트림에 구비된 고주파수 신호의 선형 예측 코딩 계수와 포락선이 조정 과정을 통해 생성된 고주파수 신호의 잔차 신호를 이용한 선형 예측 코딩 합성을 수행하여 고주파수 신호를 복호화한다. 선형 예측 코딩 합성시에 선형 예측 코딩 계수를 라인 스펙트럼 주파수(LSF)로 변환하고, 변환된 라인 스펙트럼 주파수를 보간하여 선형 예측 코딩 합성을 수행하는 것이 바람직하다.Referring again to FIG. 7, in step 730, the LPC coding of the high frequency signal included in the bitstream and the LPC synthesis using the residual signal of the high frequency signal generated through the adjustment process are performed to decode the high frequency signal . It is preferable to convert the LPC coefficient to a line spectrum frequency (LSF) at the time of linear prediction coding synthesis, and to perform linear prediction coding synthesis by interpolating the converted line spectrum frequency.

단계 740에서 복호화된 저주파 신호와 고주파 신호를 결합하여 오디오 신호를 복원한다.The low-frequency signal decoded in step 740 is combined with the high-frequency signal to restore the audio signal.

본 발명에 따르면 고주파수 신호에 대한 선형 예측 코딩 분석을 통해서 고주파수 대역의 톤 성분 등을 효율적으로 부호화할 수 있으며, 이를 통해 종래 SBR 방식 등에서는 부호화되지 않았던 고주파수 신호의 성분을 부호화할 수 있으므로 전체 오디오 신호의 음질이 향상된다.According to the present invention, it is possible to efficiently encode a tone component of a high frequency band through a linear predictive coding analysis on a high frequency signal, and to encode a component of a high frequency signal which has not been encoded by the conventional SBR method, Is improved.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명이 상기의 실시예에 한정되는 것은 아니며, 이는 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다. 따라서, 본 발명의 사상은 아래에 기재된 특허청구범위에 의해서만 파악되어야 하고, 이와 균등하거나 또는 등가적인 변형 모두는 본 발명 사상의 범주에 속한다 할 것이다. 또한, 본 발명에 따른 시스템은 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피 디스크, 광데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어 분산방식으로 컴퓨터가 읽을 수 있는 코드가 저장되고 실행될 수 있다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, but, on the contrary, Modification is possible. Accordingly, the spirit of the present invention should be understood only in accordance with the following claims, and all of the equivalent or equivalent variations will fall within the scope of the present invention. In addition, the system according to the present invention can be embodied as computer-readable codes on a computer-readable recording medium. A computer-readable recording medium includes all kinds of recording apparatuses in which data that can be read by a computer system is stored. Examples of the recording medium include a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like, and a carrier wave (for example, transmission via the Internet). The computer-readable recording medium may also be distributed over a networked computer system so that computer readable code can be stored and executed in a distributed manner.

Claims

A method of decoding an audio signal,
Extracting encoded low frequency signal data and a linear predictive coding coefficient generated through a linear predictive coding analysis on a high frequency signal during encoding from the bit stream in which the audio signal is encoded;
Performing decoding on the low-frequency signals of the audio signal using a predetermined core decoder;
Generating a residual signal of a high frequency signal of the audio signal using the decoded low frequency signal;
Decoding the high frequency signal by performing LPC synthesis using the LPC coefficient of the high frequency signal included in the bitstream and the residual signal; And
And reconstructing the audio signal by combining the decoded low-frequency signal and the high-frequency signal.

The method according to claim 1,
The step of generating a residual signal of the high frequency signal of the audio signal
Performing spectral whitening on the decoded low frequency signal to generate a residual signal of the decoded low frequency signal;
Copying the residual signal of the low frequency signal into a predetermined high frequency band;
And adjusting the envelope of the signal copied to the high frequency band using gain information of the residual signal of the high frequency signal included in the bit stream.

3. The method of claim 2,
Wherein the gain information of the high frequency signal is parameter information modeled by a time envelope of the residual signal of the high frequency signal included in the bit stream.

3. The method of claim 2,
The step of adjusting the envelope of the signal copied in the high frequency band
Dividing the signal copied in the high frequency band into a predetermined period; And
By adjusting the envelope of each divided section of the signal copied in the high frequency band by using parameter information indicating the energy of each section of the time envelope of the high frequency signal included in the bit stream, Further comprising the step of adjusting a time envelope.

The method according to claim 1,
The step of decoding the high frequency signal
Wherein the linear predictive coding coefficient included in the bitstream is converted into Line Spectral Frequencies, and the converted line spectrum frequency is interpolated to perform the LPC synthesis.

An apparatus for decoding an audio signal,
A demultiplexer 610 for extracting the encoded low-frequency signal data from the bitstream in which the audio signal is encoded, and the LPC coefficients generated through the LPC analysis on the high-frequency signal during encoding;
A core decoder 620 for decoding a low frequency signal of the audio signal;
A high frequency residual signal generating unit for generating a residual signal of the high frequency signal of the audio signal using the decoded low frequency signal;
A LPC synthesis unit 660 for decoding the high frequency signal by performing LPC synthesis using the LPC coefficient of the high frequency signal included in the bitstream and the residual signal; And
And a combining unit (670) for combining the decoded low-frequency signal and the high-frequency signal to reconstruct the audio signal.

The method according to claim 6,
The high-frequency residual signal generating unit
A spectral whitening performing unit 630 for performing spectral whitening on the decoded low frequency signal to generate a residual signal of the decoded low frequency signal;
A high frequency band copying unit 640 for copying the residual signal of the low frequency signal into a predetermined high frequency band;
And an envelope adjustment unit (650) for adjusting an envelope of a signal copied to the high frequency band using gain information of the residual signal of the high frequency signal included in the bitstream.

8. The method of claim 7,
Wherein the gain information of the high frequency signal is parameter information modeled by a time envelope of the residual signal of the high frequency signal included in the bit stream.

8. The method of claim 7,
The envelope adjustment unit 650
And dividing the signal copied in the high frequency band into a predetermined section and using parameter information indicating energy of each section of the time envelope of the high frequency signal included in the bit stream, And adjusting the time envelope of the signal copied to the high frequency band by adjusting the envelope of the section.

The method according to claim 6,
The LPC synthesis unit 660
Wherein the decoding unit performs linear prediction coding synthesis by converting the LPC coefficients included in the bitstream into Line Spectral Frequencies and interpolating the converted line spectrum frequencies.