KR20160030499A

KR20160030499A - Apparatus and method for coding audio signal by swithcing transform scheme among frequency domain transform and time domain transform

Info

Publication number: KR20160030499A
Application number: KR1020160025746A
Authority: KR
Inventors: 김중회; 성호상; 오은미; 주기현
Original assignee: 삼성전자주식회사
Priority date: 2016-03-03
Filing date: 2016-03-03
Publication date: 2016-03-18
Also published as: KR101702565B1

Abstract

Provided are an apparatus for coding an audio signal by switching between a time-domain transform technique and a frequency-domain transform technique and an apparatus for decoding a coded audio signal by switching between a time-domain transform technique and a frequency-domain transform technique. When switching from one transform technique to the other transform technique from among the time-domain transform technique or the frequency-domain transform technique, an audio signal before and after the switching can be coded or the coded audio signal can be decoded by using an additionally transformed audio signal. In addition, provided are various embodiments for reducing the amount of audio signals to be additionally transformed.

Description

BACKGROUND OF THE INVENTION 1. Field of the Invention [0001] The present invention relates to an apparatus and a method for encoding an audio signal by switching a frequency domain transform technique and a time domain transform technique,

본 발명에 따른 실시예들은 주파수 영역 변환 기법 및 시간 영역 변환 기법을 전환하며 오디오 신호를 부호화하는 장치 및 방법과 관련된 것이다.Embodiments according to the present invention relate to an apparatus and method for encoding an audio signal by switching between a frequency domain transform technique and a time domain transform technique.

기존의 음성/음악 압축 방식은 크게 시간 영역 변환 기법과 주파수 영역 변환 기법으로 분류된다. 주파수 영역 변환 기법은 주파수 영역상의 신호를 압축하는 알고리즘으로, 심리 음향 모델을 적용하는데, 음악신호에 대한 압축 성능이 우수하지만, 음성 신호에 대한 압축 성능은 떨어진다.The conventional speech / music compression method is classified into a time domain transformation method and a frequency domain transformation method. The frequency domain transform technique is an algorithm for compressing a signal in the frequency domain. The psychoacoustic model is applied. Although the compression performance is excellent for a music signal, the compression performance for a voice signal is poor.

시간 영역 변환 기법은 시간 영역 상의 신호를 압축하는 알고리즘으로, 음성 발성 모델을 적용한다. 시간 영역 변환 기법은 사람의 말소리, 즉 음성 신호에 대한 압축 성능이 우수하나, 음악 신호에 대한 압축 성능은 떨어진다.The time domain transform scheme is an algorithm that compresses the signal in the time domain and applies the speech model. The time domain transform technique has excellent compression performance for human speech, that is, speech signal, but has poor compression performance for music signals.

이러한 특성을 반영하여 음성 및 음악 신호를 포함하는 오디오 신호에 대한 압축을 효율적으로 수행하기 위한 기법들이 연구되고 있다.Techniques for efficient compression of audio signals including voice and music signals are being researched to reflect these characteristics.

종래의 기술에 따르면 오디오 신호에 대하여 주파수 영역 변환을 수행하는 것이 효율적인지, 시간 영역 변환을 수행하는 것이 효율적인지 여부를 판단하고, 판단 결과에 따라서 주파수 영역 변환 또는 시간 영역 변환을 수행할 수 있다.According to the related art, it is possible to determine whether it is efficient to perform the frequency domain transform on the audio signal, whether it is efficient to perform the time domain transform, and perform the frequency domain transform or the time domain transform according to the determination result.

종래의 기술에 따르면 오디오 신호에 대하여 주파수 영역 변환을 수행하다가 시간 영역 변환으로 전환하는 경우 및 시간 영역 변환을 수행하다가 주파수 영역 변환으로 전환하는 경우에, 예측 데이터 부족으로 인한 전환 노이즈가 발생하였다.According to the related art, when a frequency domain transform is performed on an audio signal and then the time domain transform is performed and the frequency domain transform is performed after the time domain transform is performed, transition noise due to insufficient predicted data occurs.

따라서 주파수 영역 변환 및 시간 영역 변환 중에서 어느 한 변환 기법을 수행하다가 다른 변환 기법으로 전환하는 경우에도 전환 노이즈 없이 오디오 신호를 효과적으로 압축 부호화할 수 있는 변환 기법이 요구되고 있다.Accordingly, there is a need for a conversion technique capable of efficiently compressing and encoding an audio signal without switching noise even when switching from one frequency domain transform to another is performed.

주파수 영역 변환 기법 및 시간 영역 변환 기법을 전환하며 오디오 신호를 부호화하는 경우에, 추가적으로 부호화하는 데이터의 양을 최소화하기 위한 방법 및 장치를 제공하는데 있다.And a method and apparatus for minimizing the amount of data to be encoded when an audio signal is encoded by switching the frequency domain transform technique and the time domain transform technique.

본 발명은 제1 시간 구간의 제1 오디오 신호에 대하여 시간 영역 변환 또는 주파수 영역 변환 중 어느 한 변환을 수행하는 제1 변환부, 상기 제1 시간 구간 이후에 상기 제1 시간 구간과 인접하는 제2 시간 구간의 제2 오디오 신호에 대하여 시간 영역 변환 또는 주파수 영역 변환 중 상기 제1 변환부의 변환과 상이한 변환을 수행하는 제2 변환부 및 상기 제1 오디오 신호 또는 제2 오디오 신호에 대하여 상기 제1 변환부 및 상기 제2 변환부의 변환 기법을 고려하여 시간 영역 변환 또는 주파수 영역 변환 중 어느 한 변환을 수행하는 제3 변환부를 포함하고, 상기 제1 오디오 신호는 상기 제1 변환부가 변환한 오디오 신호 및 상기 제3 변환부가 변환한 오디오 신호에 기반하여 복원되거나 제2 오디오 신호는 상기 제2 변환부가 변환한 오디오 신호 및 상기 제3 변환부가 변환한 오디오 신호에 기반하여 복원되는 것을 특징으로 하는 오디오 신호 부호화기를 제공한다.The present invention provides a speech signal processing apparatus comprising a first conversion unit for performing a time domain transformation or a frequency domain transformation on a first audio signal in a first time interval, A second transformer for performing a transform different from the transform of the first transformer during a time domain transform or a frequency domain transform for a second audio signal of a time interval, and a second transformer for performing a transform on the first audio signal or the second audio signal, And a third converter for performing either a time domain transform or a frequency domain transform considering a transform technique of the second transformer, wherein the first audio signal is a signal obtained by transforming the audio signal transformed by the first transformer, The third conversion unit is restored based on the converted audio signal, or the second audio signal is restored based on the audio signal converted by the second conversion unit and the audio signal converted by the third conversion And the audio signal is restored based on the audio signal converted by the addition.

본 발명의 일측에 따르면 시간 영역 변환 또는 주파수 영역 변환된 제1 시간 구간의 제1 오디오 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행하는 제1 역변환부, 상기 제1 시간 구간 이후에 상기 제1 시간 구간과 인접하는 제2 시간 구간의 제2 오디오 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환 중에서 상기 제1 역변환부의 역변환과 상이한 역변환을 수행하는 제2 역변환부 및 상기 제1 오디오 신호 또는 상기 제2 오디오 신호에 대하여 상기 제1 역변환부 및 상기 제2 역변환부의 역변환 기법을 고려하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행하는 제3 역변환부, 상기 제1 역변환부가 역변환한 제1 오디오 신호 및 상기 제3 역변환부가 역변환한 제1 오디오 신호에 기반하여 상기 제1 시간 구간의 오디오 신호를 복원하거나, 상기 제2 역변환부가 역변환한 제2 오디오 신호 빛 상기 제3 역변환부가 역변환한 제2 오디오 신호에 기반하여 상기 제2 시간 구간의 오디오 신호를 복원하는 신호 복원부를 포함하는 것을 특징으로 하는 오디오 신호 복호화기가 제공된다.According to an aspect of the present invention, there is provided a first inverse transformer or a frequency domain inverse transformer for performing a time domain inverse transform or a frequency domain inverse transform on a first audio signal in a time domain transform or a frequency domain transformed first time domain, A second inverse transformer for performing inverse transform different from an inverse transformation of the first inverse transformer in a time domain inverse transform or a frequency domain inverse transform with respect to a second audio signal in a second time interval adjacent to the time domain, A third inverse transformer performing a time domain inverse transform or a frequency domain inverse transform with respect to an audio signal in consideration of an inverse transformation technique of the first inverse transformer and the second inverse transformer, The audio signal of the first time interval is reproduced based on the first audio signal inversely transformed by the inverse transform unit And a signal reconstruction unit for reconstructing the audio signal of the second time interval based on the second audio signal inversely transformed by the third inverse transform unit, A signal decoder is provided.

본 발명의 실시예들에 따르면 주파수 영역 변환 기법 및 시간 영역 변환 기법을 전환하며 오디오 신호를 부호화할 수 있다.According to embodiments of the present invention, an audio signal can be encoded by switching between a frequency domain transform technique and a time domain transform technique.

본 발명의 실시예들에 따르면 주파수 영역 변환 기법 및 시간 영역 변환 기법을 전환하며 오디오 신호를 부호화하는 경우에, 추가적으로 부호화하는 데이터의 양을 최소화할 수 있다.According to embodiments of the present invention, when the audio signal is encoded by switching the frequency domain transform technique and the time domain transform technique, the amount of additional data to be encoded can be minimized.

도 1은 본 발명에 따른 오디오 신호 부호화기 및 오디오 신호 복호화기가 이용하는 겹침 변환 기법을 설명하기 위한 개념도이다.
도 2는 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환전의 신호에 기반하여 전환후의 오디오 신호를 추가적으로 부호화하는 본 발명의 일 실시예를 도시한 도면이다.
도 3은 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환 전후의 오디오 신호를 주파수 영역 변환하는 본 발명의 일 실시예를 도시한 도면이다.
도 4는 시간 영역 변환 기법에서 주파수 영역 변환 기법으로 전환된 경우에, 전환 이후의 오디오 신호를 시간 영역 변환하는 본 발명의 일 실시예를 도시한 도면이다.
도 5는 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환 이전 시간 영역의 길이를 전환 이후 시간 영역의 길이보다 작게 결정하는 본 발명의 일 실시예를 도시한 도면이다.
도 6은 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환 이전 오디오 신호를 분할하여 각각을 변환하는 본 발명의 일 실시예를 도시한 도면이다.
도 7은 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 수행하는 본 발명의 일 실시예에 따른 부호화기의 구조를 도시한 블록도이다.
도 8은 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 부호화된 오디오 신호를 본 발명의 일 실시예에 따라서 복호화하는 복호화기의 구조를 도시한 블록도이다.
도 9는 본 발명의 일 실시예에 따라서 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 오디오 신호를 부호화하는 방법을 단계별로 도시한 순서도이다.
도 10은 본 발명의 일 실시예에 따라서 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 부호화된 오디오 신호를 복호화하는 방법을 단계별로 도시한 순서도이다.
도 11은 본 발명의 일 실시예에 따른 오디오 신호 부호화기의 구조를 도시한 블록도이다.
도 12는 입력 신호에 대한 주파수 영역 역변환 또는 시간 영역 역변환을 수행하는 본 발명의 일 실시예를 단계별로 도시한 순서도 이다.
도 13은 입력 신호에 대한 주파수 영역 역변환 또는 시간 영역 역변환을 수행하는 본 발명의 다른 실시예를 단계별로 도시한 순서도이다.
도 14는 입력 신호에 대한 주파수 영역 변환 또는 시간 영역 변환을 수행하는 본 발명의 일 실시예를 단계별로 도시한 순서도이다.
도 15는 입력 신호에 대한 주파수 영역 변환 또는 시간 영역 변환을 수행하는 본 발명의 다른 실시예를 단계별로 도시한 순서도이다.FIG. 1 is a conceptual diagram for explaining a lapped transform technique used by an audio signal encoder and an audio signal decoder according to the present invention.
2 is a diagram illustrating an embodiment of the present invention for additionally encoding an audio signal after switching based on a signal before switching when the time-domain transform technique is switched in the frequency domain transform technique.
FIG. 3 is a diagram illustrating an embodiment of the present invention for frequency-domain-transforming an audio signal before and after a switch in a time domain transform technique in a frequency domain transform technique.
FIG. 4 is a diagram illustrating an embodiment of the present invention for time-domain-transforming an audio signal after switching when the time-domain transform technique is switched to a frequency-domain transform technique.
5 is a diagram illustrating an embodiment of the present invention for determining the length of the pre-switching time area to be smaller than the length of the time area after the switching when the time-domain conversion technique is used in the frequency domain conversion technique.
6 is a diagram illustrating an embodiment of the present invention for dividing an audio signal before conversion and converting each audio signal when the audio signal is converted into a time domain transformation scheme in a frequency domain transformation scheme.
FIG. 7 is a block diagram illustrating a structure of an encoder according to an embodiment of the present invention, which is performed by switching between a time domain transform technique or a frequency domain transform technique.
8 is a block diagram illustrating a structure of a decoder that decodes an encoded audio signal according to an embodiment of the present invention by switching a time domain transform technique or a frequency domain transform technique.
9 is a flowchart illustrating a method of encoding an audio signal by switching a time domain transform technique or a frequency domain transform technique to each other according to an embodiment of the present invention.
10 is a flowchart illustrating a method of decoding an encoded audio signal by switching the time domain transform technique or the frequency domain transform technique according to an embodiment of the present invention.
11 is a block diagram illustrating a structure of an audio signal encoder according to an embodiment of the present invention.
FIG. 12 is a flowchart showing an embodiment of the present invention for performing an inverse frequency domain transform or a time domain inverse transform on an input signal in steps.
FIG. 13 is a flowchart showing another embodiment of the present invention for performing an inverse frequency domain transform or a time domain inverse transform on an input signal.
FIG. 14 is a flowchart showing steps of an embodiment of the present invention for performing a frequency domain transformation or a time domain transformation on an input signal.
FIG. 15 is a flowchart showing steps of another embodiment of the present invention for performing frequency domain transformation or time domain transformation on an input signal.

본 발명의 다양한 실시예들에서 부호화되는 각각의 오디오 신호는 동일한 음원(audio source)으로부터 전송된 오디오 신호로서, 각각의 시간 구간에 따라서 구분된 것이다. 즉, 제1 오디오 신호 및 제2 오디오 신호는 각각 특정 음원에서 제1 시간 구간 및 제2 시간 구간에 대응되는 오디오 신호이다. 또한, 제1 변환 신호 및 제2 변환 신호는 각각 제1 오디오 신호 및 제2 오디오 신호가 시간 영역 변환 또는 주파수 영역 변환된 신호로서 제1 시간 구간 및 제2 시간 구간에 대응된다.Each of the audio signals encoded in the various embodiments of the present invention is an audio signal transmitted from the same audio source and is divided according to each time interval. That is, the first audio signal and the second audio signal are audio signals corresponding to the first time interval and the second time interval, respectively, in a specific sound source. The first transform signal and the second transform signal correspond to the first time interval and the second time interval, respectively, as time-domain transformed or frequency-domain transformed signals of the first audio signal and the second audio signal.

이하에서는 첨부된 도면을 참조하여 본 발명의 실시예를 상세히 설명한다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명에 따른 오디오 신호 부호화기 및 오디오 신호 복호화기가 이용하는 겹침 변환 기법을 설명하기 위한 개념도이다. 이하 도 1을 참조하여 겹침 변환 기법을 상세히 설명하기로 한다.FIG. 1 is a conceptual diagram for explaining a lapped transform technique used by an audio signal encoder and an audio signal decoder according to the present invention. Hereinafter, the lapped transform technique will be described in detail with reference to FIG.

오디오 신호를 주파수 영역으로 변환하기 위해서 변환의 기본 단위로 일정한 시간 구간을 설정하고, 시간 구간 내의 오디오 신호에 대한 주파수 영역 변환을 수행할 수 있다. 이때 시간 구간이 서로 겹쳐져 있지 않다면 부호화/복호화에서의 양자화로 인해 각 블록간에 불연속 신호가 발생하게 된다. 비디오 압축의 경우 이를 블러킹아티팩트 (Blocking Artifact)라고 하며, 이러한 열화는 오디오 신호에 있어서는 매우 치명적일 수 있다.In order to convert the audio signal into the frequency domain, a constant time interval may be set as a basic unit of the transformation, and the frequency domain transformation may be performed on the audio signal within the time interval. If the time intervals are not overlapped with each other, a discontinuous signal is generated between each block due to quantization in encoding / decoding. In the case of video compression, this is called a blocking artifact, and such degradation can be very lethal for audio signals.

따라서 오디오 압축을 위해서는 각 시간 구간이 서로 중첩되도록 설정하는 겹침 변환 기법이 사용된다. Therefore, for audio compression, a lapped transform technique is used to set each time interval to overlap each other.

겹침 변환 기법을 사용하면 변환된 신호를 복원할 때 인접 프레임간 중첩을 통해 복원하여 불연속이 발생하지 않고 적절히 디자인된 윈도우를 이용할 경우 원신호와 동일한 신호로 복원해 낼 수 있다. 예를 들어 오디오 압축에 흔히 쓰이는 Modified Discrete Cosine Transform는 sine 윈도우를 이용하여 프레임 길이의 1/2을 겹쳐 변환을 하게 되면 변환 및 역변환을 통해 원신호와 동일한 신호를 생성할 수 있다.When the transformed signal is reconstructed by using the lapped transform technique, it can be reconstructed by superimposing between neighboring frames to recover the same signal as the original signal when a properly designed window is used without discontinuity. For example, Modified Discrete Cosine Transform, which is commonly used for audio compression, can generate the same signal as the original signal through conversion and inverse transformation when a half of the frame length is transformed by using the sine window.

도 1의 (a)는 인접한 두 시간 구간의 오디오 신호를 함께 주파수 영역 변환하는 본 발명의 일 실시예를 도시한 도면이다. 각 시간 구간(111, 112, 113, 114) 동안 두 시간 구간 동안의 오디오 신호(121, 122, 123)가 변환된다. 즉, 특정한 시간 구간(112)에는 특정 시간 구간(112) 및 이전 시간 구간(111)의 오디오 신호가 주파수 영역으로 변환된다.FIG. 1 (a) illustrates an embodiment of the present invention in which audio signals of two adjacent time intervals are frequency-domain-transformed. The audio signals 121, 122, and 123 during two time intervals are converted during the respective time periods 111, 112, 113, That is, in a specific time period 112, the audio signals of the specific time period 112 and the previous time period 111 are converted into the frequency domain.

도 1의 (a)에 도시된 각각의 사선(131, 132, 133, 134, 135, 136)은 각 윈도우의 계수를 나타낸다. MDCT (Modified Discrete Cosine Transform)의 완전 복원 (Perfect Reconstruction)를 위해서는 하기 수학식 1과 같은 조건을 만족해야 한다. 여기에서 x는 입력 신호, X는 변환된 주파수 성분, y는 역MDCT를 통해 역변환된 신호를 의미한다. N번째 시간 구간에서 역변환된 제1 역변환 신호를 이전 N-1 번째 시간 구간에서 역변환된 제2 역변환 신호는 일정 시간 동안 겹친다. 제1 역변환 신호와 제2 역변환 신호가 겹치는 시간에 대해서 두 역변환 신호를 더하면 원신호를 복원할 수 있다. 2M은 변환의 길이를 의미한다.
Each slant 131, 132, 133, 134, 135, 136 shown in FIG. 1 (a) represents the coefficient of each window. For the perfect reconstruction of MDCT (Modified Discrete Cosine Transform), the following condition should be satisfied. Here, x denotes an input signal, X denotes a converted frequency component, and y denotes a signal that is inversely transformed through an inverse MDCT. The inverse transformed first inverse transformed signal in the Nth time interval overlaps the inverse transformed second inverse transformed signal in the (N-1) th time interval for a predetermined period of time. The original signal can be restored by adding two inverse transform signals to the time when the first inverse transform signal and the second inverse transform signal overlap each other. 2M means the length of the transformation.

[수학식 1]
[Equation 1]

h의 값은 131-136의 선을 의미하며 하기 수학식 2와 같은 조건을 만족하야 완전 복원 (Perfect Reconstruction)이 가능하다.
The value of h means the line 131-136, and a perfect reconstruction is possible if the condition of Equation 2 is satisfied.

[수학식 2]
&Quot; (2) "

h는 이전 프레임의 중첩된 부분의 윈도우와 좌우 대칭이어야 하며 각 계수의 제곱의 합이 1이 되어야 한다. 132와 133은 각 대칭의 모습을 보인다. 도1 (b)의 경우는 162와 163은 서로 대칭이고 각 계수의 제곱의 합이 1이므로 이러한 윈도우 계수 또한 완전 복원 (Perfect Reconstruction)을 가능하게 한다. 도1 (b)는 142부분에서 중첩이 되어 있으나 163부분의 윈도우 계수가 0이므로 중첩의 효과가 없어 그만큼 오버 코딩의 비율을 줄 일 수 있는 것이다.h must be symmetrical with the window of the overlapping part of the previous frame and the sum of the squares of each coefficient should be 1. 132 and 133 show the appearance of each symmetry. In the case of FIG. 1 (b), 162 and 163 are symmetrical with each other, and the sum of the squares of the coefficients is 1, so that this window coefficient also enables perfect reconstruction. 1 (b) is overlapped in the 142 portion, but since the window coefficient of the 163 portion is 0, there is no effect of overlap, so that the ratio of overcoding can be reduced.

도 2는 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환전의 신호에 기반하여 전환후의 오디오 신호를 추가적으로 부호화하는 본 발명의 일 실시예를 도시한 도면이다. 이하 도 2를 참조하여 전환후의 오디오 신호를 추가적으로 부호화하는 본 발명의 실시예를 상세히 설명하기로 한다.2 is a diagram illustrating an embodiment of the present invention for additionally encoding an audio signal after switching based on a signal before switching when the time-domain transform technique is switched in the frequency domain transform technique. Hereinafter, an embodiment of the present invention for additionally encoding an audio signal after switching will be described in detail with reference to FIG.

도 2의 실시예에서는 제4 시간 구간(211), 제3 시간 구간(212), 제1 시간 구간(213)에서는 주파수 영역 변환을 수행하고, 제2 시간 구간(214)에서는 시간 영역 변환으로 전환된 실시예가 도시되었다.In the embodiment of FIG. 2, the frequency domain transformation is performed in the fourth time interval 211, the third time interval 212 and the first time interval 213, and the time domain transformation is performed in the second time interval 214 Lt; / RTI >

제3 시간 구간(212)의 오디오 신호는 제4 시간 구간(211)의 오디오 신호와 함께 주파수 영역 변환되고(220), 제1 시간 구간(213)의 오디오 신호와 함께 주파수 영역 변환된다(230). 두 번 주파수 영역 변환된 제3 시간 구간(212)의 오디오 신호는 각각 역변환된 오디오 신호를 더하여 복원할 수 있다(271).The audio signal of the third time interval 212 is frequency domain transformed 220 together with the audio signal of the fourth time interval 211 and is frequency domain transformed 230 together with the audio signal of the first time interval 213. [ . The audio signal of the third time interval 212, which has been frequency-domain-converted twice, can be reconstructed by adding the inverse-transformed audio signal (271).

제1 시간 구간(213)의 오디오 신호는 제3 시간 구간(212)의 오디오 신호와 함께 주파수 영역 변환될 뿐이다(230). 시간 영역 신호로 역변환 할 수는 있으나 역변환된 신호는 1개뿐이므로 시간 영역 신호를 복원할 수는 없다.The audio signal of the first time interval 213 is only frequency-domain-transformed with the audio signal of the third time interval 212 (230). Although it can be inversely transformed into a time domain signal, there is only one inverse transformed signal. Therefore, the time domain signal can not be restored.

제2 시간 구간(214)의 오디오 신호는 시간 영역 변환 된다(250). 시간 영역 변환된 신호는 인접한 시간 구간(214)의 오디오 신호와는 관계없이 단독으로 복원될 수 있다.The audio signal of the second time interval 214 is time domain transformed (250). The time domain transformed signal can be recovered independently of the audio signal of the adjacent time interval 214. [

본 발명의 일 실시예에 따르면 주파수 영역 변환된 제1 시간 구간(213)의 오디오 신호를 복원하기 위하여 제1 시간 구간(213)의 오디오 신호를 추가적으로 변환할 수 있다.According to an embodiment of the present invention, the audio signal of the first time interval 213 may be further transformed to recover the audio signal of the first time interval 213 after the frequency domain transformation.

본 발명의 일 실시예에 따르면 제1 시간 구간(213)의 오디오 신호를 시간 영역 변환(240)할 수 있다. 제1 시간 구간(213)의 시간 영역 변환된 오디오 신호(240)에 기반하여 제1 시간 구간(213)의 오디오 신호를 복원할 수 있다(272).According to an embodiment of the present invention, the audio signal of the first time interval 213 may be time-domain transformed 240. The audio signal of the first time interval 213 may be recovered (272) based on the time domain transformed audio signal 240 of the first time interval 213.

본 발명의 일 실시예에 따르면 제1 시간 구간(213) 이전의 제3 시간 구간(212)의 참조 신호(231)를 참조하여 제1 시간 구간(213)의 오디오 신호를 효율적으로 변환할 수 있다.The audio signal of the first time interval 213 can be efficiently converted by referring to the reference signal 231 of the third time interval 212 before the first time interval 213 according to an embodiment of the present invention .

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제3 시간 구간의 제3 오디오 신호 및 제4 시간 구간의 제4 오디오 신호에 대하여 주파수 영역 변환을 수행하여 제4 변환 신호를 생성할 수 있다. 또한 제1 시간 구간의 제1 오디오 신호 및 제3 시간 구간의 제3 오디오 신호에 대하여 주파수 영역 변환을 수행하여 제3 오디오 신호를 생성할 수 있다.The audio signal encoder according to an embodiment of the present invention may perform a frequency domain transform on the third audio signal of the third time interval and the fourth audio signal of the fourth time interval to generate the fourth transformed signal. The first audio signal of the first time interval and the third audio signal of the third time interval may be subjected to frequency domain transformation to generate the third audio signal.

본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제4 변환 신호 및 제1 변환 신호를 이용하여 제3 시간 구간의 제3 오디오 신호를 복원할 수 있다. 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제3 시간 구간의 제3 오디오 신호에 기반하여 제1 시간 구간의 제1 오디오 신호를 복원할 수 있다.The audio signal decoder according to the embodiment of the present invention can recover the third audio signal of the third time interval using the fourth transform signal and the first transform signal. The audio signal decoder according to the embodiment of the present invention can restore the first audio signal of the first time interval based on the third audio signal of the third time interval.

본 발명의 일 실시예에 따르면 오디오 신호 부호화기는 제3 오디오 신호에 기반하여 제1 오디오 신호에 대한 예측 신호를 생성할 수 있다. 본 발명의 일 실시예에 따르면 오디오 신호 부호화기는 예측 신호와 제1 오디오 신호와 예측 신호와의 차이만을 시간 영역 부호화할 수 있다. 이전 시간 구간의 정보에 기반하여 제1 오디오 신호를 부호화하므로 더욱 효율적으로 제1 오디오 신호를 부호화할 수 있다.According to an embodiment of the present invention, the audio signal encoder may generate a prediction signal for the first audio signal based on the third audio signal. According to an embodiment of the present invention, the audio signal encoder can temporally encode only the difference between the prediction signal, the first audio signal, and the prediction signal. The first audio signal is encoded based on the information of the previous time interval, so that the first audio signal can be encoded more efficiently.

도 3은 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환 전후의 오디오 신호를 주파수 영역 변환하는 본 발명의 일 실시예를 도시한 도면이다. 이하 도 3을 참조하여 전환 전후의 오디오 신호를 주파수 영역 변환하는 본 발명의 일 실시예를 상세히 설명하기로 한다. FIG. 3 is a diagram illustrating an embodiment of the present invention for frequency-domain-transforming an audio signal before and after a switch in a time domain transform technique in a frequency domain transform technique. Hereinafter, an embodiment of the present invention for frequency-domain-transforming an audio signal before and after switching will be described in detail with reference to FIG.

도 3의 실시예에서는 제4 시간 구간(311), 제2 시간 구간(312), 제1 시간 구간(313)에서는 주파수 영역 시간 변환을 수행하고, 제2 시간 구간(314)에서는 시간 영역 변환으로 전환된 실시예가 도시되었다.In the embodiment of FIG. 3, the frequency domain time transform is performed in the fourth time interval 311, the second time interval 312, and the first time interval 313, and the time domain transform is performed in the second time interval 314 The switched embodiment is shown.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제3 시간 구간(312)의 제3 오디오 신호를 제4 시간 구간의 제4 오디오 신호와 함께 주파수 영역 변환한다. 따라서 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제3 시간 구간(312)의 제3 오디오 신호를 복원할 수 있다.The audio signal encoder according to the embodiment of the present invention frequency-converts the third audio signal of the third time interval 312 together with the fourth audio signal of the fourth time interval. Accordingly, the audio signal decoder according to the embodiment of the present invention can recover the third audio signal in the third time interval 312. [

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제2 시간 구간의 제2 오디오 신호를 시간 영역 변환할 수 있다. 시간 영역 변환된 오디오 신호는 다른 시간 구간의 오디오 신호와 관계없이 독립적으로 복원될 수 있으므로, 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제2 시간 구간의 제2 오디오 신호를 복원할 수 있다.The audio signal encoder according to an embodiment of the present invention may time-domain convert a second audio signal of a second time interval. Since the time-domain-converted audio signal can be independently restored regardless of the audio signal of another time interval, the audio signal decoder according to the embodiment of the present invention can recover the second audio signal of the second time interval.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제1 시간 구간(313)의 제1 오디오 신호를 제2 시간 구간(314)의 제2 오디오 신호와 함께 추가적으로 주파수 영역 변환할 수 있다. 제1 시간 구간(313)의 제1 오디오 신호는 2번 주파수 영역 부호화되므로, 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제1 시간 구간(313)의 제1 오디오 신호를 복원할 수 있다.The audio signal encoder according to the embodiment of the present invention may further convert the first audio signal of the first time interval 313 together with the second audio signal of the second time interval 314 in the frequency domain. Since the first audio signal of the first time interval 313 is frequency-domain-encoded twice, the audio signal decoder of the embodiment of the present invention can recover the first audio signal of the first time interval 313.

도 4는 시간 영역 변환 기법에서 주파수 영역 변환 기법으로 전환된 경우에, 전환 이후의 오디오 신호를 시간 영역 변환하는 본 발명의 일 실시예를 도시한 도면이다. 이하 도 4를 참조하여 전환 이후의 오디오 신호를 시간 영역 변환하는 본 발명의 일 실시예를 상세히 설명하기로 한다.FIG. 4 is a diagram illustrating an embodiment of the present invention for time-domain-transforming an audio signal after switching when the time-domain transform technique is switched to a frequency-domain transform technique. Hereinafter, an embodiment of the present invention will be described in detail with reference to FIG.

도 4의 실시예에서는 제4 시간 구간(411) 및 제1 시간 구간(412)에서는 시간 영역 변환을 수행하고, 제2 시간 구간(413) 및 제3 시간 구간(414)에서는 주파수 영역 변환을 수행하는 실시예가 도시되었다.In the embodiment of FIG. 4, the time domain transform is performed in the fourth time period 411 and the first time period 412, and the frequency domain transform is performed in the second time period 413 and the third time period 414 Is shown.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제4 시간 구간(411)의 제4 오디오 신호를 시간 영역 부호화하고, 제1 시간 구간(412)의 제1 오디오 신호를 시간 영역 부호화할 수 있다. 시간 영역 부호화된 오디오 신호는 다른 시간 구간의 오디오 신호와 관계없이 독립적으로 복원될 수 있다. 따라서 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제4 시간 구간(411) 및 제1 시간 구간(412)의 오디오 신호를 복원할 수 있다.The audio signal encoder according to an embodiment of the present invention may time-domain-encode the fourth audio signal of the fourth time interval 411 and time-domain-encode the first audio signal of the first time interval 412. [ The time-domain encoded audio signal can be independently restored regardless of the audio signal of the other time interval. Therefore, the audio signal decoder according to the embodiment of the present invention can recover the audio signals of the fourth time interval 411 and the first time interval 412. [

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제2 시간 구간(413)의 제2 오디오 신호는 제3 시간 구간(414)의 제3 오디오 신호와 함께 주파수 영역 부호화 할 수 있다. 본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제3 시간 구간(414)의 제3 오디오 신호를 제3 시간 구간(414)이후의 오디오 신호와 함께 다시 주파수 영역 변환하므로 제3 시간 구간(414)의 제3 오디오 신호는 결과적으로 2번 부호화 된다. 따라서 본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제3 오디오 신호를 복원할 수 있다.The audio signal encoder according to the embodiment of the present invention can frequency-domain-encode the second audio signal in the second time interval 413 together with the third audio signal in the third time interval 414. The audio signal encoder according to the embodiment of the present invention frequency-converts the third audio signal of the third time interval 414 together with the audio signals after the third time interval 414, so that the third time interval 414, Lt; / RTI > is encoded twice as a result. Therefore, the audio signal encoder according to the embodiment of the present invention can recover the third audio signal.

그러나 제2 시간 구간(413)의 제2 오디오 신호는 1번만 부호화 되었으므로, 본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제2 시간 구간(413)의 제2 오디오 신호를 주파수 영역 역변환해도 제2 오디오 신호를 복원할 수 없다.However, since the second audio signal in the second time interval 413 is encoded once, the audio signal encoder according to the embodiment of the present invention can perform frequency domain inverse transform of the second audio signal in the second time interval 413, The audio signal can not be restored.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제2 시간 구간(413)의 제2 오디오 신호에 대하여 추가적으로 시간 영역 변환을 수행할 수 있다. 제2 오디오 신호가 시간 영역 부호화되면, 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제2 시간 구간(413)의 제2 오디오 신호를 복원할 수 있다.The audio signal encoder according to the embodiment of the present invention may further perform time domain transformation on the second audio signal of the second time period 413. If the second audio signal is temporally coded, the audio signal decoder according to an embodiment of the present invention can recover the second audio signal of the second time period 413.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제1 시간 구간(412)의 제1 오디오 신호에 기반하여 제2 오디오 신호에 대한 예측 신호를 생성하고, 제2 오디오 신호와 제2 오디오 신호에 대한 예측 신호의 차를 시간 영역 부호화할 수 있다. 제1 오디오 신호를 복원한 경우에만 제2 오디오 신호를 복원할 수 있으나, 제2 오디오 신호를 좀더 효율적으로 부호화할 수 있다.The audio signal encoder according to an embodiment of the present invention generates a prediction signal for the second audio signal based on the first audio signal in the first time period 412 and generates a prediction signal for the second audio signal, The difference between the prediction signals can be time-domain encoded. The second audio signal can be restored only when the first audio signal is restored, but the second audio signal can be more efficiently encoded.

도 5는 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환 이전 시간 영역의 길이를 전환 이후 시간 영역의 길이보다 짧게 결정하는 본 발명의 일 실시예를 도시한 도면이다. 이하 도 5를 참조하여 전환 이전의 시간 영역의 길이를 전환 이후 시간 영역의 길이보다 짧게 결정하는 본 발명의 일 실시예를 상세히 설명하기로 한다.5 is a diagram illustrating an embodiment of the present invention in which the length of the pre-switching time region is shorter than the length of the time region after the switching in the frequency domain conversion scheme. Hereinafter, an embodiment of the present invention will be described in detail with reference to FIG. 5, in which the length of the time domain before switching is determined to be shorter than the length of the time domain after switching.

도5에서는 도2에서의 실시예와 유사하게 실시예에서는 제4 시간 구간(511), 제3 시간 구간(512) 및 제1 시간 구간(513)에서는 주파수 영역 변환을 수행하고, 제2 시간 구간(514)에서는 시간 영역 변환을 수행하는 실시예가 도시되었다.In FIG. 5, similar to the embodiment of FIG. 2, in the embodiment, the frequency domain transformation is performed in the fourth time interval 511, the third time interval 512 and the first time interval 513, (514) illustrates an embodiment that performs time domain transformation.

1번만 주파수 영역 변환된 제1 시간 구간(513)의 제1 오디오 신호를 복원하기 위하여 본 발명에 따른 오디오 신호 부호화기는 추가적으로 시간 영역 변환을 수행할 수 있다.The audio signal encoder according to the present invention may further perform time domain transformation to recover the first audio signal of the first time period 513 that has undergone frequency domain transformation only once.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제1 시간 구간(513)의 시간 길이(521)를 제2 시간 구간(513)의 시간 길이(522) 또는 제3 시간 구간(512)의 시간 길이보다 짧게 결정할 수 있다.The audio signal encoder according to an exemplary embodiment of the present invention may store the time length 521 of the first time interval 513 in the time length 522 of the second time interval 513 or the time length 522 of the third time interval 512 It can be determined shorter.

제1 시간 구간(513)의 길이가 짧으면 추가적으로 시간 영역 변환되는 오디오 신호(560)의 양이 감소한다. 더 적은 오디오 신호만을 추가적으로 변환하므로 오디오 신호 부호화기의 부호화 효율이 향상된다.If the length of the first time interval 513 is short, the amount of the audio signal 560 that is further time-domain transformed decreases. The encoding efficiency of the audio signal encoder is improved because only a few audio signals are additionally converted.

도 6은 주파수 영역 변환 기법에서 시간 영역 변환 기법으로 전환된 경우에, 전환 이전 오디오 신호를 분할하여 각각을 변환하는 본 발명의 일 실시예를 도시한 도면이다. 이하 도 6을 참조하여 전환 이전의 오디오 신호를 분할하여 각각을 변환하는 본 발명의 일 실시예를 상세히 설명하기로 한다.6 is a diagram illustrating an embodiment of the present invention for dividing an audio signal before conversion and converting each audio signal when the audio signal is converted into a time domain transformation scheme in a frequency domain transformation scheme. Hereinafter, an embodiment of the present invention will be described in detail with reference to FIG.

도 6의 실시예에서는 제4 시간 구간(611), 제3 시간 구간(612) 및 제1 시간 구간(613)의 오디오 신호에 대하여 주파수 영역 변환을 수행하고, 제1 시간 구간(613) 및 제2 시간 구간(614)의 오디오 신호에 대하여 시간 영역 변환을 수행하는 실시예가 도시되었다.In the embodiment of FIG. 6, the audio signal of the fourth time interval 611, the third time interval 612, and the first time interval 613 is subjected to frequency domain transformation, and the first time interval 613 and the second time interval 613, An embodiment of performing a time domain transform on the audio signal of the two time period 614 is shown.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제3 시간 구간(612)의 제3 오디오 신호는 제4 시간 구간(611)의 제4 오디오 신호와 한번, 제1 시간 구간(613)의 제1 오디오 신호와 다시 한번 주파수 영역 변환된다. 따라서 본 발명의 일 실시예에 따른 오디오 신호 복호화기는 제3 시간 구간(612)의 제3 오디오 신호를 복원할 수 있다.The audio signal encoder according to the embodiment of the present invention is configured such that the third audio signal of the third time interval 612 is multiplexed with the fourth audio signal of the fourth time interval 611 The audio signal is again frequency domain transformed. Accordingly, the audio signal decoder according to the embodiment of the present invention can recover the third audio signal of the third time interval 612. [

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제1 시간 구간(613)을 제1 시간 영역(621) 및 제2 시간 영역(622)으로 구분할 수 있다. 본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제1 시간 영역(621)의 제1 오디오 신호를 제3 시간 구간의 제3 오디오 신호와 함께 주파수 영역 부호화할 수 있다. 본 발명의 일 실시예에 따른 오디오 신호 부호화기는 제2 시간 영역(621)의 제1 오디오 신호에 대하여 시간 영역 변환을 수행할 수 있다.The audio signal encoder according to the embodiment of the present invention may divide the first time interval 613 into a first time interval 621 and a second time interval 622. [ The audio signal encoder according to the embodiment of the present invention can frequency-domain-code the first audio signal in the first time domain 621 together with the third audio signal in the third time domain. The audio signal encoder according to the embodiment of the present invention may perform time domain transformation on the first audio signal in the second time domain 621. [

본 발명의 일 실시예에 따른 오디오 신호 복호화기는 시간 영역 변환된 제2 시간 영역(622)의 제1 오디오 신호를 복원할 수 있다.The audio signal decoder according to the embodiment of the present invention can recover the first audio signal in the time domain converted second time domain 622. [

도 1의 (b)를 참조하면, 본 발명의 일 실시예에 따른 오디오 신호 부호화기는 직각 시간 윈도우(Rectangular time window)를 이용하여 제1 시간 영역 및 제2 시간 영역을 구분할 수 있다. 제4 시간 구간(611)및 제3 시간 구간(612)에 사용되는 싸인 (sine) 시간 윈도우(141) 는 특정 시간 구간 전부에 걸쳐서 0이상의 계수를 가지지만, 제1 시간 구간에 사용되는 직각 시간 윈도우(162, 163, 164, 165)는 특정 시간 구간(142) 중에서 특정 시간 영역(162, 163)에서는 0이상의 계수를 가지나, 다른 시간 영역(162, 163)에서는 0의 이득을 가진다.Referring to FIG. 1 (b), the audio signal encoder according to an embodiment of the present invention can distinguish the first time region and the second time region using a rectangular time window. The sine time window 141 used for the fourth time interval 611 and the third time interval 612 has a coefficient of zero or more over all the predetermined time intervals, The windows 162, 163, 164 and 165 have a coefficient of zero or more in the specific time period 162 and 163 in the specific time period 142 but have a gain of zero in the other time periods 162 and 163.

본 발명의 일 실시예에 따른 오디오 신호 부호화기는 특정 시간 영역(141, 142)내에서 싸인 시간 윈도우(141)와 직각 시간 윈도우(162, 163, 164, 165)는 MDCT의 완전복원(Perfect Reconstruction)을 만족하기 위한 윈도우 계수의 제약에 의해 디자인된다.In the audio signal encoder according to the embodiment of the present invention, the sine time window 141 and the orthogonal time windows 162, 163, 164, and 165 are perfectly reconstructed in the specific time regions 141 and 142, The constraint of the window coefficient is satisfied.

본 발명의 일 실시예에 따른 오디오 신호 부호화기가 직각 시간 윈도우를 사용하면 제1 시간 구간(613)의 제1 오디오 신호의 전체 성분 중에서 절반만을 주파수 영역 변환할 수 있으므로, 제3 시간 구간(612)의 제3 오디오 신호에 대한 주파수 영역 변환과 유사하게, 하드웨어의 변화를 최소화하여 주파수 영역 변환할 수 있다.Since the audio signal encoder according to the embodiment of the present invention can perform frequency domain conversion only on half of the entire components of the first audio signal of the first time interval 613 using the orthogonal time window, Similar to the frequency domain transform for the third audio signal of FIG.

도 6에서 도시된 실시예에 따르면 오디오 신호를 추가적으로 변환하지 않고, 오디오 신호를 부호화할 수 있다.According to the embodiment shown in FIG. 6, the audio signal can be encoded without further conversion of the audio signal.

도 7은 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 수행하는 본 발명의 일 실시예에 따른 부호화기의 구조를 도시한 블록도이다. 이하 도 7을 참조하여 본 발명의 일 실시예에 따른 부호화기의 동작을 상세히 설명하기로 한다. 본 발명의 일 실시예에 따른 부호화기(700)는 변환 기법 결정부(710), 시간 구간 분할부(720), 제1 변환부(730), 제2 변환부(740), 제3 변환부(750), 제4 변환부(760), 참조 신호 생성부(770), 신호 예측부(780)를 포함한다.FIG. 7 is a block diagram illustrating a structure of an encoder according to an embodiment of the present invention, which is performed by switching between a time domain transform technique or a frequency domain transform technique. Hereinafter, the operation of the encoder according to an embodiment of the present invention will be described in detail with reference to FIG. The encoder 700 according to an embodiment of the present invention includes a conversion technique determination unit 710, a time interval division unit 720, a first conversion unit 730, a second conversion unit 740, A fourth conversion unit 760, a reference signal generation unit 770, and a signal prediction unit 780.

본 발명의 일 실시예에 따르면 각각의 변환부(730, 740, 750)는 겹침 변환 기법을 이용하여 주파수 영역 변환을 수행할 수 있다. 겹침 변환 기법에 대해서는 도 1에서 상세히 설명한바 있으므로 이하 자세한 설명은 생략하기로 한다.According to an embodiment of the present invention, the transformers 730, 740, and 750 may perform the frequency domain transform using the lapped transform technique. Since the lapped transform technique has been described in detail with reference to FIG. 1, a detailed description will be omitted.

제1 변환부(730)는 시간 영역 변환 또는 주파수 영역 변환된 제1 시간 구간의 제1 변환 신호에 대하여 시간 영역 역변환 또는 제1 시간 구간의 제1 오디오 신호와 제3 시간 구간의 제3 오디오 신호에 대하여 주파수 영역 역변환을 수행하여 제1 역변환 신호를 생성한다.The first transformer 730 transforms the first transformed signal of the first time interval or the first transformed signal of the third time interval into the first transformed signal or the first transformed signal of the third time interval, Domain inverse transform to generate a first inverse transform signal.

제2 변환부(740)는 제1 시간 구간 이후에 제1 시간 구간과 인접하는 제2 시간 구간의 제2 오디오 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환 중에서 제1 역변환부의 역변환과 상이한 역변환을 수행하여 제2 역변환 신호를 생성한다. 제2 변환부(740)에서 주파수 영역 변환을 수행할 경우 제1 시간 구간의 제1 오디오 신호와 제2 시간 구간의 제2 오디오 신호를 이용하여 변환을 수행한다.The second transformer 740 performs inverse transform of the second audio signal of the second time interval adjacent to the first time interval after the first time interval, different from the inverse transformation of the first inverse transforming unit in the time domain inverse transform or the frequency domain inverse transform Thereby generating a second inverse transform signal. When performing the frequency domain transform in the second transform unit 740, the transform is performed using the first audio signal in the first time interval and the second audio signal in the second time interval.

제1 변환부(730)가 수행하는 변환과 제2 변환부(740)가 수행하는 변환이 상이하면, 제1 변환부(730) 또는 제2 변환부(740)가 변환한 오디오 신호를 복원하기에 불충분한 정보만이 변환될 수 있다. 즉, 각각의 변환부(730, 740)가 변환한 오디오 신호 만으로는 원 오디오 신호를 복원할 수 없다.When the transform performed by the first transforming unit 730 is different from the transform performed by the second transforming unit 740, the first transforming unit 730 or the second transforming unit 740 reconstructs the transformed audio signal Only information that is insufficient for the user can be converted. That is, the original audio signal can not be restored by only the audio signal converted by each of the conversion units 730 and 740.

본 발명의 일 실시예에 따른 오디오 신호 복호화기(700)는 제1 변환부(730) 또는 제2 변환부(740)가 변환한 오디오 신호를 추가적으로 변환하고, 본 발명의 일 실시예에 따른 오디오 신호 복호화기(800)는 추가적으로 변환된 오디오 신호에 기반하여 제1 변환부(730) 또는 제2 변환부(740)가 변환한 오디오 신호를 정확히 복원할 수 있다.The audio signal decoder 700 according to the embodiment of the present invention further converts the audio signal converted by the first converter 730 or the second converter 740 and outputs the audio signal according to the embodiment of the present invention, The signal decoder 800 can accurately recover the audio signal converted by the first converter 730 or the second converter 740 based on the converted audio signal.

제3 변환부(750)는 제1 오디오 신호 또는 제2 오디오 신호에 대하여 상기 제1 변환부 및 상기 제2 변환부의 변환을 고려하여 시간 영역 변환 또는 주파수 영역 변환 중 어느 한 변환을 수행하여 제3 변환 신호를 생성한다. 제3 변환부(750)는 제1 오디오 신호가 주파수 영역 변환 되었는지, 시간 영역 변환 되었는지 여부에 따라서 제1 오디오 신호를 변환할지, 제2 오디오 신호를 변환할 수 있다. 또한 제3 변환부(750)는 제1 오디오 신호가 주파수 영역 변환 되었는지, 시간 영역 변환 되어 있는 여부에 따라서 주파수 영역 변환을 수행할지, 시간 영역 변환을 수행할 수 있다. 있다. 여러 가지 경우에 대하여 제1 오디오 신호 또는 제2 오디오 신호를 추가적으로 변환하는 다양한 실시예가 도 2 내지 도 6에서 설명된바 있다.The third transform unit 750 performs either the time domain transform or the frequency domain transform on the first audio signal or the second audio signal in consideration of the transformations of the first transform unit and the second transform unit, And generates a conversion signal. The third conversion unit 750 may convert the first audio signal or the second audio signal depending on whether the first audio signal is frequency domain transformed or time domain transformed. Also, the third transform unit 750 may perform frequency domain transform or time domain transform according to whether the first audio signal is frequency domain transformed or time domain transformed. have. Various embodiments for additionally converting the first audio signal or the second audio signal in various cases have been described with reference to FIGS.

제3 변환부(750)가 제1 오디오 신호를 추가적으로 변환하는 경우에, 제1 오디오 신호는 제1 변환부(730)가 변환한 제1 변환 신호 및 제3 변환부(750)가 변환한 제3 변환 신호에 기반하여 복원될 수 있다.When the third converter 750 further converts the first audio signal, the first audio signal is converted into the first audio signal by the first conversion signal converted by the first converter 730 and the first audio signal converted by the third converter 750 3 conversion signal.

제3 변환부(750)가 제2 오디오 신호를 추가적으로 변환하는 경우에, 제2 오디오 신호는 제2 변환부(740)가 변환한 제2 변환 신호 및 제3 변환부(750)가 변환한 제3 변환 신호에 기반하여 복원될 수 있다.In the case where the third conversion unit 750 further converts the second audio signal, the second audio signal is converted into a second audio signal by the second conversion unit 740 and the second audio signal converted by the third conversion unit 750 3 conversion signal.

본 발명의 일 실시예에 따르면 변환 기법 결정부(710)는 제1 시간 구간의 제1 오디오 신호 및 제2 시간 구간의 제2 오디오 신호에 대하여 제1 변환부(730), 제2 변환부(740) 및 제3 변환부(350)가 수행하는 변환 기법을 결정할 수 있다.According to an embodiment of the present invention, the conversion scheme determination unit 710 may determine the conversion scheme of the first audio signal in the first time interval and the second audio signal in the second time interval, using the first conversion unit 730, 740 and the third conversion unit 350 may be determined.

본 발명의 일 실시예에 따르면, 변환 기법 결정부(710)는 사람의 말소리 등의 음성 신호에 대해서는 각각의 변환부(730, 740, 750)가 시간 영역 변환을 수행하고, 음악 소리 등에 대해서는 각각의 변환부(730, 740, 750)가 주파수 영역 변환을 수행하도록 변환 기법을 결정할 수 있다.According to an embodiment of the present invention, the conversion technique determination unit 710 performs time-domain conversion on the voice signals such as a human speech or the like by the converting units 730, 740, and 750, The transforming unit 730, 740, and 750 of FIG.

본 발명의 일 실시예에 따르면 제1 변환부(730)는 제1 오디오 신호 및 제1 시간 구간이전에 제1 시간 구간과 인접한 제3 시간 구간의 제3 오디오 신호에 대하여 주파수 영역 변환을 수행할 수 있다. 또한 제4 변환부(760)는 제3 오디오 신호 및 제4 시간 구간의 제4 오디오 신호에 대하여 주파수 영역 변환을 수행할 수 있다. 제4 시간 구간은 제3 시간 구간 이전이며, 제3 시간 구간에 인접한 시간 구간이다.According to an embodiment of the present invention, the first transform unit 730 performs frequency domain transform on the first audio signal and the third audio signal of the third time interval adjacent to the first time interval before the first time interval . The fourth transform unit 760 may perform frequency domain transform on the third audio signal and the fourth audio signal of the fourth time interval. The fourth time interval is before the third time interval, and is a time interval adjacent to the third time interval.

본 발명의 일 실시예에 따르면 오디오 신호 부호화기(700)는 제3 오디오 신호에 기반하여 추가적으로 제1 오디오 신호를 시간 영역 부호화할 수 있다. 제4 변환부(760)가 주파수 영역 변환을 수행하여 제4 변환 신호를 생성하고, 참조 신호 생성부(770)는 제4 변환 신호 및 제1 변환 신호에 기반하여 시간 영역 참조 신호를 생성할 수 있다. 시간 영역 참조 신호는 참조 신호 생성부(770)가 제1 변환 신호 및 제4 변환 신호에 기반하여 제3 시간 구간의 제3 오디오 신호를 복원한 신호이다.According to an embodiment of the present invention, the audio signal encoder 700 may additionally time-domain-encode the first audio signal based on the third audio signal. The fourth transform unit 760 performs frequency domain transform to generate a fourth transform signal, and the reference signal generator 770 can generate a time domain reference signal based on the fourth transform signal and the first transform signal have. The time-domain reference signal is a signal obtained by restoring the third audio signal of the third time interval based on the first and fourth converted signals by the reference signal generating unit 770.

신호 예측부(780)는 시간 영역 참조 신호에 기반하여 제1 시간 구간의 오디오 신호에 대한 예측 신호를 생성한다. 본 발명의 일 실시예에 따르면 신호 예측부(780)는 선형 예측법(Linear Prediction) 또는 장시간 예측법(Long Term Prediction)등을 이용하여 시간 영역 참조 신호로부터 예측 신호를 생성할 수 있다. 제3 오디오 신호에 기반하여 제1 시간 구간의 오디오 신호를 시간 영역 변환하면, 변환된 후의 데이터 크가가 감소하므로 효율적으로 부호화할 수 있다.The signal predicting unit 780 generates a prediction signal for the audio signal of the first time interval based on the time-domain reference signal. According to an embodiment of the present invention, the signal predicting unit 780 may generate a prediction signal from a time domain reference signal using a linear prediction method or a long term prediction method. If the audio signal of the first time interval is time domain transformed based on the third audio signal, the data size after the transform is reduced, so that efficient coding can be performed.

본 발명의 일 실시예에 따르면 제3 변환부(750)는 제1 오디오 신호와 예측 신호의 차이를 시간 영역 변환하여 제1 오디오 신호를 추가적으로 변환할 수 있다.According to an embodiment of the present invention, the third transform unit 750 may further convert the first audio signal by time-domain transforming the difference between the first audio signal and the prediction signal.

본 발명의 다른 실시예에 따르면 제3 변환부(750)는 제3 시간 구간의 제3 오디오 신호에 기반하지 않고, 제1 오디오 신호를 단독으로 시간 영역 변환할 수 있다. 본 발명의 일 실시예에 따르면 제3 변환부(750)는 ADPCM 부호화 기법, Mu-Law 부호화 기법 및 A-law 부호화 기법 중 적어도 하나의 부호화 기법에 따라서 상기 시간 영역 변환을 수행할 수 있다. 제1 오디오 신호가 제3 오디오 신호를 참조하지 않고 단독으로 시간 영역 변환 되므로, 간단히 변환할 수 있다.According to another embodiment of the present invention, the third conversion unit 750 can convert the first audio signal alone in time domain, not based on the third audio signal in the third time interval. According to an embodiment of the present invention, the third transform unit 750 may perform the time domain transform according to at least one of an ADPCM coding scheme, a Mu-law coding scheme, and an A-law coding scheme. Since the first audio signal is time-domain converted alone without referring to the third audio signal, it can be simply converted.

본 발명의 일 실시예에 따른 오디오 신호 부호화기(700)는 추가적으로 변환되는 오디오 신호가 포함된 시간 구간의 길이를 다른 시간 구간의 길이보다 짧게 결정할 수 있다. 즉, 제1 시간 구간의 제1 오디오 신호는 주파수 영역 변환되고, 제2 시간 구간의 제2 오디오 신호는 시간 영역 변환되는 경우에, 본 발명에 따른 오디오 신호 부호화기는 제1 오디오 신호를 추가적으로 시간 영역 변환할 수 있다. 이 경우, 본 발명에 따른 오디오 신호 부호화기는 제1 시간 구간의 길이를 제2 시간 구간의 길이보다 짧게 결정할 수 있다. 제1 시간 구간의 길이가 짧아지면 추가적으로 변환되는 제1 오디오 신호의 길이가 감소하므로, 좀더 적은 계산을 수행하여 오디오 신호를 변환할 수 있다. 변환된 오디오 신호가 통신망을 통해 전송되는 경우 적은 대역폭으로도 변환된 오디오 신호를 전송할 수 있다. 또한 변환된 오디오 신호가 저장 매체에 저장되는 경우 저장 공간을 최소화할 수 있다.The audio signal encoder 700 according to the embodiment of the present invention may determine the length of the time interval including the audio signal to be converted to be shorter than the length of the other time intervals. That is, when the first audio signal in the first time interval is frequency-domain transformed and the second audio signal in the second time interval is time-domain transformed, the audio signal encoder according to the present invention further adds the first audio signal to the time domain Can be converted. In this case, the audio signal encoder according to the present invention can determine the length of the first time interval to be shorter than the length of the second time interval. If the length of the first time interval is shortened, the length of the first audio signal to be additionally converted is reduced, so that the audio signal can be converted by performing less calculation. When the converted audio signal is transmitted through a communication network, the converted audio signal can be transmitted even with a small bandwidth. Also, when the converted audio signal is stored in the storage medium, the storage space can be minimized.

본 발명의 일 실시예에 따르면 시간 구간 분할부(720)는 제1 시간 구간을 제1 시간 영역 및 제2 시간 영역으로 분할할 수 있다. 제1 변환부(730)는 제1 시간 구간의 제1 오디오 신호 중에서 제2 시간 영역의 제1 오디오 신호를 제외하고 제1 시간 영역의 제1 오디오 신호만으로 변환을 수행할 수 있다. 또한 제3 변환부(750)는 제1 시간 구간의 제1 오디오 신호 중에서 제1 시간 영역의 제1 오디오 신호를 제외하고 제2 시간 영역의 제1 오디오 신호 만으로 변환을 수행할 수 있다. 제3 변환부(750)가 수행하는 변환은 제1 변환부(730)가 수행하는 변환과 상이한 변환이다.According to an exemplary embodiment of the present invention, the time period division unit 720 may divide the first time period into a first time period and a second time period. The first converter 730 may convert only the first audio signal of the first time domain except for the first audio signal of the second time domain among the first audio signals of the first time domain. Also, the third converter 750 may convert only the first audio signal of the second time domain, excluding the first audio signal of the first time domain, among the first audio signals of the first time domain. The transform performed by the third transform unit 750 is a transform different from the transform performed by the first transform unit 730.

제1 변환부(730)는 제1 시간 영역의 오디오 신호에 대하여 주파수 영역 변환을 수행하고, 제3 변환부(750)는 제2 시간 영역의 오디오 신호에 대하여 시간 영역 변환을 수행한다고 가정하자. 제1 시간 영역의 오디오 신호는 1번만 변환된다. 제1 변환부(730)는 1번의 주파수 영역 변환을 이용하여 제1 오디오 신호의 일부만을 변환한다. 따라서 싸인 시간 윈도우를 사용하는 일반적인 겹침 변환 기법에 따르면 성공적으로 복원될 수 없다.It is assumed that the first transforming unit 730 performs a frequency domain transform on the audio signal in the first time domain and the third transforming unit 750 performs the time domain transform on the audio signal in the second time domain. The audio signal in the first time domain is converted only once. The first transforming unit 730 transforms only a part of the first audio signal using one frequency domain transform. Therefore, it can not be restored successfully by the general lapped transform technique using the sign-time window.

본 발명의 일 실시예에 따르면 시간 구간 분할부(720)는 직각 시간 윈도우를 이용하여 제1 시간 영역의 오디오 신호와 제2 시간 영역의 오디오 신호를 구분할 수 있다. 제1 변환부(730)가 직각 시간 윈도우를 이용하면 제1 오디오 신호 중에서 제1 시간 영역에 위치하는 오디오 신호를 전부 변환할 수 있다. 따라서 본 발명에 따른 오디오 신호 복호화기가 제1 시간 영역에 위치하는 오디오 신호를 복원할 수 있다.According to an embodiment of the present invention, the time period division unit 720 can separate the audio signal of the first time domain from the audio signal of the second time domain using the rectangular time window. When the first converter 730 uses the rectangular time window, all of the audio signals located in the first time region among the first audio signals can be converted. Therefore, the audio signal decoder according to the present invention can recover the audio signal located in the first time domain.

본 발명의 일 실시예에 따르면 제3 변환부(750)는 제2 시간 영역에 포함된 오디오 신호를 시간 영역 변환할 수 있다. 본 실시예에 따르면 추가적으로 오디오 신호를 변환하지 않을 수 있다.According to an embodiment of the present invention, the third conversion unit 750 may time-domain convert an audio signal included in the second time region. According to the present embodiment, the audio signal may not be further converted.

부호화부(790)는 각각의 변환부(730, 740, 750)에서 변환된 신호를 부호화한다. 본 발명의 일 실시예에 따르면 부호화부(790)는 주파수 영역 변환된 신호는 주파수 영역 부호화할 수 있다. 본 발명의 일 실시예에 따르면 부호화부(790)는 MPEG등에서 사용되는 AAC부호화 기법등에 따라서 주파수 영역 부호화를 수행할 수 있다.The encoding unit 790 encodes the signals converted by the converting units 730, 740, and 750, respectively. According to an embodiment of the present invention, the encoder 790 can frequency-domain-code the frequency-domain-converted signal. According to an embodiment of the present invention, the encoding unit 790 can perform frequency domain encoding according to an AAC encoding technique used in MPEG and the like.

본 발명의 일 실시예에 따르면 부호화부(790)는 시간 영역 변환된 신호는 시간 영역 부호화할 수 있다. 본 발명의 일 실시예에 따르면 부호화부(790)는 AMR(Adaptive MultiRate), EVRC(Enhanced Variable Rate Codec)등의 CELP 기반 시간 영역 부호화 기법에 따라서 시간 영역 부호화를 수행할 수 있다.According to an embodiment of the present invention, the encoding unit 790 can time-domain-encode the time-domain-converted signal. According to an embodiment of the present invention, the encoding unit 790 may perform time-domain encoding according to a CELP-based time-domain encoding scheme such as AMR (Adaptive MultiRate) and EVRC (Enhanced Variable Rate Codec).

도 8은 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 부호화된 오디오 신호를 본 발명의 일 실시예에 따라서 복호화하는 복호화기의 구조를 도시한 블록도이다. 본 발명의 일 실시예에 따른 복호화기(800)는 변환 모드 판단부(810), 제1 역변환부(820), 제2 역변환부(830), 제3 역변환부(840), 제4 역변환부(850), 참조 신호 생성부(860), 신호 예측부(870), 신호 복원부(890)를 포함한다.8 is a block diagram illustrating a structure of a decoder that decodes an encoded audio signal according to an embodiment of the present invention by switching a time domain transform technique or a frequency domain transform technique. The decoder 800 according to an embodiment of the present invention includes a conversion mode determination unit 810, a first inverse transformation unit 820, a second inverse transformation unit 830, a third inverse transformation unit 840, A reference signal generating unit 860, a signal predicting unit 870, and a signal restoring unit 890.

본 발명의 일 실시예에 따르면 각각의 역변환부(810, 820, 830)는 겹침 변환 기법에 기반하여 주파수 영역 역변환을 수행할 수 있다. 겹침 변환 기법에 대해서는 도 1에서 상세히 설명한바 있으므로 이하 자세한 설명은 생략하기로 한다.In accordance with an embodiment of the present invention, each inverse transform unit 810, 820, 830 may perform frequency domain inverse transform based on the lapped transform technique. Since the lapped transform technique has been described in detail with reference to FIG. 1, a detailed description will be omitted.

제1 역변환부(820)는 시간 영역 변환 또는 주파수 영역 변환된 제1 시간 구간의 제1 변환 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행하여 제1 역변환 신호를 생성한다.The first inverse transformer 820 performs a time domain inverse transform or a frequency domain inverse transform on the first transform signal in the time domain transform or the frequency domain transformed first time domain to generate a first inverse transform signal.

제2 역변환부(830)는 제2 시간 구간의 제2 변환 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환 중에서 제1 역변환부(820)의 역변환과 상이한 역변환을 수행하여 제2 역변환 신호를 생성한다. 제2 시간 구간은 제1 시간 구간과 인접하지만, 제2 시간 구간은 제1 시간 구간 이후의 시간 구간이다.The second inverse transformer 830 performs a second inverse transform on the second transformed signal in the second time interval from the inverse transformation of the first inverse transforming unit 820 in the time domain inverse transform or the frequency domain inverse transform to generate a second inverse transform signal. The second time interval is adjacent to the first time interval, while the second time interval is a time interval after the first time interval.

제1 역변환부(820)가 수행하는 역변환과 제2 역변환부(830)가 수행하는 역변환 기법이 상이하면, 제1 역변환부(820)와 제2 역변환부(830)가 수행하는 역변환만으로는 변환전 오디오 신호를 복원하기에 충분한 정보를 생성할 수 없다. 이 경우, 제3 역변환부(840)의 역변환 결과를 더 고려하여 변환전 오디오 신호를 복원할 수 있다.When the inverse transformation performed by the first inverse transforming unit 820 differs from the inverse transforming performed by the second inverse transforming unit 830, only the inverse transform performed by the first inverse transformation unit 820 and the second inverse transformation unit 830, It can not generate sufficient information to restore the audio signal. In this case, the audio signal before conversion can be restored by further considering the inverse conversion result of the third inverse transformation unit 840.

제3 변환부(840)는 시간 영역 변환 또는 주파수 영역 변환된 제1 변환 신호 또는 제2 변환 신호에 대하여 제1 역변환부(820) 및 제2 역변환부(830)의 역변환 기법을 고려하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행한다. 제3 변환부(840)는 시간 영역 역변환 또는 주파수 영역 역변환을 수행하여 제3 역변환 신호를 생성한다. 제3 변환부(840)는 제1 변환 신호가 주파수 영역 변환 되었는지, 시간 영역 변환 되었는지 여부에 따라서 주파수 영역 역변환을 또는 시간 영역 역변환을 수행한다. 여러 가지 경우에 대하여 제1 변환 신호 또는 제2 변환 신호를 추가적으로 역변환하는 다양한 실시예가 도 2 내지 도 6에서 설명된바 있다.The third transforming unit 840 transforms the first transformed signal or the second transformed signal, which is time-domain transformed or frequency-domain transformed, into a time domain transformed by the inverse transformation method of the first inverse transform unit 820 and the second inverse transformed transformed transformed transformed transformed signal 830, Performs inverse transform or frequency domain inverse transform. The third transform unit 840 performs a time domain inverse transform or a frequency domain inverse transform to generate a third inverse transform signal. The third transforming unit 840 performs a frequency domain inverse transform or a time domain inverse transform according to whether the first transformed signal is frequency domain transformed or time domain transformed. Various embodiments for additionally inversely transforming the first converted signal or the second converted signal for various cases have been described with reference to FIGS.

신호 복원부(890)는 제1 역변환 신호 및 제3 역변환 신호에 기반하여 제1 오디오 신호를 복원하거나, 제2 역변환 신호 및 제3 역변환 신호에 기반하여 제2 오디오 신호를 복원할 수 있다.The signal restoring unit 890 may restore the first audio signal based on the first inverse transform signal and the third inverse transform signal or may restore the second audio signal based on the second inverse transform signal and the third inverse transform signal.

본 발명의 일 실시예에 따르면 변환 모드 판단부(810)는 각각의 시간 구간의 오디오 신호가 시간 영역 변환 되었는지, 주파수 영역 변환되었는지 여부를 판단한다. 각각의 역변환부(820, 830, 840, 850)는 변환 모드 판단부(810)의 판단 결과에 따라서 각각의 변환 신호에 대한 역변환을 수행한다.According to an embodiment of the present invention, the conversion mode determination unit 810 determines whether the audio signal of each time interval is time-domain-converted or frequency-domain-converted. Each of the inverse transforming units 820, 830, 840, and 850 performs inverse transform on each transformed signal according to the determination result of the transforming mode determining unit 810.

본 발명의 일 실시예에 따르면 제1 역변환부(820)는 제1 변환 신호 및 제3 변환 신호에 대하여 주파수 영역 역변환을 수행할 수 있다. 또한, 제4 역변환부(850)는 제3 변환 신호 및 제4 변환 신호에 대하여 주파수 영역 역변환을 수행할 수 있다. 제4 역변환부(850)는 주파수 영역 역변환을 수행하여 제4 역변환 신호를 생성한다.According to an embodiment of the present invention, the first inverse transform unit 820 may perform frequency domain inverse transform on the first transform signal and the third transform signal. In addition, the fourth inverse transform unit 850 may perform frequency domain inverse transform on the third transform signal and the fourth transform signal. The fourth inverse transform unit 850 performs a frequency domain inverse transform to generate a fourth inverse transform signal.

참조 신호 생성부(860)는 제4 역변환 신호 및 제1 역변환 신호에 기반하여 제3 시간 구간의 오디오 신호에 대응되는 시간 영역 참조 신호를 생성한다.The reference signal generator 860 generates a time domain reference signal corresponding to the audio signal of the third time interval based on the fourth inverse transform signal and the first inverse transform signal.

신호 예측부(870)는 시간 영역 참조 신호에 기반하여 제1 시간 구간의 오디오 신호에 대한 예측 신호를 생성한다.The signal predicting unit 870 generates a prediction signal for the audio signal of the first time interval based on the time-domain reference signal.

신호 복원부(860)는 제3 역변환 신호와 예측 신호에 기반하여 제1 시간 구간의 오디오 신호를 복원할 수 있다. 제1 시간 구간의 오디오 신호가 선형 예측법(Linear Prediction) 또는 장시간 예측법(Long Term Prediction)등을 이용하여 시간 영역 변환된 경우에, 제3 역변환부(840)가 생성한 제3 역변환 신호는 제1 시간 구간의 오디오 신호와 예측 신호의 차이에 대응된다. 신호 복원부(860)는 제3 역변환 신호와 예측 신호를 더하여 제1 시간 구간의 오디오 신호를 복원할 수 있다. 예측 신호에 기반하여 오디오 신호를 복원하면 변환 전후의 데이터 크가가 작아지므로 변환된 신호를 통신망을 통해 전송하거나 저장매체에 저장하는 경우 매우 유리하다.The signal restoring unit 860 may restore the audio signal of the first time interval based on the third inverse signal and the prediction signal. When the audio signal of the first time interval is time domain transformed using linear prediction or long term prediction, the third inverse transform signal generated by the third inverse transformer 840 is And corresponds to the difference between the audio signal and the prediction signal in the first time interval. The signal restoring unit 860 may restore the audio signal of the first time period by adding the third inverse signal and the prediction signal. If the audio signal is restored based on the prediction signal, the data size before and after the conversion becomes small. Therefore, it is very advantageous to transmit the converted signal through a communication network or to store the signal in a storage medium.

본 발명의 일 실시예에 따르면, 추가적으로 오디오 신호가 변환되는 시간 구간의 길이를 다른 시간 구간의 길이보다 짧을 수 있다. 추가적으로 변환되는 오디오 신호가 적어지므로 좀더 적은 계산을 수행하여 오디오 신호를 역변환할 수 있다. 또한 변환된 오디오 신호가 통신망을 통해 전송되는 경우 적은 대역폭으로도 변환된 오디오 신호를 전송할 수 있고, 변환된 오디오 신호가 저장 매체에 저장되는 경우 저장 공간을 최소화할 수 있다.According to an embodiment of the present invention, the length of the time interval during which the audio signal is further converted may be shorter than the length of the other time intervals. In addition, since the audio signal to be converted is less, the audio signal can be inversely transformed by performing fewer calculations. Also, when the converted audio signal is transmitted through a communication network, the converted audio signal can be transmitted even with a small bandwidth, and the storage space can be minimized when the converted audio signal is stored in the storage medium.

본 발명의 일 실시예에 따른 오디오 신호 복호화기는 시간 구간을 복수의 시간 영역으로 분할하고, 각 시간 영역에 상응하는 변환 신호에 대하여 상이한 역변환을 수행할 수 있다. 예를 들어 제1 시간 구간을 제1 시간 구간 및 제1 시간 구간 이후의 제2 시간 구간으로 분할한다고 가정하자.The audio signal decoder according to an embodiment of the present invention may divide a time interval into a plurality of time regions and perform different inverse transforms on the transformed signals corresponding to the respective time regions. For example, it is assumed that the first time period is divided into a first time period and a second time period after the first time period.

제1 시간 구간의 제1 변환 신호 중에서 제1 시간 영역의 제1 변환 신호는 주파수 영역 변환되고, 제1 역변환부(820)는 제1 시간 영역의 제1 변환 신호에 대하여 주파수 영역 역변환을 수행할 수 있다. 또한 제1 시간 구간의 제1 변환 신호 중에서 제2 시간 영역의 제1 변환 신호는 시간 영역 변환되고, 제3 역변환부(840)는 제2 시간 영역의 제1 변환 신호에 대하여 시간 영역 역변환을 수행할 수 있다.Of the first transformed signals of the first time domain, the first transformed signal of the first time domain is frequency-domain transformed, and the first inverse transformer 820 performs frequency domain inverse transform of the first transformed signal of the first time domain . Also, among the first converted signals in the first time interval, the first converted signal in the second time domain is time domain transformed, and the third inverse transformer 840 performs time domain inverse transform on the first transformed signal in the second time domain can do.

만약 제1 시간 영역의 제1 변환 신호가 도 1의 (b)에서 설명된 직각 시간 윈도우를 이용하여 변환된 신호라면, 신호 복원부(890)는 제1 역변환 신호 만으로 제1 시간 영역의 오디오 신호를 복원할 수 있다. 또한 제3 역변환부(840)는 시간 영역 변환된 제2 시간 영역의 오디오 신호를 역변환한다. 신호 복원부(890)는 시간 영역 역변환된 제2 변환 신호에 기반하여 제2 시간 영역의 오디오 신호를 복원할 수 있다.If the first transformed signal in the first time domain is a transformed signal using the orthogonal time window described in FIG. 1B, the signal reconstructor 890 reconstructs the audio signal in the first time domain using only the first inverse transform signal, Can be restored. The third inverse transform unit 840 inversely transforms the audio signal of the time domain transformed second time domain. The signal restoring unit 890 may restore the audio signal of the second time domain based on the time-domain inversely transformed second transformed signal.

도 9는 본 발명의 일 실시예에 따라서 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 오디오 신호를 부호화하는 방법을 단계별로 도시한 순서도이다. 이하 도 9를 참조하여 본 발명의 일 실시예에 따른 오디오 신호 부호화 방법을 상세히 설명하기로 한다.9 is a flowchart illustrating a method of encoding an audio signal by switching a time domain transform technique or a frequency domain transform technique to each other according to an embodiment of the present invention. Hereinafter, an audio signal encoding method according to an embodiment of the present invention will be described in detail with reference to FIG.

단계(S910)에서는 제1 시간 구간의 제1 오디오 신호에 대하여 시간 영역 변환 또는 주파수 영역 변환 중에서 어느 한 변환을 수행하여 제1 변환 신호를 생성한다.In step S910, the first audio signal of the first time interval is subjected to either a time domain transformation or a frequency domain transformation to generate a first transformed signal.

단계(S920)에서는 제2 시간 구간의 제2 오디오 신호에 대하여 시간 영역 변환 또는 주파수 영역 변환 중에서 제1 변환부의 변환과 상이한 변환을 수행한다. 제2 시간 구간의 제1 시간 구간 이후의 시간 구간으로서 제1 시간 구간과 인접한 시간 구간이다.In step S920, the second audio signal of the second time interval performs a different transformation from the time domain transformation or the frequency domain transformation than the transformation of the first transformation unit. A time interval after the first time interval of the second time interval, and a time interval adjacent to the first time interval.

즉, 제1 시간 구간과 제2 시간 구간의 경계에서 오디오 신호에 대한 변환 기법이 전환된다.That is, the conversion scheme for the audio signal is switched at the boundary between the first time period and the second time period.

단계(S930)에서는 단계(S910) 및 단계(S920)에서의 변환 기법에 기반하여 제1 오디오 신호 또는 제2 오디오 신호에 대하여 추가적으로 시간 영역 변환 또는 주파수 영역 변환을 수행하여 제3 변환 신호를 생성한다. 단계(S930)에서 수행되는 변환의 종류와 변환의 대상은 단계(S910) 및 단계(S920)에서의 변환 기법에 따라서 결정된다. 단계(S910) 및 단계(S920)에서의 다양한 변환 기법에 따라서 단계(S930)에서 수행되는 변환의 종류와 대상이 결정되는 것은 도 2내지 도 6에서 상세히 설명되었으므로 이하 설명을 생략하기로 한다.In step S930, a time-domain transform or a frequency-domain transform is additionally performed on the first audio signal or the second audio signal based on the transform technique in steps S910 and S920 to generate a third transform signal . The type of conversion performed in step S930 and the object of the conversion are determined according to the conversion technique in steps S910 and S920. Since the types of transformations and objects to be performed in step S930 are determined in detail in FIGS. 2 to 6 according to various conversion schemes in steps S910 and S920, the description will be omitted.

단계(S910), 단계(S920)에서 변환된 제1 변환 신호 또는 제2 변환 신호는 제3 변환 신호에 기반하여 복원될 수 있다. 제1 변환 신호 및 제3 변환 신호에 기반하여 제1 시간 구간의 제1 오디오 신호가 복원될 수 있다. 또한 제2 변환 신호 및 제3 변환 신호에 기반하여 제2 오디오 신호가 복원될 수 있다.The first conversion signal or the second conversion signal converted in steps S910 and S920 may be restored based on the third conversion signal. The first audio signal of the first time interval may be reconstructed based on the first transform signal and the third transform signal. And the second audio signal may be restored based on the second converted signal and the third converted signal.

도 10은 본 발명의 일 실시예에 따라서 시간 영역 변환 기법 또는 주파수 영역 변환 기법을 서로 전환하여 부호화된 오디오 신호를 복호화하는 방법을 단계별로 도시한 순서도이다. 이하 도 10을 참조하여 본 발명의 일 실시예에 따르면 오디오 신호 복호화 방법을 상세히 설명하기로 한다.10 is a flowchart illustrating a method of decoding an encoded audio signal by switching the time domain transform technique or the frequency domain transform technique according to an embodiment of the present invention. Hereinafter, an audio signal decoding method according to an embodiment of the present invention will be described in detail with reference to FIG.

단계(S1010)에서는 시간 영역 변환 또는 주파수 영역 변환된 제1 시간 구간의 제1 변환 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행하여 제1 역변환 신호를 생성한다.In step S1010, a first inverse transform or a frequency domain inverse transform is performed on the first transform signal in the time domain transform or the frequency domain transformed first time domain to generate a first inverse transform signal.

단계(S1020)에서는 시간 영역 변환 또는 주파수 영역 변환된 제2 시간 구간의 제2 변환 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행하여 제2 역변환 신호를 생성한다. 제2 변환 신호는 제1 변환 신호와 상이한 변환 기법에 의하여 변환되고, 단계(S1020)에서는 단계(S1010)에서의 역변환 기법과 상이한 역변환 기법에 따라서 제2 역변환 신호를 생성한다. 제2 시간 구간은 제1 시간 구간에 인접한 시간 구간으로서, 제1 시간 구간 이후의 시간 구간이다.In operation S1020, a second inverse transform or a frequency domain inverse transform is performed on the second transformed signal in the time domain transform or the frequency domain transformed second time domain to generate a second inverse transform signal. The second transform signal is transformed by a transform technique different from the first transform signal. In step S1020, a second inverse transform signal is generated according to an inverse transform technique different from the inverse transform technique of step S1010. The second time interval is a time interval adjacent to the first time interval, which is a time interval after the first time interval.

단계(S1030)에서는 단계(S1010) 및 단계(S1020)에서의 역변환 기법을 고려하여 제1 변환 신호 또는 제2 변환 신호에 대하여 시간 영역 역변환 또는 주파수 영역 역변환을 수행한다. 단계(S1010) 및 단계(S1020)에서의 역변환 기법에 따라서 단계(S1030)의 역변환 기법 및 역변환 대상이 결정되는 것은 도 2내지 도 6에서의 실시예에서 상세히 설명된바 있다.In step S1030, a time domain inverse transform or a frequency domain inverse transform is performed on the first transform signal or the second transform signal in consideration of the inverse transformation technique in steps S1010 and S1020. The inverse transformation method and the inverse transformation object of the step S1030 according to the inverse transformation technique in the step S1010 and the step S1020 have been described in detail in the embodiment in Figs.

단계(S1040)에서는 제1 역변환 신호 및 제3 역변환 신호에 기반하여 제1 시간 구간의 오디오 신호를 복원하거나, 제2 역변환 신호 및 제3 역변환 신호에 기반하여 제2 시간 구간의 오디오 신호를 복원할 수 있다.In operation S1040, the audio signal of the first time interval is restored based on the first inverse transform signal and the third inverse transform signal, and the audio signal of the second time interval is restored based on the second inverse transform signal and the third inverse transform signal .

도 11은 본 발명의 일 실시예에 따른 오디오 신호 부호화기의 구조를 도시한 블록도이다. 이하 도 11을 참고하여 본 발명의 일 실시예에 따른 오디오 신호 부호화기의 동작을 상세히 설명하기로 한다. 본 발명에 따른 오디오 신호 부호화기(1100)는 변환 기법 결정부(1110), 주파수 영역 변환부(1120), 주파수 영역 부호화부(1130), 시간 영역 변환부(1140), 시간 영역 부호화부(1150), 참조 신호 생성부(1160), 버퍼(1170), 추가 변환부(1180)를 포함한다.11 is a block diagram illustrating a structure of an audio signal encoder according to an embodiment of the present invention. Hereinafter, the operation of the audio signal encoder according to an embodiment of the present invention will be described in detail with reference to FIG. The audio signal encoder 1100 according to the present invention includes a conversion scheme determining unit 1110, a frequency domain transforming unit 1120, a frequency domain encoding unit 1130, a time domain transforming unit 1140, a time domain encoding unit 1150, A reference signal generation unit 1160, a buffer 1170, and an additional conversion unit 1180.

본 발명의 일 실시예에 따른 오디오 신호 부호화기(1100)는 각 시간 구간마다 각 시간 구간에 상응하는 오디오 신호를 수신한다.The audio signal encoder 1100 according to an embodiment of the present invention receives an audio signal corresponding to each time interval for each time interval.

변환 기법 결정부(1110)는 각 시간 구간에 상응하는 오디오 신호에 대하여 변환 기법을 결정한다. 본 발명의 일 실시예에 따르면 입력된 오디오 신호가 음악 신호에 가까운 성질을 가지고 있는 경우에, 변환 기법 결정부(1110)는 입력된 오디오 신호에 대하여 주파수 영역 변환을 수행하기로 결정할 수 있다. 본 발명의 다른 실시예에 따르면 입력된 오디오 신호가 음성 신호에 가까운 성질을 가지고 있는 경우에, 변환 기법 결정부(1110)는 입력된 오디오 신호에 대하여 시간 영역 변환을 수행하기로 결정할 수 있다.The conversion scheme determining unit 1110 determines a conversion scheme for an audio signal corresponding to each time interval. According to an embodiment of the present invention, when the input audio signal has a property close to that of the music signal, the conversion technique determination unit 1110 may determine to perform frequency domain conversion on the input audio signal. According to another embodiment of the present invention, when the input audio signal has a property close to that of the audio signal, the conversion scheme determination unit 1110 may determine to perform time domain conversion on the input audio signal.

변환부(1120)는 변환 기법 결정부(1110)의 결정에 따라서 입력된 각 오디오 신호에 대한 변환을 수행한다. 본 발명의 일 실시에 따르면 입력된 오디오 신호에 대하여 주파수 영역 변환을 수행하기로 결정한 경우에, 변환부(1120)는 겹침 변환 기법을 이용하여 주파수 영역 변환을 수행할 수 있다. 겹침 변환 기법에 따르면 입력된 오디오 신호는 이전 시간 구간의 오디오 신호와 함께 겹침 변환된다.The conversion unit 1120 performs conversion for each input audio signal according to the determination of the conversion scheme determination unit 1110. [ According to one embodiment of the present invention, when it is determined to perform frequency domain transform on the input audio signal, the transforming unit 1120 can perform frequency domain transform using the lapped transform technique. According to the lapped transform technique, the input audio signal is overlapped with the audio signal of the previous time interval.

주파수 영역 부호화부(1130)는 변환부(1120)가 주파수 영역 변환한 오디오 신호를 부호화한다. 본 발명의 일 실시예에 따르면 주파수 영역 부호호부(1130)는 MPEG 등에서 사용되는 AAC 부호화 기법에 따라서 주파수 영역 변환된 오디오 신호를 부호화할 수 있다.The frequency-domain encoding unit 1130 encodes the frequency-domain-converted audio signal by the transforming unit 1120. According to an embodiment of the present invention, the frequency domain codebook unit 1130 can encode an audio signal that is frequency-domain-transformed according to an AAC coding technique used in MPEG and the like.

변환 기법 결정부(1110)가 입력된 오디오 신호에 대하여 시간 영역 변환을 수행하기로 결정한 경우에, 변환부(1120)는 입력된 오디오 신호에 대한 시간 영역 변환을 수행할 수 있다. 본 발명의 일 실시예에 따르면 변환부(1120)는 입력된 오디오 신호를 다운 샘플링(down-sampling)하여 시간 영역 변환을 수행할 수 있다. When the conversion scheme determining unit 1110 decides to perform time domain transform on the input audio signal, the transforming unit 1120 may perform time domain transform on the input audio signal. According to an embodiment of the present invention, the transforming unit 1120 may perform time-domain transform by down-sampling the input audio signal.

시간 영역 부호화부(1140)는 시간 영역 변환된 오디오 신호를 시간 영역 부호화한다. 본 발명의 일 실시예에 따르면, 시간 영역 부호화부(1140)는 AMR(Adaptive MultiRate), EVRC(Enhanced Variable Rate Codec)등의 CELP 기반 시간 영역 부호화 기법에 따라서 시간 영역 부호화를 수행할 수 있다.The time-domain encoding unit 1140 time-domain-encodes the time-domain-converted audio signal. According to an embodiment of the present invention, the time-domain encoding unit 1140 may perform time-domain encoding according to a CELP-based time-domain encoding scheme such as AMR (Adaptive MultiRate) or EVRC (Enhanced Variable Rate Codec).

본 발명에 따른 오디오 신호 부호화기(1100)에 입력되는 오디오 신호는 음성 신호 및 음악 신호를 모두 포함할 수 있다. 특정 오디오 신호가 음성 신호 및 음악 신호를 모두 포함하는 경우에, 본 발명에 따른 오디오 신호 부호화기(1100)는 특정 시간 구간에서는 입력된 오디오 신호를 시간 영역 변환하고, 다른 시간 구간에서는 입력된 오디오 신호를 주파수 영역 변환할 수 있다.The audio signal input to the audio signal encoder 1100 according to the present invention may include both a voice signal and a music signal. In a case where a specific audio signal includes both a voice signal and a music signal, the audio signal encoder 1100 according to the present invention time-domain converts the input audio signal in a specific time period, Frequency domain conversion.

변환 기법 결정부(1110)는 각 시간 구간 별로 입력된 각각의 오디오 신호에 대한 변환 기법을 결정한다. 본 발명의 일 실시예에 따르면 변환 기법 결정부(1110)가 N-1번째 시간 구간에 입력된 오디오 신호에 대해서는 주파수 영역 변환을 수행하기로 결정한 경우에도, N-1번째 시간 구간과 연속하는 N번째 시간 구간의 오디오 신호에 대해서는 시간 영역 변환을 수행하기로 결정할 수 있다.The conversion scheme determining unit 1110 determines a conversion scheme for each audio signal input for each time interval. According to an embodiment of the present invention, even if the conversion scheme determination unit 1110 determines to perform frequency domain transformation on the audio signal input in the (N-1) < th & Th time interval for the audio signal.

이하 N-1번째 시간 구간 및 N-1번째 시간 구간 이전의 오디오 신호에 대해서는 주파수 영역 변환을 수행하고, N번째 시간 구간의 오디오 신호부터는 시간 영역 변환을 수행하는 실시예에 대하여 설명하기로 한다.Hereinafter, an embodiment will be described in which the frequency domain transform is performed on the audio signals before the (N-1) th time interval and the (N-1) th time interval, and the time domain transform is performed after the audio signal of the Nth time interval.

N-1 번째 시간 구간의 오디오 신호가 입력되면, 변환 기법 결정부(1110)는 N-1 번째 시간 구간의 오디오 신호에 대하여 주파수 영역 변환을 수행할 것을 결정하고, 변환부(1120)는 N-2 번째 시간 구간의 오디오 신호와 N-1 번째 시간 구간의 오디오 신호를 주파수 영역 변환한다.When the audio signal of the (N-1) < th > time interval is input, the conversion scheme determination unit 1110 determines to perform frequency domain conversion on the audio signal of the (N-1) The audio signal of the second time interval and the audio signal of the (N-1) < th > time interval are frequency domain transformed.

N번째 시간 구간의 오디오 신호가 입력되면, 변환 기법 결정부(1110)는 N번째 시간 구간의 오디오 신호에 대하여 시간 영역 변환을 수행할 것을 결정한다. N-1번째 시간 구간의 오디오 신호는 N번째 시간 구간의 오디오 신호와 함께 주파수 영역 변환되지 못한다. 따라서 부호화된 오디오 신호를 수신한 오디오 신호 복호화기는 N-1 번째 시간 구간의 오디오 신호를 주파수 영역 역변환하지 못한다.When the audio signal of the Nth time interval is input, the conversion scheme determination unit 1110 determines to perform time domain conversion on the audio signal of the Nth time interval. The audio signal of the (N-1) th time interval can not be frequency-domain-transformed together with the audio signal of the N-th time interval. Therefore, the audio signal decoder receiving the encoded audio signal can not perform frequency domain inverse transform of the audio signal of the (N-1) th time interval.

본 발명의 일 실시예에 따른 오디오 신호 부호화기(1100)는 N-1번째 시간 구간의 오디오 신호를 시간 영역 변환할 수 있다. N-1 번째 시간 구간의 오디오 신호는 주파수 영역 역변환에 의하여 복원되지는 못하지만, 시간 영역 역변환에 의하여 복원될 수 있다.The audio signal encoder 1100 according to an embodiment of the present invention may time-domain convert the audio signal of the (N-1) th time interval. The audio signal of the (N-1) th time interval is not reconstructed by the frequency domain inverse transform, but can be reconstructed by the time domain inverse transform.

본 발명의 일 실시예에 따르면 오디오 신호 부호화기(1100)는 주파수 영역 부호화된 신호에 기반하여 N-1 번째 시간 구간의 오디오 신호를 시간 영역 변환할 수 있다. 참조 신호 생성부(1150)는 주파수 영역 부호화된 N-2 번째 시간 구간의 오디오 신호를 수신한다. 참조 신호 생성부(1150)는 주파수 영역 부호화된 신호에 기반하여 N-2 번째 시간 구간의 시간 영역 참조 신호를 생성한다.According to an embodiment of the present invention, the audio signal encoder 1100 may time-domain convert the audio signal of the (N-1) th time interval based on the frequency-domain encoded signal. The reference signal generator 1150 receives the frequency-domain encoded audio signal of the (N-2) th time interval. The reference signal generator 1150 generates a time-domain reference signal of the (N-2) th time interval based on the frequency-domain encoded signal.

버퍼(1160)는 참조 신호 생성부(1150)가 생성한 N-2 번째 시간 구간의 시간 영역 참조 신호를 저장한다. 저장된 N-2 번째 시간 구간의 시간 영역 참조 신호는 N-1 번째 시간 구간의 오디오 신호를 변환하기 위하여 사용될 수 있다.The buffer 1160 stores the time domain reference signal of the (N-2) th time interval generated by the reference signal generating unit 1150. The time domain reference signal of the stored N-2 < th > time period may be used to convert the audio signal of the (N-1) < th > time period.

시간 영역 신호 생성부(1170)는 시간 영역 참조 신호를 참조하여 N-1 번째 시간 구간의 오디오 신호에 대한 시간 영역 신호를 생성한다.The time-domain signal generator 1170 refers to the time-domain reference signal and generates a time-domain signal for the audio signal of the (N-1) th time interval.

시간 영역 부호화부(1140)는 N-1번째 시간 구간의 오디오 신호를 부호화한다.The time domain encoding unit 1140 encodes the audio signal of the (N-1) th time domain.

입력된 오디오 신호의 특성에 따라서 시간 영역 변환 또는 주파수 영역 변환을 번갈아 가며 수행할 수 있으므로, 단일 코덱을 사용하여 오디오 신호를 부호화하는 경우에도 높은 효율로 오디오 신호를 부호화할 수 있다.The time domain transform or the frequency domain transform can be performed alternately according to the characteristics of the input audio signal. Therefore, even when the audio signal is encoded using a single codec, the audio signal can be encoded with high efficiency.

도 12는 입력 신호에 대한 주파수 영역 역변환 또는 시간 영역 역변환을 수행하는 본 발명의 일 실시예를 단계별로 도시한 순서도 이다. 이하 도 12를 참조하여 본 발명의 일 실시예에 따른 역변환 방법을 상세히 설명하기로 한다.FIG. 12 is a flowchart showing an embodiment of the present invention for performing an inverse frequency domain transform or a time domain inverse transform on an input signal in steps. Hereinafter, an inverse conversion method according to an embodiment of the present invention will be described in detail with reference to FIG.

단계(S1201)에서는 N번째 시간 구간의 변환된 오디오 신호가 입력되면, 변환된 오디오 신호가 주파수 영역 변환되었는지, 시간 영역 변환되었는지 여부를 판단한다.In step S1201, when the converted audio signal of the Nth time interval is inputted, it is determined whether the converted audio signal is frequency-domain-converted or time-domain-converted.

만약 변환된 오디오 신호가 주파수 영역 변환된 경우에는 단계(S1202)에서 주파수 영역 역변환을 수행한다. 본 발명의 일 실시예에 따르면 N번째 시간 구간의 주파수 영역 변환된 오디오 신호는 N-1번째 시간 구간의 주파수 영역 변환된 오디오 신호와 함께 주파수 영역 역변환될 수 있다.If the converted audio signal is frequency-domain transformed, the frequency domain inverse transform is performed in step S1202. According to an embodiment of the present invention, the frequency-domain-transformed audio signal of the N-th time interval may be frequency-domain-inverted together with the frequency-domain-transformed audio signal of the (N-1)

만약 변환된 오디오 신호가 시간 영역 변환된 경우에는 단계(S1203)에서 시간 영역 역변환된다. 본 발명의 일 실시예에 따르면 단계(S1203)에서는 업샘플링을 수행하여 변환된 오디오 신호의 샘플링 레이트(sampling rate)를 변환 하는 것으로 시간 영역 역변환을 수행할 수 있다.If the converted audio signal is time-domain transformed, it is time-domain inversely transformed in step S1203. According to an embodiment of the present invention, in step 1203, it is possible to perform time-domain inverse transform by performing upsampling and converting a sampling rate of the converted audio signal.

도 13은 입력 신호에 대한 주파수 영역 역변환 또는 시간 영역 역변환을 수행하는 본 발명의 다른 실시예를 단계별로 도시한 순서도이다. 이하 도 13을 참조하여 본 발명의 일 실시예에 따른 역변환 방법을 상세히 설명하기로 한다.FIG. 13 is a flowchart showing another embodiment of the present invention for performing an inverse frequency domain transform or a time domain inverse transform on an input signal. Hereinafter, an inverse conversion method according to an embodiment of the present invention will be described in detail with reference to FIG.

N번째 시간 구간의 변환된 오디오 신호가 입력되면, 단계(S1301)에서는 변환된 오디오 신호가 주파수 영역 변환되었는지, 시간 영역 변환되었는지 여부를 판단한다.When the converted audio signal of the Nth time interval is input, it is determined in step S1301 whether the converted audio signal is frequency domain transformed or time domain transformed.

만약 변환된 오디오 신호가 주파수 영역 변환된 경우에는 단계(S1302)에서 주파수 영역 역변환을 수행한다. 본 발명의 일 실시예에 따르면 단계(S1302)에서 수행하는 주파수 영역 역변환은 M개의 샘플을 이용한 주파수 영역 역변환일 수 있다.If the converted audio signal is frequency domain transformed, the frequency domain inverse transform is performed in step S1302. According to an embodiment of the present invention, the frequency domain inverse transform performed in step S1302 may be a frequency domain inverse transform using M samples.

만약 변환된 오디오 신호가 시간 영역 변환된 경우에는 단계(S1303)에서 시간 영역 변환된 오디오 신호에 대한 주파수 영역 변환을 수행한다. 본 발명의 일 실시예에 따르면 단계(S1303)에서는 K개의 샘플을 이용한 주파수 영역 변환을 수행할 수 있다.If the converted audio signal has been time-domain transformed, the frequency domain transform is performed on the time-domain transformed audio signal in step S1303. According to an embodiment of the present invention, in step S1303, frequency domain transform using K samples may be performed.

단계(S1303)에서 주파수 영역 변환된 오디오 신호는 단계(S1302)에서 다시 주파수 영역 역변환된다. 단계(S1302)에서 수행되는 주파수 영역 역변환은 M개의 샘플을 이용한 주파수 영역 역변환이다. M > K인 경우에, 시간 영역 변환된 오디오 신호는 단계(S1303) 및 단계(S1302)를 거치면서 업샘플링 된다. 본 발명의 일 실시예에 따르면 M과 K는 변환 전후 샘플링 레이트에 기반하여 결정될 수 있다.In step S1303, the frequency-domain-converted audio signal is frequency-domain-inverted again in step S1302. The frequency domain inverse transform performed in step S1302 is a frequency domain inverse transform using M samples. If M > K, the time-domain transformed audio signal is upsampled through step S1303 and step S1302. According to an embodiment of the present invention, M and K can be determined based on the sampling rate before and after the conversion.

도 14는 입력 신호에 대한 주파수 영역 변환 또는 시간 영역 변환을 수행하는 본 발명의 일 실시예를 단계별로 도시한 순서도이다. 이하 도 14를 참조하여 본 발명의 일 실시예에 따른 변환 방법을 상세히 설명하기로 한다.FIG. 14 is a flowchart showing steps of an embodiment of the present invention for performing a frequency domain transformation or a time domain transformation on an input signal. Hereinafter, a conversion method according to an embodiment of the present invention will be described in detail with reference to FIG.

N번째 시간 구간의 오디오 신호가 입력되면, 단계(S1410)에서는 입력된 오디오 신호에 대하여 시간 영역 변환할지, 주파수 영역 변환할지 결정한다.When the audio signal of the Nth time interval is input, in step S1410, it is determined whether the input audio signal is time-domain transformed or frequency-domain transformed.

만약 주파수 영역 변환하는 경우에는 단계(S1420)에서 주파수 영역 변환을 수행한다. 본 발명의 일 실시예에 따르면 N번째 시간 구간의 오디오 신호는 N-1번째 시간 구간의 오디오 신호와 함께 겹침 변환될 수 있다.If frequency domain conversion is performed, frequency domain conversion is performed in step S1420. According to an embodiment of the present invention, the audio signal of the N-th time interval may be lapped-transformed together with the audio signal of the (N-1) -th time interval.

만약 시간 영역 변환하는 경우에는 단계(S1430)에서 시간 영역 변환을 수행한다. 본 발명의 일 실시예에 따르면 단계(S1430)에서는 N번째 시간 구간의 오디오 신호에 대한 다운 샘플링을 수행하여 샘플링 레이트를 변환함으로써, 시간 영역 변환을 수행할 수 있다.If the time domain transformation is performed, the time domain transformation is performed in step S1430. According to an embodiment of the present invention, in step S1430, time-domain conversion may be performed by performing down-sampling on the audio signal of the Nth time interval to convert the sampling rate.

도 15는 입력 신호에 대한 주파수 영역 변환 또는 시간 영역 변환을 수행하는 본 발명의 다른 실시예를 단계별로 도시한 순서도이다. 이하 도 15를 참조하여 본 발명의 일 실시예에 따른 변환 방법을 상세히 설명하기로 한다.FIG. 15 is a flowchart showing steps of another embodiment of the present invention for performing frequency domain transformation or time domain transformation on an input signal. Hereinafter, a conversion method according to an embodiment of the present invention will be described in detail with reference to FIG.

N번째 시간 구간의 오디오 신호가 입력되면, 단계(S1510)에서는 오디오 신호에 대한 주파수 영역 변환을 수행한다. 본 발명의 일 실시예에 따르면 단계(S1510)에서 수행하는 주파수 영역 변환은 M개의 샘플을 이용하는 주파수 영역 변환일 수 있다.When the audio signal of the Nth time interval is inputted, the frequency domain transform of the audio signal is performed in step S1510. According to an embodiment of the present invention, the frequency domain transform performed in step S1510 may be a frequency domain transform using M samples.

단계(S1520)에서는 입력된 오디오 신호에 대하여 주파수 영역 변환을 수행할지, 시간 영역 변환을 수행할지 결정한다. 만약 입력된 오디오 신호에 대하여 주파수 영역 변환을 수행하는 경우에는, 단계(S1510)에서 이미 주파수 영역 변환된 오디오 신호가 출력된다.In step S1520, it is determined whether to perform frequency domain transform or time domain transform on the input audio signal. If the frequency domain transform is performed on the input audio signal, the frequency domain transformed audio signal is output in step S1510.

만약 입력된 오디오 신호에 대하여 시간 영역 변환하는 경우에는 주파수 영역 변환된 입력 오디오 신호에 대하여 주파수 영역 역변환을 수행한다. 본 발명의 일 실시예에 따르면 단계(S1530)에서 수행되는 주파수 영역 역변환은 K개의 샘플을 이용하는 주파수 영역 역변환일 수 있다.If the input audio signal is transformed in the time domain, the frequency domain inverse transform is performed on the input audio signal transformed in the frequency domain. According to an embodiment of the present invention, the frequency domain inverse transform performed in step S1530 may be a frequency domain inverse transform using K samples.

입력된 오디오 신호가 시간 영역 변환되는 경우에, M개의 샘플을 이용하여 주파수 영역 변환되고, K개의 샘플을 이용하여 주파수 영역 역변환된다. M > K인 경우에, 입력된 오디오 신호의 샘플링 레이트는 감소된다.When the input audio signal is time-domain transformed, it is frequency-domain transformed using M samples and frequency-domain inversely transformed using K samples. If M > K, the sampling rate of the input audio signal is reduced.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다.While the invention has been shown and described with reference to certain preferred embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. This is possible.

또한 본 발명의 실시예들은 다양한 컴퓨터로 구현되는 동작을 수행하기 위한 프로그램 명령을 포함하는 컴퓨터 판독 가능 매체를 포함한다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 본 발명에서 설명된 오디오 신호 부호화기 및 오디오 신호 복호화기의 동작의 전부 또는 일부가 컴퓨터 프로그램으로 구현된 경우, 상기 컴퓨터 프로그램을 저장한 컴퓨터 판독 가능 기록 매체도 본 발명에 포함된다.Embodiments of the present invention also include a computer readable medium having program instructions for performing various computer implemented operations. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions may be those specially designed and constructed for the present invention or may be available to those skilled in the art. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like. When all or a part of the operations of the audio signal encoder and the audio signal decoder described in the present invention are implemented by a computer program, the computer readable recording medium storing the computer program is also included in the present invention.

그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined by the equivalents of the claims, as well as the claims.

810 ... 변환모드 판단부 820 ... 제1 역변환부
830 ... 제2 역변환부 840 ... 제3 역변환부
850 ... 제4 역변환부 860 ... 참조신호 생성부
870 ... 신호 예측부 890 ... 신호 복원부810 ... conversion mode determination unit 820 ... first inverse conversion unit
830 ... second inverse transform unit 840 ... third inverse transform unit
850 ... fourth inverse transform unit 860 ... reference signal generating unit
870 ... Signal prediction unit 890 ... Signal reconstruction unit

Claims

At least one processor, wherein the processor
The method comprising: determining whether a current frame is coded in a time domain or a frequency domain; decoding the current frame in the time domain if the current frame is coded in the time domain; In decoding the current frame in the frequency domain and decoding the current frame in the time domain or the frequency domain, when domain switching occurs between the current frame and a previous frame of the current frame or a next frame of the current frame And processes the additional data at the boundary of the current frame so that domain switching noise is removed at the boundary of the previous frame and the current frame or at the boundary of the next frame and the current frame. Signal decoding apparatus.

The apparatus of claim 1, wherein the additional data is processed in the time domain or the frequency domain.

The audio signal decoding apparatus of claim 1, wherein the additional data includes a prediction signal based on a decoded previous frame or a decoded next frame.

At least one processor, wherein the processor
Determining whether a current frame is encoded in a time domain or a frequency domain;
Decoding the current frame in the time domain if the current frame is coded in the time domain;
Decoding the current frame in the frequency domain if the current frame is encoded in the frequency domain; And
Wherein when the previous frame of the current frame is decoded in the time domain, the domain switching noise is removed at the boundary between the previous frame and the current frame in decoding the current frame in the frequency domain, And processes the additional data.

5. The apparatus of claim 4, wherein the additional data is processed in the frequency domain.

5. The audio signal decoding apparatus of claim 4, wherein the additional data includes a prediction signal based on a decoded previous frame.

At least one processor, wherein the processor
The method comprising: determining whether a current frame is coded in a time domain or a frequency domain; decoding the current frame in the time domain if the current frame is coded in the time domain; Wherein the decoding unit decodes the current frame in the frequency domain and decodes the current frame in the frequency domain when the next frame of the current frame is decoded in the time domain, And processes the additional data at the boundary of the current frame so that noise is removed.

8. The apparatus of claim 7, wherein the additional data is processed in the frequency domain.

8. The audio signal decoding apparatus of claim 7, wherein the additional data includes a prediction signal based on a decoded previous frame.