KR20070003600A

KR20070003600A - Method and apparatus for encoding and decoding an audio signal

Info

Publication number: KR20070003600A
Application number: KR1020060058606A
Authority: KR
Inventors: 방희석; 임재현; 오현오; 김동수
Original assignee: 엘지전자 주식회사
Priority date: 2005-06-30
Filing date: 2006-06-28
Publication date: 2007-01-05

Abstract

An apparatus and a method for decoding an audio signal, and an apparatus and a method for encoding the audio signal are provided to reduce the calculative amount for obtaining space information by correcting the space information using a weight value. For decoding an audio signal, the audio signal is received(301). Space information included in the received audio signal is corrected by using a weight value(305). It is determined whether information regarding the need for correction of the space information is included in the audio signal(501). The correction of the space information is performed by grouping frequency bands and then applying weight values to respective groups of frequency bands.

Description

Method and apparatus for encoding and decoding an audio signal {Method and apparatus for encoding and decoding an audio signal}

도 1은 본 발명의 일 실시 예에 따른 오디오 신호 인코딩 장치의 다운믹싱부를 나타낸 블럭도이다.1 is a block diagram illustrating a downmixing unit of an audio signal encoding apparatus according to an embodiment of the present invention.

도 2는 본 발명의 다른 실시예에 따른 오디오 신호 디코딩 장치의 다운믹싱부를 나타낸 블럭도이다. 2 is a block diagram illustrating a downmixing unit of an audio signal decoding apparatus according to another embodiment of the present invention.

도 3은 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 방법을 나타낸 순서도이다.3 is a flowchart illustrating an audio signal decoding method according to another embodiment of the present invention.

도 4는 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 방법을 나타낸 순서도이다.4 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 5는 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 방법을 나타낸 순서도이다.5 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention.

도 6은 본 발명의 또 다른 실시예에 따른 오디오 신호 인코딩 방법을 나타낸 순서도이다.6 is a flowchart illustrating an audio signal encoding method according to another embodiment of the present invention.

도 7은 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 장치를 나타낸 블럭도이다.7 is a block diagram showing an audio signal decoding apparatus according to another embodiment of the present invention.

도 8은 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 장치의 공간 정보 디코딩부를 나타낸 블럭도이다.8 is a block diagram illustrating a spatial information decoding unit of an audio signal decoding apparatus according to another embodiment of the present invention.

도 9는 본 발명의 또 다른 실시예에 따른 오디오 신호 인코딩 장치를 나타낸 블럭도이다.9 is a block diagram showing an audio signal encoding apparatus according to another embodiment of the present invention.

도 10은 본 발명의 또 다른 실시예에 따른 오디오 신호 인코딩 장치의 공간 정보 인코딩부를 나타낸 블럭도이다.10 is a block diagram illustrating a spatial information encoding unit of an audio signal encoding apparatus according to another embodiment of the present invention.

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for main parts of the drawings>

705 : 공간 정보 디코딩부 801 : 디코딩부705: spatial information decoding unit 801: decoding unit

803 : 역양자화부 805, 1001 : 가중치 계산부803: inverse quantization unit 805, 1001: weight calculation unit

807, 1003 : 공간 정보 수정부 907 : 공간 정보 인코딩부807 and 1003: spatial information correction unit 907: spatial information encoding unit

1005 : 양자화부 1007 : 인코딩부1005: quantization unit 1007: encoding unit

본 발명은 오디오 신호의 처리에 관한 것으로서, 특히 오디오 신호 인코딩 및 디코딩 방법 및 장치에 관한 것이다.The present invention relates to the processing of audio signals, and more particularly to a method and apparatus for encoding and decoding audio signals.

일반적으로 오디오 신호의 경우, 인코딩 장치에서 다채널 오디오 신호의 채널들 각각을 압축하는 대신에, 오디오 신호를 모노 혹은 스테레오 형태의 다운믹스 신호로 압축하고, 압축된 다운믹스 신호와 공간 정보(spatial information)(또는 부가 정보)를 디코딩 장치로 함께 전송하거나 저장 매체에 저장한다.In general, in the case of an audio signal, instead of compressing each channel of a multichannel audio signal in an encoding apparatus, the audio signal is compressed into a downmix signal in mono or stereo form, and the compressed downmix signal and spatial information (Or additional information) together with the decoding apparatus or stored in the storage medium.

여기서, 공간 정보는 멀티 채널 오디오 신호를 다운믹스할 때 추출되는 것으 로 오디오 신호 디코딩 장치에서 압축된 다운믹스 신호로부터 원래의 다채널 오디오 신호를 복원할 때 사용된다. 공간 정보로는 CLD(Channel Level Differences), ICC(Interchannel Correlations), CPC(Channel Prediction Coefficients) 등이 있다. CLD는 오디오 신호들 사이의 에너지 차이를 나타내고, ICC는 오디오 신호들 간의 긴밀성 내지는 유사성을 나타낸다. 또한, CPC는 다른 신호를 이용하여 오디오 신호를 예상하는 계수를 나타낸다. 오디오 신호 인코딩 장치의 다운믹싱부는 멀티 채널들을 다운믹싱하면서 공간 정보를 추출한다. 즉, 오디오 신호 인코딩 장치의 다운믹싱부에 포함된 복수의 OTT BOX 또는 TTT BOX는 입력되는 멀티 채널을 이용하여 다운믹스 신호를 추출하는데, 입력되는 멀티 채널이 복수의 OTT BOX 또는 TTT BOX를 거치는 단계에서 각각의 BOX 별로 공간 정보가 추출된다. 이때 각각의 단계에서 추출되는 공간 정보들 사이에는 단계별 가중치가 존재한다. 따라서, 오디오 신호 디코딩 장치가 멀티 채널을 복원할 때 이용하는 공간 정보는 오디오 신호 인코딩 장치에서 멀티 채널을 다운믹싱할 때 단계별로 존재하는 가중치가 고려된 값이다. 오디오 신호 디코딩 장치는 다운믹스 신호의 주파수 밴드별로 가중치가 부여된 공간 정보를 각각 적용하여 다운믹스 신호를 멀티 채널로 복원한다.Here, the spatial information is extracted when downmixing the multichannel audio signal, and is used when the audio signal decoding apparatus restores the original multichannel audio signal from the compressed downmix signal. The spatial information includes channel level differences (CLD), interchannel correlations (ICC), channel prediction coefficients (CPC), and the like. CLD represents energy difference between audio signals, and ICC represents tightness or similarity between audio signals. In addition, CPC represents a coefficient for predicting an audio signal using another signal. The downmixing unit of the audio signal encoding apparatus extracts spatial information while downmixing the multichannels. That is, the plurality of OTT BOX or TTT BOX included in the downmixing unit of the audio signal encoding apparatus extracts the downmix signal using the input multichannel, and the input multichannel passes through the plurality of OTT BOX or TTT BOX. Spatial information is extracted for each box in. At this time, there is a step weight among the spatial information extracted in each step. Therefore, the spatial information used when the audio signal decoding apparatus restores the multi-channel is a value in consideration of weights that exist in stages when downmixing the multi-channel in the audio signal encoding apparatus. The audio signal decoding apparatus restores the downmix signal to the multi-channel by applying spatial information weighted for each frequency band of the downmix signal.

본 발명이 이루고자 하는 기술적 과제는, 공간 정보를 가중치를 이용하여 수정함으로써 공간 정보를 구할 때 요구되는 계산량이 감소하는 오디오 신호 인코딩 및 디코딩 방법 및 장치를 제공하는 데 있다.SUMMARY OF THE INVENTION The present invention has been made in an effort to provide an audio signal encoding and decoding method and apparatus for reducing the amount of computation required to obtain spatial information by modifying the spatial information using weights.

상기 과제를 이루기 위한 본 발명에 의한 오디오 신호 디코딩 방법은, 오디오 신호를 수신하는 단계 및 상기 수신한 오디오 신호에 포함되어 있는 공간 정보를 가중치를 이용하여 수정하는 단계를 포함하는 것이 바람직하다.The audio signal decoding method according to the present invention for achieving the above object preferably comprises the steps of receiving an audio signal and modifying the spatial information contained in the received audio signal by using a weight.

또한, 본 발명에 의한 오디오 신호 인코딩 방법은 멀티 채널을 다운믹싱하여 다운믹스 신호와 공간 정보를 생성하는 단계 및 상기 생성된 공간 정보를 가중치를 이용하여 수정하는 단계를 포함하는 것이 바람직하다.In addition, the audio signal encoding method according to the present invention preferably comprises downmixing the multi-channel to generate a downmix signal and spatial information and modifying the generated spatial information by using a weight.

또한, 본 발명에 의한 오디오 신호 디코딩 장치는 오디오 신호를 다운믹스 신호와 공간 정보로 분리하는 역다중화부 및 상기 역다중화부에서 분리된 공간 정보를 가중치를 이용하여 수정하는 공간 정보 디코딩부를 포함하는 것이 바람직하다.In addition, the audio signal decoding apparatus according to the present invention includes a demultiplexer for separating an audio signal into a downmix signal and spatial information, and a spatial information decoder for modifying the spatial information separated by the demultiplexer using a weight. desirable.

또한, 본 발명에 의한 오디오 신호 인코딩 장치는 멀티 채널을 다운믹싱하는 단계에서 공간 정보를 추출하는 공간 정보 추출부 및 상기 추출된 공간 정보를 가중치를 이용하여 수정하는 공간 정보 인코딩부를 포함하는 것이 바람직하다.In addition, the audio signal encoding apparatus according to the present invention preferably includes a spatial information extracting unit for extracting spatial information in the step of downmixing the multi-channel and a spatial information encoding unit for correcting the extracted spatial information by using a weight. .

이어서, 첨부한 도면들을 참조하여 본 발명의 바람직한 실시 예를 상세히 설명하기로 한다.Next, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일 실시 예에 따른 오디오 신호 인코딩 장치의 다운믹싱부를 나타낸 블럭도이다. 도 1에 따른 오디오 신호 인코딩 장치의 다운믹싱부는 제1 다운믹싱부(101)부터 제5 다운믹싱부(109)를 포함한다. 본 도면에서는 5.1 채널이 오디오 신호 인코딩 장치로 인가된다. 일반적으로 5.1 채널의 경우, 앞 중앙( FC:Front Center) 채널, 앞 왼쪽(FL:Front Left channel) 채널, 앞 오른 쪽(FR:Front Right channel) 채널, 배후 왼쪽(SL:Surround left channel) 채널, 배후 오른쪽(SR:Surround right channel) 채널 및 우퍼(LFE:Low Frequency Enhancement) 채널로 구성된다. 입력되는 멀티 채널들은 제1 다운믹싱부(101)내지 제3 다운믹싱부(105)에 인가된다. 1 is a block diagram illustrating a downmixing unit of an audio signal encoding apparatus according to an embodiment of the present invention. The downmixing unit of the audio signal encoding apparatus of FIG. 1 includes a first downmixing unit 101 to a fifth downmixing unit 109. In this figure, 5.1 channel is applied to the audio signal encoding apparatus. In general, for 5.1 channels, the front center (FC) channel, the front left channel (FL), the front right channel (FR), and the rear left channel (SL) It consists of a surround right channel (SR) channel and a low frequency enhancement (LFE) channel. The input multichannels are applied to the first downmixer 101 to the third downmixer 105.

제1 다운믹싱부(101)내지 제3 다운믹싱부(105)는 OTT(One-To-Two) BOX를 이용하여 두 개의 입력 채널들을 하나의 채널로 다운믹싱한다. 오디오 신호 인코딩 장치는 멀티 채널 오디오 신호를 다운믹스할 때 두 개의 채널을 하나의 채널로 또는 세 개의 채널을 두 개의 채널로 만들기 위해 OTT(One-To-Two) BOX 또는 TTT(Two-To-Three) BOX를 사용한다. The first downmixer 101 to the third downmixer 105 downmix two input channels into one channel by using a one-to-two box. When downmixing multichannel audio signals, the audio signal encoding device uses one-to-two box or two-to-three to make two channels into one channel or three channels into two channels. ) Use a BOX.

OTT BOX 또는 TTT BOX는 오디오 신호 디코딩 장치의 업믹싱부에 포함되어 다운믹스 신호와 공간 정보를 이용하여 원래의 멀티 채널을 복원할 때 사용되는 개념적인 BOX이다. 즉, 오디오 신호 인코딩 장치로부터 수신한 오디오 신호는 역다중화부에서 인코딩된 다운믹스 신호와 인코딩된 공간정보로 파싱되어 각각 복호화된 후 멀티 채널 생성부로 보내지는데, 멀티 채널 생성부는 복호화된 다운믹스 신호와 공간 정보를 이용하여 원래의 멀티 채널을 복원할 때 OTT BOX 또는 TTT BOX를 사용하여 하나의 입력 채널을 두 개의 채널로 또는 두 개의 입력 채널을 세 개의 채널로 출력한다. 오디오 신호 디코딩 장치의 업믹싱부에 OTT BOX 또는 TTT BOX가 사용되는 것과 대응하여 OTT BOX 또는 TTT BOX는 오디오 신호 인코딩 장치의 다운믹싱부에 포함되어 입력되는 멀티 채널을 하나 또는 두 개의 다운믹스 신호로 출력하는데 사용된다. 즉, 각각은 두 개의 입력 채널을 하나의 채널로 또는 세 개의 입력 채널 을 두 개의 채널로 출력하는데 사용된다. 이하 OTT BOX 또는 TTT BOX가 오디오 신호 인코딩 장치에서 사용될 때에는 제 몇 다운믹싱부라 부르고 오디오 신호 디코딩 장치에서 사용될 때에는 제 몇 업믹싱부라 부르기로 한다. 입력 채널들이 각각의 다운믹싱부를 거칠 때 입력 채널들 사이의 관계를 나타내는 공간 정보(113)가 추출된다. 제1 다운믹싱부(101)와 제2 다운믹싱부(103)에서 나온 신호들은 제4 다운믹싱부(107)로 들어간다. 그리고 제4 다운믹싱부(107)와 제3 다운믹싱부(105)에서 나온 신호는 제5 다운믹싱부(109)로 들어간다. 이때 제1 다운믹싱부(103), 제2 다운믹싱부(103)와 제4 다운믹싱부(107) 사이에는 공간 정보(113)들 간에 단계별 가중치가 존재한다. 그리고 제4 다운믹싱부(107)와 제5 다운믹싱부(109) 사이에도 공간 정보(113)들 간에 단계별 가중치가 존재한다. 즉, 각각의 다운믹싱부를 거칠 때 공간 정보(113)가 추출되는데 추출되는 공간 정보(113)들 간에는 단계별 가중치가 존재한다.The OTT BOX or TTT BOX is a conceptual box that is included in the upmixing unit of the audio signal decoding apparatus and used when reconstructing the original multichannel by using downmix signals and spatial information. That is, the audio signal received from the audio signal encoding apparatus is parsed into the downmix signal encoded by the demultiplexer and the encoded spatial information and decoded, respectively, and then decoded and sent to the multichannel generator. When restoring the original multi-channel using spatial information, one input channel is output to two channels or two input channels are output to three channels using OTT BOX or TTT BOX. Corresponding to the use of the OTT BOX or TTT BOX in the upmixing section of the audio signal decoding apparatus, the OTT BOX or TTT BOX is included in the downmixing section of the audio signal encoding apparatus and inputs multiple channels into one or two downmix signals. Used to output That is, each is used to output two input channels to one channel or three input channels to two channels. Hereinafter, when the OTT BOX or the TTT BOX is used in the audio signal encoding apparatus, a few downmixing units are referred to, and when used in the audio signal decoding apparatus, a few upmixing units will be referred to. When the input channels pass through each downmixing unit, spatial information 113 representing the relationship between the input channels is extracted. The signals from the first downmixer 101 and the second downmixer 103 enter the fourth downmixer 107. The signal from the fourth downmixing unit 107 and the third downmixing unit 105 enters the fifth downmixing unit 109. In this case, there is a step weight between the space information 113 between the first downmixing unit 103, the second downmixing unit 103, and the fourth downmixing unit 107. In addition, there is a step weight among the spatial information 113 between the fourth downmixer 107 and the fifth downmixer 109. That is, the spatial information 113 is extracted when the downmixing unit passes through each of the downmixing units, and there are stepwise weights between the extracted spatial information 113.

오디오 신호 인코딩 장치는 각 OTT BOX 간에 존재하는 단계별 가중치를 고려하여 공간 정보(113)를 생성하고 가중치가 고려된 공간 정보(113)를 다운믹스 신호(111)와 함께 비트 스트림 형태로 오디오 신호 디코딩 장치로 전송한다. 오디오 신호 디코딩 장치는 가중치가 고려된 공간 정보(113)를 이용하여 다운믹스 신호(111)를 멀티 채널로 복원한다. 또는 오디오 신호 인코딩 장치는 가중치를 고려하지 않은 공간 정보(113)를 다운믹스 신호(111)와 함께 오디오 신호 디코딩 당치로 전송할 수도 있다. 이 경우, 오디오 신호 디코딩 장치는 다운믹싱부들 사이에 존재하는 가중치를 계산하여 이를 공간 정보(113)에 반영하고 가중치가 고려된 공 간 정보(113)를 이용하여 다운믹스 신호(111)를 멀티 채널로 복원한다.The audio signal encoding apparatus generates spatial information 113 in consideration of stepwise weights existing between each OTT box, and decodes the spatial information 113 in consideration of the weighted spatial information 113 in the form of a bit stream along with the downmix signal 111. To send. The audio signal decoding apparatus restores the downmix signal 111 to the multi-channel by using the spatial information 113 in consideration of the weight. Alternatively, the audio signal encoding apparatus may transmit the spatial information 113 without considering the weight together with the downmix signal 111 to the audio signal decoding value. In this case, the audio signal decoding apparatus calculates the weights existing between the downmixing units, reflects the weights in the spatial information 113, and multi-channels the downmix signal 111 using the space information 113 in consideration of the weights. Restore to.

도 2는 본 발명의 다른 실시예에 따른 오디오 신호 디코딩 장치의 업믹싱부를 나타낸 블럭도이다. 도 2에 따른 오디오 신호 디코딩 장치의 업믹싱부는 제1 업믹싱부(209)부터 제5 업믹싱부(201)를 포함한다. 오디오 신호 디코딩 장치는 오디오 신호를 수신하여 다운믹스 신호(111)와 공간 정보(113)로 역다중화하고 이들 각각을 복호화한다. 복호화된 다운믹스 신호는 복호화된 공간 정보(113)를 이용하여 멀티 채널을 생성한다.2 is a block diagram illustrating an upmixing unit of an audio signal decoding apparatus according to another embodiment of the present invention. The upmixing unit of the audio signal decoding apparatus according to FIG. 2 includes the first upmixing unit 209 to the fifth upmixing unit 201. The audio signal decoding apparatus receives the audio signal, demultiplexes the downmix signal 111 and the spatial information 113, and decodes each of them. The decoded downmix signal generates a multi-channel using the decoded spatial information 113.

다운믹스 신호(111)는 제5 업믹싱부(201)로 인가되고, 제5 업믹싱부(201)는 OTT(One-To-Two) BOX를 이용하여 입력 신호를 업믹싱한다. 제5 업믹싱부(201)는 입력 신호를 두 신호로 분리하여 제4 업믹싱부(203)와 제3 업믹싱부(205)로 인가한다. 제4 업믹싱부(203)로 인가되는 신호는 제1 업믹싱부(209)와 제2 업믹싱부(207)로 분리되어 인가된다. 제1 업믹싱부(209) 내지 제5 업믹싱부(201)에는 각각 공간 정보(113)가 인가된다. 즉, 업믹싱부에는 입력 신호가 각각의 업믹싱부를 거칠 때 분리할 신호들 사이의 관계를 나타내는 공간 정보(113)가 인가된다. 제1 업믹싱부(209) 내지 제5 업믹싱부(205)를 거친 신호들은 공간 정보(113)를 이용하여 5.1 채널로 복원된다.The downmix signal 111 is applied to the fifth upmixing unit 201, and the fifth upmixing unit 201 upmixes the input signal using the one-to-two box. The fifth upmixer 201 divides the input signal into two signals and applies the input signal to the fourth upmixer 203 and the third upmixer 205. The signal applied to the fourth upmixing unit 203 is separately applied to the first upmixing unit 209 and the second upmixing unit 207. Spatial information 113 is applied to each of the first upmixing unit 209 to the fifth upmixing unit 201. That is, the spatial information 113 indicating the relationship between the signals to be separated when the input signal passes through each upmixing unit is applied to the upmixing unit. The signals passed through the first upmixer 209 to the fifth upmixer 205 are restored to 5.1 channels using the spatial information 113.

이 때, 제5 업믹싱부(201)와 제4 업믹싱부(203)로 인가되는 공간 정보(113) 사이에는 단계별 가중치가 존재한다. 예를 들어 제4 업믹싱부(203)가 front 신호와 center 신호를 분리하여 각각의 신호를 제1 업믹싱부(209)와 제2 업믹싱부(207)로 인가하는 경우, 제4 업믹싱부(203)는 front 신호와 center 신호들 사이의 관계를 나타내는 공간 정보(113)를 필요로한다. 이 때, front 신호와 center 신호들 사이의 관계를 나타내는 공간 정보(113)는 오디오 신호 인코딩 장치에서 전송하는 공간 정보(113)에 각 단계별로 존재하는 가중치를 고려한 값이다. 즉, 제4 업믹싱부(203)로 인가되는 다운믹스 신호는 제5 업믹싱부(201)를 거쳐 인가되므로, 제5 업믹싱부(201)에 인가되는 공간 정보(113)가 고려되어야한다. 이와 유사하게 제1 업믹싱부(209) 또는 제2 업믹싱부(207)에 인가되는 신호는 제5 업믹싱부(201)와 제4 업믹싱부(203)를 거쳐 인가되므로 제1 업믹싱부(209) 또는 제2 업믹싱부(207)에 인가되는 공간 정보(113)는 제5 업믹싱부(201)와 제4 업믹싱부(203)에 인가된 공간 정보(113)가 고려된 값이어야 한다. 또, 제3 업믹싱부(205)에 인가되는 신호는 제5 업믹싱부(201)를 거쳐 인가되므로 제3 업믹싱부(205)에 인가되는 공간 정보(113) 또한 제5 업믹싱부(201)에 인가된 공간 정보(113)가 고려된 값이어야 한다.At this time, there is a step weight between the fifth upmixer 201 and the spatial information 113 applied to the fourth upmixer 203. For example, when the fourth upmixing unit 203 separates the front signal and the center signal and applies each signal to the first upmixing unit 209 and the second upmixing unit 207, the fourth upmixing unit 203. The unit 203 needs spatial information 113 representing a relationship between the front signal and the center signals. In this case, the spatial information 113 representing the relationship between the front signal and the center signal is a value considering weights present at each step in the spatial information 113 transmitted from the audio signal encoding apparatus. That is, since the downmix signal applied to the fourth upmixer 203 is applied through the fifth upmixer 201, the spatial information 113 applied to the fifth upmixer 201 should be considered. . Similarly, since the signal applied to the first upmixing unit 209 or the second upmixing unit 207 is applied through the fifth upmixing unit 201 and the fourth upmixing unit 203, the first upmixing unit is applied. The spatial information 113 applied to the unit 209 or the second upmixing unit 207 may include the spatial information 113 applied to the fifth upmixing unit 201 and the fourth upmixing unit 203. It must be a value. In addition, since the signal applied to the third upmixing unit 205 is applied through the fifth upmixing unit 201, the spatial information 113 applied to the third upmixing unit 205 is also applied to the fifth upmixing unit ( The spatial information 113 applied to 201 should be a value considered.

앞에서 언급한 바와 같이 오디오 신호 인코딩 장치가 다운믹싱부 사이에 존재하는 단계별 가중치를 고려한 공간 정보(113)를 다운믹스 신호(111)와 함께 오디오 신호 디코딩 장치로 전송한 경우에는 오디오 신호 디코딩 장치는 오디오 신호 인코딩 장치가 전송한 공간 정보(113)를 이용하여 멀티 채널을 복원한다.As mentioned above, when the audio signal encoding apparatus transmits the spatial information 113 considering the stepwise weights present between the downmixing units together with the downmix signal 111 to the audio signal decoding apparatus, the audio signal decoding apparatus performs the audio. The multi-channel is restored using the spatial information 113 transmitted by the signal encoding apparatus.

도 3은 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 방법을 나타낸 순서도이다. 오디오 신호 디코딩 장치는 오디오 신호 인코딩 장치가 전송하는 오디오 신호를 수신한다(단계 301). 오디오 신호 디코딩 장치는 수신한 오디오 신호를 다운믹스 신호(111)와 공간 정보(113)로 역다중화하고 각각을 디코딩한다.3 is a flowchart illustrating an audio signal decoding method according to another embodiment of the present invention. The audio signal decoding apparatus receives an audio signal transmitted by the audio signal encoding apparatus (step 301). The audio signal decoding apparatus demultiplexes the received audio signal into the downmix signal 111 and the spatial information 113 and decodes each of them.

오디오 신호 디코딩 장치는 업믹싱부 단계별로 존재하는 공간 정보(113)들 사이의 관련성을 이용하여 가중치를 계산한다(단계 303). 오디오 신호 디코딩 장치는 가중치를 이용하여 오디오 신호 인코딩 장치로부터 수신한 공간 정보(113)를 수정한다(단계 305). 공간 정보(113)를 수정하는 방법으로는 다운믹스 신호(111)의 주파수 밴드를 그룹으로 묶어서 묶은 그룹에는 동일한 가중치를 적용하거나, 또는 다운믹스 신호(111)의 주파수 밴드 중 정해진 주파수 밴드에 대해서만 가중치를 적용하고 다른 주파수 밴드에는 가중치를 적용하지 않는 방법 등을 이용할 수 있다. 공간 정보(113)는 앞에서 언급한 바와 같이 CLD, ICC, CPC 등이 있는데 이 중에서 특히 CLD는 오디오 신호 인코딩 장치 또는 오디오 신호 디코딩 장치의 다운믹싱부 또는 멀티 채널 생성부에 포함되어 있는 OTT BOX 사이에 존재하는 가중치의 영향을 많이 받는다. 그러나 반드시 본 발명에서 사용하는 공간 정보가 CLD에 한정되는 것은 아니다.The audio signal decoding apparatus calculates a weight using the relation between the spatial information 113 existing in each stage of the upmixing unit (step 303). The audio signal decoding apparatus modifies the spatial information 113 received from the audio signal encoding apparatus by using the weight (step 305). As a method of modifying the spatial information 113, the same weight is applied to a group in which the frequency bands of the downmix signal 111 are grouped and grouped or weighted only for a predetermined frequency band among the frequency bands of the downmix signal 111. May be applied, and no weight is applied to other frequency bands. As mentioned above, the spatial information 113 includes CLD, ICC, CPC, etc. Among them, CLD is between the OTT BOX included in the downmixing unit or the multi-channel generating unit of the audio signal encoding apparatus or the audio signal decoding apparatus. It is heavily influenced by the weights that exist. However, the spatial information used in the present invention is not necessarily limited to the CLD.

도 4는 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 방법을 나타낸 순서도이다. 오디오 신호 디코딩 장치는 오디오 신호 인코딩 장치가 전송하는 오디오 신호를 수신한다(단계 301). 오디오 신호 디코딩 장치는 수신한 오디오 신호를 다운믹스 신호(111)와 공간 정보(113)로 역다중화하고 각각을 디코딩한다. 오디오 신호 디코딩 장치는 공간 정보(113)에 존재하는 업믹싱부 단계별 가중치를 수정한다(단계 401). 앞에서도 언급한 바와 같이 공간 정보(113)에는 각각의 다운믹싱부 및 업믹싱부들 사이에 단계별 가중치가 존재한다. 이 때 단계별 가중치를 그대로 공간 정보(113)에 적용할 수도 있지만 단계별 가중치를 수정하여 수정된 가중치를 공간 정보(113)에 적용할 수도 있다. 가중치를 수정하기 위해서는 가중치를 계산하 는 함수값에 포함되어 있는 주파수 밴드나 타임 슬롯 등의 변수 값을 바꾸는 방법 등이 있다. 오디오 신호 디코딩 장치는 수정된 가중치를 이용하여 공간 정보(113)를 수정한다(단계 403). 4 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention. The audio signal decoding apparatus receives an audio signal transmitted by the audio signal encoding apparatus (step 301). The audio signal decoding apparatus demultiplexes the received audio signal into the downmix signal 111 and the spatial information 113 and decodes each of them. The audio signal decoding apparatus modifies the upmixer step weights present in the spatial information 113 (step 401). As mentioned above, the spatial information 113 has stepwise weights between the respective downmixing units and the upmixing units. In this case, the step weights may be applied to the spatial information 113 as it is, but the modified weights may be applied to the spatial information 113 by modifying the step weights. In order to modify the weight, there is a method of changing a variable value such as frequency band or time slot included in a function value for calculating the weight. The audio signal decoding apparatus modifies the spatial information 113 using the modified weight (step 403).

도 5는 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 방법을 나타낸 순서도이다. 오디오 신호 디코딩 장치는 오디오 신호 인코딩 장치가 전송하는 오디오 신호를 수신한다(단계 301). 오디오 신호 디코딩 장치는 수신한 오디오 신호를 다운믹스 신호(111)와 공간 정보(113)로 역다중화하고 각각을 디코딩한다. 오디오 신호 디코딩 장치는 업믹싱부 단계별로 존재하는 공간 정보(113)들 사이의 관련성을 이용하여 가중치를 계산한다(단계 303). 오디오 신호 디코딩 장치는 멀티 채널을 복원하는데 사용되는 공간 정보(113)를 수정할 필요가 있는지를 판단한다(단계 501). 오디오 신호 디코딩 장치는 공간 정보(113)를 수정할 필요가 있는지를 판단하기 위해 오디오 신호에 포함되어 있는 식별자를 이용할 수 있다. 즉, 오디오 신호 인코딩 장치는 오디오 신호 디코딩 장치로 전송하는 오디오 신호에 포함되어 있는 공간 정보(113)가 멀티 채널을 생성하기 위해서 수정될 필요성이 있는 경우에는 공간 정보(113)에 가해지는 단계별 가중치를 구하는 함수를 수정하거나 또는 공간 정보(113)를 구하는 함수를 수정해야 함을 표시하는 식별자를 전송한다. 오디오 신호 디코딩 장치는 가중치나 공간 정보(113)를 수정할 필요성이 있는 경우에는 공간 정보(113)를 수정한다(단계 305). 5 is a flowchart illustrating a method of decoding an audio signal according to another embodiment of the present invention. The audio signal decoding apparatus receives an audio signal transmitted by the audio signal encoding apparatus (step 301). The audio signal decoding apparatus demultiplexes the received audio signal into the downmix signal 111 and the spatial information 113 and decodes each of them. The audio signal decoding apparatus calculates a weight using the relation between the spatial information 113 existing in each stage of the upmixing unit (step 303). The audio signal decoding apparatus determines whether it is necessary to correct the spatial information 113 used to recover the multi-channel (step 501). The audio signal decoding apparatus may use an identifier included in the audio signal to determine whether the spatial information 113 needs to be corrected. That is, when the spatial information 113 included in the audio signal transmitted to the audio signal decoding apparatus needs to be modified in order to generate a multi-channel, the audio signal encoding apparatus applies a stepwise weight to the spatial information 113. The identifier indicating that the function to obtain or the spatial information 113 is to be modified is transmitted. The audio signal decoding apparatus modifies the spatial information 113 when it is necessary to correct the weight or the spatial information 113 (step 305).

도 6은 본 발명의 또 다른 실시예에 따른 오디오 신호 인코딩 방법을 나타낸 순서도이다. 오디오 신호 인코딩 장치는 오디오 신호 인코딩 장치로 입력되는 멀티 채널을 다운믹싱한다(단계 601). 오디오 신호 인코딩 장치는 멀티 채널을 다운믹싱하는 단계에서 공간 정보(113)를 추출하고 추출한 공간 정보(113) 사이에 존재하는 가중치를 계산한다(단계 603). 오디오 신호 인코딩 장치는 계산한 가중치를 이용하여 공간 정보(113)를 수정하고(단계 605), 수정된 공간 정보(113)를 다운믹스 신호(111)와 함께 비트 스트림 형태로 오디오 신호 디코딩 장치로 전송한다. 오디오 신호 디코딩 장치는 수신한 공간 정보(113)와 다운믹스 신호(111)를 이용하여 멀티 채널을 복원한다.6 is a flowchart illustrating an audio signal encoding method according to another embodiment of the present invention. The audio signal encoding apparatus downmixes the multi-channels input to the audio signal encoding apparatus (step 601). The audio signal encoding apparatus extracts the spatial information 113 in the step of downmixing the multi-channels and calculates a weight existing between the extracted spatial information 113 (step 603). The audio signal encoding apparatus corrects the spatial information 113 using the calculated weights (step 605), and transmits the modified spatial information 113 together with the downmix signal 111 to the audio signal decoding apparatus in the form of a bit stream. do. The audio signal decoding apparatus restores the multi-channel by using the received spatial information 113 and the downmix signal 111.

도 7은 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 장치를 나타낸 블럭도이다. 도 7을 참조하면, 오디오 신호 디코딩 장치는 역다중화부(701), 코어 디코딩부(703), 공간 정보 디코딩부(705) 및 멀티 채널 생성부(707)를 포함한다. 7 is a block diagram showing an audio signal decoding apparatus according to another embodiment of the present invention. Referring to FIG. 7, an audio signal decoding apparatus includes a demultiplexer 701, a core decoder 703, a spatial information decoder 705, and a multi-channel generator 707.

오디오 신호 인코딩 장치가 오디오 신호를 오디오 신호 디코딩 장치로 전송하면, 오디오 신호 디코딩 장치의 역다중화부(701)는 수신한 오디오 신호를 다운믹스 신호와 공간 정보(113)로 분리하여 각각 코어 디코딩부(703)와 공간 정보 디코딩부(705)로 보낸다. 코어 디코딩부(703)는 다운믹스 신호를 디코딩하여 복호화된 다운믹스 신호를 멀티 채널 생성부(707)로 보내고 공간 정보 디코딩부(705)는 공간 정보(113)를 디코딩하여 복호화된 공간 정보(113)를 가중치를 이용하여 수정한 후, 수정된 공간 정보(113)를 멀티 채널 생성부(707)로 보낸다. 멀티 채널 생성부(707)는 복호화된 다운믹스 신호와 수정된 공간 정보(113)를 이용하여 멀티 채널을 복원한다. When the audio signal encoding apparatus transmits the audio signal to the audio signal decoding apparatus, the demultiplexer 701 of the audio signal decoding apparatus separates the received audio signal into the downmix signal and the spatial information 113 and respectively the core decoding unit ( 703 and the spatial information decoding unit 705. The core decoding unit 703 decodes the downmix signal and sends the decoded downmix signal to the multi-channel generating unit 707, and the spatial information decoding unit 705 decodes the spatial information 113 to decode the spatial information 113. ) Is modified using weights, and then the modified spatial information 113 is sent to the multi-channel generator 707. The multi-channel generator 707 restores the multi-channel by using the decoded downmix signal and the modified spatial information 113.

도 8은 본 발명의 또 다른 실시예에 따른 오디오 신호 디코딩 장치의 공간 정보 디코딩부를 나타낸 블럭도이다. 도 8을 참조하면, 공간 정보 디코딩부(705)는 디코딩부(801), 역양자화부(803), 가중치 계산부(805) 및 공간 정보 수정부(807)를 포함한다. 공간 정보 디코딩부(705)의 디코딩부(801)는 역다중화부(701)가 오디오 신호에서 분리한 공간 정보(113)를 입력받아 이를 복호화한다. 복호화된 공간 정보(113)는 역양자화부(803)에서 데시벨(db) 단위로 역양자화된다. 가중치 계산부(805)는 역양자화된 공간 정보(113)를 이용하여 공간 정보(113)들 간에 존재하는 단계별 가중치를 계산한다. 가중치 계산부(805)는 계산한 가중치를 공간 정보 수정부(807)로 보내고, 공간 정보 수정부(807)는 가중치 계산부(805)가 구한 가중치를 이용하여 공간 정보(113)를 수정한다. 가중치 계산부(805)는 공간 정보(113)를 이용하여 가중치를 계산할 때, 가중치를 수정하여 계산할 수 있다. 즉, 가중치를 구하는 함수에 포함된 변수들을 수정하여 가중치를 수정할 수 있다. 이때, 가중치 계산부(805)는 가중치 수정부(미도시)를 포함할 수 있다. 공간 정보 수정부(807)는 가중치 계산부(805)가 수정하여 계산한 가중치를 이용하여 공간 정보(113)를 수정한다. 공간 정보 수정부(807)는 수정된 공간 정보(113)를 멀티 채널 생성부(707)로 전송한다. 8 is a block diagram illustrating a spatial information decoding unit of an audio signal decoding apparatus according to another embodiment of the present invention. Referring to FIG. 8, the spatial information decoding unit 705 includes a decoding unit 801, an inverse quantization unit 803, a weight calculation unit 805, and a spatial information correction unit 807. The decoding unit 801 of the spatial information decoding unit 705 receives the spatial information 113 separated from the audio signal by the demultiplexer 701 and decodes it. The decoded spatial information 113 is dequantized in decibels by the dequantizer 803. The weight calculator 805 calculates the step-by-step weights present between the spatial information 113 using the dequantized spatial information 113. The weight calculation unit 805 sends the calculated weight to the spatial information correction unit 807, and the spatial information correction unit 807 modifies the spatial information 113 by using the weight obtained by the weight calculation unit 805. The weight calculator 805 may calculate the weight by modifying the weight when calculating the weight using the spatial information 113. That is, the weight may be modified by modifying variables included in the function for obtaining the weight. In this case, the weight calculator 805 may include a weight correction unit (not shown). The spatial information correction unit 807 modifies the spatial information 113 by using the weight calculated by the weight calculation unit 805. The spatial information correction unit 807 transmits the modified spatial information 113 to the multi-channel generator 707.

도 9는 본 발명의 또 다른 실시예에 따른 오디오 신호 인코딩 장치를 나타낸 블럭도이다. 도 9를 참조하면, 오디오 신호 인코딩 장치는 다운믹싱부(901), 코어 인코딩부(903), 공간 정보 추출부(905), 공간 정보 인코딩부(907) 및 다중화부(909)를 포함한다. 다운믹싱부(901)는 멀티 채널을 입력받아 이를 다운믹싱하여 다운믹스 신호를 생성하여 이를 코어 인코딩부(903)로 보낸다. 공간 정보 추출 부(905)는 다운믹싱부(901)가 멀티 채널을 다운믹싱할 때 입력 채널들 사이의 관계를 나타내는 공간 정보(113)를 추출한다. 코어 인코딩부(903)는 다운믹스 신호를 부호화하여 다중화부(909)로 전송한다.9 is a block diagram showing an audio signal encoding apparatus according to another embodiment of the present invention. Referring to FIG. 9, an audio signal encoding apparatus includes a downmixing unit 901, a core encoding unit 903, a spatial information extracting unit 905, a spatial information encoding unit 907, and a multiplexing unit 909. The downmixer 901 receives a multi-channel and downmixes the multichannel to generate a downmix signal and sends it to the core encoder 903. The spatial information extracting unit 905 extracts the spatial information 113 representing the relationship between the input channels when the downmixing unit 901 downmixes the multi-channels. The core encoder 903 encodes the downmix signal and transmits the downmix signal to the multiplexer 909.

공간 정보 추출부(905)는 추출한 공간 정보(113)를 공간 정보 인코딩부(907)로 보낸다. 공간 정보 인코딩부(907)는 공간 정보(113)를 수정하여 이를 인코딩하고 인코딩한 공간 정보(113)를 다중화부(909)로 보낸다. 다중화부(909)는 다운믹스 신호와 공간 정보(113)를 이용하여 멀티 채널을 복원한다. The spatial information extracting unit 905 sends the extracted spatial information 113 to the spatial information encoding unit 907. The spatial information encoding unit 907 modifies and encodes the spatial information 113, and sends the encoded spatial information 113 to the multiplexer 909. The multiplexer 909 reconstructs the multi-channel by using the downmix signal and the spatial information 113.

도 10은 본 발명의 또 다른 실시예에 따른 오디오 신호 인코딩 장치의 공간 정보 인코딩부를 나타낸 블럭도이다. 도 10을 참조하면, 공간 정보 인코딩부(907)는 가중치 계산부(1001), 공간 정보 수정부(1003), 양자화부(1005) 그리고 인코딩부(1007)를 포함한다. 공간 정보 인코딩부(907)는 공간 정보 추출부(905)가 추출한 공간 정보(113)를 입력받는다. 가중치 계산부(1001)는 공간 정보(113)를 이용하여 멀티 채널들 사이에 존재하는 가중치를 계산한다. 가중치 계산부(1001)는 가중치를 구할 때, 가중치 함수에 포함된 변수 값을 수정하여 수정된 가중치를 구할 수 있다. 이때, 가중치 계산부(1001)는 가중치 수정부(미도시)를 포함할 수 있다. 공간 정보 수정부(1003)는 가중치 계산부(1001)가 계산한 가중치를 이용하여 공간 정보(113)를 수정한다. 수정된 공간 정보(113)는 양자화부(1005)에서 양자화되고 인코딩부(1007)부로 전송된다. 인코딩부(1007)는 양자화된 공간 정보(113)를 부호화하여 부호화된 공간 정보(113)를 다중화부(909)로 전송한다. 10 is a block diagram illustrating a spatial information encoding unit of an audio signal encoding apparatus according to another embodiment of the present invention. Referring to FIG. 10, the spatial information encoding unit 907 includes a weight calculation unit 1001, a spatial information correction unit 1003, a quantization unit 1005, and an encoding unit 1007. The spatial information encoder 907 receives the spatial information 113 extracted by the spatial information extractor 905. The weight calculator 1001 calculates a weight existing between the multi-channels using the spatial information 113. When the weight calculator 1001 calculates the weight, the weight calculator 1001 may obtain the modified weight by modifying a variable value included in the weight function. In this case, the weight calculator 1001 may include a weight correction unit (not shown). The spatial information correction unit 1003 modifies the spatial information 113 by using the weight calculated by the weight calculation unit 1001. The modified spatial information 113 is quantized by the quantization unit 1005 and transmitted to the encoding unit 1007. The encoder 1007 encodes the quantized spatial information 113 and transmits the encoded spatial information 113 to the multiplexer 909.

본 발명에 의한 오디오 신호 인코딩 및 디코딩 방법은 공간 정보를 가중치를 이용하여 수정함으로써 공간 정보를 구할 때 요구되는 계산량이 감소되는 효과를 갖는다.The audio signal encoding and decoding method according to the present invention has the effect of reducing the amount of computation required when obtaining spatial information by modifying the spatial information using weights.

Claims

Receiving an audio signal; And

And correcting the spatial information included in the received audio signal by using a weight.

The method of claim 1, wherein the audio signal decoding method is

And identifying whether information on whether the correction of the spatial information is necessary is included in the audio signal.

The method of claim 1, wherein modifying the spatial information

And grouping frequency bands into groups and applying the weights to the groups, respectively.

The method of claim 1, wherein modifying the spatial information

And applying the weight only to a predetermined frequency band.

The method of claim 3 or 4, wherein the modifying the spatial information

Audio signal decoding method characterized in that performed using the modified weight.

Downmixing the multi-channels to generate a downmix signal and spatial information; And

And modifying the generated spatial information by using a weight.

7. The method of claim 6, wherein modifying the spatial information

Audio signal encoding method characterized in that performed using the modified weight.

A demultiplexer for separating an audio signal into a downmix signal and spatial information; And

And a spatial information decoding unit for modifying the spatial information separated by the demultiplexer using a weight.

The method of claim 8, wherein the spatial information decoding unit

And a weight correction unit for correcting the weight and correcting the spatial information using the weight corrected by the weight correction unit.

A spatial information extracting unit extracting spatial information in the step of downmixing the multi-channels; And

And a spatial information encoding unit which modifies the extracted spatial information by using a weight.

The method of claim 10, wherein the spatial information encoding unit

And a weight correction unit for correcting the weights, and correcting the spatial information using the weights corrected by the weight correction unit.