KR100857102B1

KR100857102B1 - Method for generating encoded audio signal and method for processing audio signal

Info

Publication number: KR100857102B1
Application number: KR1020087004585A
Authority: KR
Inventors: 오현오; 방희석; 김동수; 임재현; 정양원; 김효진
Original assignee: 엘지전자 주식회사
Priority date: 2005-07-29
Filing date: 2006-07-28
Publication date: 2008-09-08
Also published as: CN101233568A; KR100857104B1; KR20080035656A; EP1915757A1; CN101233569A; CN101233570A; KR20080033452A; CN101233571B; AU2006273012A1; EP1920437A4; CN101233567B; WO2007013781A1; AU2006273012B2; KR100841332B1; WO2007013783A1; CN101233568B; EP1920438A4; CA2617050C; EP1915756A4; EP1920439A4

Abstract

본 발명은 다채널 오디오 코딩에서 부호화된 오디오 신호의 생성방법 및 오디오 신호의 처리방법에 관한 것이다.The present invention relates to a method for generating an audio signal encoded in multichannel audio coding and a method for processing an audio signal.

본 발명은 미리 설정된 출력채널의 구성정보인 고정채널 구성정보; 및 임의채널 구성정보를 포함하는 부호화된 오디오 신호의 생성방법을 제공한다.The present invention provides fixed channel configuration information which is configuration information of a preset output channel; And a method of generating an encoded audio signal including arbitrary channel configuration information.

Description

{METHOD FOR GENERATING ENCODED AUDIO SIGNAL AND METHOD FOR PROCESSING AUDIO SIGNAL}

본 발명은 다채널 코딩에 관한 것으로, 특히 부호화된 오디오 신호의 생성방법 및 오디오 신호의 처리방법에 관한 것이다.TECHNICAL FIELD The present invention relates to multichannel coding, and more particularly, to a method of generating an encoded audio signal and a method of processing an audio signal.

일반적으로 신호는 블록, 밴드 채널 등의 형태로 존재할 수 있는데, 이러한 신호들은 일정한 통계적 특성을 유지하는 스테셔너리(stationary) 구간에는 신호를 분할하지 않고 처리하는 것이 압축관점에서 유리하지만, 신호의 특성이 급격히 변화하는 트렌지언트(transient) 구간에서는 가급적 신호를 분할하여 처리하는 것이 신호의 왜곡 방지 차원에서 유리하다. 특히, 오디오 신호의 경우는 채널 구성 및 밴드 구성을 위해 신호를 분할하여 처리하는 경우가 있다.In general, a signal may exist in the form of a block, a band channel, and the like. Although these signals are advantageous in compression terms, they may be advantageously processed in a stationary section that maintains certain statistical characteristics. In this rapidly changing transient section, it is advantageous to divide and process the signal as much as possible in order to prevent distortion of the signal. In particular, in the case of an audio signal, a signal may be divided and processed for a channel configuration and a band configuration.

그러나 상기의 신호들을 분할하여 처리하고자 할 경우, 이 분할된 정보를 표현하는 방법이 구체적으로 제시된 바 없어, 상기의 신호들을 효율적으로 처리하는데 문제점이 있었다.However, in the case where the signals are to be divided and processed, there is no specific method for expressing the divided information, and there is a problem in efficiently processing the signals.

본 발명은 상기와 같은 문제점을 해결하기 위한 것으로서, 분할된 신호의 정보를 효율적으로 표현하는 방법 및 그 정보를 이용하여 신호처리를 하는 방법을 제공하는데 있다.The present invention is to solve the above problems, and to provide a method for efficiently expressing the information of the divided signal and a signal processing method using the information.

상기 목적을 달성하기 위해 본 발명은 미리 설정된 출력채널의 구성정보인 고정채널 구성정보; 및 임의채널 구성정보를 포함하는 부호화된 오디오 신호의 생성방법을 제공한다.In order to achieve the above object, the present invention provides fixed channel configuration information which is configuration information of a preset output channel; And a method of generating an encoded audio signal including arbitrary channel configuration information.

도 1은 본 발명의 일실시예에 의한 블록분할 정보의 시그널링 방법을 설명하기 위한 개념도,1 is a conceptual diagram illustrating a signaling method of block division information according to an embodiment of the present invention;

도 2는 내지 도 3은 본 발명의 일실시예에 의한 밴드/채널 분할 정보의 시그널링 방법을 설명하기 위한 개념도,2 to 3 are conceptual views illustrating a signaling method of band / channel segmentation information according to an embodiment of the present invention;

도 4는 본 발명의 일실시예를 이용하여 다채널 신호를 생성하는 방법에 대한 개념도,4 is a conceptual diagram for a method of generating a multi-channel signal using an embodiment of the present invention;

도 5는 본 발명의 일실시예에 의한 채널 분할 정보의 시그널링 방법을 설명하기 위한 개념도이다.5 is a conceptual diagram illustrating a signaling method of channel division information according to an embodiment of the present invention.

Best Mode for Carrying Out the InventionBest Mode for Carrying Out the Invention

이하, 상기와 같은 목적 달성을 위한 본 발명의 구체적인 실시예를 도면을 참조하여 설명하기로 한다.Hereinafter, specific embodiments of the present invention for achieving the above object will be described with reference to the drawings.

본 발명의 일실시예에 의한 분할 정보의 시그널링 방법을 편의상 신호의 종류별로 나누어 설명하기로 한다. 상기 신호는 블록, 밴드, 채널 등의 형태로 존재 가능하다. 본 명세서에서 시그널링 방법이란 시그널링을 하거나, 시그널링된 신호를 인식하는 것을 포함한다.A signaling method of split information according to an embodiment of the present invention will be described by dividing the type of signal for convenience. The signal may exist in the form of a block, band, channel, or the like. In the present specification, a signaling method includes signaling or recognizing a signaled signal.

한편, 본 명세서에 사용되는 용어인 "노드"는 신호의 분할여부가 표현되는 지점을 의미한다. 그리고 본 명세서에 사용되는 용어인 "공간정보"라 함은 다채널을 다운믹스(Down-mix) 하거나 다채널 신호를 생성하기 위해 업믹스(Up-mix) 하는 과정에서 필요한 정보를 의미한다. 상기 공간 정보로 공간 파라미터를 기준으로 설명하나, 본 발명이 이에 한정되지 않음은 자명한 사실임을 밝혀둔다.On the other hand, the term "node" as used herein refers to the point where the division of the signal is expressed. In addition, the term "spatial information" used in the present specification refers to information required in the process of down-mixing multiple channels or up-mixing to generate a multi-channel signal. Although the spatial information is described based on the spatial parameters, it is to be understood that the present invention is not limited thereto.

또한, 상기 공간 파라미터는 두 채널간의 에너지 차이를 의미하는CLD(channel level difference), 두 채널간의 상관관계(correlation)를 의미하는 ICC(inter channel coherences) 및 두 채널로부터 세 채널을 생성할 때 이용되는 예측 계수인 CPC(channel prediction coefficients) 등이 있다.In addition, the spatial parameter is used when generating three channels from two channels and a channel level difference (CLD) representing an energy difference between two channels, inter channel coherences (ICC) representing a correlation between two channels, and two channels. Channel prediction coefficients (CPC), which are prediction coefficients.

이하, 블록 분할, 밴드 분할, 채널 분할에 대해 살펴보기로 한다.Hereinafter, block division, band division, and channel division will be described.

(1) 블록 분할(1) block division

오디오 신호처럼 시간 축에서 연속적인 데이터에 대해 신호의 압축과 같은 처리를 하기 위해서는 블록 프로세싱(block processing)을 수행한다. 상기 블록 프로세싱(block processing)은 입력된 신호를 일정구간 또는 일정간격으로 나누어 처리하는 것을 의미한다. 이때 사용되는 구간을 블록이라 정의하며, 한 개 혹은 복수 개의 블록이 모여 프레임(frame)을 구성할 수 있다. 상기 프레임은 데이터의 전송 및 저장을 위해 사용되는 단위를 말한다.Block processing is performed to process data such as compression of a continuous data on a time axis like an audio signal. The block processing means processing the input signal by dividing the input signal into a predetermined section or a predetermined interval. In this case, the interval used is defined as a block, and one or a plurality of blocks may be gathered to form a frame. The frame refers to a unit used for transmitting and storing data.

본 발명에서 "블록 분할(block splitting)"이란, 입력된 신호의 블록을 가변시키면서 신호를 처리할 때, 서로 다른 크기의 블록으로 변화하는 과정을 의미한다. 또한, 본 발명에서 "블록 크기 정보(block size information)"란 입력된 신호 의 블록 크기를 가변시키면서 신호를 처리하는 경우에 블록의 크기를 나타내는 정보이다.In the present invention, "block splitting" refers to a process of changing a block of an input signal into blocks of different sizes when the signal is processed while varying a block of an input signal. In addition, in the present invention, "block size information" is information indicating a block size when processing a signal while varying a block size of an input signal.

일반적으로 신호가 블록형태로 존재하는 경우 장블록(long block)과 단블록(short block) 중 하나를 사용하여 신호처리를 수행한다. 이 때, 상기 단블록(short block)을 사용하는 경우, 복수 개의 단블록을 묶어 하나의 장블록 크기에 대응되도록 한다.In general, when a signal exists in the form of a block, signal processing is performed using one of a long block and a short block. In this case, when the short block is used, a plurality of short blocks are bundled to correspond to one long block size.

그러나 신호의 특성이 구간구간 마다 다양하기에 모든 신호에 대해 장블록에 의한 신호처리 또는 단블록 의한 신호처리로 이분하여 구분짓기는 어렵다.However, since the characteristics of the signals vary from section to section, it is difficult to divide them into signal processing by long block or signal processing by short block for all signals.

따라서, 임의의 구간에서 신호 특성에 맞는 보다 다양한 블록 크기 가운데서 선택하여 블록 분할을 수행하는 것이 바람직하다. 즉, 두 개 이상의 다른 크기를 갖는 블록들이 존재하고, 이들 가운데 적절한 크기의 블록을 프레임 내에서 다양한 조합으로 선택할 수 있도록 할 수 있다.Therefore, it is desirable to perform block division by selecting among more various block sizes suitable for signal characteristics in an arbitrary section. That is, there are blocks having two or more different sizes, and among them, an appropriate size block can be selected in various combinations within a frame.

이를 위해서는 현재의 프레임이 어떠한 블록들의 조합에 의해 구성되었는지를 알려줄 필요가 있고, 이를 위한 시그널링(signaling) 방법이 필요하다.To this end, it is necessary to inform which combination of blocks the current frame is composed, and a signaling method for this is needed.

상기 시그널링 방법에는 순차적 시그널링 방법과 계층적 시그널링 방법이 있다.The signaling method includes a sequential signaling method and a hierarchical signaling method.

순차적 시그널링 방법은 프레임의 크기(길이, N)를 미리 정의하고, 최소 크기 블록 (M)의 개수로써 시그널링(signaling) 하는 방법이다. 이때, 상기 프레임의 길이 N은 특정 M의 배수이며, 상기 프레임의 크기는 고정된 값일 수도 있고, 별도의 정보로써 전송되는 값일 수도 있다.The sequential signaling method is a method of previously defining the size (length, N) of a frame and signaling the number of minimum size blocks (M). In this case, the length N of the frame is a multiple of a specific M, and the size of the frame may be a fixed value or a value transmitted as separate information.

예를 들어, N=2048, M=256이고, 프레임 내에 앞에서부터 256, 256, 1024, 512의 순서로 블록이 구성된다고 하면, 블록 크기 정보는 M*1, M*1, M*4, M*2 =＞ 1,1,4,2 =＞0,0,3,1로 시그널링 하는 방법이 있을 수 있다.For example, if N = 2048, M = 256, and blocks are configured in the frame in the order of 256, 256, 1024, and 512 from the front, the block size information is M * 1, M * 1, M * 4, M * 2 => 1,1,4,2 => 0,0,3,1 may be a method for signaling.

또한, 계층적 시그널링 방법은 계층의 깊이 정보를 보내는 방법과 계층의 깊이 정보를 보내지 않는 방법이 있는데, 이에 대해서는 도면을 통해 상세히 설명하도록 하겠다.In addition, the hierarchical signaling method includes a method of sending depth information of a layer and a method of not sending depth information of a layer, which will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일실시예에 의한 블록분할 정보의 시그널링 방법을 설명하기 위한 개념도이다.1 is a conceptual diagram illustrating a signaling method of block division information according to an embodiment of the present invention.

이에 도시된 바와 같이, 각 계층은 레이어(layer)로 나타내고, 상기 레이어의 깊이(depth)는 5이다.As shown here, each layer is represented by a layer, and the depth of the layer is five.

레이어 1(layer 1)은 블록 분할의 기본이 되는 가장 장블록이며, 그 길이는 N인 제1블록(210)을 포함한다. 또한, (1), (2), .., (a), (b), (c), (d)는 바이너리 시그널링(binary signaling) 순서의 일례를 나타낸다. 도시된 본 실시예에서 블록의 분할여부를 나타내는 블록 분할 정보를 분할 식별자와 미분할 식별자로 표현하는데, 분할 식별자로는 1을 사용하고, 미분할 식별자로는 0을 사용한다.Layer 1 is the longest block on which block division is based, and includes a first block 210 having a length of N. In addition, (1), (2), ..., (a), (b), (c), and (d) show an example of a binary signaling sequence. In the illustrated embodiment, block division information indicating whether a block is divided is represented by a partition identifier and an undivided identifier. 1 is used as a partition identifier and 0 is used as an undivided identifier.

그리고 상기 분할 식별자와 상기 미분할 식별자는 각 계층의 노드에서 표현된다.The split identifier and the undivided identifier are represented at nodes of each layer.

상기 분할 식별자는 상위 계층의 임의의 블록이 하위 계층에서 절반으로 분할됨과 아울러 하위 계층에 하위 대응 노드가 할당됨을 의미한다. 그리고 미분할 식별자는 상위 계층의 임의의 블록이 하위 계층에서 분할되지 않는 것을 의미함과 동시에 미분할 식별자가 표현된 노드에 대한 하위 대응 노드가 할당되지 않는 것을 의미한다. 하위 대응 노드가 할당되지 않는다는 것은 더이상의 추가적인 시그널링을 수행하지 않는다는 것을 뜻한다.The partition identifier means that any block of the upper layer is divided in half in the lower layer and a lower corresponding node is assigned to the lower layer. The undivided identifier means that any block of the upper layer is not divided in the lower layer and that the lower corresponding node for the node in which the undivided identifier is expressed is not allocated. The fact that the lower correspondent node is not assigned means that no further signaling is performed.

최상위 계층인 레이어 1에서 제1블록(210)에 대한 블록 분할 정보(1)가 '1'이므로 제1블록(210)의 블록 분할을 수행한다. 상기 레이어 1의 하위 계층인 레이어 2는 N/2의 길이를 가지는 제2블록(220)과 제3블록(221)을 포함한 2개의 블록으로 구성된다.Since the block partitioning information 1 for the first block 210 is '1' in Layer 1, which is the highest layer, block partitioning of the first block 210 is performed. Layer 2, which is a lower layer of the layer 1, is composed of two blocks including a second block 220 and a third block 221 having a length of N / 2.

레이어 2(layer 2)에서 제2-1블록(220)의 블록 분할 정보(2)가 '1'이고, 제2-2블록(221)의 블록 분할 정보(3)가 '1'이므로, 상기 레이어 2의 하위 계층인 레이어 3(layer 3)은 N/4의 길이를 가지는 제3-1블록(230), 제3-2블록(231), 제3-3블록(232), 제3-4블록(233)을 포함한 4개의 블록으로 구성된다.Since the block partitioning information 2 of the second-first block 220 is '1' in the layer 2 and the block partitioning information 3 of the second-two block 221 is '1', Layer 3 (layer 3), which is a lower layer of layer 2, has 3-1 blocks 230, 3-2 blocks 231, 3-3 blocks 232, and 3-3 having a length of N / 4. It consists of four blocks including four blocks 233.

레이어 3(layer 3)에서 제3-1블록(230)에 대한 블록 분할 정보(4)가 '0', 제3-2블록(231)에 대한 블록 분할 정보(5)가 '1', 제3-3블록(232)에 대한 블록 분할 정보(6)가 '1', 제3-4블록(233)에 대한 블록 분할 정보(7)가 '0'이다. 따라서,상기 레이어 3의 블록 분할 정보에 따르면, 레이어 3의 제3-1블록(230)과 제3-4블록(233)은 블록 분할을 수행하지 않고, 레이어 3의 제3-2블록(231)과 제3-3블록(232)에 대해서만 블록 분할을 수행한다. 이 때, 레이어 3에서 블록 분할을 하지않은 제3-1블록(230)과 제3-4블록(233) 이후의 하위 계층(레이어 4)에서는 하위 대응 노드가 할당되지 않는다. 또한, 레이어 3에서 블록 분할을 수행한 제3-2블록(231)과 제3-3블록(232)은 이후의 하위 계층에 하위 대응 노드를 할당하여 상기 하위 대응 노 드에서 블록 분할여부가 표현된다.In the layer 3 (block 3), the block partitioning information 4 for the third-1 block 230 is '0', and the block partitioning information 5 for the third-2 block 231 is '1', The block partitioning information 6 for the 3-3 block 232 is '1', and the block partitioning information 7 for the 3-4 block 233 is '0'. Therefore, according to the block division information of the layer 3, the 3-1 block 230 and the 3-4 block 233 of the layer 3 does not perform block division, but the 3-2 block 231 of the layer 3 ) And the third-3 blocks 232 only. At this time, the lower correspondent node is not allocated in the lower layer (layer 4) after the 3-1 block 230 and the 3-4 block 233 which do not block division in the layer 3. In addition, the 3-2 block 231 and the 3-3 block 232 having performed the block division in the layer 3 are assigned a lower correspondent node to a later lower layer to express block division in the lower correspondent node. do.

레이어 4는 N/8의 길이를 가지고, 레이어 3의 제3-2블록(231)을 블록 분할한 제4-1블록(240)과 제4-2블록(241), 제3-3블록(232)을 블록 분할한 제4-3블록(242)과 제4-4블록(243)을 포함하여 구성된다. 상기 레이어 4에서 제4-1블록(240)에 대한 블록 분할 정보(8)는 '0', 제4-2블록(241)에 대한 블록 분할 정보(9)는 '1', 제4-3블록(242)에 대한 블록 분할 정보(a)는 '0', 제4-4블록(243)에 대한 블록 분할 정보(b)는 '0'이다. 따라서, 상기 레이어 4의 블록 분할 정보에 따르면, 레이어 4의 제4-1블록(240), 제4-3블록(242), 제4-4블록(243)은 블록 분할을 수행하지 않고, 레이어 4의 제4-2블록(241)은 블록 분할을 수행한다. 이 때, 레이어 4에서 블록 분할을 하지않은 제4-1블록(240), 제4-3블록(242), 제4-4블록(243) 이후의 하위 계층(레이어 5)에서는 하위 대응 노드를 할당하지 않으며, 레이어 4에서 블록 분할을 수행한 제4-2블록(241)은 이후의 하위 계층에서 하위 대응 노드를 할당하여, 상기 하위 대응 노드에서 블록 분할여부가 표현된다.The layer 4 has a length of N / 8, and the 4-1 block 240, the 4-2 block 241, and the 3-3 block (the block division of the 3-2 block 231 of the layer 3) And a 4-3 block 242 and a 4-4 block 243 obtained by dividing the block 232 into blocks. In the layer 4, the block partition information 8 for the 4-1 block 240 is '0', and the block partition information 9 for the 4-2 block 241 is '1', and 4-3. The block partitioning information (a) for the block 242 is '0', and the block partitioning information (b) for the fourth to fourth blocks 243 is '0'. Therefore, according to the block partitioning information of the layer 4, the 4-1 block 240, the 4-3 block 242, and the 4-4 block 243 of the layer 4 does not perform block division, but the layer 4th-4-2 block 241 performs block division. In this case, the lower correspondent node (layer 5) after the 4-1 block 240, the 4-3 block 242, and the 4-4 block 243 that do not block division in the layer 4 is selected. 4-2 block 241, which does not allocate and performs block division in layer 4, allocates a lower correspondent node in a subsequent lower layer, whereby block division is expressed in the lower correspondent node.

레이어 5는 N/16의 길이를 가지고, 레이어 4의 제4-2블록(241)을 블록 분할한 제5-1블록(250)과 제5-2블록(251)을 포함하여 구성된다. 상기 레이어 5에서 제5-1블록(250)에 대한 블록 분할 정보(c)는 '0', 제5-2블록(251)에 대한 블록 분할 정보(d)는 '0'이다. 그러므로, 레이어 5의 모든 블록 분할 정보가 '0'이므로 더 이상 계층적으로(hierarchically) 블록 분할을 하지 않게 되고, 블록의 블록 분할 깊이를 알 수 있다.The layer 5 has a length of N / 16 and includes a 5-1 block 250 and a 5-2 block 251 in which the 4-2 block 241 of the layer 4 is divided into blocks. In the layer 5, the block partitioning information (c) for the 5-1 block 250 is '0', and the block partitioning information (d) for the 5-2 block 251 is '0'. Therefore, since all block partition information of the layer 5 is '0', the block partition depth is no longer hierarchically, and the block partition depth of the block can be known.

그러므로, 상기에서 계층적으로 블록 분할을 수행하여 구성될 수 있는 블록 의 구조(block layout)를 살펴보면, N/4 블록, N/8 블록, N/16 블록, N/16 블록, N/8 블록, N/8 블록, N/8 블록으로 구성된다.Therefore, looking at the block layout of the block that can be configured by performing block division in a hierarchical manner, N / 4 block, N / 8 block, N / 16 block, N / 16 block, N / 8 block , N / 8 block, N / 8 block.

신호의 길이가 N인 경우, 블록 분할된 복수 개의 블록 길이는 N/2, N/4, N/8, N/16, N/32... 중 하나의 길이를 가진다. 이를 수식으로 나타내면 N/xⁱ로 표현할 수 있다. 상기 수식에서 i = 1, 2, ..., p 중 어느 하나이며, 상기 p는 정수인 것을 특징으로 한다. 그리고 여기서 x는 2이다.When the length of the signal is N, the plurality of block divided block lengths have the length of one of N / 2, N / 4, N / 8, N / 16, N / 32... If this is expressed as an expression, it can be expressed as N / x ⁱ . In the above formula, i = 1, 2, ..., p is any one, characterized in that p is an integer. And where x is 2.

또한, 2진수로 표현되는 블록 분할 정보를 바이너리 시그널링 순서인 (1)(2)(3)(4)(5)(6)(7)(8)(9)(a)(b)(c)(d)로 나타내면, '1110110010000'의 13비트로 표현하는 것이 가능하다.In addition, the block division information expressed in binary numbers is expressed in binary signaling order (1) (2) (3) (4) (5) (6) (7) (8) (9) (a) (b) (c ), it can be represented by 13 bits of '1110110010000'.

이상은 계층의 깊이정보는 별도로 표현하지 않고, 분할 식별자와 미분할 식별자로 표현되는 블록 분할 정보로 계층의 깊이를 파악 가능한 경우를 설명한 것이다.The foregoing has described the case where the depth of the layer can be grasped by the block division information expressed by the split identifier and the undivided identifier, without separately expressing the depth information of the layer.

그러나 계층의 깊이 정보를 별도로 표현하는 블록 분할 정보의 시그널링도 가능하다. 예를 들어 상기 계층의 깊이 정보는 분할 종료 식별자와 분할 계속 식별자로 표현한다. 상기 분할 종료 식별자라 함은 더 이상 블록 분할이 이루어지지 않는 최하위 계층을 의미하고, 분할 계속 식별자라 함은 최하위 계층이 아닌 계층을 의미한다. 이 경우에도 분할 계속 식별자는 '1'로, 분할 종료 식별자는 '0'으로 표현할 수 있다. 도 1에 도시된 계층의 깊이는 5이고, 이를 상기와 같은 방식으로 표현하면 11110이 된다. 상술한 시그널링 방식에 의해 서브 블록의 크기를 인식할 수 있다. 이와 같이, 깊이 정보를 별도로 표현하는 경우는 최하위 계층에 할당된 노드에서는 결국 미분할 식별자만이 표현되기 때문에 최하위 계층의 이전 계층까지만 시그널링을 할 수 있다. 그 일례로 분할 식별자를 '1'로, 미분할 식별자를 '0'으로 표현하고, 분할 계속 식별자를 '1'로, 분할 종료 식별자를 '0'으로 표현한 경우는 최하위 계층에 할당된 노드의 분할여부 표현값이 분할 종료를 의미하는 '0'으로 대표될 수 있다.However, signaling of block partitioning information expressing depth information of a layer is also possible. For example, the depth information of the layer is represented by a split end identifier and a split duration identifier. The segmentation end identifier refers to the lowest layer where no block division is performed anymore, and the segmentation continuing identifier refers to a layer other than the lowest layer. Even in this case, the segmentation continued identifier may be expressed as '1' and the segmentation end identifier may be represented as '0'. The depth of the hierarchy illustrated in FIG. 1 is 5, which is 11110 when expressed in the above manner. The size of the subblock can be recognized by the above-described signaling scheme. As such, when the depth information is separately expressed, only the undivided identifier is finally expressed in the node allocated to the lowest layer, so that only the previous layer of the lowest layer may be signaled. For example, when the partition identifier is represented by '1', the undivided identifier is represented by '0', the partition segmentation identifier is represented by '1', and the partition end identifier is represented by '0', the partition of the node allocated to the lowest layer is divided. Whether the expression value may be represented by '0' which means the end of the division.

(2) 밴드 분할(2) band division

밴드 분할에 대해 도 2 내지 도 3을 참조하여 살펴보면 다음과 같다.Looking at the band division with reference to Figures 2 to 3 as follows.

도 2는 본 발명의 일실시예에 의한 밴드 분할 정보의 시그널링 방법을 설명하기 위한 개념도이다.2 is a conceptual diagram illustrating a signaling method of band division information according to an embodiment of the present invention.

도 2를 참조하면, 서브밴드 필터뱅크(subband filterbank)에서 트리(tree) 구조를 가지는 계층적인 밴드 분할에 관한 것이다. 이하, 후술하는 방법에 의하면 서브밴드의 주파수 해상도를 자유롭게 정의할 수 있다.Referring to FIG. 2, the present invention relates to hierarchical band division having a tree structure in a subband filterbank. In the following method, the frequency resolution of the subband can be freely defined.

도 2를 도 1와 비교하면, 도 1은 하나의 장블록이 최상의 계층을 이루는 반면, 도 2에 도시된 밴드 분할 방식은 최상위 계층에 복수 개의 밴드를 포함하는 경우를 예로 하였다.Comparing FIG. 2 with FIG. 1, FIG. 1 illustrates a case in which one long block forms the best layer, while the band division scheme illustrated in FIG. 2 includes a plurality of bands in the top layer.

도시된 본 실시예에서 밴드의 분할여부를 나타내는 밴드 분할 정보를 분할 식별자와 미분할 식별자로 표현하는데, 분할 식별자로는 1을 사용하고, 미분할 식별자로는 0을 사용한다. 그리고 상기 분할 식별자와 상기 미분할 식별자는 각 계층의 노드에서 표현된다.상기 분할 식별자는 제 M 계층의 임의의 밴드가 제 M+1 계층 에서 절반으로 분할됨을 의미한다. 그리고 미분할 식별자는 제 M 계층의 임의의 밴드가 제 M+1 계층에서 분할되지 않는 것을 의미함과 동시에 미분할 식별자가 표현된 노드에 대한 하위 대응 노드가 할당되지 않는 것을 의미한다. 하위 대응 노드가 할당되지 않는다는 것은 더 이상의 추가적인 시그널링을 수행하지 않는다는 것을 뜻한다.In the illustrated embodiment, band splitting information indicating whether a band is divided is represented by a splitting identifier and an undivided identifier. 1 is used as a splitting identifier and 0 is used as a splitting identifier. The splitting identifier and the undivided identifier are expressed in nodes of each layer. The splitting identifier means that an arbitrary band of the Mth layer is divided in half in the M + 1 layer. The undivided identifier means that any band of the Mth layer is not divided in the M + 1 layer and that the lower correspondent node for the node in which the undivided identifier is expressed is not allocated. The fact that the lower correspondent node is not assigned means that no further signaling is performed.

최상위 계층인 레이어 1(layer 1)은 제1-1밴드(310), 제1-2밴드(311), 제1-3밴드(312), 제1-4밴드(313), 제1-5밴드(314), 제1-6밴드(315)를 포함한 6개의 밴드로 구성된다. 제1-1밴드(310)의 밴드 분할 정보(1)는 '1', 제1-2밴드(311)의 밴드 분할 정보(2)는 '1', 제1-3밴드(312)의 밴드 분할 정보(3)는 '0', 제1-4밴드(313)의 밴드 분할 정보(4)는 '0', 제1-5밴드(314)의 밴드 분할 정보(5)는 '0', 제1-4밴드(315)의 밴드 분할 정보(6)는 '0'으로 표현된다.Layer 1, which is the highest layer, includes the first-first band 310, the first-second band 311, the first-third band 312, the first-fourth band 313, and the first- fifth band. It consists of six bands, including the band 314 and the first to sixth bands 315. The band division information 1 of the 1-1 st band 310 is '1', the band division information 2 of the 1-2 band 311 is '1', and the band of the 1-3 band 312 The division information 3 is '0', the band division information 4 of the first-4 bands 313 is '0', the band division information 5 of the first-5 bands 314 is '0', The band division information 6 of the first to fourth bands 315 is represented by '0'.

상술한 밴드 분할 정보는 레이어 1에 할당된 노드에서 표현된다. 밴드 분할 정보(1, 2)에 따라 제1-1밴드(310)와 제1-2밴드(311)는 신호 변환모듈(310T, 311T: 본 실시예에서는 '밴드 변환모듈'이라 부른다)을 생성하여 레이어 2에 하위밴드(320, 321, 322, 323)를 생성한다. 그리고 하위밴드(320, 321, 322, 323)에는 하위 대응 노드가 할당된다. 이에 반해, 밴드 분할을 수행하지 않는 제1-3밴드, 제1-4밴드, 제1-5밴드, 제1-6밴드는 밴드 변환모듈을 생성하지 않고, 이후 계층(레이어 2)에 대응하는 하위밴드도 생성되지 않는다. 따라서 하위 대응 노드 또한 할당되지 않는다.The above-described band division information is expressed in a node assigned to layer 1. The first-first band 310 and the first-second band 311 generate the signal conversion modules 310T and 311T (called 'band conversion modules' in this embodiment) according to the band division information (1, 2). The lower bands 320, 321, 322, and 323 are generated in the layer 2. Subordinate corresponding nodes are allocated to the lower bands 320, 321, 322, and 323. On the other hand, the 1-3 bands, the 1-4 bands, the 1-5 bands, and the 1-6 bands which do not perform band division do not generate a band conversion module, and correspond to a later layer (layer 2). No subbands are created. Therefore, the lower correspondent node is also not assigned.

레이어 2는 제1-1밴드(310)가 밴드 분할되어 형성된 제2-1밴드(320), 제2-2 밴드(321)와, 제1-2밴드(311)가 밴드 분할되어 형성된 제2-3밴드(322), 제2-4밴드(323)를 포함하여 구성된다. 제2-1밴드(320)의 밴드 분할 정보(7)는 '1', 제2-2밴드(321)의 밴드 분할 정보(8)는 '1', 제2-3밴드(322)의 밴드 분할 정보(9)는 '0', 제2-4밴드(323)의 밴드 분할 정보(10)는 '0'으로 표현된다.In the layer 2, the second-first band 320 formed by banding the first-first band 310 and the second-second band 321 and the second-second band formed by banding the first-second band 311 are separated. And a third band 322 and a second-4 band 323. The band division information 7 of the second-1 band 320 is '1', the band division information 8 of the second-2 band 321 is '1', and the band of the second-3 band 322. The segmentation information 9 is represented by '0', and the band segmentation information 10 of the second through fourth bands 323 is represented by '0'.

밴드 분할 정보(7, 8)에 따라 제2-1밴드(320)와 제2-2밴드(321)는 밴드 변환모듈(320T, 321T)을 생성하여 레이어 3에 하위밴드(330, 331, 332, 333)를 생성한다. 그리고 하위밴드(330, 331, 332, 333)에는 하위 대응 노드가 할당된다. 이에 반해, 밴드 분할을 수행하지 않는 제2-3밴드와 제2-4밴드는 밴드 변환모듈을 생성하지 않고, 이후 계층(레이어 3)에 대응하는 하위밴드도 생성되지 않는다. 따라서 하위 대응 노드 또한 할당되지 않는다.The second-first band 320 and the second-second band 321 generate the band conversion modules 320T and 321T according to the band division information 7 and 8 to generate the lower bands 330, 331, and 332 in Layer 3. , 333). Subordinate corresponding nodes are allocated to the lower bands 330, 331, 332, and 333. On the other hand, the 2-3 bands and the 2-4 bands that do not perform the band division do not generate the band conversion module, and no lower band corresponding to the layer (layer 3) is also generated. Therefore, the lower correspondent node is also not assigned.

레이어 3은 제2-1밴드(320)가 밴드 분할되어 형성된 제3-1밴드(330), 제3-2밴드(331)와, 제2-2밴드(321)가 밴드 분할되어 형성된 제3-3밴드(332), 제3-4밴드(333)을 포함하여 구성된다. 제3-1밴드(330)의 밴드 분할 정보(11)는 '1', 제3-2밴드(331)의 밴드 분할 정보(12)는 '0', 제3-3밴드(332)의 밴드 분할 정보(13)는 '0', 제3-4밴드(333)의 밴드 분할 정보(14)는 '0'으로 표현된다.In the layer 3, a third-1 band 330, a third-2 band 331, and a second-2 band 321 are formed by band-dividing the second-first band 320. And a third band 332 and a third-4 band 333. The band division information 11 of the third-1 band 330 is '1', the band division information 12 of the third-2 band 331 is '0', and the band of the third-3 band 332 The division information 13 is represented by '0', and the band division information 14 of the third to fourth bands 333 is represented by '0'.

밴드 분할 정보(11)에 따라 제3-1밴드(330)는 신호 변환모듈(330T)을 생성하여 레이어 4에 하위밴드(340, 341)를 생성한다. 그리고 하위밴드(340, 341)에는 하위 대응 노드가 할당된다. 이에 반해, 밴드 분할을 수행하지 않는 제3-2밴드, 제3-3밴드, 제3-4밴드는 밴드 변환모듈을 생성하지 않고, 이후 계층(레이어 4)에 대응하는 하위밴드도 생성되지 않는다. 따라서 하위 대응 노드 또한 할당되지 않는다.According to the band division information 11, the third-first band 330 generates the signal conversion module 330T to generate lower bands 340 and 341 in the layer 4. Sub-bands 340 and 341 are allocated sub-corresponding nodes. In contrast, the 3-2 band, the 3-3 band, and the 3-4 band which do not perform the band division do not generate the band conversion module, and no lower band corresponding to the layer (layer 4) is generated later. . Therefore, the lower correspondent node is also not assigned.

레어어 4는 제3-1밴드(330)가 밴드 분할되어 형성된 제4-1밴드(340)와 제4-2밴드(341)를 포함하여 구성된다 상기 제4-1밴드(340)의 밴드 분할 정보(15)는 '0', 상기 제4-2밴드(341)의 밴드 분할 정보(16)는 '0'으로 표현된다. 따라서, 밴드 분할을 수행하는 하위 계층이 더 이상 존재하지 않고 시그널링도 종료하게 된다. 이 경우 최하위 계층은 레이어 4가 된다.The rare 4 includes a 4-1 band 340 and a 4-2 band 341 formed by splitting the 3-1 band 330 into bands. The band of the 4-1 band 340 The division information 15 is expressed as '0', and the band division information 16 of the fourth-2 band 341 is represented as '0'. Therefore, the lower layer that performs band division no longer exists and the signaling ends. In this case, the lowest layer is layer 4.

그리고, 2진수로 표현되는 밴드 분할 정보를 바이너리 시그널링 순서인 (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)로 나타내면 '1100001100100000'의 16비트로 표현하는 것이 가능하다.Then, the band division information expressed in binary is (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) which are binary signaling sequences. (13) (14) (15) (16) can be represented by 16 bits of '1100001100100000'.

도 3은 본 발명의 또 다른 일실시예에 의한 밴드 분할 정보의 시그널링 방법을 설명하기 위한 개념도이다.3 is a conceptual diagram illustrating a signaling method of band division information according to another embodiment of the present invention.

도 3은 도 2와 비교하여, 밴드 분할을 수행하는 과정 등이 모두 유사하다. 다만, 밴드 분할 정보를 바이너리 시그널링(binary signaling)하는 순서가 차이가 난다. 그 순서는 도면에 기재되어 있다.3 is similar to the process of performing band division as compared to FIG. 2. However, the order of binary signaling of the band split information is different. The order is described in the figure.

따라서, 2진수로 표현되는 밴드 분할 정보를 바이너리 시그널링 순서인 (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)으로 나타내면 '1110001001000000'의 16비트로 표현하는 것이 가능하다.Therefore, the band division information expressed in binary numbers is (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12). (13) (14) (15) (16) can be represented by 16 bits of '1110001001000000'.

이상은 계층의 깊이정보는 별도로 표현하지 않고, 분할 식별자와 미분할 식별자로 표현되는 밴드 분할 정보로 계층의 깊이를 파악 가능한 경우를 설명한 것이다.The foregoing has described the case where the depth of the layer can be grasped by band division information expressed by the split identifier and the undivided identifier, without separately expressing the depth information of the layer.

그러나 계층의 깊이 정보를 별도로 표현하는 밴드 분할 정보의 시그널링도 가능하다. 예를 들어 상기 계층의 깊이 정보는 분할 종료 식별자와 분할 계속 식별자로 표현한다. 상기 분할 종료 식별자라 함은 더 이상 밴드 분할이 이루어지지 않는 최하위 계층을 의미하고, 분할 계속 식별자라 함은 최하위 계층이 아닌 계층을 의미한다. 이 경우에도 분할 계속 식별자는 '1'로, 분할 종료 식별자는 '0'으로 표현할 수 있다. 도 2 내지 도 3에 도시된 계층의 깊이는 4이고, 이를 상기와 같은 방식으로 표현하면 1110이 된다. 상술한 시그널링 방식에 의해 서브 밴드의 크기를 인식할 수 있다. 이와 같이, 깊이 정보를 별도로 표현하는 경우는 최하위 계층에 할당된 노드에서는 결국 미분할 식별자만이 표현되기 때문에 최하위 계층의 이전 계층까지만 시그널링을 할 수 있다. 그 일례로 분할 식별자를 '1'로, 미분할 식별자를 '0'으로 표현하고, 분할 계속 식별자를 '1'로, 분할 종료 식별자를 '0'으로 표현한 경우는 최하위 계층에 할당된 노드의 분할여부 표현값이 분할 종료를 의미하는 '0'으로 대표될 수 있다.However, signaling of the band splitting information representing the depth information of the layer may also be performed. For example, the depth information of the layer is represented by a split end identifier and a split duration identifier. The segmentation end identifier refers to the lowest layer where no further band division is performed, and the segmentation continuing identifier refers to a layer other than the lowest layer. Even in this case, the segmentation continued identifier may be expressed as '1' and the segmentation end identifier may be represented as '0'. The depth of the layer illustrated in FIGS. 2 to 3 is 4, which is 1110 when expressed in the above manner. The size of the subband can be recognized by the above-described signaling scheme. As such, when the depth information is separately expressed, only the undivided identifier is finally expressed in the node allocated to the lowest layer, so that only the previous layer of the lowest layer may be signaled. For example, when the partition identifier is represented by '1', the undivided identifier is represented by '0', the partition segmentation identifier is represented by '1', and the partition end identifier is represented by '0', the partition of the node allocated to the lowest layer is divided. Whether the expression value may be represented by '0' which means the end of the division.

(3) 채널 분할(3) channel division

채널 분할 정보는 채널을 구성할 때 이용되는 채널 구성 정보와 관련이 있으므로 채널 구성정보를 설명하면서 채널 분할에 대해 상세히 설명하도록 하겠다. 특히, 다채널 오디오 신호의 인코딩 및 디코딩 하는 경우의 채널구성에 대한 정보를 일례로 설명하겠다.Since the channel segmentation information is related to the channel configuration information used when configuring the channel, the channel segmentation information will be described in detail while describing the channel configuration information. In particular, the information on the channel configuration in the case of encoding and decoding the multi-channel audio signal will be described as an example.

먼저, 다채널 오디오 코딩에서는 필수적으로 요구되는 기본 공간정보가 있다.First, there is a basic spatial information that is essential in multichannel audio coding.

상기 기본 공간정보는 상기 기본 환경에 대한 구성정보를 나타내는 기본환경 설정정보와 그 기본환경 설정정보에 대응되는 기본 데이터로 구성된다. 또한, 다채널 오디오 코딩에서는 선택적으로 요구되는 확장 공간정보가 있다. 상기 확장 공간정보는 상기 확장 환경에 대한 구성정보를 나타내는 확장환경 설정정보와 그 확장환경 설정정보에 대응되는 확장 데이터로 구성된다. 그리고 상술한 확장 환경에 대한 구성정보는 적어도 하나 이상이 존재할 수 있고, 상술한 확장환경은 타입 식별자에 의해 식별 가능하다.The basic spatial information includes basic environment setting information representing configuration information of the basic environment and basic data corresponding to the basic environment setting information. In addition, there is extended spatial information that is optionally required in multichannel audio coding. The extended space information includes extended environment setting information representing configuration information of the extended environment and extended data corresponding to the extended environment setting information. At least one configuration information regarding the above-described extended environment may exist, and the above-described extended environment may be identified by a type identifier.

한편, 다채널 오디오 신호의 코딩에서 언급되는 채널구성은 크게 2가지 경우로 나누어 생각해 볼 수 있다. 그 하나는 기본 채널구성고, 또 다른 하나는 확장 채널구성이다.On the other hand, the channel configuration mentioned in the coding of the multi-channel audio signal can be divided into two cases. One is the basic channel configuration and the other is the extended channel configuration.

기본 채널구성에 대한 정보는 적어도 하나의 채널구성에 대한 정보가 설정되어 있는데, 그 중에서 선택된 하나의 채널구성에 대한 정보를 말한다. 편의상 기본 채널구성 정보를 '고정 채널구성 정보'라 칭하고, 상기 고정 채널구성 정보에 따라 생성된 다채널을 '고정출력 채널'이라고 부르기로 한다. 상술한 고정출력 채널을 생성하기 위해서는 고정 채널구성 정보와 이에 상응하는 고정 채널구성 데이터가 필요하다.The information on the basic channel configuration has information on at least one channel configuration, which means information on one channel configuration selected from among them. For convenience, basic channel configuration information will be referred to as 'fixed channel configuration information', and multi-channels generated according to the fixed channel configuration information will be referred to as 'fixed output channels'. In order to generate the above-described fixed output channel, fixed channel configuration information and corresponding fixed channel configuration data are required.

고정 채널구성에 대한 정보는 이미 설정된 채널구성에 대한 정보 중 선택된 하나의 채널구성에 대한 정보이다. 상기 설정된 채널구성은 다양한 경우를 상정할 수 있다. 그 예로 들 수 있는 채널구성은 5-1-5, 5-2-5, 7-2-7, 7-5-7 구성 등이 있다. 5-2-5 구성이라 함은 여섯 개의 입력채널을 두 개의 채널로 다운믹스하고, 상기 다운믹스된 채널을 여섯 개의 채널로 출력하는 채널구성을 말한다. 나머지 채 널구성도 마찬가지로 설명된다. 상술한 고정 채널구성 정보는 기본환경 설정정보 내에 포함되고, 상기 고정 채널구성 정보에 대응하는 데이터는 기본 데이터 내에 포함된다. 상술한 기본 데이터로는 두 채널간의 에너지 차이를 의미하는CLD, 두 채널간의 상관관계를 의미하는 ICC 및 두 채널로부터 세 채널을 생성할 때 이용되는 예측 계수인 CPC 등이 사용될 수 있다.The information on the fixed channel configuration is information on one channel configuration selected from the information on the channel configuration already set. The set channel configuration may assume various cases. Examples of channel configurations include 5-1-5, 5-2-5, 7-2-7, and 7-5-7. The 5-2-5 configuration refers to a channel configuration for downmixing six input channels to two channels and outputting the downmixed channels to six channels. The rest of the channel configuration is likewise described. The fixed channel configuration information described above is included in basic configuration information, and data corresponding to the fixed channel configuration information is included in basic data. As the basic data described above, CLD which means energy difference between two channels, ICC which means correlation between two channels, and CPC which is a prediction coefficient used when generating three channels from two channels may be used.

또한, 확장 채널 구성은 상술한 고정 채널구성 이후에 형성되는 채널구성을 말한다. 상술한 확장 채널구성은 인코딩된 신호에 따라 임의적으로 채널 구성이 형성된다. 따라서 편의상 확장 채널 구성에 대한 정보를 '임의 채널구성 정보'라 부르기로 한다. 그리고 상기 임의 채널구성 정보에 의해 생성된 다채널을 '임의출력 채널'이라 부르기로 한다. 상술한 임의 채널구성 정보는 확장환경 설정정보 내에 포함되고, 이는 채널 식별자라는 타입 식별자에 의해 식별된다.In addition, the extended channel configuration refers to a channel configuration formed after the above-described fixed channel configuration. In the above-described extended channel configuration, a channel configuration is arbitrarily formed according to the encoded signal. Therefore, for convenience, the information on the extended channel configuration will be referred to as 'arbitrary channel configuration information'. The multi-channel generated by the arbitrary channel configuration information will be referred to as 'arbitrary output channel'. The arbitrary channel configuration information described above is included in the extended configuration information, which is identified by a type identifier called a channel identifier.

그리고 임의 채널구성 정보에 대응되는 임의 채널구성 데이터는 상기 확장 데이터 내에 포함된다. 상기 임의 채널구성 데이터는 연산량의 간소화를 위해 두 채널간의 에너지 차이를 의미하는CLD만을 사용할 수도 있다.The arbitrary channel configuration data corresponding to the arbitrary channel configuration information is included in the extension data. The random channel configuration data may use only CLD, which means an energy difference between two channels, for the purpose of simplifying the calculation amount.

상기 임의 채널구성 정보는 분할 식별자와 미분할 식별자로 표현한다. 상기 임의 채널구성 정보의 구성요소인 분할 식별자는 채널 수 증가를 의미하고, 미분할 식별자는 채널 수의 변화가 없는 경우를 말한다. 예를 들어, 분할 식별자는 하나의 입력채널이 두 개의 채널로 변환되어 출력되는 것을 의미하고, 미분할 식별자는 입력채널이 그대로 출력되는 것을 의미한다. 그리고 상위 계층의 채널에 할당된 상위 노드에 분할 식별자가 표현된 경우는 하위 계층에 하위 채널을 생성하고, 생성된 채널에 대응되게 하위 대응 노드도 할당한다.The arbitrary channel configuration information is expressed by a split identifier and an undivided identifier. The split identifier, which is a component of the arbitrary channel configuration information, means an increase in the number of channels, and the unsplit identifier indicates a case where there is no change in the number of channels. For example, the split identifier means that one input channel is converted into two channels and outputted, and the unsplit identifier means that the input channel is output as it is. When the partition identifier is expressed in the upper node allocated to the channel of the upper layer, the lower channel is generated in the lower layer, and the lower corresponding node is allocated corresponding to the generated channel.

그러나 상위 계층의 채널에 할당된 상위 노드에 미분할 식별자가 표현된 경우는 하위 계층에 하위 채널을 생성하지 않는다. 따라서 하위 대응 노드 또한 할당되지 않는다. 상기 임의 채널구성 정보를 분할 식별자와 미분할 식별자로 표현하는 방법에 대해 도 2 내지 도 3을 참조하여 설명하기로 한다. 도 2 내지 도 3은 전술한 밴드 분할에 대한 설명뿐만 아니라 채널 분할에 대해서도 설명이 가능한 도면이기에 이를 이용하는 것이다.However, if an undivided identifier is expressed in the upper node allocated to the channel of the upper layer, the lower channel is not created in the lower layer. Therefore, the lower correspondent node is also not assigned. A method of expressing the arbitrary channel configuration information by the split identifier and the undivided identifier will be described with reference to FIGS. 2 to 3. 2 to 3 illustrate not only the above-described band division but also channel division, and thus are used.

먼저 도 2를 설명하면 다음과 같다.First, FIG. 2 will be described.

최상위 계층인 레이어 1(layer 1)은 제1-1채널(310), 제1-2채널(311), 제1-3채널(312), 제1-4채널(313), 제1-5채널(314), 제1-6채널(315)로 구성된다.Layer 1, which is the highest layer, includes the first-first channel 310, the first-second channel 311, the first-third channel 312, the first-fourth channel 313, and the first- fifth channel. Channel 314, the first to sixth channel (315).

상기 제1-1채널(310) 내지 제1-6채널(315)는 상술한 고정 다채널이 될 수 있다.The first-first channel 310 to the first- sixth channel 315 may be the fixed multi-channels described above.

본 실시 예에서 분할 식별자는 1로, 미분할 식별자는 0으로 표현한다. 도시된 방법에 의해 임의 채널구성 정보를 표현하는 방법은 레이어 1의 채널(310, 311, 312, 313, 314, 315)에 할당된 노드에 표현된 0 또는 1을 순차적으로 표현하고, 레이어 2의 채널(320, 321, 322, 323)에 할당된 노드에 표현된 0 또는 1을 순차적으로 표현한다. 그리고 레이어 3의 채널(330, 331, 332, 333)에 할당된 노드에 표현된 0 또는 1을 순차적으로 표현하고, 레이어 4의 채널(340, 341)에 할당된 노드에 표현된 0 또는 1을 순차적으로 표현한다. 즉, 상기 상위 계층의 노드에서 채널 수 증가여부를 순차적으로 표현한 후, 하위 계층의 노드에서 채널 수 증가여부를 순차 적으로 표현하는 방법이다.In the present embodiment, the partition identifier is represented by 1, and the unpartitioned identifier is represented by 0. The method of expressing arbitrary channel configuration information by the illustrated method sequentially expresses 0 or 1 expressed in nodes allocated to the channels 310, 311, 312, 313, 314, and 315 of the layer 1, and the layer 2 of the layer 2. 0 or 1 expressed in nodes allocated to the channels 320, 321, 322, and 323 are sequentially expressed. In addition, 0 or 1 expressed in nodes allocated to the channels 330, 331, 332, and 333 of the layer 3 are sequentially expressed, and 0 or 1 expressed in the nodes allocated to the channels 340 and 341 of the layer 4 are sequentially represented. Express them sequentially. That is, after sequentially expressing whether the number of channels is increased in the nodes of the upper layer, the method of sequentially expressing whether the number of channels is increased in the nodes of the lower layer.

따라서 상기 방법에 의한 임의 채널구성 정보는 '1100001100100000'의 16비트로 표현된다. 도 2와 같이 임의 채널구성 정보를 표현하는 방식을 편의상 '계층우선 방식'이라 부르기로 한다.Therefore, the arbitrary channel configuration information by the above method is represented by 16 bits of '1100001100100000'. A method of expressing arbitrary channel configuration information as shown in FIG. 2 will be referred to as a "layer priority method" for convenience.

도 3에 도시된 상기 임의 채널구성 정보의 표현방법은 상기 상위 계층의 제 1 노드에서 시그널링한 결과, 상기 제 1 노드가 1로 표현된 경우는 상기 제 1 노드의 하위 대응 노드를 순차적으로 채널 수 증가여부를 표현하고 하고, 상기 제 1 노드가 미분할 식별자로 표현된 경우는 상기 상위 계층의 제 2 노드로 이동하여 채널 수 증가여부를 순차적으로 표현한다. 따라서 상기 방법에 의한 임의 채널구성 정보는 '1110001001000000'의 16비트로 표현된다. 도 3과 같이 임의 채널구성 정보를 표현하는 방식을 편의상 '가지우선 방식'이라 부르기로 한다.In the method of expressing the arbitrary channel configuration information illustrated in FIG. 3, when the first node is represented by 1 as a result of signaling by the first node of the upper layer, the number of sub-corresponding nodes of the first node is sequentially numbered. When the first node is represented by an undivided identifier, the first node is moved to the second node of the upper layer to sequentially indicate whether the number of channels is increased. Therefore, the arbitrary channel configuration information by the above method is represented by 16 bits of '1110001001000000'. A method of expressing arbitrary channel configuration information as shown in FIG. 3 will be referred to as a "branch-first method" for convenience.

다음으로 고정출력 채널과 임의출력 채널을 생성하는 방법에 대해 살펴보기로 하겠다.Next, we will look at how to create a fixed output channel and a random output channel.

도 4는 본 발명의 일실시예를 이용하여 다채널 신호를 생성하는 방법에 대한 개념도이다.4 is a conceptual diagram of a method of generating a multi-channel signal using an embodiment of the present invention.

도시된 바와 같이, 다운믹스 신호(x)와 기본 매트릭스(ml)의 연산을 통해 임의출력 채널(y)를 생성하고, 고정출력 채널(y)과 포스트 매트릭스(m2)의 연산을 통해 임의출력 채널(z)을 생성한다. 기본 매트릭스(ml)는 2개 이상으로 구성될 수도 있다. 기본 매트릭스(m2)의 구성요소는 CLD, ICC, CPC 중 적어도 하나와 상술한 고정 채널구성 정보를 이용하여 유도될 수 있다.As shown, a random output channel (y) is generated through the operation of the downmix signal (x) and the base matrix (ml), and a random output channel through the operation of the fixed output channel (y) and the post matrix (m2). (z) is generated. The base matrix ml may be composed of two or more. The element of the base matrix m2 may be derived using at least one of CLD, ICC, and CPC and the fixed channel configuration information described above.

그리고 포스트 매트릭스(m2)의 구성요소는 CLD와 상기 임의 채널구성 정보를 이용하여 유도될 수 있다.The component of the post matrix m2 may be derived using the CLD and the arbitrary channel configuration information.

이하, 임의출력 채널을 생성하는 방법에 대해 좀 더 상세히 살펴보기로 한다.Hereinafter, a method of generating a random output channel will be described in more detail.

먼저, 임의 채널구성 정보를 이용하여 임의 채널구성을 하는 방법을 살펴본다.First, a method of configuring a random channel using random channel configuration information will be described.

임의 채널구성 정보가 가지우선 방식에 의해 표현된 경우를 예로 설명하기로 하겠다.The case where the arbitrary channel configuration information is expressed by the branch-first method will be described as an example.

상기 임의 채널구성 정보의 구성요소인 분할 식별자 또는 미분할 식별자를 순차적으로 인식하여 상기 식별자에 따른 신호처리를 한다. 상기 식별자가 분할 식별자인 경우는 하나의 입력채널이 채널 변환모듈과 연결되어 두 개의 하위채널을 생성한다. 한편, 상기 식별자가 미분할 식별자인 경우는 상기 입력채널이 그대로 상기 임의출력 채널이 된다.Signal processing according to the identifier is sequentially performed by recognizing a partition identifier or an undivided identifier which are components of the arbitrary channel configuration information. When the identifier is a partition identifier, one input channel is connected to the channel conversion module to generate two subchannels. On the other hand, when the identifier is an undivided identifier, the input channel becomes the random output channel as it is.

좀 더 구체적으로 설명하면 다음과 같다.More specifically, it is as follows.

먼저, 디코딩해야할 식별자 개수의 초기 값을 1로 하고, 상기 임의출력 채널의 수의 초기값을 0으로 하며, 상기 채널 변환모듈의 개수의 초기값은 0으로 하여 세팅한다(제1단계).First, an initial value of the number of identifiers to be decoded is set to 1, an initial value of the number of random output channels is set to 0, and an initial value of the number of channel conversion modules is set to 0 (first step).

디코딩해야 할 식별자를 인식한다(제2단계).Recognize the identifier to be decoded (step 2).

그리고 상기 인식된할 식별자가 분할 식별자인 경우 상기 채널 변환모듈의 개수 및 상기 인식해야할 식별자의 개수를 1 증가시킨다.If the identifier to be recognized is a partition identifier, the number of the channel conversion module and the number of identifiers to be recognized are increased by one.

만약, 상기 인식된 식별자가 미분할 식별자인 경우 상기 임의출력 채널의 개수를 1 증가시키며, 상기 인식해야할 식별자의 개수를 1 감소시킨다(제3단계).If the recognized identifier is an undivided identifier, the number of random output channels is increased by one, and the number of identifiers to be recognized is decreased by one (step 3).

그리고 디코딩해야 할 식별자의 개수가 0이 될 때가지 제2단계 및 제3단계를 반복 수행한다.The second and third steps are repeatedly performed until the number of identifiers to be decoded becomes zero.

그리고 상술한 신호처리 방법은 고정출력 채널 수에 대응되게 반복 수행한다.The signal processing method described above is repeatedly performed corresponding to the number of fixed output channels.

예를 들어, 임의 채널구성 정보가 '11100010010000'일 때의 상기 임의 채널구성에 대한 개략도가 도 3에 도시되어 있다. 여기서 1은 분할 식별자를 의미하고, 0은 미분할 식별자를 의미한다. 또한, '1'의 개수는 채널변환 모듈(도 3에는 신호 변환모듈을 의미함)의 개수를 말하고, 0은 상기 임의출력 채널의 개수를 의미한다.For example, a schematic diagram of the arbitrary channel configuration when the arbitrary channel configuration information is '11100010010000' is shown in FIG. 3. Here, 1 means a partition identifier, and 0 means an undivided identifier. In addition, the number of '1' refers to the number of channel conversion module (signal conversion module in Fig. 3), 0 means the number of the arbitrary output channel.

한편, 고정출력 채널은 그 순서를 재배치(re-mapping)한 후, 임의출력 채널을 생성할 수도 있는데, 도 5가 이를 표현하고 있다.Meanwhile, the fixed output channel may generate an arbitrary output channel after re-mapping the order, which is represented by FIG. 5.

도시된 바와 같이, 고정출력 채널(310, 311, 312, 313, 314, 315)은 리매핑 모듈(100)을 거쳐 재배치된다. 그리고 재배치된 고정출력 채널(310',311',312',313',314',315')이 최상의 계층의 채널로 되어 상기 임의출력 채널을 생성한다. 물론 임의출력 채널의 순서를 재배치할 수도 있다.As shown, the fixed output channels 310, 311, 312, 313, 314, 315 are relocated via the remapping module 100. The rearranged fixed output channels 310 ', 311', 312 ', 313', 314 ', and 315' become the channels of the highest layer to generate the random output channel. Of course, you can rearrange the order of the arbitrary output channels.

한편, 상기 임의 채널구성 정보 내에 채널을 스피커와 매칭시키는 채널매핑 정보가 포함된 경우는 상기 임의출력 채널을 스피커와 매핑시킬 수도 있다.Meanwhile, when channel mapping information for matching a channel with a speaker is included in the arbitrary channel configuration information, the arbitrary output channel may be mapped with the speaker.

이상은 계층의 깊이정보는 별도로 표현하지 않고, 분할 식별자와 미분할 식별자로 표현되는 임의채널 구성정보로 계층의 깊이를 파악 가능한 경우를 설명한 것이다.The foregoing has described a case where the depth of a layer can be grasped by arbitrary channel configuration information represented by a split identifier and an undivided identifier, without separately expressing the depth information of the layer.

그러나 계층의 깊이 정보를 별도로 표현하는 임의채널 구성정보의 표현도 가능하다.However, it is also possible to express arbitrary channel configuration information expressing the depth information of the layer separately.

예를 들어 상기 계층의 깊이 정보는 분할 종료 식별자와 분할 계속 식별자로 표현한다. 상기 분할 종료 식별자라 함은 더 이상 채널 수 증가가 이루어지지 않는 최하위 계층을 의미하고, 분할 계속 식별자라 함은 최하위 계층이 아닌 계층을 의미한다. 이 경우에도 분할 계속 식별자는 '1'로, 분할 종료 식별자는 '0'으로 표현할 수 있다. 도 3에 도시된 계층의 깊이는 4이고, 이를 상기와 같은 방식으로 표현하면 1110이 된다. 이와 같이, 깊이 정보를 별도로 표현하는 경우는 최하위 계층에 할당된 노드에서는 결국 미분할 식별자만이 표현되기 때문에 최하위 계층의 이전 계층까지만 시그널링을 할 수 있다. 그 일례로 분할 식별자를 '1'로, 미분할 식별자를 '0'으로 표현하고, 분할 계속 식별자를 '1'로, 분할 종료 식별자를 '0'으로 표현한 경우는 최하위 계층에 할당된 노드의 분할여부 표현값이 분할 종료를 의미하는 '0'으로 대표될 수 있다.For example, the depth information of the layer is represented by a split end identifier and a split duration identifier. The split end identifier refers to the lowest layer where the number of channels is not increased any more, and the split continue identifier refers to a layer that is not the lowest layer. Even in this case, the segmentation continued identifier may be expressed as '1' and the segmentation end identifier may be represented as '0'. The depth of the hierarchy illustrated in FIG. 3 is 4, which is 1110 when expressed in the above manner. As such, when the depth information is separately expressed, only the undivided identifier is finally expressed in the node allocated to the lowest layer, so that only the previous layer of the lowest layer may be signaled. For example, when the partition identifier is represented by '1', the undivided identifier is represented by '0', the partition segmentation identifier is represented by '1', and the partition end identifier is represented by '0', the partition of the node allocated to the lowest layer is divided. Whether the expression value may be represented by '0' which means the end of the division.

만약, 그렇게 표현된다 하더라도 상기 깊이 정보를 이용하여 최하위 계층을 알 수 있고, 생략된 0은 있는 것으로 간주하여 상기 임의출력 채널을 구성할 수 있다.If it is expressed as such, the lowest layer may be known using the depth information, and the random output channel may be configured by considering the omitted zero.

한편, 임의채널 구성정보가 디코더에 전송되더라도 디코더는 이를 이용하지 않을 수도 있다. 이는 임의채널 구성정보와 이에 대응되는 임의채널 구성 데이터의 크기를 디코더에서 인식하되, 그 크기만큼을 스킵하고 디코딩하는 경우이다.Meanwhile, even if arbitrary channel configuration information is transmitted to the decoder, the decoder may not use it. This is a case where the decoder recognizes the size of the arbitrary channel configuration information and the corresponding arbitrary channel configuration data, but skips and decodes the corresponding size.

본 발명은 상술한 실시예에 한정되지 않으며, 첨부된 청구범위에서 알 수 있는 바와 같이 본 발명이 속한 분야의 통상의 지식을 가진 자에 의해 변형이 가능하고 이러한 변형은 본 발명의 범위에 속한다.The present invention is not limited to the above-described embodiments, and as can be seen in the appended claims, modifications can be made by those skilled in the art to which the invention pertains, and such modifications are within the scope of the present invention.

이상에서 설명한 바와 같이, 본 발명에 의한 분할 정보의 시그널링 방법은 첫째, 특정 길이를 가지는 장 블록(long block)으로부터 서로 다른 복수 개의 길이를 가지는 단 블록(short block)으로 세분화할 때, 계층적인 구조를 갖는 블록 분할(block splitting) 과정에 대한 정보를 최소의 비트를 사용하여 시그널링하는 것이 가능하다.As described above, the signaling method of the partitioning information according to the present invention includes, first, a hierarchical structure when subdividing a long block having a specific length from a short block having a plurality of different lengths. It is possible to signal information on a block splitting process with a minimum bit.

둘째, 신호의 시그널링에 사용된 비트 수에 대한 정보를 별도로 전송할 필요없이, 시그널링 신호 자체만으로 분할이 수행된 계층의 깊이와 시그널링 신호의 끝을 파악하는 것이 가능하다.Second, it is possible to grasp the depth of the layer on which the division is performed and the end of the signaling signal only by the signaling signal itself, without having to separately transmit information on the number of bits used for signaling of the signal.

셋째, 복수 개로 구성된 서브밴드로부터 서로 다른 크기(예를 들어, 주파수 폭)를 갖는 임의 개수의 복수 개 서브밴드로의 세분화 전개 과정을 최소의 비트를 사용하여 시그널링하는 것이 가능하다.Third, it is possible to signal the segmentation development process from a plurality of subbands to any number of subbands having different sizes (for example, frequency widths) using a minimum number of bits.

넷째, 입력채널보다 많은 수를 갖는 출력채널로의 업믹스(up-mix) 과정에 대해 그 진행 과정의 정보를 최소의 비트를 사용하여 시그널링하는 것이 가능하다.Fourth, it is possible to signal the information of the process using the minimum bits for the up-mix process to the output channel having a larger number than the input channel.

Claims

Including fixed channel configuration information which is configuration information of a preset output channel; And

And including the arbitrary channel configuration information.

delete

Receiving an encoded audio signal including fixed channel configuration information and arbitrary channel configuration information; And

And configuring an output channel by using the fixed channel configuration information and the arbitrary channel configuration information.

The method of claim 8, wherein the random channel configuration information

Whether to increase the number of channels in the node of the layer is expressed as a split identifier and an undivided identifier,

When the upper node of the upper layer is represented by the partition identifier, the lower corresponding node is assigned to correspond to the number divided in the lower layer.

And a lower correspondent node is not assigned to the lower layer when the upper node of the upper layer is represented by the undivided identifier.

The method of claim 9, wherein the random channel configuration information

And a channel mapping information for mapping an arbitrary output channel to a location of a speaker.

9. The method of claim 8, wherein configuring the output channel

Generating a fixed output channel corresponding to a preset output channel using the fixed channel configuration information; And

And generating a random output channel using the fixed output channel and the arbitrary channel configuration information.

The method of claim 11,

Generating the random output channel,

Signal processing according to the identifier by sequentially recognizing a partition identifier or an undivided identifier that is a component of the arbitrary channel configuration information,

If the identifier is a partition identifier, one input channel is connected to the channel conversion module to generate two lower corresponding channels.

And if the identifier is an undivided identifier, the input channel becomes the random output as it is.

12. The method of claim 11, wherein generating the random output channel

A first step of setting an initial value of the number of identifiers, an initial value of the number of random outputs, and an initial value of the number of channel conversion modules;

A second step of recognizing the identifier;

If the recognized identifier is a split identifier, the number of identifiers and the channel conversion module are increased in preset increments.

If the recognized identifier is an undivided identifier, a third step of increasing the number of random output channels by a preset increment and decreasing the number of identifiers by the preset increment;

And repeating the second and third steps until the number of identifiers becomes zero.

The method of claim 13, wherein generating the random output channel

And mapping the arbitrary output channel to a speaker according to the channel mapping information.

delete

An audio signal receiver configured to receive an encoded audio signal including fixed channel configuration information and arbitrary channel configuration information; And

And a channel configuration unit configured to configure an output channel by using the fixed channel configuration information and the arbitrary channel configuration information.

The method of claim 16,

The random channel configuration information

Whether the number of channels increases in the node of the hierarchy is expressed by the partition identifier and the undivided identifier. When the upper node of the upper layer is represented by the partition identifier, the lower correspondent node is allocated to correspond to the divided number in the lower layer.

18. The method of claim 17, wherein the random channel configuration information

And channel mapping information for mapping an arbitrary output channel to a location of a speaker.

17. The apparatus of claim 16, wherein the channel component is

Generating a fixed output channel corresponding to a preset output channel by using the fixed channel configuration information;

And an arbitrary output channel is generated using the fixed output channel and the arbitrary channel configuration information.

The method of claim 19,

The channel configuration unit,

And the input channel is the random output as is if the identifier is an undivided identifier.

The method of claim 19,

The channel configuration unit,

And the random output channel is mapped to a speaker according to the channel mapping information.