KR20070003594A

KR20070003594A - Method of clipping sound restoration for multi-channel audio signal

Info

Publication number: KR20070003594A
Application number: KR1020060058140A
Authority: KR
Inventors: 방희석; 오현오; 김동수; 임재현
Original assignee: 엘지전자 주식회사
Priority date: 2005-06-30
Filing date: 2006-06-27
Publication date: 2007-01-05
Also published as: CN101243490A; CN101297352B; CN101243490B; KR20070003593A; CN101243491A; CN101243491B; CN101297352A

Abstract

An apparatus and a method for decoding an audio signal, and an apparatus and a method for encoding the audio signal are provided to effectively prevent the clipping problem occurring while downmixing a multi-channel audio signal by determining whether clipping is prevented or performed after testing the occurrence of clipping and the degree of clipping. For decoding an audio signal, a downmix audio signal is separated from a bit stream of the audio signal(602). After decoding the downmix audio signal, the occurrence of clipping and the degree of clipping are tested during a decoding process(605). A clipped portion of the downmix audio signal is restored(606). The restoring of the clipped portion is capable of being performed through soft clipping. The clipped portion is capable of being restored by using a filter.

Description

Restoration of Clipped Signals in Multichannel Audio Signals {METHOD OF CLIPPING SOUND RESTORATION FOR MULTI-CHANNEL AUDIO SIGNAL}

도 1은 본 발명에서의 오디오 신호에 대한 공간 정보를 인간이 인식하는 방법을 나타내는 도면.BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a diagram illustrating a method for a human to recognize spatial information about an audio signal in the present invention.

도 2는 클리핑 발생과정을 나타내는 도면.2 is a diagram illustrating a clipping process.

도 3은 본 발명에 따른 다운믹스 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법에 대한 도면.3 is a diagram of a decoding method for restoring a clipped portion generated during a decoding process of a downmix audio signal according to the present invention;

도 4는 본 발명에 따른 멀티채널 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법에 대한 도면.4 is a diagram illustrating a decoding method for recovering a clipped portion generated during a decoding process of a multichannel audio signal according to the present invention.

도 5는 본 발명에 따른 멀티채널 오디오 신호의 다운믹스과정 중 클리핑된 부분을 복원하는 것에 관한 인코딩 방법에 대한 도면.5 is a diagram illustrating an encoding method for restoring a clipped portion of a downmix process of a multichannel audio signal according to the present invention.

도 6은 본 발명에 따른 다운믹스 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법에 대한 흐름도.6 is a flowchart illustrating a decoding method for recovering a clipped portion generated during a decoding process of a downmix audio signal according to the present invention.

도 7은 본 발명에 따른 멀티채널 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법에 대한 흐름도.7 is a flowchart illustrating a decoding method for reconstructing a clipped portion generated during a decoding process of a multichannel audio signal according to the present invention.

도 8은 본 발명에 따른 멀티채널 오디오 신호의 다운믹스과정 중 클리핑된 부분을 복원하는 것에 관한 인코딩 방법에 대한 흐름도.8 is a flowchart illustrating an encoding method for restoring a clipped portion of a downmix process of a multichannel audio signal according to the present invention.

*도면의 주요부분에 대한 부호의 설명* Explanation of symbols for main parts of the drawings

101.원거리 음원 102.직접적인 음파101.Remote sound source 102.Direct sound wave

104.반사된 음파 301.비트스트림104.Reflected sound waves 301.Bitstream

302.비트스트림디코더 303.다운믹스 오디오 신호302 Bitstream decoder 303 Downmix audio signal

304.공간 정보 신호 305.코어 코덱 디코더304 Spatial information signal 305 Core codec decoder

307.클리핑탐지부 308.클리핑처리부307.Clipping Detector 308.Clipping Process

309.멀티채널생성부 311.멀티채널 오디오 신호309.Multichannel generator 311.Multichannel audio signal

502.다운믹스부 503.공간정보 발생부502. Downmix unit 503. Spatial information generator

504.클리핑탐지부 505.클리핑방지판단부504. Clipping detection unit 505. Clipping prevention unit

506.비트스트림 포맷터506.Bitstream Formatter

본 발명은 멀티채널 오디오 신호의 공간 정보에 대한 부호-복호화(encoding-decoding)방법에 관한 것으로서, 더욱 상세하게는 다운믹스 오디오 신호 또는 멀티채널 오디오 신호에 대한 클리핑복원방법을 갖는 멀티채널 오디오 신호의 부호화-복호화 방법에 대한 것이다.The present invention relates to an encoding-decoding method for spatial information of a multichannel audio signal, and more particularly to a multichannel audio signal having a clipping recovery method for a downmix audio signal or a multichannel audio signal. The present invention relates to an encoding-decoding method.

최근에 디지털 오디오 신호에 대한 다양한 코딩기술 및 방법들이 개발되고 있으며, 이와 관련된 제품들이 생산되고 있다. 또한 심리음향 모델(Psychoacoustic model)을 이용한 멀티채널 오디오 신호(multi-channel audio signal)의 코딩방법들 이 개발되고 있으며, 이에 대한 표준화 작업이 진행되고 있다. 상기 심리음향 모델은 인간이 소리를 인식하는 방식, 예를 들면 큰 소리 다음에 오는 작은 소리는 들리지 않으며, 20Hz 내지 20000Hz의 주파수에 해당되는 소리만 들을 수 있다는 사실을 이용하여, 코딩과정에서 불필요한 부분에 대한 오디오 신호를 제거함으로써 필요한 데이터의 양을 효과적으로 줄일 수 있는 것이다.Recently, various coding techniques and methods for digital audio signals have been developed, and related products have been produced. In addition, coding methods for multi-channel audio signals have been developed using a psychoacoustic model, and standardization thereof has been conducted. The psychoacoustic model is an unnecessary part of the coding process by using a method of recognizing a sound, for example, a small sound following a loud sound, and only a sound corresponding to a frequency of 20 Hz to 20000 Hz. By eliminating the audio signal for, the amount of data needed can be effectively reduced.

현재 MPEG-1 오디오(MEPG-1 레이어 Ⅲ), MPEG-4 AAC(Advanced Audio Coding) 및 MPEG-4 HE-AAC(High-Efficiency AAC)와 같은 오디오 표준 기술이 개발되어 상용화되고 있다. 또한 공간 정보를 이용하는 멀티채널 오디오 신호의 코딩방법이 개발되고 있다. 상기 멀티채널 오디오 신호의 코딩방법은 압축된 오디오 신호(예를 들면, 모노 또는 스테레오 오디오 신호) 및 낮은 비트-레이트의 부가정보(low-rate side information)(예를 들면, 공간 정보) 채널을 이용하여 멀티채널 오디오 신호의 전송 효율을 매우 효과적으로 향상시키는 것이다.Currently, audio standard technologies such as MPEG-1 Audio (MEPG-1 Layer III), MPEG-4 Advanced Audio Coding (AAC), and MPEG-4 High-Efficiency AAC (HE-AAC) have been developed and commercialized. In addition, a method of coding a multichannel audio signal using spatial information has been developed. The multi-channel audio signal coding method uses a compressed audio signal (e.g., mono or stereo audio signal) and a low bit-rate side information (e.g., spatial information) channel. Therefore, the transmission efficiency of the multichannel audio signal is greatly improved.

그러나, 상기 멀티채널 오디오 신호의 비트스트림을 구성하는데 있어서, 종래에는 멀티채널을 모노 또는 스테레오 오디오 신호로 다운믹스하면 클리핑(Clipping) 문제가 발생하였었다. 특히 부호화된 신호는 16비트 등으로 크기가 제한되어야하므로, 상기 부호화된 신호는 코어 코덱 인코딩 이후에도 클리핑이 지속된다. 또한, 상기 다운믹스 오디오 신호 또는 멀티채널 오디오 신호의 디코딩과정에서도 클리핑 문제가 발생하였었다. 상기 클리핑은 오디오 신호의 출력에도 영향을 주며, 음질 저하의 원인이 되었었다. However, in configuring the bitstream of the multichannel audio signal, a conventional clipping problem occurs when downmixing the multichannel to a mono or stereo audio signal. In particular, since the coded signal should be limited in size to 16 bits or the like, the coded signal continues clipping even after core codec encoding. In addition, a clipping problem occurs in the decoding process of the downmix audio signal or the multichannel audio signal. The clipping also affects the output of the audio signal, and has been a cause of sound quality degradation.

따라서 상기와 같은 문제점을 해결하기 위해 제안된 본 발명은, 멀티채널 오디오 신호를 복호화하는데 있어서, 다운믹스 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하거나, 멀티채널 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원함으로써, 멀티채널 오디오 신호에서 일어나는 클리핑 문제를 해결하는 방법 및 장치를 제공하는데 그 목적이 있다. 또한, 본 발명은 멀티채널 오디오 신호의 다운믹스과정 중 발생한 클리핑된 부분을 복원하는 방법 및 장치를 제공하는데 그 목적이 있다. Accordingly, the present invention proposed to solve the above problems, in decoding the multi-channel audio signal, to restore the clipped portion generated during the decoding process of the downmix audio signal, or clipping generated during the decoding process of the multi-channel audio signal It is an object of the present invention to provide a method and apparatus for solving a clipping problem occurring in a multichannel audio signal by reconstructing a portion of an image. Another object of the present invention is to provide a method and apparatus for recovering a clipped portion generated during a downmixing process of a multichannel audio signal.

상기의 목적을 달성하기 위하여, 본 발명은 오디오 신호의 비트스트림으로부터 다운믹스 오디오 신호를 분리하는 단계와; 상기 다운믹스 오디오 신호를 디코딩한 후에, 상기 디코딩 과정에서의 클리핑 발생 여부 및 클리핑 정도를 검사하는 단계;를 포함하는 것을 특징으로 하는 오디오 신호의 디코딩 방법을 제공한다.In order to achieve the above object, the present invention comprises the steps of: separating the downmix audio signal from the bitstream of the audio signal; And after the decoding of the downmix audio signal, checking whether clipping occurs and the degree of clipping during the decoding process.

또한, 상기의 목적을 달성하기 위하여, 본 발명은 비트스트림으로부터 다운믹스 오디오 신호 및 공간 정보 신호를 분리하는 단계와; 상기 공간 정보 신호를 이용하여 상기 다운믹스 오디오 신호를 멀티채널 신호로 변환하는 단계와; 상기 변환과정에서의 클리핑 발생 여부 및 클리핑 정도를 검사하는 단계;를 포함하는 것을 특징으로 하는 오디오 신호의 디코딩 방법을 제공한다.In addition, to achieve the above object, the present invention comprises the steps of: separating the downmix audio signal and the spatial information signal from the bitstream; Converting the downmix audio signal into a multichannel signal using the spatial information signal; And a step of checking whether clipping occurs and the degree of clipping in the conversion process.

또한, 상기의 목적을 달성하기 위하여, 본 발명은 상기 오디오 신호를 다운믹스하여 다운믹스 오디오 신호를 생성하는 단계와; 상기 다운믹스 과정 중 생성되는 상기 다운믹스 오디오 신호의 클리핑을 방지할지 또는 클리핑을 감수할지 판단 하는 단계;를 포함하는 것을 특징으로 하는 오디오 신호의 인코딩 방법을 제공한다.In addition, to achieve the above object, the present invention comprises the steps of downmixing the audio signal to generate a downmix audio signal; And determining whether to prevent clipping or to take the clipping of the downmix audio signal generated during the downmixing process.

또한, 상기의 목적을 달성하기 위하여, 본 발명은 비트스트림으로부터 다운믹스 오디오 신호 분리하는 비트스트림디코더; 상기 다운믹스 오디오 신호를 디코딩한 후에, 상기 디코딩 과정에서 클리핑이 일어났는지를 검사하는 클리핑탐지부; 및 상기 다운믹스 오디오 신호 중 클리핑된 부분을 복원하는 클리핑처리부;를 포함하는 것을 특징으로 하는 오디오 신호의 디코딩 장치를 제공한다.In addition, in order to achieve the above object, the present invention is a bitstream decoder for separating the downmix audio signal from the bitstream; A clipping detector to check whether clipping has occurred in the decoding process after decoding the downmix audio signal; And a clipping processor for restoring a clipped portion of the downmix audio signal.

또한, 상기의 목적을 달성하기 위하여, 본 발명은 비트스트림으로부터 다운믹스 오디오 신호 및 공간 정보 신호를 분리하는 비트스트림디코더; 상기 공간 정보 신호를 이용하여 상기 다운믹스 오디오 신호를 멀티채널 오디오 신호로 변환하는 멀티채널생성부; 상기 변환과정에서 클리핑이 일어났는지를 검사하는 클리핑탐지부; 및 상기 멀티채널 오디오 신호 중 클리핑된 부분을 복원하는 클리핑처리부;를 포함하는 것을 특징으로 하는 오디오 신호의 디코딩 장치를 제공한다.In addition, in order to achieve the above object, the present invention is a bitstream decoder for separating the downmix audio signal and spatial information signal from the bitstream; A multichannel generator converting the downmix audio signal into a multichannel audio signal using the spatial information signal; A clipping detector to check whether clipping has occurred in the conversion process; And a clipping processing unit for restoring a clipped portion of the multichannel audio signal.

또한, 상기의 목적을 달성하기 위하여, 본 발명은 오디오 신호를 다운믹스하여 다운믹스 오디오 신호를 생성하는 다운믹스부; 상기 다운믹스 오디오 신호의 클리핑 빈도 및 클리핑 정도를 판단하는 클리핑탐지부; 및 상기 다운믹스 오디오 신호의 클리핑을 방지할지 또는 클리핑을 감수할지 판단하는 클리핑방지판단부;를 포함하는 것을 특징으로 하는 오디오 신호의 인코딩 장치를 제공한다.In addition, to achieve the above object, the present invention provides a downmix unit for downmixing an audio signal to generate a downmix audio signal; A clipping detector to determine a clipping frequency and a clipping degree of the downmix audio signal; And an anti-clipping decision unit to determine whether to prevent clipping of the downmix audio signal or to take the clipping.

이하 상기의 목적을 구체적으로 실현할 수 있는 본 발명의 바람직한 실시예를 첨부한 도면을 참조하여 설명한다.Hereinafter, with reference to the accompanying drawings, preferred embodiments of the present invention that can specifically realize the above object will be described.

도 1 은 본 발명에서의 오디오 신호에 대한 공간 정보를 인간이 인식하는 방법을 도시한다. 멀티채널 오디오 신호에 대한 코딩방법은 인간이 오디오 신호를 3차원적 공간으로 인지한다는 사실을 바탕으로, 복수의 파라미터 세트(parameter sets)를 통하여 상기 오디오 신호를 3차원적 공간 정보로 표현할 수 있다는 것을 이용한다. 멀티채널 오디오 신호의 공간 정보를 표시하기 위한 "공간 파라미터"라고 불리는 상기 파라미터에는 ICLD(Inter Channel level differences), ICC(Inter Channel Coherences) 및 ICTD(Inter Channel Time Difference)등이 있다. 상기 ICLD는 두 채널간의 에너지 차이를 의미하고, 상기 ICC는 두 채널 간의 상관관계(correlation)를 의미하며, ICTD는 두 채널간의 시간 차이를 의미한다.1 shows a method for a human to recognize spatial information about an audio signal in the present invention. The coding method for a multichannel audio signal is based on the fact that a human perceives the audio signal as a three-dimensional space. I use it. Such parameters, called "spatial parameters" for indicating spatial information of a multichannel audio signal, include ICLD (Inter Channel level differences), ICC (Inter Channel Coherences), ICTD (Inter Channel Time Difference), and the like. The ICLD means an energy difference between two channels, the ICC means a correlation between two channels, and the ICTD means a time difference between two channels.

인간이 오디오 신호를 어떻게 공간적으로 인식하며, 상기 공간 파라미터의 개념이 어떻게 생성되는지가 도 1에 도시된다. 원거리에 있는 음원(105)으로부터의 직접적인 음파(direct sound wave)(103)가 인간의 왼쪽 귀(107)에 도달하고, 또 다른 직접적인 음파(102)는 머리 주위에서 회절되어 오른쪽 귀(106)에 도달하게 된다. 상기 두 음파(102 및 103)는 도달시간 및 에너지 레벨에서 차이를 보이게 되며, 이와 같은 차이가 상기 CLD, CPC 및 CTD 파라미터를 생성하게 된다.How a human perceives an audio signal spatially and how the concept of the spatial parameter is generated is shown in FIG. 1. Direct sound wave 103 from the remote source 105 arrives at the human left ear 107, and another direct sound wave 102 is diffracted around the head to the right ear 106. Will be reached. The two sound waves 102 and 103 show a difference in arrival time and energy level, and this difference generates the CLD, CPC and CTD parameters.

또한 만일 반사된 음파(104 및 105)가 양 귀에 도달되거나, 또는 상기 음원(105)이 분산되어 있다면, 서로 상관관계가 없는 음파가 양 귀에 도달될 것이고, 이것이 상기 ICC 파라미터를 생성하게 된다. 상기와 같이 원리로 생성된 공간 파라미터들은 멀티채널 오디오 신호를 모노 또는 스테레오 신호로 전송한 후 다시 멀티채널로 출력하는데 있어서, 강력한 비트 수 감소를 가능하게 한다는 것이 알려져 있다. 본 발명은 상기 공간 정보를 이용하는 멀티채널 오디오 신호에 있어서, 멀티채널을 다운믹스하여 코딩하는 과정, 다운믹스 오디오 신호를 디코딩하는 과정 또는 멀티채널 오디오 신호를 디코딩하는 과정에서 발생하는 클리핑(Clipping) 현상을 방지하기 위한 방법을 제시한다.Also, if the reflected sound waves 104 and 105 reach both ears, or if the sound source 105 is dispersed, sound waves that do not correlate with each other will reach both ears, which will generate the ICC parameter. Spatial parameters generated on the principle as described above are known to enable a strong number of bits in transmitting a multichannel audio signal as a mono or stereo signal and then outputting the multichannel audio signal back to the multichannel. According to an embodiment of the present invention, a clipping phenomenon occurs during downmixing and coding of multichannels, decoding downmixed audio signals, or decoding multichannel audio signals in a multichannel audio signal using the spatial information. It suggests ways to prevent this.

도 2는 클리핑 발생과정을 도시한다. 클리핑은 주로 두 가지 원인으로 발생한다. 첫 번째는 원래 신호(original signal)의 음량(sound level)이 높은 경우에 발생한다. 두 번째는 다운믹스 과정 중에 입력 채널(input channel)의 수가 많은 경우에 발생한다. 예를 들면, 3개의 채널을 1개의 채널도 다운믹스하는 경우보다, 7개의 채널을 1개의 채널도 다운믹스하는 경우에 클리핑이 더 자주 발생한다. 도 2의 클리핑 발생과정은 5개 채널을 1개의 채널로 다운믹스하는 경우를 도시하나, 본 발명은 이 경우에만 한정되지는 않는다. 2 shows a clipping process. Clipping occurs mainly for two reasons. The first occurs when the sound level of the original signal is high. The second occurs when the number of input channels is large during the downmix process. For example, clipping occurs more often when downmixing seven channels to one channel than when three channels are downmixed. The clipping generation process of FIG. 2 illustrates a case of downmixing five channels into one channel, but the present invention is not limited thereto.

도 2의 (a)는 5개의 채널로 구성된 원래 신호의 음량을 도시한다. 각각의 채널은 제한된 크기(예를 들면, 16비트)의 거의 전 범위를 사용할 수 있다. 도 2의 (b)는 상기 5개의 채널을 다운믹스하여 생성된 다운믹스 오디오 신호를 도시한다. 도시된 것처럼, 상기 다운믹스 오디오 신호는 많은 클리핑 지점들을 가질 수 있다. 도 2의 (c)는 상기 다운믹스 오디오 신호를 코어 코덱(예를 들면, AAC 코덱)을 이용하여 인코딩/디코딩한 오디오 신호를 도시한다. 상기 코어 코덱을 이용하여 인코딩/디코딩된 오디오 신호도 제한된 크기(예를 들면, 16비트)로 표현되므로, 클리핑이 지속될 수 있다. 상기 클리핑은 멀티채널 오디오 신호의 재생부에서의 출력에도 영향을 주며, 음질 저하의 원인이 될 수 있다.2 (a) shows the volume of the original signal consisting of five channels. Each channel can use almost the entire range of limited size (eg 16 bits). 2B illustrates a downmix audio signal generated by downmixing the five channels. As shown, the downmix audio signal can have many clipping points. FIG. 2C illustrates an audio signal obtained by encoding / decoding the downmix audio signal using a core codec (eg, an AAC codec). Since the audio signal encoded / decoded using the core codec is also represented in a limited size (eg, 16 bits), clipping can be continued. The clipping also affects the output from the reproduction unit of the multi-channel audio signal and may cause sound quality degradation.

도 3은 본 발명에 따른 다운믹스 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법을 도시한다. 도시된 것처럼, 비트스트림디코더(302)는 다운믹스 오디오 신호(303) 및 공간 정보 신호(304)를 포함하는 비트스트림(301)을 수신하고, 상기 다운믹스 오디오 신호(303)와 상기 공간 정보 신호(304)를 분리한다. 상기 다운믹스 오디오 신호(303)는 모노 또는 스테레오 오디오 신호를 포함하며, 멀티채널 오디오 신호가 될 수도 있다. 상기 다운믹스 오디오 신호(303)는 코어코덱디코더(305)에서 디코딩된다. 디코딩된 상기 다운믹스 오디오 신호는 PCM 데이터 형태(306)가 될 수 있다. 그 다음에, 클리핑탐지부(307)는 상기 다운믹스 오디오 신호의 디코딩과정에서 클리핑 발생 여부 및 클리핑 영향 정도를 검사할 수 있다. 클리핑처리부(308)는 상기 검사결과를 적절한 기준과 비교하여 클리핑된 부분에 대한 복원을 수행하지 않고 원래 신호를 그대로 출력하거나, 또는 적절한 방법으로 상기 클리핑된 부분을 복원한 후에 멀티채널 오디오 신호(311)로 출력할 수 있다. 만일, 멀티채널 오디오 신호(311)로 출력할 수 없는 경우에는 모노 또는 스테레오 다운믹스 오디오 신호(310)로 출력할 수 있다. 3 illustrates a decoding method for restoring a clipped portion generated during a decoding process of a downmix audio signal according to the present invention. As shown, the bitstream decoder 302 receives a bitstream 301 that includes a downmix audio signal 303 and a spatial information signal 304, and the downmix audio signal 303 and the spatial information signal. Separate 304. The downmix audio signal 303 includes a mono or stereo audio signal and may be a multichannel audio signal. The downmix audio signal 303 is decoded in the core codec decoder 305. The demixed downmix audio signal may be in PCM data form 306. Then, the clipping detector 307 may check whether clipping occurs and the degree of clipping influence in the decoding process of the downmix audio signal. The clipping processor 308 outputs the original signal as it is without restoring the clipped portion by comparing the inspection result with an appropriate reference, or multi-channel audio signal 311 after restoring the clipped portion in an appropriate manner. Can be printed as If the multi-channel audio signal 311 cannot be output, the mono or stereo downmix audio signal 310 may be output.

상기 복원방법으로는, 첫째, 소프트클리핑(Soft-Clipping) 방법을 이용할 수 있다. "소프트클리핑"이란 클리핑된 부분의 데이터를 잘라내지 않고, 상기 데이터 값을 작은 값으로 스케일링하여 클리핑을 방지하는 방법을 말한다. 예를 들면, 특정한 한계값(Threshold)을 정해 놓고, 오디오 신호의 데이터 값이 상기 한계값 이상이 되는 경우, 적절한 소프트클리핑함수를 적용하여 작은 값으로 스케일링하는 것이다. 이때, 스케일링된 값은 상기 한계값과 클리핑이 일어나지 않는 최대값 사 이에 해당되어야 한다. 즉, 상기 다운믹스 오디오 신호의 클리핑된 부분에 상기 소프트클리핑함수를 적용하여 클리핑된 부분을 복원할 수 있다. 둘째, 상기 복원방법으로 스무딩(smoothing) 효과를 나타내는 방법을 이용할 수 있다. 클리핑된 부분은 윗부분이 잘린 일직선 형태의 모양을 갖는데, "스무딩"이란 상기 일직선 부분을 곡선 형태로 부드럽게 해주는 것을 말한다. 상기 스무딩을 위하여 싱크(sinc)함수 또는 다항(Polynomial)함수 등의 필터를 이용할 수 있다. 셋째, 상기 복원방법으로 클리핑된 부분 주위의 시간(time) 또는 주파수(frequency) 정보를 분석하고, 이를 이용하여 클리핑된 부분을 재합성하는 방법을 이용할 수 있다. 예를 들면, 다운믹스 오디오 신호의 시간에 따른 시간포락선(Time envelope) 또는 주파수포락선(Frequency envelope) 정보의 양자화된 값을 이용하여 클리핑된 부분을 복원할 수 있다.As the restoration method, first, a soft-clipping method may be used. "Softclipping" refers to a method of preventing clipping by scaling the data value to a small value without cutting the data of the clipped portion. For example, if a specific threshold is set, and the data value of the audio signal is greater than or equal to the threshold value, it is scaled to a small value by applying an appropriate soft clipping function. In this case, the scaled value must correspond between the threshold value and the maximum value at which clipping does not occur. That is, the clipped portion may be restored by applying the soft clipping function to the clipped portion of the downmix audio signal. Secondly, a method of showing a smoothing effect may be used as the restoration method. The clipped portion has a straight shape with the top cut off, and "smoothing" refers to smoothing the straight portion in a curved form. For the smoothing, a filter such as a sink function or a polynomial function may be used. Third, the time or frequency information around the clipped portion may be analyzed by the reconstruction method, and a method of resynthesizing the clipped portion may be used. For example, the clipped portion may be reconstructed by using quantized values of time envelope or frequency envelope information of the downmix audio signal over time.

도 4는 본 발명에 따른 멀티채널 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법을 도시한다. 도시된 것처럼, 비트스트림디코더(402)는 다운믹스 오디오 신호(403) 및 공간 정보 신호(404)를 포함하는 비트스트림(401)을 수신하고, 상기 다운믹스 오디오 신호(403)와 상기 공간 정보 신호(404)를 분리한다. 상기 다운믹스 오디오 신호(403)는 코어코덱디코더(405)에서 디코딩된다. 디코딩된 상기 다운믹스 오디오 신호는 모노 또는 스테레오 PCM 데이터 형태(406)가 될 수 있으며, 또한 멀티채널 PCM 데이터 형태도 될 수 있다. 그 다음에 멀티채널생성부(407)에서 상기 공간 정보 신호(404)을 디코딩하여 얻어진 공간 정보를 이용하여 상기 PCM 데이터 형태의 다운믹스 오디오 신호(406)를 멀티 채널 오디오 신호로 변환할 수 있다. 변환된 멀티채널 오디오 신호는 멀티채널 PCM 데이터 형태(408)가 될 수 있다. 클리핑탐지부(409)는 상기 PCM 데이터 형태의 멀티채널 오디오 신호에 클리핑의 발생 여부 및 클리핑의 영향 정도를 검사할 수 있다. 클리핑처리부(410)는 상기 검사결과를 적절한 기준과 비교하여 클리핑된 부분에 대한 복원을 수행하지 않고 원래 신호를 그대로 출력하거나, 또는 적절한 방법으로 상기 클리핑된 부분을 복원한 후에 멀티채널 오디오 신호(412)로 출력할 수 있다. 만일, 멀티채널 오디오 신호(412)로 출력할 수 없는 경우에는 모노 또는 스테레오 다운믹스 오디오 신호(411)로 출력할 수 있다.4 illustrates a decoding method for reconstructing a clipped portion generated during a decoding process of a multichannel audio signal according to the present invention. As shown, the bitstream decoder 402 receives a bitstream 401 comprising a downmix audio signal 403 and a spatial information signal 404, and the downmix audio signal 403 and the spatial information signal. 404 is separated. The downmix audio signal 403 is decoded in the core codec decoder 405. The decoded downmix audio signal may be in mono or stereo PCM data form 406 and may also be in multichannel PCM data form. Thereafter, the multichannel generator 407 may convert the downmix audio signal 406 in the PCM data form into a multichannel audio signal using the spatial information obtained by decoding the spatial information signal 404. The converted multichannel audio signal may be in multichannel PCM data form 408. The clipping detector 409 may check whether clipping occurs and the influence of clipping on the multichannel audio signal in the PCM data format. The clipping processor 410 outputs the original signal as it is without performing restoration of the clipped portion by comparing the inspection result with an appropriate reference, or multi-channel audio signal 412 after restoring the clipped portion in an appropriate manner. Can be printed as If the multichannel audio signal 412 cannot be output, the mono or stereo downmix audio signal 411 may be output.

상기 복원방법으로는, ⅰ)소프트클리핑(Soft-Clipping) 방법을 이용하거나, ⅱ)스무딩(smoothing) 효과를 나타내는 방법을 이용하거나, ⅲ)싱크(sinc)함수 또는 다항(Polynomial)함수 등의 필터를 이용하거나, 또는 ⅳ)클리핑된 부분 주위의 시간(time) 또는 주파수(frequency) 정보를 분석하고, 이를 이용할 수 있다. 또한, 상기 멀티채널생성부(407) 내부의 함수를 이용하여 클리핑된 부분을 복원할 수 있다. As the restoration method, i) a soft-clipping method, i) a method of exhibiting a smoothing effect, i) a filter such as a sink function or a polynomial function, or the like. Alternatively, or iii) time or frequency information around the clipped portion may be analyzed and used. In addition, the clipped portion may be restored by using a function inside the multi-channel generator 407.

상기 내부 함수를 이용하는 방법으로는, 첫째, QMF(Qaudrature Mirror Filter) 또는 하이브리드 필터(Hybrid Filter)들의 게인(Gain)을 조정한 후에, 상기 필터들을 적용하는 것이다. 상기 QMF는 시간영역(Time Domain)의 오디오 신호를 QMF 영역(QMF Domain)으로 변환하거나, 또는 그 역으로 변환하는데 이용되는 필터를 말한다. 상기 하이브리드 필터는 QMF 영역의 오디오 신호를 하이브리드 영역(Hybrid Domain)으로 변환하거나, 또는 그 역으로 변환하는데 이용되는 필터를 말한다. 오디오 신호에 상기 필터들을 적용할 때, 게인 값을 조정하여 클리핑을 복원할 수 있다. 둘째, 전체 게인을 조정한 후에, 이를 적용할 수 있다. 예를 들면, 멀티채널 오디오 신호의 모든 채널에 적용될 수 있는 게인 값을 조절함으로써 클리핑복원을 수행할 수 있다. 셋째, 상기 다운믹스 오디오 신호를 멀티채널로 바꾸는 과정에서, 상기 다운믹스 오디오 신호는 디코릴레이션(De-correlation) 과정을 거치게 되는데, 상기 디코릴레이션 전에 사용되는 매트릭스(Pre-matrix)(이하, "제1 매트릭스"라 한다) 또는 상기 디코릴레이션 후에 사용되는 매트릭스(Post-Matrix)(이하, "제2 매트릭스"라 한다)의 값을 조정하거나, 또는 상기 제1 매트릭스와 제2 매트릭스 값 모두를 조정한 후에, 이를 적용할 수 있다. 넷째, 상기 디코릴레이션 과정에 이용되는 디코릴레이션 함수를 조정한 후에, 이를 적용할 수 있다. 다섯째, 출력되는 상기 멀티채널 오디오 신호의 템포럴 구조(Temporal Structure)를 보존하기 위해 시간 포락선(time envelope) 툴(tool)을 적용할 수 있는데, 상기 시간 포락선 툴을 위한 함수를 조정하고, 이를 적용할 수 있다. 상기 시간 포락선 툴은 주파수 영역에서 동작하거나, 또는 시간영역에서 동작할 수 있다. 여섯째, 상기 디코릴레이션 후의 오디오 신호와 원래 신호를 합할 때, 각각의 게인을 조정하고, 이를 적용할 수 있다. As a method of using the internal function, first, after adjusting gains of QMF (Qaudrature Mirror Filter) or Hybrid Filters, the filters are applied. The QMF refers to a filter used to convert an audio signal in a time domain into a QMF domain or vice versa. The hybrid filter refers to a filter used to convert an audio signal of a QMF region into a hybrid domain or vice versa. When applying the filters to an audio signal, the gain value can be adjusted to restore clipping. Second, after adjusting the overall gain, it can be applied. For example, clipping restoration may be performed by adjusting a gain value applicable to all channels of the multichannel audio signal. Third, in the process of converting the downmix audio signal into a multi-channel, the downmix audio signal is subjected to a de-correlation process, which is a matrix (hereinafter, referred to as “first”). 1 matrix "or post-matrix (hereinafter, referred to as" second matrix ") used after the decoration or adjusting both the first matrix value and the second matrix value. Later, this can be applied. Fourth, after adjusting the decoration function used in the decoration process, it can be applied. Fifth, a time envelope tool can be applied to preserve the temporal structure of the multi-channel audio signal that is output. Adjust a function for the time envelope tool, and apply the time envelope tool. can do. The temporal envelope tool can operate in the frequency domain or can operate in the time domain. Sixth, when summation of the audio signal after the decoration and the original signal, the respective gains may be adjusted and applied.

도 5는 본 발명에 따른 멀티채널 오디오 신호의 다운믹스과정 중 클리핑된 부분을 복원하는 것에 관한 인코딩 방법을 도시한다. 도시된 것처럼, 멀티채널 오디오 신호(501)가 인코더(508)에 입력된 후에, 다운믹스부(502)에서 다운믹스되어 다운믹스 오디오 신호를 생성한다. 또한, 공간정보발생부(503)는 상기 멀티채널 오 디오 신호로부터 공간 정보를 추출하여 공간 정보 신호 생성한다. 그 다음에 클리핑탐지부(504)는 상기 멀티채널 오디오 신호를 다운믹스하는 중에 발생한 클리핑의 빈도 및 영향력 정도(예를 들면, 음질 저하 정도)를 판단한다. 상기 클리핑탐지부(504)를 포함하는 클리핑방지판단부(505)는 클리핑이 발생한 상기 다운믹스 오디오 신호에 대하여, 게인(Gain)을 조정하여 클리핑을 방지할지, 또는 게인을 바꾸지 않고 클리핑을 감수할지를 판단할 수 있다. 만일, 클리핑을 방지하는 것이 유리하다고 판단되면, 게인을 조절하여 클리핑이 발생하지 않은 다운믹스 오디오 신호를 생성할 수 있다. 만일, 클리핑을 감수하는 것이 유리하다고 판단되면, 게인을 조절하지 않고 클리핑이 발생한 다운믹스 오디오 신호를 이용한다. 그 다음에 비트스트림포맷터(506)는 상기 다운믹스 오디오 신호와 상기 공간 정보 신호를 포함하는 비트스트림(507)을 생성하여 전송한다. 5 illustrates an encoding method for restoring a clipped portion of a downmix process of a multichannel audio signal according to the present invention. As shown, after the multichannel audio signal 501 is input to the encoder 508, it is downmixed by the downmix unit 502 to generate a downmix audio signal. In addition, the spatial information generator 503 extracts spatial information from the multichannel audio signal to generate a spatial information signal. Then, the clipping detector 504 determines the frequency and influence of clipping (eg, deterioration in sound quality) generated during downmixing the multichannel audio signal. The anti-clipping determination unit 505 including the clipping detection unit 504 adjusts a gain with respect to the downmix audio signal in which clipping has occurred, to prevent clipping or to take clipping without changing the gain. You can judge. If it is determined that it is advantageous to prevent clipping, the gain may be adjusted to generate a downmix audio signal in which clipping does not occur. If it is determined that taking a clipping is advantageous, use a downmix audio signal in which clipping has occurred without adjusting gain. The bitstream formatter 506 then generates and transmits a bitstream 507 comprising the downmix audio signal and the spatial information signal.

도 6은 본 발명에 따른 다운믹스 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법에 대한 흐름도이다. 먼저 다운믹스 오디오 신호 및 공간 정보 신호를 포함하는 비트스트림을 수신(601)하고, 상기 비트스트림으로부터 다운믹스 오디오 신호 및 공간 정보 신호를 추출(602 및 603)한다. 상기 다운믹스 오디오 신호를 디코딩(604)한 후에, 상기 디코딩 과정에서 클리핑 발생여부 및 클리핑 영향정도를 검사(605)한다. 상기 검사결과를 적절한 기준과 비교하여, 원래의 신호를 그냥 출력하거나 또는 클리핑된 부분을 복원(606)한다. 그 다음에, 상기 공간 정보 신호를 이용하여 상기 다운믹스 오디오 신호를 멀티채널 오디오 신호로 변환(607)한다. 6 is a flowchart illustrating a decoding method for recovering a clipped portion generated during the decoding process of a downmix audio signal according to the present invention. First, a bitstream including a downmix audio signal and a spatial information signal is received (601), and a downmix audio signal and spatial information signal are extracted (602 and 603) from the bitstream. After decoding the downmix audio signal (604), it is checked whether clipping occurs and the degree of clipping influence in the decoding process (605). The test result is compared with an appropriate criterion to either output the original signal or restore 606 the clipped portion. The downmixed audio signal is then converted into a multichannel audio signal 607 using the spatial information signal.

도 7은 본 발명에 따른 멀티채널 오디오 신호의 디코딩과정 중 발생한 클리핑된 부분을 복원하는 디코딩 방법에 대한 흐름도이다. 먼저 다운믹스 오디오 신호 및 공간 정보 신호를 포함하는 비트스트림을 수신(701)하고, 상기 비트스트림으로부터 다운믹스 오디오 신호 및 공간 정보 신호를 추출(702 및 703)한다. 상기 다운믹스 오디오 신호를 디코딩(604)한 후에, 상기 공간 정보 신호를 이용하여 상기 다운믹스 오디오 신호를 멀티채널 오디오 신호로 변환(705)한다. 그 다음에, 상기 멀티채널 오디오 신호로 변환하는 과정에서의 클리핑 발생 여부 및 클리핑의 영향 정도를 검사(706)한다. 상기 검사결과를 적절한 기준과 비교하여, 원래의 신호를 그대로 출력하거나 또는 클리핑된 부분을 복원(706)하여 출력한다.7 is a flowchart illustrating a decoding method for recovering a clipped portion generated during the decoding process of a multichannel audio signal according to the present invention. First, a bitstream including a downmix audio signal and a spatial information signal is received (701), and a downmix audio signal and spatial information signal are extracted (702 and 703) from the bitstream. After decoding the downmix audio signal (604), the downmix audio signal is converted into a multichannel audio signal using the spatial information signal (705). Thereafter, in operation 706, the clipping occurs and the degree of the influence of clipping in the process of converting the multi-channel audio signal. The inspection result is compared with an appropriate standard, and the original signal is output as it is or the clipped part is restored (706) and output.

도 8은 본 발명에 따른 멀티채널 오디오 신호의 다운믹스과정 중 클리핑된 부분을 복원하는 것에 관한 인코딩 방법에 대한 흐름도이다. 먼저 멀티채널 오디오 신호(801)를 다운믹스하여 다운믹스 오디오 신호를 생성(802)한다. 또한, 상기 멀티채널 오디오 신호로부터 공간 정보를 추출(803)하고, 추출된 공간 정보를 이용하여 공간 정보 신호를 생성(806)한다. 그 다음에 상기 멀티채널 오디오 신호를 다운믹스하는 중에 발생한 클리핑 빈도 및 클리핑 정도를 판단(804)한 후에, 게인을 조절하여 클리핑을 방지할지, 또는 게인을 조절하지 않고 클리핑을 감수할지를 판단(805)한다. 상기 판단결과에 대한 정보는 상기 공간 정보 신호에 포함될 수 있다. 그 다음에 상기 다운믹스 오디오 신호 및 공간 정보 신호를 포함하는 비트스트림을 전송(807)한다.8 is a flowchart illustrating an encoding method for restoring a clipped portion of a downmix process of a multichannel audio signal according to the present invention. First, the multichannel audio signal 801 is downmixed to generate a downmix audio signal (802). In addition, spatial information is extracted from the multi-channel audio signal (803), and the spatial information signal is generated (806) using the extracted spatial information. Then, after determining the clipping frequency and the degree of clipping that occur during downmixing the multichannel audio signal (804), it is determined whether to adjust the gain to prevent clipping or to take the clipping without adjusting the gain (805). do. Information about the determination result may be included in the spatial information signal. The bitstream including the downmix audio signal and the spatial information signal is then transmitted 807.

지금까지 본 발명에 대하여 몇몇 실시예들을 들어 구체적으로 설명하였으나, 상기 실시예들은 본 발명을 이해하기 위한 설명을 위해 제시된 것이며, 본 발명의 범위가 상기 실시예에 제한되는 것은 아니다. 당업자라면 본 발명의 기술적 사상의 범위를 벗어나지 않고도 다양한 변형이 가능함을 이해할 수 있을 것이며, 본 발명의 범위는 첨부된 특허청구범위에 의해서 해석되어야 할 것이다.Although the present invention has been described in detail with reference to some embodiments, the above embodiments are presented for the purpose of understanding the present invention, and the scope of the present invention is not limited to the above embodiments. Those skilled in the art will understand that various modifications are possible without departing from the scope of the technical idea of the present invention, and the scope of the present invention should be interpreted by the appended claims.

이상에서 기술된 것과 같이, 본 발명에 따른 멀티채널 오디오 신호를 코딩하는데 있어서, 다운믹스 오디오 신호를 디코딩하는 과정에서 발생한 클리핑된 부분을 복원하거나, 또는 상기 다운믹스 오디오 신호를 공간 정보 신호를 이용하여 멀티채널 오디오 신호로 변환하는 과정에서 발생한 클리핑된 부분을 적절한 방법으로 복원함으로써 멀티채널 오디오 신호에 대한 클리핑 문제를 효과적으로 방지할 수 있다.As described above, in coding the multi-channel audio signal according to the present invention, the clipping portion generated in the process of decoding the downmix audio signal is restored, or the downmix audio signal is stored using a spatial information signal. By restoring the clipped portion generated during the conversion to the multichannel audio signal in an appropriate manner, the clipping problem for the multichannel audio signal can be effectively prevented.

또한, 멀티채널 오디오 신호를 다운믹스하는 과정에서 발생한 클리핑 빈도 및 클리핑 정도를 검사한 후에, 클리핑을 방지할지 아니면 클리핑을 감수할지를 판단하는 단계를 수행함으로써 멀티채널 오디오 신호를 다운믹스하는 과정에서 발생되는 클리핑 문제를 효과적으로 방지할 수 있다.In addition, after checking the clipping frequency and the degree of clipping generated during the downmixing of the multichannel audio signal, a step of determining whether to prevent clipping or take clipping is performed. The clipping problem can be effectively prevented.

Claims

Separating the downmix audio signal from the bitstream of the audio signal; And

After decoding the downmix audio signal, checking whether clipping occurs and the degree of clipping in the decoding process.

The method of claim 1,

The decoding method,

Restoring a clipped portion of the downmix audio signal.

The method of claim 2,

And restoring said clipped portion by a method by softclipping.

The method of claim 2,

And recovering the clipped portion using a filter.

The method of claim 2,

Recovering the clipped portion using a method that exhibits a smoothing effect.

The method of claim 2,

And restoring the clipped portion using time or frequency information around the portion where the clipping occurred.

The method of claim 1,

The decoding method,

And outputting the original audio signal without restoring the clipped portion of the downmix audio signal.

Separating the downmix audio signal and the spatial information signal from the bitstream;

Converting the downmix audio signal into a multichannel signal using the spatial information signal; And

And checking whether clipping occurs and the degree of clipping during the conversion process.

The method of claim 8,

The decoding method,

Restoring a clipped portion of the multichannel audio signal.

The method of claim 9,

And restoring said clipped portion by a method by softclipping.

The method of claim 9,

And recovering the clipped portion using a filter.

The method of claim 9,

Recovering the clipped portion using a method that exhibits a smoothing effect.

The method of claim 9,

And restoring the clipped portion using an internal function of a multichannel audio signal generator for converting the downmix audio signal into a multichannel audio signal.

The method of claim 14,

And adjusting the gain of the QMF filter or the hybrid filter of the internal function, and restoring the clipped portion by applying the QMF filter or the hybrid filter.

The method of claim 14,

And restoring the clipped portion by adjusting the overall gain of the internal function and applying it.

The method of claim 14,

And reconstructing the clipped portion by adjusting a value of one or more of a first matrix or a second matrix of the internal functions, and applying the same.

The method of claim 14,

And reconstructing the clipped portion by adjusting a decorrelation function of the internal function and applying the same.

The method of claim 14,

And restoring the clipped portion by adjusting a temporal envelope function of the inner function and applying it.

The method of claim 14,

And reconstructing the clipped portion by adjusting each gain when applying the decoded audio signal of the internal function and the original audio signal.

The method of claim 14,

The decoding method,

And outputting the multichannel audio signal without restoring a clipped portion of the multichannel audio signal.

Downmixing the audio signal to produce a downmix audio signal; And

And determining whether to prevent clipping of the downmix audio signal generated during the downmixing process or to take clipping.

The method of claim 22,

The encoding method is

And determining a clipping frequency and a clipping degree of the downmix audio signal.

A bitstream decoder for separating the downmix audio signal from the bitstream;

A clipping detector to check whether clipping has occurred in the decoding process after decoding the downmix audio signal; And

And a clipping processor for restoring a clipped portion of the downmix audio signal.

A bitstream decoder for separating the downmix audio signal and the spatial information signal from the bitstream;

A multichannel generator converting the downmix audio signal into a multichannel audio signal using the spatial information signal;

A clipping detector to check whether clipping has occurred in the conversion process; And

And a clipping processor for restoring a clipped portion of the multi-channel audio signal.

A downmix unit which downmixes the audio signal to generate a downmix audio signal;

A clipping detector to determine a clipping frequency and a clipping degree of the downmix audio signal; And

And an anti-clipping decision unit to determine whether to prevent clipping of the downmix audio signal or to take clipping.