KR20110010077A

KR20110010077A - Apparatus and method of decompressing distributed way coded video

Info

Publication number: KR20110010077A
Application number: KR1020100071433A
Authority: KR
Inventors: 전병우; 심혁재; 박천호
Original assignee: 성균관대학교산학협력단
Priority date: 2009-07-23
Filing date: 2010-07-23
Publication date: 2011-01-31
Also published as: WO2011010895A3; WO2011010895A2; KR101180539B1

Abstract

PURPOSE: A device and a method of decoding a dispersion video encoding image are provided to effectively remove noise error of assistance information as minimizing the waste of compression data, thereby remarkably increasing the compression performance. CONSTITUTION: A key feature restoring unit restores key feature information transmitted from a decoding device(20). An assistance information generating unit(22) generates assistance information by using the key feature restored in the key feature restoring unit. A multi noise model based noise model providing unit provides noise model to be use in a certain channel coding unit. A channel code decoding unit decodes certain channel coding units about a given decoding unit.

Description

Apparatus and method for decoding distributed video encoded image {APPARATUS AND METHOD OF DECOMPRESSING DISTRIBUTED WAY CODED VIDEO}

본 발명은 다중잡음모델에 기반하여 압축 성능을 향상시키는 분산비디오 부호화 영상의 복호화 장치 및 그 방법에 관한 것이다.The present invention relates to an apparatus and method for decoding a distributed video encoded image that improves compression performance based on a multi-noise model.

일반적으로 압축전의 디지털 영상데이터는 상당한 양의 크기를 갖기 때문에, 종래의 MPEG, H.26X와 같은 효율적인 압축 방법에 의하여 압축되었다. 이러한 기술들은 DVD, 방송들과 같은 응용에 사용되었으며 최근에는 2.5G/3G 등의 무선 통신환경의 발달로 무선 모바일 영상전송에도 사용되고 있다.In general, since digital image data before compression has a considerable amount of size, it is compressed by an efficient compression method such as MPEG and H.26X. These technologies have been used for applications such as DVDs and broadcasts. Recently, wireless technologies such as 2.5G / 3G have been used for wireless mobile video transmission.

디지털 영상데이터의 압축은, 압축전의 영상데이터로부터 다음 세가지 관점에서 필요 없거나 나중에 찾아낼 수 있는 정보를 찾아 줄임으로써 달성되는데, 첫 번째로 시간적인 중복성(Temporal redundancy)을 줄이는 것, 다음으로 공간적인 중복성(Spatial redundancy)을 줄이는 것, 마지막으로 통계적인 중복성(Statistical redundancy)을 줄이는 것이 그것이다. Compression of digital image data is achieved by finding and reducing information that is not needed or can be found later from the image data before compression, firstly by reducing temporal redundancy and then by spatial redundancy. To reduce spatial redundancy, and finally to reduce statistical redundancy.

MPEG, H.26X로 대표되는 기존의 압축 기술들은 움직임 예측/보상에 기반하는 시간적 중복성의 제거로 높은 압축효율을 달성하고 있지만, 동영상 부호화 장치에 많은 연산부담을 준다. 따라서 최근 사용량이 늘어나거나 향후 폭발적으로 늘어날 센서 네트워크나 휴대폰 영상통화와 같이, 부호화 장치의 연산량 및/또는 전력 자원이 한정된 응용에서는, 기존의 압축 기술들이 부적합하여, 부호화 장치의 복잡도를 줄이는 것이 중요한 기술적 문제로 대두 되어왔다.Conventional compression techniques represented by MPEG and H.26X achieve high compression efficiency by eliminating temporal redundancy based on motion prediction / compensation, but they put a lot of computational burden on the video encoding apparatus. Therefore, in applications where the amount of computation and / or power resources of the encoding device are limited, such as sensor networks or cellular video calls, which are recently increasing or will explode in the future, it is important to reduce the complexity of the encoding device because existing compression technologies are inadequate. It has been a problem.

이러한 부호화 장치의 복잡도 문제를 해결하기 위한 방법의 하나로 분산 소스 부호화(DSC: Distributed Source Coding) 기술이 주목 받고 있다. 분산 소스 부호화 기술은 Slepian-Wolf 이론을 기반으로 한다. Slepian-Wolf 이론에 따르면 상관성(correlation)이 존재하는 소스들을 독립적으로 각각 압축 부호화하여도 복호화시 상호 상관성을 이용한다면 각 소스를 서로 연계하여 예측 부호화를 한 것과 동일한 압축 부호화 이득을 얻을 수 있음을 알 수 있다.Distributed source coding (DSC) technology has attracted attention as one of methods for solving the complexity problem of the encoding apparatus. Distributed source coding techniques are based on the Slepian-Wolf theory. According to the Slepian-Wolf theory, even if each of the sources having correlation is independently compressed and encoded, if the cross correlation is used when decoding, the same compression coding gains as those of the prediction coding are linked to each other can be obtained. Can be.

Wyner-Ziv 이론을 기반으로 한 분산 비디오 부호화(DVC: Distributed Video Coding) 기술은, 상기의 분산 소스 부호화 기술이 무손실 부/복호화에 해당하던 것을 손실 압축 부호화의 경우까지 확대한 것이다. 이는 픽춰 간 중복성을 줄이기 위하여 지금까지 비디오 부호화 장치에서 이루어졌던 움직임 예측/보상 같은 일련의 신호처리 절차를 큰 압축 부호화 이득/손실 없이 복호화 장치 쪽으로 옮길 수 있음을 의미한다.The Distributed Video Coding (DVC) technique based on the Wyner-Ziv theory is an extension of the lossy coding / decoding to the case of lossy compression coding. This means that in order to reduce the redundancy between pictures, a series of signal processing procedures, such as motion prediction / compensation, which have been performed in the video encoding apparatus, can be moved to the decoding apparatus without large compression coding gain / loss.

분산 비디오 부호화 기술 중 가장 대표적으로 알려진 것은 A. Aaron, S. Rane, R. Zhang, and B. Girod 등이 Proc. IEEE Data Compression Conference, 2003을 통해 발표한 논문인 "Wyner-Ziv coding for video: Applications to compression and error resilience"에서 소개한 Wyner-Ziv 코딩 기술이다. The most representative examples of distributed video encoding techniques are A. Aaron, S. Rane, R. Zhang, and B. Girod et al. Wyner-Ziv coding technology introduced by the paper, "Wyner-Ziv coding for video: Applications to compression and error resilience," published at the IEEE Data Compression Conference, 2003.

이 분산 비디오 부호화 기술은 복호화 장치에서 압축된 현재 부호화 단위의 전,후 픽춰 간의 유사도를 이용하여 현재 부호화 단위에 해당하는 보조정보를 생성한 후, 보조정보와 현재 부호화 단위에 해당하는 영상데이터의 차를 가상의 채널에서 발생한 잡음으로 간주한 뒤, 부호화 장치에서 전송된 패리티 데이터를 이용하여 보조정보내의 잡음을 제거하는 방식으로 현재 부호화 단위에 해당하는 압축 전 영상데이터를 복원하는 것이다.The distributed video encoding technique generates auxiliary information corresponding to the current coding unit by using similarity between before and after pictures of the current coding unit compressed by the decoding apparatus, and then, the difference between the auxiliary information and the image data corresponding to the current coding unit. Is regarded as noise generated in the virtual channel, and then the pre-compression image data corresponding to the current coding unit is reconstructed by removing the noise in the auxiliary information using the parity data transmitted from the encoding apparatus.

도 1은 종래의 Wyner-Ziv 코딩기술의 부호화 장치(110)와 복호화 장치(120)의 구성을 보여주는 도면이다. FIG. 1 is a diagram illustrating a configuration of a coding apparatus 110 and a decoding apparatus 120 of a conventional Wyner-Ziv coding technique.

도 1에 도시된 바와 같이, 종래의 Wyner-Ziv 코딩기술의 부호화 장치(110) 는 키픽춰 부호화부(111), 양자화부(112), 블록단위화 부(113), 그리고 채널코드 부호화부(114)를 포함하고, 이에 상응하는 복호화 장치(120)는 키픽춰 복원부(121), 보조정보 생성부(122), 잡음모델 생성부(123), 채널코드 복호화부(124), 그리고 영상 복원부(125)를 포함한다.As shown in FIG. 1, the encoding apparatus 110 of the conventional Wyner-Ziv coding technique includes a key picture encoding unit 111, a quantization unit 112, a block unitization unit 113, and a channel code encoding unit ( 114, and the corresponding decoding apparatus 120 includes a key picture reconstructor 121, an auxiliary information generator 122, a noise model generator 123, a channel code decoder 124, and an image reconstruction. The unit 125 is included.

Wyner-Ziv 코딩기술에 따른 부호화 장치(120)는 압축할 데이터를 두 개의 부류로 나누는데, 하나는 분산비디오 부호화 방식에 의해 부호화 되는 영상데이터(이하, 'WZ픽춰'라 함)이고, 또 다른 하나는 분산비디오 부호화 방식 이외의 종래의 부호화방식에 의하여 부호화 되는 영상데이터(이하, '키픽춰'라 함)이다. 키픽춰들은 일반적으로 H.264/AVC의 인트라픽춰 부호화 방식과 같이 사용자가 미리 선택한 소정의 방법으로 키픽춰 부호화부(111)에 의해 부호화되어 복호화 장치(120)로 전송된다. The encoding apparatus 120 according to the Wyner-Ziv coding technique divides data to be compressed into two classes, one of which is image data encoded by distributed video encoding (hereinafter referred to as 'WZ picture'), and the other. Denotes image data (hereinafter, referred to as a "key picture") encoded by a conventional encoding method other than the distributed video encoding method. In general, the key pictures are encoded by the key picture encoder 111 in a predetermined method selected by a user and transmitted to the decoding apparatus 120, such as H.264 / AVC.

종래의 Wyner-Ziv 코딩기술에 따른 부호화 장치(110)에 상응하는 복호화 장치(120)의 키픽춰 복원부(121)는 H.264/AVC의 인트라픽춰 부호화 방식과 같이 사용자가 미리 선택한 소정의 방법으로 부호화되어 전송된 키픽춰를 복원하고, 보조정보 생성부(122)에서는 키픽춰 복원부(121)에 의하여 복원된 키픽춰들을 이용하여 보조정보(Side Information)를 생성한다. 이 보조정보는 복원할 WZ픽춰와 유사할 것이라고 간주되는, 복호화 장치에서 생성한 추정정보이며, 압축 전 영상데이터와 이에 대한 보조정보와의 차를 잡음오류로 간주한다. 이 잡음오류는 실제 통신채널상의 전송에 의하여 발생하는 잡음오류가 아니므로, 이런 오류를 발생시키는 잡음을 가상의 채널잡음(Virtual Channel Noise)이라 한다.The key picture reconstructing unit 121 of the decoding device 120 corresponding to the encoding device 110 according to the conventional Wyner-Ziv coding technique may select a predetermined method previously selected by the user, such as the H.264 / AVC intra picture coding method. The auxiliary picture generator 122 restores the key picture encoded and transmitted, and generates side information using the key pictures restored by the key picture restorer 121. This auxiliary information is estimation information generated by the decoding apparatus, which is considered to be similar to the WZ picture to be reconstructed, and regards the difference between the pre-compression image data and the auxiliary information thereof as a noise error. Since this noise error is not the noise error caused by the transmission on the actual communication channel, the noise causing this error is called virtual channel noise.

일반적으로 보조정보 생성부(122)에서 보조정보의 생성은 WZ픽춰의 전후에 위치하는 키픽춰 간의 선형적인 움직임을 가정한 보간법(interpolation)을 이용하여 이루어지며 보외법(extrapolation) 역시 사용할 수도 있지만 성능 면에서 보간법이 보외법 보다 뛰어나므로 대부분 보간법이 사용된다.In general, the generation of auxiliary information in the auxiliary information generator 122 is performed by using interpolation, which assumes linear movement between key pictures located before and after the WZ picture. Extrapolation may also be used. Since interpolation is superior to extrapolation in most respects, interpolation is mostly used.

한편, WZ픽춰들을 부호화하기 위하여, 부호화 장치(110)의 양자화부(112)는 WZ 픽춰 정보에 대하여 양자화를 실행하고, 블록단위화부(113)는 양자화된 값을 부호화 단위로 구분한다. 채널코드 부호화부(114)는 양자화된 값을 소정 채널코딩단위로 나누며 채널코드를 이용하여 이에 대한 압축데이터를 생성하고 이를 버퍼(미도시)에 저장한다. 이때, 압축데이터는 압축할 부호화 단위에 해당하는 영상데이터에 대한 패리티데이터이다. Meanwhile, in order to encode WZ pictures, the quantization unit 112 of the encoding apparatus 110 performs quantization on the WZ picture information, and the block unitization unit 113 divides the quantized values into coding units. The channel code encoder 114 divides the quantized value into predetermined channel coding units, generates compressed data for the channel code, and stores the compressed data in a buffer (not shown). In this case, the compressed data is parity data of image data corresponding to a coding unit to be compressed.

버퍼에 저장된 압축데이터, 즉 패리티데이터는 역방향 채널(즉, 피드백 채널)을 통한 복호화 장치(120)의 요청에 따라 순차적으로 전송된다. 도 1의 채널코드 복호화부(124)는 잡음모델 생성부(123)에서 생성한 가상의 채널에 대한 하나의 잡음모델에 기반하여, 부호화 장치(110)로부터 전송되는 패리티데이터를 사용하여 보조정보에 존재하는 잡음오류를 정정하여 양자화된 압축 이전의 영상데이터의 부호화 단위에 해당하는 중간영상데이터 값을 추정한다. 여기서 압축 이전의 영상데이터는 이후 원본영상과 동일한 개념으로 혼용한다. 도 1의 영상 복원부(125)는 채널코드 복호화부(124)에 의해 출력된 소정 채널코딩단위들의 집합인 중간영상데이터 값을 입력 받아 이를 역 양자화 하여 WZ 픽춰를 복원한다. Compressed data stored in the buffer, that is, parity data is sequentially transmitted according to a request of the decoding apparatus 120 through a reverse channel (ie, a feedback channel). The channel code decoder 124 of FIG. 1 uses the parity data transmitted from the encoding apparatus 110 based on one noise model for the virtual channel generated by the noise model generator 123 to provide auxiliary information. By correcting the existing noise error, the intermediate image data value corresponding to the coding unit of the quantized image data before compression is estimated. Here, the image data before compression is mixed with the same concept as the original image. The image reconstructor 125 of FIG. 1 receives an intermediate image data value, which is a set of predetermined channel coding units output by the channel code decoder 124, and inversely quantizes the intermediate image data value to reconstruct the WZ picture.

상기 과정에 있어 역 양자화시에 발생하는 모호성(Ambiguity)은 보조정보 생성부(122)에 의해 생성한 상기 보조정보와 잡음모델 생성부(123)에서 추정한 잡음모델을 참고하여 제거한다.Ambiguity generated during inverse quantization in the above process is removed by referring to the auxiliary information generated by the auxiliary information generator 122 and the noise model estimated by the noise model generator 123.

Wyner-Ziv 코딩 기술의 복호화 방법은 기본적으로 보조정보 내의 잡음오류를 채널코드를 이용하여 정정하는 것이다. 그러나 복호화 장치에는 잡음오류를 정정하는데 필요한 압축 전 영상데이터에 대한 정보가 없기 때문에 복호화 장치는 잡음오류를 정정하기 위해 필요한 패리티데이터의 양 역시 알 수 없다. 그러므로, 복호화 장치는 역방향 채널을 통해 점진적으로 패리티데이터를 부호화 장치에 요청하는 구조를 갖는다. 이에 대한 구체적인 설명은 A. Aaron, S. Rane, R. Zhang, and B. Girod 등이 Proc. IEEE Data Compression Conference, 2003을 통해 발표한 논문인 "Wyner-Ziv coding for video: Applications to compression and error resilience"를 참조한다.The decoding method of the Wyner-Ziv coding technique basically corrects a noise error in auxiliary information using a channel code. However, since the decoding apparatus does not have information on the pre-compression image data necessary to correct the noise error, the decoding apparatus may not know the amount of parity data necessary to correct the noise error. Therefore, the decoding apparatus has a structure for gradually requesting parity data to the encoding apparatus through a reverse channel. For a detailed description, see A. Aaron, S. Rane, R. Zhang, and B. Girod et al. In Proc. See "Wyner-Ziv coding for video: Applications to compression and error resilience" published in the IEEE Data Compression Conference, 2003.

한편, Wyner-Ziv 복호화 장치에서 가상의 채널에 대한 정보를 정확히 예측하는 것은 채널 복호화의 성능에 큰 영향을 미친다. 하지만 복호화 장치에서는 원본영상에 대한 정보가 없으므로 채널(잡음)에 대한 정확한 예측은 이루어 질 수 없으며, 그에 따라 선형적인 움직임을 가정한 전후 키픽춰 간의 차분을 통한 예측을 수행하는 것이 일반적이다. 이에 대한 구체적인 설명은 C. Brites, and F. Pereira가 Proc. IEEE Transactions on Circuits and Systems for Video Technology, 2008을 통해 발표한 논문인 "Correlation Noise Modeling for Efficient Pixel and Transform Domain Wyner-Ziv Video Coding"을 참조한다.On the other hand, accurately predicting the information on the virtual channel in the Wyner-Ziv decoding apparatus greatly affects the performance of channel decoding. However, since there is no information on the original image, the decoding apparatus cannot accurately predict a channel (noise), and accordingly, it is common to perform prediction through difference between front and rear key pictures assuming linear motion. For details, see C. Brites, and F. Pereira, Proc. See "Correlation Noise Modeling for Efficient Pixel and Transform Domain Wyner-Ziv Video Coding", a paper published in IEEE Transactions on Circuits and Systems for Video Technology, 2008.

하지만 종래의 방법에 따른 잡음 모델은 실험적으로 하나의 잡음모델을 미리 생성하여 사용하는 것이므로 소정 채널코딩단위로 입력되는 압축데이터 각 단위에 대해서 모두 좋은 성능을 보장하지는 못한다. 이와 같이, 항상 좋은 성능을 보장하지 못하기 때문에 채널코드 복호화시 잡음오류 정정에 사용되는 패리티데이터의 낭비가 많았다. 또한, 항상 정확한 잡음모델을 생성할 수가 없고, 설사 하나의 적합한 잡음모델을 생성한다 하더라도 그 잡음모델을 생성할 때 확률적으로 비교적 낮게 판단되는 잡음오류가 압축데이터를 실제 복원시킬 때 발생한 경우 채널코드 복호화에 심각한 문제가 발생했다.However, since the noise model according to the conventional method is used to experimentally generate a noise model in advance, it does not guarantee good performance for each unit of compressed data input in a predetermined channel coding unit. As such, since it is not always possible to guarantee good performance, there is a lot of waste of parity data used for noise error correction during channel code decoding. In addition, it is not always possible to generate an accurate noise model, and even if one suitable noise model is generated, a channel code is generated when a noise error, which is determined to be relatively low when generating the noise model, is actually generated when the compressed data is actually recovered. There was a serious problem with decryption.

상기와 같은 문제점을 해결하기 위해, 본 발명은 다중잡음모델을 사용하여 복호화 성능을 향상시키기 위한 분산 비디오 부호화 영상의 복호화 장치 및 방법을 제공한다. In order to solve the above problems, the present invention provides an apparatus and method for decoding a distributed video encoded image for improving decoding performance using a multiple noise model.

본 발명은, 분산비디오 부호화 영상을 복호하는 복호화 장치 및 방법을 제공한다. 상기 복호화 장치는 부호화 장치로부터 전송되는 키픽춰 정보를 복원하는 키픽춰 복원부와, 상기 키픽춰 복원부에서 복원한 상기 키픽춰를 이용하여 보조정보를 생성하는 보조정보 생성부와, 상기 보조정보에 있는 잡음오류들을 추정하기 위한 복수개의 후보 잡음모델들로 구성된 다중잡음모델을 생성하는 다중잡음모델 생성부와, 상기 다중잡음모델 생성부에서 생성된 복수개의 후보 잡음모델들 중에서 소정 채널코딩단위에 사용할 잡음모델을 결정하는 잡음모델 결정부와, 상기 결정된 상기 잡음모델에 기반하여 상기 보조정보의 잡음오류를 상기 부호화 장치로부터 전송되는 압축데이터를 사용하여 정정하고, 주어진 복호화 단위에 대한 소정 채널코딩단위들을 복호화하는 채널코드 복호화부와, 상기 채널코드 복호화부에서 복호화된 상기 소정 채널코딩단위들로 구성된 중간영상데이터 값과, 상기 보조정보 생성부에서 생성한 상기 보조정보와 상기 결정된 잡음모델들을 이용하여 WZ픽춰를 복원하는 영상 복원부를 포함한다.The present invention provides a decoding apparatus and method for decoding a distributed video encoded image. The decoding apparatus includes a key picture restoring unit for restoring key picture information transmitted from an encoding device, an auxiliary information generating unit for generating auxiliary information using the key picture restored by the key picture restoring unit, and the auxiliary information. A multiple noise model generator for generating a multiple noise model composed of a plurality of candidate noise models for estimating noise errors, and a plurality of candidate noise models generated by the multiple noise model generator for use in a predetermined channel coding unit A noise model determination unit for determining a noise model, and correcting a noise error of the auxiliary information based on the determined noise model using compressed data transmitted from the encoding apparatus, and correcting predetermined channel coding units for a given decoding unit. A channel code decoding unit to decode, and the predetermined decoded by the channel code decoding unit The board using the middle value and the image data, the noise model and the side information generated by the side information generation unit consisting of the determined coding unit comprises image restoring section for restoring the WZ picture.

또한, 분산비디오 부호화 영상을 복호하는 복호화 방법은 부호화 장치로부터 전송되는 키픽춰 정보를 복원하는 단계와; 상기 복원한 상기 키픽춰를 이용하여 보조정보를 생성하는 단계와; 상기 보조정보에 있는 잡음오류들을 추정하기 위한 복수개의 후보 잡음모델들로 구성된 다중잡음모델에 기초하여 소정 채널코딩단위에 대해 사용할 잡음 모델을 제공하는 단계와; 상기 제공된 잡음 모델에 기초하여 상기 부호화 장치로부터 전송되는 압축데이터를 사용하여 보조 정보의 잡음 오류를 정정하는 단계와; 소정의 복호화 단위에 대한 소정 채널코딩단위들을 복호하는 단계와; 상기 복호화된 상기 소정 채널코딩단위들로 구성된 중간영상데이터 값과, 상기 보조정보와 상기 제공된 잡음모델들을 이용하여 WZ픽춰를 복원하는 단계를 포함한다.In addition, the decoding method for decoding a distributed video encoded image comprises the steps of: reconstructing key picture information transmitted from an encoding device; Generating auxiliary information using the reconstructed key picture; Providing a noise model to be used for a predetermined channel coding unit based on a multiple noise model consisting of a plurality of candidate noise models for estimating noise errors in the auxiliary information; Correcting a noise error of the auxiliary information using the compressed data transmitted from the encoding apparatus based on the provided noise model; Decoding predetermined channel coding units for a predetermined decoding unit; And reconstructing a WZ picture using the decoded intermediate image data value composed of the predetermined channel coding units, the auxiliary information, and the provided noise model.

전술한 바와 같이 본 발명에 따른 복호화 장치는 부호화 장치로부터 전송되는 확인정보 및/또는 복호화 장치에서 생성 또는 복원한 정보를 이용하여 가상채널의 특성을 효과적으로 반영하는 복수 개의 후보 잡음모델들을 생성한다. 그리고 복호화 과정에서 생성 또는 복원하는 정보 및/또는 부호화 장치로부터 전송받는 확인정보를 이용하여 채널코드 복호화부에 입력된 소정 채널코딩단위에 적합한 잡음모델을 선택하게 된다. 이를 통해 채널코드 복호화기는 압축데이터의 낭비를 최소화 하면서 보조정보의 잡음오류를 효과적으로 제거함으로써 압축성능을 상당히 증가시킨다.As described above, the decoding apparatus according to the present invention generates a plurality of candidate noise models that effectively reflect the characteristics of the virtual channel by using the identification information transmitted from the encoding apparatus and / or the information generated or reconstructed by the decoding apparatus. A noise model suitable for a predetermined channel coding unit input to the channel code decoding unit is selected by using information generated or restored during decoding and / or confirmation information received from the encoding apparatus. Through this, the channel code decoder significantly increases the compression performance by effectively eliminating the noise error of the auxiliary information while minimizing the waste of the compressed data.

도 1은 기존의 Wyner-Ziv 코딩기술에 해당하는 부호화 장치와 그에 따른 복호화 장치의 구성을 도시한 도면이다.
도 2는 본 발명에 따른 화소영역에서의 Wyner-Ziv 코딩기술의 부호화 장치와 그에 따른 다중잡음모델을 사용하여 소정 채널코딩단위에 적응적인 잡음모델을 사용함으로써 복호화 효율을 높이는 분산 비디오 부호화 영상의 복호화 장치의 구성을 도시한 도면이다.
도 3은 도 2의 부/복호화 장치에서 신호처리 과정을 거치는 중간영상데이터 및 소정 채널코딩단위의 예를 도시한 도면이다.
도 4는 도 2의 복호화 장치의 다중잡음모델 생성부에서 생성하는 다중잡음모델의 일 예를 도시한 도면이다.
도 5는 도 2의 복호화 장치의 채널코드 복호화부의 구성을 도시한 도면이다.
도 6는 도 2의 복호화 장치의 영상 복원부의 구성을 도시한 도면이다.
도 7은 본 발명에 따른 변환영역에서의 Wyner-Ziv 코딩기술의 부호화 장치와 그에 따른 다중잡음모델을 사용하여 소정 채널코딩단위에 적응적인 잡음모델을 사용함으로써 복호화 효율을 높이는 분산 비디오 부호화 영상의 복호화 장치의 구성을 도시한 도면이다.
도 8은 본 발명의 일 실시예에 따른 복호화 방법의 흐름도를 도시한 도면이다.
도 9는 본 발명의 일 실시예에 따른 다중 잡음 모델들을 생성하기 위한 흐름도를 도시한다.
도 10은 본 발명의 일 실시예에 따른 분산비디오 부호화 영상의 복호화 방법에 대한 성능 평가 결과를 나타내는 그래프이다.FIG. 1 is a diagram illustrating a configuration of a coding apparatus corresponding to a conventional Wyner-Ziv coding technique and a decoding apparatus according to the same.
2 is a block diagram of a distributed video coded image which improves decoding efficiency by using a noise model adaptive to a predetermined channel coding unit using a coding apparatus of a Wyner-Ziv coding technique in a pixel region and a multiple noise model according to the present invention. It is a figure which shows the structure of an apparatus.
FIG. 3 is a diagram illustrating an example of intermediate image data and a predetermined channel coding unit that undergo a signal processing process in the encoding / decoding apparatus of FIG. 2.
4 is a diagram illustrating an example of a multiple noise model generated by a multiple noise model generator of the decoding apparatus of FIG. 2.
5 is a diagram illustrating a configuration of a channel code decoder of the decoding apparatus of FIG. 2.
FIG. 6 is a diagram illustrating a configuration of an image reconstruction unit of the decoding apparatus of FIG. 2.
7 is a block diagram of a distributed video coded image which improves decoding efficiency by using a noise model adaptive to a predetermined channel coding unit using a coding apparatus of a Wyner-Ziv coding technique and a multiple noise model according to the present invention. It is a figure which shows the structure of an apparatus.
8 is a flowchart illustrating a decoding method according to an embodiment of the present invention.
9 shows a flowchart for generating multiple noise models according to an embodiment of the present invention.
10 is a graph illustrating a performance evaluation result of a method of decoding a distributed video encoded image according to an embodiment of the present invention.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였다.As the invention allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the written description. However, this is not intended to limit the present invention to specific embodiments, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. Like reference numerals are used for like elements in describing each drawing.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. 및/또는 이라는 용어는 복수의 관련된 기재된 항목들의 조합 또는 복수의 관련된 기재된 항목들 중의 어느 항목을 포함한다.The terms first, second, etc. may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component. And / or < / RTI > includes any combination of a plurality of related listed items or any of a plurality of related listed items.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다거나 "접속되어"있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어"있다거나 "직접 접속되어"있다고 언급된 때에는, 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다.It is to be understood that when an element is referred to as being "connected" or "connected" to another element, it may be directly connected or connected to the other element, . On the other hand, when a component is referred to as being "directly connected" or "directly connected" to another component, it should be understood that there is no other component in between.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terminology used herein is for the purpose of describing particular example embodiments only and is not intended to be limiting of the present invention. Singular expressions include plural expressions unless the context clearly indicates otherwise. In this application, the terms "comprise" or "having" are intended to indicate that there is a feature, number, step, operation, component, part, or combination thereof described in the specification, and one or more other features. It is to be understood that the present invention does not exclude the possibility of the presence or the addition of numbers, steps, operations, components, components, or a combination thereof.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 이하, 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다.Hereinafter, with reference to the accompanying drawings, it will be described in detail a preferred embodiment of the present invention. Hereinafter, the same reference numerals are used for the same components in the drawings, and duplicate descriptions of the same components are omitted.

본 발명에 따른 분산 비디오 부호화 영상의 복호화 장치 및 방법은 입력되는 소정 채널코딩단위 각각에 대하여, 발생할 수 있는 잡음의 정도를 파악하고 그에 알맞는 후보 잡음모델을 복수 개 생성한 후 이들 중에서 적응적으로 잡음모델을 선택하여 사용함으로써 영상데이터의 압축 성능을 개선할 수 있다. An apparatus and method for decoding a distributed video coded image according to the present invention identify a degree of noise that may occur for each of predetermined input channel coding units, generate a plurality of candidate noise models suitable for the adaptive channel coding, and adaptively select among them. By selecting and using the noise model, the compression performance of image data can be improved.

구체적으로, 본 발명에 따른 분산 비디오 부호화 영상의 복호화 장치 및 방법은 부호화 장치로부터 전송 받은 정보 및/또는 복호화 장치에서 생성 또는 복원되는 정보를 이용하여 채널의 특성을 반영하는 복수개의 후보 잡음모델들을 생성한 후, 복수개의 후보 잡음 모델들 중에서 복호화 장치에서 생성 또는 복원되는 정보 및/또는 부호화 장치로부터 전송 받은 정보를 이용하여 채널코드 복호화부에 입력된 소정 채널코딩단위 각각에 적합한 잡음모델을 선택하여, 채널 복호화를 수행함으로써, 전송되는 압축데이터 양의 낭비를 줄인다.Specifically, the apparatus and method for decoding a distributed video encoded image according to the present invention generates a plurality of candidate noise models reflecting channel characteristics by using information received from an encoding apparatus and / or information generated or reconstructed by the decoding apparatus. Then, a noise model suitable for each of the predetermined channel coding units input to the channel code decoding unit is selected by using information generated or restored by the decoding apparatus and / or information received from the encoding apparatus from among a plurality of candidate noise models. By performing channel decoding, waste of the amount of compressed data transmitted is reduced.

이하에서는 첨부된 도면들을 참조하여 본 발명을 보다 상세하게 설명한다. Hereinafter, with reference to the accompanying drawings will be described in detail the present invention.

도 2는 본 발명에 따른 Wyner-Ziv 부호화 장치(10)와 다중잡음모델을 사용하여 복호화 성능을 향상시키는 분산 비디오 부호화 영상의 복호화 장치(20)의 구성을 도시한 도면이다. 2 is a diagram illustrating a configuration of a decoder 20 of a distributed video encoded image for improving decoding performance by using the Wyner-Ziv encoding apparatus 10 and the multiple noise model according to the present invention.

도 2를 참조하여 설명하면, 본 발명에 따른 Wyner-Ziv 부호화 장치는 키픽춰 부호화부(11)와 WZ픽춰 부호화부(12)를 포함할 수 있다. Referring to FIG. 2, the Wyner-Ziv encoding apparatus according to the present invention may include a key picture encoder 11 and a WZ picture encoder 12.

부호화 장치(120)는 압축할 데이터를 두 개의 부류로 나누는데, 하나는 분산비디오 부호화 방식에 의해 부호화 되는 WZ픽춰이고, 또 다른 하나는 분산비디오 부호화 방식 이외의 종래의 부호화방식에 의하여 부호화 되는 키픽춰이다. 키픽춰 부호화부(11)는 키픽춰들은 일반적으로 H.264/AVC의 인트라픽춰 부호화 방식과 같이 사용자가 미리 선택한 소정의 방법으로 부호화하여 복호화 장치(120)로 전송한다.The encoding apparatus 120 divides the data to be compressed into two classes, one of which is a WZ picture encoded by a distributed video encoding scheme, and the other of which is a key picture encoded by a conventional encoding scheme other than the distributed video encoding scheme. to be. The key picture encoder 11 encodes the key pictures by a predetermined method selected by a user in advance, such as the H.264 / AVC intra picture encoding method, and transmits the encoded pictures to the decoding apparatus 120.

WZ픽춰 부호화부(12)는 WZ 픽춰 정보에 대하여 양자화를 실행하고, 양자화된 값을 부호화 단위로 구분한다. 여기서 부호화 단위는 영상을 부호화하고 복호화하는 단위로서, 프레임 또는 프레임을 세분한 다양한 크기의 블록을 포함하며, 화소영역 또는 이를 DCT(Discrete Cosine Transform)와 같은 변환(Transform)을 사용하여 얻어진 변환영역을 포함한다. 또한, 부호화 단위는 복호화 장치에서 복호화 대상이 되는 단위가 되어, 복호화 단위라는 용어와 같은 의미로 혼용하여 사용한다.The WZ picture encoder 12 performs quantization on the WZ picture information, and divides the quantized values into coding units. The coding unit is a unit for encoding and decoding an image. The coding unit includes a frame or a block having various sizes subdivided into frames, and includes a pixel region or a transform region obtained by using a transform such as a discrete cosine transform (DCT). Include. In addition, the coding unit becomes a unit to be decoded by the decoding apparatus and is used interchangeably in the same meaning as the term decoding unit.

또한, WZ픽춰 부호화부(12)는 채널코드를 이용하여 양자화된 값에 대한 압축데이터를 소정 채널코딩 단위로 생성하고 이를 버퍼(미도시)에 저장한다. 이때, 압축데이터는 압축할 부호화 단위에 해당하는 영상데이터에 대한 패리티데이터이다.In addition, the WZ picture encoder 12 generates compressed data about a quantized value using a channel code in a predetermined channel coding unit and stores the compressed data in a buffer (not shown). In this case, the compressed data is parity data of image data corresponding to a coding unit to be compressed.

도 3은 도 2의 부호화 장치에서 신호처리 과정을 거치는 중간영상데이터 및 소정 채널코딩단위의 예를 도시한 도면이다.3 is a diagram illustrating an example of intermediate image data and a predetermined channel coding unit that undergo a signal processing in the encoding apparatus of FIG. 2.

도 3을 참조하면, 프레임은 하나 또는 그 이상의 소정의 크기의 블록으로 이루어지며, 이 때, 블록내 및/또는 블록간 동일성분의 화소값 또는 변환영역 계수값은 양자화를 거친다. Referring to FIG. 3, a frame is composed of one or more blocks of a predetermined size, wherein pixel values or transform region coefficient values of the same components in and / or between blocks are quantized.

상기 양자화는 소정 채널코딩단위 간에 및 소정 채널코딩단위 내에서 동일한 크기로 이루어 질 필요가 없으며 양자화가 이루어지지 않는 경우도 포함한다. 또한, 블록 내 및 블록간 동일성분의 화소값 또는 변환영역 계수값에 대하여 지그재그 스캔방식 또는 Alternate scan과 같은 미리 정해진 스캔방식에 따라 데이터 나열화가 이루어지며, 이는 필수적이지 않다. The quantization does not need to be the same size between predetermined channel coding units and within a predetermined channel coding unit, and also includes a case where quantization is not performed. In addition, data enumeration is performed according to a predetermined scan method such as a zigzag scan method or an alternate scan for pixel values or transform region coefficient values of the same component in and between blocks, which is not essential.

여러 개의 양자화된 화소 값 혹은 변환영역 계수값이 나열화 된 것을 심벌 스트림이라 한다. 이 때 각각의 심벌 값(양자화된 화소 값 혹은 변환영역 계수값)은 이진화하였을 때 서로 다른 2^M의 가중치를 갖는 비트들로 나타나며, 심벌 스트림은 동일하거나 서로 다른 M의 값을 갖는 비트 스트림(이후 비트 플레인이라는 용어와 혼용)으로 구성된다. 이 때 심벌 스트림을 구성하는 심벌 각각에 대하여 M이 최대값을 갖는 비트 플레인을 최상위 비트 플레인이라 하며, M이 최소값을 갖는 비트 플레인을 최하위 비트 플레인이라 한다. 이는 또한, 심볼 스트림을 구성하는 심볼 각각의 최하위 비트 플레인의 M이 0인 양자화가 이루어지지 않은 경우 역시 포함한다.A plurality of quantized pixel values or transform region coefficient values are listed in a symbol stream. At this time, each symbol value (quantized pixel value or transform region coefficient value) is represented as bits having different weights of 2 ^M when binarized, and the symbol stream is a bit stream having a same or different M value (hereinafter, Mixed with the term bit plane). In this case, the bit plane having the maximum value M is referred to as the most significant bit plane, and the bit plane having the minimum value M is referred to as the least significant bit plane for each symbol constituting the symbol stream. This also includes the case where no quantization is performed in which M of the least significant bit plane of each symbol constituting the symbol stream is zero.

본 발명에서 채널코딩단위는 WZ픽춰 부호화부(12)에서 압축데이터, 즉 패리티 데이터를 생성하는 데이터 단위를 지칭하는 것으로서, 부호화 단위인 프레임 또는 프레임을 세분한 다양한 크기의 블록, 또는 부호화 단위를 세분화한 다양한 길이의 비트플레인, 또는 비트플레인의 일부를 포함한다. 또한, 패리티데이터를 생성하는 영상데이터는 화소영역에서의 값 또는 이를 DCT(Discrete Cosine Transform)와 같은 변환(Transform)을 사용하여 얻어진 변환영역에서의 계수값을 포함한다. In the present invention, the channel coding unit refers to a data unit for generating compressed data, that is, parity data, by the WZ picture encoder 12. The channel coding unit subdivides a block or a coding unit having various sizes, which are divided into frames or frames, which are coding units. One bitplane, or part of a bitplane, of various lengths. In addition, the image data generating the parity data includes a value in the pixel region or a coefficient value in the transform region obtained by using a transform such as a discrete cosine transform (DCT).

중간영상데이터는 소정 채널코딩단위의 집합으로 영상이 복원되기 이전의 데이터 값이다. 영상 복원부(25)의 영상복원이 이루어지는 단위(상기 부호화 단위)에 따라 양자화된 화소 값으로 이루어진 프레임 또는 프레임을 세분한 다양한 크기의 블록이 될 수 있으며, 이는 화소영역에서의 값뿐만 아니라 이를 DCT와 같은 변환(Transform)을 사용하여 얻어진 변환영역에서의 계수값을 포함한다.The intermediate image data is a data value before the image is restored to a set of predetermined channel coding units. According to the unit (the coding unit) in which the image restoration unit 25 performs image restoration, the image restoration unit 25 may be a frame having various quantized pixel values or blocks having various sizes, which are not only values in the pixel region but also DCT. Contains the coefficient values in the transform domain obtained using a transform such as

또한, 본 발명은 채널코딩단위를 더 세분하여, 채널코딩단위 내 최소구성단위를 정의한다. 이것은 채널코딩단위 내에서 교차 확률(Cross-over Probability)를 계산하는 최소의 단위를 의미한다. 최소구성 단위는 채널코드 복호화기에 입력되는 채널코딩단위에 따라 프레임 또는 프레임을 세분한 다양한 크기의 블록을 구성하는 화소 값, 또는 다양한 길이의 비트플레인을 구성하는 비트의 경우를 포함한다. 또한 최소구성 단위는 화소영역에서의 값 또는 이를 DCT와 같은 변환(Transform)을 사용하여 얻어진 변환영역에서의 계수값을 포함한다. Further, the present invention further subdivides the channel coding unit to define the minimum structural unit in the channel coding unit. This means the minimum unit for calculating the cross-over probability in the channel coding unit. The minimum configuration unit includes a pixel value constituting a frame or a block having various sizes subdivided into frames according to a channel coding unit input to the channel code decoder, or a bit constituting a bit plane of various lengths. The minimum constituent unit also includes a value in the pixel region or a coefficient value in the transform region obtained by using a transform such as DCT.

다시 도 2를 참조하면, 본 발명에 따른 복호화 장치(20)는 키픽춰 복원부(21), 보조정보 생성부(22), 다중잡음 모델 기반 잡음모델 제공부(30), 채널코드 복호화부(24), 그리고 영상 복원부(25)를 포함한다.Referring back to FIG. 2, the decoding apparatus 20 according to the present invention includes a key picture reconstructing unit 21, an auxiliary information generating unit 22, a multiple noise model based noise model providing unit 30, and a channel code decoding unit ( 24, and an image restoring unit 25.

키픽춰 복원부(21)는 키픽춰 부호화부(11)로부터 전송받은 정보를 이용하여 키픽춰를 복원한다. 보조정보 생성부(22)는 복원된 키픽춰들을 이용하여 현재 WZ픽춰로 간주되는 보조정보를 생성한다. 이 때, 상기 생성된 보조정보는 소정 채널코딩단위에 해당하는 원본영상에 가상의 채널에 의한 잡음오류가 첨가된 것으로 간주된다. 이 잡음오류를 오류정정기술을 이용하여 제거하는 목적의 채널코드 복호화를 위하여 가상의 채널에 존재하는 잡음의 예측이 필요하다.The key picture reconstructor 21 reconstructs the key picture using the information received from the key picture encoder 11. The auxiliary information generator 22 generates auxiliary information that is regarded as a current WZ picture by using the restored key pictures. At this time, the generated auxiliary information is regarded as a noise error due to a virtual channel is added to the original image corresponding to a predetermined channel coding unit. Prediction of the noise present in the virtual channel is necessary for channel code decoding for the purpose of eliminating this noise error using error correction techniques.

다중잡음 모델 기반 잡음모델 제공부(30)는 상기 보조정보에 있는 잡음오류들을 추정하기 위한 복수개의 후보 잡음모델들로 구성된 다중잡음모델에 기초하여 소정 채널코딩단위에 사용할 잡음모델을 제공한다.The noise model provider 30 based on the multiple noise model provides a noise model to be used for a predetermined channel coding unit based on the multiple noise model including a plurality of candidate noise models for estimating noise errors in the auxiliary information.

이를 위해, 다중잡음 모델 기반 잡음모델 제공부(30)는 다중잡음모델 생성부(23) 및 잡음모델 결정부(26)를 포함한다.To this end, the multi-noise model based noise model providing unit 30 includes a multi-noise model generation unit 23 and a noise model determination unit 26.

다중잡음모델 생성부(23)은 복호화 장치에서 생성 또는 복원한 정보 및/ 또는 상기 부호화 장치에서 전송된 확인정보를 이용하여 소정 채널코딩단위에 존재하는 잡음에 대하여 N개의 후보 잡음모델들을 생성한다. The multi-noise model generation unit 23 generates N candidate noise models for noise existing in a predetermined channel coding unit by using information generated or restored by the decoding apparatus and / or confirmation information transmitted by the encoding apparatus.

여기에서, 확인정보는 복호화 장치에서 추정하거나 얻을 수 있는 원본영상(즉, 영상데이터)의 일반적/통계적 정보의 전체 혹은 일부를 말하며, 원본영상에 대한 CRC(Cyclic Redundancy Check) 값이 그 한 예이다.Here, the confirmation information refers to all or part of the general / statistical information of the original image (ie, image data) that can be estimated or obtained by the decoding apparatus, and a CRC (Cyclic Redundancy Check) value of the original image is one example. .

분산 비디오 부호화 영상의 복호화 장치(20)는 정확한 잡음정보를 알 수 없다. 그러므로, 다중잡음모델 생성부(23)는 보조 정보에 있는 잡음 오류를 추정하기 위한 복수개의 후보 잡음 모델들을 생성하기 위해, 소정 채널코딩단위 내 최소구성단위인 비트 플레인을 구성하는 비트 각각에 대하여 잡음 확률분포(PMF)나, 잡음의 분포 특성(Stationary/Non-stationary), 실제의 잡음정보 추정 오류를 보상하기 위한 파라메터를 통하여 후보 잡음모델을 생성한다.The decoding apparatus 20 of the distributed video encoded image may not know accurate noise information. Therefore, the multi-noise model generation unit 23 generates noise for each bit constituting the bit plane which is the minimum component unit in the predetermined channel coding unit, in order to generate a plurality of candidate noise models for estimating the noise error in the auxiliary information. A candidate noise model is generated through parameters for compensating for probability distribution (PMF), noise distribution characteristics (stationary / non-stationary), and actual noise information estimation error.

상기 잡음 모델들을 생성하기 위한 잡음 분포 PMF의 몇 가지 예로서 라플라시안(Laplacian), 가우시안(Gaussian), Two-sided Gamma 분포 등이 있다.Some examples of noise distribution PMF for generating the noise models include Laplacian, Gaussian, and Two-sided Gamma distributions.

상기 잡음모델들은 채널코딩단위 내 잡음 분포 특성에 따라 채널코딩단위 내 최소구성단위 별로 각각 잡음추정을 하여 잡음 모델이 구성되는 Non-stationary 잡음모델과 채널코딩단위내의 구성요소들에 대하여 하나의 잡음추정 하에 잡음모델이 구성되는 Stationary 잡음모델을 포함한다.. 이외에 다른 분포 특성에 의한 구분이 존재할 수 있으며, 이를 이용하여 여러 개의 잡음모델을 생성하는 본 특허의 본질적인 의미를 벗어나지 않는다.The noise models are estimated according to the noise distribution characteristics in the channel coding unit for each of the minimum component units in the channel coding unit to estimate the noise of the non-stationary noise model and the components in the channel coding unit. In addition, the stationary noise model includes a stationary noise model. In addition, there may exist distinctions based on other distribution characteristics, and it does not deviate from the essential meaning of the present patent which generates several noise models by using the same.

상기 파라메터는 복호화 장치에서 행하여지는 잡음예측은 정확할 수 없으므로 이를 보정하기 위하여 사용하게 되는 각종 보정변수를 의미한다. 이하, 다중잡음모델 생성부(23)의 동작을 도 4를 참조하여 더 상세히 설명한다. 설명의 용이함을 위해, 변환영역의 영상데이터에 대해 패리티데이터가 생성된다는 것과, 소정 채널코딩단위로서 한 프레임 내 양자화된 동일 주파수 밴드(동일 변환영역 계수값)의 비트 플레인을 사용하는 것을 가정한다.The parameter refers to various correction variables that are used to correct the noise prediction performed by the decoding apparatus. Hereinafter, the operation of the multi-noise model generation unit 23 will be described in more detail with reference to FIG. 4. For ease of explanation, it is assumed that parity data is generated for the image data of the transform region, and that bit planes of the same frequency band (same transform region coefficient value) quantized in one frame are used as predetermined channel coding units.

다중잡음모델 생성부(23)는 소정 채널코딩단위인 비트 플레인에 대해 생성된 보조정보상의 잡음을 예측하기 위하여, 해당 키픽춰 블록 간의 차이를 구한다. 예컨대, 다중잡음모델 생성부(23)는 복호화 장치의 보조 정보 생성부(22)에서 보조정보 생성시 참조한 전후 키픽춰에 대한 움직임벡터가 가리키는 전후 키픽춰 상의 해당 블록을 찾고, 이 블록들의 동일한 주파수 밴드간의 차이를 잡음으로 예측한다(도 4의 310). 여기에서 하나의 블록은 부호화 장치에서 사용된 채널코딩 단위일 수 있다.The multi-noise model generation unit 23 obtains the difference between the corresponding key picture blocks in order to predict the noise on the auxiliary information generated for the bit plane which is a predetermined channel coding unit. For example, the multi-noise model generation unit 23 finds a corresponding block on the front-back key picture indicated by the motion vector for the front-back key picture referred to when the auxiliary information generation unit 22 of the decoding apparatus generates the auxiliary information, and the same frequency of these blocks. The difference between the bands is predicted as noise (310 in FIG. 4). Here, one block may be a channel coding unit used in the encoding apparatus.

예컨대, 다중잡음모델 생성부(23)는 블록 간의, 예컨대, i번째 블록 간의 오차 블록을 생성한 후 오차 블록에 대한 변환을 수행하고 그 분산을 구한다. 이 분산이 i-번째 블록에 위치한 잡음의 분산값이 된다. 이어서, 다중잡음모델 생성부(23)는 상기 후보 잡음모델들을 생성할 때에는 적어도, 채널 잡음의 확률분포(PMF: Probability Mass Function) 및/또는 잡음모델의 잡음 분포 특성 등을 고려한다.For example, the multi-noise model generation unit 23 generates an error block between blocks, for example, an i-th block, performs transformation on the error block, and obtains the variance thereof. This variance becomes the variance of the noise located in the i-th block. Subsequently, when generating the candidate noise models, the multi-noise model generation unit 23 considers at least a probability distribution function (PMF) of the channel noise and / or a noise distribution characteristic of the noise model.

구체적으로 채널 잡음의 확률분포에 관련하여, 다중잡음모델 생성부(23)는 소정 채널코딩단위 내 최소구성단위인 비트 플레인을 구성하는 비트 각각에 대하여 잡음 확률분포(PMF)에 따라 잡음 모델을 생성한다(도 4의 320).Specifically, in relation to the probability distribution of the channel noise, the multi-noise model generation unit 23 generates a noise model according to the noise probability distribution (PMF) for each of the bits constituting the bit plane that is the minimum component unit within the predetermined channel coding unit. (320 in FIG. 4).

전술한 바와 같이 상기 후보 잡음모델들을 생성하기 위한 잡음 확률분포(PMF)의 몇 가지 예로서 라플라시안(Laplacian), 가우시안(Gaussian), Two-sided Gamma 분포 등이 있다.As described above, some examples of noise probability distributions (PMF) for generating the candidate noise models include Laplacian, Gaussian, and Two-sided Gamma distributions.

복수개의 잡음 확률분포(PMF) 중 예컨대, 라플라시안 분포에 관련하여 설명한다. 잡음예측에서 생성된 소정 채널코딩단위를 구성하는 i-번째 블록에 위치한 잡음의 분산값을

라고 하면, 다중잡음모델 생성부(23)는 라플라시안 분포에 따라, 하기 [수학식 1]에 의하여 소정 채널코딩단위 내 최소구성단위에 대한 잡음모델을 생성할 수 있다.Among the plurality of noise probability distributions (PMFs), for example, the Laplacian distribution will be described. The variance of the noise located in the i-th block constituting the predetermined channel coding unit generated by the noise prediction

In this case, the multi-noise model generation unit 23 may generate a noise model for the smallest unit within a predetermined channel coding unit according to Equation 1 according to the Laplacian distribution.

상기 [수학식 1] 에서, x 는 잡음을 의미하고,

는 소정 채널코딩단위 중i번째 블록의 라플라시안 계수값을 의미한다. In Equation 1, x means noise,

Denotes the Laplacian coefficient value of the i-th block of the predetermined channel coding unit.

이와 같이, 다중잡음모델 생성부(23)는 비트 플레인 내의 비트 각각에 대하여 분산값

을 구함으로써 잡음을 예측하고, 예측된 잡음 즉, 비트 각각에 대한 분산값들에 대해 라플라시안 분포에 따른 잡음모델을 생성한다. In this way, the multi-noise model generation section 23 has a variance value for each bit in the bit plane.

By estimating the noise, we generate a noise model based on the Laplacian distribution for the predicted noise, that is, the variance values for each bit.

이와 같은 방식으로 다중잡음모델 생성부(23)는 라플라시안(Laplacian) 분포 이외에도 가우시안(Gaussian) 분포, Two-sided Gamma 분포 등을 이용하여 잡음 모델들을 생성할 수 있다. In this manner, the multi-noise model generator 23 may generate noise models using a Gaussian distribution, a two-sided gamma distribution, and the like in addition to the Laplacian distribution.

상기의 후보 잡음모델들을 생성할 때 고려하는 상기의 잡음 확률분포(PMF)는 잡음의 분포 형태를 결정하며, 이 확률분포를 이용하여 복호화 대상이 되는 소정 채널코딩단위 내 최소구성단위에 대한 교차 확률(Cross-over probability)을 계산한다. 예를 들면, 보조정보를 이루는 하나의 비트 플레인이 소정 채널코딩단위인 경우 이를 복호화 할 때에는, 그 비트 플레인을 이루는 각각의 비트가 소정 채널코딩단위 내 최소구성단위가 되어 이에 대한 교차 확률을 계산한다. The noise probability distribution (PMF) that is considered when generating the candidate noise models determines the distribution form of noise, and the probability distribution is used to determine the crossover probability for the smallest unit within a predetermined channel coding unit to be decoded. Calculate the cross-over probability. For example, when one bit plane constituting the auxiliary information is a predetermined channel coding unit, when decoding the bit plane, each bit constituting the bit plane becomes a minimum constituent unit in the predetermined channel coding unit and calculates an intersection probability thereof. .

이어서, 다중잡음모델 생성부(23)는 잡음 분포 특성에 관련하여 생성된 잡음 모델의 잡음 분포 특성에 따라 추가의 잡음 모델들을 더 생성한다(도 4의 330). 구체적으로 다중잡음모델 생성부(23)는 잡음 확률 분포에 따라 생성된 잡음 모델의 Stationary/Non-stationary 특성 여부를 판단한다. 그리고, 다중잡음모델 생성부(23)는 Stationary 및 Non-stationary 특성들 중 잡음 모델이 가지지 않은 특성에 대해 잡음 모델을 더 생성한다. Subsequently, the multiple noise model generator 23 further generates additional noise models according to the noise distribution characteristic of the noise model generated in relation to the noise distribution characteristic (330 of FIG. 4). In detail, the multi-noise model generation unit 23 determines whether the stationary / non-stationary characteristics of the generated noise model are based on the noise probability distribution. In addition, the multi-noise model generation unit 23 further generates a noise model for a characteristic that the noise model does not have among the stationary and non-stationary characteristics.

예컨대, 상기 실시예에서 생성된 잡음 모델은 채널코딩단위 내 최소구성단위 별로 각각 잡음을 예측하여 잡음 모델이 구성되었으므로, 생성된 잡음 모델은 Non-stationary 잡음모델의 특성을 갖는다. 다중잡음모델 생성부(23)는 생성된 잡음 모델에 기초하여, stationary 특성을 갖도록 잡음 모델을 더 생성한다. For example, the noise model generated in the above embodiment has a noise model configured by predicting noise for each of the minimum component units in the channel coding unit, and thus the generated noise model has the characteristics of a non-stationary noise model. The multiple noise model generator 23 further generates a noise model based on the generated noise model to have a stationary characteristic.

예컨대, 다중잡음모델 생성부(23)는 하나의 비트 플레인 전체에 대한 잡음추정을 이용하여 비트 플레인 모든 블록에 대하여 동일한 채널코딩단위 내 블록의 잡음 분산값(

) 하나만을 사용한 잡음모델을 적용하면 stationary 특성을 갖는 잡음모델을 생성할 수 있다. 여기서 B는 비트 플레인을 구성하는 비트를 나타내기 위함이 아니며 해당 비트 플레인을 구성하는 각각의 블록에 위치한 잡음을 샘플로 하는 랜덤변수의 분산값을 나타내고 있음을 의미한다.For example, the multi-noise model generation unit 23 uses the noise estimation for the entire bit plane to determine the noise variance value of the block in the same channel coding unit for all the blocks in the bit plane.

By applying a noise model using only one, a noise model with stationary characteristics can be generated. Here, B is not intended to represent the bits constituting the bit plane, but means that the variance value of the random variable that samples the noise located in each block constituting the bit plane is sampled.

모델의 Stationary/Non-stationary 여부 결정에 의한, Stationary 잡음 분포는 소정 채널코딩단위 내 최소구성단위들에 대하여 공통적으로 하나의 교차 확률(Cross-over Probability)을 사용하는 것을 의미하며, Non-stationary 잡음 분포는 소정 채널코딩단위 내 최소구성단위에 대하여 각각의 교차 확률을 사용하는 것을 의미한다. 일반적으로는 Non-stationary 잡음모델의 성능이 좋지만, 잡음 예측이 매우 부정확 할 경우에는 stationary 잡음모델을 사용하는 것이 오히려 이득이 된다.Stationary noise distribution by determining whether the model is stationary or non-stationary means that a single cross-over probability is used for the minimum component units within a given channel coding unit, and non-stationary noise Distribution means using the respective crossing probabilities for the smallest unit within a predetermined channel coding unit. In general, the performance of the non-stationary noise model is good, but if the noise prediction is very inaccurate, the use of the stationary noise model is rather beneficial.

이러한 잡음예측은 추정된 움직임 벡터에 대응하는 전/후 키 픽춰상의 블록간 주파수 밴드의 차를 이용하여 실제의 잡음을 추정하는데 사용하므로 움직임 벡터의 추정이 정확하지 않거나, 이러한 잡음 예측방법이 잘 맞지 않는 경우에는 잘못된 잡음예측의 가능성이 높아, 이를 효과적으로 정정하는 것이 바람직하다. Since the noise prediction is used to estimate the actual noise by using the difference of the inter-block frequency bands on the front and back key pictures corresponding to the estimated motion vector, the estimation of the motion vector is not accurate or the noise prediction method is not suitable. If not, there is a high possibility of false noise prediction, and it is desirable to correct this effectively.

그러므로, 다중잡음모델 생성부(23)는 잡음 확률 분포 및 잡음 분포 특성에 따라 여러 잡음 모델들을 생성한 후 실제의 잡음정보 추정 오류를 보상하기 위해 생성된 잡음 모델들을 변화시킴으로써 최종적인 후보 잡음 모델들을 생성한다.Therefore, the multi-noise model generator 23 generates several noise models according to the noise probability distribution and the noise distribution characteristics, and then changes the generated noise models to compensate for the actual noise information estimation error. Create

구체적으로 설명하면 예측을 통하여 생성된 후보 잡음모델들이 실제 잡음모델과 상이할 수 있으며, 이 차이를 수치화 하여 나중에 이를 보상하기 위한 목적으로 소정의 파라메터들을 정의하여 후보잡음 모델 생성시에 사용할 수 있다. 예를 들어, 잡음 모델을 생성하는 중 발생할 수 있는 스케일링 인자(scaling factor)를 정의하여 사용할 수 있다.In detail, candidate noise models generated through prediction may be different from actual noise models, and predetermined parameters may be defined and used to generate candidate noise models for the purpose of quantifying the difference and later compensating for the difference. For example, a scaling factor that may occur while generating a noise model may be defined and used.

이와 같이, 상기 잡음 예측은 정확한 잡음예측이 아니므로 다중잡음모델 생성부(23)는 상기 잡음예측치(잡음의 통계적인 값)를 기반으로 하여 실제의 잡음정보 추정 오류를 보상하기 위해 생성된 복수개의 잡음 모델들을 변화시킴으로써 복수개의 후보 잡음 모델들을 생성한다. As such, since the noise prediction is not accurate noise prediction, the multiple noise model generator 23 generates a plurality of noise generated to compensate for the actual noise information estimation error based on the noise prediction value (the statistical value of the noise). A plurality of candidate noise models are generated by changing the noise models.

예컨대, 다중잡음모델 생성부(23)는 실제의 잡음정보 추정 오류를 보상하기 위한 파라메터를 변화시키면서 생성된 잡음 모델들에 적용함으로써복수개의 후보 잡음 모델들을 생성한다(도 4의 340). For example, the multiple noise model generator 23 generates a plurality of candidate noise models by applying the generated noise models while changing a parameter for compensating an actual noise information estimation error (340 of FIG. 4).

이런 정정의 일 실시예로서, 주변의 잡음 예측치에 비하여 판이하게 다른 예측치를 갖는 블록을 찾아 이를 정정하는 경우가 있다. 다른 실시예로서, 생성된 보조정보에 대한 각 블록별 신뢰도, 부호화 장치에서 전송된 확인정보, 복호화 장치에서 추정한 움직임 벡터에 기반한 주변 블록 간 유사도 정보 등에 기반하여 정정하는 경우가 가능하다. As an example of such correction, there may be a case where a block having a different prediction value is found and corrected compared to the surrounding noise prediction value. As another embodiment, the correction may be performed based on reliability of each block of the generated auxiliary information, confirmation information transmitted from the encoding apparatus, and similarity information between neighboring blocks based on the motion vector estimated by the decoding apparatus.

이하, 상기의 주변 잡음 예측치와의 비교를 통한 정정의 실시예를 설명한다. 주변 블록의 잡음 예측치를 비트 플레인에 존재하는 잡음의 분산값(

)을 사용하여 계산할 경우, 다음의 세 가지의 경우가 존재할 수 있다. Hereinafter, an embodiment of correction through comparison with the above-described ambient noise prediction value will be described. The noise estimates of the neighboring blocks are estimated from the variance of the noise

), There are three possible cases:

다중잡음모델 생성부(23)는 주변 블록의 잡음 예측치를 비트 플레인에 존재하는 잡음의 분산값(

)을 사용하여 계산할 경우, 다음과 같이 예측 잡음을 정정한다.The multi-noise model generation unit 23 calculates the noise estimates of the neighboring blocks by the variance value of the noise present in the bit plane.

), Correct the prediction noise as follows.

a)

의 경우:

를

로 정정a)

In the case of:

To

Corrected by

b)

의 경우:

를 임계범위(

) 내의 값으로 정정b)

In the case of:

Is the critical range (

To the value in

c)

의 경우:

를

로 정정c)

In the case of:

To

Corrected by

이 때, Non-stationary의 특성을 갖는 잡음모델에서 사용하는 주변 잡음 예측치(상기의 실시예의 경우

)와 stationary의 특성을 갖는 잡음모델에서 사용하는 비트 플레인의 잡음 예측치(

)는 실제 원본영상과 보조정보의 차를 통해 얻어진 값이 아니므로, 이를 보상하기 위한 보상계수(

, Compensation factor)가 필요하다. At this time, the ambient noise prediction value used in the noise model having a non-stationary characteristic (in the case of the above embodiment)

) And noise prediction of the bit plane used in the noise model with stationary characteristics

) Is not a value obtained through the difference between the original image and the supplementary information, so a compensation coefficient (

, A compensation factor is required.

그에 따라, 다중잡음모델 생성부(23)는 비트 플레인에 존재하는 실제 잡음의 분산값을

라 하고, 라플라시안(Laplacian) 분포를 따른다면 [수학식 2]의 조건을 만족하는 보상계수를 사용한다.Accordingly, the multi-noise model generation unit 23 calculates the variance of the actual noise present in the bit plane.

If the Laplacian distribution is followed, a compensation factor that satisfies the condition of [Equation 2] is used.

그러므로, 다중잡음모델 생성부(23)는 실제의 잡음정보 추정 오류를 보상하기 위해 [수학식 2]의 조건을 만족하는 보상계수

를 변화시키면서 생성된 후보 잡음 모델들에 적용함으로써 복수개의 최종 후보 잡음 모델들을 생성한다. 따라서, 라플라시안(Laplacian) 잡음모델에 대한 후보 잡음모델은 [수학식 2]를 만족하는

의 값의 범위에 대하여 생성된다. Therefore, the multi-noise model generation unit 23 is a compensation coefficient that satisfies the condition of [Equation 2] to compensate for the actual noise information estimation error

A plurality of final candidate noise models are generated by applying to candidate noise models generated while varying. Therefore, the candidate noise model for the Laplacian noise model satisfies [Equation 2].

Generated for a range of values.

분산비디오 복호화 장치내의 원본정보 부재로 [수학식 2]를 만족하는 정확한 보상계수

의 값을 알 수는 없으나, 적어도 [수학식 2]를 만족하는

의 값의 범위는 알 수 있다. 따라서, 다중잡음모델 생성부(23)는

값의 최소값에서 최대값 사이의 값을

에 대입함으로써 여러 개의 최종 후보 잡음모델들을 생성할 수 있으며, 이는 Non-stationary/Stationary 특성 어느 경우에 상관없이 두 경우 모두에 대하여 적용할 수 있다. 도 4에서는

은 0, 0.01 및 0.02 등에 대해 최종 후보 잡음 모델들이 생성되는 경우를 예시적으로 나타낸다.Accurate compensation factor that satisfies Equation 2 due to absence of original information in distributed video decoding apparatus

Is unknown, but at least satisfies Equation 2

The range of values is known. Therefore, the multi-noise model generation unit 23

Value between the minimum and maximum values

Multiple final candidate noise models can be generated by substituting for, which can be applied to both cases regardless of non-stationary / stationary characteristics. In Figure 4

Exemplarily illustrates a case where final candidate noise models are generated for 0, 0.01, 0.02, and the like.

이때, 보상계수

는 라플라시안 잡음분포에 해당하는 하나의 예로써 다른 PDF에서도 이와 유사한 파라메터가 사용될 수 있으므로, 위의 보상계수

이외의 다른 파라메터를 이용하여 실제잡음모델과 생성된 잡음모델사이의 차이를 보상하는 것도 역시 가능하다. 따라서, 실제잡음모델과 생성된 잡음모델사이의 값의 보상을 수행하기 위하여 다른 파라메터 값을 사용하는 것은 본 특허 기본 사상에 어긋나는 것이 아니며, 본 특허의 기본 사상을 구현하는 개개의 특수한 예의 범위를 벗어나지 않는다.At this time, the compensation factor

Is an example of the Laplacian noise distribution, and similar compensation parameters can be used in other PDFs.

It is also possible to use other parameters to compensate for the difference between the actual noise model and the generated noise model. Therefore, the use of other parameter values to compensate for the value between the actual noise model and the generated noise model is not contrary to the basic idea of the present patent, and does not depart from the scope of each particular example of implementing the basic idea of the present patent. Do not.

이때, 보상계수 는 라플라시안 잡음분포에 해당하는 하나의 예로써 다른 PDF에서도 이와 유사한 파라메터가 사용될 수 있으므로, 위의 보상계수 외의 다른 파라메터를 이용하여 실제잡음모델과 생성된 잡음모델사이의 차이를 보상하는 것도 역시 가능하다. 따라서, 실제잡음모델과 생성된 잡음모델사이의 값의 보상을 수행하기 위하여 다른 파라메터 값을 사용하는 것은 본 특허 기본 사상에 어긋나는 것이 아니며, 본 특허의 기본 사상을 구현하는 개개의 특수한 예의 범위를 벗어나지 않는다.In this case, the compensation factor is an example corresponding to the Laplacian noise distribution, and similar parameters may be used in other PDFs. Therefore, the compensation factor compensates for the difference between the actual noise model and the generated noise model by using a parameter other than the above compensation factor. It is also possible. Therefore, the use of other parameter values to compensate for the value between the actual noise model and the generated noise model is not contrary to the basic idea of the present patent, and does not depart from the scope of each particular example of implementing the basic idea of the present patent. Do not.

이러한 방식으로 다른 잡음 확률분포(PMF)에 대하여도 여러 개의 후보 잡음모델들을 생성할 수 있으며 이렇게 생성된 모든 후보 잡음모델들을 모아서 다중잡음모델이 생성된다. 다중잡음모델을 생성하는 방법의 구현에는 여러 방법이 있을 수 있으며, 따라서 상기 실시 예에 의해 본 특허가 어떠한 범위로도 제한 받는 것은 아니다. In this way, several candidate noise models can be generated for different PMFs, and a multiple noise model is generated by collecting all the candidate noise models. There may be a number of ways to implement a method for generating a multi-noise model, and thus the present invention is not limited to any range by the above embodiments.

이와 같이, 다중잡음모델 생성부(23)는 보조 정보에 있는 잡음 오류를 추정하기 위한 복수개의 후보 잡음 모델들을 생성하고 이들 후보 잡음 모델들을 잡음 모델 결정부(26)로 제공한다.As such, the multiple noise model generator 23 generates a plurality of candidate noise models for estimating noise error in the auxiliary information and provides these candidate noise models to the noise model determiner 26.

잡음모델 결정부(26)는 생성된 복수개의 후보 잡음모델들 중 현재 소정 채널코딩단위의 복호화에 알맞은 잡음모델을 결정하거나 후보 잡음모델의 개수를 줄인다. The noise model determiner 26 determines a noise model suitable for decoding a current channel coding unit among a plurality of generated candidate noise models or reduces the number of candidate noise models.

채널코드 복호화부(24)는 결정된 상기 잡음모델에 기반하여 상기 보조정보 생성부에서 생성된 상기 보조정보의 잡음오류를 상기 부호화 장치로부터 전송되는 압축데이터를 사용하여 정정하여, 주어진 복호화 단위에 대한 소정 채널코딩단위들을 복호한다.The channel code decoder 24 corrects a noise error of the auxiliary information generated by the auxiliary information generator based on the determined noise model by using the compressed data transmitted from the encoding apparatus, and determines a predetermined decoding unit for a given decoding unit. Decode the channel coding units.

이러한 채널코드 복호화부(24)의 구성 및 동작을 도 5를 참조하여 설명한다.The configuration and operation of the channel code decoder 24 will be described with reference to FIG.

도 5는 도 2의 복호화 장치의 채널코드 복호화부의 구성을 도시한 도면이다.5 is a diagram illustrating a configuration of a channel code decoder of the decoding apparatus of FIG. 2.

도 5를 참조하여 설명하면, 채널코드 복호화부(24)는 잡음모델 결정부(26)에서 선택된 잡음모델을 기반으로 부호화 장치(10)에서 압축데이터(즉, 패리티데이터)를 점진적으로 전송받으며 보조정보의 잡음을 제거하는 채널코드 복호화기(510)와, 채널코드 복호화기(510)에서 출력된 연판정 출력값 및/또는 부호화기로부터 전송된 확인정보를 이용하여 채널코드 복호화 신뢰도를 측정하는 채널코드 복호화 신뢰도 계산부(520)를 포함한다. 여기서, 채널코드 복호화 신뢰도 계산부(520)는 채널코드 복호화기에서 출력된 연판정 출력값 및/또는 부호화기로부터 전송된 CRC(Cyclic Redundancy Code) 정보와 같은 확인정보를 이용하여 복호화 신뢰도를 측정한다. Referring to FIG. 5, the channel code decoder 24 gradually receives compressed data (ie, parity data) from the encoding apparatus 10 based on the noise model selected by the noise model determiner 26. Channel code decoding for measuring channel code decoding reliability using a channel code decoder 510 for removing noise of information and a soft decision output value output from the channel code decoder 510 and / or confirmation information transmitted from the encoder. The reliability calculator 520 is included. Here, the channel code decoding reliability calculator 520 measures the decoding reliability using the soft decision output value output from the channel code decoder and / or identification information such as cyclic redundancy code (CRC) information transmitted from the encoder.

연판정 출력을 통한 신뢰도 측정은 연판정 출력을 임계값

과 비교하여 임계값 보다 크면 그 복호화 결과를 신뢰하고 낮으면 신뢰하지 않는 방법으로 이루어진다. 이는 연판정 출력값의 경우 신뢰도가 낮을 수 있으므로 출력값을 바로 이용하기 보다는 일정이상의 신뢰도를 보이는 경우만 그 값을 이용하는 것이며, 이를 통하여 복호화 결과에 대한 에러율 산출이 가능하다. 이러한 방법은 복호화 결과의 신뢰도를 측정하는 하나의 예로 상기와 같은 실시 예 이외에도 다양한 접근이 가능하다.Reliability measurements with soft decision outputs threshold the soft decision outputs.

If it is larger than the threshold, the decoding result is trusted and if it is low, it is not trusted. In the case of the soft decision output value, the reliability may be low. Therefore, the value of the soft decision output value is used only when a certain level of reliability is shown, rather than using the output value immediately. This method is one example of measuring the reliability of the decoding result, various approaches are possible in addition to the above-described embodiment.

이 경우, 잡음모델 결정부(26)는 상기 다중잡음모델 생성부(23)에서 생성된 복수개의 후보 잡음모델들 중에서 주어진 소정 채널코딩단위에 적합한 잡음모델을 결정한다. 구체적으로, 잡음모델 결정부(26)에서는 복호화 장치에서 생성 또는 복원한 정보 및/또는 부호화 장치에서 전송된 확인정보에 기반하여 생성된 복수개의 후보 잡음모델들 중 현재 소정 채널코딩단위의 복호화에 알맞은 잡음모델을 선택하거나 후보 잡음모델의 개수를 줄인다. In this case, the noise model determiner 26 determines a noise model suitable for a given channel coding unit among a plurality of candidate noise models generated by the multiple noise model generator 23. In detail, the noise model determiner 26 is suitable for decoding a current channel coding unit among a plurality of candidate noise models generated based on information generated or restored by the decoding apparatus and / or confirmation information transmitted by the encoding apparatus. Choose a noise model or reduce the number of candidate noise models.

이에 대한 실시예는 다음과 같다.An embodiment thereof is as follows.

실시예Example 1 One

채널코드 복호화기(510)는 패리티데이터를 점진적으로 수신하므로 최초의 패리티데이터를 수신 받아 다중잡음모델을 사용하여 채널복호화를 수행한다. 이 때 최초의 패리티데이터는 점진적으로 여러번에 나누어 패리티데이터를 수신할 경우 1번째 수신되는 패리티데이터를 의미하며, 이후 복호화 장치의 요청에 따라 추가의 패리티데이터가 수신된다. 패리티데이터 수신 후 채널 복호화의 결과로 연판정 출력값이 나오면 채널코드 복호화 신뢰도 계산부(520)에서 신뢰도를 측정하여 임계값

보다 신뢰도가 높은 값들은 신뢰하고 낮은 값들은 신뢰하지 않는 방법으로 채널 복호화 결과(연판정 출력값)의 에러율을 계산한다. 이 때 잡음모델 결정부(26)에서는 수신된 패리티데이터로 채널 복호화가 이루어진 후 추정된 에러율을 바탕으로 1~N번의 잡음모델 중 수렴속도가 빠른 순으로 잡음모델을 추려나간다. 최종적으로 먼저 목표 에러율 이하의 추정된 에러율을 달성한 잡음모델이 소정 채널코딩단위에 대한 최적의 잡음모델이 된다. 이 때 최적의 잡음모델은 하나이상이 될 수도 있으며 소정 채널코딩단위에 대한 잡음모델로 모두 결정이 된다.Since the channel code decoder 510 gradually receives the parity data, the channel code decoder 510 receives the first parity data and performs channel decoding using the multiple noise model. In this case, the first parity data means parity data received first when the parity data is gradually divided into several times and additional parity data is received at the request of the decoding apparatus. If the soft decision output value is obtained as a result of channel decoding after the parity data is received, the channel code decoding reliability calculation unit 520 measures the reliability to determine a threshold value.

The error rate of the channel decoding result (soft decision output value) is calculated in such a way that the higher reliability values are reliable and the lower reliability values are not. At this time, the noise model determiner 26 deducts the noise model in the order of convergence speed among the 1 ~ N noise models based on the estimated error rate after channel decoding is performed on the received parity data. Finally, the noise model that first achieves the estimated error rate below the target error rate is the optimum noise model for a given channel coding unit. At this time, there may be more than one optimal noise model, and all of them are determined by the noise model for a given channel coding unit.

실시예Example 2 2

채널코드 복호화기(510)는 패리티데이터를 점진적으로 수신하므로 최초의 패리티데이터를 수신 받아 다중잡음모델을 사용하여 채널복호화를 수행한다. 이 때 최초의 패리티데이터는 점진적으로 여러번에 나누어 패리티데이터를 수신할 경우 1번째 수신되는 패리티데이터를 의미하며, 이후 복호화 장치의 요청에 따라 추가의 패리티데이터가 수신된다. 패리티데이터 수신 후 채널복호화의 결과로 연판정 출력값이 나오면 채널코드 복호화 신뢰도 계산부(520)에서 신뢰도를 측정하여 임계값

보다 신뢰도가 높은 값들을 신뢰하고 낮은 값들은 신뢰하지 않는 방법으로 채널 복호화 결과의 에러율을 계산한다. 이 때 추정된 에러율이 일정 에러율 이하를 갖게 되면 부호화기에서 전송된 CRC정보를 사용하여 CRC검사를 통하여 채널 복호화 결과의 오류 여부를 결정하게 된다. 이때 잡음모델 결정부(26)에서는 1~N번의 복수개의 후보 잡음모델들 각각을 사용하여 독립적으로 복호화하여 최소의 패리티데이터를 사용하여 CRC 검사를 통과한 잡음모델을 소정 채널코딩대상에 대한 최적의 잡음모델로 선택한다. 이 때 최적의 잡음모델은 하나이상이 될 수도 있으며 소정 채널코딩단위에 대한 잡음모델로 모두 결정이 된다.Since the channel code decoder 510 gradually receives the parity data, the channel code decoder 510 receives the first parity data and performs channel decoding using the multiple noise model. In this case, the first parity data means parity data received first when the parity data is gradually divided into several times and additional parity data is received at the request of the decoding apparatus. After receiving the parity data, if the soft decision output value is obtained as a result of channel decoding, the channel code decoding reliability calculation unit 520 measures the reliability to determine the threshold value.

The error rate of the channel decoding result is calculated by trusting more reliable values and not trusting lower values. At this time, if the estimated error rate has a predetermined error rate or less, it is determined whether an error of the channel decoding result is performed through the CRC check using the CRC information transmitted from the encoder. At this time, the noise model determiner 26 independently decodes each of a plurality of candidate noise models from 1 to N times, and then optimizes the noise model that has passed the CRC check using the minimum parity data for a predetermined channel coding object. Select to noise model. At this time, there may be more than one optimal noise model, and all of them are determined by the noise model for a given channel coding unit.

실시예Example 3 3

잡음모델 결정부(26)에서는 부호화 장치(10)에서 전송한 확인정보에 기반하여 생성된 복수개의 후보 잡음모델들 중 현재 소정 채널코딩단위의 복호화에 알맞은 잡음모델을 선택하거나 후보 잡음모델의 개수를 줄인다. 예를 들어 상기 다중잡음모델 생성부(23)의 다중잡음모델 생성 실시예 중 라플라시안 분포를 기반으로 한 후보 잡음모델들의 생성에서 사용된 값은 비트 플레인의 잡음예측치 (

)를 사용 하였을 때 이를 실제 잡음의 분산값()과 동일한 수준으로 만들기 위한 보상 계수이다. 따라서, 잡음모델 결정부(26)는 부호화 장치(10)로부터 부호화 단위에 해당하는 원본영상의 통계적인 데이터(

)를 전송 받은 후, [수학식 3]에 의한 실제의 분산값 추정이 가능하며, 이를 이용하여 사용할 다중잡음모델의 범위를 줄이거나 잡음모델을 선택 할 수 있다. 이 때, 사용 예에 따라 [수학식3]에서 사용한 [가정1], [가정2]는 바뀔 수 있으며 부호화 장치에서 전송된 확인정보를 사용한다는 점에서 본 특허의 본질적인 내용을 벗어나지 않는다.The noise model determiner 26 selects a noise model suitable for decoding a current channel coding unit from among a plurality of candidate noise models generated based on the confirmation information transmitted from the encoding apparatus 10, or selects the number of candidate noise models. Reduce For example, in the multi-noise model generation embodiment of the multi-noise model generation unit 23, it is used in the generation of candidate noise models based on the Laplacian distribution. The value is the noise prediction of the bit plane (

) Is a compensation factor to make it equal to the variance of the actual noise (). Accordingly, the noise model determiner 26 receives statistical data (the original image corresponding to the coding unit) from the encoding apparatus 10 (

After receiving), it is possible to estimate the actual variance value by [Equation 3], and use it to reduce the range of multiple noise model or select the noise model. At this time, [Assumption 1], [Assumption 2] used in [Equation 3] according to the use example can be changed and does not deviate from the essential contents of the present patent in that it uses the confirmation information transmitted from the encoding device.

E[x] : x값의 기대값, 평균E [x]: expected value, mean of x values

X : 복호화 단위에 해당하는 WZ픽춰의 원본정보X: Original information of WZ picture corresponding to decoding unit

Y : 복호화 단위에 해당하는 보조정보의 정보Y: information of auxiliary information corresponding to decoding unit

COV(X,Y) : X와 Y의 공분산COV (X, Y): Covariance of X and Y

[가정 1][Home 1]

E[N]=0 : 라플라시안 가정에서는 잡음에 대한 평균은 0이며, 사용에 따라 예측된 비트 플레인의 잡음예측치의 평균값 등 복호화 장치에서 생성되거나 복원되는 정보 및/또는 부호화 장치에서 전송된 다른 확인정보를 사용할 수 있으며 현재의 가정에 본 특허가 제약을 받는 것은 아니다.E [N] = 0: In the Laplacian hypothesis, the average for noise is 0, and information generated or restored by the decoding device, such as the average value of the noise prediction value of the predicted bit plane, and / or other confirmation information transmitted from the encoding device. May be used and this patent is not limited to the present assumption.

[가정 2][Home 2]

COV(N, X)=0 : 보조정보의 영상에서는 잡음과 원본영상의 상관관계는 없으므로 공분산의 값은 0이며, 사용에 따라 복호화 장치에서 생성되거나 복원되는 정보 및/또는 부호화 장치에서 전송된 다른 확인정보를 다른 확인정보를 사용할 수 있으며 현재의 가정에 본 특허가 제약을 받는 것은 아니다.COV (N, X) = 0: In the auxiliary information image, there is no correlation between noise and the original image, so the value of covariance is 0, and information generated or restored by the decoding device and / or other information transmitted by the encoding device according to use. The identification information may use other identification information, and this patent is not limited to the present assumption.

라플라시안 잡음 분포의 경우 외에도 가우시안, Two-sided Gamma분포 등 다른 잡음분포의 다중잡음모델의 선택 및 범위제한에 부호화 장치에서 전송된 확인정보를 사용하는 것도 가능하다.In addition to the Laplacian noise distribution, it is also possible to use the confirmation information transmitted from the coding device to select and range-limit multiple noise models of other noise distributions, such as Gaussian and Two-sided Gamma distributions.

이외에 다른 실시예가 존재할 수 있으며 다중잡음모델 중 소정 채널코딩단위에 대한 최적의 잡음모델을 선택하는 본 발명의 목적에서 벗어나지 않는다.Other embodiments may exist and do not depart from the object of the present invention of selecting an optimal noise model for a given channel coding unit among multiple noise models.

이와 같이, 채널코드 복호화부(24)는 주어진 복호화 단위에 대한 소정 채널코딩단위들을 복호화하여 영상 복원부(25)로 출력한다.As described above, the channel code decoder 24 decodes the predetermined channel coding units for the given decoding unit and outputs the decoded channel coding units to the image reconstruction unit 25.

영상 복원부(25)는 상기 채널코드 복호화부에서 복호화된 상기 소정 채널코딩단위들로 구성된 중간영상데이터 값과, 상기 보조정보 생성부에서 생성한 상기 보조정보와 상기 결정된 잡음모델들을 이용하여 WZ픽춰를 복원한다.The image reconstructor 25 uses a WZ picture by using an intermediate image data value composed of the predetermined channel coding units decoded by the channel code decoder, the auxiliary information generated by the auxiliary information generator, and the determined noise models. Restore it.

상기 영상 복원부(25)의 구성 및 동작을 도 6을 참조하여 설명한다.The configuration and operation of the image restoration unit 25 will be described with reference to FIG. 6.

도 6는 도 2의 복호화 장치의 영상 복원부의 구성을 도시한 도면이다.FIG. 6 is a diagram illustrating a configuration of an image reconstruction unit of the decoding apparatus of FIG. 2.

도 6을 참조하면, 영상 복원부(25)는 상기 채널코드 복호화부(24)에서 출력된 소정 채널코딩단위들을 저장하여 이들로 구성된 중간영상데이터를 출력하는 데이터 버퍼(610)와, 상기 잡음모델 결정부(26)에서 상기 소정 채널코딩단위를 복호화시 결정한 잡음모델을 저장하는 잡음모델 버퍼(620)를 포함한다. 영상 복원부(25)는 또한, 상기 잡음모델 버퍼에 저장된 결정된 잡음모델들 중 영상 복원에 가장 적합한 잡음모델을 결정하는 영상복원 잡음모델 결정부(630)와, 데이터 버퍼에서 출력된 중간영상데이터와 보조정보와 상기 영상복원 잡음모델 결정부에서 결정된 잡음모델을 이용하여 영상을 복원하는 영상복원기(640)를 포함한다.Referring to FIG. 6, the image reconstructor 25 stores a predetermined channel coding unit output from the channel code decoder 24 and outputs intermediate image data including the data buffer 610 and the noise model. The decision unit 26 includes a noise model buffer 620 that stores the noise model determined when the predetermined channel coding unit is decoded. The image restoration unit 25 may further include an image restoration noise model determination unit 630 for determining a noise model most suitable for image restoration among the determined noise models stored in the noise model buffer, and the intermediate image data output from the data buffer. And an image restorer 640 for restoring the image using the auxiliary information and the noise model determined by the image restoration noise model determiner.

여기서 영상복원 잡음모델 결정부(630)는, 상기 잡음모델 버퍼에 저장된 잡음 모델들 중 부호화 단위를 복원할 때 가장 유리한 잡음 모델을 결정한다. 결정을 위한 방법은 여러 가지가 존재할 수 있다. 이를 좀 더 구체적으로 설명하기 위하여, 변환영역에서의 동일 주파수 밴드의 비트 플레인을 소정 채널코딩단위로 사용하였을 경우에 대하여 다음의 서로 다른 네 가지 결정방법의 실시예들이 있다. 다음의 실시예를 벗어나는 방법도 존재할 수 있으며, 이외의 방법으로 결정을 한 경우라도 본 특허 기본 사상에 어긋나는 것이 아니며, 본 특허의 기본 사상을 구현하는 개개의 특수한 예의 범위를 벗어나지 않는다. Here, the image restoration noise model determiner 630 determines a noise model that is most advantageous when reconstructing a coding unit among the noise models stored in the noise model buffer. There are many ways to make a decision. In order to explain this in more detail, there are four different embodiments of the following determination method for the case where a bit plane of the same frequency band in a conversion region is used as a predetermined channel coding unit. There may also exist methods that deviate from the following embodiments, and the decision made by other methods does not deviate from the basic idea of the present patent, and does not depart from the scope of individual specific examples for implementing the basic idea of the present patent.

1. 최상위 비트 플레인이 도 3의 심벌 스트림에서 차지하는 값이 크므로 잡음모델 버퍼에 저장된 잡음 모델들 중 최상위 비트 플레인을 복호화시 결정된 잡음모델을 결정한다.1. Since the most significant bit plane occupies the symbol stream of FIG. 3, the noise model determined when decoding the most significant bit plane among the noise models stored in the noise model buffer is determined.

2. 중간영상 데이터와 중간영상 데이터에 해당하는 보조정보를 비교하여 보조정보의 잡음의 분포를 추정한 뒤 잡음모델 버퍼에 저장된 잡음모델들 중 중간영상데이터와 보조정보의 차에 해당하는 잡음을 가장 잘 추정하여 영상복원시 MSE(Mean Squared Error)값을 작게 할 수 있을 것으로 예상되는 잡음모델을 결정한다. 이 때 MSE는 복원영상의 화질을 평가하는 일예로써 다른 방법에 의한 평가도 가능하며, 이는 본 특허의 기본사상에 어긋나지 않는다.2. After estimating the distribution of noise of the supplementary information by comparing the supplementary information corresponding to the intermediate image data and the intermediate image data, the noise corresponding to the difference between the intermediate image data and the supplementary information among the noise models stored in the noise model buffer is estimated. By well estimating, we determine the noise model that is expected to reduce the mean squared error (MSE) during image restoration. At this time, the MSE is an example of evaluating the image quality of the reconstructed image and can be evaluated by other methods, which does not deviate from the basic idea of the present patent.

3. Stationary 잡음모델의 경우 소정 채널코딩단위 내 최소구성단위 각각에 대한 잡음 예측이 이루어 지지 않았으므로 블록 각각의 영상을 복원하기에 부적합 하다. 따라서non-stationary 잡음 모델 중 가장 최상위 비트 플레인을 복호화시 결정된 잡음모델을 사용한다.3. The stationary noise model is not suitable for reconstructing the image of each block because the noise prediction for each of the minimum constituent units within a given channel coding unit is not made. Therefore, the noise model determined when decoding the most significant bit plane of the non-stationary noise model is used.

4. 최하위 비트 플레인은 도 3을 참조하였을 경우, 심벌 스트림 혹은 양자화 된 원본영상을 구성하는 심벌 각각에 M이 최소값을 갖는 비트 플레인을 의미하며, 최하위 비트플레인 (예1)과 (예2)의 경우를 포함한다. 여기에서, M은 각각의 심벌 값(양자화된 화소 값 혹은 변환영역 계수값)이 2^M으로 양자화되었을 경우 심벌의 최하의 비트에서 부터 M번째 위치하는 비트가 양자화된 값의 최하위 비트를 의미한다. 따라서 중간 복원영상을 이루는 심볼에서 가장 작은 값을 표현하는 데이터이므로 잡음모델 버퍼에 저장된 잡음모델들 중 채널 복호화에 실패하지 않은 최하위 비트 플레인의 복호화 시 결정된 잡음모델을 사용한다.4. When referring to FIG. 3, the least significant bit plane means a bit plane in which M has a minimum value in each symbol constituting the symbol stream or the quantized original image, and the least significant bit plane of the least significant bit plane (Example 1) and (Example 2). Includes cases. Here, when M is a symbol value (quantized pixel value or transform region coefficient value) quantized to ^2M , M means the least significant bit of the quantized value of the bit located from the lowest bit of the symbol. Therefore, since the data represent the smallest value in the symbols of the intermediate reconstructed image, the noise model determined when decoding the least significant bit plane that does not fail channel decoding among the noise models stored in the noise model buffer is used.

이외에 다른 실시 예가 존재할 수 있으며 다중잡음모델 중 복호화 단위를 복원하는데 최적의 잡음모델을 선택하는 본 발명의 목적에서 벗어나지 않는다.Other embodiments may exist and do not depart from the object of the present invention of selecting an optimal noise model for reconstructing a decoding unit among multiple noise models.

본 발명에 따른 실시 예에서는 화소영역 및 변환영역에서의 예를 들었으며 이는 변환 영역의 경우 도 2와는 다르나, 도 7에 도시된 바와 같이 변환부(27) 및 역변환부(28)를 추가함으로써 설명할 수 있으며 이는 본 발명의 요지를 벗어나지 않아 별도로 설명하지 않는다. 또한 이에 대한 다양한 구성이 가능하다.In the exemplary embodiment of the present invention, examples of the pixel region and the conversion region are given, which are different from those of FIG. 2, but are illustrated by adding the transform unit 27 and the inverse transform unit 28 as shown in FIG. 7. This is not to be described separately without departing from the spirit of the invention. In addition, various configurations are possible.

따라서 본 발명의 설명에 기술된 부호화 단위는 본 발명의 구현에 따라 화소영역뿐 아니라 정수변환, DCT변환 또는 웨이블렛 변환에서의 변환계수를 포함하며 이러한 경우 역시 도 7의 변환부(27) 및 역변환부(28)로써 구현이 가능하다.Therefore, the coding unit described in the description of the present invention includes not only the pixel region but also the transform coefficients in the integer transform, DCT transform or wavelet transform according to the implementation of the present invention. In this case, the transform unit 27 and the inverse transform unit in FIG. This can be implemented as (28).

그리고 본 발명에 따른 실시 예에서는 일반적인 WZ 복호화 과정을 다루었으나 다양한 방식의 분산 부호화 비디오 영상의 복호화 장치에서도 복호화 과정 중 원본영상의 복호화를 위하여 다중잡음모델을 적용할 수 있으며 이러한 경우 역시 본 발명의 요지를 벗어나지 않는다.In the embodiment according to the present invention, a general WZ decoding process is dealt with, but a decoding apparatus of a distributed coded video image of various methods may also apply a multi-noise model to decode the original image during the decoding process. Do not escape.

상기와 같이 본 발명의 바람직한 실시 예에 따른 Wyner-Ziv 코딩기술에서의 다중잡음모델에 기반한 잡음추정 및 최적의 잡음선택이 이루어 질 수 있다.As described above, noise estimation and optimal noise selection based on a multiple noise model in the Wyner-Ziv coding technique according to the preferred embodiment of the present invention can be made.

도 8은 본 발명의 일 실시예에 따른 복호화 방법의 흐름도를 도시한 도면이다.8 is a flowchart illustrating a decoding method according to an embodiment of the present invention.

복호화 장치는 단계 810에서 부호화 장치로부터 전송되는 키픽춰 정보를 복원한다. 복호화 장치는 단계 820에서 상기 복원한 상기 키픽춰를 이용하여 보조정보를 생성한다. In operation 810, the decoding apparatus restores key picture information transmitted from the encoding apparatus. In operation 820, the decoding apparatus generates auxiliary information using the restored key picture.

이어서 복호화 장치는 단계 830에서 상기 보조정보에 있는 잡음오류들을 추정하기 위한 복수개의 후보 잡음모델들로 구성된 다중잡음모델을 생성한다. 복호화 장치는 단계 840에서 복수개의 후보 잡음 모델들 중 소정 채널코딩단위에 대해 사용할 잡음 모델을 결정한다. 일 실시예에서, 복호화 장치는 부호화 장치로부터 수신된 패리티데이터로 채널 복호화가 이루어진 후 추정된 에러율을 바탕으로 1~N번의 후보 잡음모델 중 수렴속도가 빠른 순으로 잡음모델을 선택한다. In operation 830, the decoding apparatus generates a multiple noise model including a plurality of candidate noise models for estimating noise errors in the auxiliary information. In operation 840, the decoding apparatus determines a noise model to be used for a predetermined channel coding unit among the plurality of candidate noise models. In one embodiment, after decoding the channel with the parity data received from the encoding apparatus, the decoding apparatus selects the noise models in the order of fast convergence speed from 1 to N candidate noise models based on the estimated error rate.

다른 실시예에서, 복호화 장치는 복수개의 후보 잡음모델들 각각을 사용하여 독립적으로 복호화하여 최소의 패리티데이터를 사용하여 CRC 검사를 통과한 잡음모델을 소정 채널코딩대상에 대한 최적의 잡음모델로 선택한다. 또 다른 실시예에서, 복호화 장치는 부호화 장치에서 전송한 확인정보, 예컨대, 부호화 단위에 해당하는 원본영상의 통계적인 데이터를 전송 받은 후, 실제의 분산값 추정이 수행하고, 이를 이용하여 사용할 다중잡음모델의 범위를 줄이거나 잡음모델을 선택 할 수 있다.In another embodiment, the decoding apparatus independently decodes each of the plurality of candidate noise models to select a noise model that passes the CRC check using the minimum parity data as an optimal noise model for a predetermined channel coding object. . In another embodiment, the decoding apparatus receives the identification information transmitted by the encoding apparatus, for example, statistical data of the original image corresponding to the coding unit, and then performs an actual variance estimation and uses the multiple noise to use the same. You can narrow the model or select a noise model.

복호화 장치는 사용할 잡음 모델을 선택한 후 단계 850에서 결정된 잡음 모델에 기초하여 상기 부호화 장치로부터 전송되는 압축데이터를 사용하여 보조 정보의 잡음 오류를 정정하고, 단계 860에서 주어진 복호화 단위에 대한 소정 채널코딩단위들을 복호한다. After selecting the noise model to be used, the decoding apparatus corrects the noise error of the auxiliary information by using the compressed data transmitted from the encoding apparatus based on the noise model determined in operation 850, and in step 860, the predetermined channel coding unit for the decoding unit. Decrypt them.

이어서, 복호화 장치는 단계 870에서 복호화된 상기 소정 채널코딩단위들로 구성된 중간영상데이터 값과, 상기 보조정보 생성부에서 생성한 상기 보조정보와 상기 결정된 잡음모델들을 이용하여 WZ픽춰를 복원한다.Subsequently, the decoding apparatus reconstructs the WZ picture by using the intermediate image data value composed of the predetermined channel coding units decoded in operation 870, the auxiliary information generated by the auxiliary information generator, and the determined noise models.

상기 후보 잡음모델들로 구성된 다중잡음모델을 생성하는 단계를 도 9를 참조하여 설명한다.The generation of the multi-noise model composed of the candidate noise models will be described with reference to FIG.

도 9는 본 발명의 일 실시예에 따른 다중 잡음 모델들을 생성하기 위한 흐름도를 도시한다.9 shows a flowchart for generating multiple noise models according to an embodiment of the present invention.

다중 잡음 모델을 생성하기 위해 복호화 장치는 단계 910에서 보조정보 전후의 키픽춰들 간의 차분을 이용하여 잡음을 예측한다. 구체적으로, 복호화 장치는 소정 채널코딩단위인 비트 플레인에 대해 생성된 보조정보상의 잡음을 예측하기 위하여, 해당 키픽춰 블록 간의 차이를 구한다. 예컨대, 복호화 장치는 보조정보 생성시 참조한 전후 키픽춰에 대한 움직임벡터가 가리키는 전후 키픽춰 상의 해당 블록을 찾고, 이 블록들의 동일한 주파수 밴드간의 차이를 잡음으로 예측한다. In order to generate a multiple noise model, the decoding apparatus predicts noise using a difference between key pictures before and after the auxiliary information in step 910. Specifically, the decoding apparatus obtains the difference between the corresponding key picture blocks in order to predict noise on the auxiliary information generated for the bit plane which is a predetermined channel coding unit. For example, the decoding apparatus searches for a corresponding block on the front and back key pictures indicated by the motion vector for the front and back key pictures referenced in generating the auxiliary information, and predicts the difference between the same frequency bands of the blocks as noise.

이어서, 복호화 장치는 단계 920에서 예측 잡음에 기초하여 잡음 확률 분포에 따른 잡음 모델을 생성한다. 일 실시예에서, 복호화 장치는 소정 채널코딩단위 내 최소구성단위인 비트 플레인을 구성하는 비트 각각에 대하여 잡음 확률분포(PMF)에 따라 잡음 모델을 생성한다. 전술한 바와 같이 상기 후보 잡음모델들을 생성하기 위한 잡음 확률분포(PMF)의 몇 가지 예로서 라플라시안(Laplacian), 가우시안(Gaussian), Two-sided Gamma 분포 등이 있다.In operation 920, the decoding apparatus generates a noise model based on the noise probability distribution based on the predicted noise. In one embodiment, the decoding apparatus generates a noise model according to the noise probability distribution (PMF) for each of the bits constituting the bit plane that is the smallest constituent unit in the predetermined channel coding unit. As described above, some examples of noise probability distributions (PMF) for generating the candidate noise models include Laplacian, Gaussian, and Two-sided Gamma distributions.

이어서, 복호화 장치는 단계 930에서 생성된 잡음 모델의 잡음 분포 특성에 따라 추가의 잡음 모델들을 더 생성한다. 구체적으로 복호화 장치는 잡음 확률 분포에 따라 생성된 잡음 모델의 Stationary/Non-stationary 특성 여부를 판단한다. 그리고, 복호화 장치는 Stationary 및 Non-stationary 특성들 중 잡음 모델이 가지지 않은 특성에 대해 잡음 모델을 더 생성한다. Subsequently, the decoding apparatus further generates additional noise models according to the noise distribution characteristic of the noise model generated in step 930. In detail, the decoding apparatus determines whether the stationary / non-stationary characteristics of the generated noise model are based on the noise probability distribution. The decoding apparatus further generates a noise model for a characteristic that the noise model does not have among the stationary and non-stationary characteristics.

예컨대, 복호화 장치는 채널코딩단위 내 최소구성단위 별로 각각 잡음을 예측하여 잡음 모델이 구성되었으면, Non-stationary 특성을 가지므로, 해당 채널코딩단위 내 모든 비트에 대하여 하나의 잡음을 사용하여 잡음 모델을 구성한다. For example, the decoding apparatus predicts noise for each of the minimum constituent units in the channel coding unit, and if the noise model is configured, it has a non-stationary characteristic. Configure.

이어서, 복호화 장치는 단계 940에서 실제의 잡음정보 추정 오류를 보상하기 위해 생성된 잡음 모델들을 변화시킴으로써 최종적인 후보 잡음 모델들을 생성한다. Subsequently, the decoding apparatus generates final candidate noise models by changing the generated noise models to compensate for the actual noise information estimation error in step 940.

전술한 바와 같이, 잡음예측은 추정된 움직임 벡터에 대응하는 전/후 키 픽춰상의 블록간 주파수 밴드의 차를 이용하여 실제의 잡음을 추정하는데 사용하므로 움직임 벡터의 추정이 정확하지 않거나, 이러한 잡음 예측방법이 잘 맞지 않는 경우에는 잘못된 잡음예측의 가능성이 높아, 이를 효과적으로 정정하는 것이 바람직하다. 그러므로, 복호화 장치는 예측을 통하여 생성된 후보 잡음모델들과 실제 잡음모델 간의 차이를 수치화하고 나중에 이를 보상하기 위한 목적으로 소정의 파라메터들을 정의하여 후보잡음 모델 생성시에 사용할 수 있다. 예를 들어, 잡음 모델을 생성하는 중 발생할 수 있는 스케일링 인자(scaling factor)를 정의하여 사용할 수 있다.As described above, the noise prediction is used to estimate the actual noise by using the difference between the inter-block frequency bands on the front and back key pictures corresponding to the estimated motion vector, so the estimation of the motion vector is not accurate or such noise prediction If the method does not fit well, there is a high possibility of false noise prediction, and it is desirable to correct it effectively. Therefore, the decoding apparatus may define the predetermined parameters for the purpose of quantifying the difference between the candidate noise models generated through the prediction and the actual noise model, and later compensate for this, and use them in generating the candidate noise model. For example, a scaling factor that may occur while generating a noise model may be defined and used.

예컨대, 복호화 장치는 실제의 잡음정보 추정 오류를 보상하기 위한 파라메터의 값을 변화시키면서 생성된 잡음 모델들에 적용함으로써 복수개의 후보 잡음 모델들을 생성한다. For example, the decoding apparatus generates a plurality of candidate noise models by applying the generated noise models while changing a parameter value for compensating an actual noise information estimation error.

따라서, 실제잡음모델과 생성된 잡음모델사이의 값의 보상을 수행하기 위하여 다른 파라메터 값을 사용하는 것은 본 특허 기본 사상에 어긋나는 것이 아니며, 본 특허의 기본 사상을 구현하는 개개의 특수한 예의 범위를 벗어나지 않는다.Therefore, the use of other parameter values to compensate for the value between the actual noise model and the generated noise model is not contrary to the basic idea of the present patent, and does not depart from the scope of each particular example of implementing the basic idea of the present patent. Do not.

도 10은 본 발명의 일 실시예에 따른 분산비디오 부호화 영상의 복호화 방법에 대한 성능 평가 결과를 나타내는 그래프이다.10 is a graph illustrating a performance evaluation result of a method of decoding a distributed video encoded image according to an embodiment of the present invention.

본 발명에서는 성능을 평가하기 위하여 Foreman, Stefan, Salesman, 그리고 Mobile & calendar의 시퀀스를 사용하여 모의실험을 실시하였다. 실험에 사용한 영상은 Salesman 시퀀스가 225 프레임인 것을 제외하면 나머지 것들은 모두 각각 150 프레임이며 크기는 QCIF, 프레임 율은 15Hz 이다. 본 모의실험을 통하여 기존의 잡음 모델과 본 발명의 일 실시예에 따른 잡음 모델 생성방법을 율-왜곡 성능면세서 상호 비교하고 그 결과를 분석하였다.In the present invention, a simulation was conducted using Foreman, Stefan, Salesman, and Mobile & calendar sequences to evaluate the performance. The images used in the experiment are 150 frames each, except that the Salesman sequence is 225 frames, the size is QCIF, and the frame rate is 15 Hz. Through the simulation, the noise model generation method according to an embodiment of the present invention and the noise model are compared with each other in the rate-distortion performance exemption and the results are analyzed.

도 10을 참조하면 율-왜곡(Rate-PSNR) 그래프 결과를 보면, 상대적인 율의 감소는 높은 QP(Quantization Parameter)가 낮은 QP의 결과에 비하여 컸는데 이는 잡음 모델 선택방법에 의한 성능이 어느 하나의 비트 플레인에 집중되기 보다는 전체 비트 플레인에 걸쳐 약간의 율을 감소시키므로 상대적으로 잡음제거에 패리티 정보를 덜 소모하는 상위의 비트 플레인에서 더 큰 효과가 있음을 알 수 있다. 또한, 동일한 이유로 제안하는 잡음 모델 선택방법은 부/복호화가 쉬운 선형적인 움직임이 많은 시퀀스에서 더 좋은 성능을 나타내며 여기서는 Mobile & calendar 영상이 이에 해당한다.Referring to FIG. 10, the rate-distortion (Rate-PSNR) graph results show that the decrease of the relative rate is larger than that of the high QP (low QP), which is a function of the noise model selection method. Rather than focusing on the bit plane, we reduce the rate slightly over the entire bit plane, so we can see a greater effect on the upper bit plane, which consumes less parity information for noise reduction. For the same reason, the proposed noise model selection method shows better performance in the sequence with many linear motions that are easy to encode and decode.

Claims

A decoding apparatus for decoding a distributed video encoded image,
A key picture reconstruction unit for reconstructing key picture information transmitted from the encoding apparatus;
An auxiliary information generating unit generating auxiliary information using the key picture restored by the key picture restoring unit;
A multi-noise model-based noise model providing unit for providing a noise model for use in a predetermined channel coding unit based on the multiple-noise model composed of a plurality of candidate noise models for estimating noise errors in the auxiliary information;
A channel code decoder for correcting a noise error of the auxiliary information based on the provided noise model using compressed data transmitted from the encoding apparatus, and decoding predetermined channel coding units for a given decoding unit;
And an image reconstruction unit for reconstructing a WZ picture by using the intermediate image data value composed of the predetermined channel coding units decoded by the channel code decoder, the auxiliary information generated by the auxiliary information generator, and the provided noise models. Decoding apparatus, characterized in that.

The method of claim 1, wherein the multiple noise model based noise model providing unit,
A multiple noise model generator for generating a multiple noise model composed of a plurality of candidate noise models for estimating noise errors in the auxiliary information;
And a noise model determiner configured to determine a noise model to be used for a predetermined channel coding unit among a plurality of candidate noise models generated by the multiple noise model generator.

The noise model of claim 1, wherein the multiple noise model generator predicts noise by using differences between key pictures before and after the auxiliary information, generates a noise model based on a noise probability distribution based on prediction noise, and generates the noise. Decoding apparatus, characterized in that for generating additional noise models further in accordance with the noise distribution characteristics of the model,

3. The method of claim 2, wherein the multiple noise model generator determines whether the generated noise model is a stationary / non-stationary characteristic according to the noise distribution characteristic of the generated noise model, and wherein the multiple noise model generator determines whether the generated noise model is a stationary / non-stationary characteristic. And a noise model is further generated for a characteristic that the generated noise model does not have.

The method of claim 3, And the multiple noise model generator generates a plurality of candidate noise models by applying the generated noise models while changing a parameter value for compensating an actual noise information estimation error.

3. The decoding apparatus of claim 2, wherein the noise probability distribution comprises at least one of Laplacian, Gaussian, generalized gaussian, and Two-sided Gamma distributions.

The decoding method of claim 1, wherein the noise model determiner selects the noise models in the order of convergence speed among candidate noise models based on the estimated error rate after channel decoding is performed on the parity data received from the encoding apparatus. Device.

The decoding method of claim 1, wherein the noise model determiner selects a noise model that has passed a CRC test using minimum parity data after being independently decoded using each of the plurality of candidate noise models. Device.

The method of claim 1, wherein the channel code decoding unit
A channel code decoder for correcting a noise error in auxiliary information corresponding to the predetermined channel coding unit by using the compressed data transmitted from the encoding apparatus and the noise model selected by the noise model determiner;
And a channel code decoding reliability measuring unit for measuring decoding reliability of a predetermined channel coding unit decoded by the channel code decoder.

10. The apparatus of claim 9, wherein the channel code decoding reliability measuring unit measures a decoding reliability of a channel code based on a soft output of the channel code decoder and / or a cyclic redundancy check (CRC) received from an encoding device. Decoding apparatus characterized in that for measuring.

The image restoring apparatus of claim 1, wherein the image restoration unit is used.
A data buffer for storing predetermined channel coding units output from the channel code decoder and outputting intermediate image data including the predetermined channel coding units;
A noise model buffer configured to store a noise model determined by the noise model determiner when decoding the predetermined channel coding unit;
An image restoration noise model determination unit for determining a noise model most suitable for image restoration among the determined noise models stored in the noise model buffer;
And an image restorer for reconstructing an image by using intermediate image data including auxiliary channel coding units stored in a data buffer, auxiliary information, and noise models determined by the image restoration noise model determiner.

A decoding method for decoding a distributed video encoded image,
Restoring key picture information transmitted from the encoding apparatus;
Generating auxiliary information using the reconstructed key picture;
Providing a noise model to be used for a predetermined channel coding unit based on a multiple noise model consisting of a plurality of candidate noise models for estimating noise errors in the auxiliary information;
Correcting a noise error of the auxiliary information using the compressed data transmitted from the encoding apparatus based on the provided noise model;
Decoding predetermined channel coding units for a predetermined decoding unit;
And reconstructing a WZ picture by using the decoded intermediate image data value composed of the predetermined channel coding units, the auxiliary information, and the provided noise model.

The method of claim 12, wherein providing the noise model comprises:
Generating a multiple noise model composed of a plurality of candidate noise models for estimating noise errors in the auxiliary information;
And determining a noise model to be used for a predetermined channel coding unit among the plurality of candidate noise models.

The method of claim 12, wherein generating the multiple noise model
Predicting noise using a difference between key pictures before and after the auxiliary information;
Generating a noise model according to a noise probability distribution based on the predicted noise, and further generating additional noise models according to the noise distribution characteristic of the generated noise model.

15. The method of claim 14, wherein generating the additional noise models further comprises:
Determining whether the generated noise model is a stationary / non-stationary characteristic according to the noise distribution characteristic of the generated noise model;
And generating a noise model for a characteristic of the stationary and non-stationary characteristics that the generated noise model does not have.

The method of claim 14, The generating of the multiple noise model may further include generating a plurality of candidate noise models by applying the generated noise models while changing a value of a parameter for compensating an actual noise information estimation error. Decoding device.

13. The decoding method of claim 12, wherein the noise probability distribution comprises at least one of Laplacian, Gaussian, generalized gaussian, and Two-sided Gamma distributions.

The method of claim 12, wherein the determining of the noise model includes selecting a noise model in order of convergence speed among candidate noise models based on an estimated error rate after channel decoding is performed on the parity data received from the encoding apparatus. Decoding method comprising a.

The method of claim 12, wherein the determining of the noise model comprises selecting a noise model that has passed a CRC test using minimum parity data after being independently decoded using each of the plurality of candidate noise models. Decoding method comprising a.

The method of claim 12, wherein the decoding step
And measuring decoding reliability of the channel code based on a soft output according to the channel code decoding and / or a cyclic redundancy check (CRC) received from an encoding device.