KR101285810B1

KR101285810B1 - A Super-Resolution method by Motion Estimation with Sub-pixel Accuracy using 6-Tap FIR Filter

Info

Publication number: KR101285810B1
Application number: KR1020110146474A
Authority: KR
Inventors: 권순찬; 유지상
Original assignee: 광운대학교 산학협력단
Priority date: 2011-12-29
Filing date: 2011-12-29
Publication date: 2013-07-12
Also published as: KR20130077646A

Abstract

연속적인 시간의 저해상도 영상을 입력받아, 6-Tap FIR 필터를 이용한 부화소 단위 움직임 추정을 통하여 고해상도 영상으로 복원하는 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 관한 것으로서, (a) 연속적인 시간의 저해상도 영상을 입력받되, 저해상도 영상은 목표 영상과 목표 영상의 전후 시간 영상으로 구성되는 단계; (b) 저해상도 영상들에 보간법을 적용하여 부화소를 구하여, 저해상도 영상들의 화소와 부화소가 포함된 영상(이하 부화소 영상)을 구하는 단계; (c) 목표 영상의 부화소 영상(이하 목표 부화소 영상)과 전후 시간 영상(이하 전후 부화소 영상)들 각각에 대하여, 최적의 블록정합을 탐색하여 움직임 벡터를 구하는 단계; 및, (d) 목표 부화소 영상의 각 화소(이하 목표점)에 대하여, 움직임 벡터에 의해 목표점에 정합되는 전후 부화소 영상의 화소(이하 정합점)를 찾고, 목표점을 정합점으로 보간하는 단계를 포함하는 구성을 마련한다.
상기와 같은 초해상도 영상 복원 방법에 의하여, 부화소 정밀도의 움직임 추정을 통하여 연속적인 저해상도 영상으로부터 보다 정확한 고주파성분을 복원할 수 있다.The present invention relates to a method for reconstructing a subpixel-based motion estimation-based superresolution image which receives continuous low-resolution images and reconstructs a high-resolution image by subpixel motion estimation using a 6-Tap FIR filter. Receiving a low resolution image, wherein the low resolution image comprises a target image and a time image before and after the target image; (b) obtaining a subpixel by applying interpolation to the low resolution images, and obtaining an image (hereinafter referred to as a subpixel image) including pixels and subpixels of the low resolution images; (c) obtaining a motion vector by searching for an optimal block match for each of the subpixel images (hereinafter referred to as the target subpixel images) and the before and after time images (hereinafter, referred to as subpixel images) of the target image; And (d) finding a pixel (hereinafter, a matching point) of the front and rear subpixel image matched to the target point by a motion vector, for each pixel of the target subpixel image (hereinafter, the target point), and interpolating the target point to the matching point. Provide a configuration to include.
By using the super resolution image reconstruction method as described above, more accurate high frequency components can be reconstructed from a continuous low resolution image through motion estimation of subpixel precision.

Description

Super-Resolution method by Motion Estimation with Sub-pixel Accuracy using 6-Tap FIR Filter}

본 발명은 연속적인 시간의 저해상도 영상(프레임)을 입력받아, 6-Tap FIR(finite impulse response) 필터를 이용한 부화소 단위 움직임 추정을 통하여 고해상도 영상으로 복원하는 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 관한 것이다.
According to the present invention, a subpixel motion estimation-based superresolution image reconstruction that receives continuous low resolution images (frames) and reconstructs a high resolution image through subpixel motion estimation using a 6-tap finite impulse response (FIR) filter It is about a method.

일반적으로, 영상을 확대하기 위해 기존 화소들 사이에 적절한 값을 삽입하는 기술을 영상 보간법(interpolation)이라 한다. 근래, 디지털 카메라 같은 디지털 영상 획득 매체의 개발로 고해상도 영상 보간의 중요성은 다양한 분야에서 점점 커지고 있다. 예를 들어 의료 분야에서 고해상도의 의료 영상은 의사들에게 있어 매우 중요하다. 이 분야에서 빈번히 사용되는 컴퓨터 패턴 인식의 수행 능력은 입력 영상의 해상도에 크게 좌우되기 때문이다.In general, a technique of inserting an appropriate value between existing pixels to enlarge an image is called image interpolation. Recently, with the development of digital image acquisition media such as digital cameras, the importance of high resolution image interpolation is increasing in various fields. In the medical field, for example, high-resolution medical images are very important for doctors. This is because the performance of computer pattern recognition, which is frequently used in this field, is highly dependent on the resolution of the input image.

영상 보간은 비월주사 방식의 영상을 순차주사 방식으로 전환하는 디-인터레이싱(de-interlacing)과 여러 장의 저해상도 영상들로부터 고해상도의 영상을 복원하는 초해상도(super-resolution) 기법과도 밀접한 관계가 있다. 디지털 영상 매체에 의해 획득된 영상은 고해상도 영상이 저주파 대역필터를 통과한 후, 다운 샘플링 된 영상으로 모델링 될 수 있다. 이 과정을 거치면서 획득된 영상은 앨리어싱(aliasing) 현상에 의해 많은 고주파 성분의 정보가 손실된다. 따라서 이러한 고주파 성분을 효과적으로 복원하는 것이 영상 보간에서 가장 중요한 기술이다.Image interpolation is also closely related to de-interlacing, which converts interlaced images into sequential scans, and super-resolution techniques to reconstruct high-resolution images from multiple low-resolution images. have. An image obtained by the digital image medium may be modeled as a down sampled image after the high resolution image passes through the low frequency band pass filter. An image obtained through this process loses information of many high frequency components due to an aliasing phenomenon. Therefore, effective reconstruction of such high frequency components is the most important technique in image interpolation.

전통적인 영상 보간법은 저해상도 영상 화소의 가중치 합(weighted-sum)에 기반을 두고 있으며, 그 중 대표적인 방법으로 최단입점(nearest neighborhood) 보간법, 이중선형(bi-linear) 보간법, 고등차수(bi-cubic) 보간법 등이 있다. 이러한 방법들은 선형 필터의 개념으로 해석될 수 있다. 즉 저해상도 영상은 필터의 입력 신호가 되고 이 영상에 곱해지는 가중치들은 필터 계수가 되며, 전체적인 영상의 확대 과정은 저해상도의 입력 영상을 업 샘플링한 후 필터를 적용하는 것으로 볼 수 있다. 따라서 필터 계수가 되는 가중치들의 주파수 특성을 확인함으로써 해당 보간법의 성능을 분석할 수 있다. 이러한 관점에서 볼 때 가중치 기반 방법들은 대부분 저주파 통과 필터 특성을 가지고 있으므로 앨리어싱에 의해 손상된 고주파 성분을 효과적으로 복원할 수 없다.Traditional image interpolation is based on the weighted-sum of low resolution image pixels, and the representative methods are the nearest neighborhood interpolation, bi-linear interpolation, and bi-cubic. Interpolation, etc. These methods can be interpreted as the concept of a linear filter. That is, the low resolution image becomes an input signal of the filter, and the weights multiplied by the image become filter coefficients. The process of enlarging the entire image may be regarded as applying the filter after upsampling the low resolution input image. Therefore, the performance of the interpolation method can be analyzed by confirming the frequency characteristics of the weights that become the filter coefficients. From this point of view, most of the weight-based methods have low-pass filter characteristics and thus cannot effectively recover high frequency components damaged by aliasing.

이러한 문제를 개선하기 위해, 연속된 다수의 프레임을 이용하여 단일 영상을 사용한 경우보다 정확도가 높은 고주파 성분을 복원할 수 있다.
To solve this problem, a plurality of consecutive frames can be used to restore high frequency components with higher accuracy than using a single image.

본 발명의 목적은 상술한 바와 같은 문제점을 해결하기 위한 것으로, 연속적인 시간의 저해상도 영상(프레임)을 입력받아, 6-Tap FIR(finite impulse response) 필터를 이용한 부화소 단위 움직임 추정을 통하여 고해상도 영상으로 복원하는 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법을 제공하는 것이다.SUMMARY OF THE INVENTION An object of the present invention is to solve the problems described above, and receives a low-resolution image (frame) of continuous time, and performs a high-resolution image through sub-pixel unit motion estimation using a 6-Tap finite impulse response (FIR) filter. The present invention provides a method of reconstructing a sub-resolution image based on subpixel unit motion estimation.

즉, 본 발명의 목적은 목표 영상의 앞, 뒤의 다수 프레임에서, 목표 영상 내 고주파 성분을 부화소(sub-pixel) 단위 움직임 추정을 통해 인접한 다른 영상들에서 찾아 고주파 성분 보간 시 이용하는 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법을 제공하는 것이다.That is, an object of the present invention is to find a high-frequency component in a target image in a plurality of frames before and after the target image in sub-pixel unity motions, and then use the sub-pixel unit to interpolate the high-frequency component. It is to provide a method for motion estimation based super resolution image reconstruction.

이때, 보다 정확한 움직임 추정이 결과 영상의 화질을 결정하기 때문에, 정수단위가 아닌 부화소 정밀도의 움직임 추정이 필요하고, 이를 위해, 입력 영상들의 부화소를 구하기 위해 H.264/AVC의 6-tap FIR 필터를 사용한다. 이 후 부화소 정밀도의 움직임 추정을 적용하여 목표영상에 정합시킨 후 D. Shepard에 의해 제안된 2차원 보간법을 이용해 목표영상의 빈공간을 보간하게 된다.In this case, since more accurate motion estimation determines the quality of the resultant image, motion estimation with sub-pixel precision is required, not an integer unit. For this purpose, 6-tap of H.264 / AVC is required to obtain sub-pixels of input images. Use an FIR filter. After that, after sub-pixel precision motion estimation is applied to the target image, the empty space of the target image is interpolated using the two-dimensional interpolation method proposed by D. Shepard.

상기 목적을 달성하기 위해 본 발명은 연속적인 시간의 저해상도 영상을 입력받아, 부화소 단위 움직임 추정을 통하여 고해상도 영상으로 복원하는 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 관한 것으로서, (a) 연속적인 시간의 저해상도 영상을 입력받되, 상기 저해상도 영상은 목표 영상과 상기 목표 영상의 전후 시간 영상으로 구성되는 단계; (b) 상기 저해상도 영상들에 보간법을 적용하여 부화소를 구하여, 상기 저해상도 영상들의 화소와 상기 부화소가 포함된 영상(이하 부화소 영상)을 구하는 단계; (c) 상기 목표 영상의 부화소 영상(이하 목표 부화소 영상)과 상기 전후 시간 영상(이하 전후 부화소 영상)들 각각에 대하여, 최적의 블록정합을 탐색하여 움직임 벡터를 구하는 단계; 및, (d) 상기 목표 부화소 영상의 각 화소(이하 목표점)에 대하여, 상기 움직임 벡터에 의해 상기 목표점에 정합되는 상기 전후 부화소 영상의 화소(이하 정합점)를 찾고, 상기 목표점을 상기 정합점으로 보간하는 단계를 포함하는 것을 특징으로 한다.In order to achieve the above object, the present invention relates to a sub-pixel unit motion estimation-based super-resolution image reconstruction method for receiving continuous low-resolution images and reconstructing a high-resolution image through sub-pixel unit motion estimation. Receiving a low resolution image having a normal time, wherein the low resolution image comprises a target image and a time image before and after the target image; (b) obtaining a subpixel by applying interpolation to the low resolution images, and obtaining an image including a pixel of the low resolution images and the subpixel (hereinafter referred to as a subpixel image); (c) obtaining a motion vector by searching an optimal block match for each of the subpixel images (hereinafter, referred to as target subpixel images) and the before and after time images (hereinafter, referred to as subpixel images) of the target image; And (d) for each pixel (hereinafter, referred to as a target point) of the target subpixel image, to find a pixel (hereinafter referred to as a matching point) of the front and rear subpixel image matched to the target point by the motion vector, and matching the target point to the match. And interpolating to points.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 전후 부화소 영상은 상기 목표 영상의 연속적인 이전 시간 영상 2개와 이후 시간 영상 2개로 구성되는 것을 특징으로 한다.The present invention also provides a method for reconstructing a subpixel-based motion estimation-based super resolution image, wherein the before and after subpixel image is composed of two consecutive previous time images and two subsequent time images of the target image.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 (b)단계에서, 상기 부화소 중 1/2 부화소는 6탭(tap) FIR(finite impulse response) 필터를 적용하고, 1/4 부화소는 상기 1/2 부화소에 이중선형(bi-linear) 보간법을 적용하여 구하는 것을 특징으로 한다.In addition, according to the present invention, in the subpixel unit motion estimation-based super resolution image reconstruction method, in step (b), one-half subpixels of the subpixels are applied with a 6-tap FIR filter. The quarter subpixel is obtained by applying a bi-linear interpolation method to the half subpixel.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 (c)단계에서, 상기 최적의 블록정합은 상기 목표 부화소 영상과 상기 전후 부화소 영상 간에 블록단위로 SAD(sum of absolute difference) 연산을 하여, 상기 SAD가 최소인 블록을 최적의 블록으로 선택하는 것을 특징으로 한다.In addition, according to the present invention, in the subpixel unit motion estimation based super resolution image reconstruction method, in step (c), the optimal block matching is performed in units of blocks between the target subpixel image and the front and rear subpixel images. of absolute difference) operation to select a block having the minimum SAD as an optimal block.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 블록단위는 2*2인 것을 특징으로 한다.In addition, the present invention is a subpixel unit motion estimation-based super-resolution image reconstruction method, characterized in that the block unit is 2 * 2.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 최적 블록을 선택하기 위해, 전역 탐색 방식 또는 나선형 탐색 방식으로 정합블록을 탐색하는 것을 특징으로 한다.In addition, the present invention is characterized in that in the sub-pixel unit motion estimation-based super-resolution image reconstruction method, the matching block is searched by a global search method or a spiral search method to select the optimal block.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 (d)단계에서, 상기 목표점은 상기 목표점과 정합점 사이의 거리에 의한 가중치에 의해 상기 정합점들을 보간하는 것을 특징으로 한다.The present invention also provides a subpixel unit motion estimation based super resolution image reconstruction method, wherein in step (d), the target point interpolates the matching points by a weight based on a distance between the target point and the matching point. It is done.

또, 본 발명은 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 있어서, 상기 (d)단계에서, 상기 목표점은 [수식 1]에 의해 상기 정합점들을 보간하는 것을 특징으로 한다.In another aspect of the present invention, in the subpixel unit motion estimation based super resolution image reconstruction method, in step (d), the target point is interpolated between the matching points by Equation 1.

[수식 1][Equation 1]

또한, 본 발명은 상기 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법을 수행하는 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것이다.
The present invention also relates to a computer-readable recording medium having recorded thereon a program for performing the method of performing subpixel unit motion estimation based super resolution image reconstruction.

상술한 바와 같이, 본 발명에 따른 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법에 의하면, 부화소 정밀도의 움직임 추정을 통하여 연속적인 저해상도 영상으로부터 보다 정확한 고주파성분을 복원할 수 있는 효과가 얻어진다.As described above, according to the subpixel unit motion estimation based super resolution image reconstruction method according to the present invention, an effect of reconstructing a high frequency component more accurately from a continuous low resolution image through motion estimation with subpixel precision is obtained.

특히, 실험 결과 기존 제시된 최단입점(nearest neighborhood) 보간법, 이중선형(bi-linear) 보간법, 고등차수(bi-cubic) 보간법을 사용한 결과 영상보다 주관적 및 객관적으로 약간 더 우수한 것을 확인할 수 있다.
In particular, the experimental results show that subjective and objective results are slightly superior to images obtained using the existing nearest neighborhood interpolation, bi-linear interpolation, and bi-cubic interpolation.

도 1은 본 발명을 실시하기 위한 전체 시스템의 구성을 도시한 도면이다.
도 2는 본 발명의 일실시예에 따른 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법을 설명하는 흐름도이다.
도 3은 본 발명의 일실시예에 따라 저해상도 영상이 만들어지는 과정을 설명하는 도면이다.
도 4는 본 발명에 따른 ADC 과정에서 발생하는 앨리어싱의 일례를 도시한 것이다.
도 4는 본 발명에 따른 ADC 과정에서 발생하는 앨리어싱의 일례를 도시한 것이다.
도 5는 본 발명에 따른 정수 단위 또는 부화소 단위로 화소가 이동하는 일례를 도시한 것이다.
도 6은 본 발명에 따른 1/2 화소 보간시의 6-tap FIR 필터 적용한 일례를 도시한 것이다.
도 7은 본 발명에 따른 전역 탐색 방식의 일례를 도시한 것이다.
도 8은 본 발명에 따른 나선형 탐색 방식의 일례를 도시한 것이다.
도 9는 본 발명에 따른 움직임 추정에 따른 영상간 위치관계와 보간 대상점의 위치의 일례를 도시한 것이다.
도 10은 본 발명에 따른 데이터 점 4개, 8개, 12개를 사용한 경우의 PSNR 측정결과의 일례를 도시한 것이다.
도 11은 본 발명에 따른 영상외곽의 격자현상 제거 전 및, 제거 후의 일례를 도시한 것이다.
도 12 내지 도 15는 본 발명에 따라 "Foreman (9번째 프레임)", "Car (8번째 프레임)", "Mother-daughter (6번째 프레임)"에 대한 원본영상, 최단입점, 이중선형, 고등차수, 및 본 발명의 고해상도 방법을 적용한 영상의 일례를 도시한 것이다.1 is a diagram showing a configuration of an overall system for carrying out the present invention.
2 is a flowchart illustrating a method of reconstructing a super resolution image based on subpixel unit motion estimation according to an embodiment of the present invention.
3 is a diagram illustrating a process of making a low resolution image according to an embodiment of the present invention.
Figure 4 illustrates an example of aliasing that occurs during the ADC process according to the present invention.
Figure 4 illustrates an example of aliasing that occurs during the ADC process according to the present invention.
5 illustrates an example in which a pixel moves in an integer unit or a subpixel unit according to the present invention.
FIG. 6 shows an example of applying a 6-tap FIR filter during 1/2 pixel interpolation according to the present invention.
7 illustrates an example of a global search method according to the present invention.
8 shows an example of a spiral search method according to the present invention.
9 shows an example of the positional relationship between the image and the position of the interpolation target point according to the motion estimation according to the present invention.
10 shows an example of the PSNR measurement results when four, eight, and twelve data points according to the present invention are used.
11 shows an example before and after removing the grid phenomenon of the image outline according to the present invention.
12 to 15 show the original image, the shortest-entry, double-linear, and higher for "Foreman (9th frame)", "Car (8th frame)", "Mother-daughter (6th frame)" according to the present invention. An example of an order and an image to which the high resolution method of the present invention is applied are shown.

이하, 본 발명의 실시를 위한 구체적인 내용을 도면에 따라서 설명한다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail with reference to the drawings.

또한, 본 발명을 설명하는데 있어서 동일 부분은 동일 부호를 붙이고, 그 반복 설명은 생략한다.
In the description of the present invention, the same parts are denoted by the same reference numerals, and repetitive description thereof will be omitted.

먼저, 본 발명을 실시하기 위한 전체 시스템의 구성의 예들에 대하여 도 1을 참조하여 설명한다.First, examples of the configuration of the entire system for carrying out the present invention will be described with reference to Fig.

도 1에서 보는 바와 같이, 본 발명에 따른 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법은 저해상도 영상(또는 프레임)(10)을 입력받아 상기 프레임(또는 영상)에 대하여 부화소 단위의 움직임 추정을 하고 고해상도 영상으로 복원하는 컴퓨터 단말(20) 상의 프로그램 시스템으로 실시될 수 있다. 즉, 초해상도 영상 복원 방법은 프로그램으로 구성되어 컴퓨터 단말(20)에 설치되어 실행될 수 있다. 컴퓨터 단말(20)에 설치된 프로그램은 하나의 프로그램 시스템(30)과 같이 동작할 수 있다.As shown in FIG. 1, the subpixel unit motion estimation based super resolution image reconstruction method according to the present invention receives a low resolution image (or frame) 10 and performs motion estimation of a subpixel unit with respect to the frame (or image). And a program system on the computer terminal 20 to restore the high resolution image. That is, the super resolution image restoration method may be configured as a program and installed and executed in the computer terminal 20. The program installed in the computer terminal 20 may operate like one program system 30.

한편, 다른 실시예로서, 초해상도 영상 복원 방법은 프로그램으로 구성되어 범용 컴퓨터에서 동작하는 것 외에 ASIC(주문형 반도체) 등 하나의 전자회로로 구성되어 실시될 수 있다. 또는 연속적인 프레임으로 구성된 저해상도 영상을 고해상도 영상으로 복원 등만을 전용으로 처리하는 전용 컴퓨터 단말(20)로 개발될 수도 있다. 이를 초해상도 영상 복원 장치라 부르기로 한다. 그 외 가능한 다른 형태도 실시될 수 있다.On the other hand, as another embodiment, the super-resolution image restoration method may be implemented as a program and operate in a general-purpose computer, and may be implemented as one electronic circuit such as an ASIC (custom-type semiconductor). Alternatively, the present invention may be developed as a dedicated computer terminal 20 which processes only a low resolution image composed of continuous frames to be restored to a high resolution image. This is called a super resolution image reconstruction device. Other possible forms may also be practiced.

저해상도 영상(10)은 연속적인 시간상에서 다수의 프레임으로 구성된 영상이다. 예를 들어, 현재 프레임이 t시간에서의 프레임이면, 이전 프레임은 t-1, t-2, t-3 등의 시간에서의 프레임이고, 이후 프레임은 t+1, t+2, t+3, ... 등의 시간에서의 프레임이다. 이들 프레임을 전후 프레임이라고 부르기로 한다. 영상은 시간적으로 연속된 다수의 프레임으로 구성된다. 즉, 영상은 ..., t-3, t-2, t-1, t, t+1, t+2, t+3, ... 시간 대의 프레임으로 구성된다. 이때, 현재 프레임, 즉, 관심 대상인 프레임을 목표 프레임이라고도 부르기로 한다. 또한, 영상은 다수의 프레임으로 구성되나, 이하에서, 특별히 구별의 필요성이 없으면, 프레임과 영상을 혼용하기로 한다.
The low resolution image 10 is an image composed of a plurality of frames in continuous time. For example, if the current frame is a frame at time t, the previous frame is a frame at times t-1, t-2, t-3, and the like, and the frames after t + 1, t + 2, t + 3. , ..., and so on at time. These frames will be referred to as front and rear frames. An image consists of a number of frames that are contiguous in time. That is, the image is composed of frames of time zones ..., t-3, t-2, t-1, t, t + 1, t + 2, t + 3, .... In this case, the current frame, that is, the frame of interest is referred to as a target frame. In addition, although the image is composed of a plurality of frames, in the following, unless there is a need for distinction, the frame and the image are mixed.

다음으로, 본 발명의 일실시예에 따른 부화소 단위 움직임 추정 기반 초해상도 영상 복원 방법을 도 2를 참조하여 설명한다.Next, a method of reconstructing a sub-resolution image based on subpixel unit motion estimation according to an embodiment of the present invention will be described with reference to FIG. 2.

도 2에서 보는 바와 같이, 본 발명의 일실시예에 따른 회전불변 질감특성 추출 방법은 (a) 시간상 일련의 저해상도 영상을 입력받는 단계(S10); (b) 저행상도 영상들에 보간법을 적용하여 부화소가 포함된 부화소 영상을 구는 단계(S20); (c) 목표 영상의 부화소 영상과 전후 시간 영상에 대해 최적의 블록정합을 탐색하여 움직임 벡터를 구는 단계(S30); 및, (d) 목표 부화소 영상의 목표점에 정합되는 전후 부화소 영상의 정합점으로 목표점을 보는 단계(S40)로 구성된다.As shown in Figure 2, the method of extracting the rotational invariant texture feature according to an embodiment of the present invention (a) receiving a series of low-resolution images in time (S10); (b) obtaining a subpixel image including a subpixel by applying interpolation to low resolution images (S20); (c) obtaining a motion vector by searching an optimal block match with respect to the subpixel image and the before and after time image of the target image (S30); And (d) viewing the target point as a matching point of the before and after subpixel image matched with the target point of the target subpixel image (S40).

보다 구체적으로, 본 발명의 일실시예에서는 우선 초해상도 방법을 적용하고자 하는 영상(또는 목표 영상, 목표 프레임)을 기준으로 하여 전, 후 각 2장의 프레임(이하 전후 프레임 또는 전후 영상, 전후 시간 영상)을 포함하여 총 다섯 장의 저해상도 영상을 입력으로 한다(S10). 이 다섯 장의 영상에 부화소(sub-pixel)를 구하기 위해 6-tap FIR(finite impulse response) 필터를 적용한다(S20). 그 후 1/4 화소 단위에서의 움직임 추정으로(S30) 목표 영상의 고주파 부분에 해당하는 부분을 다른 저해상도 영상에서 찾아, 이 점들을 이용하여 목표영상의 해상도를 증가시키게 된다(S40).
More specifically, in an embodiment of the present invention, first, after each of two frames (hereinafter, before and after frames, before and after images based on the image (or target image and target frame) to which the super resolution method is to be applied A total of five low resolution images, including), is input (S10). In order to obtain sub-pixels on the five images, a 6-tap finite impulse response (FIR) filter is applied (S20). Afterwards, the high frequency portion of the target image is found in another low-resolution image by the motion estimation in the unit of 1/4 pixel (S30), and the resolution of the target image is increased by using these points (S40).

본 발명의 일실시예에 따른 초해상도 영상 복원 방법을 구체적으로 설명하기 전에, 초해상도 영상을 복원하는 원리를 도 3 내지 도 5를 참조하여 보다 구체적으로 설명한다.Before specifically describing a method of restoring a super resolution image according to an embodiment of the present invention, the principle of restoring the super resolution image will be described in more detail with reference to FIGS. 3 to 5.

본 발명에서 저해상도 영상은 [수학식 1]과 같은 관측모델을 갖는다고 가정한다.In the present invention, it is assumed that the low resolution image has an observation model as shown in [Equation 1].

[수학식 1][Equation 1]

여기서 y_k는 k번째의 저해상도 영상, D는 다운샘플링 행렬, x_k와 n_k는 각각 k번째 고해상도 영상과 해당 영상에 더해지는 잡음이다. 본 발명에서는 도 3으로 나타내어진 [수학식 1]의 관측모델을 역으로 하여 목표영상인 x_k를 복원하는 과정과 그 결과를 보인다. Where y _k is the k-th low resolution image, D is the downsampling matrix, and x _k and n _k are the k-th high resolution image and the noise added to the corresponding image, respectively. In the present invention, the process of restoring the target image x _k is shown by the inverse of the observation model of [Equation 1] shown in FIG. 3 and the result.

사람이 육안으로 보는 영상은 연속적인 아날로그 신호이다. 영상신호를 컴퓨터 처리가 가능한 디지털 신호로 변환하기 위해서는 음성신호와 같이 신호를 디지털화 하는 과정이 필요하다. 이 과정에서 도 4와 같이 이미지 센서 등과 같은 디지털 기기의 성능적 제한으로 저해상도 영상이 되는 과정에서 샘플링 간격으로 인한 필연적인 앨리어싱(aliasing)이 발생한다. 이로 인해 원 영상신호의 고주파 성분이 손실된다. The image that the human eye sees is a continuous analog signal. In order to convert a video signal into a digital signal that can be processed by a computer, a process of digitizing the signal such as a voice signal is required. In this process, aliasing due to the sampling interval occurs in the process of becoming a low resolution image due to the performance limitation of a digital device such as an image sensor as shown in FIG. 4. As a result, high frequency components of the original video signal are lost.

동영상에서 현재의 프레임을 f(t)라하고, 바로 전의 프레임을 f(t-1)이라고 가정하면 장면의 전환부분이 아닌 경우 두영상은 서로 근소한 차이만을 가질 것이다. 도 5는 프레임간 존재하는 객체의 움직임을 나타낸다. 도 5에서 왼쪽 열은 f(t-1) 프레임, 오른쪽 열은 f(t) 프레임을 나타낸다. Assuming that the current frame in the video is f (t) and the previous frame is f (t-1), the two images will have only a slight difference when they are not transition parts of the scene. 5 illustrates the movement of an object existing between frames. In FIG. 5, the left column represents an f (t-1) frame and the right column represents an f (t) frame.

도 5(a)의 경우는 프레임 간 객체의 이동이 샘플링 간격인 정수단위로 이동한 경우를 나타낸다. 이 경우 디지털화 된 영상은 f(t-1)과 f(t)의 해당 객체가 같은 윤곽선 정보를 가지게 된다. 도 5(b)의 경우는 프레임 간 객체가 샘플링 간격보다 적은 부화소(sub-pixel) 단위로 이동한 경우이다. 이 경우 첫 번째 경우와 달리 두 영상은 같은 객체에 대하여 서로 다른 윤곽선 정보를 가지게 된다. 5 (a) shows a case in which the movement of an object between frames moves by an integer unit which is a sampling interval. In this case, the digitized image has the same outline information of the corresponding object of f (t-1) and f (t). In the case of FIG. 5B, an object between frames moves in a sub-pixel unit smaller than the sampling interval. In this case, unlike the first case, the two images have different contour information about the same object.

첫 번째 경우처럼 복수의 영상이 서로 같은 고주파 성분을 가지고 있다면, 해당 영상들로는 초해상도(super-resolution) 기법 적용이 불가능하다. 그러나 두 번째 경우는 서로 다른 고주파 정보를 모아서 초해상도 기법을 적용할 수 있다. 따라서 본 발명에서는 부화소 단위로 이동한 두 번째 경우를 이용하여 고주파 성분을 복원하게 된다.
If multiple images have the same high frequency components as in the first case, super-resolution techniques cannot be applied to the images. However, in the second case, super-resolution technique can be applied by collecting different high frequency information. Therefore, in the present invention, the high frequency component is restored by using the second case moved to the subpixel unit.

다음으로, 본 발명의 일실시예에 따른 초해상도 영상 복원 방법을 구체적으로 설명한다. 이하에서, 연속적인 시간의 저해상도 영상을 입력받고, 입력받은 저해상도 영상을 대상으로 초해상도 영상으로 복원한다. 이때, 상기 저해상도 영상은 목표 영상과 상기 목표 영상의 전후 시간 영상으로 구성된다. 특히, 이하의 실시예에서, 본 발명의 방법을 적용하고자 하는 영상(목표 영상)을 기준으로 하여 전, 후 각 2장의 프레임(이하 전후 시간 영상)을 포함하여 총 다섯 장의 저해상도 영상을 입력으로 한다. 즉, 상기 전후 부화소 영상은 상기 목표 영상의 연속적인 이전 시간 영상 2개와 이후 시간 영상 2개로 구성된다.Next, a method of reconstructing a super resolution image according to an embodiment of the present invention will be described in detail. Hereinafter, a low resolution image of a continuous time is input, and the received low resolution image is restored to a super resolution image. In this case, the low resolution image is composed of a target image and a time image before and after the target image. In particular, in the following embodiments, a total of five low-resolution images are input, including two frames (before and after time images) before and after the images (target images) to which the method of the present invention is to be applied. . That is, the before and after subpixel image is composed of two consecutive previous time images and two subsequent time images of the target image.

먼저, 저해상도 영상들에 보간법을 적용하여 부화소를 구하여, 상기 저해상도 영상들의 화소와 상기 부화소가 포함된 영상(이하 부화소 영상)을 구한다(S20). 특히, 상기 부화소 중 1/2 부화소는 6탭(tap) FIR(finite impulse response) 필터를 적용하고, 1/4 부화소는 상기 1/2 부화소에 이중선형(bi-linear) 보간법을 적용하여 구한다.First, a subpixel is obtained by applying interpolation to low resolution images, and an image (hereinafter referred to as a subpixel image) including a pixel of the low resolution images and the subpixel is obtained (S20). In particular, one-half subpixel of the subpixel applies a six-tap finite impulse response (FIR) filter, and one-quarter subpixel applies bi-linear interpolation to the half subpixel. Obtain by applying

즉, 앞서 언급한 부화소(sub-pixel) 단위의 움직임 정보를 얻기 위해서는 우선 입력 영상들의 부화소를 찾아야 한다. 본 발명의 일실시예에서는 입력 영상들마다 [수학식 2]의 H.264/AVC 의 6-tap FIR(finite impulse response) 필터를 적용하여 부화소를 찾는다.That is, in order to obtain the above-described sub-pixel motion information, the sub-pixels of the input images must first be found. In an embodiment of the present invention, a sub-pixel is found by applying a 6-tap finite impulse response (FIR) filter of H.264 / AVC of Equation 2 for each input image.

[수학식 2] &Quot; (2) "

여기서 b와 h는 각각 1/2 화소 단위의 화소값을 의미하고, A,C,G,M,R,T,E,F,H,I,J는 각각 정수단위의 원본 화소값을 의미한다. 각 정수단위의 화소에 곱해지는 숫자는 가중치를 나타낸다.
Where b and h each represent pixel values in units of 1/2 pixel, and A, C, G, M, R, T, E, F, H, I, and J represent original pixel values in integer units, respectively. . The number multiplied by the pixel of each integer unit represents a weight.

[수학식 2]의 이해를 돕기 위해 도 6을 가지고 설명한다. 먼저 가로 방향과 세로 방향의 원본 화소에 가중치를 주어 b와 h를 구하게 된다. 1/4 화소는 1/2 화소를 구한 다음 여기에 이중선형(bi-linear) 보간법을 적용하여 얻게 된다.It will be described with reference to Figure 6 to help understand the equation (2). First, the original pixels in the horizontal and vertical directions are weighted to find b and h. A quarter pixel is obtained by obtaining a half pixel and then applying bi-linear interpolation.

상기와 같이, 1/4 화소, 1/2 화소 등 부화소를 구하고, 상기 부화소와 원 영상의 화소들로 구성된 영상(또는 프레임)을 부화소 영상이라고 부르기로 한다. 부화소 영상은 저해상도 영상별로 각각 구한다. 즉, 앞서의 일례에서 5개의 저해상도 영상을 이용하므로, 부화소 영상도 모두 5개가 구해진다. 이때, 목표 영상에 대한 부화소 영상을 목표 부화소 영상이라고 부르기로 하고, 전후 시간 영상에 대한 부화소 영상을 전후 부화소 영상이라고 부르기로 한다.
As described above, subpixels such as 1/4 pixels and 1/2 pixels are obtained, and an image (or frame) composed of the subpixels and the pixels of the original image is called a subpixel image. Subpixel images are obtained for each low resolution image. That is, since five low resolution images are used in the above example, five subpixel images are also obtained. In this case, the subpixel image of the target image is called a target subpixel image, and the subpixel image of the before and after time image is called a before and after subpixel image.

다음으로, 상기 목표 영상의 부화소 영상(이하 목표 부화소 영상)과 상기 전후 시간 영상(이하 전후 부화소 영상)들 각각에 대하여, 최적의 블록정합을 탐색하여 움직임 벡터를 구한다(S30). 특히, 상기 최적의 블록정합은 상기 목표 부화소 영상과 상기 전후 부화소 영상 간에 블록단위로 SAD(sum of absolute difference) 연산을 하여, 상기 SAD가 최소인 블록을 최적의 블록으로 선택한다.Next, an optimal block match is searched for each of the subpixel images (hereinafter referred to as the target subpixel images) and the before and after time images (hereinafter, referred to as subpixel images) of the target image to obtain a motion vector (S30). In particular, the optimal block matching performs a sum of absolute difference (SAD) operation in units of blocks between the target subpixel image and the front and rear subpixel images, and selects the block having the smallest SAD as an optimal block.

1/2 와 1/4 화소를 모두 구한 후 2*2 블록 단위의 움직임 추정을 하게 된다. 우선 SAD(sum of absolute difference) 연산을 통해 최적 정합블록을 선택한다. 도 7과 도 8은 각각 다른 최적 블록의 탐색 방법을 나타낸다. 도 7은 전역 탐색(full search) 방식을 나타내며 만약 64*64 크기의 탐색 블록(search block)을 사용한다고 하면, 64*64개의 SAD 값이 필요하다. 하나의 SAD 값을 산출하기 위해 [수학식 3]을 사용한다. After both 1/2 and 1/4 pixels are obtained, motion estimation is performed in units of 2 * 2 blocks. First, an optimal matching block is selected through a sum of absolute difference (SAD) operation. 7 and 8 illustrate a method of searching for different optimal blocks, respectively. FIG. 7 illustrates a full search method. If a 64 * 64 search block is used, 64 * 64 SAD values are required. Equation 3 is used to calculate a single SAD value.

[수학식 3] &Quot; (3) "

여기서 SAD(i, j) 는 (i, j) 위치의 SAD 값이다. 본 발명의 일실시예에서는 64*64의 탐색 블록을 사용하므로 i, j 는 각각 0~63의 값을 가진다. B_t 와 B_t-1 은 각각 t와 t-1 시각에서의 프레임의 2*2 크기의 매크로 블록(macro block)을 나타낸다. 따라서 x, y 는 0~1의 값을 가진다. 특히, B_t _-1은 (i, j) 위치에서의 매크로 블록이다.Where SAD (i, j) is the SAD value at position (i, j). In an embodiment of the present invention, since 64 * 64 search blocks are used, i and j have values of 0 to 63, respectively. B _t and B _t-1 denote macroblocks of size 2 * 2 of the frame at time t and t-1, respectively. Therefore, x and y have values of 0-1. In particular, B _t _-1 is a macroblock at position (i, j).

여기서 i와 j는 탐색영역(search range)로, B_t를 탐색영역(search range)의 중심으로 하여 64*64의 크기를 가지는 정사각형이다. [수학식 3]에서 B_t는 기준 영상에서의 ‘고정된’2*2 블록, B_t _-1은 탐색영역(search range)의 (0, 0)부터 (63, 63)까지 한 픽셀 단위로 이동하며 총 64*64개의 SAD 값을 구해내는 2*2의 매크로 블록이다. [수학식 3]에서는 B_t가 고정되어있다는 것을 전제로 한 것이다. 결국, i와 j는 탐색영역(search range), x와 y는 움직이는 블록 및 고정된 블록내의 좌표 (여기서는 x, y 모두 0~1의 값을 가짐)가 된다.Where i and j is a square having a size of 64 * 64 as a center of the search region (search range), the search region (search range) of B _t. In Equation 3, B _t is a 'fixed' 2 * 2 block in the reference image, and B _t _-1 is one pixel unit from (0, 0) to (63, 63) in the search range. It is a 2 * 2 macroblock that moves and yields 64 * 64 SAD values. Equation 3 is based on the assumption that B _t is fixed. As a result, i and j are search ranges, x and y are moving blocks and coordinates in fixed blocks (where x and y both have values of 0 to 1).

본 발명의 일실시예에서는 이전 두 프레임과 이후 두 프레임을 사용하므로 B_t-2, B_t-1, B_t+1, B_t+2 로 각각의 SAD 식을 갖는다. 즉 목표 영상의 B_t 의 위치를 고정한 상태에서 기타 저해상도 영상의 매크로 블록(macro block)을 탐색 블록 내에서 움직이며 SAD값을 채우는 것이다. 이렇게 64*64의 SAD 값을 모두 구한 뒤, 최소값을 찾아내 그 방향으로 영상이 천이되었다고 가정하여 움직임 벡터를 구하게 된다.In an embodiment of the present invention, since the previous two frames and the subsequent two frames are used, B _t-2 , B _t-1 , B _{t + 1} , and B _{t + 2} have respective SAD equations. That is, while the position of B _t of the target image is fixed, the macro block of the other low resolution image is moved in the search block to fill the SAD value. After all the 64 * 64 SAD values are obtained, the minimum value is found and the motion vector is calculated assuming that the image has shifted in that direction.

탐색 방식에는 크게 두 가지가 있는데, 대표적으로 도 7에 나타낸 전역 탐색은 시간을 많이 소모하므로, 본 발명의 일실시예에서는 도 8과 같이 고정 블록을 중심으로 일정 범위의 가까운 곳을 탐색하는 나선형 탐색(spiral search)을 사용한다.There are two types of searching methods. Since the global search shown in FIG. 7 is time consuming, a spiral search for searching near a predetermined range around a fixed block is shown in FIG. 8. Use (spiral search).

도 8(a)는 나선형 탐색 방식을 택했을 경우 2*2 블록의 좌측 상단 점을 기준점으로 하여 이동시의 좌표 변화를 나타낸 것이다. 도 8(b)는 나선형 탐색에서 탐색 범위의 단위를 나타낸 것으로 보통 가로 방향과 세로 방향으로 한 번씩 움직일 때 1킥(kick)을 탐색했다고 정한다.FIG. 8 (a) shows a change in coordinates when moving with the upper left point of a 2 * 2 block as a reference point when the spiral search method is selected. FIG. 8 (b) shows the unit of the search range in the spiral search, and it is determined that one kick is searched when moving in the horizontal direction and the vertical direction once.

상기와 같이 탐색된 블록을 이용하여, 상기 목표 영상을 기준으로 나머지 4개의 저해상도 입력 영상들에 대하여 1/4 화소 단위의 움직임 추정을 한다. 이 과정을 통하여 목표 저해상도 영상과 나머지 저해상도 영상들 간의 위치관계를 움직임 벡터를 통해 도 9(a)와 같이 얻을 수 있다.Using the searched block as described above, motion estimation in units of 1/4 pixels is performed on the remaining four low resolution input images based on the target image. Through this process, the positional relationship between the target low resolution image and the remaining low resolution images can be obtained as shown in FIG. 9 (a) through a motion vector.

도 9(a)에서 원 모양의 화소는 목표 영상의 원본 화소를, 사각형과 다이아몬드 모양의 화소는 각각 나머지 저해상도 영상의 움직임 벡터를 바탕으로 한 상대 위치를 나타낸다. 도 9(b)에서 표시된 부분은 보간해야 할 대상 점들을 나타낸다.In FIG. 9 (a), the circular pixels represent the original pixels of the target image, and the square and diamond pixels each represent a relative position based on the motion vectors of the remaining low resolution images. The portion indicated in FIG. 9B represents target points to be interpolated.

이때, 목표 영상의 원본 화소(또는 보간해야 할 점)를 목표점이라 하고, 움직임 벡터에 의한 상대 위치, 즉, 전후 영상에서의 상대 위치(또는 대응되는 위치)의 화소를 정합점이라고 부르기로 한다. 정합점은 움직임 벡터에 의해 목표점 위치와 대응되는 위치이다.
In this case, an original pixel (or a point to be interpolated) of the target image is called a target point, and a pixel of a relative position by a motion vector, that is, a pixel of a relative position (or a corresponding position) in the before and after image is called a matching point. The matching point is a position corresponding to the target point position by the motion vector.

다음으로, 상기 목표 부화소 영상의 각 화소(이하 목표점)에 대하여, 상기 움직임 벡터에 의해 상기 목표점에 정합되는 상기 전후 부화소 영상의 화소(이하 정합점)를 찾고, 상기 목표점을 상기 정합점으로 보간한다(S40). 특히, 상기 목표점은 상기 목표점과 정합점 사이의 거리에 의한 가중치에 의해 상기 정합점들을 보간한다.Next, for each pixel (hereinafter, referred to as a target point) of the target subpixel image, a pixel (hereinafter referred to as a matching point) of the front and rear subpixel image matched to the target point by the motion vector is found, and the target point is referred to as the registration point. Interpolate (S40). In particular, the target point interpolates the matching points by a weight by the distance between the target point and the matching point.

상기 목표 영상의 해상도를 보간하고자 할 때 부족한 고주파 성분을 인접한 다른 프레임에서 보충한다. 그러나 도 9(a)처럼 단순히 다른 저해상도 영상들과 정합(registration)만 완료된 상태에서는 목표 보간점의 화소 값으로 사용할 수 없다. 정합된 화소 데이터를 참고로 하여 목표 보간점의 화소 값을 구하는 과정이 필요하다. 우선 목표 보간점을 기준으로 하여 가장 가까운 점들 중 몇 개를 선택할 것인지를 정해야 한다.When the resolution of the target image is interpolated, insufficient high frequency components are compensated for in another adjacent frame. However, as shown in FIG. 9 (a), when only registration with other low resolution images is completed, it cannot be used as a pixel value of the target interpolation point. It is necessary to obtain a pixel value of the target interpolation point with reference to the matched pixel data. First, determine how many of the closest points to select based on the target interpolation point.

화소간의 거리를 구하는 기준은 [수학식 4]와 같이 유클리드 거리(Euclidian distance)를 사용한다.As a criterion for calculating the distance between pixels, Euclidian distance is used as shown in [Equation 4].

[수학식 4] &Quot; (4) "

여기서 (x-x_i) 는 가로방향의 움직임을, (y-y_i)는 세로방향의 움직임을 나타낸다.Where (xx _i ) represents horizontal movement and (yy _i ) represents vertical movement.

목표점의 보간에 사용되는 데이터의 수는 결과 영상에 영향을 준다. 그 수가 너무 적으면 고주파성분을 보간하기에 정보가 부족하고, 반대로 그 수가 너무 많다면 결과적으로 구하고자 하는 성분과는 관계가 적은 점을 참고하게 되어 오히려 보간법의 정확도를 떨어뜨린다.The number of data used for interpolation of the target point affects the resulting image. If the number is too small, the information is insufficient to interpolate the high frequency components. On the contrary, if the number is too large, the interpolation method may reduce the accuracy of the interpolation method.

정합된 화소 값들을 목표점으로 보간하는 데에는 여러 방법이 있는데, 본 논문에서는 [수학식 5]와 같이 D. Shepard에 의해 제시된 보간법을 사용한다.There are several methods for interpolating matched pixel values to target points. In this paper, we use the interpolation method proposed by D. Shepard as shown in [Equation 5].

[수학식 5] [Equation 5]

[수학식 5]는 2차원 공간에서 해당위치를 기준으로 하여 불규칙적으로 퍼져있는 점들과의 거리에 의한 가중치를 이용한 보간법이다. [수학식 5]에서 f₁(P) 는 목표 보간점(또는 목표점)의 값, d_i는 각 목표 보간점과 정합점과의 거리, z_i 는 해당 데이터 점의 값을 나타낸다. u는 보간 식의 계수로서 이 수치에 따라 결과 영상의 평탄함이 다르게 나타난다. 만약 수집된 정합점 중 목표 보간점과의 거리가 0인 경우, 해당 값을 그대로 목표 보간점에 대입한다. N은 목표점에 사용하는 정합점의 개수이다.Equation 5 is an interpolation method using weights based on distances from irregularly spread points based on a corresponding position in a two-dimensional space. In Equation 5, f ₁ (P) represents a value of a target interpolation point (or target point), d _i represents a distance between each target interpolation point and a matching point, and z _i represents a value of a corresponding data point. u is the coefficient of the interpolation equation, and the flatness of the resultant image varies according to this value. If the distance from the matching interpolation point among the collected matching points is 0, the corresponding value is substituted into the target interpolation point. N is the number of matching points used for the target point.

다음으로, 본 발명의 효과를 도 10 내지 도 17을 참조하여 보다 구체적으로 설명한다. Next, the effects of the present invention will be described in more detail with reference to FIGS. 10 to 17.

본 발명의 일실시예의 성능을 실험하기 위해 사용된 실험 영상은 352*288 크기의 'Foreman', 'Car', 'Mother-daughter' 영상이다. 실험은 원본 영상을 176*144 크기의 영상으로 다운 샘플링 한 후 각 실험 영상의 최초 10장의 프레임을 최단입점(nearest neighborhood), 이중선형(bi-linear), 고등차수(bi-cubic) 보간법과 본 발명을 사용하여 다시 원래 해상도로 복원 후 주관적, 객관적 결과를 비교하는 것으로 진행한다. Experimental images used to test the performance of an embodiment of the present invention are '352', 'Foreman', 'Car', 'Mother-daughter' image. The experiment downsamples the original image into a 176 * 144 size image, and then extracts the first 10 frames of each experimental image from the nearest neighborhood, bi-linear, and bi-cubic interpolation. The invention proceeds back to the original resolution and then compares the subjective and objective results.

객관적인 성능 비교를 위해 각 결과들의 원본과의 PSNR을 측정한 결과를 도 10(a), 도 10(b), 도 10(c)에 각각 나타내었다. 각 도의 표의 두 번째 행의 열은 [수학식 5]에서 계수 u의 값을 나타낸다. 각 표의 3행부터 실험에 사용된 영상의 장면(sequence) 번호를 나타내며, 각 장면을 이루고 있는 행에 순서대로 제안된 기법, 이중선형, 고등차수, 최단입점 보간법을 사용한 결과를 나타내었다. 도 10(a)부터 도 10(c)까지를 참고하면 계수 u가 작을수록, 사용한 데이터 점의 수가 많을수록 PSNR 수치가 상승하는 것을 확인할 수 있다. 기존 보간법들과 제안된 기법간의 PSNR 차이는 도 10(a)부터 도 10(c)까지 그 정도의 차이는 있으나, 실험 한 모든 조건에서 다른 보간법 보다 PSNR 수치가 높다는 것을 확인할 수 있다.For objective performance comparison, the results of the PSNR measurement with the original of the results are shown in FIGS. 10 (a), 10 (b) and 10 (c), respectively. The column of the second row of the table of each figure represents the value of the coefficient u in [Equation 5]. From the third row of each table, the sequence numbers of the images used in the experiments are shown, and the results of using the proposed technique, the double linearity, the higher order, and the shortest-entry point interpolation method are shown in order for the rows forming each scene. 10 (a) to 10 (c), it can be seen that the smaller the coefficient u and the higher the number of used data points, the higher the PSNR value. The difference in PSNR between the existing interpolation methods and the proposed method is similar to that of FIGS. 10 (a) to 10 (c), but it can be confirmed that PSNR values are higher than other interpolation methods under all the experimental conditions.

도 11(a)는 6-tap FIR(finite impulse response) 필터 보간 과정에서 영상의 최외곽에서 나타나는 격자현상을 붉은 사각형으로 나타낸다. 대부분의 영상에서 관심 영역은 외곽보다는 중앙 혹은 그 외의 영역에 위치하기 때문에 외곽 격자현상의 제거와 움직임 추정시간의 단축을 위해 영상 최외곽 부분의 보간에는 이중선형 보간법을 적용하여 이 현상을 제거한 결과를 도 11(b)에 나타내었다.FIG. 11 (a) shows a lattice phenomenon appearing at the outermost part of an image in a red square in a 6-tap finite impulse response (FIR) filter interpolation process. In most images, the region of interest is located in the center or other region rather than the outer region. Therefore, the bilinear interpolation method is applied to the interpolation of the outermost portion of the image to remove the outer lattice phenomenon and shorten the motion estimation time. It is shown in Figure 11 (b).

도 12부터 도 17은 실험에 사용된 각 영상의 주관적 화질 비교 결과를 보여준다. 각 실험에 사용된 영상마다 열 장의 프레임 중에서 PSNR 수치가 가장 우수한 프레임을 한 장씩 선택하였다. 도 12, 도 14, 도 16은 결과 영상 전체 부분을 나타낸다. 도 12, 도 14, 도 16에서는 고등차수 보간법과 제안된 기법의 성능 차이를 쉽게 비교하기가 어렵다. 12 to 17 show results of subjective quality comparison of each image used in the experiment. For each image used in each experiment, one frame with the best PSNR value was selected. 12, 14 and 16 show the entire part of the resultant image. 12, 14, and 16, it is difficult to easily compare the performance difference between the higher order interpolation and the proposed technique.

따라서 도 13, 도 15, 도 17에 결과 영상을 부분적으로 6배 확대하여 보였다. 최단입점 보간법을 사용한 결과는 단순히 원본 정수 화소값을 복사하여 목표 보간점에 그대로 대입하였기 때문에 고주파성분이 다른 결과들에 비해 두드러지게 손상되는 것을 확인할 수 있다. 이중선형 보간법의 경우, 최단입점 보간법에서 나타난 계단현상을 상당부분 감소시킬 수 있지만 객체의 윤곽선이 원본에 비해 흐려져 역시 결과적으로 고주파성분이 많이 손실된다. 각 영상에서 고등차수 보간법을 사용한 결과는 이중선형 보간법을 사용한 결과보다 윤곽선이 흐려지는 고주파 성분 손실이 비교적 덜 발생하지만, 각 그림의 사각형으로 표시된 부분에 나타낸 것과 같이 윤곽선상에 격자현상 혹은 블록현상이 발생함을 알 수 있다. 본 발명에 의한 초해상도 영상 복원 방법이 이중선형 보간법과 고등차수 보간법에서 나타나는 결점을 상당히 보완해 주는 것을 도 13, 도 15, 도 17에서 확인할 수 있다.
Therefore, the resultant image is partially enlarged 6 times in FIGS. 13, 15, and 17. The result of using the shortest-point interpolation method is that the original integer pixel value is copied and inserted into the target interpolation point as it is, so it can be seen that the high frequency component is significantly damaged compared with other results. In the case of the bilinear interpolation method, the step phenomena shown in the shortest point interpolation method can be greatly reduced, but the contour of the object is blurred compared to the original, and as a result, a large amount of high frequency components are lost. The result of using higher-order interpolation in each image is relatively less likely to lose the high-frequency components that blur the contour than the result of using the bilinear interpolation method. It can be seen. It can be seen from FIGS. 13, 15, and 17 that the super resolution image reconstruction method of the present invention substantially compensates for the drawbacks of the bilinear interpolation method and the higher order interpolation method.

이상, 본 발명자에 의해서 이루어진 발명을 상기 실시 예에 따라 구체적으로 설명하였지만, 본 발명은 상기 실시 예에 한정되는 것은 아니고, 그 요지를 이탈하지 않는 범위에서 여러 가지로 변경 가능한 것은 물론이다.
As mentioned above, although the invention made by this inventor was demonstrated concretely according to the said Example, this invention is not limited to the said Example, Of course, a various change is possible in the range which does not deviate from the summary.

10 : 저해상도 영상 20 : 컴퓨터 단말
30 : 프로그램 시스템 10: low resolution image 20: computer terminal
30: Program system

Claims

Sub-pixel-based motion estimation-based super-resolution image reconstruction method that receives a continuous time image (hereinafter, referred to as a low resolution image) and reconstructs an image having a higher resolution than the low resolution image (hereinafter referred to as a high resolution image) through subpixel unit motion estimation. In
(a) receiving a low resolution image of a continuous time, wherein the low resolution image comprises a target image and a time image before and after the target image;
(b) obtaining a subpixel by applying interpolation to the low resolution images, and obtaining an image including a pixel of the low resolution images and the subpixel (hereinafter referred to as a subpixel image);
(c) obtaining a motion vector by searching an optimal block match for each of the subpixel images (hereinafter, referred to as target subpixel images) and the before and after time images (hereinafter, referred to as subpixel images) of the target image; And
(d) For each pixel (hereinafter, referred to as a target point) of the target subpixel image, the pixel (hereinafter referred to as a matching point) of the front and rear subpixel image matched to the target point by the motion vector is found, and the target point is referred to as the matching point. Sub-pixel motion estimation based super resolution image reconstruction method comprising the step of interpolating.

The method of claim 1,
The before-and-after subpixel image is composed of two consecutive images of a previous time of the target image and two consecutive images of a subsequent time of the target image.

The method of claim 1,
In the step (b), one-half subpixel of the subpixel applies a 6-tap FIR filter, and one-quarter subpixel is bilinear to the half subpixel. A subpixel-based motion estimation based super resolution image reconstruction method characterized by applying a linear interpolation method.

The method of claim 1,
In the step (c), the optimal block matching performs a sum of absolute difference (SAD) operation in units of blocks between the target subpixel image and the front and rear subpixel images, so that the block having the smallest SAD is an optimal block. Sub-pixel unit motion estimation based super-resolution image reconstruction method characterized in that the selection.

5. The method of claim 4,

The sub-pixel motion estimation based super resolution image reconstruction method of claim 2, wherein the block unit is 2 * 2.

5. The method of claim 4,
The subpixel unit motion estimation based super resolution image reconstruction method according to claim 1, wherein the matching block is searched by a global search method or a spiral search method to select the optimal block.

The method of claim 1,
In the step (d), the target point is a subpixel unit motion estimation based super-resolution image reconstruction method, characterized in that for interpolating the matching points by the weight of the distance between the target point and the matching point.

The method of claim 7, wherein
In the step (d), the target point is a subpixel unit motion estimation based super-resolution image reconstruction method, characterized in that for interpolating the matching points by [Equation 1].
[Equation 1]

Where P is the target point,
f ₁ (P) is the value obtained by interpolating the target point P (hereinafter referred to as the value of the target interpolation point),
f ₁ () is the interpolation function,
D _i is each registration point,
d _i is the distance between the target point P and each registration point D _i ,
z _i is the value of its matching point D _i ,
u is the coefficient of the interpolation equation,
N is the number of matching points used for the target point,
i is the index of the registration point.

A computer-readable recording medium having recorded thereon a program for performing the subpixel unit motion estimation based super resolution image reconstruction method according to any one of claims 1 to 8.