KR101996730B1

KR101996730B1 - Method and apparatus for reconstructing single image super-resolution based on artificial neural network

Info

Publication number: KR101996730B1
Application number: KR1020170129743A
Authority: KR
Inventors: 박인규; 정준영
Original assignee: 인하대학교 산학협력단
Priority date: 2017-10-11
Filing date: 2017-10-11
Publication date: 2019-07-04
Also published as: KR20190040586A

Abstract

인공신경망을 이용한 단일 영상 고해상도 복원 방법 및 시스템이 제시된다. 일 실시예에 따른 인공신경망(Artificial neural network)을 이용한 단일 영상 고해상도 복원(Single Image Super-Resolution) 방법은, 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도(Self High-Resolution) 영상을 제공하는 단계; 상기 저해상도 영상 및 상기 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 단계; 및 상기 저해상도 영상 및 상기 자가 고해상도 영상과 잔여영상을 복원한 영상을 통해 상기 입력된 영상을 고해상도 영상으로 복원하는 단계를 포함하고, 상기 저해상도 영상 및 상기 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 단계는, 전체 잔여영상 복원 네트워크를 통해 외부 예시영상 기반으로 넓은 수용 영역과 잔여영상 학습을 이용하여 상기 저해상도 영상의 전반적인 전체 영상구조의 고주파 성분을 복원하는 단계; 및 지역 잔여영상 복원 네트워크를 통해 내부 예시영상 기반의 상기 자가 고해상도 영상을 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 지역적 영상구조의 고주파 성분을 복원하는 단계를 포함하고, 상기 내부 예시영상 및 상기 외부 예시영상의 상호보완 관계를 이용하여 영상을 고해상도 복원할 수 있다. A single image high resolution reconstruction method and system using artificial neural network is presented. A single image super-resolution method using an artificial neural network according to an embodiment includes a step of pre-processing an input image to provide a low resolution image and a self high-resolution image ; Reconstructing a residual image using predetermined learning data for the low-resolution image and the self-high-resolution image; And reconstructing the input image into a high-resolution image through an image reconstructed from the low-resolution image, the self-high-resolution image, and the residual image, wherein the low-resolution image and the self- Reconstructing the residual image may include restoring a high frequency component of the overall image structure of the low resolution image using a wide acceptance area and residual image learning based on the external exemplar image through the entire residual image restoration network; And reconstructing the high frequency component of the detailed local image structure repeatedly appearing in the self-high-resolution image based on the internal example image through the local residual image restoration network or distributed over the entire image, The image can be restored at a high resolution using the complementary relationship of the example images.

Description

[0001] METHOD AND APPARATUS FOR RECONSTRUCTING SINGLE IMAGE SUPER-RESOLUTION [0002] BASED ON ARTIFICIAL NEURAL NETWORK [

아래의 실시예들은 인공신경망을 이용한 단일 영상 고해상도 복원 방법 및 시스템에 관한 것으로, 더욱 상세하게는, 저해상도 영상과 고해상도 영상의 맵핑 관계를 학습하는 깊은 컨볼루셔널 신경망(deep convolutional neural network) 기반의 단일 영상 고해상도 복원 방법 및 시스템에 관한 것이다.The following embodiments relate to a single image high resolution reconstruction method and system using an artificial neural network and more particularly to a method and system for reconstructing a single image based on a deep convolutional neural network that learns a mapping relationship between a low resolution image and a high resolution image And more particularly, to a method and system for reconstructing high-resolution images.

최근 메모리 집적화 기술과 인공지능 연구의 발전을 통해 빅 데이터를 기반으로 한 영상 기반의 컴퓨터 비전 기법 연구의 중요성이 강조되고 있다. 그러나 이러한 연구들은 선명한 영상 취득을 기반으로 연구되어 영상 취득 시 발생하는 기계적 제한 및 환경적 요인에 의한 화질 저하에 대한 연구의 필요성을 수반한다.Recent development of memory integration technology and artificial intelligence research has emphasized the importance of image based computer vision technique research based on Big Data. However, these studies have been studied based on clear image acquisition, and involve the necessity of research on image quality deterioration due to mechanical limitations and environmental factors during image acquisition.

단일 영상의 고해상도 복원 기술은 환경적 또는 기계적 제약에 의해 저해상도 영상을 취득할 수밖에 없는 경우 해상도의 한계를 알고리즘을 통하여 해결하기 위해 연구되어 왔다. 최근의 고해상도 복원 기술은 컨볼루셔널 신경망(Convolutional Neural Network, CNN)을 이용한 외부 예시영상 기반의 고해상도 복원 기술이 기존 기술들의 성능을 넘어서는 새로운 발전 방향으로 그 성능을 입증하였음에도 불구하고 여전히 작은 영상 구조에 대하여 왜곡된 고해상도 복원 결과를 보이는 단점을 가지고 있다.A high-resolution reconstruction technique of a single image has been studied to solve the limitations of resolution through an algorithm when the low-resolution image can not be obtained due to environmental or mechanical constraints. Although the recent high resolution reconstruction technology has proved its performance in the direction of new developments beyond the performance of existing technologies, the reconstruction technology based on external exemplar images using Convolutional Neural Network (CNN) And a high-resolution reconstruction result is distorted.

한국등록특허 10-1780057호는 이러한 고해상도 영상 복원 방법 및 장치에 관한 것으로, 저해상도 영상에 복수 개의 영상처리 레이어(layer)를 순차적으로 적용하여 고해상도 영상을 복원하는 고해상도 영상 복원에 관한 기술을 기재하고 있다. Korean Patent No. 10-1780057 discloses such a high resolution image restoration method and apparatus and describes a technique for restoring a high resolution image by sequentially applying a plurality of image processing layers to a low resolution image to restore a high resolution image .

W. Shi, J. caballero, F. Huszr, J. totz, A. P. Aitken, R. Bishop, D. Rueckert, and Z. Wang, "Real-time single image and video superresolution using an efficient sub-pixel convolutional neural network," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1874-1883, July, 2016. W. Shi, J. Caballero, F. Huszr, J. Totz, AP Aitken, R. Bishop, D. Rueckert, and Z. Wang, "An efficient sub-pixel convolutional neural network , "In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1874-1883, July, 2016. S. Schulter, C. Leistner, and H. Bischof, "Fast and accurate image upscaling with super-resolution forests," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3791-3799, June, 2015. S. Schulter, C. Leistner, and H. Bischof, "Fast and accurate image upscaling with super-resolution forests," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3791-3799, June, 2015. C. Dong, C. Loy, K. He, and X. Tang, "Learning a deep convolutional network for image super-resolution," In Proc. of European Conference on Computer Vision, pp. 184-199, September, 2014. C. Dong, C. Loy, K. He, and X. Tang, "Learning a deep convolutional network for image super-resolution," In Proc. of European Conference on Computer Vision, pp. 184-199, September, 2014. J. Kim, J. K. Lee, and K. M. Lee, "Accurate image super-resolution using very deep convolutional networks," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1646-1654, July, 2016. J. Kim, J. K. Lee, and K. M. Lee, " Accurate image super-resolution using very deep convolutional networks, "In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1646-1654, July, 2016. D. Glasner, S. Bagon, and M. Irani, "Super-resolution from a single image," In Proc. of IEEE International Conference on Computer Vision, pp. 349-356, September, 2009. D. Glasner, S. Bagon, and M. Irani, "Super-resolution from a single image," In Proc. IEEE International Conference on Computer Vision, pp. 349-356, September, 2009. A. Singh and N. Ahuja, "Super-resolution using sub-band self-similarity," In Proc. of Asian Conference on Computer Vision, pp. 552-568, November, 2014. A. Singh and N. Ahuja, "Super-resolution using sub-band self-similarity," In Proc. of Asian Conference on Computer Vision, pp. 552-568, November, 2014. J. Yang, Z. Lin, and S. Cohen, "Fast image super-resolution based on in-place example regression," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1059-1066, June, 2013. J. Yang, Z. Lin, and S. Cohen, "Fast image super-resolution based on in-place example regression," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1059-1066, June, 2013. J.-B. Huang, A. Singh, and N.Ahuja, "Single image super-resolution from transformed self-exemplars," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 5197-5206, June, 2015. J.-B. Huang, A. Singh, and N. Ahuja, "Single image super-resolution from transformed self-exemplars," In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 5197-5206, June, 2015. A. Vedaldi and K. Lenc, "MatConvnet-Convolutional Neural Networks for MATLAB," In Proc. of the ACM International Conference on Multimedia, pp. 689-692, October, 2015. A. Vedaldi and K. Lenc, "MatConvnet-Convolutional Neural Networks for MATLAB," In Proc. of the ACM International Conference on Multimedia, pp. 689-692, October, 2015. J.-B. Huang, S. B. Kang, N. Ahuja, and J. Kopf. "Image completion using planar structure guidance," ACM Trans. on Graphics, vol. 33, no. 4, pp.129, July, 2014. J.-B. Huang, S. B. Kang, N. Ahuja, and J. Kopf. "Image completion using planar structure guidance," ACM Trans. on Graphics, vol. 33, no. 4, pp. 129, July, 2014. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," In Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," In Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324,1998. M. Bevilacqua, A. Roumy, C. Guillemot, and M.-L. Alberi-Morel, "Low-complexity single-image super-resolution based on nonnegative neighbor embedding," In Proc. of British Machine Vision Conference, pp. 1-10, September, 2012. M. Bevilacqua, A. Roumy, C. Guillemot, and M.-L. Alberi-Morel, "Low-complexity single-image super-resolution based on nonnegative neighbor embedding," In Proc. of British Machine Vision Conference, pp. 1-10, September, 2012. R. Zeyde, M. Elad, and M. Protter, "On single image scale-up using sparse representations," In Proc. of International Conference on Curves and Surfaces, pp. 711-730, June, 2010. R. Zeyde, M. Elad, and M. Protter, "On single image scale-up using sparse representations," In Proc. of International Conference on Curves and Surfaces, pp. 711-730, June, 2010. D. Martin, C. Fowlkes, D. Tal, and J. Malik, "A database of human segmented natural images and its application to evaluating segmentation methods and measuring ecological statistics," In Proc. of IEEE International Conference on Computer Vision, pp. 416-423, July, 2001. D. Martin, C. Fowlkes, D. Tal, and J. Malik, "A database of human segmented natural images and their application to evaluating segmentation methods and measuring ecological statistics," In Proc. IEEE International Conference on Computer Vision, pp. 416-423, July, 2001. L. Sun and J. Hays, "Super-resolution from internet-scale scene matching," In Proc. of IEEE International Conference on Computational Photography, pp. 1-12, April, 2012. L. Sun and J. Hays, "Super-resolution from internet-scale scene matching," In Proc. IEEE International Conference on Computational Photography, pp. 1-12, April, 2012. https://github.com/jbhuang0604/SelfExSR. https://github.com/jbhuang0604/SelfExSR.

실시예들은 인공신경망을 이용한 단일 영상 고해상도 복원 방법 및 시스템에 관하여 기술하며, 보다 구체적으로 저해상도 영상과 고해상도 영상의 맵핑 관계(mapping function)를 학습하는 깊은 컨볼루셔널 신경망(deep convolutional neural network) 기반의 단일 영상 고해상도 복원(single image super-resolution: SISR) 기술을 제공한다. Embodiments describe a single image high resolution reconstruction method and system using an artificial neural network, and more specifically, a deep convolutional neural network based learning method that learns a mapping function between a low resolution image and a high resolution image And single image super-resolution (SISR) technology.

실시예들은 외부 및 내부 예시영상 기반의 단일 영상 고해상도 복원(external and internal examples-driven SISR) 기술의 상호 보완적인 관계를 이용하여 다양한 영상구조에서 강인한 고해상도 영상 복원 능력을 갖는 인공신경망을 이용한 단일 영상 고해상도 복원 방법 및 시스템을 제공하는데 있다.Embodiments use a mutually complementary relationship between external and internal examples-driven SISR (SISR) techniques based on external and internal example images to provide a single high resolution image using an artificial neural network having robust high resolution image restoration capability in various image structures And to provide a restoration method and system.

일 실시예에 따른 인공신경망(Artificial neural network)을 이용한 단일 영상 고해상도 복원(Single Image Super-Resolution) 방법은, 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도(Self High-Resolution) 영상을 제공하는 단계; 상기 저해상도 영상 및 상기 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 단계; 및 상기 저해상도 영상 및 상기 자가 고해상도 영상과 잔여영상을 복원한 영상을 통해 상기 입력된 영상을 고해상도 영상으로 복원하는 단계를 포함하고, 상기 저해상도 영상 및 상기 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 단계는, 전체 잔여영상 복원 네트워크를 통해 외부 예시영상 기반으로 넓은 수용 영역과 잔여영상 학습을 이용하여 상기 저해상도 영상의 전반적인 전체 영상구조의 고주파 성분을 복원하는 단계; 및 지역 잔여영상 복원 네트워크를 통해 내부 예시영상 기반의 상기 자가 고해상도 영상을 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 지역적 영상구조의 고주파 성분을 복원하는 단계를 포함하고, 상기 내부 예시영상 및 상기 외부 예시영상의 상호보완 관계를 이용하여 영상을 고해상도 복원할 수 있다. A single image super-resolution method using an artificial neural network according to an embodiment includes a step of pre-processing an input image to provide a low resolution image and a self high-resolution image ; Reconstructing a residual image using predetermined learning data for the low-resolution image and the self-high-resolution image; And reconstructing the input image into a high-resolution image through an image reconstructed from the low-resolution image, the self-high-resolution image, and the residual image, wherein the low-resolution image and the self- Reconstructing the residual image may include restoring a high frequency component of the overall image structure of the low resolution image using a wide acceptance area and residual image learning based on the external exemplar image through the entire residual image restoration network; And reconstructing the high frequency component of the detailed local image structure repeatedly appearing in the self-high-resolution image based on the internal example image through the local residual image restoration network or distributed over the entire image, The image can be restored at a high resolution using the complementary relationship of the example images.

상기 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도(Self High-Resolution) 영상을 제공하는 단계는, 보간 기술을 통해 보간된 상기 저해상도 영상을 전반적인 영상구조를 유지하기 위한 가이드 영상으로 제공하고, 상기 내부 예시영상 기반의 고해상도 복원 기술을 이용하여 복원한 상기 자가 고해상도 영상을 지역적 영상구조의 세밀한 부분을 복원하기 위해 제공하며, 상기 외부 예시영상 및 내부 예시영상의 상호보완 관계를 이용하여 깊은(deep) 컨볼루셔널 신경망(Convolutional Neural Network, CNN) 기반으로 상기 입력된 영상을 고해상도 복원할 수 있다. The step of pre-processing the input image to provide a low-resolution image and a self-high-resolution image may include providing the low-resolution image interpolated through an interpolation technique as a guide image for maintaining an overall image structure, A method for reconstructing a self-reconstructed high-resolution image using a high-resolution reconstruction technique based on an exemplar image, the method comprising the steps of: reconstructing a deep-reconstructed image using a complementary relationship between the external exemplar image and an internal- The input image can be restored with high resolution based on a Convolutional Neural Network (CNN).

다른 실시예에 따른 인공신경망(Artificial neural network)을 이용한 단일 영상 고해상도 복원(Single Image Super-Resolution) 시스템에 있어서, 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도(Self High-Resolution) 영상을 제공하는 영상 제공부; 상기 저해상도 영상 및 상기 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 잔여영상 복원부; 및 상기 저해상도 영상 및 상기 자가 고해상도 영상과 잔여영상을 복원한 영상을 통해 상기 입력된 영상을 고해상도 영상으로 복원하는 고해상도 복원부를 포함하고, 상기 잔여영상 복원부는, 외부 예시영상 기반으로 넓은 수용 영역과 잔여영상 학습을 이용하여 상기 저해상도 영상의 전반적인 전체 영상구조의 고주파 성분을 복원하는 전체 잔여영상 복원 네트워크; 및 내부 예시영상 기반의 상기 자가 고해상도 영상을 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 지역적 영상구조의 고주파 성분을 복원하는 지역 잔여영상 복원 네트워크를 포함하고, 상기 내부 예시영상 및 상기 외부 예시영상의 상호보완 관계를 이용하여 영상을 고해상도 복원할 수 있다. In a single image super-resolution system using an artificial neural network according to another embodiment, an input image is pre-processed to provide a low-resolution image and a self-high-resolution image Image supply; A residual image restoring unit for restoring the residual image using the preset learning data for the low resolution image and the self high resolution image; And a high-resolution reconstruction unit that reconstructs the input image into a high-resolution image through the low-resolution image, the self-high-resolution image, and the reconstructed residual image, wherein the residual image reconstructing unit reconstructs the wide- An entire residual image restoration network for restoring a high frequency component of the entire overall image structure of the low resolution image using image learning; And a local residual image restoration network for restoring the high frequency component of the detailed local image structure repeatedly appearing on the self-high-resolution image based on the internal example image or distributed over the entire image, wherein the internal exemplar image and the external exemplar image The image can be restored at a high resolution using the complementary relation.

상기 영상 제공부는, 보간 기술을 통해 보간된 상기 저해상도 영상을 전반적인 영상구조를 유지하기 위한 가이드 영상으로 제공하고, 상기 내부 예시영상 기반의 고해상도 복원 기술을 이용하여 복원한 상기 자가 고해상도 영상을 지역적 영상구조의 세밀한 부분을 복원하기 위해 제공하며, 상기 외부 예시영상 및 내부 예시영상의 상호보완 관계를 이용하여 깊은(deep) 컨볼루셔널 신경망(Convolutional Neural Network, CNN) 기반으로 상기 입력된 영상을 고해상도 복원할 수 있다. Wherein the image providing unit provides the low-resolution image interpolated through the interpolation technique as a guide image for maintaining the overall image structure, and the high-resolution restoration technique using the internal example image-based high- And reconstructs the input image based on a deep convolutional neural network (CNN) using the complementary relationship between the external exemplar image and the internal exemplar image. .

상기 지역 잔여영상 복원 네트워크는, 컨볼루셔널 레이어(convolutional layer)와 ReLu(Rectified Linear Unit) 함수로 이루어진 복수의 화소 부호화 레이어로 구성되어, 출력 특징 맵은 주변 화소 정보의 간섭 없이 부호화된 상기 자가 고해상도 영상의 화소 정보를 전달할 수 있다. Wherein the region residual image restoration network is composed of a plurality of pixel encoding layers composed of a convolutional layer and a ReLu (Rectified Linear Unit) function, and the output feature map includes the self-high resolution The pixel information of the image can be transmitted.

상기 전체 잔여영상 복원 네트워크는, 상기 자가 고해상도 영상의 계단 왜곡 현상에 의해 왜곡된 전반적인 전체영상구조를 보완하여 왜곡 없는 고해상도 영상구조 복원하도록 외부 예시영상 기반의 넓은 수용영역을 갖도록 직렬로 연결된 복수의 상기 잔여 블록(residual block)이 형성될 수 있다. The entire remaining image restoration network may include a plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality of the plurality A residual block may be formed.

상기 전체 잔여영상 복원 네트워크는, 상기 잔여 블록을 통해 스킵 컨넥션(skip-connection)과 집단 정규화(Batch Normalization) 레이어를 이용하여 안정적인 그래디언트(gradient) 값의 학습 과정을 수행하고, 각각의 상기 잔여 블록을 통해 입력 특징 맵은 복수의 컨볼루셔널 레이어를 통과하며, 출력 특징 맵의 수는 입력 특징 맵의 수와 동일하게 유지되고, 소정 크기의 수용 영역은 N개의 잔여 블록과 마지막으로 컨볼루셔널 레이어를 통과하여 크기가 확장됨에 따라 전체 잔여영상을 복원하여 고해상도 영상구조를 복원할 수 있다.Wherein the entire residual image restoration network performs a process of learning a gradient value using a skip-connection and a batch normalization layer through the residual block, The input feature map passes through a plurality of convolutional layers, the number of output feature maps remains the same as the number of input feature maps, the predetermined size of coverage area includes N remaining blocks and finally the convolutional layer As the size increases, the entire residual image can be restored and the high-resolution image structure can be restored.

실시예들에 따르면 저해상도 영상과 고해상도 영상의 맵핑 관계를 학습하는 깊은 컨볼루셔널 신경망 기반의 단일 영상 고해상도 복원(single image super-resolution: SISR) 방법 및 시스템을 제공할 수 있다.According to embodiments, a single convolutional neural network-based single image super-resolution (SISR) method and system that learns the mapping relationship between a low-resolution image and a high-resolution image can be provided.

실시예들에 따르면 외부 및 내부 예시영상 기반의 단일 영상 고해상도 복원(external and internal examples-driven SISR) 기술의 상호 보완적인 관계를 이용하여 다양한 영상구조에서 강인한 고해상도 영상 복원 능력을 갖는 인공신경망을 이용한 단일 영상 고해상도 복원 방법 및 시스템을 제공할 수 있다.According to the embodiments, by using the mutually complementary relation of the external and internal examples-driven SISR techniques based on the external and internal example images, a single image using artificial neural network having robust high resolution image restoration ability in various image structures It is possible to provide an image restoration method and system.

도 1은 복원된 고해상도 영상의 예를 나타낸다.
도 2a는 일 실시예에 따른 깊은 CNN 모델을 설명하기 위한 개략도이다.
도 2b는 일 실시예에 따른 깊은 CNN 모델의 전체 잔여영상 복원 네트워크를 설명하기 위한 개략도이다.
도 3은 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 시스템을 나타내는 블록도이다.
도 4a 및 도 4b는 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 방법을 나타내는 흐름도이다.
도 5는 일 실시예에 따른 깊은 CNN 모델의 분석을 설명하기 위한 도면이다.
도 6은 일 실시예에 따른 영상의 고해상도 복원 결과를 비교하여 나타내는 도면이다.
도 7은 일 실시예에 따른 Urban100 데이터 셋에 대한 4배 고해상도 복원 결과를 나타낸다.
도 8은 일 실시예에 따른 Sun-hays80 데이터 셋에 대한 8배 고해상도 복원 결과를 나타낸다.
도 9는 일 실시예에 따른 Sun-hays80 데이터 셋에 대한 고해상도 복원 결과를 나타낸다.FIG. 1 shows an example of a reconstructed high-resolution image.
2A is a schematic diagram for explaining a deep CNN model according to an embodiment.
FIG. 2B is a schematic diagram illustrating an entire residual image restoration network of a deep CNN model according to an embodiment.
3 is a block diagram illustrating a single image high resolution reconstruction system using an artificial neural network according to an exemplary embodiment.
4A and 4B are flowcharts illustrating a single image high resolution restoration method using an artificial neural network according to an exemplary embodiment.
FIG. 5 is a diagram for explaining an analysis of a deep CNN model according to an embodiment.
FIG. 6 is a diagram illustrating a result of high resolution reconstruction of an image according to an exemplary embodiment.
FIG. 7 shows a 4x high resolution reconstruction result for the Urban100 data set according to an embodiment.
FIG. 8 shows an 8x high resolution reconstruction for a Sun-hays80 dataset according to one embodiment.
Figure 9 shows the high resolution reconstruction results for the Sun-hays80 dataset according to one embodiment.

이하, 첨부된 도면을 참조하여 실시예들을 설명한다. 그러나, 기술되는 실시예들은 여러 가지 다른 형태로 변형될 수 있으며, 본 발명의 범위가 이하 설명되는 실시예들에 의하여 한정되는 것은 아니다. 또한, 여러 실시예들은 당해 기술분야에서 평균적인 지식을 가진 자에게 본 발명을 더욱 완전하게 설명하기 위해서 제공되는 것이다. 도면에서 요소들의 형상 및 크기 등은 보다 명확한 설명을 위해 과장될 수 있다.
Hereinafter, embodiments will be described with reference to the accompanying drawings. However, the embodiments described may be modified in various other forms, and the scope of the present invention is not limited by the embodiments described below. In addition, various embodiments are provided to more fully describe the present invention to those skilled in the art. The shape and size of elements in the drawings may be exaggerated for clarity.

아래의 실시예들에 따르면 깊은(deep) 컨볼루셔널 신경망(Convolutional Neural Network, CNN) 기반의 고해상도 복원 기술은 외부 및 내부 예시영상의 상호보완 관계를 이용하여 고해상도 복원 문제를 해결할 수 있다. 이하에서는, 외부 및 내부 예시영상 기반의 고해상도 복원 기술의 장단점을 각각 분석하고, 이를 이용한 새로운 CNN 기반의 고해상도 복원 기술을 제공한다. 여기에서, 깊은 컨볼루셔널 신경망(CNN)은 딥 러닝(deep learning)의 일종으로, 영상 처리 분야에서 이미지를 이해하거나 이미지로부터 각종 정보를 추출하기 위한 인공신경망을 의미할 수 있다.According to the embodiments described below, a high-resolution reconstruction technique based on a deep convolutional neural network (CNN) can solve the high-resolution reconstruction problem by using the complementary relationship between the external and internal example images. Hereinafter, the advantages and disadvantages of the high-resolution reconstruction technology based on the external and internal example images are analyzed, respectively, and a new CNN-based high-resolution reconstruction technique using the same is provided. Here, deep convolutive neural network (CNN) is a kind of deep learning, which can mean an artificial neural network for understanding images in the field of image processing or extracting various information from images.

먼저, 외부 예시영상 기반의 고해상도 복원 기술의 장단점을 분석한다. First, we analyze the strengths and weaknesses of the high resolution reconstruction technology based on the external image.

지난 10년간, CNN 기반의 고해상도 복원 기술(비특허문헌 1,3,4)은 기존 기술의 한계를 극복하는 새로운 해결책으로 각광받았다. 보다 깊은 CNN 모델 기반의 복원 기술은 저해상도와 고해상도 영상의 보다 정교한 비선형 맵핑 관계를 학습하여 단순한 CNN 모델 기반의 기술들보다 높은 성능을 보였다. VDSR(비특허문헌 4)은 이러한 깊은 CNN 모델 기반의 고해상도 복원 기술의 예시로 3x3 컨볼루셔널 레이어와 ReLu 함수의 반복적인 깊은 구조를 이용하였다. 특히, 전체의 고해상도 영상 복원 대신 고해상도 영상과 보간된 저해상도 영상의 잔여영상을 복원하여 고해상도 영상의 고주파 영상구조에 집중하였으며, 이는 CNN 모델의 수용성을 향상시켜 기존의 외부 예시영상 기반의 다양한 복원 기술 중 최고의 성능을 보였다. 따라서, 본 실시예에서는 VDSR의 복원 성능을 기준으로 외부 예시영상 기반의 고해상도 복원 기술을 분석할 수 있다.Over the past decade, CNN-based high-resolution restoration technologies (non-patent documents 1, 3 and 4) have attracted attention as new solutions to overcoming the limitations of existing technologies. The deeper CNN model - based restoration technique learned the more sophisticated nonlinear mapping relationship between low resolution and high resolution images and showed higher performance than simple CNN model based techniques. VDSR (Non-Patent Document 4) used a repetitive deep structure of a 3x3 convolutional layer and a ReLu function as an example of such a deep CNN model-based high-resolution reconstruction technique. In particular, instead of restoring the entire high-resolution image, the high-resolution image and the residual image of the interpolated low-resolution image are restored to focus on the high-frequency image structure of the high-resolution image. This enhances the acceptability of the CNN model, It showed the best performance. Therefore, in this embodiment, it is possible to analyze the high-resolution restoration technology based on the external exemplar image based on the restoration performance of the VDSR.

도 1은 복원된 고해상도 영상의 예를 나타낸다. FIG. 1 shows an example of a reconstructed high-resolution image.

도 1을 참조하면, 복원된 고해상도 영상의 예를 나타내는 것으로, (a)는 입력영상을 나타내며, (b)는 내부 예시영상 기반의 복원 결과(비특허문헌 8)를 나타내고, (c)는 외부 예시영상 기반의 복원 결과(비특허문헌 4)를 나타낸다. (B) shows a restoration result based on an internal example image (non-patent document 8), (c) shows an example of a restored An example image-based restoration result (non-patent document 4) is shown.

깊은 CNN 모델을 통한 넓은 수용 영역은 고해상도 영상의 1개의 화소를 복원하기 위해 넓은 영역의 저해상도 영역을 볼 수 있어 보다 정교한 맵핑 관계를 학습할 수 있다. 또한 고주파 영상구조 복원에 특화된 깊은 CNN 모델은 고주파 영상구조인 에지(edge) 영역에 대하여 선명한 고해상도 영상을 복원할 수 있다(122).In the wide acceptance area through the deep CNN model, a wide area of low resolution area can be seen in order to restore one pixel of a high resolution image, so that a more sophisticated mapping relation can be learned. In addition, the deep CNN model, which is specialized for restoration of high-frequency image structure, can reconstruct a clear high-resolution image of the edge region, which is a high-frequency image structure (122).

그러나, 깊은 CNN 모델은 넓은 수용 영역을 기반으로 한 고해상도 영상을 복원할 수 있지만 여전히 영상의 전체 영역에 대한 자가 예시영상 정보를 대체하기에는 부족하여 반복적으로 나타나거나 영상 전체에 분포되어 있는 영상구조에 대하여 고해상도 영상 복원 시 복원된 고해상도 영상구조가 왜곡되는 한계를 보인다(112).However, the deep CNN model can reconstruct a high-resolution image based on a wide coverage area, but is still insufficient to replace the self-imaged image information of the whole area of the image. The reconstructed high-resolution image structure in the reconstruction of the high-resolution image has a limit of distortion (112).

다음으로, 내부 예시영상 기반의 고해상도 복원 기술의 장단점을 분석한다.Next, we analyze the advantages and disadvantages of the high-resolution reconstruction technology based on the internal example image.

연구(비특허문헌 5)의 실험에 따르면, 자연 영상의 작은 고해상도 패치는 다양한 스케일의 영상 내에서 반복적으로 나타난다. 대부분의 내부 예시영상 기반의 고해상도 복원 기술들은 이러한 자가 유사성 특성을 바탕으로 고안되어 보다 확장된 자가 예시 고해상도 패치 공간 생성은 고해상도 영상 복원 성능 향상에 중요한 부분이다. 이를 위하여, 최근 SelfExSR(비특허문헌 8)은 입력영상 내 다중 평면(비특허문헌 10)을 기반으로 한 원근 왜곡을 고려하여 패치 공간을 확장시켜 기존의 내부 예시영상 기반의 복원 기술들보다 높은 성능 향상을 보였다. 따라서, 본 실시예에서는 SelfExSR의 복원 성능을 기준으로 내부 예시영상 기반의 고해상도 복원 기술을 분석할 수 있다.According to the experiment of the study (Non-Patent Document 5), a small high resolution patch of a natural image appears repeatedly in various scale images. Most internal image-based high-resolution reconstruction techniques are devised based on such self-similarity characteristics, so that a more extended example of the authorship. Creating a high resolution patch space is an important part to improve the high-resolution image reconstruction performance. To this end, the SelfExSR (Non-Patent Document 8) recently proposed a method of expanding the patch space in consideration of perspective distortion based on a multi-plane (non-patent document 10) in the input image, Improvement. Therefore, in the present embodiment, it is possible to analyze the high-resolution restoration technology based on the internal example image based on the restoration performance of SelfExSR.

자가 유사성 특성을 기반으로 고해상도 영상 패치를 탐색하여 한정된 고해상도 영상 탐색공간이 아닌 영상 전체를 탐색공간으로 사용할 수 있다. 이는, 반복적으로 나타나거나 영상 전체에 분포되어 있는 영상구조에 대하여 풍부한 고해상도 정보를 기반으로 고해상도 영상을 복원할 수 있어 세밀한 영상구조 복원에 높은 성능을 보인다(121).Based on the self similarity characteristic, a high resolution image patch can be searched to use the entire image as a search space rather than a limited high resolution image search space. This is because high-resolution images can be reconstructed based on rich high-resolution information for image structures repeatedly appearing or distributed throughout the image, thus exhibiting high performance for fine image reconstruction (121).

한편, 내부 예시영상 기반의 고해상도 복원 기술은 탐색한 작은 고해상도 패치 기반으로 영상의 해상도를 단계적으로 증가시켜가며 고해상도 영상을 복원할 수 있다. 이러한 작은 고해상도 패치 기반의 복원 방식은 패치보다 큰 전반적인 고해상도 영상구조를 겹쳐진 작은 고해상도 패치를 관찰하여 복원해야 하므로 왜곡된 고해상도 영상구조를 복원하기 쉬우며 단계적인 복원 방식에 따라 왜곡된 고해상도 영상구조 정보가 누적된다. On the other hand, the high-resolution reconstruction technique based on the internal example image can restore a high-resolution image by incrementally increasing the resolution of the image based on the small high resolution patch that is searched. This small, high-resolution patch-based reconstruction method requires restoration by observing a small high-resolution patch that overlaps the overall high-resolution image structure larger than the patch, so that it is easy to reconstruct distorted high-resolution image structure and distorted high- .

이러한 단점은 복원된 고해상도 영상의 고주파 영상구조에 대하여 계단 왜곡 현상(step artifacts)을 야기한다(111).This disadvantage causes step artifacts in the high-frequency image structure of the reconstructed high-resolution image (111).

이와 같이, 외부 예시영상 기반의 고해상도 복원 기술은 넓은 수용 영역과 잔여영상 학습을 기반으로 전반적인 고주파 영상구조를 선명하게 구현할 수 있지만, 내부 예시영상 기반의 고해상도 복원 기술은 계단 왜곡 현상을 야기할 수 있다. 반대로, 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 영상구조에 대하여 내부 예시영상 기반의 고해상도 복원 기술은 뛰어난 성능을 보이는 반면, 외부 예시영상 기반의 고해상도 복원 기술은 왜곡된 영상구조를 복원하기 쉽다.
As described above, although the high-resolution reconstruction technique based on the external exemplar image can clearly implement the overall high-frequency image structure based on the wide acceptance region and the residual image learning, the high-resolution reconstruction technique based on the internal exemplar image may cause the step- . On the contrary, the high-resolution reconstruction technique based on the internal example image has excellent performance for the detailed image structure repeatedly or distributed over the entire image, whereas the high-resolution reconstruction technique based on the external exemplar image is easy to reconstruct the distorted image structure.

도 2a는 일 실시예에 따른 깊은 CNN 모델을 설명하기 위한 개략도이다. 2A is a schematic diagram for explaining a deep CNN model according to an embodiment.

도 2a를 참조하면, 일 실시예에 따른 깊은 CNN 모델은 평행한 2개의 깊은 CNN 구조의 부분 네트워크로 이루어질 수 있다. 깊은 CNN 구조의 부분 네트워크는 전체 잔여영상 복원 네트워크(global residual network)이며, 다른 부분 네트워크는 지역 잔여영상 복원 네트워크(local residual network)이다. Referring to FIG. 2A, a deep CNN model according to an exemplary embodiment may be formed of a partial network of two parallel deep CNN structures. The partial network of the deep CNN structure is a global residual network, and the other partial network is a local residual network.

먼저, 전체 잔여영상 복원 네트워크는 깊은 CNN 모델의 넓은 수용영역을 활용하여 큰 영상구조의 고주파 성분을 복원할 수 있다. 그리고, 지역 잔여영상 복원 네트워크는 내부 예시영상 기반의 고해상도 정보를 3개의 1x1 컨볼루셔널 레이어(convolutional layer)를 활용하여 주변 화소의 간섭 없이 인코딩된 선명한 작은 영상구조의 고주파 성분을 복원할 수 있다. 아래에서 보다 구체적으로 설명한다. First, the entire residual image restoration network can recover a high frequency component of a large image structure by utilizing the wide receiving area of the deep CNN model. In addition, the local residual image restoration network can recover high frequency components of a clear image structure that is encoded without interference of peripheral pixels by utilizing three 1x1 convolutional layers of high resolution information based on the internal example image. This will be described in more detail below.

일 실시예에 따른 깊은 CNN 모델의 입력은 Bicubic 보간 기술로 보간된 저해상도(Interpolated Low-Resolution: Interpolated LR) 영상(

)을 전반적인 영상구조를 유지하기 위한 가이드 영상으로, 내부 예시영상 기반의 고해상도 복원 기술을 이용하여 복원한 자가 고해상도(Self High-Resolution: Self HR) 영상(

)을 지역적 영상구조의 세밀한 부분을 복원하기 위해 이용할 수 있다.The input of the deep CNN model according to one embodiment is an interpolated low-resolution (Interpolated LR) image interpolated with a Bicubic interpolation technique

) Is a guide image for maintaining the overall image structure, and the self-high-resolution (Self High-Resolution) image (Self HR) image

) Can be used to reconstruct the fine detail of the local image structure.

주어진 학습 데이터

를 이용하여 입력영상

에 대하여 잔여영상을 복원하는 깊은 CNN 모델(

)을 다음 식을 최적화하여 학습할 수 있다. Given learning data

The input image

Deep CNN model to reconstruct the residual image

) Can be learned by optimizing the following equation.

[식 1] [Formula 1]

지역 잔여영상 복원 네트워크는 자가 고해상도 영상의 지역적 영상정보의 전달을 목적으로 한다. 예를 들어, 지역 잔여영상 복원 네트워크는 연구(비특허문헌 7)에 기반하여 1x1 컨볼루션 레이어와 ReLu(Rectified Linear Unit) 함수로 이루어진 3개의 화소 부호화 레이어로 구성될 수 있다. 수학적으로 i 번째 화소 부호화 레이어의 출력 특징 맵은 다음 식과 같이 표현될 수 있다.The local residual image restoration network aims to transmit local image information of self high resolution image. For example, the local residual image restoration network may be composed of three pixel encoding layers composed of a 1x1 convolution layer and a ReLu (Rectified Linear Unit) function based on the research (Non-Patent Document 7). Mathematically, the output feature map of the i- th pixel encoding layer can be expressed by the following equation.

[식 2][Formula 2]

여기서, F _i 은 i 번째 화소 부호화 레이어의 출력 특징 맵을 의미하며, W _i 와 B _i 은 i 번째 화소 부호화 레이어의 컨볼루션 필터와 바이어스를 의미할 수 있다. 출력 특징 맵은 주변 화소 정보의 간섭 없이 부호화된 지역적 자가 고해상도 영상의 화소 정보를 전달할 수 있다.Here, F _i denotes an output feature map of the i-th pixel encoding layer, and W _i and B _i can denote a convolution filter and a bias of the i- th pixel encoding layer. The output feature map can convey the pixel information of the localized high-resolution image that is encoded without interfering with surrounding pixel information.

전체 잔여영상 복원 네트워크는 자가 고해상도 영상의 계단 왜곡 현상에 의해 왜곡된 전반적인 영상구조를 보완하여 왜곡 없는 고해상도 영상구조 복원을 목적으로 한다. 전체 잔여영상 복원 네트워크는 앞선 외부 예시영상 기반의 고해상도 복원 기술의 분석을 기반하여 넓은 수용 영역을 갖기 위해 여러 개의 잔여 블록(residual block)의 직렬 연결로 구성될 수 있다.The entire residual image restoration network aims at restoring distortion-free high-resolution image structure by complementing the distorted overall image structure by the step distortion of self-high resolution image. The entire residual image restoration network can be composed of a series connection of several residual blocks to have a wide acceptance area based on the analysis of the high resolution restoration technology based on the preceding external image.

도 2b는 일 실시예에 따른 깊은 CNN 모델의 전체 잔여영상 복원 네트워크를 설명하기 위한 개략도이다. FIG. 2B is a schematic diagram illustrating an entire residual image restoration network of a deep CNN model according to an embodiment.

도 2b를 참조하면, 전체 잔여영상 복원 네트워크는 넓은 수용 영역을 갖기 위해 여러 개의 잔여 블록의 직렬 연결로 구성될 수 있다. 여기서, 잔여 블록은 스킵 컨넥션(skip-connection)과 집단 정규화(Batch Normalization: BN) 레이어(202)를 이용하여 안정적인 그래디언트 값의 학습 과정을 도모할 수 있다. 각각의 잔여 블록을 통해 입력 특징 맵은 2 번의 3x3 컨볼루셔널 레이어(201, 204)를 지나며, 출력 특징 맵의 수는 입력 특징 맵의 수와 동일하게 64개로 유지될 수 있다. 3x3 크기의 수용 영역은 N개의 잔여 블록과 마지막 3x3 컨볼루셔널 레이어(222)를 지나 (4N+5)x(4N+5)의 크기로 확장되어 고해상도 영상구조를 복원할 수 있다. Referring to FIG. 2B, the entire residual image restoration network may include a plurality of residual blocks connected in series to have a wide coverage area. Here, the residual block can be steadily learned by using a skip-connection and batch normalization (BN) layer 202. FIG. Through each residual block, the input feature map passes through two 3x3 convolutional layers 201 and 204, and the number of output feature maps can be maintained at 64 equal to the number of input feature maps. The 3x3 receiving area can be expanded to (4N + 5) x (4N + 5) through the N remaining blocks and the last 3x3 convolutional layer 222 to restore the high resolution image structure.

이를 바탕으로 아래에서 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 시스템 및 방법에 대해 설명하기로 한다.
Hereinafter, a single image high resolution restoration system and method using an artificial neural network according to an embodiment will be described.

도 3은 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 시스템을 나타내는 블록도이다. 3 is a block diagram illustrating a single image high resolution reconstruction system using an artificial neural network according to an exemplary embodiment.

도 3을 참조하면, 일 실시예에 따른 인공신경망(Artificial neural network)을 이용한 단일 영상 고해상도 복원(Single Image Super-Resolution) 시스템(300)은 영상 제공부(310), 잔여영상 복원부(320) 및 고해상도 복원부(330)를 포함하여 이루어질 수 있다. 여기에서 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 시스템(300)은 도 2a 및 도 2b에서 설명한 일 실시예에 따른 깊은 CNN 모델이 되거나 깊은 CNN 모델의 적어도 한 부분이 될 수 있다. Referring to FIG. 3, a single image super-resolution system 300 using an artificial neural network according to an exemplary embodiment includes an image providing unit 310, a residual image restoring unit 320, And a high-resolution reconstruction unit 330. FIG. Here, the single image high resolution reconstruction system 300 using the artificial neural network according to an exemplary embodiment may be a deep CNN model or at least one part of a deep CNN model according to the embodiment illustrated in FIGS. 2A and 2B.

영상 제공부(310)는 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도 영상을 제공할 수 있다. The image providing unit 310 may pre-process the input image to provide a low-resolution image and a self-high-resolution image.

영상 제공부(310)는 보간 기술을 통해 보간된 저해상도 영상을 전반적인 영상구조를 유지하기 위한 가이드 영상으로 제공하고, 내부 예시영상 기반의 고해상도 복원 기술을 이용하여 복원한 자가 고해상도 영상을 지역적 영상구조의 세밀한 부분을 복원하기 위해 제공할 수 있다. The image providing unit 310 provides the guide image for maintaining the overall image structure of the low resolution image interpolated through the interpolation technique, and provides the high resolution image restored by the internal example image based high resolution reconstruction technique to the local image structure Can be provided to restore details.

잔여영상 복원부(320)는 저해상도 영상 및 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원할 수 있다. The residual image restoration unit 320 may restore the residual image using the preset learning data for the low resolution image and the self high resolution image.

여기서, 잔여영상 복원부(320)는 전체 잔여영상 복원 네트워크(331) 및 지역 잔여영상 복원 네트워크(332)를 포함할 수 있다. Here, the residual image restoration unit 320 may include an entire residual image restoration network 331 and a region residual image restoration network 332. [

전체 잔여영상 복원 네트워크(331)는 외부 예시영상 기반으로 넓은 수용 영역과 잔여영상 학습을 기반으로 저해상도 영상을 이용하여 전반적인 전체 영상구조의 고주파 성분을 복원할 수 있다. The entire residual image restoration network 331 can restore the high frequency components of the overall image structure using the low resolution image based on the wide acceptance area and the residual image learning based on the external exemplar image.

이러한 전체 잔여영상 복원 네트워크(331)는 자가 고해상도 영상의 계단 왜곡 현상에 의해 왜곡된 전반적인 전체영상구조를 보완하여 왜곡 없는 고해상도 영상구조 복원하도록 외부 예시영상 기반의 넓은 수용영역을 갖도록 직렬로 연결된 복수의 잔여 블록이 형성될 수 있다. The entire residual image restoration network 331 may include a plurality of serial-connected image restoration areas 331, 332, 333, and 333 having a wide reception area based on the external image to compensate for the distortion of the entire image structure distorted by the step distortion of the self- A residual block may be formed.

더 구체적으로, 전체 잔여영상 복원 네트워크(331)는 잔여 블록을 통해 스킵 컨넥션과 집단 정규화 레이어를 이용하여 안정적인 그래디언트 값의 학습 과정을 수행할 수 있다. 각각의 잔여 블록을 통해 입력 특징 맵은 복수의 컨볼루셔널 레이어를 통과하며, 출력 특징 맵의 수는 입력 특징 맵의 수와 동일하게 유지될 수 있다. 또한, 전체 잔여영상 복원 네트워크(331)는 소정 크기의 수용 영역이 구성되어 N개의 잔여 블록과 마지막으로 컨볼루셔널 레이어를 통과하여 크기가 확장됨에 따라 전체 잔여영상을 복원하여 고해상도 영상구조를 복원할 수 있다. 예를 들어, 전체 잔여영상 복원 네트워크(331)는 각각의 잔여 블록을 통해 입력 특징 맵은 2 번의 3x3 컨볼루셔널 레이어(201, 204)를 지나며, 출력 특징 맵의 수는 입력 특징 맵의 수와 동일하게 64개로 유지될 수 있다. 3x3 크기의 수용 영역은 N개의 잔여 블록과 마지막 3x3 컨볼루셔널 레이어(222)를 지나 (4N+5)x(4N+5)의 크기로 확장되어 고해상도 영상구조를 복원할 수 있다.More specifically, the entire residual image restoration network 331 can perform a stable gradient value learning process using the skip connection and the group normalization layer through the remaining blocks. Through each residual block, the input feature map passes through a plurality of convolutional layers, and the number of output feature maps can remain the same as the number of input feature maps. In addition, the entire remaining image restoration network 331 reconstructs the entire residual image as the size of the entire remaining image restoration network 331 is expanded through the N remaining blocks and finally the convolutional layer, and restores the high resolution image structure . For example, the entire residual image restoration network 331 passes through two remaining 3x3 convolutional layers 201 and 204 through the respective remaining blocks, and the number of output feature maps becomes equal to the number of input feature maps It can be kept at 64 in the same manner. The 3x3 receiving area can be expanded to (4N + 5) x (4N + 5) through the N remaining blocks and the last 3x3 convolutional layer 222 to restore the high resolution image structure.

지역 잔여영상 복원 네트워크(332)는 내부 예시영상 기반의 자가 고해상도 영상을 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 지역적 영상구조의 고주파 성분을 복원할 수 있다. The local residual image restoration network 332 can restore a high-frequency component of a fine local image structure repeatedly appearing in the self-high-resolution image based on the internal example image or distributed over the entire image.

더 구체적으로, 지역 잔여영상 복원 네트워크(332)는 컨볼루셔널 레이어와 ReLu 함수로 이루어진 복수의 화소 부호화 레이어로 구성될 수 있다. 이 때, 출력 특징 맵은 주변 화소 정보의 간섭 없이 부호화된 자가 고해상도 영상의 화소 정보를 전달할 수 있다. 예를 들어, 지역 잔여영상 복원 네트워크(332)는 내부 예시영상 기반의 고해상도 정보를 3개의 1x1 컨볼루셔널 레이어(convolutional layer)를 활용하여 주변 화소의 간섭 없이 인코딩된 선명한 작은 영상구조의 고주파 성분을 복원할 수 있다.More specifically, the region residual image restoration network 332 may be composed of a plurality of pixel encoding layers made up of a convolutional layer and a ReLu function. At this time, the output feature map can transmit the pixel information of the self-encoded high-resolution image without interference of neighboring pixel information. For example, the local residual image restoration network 332 uses high-resolution information based on the internal example image using three 1x1 convolutional layers to generate a high-frequency component of a clear small image structure encoded without intervention of peripheral pixels Can be restored.

고해상도 복원부(330)는 저해상도 영상 및 자가 고해상도 영상과 잔여영상을 복원한 영상을 통해 입력된 영상을 고해상도 영상으로 복원할 수 있다. The high-resolution reconstruction unit 330 reconstructs a low-resolution image, a high-resolution image, and a reconstructed residual image into a high-resolution image.

실시예들에 따르면, 외부 예시영상 및 내부 예시영상의 상호보완 관계를 이용하여 깊은 컨볼루셔널 신경망(CNN) 기반으로 입력된 영상을 고해상도 복원할 수 있다.
According to embodiments, a high-resolution reconstruction of a deep convoluted neural network (CNN) -based image can be performed using the complementary relationship between the external exemplar image and the internal exemplar image.

도 4a 및 도 4b는 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 방법을 나타내는 흐름도이다. 4A and 4B are flowcharts illustrating a single image high resolution restoration method using an artificial neural network according to an exemplary embodiment.

도 4a 및 도 4b를 참조하면, 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 방법은 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도 영상을 제공하는 단계(410), 저해상도 영상 및 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 단계(420) 및 저해상도 영상 및 자가 고해상도 영상과 잔여영상을 복원한 영상을 통해 입력된 영상을 고해상도 영상으로 복원하는 단계(430)를 포함할 수 있다. 내부 예시영상 및 외부 예시영상의 상호보완 관계를 이용하여 영상을 고해상도 복원할 수 있다.4A and 4B, a single image high resolution reconstruction method using an artificial neural network according to an exemplary embodiment includes a step 410 of providing a low resolution image and a high resolution image by pre-processing an input image, a low resolution image and a high resolution image (420) reconstructing a residual image using preset learning data and reconstructing a high-resolution image into a high-resolution image through a reconstructed low-resolution image, a self-high-resolution image, and a residual image . It is possible to reconstruct the image with high resolution by using the complementary relationship between the internal example image and the external example image.

여기서 상기 저해상도 영상 및 상기 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원하는 단계(420)는, 전체 잔여영상 복원 네트워크를 통해 외부 예시영상 기반으로 넓은 수용 영역과 잔여영상 학습을 이용하여 상기 저해상도 영상의 전반적인 전체 영상구조의 고주파 성분을 복원하는 단계(421) 및 지역 잔여영상 복원 네트워크를 통해 내부 예시영상 기반의 상기 자가 고해상도 영상을 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 지역적 영상구조의 고주파 성분을 복원하는 단계(422)를 포함할 수 있다. In operation 420, restoring the residual image using the preset learning data for the low-resolution image and the self-high-resolution image may include using a wide acceptance region and residual image learning based on the external exemplar image through the entire residual image restoration network (421) reconstructing a high-frequency component of the entire overall image structure of the low-resolution image, and reconstructing the high-resolution local image of the low-resolution image repeatedly or through the local residual image restoration network, And recovering the high-frequency component of the structure (step 422).

여기서 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 방법은 도 3에서 설명한 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 시스템(300)을 통해 보다 구체적으로 설명할 수 있다. 일 실시예에 따른 인공신경망을 이용한 단일 영상 고해상도 복원 시스템(300)은 영상 제공부(310), 잔여영상 복원부(320) 및 고해상도 복원부(330)를 포함하여 이루어질 수 있다. 여기서, 잔여영상 복원부(320)는 전체 잔여영상 복원 네트워크(331) 및 지역 잔여영상 복원 네트워크(332)를 포함할 수 있다. Here, the single image high resolution reconstruction method using the artificial neural network according to one embodiment can be explained more concretely through the single image high resolution reconstruction system 300 using the artificial neural network according to the embodiment shown in FIG. The single image high resolution reconstruction system 300 using the artificial neural network according to an exemplary embodiment may include an image providing unit 310, a residual image restoring unit 320, and a high resolution restoring unit 330. Here, the residual image restoration unit 320 may include an entire residual image restoration network 331 and a region residual image restoration network 332. [

단계(410)에서, 영상 제공부(310)는 입력된 영상을 전처리하여 저해상도 영상 및 자가 고해상도 영상을 제공할 수 있다. In operation 410, the image providing unit 310 may process the input image to provide a low-resolution image and a self-high-resolution image.

더 구체적으로, 영상 제공부(310)는 보간 기술을 통해 보간된 저해상도 영상을 전반적인 영상구조를 유지하기 위한 가이드 영상으로 제공하고, 내부 예시영상 기반의 고해상도 복원 기술을 이용하여 복원한 자가 고해상도 영상을 지역적 영상구조의 세밀한 부분을 복원하기 위해 제공할 수 있다. More specifically, the image data providing unit 310 provides the low-resolution image interpolated through the interpolation technology as a guide image for maintaining the overall image structure, and the high-resolution image reconstructed using the internal example image-based high- Can be provided to reconstruct the details of the local image structure.

단계(420)에서, 잔여영상 복원부(320)는 저해상도 영상 및 자가 고해상도 영상에 대하여 기설정된 학습 데이터를 이용하여 잔여영상을 복원할 수 있다. In operation 420, the residual image restoration unit 320 may restore the residual image using the preset training data for the low-resolution image and the self-high-resolution image.

더 구체적으로, 단계(421)에서, 전체 잔여영상 복원 네트워크(331)를 통해 외부 예시영상 기반으로 넓은 수용 영역과 잔여영상 학습을 기반으로 저해상도 영상을 이용하여 전반적인 전체 영상구조의 고주파 성분을 복원할 수 있다. More specifically, in step 421, a high-frequency component of the overall image structure is restored using a low-resolution image based on the wide acceptance area and the residual image learning based on the external exemplar image based on the entire remaining image restoration network 331 .

또한, 단계(422)에서, 지역 잔여영상 복원 네트워크(332)를 통해 내부 예시영상 기반의 자가 고해상도 영상을 반복적으로 나타나거나 영상 전체에 분포되어 있는 세밀한 지역적 영상구조의 고주파 성분을 복원할 수 있다.Also, in step 422, the high-frequency component of the detailed local image structure repeatedly appearing on the internal example-image-based self-high-resolution image repeatedly or distributed over the entire image can be restored through the local residual image restoration network 332.

단계(430)에서, 고해상도 복원부(330)는 저해상도 영상 및 자가 고해상도 영상과 잔여영상을 복원한 영상을 통해 입력된 영상을 고해상도 영상으로 복원할 수 있다. In operation 430, the high-resolution reconstruction unit 330 reconstructs a low-resolution image, a self-high-resolution image, and a reconstructed residual image into a high-resolution image.

이에 따라 외부 예시영상 및 내부 예시영상의 상호보완 관계를 이용하여 깊은 컨볼루셔널 신경망(CNN) 기반으로 입력된 영상을 고해상도 복원할 수 있다.
Accordingly, it is possible to reconstruct a high-resolution image based on a deep convolutive neural network (CNN) using the complementary relationship between the external exemplar image and the internal exemplar image.

도 5는 일 실시예에 따른 깊은 CNN 모델의 분석을 설명하기 위한 도면이다. FIG. 5 is a diagram for explaining an analysis of a deep CNN model according to an embodiment.

도 5는 일 실시예에 따른 깊은 CNN 모델의 결과를 비교 분석하기 위한 것으로, 보다 구체적으로 (a)는 Bicubic 보간 영상을 나타내고, (b)는 지역 잔여영상 복원 네트워크 결과를 나타내며, (c)는 다중 입력 VDSR 모델 결과를 나타내고, (d)는 일 실시예에 따른 깊은 CNN 모델을 이용한 결과를 나타낸다. 여기서 깊은 CNN 모델은 인공신경망을 이용한 단일 영상 고해상도 복원 시스템 또는 방법을 의미할 수 있으며, 간단히 영상 고해상도 복원 기술이라 할 수 있다. Figure 5 is a block diagram (A) shows a Bicubic interpolation image, (b) shows a residual residual image restoration network result, (c) shows a result of a multiple input VDSR model, (d) shows the result using the deep CNN model according to one embodiment. Here, the deep CNN model can represent a single image high resolution reconstruction system or method using an artificial neural network, and is simply an image high resolution reconstruction technique.

고해상도 영상을 복원하기 위한 개별적인 부분 네트워크의 기여도를 분석하기 위해 동일한 입력영상에 대해 지역 잔여영상 복원 네트워크만을 이용한 고해상도 복원을 진행할 수 있다. 반복적인 고해상도 영상구조 영역의 경우, 지역적 잔여영상 복원 네트워크는 자가 고해상도 영상의 부호화된 화소 정보전달을 통해 (a)의 지역적인 작은 영상구조의 모호성을 제거할 수 있다(b). 그 결과, 전체 잔여영상 복원 네트워크는 보다 선명한 영상에 대하여 영상구조의 고주파 성분 복원을 수행할 수 있다. In order to analyze the contribution of individual partial networks for reconstructing a high-resolution image, a high-resolution reconstruction using only the region-residual image restoration network can be performed on the same input image. In the case of a repetitive high-resolution image structure region, the local residual image restoration network can remove the ambiguity of the local small image structure of (a) through the transmission of the encoded pixel information of the self-high resolution image (b). As a result, the entire residual image restoration network can perform high frequency component restoration of the image structure for a clearer image.

이러한 2개의 부분 네트워크의 협력을 통한 일 실시예에 따른 영상 고해상도 복원 기술은, (d)에 도시된 바와 같이, 영상구조 왜곡 및 계단 왜곡 현상 없이 개별적인 외부 또는 내부 예시영상 기반의 복원 기술의 결과보다 선명한 고해상도 영상을 복원할 수 있다. The image high-resolution reconstruction technique according to an embodiment through cooperation of the two partial networks can be achieved by using a reconstruction technique based on the result of the reconstruction technique based on an individual external or internal example image without image structure distortion and step- A clear high-resolution image can be restored.

추가적으로 일 실시예에 따른 깊은 CNN 모델의 대안으로 다중 입력 레이어의 VDSR 모델을 고려할 수도 있다(c). Additionally, a VDSR model of a multiple input layer may be considered as an alternative to the deep CNN model according to one embodiment (c).

도 6은 일 실시예에 따른 영상의 고해상도 복원 결과를 비교하여 나타내는 도면이다. FIG. 6 is a diagram illustrating a result of high resolution reconstruction of an image according to an exemplary embodiment.

도 6을 참조하면, (a)는 입력영상을 나타내고, (b)는 다중 입력 VDSR 모델 결과를 나타내며, (c)는 일 실시예에 따른 깊은 CNN 모델을 이용한 결과를 나타낸다. Referring to FIG. 6, (a) shows an input image, (b) shows a result of a multiple input VDSR model, and (c) shows a result using a deep CNN model according to an embodiment.

다중 입력 VDSR 모델과 일 실시예에 따른 깊은 CNN 모델을 이용한 결과를 비교한 결과, 다중 입력 레이어의 VDSR 모델은 작은 영상구조에 대하여 성능향상을 보일 수 있지만, 고주파 영상구조의 에지 영역에 대하여 내부 예시영상 기반의 고해상도 기술의 단점을 동일하게 학습하는 단점을 가지고 있다. As a result of comparing the results using the multi-input VDSR model and the deep CNN model according to an embodiment, the VDSR model of the multi-input layer can show a performance improvement with respect to a small image structure. However, Has the disadvantage of learning the disadvantages of image-based high-resolution technology equally.

반면, 일 실시예에 따른 깊은 CNN 모델을 이용한 결과의 경우 작은 영상구조뿐 아니라, 고주파 영상구조의 에지 영역에 대하여 성능향상을 보인다.
On the other hand, in the case of using the deep CNN model according to an embodiment, the performance is improved not only for the small image structure but also for the edge area of the high frequency image structure.

아래에서는 일 실시예에 따른 깊은 CNN 모델의 학습 방법에 대해 하나의 예를 들어 보다 구체적으로 설명한다. Below One example of a learning method of a deep CNN model according to an embodiment will be described in more detail.

예를 들어, 일 실시예에 따른 깊은 CNN 모델의 학습데이터는 연구(비특허문헌 2)의 291개의 자연 영상의 작은 영상 패치로 구성하며 패치에 대하여 반전 및 회전 변환을 적용하여 그 수를 확대할 수 있다. 깊은 CNN 모델의 입력을 위한 자가 고해상도 영상은 SelfExSR을 이용하여 복원하며 매개 변수는 연구(비특허문헌 8)의 기본 설정 값을 이용할 수 있다. 깊은 CNN 모델을 다양한 해상도 복원 환산 계수에 대하여 학습하기 위해 2, 3 및 4배의 해상도로 축소 후 Bicubic 보간 기술로 복원된 학습데이터를 사용할 수 있다. 학습은 역전파 기술(back-propagation)(비특허문헌 11)을 기반으로 작은 집단 기반 경사 하강법(min-batch gradient descent)을 이용하며, 이때 집단 크기는 64, 가속도(momentum) 0.9와 L2 가중치 정규화 파라미터(L2-weighted decay parameter)는 0.0001로 설정할 수 있다. 또한 빠른 학습 속도를 위해 그래디언트 클리핑(gradient clipping)(비특허문헌 4) 기술을 이용할 수 있다. 초기 학습율(learning rate)은 0.01로 설정하며 매 10번의 학습 에포크(epoch)마다 0.1씩 감소시킬 수 있다. 학습율은 40번의 학습 에포크 이후 고정하며 60번의 학습 에포크 이후 학습을 중단할 수 있다.
For example, the training data of the deep CNN model according to one embodiment is composed of a small image patch of 291 natural images of the study (Non-Patent Document 2), and the number of the training images is increased by applying inversion and rotation transformation to the patch . Self-high resolution images for deep CNN model input are restored using SelfExSR, and parameters can be used as default values of the study (Non-Patent Document 8). In order to study deep CNN model with various resolution restoration conversion factors, it is possible to use restored learning data by Bicubic interpolation technique after reducing to 2, 3 and 4 times resolution. The learning uses a min-batch gradient descent based on back-propagation (non-patent document 11), where the population size is 64, the momentum is 0.9 and the L2 weight The L2-weighted decay parameter can be set to 0.0001. Also, gradient clipping (non-patent document 4) technique can be used for fast learning speed. The initial learning rate is set to 0.01 and can be reduced by 0.1 for every 10 learning epochs. The learning rate is fixed after 40 learning epochs and can be stopped after 60 learning epochs.

아래에서는 다양한 고해상도 영상 복원 기술과 일 실시예에 따른 영상 고해상도 복원 기술의 성능 비교분석을 수행한다. Hereinafter, performance comparison and analysis of various high resolution image restoration techniques and image high resolution restoration techniques according to an embodiment will be performed.

일례로, 실험 환경은 Intel Core i5 CPU(3.2GHz), 12GB RAM과 NVIDIA Titan GPU이며, 다양한 자연 영상 데이터 셋(비특허문헌 12,13,14) 및 도심 영상 데이터 셋(비특허문헌 8) 및 8배의 고해상도 복원 환산 계수에 대한 정량적 평가를 위해 Sun-Hays80(비특허문헌 15)을 이용한다.For example, the experimental environment is an Intel Core i5 CPU (3.2 GHz), 12 GB RAM and an NVIDIA Titan GPU, and various natural image data sets (non-patent documents 12, 13 and 14) and an urban image data set We use Sun-Hays80 (non-patent document 15) for quantitative evaluation of 8x high resolution reconstruction conversion factor.

VDSR을 벤치마크(benchmark)하기 위해 전반적 잔여영상 복원 네트워크의 잔여 블록을 9개로 설정하여 41x41의 동일한 수용 영역을 갖도록 구성하며, RGB 3채널의 영상 대신 단일 휘도 영상만을 복원할 수 있다. 분석을 위한 (비특허문헌 5,6,8)의 고해상도 복원 결과는 Huang의 웹 페이지(비특허문헌 16) 벤치마크 결과를 이용할 수 있다. 일 실시예에 따른 영상 고해상도 복원 기술은 MatConvNet Deep Learning Toolbox(비특허문헌 9)를 이용하여 구현할 수 있으며, 학습시간은 약 10시간 소요될 수 있다. 나아가 일 실시예에 따른 영상 고해상도 복원 기술을 이용하여 다양한 내부 예시영상 기반의 고해상도 영상 복원 기술들의 성능향상 실험을 수행할 수 있다.
In order to benchmark the VDSR, the residual block of the overall residual image restoration network is set to 9, so that it has the same receiving area of 41x41, and only a single luminance image can be restored instead of the RGB three channel image. The high-resolution restoration results for the analysis (non-patent documents 5, 6, 8) can use the benchmark results of Huang's web page (non-patent document 16). The high-resolution image restoration technique according to one embodiment can be implemented using the MatConvNet Deep Learning Toolbox (Non-Patent Document 9), and the learning time can take about 10 hours. Further, performance enhancement experiments of various high resolution image restoration techniques based on the internal example images can be performed using the image high resolution restoration technique according to the embodiment.

정성적 비교평가Qualitative comparison evaluation

도 7은 일 실시예에 따른 Urban100 데이터 셋에 대한 4배 고해상도 복원 결과를 나타낸다. FIG. 7 shows a 4x high resolution reconstruction result for the Urban100 data set according to an embodiment.

보다 구체적으로, 도 7의 (a)는 VDSR(비특허문헌 4)이고, (b)는 SelfExSR(비특허문헌 8)이며, (c)는 일 실시예에 따른 영상 고해상도 복원 기술이고, (d)는 원본 영상을 나타낸다. 이는, 반복적인 고해상도 영상구조를 가지는 도로 외부 및 내부 예시영상 기반의 고해상도 복원 기술들의 성능평가에 효과적이다. More specifically, FIG. 7A is a VDSR (non-patent document 4), FIG. 7B is a SelfExSR (non-patent document 8), FIG. 7C is an image high resolution restoration technique according to an embodiment, ) Represents the original image. This is effective for evaluating the performance of high resolution reconstruction techniques based on road external and internal example images having repetitive high resolution image structures.

도 7a에 도시된 바와 같이, VDSR의 경우, 고주파 영상구조에 특화된 복원 기술로 에지 부분의 큰 고해상도 영상구조에서 선명한 복원 결과를 보인다. 그러나, 지역적인 작은 고해상도 영상구조에 대하여 충분한 고해상도 정보를 취득하지 못해 왜곡된 복원 결과를 나타낸다. As shown in FIG. 7A, in the case of VDSR, a restoration technique specialized for a high-frequency image structure shows a vivid restoration result in a large high-resolution image structure of an edge portion. However, sufficient high-resolution information can not be obtained for a local small-resolution image structure, and thus a distorted restoration result is displayed.

도 7b에 도시된 바와 같이, SelfExSR의 경우, 작은 고해상도 영상구조에 대하여 자가 예시 고해상도 패치 공간을 통해 충분한 고해상도 정보를 취득하여 선명한 고해상도 영상을 복원한다. 반면, 작은 자가 예시 고해상도 패치 기반의 복원 방식에 의해 큰 고해상도 영상구조에 대하여 계단 왜곡 현상을 야기한다. As shown in FIG. 7B, in the case of SelfExSR, sufficient high resolution information is acquired through a self-illustrated high resolution patch space for a small high resolution image structure to restore a clear high resolution image. On the other hand, a small-sized high-resolution patch-based reconstruction method causes a staircase distortion phenomenon for a large high-resolution image structure.

도 7c에 도시된 바와 같이, 일 실시예에 따른 영상 고해상도 복원 기술의 경우, 단순한 각 복원 기술의 단점 보완에서 나아가 각 복원 기술의 장점을 활용하여 성능향상을 도모하여 보다 세밀한 고해상도 영상구조를 복원할 수 있다. As shown in FIG. 7C, in the case of the image high-resolution restoration technique according to the embodiment, the merit of the simple restoration technique is supplemented, and the performance of the restoration technique is improved by utilizing the merits of each restoration technique. .

나아가, 8배의 해상도 복원 환산 계수에 대하여 고해상도 복원 실험을 수행할 수 있다. 일 실시예에 따른 영상 고해상도 복원 기술은 4배 이상의 해상도 복원 환산 계수에 대하여 학습을 수행하지 않았으므로 SelfExSR의 단계적 고해상도 영상 복원 방법을 이용할 수 있다. 일 실시예에 따른 영상 고해상도 복원 기술은 SelfExSR의 중간 결과인 2, 4와 8배의 복원된 고해상도 영상에 적용하여 중간 결과를 개선할 수 있다. 비슷한 접근방법으로 VDSR은 8배의 고해상도 영상을 복원을 위해 2배의 해상도 환산 계수에 대하여 총 3번의 고해상도 복원을 수행한다. Furthermore, a high resolution reconstruction experiment can be performed on an 8x resolution reconstruction conversion factor. Since the image high resolution reconstruction technique according to the embodiment does not perform learning on the resolution reconstruction conversion factor of 4 times or more, the stepwise high resolution image restoration method of SelfExSR can be used. The image high resolution reconstruction technique according to the embodiment can improve the intermediate result by applying the intermediate result of 2, 4 and 8 times to the restored high resolution image of SelfExSR. In a similar approach, VDSR performs a total of three high-resolution reconstructions for a 2x resolution conversion factor to restore 8x high-resolution images.

도 8은 일 실시예에 따른 Sun-hays80 데이터 셋에 대한 8배 고해상도 복원 결과를 나타낸다. FIG. 8 shows an 8x high resolution reconstruction for a Sun-hays80 dataset according to one embodiment.

보다 구체적으로, 도 8의 (a)는 VDSR(비특허문헌 4)를 나타내고, (b)는 SelfExSR(비특허문헌 8)를 나타내며, (c)는 일 실시예에 따른 영상 고해상도 복원 기술을 나타내고, (d)는 원본 영상을 나타낸다. More specifically, FIG. 8A shows a VDSR (non-patent document 4), FIG. 8B shows a SelfExSR (non-patent document 8), FIG. 8C shows an image high- , (d) show the original image.

도 8을 통해, 8배의 고해상도 영상 복원 시, 일 실시예에 따른 영상 고해상도 복원 기술이 기존 기술들(비특허문헌 4,8)에 비해 선명한 울타리 형태를 복원하는 것을 확인할 수 있다. 추가적인 일 실시예에 따른 Sun-hays80 데이터 셋에 대한 고해상도 복원 결과는 도 9에 나타나 있다.
8, it can be seen that the image reconstruction technique according to the embodiment of the present invention restores a clear fence shape compared to the existing techniques (Non-Patent Documents 4 and 8) in the case of restoring an 8 times higher resolution image. The high resolution reconstruction results for the Sun-hays80 dataset according to a further embodiment are shown in FIG.

정량적 비교평가Quantitative comparison evaluation

다양한 외부 예시영상(비특허문헌 3,4) 및 내부 예시영상(비특허문헌 5,6,8) 기반의 고해상도 복원 기술에 대하여 정량적 비교평가를 수행할 수 있다. It is possible to perform quantitative comparison evaluation on high resolution reconstruction techniques based on various external exemplar images (non-patent documents 3 and 4) and internal example images (non-patent documents 5, 6 and 8).

표 1은 Set5(비특허문헌 12), Set14(비특허문헌 13), BSD100(비특허문헌 14)과 Urban100(비특허문헌 8) 데이터 셋에 대한 정량적 비교평가 결과(PSNR/SSIM)를 나타낸다. Table 1 shows the quantitative comparison evaluation result (PSNR / SSIM) for Set 5 (non-patent document 12), Set 14 (non-patent document 13), BSD 100 (non-patent document 14) and Urban 100 (non-patent document 8) data sets.

[표 1][Table 1]

표 1에서 가장 높은 정량적 수치는 빨간색으로, 두 번째 높은 정량적 수치는 파란색으로 표기되어 있다. In Table 1, the highest quantitative figure is indicated in red, and the second highest quantitative figure is indicated in blue.

일 실시예에 따른 영상 고해상도 복원 기술은 자연 영상 데이터 셋에 대하여 기존 기술 중 최고의 성능을 보이는 VDSR보다 미세하게 높은 정량적 수치를 보인다. 그러나, 반복적인 고해상도 영상구조가 많은 Urban100 데이터 셋에 대하여 일 실시예에 따른 영상 고해상도 복원 기술은 VDSR보다 큰 PSNR 증가 폭(0.3dB)을 보여 확연한 성능 차이를 확인할 수 있다. According to one embodiment, the image high resolution restoration technique has a finer quantitative value than the VDSR which shows the best performance among the existing technologies for the natural image data set. However, for the Urban 100 data set having many repetitive high-resolution image structures, the image high-resolution restoration technique according to one embodiment shows a PSNR increase width (0.3 dB) larger than the VDSR.

나아가, 일 실시예에 따른 깊은 CNN 모델의 재학습을 진행하지 않고 기존의 내부 예시영상 기반의 고해상도 복원 기술들(비특허문헌 5,6) 기반의 자가 고해상도 영상을 입력으로 이용하여 기존 기술들의 성능 향상 실험을 수행할 수 있다. Furthermore, without re-learning of the deep CNN model according to an embodiment, it is possible to use a self-high-resolution image based on existing internal example image-based high-resolution restoration techniques (non-patent documents 5 and 6) An improvement experiment can be performed.

실험 결과, 일 실시예에 따른 영상 고해상도 복원 기술을 이용한 기존 복원 기술들의 성능이 기존 성능 대비 1dB 이상 향상된 높은 정량적 수치를 확인할 수 있다.As a result, it can be seen that the performance of existing restoration techniques using the image high resolution restoration technique according to one embodiment is higher than the existing performance by a high quantitative value which is improved by 1 dB or more.

실시예들에 따르면, 기존의 다양한 고해상도 복원 기술 중 예시영상을 이용한 고해상도 복원 기법(example-driven SISR)을 기반으로 CNN을 이용한 새로운 고해상도 복원 기술을 제공할 수 있다. 이러한 실시예들에 따른 기술은 외부 및 내부 예시영상 기반의 고해상도 복원 기법(external and internal example-based SISR)의 상호보완적인 관계를 이용하여 기존의 외부 또는 내부 예시영상 기반의 고해상도 복원 기법들의 단점을 보완하고 각 기법의 장점을 보강하여 향상된 고해상도 영상을 복원할 수 있다.
According to embodiments, a new high-resolution reconstruction technique using CNN based on an example-driven SISR using an exemplary image among various existing high-resolution reconstruction techniques can be provided. The techniques according to these embodiments utilize the complementary relationship between the external and internal example-based SISR (external and internal example-based SISR) to overcome the drawbacks of the existing external or internal example image-based high resolution reconstruction techniques And enhances the merits of each technique to restore an enhanced high-resolution image.

이상에서 설명된 장치는 하드웨어 구성요소, 소프트웨어 구성요소, 및/또는 하드웨어 구성요소 및 소프트웨어 구성요소의 조합으로 구현될 수 있다. 예를 들어, 실시예들에서 설명된 장치 및 구성요소는, 예를 들어, 프로세서, 컨트롤러, ALU(arithmetic logic unit), 디지털 신호 프로세서(digital signal processor), 마이크로컴퓨터, FPA(field programmable array), PLU(programmable logic unit), 마이크로프로세서, 또는 명령(instruction)을 실행하고 응답할 수 있는 다른 어떠한 장치와 같이, 하나 이상의 범용 컴퓨터 또는 특수 목적 컴퓨터를 이용하여 구현될 수 있다. 처리 장치는 운영 체제(OS) 및 상기 운영 체제 상에서 수행되는 하나 이상의 소프트웨어 애플리케이션을 수행할 수 있다. 또한, 처리 장치는 소프트웨어의 실행에 응답하여, 데이터를 접근, 저장, 조작, 처리 및 생성할 수도 있다. 이해의 편의를 위하여, 처리 장치는 하나가 사용되는 것으로 설명된 경우도 있지만, 해당 기술분야에서 통상의 지식을 가진 자는, 처리 장치가 복수 개의 처리 요소(processing element) 및/또는 복수 유형의 처리 요소를 포함할 수 있음을 알 수 있다. 예를 들어, 처리 장치는 복수 개의 프로세서 또는 하나의 프로세서 및 하나의 컨트롤러를 포함할 수 있다. 또한, 병렬 프로세서(parallel processor)와 같은, 다른 처리 구성(processing configuration)도 가능하다.The apparatus described above may be implemented as a hardware component, a software component, and / or a combination of hardware components and software components. For example, the apparatus and components described in the embodiments may be implemented within a computer system, such as, for example, a processor, controller, arithmetic logic unit (ALU), digital signal processor, microcomputer, field programmable array (FPA) A programmable logic unit (PLU), a microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to execution of the software. For ease of understanding, the processing apparatus may be described as being used singly, but those skilled in the art will recognize that the processing apparatus may have a plurality of processing elements and / As shown in FIG. For example, the processing apparatus may comprise a plurality of processors or one processor and one controller. Other processing configurations are also possible, such as a parallel processor.

소프트웨어는 컴퓨터 프로그램(computer program), 코드(code), 명령(instruction), 또는 이들 중 하나 이상의 조합을 포함할 수 있으며, 원하는 대로 동작하도록 처리 장치를 구성하거나 독립적으로 또는 결합적으로(collectively) 처리 장치를 명령할 수 있다. 소프트웨어 및/또는 데이터는, 처리 장치에 의하여 해석되거나 처리 장치에 명령 또는 데이터를 제공하기 위하여, 어떤 유형의 기계, 구성요소(component), 물리적 장치, 가상 장치(virtual equipment), 컴퓨터 저장 매체 또는 장치에 구체화(embody)될 수 있다. 소프트웨어는 네트워크로 연결된 컴퓨터 시스템 상에 분산되어서, 분산된 방법으로 저장되거나 실행될 수도 있다. 소프트웨어 및 데이터는 하나 이상의 컴퓨터 판독 가능 기록 매체에 저장될 수 있다.The software may include a computer program, code, instructions, or a combination of one or more of the foregoing, and may be configured to configure the processing device to operate as desired or to process it collectively or collectively Device can be commanded. The software and / or data may be in the form of any type of machine, component, physical device, virtual equipment, computer storage media, or device As shown in FIG. The software may be distributed over a networked computer system and stored or executed in a distributed manner. The software and data may be stored on one or more computer readable recording media.

실시예에 따른 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 실시예를 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. The method according to an embodiment may be implemented in the form of a program command that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions to be recorded on the medium may be those specially designed and configured for the embodiments or may be available to those skilled in the art of computer software. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like.

이상과 같이 실시예들이 비록 한정된 실시예와 도면에 의해 설명되었으나, 해당 기술분야에서 통상의 지식을 가진 자라면 상기의 기재로부터 다양한 수정 및 변형이 가능하다. 예를 들어, 설명된 기술들이 설명된 방법과 다른 순서로 수행되거나, 및/또는 설명된 시스템, 구조, 장치, 회로 등의 구성요소들이 설명된 방법과 다른 형태로 결합 또는 조합되거나, 다른 구성요소 또는 균등물에 의하여 대치되거나 치환되더라도 적절한 결과가 달성될 수 있다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. For example, it is to be understood that the techniques described may be performed in a different order than the described methods, and / or that components of the described systems, structures, devices, circuits, Lt; / RTI > or equivalents, even if it is replaced or replaced.

그러므로, 다른 구현들, 다른 실시예들 및 특허청구범위와 균등한 것들도 후술하는 특허청구범위의 범위에 속한다.Therefore, other implementations, other embodiments, and equivalents to the claims are also within the scope of the following claims.

Claims

In a single image super-resolution method using an artificial neural network,
Processing the input image to provide a low-resolution image and a self-high-resolution image;
Reconstructing a residual image using predetermined learning data for the low-resolution image and the self-high-resolution image; And
And reconstructing the input image into a high-resolution image through the reconstructed low-resolution image, the self-high-resolution image, and the residual image
Lt; / RTI >
And restoring the residual image using the preset learning data for the low-resolution image and the self-high-
A high frequency component of the entire overall image structure of the low resolution image is restored using a wide acceptance area and residual image learning through a plurality of residual block structures connected in series based on the external exemplar image through the entire residual image restoration network step; And
The local residual image restoration network is used to display the self-high-resolution image based on the internal example image repeatedly using three 1x1 convolutional layers so as to be encoded without interference of neighboring pixels, or a detailed local Restoring the high-frequency component of the image structure
Lt; / RTI >
Reconstructing a high-resolution image using the complementary relationship between the internal exemplar image and the external exemplar image,
Processing the input image to provide a low-resolution image and a self-high-resolution image,
The interpolation technique provides the low resolution image as a guide image for maintaining the overall image structure and restores the detailed high resolution image of the local image structure using the high resolution reconstruction technique based on the internal example image , &Lt; / RTI >
The high-resolution reconstruction of the input image based on a deep convolutional neural network (CNN) using the complementary relationship between the external exemplar image and the internal exemplar image
A single image high resolution reconstruction method using an artificial neural network.

delete

In a single image super-resolution system using an artificial neural network,
An image providing unit for providing a low-resolution image and a self-high-resolution image by preprocessing the input image;
A residual image restoring unit for restoring the residual image using the preset learning data for the low resolution image and the self high resolution image; And
And a high-resolution reconstruction unit that reconstructs the input image into a high-resolution image through the reconstructed low-resolution image, the self-high-
Lt; / RTI >
The residual image reconstructing unit reconstructs the residual image,
An entire residual image restoration network for restoring a high frequency component of an entire overall image structure of the low resolution image using a wide acceptance area and residual image learning through a plurality of residual block structures connected in series based on an external exemplar image; And
The high-frequency component of the detailed local image structure that appears repeatedly or distributed all over the image is restored by utilizing three 1x1 convolutional layers so that the self-high resolution image based on the internal example image is encoded without interference of peripheral pixels. Local residual image restoration network
Lt; / RTI >
Reconstructing a high-resolution image using the complementary relationship between the internal exemplar image and the external exemplar image,
The image providing unit,
The interpolation technique provides the low resolution image as a guide image for maintaining the overall image structure and restores the detailed high resolution image of the local image structure using the high resolution reconstruction technique based on the internal example image , &Lt; / RTI >
The high-resolution reconstruction of the input image based on a deep convolutional neural network (CNN) using the complementary relationship between the external exemplar image and the internal exemplar image
A Single Image High Resolution Restoration System Using Artificial Neural Network.

delete

The method of claim 3,
The area residual image restoration network includes:
And a plurality of pixel encoding layers each of which is composed of a convolutional layer and a ReLu (Rectified Linear Unit) function. The output feature map includes pixel information of the self-high-resolution image encoded without interfering with neighboring pixel information
A Single Image High Resolution Restoration System Using Artificial Neural Network.

The method of claim 3,
Wherein the entire residual image restoration network comprises:
A plurality of the residual blocks connected in series so as to have a wide receiving area based on the external exemplar image are formed so as to recover the distortionless high-resolution image structure by complementing the overall overall image structure distorted by the step-distortion phenomenon of the self-high- Being
A Single Image High Resolution Restoration System Using Artificial Neural Network.

The method according to claim 6,
Wherein the entire residual image restoration network comprises:
A gradient gradient value learning process is performed using a skip-connection and a batch normalization layer through the residual block, and an input feature map is classified into a plurality of consecutive blocks The number of output feature maps remains the same as the number of input feature maps, and the acceptance area of a predetermined size passes through the N remaining blocks and finally the convolutional layer, Restoring the residual image to restore the high-resolution image structure
A Single Image High Resolution Restoration System Using Artificial Neural Network.