KR102206792B1

KR102206792B1 - Method for image denoising using parallel feature pyramid network, recording medium and device for performing the method

Info

Publication number: KR102206792B1
Application number: KR1020190175355A
Authority: KR
Inventors: 고성제; 엄광현; 조성진
Original assignee: 고려대학교 산학협력단
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2021-01-22

Abstract

A method of removing noise in a network image by using a parallel feature pyramid includes: generating a feature map by performing a preset number of convolution operations on an input image; generating n-layer pyramid feature maps by downsizing the feature map to have mutually different scales through a pyramid (spatial pyramid pooling) module; converting the n-layer pyramid feature maps having mutually different scales into unique scales of corresponding layers through n multi-scale context aggregation (MSCA) modules; and integrating information of the feature maps output from the MSCA modules to output the integrated information as one image data. Accordingly, in consideration of various scales in the image, noise removal performance is increased, and unnecessary artifacts are reduced, so that an image having excellent quality is provided.

Description

A method of removing noise in a network image using a parallel feature pyramid, a recording medium and a device for performing it

본 발명은 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법, 이를 수행하기 위한 기록 매체 및 장치에 관한 것으로서, 더욱 상세하게는 합성곱 신경망(Convolutional Neural Network, CNN) 중 DnCNN(DeNoise CNN)을 기반으로 영상의 잡음 제거 성능을 향상시키는 기술에 관한 것이다.The present invention relates to a method for removing noise in a network image using a parallel feature pyramid, a recording medium and an apparatus for performing the same, and more particularly, based on a DeNoise CNN (DnCNN) among a convolutional neural network (CNN). It relates to a technology for improving the noise reduction performance of an image.

[국가지원 연구개발에 대한 설명][Explanation of nationally supported R&D]

본 연구는 2019년도 정부(과학기술정보통신부)의 재원으로 정보통신기획평가원의 지원을 받아 수행된 연구이다(No. 2014-3-00077, 대규모 실시간 비디오 분석에 의한 전역적 다중 관심객체 추적 및 상황 예측 기술 개발).This study was conducted with the support of the Information Communication Planning and Evaluation Institute with the financial resources of the government (Ministry of Science, ICT, and ICT) in 2019 (No. 2014-3-00077, Global multi-object tracking and situation based on large-scale real-time video analysis. Development of predictive skills).

영상 내 잡음 제거는 카메라를 활용하는 장비에서 필수적인 기술이다. 카메라 센서들이 많이 발전하고 있지만 여전히 이미지 획득 과정에서 영상에 잡음이 섞이는 것은 불가피하다. 따라서, 영상의 잡음을 제거하는 기술은 카메라의 이미지 신호 처리 모듈에도 적용되기도 한다. Noise removal in the image is an essential technology in equipment using cameras. Although camera sensors are developing a lot, it is still inevitable that noise is mixed in the image during the image acquisition process. Therefore, a technology for removing noise from an image is also applied to an image signal processing module of a camera.

영상 내 잡음 제거 기술에서 가장 중요한 점은 영상의 디테일을 유지하면서 영상의 잡음만 제거하는 것이다. 잡음 제거에만 신경을 쓴다면 영상의 디테일이 사라져버려 자칫 흐릿한 영상이 되어버릴 수 있다. The most important point in image noise removal technology is to remove only image noise while maintaining image detail. If you only pay attention to noise removal, the details of the image may disappear, resulting in a blurry image.

최근에는 영상의 활용도가 매우 높아지고 있고 영상을 활용한 다양한 기술이 발전하고 있기 때문에 깨끗한 영상을 얻는 것의 중요성은 더욱더 높아지고 있다.In recent years, the use of images is very high, and since various technologies using images are developing, the importance of obtaining a clean image is increasing more and more.

영상 내 잡음 제거를 위해 수십년 전부터 다양한 기술들이 연구되었다. 이러한 기법들은 영상의 잡음을 제거하고 디테일은 보존하는 필터를 만들어 영상에 적용시키기도 하고 최적화 기법을 통해 잡음을 제거하기도 한다. Various technologies have been researched for decades to remove noise in images. These techniques remove noise from the image and apply it to the image by creating a filter that preserves detail, and sometimes remove the noise through optimization techniques.

최근에는 합성곱 신경망(Convolutional Neural Network, CNN)을 이용하여 영상의 잡음 제거 성능을 향상시키고 있다. DnCNN(DeNoise CNN)은 CNN 기반 영상 잡음 네트워크 중 가장 대표적이다. Recently, a convolutional neural network (CNN) is used to improve image noise removal performance. DnCNN (DeNoise CNN) is the most representative of CNN-based image noise networks.

DnCNN은 배치 정규화(Batch Normalization)과 잔차 학습(residual learning)을 활용한다. 잔차 학습을 통해 네트워크는 잡음이 섞여있는 영상에서 깨끗한 영상을 뺀 차이를 학습한다. 이 과정을 통해 네트워크는 잡음이 섞여있는 영상에서 영상의 내용물들을 지우고 잡음 성분만 남긴다. DnCNN은 좋은 성능을 보이기 때문에 널리 활용되고 있는 기법이다.DnCNN utilizes batch normalization and residual learning. Through residual learning, the network learns the difference between the noisy image and the clean image. Through this process, the network erases the contents of the image from the noisy image and leaves only the noise component. DnCNN is a widely used technique because it shows good performance.

한편, DnCNN은 단일 스케일의 특징 맵을 사용해 영상 내 잡음을 제거한다. 그러나, 영상에는 다양한 스케일의 내용들이 있기 때문에 단일 스케일의 특징 맵은 이런 영상의 특징을 모두 담아내기 어렵다는 한계가 있다.Meanwhile, DnCNN uses a single-scale feature map to remove noise in the image. However, since there are contents of various scales in an image, a feature map of a single scale has a limitation in that it is difficult to capture all the features of such an image.

US 9,984,325 B1US 9,984,325 B1 JP 6103243 B2JP 6103243 B2 KR 10-1938945 B1KR 10-1938945 B1

K.Zhang,W.Zuo,Y.Chen,D.Meng,andL.Zhang,,"Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising," IEEE Trans. on Image Process., vol. 26, no. 7, pp. 3142-3155, July 2017 K.Zhang,W.Zuo,Y.Chen,D.Meng,andL.Zhang,,"Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising," IEEE Trans. on Image Process., vol. 26, no. 7, pp. 3142-3155, July 2017 K. He, X. Zhang, S. Ren, and J. Sun, "Spatial pyramid pooling in deep convolutional network for visual recognition," in Proc. Eur. Conf. Comput. Vis., 2014, pp. 346-361 K. He, X. Zhang, S. Ren, and J. Sun, "Spatial pyramid pooling in deep convolutional network for visual recognition," in Proc. Eur. Conf. Comput. Vis., 2014, pp. 346-361 S.-W. Kim, H.-K Kook, J.-Y. Sun, M.-C. Kang, and S.-Jea Ko, "Parallel feature pyramid network for object detection," in Proc. Eur. Conf. Comput. Vis., 2018, pp. 234-350 S.-W. Kim, H.-K Kook, J.-Y. Sun, M.-C. Kang, and S.-Jea Ko, "Parallel feature pyramid network for object detection," in Proc. Eur. Conf. Comput. Vis., 2018, pp. 234-350

이에, 본 발명의 기술적 과제는 이러한 점에서 착안된 것으로 본 발명의 목적은 영상 내 다양한 스케일을 반영하기 위해 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법을 제공하는 것이다.Accordingly, the technical problem of the present invention is conceived in this respect, and an object of the present invention is to provide a method for removing noise in a network image using a parallel feature pyramid to reflect various scales in the image.

본 발명의 다른 목적은 상기 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법을 수행하기 위한 컴퓨터 프로그램이 기록된 기록 매체를 제공하는 것이다.Another object of the present invention is to provide a recording medium in which a computer program for performing a method for removing noise in a network image using the parallel feature pyramid is recorded.

본 발명의 또 다른 목적은 상기 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법을 수행하기 위한 장치를 제공하는 것이다.Another object of the present invention is to provide an apparatus for performing a method for removing noise in a network image using the parallel feature pyramid.

상기한 본 발명의 목적을 실현하기 위한 일 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법은, 입력된 영상에 대해 미리 설정된 수만큼의 합성곱(convolution) 연산을 수행하여 특징 맵을 생성하는 단계; 피라미드(Spatial pyramid pooling) 모듈을 통해 상기 특징 맵을 서로 다른 스케일을 갖도록 다운 사이징하여 n층의 피라미드 특징 맵들을 생성하는 단계; n개의 MSCA(multi-scale context aggregation) 모듈을 통해 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계; 및 상기 각 MSCA 모듈로부터 출력되는 특징 맵들의 정보를 통합하여 하나의 영상 데이터로 출력하는 단계;를 포함한다.A method for removing noise in a network image using a parallel feature pyramid according to an embodiment for realizing the object of the present invention is to generate a feature map by performing a predetermined number of convolution operations on an input image. Generating; Generating n-layer pyramid feature maps by downsizing the feature maps to have different scales through a pyramid (Spatial pyramid pooling) module; converting the pyramid feature maps of n layers having different scales into a unique scale of each corresponding layer through n multi-scale context aggregation (MSCA) modules; And integrating information on the feature maps output from each of the MSCA modules and outputting one image data.

본 발명의 실시예에서, 상기 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계는, 각 MSCA 모듈에서 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각각 합성곱 연산하는 단계; 상기 합성곱 연산된 각 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계; 상기 고유 스케일로 변환된 특징 맵들의 접합(concatenation) 연산을 수행하는 단계; 및 상기 접합 연산된 특징 맵을 합성곱 연산하여 정제하는 단계;를 포함할 수 있다.In an embodiment of the present invention, the converting of the n-layer pyramid feature maps to a unique scale of each corresponding layer may include performing convolution calculations of the n-layer pyramid feature maps having different scales in each MSCA module; Converting each of the convolutional feature maps into a unique scale of each corresponding layer; Performing a concatenation operation of the feature maps converted to the intrinsic scale; And performing a convolution operation and refinement of the convolutional feature map.

본 발명의 실시예에서, 상기 각 MSCA 모듈로부터 출력되는 특징 맵들의 정보를 통합하여 하나의 영상 데이터로 출력하는 단계는, 상기 각 MSCA 모듈로부터 출력되는 특징 맵을 기준 스케일로 변환하는 단계; 상기 기준 스케일로 변환된 특징 맵들의 접합(concatenation) 연산을 수행하는 단계; 상기 접합 연산된 특징 맵을 미리 설정된 수만큼 합성곱 연산하는 단계; 및 상기 합성곱 연산된 특징 맵을 가산 연산하여 출력하는 단계;를 포함할 수 있다.In an embodiment of the present invention, the step of integrating information of feature maps output from each MSCA module and outputting the information as one image data includes: converting feature maps output from each MSCA module to a reference scale; Performing a concatenation operation of feature maps converted to the reference scale; Performing a convolution operation on the joint-operated feature map by a preset number; And adding and outputting the convolutional feature map.

본 발명의 실시예에서, 상기 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계는, 상기 입력된 영상의 크기가

인 경우,

(여기서, n은 피라미드의 층 수이며, 1층은 n=0으로 정의됨)의 크기를 갖는 특징 맵들을 생성할 수 있다.In an embodiment of the present invention, the step of converting the n-layer pyramid feature maps to a unique scale of each corresponding layer includes a size of the input image

If is,

(Where n is the number of layers of the pyramid, and the first layer is defined as n=0), feature maps having a size can be generated.

본 발명의 실시예에서, 상기 n층의 피라미드 특징 맵들을 생성하는 단계는, 상기 입력된 영상과 동일한 스케일을 갖는 제1 피라미드 특징 맵을 생성하는 단계; 상기 제1 피라미드 특징 맵을 다운 사이징하여 상기 입력된 영상의 1/2 스케일을 갖는 제2 피라미드 특징 맵을 생성하는 단계; 및 상기 제2 피라미드 특징 맵을 다운 사이징하여 상기 입력된 영상의 1/4 스케일을 갖는 제3 피라미드 특징 맵을 생성하는 단계;를 포함할 수 있다.In an embodiment of the present invention, the generating of the n-layer pyramid feature maps includes: generating a first pyramid feature map having the same scale as the input image; Downsizing the first pyramid feature map to generate a second pyramid feature map having a 1/2 scale of the input image; And generating a third pyramidal feature map having a 1/4 scale of the input image by downsizing the second pyramidal feature map.

본 발명의 실시예에서, 상기 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계는, 상기 제1 피라미드 특징 맵의 합성곱 연산 후 다운 사이징하는 단계; 상기 제2 피라미드 특징 맵의 합성곱 연산을 수행하는 단계; 및 상기 제3 피라미드 특징 맵의 합성곱 연산 후 업 사이징하는 단계;를 포함할 수 있다.In an embodiment of the present invention, the converting of the n-layer pyramid feature maps to a unique scale of each corresponding layer includes: downsizing after a convolution operation of the first pyramid feature map; Performing a convolution operation of the second pyramidal feature map; And upsizing after a convolution operation of the third pyramid feature map.

본 발명의 실시예에서, 상기 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계는, 다운 사이징된 제1 피라미드 특징 맵, 제2 피라미드 특징 맵 및 업 사이징된 제3 피라미드 특징 맵의 접합(concatenation) 연산을 수행하는 단계; 및 상기 접합 연산된 특징 맵을 합성곱 연산하여 정제하는 단계;를 포함할 수 있다.In an embodiment of the present invention, the step of converting the n-layer pyramid feature maps to a unique scale of each corresponding layer comprises: a downsized first pyramid feature map, a second pyramid feature map, and an upsized third pyramid feature map. Performing a concatenation operation of; And performing a convolution operation and refinement of the convolutional feature map.

본 발명의 실시예에서, 상기 각 MSCA 모듈로부터 출력되는 특징 맵들의 정보를 통합하여 하나의 영상 데이터로 출력하는 단계는, 상기 각 MSCA 모듈로부터 출력되는 특징 맵을 필요에 따라 업 사이징하는 단계; 상기 업 사이징된 특징 맵들의 접합(concatenation) 연산을 수행하는 단계; 상기 접합 연산된 특징 맵을 미리 설정된 수만큼 합성곱 연산하는 단계; 및 상기 합성곱 연산된 특징 맵을 가산 연산하여 출력하는 단계;를 포함할 수 있다.In an embodiment of the present invention, the step of integrating information of feature maps outputted from each MSCA module and outputting it as one image data may include: upsizing feature maps output from each MSCA module as necessary; Performing a concatenation operation of the upsized feature maps; Performing a convolution operation on the joint-operated feature map by a preset number; And adding and outputting the convolutional feature map.

상기한 본 발명의 다른 목적을 실현하기 위한 일 실시예에 따른 컴퓨터로 판독 가능한 저장 매체에는, 상기 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법을 수행하기 위한 컴퓨터 프로그램이 기록되어 있다. In a computer-readable storage medium according to an embodiment for realizing another object of the present invention, a computer program for performing a method of removing noise in a network image using the parallel feature pyramid is recorded.

상기한 본 발명의 또 다른 목적을 실현하기 위한 일 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치는, 입력된 영상에 대해 미리 설정된 수만큼의 합성곱(convolution) 연산을 수행하여 특징 맵을 생성하는 특징 맵 생성부; 상기 특징 맵을 서로 다른 스케일을 갖도록 다운 사이징하여 n층의 피라미드 특징 맵들을 생성하는 피라미드(Spatial pyramid pooling) 모듈; 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 n개의 MSCA(multi-scale context aggregation) 모듈; 및 상기 각 MSCA 모듈로부터 출력되는 특징 맵들의 정보를 통합하여 하나의 영상 데이터로 출력하는 융합부;를 포함한다.In an embodiment for realizing another object of the present invention, an apparatus for removing noise in a network image using a parallel feature pyramid is characterized by performing a predetermined number of convolution operations on an input image. A feature map generator for generating a map; A pyramid (Spatial pyramid pooling) module for generating n-layer pyramid feature maps by downsizing the feature maps to have different scales; N multi-scale context aggregation (MSCA) modules for converting pyramid feature maps of n layers having different scales into unique scales of each corresponding layer; And a fusion unit for integrating information of feature maps output from each MSCA module and outputting one image data.

본 발명의 실시예에서, 상기 n개의 MSCA의 각 MSCA 모듈은, 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각각 합성곱 연산하고, 상기 합성곱 연산된 각 특징 맵들을 다운 사이징 또는 업 사이징하여 각 해당 층의 고유 스케일로 변환할 수 있다.In an embodiment of the present invention, each MSCA module of the n MSCAs performs convolutional operation on each of the n-layer pyramid feature maps having different scales, and downsizing or upsizing each of the convolutional feature maps. Each layer can be converted to its own scale.

본 발명의 실시예에서, 상기 n개의 MSCA의 각 MSCA 모듈은, 상기 각 해당 층의 고유 스케일로 변환된 특징 맵들을 접합(concatenation) 연산 및 합성곱 연산하여 정제할 수 있다.In an embodiment of the present invention, each MSCA module of the n MSCAs may refine the feature maps converted to the intrinsic scales of the respective layers by concatenation and convolution operations.

본 발명의 실시예에서, 상기 융합부는, 상기 각 MSCA 모듈로부터 출력되는 특징 맵을 업 사이징하여 기준 스케일로 변환하고, 특징 맵들을 접합(concatenation) 연산 및 합성곱 연산한 후 가산 연산하여 출력할 수 있다.In an embodiment of the present invention, the fusion unit may upsize feature maps output from each of the MSCA modules to convert them to a reference scale, perform concatenation and convolution operations, and then add and output the feature maps. have.

이와 같은 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법에 따르면, 영상 내에 다양한 스케일을 고려하여 잡음 제거 성능이 높을 뿐만 아니라 불필요한 아티팩트들을 감소시켜 우수한 품질의 영상을 제공할 수 있다.According to such a method for removing noise in a network image using a parallel feature pyramid, not only has high noise removal performance in consideration of various scales in the image, but also reduces unnecessary artifacts to provide an image of excellent quality.

도 1은 본 발명의 일 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치의 블록도이다.
도 2는 일반적인 DnCNN의 네트워크 구조를 보여주는 개념도이다.
도 3은 도 1의 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치의 구체적인 구성도이다.
도 4는 도 3의 MSCA 모듈의 구체적인 구성도이다.
도 5는 본 발명의 일 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법의 흐름도이다.
도 6은 테스트 영상에 대해 기존 기법과 본 발명에서 제안하는 방법의 잡음 제거 예시를 보여주는 도면이다.
도 7은 기존 기법과 본 발명에서 제안하는 방법의 영상 내 잡음 제거 성능을 비교하는 표이다.1 is a block diagram of an apparatus for removing noise in a network image using a parallel feature pyramid according to an embodiment of the present invention.
2 is a conceptual diagram showing a general network structure of DnCNN.
3 is a detailed configuration diagram of an apparatus for removing noise in a network image using a parallel feature pyramid of FIG. 1.
4 is a detailed configuration diagram of the MSCA module of FIG. 3.
5 is a flowchart of a method for removing noise in a network image using a parallel feature pyramid according to an embodiment of the present invention.
6 is a diagram showing an example of noise reduction of a conventional technique and a method proposed by the present invention for a test image.
7 is a table comparing the noise reduction performance in an image between the conventional technique and the method proposed by the present invention.

후술하는 본 발명에 대한 상세한 설명은, 본 발명이 실시될 수 있는 특정 실시예를 예시로서 도시하는 첨부 도면을 참조한다. 이들 실시예는 당업자가 본 발명을 실시할 수 있기에 충분하도록 상세히 설명된다. 본 발명의 다양한 실시예는 서로 다르지만 상호 배타적일 필요는 없음이 이해되어야 한다. 예를 들어, 여기에 기재되어 있는 특정 형상, 구조 및 특성은 일 실시예에 관련하여 본 발명의 정신 및 범위를 벗어나지 않으면서 다른 실시예로 구현될 수 있다. 또한, 각각의 개시된 실시예 내의 개별 구성요소의 위치 또는 배치는 본 발명의 정신 및 범위를 벗어나지 않으면서 변경될 수 있음이 이해되어야 한다. 따라서, 후술하는 상세한 설명은 한정적인 의미로서 취하려는 것이 아니며, 본 발명의 범위는, 적절하게 설명된다면, 그 청구항들이 주장하는 것과 균등한 모든 범위와 더불어 첨부된 청구항에 의해서만 한정된다. 도면에서 유사한 참조부호는 여러 측면에 걸쳐서 동일하거나 유사한 기능을 지칭한다.DETAILED DESCRIPTION OF THE INVENTION The detailed description of the present invention to be described later refers to the accompanying drawings, which illustrate specific embodiments in which the present invention may be practiced. These embodiments are described in detail sufficient to enable a person skilled in the art to practice the present invention. It is to be understood that the various embodiments of the present invention are different from each other, but need not be mutually exclusive. For example, specific shapes, structures, and characteristics described herein may be implemented in other embodiments without departing from the spirit and scope of the present invention in relation to one embodiment. In addition, it is to be understood that the location or arrangement of individual components within each disclosed embodiment may be changed without departing from the spirit and scope of the present invention. Accordingly, the detailed description to be described below is not intended to be taken in a limiting sense, and the scope of the present invention, if properly described, is limited only by the appended claims, along with all scopes equivalent to those claimed by the claims. Like reference numerals in the drawings refer to the same or similar functions over several aspects.

이하, 도면들을 참조하여 본 발명의 바람직한 실시예들을 보다 상세하게 설명하기로 한다. Hereinafter, preferred embodiments of the present invention will be described in more detail with reference to the drawings.

도 1은 본 발명의 일 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치의 블록도이다.1 is a block diagram of an apparatus for removing noise in a network image using a parallel feature pyramid according to an embodiment of the present invention.

본 발명에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치(10, 이하 장치)는 영상 내 서로 다른 스케일의 정보를 고려한 DnCNN(DeNoise CNN) 기반의 학습기일 수 있다.The apparatus 10 for removing noise in a network image using a parallel feature pyramid according to the present invention may be a learner based on a DnCNN (DeNoise CNN) that considers information of different scales in the image.

도 1을 참조하면, 본 발명에 따른 장치(10)는 특징 맵 생성부(100), 피라미드(Spatial pyramid pooling, 이하 SPP) 모듈(300), n개의 MSCA(multi-scale context aggregation, 이하 MSCA) 모듈(500) 및 융합부(700)를 포함한다.Referring to FIG. 1, an apparatus 10 according to the present invention includes a feature map generator 100, a spatial pyramid pooling (SPP) module 300, and n multi-scale context aggregation (MSCA) It includes a module 500 and a fusion unit 700.

본 발명의 상기 장치(10)는 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거를 수행하기 위한 소프트웨어(애플리케이션)가 설치되어 실행될 수 있으며, 상기 특징 맵 생성부(100), 상기 SPP 모듈(300), 상기 n개의 MSCA 모듈(500) 및 상기 융합부(700)의 구성은 상기 장치(10)에서 실행되는 상기 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거를 수행하기 위한 소프트웨어에 의해 제어될 수 있다. The device 10 of the present invention may be executed by installing software (application) for performing noise reduction in a network image using a parallel feature pyramid, and the feature map generator 100, the SPP module 300, The configurations of the n MSCA modules 500 and the fusion unit 700 may be controlled by software for performing noise reduction in a network image using the parallel feature pyramid executed in the device 10.

상기 장치(10)는 별도의 단말이거나 또는 단말의 일부 모듈일 수 있다. 또한, 상기 특징 맵 생성부(100), 상기 SPP 모듈(300), 상기 n개의 MSCA 모듈(500) 및 상기 융합부(700)의 구성은 통합 모듈로 형성되거나, 하나 이상의 모듈로 이루어 질 수 있다. 그러나, 이와 반대로 각 구성은 별도의 모듈로 이루어질 수도 있다.The device 10 may be a separate terminal or some modules of the terminal. In addition, the configuration of the feature map generation unit 100, the SPP module 300, the n MSCA modules 500, and the fusion unit 700 may be formed as an integrated module or one or more modules. . However, on the contrary, each component may be formed as a separate module.

상기 장치(10)는 이동성을 갖거나 고정될 수 있다. 상기 장치(10)는, 서버(server) 또는 엔진(engine) 형태일 수 있으며, 디바이스(device), 기구(apparatus), 단말(terminal), UE(user equipment), MS(mobile station), 무선기기(wireless device), 휴대기기(handheld device) 등 다른 용어로 불릴 수 있다. The device 10 may be mobile or stationary. The device 10 may be in the form of a server or an engine, and may be a device, an apparatus, a terminal, a user equipment (UE), a mobile station (MS), and a wireless device. (wireless device), portable device (handheld device), etc. can be called by other terms.

상기 장치(10)는 운영체제(Operation System; OS), 즉 시스템을 기반으로 다양한 소프트웨어를 실행하거나 제작할 수 있다. 상기 운영체제는 소프트웨어가 장치의 하드웨어를 사용할 수 있도록 하기 위한 시스템 프로그램으로서, 안드로이드 OS, iOS, 윈도우 모바일 OS, 바다 OS, 심비안 OS, 블랙베리 OS 등 모바일 컴퓨터 운영체제 및 윈도우 계열, 리눅스 계열, 유닉스 계열, MAC, AIX, HP-UX 등 컴퓨터 운영체제를 모두 포함할 수 있다.The device 10 may execute or manufacture various software based on an operating system (OS), that is, a system. The operating system is a system program for allowing software to use the hardware of the device, and mobile computer operating systems such as Android OS, iOS, Windows Mobile OS, Bada OS, Symbian OS, Blackberry OS, Windows series, Linux series, Unix series, It can include all computer operating systems such as MAC, AIX, and HP-UX.

최근 영상 잡음 제거 분야에서 연구된 DnCNN은 CNN을 활용한 영상 내 노이즈 제거 네트워크로 좋은 성능을 보인다. DnCNN은 단일 스케일의 특징 맵을 사용해 영상 내 잡음을 제거한다. 그러나, 영상에는 다양한 스케일의 내용들이 있기 때문에 단일 스케일의 특징 맵은 이런 영상의 특징을 모두 담아내기 어렵다. 따라서, 본 발명에서는 영상 내 다양한 스케일의 내용들을 담아내는 것에 주안점을 둔다.DnCNN, recently studied in the field of image noise reduction, shows good performance as an intra-image noise reduction network using CNN. DnCNN uses a single-scale feature map to remove noise in the image. However, since there are contents of various scales in an image, a feature map of a single scale is difficult to capture all the features of such an image. Therefore, the present invention focuses on containing contents of various scales in an image.

도 2는 기존의 네트워크인 DnCNN의 구조이다. 기존의 네트워크는 입력 영상이 주어 졌을 때, 합성곱 연산을 통해 영상과 같은 크기의 특징 맵들을 만들어 내고 각각의 특징 맵 들은 잡음이 포함된 영상에서 배경을 지워 잡음 성분만 추출한다.2 is a structure of an existing network, DnCNN. When an input image is given, the existing network generates feature maps of the same size as the image through a convolution operation, and each feature maps extracts only the noise component by erasing the background from the image containing noise.

그러나, 영상에는 다양한 크기의 컨텐츠 성분들이 존재하기 때문에 이렇게 단일 스케일로는 잡음이 포함된 영상에서 배경 성분만 지워 내기는 쉽지 않다.However, since content components of various sizes exist in an image, it is not easy to erase only the background component from an image containing noise with this single scale.

이에 본 발명은 네트워크 구조가 영상 내 서로 다른 스케일의 정보를 포함하는 것에 주안점을 둔다. 이하에서는 본 발명의 각 구성에 대해 자세히 설명한다.Accordingly, the present invention focuses on the network structure including information of different scales in an image. Hereinafter, each configuration of the present invention will be described in detail.

상기 특징 맵 생성부(100)는 입력된 영상에 대해 미리 설정된 수만큼의 합성곱(convolution) 연산을 수행하여 특징 맵을 생성한다. The feature map generator 100 generates a feature map by performing a convolution operation as many as a preset number on the input image.

상기 SPP 모듈(300)은 상기 특징 맵을 서로 다른 스케일을 갖도록 다운 사이징하여 n층의 피라미드 특징 맵들을 생성하고, 상기 n개의 MSCA 모듈(500)은 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환한다.The SPP module 300 downsizes the feature maps to have different scales to generate n-layer pyramid feature maps, and the n MSCA modules 500 generate n-layer pyramid feature maps having different scales. Convert to a unique scale for each layer.

상기 융합부(700)는 상기 각 MSCA 모듈로부터 출력되는 특징 맵들의 정보를 통합하여 하나의 영상 데이터로 출력한다.The fusion unit 700 integrates information of feature maps output from each MSCA module and outputs one image data.

상기 SPP 모듈(300)은 SPP는 기존 객체 검출 분야에서 사용되는 기법으로 본 발명의 상기 SPP 모듈(300)은 기존의 특징 맵을 여러 크기로 리사이징한다. 본 발명은 상기 SPP 모듈(300)을 활용하여 앞서 만든 특징 맵을 다른 크기로 다운사이즈를 수행한다. The SPP module 300 is a technique used in the field of existing object detection, and the SPP module 300 of the present invention resizes the existing feature map to various sizes. In the present invention, the previously created feature map is downsized to a different size using the SPP module 300.

예를 들어, 처음 입력 영상의 크기를

라고 하면, 상기 SPP 모듈(300)의 결과물은

의 크기를 갖는다. 여기서, n은 피라미드의 층 수에 해당하며, 피라미드의 1층은 n=0으로, 피라미드의 2층은 n=1으로 피라미드의 3층은 n=2으로 정의된다. 이렇게 만들어진 각각의 특징 맵들은 영상 내에 서로 다른 스케일의 정보를 포함하게 된다. For example, the size of the first input image

If so, the result of the SPP module 300 is

Has the size of. Here, n corresponds to the number of layers of the pyramid, the first layer of the pyramid is defined as n=0, the second layer of the pyramid is n=1, and the third layer of the pyramid is defined as n=2. Each of the feature maps created in this way includes information of a different scale in the image.

각각의 스케일 정보를 포함할 때, 영상의 문맥 정보가 도움이 되는 경우가 많다. 따라서, 본 발명에서는 상기 n개의 MSCA 모듈(500)을 만들어 각각의 피라미드 층에 적용시켜 서로 다른 층에 있는 다른 스케일 정보를 융합하여 그 층에 맞는 스케일 정보로 변환한다. When including each scale information, the context information of the image is often helpful. Accordingly, in the present invention, the n MSCA modules 500 are created and applied to each pyramid layer, and different scale information in different layers is fused and converted into scale information suitable for the layer.

상기 n개의 MSCA 모듈(500)의 입력은 서로 다른 피라미드 층의 특징 맵이기 때문에 그 크기가 다르다. 따라서, 상기 n개의 MSCA 모듈(500)에서는 먼저 그 층에 맞게 특징 맵의 크기를 맞춰주고 합성곱 연산으로 정제해준다. Since the inputs of the n MSCA modules 500 are feature maps of different pyramid layers, their sizes are different. Accordingly, in the n MSCA modules 500, the size of the feature map is first adjusted to fit the layer and refined by a convolution operation.

이후 합성곱 연산을 통해 몇 번의 추상화 정도를 조금 더 높이고, 최종적으로 하나의 결과를 만들어 내기 위해 결과물들을 하나로 통합한다. 이러한 구조는 기존의 구조보다 다른 스케일 정보가 많은 영상에서 더 강인한 결과를 얻을 수 있다.After that, the degree of abstraction a few times is slightly higher through the convolution operation, and the results are combined into one to finally produce a single result. This structure can obtain a more robust result in an image having a lot of different scale information than the conventional structure.

도 3은 도 1의 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치의 구체적인 구성도이다. 도 4는 도 3의 MSCA 모듈의 구체적인 구성도이다.3 is a detailed configuration diagram of an apparatus for removing noise in a network image using a parallel feature pyramid of FIG. 1. 4 is a detailed configuration diagram of the MSCA module of FIG. 3.

도 3을 참조하면, 상기 특징 맵 생성부(100)는 합성곱 연산(110) 블록으로 구성되어 몇 번의 합성곱 연산을 통해 피라미드를 만들기 위한 특징 맵을 만들어 낸다. Referring to FIG. 3, the feature map generator 100 is composed of a convolution operation 110 block and generates a feature map for creating a pyramid through several convolution operations.

상기 SPP(300)은 상기 특징 맵을 서로 다른 스케일을 갖도록 리사이징하여 n층의 피라미드 특징 맵들을 생성한다. 처음 입력 영상의 크기를

라고 하면, 상기 SPP 모듈(300)은 특징 맵을 다른 사이즈로 다운 사이징하여

의 크기를 갖는 결과물을 출력한다.The SPP 300 generates n-layer pyramid feature maps by resizing the feature maps to have different scales. The size of the first input image

If so, the SPP module 300 downsizes the feature map to a different size

Outputs the result with the size of.

예를 들어, 3개 층의 피라미드 특징 맵을 생성하는 경우, 상기 입력된 영상과 동일한 스케일을 갖는 제1 피라미드 특징 맵(f ₀ ), 상기 제1 피라미드 특징 맵(f ₀ )을 다운 사이징하여 상기 입력된 영상의 1/2 스케일을 갖는 제2 피라미드 특징 맵(f ₁ ), 상기 제2 피라미드 특징 맵(f ₁ )을 다운 사이징하여 상기 입력된 영상의 1/4 스케일을 갖는 제3 피라미드 특징 맵(f ₂ )을 생성할 수 있다.For example, when generating a three-layered pyramid feature map, the first pyramid feature map ( f ₀ ) and the first pyramid feature map ( f ₀ ) having the same scale as the input image are downsized to A second pyramid feature map ( f ₁ ) having a 1/2 scale of the input image, and a third pyramid feature map having a 1/4 scale of the input image by downsizing the second pyramid feature map ( f ₁ ). ( f ₂ ) can be created.

상기 n개의 MSCA 모듈(500)은 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환한다. 상기 n개의 MSCA 모듈(500)의 입력은 서로 다른 피라미드 층의 특징 맵이기 때문에 그 크기가 다르다. 따라서, 상기 n개의 MSCA 모듈(500)에서는 먼저 그 층에 맞게 특징 맵의 크기를 맞춰주고 합성곱 연산으로 정제해준다. The n MSCA modules 500 convert the pyramid feature maps of n layers having different scales into a unique scale of each corresponding layer. Since the inputs of the n MSCA modules 500 are feature maps of different pyramid layers, their sizes are different. Accordingly, in the n MSCA modules 500, the size of the feature map is first adjusted to fit the layer and refined by a convolution operation.

예를 들어, 상기 n개의 MSCA 모듈(500)은 제1 MSCA 모듈(510), 제2 MSCA 모듈(530) 및 제3 MSCA 모듈(550)의 3개를 가질 수 있고, 각 MSCA 모듈은 상기 제1 피라미드 특징 맵(f ₀ ), 상기 제2 피라미드 특징 맵(f ₁ ) 및 상기 제3 피라미드 특징 맵(f ₂ )을 합성곱 연산한 후 스케일을 변경한다.For example, the n MSCA modules 500 may have three of a first MSCA module 510, a second MSCA module 530, and a third MSCA module 550, and each MSCA module The scale is changed after a convolution operation is performed on the first pyramidal feature map ( f ₀ ), the second pyramidal feature map ( f ₁ ), and the third pyramidal feature map ( f ₂ ).

구체적으로, 도 4를 참조하면, 상기 제2 MSCA 모듈(530)은 상기 제1 피라미드 특징 맵(f ₀ ), 상기 제2 피라미드 특징 맵(f ₁ ) 및 상기 제3 피라미드 특징 맵(f ₂ )을 각각 제1 내지 제3 합성곱 연산 블록(501, 503, 505)을 통해 연산곱을 수행한다. Specifically, referring to FIG. 4, the second MSCA module 530 includes the first pyramid feature map ( f ₀ ), the second pyramid feature map ( f ₁ ), and the third pyramid feature map ( f ₂ ). Operation is performed through the first to third convolution operation blocks 501, 503, and 505, respectively.

이후, 상기 제1 합성곱 연산 블록(501)의 결과물을 다운 사이징 블록(502)을 통해 다운 사이징하고, 상기 제3 합성곱 연산 블록(505)의 결과물을 업 사이징 블록(506)을 통해 업 사이징하여, 해당 층에 맞는 상기 제2 피라미드 특징 맵(f ₁ )의 스케일 정보로 바꿔준다. 이 경우, 상기 제2 합성곱 연산 블록(503)의 결과물은 다운 사이징 또는 업 사이징을 할 필요가 없다.Thereafter, the result of the first convolution operation block 501 is downsized through the downsizing block 502, and the result of the third convolution operation block 505 is upsized through the upsizing block 506 Thus, the scale information of the second pyramid feature map f ₁ suitable for the corresponding layer is changed. In this case, the result of the second convolution operation block 503 does not need to be downsized or upsized.

또한, 상기 다운 사이징 블록(502)의 결과물, 상기 제2 합성곱 연산 블록(503)의 결과물 및 상기 업 사이징 블록(506)의 결과물을 접합(concatenation) 연산 블록(507)을 통해 접합한 후 합성곱 연산 블록(509)을 통해 정제하여 출력한다.In addition, the result of the down sizing block 502, the result of the second convolution operation block 503, and the result of the up sizing block 506 are joined through a concatenation operation block 507 and then synthesized. It is refined and output through the product operation block 509.

그 결과, 상기 제2 MSCA 모듈(530)은 대응하는 층의 스케일을 가진 특허 맵(g ₁ )을 출력하고, 상기 제1 MSCA 모듈(510) 및 제3 MSCA 모듈(550)도 상기 제2 MSCA 모듈(530)과 동일한 구성을 포함하여, 상기 제1 피라미드 특징 맵(f ₀ ), 상기 제2 피라미드 특징 맵(f ₁ ) 및 상기 제3 피라미드 특징 맵(f ₂ )의 스케일 변환을 대응하는 층의 스케일로 변환하여, 각각 특징 맵(g ₀ ) 및 특징 맵(g ₂ )을 출력한다.As a result, the second MSCA module 530 outputs a patent map ( g ₁ ) having a scale of a corresponding layer, and the first MSCA module 510 and the third MSCA module 550 are also A layer corresponding to the scale transformation of the first pyramid feature map ( f ₀ ), the second pyramid feature map ( f ₁ ), and the third pyramid feature map ( f ₂ ), including the same configuration as the module 530 By converting to the scale of, a feature map ( g ₀ ) and a feature map ( g ₂ ) are output, respectively.

상기 융합부(700)는 합성곱 연산을 통해 몇 번의 추상화 정도를 조금 더 높이고, 최종적으로 하나의 결과를 만들어 내기 위해 결과물들을 하나로 통합한다. 이러한 구조는 기존의 구조보다 다른 스케일 정보가 많은 영상에서 더 강인한 결과를 얻을 수 있다.The fusion unit 700 slightly increases the degree of abstraction several times through a convolution operation, and finally integrates the results into one in order to produce a single result. This structure can obtain a more robust result in an image having a lot of different scale information than the conventional structure.

구체적으로, 상기 융합부(700)는 상기 제1 MSCA 모듈(510), 상기 제2 MSCA 모듈(530) 및 상기 제3 MSCA 모듈(550)로부터 각각 출력되는 특징 맵(f _0, g _1, g ₂ )을 필요에 따라 업 사이징(711, 712)하고, 접합(concatenation) 연산(703)을 수행한다.Specifically, the fusion unit 700 is a feature map ( f _0, g _1, g ) output from the first MSCA module 510, the second MSCA module 530, and the third MSCA module 550, respectively. ₂ ) is upsized as necessary (711 and 712), and a concatenation operation 703 is performed.

이후, 상기 접합 연산된 특징 맵을 미리 설정된 수만큼 합성곱 연산(704)하고, 상기 합성곱 연산된 특징 맵을 가산 연산(705)하여 최종 영상을 출력한다.Thereafter, a convolution operation 704 is performed on the conjugated feature maps by a preset number, and an addition operation 705 is performed on the convolutional feature maps to output a final image.

이에 따라, 본 발명은 영상 내 서로 다른 스케일의 정보를 포함하도록 하여, 잡음 제거 성능이 높을 뿐만 아니라 불필요한 아티팩트들을 감소시켜 우수한 품질의 영상을 제공할 수 있다.Accordingly, according to the present invention, by including information of different scales in an image, noise removal performance is high, and unnecessary artifacts are reduced to provide an image of excellent quality.

도 5는 본 발명의 일 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법의 흐름도이다.5 is a flowchart of a method for removing noise in a network image using a parallel feature pyramid according to an embodiment of the present invention.

본 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법은, 도 3의 장치(10)와 실질적으로 동일한 구성에서 진행될 수 있다. 따라서, 도 3의 장치(10)와 동일한 구성요소는 동일한 도면부호를 부여하고, 반복되는 설명은 생략한다. The method for removing noise in a network image using a parallel feature pyramid according to the present embodiment may be performed in substantially the same configuration as the apparatus 10 of FIG. 3. Accordingly, the same components as those of the device 10 of FIG. 3 are given the same reference numerals, and repeated descriptions are omitted.

또한, 본 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법은 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거를 수행하기 위한 소프트웨어(애플리케이션)에 의해 실행될 수 있다.Further, the method for removing noise in a network image using a parallel feature pyramid according to the present embodiment may be executed by software (application) for performing noise removal in a network image using a parallel feature pyramid.

도 5를 참조하면, 본 실시예에 따른 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법은, 학습 시작을 위한 입력 영상이 주어지면, 입력된 영상에 대해 미리 설정된 수만큼의 합성곱(convolution) 연산을 수행하여 특징 맵을 생성한다(단계 S10). Referring to FIG. 5, in the method for removing noise in a network image using a parallel feature pyramid according to the present embodiment, when an input image for starting learning is given, a predetermined number of convolutions are calculated on the input image. To generate a feature map (step S10).

상기 특징 맵이 생성되면, 피라미드(Spatial pyramid pooling) 모듈을 통해 상기 특징 맵을 서로 다른 스케일을 갖도록 다운 사이징하여 n층의 피라미드 특징 맵들을 생성한다(단계 S20).When the feature map is generated, the feature map is downsized to have different scales through a pyramid (Spatial pyramid pooling) module to generate n-layer pyramid feature maps (step S20).

n개의 MSCA(multi-scale context aggregation) 모듈을 통해 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각 해당 층의 고유 스케일로 변환한다(단계 S30).Pyramid feature maps of n layers having different scales are converted into unique scales of each corresponding layer through n multi-scale context aggregation (MSCA) modules (step S30).

그 결과, 상기 입력된 영상의 크기가

인 경우,

(여기서, n은 피라미드의 층 수이며, 1층은 n=0으로 정의됨)의 크기를 갖는 특징 맵들을 생성할 수 있다.As a result, the size of the input image

If is,

단계 S30은 각 MSCA 모듈에서 서로 다른 스케일을 갖는 n층의 피라미드 특징 맵들을 각각 합성곱 연산하는 단계, 상기 합성곱 연산된 각 특징 맵들을 각 해당 층의 고유 스케일로 변환하는 단계, 상기 고유 스케일로 변환된 특징 맵들의 접합(concatenation) 연산을 수행하는 단계 및 상기 접합 연산된 특징 맵을 합성곱 연산하여 정제하는 단계를 포함할 수 있다.In step S30, each MSCA module performs a convolution operation of each of the pyramid feature maps of n layers having different scales, converting the convolutional feature maps to a unique scale of each corresponding layer, and converting the convolutional feature maps to a unique scale of each corresponding layer. It may include performing a concatenation operation of the transformed feature maps, and refining the conjugated feature maps by performing a convolution operation.

상기 각 MSCA 모듈로부터 출력되는 특징 맵들의 정보를 통합하여 하나의 영상 데이터로 출력한다(단계 S40).The information on the feature maps output from each MSCA module is combined and output as one image data (step S40).

단계 S40은 상기 각 MSCA 모듈로부터 출력되는 특징 맵을 기준 스케일로 변환하는 단계, 상기 기준 스케일로 변환된 특징 맵들의 접합(concatenation) 연산을 수행하는 단계, 상기 접합 연산된 특징 맵을 미리 설정된 수만큼 합성곱 연산하는 단계 및 상기 합성곱 연산된 특징 맵을 가산 연산하여 출력하는 단계를 포함할 수 있다.Step S40 is a step of converting a feature map output from each of the MSCA modules to a reference scale, performing a concatenation operation of feature maps converted to the reference scale, and performing a concatenation operation of the feature maps by a preset number. It may include performing a convolution operation and performing an addition operation and outputting the convolutional feature map.

상기와 같은 과정으로 네크워크 학습을 완료하면, 이후 실시간 입력 영상에 대해 학습 결과를 이용하여 우수한 품질의 영상을 출력할 수 있다. 이하에서는, 본 발명의 성능을 평가하기 위한 실험 결과를 기술한다.When network learning is completed in the above process, an image of excellent quality may be output using the learning result for the real-time input image. In the following, the experimental results for evaluating the performance of the present invention will be described.

도 6은 테스트 영상에 대해 기존 기법과 본 발명에서 제안하는 방법의 잡음 제거 예시를 보여주는 도면이다. 도 6에서 (a) 잡음이 없는 영상, (b) 잡음이 있는 영상, (c) DnCNN의 잡음제거 결과 및 (d) 본 발명에서 제안하는 네트워크의 잡음제거 결과이다.6 is a diagram showing an example of noise reduction of a conventional technique and a method proposed by the present invention for a test image. In FIG. 6, (a) an image without noise, (b) an image with noise, (c) a noise reduction result of DnCNN, and (d) a noise reduction result of the network proposed by the present invention.

도 6을 참조하면, berkeley segmentation dataset(BSD)의 테스트 영상에 대해 기존 기법과 제안하는 기법의 잡음 제거 예시를 보여준다. 기존 기법인 DnCNN의 경우 확대된 영역과 같이 잡음 제거 이외에도 불필요한 아티팩트를 생성해 낸다. Referring to FIG. 6, an example of noise reduction of the existing technique and the proposed technique for a test image of the berkeley segmentation dataset (BSD) is shown. In the case of DnCNN, which is an existing technique, unnecessary artifacts are generated in addition to noise removal, such as an enlarged area.

반면, 본 발명에서 제안하는 방법의 결과물에서는 불필요한 아티팩트가 적은 것을 확인할 수 있다. 도 3의 PSNR 결과는 (b)의 경우 14.99dB, (c)의 경우 28.38, (d)의 경우 28.88로, 본 발명에서 제안하는 방법의 PSNR 또한 더 높은 것을 확인할 수 있다.On the other hand, in the result of the method proposed in the present invention, it can be confirmed that there are few unnecessary artifacts. The PSNR result of FIG. 3 is 14.99dB in the case of (b), 28.38 in the case of (c), and 28.88 in the case of (d), and it can be seen that the PSNR of the method proposed in the present invention is also higher.

도 7은 기존 기법과 본 발명에서 제안하는 방법의 영상 내 잡음 제거 성능을 비교하는 표이다.7 is a table comparing the noise reduction performance in an image between the conventional technique and the method proposed by the present invention.

실험을 위해 영상 내 다양한 정도의 잡음을 추가하였고, 모든 정도의 잡음에서 본 발명에서 제안하는 방법의 성능이 가장 좋았다. 성능 평가는 영상 내 잡음 제거 분야에서 공인하게 사용하는 PSNR을 통해 측정했으며, 데이터 셋을 비롯한 기존 검출 분야들에서 데이터 셋으로 많이 사용하는 BSD 68 셋을 통해 성능을 검증했다.Various levels of noise were added in the image for the experiment, and the performance of the method proposed by the present invention was the best at all levels of noise. The performance evaluation was measured through the PSNR, which is officially used in the field of noise reduction in the image, and the performance was verified through the BSD 68 set, which is widely used as a data set in existing detection fields including data sets.

이와 같은, 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 방법은 애플리케이션으로 구현되거나 다양한 컴퓨터 구성요소를 통하여 수행될 수 있는 프로그램 명령어의 형태로 구현되어 컴퓨터 판독 가능한 기록 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능한 기록 매체는 프로그램 명령어, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. Such a method of removing noise in a network image using a parallel feature pyramid may be implemented as an application or implemented in the form of program instructions that can be executed through various computer components and recorded in a computer-readable recording medium. The computer-readable recording medium may include program instructions, data files, data structures, etc. alone or in combination.

상기 컴퓨터 판독 가능한 기록 매체에 기록되는 프로그램 명령어는 본 발명을 위하여 특별히 설계되고 구성된 것들이거니와 컴퓨터 소프트웨어 분야의 당업자에게 공지되어 사용 가능한 것일 수도 있다. The program instructions recorded in the computer-readable recording medium may be specially designed and constructed for the present invention, and may be known and usable to those skilled in the computer software field.

컴퓨터 판독 가능한 기록 매체의 예에는, 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체, CD-ROM, DVD와 같은 광기록 매체, 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 ROM, RAM, 플래시 메모리 등과 같은 프로그램 명령어를 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tapes, optical recording media such as CD-ROMs and DVDs, magnetic-optical media such as floptical disks. media), and a hardware device specially configured to store and execute program instructions such as ROM, RAM, flash memory, and the like.

프로그램 명령어의 예에는, 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드도 포함된다. 상기 하드웨어 장치는 본 발명에 따른 처리를 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.Examples of the program instructions include not only machine language codes such as those produced by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like. The hardware device may be configured to operate as one or more software modules to perform processing according to the present invention, and vice versa.

이상에서는 실시예들을 참조하여 설명하였지만, 해당 기술 분야의 숙련된 당업자는 하기의 특허 청구의 범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다.Although the above has been described with reference to embodiments, it is understood that those skilled in the art can variously modify and change the present invention without departing from the spirit and scope of the present invention described in the following claims. You can understand.

본 발명은 영상 내에 다양한 스케일을 고려하여 잡음 제거 성능이 높을 뿐만 아니라 불필요한 아티팩트들을 감소시켜 우수한 품질의 영상을 제공할 수 있다.The present invention not only has high noise removal performance in consideration of various scales in an image, but also reduces unnecessary artifacts to provide an image of excellent quality.

10: 병렬 특징 피라미드를 이용한 네트워크 영상 내 잡음 제거 장치
100: 특징 맵 생성부
300: SPP 모듈
500: MSCA 모듈
700: 융합부10: Device for removing noise in network image using parallel feature pyramid
100: feature map generator
300: SPP module
500: MSCA module
700: fusion unit

Claims

Generating a feature map by performing a predetermined number of convolution operations on the input image;
Generating n-layer pyramid feature maps by downsizing the feature maps to have different scales through a pyramid (Spatial pyramid pooling) module;
converting the pyramid feature maps of n layers having different scales into a unique scale of each corresponding layer through n multi-scale context aggregation (MSCA) modules; And
Including; integrating information of feature maps output from each MSCA module and outputting as one image data; Including,
The step of integrating information of feature maps output from each MSCA module and outputting it as one image data,
Converting a feature map output from each MSCA module into a reference scale;
Performing a concatenation operation of feature maps converted to the reference scale;
Performing a convolution operation on the joint-operated feature map by a preset number; And
Comprising; adding and outputting the convolutional feature map. A method of removing noise in a network image using a parallel feature pyramid.

The method of claim 1, wherein converting the n-layer pyramid feature maps to a unique scale of each corresponding layer,
Performing a convolution operation of each of the n-layer pyramid feature maps having different scales in each MSCA module;
Converting each of the convolutional feature maps into a unique scale of each corresponding layer;
Performing a concatenation operation of the feature maps converted to the intrinsic scale; And
Comprising, Comprising; Comprising the convolutional computation of the spliced feature map to remove noise in a network image using a parallel feature pyramid.

delete

The method of claim 1, wherein converting the n-layer pyramid feature maps to a unique scale of each corresponding layer,
The size of the input image

If is,

(Here, n is the number of pyramidal layers, and the first layer is defined as n=0) A noise removal method in a network image using a parallel feature pyramid to generate feature maps.

The method of claim 1, wherein generating the n-layer pyramid feature maps comprises:
Generating a first pyramid feature map having the same scale as the input image;
Downsizing the first pyramid feature map to generate a second pyramid feature map having a 1/2 scale of the input image; And
Generating a third pyramidal feature map having a 1/4 scale of the input image by downsizing the second pyramidal feature map. Including, Method for removing noise in a network image using a parallel feature pyramid.

The method of claim 5, wherein converting the n-layer pyramid feature maps to a unique scale of each corresponding layer,
Downsizing after a convolution operation of the first pyramid feature map;
Performing a convolution operation of the second pyramidal feature map; And
Comprising, Upsizing after a convolution operation of the third pyramidal feature map. A method of removing noise in a network image using a parallel feature pyramid.

The method of claim 6, wherein converting the n-layer pyramid feature maps to a unique scale of each corresponding layer,
Performing a concatenation operation of the downsized first pyramid feature map, the second pyramid feature map, and the upsized third pyramid feature map; And
Comprising, Comprising; Comprising the convolutional computation of the spliced feature map to remove noise in a network image using a parallel feature pyramid.

The method of claim 7, wherein the step of integrating information of feature maps output from each MSCA module and outputting it as one image data,
Upsizing a feature map output from each of the MSCA modules;
Performing a concatenation operation of the upsized feature maps;
Performing a convolution operation on the joint-operated feature map by a preset number; And
Comprising; adding and outputting the convolutional feature map. A method of removing noise in a network image using a parallel feature pyramid.

A computer-readable storage medium in which a computer program for performing the method for removing noise in a network image using the parallel feature pyramid according to any one of claims 1 to 2 and 4 to 8 is recorded.

A feature map generator configured to generate a feature map by performing convolution operations as many as a preset number on the input image;
A pyramid (Spatial pyramid pooling) module for generating n-layer pyramid feature maps by downsizing the feature maps to have different scales;
N multi-scale context aggregation (MSCA) modules for converting pyramid feature maps of n layers having different scales into unique scales of each corresponding layer; And
Containing; a fusion unit for integrating the information of the feature maps output from each MSCA module and outputting as one image data,
The fusion unit,
An apparatus for removing noise in a network image using a parallel feature pyramid for upsizing the feature maps output from each MSCA module and converting them to a reference scale, performing concatenation and convolution of feature maps, and then performing addition operation. .

The method of claim 10, wherein each MSCA module of the n MSCAs,
Network image using a parallel feature pyramid in which each of the pyramid feature maps of n layers having different scales is convoluted, and each of the convolutional feature maps is down-sized or up-sized to convert to a unique scale of each corresponding layer My noise canceling device.

The method of claim 10, wherein each MSCA module of the n MSCAs,
An apparatus for removing noise in a network image using a parallel feature pyramid, which refines the feature maps converted to the intrinsic scale of each corresponding layer by performing concatenation and convolution.

delete